Model Library/Kimi K2 Thinking
MoonshotAI

Kimi K2 Thinking

20% OFF
moonshotai/kimi-k2-thinking
The kimi-k2-thinking model is a general-purpose agentic reasoning model developed by Moonshot AI.

Features

Serverless API

Docs

moonshotai/kimi-k2-thinking is available via Novita's serverless API, where you pay per token. There are several ways to call the API, including OpenAI-compatible endpoints with exceptional reasoning performance.

Available Serverless

Run queries immediately, pay only for usage

Input$0.48 / M Tokens$0.6 / M Tokens
Output$2 / M Tokens$2.5 / M Tokens

Use the following code examples to integrate with our API:

1from openai import OpenAI
2
3client = OpenAI(
4    api_key="<Your API Key>",
5    base_url="https://api.novita.ai/openai"
6)
7
8response = client.chat.completions.create(
9    model="moonshotai/kimi-k2-thinking",
10    messages=[
11        {"role": "system", "content": "You are a helpful assistant."},
12        {"role": "user", "content": "Hello, how are you?"}
13    ],
14    max_tokens=262144,
15    temperature=0.7
16)
17
18print(response.choices[0].message.content)

Info

Provider
MoonshotAI
Quantization
bf16

Supported Functionality

Context Length
262144
Max Output
262144
Serverless
Supported
Structured Output
Supported
Reasoning
Supported
Function Calling
Supported
Anthropic API
Supported
Input Capabilities
text
Output Capabilities
text