Model Library/Kimi K2.7 Code
MoonshotAI

Kimi K2.7 Code

moonshotai/kimi-k2.7-code
Kimi K2.7 Code is MoonshotAI's strongest coding & agentic model — a 1T-parameter MoE (32B activated) , 256K context and interleaved thinking with multi-step tool calling. It delivers major gains on long-horizon coding tasks while cutting thinking-token usage by ~30% vs K2.6, and accepts text, image and video inputs for vision-driven development workflows.

Features

Serverless API

Docs

moonshotai/kimi-k2.7-code is available via Novita's serverless API, where you pay per token. There are several ways to call the API, including OpenAI-compatible endpoints with exceptional reasoning performance.

Available Serverless

Run queries immediately, pay only for usage

Input$0.95 / M Tokens
Cache Read$0.19 / M Tokens
Output$4 / M Tokens

Use the following code examples to integrate with our API:

1from openai import OpenAI
2
3client = OpenAI(
4    api_key="<Your API Key>",
5    base_url="https://api.novita.ai/openai"
6)
7
8response = client.chat.completions.create(
9    model="moonshotai/kimi-k2.7-code",
10    messages=[
11        {"role": "system", "content": "You are a helpful assistant."},
12        {"role": "user", "content": "Hello, how are you?"}
13    ],
14    max_tokens=262144,
15    temperature=0.7
16)
17
18print(response.choices[0].message.content)

Info

Provider
-
Quantization
-

Supported Functionality

Context Length
262144
Max Output
262144
Serverless
Supported
Function Calling
Supported
Structured Output
Supported
Reasoning
Supported
Anthropic API
Supported
Input Capabilities
text, image, video
Output Capabilities
text

Everything you need to build production AI.

200+ models, on-demand GPUs, and secure agent runtimes — unified under one API. Free to start, scales as you grow.