Skip to main content
POST
/
v3
/
async
/
kling-v3.0-4k-i2v
Kling v3.0 4K Image-to-Video
curl --request POST \
  --url https://api.novita.ai/v3/async/kling-v3.0-4k-i2v \
  --header 'Authorization: <authorization>' \
  --header 'Content-Type: <content-type>' \
  --data '
{
  "image": "<string>",
  "sound": true,
  "prompt": "<string>",
  "duration": 123,
  "cfg_scale": 123,
  "end_image": "<string>",
  "multi_prompt": [
    {
      "prompt": "<string>",
      "duration": 123
    }
  ],
  "negative_prompt": "<string>"
}
'
{
  "task_id": "<string>"
}

Documentation Index

Fetch the complete documentation index at: https://novita.ai/docs/llms.txt

Use this file to discover all available pages before exploring further.

Kling v3.0 4K Image-to-Video generates native 4K ultra-high-definition videos from a first-frame image with cinematic quality, smooth motion, precise prompt adherence, and optional synchronized audio generation. Supports end-frame guidance, multi-shot composition, and flexible 3-15 second durations.
This is an asynchronous API; only the task_id will be returned. You should use the task_id to request the Task Result API to retrieve the video generation results.

Request Headers

Content-Type
string
required
Supports: application/json
Authorization
string
required
Bearer authentication format, for example: Bearer {{API Key}}.

Request Body

image
string
required
First frame image for video; supports .jpg, .jpeg, .png. Image file size must not exceed 10MB; width and height must be >= 300px; aspect ratio must be between 1:2.5 and 2.5:1.
sound
boolean
default:false
Whether to generate synchronized audio simultaneously with the video. Supports Chinese and English voice output.
prompt
string
required
Positive prompt text for video generation, describing scene motion, camera movement, actions, voice style, atmosphere, and sound effects; must not exceed 2500 characters.Length limit: 0 - 2500
duration
integer
default:5
Duration of generated video in seconds. Supports flexible durations from 3 to 15 seconds.Value range: [3, 15]
cfg_scale
number
Controls flexibility of video generation. Higher values result in content more closely following the prompt; lower values produce more natural motion.Value range: [0, 1]
end_image
string
Ending frame image URL for guided transitions between start and end frames. Same format constraints as image. Cannot be used together with multi_prompt.
multi_prompt
array
Array of prompts for multi-shot video composition. Each item contains a prompt and duration for one segment. Cannot be used together with end_image.
negative_prompt
string
Negative prompt describing elements to avoid in video and audio; must not exceed 2500 characters.Length limit: 0 - 2500

Response

task_id
string
required
Use the task_id to request the Task Result API to retrieve the generated outputs.