Kling v3.0 4K Image-to-Video
Video Generator
Kling v3.0 4K Image-to-Video
POST
Kling v3.0 4K Image-to-Video
Kling v3.0 4K Image-to-Video generates native 4K ultra-high-definition videos from a first-frame image with cinematic quality, smooth motion, precise prompt adherence, and optional synchronized audio generation. Supports end-frame guidance, multi-shot composition, and flexible 3-15 second durations.
Request Headers
Supports:
application/jsonBearer authentication format, for example: Bearer {{API Key}}.
Request Body
First frame image for video; supports
.jpg, .jpeg, .png.
Image file size must not exceed 10MB; width and height must be >= 300px; aspect ratio must be between 1:2.5 and 2.5:1.Whether to generate synchronized audio simultaneously with the video. Supports Chinese and English voice output.
Positive prompt text for video generation, describing scene motion, camera movement, actions, voice style, atmosphere, and sound effects; must not exceed 2500 characters.Length limit: 0 - 2500
Duration of generated video in seconds. Supports flexible durations from 3 to 15 seconds.Value range: [3, 15]
Controls flexibility of video generation. Higher values result in content more closely following the prompt; lower values produce more natural motion.Value range: [0, 1]
Ending frame image URL for guided transitions between start and end frames. Same format constraints as image. Cannot be used together with multi_prompt.
Array of prompts for multi-shot video composition. Each item contains a prompt and duration for one segment. Cannot be used together with end_image.
Negative prompt describing elements to avoid in video and audio; must not exceed 2500 characters.Length limit: 0 - 2500
Response
Use the task_id to request the Task Result API to retrieve the generated outputs.
Last modified on June 10, 2026