Kling-o1 Text to Video
Video Generator
Kling-o1 Text to Video
POST
Kling-o1 Text to Video
Kling Omni Video O1 is Kuaishou’s first unified multimodal video model using MVL (Multimodal Vision Language) technology. Its text-to-video mode can generate cinematic videos based on text prompts while maintaining subject consistency, natural physical simulation, and precise semantic understanding.
Request Headers
Supports:
application/jsonBearer authentication format, for example: Bearer {{API Key}}.
Request Body
The positive prompt for the generation.
The duration of the generated media in seconds.Optional values:
5, 10The aspect ratio of the generated video.Optional values:
16:9, 9:16, 1:1Response
Use the task_id to request the Task Result API to retrieve the generated outputs.
Last modified on December 5, 2025