Kling Omni Video O1 is Kuaishou’s first unified multimodal video model using MVL (Multimodal Vision Language) technology. Its text-to-video mode can generate cinematic videos based on text prompts while maintaining subject consistency, natural physical simulation, and precise semantic understanding.
This is an asynchronous API; only the task_id will be returned. You should use the task_id to request the Task Result API to retrieve the video generation results.
Supports: application/json
Bearer authentication format, for example: Bearer {{API Key}}.
Request Body
The positive prompt for the generation.
The duration of the generated media in seconds.Optional values: 5, 10
The aspect ratio of the generated video.Optional values: 16:9, 9:16, 1:1
Response
Use the task_id to request the Task Result API to retrieve the generated outputs.