Submit an asynchronous video generation task using Vidu’s AI models. The API returns a taskDocumentation Index
Fetch the complete documentation index at: https://docs.xingchaoyiqing.com/llms.txt
Use this file to discover all available pages before exploring further.
id immediately — you then poll the Query Video Task endpoint until the task reaches completed or failed status. Vidu supports pure text-to-video, image-to-video (driven by a first frame), and first/last frame guidance mode for precise scene control.
Base URL
Endpoint
Authentication
Include your API key as a Bearer token in every request:Models
Vidu offers two models with different performance characteristics:| Model | Description |
|---|---|
viduq3-pro | Higher quality output — richer motion, more vivid and cinematic results |
viduq3-turbo | Faster generation — same capabilities, reduced wait time |
Request Parameters
The model to use for generation.Supported values:
viduq3-pro— higher quality, cinematic outputviduq3-turbo— faster generation
A text description of the video content you want to generate. Be specific about subject, action, camera movement, and visual style.Example:
"A cat dancing in the rain, cinematic style"Output video length in seconds. Defaults to
5.- Supported range:
1–16seconds - Default:
5
Output video aspect ratio. Defaults to
16:9.| Value | Description |
|---|---|
16:9 | Landscape (default) |
9:16 | Portrait |
1:1 | Square |
4:3 | Standard screen |
3:4 | Portrait standard |
Output resolution. Defaults to
720p.| Value | Description |
|---|---|
540p | Standard definition |
720p | HD (default) |
1080p | Full HD |
Image-to-Video Parameters
URL of the first-frame image. Enables image-to-video mode.
Value must be a publicly accessible URL.
- Used alone: drives image-to-video generation from this starting frame
- Used with
last_image: enables first/last frame mode — the model transitions from the first frame to the last
jpeg, png, webpValue must be a publicly accessible URL.
URL of the last-frame image. Enables first/last frame guidance mode.
Value must be a publicly accessible URL.
- Must be used together with
first_image— cannot be used alone - The model generates a smooth transition between the two frames
jpeg, png, webpValue must be a publicly accessible URL.
Response Fields
Unique task identifier. Save this value — you’ll use it to poll the Query Video Task endpoint.
Object type. Always
"video".Unix timestamp of when the task was created.
The model name used for this task.
Task status at creation time. Always
"queued" on successful submission.Possible lifecycle values:queued— task accepted and waiting in queueprocessing— model is actively generatingcompleted— generation finished;video_urlis availablefailed— generation failed; seeerrorfor details
Error message.
null on successful submission; populated when status is "failed".Code Examples
Example Response
Usage Modes
Text-to-Video
Send onlymodel, prompt, and optional duration/aspect_ratio/resolution. The model generates content entirely from the text description.
Image-to-Video
Includefirst_image (without last_image). The model anchors the video to your provided first frame and generates a natural continuation.
First/Last Frame Guidance
Include bothfirst_image and last_image. The model creates a smooth transition between your two provided frames — ideal for controlled scene transitions and specific visual storytelling.
All image URLs must be publicly accessible on the internet. Base64-encoded images are not supported.
Next Steps
After receiving the taskid, poll the Query Video Task endpoint to check status and retrieve the video_url when generation completes.