Use this endpoint to submit a new video generation task with Google’s VEO model. You can guide generation in one of two mutually exclusive modes: reference image mode (provide up to 3 stylistic reference images) or first/last frame mode (pin the exact start and end frames of the video). The API responds immediately with a taskDocumentation Index
Fetch the complete documentation index at: https://docs.xingchaoyiqing.com/llms.txt
Use this file to discover all available pages before exploring further.
id — you then poll the Query Video Task endpoint until the video is ready.
Video generation is asynchronous. The create endpoint returns a task
id with status: "queued". Use the Query Video Task endpoint to check progress and retrieve the final video_url.Base URL & Endpoint
Supported Models
| Model | Resolutions | Reference Images | First/Last Frame | Notes |
|---|---|---|---|---|
veo_3_1_fast | 720p · 1080p · 4K | ✅ up to 3 | ✅ | Fast generation |
veo_3_1_pro | 720p · 1080p · 4K | ✅ up to 3 | ✅ | Highest quality |
veo_3_1_pro_stable | 720p · 1080p | ✅ up to 3 | ✅ | Stable variant |
veo_3_1_fast_stable | 720p · 1080p | ❌ | ✅ only | First/last frame only |
Request Parameters
The VEO model variant to use. See the models table above for capability details.Accepted values:
veo_3_1_fast, veo_3_1_pro, veo_3_1_pro_stable, veo_3_1_fast_stableA text description of the video you want to generate.Example:
"A cat dancing in the rain, cinematic style"Output video resolution. Defaults to
720p.720p— supported by all models1080p— supported by all models4K— onlyveo_3_1_fastandveo_3_1_pro
720p, 720P, 4K, and 4k are all accepted).Output video aspect ratio. Defaults to
16:9.16:9— landscape9:16— portrait
resolution field. Do not append resolution suffixes (e.g. -1080p) to this field.Reference Image Mode
Provide up to 3 reference images to guide the overall style and content of the generated video. Mutually exclusive with first/last frame mode.Array of reference image URLs. Maximum 3 images.Supported models:
veo_3_1_pro_stable (max 3), veo_3_1_fast (max 3), veo_3_1_pro.Use this or image_urls — not both. Cannot be combined with first_image / last_image.Array of reference image URLs. Maximum 3 images.Supported models:
veo_3_1_pro_stable (max 3), veo_3_1_fast (max 3), veo_3_1_pro.Use this or input_reference — not both. Cannot be combined with first_image / last_image.First / Last Frame Mode
Pin the exact starting and/or ending frame of the generated video. Mutually exclusive with reference image mode. All four models support this mode.URL of the image to use as the first frame of the video.
- Used alone: image-to-video mode (video starts from this frame)
- Used with
last_image: first/last frame mode (video starts and ends at the specified frames)
input_reference / image_urls.URL of the image to use as the last frame of the video. Must be used together with
first_image.Cannot be combined with input_reference / image_urls.Response Fields
Unique identifier for the video generation task. Save this value — you will use it to poll the Query Video Task endpoint.
Object type. Always
"video".Unix timestamp (seconds) of when the task was created.
The model name you specified in the request.
Initial task status. On successful creation this is always
"queued".Error message. Only present when
status is "failed".Code Examples
Guidance Mode Comparison
| Mode | When to use | Supported models |
|---|---|---|
| Reference image mode | You want the video to match an overall visual style | veo_3_1_pro_stable, veo_3_1_fast, veo_3_1_pro |
| First/last frame mode | You need precise control over the opening and closing frames | All four models |