Use this endpoint to submit a video generation task with ByteDance’s Seedance 1.5 model. Unlike simpler text-only video APIs, Seedance 1.5 accepts aDocumentation Index
Fetch the complete documentation index at: https://docs.xingchaoyiqing.com/llms.txt
Use this file to discover all available pages before exploring further.
content array that lets you mix text prompts and image inputs in a single request. You can generate video from text alone, from a single reference frame, from first and last frames, or from multiple reference images. Audio generation is supported on all model tiers. The API responds immediately with a task id — you then poll the Query Video Task endpoint until the video is ready.
Video generation is asynchronous. The create endpoint returns a task
id with status: "queued". Use the Query Video Task endpoint to check progress and retrieve the final video_url.Base URL & Endpoint
Supported Models
| Model | Resolution | Audio | Image-to-Video |
|---|---|---|---|
seedance_1_5_pro_480p | 480p | ✅ | ✅ |
seedance_1_5_pro_720p | 720p | ✅ | ✅ |
seedance_1_5_pro_1080p | 1080p | ✅ | ✅ |
Request Parameters
The Seedance 1.5 model to use. Accepted values:
seedance_1_5_pro_480pseedance_1_5_pro_720pseedance_1_5_pro_1080p
An array of content objects that define the prompt and any input images. This is not a simple string — it must be an array of typed objects (see the Content Array Structure section below).At minimum, include one text object. You may also include image objects to enable image-guided generation modes.
Output video aspect ratio. Accepted values:
16:9, 4:3, 1:1, 3:4, 9:16, 21:9, adaptive.Defaults:- Text-to-video:
16:9(Seedance 1.5 Pro defaults toadaptive) - Image-to-video:
adaptive
Video duration in seconds.
- Range:
2–12seconds (default5) - Seedance 1.5 Pro also supports
4–12seconds - Set to
-1to let the model automatically choose the best duration (4–12 s)
Whether to include synthesised audio in the output video. Defaults to
true.Audio generation is only available on Seedance 1.5 Pro series models.Content Array Structure
Thecontent array supports two object types: text and image_url.
Text Content Object
Must be
"text".The video generation prompt. Maximum length: 500 Chinese characters or 1000 English words.
Example text object
Image Content Object
Must be
"image_url".A publicly accessible URL for the input image.
The role of this image in the generation. Controls which generation mode is used:
first_frame(or omitted) — use as the first frame of the video (image-to-video mode)last_frame— use as the last frame of the video (must be paired with afirst_frameimage)reference_image— use as a stylistic reference (1–4 images supported)
- Formats: JPEG, PNG, WebP, BMP, TIFF, GIF (Seedance 1.5 Pro also supports HEIC/HEIF)
- Aspect ratio: between 0.4 and 2.5 (width ÷ height)
- Dimensions: 300 px – 6000 px on each side
- File size: less than 30 MB
Example image object
Generation Scenarios
The image roles in thecontent array determine which generation mode is active. These scenarios are mutually exclusive — you cannot mix first-frame, last-frame, and reference image roles in the same request.
| Scenario | Content array contents |
|---|---|
| Text-to-video | One text object only |
| Image-to-video (first frame) | One text object + one image with role: "first_frame" |
| Image-to-video (first + last frame) | One text object + one first_frame image + one last_frame image |
| Reference image guidance | One text object + 1–4 images with role: "reference_image" |
Response Fields
Unique identifier for the video generation task. Task records are saved for 1 day — save this value to query the result in time.
Object type. Always
"video".Unix timestamp (seconds) of when the task was created.
The model name you specified in the request.
Initial task status. On successful creation this is always
"queued".