Video Generating
POST https://api.vidu.com/ent/v2/reference2videoField | Value | Description |
|---|---|---|
Content-Type | application/json | Data Exchange Format |
Authorization | Token {your api key} | Replace {} with your API key |
Field | Sub Field | Type | Required | Description |
|---|---|---|---|---|
model | String | Required | Model name Accepted values: viduq3-turbo viduq3 viduq2 viduq1 vidu2.0 - viduq3-turbo: Supports intelligent scene switching and simultaneous audio and video output, boasts the fastest generation speed, and offers the highest cost-effectiveness - viduq3: Supports intelligent camera switching, supports simultaneous audio and video output, and achieves superior consistency across multiple camera positions - viduq2: Good dynamic effect and rich generated details - viduq1: Clear image, smooth transition, stable camera operation - vidu2.0: Fast generation speed | |
auto_subjects | Bool | Optional | Whether to use the intelligent entity library capability, default is false, optional values are true and false - true: Utilize the capabilities of the intelligent agent library. - false: Do not use the intelligent agent library capability. | |
subjects | List[Array] | Required | When using the q3, q2, q1, and 2.0 models, only the main body of the image and text can be used - The maximum number of images or textual content should not exceed 7 When using the q2-pro model, video subjects, text subjects, and image subjects can be employed - The number of images or textual content should not exceed 4 - The number of video subjects should not exceed 2 (or 1 for temporary video subjects) Note: viduq3-mix does not support the use of entities for the time being | |
name | String | Required | Subject name Usable in prompts via @subjectname | |
images | Array\{String\} | Required | URLs of images corresponding to the subject. Each subject supports up to 3 images. - Assets can be provided via URLs or Base64 encode. - You must use one of the following codecs: PNG, JPEG, JPG, WebP - The dimensions of the images must be at least 128x128 pixels - The aspect ratio of the images must be less than 1:4 or 4:1 - All images are limited to 50MB - The post body of the HTTP request should not exceed 20MB, and it must include an appropriate content type string. For instance | |
voice_id | String | Optional | Voice ID Used to determine the voice character in the video. The system will automatically recommend a suitable voice, and optional values can be found in the Voice List. You can also use the Voice Cloning API to clone any voice. The voice_id parameters can be used interchangeably. | |
prompt | String | Required | Text prompt A textual description for video generation, with a maximum length of 5000 characters Or you can use id to represent subject content. For example: | |
audio | Bool | Optional | Whether to enable audio-video output capability. Default is false. Valid values: - false: Outputs silent video. - true: Outputs video with generated speech and background music based on the prompt. By default, audio is set to true for viduq3 and viduq3-turbo, and false for other models | |
audio_type | String | Optional | Audio type, required when audio is true, defaults to all -All: Sound effects+vocals -Speech_only: Vocal only -Sound-effect_only: Sound effects only | |
duration | Int | Optional | Video duration parameter, with default values depending on the model: - viduq3、viduq3-turbo: Default is 5 seconds, available option: 3-16 - viduq2: Default is 5 seconds, available option: 1-10 - viduq1: Default is 5 seconds, available option: 5 - vidu2.0: Default is 4 seconds, available option: 4 | |
seed | Int | Optional | Random seed - Defaults to a random seed number - Manually set values will override the default random seed | |
aspect_ratio | String | Optional | The aspect ratio of the output video Defaults to 16:9, accepted: 16:9 9:16 1:1 q2、q3 model supports any aspect ratio | |
resolution | String | Optional | Resolution parameter, with default values depending on the model and video duration: - viduq3、viduq3-turbo (3 - 16s): Default is 720p, available option: 540p 720p 1080p - viduq2 (1 - 10s): Default is 720p, available option: 540p 720p 1080p - viduq1 (5s): Default is 1080p, available option: 1080p - vidu2.0 (4s): Default is 360p, available options: 360p, 720p | |
movement_amplitude | String | Optional | The movement amplitude of objects in the frame Defaults to auto, accepted value: auto small medium large This parameter does not take effect when using the q2、q3 model | |
payload | String | Optional | transparent transmission parameters No processing, only data transmission,with a maximum length of 1048576 characters | |
off_peak | Bool | Optional | off peak mode, Defaults to false, accepted value: true false - true:off peak generate mode; - false:normal generate mode; - 1:The off_peak mode consumes lower points, please refer to the details Pricing - 2:Tasks submitted in off peak mode will be generated within 48 hours. Tasks that are not completed will be automatically cancelled and their points will be refunded; - 3:we also support cancle off_peak tasks. - 4:The q3 model supports peak-shifting tasks when the audio parameter is true; the q2, q1, and 2.0 series models support peak-shifting tasks when the audio parameter is false | |
callback_url | String | Optional | Callback When creating a task, you need to actively set the callback_url with a POST request. When the video generation task changes its status, Vidu will send a callback request to this URL, containing the latest status of the task. The structure of the callback request content will be the same as the return body of the GET Generation API. The "status" in the callback response includes the following states: - processing: Task is being processed. - success: Task is completed (if sending fails, it will retry the callback three times). - failed: Task failed (if sending fails, it will retry the callback three times). Vidu uses a callback signature algorithm for verification, check out the details here: Callback Signature |
Field | Type | Description |
|---|---|---|
task_id | String | Task ID |
state | String | It will be returned to a specific processing state: - created created task successfully - queueing task in queue - processing processing - success generation successful - failedtask failed |
model | String | The parameter of the model used for this call |
prompt | String | The text prompt used for this call |
images | Array[String] | The images used for this call |
duration | Int | The video duration parameter used for this call |
seed | Int | The random seed parameter used for this call |
aspect_ratio | String | The aspect ratio parameter used for this call |
resolution | String | The resolution parameter used for this call |
bgm | Bool | Whether to use bgm for this call |
audio | Bool | Whether to use audio for this call |
audio_type | String | The audio_type parameter used for this call |
movement_amplitude | String | The camera movement amplitude parameter used for this call |
payload | String | The payload parameter used for this call |
off_peak | Bool | The off_peak parameter used for this call |
credits | String | The credits used for this call |
created_at | String | Task creation time |
{ "task_id": "your_task_id_here", "state": "created", "model": "viduq3", "prompt": "@1 and @2 are cooking together, and both say they love hot pot.", "duration": 8, "seed": random_number, "resolution": "1080p", "movement_amplitude": "auto", "payload":"", "off_peak": false, "credits":credits_number, "created_at": "2025-01-01T15:41:31.968916Z" }
Field | Sub Field | Type | Required | Description |
|---|---|---|---|---|
model | String | Required | Model name Accepted values: viduq3-mix viduq3-turbo viduq3 viduq2-pro viduq2 viduq1 vidu2.0 - viduq3-mix: Strong visual quality, supports intelligent scene transitions, supports simultaneous audio and video output, good dynamic effects, and the strongest balance - viduq3-turbo: Supports intelligent scene switching and synchronized audio and video output, boasts the fastest generation speed, and offers the highest cost-effectiveness - viduq3: Supports intelligent camera switching, supports simultaneous audio and video output, and achieves superior consistency across multiple camera positions - viduq2-pro: Supports video reference, video editing, and video replacement - viduq2: Good dynamic effect and rich generated details - viduq1: Clear image, smooth transition, stable camera operation - vidu2.0: Fast generation speed | |
images | Array[String] | Optional | The model will use the provided images as references to generate a video with consistent subjects For fields that accept images: - viduq3-mix viduq3-turbo viduq3 viduq2 viduq1 vidu2.0 accepts 1 to 7 images. - viduq2-pro If no video is uploaded, supports 1-7 images ; If Uploading videos, supports 1-4 images. - Assets can be provided via URLs or Base64 encode. - You must use one of the following codecs: PNG, JPEG, JPG, WebP - The dimensions of the images must be at least 128x128 pixels - The aspect ratio of the images must be less than 1:4 or 4:1 - All images are limited to 50MB - The length of the base64 decode must be under 10MB, and it must include an appropriate content type string. For instance, data:image/png;base64,\{base64_encode\} | |
videos | Array[String] | Optional | The video URL corresponding to the subject. All subjects share a 13s duration (each image subject = 1s). Must provide either this or images. - Note 1: This parameter is only supported by the viduq2-pro model. - Note 2: When using the video reference feature, supports uploading at most 1 video of 8s or 2 videos of 5s. - Note 3: Supported video formats: mp4, avi, mov. - Note 4: Video resolution cannot be less than 128*128, aspect ratio must be between 1:4 and 4:1, and size must not exceed 100M. - Note 5: Please note that the byte length after base64 decoding must be less than 20M, and the encoding must include the appropriate content type string, e.g.: data:video/mp4;base64,\{base64_encode\} | |
prompt | String | Required | Text prompt A textual description for video generation, with a maximum length of 5000 characters | |
duration | Int | Optional | Video duration parameter, with default values depending on the model: - viduq3-mix: Default is 5 seconds, available option: 1-16 - viduq3-turbo、viduq3-pro: Default is 5 seconds, available option: 3-16 - viduq2-pro: Default is 5 seconds, available option: 0-10(0 for automatic duration) - viduq2: Default is 5 seconds, available option: 1-10 - viduq1: Default is 5 seconds, available option: 5 - vidu2.0: Default is 4 seconds, available option: 4 | |
seed | Int | Optional | Random seed - Defaults to a random seed number - Manually set values will override the default random seed | |
aspect_ratio | String | Optional | The aspect ratio of the output video Defaults to 16:9, accepted: 16:9 9:16``3:4 4:3 1:1 3:4&4:3only support q2 model | |
resolution | String | Optional | Resolution parameter, with default values depending on the model and video duration: - viduq3-mix(1-16):Default is 720p, available option: 720p 1080p - viduq3-turbo、viduq3-pro(3-16):Default is 720p, available option: 540p 720p 1080p - viduq2-pro (0 - 10s): Default is 720p, available option: 540p 720p 1080p - viduq2 (1 - 10s): Default is 720p, available option: 540p 720p 1080p - viduq1 (5s): Default is 1080p, available option: 1080p - vidu2.0 (4s): Default is 360p, available options: 360p, 720p | |
movement_amplitude | String | Optional | The movement amplitude of objects in the frame Defaults to auto, accepted value: auto small medium large This parameter does not take effect when using the q2 model | |
bgm | Bool | Optional | Whether to add background music to the generated video. Default: false. Acceptable values: true, false. When true, the system will automatically add a suitable BGM. BGM has no time limit and the system automatically adapts. BGM does not take effect when the duration of the q2 model is 9 or 10 seconds q3 model does not support this parameter | |
payload | String | Optional | transparent transmission parameters No processing, only data transmission,with a maximum length of 1048576 characters | |
off_peak | Bool | Optional | off peak mode, Defaults to false, accepted value: true false - true:off peak generate mode; - false:normal generate mode; - 1:The off_peak mode consumes lower points, please refer to the details Pricing - 2:Tasks submitted in off peak mode will be generated within 48 hours. Tasks that are not completed will be automatically cancelled and their points will be refunded; - 3:we also support cancle off_peak tasks. - 4:The q3 model supports peak-shifting tasks when the audio parameter is true; the q2, q1, and 2.0 series models support peak-shifting tasks when the audio parameter is false | |
callback_url | String | Optional | Callback When creating a task, you need to actively set the callback_url with a POST request. When the video generation task changes its status, Vidu will send a callback request to this URL, containing the latest status of the task. The structure of the callback request content will be the same as the return body of the GET Generation API. The "status" in the callback response includes the following states: - processing: Task is being processed. - success: Task is completed (if sending fails, it will retry the callback three times). - failed: Task failed (if sending fails, it will retry the callback three times). Vidu uses a callback signature algorithm for verification, check out the details here: Callback Signature |
Field | Type | Description |
|---|---|---|
task_id | String | Task ID |
state | String | It will be returned to a specific processing state: - created created task successfully - queueing task in queue - processing processing - success generation successful - failedtask failed |
model | String | The parameter of the model used for this call |
prompt | String | The text prompt used for this call |
images | Array[String] | The images used for this call |
videos | Array[String] | The videos used for this call |
duration | Int | The video duration parameter used for this call |
seed | Int | The random seed parameter used for this call |
aspect_ratio | String | The aspect ratio parameter used for this call |
resolution | String | The resolution parameter used for this call |
bgm | Bool | Whether to use bgm for this call |
audio | Bool | Whether to use audio for this call |
movement_amplitude | String | The camera movement amplitude parameter used for this call |
payload | String | The payload parameter used for this call |
off_peak | Bool | The off_peak parameter used for this call |
credits | String | The credits used for this call |
created_at | String | Task creation time |
{ "task_id": "your_task_id_here", "state": "created", "model": "viduq3-mix", "images": ["https://prod-ss-images.s3.cn-northwest-1.amazonaws.com.cn/vidu-maas/template/reference2video-1.png","https://prod-ss-images.s3.cn-northwest-1.amazonaws.com.cn/vidu-maas/template/reference2video-2.png","https://prod-ss-images.s3.cn-northwest-1.amazonaws.com.cn/vidu-maas/template/reference2video-3.png"], "prompt": "Santa Claus and the bear hug by the lakeside.", "duration": 5, "seed": random_number, "aspect_ratio": "16:9", "resolution": "720p", "bgm": false, "movement_amplitude": "auto", "payload":"", "off_peak": false, "credits": credits_number, "created_at": "2025-01-01T15:41:31.968916Z" }
On this page
Overview
- Request Address
- Request Header
- Subjects-Reference to Video
- Request Body
- Response Body
- Reference to Video
- Request Body
- Response Body