Generate videos using Google Veo 3.1 model with text, images, or reference images.
| Aspect Ratio | Resolution | Duration Options | Notes |
|---|---|---|---|
| 16:9 | 720p | 4s, 6s, 8s | All features supported |
| 9:16 | 720p | 4s, 6s, 8s | All features supported |
| 16:9 | 1080p | 8s only | Reference images supported |
| 9:16 | 1080p | 8s only | Reference images supported |
| 16:9 | 4k | 8s only | Higher latency and cost |
| 9:16 | 4k | 8s only | Higher latency and cost |
Bearer authentication header of the form Bearer <token>, where <token> is your auth token.
Text description for the video. Supports audio cues.
Use descriptive language including:
Audio Prompting:
2000Text describing what not to include in the video.
Do not use instructive language like "no" or "don't". Instead, describe what you don't want to see (e.g., "wall, frame" instead of "No walls").
500Initial image to animate (first frame). Can be a URL or Base64 encoded data.
Format for Base64: data:image/png;base64,{base64_data}
Supported formats: JPEG, JPG, PNG, BMP, WEBP Max file size: 20MB
Final image for interpolation video. Must be used in combination with the image parameter.
Format for Base64: data:image/png;base64,{base64_data}
Supported formats: JPEG, JPG, PNG, BMP, WEBP Max file size: 20MB
Up to 3 images to be used as style and content references. Provide images of a person, character, or product to preserve the subject's appearance in the output video.
Each item can be a URL or Base64 encoded data.
Format for Base64: data:image/png;base64,{base64_data}
Note: When using reference images, duration must be 8 seconds.
3Video aspect ratio (width:height).
16:9, 9:16 Video resolution.
Note: 1080p and 4k only support 8 second duration.
720p, 1080p, 4k Length of the generated video in seconds.
Note: Must be 8 when using reference images, 1080p, or 4k resolution.
4, 6, 8 Accepted - Task created successfully