Bytedance
seedance-v1.5-pro
ByteDance’s native audio-visual joint generation model built on a dual-branch DiT architecture, producing synchronized video and audio in a single pass with multilingual lip-sync, cinematic camera control, and narrative coherence.
| Provider | Bytedance |
| Tasks | text-to-video · image-to-video |
| Starting from | 0.0552 USD / call · Pricing details |
POST
seedance-v1.5-pro
Authorizations
Bearer authentication header of the form Bearer <token>, where <token> is your auth token.
Body
application/json
model is required
Available options:
seedance-v1.5-pro prompt length must be <= 2000
Maximum string length:
2000resolution must be 480p or 720p
Available options:
480P, 720P duration must between 4 and 12
Available options:
5 extends.audio must be a boolean
Previous
kling-v3Kuaishou's unified multimodal video model with native 4K/60fps output, AI Director multi-shot storyboarding, multilingual native audio, and ultimate character consistency, unifying video understanding, generation, and editing in one workflow.
| | |
| --- | --- |
| **Provider** | Kling |
| **Tasks** | text-to-video · image-to-video |
| **Starting from** | 0.0610 USD / second · <a href="https://linkmodel.ai/en/models/kling-v3" target="_blank" rel="noopener">Pricing details</a> |
Next
seedance-v1.5-pro