Seedance 2.0 arrived in February 2026 and immediately made waves — both for its technical leap and for a Hollywood copyright controversy. If you're here for the craft, this guide covers everything you need to write prompts that actually work.
What is Seedance 2?
Seedance 2.0 is ByteDance's next-generation AI video model, built on a unified multimodal audio-video architecture. Unlike every other model in the space, it accepts text, images, video clips, and audio tracks simultaneously — up to 12 reference files in a single generation.
Key specs at a glance:
| Spec | Value |
|---|---|
| Max resolution | 2K (2560×1440) |
| Max duration | 15 seconds |
| Frame rate | 24fps |
| Multi-modal input | Up to 12 files (9 images + 3 videos + 3 audio) |
| Native audio | Stereo (dialogue, ambient, effects) |
| Aspect ratios | 16:9 · 9:16 · 1:1 |
6 Generation Modes
- T2V — Text to video
- I2V — Image to video
- V2V — Video to video (transfer motion or camera style)
- A2V — Audio-driven video (rhythm and mood from audio)
- R2V — Multi-modal mix: combine text + images + video + audio references
- Video Editing — Target specific segments, characters, or actions in existing footage
How Seedance 2 Compares to the Competition
Honest comparison across the five leading AI video models as of early 2026:
| Feature | Seedance 2 | Sora 2 | Kling 3.0 | Veo 3.1 | Runway Gen-4 |
|---|---|---|---|---|---|
| Max resolution | 2K | 1080p | 4K | 1080p | 4K |
| Max duration | 15s | 25s | 15s | 8s | 10s |
| Frame rate | 24fps | 24-30fps | 60fps | 24fps | 24fps |
| Multi-file input | 12 files | — | — | — | — |
| Native audio | Stereo | — | Limited | Yes | — |
| Audio reference input | Yes (unique) | — | — | — | — |
| Physics realism | Excellent | Best | Excellent | Good | Good |
| Free tier | Yes (watermark) | Limited | 66-day credits | Limited | Limited |
When to choose which model
Choose Seedance 2 when you need precise control over composition, motion, and audio rhythm using reference files — or when you need multi-person interactions with consistent characters.
Choose Sora 2 when you need the longest single shot (25s) or the most physically accurate simulation.
Choose Kling 3.0 when you want the highest visual quality (4K/60fps) at the best price per second, with generous free credits to experiment.
Choose Veo 3.1 when native dialogue, sound effects, and music generation are the priority.
Choose Runway Gen-4 when overall perceived quality and a mature creator ecosystem matter most.
The Seedance 2 Prompt Formula
Every prompt should follow this structure:
Subject + Action + Camera + Scene + Style + ConstraintsThink of yourself as a director briefing a cinematographer — not a poet writing prose.
1. Write Like a Director
Be explicit about who does what, how the camera moves, and where the scene takes place.
Weak: A woman running through a city
Strong: A woman in a red wool coat runs toward the camera through a foggy alley at night, face lit by neon reflections on wet pavement. Tracking shot, handheld, shallow depth of field. Cinematic.
2. Use Present Tense + Intensity Modifiers
Always write actions in present tense and add qualifiers for speed and emotion:
slowly turns toward cameraabruptly stops and looks back over her shouldergently lifts the object, examines it closelyrapidly pans across the crowded market
3. Keep It Concise
1–2 structured sentences outperform long paragraphs. The model fills in details well — focus your prompt on the elements you must control.
4. Append Quality Keywords
End your prompt with one or two quality boosters:
... cinematic, 4K, film grain, shallow depth of field
... commercial product photography style, soft studio lighting
... wide-angle, golden hour, anamorphic lens flare
5. Use Camera Language
The model responds well to cinematography vocabulary:
| Keyword | Effect |
|---|---|
low-angle | Powerful, imposing subject |
bird's-eye view | Overview, establishes scale |
close-up | Emotion, intimacy |
tracking shot | Subject movement |
push in | Tension, focus |
pull out | Reveal, context |
dolly zoom | Vertigo, disorientation |
crane shot | Sweeping, epic feel |
handheld | Raw, documentary feel |
rack focus | Shift attention between planes |
6. Add Consistency Constraints
For character-driven scenes, end with:
maintain face and clothing consistency across all frames, no distortion
Upload a reference image of your character via @Image1 for the tightest control.
Using Multi-Modal References (R2V Mode)
This is Seedance 2's biggest differentiator. Upload up to 12 files and reference them inline:
@Image1 A girl breaks through a dimensional wall and travels through the worlds
of famous paintings. She stands under the rotating starry sky of @Image2
with an excited expression, then curiously watches the couple embracing in @Image3.Reference tag syntax:
@Image1,@Image2, …@Image9@Video1,@Video2,@Video3@Audio1,@Audio2,@Audio3
Each tag tells the model how to use that reference — describe it in context.
Ready-to-Use Prompt Examples
Product Commercial
A minimalist matte black mechanical keyboard on a pure white infinite studio
background, rotating smoothly 360 degrees clockwise. RGB lighting breathing
gently. Keycap text sharp and readable. Fixed macro lens, smooth turntable
motion, commercial product photography style, soft studio lighting.Action Scene (with video reference)
A wuxia-style male hero (@Video1 character reference), wearing a black martial
outfit, fighting enemies in a rainy bamboo forest at night. Fast sword combos
with visible sword-light trails and splashing water droplets. Fast-follow
camera, crane shots, and quick close-ups alternating. Cinematic. Maintain
character appearance and clothing consistency. Realistic physics, wet fabric,
rain interaction.Figure Skating (T2V)
Competitive figure skating live performance. Opens with a low-angle shot
following the skate blades gliding on ice — clear details of ice shavings
and light reflections. Enters a spin sequence. Climax: synchronized jump
combination, upright aerial posture, decisive landings. 24fps, cinematic,
audio-visual alignment.Landscape / Travel
A lone hiker crests a ridge at sunrise in the Scottish Highlands. Wide-angle
pull-out as golden light floods the valley below. Wind moves tall grass in
the foreground. Handheld with slow stabilization, anamorphic lens, cinematic
color grade, film grain.Availability & Pricing
Where to use it:
- Jianying (CapCut China) — live for Chinese users
- CapCut — rolling out globally
- Third-party APIs — Kie AI, APIYI, GlobalGPT, WaveSpeed AI
Cost: Free tier available (watermarked). Paid tiers coming. Third-party API access runs approximately $0.10–$0.80 per minute depending on resolution.
Generate Your Seedance 2 Prompts with image2prompt
Not sure how to describe your idea? Use image2prompt to turn any reference image into an optimized video prompt — ready to paste into Seedance 2 or any other AI video model.

