Seedance 2.0 by ByteDance

Seedance 2.0 AI Video Generator

The most powerful way to make video from your own material. Seedance 2.0 is ByteDance's flagship model - combine up to 9 images, 3 videos, and 3 audio clips in one prompt to create cinematic, multi-shot video with stereo sound. Use it free inside 3D AI Studio's Video Studio.

ByteDance SEED9 images · 3 videos · 3 audioUp to 15s multi-shotStereo native audio
Prompt

Slow walk forward, fashion runway, studio lighting

Seedance
12 assets
References per clip
15s
Multi-shot length
Stereo
Native audio
Text·Image·Video·Audio
Input modes
Ways to create

Three ways to create with Seedance 2.0

Animate an image, write from scratch, or compose a clip from your own references - all in one model.

Input image
Input image

Image to Video

Seedance 2.0

Upload a single image and describe the motion. Seedance animates it with smooth, production-grade movement and stereo sound.

  • Keeps your exact subject
  • Up to 15 seconds, multi-shot
  • Stereo native audio
Try Image to Video
Text prompt

“White horses galloping through shallow water, drone shot”

Text to Video

Seedance 2.0 Text

Describe a scene in words and Seedance builds it from scratch - cinematic, multi-shot, with synchronized audio.

  • No image needed
  • Great for original scenes
  • Multiple aspect ratios
Try Text to Video
@Image1
@Image1

Reference to Video

Most powerful

Combine up to 9 images, 3 videos, and 3 audio clips. Tag them with @Image1, @Video1, @Audio1 to control exactly how each is used.

  • Character consistency & subject swaps
  • Style and motion transfer
  • Lip-sync from an audio clip
Try Reference to Video

What Seedance 2.0 can do

ByteDance's flagship model, built for production-grade, reference-driven video.

Multi-reference input

Combine up to 9 images, 3 videos, and 3 audio clips in one generation. Tag them as @Image1, @Video1, @Audio1 in your prompt to control exactly how each one is used.

Character consistency

Lock a character's face, outfit, or a product across shots by referencing it directly - perfect for series and brand content.

Multi-shot stories

Generate cinematic, multi-shot sequences up to 15 seconds, with the composition and camera language taken from your references.

Stereo native audio

Two-channel sound with background music, ambient effects, and character voiceover - all generated together and aligned to the action.

Style & motion transfer

Reference the look of one image and the motion of another. Seedance reads each input's role and blends them into one result.

Flexible output

Choose 480p, 720p, or 1080p and aspect ratios from vertical 9:16 to wide 21:9 to fit any platform.

Image to Video

Single-image to video, production-grade

Even without multiple references, Seedance 2.0 animates one image with the kind of smooth, stable motion that holds up in real production - plus stereo sound.

Input image
Steady, production-grade motion: he raises his hand to cast as the robe and beard settle naturally and the firelight holds its color.
AI video

motionSteady, production-grade motion: he raises his hand to cast as the robe and beard settle naturally and the firelight holds its color.

Input image
Locked-off framing with subtle life - the knight breathes, grips his axe, and the forest sways gently behind him.
AI video

motionLocked-off framing with subtle life - the knight breathes, grips his axe, and the forest sways gently behind him.

Input image
A clean, stable loop: she rocks slightly and blinks while the mushrooms glow at a consistent rhythm.
AI video

motionA clean, stable loop: she rocks slightly and blinks while the mushrooms glow at a consistent rhythm.

Text to Video

Generate video from a text prompt

No references at all - just describe the scene and Seedance builds a cinematic, multi-shot clip with stereo audio.

Text to Video

White horses galloping through shallow water, drone shot

Text to Video

Wild horses running through a mountain meadow, golden hour

Text to Video

Two warriors clash in a vast desert, cinematic wide shot

Text to Video

Aerial drone shot over a misty river valley at dawn

Text to Video

Woman faces a T-Rex in a ruined city, cinematic rain

Text to Video

Pixar-style penguin sliding down an icy slope, playful

Reference to Video

Reference to Video: many workflows, one model

This is what makes Seedance 2.0 special. Feed it images, videos, and audio, then point to them in your prompt with @ tags to control exactly what happens.

Source video
@Image1
reference
Result

promptReplace the penguin with the character in @Image1

Seedance kept the original sliding motion and camera from the source video, and swapped in the referenced character - no masking or rotoscoping needed.

Subject replacement

Swap a subject in an existing clip for one from a reference image, keeping the original motion and camera.

replace the subject with @Image1

Character consistency

Reference the same character across generations so faces, outfits, and props stay identical from shot to shot.

@Image1 as the main character

Style transfer

Match the art style, palette, or look of a reference image while generating new motion and composition.

in the style of @Image2

Motion transfer

Apply the movement and pacing of a reference video to a new scene or character.

use the motion from @Video1

Lip-sync & voice

Drive a character's mouth and timing from an audio clip for talking and singing scenes.

lip-sync to @Audio1

Multi-shot composition

Combine several references into a single multi-shot sequence with consistent identity throughout.

@Image1 @Image2 @Video1

More videos made with Seedance

Real outputs from Seedance 2.0 and Seedance 1.5 inside Video Studio.

Seedance 2.0

Slow walk forward, fashion runway

Reference

Replace the penguin with the character in @Image1

Seedance 1.5

360° product rotation, studio lighting

Seedance 2.0

Wild horses in a mountain meadow, golden hour

Multi-shot

Cinematic multi-shot duel

Cinematic

Woman faces a T-Rex, cinematic rain

How to use Reference to Video, step by step

Seedance's reference mode is the most powerful - and the most precise. Five steps turn your own material into a finished clip.

01

Open Reference to Video

In Video Studio, switch to the Reference to Video mode (powered by Seedance 2.0). This is where you can feed the model your own images, footage, and audio instead of relying on a prompt alone.

02

Add your assets

Upload up to 9 images, 3 video clips, and 3 audio files (12 references total). As you add them they're automatically labelled @Image1, @Image2, @Video1, @Audio1, and so on, so you can refer to each one in your prompt.

03

Write a prompt with @ tags

Describe the result and point to your assets by tag. Be explicit about each one's role: 'replace the subject with @Image1', 'use the motion from @Video1', or 'lip-sync the character to @Audio1', and say what should stay unchanged.

04

Set resolution, aspect ratio and length

Pick 480p, 720p, or 1080p, choose an aspect ratio from 9:16 vertical to 21:9 widescreen, and a duration up to 15 seconds. Keep stereo audio on so music, effects, and voice come out aligned.

05

Generate, iterate and download

Generate your clip; Seedance composes the references into one cinematic, multi-shot result. If a reference's role wasn't clear, tighten the @-tag wording and regenerate, then download in HD.

Prompt guide

How to write great Seedance 2.0 prompts

The key is telling Seedance what each reference is for. Use @ tags and clear, simple instructions.

Tag every reference

Point to each asset by its tag so Seedance knows its role in the scene.

@Image1 walks through the doorway from @Video1's background

Say what to keep

Be explicit about what should stay consistent - the face, the outfit, the product.

keep @Image1's face and jacket exactly the same

Describe the motion

Even with references, describe how things should move and how the camera behaves.

slow orbit around the subject, smooth and steady

Use audio for timing

Reference an audio clip to drive lip-sync, rhythm, or beat-matched motion.

lip-sync the character to @Audio1

Seedance 2.0 specs

Everything you can control when you generate.

ProviderByteDance SEED Lab
ReleasedSeedance 2.0 - February 2026
Input modesText · Image · Video · Audio (reference)
Max references9 images + 3 videos + 3 audio (12 total)
DurationUp to 15 seconds, multi-shot
AudioStereo native (music, SFX, voiceover)
Resolution480p · 720p · 1080p
Aspect ratios16:9 · 9:16 · 1:1 · 4:3 · 21:9
Modes in StudioSeedance 2.0 · Reference · Text (+ Seedance 1.5)

Seedance 2.0 vs Seedance 1.5

Seedance 2.0 adds multi-reference input, stereo audio, and multi-shot. 1.5 stays great for fast, stable single clips.

FeatureSeedance 2.0Seedance 1.5
Best forProduction, multi-referenceStable, fast single clips
Reference inputs9 img + 3 vid + 3 audioSingle image
Native audioStereoOff (in Studio)
Max duration15s, multi-shot10s
ResolutionUp to 1080pUp to 1080p
Relative costPremiumLower

What people make with Seedance 2.0

Character series

Keep the same character across many clips using references.

Product & e-commerce

Drop a product image into a scene and animate it.

Ads with voiceover

Combine visuals and a voice sample into one finished spot.

Subject swaps

Replace a subject in a clip while keeping the motion.

Style matching

Generate new shots that match a brand or reference look.

Lip-sync scenes

Talking and singing characters driven by an audio clip.

Multi-shot stories

Build cinematic sequences from your own material.

Music videos

Beat-matched, reference-driven visuals.

One subscription

Feed Seedance your best material

Generate reference characters and products in Image Studio, composite them with Seedance, and turn a hero asset into a 3D model - all on one account and one credit balance.

What is Seedance 2.0?

Seedance 2.0 is ByteDance's flagship AI video model, released in February 2026. Its standout feature is multimodal reference input: in a single generation you can combine up to 9 images, 3 video clips, and 3 audio clips along with your text prompt. You point to each asset in your prompt using simple tags like @Image1, @Video1, and @Audio1, and Seedance reads how each one should be used - for composition, motion, style, or sound.

The result is cinematic, multi-shot video up to 15 seconds long, with two-channel stereo audio generated at the same time. This is what makes it so powerful for real production work: instead of chaining several tools together, you give Seedance your reference image, a voice sample, and a description, and it returns a finished clip. In 3D AI Studio you get Seedance 2.0 for image-to-video, its Reference-to-Video mode, and a Text-to-Video mode.

Reference to Video, explained

Reference to Video is the mode that sets Seedance 2.0 apart. You upload a mix of images, videos, and audio, then write a prompt that tells the model what to do with them. A common example: take an existing clip of a penguin sliding on ice, add a character image as @Image1, and prompt 'replace the penguin with the character in @Image1'. Seedance keeps the original motion but swaps in your character - exactly the example shown above on this page.

You can use the same idea for character consistency (keep one face across shots), style transfer (match the look of a reference image), motion transfer (apply the movement of a reference video), and lip-sync (drive a character with an audio clip). Because everything happens in one pass, the timing and audio stay perfectly aligned, and you skip the manual masking, rotoscoping, and editing those tasks usually require.

When to use each Seedance mode

Use Image to Video when you have one picture you want to bring to life - a character, a product, or a scene - and you want it to move while staying exactly on-model. Use Text to Video when you have an idea but no footage or image, and want Seedance to build the whole scene from a description.

Use Reference to Video when you have several pieces of material to combine, or when you need precise control: keeping a character consistent, transferring motion or style, swapping a subject, or syncing to audio. It's the most powerful mode and the reason many teams choose Seedance for production work.

Seedance 2.0 vs Veo 3.1 vs Kling 3.0

Seedance 2.0 is the best choice when you're building from your own material - references, products, characters, and voices you want to keep consistent. Google Veo 3.1 is the leader for realistic talking and lip-sync from a simple prompt, and Kling 3.0 excels at long, cinematic single shots and multi-shot storyboards.

All three are included in every 3D AI Studio plan. A common workflow is to generate a character image in Image Studio, animate or composite it with Seedance 2.0, then use Veo or Kling for additional shots - and even turn the character into a 3D model with Image to 3D.

Tips for better Seedance 2.0 videos

Tag your references clearly and tell Seedance what each one is for. A prompt like 'place @Image1 in the setting from @Video1, keep @Image1's outfit, slow dolly-in' gives the model a precise plan to follow.

Start with clean, high-quality reference images and clips - Seedance preserves what you give it, so sharp inputs lead to sharp results. For consistency across a series, reuse the exact same reference image each time. And if you only need a quick, stable single clip without references, Seedance 1.5 is a faster, lower-cost option in the same Studio.

Explore other video models

Every plan includes access to all of them. Pick the right tool for each shot.

Frequently asked questions

Start creating with Seedance 2.0

Open Video Studio and generate your first clip in minutes. Free credits to start.