What is reference-to-video?

Reference-to-video lets you upload a mix of images, videos, and audio and reference them in your prompt with tags like @Image1, @Video1, and @Audio1. For example, you can take a clip and prompt 'replace the subject with @Image1' to swap in your own character while keeping the original motion.

How many references can Seedance 2.0 use?

Up to 12 assets in one generation: a maximum of 9 images, 3 video clips, and 3 audio clips combined. You tag each one in your prompt so the model knows exactly how to use it.

Can Seedance 2.0 swap a subject in a video?

Yes. Add the source clip and a character image, then prompt something like 'replace the subject with @Image1'. Seedance keeps the original motion and camera while swapping in your referenced subject - no manual masking or rotoscoping required.

Does Seedance 2.0 generate audio?

Yes. Seedance 2.0 produces two-channel stereo audio in the same pass as the video, including background music, ambient sound effects, and character voiceover, all aligned to the on-screen action.

Can Seedance 2.0 keep a character consistent across clips?

Yes. By referencing the same character image across generations, Seedance keeps faces, outfits, and products visually consistent - which is ideal for series, ads, and brand content.

Can Seedance do style and motion transfer?

Yes. Reference an image with @Image to match its style, and reference a video with @Video to apply its motion to a new scene or character. You can combine both in a single prompt.

Does Seedance 2.0 support lip-sync?

Yes. Add an audio clip as a reference and prompt the model to lip-sync a character to it, which is useful for talking and singing scenes.

What does the @Image1 syntax mean?

When you add references, Seedance labels them in order - @Image1, @Image2, @Video1, @Audio1, and so on. You type those tags directly in your prompt to tell the model which asset to use and how, like 'put @Image1 in the setting from @Video1'. It's how you direct a multi-reference generation precisely.

Do I have to use references, or can I just type a prompt?

References are optional. Seedance 2.0 also offers plain Text to Video (describe a scene from scratch) and Image to Video (animate one image). Reference to Video is the most powerful mode, but you only need it when you're combining or controlling your own material.

How is Seedance 2.0 different from Seedance 1.0?

Seedance 1.0 was a strong single-clip text-and-image model. Seedance 2.0 adds the unified multimodal architecture: multi-reference input (images, video, and audio together), stereo native audio, multi-shot output, and noticeably higher quality in complex motion and interaction scenes.

What is the difference between Seedance 2.0 and Seedance 1.5?

Seedance 2.0 adds multimodal reference input, stereo native audio, and multi-shot output up to 15 seconds. Seedance 1.5 is a great, lower-cost choice for fast, stable single clips with flexible resolution and aspect-ratio control, and it's available in the same Studio.

What resolutions and aspect ratios does Seedance 2.0 support?

It supports 480p, 720p, and 1080p, with aspect ratios including 16:9, 9:16, 1:1, 4:3, and 21:9 - so you can target everything from cinematic widescreen to vertical social video, all from the same model.

Do I need a CapCut, Dreamina, or ByteDance account?

No. Seedance 2.0 is built into 3D AI Studio's Video Studio. You open the Studio, choose Seedance, and generate - billing runs on 3D AI Studio credits, with no separate ByteDance, Dreamina, or CapCut sign-up.

How long does a Seedance 2.0 generation take?

Because it's a premium, multi-shot model, Seedance 2.0 takes longer than lightweight models - typically a couple of minutes depending on length, resolution, and how many references you include. You can move the job to the background and keep working while it renders.

How many credits does Seedance 2.0 cost?

Seedance 2.0 is a premium model, so it uses more credits than lighter options, scaled by resolution and length. The exact credit cost is shown in Video Studio before you generate. For cheaper, fast single clips, Seedance 1.5 is the budget option.

Is Seedance 2.0 free to try?

Yes. New 3D AI Studio accounts include free credits so you can try Seedance 2.0, including Reference to Video, before choosing a plan.

Can I use Seedance 2.0 videos commercially?

Yes. Videos generated on a paid plan include commercial rights for use in ads, social media, e-commerce, product content, and client projects.

Seedance 2.0 by ByteDance

Seedance 2.0 AI Video Generator

The most powerful way to make video from your own material. Seedance 2.0 is ByteDance's flagship model - combine up to 9 images, 3 videos, and 3 audio clips in one prompt to create cinematic, multi-shot video with stereo sound. Use it free inside 3D AI Studio's Video Studio.

ByteDance SEED9 images · 3 videos · 3 audioUp to 15s multi-shotStereo native audio

Compare all AI video models

Prompt

“Slow walk forward, fashion runway, studio lighting”

Seedance

12 assets

References per clip

15s

Multi-shot length

Stereo

Native audio

Text·Image·Video·Audio

Input modes

Ways to create

Three ways to create with Seedance 2.0

Animate an image, write from scratch, or compose a clip from your own references - all in one model.

Input image

Image to Video

Seedance 2.0

Upload a single image and describe the motion. Seedance animates it with smooth, production-grade movement and stereo sound.

Keeps your exact subject
Up to 15 seconds, multi-shot
Stereo native audio

Try Image to Video

Text prompt

“White horses galloping through shallow water, drone shot”

Text to Video

Seedance 2.0 Text

Describe a scene in words and Seedance builds it from scratch - cinematic, multi-shot, with synchronized audio.

No image needed
Great for original scenes
Multiple aspect ratios

Try Text to Video

@Image1

Reference to Video

Most powerful

Combine up to 9 images, 3 videos, and 3 audio clips. Tag them with @Image1, @Video1, @Audio1 to control exactly how each is used.

Character consistency & subject swaps
Style and motion transfer
Lip-sync from an audio clip

Try Reference to Video

What Seedance 2.0 can do

ByteDance's flagship model, built for production-grade, reference-driven video.

Multi-reference input

Combine up to 9 images, 3 videos, and 3 audio clips in one generation. Tag them as @Image1, @Video1, @Audio1 in your prompt to control exactly how each one is used.

Character consistency

Lock a character's face, outfit, or a product across shots by referencing it directly - perfect for series and brand content.

Multi-shot stories

Generate cinematic, multi-shot sequences up to 15 seconds, with the composition and camera language taken from your references.

Stereo native audio

Two-channel sound with background music, ambient effects, and character voiceover - all generated together and aligned to the action.

Style & motion transfer

Reference the look of one image and the motion of another. Seedance reads each input's role and blends them into one result.

Flexible output

Choose 480p, 720p, or 1080p and aspect ratios from vertical 9:16 to wide 21:9 to fit any platform.

Image to Video

Single-image to video, production-grade

Even without multiple references, Seedance 2.0 animates one image with the kind of smooth, stable motion that holds up in real production - plus stereo sound.

Input image

AI video

motion“Steady, production-grade motion: he raises his hand to cast as the robe and beard settle naturally and the firelight holds its color.”

Input image

AI video

motion“Locked-off framing with subtle life - the knight breathes, grips his axe, and the forest sways gently behind him.”

Input image

AI video

motion“A clean, stable loop: she rocks slightly and blinks while the mushrooms glow at a consistent rhythm.”

Text to Video

Generate video from a text prompt

No references at all - just describe the scene and Seedance builds a cinematic, multi-shot clip with stereo audio.

Text to Video

“White horses galloping through shallow water, drone shot”

Text to Video

“Wild horses running through a mountain meadow, golden hour”

Text to Video

“Two warriors clash in a vast desert, cinematic wide shot”

Text to Video

“Aerial drone shot over a misty river valley at dawn”

Text to Video

“Woman faces a T-Rex in a ruined city, cinematic rain”

Text to Video

“Pixar-style penguin sliding down an icy slope, playful”

Reference to Video

Reference to Video: many workflows, one model

This is what makes Seedance 2.0 special. Feed it images, videos, and audio, then point to them in your prompt with @ tags to control exactly what happens.

Source video

@Image1

Result

prompt“Replace the penguin with the character in @Image1”

Seedance kept the original sliding motion and camera from the source video, and swapped in the referenced character - no masking or rotoscoping needed.

Subject replacement

Swap a subject in an existing clip for one from a reference image, keeping the original motion and camera.

replace the subject with @Image1

Character consistency

Reference the same character across generations so faces, outfits, and props stay identical from shot to shot.

@Image1 as the main character

Style transfer

Match the art style, palette, or look of a reference image while generating new motion and composition.

in the style of @Image2

Motion transfer

Apply the movement and pacing of a reference video to a new scene or character.

use the motion from @Video1

Lip-sync & voice

Drive a character's mouth and timing from an audio clip for talking and singing scenes.

lip-sync to @Audio1

Multi-shot composition

Combine several references into a single multi-shot sequence with consistent identity throughout.

@Image1 @Image2 @Video1

How to use Reference to Video, step by step

Seedance's reference mode is the most powerful - and the most precise. Five steps turn your own material into a finished clip.

Open Reference to Video

In Video Studio, switch to the Reference to Video mode (powered by Seedance 2.0). This is where you can feed the model your own images, footage, and audio instead of relying on a prompt alone.

Add your assets

Upload up to 9 images, 3 video clips, and 3 audio files (12 references total). As you add them they're automatically labelled @Image1, @Image2, @Video1, @Audio1, and so on, so you can refer to each one in your prompt.

Write a prompt with @ tags

Describe the result and point to your assets by tag. Be explicit about each one's role: 'replace the subject with @Image1', 'use the motion from @Video1', or 'lip-sync the character to @Audio1', and say what should stay unchanged.

Set resolution, aspect ratio and length

Pick 480p, 720p, or 1080p, choose an aspect ratio from 9:16 vertical to 21:9 widescreen, and a duration up to 15 seconds. Keep stereo audio on so music, effects, and voice come out aligned.

Generate, iterate and download

Generate your clip; Seedance composes the references into one cinematic, multi-shot result. If a reference's role wasn't clear, tighten the @-tag wording and regenerate, then download in HD.

Prompt guide

How to write great Seedance 2.0 prompts

The key is telling Seedance what each reference is for. Use @ tags and clear, simple instructions.

Tag every reference

Point to each asset by its tag so Seedance knows its role in the scene.

“@Image1 walks through the doorway from @Video1's background”

Say what to keep

Be explicit about what should stay consistent - the face, the outfit, the product.

“keep @Image1's face and jacket exactly the same”

Describe the motion

Even with references, describe how things should move and how the camera behaves.

“slow orbit around the subject, smooth and steady”

Use audio for timing

Reference an audio clip to drive lip-sync, rhythm, or beat-matched motion.

“lip-sync the character to @Audio1”

Seedance 2.0 specs

Everything you can control when you generate.

ProviderByteDance SEED Lab

ReleasedSeedance 2.0 - February 2026

Input modesText · Image · Video · Audio (reference)

Max references9 images + 3 videos + 3 audio (12 total)

DurationUp to 15 seconds, multi-shot

AudioStereo native (music, SFX, voiceover)

Resolution480p · 720p · 1080p

Aspect ratios16:9 · 9:16 · 1:1 · 4:3 · 21:9

Modes in StudioSeedance 2.0 · Reference · Text (+ Seedance 1.5)

Seedance 2.0 vs Seedance 1.5

Seedance 2.0 adds multi-reference input, stereo audio, and multi-shot. 1.5 stays great for fast, stable single clips.

Feature	Seedance 2.0	Seedance 1.5
Best for	Production, multi-reference	Stable, fast single clips
Reference inputs	9 img + 3 vid + 3 audio	Single image
Native audio	Stereo	Off (in Studio)
Max duration	15s, multi-shot	10s
Resolution	Up to 1080p	Up to 1080p
Relative cost	Premium	Lower

What people make with Seedance 2.0

Character series

Keep the same character across many clips using references.

Product & e-commerce

Drop a product image into a scene and animate it.

Ads with voiceover

Combine visuals and a voice sample into one finished spot.

Subject swaps

Replace a subject in a clip while keeping the motion.

Style matching

Generate new shots that match a brand or reference look.

Lip-sync scenes

Talking and singing characters driven by an audio clip.

Multi-shot stories

Build cinematic sequences from your own material.

Music videos

Beat-matched, reference-driven visuals.

One subscription

Feed Seedance your best material

Generate reference characters and products in Image Studio, composite them with Seedance, and turn a hero asset into a 3D model - all on one account and one credit balance.

Image Studio

Generate & edit images

Create the perfect input image with 100+ AI tools, then animate it here.

Image to 3D

Turn images into 3D

Convert any image or video frame into a production-ready 3D model.

Text to 3D

Generate 3D from text

Describe any object and get a textured 3D model in seconds.

What is Seedance 2.0?

Seedance 2.0 is ByteDance's flagship AI video model, released in February 2026. Its standout feature is multimodal reference input: in a single generation you can combine up to 9 images, 3 video clips, and 3 audio clips along with your text prompt. You point to each asset in your prompt using simple tags like @Image1, @Video1, and @Audio1, and Seedance reads how each one should be used - for composition, motion, style, or sound.

The result is cinematic, multi-shot video up to 15 seconds long, with two-channel stereo audio generated at the same time. This is what makes it so powerful for real production work: instead of chaining several tools together, you give Seedance your reference image, a voice sample, and a description, and it returns a finished clip. In 3D AI Studio you get Seedance 2.0 for image-to-video, its Reference-to-Video mode, and a Text-to-Video mode.

Reference to Video, explained

Reference to Video is the mode that sets Seedance 2.0 apart. You upload a mix of images, videos, and audio, then write a prompt that tells the model what to do with them. A common example: take an existing clip of a penguin sliding on ice, add a character image as @Image1, and prompt 'replace the penguin with the character in @Image1'. Seedance keeps the original motion but swaps in your character - exactly the example shown above on this page.

You can use the same idea for character consistency (keep one face across shots), style transfer (match the look of a reference image), motion transfer (apply the movement of a reference video), and lip-sync (drive a character with an audio clip). Because everything happens in one pass, the timing and audio stay perfectly aligned, and you skip the manual masking, rotoscoping, and editing those tasks usually require.

When to use each Seedance mode

Use Image to Video when you have one picture you want to bring to life - a character, a product, or a scene - and you want it to move while staying exactly on-model. Use Text to Video when you have an idea but no footage or image, and want Seedance to build the whole scene from a description.

Use Reference to Video when you have several pieces of material to combine, or when you need precise control: keeping a character consistent, transferring motion or style, swapping a subject, or syncing to audio. It's the most powerful mode and the reason many teams choose Seedance for production work.

Seedance 2.0 vs Veo 3.1 vs Kling 3.0

Seedance 2.0 is the best choice when you're building from your own material - references, products, characters, and voices you want to keep consistent. Google Veo 3.1 is the leader for realistic talking and lip-sync from a simple prompt, and Kling 3.0 excels at long, cinematic single shots and multi-shot storyboards.

All three are included in every 3D AI Studio plan. A common workflow is to generate a character image in Image Studio, animate or composite it with Seedance 2.0, then use Veo or Kling for additional shots - and even turn the character into a 3D model with Image to 3D.

Tips for better Seedance 2.0 videos

Tag your references clearly and tell Seedance what each one is for. A prompt like 'place @Image1 in the setting from @Video1, keep @Image1's outfit, slow dolly-in' gives the model a precise plan to follow.

Start with clean, high-quality reference images and clips - Seedance preserves what you give it, so sharp inputs lead to sharp results. For consistency across a series, reuse the exact same reference image each time. And if you only need a quick, stable single clip without references, Seedance 1.5 is a faster, lower-cost option in the same Studio.

Explore other video models

Every plan includes access to all of them. Pick the right tool for each shot.

Start creating with Seedance 2.0

Open Video Studio and generate your first clip in minutes. Free credits to start.

Seedance 2.0 AI Video Generator

Three ways to create with Seedance 2.0

Image to Video

Text to Video

Reference to Video

What Seedance 2.0 can do

Multi-reference input

Character consistency

Multi-shot stories

Stereo native audio

Style & motion transfer

Flexible output

Single-image to video, production-grade

Generate video from a text prompt

Reference to Video: many workflows, one model

Subject replacement

Character consistency

Style transfer

Motion transfer

Lip-sync & voice

Multi-shot composition

More videos made with Seedance

How to use Reference to Video, step by step

Open Reference to Video

Add your assets

Write a prompt with @ tags

Set resolution, aspect ratio and length

Generate, iterate and download

How to write great Seedance 2.0 prompts

Tag every reference

Say what to keep

Describe the motion

Use audio for timing

Seedance 2.0 specs

Seedance 2.0 vs Seedance 1.5

What people make with Seedance 2.0

Character series

Product & e-commerce

Ads with voiceover

Subject swaps

Style matching

Lip-sync scenes

Multi-shot stories

Music videos

Feed Seedance your best material

Generate & edit images

Turn images into 3D

Generate 3D from text

What is Seedance 2.0?

Reference to Video, explained

When to use each Seedance mode

Seedance 2.0 vs Veo 3.1 vs Kling 3.0

Tips for better Seedance 2.0 videos

Explore other video models

Veo 3.1

Kling 3.0

All video models

What is Seedance 2.0?

What is reference-to-video?

How many references can Seedance 2.0 use?

Can Seedance 2.0 swap a subject in a video?

Does Seedance 2.0 generate audio?

Can Seedance 2.0 keep a character consistent across clips?

Can Seedance do style and motion transfer?

Does Seedance 2.0 support lip-sync?

What does the @Image1 syntax mean?

Do I have to use references, or can I just type a prompt?

How is Seedance 2.0 different from Seedance 1.0?

What is the difference between Seedance 2.0 and Seedance 1.5?

What resolutions and aspect ratios does Seedance 2.0 support?

Do I need a CapCut, Dreamina, or ByteDance account?

How long does a Seedance 2.0 generation take?

How many credits does Seedance 2.0 cost?

Is Seedance 2.0 free to try?

Can I use Seedance 2.0 videos commercially?

Start creating with Seedance 2.0