> For the complete documentation index, see [llms.txt](https://docs.chatvideopro.com/llms.txt). Markdown versions of documentation pages are available by appending `.md` to page URLs; this page is available as [Markdown](https://docs.chatvideopro.com/features/video-generation/supported-video-models.md).

# Supported Video Models

<figure><img src="/files/oi5WVzsK6Ea3I1gj6xFr" alt=""><figcaption></figcaption></figure>

Chat Video Pro includes several video models because no single model is best at everything. Some are better for dialogue. Some are better for cinematic camera movement. Some are better for fast drafts, longer clips, reference consistency, or mobile-first content.

This page is a practical chooser guide. You do not need to memorize every model. Start from the type of shot you want, then choose the model that fits the job.

{% hint style="info" %}
If you are using **Studio**, the workflow often chooses the model path for you. For example, Motion Director uses Kling 3.0 Pro Image-to-Video, AI Transitions uses Kling O3 transition models, and Add Effects uses Kling O3 VFX. Use this page when you are choosing models directly from the Video Generation model selector.
{% endhint %}

***

### Quick Recommendations

<table><thead><tr><th width="241">If you need...</th><th>Start with...</th><th>Why</th></tr></thead><tbody><tr><td>Dialogue, speech, or generated audio</td><td><strong>Veo 3.1</strong> or <strong>Veo 3.1 Fast</strong></td><td>Strongest choice when audio and lip-sync matter.</td></tr><tr><td>Fast lower-cost Veo drafts without audio</td><td><strong>Veo 3.1 Lite</strong></td><td>Good for visual drafts, B-roll ideas, and transitions when you will add sound in Premiere.</td></tr><tr><td>Cinematic motion and strong general quality</td><td><strong>Kling 3.0 Pro</strong> or <strong>Kling O3 Pro</strong></td><td>Strong camera movement, action, and polished motion.</td></tr><tr><td>Longer cinematic clips without audio</td><td><strong>Sora 2</strong> or <strong>Sora 2 Pro</strong></td><td>Good for cinematic B-roll and longer visual generations.</td></tr><tr><td>Natural motion with ambient audio</td><td><strong>Seedance 2</strong></td><td>Good all-around model for natural motion and audio-enabled scenes.</td></tr><tr><td>Action, sports, or fast movement</td><td><strong>Hailuo 2.3</strong></td><td>Strong for dynamic action and energetic movement.</td></tr><tr><td>Fast mobile-first content</td><td><strong>Grok</strong></td><td>Fast generations and unusual mobile aspect ratios.</td></tr><tr><td>High-resolution 1080p-style outputs</td><td><strong>Wan 2.7</strong></td><td>Good when clarity, flexible aspect ratios, or reference workflows matter.</td></tr><tr><td>A guided camera move from a still image</td><td><strong>Studio Motion Director</strong></td><td>Easier than hand-prompting image-to-video movement.</td></tr><tr><td>A polished transition between two frames</td><td><strong>Studio AI Transitions</strong></td><td>Easier than manually choosing a transition model.</td></tr></tbody></table>

***

### The Simple Rule

Choose the model based on the hardest part of the shot:

<table><thead><tr><th width="294">Hardest part of the shot</th><th>What to prioritize</th></tr></thead><tbody><tr><td>A person speaking</td><td>Audio and lip-sync.</td></tr><tr><td>Complex camera movement</td><td>Motion quality and scene understanding.</td></tr><tr><td>A client-ready cinematic insert</td><td>Quality and consistency.</td></tr><tr><td>A quick concept</td><td>Speed and cost.</td></tr><tr><td>A character must look the same</td><td>Reference mode or Studio workflow.</td></tr><tr><td>Two frames need to connect</td><td>Transition mode or Studio AI Transitions.</td></tr><tr><td>The clip must be vertical/mobile</td><td>Aspect ratio support.</td></tr></tbody></table>

The best model is not always the highest-quality model. The best model is the one that solves the specific problem in the shot.

***

### Model Guide

#### Veo 3.1

Use Veo 3.1 when the shot needs audio, dialogue, or speaking characters.

Best for:

* Talking-head concepts.
* Product explainers with speech.
* Scenes where sound matters.
* Short dialogue tests.
* Image-to-video or transition shots where audio should be part of the generation.

Use **Veo 3.1 Fast** when you want quicker iterations. Use the regular Veo 3.1 path when quality matters more than speed.

#### Veo 3.1 Lite

Use Veo 3.1 Lite when you want a lower-cost Veo-style visual draft and do not need generated audio.

Best for:

* Silent B-roll concepts.
* Visual drafts before adding voiceover or music in Premiere.
* Budget-conscious text-to-video, image-to-video, or transition tests.
* Shots where 720p or 1080p is enough.

Avoid Lite when the prompt depends on spoken dialogue or synchronized audio. Add sound in Premiere instead.

#### Kling 3.0

Use Kling 3.0 when you want strong cinematic movement, flexible shot types, and polished video generation.

Best for:

* Camera moves.
* Dynamic product shots.
* Cinematic B-roll.
* Image-to-video with strong motion.
* Shots that need native audio but are less dialogue-focused than Veo.

Kling 3.0 is a strong general-purpose choice for classic text-to-video and image-to-video generation.

#### Kling O3

Use Kling O3 when motion quality, scene understanding, or transition quality is the priority.

Best for:

* Advanced motion.
* High-quality transitions.
* Reference-driven shots.
* VFX-oriented generations.
* Complex scenes where the model needs to preserve structure.

If you are creating transitions, consider Studio AI Transitions instead of manually selecting an O3 transition model. The Studio workflow gives you transition styles and better prompting structure.

#### Sora 2

Use Sora 2 when you want cinematic visual quality and do not need generated audio.

Best for:

* Establishing shots.
* Atmospheric B-roll.
* Longer visual clips.
* Cinematic concepts where dialogue is not required.

Choose Sora 2 Pro when you want the higher-quality Sora option and the extra cost makes sense.

#### Seedance 2

Use Seedance 2 when you want natural motion with native audio and a balanced all-around video model.

Best for:

* Natural movement.
* Ambient audio scenes.
* Cinematic clips with sound.
* Reference mode when you need subject consistency.
* Wide or cinematic aspect ratios.

Seedance 2 Fast is useful for quicker iterations when you want the same general family but faster turnaround.

#### Hailuo 2.3

Use Hailuo 2.3 when action and motion are the main challenge.

Best for:

* Sports.
* Fast movement.
* Stunts.
* Energetic social clips.
* Dynamic camera action.

It is less of a first choice for dialogue-heavy or reference-heavy work.

#### Wan 2.7

Use Wan 2.7 when you want flexible aspect ratios, high-resolution visual output, or reference-driven generation without generated audio.

Best for:

* Clean visual generations.
* Flexible formats.
* Reference mode with more image inputs.
* Start/end interpolation.
* Instruction-style video editing tests.

Plan to add sound later if the final edit needs audio.

#### Grok

Use Grok when speed and social/mobile formats matter more than maximum resolution.

Best for:

* Fast drafts.
* Mobile-first clips.
* Unusual vertical or panoramic formats.
* Quick social content experiments.

Grok is useful for fast exploration. For final cinematic polish, compare against Veo, Kling, Sora, or Seedance.

***

### Which Mode Should I Use?

Model choice matters, but mode choice comes first.

| You have...                                                   | Use...                                   |
| ------------------------------------------------------------- | ---------------------------------------- |
| Only a prompt                                                 | Text-to-Video                            |
| One still image                                               | Image-to-Video                           |
| A start frame and end frame                                   | Transition Mode or Studio AI Transitions |
| Several images of the same subject                            | Reference Mode                           |
| An existing video that should continue                        | Generative Extend                        |
| A still image that needs directed camera movement             | Studio Motion Director                   |
| A video that needs cleanup, VFX, relight, reshoot, or upscale | Studio                                   |

If you are not sure, start with the inputs. The number and type of files you attach usually determines the best mode.

***

### Audio Guide

| Need                           | Recommended models                                    |
| ------------------------------ | ----------------------------------------------------- |
| Dialogue or lip-sync           | Veo 3.1 or Veo 3.1 Fast                               |
| Ambient sound or general audio | Kling 3.0, Kling O3, Seedance 2                       |
| Silent visual draft            | Veo 3.1 Lite, Sora 2, Wan 2.7, Grok, Hailuo           |
| Final sound design             | Generate visuals first, then finish audio in Premiere |

Audio generation can be useful, but it is not always the best final audio. For client work, you may still want to add dialogue, voiceover, music, or sound effects in Premiere.

***

### Pro vs. Fast vs. Standard

Some model families have quality or speed variants.

Use the higher-quality option when:

* The shot is for delivery.
* The prompt is complex.
* Character or product consistency matters.
* You are generating a final hero shot.

Use the faster or lighter option when:

* You are exploring ideas.
* You are testing prompts.
* You expect to regenerate several times.
* You care more about speed or cost than final polish.

A good workflow is to draft with a faster model, then regenerate the best prompt with a higher-quality model.

***

### When To Use Studio Instead

Studio is better than manual model selection when the task already has a clear workflow.

<table><thead><tr><th width="404">Goal</th><th>Use Studio workflow</th></tr></thead><tbody><tr><td>Create a cinematic still before video work</td><td>Cinematic Lab</td></tr><tr><td>Animate a still with a camera move</td><td>Motion Director</td></tr><tr><td>Create a polished transition</td><td>AI Transitions</td></tr><tr><td>Generate alternate angles</td><td>Multi-Cam</td></tr><tr><td>Transfer motion to a character image</td><td>Motion Capture</td></tr><tr><td>Add VFX to a video</td><td>Add Effects</td></tr><tr><td>Change lighting</td><td>Relight Scene</td></tr><tr><td>Reshoot a short segment</td><td>Reshoot</td></tr><tr><td>Finish resolution</td><td>Upscale</td></tr></tbody></table>

Studio does more of the prompt and model setup for you. Direct model selection gives you more control when you already know exactly which generation mode you want.

***

### Practical Starting Points

#### If you are new

Start with one of these:

* **Veo 3.1 Fast** for quick audio-enabled video tests.
* **Kling 3.0 Pro** for cinematic motion and image-to-video.
* **Veo 3.1 Lite** for lower-cost silent drafts.
* **Studio Motion Director** if you already have a strong still image.

#### If you are making B-roll

Try:

* Sora 2 for cinematic visual clips.
* Kling 3.0 or Kling O3 for stronger movement.
* Veo 3.1 Lite for lower-cost drafts.
* Seedance 2 if audio/ambient sound is useful.

#### If you are making social content

Try:

* Grok for fast mobile formats.
* Hailuo for action and energy.
* Kling for polished camera movement.
* Veo 3.1 when dialogue or sound matters.

#### If you are making client-facing shots

Use faster models to explore, then move the best idea into a quality pass:

1. Draft with a faster or lighter model.
2. Refine the prompt.
3. Regenerate with a higher-quality model.
4. Use Studio tools for cleanup, transitions, relight, or upscale.

***

### Troubleshooting

#### The model I expected is not available

The model selector changes based on what you attach. No attachments show text-to-video models. One image shows image-to-video options. Two images may activate transition mode. Multiple reference images may show reference models.

#### The result has no audio

Check whether the selected model supports audio. Veo 3.1, Kling, and Seedance can generate audio. Sora 2, Veo 3.1 Lite, Wan 2.7, Grok, and Hailuo are better treated as visual models unless the app shows an audio option for your selected mode.

#### The model looks wrong for my use case

Switch based on the failure:

| Problem                              | Try                                            |
| ------------------------------------ | ---------------------------------------------- |
| Weak dialogue or lip-sync            | Veo 3.1                                        |
| Weak motion                          | Kling, Seedance, or Hailuo                     |
| Need faster drafts                   | Veo 3.1 Lite, Grok, or a Fast/Standard variant |
| Need stronger transition             | Studio AI Transitions                          |
| Need more controlled image animation | Studio Motion Director                         |
| Need a cleaner final                 | Upscale after the creative result is approved  |

#### The model list changes over time

Chat Video Pro adds and updates models as providers improve. If the in-app selector differs from this page, trust the app. This guide is meant to help you choose the right kind of model, not memorize every technical option.

***

### Next Steps

* Use [Text-to-Video](/features/video-generation/text-to-video.md) when starting from a prompt.
* Use [Image-to-Video](/features/video-generation/image-to-video.md) when animating one still.
* Use [Transition Mode](/features/video-generation/transition-mode.md) or [AI Transitions](/features/studio/ai-transitions.md) when connecting two frames.
* Use [Reference Mode](/features/video-generation/reference-mode.md) when subject consistency matters.
* Use [Studio](/features/studio.md) for guided production and post-production workflows.
If you need...	Start with...	Why
Dialogue, speech, or generated audio	Veo 3.1 or Veo 3.1 Fast	Strongest choice when audio and lip-sync matter.
Fast lower-cost Veo drafts without audio	Veo 3.1 Lite	Good for visual drafts, B-roll ideas, and transitions when you will add sound in Premiere.
Cinematic motion and strong general quality	Kling 3.0 Pro or Kling O3 Pro	Strong camera movement, action, and polished motion.
Longer cinematic clips without audio	Sora 2 or Sora 2 Pro	Good for cinematic B-roll and longer visual generations.
Natural motion with ambient audio	Seedance 2	Good all-around model for natural motion and audio-enabled scenes.
Action, sports, or fast movement	Hailuo 2.3	Strong for dynamic action and energetic movement.
Fast mobile-first content	Grok	Fast generations and unusual mobile aspect ratios.
High-resolution 1080p-style outputs	Wan 2.7	Good when clarity, flexible aspect ratios, or reference workflows matter.
A guided camera move from a still image	Studio Motion Director	Easier than hand-prompting image-to-video movement.
A polished transition between two frames	Studio AI Transitions	Easier than manually choosing a transition model.
Hardest part of the shot	What to prioritize
A person speaking	Audio and lip-sync.
Complex camera movement	Motion quality and scene understanding.
A client-ready cinematic insert	Quality and consistency.
A quick concept	Speed and cost.
A character must look the same	Reference mode or Studio workflow.
Two frames need to connect	Transition mode or Studio AI Transitions.
The clip must be vertical/mobile	Aspect ratio support.
Goal	Use Studio workflow
Create a cinematic still before video work	Cinematic Lab
Animate a still with a camera move	Motion Director
Create a polished transition	AI Transitions
Generate alternate angles	Multi-Cam
Transfer motion to a character image	Motion Capture
Add VFX to a video	Add Effects
Change lighting	Relight Scene
Reshoot a short segment	Reshoot
Finish resolution	Upscale