> For the complete documentation index, see [llms.txt](https://docs.chatvideopro.com/llms.txt). Markdown versions of documentation pages are available by appending `.md` to page URLs; this page is available as [Markdown](https://docs.chatvideopro.com/features/video-generation/text-to-video.md).

# Text-to-Video

Text-to-Video is the simplest way to create a new AI video from scratch. You write what you want to see, choose a video model, set the basic generation options, and Chat Video Pro generates the shot.

Use it when you do not already have a source image, start frame, end frame, reference set, or existing video. It is best for creating new B-roll, establishing shots, product concepts, social clips, background plates, style tests, and visual ideas that do not exist yet.

{% hint style="info" %}
If you already have a still image, use Image-to-Video or Studio Motion Director. If you have two frames, use Transition Mode or Studio AI Transitions. Text-to-Video is for starting from words.
{% endhint %}

***

### When To Use Text-to-Video

Use Text-to-Video when you want the model to invent the whole shot:

* A quick B-roll insert.
* A cinematic establishing shot.
* A product or brand concept.
* A background plate for an edit.
* A social clip idea.
* A mood, setting, or visual direction test.
* A shot you will later refine with Studio tools.

Choose another workflow when:

<table><thead><tr><th width="362">You have...</th><th>Use instead</th></tr></thead><tbody><tr><td>One still image to animate</td><td>Image-to-Video or Studio Motion Director</td></tr><tr><td>Two images to connect</td><td>Transition Mode or Studio AI Transitions</td></tr><tr><td>Reference images of a character/product</td><td>Reference Mode</td></tr><tr><td>Existing video to extend</td><td>Generative Extend</td></tr><tr><td>Existing video to clean up or edit</td><td>Studio or Video Canvas Editor</td></tr><tr><td>A cinematic still before motion</td><td>Studio Cinematic Lab</td></tr><tr><td>Help writing a strong prompt</td><td>Video Prompter Assistant</td></tr></tbody></table>

***

<figure><img src="/files/GgCY0b8poqtGysvA6vOW" alt=""><figcaption></figcaption></figure>

### How It Works

1. Enable [**Generate Media**](/getting-started/interface-overview/generate-media-button.md) in the composer.
2. Choose a video model from the model dropdown.
3. Set the aspect ratio, duration, resolution, and audio options.
4. Write a prompt that describes the shot.
5. Generate the video.
6. Review the result.
7. Regenerate, edit, extend, send to Studio, or import into Premiere.

The first generation is often a draft. Treat Text-to-Video like directing a model: generate, review what worked, then refine the prompt or switch models if needed.

***

### What A Good Prompt Includes

A strong Text-to-Video prompt usually answers six questions:

<table><thead><tr><th width="311">Prompt element</th><th>What to describe</th></tr></thead><tbody><tr><td>Subject</td><td>Who or what is the shot about?</td></tr><tr><td>Setting</td><td>Where and when does it happen?</td></tr><tr><td>Action</td><td>What changes during the shot?</td></tr><tr><td>Camera</td><td>How is the shot framed or moving?</td></tr><tr><td>Style</td><td>What visual language should it follow?</td></tr><tr><td>Lighting and mood</td><td>What should it feel like?</td></tr></tbody></table>

Useful structure:

{% code overflow="wrap" %}

```
[Camera movement] on [subject] doing [action] in [setting], [lighting], [style], [mood], [important details].
```

{% endcode %}

You do not need to use this exact format every time. The goal is to give the model enough direction to understand the shot, not to write a novel.

***

### Prompt Examples

#### Cinematic B-Roll

{% code overflow="wrap" %}

```
Slow push-in on a steaming coffee cup on a wooden table in a cozy coffee shop during golden hour, warm natural light through large windows, soft bokeh background, peaceful and inviting atmosphere.
```

{% endcode %}

Why it works:

* Clear subject: coffee cup.
* Clear camera move: slow push-in.
* Clear setting and light: cozy coffee shop, golden hour.
* Clear mood: peaceful and inviting.

#### Product Shot

{% code overflow="wrap" %}

```
Smooth orbit around a premium black smartwatch on a dark reflective surface, studio lighting, subtle rim light on the edges, luxury product commercial style, clean background, slow elegant motion.
```

{% endcode %}

Why it works:

* The product is specific.
* The camera move is direct.
* The lighting and style support the use case.

#### Establishing Shot

{% code overflow="wrap" %}

```
Wide aerial shot of a coastal city at sunrise, camera gliding slowly over rooftops toward the ocean, soft golden light, light morning haze, cinematic documentary style, calm optimistic mood.
```

{% endcode %}

Why it works:

* It gives scale and motion.
* The camera has a path.
* The mood fits an establishing shot.

#### Social Clip

{% code overflow="wrap" %}

```
Vertical handheld shot following a runner through a neon-lit city street at night, wet pavement reflections, energetic pacing, quick natural camera movement, modern social ad style.
```

{% endcode %}

Why it works:

* It specifies vertical framing.
* Motion and platform style are clear.
* The scene has visual hooks.

***

### Weak Prompts To Avoid

Weak prompts usually fail because they describe a topic, not a shot.

Too vague:

```
A coffee shop.
```

Better:

{% code overflow="wrap" %}

```
Slow push-in on a barista pouring latte art in a cozy coffee shop, warm morning light, shallow depth of field, cinematic lifestyle ad.
```

{% endcode %}

Missing action:

```
A person in a city.
```

Better:

{% code overflow="wrap" %}

```
Tracking shot following a person crossing a busy city street at sunset, traffic lights glowing, wind moving their coat, cinematic urban energy.
```

{% endcode %}

No camera or style:

```
A car driving.
```

Better:

{% code overflow="wrap" %}

```
Low-angle tracking shot of a vintage red car driving along a coastal highway at sunset, ocean in the background, warm film look, smooth cinematic motion.
```

{% endcode %}

***

### Choosing A Model

You do not need to memorize every model. Choose based on the hardest part of the shot.

| Need                                           | Good starting point       |
| ---------------------------------------------- | ------------------------- |
| Dialogue or generated audio                    | Veo 3.1 or Veo 3.1 Fast   |
| Lower-cost silent drafts                       | Veo 3.1 Lite              |
| Cinematic motion                               | Kling 3.0 Pro or Kling O3 |
| Longer visual clips without audio              | Sora 2 or Sora 2 Pro      |
| Natural motion with ambient audio              | Seedance 2                |
| Action or sports                               | Hailuo 2.3                |
| Mobile-first formats                           | Grok                      |
| Flexible visual output without generated audio | Wan 2.7                   |

For a deeper chooser guide, see Supported Video Models.

***

### Settings That Matter

#### Aspect Ratio

Choose the aspect ratio for the edit, not just the idea.

<table><thead><tr><th width="156">Aspect ratio</th><th>Use for</th></tr></thead><tbody><tr><td>16:9</td><td>YouTube, websites, landscape edits, most Premiere timelines.</td></tr><tr><td>9:16</td><td>TikTok, Reels, Shorts, vertical ads.</td></tr><tr><td>1:1</td><td>Square social placements when supported by the selected model.</td></tr><tr><td>21:9</td><td>Cinematic widescreen when supported.</td></tr><tr><td>4:3 or 3:4</td><td>Vintage, editorial, or portrait alternatives when supported.</td></tr></tbody></table>

If the model does not support the aspect ratio you need, switch models or generate in the closest supported format and reframe in Premiere.

#### Resolution

Generate at the resolution that fits the stage of work.

* Use lower or standard resolution for drafts.
* Use higher resolution for final candidates.
* Use Studio Upscale after the creative result is approved.

#### Audio

Only some models generate audio. If the clip needs speech, sound effects, or ambient audio, choose a model that supports it and describe the audio in the prompt.

If you plan to build the final sound in Premiere, a silent model can be a better choice.

***

### Best Practices

#### Describe A Shot, Not A Concept

The model needs direction. "Luxury watch commercial" is a concept. "Slow orbit around a black watch on a reflective surface with rim lighting" is a shot.

#### Give The Camera Something To Do

Camera movement is one of the strongest levers in video generation. Use phrases like:

* Slow push-in.
* Wide establishing shot.
* Tracking shot.
* Handheld follow shot.
* Low-angle dolly.
* Smooth orbit.
* Aerial glide.

#### Keep The Action Believable

AI video models struggle when too much changes at once. A simple clear action usually beats five competing actions.

Better:

```
The runner turns the corner and accelerates down the wet street.
```

Riskier:

{% code overflow="wrap" %}

```
The runner jumps over a car, changes outfits, enters a building, and the scene becomes a concert.
```

{% endcode %}

#### Mention What Matters Most

If a detail must be right, put it in the prompt. If the car must be red, say red. If the shot must be vertical, set the aspect ratio and mention vertical social ad framing.

#### Use Studio After The First Good Result

Once a Text-to-Video result is close, use Studio to improve it:

<table><thead><tr><th width="327">Next step</th><th>Studio workflow</th></tr></thead><tbody><tr><td>Add VFX or atmosphere</td><td>Add Effects</td></tr><tr><td>Change lighting</td><td>Relight Scene</td></tr><tr><td>Fix a short moment</td><td>Reshoot</td></tr><tr><td>Remove a distraction</td><td>Erase Objects</td></tr><tr><td>Improve resolution</td><td>Upscale</td></tr></tbody></table>

***

### Example Workflows

#### Quick B-Roll

1. Choose a fast or balanced model.
2. Use a simple prompt with subject, camera, setting, and mood.
3. Generate a 4-8 second clip.
4. Regenerate once or twice with clearer camera/action notes.
5. Import the best result into Premiere.

#### Client Concept Shot

1. Start with a detailed prompt.
2. Draft with a faster model or lower resolution.
3. Refine the wording based on the result.
4. Regenerate with a higher-quality model.
5. Use Studio Upscale or Add Effects only after the creative direction is approved.

#### Social Video Insert

1. Set aspect ratio to 9:16.
2. Choose a model that supports the format you need.
3. Prompt for vertical composition and mobile pacing.
4. Keep the action simple and readable.
5. Finish captions, music, or sound design in Premiere.

***

### Troubleshooting

#### The video does not match my prompt

Make the prompt more concrete. Add a clearer subject, camera move, setting, action, and style. If the model keeps missing the same thing, try a different model.

#### The shot feels generic

Add specific production language:

{% code overflow="wrap" %}

```
35mm handheld documentary style, natural window light, shallow depth of field, soft background motion.
```

{% endcode %}

or:

{% code overflow="wrap" %}

```
Premium product commercial style, slow controlled orbit, black reflective surface, sharp rim light.
```

{% endcode %}

#### The motion is messy

Simplify the action. Ask for one clear movement instead of several. If the shot depends on camera movement, try Kling, Seedance, Hailuo, or Studio Motion Director depending on the source.

#### The audio did not generate

Check whether the selected model supports audio and whether the audio toggle is enabled. Veo 3.1, Kling, and Seedance are better choices when audio matters. Veo 3.1 Lite, Sora, Wan, Grok, and Hailuo are better treated as visual models unless the app shows audio support for that mode.

#### The result is close but not final

Do not keep regenerating blindly. Decide what is wrong:

<table><thead><tr><th width="314">Problem</th><th>Better next step</th></tr></thead><tbody><tr><td>Needs better prompt</td><td>Rewrite with clearer camera/action details.</td></tr><tr><td>Needs stronger model</td><td>Switch models using Supported Video Models.</td></tr><tr><td>Needs VFX or lighting</td><td>Use Studio Add Effects or Relight Scene.</td></tr><tr><td>Needs cleanup</td><td>Use Studio Erase Objects, Rotoscope, or Reshoot.</td></tr><tr><td>Needs resolution</td><td>Use Studio Upscale after approval.</td></tr></tbody></table>

***

### Related Pages

* [Supported Video Models](/features/video-generation/supported-video-models.md) - Choose the right model.
* [Image-to-Video](/features/video-generation/image-to-video.md) - Animate a still image.
* [Transition Mode ](/features/video-generation/transition-mode.md)- Connect two frames.
* [Reference Mode](/features/video-generation/reference-mode.md) - Use references for consistency.
* [Video Prompter Assistant](/conversation-starters/video-prompter-assistant.md) - Get help building a stronger prompt.
* [Studio](/features/studio.md) - Use guided production and post-production workflows.

***

**Next:** If you already have an image you want to animate, use Image-to-Video or Studio Motion Director.
You have...	Use instead
One still image to animate	Image-to-Video or Studio Motion Director
Two images to connect	Transition Mode or Studio AI Transitions
Reference images of a character/product	Reference Mode
Existing video to extend	Generative Extend
Existing video to clean up or edit	Studio or Video Canvas Editor
A cinematic still before motion	Studio Cinematic Lab
Help writing a strong prompt	Video Prompter Assistant
Prompt element	What to describe
Subject	Who or what is the shot about?
Setting	Where and when does it happen?
Action	What changes during the shot?
Camera	How is the shot framed or moving?
Style	What visual language should it follow?
Lighting and mood	What should it feel like?
Aspect ratio	Use for
16:9	YouTube, websites, landscape edits, most Premiere timelines.
9:16	TikTok, Reels, Shorts, vertical ads.
1:1	Square social placements when supported by the selected model.
21:9	Cinematic widescreen when supported.
4:3 or 3:4	Vintage, editorial, or portrait alternatives when supported.
Next step	Studio workflow
Add VFX or atmosphere	Add Effects
Change lighting	Relight Scene
Fix a short moment	Reshoot
Remove a distraction	Erase Objects
Improve resolution	Upscale
Problem	Better next step
Needs better prompt	Rewrite with clearer camera/action details.
Needs stronger model	Switch models using Supported Video Models.
Needs VFX or lighting	Use Studio Add Effects or Relight Scene.
Needs cleanup	Use Studio Erase Objects, Rotoscope, or Reshoot.
Needs resolution	Use Studio Upscale after approval.