> For the complete documentation index, see [llms.txt](https://docs.chatvideopro.com/llms.txt). Markdown versions of documentation pages are available by appending `.md` to page URLs; this page is available as [Markdown](https://docs.chatvideopro.com/features/studio/sam-3-rotoscoping.md).

# Rotoscoping

{% embed url="<https://youtu.be/6vHPRgheHTQ?si=BqWh4DEvM7BBQpb2>" %}

Rotoscope is the Studio workflow for isolating a subject from its background. Use it when you want to keep a person, product, prop, animal, or object and turn the rest of the frame transparent for compositing.

Chat Video Pro uses **SAM 3 Rotoscoping** to identify the subject, track it through the clip, and create a transparent video output that can be placed over new footage, graphics, generated backgrounds, or motion design inside Premiere Pro.

### What This Tool Is For

Rotoscope is for **keeping the selected subject** and removing everything else.

Use it for:

* Talking-head background removal.
* Product cutouts.
* Isolating a dancer, athlete, actor, or presenter.
* Pulling a foreground object out of a shot.
* Creating transparent overlays for edits, thumbnails, trailers, tutorials, and ads.
* Preparing a subject to place over AI-generated backgrounds or design elements.

Do not use Rotoscope when your goal is to remove a distracting object from the scene while keeping the original background. That is an object-erasing/inpainting problem, so use Erase Objects instead.

{% hint style="info" %}
**Rotoscope answers: "What should stay?"** Erase Objects answers: "What should disappear?"
{% endhint %}

### When To Use It

Use Rotoscope when the subject is worth separating from the scene:

<table><thead><tr><th width="333">Goal</th><th>Why Rotoscope helps</th></tr></thead><tbody><tr><td>Put a presenter over a new background</td><td>Creates a transparent subject layer for compositing.</td></tr><tr><td>Isolate a product demo</td><td>Keeps the product motion while removing the original environment.</td></tr><tr><td>Build a graphic overlay</td><td>Lets you layer a person or object above text, titles, or b-roll.</td></tr><tr><td>Create social cutouts</td><td>Makes subjects reusable across vertical edits, thumbnails, and promo assets.</td></tr><tr><td>Combine live footage with AI scenes</td><td>Extracts the live subject so it can sit over generated backgrounds.</td></tr></tbody></table>

Choose another Studio workflow when:

<table><thead><tr><th width="522">Goal</th><th>Better workflow</th></tr></thead><tbody><tr><td>Remove a person or object and keep the same background</td><td>Erase Objects</td></tr><tr><td>Change part of a clip with a prompt</td><td>Reshoot</td></tr><tr><td>Add rain, fire, smoke, or style</td><td>Add Effects</td></tr><tr><td>Improve lighting or mood</td><td>Relight Scene</td></tr><tr><td>Increase resolution after editing</td><td>Upscale</td></tr></tbody></table>

***

<figure><img src="/files/vKAyrEV2Jkx496RXSrWF" alt=""><figcaption></figcaption></figure>

### Studio Path

The fastest route is through Studio.

1. Open **Studio**.
2. Choose **Rotoscope** from the Post-Production department.
3. Load a video from upload, Recents, or your Premiere timeline.
4. Choose a selection method: **Text**, **Box**, or **Point**.
5. Click **Track Frame** to preview the mask.
6. If the mask is right, click **Track Entire Video**.
7. Click **Remove Background** to create the transparent result.
8. Send the result to your chat/library and bring it back into Premiere.

Studio opens the video editor directly in SAM 3 Rotoscoping mode, so you can start selecting the subject immediately.

***

### Classic/Editor Path

You can also use Rotoscope from the classic video editor path:

1. Import or generate a video.
2. Click **Edit** on the video thumbnail.
3. Choose **SAM 3 Rotoscoping** if it is not already selected.
4. Select the subject.
5. Track the frame, track the full video, and remove the background.

This path is useful when you are already working from a chat result. For new post-production work, Studio is cleaner because it starts in the right tool.

***

### Controls And Constraints

| Control            | What it does                                                                                |
| ------------------ | ------------------------------------------------------------------------------------------- |
| Text               | Selects the subject using a description like `the person`, `the red car`, or `the product`. |
| Box                | Lets you draw a box around the subject for more precise control.                            |
| Point              | Lets you click points on the subject. Shift-click marks areas to exclude.                   |
| Track Frame        | Tests the selection on the current frame before processing the full clip.                   |
| Track Entire Video | Tracks the approved subject through the whole video.                                        |
| Remove Background  | Converts the tracked result into a transparent video for compositing.                       |

Current constraints:

| Constraint              | Detail                                                                                            |
| ----------------------- | ------------------------------------------------------------------------------------------------- |
| Model                   | SAM 3 video segmentation.                                                                         |
| Recommended clip length | Short clips are faster and easier to verify. Under 30 seconds is a good working target.           |
| Maximum clip length     | SAM 3 supports longer clips, up to about 5 minutes, but processing time scales with length.       |
| Source quality          | Clear, well-lit, high-contrast subjects track better.                                             |
| Output                  | Transparent video for Premiere compositing, with a panel-friendly preview generated for playback. |
| Reference images        | Not used. Rotoscope selects from the video itself.                                                |

***

### Selection Methods

#### Text Selection

Use Text when the subject is obvious and easy to describe.

Good text prompts:

```
the person
```

```
the dancer in the center
```

```
the red car
```

```
the product on the table
```

Text is best for clean scenes with one clear subject. It is usually the fastest way to start.

<figure><img src="/files/Iq25Bjb3Zxp2JLvDvbsD" alt=""><figcaption></figcaption></figure>

#### Box Selection

Use Box when the scene has multiple subjects, busy backgrounds, or a subject that text might misunderstand.

Best practices:

* Draw the box tightly around the subject.
* Include the full subject, not just the face or center.
* Avoid including large background areas.
* Use Shift-drag to mark exclusion areas when needed.
* Start on the first frame when using spatial selection.

Box selection is often the safest choice for production work because it gives the model a strong spatial hint.

<figure><img src="/files/eQphDJgE6wQMsr7WSy3g" alt=""><figcaption></figcaption></figure>

#### Point Selection

Use Point when you need a quick selection or want to guide SAM 3 toward a specific object.

Best practices:

* Click near the center of the subject.
* Add multiple include points for larger subjects.
* Shift-click areas you want excluded.
* Use Box instead if the subject has a complex shape or overlaps other objects.
* Start on the first frame when using spatial selection.

Point selection is fast, but it can be less stable than a good box on difficult footage.

***

### The Rotoscope Workflow

#### 1. Select The Subject

Decide what should remain visible. Rotoscope works best when you think in terms of the final composited layer:

* Keep the presenter.
* Keep the product.
* Keep the car.
* Keep the dancer.
* Keep the foreground prop.

Avoid vague selections like `foreground` or `everything important`. Name the actual subject.

#### 2. Track Frame

Track Frame creates a preview mask before you spend time processing the full clip.

Look for:

* The correct subject is selected.
* Edges are close enough for the intended use.
* Important limbs, hair, products, or props are included.
* Background areas are not being included by mistake.
* Multiple subjects are not being merged unless you want them together.

If the preview is wrong, adjust the selection method and track the frame again.

#### 3. Track Entire Video

Once the frame preview is good, Track Entire Video processes the clip. Chat Video Pro runs the preview video and mask-data work together, so the result can be used for the final transparent export.

Longer clips take longer. If you are testing a difficult subject, process a short segment first before committing to a long one.

#### 4. Remove Background

After tracking completes, click **Remove Background**. Chat Video Pro uses the cached mask data to create a transparent result. The current pipeline uses the original video plus SAM 3 mask data to produce a cleaner alpha result, with a preview that can still play inside the panel.

The final result is designed for Premiere compositing, not just browser preview.

***

### Best Practices

#### Start With The Cleanest Frame

For Box and Point selections, start at the first frame. The video workflow expects spatial selections to begin at the start of the clip, and you will get more predictable tracking when the subject is visible right away.

If the subject is not visible on frame one, trim the clip so the first frame is a strong selection frame.

#### Keep Clips Short While Testing

Rotoscoping is easier to diagnose in short pieces. For a long shot, test a 5-10 second segment first. Once you know the selection works, process the longer section or split the scene into manageable parts.

#### Use Contrast To Your Advantage

SAM 3 does better when the subject is visually distinct from the background. High-contrast clothing, clean lighting, visible outlines, and stable framing all help.

Hard cases include:

* Dark clothing on a dark background.
* Fast motion blur.
* Thin hair against complex detail.
* Transparent objects.
* Multiple overlapping people.
* Subjects leaving and re-entering frame.

#### Choose The Right Selection Method

Use Text for obvious subjects. Use Box for precision. Use Point for quick correction or simple object picks. If one method fails, switch methods rather than repeating the same bad selection.

#### Composite Intentionally

A transparent subject is only half the shot. Once you bring it into Premiere:

* Place it above the new background.
* Match scale and position.
* Add color correction so the subject belongs in the scene.
* Add shadows, blur, or grain when needed.
* Use feathering or additional masks in Premiere for edge cleanup if the shot demands it.

***

### Examples

#### Talking Head On A New Background

* Source: Presenter on a plain or messy background.
* Selection: Box around the presenter or text prompt `the person`.
* Result: Presenter on transparent background, ready for a branded graphic or generated scene.

#### Product Cutout

* Source: Product demo video.
* Selection: Text prompt `the product` or a tight box around the item.
* Result: Product motion isolated for ads, thumbnails, landing pages, or overlays.

#### Dance Layer

* Source: Dancer footage.
* Selection: Box around the dancer on the first frame.
* Result: Dancer isolated over typography, generated backgrounds, or music-video visuals.

#### Foreground Object Extraction

* Source: Clip with a car, prop, animal, or object in front of the camera.
* Selection: Text prompt or box around the target.
* Result: Object separated for compositing or reuse in another sequence.

***

### Troubleshooting

#### Track Frame is disabled

Make a selection first. Text mode needs a prompt. Box mode needs a drawn box. Point mode needs at least one point.

#### Box or Point selection gives an error

Scrub to the beginning of the clip and select on the first frame. Spatial selections are most reliable from frame zero.

#### The wrong subject is selected

Use a more specific text prompt or switch to Box selection. If there are multiple people or objects, describe position, color, or role:

```
the person in the blue jacket
```

```
the car in the foreground
```

#### The mask is right on the first frame but drifts later

Process a shorter segment, use a clearer first-frame selection, or split the shot around difficult moments. Drift often happens when the subject turns, gets blocked, leaves frame, or overlaps another subject.

#### The background is not fully removed

Return to the selection step and tighten the box, add exclusion points, or use a simpler subject description. If the footage has low contrast, try a cleaner source or shorter segment.

#### The transparent result does not preview like a normal MP4

Transparent video formats are different from standard playback formats. Chat Video Pro creates a panel-friendly preview for viewing, while the real transparent output is meant for editing/compositing in Premiere.

***

### Links To Related Studio Pages

* [Studio](/features/studio.md) - Learn how Studio workflows are organized.
* [Erase Objects](/features/studio/object-eraser-tool.md) - Remove unwanted people or objects instead of keeping them.
* [Reshoot ](/features/studio/reshoot.md)- Regenerate a targeted part of a clip.
* [Add Effects ](/features/studio/kling-vfx.md)- Add atmosphere, VFX, or style to existing footage.
* [Relight Scene](/features/studio/relight-scene.md) - Improve lighting before or after isolating a subject.
* [Upscale](/features/studio/video-upscaling.md) - Improve resolution after rotoscoping or generation.

***

**Next:** If you want to remove something from the shot while keeping the original background, use Erase Objects.


---

# Agent Instructions
This documentation is published with GitBook. GitBook is the documentation platform designed so that both humans and AI agents can read, navigate, and reason over technical content effectively. Learn more at gitbook.com.

## Querying This Documentation
If you need additional information that is not directly available in this page, you can query the documentation dynamically by asking a question.

Perform an HTTP GET request on the current page URL with the `ask` query parameter:

```
GET https://docs.chatvideopro.com/features/studio/sam-3-rotoscoping.md?ask=<question>
```

The question should be specific, self-contained, and written in natural language.
The response will contain a direct answer to the question and relevant excerpts and sources from the documentation.

Use this mechanism when the answer is not explicitly present in the current page, you need clarification or additional context, or you want to retrieve related documentation sections.
