Rotoscoping
Professional-grade background removal and object isolation using Meta's SAM 3
Rotoscope is the Studio workflow for isolating a subject from its background. Use it when you want to keep a person, product, prop, animal, or object and turn the rest of the frame transparent for compositing.
Chat Video Pro uses SAM 3 Rotoscoping to identify the subject, track it through the clip, and create a transparent video output that can be placed over new footage, graphics, generated backgrounds, or motion design inside Premiere Pro.
What This Tool Is For
Rotoscope is for keeping the selected subject and removing everything else.
Use it for:
Talking-head background removal.
Product cutouts.
Isolating a dancer, athlete, actor, or presenter.
Pulling a foreground object out of a shot.
Creating transparent overlays for edits, thumbnails, trailers, tutorials, and ads.
Preparing a subject to place over AI-generated backgrounds or design elements.
Do not use Rotoscope when your goal is to remove a distracting object from the scene while keeping the original background. That is an object-erasing/inpainting problem, so use Erase Objects instead.
Rotoscope answers: "What should stay?" Erase Objects answers: "What should disappear?"
When To Use It
Use Rotoscope when the subject is worth separating from the scene:
Put a presenter over a new background
Creates a transparent subject layer for compositing.
Isolate a product demo
Keeps the product motion while removing the original environment.
Build a graphic overlay
Lets you layer a person or object above text, titles, or b-roll.
Create social cutouts
Makes subjects reusable across vertical edits, thumbnails, and promo assets.
Combine live footage with AI scenes
Extracts the live subject so it can sit over generated backgrounds.
Choose another Studio workflow when:
Remove a person or object and keep the same background
Erase Objects
Change part of a clip with a prompt
Reshoot
Add rain, fire, smoke, or style
Add Effects
Improve lighting or mood
Relight Scene
Increase resolution after editing
Upscale

Studio Path
The fastest route is through Studio.
Open Studio.
Choose Rotoscope from the Post-Production department.
Load a video from upload, Recents, or your Premiere timeline.
Choose a selection method: Text, Box, or Point.
Click Track Frame to preview the mask.
If the mask is right, click Track Entire Video.
Click Remove Background to create the transparent result.
Send the result to your chat/library and bring it back into Premiere.
Studio opens the video editor directly in SAM 3 Rotoscoping mode, so you can start selecting the subject immediately.
Classic/Editor Path
You can also use Rotoscope from the classic video editor path:
Import or generate a video.
Click Edit on the video thumbnail.
Choose SAM 3 Rotoscoping if it is not already selected.
Select the subject.
Track the frame, track the full video, and remove the background.
This path is useful when you are already working from a chat result. For new post-production work, Studio is cleaner because it starts in the right tool.
Controls And Constraints
Text
Selects the subject using a description like the person, the red car, or the product.
Box
Lets you draw a box around the subject for more precise control.
Point
Lets you click points on the subject. Shift-click marks areas to exclude.
Track Frame
Tests the selection on the current frame before processing the full clip.
Track Entire Video
Tracks the approved subject through the whole video.
Remove Background
Converts the tracked result into a transparent video for compositing.
Current constraints:
Model
SAM 3 video segmentation.
Recommended clip length
Short clips are faster and easier to verify. Under 30 seconds is a good working target.
Maximum clip length
SAM 3 supports longer clips, up to about 5 minutes, but processing time scales with length.
Source quality
Clear, well-lit, high-contrast subjects track better.
Output
Transparent video for Premiere compositing, with a panel-friendly preview generated for playback.
Reference images
Not used. Rotoscope selects from the video itself.
Selection Methods
Text Selection
Use Text when the subject is obvious and easy to describe.
Good text prompts:
Text is best for clean scenes with one clear subject. It is usually the fastest way to start.

Box Selection
Use Box when the scene has multiple subjects, busy backgrounds, or a subject that text might misunderstand.
Best practices:
Draw the box tightly around the subject.
Include the full subject, not just the face or center.
Avoid including large background areas.
Use Shift-drag to mark exclusion areas when needed.
Start on the first frame when using spatial selection.
Box selection is often the safest choice for production work because it gives the model a strong spatial hint.

Point Selection
Use Point when you need a quick selection or want to guide SAM 3 toward a specific object.
Best practices:
Click near the center of the subject.
Add multiple include points for larger subjects.
Shift-click areas you want excluded.
Use Box instead if the subject has a complex shape or overlaps other objects.
Start on the first frame when using spatial selection.
Point selection is fast, but it can be less stable than a good box on difficult footage.
The Rotoscope Workflow
1. Select The Subject
Decide what should remain visible. Rotoscope works best when you think in terms of the final composited layer:
Keep the presenter.
Keep the product.
Keep the car.
Keep the dancer.
Keep the foreground prop.
Avoid vague selections like foreground or everything important. Name the actual subject.
2. Track Frame
Track Frame creates a preview mask before you spend time processing the full clip.
Look for:
The correct subject is selected.
Edges are close enough for the intended use.
Important limbs, hair, products, or props are included.
Background areas are not being included by mistake.
Multiple subjects are not being merged unless you want them together.
If the preview is wrong, adjust the selection method and track the frame again.
3. Track Entire Video
Once the frame preview is good, Track Entire Video processes the clip. Chat Video Pro runs the preview video and mask-data work together, so the result can be used for the final transparent export.
Longer clips take longer. If you are testing a difficult subject, process a short segment first before committing to a long one.
4. Remove Background
After tracking completes, click Remove Background. Chat Video Pro uses the cached mask data to create a transparent result. The current pipeline uses the original video plus SAM 3 mask data to produce a cleaner alpha result, with a preview that can still play inside the panel.
The final result is designed for Premiere compositing, not just browser preview.
Best Practices
Start With The Cleanest Frame
For Box and Point selections, start at the first frame. The video workflow expects spatial selections to begin at the start of the clip, and you will get more predictable tracking when the subject is visible right away.
If the subject is not visible on frame one, trim the clip so the first frame is a strong selection frame.
Keep Clips Short While Testing
Rotoscoping is easier to diagnose in short pieces. For a long shot, test a 5-10 second segment first. Once you know the selection works, process the longer section or split the scene into manageable parts.
Use Contrast To Your Advantage
SAM 3 does better when the subject is visually distinct from the background. High-contrast clothing, clean lighting, visible outlines, and stable framing all help.
Hard cases include:
Dark clothing on a dark background.
Fast motion blur.
Thin hair against complex detail.
Transparent objects.
Multiple overlapping people.
Subjects leaving and re-entering frame.
Choose The Right Selection Method
Use Text for obvious subjects. Use Box for precision. Use Point for quick correction or simple object picks. If one method fails, switch methods rather than repeating the same bad selection.
Composite Intentionally
A transparent subject is only half the shot. Once you bring it into Premiere:
Place it above the new background.
Match scale and position.
Add color correction so the subject belongs in the scene.
Add shadows, blur, or grain when needed.
Use feathering or additional masks in Premiere for edge cleanup if the shot demands it.
Examples
Talking Head On A New Background
Source: Presenter on a plain or messy background.
Selection: Box around the presenter or text prompt
the person.Result: Presenter on transparent background, ready for a branded graphic or generated scene.
Product Cutout
Source: Product demo video.
Selection: Text prompt
the productor a tight box around the item.Result: Product motion isolated for ads, thumbnails, landing pages, or overlays.
Dance Layer
Source: Dancer footage.
Selection: Box around the dancer on the first frame.
Result: Dancer isolated over typography, generated backgrounds, or music-video visuals.
Foreground Object Extraction
Source: Clip with a car, prop, animal, or object in front of the camera.
Selection: Text prompt or box around the target.
Result: Object separated for compositing or reuse in another sequence.
Troubleshooting
Track Frame is disabled
Make a selection first. Text mode needs a prompt. Box mode needs a drawn box. Point mode needs at least one point.
Box or Point selection gives an error
Scrub to the beginning of the clip and select on the first frame. Spatial selections are most reliable from frame zero.
The wrong subject is selected
Use a more specific text prompt or switch to Box selection. If there are multiple people or objects, describe position, color, or role:
The mask is right on the first frame but drifts later
Process a shorter segment, use a clearer first-frame selection, or split the shot around difficult moments. Drift often happens when the subject turns, gets blocked, leaves frame, or overlaps another subject.
The background is not fully removed
Return to the selection step and tighten the box, add exclusion points, or use a simpler subject description. If the footage has low contrast, try a cleaner source or shorter segment.
The transparent result does not preview like a normal MP4
Transparent video formats are different from standard playback formats. Chat Video Pro creates a panel-friendly preview for viewing, while the real transparent output is meant for editing/compositing in Premiere.
Links To Related Studio Pages
Studio - Learn how Studio workflows are organized.
Erase Objects - Remove unwanted people or objects instead of keeping them.
Reshoot - Regenerate a targeted part of a clip.
Add Effects - Add atmosphere, VFX, or style to existing footage.
Relight Scene - Improve lighting before or after isolating a subject.
Upscale - Improve resolution after rotoscoping or generation.
Next: If you want to remove something from the shot while keeping the original background, use Erase Objects.
Last updated