SAM 3 Rotoscoping
Professional-grade background removal and object isolation using Meta's SAM 3

Tutorial
How It Works
Import video - Use Import Clip button or drag & drop
Click "Edit" - On video thumbnail
Select SAM 3 - From model selector (default for videos)
Choose selection method - Text prompt, box, or point selection
Track Frame - Preview mask on first frame (optional)
Track Entire Video - Process all frames
Remove Background - Export transparent ProRes 4444 video
Selection Methods

Text Prompt Selection
Best for: Clear subjects, simple scenes
Select "Text" input mode
Type description: "the person in the white shirt"
Click "Track Frame" - Preview on first frame
Review mask - Ensure correct selection
Track Entire Video - Process all frames
Example prompts:
"the person"
"the car"
"the product"
"the dancer in the center"

Box Selection
Best for: Precise control, complex scenes
Select "Box" input mode
Draw rectangle around subject
Optional: Add helper text describing selection
Click "Track Frame" - Preview mask
Track Entire Video - Process all frames
Tips:
Draw box tightly around subject
Include entire subject in box
Helper text improves accuracy

Point Selection
Best for: Quick selection, single objects
Select "Point" input mode
Click on subject to add points
Add multiple points for better accuracy
Optional: Add helper text
Click "Track Frame" - Preview
Track Entire Video - Process
Tips:
Click on center of subject
Multiple points improve tracking
Use for objects, not complex scenes
Workflow Steps
Step 1: Track Frame (Preview)
Purpose: Preview your selection before processing entire video
Make selection (text, box, or point)
Click "Track Frame"
Wait for processing (few seconds)
Review mask overlay on first frame
Adjust if needed - Change selection and track again
Why this matters:
Saves time and credits
Ensures correct selection
Allows refinement before full processing
Shows mask quality
Step 2: Track Entire Video
Purpose: Process all frames of your video
Confirm selection looks good from Track Frame
Click "Track Entire Video"
Wait for processing (scales with video length)
Progress indicator shows status
Mask data generated for all frames
Processing time:
10-second clip: ~30-60 seconds
30-second clip: ~2-3 minutes
Longer clips take proportionally longer
Step 3: Remove Background
Purpose: Export transparent video file
After tracking completes
Click "Remove Background"
FFmpeg conversion creates transparent video
ProRes 4444 output with alpha channel
Download or Import to Premiere Pro
Output format:
Format: ProRes 4444 (.mov)
Alpha channel: Full transparency
Quality: Professional grade
Compatibility: Premiere Pro ready
Best Practices
Video Preparation
Keep clips short - Under 30 seconds for faster processing
Good contrast - Subject should stand out from background
Stable footage - Works best with steady shots
Clear subject - Well-defined subjects track better
Good lighting - Well-lit subjects produce cleaner masks
Selection Tips
Be specific in text prompts - "the person" vs. "person in blue shirt"
Draw tight boxes - Include entire subject, minimal background
Use multiple points - For point selection, add 2-3 points
Preview first - Always use Track Frame before full processing
Refine if needed - Adjust selection and re-track if mask is wrong
Processing Tips
Start with short clips - Test workflow with 5-10 second clips
Check first frame - Ensure Track Frame looks good
Be patient - Full video processing takes time
Monitor progress - Watch progress indicator
Save results - Download or import when complete
Use Cases
Product Isolation
Example:
Import product demo video
Text prompt: "the product"
Track Entire Video
Remove Background
Result: Product on transparent background for compositing
Portrait Background Removal
Example:
Import talking head video
Box selection around person
Track Entire Video
Remove Background
Result: Person isolated for new background
Object Extraction
Example:
Import video with multiple objects
Text prompt: "the car in the foreground"
Track Entire Video
Remove Background
Result: Car isolated for compositing
Output Details
ProRes 4444 Format
Characteristics:
Codec: Apple ProRes 4444
Alpha channel: Full transparency support
Quality: Professional, lossless alpha
File size: Larger than MP4 (high quality)
Compatibility: Premiere Pro, After Effects, Final Cut Pro
Importing to Premiere Pro
Download transparent video
Import into Premiere Pro project
Place on timeline above background
Composite with other footage
Alpha channel automatically used
Limitations
Video Length
Recommended: Under 30 seconds
Maximum: 5 minutes
Processing time: Scales with length
Longer clips: Consider trimming first
Resolution
Maximum: 2160p (4K) for Kling O1 Edit
Standard: 1080p recommended
Higher resolution: Slower processing
Lower resolution: Faster processing
Subject Complexity
Simple subjects: Work best (single person, object)
Complex scenes: May require refinement
Multiple subjects: May need separate passes
Overlapping objects: Can be challenging
Troubleshooting
"Track Frame not working"
Solutions:
Ensure you've made a selection (text, box, or point)
Check video is properly loaded
Try different selection method
Verify API keys are configured
"Mask is wrong on some frames"
Solutions:
Use more specific text prompt
Try box selection for better control
Add helper text for box/point selection
Process in shorter segments if needed
"Processing is too slow"
Solutions:
Trim video to shorter duration
Use lower resolution source
Keep clips under 30 seconds
Check internet connection
"Background not fully removed"
Solutions:
Refine selection with Track Frame
Use box selection for precise control
Try different selection method
Process again with adjusted selection
Next: Learn about Reshoot for video regeneration and style transfer.
Last updated