Create a Task
This example extracts both the dialogue and the background (music + effects) stems:Outputs
| Target | Role |
|---|---|
dialogue | Clean speech — use as reference or feed into translation/dubbing |
music_fx | Music + effects bed — layer localized voiceover on top |
Use cases
- Prepare content for localization and foreign-language dubbing
- Clean up podcast audio by isolating the host’s voice
- Extract clean dialogue for speech-to-text or AI training data
- Separate effects and ambience for sound design workflows
Stem Separation
Separate music into individual instruments instead.
Music Detection
Find where music appears before separating.