Create a Task
Output format
Music is detected in 10-second intervals. The output JSON contains an array of segments where music is present, each with a confidence score:| Field | Description |
|---|---|
start_time | Start of the music segment (seconds) |
end_time | End of the music segment (seconds) |
confidence | Detection confidence score (0–1) |
Use cases
- Flag content that requires music licensing review
- Build searchable timelines of music usage across archives
- Trigger stem separation or transcription only on segments containing music
- Monitor broadcast compliance with music usage policies
Dialogue Separation
Separate speech from music and effects in your content.
Models
See all available detection and analysis models.