Input formats
Audio
| Format | Extensions |
|---|---|
| WAV | .wav |
| AIFF | .aiff, .aif |
| FLAC | .flac |
| MP3 | .mp3 |
| AAC | .aac, .m4a |
Video
| Format | Extensions | Audio codec |
|---|---|---|
| MP4 | .mp4 | AAC or PCM |
| MOV | .mov | AAC or PCM |
Text
| Format | Extensions | Use case |
|---|---|---|
| JSON | .json | Transcript input for the alignment model |
| TXT | .txt | Transcript input for the alignment model |
Output formats
Set output formats in theformats array when creating a Task. Use the API value shown below.
Audio
| Format | API value | Models |
|---|---|---|
| WAV | wav | All models that output audio |
| MP3 | mp3 | All models that output audio |
| FLAC | flac | All models that output audio |
| AIFF | aiff | All models that output audio |
Video
| Format | API value | Models |
|---|---|---|
| MP4 | mp4 | All models that output audio |
Text
| Format | API value | Models |
|---|---|---|
| JSON | json | transcription, alignment, music_detection |
| SRT | srt | transcription, alignment |
| TXT | txt | transcription, alignment, music_detection |
Limitations
- Encrypted or DRM-protected content is not supported
- Multi-channel or surround formats (e.g. 5.1) are not supported