Formats - AudioShake Developers

Input formats

For best results, use WAV at 16-bit or 24-bit depth. Supports up to 192kHz sample rate. Provide a stereo or mono track.

Format	Extensions	Audio codec
MP4	`.mp4`	AAC or PCM
MOV	`.mov`	AAC or PCM

Only the audio stream is processed — video content is ignored.

Format	Extensions	Use case
JSON	`.json`	Transcript input for the `alignment` model
TXT	`.txt`	Transcript input for the `alignment` model

Set output formats in the formats array when creating a Task. Use the API value shown below.

Format	API value	Models
MP4	`mp4`	All models that output audio

Output video files contain the separated audio stream in the original video container.

Format	API value	Models
JSON	`json`	`transcription`, `alignment`, `music_detection`
SRT	`srt`	`transcription`, `alignment`
TXT	`txt`	`transcription`, `alignment`, `music_detection`