Skip to main content


Our Transcription and Alignment API is a powerful tool that transcribes audio files into written text and aligns the transcriptions with the corresponding timestamps from the audio file. We provide the following options:

  1. Transcription Only: Given an audio file, AudioShake will transcribe audio into readable text.
  2. Alignment Only: Given an audio file and a transcription json file, AudioShake will map the transcribed text to the corresponding timestamps in the audiofile.
  3. Alignment + Transcription: Given only an audio file, this step combines the first two steps into one.