Skip to main content

Introduction

AudioShake is a cutting-edge audio research company that specializes in producing high-quality sound separation and lyric transcription technology. Stems are essential components of modern content workflows. Our technology gives users access to the different components of a piece of audio for more precise editing and mixing in applications across UGC, gaming, streaming, sports, and more.

This documentation will guide you through the process of using our available stem separation modalities, as well as the services available on each platform. Check out our quick start guide to familiarize yourself with our API.

To begin building with any of our available platforms, get in touch with AudioShake: support@audioshake.ai.

Platforms

API

Implement AudioShake’s stem separation and transcription technology into an existing stack or workflow.

Services

  • Instrument Stem Separation: Separate any track–even mono-track recordings–into stems and instrumentals, for use in sync licensing, immersive mixes, AR/VR, gaming, fitness, UGC, karaoke, and more.
  • Dialogue, Music, and Effects Separation: Isolate dialogue, music and effects tracks for a variety of use cases across film, TV, dubbing, and synthetic voice.
  • Lyric Transcription and Alignment: Transcribe and word-align audio files into readable text, using AudioShake’s leading vocal isolation models.