Introduction

AudioShake is a cutting-edge audio research company that specializes in producing high-quality sound separation and lyric transcription technology. Stems are essential components of modern content workflows, opening up audio to new use, and lyrics play an important role in the music industry–from providing important metadata for sync briefs and DSP recommendations to driving fan engagement.

Our proprietary, patented technology–including state-of-the-art lyric transcription technology–has processed millions of minutes of audio to create industry-leading solutions for users across industries. AudioShake allows users access to the different components of a piece of audio for more precise editing and mixing in applications across music, film, television, UGC, gaming, streaming, sports, and more. See how customers and partners have used our technology.

This documentation will guide you through the process of using our available stem separation modalities, as well as the services available on each platform. Check out our quick start guide to familiarize yourself with our API.

To begin building with any of our available platforms, get in touch with AudioShake: support@audioshake.ai.

Platforms

API

Implement AudioShake’s stem separation and transcription technology into an existing stack or workflow.

SDK

Bring real-time stem separation to edge devices with a lightweight SDK that delivers isolated stems in real-time.

Widget

Quickly perform stem separation and transcription with AudioShake’s embedded javascript pop-up tool.

Services

Instrument Stem Separation: Separate any track–even mono-track recordings–into stems and instrumentals, for use in sync licensing, immersive mixes, AR/VR, gaming, fitness, UGC, karaoke, and more.
Dialogue, Music, and Effects Separation: Isolate dialogue, music and effects tracks for a variety of use cases across film, TV, dubbing, and synthetic voice.
Lyric Transcription and Alignment: Transcribe and word-align audio files into readable text, using AudioShake’s leading vocal isolation models.
Multi-Speaker Separation: Isolate individual voices from overlapping speech in podcasts, film, and TV content. Create separate speaker streams for each speaker for use in dubbing, accessibility, voice AI, and more.

Platforms​

Services​

Platforms

Services