Models

When making calls with your API, these are the model names used when creating jobs. For example, in this call, the model name is wav:

curl -L -X POST 'https://groovy.audioshake.ai/job' \
    -H 'Content-Type: application/json' \
    -H 'Authorization: Bearer <TOKEN>' \
    --data-raw '{
      "metadata": {
        "format": "wav",
        "name": "vocals"
      },
      "callbackUrl": "https://example.com/webhook/vocals",
      "assetId": "abc123"
    }'

Instrument Stem Separation

Name	Stem Name	Description	API	Widget
Instrumental	instrumental	Music without vocals, only instruments	✅	✅
Drums	drums	Percussion instruments producing rhythmic beats	✅	✅
Vocals	vocals	Isolates singing and vocal sounds	✅	✅
Bass	bass	Instruments producing low-frequency sounds, typically the bass guitar or synthesizer bass lines	✅	✅
Other	other	Remaining instrumentation after removing vocals, drums, and bass	✅	✅
Guitar	guitar	Instruments from the guitar family, including electric, acoustic, and classical guitars	✅
Other-x-Guitar	other-x-guitar	Remaining instrumentation after removing vocals, drums, bass, and guitar	✅	✅
Piano	piano	Instruments like Rhodes piano, upright piano, grand piano, and keyboard	✅	✅
Wind	wind	Instruments like flute, saxophone, producing sound by vibrating air	✅	✅
Strings	strings	Orchestral string instruments like violin, viola, cello, and double bass	✅	✅

Residual

If you would like to generate a residual stem, please set residual in the metadata field to true.

For more info, contact info@audioshake.ai

Dialogue, Music, & Effects

Name	Stem Name	Description	API
Dialogue	dialogue	Speech or vocals isolated from any other sound	✅
Music removal	music_removal	Removing music from audio while retaining dialogue, background effects, and natural sound	✅
Background (Music & FX)	music_fx	Remove dialogue to extracting a clean background stem of music and effects	✅

Transcription & Alignment

Name	Stem Name	Description	API	Widget
Transcription	transcription	Text representation of spoken words or audio content	✅	✅
Alignment	alignment	Synchronization of audio and corresponding text or captions	✅	✅

Example

In the provided cURL command, the "name" field in the JSON payload specifies the desired stem type for processing the audio file. It allows you to indicate what specific part of the audio you want to extract or process.

curl -L -X POST 'https://groovy.audioshake.ai/job' \
    -H 'Content-Type: application/json' \
    -H 'Authorization: Bearer <TOKEN>' \
    --data-raw '{
    "metadata": {
        "format": "wav",
        "name": "vocals"
    },
    "callbackUrl": "https://example.com/webhook/vocals",
    "assetId": "abc123"
    }'

Instrument Stem Separation​

Dialogue, Music, & Effects​

Transcription & Alignment​

Example​

Instrument Stem Separation

Dialogue, Music, & Effects

Transcription & Alignment

Example