Lyric Transcription - AudioShake Developers

Create a Task

import requests

API_KEY = "your_api_key"
HEADERS = {"Content-Type": "application/json", "x-api-key": API_KEY}

response = requests.post(
    "https://api.audioshake.ai/tasks",
    headers=HEADERS,
    json={
        "assetId": "your_asset_id",
        "targets": [
            {"model": "transcription", "formats": ["json"]}
        ]
    }
)

task_id = response.json()["id"]
print(f"Task created: {task_id}")

const API_KEY = "your_api_key";
const headers = { "Content-Type": "application/json", "x-api-key": API_KEY };

const createRes = await fetch("https://api.audioshake.ai/tasks", {
  method: "POST",
  headers,
  body: JSON.stringify({
    assetId: "your_asset_id",
    targets: [
      { model: "transcription", formats: ["json"] }
    ]
  })
});

const { id: taskId } = await createRes.json();
console.log(`Task created: ${taskId}`);

curl -X POST "https://api.audioshake.ai/tasks" \
  -H "Content-Type: application/json" \
  -H "x-api-key: $AUDIOSHAKE_API_KEY" \
  -d '{
    "assetId": "your_asset_id",
    "targets": [
      { "model": "transcription", "formats": ["json"] }
    ]
  }'

Check Task status to monitor progress and download results, or use webhooks to be notified when each target completes.

Use clean, full-length source audio for best results. Heavily compressed or low-bitrate files reduce transcription accuracy.

Need word-level timing for karaoke or subtitles? Use the alignment model instead — it provides precise per-word timestamps. See Models for details.

Use cases

Render karaoke-style highlighted lyrics in real time

Convert to SRT or VTT subtitle formats

Build searchable lyric databases with per-line timestamps

Power lyric-synced visual experiences

Build Karaoke Tracks

Combine lyric transcription with stem separation.

Models

See all available transcription and alignment models.

​Create a Task

​Use cases

Build Karaoke Tracks

Models

Create a Task

Use cases