When to use auto-transcription
Auto-transcription works best when:- You don’t have access to the song’s lyrics
- The lead vocal is clear and easy to follow
- The song uses standard pronunciation in its language
- You want to get a karaoke up and running quickly without sourcing lyrics manually
Steps to use auto-transcription
Open the Create Karaoke page
Click Create Karaoke from the Youka home page, then upload your file or paste a URL.
Choose the AI model (optional)
Open Advanced Settings to select the transcription model. Each model offers a different balance of accuracy and credit cost:
If you’re unsure which to choose, AudioShake is a reliable default for most use cases.
| Model | Strengths | Credit cost |
|---|---|---|
| AudioShake | Best accuracy for most songs | Standard |
| MusicAI | Premium detection with syllable-level timing | Higher |
| Whisper | Budget-friendly; good for common languages | Lower |
Select the song's language
Choose the language the lyrics are sung in. This helps the model apply the correct pronunciation patterns.
Credit cost
Auto-transcription uses more credits than providing your own lyrics because the AI performs an extra analysis pass. The exact cost depends on:- The duration of the song
- The AI model you select
Reviewing transcription results
AI transcription is accurate for most songs, but it can make mistakes. After your karaoke is created, open the project and check for:- Misheard words — the AI may transcribe a word phonetically rather than spelling it correctly
- Names and proper nouns — song-specific references, artist names, and place names are common error points
- Stylized pronunciations — words that are deliberately mispronounced or altered as part of the song’s style
What’s next
Edit lyrics
Fix any transcription errors directly in the Studio editor.
Manual sync
Fine-tune the timing of individual words and lines.