Speaker labels
Two voices, ten voices — every block is tagged with who said it. Rename a speaker once and every reference updates.
- Auto-detection for up to 10 speakers
- One-click rename, applied everywhere
- Survives every export format
Loading...
Upload an audio or video file, get an editable transcript with speaker labels in minutes. $2 per hour. No subscription.
No card needed to sign up.
board_strategy_review.mp3
0:30 · 2 speakers
Two voices, ten voices — every block is tagged with who said it. Rename a speaker once and every reference updates.
Open a transcript in the browser, play the audio alongside it, click any line to jump there. Fix a misheard word in place — autosaved.
Send the transcript out as DOCX, PDF, TXT, SRT, or VTT — or push it straight to a Google Doc you can share with the team.
Works with what you already record
Every file runs through OpenAI's latest speech model — the same family that powers near-human accuracy on clean recordings. Heavy accents, overlapping speakers, and noisy rooms reduce quality, but you can fix anything line by line in the in-browser editor.
Auto-detected for every upload. Same price, same speed, same editor.
No subscription. No tiers. No expiring credits. Pay only for the hours you actually transcribe.
Everything you need to know before uploading your first file.
$2 per hour of audio or video, billed per minute, with a $2 minimum per file. A 15-minute file and a 55-minute file both cost $2. A 90-minute file costs $3. No subscription, no monthly fee.
Most files finish in under 5 minutes. Longer files (2+ hours) can take 10-15 minutes. The page updates automatically — you can leave it open or come back later.
Near-human accuracy on clear audio, using OpenAI's latest speech model. Heavy accents, overlapping speakers, and noisy rooms reduce quality — you can fix anything line by line in the in-browser editor.
You get an automatic refund to your original payment method. No support ticket needed.
MP3, WAV, M4A, MP4, FLAC, OGG, WebM, and Opus. Phone recordings (M4A) and video files (MP4) work out of the box.
100+ languages, auto-detected on every upload. Same price across all of them, same speed, same editor. Includes English, Spanish, German, French, Norwegian, Japanese, Mandarin, Arabic, Hindi, and many more.
Audio is kept 90 days from your last sign-in by default. You can change that per upload or in your profile — anything from instant-delete to one year. The transcript itself stays in your account until you delete it.
Yes. Open any transcript, click into a block, and type. Cmd+Enter saves; Escape reverts. Speaker labels are clickable too — rename "Speaker A" to "Marcos" once and every block updates.
One hour of audio. $2. Editable transcript. No subscription.
No card needed to sign up.