How Much Does Closed Captioning Cost in 2026? Real Per-Minute Rates
Founder · Building TranscribeCat since 2024 · Last updated May 2, 2026
Short answer:closed captioning costs between $0.04 and $7+ per minute of video in 2026, depending on how it's produced. DIY AI tools sit at the bottom ($0.04–$0.10/min), self-serve AI services like TranscribeCat sit at the affordable middle ($0.03/min — $2/hour), professional human services run $1–$3/min, and broadcast-grade live captioning can reach $7/min or more.
Below is the actual pricing landscape, what you get at each tier, and how to decide which one fits your video.
Closed captioning rates at a glance
| Tier | Per minute | Accuracy | Turnaround |
|---|---|---|---|
| DIY (Whisper, YouTube auto-captions) | $0.00–$0.10 | 85–93% | Minutes |
| Self-serve AI (TranscribeCat, TurboScribe) | $0.03–$0.17 | 93–97% | Minutes |
| AI + human review (Rev AI, Sonix Pro) | $0.50–$1.50 | 97–99% | Hours |
| Human transcription (Rev, GoTranscript, 3PlayMedia) | $1.25–$3.00 | 99%+ | 12–48 hours |
| Live captioning (broadcast, legal) | $5.00–$7.50 | 99%+ realtime | Real-time |
Prices reflect public per-minute rates as of May 2026. Volume discounts apply at most human-tier services starting around 100 hours per month.
Why captioning costs vary 175× across tiers
The price gap between $0.04 and $7+ per minute is real, not arbitrary. Three things drive it:
- Human time. A skilled captioner transcribes at roughly 4× audio speed. One hour of video = 15 minutes of human work, plus QA.
- Accuracy floor.ADA, FCC, and most broadcaster contracts require 99% accuracy. AI alone can't guarantee that on noisy audio, accents, or technical jargon — so human review is added.
- Realtime constraint. Live captioning compresses the QA window to zero, requires redundant captioner shifts, and demands specialised software. That drives the price into the $5–$7+ range.
When DIY AI captioning is enough
If your video is going on YouTube as a creator, into an internal training library, or attached to a podcast as supplementary captions — pure AI captioning is fine. At 93–97% accuracy you'll have to fix proper nouns and the occasional phrase, but the math beats spending $1.50/min for the marginal accuracy gain.
For example: 1 hour of clear interview audio at TranscribeCat's $2/hour rate costs $2.00. The same hour at a human service averages $108 ($1.80/min × 60). The difference buys 54 hours of additional content production.
See our cheapest transcription service comparison for the full breakdown across 10+ providers.
When you need humans (or AI + human)
Three scenarios where the human tier is actually worth $1+/min:
- Legal compliance.ADA Title III lawsuits typically require 99%+ accuracy. AI captions don't meet that bar without review.
- Heavy accents or multilingual mixing.Code-switching between languages mid-sentence is still where AI most often fails. Human captioners catch this; AI doesn't.
- Broadcast or theatrical release.Networks and streamers have contractual accuracy floors that AI can't hit unaided.
Hidden costs to watch
The per-minute rate isn't the whole picture. Three line items that frequently surprise people:
- Subscription floors. Otter charges $20/month for 1,200 minutes — but if you only caption 60 minutes that month, your effective rate is $0.33/min, not the $0.017/min the marketing implies.
- Speaker labels and timestamps.Some services charge extra for speaker diarization (the “Speaker 1 / Speaker 2” labels) and SRT/VTT export. TranscribeCat includes both at no extra charge.
- Rush fees. Human services typically charge 50–100% premiums for under-24-hour turnaround. Factor this into your editorial timeline.
A working budget for closed captions
A practical budget rule that holds for most creators and small media teams:
- Under 5 hours/month, internal-use video — DIY AI ($0.04–$0.10/min)
- 5–50 hours/month, public content — self-serve AI ($0.03–$0.17/min)
- Public content with legal exposure — AI + human review ($0.50–$1.50/min)
- Broadcast or live — professional human ($1.50–$7.50/min)
Where TranscribeCat fits
TranscribeCat is the self-serve AI tier — $2 per hour of audio (~$0.033/min), no subscription, with speaker labels and SRT/VTT export included. Audio is encrypted, you choose retention (instant to 1 year), and we issue automatic refunds on failed jobs.
See /pricing for the full feature list, or check a sample transcript before signing up.
Try TranscribeCat for $2
Caption your first hour of video for $2. No subscription, no minimum commitment. Output as TXT, SRT, or Word.
Upload a file →