
I made a small ElevenLabs cost/workflow calculator for long-form TTS users
I’ve been thinking a lot about the hidden cost of long-form TTS workflows.
Short clips are easy. But once you’re doing YouTube narration, course audio, audiobook chapters, training material, etc., the expensive part is not always the final export. It’s all the retakes:
- one paragraph sounds off
- pacing feels too clean
- you regenerate a section
- then compare 3-4 takes
- then export again
- then the credits slowly disappear
So I made a small calculator/workflow finder for people comparing cloud TTS usage with local/offline draft generation:
https://www.murmurtts.com/tools/elevenlabs-alternative-calculator
It estimates rough monthly voice costs, where credits get burned, and whether a local Mac workflow makes sense for drafts/retakes before using ElevenLabs for final output.
I built it because I’m working on a local Mac TTS app, so full disclosure there. But the calculator is free and might be useful even if you stay fully on ElevenLabs.
Curious how other people here handle long-form iteration: do you regenerate tiny sections, full chunks, or use a separate draft pipeline before final voice?