
I built a CLI so AI agents can generate TTS audio from docs, notes, and summaries
Hey everyone,
I’m building TTSBuddy CLI, a command-line tool that lets AI agents and automation workflows generate text-to-speech audio from text, markdown, notes, scripts, summaries, or long documents.
https://reddit.com/link/1te4hvy/video/3gz9n51nec1h1/player
The idea is simple: agents are good at producing useful written output, but sometimes the better final artifact is audio.
Why I built it:
- turn agent summaries into listenable audio
- generate study audio from notes or papers
- create quick voice previews from scripts
- make long AI outputs easier to consume away from the screen
- support automation pipelines without needing the web UI
I’d love feedback from people building agents:
- Would you use this as a CLI tool, MCP tool, or both?
- Should the default output be a downloaded MP3, an audio URL, or JSON?
- What agent workflows would actually benefit from audio output?
- What would make this easier to plug into your stack?