u/k800elik

I built a CLI so AI agents can generate TTS audio from docs, notes, and summaries
▲ 3 r/AIAgentsDirectory+1 crossposts

I built a CLI so AI agents can generate TTS audio from docs, notes, and summaries

Hey everyone,

I’m building TTSBuddy CLI, a command-line tool that lets AI agents and automation workflows generate text-to-speech audio from text, markdown, notes, scripts, summaries, or long documents.

https://reddit.com/link/1te4hvy/video/3gz9n51nec1h1/player

The idea is simple: agents are good at producing useful written output, but sometimes the better final artifact is audio.

Why I built it:

  • turn agent summaries into listenable audio
  • generate study audio from notes or papers
  • create quick voice previews from scripts
  • make long AI outputs easier to consume away from the screen
  • support automation pipelines without needing the web UI

I’d love feedback from people building agents:

  • Would you use this as a CLI tool, MCP tool, or both?
  • Should the default output be a downloaded MP3, an audio URL, or JSON?
  • What agent workflows would actually benefit from audio output?
  • What would make this easier to plug into your stack?
reddit.com
u/k800elik — 7 days ago
▲ 2 r/SaasDevelopers+1 crossposts

How would you QA 30+ languages as a solo SaaS founder?

https://preview.redd.it/152mo4e0tb1h1.png?width=1046&format=png&auto=webp&s=5f42a5398e58f17ccd5b5edcb64209a1f9de85fe

I’m building TTSBuddy, mainly for students, ADHD folks, and busy people who prefer listening instead of reading.

I recently added 30+ language modes, which sounded like a good milestone until I hit the obvious problem: I can’t personally judge quality in most of those languages.

The QA I need is very human:

- Does the voice sound natural?

- Are there pronunciation issues?

- Is the pacing weird?

- Is the emphasis wrong?

- Does it sound technically correct but unnatural to a native speaker?

My current plan is to recruit native/fluent speakers, give them short generated samples, collect feedback in a simple form.

Languages include Arabic, Bulgarian, British English, Chinese, Croatian, Czech, Danish, Dutch, English, Estonian, Finnish, French, German, Greek, Hindi, Hungarian, Indonesian, Italian, Japanese, Korean, Latvian, Lithuanian, Polish, Portuguese, Romanian, Russian, Slovak, Slovenian, Spanish, Swedish, Turkish, Ukrainian, and Vietnamese.

I’m still deciding what a fair thank-you should be for reviewers.

For other micro-SaaS builders: how would you structure this?

I’m trying to keep it lightweight, but not fake the quality bar.

reddit.com
u/k800elik — 7 days ago
▲ 1 r/studytips+1 crossposts

Looking for students/native speakers to help check test-to-speech quality for study use

Hey,

I’m working on a small text-to-speech project to help students, ADHD folks, and busy people turn reading into listening. And recently added support for several more languages.

I’d like to sanity-check the quality with native or fluent speakers before I describe those languages as properly supported.

The feedback I’m looking for is simple:

- Does the voice sound natural?

- Are there pronunciation mistakes?

- Is the pacing or emphasis strange?

- Does anything sound robotic or awkward?

Languages include Arabic, Bulgarian, Chinese, Croatian, Czech, Danish, Dutch, English, Estonian, Finnish, French, German, Greek, Hindi, Hungarian, Indonesian, Italian, Japanese, Korean, Latvian, Lithuanian, Polish, Portuguese, Romanian, Russian, Slovak, Slovenian, Spanish, Swedish, Turkish, Ukrainian, Vietnamese, and British English.

If you boring to read study books and prefer listen instead and speak one of these well and are open to giving quick feedback, please comment with the language.

reddit.com
u/k800elik — 7 days ago