r/FishAudio_Official

🐟 Fish Audio Weekly Update (May 6 - May 12)

Hey Fishes!

Thank you for your continued support and feedback! Here's what's new this week 👇

🆕 What's New

  • Text-to-Speech streaming with timestamps: TTS streaming now supports timestamps for improved synchronization and developer workflows.
  • Expanded phoneme control documentation: Added phoneme control support for English, Japanese, and Chinese in the developer documentation.
  • Improved ASR stability and latency: Optimized ASR response delays for a faster and more stable experience.
  • ASR model & pricing visibility: The Developer → Control page now displays ASR model information and pricing ($0.36 per audio hour).

🛠️ Bug Fixes

  • Optimized TTS usage handling for single-speaker generation workflows

🎉 Community Highlights

  • Got something cool to share? Post it here on our subreddit or drop it in ⁠🎧│voice-models channel on our Discord and tag us! We're looking to feature projects from our community!

Stay creative 🎵

— The Fish Audio Team

reddit.com
u/FishAudio — 10 days ago