u/goyetus — reddlx

Voice emotions for cloned voice

Im.using qwen tts and i create my own voice models. Next i used the audio.to clone and narrate text..

The only problem. Cant get emotions in a cloned voice with qwen tts.

I need to add emotions to my cloned voice, and then , use then independant in qwen tts.. (Python coding)

What software should i use to add an emotion to my cloned voice and have a .wav export for that emotion?

My plan is to get sbout 10 emotions for my cloned voice..... And use then as cloned voice in qwentts.....

UPDATE

I’ve already given up on “cloning + emotions”—not even Fish Audio has managed to do it right. (I just need to try Elevenlabs.)

I'm using the “Spanish” language.

I've used Qwen TTS and got a beautiful voice that I really like. The problem is that if I “change” the prompt or the seed, the voice changes completely.

That’s why I can’t create a library of similar voices for different moods (at least with Qwen TTS).

I’ve checked out the ZeroVoice repository, and it’s great (too bad it’s only in English).

What do you recommend for designing a voice and adding emotions to it?

Thanks a lot!!

reddit.com

u/goyetus — 5 days ago

▲ 0 r/comfyui

Fastest "Image to Video" Model ? Maybe interpolation ? Maybe low res and reescale? Cache ?

Intel 270K plus + 5070ti 16gb
---------------------------------

Trying to get 800 px video with 30 fps. About 5 seconds......

What´s the best Fastest model without being trash for "image to video" ?

Im new, so , any direction to search is good!!!

Maybe interpolation ?
Maybe low res and then Scale to about 800px?
Maybe some cache ?

Thanks a lot for the help !!!!

reddit.com

u/goyetus — 7 days ago

▲ 2 r/TextToSpeech

Hello and thanks for this subreddit!!

I need help choosing an AI model for local voice synthesis. There are so many options, and I’m not sure which one would work best for me:

Hardware: 2 NVIDIA GeForce RTX 5060 Ti GPUs with 16 GB of RAM

Language: Spanish (not Latin American)

- I need to clone my voice and use it in audio files. (Finetuning or LORA?)

- I need to generate audio clips several minutes long (though I can shorten them)

Can you help me? I don’t know where to start.......

The AI is recommending some models I’ve never heard of in this subreddit

Thanks a million!!!!

reddit.com

u/goyetus — 24 days ago