u/Far_Estimate7276

▲ 2 r/Qwen_AI+1 crossposts

I generally use Faster Whisper for all transcription needs and it works very well when making subtitles, but it cannot handle audio containing multiple languages. To this end, I began researching Qwen3-ASR, trying both of these custom nodes in Comfy:

https://github.com/kaushiknishchay/ComfyUI-Qwen3-ASR

https://github.com/diodiogod/TTS-Audio-Suite

The problem is that the kaushiknishchay nodes seem to be able to distinguish between different languages, but can't output subtitles (it produces timestamps of some sort, but only at word-level).

The TTS nodes, on the other hand, will output proper srt-formatted timestamps at sentence level, but force everything into a single language (as with Whisper).

Does anyone know of a viable means of doing what I require? Something that can distinguish between different languages, transcribe them effectively and then output the results as an srt with sentence-level time-stamps.

u/Far_Estimate7276 — 16 days ago

I've never tried audio processing in ComfyUI before and wondered if there's an effective method of removing noise or tape hiss from old recordings. Initial research suggests Demucs is very good at track separation, but can anyone recommend anything geared more specifically to the task of noise removal?

reddit.com
u/Far_Estimate7276 — 24 days ago
▲ 3 r/FluxAI+1 crossposts

One of the main reasons I use Krita AI as a front-end for Comfy is the ease of selective outpainting. However, at the point where the feathered edge of the outpainted area overlaps with the original image, the colours underneath seem to be combining with the new layer to create a distinct coloured band when using F2K 4B. The is most apparent with areas of flat, even colour like sky etc.

Meanwhile, I've never been able to get F2K 9B to outpaint, which I assumed was because I only have an outpaint lora for the 4B model. On a whim, I tried outpainting with F2K 9B whilst adding the image to be extended as a reference layer. Not only did it outpaint perfectly, there were no colour banding issues. Can anyone suggest why that might be?

I tried the same process with my usual 4B models, but there's still banding even when using a reference layer. Is it just a question of how the two different models handle colours (and the number of colours they can produce)?

reddit.com
u/Far_Estimate7276 — 25 days ago