u/nikhilprasanth

Creating character turnaround sheets with Flux 2 Klein in ComfyUI
▲ 20 r/aifilmmaking+2 crossposts

Creating character turnaround sheets with Flux 2 Klein in ComfyUI

I made a small ComfyUI workflow for creating multi angle reference sheets from a single input image.

The main use case is character sheets. You give it one character image, and the workflow tries to generate multiple consistent views like front three quarter, side profile, rear view, rear three quarter, high angle, low angle, and a close detail view. The goal is to keep the same face, outfit, pose, expression, proportions, and general design while only changing the camera angle.

I built it mostly with native ComfyUI nodes. The only non native part, as far as I remember, is the GGUF loader. The prompts are written in a generic way, so it can also work for people, props, vehicles, creatures, or objects, but I mainly made it for character sheet generation.

I tested it with the Flux 2 Klein 4B Q4 GGUF model because I currently have access to only 4 GB VRAM. For such a small setup, it is giving acceptable results. It is not perfect, especially with difficult rear views or fine clothing continuity, but it is usable for blocking out reference angles and building rough character sheets.

I expect the 9B variant to give much better consistency and detail, especially for faces, costume continuity, proportions, and rear view inference.

This is not meant to be a final polished character turnaround solution. It is more of a practical workflow for quickly getting usable angle references from one image, especially when working with AI video, inpainting, first frame last frame generation, or character continuity.

Sharing it in case it is useful to anyone experimenting with Flux 2 Klein on low VRAM setups.

https://pastebin.com/EyRM0zed

https://preview.redd.it/y8v7v06d4o2h1.png?width=5824&format=png&auto=webp&s=3d7acb275bf8652b68501e9efb33af7d324e75ca

reddit.com
u/nikhilprasanth — 8 hours ago
▲ 13 r/TextToSpeech+1 crossposts

Built an offline audiobook reader called Auris.

https://github.com/nikhilprasanth/Auris

The idea started because I wanted a proper local audiobook workflow without depending on cloud TTS services or locked ecosystems. Auris reads EPUB, PDF, and TXT books using local OmniVoice TTS with character aware voices, narrator control, synced text highlighting, subtitle export, and voice cloning support.

You can import a book, assign voices to characters, preview them, upload reference WAV files, and export the final result as audio with subtitles. Everything runs locally after setup.

Currently experimenting with improving emotional delivery using lightweight local models through OpenAI compatible endpoints before TTS generation. Trying to make narration feel less flat without turning the pipeline into something huge and overengineered.

Still early and very experimental, so I’d genuinely appreciate feedback, criticism, or ideas from people working on local AI, audio tooling, TTS pipelines, inference optimization, or ebook workflows.

Also very open to collaborators if anyone wants to contribute or experiment with the codebase.

u/nikhilprasanth — 15 days ago

Have Qwen 3.6 27B and Qwen 3.6 35B basically made most of the older ~30B models irrelevant?

They seem to beat stuff like Qwen coder 30B, GPT OSS 20B, Gemma models, especially for coding and agent workflows.

At this point I’m not really finding a reason to keep the older ones around.

Anyone still using them for something specific?

reddit.com
u/nikhilprasanth — 22 days ago