u/FitContribution2946

Image 1 — Testing Z-Image 6B in ComfyUI | Experimental Pixel-Space Workflow
Image 2 — Testing Z-Image 6B in ComfyUI | Experimental Pixel-Space Workflow
Image 3 — Testing Z-Image 6B in ComfyUI | Experimental Pixel-Space Workflow
Image 4 — Testing Z-Image 6B in ComfyUI | Experimental Pixel-Space Workflow
▲ 12 r/comfyui+1 crossposts

Testing Z-Image 6B in ComfyUI | Experimental Pixel-Space Workflow

This isn't perfect, but I put together a basic experimental ComfyUI workflow for Z-Image 6B / L2P pixel-space generation.

It requires installing a custom node.

JSYK, I used Codex to help generate the workflow and custom node and adapted things from existing Hidream 01 workflow while experimenting with getting this running. I got it working, uploaded it to GitHub as-is, and added some basic instructions.

I'm not claiming this is the ideal implementation or production-ready. Just sharing a working experiment for people who want to poke at it.

On my NVIDIA 4090 I'm seeing roughly 30 seconds at 1024x1024, 30 steps.

GitHub:
https://github.com/gjnave/ggf-ltp-zimage

u/FitContribution2946 — 4 hours ago

I made Dramabox easier to run locally with a standalone app and LoRA tool built in

This TTS is actually amazing and I would say the recent best. Chatterbox is also very good, but I think that Dramabox is better - it has fluid speech movement, near perfect pause, and expressive detail.

Here is the repo: https://github.com/gjnave/GGF-DramaBox

To install:
create a virtual environment
istall torch w/ cuda (if you have a NVIDIA)
pip install -r requirements.txt

uses:

  • hf download unsloth/gemma-3-12b-it-bnb-4bit --local-dir models\gemma-3-12b-it-bnb-4bit
  • hf download Lightricks/LTX-2.3 --include "ltx-2.3-22b-distilled-1.1.safetensors" --local-dir models\ltx-distilled-1.1
u/FitContribution2946 — 1 day ago

Flux Klein T21 STANDALONE App (9b & 4b) - Basic Al Installations Req (CUDA, Python, Miniconda, git) - NO comfyui required

I made this standalone app of Flux Klein for the community and I've been pleased with it. It's very fast and once loaded up can generate images, like the one above, in a matter of seconds. I also use Klein as my image generator for bots due to its low footprint and high speeds at great quality.

https://github.com/gjnave/klein-standalone

FEEL FREE TO IMPROVE ON IT

This standalone app does not require ComfyUl and should work easily as long as your system is set up properly following the Get Going Fast method (basic AI tools)

To install:

  1. Download the zip file and extract it to an empty folder close to root Example: C:\Ai-Apps\Flux-Klein

  2. Double-click installer.bat

  3. Run the app with run.bat

  4. Download a model from the Model Manager tab inside the app

More to come:

. Image editing

. LoRA adding

u/FitContribution2946 — 8 days ago

HiDream-Studio v.01 has been released! It is fast and powerful and open-sourced on Github | Easy Install

Repo: https://github.com/gjnave/HiDreamStudio
Installation:
- clone repo
- double click the install.bat

I've been surprised with how fast and powerful this model is. Usually these apps go much faster in Comfyui, however this PySide app is very fast with inference on a 4090 at about 20 seconds per image

Note: the model is baked to prefers 2048x2048 and 1024x1024 .. ironically odd resolutions can actually slow it down.

u/FitContribution2946 — 12 days ago

Same Prompt for each:
Create a funny, polished, wide landscape digital illustration in a colorful comic-meets-3D style.

Taylor Swift is sitting at a glowing computer desk on a Friday evening, looking amused and tempted as she tries to decide whether to spend the night doing more AI hobby projects. She is in a cozy neon-lit creative studio with music gear, AI tools, laptops, keyboards, notebooks, and glowing monitors around her.

On one shoulder is a tiny Teenage Mutant Ninja Turtle dressed like a mischievous little devil, with small red horns, a tiny cape, and a playful grin. He is pointing toward the computer and saying in a speech bubble:

"Do it...

train one more model!"

On her other shoulder is another tiny Teenage Mutant Ninja Turtle dressed like an angel, with a halo, little white wings, and a sweet supportive smile. He is saying in a speech bubble:

"AI IS pretty cool...

and it IS Friday after all."

Taylor is smiling like she knows she is about to give in. Make the scene funny, charming, and expressive, with readable speech bubbles and strong character acting.

In the background, add bold neon branding that says:

"GGF"

Also include fun little details around the desk, like a mug that says "GGF FUEL", a sticky note that says "just one more workflow", and a notebook titled "Friday Plan" with checkboxes:

- Relax

- Be normal

- AI Projects

The "AI Projects" box is checked.

Use vibrant neon lighting, crisp details, clean composition, and a funny YouTube-thumbnail-worthy look. Make it high-quality, energetic, and visually clear.

u/FitContribution2946 — 21 days ago
▲ 13 r/generativeAI+1 crossposts

I personally think this is a a very cool app and truly something new.

MOSS-Audio is a new open-source AI model designed to go far beyond basic speech transcription. It can listen to recordings, caption what is happening, detect sounds and events, analyze music, and even answer questions about the audio.

Think of it a bit like Joy Caption, but for audio instead of images. Instead of only converting speech to text, it attempts to understand the entire sound environment.

This makes it useful for podcast analysis, dataset creation, LoRA training data preparation, sound event detection, and AI research workflows.

Key Features

  • Audio and video file processing
  • Batch captioning
  • YouTube URL captioning
  • File chunking for large recordings
  • Caption export for LoRA training
  • Sound event and music analysis

Heres the repo with instructions and GUI: https://github.com/gjnave/moss-audio-gff

https://preview.redd.it/l64eiszju0yg1.jpg?width=1682&format=pjpg&auto=webp&s=65128d6eede6937041ea7b7d601b4d0b422eda1f

reddit.com
u/FitContribution2946 — 24 days ago