Image 1 — Testing Z-Image 6B in ComfyUI | Experimental Pixel-Space Workflow

Image 2 — Testing Z-Image 6B in ComfyUI | Experimental Pixel-Space Workflow

Image 3 — Testing Z-Image 6B in ComfyUI | Experimental Pixel-Space Workflow

▲ 12 r/comfyui+1 crossposts

Testing Z-Image 6B in ComfyUI | Experimental Pixel-Space Workflow

This isn't perfect, but I put together a basic experimental ComfyUI workflow for Z-Image 6B / L2P pixel-space generation.

It requires installing a custom node.

JSYK, I used Codex to help generate the workflow and custom node and adapted things from existing Hidream 01 workflow while experimenting with getting this running. I got it working, uploaded it to GitHub as-is, and added some basic instructions.

I'm not claiming this is the ideal implementation or production-ready. Just sharing a working experiment for people who want to poke at it.

On my NVIDIA 4090 I'm seeing roughly 30 seconds at 1024x1024, 30 steps.

GitHub:
https://github.com/gjnave/ggf-ltp-zimage

u/FitContribution2946 — 4 hours ago

▲ 1 r/comfyui

The ChatGPT Cycle .. i think were in phase 3 at the moment

https://preview.redd.it/w3dgdur73l2h1.png?width=1448&format=png&auto=webp&s=1d00abfe87ad0f7932e8ea1eacccd375b0864fb2

reddit.com

u/FitContribution2946 — 1 day ago

▲ 12 r/StableDiffusion

I made Dramabox easier to run locally with a standalone app and LoRA tool built in

This TTS is actually amazing and I would say the recent best. Chatterbox is also very good, but I think that Dramabox is better - it has fluid speech movement, near perfect pause, and expressive detail.

Here is the repo: https://github.com/gjnave/GGF-DramaBox

To install:
create a virtual environment
istall torch w/ cuda (if you have a NVIDIA)
pip install -r requirements.txt

uses:

hf download unsloth/gemma-3-12b-it-bnb-4bit --local-dir models\gemma-3-12b-it-bnb-4bit
hf download Lightricks/LTX-2.3 --include "ltx-2.3-22b-distilled-1.1.safetensors" --local-dir models\ltx-distilled-1.1

u/FitContribution2946 — 1 day ago

▲ 9 r/StableDiffusion

Flux Klein T21 STANDALONE App (9b & 4b) - Basic Al Installations Req (CUDA, Python, Miniconda, git) - NO comfyui required

I made this standalone app of Flux Klein for the community and I've been pleased with it. It's very fast and once loaded up can generate images, like the one above, in a matter of seconds. I also use Klein as my image generator for bots due to its low footprint and high speeds at great quality.

https://github.com/gjnave/klein-standalone

FEEL FREE TO IMPROVE ON IT

This standalone app does not require ComfyUl and should work easily as long as your system is set up properly following the Get Going Fast method (basic AI tools)

To install:

Download the zip file and extract it to an empty folder close to root Example: C:\Ai-Apps\Flux-Klein
Double-click installer.bat
Run the app with run.bat
Download a model from the Model Manager tab inside the app

More to come:

. Image editing

. LoRA adding

u/FitContribution2946 — 8 days ago

▲ 58 r/StableDiffusion

HiDream-Studio v.01 has been released! It is fast and powerful and open-sourced on Github | Easy Install

Repo: https://github.com/gjnave/HiDreamStudio
Installation:
- clone repo
- double click the install.bat

I've been surprised with how fast and powerful this model is. Usually these apps go much faster in Comfyui, however this PySide app is very fast with inference on a 4090 at about 20 seconds per image

Note: the model is baked to prefers 2048x2048 and 1024x1024 .. ironically odd resolutions can actually slow it down.

u/FitContribution2946 — 12 days ago

▲ 2 r/AIcomics

I made this for a friend of mine who is kind of an elitist X'D

u/FitContribution2946 — 12 days ago

▲ 75 r/StableDiffusion

Hi-Dream 01 Out : 2k Images in 20seconds on a 4090 (fp8 dev) ComfyUI

The workflow is the first image on the model page:
https://huggingface.co/drbaph/HiDream-O1-Image-FP8

u/FitContribution2946 — 13 days ago

▲ 255 r/aivids+1 crossposts

https://huggingface.co/SulphurAI/Sulphur-2-base

Workflows are on the repo - the above was t2v with distilled. Also has i2v which is powerful!

u/FitContribution2946 — 15 days ago

▲ 1 r/ChatGPT

u/FitContribution2946 — 21 days ago

▲ 39 r/StableDiffusion

Same Prompt for each:
Create a funny, polished, wide landscape digital illustration in a colorful comic-meets-3D style.

Taylor Swift is sitting at a glowing computer desk on a Friday evening, looking amused and tempted as she tries to decide whether to spend the night doing more AI hobby projects. She is in a cozy neon-lit creative studio with music gear, AI tools, laptops, keyboards, notebooks, and glowing monitors around her.

On one shoulder is a tiny Teenage Mutant Ninja Turtle dressed like a mischievous little devil, with small red horns, a tiny cape, and a playful grin. He is pointing toward the computer and saying in a speech bubble:

"Do it...

train one more model!"

On her other shoulder is another tiny Teenage Mutant Ninja Turtle dressed like an angel, with a halo, little white wings, and a sweet supportive smile. He is saying in a speech bubble:

"AI IS pretty cool...

and it IS Friday after all."

Taylor is smiling like she knows she is about to give in. Make the scene funny, charming, and expressive, with readable speech bubbles and strong character acting.

In the background, add bold neon branding that says:

"GGF"

Also include fun little details around the desk, like a mug that says "GGF FUEL", a sticky note that says "just one more workflow", and a notebook titled "Friday Plan" with checkboxes:

- Relax

- Be normal

- AI Projects

The "AI Projects" box is checked.

Use vibrant neon lighting, crisp details, clean composition, and a funny YouTube-thumbnail-worthy look. Make it high-quality, energetic, and visually clear.

u/FitContribution2946 — 21 days ago

▲ 13 r/generativeAI+1 crossposts

I personally think this is a a very cool app and truly something new.

MOSS-Audio is a new open-source AI model designed to go far beyond basic speech transcription. It can listen to recordings, caption what is happening, detect sounds and events, analyze music, and even answer questions about the audio.

Think of it a bit like Joy Caption, but for audio instead of images. Instead of only converting speech to text, it attempts to understand the entire sound environment.

This makes it useful for podcast analysis, dataset creation, LoRA training data preparation, sound event detection, and AI research workflows.

Key Features

Audio and video file processing
Batch captioning
YouTube URL captioning
File chunking for large recordings
Caption export for LoRA training
Sound event and music analysis

Heres the repo with instructions and GUI: https://github.com/gjnave/moss-audio-gff

https://preview.redd.it/l64eiszju0yg1.jpg?width=1682&format=pjpg&auto=webp&s=65128d6eede6937041ea7b7d601b4d0b422eda1f

reddit.com

u/FitContribution2946 — 24 days ago

▲ 1 r/AIcomics

u/FitContribution2946 — 26 days ago

u/FitContribution2946