u/Famous-Sport7862

LTX 2.3 growing frustration

I have been defending LTX and had moved away from Wan 2.2 since LTX 2.3 came out. Now that I am trying to create a short narrative film I'm getting very frustrated with ltx's inability to follow prompt directions. For example shot of two men standing next to each other and all I want is for the camera to zoom in on one of the men as he talks. LTX keeps giving me a pullout or zoom out instead of a zoom in. No matter how I prompt for it it just won't do it. Something so simple like that shot should not be so difficult to achieve. I have used different workflows for example the new LTX director that has the prompt relay embedded. Anyone else gets frustrated with this model.

reddit.com
u/Famous-Sport7862 — 1 day ago

How to get better acting and better image direction with LTX 2.3

I am creating this video and I want the flying saucers to be seen flying briskly forwards earth as the man is speaking. No matter how I prompt for it LTX refuses to have the saucers fly like I wan them to. Also I would like to get a more natural performance from the character. Any tips or suggestions are greatly appreciated.

u/Famous-Sport7862 — 6 days ago
▲ 45 r/comfyui+1 crossposts

LTX 2.3 audio as standalone speech model.

User @wildmindai from X posted about this new model. Has anyone here tried it yet?

LTX 2.3 audio as standalone speech model.

Emotional TTS with Scenema Audio.

- Zero-shot expressive voice cloning, speech gen

- 8-step distilled with Gemma 3 12B text encoding

- stage directions via <action> tags

- runs at 1.5x real-time on RTX 4090

- fits in 16GB VRAM

- 13 languages, 48kHz stereo output

it also gens matching environment sounds

https://huggingface.co/ScenemaAI/scenema-audio

u/Famous-Sport7862 — 11 days ago