
Qwen 3.6 27B
Qwen 3.6 27B has quietly become my daily driver in Thoth.
It fits perfectly into my RTX 5090’s 32GB VRAM, which means I get a proper local model running fast enough for real daily use.
No API round trips. No sending private context away. Just 100% local, 100% private AI.
This is exactly why Thoth is designed local-first: your assistant, memory, tools, workflows, and data should live on your machine by default, with cloud models as an option, not a dependency.
Curious to know your experience with it.