u/Smart-Patient-4828

What's the best local LLM for an RTX 6000 96GB VRAM?

What's the best local LLM for an RTX 6000 96GB VRAM, 300GB RAM, and 196-core 9965 CPU? I need something good for 24/7 code care, suggestions, and reviews.

I have a max Claude 200$ and GPT Pro *100 USD* subscription. I'm also using 3.6 Qwen right now and the coder version.

Claude orchestrator, CEO, GPT 5.5 reviewer, and I want to add a local LLM because I can. 😅

Pretty new to AI and workflows. Any suggestions?

Got one important big project that I want to build more, scale, and *make $10M with no mistakes*. 🤣

reddit.com
u/Smart-Patient-4828 — 2 days ago