
Claude Code has 240+ models via NVIDIA NIM gateway
TIL Claude Code has 240+ models via NVIDIA NIM gateway — Nemotron-3 120B for agentic coding is surprisingly good
So I was messing around with /model in Claude Code today and noticed something most people probably don't know about — after the standard Claude models (Opus, Sonnet, Haiku), there's a whole NVIDIA NIM gateway section with +239 additional models you can switch to mid-session.
Some of the models I spotted:
nvidia/nemotron-3-super-120b-a12b(with and without thinking mode)01-ai/yi-largeabacusai/dracarys-llama-3.1-70b-instruct- ...and hundreds more
I've been running the Nemotron thinking variant for multi-file refactoring and it's genuinely solid. It reasons through changes before touching your code — exactly what you want for agentic tasks. Latency is higher than Claude obviously, but if you're burning through Opus credits on long sessions this is worth experimenting with.
How to try it:
- Open any Claude Code session
- Run
/model - Scroll past the four standard Claude options — NIM models appear below
- Hit
dto set one as your session default, or pass--modelat launch
Anyone else been routing Claude Code through NIM? Curious what models people have had luck with — especially for Python or Rust codegen.