u/shadowBladeO4

Claude Code has 240+ models via NVIDIA NIM gateway
▲ 3 r/FunMachineLearning+1 crossposts

Claude Code has 240+ models via NVIDIA NIM gateway

TIL Claude Code has 240+ models via NVIDIA NIM gateway — Nemotron-3 120B for agentic coding is surprisingly good

So I was messing around with /model in Claude Code today and noticed something most people probably don't know about — after the standard Claude models (Opus, Sonnet, Haiku), there's a whole NVIDIA NIM gateway section with +239 additional models you can switch to mid-session.

Some of the models I spotted:

  • nvidia/nemotron-3-super-120b-a12b (with and without thinking mode)
  • 01-ai/yi-large
  • abacusai/dracarys-llama-3.1-70b-instruct
  • ...and hundreds more

I've been running the Nemotron thinking variant for multi-file refactoring and it's genuinely solid. It reasons through changes before touching your code — exactly what you want for agentic tasks. Latency is higher than Claude obviously, but if you're burning through Opus credits on long sessions this is worth experimenting with.

How to try it:

  1. Open any Claude Code session
  2. Run /model
  3. Scroll past the four standard Claude options — NIM models appear below
  4. Hit d to set one as your session default, or pass --model at launch

Anyone else been routing Claude Code through NIM? Curious what models people have had luck with — especially for Python or Rust codegen.

u/shadowBladeO4 — 3 days ago

I built a J.A.R.V.I.S.-style AI agent from scratch using LangGraph — Day 1 complete (6 experiments, full notebook on GitHub)

Been building AI agents in public, one notebook per day. Day 1 covers the full foundational architecture of a stateful, persona-driven agent.

What I built across 6 experiments:

  1. Basic LangGraph chat — minimal START → chat → END graph with OpenRouter LLM
  2. Jarvis persona — SystemMessage injection for consistent personality
  3. Short-term memory — MemorySaver + thread_id (each user = isolated memory)
  4. Conditional routing — text / voice / image handlers via a router node
  5. Interactive chat loop — ask_jarvis() helper + terminal interface
  6. Graph visualization — ASCII + Mermaid diagrams of the agent architecture

Key insight: LangGraph doesn't chat by itself — it moves a state dict through Python functions called nodes. Every agent is: State → Nodes → Edges → Compile → Invoke.

Full notebook: github.com/Luciferxy/Artificial-Intelligence

u/shadowBladeO4 — 3 days ago

Just hit 25/150 on the LeetCode Interview 150

Started grinding the LeetCode Interview 150 list seriously a few weeks ago and just crossed the 25 problems mark. Wanted to share a quick checkpoint post for anyone else on the same journey.

Where I'm at:

  • 25/150 problems completed
  • Focusing on understanding patterns, not just memorizing solutions
  • Slowly getting comfortable with arrays, hashing, and two pointers

What's actually helping me:

  • Solving first without hints, then checking the editorial even if I get it right
  • Writing down why a solution works, not just what it does
  • Revisiting problems I got wrong after 2–3 days

What's been hard:

  • Staying consistent on busy academic days
  • Not spiraling when a "medium" destroys me for an hour

Still a long way to go but 25 feels like the point where the process starts clicking. If you're also grinding the Interview 150, drop where you're at — would love to know how others are approaching it.

Let's get it. 💪

reddit.com
u/shadowBladeO4 — 3 days ago