u/Fit_Chair2340

Beelink SER10 MAX AMD Ryzen™ AI 9 HX 470 OpenClaw Edition running Qwen 3.5 9B Q4 K M GGUF via llama.cpp
▲ 9 r/BeelinkOfficial+1 crossposts

Beelink SER10 MAX AMD Ryzen™ AI 9 HX 470 OpenClaw Edition running Qwen 3.5 9B Q4 K M GGUF via llama.cpp

Been running the SER10 Max as my dedicated local AI worker for 4 days. Posting because I was spending $10–20/day in cloud credits running OpenClaw and Hermes agents, and wanted to test whether a mini PC could actually replace that. Is it really powerful enough to replace my cloud API? I saw a post on SER9 earlier so thought this could be interesting.

Hardware (OpenClaw Edition):

  • Beelink SER10 Max
  • AMD Ryzen AI 9 HX 470 (12 cores, Zen 5)
  • Radeon 890M iGPU
  • 64GB RAM
  • 1TB SSD
  • Ships with Ubuntu, llama.cpp, Qwen 3.5 9B Q4KM GGUF pre-installed — OpenClaw already configured

LLM inference numbers (llama.cpp, Qwen 3.5 9B Q4KM):

  • Chat/simple tasks: ~12–14 tok/s steady state
  • First Hermes agent response: ~73 seconds (cold init)
  • Subsequent agent responses: ~3 seconds
  • Complex agentic task (multi-tool, browser use): 1.5–2.5 min per loop
  • Same task on Claude Sonnet via cloud: ~58 seconds

The 12 tokens/s number is the honest tradeoff. For simple summarization and writing tasks, it's fine. For heavy tool-calling loops with sub-agents, I had 1 task run for 90-minute! Absolute bananas. So some tasks I need to route to the cloud.

What actually replaced cloud spend for me:

  • Local default: research, summaries, social content drafts, simple routing tasks
  • Cloud fallback (Codex/OpenRouter): debugging loops, complex multi-tool chains, anything time-critical
  • Net result: most of my daily AI usage now hits the Beelink first

Setup that made it actually useful: Running Tailscale so my MacBook and iPhone can SSH in and access the OpenClaw/Hermes dashboard remotely. Without this, you're physically tied to the machine. With it, you can monitor jobs, fix issues, and even run the TUI from your phone.

Honest gripes:

  • 12 tok/s is real. If you're doing heavy agentic work, you'll feel it vs cloud
  • Things will go wrong (model updates, Telegram timeouts, dependency issues) — not a hands-off setup
  • The pre-installed OpenClaw version was slightly behind on first boot — minor but worth noting
  • First Hermes init is slow; don't panic

I upgrade from Jetson Nano: Night and day on stability. The Nano would crash under sustained load and browser use. Zero crashes on the SER10 in weeks of daily use.

Disclosure: Beelink provided this unit for review. All benchmarks and opinions are my own.

Hope this helps someone decide if they want one!

youtu.be
u/Fit_Chair2340 — 8 days ago