
Beelink SER10 MAX AMD Ryzen™ AI 9 HX 470 OpenClaw Edition running Qwen 3.5 9B Q4 K M GGUF via llama.cpp
Been running the SER10 Max as my dedicated local AI worker for 4 days. Posting because I was spending $10–20/day in cloud credits running OpenClaw and Hermes agents, and wanted to test whether a mini PC could actually replace that. Is it really powerful enough to replace my cloud API? I saw a post on SER9 earlier so thought this could be interesting.
Hardware (OpenClaw Edition):
- Beelink SER10 Max
- AMD Ryzen AI 9 HX 470 (12 cores, Zen 5)
- Radeon 890M iGPU
- 64GB RAM
- 1TB SSD
- Ships with Ubuntu, llama.cpp, Qwen 3.5 9B Q4KM GGUF pre-installed — OpenClaw already configured
LLM inference numbers (llama.cpp, Qwen 3.5 9B Q4KM):
- Chat/simple tasks: ~12–14 tok/s steady state
- First Hermes agent response: ~73 seconds (cold init)
- Subsequent agent responses: ~3 seconds
- Complex agentic task (multi-tool, browser use): 1.5–2.5 min per loop
- Same task on Claude Sonnet via cloud: ~58 seconds
The 12 tokens/s number is the honest tradeoff. For simple summarization and writing tasks, it's fine. For heavy tool-calling loops with sub-agents, I had 1 task run for 90-minute! Absolute bananas. So some tasks I need to route to the cloud.
What actually replaced cloud spend for me:
- Local default: research, summaries, social content drafts, simple routing tasks
- Cloud fallback (Codex/OpenRouter): debugging loops, complex multi-tool chains, anything time-critical
- Net result: most of my daily AI usage now hits the Beelink first
Setup that made it actually useful: Running Tailscale so my MacBook and iPhone can SSH in and access the OpenClaw/Hermes dashboard remotely. Without this, you're physically tied to the machine. With it, you can monitor jobs, fix issues, and even run the TUI from your phone.
Honest gripes:
- 12 tok/s is real. If you're doing heavy agentic work, you'll feel it vs cloud
- Things will go wrong (model updates, Telegram timeouts, dependency issues) — not a hands-off setup
- The pre-installed OpenClaw version was slightly behind on first boot — minor but worth noting
- First Hermes init is slow; don't panic
I upgrade from Jetson Nano: Night and day on stability. The Nano would crash under sustained load and browser use. Zero crashes on the SER10 in weeks of daily use.
Disclosure: Beelink provided this unit for review. All benchmarks and opinions are my own.
Hope this helps someone decide if they want one!