u/Fit_Chair2340

Been running the SER10 Max as my dedicated local AI worker for 4 days. Posting because I was spending $10–20/day in cloud credits running OpenClaw and Hermes agents, and wanted to test whether a mini PC could actually replace that. Is it really powerful enough to replace my cloud API? I saw a post on SER9 earlier so thought this could be interesting.

Hardware (OpenClaw Edition):

Beelink SER10 Max
AMD Ryzen AI 9 HX 470 (12 cores, Zen 5)
Radeon 890M iGPU
64GB RAM
1TB SSD
Ships with Ubuntu, llama.cpp, Qwen 3.5 9B Q4KM GGUF pre-installed — OpenClaw already configured

LLM inference numbers (llama.cpp, Qwen 3.5 9B Q4KM):

Chat/simple tasks: ~12–14 tok/s steady state
First Hermes agent response: ~73 seconds (cold init)
Subsequent agent responses: ~3 seconds
Complex agentic task (multi-tool, browser use): 1.5–2.5 min per loop
Same task on Claude Sonnet via cloud: ~58 seconds

The 12 tokens/s number is the honest tradeoff. For simple summarization and writing tasks, it's fine. For heavy tool-calling loops with sub-agents, I had 1 task run for 90-minute! Absolute bananas. So some tasks I need to route to the cloud.

What actually replaced cloud spend for me:

Local default: research, summaries, social content drafts, simple routing tasks
Cloud fallback (Codex/OpenRouter): debugging loops, complex multi-tool chains, anything time-critical
Net result: most of my daily AI usage now hits the Beelink first

Setup that made it actually useful: Running Tailscale so my MacBook and iPhone can SSH in and access the OpenClaw/Hermes dashboard remotely. Without this, you're physically tied to the machine. With it, you can monitor jobs, fix issues, and even run the TUI from your phone.

Honest gripes:

12 tok/s is real. If you're doing heavy agentic work, you'll feel it vs cloud
Things will go wrong (model updates, Telegram timeouts, dependency issues) — not a hands-off setup
The pre-installed OpenClaw version was slightly behind on first boot — minor but worth noting
First Hermes init is slow; don't panic

I upgrade from Jetson Nano: Night and day on stability. The Nano would crash under sustained load and browser use. Zero crashes on the SER10 in weeks of daily use.

Disclosure: Beelink provided this unit for review. All benchmarks and opinions are my own.

Hope this helps someone decide if they want one!

Beelink SER10 MAX AMD Ryzen™ AI 9 HX 470 OpenClaw Edition running Qwen 3.5 9B Q4 K M GGUF via llama.cpp