u/anonrftw

Ongoing SOTA setups?

Hello everybody,

I just got me Strix Halo (after a very long wait) finally! (Minisforum MS S1 MAX)

Feeling like a kid with the latest gadget, I started my real research about how to get the max out of my new baby; I got a bit lost and I don't know what to do.

When I ask Claude/ChatGPT questions like "what are the best model to run for general use cases?" or "I trying to decide using between Deepseek v4 flash, Minimax 2.7, Qwen 3.6 models(either 27B or 35B3A), Gemma4 31B" all I got was mixed responses.

Sometimes it was OS was Ubuntu 26.04 because it has the latest drivers, sometimes it is Ubuntu 24.04 HWE because "the official AMD-built ROCm binaries target Ubuntu 24.04". Sometimes ROCm, sometimes "Mesa RADV (already in the kernel/userspace) — for llama.cpp Vulkan builds", sometimes both.

Model advices are also all over the place, mostly in between:

DeepSeek V4 Flash at Q4

Qwen 3.6-27B

Gemma 4 31B

MiniMax M2.7

Honestly, I am not technical&knowledgeable enough (yet!) to figure out the best setups; but I think maybe collectively we can create maybe our own benchmarks for the best models that we can run.

I would also love to hear your opinions/preferences.

reddit.com
u/anonrftw — 7 days ago