What would you run on 4x RTX Pro 6000 and why?
I'm currently running Qwen3.5 397b NVFP4 with very good results but I'm wondering if I should look into Qwen3.6 and what size. Or maybe another model. Qwen3.6 seems good but probably a waste to run on anything more than 1 RTX Pro 6000. I'm currently using all of this on vLLM through openwebui for general purpose and vibe coding.