What llama.cpp / local LLM configs are people using on laptops like Ryzen AI Max+ 395?
I’m experimenting with local LLMs on a laptop and would love to compare configurations with people running similar hardware. I'm not new in this but also not quite expert tho :).
My setup:
- ASUS ROG Flow Z13
- Ryzen AI Max+ 395
- 128GB unified memory
- Radeon 8060S iGPU
- Windows
- llama.cpp with Vulkan backend or lemonade
Im not expecting desktop GPU performance, but I want to understand what is realistic and what people have found to work well in daily use
Thanks