u/Character-Blood3482

Ask for the best model use for coding agent in my 6gb vram laptop

I have a RTX 4050 6GB and 16gb ram, I have try pi cli agent + a finetuned Qwen3.5 4gb model (Qwopus3.5-9B-coder-Exp) and got a pretty good result with a todo simple CRUD application.

I try to ask pi cli simple and easy tasks and it done very well but when I try to ask it do write e2e code and do playwright test and it failed 100% times. Also when code base got bigger and I ask it to fix a small checkbox error it looping forever and couldn't solve it.

So my question is is there any model better in cli coding with speed of 30+ token/s. I have try searching on huggingface and ask ChatGPT but nothing pass the Qwopus3.5-9B from my own experience.

reddit.com
u/Character-Blood3482 — 3 days ago