u/nunodonato

/goal or any other way to ensure an agent reaches a good final result?

Hi folks

I'm testing OpenCode to power an agent to work in the background by receiving requests via an HTTP API.

Right now I'm getting good results, but sometimes it can get stuck (thinks but doesnt do anything else afterwards). I understand this is related to a bug parsing Qwen3.6, but regardless of that, other problems might also come up in the future.

So I was wondering if there is already a way to prevent the agent from getting stuck and "dead". Hermes has a really cool feature that detects this cases and nudges it to move forward. I'm not yet familiar with all the ins and outs of OpenCode, so I thought I would ask first.

Thanks!

reddit.com
u/nunodonato — 3 days ago

Which Mistral model do you recommend for a local agent? (Hermes)

Basically, what's in the title.

I'm setting up an agent for my wife, and am looking for a cheap model that can perform well. Mistral sounds good, but I'm a bit confused on the model offering, small 4, medium 3.5, large 3. Seems the bigger (better), the older it is. Would small be a good fit?

thanks in advance

reddit.com
u/nunodonato — 14 days ago
▲ 17 r/Vllm+1 crossposts

I'm running Qwen3.6-27B with vLLM at FP16. There are a few known issues with the chat template (I think), and I do get occasional stop in OpenCode or other harnesses.

But in OpenWebUI is 100x worse. The model stops, sometimes gives me garbage words in a loop and other times fails tool calls due to bad json. It's a 50% chance to actually manage to use it or not.

I don't get it, I'm using the default values and yes, Native tool calls. In vLLM I'm using the recommended params.

What else can I try?

reddit.com
u/nunodonato — 22 days ago