
Solo agent costs less for a reason.
Here are my attempts at using the orchestrator pattern with Kimi as the primary model and DeepSeek V4 Flash as the sub-agent. After a lot of setup, it ended up costing more tokens and taking more time. (In the first run, I accidentally stopped the process and asked the agent to continue.) How do you guys handle it? It keeps failing in my case, even though every attempt used the same prompt.