
Gemini 3.5 Flash looks really interesting with Hermes.
It's available in the web app and AI Studio. The pricing is not cheap, $1.50 input / $9.00 output per million Token, so I don’t think I’d want to blindly route everything through it.
But as part of a Hermes setup, especially with multi-model routing, this could be really useful.
For me, the exciting use case is having Hermes decide when a stronger model is actually worth calling: planning, debugging, monitoring, self-improvement loops, harder coding tasks, or anything where a cheaper model might waste time looping.
That’s what I’ve been experimenting with my set up (Hermes on a Lightnode VPS using a mix of models like GLM, Opus, and Codex): Hermes as less of “one agent using one model” and more like an actual model orchestration layer. Will keep testing Gemini as to see if it differs significantly from Codex in terms a being the "high-level orchestrator" role.