
▲ 6 r/servicenow
Gemini Flash 3.5 benchmarks by AppAgent for ServiceNow tasks
You can run the benchmarks for yourself using the "Run ServiceNow Eval" Action in AppAgent Chrome Extension:
https://chromewebstore.google.com/detail/appagent-for-servicenow/jafoppdjbleekamickhclhdaoephgimi
Note that Opus 4.7 quality changes drastically from one run to another (due to agressive quantization maybe)
u/Used-Muffin1727 — 1 day ago