



We built an open-source context engine for coding agents that got GLM to solve SWE Bench tests Opus could not solve, here's how:
So, after several weeks of frustration with claude code and token spend, we came up with a thesis: with the right context, an open-weight model could match a frontier model on coding. So we decided to build Bitloops to test it.
Bitloops is an open-source memory and context layer for coding agents. We benchmarked it: GLM 5.1 on Opencode paired with Bitloops scored 88 on SWE-bench Verified (for the 43 Rust specific tests). This is higher than Claude Opus 4.6's 81% on the same benchmark.
How it works:
- Targeted context retrieval, not grep. Bitloops continuously models your codebase: structural relationships, dependencies, prior decisions. When the agent asks "how does auth work," it gets back the connected code and reasoning, not 12 random snippets. Agents query through DevQL, a typed GraphQL interface they already understand.
- Shared memory across sessions. Most agents start every session from zero. Bitloops keeps a local knowledge layer scoped to the repo and shared across agents. Cursor in the morning, Claude Code in the afternoon, same memory.
- Git-linked reasoning capture. Every session becomes a Checkpoint tied to your commits. Next session, the model sees why the last change was made, not just what changed. Reviewers get the developer-agent conversation next to the diff.
- Native agent hooks. Bitloops plugs into the agent's own hook surface on Claude Code, Codex, Cursor, Gemini, Copilot, and OpenCode. Context gets injected before the model sees the prompt. No protocol indirection.
- Local-first. Rust daemon, SQLite + DuckDB, local embeddings runtime.
- Local dashboard: still alpha, but it can present the analysis of your codebase in different ways like code-city, architectural structure, etc.
- Languages: works with TS / JS, Python, Rust, Go, Java, C# and PHP
Apache 2.0, everything's on GitHub: https://github.com/bitloops/bitloops
Happy to dig into the architecture, the hook integration, or the benchmark methodology.