u/Charming_You_25

seeking scrubbed telemetry to improve open source local agentic performance.

seeking scrubbed telemetry to improve open source local agentic performance.

Hi all. A couple days ago a new research paper came out on δ-mem adaptors. I have been deep in it applying it to my local agent’s memory system.

I converted the research project to MLX (I run on apple silicon) and made it work with qwen3.5:9b GatedDeltaNet was NOT easy but I think I have it figured out. I haven’t pushed my adaptor trainer yet, but I will once it’s perfectly tuned. I have been benchmarking it for agentic use and results have been very promising, based on the research papers adaptor and passing good attention data, it improves similar questions / answers and summarizations quality by 30%.

Basically, it allows a local model to perform way better.. but there’s a hole that needs to be filled before this is useful.

To make this usable for local agentic use, an adaptor needs to be trained on high quality curated sessions with tool use.

So I’m looking for volunteers that have telemetry data on, and are willing to contribute scrubbed sessions for training. I only have about 100 sessions since I turned on telemetry, for v1 of a trained adaptor I want about 500 more. For a final version, 2400 more sessions at least would be ideal.

I added my telemetry-to-useful-training-data repo to GitHub, here’s a prompt you can give an agent to scrub your own data:

https://github.com/elimaine/agent-memory-forge/blob/main/prompts/telemetry_scrubber_handoff.md

or you can use my toolbelt in the same repo. Scrubbing can be done by any llm that can handle the context length, so if you have a local model you can use that. I used gpt5.4. Though I’d prefer full scrubbed telemetry in cases i find more optimal ways to do formatting/goal awareness.

Send me a link to the scrubbed data and I’ll add it to training.

What you get: I’ll upload an agentic tool use specific delta men adaptor to huggingface as well as the sidecar mlx delta mem repo so anyone can run it. As well as benchmarks so you know what improvement to expect.

Let me know in your message if it’s okay to share your sessions as the training data. If you don’t specify, I will treat it as private. Also let me know if you are using PyTorch/CuDA, if nobody contributing requests it I’ll probably just upload the MLX version with a diy conversion tool.

This will allow you to use the local models and get 8% better answers just for free, with up to 30% if you pass in more relevant context. You can just pass in past relevant sessions or compacted away session history without increasing your context window!

I am first targeting qwen3.5:9b, with my sights set on larger quantized models (27-32b) once everything is stable.

Thanks for reading yall.

u/Charming_You_25 — 22 hours ago

Yesterday I saw a new research paper about δ-mem and integrated with openclaw

Improves agent response quality by 7-32%. Without context window increases depending what you pass it. 7% not passing anything. This project is not yet usable for anything outside mlx qwen3:4b yet until people train adaptors for it. I recommend asking your claw to check huggingface so you know when they drop!

The original paper claimed up to 30%, but I found a way to get even better results up to 32% with openclaw agent use. Benchmark normalized and made with a n of 15.

GitHub for plugin:
https://github.com/elimaine/openclaw-delta-mem-mlx-plugin

Clawhub:
https://clawhub.ai/plugins/@elimaine/openclaw-delta-mem-mlx

Original paper:
https://arxiv.org/abs/2605.12357

I did a ton of benchmark testing you can read about here, if you get bored of the tables scroll down to the graph which shows the important bits. Benchmark results have probably changed though since I made a bunch of improvements/hardenings.
https://github.com/elimaine/delta-mem-mlx-sidecar-w-openclaw/blob/main/wiki/Benchmark-Findings.md

TLDR: pass qmd vsearch for adapter attention state. 32% improvement at cost of 30-61% slowdown.

My project is only usable on Apple Silicon using mlx. Porting it to CUdA would be easy and faster.

Once qwen3.6:27b δ-mem mlx adaptor get released this will be the best local stack on the planet (higher parameters excluded).

Happy experimenting lobsters!

u/Charming_You_25 — 6 days ago

Is adding a Mission Control still worth?

Late to the game adding Mission Control, openclaw gateway has tons of info now that used to be shown in Mission Control. I’m not sure I see the use when other orchestration tools can be built like symphony or in-discord message editing for run status.

Anyone love their Mission Control? What are you doing with it?

reddit.com
u/Charming_You_25 — 13 days ago

Memory wiki defaults only half implemented

Anyone else turn on memory wiki and just hoped it did something?

Yeah turns out it needs regular synthesize crons and instructions when to use it and it creates massive indexes. You basically need to build the whole integration.

After I dove in I realized it was kinda lame and went the obsidian route for phone sync and pretty graphs. Defining my own ontology and rules for using it felt like it could have been implemented better. Ended up gutting and overhauling the openclaw wik installed through config.. Karpathy is one of the greats but something was lost in the integration of his wiki long term memory system.. would have been easier to just design it myself..

reddit.com
u/Charming_You_25 — 13 days ago

First let me say I am not a noob off the street, I’ve been working with openclaw daily since February. I have done my research and tried googling this issue but have not found the results I need. If someone who has solved this could point me me in the right direction I would be much appreciated.

I have had some success with local models.. granite seems basically plug and play, though not too smart. But the most powerful models are eluding me.

What’s worked: gemma4 being served from ollama (using native url, not openai “/v1”) being run through an opencode APC can do tool use and generally works quite well, but that means for the agent I need to use another model to basically act as the go between when starting a session (I used granite to start the APC).

What hasn’t worked: Gemma4 being run as openclaw config defined model, when passed all of openclaws personality and context returns blank results. Local Qwen (every version I’ve tried) set up the same as above (non openai /v1, ollama) also gives errors saying tool use isn’t allowed. I suspect it would work through a more intelligent harness like opencode.. but I want to get the basic agent running the default openclaw pi default working.

What basic thing am I (and my agent) missing? Any detailed tutorials for how to get qwen local running on openclaw?

reddit.com
u/Charming_You_25 — 16 days ago