u/naruto_uzumaki00

Open-sourced a layer that cuts ~87% of LLM API input tokens (GPT-5.5 & Opus 4.8, real billed tokens) - proxy + MCP plugin for Claude Code/Codex

if you build on the LLM APIs, a big chunk of every request is tokens the model doesn't need - resent system prompts + history, whole files dumped into context, easy calls routed to the frontier model. i built a vendor-neutral layer that strips that, and measured it on the providers' own billed tokens (heavy tasks):

gpt-5.5: 16,875 -> 2,232 input tokens (86.8% fewer), quality 3/3 -> 3/3

opus 4.8: 26,573 -> 3,343 (87.4% fewer), 3/3 -> 3/3

two ways to drop it in:

- OpenAI/Anthropic-compatible proxy - point base_url at it, keep your key. every request gets the levers applied + an X-TRL-Tokens-Saved header.

- MCP plugin for Claude Code / Codex - the agent gets retrieve_code(query) / explain_symbol(name) and pulls only the relevant AST slices instead of dumping whole files. since Claude Code and Codex bill by tokens, that stretches your weekly cap.

four levers under the hood: prefix caching, tail compression with a deterministic guard that re-injects any number the compressor drops, AST/text retrieval, and cascading verifiable steps to a local model.

honest negatives, in the repo: static embeddings didn't beat plain keyword retrieval in my eval; a real 3B compressor dropped ~1/3 of load-bearing numbers before i added the guard; suites are small + favorable. Apache-2.0, free, reproducible benchmark included (validate/heavy_bench.py). repo: https://github.com/AryanGonsalves/trl-token-reduction - would love people to break it.

u/naruto_uzumaki00 — 16 hours ago

▲ 0 r/mcp

I built an MCP server that gives Claude Code / Codex only the code slices they need - ~87% fewer input tokens, quality-neutral

I've been building a vendor-neutral token-reduction layer, and the piece most relevant here is the MCP server. instead of your agent grepping and dumping whole files into context, it exposes retrieve_code(query) and explain_symbol(name) and returns only the relevant AST slices (tree-sitter, 12 languages). same STDIO server drops into Claude Code AND Codex via an [mcp_servers] block.

why it matters right now: Claude Code and Codex both bill by tokens, so trimming what the agent pulls into context directly stretches your weekly cap. measured on real billed tokens, heavy tasks:

gpt-5.5: 16,875 -> 2,232 input tokens (86.8% fewer), quality 3/3 -> 3/3

opus 4.8: 26,573 -> 3,343 (87.4% fewer), 3/3 -> 3/3

the broader layer has 3 more levers (prefix caching, tail compression with a fact-guard that can't drop a number, and cascade easy steps to a local model) and also runs as an OpenAI/Anthropic-compatible proxy. there's also a small Claude.ai browser extension that compresses what you paste before you send - it's in Chrome Web Store review right now (not verified yet), so load-unpacked for the moment.

honest caveats in the readme: small favorable suites, and static embeddings didn't beat plain keyword retrieval in my eval.

repo (Apache-2.0, reproducible benchmark in validate/heavy_bench.py): https://github.com/AryanGonsalves/trl-token-reduction

reddit.com

u/naruto_uzumaki00 — 16 hours ago

▲ 6 r/USCIS_FORUM

EAD Card returned to USCIS by USPS

So, my I-765 EAD card was approved on 5th June and was out for delivery on the 11th of June . No case status update after my card was out for delivery.
Since my OPT clock starts today, I checked the USCIS site and it says my card was returned to USCIS on the 11th of June itself since USPS failed to find the delivery address. The USPS tracker on the other hand says that my EAD card was returned back to USCIS on the 30th of June.

What should I be doing? My opt unemployment clock is getting wasted away since I can’t work/ volunteer without the card.
USCIS agents have also been extremely unhelpful (and rude) on call.

Most importantly, I filled the card resend request form today (1st of July) ,hence, I was hoping to know by when I can expect the card to be mailed back to me? Any others that have experienced this issue in the past?

reddit.com

u/naruto_uzumaki00 — 4 days ago

▲ 2 r/immigration+1 crossposts

EAD Card returned to USCIS by USPS

What should I be doing? My opt unemployment clock is getting wasted away since I can’t work/ volunteer without the card.
USCIS agents have also been extremely unhelpful (and rude) on call.

reddit.com

u/naruto_uzumaki00 — 4 days ago

▲ 0 r/resumes

[0 YoE, Unemployed, Data Scientist, United States]

Hi all — I'm a Master's student (Data Science, graduating May 2026) who has been actively applying for full-time Data Scientist / Data Analyst / BI roles in the US for a few months with no callbacks, and I'd really appreciate a critical look at my resume.

Background: MS in Data Science, Analytics & Engineering (GPA 3.8), and before that a B.Eng in Computer Engineering with an AI/ML honors track. I've built several end-to-end ML projects (a fraud-triage platform, a time-series forecasting system, an LLM-powered agent) and had one internship as a game developer.

Targeting: Data Scientist, Data Analyst, and BI / Decision Analytics roles, entry-level / new-grad.

Location: Based in Arizona, open to relocating anywhere in the US, open to remote.

Visa: F-1 OPT (STEM-eligible), so no employer sponsorship is required — not sure if that's still causing some ATS/recruiter filtering.

Challenge: very few interview callbacks despite what I think is solid, quantified project work. Not sure if it's the resume structure, how I'm presenting technical depth, or something else entirely.

Would love feedback on: whether the projects section reads as too dense/technical, whether the summary at the top is doing its job, and anything that would make a recruiter skim past me. Thanks in advance!

u/naruto_uzumaki00 — 4 days ago

▲ 10 r/windowsapps+1 crossposts

[Python] VoiceClaw - an always-listening, hands-free voice agent that controls your Windows PC by voice

github.com

u/naruto_uzumaki00 — 4 days ago

▲ 3 r/LLMDevs+1 crossposts

I built PolyForge — a reliability auditor + fallback router for the MCP servers your agent depends on

Hey all — I made this and wanted to share it for feedback (I'm the author, so flagging this as a showcase/self-promo post).

The problem I kept hitting: when an agent calls a broken or drifted MCP tool, the model often doesn't error out — it improvises around the bad response and keeps going, quietly contaminating downstream steps. No crash, no alert.

PolyForge scores every MCP server your agent depends on on a 9-point rubric (commit recency, sole-maintainer risk, CI status, unpatched CVEs, clean install, hosted uptime, schema stability), buckets them production / light / dead, and can fail CI if a dependency is dead. It can auto-gather the cheap signals (last commit, contributors, CI) straight from a repo on GitHub, and there's a fallback router that resolves a capability to the healthiest server.

Honest about where it's at: early, from-source only (PyPI planned), and routing picks the healthiest server but doesn't execute MCP calls yet. The rubric weights are a sensible default, not validated against a big labeled set — which is exactly the feedback I'm after. If you run agents on a stack of MCP servers, does the scoring match your gut, and what signals would you add?

Repo: https://github.com/AryanGonsalves/polyforge (MIT). Tear it apart — issues/PRs welcome.

u/naruto_uzumaki00 — 4 days ago

▲ 0 r/stemopt

Looking for post OPT opportunities

As the title states, I’m looking for any post OPT opportunities and would be grateful if y’all could provide me with some leads regarding any open volunteer positions .

I’m a recently graduated DSAE (data sci) masters student.

Feel free to DM me.

Thank you and continue having a great summer.

reddit.com

u/naruto_uzumaki00 — 23 days ago

▲ 0 r/ASU

Looking for post OPT opportunities

As the title states, I’m looking for any post OPT opportunities and would be grateful if y’all could provide me with leads in case you work with a professor or know someone who works with a professor that has any open volunteer positions .

I’m a recently graduated DSAE (data sci) masters student.

Feel free to DM me.

Thank you and continue having a great summer.

reddit.com

u/naruto_uzumaki00 — 24 days ago

▲ 2 r/waymo

Waymo.codes shows that codes ain’t available for my account even after clearing cache and history or switching accounts

u/naruto_uzumaki00 — 2 months ago

Open-sourced a layer that cuts ~87% of LLM API input tokens (GPT-5.5 &amp; Opus 4.8, real billed tokens) - proxy + MCP plugin for Claude Code/Codex

I built an MCP server that gives Claude Code / Codex only the code slices they need - ~87% fewer input tokens, quality-neutral

EAD Card returned to USCIS by USPS

EAD Card returned to USCIS by USPS

[0 YoE, Unemployed, Data Scientist, United States]

[Python] VoiceClaw - an always-listening, hands-free voice agent that controls your Windows PC by voice

I built PolyForge — a reliability auditor + fallback router for the MCP servers your agent depends on

Looking for post OPT opportunities

Looking for post OPT opportunities

Waymo.codes shows that codes ain’t available for my account even after clearing cache and history or switching accounts

Open-sourced a layer that cuts ~87% of LLM API input tokens (GPT-5.5 & Opus 4.8, real billed tokens) - proxy + MCP plugin for Claude Code/Codex