Cost-routing tasks across models in one session - cheap model for grunt work, frontier for reasoning, local for sensitive code

Been experimenting with model routing at the workflow level instead of the app level and wanted to share what it looks like in practice.

The setup: Zero, an open source coding agent (github.com/gitlawb/zero) that treats the model as a swappable component. It talks to 25+ providers - OpenAI, Anthropic, Gemini, DeepSeek, Qwen, Groq, plus local models through Ollama or LM Studio - and you switch mid-session with /model without losing context.

The routing pattern I've settled into:

  • Cheap/fast model for scaffolding, file reads, summaries, boilerplate
  • Frontier model only for the steps that need real reasoning - the escalation is a single command, same context
  • Local model for anything touching code or data I don't want leaving the machine

The cost curve changes completely. Instead of paying frontier prices for 100% of tokens, you pay them for the 20% of steps that actually need it. Over a week of heavy use the difference is not subtle.

Implementation details that matter: sessions are files on disk (resumable/forkable, so routing decisions survive restarts), it's a single Go binary, no telemetry, and there's a headless mode (zero exec, streams JSON) if you want to wire the routing into scripts or CI instead of doing it interactively.

Open question I haven't solved: my escalation decisions are still vibes-based. Has anyone built actual heuristics for when a task deserves the expensive model - token-count thresholds, retry-on-failure escalation, task classification? Curious what's working for people running this at scale.

reddit.com
u/amu4biz — 14 hours ago

Coding agents are quietly shifting from "pick our model, use our cloud" to "bring any model, run it yourself" and it feels like a real inflection

Been noticing a pattern across the newer AI coding tools and wanted to see if others see it too.

The first wave (Cursor, Copilot, Claude Code) all share the same shape: the tool is tied to a model or a small curated set, a lot of it runs through the vendor's cloud, and you're basically renting into one company's stack. That was fine when only a few models were any good.

But now that there are a dozen genuinely capable models, and strong local ones via Ollama/LM Studio , that lock-in is starting to feel outdated. And a new crop of tools is being built around the opposite assumption.

The clearest example I've hit is Zero (open source, github.com/gitlawb/zero). The whole pitch is "your model, your machine, your rules" — it talks to 24+ providers, you can switch models mid-task, it runs locally, and it stores nothing remotely (no telemetry). The model is a swappable part, not the identity of the tool.

What's interesting to me isn't the specific tool, it's the architectural bet: that inference is becoming a commodity you route to, the way we already treat storage or compute. If that's right, "which model does your coding agent use" becomes as weird a question as "which brand of electricity powers your laptop."

Do you think provider-agnostic, local-first agents are actually the future here, or does the convenience of an all-in-one cloud tool (Cursor etc.) win for most people regardless? Curious where people land.

reddit.com
u/amu4biz — 1 day ago

My portfolio is down 60% but I just aped into a coin called $babyansem so things are looking up

I've read the whitepapers. I've studied the tokenomics. I've spent three years learning about L2 scaling, MEV, and real yield.

And then last night at 2am I bought a coin whose entire thesis is "what if Ansem, but a baby."

No roadmap. No utility. Just a diaper and a dream. 🍼

My wife asked what I invested in. I said "the heir to the throne." She asked what that means. I don't know either. But I'm up 12% and she isn't, so who's the real baby here.

Anyway not financial advice, I clearly have none.

reddit.com
u/amu4biz — 1 day ago
▲ 0 r/OpenAI

Sam Altman personally replied to an open source dev tool project — and most people missed it

Was digging through Sam Altman's replies and found this — back in May he responded directly to GitLawb, the team building git infrastructure for AI agents.

They'd tweeted that if he replied, they'd make GPT-5.5 the default model and Codex the default provider for their platform (which even then had 25.7K GitHub stars, 8.3K forks, 116K downloads). Altman replied "cool thanks!" — the post has 369K views now.

The OpenAI CEO doesn't hand out replies. His timeline is almost entirely researchers and AI-lab people. So when he stops to acknowledge a project by name, it's worth noticing what they're building.

Since then the same team shipped Zero — an open source terminal coding agent (single Go binary, 24+ model providers, MIT licensed, no telemetry) that's pulled ~800 stars in its first two days.

reddit.com
u/amu4biz — 1 day ago

Backtested the dumbest possible trend rule on BTC vs SOL (3yr) — 30% win rate but still very profitable. The R:R is doing all the work.

Ran the simplest thing I could think of: go long when the daily close crosses above its 20-day average, exit when it closes back below, 8% stop. 3 years, $1,000, slippage included, no fancy filters.

  • BTC**:** +79.5% — CAGR 21.9%, win rate 30%, max drawdown −29%
  • SOL**:** +263.9% — CAGR 55.6%, win rate 36%, max drawdown −62%

What got me: both won under 40% of the time. All the profit came from winners being 3.8–4.7× the size of losers. Same exact rules, but SOL's volatility juiced both the return and the drawdown hard.

(Ran it through an MCP in Claude called Rix so I didn't have to write the code, but the takeaway is really about the R:R, not the tool.)

Makes me wonder how much a regime filter or a volatility-scaled position size would smooth that SOL drawdown. Anyone here run naive trend rules like this across assets

Does the "low win rate, high R:R" pattern hold up on the alts you've tested?

reddit.com
u/amu4biz — 2 days ago
▲ 5 r/ollama+1 crossposts

New open source coding agent written in Go — bring your own model, runs on your machine

A coding agent called Zero launched today and the pitch caught my attention: your model, your machine, your rules.

It's built from scratch in Go — single binary, no Python environment, no dependency hell. Install is one line:

npm install -g u/gitlawb/zero

Repo: https://github.com/Gitlawb/zero

What stood out vs the usual coding agents:

- You pick the model provider, it's not locked to one API

- Runs on your machine, not their cloud

- Go means it's lean — the team behind it (GitLawb) claims 5x faster than their previous harness

It's early ,someone already flagged that only one provider can be active at a time, and the maintainers said to file a feature request. So rough edges exist but they're responsive.

Anyone tried pointing it at a local model through Ollama yet? Curious how it handles smaller coding models vs the big API ones.

u/amu4biz — 3 days ago

$ZAYED - The Golden Bull Has Arrived on Solana | From $ANSEM Caller to Next Big Move

Hey Solana Fam,

If you followed $ANSEM, you already know the name: zayed.eth called it at ~$3M and it ran hard. Now he’s back with his own coin — $ZAYED, positioning as The Golden Bull of Dubai.

Why this one stands out:
• Strong dev conviction: Bought 15% of supply for airdrops/locking (no random dumps). 10% already locked via Streamflow for transparency. 

• Clean chart setup with early momentum — still early (“ugly part” before it becomes obvious).

• Bullish Dubai-inspired branding and community energy. Golden Bull narrative fits perfectly in this market cycle.

• CA: 6XUkLa8fbTzRUkGotni7UtkG4EHtUFe8qPhJU5THpump

This isn’t just another random launch. It’s backed by someone who’s already delivered in this space, with real skin in the game and a clear vision for $10M+ minimum.

DYOR, as always. Check the charts, locked supply, and community. The bull is awake 👑

Links:
• Dexscreener: https://dexscreener.com/solana/F1Q7rvMJQ7KFGutSpmVqJab1yG1vw5kEfNWNNXziH5Mn

• X: @ZayedETH

What do you think — next runner or fade? Let’s discuss.

reddit.com
u/amu4biz — 5 days ago
▲ 5 r/defi

On-Chain Borrowing Against Assets at 3% APR: Worth It Compared to Traditional Brokers?

Traditional margin loans from brokers (Schwab, Fidelity, Robinhood, IBKR etc.) often charge 6-12%+ APR, and the cash just sits there.

On Solana DeFi, some setups now let you borrow stable USD against tokenized assets (including equities) at much lower rates — around 3% APR — while keeping full exposure to the underlying.

Utility breakdown:
• Access dollar liquidity without selling long-term holdings.
• Use the borrowed stable across DeFi (lending, trading, payments).
• Collateral remains productive and transparent on-chain.
• Potential for the stable itself to earn yield when staked.

This could be a meaningful edge for capital efficiency in a DeFi portfolio — cheaper leverage, better composability.

Anyone actively using on-chain borrowing against RWAs or equities?
How do the rates, liquidation risks, and UX compare to TradFi in practice?
Any good protocols or setups worth looking at right now?
For one transparent example and ongoing discussion: https://x.com/nestusd
DYOR.
Borrowing always carries liquidation risk if collateral value drops.

reddit.com
u/amu4biz — 5 days ago

Discussion: NestUSD – Borrowing nUSD against tokenized US equities on Solana

With growing interest in RWAs and on-chain tokenized assets, I’ve been looking into NestUSD on Solana.

It allows holders of xStocks tokenized US equities to use them as collateral to mint nUSD (a dollar-pegged stablecoin) at a fixed 3% APR. Stakers of snUSD can earn a share of protocol stability fees (recent targets around 6% APY). Backing comes from the tokenized equities plus USDC reserves, all visible on-chain.

xStocks endorsed the project recently. It’s still early-stage with modest TVL.

Curious if anyone has checked it out or has thoughts on this setup:

  • How do the liquidation parameters hold up compared to traditional margin on equities?
  • Is using tokenized stocks as stablecoin collateral a meaningful innovation here, or mostly incremental?
  • Broader take on Solana-based RWA/equity experiments right now?

Website: nestusd.com
Twitter: https://x.com/NestUSD

As always, DYOR and manage risk — newer protocols carry smart contract and adoption risks.

u/amu4biz — 5 days ago
▲ 0 r/solana

$ANSEM exploding on Solana after Ansem airdrops millions to users

Big movement in the Solana memecoin scene right now.

Crypto influencer Ansem (@blknoiz06) is airdropping roughly $7M worth of $ANSEM (The Black Bull) across hundreds of wallets on Solana. His stated goal is to push toward 1 million holders.

Key points:

  • Massive distribution play — from small amounts to multiple 5-6 figure airdrops
  • Token has seen extreme volatility and volume (thousands of percent moves in days)
  • Ansem reportedly holding a large portion of supply while actively distributing
  • Ties into the broader “trenches” / Solana meme culture narrative

Whether you love or hate memecoins, this is one of the more interesting distribution experiments happening live on Solana.

CA: 9cRCn9rGT8V2imeM2BaKs13yhMEais3ruM3rPvTGpump

What do you guys think — genius community building or classic influencer coin risk? Anyone aping or just watching from the sidelines?

reddit.com
u/amu4biz — 5 days ago

Decentralized Git for AI Agents: Gitlawb looks like the real deal (must-watch video)

As AI agents get more autonomous and start owning/editing code at scale, the old centralized GitHub + PAT token model is going to break down fast.

This video does a great job explaining the problem and how Gitlawb is building the solution:

  • Decentralized git network where agents are first-class citizens (not just bots)
  • Cryptographic DIDs for identities (human or agent)
  • Every commit signed properly
  • Running on Base L2 with their own nodes
  • Incentives for independent node runners via token
  • Tools like OpenClaude, OpenGateway, Playground, etc.

It feels like the missing infrastructure layer for the agentic era.

Video (4.5 mins, nicely animated)

What are you all using right now for agent code collaboration and version control? Still GitHub + manual oversight, or have you found something better?

Curious to hear thoughts from people actually running fleets of agents.

reddit.com
u/amu4biz — 5 days ago

I was missing every World Cup goal while working, so I built a Chrome extension that smashes a ball through my screen when anyone scores

I keep getting stuck vibecoding, bullposting on X on my browser, during the World Cup and missing goals. So I built Goal , a free Chrome extension that watches live matches and, the second anyone scores, a soccer ball smashes your screen with the score, the scorer, and a commentator yelling GOOOAAAL.

It's completely free, no signup. Built it mostly for myself but figured others might enjoy it.

Video of it in action 👇 (the smash is real, that's the actual overlay)

Would love feedback

https://reddit.com/link/1uhv1u4/video/ru990darf0ah1/player

reddit.com
u/amu4biz — 7 days ago

Someone built a reverse CAPTCHA — instead of proving you're human, it proves you're a bot

Most infrastructure on the internet is built to keep bots out.

But what happens when bots are the legitimate users?

The founder of GitLawb — a git platform built specifically for AI

agents — ran into this exact problem. One of their nodes fell over,

pushes started failing with 500s, and he needed to verify the agent

on the other side of the connection was actually a legitimate agent.

Not a human. Not a scraper. An authenticated AI agent.

Traditional CAPTCHAs are useless here — they're designed to block

exactly what he needed to let through. So he built the opposite. A

CAPTCHA where you prove you're a bot to get access.

This feels like one of those early infrastructure problems that's

going to matter a lot as agents become more autonomous. Right now

most of the internet's identity layer assumes a human is on the other

end. That assumption is breaking down fast.

Curious if anyone else has run into this — how are you handling

agent authentication in your pipelines? Is there any standard

emerging or is everyone rolling their own?

reddit.com
u/amu4biz — 8 days ago
▲ 5 r/Gitlawb+1 crossposts

Open Source & Decentralized Infrastructure for AI Agents (Git Layer Discussion)

With AI agents becoming more capable, the open source community is starting to build the infrastructure layers needed to support them properly.

One recent example shows a suite focused on decentralized Git for AI agents. It includes tools like an open-source coding harness, agentic memory for persistent context, unified model access, and decentralized storage/nodes.

Why this kind of open source work matters:

\- Agents need reliable version control, persistent memory, and collaboration tools that aren't locked into proprietary platforms.
\- Decentralized and open approaches provide censorship resistance, verifiability, and true ownership of code and data.
\- Open standards and community governance help ensure interoperability between different agents, tools, and human developers.
\- As more code and workflows are generated by AI, having auditable, forkable, and distributed systems becomes essential for long-term sustainability.

It’s part of a larger trend toward open, agent-native ecosystems rather than relying solely on centralized services.

What are your thoughts? Are there other open source projects working on similar decentralized tooling for AI agents or code collaboration? Which challenges do you see as most important to solve?

u/amu4biz — 8 days ago

SKALP — read the market, long/short, don't get liquidated (60-sec arcade)

Game Title: SKALP

Playable Link: https://skalp.fun

Platform: Browser (desktop & mobile)

Description: SKALP is a 60-second trading-floor reflex game that turns reading a market into a fast arcade challenge. A price line scrolls live across the screen, and your only job is to read its momentum. Go LONG when it climbs, SHORT when it dives, and FLATTEN to lock in your profit before a reversal wipes you out. You trade on 6x leverage, so every move is amplified.

Free to Play Status:

[x] Free to play

[ ] Demo/Key available

[ ] Paid (Allowed only on Tuesdays with [TT] in the title)

Involvement: Solo developer. I designed the game ,All original work.

reddit.com
u/amu4biz — 9 days ago

Stop missing the early buys on EasyA Kickstart — free bot that alerts you the second money moves in

Been aping EasyA Kickstart tokens and got tired of finding out about the runners after they already 2-3x'd. So I built a free Telegram bot that pings me the instant buys start hitting — figured this crowd would get the most out of it.

Why it helps you catch moonshots early:

🆕 New-launch alerts — know the second a fresh token drops, not 30 min late

🟢 Live buy alerts on any token you follow — SOL + USD size, price, market cap, holders, the buyer

🐋 Whale filter — only get pinged when big money moves in (the real signal)

🎓 Graduation alerts — catch tokens completing their bonding curve before the next leg

Follow one token, or follow ALL active ones and let the alerts find the movers

Important / no BS:

- It's 100% free. No token, no presale, no referral, nothing to buy.

- No wallet connection, no signup — it only reads public on-chain data. It literally can't touch your funds.

- I built it for the community, not to shill a bag. Not financial advice, DYOR.

Open to feature requests — tell me what alert would actually help you snipe earlier and I'll add it.

reddit.com
u/amu4biz — 10 days ago
▲ 2 r/solana

I built a free, open Telegram buy bot for EasyA Kickstart tokens — no wallet, no signup, public data only

EasyA Kickstart launches a lot of tokens and I kept missing buys on the ones I was watching, so I built a Telegram buy bot for it. Sharing it here because it's free and might be useful to others trading these.

What it does

  • Follow any EasyA token — search by name, browse trending, or follow all active ones
  • Live alert on every buy: amount in SOL + USD, price, market cap, bonding-curve %, holder count, and the buyer's wallet
  • Graduation alerts when a token completes its bonding curve and migrates to an AMM
  • New-launch alerts the moment a fresh token drops
  • Whale and market-cap filters so you only see buys above a threshold (cuts the dust)
  • Works in DMs, or admins can add it to a project's group

How it works (for the curious)
EasyA Kickstart has no public API, so I traced its frontend and found it sources data from Jupiter's public data API. The bot polls that for trades and token metadata — so it's all read-only, public on-chain data. No wallet connection, no signup, nothing to sign. It can't touch your funds because it never asks for anything.

Stack is Node + Telegraf, runs 24/7 in Docker. Built it for the EasyA community but it's open to anyone.

Genuinely want feedback — if there's a data point you'd want in the alerts or a filter that's missing, tell me and I'll add it.

EasyA_Buybot on telegram — happy to answer anything about how it pulls data.

reddit.com
u/amu4biz — 10 days ago

Found a way to earn passive income just by using AI tools

So I've been using this AI tool called GitLawb and kind of stumbled onto something.

Every time you send a prompt there's that few seconds where the model is thinking.

Turns out they put a short ad in that gap and split the revenue with you as the

user. You're not doing anything extra, you're just using the tool like normal.

Payouts are in USDC which is the part that actually got my attention. Not points,

not credits, actual stablecoin.

The ad side is interesting too — anyone can buy a slot directly with USDC, no

approval process or sales call. So sponsors are actively putting money in and that's

what flows to users.

I can't tell you exact numbers per prompt yet, I haven't used it long enough. But

the concept is pretty clean and I haven't seen another AI tool do this.

Anyone else found tools that actually pay you to use them?

reddit.com
u/amu4biz — 10 days ago

The inference market is splitting in two and most people haven't noticed

Been thinking about this a lot lately. Everyone's focused on which model is best, but the more interesting battle is happening one layer down — who actually runs the inference.

OpenRouter just raised $113M and is routing 47 trillion tokens a week. That number is insane. It's basically becoming the NYSE of AI requests — developers point to it and let it figure out which provider to hit.

And the provider landscape is quietly splitting:

On one side you have the hyperscalers and the professional inference players (Fireworks, Together, Groq etc.) — they compete on uptime, SLAs, enterprise contracts. Boring but necessary.

On the other side you have the decentralized networks trying to build the permissionless version underneath. Akash, io net, Venice, c0mpute and a handful of others. They're not trying to win on reliability — they're winning on things AWS structurally can't offer. No content filters. No account bans. No rate limits controlled by one company.

DeepSeek now makes up 4 of the 5 most used models on OpenRouter. When the model itself is basically free and open, the infrastructure underneath it starts to matter a lot more.

Some of these networks are doing interesting things architecturally — distributed inference across consumer GPUs with verifiable receipts showing actual GPU IDs and public IPs. Early but the approach is different enough to be worth watching.

Do decentralized inference networks ever actually break mainstream or do they stay a tool for people who specifically need censorship resistance?

reddit.com
u/amu4biz — 10 days ago

I built MeetMoves — a Chrome extension that lets you use any video clip or GIF as your live Google Meet background

Tired of the same boring blurred or static backgrounds in Google Meet? I built MeetMoves so you can upload any video clip or GIF and use it as a real moving background during calls.

It integrates directly into Google Meet’s background panel — just click the + tile, pick your video, and it’s live. No extra apps, everything stays in your browser.

Key features:

  • Supports MP4, WebM, GIFs
  • Privacy-first: nothing is uploaded to servers
  • One-time unlock for unlimited use (free for the first day, then $1 forever)

https://reddit.com/link/1uf8b0q/video/yy4fhdtg7f9h1/player

Try it here:
https://chromewebstore.google.com/detail/meetmoves/pcihfjkbfcfademdaplkhmgngdlkfepg

Would love honest feedback from the community — especially on UX, any bugs you notice, or ideas for improvements. Built this as a quick side project and just shipped it.

Looking forward to your thoughts!

reddit.com
u/amu4biz — 10 days ago