r/learnAIAgents

I built an open-source, self-hosted AI gateway: 237 providers (90+ free), auto-fallback combos, and a 10-engine token-compression pipeline (MIT)

Builders-welcome post with the substance up front (disclosure: I'm the maintainer). OmniRoute is a free, MIT, self-hosted AI gateway — one OpenAI-compatible endpoint over 237 providers — built around two problems: runs dying on a provider 429, and tokens bleeding on tool/log output.

One endpoint, 237 providers — 90+ of them free. You point any tool or agent at a single OpenAI-compatible endpoint (localhost:20128/v1) and it can reach 237 LLM providers without you rewriting anything. 90+ have free tiers and 11 are free forever (no card), which aggregates to ~1.6B documented free tokens/month — and that's honest, pool-deduped math (we count each shared pool once instead of inflating it; the methodology is public in the repo). There's a one-command setup-* for 13+ coding tools (Claude Code, Codex, Cursor, Cline, Roo, Kilo, Gemini CLI…), so switching your existing setup over takes seconds.

Fallback combos — so it never stops mid-task. A "combo" is a ladder of models the router walks automatically: your subscription first, then API keys, then cheap models, then free ones. When a provider returns a 500 or you hit a rate limit, it slides to the next target in milliseconds, mid-request, and your tool never even sees the error. There are 17 routing strategies (priority, weighted, round-robin, cost-optimized, auto/coding:fast…) plus three resilience layers — a per-provider circuit breaker, a per-key cooldown, and a per-model lockout — so one dead key can't take down a whole provider.

Fusion — an ensemble mode for the hard steps. Beyond simple routing, there's a fusion strategy that fans a single prompt out to a panel of different models in parallel and then has a judge model synthesize one best answer (mixture-of-agents, built in). It's cost-aware, so easy turns stay on one fast model and it only fuses when the step is worth it.

A 10-engine compression pipeline — the part most routers don't have. Every request flows through a transparent compression pass you can toggle/stack per combo. Instead of one trick, it stacks the best of the open-source ecosystem: RTK filters command/tool output (git diffs, test logs, builds) at 60–90%, Microsoft's LLMLingua-2 does ML semantic pruning, Caveman handles prose, session-dedup strips repeats across turns. Critically, code, URLs and JSON are preserved byte-perfect, and a default-on inflation guard throws the compressed version away and sends the original if compressing would actually grow the prompt — it never makes things worse. On tool-heavy sessions that's ~89% average input-token reduction (an 8k-token git diff becomes a few hundred). Full credit to every upstream project (RTK, Caveman, LLMLingua-2, Troglodita) is in the README.

Agent-native — the agent can drive the router itself. There's a built-in MCP server (95 tools across 30 audited scopes, over stdio / SSE / streamable-HTTP), plus A2A (v0.3, JSON-RPC 2.0) support. That means an agent can query providers, switch combos, read its own remaining quota and manage memory through the gateway — not just consume tokens through it.

It's 100% local (zero telemetry, AES-256-GCM at rest), MIT-licensed, has a prompt-injection guard on every LLM route, opt-in memory, and runs on npm, Docker, desktop or your phone via Termux.

For context on whether it's worth your time: it's grown to ~9.8K GitHub stars, 1,490+ forks and 280+ contributors in ~4.5 months, with 21,000+ automated tests and 1,830+ issues closed — so it's a battle-tested project, not a brand-new experiment.

npm install -g omniroute

GitHub: https://github.com/diegosouzapw/OmniRoute · Site: https://omniroute.online

Would value a critique of the routing/compression architecture from this crowd.

u/ZombieGold5145 — 3 days ago

▲ 5 r/learnAIAgents+2 crossposts

Is anyone else starting to lose track of all their AI agents / automations?

I’ve been experimenting with different AI tools and agents, and honestly it’s getting messy.

Some are in ChatGPT, some in workflows, some in external tools… and there’s no real “overview” of everything.

I’m wondering:

How are you keeping visibility on all of them in one place (if at all)?

Or is everyone just improvising right now?

reddit.com

u/Gallegos_Daniel — 1 day ago

▲ 2 r/learnAIAgents+3 crossposts

What I Learned by Making an AI Agent Read My Company's Mind

Most of the AI conversation in marketing today runs in one of two directions. There’s the outside-in direction — competitive intelligence, analyst coverage, social listening — where AI helps you understand the market. And there’s the forward direction — campaigns, ABM, lead nurturing — where AI helps you move people through a funnel.

What’s almost never discussed is the third direction: inside-out. What happens when you point an AI agent at your own company’s internal knowledge — specs, tickets, call transcripts, win/loss notes — and ask it to do the unglamorous but critical work of product marketing? Not “tell me about the market,” not “write me an ad,” but: generate the battle card, draft the release note, and tell me if what we’re saying externally still matches what’s true internally.

That third lane is what I wanted to explore. So I built PMM Second Brain — a working AI agent, backed by a real (if fictional) company’s internal wiki, that produces actual PMM deliverables and catches messaging drift before a customer does.

This post is the story of how it came together: the thinking behind it, the architecture, the build process, and a few things I learned along the way — including some genuinely humbling moments getting it to run on my own laptop.

https://yotam.substack.com/p/building-a-second-brain-for-product

u/Cyberthere — 2 days ago

▲ 8 r/learnAIAgents+5 crossposts

Build AI Code Review Agent ( looking for feedbacks and contribution )

I've been learning AI engineering by building instead of just watching tutorials.

To push myself beyond the basics, I started building an AI Code Review Agent. The goal wasn't to create a polished product—it was to force myself to understand how these systems actually work.

Some of the concepts I ended up learning along the way:

Retrieval-Augmented Generation (RAG)
Embeddings and vector search
ReAct-based agent workflows
LLM-powered code analysis
GitHub integrations

One thing I learned quickly is that getting an LLM to answer questions isn't the hard part. Making retrieval reliable, giving the agent the right context, and designing good workflows takes much more iteration than I initially expected.

The project is open source and still a work in progress, with plenty of room for improvement as I continue building and learning.

Repository: https://github.com/RishabhhG/codereview-agent
Linkedin : https://www.linkedin.com/in/rishabh-guptaaa/

If anyone wants to try it out, use it, or contribute, I'd really appreciate the feedback. I'm also happy to discuss the architecture, implementation decisions, or hear suggestions for improving the agent.

u/Aggravating-Drama916 — 3 days ago

▲ 5 r/learnAIAgents+5 crossposts

You will love this site

I love helping people out and I recently launch a small community project for people who are building AI Agent. As someone who has found tons of good ideas over the years. I thought it only made sense to put all agent ideas together.

So I build Agent Idea Hub a place for people to find ideas on agents that are worth building (you can actually make money from it)

Would love for you all to join ! As the spot are limited and I won’t allow more than 50 members to keep the ideas solid.

u/yomatt41 — 3 days ago

▲ 11 r/learnAIAgents

Why are good agent skills so hard to discover?

Every time I discover a cool skill repository I think: "This one looks useful"

Ten minutes later I uninstall it. The README is usually more impressive than the actual workflow improvement :-(

At this point I've become much more skeptical of GitHub stars and flashy demos.

Wondering if anyone else has had the same experience. and if you have some skills you consistently use, pls tell me (No Ads pls)

reddit.com

u/IndependenceGold5902 — 4 days ago

▲ 3 r/learnAIAgents

Looking for agent building friends

hey guys looking to connect with people who know agents to learn how to build and optimize agents to sell to businesses and to use for myself.

reddit.com

u/SunnySAGA03 — 5 days ago

▲ 8 r/learnAIAgents+6 crossposts

Published Part 5 of our Data Explorer architecture series.

This one covers the SSE streaming layer that carries our AI agent's lifecycle from server to browser.

We chose SSE over WebSockets for the agent stream. The conversation is unidirectional during execution — user sends request, agent streams response. No need for browser to push mid-stream. SSE gives us built-in reconnection, HTTP compatibility with every proxy/CDN, and ~50 lines of parser code.

The protocol defines 18+ typed event types across 6 categories:

- Agent lifecycle (status, chat_id, context_meta)

- Reasoning (thinking, thinking_chunk, plan, plan_update, thinking_clear)

- Tool (tool_call, tool_result, tool_retry)

- Artifact (artifact_start, artifact, artifact_rows)

- Content (delta)

- Completion (done, error, quota_exceeded, quota_update)

Each event type maps to a specific UI affordance. Frontend doesn't infer state from text deltas — the protocol tells it what's happening.

Three things worth highlighting:

Chunked payloads for >64KB. SSE has a practical ~64KB limit in many proxy/CDN configs. Our emitter auto-chunks oversized payloads into frames with chunk_index/total_chunks metadata. Frontend reconstructs transparently.
Streaming tables in 50-row batches. Instead of buffering a 10K-row table and chunking it, we emit rows in fixed batches. First 50 render in <200ms, rest append without blocking UI. Memory is bounded.
The Nginx gotcha. X-Accel-Buffering: no is the non-obvious header that makes SSE actually work behind Nginx. Without it, Nginx buffers the entire stream before forwarding. Browser sees nothing until agent finishes, then gets every event at once.

Retry strategy: 4xx errors (auth, validation, quota) are NOT retried — they require user action. Only network errors and 5xx trigger exponential backoff (1s, 2s, 4s, max 3 retries).

Full post with code, Mermaid diagrams, payload examples:

https://vivekmind.com/blog/sse-streaming-architecture-how-schema-weaver-s-data-explorer-streams-18-event-types-and-chunked-payloads-to-the-browser

Happy to answer questions on the protocol design, chunking strategy, or the Nginx gotcha.

u/Vivek-Kumar-yadav — 5 days ago

▲ 11 r/learnAIAgents

It’s so hard to build an actual agent

I doubt this will resonate with anyone here since from what I’ve seen (most) people here are great when it comes to using AI.

For the average Joe, it’s historically been very difficult to set up an AI Agent, somewhere to store data, giving your agent it’s own memory, making it become smarter over time, being able to speak to it from anywhere without your computer being on.

It’s a lot, so for the most part it’s always been avoided, and people resort to using chatbots.

I’ve built a tool that lets anyone create their own super-agent in natural language;

\- Speak to it from anywhere.
\- It has it’s own memory, and becomes smarter over time.
\- You can connect it to 500+ apps
\- Your super-agent can deploy subagents to delegate tasks to.
\- Over time as it learns what you do and your workflow, it can execute tasks fully autonomously based on what it learns from you.
\- It also has its own inbox and computer so it can do practically anything you tell it to.

We’re still in stealth but the product works today.

I’m happy to give a few people full access completely for free to test it out.

Btw there probably still is loads of bugs I need to iron out but we can get there together!

If you’re interested, please message me or reply here, I’ll get back to everyone asap! I’d love to see what use cases you guys have :)!

reddit.com

u/ghostt2x — 8 days ago

▲ 5 r/learnAIAgents

What's one lesson about building AI agents that you wish you knew earlier?

After spending more time experimenting with AI agents, one thing has become clear to me:

Building the agent is usually the easy part. Building one that's actually useful is much harder.

I initially focused on adding more capabilities, multiple tools, longer prompts, memory, and complex workflows, thinking that would make the agent better.

In reality, the biggest improvements came from simplifying things:

Giving the agent one well-defined responsibility.
Spending more time on prompt design than adding new features.
Improving the quality of inputs instead of increasing model complexity.
Testing with real-world scenarios instead of ideal examples.

It reminded me that a reliable agent solving one problem consistently is often more valuable than a sophisticated agent trying to solve ten.

I'm interested to hear from others in this community:

What's one lesson you've learned while building AI agents that completely changed your approach?

Whether it's about prompting, orchestration, tool selection, memory, evaluation, or deployment, I'm sure newer builders (myself included) would benefit from hearing what actually worked in practice.

reddit.com

u/Correct-Address-3735 — 7 days ago

▲ 1 r/learnAIAgents

6 AI micro-saas to $20k/mo. i built a community to share how

yo. going from a buggy MVP to actual recurring revenue is brutal.

i stabilized my 6 apps at $20k/mo mrr only after building a strict system for my tech stack and organic marketing.

i just opened the AI SaaS Launchpad.

the community and daily resources are completely free. for those who want to copy-paste my exact systems, i also host paid, structured sprints (like a 3-Day challenge to get your first 100 users using automated Reddit and LinkedIn outreach).

either way, stop building in isolation. you will quit when things get hard. come build alongside 1000+ other founders.

drop a comment or shoot me a dm and i’ll send the link right now.

Processing img q1e8wqfp49ah1...

reddit.com

u/Wide-Tap-8886 — 7 days ago

▲ 4 r/learnAIAgents+2 crossposts

We’re moving past simple RAG. Microsoft just shared how they are shifting to Agentic Workflows internally.

If you've been building in the AI space lately, you know that transitioning from standard chat interfaces to truly autonomous, agent-based systems is the current frontier.

Microsoft just dropped a solid engineering blog on how they are fundamentally transforming their own software development lifecycle by leveraging an agentic platform. It covers some great architectural insights on moving beyond simple automation into complex, multi-agent reasoning for real-world development tasks.

Definitely worth a read if you're exploring agentic architectures or looking to scale autonomous dev tools.

Check out the full breakdown from Microsoft here

u/ninjitsuytber — 7 days ago

▲ 6 r/learnAIAgents

Want to learn agentic ai

I ama beginner and want to learn agentic ai what should i start from because i am a full beginner. Can anyone recommend me a path

reddit.com

u/HomeworkFit8239 — 9 days ago

▲ 5 r/learnAIAgents

Want to learn agentic ai

I ama beginner and want to learn agentic ai what should i start from because i am a full beginner. Can anyone recommend me a path

reddit.com

u/HomeworkFit8239 — 9 days ago

▲ 1 r/learnAIAgents

Guidance required

Hey all!!

I really want to build an agent but I dont know how to code, all the videos on youtube require some kind of coding, they are all building on claude code

Can we not make an agent just by typing on claude chat and asking it to build an agent, If I am able to prepare the system prompt/instructions beforehand.

A guiding reply will be appreciated since, I have not tried making an agent and an easier approach is not available on youtube tutorials.

How should I start?

Should I just write a that I want to make a prompt or should I give an prompt like I want to make an agent that can update my monthly financial data from the raw data file to in a formatted file a d then further describe the formulas and give it the raw data files and then give it the format and way the format is to be updated.

Is it the right way? What exactly is the right way? I don't want to waste tokens, I know I will end up wasging some while practicing bit I do want to start with the right way.

reddit.com

u/inquisitive_aunt — 8 days ago

▲ 12 r/learnAIAgents

want best Agentic Ai resources on Youtube

I keep seeing “AI agents will replace everything” content everywhere, but most tutorials feel like just prompt chaining.

Are there any **serious resources** that show:

* real multi-step reasoning agents * tool use + memory systems * production-ready AI agent architecture

please suggest me some usefull channels

reddit.com

u/Tribalcheaf123 — 11 days ago

▲ 16 r/learnAIAgents+9 crossposts

CortexPrism — Open-Source AI Agentic Harness

cortexprism.io

u/scarecr0w12 — 12 days ago

▲ 24 r/learnAIAgents+1 crossposts

What is your ai stack to do multiple complex jobs, what have you tried and what are you using now?

Real answers from genuine people, don't just say what you use say what you've made as well. Anyone elses own brain suffering from system overload trying to figure out which one to pick and choose for what? Much appreciated.

reddit.com

u/JaydenMongoose — 13 days ago

▲ 2 r/learnAIAgents+1 crossposts

Help: Need API key for trying my Ai agent is working or not

If any can offer me any api key for testing my ai agent. I'm very grateful for them

reddit.com

u/Noob-editor12 — 10 days ago

▲ 2 r/learnAIAgents

How much is a Claude cost per token?

How much does each Claude model actually cost per token, and why is the output price so much higher than the input price? I'm trying to estimate my monthly API bill, and I can't tell whether the long responses I'm getting are what's driving the cost or if it's the big context windows I'm sending in.

reddit.com

u/Witty_Champion6609 — 11 days ago