r/mcp

▲ 83 r/mcp+9 crossposts

Hey everyone,

I just open-sourced TuneForge.

The goal is simple: let your coding agent manage the full LLM improvement loop without ever leaving the chat window.

You can now tell your agent something like:

“Build me a customer support bot from this FAQ”

…and it can:

• Generate a clean synthetic instruction dataset (with LLM judging for quality)

• Run LoRA supervised fine-tuning on any Hugging Face causal LM

• Do a quick policy-gradient RL step using Ollama as the reward judge

• Merge the adapter, evaluate on a test set, and iterate

Everything runs locally, uses 4-bit quantization so it fits on modest hardware, and uses background jobs (with job_id polling) so long training tasks don’t freeze the MCP connection.

It’s built around the Model Context Protocol (MCP) for seamless integration with Claude Desktop, Cursor, Zed, Continue.dev, etc.

Tech: Python + Transformers + PEFT + bitsandbytes + Ollama + SQLite for job state.

Super early stage (just released), MIT licensed.

Would love feedback or ideas on what to add next. If you’re into agentic fine-tuning workflows, give it a try and let me know how it goes!

u/Just_Vugg_PolyMCP — 2 hours ago
▲ 29 r/mcp+7 crossposts

Local coding models need better repo context, not just bigger context windows

Local coding models have a repo-context problem.

When using llama/qwen/mistral/gemma for coding, the hard part is often not the model itself. It is getting the right files/functions into context without dumping too much raw source.

Long context helps, but it does not solve retrieval.

If the model never sees the right file, it still guesses.

I’ve been building SigMap, a zero-dependency CLI that creates a compact repo map for coding workflows.

Instead of sending raw source first, it extracts:

  • function signatures
  • classes/interfaces
  • exports
  • import relationships
  • ranked file matches per query

The workflow is simple:

repo map first → find likely files → read full source only where needed

Benchmarked across 18 repos / 90 tasks:

  • 81.1% hit@5 vs 13.6% random baseline
  • ~6× better file retrieval
  • 96.9% token reduction in the benchmark setup
  • 41.4% fewer prompts per task

No embeddings. No vector DB. No npm dependencies.

This is not meant to replace LSPs, grep, agent search, MCP tools, or full-file reads.

It is meant to give local coding models / agents a cheap first-pass structure map before deeper inspection.

Repo: https://github.com/manojmallick/sigmap

Benchmark suite: https://github.com/manojmallick/sigmap-benchmark-suite

Curious how people here handle repo context with local coding models.

Are you mostly using grep/search, RAG, repo maps, MCP tools, or just relying on longer-context models?

Edit: Good point from the comments — SigMap core is model-agnostic. The docs currently look too focused on proprietary assistants, so I’ll add clearer examples for VSCodium/Open VSX, Continue, Cline/Roo Code, Aider, OpenHands, and local Ollama/llama.cpp workflows.

u/Independent-Flow3408 — 5 hours ago
▲ 18 r/mcp+1 crossposts

apple's safari mcp server is more interesting than i initially thought

apple's safari mcp server only exposes 17 tools and runs inside an isolated webdriver session, while the community safari-mcp implementation has around 96 tools and can work with existing browser sessions.

the difference is pretty interesting. apple seems to be treating mcp as a clean-room debugging environment rather than giving agents access to your actual browser state.

there's also the bigger issue that most browser automation tooling is still heavily chromium-first.

this comparison goes deeper into both approaches:

https://rune.codes/hub/tech-trends/the-safari-mcp-server-could-change-how-developers-debug-websites

do you think browser mcp tools should be isolated by default, or is access to real browser sessions more useful?

u/Low-Trust2491 — 8 hours ago
▲ 4 r/mcp

Your AI coding agent isn't hallucinating. It's out of date.

We've been building and testing an MCP server for library upgrades and migrations over the last few weeks, and one pattern kept showing up.

The biggest source of wasted effort isn't hallucination in the classic sense. It's an agent operating on outdated knowledge and not realizing it.

An agent that doesn't know what changed between versions doesn't fail cleanly. It writes plausible code, hits a build error, tries a different plausible fix, hits another error, and keeps going in circles. Each of those loops costs tokens and time, and none of them would be necessary if the agent knew the actual breaking change upfront.

Current documentation tools solve a different problem. They tell an agent what the API looks like today. They don't tell it what changed, what replaced it, or what silently behaves differently now. Knowing the current API and knowing the migration path are not the same thing.

For example:

  • Next.js 14 → 15 changed several request APIs from synchronous to async
  • The Vercel AI SDK 4 → 5 removed and renamed multiple interfaces
  • The MCP SDK's upcoming v2 turned out to be a package restructuring, not a version bump

The agent often knows both versions exist. It has no idea what actually changed between them. So it starts guessing.

We've been building a way to give agents that missing piece directly: Asynthetic, an MCP server that serves hand-curated migration maps. Each one contains only the breaking changes for a specific version jump, with before/after code, deprecation timelines, peer compatibility requirements, and a citation back to the original source on every single entry. Nothing in the data path is LLM-generated. When there's no verified answer for what you asked, it returns found: false and tells the agent not to guess instead of making something up.

The part that took the most time wasn't writing the maps. It was verifying them. For the Next.js 14 → 15 map, every breaking change entry was tested against real builds, both Turbopack and Webpack, dev and production. Some of what the official docs say turned out to be incomplete in practice. Turbopack silently doesn't enforce the removal of the old u/next/font package the way Webpack does, so a default dev setup won't show you the break, only CI will. The React 19 "requirement" for the App Router isn't actually build-enforced either. An App Router app runs fine on React 18.3.1 in testing. None of that is a knock on the Next.js team's docs. It's just the gap between what a migration guide says and what a compiler actually does, and that gap is exactly what we wanted the tool to capture. It's in the map now as a caveat the agent can read.

Two maps are live right now, Next 14 → 15 and Vercel AI SDK 4 → 5, 40 breaking changes total, all cited. A third map for the MCP SDK is sitting there marked stale on purpose, because v2 turned out to be a package split rather than a version bump, and shipping nothing was better than shipping something wrong for that transition.

The part worth discussing isn't a specific number. It's that most of the cost in these agent loops seems to come from wrong turns, not from the actual fix once the agent knows what to do. Preventing the wrong turn seems to matter more than making any individual edit faster.

It's free, public beta, works hosted over Streamable HTTP or fully offline through npx asynthetic. Source-available under BSL, converts to Apache in 2030.

Repo's here if you want to look under the hood or find a case where a map is wrong: github.com/asyntheticai/asynthetic

Curious if others running Cursor, Claude Code, Devin, or similar agents are seeing the same pattern during upgrades and large dependency migrations.

u/dr_stefan — 8 hours ago
▲ 3 r/mcp+3 crossposts

Making AI Schema-Aware

I built a small open-source CLI because AI kept struggling with SQL against a large Oracle schema. It makes AI schema-aware before it writes SQL.

The workflow is simple:

  1. Search database metadata
  2. Pull table context
  3. Run small read-only checks
  4. Save useful SQL
  5. Store domain notes in Markdown

It is Oracle-only right now and still early, but it has already helped in my own AI coding workflow. Used it mainly with github copilot

Repo: oracledb-navigator

Curious if others are solving AI + database context in a similar way.

u/clean-apps-dev — 9 hours ago
▲ 19 r/mcp+10 crossposts

I built Curion, a librarian-like memory agent for AI agents

I’ve been working on Curion, a memory system for AI agents built around a simple idea:

The main agent should not have to manage memory manually.

Most AI agents are useful inside a single session, but they still lose important context between sessions. Project decisions, implementation history, constraints, unresolved tasks, and previous reasoning often disappear unless I manually write long handoff notes.

At first, the obvious solution seems to be giving the agent memory tools: save, search, update, delete, edit.

But that creates a second problem.

If the main agent has to manage memory by itself, it can easily receive too many raw memories. Some are relevant, some are stale, some are only partially related, and some may conflict with newer information. The agent then has to spend context and attention deciding what matters.

That creates context bloat.

Curion takes a different approach.

I think of Curion as a librarian for AI agents.

A good librarian does not just throw every possibly related book at you. They understand the question, know how information is organized, filter what matters, notice conflicts, ask clarifying questions when needed, and return the most useful context.

That is what Curion is meant to do for agent memory.

The main agent only needs to say:

“I want to remember this.”

or

“I need to recall something about this.”

Curion handles the rest.

When saving memory, Curion can decide how information should be stored, whether it relates to existing records, whether something should be updated, and whether a conflict requires clarification.

When recalling memory, Curion does not just dump raw search results into the agent’s context. It retrieves relevant records, evaluates what is useful for the current task, synthesizes the context, and clearly says when nothing relevant was found.

The analogy I use is human memory. When we want to remember something, we do not consciously search through billions of memories. We ask for what we need, and the relevant memory appears automatically beneath the surface.

Curion is built around that same interface idea for AI agents.

It is project-first: Curion focuses on the project the agent is currently working in. It can also use cross-project recall when information from another project is actually relevant.

Curion is not just a save/search tool. It is a collaborative memory layer: a specialized memory librarian that helps agents remember responsibly, reduces context bloat, and gives the main agent only the context it actually needs.

GitHub: https://github.com/geanatz/curion

NPM: https://www.npmjs.com/package/@geanatz/curion

Portfolio: https://geanatz.com

u/geanatz — 1 day ago
▲ 0 r/mcp

I built an MCP server that gives Claude Code / Codex only the code slices they need - ~87% fewer input tokens, quality-neutral

I've been building a vendor-neutral token-reduction layer, and the piece most relevant here is the MCP server. instead of your agent grepping and dumping whole files into context, it exposes retrieve_code(query) and explain_symbol(name) and returns only the relevant AST slices (tree-sitter, 12 languages). same STDIO server drops into Claude Code AND Codex via an [mcp_servers] block.

why it matters right now: Claude Code and Codex both bill by tokens, so trimming what the agent pulls into context directly stretches your weekly cap. measured on real billed tokens, heavy tasks:

gpt-5.5: 16,875 -> 2,232 input tokens (86.8% fewer), quality 3/3 -> 3/3

opus 4.8: 26,573 -> 3,343 (87.4% fewer), 3/3 -> 3/3

the broader layer has 3 more levers (prefix caching, tail compression with a fact-guard that can't drop a number, and cascade easy steps to a local model) and also runs as an OpenAI/Anthropic-compatible proxy. there's also a small Claude.ai browser extension that compresses what you paste before you send - it's in Chrome Web Store review right now (not verified yet), so load-unpacked for the moment.

honest caveats in the readme: small favorable suites, and static embeddings didn't beat plain keyword retrieval in my eval.

repo (Apache-2.0, reproducible benchmark in validate/heavy_bench.py): https://github.com/AryanGonsalves/trl-token-reduction

reddit.com
u/naruto_uzumaki00 — 15 hours ago
▲ 31 r/mcp+10 crossposts

SeekYou, unified host intelligence across 15 sources

SeekYou – unified host intelligence across 15 sources, runs free on Cloudflare.
- Built a tool that takes any IP, domain, or ASN and queries 15 sources in parallel: open ports, CVEs, BGP, RDAP, cert history, passive DNS, 5 threat feeds, exposed buckets, Wayback snapshots — all in one report.
- 4-layer parallel execution (total time ≈ slowest source, not sum of all).
- KV caching per source, circuit breakers, per-IP rate limiting.
- Typed diff engine — get alerted when ports open, CVEs appear, or certs expire on monitored hosts.
- Runs entirely on Cloudflare free tier (~5k lookups/day).
Source: https://github.com/Teycir/SeekYou (https://github.com/Teycir/SeekYou)

u/tcoder7 — 1 day ago
▲ 2 r/mcp

i built this instead of sleeping, please tell me if it’s stupid

i got tired of the whole “just let agents call your API” thing sounding simple but being annoying once you actually try to do it.

everyone shows the happy path, but then you hit the boring stuff: auth, API keys, deciding which endpoints are safe, huge JSON responses, logs, rate limits, and not letting the model see half your backend for no reason.

so i built a rough gateway/proxy layer.

basically:

agent → gateway → real API

it’s not exactly MCP. it’s more like a curated agent-facing layer in front of an existing API.

the agent gets a scoped gateway key, not the real API key. the gateway checks what tools/endpoints that key is allowed to call, injects the real upstream auth server-side, calls the actual API, slims/redacts the response, and logs what happened.

it also supports some per-tool settings, like different auth/base URLs/response cleanup rules, because real APIs are messy and not every endpoint behaves the same.

the idea is not to replace the API. it’s just the boring wrapper/proxy layer people seem to keep rebuilding when they want agents to use APIs safely.

i haven’t launched it yet because it still needs polish, and i’d rather get roasted now than launch, regret the direction, and realize i built the wrong thing.

now you can roast the f out of me. constructive criticism is welcomed.

reddit.com
u/Decent_Progress7631 — 1 day ago
▲ 326 r/mcp+69 crossposts

I built an open-source, self-hosted AI gateway: 237 providers (90+ free), auto-fallback combos, and a 10-engine token-compression pipeline (MIT)

Builders-welcome post with the substance up front (disclosure: I'm the maintainer). OmniRoute is a free, MIT, self-hosted AI gateway — one OpenAI-compatible endpoint over 237 providers — built around two problems: runs dying on a provider 429, and tokens bleeding on tool/log output.

One endpoint, 237 providers — 90+ of them free. You point any tool or agent at a single OpenAI-compatible endpoint (localhost:20128/v1) and it can reach 237 LLM providers without you rewriting anything. 90+ have free tiers and 11 are free forever (no card), which aggregates to ~1.6B documented free tokens/month — and that's honest, pool-deduped math (we count each shared pool once instead of inflating it; the methodology is public in the repo). There's a one-command setup-* for 13+ coding tools (Claude Code, Codex, Cursor, Cline, Roo, Kilo, Gemini CLI…), so switching your existing setup over takes seconds.

Fallback combos — so it never stops mid-task. A "combo" is a ladder of models the router walks automatically: your subscription first, then API keys, then cheap models, then free ones. When a provider returns a 500 or you hit a rate limit, it slides to the next target in milliseconds, mid-request, and your tool never even sees the error. There are 17 routing strategies (priority, weighted, round-robin, cost-optimized, auto/coding:fast…) plus three resilience layers — a per-provider circuit breaker, a per-key cooldown, and a per-model lockout — so one dead key can't take down a whole provider.

Fusion — an ensemble mode for the hard steps. Beyond simple routing, there's a fusion strategy that fans a single prompt out to a panel of different models in parallel and then has a judge model synthesize one best answer (mixture-of-agents, built in). It's cost-aware, so easy turns stay on one fast model and it only fuses when the step is worth it.

A 10-engine compression pipeline — the part most routers don't have. Every request flows through a transparent compression pass you can toggle/stack per combo. Instead of one trick, it stacks the best of the open-source ecosystem: RTK filters command/tool output (git diffs, test logs, builds) at 60–90%, Microsoft's LLMLingua-2 does ML semantic pruning, Caveman handles prose, session-dedup strips repeats across turns. Critically, code, URLs and JSON are preserved byte-perfect, and a default-on inflation guard throws the compressed version away and sends the original if compressing would actually grow the prompt — it never makes things worse. On tool-heavy sessions that's ~89% average input-token reduction (an 8k-token git diff becomes a few hundred). Full credit to every upstream project (RTK, Caveman, LLMLingua-2, Troglodita) is in the README.

Agent-native — the agent can drive the router itself. There's a built-in MCP server (95 tools across 30 audited scopes, over stdio / SSE / streamable-HTTP), plus A2A (v0.3, JSON-RPC 2.0) support. That means an agent can query providers, switch combos, read its own remaining quota and manage memory through the gateway — not just consume tokens through it.

It's 100% local (zero telemetry, AES-256-GCM at rest), MIT-licensed, has a prompt-injection guard on every LLM route, opt-in memory, and runs on npm, Docker, desktop or your phone via Termux.

For context on whether it's worth your time: it's grown to ~9.8K GitHub stars, 1,490+ forks and 280+ contributors in ~4.5 months, with 21,000+ automated tests and 1,830+ issues closed — so it's a battle-tested project, not a brand-new experiment.

npm install -g omniroute

GitHub: https://github.com/diegosouzapw/OmniRoute · Site: https://omniroute.online

Would value a critique of the routing/compression architecture from this crowd.

u/ZombieGold5145 — 2 days ago
▲ 5 r/mcp+3 crossposts

FINALLY Claude can DoomScroll for me

I've finally had enough. I have wasted too many hours doomscrolling Insta to find reference videos and creators for my niche for my own startup, so I built an MCP to do it for me.

ScrollScanner lets Claude search Instagram, then a vision model actually watches each candidate reel so you can search by format in plain English "single-cut talking head, on-screen text, advertising dog walking software" and only true matches come back, each with the reason it matched. Everything ranks by outlier score (views ÷ the account's own median) instead of follower count, which is great for surfacing micro-creators.

One-line add in Claude Code, or drop the URL into Settings → Connectors on Desktop/web. Would love feedback from this crowd on the tool design and what other research moves you'd want it to make.

Link is scrollscanner.com

▲ 14 r/mcp+2 crossposts

basemind: an MCP server that indexes your repo so agents answer from signatures, not full file reads

I kept watching coding agents answer "what calls this function" by grepping, opening three files, and reading them top to bottom to find four call sites. On a big repo that eats the context window fast.

basemind indexes a repo once and answers structurally. The MCP tools return paths, line numbers, and signatures instead of file bodies, so a lookup costs a fraction of reading the source. What it exposes:

  • Code map (300+ languages): outline, search_symbols, find_references, find_callers, call_graph, find_implementations. An expand escape hatch pulls a single function's full body when the agent actually needs it.
  • Git at symbol resolution: blame_symbol, symbol_history (when a symbol's body changed), recent_changes, diff_outline.
  • Document RAG over 90+ formats with text extraction and OCR built in, plus semantic and full-text search.
  • Shared memory and an agent-to-agent comms channel (rooms, DMs, inbox) for running more than one agent on the same repo.

Runs three ways over one local index: a Claude Code plugin, a plain MCP server, or a CLI. Works with Claude Code, Codex, Cursor, Gemini CLI, Copilot CLI, OpenCode and a few others. Rust, MIT.

On token savings: it ships a heuristic counter (an outline is modelled at about 1/5 of reading the file, a caller lookup about 1/3 of grep plus read). It's an honest estimate, not a benchmark, and tools with no fair baseline (memory, git wrappers) count zero.

Honest limitations: it's an index, so it lags edits between scans. serve watches by default and there's a rescan, but a cold first scan is slower (worst case in my tests is the TypeScript compiler, 81k files, about 18s), and the git-history index costs 6 to 22% of your .git on disk.

https://github.com/Goldziher/basemind

Curious how others here are feeding repo structure to agents over MCP.

u/Goldziher — 1 day ago
▲ 61 r/mcp+40 crossposts

Ask questions across your Markdown notes using a fully local Graph RAG engine. Built for Obsidian vaults, works with any folder of Markdown files. Extracts entity-relation triples from wikilinks & YAML frontmatter, retrieves answers via hybrid search (vector + BM25 + temporal). Multilingual. No cloud. Runs on Ollama.

https://github.com/benmaster82/Kwipu

u/WritHerAI — 2 days ago
▲ 8 r/mcp

Is there actually a “best” MCP gateway yet, or is everyone just solving different halves of the problem?

Spent the last few weeks trying to answer this for our own stack and came away thinking the question itself is slightly wrong right now. Docker’s mcp gateway is genuinely nice for local dev - container isolation per server, credential handling baked into docker desktop but it’s not really built for cross-team, crossregion enterprise governance. The community mcp-gateway-registry project is solid if you want to bring your own keycloak/entra OAuth and don’t mind assembling the pieces yourself. Kong shipped an mcp layer as part of their broader ai gateway, which makes sense if mcp is one traffic type among several you already govern with Kong, but feels heavy if mcp is your only concern. Truefoundry approaches it as identity-and-token-scoping first, resolving agent identity separately from user identity and minting scoped tokens per mcp server which matters a lot once you have agents acting on behalf of users, less if you’re still single-user, (this is the one I ended up using for my team)

The honest answer is, the “best” depends on whether your problem is discovery (which servers exist), governance (who can call what), or just getting something running fast for a demo. I think, what problem people are others facing and how are you actually optimizing for that seems to determine the right answer more than any feature checklist does..

reddit.com
u/Background-Job-862 — 2 days ago
▲ 35 r/mcp+11 crossposts

Multi-model consensus debate via the filesystem. LLMs propose, peer-review, rebut, vote and synthesize a group-confirmed answer. CLI + MCP.

github.com
u/raiyanyahya — 2 days ago
▲ 9 r/mcp

How are you handling auth and billing in your MCP servers?

Building an MCP server for the first time and I'm stuck on the boring parts — OAuth 2.1, API key management, usage metering, Stripe integration.

I expected the hard part to be the actual server logic. Turns out it's wiring all the auth and billing infrastructure around it. Took me way longer than expected and none of it made the server itself any smarter.

Curious how others are handling this:

- Are you rolling your own auth from scratch?

- Using any existing boilerplate or template?

- Just skipping monetization entirely for now?

Would love to know what's working (or not working) for people.

reddit.com
u/OpeningSir9287 — 2 days ago
▲ 8 r/mcp+1 crossposts

Cross agents assistance/memory layer - ideal solution

My first post in a while, so bare with me.
A bit about myself, exited a company on 2023. worked since on Software architecture, and in the last couple of years, around the AI architecture to make an organization (R&D mostly) utilize AI in a better way.

In a recent project i did, i was requested to build a knowledge layer for a small startup (10 R&D employees). I researched quite extensively (Supermemory, etc.) but all seem like something that won't sustain and won't be called by the devs in their agents.

Another issue was that even if it works, how would we utilize it for other agents like a KB slackbot that their sales team use, or an SRE bot that need to decide if an event it seen in the logs is a bug or a feature?

So bottom line, the project is somewhat a success, somewhat a failure. Not something i'm proud of. Which got me into thinking on how to effectively capture and share context across the organization with zero/minimal burden to people?

What i envision is how we did buddy training for a new employee (back in the old days...), we would sit a new employee next to a senior one (who likes it or not), and let them look how it work and ask questions.

  • Taking notes on design choices
  • How to troubleshoot some problems
  • How to raise a local environment
  • Where to look for the ticket
  • What is a known issue that we should tackle later after we do X
  • What dashboard in Grafana has the important logs about this system
  • etc.

But instead of putting a person next to the developer, there is already an AI agent working with it.

Such a system (and i need your help on defining it❤️) would:

  • Work on every agent type: coding, internal bot, framework, etc.
  • Capture and recall memories natively during the conversation with the AI agent
  • Capture and recall needs natively
  • Create and optimize workflows (skills) natively as we activate and feedback these workflows
  • Promote/Graduate memories/needs/skills from a local level to team/org level as they mature and get more traction
  • Share the collected memories/needs with other agents (plugin?)

Basically, doing compound knowledge growth via the conversations with AI agents

Would be happy to hear your thoughts.

reddit.com
u/Yarharel — 3 days ago
▲ 1 r/mcp

Question: does improving MCP security can drive adoption?

From one side, I'm reading threads about building local vs remote MCP, also how a large amount of remote MCPs are not using any authentication-authorization, or using static keys/tokens instead of OAuth. I imagine each builder may have different reasons on why.

I'm also seeing in enterprise companies the IT/security team blocking custom MCP access and going through a process to unlock MCPs to employees, where they assess the MCP security, scope, documentation,...

When building an MCP how much not building it remote and using OAuth can impact the overall adoption, considering enterprise users may have additional friction to access it.

I imagine the target audience for the MCP can influence the answer, but curious on any learning experience or thoughts?

reddit.com
u/Additional_Fig_9234 — 2 days ago
▲ 9 r/mcp

I'm going to let Claude run a real $100k portfolio through an MCP server I built. Help me not blow it.

For starters I'm a software engineer with basically zero quant experience.

I work on a product is built around alternative data for researching stocks, think social media, hiring data, insider and congress trades, web traffic, that kind of stuff. We've been collecting it for about five years. It's pretty well established by now in the investing space that the right alternative data has an edge. A model built on nothing but credit card data out of MIT beat the analysts' consensus 57% of the time. Changes in Glassdoor ratings have led forward returns by about 10% a year in peer reviewed work. We've had some institutional interest, but we've never once traded on our own signal.

So I want to. And I want Claude to run it.

The plan is to wire Claude to two things. An MCP server I built that exposes all this alt-data across a few thousand US names, and an Alpaca brokerage account for execution. Claude pulls the signals through the MCP tools, figures out what fits the strategy, and places the trades through Alpaca. I think a lot more people are about to start building LLM driven strategies, and I'd rather learn it in public with real money on the line than paper trade it.

If I land on a strategy I actually believe in, my company will even fund it with $100k for three months and we'll post some updates around it.

Here's the rough starting point. Please pick it apart:

- Universe: liquid US equities, 2B+ market cap, ~3,000 tickers
- Signals: social sentiment and mention volume (Reddit, X, Stocktwits), insider buying, congress trades, hiring acceleration, web traffic and wikipedia pageviews, plus some fundamentals
- 10 names, equal weight
- Entry: 3+ signals fire and hold across 2 weekly reads, so I'm not chasing one print
- Exit: 2+ of those signals reverse
- Rebalance weekly, only act on a trigger
- Benchmark: QQQ

The part I actually want help on is how to run it. My plan is to put Claude on a weekly routine that pulls the signals, decides the changes, and sends the orders to Alpaca, If you've set up a recurring Claude agent that touches a real API or real money, I'd love to hear how you did it and what broke.

Happy to get into the MCP side too. If anyone wants to know what the server exposes or how Claude actually uses the tools, ask and I'll go deep on it.

reddit.com
u/CoolioBeansTTV — 3 days ago