r/devworld

▲ 34 r/devworld+27 crossposts

We ran a 1,655 person blind study on AI memory. The results changed how we think about the problem.

We’re building KAPEX (getkapex.ai), memoryware for AI applications. Two co-founders, bootstrapped, patent pending. I wanted to share some of what we’ve learned because the discourse in this space keeps circling the same assumptions and I think a few of them are wrong.

The study: 1,655 participants interacted with AI systems with and without our memory layer. Blind setup, they didn’t know which condition they were in.
The finding that mattered most: first-session preference was around 65%. Not bad, but not a clear signal. After 20+ sessions, preference climbed past 80% and kept rising. The longer people used it, the wider the gap.

That trajectory is the insight. Not the final number. The trajectory.

Here’s why that matters for anyone building in this space:

Most AI memory tools are optimized for first impressions. Demo well, retrieve fast, show the user you remembered their name. That’s fine. But it means the entire evaluation framework for memory (including the benchmarks everyone cites) is testing the wrong thing. LongMemEval and LoCoMo test whether you can find what was said. They don’t test whether the system knows what still matters.
Retrieval and relevance are different problems. The industry has spent two years building better retrieval. Almost nobody is building relevance governance: what stays important, what fades, what gets superseded, and whether the user can see and correct what the system believes.

Three things we learned the hard way:

1.	Clean store beats fancy retrieval. Every time. If your memory layer lets stale context accumulate without governance, no amount of reranking or hybrid search fixes the degradation over time. The capture and maintenance side is where the leverage actually is.

2.	Memory without transparency is a black box. If developers can’t see why the agent believes something, and users can’t see what the system thinks it knows about them, then memory becomes a liability rather than a feature. Inspectability isn’t a nice-to-have. It’s what makes correctability possible.

3.	The value of memory is invisible in short sessions. This is why benchmarks miss it. A 5-turn evaluation can’t distinguish between a system with real governance and one that just retrieved the right vector. The difference only shows up after sustained use, which is also when it matters most.  

Our approach treats relevance as something that should be handled continuously by the architecture, not at query time by the retrieval layer. Context that stops being reinforced through usage naturally loses priority. Not deleted, just deprioritized. That’s the principle. Can’t share more on implementation for IP reasons.

Curious what others here are seeing. Is anyone else finding that the retrieval-first paradigm breaks down over time? And is anyone working on evaluation frameworks that test sustained-use performance rather than single-session recall?

getkapex.ai if you want to follow along. Still pre-launch but opening access soon.

reddit.com
u/sandstone-oli — 12 hours ago
▲ 74 r/devworld+19 crossposts

Built an interactive system design tool every architecture is clickable and you can simulate failures

Reqflow : pick an architecture (WhatsApp,
Uber, Netflix…), hit play, watch a request flow through it step by step. Click any component for purpose + tradeoffs. Kill the cache and watch the path change.

15 systems, 18 concept guides, a drag-and-drop Builder with AI review, and a timed Interview mode.

Feedback welcome — especially what's missing from the 15.

getreqflow.com
u/YouSilent6025 — 17 hours ago
▲ 4 r/devworld+1 crossposts

AI Should Help Us Build Real Systems, Not More Knockoffs

I’m honestly tired of seeing AI used to build slightly different versions of things we already have.

Another dashboard.
Another chatbot.
Another clone of an app that already exists.
Another “AI wrapper” that looks impressive for five minutes but doesn’t solve a real problem.

The bigger issue I keep seeing is this:

People have good ideas. AI can write code. But the project still falls apart.

The system isn’t clear.
The specs are scattered.
The AI loses context.
One fix breaks another thing.
Nobody knows what’s actually done.
The project gets patched over and over until it becomes too messy to launch.

That’s the problem I want to solve with S1 Canvas.

Not “type one prompt and magically get an app.”

Something more useful than that.

A way to map the system, break it into buildable pieces, check what’s missing, monitor what’s risky, and give AI coders the right context without letting the whole project drift into chaos.

CyberShark’s job isn’t to blindly build for you. It monitors, suggests, informs, and visualizes. You stay in control.

I think AI should help people build real systems that actually launch - not just generate more disposable software.

That’s what I’m building. curious if that’s how others are feeling.

reddit.com
u/Gigz100 — 9 hours ago
▲ 3 r/devworld+2 crossposts

I'm developing a new ideia

I'm developing a platform that will be an entire ecosystem of AI agents, much broader than Moltbook, coming soon...

reddit.com
u/moltnews — 10 hours ago
▲ 13 r/devworld+6 crossposts

I’m a software engineer, and a big part of my day is debugging issues reported by other teams.

Most of the time I’m:
- checking tokens
- calling APIs
- comparing JSON responses and traces
- writing docs

I kept jumping between different tools for all of this, and it always felt clunky.

I tried finding a lightweight toolkit with a clean UI and smooth workflow, but most tools either felt outdated or started adding AI features that didn’t really make sense to me, especially for workflows involving sensitive tokens or internal data.

So I ended up building a small client-side toolkit for myself: https://catssaymeow.org/

What surprised me was that when I showed it to teammates, some of them immediately asked:
“Why doesn’t it have AI support?”

Personally, I don’t really see the need for AI in tools like JSON formatting, token inspection, API calls, etc.

u/limario_bp — 15 hours ago
▲ 22 r/devworld+19 crossposts

What are you building? Let's promote each other

Hey founders, what are you building?

🚀 Built something cool and want more people to know about it?

I created ContactJournalists.com because PR was one of the biggest growth drivers in my own business.

We have a 7 day free trial for you to get stuck in and look around :)

A single feature can do so much more than generate a nice ego boost:

✨ Build high-authority backlinks
✨ Improve your SEO
✨ Increase your visibility in AI search (GEO)
✨ Drive targeted traffic to your website
✨ Build trust with potential customers
✨ Open doors to podcast interviews and partnerships

The problem? Finding relevant journalists and podcasts takes forever.

That’s exactly why I built ContactJournalists.com.

What you get:

📰 Live press requests from journalists actively looking for expert comments and product recommendations

🎙️ Hundreds of podcasts looking for guests

🔎 Searchable journalist database with reporters, bloggers, and editors across dozens of niches

✍️ AI Pitch Helper to help you craft stronger responses

📂 Save contacts and media opportunities to your own lists

📈 Track your submissions in one dashboard

👀 See when journalists save your profile

Who it’s for:

🚀 Solopreneurs
💻 SaaS founders
🛍️ Ecommerce brands
📣 PR agencies
🏋️ Coaches and consultants
🤖 Indie hackers
🏢 Startups and small businesses

If you’re building something and want to get featured in the press, appear on podcasts, and grow your brand organically, it’s designed for you.

🎁 Free 7-day trial
💷 Then just £14/month

It takes about 30 seconds to get started.

👉 https://www.contactjournalists.com

Would genuinely love your feedback from fellow founders and marketers. 😊

#PR #SEO #GEO #SaaS #Solopreneur #Startups #IndieHackers #PodcastGuest #BuildInPublic

u/Capuchoochoo — 20 hours ago
▲ 23 r/devworld+5 crossposts

Let’s check out each other’s SaaS products and share feedback

Drop your SaaS/startup/project below and let’s help each other out with:
• honest feedback
• UI/UX suggestions
• bug finding
• feature ideas
• early traffic/users

I’ll start with mine:

XLink — a simple platform for:
→ Smart link shortening
→ QR code generation
→ Secure file sharing up to 200MB
→ Link analytics & traffic insights

Trying to keep it clean, fast, and free to use.

Project:
xlink.xunifire.com

Would genuinely love feedback on which feature stands out most or what feels confusing as a first-time user 👇

u/illegaltoaster25 — 24 hours ago

We built an AI visibility tracker for brands

We have Been working on this or a while and wanted to share it with people who would appreciate what we built.

The problem we were solving

Most brands have no idea how they show up when someone asks ChatGPT or Perplexity about their category. Manual checking is one prompt, one engine, one moment in time. Useless at scale.

What we built

Crawls a domain on connect, maps competitors automatically, and generates prompt sets from real buyer query patterns, not keywords, full natural language questions the way someone actually types them into an LLM.

Fires those prompts daily across ChatGPT, Gemini, Perplexity with full response capture not parsed extracts, raw responses stored per prompt per engine per day.

Parses every response for brand mentions, citation sources, positioning language, and sentiment signals. Rolls it up into a visibility score with the underlying data fully accessible.

Diffs responses over time so you can see exactly when and how an engine's answer changed useful for correlating content changes with visibility shifts.

The engineering problems

Rate limiting across 3 different APIs with different quota structures and response formats is the unglamorous core of the whole thing.

Prompt generation that produces realistic buyer queries rather than SEO-brained keyword strings required a lot of iteration. The gap between "best CRM software 2025" and "I'm a 3-person startup and we keep losing track of client follow ups what should I use is the entire difference between useful and useless data.

Citation attribution figuring out which source an LLM actually pulled from for a given claim is genuinely hard and still imperfect. We're treating it as a signal not a ground truth.

Sentiment parsing at this scale needed something faster than sending every response back through a model. Current approach uses a fine-tuned classifier for the first pass with model verification for edge cases.

Check it out Codepup AEO.

reddit.com
u/InfamousInvestigator — 18 hours ago
▲ 7 r/devworld+1 crossposts

Rate my app?

So this idea started with chatbots like Gemini, ChatGPT and Claude. When I would use them for school work, and would upload my school documents, I would always get incorrect answers or simply confusion.

So this project started as a simple solution for AI to do my schoolwork and now it is called Parseflow, and today I published it. So how it works is when you send in a document, Parseflow will process it and extract all the information within and organize that data to return structured chunks, which can be used directly or can be searched through with the search features. By using Parseflow, you can improve context and reduce token costs. Currently it accepts PDFs, DOCX files and plain text.

I am still a student, graduating high school this year, so I built this project to try pay for university. I still have a lot to learn so any feedback, advice, questions, etc... are appreciated, you can DM me if you need.

reddit.com
▲ 5 r/devworld+2 crossposts

A few months ago I randomly started building a video chat website after work just to learn WebRTC and realtime systems

At first I thought it would be a small fun project but it slowly turned into me spending nights fixing random bugs deployment issues OAuth problems MongoDB errors and moderation stuff

I dont really have a team and most of this was built while learning things along the way using documentation ChatGPT and AI tools

Now its finally usable and Im trying to improve it properly

One thing Ive realised while building this is how hard moderation and user safety actually is on realtime platforms

Still figuring out things like

better matching

NSFW prevention

making new users feel comfortable

reducing awkward dead chats

mobile experience

Would genuinely like to hear opinions from people who have worked on or used similar platforms

reddit.com
u/ExcitingWin9648 — 22 hours ago

Got burned by a nightmare client. Built something to prevent it — would this help you?

Last year I took on a client who seemed fine at first.

Vague brief. Kept saying "just make it pop." Changed scope every week. Then disappeared for 3 months without paying.

I ignored every red flag because I needed the work.

I've been building a small tool that reads a client's first message or job post and flags red flags before you commit — things like vague scope language, lowball anchoring, urgency pressure, and IP grabs buried in casual messages.

Before I keep building: would you actually use something like this? And what red flags do you wish you'd caught earlier?

(Not selling anything — genuinely trying to figure out if this is worth finishing.)

reddit.com
u/Sargent_skeleton_9 — 18 hours ago
▲ 3 r/devworld+2 crossposts

Got threatened with fake 1-star reviews after posting about my app 😅

I’ve been building a small baby tracking app for parents over the last few months and started sharing it online to get feedback/users.

Most people are supportive, but today someone literally threatened to leave fake 1-star reviews just because they thought my post was “an ad”.

Didn’t realize launching an indie app would come with this level of random hostility 😅

Any other indie founders/devs dealt with stuff like this?

u/nextmomi — 1 day ago
▲ 2 r/devworld+1 crossposts

Recommendations for building with React

I’ve been putting off learning React for way too long. Anyone have any recommendations for building with it

Also using:
Tailwind
Next
And nodejs

reddit.com
u/Mobile_Return7529 — 1 day ago
▲ 6 r/devworld+3 crossposts

Hi everyone!

My name is Syoma, and I’m a 29-year-old 3D artist specializing in Low Poly Stylized art for games and other creative projects. I’m currently looking for exciting collaborations and projects to contribute to!

With over 12 years of experience in 3D modeling, I primarily work in Cinema 4D but also use Zbrush, Substance Painter, Unreal Engine/Unity in my pipeline. I’m skilled in creating hand-painted textures inspired by styles like Fortnite, Sea of Thieves, and Warcraft, but my true passion lies in Low Poly art.

📄What I can do:

- Model buildings, props, and environments (no characters for now, but simple ones are possible).
- Create game-ready assets with attention to detail and optimized performance.
- Deliver hand-painted textures for vibrant and immersive designs.
- Design levels that tell compelling stories.
- Effectively lead a team, ensuring clear direction, responsibility, and successful results.

🎮Notable projects (PC Games) I’ve worked on:
- Frozen Ship on Steam: https://store.steampowered.com/app/3423990/Frozen_Ship/
- Beaten Path on Steam: https://store.steampowered.com/app/2584400/Beaten_Path/
- Soar: Pillars of Tasneem on Steam: https://store.steampowered.com/app/1513030/Soar_Pillars_of_Tasneem/
- Deisim on Steam: https://store.steampowered.com/app/525680/Deisim/

I’m open to freelance commissions and would love the opportunity to join a creative team on a full-time basis. Collaboration is key for me, and I believe in clear communication to bring any vision to life.

💲My rates:
- 30 USD/hour
- 1000 USD/week
- 3500 USD/month

📁Portfolio links:
- ArtStation: https://www.artstation.com/moldydoldy
- Behance: https://www.behance.net/moldydoldy

✉️How to reach me:
- DMs- Reddit or Discord: moldydoldy
- Email: syomapozdeev@gmail.com

Don’t hesitate to reach out — let’s discuss your project, rates, or any other questions you might have. Let’s create something amazing together!

u/Addlxon — 1 day ago

Need actual Developers to check my app/code and tell me if it's safe

As the title above i need a actual developers to check my app/code and tell me if it's well structrured and has enough security policies. I can't spend money to pay an actual dev to check it, hit me up with a dm and i'll give you access to the repo since it's private right now.
(whole app has been vibe coded but it has enough structure so someone doesn't ALT+F4 when they see it).

reddit.com
u/No_Permission_5121 — 2 days ago

Can you rate my website 1-10?

https://rebuttal.live — it’s a competitive live debating website I’ve been building where people can debate politics, philosophy, social issues, or honestly anything controversial in real time.

There’s ranked ELO, live voice/text debates, and global leaderboards. I wanted it to feel more like a competitive game than a normal forum.

reddit.com
u/Revolutionary-Rice90 — 2 days ago