u/MrBemz

Why are indian Ceos so Dumb? Over promise and understand delivery

Real incident that I faced, Ceo goes on stage says we are going all in on Ai and hypes everyone up saying we will print a bajjilion dollar.

Then the product and ops team got excited thinking we finally got approval and funding for projects we wanted to implement.

But then Legal, InfoSec and compliance team start putting up blocks, slowing approval and finally they say forget about it and such.

Like bruh

Put money where your mouth is.

Use some open source tool like lyzr governor or smth to protect your data? Maybe do local hosting? Build ur own chatgpt wrapper idk man

But honestly? Legal is right to block it

Throwing so much sensitive data especially financial data into a traditional SaaS platform is just asking for compliance nightmare. Yes even in india we have to follow some level of compliance.

Me and my team have been thinking of solutions

One thing we came up with is that instead of chatboxes we should pivot to Sovereign Infrastructure.

Our goals are

  1. 100% data sovereignty

  2. Infrastructure Level PII redaction

3 . Zero Vendor Lock in \ Multiple Model Agility using lyzr studio

Any thing we are missing? Or should change?

How do u guys convince higher up ?

Can't name company

reddit.com
u/MrBemz — 14 hours ago

Frame work for getting Enterprise Ai Features past InfoSec

Idiotic CEOs love giving mandates to "implement AI," but the moment you try to move a pipeline to production, THEY FREAK OUT AND START SCREAMING STUFF LIKE Legal and Compliance AND THAT kill it because they are either terrified of data leaks, hallucinations, and compliance audits or dont ACTUALLY WANT AI just the tag to pump their stock up or are just following hype. Over the last few quarters handling enterprise facing data infrastructure, I've had to map out a repeatable playbook to get these projects approved.

HERE ARE 4 METHODS TO TRICK YOUR USELESS AI COMPLIANCE MANAGER

1.LIE AND GASLIGHT THEM

NO NOT RLY BUT LIKE Build the "VPC-First" Architecture Shield Never tell an enterprise client Or (YOUR MANAGER, NEVER EXPLAIN HOW IT WORKS SO THEY CANT REPLACE YOU) that your application relies on public APIs or shared endpoints. The moment you say "OpenAI public endpoints," your deal is dead. PEOPLE HEAR BRAND NAME AND THINK THEY CAN DO IT THEMSELVES SPOILER -> THEY CAN'T Your architectural pitch must be centered around isolated cloud environments (AWS/GCP VPCs) where zero data leaves their perimeter to train public models.

Just keep dropping technical words that sound smart

  1. Move from "Chatboxes" to Deterministic Workflows

Enterprise buyers hate chat windows. They view them as a massive liability because users can prompt-inject them or get unpredictable answers. Instead, frame your AI strategy around background processing loops. The AI works in the background, runs an audit/validation loop, and only outputs verified, clean data straight to their internal dashboards. Fewer inputs = lower risk.

ALSO GASLIGHT THEM INTO THINKING THAT THEY CAN REPLACE THEIR WHOLE CONSUMER SUPPORT WITH AI JUST GASLIGHT

  1. Establish "Human-in-the-Loop" Webhooks Upfront

Do not give an autonomous agent final API execution authority over high-risk actions (like moving money or editing user databases). Build asynchronous pause-states natively. When an agent calculates an outcome, it triggers an internal Slack or email approval button to a manager. The execution halts until a human clicks "Approve." InfoSec teams love seeing this manual circuit breaker. ALSO LETTING THE CLIENT PRESS THE APPROVAL BUTTON HELPS WITH THEIR EGO

Create an Audit Trail via Prompts-as-Code

  1. Compliance teams (especially under SEC or FINRA guidelines) need to be able to audit why a system made a specific decision. Treat your system prompts and agent rules like production code. Use version-controlled repositories so that if an agent's behavior shifts, your legal team can visually inspect a git diff to see exactly what changed in the underlying system rules. Stop selling the "magic" of AI to corporate stakeholders. (AND START GASLIGHTING THEM) Sell the guardrails, isolation, and predictability.

Drop your fav workaround that u use

reddit.com
u/MrBemz — 1 day ago

4 step framework for Enterprise Ai features

Idiotic CEOs love giving mandates to "implement AI," but the moment you try to move a pipeline to production, THEY FREAK OUT AND START SCREAMING STUFF LIKE Legal and Compliance AND THAT kill it because they are either terrified of data leaks, hallucinations, and compliance audits or dont ACTUALLY WANT AI just the tag to pump their stock up or are just following hype.

Over the last few quarters handling enterprise facing data infrastructure, I’ve had to map out a repeatable playbook to get these projects approved.

HERE ARE 4 METHODS TO TRICK YOUR USELESS AI COMPLIANCE MANAGER

  1. LIE AND GASLIGHT THEM

NO NOT RLY BUT LIKE

Build the "VPC-First" Architecture Shield

Never tell an enterprise client Or (YOUR MANAGER, NEVER EXPLAIN HOW IT WORKS SO THEY CANT REPLACE YOU) that your application relies on public APIs or shared endpoints. The moment you say "OpenAI public endpoints," your deal is dead.

PEOPLE HEAR BRAND NAME AND THINK THEY CAN DO IT THEMSELVES

SPOILER -> THEY CAN'T

Your architectural pitch must be centered around isolated cloud environments (AWS/GCP VPCs) where zero data leaves their perimeter to train public models.

And just spam the f out of them with jargon

Just keep dropping technical words that sound smart

  1. Move from "Chatboxes" to Deterministic Workflows

Enterprise buyers hate chat windows. They view them as a massive liability because users can prompt-inject them or get unpredictable answers.

Instead, frame your AI strategy around background processing loops. The AI works in the background, runs an audit/validation loop, and only outputs verified, clean data straight to their internal dashboards. Fewer inputs = lower risk.

ALSO GASLIGHT THEM INTO THINKING THAT THEY CAN REPLACE THEIR WHOLE CONSUMER SUPPORT WITH AI

JUST GASLIGHT

  1. Establish "Human-in-the-Loop" Webhooks Upfront

Do not give an autonomous agent final API execution authority over high-risk actions (like moving money or editing user databases). Build asynchronous pause-states natively. When an agent calculates an outcome, it triggers an internal Slack or email approval button to a manager. The execution halts until a human clicks "Approve." InfoSec teams love seeing this manual circuit breaker.

ALSO LETTING THE CLIENT PRESS THE APPROVAL BUTTON HELPS WITH THEIR EGO

  1. Create an Audit Trail via Prompts-as-Code

Compliance teams (especially under SEC or FINRA guidelines) need to be able to audit why a system made a specific decision. Treat your system prompts and agent rules like production code. Use version-controlled repositories so that if an agent's behavior shifts, your legal team can visually inspect a git diff to see exactly what changed in the underlying system rules.

Stop selling the "magic" of AI to corporate stakeholders.

(AND START GASLIGHTING THEM)

Sell the guardrails, isolation, and predictability.

reddit.com
u/MrBemz — 1 day ago
▲ 3 r/rust

Is it worth Rewriting a high performance Go web crawler in rust for RAG data ingestion? (update)

Appreciate the feedback on my last post about whether I should rewrite the web crawler layer of my local RAG system from Go to Rust. Someone rightfully called me out for not explicitly defining my project goals and looking for a "how" in search of a "why."

To clear up the confusion: this isn't just a textbook learning project for fun (Goal A), nor is it an enterprise application with thousands of active users that I'm terrified of breaking (Goal B).

This is a self-hosted, personal production stack designed to ingest thousands of dynamic, JS-heavy data sources. My operational goals are system reliability, predictable long-term maintenance, and memory efficiency under heavy parallel loads.

Right now, the architecture is split:

Ingestion Layer: Built on Lyzr-Crawl (github.com/LyzrCore/lyzr-crawl). It’s a dedicated open-source Go engine that handles the heavy lifting—headless JS rendering, parallel URL discovery, and clean Markdown extraction so my LLM context windows don't get flooded with raw HTML/CSS bloat.

Core Pipeline Layer: A custom Rust stack utilizing tokio for heavy async coordination, text splitting, chunk embeddings, and streaming updates to a local vector store.

The Go binary is incredibly fast out of the box due to native goroutines, and it successfully saves me from paying insane per-page SaaS scraping API fees. But running a split-language stack introduces a lot of micro-frustrations that are pushing me toward a full Rust unification:

The Maintenance "Why": Debugging across a language boundary sucks. Passing structured data from Go's runtime over IPC/gRPC into Rust's async environment introduces extra serialization overhead and means I'm maintaining two completely different error_handling mental models. If a network timeout or headless crash occurs inside the crawler, bubbling that up deterministically to my Rust logic is clunky.embedding models.

The Resource "Why": Since I run this pipeline locally on a single machine alongside the LLM/Vector store, every megabyte counts. Go’s garbage collector is aggressive under high concurrent I/O, causing random memory spikes. I want Rust's zero-cost abstractions and strict memory predictability so the ingestion layer doesn't choke out the resources needed for local embedding models.

So my specific technical question for the sub remains:

If my goal is long-term stability and rock-solid error handling across the entire pipeline, is it worth writing a custom, production-grade equivalent to Lyzr-Crawl using reqwest + headless_chrome in Rust? Or will the sheer development overhead of rebuilding highly optimized, multi-threaded Go crawling primitives from scratch outweigh any performance and architectural synchronization gains I get from a single-language stack?

Also likes being efficient is good overall

Like I need a good mixture of efficiency and reliability

English isn't my first language as u can tell from my previous post where I messed up some verbs so I first wrote my post in a notepad and then ran it through grammarly this time around

Oh also what would happen if I switch over to watercrawl from lyzrcrawl?

reddit.com
u/MrBemz — 3 days ago

Made my own tracker

Rn just me and some close friends are using it to beat the scalpers

Method -> used WaterCrawl to scrape api and then used Lyzr Architect to make a multi agent setup and at last implemented a discord webhook for notifications and front end. Got a server and some proxies and thats about it.

u/MrBemz — 3 days ago