u/founders_keepers — reddlx

▲ 7 r/singularity

Firms that adopt AI grow headcount 10.2% over the two years following adoption

Key take aways:

Firms that adopt AI grow headcount 10.2% over the two years following adoption, but these gains are entirely driven by high-intensity adopters. Low-intensity adopters see no statistically significant change.
Entry-level headcount grew even faster. At the companies making the largest AI investments, entry-level headcount grew 12% over the two years following adoption.
AI adoption and the associated gains are unevenly distributed. AI adopters are already larger, more engineering-intensive, more likely to be venture-backed, and faster-growing than non-adopters. These firms then grow faster upon adoption.

Extremely counter intuitive on the three findings.

ramp.com

u/founders_keepers — 5 days ago

▲ 9 r/ZaiGLM

not hitting 200+ tok/s anymore in serverless

Cool to see providers hitting these top speed numbers, get in the door but actually barely gets 50 tok/s

Hard to get real sense of the speeds under normal usages, what i'm confused is how are these benchmarks meaasured? from Chat:

>Imagine a GPU is like a school bus delivering kids to school. Every time the bus leaves, there's a fixed cost whether it carries 1 kid or 50 kids. A batch size is just how many requests (or tokens) you put on the bus before sending it. If the batch is too small, the GPU spends a lot of time driving half-empty, so it's inefficient and expensive. If the batch is larger, the GPU can process many requests at once, making each one cheaper because they share the work. The downside is that the first request may have to wait for the bus to fill up, increasing latency. So inference systems constantly balance batch size: small batches give faster responses, while larger batches maximize throughput and reduce cost.

So it's mostly hard to measure true speed, Openrouter is much more accurate with higher sample sizes and number of batches.

Either way, excited to see these 200+ tok/s actually works in serverless and for weeks.

u/founders_keepers — 11 days ago

▲ 3 r/betpandacasino

[MEGA THREAD] Betpanda website down

Website has been down since 8:00 PM UTC

Keep all discussion about the status here please. Will update as soon as we hear more from the support team.

u/founders_keepers — 11 days ago

▲ 40 r/ZaiGLM

Competition is getting fierce

img1 Wafer fast on Vercel AI router $10/$3/0.5 - sustained $150tps+
img2 Vercel AI router - provider comparisons
img3 Openrouter provider comparisons

u/founders_keepers — 12 days ago

▲ 13 r/CFO

Why ‘Tokenmaxxing’ Is Out And ‘Valuemaxxing’ Is In

Very interesting Forbes article published a few weeks ago and current hot topic in amongst companies I'm connected to.

AI companies have pushed "tokenmaxxing" to measure dev productivity by how many AI tokens they burned. Predictably, this was a financial disaster.

Uber reportedly blew its entire AI budget in four months, and one company accidentally torched $500 million in one month on uncapped employee Claude licenses.

Now, the pendulum is swinging to "valuemaxxing." Microsoft announced cutting unchecked subscriptions and demanding ROI. Imo days of treating expensive compute as a vanity metric are over and the focus is officially back on basic finance 101, removing redundancies, and strict usage caps.

How is has this conversations gone if when you spoke to your tech leaders? Whats the best way to think through this?

forbes.com

u/founders_keepers — 12 days ago

▲ 121 r/ZaiGLM

GLM5.2 is blowing my mind right now 300+ TPS

it's definitely as good as opus 4.8

mostly coding loops

300+ TPS

u/founders_keepers — 18 days ago

▲ 0 r/ZaiGLM

50% off tokens code REDDIT2X

used their pass until it got killed. still really fast i'm going to test it out for the next month or so. might as well

u/founders_keepers — 26 days ago

▲ 25 r/ZaiGLM

Might give this a go, it was damn fast.

u/founders_keepers — 1 month ago

▲ 30 r/legaltech

Legal AI has a growing token price problem?

So much conversation going on about end of tokenmaxxing.

Just this week in news (not in legaltech):

Uber capping token spend to $1500/month per dev
Sam Altman admits AI token costs are becoming 'a huge issue'
Ramp raises $750M on backs of helping CFOs solve token burn issue
Microsoft is canceling most internal Claude Code licenses

Any thoughts by legal professions in this space?

u/founders_keepers — 1 month ago

▲ 43 r/BuyItForLife

Washable rug that last?

Got this at Marshalls for my standing desk, so pretty high traffic and my dogs favorite day spot = mess collector.

Unfortunately the washing part is kind of a gimmick, the stains are very obvious there and doesn't come out.

This was only $50 for a 6 x 7, and i'm willing to go up to $400. Not sure about Ruggable, I know people who bought them and they're alright for the price but definitely not buy for life material..

u/founders_keepers — 1 month ago

▲ 4 r/ZaiGLM

Comparing Wafer with other token based plans

Well, good things never last. Wafer pass is no more and they've switched to a per token billing model. Thoughts?

u/founders_keepers — 1 month ago

▲ 4 r/devsecops

Upwind AI Agents for Cloud Security

upwind.io

u/founders_keepers — 1 month ago

▲ 17 r/ZaiGLM

GLM 5.1 comparison on wafer pass vs zai

Both plans compared, there are hick ups but overall decent enough to be used as my primary provider, been thinking about cancelling the legacy plan but i'll hold off for just a while longer.

u/founders_keepers — 2 months ago

▲ 104 r/betpandacasino+8 crossposts

r/Ramp Giveaway: We're giving away 2 FIFA World Cup 2026 tickets!!!

We're giving away two tickets to a FIFA World Cup 2026 group stage match here on r/Ramp!!! Entering takes all of 30 seconds.

How to enter:

Join the r/Ramp subreddit
Leave a comment on this post!

That's it. One comment per username = one entry. The winner will be selected at random.

Details:

Entry deadline: May 21, 2026 at 5:00 PM PT
Winner will be notified via Reddit DM from u/ramplovesyou and must respond within 7 days

No purchase necessary. Open to US residents 21+, excluding NY, FL, and RI. Full terms and conditions.

u/Hot-Yogurtcloset5181 — 1 month ago

▲ 11 r/opencodeCLI

Opencode + Cheap DS V4 Pro on Wafer Pass vs Other providers

Extension of my last post about GLM here.

Not there yet in terms of token output per second, but for flat fee of $10/wk you set it on background jobs.

u/founders_keepers — 2 months ago

▲ 22 r/LocalLLM

GitHub - antirez/ds4: DeepSeek 4 Flash local inference engine for Metal

Dropped by founder of Redis. This is a custom native inference engine built specifically for DeepSeek v4 Flash.

on a M3 max, 128GB, stock ds4 settings:
- 14–15 t/s at 62K pre-filled actual coding conversation
- memory usage was flat during gen ~85GB res
- disk cache is ~8GB for a full 100K context window
- thermals were normal, light fan activity
- inference server is rock solid so far

Haven't played around with it yet but going to give it a go tomorrow when I get time.

github.com

u/founders_keepers — 2 months ago

▲ 8 r/opencodeCLI

MiniMax-M2.7 added via Wafer.ai

Has anyone tried this provider? Would love your genuine feedback.

github.com

u/founders_keepers — 2 months ago

▲ 2 r/MiniMax_AI

MiniMax-M2.7 live with a 204,800 token context window, built for long-context coding agents and production engineering workflows. Starting at $10/week.

u/founders_keepers — 2 months ago

▲ 41 r/ZaiGLM

Tracking comparisons for the past few weeks, more full comparison here https://www.reddit.com/r/ZaiGLM/comments/1sz0gv3/glm51_on_wafer_pass_vs_zai/

For open source I'm very bullish on small providers, especially if they're local.

u/founders_keepers — 2 months ago

▲ 7 r/ZaiGLM

Comparison based on E2E on real usage, so including TTFT. Tokens per second.

For flat fee of $10/week very bullish on small inference providers.

u/founders_keepers — 2 months ago