u/read_too_many_books

All local models suck. Even DeepseekV4 can only handle instructions. Prove me wrong plssss

I can spend $25k-$100k on a local computer, call it a business expense. I'll spare the details.

But no... These models just suck.

I've tried every single model outside the 1.6T DeepSeekv4. Maybe if people think that is useful, I'll do a trial Vast.ai server.

I feel like I'm letting my ultra privacy focused customers down. I've been trying for 1+ months and probably spent $400+ on vast.ai server tryouts.

I know some people are getting weather and stonk prices with crappy 35B models... We have significantly more complex stuff. Combining 260 page pdf docs with a massive dropbox.

Only Opus has worked.

Maybe I need to lower expectations? Maybe I need to have Opus make MCP/CLI-like skills for ~500B models?

reddit.com
u/read_too_many_books — 2 days ago

Whats it like not using Claude Opus?

People using 2nd rate models. What are you accomplishing? (Chinese models, Codex, Sonnet)

People using 3rd rate with GPU. What are you accomplishing? (Local tier with Nvidia Chips)

People with 3rd rate with CPU. What are you accomplishing? (Everyone else)

I'm most interested in why you are still interested in OpenClaw. I have been miserable trying Sonnet and Deepseekv4 after getting used to Opus.

I joke when I say: "Oh you got the weather or stock prices?" but... its not like you are single shotting full stack HIPAA approved apps.

reddit.com
u/read_too_many_books — 2 days ago

Looking to hire a US 1099 contractor to teach me how to use small models on OpenClaw

I basically need training. I've been using Opus for so long, when switching to anything else, nothing works.

DM me.

I'll want to look at your memory files, and see how you talk to my robust openclaw.

reddit.com
u/read_too_many_books — 2 days ago

Local LLM CPU users... How long is it taking you to do anything?

I hear about people on CPU using 35B models... From everything I read, when you load up a context of like 100k tokens, this takes like 5-15 minutes.

Is this what life is like running local models? My 9B models on old 6gb VRAM wont even run with OpenClaw because it needs 16k tokens just to send its first command.

reddit.com
u/read_too_many_books — 3 days ago

CLI is cheaper than Tokens, but it makes more mistakes. (Duh, but real world usecase)

I have sent out ~61 messages RCS messages x 4 times(1 month before event, 1 week before, day before, day after)

The 1st time I used it, I prob spent $25 on Opus tokens. It worked perfectly.

3rd and 4th times I used it, I told it to use CLI as much as possible. It thought it sent 61 invites, but really it sent ~10 before I disconnected my phone due to having to leave the house.

But................

One more thing to mention. The second time... The second time Tokens screwed up worse.

For some reason, despite having a solution in its memory file, Opus decided to use something called KDE connect. This was different than how it was successful time #1 using Google Messages.

Why did it decide a different path? Non-deterministic.

Anyway, you want some non AI slop? Here you go. I bet crappy local models can follow CLI instructions that Opus figured out.

DM me for a openclaw Opus snapchat group.

reddit.com
u/read_too_many_books — 4 days ago

$2,500 of Opus token spend on Openclaw... "Whats a workflow?"

Admittedly I own a software shop and have been using OpenClaw to upgrade and bug fix my programs. I taught it vision to click buttons and look at the screen to determine if things were correct. Its been amazing. I've also used it to manage a server with a few customer's full stack apps. Occasionally I use it as an assistant, it fills out forms on websites.

But 'Whats a workflow?'

I wonder if I have a hard time understanding because whenever I have a 'workflow', I tell my openclaw to build software for it.

The closest thing I can imagine to a workflow, and this is saved in a separate memory file is paying contractor invoices:

>Open the invoice tracking file

>Go to this week's pay period

>line up who submitted an invoice with name

>open each person's invoice file.

>go to this week's spreadsheet in each invoice file

None of this is programmatic I believe. Is that a workflow?

reddit.com
u/read_too_many_books — 9 days ago

Transfering OpenClaw workspaces between computers? How about between Windows and Linux(Fedora ofc)

I have a main computer with 500 memory files made by openclaw. Its my favorite OpenClaw.

However, I sometimes need this computer for other things, work, compute, etc...

I want to be able to use my openclaw on another computer, same memories. Even better, sync them.

And... oof this is the hard one, and probably close to impossible without spending a ton of tokens:

Convert my memory files from Windows(full of powershell commands) to Linux Fedora....

Curious if there are any tricks here, or its going to be as painful as I imagine.

reddit.com
u/read_too_many_books — 11 days ago

How long should I expect for video generation?

I'm using OpenClaw to rent Vast.AI servers, it told me 10 minutes per video with 96GB.

Not sure if it screwed something up, but was curious what kind of expectations I should have.

Any settings I should set?

reddit.com
u/read_too_many_books — 11 days ago

Looking for ideas... Openclaw doing video editing

I can spend hundreds of dollars in Opus tokens to do this. Any ideas on having openclaw edit videos?

EDIT:

Is it smart enough to know where to clip and make edits/transitions?

reddit.com
u/read_too_many_books — 12 days ago

What have you accomplished with Qwen 35B (include your context size too)

I see people claiming they used local models, and even more insane, sub 200B local models.

I tried hard, and I have come to 2 conclusions on people who claim they are successful:

>They are lying

>They are doing something so unbelievably easy like opening a website or getting the weather.

At best, I can see using Opus to create a memory file for it to follow.

Maybe using it for heartbeats...

But I want to hear the most complex things you've done with Qwen, specifically 1 shot, with basically 0 corrections.

reddit.com
u/read_too_many_books — 13 days ago
▲ 105 r/openclaw

Deepseek v4 Flash is pretty amazing, about to buy a $25k computer

My customers have confidential data, they won't even use AWS.

I've been trying to solve this problem for them and they are more than fine with buying an on-premise device for Local LLMs + AI Agents.

Up until today, I have been extremely dissapointed with every model not named Opus.

However, Deepseek 4 Flash is doing near-Opus level performance. This is something I can actually use.

Upon this whole process things I dont understand:

>How are Qwen 35b people are using it? Not even sonnet can do the job.

>Do Mac users just say they are using local LLMs but not actually? That stuff is unbelievably slow. Heck, even with NVIDIA GPUs, it can be a bit frustrating when doing 1M tokens.

Anyway, thanks China for the free LLM. Not sure what they get out of it, I'm running it locally.

reddit.com
u/read_too_many_books — 14 days ago

https://github.com/antirez/ds4

I saw the phrase 'long prompt', and was amazed to see it was getting 20k tk/s. Its CPU based, that is pretty amazing for CPU... until I saw it say 12k tokens.

My Claude Opus is using 40-60k on its subagents. My current main is 3M.

Don't get me wrong, I'm looking forward to trying deepseek, but on CPU? That sounds miserable. I'll spin up a vast server with 2 RTX 6000s.

u/read_too_many_books — 16 days ago

Probably the best work I've ever done.

The new insult in the household:

>"You are being a last man!"

I've also gotten a kid out of bed by saying:

>"You are just going to lay in bed all day? Are you a last man? Or a superman?

And the kid got up, both me and my wife were surprised.

>"We've invented happiness. Says the last man, and he blinks" - My 6 year old

Fun times here. Seems like good fun.

reddit.com
u/read_too_many_books — 18 days ago

I am no fanboy, I run a software shop.

I want cheap labor(openclaw as an assistant and openclaw as a senior programmer) + I've been selling OpenClaws to various small businesses in my area.

I installed Hermes this weekend, and it was okay. It seemed to figure out how to spend money on vast.ai and spin up a llama4 maverick instance with Opus tokens. That was cool.

But...

>The UI seems lacking. No easy toggle for models, the web UI didnt have an chat feature without toggling it, and even then, it seems like a crappy terminal that isnt very fast.

>It kept using Opus despite me telling it to use local AI. Idk... I suppose even openclaw struggled with that.

>Idk... something seems off. Like its just not as dynamic as openclaw. It asks for my help. Idk... Anyone share the same vibe here? I see the potential.

>it seems to spend lots of tokens, more than openclaw.

Anyone else use both and have opinions?

reddit.com
u/read_too_many_books — 19 days ago

I did all 44 hours, I really liked the book, I still think about the scene at the end:

>"no Ivan, Katrina doesn't love Dmitri anymore, shes totally over him" - little Alyosha

>1 day later

>At that instant Katya(Katrina) appeared in the doorway. For a moment she stood still, gazing at Mitya(Demitri) with a dazed expression. He leapt impulsively to his feet, and a scared look came into his face. He turned pale, but a timid, pleading smile appeared on his lips at once, and with an irresistible impulse he held out both hands to Katya. Seeing it, she flew impetuously to him. She seized him by the hands, and almost by force made him sit down on the bed. She sat down beside him, and still keeping his hands pressed them violently. Several times they both strove to speak, but stopped short and again gazed speechless with a strange smile, their eyes fastened on one another. So passed two minutes. “Have you forgiven me?” Mitya faltered at last, and at the same moment turning to Alyosha, his face working with joy, he cried, “Do you hear what I am asking, do you hear?” “That's what I loved you for, that you are generous at heart!” broke from Katya. “My forgiveness is no good to you, nor [866] yours to me; whether you forgive me or not, you will always be a sore place in my heart, and I in yours—so it must be....” She stopped to take breath. “What have I come for?” she began again with nervous haste: “to embrace your feet, to press your hands like this, till it hurts—you remember how in Moscow I used to squeeze them—to tell you again that you are my god, my joy, to tell you that I love you madly,” she moaned in anguish, and suddenly pressed his hand greedily to her lips. Tears streamed from her eyes. Alyosha stood speechless and confounded; he had never expected what he was seeing.

But I didn't think this was absurdism. https://en.wikipedia.org/wiki/The_Grand_Inquisitor was quite philosophical, and such was Ivan and his alter ego...

What should I be viewing under that lens?

u/read_too_many_books — 20 days ago

I wanted to use vast.ai, but ollama doesnt have it, and when i used vLLM I didn't have success.

I genuinely don't know what failed. Maybe the VPS didnt have enough HDD/SSD space.

I do not want to use someone elses server with this already installed. I want to live through the entire process.

Any suggestions? I am open to new VPS companies and different instances.

reddit.com
u/read_too_many_books — 21 days ago