u/H3OErikilious

Hey everyone,

I’ve been trying to build some type of local llm wiki with local LLMs as I decided that the Claude cowork limits were too small for me and I just wanted to have some more “token freedom” so to speak. My tech stack before I get into the problems:

M1 Max MacBook Pro with 64gb of ram and 8tb of storage

Claude code running in terminal linked to LM studio

100k token context with GPU offload set to 40, CPU Thread pool size set to 10

It’s been processing requests and stuff like that pretty slowly and didn’t really do anything right so far. Is there anything I need to fix? There’s a Claude.md and stuff like that but what should I be fixing for it to work quicker and make less errors? (E.g. making snippets in obsidian with a tutorial in css, it just doesn’t work at all lol)

Thanks for the help!

reddit.com
u/H3OErikilious — 1 month ago