u/Immediate-Grand8403

LLM-wiki - first test drive. Am I doing this right?

LLM-wiki - first test drive. Am I doing this right?

If you've heard of Karpathy's llm-wiki (it's a github gist), I could use the additional eyeballs.

I mkdir'd /llm-wiki in my root directory, copied the gist into a CLAUDE.md, then asked claude to ingest a couple of example files. One .md, one .pdf, one .txt. It's the token usage on the text file that piques my curiosity. Here's the session results. (This session only concerns the .txt file ingestion.)

Total cost: $1.72

Total duration (API): 5m 10s

Total duration (wall): 11m 23s

Total code changes: 132 lines added, 0 lines removed

Usage by model:

claude-haiku-4-5: 455 input, 16 output, 0 cache read, 0 cache write ($0.0005)

claude-sonnet-4-6: 7.4k input, 13.4k output, 2.5m cache read, 197.7k cache write ($1.72)

The text file is 486kB.

I'm mainly curious about the token usage and therefore cost. I haven't settled on a decent way of tracking usage. Bottom line: am I doing this correctly?

u/Immediate-Grand8403 — 2 days ago