u/HumanoidMuppet

LLMDB.org - A place to share benchmarks and configs

I basically vibecoded this entire app over the last few days because I was frustrated with the current state of local LLM benchmarking.

Whenever I wanted to check how fast a model runs on a specific setup, I couldn't find a single unified database. The few platforms that do exist are either outdated, focus exclusively on Nvidia or enterprise cloud GPUs (like H100s/A100s), or don't accept community submissions. I really couldn't find a similar app out there that allowed people to submit benchmarks for any arbitrary hardware entries.

So I built LLMDB (LLM Benchmarks Database).

URL: https://www.llmdb.org/

GitHub: https://github.com/secretdino/llmdb-org

It is completely open source (MIT license) and community-driven. The goal is to make it easy to compare, search, and submit runs across any setup - whether you're running on an Apple Silicon Mac, AMD ROCm, an old Nvidia GTX card, or a wild multi-GPU rig.

A few quick things about how it works:

  • Any hardware entries allowed: No rigid dropdowns or pre-approved configurations. You can submit and document performance for any custom setup.
  • Console log and Settings auto-parsing: To make submissions easy, you can literally just copy and paste raw console output from llama.cpp or vLLM. The backend will parse out the model name, tokens per second, engine parameters (like KV cache precision). You can also paste in your llama.cpp settings and it will parse those automatically too.
  • Fully Open Source: The repository is open-source under the MIT license, so anyone can check out the codebase or contribute.

Some disclaimers:

  • Probably full of bugs, I used Gemini 3.5 Flash for most of the development. I didn't even spring for Gemini Pro or Claude Opus.
  • It's as secure as asking Claude Opus to make it secure, so probably not very.
  • I asked Gemini to find some seed data, so the starting data might be missing a lot of detail. It did provide links to the sources, so the seed data isn't hallucinations.
  • Absolutely feel free to trash the code and my development skills (or lack there of). I know how much effort I put into this (not much) and that I should do better.
  • Will this be standard vibe-coded abandonware? Maybe, but that's why I open sourced it.

If you have a few minutes, check it out, play around with the search/filters, or upload some of your own local benchmark logs to help populate the database. I'd love to hear your feedback on it!

reddit.com
u/HumanoidMuppet — 1 day ago

Qwen3.6-35b has jokes

I've been benchmarking MTP on Qwen3.6-35b-Q5_K_M and performance has been good so far, but that's not the point of this post. To benchmark, I have a few prompts that I try - write a story about my dog, create a single-file html aquarium, create a html sheepdog herding simulator, analyze an old kubernetes book in PDF format to test 80k context and tell me what has changed since this book was written.

But today, I wanted to test my custom orchestrator's new Attachment feature. I decided to test the Pelican riding a Bicycle benchmark. Successfully showed in chat! But it was Qwen's next suggestion that led to this post - Do I want an SVG of my dog on a Skateboard? Done! A dinosaur in a rocket going to the moon - ✅. Then I wanted to try non-image attachments, so I asked Qwen to attach a .md file, and apparently I'm using these SVGs to take over the world!

https://preview.redd.it/j929rg4b8d2h1.png?width=390&format=png&auto=webp&s=b3d69a6b056142f58b993a3d104a5181b7846b51

https://preview.redd.it/096fsgvc8d2h1.png?width=395&format=png&auto=webp&s=9fa8c3856c09cab6fba181ba886c3269fbfaffc3

https://preview.redd.it/z1jfrs1f8d2h1.png?width=392&format=png&auto=webp&s=8861eb6982253d6903d506dff7e484c376615054

# 🦖 Muppet's Secret Plan

## Objective
Conquer the world... one SVG at a time.

## Steps
1. Generate cute animals
2. Put them on vehicles
3. Watch them zoom around
4. ???
5. Profit 🦖💰

## Notes
- Remember to add more sparkles ✨
- Dog needs a jetpack next
reddit.com
u/HumanoidMuppet — 1 day ago