u/anitamaxwynnn69

Corsair RMA experience (the good kind)

I see a ton of terrible Corsair support stories on here. I wanted to share a different experience I had just to balance it out.

Had a Corsair Vengeance RGB 64GB (2x32GB DDR4) that was gifted to me years ago. One stick gave my pc reboots since day #1 so I was using it on and off. Always thought it was some compatibility issues with my mobo. Got to know about MemTest and tested it out a few weeks ago - it would abort at 58% from sheer error count lol. Kit was probably 3-4 years old easy. The person who gifted me this was no longer a part of my life and there was no way to reach out to them for receipts.

Submitted a ticket in late April with no expectations.

Timeline:

  • Apr 22: Submitted ticket, explained everything honestly.
  • Apr 23: Was asked for some pictures and some other information. RMA approved within 24 hours.
  • Apr 25: Agent Wayne L (GOAT) confirmed everything, and kept me in the loop
  • Shipped the bad kit back somewhere around Apr 28
  • May 4: Replacement shipped
  • May 7: Delivered at my door (3 days from Taiwan to WA, US lol)

Genuinely hope y'all also have good experiences like this 😸 (not sponsored)

reddit.com
u/anitamaxwynnn69 — 2 days ago
▲ 27 r/LocalLLM+1 crossposts

Finding the 4x 3090 Sweet Spot

https://preview.redd.it/8o43bjhe9d1h1.png?width=5346&format=png&auto=webp&s=1c87c2ee8b8ffff43495f543266056b0e26d3947

In another post I had someone ask me about the power draw of the 4x 3090 setup so I'm sharing a a full test I conducted to understand the efficiency curve. Used this blog post (not mine) as a reference.

Setup:

  • GPUs: 4x RTX 3090 (Dell OEM, EVGA XC3, 2x ASUS Strix)
  • PCIe Topology: Gen 3 (Bifurcated: x16 / x8 / x8 / x4)
  • Model: Qwen3.6-27B (FP16)
  • Backend: vLLM v0.20.2 (TP=4)
Power Limit (W) Output (t/s) Prompt Processing (t/s) Total Throughput (t/s) Efficiency (t/joule)
350/390 (Unrestricted) 29 239 269 0.77
300 29 238 268 0.89
275 29 236 265 0.96
250 29 232 261 1.04
220 27 220 248 1.13
200 24 196 221 1.11

Takeaways:

  1. The 220W Sweet Spot: Peak efficiency (matches the blog's findings)
  2. Diminishing Returns: Increasing the limit beyond 250W provides diminishing returns

Hope this helps someone. Happy to answer any questions.

I'm VERY satisfied with Qwen 3.6 27B as a daily driver, but I would still like to know if there are any better/bigger models I can run on this setup. My understanding is that the best I can do is DSv4 at Q2 - not sure if it's fully supported yet though.

Additional context: it's an open build on a generic mining frame. I'm cooling it with 10x TL-C12C-S (5 on each side of gpus perpendicularly). I finished building this very recently so I'm open to suggestions on how to improve it.

Edit: Added prompt processing to the table

reddit.com
u/anitamaxwynnn69 — 8 days ago
▲ 23 r/ollama+1 crossposts

Simpler self hosted alt to Open WebUI

Got Qwen3.6 27B running on my newly assembled 4x 3090 rig (s/o 3090-club) and I'm trying to get the people in my house to adopt the local workflow.

Open WebUI has improved a lot in the recent updates, but I still found it pretty rough for non-technical people. It often feels more like a dev tool than a self-hosted ChatGPT-style app that "just works". I built overtchat to focus mainly on getting the core chat experience right: a polished ui, simple setup and fewer moving parts. The goal is not to compete on agentic workflow with LibreChat/LobeChat/OWUI but to provide a cleaner self-hosted interface for local models.

Ships with its own tried & tested searxng config for web search, kokoro tts (no api keys needed). Single docker compose file. MIT licensed of course, no telemetry. Optimized for mobile as PWA. Github.

Also being upfront - I write code for a living and have been actively reviewing/debugging/changing things, but I did use quite a lot of AI lol. I promise it's not slop tho 😿 . Feedback is welcome!

u/anitamaxwynnn69 — 3 days ago