u/Forward_Jackfruit813

It's a perfect time to pivot to local LLMs and learn real skills

Don't shortcut anything. Get you a GPU (32GB VRAM recommend, multiple cards work too), Mac Mini, or Ryzen AI 395+ PC (what I went with). Budgets vary, but there are options even if you dont want to spend thousands. (Which you probably should if you're heavily using AI)

If you go the GPU or Ryzen AI route, install Linux server. Learn how it works. Set up your workflow exactly how you want to use it. Compile llama.cpp (not ollama, they are different and Ollama is easier but way worse). If it's too daunting you can even use Gemini CLI to set it up for you the first time. You will pick up on useful skills just working through everything with it.

You'll learn how to set up your firewall, understand how LLMs work, understand how Linux works, understand how to do simple stuff like SSH. You can secure knowledge sources like wikipedia or stack overflow through kiwix and use it as a RAG (searchable database for your LLM) to use it when the Internet is not available or when AIs start getting blocked. Your LLM quality will be unphased regardless of what happens, you only need electricity. AI is significantly more impressive when you can pull the Ethernet plug and still have all that intelligence running only just for you. And you will never have a limit again.

You will gain personal skills and guarantee you will have a workflow that works for you even if changes in cloud providers or web searching happens. You can custom build applications for yourself to make the LLM more effective for what you want it to do, or even integrate the LLM into it.

You can still use a low tier cloud model for stuff that you can't do locally yet, but local tends to follow about 8 months behind frontier. Open-Weight models come out almost every week. And when they do you can drop it into your setup and see immediate benefit. Gemini will help you start, that's how I got into it at first.

For most users at this moment, you will want to run Qwen 3.6 27B (Dense model which is slower but smarter, better for GPUs) or Qwen 3.6 35A3B (MoE model which is faster and better for the Ryzen AI 395+ PCs) at Q6 quants or higher (if not coding, q4 is fine). They are you best bang for your buck and will do most, if not all you are currently doing with Gemini. Opencode is one of the open source alternatives to Claude code, Codex, and Gemini CLI. Local LLMs can run agentically just like cloud models.

Big tech will rug pull you, it will only get worse from here and you can go ahead and learn what you need while Gemini is still good enough to guide you through the process.

I started this process a little over a month ago and everything described I already have set up and working. It's been extremely fun and worthwhile.

reddit.com

UI feedback on retro console?

Hey everybody, just wanting to get some quick feedback for a nostalgia console I am making. It's aiming for a Dreamcast/PSX aesthetic. The icons bounce and hover similar to the Dreamcast. The target resolution is 640x360 with 5 bit dithering that scales to a 720P output. Is there anything that looks off? I have a video, but I can't upload it on here. It's very important that it hits some nostalgia vibes. Screenshots aren't the highest quality due to running on a capture card on the console.

u/Forward_Jackfruit813 — 4 days ago

This is a fantasy console (except it's real working hardware) that I am working on inspired by my favorite console the Dreamcast. It's called Replay and is built using Rogue Engine and is intended to run Rogue Engine and Three.js games. The Chromium instance runs on top of a Linux Ubuntu Server Distro. The Rest API and Chromium Kiosk instance starts on boot as a service and is the only GUI you see. There is a boot up intro and it's all streamlined and simple as expected.

The goal is to make it dead simple and easy for demakes and retro indie projects and ports. The developer can build to HTML and utilize only a few simple API calls (save game filesystem, stat tracking, achievement tracking, and gamepad mapping) for the Replay port. You can easily move over to another platform for more market reach. There are no limitations other than the hardware performance with overhead. 60FPS has required a lower ~640x480 class resolution which fits in to the Dreamcast aesthetic. I did have a lower color palette forced with dithering, but I didn't want to hinder potential with other art styles of different styles of games. So your game can run any resolution and any color depth you choose. I recommend the ~640x480 class resolution though for consistency and reliable performance.

The SDCards only need to contain your index.html and a details.txt file for metadata. CORS restrictions are disabled so if your game needs to pull data off the SDCard for app data, that works with no intervention. The console is intended to be offline always for security with the exception of developers/tinkers going in through SSH.

The hardware itself is a 4GB Orange Pi running on a 128GB NVME SSD (overkill on storage, but the cheapest I could find as the OrangePi requires NVME). The gamepad is a N64 clone with more of a Dreamcast style mold. The case is a kit that comes with a cooling fan, outer case, and power supply. Hardware costs are around $60 not including the OrangePI (prices for devboards are inflated right now).

I have 3 launch titles planned: Sunder (silent hill style game), a space dogfighting game, a racing game (PGR inspiration), and a marble blast style casual game.

I will post more footage after the dashboard is officially complete. Stat, achievement, save game management, gamepad remapping, and factory restore options need UI pages finished.

u/Forward_Jackfruit813 — 2 months ago