Image 1 — LLMs Gateway – Model Management for llama.cpp

Image 2 — LLMs Gateway – Model Management for llama.cpp

LLMs Gateway – Model Management for llama.cpp

I got tired of manually juggling GGUF downloads, symlinks, and llama-server restarts every time I wanted to swap models. So I built LLMs Gateway – a lightweight CLI + REST API that sits on top of llama.cpp and handles the entire model lifecycle.

What My Project Does

LLMs Gateway simplifies local LLM management by providing a single interface for discovering, installing, validating, activating, and serving GGUF models.

Features:

Search Hugging Face repositories directly from the CLI
Inspect model metadata before downloading
Download and install GGUF models with a single command
Maintain a local JSON-based model registry
Validate downloaded files using hashes
Activate models through symlink switching
Automatically restart llama-server when a model changes
Expose all functionality through both a CLI and REST API

Example workflow:

docker compose up -d

modelctl search llama
modelctl inspect unsloth/gemma-4-E2B-it-qat-GGUF
modelctl install unsloth/gemma-4-E2B-it-qat-GGUF model.gguf
modelctl activate &lt;model-id&gt;

Once activated, llama-server automatically picks up the new model without manual intervention.

Target Audience

LLMs Gateway is designed for:

Developers running local LLMs with llama.cpp
Self-hosted AI enthusiasts
Homelab users
Teams building local AI services or internal tooling
Anyone managing multiple GGUF models on a single machine

The project is intended to be production-capable for small to medium deployments while remaining lightweight enough for personal use.

Comparison

Unlike tools such as Ollama that manage their own model ecosystem and runtime, LLMs Gateway focuses on model lifecycle management for llama.cpp.

Key differences:

Works directly with GGUF repositories from Hugging Face
Keeps a transparent local JSON registry instead of a hidden database
Provides explicit control over installed artifacts
Uses symlink-based activation to switch models
Integrates directly with existing llama.cpp deployments
Combines model management and serving orchestration in a single workflow

The goal is not to replace llama.cpp, but to make operating multiple local models on top of llama.cpp significantly easier.

Architecture

Stack:

Python monorepo (uv workspace)
FastAPI
llama.cpp
Single Docker image

Two services, one image.

The coolest part is the container entrypoint. It watches for model activation changes and seamlessly restarts llama-server with the selected weights. No manual process management, no PID hunting, and no server reconfiguration.

GitHub: https://github.com/regisx001/llms-gateway

I'm interested in hearing how others manage local models today. Are you using symlinks, Ollama, custom scripts, or something else?

u/regisx001 — 13 days ago

▲ 1 r/OpenSourceeAI

LLMs Gateway – Model Management for llama.cpp

What My Project Does

LLMs Gateway simplifies local LLM management by providing a single interface for discovering, installing, validating, activating, and serving GGUF models.

Features:

Search Hugging Face repositories directly from the CLI
Inspect model metadata before downloading
Download and install GGUF models with a single command
Maintain a local JSON-based model registry
Validate downloaded files using hashes
Activate models through symlink switching
Automatically restart llama-server when a model changes
Expose all functionality through both a CLI and REST API

Example workflow:

docker compose up -d

modelctl search llama
modelctl inspect unsloth/gemma-4-E2B-it-qat-GGUF
modelctl install unsloth/gemma-4-E2B-it-qat-GGUF model.gguf
modelctl activate &lt;model-id&gt;

Once activated, llama-server automatically picks up the new model without manual intervention.

Target Audience

LLMs Gateway is designed for:

Developers running local LLMs with llama.cpp
Self-hosted AI enthusiasts
Homelab users
Teams building local AI services or internal tooling
Anyone managing multiple GGUF models on a single machine

The project is intended to be production-capable for small to medium deployments while remaining lightweight enough for personal use.

Comparison

Unlike tools such as Ollama that manage their own model ecosystem and runtime, LLMs Gateway focuses on model lifecycle management for llama.cpp.

Key differences:

Works directly with GGUF repositories from Hugging Face
Keeps a transparent local JSON registry instead of a hidden database
Provides explicit control over installed artifacts
Uses symlink-based activation to switch models
Integrates directly with existing llama.cpp deployments
Combines model management and serving orchestration in a single workflow

The goal is not to replace llama.cpp, but to make operating multiple local models on top of llama.cpp significantly easier.

Architecture

Stack:

Python monorepo (uv workspace)
FastAPI
llama.cpp
Single Docker image

Two services, one image.

GitHub: https://github.com/regisx001/llms-gateway

I'm interested in hearing how others manage local models today. Are you using symlinks, Ollama, custom scripts, or something else?

u/regisx001 — 13 days ago

▲ 92 r/coolgithubprojects

I built a local-first desktop project manager and I’m looking for testers + contributors

I’ve been building a project management desktop app called Worklog.

fast
local-first
offline-capable
lightweight
no unnecessary cloud dependency

Tech stack:

Rust + Tauri
SvelteKit
SQLite

Current features:

Kanban boards
timeline view
table view
ticket management
command palette
comments
local database storage
cross-platform support (Linux/macOS/Windows)

I’m actively improving it and adding more features, but I need real users to test it and break things.

I’m mainly looking for:

bug reports
UI/UX feedback
performance feedback
feature suggestions
contributors interested in local-first desktop software

If you test it and find issues, open an issue or send feedback on GitHub.

The project is still evolving quickly, so feedback now has a direct impact on how it develops.

u/regisx001 — 2 months ago

▲ 7 r/Moroccopreneur

Struggling to grow as an AI/Data Science student in Morocco despite strong technical skills

I'm currently a first year Master's student in AI & Data Science in Morocco, and lately I've been questioning how people in tech here actually grow professionally outside of traditional jobs.

I’ve been trying to freelance, build products, and use my skills seriously, but honestly I find the ecosystem very limiting. I can build advanced software and AI systems, yet it feels difficult to find:

serious communities
ambitious technical people
interesting real-world projects
clients who understand software value
opportunities to work on high-impact products

Most freelance opportunities I find are either very low-budget or focused on basic web work, while I’m more interested in building scalable systems, AI applications, backend infrastructure, automation tools, data systems, etc.

Another issue is that I sometimes struggle to find meaningful projects to work on. I can technically build a lot of things, but I don’t want to waste time creating random portfolio projects with no actual value or users.

My goal is:

make income from my skills
refine myself technically
work on useful and challenging products
eventually build something scalable

For people who are more experienced:

How did you start making income with advanced technical skills in Morocco?
Did you focus on freelancing, remote work, SaaS, open source, networking, or something else?
How do you find good project ideas that are actually useful?
Is the solution simply to ignore the local market and target international clients directly?
Are there Moroccan communities/Discords/groups where serious builders actually collaborate?

Here’s my GitHub if anyone wants to see the kind of things I work on:
GitHub profile

I’d appreciate honest advice from people who already went through this stage.

u/regisx001 — 2 months ago

▲ 13 r/coolgithubprojects+1 crossposts

Hey everyone!

I've been building Worklog — a local-first desktop project manager for small dev teams — and just shipped v1.2.

The core idea is simple: a fast, keyboard-driven Kanban tool where your data lives on your machine, not someone else's server. No accounts, no subscriptions, no vendor lock-in. The .worklog/worklog.db file sits right in your workspace folder — portable and transparent.

The Stack

Desktop shell: Tauri v2 (Rust)
Frontend: SvelteKit + TypeScript
Runtime: Bun
Persistence: SQLite (via Tauri SQL plugin)

The architecture is straightforward:

UI → hooks → repository layer → SQLite

Everything is local, predictable, and fast.

What's new in v1.2?

Git Auto-Sync (Background Scheduler)

Worklog can now automatically sync your workspace to a remote GitHub repo in the background. Configure intervals from 5 minutes to 6 hours. It uses a pull-before-push strategy to keep history clean. PATs are stored securely.

Global Zoom

Scale the entire UI from 50% to 200% using Ctrl/Cmd + +/-/0. Preference persists across restarts.

Edit Board from Sidebar

Right-click a board in the sidebar to rename it or update its description. No need to go into settings.

Robust DB Migrations

Implemented a schema migration system (currently at schema v11). Data upgrades cleanly with each release.

Keyboard-first

Ctrl+K opens the command palette
Ctrl+N creates a ticket
M moves a focused ticket to the next status

The tool is designed to be used primarily from the keyboard.

Install (Linux)

    # Arch via AUR
    yay -S worklog-bin

GitHub: https://github.com/regisx001/Worklog
📦 v1.2.0 Release: https://github.com/regisx001/Worklog/releases/tag/app-v1.2.0

Would love feedback, bug reports, or feature ideas. This is still early but the foundation feels solid. Thanks for checking it out! 🙏

u/regisx001 — 2 months ago

▲ 30 r/sveltejs+1 crossposts

I’ve been working on a project called Worklog, a desktop project manager designed for small dev teams that don’t want to rely on cloud tools.

The idea is simple: fast, keyboard-driven planning with a Kanban workflow, and all data stored locally. No accounts, no sync servers, no hidden state.

Core structure:
Workspace -> Board -> Ticket

What it currently does:

Kanban boards (Todo / In Progress / Done)
Ticket editing with comments
Command palette + keyboard shortcuts for most actions
Persistent workspaces that restore on startup
Local SQLite storage (your data lives in your workspace folder)

Key decisions:

Local-first by default (works fully offline)
No forced cloud or SaaS model
Data is transparent and portable
Git-friendly direction for teams that want versioned project state

Tech stack:

Tauri (desktop shell)
SvelteKit + TypeScript
Bun
SQLite

Architecture is straightforward:
UI -> hooks -> repository layer -> SQLite

What I’m trying to avoid:

Turning it into another bloated PM suite
Locking users into a hosted backend
Hiding data behind APIs or proprietary formats

Planned direction:

Better filtering and board views
Richer ticket metadata
Search across workspaces
Backup/export workflows
Git-native workflows for teams

This is still early, but already usable for small projects.

I’m interested in feedback from people who:

Prefer local-first tools
Are tired of Jira/Linear-style overhead
Care about data ownership

Repo: https://github.com/regisx001/Worklog

Latest Release : https://github.com/regisx001/Worklog/releases/tag/app-v0.3.0

don't forget to give a star in github !!!

u/regisx001 — 2 months ago