Image 1 — LLMs Gateway – Model Management for llama.cpp
Image 2 — LLMs Gateway – Model Management for llama.cpp
Image 3 — LLMs Gateway – Model Management for llama.cpp
Image 4 — LLMs Gateway – Model Management for llama.cpp

LLMs Gateway – Model Management for llama.cpp

LLMs Gateway – Model Management for llama.cpp

I got tired of manually juggling GGUF downloads, symlinks, and llama-server restarts every time I wanted to swap models. So I built LLMs Gateway – a lightweight CLI + REST API that sits on top of llama.cpp and handles the entire model lifecycle.

What My Project Does

LLMs Gateway simplifies local LLM management by providing a single interface for discovering, installing, validating, activating, and serving GGUF models.

Features:

  • Search Hugging Face repositories directly from the CLI
  • Inspect model metadata before downloading
  • Download and install GGUF models with a single command
  • Maintain a local JSON-based model registry
  • Validate downloaded files using hashes
  • Activate models through symlink switching
  • Automatically restart llama-server when a model changes
  • Expose all functionality through both a CLI and REST API

Example workflow:

docker compose up -d

modelctl search llama
modelctl inspect unsloth/gemma-4-E2B-it-qat-GGUF
modelctl install unsloth/gemma-4-E2B-it-qat-GGUF model.gguf
modelctl activate <model-id>

Once activated, llama-server automatically picks up the new model without manual intervention.

Target Audience

LLMs Gateway is designed for:

  • Developers running local LLMs with llama.cpp
  • Self-hosted AI enthusiasts
  • Homelab users
  • Teams building local AI services or internal tooling
  • Anyone managing multiple GGUF models on a single machine

The project is intended to be production-capable for small to medium deployments while remaining lightweight enough for personal use.

Comparison

Unlike tools such as Ollama that manage their own model ecosystem and runtime, LLMs Gateway focuses on model lifecycle management for llama.cpp.

Key differences:

  • Works directly with GGUF repositories from Hugging Face
  • Keeps a transparent local JSON registry instead of a hidden database
  • Provides explicit control over installed artifacts
  • Uses symlink-based activation to switch models
  • Integrates directly with existing llama.cpp deployments
  • Combines model management and serving orchestration in a single workflow

The goal is not to replace llama.cpp, but to make operating multiple local models on top of llama.cpp significantly easier.

Architecture

Stack:

  • Python monorepo (uv workspace)
  • FastAPI
  • llama.cpp
  • Single Docker image

Two services, one image.

The coolest part is the container entrypoint. It watches for model activation changes and seamlessly restarts llama-server with the selected weights. No manual process management, no PID hunting, and no server reconfiguration.

GitHub: https://github.com/regisx001/llms-gateway

I'm interested in hearing how others manage local models today. Are you using symlinks, Ollama, custom scripts, or something else?

u/regisx001 — 13 days ago

LLMs Gateway – Model Management for llama.cpp

LLMs Gateway – Model Management for llama.cpp

I got tired of manually juggling GGUF downloads, symlinks, and llama-server restarts every time I wanted to swap models. So I built LLMs Gateway – a lightweight CLI + REST API that sits on top of llama.cpp and handles the entire model lifecycle.

What My Project Does

LLMs Gateway simplifies local LLM management by providing a single interface for discovering, installing, validating, activating, and serving GGUF models.

Features:

  • Search Hugging Face repositories directly from the CLI
  • Inspect model metadata before downloading
  • Download and install GGUF models with a single command
  • Maintain a local JSON-based model registry
  • Validate downloaded files using hashes
  • Activate models through symlink switching
  • Automatically restart llama-server when a model changes
  • Expose all functionality through both a CLI and REST API

Example workflow:

docker compose up -d

modelctl search llama
modelctl inspect unsloth/gemma-4-E2B-it-qat-GGUF
modelctl install unsloth/gemma-4-E2B-it-qat-GGUF model.gguf
modelctl activate <model-id>

Once activated, llama-server automatically picks up the new model without manual intervention.

Target Audience

LLMs Gateway is designed for:

  • Developers running local LLMs with llama.cpp
  • Self-hosted AI enthusiasts
  • Homelab users
  • Teams building local AI services or internal tooling
  • Anyone managing multiple GGUF models on a single machine

The project is intended to be production-capable for small to medium deployments while remaining lightweight enough for personal use.

Comparison

Unlike tools such as Ollama that manage their own model ecosystem and runtime, LLMs Gateway focuses on model lifecycle management for llama.cpp.

Key differences:

  • Works directly with GGUF repositories from Hugging Face
  • Keeps a transparent local JSON registry instead of a hidden database
  • Provides explicit control over installed artifacts
  • Uses symlink-based activation to switch models
  • Integrates directly with existing llama.cpp deployments
  • Combines model management and serving orchestration in a single workflow

The goal is not to replace llama.cpp, but to make operating multiple local models on top of llama.cpp significantly easier.

Architecture

Stack:

  • Python monorepo (uv workspace)
  • FastAPI
  • llama.cpp
  • Single Docker image

Two services, one image.

The coolest part is the container entrypoint. It watches for model activation changes and seamlessly restarts llama-server with the selected weights. No manual process management, no PID hunting, and no server reconfiguration.

GitHub: https://github.com/regisx001/llms-gateway

I'm interested in hearing how others manage local models today. Are you using symlinks, Ollama, custom scripts, or something else?

u/regisx001 — 13 days ago

I built a local-first desktop project manager and I’m looking for testers + contributors

I’ve been building a project management desktop app called Worklog.

  • fast
  • local-first
  • offline-capable
  • lightweight
  • no unnecessary cloud dependency

Tech stack:

  • Rust + Tauri
  • SvelteKit
  • SQLite

Current features:

  • Kanban boards
  • timeline view
  • table view
  • ticket management
  • command palette
  • comments
  • local database storage
  • cross-platform support (Linux/macOS/Windows)

I’m actively improving it and adding more features, but I need real users to test it and break things.

I’m mainly looking for:

  • bug reports
  • UI/UX feedback
  • performance feedback
  • feature suggestions
  • contributors interested in local-first desktop software

If you test it and find issues, open an issue or send feedback on GitHub.

The project is still evolving quickly, so feedback now has a direct impact on how it develops.

u/regisx001 — 2 months ago

Struggling to grow as an AI/Data Science student in Morocco despite strong technical skills

I'm currently a first year Master's student in AI & Data Science in Morocco, and lately I've been questioning how people in tech here actually grow professionally outside of traditional jobs.

I’ve been trying to freelance, build products, and use my skills seriously, but honestly I find the ecosystem very limiting. I can build advanced software and AI systems, yet it feels difficult to find:

  • serious communities
  • ambitious technical people
  • interesting real-world projects
  • clients who understand software value
  • opportunities to work on high-impact products

Most freelance opportunities I find are either very low-budget or focused on basic web work, while I’m more interested in building scalable systems, AI applications, backend infrastructure, automation tools, data systems, etc.

Another issue is that I sometimes struggle to find meaningful projects to work on. I can technically build a lot of things, but I don’t want to waste time creating random portfolio projects with no actual value or users.

My goal is:

  • make income from my skills
  • refine myself technically
  • work on useful and challenging products
  • eventually build something scalable

For people who are more experienced:

  • How did you start making income with advanced technical skills in Morocco?
  • Did you focus on freelancing, remote work, SaaS, open source, networking, or something else?
  • How do you find good project ideas that are actually useful?
  • Is the solution simply to ignore the local market and target international clients directly?
  • Are there Moroccan communities/Discords/groups where serious builders actually collaborate?

Here’s my GitHub if anyone wants to see the kind of things I work on:
GitHub profile

I’d appreciate honest advice from people who already went through this stage.

u/regisx001 — 2 months ago
▲ 13 r/coolgithubprojects+1 crossposts

Hey everyone!

I've been building Worklog — a local-first desktop project manager for small dev teams — and just shipped v1.2.

The core idea is simple: a fast, keyboard-driven Kanban tool where your data lives on your machine, not someone else's server. No accounts, no subscriptions, no vendor lock-in. The .worklog/worklog.db file sits right in your workspace folder — portable and transparent.

The Stack

  • Desktop shell: Tauri v2 (Rust)
  • Frontend: SvelteKit + TypeScript
  • Runtime: Bun
  • Persistence: SQLite (via Tauri SQL plugin)

The architecture is straightforward:

UI → hooks → repository layer → SQLite

Everything is local, predictable, and fast.

What's new in v1.2?

Git Auto-Sync (Background Scheduler)

Worklog can now automatically sync your workspace to a remote GitHub repo in the background. Configure intervals from 5 minutes to 6 hours. It uses a pull-before-push strategy to keep history clean. PATs are stored securely.

Global Zoom

Scale the entire UI from 50% to 200% using Ctrl/Cmd + +/-/0. Preference persists across restarts.

Edit Board from Sidebar

Right-click a board in the sidebar to rename it or update its description. No need to go into settings.

Robust DB Migrations

Implemented a schema migration system (currently at schema v11). Data upgrades cleanly with each release.

Keyboard-first

  • Ctrl+K opens the command palette
  • Ctrl+N creates a ticket
  • M moves a focused ticket to the next status

The tool is designed to be used primarily from the keyboard.

Install (Linux)

    # Arch via AUR
    yay -S worklog-bin

GitHub: https://github.com/regisx001/Worklog
📦 v1.2.0 Release: https://github.com/regisx001/Worklog/releases/tag/app-v1.2.0

Would love feedback, bug reports, or feature ideas. This is still early but the foundation feels solid. Thanks for checking it out! 🙏

u/regisx001 — 2 months ago
▲ 30 r/sveltejs+1 crossposts

I’ve been working on a project called Worklog, a desktop project manager designed for small dev teams that don’t want to rely on cloud tools.

The idea is simple: fast, keyboard-driven planning with a Kanban workflow, and all data stored locally. No accounts, no sync servers, no hidden state.

Core structure:
Workspace -> Board -> Ticket

What it currently does:

  • Kanban boards (Todo / In Progress / Done)
  • Ticket editing with comments
  • Command palette + keyboard shortcuts for most actions
  • Persistent workspaces that restore on startup
  • Local SQLite storage (your data lives in your workspace folder)

Key decisions:

  • Local-first by default (works fully offline)
  • No forced cloud or SaaS model
  • Data is transparent and portable
  • Git-friendly direction for teams that want versioned project state

Tech stack:

  • Tauri (desktop shell)
  • SvelteKit + TypeScript
  • Bun
  • SQLite

Architecture is straightforward:
UI -> hooks -> repository layer -> SQLite

What I’m trying to avoid:

  • Turning it into another bloated PM suite
  • Locking users into a hosted backend
  • Hiding data behind APIs or proprietary formats

Planned direction:

  • Better filtering and board views
  • Richer ticket metadata
  • Search across workspaces
  • Backup/export workflows
  • Git-native workflows for teams

This is still early, but already usable for small projects.

I’m interested in feedback from people who:

  • Prefer local-first tools
  • Are tired of Jira/Linear-style overhead
  • Care about data ownership

Repo: https://github.com/regisx001/Worklog

Latest Release : https://github.com/regisx001/Worklog/releases/tag/app-v0.3.0

don't forget to give a star in github !!!

u/regisx001 — 2 months ago