u/Emergency-Device2599

DeepSeek-R1 Production-Ready Guide: Ollama + Open WebUI with Docker Compose (Nginx & GPU Passthrough)

DeepSeek-R1 Production-Ready Guide: Ollama + Open WebUI with Docker Compose (Nginx & GPU Passthrough)

https://preview.redd.it/ukevg03e912h1.jpg?width=1672&format=pjpg&auto=webp&s=076cc4390d9181c83e27e8e89b0adb38bad0ba51

Hey everyone, just put together a comprehensive, production-oriented guide on deploying DeepSeek-R1 locally. Instead of the usual "one-click install" scripts, this focus is on building a secure, scalable private AI setup for small teams or enterprise environments.

Key Technical Highlights:

  • Separated Containers: Using docker-compose to run Ollama and Open WebUI independently, making future upgrades seamless.
  • Hardware Blueprint: VRAM & system RAM mapping benchmarks specifically tailored for 8B, 14B, 32B, and 70B models.
  • Hardening Security: Restricting Ollama’s API port (11434) strictly to localhost (127.0.0.1) so it's never exposed to the public internet.
  • Reverse Proxying: Nginx configuration optimized with WebSocket support (critical for Open WebUI streaming responses) and Let’s Encrypt SSL.
  • NVIDIA Passthrough: Quick setup guide for NVIDIA Container Toolkit on Ubuntu 22.04/24.04.

If you're planning to migrate from OpenAI/Anthropic APIs to an on-premise solution for privacy or cost predictability, check out the full breakdown here: 🔗https://www.taki.com.tw/blog/deepseek-local-deployment-ollama-open-webui/

Would love to hear your thoughts on how you handle user authentication and RAG scaling with this specific stack!

#DeepSeekR1 #Ollama #OpenWebUI #DockerCompose #SelfHosted #NvidiaGPU #LocalLLM #Nginx

reddit.com
u/Emergency-Device2599 — 4 days ago