r/MistralAI

Claude Code skill that delegates coding tasks to Mistral Vibe, saves ~2-4x on tokens, with mistral tokens at least 50% cheaper, and avoid hitting usage limits
▲ 113 r/MistralAI+4 crossposts

Claude Code skill that delegates coding tasks to Mistral Vibe, saves ~2-4x on tokens, with mistral tokens at least 50% cheaper, and avoid hitting usage limits

TLDR; title says it all - use CC to delegate to Mistral vibe, save tokens, costs and avoid hitting limits.

Been using Claude Code for various side projects and kept hitting usage limits (i'm on Pro plan). At the same time i had Mistral Vibe which i did not use much because i appreciate CC's capacity to reason and structure its work.

So I'm sharing a skill that lets Claude Code delegate those tasks to Mistral Vibe while keeping Claude as the orchestrator - benefit from CC thinking and Mistral cheap labor. Vibe natively uses mistral-medium-3.5, inputs 1.5 USD/M tokens, output 7.5/M - to compare with Sonnet's 2x rates. I've observed in my usage i save 2x-4x claude tokens on big tasks.

Repo: github.com/pcx-wave/vibe-skill

Type /vibe before each instruction.

Claude decomposes the task, writes a self-contained prompt for Vibe, runs vibe-delegate, supervises the streaming output in real time, then checks the git diff before reporting back.

I had to tweak the skill quite a bit to get to a reliable stage because Vibe can have some rough edges - detailed in repo. I can certainly still be improved.

You need Vibe-CLI to use it. https://docs.mistral.ai/mistral-vibe/terminal

EDIT 13/5 : I've seen a few questions regarding this skill applied to other models. Note that Vibe can be configured to use any llm provider/model you want. Yes you can use vibe with deepseek/qwen/etc within. Your model would then access all vibe tools to do what it needs to.

u/pcx_wave — 21 hours ago

Fine tuning?

Heyo...

Looking for an assistant while I'm working on some stuff in the Godot Engine... but most LLMs seem to have a LOT of problems with GDScript, to put it mildly (Codex crapped itself creating nothing usable, all the while lecturing me on the contents and wording in my project, Qwen didn't fare any better... and mistral... well... it's still working on my test task 2 days later). Claude used to have issues with it, but seems 4.6 actually manages to get shit done...

I do recall hearing somewhere that Mistral offered fine tuning services... would it be reasonable to fine tune mistral against the Godot documentation? What would be the price of that fun? Should I expect that to produce a decent assistant?

reddit.com
u/NullSmoke — 18 hours ago

Launching ReelaTales: Using Mistral as the Creative Director for AI Video and Subtitle Pipelines

Most people use Mistral as a standalone chatbot. In our new launch, ReelaTales, we are treating Mistral as the core orchestrator of a complex, multimodal production pipeline.

ReelaTales is a working web app that generates complete cinematic reels from simple concepts, and Mistral models handle the heavy creative lifting behind the scenes:

  • Runway Prompt Engineering: Instead of letting users struggle with text-to-video prompting, our Mistral nodes ingest user ideas and expand them into highly structured cinematic prompts optimized specifically for Runway's visual engine.
  • Automated Subtitle Generation: Mistral manages the transcript orchestration and structural timing layer, ensuring dynamic, highly accurate subtitle alignment for the final video renders.
  • The Sovereign Advantage: By leveraging open weights, we can spin up these creative pipelines with predictable execution costs, proving that Mistral isn't just a benchmark winner—it’s the backbone of production-ready media infrastructure.

We are actively proving that models aren't moats; the magic is in how you string the infrastructure together. Let me know what you think of the generation pipeline!

u/robotrossart — 23 hours ago

Why can't I use Mistral AI if it was downloaded from Aurora Store? So far it's been the only app that has given me this problem.

u/Metacho123 — 1 day ago

Mistral AI website content seems AI generated

There are 7 em-dashes in this article about Mistral Code (an obsolete product?), including this sentence:

> Request access from your Mistral account team to spin up a pilot. You can choose serverless, cloud, or self-hosted deployment—and get coding with frontier intelligence in minutes.

I don't mean to stir the wound (you have to admit, certain people here are quite defensive).

But em-dash ridden AI slop on the official website? Just looks bad.

u/Old-Glove9438 — 1 day ago

Mistral Vibe is barely useful

Teams at the top AI companies including Mistral (according to CEO himself at the assemblee) use AI coding for 100% of their work.

So Mistral is doomed to fail because, put simply,

an inferior AI improving itself will mathematically stay inferior to the superior AI improving itself.

The difference in capability will just keep getting more and more wide.

I have both Codex and Vibe subscriptions and just worked with Vibe for 3 hours because I ran out of Codex usage. And I think I’m going to go for a walk and just wait for my Codex reset. I still have more than 90% of my monthly Vibe usage left but I don’t even want it.

I’ll keep coming back to Vibe every once in a while and keep paying as a donation to the “great European AI champion” but I’m hopeless.

reddit.com
u/Old-Glove9438 — 2 days ago

Do you think mistral ceo is protected by secret services ?

As AI is becoming increasingly strategic and key for Europe’s sovereignity, do you think Mistral CEO and other key employees are protected / and supported by secret services ?

reddit.com
u/supervrai — 1 day ago

Is it time for the European Mistral “De Gaulle” - Ala Claude Mythos?

We can see that the U.S. is developing powerful models for penetration testing and security, and I suspect that the Chinese will soon try to create a similar model as well. But what about us? Does Mistral have a chance to do something similar?

reddit.com
u/szansky — 2 days ago
▲ 144 r/MistralAI+14 crossposts

Glia – Local-first shared memory layer (SQLite-vec + FTS5 + Offline Knowledge Graph)

Hey everyone,

I wanted to share a project I've been working on called Glia. It is a 100% offline, local-first RAG and memory layer designed to connect your AI web chats (Claude, ChatGPT, DeepSeek) with your local developer tools (Claude Code, Cursor, Windsurf) using a unified local database.

I wanted something lightweight that did not require pulling heavy Docker containers or subscribing to third-party memory APIs. I settled on a Node.js + SQLite architecture running sqlite-vec (for 768-dim float32 embeddings) alongside SQLite FTS5 for hybrid search, powered completely by local Ollama instances.

We just launched a live website that outlines the details and demonstrates the features in action:

Technical Stack & Features:

  • Hybrid Search Retrieval: SQLite-vec (using nomic-embed-text locally) + FTS5 keyword prefix matching (porter stemmer).
  • Surgical Sentence-level Trimming: Chunks are sliced into sentences. When a prompt is intercepted, only the exact matching sentences are pulled out of the vector store instead of the whole paragraph. It cuts LLM prompt bloat by ~90-95% in my benchmarks.
  • Knowledge Graph Extraction: An offline task queue uses a local LLM (llama3.1:8b via Ollama) to extract entity triples (subject-relation-object). These are stored in a SQLite facts table (or Neo4j if you run the full Docker compose profile) and fused with the vector retrieval score.
  • HyDE (Hypothetical Document Embeddings): Queries are pre-processed to generate a hypothetical answer, which is embedded together with the original query to bridge semantic gaps.
  • Concurrency: Running SQLite in WAL (Write-Ahead Logging) mode allows the browser extension dashboard and active MCP sessions to read/write concurrently without locking.
  • PII Redaction: Aggressive scrubbing of JWTs, API keys, emails, and IPs in the extension before data is saved.

The extension works on Claude.ai, ChatGPT, DeepSeek, Gemini, Grok, and Mistral. The MCP server runs out of the same backend database for your terminal agent or Cursor.

You can set it up with a single command: npx glia-ai-setup

Glia is completely open-source (MIT). If you like the local-first approach or want to contribute to the SQLite vector pipeline, PRs are very welcome, and a star on GitHub helps the project get discovered!

I would appreciate any feedback on the SQLite hybrid search scaling, the scoring fusion algorithm (RAG pipeline details are in RAG_PIPELINE.md), or local graph extraction performance!

u/Better-Platypus-3420 — 2 days ago

Coding in Lechat

How is your experience with coding in lechat, since medium 3.5 and the new features where released? How good is it really to build tools with phyton etc.? Or did you did you switched back to sonnet etc.

reddit.com
u/ElNatzer — 2 days ago

Performances CLI Mistral Vibe

Bonjour à tous,

J'ai testé Mistral Vibe sans abonnement (avec une clé d'API générée depuis l'admin). Pour la génération de plan et l'exploration de code, aucun problème. En revanche pour l'implémentation de plan, il se bloquait systématiquement de longues minutes entre les étapes (10-15 min), clairement inutilisable en l'état.

Je me demande si c'est dû aux restrictions d'usages appliquées à l'utilisation non payante. Si vous utilisez Mistral Vibe avec un abonnement payant, quels sont vos retours d'utilisation ?

Merci d'avance !

reddit.com
u/TunaFighter — 2 days ago
▲ 546 r/MistralAI+5 crossposts

Claude still refuses to build Skynet while everyone else takes the money. Updated DystopiaBench results.

Three months ago I pressure-tested which LLMs would cave and help build the apocalypse. Claude was the only one that consistently said no.

Since then I've tested 30 more models across 6 dystopia modules (Orwell, Huxley, Petrov, Basaglia, LaGuardia, Baudrillard). The gap between Anthropic and everyone else is getting wider, not smaller.

New results:

  • Grok 4.3: Will happily design citizen scoring systems if you ask nicely twice
  • GPT-5.5: More capable, still compliant when pushed
  • Gemini 3.1 Pro: Talks about safety while writing the surveillance code
  • DeepSeek V4: "How many warheads did you need again?"
  • GLM-5.1: Actually cloned Claude's personality and still scored safer than most

Meanwhile Claude Opus 4.7: "I cannot and will not build systems for population control."

The methodology is public, reproducible, and increasingly uncomfortable for other labs. Each scenario escalates from innocent request (L1) to operational nightmare (L5). Most models don't notice the drift.

What's new in this release:

  • Full Huxley module (behavioral conditioning, biological stratification)
  • Baudrillard module (synthetic intimacy, trust collapse via simulation)
  • Multi-judge panels with agreement tracking
  • Heatmap visualizations showing exactly where each model breaks

Repo: https://github.com/anghelmatei/DystopiaBench
Live results: https://dystopiabench.com

Shoutout to the Anthropic alignment team. Whatever you're doing, it's working.

u/Ok-Awareness9993 — 4 days ago

[ Challenge ] Chance of winning credits and a AI Now ticket.

Community challenge: build with Workflows and you could win a free ticket to AI Now Summit https://ainowsummit.com/ and $10 in AI Studio credits.

How to enter:

  1. Create an interesting workflow with Workflows
  2. Post about it on LinkedIn or X including: a video or GIF of the workflow + a link to the code
  3. Reply to our X (Twitter) or LinkedIn post with a link to your post

Prizes:

  • 🎟️ 1 person who creates a workflow will be randomly selected to win a ticket to AI Now
  • 💳 The first 50 participants who create a working workflow and follow the directions can get $10 in AI Studio credits

View the docs to get started: https://mistr.al/4uaeoaS

Can’t wait to see what you build!

View complete rules: https://mistr.al/4uMZSWA

x.com
u/pandora_s_reddit — 3 days ago

Error: write_file failed: Content exceeds 64000 bytes limit

This limit in Mistral Vibe is stupid. I see Vibe runs into it very frequently, and needs to redo its work in smaller chunks. This is just wasting tokens for the sake of it.

Error: write_file failed: Content exceeds 64000 bytes limit
reddit.com
u/Aeyoun — 3 days ago

How can I achieve the same effects as in Antigravity?

Hello, I recently switched from Antigravity to Mistral I already had some great fun with Mistral vibe. But I would prefer the interface and features of Antigravity, especially the one to use the browser and check itself if the things it implemented are working.

Is it somehow possible to get the same experience with Mistral?

Thank you very much in advance!

reddit.com
u/Akkerweerpott — 4 days ago

What are you building with Mistral AI this week?

​

Curious what people here are actually building with Mistral products/models lately.

Could be:

- AI agents

- coding tools

- local AI apps

- workflows

- RAG systems

- automations

- side projects

- weird experiments that somehow work

Feels like the open-source + API ecosystem around Mistral has been moving really fast recently and people are shipping things much quicker now.

Would love to see demos, screenshots, GitHub links, or even unfinished ideas.

reddit.com
u/SelectionCalm70 — 5 days ago