u/robotrossart

Aesop Fables from 1707 adapted to Reels format: Fabels on the price of vanity

u/robotrossart — 24 hours ago

Beyond Chatbots: Building a Multi-Agent E-Commerce Backend for Automated Reels Generation

Most agentic pipelines are playground demos. For our launch of ReelaTales (reelatales.robotross.art), we wanted to build a hardened, transaction-driven agent infrastructure.

The problem we are trying to solve is to help Authors generate Reels of their work without having to dig deep in tooling and prompt generation.

We’ve map-out our complete engineering architecture to show how we’ve productized the fleet from customer checkout to asset delivery:

  • The Transaction Boundary : We don't trigger agents on arbitrary user inputs. A custom Shopify customer order fires a secure webhook directly to our DigitalOcean Server, dropping the job payload into a deterministic Order Queue.
  • Language-Optimized Orchestration : The core routing engine splits the workload across three specialized model nodes depending on the target locale. Claude Sonnet owns English, Mistral Large handles French, and our local Apertus node anchors the hyper-regional Swiss German and Rumantsch generations.
  • Downstream Generation & Stitching : These coordinator agents don't just write text; they engineer structured prompts for the Runway API and drive ElevenLabs for professional narration, passing the raw multimedia assets to an automated FFmpeg Assembly microservice for final stitching.
  • Autonomous Distribution: The loop finishes with an automated upload worker pushing directly to YouTube, returning a valid Proof of Work URL straight to the customer ecosystem.

By keeping orchestration firmly in our control plane and treating individual LLMs as modular workers, we can swap models in and out without breaking the transactional backbone of the business.

The app is live. Check the architecture out and let us know how you handle background payload queueing for multi-modal generations.

u/robotrossart — 24 hours ago

Launching ReelaTales: Using Mistral as the Creative Director for AI Video and Subtitle Pipelines

Most people use Mistral as a standalone chatbot. In our new launch, ReelaTales, we are treating Mistral as the core orchestrator of a complex, multimodal production pipeline.

ReelaTales is a working web app that generates complete cinematic reels from simple concepts, and Mistral models handle the heavy creative lifting behind the scenes:

  • Runway Prompt Engineering: Instead of letting users struggle with text-to-video prompting, our Mistral nodes ingest user ideas and expand them into highly structured cinematic prompts optimized specifically for Runway's visual engine.
  • Automated Subtitle Generation: Mistral manages the transcript orchestration and structural timing layer, ensuring dynamic, highly accurate subtitle alignment for the final video renders.
  • The Sovereign Advantage: By leveraging open weights, we can spin up these creative pipelines with predictable execution costs, proving that Mistral isn't just a benchmark winner—it’s the backbone of production-ready media infrastructure.

We are actively proving that models aren't moats; the magic is in how you string the infrastructure together. Let me know what you think of the generation pipeline!

u/robotrossart — 1 day ago

Why the new Claude Agent SDK credits prove the need for Hybrid Orchestration

Anthropic just announced that programmatic Agent SDK usage will soon move to a credit-based model ($100/month for Max subscribers). This is a massive wake-up call: if your agents are 100% cloud-dependent, your "predictable budget" is now at the mercy of provider policy.
In Flotilla v0.5.0, we’ve engineered a way out of the "Credit Trap" through a Zero-Limit Hybrid Architecture:
- The Local Safety Valve: Instead of burning your $100 Claude credit on simple implementation tasks, Flotilla automatically routes high-frequency work to our Local Realm (Gemma4/Voxtral running on an M4 Mac Mini).
- Preserving Subscription Limits: We save Claude’s reasoning for the "Big Sister" roles (Architecting and Code Review), ensuring your subscription limits stay reserved for interactive work rather than being drained by background automation.
- Orchestration Sovereignty: Flotilla doesn't rely on the Claude Agent SDK for coordination. We use a custom state machine and a local Operational Ledger, meaning you own the "Brain" of the fleet, regardless of how cloud providers change their terms.

Don't let your fleet's uptime depend on a monthly claim flow. Build on a stack that treats local compute as the foundation, not the exception.

u/robotrossart — 7 days ago

7 Days of 24/7 Agent Operations on the M4 Mac Mini coordinated wiht Flotilla

We’ve been running our agentic fleet 24/7, and the M4 Mac Mini has officially become the heart of the operation.

The Strategy: Cloud for Thinking, M4 for Doing We realized that asking cloud models (Claude/Gemini) to handle every minor implementation task was an expensive waste of reasoning depth and token budgets.

How we use the M4:

  • The Local Executor: We onboarded Gemma4 as a fully local agent running via aichat. She handles the straightforward coding, commits, and implementation tasks.
  • Token Conservation: By offloading these high-frequency tasks to the M4, we save our expensive cloud model "seats" for what they are actually for: architectural review and complex logic.
  • Zero Latency: Because Gemma runs on-device, implementation tasks start instantly with zero network round-trip. This keeps the "Local Realm" active even when our cloud subscriptions hit their monthly limits.

The 7-Day Reality: > Check the attached Shift Timeline. You can see exactly where we onboarded Gemma in early April. While the cloud agents (Red bars) occasionally go dark to preserve their quotas, the local M4 node provides a continuous foundation for the fleet.

If you have an M4 on your desk, you have a data-sovereign factory floor that never hits a billing limit.

https://github.com/UrsushoribilisMusic/agentic-fleet-hub

u/robotrossart — 1 month ago