u/AdPlane8191

Backend Engine

Hey for anyone that's built out a backend structure I have a question: I'm requiring some LLM models for compression & aggregation of information. I was looking at Deepseek R1 0528 for my Intent Extraction / Canon Validator / Memory Compression. Seems like it would serve the purpose well, and costs are reasonable.

My questions are:

-Any reason to not let it run the whole behind the scenes...say for diversity, or you had a past experience?

-Is it overkill?

-is the a better cost to performance model out there?

*Moody SciFi RPG Genre

*GLM narration likely (mixed models)

*I will have shadow models set up as a back-up

Thanks 🙏

reddit.com

u/AdPlane8191 — 7 days ago

▲ 2 r/SillyTavernAI

Mixing LLM's RPG Roleplay

Hey there, curious on people's experience with mixing LLM's in rpg roleplay. I'm trying to build a system of hard guardrails on the backend to guide the vibe, ruleset, and memory recall that two different AI would pull from. The goals is to use a more expensive model for high/mid impact decisions & resolution, while using a lower model for simpler moments. Sonnet 4.6 & Deepseek 3.2 for reference. New to this any help would be appreciated.

reddit.com

u/AdPlane8191 — 13 days ago