Backend Engine
Hey for anyone that's built out a backend structure I have a question: I'm requiring some LLM models for compression & aggregation of information. I was looking at Deepseek R1 0528 for my Intent Extraction / Canon Validator / Memory Compression. Seems like it would serve the purpose well, and costs are reasonable.
My questions are:
-Any reason to not let it run the whole behind the scenes...say for diversity, or you had a past experience?
-Is it overkill?
-is the a better cost to performance model out there?
*Moody SciFi RPG Genre
*GLM narration likely (mixed models)
*I will have shadow models set up as a back-up
Thanks 🙏