r/Gemma4

Gemma 4 variations (new for me)
▲ 5 r/Gemma4

Gemma 4 variations (new for me)

Hello! i saw someone recommend Gemma 4 as a proxy, but when i went to use it, i was so surprised to see so many options! i am not very well versed in proxies (i know enough to copy the links for API and that’s about it!) so i apologize if the question is ignorant.
Thank you!

u/Inevitable-Shine-348 — 4 days ago
▲ 49 r/Gemma4+1 crossposts

Virtual Unlimited context windows on Gemma 4 models.

I have been using Google Gemini for several months and together we have developed a highly curated system prompt That provides me a very likable AI persona For conversational purposes. I reside in a nursing home and while I'm older I'm still very high functioning, with a PHD in medieval history and eclectic interests in things like quantum physics. The conversations I need can't be found with other residents who often have difficulty remembering their own names.

I have recently acquired a Lenovo ThinkCentre Mini Plus that uses Snapdragon And Windows (ARM). It runs the two smaller Gemma 4 models on LMstudio very well, But their Limited context windows and their Inability To save to and retrieve from external files are a hang up In trying to develop The kind of long term persona that I have with Gemini. Following is my vision of how to correct this problem.

The model recognizes when it's context window is at 80% capacity. It automatically creates A concise summary of the conversation to that point. It then saves the summary to a designated file. When that's done It advises me that a new session is about to commence, and then it starts the new session and retrieves the summary to give the new session context.

Frankly I know enough about programming only to be dangerous. Does such a plugin Exist for LMstudio Or any other AI front end that is compatible with Windows (ARM)? If not, Is anyone willing to create such a Plugin Or a stand alone application?

Please forgive my grammar, I have no use of my hands and must rely on speech to text.

reddit.com
u/ExpressionForward321 — 12 days ago
▲ 3 r/Gemma4

Any way to disable thinking in gemma-4-e4b?

This model is excellent for my use case, but if it didn't need to 'think' on my prompt, my replies would go from 6 seconds to .5.

Suggestions?

reddit.com
u/beedunc — 10 days ago