u/GuaranteePurple4468

Gemma 4 31B issues with reasoning and completion API question

Gemma 4 31B issues with reasoning and completion API question

So I'm struggling to wrap my head around this and hoping someone smarter than me can help.

I am trying to get reasoning enabled in Gemma 4 31B (eg: Mero-Artemis-31B-v0.3.1 ), but can't seem to figure out **how** to do that.

I am loading the model using KoboldCPP, with the latest SillyTavern update, and I have it set in my Chat Completion template to Request Reasoning, and have Reasoning Effort set to "High", but the model simply does not output any reasoning at all (nothing hidden in the KoboldCPP terminal either, it just is not thinking at all).

Do I have to use Text Completion instead?
And if so, where do I actually add my prompt/preset details? I cannot see anywhere in Text Completion to add my preferred wall of text.

u/GuaranteePurple4468 — 4 days ago

I've seen a few people mention you can set your hardware on Huggingface and it can tell you what models you can run, but for the life of me I cannot find where to do that.

Could someone be kind enough to point me in the right direction?

reddit.com
u/GuaranteePurple4468 — 18 days ago

Anyone have experience with Koboldcpp and troubleshooting it? I can't find any logs so no idea why this is happening.

I have a 16gb Amd Radeon RX 6800 with 80gb desktop memory.

The steps I have done:

  1. Downloaded koboldcpp-1.104 from the YellowRose Rocm github.
  2. Downloaded a model from huggingface (SuperGemma4-31b-abliterated.Q4_K_M.gguf) 17.4gb.
  3. Opened the Koboldcpp exe file and left it on default settings, selected the model and clicked launch.

The result is... nothing.
The exe just closes, then... nothing.
No errors, nothing I can see in the background in task manager, just... nothing.

Tried messing around with context sizes etc but seems like they all do the same.

reddit.com
u/GuaranteePurple4468 — 24 days ago