u/ExtremeScience6342

Hi,

I was curious how diff reasoning levels are enforced at inference time.

Is some extra token passed to llm that makes it reason in certain way? and maybe an additional hard cutoff with some thinking budget.

Or there is some logits overriding of some kind.

Thanks!