Here's a thought...
If you make Kimi2.6 (the only usable "free-tier" model) unusable by throttling....maybe just tell the user the model is overloaded and to try again later. This is better than being in the middle of a workflow and have it unresponsive or "model unreachable."
Once I use my paid quota on bigger items, I then fallback on these lighter models. Recently, though, it has become impossible to use reliably. Absolutely frustrating. Kick these free users off or have them use SWE-1.5. We need to see some consistency here.
I refuse to top up my quota with additional usage because I feel like I'm part of some ridiculous guinea pig experiment called "force them to pay." Guess what? I do pay...but out of principle, I refuse to pay more.