u/SveXteZ

Google is focusing on the wrong thing. We don't want faster LLM models, we want more of them

As you may know, in this subreddit it is forbidden to complain from the obvious, so I'm restricted in using some words, otherwise my post will get instantly deleted.

Google talks about how fast the new Flash 3.5 model is. And it is really fast and good, definitely the best model they ever made.
But we never complained from their models being slow, right? As long as the old models worked (and not being constantly bombarded with errors), they were fast enough too. Gemini-cli was an exception, which during peak hours might take you HOURS to respond, but Antigravity didn't have this issue.

I'd rather get 3x slower model, but the same usage as the old models used to have. I don't mind browsing around till my code is being prepared or even review the old changes before the new ones are finished, as it takes me much more time to review the code and to prepare the new prompt, than for the LLM to generate that code.

I lost hope in Google and doubt that they'd listen to us, as they never did.

reddit.com
u/SveXteZ — 2 days ago

The gemini-cli free spins are over too. The new agy-cli shares the same pool as the IDE

Gemini-cli was barely useful, as most requests took over 10 minutes to receive a response, though it occasionally worked quickly and proved very helpful. I knew the free spins would eventually end, and that day has finally arrived.

To make matters worse, Family Sharing has also been discontinued. "Thank you" for this update, Google, even though I never asked for any of these changes.

At least, I hope to finally fix the issue in Antigravity where I would hit 7 days "cooldown" after even a single prompt on the Gemini 3.1 low/pro models.

u/SveXteZ — 3 days ago

I miss the old Gemini Flash 3.0 . It used to be close to unlimited, while with Flash 3.5 I could use no more than 10 prompts

The model is great, but I didn't ask for this at all.

u/SveXteZ — 3 days ago