u/Cute_Dragonfruit4738

▲ 10 r/ZaiGLM

Is there a way to estimate how many users have left Z.AI since price change?

Curious to know if there is a way to see usage/users who left based on the inconsistencies in the actual product (Uptime/rate limits etc.) and the overnight bamboozle of more than doubling prices.

Maybe this was an international thing and the Chinese market is propping them, but in my real day-to-day, anecdotally it seems like 80%-90% of people I know in the dev community who used Z.Ai have cancelled their plan.

Anyone know of any potential way, even rough estimate to figure that out? I'm really keen to do a case study on them.

reddit.com
u/Cute_Dragonfruit4738 — 10 days ago
▲ 2 r/ollama+1 crossposts

Hi everyone,

Moving away from GLM and wondering if anyone had an opinion on the best alternative inference provider. I'm looking for coding + agent use. My current stack:

- Claude Pro ($28)- Max out my weekly sessions every time, and have to ration my asks only using sonnet for non-coding activities.
- Z.AI - Pro ($30) - Crossed 1B tokens this past month, so obviously using quite a bit here. This pricing is now more than doubled so will be expire at the end of the week.
- MiniMax Lite - Honestly insane usage for my OpenClaw - will likely keep this.
- Ad-Hoc Deepseek API - When I need to supplement
- ChatGPT Plus ($20) - Got a free month so trying out codex with GPT5.5 - insanely slow which makes sure I dont hit my session limits, but overall seem to be a fan.

Really wondering the usage and capability of Ollama Pro ($20/month - Or Cloud if need be), OpenCode Go ($10/month) or Alibaba Coding Plan ($50/month). Particularly curious about Alibaba Coding plan and if anyone has enjoyed that experience. Also curious to alternative reliable providers.

Open to using different combinations. Looking for best price to intelligence. Z.ai's subscription is 100% out, while Minimax is definitely staying in the stack.

Appreciate everyone's opinion! Ollama Pro vs. OpenCode Go vs AliBaba Coding Plan [D]

reddit.com
u/Cute_Dragonfruit4738 — 25 days ago

Hi Everyone! I'm looking for a competitive swimming club for my six year old daughter. We've seen plenty of swim schools, but she's showing early promise and we want to feed into her potential. If there are any clubs out there that are a little bit more competitive, and provide a more challenging but warm environments for children, would love any suggestions.

Thanks yall!

reddit.com
u/Cute_Dragonfruit4738 — 26 days ago
▲ 22 r/ZaiGLM

https://preview.redd.it/0sj12oor2rxg1.png?width=621&format=png&auto=webp&s=42dee4c561ad8d46332bb9d6d7f8b0d458009801

Crossed 1B tokens used this last month with the GLM5.1! Farewell GLM! Ollama here we come.

Gemini did a quick cost analysis on how much that would cost using their API:

Cost Scenarios

Since you specified "total tokens" without a breakdown, here is the cost for different scenarios:

Scenario Calculation (1,001.88M×Rate) Total Cost
100% Input Tokens ($1.4/1M) $1,001.88 \times \$1.4$ $1,402.63
100% Output Tokens ($4.4/1M) $1,001.88 \times \$4.4$ $4,408.27
100% Cached Input ($0.26/1M) $1,001.88 \times \$0.26$ $260.49

Estimated "Real-World" Cost

In a typical use case (such as a chatbot or coding assistant), a common token distribution is roughly 80% Input and 20% Output. Using your pricing:

  • Input Cost: $801.50 \text{M tokens} \times \$1.4 = \$1,122.10$
  • Output Cost: $200.38 \text{M tokens} \times \$4.4 = \$881.67$
  • Estimated Total: $2,003.77

Summary

  • Minimum Cost: $260.49 (if all tokens are cached inputs)
  • Maximum Cost: $4,408.27 (if all tokens are model outputs)
  • Likely Cost: Approximately $2,000 for a standard mix of input and output.
reddit.com
u/Cute_Dragonfruit4738 — 26 days ago