Ollama has Quantized LLMs now?
So, after the fiasco of reducing the limit of all LLM to 1/4-1/5 th of what it used to be.
It seems they have now switched to quantized models of LLM silently.
The models are so dumb now since last 3-4 days.
Now Deepseek v4 models (pro and flash), both needs to be instructed 3x-4x times before they do anything right at all.
Anyone else feeling it?
u/Growth2day — 15 days ago