I was completely off thinking the dead neuron is what really matters, only to find out that as a mere symptom and the actual issue is 'the dead gradient '
u/Crazy-Economist-3091 — 16 days ago
I was completely off thinking the dead neuron is what really matters, only to find out that as a mere symptom and the actual issue is 'the dead gradient '