
HuggingFace benchmark datasets now let you filter by model size
Quite useful to see which model under 32B performs best on swebenchverified for example.
https://huggingface.co/datasets?benchmark=benchmark:official&sort=trending
u/paf1138 — 2 days ago