How to evaluate models
Hi folks,
I am setting up local AI agents. I don't know the exact term for that but I want to know is there anyway we can evaluate the models like in which domain they are good and how good the are ? Is there any website stating that or it can be figured out using some tests or some kind of score ?