Those final checker/evaluator models when you have to create a split between models
I was just on a project where I had to create a split between two models and rate them. Then I had to have another model evaluate my rating.
Do those checker/evaluator models have to agree 100% with what you wrote? I'm not talking about ignoring a negative response because you're sure of your rating. I mean when the checker's response is "I agree on this but not on this." If there's a green check at the bottom anyway, does that mean the model agreed enough to submit the task?
I got a response like that and figured I should adjust my rating to take the checker's comments into account. What followed were several regenerations with the checker's response getting more and more negative each time I tried to adjust my rating to take the checker's comments into account. The few times it was positive, it still said part of my response was inaccurate. I ended up using the escape hatch.