How a Short Word Can Turn Your AI Product into a Legal Nightmare
In ML evaluation, particularly with the LLM-as-a-Judge approach, we frequently fall into the "halo effect" trap. When an AI model's response sounds authoritative and professional, the Judge automatically assigns it a high score, completely missing the actual semantic content.