Aligned or Just Agreeable? Why Accuracy Is a Terrible Proxy for AI–Human Alignment
Accuracy is comforting because it gives us a number. The model predicted the right label. The chatbot chose the same option as the survey respondent. The simulated customer picked the same product. Everyone claps, someone updates a dashboard, and the alignment problem is declared mostly solved. Unfortunately, decision-making is where accuracy goes to look respectable while quietly doing very little. ...