More Isn’t Smarter: Why Agent Diversity Beats Agent Count
Opening — Why this matters now Multi-agent LLM systems have quietly become the industry’s favorite way to brute-force intelligence. When one model struggles, the instinct is simple: add more agents. Vote harder. Debate longer. Spend more tokens. And yet, performance curves keep telling the same unflattering story: early gains, fast saturation, wasted compute. This paper asks the uncomfortable question most agent frameworks politely ignore: why does scaling stall so quickly—and what actually moves the needle once it does? The answer, it turns out, has less to do with how many agents you run, and more to do with how different they truly are. ...