Talk Less, Coordinate More: MARL Meets the Real World
A warehouse robot fleet does not fail because one robot forgot how to move. It fails because three robots each saw a slightly different world, one message arrived late, another was dropped, and the coordination policy confidently optimised against yesterday’s reality. Very modern. Very autonomous. Very expensive. That is the uncomfortable premise behind Robust and Efficient Communication in Multi-Agent Reinforcement Learning, a survey of how multi-agent reinforcement learning, or MARL, behaves when the communication layer is no longer treated as magic plumbing.1 The paper is not presenting a new benchmark champion. Its value is quieter and more useful: it organises a scattered body of work around the communication failures that actually matter in deployed multi-agent systems. ...