Cover image

Don’t Self-Sabotage Me Now: Rational Policy Gradients for Sane Multi-Agent Learning

Kitchen work is not hard because chopping onions is metaphysically difficult. It is hard because two people must agree, implicitly and quickly, who gets the onion, who holds the plate, who waits by the pot, and who moves out of the corridor before everyone performs a small culinary traffic accident. That is why Overcooked remains such a useful multi-agent benchmark. It turns coordination into something visible. Agents do not merely need to “perform a task”; they need to infer what another agent is about to do and avoid becoming a sentient obstacle. ...

November 13, 2025 · 14 min · Zelina
Cover image

Noisy but Wise: How Simple Noise Injection Beats Shortcut Learning in Medical AI

X-rays look clinical. To a neural network, they can also look like stationery. A hospital name in the corner. A scanner signature. A compression pattern. A familiar positioning marker. A slightly different way of cropping the lung field. None of these is pneumonia. None of these is COVID-19. Yet a deep learning model trained on small medical datasets can treat them as wonderfully convenient diagnostic evidence, because machines are very good at passing exams and less naturally committed to understanding what the exam is about. ...

November 9, 2025 · 15 min · Zelina