AI Robustness

TL;DR for operators AI systems do not only fail because they are too small, too dumb, or insufficiently blessed by the gods of scale. They often fail because the formal objective supervises one slice of behavior and quietly leaves another slice unmanaged. Three recent papers make that point from different domains. MA-SBI shows how side-channel context can correct simulation-based inference when the simulator is misspecified.1 A paper on non-adversarial LLM robustness shows that semantically neutral prompt changes can systematically shift internal module outputs, and that targeted debiasing can recover robustness without full retraining.2 FiberTune shows that robot policy fine-tuning can preserve action-equivalent visual residuals that ordinary action loss is happy to compress into oblivion.3 ...

AI Robustness

Mind the Loss Gap

Spurious Minds: How Embedding Regularization Could Fix Bias at Its Roots