Patch, Don’t Preach: The Coming Era of Modular AI Safety
Opening — Why this matters now The safety race in AI has been running like a software release cycle: long, expensive, and hopelessly behind the bugs. Major model updates arrive every six months, and every interim week feels like a patch Tuesday with no patches. Meanwhile, the risks—bias, toxicity, and jailbreak vulnerabilities—don’t wait politely for version 2.0. ...