Risk Management

When Agents Behave: Conformal Policy Control and the Business of Safe Autonomy

Deployment has a boring problem. That is usually where the expensive problems live. A company has an existing model, workflow, or agent policy that is not brilliant but has behaved well enough not to frighten legal, compliance, or operations. Then someone improves it. The new version is more capable, more exploratory, perhaps trained with better preference data or optimized for a sharper reward. It also does things the old version would not have done. ...

The Audit of Autonomy: When AI Agents Need More Than Intelligence

Audit is a boring word until the system being audited can move money, approve a refund, escalate a medical triage queue, book logistics capacity, or quietly call six APIs before breakfast. That is the mood shift around AI agents. The question is no longer whether a model can produce a clever answer. It often can. Congratulations to the stochastic parrot; it has learned to use tools. The harder question is whether an organization can prove, after the fact and preferably before disaster, that the agent acted within its assigned authority. ...

The Reliability Gap: Why Smarter AI Agents Still Fail When It Matters

A customer service agent gets the refund policy right on Monday, wrong on Tuesday, and confidently wrong on Wednesday. A coding agent passes the benchmark, then casually rewrites the wrong file in production. A workflow agent behaves perfectly in a demo, then becomes confused when the API returns the same fields in a different order. ...

From Scattered Event Chats to an AI Event Control Board

A mid-sized event agency moved from human-coordination-heavy planning across chats and spreadsheets to a human-reviewed agentic workflow that centralizes event state, flags exceptions, and keeps vendors, budgets, guests, schedules, and risks aligned.

Survival by Swiss Cheese: Why AI Doom Is a Layered Failure, Not a Single Bet

Risk committees love a single number. Give them a probability, a red-yellow-green dashboard, perhaps a polite heatmap, and everyone can pretend the future has agreed to become a spreadsheet. The trouble with AI existential risk is that the interesting question is not simply whether one dramatic doom story is persuasive. The more useful question is uglier: if humanity survives advanced AI, which layer saved us? ...

When Control Towers Learn to Think: Agentic AI Enters the Supply Chain

Control towers are good at showing managers what the company already knows. That is useful. It is also the problem. Most supply-chain control towers watch direct suppliers, shipments, inventory levels, and predefined thresholds. They are strongest when the relevant data has already been structured and admitted into the system. But many serious disruptions begin elsewhere: a Tier-3 materials supplier, a Tier-4 regional dependency, a geopolitical event buried in a news article, or a supplier relationship nobody remembered until the factory schedule started looking nervous. ...

When Your Dataset Needs a Credit Score

A dataset can look respectable for all the wrong reasons. It may have a familiar name. It may sit on a well-known repository. It may come with a license file, a citation, a download button, and just enough academic polish to make procurement, product, and engineering all feel that the risk has been handled. Wonderful. A PDF said it was fine. What could possibly go wrong? ...

Black Boxes, White Coats: AI Epidemiology and the Art of Governing Without Understanding

A hospital does not need a perfect theory of neural network internals before it can notice that one clinical AI keeps recommending the wrong kind of follow-up. A bank does not need to decode every transformer layer before it can see that a credit assistant behaves oddly around post-bankruptcy applicants. A regulator does not need metaphysics. It needs repeatable measurements. ...

The Ethics of Not Knowing: When Uncertainty Becomes an Obligation

Uncertainty is the most convenient word in governance. A model is uncertain, so the system waits. A committee is uncertain, so the decision is deferred. A risk officer is uncertain, so the memo gets another paragraph of decorative caution and nobody quite owns the next step. Very mature. Very responsible. Also, sometimes, very useful for avoiding responsibility while looking intellectually respectable. ...

Stack Overflow for Ethics: Governing AI with Feedback, Not Faith

Dashboards are where good intentions go to look responsible. A company launches an AI triage assistant, lending model, recommender, or eligibility system. The governance slide deck is very respectable. Fairness is mentioned. Transparency is mentioned. Human oversight is mentioned, usually beside a tasteful icon of a person holding a clipboard. Everyone nods. Six months later, users have learned to rubber-stamp the recommendation, one subgroup’s error rate has drifted, appeals are piling up, and nobody can say whether the system is still operating inside the boundaries that were promised at launch. ...