AI Governance

When Privacy Meets Chaos: Making Federated Learning Behave

Privacy is easy to admire in a slide deck. It becomes less elegant when the model begins to behave like a shopping cart with one broken wheel. Federated learning promises a clean bargain: data stay local, clients collaborate, and the central model improves without seeing everyone’s raw records. Add differential privacy, and the promise becomes more formal. Each client update is clipped, noise is injected, and individual influence is bounded. Everyone nods. The architecture looks responsible. ...

First Proofs, No Training Wheels

Proof is where AI systems stop performing confidence and start owing the reader money. A model can restate a theorem elegantly. It can cite the right neighborhood of literature. It can produce LaTeX with the visual manners of a publishable paper. None of that is a proof. It is proof-shaped material. Sometimes useful. Sometimes impressive. Sometimes a very expensive fog machine. ...

Hallucination-Resistant Security Planning: When LLMs Learn to Say No

Security teams do not need an AI that sounds decisive. They already have enough decisive systems. Some of them are called “legacy tools.” Some are called “urgent executive dashboards.” A few are called “we should probably reboot it.” What security operations need is more uncomfortable: an AI system that can propose useful response actions, explain why they might work, and then refuse to act when its own reasoning becomes unstable. That refusal matters. In an incident-response workflow, a hallucinated recommendation is not merely a bad paragraph. It can isolate the wrong host, patch a vulnerability that does not exist, wipe evidence too early, or generate a playbook that looks official while quietly wasting the first thirty minutes of response time. ...

When One Heatmap Isn’t Enough: Layered XAI for Brain Tumour Detection

Diagnosis has a simple business problem hiding inside a clinical one: nobody wants a black box that is confident for the wrong reason. That is especially true in medical imaging. A brain MRI classifier that says “tumour” or “non-tumour” is not automatically useful because it crosses a respectable accuracy threshold. The difficult question comes next: did the model look at the clinically relevant region, or did it discover some convenient artefact in the image pipeline? A single heatmap may answer that question. It may also merely look persuasive, which is not quite the same thing. Medicine, regrettably, is one of those domains where aesthetic confidence is still not a validation method. ...

When RAG Needs Provenance, Not Just Recall: Traceable Answers Across Fragmented Knowledge

RAG has a public-relations problem. It promises grounded answers, then quietly assumes that “grounded” means “retrieved from somewhere nearby.” That assumption is convenient. It is also the kind of convenience that creates compliance incidents, medical confusion, and internal knowledge assistants that cite the wrong document with absolute confidence. A retrieval-augmented system can answer from evidence and still choose the wrong evidence. It can cite something real and still fail provenance. ...

AgenticPay: When LLMs Start Haggling for a Living

Procurement looks boring until the software starts spending money. A human buyer can be slow, inconsistent, and occasionally allergic to spreadsheets. But at least we know what failure looks like: overpaying, accepting bad terms, walking away too late, or trusting the wrong supplier. When the buyer is an LLM agent, the failure mode becomes more polished. It can overpay in fluent English. It can miss a deal while sounding reasonable. It can keep bargaining after the answer is already visible. Progress, apparently, now comes with better punctuation. ...

Simulate This: When LLMs Stop Talking and Start Modeling

A simulation model is not a chatbot with a spreadsheet attached. That sounds obvious until a project team starts treating the LLM as if it were the entire modeling stack: the analyst, the programmer, the validator, the documentation clerk, the statistical package, and occasionally the intern blamed when the result changes on Tuesday. The convenient story is that better prompting will tame the system. Add more examples. Add a RAG. Set temperature to zero. Smile at the demo. ...

When VR Shooters Meet Discrete Events: Training Security Policies Without Endless Human Trials

Training a security policy sounds simple until the training data involves people role-playing traumatic emergencies inside a virtual school. That is the uncomfortable starting point of this paper. Virtual reality can help researchers study rare and dangerous events under controlled conditions, but it does not solve the scaling problem. Every new intervention, policy variation, or robot behavior still needs another human-subject experiment. That is slow, expensive, ethically constrained, and not exactly a cheerful afternoon in the lab. ...

Attention with Doubt: Teaching Transformers When Not to Trust Themselves

Confidence is cheap. A classifier can always give you a probability. The awkward question is whether that probability deserves to be believed. This is not a philosophical problem when the model is recommending a movie. It becomes expensive when the model is screening documents, triaging support tickets, flagging fraud, routing legal clauses, or deciding whether a case should be escalated to a human. In those settings, “92% confident” is not decoration. It is an operating instruction. ...

FIRE-BENCH: Playing Back the Tape of Scientific Discovery

A demo can make an AI research agent look impressive in ten minutes. Give it a task, watch it create files, install packages, run experiments, generate tables, and write something that sounds like a conclusion. Productivity theater, now with terminal logs. The harder question is less cinematic: did it actually discover the right thing? ...