AI Governance

Prompt Wars: When Pedagogy Beats Cleverness

A prompt review meeting usually sounds more scientific than it is. One person likes the “coach” version. Another prefers the “Socratic” version because it sounds more educational. Someone says the prompt should mention metacognition. Someone else adds “be concise,” because apparently every prompt eventually becomes a corporate email with anxiety issues. Then the team ships the one that feels best. ...

Auditing the Illusion of Forgetting: When Unlearning Isn’t Enough

Deletion requests sound simple until the model answers politely. A user asks for data to be removed. A publisher demands that copyrighted passages stop being reproduced. A compliance team wants evidence that a fine-tuned model no longer carries traces of a forbidden dataset. The model is run through an unlearning method, the surface tests improve, the dashboard turns less red, and everyone enjoys the brief spiritual comfort of a green checkmark. ...

Pay to Think: Incentive Design Is the Hidden Variable in Human–AI Research

Payment sounds like the boring part of a user study. Recruit participants. Estimate task time. Set a base rate. Add a small bonus if the budget allows. Put the number in the methods section, preferably somewhere readers can skim past with dignity. Then move on to the interesting material: trust, reliance, explanations, fairness, error rates, cognitive load, and all the other variables that make human–AI decision-making sound like a serious field rather than a procurement spreadsheet. ...

Rebuttal Agents, Not Rebuttal Text: Why ‘Verify‑Then‑Write’ Is the Only Scalable Future

Rebuttal is where polite language goes to be cross-examined. A reviewer asks why the baseline is missing. Another says the theory is unclear. A third implies that the claimed novelty is, shall we say, generously interpreted. The authors have a few days to respond, and every sentence must do three jobs at once: answer the concern, avoid overclaiming, and preserve the paper’s strategic position. ...

When Benchmarks Break: Why Bigger Models Keep Winning (and What That Costs You)

Budget. That is where the benchmark story usually becomes less elegant. A vendor shows a model card with better reasoning scores, stronger multi-task accuracy, and a leaderboard position polished to a mirror finish. Then someone in operations asks the rude question: what does this improvement cost per customer case, per analyst hour, per compliance review, or per failed escalation? ...

Who’s Really in Charge? Epistemic Control After the Age of the Black Box

Control is a comforting word. It suggests a hand on the wheel, a dashboard of indicators, and a human being somewhere nearby who can still say no. Machine learning makes that picture look increasingly theatrical. In AI-assisted science, researchers often do not know exactly which internal representations a model has learned, why a high-dimensional classifier separates one tumor subtype from another, or whether a model’s “useful pattern” corresponds to anything a scientist would recognize as a meaningful mechanism. The black box does not merely sit inside the laboratory. It starts to participate in deciding what the laboratory can see. ...

Think-with-Me: When LLMs Learn to Stop Thinking

A model can be wrong because it did not think enough. That part is easy to understand. The more annoying failure is when the model already had the answer, kept going, second-guessed itself into a ditch, and then presented the ditch with confidence. This is the special comedy of large reasoning models: sometimes the expensive part is not the intelligence, but the hesitation after the intelligence has already done its job. ...

Recommendations With Receipts: When LLMs Have to Prove They Behaved

A recommendation list is rarely just a list. On the surface, it says: “Here are ten movies, products, articles, songs, creators, or courses you may like.” Underneath, it often carries a second instruction: “Also do not bury long-tail items, do not over-concentrate exposure, do not violate diversity rules, do not create an audit nightmare, and please do all of this while still looking personalized.” ...

Survival by Swiss Cheese: Why AI Doom Is a Layered Failure, Not a Single Bet

Risk committees love a single number. Give them a probability, a red-yellow-green dashboard, perhaps a polite heatmap, and everyone can pretend the future has agreed to become a spreadsheet. The trouble with AI existential risk is that the interesting question is not simply whether one dramatic doom story is persuasive. The more useful question is uglier: if humanity survives advanced AI, which layer saved us? ...

Bubble Trouble: Why Top‑K Retrieval Keeps Letting LLMs Down

The problem is not finding documents. It is spending the prompt budget badly. Ask an enterprise RAG system for “scope of work,” and the system may look confident for exactly the wrong reason. The query sounds simple. Somewhere in the document set, there is probably a sheet, paragraph, or clause literally called “Scope of Works.” A flat top-k retriever will happily grab the highest-scoring chunks from that section, stack them into the model context, and call the job done. Very tidy. Very wrong. ...