Causal Reasoning

Synthetic and Sensibility: Why More Data Needs a Control Stack

Synthetic and Sensibility: Why More Data Needs a Control Stack Synthetic data has become the convenient answer to almost every uncomfortable AI training question. Need more reasoning traces? Generate them. Need domain examples? Generate them. Need privacy-preserving replacements for customer data? Generate them. Need a dataset that looks suspiciously like a benchmark but not too suspiciously like a benchmark? Generate it, then call it “curriculum design.” ...

Flip the Script: When Causality Breaks the LLM Illusion

A fire alarm can cause people to evacuate. It can cause a building to enter alert mode. It can trigger emergency procedures, bring firefighters, and make everyone suddenly remember where the stairs are. But does a fire alarm cause a fire? Obviously not. At least, obviously not to a human who understands the causal structure. The alarm is usually an effect or signal of fire risk, not the origin of the fire itself. A model trained on enough sentences of the form “fire alarm causes…” may not be so careful. It may see the familiar phrase pattern, complete the familiar answer, and walk directly into the wrong conclusion with excellent grammar. ...

When Temperature Rises, Who’s to Blame? — Causation in Hybrid Worlds

Temperature is a patient witness. A valve ruptures. A cooling system fails. A technician records a radiation reading. Minutes later, the core temperature crosses a danger threshold. The incident report now asks the question every system audit eventually asks, usually after everyone has already chosen a favorite suspect: Who caused the temperature rise? ...

RAudit: When Models Think Too Much and Still Get It Wrong

The model is not always confused. Sometimes it has already done the work, reached the right answer, and then politely walks away from it because the user sounded confident. That is the quietly irritating problem behind RAudit, a paper that studies how large language models behave when their reasoning is audited without giving the auditor the correct answer.1 The paper is not just another “LLMs can be sycophantic” warning. We have enough of those. At this point, saying models flatter users is like saying spreadsheets contain hidden errors. True, useful, and somehow still not enough to change deployment practice. ...

Agents Without Time: When Reinforcement Learning Meets Higher-Order Causality

Handoffs Are Where Fixed Time Sneaks Into Agent Design Handoffs look harmless. One agent collects evidence, another checks it, a third decides, and a fourth sends the answer to a customer, robot, trader, or dashboard. The workflow diagram has arrows. The arrows have a direction. Someone decided which component acts first. Usually that decision is treated as engineering housekeeping. In Matt Wilson’s paper, it becomes the point of the story.1 ...

Breaking the Tempo: How TempoBench Reframes AI’s Struggle with Time and Causality

A failed deployment usually produces two questions. The first is easy enough to ask: what happened? The second is where the room goes quiet: what actually caused it? Most AI systems are now quite comfortable with the first question. Give them logs, traces, workflows, tool calls, or transition histories, and they can often produce a plausible reconstruction. They can narrate the incident in confident sequence. They can point to every condition that was present. They can provide a tidy post-mortem, ideally before the humans have finished opening the dashboard. ...

Chains of Causality, Not Just Thought

TL;DR for operators Causal Influence Prompting, or CIP, is a safety method for LLM agents that asks the model to build and consult a causal influence diagram before acting. Instead of telling the agent, “be safe,” it asks the agent to represent the task as a graph: what facts matter, what choices are available, what outcomes are useful, and what outcomes are harmful. This is a better shape for the problem, because agents do not merely answer questions. They click buttons, run code, forward messages, use tools, and occasionally behave as if “sure, why not?” were a compliance framework. ...

Bias Busters: Teaching Language Agents to Think Like Scientists

TL;DR for operators Language-model agents do not merely make wrong causal guesses. In this paper, they gather evidence in a biased way, then interpret that evidence through the same bias. That is the uncomfortable part. The study turns the classic Blicket Test from developmental psychology into a text-based active exploration game for LM agents. The agent must test objects, observe whether a machine turns on, then infer which objects are “Blickets” and whether the hidden rule is disjunctive — any Blicket activates the machine — or conjunctive — all relevant Blickets must be present together.1 ...