Decision Intelligence

Forecasting the Forecast: Why Agentic AI Is Learning to Doubt Itself

Opening — Why this matters now Everyone wants AI to predict the future. Markets want alpha. Governments want warning signals. Executives want next quarter to behave politely. Yet most AI forecasting systems still operate like overconfident interns: one quick answer, suspicious certainty, and little memory of how they got there. A recent paper, Agentic Forecasting using Sequential Bayesian Updating of Linguistic Beliefs, proposes something rarer: an AI forecaster that updates its mind step by step, tracks evidence, and occasionally admits uncertainty. Revolutionary behavior, frankly. ...

From Playbooks to Probabilities: When AI Starts Thinking Like a Football Manager

Opening — Why this matters now AI has spent the past decade predicting outcomes. Now it wants to simulate realities. That shift—from prediction to generation—is subtle but consequential. In markets, it means scenario analysis instead of point forecasts. In operations, it means stress-testing decisions rather than merely optimizing them. And, somewhat unexpectedly, one of the clearest demonstrations of this shift comes not from finance or logistics, but from football. ...

Punching Above Baselines: When Boxing Strategy Learns to Differentiate

Opening — Why this matters now Elite sport has quietly become an optimization problem. Marginal gains are no longer found in strength alone, but in decision quality under pressure. Boxing, despite its reputation for instinct and grit, has remained stubbornly analog in this regard. Coaches still scrub footage frame by frame, hunting for patterns that disappear as fast as they emerge. ...

From Field Notes to Farm Operating Intelligence

A high-value commercial farm redesigned daily crop, irrigation, pest, harvest, labor, and buyer-delivery coordination around a reviewed AI operations brief instead of fragmented messages and manager memory.

Skip or Split? How LLMs Can Make Old-School Planners Run Circles Around Complexity

TL;DR Classical planners crack under scale. You can rescue them with LLMs in two ways: (1) Inspire the next action, or (2) Predict an intermediate state and split the search. On diverse benchmarks (Blocks, Logistics, Depot, Mystery), the Predict route generally solves more cases with fewer LLM calls, except when domain semantics are opaque. For enterprise automation, this points to a practical recipe: decompose → predict key waypoints → verify with a trusted solver—and only fall back to “inspire” when your domain model is thin. ...

Meta-Game Theory: What a Pokémon League Taught Us About LLM Strategy

When language models battle, their strategies talk back. In a controlled Pokémon tournament, eight LLMs drafted teams, chose moves, and logged natural‑language rationales every turn. Beyond win–loss records, those explanations exposed how models reason about uncertainty, risk, and resource management—exactly the traits we want in enterprise decision agents. Why Pokémon is a serious benchmark (yes, really) Pokémon delivers the trifecta we rarely get in classic AI games: Structured complexity: 18 interacting types, clear multipliers, and crisp rules. Uncertainty that matters: imperfect information, status effects, and accuracy trade‑offs. Resource management: limited switches, finite HP, role specialization. Crucially, the action space is compact enough for language-first agents to reason step‑by‑step without search trees—so we can see the strategy, not just the score. ...