Cover image

Forecasting the Forecast: Why Agentic AI Is Learning to Doubt Itself

Forecasting is where executive optimism goes to be measured. A sales team says the pipeline is healthy. A policy team says the election risk is manageable. A trading desk says the market has mostly priced in the event. Everyone has a probability. Few people have a disciplined process for updating it. That is also the problem with many AI forecasters. They can produce a number quickly, sometimes impressively, sometimes with the emotional stability of a quarterly sales forecast. But the harder question is not whether an AI can answer, “What is the probability?” The harder question is whether it can revise that probability as evidence arrives, remember why it changed its mind, and avoid turning a confidence score into decorative typography. ...

April 23, 2026 · 18 min · Zelina
Cover image

From Playbooks to Probabilities: When AI Starts Thinking Like a Football Manager

Football is usually explained after the fact. A team “pressed high.” A winger “found space.” A midfield line “lost compactness.” These statements may be accurate, but they arrive with the comforting uselessness of a weather report read after the picnic. The real managerial question is not merely what happened. It is what could have happened if the opponent shifted earlier, if the team protected the half-space, if the attacking line stretched the back four, or if the next pass invited three different futures instead of one. ...

April 14, 2026 · 17 min · Zelina
Cover image

The Sandbox Economy: When LLMs Stop Talking and Start Shopping

Discount. It is a small word, but in retail it is not decorative. It changes what people buy, how much they buy, whether they switch brands, whether they stockpile, whether distributors clear inventory, and whether a manager later pretends the promotion was “strategic” rather than simply expensive. This is where many LLM-agent demos become fragile. They can describe a discount. They can explain why a rational consumer might respond to it. They can even role-play a price-sensitive shopper with theatrical enthusiasm. But describing incentive response is not the same as simulating it. A consumer simulator that treats price as one more piece of text is not an economic simulator. It is a chatbot wearing a shopping cart. ...

March 19, 2026 · 18 min · Zelina
Cover image

Greedy, but Not Blind: Teaching Optimization to Listen

Budget meetings have a familiar rhythm. Someone brings the spreadsheet. Someone brings the map. Someone else brings the sentence that ruins the spreadsheet: “This district looks inefficient on paper, but the roads are worse than the data says.” Classical optimization knows what to do with numbers. It does not naturally know what to do with that sentence. In public health planning, infrastructure rollout, retail site selection, and ESG investment, those sentences are often where the real institutional knowledge lives. Unfortunately, once the sentence enters the room, the algorithm usually leaves through the back door. Or worse, the organization pretends the sentence has been “encoded” into a weight, because apparently all human judgment becomes rigorous once it is multiplied by 0.37. ...

January 19, 2026 · 14 min · Zelina
Cover image

Punching Above Baselines: When Boxing Strategy Learns to Differentiate

Li Qian is the useful part of the paper, not the medal count Boxing is a simple sport only if you watch it from far enough away. Two athletes enter a ring. One wins. The spectators remember the clean punch, the late-round pressure, the judge’s card, maybe the celebration. Coaches remember something less theatrical: distance, lead-hand rhythm, counter timing, target selection, whether a hook was thrown from the wrong range, whether the opponent’s aggression was actually a trap. ...

January 19, 2026 · 18 min · Zelina
Cover image

Storm-Chasing Agents: How EWE Turns Extreme Weather into Actionable Intelligence

Storms are easy to see after they arrive. The harder question is what actually made them happen. That distinction sounds academic until money enters the room. An insurer wants to know whether an event belongs to a changing regional risk pattern. A grid operator wants to understand whether a heatwave was driven by persistent blocking, moisture transport, or local feedback. A government agency wants a report fast enough to support preparedness, not just a polished explanation three months later. The weather event is visible. The mechanism is expensive. ...

November 28, 2025 · 14 min · Zelina
Cover image

Futures, Not Forecasts: How AI Redraws the Boundaries of Foresight

Forecasts are comforting because they pretend the future has already filed its paperwork. A number arrives. A probability. A trend line. A neat dashboard arrow pointing upward, downward, or toward whichever strategic conclusion the executive team secretly preferred anyway. This is why forecasting tools sell so well: they reduce uncertainty into something that looks like management. ...

November 27, 2025 · 14 min · Zelina
Cover image

Bandits, Budgets, and the Art of Waiting: How Delay-Aware Algorithms Rewire Resource Allocation

Budgets arrive before outcomes. That is the small administrative tragedy behind many allocation systems. A university decides which students receive financial aid before it knows who will persist. A workforce programme assigns training slots before employment outcomes appear. A healthcare provider prioritises interventions before the full treatment effect is visible. The decision is immediate; the evidence drips in later, usually after the next decision has already been made. Naturally, many algorithms pretend this is not happening. Very elegant. Also very wrong. ...

November 14, 2025 · 15 min · Zelina

From Field Notes to Farm Operating Intelligence

A high-value commercial farm redesigned daily crop, irrigation, pest, harvest, labor, and buyer-delivery coordination around a reviewed AI operations brief instead of fragmented messages and manager memory.

October 30, 2025 · 8 min · Vox
Cover image

Meta-Game Theory: What a Pokémon League Taught Us About LLM Strategy

TL;DR for operators A Pokémon tournament sounds unserious until you notice what it does better than many enterprise AI pilots: it forces models to make constrained, sequential, adversarial decisions, then records not only what they did but why they said they did it. The paper behind this article introduces LLM Pokémon League, a benchmark where eight models from the GPT, Claude, and Gemini families act as Pokémon trainers. Each model selects a six-member team, then makes turn-by-turn battle decisions in a zero-shot setting. The framework captures team-building rationales, move choices, switching decisions, and explanations throughout the tournament.1 ...

August 9, 2025 · 17 min · Zelina