Cover image

Punching Above Baselines: When Boxing Strategy Learns to Differentiate

Li Qian is the useful part of the paper, not the medal count Boxing is a simple sport only if you watch it from far enough away. Two athletes enter a ring. One wins. The spectators remember the clean punch, the late-round pressure, the judge’s card, maybe the celebration. Coaches remember something less theatrical: distance, lead-hand rhythm, counter timing, target selection, whether a hook was thrown from the wrong range, whether the opponent’s aggression was actually a trap. ...

January 19, 2026 · 18 min · Zelina
Cover image

Pressing by Cosine, Defending by Distance: When Football Learns Semantics

Halftime is where many analytics dashboards become strangely shy. They can tell a coach which zones were overloaded, how many high-intensity runs dropped after minute 30, how pressing recoveries changed, and whether expected goals has decided to be emotionally cruel today. But when the actual question arrives—what should we do now?—the answer often slides back into expert intuition. ...

January 5, 2026 · 16 min · Zelina
Cover image

When Models Forget on Purpose: Why Data Selection Matters More Than Data Volume

Training data has become the AI industry’s favorite comfort blanket. When performance stalls, add more tokens. When a benchmark looks stubborn, add more tokens. When the model behaves badly, add more tokens and call it a roadmap. This worked well enough to become a reflex. Unfortunately, reflexes are not strategies. The uncomfortable question is no longer whether data matters. Of course it matters. The better question is whether every token deserves the same vote during training. ...

December 31, 2025 · 17 min · Zelina
Cover image

LLMs, Gotta Think ’Em All: When Pokémon Battles Become a Serious AI Benchmark

Game AI usually has a familiar job: lose convincingly. Not too quickly, because that feels insulting. Not too brutally, because that feels like homework wearing a boss battle costume. Good game AI sits in the narrow emotional band between “I can beat this” and “I need to think.” The old solution was scripted behavior, heuristics, difficulty sliders, or reinforcement learning trained until the agent stopped embarrassing itself. The newer temptation is simpler: give the game state to an LLM and ask it to play. ...

December 22, 2025 · 14 min · Zelina
Cover image

Reasoning Loops, Not Bigger Brains

Reasoning Loops, Not Bigger Brains Scale is the easiest story in AI because everyone understands the shopping logic: buy more compute, add more parameters, train on more data, and watch the benchmark line move upward. It is also the story vendors enjoy telling, because nobody ever got fired for recommending a larger invoice. ...

December 17, 2025 · 14 min · Zelina
Cover image

Agents Behaving Badly: Why 'Agentic AI' Needs Adult Supervision

A travel agent that books a bad flight is annoying. A travel agent that books the wrong flight, triggers a hotel agent to change the reservation, alerts a finance agent to approve reimbursement, and then lets a calendar agent reschedule meetings around the mistake is no longer annoying. It is an organizational incident with a charming user interface. ...

November 24, 2025 · 20 min · Zelina
Cover image

Mind the Model: When Generative AI Teaches Neuroscience New Tricks

Mind the Model: When Generative AI Teaches Neuroscience New Tricks A model is not a mind. This should not need saying, but then again, neither should “do not use benchmark scores as a personality test,” and here we are. The more useful point is subtler. Modern generative AI does not matter to neuroscience because transformers are secretly brains in a hoodie. It matters because machine learning has turned several once-vague ideas about cognition into working engineering mechanisms. Not perfect mechanisms. Not biological mechanisms by default. But mechanisms clear enough to test, stress, reject, adapt, or steal with appropriate academic manners. ...

November 23, 2025 · 16 min · Zelina
Cover image

Strategy as a Service: When AI Learns How to Think

Every enterprise AI team eventually meets the same annoying bill: the agent that thinks too much. It calls tools when a direct answer would do. It loops through evaluator prompts for tasks that need one clean instruction. It drags a code interpreter into a problem that is mostly reading comprehension. Then, after all that expensive theatre, it may still be wrong. Very impressive. Very modern. Very invoicable. ...

November 17, 2025 · 14 min · Zelina
Cover image

Graph Minds, Game Moves: How Multi‑Agent Learning Is Quietly Redrawing AI Strategy

A traffic light is not just a traffic light once the other lights start learning. That is the uncomfortable starting point for strategic AI systems. A single model can optimise a route, price, recommendation, allocation, or control policy. But the moment other decision-makers are learning at the same time, the environment stops behaving like scenery. It becomes a cast. Each actor updates, reacts, misreads, cooperates, defects, imitates, or quietly ruins the assumptions in your simulator. Very rude, but entirely realistic. ...

November 14, 2025 · 16 min · Zelina
Cover image

Bottleneck or Breakout? Modeling the Compute Barrier to AI's Intelligence Explosion

TL;DR for operators The practical question is not whether AI will “think itself into godhood by Tuesday”. Charming as that spreadsheet would be, this paper is doing something narrower and more useful. Whitfill and Wu ask whether a software-only intelligence explosion can survive a compute bottleneck: if AI systems become good enough to replace human AI researchers, can that extra cognitive labour keep improving AI without a matching increase in research compute?1 ...

August 3, 2025 · 16 min · Zelina