Cover image

Karma, But Make It Causal: Why Simulation Is Finally Growing Up

A hospital monitor, a factory sensor array, and a trading dashboard have a shared irritation: they all produce time-series data that everyone wants to model, almost nobody wants to share, and absolutely nobody fully understands from correlations alone. That is the practical problem behind KarmaTS, a proposed interactive framework for constructing executable, lag-indexed causal simulations for multivariate time series.1 The paper is not trying to sell another magical causal-discovery algorithm. Good. We have enough of those wandering around with heroic acronyms and very delicate assumptions. ...

November 17, 2025 · 14 min · Zelina
Cover image

Play by Automata: How Regular Games Rewrites the Rules of General Game Playing

A game engine is usually where rules go to become software. Someone writes the rules, someone else encodes the rules, and an AI agent then spends its expensive little life asking the engine what moves are legal, what happens next, and whether it has already lost. Very glamorous. Very repetitive. General Game Playing tries to remove the hand-built engine from that loop. Instead of building a custom simulator for chess, backgammon, Amazons, Reversi, or some procedural oddity invented on a tired Wednesday afternoon, a game is described in a formal language and a generic system turns that description into something agents can use. ...

November 14, 2025 · 15 min · Zelina
Cover image

When Algorithms Command: AI's Quiet Revolution in Battlefield Strategy

When Algorithms Command: AI’s Quiet Revolution in Battlefield Strategy Dispatch is rarely elegant. A road closes, a shipment misses its window, a critical machine fails, a storm changes direction, and suddenly the tidy plan becomes a historical artefact. The manager, commander, operator, or incident lead is not looking for a philosophical meditation on uncertainty. They need options, fast, preferably before the situation develops a personality. ...

November 10, 2025 · 16 min · Zelina
Cover image

When the Sandbox Thinks Back: Training AI Agents in Simulated Realities

Workflow software has a deeply unglamorous problem: reality keeps changing. A customer support agent may know the refund policy, but then the customer changes their address, the order record has a missing field, the tool returns a cryptic error, and the next API call requires a schema nobody mentioned in the demo. A spreadsheet agent may know how to summarise a table, but the file path is wrong, the calendar has a conflicting event, and the “obvious” action fails because the world, in its charmingly vindictive way, is not a benchmark prompt. ...

November 6, 2025 · 18 min · Zelina
Cover image

Blueprints of Agency: Compositional Machines and the New Architecture of Intelligence

A prototype begins innocently enough: a product team wants a small machine, a vehicle, a tool, a fixture, perhaps a mechanism that throws something across a room because medieval engineering apparently never left the group chat. The modern AI pitch says the agent can design it. Give it parts, constraints, and a goal; let it reason; let it test; let it improve. ...

October 23, 2025 · 14 min · Zelina
Cover image

Pods over Prompts: Shachi’s Playbook for Serious Agent-Based Simulation

A boardroom simulation is only useful if you know what was being simulated. That sounds obvious. It is also where many AI-agent demos quietly fall apart. Give one hundred language-model agents a set of personas, drop them into a toy market, forum, election, auction, or customer-support queue, and the result will usually look interesting. Someone panics. Someone coordinates. Someone overpays. Someone posts something faintly unhinged. Excellent. We have recreated the internet. ...

October 3, 2025 · 18 min · Zelina
Cover image

Consent, Coaxing, and Countermoves: Simulating Privacy Attacks on LLM Agents

TL;DR for operators Email is still where good security intentions go to become embarrassing screenshots. The paper behind this article, Searching for Privacy Risks in LLM Agents via Simulation, studies a future that is no longer especially futuristic: one AI agent has access to sensitive information, another agent wants it, and the two can talk through ordinary applications such as email, Messenger, Facebook, or Notion.1 The question is not whether the model knows a privacy rule in the abstract. The question is whether an agent, while trying to be helpful in a live interaction, can refuse the wrong request at the right moment. ...

August 18, 2025 · 20 min · Zelina
Cover image

From Sobol to Sinkhorn: A Transport Revolution in Sensitivity Analysis

TL;DR for operators Models rarely fail because nobody ran a sensitivity analysis. They fail because the sensitivity analysis answered the convenient question instead of the relevant one. The paper behind gsaot introduces an R package for Optimal Transport-based global sensitivity analysis.1 Its practical value is not that it makes Sobol’ indices obsolete. It does not. The useful shift is narrower and more interesting: gsaot estimates how much the entire output distribution changes when an input is known, rather than asking only how much of the output variance can be attributed to that input. ...

July 27, 2025 · 17 min · Zelina
Cover image

Agents of Disruption: How LLMs Became Adversarial Testers for Autonomous Driving

TL;DR for operators AGENTS-LLM is not another attempt to make a language model dream up an entire traffic world and then hope the simulator forgives the hallucination. It does something narrower and more operationally useful: it takes an existing real-world driving scenario, accepts a natural-language instruction such as adding a parked vehicle, jaywalker, accident site, or construction zone, and produces an augmented scenario that can be executed in closed-loop autonomous-driving simulation.1 ...

July 21, 2025 · 17 min · Zelina