Cover image

Unsolvable by Design: Turning AI Plans Into Security Guarantees

A mechanism-first reading of planning task shielding: how AI planning can be used to make dangerous states unreachable, where the guarantee holds, and where the computation breaks.

April 9, 2026 · 16 min · Zelina
Cover image

When Feelings Negotiate: Why Emotion Might Be the Missing Layer in AI Agents

A mechanism-first reading of EmoMAS and what strategic emotional orchestration means for business-facing AI agents.

April 9, 2026 · 18 min · Zelina
Cover image

Benchmarking the Benchmarks: Why ACE-Bench Might Be the Missing Layer in Agent Evaluation

A mechanism-first reading of AgentCE-Bench, showing why controllable agent evaluation may be more useful than another realism-heavy leaderboard.

April 8, 2026 · 14 min · Zelina
Cover image

Blinded by Design: When AI Stops Thinking and Starts Remembering

A practical reading of epistemic blinding: an inference-time audit protocol for separating LLM reasoning from memorized entity priors in business-critical ranking workflows.

April 8, 2026 · 19 min · Zelina
Cover image

Claw-Eval — When Agents Game the System, the System Needs Claws

Claw-Eval shows why serious AI-agent evaluation must audit behavior, stress-test recovery, and separate lucky success from deployable reliability.

April 8, 2026 · 16 min · Zelina
Cover image

From Spreadsheets to Swarms: How Agentic AI Rewrites the Retail Supply Chain

A mechanism-first reading of Flowr, an agentic AI framework that turns supermarket replenishment from manual coordination into supervised workflow automation.

April 8, 2026 · 18 min · Zelina
Cover image

Skill Issue or System Design? How LLMs Actually Follow Instructions

A practical reading of why LLM instruction-following looks less like one universal compliance switch and more like coordination among task-specific skills.

April 8, 2026 · 18 min · Zelina
Cover image

When Data Decides What Matters: The Quiet Economics of LLM Data Selection

A clearer look at why dynamic data weighting may matter less as a magic shortcut than as a new control layer for LLM training economics.

April 8, 2026 · 15 min · Zelina
Cover image

Memory That Actually Remembers: Why MemMachine Signals a Shift in AI Agent Architecture

MemMachine shows why useful AI-agent memory is less about compressing chat history and more about preserving auditable episodes, retrieving them well, and knowing when retrieval should become a reasoning process.

April 7, 2026 · 18 min · Zelina
Cover image

Protocol Over Prompts: Why ANX Rewrites the Rules of AI Agent Interaction

ANX shows why enterprise agents may need protocol-level interaction design more than larger prompts, richer tool schemas, or screen-mimicking automation.

April 7, 2026 · 18 min · Zelina