Cover image

Themis Knows Best: When AI Judges Start Training Other AI

OS-Themis shows that the hard part of training GUI agents is not merely choosing a stronger judge, but building an evidence pipeline that knows which UI steps actually deserve reward.

March 20, 2026 · 20 min · Zelina
Cover image

When EEG Stops Thinking in Squares: Why Linear-Time Models Are Quietly Winning

LuMamba shows how topology-invariant EEG modeling, linear-time Mamba blocks, and a mixed LeJEPA reconstruction objective may make biosignal foundation models more deployable across messy real-world electrode layouts.

March 20, 2026 · 16 min · Zelina
Cover image

Context Rot & The Memory Illusion: Why Bigger Prompts Won’t Save Your AI

A comparison-based reading of Knowledge Objects: why durable AI memory needs structured storage, not just larger prompts or prettier summaries.

March 19, 2026 · 15 min · Zelina
Cover image

From Memory to Machinery: Why AI Agents Are Learning to Write Themselves

AgentFactory shows why the next useful step in AI agents may be less about remembering better and more about preserving executable work as reusable, auditable capability.

March 19, 2026 · 16 min · Zelina
Cover image

Learning Less, Winning More: The Curious Case of Sensi’s Efficiently Wrong Intelligence

Sensi shows why fast agent learning is not enough when perception errors can become verified facts.

March 19, 2026 · 17 min · Zelina
Cover image

The Memory Gap Nobody Budgeted For: Why Your AI Agents Keep Forgetting Each Other

A business reading of Governed Memory, showing why multi-agent AI needs shared memory, policy routing, schema feedback, and entity isolation—not just another RAG store.

March 19, 2026 · 20 min · Zelina
Cover image

The Sandbox Economy: When LLMs Stop Talking and Start Shopping

MALLES shows why useful AI economic agents need transaction alignment, numerical sensitivity, and population calibration—not just better role-play prompts.

March 19, 2026 · 18 min · Zelina
Cover image

When Memory Lies and Rules Save It: Rethinking LLM Agents in Closed Worlds

A mechanism-first reading of RPMS, showing why reliable LLM agents need executable rules, state-aware memory, and conflict arbitration—not larger memory alone.

March 19, 2026 · 18 min · Zelina
Cover image

Beyond Accuracy: When Forecasts Meet Cash Flow

Why demand forecasts should be evaluated by the inventory decisions they trigger, not only by the errors they minimize.

March 18, 2026 · 12 min · Zelina
Cover image

Cultural Alignment: When Prompts Stop Being Instructions and Start Being Policy

A business-focused reading of why cultural alignment in LLM systems should be measured, compared, and optimized rather than handled as a one-line localization prompt.

March 18, 2026 · 17 min · Zelina