Cover image

Hex Marks the Spot: Terra Nova and the New Frontier of Agent Intelligence

Terra Nova shows why serious agent evaluation must test coupled strategy, uncertainty, cooperation, and long-horizon trade-offs rather than another tidy task list.

November 21, 2025 · 16 min · Zelina
Cover image

Intent, Actually: Why DeFi Needs a Mind‑Reader

A mechanism-first reading of TIM, a multi-agent LLM framework that turns opaque DeFi transactions into evidence-ranked intent labels without pretending to read private motives.

November 21, 2025 · 16 min · Zelina
Cover image

Peer Review in the Age of Agents: When Scientists Go Silicon

A field experiment in AI-authored and AI-reviewed science shows that research agents are useful only when wrapped in disclosure, verification, and human judgment.

November 21, 2025 · 16 min · Zelina
Cover image

RL, Recall, and the Rise of Agentic Memory: What Memory-R1 Means for AI Systems

Memory-R1 shows why durable AI agents need learned memory operations, not just bigger context windows or more enthusiastic vector search.

November 21, 2025 · 15 min · Zelina
Cover image

Tentacles of Thought: Why Six Is the New One in Multimodal AI

A mechanism-first reading of Octopus, a multimodal agent framework that treats reasoning as capability orchestration rather than a bigger-model contest.

November 21, 2025 · 13 min · Zelina
Cover image

Compression, But Make It Pedagogical: Rate–Distortion KGs for Smarter AI Learning Assistants

A mechanism-first analysis of how rate–distortion theory and fused Gromov-Wasserstein alignment can make educational knowledge graphs more useful, not merely larger.

November 20, 2025 · 19 min · Zelina
Cover image

Flip the Switch: How Heterogeneous Agents Learn to Restore the Grid

A mechanism-first look at how heterogeneous multi-agent reinforcement learning could turn distribution-grid restoration into faster, constraint-aware decision support.

November 20, 2025 · 15 min · Zelina
Cover image

Prompted and Confused: When LLMs Forget the Assignment

A close reading of why LLM-generated optimisation models can look correct, compile occasionally, and still misunderstand the problem hiding in plain sight.

November 20, 2025 · 14 min · Zelina
Cover image

Skills to Pay the Agent Bills: Why LLMs Need Better Moves, Not Bigger Models

SkillGen shows why the next gain in LLM agents may come from reusable procedural skills, not longer prompts or larger models.

November 20, 2025 · 18 min · Zelina
Cover image

Thresholds, Trade-offs, and the Art of Not Overthinking Your Robot

How calibrated symbolic uncertainty helps robots decide when to act, when to look again, and when confidence becomes expensive.

November 20, 2025 · 14 min · Zelina