Cognaptus Insights

Mind the Gap: Why AI Still Struggles to Build Common Ground

A case-first reading of DPIP, a multimodal benchmark showing why AI agents still confuse visible task progress with genuinely shared belief.

Reading Between the Lines: How AI Learned to Interpret the Law

A timeline-style reading of how AI moved from encoding legal interpretations, to modeling interpretive disputes, to generating legal arguments that still need human judgment.

The Judge Is Not Always Right: Stress‑Testing LLM Judges

A mechanism-first reading of Judge Reliability Harness and why LLM judges need reliability audits before they become business-critical evaluators.

When Tokens Explode: The Hidden Geometry Behind Attention Sinks

A mechanism-first reading of how massive activations, normalization, and attention-sink geometry interact inside modern Transformer language models.

Bending the Beam, Not the Brain: What RL with Perfect Rewards Still Can’t Teach LLMs

BeamPERL shows that exact physics rewards can specialize compact LLMs, but they do not automatically produce transferable scientific reasoning.

Double Helix, Double Checks: Why Agentic AI Needs Governance Before It Writes Your Code

A WebGIS case study shows why reliable agentic AI depends less on bigger prompts and more on persistent memory, enforceable rules, and auditable workflow structure.

$Cover image$

From Prompt Chains to Algebra: Why Agentics 2.0 Treats AI Workflows Like Math

Agentics 2.0 argues that reliable enterprise AI workflows need typed, composable, evidence-preserving transformations—not just better prompts or louder agents.

Memory Isn’t Personal: Why LLMs Still Forget What You Like

RealPref shows why longer chat history alone does not make an AI assistant genuinely personal, and what businesses should build instead.

Small Model, Big Eyes: Why Microsoft’s Phi‑4 Vision Model Is a Warning Shot to Giant Multimodal AI

A mechanism-first reading of Microsoft’s Phi-4-reasoning-vision-15B report, and why smaller multimodal models may win practical AI deployments through sharper perception, cleaner data, and selective reasoning.

The Ambiguity Advantage: When AI Becomes Your Most Honest (and Sometimes Too Polite) Manager

A mechanism-first reading of how managerial ambiguity makes LLM advice look useful before it is actually grounded.