Cover image

Persona Non Grata: When LLMs Forget They're AI

A behavioral audit shows why professional personas can suppress AI self-disclosure, why bigger models do not solve it, and how enterprises should test trust before deploying expert-like agents.

November 27, 2025 · 13 min · Zelina
Cover image

Seeing Is Believing—Planning Is Not: What SpatialBench Reveals About MLLMs

SpatialBench shows why multimodal models that recognize scenes can still fail at the harder work of spatial abstraction, causality, and planning.

November 27, 2025 · 15 min · Zelina
Cover image

Tile by Tile: Why LLMs Still Can't Plan Their Way Out of a 3×3 Box

A close reading of an 8-puzzle study showing why fluent reasoning traces are still a poor substitute for explicit state management, validators, and real planning systems.

November 27, 2025 · 15 min · Zelina
Cover image

Fragments, Feedback, and Fast Drugs: When Generative Models Grow a Spine

How FRAGMENTA reframes small-data drug lead optimization as a feedback-loop problem, not merely a bigger-model problem.

November 26, 2025 · 15 min · Zelina
Cover image

Maps, Models, and Mobility: GPT Goes for a Walk

A mechanism-first reading of how GPT-style Transformers can be adapted from text tokens to continuous mobility trajectories without pretending the tutorial is a benchmark race.

November 26, 2025 · 17 min · Zelina
Cover image

Pills, Protocols, and Parameters: When LLMs Sit the Pharmacist Exam

A Chinese pharmacist licensure benchmark shows why LLM deployment in professional education should be mapped by task category, not model leaderboard score.

November 26, 2025 · 15 min · Zelina
Cover image

Reasoning in Stereo: Why Vision-Language Models Need Multi‑Hop Sanity Checks

A mechanism-first reading of a VLM factuality paper showing why multimodal systems need explicit verification paths, not just larger perception models.

November 26, 2025 · 15 min · Zelina
Cover image

Trust Issues: Why Neural Networks Need Their Own Internal Affairs Department

A mechanism-first reading of PaTAS, a Subjective Logic framework that treats neural-network trust as something propagated through data, parameters, and inference paths—not guessed from accuracy.

November 26, 2025 · 16 min · Zelina
Cover image

When AI Reviews AI: Turning Foundation Models into Safety Inspectors

A mechanism-first reading of how REACT and SemaLens use LLMs and VLMs to make safety-critical AI systems more inspectable without pretending that AI can certify itself.

November 26, 2025 · 19 min · Zelina
Cover image

Who Owns Your Words? Copyright, LLMs, and the Quiet Arms Race Over Training Data

A mechanism-first look at how copyright-detection pipelines turn LLM memorization into an operational audit signal, without pretending it is courtroom proof.

November 26, 2025 · 17 min · Zelina