Cognaptus Insights

Persona Non Grata: When LLMs Forget They're AI

A behavioral audit shows why professional personas can suppress AI self-disclosure, why bigger models do not solve it, and how enterprises should test trust before deploying expert-like agents.

Seeing Is Believing—Planning Is Not: What SpatialBench Reveals About MLLMs

SpatialBench shows why multimodal models that recognize scenes can still fail at the harder work of spatial abstraction, causality, and planning.

Tile by Tile: Why LLMs Still Can't Plan Their Way Out of a 3×3 Box

A close reading of an 8-puzzle study showing why fluent reasoning traces are still a poor substitute for explicit state management, validators, and real planning systems.

Fragments, Feedback, and Fast Drugs: When Generative Models Grow a Spine

How FRAGMENTA reframes small-data drug lead optimization as a feedback-loop problem, not merely a bigger-model problem.

Maps, Models, and Mobility: GPT Goes for a Walk

A mechanism-first reading of how GPT-style Transformers can be adapted from text tokens to continuous mobility trajectories without pretending the tutorial is a benchmark race.

Pills, Protocols, and Parameters: When LLMs Sit the Pharmacist Exam

A Chinese pharmacist licensure benchmark shows why LLM deployment in professional education should be mapped by task category, not model leaderboard score.

Reasoning in Stereo: Why Vision-Language Models Need Multi‑Hop Sanity Checks

A mechanism-first reading of a VLM factuality paper showing why multimodal systems need explicit verification paths, not just larger perception models.

Trust Issues: Why Neural Networks Need Their Own Internal Affairs Department

A mechanism-first reading of PaTAS, a Subjective Logic framework that treats neural-network trust as something propagated through data, parameters, and inference paths—not guessed from accuracy.

When AI Reviews AI: Turning Foundation Models into Safety Inspectors

A mechanism-first reading of how REACT and SemaLens use LLMs and VLMs to make safety-critical AI systems more inspectable without pretending that AI can certify itself.

Who Owns Your Words? Copyright, LLMs, and the Quiet Arms Race Over Training Data

A mechanism-first look at how copyright-detection pipelines turn LLM memorization into an operational audit signal, without pretending it is courtroom proof.