Reasoning Evaluation on Cognaptus

Reasoning Evaluation on Cognaptus https://cognaptus.com/tags/reasoning-evaluation/ Recent content in Reasoning Evaluation on Cognaptus Hugo -- 0.145.0 en-us Sun, 21 Jun 2026 00:00:00 +0000 The Model Agreed With Itself. That Was the Problem. https://cognaptus.com/blog/2026-06-21-the-model-agreed-with-itself-that-was-the-problem/ Sun, 21 Jun 2026 00:00:00 +0000 https://cognaptus.com/blog/2026-06-21-the-model-agreed-with-itself-that-was-the-problem/ A mechanism-first analysis of structural uncertainty, a black-box method for detecting unstable LLM reasoning even when sampled answers agree.