TextWorld on Cognaptus

TextWorld on Cognaptus https://cognaptus.com/tags/textworld/ Recent content in TextWorld on Cognaptus Hugo -- 0.145.0 en-us Thu, 15 Jan 2026 00:00:00 +0000 Knowing Is Not Doing: When LLM Agents Pass the Task but Fail the World https://cognaptus.com/blog/2026-01-15-knowing-is-not-doing-when-llm-agents-pass-the-task-but-fail-the-world/ Thu, 15 Jan 2026 00:00:00 +0000 https://cognaptus.com/blog/2026-01-15-knowing-is-not-doing-when-llm-agents-pass-the-task-but-fail-the-world/ Task2Quiz shows why agent evaluation needs to separate task completion from grounded environment understanding.