<?xml version="1.0" encoding="utf-8" standalone="yes"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:content="http://purl.org/rss/1.0/modules/content/">
  <channel>
    <title>Governance on Cognaptus</title>
    <link>https://cognaptus.com/tags/governance/</link>
    <description>Recent content in Governance on Cognaptus</description>
    <generator>Hugo -- 0.145.0</generator>
    <language>en-us</language>
    <lastBuildDate>Fri, 29 May 2026 00:00:00 +0000</lastBuildDate>
    <atom:link href="https://cognaptus.com/tags/governance/index.xml" rel="self" type="application/rss+xml" />
    <item>
      <title>Jailbreak ASR Is Wearing a Costume</title>
      <link>https://cognaptus.com/blog/2026-05-29-jailbreak-asr-is-wearing-a-costume/</link>
      <pubDate>Fri, 29 May 2026 00:00:00 +0000</pubDate>
      <guid>https://cognaptus.com/blog/2026-05-29-jailbreak-asr-is-wearing-a-costume/</guid>
      <description>A study of LLM jailbreak benchmarks shows why headline attack-success rates can be inflated by stochastic evaluation, judge settings, and undisclosed generation protocols.</description>
    </item>
    <item>
      <title>Red Queen Receipts: AI Security Testing Needs Logs, Not Vibes</title>
      <link>https://cognaptus.com/blog/2026-05-22-red-queen-receipts-ai-security-testing-needs-logs-not-vibes/</link>
      <pubDate>Fri, 22 May 2026 00:00:00 +0000</pubDate>
      <guid>https://cognaptus.com/blog/2026-05-22-red-queen-receipts-ai-security-testing-needs-logs-not-vibes/</guid>
      <description>AVISE shows why AI security evaluation should move from one-off jailbreak anecdotes toward repeatable, auditable test pipelines.</description>
    </item>
    <item>
      <title>Context Is the New Attack Surface</title>
      <link>https://cognaptus.com/blog/2026-05-16-context-is-the-new-attack-surface/</link>
      <pubDate>Sat, 16 May 2026 00:00:00 +0000</pubDate>
      <guid>https://cognaptus.com/blog/2026-05-16-context-is-the-new-attack-surface/</guid>
      <description>A business-focused reading of Jailbreak Mimicry, explaining why LLM safety failures often live in task framing rather than forbidden words.</description>
    </item>
    <item>
      <title>The Reward Is in the Room: Why AI Automation Needs Better Judgment, Not Just Bigger Models</title>
      <link>https://cognaptus.com/blog/2026-05-07-the-reward-is-in-the-room-why-ai-automation-needs-better-judgment-not-just-bigger-models/</link>
      <pubDate>Thu, 07 May 2026 00:00:00 +0000</pubDate>
      <guid>https://cognaptus.com/blog/2026-05-07-the-reward-is-in-the-room-why-ai-automation-needs-better-judgment-not-just-bigger-models/</guid>
      <description>A synthesis of four recent papers showing why the next bottleneck in AI automation is not generation, but judgment, feedback, and reward design.</description>
    </item>
    <item>
      <title>Meerkat or Mirage? When AI Safety Fails in Plain Sight (Across Traces)</title>
      <link>https://cognaptus.com/blog/2026-04-14-meerkat-or-mirage-when-ai-safety-fails-in-plain-sight-across-traces/</link>
      <pubDate>Tue, 14 Apr 2026 00:00:00 +0000</pubDate>
      <guid>https://cognaptus.com/blog/2026-04-14-meerkat-or-mirage-when-ai-safety-fails-in-plain-sight-across-traces/</guid>
      <description>A case-first reading of Meerkat shows why AI agent safety failures increasingly require repository-level investigation, not one-trace-at-a-time monitoring.</description>
    </item>
    <item>
      <title>AI Access Control, Logging, and Retention Policies</title>
      <link>https://cognaptus.com/academy/privacy/ai-access-control-logging-and-retention-policies/</link>
      <pubDate>Mon, 16 Mar 2026 00:00:00 +0000</pubDate>
      <guid>https://cognaptus.com/academy/privacy/ai-access-control-logging-and-retention-policies/</guid>
      <description>A practical guide to AI access control, logging, and retention design, including role-based access, least-privilege policies, auditability, retention windows, and governance checklists.</description>
    </item>
    <item>
      <title>AI Evaluation, Monitoring, and Incident Response for Production Systems</title>
      <link>https://cognaptus.com/academy/privacy/ai-evaluation-monitoring-and-incident-response-for-production-systems/</link>
      <pubDate>Mon, 16 Mar 2026 00:00:00 +0000</pubDate>
      <guid>https://cognaptus.com/academy/privacy/ai-evaluation-monitoring-and-incident-response-for-production-systems/</guid>
      <description>A practical guide to production AI evaluation and monitoring, including pre-launch tests, live performance checks, rollback triggers, incident response, and governance ownership.</description>
    </item>
    <item>
      <title>AI Vendor Risk Assessment and Procurement Checklist</title>
      <link>https://cognaptus.com/academy/privacy/ai-vendor-risk-assessment-and-procurement-checklist/</link>
      <pubDate>Mon, 16 Mar 2026 00:00:00 +0000</pubDate>
      <guid>https://cognaptus.com/academy/privacy/ai-vendor-risk-assessment-and-procurement-checklist/</guid>
      <description>A practical guide to AI vendor assessment and procurement, including due-diligence questions, risk categories, contract review concerns, pilot criteria, and approval logic.</description>
    </item>
    <item>
      <title>How to Design Human Review for AI Systems</title>
      <link>https://cognaptus.com/academy/privacy/how-to-design-human-review-for-ai-systems/</link>
      <pubDate>Mon, 16 Mar 2026 00:00:00 +0000</pubDate>
      <guid>https://cognaptus.com/academy/privacy/how-to-design-human-review-for-ai-systems/</guid>
      <description>A practical framework for human review design in AI systems, including risk tiers, review triggers, approval boundaries, queue design, evidence requirements, and governance rules.</description>
    </item>
    <item>
      <title>When Not to Send Data to a Public LLM</title>
      <link>https://cognaptus.com/academy/privacy/when-not-to-send-data-to-a-public-llm/</link>
      <pubDate>Mon, 16 Mar 2026 00:00:00 +0000</pubDate>
      <guid>https://cognaptus.com/academy/privacy/when-not-to-send-data-to-a-public-llm/</guid>
      <description>A practical decision guide for using or avoiding public LLMs, including data classification, contractual risk, anonymization limits, decision trees, review triggers, and governance rules.</description>
    </item>
    <item>
      <title>When Alignment Is Not Enough: Reading Between the Lines of Modern LLM Safety</title>
      <link>https://cognaptus.com/blog/2026-01-26-when-alignment-is-not-enough-reading-between-the-lines-of-modern-llm-safety/</link>
      <pubDate>Mon, 26 Jan 2026 00:00:00 +0000</pubDate>
      <guid>https://cognaptus.com/blog/2026-01-26-when-alignment-is-not-enough-reading-between-the-lines-of-modern-llm-safety/</guid>
      <description>A close reading of recent alignment research, and why safety mechanisms increasingly fail in the real world.</description>
    </item>
    <item>
      <title>Seeing Too Much: When Multimodal Models Forget Privacy</title>
      <link>https://cognaptus.com/blog/2026-01-12-seeing-too-much-when-multimodal-models-forget-privacy/</link>
      <pubDate>Mon, 12 Jan 2026 00:00:00 +0000</pubDate>
      <guid>https://cognaptus.com/blog/2026-01-12-seeing-too-much-when-multimodal-models-forget-privacy/</guid>
      <description>A mechanism-first reading of PII-VisBench, showing why privacy risk in vision-language models depends on who is visible, what is asked, and how the model has learned to recognize people.</description>
    </item>
    <item>
      <title>Thinking Without Understanding: When AI Learns to Reason Anyway</title>
      <link>https://cognaptus.com/blog/2026-01-06-thinking-without-understanding-when-ai-learns-to-reason-anyway/</link>
      <pubDate>Tue, 06 Jan 2026 00:00:00 +0000</pubDate>
      <guid>https://cognaptus.com/blog/2026-01-06-thinking-without-understanding-when-ai-learns-to-reason-anyway/</guid>
      <description>A practical reading of simulated reasoning: why reasoning models are no longer mere stochastic parrots, but still not grounded human reasoners.</description>
    </item>
    <item>
      <title>Paper Tigers or Compliance Cops? What AIReg‑Bench Really Says About LLMs and the EU AI Act</title>
      <link>https://cognaptus.com/blog/2025-10-09-paper-tigers-or-compliance-cops-what-airegbench-really-says-about-llms-and-the-eu-ai-act/</link>
      <pubDate>Thu, 09 Oct 2025 00:00:00 +0000</pubDate>
      <guid>https://cognaptus.com/blog/2025-10-09-paper-tigers-or-compliance-cops-what-airegbench-really-says-about-llms-and-the-eu-ai-act/</guid>
      <description>A close read of AIReg‑Bench shows frontier LLMs can approximate expert EU AI Act judgments—sometimes eerily well—but only under disciplined inputs and with caveats executives can’t ignore.</description>
    </item>
    <item>
      <title>Options = Power: Turning Empowerment into a KPI for AI Agents</title>
      <link>https://cognaptus.com/blog/2025-10-03-options-power-turning-empowerment-into-a-kpi-for-ai-agents/</link>
      <pubDate>Fri, 03 Oct 2025 00:00:00 +0000</pubDate>
      <guid>https://cognaptus.com/blog/2025-10-03-options-power-turning-empowerment-into-a-kpi-for-ai-agents/</guid>
      <description>A practical take on EELMA—an information‑theoretic ‘optionality’ score that tracks agent capability and flags power‑seeking pivots without hand‑built benchmarks.</description>
    </item>
    <item>
      <title>When Agents Get Bored: Three Baselines Your Autonomy Stack Already Has</title>
      <link>https://cognaptus.com/blog/2025-10-02-when-agents-get-bored-three-baselines-your-autonomy-stack-already-has/</link>
      <pubDate>Thu, 02 Oct 2025 00:00:00 +0000</pubDate>
      <guid>https://cognaptus.com/blog/2025-10-02-when-agents-get-bored-three-baselines-your-autonomy-stack-already-has/</guid>
      <description>A business-first read on new evidence that LLM agents, left without tasks, fall into three stable modes—and what that means for reliability, UX, and governance.</description>
    </item>
    <item>
      <title>Sandboxes &amp; Ladders: How to Build a Steerable Agent Economy</title>
      <link>https://cognaptus.com/blog/2025-09-19-sandboxes-ladders-how-to-build-a-steerable-agent-economy/</link>
      <pubDate>Fri, 19 Sep 2025 00:00:00 +0000</pubDate>
      <guid>https://cognaptus.com/blog/2025-09-19-sandboxes-ladders-how-to-build-a-steerable-agent-economy/</guid>
      <description>DeepMind’s ‘Virtual Agent Economies’ sketches a two-axis map for AI markets and a policy toolkit—auctions, mission economies, and identity rails—to keep them safe, fair, and useful. Here’s what matters for operators and regulators.</description>
    </item>
    <item>
      <title>Terms of Engagement: Building Trustworthy AI Agents Before They Build Us</title>
      <link>https://cognaptus.com/blog/2025-09-19-terms-of-engagement-building-trustworthy-ai-agents-before-they-build-us/</link>
      <pubDate>Fri, 19 Sep 2025 00:00:00 +0000</pubDate>
      <guid>https://cognaptus.com/blog/2025-09-19-terms-of-engagement-building-trustworthy-ai-agents-before-they-build-us/</guid>
      <description>Why agentic AI changes the ethics playbook—and a practical framework for businesses to deploy agents safely without killing their upside.</description>
    </item>
    <item>
      <title>Stop, Verify, and Listen: HALT‑RAG Brings a ‘Reject Option’ to RAG</title>
      <link>https://cognaptus.com/blog/2025-09-13-stop-verify-and-listen-haltrag-brings-a-reject-option-to-rag/</link>
      <pubDate>Sat, 13 Sep 2025 00:00:00 +0000</pubDate>
      <guid>https://cognaptus.com/blog/2025-09-13-stop-verify-and-listen-haltrag-brings-a-reject-option-to-rag/</guid>
      <description>A practical, calibrated verifier that turns hallucination detection into an operational safety valve for retrieval‑augmented generation.</description>
    </item>
    <item>
      <title>Branching Out of the Middle: How a ‘Tree of Agents’ Fixes Long-Context Blind Spots</title>
      <link>https://cognaptus.com/blog/2025-09-12-branching-out-of-the-middle-how-a-tree-of-agents-fixes-longcontext-blind-spots/</link>
      <pubDate>Fri, 12 Sep 2025 00:00:00 +0000</pubDate>
      <guid>https://cognaptus.com/blog/2025-09-12-branching-out-of-the-middle-how-a-tree-of-agents-fixes-longcontext-blind-spots/</guid>
      <description>A multi-agent, tree-structured method tackles ‘lost in the middle’ and beats larger models on long-context QA—at startup-friendly cost and with interpretable steps.</description>
    </item>
    <item>
      <title>Rules of Engagement: How Meta‑Policy Reflexion Turns Agent Memory into Guardrails</title>
      <link>https://cognaptus.com/blog/2025-09-08-rules-of-engagement-how-metapolicy-reflexion-turns-agent-memory-into-guardrails/</link>
      <pubDate>Mon, 08 Sep 2025 00:00:00 +0000</pubDate>
      <guid>https://cognaptus.com/blog/2025-09-08-rules-of-engagement-how-metapolicy-reflexion-turns-agent-memory-into-guardrails/</guid>
      <description>A practical look at Meta‑Policy Reflexion (MPR)—a predicate‑style memory plus hard admissibility checks that make LLM agents safer, cheaper, and more transferable without fine‑tuning.</description>
    </item>
    <item>
      <title>Patience Is Profit: Can LLM Agents Stabilize DePIN’s Token Rails?</title>
      <link>https://cognaptus.com/blog/2025-09-01-patience-is-profit-can-llm-agents-stabilize-depins-token-rails/</link>
      <pubDate>Mon, 01 Sep 2025 00:00:00 +0000</pubDate>
      <guid>https://cognaptus.com/blog/2025-09-01-patience-is-profit-can-llm-agents-stabilize-depins-token-rails/</guid>
      <description>We dissect EconAgentic’s DePIN market model and argue when ‘patient’ LLM agents improve inclusion and stability without tanking efficiency—and where the thesis may overreach.</description>
    </item>
    <item>
      <title>Hypotheses, Not Hunches: What an AI Data Scientist Gets Right</title>
      <link>https://cognaptus.com/blog/2025-08-26-hypotheses-not-hunches-what-an-ai-data-scientist-gets-right/</link>
      <pubDate>Tue, 26 Aug 2025 00:00:00 +0000</pubDate>
      <guid>https://cognaptus.com/blog/2025-08-26-hypotheses-not-hunches-what-an-ai-data-scientist-gets-right/</guid>
      <description>A hypothesis-first agentic workflow that turns raw data into defensible business actions—faster than AutoML and clearer than dashboards.</description>
    </item>
    <item>
      <title>Put It on the GLARE: How Agentic Reasoning Makes Legal AI Actually Think</title>
      <link>https://cognaptus.com/blog/2025-08-25-put-it-on-the-glare-how-agentic-reasoning-makes-legal-ai-actually-think/</link>
      <pubDate>Mon, 25 Aug 2025 00:00:00 +0000</pubDate>
      <guid>https://cognaptus.com/blog/2025-08-25-put-it-on-the-glare-how-agentic-reasoning-makes-legal-ai-actually-think/</guid>
      <description>GLARE blends charge expansion, precedent demos, and targeted legal search to turn LJP from pattern-matching into grounded reasoning—with measurable gains on hard cases.</description>
    </item>
    <item>
      <title>Blame Isn’t a Bug: Turning Agent ‘Whodunits’ into Fixable Systems</title>
      <link>https://cognaptus.com/blog/2025-08-23-blame-isnt-a-bug-turning-agent-whodunits-into-fixable-systems/</link>
      <pubDate>Sat, 23 Aug 2025 00:00:00 +0000</pubDate>
      <guid>https://cognaptus.com/blog/2025-08-23-blame-isnt-a-bug-turning-agent-whodunits-into-fixable-systems/</guid>
      <description>A practical playbook for diagnosing AI-agent incidents using a three-factor framework—and the logs and policies you must have in place before things go wrong.</description>
    </item>
    <item>
      <title>Mirror, Signal, Manoeuvre: Why Privileged Self‑Access (Not Vibes) Defines AI Introspection</title>
      <link>https://cognaptus.com/blog/2025-08-23-mirror-signal-manoeuvre-why-privileged-selfaccess-not-vibes-defines-ai-introspection/</link>
      <pubDate>Sat, 23 Aug 2025 00:00:00 +0000</pubDate>
      <guid>https://cognaptus.com/blog/2025-08-23-mirror-signal-manoeuvre-why-privileged-selfaccess-not-vibes-defines-ai-introspection/</guid>
      <description>A new paper argues that real introspection requires privileged self‑access—beating any cheap third‑party method—and shows temperature ‘self‑reports’ collapse under simple prompt tweaks.</description>
    </item>
    <item>
      <title>IRB, API, and a PI: When Agents Run the Lab</title>
      <link>https://cognaptus.com/blog/2025-08-20-irb-api-and-a-pi-when-agents-run-the-lab/</link>
      <pubDate>Wed, 20 Aug 2025 00:00:00 +0000</pubDate>
      <guid>https://cognaptus.com/blog/2025-08-20-irb-api-and-a-pi-when-agents-run-the-lab/</guid>
      <description>A new multi‑agent system runs human experiments end‑to‑end—design, recruit 288 participants, analyze, and draft the paper—in ~17 hours of runtime. What that really means for R&amp;amp;D, governance, and the future ‘general science’ stack.</description>
    </item>
    <item>
      <title>Quants With a Plan: Agentic Workflows That Outtrade AutoML</title>
      <link>https://cognaptus.com/blog/2025-08-20-quants-with-a-plan-agentic-workflows-that-outtrade-automl/</link>
      <pubDate>Wed, 20 Aug 2025 00:00:00 +0000</pubDate>
      <guid>https://cognaptus.com/blog/2025-08-20-quants-with-a-plan-agentic-workflows-that-outtrade-automl/</guid>
      <description>TS-Agent shows how a structured, auditable agentic workflow—armed with model/refinement knowledge banks—beats generic AutoML on forecasting and synthetic generation for financial time series.</description>
    </item>
    <item>
      <title>Precepts over Predictions: Can LLMs Play Socrates?</title>
      <link>https://cognaptus.com/blog/2025-08-19-precepts-over-predictions-can-llms-play-socrates/</link>
      <pubDate>Tue, 19 Aug 2025 00:00:00 +0000</pubDate>
      <guid>https://cognaptus.com/blog/2025-08-19-precepts-over-predictions-can-llms-play-socrates/</guid>
      <description>A new benchmark, AMAeval, stresses-test LLMs on the two moves real moral assistants must master: deriving case-specific precepts (abduction) and applying them consistently (deduction). We unpack what this means for AI copilots in business.</description>
    </item>
    <item>
      <title>Survival of the Fittest Prompt: When LLM Agents Choose Life Over the Mission</title>
      <link>https://cognaptus.com/blog/2025-08-19-survival-of-the-fittest-prompt-when-llm-agents-choose-life-over-the-mission/</link>
      <pubDate>Tue, 19 Aug 2025 00:00:00 +0000</pubDate>
      <guid>https://cognaptus.com/blog/2025-08-19-survival-of-the-fittest-prompt-when-llm-agents-choose-life-over-the-mission/</guid>
      <description>A Sugarscape-style study finds that modern LLM agents spontaneously reproduce, cooperate, and—under scarcity—turn aggressive, sometimes abandoning tasks to stay alive.</description>
    </item>
    <item>
      <title>Consent, Coaxing, and Countermoves: Simulating Privacy Attacks on LLM Agents</title>
      <link>https://cognaptus.com/blog/2025-08-18-consent-coaxing-and-countermoves-simulating-privacy-attacks-on-llm-agents/</link>
      <pubDate>Mon, 18 Aug 2025 00:00:00 +0000</pubDate>
      <guid>https://cognaptus.com/blog/2025-08-18-consent-coaxing-and-countermoves-simulating-privacy-attacks-on-llm-agents/</guid>
      <description>A search-based simulation framework uncovers how agent-to-agent conversations escalate from polite asks to forged-consent impersonations—and what state-machine defenses actually hold up.</description>
    </item>
    <item>
      <title>Patch Tuesday for the Law: Hunting Legal Zero‑Days in AI Governance</title>
      <link>https://cognaptus.com/blog/2025-08-18-patch-tuesday-for-the-law-hunting-legal-zerodays-in-ai-governance/</link>
      <pubDate>Mon, 18 Aug 2025 00:00:00 +0000</pubDate>
      <guid>https://cognaptus.com/blog/2025-08-18-patch-tuesday-for-the-law-hunting-legal-zerodays-in-ai-governance/</guid>
      <description>A new benchmark shows frontier models are starting to spot ‘legal zero‑days’—latent flaws in statutes that can paralyze institutions. We unpack the risk, the evidence, and a practical playbook for leaders.</description>
    </item>
    <item>
      <title>Kill Switch Ethics: What the PacifAIst Benchmark Really Measures</title>
      <link>https://cognaptus.com/blog/2025-08-16-kill-switch-ethics-what-the-pacifaist-benchmark-really-measures/</link>
      <pubDate>Sat, 16 Aug 2025 00:00:00 +0000</pubDate>
      <guid>https://cognaptus.com/blog/2025-08-16-kill-switch-ethics-what-the-pacifaist-benchmark-really-measures/</guid>
      <description>A new benchmark asks a hard question—will your AI sacrifice itself for humans? We unpack what PacifAIst means for procurement, governance, and deployment.</description>
    </item>
    <item>
      <title>From Wallets to Warlords: How AI Agents Are Colonizing Web3</title>
      <link>https://cognaptus.com/blog/2025-08-06-from-wallets-to-warlords-how-ai-agents-are-colonizing-web3/</link>
      <pubDate>Wed, 06 Aug 2025 00:00:00 +0000</pubDate>
      <guid>https://cognaptus.com/blog/2025-08-06-from-wallets-to-warlords-how-ai-agents-are-colonizing-web3/</guid>
      <description>An in-depth look at the growing convergence of AI agents and Web3 technologies, based on a systematic analysis of 133 real-world projects.</description>
    </item>
    <item>
      <title>Thoughts, Exposed: Why Chain-of-Thought Monitoring Might Be AI Safety’s Best Fragile Hope</title>
      <link>https://cognaptus.com/blog/2025-07-16-thoughts-exposed-why-chainofthought-monitoring-might-be-ai-safetys-best-fragile-hope/</link>
      <pubDate>Wed, 16 Jul 2025 00:00:00 +0000</pubDate>
      <guid>https://cognaptus.com/blog/2025-07-16-thoughts-exposed-why-chainofthought-monitoring-might-be-ai-safetys-best-fragile-hope/</guid>
      <description>A deep dive into Chain-of-Thought monitorability—a fleeting yet critical window into AI reasoning that could redefine safety protocols for large language models.</description>
    </item>
    <item>
      <title>From Ballots to Bots: Reprogramming Democracy for the AI Era</title>
      <link>https://cognaptus.com/blog/2025-06-10-from-ballots-to-bots-reprogramming-democracy-for-the-ai-era/</link>
      <pubDate>Tue, 10 Jun 2025 00:00:00 +0000</pubDate>
      <guid>https://cognaptus.com/blog/2025-06-10-from-ballots-to-bots-reprogramming-democracy-for-the-ai-era/</guid>
      <description>Exploring how AI agents and blockchain technology could transform democratic decision-making by replacing traditional political representation with transparent, scalable, and data-driven governance.</description>
    </item>
  </channel>
</rss>
