AI Governance

When Agents Talk Back: Why AI Collectives Need a Social Theory

Teams are easy to draw and hard to govern. Put five AI agents in a workflow diagram and everything looks reassuringly corporate: one planner, one researcher, one coder, one critic, one manager. Give them arrows. Add a dashboard. Call it orchestration. Investors relax. Engineers nod. Consultants quietly increase the font size on the word “autonomous.” ...

When Models Know They’re Wrong: Catching Jailbreaks Mid-Sentence

Guardrails usually fail quietly. A user sends a malicious prompt. The model begins answering. The safety policy that looked firm in the demo environment starts behaving like office wallpaper: present, decorative, and not especially involved. By the time a post-hoc filter reads the final answer, the model has already produced the thing it should not have produced. The system may block the response from the user, but the real lesson is less flattering: the model crossed the line before the defense noticed. ...

EvoFSM: Teaching AI Agents to Evolve Without Losing Their Minds

Workflow is the unglamorous part of agentic AI. Which is precisely why it matters. A research agent can have a strong language model, a decent search tool, and an impressive ability to produce paragraphs that sound like a McKinsey intern who drank too much espresso. Yet when the task becomes long, ambiguous, and evidence-heavy, the same agent often fails for a boring reason: it does the right actions in the wrong order, repeats the same weak search, summarizes too early, forgets to verify a source, or changes its own instructions so enthusiastically that it becomes a different employee halfway through the job. ...

Agents That Ship, Not Just Think: When LLM Self-Improvement Meets Release Engineering

Shipping Is the Part Agents Usually Skip Shipping is where confidence goes to die. A demo agent can impress everyone on Tuesday, receive a clever prompt update on Wednesday, and quietly break three workflows that were working last week. The aggregate score improves. The release notes look cheerful. Somewhere, a previously solved customer task becomes unsolved again. Naturally, everyone calls this “iteration,” because “we broke production while chasing a benchmark bump” sounds less strategic. ...

Model Cannibalism: When LLMs Learn From Their Own Echo

Feedback is usually sold as the civilized part of AI deployment. Users interact with the model. The product team collects prompts, outputs, ratings, usage logs, corrections, maybe a few thumbs-up signals. The model is fine-tuned. The next version is better. Everybody nods. A dashboard is opened. Someone says “continuous improvement.” The room relaxes. ...

Agents Gone Rogue: Why Multi-Agent AI Quietly Falls Apart

A workflow looks stable on Monday. The planner assigns tasks. The research agent gathers evidence. The calculator checks numbers. The compliance agent says no to the obviously bad idea, which is rude but useful. The whole multi-agent system feels less like a chatbot and more like a small digital department with unusually poor lunch habits. ...

Trading Without Cheating: Teaching LLMs to Reason When Markets Lie

Trade has a special talent for humiliating clean theories. A model reads a market brief. It sees earnings beats, sales guidance, analyst upgrades, and a few scattered corporate events. Asked to behave like a turnaround specialist, it starts building buy signals. Some recommendations are reasonable. Others quietly smuggle in missing assumptions: maybe the company has new management; maybe the earnings beat reflects restructuring; maybe debt reduction is happening somewhere behind the curtain. Very elegant. Also, very convenient. ...

Batch of Thought, Not Chain of Thought: Why LLMs Reason Better Together

Fraud review is not a solo sport. A risk analyst looking at one suspicious seller can notice a strange product description, a vague company name, or a price range that feels wrong. But the real signal often appears only when several sellers are placed side by side. One shop looks unusual. Ten shops with the same naming pattern, same product mismatch, and same pricing behavior start to look less like noise and more like a system. ...

Pulling the Thread: Why LLM Reasoning Often Unravels

Audit is a less glamorous word than intelligence. That is unfortunate, because most business problems with AI agents do not begin with stupidity. They begin with confidence. The agent gives an answer. The answer sounds reasonable. The explanation sounds even better. A manager, analyst, compliance reviewer, or product owner reads the chain of thought and feels the mild comfort of seeing steps. There is a premise, then a bridge, then a conclusion. Very civilized. Very inspectable. Very possibly fake. ...

Causality Remembers: Teaching Social Media Defenses to Learn from the Past

Moderation teams do not usually lose because they see nothing. They lose because they see too much: thousands of accounts posting near the same topic, near the same time, with enough similarity to look suspicious and enough difference to remain deniable. Some are campaign assets. Some are enthusiastic humans. Some are bots. Some are people who simply saw the same trending story and behaved like everyone else, which is annoying for both democracy and data science. ...