Machine Ethics

When RAG Needs Provenance, Not Just Recall: Traceable Answers Across Fragmented Knowledge

Opening — Why this matters now RAG is supposed to make large language models safer. Ground the model in documents, add citations, and hallucinations politely leave the room—or so the story goes. In practice, especially in expert domains, RAG often fails in a quieter, more dangerous way: it retrieves something relevant, but not the right kind of evidence. ...

When Transformers Learn the Map: Why Geography Still Matters in Traffic AI

Opening — Why this matters now Digital twins for transport are no longer futuristic demos. They are quietly becoming operational systems, expected to anticipate congestion, test control policies, and absorb shocks before drivers ever feel them. But a digital twin that only mirrors the present is reactive by definition. To be useful, it must predict. ...

Perspective Without Rewards: When AI Develops a Point of View

Opening — Why this matters now As AI systems grow more autonomous, the uncomfortable question keeps resurfacing: what does it even mean for a machine to have a perspective? Not intelligence, not planning, not goal pursuit—but a situated, history-sensitive way the world is given to the system itself. Most modern agent architectures quietly dodge this question. They optimize rewards, compress states, maximize returns—and call whatever internal structure emerges a day. But subjectivity, if it exists at all in machines, is unlikely to be a side effect of reward maximization. It is more plausibly a structural condition: something slow, global, and stubbornly resistant to momentary incentives. ...

Conformal Thinking: Teaching LLMs When to Stop Thinking

Opening — Why this matters now Reasoning models have learned how to think longer. Unfortunately, they have not learned when to stop. Test-time scaling has become the industry’s favorite blunt instrument: allocate more tokens, get better answers—on average. But averages are a luxury in deployment. In production systems, every additional token is a cost, and every premature stop is a risk. The uncomfortable truth is that “adaptive reasoning” merely replaces one opaque knob (token limits) with another (confidence thresholds), without offering a principled way to tune either. ...

When Your Agent Starts Copying Itself: Breaking Conversational Inertia

Opening — Why this matters now Multi-turn agents are supposed to get better with experience. More context, more feedback, more opportunities to adapt. Yet in practice, the opposite often happens. Agents loop. They fixate. They repeat themselves with growing confidence and shrinking effectiveness. This paper puts a name—and a mechanism—on that failure mode: conversational inertia. And more importantly, it shows that the problem is not a lack of information, but too much of the wrong kind. ...

Click with Confidence: Teaching GUI Agents When Not to Click

Opening — Why this matters now Autonomous GUI agents are finally leaving demos and entering production. They book meetings, fill forms, manage dashboards—and occasionally approve payments they should not. The uncomfortable truth is that one mis-click can be irreversible. Yet most GUI grounding models behave with absolute confidence, even when they are guessing. The paper “SafeGround: Know When to Trust GUI Grounding Models via Uncertainty Calibration” tackles this exact failure mode. Its core argument is simple but sharp: progress in GUI agents is no longer bottlenecked by accuracy alone, but by the absence of calibrated doubt. ...

Identity Crisis: How a Trivial Trick Teaches LLMs to Think Backwards

Opening — Why this matters now Large language models can write poetry, solve Olympiad-level math problems, and simulate entire businesses—yet they reliably fail at a task that feels almost insulting in its simplicity: if Alice’s husband is Bob, they struggle to answer who is Bob’s wife? This failure mode, known as the reversal curse, has become something of an embarrassment for autoregressive models. More troublingly, a growing body of literature has argued that the curse is fundamental: a baked-in limitation of left-to-right next-token prediction. If true, this would place a hard ceiling on what today’s LLM architectures can ever reliably reason about. ...

When Language Learns to Doubt Itself: Self-Contradiction as an Upgrade Path for Multimodal AI

Opening — Why this matters now Multimodal large language models (MLLMs) can describe, caption, and reason about images with impressive fluency. Yet beneath the polished surface lies a persistent flaw: they often say the right thing without truly understanding it. This mismatch—known as the generation–understanding gap—has become a quiet bottleneck as MLLMs move from demos into decision‑support systems, compliance tools, and autonomous agents. ...

Agentic Systems Need Architecture, Not Vibes

Opening — Why this matters now Agentic AI has officially entered its awkward adolescence. It can plan, call tools, collaborate, and occasionally impress investors—but it also hallucinates, forgets, loops endlessly, and collapses under modest real‑world complexity. The problem is no longer model capability. It’s architecture. Today’s agent systems are mostly stitched together through intuition, blog wisdom, and prompt folklore. Powerful, yes—but brittle. What’s missing is not another clever prompt trick, but an engineering discipline. ...

Metric Time Without the Clock: Making ASP Scale Again

Opening — Why this matters now Temporal reasoning has always been the Achilles’ heel of symbolic AI. The moment time becomes quantitative—minutes, deadlines, durations—logic programs tend to balloon, grounders panic, and scalability quietly exits the room. This paper lands squarely in that discomfort zone and does something refreshingly unglamorous: it makes time boring again. And boring, in this case, is good for business. ...