Machine Ethics

Peer Review in the Age of Agents: When Scientists Go Silicon

Opening — Why this matters now Artificial intelligence is no longer content with taking your job; it now wants to publish in your favorite journal. If 2024 was the year enterprises raced to bolt LLMs onto every workflow, 2025 is the year science itself became an experiment — with AI as both the subject and the researcher. ...

RL, Recall, and the Rise of Agentic Memory: What Memory-R1 Means for AI Systems

Opening — Why this matters now The AI ecosystem is shifting from clever parrots to agents that can sustain long‑horizon workflows. Yet even the flashiest models stumble on the simplest human expectation: remembering what happened five minutes ago. Statelessness remains the enemy of reliability. Memory-R1 — introduced in a recent paper from LMU Munich and collaborators — pushes back against this brittleness. Instead of stuffing longer prompts or bolting on static RAG pipelines, it proposes something far more interesting: reinforcement-trained memory management. Think of it as teaching a model not just to recall, but to care about what it chooses to remember. ...

Tentacles of Thought: Why Six Is the New One in Multimodal AI

Opening — Why this matters now The multimodal AI arms race is no longer about who can see more pixels or generate prettier sketches. It’s about whether models can think across modalities the way humans do—fluidly, strategically, and with the right tool for the moment. Most systems still behave like students who bring one pen to an exam: capable, but painfully limited. The newly proposed Octopus framework—with its six-capability orchestration—suggests a different future: one where a model doesn’t just hold tools, but chooses them. It’s a quiet shift with big implications for enterprise automation. ...

Compression, But Make It Pedagogical: Rate–Distortion KGs for Smarter AI Learning Assistants

Opening — Why This Matters Now The age of AI-powered learning assistants has arrived, but most of them still behave like overeager interns—confident, quick, and occasionally catastrophically wrong. The weakest link isn’t the models; it’s the structure (or lack thereof) behind their reasoning. Lecture notes fed directly into an LLM produce multiple-choice questions with the usual suspects: hallucinations, trivial distractors, and the unmistakable scent of “I made this up.” ...

Flip the Switch: How Heterogeneous Agents Learn to Restore the Grid

Opening — Why this matters now Extreme weather, brittle infrastructure, and decentralised energy markets are converging into one perennial headache: when the power goes out, restoring it is neither quick nor cheap. Utilities increasingly rely on automation and AI assistance, but most existing systems buckle under the messy, nonlinear physics of real distribution networks. Restoration isn’t just an optimisation puzzle — it’s an orchestration of microgrids, generators, constraints, and switching actions that cascade through the system. ...

Prompted and Confused: When LLMs Forget the Assignment

Opening — Why this matters now The industry narrative says LLMs are marching confidently toward automating everything from tax audits to telescope alignment. Constraint programming — the backbone of scheduling, routing, and resource allocation — is often portrayed as the next domain ripe for “LLM takeover.” Just describe your optimisation problem in plain English and voilà: a clean, executable model. ...

Skills to Pay the Agent Bills: Why LLMs Need Better Moves, Not Bigger Models

Opening — Why This Matters Now Large language model agents are expanding into tasks that look suspiciously like real work: navigating UIs, operating tools, and making sequential decisions in messy environments. The industry’s response has been predictable—give the model more context, more examples, more memory, more everything. But bigger prompts aren’t the same as better reasoning. Most agents still wander around like interns on their first day: energetic, but directionless. ...

Thresholds, Trade-offs, and the Art of Not Overthinking Your Robot

Opening — Why this matters now The current wave of robotics and agentic AI is colliding with a familiar enemy: uncertainty. You can train a visual model to spot a cup, a box, or an inexplicably glossy demo object—but when those predictions get fed into a planner, the whole pipeline begins to wobble. Businesses deploying AI agents in warehouses, kitchens, labs, or digital environments need systems that don’t fold the moment the camera blinks. ...

Tools of Habit: Why LLM Agents Benefit from a Little Inertia

Tools of Habit: Why LLM Agents Benefit from a Little Inertia Opening — Why this matters now LLM agents are finally doing real work—querying APIs, navigating unstructured systems, solving multi-step tasks. But their shiny autonomy hides a quiet tax: every tool call usually means another LLM inference. And when you chain many of them together (as all interesting workflows do), latency and cost balloon. ...

Value Collision Course: When LLM Alignment Plays Favorites

Opening — Why this matters now The industry is finally waking up to an uncomfortable truth: AI alignment isn’t a monolithic engineering task—it’s a political act wrapped in an optimization problem. Every time we say a model is “safe,” we’re really saying it is safe for whom. A new empirical study puts hard numbers behind what many practitioners suspected but lacked the data to prove: the way we collect, compress, and optimize human feedback implicitly privileges certain groups over others. And in a world where LLMs increasingly mediate customer service, financial advice, hiring flows, and mental-health interactions, this is not an academic quibble—it’s a governance risk hiding in plain sight. ...