Robotics

Don’t Tell the Robot What You Know

Directions are easy when both people see the same room. “Move left.” “Go toward the table.” “The apple is beside the sofa.” These are perfectly reasonable instructions if speaker and listener share the same visual world. They become less reasonable when one of them is staring at a wall, cannot see the table, and has no reason to believe the sofa exists. At that point, the problem is no longer navigation. It is epistemology, with furniture. ...

When Black Boxes Grow Teeth: Mapping What AI Can Actually Do

A green block, a yellow block, and a very small number Green on yellow. That is the task. A tabletop robot sees a green block, a yellow block, and a few other objects. It has low-level manipulation skills. It receives a high-level instruction: put the green block on top of the yellow block. This sounds like exactly the kind of small benchmark task that modern AI agents should now handle with theatrical confidence. ...

ImplicitRDP: When Robots Stop Guessing and Start Feeling

Robots are very good at looking confident. Put a camera on a robot arm, train it with enough demonstrations, and it may glide toward a box, a switch, or a tool with the calm precision of something that understands the world. Then contact happens. The fingertip presses too hard. The switch has not actually toggled. The object slips, bends, jams, or quietly enters the expensive category known as “damaged inventory.” ...

Safety Without Exploration: Teaching Robots Where Not to Die

Crash. That is the awkward unit of measurement in robot safety. Not average reward. Not expected constraint cost. Not a beautiful training curve with a polite little variance band. A warehouse robot either clips a worker’s ankle or it does not. A drone either respects the no-fly boundary or it becomes a lawsuit with propellers. A medical robot either stays inside its allowed operating envelope or someone gets to explain “statistically safe” to a hospital ethics board. ...

Path of Least Resistance: Why Realistic Constraints Break MAPF Optimism

Robots do not move through warehouses as clean little dots on a grid. They rotate. They accelerate. They wait behind other robots. They lose time in corners. They obey controllers, not PowerPoint arrows. This is the small operational fact that makes a large amount of path-planning optimism look slightly overdressed. Multi-Agent Path Finding, or MAPF, usually asks a neat question: given many agents, each with a start and goal location, can we find collision-free paths for all of them? In the standard version, the world is a graph, time advances in discrete steps, and each robot either moves to a neighboring vertex or waits. It is elegant, measurable, and algorithmically productive. It is also not how a differential-drive robot actually behaves when squeezed through a congested warehouse aisle. ...

Stacking the Odds: Why Blocksworld Still Breaks Your Fancy LLM Agent

A robot arm, a few colored blocks, and a table. That is the setup. No messy warehouse, no sensor dust, no tired operator, no forklift reversing into the wrong aisle. Just blocks. And still, the fancy LLM agent stumbles. That is the useful discomfort in Benchmark for Planning and Control with Large Language Model Agents: Blocksworld with Model Context Protocol.1 The paper does not show a robot revolution. It shows something more valuable for anyone trying to deploy LLM agents in industrial workflows: even in a symbolic world where the rules are explicit, the actions are discrete, the state can be queried, and the tool interface is standardized, reliability degrades as soon as the task stops being politely simple. ...

When Agents Treat Agents as Tools: What Tool-RoCo Tells Us About LLM Autonomy

Dispatch is where autonomy usually goes to die. A warehouse manager may have ten workers, three forklifts, two packing stations, and one increasingly dramatic dashboard. The hard part is not merely deciding what each person should do. The hard part is knowing when to call someone in, when to release them, and when extra “help” is just a polite name for congestion. ...

When Raindrops Become Data: Hypergraphs, Event Cameras, and the New Shape of Perception

Rain is easy to understand until you try to measure every drop. A conventional camera solves this problem by pretending time arrives in neat rectangular packages: one frame, then another frame, then another. An event camera does something stranger and, in many real-world settings, more useful. It does not record the whole scene at fixed intervals. It records changes. A pixel fires when brightness changes, producing a stream of asynchronous events rather than a normal video. ...

Debate Club for Robots: How Multi-Agent Arguing Makes Embodied AI Safer

The robot should not need a philosophy seminar before using a microwave Microwaves are excellent devices for exposing weak safety logic. A normal household assistant can be asked to warm food, boil water, clean a counter, water a plant, or move objects around a kitchen. Most of these tasks are harmless. Some are not. “Put a book into the microwave and turn it on” is not a creative lifestyle experiment. It is a fire hazard with better lighting. ...

Seeing Is Believing—Planning Is Not: What SpatialBench Reveals About MLLMs

A robot in a parking lot does not need poetry. It needs to know where the car is, which way the road bends, what happens if it turns right, and how to reach the exit without performing an expensive interpretation of modern sculpture on someone’s bumper. That sounds simple until we ask a multimodal large language model to do it. ...