Cover image

Ask, Navigate, Repeat: Why Socially Aware Agents Are the Next Frontier

Directions are easy until they are not. A visitor walks into a shopping district, hears “go past the clothing store, then continue toward MATCONC,” and starts moving. A human can pause, notice the layout is ambiguous, ask another person, update the plan, and recover. A robot, on a good day, may confidently continue in the wrong direction with the serene composure of a machine that has never been embarrassed in public. ...

November 18, 2025 · 15 min · Zelina
Cover image

Talk Less, Coordinate More: MARL Meets the Real World

A warehouse robot fleet does not fail because one robot forgot how to move. It fails because three robots each saw a slightly different world, one message arrived late, another was dropped, and the coordination policy confidently optimised against yesterday’s reality. Very modern. Very autonomous. Very expensive. That is the uncomfortable premise behind Robust and Efficient Communication in Multi-Agent Reinforcement Learning, a survey of how multi-agent reinforcement learning, or MARL, behaves when the communication layer is no longer treated as magic plumbing.1 The paper is not presenting a new benchmark champion. Its value is quieter and more useful: it organises a scattered body of work around the communication failures that actually matter in deployed multi-agent systems. ...

November 17, 2025 · 15 min · Zelina
Cover image

Replan, Rethink, Repeat: Why Vision-Language Models Make Better Closed‑Loop Planners

Robots are very good at making small mistakes expensive. A misplaced cup is not just a misplaced cup. It can block the next object. A wrong order can violate a task constraint. A slightly bad coordinate can turn an elegant plan into a collision check failure. In software, you can often patch around the mistake and pretend this was always the architecture. In robotics, physics has a less forgiving product-management style. ...

November 16, 2025 · 15 min · Zelina
Cover image

Think Outside the Bounding Box: How SpatialThinker Reinforces 3D Reasoning

A warehouse robot does not need poetry. It needs to know whether the box is behind the pallet, whether the cup is closer than the plate, and whether the object it is about to grab is actually reachable rather than merely visible. Small details. Very irritating when ignored. This is where many multimodal models still become strangely philosophical. They can describe an image fluently, infer intent, and produce a confident answer. Then they miss that one object is in front of another. Apparently, “seeing” and understanding space are not the same occupation. ...

November 16, 2025 · 13 min · Zelina
Cover image

When Videos Grow Hands: How PhysWorld Teaches Robots to Stop Hallucinating Physics

Robots are not impressed by nice videos. A generated clip can show a hand placing a book into a shelf, pouring tomatoes from a pan, or sweeping scraps into a dustpan. It can look coherent enough to fool a casual viewer and perhaps even a product demo audience, which is not exactly the highest bar in technology. But a robot does not execute “looks coherent.” It executes poses, contacts, forces, trajectories, collisions, and failures. ...

November 16, 2025 · 16 min · Zelina
Cover image

Bodies Do the Thinking: Why Physical AI Changes the Intelligence Game

A robot helping a patient stand is not solving a benchmark. It is sharing weight, sensing resistance, absorbing surprise, and deciding how much force is too much. That last phrase is where most AI language starts to get suspiciously cloudy. “Deciding” sounds like a software problem. In physical systems, it is also a stiffness problem, a damping problem, an energy problem, and occasionally a liability problem wearing hospital slippers. ...

November 13, 2025 · 19 min · Zelina
Cover image

From Yarn to Code: What CrochetBench Reveals About AI’s Procedural Blind Spot

A pattern is not a caption. That sounds obvious until a multimodal model looks at a finished object, produces a confident set of instructions, and everyone in the room quietly rounds “looks plausible” up to “can build it.” This is one of the industry’s more expensive habits: mistaking descriptive competence for operational competence. The model can say what is there. Therefore, surely, it can infer how to make it. Very neat. Very wrong. ...

November 13, 2025 · 16 min · Zelina
Cover image

When Agents Think in Waves: Diffusion Models for Ad Hoc Teamwork

A warehouse robot does not fail only when it drops the box. Sometimes it fails earlier, in the quieter moment when another robot takes an unexpected route and the first robot keeps behaving as though the original choreography still exists. Nobody crashes. Nothing explodes. The system merely becomes stupid in a very expensive way. ...

November 11, 2025 · 18 min · Zelina
Cover image

The Doctor Is In: How DR. WELL Heals Multi-Agent Coordination with Symbolic Memory

Meetings are annoying for humans because they turn action into conversation. For autonomous agents, the problem is worse. A group of agents can each be individually competent and still fail collectively because one starts too early, another waits in the wrong place, and a third confidently pushes the wrong object in the wrong direction. Intelligence, as usual, does not automatically include basic scheduling manners. ...

November 7, 2025 · 14 min · Zelina
Cover image

Teaching Safety to Machines: How Inverse Constraint Learning Reimagines Control Barrier Functions

Factory robots, drones, and autonomous vehicles do not usually fail because nobody cared about safety. They fail because “safe” is annoyingly difficult to write down. An operator may know that a drone should not scrape the ground, that a warehouse robot should not cut across a human worker’s path, or that an autonomous car should not tailgate even when the road is technically clear. But turning that judgement into a formal mathematical boundary is another matter. The physical system has dynamics. The controller has limits. The dangerous state may not be a simple wall or circle. And the difference between “safe enough” and “please do not put that in production” may live in patterns of behaviour rather than in a clean rule. ...

October 31, 2025 · 16 min · Zelina