Cover image

When Plans Break: Relaxing Petri Nets for Smarter Sequential Planning

Plans fail in painfully ordinary ways. A warehouse robot cannot both reserve the last pallet slot and keep the aisle clear. A field-service schedule cannot satisfy every customer window after one technician calls in sick. A compliance workflow cannot approve a transaction before the missing document exists, no matter how passionately the dashboard insists on “urgent priority.” ...

February 26, 2026 · 18 min · Zelina
Cover image

Reasoning Is Optional. Optimization Is Not: Rethinking VLA Training with NORD

Driving teams do not pay for reasoning tokens because they enjoy watching a model narrate its inner life. They pay for them because, at least in current VLA training culture, reasoning traces are treated as a bridge between perception and action. The bridge is expensive. A typical reasoning-heavy Vision-Language-Action pipeline for autonomous driving collects large driving datasets, generates dense chain-of-thought-style annotations, supervised-fine-tunes the model, and then applies reinforcement learning to improve driving metrics. It is a respectable pipeline. It is also the kind of pipeline that quietly converts every research win into an invoice. ...

February 25, 2026 · 14 min · Zelina
Cover image

Diffusing to Coordinate: When Multi-Agent RL Learns to Breathe

Robots are easy to imagine as individuals. A quadruped walks. A drone flies. A warehouse arm picks. The business slide is usually kind enough to show one machine, one task, one satisfying arrow from input to output. Reality is less polite. A quadruped is not one decision-maker. It is a committee of limbs negotiating with gravity. A multi-drone system is not one policy with four propellers. It is a moving argument about timing, local perception, shared goals, and what not to crash into. A factory cell with multiple robotic agents is even worse: every local action changes the environment other agents are trying to understand. ...

February 23, 2026 · 17 min · Zelina
Cover image

When Robots Disagree: Taming Gradient Conflicts in Cross-Embodiment Offline RL

A robot fleet looks efficient on a spreadsheet. One warehouse robot logs a few million movements. Another quadruped logs a few million more. A bipedal platform contributes its own dataset. The obvious managerial instinct is to pour everything into one large training pool and let scale do its polite little miracle. This is where robots become less cooperative than cloud software. ...

February 23, 2026 · 16 min · Zelina
Cover image

From Guesswork to Generative Foresight: Why Diffusion Models May Fix Multi-Agent Blind Spots

A warehouse robot turns a corner and sees three things: a shelf edge, a moving cart, and another robot’s partial path. It does not see the blocked aisle behind the shelf. It does not see whether the cart will stop or continue. It does not see the supervisor system’s full map. Still, it must act. ...

February 18, 2026 · 15 min · Zelina
Cover image

Stable World Models, Unstable Benchmarks: Why Infrastructure Is the Real Bottleneck

A robot does not fail politely. It does not say, “I was trained on a slightly different shade of blue.” It just misses the object, pushes the wrong way, or confidently follows a plan that only works in the tidy little universe where the benchmark was born. That is the uncomfortable lesson behind stable-worldmodel-v1, a paper that is less about inventing a new world model and more about asking whether world-model research has been measuring the right thing in the first place.1 ...

February 10, 2026 · 14 min · Zelina
Cover image

Benchmarks Lie, Rooms Don’t: Why Embodied AI Fails the Moment It Enters Your House

The room is not impressed by your leaderboard A robot that performs well on a public benchmark has not necessarily learned how to operate in your house. It may recognize a chair in a dataset. It may answer a visual question about a tidy image. It may even produce a confident paragraph explaining where the coffee mug should be. Then it enters a real room — with mirrors, partial views, cluttered corners, awkward sightlines, and objects that are not positioned for benchmark convenience — and suddenly the “general intelligence” starts behaving like a tourist holding the map upside down. ...

February 7, 2026 · 17 min · Zelina
Cover image

When VR Shooters Meet Discrete Events: Training Security Policies Without Endless Human Trials

Training a security policy sounds simple until the training data involves people role-playing traumatic emergencies inside a virtual school. That is the uncomfortable starting point of this paper. Virtual reality can help researchers study rare and dangerous events under controlled conditions, but it does not solve the scaling problem. Every new intervention, policy variation, or robot behavior still needs another human-subject experiment. That is slow, expensive, ethically constrained, and not exactly a cheerful afternoon in the lab. ...

February 6, 2026 · 17 min · Zelina
Cover image

Cosmos Policy: When Video Models Stop Watching and Start Acting

A robot in a factory does not need a beautiful video of itself almost doing the job. It needs the gripper to close at the right moment, the wrist to rotate by the right amount, and the next two seconds of motion not to turn a simple pick-and-place task into modern sculpture. This is where many foundation-model stories become less glamorous. Vision-language models can recognize the scene. Video models can imagine motion. Neither of those achievements automatically gives you a usable control policy. ...

January 23, 2026 · 16 min · Zelina
Cover image

Your Agent Remembers—But Can It Forget?

Memory is usually sold as a virtue. An AI agent with memory sounds safer, smarter, more personal, more autonomous. A warehouse robot remembers where boxes were placed. A navigation agent remembers which corridor led to the exit. A workflow agent remembers what the user asked yesterday and uses that context tomorrow. This is the comforting version of memory: the past as an asset. ...

January 22, 2026 · 16 min · Zelina