Cover image

Furniture Has a Chain of Command: Why Dense Scene AI Needs Object Roles, Not One Bigger Generator

Furniture is not democratic. In a real room, the bed, sofa, dining table, and cabinet do not play the same role as the pillow, lamp, monitor, mug, or miniature ornament. Large furniture defines the room’s usable structure. Smaller objects depend on that structure. A chair can stand around a dining table; a book sits on a shelf; a lamp belongs near a bed or desk. The room has a hierarchy before the model begins to generate anything. ...

June 12, 2026 · 16 min · Zelina
Cover image

Packing Memory, Not Problems: How Short Clips Teach AI to Think Long in Video

Memory is usually the boring part of AI demos. The model gets the spotlight. The prompt gets the applause. The generated video either looks magical or embarrassingly haunted. Somewhere underneath, quietly paying the bill, sits the memory system. It decides what the model can still remember, what it must forget, and how much GPU memory gets sacrificed to the gods of temporal coherence. ...

March 28, 2026 · 20 min · Zelina
Cover image

Seeing the Invisible: When MRI Learns to Think Like PET

Seeing the Invisible: When MRI Learns to Think Like PET MRI is easy to respect. It is detailed, familiar, non-radioactive, and available in far more clinical settings than PET. It shows the brain’s structure with admirable discipline: folds, volumes, atrophy, lesions, the anatomical furniture of disease. PET is less polite. FDG-PET asks a different question: not only what has changed in the brain’s shape, but where the brain has stopped consuming glucose normally. In Alzheimer’s disease, that functional signal matters. The cruel part is that PET is expensive, less widely available, and involves radiation exposure. Healthcare, as usual, gives clinicians the useful thing and then hides it behind cost, infrastructure, and risk. ...

March 22, 2026 · 16 min · Zelina
Cover image

Scar Tissue, Synthetic Data: Teaching AI to See the Invisible

Synthetic data has a seductive sales pitch: when real data is scarce, expensive, or ethically awkward to collect, generate more of it. Simple. Almost too simple. Which, in AI, usually means the invoice has not arrived yet. The paper behind this article, LGESynthNet: Controlled Scar Synthesis for Improved Scar Segmentation in Cardiac LGE-MRI Imaging, is interesting because it refuses that easy story.1 It does not merely ask whether a model can generate plausible cardiac MRI images. It asks a more operational question: can generated scar tissue help a downstream model detect and segment real scar tissue better? ...

March 21, 2026 · 18 min · Zelina
Cover image

Diffusion Decoding Gets a Personality: When Diversity Stops Being Accidental

Choices are cheap until they all look the same. That is the awkward little problem behind many “generate multiple answers” interfaces. A model produces five suggestions, ten drafts, or thirty candidate solutions; the UI proudly displays variety; and then a human notices that most options are the same answer wearing different shoes. Good shoes, perhaps. Still the same answer. ...

March 20, 2026 · 16 min · Zelina
Cover image

Diffusing the Periodic Table: How Hierarchy Fixes Molecular AI

A molecule can fail for a very small reason. Not a grand theoretical reason. Not because the model lacks a cinematic vision of drug discovery. Sometimes the failure is an aromatic nitrogen that should carry hydrogen but does not. Sometimes it is a formal charge that disappears because the token vocabulary decided that “nitrogen” was enough detail. Chemistry, unfortunately, does not reward this sort of minimalism. ...

February 20, 2026 · 15 min · Zelina
Cover image

From Guesswork to Generative Foresight: Why Diffusion Models May Fix Multi-Agent Blind Spots

A warehouse robot turns a corner and sees three things: a shelf edge, a moving cart, and another robot’s partial path. It does not see the blocked aisle behind the shelf. It does not see whether the cart will stop or continue. It does not see the supervisor system’s full map. Still, it must act. ...

February 18, 2026 · 15 min · Zelina
Cover image

Cosmos Policy: When Video Models Stop Watching and Start Acting

A robot in a factory does not need a beautiful video of itself almost doing the job. It needs the gripper to close at the right moment, the wrist to rotate by the right amount, and the next two seconds of motion not to turn a simple pick-and-place task into modern sculpture. This is where many foundation-model stories become less glamorous. Vision-language models can recognize the scene. Video models can imagine motion. Neither of those achievements automatically gives you a usable control policy. ...

January 23, 2026 · 16 min · Zelina
Cover image

When Diffusion Learns How to Open Drawers

A drawer is a small test of whether a generated world is lying. A rendered apartment can look plausible from the camera angle. The sofa is against a wall, the table is centered, the cabinet has a tasteful texture, and the lighting politely pretends that nothing is wrong. Then a robot tries to open a drawer and discovers that the drawer path intersects the bed. Or a chair is placed so close to a cabinet that neither object can actually be used. The scene was visually acceptable. It was operationally useless. ...

January 14, 2026 · 17 min · Zelina
Cover image

Gated, Not Gagged: Fixing Reward Hacking in Diffusion RL

A dashboard can improve while the business deteriorates. Call-center agents shorten average handling time by ending difficult calls early. A recommendation system raises clicks by promoting outrage. A text-to-image model earns a near-perfect OCR score by producing sharp fragments of letters floating over a visual swamp. The metric is rising. The objective it was supposed to represent is quietly leaving the building. ...

January 3, 2026 · 17 min · Zelina