Diffusion-Models

The Missing Present Is a Distribution: DUPO for Delayed Control

TL;DR for operators A control system must act now even when its latest sensor reading describes an earlier moment. The usual response is to predict the missing present and let the policy act on that reconstruction. In a stochastic system, however, the same delayed message can correspond to several plausible current states—not one hidden answer waiting to be recovered. ...

State of the Art, Not State of Everything: Why Better AI Remembers Less

TL;DR for operators Production AI does not become reliable by remembering everything. It becomes reliable by preserving the information that defines the current state, explicitly representing what is allowed to change, and discarding history that would contaminate the next decision. Two papers arrive at this conclusion from remarkably different directions. One generates future amyloid-PET scans by anchoring the generative process to a patient’s baseline scan. The other builds persistent enterprise agents by retaining specifications, schemas, tools, and output contracts while deleting prior reasoning traces. ...

Safe on Paper, Lost in the Prompt

TL;DR for operators A safety-aligned image model can keep its FID and CLIPScore nearly unchanged while becoming materially worse at following ordinary instructions. It may still generate a plausible bird, vase, or product scene, but quietly miss the requested color, quantity, relationship, or attribute. The paper identifies a mechanism behind this failure. When safety tuning modifies the text encoder, benign prompt embeddings can become compressed and their semantic neighborhoods can be rearranged. Distinctions that the original model represented clearly begin to blur. The authors call this semantic collapse.1 ...

The Path of Least Assurance: Why AI Reliability Lives Between the Steps

TL;DR for operators AI reliability is increasingly a process problem, not an answer-checking problem. Three recent arXiv papers make that point from very different angles. MoCo-EA shows that adversarial examples are not merely isolated malicious pixels lurking in the shrubbery; they can lie along continuous, optimisable paths.1 ConceptAgent shows that erasing a concept from a diffusion model may disrupt the early text-to-image link while leaving later trajectory dynamics available for concept re-entry.2 BlueFin shows that LLM agents doing finance spreadsheet work fail in ways that only appear when you inspect formulas, recalculation behaviour, workbook mutations, tool choices, and whether the output helps a human analyst do useful work.3 ...

Furniture Has a Chain of Command: Why Dense Scene AI Needs Object Roles, Not One Bigger Generator

Furniture is not democratic. In a real room, the bed, sofa, dining table, and cabinet do not play the same role as the pillow, lamp, monitor, mug, or miniature ornament. Large furniture defines the room’s usable structure. Smaller objects depend on that structure. A chair can stand around a dining table; a book sits on a shelf; a lamp belongs near a bed or desk. The room has a hierarchy before the model begins to generate anything. ...

Packing Memory, Not Problems: How Short Clips Teach AI to Think Long in Video

Memory is usually the boring part of AI demos. The model gets the spotlight. The prompt gets the applause. The generated video either looks magical or embarrassingly haunted. Somewhere underneath, quietly paying the bill, sits the memory system. It decides what the model can still remember, what it must forget, and how much GPU memory gets sacrificed to the gods of temporal coherence. ...

Seeing the Invisible: When MRI Learns to Think Like PET

Seeing the Invisible: When MRI Learns to Think Like PET MRI is easy to respect. It is detailed, familiar, non-radioactive, and available in far more clinical settings than PET. It shows the brain’s structure with admirable discipline: folds, volumes, atrophy, lesions, the anatomical furniture of disease. PET is less polite. FDG-PET asks a different question: not only what has changed in the brain’s shape, but where the brain has stopped consuming glucose normally. In Alzheimer’s disease, that functional signal matters. The cruel part is that PET is expensive, less widely available, and involves radiation exposure. Healthcare, as usual, gives clinicians the useful thing and then hides it behind cost, infrastructure, and risk. ...

Scar Tissue, Synthetic Data: Teaching AI to See the Invisible

Synthetic data has a seductive sales pitch: when real data is scarce, expensive, or ethically awkward to collect, generate more of it. Simple. Almost too simple. Which, in AI, usually means the invoice has not arrived yet. The paper behind this article, LGESynthNet: Controlled Scar Synthesis for Improved Scar Segmentation in Cardiac LGE-MRI Imaging, is interesting because it refuses that easy story.1 It does not merely ask whether a model can generate plausible cardiac MRI images. It asks a more operational question: can generated scar tissue help a downstream model detect and segment real scar tissue better? ...

Diffusion Decoding Gets a Personality: When Diversity Stops Being Accidental

Choices are cheap until they all look the same. That is the awkward little problem behind many “generate multiple answers” interfaces. A model produces five suggestions, ten drafts, or thirty candidate solutions; the UI proudly displays variety; and then a human notices that most options are the same answer wearing different shoes. Good shoes, perhaps. Still the same answer. ...

Diffusing the Periodic Table: How Hierarchy Fixes Molecular AI

A molecule can fail for a very small reason. Not a grand theoretical reason. Not because the model lacks a cinematic vision of drug discovery. Sometimes the failure is an aromatic nitrogen that should carry hydrogen but does not. Sometimes it is a formal charge that disappears because the token vocabulary decided that “nitrogen” was enough detail. Chemistry, unfortunately, does not reward this sort of minimalism. ...