Cover image

Edit, Actually: Why Visual AI Needs Evidence, Not Eye Candy

A dashboard is rarely confusing because the pixels are ugly. More often, the problem is that the important part is small, crowded, rotated, hidden in a chart corner, split across spatial relations, or buried inside a scene that needs to be mentally transformed before the answer becomes obvious. A human analyst zooms, marks, traces, rearranges, or imagines a new angle. A multimodal model, by contrast, is often asked to stare at the original image and talk harder. ...

June 9, 2026 · 15 min · Zelina
Cover image

Unchained Distortions: Why Step-by-Step Image Editing Breaks Down While Chain-of-Thought Shines

TL;DR for operators Image-editing demos are easy. Ask a model to remove one object, recolour a jacket, or add a tasteful lamp, and most modern systems can produce something impressive enough for a product page and a LinkedIn post. Ask it to perform eight connected edits while keeping the original subject, layout, texture, lighting, and realism intact, and the polite showroom smile begins to crack. ...

April 21, 2025 · 16 min · Zelina