When Agents Start Thinking Twice: Teaching Multimodal AI to Doubt Itself
A model that fails its own eye test Mirror. That is where the problem becomes easy to see. Ask a multimodal model to generate an image of a plush lion toy in front of a mirror. The model may produce something plausible at first glance: lion, mirror, warm lighting, adorable synthetic confidence. Then ask the same model, through its understanding branch, whether the image makes physical sense. Suddenly it notices the issue: if the toy faces the camera, the mirror should mostly show its back, not another front-facing lion. ...