When Images Learn to Think in Code: The Rise of Code-as-CoT for Structured Generation
Poster. That is where the problem becomes embarrassingly visible. Ask an image model to make “a beautiful poster for a finance seminar,” and it may produce something visually polished enough to survive a casual scroll. Ask it to place five labeled cards, keep the headline readable, align the icons, preserve the chart, and spell the sponsor name correctly, and the glamour fades. The model may understand the request. It may even describe the right plan. Then it still puts the label where no label should live, mangles the typography, and invents a layout that looks as if the design brief was translated through fog. ...