From Yarn to Code: What CrochetBench Reveals About AI’s Procedural Blind Spot
A pattern is not a caption. That sounds obvious until a multimodal model looks at a finished object, produces a confident set of instructions, and everyone in the room quietly rounds “looks plausible” up to “can build it.” This is one of the industry’s more expensive habits: mistaking descriptive competence for operational competence. The model can say what is there. Therefore, surely, it can infer how to make it. Very neat. Very wrong. ...