Physical Reasoning

Rewarding Bad Physics Habits: What VLMs Learn When You Pay Them to Reason

A factory camera sees a pressure gauge. The AI reads the image, explains the mechanism, applies the formula, and recommends an action. Everyone in the meeting relaxes, because the model has produced a neat chain of reasoning. That is usually the moment to become nervous. The dangerous part is not that a vision-language model can be wrong. We know that. The more interesting problem is that a model can become wrong in a very specific way because we trained it to chase the wrong reward. Pay it for clean formatting, and it learns to look organized. Pay it for final answers, and it may sacrifice the reasoning path. Pay it to stare at the image, and it may do better on spatial problems while forgetting that physics also contains formulas. Apparently, “look harder” is not a complete theory of mechanics. ...

When Motion Lies: Why Video LLMs Keep Misreading Physics

A car approaches a crosswalk. The frames look simple: car, road, direction, movement. A human can still ask the useful question: is the car speeding up, slowing down, or merely moving at a steady pace? A video language model may answer with the confidence of a dashboard camera that has read too many captions and learned too little physics. It sees a car getting closer. It infers “accelerating.” The problem is not that the model missed the car. The problem is that it saw the same visual pattern and failed to model the hidden change in motion. ...

Game of Cones: How Physics Codes Could Fix Agent Reasoning

Controls are where agent intelligence goes to embarrass itself. Give a vision-language model a game frame, a goal, and a list of legal buttons. It may describe the scene beautifully. It may explain that the projectile is approaching, the platform is unstable, and the shiny object is probably a reward. Then it presses the wrong key, late, for the wrong duration, and walks heroically into danger. Excellent commentary. Poor organism. ...