From Pixels to Patterns: Teaching LLMs to Read Physics
Opening — Why this matters now Large models can write poetry, generate code, and debate philosophy. Yet show them a bouncing ball in a physics simulator and ask, “Why did that happen?”—and things get awkward. The problem is not intelligence in the abstract. It is interface. Language models operate in a world of tokens. Physics simulators operate in a world of state vectors and time steps. Somewhere between $(x_t, y_t, v_t)$ and “the ball bounced off the wall,” meaning gets lost. ...