Scientific Reasoning Under the Microscope: How PRiSM Stress-Tests the New Generation of Multimodal Models
Opening — Why this matters now The AI industry is in its “just add reasoning” era—a phase where every model release promises deeper thought, richer chains, and more reliable problem‑solving. Yet nowhere do these promises collapse faster than in scientific reasoning. Physics and mathematics demand rigor: dimensional consistency, symbolic logic, multi‑step derivations, and the ability to distrust misleading visuals. These domains are the natural predators of hand‑wavy reasoning. ...