When Coders Prove Theorems: Agents, Lean, and the Quiet Death of the Specialist Prover
Opening — Why this matters now Formal mathematics has quietly become one of the most revealing stress tests for modern AI. Not because theorems are commercially lucrative, but because they are unforgiving. Proof assistants do not care about vibes, rhetorical fluency, or confident hallucinations. Either the proof compiles, or it does not. Until recently, success in this domain required highly specialized models, intricate pipelines, and months of reinforcement learning. Numina-Lean-Agent proposes something more unsettling: maybe all of that specialization was unnecessary. ...