Long Thoughts, Short Bills: Distilling Mathematical Reasoning at Scale
Opening — Why this matters now Large language models can solve math problems. The more interesting question in 2025 is whether they can learn how to reason, at scale, across contexts that are long, messy, and computationally expensive. Most math datasets answer the first question. Nemotron-Math answers the second — and does so with a surprisingly pragmatic eye on cost. ...