Stop Wasting Tokens: ESTAR and the Economics of Early Reasoning Exit
Tokens are tiny invoices. One reasoning model writes a long chain-of-thought, checks itself, circles back, restates the same conclusion in a slightly more spiritual tone, and then finally prints an answer. Another model reaches the same answer halfway through but keeps talking because nobody told it that the meter is still running. This is not philosophy. This is unit economics with better typography. ...