Cover image

Reasoning on a Sliding Scale: Why One Size Doesn't Fit All in CoT

TL;DR for operators Ada-R1 is useful because it attacks the expensive part of reasoning models from the right angle: not “make every answer shorter,” but “decide which problems deserve long reasoning in the first place.”1 The paper’s key evidence is uncomfortable for anyone buying premium reasoning capacity by default. Long Chain-of-Thought helps on harder mathematical problems, but nearly half of the analysed samples show no improvement from Long-CoT, and some perform worse. In other words, paying for the model to brood majestically over simple work is not intelligence. It is ceremony with a token meter attached. ...

May 1, 2025 · 16 min · Zelina