Mind the Units: Why LLMs Still Can't Count (And How CONE Fixes It)
Opening — Why this matters now Large language models can write essays, generate code, and even explain quantum physics. Yet ask them a deceptively simple question involving numbers—which value is larger, 9000 or 12000?—and things occasionally fall apart. The problem is structural. Most language models treat numbers as if they were ordinary words. The token “42” is just another symbol, not something that carries magnitude, units, or measurement semantics. ...