Model Cascades

TL;DR for operators Most organisations do not have an AI capability problem. They have an AI allocation problem. They send too many routine, repetitive, low-risk tasks to large frontier models because the demo looked impressive and the invoice arrived later. The slingshot strategy is the opposite instinct: break a workflow into smaller decisions, assign the cheap and reliable parts to specialised models or rules, and escalate only the uncertain or high-value cases to stronger LLMs. The point is not to worship small models. That would be merely replacing one superstition with a smaller, cheaper superstition. The point is to allocate model capacity like an operating resource. ...

Model Cascades

Cheap Thrills, Hard Guarantees: BARGAINing with LLM Cascades

The Slingshot Strategy: Outsmarting Giants with Small AI Models