Why small models exist — five drivers, zero mythology
Before the first equation we build intuition: when a 70B model serves a billion requests, the deployment curve starts to hurt. Here we unpack the five drivers that pushed the frontier back toward small models.