the demand models were built for humans hitting APIs a few times a minute... max but millions of agents never sleep, never batch, never wait they run inference continuously, across every time zone, with zero tolerance for staying in line AWS can't build data centers fast enough for human demand. they're not even pricing in what agents do to that curve time to update the models is an understatement