2026 will be the year when models become leaner and faster (whilst becoming smarter) a big part of human-in-the-loop DX is how fast the model responses are