Karpathy says we need multi-agent systems to ‘self-play’ i.e compete and learn from each other goat and the backrooms early not wrong