After observing my 3 year old, I’m convinced a good benchmark for robotic AI is simply: can a humanoid put its own clothes on