1X says its NEO humanoid is now running a video-pretrained “world model” policy that generates a text-conditioned video rollout, then an inverse-dynamics model converts frames into robot actions.