in the end, its simply about flop efficiency and sample efficiency