Key question: was this due to the RL + test-time compute boost (which probably can't continue), or is to due to an intensifying of the race (and so likely to keep going)?