Only 2% slower inference for a massive 7.5-point GPQA gain. Absolutely worth it