pressure testing chatgpt 5.4 output against claude opus 4.6