Examples of o3 lying through its teeth in a really concerning way. It’s constantly making absolutely outrageous claims about how much faster the code is after its revisions (it doesn’t even know if it will *run*), whether it’s functioning properly, that it tested things, etc.