i asked it to build a test and it built a simulated test that shows everything works... i'm done with gemini 2.5 we need 3