every language model that i have used has gotten worse, except for coding models (i just don't use the non coding ones)