Trendaavat aiheet
#
Bonk Eco continues to show strength amid $USELESS rally
#
Pump.fun to raise $1B token sale, traders speculating on airdrop
#
Boop.Fun leading the way with a new launchpad on Solana.

Taelin
Kind / Bend / HVM / INets / λCalculus
yep no model in the world comes anywhere close to this
going sleep 100% certain I'm right on my judgement
as always, this will soon be common sense, but I said it first (:
see you

Taelin37 minuuttia sitten
Oh, I just noticed GPT-5's solution is identical to mine's
This is incredible
2,65K
Nah you're all wrong, GPT-5 is a leap
I'm 100% doubling down here
I didn't want to post too fast and regret it again, but it just solved a bunch of very, very hard debugging prompts that were previously unsolved (by AI), and then designed a gorgeous pixelated Gameboy game with a level of detail and quality that is clearly beyond anything else I've ever seen.
There is no way this model is bad.
I think you're all traumatized of benchmaxxers, and over-compensating against a model that is actually good. I also think you're underestimating gpt-oss's strengths (but yeah my last post was rushed)
I still don't know if it is usable for serious programming though (o3 wasn't), but it seems so? A coding model as reliable as Opus, yet smarter than o3, would completely change my workflow. Opus doesn't need thinking to be great though, so, that might weight in its favor.
For what it is worth, I only really used 3 models:
- Opus 4.1 for coding
- Gemini 2.5 very rarely for coding when Opus fails
- o3 for everything but coding
22,82K
Nah you're all wrong, GPT-5 is a leap
I'm 100% doubling down here
I didn't want to post too fast and regret it again, but it just solved a bunch of very, very hard debugging prompts that were previously unsolved (by AI), and then designed a gorgeous pixelated Gameboy game with a level of detail and quality that is clearly beyond anything else I've ever seen.
There is no way this model is bad.
I think you're all traumatized of benchmaxxers, and over-compensating against a model that is actually really good. I also think you're underestimating gpt-oss's strengths (but yeah my last post was rushed)
I still don't know if it is usable for serious programming though (4o, o3 definitely weren't), but it seems so? A coding model as reliable as Opus, yet smarter than o3, would completely change my workflow. Opus doesn't need thinking to be great though, so, that might weight in its favor.
For what it is worth, I only really used 3 models:
- Opus 4.1 for coding
- Gemini 2.5 very rarely for coding when Opus fails
- o3 for everything but coding
463
"preventing death is highly unethical"

João Pedro de Magalhães6.8. klo 06.41
"It is highly unethical to stop aging" - reviewer commenting on one of my grant applications.
The grant focused on cellular rejuvenation, no mention to curing aging, but it shows we still have a long way to go to convince even fellow scientists that curing aging is desirable.
16,06K
preventing death is highly unethical

João Pedro de Magalhães6.8. klo 06.41
"It is highly unethical to stop aging" - reviewer commenting on one of my grant applications.
The grant focused on cellular rejuvenation, no mention to curing aging, but it shows we still have a long way to go to convince even fellow scientists that curing aging is desirable.
235
So gpt-oss 120B can't produce correct german, yet nails complex Haskell bugs that even Opus failed to identify?
How does that happen?
I'm genuinely so confused by all of this

Björn Plüster6.8. klo 04.45
gpt-oss 120B is very blatantly incapable of producing linguistically correct german text. 🧵
238
My initial impression on OpenAI's OSS model is aligned with what they advertised. It does feel closer to o3 than to other open models, except it is much faster and cheaper. Some providers offer it at 3000 tokens/s, which is insane. It is definitely smarter than Kimi K2, R1 and Qwen 3. I tested all models for a bit, and got very decisive results in favor of OpenAI-OSS-120b.
Unfortunately, there is one thing these models can't do yet - my damn job. So, hope you guys have fun. I'll be back to debugging superposed λ-calculus evaluation 😭 see you
410,37K
Johtavat
Rankkaus
Suosikit
Ketjussa trendaava
Trendaa X:ssä
Viimeisimmät suosituimmat rahoitukset
Merkittävin