现在的评估真是火热
Lenny Rachitsky
Lenny Rachitsky2025年9月26日
So many people talking about AI evals. @HamelHusain and @sh_reya show me how to actually build one. Live. Hamel and Shreya teach the world’s most popular course on evals, and have long been at the forefront of this emerging and important new skill for AI product builders. I'm so excited to finally have them on the podcast. Learn: 🔸 WTF are evals 🔸 A step-by-step live walkthrough of how to create an eval 🔸 Why evals have become so important 🔸 The debate between “vibes” and systematic evals 🔸 Code-based evals vs. LLM-as-judge 🔸 Why you only need 4-7 evals (not hundreds) 🔸 Building an LLM judge in Google Sheets 🔸 Why you always want to start with error analysis 🔸 Much more Listen now 👇 • YouTube: • Spotify: • Apple: Thank you to our wonderful sponsors for supporting the podcast: 🏆 @Fin_ai — The #1 AI agent for customer service: 🏆 @dscout — The UX platform to capture insights at every stage: from ideation to production: 🏆 @mercury — The art of simplified finances:
干得不错 @HamelHusain @sh_reya
16.82K