Cohort-based courses
Guided programs to get real results.
AI Evals For Engineers & PMs
4.7
·4 weeks·Sep 7 – Oct 2
Hamel Husain ML Engineer with 20 years of experience
Shreya Shankar ML Systems & Applied AI Evals Researcher
AI Evals and Analytics Playbook
5.0
·3 weeks·May 11 – Jun 1

Stella Liu Head of AI Applied Science
Amy Chen Cofounder, AI Evals & Analytics
Beyond Evals: Designing Improvement Flywheels for AI Products
NEW·3 weeks·Jun 6 – Jun 27
.png&w=256&q=75)
Aishwarya Naresh RegantiAI Founder & Advisor to F500s | Ex-AWS
1-day workshops
Short, focused sessions to build specific skills.
Free Lightning Lessons
Interactive sessions to explore new topics.
How to Setup Evals For Agents
·30 minutes1,676 StudentsWatch
Harrison Chase, Hamel Husain, andModern Information Retrieval Evaluation In The RAG Era
·45 minutes5,308 StudentsWatch
Nandan Thakur, Hamel Husain, and Shreya ShankarRaise Your Technical Bar as an AI-Native PM
·30 minutes15,835 StudentsWatch
Jason P. Yoong and Gayathri Keerthana (GK)AI Evals for Product Managers
·60 minutes2,013 StudentsWatch
Anshumani RuddraDebug the weird stuff your AI does (in less than 1 hour)
·45 minutes5,167 StudentsWatch.webp&w=1536&q=75)
Marily Nika and Hamel HusainFrom Automation to Multi-Agent Architectures
·3 lessons1,352 StudentsWatch
Hamza FarooqLearn Agentic AI: Setting agents metrics and evaluations
·45 minutes858 StudentsWatch
Mahesh YadavEvaluation Driven Development for Agentic AI Systems
·45 minutes587 StudentsWatch
Aurimas GriciūnasBuild Your AI Evals & Analytics Playbook
·30 minutes511 StudentsWatch
Stella Liu and Amy ChenProduction Grade AI Evals by Braintrust.dev
·30 minutes492 StudentsWatch
Mengying LiPractical Evaluation Strategies for AI Agents
·45 minutes473 StudentsWatch
Hamza Farooq and Gabriela de QueirozHow to Drive AI Evals Adoption
·30 minutes327 StudentsWatch
Dr Sebastian FoxRun Eval Loops and Guardrails for Cursor Agents
·30 minutes83 StudentsWatch
Carmelo IariaDesign Evals Users Will Trust
·45 minutes771 StudentsWatch
Aishwarya Naresh RegantiPart 3: Building Robust Evaluations for AI Agents
·60 minutes142 StudentsWatch
Hamza Farooq and Gabriela de QueirozEvals for Everyone
·3 lessons2,196 StudentsWatch
Aishwarya & KiritiEvaluating Agentic AI Applications Beyond Vibe Checks
·45 minutes1,251 StudentsWatch
Aishwarya Naresh Reganti, Kiriti Badam, and Claire LongoSetting Eval for AI Agents & Scaling with Auto-Evaluation
·30 minutes862 StudentsWatch
Mahesh YadavVibe Code Annotation UIs for AI Analytics Evals
·Jun 24·60 minutes634 StudentsLive
Shane ButlerEvals for Voice AI: Learnings from Google Evals Team
·30 minutes242 StudentsWatch
Ravin KumarEvals in Action With Arize
·45 minutes202 StudentsWatch
Laurie VossShip a Production Cursor Agent System in 30 Minutes
·Jun 24·30 minutes95 StudentsLive
Carmelo IariaAutomating Evals With Claude Code + Phoenix
·60 minutes2,355 StudentsWatch
Mikyo King and Hamel HusainEvaluating AI Agents
·45 minutes1,429 StudentsWatch
Amir Feizpour and Samuel Dion-GirardeauHow OpenAI Customers Use Evals To Build Better AI Products
·30 minutes1,081 StudentsWatch
Jim Blomo and Hamel HusainHow Evals Made GitHub Copilot Happen
·30 minutes892 StudentsWatch
John Berryman, Shawn Simister, and Hamel HusainOptimize Your Dev Setup For Evals w/ Cursor Rules & MCP
·30 minutes687 StudentsWatch
Isaac Flath, Hamel Husain, and Shreya ShankarBuild Your Own Eval Tools With Notebooks!
·45 minutes612 StudentsWatch
Vincent D. Warmerdam, Hamel Husain, and Shreya ShankarStrategies for building self-improving document processing
·60 minutes429 StudentsWatch
Jason Liu and Eli BadgioMaster Evaluation Techniques for LLM Apps
·30 minutes413 StudentsWatch
Haroon ChouderyReliable RAG Agents: Intent-Driven Failure Detection
·60 minutes298 StudentsWatch
Jason Liu and Ben HylakMastering LLM Application Testing
·30 minutes240 StudentsWatch
Hugo Bowne-Anderson and Stefan KrawczykThe Hidden Signal in Production AI Logs
·60 minutes172 StudentsWatch
Jason Liu and Scott ClarkEvaluating AI Agents before Users Break Them
·60 minutes88 StudentsWatch
Aki Wijesundara, PhD, Marc Klingen, and Lotte VerheydenSetting up your first AI eval with a LLM-as-judge
·45 minutes62 StudentsWatch
Madalina Turlea and Catalina TurleaGo Beyond AI Evals: Diagnose and Decide
·45 minutes53 StudentsWatch
Rajiv ShahDebug Cursor Agent Failures Before Production
·Jun 10·30 minutes39 StudentsLive
Carmelo IariaError Analysis: The AI Engineer’s Best ROI
·60 minutes1,514 StudentsWatch
Hamel Husain and Shreya ShankarUnderstanding Embedding Performance through Generative Evals
·60 minutes1,181 StudentsWatch
Jason Liu and Kelly HongOptimize Structured Data Retrieval With Evals
·45 minutes843 StudentsWatch
Daniel Svonava and Hamel HusainOnline Evals and Production Monitoring
·60 minutes831 StudentsWatch
Jason Liu, Ben Hylak, and Sidhant BendreAI Systems Under Pressure: Red-Team Before You Ship
·60 minutes802 StudentsWatch
Krystal JacksonEvaluate AI agents with Confidence
·45 minutes800 StudentsWatch
Mahesh YadavImprove reliability of your AI applications
·30 minutes747 StudentsWatch
Shreya RajpalHow You Catch Production Hallucinations in Real Time
·60 minutes504 StudentsWatch
Jason Liu and Julia NeaguScaling Judge-Time Compute for Robust Auto LLM Evaluation
·60 minutes489 StudentsWatch
Jason Liu and Leonard TangUnderstand SHAP (SHapley Additive exPlanations)
·30 minutes310 StudentsWatch
Patrick HallCreate MCP Tool Evals Before You Ship
·45 minutes282 StudentsWatch
Emmanuel ParaskakisDon't Tweak Prompts. Engineer Agents.
·30 minutes274 StudentsWatch
Hugo Bowne-Anderson and Skylar PayneScale Evals Without the Chaos
·45 minutes248 StudentsWatch
Aishwarya Naresh RegantiSynthetic RAG evaluation
·60 minutes210 StudentsWatch
Alexey Grigorev and Doug TurnbullCalibrate LLM-as-a-judge for Real-world Impact
·45 minutes205 StudentsWatch
Eddie Landesberg🛠 Synthetic Data Flywheels: Build Reliable LLM Apps Faster
·30 minutes187 StudentsWatch
Hugo Bowne-Anderson and Stefan KrawczykHow to test and improve your AI agents
·45 minutes167 StudentsWatch
Jacob BankDe-Risking LLM Model Switches w Evals & Prompt Optimization
·45 minutes145 StudentsWatch
Amir Feizpour and Hugo MailhotCollaborative AI Evals with Human Feedback
·30 minutes113 StudentsWatch
Rogério ChavesStay Ahead in AI: Evaluate Any New LLM in 15 Minutes
·30 minutes93 StudentsWatch
Sherveen MashayekhiHow to test AI when you don't have any data yet
·45 minutes23 StudentsWatch
Madalina Turlea and Catalina Turlea
