EP. 269 The Secret to Trustworthy AI: "Fuzzing" Your Models with Haize Labs' Co-founder
Manage episode 499819090 series 3641432
How do you test a GenAI application that's constantly changing? In this episode, Shane talks to Leonard Tang, co-founder of Haize Labs, about why traditional testing fails for LLMs and how to adopt a new evaluation strategy. Leonard introduces "fuzzing"—a powerful technique for discovering edge cases, improving reliability, and building AI you can actually trust. He also gives a live demo of the Haize Labs platform, so be sure to watch the video version on YouTube or Spotify to see it in action.
273 episodes