Go offline with the Player FM app!
LLM Training: Superman's Kryptonite-Proof Suit
Manage episode 364606291 series 3427795
Why isn't Superman's suit Kryptonite-proof? This question reveals how large language models are trained. We break down transformers (the T in GPT), self-attention mechanisms, and the inference process—using Superman to explain why GPT-3 can generate coherent answers to questions it's never seen before. Solo episode on LLM architecture.
To stay in touch, sign up for our newsletter at https://www.superprompt.fm
30 episodes
Manage episode 364606291 series 3427795
Why isn't Superman's suit Kryptonite-proof? This question reveals how large language models are trained. We break down transformers (the T in GPT), self-attention mechanisms, and the inference process—using Superman to explain why GPT-3 can generate coherent answers to questions it's never seen before. Solo episode on LLM architecture.
To stay in touch, sign up for our newsletter at https://www.superprompt.fm
30 episodes
All episodes
×Welcome to Player FM!
Player FM is scanning the web for high-quality podcasts for you to enjoy right now. It's the best podcast app and works on Android, iPhone, and the web. Signup to sync subscriptions across devices.