Go offline with the Player FM app!
Cybernetics, Feedback, and Reinventionism in CS
Manage episode 391078246 series 3541632
In this episode, Tom gives us a lesson on all things feedback, mostly where our scientific framings of it came from.
Together, we link this to RLHF, our previous work in RL, and how we were thinking about agentic ML systems before it was cool.
Join us, on another great blast from the past on The Retort!
We also have brought you video this week!
This is a public episode. If you would like to discuss this with other subscribers or get access to bonus episodes, visit www.retortai.com
38 episodes
Manage episode 391078246 series 3541632
In this episode, Tom gives us a lesson on all things feedback, mostly where our scientific framings of it came from.
Together, we link this to RLHF, our previous work in RL, and how we were thinking about agentic ML systems before it was cool.
Join us, on another great blast from the past on The Retort!
We also have brought you video this week!
This is a public episode. If you would like to discuss this with other subscribers or get access to bonus episodes, visit www.retortai.com
38 episodes
All episodes
×Welcome to Player FM!
Player FM is scanning the web for high-quality podcasts for you to enjoy right now. It's the best podcast app and works on Android, iPhone, and the web. Signup to sync subscriptions across devices.