Search a title or topic

Over 20 million podcasts, powered by 

Player FM logo
Artwork

Content provided by Google DeepMind and Hannah Fry. All podcast content including episodes, graphics, and podcast descriptions are uploaded and provided directly by Google DeepMind and Hannah Fry or their podcast platform partner. If you believe someone is using your copyrighted work without your permission, you can follow the process outlined here https://player.fm/legal.
Player FM - Podcast App
Go offline with the Player FM app!

Is Human Data Enough? With David Silver

49:38
 
Share
 

Manage episode 476267983 series 2532352
Content provided by Google DeepMind and Hannah Fry. All podcast content including episodes, graphics, and podcast descriptions are uploaded and provided directly by Google DeepMind and Hannah Fry or their podcast platform partner. If you believe someone is using your copyrighted work without your permission, you can follow the process outlined here https://player.fm/legal.

In this episode of Google DeepMind: The Podcast, VP of Reinforcement Learning, David Silver, describes his vision for the future of AI, exploring the concept of the "era of experience" versus the current "era of human data". Using AlphaGo and AlphaZero as examples, he highlights how these systems surpassed human capabilities by engaging in reinforcement learning without prior human knowledge. This approach contrasts with large language models, which depend on human data and feedback. Silver emphasizes the need to explore this path to drive AI progress and achieve artificial superintelligence.

Timestamps

  • 00:00 Introduction
  • 01:50 Era of experience
  • 03:45 AlphaZero
  • 10:19 Move 37
  • 15:20 Reinforcement learning and human feedback
  • 24:30 AlphaProof
  • 29:50 Math Olympiads
  • 35:00 Experience based methods
  • 42:56 Hannah's reflections
  • 44:00 Fan Hui joins

___

Thanks to everyone who made this possible, including but not limited to:

  • Presenter: Professor Hannah Fry
  • Series Producer: Dan Hardoon
  • Series Editor: Rami Tzabar
  • Commissioner & Producer: Emma Yousif
  • Music Composition: Eleni Shaw
  • Audio Engineer: Richard Courtice
  • Production Manager: Dan Lazard
  • Video Director and Editor: Bernardo Resende
  • Video Studio Production: Nicholas Duke
  • Video Editor: Bilal Merhi
  • Audio Engineer: Perry Rogantin
  • Camera and Lighting Operator: Robert Messere
  • Production Coordination: Zoey Roberts, Sarah Ellen Morton
  • Visual Identity and Design: Rob Ashley
  • Commissioned by Google DeepMind

Please leave us a review on Spotify or Apple Podcasts if you enjoyed this episode. We always want to hear from our audience whether that's in the form of feedback, new idea or a guest recommendation!

  continue reading

34 episodes

Artwork
iconShare
 
Manage episode 476267983 series 2532352
Content provided by Google DeepMind and Hannah Fry. All podcast content including episodes, graphics, and podcast descriptions are uploaded and provided directly by Google DeepMind and Hannah Fry or their podcast platform partner. If you believe someone is using your copyrighted work without your permission, you can follow the process outlined here https://player.fm/legal.

In this episode of Google DeepMind: The Podcast, VP of Reinforcement Learning, David Silver, describes his vision for the future of AI, exploring the concept of the "era of experience" versus the current "era of human data". Using AlphaGo and AlphaZero as examples, he highlights how these systems surpassed human capabilities by engaging in reinforcement learning without prior human knowledge. This approach contrasts with large language models, which depend on human data and feedback. Silver emphasizes the need to explore this path to drive AI progress and achieve artificial superintelligence.

Timestamps

  • 00:00 Introduction
  • 01:50 Era of experience
  • 03:45 AlphaZero
  • 10:19 Move 37
  • 15:20 Reinforcement learning and human feedback
  • 24:30 AlphaProof
  • 29:50 Math Olympiads
  • 35:00 Experience based methods
  • 42:56 Hannah's reflections
  • 44:00 Fan Hui joins

___

Thanks to everyone who made this possible, including but not limited to:

  • Presenter: Professor Hannah Fry
  • Series Producer: Dan Hardoon
  • Series Editor: Rami Tzabar
  • Commissioner & Producer: Emma Yousif
  • Music Composition: Eleni Shaw
  • Audio Engineer: Richard Courtice
  • Production Manager: Dan Lazard
  • Video Director and Editor: Bernardo Resende
  • Video Studio Production: Nicholas Duke
  • Video Editor: Bilal Merhi
  • Audio Engineer: Perry Rogantin
  • Camera and Lighting Operator: Robert Messere
  • Production Coordination: Zoey Roberts, Sarah Ellen Morton
  • Visual Identity and Design: Rob Ashley
  • Commissioned by Google DeepMind

Please leave us a review on Spotify or Apple Podcasts if you enjoyed this episode. We always want to hear from our audience whether that's in the form of feedback, new idea or a guest recommendation!

  continue reading

34 episodes

All episodes

×
 
Loading …

Welcome to Player FM!

Player FM is scanning the web for high-quality podcasts for you to enjoy right now. It's the best podcast app and works on Android, iPhone, and the web. Signup to sync subscriptions across devices.

 

Listen to this show while you explore
Play