Go offline with the Player FM app!
Jacob Beck and Risto Vuorio
Manage episode 357253007 series 2536330
Jacob Beck and Risto Vuorio on their recent Survey of Meta-Reinforcement Learning. Jacob and Risto are Ph.D. students at Whiteson Research Lab at University of Oxford.
Featured Reference
A Survey of Meta-Reinforcement Learning
Jacob Beck, Risto Vuorio, Evan Zheran Liu, Zheng Xiong, Luisa Zintgraf, Chelsea Finn, Shimon Whiteson
Additional References
- VariBAD: A Very Good Method for Bayes-Adaptive Deep RL via Meta-Learning, Luisa Zintgraf et al
- Mastering Diverse Domains through World Models (Dreamerv3), Hafner et al
- Unsupervised Meta-Learning for Reinforcement Learning (MAML), Gupta et al
- Decoupling Exploration and Exploitation for Meta-Reinforcement Learning without Sacrifices (DREAM), Liu et al
- RL2: Fast Reinforcement Learning via Slow Reinforcement Learning, Duan et al
- Learning to reinforcement learn, Wang et al
66 episodes
Manage episode 357253007 series 2536330
Jacob Beck and Risto Vuorio on their recent Survey of Meta-Reinforcement Learning. Jacob and Risto are Ph.D. students at Whiteson Research Lab at University of Oxford.
Featured Reference
A Survey of Meta-Reinforcement Learning
Jacob Beck, Risto Vuorio, Evan Zheran Liu, Zheng Xiong, Luisa Zintgraf, Chelsea Finn, Shimon Whiteson
Additional References
- VariBAD: A Very Good Method for Bayes-Adaptive Deep RL via Meta-Learning, Luisa Zintgraf et al
- Mastering Diverse Domains through World Models (Dreamerv3), Hafner et al
- Unsupervised Meta-Learning for Reinforcement Learning (MAML), Gupta et al
- Decoupling Exploration and Exploitation for Meta-Reinforcement Learning without Sacrifices (DREAM), Liu et al
- RL2: Fast Reinforcement Learning via Slow Reinforcement Learning, Duan et al
- Learning to reinforcement learn, Wang et al
66 episodes
Alle Folgen
×Welcome to Player FM!
Player FM is scanning the web for high-quality podcasts for you to enjoy right now. It's the best podcast app and works on Android, iPhone, and the web. Signup to sync subscriptions across devices.