Search a title or topic

Over 20 million podcasts, powered by 

Player FM logo
Artwork

Content provided by Tony Wan. All podcast content including episodes, graphics, and podcast descriptions are uploaded and provided directly by Tony Wan or their podcast platform partner. If you believe someone is using your copyrighted work without your permission, you can follow the process outlined here https://podcastplayer.com/legal.
Player FM - Podcast App
Go offline with the Player FM app!

AI Safety: Constitutional AI vs Human Feedback

16:38
 
Share
 

Manage episode 424053414 series 3427795
Content provided by Tony Wan. All podcast content including episodes, graphics, and podcast descriptions are uploaded and provided directly by Tony Wan or their podcast platform partner. If you believe someone is using your copyrighted work without your permission, you can follow the process outlined here https://podcastplayer.com/legal.

With great power comes great responsibility. How do leading AI companies implement safety and ethics as language models scale? OpenAI uses Model Spec combined with RLHF (Reinforcement Learning from Human Feedback). Anthropic uses Constitutional AI. The technical approaches to maximizing usefulness while minimizing harm. Solo episode on AI alignment.
REFERENCE

OpenAI Model Spec

https://cdn.openai.com/spec/model-spec-2024-05-08.html#overview

Anthropic Constitutional AI

https://www.anthropic.com/news/claudes-constitution

To stay in touch, sign up for our newsletter at https://www.superprompt.fm

  continue reading

30 episodes

Artwork
iconShare
 
Manage episode 424053414 series 3427795
Content provided by Tony Wan. All podcast content including episodes, graphics, and podcast descriptions are uploaded and provided directly by Tony Wan or their podcast platform partner. If you believe someone is using your copyrighted work without your permission, you can follow the process outlined here https://podcastplayer.com/legal.

With great power comes great responsibility. How do leading AI companies implement safety and ethics as language models scale? OpenAI uses Model Spec combined with RLHF (Reinforcement Learning from Human Feedback). Anthropic uses Constitutional AI. The technical approaches to maximizing usefulness while minimizing harm. Solo episode on AI alignment.
REFERENCE

OpenAI Model Spec

https://cdn.openai.com/spec/model-spec-2024-05-08.html#overview

Anthropic Constitutional AI

https://www.anthropic.com/news/claudes-constitution

To stay in touch, sign up for our newsletter at https://www.superprompt.fm

  continue reading

30 episodes

All episodes

×
 
Loading …

Welcome to Player FM!

Player FM is scanning the web for high-quality podcasts for you to enjoy right now. It's the best podcast app and works on Android, iPhone, and the web. Signup to sync subscriptions across devices.

 

Copyright 2025 | Privacy Policy | Terms of Service | | Copyright
Listen to this show while you explore
Play