Search a title or topic

Over 20 million podcasts, powered by 

Player FM logo
Artwork

Content provided by Ben Lorica. All podcast content including episodes, graphics, and podcast descriptions are uploaded and provided directly by Ben Lorica or their podcast platform partner. If you believe someone is using your copyrighted work without your permission, you can follow the process outlined here https://player.fm/legal.
Player FM - Podcast App
Go offline with the Player FM app!

Beyond Guardrails: Defending LLMs Against Sophisticated Attacks

44:31
 
Share
 

Manage episode 484162025 series 2570898
Content provided by Ben Lorica. All podcast content including episodes, graphics, and podcast descriptions are uploaded and provided directly by Ben Lorica or their podcast platform partner. If you believe someone is using your copyrighted work without your permission, you can follow the process outlined here https://podcastplayer.com/legal.

Jason Martin is an AI Security Researcher at HiddenLayer. This episode explores “policy puppetry,” a universal attack technique bypassing safety features in all major language models using structured formats like XML or JSON.

Subscribe to the Gradient Flow Newsletter 📩 https://gradientflow.substack.com/

Subscribe: Apple · Spotify · Overcast · Pocket Casts · AntennaPod · Podcast Addict · Amazon · RSS.
Detailed show notes - with links to many references - can be found on The Data Exchange web site.

  continue reading

285 episodes

Artwork
iconShare
 
Manage episode 484162025 series 2570898
Content provided by Ben Lorica. All podcast content including episodes, graphics, and podcast descriptions are uploaded and provided directly by Ben Lorica or their podcast platform partner. If you believe someone is using your copyrighted work without your permission, you can follow the process outlined here https://podcastplayer.com/legal.

Jason Martin is an AI Security Researcher at HiddenLayer. This episode explores “policy puppetry,” a universal attack technique bypassing safety features in all major language models using structured formats like XML or JSON.

Subscribe to the Gradient Flow Newsletter 📩 https://gradientflow.substack.com/

Subscribe: Apple · Spotify · Overcast · Pocket Casts · AntennaPod · Podcast Addict · Amazon · RSS.
Detailed show notes - with links to many references - can be found on The Data Exchange web site.

  continue reading

285 episodes

All episodes

×
 
Loading …

Welcome to Player FM!

Player FM is scanning the web for high-quality podcasts for you to enjoy right now. It's the best podcast app and works on Android, iPhone, and the web. Signup to sync subscriptions across devices.

 

Listen to this show while you explore
Play