Search a title or topic

Over 20 million podcasts, powered by 

Player FM logo
Artwork

Content provided by Adventures in DevOps, Will Button, and Warren Parad. All podcast content including episodes, graphics, and podcast descriptions are uploaded and provided directly by Adventures in DevOps, Will Button, and Warren Parad or their podcast platform partner. If you believe someone is using your copyrighted work without your permission, you can follow the process outlined here https://podcastplayer.com/legal.
Player FM - Podcast App
Go offline with the Player FM app!

Incident Response Essentials: From Postmortems to Communication Strategies - DevOps 212

1:10:23
 
Share
 

Manage episode 435569291 series 2529949
Content provided by Adventures in DevOps, Will Button, and Warren Parad. All podcast content including episodes, graphics, and podcast descriptions are uploaded and provided directly by Adventures in DevOps, Will Button, and Warren Parad or their podcast platform partner. If you believe someone is using your copyrighted work without your permission, you can follow the process outlined here https://podcastplayer.com/legal.
In today's episode, Warren, Will, and special guest Falit Jain dive deep into the intricate world of incident management and response, drawing from rich experiences at tech giants like Amazon and Disney. They explore real-life scenarios, including Amazon's complex debugging challenges with over 150 engineers maintaining their detail page, and the high stakes of live streaming events at Disney.\
Join them as they discuss the crucial aspects of effective incident response, from the importance of familiarity with systems and the role of on-call processes to the value of communication and meticulous postmortems. They also deep-dive into cultural influences from leadership, the balance between new feature launches and system stability, and the significance of metrics like mean time to resolution and error budgets.
Socials
  • LinkedIn: Falit Jain

Picks
  continue reading

284 episodes

Artwork
iconShare
 
Manage episode 435569291 series 2529949
Content provided by Adventures in DevOps, Will Button, and Warren Parad. All podcast content including episodes, graphics, and podcast descriptions are uploaded and provided directly by Adventures in DevOps, Will Button, and Warren Parad or their podcast platform partner. If you believe someone is using your copyrighted work without your permission, you can follow the process outlined here https://podcastplayer.com/legal.
In today's episode, Warren, Will, and special guest Falit Jain dive deep into the intricate world of incident management and response, drawing from rich experiences at tech giants like Amazon and Disney. They explore real-life scenarios, including Amazon's complex debugging challenges with over 150 engineers maintaining their detail page, and the high stakes of live streaming events at Disney.\
Join them as they discuss the crucial aspects of effective incident response, from the importance of familiarity with systems and the role of on-call processes to the value of communication and meticulous postmortems. They also deep-dive into cultural influences from leadership, the balance between new feature launches and system stability, and the significance of metrics like mean time to resolution and error budgets.
Socials
  • LinkedIn: Falit Jain

Picks
  continue reading

284 episodes

All episodes

×
 
Loading …

Welcome to Player FM!

Player FM is scanning the web for high-quality podcasts for you to enjoy right now. It's the best podcast app and works on Android, iPhone, and the web. Signup to sync subscriptions across devices.

 

Listen to this show while you explore
Play