Best Llm Deployment Strategies Podcasts (2025)

1
If You Can’t Test It, Don’t Deploy It: The New Rule of AI Development? 22:50

Play Pause

3d ago22:50

22:50

Magdalena Picariello reframes how we think about AI, moving the conversation from algorithms and metrics to business impact and outcomes. She champions evaluation systems that don't just measure accuracy but also demonstrate real-world business value, and advocates for iterative development with continuous feedback to build optimal applications.Rea…

1
Effective Error Handling: A Uniform Strategy for Heterogeneous Distributed Systems 37:47

11d ago37:47

37:47

Jenish Shah, a back-end engineer focused on distributed systems at Netflix, provides more insights on how to handle failures in a distributed systems setup. He shares details on how he built a library that handles exceptions uniformly, regardless of the underlying communication protocol. Read a transcript of this interview: http://bit.ly/3JpmIBnSub…

1
Cloud and DevOps InfoQ Trends Report 2025 50:46

15d ago50:46

50:46

In this episode of the podcast, members of the InfoQ editorial staff and friends of InfoQ will discuss current trends in the cloud and DevOps domains as part of our annual trends report creation process. These reports provide InfoQ readers with a high-level overview of key topics to watch and also help the editorial team focus on innovative technol…

1
Mental Models in Architecture & Societal Views of Technology: A Conversation with Nimisha Asthagiri 51:51

25d ago51:51

51:51

In this podcast, Michael Stiefel spoke with Nimisha Asthagiri about the importance of system thinking, multi-agent systems, the consequences of society applying a technology into an area for which it was not designed, and whether we can ever have a healthy relationship with artificial intelligence. System thinking emphasizes the importance of menta…

1
Elena Samuylova on Large Language Model (LLM) Based Application Evaluation and LLM as a Judge 35:44

1M ago35:44

35:44

In this podcast, InfoQ spoke with Elena Samuylova from Evidently AI, on best practices in evaluating Large Language Model (LLM) based applications. She also discussed the tools for evaluating, testing and monitoring applications powered by AI technologies.Read a transcript of this interview: https://bit.ly/4mHAKvN Subscribe to the Software Architec…

1
Http 123 1:58

1M ago1:58

1:58

http 123

1
The Hidden Vulnerability of The Open Source Software Supply Chain: The Underlying Infrastructure 41:48

1M ago41:48

41:48

Software supply chain veteran Brian Fox unpacks the security implications of the new EU Cyber Resilience Act and its profound impact on open-source projects. He reveals the hidden infrastructure risks threatening open-source projects and shares insights for senior software leaders navigating this regulatory landscape.Read a transcript of this inter…

1
AI, ML, and Data Engineering InfoQ Trends Report 2025 53:02

1M ago53:02

53:02

In this episode of the podcast, members of the InfoQ editorial staff and friends of InfoQ discuss the current trends in the domain of AI, ML and Data Engineering.One of the regular features of InfoQ are the trends reports, which each focus on a different aspect of software development. These reports provide the InfoQ readers and listeners with a hi…

1
Scaling Systems, Companies, and Careers with Suhail Patel 46:27

2M ago46:27

46:27

In this episode, Suhail Patel joins Thomas Betts for a discussion about growing yourself as your company grows. When he started at Monzo, Patel was one of four engineers on the then new platform team–there are now over 100 people. The conversation covers how to thrive when the company and the systems you’re building are going through major growth.R…

1
Safely Changing Software to Avoid Incidents: A Conversation with Justin Sheehy 59:48

2M ago59:48

59:48

In this podcast, Michael Stiefel spoke with Justin Sheehy about how to safely put software into production without creating production incidents. Among the topics discussed were the futility of root cause analysis, and the importance of having a shared language for discussing incidents. This discussion included the need for software to be malleable…

1
Smithery Co-founders | Henry Mao (fmr Jenni AI) & Anirudh Kamath | Building AI-native services with MCP, developer distribution challenges & the future of agent tooling 31:56

2M ago31:56

31:56

In this episode of Request Response, I sit down with Henry and Anirudh, co-founders of Smithery, the easiest place to build, deploy, and discover MCP servers. We dive into how Henry and Anirudh went from building custom chatbots for specific use cases to creating infrastructure that makes MCP development accessible to any developer. We explore the …

1
Observability in Java with Micrometer - a Conversation with Marcin Grzejszczak 35:42

2M ago35:42

35:42

Marcin Grzejszczak, a veteran of observability spaces, discusses the current state of the space, including its evolution and the fine-grained details of how to instrument your system to capture all relevant information at every level - both inside services and between services communication.Read a transcript of this interview: http://bit.ly/4mDTkFW…

1
Why Rust Will Help You Deliver Better Low-latency Systems and Happier Developers 42:58

2M ago42:58

42:58

Andrew Lamb, a veteran of database engine development, shares his thoughts on why Rust is the right tool for developing low-latency systems, not only from the perspective of the code’s performance, but also looking at productivity and developer joy. He discusses the overall experience of adopting Rust after a decade of programming in C/C++.Read a t…

1
Is WebAssembly the Secure, Efficient Alternative Everybody was Waiting for? 24:14

3M ago24:14

24:14

Laurent Doguin and Geoffroy Couprie discuss their pioneering work with Wasm on the infrastructure side. They walk us through the benefits and challenges of building a platform over WebAssembly and why it’s the safer alternative to containers.Read a transcript of this interview: http://bit.ly/3HheBWxSubscribe to the Software Architects’ Newsletter f…

1
Continuous Deployment and Pair Programming for Lean Software Delivery Even Without Jira 54:18

3M ago54:18

54:18

Asgaut Mjølne Söderbom and Ola Hast, two developers with Sparebank1 speak about their journey towards continuous deployment and pair programming. During the conversation, they share how they use the "waste clock" to identify areas of improvement or how TDD helps them deliver high-quality code.Read a transcript of this interview: https://bit.ly/4lNv…

1
Sovereign Clouds, Hyperscalers and European Alternatives: InfoQ Dev Summit Munich 2025 Preview 41:04

3M ago41:04

41:04

In this podcast episode, speakers from the upcoming InfoQ Dev Summit Munich 2025 tackle the practical challenges facing European developers caught between regulatory pressures and technological realities. The panel discusses trade-offs between using US cloud providers versus emerging European alternatives, exploring cloud-agnostic architecture stra…

1
The Financial Architecture of Software with Ian Miell 34:40

3M ago34:40

34:40

In this episode, Thomas Betts speaks with Ian Miell about how the financial aspects of a business affect how software is designed and built. If Conway’s Law says organizational structures determine the software design, then following the money helps us understand why those organization structures exist, and ultimately whether software will be succe…

1
Scott Dietzen (CEO, Augment Code) | Context engineering & enterprise-scale AI coding 25:34

3M ago25:34

25:34

In this episode of Request Response, I sit down with Scott Dietzen, CEO of Augment Code and former CEO of Pure Storage. We dive into why context selection has become more critical than prompt engineering, and how his team solved the fundamental challenge of giving AI agents just the right amount of codebase context to be effective without being ove…

1
Microfrontends: Heuristics, Patterns and Antipatterns by Luca Mezzalira 44:57

3M ago44:57

44:57

Luca Mezzalira, a pioneer and enthusiast of microfrontends, discusses the microfrontends' evolution over the past years, underlying a set of heuristics that will allow you to gradually implement them in your product. He also touches on approaches to obtain quick feedback, both in your inner and outer development loops.Read a transcript of this inte…

1
AI-friendly API design buy vs. build, and scaling DevEx | Tom Hacohen (CEO & Founder, Svix) 30:46

4M ago30:46

30:46

In this episode of Request Response, I sat down with Tom Hacohen, founder and CEO of Swix, the Webhooks-as-a-Service platform. Tom shares the origin story of Swix—which started as a side project to escape the pain of webhook maintenance—and how it’s grown into essential infrastructure for API-first companies. We dive into why great developer experi…

1
Season 3 Finale: Top Insights, Hacks, and Lessons from the Frontlines of AI Security 24:15

4M ago24:15

24:15

Send us a text To close out Season 3, we’re revisiting the standout insights, wildest vulnerabilities, and most practical lessons shared by 20+ AI practitioners, researchers, and industry leaders shaping the future of AI security. If you're building, breaking, or defending AI/ML systems, this is your must-listen roundup. Full transcript, video, and…

1
Understanding Event-Driven Architecture in a Multicloud Environment 36:38

4M ago36:38

36:38

Teena Idnani, senior solutions architect at Microsoft, shares her experience on how and when to use event-driven architectures to improve the experience of your customers. She touches on when to use and not use this approach, as well as how to design your system, implement observability, and when to consider using more than one cloud vendor.Read a …

1
Breaking and Securing Real-World LLM Apps 53:31

4M ago53:31

53:31

Send us a text Fresh off their OWASP AppSec EU talk, Rico Komenda and Javan Rasokat join Charlie McCarthy to share real-world insights on breaking and securing LLM-integrated systems. Full transcript, video, and links to episode resources available at https://mlsecops.com/podcast/breaking-and-securing-real-world-llm-apps Ask ChatGPT Thanks for chec…

1
The Java Ecosystem Remains Ever-Green By Continuously Adapting to Developers' Needs 51:15

4M ago51:15

51:15

Kevin Dubois and Thomas Vitale, two cloud-native enthusiasts in the Java ecosystem, discuss the evolution of frameworks and tooling that has led to increased development and developer joy. They cover everything from Testcontainers to incorporating LLMs in existing applications, as well as how to ensure the code quality remains high, even with the p…

1
How Red Teamers Are Exposing Flaws in AI Pipelines 41:46

4M ago41:46

41:46

Send us a text Prolific bug bounty hunter and Offensive Security Lead at Toreon, Robbe Van Roey (PinkDraconian), joins the MLSecOps Podcast to break down how he discovered RCEs in BentoML and LangChain, the risks of unsafe model serialization, and his approach to red teaming AI systems. Full transcript, video, and links to episode resources availab…

1
Mandy Gu on Generative AI (GenAI) Implementation, User Profiles and Adoption of LLMs 23:45

4M ago23:45

23:45

In this podcast, Mandy Gu from WealthSimple discusses how to establish AI programs in organizations and implement Generative AI (GenAI) initiatives, and the relationship between user profiles and adoption of LLMs.Read a transcript of this interview: https://bit.ly/3ZJLtxaSubscribe to the Software Architects’ Newsletter for your monthly guide to the…

1
Achieving Seamless Integration Through User Co-Design 58:53

4M ago58:53

58:53

Savannah Kunovsky and Jenna Fizel, co-managing directors of IDEO’s Emerging Technology division, talk about the future of technology in general and how we can work with our users to build the most impactful product. They explore prototyping and co-design techniques, as well as how generative AI can help with rapid prototyping.Read a transcript of t…

1
Securing AI for Government: Inside the Leidos + Protect AI Partnership 34:04

4M ago34:04

34:04

Send us a text On this episode of the MLSecOps Podcast, Rob Linger, Information Advantage Practice Lead at Leidos, join hosts Jessica Souder, Director of Government and Defense at Protect AI, and Charlie McCarthy to explore what it takes to deploy secure AI/ML systems in government environments. Full transcript, video, and links to episode resource…

1
Building the Middle Tier and Doing Software Migrations: A Conversation with Rashmi Venugopal 52:46

5M ago52:46

52:46

In this podcast, Michael Stiefel spoke with Rashmi Venugopal about two topics. The first is how the middle-tier creates the application from the raw materials in the back-end, and how the front-end uses the middle-tier to present a meaningful workflow to the user. The second is how to manage the usually inevitable software migration that results fr…

Llm Deployment Strategies Podcasts

1
The InfoQ Podcast

InfoQ

1
Programmers Quickie

Software Engineering

51
Last Week in r/LocalLLaMA

Adithyan Ilangovan

1
The MLSecOps Podcast

MLSecOps.com

1
Request // Response

Sagar Batchu

1
If You Can’t Test It, Don’t Deploy It: The New Rule of AI Development? 22:50

1
Effective Error Handling: A Uniform Strategy for Heterogeneous Distributed Systems 37:47

1
Cloud and DevOps InfoQ Trends Report 2025 50:46

1
Mental Models in Architecture & Societal Views of Technology: A Conversation with Nimisha Asthagiri 51:51

1
Elena Samuylova on Large Language Model (LLM) Based Application Evaluation and LLM as a Judge 35:44

1
Http 123 1:58

1
The Hidden Vulnerability of The Open Source Software Supply Chain: The Underlying Infrastructure 41:48

1
AI, ML, and Data Engineering InfoQ Trends Report 2025 53:02

1
Scaling Systems, Companies, and Careers with Suhail Patel 46:27

1
Safely Changing Software to Avoid Incidents: A Conversation with Justin Sheehy 59:48

1
Smithery Co-founders | Henry Mao (fmr Jenni AI) & Anirudh Kamath | Building AI-native services with MCP, developer distribution challenges & the future of agent tooling 31:56

1
Observability in Java with Micrometer - a Conversation with Marcin Grzejszczak 35:42

1
Why Rust Will Help You Deliver Better Low-latency Systems and Happier Developers 42:58

1
Is WebAssembly the Secure, Efficient Alternative Everybody was Waiting for? 24:14

1
Continuous Deployment and Pair Programming for Lean Software Delivery Even Without Jira 54:18

1
Sovereign Clouds, Hyperscalers and European Alternatives: InfoQ Dev Summit Munich 2025 Preview 41:04

1
The Financial Architecture of Software with Ian Miell 34:40

1
Scott Dietzen (CEO, Augment Code) | Context engineering & enterprise-scale AI coding 25:34

1
Microfrontends: Heuristics, Patterns and Antipatterns by Luca Mezzalira 44:57

1
AI-friendly API design buy vs. build, and scaling DevEx | Tom Hacohen (CEO & Founder, Svix) 30:46

1
Season 3 Finale: Top Insights, Hacks, and Lessons from the Frontlines of AI Security 24:15

1
Understanding Event-Driven Architecture in a Multicloud Environment 36:38

1
Breaking and Securing Real-World LLM Apps 53:31

1
The Java Ecosystem Remains Ever-Green By Continuously Adapting to Developers' Needs 51:15

1
How Red Teamers Are Exposing Flaws in AI Pipelines 41:46

1
Mandy Gu on Generative AI (GenAI) Implementation, User Profiles and Adoption of LLMs 23:45

1
Achieving Seamless Integration Through User Co-Design 58:53

1
Securing AI for Government: Inside the Leidos + Protect AI Partnership 34:04

1
Building the Middle Tier and Doing Software Migrations: A Conversation with Rashmi Venugopal 52:46