Search a title or topic

Over 20 million podcasts, powered by 

Player FM logo

Data Lakehouse Podcasts

show episodes
 
This show goes behind the scenes for the tools, techniques, and difficulties associated with the discipline of data engineering. Databases, workflows, automation, and data manipulation are just some of the topics that you will find here.
  continue reading
 
Overruled by Data is the podcast for law firms looking to accelerate their data journey without all the pain points. Hosted by Tom Baldwin and brought to you by Entegrata, each episode shares real-world stories from law firm leaders who’ve tackled the tough stuff—getting data from all the right places, navigating the AI hype, and scaling operations in a way that doesn’t leave you with a mountain of tech debt. If you're in a leadership role at a law firm, this show offers valuable insights fr ...
  continue reading
 
Artwork
 
Hosted by Carlos L Chacon, the SQL Data Partners Podcast focuses on Microsoft data platform related topics mixed with a sprinkling of professional development. Carlos and guests discuss new and familiar features and ideas and how you might apply them in your environments. Visit our website for episode show notes at marathonus.com/podcast and leave a comment if you have a topic you think we should discuss. We'll see YOU on the SQL Trail.
  continue reading
 
Partially Redacted brings together leaders in engineering, data, AI, security, and privacy to share knowledge, best practices, and real world experiences. Each episode provides an in-depth conversation with an industry expert who dives into their background and experience. They’ll share practical advice and insights about the techniques, tools, and technologies that every company – and every technology professional – should know about. Learn from an amazing array of founders, engineers, arch ...
  continue reading
 
Artwork

1
Over The Edge

Caspian Studios

icon
Unsubscribe
icon
icon
Unsubscribe
icon
Monthly+
 
Over The Edge is a podcast about edge computing and those in the industry who are creating the future of the internet. On the show we talk to corporate leaders, open-source experts, technologists, journalists, analysts, and the community at large, to discuss technological innovations, trends, practical applications, business models, and the occasional far-flung theory. Over the Edge is brought to you by the generous sponsorship of Dell Technologies.
  continue reading
 
Welcome to Cribl: The Stream Life, a podcast for IT pros trying to take control of their observability data with a no-compromise approach. With each episode, our hosts will cover the latest insights, trends, and emerging technologies to help IT organizations achieve observability in their operations. We'll also address specific challenges we've seen with hundreds of enterprises over the last several years and sketch out the fundamental capabilities required to overcome them.
  continue reading
 
Loading …
show series
 
Data management has traditionally forced organizations into an uncomfortable choice: use data warehouses for structured analytics or data lakes for flexible, large-scale storage. For years, this meant maintaining separate systems, duplicating data, and dealing with the headaches that come with disconnected infrastructure. The data lakehouse emerged…
  continue reading
 
"A flaw of warehouses is that you need to move all your data into them so you can keep it going, and for a lot of organisations that's a big hassle,” says Will Martin, EMEA Evangelist at Dremio. “It can take a long time, it can be expensive, and you ultimately can end up ripping up processes that are there." In this episode of the Don’t Panic It’s …
  continue reading
 
While the role of a chief data officers (CDOs) was traditionally focused on regulatory compliance, it has now expanded to empowering the consistent and effective use of data across organizations to improve business outcomes. One of the most effective ways for CDOs to demonstrate their value is by developing a data strategy that is closely aligned w…
  continue reading
 
Summary In this episode Preeti Somal, EVP of Engineering at Temporal, talks about the durable execution model and how it reshapes the way teams build reliable, stateful systems for data and AI. She explores Temporal’s code‑first programming model—workflows, activities, task queues, and replay—and how it eliminates hand‑rolled retry, checkpoint, and…
  continue reading
 
Summary In this episode of the Data Engineering Podcast Ariel Pohoryles, head of product marketing for Boomi's data management offerings, talks about a recent survey of 300 data leaders on how organizations are investing in data to scale AI. He shares a paradox uncovered in the research: while 77% of leaders trust the data feeding their AI systems,…
  continue reading
 
The challenge all organisations, big and small, face is answering and implementing solutions to solve this key question: How can finance and accounting teams work faster, smarter and more accurately? In the recent episode of the Don’t Panic It’s Just Data podcast, host Scott Taylor, The Data Whisperer and Principal Consultant at MetaMeta Consulting…
  continue reading
 
Summary In this episode of the Data Engineering Podcast Omri Lifshitz (CTO) and Ido Bronstein (CEO) of Upriver talk about the growing gap between AI's demand for high-quality data and organizations' current data practices. They discuss why AI accelerates both the supply and demand sides of data, highlighting that the bottleneck lies in the "middle …
  continue reading
 
AI observability provides deep visibility into AI systems beyond traditional monitoring, tracking data quality, model performance, infrastructure, and ethical compliance. It's essential for detecting silent model degradation, managing complexity at scale, and building trust in AI systems. Organizations implementing AI observability gain faster trou…
  continue reading
 
How do you turn legal data into trusted insight across one of the world’s largest law firms? In this episode, Jennifer Mapp, Senior Director of Data Management and Analytics at Morgan, Lewis & Bockius LLP, shares how her team built a firmwide data foundation that empowers attorneys and business leaders to make faster, smarter decisions. From launch…
  continue reading
 
At Big Data LDN (BDL) 2025, Keboola CEO Pavel Dolezal presented a new data agent designed for all business users, not just engineers. With a mission to make AI, automation, and data easy to access, relevant, and useful across the organisation, Dolezal revealed that the data agent has been embedded with contextual intelligence and generative AI. “Wh…
  continue reading
 
Summary In this episode of the Data Engineering Podcast Matt Topper, president of UberEther, talks about the complex challenge of identity, credentials, and access control in modern data platforms. With the shift to composable ecosystems, integration burdens have exploded, fracturing governance and auditability across warehouses, lakes, files, vect…
  continue reading
 
With an ever-changing business climate, companies have begun to shift their focus to unstructured data. In the past, unstructured data was challenging to deal with, considering the volume, governance and compliance, so organisations mainly focused on structured datasets. However, with the rise of generative AI and large language models (LLMs), Reec…
  continue reading
 
Summary In this episode Kate Shaw, Senior Product Manager for Data and SLIM at SnapLogic, talks about the hidden and compounding costs of maintaining legacy systems—and practical strategies for modernization. She unpacks how “legacy” is less about age and more about when a system becomes a risk: blocking innovation, consuming excess IT time, and cr…
  continue reading
 
Organizations are increasingly exploring new technologies to improve their operations, but adoption comes with real challenges. In the latest episode of Don’t Panic, It’s Just Data, host Trisha Pillay speaks with Kevin Petrie, VP of Research at BARC, about the practical realities of integrating these emerging technologies into business operations, …
  continue reading
 
"Most companies still juggle with multiple different platforms; the communication between these tools and these platforms is happening in spreadsheets, and that is tedious, it's error-prone,” states Søren Lundtoft, Sr. Director of Product Management at Stibo Systems. On the Don't Panic, It's Just Data podcast, host Doug Laney and Søren Lundtoft div…
  continue reading
 
Summary In this episode of the Data Engineering Podcast, host Tobias Macey welcomes back Nick Schrock, CTO and founder of Dagster Labs, to discuss Compass - a Slack-native, agentic analytics system designed to keep data teams connected with business stakeholders. Nick shares his journey from initial skepticism to embracing agentic AI as model and a…
  continue reading
 
With an erratic and fast business environment, finance teams are facing high pressure to process reports. The main challenge lies in how mid-market firms achieve digital transformation, not by abandoning familiar tools but by making them more effective. In this episode of the Don't Panic It's Just Data podcast, host Kevin Petrie, Vice President, Re…
  continue reading
 
Summary In this episode of the Data Engineering Podcast Vijay Subramanian, founder and CEO of Trace, talks about metric trees - a new approach to data modeling that directly captures a company's business model. Vijay shares insights from his decade-long experience building data practices at Rent the Runway and explains how the modern data stack has…
  continue reading
 
What makes a modern data platform actually work inside a law firm? In this episode, Mark Thorogood, Director of Enterprise Data Operations & Software Engineering at Perkins Coie LLP, breaks down how his team moved beyond on-prem constraints to a DIY lakehouse and logical data fabric that unify insights across the firm. Drawing on lessons from the a…
  continue reading
 
Summary In this crossover episode of the AI Engineering Podcast, host Tobias Macey interviews Brijesh Tripathi, CEO of Flex AI, about revolutionizing AI engineering by removing DevOps burdens through "workload as a service". Brijesh shares his expertise from leading AI/HPC architecture at Intel and deploying supercomputers like Aurora, highlighting…
  continue reading
 
What good is a dashboard if no one’s using it? In this episode, Jeannine Zito, senior manager of global knowledge solutions at Cleary Gottlieb, talks about how her background in banking shaped the way she thinks about data, strategy, and transformation inside major law firms. She gets into what happens when firms wait too long for the “perfect” dat…
  continue reading
 
Summary In this episode of the AI Engineering Podcast Mark Brooker, VP and Distinguished Engineer at AWS, talks about how agentic workflows are transforming database usage and infrastructure design. He discusses the evolving role of data in AI systems, from traditional models to more modern approaches like vectors, RAG, and relational databases. Ma…
  continue reading
 
Welcome back to Meeting of the Minds, a special podcast episode series by EM360Tech, where we talk about the future of tech. In this Big Data special episode of the Meeting of the Minds, our expert panel – Ravit Jain, Podcast host, Christina Stathopoulos of Dare to Data and a data and AI evangelist, Wayne Eckerson, data strategy consultant and pres…
  continue reading
 
Summary In this episode of the Data Engineering Podcast Hannes Mühleisen and Mark Raasveldt, the creators of DuckDB, share their work on Duck Lake, a new entrant in the open lakehouse ecosystem. They discuss how Duck Lake, is focused on simplicity, flexibility, and offers a unified catalog and table format compared to other lakehouse formats like I…
  continue reading
 
Generative AI has captured global attention, powering everything from chatbots to intelligent assistants. Yet in the enterprise, its promise often hits a dead end. According to Gartner, 80 per cent of enterprise data remains unused or “dark,” because conventional AI struggles to interpret complex, domain-specific information. In this episode of the…
  continue reading
 
What makes a data strategy actually work inside a law firm? In this episode, Carrie Remhof, Director of Firm Intelligence at Troutman Pepper Locke LLP, shares what she’s learned from building, implementing, and reimagining legal data systems across 30+ firms. She talks about the real reasons legal tech projects succeed or stall, why data perfection…
  continue reading
 
Inclusivity and accessibility remain some of the biggest challenges for data events. True inclusion is not the result of a single initiative, but continuous effort, honest reflection, and a willingness to listen to the community. At Big Data LDN, these values are rooted in the event’s mission. By highlighting both established leaders and emerging v…
  continue reading
 
Summary In this episode of the Data Engineering Podcast Serge Gershkovich, head of product at SQL DBM, talks about the socio-technical aspects of data modeling. Serge shares his background in data modeling and highlights its importance as a collaborative process between business stakeholders and data teams. He debunks common misconceptions that dat…
  continue reading
 
Big Data LDN (BDL), the ultimate data event of the year, celebrates its 10th anniversary this year. This year’s event is scheduled to take place on September 24 and 25, with a brand-new deep-dive conference held on September 23. In this episode of the Don't Panic, It's Just Data podcast, host Shubhangi Dua, Podcast Producer and B2B Tech Journalist …
  continue reading
 
Summary In this episode of the Data Engineering Podcast Professor Paul Groth, from the University of Amsterdam, talks about his research on knowledge graphs and data engineering. Paul shares his background in AI and data management, discussing the evolution of data provenance and lineage, as well as the challenges of data integration. He explores t…
  continue reading
 
What happens when data, clients, and people finally connect? In this episode, Rachel Shields Williams, Director of Client Intelligence at Sidley Austin LLP and incoming president of the Legal Marketing Association, shares how a lifelong instinct to “just help people” evolved into one of the most forward-thinking data roles in big law. Rachel explai…
  continue reading
 
Summary In this episode of the Data Engineering Podcast Prashanth Rao, an AI engineer at KuzuDB, talks about their embeddable graph database. Prashanth explains how KuzuDB addresses performance shortcomings in existing solutions through columnar storage and novel join algorithms. He discusses the usability and scalability of KuzuDB, emphasizing its…
  continue reading
 
In this episode of the Don't Panic, It's Just Data podcast, Kevin Petrie, VP of Research at BARC and the podcast host, is joined by Dainius Jocas, Search Engineer at Vinted, and Radu Gheorghe, Software Engineer at Vespa.ai. They discuss how Vinted, an online marketplace for secondhand products, modernised its data architecture to address new AI sea…
  continue reading
 
Summary In this episode of the Data Engineering Podcast Lucas Thelosen and Drew Gilson from Gravity talk about their development of Orion, an autonomous data analyst that bridges the gap between data availability and business decision-making. Lucas and Drew share their backgrounds in data analytics and how their experiences have shaped their approa…
  continue reading
 
Loading …
Copyright 2025 | Privacy Policy | Terms of Service | | Copyright
Listen to this show while you explore
Play