The O'Reilly Data Show Podcast explores the opportunities and techniques driving big data, data science, and AI.
…
continue reading
OReilly Data Podcasts
Insight, analysis, and research about emerging technologies from O'Reilly Media.
…
continue reading
Security insight and analysis.
…
continue reading
O'Reilly Radar tracks the technologies and people that will shape our world in the years to come. Each episode of O'Reilly Radar features an interview with an industry thought leader, with topics touching on everything from programming to data to experience design. We also take a step back from the breathless pace of the latest tech news to examine why new developments are important and what they might mean down the road.
…
continue reading
O'Reilly Media spreads the knowledge of innovators. At O’Reilly, a big part of our business is paying attention to what’s new and interesting in the world of technology. We have a pretty good record at having anticipated some of the big technology developments in recent history. For instance, we launched the first commercial Web site, GNN, in 1993; we organized the meeting at which the term “open source” was first adopted; we were early investors in Blogger, which helped launch the blogging ...
…
continue reading
1
Machine Learning for Operational Analytics and Business Intelligence
51:41
51:41
Play later
Play later
Lists
Like
Liked
51:41In this episode of the Data Show, I speak with Peter Bailis, founder and CEO of Sisu, a startup that is using machine learning to improve operational analytics. Bailis is also an assistant professor of computer science at Stanford University, where he conducts research into data-intensive systems and where he is co-founder of the DAWN Lab.…
…
continue reading
1
Machine learning for operational analytics and business intelligence
51:38
51:38
Play later
Play later
Lists
Like
Liked
51:38In this episode of the Data Show, I speak with Peter Bailis, founder and CEO of Sisu, a startup that is using machine learning to improve operational analytics. Bailis is also an assistant professor of computer science at Stanford University, where he conducts research into data-intensive systems and where he is co-founder of the DAWN Lab. We had a…
…
continue reading
1
Machine learning and analytics for time series data
40:31
40:31
Play later
Play later
Lists
Like
Liked
40:31In this episode of the Data Show, I speak with Arun Kejariwal of Facebook and Ira Cohen of Anodot (full disclosure: I’m an advisor to Anodot). This conversation stemmed from a recent online panel discussion we did, where we discussed time series data, and, specifically, anomaly detection and forecasting. Both Kejariwal (at Machine Zone, Twitter, an…
…
continue reading
1
Machine Learning and Analytics for Time Series Data
40:33
40:33
Play later
Play later
Lists
Like
Liked
40:33In this episode of the Data Show, I speak with Arun Kejariwal of Facebook and Ira Cohen of Anodot (full disclosure: I’m an advisor to Anodot). This conversation stemmed from a recent online panel discussion we did, where we discussed time series data, and, specifically, anomaly detection and forecasting. Both Kejariwal (at Machine Zone, Twitter, an…
…
continue reading
In this episode of the Data Show, I speak with Michael Mahoney, a member of RISELab, the International Computer Science Institute, and the Department of Statistics at UC Berkeley. A physicist by training, Mahoney has been at the forefront of many important problems in large-scale data analysis. On the theoretical side, his works spans algorithmic a…
…
continue reading
In this episode of the Data Show, I speak with Michael Mahoney, a member of RISELab, the International Computer Science Institute, and the Department of Statistics at UC Berkeley. A physicist by training, Mahoney has been at the forefront of many important problems in large-scale data analysis. On the theoretical side, his works spans algorithmic a…
…
continue reading
In this episode of the Data Show, I speak with Kesha Williams, technical instructor at A Cloud Guru, a training company focused on cloud computing. As a full stack web developer, Williams became intrigued by machine learning and started teaching herself the ML tools on Amazon Web Services. Fast forward to today, Williams has built some well-regarde…
…
continue reading
In this episode of the Data Show, I speak with Kesha Williams, technical instructor at A Cloud Guru, a training company focused on cloud computing. As a full stack web developer, Williams became intrigued by machine learning and started teaching herself the ML tools on Amazon Web Services. Fast forward to today, Williams has built some well-regarde…
…
continue reading
1
Labeling, transforming, and structuring training data sets for machine learning
40:51
40:51
Play later
Play later
Lists
Like
Liked
40:51In this episode of the Data Show, I speak with Alex Ratner, project lead for Stanford’s Snorkel open source project; Ratner also recently garnered a faculty position at the University of Washington and is currently working on a company supporting and extending the Snorkel project. Snorkel is a framework for building and managing training data. Base…
…
continue reading
1
Labeling, Transforming, and Structuring Training Data Sets for Machine Learning
40:54
40:54
Play later
Play later
Lists
Like
Liked
40:54In this episode of the Data Show, I speak with Alex Ratner, project lead for Stanford’s Snorkel open source project; Ratner also recently garnered a faculty position at the University of Washington and is currently working on a company supporting and extending the Snorkel project. Snorkel is a framework for building and managing training data. Base…
…
continue reading
In this episode of the Data Show, I speak with Cassie Kozyrkov, technical director and chief decision scientist at Google Cloud. She describes “decision intelligence” as an interdisciplinary field concerned with all aspects of decision-making, and which combines data science with the behavioral sciences. Most recently she has been focused on develo…
…
continue reading
1
Taming Chaos: Preparing for Your Next Incident
29:45
29:45
Play later
Play later
Lists
Like
Liked
29:45In this interview, Tim Craig and fellow Googler Gustavo Franco, a site reliability engineer (SRE), discuss the wide range of events that qualify as “incidents;” the need for a conscious, robust, and well-defined process for understanding them; the role of training; and how to get buy-in from management so you can spread incident response training t…
…
continue reading
In this episode of the Data Show, I speak with Cassie Kozyrkov, technical director and chief decision scientist at Google Cloud. She describes "decision intelligence" as an interdisciplinary field concerned with all aspects of decision-making, and which combines data science with the behavioral sciences. Most recently she has been focused on develo…
…
continue reading
In this episode of the Data Show, I spoke with Roger Chen, co-founder and CEO of Computable Labs, a startup focused on building tools for the creation of data networks and data exchanges. Chen has also served as co-chair of O’Reilly’s Artificial Intelligence Conference since its inception in 2016. This conversation took place the day after Chen and…
…
continue reading
In this episode of the Data Show, I spoke with Roger Chen, co-founder and CEO of Computable Labs, a startup focused on building tools for the creation of data networks and data exchanges. Chen has also served as co-chair of O'Reilly's Artificial Intelligence Conference since its inception in 2016. This conversation took place the day after Chen and…
…
continue reading
In this week’s episode of the Data Show, we’re featuring an interview Data Show host Ben Lorica participated in for the Software Engineering Daily Podcast, where he was interviewed by Jeff Meyerson. Their conversation mainly centered around data engineering, data architecture and infrastructure, and machine learning (ML). Here are a few highlights:…
…
continue reading
In this week's episode of the Data Show, we're featuring an interview Data Show host Ben Lorica participated in for the Software Engineering Daily Podcast, where he was interviewed by Jeff Meyerson. Their conversation mainly centered around data engineering, data architecture and infrastructure, and machine learning (ML).…
…
continue reading
1
Enabling end-to-end machine learning pipelines in real-world applications
42:53
42:53
Play later
Play later
Lists
Like
Liked
42:53In this episode of the Data Show, I spoke with Nick Pentreath, principal engineer at IBM. Pentreath was an early and avid user of Apache Spark, and he subsequently became a Spark committer and PMC member. Most recently his focus has been on machine learning, particularly deep learning, and he is part of a group within IBM focused on building open s…
…
continue reading
1
Enabling End-to-End Machine Learning Pipelines in Real-World Applications
42:56
42:56
Play later
Play later
Lists
Like
Liked
42:56In this episode of the Data Show, I spoke with Nick Pentreath, principal engineer at IBM. Pentreath was an early and avid user of Apache Spark, and he subsequently became a Spark committer and PMC member. Most recently his focus has been on machine learning, particularly deep learning, and he is part of a group within IBM focused on building open s…
…
continue reading
1
How to Get Started With Site Reliability Engineering (SRE)
38:32
38:32
Play later
Play later
Lists
Like
Liked
38:32At Google’s 2019 Cloud Next conference, I sat down with Stephen Thorne, site reliability engineer on Google’s customer reliability engineering team and co-author of "The Site Reliability Workbook," to talk about how organizations, both large and small, can use SRE to reduce operational costs, improve reliability, and create productive cross-functio…
…
continue reading
1
Bringing scalable real-time analytics to the enterprise
37:12
37:12
Play later
Play later
Lists
Like
Liked
37:12In this episode of the Data Show, I spoke with Dhruba Borthakur (co-founder and CTO) and Shruti Bhat (SVP of Product) of Rockset, a startup focused on building solutions for interactive data science and live applications. Borthakur was the founding engineer of HDFS and creator of RocksDB, while Bhat is an experienced product and marketing executive…
…
continue reading
1
Bringing Scalable Real-time Analytics to the Enterprise
37:14
37:14
Play later
Play later
Lists
Like
Liked
37:14In this episode of the Data Show, I spoke with Dhruba Borthakur (co-founder and CTO) and Shruti Bhat (SVP of Marketing) of Rockset, a startup focused on building solutions for interactive data science and live applications. Borthakur was the founding engineer of HDFS and creator of RocksDB, while Bhat is an experienced product and marketing executi…
…
continue reading
1
Applications of data science and machine learning in financial services
42:32
42:32
Play later
Play later
Lists
Like
Liked
42:32In this episode of the Data Show, I spoke with Jike Chong, chief data scientist at Acorns, a startup focused on building tools for micro-investing. Chong has extensive experience using analytics and machine learning in financial services, and he has experience building data science teams in the U.S. and in China. We had a great conversation spannin…
…
continue reading
1
Applications of Data Science and Machine Learning in Financial Services
42:35
42:35
Play later
Play later
Lists
Like
Liked
42:35In this episode of the Data Show, I spoke with Jike Chong, chief data scientist at Acorns, a startup focused on building tools for micro-investing. Chong has extensive experience using analytics and machine learning in financial services, and he has experience building data science teams in the U.S. and in China.We had a great conversation spanning…
…
continue reading
1
Real-time entity resolution made accessible
27:09
27:09
Play later
Play later
Lists
Like
Liked
27:09In this episode of the Data Show, I spoke with Jeff Jonas, CEO, founder and chief scientist of Senzing, a startup focused on making real-time entity resolution technologies broadly accessible. He was previously a fellow and chief scientist of context computing at IBM. Entity resolution (ER) refers to techniques and tools for identifying and linking…
…
continue reading
1
Real-Time Entity Resolution Made Accessible
27:11
27:11
Play later
Play later
Lists
Like
Liked
27:11In this episode of the Data Show, I spoke with Jeff Jonas, CEO, founder and chief scientist of Senzing, a startup focused on making real-time entity resolution technologies broadly accessible. He was previously a fellow and chief scientist of context computing at IBM. Entity resolution (ER) refers to techniques and tools for identifying and linking…
…
continue reading