Join Dikayo Data and the brightest voices in data science for a weekly riveting discussion about the future of the data science industry, with a bent on diversity.
…
continue reading
Tales at Scale cracks open the world of analytics projects. We’ll be diving into Apache Druid but also hearing from folks in the data ecosystem tackling everything from architecture to open source, from scaling to streaming and everything in between- brought to you by Imply.
…
continue reading
Join DiKayo Data for a riveting recap of Imply’s annual Druid Summit featuring Senior Product Marketing Manager at Imply, Larissa Klitzke. The summit brought together industry leaders from top tech companies to discuss the latest trends, challenges and best practices across the Druid community. Catch the latest in this episode! Want to dive deeper …
…
continue reading

1
Druid Summit 2024 special! A recap with #DataFemme host Danielle DiKayo and Imply's Larissa Klitzke
39:00
39:00
Play later
Play later
Lists
Like
Liked
39:00Held in October 2024, Druid Summit brought Apache Druid® community contributors at companies including Netflix, Salesforce, Atlassian, Imply, Roblox and more together to discuss the latest trends, challenges, and best practices across the Druid community. The summit explored user experience design, operations and optimization techniques, and lakeho…
…
continue reading
Starburst's Senior Developer Advocate Monica Miller champions female representation in the data science world while gearing up for Datanova, Starburst's premiere data conference on October 23-24! Register for this free, virtual event here: https://bit.ly/4ewsZpq Check out Data Mishaps Night! => https://datamishapsnight.com This episode is sponsored…
…
continue reading

1
APIs, Analytics, and Apache Druid: How Kong Delivers API Observability and Insights with Hiroshi Fukada
20:15
20:15
Play later
Play later
Lists
Like
Liked
20:15On this episode, we explore how Kong, a cloud-native API gateway, leverages Apache Druid for real-time data processing and analytics in their platform, Kong Konnect. Hiroshi Fukada, Staff Software Engineer at Kong, shares his insights on managing customer data through Kong Gateway and transitioning to Imply's managed Druid services to simplify thei…
…
continue reading
Join DiKayo Data and Datalab Solutions' Reza du Plooy in an exploration of the consulting side of data science. Maybe you're interested in consulting yourself, learning what roles consultants play in fueling data innovation, or both! Listen in and get the latest insights from industry experts. Learn more about this episode's sponsor at datalabsolut…
…
continue reading

1
Apache Druid News! Druid 30.0 is Live and Druid Summit 2024 Announced with Hugh Evans and Will Xu
37:44
37:44
Play later
Play later
Lists
Like
Liked
37:44On this episode, we are joined by special co-host Hugh Evans and returning guest Will Xu as we announce Druid Summit 2024 and dive into Druid 30.0's new features and enhancements. Improvements include better ingestion for Amazon Kinesis and Apache Kafka, enhanced support for Delta Lake, and advanced integrations with Google Cloud Storage and Azure …
…
continue reading

1
Insights and Airwaves: How Global Delivers Ad Data Freshness with Apache Druid with Miguel Rodrigues
18:57
18:57
Play later
Play later
Lists
Like
Liked
18:57On this episode, we’re diving into digital ad spend and real-time data with Miguel Rodrigues, Head of Engineering at British media company Global. We’ll discuss their use of Apache Druid to enhance real-time analytics for their digital advertising platform and get the details on their transition from traditional databases to Druid, which added the …
…
continue reading

1
How Finix is Leaning Into Real-Time data with Apache Druid and Imply Polaris with Ross Morrow
36:51
36:51
Play later
Play later
Lists
Like
Liked
36:51On this episode, we are joined by Ross Morrow, a Software Engineer at Finix, the payment processor working to create the most accessible financial services ecosystem in history. Finix’s B2B payments platform is designed for flexibility and scalability, streamlining financial transactions for businesses and delivering a truly customer-centric experi…
…
continue reading

1
Securing the “Crown Jewels”: A Journey through Druid Database Security with Carrell Jackson
31:02
31:02
Play later
Play later
Lists
Like
Liked
31:02On this episode, we’re going all in on cybersecurity! Helping us with what critical aspects of security you need to focus on when building analytics applications is Carrell Jackson, CISO at Imply. We’ll discuss the importance of protecting sensitive data by implementing role-based access control and encryption and hear about best practices for secu…
…
continue reading
Join DiKayo Data and a dynamic panel from ThoughtSpot as they tune in from the US and India alike to discuss the rewards and challenges of developing GenAI tools, India as a blossoming hub for AI technology and more... Check out the latest news on ThoughtSpot’s $150M investment in its India operations here! This episode is kindly sponsored by Thoug…
…
continue reading

1
Inside Apache Druid 29.0: Getting up to Speed on Druid’s Performance, Ecosystem, and SQL Compliance with Sergio Ferragut
22:34
22:34
Play later
Play later
Lists
Like
Liked
22:34On this episode, we explore Apache Druid 29.0, focusing on three specific themes: performance, ecosystem, and SQL compliance. Discover new features such as EARLIEST / LATEST support for numerical columns, system fields ingestion, and enhanced array support like UNNEST and JSON_QUERY_ARRAY. In addition, get the full scoop on community-contributed ex…
…
continue reading

1
Smart Data in the Wake of Target Discovery
43:30
43:30
Play later
Play later
Lists
Like
Liked
43:30Join DiKayo Data and Ontotext as we navigate through compelling use cases that showcase the tangible benefits of target discovery. Our discussion will take you on a deep dive into the symbiotic relationship between large scale biomedical knowledge graphs and LLMs, offering insight into their collaborative potential in driving medical breakthroughs.…
…
continue reading

1
A Year in Review: Apache Druid's 2023 Highlights with Peter Marshall
26:29
26:29
Play later
Play later
Lists
Like
Liked
26:29In this special episode of Tales at Scale - this is our final episode of our first season! - Peter Marshall, Director of Developer Relations at Imply joins the show to discuss the highlights of 2023 for Apache Druid. We dive into the significant feature releases and enhancements that have transformed Druid over the past year, including the SQL stan…
…
continue reading
Join DiKayo Data and Ontotext in a riveting panel discussion about knowledge graphs and their role in communicating the value of data. Have questions about semantic layering? Chances are they’ll be answered here! Learn more about Ontotext on their website. Ontotext Panelists: Doug Kimball, CMO Teodora Petkova, Content Writer Krasimira Bozhanova, So…
…
continue reading

1
From ANSI SQL Support to Multi-topic Kafka Ingestion: What's New in Apache Druid 28 with Will Xu
25:46
25:46
Play later
Play later
Lists
Like
Liked
25:46On this episode, we dive into Apache Druid 28. This latest Druid release includes improved ANSI SQL and Apache Calcite support, the addition of window functions as an experimental feature, async queries and query from deep storage going GA, array enhancements, multi-topic Apache Kafka ingestion, and so much more! Will Xu, program manager at Imply r…
…
continue reading
Join DiKayo Data and Alex Merced of Dremio on an expedition into the heart of Apache Iceberg! This episode was kindly sponsored by dremio.com Got insights to share? Apply to speak at Subsurface LIVE - the industry’s premier cloud data lakehouse conference. Click here to apply by December 22, 2023 => https://sessionize.com/subsurface-live-2024 ~~~~~…
…
continue reading

1
Druid and Joins Debunked! with Sergio Ferragut and Hellmar Becker
15:41
15:41
Play later
Play later
Lists
Like
Liked
15:41On this episode, we debunk the myth that Druid can't do joins. Druid doesn't function as a traditional relational database because it was purpose-built for lightning-fast queries on large datasets. However, this doesn't mean Druid is entirely devoid of join capabilities – it simply approaches them differently. Our myth-busting team features returni…
…
continue reading

1
Scaling with Speed: How Atlassian's Confluence Big Data Platform Team Delivers Customer-Facing Insights with Apache Druid with Gautam Jethwani and Kasirajan Selladurai Selvakumari
16:24
16:24
Play later
Play later
Lists
Like
Liked
16:24On this episode, we explore how Atlassian leverages Apache Druid's capabilities to handle millions of daily events and empower users with intelligent data-driven features. We’re joined by Gautam Jethwani and Kasirajan Selladurai Selvakumari from the Confluence Big Data Platform Team who will talk through how they use Druid to power intelligent feat…
…
continue reading

1
Fraud Fighters: How Apache Druid and Imply help Ibotta combat fraud with faster anomaly detection with Jaylyn Stoesz
32:27
32:27
Play later
Play later
Lists
Like
Liked
32:27When it comes to fraud detection, initial detection is key, but so is the ability to quickly dissect and address the problem to minimize losses. This means access to real-time data is paramount. The only way to combat fraud in the digital age is to fight fire with fire…automation with automation. In this episode, we’re joined by Jaylyn Stoesz, Staf…
…
continue reading

1
Safety and Trust: Two Sides of the Same Coin
45:20
45:20
Play later
Play later
Lists
Like
Liked
45:20Semantic layers and generative AI are all the rage and it's a lot to unpack! Join DiKayo Data and David Jayatillake of Delphi HQ in a journey to pinpoint where and how these exciting new technologies best enhance our lives. This episode is generously sponsored by Cube. Learn more about them mid episode and at cube.dev! Think you’d be a good guest o…
…
continue reading

1
All things Apache Druid 27.0: From deep storage queries to new visualization with Will Xu
27:34
27:34
Play later
Play later
Lists
Like
Liked
27:34We’re back again with another Druid release! Here we are at Apache Druid 27.0, thanks to the dedication of the Druid Community. This release was made possible by over 350 commits & 46 contributors. Will Xu, Product Manager at Imply joins the show to discuss new features like Smart Segment Loading, a new mechanism for managing data files as the data…
…
continue reading

1
Orb and Apache Druid: Building customer trust through data correctness with Kshitij Grover
27:04
27:04
Play later
Play later
Lists
Like
Liked
27:04Real-time data has many applications but one place where it’s extremely valuable is with usage tracking, billing, and generating reports. Ensuring the freshness and availability of this data is not only essential for financial success but also for establishing a more challenging aspect—trust. That's precisely why Orb chose Apache Druid and Imply as…
…
continue reading

1
Confluent, Kafka, Druid, and Flink: The Future of Streaming Data with Kai Waehner
32:22
32:22
Play later
Play later
Lists
Like
Liked
32:22Apache Kafka® is a streaming platform that can handle large-scale, real-time data streams reliably. It’s used for real-time data pipelines, event sourcing, log aggregation, stream processing, and building analytics applications. Apache® Druid is a database designed to provide fast, interactive, and scalable analytics on time-series and event-based …
…
continue reading

1
Driving Innovation with Open Standards: How Voltron Data is Shaping the Data Ecosystem with Apache Arrow and Ibis with Josh Patterson
34:16
34:16
Play later
Play later
Lists
Like
Liked
34:16Today's show is all about the world of big data and open source projects, and we've got a real gem to share with you—Voltron Data! They're on a mission to revolutionize the data analytics industry through open standards. To unleash the untapped potential in data, Voltron Data uses cutting-edge tech and provides top-notch support services, with a sp…
…
continue reading

1
How Apache Druid Revolutionized Digital Turbine’s Analytics Infrastructure with Lioz Nudel and Alon Edelman
21:24
21:24
Play later
Play later
Lists
Like
Liked
21:24Who better to talk about the real-world usage of Apache Druid than Digital Turbine, a leading mobile growth and monetization platform? The folks at DT go way back with Druid. On this episode Lioz Nudel, Engineering Group Manager at Digital Turbine and Alon Edelman, Data Architect at Digital Turbine discuss how Druid has significantly improved their…
…
continue reading
In this episode of #DataFemme, join DiKayo Data and Chelsea Douglas, Plotly's VP of Customer Success, as they explore the synergies between Plotly's customer success team and the beautiful visualizations Dash can create for clients. This episode of #DataFemme is sponsored by plot.ly. Like what you hear? Support the creation of future #DataFemme con…
…
continue reading

1
Decoding Emotions: Leveraging ChatGPT and Apache Druid for Sentiment Analysis with Rick Jacobs
19:19
19:19
Play later
Play later
Lists
Like
Liked
19:19Whether you're a data engineer, data scientist, technology enthusiast, or just a person on the Internet, you’ve heard about ChatGPT. But did you know there are some great use cases for it that work with Apache Druid? Druid and ChatGPT are two cutting-edge technologies that are revolutionizing the world of real-time analytics and natural language pr…
…
continue reading

1
Druid Operator: Simplifying the management of Apache Druid in Kubernetes with Adheip Singh
15:41
15:41
Play later
Play later
Lists
Like
Liked
15:41Deploying and configuring Apache Druid manually in a Kubernetes environment can be complex and time-consuming. But it doesn’t have to be. Enter Druid Operator, a tool specifically designed for managing Apache Druid deployments in a Kubernetes environment. Adheip Singh, founder of DataInfra and contributor to Druid Operator, walks us through the ben…
…
continue reading

1
Apache Druid 26.0: Breaking Down Druid's Latest Release with Vadim Ogievetsky
30:49
30:49
Play later
Play later
Lists
Like
Liked
30:49Breaking news! Apache Druid 26.0 is now available! Druid 26.0 has a few key features including schema auto discovery and shuffle JOINs but that’s not all. On this episode, we’re joined by Vadim Ogievetsky, Apache Druid PMC, co-founder of Imply and one of the very first Druid users, to talk through what’s new and why it’s cool. Special thanks to the…
…
continue reading

1
The Dynamic Duo: Apache Druid and Kubernetes with Yoav Nordmann
29:06
29:06
Play later
Play later
Lists
Like
Liked
29:06Kubernetes, an open-source container orchestration platform, has been making waves in the Apache Druid community. It makes sense - using Druid with Kubernetes can help you build a more scalable, flexible, and resilient data analytics infrastructure. Yoav Nordmann, Tech Lead and Architect at Tikal Knowledge shares why Kubernetes is so hot right now …
…
continue reading

1
Documenting Apache Druid Experiments with Hellmar Becker
20:23
20:23
Play later
Play later
Lists
Like
Liked
20:23Working on/with Apache Druid is one thing, but talking about it is another. On today’s episode, we get tips and tricks for writing about your technical projects from Hellmar Becker, Apache Druid blogger and sales engineer at Imply. Spoiler alert: It doesn't have to be War and Peace. Learn how to get started with your own blog, the value of document…
…
continue reading

1
The World of Operational Visibility with Will To
45:02
45:02
Play later
Play later
Lists
Like
Liked
45:02One of Apache Druid's top use cases is operational visibility, which involves monitoring, understanding, and optimizing systems in real time. If that sounds a little boring, you’re in for a treat. We talk about what you need to get started and then dive into some really interesting use cases for operational visibility across industries. Listen to l…
…
continue reading

1
Speed, Scale, and Streaming: Building Analytics Applications with Darin Briskman
27:59
27:59
Play later
Play later
Lists
Like
Liked
27:59What is an analytics application? We state at the top of every show that they’re different from BI tools but so far, we haven’t said why. It’s time we break it all down. Darin Briskman, Director of Technology at Imply and author of the O'Reilly report Building Real Time Analytics Applications: Operational Workflows with Apache Druid, joins us to ta…
…
continue reading

1
Everything You Need to Know About SQL-Based Ingestion in Apache Druid with Sergio Ferragut
19:03
19:03
Play later
Play later
Lists
Like
Liked
19:03If you’re a fan of SQL, this episode is for you. The addition of the multi-stage query engine in Apache Druid has enabled SQL-based ingestion. While that’s not something new to the database space, it makes Druid easier to use since more developers know SQL. Sergio Ferragut, Senior Developer Advocate at Imply walks us through using SQL to define and…
…
continue reading

1
Accurate, Validated, and Real Time: Diving into Reddit’s Druid-powered Ad Platform with Lakshmi Ramasubramanian
19:49
19:49
Play later
Play later
Lists
Like
Liked
19:49How do ads work on the “front page of the internet?” On today’s episode, staff software engineer at Reddit Lakshmi Ramasubramanian discusses Reddit’s ad platform, including how it handles ad pacing, real-time data, and more. We’ll dive into the challenges they needed to solve and why Apache Druid was the right database for the job.…
…
continue reading

1
The Tale of Two Vehicles: Apache Druid's New Shape Takes Form
17:48
17:48
Play later
Play later
Lists
Like
Liked
17:48Apache Druid today isn’t the Druid that you’re used to. It’s so much more. The addition of the multi-stage query engine didn’t just change the way Druid handles queries but enabled data and transformation on ingestion and inside of Druid from one table to another using SQL. This has made Druid about 40% faster. But why stop there? Get the inside sc…
…
continue reading

1
Who Really Needs Real-Time Data? with Gwen Shapira
37:07
37:07
Play later
Play later
Lists
Like
Liked
37:07The term “real time” gets thrown around a lot, especially when it comes to data. Gwen Shapira, co-founder and CPO of Nile joins us to help define real-time data, discuss who needs it (and who probably doesn't) and how to not build yourself into a corner with your architecture. When you're able to harness the power of real-time insights, you can do …
…
continue reading

1
Why Apache Druid is Not Like Other OLAP Databases with Muthu Lalapet and David Wang
55:59
55:59
Play later
Play later
Lists
Like
Liked
55:59On today’s episode, we’re joined by David Wang, VP of Product Marketing at Imply and Muthu Lalapet, Director of Worldwide Sales Engineering at Imply to dig into Apache Druid, a high performance, real-time analytics database. Thousands of companies are already using Druid today, from Netflix to Salesforce. But what is Apache Druid best used for? Wha…
…
continue reading
Tired of crumpled overtime worksheets and awkward requests for time off? Join DiKayo Data and two amazing panelists from Verint as they how discuss the streamlining of data processes is improving modern companies’ workflows. This episode of #DataFemme is generously sponsored by Verint, The Customer Engagement Company.…
…
continue reading

1
Apache Druid: A Database Origin Story with Eric Tschetter
36:06
36:06
Play later
Play later
Lists
Like
Liked
36:06On today’s episode, we’re digging into the origins of Apache Druid, a high-performance, real-time analytics database, and what makes it unique with Druid co-creator and Field CTO at Imply Eric Tschetter.By Imply Data
…
continue reading
Tales at Scale cracks open the world of analytics projects. Hosted by Reena Leone from Imply, we’ll be diving into Apache Druid but also hearing from folks in the data ecosystem tackling everything from architecture to open source, from scaling to streaming and everything in between.By Imply Data
…
continue reading
Join DiKayo Data and three amazing panelists from Scuba Analytics to discuss how the deconstructing of silos can inspire innovation as a team. Scuba Analytics is Hiring… A FRONT-END ENGINEER who is strong in React and technologies that underpin the Web: https://boards.greenhouse.io/scuba/jobs/5104867002 An BACK-END ENGINEER who brings considerable …
…
continue reading
Join Daria Malone, Meg Pirrung and myself on the #DataFemme platform for a riveting discussion on mental health, tech careers and the job hunt, specifically for Black and Brown Women. This episode is sponsored by The City of New Orleans. They are hiring for several data-related roles that are sure to interest a lot of you: Data Visualization Develo…
…
continue reading
Leigha Jarett, Product Manager @ Google, appears on #DataFemme to talk about career paths, cloud migration and learning new skills both hard and soft. This episode of #DataFemme is sponsored by The Bloor Group. Are you a voice of authority in the information economy? Apply to be a guest on DM Radio with Eric Kavanagh here. Thoughts on this episode?…
…
continue reading
Join me on an investigation down the metaphysical rabbit hole to discover how the energy behind the numbers and letters we use without a second thought could truly inform our lives.By DiKayo Data
…
continue reading
#DataFemme Season 3 kicks off with an episode featuring Adam Wilson, CEO of Trifacta. Join us in our discussion of the sensitivity of health care data, where Trifacta falls in a data company's pipeline and so much more in! Thank you to Trifacta for sponsoring this episode. Learn more about Trifacta and explore their free trial at trifacta.com. Supp…
…
continue reading
Join Danielle Oberdier, Founder of Dikayo Data and Sepi Seifzadeh, Program Director of Open Source - Data & AI Technologies at IBM for a riveting discussion on Ethical AI, blogging about data science projects and much more. This episode was kindly sponsored by IBM. Sepi mentions a lot of valuable resources throughout this episode. Click HERE to acc…
…
continue reading

1
Open Source, Open Teams and the Push for Representation
57:30
57:30
Play later
Play later
Lists
Like
Liked
57:30This #DataFemme episode features Open Teams' Dhavide Aruliah and Fatma Tarlaci as they work to restructure diversity, equity and inclusion efforts in the data science industry and open source community. What have we learned in the past year? This episode is generously sponsored by Open Teams. Visit openteams.com to find out more! Music: Dreams by T…
…
continue reading
As live conferences begin to resume, I paired up with my favorite data conference host Lander Analytics to bring you this panel of unique data wizzes, all of whom have found their own niches within the industry. Panel Guests: Ijeamaka Anyene | Kaiser Division of Research Malorie Hughes | Amazon Asmae Toumi | Pursue Care This episode was sponsored b…
…
continue reading
This week's episode of #DataFemme is a riveting discussion on the synergies between chess and data science. How does skill at one inform the other? Tableau Zen Master and chess aficionado Neil Richards is our wonderful guest this week and this episode is sponsored by my friends at Cockroach DB. Get a free t-shirt from Cockroach DB when you sign up …
…
continue reading