Search a title or topic

Over 20 million podcasts, powered by 

Player FM logo
Artwork

Content provided by collide.. All podcast content including episodes, graphics, and podcast descriptions are uploaded and provided directly by collide. or their podcast platform partner. If you believe someone is using your copyrighted work without your permission, you can follow the process outlined here https://podcastplayer.com/legal.
Player FM - Podcast App
Go offline with the Player FM app!

The Secret Data Engineering Behind Industry AI

45:13
 
Share
 

Manage episode 506984615 series 3441209
Content provided by collide.. All podcast content including episodes, graphics, and podcast descriptions are uploaded and provided directly by collide. or their podcast platform partner. If you believe someone is using your copyrighted work without your permission, you can follow the process outlined here https://podcastplayer.com/legal.

This episode is packed with big-picture energy talk and some seriously nerdy (but fun) data breakdowns. John Kalfayan from collide. and Chuck start with what’s really happening in oil and gas today before shifting into the challenges of putting AI to work in the field. From there, things get deep: contract dedications, what RAG actually means, how data chunking works, and the never-ending battle with duplicate info. We also weigh the costs of storage, querying, and running models, plus the tradeoffs between RAG and foundational models. If you’ve ever wondered about vector databases, data strategy, or just why we have a rant about sand, it’s all here. By the end, we hit on the human side too: education, privacy, and making sure the right people can access the right data.

Click here to watch a video of this episode.

Join the conversation shaping the future of energy.
Collide is the community where oil & gas professionals connect, share insights, and solve real-world problems together. No noise. No fluff. Just the discussions that move our industry forward.
Apply today at collide.io


Click here to view the episode transcript.
00:00 - Intro
01:51 - Oil and Gas Industry Insights
06:34 - AI Deployment Challenges
09:12 - Contract Dedications Explained
10:32 - Understanding RAG
12:52 - What is RAG in Data Management
13:43 - Data Chunking Techniques
17:17 - Cost Considerations in Data
18:03 - RAG vs Foundational Models
19:21 - Vectorized Databases Overview
23:47 - Managing Duplicate Data
26:28 - Data Strategy Considerations
28:24 - Sand Rant
31:32 - Identifying Gaps in Data
33:10 - The Cost of Storage
33:56 - Effective Data Querying
35:50 - AI Education and Awareness
37:53 - Privacy Concerns with Language Models
40:54 - Data Access and Availability

https://twitter.com/collide_io

https://www.tiktok.com/@collide.io

https://www.facebook.com/collide.io

https://www.instagram.com/collide.io

https://www.youtube.com/@collide_io

https://bsky.app/profile/digitalwildcatters.bsky.social

https://www.linkedin.com/company/collide-digital-wildcatters

  continue reading

218 episodes

Artwork
iconShare
 
Manage episode 506984615 series 3441209
Content provided by collide.. All podcast content including episodes, graphics, and podcast descriptions are uploaded and provided directly by collide. or their podcast platform partner. If you believe someone is using your copyrighted work without your permission, you can follow the process outlined here https://podcastplayer.com/legal.

This episode is packed with big-picture energy talk and some seriously nerdy (but fun) data breakdowns. John Kalfayan from collide. and Chuck start with what’s really happening in oil and gas today before shifting into the challenges of putting AI to work in the field. From there, things get deep: contract dedications, what RAG actually means, how data chunking works, and the never-ending battle with duplicate info. We also weigh the costs of storage, querying, and running models, plus the tradeoffs between RAG and foundational models. If you’ve ever wondered about vector databases, data strategy, or just why we have a rant about sand, it’s all here. By the end, we hit on the human side too: education, privacy, and making sure the right people can access the right data.

Click here to watch a video of this episode.

Join the conversation shaping the future of energy.
Collide is the community where oil & gas professionals connect, share insights, and solve real-world problems together. No noise. No fluff. Just the discussions that move our industry forward.
Apply today at collide.io


Click here to view the episode transcript.
00:00 - Intro
01:51 - Oil and Gas Industry Insights
06:34 - AI Deployment Challenges
09:12 - Contract Dedications Explained
10:32 - Understanding RAG
12:52 - What is RAG in Data Management
13:43 - Data Chunking Techniques
17:17 - Cost Considerations in Data
18:03 - RAG vs Foundational Models
19:21 - Vectorized Databases Overview
23:47 - Managing Duplicate Data
26:28 - Data Strategy Considerations
28:24 - Sand Rant
31:32 - Identifying Gaps in Data
33:10 - The Cost of Storage
33:56 - Effective Data Querying
35:50 - AI Education and Awareness
37:53 - Privacy Concerns with Language Models
40:54 - Data Access and Availability

https://twitter.com/collide_io

https://www.tiktok.com/@collide.io

https://www.facebook.com/collide.io

https://www.instagram.com/collide.io

https://www.youtube.com/@collide_io

https://bsky.app/profile/digitalwildcatters.bsky.social

https://www.linkedin.com/company/collide-digital-wildcatters

  continue reading

218 episodes

All episodes

×
 
Loading …

Welcome to Player FM!

Player FM is scanning the web for high-quality podcasts for you to enjoy right now. It's the best podcast app and works on Android, iPhone, and the web. Signup to sync subscriptions across devices.

 

Copyright 2025 | Privacy Policy | Terms of Service | | Copyright
Listen to this show while you explore
Play