Go offline with the Player FM app!
The Secret Data Engineering Behind Industry AI
Manage episode 506984615 series 3441209
This episode is packed with big-picture energy talk and some seriously nerdy (but fun) data breakdowns. John Kalfayan from collide. and Chuck start with what’s really happening in oil and gas today before shifting into the challenges of putting AI to work in the field. From there, things get deep: contract dedications, what RAG actually means, how data chunking works, and the never-ending battle with duplicate info. We also weigh the costs of storage, querying, and running models, plus the tradeoffs between RAG and foundational models. If you’ve ever wondered about vector databases, data strategy, or just why we have a rant about sand, it’s all here. By the end, we hit on the human side too: education, privacy, and making sure the right people can access the right data.
Click here to watch a video of this episode.
Join the conversation shaping the future of energy.
Collide is the community where oil & gas professionals connect, share insights, and solve real-world problems together. No noise. No fluff. Just the discussions that move our industry forward.
Apply today at collide.io
Click here to view the episode transcript.
00:00 - Intro
01:51 - Oil and Gas Industry Insights
06:34 - AI Deployment Challenges
09:12 - Contract Dedications Explained
10:32 - Understanding RAG
12:52 - What is RAG in Data Management
13:43 - Data Chunking Techniques
17:17 - Cost Considerations in Data
18:03 - RAG vs Foundational Models
19:21 - Vectorized Databases Overview
23:47 - Managing Duplicate Data
26:28 - Data Strategy Considerations
28:24 - Sand Rant
31:32 - Identifying Gaps in Data
33:10 - The Cost of Storage
33:56 - Effective Data Querying
35:50 - AI Education and Awareness
37:53 - Privacy Concerns with Language Models
40:54 - Data Access and Availability
https://twitter.com/collide_io
https://www.tiktok.com/@collide.io
https://www.facebook.com/collide.io
https://www.instagram.com/collide.io
https://www.youtube.com/@collide_io
https://bsky.app/profile/digitalwildcatters.bsky.social
https://www.linkedin.com/company/collide-digital-wildcatters
218 episodes
Manage episode 506984615 series 3441209
This episode is packed with big-picture energy talk and some seriously nerdy (but fun) data breakdowns. John Kalfayan from collide. and Chuck start with what’s really happening in oil and gas today before shifting into the challenges of putting AI to work in the field. From there, things get deep: contract dedications, what RAG actually means, how data chunking works, and the never-ending battle with duplicate info. We also weigh the costs of storage, querying, and running models, plus the tradeoffs between RAG and foundational models. If you’ve ever wondered about vector databases, data strategy, or just why we have a rant about sand, it’s all here. By the end, we hit on the human side too: education, privacy, and making sure the right people can access the right data.
Click here to watch a video of this episode.
Join the conversation shaping the future of energy.
Collide is the community where oil & gas professionals connect, share insights, and solve real-world problems together. No noise. No fluff. Just the discussions that move our industry forward.
Apply today at collide.io
Click here to view the episode transcript.
00:00 - Intro
01:51 - Oil and Gas Industry Insights
06:34 - AI Deployment Challenges
09:12 - Contract Dedications Explained
10:32 - Understanding RAG
12:52 - What is RAG in Data Management
13:43 - Data Chunking Techniques
17:17 - Cost Considerations in Data
18:03 - RAG vs Foundational Models
19:21 - Vectorized Databases Overview
23:47 - Managing Duplicate Data
26:28 - Data Strategy Considerations
28:24 - Sand Rant
31:32 - Identifying Gaps in Data
33:10 - The Cost of Storage
33:56 - Effective Data Querying
35:50 - AI Education and Awareness
37:53 - Privacy Concerns with Language Models
40:54 - Data Access and Availability
https://twitter.com/collide_io
https://www.tiktok.com/@collide.io
https://www.facebook.com/collide.io
https://www.instagram.com/collide.io
https://www.youtube.com/@collide_io
https://bsky.app/profile/digitalwildcatters.bsky.social
https://www.linkedin.com/company/collide-digital-wildcatters
218 episodes
All episodes
×Welcome to Player FM!
Player FM is scanning the web for high-quality podcasts for you to enjoy right now. It's the best podcast app and works on Android, iPhone, and the web. Signup to sync subscriptions across devices.