Kubernetes Bytes is a podcast bringing you the latest from the world of cloud native data management. Hosts Ryan Wallner and Bhavin Shah come to you from Boston, Massachusetts with experienced backgrounds in cloud-native tech. They'll be sharing their thoughts on recent cloud native news and talking to industry experts about their experiences and challenges managing the wealth of data in today's cloud-native ecosystem.
…
continue reading
Kubernetes Bytes Podcasts
Welcome to Energy Bytes - your essential guide to the intersection of data and energy. Hosted by industry veterans John Kalfayan and Bobby Neelon, this podcast dives deep into the world of energy, shedding light on how data, AI, and technology are revolutionizing this sector. Each episode equips listeners with insights into the most efficient tools and resources, paving the way for a data-driven future in energy. From technical nuances to broader industry trends, Energy Bytes offers an unpar ...
…
continue reading
Green IO with Gaël Duez explores how to reduce the environmental impact of our digital world. Twice a month, on a Tuesdays guests from across the globe share insights, tools, and alternative approaches, enabling all responsible technologists, within the Tech sector and beyond, to build a greener digital world, one byte at a time.
…
continue reading
Podcast about the web industry, tools and techniques upcoming and in use today hosted by Adam Listek. Support this podcast: https://podcasters.spotify.com/pod/show/bit-v-byte/support
…
continue reading
Binary Breakdown is your go-to podcast for exploring the latest in computer science research and technology. Each episode dives into groundbreaking papers, emerging technologies, and the ideas shaping our digital world. Whether you're a tech enthusiast, a computer science student, or a seasoned professional, Binary Breakdown decodes complex topics into insightful discussions, connecting the dots between theory and real-world application. Join us as we break down binary, byte by byte, to unco ...
…
continue reading

1
#61 Scaling Green Software with Anita Schüttler
49:16
49:16
Play later
Play later
Lists
Like
Liked
49:16What does it take to go beyond raising awareness in green software? To avoid checking just boxes? What is required to scale green software practices in a company? To discuss these issues, Gaël Duez welcomes Anita Schüttler on this episode “from the trenches”. Anita is a seasoned software engineer and expert on digital sustainability. She works as H…
…
continue reading

1
EP 69: Bella Kelada-Khalil from Midnight Marketing
1:21:32
1:21:32
Play later
Play later
Lists
Like
Liked
1:21:32AI isn’t just showing up in oil and gas, it’s kicking the door down. We hung out with Bella Kelada-Khalil from Midnight Marketing, who’s diving headfirst into petroleum engineering and AI after starting in sales and events. She’s asking the smart, unfiltered questions you wish you could, and we’re here for it. We swap stories on how AI is automatin…
…
continue reading

1
#60 Why Tech companies should not deprioritize future readiness with Rainer Karcher
49:37
49:37
Play later
Play later
Lists
Like
Liked
49:37“Climate activist in a suit”. This is how Rainer Karcher describes himself. It is an endless debate between people advocating for the system to change from the outside and those willing to change it from the inside. In this episode Gaël Duez welcomes a strong advocate of moving the corporate world into the right direction from within? Having spent …
…
continue reading
Tired of clunky geoscience software that feels like it was built in the '90s? Same. That’s why we brought on Cameron Snow from Danomics to talk about how his team is shaking things up. He’s built a platform that actually makes life easier for geologists and engineers, no more wrestling with legacy tools just to run a few thousand wells. We got into…
…
continue reading

1
EP 67: Jeff Krimmel from Krimmel Strategy Group
1:13:01
1:13:01
Play later
Play later
Lists
Like
Liked
1:13:01If you've ever wrestled with messy datasets, legacy code, or trying to get executives to actually *use* data, you’ll get a kick out of this one. We sat down with Jeff Krimmel from Krimmel Strategy Group to talk shop about how he’s helping energy companies turn their mountain of data into real-world strategy. We hit everything from making peace with…
…
continue reading

1
#59 Debriefing Qcon Sustainability track with Erica Pisani
39:32
39:32
Play later
Play later
Lists
Like
Liked
39:32How is sustainability covered in main tech conferences? Sure cybersecurity, DevOps, or anything related to SRE, is covered at length. Not to mention AI… But what room is left for the environmental impact of our job ? And what are the main trends which are filtered out from specialized conferences in Green IT such as Green IO, GreenTech Forum or eco…
…
continue reading

1
EP 66: Cam Sinclair from SmartChain
1:02:49
1:02:49
Play later
Play later
Lists
Like
Liked
1:02:49SmartChain is on a mission to kill off the chaos of clunky contracts in oil and gas—and they might actually be pulling it off. We caught up with their co-founder Cameron Sinclair, who’s turning heads with a platform that automates the mess out of invoicing, closes those dreaded “leakage” gaps, and makes cash flow way less of a guessing game. From a…
…
continue reading
This research paper introduces Anna, a key-value store (KVS) designed for scalable performance across diverse computing environments, from single multi-core machines to globally distributed cloud deployments. Anna achieves high performance and adaptability through a partitioned, multi-master architecture utilizing wait-free execution and coordinati…
…
continue reading

1
#58b Avoided emissions thanks to Tech: the Vinted use case with Laetitia Bornes - Part 2
39:15
39:15
Play later
Play later
Lists
Like
Liked
39:15Why is the model of a Nobel prize winner not necessarily good science? What is “good” modelling? Is access to information enough to change a system behavior? This episode is the second part of a long interview with Laetitia Bornes, a Doctor in Human-Computer Interaction, Systems Engineering and Systemic Design who is one of the co-authors of a rese…
…
continue reading

1
EP 65: Brandon Davis from AFE Leaks
1:04:23
1:04:23
Play later
Play later
Lists
Like
Liked
1:04:23We're lifting the curtain on the mysterious world of energy data transparency with the guy who's shaking things up at AFE Leaks. He's on a mission to break open the hidden details around oil and gas costs and operations, think: cracking open dusty old filings and turning them into something everyone can actually use. He chats about the wild ride of…
…
continue reading
This academic paper introduces Conflict-free Replicated Data Types (CRDTs), which are abstract data types designed for distributed systems where data is replicated across multiple locations. CRDTs allow any replica to be modified without needing immediate coordination with other replicas, ensuring high availability and low latency. The core concept…
…
continue reading

1
#58a Avoided emissions thanks to Tech: the Vinted use case with Laetitia Bornes - Part 1
48:03
48:03
Play later
Play later
Lists
Like
Liked
48:03Can a digital company be “carbon negative”? What should we think of these claims of “tons of carbon avoided” coming from 2nd hand platforms such as Vinted or Back Market? Dr Laetita Bornes conducted research on Vinted claims, investigating its data sources and the methodology used with her colleague David Ekchazer. Their findings were surprising, e…
…
continue reading

1
CAP Twelve Years Later: How the "Rules" Have Changed
29:47
29:47
Play later
Play later
Lists
Like
Liked
29:47This content from InfoQ provides insights for software architects and developers through various formats like newsletters, articles, and conference information. It highlights topics in architecture, AI, data engineering, culture, methods, and DevOps. Featured pieces discuss Slack's cellular architecture, data stream processing patterns, cultivating…
…
continue reading

1
Raft versus Paxos: An Understandable Consensus Algorithm
33:14
33:14
Play later
Play later
Lists
Like
Liked
33:14Raft, a consensus algorithm designed for managing a replicated log in distributed systems. It aims to be more understandable than Paxos, a widely used but complex alternative, while achieving equivalent efficiency and safety. Raft separates key consensus elements like leader election, log replication, and safety, using techniques such as problem de…
…
continue reading

1
Neo4j Architecture: Graph Database Internals, Performance, and Optimization
17:42
17:42
Play later
Play later
Lists
Like
Liked
17:42This compilation of resources offers a comprehensive examination of Neo4j's graph database architecture. It explains how Neo4j differs from relational and document-oriented databases through its native graph storage. The materials describe how nodes, relationships, and properties are stored and indexed for efficient traversal and query processing. …
…
continue reading

1
#57 Greening Intelligence: Bridging Infrastructure and Governance for a Sustainable AI Future with Pr. PS Lee and Pr. Heng Wang
24:46
24:46
Play later
Play later
Lists
Like
Liked
24:46Description “It's always a case of fit for purpose, or what we call a proper engineering.” Some down-to-earth facts and analysis were coined by Pr PS Lee, one of the world's top experts in liquid cooling - and Pr. Heng Wang - a renowned expert in digital governance - while cross-analysing Singapore’s main challenges from an infrastructure and gover…
…
continue reading

1
EP 64: Joshua Johnston from Capsher Technology
57:39
57:39
Play later
Play later
Lists
Like
Liked
57:39AI isn't just a buzzword in oil and gas, it’s changing the game, and we’re breaking it down with someone who’s been in the trenches making it happen. From smarter fracking and streamlined drilling to navigating the chaos of legacy systems, this convo peels back the curtain on what it *really* takes to build tech that actually works in the field. We…
…
continue reading

1
Sentry: Error Monitoring at Scale - Design Principles Analysis
15:48
15:48
Play later
Play later
Lists
Like
Liked
15:48Sentry is a large-scale, open-source error monitoring platform designed for modern distributed systems. It prioritizes actionable insights by focusing on exceptions and crashes, enriching errors with contextual data, and using features such as breadcrumbs and error grouping. Sentry's architecture employs modular and decoupled components like Relay …
…
continue reading

1
Istio Service Mesh: Architecture, Security, and Traffic Management
33:58
33:58
Play later
Play later
Lists
Like
Liked
33:58These excerpts offer a detailed look at Istio's service mesh architecture, a critical component for managing microservices in cloud-native environments. The architecture is divided into a control plane and data plane, emphasizing security through automated mTLS and traffic management with advanced load balancing techniques. Observability is achieve…
…
continue reading

1
CockroachDB: SQL for Global Scale Design Principles
14:33
14:33
Play later
Play later
Lists
Like
Liked
14:33CockroachDB is a distributed SQL database designed for global scalability and resilience. The database achieves this through a unique architecture built on a monolithic key-value store, Raft-based replication, and hybrid logical clocks. Transaction management is optimized for global workloads using a non-blocking commit protocol and multi-region ca…
…
continue reading

1
#56 Building Green Software, the one year anniversary with Sarah Hsu
49:10
49:10
Play later
Play later
Lists
Like
Liked
49:10A year ago, Building Green Software was released by O’Reilly. Since Tim Frick’s book “Designing for sustainability” (8 years ago!), O’Reilly didn’t publish anything fully focusing on sustainability. So, it’s a fair statement that this book was long awaited. But a year is an eternity in IT. This is why Sarah Hsu, one of its 3 co-authors as well as t…
…
continue reading
What happens when a control room engineer gets fed up with clunky old systems and decides to rebuild them from the ground up? You get CruxOCM, a company shaking up pipeline automation with tools like PipeBot and GatherBot that sound like sci-fi but are very real (and very efficient). Vicki Knott shares how she went from pulp and paper mills to lead…
…
continue reading

1
Snowflake: Revolutionizing Cloud Data Warehousing and Analytics
17:21
17:21
Play later
Play later
Lists
Like
Liked
17:21Snowflake, a cloud-native data warehouse, revolutionizes modern analytics through its unique architecture and capabilities. The platform separates compute and storage layers, enabling independent scaling and optimized performance. Its three-layer design encompasses cloud services, a compute layer using virtual warehouses, and a storage layer levera…
…
continue reading

1
EP 62: Nick Fornicola from Beusa Energy
1:13:45
1:13:45
Play later
Play later
Lists
Like
Liked
1:13:45AI in oil and gas isn't just a buzzword, it's actually solving real problems out in the field. On this episode, we chat with someone who's knee-deep in the data trenches and building tools that make workflows faster, smarter, and way less painful. From using vibration data to predict when gear’s about to break, to training AI to read and summarize …
…
continue reading

1
Kubernetes: Container Orchestration, Architecture, and Evolution
25:56
25:56
Play later
Play later
Lists
Like
Liked
25:56This collection of excerpts comprehensively examines Kubernetes, the leading container orchestration platform. It traces the historical evolution of container orchestration and highlights Kubernetes' architectural foundations, including its control plane and node components. Scalability mechanisms like horizontal pod autoscaling and cell-based arch…
…
continue reading

1
#55 Decarbonizing Kubernetes with Flavia Paganelli and Niki Manoledaki
44:41
44:41
Play later
Play later
Lists
Like
Liked
44:41Did containerisation ship away our environmental responsibility? Containers come with the promise of automation, scalability and reliability. The question is how to add sustainability to the list without breaking its other benefits. To talk about these challenges, Gaël Duez welcomes Flavia Paganelli and Niki Manoledaki, 2 experts in Kubernetes who …
…
continue reading

1
Elasticsearch: Architecture, Applications, and Emerging Trends
18:13
18:13
Play later
Play later
Lists
Like
Liked
18:13This compilation of excerpts thoroughly examines Elasticsearch, focusing on its architecture, applications, and future trends. The core architecture and its integration within the Elastic Stack are highlighted, emphasizing scalability and real-time analytics. Various specialized applications are discussed, including maritime data storage, academic …
…
continue reading

1
# 54 Agility and Sustainability with Joanna Masraff and Joanne Stone
48:44
48:44
Play later
Play later
Lists
Like
Liked
48:44"We are 100% convinced that IT sustainability matters but we can’t add more non business requirements, we have agile teams." This often heard sentence from product managers or CPOs, led to this dedicated episode on agility and sustainability where host Gaël Duez welcomes 2 seasoned agile coaches: Joanne Stone, the founder of Agilist 4 planet and th…
…
continue reading

1
Ray: A Distributed Framework for Emerging AI Applications
19:40
19:40
Play later
Play later
Lists
Like
Liked
19:40This research paper introduces Ray, a distributed framework designed for emerging AI applications, particularly those involving reinforcement learning. It addresses the limitations of existing systems in handling the complex demands of these applications, which require continuous interaction with the environment. Ray unifies task-parallel and actor…
…
continue reading

1
Zanzibar: Google's Global Authorization System
27:21
27:21
Play later
Play later
Lists
Like
Liked
27:21This paper details Zanzibar, Google's globally distributed authorization system, designed to manage access control lists (ACLs) at a massive scale. Zanzibar uses a flexible data model and configuration language to handle diverse access control policies for numerous Google services, achieving high availability and low latency. The system maintains e…
…
continue reading

1
Database as a service with Percona Everest
1:02:44
1:02:44
Play later
Play later
Lists
Like
Liked
1:02:44In this episode of the Kubernetes Bytes podcast, Ryan and Bhavin sit down with Edith (Edi) Puclla, Technology Evangelist at Percona to talk about Percona Everest. The conversation focuses on Percona's investment in the Open-source ecosystem, and how they keep innovating with Percona Monitoring and Management and Percona Everest. The discussion also…
…
continue reading

1
Google Mesa: A Geo-Replicated, Near Real-Time Data Warehouse
15:02
15:02
Play later
Play later
Lists
Like
Liked
15:02**Mesa** is a highly scalable, geo-replicated data warehousing system developed at Google to handle petabytes of data related to its advertising business. **Designed for near real-time data ingestion and querying**, it processes millions of updates per second and serves billions of queries daily. **Key features include strong consistency, high avai…
…
continue reading

1
#53 Scaling GreenOps at Back Market with Dawn Baker
37:03
37:03
Play later
Play later
Lists
Like
Liked
37:03Changing its Cloud provider is never small potatoes, especially when a platform operates up to 40,000 containers and has about 4 million unique visitors a day to its website. Yet Back Market made the move from AWS to Google Cloud Platform motivated primarily by … sustainability concerns! In this episode its CTO, Dawn Backer, chats with Gaël Duez an…
…
continue reading

1
Time, Clocks, and the Ordering of Events in a Distributed System
13:50
13:50
Play later
Play later
Lists
Like
Liked
13:50This paper, "Time, Clocks, and the Ordering of Events in a Distributed System," explores the challenges of defining and managing time in distributed systems. It introduces the concept of a "happened before" relation to partially order events and presents an algorithm for creating a consistent total ordering using logical clocks. The paper then exte…
…
continue reading

1
ZooKeeper: Wait-Free Coordination for Internet-Scale Systems
26:38
26:38
Play later
Play later
Lists
Like
Liked
26:38This paper details the design and implementation of ZooKeeper, a high-performance coordination service for large-scale distributed systems. ZooKeeper provides a simple, wait-free API enabling developers to build various coordination primitives, such as locks and group membership, without server-side modifications. It achieves high throughput throug…
…
continue reading
Oil and gas engineers drowning in data, meet your new best friend—Wise Rock. We’re talking lightning-fast analytics, effortless communication, and a platform built to make your life easier. Brock Meyer, the brains behind it, joins us to share how his tech is slashing downtime, streamlining workflows, and bringing exception-based surveillance to the…
…
continue reading

1
#52 Sustainability at WordPress: an update with Csaba Varszegi, Nahuai Badiola, and Nora Ferreiros
47:46
47:46
Play later
Play later
Lists
Like
Liked
47:46“Today I learned that we have a sustainability team.Thank you for your effort in this area, looking at results of the team so far, and the ROI of time invested, it's probably a good time to officially dissolve the team entirely”. In 3 sentences, almost 3 years of work from the WordPress Sustainability Group vanished and their Slack channel archived…
…
continue reading

1
TensorFlow: Large-Scale Machine Learning on Heterogeneous Systems
17:02
17:02
Play later
Play later
Lists
Like
Liked
17:02This paper details TensorFlow, a large-scale machine learning system developed by Google. TensorFlow uses dataflow graphs to represent computation and manages state across diverse hardware, including CPUs, GPUs, and TPUs. It offers a flexible programming model, allowing developers to experiment with novel optimizations and training algorithms beyon…
…
continue reading

1
#51 Exploring the digital revolution paradox from a UN perspective with Paz Pena and Pablo José Gamez Cersosimo
56:41
56:41
Play later
Play later
Lists
Like
Liked
56:41It’s a 252 pages report with the foreword of António Guterres, the Secretary-General of the United Nations, talking about digitalization and sustainability. And, for once, it’s not another report from the UN stating “let’s digitize everything to boost sustainability”. Quite the contrary as it states a “unequal ecological exchange between developed …
…
continue reading
This paper details Google Firestore, a NoSQL serverless database built on Spanner. It highlights Firestore's ease of use, scalability, real-time query capabilities, and support for disconnected operations. The architecture, which enables multi-tenancy and efficient handling of large datasets, is explained. Performance benchmarks and practical lesso…
…
continue reading

1
Apache Flink: Stream and Batch Processing in a Single Engine
18:12
18:12
Play later
Play later
Lists
Like
Liked
18:12This research paper details Apache Flink, an open-source system unifying stream and batch data processing. Flink uses a dataflow model to handle various data processing needs, including real-time analytics and batch jobs, within a single engine. The paper explores Flink's architecture, APIs (including DataStream and DataSet APIs), and fault-toleran…
…
continue reading

1
Kafka: A Distributed Messaging System for Log Processing
16:52
16:52
Play later
Play later
Lists
Like
Liked
16:52This paper introduces Kafka, a novel distributed messaging system designed for high-throughput log processing. Kafka addresses limitations in existing messaging systems and log aggregators by offering a scalable, efficient architecture with a simple API. Key features include a pull-based consumption model, efficient storage and data transfer mechan…
…
continue reading

1
LinkedIn: Using Set Cover to Optimize a Large-Scale Low Latency Distributed Graph
12:59
12:59
Play later
Play later
Lists
Like
Liked
12:59This research paper details LinkedIn's solution for optimizing low-latency graph computations within their large-scale distributed graph system. To improve performance, they implemented a modified greedy set cover algorithm to minimize the number of machines needed for processing second-degree connection queries. This optimization significantly red…
…
continue reading

1
Monolith: A Real-Time Recommendation System
20:25
20:25
Play later
Play later
Lists
Like
Liked
20:25This research paper details Monolith, a real-time recommendation system developed by Bytedance. Monolith addresses challenges in building scalable recommendation systems, such as sparse and dynamic data, and concept drift, by employing a collisionless embedding table and an online training architecture. Key innovations include a Cuckoo HashMap for …
…
continue reading

1
Meta FlexiRaft: Flexible Quorums for Raft Consensus
24:27
24:27
Play later
Play later
Lists
Like
Liked
24:27This research paper details FlexiRaft, a modified Raft consensus algorithm designed for Meta's petabyte-scale MySQL deployments. The core improvement is the introduction of flexible quorums, allowing configurable trade-offs between latency, throughput, and fault tolerance. Two quorum modes are presented: static and dynamic. The paper explores the a…
…
continue reading
Join Bhavin Shah and Ryan Wallner for a recap of announcements and news from KubeCon North America 2024. Check out our website at https://kubernetesbytes.com/ https://www.businesswire.com/news/home/20241119538933/en/Spectro-Cloud-Closes-75m-Series-C-Led-by-Growth-Equity-at-Goldman-Sachs-Alternatives https://northflank.com/blog/northflank-raises-22m…
…
continue reading
From Kazakhstan to Louisiana and now shaking up the oil and gas industry, Onega Ulanova’s story is as bold as her startup, QMS2GO. This AI-powered tool is turning the grind of quality management into a smooth operation—automating audits, crafting training materials, and capturing wisdom from seasoned machinists. Inspired by her own experience as a …
…
continue reading

1
Spanner: Google’s Globally Distributed Database
13:28
13:28
Play later
Play later
Lists
Like
Liked
13:28This research paper details Spanner, Google's globally-distributed database system. Spanner achieves strong consistency across its geographically dispersed data centers using a novel TrueTime API that accounts for clock uncertainty. The system features automatic sharding, failover, and a semi-relational data model, addressing limitations of previou…
…
continue reading
In this episode of the Kubernetes Bytes podcast, Ryan and Bhavin talk to Tobi Knaup, VP and General Manager of Cloud Native at Nutanix about all things Kubernetes and AI. The discussion focuses on how Kubernetes has evolved since the early days, and why it's architecture is a perfect fit for accelerating adoption of AI workloads inside organization…
…
continue reading

1
EP 59: Jacob Matson from MotherDuck
1:24:21
1:24:21
Play later
Play later
Lists
Like
Liked
1:24:21What happens when a lightning-fast database meets a quirky name like MotherDuck? You get DuckDB—an embeddable powerhouse shaking up the data warehouse world. It’s fast, it’s sleek, and it’s turning traditional multi-node setups into yesterday’s news. Jacob Matson from MotherDuck spills the beans on how they’re turbocharging DuckDB for the cloud, ma…
…
continue reading