Are you on top of the latest innovations in data, analytics, and AI? With data being pivotal to strategy and change, the Data-powered Innovation Jam podcast gives you the key to some of the most crucial aspects of business success. Through our guests, we bring you the latest trends from the world of data and AI, discussing the best ideas and experiences. Our hosts with their decades of profound experience and a background in avant-garde music, will also explore the edges of jazz, rock, and p ...
…
continue reading
Join us each week to hear real-world advice and insights from senior marketing, growth and product leaders who are driving growth at the world's fastest growing companies.
…
continue reading
Ready to think beyond your common data environment and unlock what's possible when people, systems, and organizations are successfully integrated? Discover how the architecture, engineering, and construction (AEC) industry and project owners can save time, reduce risk, and drive ROI with modern collaborative project information management.
…
continue reading
Come on an insightful journey across business, sustainability, technology, strategy and architecture - listen to the people who are influencing the architecture of tomorrow. Hear from the global community for Enterprise, Business, Technology Architects and related roles who want to collaborate and learn from each other. Connect with Oliver Cronk on LinkedIn if you have thoughts on topics or would like to appear on the series. Find the YouTube channel at https://YouTube.com/ArchitectTomorrow/
…
continue reading
Binary Breakdown is your go-to podcast for exploring the latest in computer science research and technology. Each episode dives into groundbreaking papers, emerging technologies, and the ideas shaping our digital world. Whether you're a tech enthusiast, a computer science student, or a seasoned professional, Binary Breakdown decodes complex topics into insightful discussions, connecting the dots between theory and real-world application. Join us as we break down binary, byte by byte, to unco ...
…
continue reading
From punk rock to process automation, from Blondie to BSON — it’s a genre-defying episode of the Data-Powered Innovation Jam where we grab a virtual bench in Central Park with Andrew Davidson, SVP of Products at MongoDB, and let the data conversation run wild. What follows is an improvisational jam session on the evolution of data, the art of distr…
…
continue reading

1
Why Data Centres Belong in Nursing Homes (Not Isolated Warehouses) with David from Leafcloud
28:55
28:55
Play later
Play later
Lists
Like
Liked
28:55In this episode, Oliver speaks with David Kohnstamm from Leafcloud about a fundamental shift in how we think about data centre locations. Forget remote warehouses - the future of sustainable computing lies in nursing homes, apartment complexes, and anywhere with constant heat demand.David reveals why traditional data centres waste enormous amounts …
…
continue reading
This research paper introduces Anna, a key-value store (KVS) designed for scalable performance across diverse computing environments, from single multi-core machines to globally distributed cloud deployments. Anna achieves high performance and adaptability through a partitioned, multi-master architecture utilizing wait-free execution and coordinati…
…
continue reading

1
Using waste heat from data centres: Deep Green - Mark Bjornsgaard
9:57
9:57
Play later
Play later
Lists
Like
Liked
9:57Please note: We apologise for the audio and video quality in this episode - as Oliver mentions, "we're a bit hesitant here because we're kind of squatting in part of Tech Show London trying to record a conversation" - but we couldn't miss the opportunity to capture this discussion!In this episode of Architect Tomorrow, we catch up with Mark from De…
…
continue reading
This academic paper introduces Conflict-free Replicated Data Types (CRDTs), which are abstract data types designed for distributed systems where data is replicated across multiple locations. CRDTs allow any replica to be modified without needing immediate coordination with other replicas, ensuring high availability and low latency. The core concept…
…
continue reading

1
The six pistols of technology trends
1:18:45
1:18:45
Play later
Play later
Lists
Like
Liked
1:18:45Ready for the remix? You might have already listened to one of the most recent episodes of the Cloud Realities podcast, where six dynamic voices from two podcast teams came together to unpack Capgemini’s TechnoVision 2025. From infrastructure and applications to collaboration, user experience, automation, and — naturally — a heavy dose of data and …
…
continue reading

1
CAP Twelve Years Later: How the "Rules" Have Changed
29:47
29:47
Play later
Play later
Lists
Like
Liked
29:47This content from InfoQ provides insights for software architects and developers through various formats like newsletters, articles, and conference information. It highlights topics in architecture, AI, data engineering, culture, methods, and DevOps. Featured pieces discuss Slack's cellular architecture, data stream processing patterns, cultivating…
…
continue reading

1
Raft versus Paxos: An Understandable Consensus Algorithm
33:14
33:14
Play later
Play later
Lists
Like
Liked
33:14Raft, a consensus algorithm designed for managing a replicated log in distributed systems. It aims to be more understandable than Paxos, a widely used but complex alternative, while achieving equivalent efficiency and safety. Raft separates key consensus elements like leader election, log replication, and safety, using techniques such as problem de…
…
continue reading
What happens when Professor Erik Proper of the Technical University of Vienna teams up with the ever-inquisitive ‘Dr. Bob’ Robert Engels – modestly supported by co-hosts Weiwei Feng and Ron Tolido - to discuss AI, models, semantics, ontologies, and context? A true PhD Fest? Or a practical exploration of the synergies between the academic and busine…
…
continue reading

1
Neo4j Architecture: Graph Database Internals, Performance, and Optimization
17:42
17:42
Play later
Play later
Lists
Like
Liked
17:42This compilation of resources offers a comprehensive examination of Neo4j's graph database architecture. It explains how Neo4j differs from relational and document-oriented databases through its native graph storage. The materials describe how nodes, relationships, and properties are stored and indexed for efficient traversal and query processing. …
…
continue reading

1
Futuring Architectures: Ecosystems, AI and Mindset Shifts
47:04
47:04
Play later
Play later
Lists
Like
Liked
47:04In this episode, Oliver Cronk talks with Ron Kersic about "Futuring Architectures" and how architects can navigate today's rapidly changing digital landscape. Ron, a self-described "recovering programmer" now at ING's Tech Strategy group, discusses his journey from early programming to enterprise architecture. He explains his concept of "Futuring A…
…
continue reading
Much like a hammer sees every problem as a nail, data experts tend to see opportunity in every single data field. Collaboration on data is still one of the best ways to bring data to life and build business value on top of it. Sharing data across organizations is one of the most powerful ways to spark innovation. But when data needs to stay private…
…
continue reading

1
Sentry: Error Monitoring at Scale - Design Principles Analysis
15:48
15:48
Play later
Play later
Lists
Like
Liked
15:48Sentry is a large-scale, open-source error monitoring platform designed for modern distributed systems. It prioritizes actionable insights by focusing on exceptions and crashes, enriching errors with contextual data, and using features such as breadcrumbs and error grouping. Sentry's architecture employs modular and decoupled components like Relay …
…
continue reading

1
Istio Service Mesh: Architecture, Security, and Traffic Management
33:58
33:58
Play later
Play later
Lists
Like
Liked
33:58These excerpts offer a detailed look at Istio's service mesh architecture, a critical component for managing microservices in cloud-native environments. The architecture is divided into a control plane and data plane, emphasizing security through automated mTLS and traffic management with advanced load balancing techniques. Observability is achieve…
…
continue reading

1
CockroachDB: SQL for Global Scale Design Principles
14:33
14:33
Play later
Play later
Lists
Like
Liked
14:33CockroachDB is a distributed SQL database designed for global scalability and resilience. The database achieves this through a unique architecture built on a monolithic key-value store, Raft-based replication, and hybrid logical clocks. Transaction management is optimized for global workloads using a non-blocking commit protocol and multi-region ca…
…
continue reading
Ever thought AI practitioners could win a Noble prize? Well, they already have. Not for predicting your next favourite movie or generating poetry, but for groundbreaking advancements in Life Sciences. And it won’t be the last time. In this episode, hosts Ron Tolido, Robert Engels, and Weiwei Feng find themselves venturing into unexpected territory …
…
continue reading

1
Snowflake: Revolutionizing Cloud Data Warehousing and Analytics
17:21
17:21
Play later
Play later
Lists
Like
Liked
17:21Snowflake, a cloud-native data warehouse, revolutionizes modern analytics through its unique architecture and capabilities. The platform separates compute and storage layers, enabling independent scaling and optimized performance. Its three-layer design encompasses cloud services, a compute layer using virtual warehouses, and a storage layer levera…
…
continue reading

1
Kubernetes: Container Orchestration, Architecture, and Evolution
25:56
25:56
Play later
Play later
Lists
Like
Liked
25:56This collection of excerpts comprehensively examines Kubernetes, the leading container orchestration platform. It traces the historical evolution of container orchestration and highlights Kubernetes' architectural foundations, including its control plane and node components. Scalability mechanisms like horizontal pod autoscaling and cell-based arch…
…
continue reading
In for a new venture? This episode of the Data-powered Innovation Jam has you covered. Our hosts Weiwei, Robert, and Ron welcome Brett Clark, Director, Global Business Development at blackshark.ai—a company with the modest mission of creating a digital twin of the entire planet using satellite data. It all started with Microsoft Flight Simulator, b…
…
continue reading

1
Elasticsearch: Architecture, Applications, and Emerging Trends
18:13
18:13
Play later
Play later
Lists
Like
Liked
18:13This compilation of excerpts thoroughly examines Elasticsearch, focusing on its architecture, applications, and future trends. The core architecture and its integration within the Elastic Stack are highlighted, emphasizing scalability and real-time analytics. Various specialized applications are discussed, including maritime data storage, academic …
…
continue reading

1
Ray: A Distributed Framework for Emerging AI Applications
19:40
19:40
Play later
Play later
Lists
Like
Liked
19:40This research paper introduces Ray, a distributed framework designed for emerging AI applications, particularly those involving reinforcement learning. It addresses the limitations of existing systems in handling the complex demands of these applications, which require continuous interaction with the environment. Ray unifies task-parallel and actor…
…
continue reading
It’s the ultimate new hit album for generative AI: intelligent automation, autonomous systems, and—of course—agents. But while ServiceNow has been a pioneering force in this space for a long time, it still seems to fly under the radar for many data and AI experts. Time to change that. Time to bridge the worlds of Planet Process and Planet Data! For…
…
continue reading

1
Zanzibar: Google's Global Authorization System
27:21
27:21
Play later
Play later
Lists
Like
Liked
27:21This paper details Zanzibar, Google's globally distributed authorization system, designed to manage access control lists (ACLs) at a massive scale. Zanzibar uses a flexible data model and configuration language to handle diverse access control policies for numerous Google services, achieving high availability and low latency. The system maintains e…
…
continue reading

1
Google Mesa: A Geo-Replicated, Near Real-Time Data Warehouse
15:02
15:02
Play later
Play later
Lists
Like
Liked
15:02**Mesa** is a highly scalable, geo-replicated data warehousing system developed at Google to handle petabytes of data related to its advertising business. **Designed for near real-time data ingestion and querying**, it processes millions of updates per second and serves billions of queries daily. **Key features include strong consistency, high avai…
…
continue reading
"It Don’t Mean a Thing If It Ain’t Got That Swing"—Duke Ellington’s immortal words composed almost a century ago resonate deeply in the latest episode of the Data-powered Innovation Jam. Hosts Ron Tolido, Robert Engels, and Weiwei Feng explore TechnoVision 2025, Capgemini’s annual technology trend analysis, where ‘The Pendulum Swing’ captures the r…
…
continue reading

1
Time, Clocks, and the Ordering of Events in a Distributed System
13:50
13:50
Play later
Play later
Lists
Like
Liked
13:50This paper, "Time, Clocks, and the Ordering of Events in a Distributed System," explores the challenges of defining and managing time in distributed systems. It introduces the concept of a "happened before" relation to partially order events and presents an algorithm for creating a consistent total ordering using logical clocks. The paper then exte…
…
continue reading

1
ZooKeeper: Wait-Free Coordination for Internet-Scale Systems
26:38
26:38
Play later
Play later
Lists
Like
Liked
26:38This paper details the design and implementation of ZooKeeper, a high-performance coordination service for large-scale distributed systems. ZooKeeper provides a simple, wait-free API enabling developers to build various coordination primitives, such as locks and group membership, without server-side modifications. It achieves high throughput throug…
…
continue reading
Cloud computing has long been a familiar fixture in the digital sky, but that doesn’t mean innovation has stopped soaring. In this episode, hosts Ron Tolido, Robert Engels, and Weiwei Feng discuss all things cloud with Carolin Eggers, Director, Data & AI, EMEA North, Google Cloud. The first kind of cloud? There’s nothing UFO about it. That’s well u…
…
continue reading

1
TensorFlow: Large-Scale Machine Learning on Heterogeneous Systems
17:02
17:02
Play later
Play later
Lists
Like
Liked
17:02This paper details TensorFlow, a large-scale machine learning system developed by Google. TensorFlow uses dataflow graphs to represent computation and manages state across diverse hardware, including CPUs, GPUs, and TPUs. It offers a flexible programming model, allowing developers to experiment with novel optimizations and training algorithms beyon…
…
continue reading
This paper details Google Firestore, a NoSQL serverless database built on Spanner. It highlights Firestore's ease of use, scalability, real-time query capabilities, and support for disconnected operations. The architecture, which enables multi-tenancy and efficient handling of large datasets, is explained. Performance benchmarks and practical lesso…
…
continue reading
Is quantum computing an early ‘70s thing? Well, certainly if you compare it to the new, experimental music that was released then. To many, quantum computing feels like listening to an early Pink Floyd record for the very first time: alien, incomparable, confusing, but above all: intriguing. Quite different from the more mainstream band the Floyd l…
…
continue reading

1
Apache Flink: Stream and Batch Processing in a Single Engine
18:12
18:12
Play later
Play later
Lists
Like
Liked
18:12This research paper details Apache Flink, an open-source system unifying stream and batch data processing. Flink uses a dataflow model to handle various data processing needs, including real-time analytics and batch jobs, within a single engine. The paper explores Flink's architecture, APIs (including DataStream and DataSet APIs), and fault-toleran…
…
continue reading

1
Kafka: A Distributed Messaging System for Log Processing
16:52
16:52
Play later
Play later
Lists
Like
Liked
16:52This paper introduces Kafka, a novel distributed messaging system designed for high-throughput log processing. Kafka addresses limitations in existing messaging systems and log aggregators by offering a scalable, efficient architecture with a simple API. Key features include a pull-based consumption model, efficient storage and data transfer mechan…
…
continue reading
Knock, knock, Neo. Think Agents are a thing of the future? Look again at the iconic movie, The Matrix—they've been hiding in plain sight. In the first episode of 2025, hosts Ron Tolido, Weiwei Feng, and Robert Engels venture down the digital rabbit hole of Virtual Twins with Morgan Zimmerman, CEO of NETVIBES at Dassault Systèmes. It’s a fascinating…
…
continue reading

1
LinkedIn: Using Set Cover to Optimize a Large-Scale Low Latency Distributed Graph
12:59
12:59
Play later
Play later
Lists
Like
Liked
12:59This research paper details LinkedIn's solution for optimizing low-latency graph computations within their large-scale distributed graph system. To improve performance, they implemented a modified greedy set cover algorithm to minimize the number of machines needed for processing second-degree connection queries. This optimization significantly red…
…
continue reading

1
Monolith: A Real-Time Recommendation System
20:25
20:25
Play later
Play later
Lists
Like
Liked
20:25This research paper details Monolith, a real-time recommendation system developed by Bytedance. Monolith addresses challenges in building scalable recommendation systems, such as sparse and dynamic data, and concept drift, by employing a collisionless embedding table and an online training architecture. Key innovations include a Cuckoo HashMap for …
…
continue reading

1
Meta FlexiRaft: Flexible Quorums for Raft Consensus
24:27
24:27
Play later
Play later
Lists
Like
Liked
24:27This research paper details FlexiRaft, a modified Raft consensus algorithm designed for Meta's petabyte-scale MySQL deployments. The core improvement is the introduction of flexible quorums, allowing configurable trade-offs between latency, throughput, and fault tolerance. Two quorum modes are presented: static and dynamic. The paper explores the a…
…
continue reading
Do we hear sleigh bells in that surf rock soundtrack? Well, anything can happen when you tune in to the Data-powered Innovation Jam podcast—especially in the very last episode of the year! Our hosts Weiwei Feng (winning top honours for her Christmas sweater), Robert Engels, and Ron Tolido gather with their producer Arne Rossmann by a cozy fireplace…
…
continue reading

1
The Value and Potential of a Data Warehouse for the AEC Industry
21:34
21:34
Play later
Play later
Lists
Like
Liked
21:34In this episode, we dive into the benefits of implementing a data warehouse in the Architecture, Engineering, and Construction (AEC) industry. We discuss how centralizing data can address issues like fragmented systems, improve search and reporting, and prepare your organization to fully leverage AI. Whether you're migrating to the cloud or seeking…
…
continue reading
Autonomous AI isn’t just a futuristic dream—it’s here, and robots are redefining industries. But who would have thought the human touch is so crucial to success? In this electrifying episode, hosts Robert Engels, Weiwei Feng, and Ron Tolido are joined by Kence Anderson, CEO and co-founder of Composabl and author of “Designing Autonomous AI: A Guide…
…
continue reading

1
Spanner: Google’s Globally Distributed Database
13:28
13:28
Play later
Play later
Lists
Like
Liked
13:28This research paper details Spanner, Google's globally-distributed database system. Spanner achieves strong consistency across its geographically dispersed data centers using a novel TrueTime API that accounts for clock uncertainty. The system features automatic sharding, failover, and a semi-relational data model, addressing limitations of previou…
…
continue reading

1
Meta Minesweeper: Scalable Statistical Root Cause Analysis on App Telemetry
17:54
17:54
Play later
Play later
Lists
Like
Liked
17:54This research paper introduces Minesweeper, a novel technique for automated root cause analysis (RCA) of software bugs at scale. Leveraging telemetry data, Minesweeper efficiently identifies statistically significant patterns in user app traces that correlate with bugs, even in the absence of detailed debugging information. The method uses sequenti…
…
continue reading
Bayer, a global powerhouse with a diverse portfolio spanning pharmaceuticals via consumer health, all the way to crop science. So, what do you get when you have a guest from Bayer discussing data and AI? Even more diverse topics, of course. In this episode, hosts Weiwei Feng, Robert Engels, and Ron Tolido are joined by Mukesh Dubey, Product Managem…
…
continue reading

1
Cassandra- A Decentralized Structured Storage System
14:27
14:27
Play later
Play later
Lists
Like
Liked
14:27This paper details Cassandra, a decentralized structured storage system designed for managing massive amounts of structured data across numerous commodity servers. High availability and scalability are key features, achieved through techniques like consistent hashing for data partitioning and replication strategies across multiple data centers to h…
…
continue reading

1
The Strategic Importance and ROI of Digital Delivery in AEC Projects
45:39
45:39
Play later
Play later
Lists
Like
Liked
45:39In this episode of the ProjectReady Podcast, we explore the role of digital delivery in the Architecture, Engineering, and Construction (AEC) industry with special guests Jeff Walter, Digital Transformation Leader at AECOM Canada and Nicholas Childs, Construction Account Executive at Autodesk. As project complexity grows, digital delivery methods a…
…
continue reading

1
FoundationDB: A Distributed Unbundled Transactional Key Value Store
21:02
21:02
Play later
Play later
Lists
Like
Liked
21:02The provided text is an excerpt from a research paper on FoundationDB, an open-source, distributed transactional key-value store. The paper details FoundationDB's design principles, architecture, and key features, including its unbundled architecture, strict serializability through a combination of optimistic concurrency control (OCC) and multi-ver…
…
continue reading
Sometimes, language is universal – one could win the Eurovision Song Contest with lyrics like “la-la-la” or “ding-a-dong! But when it comes to sharing data across companies, sectors, and countries, we need a little more than catchy choruses. Enter Gaia-X: a pan-European platform designed to enable trusted, decentralized data sharing and digital eco…
…
continue reading

1
Amazon Aurora: Design Considerations for High Throughput Cloud-Native Relational Databases
18:56
18:56
Play later
Play later
Lists
Like
Liked
18:56This document describes the design of Amazon Aurora, a cloud-native relational database service built to handle high-throughput, online transaction processing (OLTP) workloads. The paper highlights the challenges of traditional database architectures in cloud environments, specifically the I/O bottleneck created by network traffic. Aurora addresses…
…
continue reading

1
Pregel: A System for Large-Scale Graph Processing
22:39
22:39
Play later
Play later
Lists
Like
Liked
22:39The article is a paper published in 2010 by researchers at Google that introduces Pregel, a large-scale graph processing system. Pregel is designed for processing graphs with billions of vertices and trillions of edges, and it uses a vertex-centric approach where vertices are assigned to individual machines and communicate with each other through m…
…
continue reading

1
Dapper, a Large-Scale Distributed Systems Tracing Infrastructure
20:42
20:42
Play later
Play later
Lists
Like
Liked
20:42This paper from Google describes the design and implementation of Dapper, Google’s system for tracing requests in distributed systems. The authors explain why they chose a distributed tracing system, the design decisions they made for Dapper, and how the Dapper infrastructure has been used in practice. They also discuss the impact of Dapper on appl…
…
continue reading