Search a title or topic

Over 20 million podcasts, powered by 

Player FM logo
show episodes
 
Artwork

1
Streaming Audio: Apache Kafka® & Real-Time Data

Confluent, founded by the original creators of Apache Kafka®

icon
Unsubscribe
icon
Unsubscribe
Monthly
 
Streaming Audio features all things Apache Kafka®, Confluent, real-time data, and the cloud. We cover frequently asked questions, best practices, and use cases from the Kafka community—from Kafka connectors and distributed systems, to data mesh, data integration, modern data architectures, and data mesh built with Confluent and cloud Kafka as a service. Join our hosts as they stream through a series of interviews, stories, and use cases with guests from the data streaming industry. Apache®️, ...
  continue reading
 
Artwork

1
Developer Voices

Kris Jenkins

icon
Unsubscribe
icon
Unsubscribe
Monthly+
 
Deep-dive discussions with the smartest developers we know, explaining what they're working on, how they're trying to move the industry forward, and what we can learn from them. You might find the solution to your next architectural headache, pick up a new programming language, or just hear some good war stories from the frontline of technology. Join your host Kris Jenkins as we try to figure out what tomorrow's computing will look like the best way we know how - by listening directly to the ...
  continue reading
 
Artwork

1
The Binary Breakdown

The Binary Breakdown

icon
Unsubscribe
icon
Unsubscribe
Weekly
 
Binary Breakdown is your go-to podcast for exploring the latest in computer science research and technology. Each episode dives into groundbreaking papers, emerging technologies, and the ideas shaping our digital world. Whether you're a tech enthusiast, a computer science student, or a seasoned professional, Binary Breakdown decodes complex topics into insightful discussions, connecting the dots between theory and real-world application. Join us as we break down binary, byte by byte, to unco ...
  continue reading
 
Artwork

1
The Craft Of Open Source

Ben Rometsch, CEO, Flagsmith

icon
Unsubscribe
icon
Unsubscribe
Monthly
 
Welcome to The Craft of Open Source, hosted by Ben Rometsch, Co-Founder and CEO of Flagsmith. This bi-weekly show is focused on the ins and outs of the Open Source Software Community. Join Ben as he speaks with the brightest minds that have brought us some of the most adopted technologies on earth. Each episode is an interview with creators, maintainers, entrepreneurs, and key contributors to the open source community. We will cover critical topics for open source developers, contributors an ...
  continue reading
 
Welcome to the Preset Podcast, the home of "Analytics Everywhere" and "Designated Driver". Analytics Everywhere discusses wide-ranging topics in business intelligence and data engineering, and Designated Driver is a great way to get to know the database platforms of the world over a beer. These podcasts are dedicated to explore next-generation data tools and the impact they have on data teams.
  continue reading
 
Artwork

1
Java Pub House

Freddy Guime & Bob Paulin

icon
Unsubscribe
icon
Unsubscribe
Monthly
 
This podcast talks about how to program in Java; not your tipical system.out.println("Hello world"), but more like real issues, such as O/R setups, threading, getting certain components on the screen or troubleshooting tips and tricks in general. The format is as a podcast so that you can subscribe to it, and then take it with you and listen to it on your way to work (or on your way home), and learn a little bit more (or reinforce what you knew) from it.
  continue reading
 
Artwork

1
The Computing Podcast

Alex Feinberg & Vikram Rangnekar

icon
Unsubscribe
icon
Unsubscribe
Monthly
 
The Computing Podcast where we talk about all things computer science, direct from Silicon Valley. Into databases distributed system and building large scale software products? Then this is the Podcast for you. Hosted by Alex Feinberg and Vikram Rangnekar. Between the two of us we have worked for companies like Linkedin, Amazon, Microsoft, Cloudera building stateful distributed systems and ad serving engines.
  continue reading
 
Data is the fuel that ignites innovation and revolutionizes our world. It’s potent, undeniable, and the driving force for our future. Data (R)evolution, powered by Aiven, explores the intersections of data, innovation, and economics through the ever-evolving landscape of cloud technology. Led by hosts Janki Patel Westenberg and Francesco Tisiot, they navigate the currents of this digital revolution in conversation with experts at the forefront of change.
  continue reading
 
Loading …
show series
 
How do you retrofit a clustered data-processing system to use cheap commodity storage? That’s the big question in this episode as we look at one of the many attempts to build a version of Kafka that uses object storage services like S3 as its main disk, sacrificing a little latency for cheap, infinitely-scalable disks. There are several companies t…
  continue reading
 
This research paper introduces Anna, a key-value store (KVS) designed for scalable performance across diverse computing environments, from single multi-core machines to globally distributed cloud deployments. Anna achieves high performance and adaptability through a partitioned, multi-master architecture utilizing wait-free execution and coordinati…
  continue reading
 
Java’s has been evolving faster than any 30 year old language has a right to do, and there’s probably no-one more pleased about it than my guest this week - Josh Long. He’s a Java & Kotlin programming, a JVM enthusiast in general, and an advocate for Spring, and he has chapters full of news about what’s been happening in Javaland over the past few …
  continue reading
 
This academic paper introduces Conflict-free Replicated Data Types (CRDTs), which are abstract data types designed for distributed systems where data is replicated across multiple locations. CRDTs allow any replica to be modified without needing immediate coordination with other replicas, ensuring high availability and low latency. The core concept…
  continue reading
 
This content from InfoQ provides insights for software architects and developers through various formats like newsletters, articles, and conference information. It highlights topics in architecture, AI, data engineering, culture, methods, and DevOps. Featured pieces discuss Slack's cellular architecture, data stream processing patterns, cultivating…
  continue reading
 
I’m joined this week by one of the authors of Apache Kafka In Action, to take a look at the state of Kafka, event systems & stream-processing technology. It’s an approach (and a whole market) that’s had at least a decade to mature, so how has it done? What does Kafka offer to developers and businesses, and which parts do they actually care about? W…
  continue reading
 
Raft, a consensus algorithm designed for managing a replicated log in distributed systems. It aims to be more understandable than Paxos, a widely used but complex alternative, while achieving equivalent efficiency and safety. Raft separates key consensus elements like leader election, log replication, and safety, using techniques such as problem de…
  continue reading
 
This compilation of resources offers a comprehensive examination of Neo4j's graph database architecture. It explains how Neo4j differs from relational and document-oriented databases through its native graph storage. The materials describe how nodes, relationships, and properties are stored and indexed for efficient traversal and query processing. …
  continue reading
 
Building a database is a serious undertaking. There are just so many parts that you have to implement before you even get to a decent prototype, and so many hours of work before you could begin working on the ideas that would make your database unique. Apache DataFusion is a project that hopes to change all that, but building an extensible, composa…
  continue reading
 
Sentry is a large-scale, open-source error monitoring platform designed for modern distributed systems. It prioritizes actionable insights by focusing on exceptions and crashes, enriching errors with contextual data, and using features such as breadcrumbs and error grouping. Sentry's architecture employs modular and decoupled components like Relay …
  continue reading
 
These excerpts offer a detailed look at Istio's service mesh architecture, a critical component for managing microservices in cloud-native environments. The architecture is divided into a control plane and data plane, emphasizing security through automated mTLS and traffic management with advanced load balancing techniques. Observability is achieve…
  continue reading
 
Jupyter’s become an incredibly popular programming and data science tool, but how does it actually work? How have they built an interactive language execution engine? And if we understand the architecture, what else could it be used for? Joining me to look inside the Jupyter toolbox are Afshin Darian and Sylvain Corlay, two of Jupyters long-standin…
  continue reading
 
CockroachDB is a distributed SQL database designed for global scalability and resilience. The database achieves this through a unique architecture built on a monolithic key-value store, Raft-based replication, and hybrid logical clocks. Transaction management is optimized for global workloads using a non-blocking commit protocol and multi-region ca…
  continue reading
 
As a corollary to Max's new blog post, "AI in BI: The Path to Full Self Drive", this episode is a bit of.... an experiment. It explores how artificial intelligence is revolutionizing business intelligence (BI), from accelerating data analysis and surfacing key insights to enabling predictive models and automating decision-making. AI is changing how…
  continue reading
 
Snowflake, a cloud-native data warehouse, revolutionizes modern analytics through its unique architecture and capabilities. The platform separates compute and storage layers, enabling independent scaling and optimized performance. Its three-layer design encompasses cloud services, a compute layer using virtual warehouses, and a storage layer levera…
  continue reading
 
Ever since we invented makefiles, the programming world has been wrestling with the problem of building software stacks reliably. This week we’re going to look at one of the most ambitious solutions available - Nix. Nix tries to do everything from invoking your compiler to installing your language, and even providing your operating system. But how …
  continue reading
 
This collection of excerpts comprehensively examines Kubernetes, the leading container orchestration platform. It traces the historical evolution of container orchestration and highlights Kubernetes' architectural foundations, including its control plane and node components. Scalability mechanisms like horizontal pod autoscaling and cell-based arch…
  continue reading
 
This compilation of excerpts thoroughly examines Elasticsearch, focusing on its architecture, applications, and future trends. The core architecture and its integration within the Elastic Stack are highlighted, emphasizing scalability and real-time analytics. Various specialized applications are discussed, including maritime data storage, academic …
  continue reading
 
Graphite is a new image editor with an interesting architecture - it’s a classic UI-driven app, an image-manipulation language, and a library of programmable graphics primitives that any Rust coder could use, extend or add to. The result is something that you can use like Photoshop or Inkscape, or make use of in batch pipelines, a bit like ImageMag…
  continue reading
 
This research paper introduces Ray, a distributed framework designed for emerging AI applications, particularly those involving reinforcement learning. It addresses the limitations of existing systems in handling the complex demands of these applications, which require continuous interaction with the environment. Ray unifies task-parallel and actor…
  continue reading
 
This paper details Zanzibar, Google's globally distributed authorization system, designed to manage access control lists (ACLs) at a massive scale. Zanzibar uses a flexible data model and configuration language to handle diverse access control policies for numerous Google services, achieving high availability and low latency. The system maintains e…
  continue reading
 
**Mesa** is a highly scalable, geo-replicated data warehousing system developed at Google to handle petabytes of data related to its advertising business. **Designed for near real-time data ingestion and querying**, it processes millions of updates per second and serves billions of queries daily. **Key features include strong consistency, high avai…
  continue reading
 
Interview With Malcolm Matalka: Co-Founder, Terrateam Ben Rometsch chats with Malcolm Matalka, co-founder of Terrateam, to discuss the company’s inception and its eventual shift to open source. Together, they discuss how his frustrations with the click ops approach prompted Malcolm to create his own team to disrupt the system and pave the way for m…
  continue reading
 
ReScript is a strongly-typed programming language that compiles to JavaScript, and that puts it squarely in competition with TypeScript. So why would a JavaScript developer choose to learn it next? What does it offer that makes it a tempting proposition? And how are the ReScript developers making life easier for anyone who wants to make the switch?…
  continue reading
 
This paper, "Time, Clocks, and the Ordering of Events in a Distributed System," explores the challenges of defining and managing time in distributed systems. It introduces the concept of a "happened before" relation to partially order events and presents an algorithm for creating a consistent total ordering using logical clocks. The paper then exte…
  continue reading
 
This paper details the design and implementation of ZooKeeper, a high-performance coordination service for large-scale distributed systems. ZooKeeper provides a simple, wait-free API enabling developers to build various coordination primitives, such as locks and group membership, without server-side modifications. It achieves high throughput throug…
  continue reading
 
With almost every transaction being done through the internet, your personal information and finances must be protected at all costs from hackers and scammers. Therefore, cybersecurity is important now more than ever. Leading the charge in keeping the digital world safe and secure is OWASP, a non-profit organization working mainly on software secur…
  continue reading
 
Trustfall is a library based on a simple question - what happens if we can query absolutely anything? If you could join REST APIs and databases with filesystems and dockerfiles? It’s possible in theory because those are all just datasources. Predrag Gruevski is trying to make it easy by building a universal query engine, with pluggable datasources,…
  continue reading
 
This paper details TensorFlow, a large-scale machine learning system developed by Google. TensorFlow uses dataflow graphs to represent computation and manages state across diverse hardware, including CPUs, GPUs, and TPUs. It offers a flexible programming model, allowing developers to experiment with novel optimizations and training algorithms beyon…
  continue reading
 
This paper details Google Firestore, a NoSQL serverless database built on Spanner. It highlights Firestore's ease of use, scalability, real-time query capabilities, and support for disconnected operations. The architecture, which enables multi-tenancy and efficient handling of large datasets, is explained. Performance benchmarks and practical lesso…
  continue reading
 
Dimitris Kyriakoudis is a researcher, programmer and musician who's combining all three talents to build dedicated music hardware. Specifically a device called the µseq, which reads Lisp programs and uses them to drive synthesizers to make music. In this episode we go through the full platform that he's building, from soldering resistors to an RPi …
  continue reading
 
This research paper details Apache Flink, an open-source system unifying stream and batch data processing. Flink uses a dataflow model to handle various data processing needs, including real-time analytics and batch jobs, within a single engine. The paper explores Flink's architecture, APIs (including DataStream and DataSet APIs), and fault-toleran…
  continue reading
 
Interview With Chris Villa: Co-Founder, Puck & Measured Puck, an open-source project, has exploded in popularity, becoming a major focus for Chris Villa and his team. But how do you balance a runaway open-source hit with the demands of running a consultancy? Join Ben Rometsch as he sits down with Chris Villa, co-founder of both Puck and Measured, t…
  continue reading
 
If you want to build really large software systems well, you have to stop thinking of them as just software systems. Beyond a certain size, everything your software touches becomes part of the wider system. You’re part of the system, your users are part of the system, and every other employee & department & priority eventually forms part of that sy…
  continue reading
 
This paper introduces Kafka, a novel distributed messaging system designed for high-throughput log processing. Kafka addresses limitations in existing messaging systems and log aggregators by offering a scalable, efficient architecture with a simple API. Key features include a pull-based consumption model, efficient storage and data transfer mechan…
  continue reading
 
To kick off 2025 we’re looking at Fyrox a game engine built in Rust, largely by one person - Dmitry Stepanov. For an individual project, it’s covered an incredible amount of ground, covering the rendering and animation features you’d expect from a game engine, with some features that might surprise you - like Rust scripting support with hot-reloadi…
  continue reading
 
This research paper details LinkedIn's solution for optimizing low-latency graph computations within their large-scale distributed graph system. To improve performance, they implemented a modified greedy set cover algorithm to minimize the number of machines needed for processing second-degree connection queries. This optimization significantly red…
  continue reading
 
Joining this episode is James McLeod, Open Source Program Lead at NatWest Group. He looks back on his career journey, from his role at the Linux Foundation, his efforts in scaling the FINOS project, to the creation of London.js. Reflecting on the many lessons throughout his professional life, James stressed the importance of proper data documentati…
  continue reading
 
This research paper details Monolith, a real-time recommendation system developed by Bytedance. Monolith addresses challenges in building scalable recommendation systems, such as sparse and dynamic data, and concept drift, by employing a collisionless embedding table and an online training architecture. Key innovations include a Cuckoo HashMap for …
  continue reading
 
This research paper details FlexiRaft, a modified Raft consensus algorithm designed for Meta's petabyte-scale MySQL deployments. The core improvement is the introduction of flexible quorums, allowing configurable trade-offs between latency, throughput, and fault tolerance. Two quorum modes are presented: static and dynamic. The paper explores the a…
  continue reading
 
Integration testing is always a tricky thing, fraught with problems setting up the right environment and attempting to control the system’s state. That’s particularly true when you’re dealing with a mix of software and hardware, and even worse when you don’t have control of what the hardware can do. This week I’m joined by Dave Lucia of TVLab’s, wh…
  continue reading
 
This episode features Moritz Klack and Christopher Möller, Co-Founders of xyflow. They look back on their career journey leading to their current company, from developing webkid, creating Datablocks, and launching React Flow. They discuss their transition from agency work to open source, their pricing strategies, and the importance of community fee…
  continue reading
 
This research paper details Spanner, Google's globally-distributed database system. Spanner achieves strong consistency across its geographically dispersed data centers using a novel TrueTime API that accounts for clock uncertainty. The system features automatic sharding, failover, and a semi-relational data model, addressing limitations of previou…
  continue reading
 
Sam Aaron is the creator of Sonic Pi, one of the most unusual software platforms you’ll encounter. It’s a live-coding playground for making music. A tool that lets you write code that defines sounds and musical phrases, and build up a hole program that plays anything from a short bleep to a whole nightclub set. And Sam’s creator has been using it l…
  continue reading
 
This research paper introduces Minesweeper, a novel technique for automated root cause analysis (RCA) of software bugs at scale. Leveraging telemetry data, Minesweeper efficiently identifies statistically significant patterns in user app traces that correlate with bugs, even in the absence of detailed debugging information. The method uses sequenti…
  continue reading
 
Evan Czaplicki—the creator of the Elm programming language —joins me to discuss the state and future of Elm, the friendly, type-safe functional programming language. On many fronts Elm has been a huge success: it’s been popular with new and seasoned programmers alike; it’s helped push several language ideas into the mainstream; it’s been a key part…
  continue reading
 
This paper details Cassandra, a decentralized structured storage system designed for managing massive amounts of structured data across numerous commodity servers. High availability and scalability are key features, achieved through techniques like consistent hashing for data partitioning and replication strategies across multiple data centers to h…
  continue reading
 
This week we’re going to look at the most essential piece of firmware in a programmer’s toolkit - the brain. I’m joined by Chris Ferdinandi to explore what it’s like to be a programmer with ADHD. It’s an unusual topic for the channel, but the more I spoke to him, the more I wanted to know what coding is like when your brain is wired differently, ho…
  continue reading
 
The provided text is an excerpt from a research paper on FoundationDB, an open-source, distributed transactional key-value store. The paper details FoundationDB's design principles, architecture, and key features, including its unbundled architecture, strict serializability through a combination of optimistic concurrency control (OCC) and multi-ver…
  continue reading
 
Loading …
Copyright 2025 | Privacy Policy | Terms of Service | | Copyright
Listen to this show while you explore
Play