Sign in to confirm you’re not a bot
This helps protect our community. Learn more
KEYNOTE: The Distributed PostgreSQL Problem & How Citus Solves it | Citus Con 2023
77Likes
3,236Views
2023May 8
In this keynote talk by Marco Slot at Citus Con: An Event for Postgres 2023, Marco explores the distributed PostgreSQL problem and how Citus solves it. Building distributed PostgreSQL is perhaps one of the most challenging software engineering projects imaginable. Early on, the Citus team decided to architect Citus as PostgreSQL extension. That way Citus remains part of the PostgreSQL ecosystem even as PostgreSQL keeps developing. Moreover, architecting Citus as an extension made distribution a feature that can simply be added to PostgreSQL without losing the versatile feature set of Postgres, nor its mature, efficient implementations. The goal of Citus is to provide high PostgreSQL performance at any scale, but simply distributing data across machines is rarely sufficient to achieve that. Crisp distribution concepts and careful trade-offs are important to favor workload patterns that benefit from scaling out. There are also many complex engineering problems given the large PostgreSQL feature set, failures and concurrency in distributed systems, and mission-critical nature of databases. Marco discusses the main engineering challenges faced over the past 10 years of developing the fastest, most mature, open-source Distributed PostgreSQL implementation: Citus. Marco Slot is a Principal Software Engineer at Microsoft and is Lead Architect of the Citus team. He has been working on PostgreSQL extensions including Citus and pg_cron since 2014 when he first joined Citus Data, prior to the Microsoft acquisition. Prior to Citus Data, Marco earned a PhD in cooperative self-driving cars at Trinity College Dublin. ► Video bookmarks: ⏩ 00:00 Introducing Marco ⏩ 02:01 Marco’s talk begins: what is Citus? ⏩ 02:56 Why we need distributed PostgreSQL ⏩ 07:44 Data locality problem ⏩ 12:54 How Citus addresses data locality ⏩ 15:12 Distributed SQL benchmark results ⏩ 15:57 Distributed data modeling problem ⏩ 18:53 Versatility problem ⏩ 19:12 Citus features & gaps ⏩ 20:09 Engineering culture problem ⏩ 22:57 Q&A ✅ Learn more: Watch more Citus Con talks: https://aka.ms/cituscon-playlist 📕 Everything you need to know about Citus Con: An Event for Postgres can be found at: https://aka.ms/cituscon 📌 Let’s connect:   Twitter – @CitusCon,   / cituscon   ⭐ Stay connected with the Citus open source project on GitHub: https://github.com/citusdata/citus 💡Take the Azure Cosmos DB for PostgreSQL training at https://aka.ms/pg-azurecosmosdb-training 🔔 Subscribe to the Citus monthly technical newsletter: https://aka.ms/citus-newsletter More resources: ✅ Distributed SQL Performance Benchmark: https://aka.ms/distributed-sql-benchmark ✅ Slides for this keynote talk: https://speakerdeck.com/citusdata/key... #CitusCon #PostgreSQL #Citus

Follow along using the transcript.

Microsoft Developer

589K subscribers