Back
March 22, 2024
5
min read

Checkpoint Chronicle - March 2024

By
Robin Moffatt
Share this post

Welcome to the Checkpoint Chronicle, a monthly roundup of interesting stuff in the data and streaming space. Your hosts and esteemed curators of said content are Gunnar Morling and Robin Moffatt (your editor-in-chief for this edition). Feel free to send our way any choice nuggets that you think we should feature in future editions.

Stream Processing, Streaming SQL, and Streaming Databases

Event Streaming

Change Data Capture

Data Platforms and Architecture

Data Ecosystem

  • Two good articles about real-world database administration, with GitHub.com’s migration from MySQL 5.7 to 8.0, and the UK Government’s Digital Service’s migration of their PostgreSQL database within AWS.
  • The ecosystem around open-table formats (OTF) continues to be an interesting one, with the project formerly known as OneTable now becoming Apache XTable (incubating). This move to an ASF project I think is really important as it defines a well-known and respected governance model. It will still be interesting to see how OTFs play out in the coming months and years—will we end up with one format simply winning outright, or will the cross-format support that XTable (and to a slight extent Databricks’ Uniform) provide give the best of all worlds? Personally, I’m sceptical; I think the formats will slug it out and in the end, one or two will survive—perhaps for different use cases—and the others will either pivot or die.
  • LinkedIn has quite the pedigree for open-source projects (Apache Kafka, anyone?) and this newly open-sourced one looks interesting. OpenHouse is described as a control plane for managed tables, and includes a catalog and services for managing Iceberg tables.
  • I always enjoy reading my former colleague Jack Vanlightly’s writing, and this recent article about Confluent’s new capability called Tableflow—which will support writing from Apache Kafka into Apache Iceberg—is no exception. 

    Papers of the Month

    Events & Call for Papers (CfP)

    New Releases

    A bumper batch of new releases this month:

    Plus a couple of useful-looking projects that I noticed:

    That’s all for this month! We hope you’ve enjoyed the newsletter and would love to hear about any feedback or suggestions you’ve got.

    Gunnar (LinkedIn / X / Mastodon / Email)
    Robin (LinkedIn / X / Mastodon / Email)

    📫 Email signup 👇

    Did you enjoy this issue of Checkpoint Chronicle? Would you like the next edition delivered directly to your email to read from the comfort of your own home?

    Simply enter your email address here and we'll send you the next issue as soon as it's published—and nothing else, we promise!

    👍 Got it!
    Oops! Something went wrong while submitting the form.
    Robin Moffatt

    Robin is a Principal DevEx Engineer at Decodable. He has been speaking at conferences since 2009 including QCon, Devoxx, Strata, Kafka Summit, and Øredev. You can find many of his talks online and his articles on the Decodable blog as well as his own blog.

    Outside of work, Robin enjoys running, drinking good beer, and eating fried breakfasts—although generally not at the same time.

    Table of contents

    Let's Get Decoding

    Welcome to the Checkpoint Chronicle, a monthly roundup of interesting stuff in the data and streaming space. Your hosts and esteemed curators of said content are Gunnar Morling and Robin Moffatt (your editor-in-chief for this edition). Feel free to send our way any choice nuggets that you think we should feature in future editions.

    Stream Processing, Streaming SQL, and Streaming Databases

    Event Streaming

    Change Data Capture

    Data Platforms and Architecture

    Data Ecosystem

    • Two good articles about real-world database administration, with GitHub.com’s migration from MySQL 5.7 to 8.0, and the UK Government’s Digital Service’s migration of their PostgreSQL database within AWS.
    • The ecosystem around open-table formats (OTF) continues to be an interesting one, with the project formerly known as OneTable now becoming Apache XTable (incubating). This move to an ASF project I think is really important as it defines a well-known and respected governance model. It will still be interesting to see how OTFs play out in the coming months and years—will we end up with one format simply winning outright, or will the cross-format support that XTable (and to a slight extent Databricks’ Uniform) provide give the best of all worlds? Personally, I’m sceptical; I think the formats will slug it out and in the end, one or two will survive—perhaps for different use cases—and the others will either pivot or die.
    • LinkedIn has quite the pedigree for open-source projects (Apache Kafka, anyone?) and this newly open-sourced one looks interesting. OpenHouse is described as a control plane for managed tables, and includes a catalog and services for managing Iceberg tables.
    • I always enjoy reading my former colleague Jack Vanlightly’s writing, and this recent article about Confluent’s new capability called Tableflow—which will support writing from Apache Kafka into Apache Iceberg—is no exception. 

      Papers of the Month

      Events & Call for Papers (CfP)

      New Releases

      A bumper batch of new releases this month:

      Plus a couple of useful-looking projects that I noticed:

      That’s all for this month! We hope you’ve enjoyed the newsletter and would love to hear about any feedback or suggestions you’ve got.

      Gunnar (LinkedIn / X / Mastodon / Email)
      Robin (LinkedIn / X / Mastodon / Email)

      📫 Email signup 👇

      Did you enjoy this issue of Checkpoint Chronicle? Would you like the next edition delivered directly to your email to read from the comfort of your own home?

      Simply enter your email address here and we'll send you the next issue as soon as it's published—and nothing else, we promise!

      👍 Got it!
      Oops! Something went wrong while submitting the form.
      Robin Moffatt

      Robin is a Principal DevEx Engineer at Decodable. He has been speaking at conferences since 2009 including QCon, Devoxx, Strata, Kafka Summit, and Øredev. You can find many of his talks online and his articles on the Decodable blog as well as his own blog.

      Outside of work, Robin enjoys running, drinking good beer, and eating fried breakfasts—although generally not at the same time.

      Let's Get Decoding