Microsoft SQL Server CDC Overview
Microsoft SQL Server is a relational database management system (RDBMS) that supports a wide variety of transaction processing, business intelligence and analytics applications in corporate IT environments. The latest version, SQL Server 2022, integrates with Azure Synapse Link and Microsoft Purview to enable customers to drive deeper insights, predictions, and governance from their data at scale. Cloud integration is enhanced with managed disaster recovery (DR) to Azure SQL Managed Instance, along with near real-time analytics, allowing database administrators to manage their data estates with greater flexibility and minimal impact to the end-user. Performance and scalability are automatically enhanced via built-in query intelligence.
The traditional approach to syncing SQL Server with complementary data stores is batch-based, where pipelines extract data from SQL Server and send it to downstream data stores. Change data capture (CDC) is a modern alternative to inefficient bulk imports. CDC extracts change events (INSERTs, UPDATEs, and DELETEs) in real-time from data stores, such as SQL Server, and provides them to downstream systems which can be applied to a copy of the data.
Decodable + Microsoft SQL Server CDC
Decodable can take input from a Microsoft SQL Server database as a source using a Debezium server which converts table changes into a stream of change records. Decodable can be a super convenient means to replicate the contents of a SQL Server database into a range of systems including messaging such as Apache Kafka, analytical databases such as Apache Pinot, or machine learning feature stores. Decodable's SQL-on-the-stream transformation process also includes data virtualization on change streams so you're processing against the table, not the change records.