Solved

How to setup a continuous data replication pipeline from SQL Server to Snowflake?

2 years ago
8 February 2022
1 reply
180 views

Userlevel 5

Drew Kreiger
Senior Community Builder at StreamSets
95 replies

We would like to setup a continuous data replication pipeline from SQL Server to Snowflake, completed with historical data for several hundred tables. Would like some documentation/assistance with creating a test pipeline for our use case.

icon

Best answer by Drew Kreiger 8 February 2022, 22:11

View original

1 reply

Userlevel 5

Drew Kreiger
Author
Senior Community Builder at StreamSets
95 replies
2 years ago
8 February 2022
Answer

Here is a link to our documentation on the SQL Server CDC Client. That will describe how to setup SQL Server for CDC and about all the option for that origin. This link describes how to process change data.

In addition, here is a link to our github repository which shows an example of SQL Server CDC to Snowflake.

To properly set up a full CDC for SQL Server, you will want one pipeline that does the bulk load. That will use the JDBC Multitable Consumer Origin to read all the SQL Server tables and replicate them into Snowflake. Once the bulk load is complete, you can then start the SQL Server CDC pipeline, which runs continuously, to capture the changes and write them into Snowflake.

Reply

Couldn't find what you're looking for?

Sign up

Social Login

Login to the community

Social Login

Scanning file for viruses.

This file cannot be downloaded