Use Case:-
We have a dataset, in which we have columns as follows:-
FIRST_NAME, LAST_NAME, EMAIL, PHONE, GENDER, DEPARTMENT, JOB_TITLE, YEARS_OF_EXPEREIENCE, SALARY.
Lets sort the column SALARY in ascending order and write to a new table with just 4 columns, FIRST_NAME, LAST_NAME, YEARS_OF_EXPEREIENCE, SALARY.
Pipeline Design:-
- Snowflake Table (origin)
- Sort (Processor)
- Column Remover (Processor)
- Snowflake Table (Destination)
Pipeline Working:-
- Snowflake Origin will fetch the table and columns and pass the records to Sort processor
- Sort processor will sort the data based on the configuration and pass it to Column Remover. (SALARY column, Ascending order)
- Column Remover will keep or remove the columns based on the configuration.
- Snowflake Table Destination will write the data to a new snowflake table.