Question

pipeline want to run once my source is update.

1 year ago
June 20, 2023
2 replies
23 views

lakshmi_narayanan_t
Discovered Fame
31 replies

my origin is directory reading csv source and write it to AWS S3 BUCKET when my source any data is updated means I need to rerun my pipeline ,how can I achieve it .

+1

Bikram
Headliner
486 replies
1 year ago
June 20, 2023

@lakshmi_narayanan_t

Can you use kafka processor in your case , if yes then it will be solve your problem.

Pipeline 1:

Read data from source and send it to Kafka producer .

Pipeline 2 :

Fetch data from kafka topic and send to S3 bucket .

In this case you will get the updated data in case of any changes from the source.

Please let me know if it helps , else i will help you on the second approach to come over your issue.

Sanjeev
StreamSets Employee
53 replies
1 year ago
July 11, 2023

@lakshmi_narayanan_t depending upon the read order configured the directory origin will automatically pick up new files as and when they arrive as long as the pipeline is running continuously or runs based on a regular schedule.

Reply

Related topics

Is NWC still being supported and developed?icon

PowerApps and Flow vs Nintex Forms and Workflow

TLS and Start NWC Workflow

Integrating NWC to Node JS example

K2 Cloud: Authentication, Security and other Frequently-asked Questions

Tags

Couldn't find what you're looking for?

Sign up

Social Login

Login to the community

Social Login

Scanning file for viruses.

This file cannot be downloaded