Question

Initial data load in pipeline

1 year ago
July 5, 2023
2 replies
66 views

ccariman
Fan

Hi, is there a way to load data (using JDBC and HTTP Client processors) into memory only once at the start of a pipeline and have it used by all streams?. This in order to avoid loading the data for each stream (in my case from the kafka topic).

I'll look forward for your answer.

Carlos.

+1

Bikram
Headliner
486 replies
1 year ago
July 7, 2023

@ccariman

You can use the data processing mode as batch mode ,to process data in StreamSets.

If you need to process the data in streaming mode then you can check the mode as streaming in HTTP client processor or kafka for real time data processing.

Kindly provide the issue in details so i can try to help you on it.

Sanjeev
StreamSets Employee
53 replies
1 year ago
July 11, 2023

@ccariman if the requirement is to enrich data from Kafka using a JDBC source then you can use the JDBC Lookup processor. More details on the use-case will help to provide further guidance.

Reply

Related topics

Orchestrate Initial Bulk Load to Change Data Capture

Oracle CDC - Snowflake destination throwing SNOWFLAKE_28.

JDBC Multi-Table Consumer able to handle multiple offset column ?icon

timestamp tz to ntz utc conversion.icon

Pipeline load data to Snowflake table, even though the target table does not exit, pipeline still succeededicon

Tags

Couldn't find what you're looking for?

Sign up

Social Login

Login to the community

Social Login

Scanning file for viruses.

This file cannot be downloaded