Question

Transfomer Kafka origin consumer group

1 year ago
July 21, 2023
3 replies
30 views

p_carm
Fan
1 reply

I’m used to configuring the Data Collector Kafka origin with a specific consumer group. I need this to control Kafka offsets and the Kafka broker requires this.

In Transformer, I don’t see any way to define the consumer group. How is this done ?

+1

Bikram
Headliner
486 replies
1 year ago
July 21, 2023

@p_carm

If you need to consume data from Kafka and perform real-time stream processing, you should use StreamSets Data Collector and take advantage of its Kafka Consumer origin. If you require more complex data transformations at scale, you can use StreamSets Transformer for batch processing with Apache Spark.

In the transformer, data will be processed from Kafka, based on the Kafka topic, eliminating the need for consumer details.

p_carm
Author
Fan
1 reply
1 year ago
July 21, 2023

Thanks for that. Yes. We’re a combined SDC and Transformer implementation already. I just had a first look at the Transformer Kafka origin having used the SDC Kafka multitopic origin extensively. If it doesn’t have a consumer group setting it isn’t usable in any scenario I can foresee. We have access controls on consumer groups so you can’t just be an arbitrary consumer.

Sanjeev
StreamSets Employee
53 replies
1 year ago
July 24, 2023

@p_carm it’s a limitation on Spark side which is addressed in Spark v3.x

https://issues.apache.org/jira/browse/SPARK-26350

With Spark v3.0+ you should be able to specify consumer group via additional properties(kafka.group.id)

Reply

Related topics

Let’s introduce ourselves!

Announcing Amplitude’s New Developer Center

Community Spotlight: Saish Redkar

November Product Release Highlights

September Product Release Highlights

Tags

Couldn't find what you're looking for?

Sign up

Social Login

Login to the community

Social Login

Scanning file for viruses.

This file cannot be downloaded