Solved

Is it possible to auto increment values of a field using Field Replacer?

7 months ago
August 1, 2024
2 replies
60 views

Sperch
Opening Band
11 replies

The title sums up what I am looking for.

The context is that I have a pipeline with a column that is always NULL but the business owner wants it to be populated with an auto incrementing integer to use as a primary key in the Snowflake table destination. I tried permitting the column to accept NULLs in Snowflake while still having it configured to add an integer for each record, but it just uses the NULLs. Taking the field out of the pipeline of course fails due to the missing field.

I am hoping to avoid staging this data in another table and then using INSERT to get the auto increment if StreamSets has the capabilities to do this via regex or some other means.

Thank you in advance for your time to anyone that reads this.

Best answer by Sperch

I discovered a solution elsewhere online (I have since lost the URL and don’t recall, but know I did not create this). By inserting a Jython Evaluator and using this code:

# Increment CMPNY_LCTN_ID by 1 for each record
state['CMPNY_LCTN_ID'] = 1

for record in records:
try:
    record.value['CMPNY_LCTN_ID'] = sdc.state['CMPNY_LCTN_ID']
    state['CMPNY_LCTN_ID'] += 1
    output.write(record)
except Exception as e:
    error.write(record, str(e))

I was able to get the desired result. That said, it will restart from 1 every time a batch is sent through so I had to increase the batch size to 10,000 to handle all the records in a single batch and not repeat the assigned ID. This works for us in this scenario due to the relatively small number of records at this time, but this is not an elegant or sustainable long-term solution.

View original

Did this topic help you find an answer to your question?

Sperch
Author
Opening Band
11 replies
Answer
6 months ago
September 4, 2024

I discovered a solution elsewhere online (I have since lost the URL and don’t recall, but know I did not create this). By inserting a Jython Evaluator and using this code:

# Increment CMPNY_LCTN_ID by 1 for each record
state['CMPNY_LCTN_ID'] = 1

for record in records:
try:
    record.value['CMPNY_LCTN_ID'] = sdc.state['CMPNY_LCTN_ID']
    state['CMPNY_LCTN_ID'] += 1
    output.write(record)
except Exception as e:
    error.write(record, str(e))

I was able to get the desired result. That said, it will restart from 1 every time a batch is sent through so I had to increase the batch size to 10,000 to handle all the records in a single batch and not repeat the assigned ID. This works for us in this scenario due to the relatively small number of records at this time, but this is not an elegant or sustainable long-term solution.

lakshman
Fan
2 replies
6 months ago
September 4, 2024

write data into mysql from mongodb how to handle auto increment column

Could you please refer to this? It might give you an idea.

Reply

Related topics

Weekly Product Release Notes - December 21, 2020

Weekly Product Release Notes - December 21, 2022

Weekly Product Release Notes - December 16, 2020

Weekly Product Release Notes - December 9, 2020

Tags

Couldn't find what you're looking for?

Sign up

Social Login

Login to the community

Social Login

Scanning file for viruses.

This file cannot be downloaded