30-Day Free Trial: It’s Never Been Easier To Get Started With StreamSets

11 months ago

Home
Community overview
StreamSets Platform
Community Articles and Got a Question?

Community Articles and Got a Question?

Can't find what you're looking for? Ask it here or check out the Community articles

1,020 Topics
2,305 Replies

When you subscribe we will email you when there is a new topic in this category

1020 Topics

Newest first

Recently active Most replies Most views

JagadeeshFan

asked in Community Articles and Got a Question?

Groovy establish connection to mongodb

HI,Could you please provide a snippet of connection establish to mongodb from groovy script.

2 years ago

giulliana_souzaFan

asked in Community Articles and Got a Question?

[TeraJDBC 16.20.00.10][Error 640][SQÇstare HY000]ResultSet: findColumn - column -- not found

We are using offset, where clause initial value is from job parameters. 20 pipelines following the same structure, 1 was done, and we copied to develop all others 19.On DEV and SIT they are working fine, only in prod has this error for 16 pipelines, others 4 are working fine. Giving up, after 1 error as per stage configuration. First error: SQLState: HY000 error code:640 message: [Teradata JSBC Driver][TeraJDBC 16.20.00.10][Error 640][SQÇstare HY000]ResultSet: findColumn — column-- not foundWe got an SQL statement run by Teradata and it works fine.

2 years ago

refl_jasmelFan

asked in Community Articles and Got a Question?

PostgreSql CDC client worked only when i make postgre port public. Why ?

I am trying to create a pipeline with origin PostgreSQL CDC client. The connection was not initializing until unless i make the postgre port to public. What is the exact solution for this ? can anyone help on this ?

2 years ago

anirbanchFan

asked in Community Articles and Got a Question?

Invoking a SDC job with a set of input parameters

Hi, We know a SDC job be involked through HTTP endpoint and can a set of parameters be passed as input parameter?Effectively we want the following :Originating system X > invoke SDC job with a set of parameters > SDC job looks up a database table with the passed on parameters > Returns the results to a different external system Y[ note its not the originating system X] > Send a response back to Originating system with a pass/fail status.Any demo pipeline will help. Are we advised to use REST Service origin for https://docs.streamsets.com/portal/controlhub/latest/help/datacollector/UserGuide/Microservice/Microservice_Title.htmlRegards, Anirban

2 years ago

ashok vermaDiscovered Fame

asked in Community Articles and Got a Question?

ashok vermaDiscovered Fame

asked in Community Articles and Got a Question?

how to know what values are present in action for filed remover compoennet in SDK

in Control hub i will know what are values are present in action for Field Remover but in SDK how to know.field_remover = pipeline_builder_14.add_stage('Field Remover')for field_remover.action, what values are present how i will know through SDKThanks,Ashok.

2 years ago

mblahayDiscovered Fame

asked in Community Articles and Got a Question?

Error Records: Send Response to Origin

A pipeline’s origin is an S3 bucket. Error records are configured to “Send Response to Origin.” What exactly happens to the error records in this instance?

2 years ago

krishnankannanFan

asked in Community Articles and Got a Question?

build pipeline in transformer

I am trying to create transformer pipeline using python sdk , unable to connect transformer engine.. i am getting two id and url for sch.tramsformers command . Please help me

2 years ago

ashok vermaDiscovered Fame

asked in Community Articles and Got a Question?

i want to extract multiple fields from JSON/XML using XML Parser etc..

i want to extract multiple fields from JSON/XML using XML Parser etc..i am able to extarct with groovy but i want to achive like belowreading a file from S3 using data_format as XML extarct multiple fileds from XML in step 2<body><head>1</heaad><m>3</m><tail>2</tail><body>in step 2 i want to have 2 values in my output with out using any groovy etc..i want to achive using XML parser or filed mapper etc.. as of today i see only one value i can extarct these ex : /body/headbut i want to extarct both /body/head/body/tail

2 years ago

mySSnameFan

asked in Community Articles and Got a Question?

All Files are not copying from one folder to another folder in the same s3 bucket

HI, I have tried copy all the files from one folder to another folder with in same s3 bucket using streamsets job. But I am seeing only 1 or 2 files copied into destination folder compared to source folder(like in source folder if 7 files are there, but in destination folder I am seeing 1 or 2 files are copying). Can any one help me on this issue. ThanksMurali

2 years ago

pranavkatkarFan

asked in Community Articles and Got a Question?

Streamsets asking for connect to control hub or to enter registration code after upgrading to 4.2.0 version.

Hi Team,To fix log 4j vulnerability , I upgraded the streamsets version from streamsets/datacollector:3.18.1 to streamsets/datacollector:4.2.0. After that, I am not able to create a new /import pipeline. User Interface asks me to connect to control hub or enter activation code which was not the case in version 3.18.1.

2 years ago

mblahayDiscovered Fame

asked in Community Articles and Got a Question?

Alternate Git Repo Integration

When will StreamSets introduce the ability to use Git repos such as Github or GitLab for version control?

1 year ago

lr123Fan

asked in Community Articles and Got a Question?

jdbc query used toTable data emptying

When the JDBC query component in executors is used to empty table data, it will not stop after starting the task. Note: pipeline finisher has been used on the java script component SQL QUERY:delete from depart_passenger_info

2 years ago

mblahayDiscovered Fame

asked in Community Articles and Got a Question?

How to get Python SDK Activation Key

I would like to learn how to use the python SDK. How do I go about getting an activation key for use with a personal account?

2 years ago

mySSnameFan

asked in Community Articles and Got a Question?

Data copy mismatch for same s3 bucket

HI, I have tried copy all the files from one folder to another folder with in same s3 bucket using streamsets job. But I am seeing more files copied into destination folder compared to source folder(like in source folder if 7 files are there, but in destination folder I am seeing more than 7 like … 8 or 10 or 12). But this issue is coming only for first time of the day. If I run same job again for the day I am seeing record count matching between source and destination. Can any one help me on this issue. ThanksMurali

2 years ago

jerriRoadie

asked in Community Articles and Got a Question?

Lookups (into DeltaTable) delivering extremely bad performances when used in Transformer

Lookups (into DeltaTable) giving extremely bad performances (sometime it stays in pre-execution stage forever) when used in Transformer with origin of 1000 records, although, it works decent enough in streaming mode which i guess is due to the lesser number of incoming records.

2 years ago

ashok vermaDiscovered Fame

asked in Community Articles and Got a Question?

while launching S3 from SDK giving error

i have declared s3 data format as below but in UI/Control hub data format is showing blank.pipeline_builder = sch.get_pipeline_builder(engine_type='data_collector', engine_url=”XXXX”)s3_origin = pipeline_builder.add_stage('Amazon S3', type='origin')--s3_origin.data_format ='Text'.How to see values allowed for any particular component in SDK.(ex s3_origin.data_format ,s3_origin.delimiter etc..)

2 years ago

santhoshchallaFan

asked in Community Articles and Got a Question?

Find the status of a job using REST API

Hi,I have triggered one of the job suing REST API in StreamSets and I need to the status of the job if it is failed or successful. I am able to get the status of the job but if the job is failed or successful, I am getting status as Inactive in both success and failure case. I need to get exact status like success or failure and error message in case of job failure. Please let me know how I can I achieve this StreamSets.

10 months ago

ashok vermaDiscovered Fame

asked in Community Articles and Got a Question?

refreshing pipeline launch using SDK

After launching pipeline using SDK, if i have any changes in pipeline, i want to do it from SDK instead of UI and those changes has to reflect in UI. how can i achieve this one without again launching pipeline.some more queries1.preview using SDKhow to know stage has no errors in SDK

2 years ago

jmazariegosFan

asked in Community Articles and Got a Question?

Dynamic SQL

Hi, I have a product table named ‘product’ in MySQL as follows:product_id | Product | FieldName1 Milk milk2 Water water3 Coffee coffee Then, I have a source fully de-normalized table named ‘raw_transaction’ as follows:transaction_Id | Date | customer | milk | water | coffee | 1 1/1/2021 John 12 1/1/2021 Mary 1 13 1/1/2021 Anna 1 Can you give me a hint on how I can create a pipeline in StreamSets so that I can use the product table as meta-data in creating a dynamic query so that I can populate a ‘FactCustomerProduct’ as follows For each product in products INSERT INTO FactCustomerProduct (product_id,date_id,customer_id,transaction_id,quantity) SELECT p.product_id,r.date_id,customer_id,r.transaction_id,r.<fieldName> FROM ‘raw_transaction’ r [...] WHERE r.<fiel

2 years ago

rohan_bFan

asked in Community Articles and Got a Question?

Databricks Delta Layer - DELTA_LAKE_13 Validation Error

Hi,I have created a pipeline in the StreamSets Data Collector which reads data from an Apache Kafka topic and inserts it into a Databricks Delta table.Table Auto Creation has been disabled. I have created the table on the Databricks instance separately and the columns and data types in the table are correct.But I am getting the following error while validating the pipeline. What could be the reason for the error?Caused by: com.streamsets.pipeline.api.StageException: DELTA_LAKE_13 - Table 'gov_src_req_sts_v1' column '' unsupported type '' at com.streamsets.pipeline.stage.destination.definitions.JdbcTableDefStore.get(JdbcTableDefStore.java:157) at com.streamsets.pipeline.stage.destination.definitions.CacheTableDefStore$1.load(CacheTableDefStore.java:59) at com.streamsets.pipeline.stage.destination.definitions.CacheTableDefStore$1.load(CacheTableDefStore.java:53) at com.google.common.cache.LocalCache$LoadingValueReference.loadFuture(LocalCache.java:3542) at com.google.common.cache.LocalCa

2 years ago

ashok vermaDiscovered Fame

asked in Community Articles and Got a Question?

launching pipeline using SDK

from the python shell i am unable to launch data collectorbelow options i have triedfrom streamsets.sdk import DataCollectordc = DataCollector('https://localhost:18630') error : None object has no attribute use_websocket_tunning 2.from streamsets.sdk import DataCollector,ControlHubsch = ControlHub(<SCH URL>, credential_id=<credential id>, token=<token>)pipeline_builder = sch.get_pipeline_builder(engine_type='data_collector', engine_url=<SDC URL>)in the above step, i have given engine_url by login into streamsets and under engine tab the active i gave and i am getting below error error : instnace is not in list

2 years ago

levanyeFan

asked in Community Articles and Got a Question?

running error pipeline couldn`t force stop

Running Error pipeline couldn`t stop, when I click the button of force stop, nothing happened, what should i do?

9 months ago

Page 37 / 41

Badge winners

vishwesh.margasahayamhas earned the badge Product expert
ajinkyahas earned the badge Innovator
Sanjeevhas earned the badge Eager to help
AkshayJadhavhas earned the badge Eager to help
john.durkinhas earned the badge Eager to help

Show all badges

Terms & Conditions

Sign up

Already have an account? Login

Social Login

Username *

E-mail address *

What I do... *

Data Leader Data Architect Data Engineer Data Scientist Other

Company *

Country *

Zip Code *

Marketing Communications

Yes No

Password *

I have read and Agree to the Website Terms of Service and I have read and acknowledged the Privacy Policy.

loginBox.register.email_repeat

Login to the community

No account yet? Create an account

Social Login

Username or Email

Password

Remember me

Forgot password?

Enter your E-mail address. We'll send you an e-mail with instructions to reset your password.

Enter your e-mail address

Back to overview

Scanning file for viruses.

Sorry, we're still checking this file's contents to make sure it's safe to download. Please try again in a few minutes.

This file cannot be downloaded

Sorry, our virus scanner detected that this file isn't safe to download.