Skip to main content

- - Knowledge base
Product Updates
Events

Create topic
Login/Register

30-Day Free Trial: It’s Never Been Easier To Get Started With StreamSets

A

2 years ago

Home
Community overview
StreamSets Platform
Community Articles and Got a Question?

Community Articles and Got a Question?

Can't find what you're looking for? Ask it here or check out the Community articles

1,066 Topics
2,341 Replies

When you subscribe we will email you when there is a new topic in this category

1066 Topics

SperchOpening Band

asked in Community Articles and Got a Question?

HTTP Client Loops Unceasingly

We have an HTTP Client origin that pulls from our ServiceNow site records that would be returned if we ran a report in ServiceNow directly. The records pulled appear at first blush to all be accounted for, but unfortunately the origin just keeps looping and pulling the every record over and over. I reduced the batch size to as low as 100 records without effect (i.e., it still loops endlessly).The URL with parameters is:https://<intentionally_redacted>.servicenowservices.com/api/now/table/task?sysparm_query=closed_atISNOTEMPTY%5Eclosed_at%3E%3Djavascript%3Ags.dateGenerate('2023-01-01'%2C'00%3A00%3A00')%5Eclosed_by.department.nameSTARTSWITHPBO%20-%20ETS%5Eassignment_group!%3Daeb1d3bc3772310057c29da543990ea2%5Eassignment_group!%3D4660e3fc3772310057c29da543990e0b%5EnumberNOT%20LIKEGAPRV%5EnumberNOT%20LIKERCC%5Esysparm_display_value=true%5Esysparm_limit=100%5Esysparm_offset=0

asked in Community Articles and Got a Question?

Tableau API with StreamSets pipeline Issue

Hello,I am currently facing a strange error. I am building a pipeline to keep my Tableau Personal Access Token (PAT) active, as Tableau's policy states that the PAT can expire after 14 days of inactivity. The purpose of this pipeline is to ensure that the PAT remains active. However, I received an error when I ran the pipeline.com.streamsets.pipeline.api.StageException: HTTP_32 - Error executing request. HTTP-Status: NULL Reason: java.net.ConnectException: Connection refusedAside from StreamSets, I was able to test the connection using the PAT token in Postman, and it worked fine. I tested a Python script that successfully authenticates with the Tableau API. The Tableau API appears to be working fine when I use it.As for the network, I don't use a proxy for StreamSets. Tableau Server doesn't have any traffic restrictions or any IP addresses on a blacklist. I'm wondering if it has something to do with the engine's configuration.

asked in Community Articles and Got a Question?

StreamSets Data Pipeline Engineer for ETL Modernization

Seeking a StreamSets specialist to build and optimize our data pipeline infrastructure - need assistance with pipeline design using Data Collector, real-time streaming from Kafka/databases to cloud warehouses, data transformation and validation rules, error handling and monitoring setup, Control Hub deployment for pipeline management, CDC implementation for database replication, API integrations with enterprise systems, and performance tuning for high-volume data processing. Requirements include proven StreamSets platform experience, knowledge of big data technologies and streaming architectures, understanding of data governance and lineage tracking, SQL and scripting skills for custom processors. Please demonstrate previous StreamSets implementations and data volume handled in your response.

JohnnyPdarosa956.903.6595Fan

asked in Community Articles and Got a Question?

Wanted FlyZipline, 4CopterPack, Bc/Hydro/Ev, R134a Hvac &Refrig, Visible/Total/Lte, Pcd/Nuu/Blu, TeleSat, Starlink, JohnnyPdarosa@ 956-903-6595, Keywords garden power machines harvesters attachments

Wanted FlyZipline, 4CopterPack, Bc/Hydro/Ev, R134a Hvac &Refrig, Visible/Total/Lte, Pcd/Nuu/Blu, TeleSat, Starlink, but i sell: OliveOil$6Liter, AlmondMilk$2Liter, CashewMilk$2Liter, 4 berry jelly$3Liter, Celery/ChickPeaChips$2Lb, BambaraMilk$2Liter, JohnnyPdarosa@ 956-903-6595, Keywords garden power machines harvesters attachments

JohnnyPdarosa956.903.6595Fan

asked in Community Articles and Got a Question?

.Wanted FlyZipline, 4CopterPack, Bc/Hydro/Ev, R134a Hvac &Refrig, Visible/Total/Lte, Pcd/Nuu/Blu, TeleSat, Starlink, JohnnyPdarosa@ 956-903-6595, Keywords garden power machines harvesters attachments.

.Wanted FlyZipline, 4CopterPack, Bc/Hydro/Ev, R134a Hvac &Refrig, Visible/Total/Lte, Pcd/Nuu/Blu, TeleSat, Starlink, but i sell: OliveOil$6Liter, AlmondMilk$2Liter, CashewMilk$2Liter, 4 berry jelly$3Liter, Celery/ChickPeaChips$2Lb, BambaraMilk$2Liter, JohnnyPdarosa@ 956-903-6595, Keywords garden power machines harvesters attachments.

JohnnyPdarosa9569036595Fan

posted in Community Articles and Got a Question?

.Wanted FlyZipline, 4CopterPack, Bc/Hydro/Ev, R134a Hvac &Refrig, Visible/Total/Lte, Pcd/Nuu/Blu, TeleSat, Starlink, JohnnyPdarosa@ 956-903-6595.

.Wanted FlyZipline, 4CopterPack, Bc/Hydro/Ev, R134a Hvac &Refrig, Visible/Total/Lte, Pcd/Nuu/Blu, TeleSat, Starlink, but i sell: OliveOil$6Liter, AlmondMilk$2Liter, CashewMilk$2Liter, 4 berry jelly$3Liter, Celery/ChickPeaChips$2Lb, BambaraMilk$2Liter, JohnnyPdarosa@ 956-903-6595, Keywords garden power machines harvesters attachments.

JohnnyPdarosa9569036595Fan

asked in Community Articles and Got a Question?

Wanted FlyZipline, 4CopterPack, Bc/Hydro/Ev, R134a Hvac &Refrig, Visible/Total/Lte, Pcd/Nuu/Blu, TeleSat, Starlink, JohnnyPdarosa@ 956-903-6595

Wanted FlyZipline, 4CopterPack, Bc/Hydro/Ev, R134a Hvac &Refrig, Visible/Total/Lte, Pcd/Nuu/Blu, TeleSat, Starlink, but i sell: OliveOil$6Liter, AlmondMilk$2Liter, CashewMilk$2Liter, 4 berry jelly$3Liter, Celery/ChickPeaChips$2Lb, BambaraMilk$2Liter, JohnnyPdarosa@ 956-903-6595, Keywords garden power machines harvesters attachments.

bonthunagireddy03Fan

asked in Community Articles and Got a Question?

how to add new stages to an existing pipelines using streamsets python SDK?

Hi team,how to add new stages to an existing pipelines using streamsets python SDK?

emily.suchanFan

asked in Community Articles and Got a Question?

Remove Stages from an Existing pipeline using SDK

I am looking to remove a stage from a pipeline and then republish it. Does anyone have experience doing this? Here is some sample code:pipeline = control_hub.pipeline.get(name = pipeline_name)stages = pipeline.stagesstage = stages.get(instance_name="HiveMetastore_1")stages.remove(stage) This successfully removes the stage from the ‘stages’ list object, but if I were to run pipeline.stages again, the stage I thought I removed is still there.

rahunlsingjdksFan

asked in Community Articles and Got a Question?

Is Desi Khand good for health?

Yes, Desi Khand is a healthy and natural sweetener packed with essential minerals like calcium, iron, and magnesium. Its unrefined nature retains nutrients that boost energy, support digestion, and strengthen immunity. Rich in fat-soluble vitamins (A, D, E, K), which support immunity, vision, hormone balance, and bone health. It also contains beneficial fats like omega-3s and CLA. Free from chemicals, it’s a wholesome alternative to refined sugar for tea, desserts, and daily use.If you want good desi Khand then get it from this website – https://desikhand.in/.

asked in Community Articles and Got a Question?

badges for Credly

Good evening everyone, I am new to streamsets, is there any available badges for streamsets platform?

SperchOpening Band

posted in Community Articles and Got a Question?

Idea: View pipeline data in the UI in real time

Recently while troubleshooting an issue with tech support I lamented about not being able to watch the data stream being processed in real-time as it would have been incredibly helpful. Realizing it could be helpful in just testing when building a pipeline, too, I thought I’d suggest it.Is there a platform to suggest ideas to StreamSets? If not already suggested or in the pipeline (pun intended) then how do others in the community feel about this?

JohnnyPdarosa956_903_6595Fan

asked in Community Articles and Got a Question?

JohnnyPdarosa956-903-6595 JohnnyPdarosa956_903_6595 JohnnyPdarosa JohnnyPdarosa9569036595

JohnnyPdarosa956-903-6595 JohnnyPdarosa956_903_6595 JohnnyPdarosa JohnnyPdarosa9569036595

JohnnyPdarosaFan

asked in Community Articles and Got a Question?

JohnnyPdarosa956-903-6595 JohnnyPdarosa JohnnyPdarosa9569036595

JohnnyPdarosa956-903-6595 JohnnyPdarosa JohnnyPdarosa9569036595

asked in Community Articles and Got a Question?

(JDBC) Singlestore to Singlestore Pipeline incremental load won't get new records, inserting same lookup record at destination

Greetings!I’ve created a pipeline that should listen and incrementaly load records from a login table, match the info from the logged user through a JDBC lookup and sending the enriched record to another table. the problem is that the pipeline won’t get new records after the 1st one and it keeps inserting it forever. here is the Query consumer configthe lookup config and the the destination operation

rahunlsingjdksFan

asked in Community Articles and Got a Question?

Is Geek Studio involved in scam reports?

No – Geek Studio is not a scam. While some scammers have misused the name by impersonating the company, these complaints involve fake representatives. The genuine Geek Studio has a strong reputation, excellent customer reviews, and a proven track record of providing trustworthy and professional tech support.

asked in Community Articles and Got a Question?

Error while testing a connection and connecting to a SFTP server

Hi ,I am creating a new Connection with SFTP protocol and trying to connect to the SFTP server using a private key .I inputted these while creating the Connection:Authentication as Private Key ,Private Key provider as plain text Private Key as the text that i copied from the ppk file Username the correct one there was no Passphraseand when i hit Test Connection I get the below errorStage 'com_streamsets_pipeline_lib_remote_RemoteConnectionVerifier_01' initialization error: java.lang.NullPointerException: Cannot invoke "net.schmizz.sshj.userauth.keyprovider.KeyProvider.getPublic()" because "this.kProv" is null (CONTAINER_0701) But alternatively if i input the same in the credentials tab of the pipeline and preview ,I am able to successfully connect to the sftp server and read the file there . Problem is when i create the same via a connection Kindly help me to resolve this issue .

SperchOpening Band

asked in Community Articles and Got a Question?

Snowflake Executor Destination Unable to Execute Stored Procedure

I have a stored procedure that works when I call it manually, both within Snowflake and via snowsql. I am trying to automate the call of this procedure via StreamSets, but it continues to fail with the ambiguous error message:Technical details: An exception has arisen while executing the query 'CALL ETS_METRICS.TABLEAU_METADATA_COLLECTION();': SQL compilation error: Unknown user-defined function ETS_METRICS.TABLEAU_METADATA_COLLECTIONI am aware of this post and it is not relevant since it only is about how to do it, not troubleshooting error. That said, I did try making the query a stop event and the same error is thrown. What boggles my mind the most is that I am specifying the schema even though the defined connection already connects to the correct schema, an

asked in Community Articles and Got a Question?

How can we use the aggregation query/pipeline in MongoDB Atlas lookup processor or any other processor?

Hi All, I have a requirement to use the aggregation query using match,group,sum,min,max and counts on a MongoDB Atlas collection in a StreamSets pipeline probably in a lookup processor by passing the input fields.Is it possible? Thanks,Mahender

dhanrajshindeFan

asked in Community Articles and Got a Question?

HTTP Client Processor Pagination for using Page number for Binary data

Hi, I have an API which takes page_no and page_size as parameter for pagination.API response is not a json data or had a records in list/array but a Binary Byte array as string.API returns a zip file.In response header it has total_pages, page_no and page_size. In streamsets for pagination with page number we have to provide result field path for incrementing to next page number. which is not possible in my case.refer doc https://learn.ocp.ai/guides/exports-api Could anyone please suggest solution for it.

SperchOpening Band

asked in Community Articles and Got a Question?

Possible to query the SCH for the historically longest running jobs?

Is it possible to query the SCH for the longest running jobs? For example, say you have 100 jobs, and you want to know the 10 longest running jobs out of the total (I’d happily take just reporting on averages if that is all there is). It seems like this would be a useful report or API endpoint to have.

Eric MencariniFan

asked in Community Articles and Got a Question?

Retreaving more metrics fom Jobs(SDK)

Hello!!Currently, I’m using the SDK to extract metrics from some jobs. However, instead of having only {run_count, input_count, output_count, total_error_count} from sch.job.metrics, I would like to have some other metrics as well, such as counting each record for each step of my job. Is this possible? How can I achieve this?I’m also looking to get the last received record (for each pipeline) that we can see in the real-time summary. How can I get this metric from ControlHub using the SDK?

SperchOpening Band

asked in Community Articles and Got a Question?

Configuration of Postman to Use StreamSets APIs

I am attempting to use Postman to query the runtime history of all jobs so that I can provide a list of the X slowest running jobs. I have been unable to configure Postman correct for this GET call, and would like to know from the hivemind what I am doing wrong here.Trying to use this API endpoint:https://{Control_Hub_URL}.streamsets.com/jobrunner/rest/v1/job/{jobId}/historyAuth is set to “No auth” based upon another forum post I read that stated we should include the ID and key for the API credentials used in the header, though I have tried it with API Key and providing the credentials there, as well.I have read through the 14 questions and 4 conversations that return when searching here for “Postman” and tried a few of the proffered solutions but without success. I have no parameters and the headers are where I have been trying to define API credentials. Note these credentials are used by automation in the Control Hub so they are valid. I thin

asked in Community Articles and Got a Question?

Assigning value to runtime parameter in pipeline start event as jdbc query

I am using Data collector 5.8.1 . I am selected JDBC in my pipeline start event I am using this query in my start event: DECLARE @XYZWorkFlowID INT = (SELECT TOP 1 WorkFlowID FROM dbo.RefWorkFlow WHERE WorkFlowName = 'XYZ')INSERT INTO JobTracker(JobName,WorkFlowID,Campaign,LastExecutedTime,JobStatus)VALUES ('XYZ_ABC_Export',@XYZWorkFlowID ,'Dialer',GETDATE(),'InProgress')I need to get the identity value generated by JobTracker table and assign it to a ‘JobTrackerId’ pipeline parameter. I need to use the same parameter in the stop event jdbc to update JobTracker the table.I tried adding: ${JobTrackerId} = SELECT @@IdentityI'm pretty sure this is the wron

asked in Community Articles and Got a Question?

Using expression language to get the length of a list

Docs and at least one community thread have suggested that the following expression should work in a stream selector conditional:${record:exists('/ids') && length(record:value('/ids')) > 0}This returns:ELException: No function is mapped to the name "length".The field in question is a list of integers. What am I not understanding?Thanks

1
2
3
4
...
43

Page 1 / 43

Badge winners

Sperchhas earned the badge Eager to help
vishwesh.margasahayamhas earned the badge Product expert
ajinkyahas earned the badge Innovator
Sanjeevhas earned the badge Eager to help
AkshayJadhavhas earned the badge Eager to help

Show all badges

Powered by Gainsight

Terms & Conditions Accessibility statement

Sign up

Already have an account? Login

Social Login

or

Username *

E-mail address *

What I do... *

Data Leader Data Architect Data Engineer Data Scientist Other

Company *

Country *

Zip Code *

Marketing Communications

Yes No

Password *

I have read and Agree to the Website Terms of Service and I have read and acknowledged the Privacy Policy.

loginBox.register.email_repeat

Login to the community

No account yet? Create an account

Social Login

or

Username or Email

Password

Remember me

Forgot password?

Enter your E-mail address. We'll send you an e-mail with instructions to reset your password.

Enter your e-mail address

Back to overview

Scanning file for viruses.

Sorry, we're still checking this file's contents to make sure it's safe to download. Please try again in a few minutes.

OK

This file cannot be downloaded

Sorry, our virus scanner detected that this file isn't safe to download.

OK