Welcome

Hi, I build a pipeline using Oracle CDC client, it is a very simple pipeline , have attached my exported pipeline. I currently using sysdba access configured in Streamsets and using this account I ran below in DBeaver can get records from V$LOGMNR_CONTENTS, please refer to attached screenshot "DBeaver_logmnr_screeashot.png"", however the streamsets cdc pipeline keeps running without any input and output, I also attached the sdc.log from server.from the log I can see the pipeline has gotten the timestamp of the starting SCN operation , however it cannot get records from LOGMNR and then insert into destination.can you let know anything wrong here?

Oracle CDC client pipeline keep running without processing any records, but in DBeaver same account can get data from V$LOGMNR_CONTENTS

28 days ago

ArjoFan

"Can values be inserted into another database table from a temporary table that is created inside a function?"

29 days ago

DolphinDiscovered Fame

Hi Team,I generate Credential ID and Token via StreamSets UI → Manage → API Credential, and with the generated ID and Token I can run Curl command, returned status is “HTTP/1.1 200 OK”, also return me a json format that show my organization id, email, extra.. However, When I want to connect with below code snippet (using same credential ID and Token), it returns me error of 403. paste code and error below:>>> from streamsets.sdk import ControlHub>>> sch = ControlHub(credential_id='absd_myid', token="abcd_mytoken.") Traceback (most recent call last): File "<stdin>", line 1, in <module> File "/opt/module/sfsdcvenv/lib/python3.8/site-packages/streamsets/sdk/sch.py", line 141, in __init__ self.api_client = sch_api.ApiClient( File "/opt/module/sfsdcvenv/lib/python3.8/site-packages/streamsets/sdk/sch_api.py", line 96, in __init__ raise ValueError('Encountered error while decoding auth token: {}'.format(e))ValueError: Encountered error while decoding aut

Cannot Connect StreamSets with Python SDK, but Can connect via CURL command with same Credential ID and Token

3 days ago

hrishikeh132609Fan

realmatchaOpening Band

Error: Source : hello everyone, I want to pull data from the source to the warehouse, in the source there is a "customer_id" column, when the streamsets are run why is there an invalid customer error? in streamsets I also flag based on the "id" column. please help, thank you

JDBC ERROR

1 month ago

DolphinDiscovered Fame

Hi team,I am configuring Oracle CDC Client now, our oracle version is 19C, non-CDB oracle.I ensure our oracle instance has enabled Database Archiving Mode, cause the return of statement "select log_mode from v$database" is "ARCHIVELOG". My question is for Step "Enable Supplemental Logging", since I have three "NO" for statement "select supplemental_log_data_min, supplemental_log_data_pk, supplemental_log_data_all from v$database;". Is it mandatory to run this "alter database add supplemental log data;" ?I actually do not want to make all database tables' Supplemental Logging, only want to enable it for some related tables, if this, should I only run like below "alter table <schema name>.<table name> add supplemental log data (all) columns;" , then run "alter system archive log current;" or I need to run below in sequence?Step 1:Enable minimal supplemental logging, run "alter database add supplemental log data;"Step 2:alter table <schema name>.<table name> add su

About Oracle CDC Client Configure

2 months ago

Drew KreigerSenior Community Builder at StreamSets

Hi everyone, As of December 8th 2021 we have sunset ask.streamsets.com. You may have some questions and this post is to help answer. Why did we close down ask.streamsets.com? As we continue to grow our community the ask.streamsets.com website did not meet our level of support we wanted to provide to the community. What content was migrated over? The top 30 relevant topics were migrated over. Most questions you may have that once was found at ask.streamsets.com can be found in our up to date docs. https://streamsets.com/support/documentation-overview/Also, at this same time we have un-gated and made free over 400 knowledge base articles from our support portal. Do I maintain my status and points once had on ask.streamsets.com?Sorry, no. We wish the systems allowed this function. Going forward, Please check out our leaderboard https://community.streamsets.com/leaderboard?period=thisWeek and Where do I post my questions going forward? Post your questions and start a conversations her

Ask.streamsets.com Update

Drew KreigerSenior Community Builder at StreamSets

Welcome to the StreamSets Community! Please read over the following Community Guidelines and Code of Conduct below. Please also view StreamSets work culture Blog and Privacy policy. Who we areThe StreamSets Community Is a global community of data engineers who set out to learn, share, and expand their DataOps skills. “If you want to go fast, go alone. If you want to go far, go together” – African Proverb GuidelinesThe Do’s and Don’ts Do’s: Share helpful docs/resources, tips & tricks, and useful questions Network with other StreamSets users Help answer questions via the forum Challenge and push boundaries Be positive, Be kindDon’ts: Share personal information Attack/Call out others Degrade other members based on their knowledge level Promote a personal brand or company Code Of Conduct The StreamSets Community is dedicated to providing a harassment-free, equitable and inclusive community experience for everyone. We do not tolerate harassment of members in any form. We take this po

StreamSets Community Guidelines

posted in Events & Webinars

Drew KreigerSenior Community Builder at StreamSets

Sources and Destinations Podcast

Join and listen to our latest episode of our Sources and Destinations podcast from our hosts, @iamontheinet and Sean Anderson.S&D is a podcast about data engineering and data science talking about common design patterns and best practices. Listen where ever you get your podcasts. https://linktr.ee/sourcesanddestinations

Drew KreigerSenior Community Builder at StreamSets

We want to make sure your questions are seen and answered. If you ask your question the right way, we will accomplish answering your question fast and precisely. Here are some tips: 1. Before you ask, search first! Make sure to search your question first. The search icon/bar will always be found: At the top of the home page. Within creating a new topic. Within a topic post near your profile photo (Right Top Corner) 2. Don't hesitate ASK! This is a judge-free community. We are all here to support others and build our StreamSets knowledge. 3. Keep your data yours 🧑‍We cannot stress this enough. Do not share any persons' or your personal information (Email, Phone Number, Address, banking info, etc.) in a screenshot, post, messages, or anywhere on the StreamSets Community Platform. 4. Provide all information Please be concise with your topic/ question Title. Regarding your topic/ question description, Please be sure to provide as much detail; Platform, Version, Screenshots, Categories

5 tips to ask your question the right way

Drew KreigerSenior Community Builder at StreamSets

Hi everyone, You might be asking, what is the difference with a question and conversation? That is a great question.A conversation topic is used when you want to share something and involve the community into a discussion.A question topic is used when you need a solution for your question or problem from your community peers. I hope this helps. Lets go create questions and conversations!

Question vs Conversation

Drew KreigerSenior Community Builder at StreamSets

Hello everyone, Welcome to the StreamSets Community. I am excited to help empower members as they continue their journey to learn, share, and grow their knowledge to succeed each day. I want to introduce myself, and I hope to read more about you too!I am Drew Kreiger. I recently started as the Senior Community Manager here at StreamSets in April of 2021. In the past 5 years, I have previously worked with communities at Talend and now called Redis, where I managed community meetups, education programs, hackathons, forums, and many other great programs with fantastic community members.I enjoy working with community members as I enjoy helping users overcome issues/challenges. I also enjoy working with users on community content and seeing the impact a blog, podcast, KB article, and or event has made within the community. A fun fact about me. During college, I had studied to become a sommelier. Cheers!

Meet the Community | Introduce yourself

posted in Events & Webinars

Drew KreigerSenior Community Builder at StreamSets

Demos with Dash! Event Discussion: October 27th

Hi everyone, This post will co-inside with the September 28th Demos with Dash event! Please ask a question, share what you enjoyed or disliked. Dash is excited to chat further!

Drew KreigerSenior Community Builder at StreamSets

Currently the platform does not allow you the members to change this on your own. The team is looking to allow this functionality. I know this is frustrating. To work around this, please email Drew.Kreiger@streamsets.com to change your email and or other profile questions/needs. -Drew

How do I change my Email? (StreamSets Community Platform)

DashSenior Technical Evangelist and Developer Advocate at Snowflake

This pipeline is designed to handle (embrace!) data drift while ingesting web logs from AWS S3 and then transforming and enriching them before storing the curated data in Snowflake Data Cloud. The data drift alert is triggered if/when the data being ingested is missing a key field IP Address which is crucial for downstream analytics.

Embrace Data Drift or get left behind!

DashSenior Technical Evangelist and Developer Advocate at Snowflake

This pipeline is designed to ingest data from Amazon S3 and prepare it for training a ML model using PySpark custom processor. Once the Gradient Boosted model is trained, the model artifacts, features, accuracy of the model and other metrics are registered as an experiment in MLflow. (The pipeline runs on Databricks cluster which comes bundled with MLflow server.)

Train ML Model and register experiment in MLflow

DashSenior Technical Evangelist and Developer Advocate at Snowflake

This pipeline is designed to ingest streaming data from Kafka and load a trained ML model in Scala custom processor to predict sentiment of tweets. The pipeline runs on Databricks cluster and stores the tweets along with its score in Delta Lake.

Real-time scoring using ML model

DashSenior Technical Evangelist and Developer Advocate at Snowflake

This pipeline is designed to capture inserts and updates (SCD Type II) being uploaded to a bucket on Amazon S3 for a slowly changing dimension table -- Customers. The pipeline creates new records with version set to 1 for new customers and with version set to (current version + 1) for existing customers. The customer records are then stored in Snowflake Data Cloud.

Slowly but surely!

Drew KreigerSenior Community Builder at StreamSets

We are excited to share that we have added a new theme to our badges and ranks within the StreamSets Community Platform. Let's go over what's new while also explaining how to earn points, badges, and ranks to reach Rock Star status within the StreamSets Community. StreamSets Community ThemeWe believe we Rocked the theme. 😏 We have designed our community around experts like yourself ranking up to become Data Rock Stars. Today, we have added a new category banner image and rank icons following members reaching the Rock Star status. We hope you like the new look. What are badges? Badges are a great way to share and highlight a user of a specific milestone or involvement within the community. Badges range from; completing a course on StreamSets Academy, becoming StreamSets certified, speaking at DataOps Summit, and becoming a guest on our Sources and Destinations data engineering podcast. Take a Peep at the nifty badges! (only a few here)To earn a badge, sign up for StreamSets Acade

Community Platform Theme, Badges, and Ranks

Drew KreigerSenior Community Builder at StreamSets

Happy New Years everyone! I want to say thank you to each of you for all your hard work and contributions to our StreamSets Community. We are close to welcoming our 300th member! Taking a look back at last year we launched our new community platform with the addition of knowledge base articles, a monthly community-led newsletter, bi-annual member feedback survey, a new and improved StreamSets Academy, and much much more. Looking ahead into 2022 we are excited to launch; Virtual meetups, Pipelines and patterns examples, and much more from the feedback we received from the member feedback survey. (Click image to expand)

StreamSets Community Wrap Up 2021! 🎉

Drew KreigerSenior Community Builder at StreamSets

Hi Everyone, Thank you to those who completed the survey. This has been really helpful to understand what we are doing an awesome job on and where we can improve. We wanted to share the responses and pulse of the community today, which you’ll find in the representative responses below. We will be conducting these surveys on a quarterly basis throughout the year. You can find them on the right side of each community page labeled as “Feedback”. (REPORT) Short Answer Question #1: Thank you for your honest feedback. How can we make the community more helpful for you? Responses With more documentation of how we implement for streaming and transformer Here are some helpful docs and academy course. Not able to open ask.streamsets.com As ofDec 8, 2021ask.streamsets.com has been sunset. Going forward Community.streamsets.com is the go-to community forum and knowledge base managed and hosted by StreamSets. We have migrated the top ask.streamsets.com to our new platform and ungat

Community Member Feedback: Survey Follow Up: Q4 2021

antmcmullenStreamSets Employee

Data comes from a monitor device, with test results of different elements. Those elements have 3 values, their ID, Their Value and any error messages.The customer wants to see a flattened list of just Monitor Device and timestamp with a result of each element on seperate lines.Using the above we can achieve that, we import the data ignoring the header line. (hence labeling them with numbers) First we label the monitor “Parent” fields by using a field renamer We build a Empty map for us to correctly parse the records using expression evaluator we use a field mapping processor, to map those groups of 3 fields ( checking we only remap columns that are numeric) now we have groups, we split the groups into records using a field pivot processor This leaves a single group per record but as a group, Lets tidy that up with a field flattener so all the records are at the same level Finally we use a field renamer to label the 3 fields we have produced We ship that off to our secure storage faci

Split child records that are crosstabbed back into flat records