Unable to Read Data Using S3origin in Data Transformer

Question

Hi Team,I am facing  issue i.e. reading data through S3origin within data transformer. I am able to read data through s3 origin in data collector.Trying to read data from S3 origin and copy the same in different location using S3 destination. I am using EMR as an computing engine. Job runs for several mins. on EMR and completed successfully. There is no error in Logs (Both EMR and StreamSet pipeline Logs). Do get this below Warning but not sure this is causing issue or not. java.nio.file.NoSuchFileException: /data/transformer/runInfo/testRun__9e731964-6f21-4956-99fa-82206f3451f5__149e11c1-f697-11eb-b9dc-fd846d33049d__56e36c1c-f8c6-11eb-9295-0fa62e75e081@149e11c1-f697-11eb-b9dc-fd846d33049d/run1630923519827/driver-topLevelError.logI have verified staging directory as well. seems like all required files are getting populated there which eventually being read through Spark submit. At the end, Transformer pipeline ends with status START_ERROR: Job completed successfully.This is a show stopper as of now as its seems to be very basic issue. Appreciate any resolution and pointers to proceed further.

Giuseppe Mura · Accepted Answer

hi@Ankit, what you need to check is that the Spark cluster is able to “talk” back to your Transformer engine; in your pipeline, edit theCluster Callback URL property with your EC2 instance’s hostname, e.g. something like:http://ip-10-80-XX-YYYY.eu-west-1.compute.internal:19630If you have a tarball install this is not really required as Transformer would automatically take the hostname from the machine, but given that you’re running in Docker, Transformer ends up using your container name, which is not reachable from your EMR clusterAlso find the documentation for this here:https://docs.streamsets.com/portal/#platform-transformer/latest/transformer/PipelineConfig/CallbackURL.html

Giuseppe Mura · Answer

Hi @ankit, are you able to preview the data?Also, are you able to run a very simple Dev Origin →Trash pipeline?

Couldn't find what you're looking for?

Sign up

Social Login

Login to the community

Social Login

Scanning file for viruses.

This file cannot be downloaded