Question

Need to Read ADLS files name in nested folders

  • 6 September 2023
  • 2 replies
  • 21 views

Hi All,
We have a senario where we need to read all file name in ADLS Origin Stage. We have checked out in Streamsets documentation but its not working 

Its mentioned in Doc - we can access the files with **/<File Pattern>

But its not picking up the filename.

If anyone have the solution you can share with us.
 


2 replies

Userlevel 5
Badge +1

@SUDHANSHOO 

Kindly try to use the absolute path for the nested folder to read the file from it.

If you want to read files from different folders then you can use glob pattern to handle the file from it.

 

Below the example for your reference for fetching multiple files from nested directory.

 

glob.sync("./src/handlebar/{a, b, c, d}/**/*.hbs") 

 

Thanks & regards

Bikram_

Hi Bikram,
We want to go with Generic approach, where we need to drive in each directory and read the file names:

For example:-

A/abc.txt

A/B/def/def.csv

A/B/C/ghi.txt

A/B/C/abc.xlsx

So we want to read all the files which are there in nested directory:- abc.txt, def.csv, ghi.txt, abc.xlsx

Where Dir name A/B/C need to be read at run runtime.
Environment we are using - SDC 5.2

Reply