I used StreamSets Data Collector 5.4.0, Oracle character set is AL32UTF8, In preview mode, I see that the record value is wrapped in a layer of UNISTR functions, convert to unicode
like this :
Oracle CDC Client config :
is it something wrong with my JDBC config or how to set the character set ?
I believe so and also did some analysis from my end but its saying that UTF-8 should solve the issue.
For work around , can you please try out to decode the output string using Jython processor and check if the string value is as expected.
I will give a try from my end as well and let you know if any luck .
Thanks & Regards
thank you for your reply,
i get this error , does the driver not support parameters?
it looks like your data contains some chinese characters : gavin-罗江1408.
Give the following a try and see if any of the work:
You might have to change the ? to /
Thank you for your reply , i add this “jdbc:oracle:thin:@//192.168.100.130:1521/ironman/TMODE=ANSI,CHARSET=UTF8” and get this error at start pipline
can you please try this and let me know if it works for you .