Skip to main content
Question

MySQL Binary Log UTF-8 STRING problem


Mike Arov
Roadie

I am trying to set up a MySQL CDC pipeline where one of the columns is of type BINARY. 
I'm using “MySQL Binary Log" for this and have run into an issue where the processor tries to decode every BINARY column into a UTF-8 STRING, inserting a bunch of UTF replacement characters and corrupting the data (example below). 
Have I misconfigured the processor or is this a bug? 

 

2 replies

john.mcavoy
StreamSets Employee
  • StreamSets Employee
  • 1 reply
  • October 19, 2022

Hi Mike,

It looks like this issue may be caused by your Java Runtime Environment using a non-UTF-8 characterset.

Here is a Community Article which may be helpful to troubleshoot if this is the root cause:

Please let us know if this turns out to be the root cause of your problem.


Bikram
Headliner
Forum|alt.badge.img+1
  • Headliner
  • 486 replies
  • October 21, 2022

@Mike Arov 

 

can you please convert your jdbc url as given below and check if it helps in your issues.

 

jdbc:sql://xxxx.net/TMODE=ANSI,CHARSET=UTF8

 

Thanks & Regards

Bikram_


Reply