Skip to main content

Getting PySpark processor to work on Databricks.


AkshayJadhav
StreamSets Employee
Forum|alt.badge.img

Problem:

How to add make PySpark processor to work on Databricks.

 

Solution:

This error can be solved by adding the following env variables on Databricks Spark cluster with a runtime that comes bundled with Python:

PYTHON_HOME=/databricks/python3/bin/python3

PYSPARK_PYTHON=/databricks/python3/bin/python3

PYSPARK_DRIVER_PYTHON=/databricks/python3/bin/python3

PYTHONPATH=/databricks/python3/bin/python3

 

Did this topic help you find an answer to your question?
This topic has been closed for comments