A very common approach is to query data straight from Databricks via Power BI. For this you need Databricks token and the JDBC address URL. This is found within Account settings of the cluster.
Save that token, you won’t be able to get it again unless you re-create. Now we need the JDBC information from the main cluster dashboard. It is under the Advanced Options.
Get the URL and further tweaks are needed. Change to HTTPS://, remove the blue box below and then token goes at the end.
Now load Power BI, get your SPARK data – natively supported by Power BI.
It will pull in my data from spark cluster.
This “events” data is actual data based on a DELTA lake, such as:
display(spark.sql(“DROP TABLE IF EXISTS events”))
display(spark.sql(“CREATE TABLE events USING DELTA LOCATION ‘/mnt/deltadrive/'”))