Discover how a bimodal integration strategy can address the major data management challenges facing your organization today.
Get the Report →Visualize Live Databricks Data in Power BI (via CData Connect Cloud)
Use the CData Power BI Connector and CData Connect Cloud to integrate live Databricks data into custom reports in Power BI.
Power BI transforms your company's data into rich visuals for you to collect and organize so you can focus on what matters to you. When paired with CData Connect Cloud, you get access to Databricks data for visualizations, dashboards, and more. This article shows how to use CData Connect to create a live connection to Databricks, connect to Databricks data from Power BI and then create reports on Databricks data in Power BI.
About Databricks Data Integration
Accessing and integrating live data from Databricks has never been easier with CData. Customers rely on CData connectivity to:
- Access all versions of Databricks from Runtime Versions 9.1 - 13.X to both the Pro and Classic Databricks SQL versions.
- Leave Databricks in their preferred environment thanks to compatibility with any hosting solution.
- Secure authenticate in a variety of ways, including personal access token, Azure Service Principal, and Azure AD.
- Upload data to Databricks using Databricks File System, Azure Blog Storage, and AWS S3 Storage.
While many customers are using CData's solutions to migrate data from different systems into their Databricks data lakehouse, several customers use our live connectivity solutions to federate connectivity between their databases and Databricks. These customers are using SQL Server Linked Servers or Polybase to get live access to Databricks from within their existing RDBMs.
Read more about common Databricks use-cases and how CData's solutions help solve data problems in our blog: What is Databricks Used For? 6 Use Cases.
Getting Started
Configure Databricks Connectivity for Power BI
Connectivity to Databricks from Power BI is made possible through CData Connect Cloud. To work with Databricks data from Power BI, we start by creating and configuring a Databricks connection.
- Log into Connect Cloud, click Connections and click Add Connection
- Select "Databricks" from the Add Connection panel
-
Enter the necessary authentication properties to connect to Databricks.
To connect to a Databricks cluster, set the properties as described below.
Note: The needed values can be found in your Databricks instance by navigating to Clusters, and selecting the desired cluster, and selecting the JDBC/ODBC tab under Advanced Options.
- Server: Set to the Server Hostname of your Databricks cluster.
- HTTPPath: Set to the HTTP Path of your Databricks cluster.
- Token: Set to your personal access token (this value can be obtained by navigating to the User Settings page of your Databricks instance and selecting the Access Tokens tab).
- Click Create & Test
- Navigate to the Permissions tab in the Add Databricks Connection page and update the User-based permissions.
With the connection configured, you are ready to connect to Databricks data from Power BI.
Query Databricks Tables
Follow the steps below to build a query to pull Databricks data into the report:
- Open Power BI Desktop and click Get Data -> Online Services -> CData Connect Cloud and click "Connect"
- Click "Sign in" and authenticate with your CData Connect Cloud account
- After signing in, click "Connect"
- Select tables in the Navigator dialog
- Click Load to establish the connection to your Databricks data from Power BI
Create Databricks Data Visualizations
After connecting to the data into Power BI, you can create data visualizations in the Report view by dragging fields from the Fields pane onto the canvas. Select the dimensions and measures you wish to visualize along with the chart type.
Click Refresh to synchronize your report with any changes to the data.
Live Access to Databricks Data from Data Applications
With CData Connect Cloud you have a direct connection to Databricks data from Power BI. You can import more data, create new visualizations, build reports, and more — all without replicating Databricks data.
To get SQL data access to 100+ SaaS, Big Data, and NoSQL sources (including Databricks) directly from your on-premise BI, reporting, ETL and other data applications, visit the CData Connect page and start a free trial.