Discover how a bimodal integration strategy can address the major data management challenges facing your organization today.
Get the Report →Build Semantic Layer Views for Databricks Data in APOS Live Data Gateway
Use the CData Connector for Databricks in APOS Live Data Gateway to build Semantic Layer Views for Databricks data.
APOS Live Data Gateway (LDG) is a data connection and data transformation solution that enables live data connectivity and expanded data source options for SAP Analytics Cloud and other SAP solutions. When paired with CData Connectors, users can build semantic layer views for live Databricks data, enabling real-time analytics on Databricks just like working with a relational database.
With built-in optimized data processing, the CData Connector offers unmatched performance for interacting with live Databricks data. When you issue complex SQL queries to Databricks, the driver pushes supported SQL operations, like filters and aggregations, directly to Databricks and utilizes the embedded SQL engine to process unsupported operations client-side (often SQL functions and JOIN operations). Its built-in dynamic metadata querying allows you to work with and analyze Databricks data using native data types.
About Databricks Data Integration
Accessing and integrating live data from Databricks has never been easier with CData. Customers rely on CData connectivity to:
- Access all versions of Databricks from Runtime Versions 9.1 - 13.X to both the Pro and Classic Databricks SQL versions.
- Leave Databricks in their preferred environment thanks to compatibility with any hosting solution.
- Secure authenticate in a variety of ways, including personal access token, Azure Service Principal, and Azure AD.
- Upload data to Databricks using Databricks File System, Azure Blog Storage, and AWS S3 Storage.
While many customers are using CData's solutions to migrate data from different systems into their Databricks data lakehouse, several customers use our live connectivity solutions to federate connectivity between their databases and Databricks. These customers are using SQL Server Linked Servers or Polybase to get live access to Databricks from within their existing RDBMs.
Read more about common Databricks use-cases and how CData's solutions help solve data problems in our blog: What is Databricks Used For? 6 Use Cases.
Getting Started
Download and Install the Connector Files
In order to access Databricks data through the APOS Live Data Gateway, you will need to download the connector files from APOS and install them on the machine hosting the Live Data Gateway. An APOS representative can deliver the necessary files.
- Install the CData DLL file (System.Data.CData.Databricks.dll) to the APOS Live Data Gateway installation directory (C:\Program Files\Live Data Gateway\Admin\ by default).
- Install the CData JAR file (cdata.jdbc.databricks.jar) to the ConnectionTest_lib folder in the installation directory (C:\Program Files\Live Data Gateway\Admin\ConnectionTest_lib\ by default).
- Install the CData JAR file (cdata.jdbc.databricks.jar) to the lib folder in the Web UI installation directory (e.g.: C:\LDG_WebUI\lib\)
Configuring the Databricks Connection String
Before establishing the connection to Databricks from the APOS Live Data Gateway, you need to configure the Databricks JDBC Connection String.
Built-in Connection String Designer
For assistance in constructing the JDBC Connection String, use the connection string designer built into the Databricks JDBC Driver. Either double-click the JAR file or execute the jar file from the command-line.
java -jar cdata.jdbc.databricks.jar
Fill in the connection properties and copy the connection string to the clipboard.
To connect to a Databricks cluster, set the properties as described below.
Note: The needed values can be found in your Databricks instance by navigating to Clusters, and selecting the desired cluster, and selecting the JDBC/ODBC tab under Advanced Options.
- Server: Set to the Server Hostname of your Databricks cluster.
- HTTPPath: Set to the HTTP Path of your Databricks cluster.
- Token: Set to your personal access token (this value can be obtained by navigating to the User Settings page of your Databricks instance and selecting the Access Tokens tab).
Your connection string will look similar to the following:
jdbc:Databricks:Server=127.0.0.1;Port=443;TransportMode=HTTP;HTTPPath=MyHTTPPath;UseSSL=True;User=MyUser;Password=MyPassword;
Connecting to Databricks & Creating a Semantic Layer View
After installing the connector files and configuring the connection string, you are ready to connect to Databricks in the Live Data Gateway Admin tool and build a semantic layer view in the Live Data Gateway Web UI.
Configuring the Connection to Databricks
- Log into your APOS Live Data Gateway Manager
- If you haven't already, update your APOS LDG license file
- Click File -> Configurations
- Click on the "..." Menu for the License
- Select the license file from the APOS team that includes your CData Connector license
- In the APOS Live Data Gateway Manager, click "Add"
- In the APOS Live Data Gateway On the Connection tab, configure the connection:
- Set Data Source to "Database"
- Set Database to "CData Drivers"
- Set Database CData Drivers to "Databricks"
- Set Connection String to the connection string configured earlier (e.g.:
jdbc:Databricks:Server=127.0.0.1;Port=443;TransportMode=HTTP;HTTPPath=MyHTTPPath;UseSSL=True;User=MyUser;Password=MyPassword;
- Set Driver Class to "cdata.jdbc.databricks.DatabricksDriver" (this should be set by default)
- Click Test Connection
- Click Save
- Give your connection a unique prefix (e.g. "databricks")
- Highlight the newly created connection and click File -> "Approve Users For Web UI"
- Approve the appropriate DB users to create views and click "Save"
At this point, we are ready to build our semantic layer view in the Live Data Gateway Web UI.
Creating a Semantic Layer View
- In your browser, navigate to the APOS Live Data Gateway Portal
- Select a Connection (e.g. "databricks")
- Set User Name and Password to the User and Password properties for Databricks.
- Click "Login"
- Once connected, click "Semantic Layer" to create a new semantic layer view
- Click "New Semantic Layer View"
- Set the Semantic Layer View Prefix and Semantic Layer View Name
- Click "Step 2"
- Select the table(s) and column(s) you wish to include in your view
- Click "Step 3"
- Select the Measures from the available table columns
- Click "Step 5" (we skipped the "Extra Dimensions" step)
- Add any Variable Prompts
- Click "Step 6"
- Define any Table Joins
- Click "Review"
- Review you semantic layer view and click "Save"
With the Semantic Layer View created, you are ready to access your Databricks data through the APOS Live Data Gateway, enabling real-time data connectivity to Databricks data from SAP Analytics Cloud and other SAP solutions.
More Information & Free Evaluation
Please visit APOS Systems - APOS Solutions - Request Evaluation Software to request evaluation software or email [email protected] for more information on working with your live Databricks data in APOS Live Data Gateway.