Connect to and Visualize Live Azure Data Lake Storage Data in Tableau Prep



Use CData Tableau Connectors and Tableau Prep Builder to visualize live Azure Data Lake Storage data.

Tableau is a visual analytics platform transforming the way businesses use data to solve problems. When paired with the CData Tableau Connector for Azure Data Lake Storage, you can easily get access to live Azure Data Lake Storage data within Tableau Prep. This article shows how to connect to Azure Data Lake Storage in Tableau Prep and build a simple chart.

The CData Tableau Connectors enable high-speed access to live Azure Data Lake Storage data in Tableau. Once you install the connector, you simply authenticate with Azure Data Lake Storage and you can immediately start building responsive, dynamic visualizations and dashboards. By surfacing Azure Data Lake Storage data using native Tableau data types and handling complex filters, aggregations, & other operations automatically, CData Tableau Connectors grant seamless access to Azure Data Lake Storage data.

NOTE: The CData Tableau Connectors support Tableau Prep Builder 2020.4.1 or higher. If you are using an older version of Tableau Prep Builder, you will need to use the CData Tableau Connector for Azure Data Lake Storage. If you wish to connect to Azure Data Lake Storage data in Tableau Cloud, you will need to use CData Connect Cloud.

Install the CData Tableau Connector

When you install the CData Tableau Connector for Azure Data Lake Storage, the installer should copy the TACO and JAR files to the appropriate directories. If your data source does not appear in the connection steps below, you will need to copy two files:

  1. Copy the TACO file (cdata.adls.taco) found in the lib folder of the connector's installation location (C:\Program Files\CData\CData Tableau Connector for Azure Data Lake Storage 20XX\lib on Windows) to the Tableau Prep Builder repository:

    • Windows: C:\Users\[Windows User]\Documents\My Tableau Prep Repository\Connectors
    • MacOS: /Users//Documents/My Tableau Prep Repository/Connectors
  2. Copy the JAR file (cdata.tableau.adls.jar) found in the same lib folder to the Tableau drivers directory, typically [Tableau installation location]\Drivers.

Connect to Azure Data Lake Storage in Tableau Prep Builder

Open Tableau Prep Builder and click "Connect to Data" and search for "Azure Data Lake Storage by CData." Configure the connection and click "Sign In."

Authenticating to a Gen 1 DataLakeStore Account

Gen 1 uses OAuth 2.0 in Azure AD for authentication.

For this, an Active Directory web application is required. You can create one as follows:

  1. Sign in to your Azure Account through the .
  2. Select "Azure Active Directory".
  3. Select "App registrations".
  4. Select "New application registration".
  5. Provide a name and URL for the application. Select Web app for the type of application you want to create.
  6. Select "Required permissions" and change the required permissions for this app. At a minimum, "Azure Data Lake" and "Windows Azure Service Management API" are required.
  7. Select "Key" and generate a new key. Add a description, a duration, and take note of the generated key. You won't be able to see it again.

To authenticate against a Gen 1 DataLakeStore account, the following properties are required:

  • Schema: Set this to ADLSGen1.
  • Account: Set this to the name of the account.
  • OAuthClientId: Set this to the application Id of the app you created.
  • OAuthClientSecret: Set this to the key generated for the app you created.
  • TenantId: Set this to the tenant Id. See the property for more information on how to acquire this.
  • Directory: Set this to the path which will be used to store the replicated file. If not specified, the root directory will be used.

Authenticating to a Gen 2 DataLakeStore Account

To authenticate against a Gen 2 DataLakeStore account, the following properties are required:

  • Schema: Set this to ADLSGen2.
  • Account: Set this to the name of the account.
  • FileSystem: Set this to the file system which will be used for this account.
  • AccessKey: Set this to the access key which will be used to authenticate the calls to the API. See the property for more information on how to acquire this.
  • Directory: Set this to the path which will be used to store the replicated file. If not specified, the root directory will be used.

Discover and Prep Data

Drag the tables and views you wish to work with onto the canvas. You can include multiple tables.

Data Cleansing & Filtering

To further prepare the data, you can implement filters, remove duplicates, modify columns and more.

  1. Start by clicking on the plus next to your table and selecting the Clean Step option.
  2. Select the field values to filter by. As you select values, you can see how your selections impact other fields.
  3. Opt to "Keep Only" or "Exclude" entries with your select values and the data changes in response.

Data Joins and Unions

Data joining involves combining data from two or more related tables based on a common field or key.

  1. To join multiple tables, drag a related table next to an existing table in the canvas and place it in the Join box.
  2. Select the foreign keys that exist in both tables.

Exporting Prepped Data

After you perform any cleansing, filtering, transformations, and joins, you can export the data for visualization in Tableau.

  1. Add any other needed transformations then insert an Output node at the end of the flow.
  2. Configure the node to save to a file in the format of your choice.

Once the output data is saved, you can work with it in Tableau, just like you would any other file source.

Using the CData Tableau Connector for Azure Data Lake Storage with Tableau Prep Builder, you can easily join, cleanse, filter, and aggregate Azure Data Lake Storage data for visualizations and reports in Tableau. Download a free, 30-day trial and get started today.

Ready to get started?

Download a free trial of the Azure Data Lake Storage Tableau Connector to get started:

 Download Now

Learn more:

Azure Data Lake Storage Icon Azure Data Lake Storage Tableau Connector

The fastest and easiest way to connect Tableau to Azure Data Lake Storage data. Includes comprehensive high-performance data access, real-time integration, extensive metadata discovery, and robust SQL-92 support.