Ready to get started?

Download a free trial of the Azure Data Lake Storage Tableau Connector to get started:

 Download Now

Learn more:

Azure Data Lake Storage Icon Azure Data Lake Storage Tableau Connector

The fastest and easiest way to connect Tableau to Azure Data Lake Storage data. Includes comprehensive high-performance data access, real-time integration, extensive metadata discovery, and robust SQL-92 support.

Publish Azure Data Lake Storage-Connected Dashboards in Tableau Server



Use CData Tableau Connectors and Tableau Server to visualize live Azure Data Lake Storage data.

Tableau Server is a visual analytics platform transforming the way businesses use data to solve problems. When paired with the CData Tableau Connector for Azure Data Lake Storage, you get access to live Azure Data Lake Storage data within Tableau Server. This article shows how to connect to Azure Data Lake Storage in Tableau Desktop, publish a Data Source to Tableau Server, and build a simple chart from that data.

The CData Tableau Connectors enable high-speed access to live Azure Data Lake Storage data in Tableau Server. Once you install the connector, you simply authenticate with Azure Data Lake Storage and you can immediately start building responsive, dynamic visualizations and dashboards. By surfacing Azure Data Lake Storage data using native Tableau data types and handling complex filters, aggregations, & other operations automatically, CData Tableau Connectors grant seamless access to Azure Data Lake Storage data.

NOTE: The CData Tableau Connectors require Tableau 2020.3 or higher. If you are using an older version of Tableau, you will need to use the CData Tableau Connector for Azure Data Lake Storage. If you wish to connect to Azure Data Lake Storage data in Tableau Cloud, you will need to use CData Connect.

Enable Connectivity to Azure Data Lake Storage in Tableau Server

Start by installing the CData Tableau Connector on the machine hosting Tableau Server.

Installation on a Linux Machine:

  1. Unpack the setup.zip archive. Creating a directory for the connector like /opt/cdata/adls is recommended, but the exact install location does not matter.
  2. Navigate to the lib subdirectory under the install directory. Generate a license file by running this command and following the prompts: java -jar cdata.tableau.adls.jar -l
  3. Copy cdata.tableau.adls.jar and cdata.tableau.adls.lic into the drivers directory, /opt/tableau/tableau_driver/jdbc.
  4. Copy cdata.adls.taco into the connectors directory, MyTableauServerRootDir/data/tabsvc/vizqlserver/Connectors. In most cases MyTableauServerRootDir is located at /var/opt/tableau/tableau_server.
  5. Restart Tableau Server.

If you cannot execute the java commmand from step 2, you will need to install a Java runtime environment. The name of this package differs on Red Hat-based and Debian-based systems:

OSJava Package
Ubuntuopenjdk-8-jre-headless
Debianopenjdk-8-jre-headless
RHELjava-1.8.0-openjdk
CentOSjava-1.8.0-openjdk
Fedorajava-1.8.0-openjdk
SUSEjava-1_8_0-openjdk

You can substitute Java 8 with a later Java release as needed.

Installation on a Windows Machine:

  1. Run the setup.exe installer. The driver JAR file will be automatically placed in the drivers directory, C:\Program Files\Tableau\Drivers.
  2. Navigate to the lib folder under the installation directory. By default the installation directory is a folder in C:\Program Files\CData.
  3. Copy cdata.tableau.adls.lic into the drivers directory.
  4. Copy cdata.tableau.adls.taco into the connectors directory, MyTableauServerRootDir/data/tabsvc/vizqlserver/Connectors. In most cases MyTableauServerRootDir is located under C:\ProgramData.
  5. Restart Tableau Server.

Connect to Azure Data Lake Storage in Tableau Desktop

Once the connectors are installed on the Server machine, we can configure a connection to Azure Data Lake Storage in Tableau Desktop and publish a Azure Data Lake Storage-based Data Source to Tableau Server.

  1. Open Tableau Desktop.
  2. Click More under Connect -> To a Server.
  3. Select "Azure Data Lake Storage by CData".
  4. Configure the connection to the data.

    Authenticating to a Gen 1 DataLakeStore Account

    Gen 1 uses OAuth 2.0 in Azure AD for authentication.

    For this, an Active Directory web application is required. You can create one as follows:

    1. Sign in to your Azure Account through the .
    2. Select "Azure Active Directory".
    3. Select "App registrations".
    4. Select "New application registration".
    5. Provide a name and URL for the application. Select Web app for the type of application you want to create.
    6. Select "Required permissions" and change the required permissions for this app. At a minimum, "Azure Data Lake" and "Windows Azure Service Management API" are required.
    7. Select "Key" and generate a new key. Add a description, a duration, and take note of the generated key. You won't be able to see it again.

    To authenticate against a Gen 1 DataLakeStore account, the following properties are required:

    • Schema: Set this to ADLSGen1.
    • Account: Set this to the name of the account.
    • OAuthClientId: Set this to the application Id of the app you created.
    • OAuthClientSecret: Set this to the key generated for the app you created.
    • TenantId: Set this to the tenant Id. See the property for more information on how to acquire this.
    • Directory: Set this to the path which will be used to store the replicated file. If not specified, the root directory will be used.

    Authenticating to a Gen 2 DataLakeStore Account

    To authenticate against a Gen 2 DataLakeStore account, the following properties are required:

    • Schema: Set this to ADLSGen2.
    • Account: Set this to the name of the account.
    • FileSystem: Set this to the file system which will be used for this account.
    • AccessKey: Set this to the access key which will be used to authenticate the calls to the API. See the property for more information on how to acquire this.
    • Directory: Set this to the path which will be used to store the replicated file. If not specified, the root directory will be used.
  5. Click "Sign In".

Discover Schemas and Query Data

Once you establish the connection to Azure Data Lake Storage data, you can configure which entities to visualize.

  1. Select CData from the Database pull-down menu.
  2. Select ADLS from the Schema pull-down menu.
  3. Drag the tables and views you wish to visualize onto the join area. You can include multiple tables.
  4. Select Update Now or Automatically Update. Update Now lets you preview the first 10,000 rows of the data source (or enter the number of rows you want to see in the Rows text box). Automatically Update auto-loads the changes in the preview area.

Publish Data to Tableau Server

After you configure the data you wish to visualize, you can publish the Data Source to a Tableau Server instance. In Tableau Desktop:

  1. Click Server -> Sign In.
  2. Enter the URL for your Tableau Server.
  3. Authenticate with Tableau Server credentials.
  4. Click Server -> Publish Data Source and select your data source.
  5. Click Publish.
  6. Select the Project, name the Data Source, and optionally add a description.
  7. Click Publish.

This creates a new entry under the server's data source list, from which you an change the data source's permissions, view its history, and perform other management tasks.

Note that workstation connected to the same server will be able to use the same source in Tableau Desktop, even if the connector isn't installed there. Also, workbooks created directly on Tableau Server (via the web interface) can use this source.

Visualize Azure Data Lake Storage Data in Tableau Server

With the Data Source published to Tableau Server, you are ready to visualize Azure Data Lake Storage data.

  1. Login to your Tableau Server instance.
  2. Connect to the remote source using the Search for Data -> Tableau Server in the Connect sidebar.
  3. Click the published Data Source.
  4. Click New Workbook.
  5. In the workbook, Azure Data Lake Storage fields are listed as Dimensions and Measures, depending on the data type. The CData Tableau Connector discovers data types automatically, allowing you to leverage the powerful data processing and visualization features of Tableau.
  6. Drag a field from the Dimensions or Measures area to Rows or Columns. Tableau creates column or row headers.
  7. Select one of the chart types from the Show Me tab. Tableau displays the chart type that you selected.

Using the CData Tableau Connector for Azure Data Lake Storage with Tableau Server, you can easily create robust visualizations and reports on Azure Data Lake Storage data. Download a free, 30-day trial and get started today.