Query Impala Data in DataGrip



Create a Data Source for Impala in DataGrip and use SQL to query live Impala data.

DataGrip is a database IDE that allows SQL developers to query, create, and manage databases. When paired with the CData JDBC Driver for Impala, DataGrip can work with live Impala data. This article shows how to establish a connection to Impala data in DataGrip and use the table editor to load Impala data.

Create a New Driver Definition for Impala

The steps below describe how to create a new Data Source in DataGrip for Impala.

  1. In DataGrip, click File -> New > Project and name the project
  2. In the Database Explorer, click the plus icon () and select Driver.
  3. In the Driver tab:
    • Set Name to a user-friendly name (e.g. "CData Impala Driver")
    • Set Driver Files to the appropriate JAR file. To add the file, click the plus (), select "Add Files," navigate to the "lib" folder in the driver's installation directory and select the JAR file (e.g. cdata.jdbc.apacheimpala.jar).
    • Set Class to cdata.jdbc.apacheimpala.ApacheImpala.jar
    Additionally, in the advanced tab you can change driver properties and some other settings like VM Options, VM environment, VM home path, DBMS, etc
    • For most cases, change the DBMS type to "Unknown" in Expert options to avoid native SQL Server queries (Transact-SQL), which might result in an invalid function error
  4. Click "Apply" then "OK" to save the Connection

Configure a Connection to Impala

  1. Once the connection is saved, click the plus (), then "Data Source" then "CData Impala Driver" to create a new Impala Data Source.
  2. In the new window, configure the connection to Impala with a JDBC URL.

    Built-in Connection String Designer

    For assistance in constructing the JDBC URL, use the connection string designer built into the Impala JDBC Driver. Either double-click the JAR file or execute the jar file from the command-line.

    java -jar cdata.jdbc.apacheimpala.jar

    Fill in the connection properties and copy the connection string to the clipboard.

    In order to connect to Apache Impala, set the Server, Port, and ProtocolVersion. You may optionally specify a default Database. To connect using alternative methods, such as NOSASL, LDAP, or Kerberos, refer to the online Help documentation.

  3. Set URL to the connection string, e.g., jdbc:apacheimpala:Server=127.0.0.1;Port=21050;
  4. Click "Apply" and "OK" to save the connection string

At this point, you will see the data source in the Data Explorer.

Execute SQL Queries Against Impala

To browse through the Impala entities (available as tables) accessible through the JDBC Driver, expand the Data Source.

To execute queries, right click on any table and select "New" -> "Query Console."

In the Console, write the SQL query you wish to execute. For example: SELECT City, CompanyName FROM Customers WHERE Country = 'US'

Download a free, 30-day trial of the CData JDBC Driver for Impala and start working with your live Impala data in DataGrip. Reach out to our Support Team if you have any questions.

Ready to get started?

Download a free trial of the Impala Driver to get started:

 Download Now

Learn more:

Apache Impala Icon Impala JDBC Driver

Rapidly create and deploy powerful Java applications that integrate with Impala.