Discover how a bimodal integration strategy can address the major data management challenges facing your organization today.
Get the Report →How to Connect DBeaver to HDFS via a JDBC Driver
Manage HDFS data with visual tools in DBeaver like the query browser.
The CData JDBC Driver for HDFS implements JDBC standards that enable third-party tools to interoperate, from wizards in IDEs to business intelligence tools. This article shows how to connect to HDFS data with wizards in DBeaver and browse data in the DBeaver GUI.
Create a JDBC Data Source for HDFS Data
Follow the steps below to load the driver JAR in DBeaver.
- Open the DBeaver application and, in the "Database" menu, select the "Driver Manager" option. Click "New" to open the "Create new driver" form.
- In the Settings tab:
- Set Driver Name to a user-friendly name for the driver (e.g. CData JDBC Driver for HDFS).
- Set Class Name to the class name for the JDBC driver: cdata.jdbc.hdfs.HDFSDriver.
- Set URL Template to jdbc:hdfs:.
- In the Libraries tab, click "Add File," navigate to the "lib" folder in the installation directory (C:\Program Files\CData[product_name] XXXX\) and select the JAR file (cdata.jdbc.HDFS.jar).
Create a Connection to HDFS Data
Follow the steps below to add credentials and other required connection properties.
- In the "Database" menu, click "New Database Connection."
- In the "Connect to a database" wizard that results, select the driver you just created (e.g. CData JDBC Driver for HDFS) and click "Next >."
- On the Main tab of the configuration wizard, set the JDBC URL, using the required connection properties:
In order to authenticate, set the following connection properties:
- Host: Set this value to the host of your HDFS installation.
- Port: Set this value to the port of your HDFS installation. Default port: 50070
Built-in Connection String Designer
For assistance in constructing the JDBC URL, use the connection string designer built into the HDFS JDBC Driver. Either double-click the JAR file or execute the jar file from the command-line.
java -jar cdata.jdbc.hdfs.jar
Fill in the connection properties and copy the connection string to the clipboard.
Below is a typical connection string:
jdbc:hdfs:Host=sandbox-hdp.hortonworks.com;Port=50070;Path=/user/root;User=root;
- Click "Test Connection ..." to ensure you have configured the connection properly.
- Click "Finish."
Query HDFS Data
You can now query information from the tables exposed by the connection: Right-click a Table and then click View Table. The data is available on the Data tab.