Discover how a bimodal integration strategy can address the major data management challenges facing your organization today.
Get the Report →The HDFS ODBC Driver is a powerful tool that allows you to connect with live data from HDFS, directly from any applications that support ODBC connectivity.Access HDFS data like you would a database - read, write, and update HDFS HDFSData, etc. through a standard ODBC Driver interface.
Features
- SQL access to Hadoop Distributed File System data
- Use SQL Stored Procedures to perform actions like creating files, appending data, setting permission, and more
- Connect to live Apache HDFS data, for real-time data access
- Full support for data aggregation and complex JOINs in SQL queries
- Generate table schema automatically based on existing Apache HDFS data or manually for greater control of the content you need
- Seamless integration with leading BI, reporting, and ETL tools and with custom applications
Specifications
- Supports 32-bit and 64-bit applications.
- ODBC 3.8 compliant native protocol based drivers.
- Comprehensive ANSI SQL-92 support.
- Full Unicode Support - any language, any data.
- Cross-platform ODBC. Available for Windows, Linux and macOS.
Robust ODBC Access to Apache HDFS
Full-featured and consistent SQL access to any supported data source through ODBC
-
Standards-Compliant ODBC
ODBC 3.8 compliant native protocol based drivers.
-
Unicode, 32-bit and 64-bit support
Support 32-bit and 64-bit applications and unicode ODBC APIs.
-
Cross-platform ODBC (Windows/Linux/Mac)
Available for Microsoft Windows, Unix, and Mac (macOS / OS X) platforms.
-
Certified ODBC Compatibility*
Our drivers undergo extensive testing and are certified to be compatible with leading analytics and reporting applications like Tableau, Microsoft Excel, and many more.
* Please visit our knowledge base to access how-to articles. -
Embedded Remoting Capabilities
Our exclusive Remoting feature allows hosting the ODBC connection on a server to enable connections from various clients on any platform (Java, .NET, C++, PHP, Python, etc.), using any standards based technology (ODBC, JDBC, etc.). ODBC Remoting is enabled using the popular MySQL and SQL (TDS) wire protocol server.
-
String, Date, Numeric SQL Functions
The driver includes a library of 50 plus functions that can manipulate column values into the desired result. Popular examples include Regex, JSON, and XML processing functions.
-
Collaborative Query Processing
Our drivers enhance the data source's capabilities by additional client-side processing, when needed, to enable analytic summaries of data such as SUM, AVG, MAX, MIN, etc.
-
Easily Customizable and Configurable
The data model exposed by our ODBC Drivers can easily be customized to add or remove tables/columns, change data types, etc. without requiring a new build. These customizations are supported at runtime using human-readable schema files that are easy to edit.
-
Enterprise-class Secure Connectivity
Includes standard Enterprise-class security features such as TLS/ SSL data encryption for all client-server communications.
-
Replication and Caching
Our replication and caching commands make it easy to copy data to local and cloud data stores such as Oracle, SQL Server, Google Cloud SQL, etc. The replication commands include many features that allow for intelligent incremental updates to cached data.
Enterprise-Class Remoting
MySQL/SQL Database entry points for Apache HDFS Data
The CData ODBC drivers include powerful fully-integrated remote access capabilities that makes Apache HDFS data accessible from virtually anywhere. The drivers include the CData SQL Gateway, which can the optional ability to accept incoming SQL and MySQL client connections and service standard database requests.
With the CData SQL Gateway, users can interact with Apache HDFS data from any client that supports SQL Server or MySQL: from Web & mobile applications, to CRM and CMS systems, BI tools like SQL Server Analysis Services, and even through popular management applications like MySQL Workbench. Access Apache HDFS data from virtually any application that can access external data. Applications that can access SQL Server or MySQL data can now connect to Apache HDFS with this driver.
- Connect Apache HDFS data with popular BI tools like SQL Server Analysis Services.
- Enable enterprise Apache HDFS data integration through SQL Linked Server connectivity
- Includes support for the MySQL and SQL (TDS) remote access protocols, with advanced wire-protocol SSL security - industry standards for remote database connectivity.
- Easily connect to HDFS from code (PHP, Python, Ruby, Objective-C, etc.) using one of the standard client libraries for accessing SQL/MySQL.
- * Available exclusively in ODBC Server Licenses.
learn more
Frequently Asked HDFS ODBC Driver Questions
Learn more about HDFS ODBC drivers for data and analytics integration
Does HDFS support ODBC?
Not natively. However, CData offers an ODBC driver for HDFS that allows you to connect to HDFS data from any ODBC-compliant application, just like you would access a traditional database. This can be useful for tasks like:
- Accessing HDFS from applications: Connect to HDFS data in popular tools and applications including Microsoft Excel, Tableau, Power BI, and many others.
- Real-time data: You can work with live HDFS data within these applications, enabling tasks like reporting and analysis.
- Connecting systems: Build data integrations between HDFS and other systems.
The HDFS ODBC driver is an ODBC 3.8 compliant native protocol-based driver with comprehensive ANSI SQL-92 support. This means that virtually any application that can connect to data via ODBC, can use the CData ODBC driver for real-time integration. Download a fully functional free trial of the HDFS ODBC driver today to get started.
Is there an ODBC driver for HDFS?
Yes, the CData ODBC driver for HDFS provides universal ODBC data connectivity for HDFS. The HDFS ODBC driver offers a simple SQL-based layer of abstraction that simplifies real-time data access for users and applications, enabling them to communicate with HDFS using a standardized set of functions. Virtually any application on any platform (including Windows, macOS, or *nix) can use the CData ODBC driver for real-time integration.
How do I connect to HDFS via ODBC?
Connectivity to HDFS via ODBC is easy. First, download and install the HDFS ODBC driver. The download page includes separate installers for Windows, macOS, and Unix, so please select the HDFS ODBC driver that matches the platform where you intend to use the driver.
Once the installation is complete, navigate to the ODBC driver documentation page. Here, you'll find a wealth of information about the installed driver. The step-by-step instructions for creating a DSN and using it to connect to HDFS via ODBC are just the beginning. The documentation also provides extensive configuration details for using the HDFS ODBC driver with all your favorite applications and development tools, ensuring you have all the support you need.
Where can I download an ODBC driver for HDFS?
All of the CData ODBC drivers, including the HDFS ODBC driver are available for download online. To get started, download a fully functional free trial of the HDFS ODBC driver today.
How do I install an ODBC driver for HDFS?
To install the HDFS driver, simply download one of the HDFS ODBC driver installers available online. The download page includes separate Windows, macOS, and Unix installers, so please select the HDFS ODBC driver that matches the platform where you intend to use the driver. The installers are comprehensive setup utilities that will install all the components required to use the HDFS ODBC driver on your system.
Is the HDFS ODBC driver 32-bit or 64-bit?
The HDFS ODBC driver is available in both 32-bit and 64-bit binaries. The HDFS driver installer will allow you to install both options, and you can select the ODBC driver binary that is appropriate for the system where you intend to use the driver.
Does pyODBC work with HDFS?
Absolutely. The HDFS ODBC driver seamlessly supports pyODBC connectivity. For example, check out this article on our online knowledgebase that demonstrates how you can connect to HDFS from PyCharm using pyODBC and the HDFS ODBC driver.
However, besides ODBC support, we also provide native Python data connectivity for HDFS. To learn more, check out the HDFS Python Connector online.