HDFS Integration Guides and Tutorials



A list of guides and tutorials for connecting to and working with live HDFS data.

CData Software's connectivity tools enable users to connect directly to live HDFS data data from widely-used BI, analytics, ETL, and custom applications, ensuring that our customers can access their data wherever they desire. Below, you'll find a collection of guides and tutorials on integrating with live HDFS data.

Integration Use-Cases

Click below to jump to articles related to specific integration use-case.

Business Intelligence & Analytics


ProductTechnologyArticle Title
Alteryx DesignerODBCPrepare, Blend, and Analyze HDFS in Alteryx Designer (ODBC)
Amazon QuickSightConnect ServerBuild Interactive Dashboards from HDFS Data in Amazon QuickSight
Aqua Data StudioJDBCConnect to HDFS in Aqua Data Studio
AWS DatabricksJDBCProcess & Analyze HDFS Data in Databricks (AWS)
BirstJDBCBuild Visualizations of HDFS in Birst
BIRTJDBCDesign BIRT Reports on HDFS
Clear AnalyticsODBCBuild Charts with HDFS in Clear Analytics
DBxtraODBCBuild Dashboards with HDFS in DBxtra
DomoODBCCreate Datasets from HDFS in Domo Workbench
Dundas BIODBCBuild Dashboards with HDFS in Dundas BI
Excel (on Mac OS)ODBCWork with HDFS Data in MS Excel on Mac OS X
FineReportJDBCFeed HDFS into FineReport
IBM Cognos BIODBCCreate Data Visualizations in Cognos BI with HDFS
Infragistics RevealConnect ServerAnalyze HDFS Data in Infragistics Reval
JasperServerJDBCCreate HDFS Reports on JasperReports Server
Jaspersoft BI SuiteJDBCConnect to HDFS in Jaspersoft Studio
JReport DesignerJDBCIntegrate with HDFS in JReport Designer
KlipfolioConnect ServerCreate HDFS-Connected Visualizations in Klipfolio
KNIMEJDBCEnable the HDFS JDBC Driver in KNIME
LINQPadADO.NETWorking with HDFS in LINQPad
Microsoft SSASADO.NETBuild an OLAP Cube in SSAS from HDFS
MicroStrategyConnect ServerConnect to Live HDFS Data in MicroStrategy through Connect Server
MicroStrategyJDBCUse the CData JDBC Driver for HDFS in MicroStrategy
Microstrategy DesktopJDBCUse the CData JDBC Driver for HDFS in MicroStrategy Desktop
Microstrategy WebJDBCUse the CData JDBC Driver for HDFS in MicroStrategy Web
OBIEEJDBCHDFS Reporting in OBIEE with the HDFS JDBC Driver
pandasPythonUse pandas to Visualize HDFS in Python
Pentaho Report DesignerJDBCIntegrate HDFS in the Pentaho Report Designer
Power BI DesktopPower BIAuthor Power BI Reports on Real-Time HDFS
Power BI ServiceConnect ServerVisualize Live HDFS Data in the Power BI Service
Power PivotConnect ServerAccess HDFS Data in Microsoft Power Pivot
Power QueryConnect ServerAccess HDFS Data in Microsoft Power Query
Qlik CloudConnect ServerCreate Apps from HDFS Data in Qlik Sense Cloud
QlikViewODBCConnect to and Query HDFS in QlikView over ODBC
RJDBCAnalyze HDFS in R (JDBC)
RODBCAnalyze HDFS in R (ODBC)
RapidMinerJDBCConnect to HDFS in RapidMiner
RedashConnect ServerQuery, Visualize, and Share live HDFS Data in Redash
SAP Analytics CloudConnect ServerAnalyze HDFS Data in SAP Analytics Cloud
SAP Business ObjectsJDBCCreate an SAP BusinessObjects Universe on the CData JDBC Driver for HDFS
SAP Crystal ReportsJDBCPublish Reports with HDFS in Crystal Reports (JDBC)
SASODBCUse the CData ODBC Driver for HDFS in SAS for Real-Time Reporting and Analytics
SAS JMPODBCUse the CData ODBC Driver for HDFS in SAS JMP
SisenseJDBCVisualize Live HDFS in Sisense
Spago BIJDBCConnect to HDFS in SpagoBI
TableauTableauVisualize HDFS in Tableau Desktop
TableauConnect ServerVisualize HDFS in Tableau Desktop (Connect Server)
Tableau CloudConnect ServerBuild HDFS Visualizations in Tableau Cloud
Tableau ServerTableauPublish HDFS-Connected Dashboards in Tableau Server
TIBCO SpotfireADO.NETVisualize HDFS in TIBCO Spotfire through ADO.NET
TIBCO SpotfireConnect ServerVisualize HDFS Data in TIBCO Spotfire
TIBCO Spotfire ServerJDBCOperational Reporting on HDFS from Spotfire Server

Back to top

ETL & Replication


ProductTechnologyArticle Title
Amazon RedshiftCData SyncAutomated Continuous HDFS Replication to Amazon Redshift
Amazon S3CData SyncAutomated Continuous HDFS Replication to Amazon S3
Apache AirflowJDBCBridge HDFS Connectivity with Apache Airflow
Apache CamelJDBCIntegrate with HDFS using Apache Camel
Apache CassandraCData SyncAutomated Continuous HDFS Replication to Apache Cassandra
Apache KafkaCData SyncAutomated Continuous HDFS Replication to Apache Kafka
Apache NiFiJDBCBridge HDFS Connectivity with Apache NiFi
Azure Data LakeCData SyncAutomated Continuous HDFS Replication to Azure Data Lake
Azure SynapseCData SyncAutomated Continuous HDFS Replication to Azure Synapse
BIMLSSISUse Biml to Build SSIS Tasks to Replicate HDFS to SQL Server
CloverDXJDBCConnect to HDFS in CloverDX (formerly CloverETL)
CouchbaseCData SyncAutomated Continuous HDFS Replication to Couchbase
CSVCData SyncAutomated Continuous HDFS Replication to Local Delimited Files
DatabricksCData SyncAutomated Continuous HDFS Replication to Databricks
ETL ValidatorJDBCHow to Work with HDFS in ETL Validator
FoxProODBCWork with HDFS in FoxPro
Google AlloyDBCData SyncAutomated Continuous HDFS Replication to Google AlloyDB
Google BigQueryCData SyncAutomated Continuous HDFS Replication to Google BigQuery
Google Cloud SQLCData SyncAutomated Continuous HDFS Replication to Google Cloud SQL
Google Data FusionJDBCBuild HDFS-Connected ETL Processes in Google Data Fusion
Heroku / Salesforce ConnectCData SyncReplicate HDFS for Use in Salesforce Connect
HULFT IntegrateJDBCConnect to HDFS in HULFT Integrate
IBM DB2CData SyncAutomated Continuous HDFS Replication to IBM DB2
Informatica CloudJDBCIntegrate HDFS in Your Informatica Cloud Instance
Informatica PowerCenterJDBCCreate Informatica Mappings From/To a JDBC Data Source for HDFS
Jaspersoft ETLJDBCConnect to HDFS in Jaspersoft Studio
Microsoft AccessCData SyncAutomated Continuous HDFS Replication to Microsoft Access
Microsoft Azure TablesCData SyncAutomated Continuous HDFS Replication to Azure SQL
Microsoft Power AutomateConnect ServerBuild HDFS-Connected Automated Tasks with Power Automate (Desktop)
MongoDBCData SyncAutomated Continuous HDFS Replication to MongoDB
MySQLCData SyncAutomated Continuous HDFS Replication to MySQL
Oracle Data IntegratorJDBCETL HDFS in Oracle Data Integrator
Oracle DatabaseCData SyncAutomated Continuous HDFS Replication to Oracle
petlPythonExtract, Transform, and Load HDFS in Python
PostgreSQLCData SyncAutomated Continuous HDFS Replication to PostgreSQL
Replicate to MySQLPowerShellReplicate HDFS to MySQL with PowerShell
SAP HANACData SyncAutomated Continuous HDFS Replication to SAP HANA
SingleStoreCData SyncAutomated Continuous HDFS Replication to SingleStore
SnapLogicJDBCIntegrate HDFS with External Services using SnapLogic (JDBC)
SnowflakeCData SyncAutomated Continuous HDFS Replication to Snowflake
SQL ServerCData SyncAutomated Continuous HDFS Replication to SQL Server
SQL Server Linked ServerConnect ServerConnect to HDFS Data as a SQL Server Linked Server
SQLiteCData SyncAutomated Continuous HDFS Replication to SQLite
TalendJDBCConnect to HDFS and Transfer Data in Talend
UiPath StudioODBCCreate an RPA Flow that Connects to HDFS in UiPath Studio
VerticaCData SyncAutomated Continuous HDFS Replication to a Vertica Database

Back to top

Data Virtualization



Back to top

Software Development


ProductTechnologyArticle Title
AWS LambdaJDBCAccess Live HDFS Data in AWS Lambda
.NET ChartsADO.NETDataBind Charts to HDFS
.NET QueryBuilderODBCRapidly Develop HDFS-Driven Apps with Active Query Builder
Angular JSConnect ServerUsing AngularJS to Build Dynamic Web Pages with HDFS
Apache SparkJDBCWork with HDFS in Apache Spark Using SQL
AppSheetConnect ServerCreate HDFS-Connected Business Apps in AppSheet
C++BuilderODBCDataBind Controls to HDFS Data in C++Builder
ColdFusionJDBCQuery HDFS in ColdFusion Using JDBC
ColdFusionODBCQuery HDFS in ColdFusion Using ODBC
DashPythonUse Dash & Python to Build Web Apps on HDFS
DelphiODBCDataBind Controls to HDFS Data in Delphi
DevExpressADO.NETDataBind HDFS to the DevExpress Data Grid
EF - Code FirstADO.NETAccess HDFS with Entity Framework 6
EF - LINQADO.NETLINQ to HDFS
EF - MVCADO.NETBuild MVC Applications with Connectivity to HDFS
Filemaker ProODBCBidirectional Access to HDFS from FileMaker Pro
Filemaker Pro (on Mac)JDBCBidirectional Access to HDFS from FileMaker Pro (on Mac)
GoODBCWrite a Simple Go Application to work with HDFS on Linux
Google Apps ScriptConnect ServerConnect to HDFS Data in Google Apps Script
HibernateJDBCObject-Relational Mapping (ORM) with HDFS Entities in Java
IntelliJJDBCConnect to HDFS in IntelliJ
JBossJDBCConnect to HDFS from a Connection Pool in JBoss
JDBIJDBCCreate a Data Access Object for HDFS using JDBI
JRubyJDBCConnect to HDFS in JRuby
MendixJDBCBuild HDFS-Connected Apps in Mendix (JDBC)
Microsoft Power AppsConnect ServerIntegrate Live HDFS Data into Custom Business Apps Built in Power Apps
NodeJSConnect ServerQuery HDFS Data in Node.js (via Connect Server)
NodeJSODBCQuery HDFS through ODBC in Node.js
PHPConnect ServerAccess HDFS in PHP through Connect Server
PHPODBCNatively Connect to HDFS in PHP
PowerBuilderADO.NETConnect to HDFS from PowerBuilder
PowerShellPowerShellPipe HDFS to CSV in PowerShell
PyCharmODBCUsing the CData ODBC Driver for HDFS in PyCharm
PythonODBCConnect to HDFS in Python on Linux/UNIX
ReactConnect ServerBuild Dynamic React Apps with HDFS Data
RubyODBCConnect to HDFS in Ruby
RunMyProcessConnect ServerConnect to HDFS Data in RunMyProcess
RunMyProcess DSECJDBCConnect to HDFS in DigitalSuite Studio through RunMyProcess DSEC
SAP UI5Connect ServerIntegrate Real-Time Access to HDFS in SAPUI5 MVC Apps
ServoyJDBCBuild HDFS-Connected Apps in Servoy
Spring BootJDBCAccess Live HDFS Data in Spring Boot Apps
SQLAlchemyPythonUse SQLAlchemy ORMs to Access HDFS in Python
TomcatJDBCConfigure the CData JDBC Driver for HDFS in a Connection Pool in Tomcat
UnqorkConnect ServerCreate HDFS-Connected Applications in Unqork
VCL App (RAD Studio)ODBCBuild a Simple VCL Application for HDFS
WebLogicJDBCConnect to HDFS from a Connection Pool in WebLogic

Back to top

Data Management



Back to top

Workflow Automation



Back to top

Ready to get started?

Learn more:

HDFS Connectivity Solutions