Replicate Google Data Catalog Data from PowerShell



Write a quick PowerShell script to query Google Data Catalog data. Use connectivity to the live data to replicate Google Data Catalog data to SQL Server.



The CData ODBC Driver for Google Data Catalog enables out-of-the-box integration with Microsoft's built-in support for ODBC. The ODBC driver instantly integrates connectivity to the real Google Data Catalog data with PowerShell.

You can use the .NET Framework Provider for ODBC built into PowerShell to quickly automate integration tasks like replicating Google Data Catalog data to other databases. This article shows how to replicate Google Data Catalog data to SQL Server in 5 lines of code.

You can also write PowerShell code to download Google Data Catalog data. See the examples below.

Create an ODBC Data Source for Google Data Catalog

If you have not already, first specify connection properties in an ODBC DSN (data source name). This is the last step of the driver installation. You can use the Microsoft ODBC Data Source Administrator to create and configure ODBC DSNs.

Google Data Catalog uses the OAuth authentication standard. Authorize access to Google APIs on behalf on individual users or on behalf of users in a domain.

Before connecting, specify the following to identify the organization and project you would like to connect to:

  • OrganizationId: The ID associated with the Google Cloud Platform organization resource you would like to connect to. Find this by navigating to the cloud console.

    Click the project selection drop-down, and select your organization from the list. Then, click More -> Settings. The organization ID is displayed on this page.

  • ProjectId: The ID associated with the Google Cloud Platform project resource you would like to connect to.

    Find this by navigating to the cloud console dashboard and selecting your project from the Select from drop-down. The project ID will be present in the Project info card.

When you connect, the OAuth endpoint opens in your default browser. Log in and grant permissions to the application to completes the OAuth process. For more information, refer to the OAuth section in the Help documentation.

Connect to Google Data Catalog

The code below shows how to use the DSN to initialize the connection to Google Data Catalog data in PowerShell:

$conn = New-Object System.Data.Odbc.OdbcConnection $conn.ConnectionString = "DSN=CData GoogleDataCatalog Source x64"

Back Up Google Data Catalog Data to SQL Server

After you enable caching, you can use the code below to replicate data to SQL Server.

Set the following connection properties to configure the caching database:

  • CacheProvider: The name of the ADO.NET provider. This can be found in the Machine.config for your version of .NET. For example, to configure SQL Server, enter System.Data.SqlClient.

  • CacheConnection: The connection string of properties required to connect to the database. Below is an example for SQL Server:

    Server=localhost;Database=RSB;User Id=sqltest;Password=sqltest;

The SQL query in the example can be used to refresh the entire cached table, including its schema. Any already existing cache is deleted.

$conn.Open() # Create and execute the SQL Query $SQL = "CACHE DROP EXISTING SELECT * FROM " + $Schemas $cmd = New-Object System.Data.Odbc.OdbcCommand($sql,$conn) $count = $cmd.ExecuteNonQuery() $conn.Close()

The driver gives you complete control over the caching functionality. See the help documentation for more caching commands and usage examples. See the help documentation for steps to replicate to other databases.

Other Operations

To retrieve Google Data Catalog data in PowerShell, call the Fill method of the OdbcDataAdapter method. To execute data manipulation commands, initialize the OdbcCommand object and then call ExecuteNonQuery. Below are some more examples commands to Google Data Catalog through the .NET Framework Provider for ODBC:

Retrieve Google Data Catalog Data

$sql="SELECT Type, DatasetName from Schemas" $da= New-Object System.Data.Odbc.OdbcDataAdapter($sql, $conn) $dt= New-Object System.Data.DataTable $da.Fill($dt) $dt.Rows | foreach { $dt.Columns | foreach ($col in dt{ Write-Host $1[$_] } }

Ready to get started?

Download a free trial of the Google Data Catalog ODBC Driver to get started:

 Download Now

Learn more:

Google Data Catalog Icon Google Data Catalog ODBC Driver

The Google Data Catalog ODBC Driver is a powerful tool that allows you to connect with live data from Google Data Catalog, directly from any applications that support ODBC connectivity.

Access Google Data Catalog data like you would a database - read, write, and update Google Data Catalog Schemas, Tables, etc. through a standard ODBC Driver interface.