TheProgressDataDirect for ODBC forApacheHiveWire ... · 14.06.2018  ·...

26
The Progress DataDirect for ODBC for Apache Hive Wire Protocol Driver Quick Start for Windows Release 8.0.1

Transcript of TheProgressDataDirect for ODBC forApacheHiveWire ... · 14.06.2018  ·...

Page 1: TheProgressDataDirect for ODBC forApacheHiveWire ... · 14.06.2018  · 11.TheCreateDefaultDefaultDataSourcewindowappears.SelecttheCreateDefaultDataSourceboxto defineauserdatasourceinHKEY_CURRENT_USER\SOFTWARE\ODBC\ODBC.INIforeachDataDirect

The Progress DataDirect forODBC for Apache Hive WireProtocol DriverQuick Start for Windows

Release 8.0.1

Page 2: TheProgressDataDirect for ODBC forApacheHiveWire ... · 14.06.2018  · 11.TheCreateDefaultDefaultDataSourcewindowappears.SelecttheCreateDefaultDataSourceboxto defineauserdatasourceinHKEY_CURRENT_USER\SOFTWARE\ODBC\ODBC.INIforeachDataDirect
Page 3: TheProgressDataDirect for ODBC forApacheHiveWire ... · 14.06.2018  · 11.TheCreateDefaultDefaultDataSourcewindowappears.SelecttheCreateDefaultDataSourceboxto defineauserdatasourceinHKEY_CURRENT_USER\SOFTWARE\ODBC\ODBC.INIforeachDataDirect

Copyright

© 2018 Progress Software Corporation and/or one of its subsidiaries or affiliates. Allrights reserved.These materials and all Progress® software products are copyrighted and all rights are reserved by ProgressSoftware Corporation. The information in these materials is subject to change without notice, and ProgressSoftware Corporation assumes no responsibility for any errors that may appear therein. The references in thesematerials to specific platforms supported are subject to change.

Corticon, DataDirect (and design), DataDirect Cloud, DataDirect Connect, DataDirect Connect64, DataDirectXML Converters, DataDirect XQuery, DataRPM, Deliver More Than Expected, Icenium, Kendo UI, NativeScript,OpenEdge, Powered by Progress, Progress, Progress Software Developers Network, Rollbase, SequeLink,Sitefinity (and Design), SpeedScript, Stylus Studio, TeamPulse, Telerik, Telerik (and Design), Test Studio, andWebSpeed are registered trademarks of Progress Software Corporation or one of its affiliates or subsidiariesin the U.S. and/or other countries. Analytics360, AppServer, BusinessEdge, DataDirect Spy, SupportLink,DevCraft, Fiddler, JustAssembly, JustDecompile, JustMock, Kinvey, NativeScript Sidekick, OpenAccess,ProDataSet, Progress Results, Progress Software, ProVision, PSE Pro, Sitefinity, SmartBrowser,SmartComponent, SmartDataBrowser, SmartDataObjects, SmartDataView, SmartDialog, SmartFolder,SmartFrame, SmartObjects, SmartPanel, SmartQuery, SmartViewer, SmartWindow, and WebClient aretrademarks or service marks of Progress Software Corporation and/or its subsidiaries or affiliates in the U.S.and other countries. Java is a registered trademark of Oracle and/or its affiliates. Any other marks containedherein may be trademarks of their respective owners.Please refer to the readme applicable to the particular Progress product release for any third-partyacknowledgements required to be provided in the documentation associated with the Progress product.

Updated: 2018/06/14

3Progress DataDirect for ODBC for Apache Hive Wire Protocol Driver: Quick Start for Windows: Version 8.0.1

Page 4: TheProgressDataDirect for ODBC forApacheHiveWire ... · 14.06.2018  · 11.TheCreateDefaultDefaultDataSourcewindowappears.SelecttheCreateDefaultDataSourceboxto defineauserdatasourceinHKEY_CURRENT_USER\SOFTWARE\ODBC\ODBC.INIforeachDataDirect

Progress DataDirect for ODBC for Apache Hive Wire Protocol Driver: Quick Start for Windows: Version 8.0.14

Copyright

Page 5: TheProgressDataDirect for ODBC forApacheHiveWire ... · 14.06.2018  · 11.TheCreateDefaultDefaultDataSourcewindowappears.SelecttheCreateDefaultDataSourceboxto defineauserdatasourceinHKEY_CURRENT_USER\SOFTWARE\ODBC\ODBC.INIforeachDataDirect

Quick Start: Progress DataDirect for ODBCfor Apache Hive Wire Protocol Driver forWindows

This quick start provides basic information that allows you to install, test connect with your driver, and accessyour data with some commonly used third-party applications . To take full advantage of the features andfunctionality available for your driver, refer to the product documentation library at:

Progress DataDirect Documentation Library

Note: OEMCUSTOMERS: Refer to theProgress DataDirect for ODBCDrivers Distribution Guide for informationon installing, branding, unlocking, and distributing your branded drivers.

This quick start covers the following topics:

• Before You Start on page 5

• Requirements and Support on page 6

• Downloading the Driver on page 6

• Installing the ODBC Driver on Windows on page 7

• Configuring a Data Source on page 10

• Testing the Connection on page 13

• Tuning for Performance on page 14

• Accessing Data With Third-Party Applications on page 16

• Troubleshooting Setup/Connection Issues on page 23

• Additional Resources on page 24

Before You StartBefore you get started, you need the following:

• Appropriate user permissions to modify your environment and to read, write, and execute various files inthe DataDirect for ODBC installation directory .

• Connection information:

• Host Name: The name or the IP address of the server to which you want to connect.

• Database: The name of the Apache Hive database to which you want to connect by default. The databasemust exist, or the connection attempt will fail.

• User Name: If required by your Apache Hive configuration, the default user ID that is used to connectto your database.

5Progress DataDirect for ODBC for Apache Hive Wire Protocol Driver: Quick Start for Windows: Version 8.0.1

Page 6: TheProgressDataDirect for ODBC forApacheHiveWire ... · 14.06.2018  · 11.TheCreateDefaultDefaultDataSourcewindowappears.SelecttheCreateDefaultDataSourceboxto defineauserdatasourceinHKEY_CURRENT_USER\SOFTWARE\ODBC\ODBC.INIforeachDataDirect

• Port Number: The port number of the server listener.

• (For HTTP connections) HTTP/HTTPS Endpoint: The path of the HTTP/HTTPS endpoint used forconnections.

• For licensed installations, you will also need the following information that was provided by ProgressDataDirect:

• IPE Key (control number)

• Serial Number

Requirements and SupportDriver/Client Software Requirements:

• The driver has no client requirements.

Supported Data Sources:

• Supports Apache Hive version 1.0 and higher against the following distributions:

• Amazon Elastic MapReduce (Amazon EMR), version 4.0 and higher

• Apache Hadoop Hive, version 1.0 and higher

• Cloudera's Distribution Including Apache Hadoop (CDH), version 5.4 and higher

• Hortonworks Data Platform (HDP), version 2.3 and higher

• IBM BigInsights, version 4.0 and higher

• MapR Distribution for Apache Hadoop, version 5.0 and higher

• Pivotal HD Enterprise (PHD), version 3.0 and higher

For the latest information on supported data sources, visit the Progress DataDirect Supported Configurationspage.

Downloading the DriverTo download the ODBC Apache Hive driver:

1. Visit the Progress DataDirect Connectors Download page.

2. Enter Hive in the search field; then, select your data source from the list.

3. Select ODBC for the interface.

4. Select the OS and architecture platform for your application:

• Windows 32-bit• Windows 64-bit

Progress DataDirect for ODBC for Apache Hive Wire Protocol Driver: Quick Start for Windows: Version 8.0.16

Chapter 1: Quick Start: Progress DataDirect for ODBC for Apache Hive Wire Protocol Driver for Windows

Page 7: TheProgressDataDirect for ODBC forApacheHiveWire ... · 14.06.2018  · 11.TheCreateDefaultDefaultDataSourcewindowappears.SelecttheCreateDefaultDataSourceboxto defineauserdatasourceinHKEY_CURRENT_USER\SOFTWARE\ODBC\ODBC.INIforeachDataDirect

5. Fill in the registration form with your contact information.

6. Review the End User License Agreement. If you agree, select the corresponding box; then, clickDownload.

The installer program has been downloaded. See "Installing the ODBC Driver on Windows" for instructions oninstalling the driver.

See alsoInstalling the ODBC Driver on Windows on page 7

Installing the ODBC Driver on WindowsNote: OEMCUSTOMERS: Refer to the Progress DataDirect for ODBCDrivers Distribution Guide for informationon installing, branding, unlocking, and distributing your branded drivers.

This section provides instructions for installing the driver using a GUI. For more detailed installation instructions,refer to the Progress DataDirect for ODBC Drivers Installation Guide.

To install the drivers from downloaded files:

1. Unzip the files, maintaining the directory structure in the zip file, to a temporary directory, for example:c:\TEMP.

2. From Windows Explorer, navigate to the temporary directory that contains the unzipped installer files; then,double-click the installer program:

For 32-bit drivers, the installer program is named:

PROGRESS_DATADIRECT_ODBC_nn_WIN_32_INSTALL.exe

For 64-bit drivers, the installer program is named:

PROGRESS_DATADIRECT_ODBC_nn_WIN_64_INSTALL.exe

3. The Introduction window for the product installation appears. Click Next to continue.

4. The License Agreement window appears. Make sure that you read and understand the license agreement.To continue with the installation, select the I accept the terms of the License Agreement option; then,click Next.

Note: You can exit the installer program at any time by clicking Cancel or return to the previous windowby clicking Previous.

5. The Install Directory window appears. In theWhereWould You Like to Install? field, type the path, includingthe drive letter, of the product installation directory or click the Choose... button to browse to and select aninstallation directory.

• If you are running the 32-bit installer on 64-bit Windows, the default value for the installation directory isC:\Program Files (x86)\Progress\DataDirect\ODBC_nn.

• For all other installations, the default value for the installation directory for a is C:\ProgramFiles\Progress\DataDirect\ODBC_nn

Verify that you have entered (or selected) the correct installation directory. Then, click Next to continue.

To restore the installation directory to its default setting, click Restore Default.

7Progress DataDirect for ODBC for Apache Hive Wire Protocol Driver: Quick Start for Windows: Version 8.0.1

Installing the ODBC Driver on Windows

Page 8: TheProgressDataDirect for ODBC forApacheHiveWire ... · 14.06.2018  · 11.TheCreateDefaultDefaultDataSourcewindowappears.SelecttheCreateDefaultDataSourceboxto defineauserdatasourceinHKEY_CURRENT_USER\SOFTWARE\ODBC\ODBC.INIforeachDataDirect

6. Choose the type of installation to perform. Select one of the following options:

• Evaluation Installation (will expire in 15 days). Select this option to install an evaluation version ofthe driver. Click Next to continue with the installation. Skip to Step 10 on page 9.

• OEM/Licensed Installation. Select this option if you purchased a licensed version of one or multipledrivers. Click Next. Proceed to the next step.

Note: OEM CUSTOMERS: Refer to the Progress DataDirect for ODBC Drivers Distribution Guide forinformation on installing, branding, unlocking, and distributing your branded drivers.

7. The Product Registration window appears.

Type the IPE key (control number) that was provided by Progress DataDirect in the IPE Key field and clickthe Validate button. A tree menu of drivers with valid licenses appears in the selection box. You can addone or multiple keys, one at a time. For example, the previous image demonstrates the successful validationof the MongoDB driver key.

8. From the tree menu, select the drivers that you want to install. Drivers that are already installed are listedin the Drivers (Installed) tree and cannot be deselected. To remove installed drivers, you must uninstall theproduct. Click Next to continue.

Note: If you are installing a different version of a currently installed driver, the installer will overwrite theexisting driver files with the version that you are installing.

9. The Product Registration window appears.

Progress DataDirect for ODBC for Apache Hive Wire Protocol Driver: Quick Start for Windows: Version 8.0.18

Chapter 1: Quick Start: Progress DataDirect for ODBC for Apache Hive Wire Protocol Driver for Windows

Page 9: TheProgressDataDirect for ODBC forApacheHiveWire ... · 14.06.2018  · 11.TheCreateDefaultDefaultDataSourcewindowappears.SelecttheCreateDefaultDataSourceboxto defineauserdatasourceinHKEY_CURRENT_USER\SOFTWARE\ODBC\ODBC.INIforeachDataDirect

Provide the following information:

a) Type your name and company name in the corresponding fields.

b) Type the serial number that was provided by Progress DataDirect. Click Next to continue. Skip to Step11 on page 10.

10. The Driver Evaluation Selection window appears.

From the Drivers To Be Evaluated tree, select the drivers that you want to install. Drivers that are alreadyinstalled are listed in the Drivers (Installed) tree and cannot be deselected. For example, the previous imagedemonstrates the installation of evaluation MongoDB driver.

9Progress DataDirect for ODBC for Apache Hive Wire Protocol Driver: Quick Start for Windows: Version 8.0.1

Installing the ODBC Driver on Windows

Page 10: TheProgressDataDirect for ODBC forApacheHiveWire ... · 14.06.2018  · 11.TheCreateDefaultDefaultDataSourcewindowappears.SelecttheCreateDefaultDataSourceboxto defineauserdatasourceinHKEY_CURRENT_USER\SOFTWARE\ODBC\ODBC.INIforeachDataDirect

11. The Create Default Default Data Source window appears. Select the Create Default Data Source box todefine a user data source in HKEY_CURRENT_USER\SOFTWARE\ODBC\ODBC.INI for each DataDirectdriver that you install.

Warning: If you select Create Default Data Sources, existing data sources in your registry with thesame DataDirect default name will be overwritten. To maintain your current DataDirect default datasources, rename them before you continue.

Then, click Next to continue.

12. The Pre-Installation Summary window provides the opportunity to review the information you have entered.When you are satisfied with your installation or branding option selections, click Install to begin the installation.

13. When the installation finishes, the Install Complete window appears. You may be prompted to restart yoursystem, but it is not required to use the product. Select one of the following options:

• Yes, restart my system. Select this option if you are ready to restart your system. The installer willrestart your system when you exit the program.

• No, I will restart my systemmyself. Select this option if you prefer to use the product without restartingthe system, or if you are not ready to restart the system.

Click Done to exit the installer program.

This completes the installation.

A Progress DataDirect program group is created as part of the installation. This program group provides thefollowing shortcuts:

• ODBC Administrator

• ODBC Driver Help

• ODBC Performance Tuning Wizard

• ODBC Readme

• Processor Information Utility

• Uninstall Progress DataDirect for ODBC x.x - 32 bit [32-bit only]

• Uninstall Progress DataDirect for ODBC x.x - 64 bit [64-bit only]

• XML Persistence Demo [32-bit only]

After installation, the next step is to configure your data source. See "Configuring a Data Source" for details.

See alsoConfiguring a Data Source on page 10

Configuring a Data SourceTo configure a basic data source to test your connection:

1. From the Progress DataDirect program group, start the ODBC Administrator.

Progress DataDirect for ODBC for Apache Hive Wire Protocol Driver: Quick Start for Windows: Version 8.0.110

Chapter 1: Quick Start: Progress DataDirect for ODBC for Apache Hive Wire Protocol Driver for Windows

Page 11: TheProgressDataDirect for ODBC forApacheHiveWire ... · 14.06.2018  · 11.TheCreateDefaultDefaultDataSourcewindowappears.SelecttheCreateDefaultDataSourceboxto defineauserdatasourceinHKEY_CURRENT_USER\SOFTWARE\ODBC\ODBC.INIforeachDataDirect

2. Click either the User DSN, System DSN, or File DSN tab to display a list of data sources.

• User DSN: If you installed a default DataDirect ODBC user data source as part of the installation, selectthe appropriate data source name and click Configure to display the driver Setup dialog box.

If you are configuring a new user data source, click Add to display a list of installed drivers. Select theApache Hive Wire Protocol driver and click Finish to display the driver Setup dialog box.

• System DSN: To configure a new system data source, click Add to display a list of installed drivers.Select the Apache Hive Wire Protocol driver and click Finish to display the driver Setup dialog box.

• File DSN: To configure a new file data source, click Add to display a list of installed drivers. Select theApache Hive Wire Protocol driver and click Advanced to specify attributes; otherwise, click Next toproceed. Specify a name for the data source and click Next. Verify the data source information; then,click Finish to display the driver Setup dialog box.

3. The General tab of the Setup dialog box appears by default.

11Progress DataDirect for ODBC for Apache Hive Wire Protocol Driver: Quick Start for Windows: Version 8.0.1

Configuring a Data Source

Page 12: TheProgressDataDirect for ODBC forApacheHiveWire ... · 14.06.2018  · 11.TheCreateDefaultDefaultDataSourcewindowappears.SelecttheCreateDefaultDataSourceboxto defineauserdatasourceinHKEY_CURRENT_USER\SOFTWARE\ODBC\ODBC.INIforeachDataDirect

Note: The General tab displays the fields that are required for creating a data source. The fields on allother tabs are optional.

On the General tab, provide the following information; then, click Apply.

• Data Source Name: Type a string that identifies this data source configuration, such as Accounting.

• Description: Type an optional long description of a data source name, such as My Accounting Database.

• Host: Type either the name or the IP address of the server to which you want to connect.

• Port Number: Type the port number of the server listener. The default port number for the Apache Hiveserver is 10000. Because of reported concurrency issues, you might want to use a different port number.

• Database Name: Type the name of the Apache Hive database to which you want to connect by default.The database must exist, or the connection attempt will fail.

• Transport Mode: If you are connecting to an HTTP endpoint, set to 1 - HTTP. By default, the driverattempts to establish a binary connection (TCP mode).

• HTTPPath: If Transport Mode is set to 1 - HTTP, type the path of the HTTP/HTTPS endpoint used forconnections. The default is cliservice.

4. If your database configuration requires a user name, on the Security tab, provide the following information;then, click Apply.User Name: Type the default user ID that is used to connect to your database.

Progress DataDirect for ODBC for Apache Hive Wire Protocol Driver: Quick Start for Windows: Version 8.0.112

Chapter 1: Quick Start: Progress DataDirect for ODBC for Apache Hive Wire Protocol Driver for Windows

Page 13: TheProgressDataDirect for ODBC forApacheHiveWire ... · 14.06.2018  · 11.TheCreateDefaultDefaultDataSourcewindowappears.SelecttheCreateDefaultDataSourceboxto defineauserdatasourceinHKEY_CURRENT_USER\SOFTWARE\ODBC\ODBC.INIforeachDataDirect

5. For Microsoft Access and Tableau users, click on the Advanced tab. Provide the following required valuefor the Max String Size option; then, click Apply.

• For Microsoft Access: Type 255.

• For Tableau: Type a value from 255 to 4000 that suits your environment.

6. Your driver is now configured with the basic settings necessary to connect your driver. Choose from thefollowing:

• If you would like to test the connection, proceed to "Testing the Connection."• If you would like to tune your driver for performance, proceed to "Tuning for Performance."• If you would like to save the values you have specified and close the setup dialog, clickOK. These values

will be used as the default settings when you connect to a data source.

See alsoTuning for Performance on page 14Testing the Connection on page 13

Testing the ConnectionTo test the connection:

1. After you have configured the data source, you can click Test Connect on the Setup dialog box to attemptto connect to the data source using the connection options specified in the dialog box.

2. The logon dialog box appears. The fields have been populated with the values you supplied in "Configuringa Data Source."

Enter your Apache Hive password; then, click OK. Note that the information you enter in the logon dialogbox during a test connect is not saved.

• If the driver can connect, it releases the connection and displays a Connection Establishedmessage.Click OK.

13Progress DataDirect for ODBC for Apache Hive Wire Protocol Driver: Quick Start for Windows: Version 8.0.1

Testing the Connection

Page 14: TheProgressDataDirect for ODBC forApacheHiveWire ... · 14.06.2018  · 11.TheCreateDefaultDefaultDataSourcewindowappears.SelecttheCreateDefaultDataSourceboxto defineauserdatasourceinHKEY_CURRENT_USER\SOFTWARE\ODBC\ODBC.INIforeachDataDirect

• If the driver cannot connect because of an incorrect environment or connection value, it displays anappropriate error message. Click OK.

3. On the driver Setup dialog box, click OK. The values you have specified are saved and are the defaultsused when you connect to the data source. You can change these defaults by using the previously describedprocedure to modify your data source, or you can override these defaults by connecting to the data sourceusing a connection string with alternate values. See Progress DataDirect for ODBC for Apache Hive WireProtocol Driver User's Guide and Reference for additional information.

See alsoConfiguring a Data Source on page 10

Tuning for PerformanceThe connection options described in this section directly affect the performance of your driver. To tune forperformance, configure your driver according to the recommended settings and your environment.

Array Fetch SizePurpose:

Determines the number of cells the driver retrieves from a server for a fetch. When executing a fetch, the driverdivides the value specified by the number columns in a particular table to determine the number of rows toretrieve.

Performance Impact:

To improve throughput, increase the value of Array Fetch Size. By increasing the value of Array Fetch Size,you increase the number of rows the driver will retrieve from the server for a fetch. In turn, increasing thenumber of rows that the driver can retrieve reduces the number, and expense, of network round trips. Notethat improved throughput does come at the expense of increased demands on memory and slower responsetime. Furthermore, if the fetch size exceeds the available buffer memory of the server, an out of memory erroris returned when attempting to execute a fetch. If you receive this error, decrease the value specified untilfetches are successfully executed.

Recommended Settings:Tune this setting to reflect the typical fetch size of your application. Smaller fetch sizes can improve the initialresponse time of the query. Larger fetch sizes improve overall fetch times at the cost of additional memory.

GUI Tab:

Advanced

Batch MechanismPurpose:

Determines the mechanism that is used to execute batch operations.

Performance Impact:

Unlike the native batch mechanism, the multi-row insert mechanism only returns the total number of updatecounts for batch inserts. Therefore, setting BatchMechanism to MultiRowInsert offers substantial performancegains when performing batch inserts.

Recommended Settings:

Progress DataDirect for ODBC for Apache Hive Wire Protocol Driver: Quick Start for Windows: Version 8.0.114

Chapter 1: Quick Start: Progress DataDirect for ODBC for Apache Hive Wire Protocol Driver for Windows

Page 15: TheProgressDataDirect for ODBC forApacheHiveWire ... · 14.06.2018  · 11.TheCreateDefaultDefaultDataSourcewindowappears.SelecttheCreateDefaultDataSourceboxto defineauserdatasourceinHKEY_CURRENT_USER\SOFTWARE\ODBC\ODBC.INIforeachDataDirect

If your application does not require individual update counts for each statement or parameter set in the batch,set to 2 (MultiRowInsert) for improved performance when executing batch inserts.

GUI Tab:

Advanced

Catalog ModePurpose:

Determines whether the driver uses native catalog functions to retrieve information returned by the SQLTables,SQLColumns, and SQLStatistics catalog functions.

Performance Impact:

Apache Hive’s native catalog functions return incorrect information in certain scenarios. To address this issue,by default, the driver uses a combination of driver-discovered information and native functions to retrieve moreaccurate catalog information than native functions alone. While using driver-discovered information improvesaccuracy, it does so at an expense to performance. If accurate catalog information is not required, you canimprove performance by setting Catalog Mode connection option to 1 (Native).

Recommended Settings:

If accurate catalog information is not required, set to 1 (Native) for the best performance.

If your application requires accurate catalog information, set to 0 (Mixed) for the optimal balance of performanceand accuracy.

GUI Tab:

Advanced

Default Buffer Size for Long/LOB Columns (in Kb)Purpose:

The maximum length of data (in KB) the driver can fetch from long columns in a single round trip and themaximum length of data that the driver can send using the SQL_DATA_AT_EXEC parameter.

Performance Impact:

To improve performance when your application fetches images, pictures, or long text or binary data, a buffersize can be set to accommodate the maximum size of the data. The buffer size should only be large enoughto accommodate the maximum amount of data retrieved; otherwise, performance is reduced by transferringlarge amounts of data into an oversized buffer. If your application retrieves more than 1 MB of data, the buffersize should be increased accordingly.

Recommended Settings:

Set DefaultLongDataBuffLen to a size no larger than the maximum size of data retrieved.

GUI Tab:

Advanced

Enable Cookie AuthenticationPurpose:

Determines whether the driver attempts to use session cookie based authentication for requests to an HTTPendpoint after the initial authentication to the server.

Performance Impact:

Cookie based authentication improves response time by eliminating the need to re-authenticate with the serverfor each request.

15Progress DataDirect for ODBC for Apache Hive Wire Protocol Driver: Quick Start for Windows: Version 8.0.1

Tuning for Performance

Page 16: TheProgressDataDirect for ODBC forApacheHiveWire ... · 14.06.2018  · 11.TheCreateDefaultDefaultDataSourcewindowappears.SelecttheCreateDefaultDataSourceboxto defineauserdatasourceinHKEY_CURRENT_USER\SOFTWARE\ODBC\ODBC.INIforeachDataDirect

Recommended Settings:

If your environment is configured to use session cookies, set to 1 (Enabled).

GUI Tab:

General

Encryption MethodPurpose:

Determines the method the driver uses to encrypt data sent between the driver and the database server.

Performance Impact:

Data encryption may adversely affect performance because of the additional overhead (mainly CPU usage)that is required to encrypt and decrypt data.

Recommended Settings:

If data encryption is not required, set to 0 (No Encryption) for improved performance.

GUI Tab:

Security

String Describe TypePurpose:

Specifies whether all string columns are described as SQL_WVARCHAR. This connection option affectsSQL_Columns, SQLDescribeCol, SQLColAttributes, etc. It does not affect SQLGetTypeInfo.

Performance Impact:

To obtain data from String columns with the getClob() method, the StringDescribeType connection propertymust be set to -10 (SQL_WLONGVARCHAR). (Otherwise, calling getClob() results in an "unsupported dataconversion" exception.) When StringDescribeType is set to -10 (SQL_WVARCHAR), the driver not only mapsString to Longvarchar but also allocates more space to cache the long data. Because more space is allocatedfor the long data, your application will incur a performance penalty.

Recommended Settings:

If your application does not need to fetch data as SQL_WLONGVARCHAR, set to -9 (SQL_WVARCHAR) forimproved performance.

GUI Tab:

Advanced

Accessing Data With Third-Party ApplicationsThe following sections guide you through using the driver to access your data with some common third-partyapplications:

• Accessing Data in Tableau on page 17

• Accessing Data in Microsoft Excel on page 19

• Accessing Data in Microsoft Excel from the Query Wizard on page 21

Progress DataDirect for ODBC for Apache Hive Wire Protocol Driver: Quick Start for Windows: Version 8.0.116

Chapter 1: Quick Start: Progress DataDirect for ODBC for Apache Hive Wire Protocol Driver for Windows

Page 17: TheProgressDataDirect for ODBC forApacheHiveWire ... · 14.06.2018  · 11.TheCreateDefaultDefaultDataSourcewindowappears.SelecttheCreateDefaultDataSourceboxto defineauserdatasourceinHKEY_CURRENT_USER\SOFTWARE\ODBC\ODBC.INIforeachDataDirect

Accessing Data in TableauAfter you have configured your data source, you can use the driver to access your Apache Hive data withTableau. Tableau is a business intelligence software program that allows you to easily create reports andvisualized representations of your data. By using the driver with Tableau, you can improve performance whenretrieving data while leveraging the driver's relational mapping tools.

To use the driver to access data with Tableau:

1. Navigate to the \tools\Tableau subdirectory of the Progress DataDirect installation directory; then, locate the Tableau data source file, DataDirect Apache Hive.tdc.

2. Copy the DataDirect Apache Hive.tdc into the following directory:

C:\Users\user_name\Documents\My Tableau Repository\Datasources

3. Open Tableau. If the Connect menu does not open by default, select Data > New Data Source or the Add

New Data Source button to open the menu.=

4. From the Connect menu, select Other Databases (ODBC).

5. The Server Connection dialog appears.

17Progress DataDirect for ODBC for Apache Hive Wire Protocol Driver: Quick Start for Windows: Version 8.0.1

Accessing Data With Third-Party Applications

Page 18: TheProgressDataDirect for ODBC forApacheHiveWire ... · 14.06.2018  · 11.TheCreateDefaultDefaultDataSourcewindowappears.SelecttheCreateDefaultDataSourceboxto defineauserdatasourceinHKEY_CURRENT_USER\SOFTWARE\ODBC\ODBC.INIforeachDataDirect

In the DSN field, select the data source you want to use from the drop down menu. For example, My DSN.Then, click Connect. The Logon to Apache Hive dialog appears pre-populated with the connectioninformation you provided in your data source.

6. If required, type your user name and password; then, click OK. The Logon dialog closes. Then, click OKon the Server Connection dialog.

7. The Data Source window appears.

Progress DataDirect for ODBC for Apache Hive Wire Protocol Driver: Quick Start for Windows: Version 8.0.118

Chapter 1: Quick Start: Progress DataDirect for ODBC for Apache Hive Wire Protocol Driver for Windows

Page 19: TheProgressDataDirect for ODBC forApacheHiveWire ... · 14.06.2018  · 11.TheCreateDefaultDefaultDataSourcewindowappears.SelecttheCreateDefaultDataSourceboxto defineauserdatasourceinHKEY_CURRENT_USER\SOFTWARE\ODBC\ODBC.INIforeachDataDirect

By default, Tableau connects live, or directly, to your data. We recommend that you use the default settingsto avoid extracting all of your data. However, if you prefer, you can import your data by selecting the Extractoption at the top of the dialog.

8. In the Schema field, select the database you want to use. The tables stored in this database are nowavailable for selection in the Table field.

You have successfully accessed your data and are now ready to create reports with Tableau. For moreinformation, refer to the Tableau product documentation at: http://www.tableau.com/support/help.

Accessing Data in Microsoft ExcelAfter you have configured your data source, you can use the driver to access your data with Microsoft Excelfrom the Data Connection Wizard. Using the driver with Excel provides improved performance when retrievingdata, while leveraging the driver's relational-mapping tools.

To use the driver to access data with Excel from the Data Connection Wizard:

1. Open your workbook in Excel.

2. From the Data menu, select Get Data>From Other Sources>From ODBC.

3. The From ODBC dialog appears.

19Progress DataDirect for ODBC for Apache Hive Wire Protocol Driver: Quick Start for Windows: Version 8.0.1

Accessing Data With Third-Party Applications

Page 20: TheProgressDataDirect for ODBC forApacheHiveWire ... · 14.06.2018  · 11.TheCreateDefaultDefaultDataSourcewindowappears.SelecttheCreateDefaultDataSourceboxto defineauserdatasourceinHKEY_CURRENT_USER\SOFTWARE\ODBC\ODBC.INIforeachDataDirect

Select your data source from the Data Source Name (DSN) drop down; then, click OK.

4. You are prompted for logon credentials for your data source:

• If your data source does not require logon credentials or if you prefer to specify your credentials usinga connection string, select Default or Custom from the menu on the left. Optionally, specify yourcredential-related properties using a connection string in the provided field. Click Connect to proceed.

• If your data source uses Windows credentials, selectWindows from the menu; then, provide yourcredentials. Optionally, specify a connection string with credential-related properties in the provided field.Click Connect proceed.

• If your data source uses credentials stored on the database, select Database; then, provide your username and password. Optionally, specify a connection string in provided field. Click Connect to proceed.

5. The Navigator window appears.

From the list, select the tables you want to access. A preview of your data will appear in the pane on theright. Optionally, click Edit to modify the results using the Query Editor. Refer to the Microsoft Excel productdocumentation for detailed information on using the Query Editor.

Progress DataDirect for ODBC for Apache Hive Wire Protocol Driver: Quick Start for Windows: Version 8.0.120

Chapter 1: Quick Start: Progress DataDirect for ODBC for Apache Hive Wire Protocol Driver for Windows

Page 21: TheProgressDataDirect for ODBC forApacheHiveWire ... · 14.06.2018  · 11.TheCreateDefaultDefaultDataSourcewindowappears.SelecttheCreateDefaultDataSourceboxto defineauserdatasourceinHKEY_CURRENT_USER\SOFTWARE\ODBC\ODBC.INIforeachDataDirect

6. Load your data:

• Click Load to import your data into your work sheet. Skip to the end.• Click Load>Load To to specify a location to import your data. Proceed to the next step.

7. The Import Data window appears.

Select the desired view and insertion point for the data. Click OK.

You have successfully accessed your data in Excel. For more information, refer to the Microsoft Excel productdocumentation at: https://support.office.com/.

Accessing Data in Microsoft Excel from the Query WizardAfter you have configured your data source, you can use the driver to access your data with Microsoft Excelfrom the Query Wizard. Using the driver with Excel provides improved performance when retrieving data, whileleveraging the driver's relational-mapping tools.

To use the driver to access data with Excel from the Query Wizard:

1. Open your workbook in Excel.

2. From the Data menu, select Get Data>From Other Sources>From Microsoft Query.

3. The Choose Data Source dialog appears.

From the Databases list, select your data source. For example, MyDSN. Click Next.

21Progress DataDirect for ODBC for Apache Hive Wire Protocol Driver: Quick Start for Windows: Version 8.0.1

Accessing Data With Third-Party Applications

Page 22: TheProgressDataDirect for ODBC forApacheHiveWire ... · 14.06.2018  · 11.TheCreateDefaultDefaultDataSourcewindowappears.SelecttheCreateDefaultDataSourceboxto defineauserdatasourceinHKEY_CURRENT_USER\SOFTWARE\ODBC\ODBC.INIforeachDataDirect

4. The logon dialog appears pre-populated with the connection information you provided in your data source.If required, type your password. Click OK to proceed.

Note: The logon dialog may reappear if Excel needs to access additional information from the data source.If this occurs, re-enter your password; then, click OK to proceed to the next step.

5. The Query Wizard - Choose Columns window appears.

Choose the columns you want to import into your workbook. To add a column, select the column name inAvailable tables and columns pane; then, click the > button. After you add the columns you want to include,click Next to continue.

6. Optionally, filter your data using the drop down menus; then, click Next.

7. Optionally, sort your data using the drop down menus; then, click Next.

8. Select "Return Data to Microsoft Excel"; then, click Finish.

9. In the Import Data window appears.

Select the desired view and insertion point for your data. Click OK.

You have successfully accessed your data in Excel using the Query Wizard. For more information, refer to theMicrosoft Excel product documentation at: https://support.office.com/.

Progress DataDirect for ODBC for Apache Hive Wire Protocol Driver: Quick Start for Windows: Version 8.0.122

Chapter 1: Quick Start: Progress DataDirect for ODBC for Apache Hive Wire Protocol Driver for Windows

Page 23: TheProgressDataDirect for ODBC forApacheHiveWire ... · 14.06.2018  · 11.TheCreateDefaultDefaultDataSourcewindowappears.SelecttheCreateDefaultDataSourceboxto defineauserdatasourceinHKEY_CURRENT_USER\SOFTWARE\ODBC\ODBC.INIforeachDataDirect

Troubleshooting Setup/Connection IssuesThis section describes common setup/connection issues you may encounter while trying to establish a databaseconnection with the driver as well as some potential reasons for these issues. If you are experiencing a problemnot described in this section, comprehensive troubleshooting resources are available in the "Troubleshooting"section of the User's Guide and Reference for your driver.

In addition, Progress DataDirect offers a variety of options to meet your technical support needs. See Contactingtechnical support on page 24 for details.

Common Setup/Connection IssuesYou are experiencing a setup/connection issue if you are encountering an error or hang while you are tryingto make a database connection with the ODBC driver or are trying to configure the ODBC driver. Some commonerrors that are returned by the ODBC driver if you are experiencing a setup/connection issue include:

• Specified driver could not be loaded.

• Data source name not found and no default driver specified.

• Unable to connect to destination.

• Invalid username/password; logon denied.

Troubleshooting the IssueSome common reasons that setup/connection issues occur are:

• The database and/or listener are not started.

• The ODBC driver’s connection options are not set correctly when configuring the driver. For example, thehost name or port number are not correctly configured. See "Configuring a Data Source" for a list of connectionoptions that are required for the driver to connect properly to the underlying database.

See alsoConfiguring a Data Source on page 10

23Progress DataDirect for ODBC for Apache Hive Wire Protocol Driver: Quick Start for Windows: Version 8.0.1

Troubleshooting Setup/Connection Issues

Page 24: TheProgressDataDirect for ODBC forApacheHiveWire ... · 14.06.2018  · 11.TheCreateDefaultDefaultDataSourcewindowappears.SelecttheCreateDefaultDataSourceboxto defineauserdatasourceinHKEY_CURRENT_USER\SOFTWARE\ODBC\ODBC.INIforeachDataDirect

Additional ResourcesIn addition to this quick start, the following resources enable you to take full advantage of the features andsupport offered for your driver.

• Product Documentation Library contains a comprehensive set of product documentation, including thefollowing guides:

• Progress DataDirect Connect Series for ODBC Installation Guide details requirements and proceduresfor installing the product.

• Progress DataDirect for ODBC for Apache HiveWire Protocol Driver User's Guide and Reference guidesyou through using and configuring the driver, provides detailed reference information, and explains thetools used to troubleshoot common problems.

• Progress Support Knowledgebase provides answers to questions, access to technical documentation,release notes, product alerts and other support information.

• Progress Community allows you to contribute, share, and network with other Progress users and employees.

• Technical Support provides technical support services, including maintenance services and opening asupport case.

Contacting technical supportProgress DataDirect offers a variety of options to meet your support needs. Please visit our Web site for moredetails and for contact information:

https://www.progress.com/support

The Progress DataDirect Web site provides the latest support information through our global service network.The SupportLink program provides access to support contact details, tools, patches, and valuable information,including a list of FAQs for each product. In addition, you can search our Knowledgebase for technical bulletinsand other information.

When you contact us for assistance, please provide the following information:

• Your number or the serial number that corresponds to the product for which you are seeking support, or acase number if you have been provided one for your issue. If you do not have a SupportLink contract, theSupportLink representative assisting you will connect you with our Sales team.

• Your name, phone number, email address, and organization. For a first-time call, you may be asked for fullinformation, including location.

• The Progress DataDirect product and the version that you are using.

• The type and version of the operating system where you have installed your product.

• Any database, database version, third-party software, or other environment information required to understandthe problem.

• A brief description of the problem, including, but not limited to, any error messages you have received, whatsteps you followed prior to the initial occurrence of the problem, any trace logs capturing the issue, and soon. Depending on the complexity of the problem, you may be asked to submit an example or reproducibleapplication so that the issue can be re-created.

Progress DataDirect for ODBC for Apache Hive Wire Protocol Driver: Quick Start for Windows: Version 8.0.124

Chapter 1: Quick Start: Progress DataDirect for ODBC for Apache Hive Wire Protocol Driver for Windows

Page 25: TheProgressDataDirect for ODBC forApacheHiveWire ... · 14.06.2018  · 11.TheCreateDefaultDefaultDataSourcewindowappears.SelecttheCreateDefaultDataSourceboxto defineauserdatasourceinHKEY_CURRENT_USER\SOFTWARE\ODBC\ODBC.INIforeachDataDirect

• A description of what you have attempted to resolve the issue. If you have researched your issue on Websearch engines, our Knowledgebase, or have tested additional configurations, applications, or other vendorproducts, you will want to carefully note everything you have already attempted.

• A simple assessment of how the severity of the issue is impacting your organization.

25Progress DataDirect for ODBC for Apache Hive Wire Protocol Driver: Quick Start for Windows: Version 8.0.1

Additional Resources

Page 26: TheProgressDataDirect for ODBC forApacheHiveWire ... · 14.06.2018  · 11.TheCreateDefaultDefaultDataSourcewindowappears.SelecttheCreateDefaultDataSourceboxto defineauserdatasourceinHKEY_CURRENT_USER\SOFTWARE\ODBC\ODBC.INIforeachDataDirect

Progress DataDirect for ODBC for Apache Hive Wire Protocol Driver: Quick Start for Windows: Version 8.0.126

Chapter 1: Quick Start: Progress DataDirect for ODBC for Apache Hive Wire Protocol Driver for Windows