Table of Contents

Search

  1. Abstract
  2. Supported Versions
  3. Integrating Informatica® Big Data Management 10.2.2 HF1 SP1 with Qubole

Integrating Informatica® Big Data Management 10.2.2 HF1 SP1 with Qubole

Integrating Informatica® Big Data Management 10.2.2 HF1 SP1 with Qubole

Perform Initial Qubole Configuration Tasks

Perform Initial Qubole Configuration Tasks

Before you begin to integrate Big Data Management with Qubole on AWS, perform initial tasks to configure Qubole.

Establish the Qubole Environment on AWS

Go to the Qubole AWS Quick Start Guide and prepare the Qubole environment by performing the steps under the heading "Getting Started with Qubole on AWS."

Enable Spark 2.2.1 on Qubole

By default, Qubole supports Spark version 2.1.1, but the integration with Big Data Management requires Spark 2.2.1. To use Spark version 2.2.1, you must contact Qubole Support to enable it for your Qubole environment.
Create a support ticket with Qubole Support and request the following changes to your Qubole environment:
  • Enable Spark 2.2.1 support for resources that you create.
  • Disable private IP.
    Big Data Management uses VM host names to communicate with the Qubole cluster. When private IP is disabled for the cluster, the cluster identifies its nodes internally by host names, instead of IP addresses. This enables the Qubole cluster to understand resource requests from Informatica.
  • Set the Hive Metastore version to 1.2.

0 COMMENTS

We’d like to hear from you!