Before you begin to integrate Big Data Management with Qubole on AWS, perform initial tasks to configure Qubole.
Establish the Qubole Environment on AWS
Go to the Qubole
AWS Quick Start Guide and prepare the Qubole environment by performing the steps under the heading "Getting Started with Qubole on AWS."
Enable Spark 2.2.1 on Qubole
By default, Qubole supports Spark version 2.1.1, but the integration with Big Data Management requires Spark 2.2.1. To use Spark version 2.2.1, you must contact Qubole Support to enable it for your Qubole environment.
Create a support ticket with Qubole Support and request the following changes to your Qubole environment:
Enable Spark 2.2.1 support for resources that you create.
Disable private IP.
Big Data Management uses VM host names to communicate with the Qubole cluster. When private IP is disabled for the cluster, the cluster identifies its nodes internally by host names, instead of IP addresses. This enables the Qubole cluster to understand resource requests from Informatica.