Skip to main content

User Guide

Creating Batch Unmasking Jobs

To create a Batch Unmasking Job:

  1. Select Protected Data from the Navigation sidebar.

    The Protected Data page is displayed, containing a list of the PDDs that have been created.

  2. Select the PDD by clicking on the name of the PDD in the PDD index listing.

    The Protected Data Domain Details window is displayed for the selected PDD.

  3. Select the Unmasking Jobs tab.

  4. Select Batch from the Create Unmasking Job list box. The Create Unmasking Batch Job page is displayed.

  5. Complete the Job tab. For more information, see Defining Unmasking Jobs.

  6. Complete the Data Locations tab to reference the HDFS data file that is to be processed and to specify where the unmasked data should be written:

    • The entry in the Input root location box determines the base location in HDFS containing the data file to be unmasked.

    • The entry in the Relative path to file box is interpreted relative to the Input root location to identify the file. This may contain wildcards and path replacements. (For more information, see Partitioned Data and Wildcards (HDFS).)

    • The Output root location is used as the base location for the unmasked output. The data file itself is written to this location by adding the Relative path to file.

  7. If required, select the Advanced tab if you wish to customize the Spark execution of the Unmasking Job.

    For more information, see the Apache Spark documentation.