Creating a PDD
In addition to creating PDDs directly from the user interface, PDDs can also be created inline as part of the process for running a Batch Job or creating a Data Flow Job. PDDs may also be created using the Privitar automation APIs.
To create a PDD:
Select Protected Data from the Navigation sidebar. The Protected Data page is displayed, showing an index of all the PDDs that have been created.
Click on Create New Protected Data Domain. The Create Protected Data Domain screen is displayed.
Enter details about the new PDD:
A Name that will be used to refer to this specific PDD.
Whether to attempt to Embed Watermarks in the de-identified files.
Note
Enabling this option does not guarantee that every processed file will contain a watermark. Some small files may not be compatible, and the Policy used with the PDD must include at least one masked column. For more information, see Watermarking a Dataset.
Enter details about the output locations of the PDD in Output Data Locations. PDDs can contain output in various locations. Depending on the Environment configuration, HDFS, Hive and AWS Glue may be available. When using Data Flow or Privitar On Demand Jobs, no output location is required.
If a Hadoop cluster is configured in the Environment:
Enable Batch Jobs.
Enter the HDFS output path of the folder where output data will be written in the Batch Output Path.
If Hive is configured in the Environment:
Enable Hive Batch Jobs.
Enter the Hive Database Name into which output records will be inserted.
(Depending on configuration) Enter the HDFS Output path where Hive table data should be stored in the Write Job output to field. This option is presented only if the value is not specified centrally by a system administrator in the PDD's Environment.
If AWS Glue is configured in the Environment:
Enter the Amazon S3 output path of the folder where output data will be written in the Batch Output Path.
If metadata attribute mappings are configured and enabled (See Managing Metadata Attributes), then they will appear under Properties.
Complete the required Metadata fields. The fields shown in the dialog box depend on the Privitar configuration. (For more information, see Managing Metadata Attributes.)
Click Save to save the new PDD.