Investigating Watermarks
The Investigation page in the Privitar Platform can be used to investigate a watermark. This allows sample data in HDFS to be analyzed for the presence of a watermark.
The investigation process may take some time, depending on the data size, and so Privitar allows multiple ongoing investigations to be tracked on the Policy Manager interface.
Once the process is complete, the record of the investigation remains present with a link to the PDD, if one was identified.
The watermark investigations require an Environment with a configured Hadoop Cluster and HDFS to contain the sample. Before beginning, an investigation, note the location on HDFS of the data sample.
To investigate a file to see if it contains a watermark:
Select Investigation from the Navigation sidebar.
The Investigation page is displayed displaying a list of previous investigations that have been performed.
Click on Investigate File to begin an investigation.
The Investigate File dialog box is displayed.
Select the Environment that contains the file to investigate, from the Environment list box.
Specify the full pathname of the file (including the name of the file), in the File Path edit box.
If necessary, specify the type of file from the list box. (Click on the Settings icon to specify options for reading and writing CSV files.)
Click on Start to begin the investigation process.
The pending investigation is shown as a new entry on the Investigations page. The outcome of the process can be:
A PDD was identified by a watermark match.
No PDD was identified.
An error occurred during the investigation process. For example, an incorrect pathname was specified.
Refer to the following sections for more information.
A PDD was Identified
When a PDD is identified, its name is displayed as a Match in the entry for the investigation on the Investigations page. The PDD that has been identified is displayed, along with a link to the PDD's detail page.
No PDD was Identified
There are several reasons why a PDD might not be identified:
The file did not contain a watermark.
The file has been tampered-with to the extent that it was not possible to confidently extract the watermark.
In both these cases, the Investigation is marked as No Match on the Investigations page.
An error occurred during the Investigation
If an error occurs during extraction, then a Failed status is displayed in the entry for the investigation on the Investigations page.
A View Log link to the execution log is provided for troubleshooting purposes in the "..." menu.