Lookup Text
This section provides a comprehensive description of the Lookup Text Rule.
For a summary of the rule and its compatibility with Privitar jobs and execution environments, see Masking Rule Types.
Data Types
The supported data types for this rule are:
Text
Description
The value is replaced by a randomly selected value read from from a Lookup file stored in HDFS or uploaded locally. For example, here is a typical Lookup file consisting of six names:
Gary Joshua Daniel Thomas Craig Samuel
This file could be used by the rule to map the following input values to new output values:
Input value | Output value | Mapping |
---|---|---|
John | Daniel | John > Daniel |
Bob | Craig | Bob > Craig |
Christian | Gary | Christian > Gary |
Louis | Joshua | Louis > Joshua |
If using Lookup From File, specify the name of a file in Lookup From File. The full HDFS directory path and filename must be specified.
If using Upload Lookup File, use Drag & drop a file or Browse to upload a file stored locally.
The Lookup file must be a CSV file that contains a single column of newline-separated replacement values (no comma is needed to separate the values). The replacement value is drawn at random from this file. The size of the Lookup file can be up to 100k rows.
Tokenization will fail if there are fewer replacement values in the Lookup file than original values to be replaced.
Masking Behavior
The options are described in the following table:
Option | Description |
---|---|
Lookup From File | The full HDFS directory path including filename of the Lookup file. |
Upload Lookup File | File to be uploaded. |