Review documents

This section describes common tasks and interfaces used by Reviewers when working with PI Detect. Reviewers check documents to make sure the machine is accurately predicting what documents contain PII. They correct the machine's predictions when necessary and make sure all PII types are detected.

Spreadsheet review

The PI Detect product uses AI to identify table boundaries, column headers, and the personal information that sits within those tables. The PI Detect product also identifies personal information that sits within cells outside of tables. Below is a high-level overview of the recommended review approach for spreadsheets, as it differs from non-spreadsheet documents:

  • Predict table boundaries and column headers. During the Incorporate Feedback process, PI Detect finds table boundaries with column headers to predict where and what type of personal information is contained in tables.
  • Verify identified table boundaries and column headers. PI identified within tables are dependent on the table boundaries being properly identified. Table boundaries identified by the product can be edited or removed. New table boundaries can be added. For more information, see Creating a table

  • Verify Personal Information identified outside of tables. The PI Detect product can also identify personal information that sits within cells outside of a table. Please use the PI Detection card to navigate to those. That PI can also be edited, removed, or new PI sitting outside a table can be added. For more information, see Capturing PI

Reviewing PI within tables

The PI Detect product identifies table boundaries with column headers to predict where and what type of personal information is contained in tables. There are instances where tables boundaries may need to be edited, removed, or added.

Creating a table

If there is a table in a sheet that has not been identified, a new table can be created. To create a new table:

  1. Select the table icon on the bottom of the native spreadsheet viewer or right click from the viewer and select Add Table Boundary.

  2. From the table card, click Add New Table.

  3. Add a Name for the table.

  4. Insert the Table Boundaries for the table.

  5. If the table does not have headers and the first row of the table contains PI, disable the Header toggle. The header location will be auto filled with the first row of the table.
    If the Header toggle is selected, a header value must be entered.

  6. Click Save after confirming that the cell range is correct.

Editing a table

Table boundaries are marked by a dashed line around the predicted table. Existing table boundaries can be modified:

  1. Right click on the table boundary of the table you wish to edit.

  2. From the Context Menu, select Personal Information. Click Edit Table Boundary.

  3. A table card will appear. Click the Edit button in the table card.

  4. The Name of the table, Table Boundary, and Header values can be changed.

  5. Click Save.

Capturing PI

PI can be captured for a single cell, or for an entire column. It is appropriate to annotate an entire column if the majority of cells in that column represent the same type of PI. If this is not the case and PI is scattered inconsistently across the sheet, then manual annotations for single cells can be created.

Adding PI

To add PI:

  1. Right click the relevant column header.

  2. From the Context Menu, select Personal Information.

  3. Click Add Column Annotation.

  4. Select the PI Type to assign to the column.

  5. Click Save.

The PI will appear in the PI Detection panel.

Editing PI

To edit PI:

  1. From the Context Menu, select Personal Information.
  2. Click Edit Column Annotation.
  3. Select the new PI Type to assign to the column.
  4. Click Save.

Deleting PI

To delete annotations that have been applied to entire columns:

  1. From the Context Menu, select Personal Information.
  2. Click Remove Column Annotation.
  3. A confirmation modal appears. Click Remove.

Adding partial cell PI

To add partial cell PI:

  1. Right click the cell containing the PI in the spreadsheet.
  2. From the Context Menu, select Personal Information.
  3. Click Add.
  4. The PI is captured in the PI Value field.
    Type to edit the PI Value if needed.
  5. Choose the PI Type.

  6. Click Add.

The PI now appears in the PI Detection panel.

Editing partial cell PI:

To edit partial cell PI:

  1. Right click the cell containing the PI in the spreadsheet.
  2. From the Context Menu, select Personal Information.
  3. Click Edit.
  4. Edit the PI Value and PI Type.
  5. Click Save.

Removing partial cell PI:

To remove partial cell PI:

  1. Right click the cell containing the PI in the spreadsheet.
  2. From the Context Menu, select Personal Information.
  3. Click Delete.
    A confirmation modal appears.
  4. Click Remove.

Non-spreadsheet review

PI Detect identifies personal information located within documents, reducing the manual burden on reviewers. When reviewing documents, the goal is to validate the predictions highlighted by the AI.

Capturing PI

Add or remove PI by following the procedures below.

Adding PI

To add PI:

  1. Select the text for annotation using the cursor.

  2. Right click on the document to open the Context Menu.
  3. From the Context Menu, select Personal Information. Click Add.

  4. From the Add PI Detection modal, select the PI type for the auto-filled PI value. You cannot change this text without re-highlighting it.

  5. Click Save.

Editing PI

To edit PI:

  1. Right click the highlighted PI to edit.
  2. From the Context Menu, select Personal Information.
  3. Click Edit.
  4. From the Edit PI Detection modal, select the PI type for the auto-filled PI value.

  5. Click Save.

Removing PI

To remove PI:

  1. Right click the highlighted PI to remove it.

  2. From the Context Menu, select Personal Information.

  3. Click Remove.

  4. Click Remove on the confirmation modal.

Adding drawn annotations

In some cases, PI detections cannot be made on a document due to poor OCR quality. In these instances, PI can be recorded by doing the following:

Note: Drawn annotations can only be applied to PDFs.

  1. In the PI Detection panel, select Draw Annotation.

  2. Draw a box over the text that contains the PI to record.

  3. A panel opens in the Draw Annotation section of the PI Detection panel.
    Enter the PI Value and PI Type.

  4. Click Save.

Locking detections

For PI detections to be preserved on a document when the Incorporate Feedback pipeline is rerun, PI must be saved before moving on to the next document when reviewing.

To save PI detections before Incorporate Feedback is run, click the Lock Annotations button in the PI Detections panel. You can continue to make changes and Add/Edit/Remove PI after a document is locked. Those updates will remain on the document.
Changes can not be made while Incorporate Feedback is running.

If further changes need to be made on the document, select the button again to Unlock Annotations.

Errors

Personal information will not appear in the PI Detection panel under the following circumstances:

  • The AI process, called Incorporate Feedback in PI Detect, has not been run.
  • The AI process, called Incorporate Feedback in PI Detect, is in progress.
    • Detections will be available in the PI Detection panel once Incorporate Feedback is complete. While Incorporate Feedback is in progress, the panel will display the following message:

  • The AI process, called Incorporate Feedback in PI Detect, has failed.

    • If an error has been encountered during PI detection or it has failed, the following message will appear in the PI Detections panel:

  • No PI was detected after the AI was run.

    • If a document does not contain PI, the PI detection panel will display a message indicating this.