Entity Analysis

You can use the Entity Analysis tab to see, modify, and export your entity lists.

The following are definitions of terms used throughout this documentation.

Entity—a unique individual identified by their name and their PI, such as SSN, date of birth, and full address. Entities are assigned a unique Entity ID, appear in the Entity Centric Report, and are composed of one or more records.

Record—a pairing of raw name and PI data from documents. Records are evaluated against each other to determine if they should be consolidated into an entity. A single record can be transformed into an entity if no related records are found.

Conflict Cluster—a group of records based on PI conflicts and name similarity. Entities with conflicting PI or similar names are grouped together to aid in review and potential merging. Entities are not merged if there is no PI match, or a conflict exists.

Note: Run Normalization has moved to the Data Analysis tab. See Data Analysis for more information.

The Entity Analysis tab includes two subtabs:

  • Entities—shows a current view of your entity list. If this list is blank, you have either not extracted any records from your documents yet or you have not run normalizer yet. Go to Data Analysis to run Normalizer.
  • Conflicts—shows all entities where normalizer encountered conflicts. Navigate here to resolve these conflicts to ensure your Entity Report is accurate.