Clustering
Analytics uses clustering to create groups of conceptually similar documents. With clusters, you can identify conceptual groups in a workspace or subset of documents using an existing Analytics index. Unlike categorization, clustering doesn't require much user input. You can create clusters based on your selected documents without requiring example documents or category definitions. You can view the clusters identified by the index in the Cluster browser available on the Documents tab.
When you submit documents for clustering, the Analytics engine determines the positions of the documents in the conceptual index. Depending on the conceptual similarity, the index identifies the most logical groupings of documents and places them into clusters. Once the Analytics engine creates these clusters, it runs a naming algorithm to label each node in the hierarchy appropriately to refer to the conceptual content of the clustered documents.
Clustering is useful when working with unfamiliar data sets. However, because clustering is unsupervised, the Analytics engine doesn't indicate which concepts are of particular interest to you. Use investigative features such as Sampling, Searching, or Pivot in order to find the clusters that are of most interest.
Note: Clustering doesn't perform conceptual culling of irrelevant documents, but there is a group created for documents without searchable text. The name of the group is UNCLUSTERED. This group can be used to find documents that don’t have enough searchable data. All documents with conceptual data get clustered somewhere once. Clustering may be used on any list of documents. For example: a saved search based on custodians, search term results, date ranges or the entire workspace.
After you've created clusters, you can use those clusters to create batches to speed up a linear review.
See these related pages:
Creating or replacing a cluster
You can create new clusters or replace existing ones. The steps for these tasks are similar, except that you need to select an existing cluster when you perform a replacement.
Note: As of February 2025, the new Feature Permissions redefines Relativity's security management by shifting the focus from Object Types and Tab Visibility to feature-based permissions. This new method is simply another option; any feature-specific permissions information already in this topic is still applicable. This new interface enables administrators to manage permissions at the feature level, offering a more intuitive experience. By viewing granular permissions associated with each feature, administrators can ensure comprehensive control, ultimately reducing complexity and minimizing errors. For details see
Instance-level permissions and
Workspace-level permissions.
Notes:
- You must have the add permission (for both field, Cluster Set, and choice objects) and the cluster mass operations permission to create or replace clusters.
- Permissions for the Cluster Set object and the multiple choice fields that hold cluster data should be kept in sync. Refer to the Cluster Set object description in Workspace permissions.
To automatically create a cluster when creating a conceptual index, leave the Cluster Documents option set to Yes. For more information, see the following:
To manually create a new cluster or replace an existing one:
- Navigate to the Documents tab and find the documents that you wish to cluster. This might be a saved search, folder, or all documents in the workspace.
- Select the documents you want to cluster. You can click the checkboxes of individual documents in the list and select Checked from the drop-down menu. Alternatively, you can select All to include all documents.
- Select Cluster from the Mass Operations drop-down menu.
The Cluster X Documents pop-up displays.
- Complete the fields on the Cluster dialog. See Fields.
- Once you've completed the fields on the Cluster dialog, click Submit for Clustering.
To view the cluster visualization on the Documents tab, do the following:
- On the Documents tab, click the Add Widget button and select Cluster Visualization from the drop-down menu.
- Select the desired Cluster Set from the drop-down menu.
- Click Apply.
The selected cluster visualization displays on the Documents tab.
Fields
The following fields display on the Cluster window:
- Mode - determines whether this is a new cluster or a replacement
- Name - the name of the new cluster. If you are replacing an existing cluster this field becomes Cluster, in which you select the cluster you'd like to replace from a drop-down list of existing ones.
- Relativity Analytics Index - the Analytics index that is used to cluster your documents. The index you select here must have queries enabled for you to be able to submit the selected documents for clustering. All of the selected documents must be in the data source of this index; otherwise, they end up as Not Clustered.
Click + to display the following Advanced Options:
See Deleting a cluster.
Default settings for automatically created clusters
If you choose to automatically create a cluster when setting up a new conceptual index, the cluster will behave as follows:
- It will be called "Default Cluster [index name]."
- It will be created immediately after the conceptual index has been activated.
- It will be linked to the chosen conceptual index.
- The cluster settings will have the following values:
- Title Format: Outline and Title
- Maximum Hierarchy Depth: 3
- Minimum Coherence: 0.7
- Generality: 0.5
- Create Cluster Score Field: No
You can replace the default cluster at any time through the Mass Operations menu. For more information, see Creating or replacing a cluster.
Replacing an existing cluster
Replace existing cluster is the same as Creating or replacing a cluster, except the results replace existing clustering options. When you select Replace Existing Cluster, you're prompted to select the existing cluster set you would like to replace.
Viewing clusters
Note: The Cluster browser has been deprecated. You can access clustering by using the Cluster visualization. To learn more, see Cluster Visualization.
Deleting a cluster
To delete a cluster, perform the following steps:
- Navigate to the Analytics Indexes tab, and click on the Analytics index that was used to create the cluster.
- Select the checkbox next to the cluster you want to delete from the cluster list at the bottom of the Analytics index console.
- Click Delete.