Find similar documents

You can use Find similar documents to identify documents that are conceptually similar to the one you are viewing. Relativity ranks the documents based on the conceptual similarity of their content in the concept space rather than a strict word comparison.

When you click Find Similar Documents in the Viewer, the entire document is submitted as a query string. The process is similar to a concept search, except instead of a query string, the whole document's position in the concept space is used as the query. A hit sphere with minimum concept rank of 60 is drawn around the document, and any documents that are within that hit sphere are returned as search results. This minimum rank value is not configurable.

Special considerations

Note the following special considerations about running conceptual analytics operations:

  • The following security permissions are required to run the operations:
    Object SecurityTab Visibility
    • Document - View
    • Analytics Index - View
    • Analytics Categorization Set - View, Edit, Add
    • Analytics Categorization Category - View, Edit, Add
    • Analytics Example - View, Edit, Add
    Documents
  • In order to run an operation from the viewer, the document must be in the data set of an active Analytics index.
  • You can only run operations in the Native Viewer and Extracted Text Viewer.

Running find similar documents from the viewer

Note: Large documents with many topics are not optimal for finding similar documents. Documents that have more than 8 MB of extracted text are also not suitable for this feature.
Instead of using find similar documents, select the text that is relevant to your query, and then submit that text as a concept search. For more information, see Concept searching.

To find similar documents, perform the following steps:

  1. Select a document from the document list and open it in the Native Viewer or Extracted Text Viewer. This is your primary document.
  2. Click the View Similar Documents button at the bottom of the viewer. Alternatively, right-click in the white space of the document and select Find Similar Documents

When the operation is executed, all of the unfiltered text of the document is used as the query. The Documents list pane opens and displays the Similar Documents tab, which contains other conceptually similar documents. This tab contains the following information about the results:

  • Rank - the conceptual similarity of the document to the primary document. The higher the rank, the higher the relevance to the query. A rank of 100 represents the closest possible distance. The rank doesn't indicate the percentage of shared terms or the percentage of the document that isn't relevant.
  • Control Number - the control number of the document.

Note: If there are more than 10,000 results, only the first 10,000 will display in the Viewer.

Once the conceptual analytics operation is executed, the following takes place:

  1. The breadcrumb navigation includes Conceptual Analytics if you have run a concept search, find similar documents, or keyword expansion. If you navigate back to the Documents tab, this breadcrumb is removed.
  2. The Similar Documents card updates to display the results of the operation and the number of documents returned by the operation.
  3. Optionally, you can click on the Filters icon to enable filtering. Enter the desired terms in a column and press Enter on your keyboard to filter the results.
  4. Optionally, click the Export to CSV icon to download a .csv file that contains the results in the Similar Documents card.
  5. Optionally, to save the current documents in the Similar Documents card as a saved search, do the following:
    1. Click the Save as Search icon in the bottom-left.
      The Saved Search pop-up displays.
    2. Enter the desired name in the Name field.
    3. Click Save.

The Viewer displays with the Similar Documents card expanded.

If you have more than one active index, the Select Index icon displays in the upper-right of the card. The oldest active index (lowest Artifact ID) is chosen by default. To change indexes, click on the Select Index icon and select a different active index from the drop-down menu.

Changing the results panel columns

To change which columns appear in the results panel:

  1. Exit the document viewer and navigate to the Views tab.
  2. Filter the list of Views to search for Search Results Pane View. Click the pencil icon to edit the view.
  3. The first page shows basic view information. Click Next to display the options for Fields.
  4. Edit the view as desired, then click Save.

For more information about editing views, see Views.