Files tab

The Files tab in Relativity Processing allows you to view and analyze a list of all discovered documents and their metadata before deduplication and publishing.

Discovered files view

Views on the Files tab

The Files tab contains the following views:

  • All Files—lists all the files in your workspace.
  • Current Errored Files—lists the documents that yielded errors in your workspace that currently have an Error Status value of Not Resolved.
  • All Errored Files—lists the documents that yielded errors in your workspace, including those with a current Error Status value of Resolved and Unresolved.
  • Deleted Documents—lists the documents you deleted from your workspace.
  • Container files—lists the container files in your workspace.

Note: You can export any file list as a CSV file, which includes the total set of filtered results.

All Files view

The All Files view contains all the discovered files in your workspace. This view does not contain documents that have been deleted and have a Yes value for the Processing Deletion? field. Those documents can only be found in the Deleted Documents view described in the next section.

This view contains the following fields:

  • Details—the details view of all fields, including compressed metadata, of the discovered file selected.
  • File ID—the number value associated with the discovered file in the database.
  • File Name—the original name of the discovered file.
  • File Type—the file type of the discovered file.
  • File Extension - Text—allows file extensions to be filtered by text.
  • Custodian—the custodian associated with the discovered file.
  • Data Source—the data source containing the discovered file.
  • Dedupe Status—the deduplication status of a file. The Dedupe Status field is active after publishing completes and is useful for determining which files are duplicates and, therefore, not published. The Dedupe Status field is empty for container files, discovered files, and any other files previously processed (such as files published prior to the field being available.) If you republish files, the Dedupe Status field updates with the status of the republished files. Child documents inherit their dedupe status from their parent.
    Dedupe Status field values include:
    • Primary—a file that has one or more duplicates after publication.
    • Duplicate—a duplicate of a primary file.
    • Unique—a primary file that does not have any duplicates.
  • File Size (KB)—the size of the discovered file. To specify KB or MB, this field needs to be recreated as a fixed-length text field.
  • Is Published—the yes/no value indicates if a discovered file is published.
  • Sender Domain—the domain of the sender of an email.
  • Sort Date—the date taken from the file's Sent Date, Received Date, and Last Modified Date fields in that order of precedence.
  • Virtual Path - Text—the complete folder structure and path from the original folder or file chosen for processing to the discovered file.

Deleted Documents view

The Deleted Documents view contains the files you deleted from the Documents tab after the files were published.

This view contains the following fields:

  • File ID—the number value associated with the discovered file in the database.
  • File Name—the original name of the discovered file.
  • Custodian—the custodian associated with the discovered file.
  • Data Source—the data source containing the discovered file.
  • Processing Deletion?—the yes/no value indicating if a discovered file or partial family file is deleted.
  • Is Published?—the yes/no value indicating if a discovered file is published.
  • Error Message—the message that details the error, cause, and suggested resolution of the error prioritized by the following processing phases:
    • Delete
    • Publish
    • Discover
    • Text Extraction

Current Errored Files view

The Current Errored Files view contains all the documents that yielded errors in your workspace that currently have an Error Status value of Not Resolved. By default, this view does not contain files with an Error Status of Resolved, as those can be found in the All Errored Files view.

Current Errored Files View

  • Details—the details view of all fields, including compressed metadata, of the discovered file selected.
  • File ID—the unique identifier assigned to an instance of a file.
  • Storage ID—the unique identifier assigned to a file.
  • Note: Differentiating between the File ID and the Storage ID. Every file is assigned a unique Storage ID. Every instance of a file is assigned a unique File ID. For example, you may have a Word document with a unique Storage ID. If that document is attached to two different emails, then each instance of the document attachments is assigned a unique File ID. Therefore, a single file with a Storage ID, may have multiple instances, each with their own unique File ID.

  • File Name—the original name of the discovered file.
  • Error Message—the message that details the error, cause, and suggested resolution of the error. This field will display any of the following values, as dictated by the phases' precedence. For example, if a file has both Text Extraction and Publish errors associated with it, this field will display a value of the Publish error.
    • Delete
    • Publish
    • Discover
    • Text Extraction
  • Error Phase—the phase of processing in which the error occurred. This field will display any of the following values, as dictated by the phases' precedence. For example, if a file has both Text Extraction and Publish errors associated with it, this field will display a value of Publish.
    • Delete
    • Publish
    • Discover
    • Text Extraction
  • Error Category—provides insight into the nature of the errors that have occurred on your processed files. For details, see Error categories and their probability of retry success.
  • Error Status—the current status of the error. The Current Errored Files view only displays files with an Error Status of Not Resolved.
  • File Type—the file type of the discovered file.
  • File Extension - Text—allows file extensions to be filtered by text.
  • File Size (KB)—the size of the discovered file. To specify KB or MB, this field needs to be recreated as a fixed-length text field.
  • Custodian—the custodian associated with the discovered file.
  • Data Source—the data source containing the discovered file.
  • Is Published—the yes/no value indicating if a discovered file is published.

All Errored Files view

The All Errored Files view contains all the documents that yielded errors in your workspace, including those with a current Error Status value of Resolved and Unresolved. These files are sorted by descending file size starting with the largest containers and ending with the smallest loose files.

All Errored Files

This view contains the following fields:

  • Details—the details view of all fields, including compressed metadata, of the discovered file selected.
  • File ID—the unique identifier assigned to an instance of a file.
  • Storage ID—the unique identifier assigned to a file.
  • Note: Differentiating between the File ID and the Storage ID. Every file is assigned a unique Storage ID. Every instance of a file is assigned a unique File ID. For example, you may have a Word document with a unique Storage ID. If that document is attached to two different emails, then each instance of the document attachments is assigned a unique File ID. Therefore, a single file with a Storage ID, may have multiple instances, each with their own unique File ID.

  • File Name—the original name of the discovered file.
  • Error Message—the message that details the error, cause, and suggested resolution of the error. This field will display any of the following values, as dictated by the phases' precedence. For example, if a file has both Text Extraction and Publish errors associated with it, this field will display a value of Publish.
    • Delete
    • Publish
    • Discover
    • Text Extraction
  • Error Phase—the phase of processing in which the error occurred. This field will display any of the following values, as dictated by the phases' precedence. For example, if a file has both Text Extraction and Publish errors associated with it, this field will display a value of Publish.
    • Delete
    • Publish
    • Discover
    • Text Extraction
  • Error Status—the current status of the error. This field displays any of the following values, depending on the current state of the file:
    • Ignored
    • Resolved
    • Resolving
    • Not Resolved
  • File Type—the file type of the discovered file.
  • File Extension - Text—allows file extensions to be filtered by text.
  • File Size (KB)—the size of the discovered file. To specify KB or MB, this field needs to be recreated as a fixed-length text field.
  • Custodian—the custodian associated with the discovered file.
  • Data Source—the data source containing the discovered file.
  • Is Published—the yes/no value indicating if a discovered file is published.

Details modal

You can view file details by clicking the Show details for this file icon ( Expand details icon .) Details include file data, file metadata, and content metadata.

Discovered files details view

Click the Processing Errors tab to view a summary of errors, and the error history.

Discovered files details view

  • The Error History section represents all errors that have ever occurred on a file. This acts as a timeline of the record’s errors, showing when they occurred, what they were about, and if any are still active. This includes errors resulting from retries of previous errors and contains category, phase, date/time, and message information. All times are kept in UTC format.
  • The Error Summary section displays a count of all active errors along with their associated category and phase. This is especially important when investigating errors relating to container files, as there can be many associated to the parent container during file extraction. This helps determine the level of impact the issue has as it may affect many files originating from it.

Retrying delete errors

Navigate to the Deleted Documents view to see a record of all deleted documents. The Processing Deletion? field is the yes/no indicator for deleted documents. You can filter by Error Message to see the errors that occurred during deletion. Errors can be retried using the Retry Errors mass operations option. Once deleted, documents are excluded from further processing operations (for example, deduplication, retry, and republish) and the next duplicate is published as the new primary document, if available. To see a summary of primary documents that have been replaced, see the Master Document Replacement Summary report in Processing Reports. See Mass Delete for more information on deleting documents.

Discovered Files deleted documents view retry delete button

Republishing files from the Files tab

Note: The following information is specific to republishing files at the file level via the Files tab and is distinct from republishing files via the processing set console. For details on republishing via the processing set console, see Republishing a processing set on the Publishing files topic.

The Republish mass operation provides the ability to publish specific documents on a more granular level compared to the processing set page. For example, you can select specific files and re-publish only that subset. In case only a few members of a family are selected, then this mass operation will automatically republish the whole family of documents.

Republish will overlay all metadata fields mapped at the time you started the operation. If fields are unmapped, Relativity will not remove the data from the field that was already published. The Extracted Text and Native file fields will be overwritten if they are different than the initially published document.

Navigate to the All Files view to see a record of all discovered files and filter to published files via the Is Published? field published. You can republish files at the file level using the Republish mass operation.

Discovered Files deleted documents view retry publish button

When you click Republish, a confirmation modal appears containing the following information about the job you are about to submit:

  • Selected republish count of eligible files
  • Total number of files to be republished, including families
  • Total number of mapped fields
  • Number of documents per batch when importing documents during processing

Note: If you have selected files ineligible for republish, the confirmation message reflects this by stating that there are 0 files to be republished. Ineligible files include files from unpublished processing sets or data sources, containers (for example, PST, ZIP), duplicate files, and files where the Processing Deletion? field status is Yes.

Discovered files retry publish modal

If you have selected eligible files, click Republish to proceed or Cancel to return to the All Files view.

Note: Once you republish, you cannot cancel this job.

Republishing files where job errors are present

If you attempt to republish files with job errors, you will see a confirmation message indicating you must first resolve the errors before republishing.

To resolve job errors before republishing:

  1. Click Resolve job errors.
    Relativity attempts to fix the job errors and republish the files. For more information on unresolved job errors, see Processing error resolution.

    Republish with Errors Warning Modal Window

Common use cases for using the Republish mass operation

The following are common situations in which you would opt to use the Republish mass operation:

  • Additional metadata fields were mapped after the initial publish of the processing set/data source completed. For example, you did not map the File Name field during the initial publish, which resulted in no metadata being populated for the documents. Now, you can map the File Name fields, go to the Files tab, filter for that data source, select the returned files and republish them, which will result the File name field getting populated.

  • Files that did not get published because of document or job level publish errors.

  • Newly discovered files came from a retry discovery operation after the initial publish on the set. After the initial publish completes, you can still retry discovery errors, which could result in more files to be discovered. You can select unpublished files and republish only that subset.

Retrying errors

Navigate to the All Files view to see a record of all discovered files, including files with errors found during Discovery via the Discover Error field. The files that have ready to retry errors can be retried at file level using the Retry Errors mass operations option. When you click Retry Errors, you see a confirmation modal confirming the number of files you are about to submit for retry.

Discovered files retry errors modal

Click Retry on the retry errors confirmation modal.

Retry errors mass operations option in dfv

Navigate to the All Errored Files view to see a record of all files with errors.

Retry errors mass operations option in dfv

Saved filters

You can save any filters you set on any of the views in the Files tab and reuse them in future workflows. To do this:

  1. Filter on any of the fields in the view and click Save Filters at the bottom of the view.

  2. In the Saved Filter modal, complete the Name, Notes (optional), and Conditions fields, and click Save.

    • Name—the name you want these saved filters to appear under in the saved filters browser.
    • Notes—any notes you want to enter to clarify the purpose of these saved filters.
    • Conditions—displays the filters applied to the list. To add additional condition, click + Condition. The Add Condition - Select Field modal appears, where you can select addition fields and apply conditions.

      Once you select the additional fields you want to add to the saved filters set, specify the conditions you want to apply to those fields and click Apply.

      The new field(s) and condition(s) are visible in the Saved Filter modal. Click Save to save the filter set.
  3. The saved filter is added to the list of Saved Searches.

Right-click options

You can right-click any saved filter and choose to Edit, Copy, or Delete it.

Edit

Clicking Edit opens the Saved Filter modal. You can add, remove, or modify fields and conditions, then click Save.

Copy

To make a copy of an existing filter set, right-click and select Copy.

Use the Copy option when you need a new filter with similar fields and conditions. When you copy a filter set and save it, the new set name has a (1) appended to the file name. To rename the filter set, edit the filter, then change the name on the Saved Filter modal.

Delete

To remove a filter set from the saved filter browser entirely, right-click and select Delete .

Single Download / Replace

Single Download / Replace provides the ability to download a file to your local machine for investigation. It will also provide the ability to replace and retry an original file with a new version that has been fixed during error remediation. You can only replace and retry files with errors of a status of Not Resolved.

To perform the Single Download / Replace mass action, perform the following steps:

  1. To take action on a specific file, select it, and then select the Single Download / Replace mass action.
  2. Note: If you select multiple files, you will see an error as this action is meant for one file at a time. Similarly, you will see an error message if you do not select a file.

    The Single Download / Replace dialog box opens allowing you to browse for or drop a replacement file.

  3. Once you select or browse and drop the replacement file, click Download.
  4. Once the file downloads, resolve the error, and drag the resolved file back into the Single Download / Replace modal. A message displays that the file is ready for replace and retry. If the file extensions do not match and/or if the new file is larger than the original, you can still proceed with the replace and retry action.
  5. Click Replace & Retry.
  6. To see if the action was successful, you can check the Error History by going into the Details modal. If the retry was successful, the error displays a Resolved status.

Mass download errored files

Use the Download mass operation to download errored files to your local drive. You can only use the Download mass operation on files with an error status of Not Resolved.

To mass download errored files:

  1. Use the search bar to navigate to the Files page.
  2. Select Current Errored Files from the Views drop-down menu.
  3. Select individual files to include in the download package or use the Select All option to mark all files for downloading.
  4. Note: Errored files must be in the Discover phase for mass download. You will see an error message if you attempt to download files in other phases, such as Publish or Text Extraction.

    Use the Select All Option

  5. Select Download from the mass operation drop-down menu.
  6. Download Mass Action

    A confirmation message appears that displays the number of files downloading.

    Note: Relativity downloads unique files from each selection. The number of files selected may differ from the number of files downloaded if duplicates exist.

    Download Confirmation Message

  7. Click Download.
  8. A status message appears that displays the overall progress.
  9. Download Complete Message

    When complete, files are packaged in a zip container and saved to your local drive.