Processing
Use Relativity’s processing feature to ingest raw data directly into your workspace for eventual search, review, and production without the need for an external tool. You can use the various processing objects to create custom processing jobs that handle a wide array of information.
Some of the primary goals of processing are to:
- Discern, at an item level, exactly what data is found in a certain source.
- Record all item-level metadata as it existed prior to processing.
- Enable defensible reduction of data by selecting only items that are appropriate to move forward to review.
Note: Processing does not perform language identification. For information on how to perform language identification using Analytics, see Language identification.
To gain control over more complex processing jobs on a granular level, you can use the Relativity Processing Console desktop application.
- Installing and configuring processing
- Supported file types
- Entities
- Password bank
- Processing profiles
- Processing sets
- Inventory
- Discovering files
- Publishing files
- Processing error workflow
- Reports
- Managing the processing queue
- Deduplication considerations
- Processing duplication workflow
- Relativity Processing Console
Application version considerations
All the content in this section and its related pages correspond to the latest version of the Processing application, which is updated on a monthly basis with the release of each patch of Relativity Server 2022.
If the processing components in your environment do not match the descriptions in this content exactly, it may be because you are using an older version of the Processing application. To get the newest version of the Processing application, upgrade to the latest product update of Relativity Server 2022.
For a list of changes made to processing per monthly product update, see the Release Notes.
You are a litigation support specialist, and the lead attorney hands you a CD containing data on a key custodian. There are about 200,000 files on the disc, and he is only looking for files from an 18-month period.
You use Relativity's processing feature to bring in that custodian’s data to Relativity and then to filter it based on what the lead attorney is looking for in this case. To do this, you first save the files into a folder and create a new custodian, Joe Smith.
Then you create a new processing set, to which you add a data source that has only Joe Smith associated with it. This data source includes a source path that is the folder in which you saved the custodian's files.
Once you save the processing set, you can inventory that custodian's data and eliminate all the files that fall outside of the 18-month period you are dealing with. Once you narrow down the data set to the most relevant files, you can
Basic processing workflow
The following steps depict a typical processing workflow that uses all available processing objects and phases. Note that each user's workflow may vary. You may not be required to follow all of these steps for every processing job you run.
- Processing sets
Entities can be created on the fly, in advance, or automatically through imports or connections to HR systems.
Processing Profiles carry over from template workspaces.
If necessary, create new Password Bank Entries with passwords for any password-protected files to be processed.
- (Optional) Inventory
Inventoried files can be filtered down based on several metadata attributes prior to publish.
Reports can be run to understand and communicate the files culled from publish.
-
Discovering filesand Publishing files
-
Post publish—view, ignore, or retry any errors that occurred during any phase of the processing job. If needed, republish the files.
-
Logging for processing
The logging framework enables you to efficiently gather runtime diagnostic information. You can use logging for troubleshooting application problems when you need a very granular level of detail, for example, when working with a Relativity Support representative.
Relativity system components that can log messages are identified based on the system-subsystem-application designation. When troubleshooting, use the system-subsystem-application matrix to configure logging to target a specific Relativity component, such as the Processing application.
Note: It is recommended that you not set your logging to verbose when publishing documents to a workspace, as doing so can cause your worker to run out of resources, such as CPU, RAM, disk usage, or others, which then causes your publish job to cease entirely. If you need to use verbose logging in order to collect detailed logs, do so for short periods of time only, under 5 minutes, and have a developer on hand to troubleshoot if any issues occur.
For more information, see Logging and Logging system-subsystem-application matrix.