Processing an RSMF file

Relativity Processing is the only viable method of importing RSMF files into Relativity. To process an RSMF successfully, point Processing to the RSMF container. This is because Simple File Upload doesn't recognize RSMF files, and the Relativity Desktop doesn't properly extract attachments and then it requires you to manually map attachments to the appropriate messages.

If you are using a non-Relativity tool, attachments need to be manually tied to the appropriate messages. RSMF is wrapped in an EML so other processing tools should be able to process the other aspects of the RSMF file.

To prepare your RSMF files to be processed into RelativityOne, use Staging Explorer to get the data ready. To learn more, visit RelativityOne Staging Explorer.

This page contains the following sections:

Checklist for processing an RSMF file

Processing largely handles RSMF files like any other Internet Message Format such as EML. Within an RSMF file, there is an EML with a ZIP as an attachment, and within the ZIP is the JSON along with any attachments. Note that to successfully process an RSMF file, only the RSMF file needs to be processed by Relativity Processing. If you include information in the Body field, it will display in the Extracted Text Short Message Viewer.

We recommend preparing the following before you process an RSMF file:

Feature Steps needed to prepare
Processing profiles
  • Default processing profile will handle RSMF and subsequent mapping.

  • Ensure Extract children is set to Yes, otherwise this results in missing attachments.
  • The Processing application comes with one field that is custom to RSMF. The Relativity Attachment ID is the only system-mapped field and cannot be changed.
  • Ensure that you select a relational field such as Group Identifier for the Source field. See screenshots below for an example. Setting up Group Identifier in this manner will ensure that attachments appear in the Short Message Viewer and are tied to the correct parent messages.
RSMF Header fields

The fields marked as required must be included in the RSMF Header. The following fields are populated in the document list: To, From, CC, Subject Sent Date, Last Modified. Any custom RSMF headers are also populated if the mapped fields are created prior to processing. RSMF headers are metadata fields so they become available for mapping once extracted from the actual file. To learn more, see Mapping processing fields.

The following headers are supported: X-RSMF-Version, X-RSMF-Generator, X-RSMF-BeginDate, X-RSMF-EndDate, X-RSMF-EventCount, X-RSMF-Application, X-RSMF-Custodian, X-RSMF-Participants, X-RSMF-AttachmentCount, X-RSMF-EventCollectionID.

Note: RSMF files greater than 2GB are not supported and may be unable to process. We recommend creating and processing RSMF files no greater than 2GB.

Note: We recommend that an RSMF file should have no more than 10,000 events to ensure high performance in Relativity.

How parts of RSMF files are processed

The following subsections describe how different parts of RSMF are handled during processing.

RSMF Header

The optional headers that begin with X-RSMF are extracted automatically during processing to the following mappable fields:

  • X-RSMF-BeginDate → Rsmf/BeginDate

  • X-RSMF-EndDate → Rsmf/EndDate

  • X-RSMF-EventCount → Rsmf/MessageCount

  • X-RSMF-Version → Rsmf/Version

  • X-RSMF-Generator → Rsmf/Generator

  • X-RSMF-Application → Rsmf/Application

  • X-RSMF-Custodian → Rsmf/Custodian

  • X-RSMF-Participants → Rsmf/Participants

  • X-RSMF-AttachmentCount → Rsmf/AttachmentCount

  • X-RSMF-EventCollectionID → Rsmf/EventCollectionID

Since the RSMF file type is an open format, custom headers are supported as well. They are extracted automatically to the mappable fields using the following: X-RSMF-CustomHeader → Rsmf/CustomHeader.

Note: RSMF headers (both standard and custom) are extracted as metadata fields. They become available for mapping after you process a file that contains them. If you don't see the headers from the list above in your workspace, make sure you are processing RSMF files using them.

For metadata to be extracted from fields they should be created before processing. You may be required to process file twice when setting up headers initially. First, to map a header and then the second time to extract the metadata associated with a header into the document. To learn more, Mapping processing fields.

If the RSMF file doesn't include a Sent Date, and the X-RSMF-BeginDate header exists, that header will be mapped to the Sent Date field.

If Sent Date is included, then the value in X-RSMF-BeginDate will map to a new field called Rsmf/BeginDate.

If the RSMF file does not include Last Modified, then X-RSMF-EndDate will map to Last Modified.

If the RSMF file includes Last Modified, then X-RSMF-EndDate will map to Rsmf/EndDate.

Note: The Message Header field should exist prior to Processing an RSMF file. This field will be populated with all of the metadata stored in the EML header of the RSMF file. The Message Header field is a non-relational, ‘Long Text’ field.

To learn more about what should be included in the header and body, see Requirements for RSMF head and body.

RSMF.ZIP

The rsmf_manifest.json file is not discovered as a publishable file.

For any other file within the rsmf.zip, the following rules apply:

  • If the file is referenced within the rsmf_manifest.json:
    • The virtual path will exclude the rsmf.zip portion in order to avoid the creation of an rsmf.zip folder in Relativity once the file is published.
    • When published, a field called Relativity Attachment ID will be populated with additional metadata. If the file is an attachment, the metadata will be the id of the attachment as specified within the rsmf_manifest.json. If the file is an avatar, the metadata will be the name of the file. The Relativity Attachment ID is a system field that the Short Message Viewer uses to provide enhanced support for attachments and avatars.
  • If the file is not referenced within the rsmf_manifest.json, it is processed as any other file contained within a zip within an EML. So, when published, it will create a folder called rsmf.zip and the file will be placed within there.
  • All discovered files, whether referenced within the rsmf_manifest.json or not, will be fully processed and given the same Group Identifier.

RSMF deduplication

The deduplication of RSMF files is based on the same method as the deduplication of emails. Relativity is calculating the hash based on the contents of the file and comparing this hash between files, based on the following factors:

  • Header Hash (Subject, From, Date)

  • Recipient Hash (To)

  • Message Body Hash

  • Attachment Hash

To learn more about this algorithm, visit Deduplication considerations.

Relativity AttachmentID

There are four things you need to make sure RSMF files display correctly:

  1. The Family (Group) Identifier field is mapped.

  2. All documents are contained in the Family (Group) Identifier relational family.

  3. The Relativity AttachmentID field is mapped.

  4. Ensure the Extract children field on the processing profile is enabled. If this option is disabled, enable it and re-process the data.

The Short Message Viewer looks for attachment files in the Family (Group) Identifier's relational group that have the Relativity AttachmentID field set to the ID associated to that file in the RSMF's manfiest.json. If either the Relativity AttachmentID or Family (Group) Identifier field is not set, the Viewer cannot find the relevant image to display, and instead displays an error.

Troubleshooting attachments

If the Short Message Viewer shows an error on the attachments, you can troubleshoot by doing the following:

  • Verify that the Extract children field on the processing profile is enabled. If the setting is disabled, enable it and re-process the data.

  • Verify that the Family (Group) Identifier field is mapped. This is a relational field which indicates the family relationship between the parent RSMF and the children attachments. If this field is not mapped, the Viewer will not know which RSMF file the attachments belong to.

  • Consider running the RSMF file through the RSMF validator. To use the validator, visit RSMF validator.

Results after successful processing

Once processing has been successfully completed, the method in which the attachments\embedded images display in the Document list is as follows:

  • The top-level document, the parent, will contain the entire thread of the conversation. Each of the child IMG files are images that occur within that conversation. These will appear separately, as children of the conversation in much the same way an embedded image in an EML file would occur as a child of the email.

    The document list displays with Family Group highlighted.

Short Message Viewer

Relativity's review interface has been updated with a new viewer that will automatically be displayed when using the Viewer radio button for any document identified as Relativity Short Message Format. The new viewer provides features to make it easy to review all aspects of RSMF files including conversations, participants, messages, and attachments. To learn more, visit the Short Message Viewer.

Troubleshooting steps

If the RSMF file exceeds the limit of 2 GB and is reported as error during processing:

  • To be able to successfully process this RSMF file consider slicing it below 2 GB using the same software that was used to generate the file or contact the party that provided you the original file.

If the Short Message Viewer shows an error on the attachments, troubleshoot issues with the following tips:

  • Check if the “Extract children” setting on the processing profile is enabled. If disabled, enable this setting and re-process the data.
  • Check that the Family (Group) Identifier field is mapped. This is a relational field which indicates the family relationship between the parent RSMF and the children attachments. If this field is not mapped, the Viewer will not know which RSMF file the attachments belong to.
  • Consider running the RSMF file through the RSMF validator if attachments are missing.

If you are missing attachments and see errors in the RSMF file where the attachments should be, do the following:

  • Check if the Extract children setting on the processing profile is enabled. If disabled, enable this setting and re-process the data.

If the attachments are not linked to the RSMF file(s), do the following:

  • Check that the Family (Group) Identifier field is mapped. This is a relational field which indicates the family relationship between the parent RSMF and the children attachments.
  • Consider running the RSMF file through the RSMF validator if attachments are missing.

If it appears that Processing is ignoring the Sent Date field or Last Modified field, you can do the following:

  • If the RSMF file doesn't include a Sent Date, and the X-RSMF-BeginDate header exists, that header will be mapped to the Sent Date field. The latter takes precedent to the former if both are present.
  • The same goes for the ‘Last Modified’ field and X-RSMF-EndDate with taking precedent if both are present.

If all of your data is populated accurately in the Group Identifier field, but your attachments are still not showing up, then verify that the Group Identifier field has the correct GUID. If you have SQL access, query for where the GUID is 1F036749-A691-4AA8-8CF7-5EEB80C36CAF. Otherwise, please contact Relativity Support for assistance.