

Use this guide to learn about the recommended infrastructure for a Relativity installation. It contains general information about system requirements, server functionality, and other guidelines for infrastructure planning. It also includes a list of frequently asked questions (FAQs) grouped by category, such as security and virtualization, for easy reference.
Note: See System Requirements for a complete discussion of server recommendations.
This section briefly explains the types and functionality of the different servers used in a Relativity installation. It also includes basic guidelines for memory requirements, configurations, and other information you can use to develop an effective infrastructure plan.
The Secret Store provides secure auditable storage for Relativity secrets. A secret could be user IDs and passwords stored within the Relativity response file, a database connection string stored in an application configuration file or instance settings containing confidential credentials. Use the following guidelines when configuring Secret Store.
The SQL Server stores the databases, which contain metadata, extracted text, and SQL log files. Use the following guidelines when configuring this server.
Relativity uses several different types of agents to perform background processing, such as imaging, branding, and indexing. These agents run on scalable agent servers. These agents can be processor-intensive depending on the types of jobs that are running. Use the following guidelines when configuring an agent server.
Relativity runs on the web (or application) server. Use the following guidelines when configuring this server.
Analytics is one of the default searching methods Relativity provides, which also includes Keyword Search (SQL Full Text Searching), and dtSearch. You can also use the analytics server for data analysis features such as email threading, near duplicate identification, clustering, and categorization. The size of the indexes and structured analytics sets that these searching analysis methods use depends on the number of columns, records, documents, and type of data being indexed on the workspace level.
Relativity uses the file server to store native and image files. Use the following guidelines when configuring this server.
The following FAQs provide information on Relativity agents and agent servers.
The minimum requirements for the agent server support one instance of all Relativity agents.
See the Agents guide for information on how many agents your version of Relativity supports and how to add them to your environment.
Yes, you can scale the number of agents. In addition, you can use more than one agent server in your environment.
For example, Relativity environments typically have multiple Branding Manager, dtSearch Index Worker, and agents. Partners who use Relativity for productions and imaging often have many Branding Manager or agents to speed up these jobs. If you have a larger Relativity instance, use several agent server.
We suggest allocating one processor core and 1 GB of RAM for each additional agent in the environment.
Agent machines are server class. To use these machines for processing, they must communicate with the SQL Server and storage server.
The following FAQs provide information on Analytics servers, indexes, and disk space requirements.
The web server must communicate with the analytics server via TCP ports 445, 8080, and 8443.
The initial communication from the analytics server to SQL is on ports TCP: 1433 and UDP:1434. By default, communication from the SQL Server to the analytics server occurs on port 8080 or 8443 (REST). However, the response from the analytics server to the SQL Server during a search query can occur on any port, thus all response (ephemeral) ports must remain open. A typical firewall rule would be to allow TCP established from analytics server port 8080 and 8443 to any port range 49152-65535.
The communication between Analytics and SQL is on port 1433.
Using default settings, the average amount of disk space for the Analytics index is equal to 20% of the size of the MDF file of the workspace database. This metric has a high standard deviation and tends to scale up or down linearly. This metric indicates an average amount of disk space usage, but actual indexes may vary considerably in size. The amount of space required depends on the size of the extracted text being indexed, as well as the number of documents, unique terms, and dimensions used to build the index.
The Analytics Index Share houses all of your Analytics data for a particular Analytics server, and it can grow to be very large. We have found that NTFS file systems work for small environments, but if you anticipate running sets of 10 million or more documents through your Analytics Engine, you should use a file system that supports larger files such as exFAT or ReFS. We do not have a recommendation for either file system, so you must determine which is the better fit for you.
No. An analytics server may only reference one disk location for the server’s analytics indexes and structured analytics sets.
Yes. Analytics servers are scalable and are available as Servers in Relativity. You can allocate the Analytics servers to different workspaces via Resource Pools. Additionally, you can dedicate Analytics servers to Analytics Indexing, Structured Analytics, or both feature sets. For more information on configuring these options, see Adding an Analytics server.
The following FAQs provide information on scaling, performance, memory, and other issues related to the architecture of a Relativity installation.
Relativity is a web-based review platform that rests on a distributed, grid-based computational model.
You can use an Ethernet connection for the servers in your Relativity installation. Minimal connectivity speeds between SQL and the storage location are 4Gbps.
We advise against this server configuration. When users are heavily utilizing the agents, this configuration causes performance problems. It also makes troubleshooting performance issues difficult.
Yes, Relativity can run over a WAN if it's set up for pass-through authentication. All authentications must pass through to Relativity.
For a default named SQL Server, switching port 1433 won't work.
For a named instance of SQL Server, you can use a static port for SQL communication. By opening up port 1434 and setting the SQL Server to use another port (for example, 1435), SQL will begin to communicate using port 1435.
However, for Relativity to work and communicate effectively over the chosen static port, UDP port 1434 also needs to remain open. Relativity uses this port for the initial connection to the SQL Server and then begins to use port 1435. Once SQL has been configured to use a static port such at 1435, if port 1434 isn't open, certain actions in Relativity might break. Some of the places where it breaks include upgrading and the Relativity viewer.
The following FAQs provide information on storing and managing search indexes in Relativity.
Indexes for the three searching methods offered in Relativity are stored in the following locations:
Yes, you can configure your Relativity environment with multiple repositories and dtSearch indexes as necessary.
The following FAQs provide information on various file storage solutions for Relativity.
No, a Windows installation isn't required, but you must provide Relativity with the path to the location where the files reside.
We don't recommend using thin virtual provisioning in VMware for the following reasons:
Provisioning can cause Relativity to reach capacity and a disk grow (zeroing) event to occur. If provisioning uses zeroing to grow the disks, performance is throttled at as little as 50 percent of expected disk IO.
We do not have any specific storage speed or IOPS requirements for Relativity. However, you can use the following guidelines when selecting storage devices:
We use Compellent storage, while many of our partners have recently purchased EMC VNX devices. Other devices, such as NetAPP and IBM, also use some type of auto-tiering.
The following FAQs summarize information about integrating Relativity with different systems and platforms.
System or Platform | Integrates with Relativity? | Comments |
---|---|---|
DB2 database platforms | No | |
SQL database platforms | Only Microsoft SQL | |
Lotus Domino email system | Loosely integrated | The Native Imaging platform can read email messages stored in Lotus Notes databases. |
Microsoft Outlook Exchange email system | Yes | |
EMC SourceOne email archiving system | No | |
IBM CM8 (z/OS platform) | No | |
Network file systems | No | |
IBM Lotus Quickr and IBM TeamRoom | No | |
LDAP/Active Directory servers | Yes | |
ImagePlus system | No |
These FAQs provide information about optimal performance configurations for Relativity installations as well as latency detection.
The Virtual Machine Communication Interface (VMCI) is an infrastructure that provides fast and efficient communication between a virtual machine and the host operating system. It also provides fast and efficient communication between two or more virtual machines on the same host.
You can enable your application to use VMCI programmatically. VMCI provides greater inter-guest communication when virtual machine guests share a host. We don't offer built-in support for this feature. However, you may want to research existing tools for porting specific TCP endpoints to VMCI sockets.
These FAQs provide information about Processing in a Relativity environment.
By default, imaging jobs take higher priority over processing jobs. However, you can access the management to change the priority of running jobs. There is never a need to manually configure things. If a processing worker has nothing to do, it asks the Queue Manager to give it something to do. Any processing worker can work on any job, regardless of job type. It's uncommon for even a single worker to have multiple threads working on both imaging and processing simultaneously.
Yes, you should back up processing databases.
We recommend a decent amount of HD space on the worker machine for processing, because up to 16 threads can perform processing on each worker. Some of the files each worker processes could be quite large. 250 GB is a reasonable amount of space to perform temporary work before files get transferred to the network. The space is strictly used for temporary storage of data while workers are processing.
Use Relativity Processing to ingest raw data directly into your workspace for eventual search and review without the need for an external tool.
Yes, Relativity supports virtualized processing SQL Servers as long as the server meets the minimum requirements.
Yes, you can specify the type of jobs a worker can perform by editing the Worker designated work field on the worker server. You can configure a worker for one or more or all of the following jobs:
For smaller tier 1 environments, one server is enough.
Depending on the size of the environment, you can store the Service Bus Databases on the Invariant SQL Server.
The following FAQs provide information about Relativity Legal Hold (RLH).
You can only upgrade from version 2.1.127.5 and above to 3.0. Any version beneath that, including 1.0, does not have a direct upgrade path. If a client really needs to maintain their data during an upgrade from an older version, reach out to PM.
You can see it all in one place from the Configuration tab or look at the installed Applications.
You can install to any workspace that doesn"t have Processing already installed.
We recommend starting out with a single agent because it doesn’t use a lot of system resources. You can add more agents, if necessary.
There is no authentication done when accessing the Portal, but the Portal link is unique to every custodian. To prevent unauthorized use (via email forwarding of the link), you can specify the maximum number of times the link can be used or specify a link expiration date.
No, it doesn't.
The following FAQs provide information about dtSearch in a Relativity environment.
Yes, you can configure your Relativity environment with multiple repositories and dtSearch indexes as necessary.
Each Relativity dtSearch search agent should have its own server or virtual machine, similar to existing dtSearch WebAPI web servers. System admins have the ability to add the search agent to servers with other Relativity agents. However, this is not advisable as one dtSearch search agent will be able to utilize all available processor cores on a server. Therefore, each Relativity agent server that is designated to be a dtSearch search agent server should only have one dtSearch search agent and nothing else.
The dtSearch indexes grow to approximately 25% of the total database size. For example, if you had 10 TB of total files, the database would be 3-4 TB, and you would need about 1 TB for the index storage.
The default size of the sub-index is controlled through an instance setting. It can also be adjusted per each index creation through a setting on the index creation configuration page. The default value will is 250,000 documents. Larger data sets may benefit from having the sub-index size increased to 500,000 documents.
The following FAQs provide information on managing a SQL Server in a Relativity environment.
Yes, you can use a solid-state drive (SSD) on the SQL Server to improve performance.
We don't recommend storing dtSearch indexes on the SQL Server. Other than this location, you can store the indexes in the file repository location.
Multiple files provide better I/O performance and less contention on the global allocation structures. For higher end systems with more than 16 cores, Microsoft suggests beginning with eight database files and adding more if there is still contention.
There are scripts available to measure contention. Most Relativity environments work well with eight data files. The
No, SQL aliasing isn't standard configuration supported by Relativity. Our Customer Support will work with you to determine whether or not aliasing can be successfully used in your environment.
We support Relativity in a mirrored environment, but mirroring is known to fail. If you use this configuration, and experience a failover, follow the instructions in the Configuring Relativity after a Disaster Recovery Failover document in the Relativity Community. Relativity won't be usable until you run the scripts included in that document.
Yes, the focus on connection speed should be between the SQL and the storage system. A SQL Server on 1 GB iSCSI isn’t sufficient. It constrains you during backups, index rebuilds, table scans, etc. These large sequential operations can easily saturate a 1 GB pipe, and storage quickly becomes a bottleneck. The minimum requirement we have is 4GBPS, but if possible, we recommend moving this up to 8-10GBPS.
With the use of multiple active SQL Servers, you can distribute your workspace databases across more than one SQL instance.
Yes, another instance can be added to this configuration and then moved to a second node or converted to an active/active cluster.
If there is a need to change the name of the primary Relativity SQL instance during the reconfiguration, Customer Support will assist you with updating references to it in Relativity. (The primary Relativity SQL instance houses the EDDS database.)
No, you should never activate the Priority Boost setting.
SSL transit is the most commonly used database encryption method in practice. Some clients have been known to use Transparent Data Encryption, which is available in the enterprise edition of SQL Server.
Relativity 9.1 and 9.2 support SQL Server 2014 and Relativity 9.4 supports SQL Server 2016.
The physical memory limitation on Windows Server 2008 R2 Standard Edition is 32 GB, so you need Enterprise Edition if using this version of Windows. Windows Server 2012 R2 Standard Edition has a 4TB physical memory limitation.
This depends mostly upon your memory requirements for the SQL Server. SQL Server 2012 and below support up to 64 GB RAM for Standard Edition. SQL Server 2014 supports up to 128 GB RAM for Standard Edition.
The database is small at the outset. It grows as it ages (from audits) and as data is loaded into it. The amount of SQL data created during a load can be anywhere from 1-2 times the size of the load file being loaded. This can possibly be even more if a lot of ANSI data is loaded into Unicode fields.
We require both a Windows Relativity service account and a SQL Server account (EDDSDBO), but the EDDSDBO server account is the only one that connects to the database itself.
The Windows Relativity service account requires permissions to the shared BCP Path folder on the SQL Server so that it can drop data there for loading.
What are the recommendations for virtualizing servers in a Relativity installation?
We recommend virtualizing all Relativity roles with the exception of SQL. Any hypervisor is supported.
Note: Virtualizing SQL may simplify meeting High Availability (HA) and Disaster Recovery (DR) requirements. However, it can introduce additional layers of complexity when properly configuring and troubleshooting performance and stability related issues.
We do not have any special recommendations for using VMware with Relativity. For general information, see the VMWare white papers.
The following FAQs provide information web servers and other related information.
Use the following guidelines to determine the number of web servers required for your Relativity installation:
To configure the servers for DMZ access, you must open the following ports between the web server in the DMZ and all other servers in your environment:
Only the web servers should be in the DMZ.
Note: Once you click Save, Relativity checks the URL to see if it accessible so make sure the path is set up beforehand.
RULB works well with NLB and needs no extra configuration. All you have to do is turn on RULB, and Relativity does the rest. You probably won't see the NLB URL in IE change, but users should land on different web servers.
RULB also works with SSL certificates.
These are dynamic load balancing methods. They distribute connections based on aspects of real-time server performance analysis, such as the current number of connections per node or the fastest node response time. Load balancing calculations may be localized to each pool (member-based calculation) or they may apply to all pools of which a server is a member (node-based calculation).
With all pool members available (meaning they are enabled, marked up, and have not exceeded their connection limit), the LTM system distributes connections to all members in the highest priority group only. That is, the group designated by the highest priority number. The Priority Group Activation value determines the minimum number of members that must remain available for traffic to be confined to that group. If the number of available members in the highest priority group drops below the minimum number, the LTM system also distributes traffic to the next highest priority group, and so on.
RULB (Relativity.User.Load.Balancing) or NLB are the most commonly used load balancers.
What load balancing algorithms are available as Predictor options?
The following load balancing algorithms are available as Predictor options:
The following FAQs provide information on Secret Store and related information.
The default ports utilized for secret store are TCP 9090 (the service port) and TCP 10000 (the nonce port). All Relativity Servers must be able to communicate with Secret Store over these ports and these ports are configurable.
No, it is not possible to recover secrets from the store without the unseal key. In this scenario, it is recommended that the existing secret store database be backed up, deleted and that Secret Store be reinstalled to create a new database and generate a new unseal key. Secrets should be reimported into the store at this point.
Why was this not helpful?
Check one that applies.
Thank you for your feedback.
Want to tell us more?
Great!