Active Learning performance baselines

This page is meant to be used as a reference to track the overall performance of Active Learning in RelativityOne. It should not be used as a benchmark of what you expect to see in a production client environment or Relativity Server environment due to differences in data, infrastructure, and configuration. The results may not scale linearly. Exceeding these limits may result in failure or degraded experience using Active Learning. If you have a larger data set, see Scaling Active Learning for options.

Active Learning project size recommendations

These recommendations are the result of extensive testing in RelativityOne. For the best user experience, we advise adding a maximum of 9 million total documents and a maximum of 1 million coded documents to an Active Learning project .

We recommend no more than 150 concurrent reviewers per project. Concurrent reviewers are defined as reviewers making coding decisions in an Active Learning queue at the same time. There is no limit to how many reviewers you can add to a queue as long as the number of concurrent reviewers remains at 150 or fewer.

Max documents in classification indexMax coded documents Max concurrent reviewers
9 million1 million150

Data set details

These tests were run on a subset of the following data set in a RelativityOne environment. Results may vary.

Data set nameFile countAverage extracted text size (KB)Total Extracted Text size (GB)
Real World9,154,51630.18 276.25

Classification index population + build results

A classification index is required for an Active Learning project. This performance run includes population of all documents and building. Start time was measured as the time the first document was sent to the Analytics server, and end time was measured as when the last document became available in Active Learning.

Index size (Documents) Pre-coded documentsPopulation rate (GB/hr)Population time
(h:mm:ss)
Index build
(h:mm:ss)
Total operation time (h:mm:ss)Documents/hr
1,000,000

10

133:03:290:13:303:16:59 304,878

Active Learning index build results

Once the Active Learning model completes its initial build, the model builds at maximum every 20 minutes to include new coded documents. The documents were randomly coded 50% responsive and 50% non-responsive using Relativity's sampling feature.

The test scenarios in the following table use an index that contained 1,000,000 documents.

The results listed below were measured after the initial model build completed and do not include the population stage as the documents have already been added to the index.

 Coded documentsIndex build time (h:mm:ss)
Build 1 10 0:13:30
Build 2100,000 0:27:17
Build 3200,0000:31:02
Build 4300,000 0:38:29
Build 5400,0000:50:52
Build 6500,000 0:56:21
Build 7600,0001:06:27
Build 8700,0001:09:29
Build 9800,0001:09:33
Build 10900,0002:45:09
Build 111,000,0003:24:16

The test scenarios in the following table use an index that contained 9,154,516 documents.

 Coded documentsIndex build time (h:mm:ss)
Build 1 101:38:30
Build 2100,0002:59:32
Build 3700,0005:14:47
Build 4800,0007:39:47
Build 5900,0006:50:19
Build 61,000,0006:25:35

Update ranks results

In an Active Learning project, you can manually update the document ranks and ensure the rank categorization field is up to date. Once you click Update Ranks, you can monitor the update progress via a fly-out modal. You can update ranks again only after the current modification is complete. Update ranks performs faster on subsequent updates than on the initial update.

Project size (documents)Initial update ranksSubsequent update ranks
Operation time (h:mm:ss)Operation time (h:mm:ss)
200,0000:02:48 0:01:59
400,0000:06:12 0:04:42
600,0000:09:42 0:07:02
800,0000:13:48 0:11:39
1,000,0000:22:24 0:15:14
9,154,5165:47:36 3:39:54
Return to top of the page
Feedback