aiR for Review

aiR for Review harnesses the power of large language models (LLM) to review documents. It uses generative artificial intelligence (AI) to simulate the actions of a human reviewer by finding and describing relevant documents according to the review instructions (prompt criteria) that you provide. It identifies the documents, describes why they are relevant using natural language, and demonstrates relevance using citations from the document.

A few benefits of application include:

  • Highly efficient, low-cost document analysis
  • Quick discovery of important issues and criteria
  • Consistent, cohesive analysis across all documents

Below are some common use cases for it:

  • Beginning the review process—prioritize the most important documents to give to reviewers.
  • First-pass review—determine what you need to produce and discover essential insights.
  • Gaining early case insights—learn more about your matter right from the start.
  • Internal investigations—find documents and insights that help you understand the story hidden in your data.
  • Analyzing productions from other parties—reduce the effort to find important material and get it into the hands of decision makers.
  • Quality control for traditional review—compare aiR for Review's coding predictions to decisions made by reviewers to accelerate QC and improve results.

See these related trainings, articles, and white papers:

Analysis review types

aiR for Review offers three analysis types, each suited to a specific review or investigation phase.

Analysis Type Description
Relevance Use to find documents that are relevant to a case or situation that you describe, such as documents responsive to a production request.
Key Documents Use to find documents that are "hot" or important to a case or investigation, such as those that might be critical or embarrassing to one party or another.
Issues Use to find documents that include content that falls under specific categories. For example, you might use this to check whether documents involve coercion, retaliation, or a combination of both.

aiR for Review workflow

aiR for Review's process is similar to training a human reviewer: explain the case and its relevance criteria, hand over the documents, and check the results. If the application misunderstood any part of the prompt criteria, simply explain that part in more detail, then try again.

The workflow has three phases:

  1. Develop—write the Prompt Criteria (review instructions), test on a small document set, and tweak until the results align with human review.
  2. Validate—run the Prompt Criteria on a slightly larger set of documents and compare to results from senior reviewers.
  3. Run—use the verified Prompt Criteria on much larger sets of documents.

Diagram representing workflow from Develop to Validate to Run.

Within RelativityOne, the main steps are:

  1. Select the documents to review.
  2. Create an aiR for Review project. See Creating an aiR for Review project for more information.
  3. Write and submit the review instructions, collectively called Prompt Criteria. See Developing prompt criteria for more information.
  4. Review the results (citations, rationale, considerations, recommendation). See aiR for Review results for more information.

When setting up the first analysis, we recommend running it on a sample set of documents that was already coded by human reviewers. If the resulting predictions are different from the human coding, revise the prompt criteria and try again. This could include rewriting unclear instructions, defining an acronym or a code word, or adding more detail to an issue definition.

For additional workflow help and examples, see Workflows for Applying aiR for Review on the Community site.

How it works

aiR for Review's analysis is powered by Azure OpenAI's GPT-4 Omni large language model. The LLM is designed to understand and generate human language. It is trained on billions of documents from open datasets and the web.

When you submit prompt criteria and a set of documents to aiR for Review, Relativity sends the first document to Azure OpenAI and asks it to review the document according to the prompt criteria. After Azure OpenAI returns its results, Relativity sends the next document. The LLM reviews each document independently, and it does not learn from previous documents. Unlike Review Center, which makes its predictions based on learning from the document set, the LLM makes its predictions based on the prompt criteria and its built-in training.

Azure OpenAI does not retain any data from the documents being analyzed. Data you submit for processing by Azure OpenAI is not retained beyond your organization’s instance, nor is it used to train any other generative AI models from Relativity, Microsoft, or any other third party. For more information, see the white paper A Focus on Security and Privacy in Relativity’s Approach to Generative AI.

For more information on using generative AI for document review, we recommend:

Understanding documents and billing

For billing purposes, a document unit is a single document. The initial pre-run estimate may be higher than the actual units billed because of canceled jobs or document errors. To find the actual document units that are billed, see Cost Explorer .

A document will be billed each time it runs through aiR for Review, regardless of whether that document ran before.

Customer may not consolidate documents or otherwise take steps to circumvent the aiR for Review Document Unit limits, including for the purpose of reducing the Customer's costs. If Customer takes such action, Customer may be subject to additional charges and other corrective measures as deemed appropriate by Relativity.

Regional availability of aiR for Review

aiR for Review's availability may vary by region, as well as the availability of the LLM model used. Once OpenAI releases an LLM model to a region, Relativity tests it and notifies clients before upgrading aiR for Review.

The following table lists the current LLM model available and date it was deployed to aiR for Review per region. Also listed is the current version of aiR for Review, which may vary by region.

Region

Current LLM Model

aiR for Review Model
Deployment Date

Current aiR for Review
Version

United States

GPT-4 Omni - November

2025-06-16

2025.06.1

United Kingdom

GPT-4 Omni - November

2025-06-16 2025.06.1
Australia

GPT-4 Omni - November

2025-06-16 2025.06.1

Canada

GPT-4 Omni - November

2025-06-16 2025.06.1

France

GPT-4 Omni - November

2025-06-16 2025.06.1

Germany

GPT-4 Omni - November

2025-06-16 2025.06.1
Hong Kong

GPT-4 Omni - November

2025-07-08 2025.06.1
India

GPT-4 Omni - November

2025-07-08 2025.06.1

Ireland

GPT-4 Omni - November

2025-06-16 2025.06.1
Japan

GPT-4 Omni - November

2025-07-08 2025.06.1

Netherlands

GPT-4 Omni - November

2025-06-16 2025.06.1
Singapore

GPT-4 Omni - November

2025-07-08 2025.06.1
South Korea

GPT-4 Omni - November

2025-07-08 2025.06.1

Switzerland

GPT-4 Omni - November

2025-06-16 2025.06.1

When using Relativity's AI technology, the selected customer data may be processed outside of your specific Geo location as provided below. If not provided below, please contact your Relativity Success Manager for further information.

RelativityOne Deployment Geography aiR Processing Geography
APAC (Hong Kong, Japan, Singapore, South Korea) Japan
Australia Australia
Canada Canada
EEA (France, Germany, Ireland, Netherlands) EEA (currently Germany)
India India
Switzerland Switzerland
United Kingdom United Kingdom
United States United States

For more details about availability in your region, contact your account representative.

For technical specifications of your region's current LLM model, see documentation on the Azure website.

Language support

The underlying LLM used by aiR for Review has been evaluated for use with 83 languages. While aiR for Review itself has been primarily tested on English-language documents, unofficial testing with non-English datasets shows encouraging results.

If you use the application with non-English data sets, we recommend the following:

  • Rigorously follow best practices for writing. For more information, see Best practices.
  • Iterate on the prompt criteria. For more information, see Revising the prompt criteria.
  • Analyze the extracted text as-is. You do not need to translate it into English.
  • When possible, write the prompt criteria in the same language as the documents being analyzed. This should also be the subject matter expert's native language. If that is not possible, write the prompt criteria in English.

When you view the results of the analysis, all citations stay in the same language as the document they cite. By default, the rationales and considerations are in English. If you want the rationales and considerations to be in a different language, type “Write rationales and considerations in [desired language]” in the Additional Context field of the prompt criteria.

For the study used to evaluate Azure OpenAI's GPT-4 model across languages, see MEGAVERSE: Benchmarking Large Language Models Across Languages, Modalities, Models and Tasks on the arXiv website.

Analyzing emojis

aiR for Review has not been specifically tested for analyzing emojis. However, the underlying LLM does understand Unicode emojis. It also understands other formats that could normally be understood by a human reviewer. For example, an emoji that is extracted to text as :smile: would be understood as smiling.