Revising the prompt criteria

After running an aiR for Review job for the first time on the sample set, the initial results on the dashboard can be used as feedback for improving the prompt criteria. The cycle of examining the results, revising the prompt criteria, then running a new job on the sample documents is known as iterating on the prompt criteria. Refer to Best practices for more information. Also see Job capacity, size limitations, and speed for details on document and prompt limits.

In particular, ask the following questions about each document:

Did aiR for Review and the human reviewer agree on the relevance of the document?
Read the aiR for Review rationale and considerations. Do they make sense?
Do the citations make sense?

For all of these, if you see something incorrect, make notes on where aiR seems to be confused and rephrase the prompts. Here are the most common sources of confusion:

Insufficient context—For example, an internal acronym, key person, or code word may not have been defined. To fix this, add it to the proper section of the Case Summary tab.
Ambiguous instructions or unclear language—To fix this, edit the instructions on the Relevance, Key Documents, or Issues tabs to clarify meaning and intent.

In general, consider how you would help a human reviewer making the same mistakes. For example, if aiR for Review is having trouble identifying a specific issue, try explaining the criteria for that issue with simpler language.

After you have revised the prompt criteria to address any weak points, run the analysis again. Continue refining the prompt criteria until results accurately predicts the human coding decisions for all test documents in the sample.

To run validation on the prompt criteria in Review Center to compare it to aiR for Review results before using it on a larger target population, refer to Setting up aiR for Review prompt criteria validation for more information.

aiR for Review only looks at the extracted text of each document. If a human reviewer marked a document as relevant because of an attachment or other criteria beyond the extracted text, aiR for Review will not be able to match that relevance decision.

See these related resources for more information: