DISCO AI’s tag predictions help lawyers review documents faster and more accurately. This guide will cover:
- Understanding DISCO AI's tag predictions
- Setting up DISCO AI tag predictions
- Using filters, search visualization, custom columns, and search to find DISCO AI tag predictions
- Using DISCO AI tag predictions while reviewing documents
Understanding DISCO AI's tag predictions
DISCO AI scores
DISCO AI predicts the likelihood that a given document should or should not be assigned a tag based on how documents with similar characteristics have been tagged in the past. For each document and tag pair, DISCO AI assigns a score from -100 to 100, where -100 means it is highly unlikely that the document will receive that tag and 100 means the document is highly likely to receive that tag. DISCO's AI uses a systematic approach to ongoing machine learning, which means that its predictions are continuously updated as more documents are tagged. To improve accuracy, DISCO AI uses language-specific models for English, German, French, Portuguese, Spanish, Chinese, Japanese, and Korean.
DISCO AI's numerical scores are also accompanied by a color code that mimics that of a traffic light; green being likely to highly likely, yellow being somewhat likely or neutral, and red being unlikely to highly unlikely that a certain tag should be applied to a document.
|Color||What it means||Example|
High scores – Highly likely this tag should be applied to this document
Medium scores – Somewhat likely or neutral that a tag should be applied to a document
|Orange or red||
Low scores – Unlikely to highly unlikely that a certain tag should be applied to a document
NOTE: You may notice some of the color squares are solid and some are not. A solid color square means that tag has already been applied to the document, while a hollow square means that tag has not been applied to the document.
Setting up DISCO AI tag predictions
Specifying tags for predictions
To get started, in the DISCO main menu, click Tags. On the Tags page, toggle PREDICTIONS to ON for the tags for which you want to turn on predictions.
For this example, we will focus on the tag predictions for the Broadband tag. As shown in the image below, to start tag predictions for Broadband, toggle PREDICTIONS to ON for that particular tag.
Note: It is okay if you already have applied a particular tag to documents before you enable it for predictions. Those tagging activities are not lost and will be used to make predictions once you enable the toggle.
Reviewing and tagging documents
Review documents as you normally would. As you apply or don’t apply tags to documents, the DISCO AI learns from those actions. Once you have tagged 50 documents, DISCO AI will recommend which documents should or should not be given the Broadband tag and the strength of that recommendation.
Tag predictions are reflected as a score ranging from -100 to 100 applied to every document in your database. A positive number indicates the likelihood a tag should be applied to a document, while a negative number indicates the likelihood a tag should not be applied to a document. This process of learning, training, and scoring documents based on your normal tagging activities happens continuously in the background.
Using filters, search visualization, search and custom columns to find DISCO AI tag predictions
You can filter for documents based on the likelihood that they will be assigned a specific tag in the Predicted Tags filters pane. In the Predicted Tags filters pane, you can select one or more score ranges for each tag of interest. The range begins with Highly likely and ends with Highly unlikely. Likely, Neutral, and Unlikely scores represent selections in the middle of the range. These scores indicate the likelihood a tag should be applied to a document.
NOTE: If you select Likely, Highly likely is automatically selected, and if you select Unlikely, Highly unlikely is automatically selected.
The chart below shows how the DISCO AI predicted score ranges correspond to each likeliness range. For example, the prediction score range Highly likely uses 80 to 100 and Likely has a prediction score range from 20 to 100.
|Prediction score range||Equivalent search syntax|
|Highly likely||80 to 100|
|Likely||20 to 100|
|Neutral||20 to -20|
|Unlikely||-20 to -100|
|Highly unlikely||-80 to -100|
In the Predicted Tag Changes filters pane, you can filter for documents that DISCO AI predicts will either have a tag added or removed. Use the Confidence dropdown to adjust the sensitivity of each category. A High confidence setting means the predicted tagging system is highly confident that the documents returned are likely to receive tag changes, where as a Low confidence setting means the predicted tagging system is less confident that the documents returned will receive tag changes. DISCO returns a larger set of documents when a low confidence setting is selected.
Search Visualization pane
The search visualization panes correspond to the sub-panels in the filtering pane. The Predicted Tags visualization pane shows the distribution of likelihoods that documents in the set being examined will be assigned a certain tag. In the Predicted Tags visualization pane, hover over a colored section of the bar to see the documents associated with the section and the percent of the bar length that it represents.
The Predicted Tag Changes visualization pane shows the distribution of documents that are likely to have certain tags added or removed. In the Predicted Tag Changes visualization pane, click one or more of the bars for tags of interest to filter results by whether tags are predicted to be removed or added to a set of documents.
For more information on how to use search visualization and filters in DISCO, see:
Using predicted tag filters with a custom view can help you quickly view and sort documents using predicted scores. Add columns from the specific tag predictions group to view individual predicted scores while using predicted tag filters. In this example, the view includes Broadband and Enron - Oil, Gas or Energy tag predictions.
Using Search Builder can help you quickly view and sort documents using predicted score ranges. In Search Builder, build the search syntax using the Tag prediction and Tag buttons. Examples of search syntax for confidence settings are shown below.
|Confidence||Predicted to be added||Predicted to be removed|
|High||prediction(Broadband, 80 to 100)%tag(Broadband)||tag(Broadband) & prediction(Broadband, -100 to -80)|
|Medium||prediction(Broadband, 60 to 100)%tag(Broadband)||tag(Broadband) & prediction(Broadband, -100 to -60)|
|Low||prediction(Broadband, 20 to 100)%tag(Broadband)||tag(Broadband) & prediction(Broadband, -100 to -20)|
The predicted tag score ranges are reflected below. The prediction score ranges are mutually exclusive.
|Prediction score range||Equivalent syntax example|
|prediction(Broadband, 80 to 100)|
|Likely||prediction(Broadband, 20 to 79)|
|Neutral||prediction(Broadband, -19 to -19)|
|Unlikely||prediction(Broadband, -79 to -20)|
|Highly unlikely||prediction(Broadband, -100 to -80)|
Using DISCO AI tag predictions while reviewing documents
Tag predictions and recommended tags
When you are viewing a document in the document viewer, you can see recommended tags as well as tag predictions. By default, predictions are not displayed when viewing a document. To view tag predictions, in the TAGS section, click Show predictions.
The recommended tags are tags that have been enabled for predictions and have a score higher than 50 for that document, indicating a higher likelihood of being applied to a particular document.
To apply a recommended tag to the document, in the PREDICTIONS section, click the recommended tag to apply.
NOTE: DISCO AI does not apply any tags to documents automatically. Instead, it recommends tags to you to help reduce errors and find relevant documents faster.
You can also see the predictive score and color for all the tags predictive was turned on for when you click into the apply tag area.
From the prediction scores and colors displayed, you are able to see DISCO's AI predict that the Broadband and Enron Outside USA tags are relevant to this document, while the other three tags show very low relevance. You are able to confirm DISCO's predictions are correct by reading the document these predictions were coded for.
Review stage prioritization in workflow
You can leverage tag predictions to power a prioritized review in workflow review stages. You can configure your review stage to order the documents based on the predictions for a particular tag when they are batched for review. This could allow you to find the more relevant documents earlier in your review than if you performed a linear review. For a more detailed explanation on prioritized reviews with tag predictions, see Review Stage Management.
For a training video about DISCO's AI predictive workflow feature, see AI predictive workflow training.