When producing or exporting documents from DISCO, there are several advanced options that allow you to control how family documents are produced: deduplication level and sort by options. Navigate to the top menu Create > Productions > Create New > Show advanced options.
While DISCO performs full document deduplication upon ingestion (so that a set of duplicate documents only have to be reviewed/tagged once), the user has the option to split the duplicate documents out again on production (into separately Bates stamped images/files) in several ways.
These options run on a spectrum from getting the most number of documents in the production (No deduplication - meaning produce exactly in the way and number the duplicate documents were ingested into DISCO) to the fewest (One copy for the entire production - meaning produce only one copy of each set of duplicated documents, no matter how many duplicates there were).
The default option - One copy per custodian and per parent - is the most common option, guaranteeing that at least one copy of a duplicated document will be produced for each custodian, and one copy of each email attachment will be produced for each email that duplicate was attached to.
- No deduplication
- One copy per custodian and per parent (default)
- One copy per parent across custodians
- One copy per custodian
- One copy for the entire production
Here the user can choose a sorting order for the documents (and, hence, the Bates stamping) in the production. The comma, e.g. “then” in the options means that the documents will be initially sorted by the first criteria, then the second criteria.
- Custodian then Family Date (default)
- Custodian then Path
- Family Date then Path
- Path Only
- Reference ID
The default option, “Custodian then Family Date”, means that the documents will first be sorted by their custodian, then by family date within that custodian group. The family date of a document is the sent date (or last modified date) of its parent, or just the sent date/last modified date of the document if it has no parent. Family date ordering helps ensure that attachments directly follow their parent emails in the production, where possible.
“Path” is simply the original file path of the document (i.e., the path of the file on the disk or email container the document came from).
Reference ID is a special custom field in DISCO that the user can set via initial loadfile or later mass update of the documents. This is rarely used, but useful when the user already has applied a control ID to the documents before they come to DISCO, and wants to preserve that ordering for the production.