DISCO can now create email conversation threads and identify inclusive emails from email images, even if no email natives or thread metadata are provided.
This means conversation threading and inclusive emails are now available for
- Collections of scanned or otherwise imaged emails (such as TIFF, JPG, or PDF) without natives (such as PST, EML, or MSG)
- Productions from opposing parties in which only images are produced and thread metadata is not included in the load file or is unreliable
When natives are provided, DISCO threads emails using header information in the native files. We then identify inclusive emails by comparing the text of each and identifying those emails whose text is wholly included in other emails.
When only images are provided, most systems are not able to thread emails because the images do not contain this header information. And most systems are not able to identify inclusive emails because the images contain extraneous text, such as Bates stamps or email signature footers, that make it difficult to determine whether one email is wholly included in another email.
With this release, DISCO overcomes these problems, and makes threading and inclusive emails available for all email collections and productions, whether uploaded as natives or as images.
For more information, see Conversation browser with inclusive email identification.
Conversation threading and inclusive identification for non-native emails is available for databases created after February 5, 2018.