Column |
Description |
InstanceId |
A unique ID assigned to each file ingested into DISCO. The numbers to the left of the decimal will become the DISCO ID and will be displayed to the end users. The numbers to the right refer to the number of duplicate files (instances) that have been ingested. |
InstanceHash |
An alphanumeric value that uniquely identifies each file in an ingest session. |
DeDupHash |
DISCO computed value that is used to deduplicate instances within a database. |
IngestTime |
The time the file was ingested into DISCO. |
Custodian |
The name of a person or entity representing from whom the data was collected. For example, the custodian of an email is the person or entity in possession of the inbox that contains the message. Custodians are assigned prior to ingest, or can be added after ingest via overlay. |
FileLength |
The file's size in bytes. |
FilePath |
The file, folder, or directory structure from which the document was collected. |
ContainerPath |
The path within the parent container from which the document was collected. |
ProcessingStatus |
The primary indicator of the overall processing outcome of an ingested file. Files categorized as partial or failure may require additional processing work. |
ProcessingDetails |
Description of processing status including whether the file was one the following: Successfully processed, Partial failure, Unsupported file, Corrupted, Encrypted, Exceeded DISCO limits, Duplicate. |
Ingested |
Indicates if a processed file was ingested into the database. |
ProcessedAsNative |
Indicates that some aspect of processing was unsuccessful. However, DISCO will create a record in the matter with a link to the native file. |
Image |
Indicates if an item was processed as an image or not. An image is sometimes also referred to as a "near-native" or "PDF." |
ImagePageLimitExceeded |
Indicates if DISCO failed to produce a near-native image due to the native file exceeding the supported number of pages. The page limit is set to 30,000 pages. |
SearchText |
Indicates if the record contains searchable text. "No" indicates that items are processed as native files. |
TextLimitExceeded |
Indicates that the ingested file exceeded a maximum amount of text allowed. This limit is configured per database. The default limit is 100 MB. |
OCRText |
Indicates whether or not any of the extracted text was derived via OCR. |
InputFile |
Indicates whether this was an object viewable in the file system when it was received by DISCO for processing. Items within containers will have an N in this column as they cannot be viewed as a file system object. |
ObjectType |
Indicates one of four primary files types:
|
ContainerMember |
Indicates whether or not the file is a member of a container. Container files, such as TAR, RAR, PST, NSF, and MSG files, do not generate records in the DISCO database when successfully processed. |
PartialContainer |
Indicates that a container could not be fully processed. DISCO will create a record within the database and provide a link to the native container file so that it can be retrieved for additional processing. |
SlipsheetIdentified |
DISCO identifies when a slip sheet is produced in conjunction with a native file for load file ingests. DISCO will create a near-native image from the native file and append it to the slip sheet in the document viewer. |
Missing Native |
Indicates when the native file was not supplied. If an image is supplied, DISCO will use that to populate the native file link. If no native or image is supplied, DISCO will create a near-native image from the supplied text and use that to populate the native link. |
IngestType |
Indicates the two types of data deliveries that can be processed:
|
HiddenText |
Indicates whether the file contains hidden content. Files that may contain hidden content include Word, Excel, and PowerPoint files. |
HiddenType |
Indicates the type of hidden content contained within the file. Types include revisions, hidden sheets, very hidden sheets, comments, and notes. |
WrongExtension |
Indicates that the extension of an ingested item is inconsistent with the determined type. |
Extension |
The file extension of the ingested file. |
ContentType |
The content type is a normalized file type. Examples of content types in DISCO are Excluded, Unknown, Text, Email, PDF, Word, Excel, PowerPoint, PST, HTML, Image, RichText, ZIP, LoadFile, Audio, Video, Appointment, Contact, Cad, Project , Xps, Vcard, Visio, OpenXml, ISO, Mbox, and RAR. |
FileName |
The name of the ingested file. |
ContainerName |
The name of the container from which the file was extracted. |
DetectedEmail |
Files that DISCO has identified as emails during processing based on an examination of the file’s text/OCR. |
ImageSize |
Size in bytes of the document’s image file. |
ParentInstanceId |
The instance ID of the ingested file's immediate parent. The immediate parent is not always the family head. |
NativeViewSupport |
Indicates if the Excel document is rendered in the DISCO native viewer. N/A for non-Excel documents. |
DetectedLanguage |
The primary language identified in the document. |
ContainsVirus |
Indicates when DISCO detected a virus signature in the document. You can use the hasVirus(true) search syntax to safely view the documents. |
BatesLimitExceeded |
If Yes, this column indicates that the range of bates numbers exceeded DISCO's limit of 30,000 bates numbers on a single document. The BeginBates and EndBates will be stored on the document. A document in this category generally denotes an error in the provided bates numbering. |