Looking for a particular kind of metadata? Use CTRL + F (on a PC) or ⌘ + F (on a Mac) to search in this page.
Here is a list of metadata fields DISCO extracts from native files:
- If any of the date fields extracted appear to be invalid, DISCO will treat them differently. See how invalid dates are handled here.
- If any time fields are extracted, DISCO will normalize the time zones on all documents. This way, all documents will be in a single time zone, simplifying searching and chronological sorting. It will not change the actual date and time on the documents, just how the metadata is displayed in DISCO. You will be asked to choose your desired time zone when you first create your database.
Field Name Description
Author |
Person who created the file. A file can have one or no author |
Company |
Available company information extracted from MS Office and PDF documents |
Title |
Available title information extracted from MS Office and PDF documents |
Conversation date |
The date the email conversation started (see how invalid dates are handled) |
Conversation time |
The time the email conversation started (see how time zones are handled) |
Create date |
The date the file was created (see how invalid dates are handled) |
Create time |
The time the file was created (see how time zones are handled) |
Date last accessed |
The date the file was last accessed |
Time last accessed |
The time the file was last accessed |
Date last modified |
The date changes were last made to a file |
Time last modified |
The time changes were last made to a file |
Date printed |
The date the file was last printed |
Time printed |
The time the file was last printed |
Date received |
The date an email was received |
Time received |
The time an email was received |
Date sent |
The date an email was sent |
Time sent |
The time an email was sent |
Family date |
The date of the parent document. For emails this will be the date sent and for non-emails it will be the date last modified |
Family time |
The time of the parent document |
From |
Sender of an email |
To |
Recipient(s) from the to line of an email |
Bcc |
Blind carbon copy recipient(s) on an email |
Cc |
Carbon copy recipient(s) on an email |
Subject |
Subject line from an email |
Attachment count |
The number of attachments on an email. It also refers to the number of embedded files within a document |
Custodian(s) |
Owner of the mailbox, computer or share where an email or file were collected |
Document type |
The kind of file a document came from |
File extension |
The file extension of the native file |
File name |
The original name of the native file |
File size |
File length is a file's size in bytes |
Page count |
The number of pages in a document |
Parent count |
The number of parent documents a document has |
Partial path |
The folder or parent folder from which the document was collected |
Path |
The full file, folder, or directory structure from which the document was collected |
Hash |
Numeric value of a fixed length that uniquely identifies data |
Document text |
Extracted text or OCR |
Comments