Follow

What metadata does DISCO extract?

Looking for a particular kind of metadata? Use CTRL + F (on a PC) or ⌘ + F (on a Mac) to search in this page.

Here is a list of metadata fields DISCO extracts from native files:

  • If any of the date fields extracted appear to be invalid, DISCO will treat them differently. See how invalid dates are handled here.
  • If any time fields are extracted, DISCO will normalize the time zones on all documents. This way, all documents will be in a single time zone, simplifying searching and chronological sorting. It will not change the actual date and time on the documents, just how the metadata is displayed in DISCO. You will be asked to choose your desired time zone when you first create your database.

     

Field Name      Description

Author

Person who created the file. A file can have one or no author

Company

Available company information extracted from MS Office and PDF documents

Title

Available title information extracted from MS Office and PDF documents

Conversation date

The date the email conversation started (see how invalid dates are handled)

Conversation time

The time the email conversation started (see how time zones are handled)

Create date

The date the file was created (see how invalid dates are handled)

Create time

The time the file was created (see how time zones are handled)

Date last accessed

The date the file was last accessed

Time last accessed

The time the file was last accessed 

(see how time zones are handled)

Date last modified

The date changes were last made to a file

Time last modified

The time changes were last made to a file 

(see how time zones are handled)

Date printed

The date the file was last printed

Time printed

The time the file was last printed 

(see how time zones are handled)

Date received

The date an email was received

Time received

The time an email was received

Date sent

The date an email was sent

Time sent

The time an email was sent

Family date

The date of the parent document. For emails this will be the date sent and for non-emails it will be the date last modified

Family time

The time of the parent document

From

Sender of an email

To

Recipient(s) from the to line of an email

Bcc

Blind carbon copy recipient(s) on an email

Cc

Carbon copy recipient(s) on an email

Subject

Subject line from an email

Attachment count

The number of attachments on an email. It also refers to the number of embedded files within a document

Custodian(s)

Owner of the mailbox, computer or share where an email or file were collected

Document type

The kind of file a document came from

File extension

The file extension of the native file

File name

The original name of the native file

File size

File length is a file's size in bytes

Page count

The number of pages in a document

Parent count

The number of parent documents a document has

Partial path

The folder or parent folder from which the document was collected

Path

The full file, folder, or directory structure from which the document was collected

Hash

Numeric value of a fixed length that uniquely identifies data

Document text

Extracted text or OCR

Was this article helpful?
0 out of 0 found this helpful
Have more questions? Submit a request

Comments

Chat is online
Chat is woffline