Follow

What metadata does DISCO extract?

Looking for a particular type of metadata? Use CTRL + F (on a PC) or ⌘ + F (on a Mac) to search on this page.

Here is a list of metadata fields DISCO extracts from native files:

  • If any of the date fields extracted appear to be invalid, DISCO will treat them differently. See Invalid dates for more information.
  • If time fields are extracted, DISCO will normalize the time zones on all documents. This way, all documents will be in a single time zone, simplifying searching and chronological sorting. It will not change the actual date and time on the documents, just how the metadata is displayed in DISCO. You will be asked to choose your desired time zone when you first create your database.

Field name

Description

Author

Person who created the file. A file can have one or no author.

Company

Available company information extracted from Microsoft Office and PDF documents.

Title

Available title information extracted from Microsoft Office and PDF documents.

Conversation date

The date the email conversation started. See Invalid dates for more information.

Conversation time

The time the email conversation started.

Create date

The date the file was created. See Invalid dates for more information.

Create time

The time the file was created.

Date last accessed

The date the file was last accessed.

Time last accessed

The time the file was last accessed.

Date last modified

The date changes were last made to a file.

Time last modified

The time changes were last made to a file. 

Date printed

The date the file was last printed.

Time printed

The time the file was last printed.

Date received

The date an email was received.

Time received

The time an email was received.

Date sent

The date an email was sent.

Time sent

The time an email was sent.

Family date

The date of the parent document. For emails, this will be the date sent, and for non-emails it will be the date last modified.

Family time

The time of the parent document.

From

Sender of an email.

To

Recipient(s) from the to line of an email.

Bcc

Blind carbon copy recipient(s) on an email.

Cc

Carbon copy recipient(s) on an email.

Subject

Subject line from an email.

Attachment count

The number of attachments on an email. It also refers to the number of embedded files within a document.

Custodian(s)

Owner of the mailbox, computer, or share where an email or file were collected.

Document type

The kind of file a document came from.

File extension

The file extension of the native file.

File name

The original name of the native file.

File size

A file's size, in bytes.

Page count

The number of pages in a document.

Parent count

The number of parent documents a document has.

Partial path

The folder or parent folder from which the document was collected.

Path

The full file, folder, or directory structure from which the document was collected.

Hash

Numeric value of a fixed length that uniquely identifies data.

Document text

Extracted text or OCR.

Was this article helpful?
0 out of 0 found this helpful
Have more questions? Submit a request

Comments

Chat is online
Chat is woffline