Follow

How to prepare your files for ingest and choose an ingest method

Different kinds of files require different preparation and ingest methods. If you want to know what kinds of files DISCO supports, see our list here. (If a file type is not on the list, this does not mean it cannot be ingested, but it may mean that we are not able to create a near-native version.)

No matter what kinds of files you ingest, we will extract all compressed, container, and embedded files. There will be separate, related records for both original and embedded file(s). For example, if you have an Excel document embedded in a PowerPoint, DISCO will create one document/record for both the PowerPoint and the Excel. In addition, the Excel file will be recorded as an attachment to the PowerPoint.

If you want to know what metadata is extracted by DISCO, see our list here.

To get started, choose what kinds of files you need to ingest:

  • Native files: A native file is the original version of a document in the original format, usually collected directly from the custodian.
  • Load files: A load file is used to import data into a database. It can carry extracted, searchable text, metadata about the documents, and information about the relationships between documents.
  • Hard copies: A hard copy is a paper document that must be scanned before it can be ingested.

How to prepare and ingest native files

Before you ingest them, prepare your files:

  1. Organize the files with one folder per custodian.
  2. Unencrypt any encrypted files.
  3. If any documents' file paths are over 260 characters, shorten those file paths.

Then, figure out if you can use the high-speed uploader to ingest your files:

  1. Figure out if your organization has the high-speed uploader in crawl or walk. If your organization does not have this feature, skip steps 2-4 below.
  2. Run this speed test and click Show more info. Find your network’s upload speed.
  3. Use the table below to see how much data you can upload in 24 hours.
    Network upload speed in Mbps How much you can upload in 24 hours
    in GB in TB
    25 251 0.25
    50 503 0.49
    100 1004 0.98
    250 2519 2.46
    500 5028 4.91
    1000 10056 9.82
  4. If the speed in the table is fast enough, use the high-speed uploader to ingest your data.

If your organization does not have the high-speed uploader feature yet or the upload speed using the high-speed uploader is not fast enough, choose the appropriate ingest method below:

We recommend mailing your data to DISCO for ingest. If you need it very quickly, overnight the data to us. After you've prepared your data, put it on one or more hard drives. Then send the hard drives to: 

CS DISCO, Inc
Attn: Media Management
3700 N. Capital of Texas Highway, Suite 150
Austin, TX 78746

After you've mailed the hard drive(s), submit a new data ticket. Be sure to include:

  • A description of the hard drive(s) you sent, including the color and brand.
  • What file type(s) the majority of the files are in. This will help us provide an accurate ETA.
  • What the overall size of your ingest is.
  • How many container files are included in your ingest session.
  • What your review timeline is, so we can help you strategize with document tagging, culling, and production.

If you would rather not mail your data, you can send it to us via FTP. If you have less than 10 GB of data, use the native ingest lite method. If you have more than 10 GB of data, use the regular FTP method.

Please note that when you send files to DISCO, we sometimes do not or cannot ingest all of them. Click here to see more information.

How to prepare and ingest load files

First, prepare your files:

  1. See our guidelines for load files produced by other parties.
  2. We prefer a concordance DAT load file or an OPT load file.
  3. Make sure the file contains fields for the Bates number range: BeginBates and EndBates. This way documents can be searched and sorted by Bates numbers.

Then, we recommend mailing your data to DISCO for ingest. If you need it very quickly, overnight the data to us. After you've prepared your data, put it on one or more hard drives. Then send the hard drives to: 

CS DISCO, Inc
Attn: Media Management
3700 N. Capital of Texas Highway, Suite 150
Austin, TX 78746

After you've mailed the hard drive(s), submit a new data ticket. Be sure to include:

  • A description of the hard drive(s) you sent, including the color and brand.
  • What file type(s) the majority of the files are in. This will help us provide an accurate ETA.
  • What the overall size of your ingest is.
  • How many container files are included in your ingest session.
  • What your review timeline is, so we can help you strategize with document tagging, culling, and production.

If you would rather not mail your data, you can send it to us via FTP:

  1. Navigate to "Ingest" from the main menu in DISCO Review.
  2. Click the blue "New ingest" button in the top right corner.
  3. Select "Load file" in the dropdown menu.
  4. Complete the wizard. Upload_load_files.gif

 Please note that when you send files to DISCO, we sometimes cannot or do not ingest all of them. Click here to see more information.

How to prepare and ingest hard copies

You must scan paper copies of your documents before ingesting them into DISCO. When scanning...

  • Only scan one document per PDF file.
  • Make sure pagination is included and clear.
  • Make sure pages are not skewed, which can happen when a page is misfed into a scanner.
  • Do not use a scanning resolution lower than 150 dpi, but we recommend 300 dpi or higher. The higher the dpi, the better the OCR (optical character recognition) fidelity.
  • We recommend including a reference field titled "box number" or "folder" in the metadata to help you search for and sort documents in DISCO. 

After you have scanned all your documents, you will have a bunch of PDF files. You can then put those files onto a hard drive or series of hard drives. Then, follow the instructions for ingesting native files above

Was this article helpful?
0 out of 0 found this helpful
Have more questions? Submit a request

Comments

Chat is online
Chat is woffline