There are a few different ways you can send data to DISCO, depending on what kind of data you're ingesting. For native files, the fastest way is almost always to use the high-speed uploader. Additional information for how to use the high-speed uploader can be found here. You can also use FTP, or our native-lite uploader for files less than 10 GB. If you need to ship physical media, please contact your DISCO representative or email@example.com to make special arrangements. We also encourage you to leverage the high-speed uploader to transfer large volumes of data to DISCO.
Different kinds of files require different preparation and ingest methods. If you want to know what kinds of files DISCO supports, see DISCO-supported file types. (If a file type is not on the list, this does not mean it cannot be ingested, but it may mean that we are not able to create a near-native version.)
No matter what kinds of files you ingest, we will extract all compressed, container, and embedded files. There will be separate, related records for both original and embedded files. For example, if you have an Excel document embedded in a PowerPoint, DISCO will create one document/record for both the PowerPoint and the Excel. In addition, the Excel file will be recorded as an attachment to the PowerPoint.
If you want to know what metadata is extracted by DISCO, see Extracted metadata.
To get started, choose what kinds of files you need to ingest:
- Native files – A native file is the original version of a document in the original format, usually collected directly from the custodian.
- Load files – A load file is used to import data into a database. It can carry extracted, searchable text, metadata about the documents, and information about the relationships between documents.
- Hard copies – A hard copy is a paper document that must be scanned before it can be ingested.
Sending data to DISCO Professional Services
If your organization does not have the high-speed uploader feature yet or the upload speed using the high-speed uploader is not fast enough, you can mail your data or send it via FTP.
Please contact your DISCO representative or firstname.lastname@example.org to make special arrangements for hard drive deliveries.
After you've mailed the hard drive or added data to the FTP, submit a new data ticket. Be sure to include:
- A description of the hard drive you sent, including the color and brand (if applicable).
- What file types the majority of the files are in. This will help us provide an accurate ETA.
- What the overall size of your ingest is.
- How many container files are included in your ingest session.
- What your review timeline is, so we can help you strategize with document tagging, culling, and production.
If you would rather not mail your data, you can send it to us via FTP. If you have less than 10 GB of data, use the native ingest lite method. If you have more than 10 GB of data, use the regular FTP method.
Please note that when you send files to DISCO, we sometimes do not or cannot ingest all of them. See Family-level deduplication for more information.
Preparing and ingesting hard copies
You must scan paper copies of your documents before ingesting them into DISCO. When scanning:
- Only scan one document per PDF file.
- Make sure pagination is included and clear.
- Make sure pages are not skewed, which can happen when a page is misfed into a scanner.
- Do not use a scanning resolution lower than 150 dpi, but we recommend 300 dpi or higher. The higher the dpi, the better the OCR (optical character recognition) fidelity.
- We recommend including a reference field titled "box number" or "folder" in the metadata to help you search for and sort documents in DISCO.
After you have scanned all your documents, you will have a bunch of PDF files. You can then put those files onto a hard drive or series of hard drives. Then, follow the instructions for ingesting native files.