Initial Run Through
Do you ever miss the days where you could thumb through paper documents and get a “feel” for what they contained? With DISCO sampling, you can search for a randomized subset of documents within your entire set or any subset to get that same “feel” for them. Here’s how it works:
From the DISCO search bar, enter in one of the following new queries:
Sampling Search Syntax
- sample(0.1, “any query”) - gives you 10% of search results set
- sample(500, “any query”) - gives you a maximum of 500 documents from search results set
- sample(10%, “any query”) - gives you 10% of search results set
- sample(10, tag(by email@example.com)) to get ten random documents tagged by firstname.lastname@example.org.
Within the parenthesis, enter in any valid search query. For example, to sample 10% of your entire database, enter the search sample(0.1,”!”), the exclamation point being a wildcard that indicates “search all documents”. However, if you would like to search for a randomized 10% of all documents containing the word “flowers”, change the search to sample(0.1,flowers).
Some important items to note:
- IF you search or refresh the page, it will give you a new random subset of documents.
- IF you give it a number that is more than the search results, it will give you the total number of search results.
- IF you use the sample query, sort order for search results is disabled.