This directory contains test data that may be used with Lucene's tests. Pass the following flags to Lucene's build system: $ ant test -Dtests.linedocsfile=/path/to/enwiki.random.lines.txt The Lucene Nightly Jenkins Jobs have this configured: https://builds.apache.org/job/Lucene-Solr-NightlyTests-master Source of data: The file contains a preprocessed dump of Wikipedia as CSV file for easy import with Lucene's test-framework or benchmark module.