Class | Description |
---|---|
ExtractReuters |
Split the Reuters SGML documents into Simple Text files containing: Title, Date, Dateline, Body
|
ExtractWikipedia |
Extract the downloaded Wikipedia dump into separate files for indexing.
|
Copyright © 2000-2015 Apache Software Foundation. All Rights Reserved.