Package org.apache.nutch.indexer

Maintain Lucene full-text indexes.

See:
          Description

Interface Summary
IndexingFilter Extension point for indexing.
NutchIndexWriter  
 

Class Summary
DeleteDuplicates Delete duplicate documents in a set of Lucene indexes.
DeleteDuplicates.HashPartitioner  
DeleteDuplicates.HashReducer  
DeleteDuplicates.IndexDoc  
DeleteDuplicates.InputFormat  
DeleteDuplicates.UrlsReducer  
FsDirectory Reads a Lucene index stored in DFS.
HighFreqTerms Lists the most frequent terms in an index.
Indexer Create indexes for segments.
IndexerMapReduce  
IndexerOutputFormat  
IndexingFilters Creates and caches IndexingFilter implementing plugins.
IndexMerger IndexMerger creates an index for the output corresponding to a single fetcher run.
IndexSorter Sort a Nutch index by page score.
NutchDocument A NutchDocument is the unit of indexing.
NutchIndexWriterFactory  
NutchSimilarity Similarity implementatation used by Nutch indexing and search.
 

Exception Summary
IndexingException  
 

Package org.apache.nutch.indexer Description

Maintain Lucene full-text indexes.



Copyright © 2006 The Apache Software Foundation