org.apache.nutch.indexer.more
Class MoreIndexingFilter
java.lang.Object
org.apache.nutch.indexer.more.MoreIndexingFilter
- All Implemented Interfaces:
- Configurable, IndexingFilter, Pluggable
public class MoreIndexingFilter
- extends Object
- implements IndexingFilter
Add (or reset) a few metaData properties as respective fields
(if they are available), so that they can be displayed by more.jsp
(called by search.jsp).
content-type is indexed to support query by type:
last-modifed is indexed to support query by date:
Still need to make content-length searchable!
- Author:
- John Xing
Field Summary |
static org.apache.commons.logging.Log |
LOG
|
Methods inherited from class java.lang.Object |
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
LOG
public static final org.apache.commons.logging.Log LOG
MoreIndexingFilter
public MoreIndexingFilter()
filter
public NutchDocument filter(NutchDocument doc,
Parse parse,
Text url,
CrawlDatum datum,
Inlinks inlinks)
throws IndexingException
- Description copied from interface:
IndexingFilter
- Adds fields or otherwise modifies the document that will be indexed for a
parse. Unwanted documents can be removed from indexing by returning a null value.
- Specified by:
filter
in interface IndexingFilter
- Parameters:
doc
- document instance for collecting fieldsparse
- parse data instanceurl
- page urldatum
- crawl datum for the pageinlinks
- page inlinks
- Returns:
- modified (or a new) document instance, or null (meaning the document
should be discarded)
- Throws:
IndexingException
addIndexBackendOptions
public void addIndexBackendOptions(Configuration conf)
- Description copied from interface:
IndexingFilter
- Adds index-level configuraition options.
Implementations can update given configuration to pass document-independent
information to indexing backends. As a rule of thumb, prefix meta keys
with the name of the backend intended. For example, when
passing information to lucene backend, prefix keys with "lucene.".
- Specified by:
addIndexBackendOptions
in interface IndexingFilter
- Parameters:
conf
- Configuration instance.
setConf
public void setConf(Configuration conf)
- Specified by:
setConf
in interface Configurable
getConf
public Configuration getConf()
- Specified by:
getConf
in interface Configurable
Copyright © 2006 The Apache Software Foundation