org.apache.nutch.net
Interface URLFilter
- All Superinterfaces:
- org.apache.hadoop.conf.Configurable, Pluggable
- All Known Implementing Classes:
- AutomatonURLFilter, DomainURLFilter, PrefixURLFilter, RegexURLFilter, RegexURLFilterBase, Subcollection, SuffixURLFilter, UrlValidator
public interface URLFilter
- extends Pluggable, org.apache.hadoop.conf.Configurable
Interface used to limit which URLs enter Nutch.
Used by the injector and the db updater.
Methods inherited from interface org.apache.hadoop.conf.Configurable |
getConf, setConf |
X_POINT_ID
static final String X_POINT_ID
- The name of the extension point.
filter
String filter(String urlString)
Copyright © 2013 The Apache Software Foundation