Uses of Interface
org.apache.nutch.plugin.Pluggable

Packages that use Pluggable
org.apache.nutch.analysis.lang Text document language identifier. 
org.apache.nutch.collection Subcollection is a subset of an index. 
org.apache.nutch.indexer Maintain Lucene full-text indexes. 
org.apache.nutch.indexer.anchor An indexing plugin for inbound anchor text. 
org.apache.nutch.indexer.basic A basic indexing plugin. 
org.apache.nutch.indexer.feed   
org.apache.nutch.indexer.metadata   
org.apache.nutch.indexer.more A more indexing plugin. 
org.apache.nutch.indexer.staticfield A simple plugin called at indexing that adds fields with static data. 
org.apache.nutch.indexer.subcollection   
org.apache.nutch.indexer.tld Top Level Domain Indexing plugin. 
org.apache.nutch.indexer.urlmeta URL Meta Tag Indexing Plugin 
org.apache.nutch.microformats.reltag A microformats Rel-Tag Parser/Indexer/Querier plugin. 
org.apache.nutch.net   
org.apache.nutch.parse   
org.apache.nutch.parse.ext   
org.apache.nutch.parse.feed   
org.apache.nutch.parse.headings   
org.apache.nutch.parse.html An HTML document parsing plugin. 
org.apache.nutch.parse.js   
org.apache.nutch.parse.swf   
org.apache.nutch.parse.tika   
org.apache.nutch.parse.zip   
org.apache.nutch.protocol   
org.apache.nutch.protocol.file Protocol plugin which supports retrieving local file resources. 
org.apache.nutch.protocol.ftp Protocol plugin which supports retrieving documents via the ftp protocol. 
org.apache.nutch.protocol.http Protocol plugin which supports retrieving documents via the http protocol. 
org.apache.nutch.protocol.http.api Common API used by HTTP plugins (http, httpclient
org.apache.nutch.scoring   
org.apache.nutch.scoring.link   
org.apache.nutch.scoring.opic   
org.apache.nutch.scoring.tld Top Level Domain Scoring plugin. 
org.apache.nutch.scoring.urlmeta URL Meta Tag Scoring Plugin 
org.apache.nutch.urlfilter.api   
org.apache.nutch.urlfilter.automaton A url filter plugin based on dk.brics.automaton Finite-State Automata for JavaTM
org.apache.nutch.urlfilter.domain A url filter plugin that filters by domain. 
org.apache.nutch.urlfilter.domainblacklist   
org.apache.nutch.urlfilter.prefix A url filter plugin. 
org.apache.nutch.urlfilter.regex A url filter plugin. 
org.apache.nutch.urlfilter.suffix   
org.apache.nutch.urlfilter.validator A url filter plugin that validates given urls. 
org.creativecommons.nutch Sample plugins that parse and index Creative Commons medadata. 
 

Uses of Pluggable in org.apache.nutch.analysis.lang
 

Classes in org.apache.nutch.analysis.lang that implement Pluggable
 class HTMLLanguageParser
           
 class LanguageIndexingFilter
          An IndexingFilter that add a lang (language) field to the document.
 

Uses of Pluggable in org.apache.nutch.collection
 

Classes in org.apache.nutch.collection that implement Pluggable
 class Subcollection
          SubCollection represents a subset of index, you can define url patterns that will indicate that particular page (url) is part of SubCollection.
 

Uses of Pluggable in org.apache.nutch.indexer
 

Subinterfaces of Pluggable in org.apache.nutch.indexer
 interface IndexingFilter
          Extension point for indexing.
 

Uses of Pluggable in org.apache.nutch.indexer.anchor
 

Classes in org.apache.nutch.indexer.anchor that implement Pluggable
 class AnchorIndexingFilter
          Indexing filter that offers an option to either index all inbound anchor text for a document or deduplicate anchors.
 

Uses of Pluggable in org.apache.nutch.indexer.basic
 

Classes in org.apache.nutch.indexer.basic that implement Pluggable
 class BasicIndexingFilter
          Adds basic searchable fields to a document.
 

Uses of Pluggable in org.apache.nutch.indexer.feed
 

Classes in org.apache.nutch.indexer.feed that implement Pluggable
 class FeedIndexingFilter
           
 

Uses of Pluggable in org.apache.nutch.indexer.metadata
 

Classes in org.apache.nutch.indexer.metadata that implement Pluggable
 class MetadataIndexer
          Indexer which can be configured to extract metadata from the crawldb, parse metadata or content metadata.
 

Uses of Pluggable in org.apache.nutch.indexer.more
 

Classes in org.apache.nutch.indexer.more that implement Pluggable
 class MoreIndexingFilter
          Add (or reset) a few metaData properties as respective fields (if they are available), so that they can be displayed by more.jsp (called by search.jsp).
 

Uses of Pluggable in org.apache.nutch.indexer.staticfield
 

Classes in org.apache.nutch.indexer.staticfield that implement Pluggable
 class StaticFieldIndexer
          A simple plugin called at indexing that adds fields with static data.
 

Uses of Pluggable in org.apache.nutch.indexer.subcollection
 

Classes in org.apache.nutch.indexer.subcollection that implement Pluggable
 class SubcollectionIndexingFilter
           
 

Uses of Pluggable in org.apache.nutch.indexer.tld
 

Classes in org.apache.nutch.indexer.tld that implement Pluggable
 class TLDIndexingFilter
          Adds the Top level domain extensions to the index
 

Uses of Pluggable in org.apache.nutch.indexer.urlmeta
 

Classes in org.apache.nutch.indexer.urlmeta that implement Pluggable
 class URLMetaIndexingFilter
          This is part of the URL Meta plugin.
 

Uses of Pluggable in org.apache.nutch.microformats.reltag
 

Classes in org.apache.nutch.microformats.reltag that implement Pluggable
 class RelTagIndexingFilter
          An IndexingFilter that add tag field(s) to the document.
 class RelTagParser
          Adds microformat rel-tags of document if found.
 

Uses of Pluggable in org.apache.nutch.net
 

Subinterfaces of Pluggable in org.apache.nutch.net
 interface URLFilter
          Interface used to limit which URLs enter Nutch.
 

Uses of Pluggable in org.apache.nutch.parse
 

Subinterfaces of Pluggable in org.apache.nutch.parse
 interface HtmlParseFilter
          Extension point for DOM-based HTML parsers.
 interface Parser
          A parser for content generated by a Protocol implementation.
 

Classes in org.apache.nutch.parse that implement Pluggable
 class MetaTagsParser
          Parse HTML meta tags (keywords, description) and store them in the parse metadata so that they can be indexed with the index-metadata plugin with the prefix 'metatag.'
 

Uses of Pluggable in org.apache.nutch.parse.ext
 

Classes in org.apache.nutch.parse.ext that implement Pluggable
 class ExtParser
          A wrapper that invokes external command to do real parsing job.
 

Uses of Pluggable in org.apache.nutch.parse.feed
 

Classes in org.apache.nutch.parse.feed that implement Pluggable
 class FeedParser
           
 

Uses of Pluggable in org.apache.nutch.parse.headings
 

Classes in org.apache.nutch.parse.headings that implement Pluggable
 class HeadingsParseFilter
          HtmlParseFilter to retrieve h1 and h2 values from the DOM.
 

Uses of Pluggable in org.apache.nutch.parse.html
 

Classes in org.apache.nutch.parse.html that implement Pluggable
 class HtmlParser
           
 

Uses of Pluggable in org.apache.nutch.parse.js
 

Classes in org.apache.nutch.parse.js that implement Pluggable
 class JSParseFilter
          This class is a heuristic link extractor for JavaScript files and code snippets.
 

Uses of Pluggable in org.apache.nutch.parse.swf
 

Classes in org.apache.nutch.parse.swf that implement Pluggable
 class SWFParser
          Parser for Flash SWF files.
 

Uses of Pluggable in org.apache.nutch.parse.tika
 

Classes in org.apache.nutch.parse.tika that implement Pluggable
 class TikaParser
          Wrapper for Tika parsers.
 

Uses of Pluggable in org.apache.nutch.parse.zip
 

Classes in org.apache.nutch.parse.zip that implement Pluggable
 class ZipParser
          ZipParser class based on MSPowerPointParser class by Stephan Strittmatter.
 

Uses of Pluggable in org.apache.nutch.protocol
 

Subinterfaces of Pluggable in org.apache.nutch.protocol
 interface Protocol
          A retriever of url content.
 

Uses of Pluggable in org.apache.nutch.protocol.file
 

Classes in org.apache.nutch.protocol.file that implement Pluggable
 class File
          File.java deals with file: scheme.
 

Uses of Pluggable in org.apache.nutch.protocol.ftp
 

Classes in org.apache.nutch.protocol.ftp that implement Pluggable
 class Ftp
          Ftp.java deals with ftp: scheme.
 

Uses of Pluggable in org.apache.nutch.protocol.http
 

Classes in org.apache.nutch.protocol.http that implement Pluggable
 class Http
           
 

Uses of Pluggable in org.apache.nutch.protocol.http.api
 

Classes in org.apache.nutch.protocol.http.api that implement Pluggable
 class HttpBase
           
 

Uses of Pluggable in org.apache.nutch.scoring
 

Subinterfaces of Pluggable in org.apache.nutch.scoring
 interface ScoringFilter
          A contract defining behavior of scoring plugins.
 

Classes in org.apache.nutch.scoring that implement Pluggable
 class ScoringFilters
          Creates and caches ScoringFilter implementing plugins.
 

Uses of Pluggable in org.apache.nutch.scoring.link
 

Classes in org.apache.nutch.scoring.link that implement Pluggable
 class LinkAnalysisScoringFilter
           
 

Uses of Pluggable in org.apache.nutch.scoring.opic
 

Classes in org.apache.nutch.scoring.opic that implement Pluggable
 class OPICScoringFilter
          This plugin implements a variant of an Online Page Importance Computation (OPIC) score, described in this paper: Abiteboul, Serge and Preda, Mihai and Cobena, Gregory (2003), Adaptive On-Line Page Importance Computation .
 

Uses of Pluggable in org.apache.nutch.scoring.tld
 

Classes in org.apache.nutch.scoring.tld that implement Pluggable
 class TLDScoringFilter
          Scoring filter to boost tlds.
 

Uses of Pluggable in org.apache.nutch.scoring.urlmeta
 

Classes in org.apache.nutch.scoring.urlmeta that implement Pluggable
 class URLMetaScoringFilter
          For documentation:
 

Uses of Pluggable in org.apache.nutch.urlfilter.api
 

Classes in org.apache.nutch.urlfilter.api that implement Pluggable
 class RegexURLFilterBase
          Generic URL filter based on regular expressions.
 

Uses of Pluggable in org.apache.nutch.urlfilter.automaton
 

Classes in org.apache.nutch.urlfilter.automaton that implement Pluggable
 class AutomatonURLFilter
          RegexURLFilterBase implementation based on the dk.brics.automaton Finite-State Automata for JavaTM.
 

Uses of Pluggable in org.apache.nutch.urlfilter.domain
 

Classes in org.apache.nutch.urlfilter.domain that implement Pluggable
 class DomainURLFilter
          Filters URLs based on a file containing domain suffixes, domain names, and hostnames.
 

Uses of Pluggable in org.apache.nutch.urlfilter.domainblacklist
 

Classes in org.apache.nutch.urlfilter.domainblacklist that implement Pluggable
 class DomainBlacklistURLFilter
          Filters URLs based on a file containing domain suffixes, domain names, and hostnames.
 

Uses of Pluggable in org.apache.nutch.urlfilter.prefix
 

Classes in org.apache.nutch.urlfilter.prefix that implement Pluggable
 class PrefixURLFilter
          Filters URLs based on a file of URL prefixes.
 

Uses of Pluggable in org.apache.nutch.urlfilter.regex
 

Classes in org.apache.nutch.urlfilter.regex that implement Pluggable
 class RegexURLFilter
          Filters URLs based on a file of regular expressions using the Java Regex implementation.
 

Uses of Pluggable in org.apache.nutch.urlfilter.suffix
 

Classes in org.apache.nutch.urlfilter.suffix that implement Pluggable
 class SuffixURLFilter
          Filters URLs based on a file of URL suffixes.
 

Uses of Pluggable in org.apache.nutch.urlfilter.validator
 

Classes in org.apache.nutch.urlfilter.validator that implement Pluggable
 class UrlValidator
          Validates URLs.
 

Uses of Pluggable in org.creativecommons.nutch
 

Classes in org.creativecommons.nutch that implement Pluggable
 class CCIndexingFilter
          Adds basic searchable fields to a document.
 class CCParseFilter
          Adds metadata identifying the Creative Commons license used, if any.
 



Copyright © 2012 The Apache Software Foundation