Uses of Interface
org.apache.nutch.parse.HtmlParseFilter

Packages that use HtmlParseFilter
org.apache.nutch.analysis.lang Text document language identifier. 
org.apache.nutch.microformats.reltag A microformats Rel-Tag Parser/Indexer/Querier plugin. 
org.apache.nutch.parse   
org.apache.nutch.parse.headings   
org.apache.nutch.parse.js   
org.creativecommons.nutch Sample plugins that parse and index Creative Commons medadata. 
 

Uses of HtmlParseFilter in org.apache.nutch.analysis.lang
 

Classes in org.apache.nutch.analysis.lang that implement HtmlParseFilter
 class HTMLLanguageParser
           
 

Uses of HtmlParseFilter in org.apache.nutch.microformats.reltag
 

Classes in org.apache.nutch.microformats.reltag that implement HtmlParseFilter
 class RelTagParser
          Adds microformat rel-tags of document if found.
 

Uses of HtmlParseFilter in org.apache.nutch.parse
 

Classes in org.apache.nutch.parse that implement HtmlParseFilter
 class MetaTagsParser
          Parse HTML meta tags (keywords, description) and store them in the parse metadata so that they can be indexed with the index-metadata plugin with the prefix 'metatag.'
 

Uses of HtmlParseFilter in org.apache.nutch.parse.headings
 

Classes in org.apache.nutch.parse.headings that implement HtmlParseFilter
 class HeadingsParseFilter
          HtmlParseFilter to retrieve h1 and h2 values from the DOM.
 

Uses of HtmlParseFilter in org.apache.nutch.parse.js
 

Classes in org.apache.nutch.parse.js that implement HtmlParseFilter
 class JSParseFilter
          This class is a heuristic link extractor for JavaScript files and code snippets.
 

Uses of HtmlParseFilter in org.creativecommons.nutch
 

Classes in org.creativecommons.nutch that implement HtmlParseFilter
 class CCParseFilter
          Adds metadata identifying the Creative Commons license used, if any.
 



Copyright © 2012 The Apache Software Foundation