org.apache.nutch.microformats.reltag
Class RelTagParser

java.lang.Object
  extended by org.apache.nutch.microformats.reltag.RelTagParser
All Implemented Interfaces:
Configurable, HtmlParseFilter, Pluggable

public class RelTagParser
extends Object
implements HtmlParseFilter

Adds microformat rel-tags of document if found.

Author:
Jérôme Charron
See Also:
http://www.microformats.org/wiki/rel-tag

Field Summary
static org.slf4j.Logger LOG
           
static String REL_TAG
           
 
Fields inherited from interface org.apache.nutch.parse.HtmlParseFilter
X_POINT_ID
 
Constructor Summary
RelTagParser()
           
 
Method Summary
 ParseResult filter(Content content, ParseResult parseResult, HTMLMetaTags metaTags, DocumentFragment doc)
          Scan the HTML document looking at possible rel-tags
 Configuration getConf()
           
 void setConf(Configuration conf)
           
 
Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
 

Field Detail

LOG

public static final org.slf4j.Logger LOG

REL_TAG

public static final String REL_TAG
See Also:
Constant Field Values
Constructor Detail

RelTagParser

public RelTagParser()
Method Detail

filter

public ParseResult filter(Content content,
                          ParseResult parseResult,
                          HTMLMetaTags metaTags,
                          DocumentFragment doc)
Scan the HTML document looking at possible rel-tags

Specified by:
filter in interface HtmlParseFilter

setConf

public void setConf(Configuration conf)
Specified by:
setConf in interface Configurable

getConf

public Configuration getConf()
Specified by:
getConf in interface Configurable


Copyright © 2012 The Apache Software Foundation