org.creativecommons.nutch
Class CCParseFilter.Walker
java.lang.Object
org.creativecommons.nutch.CCParseFilter.Walker
- Enclosing class:
- CCParseFilter
public static class CCParseFilter.Walker
- extends Object
Walks DOM tree, looking for RDF in comments and licenses in anchors.
Method Summary |
static void |
walk(Node doc,
URL base,
WebPage page,
org.apache.hadoop.conf.Configuration conf)
Scan the document adding attributes to metadata. |
Methods inherited from class java.lang.Object |
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
walk
public static void walk(Node doc,
URL base,
WebPage page,
org.apache.hadoop.conf.Configuration conf)
throws ParseException
- Scan the document adding attributes to metadata.
- Throws:
ParseException
Copyright © 2013 The Apache Software Foundation