org.creativecommons.nutch
Class CCParseFilter.Walker

java.lang.Object
  extended by org.creativecommons.nutch.CCParseFilter.Walker
Enclosing class:
CCParseFilter

public static class CCParseFilter.Walker
extends Object

Walks DOM tree, looking for RDF in comments and licenses in anchors.


Method Summary
static void walk(Node doc, URL base, Metadata metadata, Configuration conf)
          Scan the document adding attributes to metadata.
 
Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
 

Method Detail

walk

public static void walk(Node doc,
                        URL base,
                        Metadata metadata,
                        Configuration conf)
                 throws ParseException
Scan the document adding attributes to metadata.

Throws:
ParseException


Copyright © 2011 The Apache Software Foundation