Package org.creativecommons.nutch

Sample plugins that parse and index Creative Commons medadata.

See:
          Description

Class Summary
CCDeleteUnlicensedTool Deletes documents in a set of Lucene indexes that do not have a Creative Commons license.
CCIndexingFilter Adds basic searchable fields to a document.
CCParseFilter Adds metadata identifying the Creative Commons license used, if any.
CCParseFilter.Walker Walks DOM tree, looking for RDF in comments and licenses in anchors.
CCQueryFilter Handles "cc:" query clauses, causing them to search the "cc" field indexed by CCIndexingFilter.
 

Package org.creativecommons.nutch Description

Sample plugins that parse and index Creative Commons medadata.



Copyright © 2006 The Apache Software Foundation