org.creativecommons.nutch
Class CCDeleteUnlicensedTool

java.lang.Object
  extended by org.creativecommons.nutch.CCDeleteUnlicensedTool

public class CCDeleteUnlicensedTool
extends Object

Deletes documents in a set of Lucene indexes that do not have a Creative Commons license.


Constructor Summary
CCDeleteUnlicensedTool(IndexReader[] readers)
          Constructs a duplicate detector for the provided indexes.
 
Method Summary
 void close()
          Closes the indexes, saving changes.
 int deleteUnlicensed()
          Delete pages without CC licenes.
static void main(String[] args)
          Delete duplicates in the indexes in the named directory.
 
Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
 

Constructor Detail

CCDeleteUnlicensedTool

public CCDeleteUnlicensedTool(IndexReader[] readers)
Constructs a duplicate detector for the provided indexes.

Method Detail

close

public void close()
           throws IOException
Closes the indexes, saving changes.

Throws:
IOException

deleteUnlicensed

public int deleteUnlicensed()
                     throws IOException
Delete pages without CC licenes.

Throws:
IOException

main

public static void main(String[] args)
                 throws Exception
Delete duplicates in the indexes in the named directory.

Throws:
Exception


Copyright © 2006 The Apache Software Foundation