org.apache.nutch.indexer.solr
Class SolrClean

java.lang.Object
  extended by org.apache.nutch.indexer.solr.SolrClean
All Implemented Interfaces:
Configurable, Tool

public class SolrClean
extends Object
implements Tool

The class scans CrawlDB looking for entries with status DB_GONE (404) and sends delete requests to Solr for those documents.

Author:
Claudio Martella

Nested Class Summary
static class SolrClean.DBFilter
           
static class SolrClean.SolrDeleter
           
 
Field Summary
static org.slf4j.Logger LOG
           
 
Constructor Summary
SolrClean()
           
 
Method Summary
 void delete(String crawldb, String solrUrl, boolean noCommit)
           
 Configuration getConf()
           
static void main(String[] args)
           
 int run(String[] args)
           
 void setConf(Configuration conf)
           
 
Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
 

Field Detail

LOG

public static final org.slf4j.Logger LOG
Constructor Detail

SolrClean

public SolrClean()
Method Detail

getConf

public Configuration getConf()
Specified by:
getConf in interface Configurable

setConf

public void setConf(Configuration conf)
Specified by:
setConf in interface Configurable

delete

public void delete(String crawldb,
                   String solrUrl,
                   boolean noCommit)
            throws IOException
Throws:
IOException

run

public int run(String[] args)
        throws IOException
Specified by:
run in interface Tool
Throws:
IOException

main

public static void main(String[] args)
                 throws Exception
Throws:
Exception


Copyright © 2012 The Apache Software Foundation