org.apache.nutch.scoring.webgraph
Class NodeReader

java.lang.Object
  extended by org.apache.hadoop.conf.Configured
      extended by org.apache.nutch.scoring.webgraph.NodeReader
All Implemented Interfaces:
Configurable

public class NodeReader
extends Configured

Reads and prints to system out information for a single node from the NodeDb in the WebGraph.


Constructor Summary
NodeReader()
           
NodeReader(Configuration conf)
           
 
Method Summary
 void dumpUrl(Path webGraphDb, String url)
          Prints the content of the Node represented by the url to system out.
static void main(String[] args)
          Runs the NodeReader tool.
 
Methods inherited from class org.apache.hadoop.conf.Configured
getConf, setConf
 
Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
 

Constructor Detail

NodeReader

public NodeReader()

NodeReader

public NodeReader(Configuration conf)
Method Detail

dumpUrl

public void dumpUrl(Path webGraphDb,
                    String url)
             throws IOException
Prints the content of the Node represented by the url to system out.

Parameters:
webGraphDb - The webgraph from which to get the node.
url - The url of the node.
Throws:
IOException - If an error occurs while getting the node.

main

public static void main(String[] args)
                 throws Exception
Runs the NodeReader tool. The command line arguments must contain a webgraphdb path and a url. The url must match the normalized url that is contained in the NodeDb of the WebGraph.

Throws:
Exception


Copyright © 2012 The Apache Software Foundation