public class NodeDumper
extends org.apache.hadoop.conf.Configured
implements org.apache.hadoop.util.Tool
Modifier and Type | Class and Description |
---|---|
static class |
NodeDumper.Dumper
Outputs the hosts or domains with an associated value.
|
static class |
NodeDumper.Sorter
Outputs the top urls sorted in descending order.
|
Modifier and Type | Field and Description |
---|---|
static org.slf4j.Logger |
LOG |
Constructor and Description |
---|
NodeDumper() |
Modifier and Type | Method and Description |
---|---|
void |
dumpNodes(org.apache.hadoop.fs.Path webGraphDb,
org.apache.nutch.scoring.webgraph.NodeDumper.DumpType type,
long topN,
org.apache.hadoop.fs.Path output,
boolean asEff,
org.apache.nutch.scoring.webgraph.NodeDumper.NameType nameType,
org.apache.nutch.scoring.webgraph.NodeDumper.AggrType aggrType,
boolean asSequenceFile)
Runs the process to dump the top urls out to a text file.
|
static void |
main(String[] args) |
int |
run(String[] args)
Runs the node dumper tool.
|
public void dumpNodes(org.apache.hadoop.fs.Path webGraphDb, org.apache.nutch.scoring.webgraph.NodeDumper.DumpType type, long topN, org.apache.hadoop.fs.Path output, boolean asEff, org.apache.nutch.scoring.webgraph.NodeDumper.NameType nameType, org.apache.nutch.scoring.webgraph.NodeDumper.AggrType aggrType, boolean asSequenceFile) throws Exception
webGraphDb
- The WebGraph from which to pull values.topN
- output
- IOException
- If an error occurs while dumping the top values.Exception
Copyright © 2014 The Apache Software Foundation