public class NodeDumper extends Configured implements Tool
Modifier and Type | Class and Description |
---|---|
static class |
NodeDumper.Dumper
Outputs the hosts or domains with an associated value.
|
static class |
NodeDumper.Sorter
Outputs the top urls sorted in descending order.
|
Modifier and Type | Field and Description |
---|---|
static org.slf4j.Logger |
LOG |
Constructor and Description |
---|
NodeDumper() |
Modifier and Type | Method and Description |
---|---|
void |
dumpNodes(Path webGraphDb,
org.apache.nutch.scoring.webgraph.NodeDumper.DumpType type,
long topN,
Path output,
boolean asEff,
org.apache.nutch.scoring.webgraph.NodeDumper.NameType nameType,
org.apache.nutch.scoring.webgraph.NodeDumper.AggrType aggrType,
boolean asSequenceFile)
Runs the process to dump the top urls out to a text file.
|
static void |
main(String[] args) |
int |
run(String[] args)
Runs the node dumper tool.
|
getConf, setConf
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
getConf, setConf
public void dumpNodes(Path webGraphDb, org.apache.nutch.scoring.webgraph.NodeDumper.DumpType type, long topN, Path output, boolean asEff, org.apache.nutch.scoring.webgraph.NodeDumper.NameType nameType, org.apache.nutch.scoring.webgraph.NodeDumper.AggrType aggrType, boolean asSequenceFile) throws Exception
webGraphDb
- The WebGraph from which to pull values.topN
- output
- IOException
- If an error occurs while dumping the top values.Exception
Copyright © 2015 The Apache Software Foundation