Package | Description |
---|---|
org.apache.nutch.parse | |
org.apache.nutch.segment |
Modifier and Type | Method and Description |
---|---|
static ParseText |
ParseText.read(DataInput in) |
Modifier and Type | Method and Description |
---|---|
void |
ParseResult.put(String key,
ParseText text,
ParseData data)
Store a result of parsing.
|
void |
ParseResult.put(org.apache.hadoop.io.Text key,
ParseText text,
ParseData data)
Store a result of parsing.
|
Constructor and Description |
---|
ParseImpl(ParseText text,
ParseData data) |
ParseImpl(ParseText text,
ParseData data,
boolean isCanonical) |
Modifier and Type | Method and Description |
---|---|
boolean |
SegmentMergeFilter.filter(org.apache.hadoop.io.Text key,
CrawlDatum generateData,
CrawlDatum fetchData,
CrawlDatum sigData,
Content content,
ParseData parseData,
ParseText parseText,
Collection<CrawlDatum> linked)
The filtering method which gets all information being merged for a given
key (URL).
|
boolean |
SegmentMergeFilters.filter(org.apache.hadoop.io.Text key,
CrawlDatum generateData,
CrawlDatum fetchData,
CrawlDatum sigData,
Content content,
ParseData parseData,
ParseText parseText,
Collection<CrawlDatum> linked)
Iterates over all
SegmentMergeFilter extensions and if any of them
returns false, it will return false as well. |
Copyright © 2014 The Apache Software Foundation