|
||||||||||
PREV NEXT | FRAMES NO FRAMES |
Packages that use org.apache.nutch.crawl | |
---|---|
org.apache.nutch.analysis.lang | Text document language identifier. |
org.apache.nutch.crawl | Crawl control code. |
org.apache.nutch.fetcher | The Nutch robot. |
org.apache.nutch.indexer | Maintain Lucene full-text indexes. |
org.apache.nutch.indexer.basic | A basic indexing plugin. |
org.apache.nutch.indexer.more | A more indexing plugin. |
org.apache.nutch.indexer.solr | |
org.apache.nutch.metadata | A Multi-valued Metadata container, and set of constant fields for Nutch Metadata. |
org.apache.nutch.microformats.reltag | A microformats Rel-Tag Parser/Indexer/Querier plugin. |
org.apache.nutch.protocol | |
org.apache.nutch.protocol.file | Protocol plugin which supports retrieving local file resources. |
org.apache.nutch.protocol.ftp | Protocol plugin which supports retrieving documents via the ftp protocol. |
org.apache.nutch.protocol.http | Protocol plugin which supports retrieving documents via the http protocol. |
org.apache.nutch.protocol.http.api | Common API used by HTTP plugins (http ,
httpclient ) |
org.apache.nutch.scoring | |
org.apache.nutch.scoring.opic | |
org.apache.nutch.scoring.webgraph | |
org.apache.nutch.segment | |
org.apache.nutch.tools | |
org.apache.nutch.tools.arc | |
org.apache.nutch.util.domain | org.apache.nutch.util.domain |
org.creativecommons.nutch | Sample plugins that parse and index Creative Commons medadata. |
Classes in org.apache.nutch.crawl used by org.apache.nutch.analysis.lang | |
---|---|
CrawlDatum
|
|
Inlinks
A list of Inlink s. |
Classes in org.apache.nutch.crawl used by org.apache.nutch.crawl | |
---|---|
AbstractFetchSchedule
This class provides common methods for implementations of FetchSchedule . |
|
CrawlDatum
|
|
FetchSchedule
This interface defines the contract for implementations that manipulate fetch times and re-fetch intervals. |
|
Generator.SelectorEntry
|
|
Inlink
|
|
Inlinks
A list of Inlink s. |
|
MapWritable
Deprecated. Use org.apache.hadoop.io.MapWritable instead. |
|
Signature
|
Classes in org.apache.nutch.crawl used by org.apache.nutch.fetcher | |
---|---|
CrawlDatum
|
|
NutchWritable
|
Classes in org.apache.nutch.crawl used by org.apache.nutch.indexer | |
---|---|
CrawlDatum
|
|
Inlinks
A list of Inlink s. |
|
NutchWritable
|
Classes in org.apache.nutch.crawl used by org.apache.nutch.indexer.basic | |
---|---|
CrawlDatum
|
|
Inlinks
A list of Inlink s. |
Classes in org.apache.nutch.crawl used by org.apache.nutch.indexer.more | |
---|---|
CrawlDatum
|
|
Inlinks
A list of Inlink s. |
Classes in org.apache.nutch.crawl used by org.apache.nutch.indexer.solr | |
---|---|
CrawlDatum
|
Classes in org.apache.nutch.crawl used by org.apache.nutch.metadata | |
---|---|
NutchWritable
|
Classes in org.apache.nutch.crawl used by org.apache.nutch.microformats.reltag | |
---|---|
CrawlDatum
|
|
Inlinks
A list of Inlink s. |
Classes in org.apache.nutch.crawl used by org.apache.nutch.protocol | |
---|---|
CrawlDatum
|
Classes in org.apache.nutch.crawl used by org.apache.nutch.protocol.file | |
---|---|
CrawlDatum
|
Classes in org.apache.nutch.crawl used by org.apache.nutch.protocol.ftp | |
---|---|
CrawlDatum
|
Classes in org.apache.nutch.crawl used by org.apache.nutch.protocol.http | |
---|---|
CrawlDatum
|
Classes in org.apache.nutch.crawl used by org.apache.nutch.protocol.http.api | |
---|---|
CrawlDatum
|
Classes in org.apache.nutch.crawl used by org.apache.nutch.scoring | |
---|---|
CrawlDatum
|
|
Inlinks
A list of Inlink s. |
Classes in org.apache.nutch.crawl used by org.apache.nutch.scoring.opic | |
---|---|
CrawlDatum
|
|
Inlinks
A list of Inlink s. |
Classes in org.apache.nutch.crawl used by org.apache.nutch.scoring.webgraph | |
---|---|
CrawlDatum
|
Classes in org.apache.nutch.crawl used by org.apache.nutch.segment | |
---|---|
CrawlDatum
|
|
NutchWritable
|
Classes in org.apache.nutch.crawl used by org.apache.nutch.tools | |
---|---|
CrawlDatum
|
|
Generator.SelectorEntry
|
Classes in org.apache.nutch.crawl used by org.apache.nutch.tools.arc | |
---|---|
NutchWritable
|
Classes in org.apache.nutch.crawl used by org.apache.nutch.util.domain | |
---|---|
CrawlDatum
|
Classes in org.apache.nutch.crawl used by org.creativecommons.nutch | |
---|---|
CrawlDatum
|
|
Inlinks
A list of Inlink s. |
|
||||||||||
PREV NEXT | FRAMES NO FRAMES |