Parent Directory
|
Revision Log
| Links to HEAD: | (view) (annotate) |
| Sticky Revision: |
NUTCH-758 Set subversion eol-style to "native".
NUTCH-662: Upgrade Nutch to use Lucene 2.4
NUTCH-634 Upgrade Nutch to Hadoop 0.17.1 .
Avoid NPE when pocessing empty / corrupted indexes.
NUTCH-598 - Remove deprecated use of ToolBase. Use generics in Hadoop API.
NUTCH-494 - FindBugs: CrawlDbReader and DeleteDuplicates.
NUTCH-552 - Upgrade Nutch to Hadoop 0.15.x.
NUTCH-525 - DeleteDuplicates generates ArrayIndexOutOfBoundsException when trying to rerun dedup on a segment. Contributed by Vishal Shah.
Prevent NPE when working with small, possibly empty indexes.
Fix NUTCH-420 - DeleteDuplicates depended on the order of IndexDoc processing..
Upgrade to Hadoop 0.10.1. HTTPClient is now a dependency - move it to lib/ and remove it as a plugin. Add also native Linux libraries for Hadoop compression, plus corresponding logic in bin/nutch. Hadoop uses larger buffers now - explicitly set large heap size for JUnit tests. All tests should pass now.
NUTCH-400 update headers
NUTCH-383: upgrade to Hadoop 0.7.1 and Lucene 2.0.0. NUTCH-373: replace DeleteDuplicates with a version that implements both parts of the algorithm. Add JUnit test.
This form allows you to request diffs between any two revisions of this file. For each of the two "sides" of the diff, enter a numeric revision.
| apache@apache.org | ViewVC Help |
| Powered by ViewVC 1.1.2 |