/[Apache-SVN]/lucene/nutch/trunk/src/java/org/apache/nutch/indexer/IndexMerger.java
ViewVC logotype

Log of /lucene/nutch/trunk/src/java/org/apache/nutch/indexer/IndexMerger.java

Parent Directory Parent Directory | Revision Log Revision Log


Links to HEAD: (view) (annotate)
Sticky Revision:

Revision 823614 - (view) (annotate) - [select for diffs]
Modified Fri Oct 9 17:02:32 2009 UTC (6 weeks, 4 days ago) by ab
File length: 5377 byte(s)
Diff to previous 678533 (colored)
NUTCH-758 Set subversion eol-style to "native".

Revision 678533 - (view) (annotate) - [select for diffs]
Modified Mon Jul 21 19:20:21 2008 UTC (16 months ago) by ab
File length: 5377 byte(s)
Diff to previous 638779 (colored)
NUTCH-634 Upgrade Nutch to Hadoop 0.17.1 .

Revision 638779 - (view) (annotate) - [select for diffs]
Modified Wed Mar 19 10:34:14 2008 UTC (20 months, 1 week ago) by ab
File length: 5295 byte(s)
Diff to previous 613378 (colored)
NUTCH-598 - Remove deprecated use of ToolBase. Use generics in Hadoop API.

Revision 613378 - (view) (annotate) - [select for diffs]
Modified Sat Jan 19 08:59:29 2008 UTC (22 months ago) by siren
File length: 5288 byte(s)
Diff to previous 555307 (colored)
NUTCH-580 Remove deprecated hadoop api calls (FS)

Revision 555307 - (view) (annotate) - [select for diffs]
Modified Wed Jul 11 15:30:29 2007 UTC (2 years, 4 months ago) by dogacan
File length: 5212 byte(s)
Diff to previous 515791 (colored)
NUTCH-510 - IndexMerger delete working dir. Contributed by Enis.

Revision 515791 - (view) (annotate) - [select for diffs]
Modified Wed Mar 7 21:59:07 2007 UTC (2 years, 8 months ago) by ab
File length: 5131 byte(s)
Diff to previous 473936 (colored)
Upgrade to Hadoop 0.11.2 and Lucene 2.1.0 releases.

Revision 473936 - (view) (annotate) - [select for diffs]
Modified Sun Nov 12 11:37:02 2006 UTC (3 years ago) by siren
File length: 5149 byte(s)
Diff to previous 464654 (colored)
NUTCH-400 update headers

Revision 464654 - (view) (annotate) - [select for diffs]
Modified Mon Oct 16 20:38:57 2006 UTC (3 years, 1 month ago) by ab
File length: 4961 byte(s)
Diff to previous 438670 (colored)
NUTCH-383: upgrade to Hadoop 0.7.1 and Lucene 2.0.0.

NUTCH-373: replace DeleteDuplicates with a version that implements both
parts of the algorithm. Add JUnit test.

Revision 438670 - (view) (annotate) - [select for diffs]
Modified Wed Aug 30 22:12:53 2006 UTC (3 years, 2 months ago) by ab
File length: 4960 byte(s)
Diff to previous 432674 (colored)
This patch addresses two issues:

* NUTCH-242: The code to activate url normalization and filtering has been
  refactored and extracted into CrawlDbFilter and LinkDbFilter. These
  two concerns (normmaliztion and filtering) have been made independent.
  Command line options have been modified to reflect these changes.

* NUTCH-143: all command-line tools have been modified to return
  meaningful OS exit codes. At the moment this uses a modified copy of
  Hadoop's ToolBase, which will be removed when HADOOP-488 is fixed and
  Nutch upgrades to Hadoop 0.6.0 .

All JUnit tests pass.

Revision 432674 - (view) (annotate) - [select for diffs]
Modified Fri Aug 18 18:48:29 2006 UTC (3 years, 3 months ago) by ab
File length: 4935 byte(s)
Diff to previous 416346 (colored)
NUTCH-341 - if -workingdir is specified, always create a unique subdir.
Also, use unique directory names to allow multiple IndexMergers to run
simultaneously.

Revision 416346 - (view) (annotate) - [select for diffs]
Modified Thu Jun 22 12:20:29 2006 UTC (3 years, 5 months ago) by jerome
File length: 4860 byte(s)
Diff to previous 413742 (colored)
NUTCH-309 : Added logging code guards

Revision 413742 - (view) (annotate) - [select for diffs]
Modified Mon Jun 12 20:51:40 2006 UTC (3 years, 5 months ago) by jerome
File length: 4763 byte(s)
Diff to previous 405204 (colored)
NUTCH-303 : Make use of the Commons Logging API and use log4j as the default implementation

Revision 405204 - (view) (annotate) - [select for diffs]
Modified Mon May 8 22:34:29 2006 UTC (3 years, 6 months ago) by cutting
File length: 4759 byte(s)
Diff to previous 383304 (colored)
Change parameters passed to Hadoop's FileSystem from (now-deprecated) java.io.File to (new) org.apache.hadoop.fs.Path.

Revision 383304 - (view) (annotate) - [select for diffs]
Modified Sun Mar 5 10:55:17 2006 UTC (3 years, 8 months ago) by siren
File length: 4760 byte(s)
Diff to previous 382912 (colored)
NUTCH-221, removed deprecated Lucene API usage

Revision 382912 - (view) (annotate) - [select for diffs]
Modified Fri Mar 3 19:05:41 2006 UTC (3 years, 8 months ago) by cutting
File length: 4746 byte(s)
Diff to previous 381751 (colored)
Undo unintentional changes made in r381751.  Thanks, Jerome, for catching this!

Revision 381751 - (view) (annotate) - [select for diffs]
Modified Tue Feb 28 19:25:12 2006 UTC (3 years, 8 months ago) by cutting
File length: 5129 byte(s)
Diff to previous 374796 (colored)
Adding DOAP for Nutch.  Contributed by Chris Mattmann.

Revision 374796 - (view) (annotate) - [select for diffs]
Modified Sat Feb 4 00:38:32 2006 UTC (3 years, 9 months ago) by cutting
File length: 4746 byte(s)
Diff to previous 374741 (colored)
NUTCH-193: MapReduce and NDFS code moved to new project, Hadoop.  See bug report for details.

Revision 374741 - (view) (annotate) - [select for diffs]
Modified Fri Feb 3 20:56:28 2006 UTC (3 years, 9 months ago) by siren
File length: 4729 byte(s)
Diff to previous 373853 (colored)
removed unused imports

Revision 373853 - (view) (annotate) - [select for diffs]
Modified Tue Jan 31 16:08:58 2006 UTC (3 years, 9 months ago) by ab
File length: 4793 byte(s)
Diff to previous 357197 (colored)
Apply patches from NUTCH-169 (remove static NutchConf).

Submitted by: Marko Bauhardt, Stefan Groschupf, Jerome Charron.


Revision 357197 - (view) (annotate) - [select for diffs]
Modified Fri Dec 16 17:51:05 2005 UTC (3 years, 11 months ago) by cutting
File length: 4873 byte(s)
Diff to previous 179640 (colored)
Merge mapred branch to trunk & remove it.

Revision 179640 - (view) (annotate) - [select for diffs]
Modified Thu Jun 2 20:37:21 2005 UTC (4 years, 5 months ago) by cutting
File length: 5593 byte(s)
Diff to previous 169406 (colored)
Moving Nutch from the Incubator to Lucene.

Revision 169406 - (view) (annotate) - [select for diffs]
Modified Tue May 10 03:20:11 2005 UTC (4 years, 6 months ago) by cutting
Original Path: incubator/nutch/trunk/src/java/org/apache/nutch/indexer/IndexMerger.java
File length: 5593 byte(s)
Diff to previous 155829 (colored)
Add ability to set Lucene's term index interval from config.

Revision 155829 - (view) (annotate) - [select for diffs]
Added Tue Mar 1 22:04:46 2005 UTC (4 years, 8 months ago) by cutting
Original Path: incubator/nutch/trunk/src/java/org/apache/nutch/indexer/IndexMerger.java
File length: 5378 byte(s)
Initial import of Nutch to Apache.

This form allows you to request diffs between any two revisions of this file. For each of the two "sides" of the diff, enter a numeric revision.

  Diffs between and
  Type of Diff should be a

apache@apache.org
ViewVC Help
Powered by ViewVC 1.1.2