/[Apache-SVN]/lucene/nutch/trunk/build.xml
ViewVC logotype

Log of /lucene/nutch/trunk/build.xml

Parent Directory Parent Directory | Revision Log Revision Log


Links to HEAD: (view) (annotate)
Sticky Revision:

Revision 823614 - (view) (annotate) - [select for diffs]
Modified Fri Oct 9 17:02:32 2009 UTC (7 weeks ago) by ab
File length: 24375 byte(s)
Diff to previous 756218 (colored)
NUTCH-758 Set subversion eol-style to "native".

Revision 756218 - (view) (annotate) - [select for diffs]
Modified Thu Mar 19 21:34:47 2009 UTC (8 months, 1 week ago) by siren
File length: 24375 byte(s)
Diff to previous 745448 (colored)
NUTCH-727

Revision 745448 - (view) (annotate) - [select for diffs]
Modified Wed Feb 18 09:18:07 2009 UTC (9 months, 1 week ago) by siren
File length: 24343 byte(s)
Diff to previous 745416 (colored)
NUTCH-687 add RAT, also check plugins

Revision 745416 - (view) (annotate) - [select for diffs]
Modified Wed Feb 18 08:11:46 2009 UTC (9 months, 1 week ago) by siren
File length: 24269 byte(s)
Diff to previous 733738 (colored)
NUTCH-687 add RAT

Revision 733738 - (view) (annotate) - [select for diffs]
Modified Mon Jan 12 13:26:16 2009 UTC (10 months, 2 weeks ago) by dogacan
File length: 23578 byte(s)
Diff to previous 730845 (colored)
NUTCH-442 - Integrate Solr/Nutch

Revision 730845 - (view) (annotate) - [select for diffs]
Modified Fri Jan 2 21:38:58 2009 UTC (10 months, 3 weeks ago) by kubes
File length: 23437 byte(s)
Diff to previous 686912 (colored)
NUTCH-594: Serve Nutch search results in multiple formats including XML and JSON.

Revision 686912 - (view) (annotate) - [select for diffs]
Modified Tue Aug 19 00:49:45 2008 UTC (15 months, 1 week ago) by ab
File length: 23293 byte(s)
Diff to previous 637837 (colored)
NUTCH-642 - Unit tests fail when run in non-local mode.

Revision 637837 - (view) (annotate) - [select for diffs]
Modified Mon Mar 17 11:05:11 2008 UTC (20 months, 1 week ago) by ab
File length: 23235 byte(s)
Diff to previous 620172 (colored)
Don't add Hadoop config files to Nutch job file.

Revision 620172 - (view) (annotate) - [select for diffs]
Modified Sat Feb 9 18:41:19 2008 UTC (21 months, 2 weeks ago) by kubes
File length: 23225 byte(s)
Diff to previous 521933 (colored)
NUTCH-607 - Update build.xml to include tika jar in war when building the war file.

Revision 521933 - (view) (annotate) - [select for diffs]
Modified Fri Mar 23 22:59:01 2007 UTC (2 years, 8 months ago) by ab
File length: 23181 byte(s)
Diff to previous 517015 (colored)
Upgrade to Hadoop 0.12.2 release.

Fix whitespace issues in platform name in bin/hadoop under Cygwin.

Replace deprecated method call.

Revision 517015 - (view) (annotate) - [select for diffs]
Modified Sun Mar 11 21:18:23 2007 UTC (2 years, 8 months ago) by siren
File length: 23037 byte(s)
Diff to previous 516885 (colored)
merging 517012:516728 excluding changes made by dennis



Revision 516885 - (view) (annotate) - [select for diffs]
Modified Sun Mar 11 11:02:27 2007 UTC (2 years, 8 months ago) by siren
File length: 23494 byte(s)
Diff to previous 511159 (colored)
reduce the size of .job from 19+M down to 14+M

Revision 511159 - (view) (annotate) - [select for diffs]
Modified Fri Feb 23 22:57:06 2007 UTC (2 years, 9 months ago) by cutting
File length: 23037 byte(s)
Diff to previous 497859 (colored)
NUTCH-449.  Make junit output format configurable.  Contributed by Nigel.

Revision 497859 - (view) (annotate) - [select for diffs]
Modified Fri Jan 19 16:17:32 2007 UTC (2 years, 10 months ago) by siren
File length: 22955 byte(s)
Diff to previous 495392 (colored)
NUTCH-400

Revision 495392 - (view) (annotate) - [select for diffs]
Modified Thu Jan 11 21:51:20 2007 UTC (2 years, 10 months ago) by ab
File length: 22180 byte(s)
Diff to previous 468672 (colored)
Upgrade to Hadoop 0.10.1. HTTPClient is now a dependency - move it
to lib/ and remove it as a plugin.

Add also native Linux libraries for Hadoop compression, plus corresponding
logic in bin/nutch.

Hadoop uses larger buffers now - explicitly set large heap size for
JUnit tests. All tests should pass now.

Revision 468672 - (view) (annotate) - [select for diffs]
Modified Sat Oct 28 10:31:57 2006 UTC (3 years, 1 month ago) by ab
File length: 22162 byte(s)
Diff to previous 447931 (colored)
Add missing commons-cli jar file when building the webapp WAR.

Revision 447931 - (view) (annotate) - [select for diffs]
Modified Tue Sep 19 19:10:11 2006 UTC (3 years, 2 months ago) by siren
File length: 22118 byte(s)
Diff to previous 416402 (colored)
fix for fetcher testcase

Revision 416402 - (view) (annotate) - [select for diffs]
Modified Thu Jun 22 15:45:47 2006 UTC (3 years, 5 months ago) by jerome
File length: 21952 byte(s)
Diff to previous 413742 (colored)
NUTCH-303 : Unit Tests now uses the log4j.properties in src/test

Revision 413742 - (view) (annotate) - [select for diffs]
Modified Mon Jun 12 20:51:40 2006 UTC (3 years, 5 months ago) by jerome
File length: 21859 byte(s)
Diff to previous 413053 (colored)
NUTCH-303 : Make use of the Commons Logging API and use log4j as the default implementation

Revision 413053 - (view) (annotate) - [select for diffs]
Modified Fri Jun 9 14:30:18 2006 UTC (3 years, 5 months ago) by jerome
File length: 21803 byte(s)
Diff to previous 406043 (colored)
Add commons logging and log4j in nutch war : they are now required by hadoop

Revision 406043 - (view) (annotate) - [select for diffs]
Modified Sat May 13 08:20:54 2006 UTC (3 years, 6 months ago) by jerome
File length: 21717 byte(s)
Diff to previous 405165 (colored)
NUTCH-240 : Added a javadoc scoring section and added the scoring-opic plugin in javadoc

Revision 405165 - (view) (annotate) - [select for diffs]
Modified Mon May 8 21:04:01 2006 UTC (3 years, 6 months ago) by jerome
File length: 21585 byte(s)
Diff to previous 397312 (colored)
NUTCH-134 : Added a summarizer extension point and two enxtensions:
* summary-basic is the current nutch implementation moved into a plugin
* summary-lucene a raw version of a summarizer plugin based on lucene highlighter

Revision 397312 - (view) (annotate) - [select for diffs]
Modified Wed Apr 26 21:51:14 2006 UTC (3 years, 7 months ago) by jerome
File length: 21387 byte(s)
Diff to previous 397311 (colored)
Added parse-oo to javadoc

Revision 397311 - (view) (annotate) - [select for diffs]
Modified Wed Apr 26 21:43:38 2006 UTC (3 years, 7 months ago) by jerome
File length: 21328 byte(s)
Diff to previous 394231 (colored)
Copy the plugin.dtd directly into javadoc build dir instead of source dir

Revision 394231 - (view) (annotate) - [select for diffs]
Modified Sat Apr 15 00:13:21 2006 UTC (3 years, 7 months ago) by jerome
File length: 21433 byte(s)
Diff to previous 394228 (colored)
NUTCH-245 : Some minor fixes
  - Added Apache License in DTD (?)
  - Delete the org/apache/nutch/plugin/doc-files once javadoc task completed.

Revision 394228 - (view) (annotate) - [select for diffs]
Modified Fri Apr 14 23:57:24 2006 UTC (3 years, 7 months ago) by jerome
File length: 21323 byte(s)
Diff to previous 392377 (colored)
NUTCH-245 : Added a DTD for Nutch Plugin Manifest
  - Add a commented DTD in src
  - Add the DTD in javadoc
  - Change the implementation element structure : uses name-value parameters instead of proprietary attributes
  - Fix unit tests regarding changes in DTD
  - Fix the plugin.xml file in nutch plugins regarding changes in DTD

Revision 392377 - (view) (annotate) - [select for diffs]
Modified Fri Apr 7 20:13:33 2006 UTC (3 years, 7 months ago) by pkosiorowski
File length: 21148 byte(s)
Diff to previous 390277 (colored)
PMD checks added

Revision 390277 - (view) (annotate) - [select for diffs]
Modified Thu Mar 30 23:11:55 2006 UTC (3 years, 7 months ago) by jerome
File length: 19526 byte(s)
Diff to previous 390254 (colored)
Add query-basic plugin to javadoc

Revision 390254 - (view) (annotate) - [select for diffs]
Modified Thu Mar 30 22:06:18 2006 UTC (3 years, 7 months ago) by jerome
File length: 19464 byte(s)
Diff to previous 389456 (colored)
Add some plugins groups in the javadoc

Revision 389456 - (view) (annotate) - [select for diffs]
Modified Tue Mar 28 09:40:29 2006 UTC (3 years, 8 months ago) by jerome
File length: 18784 byte(s)
Diff to previous 387655 (colored)
NUTCH-210, forgot to commit the nutch.xml xsl generation

Revision 387655 - (view) (annotate) - [select for diffs]
Modified Tue Mar 21 22:35:20 2006 UTC (3 years, 8 months ago) by jerome
File length: 18490 byte(s)
Diff to previous 382948 (colored)
Add lib-regex-filter and urlfilter-automaton to the list of javadoc packages.
Add lib-regex-filter and urlfilter-automaton to the list of deployes, tested and cleaned plugins.
Add the regular expression rule file property for urlfilter-automaton.

Revision 382948 - (view) (annotate) - [select for diffs]
Modified Fri Mar 3 22:33:29 2006 UTC (3 years, 8 months ago) by jerome
File length: 18355 byte(s)
Diff to previous 380789 (colored)
Add a microformats rel-tag parser/indexer/searcher plugin (a la technorati)

Revision 380789 - (view) (annotate) - [select for diffs]
Modified Fri Feb 24 19:11:44 2006 UTC (3 years, 9 months ago) by cutting
File length: 18286 byte(s)
Diff to previous 379492 (colored)
Fix to not use 'exec', but rather 'untar' and 'chmod' which are more portable.

Revision 379492 - (view) (annotate) - [select for diffs]
Modified Tue Feb 21 15:38:31 2006 UTC (3 years, 9 months ago) by jerome
File length: 18271 byte(s)
Diff to previous 377808 (colored)
Add it locale (Adriano Palombo)

Revision 377808 - (view) (annotate) - [select for diffs]
Modified Tue Feb 14 19:31:41 2006 UTC (3 years, 9 months ago) by siren
File length: 18172 byte(s)
Diff to previous 377501 (colored)
NUTCH-188, translation for Serbian (sr, Cyrilic) and Serbo-Croatian (sh, Latin) languages. 

Revision 377501 - (view) (annotate) - [select for diffs]
Modified Mon Feb 13 21:43:15 2006 UTC (3 years, 9 months ago) by jerome
File length: 17974 byte(s)
Diff to previous 376803 (colored)
Javadoc updates for ms parsers

Revision 376803 - (view) (annotate) - [select for diffs]
Modified Fri Feb 10 19:22:15 2006 UTC (3 years, 9 months ago) by cutting
File length: 17913 byte(s)
Diff to previous 376768 (colored)
Unpack Hadoop webapps from jar so that they can be used.

Revision 376768 - (view) (annotate) - [select for diffs]
Modified Fri Feb 10 17:08:23 2006 UTC (3 years, 9 months ago) by jerome
File length: 17565 byte(s)
Diff to previous 376485 (colored)
NUTCH-52, Add a parser plugin for MS Excel files

Revision 376485 - (view) (annotate) - [select for diffs]
Modified Thu Feb 9 23:20:28 2006 UTC (3 years, 9 months ago) by cutting
File length: 17502 byte(s)
Diff to previous 376089 (colored)
Fix for NUTCH-209.  Nutch now supplies all code to remote MapReduce daemons through a job jar file.  So Hadoop daemons no longer need to be restarted when Nutch code changes.

Revision 376089 - (view) (annotate) - [select for diffs]
Modified Wed Feb 8 21:48:52 2006 UTC (3 years, 9 months ago) by jerome
File length: 16653 byte(s)
Diff to previous 376012 (colored)
NUTCH-139
 * Add standard metadata names
 * Syntax tolerant metadata names container
 * Review usage of metadata among plugins

Revision 376012 - (view) (annotate) - [select for diffs]
Modified Wed Feb 8 18:03:01 2006 UTC (3 years, 9 months ago) by jerome
File length: 16608 byte(s)
Diff to previous 375982 (colored)
Add/Move some plugins javadoc to the Plugins group

Revision 375982 - (view) (annotate) - [select for diffs]
Modified Wed Feb 8 15:22:37 2006 UTC (3 years, 9 months ago) by jerome
File length: 16127 byte(s)
Diff to previous 375965 (colored)
Add link to Hadoop javadoc and proxy support in javadoc

Revision 375965 - (view) (annotate) - [select for diffs]
Modified Wed Feb 8 13:58:08 2006 UTC (3 years, 9 months ago) by jerome
File length: 15990 byte(s)
Diff to previous 375414 (colored)
Fix some javadoc issues with lib-http plugin

Revision 375414 - (view) (annotate) - [select for diffs]
Modified Mon Feb 6 23:36:01 2006 UTC (3 years, 9 months ago) by cutting
File length: 15932 byte(s)
Diff to previous 374872 (colored)
Extract Hadoop's scripts from Hadoop's jar into bin/ directory.

Revision 374872 - (view) (annotate) - [select for diffs]
Modified Sat Feb 4 14:28:46 2006 UTC (3 years, 9 months ago) by siren
File length: 15548 byte(s)
Diff to previous 374799 (colored)
add hadoop jar to war

Revision 374799 - (view) (annotate) - [select for diffs]
Modified Sat Feb 4 00:55:20 2006 UTC (3 years, 9 months ago) by cutting
File length: 15515 byte(s)
Diff to previous 370281 (colored)
Remove vestiges of mapred's webapp.

Revision 370281 - (view) (annotate) - [select for diffs]
Modified Wed Jan 18 22:03:28 2006 UTC (3 years, 10 months ago) by cutting
File length: 15752 byte(s)
Diff to previous 365345 (colored)
Fix NUTCH-102: include webapps in packaged releases.

Revision 365345 - (view) (annotate) - [select for diffs]
Modified Mon Jan 2 13:26:19 2006 UTC (3 years, 10 months ago) by ab
File length: 15660 byte(s)
Diff to previous 357497 (colored)
Not needed anymore.

Revision 357497 - (view) (annotate) - [select for diffs]
Modified Sun Dec 18 19:47:00 2005 UTC (3 years, 11 months ago) by pkosiorowski
File length: 15799 byte(s)
Diff to previous 357197 (colored)
Fixed distribution build problem with missing empty plugin dirs.

Revision 357197 - (view) (annotate) - [select for diffs]
Modified Fri Dec 16 17:51:05 2005 UTC (3 years, 11 months ago) by cutting
File length: 15800 byte(s)
Diff to previous 233147 (colored)
Merge mapred branch to trunk & remove it.

Revision 233147 - (view) (annotate) - [select for diffs]
Modified Wed Aug 17 10:00:37 2005 UTC (4 years, 3 months ago) by pkosiorowski
File length: 15568 byte(s)
Diff to previous 219476 (colored)
Setting executable bit for shell scripts.

Revision 219476 - (view) (annotate) - [select for diffs]
Modified Mon Jul 18 12:12:28 2005 UTC (4 years, 4 months ago) by pkosiorowski
File length: 15400 byte(s)
Diff to previous 189627 (colored)
parse-mp3 and parse-rtf plugins excluded from JavaDoc build

Revision 189627 - (view) (annotate) - [select for diffs]
Modified Wed Jun 8 20:07:54 2005 UTC (4 years, 5 months ago) by ab
File length: 15284 byte(s)
Diff to previous 180146 (colored)
Add local resources for XSLT tasks. Now the build of web pages can be
completed when offline, and much faster at that.

Patch submitted by Piotr Kosiorowski.

Revision 180146 - (view) (annotate) - [select for diffs]
Modified Sun Jun 5 20:30:05 2005 UTC (4 years, 5 months ago) by ab
File length: 14955 byte(s)
Diff to previous 179640 (colored)
This changes the build process to minimize dependency on Unix/Cygwin
utilities, and on availability of symbolic links.

Patch submitted by Dawid Weiss.

Revision 179640 - (view) (annotate) - [select for diffs]
Modified Thu Jun 2 20:37:21 2005 UTC (4 years, 5 months ago) by cutting
File length: 14980 byte(s)
Diff to previous 179436 (colored)
Moving Nutch from the Incubator to Lucene.

Revision 179436 - (view) (annotate) - [select for diffs]
Modified Wed Jun 1 22:20:01 2005 UTC (4 years, 5 months ago) by ab
Original Path: incubator/nutch/trunk/build.xml
File length: 14980 byte(s)
Diff to previous 161630 (colored)
This patchset contains improvements to Fetcher, described in NUTCH-54,
specifically the following:

* protocol- and content-based redirection handling in Fetcher.

* parse-js: heuristic link extractor for JavaScript

* protocol-httpclient: HTTP and HTTPS protocol handler, based on
Jakarta Commons HttpClient library.

* alternative HTML parser based on TagSoup.

* improved status reporting for protocol and parse plugins. Status
information is persisted in segment data, so that other plugins can
use it.

* and other assorted fixes...

This work has been sponsored by EvaluMetrix LLC (http://www.evalumetrix.com).
Thank you!


Revision 161630 - (view) (annotate) - [select for diffs]
Modified Sun Apr 17 06:51:28 2005 UTC (4 years, 7 months ago) by johnx
Original Path: incubator/nutch/trunk/build.xml
File length: 14850 byte(s)
Diff to previous 159320 (colored)
Close Issue #33 - MIME content type detector (using magic char sequences).

Revision 159320 - (view) (annotate) - [select for diffs]
Modified Mon Mar 28 22:37:21 2005 UTC (4 years, 8 months ago) by cutting
Original Path: incubator/nutch/trunk/build.xml
File length: 14770 byte(s)
Diff to previous 159317 (colored)
Fixed a dependency: war needs plugins built too.

Revision 159317 - (view) (annotate) - [select for diffs]
Modified Mon Mar 28 22:22:34 2005 UTC (4 years, 8 months ago) by cutting
Original Path: incubator/nutch/trunk/build.xml
File length: 14766 byte(s)
Diff to previous 157479 (colored)
Fix so that build works when not connected to basedir.

Revision 157479 - (view) (annotate) - [select for diffs]
Modified Mon Mar 14 22:34:37 2005 UTC (4 years, 8 months ago) by cutting
Original Path: incubator/nutch/trunk/build.xml
File length: 14767 byte(s)
Diff to previous 155829 (colored)
Add a target for nightly build.

Revision 155829 - (view) (annotate) - [select for diffs]
Added Tue Mar 1 22:04:46 2005 UTC (4 years, 8 months ago) by cutting
Original Path: incubator/nutch/trunk/build.xml
File length: 14709 byte(s)
Initial import of Nutch to Apache.

This form allows you to request diffs between any two revisions of this file. For each of the two "sides" of the diff, enter a numeric revision.

  Diffs between and
  Type of Diff should be a

apache@apache.org
ViewVC Help
Powered by ViewVC 1.1.2