=== Lucene Status Report: 19th of March, 2008 ===

TLP

The top-level project voted to promote Hadoop to it's own TLP,
hadoop.apache.org.  Nigel Daley, Owen O'Malley and Tom White resigned
their positions on the Lucene PMC to focus on the Hadoop PMC.  The TLP
also voted to create a new project, named Apache Mahout, to build
scalable machine learning libraries.   Doug Cutting has resigned as
Chair of the Lucene PMC.  Grant Ingersoll has been elected the new
chair. 

CRYPTOGRAPHY

Nutch uses PDFBox and thus has a dependency on BouncyCastle.  
We have not begun the process specified at
http://www.apache.org/dev/crypto.html, but will do so
in the near term.

LUCENE JAVA

Lucene Java is a search-engine toolkit.  Development has been very
active and there have been many core improvements, especially in the
area of indexing performance and error recovery.  Version 2.3.0 was
released on 2008-01-24, and a minor bug fix release (2.3.1) was made
on 2008-02-23. 

SOLR

Solr is a full text search server.  We continue to see strong adoption
and community interest.  Development has been active with many new
core features being added.  Grant Ingersoll was added as a committer.

NUTCH

Nutch is a web-search engine: crawler, indexer and search
runtime. Development is active.  Recent work has concentrated on
stability and bug-fixing in preparation for the upcoming 1.0 release,
due around April. 

LUCY

Lucy will develop a shared C-based core for ports of Lucene to other
languages, such as Perl, Python and Ruby.  No progress has been made
this quarter. 

LUCENE.NET (incubating)

Lucene.Net is an port of Lucene to C# on the .NET platform.
Lucene.Net struggles with committership.  George Aroush has
effectively stepped down, but other strong contributors have rallied
and are in the process of proposing a few new committers.  

TIKA (incubating)

Tika is a toolkit for detecting and extracting metadata and structured
text content from various documents using existing parser
libraries. Development has been active. 

MAHOUT

Apache Mahout is a new subproject of the Lucene PMC with the goal of
building a suite of scalable machine learning libraries for text and
data mining.  Initial reaction to the project has been positive, with
many people expressing interest and several code contributions already
made.  Initial committers are Grant Ingersoll, Otis Gospodnetic, Erik
Hatcher, Isabel Drost, Ozgur Yilmazel, Niranjan Balasubramanian, Karl
Wettin and Dawid Weiss.  Jeff Eastman was also voted in as a new
committer (after the initial project creation).