=== Lucene Status Report: 19th of March, 2008 === TLP The top-level project voted to promote Hadoop to it's own TLP, hadoop.apache.org. Nigel Daley, Owen O'Malley and Tom White resigned their positions on the Lucene PMC to focus on the Hadoop PMC. The TLP also voted to create a new project, named Apache Mahout, to build scalable machine learning libraries. Doug Cutting has resigned as Chair of the Lucene PMC. Grant Ingersoll has been elected the new chair. CRYPTOGRAPHY Nutch uses PDFBox and thus has a dependency on BouncyCastle. We have not begun the process specified at http://www.apache.org/dev/crypto.html, but will do so in the near term. LUCENE JAVA Lucene Java is a search-engine toolkit. Development has been very active and there have been many core improvements, especially in the area of indexing performance and error recovery. Version 2.3.0 was released on 2008-01-24, and a minor bug fix release (2.3.1) was made on 2008-02-23. SOLR Solr is a full text search server. We continue to see strong adoption and community interest. Development has been active with many new core features being added. Grant Ingersoll was added as a committer. NUTCH Nutch is a web-search engine: crawler, indexer and search runtime. Development is active. Recent work has concentrated on stability and bug-fixing in preparation for the upcoming 1.0 release, due around April. LUCY Lucy will develop a shared C-based core for ports of Lucene to other languages, such as Perl, Python and Ruby. No progress has been made this quarter. LUCENE.NET (incubating) Lucene.Net is an port of Lucene to C# on the .NET platform. Lucene.Net struggles with committership. George Aroush has effectively stepped down, but other strong contributors have rallied and are in the process of proposing a few new committers. TIKA (incubating) Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries. Development has been active. MAHOUT Apache Mahout is a new subproject of the Lucene PMC with the goal of building a suite of scalable machine learning libraries for text and data mining. Initial reaction to the project has been positive, with many people expressing interest and several code contributions already made. Initial committers are Grant Ingersoll, Otis Gospodnetic, Erik Hatcher, Isabel Drost, Ozgur Yilmazel, Niranjan Balasubramanian, Karl Wettin and Dawid Weiss. Jeff Eastman was also voted in as a new committer (after the initial project creation).