=== Lucene Status Report: 25th of June, 2008 === TLP The TLP has added two new PMC members: Mike Klaas and Ryan McKinley. The PMC also voted to mark Christoph Goller as emeritus due to lack of participation for over 1 year and failure to respond to inquiries. CRYPTOGRAPHY Nutch uses PDFBox and thus has a dependency on BouncyCastle. https://issues.apache.org/jira/browse/NUTCH-621 has been opened and is actively being worked on. LUCENE JAVA Lucene Java is a search-engine toolkit. Development has been active and there have been many core improvements, especially in the area of indexing performance and error recovery. Version 2.3.2 was released on 2008-05-06. SOLR Solr is a full text search server. We continue to see strong adoption and community interest. Development has been active with many new core features being added. Koji Sekiguchi was added as a committer. Solr 1.3 is expected to be released in the next quarter. NUTCH Nutch is a web-search engine: crawler, indexer and search runtime. Otis Gospodnetic was added as a Nutch committer. Development of the current code base was limited, and the planned 1.0 release is moved to Q3 2008. Nutch community started discussions about a fundamental redesign of the platform for the next releases. LUCY Lucy will develop a shared C-based core for ports of Lucene to other languages, such as Perl, Python and Ruby. No progress has been made this quarter, but there has been some recent email activity. LUCENE.NET (incubating) Michael Garski, of MySpace, has a couple of developers to bring on board. Currently there is no active committership, though their is a small community of users present on the mailing list. We're still optimistic that this project will see new life soon. TIKA (incubating) Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries. Tika entered incubation on March 22nd, 2007. Niall Pemberton joined the project as a committer and PPMC member. Community is currently working towards 0.2 release. MAHOUT Apache Mahout is a new subproject of the Lucene PMC with the goal of building a suite of scalable machine learning libraries for text and data mining. We have received the Taste Collaborative Filtering engine into Mahout as via a software grant from Sean Owen. We have also implemented several machine learning algorithms to date. Sean Owen and Ted Dunning were added as committers.