=== Lucene Status Report: Sept, 2009 === TLP -The PMC added Mahout committers David Hall, Deneche Abdelhakim, Robin Anil -The PMC added Lucene Java committer Robert Muir LUCENE JAVA Lucene Java is a search-engine toolkit. Development has been active and we are in the final stages of releasing Lucene 2.9. SOLR Solr is a full text search server. Development and the community is active. Solr is in the final push towards the release of 1.4. NUTCH Nutch is a web-search engine: crawler, indexer and search runtime. Development of 1.0 line has stalled as a major redesign has been under discussion to address various shortcomings of the current platform, and avoid duplication of effort with other Apache projects. Two milestone prototypes (OSGI-based and HBase-based) have been created, which will be further examined and probably merged to form the new architecture. LUCY Lucy is a loose C port of Lucene targeted at dynamic language bindings. The pace of development has picked up this year, and a significant milestone was achieved with the completion of the core object model. We are working agressively towards alpha release. LUCENE.NET (incubating) Lucene.NET is a .NET based port of Lucene Java. Development and the community are active. Incubating project needs to look towards graduation soon. MAHOUT Apache Mahout is working towards building a suite of scalable machine learning libraries for text and data mining. Development is active and we are working towards a 0.2 release. PyLucene PyLucene is a Python integration of Lucene Java. Development is active. Since it is a port of Lucene Java, a release would be expected after Lucene 2.9 is released TIKA Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries. Tika 0.4 was released in July. Development continues at the steady pace of a few commits per week. User list activity seems to be increasing.