Apache Tika http://lucene.apache.org/tika/tika.png http://lucene.apache.org/tika/ Apache Lucene http://lucene.apache.org/images/lucene_green_300.gif http://lucene.apache.org/