2007-03-22 Apache Tika The Apache Tika toolkit detects and extracts metadata and structured text content from various documents using existing parser libraries. The Apache Tika toolkit is an ASFv2 licensed open source tool for extracting information from digital documents. Tika allows search engines, content management systems and other applications that work with various kinds of digital documents to easily detect and extract metadata and content from all major file formats. Java Apache Tika 1.0 2011-11-07 1.0 Apache Tika 0.10 2011-09-30 0.10 Apache Tika 0.9 2011-02-16 0.9 Apache Tika 0.8 2010-10-12 0.8 Apache Tika 0.7 2010-04-02 0.7 Apache Tika 0.6 2010-01-30 0.6 Apache Tika 0.5 2009-11-10 0.5 Apache Tika 0.4 2009-07-29 0.4 Apache Tika 0.3 2009-03-19 0.3 Apache Tika 0.2 2008-12-09 0.2 Apache Tika 0.1-incubating 2007-12-27 0.1-incubating