Download Apache Tika
Apache Tika 3.0.0 is now available. See the CHANGES-3.0.0.txt file for more information on the list of breaking changes and updates.
- Mirrors for tika-3.0.0-src.zip (source archive, PGP signature, SHA512)
- Mirrors for tika-app-3.0.0.jar (runnable jar, PGP signature, SHA512)
- Mirrors for tika-server-standard-3.0.0.jar (runnable jar, PGP signature, SHA512)
- Mirrors for tika-server-standard-3.0.0.bin.zip (tika-server as a service), PGP signature, SHA512)
- Mirrors for tika-eval-app-3.0.0.jar (runnable jar, PGP signature, SHA512)
- Mirrors for tika-parser-scientific-package-3.0.0.jar (scientific parsers, PGP signature, SHA512)
- Mirrors for tika-parser-nlp-package-3.0.0.jar (nlp parser, PGP signature, SHA512)
- Mirrors for tika-parser-sqlite3-package-3.0.0.jar (sqlite3 parser, PGP signature, SHA512)
The latest 2.x release (2.9.2) is available here:
- Mirrors for tika-2.9.2-src.zip (source archive, PGP signature, SHA512)
- Mirrors for tika-app-2.9.2.jar (runnable jar, PGP signature, SHA512)
- Mirrors for tika-server-standard-2.9.2.jar (runnable jar, PGP signature, SHA512)
- Mirrors for tika-server-standard-2.9.2.bin.zip (tika-server as a service), PGP signature, SHA512)
- Mirrors for tika-eval-app-2.9.2.jar (runnable jar, PGP signature, SHA512)
- Mirrors for tika-parser-scientific-package-2.9.2.jar (scientific parsers, PGP signature, SHA512)
- Mirrors for tika-parser-nlp-package-2.9.2.jar (nlp parser, PGP signature, SHA512)
- Mirrors for tika-parser-sqlite3-package-2.9.2.jar (sqlite3 parser, PGP signature, SHA512)
Apache Tika releases are available under the Apache License, Version 2.0. See the NOTICE.txt file contained in each release artifact for applicable copyright attribution notices.
If you are looking for previous releases of Apache Tika, have a look in the archives.
If you are looking for releases of Apache Tika from the Apache Lucene project (pre-0.8 releases), have a look in the lucene archives. If you are looking for releases of Apache Tika from the Apache Incubator (pre-0.2 releases), have a look in the incubator archives.
Export control
Apache Tika includes cryptographic software. The country in which you currently reside may have restrictions on the import, possession, use, and/or re-export to another country, of encryption software. BEFORE using any encryption software, please check your country's laws, regulations and policies concerning the import, possession, or use, and re-export of encryption software, to see if this is permitted. See <http://www.wassenaar.org/> for more information.
The U.S. Government Department of Commerce, Bureau of Industry and Security (BIS), has classified this software as Export Commodity Control Number (ECCN) 5D002.C.1, which includes information security software using or performing cryptographic functions with asymmetric algorithms. The form and manner of this Apache Software Foundation distribution makes it eligible for export under the License Exception ENC Technology Software Unrestricted (TSU) exception (see the BIS Export Administration Regulations, Section 740.13) for both object code and source code.
The following provides more details on the included cryptographic software:
- Apache Tika uses the Bouncy Castle generic encryption libraries for extracting text content and metadata from encrypted PDF files. See https://www.bouncycastle.org/ for more details on Bouncy Castle.
Verify
It is essential that you verify the integrity of the downloaded files using the PGP signatures. Please read How to Verify Downloaded Files for more information on how and why you should verify our releases.
The PGP signatures can be verified using PGP or GPG. First download the KEYS file as well as the .asc signature files for the relevant release packages. Make sure you get these files from the main distribution directory, rather than from a mirror. Then verify the signatures using
% pgpk -a KEYS % pgpv apache-tika-X.Y.Z-src.zip.asc
or
% pgp -ka KEYS % pgp apache-tika-X.Y.Z-src.zip.asc
or
% gpg --import KEYS % gpg --verify apache-tika-X.Y.Z-src.zip.asc apache-tika-X.Y.Z-src.zip