Security

The following is an incomplete list of known and fixed Critical Vulnerabilities and Exposures (CVEs) and other vulnerabilities in Apache Tika or its dependencies. Please help us fill this in with more details.

CVE or Vulnerability Description Reporter Affected Versions
CVE-2018-1339 Infinite loop in ChmParser Tobias Ospelt ?-1.17
CVE-2018-1338 Infinite loop in BPGParser Tobias Ospelt ?-1.17
CVE-2018-1335 Command Execution in tika-server Tim Allison ?-1.17
CVE-2017-12626 Apache POI - Infinite loops in WMF, EMF, MSG and macros; OOMs in DOC, PPT and XLS Tim Allison, Luís Filipe Nassif and Jerome Lacoste ?-1.17
CVE-2018-1324 and COMPRESS-432 Commons Compress - Infinite loop in ZipFile Luís Filipe Nassif and Anton Abashkin ?-1.17
CVE-2018-7489 and TIKA-2634 Jackson - Deserialization vulnerability Richard Cyganiak (notified Tika team) ?-1.17
TIKA-2115 Apache POI - OOM parsing OLE object Thomas Galla ?-1.15
COMPRESS-382 Commons Compress - OOM detecting corrupt LZMA Luís Filipe Nassif ?-1.15
COMPRESS-386 and TIKA-1631 Commons Compress - OOM detecting corrupt x-compress Pavel Micka ?-1.15
TIKA-1866 and TIKA-954 Apache POI - OOM in DOCX and PPTX because of bug in Piccolo parser Rob Tulloh and Shawn Johnson ?-1.13
TIKA-2040 GC-Overload and OOM in CHMParser Luís Filipe Nassif ?-1.13
CVE-2016-6809 jmatio - Deserialization Vulnerability in MATLAB parser Pierre Ernst 1.6-1.13
CVE-2016-4434 XXE Vulnerability in several parsers Arthur Khashaev, Seulgi Kim, Mesut Timur 0.10-1.12
CVE-2015-3271 Remote Access to host files via tika-server Tim Allison 1.9?-1.10
TIKA-788 Infinite Loop in DWG Stas Shaposhnikov ?-1.4?
TIKA-1132 Apache POI - Nearly Infinite Loop in XLS Ryan Krueger ?-1.4
TIKA-1179 Infinite Loop in corrupt MP3 Marius Dumitru Florea ?-1.4
TIKA-866 OOM reading Tika config file Stephan Mühlstrasser ?-1.1

Acronyms and Terms

  • Command Execution -- a malicious client could call anything on tika-server's commandline
  • Deserialization Vulnerability- OWASP's Cheat Sheet. A malicious actor could run arbitrary code on your computer.
  • OOM - Out of Memory Exception - for example, if a parser doesn't do a sanity check, a 4 byte file could cause a parser to allocate 2GB of memory See: TIKA-1631
  • XXE - XML External Entity Processing A malicious client could access data on your system.