----------------
Apache Tika 1.18
----------------
~~ Licensed to the Apache Software Foundation (ASF) under one or more
~~ contributor license agreements. See the NOTICE file distributed with
~~ this work for additional information regarding copyright ownership.
~~ The ASF licenses this file to You under the Apache License, Version 2.0
~~ (the "License"); you may not use this file except in compliance with
~~ the License. You may obtain a copy of the License at
~~
~~ https://www.apache.org/licenses/LICENSE-2.0
~~
~~ Unless required by applicable law or agreed to in writing, software
~~ distributed under the License is distributed on an "AS IS" BASIS,
~~ WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
~~ See the License for the specific language governing permissions and
~~ limitations under the License.
Apache Tika 1.20
The most notable changes in Tika 1.20 over the previous release are:
* Upgrade to Apache POI 4.0.1 ({{{https://issues.apache.org/jira/browse/TIKA-2751}TIKA-2751}}).
* Integrate/parameterize new angles handling in
PDFBox ({{{https://issues.apache.org/jira/browse/TIKA-2779}TIKA-2779}}).
* Upgrade to PDFBox 2.0.13 ({{{https://issues.apache.org/jira/browse/TIKA-2788}TIKA-2788}}).
* Prevent content within and elements
to be written in the ToTextContentHandler ({{{https://issues.apache.org/jira/browse/TIKA-2550}TIKA-2550}}).
* Switch child to parent communication to a shared memory-mappedfile in tika-server's -spawnChild mode.
* Fix bug in tika-server when run in legacy mode (not -spawnChild)
that caused it to return 503 on documents submitted after
it hit an OutOfMemoryError ({{{https://issues.apache.org/jira/browse/TIKA-2776}TIKA-2776}}).
* Upgrade jaxb-runtime and javax.activation ({{{https://issues.apache.org/jira/browse/TIKA-2778}TIKA-2778}}).
* tika-app in batch mode now requires an interrupt or
kill signal to the parent process to stop the parent
and the child processes ({{{https://issues.apache.org/jira/browse/TIKA-2780}TIKA-2780}}).
* Bulk upgrade of dependencies ({{{https://issues.apache.org/jira/browse/TIKA-2775}TIKA-2775}}).
* Improve language id efficiency in tika-eval ({{{https://issues.apache.org/jira/browse/TIKA-2777}TIKA-2777}}).
* 25.2 ({{{https://issues.apache.org/jira/browse/TIKA-2773}TIKA-2773}}).
* Remove duplication of notes in PPT slides ({{{https://issues.apache.org/jira/browse/TIKA-2735}TIKA-2735}})
* Use -javaHome or $JAVA_HOME (if they exist) when
spawning child in tika-server's -spawnChild mode.
The following people have contributed to Tika 1.20 by submitting or
commenting on the issues resolved in this release:
* Boris Petrov
* Dave Meikle
* feng ye
* Hans Brende
* Jeroen
* Julien Massiera
* Kristen Cheung
* Lewis John McGibbney
* Mario Bisonti
* Markus Jelsma
* Nick Sincaglia
* Ronan O'Sullivan
* Tim Allison
See {{https://s.apache.org/fScy}} for more details on these contributions.