---------------- Apache Tika 1.12 ---------------- ~~ Licensed to the Apache Software Foundation (ASF) under one or more ~~ contributor license agreements. See the NOTICE file distributed with ~~ this work for additional information regarding copyright ownership. ~~ The ASF licenses this file to You under the Apache License, Version 2.0 ~~ (the "License"); you may not use this file except in compliance with ~~ the License. You may obtain a copy of the License at ~~ ~~ http://www.apache.org/licenses/LICENSE-2.0 ~~ ~~ Unless required by applicable law or agreed to in writing, software ~~ distributed under the License is distributed on an "AS IS" BASIS, ~~ WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. ~~ See the License for the specific language governing permissions and ~~ limitations under the License. Apache Tika 1.12 The most notable changes in Tika 1.12 over the previous release are: * Support for iFrames and element link extraction is provided inthe link Content Handler ({{{http://issues.apache.org/jira/browse/TIKA-1835}TIKA-1835}}). * Slide notes are now linked to the slide XHTML in the PPT output({{{http://issues.apache.org/jira/browse/TIKA-1840}TIKA-1840}}). * JSON tests in Tika server were updated to remove impossible casts({{{http://github.com/apache/tika/pull/73}Github-73}}). * Fix bug in GeoTopicParser where NER is reused instead of instantiatedwith each request ({{{http://issues.apache.org/jira/browse/TIKA-1834}TIKA-1834}}). * 5.1 && Downgrade Rome dependency to 0.9 to avoidnasty NPE ({{{http://issues.apache.org/jira/browse/TIKA-1820}TIKA-1820}}, {{{http://issues.apache.org/jira/browse/TIKA-1516}TIKA-1516}}) * The NamedEntityParser was enhanced to generate text contentin addition to metadata ({{{http://issues.apache.org/jira/browse/TIKA-1815}TIKA-1815}}, {{{http://issues.apache.org/jira/browse/TIKA-1816}TIKA-1816}}). * A significant speed-up is made to the GeoTopicParser byusing the new REST server capabilities from Lucene GeoGazetteer ({{{http://issues.apache.org/jira/browse/TIKA-1803}TIKA-1803}}). * A parser to compute motion properties in Videos, e.g.,Histogram of Oriented Gradients and Histogram of Optical Flowsusing the Pooled Time Series algorithm, was added ({{{http://issues.apache.org/jira/browse/TIKA-1798}TIKA-1798}}). * Provide NamedEntityParser which exposes Named Entity Recognitionfrom OpenNLP and Stanford NER providers ({{{http://issues.apache.org/jira/browse/TIKA-1787}TIKA-1787}}, {{{http://github.com/apache/tika/pull/61}Github-61}},{{{http://github.com/apache/tika/pull/62}Github-62}}). * Allow XHTMLContentHandler to pass attributes of html elementvia Markus Jelsma ({{{http://issues.apache.org/jira/browse/TIKA-1782}TIKA-1782}}). * Fix regression with spacing in PPT via Andreas Beeker ({{{http://issues.apache.org/jira/browse/TIKA-1777}TIKA-1777}}). * Tika Facade parse methods for Path and File added which take aMetadata object, to mirror the existing InputStream one ({{{http://github.com/apache/tika/pull/60}Github-60}}) The following people have contributed to Tika 1.12 by submitting or commenting on the issues resolved in this release: * Bob Paulin * Chris A. Mattmann * Ken Krugler * Lewis John McGibbney * Madhav Sharan * Markus Jelsma * Nick Burch * Roberto Benedetti * Thamme Gowda N * Tim Allison * Vjeran Marcinko * Yueheng He See {{https://s.apache.org/wDlx}} for more details on these contributions.