--------------- Apache Tika 0.8 --------------- ~~ Licensed to the Apache Software Foundation (ASF) under one or more ~~ contributor license agreements. See the NOTICE file distributed with ~~ this work for additional information regarding copyright ownership. ~~ The ASF licenses this file to You under the Apache License, Version 2.0 ~~ (the "License"); you may not use this file except in compliance with ~~ the License. You may obtain a copy of the License at ~~ ~~ http://www.apache.org/licenses/LICENSE-2.0 ~~ ~~ Unless required by applicable law or agreed to in writing, software ~~ distributed under the License is distributed on an "AS IS" BASIS, ~~ WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. ~~ See the License for the specific language governing permissions and ~~ limitations under the License. Apache Tika 0.8 The most notable changes in Tika 0.8 over the previous release are: * Language identification is now dynamically configurable, managed via a config file loaded from the classpath. ({{{http://issues.apache.org/jira/browse/TIKA-490}TIKA-490}}) * Tika now supports parsing Feeds by wrapping the underlying Rome library. ({{{http://issues.apache.org/jira/browseTIKA-466}TIKA-466}}) * A quick-start guide for Tika parsing was contributed. ({{{http://issues.apache.org/jira/browse/TIKA-464}TIKA-464}}) * An approach for plumbing through XHTML attributes was added. ({{{http://issues.apache.org/jira/browse/TIKA-379}TIKA-379}}) * Media type hierarchy information is now taken into account when selecting the best parser for a given input document. ({{{http://issues.apache.org/jira/browse/TIKA-298}TIKA-298}}) * Support for parsing common scientific data formats including netCDF and HDF4/5 was added ({{{http://issues.apache.org/jira/browse/TIKA-400}TIKA-400}} and {{{http://issues.apache.org/jira/browse/TIKA-399}TIKA-399}}). * Unit tests for Windows have been fixed, allowing TestParsers to complete. ({{{http://issues.apache.org/jira/browse/TIKA-398}TIKA-398}}) The following people have contributed to Tika 0.8 by submitting or commenting on the issues resolved in this release: * Łukasz Wiktor * Adam Wilmer * Alex Baranau * Alex Ott * André Ricardo * Andrey Barhatov * Andrey Sidorenko * Antoni Mylka * Arturo Beltran * Attila Király * Brad Greenlee * Bruno Dumon * Chris A. Mattmann * Chris Bamford * Christophe Gourmelon * Dave Meikle * David Weekly * Dmitry Kuzmenko * Erik Hetzner * Geoff Jarrad * Gerd Bremer * Grant Ingersoll * Jan Høydahl * Jean-Philippe Ricard * Jeremias Maerki * Joao Garcia * Jukka Zitting * Julien Nioche * Ken Krugler * Liam O'Boyle * Mads Hansen * Marcel May * Markus Goldbach * Martijn van Groningen * Maxim Valyanskiy * Mike Hays * Miroslav Pokorny * Nick Burch * Otis Gospodnetic * Peter van Raamsdonk * Peter Wolanin * Peter_Lenahan@ibi.com * Piotr Bartosiewicz * Radek * Rajiv Kumar * Reinhard Schwab * rick cameron * Robert Muir * Sanjeev Rao * Simon Tyler * Sjoerd Smeets * Slavomir Varchula * Staffan Olsson * Tom De Leu * Uwe Schindler * Victor Kazakov See {{http://s.apache.org/ab0}} for more details on these contributions.