Any23 0.4.0 Release Notes * The any23-service module has been separated from the any23-core module, the Ant build system has been dropped. * Added support for HTML metadata (RDFa / Microformats) validation and correction (validator). * Added flag to disable the nesting relationship property enrichment. * Improved coverage of Microformats tests. * Improved documentation. * Various code consolidation. ========================================================================== Any23 0.3.0 Release Notes * Added detection and enrichment of nested microformats. [Issue #61] * Added detection and support of N-Quads as input and output format. [Issue #7] * General Improvements in RDFa extraction. [Issue #12, Issue #14] * Added support of Turtle embedded in HTML script tag. [Issue #62] * Improvement in encoding support. [Issue #43] * Improvement in Core API. [Issue #27] * Improved support for Species Microformat. [Issue #63] * General Code prettification. ========================================================================== Any23 0.2.2 Release Notes * Fixed dependency management on Maven. A second level dependency of Xerces introduced a conflict on the java.xml.transform API causing wrong XSLT transformations within RDFa extractor. ========================================================================== Any23 0.2.1 Release Notes * Major applyFix on Tika configuration management. This applyFix solves the auto detection of the main Semantic Web related formats. ========================================================================== Any23 0.2 Release Notes ============ Introduction ============ This release features a redesigned API and incorporating enhancements and bug fixes that have accumulated since the 0.1 release. Apart from some new or changed dependencies on the underlying libraries, this version comes with an improved unit test coverage and other features like the automatic charset encoding detection and an improved documentation. Maven build system has been introduced. ================================== Summary of major changes since 0.1 ================================== * Redesigned Java API - Input from string, stream, file, or URI - Allow choosing which extractors to use - Report origin of triples (document/extractor) to client processors - Various processors/serializers for extracted triples * Added flexible command-line tool for easy testing * Vastly improved website and documentation * Media type and encoding detection via Apache Tika * Switched RDF library from Jena to Sesame * Added Maven build * Better RDF extraction from Microformats * Extractors now come with an example file to document typical in- and output * Major refactoring * Lots and lots of bugfixes ================= Supported formats ================= * RDF/XML * Notation3 and Turtle * N-Triples * RDFa Various microformats, see http://sindice.com/developers/microformat on Sindice Microformats support. =================== Dependency Upgrade =================== CyberNeko Html parser has been upgraded to 1.9.14. Apache Tika 0.3 has been replaced with 0.6, with the new support for the automatic encoding detection. EOF