Anything To Triples (any23) is a library, a web service and a command line tool that extracts structured data in RDF format from a variety of Web documents. Currently it supports the following input formats:
A detailed description of available extractors is here.
Apache Any23 is used in major Web of Data applications such as sindice.com and sig.ma. It is written in Java and licensed under the Apache License. Apache Any23 can be used in various ways: * As a library in Java applications that consume structured data from the Web. * As a command-line tool for extracting and converting between the supported formats. * As online service API available at any23.org.
You can download the latest release from Apache Mirrors.
Previous versions are available from the download site at Google Code.
Introduction: this page.
Install: how to install Apache Any23 library and service.
Getting Started: start using Apache Any23 command-line tools.
Supported Formats: complete list of Semantic Web formats supported by Apache Any23.
Configuration: learn how to change default library and service configuration.
REST Service: discover how to use the Apache Any23 REST Service.
Plugins: read how to install and configure the Apache Any23 plugins.
Developers: understand the Apache Any23 code internals, how to write plugins, fixing rules and customize the code.