Apache Any23 0.7.1-SNAPSHOT API

Packages 
Package Description
org.apache.any23
The Any23 root package defines the Any23 library facade and auxiliary classes.
org.apache.any23.cli
This package contains some command-line utilities which allow users to use the main Any23 features via commandline shell.
org.apache.any23.configuration
This packages contains the Any23 Configuration definition.
org.apache.any23.encoding
This package defines utility classes for the Encoding Detection.
org.apache.any23.extractor
This package contains classes and interfaces modeling the Extractor API.
org.apache.any23.extractor.csv
This package contains the definition of a specific Extractor for CSV files as specified on RFC 4180.
org.apache.any23.extractor.html
All the various Extractor needed to distill RDF from Microformats in HTML pages are contained in this package.
org.apache.any23.extractor.html.annotations
This package contains the annotations needed to describe the single nesting relations among different Microformats.
org.apache.any23.extractor.microdata
This package provides classes for dealing with Microdata extraction.
org.apache.any23.extractor.rdf
This package contains the concrete implementations (and utilities classes) to perform the extraction from various RDF serialization formats.
org.apache.any23.extractor.rdfa
This package contains the definition of a specific Extractor for RDFa.
org.apache.any23.extractor.xpath
This package provides an XPath extractor able to generate RDF statements from any HTML page on the basis of a set of specified rules and a list of statement templates.
org.apache.any23.filter
This package defines specific TripleHandlers able to filter document and triples that don't respect particular conditions.
org.apache.any23.http
This package contains classes and interfaces to implement a small multi-threaded HTTP Client used to fetch data sources.
org.apache.any23.io.nquads
This package contains an NQuads parser and writer based on the Sesame API.
org.apache.any23.mime
This package provides support for MIME Type data detection.
org.apache.any23.mime.purifier
This package contains all the logic to implement your own Purifier.
org.apache.any23.plugin
This package provides Plugin Management support for Any23.
org.apache.any23.plugin.crawler
Commandline extension of Rover able to extract semantic content from an entire site.
org.apache.any23.plugin.htmlscraper
The HTMLScraperExtractor is a special extractor to scrape textual content from a generic HTML pages.
org.apache.any23.plugin.officescraper
This package defines a set of Extractors able to extract metadata from various Microsoft Office documents.
org.apache.any23.rdf
This package provides some utility classes to manage RDF models and prefixes.
org.apache.any23.servlet
The package defines a servlet exposing Any23 as a REST Service.
org.apache.any23.servlet.conneg
This package provides the Any23 content type negotiator.
org.apache.any23.source
Package containing some auxiliary classes to handle different types of InputStream sources.
org.apache.any23.util
This package collects a set of internal library utility classes.
org.apache.any23.validator
This package defines a simple rule based validation framework able to detect (Rule) and fix (Fix) common mistakes that can be present within an HTML DOM model and that can prevent the extraction of metadata.
org.apache.any23.validator.rule
A set of predefined validation Rules and associated Fixes.
org.apache.any23.vocab
This package defines the main ontologies used by Any23.
org.apache.any23.writer
This package collects a set of TripleHandler decorators and specific RDF format writers.

Copyright © 2010-2012 The Apache Software Foundation. All Rights Reserved.