A B C D E F G H I J K L M N O P Q R S T U V W X Y 

A

abbreviate(String) - Method in class org.apache.any23.rdf.Prefixes
 
ABOUT_ATTRIBUTE - Static variable in class org.apache.any23.extractor.rdfa.RDFa11Parser
 
AboutNotURIRule - Class in org.apache.any23.validator.rule
This rule is able to detect whether an about value is a valid URL or otherwise is a valid relative URL.
AboutNotURIRule() - Constructor for class org.apache.any23.validator.rule.AboutNotURIRule
 
AcceptHeaderBuilder - Class in org.apache.any23.http
Concatenates a collection of MIME specs in "type/subtype;q=x.x" notation into an HTTP Accept header value, and removes duplicates and types covered by wildcards.
AcceptHeaderBuilder(Collection<MIMEType>) - Constructor for class org.apache.any23.http.AcceptHeaderBuilder
 
acceptURI(URI) - Method in class org.apache.any23.extractor.xpath.TemplateXPathExtractionRuleImpl
 
acceptURI(URI) - Method in interface org.apache.any23.extractor.xpath.XPathExtractionRule
Checks if the rule can be applied on the given document URI.
accountName - Variable in class org.apache.any23.vocab.FOAF
 
accountServiceHomepage - Variable in class org.apache.any23.vocab.FOAF
 
acquaintance - Variable in class org.apache.any23.vocab.XFN
 
acquireProperty(ItemProp) - Method in class org.apache.any23.extractor.microdata.ItemScope
 
action - Variable in class org.apache.any23.vocab.HLISTING
 
action - Variable in class org.apache.any23.vocab.ICAL
value type: TEXTThis URI defines the action to be invoked when an alarm is triggered..
add(Variable) - Method in interface org.apache.any23.extractor.xpath.TemplateXPathExtractionRule
Adds a variable to the template.
add(QuadTemplate) - Method in interface org.apache.any23.extractor.xpath.TemplateXPathExtractionRule
Adds a QuadTemplate to the rule.
add(Variable) - Method in class org.apache.any23.extractor.xpath.TemplateXPathExtractionRuleImpl
 
add(QuadTemplate) - Method in class org.apache.any23.extractor.xpath.TemplateXPathExtractionRuleImpl
 
add(XPathExtractionRule) - Method in class org.apache.any23.extractor.xpath.XPathExtractor
 
add(String, String) - Method in class org.apache.any23.rdf.Prefixes
 
add(Prefixes) - Method in class org.apache.any23.rdf.Prefixes
 
addAliasMediaType(String) - Method in class org.apache.any23.servlet.conneg.ContentTypeNegotiator.VariantSpec
 
addAttribute(String, String, String) - Method in class org.apache.any23.validator.DefaultDOMDocument
 
addAttribute(String, String, String) - Method in interface org.apache.any23.validator.DOMDocument
Adds an attribute to a node addressed by the given XPath.
addBNodeProperty(Node, Resource, URI, BNode) - Method in class org.apache.any23.extractor.html.MicroformatExtractor
Helper method that adds a BNode property to a node.
addBNodeProperty(Resource, URI, BNode) - Method in class org.apache.any23.extractor.html.MicroformatExtractor
Helper method that adds a BNode property to a node.
addChild(TripleHandler) - Method in class org.apache.any23.writer.CompositeTripleHandler
Adds a decorated handler.
ADDITIONAL_NAME - Static variable in class org.apache.any23.extractor.html.HCardName
 
additional_name - Variable in class org.apache.any23.vocab.VCARD
An additional part of a person's name.
addListener(CrawlerListener) - Method in class org.apache.any23.plugin.crawler.SiteCrawler
Registers a CrawlerListener to this crawler.
addPropertyPath(Class<? extends MicroformatExtractor>, Resource, Resource, BNode, String[]) - Method in class org.apache.any23.extractor.ExtractionResultImpl
 
addPropertyPath(Class<? extends MicroformatExtractor>, Resource, Resource, BNode, String[]) - Method in interface org.apache.any23.extractor.TagSoupExtractionResult
Adds a property path to the list of the extracted data.
addResourceRoot(String[], Resource, Class<? extends MicroformatExtractor>) - Method in class org.apache.any23.extractor.ExtractionResultImpl
 
addResourceRoot(String[], Resource, Class<? extends MicroformatExtractor>) - Method in interface org.apache.any23.extractor.TagSoupExtractionResult
Adds a root property to the extraction result, specifying also the path corresponding to the root of data which generated the property and the extractor responsible for such addition.
Address - Variable in class org.apache.any23.vocab.VCARD
Resources that are vCard (postal) addresses.
addressType - Variable in class org.apache.any23.vocab.VCARD
 
addRule(Class<? extends Rule>, Class<? extends Fix>) - Method in class org.apache.any23.validator.DefaultValidator
 
addRule(Class<? extends Rule>) - Method in class org.apache.any23.validator.DefaultValidator
 
addRule(Class<? extends Rule>) - Method in interface org.apache.any23.validator.Validator
Allows to register a new rule to this validator
addRule(Class<? extends Rule>, Class<? extends Fix>) - Method in interface org.apache.any23.validator.Validator
Allows to register a new rule to this validator and associating it to a fix.
addTextExtractor(String, URI, BoilerpipeExtractor) - Method in class org.apache.any23.plugin.htmlscraper.HTMLScraperExtractor
 
addURIProperty(Resource, URI, URI) - Method in class org.apache.any23.extractor.html.MicroformatExtractor
Helper method that adds a URI property to a node.
addUserAgentOverride(Pattern, String, String) - Method in class org.apache.any23.servlet.conneg.ContentTypeNegotiator
Overrides the Accept header for certain user agents.
addVariant(String) - Method in class org.apache.any23.servlet.conneg.ContentTypeNegotiator
 
addVolatile(String, String) - Method in class org.apache.any23.rdf.Prefixes
 
addVolatile(Prefixes) - Method in class org.apache.any23.rdf.Prefixes
 
adr - Variable in class org.apache.any23.vocab.VCARD
A postal or street address of a person.
AdrExtractor - Class in org.apache.any23.extractor.html
Extractor for the adr microformat.
AdrExtractor() - Constructor for class org.apache.any23.extractor.html.AdrExtractor
 
affiliation - Variable in class org.apache.any23.vocab.DOAC
 
Agent - Variable in class org.apache.any23.vocab.FOAF
 
agent - Variable in class org.apache.any23.vocab.VCARD
A person that acts as one's agent.
aimChatID - Variable in class org.apache.any23.vocab.FOAF
 
allExtractorsSupportAllContentTypes() - Method in class org.apache.any23.extractor.ExtractorGroup
 
allPrefixes() - Method in class org.apache.any23.rdf.Prefixes
 
alternate - Variable in class org.apache.any23.vocab.XHTML
 
altrep - Variable in class org.apache.any23.vocab.ICAL
To specify an alternate text representation for the URI value..
announce - Variable in class org.apache.any23.vocab.HLISTING
 
Any23 - Class in org.apache.any23
A facade with convenience methods for typical Any23 extraction operations.
Any23(Configuration, ExtractorGroup) - Constructor for class org.apache.any23.Any23
Constructor that allows the specification of a custom configuration and of a list of extractors.
Any23(ExtractorGroup) - Constructor for class org.apache.any23.Any23
Constructor that allows the specification of a list of extractors.
Any23(Configuration, String...) - Constructor for class org.apache.any23.Any23
Constructor that allows the specification of a custom configuration and of list of extractor names.
Any23(String...) - Constructor for class org.apache.any23.Any23
Constructor that allows the specification of a list of extractor names.
Any23(Configuration) - Constructor for class org.apache.any23.Any23
Constructor accepting Configuration.
Any23() - Constructor for class org.apache.any23.Any23
Constructor with default configuration.
Any23Negotiator - Class in org.apache.any23.servlet.conneg
Defines a ContentTypeNegotiator for Any23.
Any23Negotiator() - Constructor for class org.apache.any23.servlet.conneg.Any23Negotiator
 
Any23PluginManager - Class in org.apache.any23.plugin
The Any23PluginManager is responsible for inspecting dynamically the classpath and retrieving useful classes.
Any23ValueFactoryWrapper - Class in org.apache.any23.rdf
Any23 specialization of the ValueFactory.
Any23ValueFactoryWrapper(ValueFactory, IssueReport, String) - Constructor for class org.apache.any23.rdf.Any23ValueFactoryWrapper
Constructor with error reporter.
Any23ValueFactoryWrapper(ValueFactory, IssueReport) - Constructor for class org.apache.any23.rdf.Any23ValueFactoryWrapper
 
Any23ValueFactoryWrapper(ValueFactory) - Constructor for class org.apache.any23.rdf.Any23ValueFactoryWrapper
 
applyOn(DOMDocument, RuleContext, ValidationReportBuilder) - Method in class org.apache.any23.validator.rule.AboutNotURIRule
 
applyOn(DOMDocument, RuleContext, ValidationReportBuilder) - Method in interface org.apache.any23.validator.Rule
Applies this rule to the given document.
applyOn(DOMDocument, RuleContext, ValidationReportBuilder) - Method in class org.apache.any23.validator.rule.MetaNameMisuseRule
 
applyOn(DOMDocument, RuleContext, ValidationReportBuilder) - Method in class org.apache.any23.validator.rule.MissingOpenGraphNamespaceRule
 
applyTo(Document, Writer) - Method in class org.apache.any23.extractor.rdfa.XSLTStylesheet
Applies the XSLT transformation
applyTo(Document, Writer, Map<String, String>) - Method in class org.apache.any23.extractor.rdfa.XSLTStylesheet
Applies the XSLT transformation
asLines(InputStream) - Static method in class org.apache.any23.util.StreamUtils
Returns all the lines read from an input stream.
asMap() - Method in class org.apache.any23.rdf.Prefixes
 
asString(InputStream, boolean) - Static method in class org.apache.any23.util.StreamUtils
Returns the string content of a stream.
asString(InputStream) - Static method in class org.apache.any23.util.StreamUtils
Returns the string content of a stream, new line chars will be removed.
attach - Variable in class org.apache.any23.vocab.ICAL
The URI provides the capability to associate a document object with a calendar component.default value type: URI.
attendee - Variable in class org.apache.any23.vocab.ICAL
The URI defines an "Attendee" within a calendar component.value type: CAL-ADDRESS.
ATTRIBUTE_CSS - Static variable in class org.apache.any23.extractor.rdfa.RDFa11Parser
 
AUDIO - Static variable in class org.apache.any23.vocab.OGP
A URL to an audio file to accompany this object.
audio - Variable in class org.apache.any23.vocab.OGP
 
AUDIO__SECURE_URL - Static variable in class org.apache.any23.vocab.OGP
An alternate url to use if the webpage requires HTTPS.
AUDIO__TYPE - Static variable in class org.apache.any23.vocab.OGP
A MIME type for this audio.
audioSecureURL - Variable in class org.apache.any23.vocab.OGP
 
audioType - Variable in class org.apache.any23.vocab.OGP
 
Author - Annotation Type in org.apache.any23.plugin
Describes the author of the plugin.
author - Variable in class org.apache.any23.vocab.HRECIPE
 
available() - Method in class org.apache.any23.util.ReaderInputStream
 

B

based_near - Variable in class org.apache.any23.vocab.FOAF
 
BaseRDFExtractor - Class in org.apache.any23.extractor.rdf
Base class for a generic RDF Extractor.ContentExtractor.
BaseRDFExtractor(boolean, boolean) - Constructor for class org.apache.any23.extractor.rdf.BaseRDFExtractor
Constructor, allows to specify the validation and error handling policies.
BaseRDFExtractor() - Constructor for class org.apache.any23.extractor.rdf.BaseRDFExtractor
 
bday - Variable in class org.apache.any23.vocab.VCARD
The birthday of a person.
BEGIN_SCRIPT - Static variable in class org.apache.any23.extractor.html.MicroformatExtractor
 
BenchmarkTripleHandler - Class in org.apache.any23.writer
TripleHandler decorator useful to perform benchmarking.
BenchmarkTripleHandler(TripleHandler) - Constructor for class org.apache.any23.writer.BenchmarkTripleHandler
Constructor.
birthday - Variable in class org.apache.any23.vocab.FOAF
 
blockContext(ExtractionContext) - Method in class org.apache.any23.filter.ExtractionContextBlocker
 
bnode(String) - Static method in class org.apache.any23.rdf.RDFUtils
Creates a BNode.
bnode() - Static method in class org.apache.any23.rdf.RDFUtils
 
BODY_TAG - Static variable in class org.apache.any23.extractor.rdfa.RDFa11Parser
 
build(InputStream) - Static method in class org.apache.any23.extractor.csv.CSVReaderBuilder
Builds a not null CSVParser guessing from the provided CSV file.
byday - Variable in class org.apache.any23.vocab.ICAL
 
byhour - Variable in class org.apache.any23.vocab.ICAL
 
byminute - Variable in class org.apache.any23.vocab.ICAL
 
bymonth - Variable in class org.apache.any23.vocab.ICAL
 
bysecond - Variable in class org.apache.any23.vocab.ICAL
 
bysetpos - Variable in class org.apache.any23.vocab.ICAL
 
ByteArrayDocumentSource - Class in org.apache.any23.source
ByteArray implementation of DocumentSource.
ByteArrayDocumentSource(byte[], String, String) - Constructor for class org.apache.any23.source.ByteArrayDocumentSource
 
ByteArrayDocumentSource(InputStream, String, String) - Constructor for class org.apache.any23.source.ByteArrayDocumentSource
 
byweekno - Variable in class org.apache.any23.vocab.ICAL
 
byyearday - Variable in class org.apache.any23.vocab.ICAL
 

C

calAddress - Variable in class org.apache.any23.vocab.ICAL
 
calscale - Variable in class org.apache.any23.vocab.ICAL
value type: TEXTThis URI defines the calendar scale used for the calendar information specified in the iCalendar object..
canAbbreviate(String) - Method in class org.apache.any23.rdf.Prefixes
 
canExpand(String) - Method in class org.apache.any23.rdf.Prefixes
 
categories - Variable in class org.apache.any23.vocab.ICAL
value type: TEXTThis URI defines the categories for a calendar component..
category - Variable in class org.apache.any23.vocab.VCARD
A category of a vCard.
CELL - Static variable in class org.apache.any23.vocab.Excel
 
cell - Variable in class org.apache.any23.vocab.Excel
This resource identifies a cell.
CELL_VALUE - Static variable in class org.apache.any23.vocab.Excel
 
cellValue - Variable in class org.apache.any23.vocab.Excel
This property links the identifier of a cell to the content of the cell.
checkCSVFormat(InputStream) - Static method in class org.apache.any23.mime.TikaMIMETypeDetector
Checks if the stream contains a valid CSV pattern.
checkN3Format(InputStream) - Static method in class org.apache.any23.mime.TikaMIMETypeDetector
Checks if the stream contains the N3 triple patterns.
checkNQuadsFormat(InputStream) - Static method in class org.apache.any23.mime.TikaMIMETypeDetector
Checks if the stream contains the NQuads patterns.
checkTurtleFormat(InputStream) - Static method in class org.apache.any23.mime.TikaMIMETypeDetector
Checks if the stream contains Turtle triple patterns.
child - Variable in class org.apache.any23.vocab.XFN
 
class_ - Variable in class org.apache.any23.vocab.ICAL
value type: TEXTThis URI defines the access classification for a calendar component..
class_ - Variable in class org.apache.any23.vocab.VCARD
A class (e.g., public, private, etc.) of a vCard.
classClass - Variable in class org.apache.any23.vocab.WO
 
clazz - Variable in class org.apache.any23.vocab.WO
 
clazzName - Variable in class org.apache.any23.vocab.WO
 
CLI_PACKAGE - Static variable in class org.apache.any23.plugin.Any23PluginManager
Any23 Command Line Interface package.
close() - Method in class org.apache.any23.cli.Rover
 
close() - Method in interface org.apache.any23.extractor.ExtractionResult
Close the result.
close() - Method in class org.apache.any23.extractor.ExtractionResultImpl
 
close() - Method in class org.apache.any23.filter.ExtractionContextBlocker
 
close() - Method in class org.apache.any23.filter.IgnoreAccidentalRDFa
 
close() - Method in class org.apache.any23.filter.IgnoreTitlesOfEmptyDocuments
 
close() - Method in class org.apache.any23.http.DefaultHTTPClient
Shuts down the connection manager.
close() - Method in interface org.apache.any23.http.HTTPClient
Release all static resources help by the instance.
close() - Method in class org.apache.any23.util.ReaderInputStream
Closes the Stringreader.
close() - Method in class org.apache.any23.writer.BenchmarkTripleHandler
 
close() - Method in class org.apache.any23.writer.CompositeTripleHandler
 
close() - Method in class org.apache.any23.writer.CountingTripleHandler
 
close() - Method in class org.apache.any23.writer.JSONWriter
 
close() - Method in class org.apache.any23.writer.LoggingTripleHandler
 
close() - Method in class org.apache.any23.writer.RDFWriterTripleHandler
 
close() - Method in class org.apache.any23.writer.ReportingTripleHandler
 
close() - Method in class org.apache.any23.writer.RepositoryWriter
 
close() - Method in interface org.apache.any23.writer.TripleHandler
Will be called last and exactly once.
close() - Method in class org.apache.any23.writer.URIListWriter
 
closeContext(ExtractionContext) - Method in class org.apache.any23.filter.ExtractionContextBlocker
 
closeContext(ExtractionContext) - Method in class org.apache.any23.filter.IgnoreAccidentalRDFa
 
closeContext(ExtractionContext) - Method in class org.apache.any23.filter.IgnoreTitlesOfEmptyDocuments
 
closeContext(ExtractionContext) - Method in class org.apache.any23.writer.BenchmarkTripleHandler
 
closeContext(ExtractionContext) - Method in class org.apache.any23.writer.CompositeTripleHandler
 
closeContext(ExtractionContext) - Method in class org.apache.any23.writer.CountingTripleHandler
 
closeContext(ExtractionContext) - Method in class org.apache.any23.writer.JSONWriter
 
closeContext(ExtractionContext) - Method in class org.apache.any23.writer.LoggingTripleHandler
 
closeContext(ExtractionContext) - Method in class org.apache.any23.writer.RDFWriterTripleHandler
 
closeContext(ExtractionContext) - Method in class org.apache.any23.writer.ReportingTripleHandler
 
closeContext(ExtractionContext) - Method in class org.apache.any23.writer.RepositoryWriter
 
closeContext(ExtractionContext) - Method in interface org.apache.any23.writer.TripleHandler
Informs the handler that no more triples will come from a previously opened context.
closeContext(ExtractionContext) - Method in class org.apache.any23.writer.URIListWriter
 
closeGracefully(Closeable) - Static method in class org.apache.any23.util.StreamUtils
Closes the closable interface and reports error if any.
cn - Variable in class org.apache.any23.vocab.ICAL
To specify the common name to be associated with the calendar user specified by the URI..
colleague - Variable in class org.apache.any23.vocab.XFN
 
COLUMN_POSITION - Static variable in class org.apache.any23.vocab.CSV
 
columnPosition - Variable in class org.apache.any23.vocab.CSV
This property expresses the index of a column in a CSV file.
comment - Variable in class org.apache.any23.vocab.ICAL
value type: TEXTThis URI specifies non-processing information intended to provide a comment to the calendar user..
Comment - Variable in class org.apache.any23.vocab.REVIEW
A comment on a review.
commenter - Variable in class org.apache.any23.vocab.REVIEW
The commenter on the review.
compareTo(MIMEType) - Method in class org.apache.any23.mime.MIMEType
 
completed - Variable in class org.apache.any23.vocab.ICAL
value type: DATE-TIMEThis URI defines the date and time that a to-do was actually completed..
component - Variable in class org.apache.any23.vocab.ICAL
 
CompositeTripleHandler - Class in org.apache.any23.writer
A TripleHandler multi decorator, that wraps zero or more other triple handlers and dispatches all events to each of them.
CompositeTripleHandler() - Constructor for class org.apache.any23.writer.CompositeTripleHandler
Constructor with empty decorated list.
CompositeTripleHandler(Collection<TripleHandler>) - Constructor for class org.apache.any23.writer.CompositeTripleHandler
Constructor with initial list of decorated handlers.
ComputerSkill - Variable in class org.apache.any23.vocab.DOAC
 
conditionallyAddLiteralProperty(Node, Resource, URI, Literal) - Method in class org.apache.any23.extractor.html.MicroformatExtractor
Helper method that adds a literal property to a node.
conditionallyAddResourceProperty(Resource, URI, URI) - Method in class org.apache.any23.extractor.html.MicroformatExtractor
Helper method that adds a URI property to a node.
conditionallyAddStringProperty(Node, Resource, URI, String) - Method in class org.apache.any23.extractor.html.MicroformatExtractor
Helper method that adds a literal property to a subject only if the value of the property is a valid string.
Configuration - Interface in org.apache.any23.configuration
Defines the main Any23 configuration.
configure() - Method in class org.apache.any23.cli.Rover
 
configureExtractors(ExtractorGroup, File...) - Method in class org.apache.any23.plugin.Any23PluginManager
Configures a new list of extractors containing the extractors declared in initialExtractorGroup and also the extractors detected in classpath specified by pluginLocations.
configureExtractors(ExtractorGroup) - Method in class org.apache.any23.plugin.Any23PluginManager
Configures a new list of extractors containing the extractors declared in initialExtractorGroup and also the extractors detected in classpath specified by the default configuration.
contact - Variable in class org.apache.any23.vocab.ICAL
value type: TEXTThe URI is used to represent contact information or alternately a reference to contact information associated with the calendar component..
contact - Variable in class org.apache.any23.vocab.XFN
 
contains(XPathExtractionRule) - Method in class org.apache.any23.extractor.xpath.XPathExtractor
 
CONTAINS_CELL - Static variable in class org.apache.any23.vocab.Excel
 
CONTAINS_ROW - Static variable in class org.apache.any23.vocab.Excel
 
CONTAINS_SHEET - Static variable in class org.apache.any23.vocab.Excel
 
containsCell - Variable in class org.apache.any23.vocab.Excel
This property links the identifier of a row to the identifier of a cell.
containsField(String) - Method in class org.apache.any23.extractor.html.HCardName
 
containsRow - Variable in class org.apache.any23.vocab.Excel
This property links the identifier of a sheet to the identifier of a row.
containsSheet - Variable in class org.apache.any23.vocab.Excel
This property links the identifier of a document to the identifier of a sheet.
CONTENT_ATTRIBUTE - Static variable in class org.apache.any23.extractor.rdfa.RDFa11Parser
 
ContentTypeNegotiator - Class in org.apache.any23.servlet.conneg
This class defines a negotiator for content types based on scoring.
ContentTypeNegotiator() - Constructor for class org.apache.any23.servlet.conneg.ContentTypeNegotiator
 
ContentTypeNegotiator.VariantSpec - Class in org.apache.any23.servlet.conneg
 
ContentTypeNegotiator.VariantSpec(String) - Constructor for class org.apache.any23.servlet.conneg.ContentTypeNegotiator.VariantSpec
 
convert(String) - Method in class org.apache.any23.cli.Crawler.PatterConverter
 
convert(String) - Method in class org.apache.any23.cli.MicrodataParser.MicrodataParserDocumentSourceConverter
 
convert(String) - Method in class org.apache.any23.cli.MimeDetector.MimeDetectorDocumentSourceConverter
 
convert(String) - Method in class org.apache.any23.cli.Rover.ArgumentToURIConverter
 
convert(String) - Method in class org.apache.any23.cli.Rover.PrintStreamConverter
 
convert(String) - Method in class org.apache.any23.cli.VocabPrinter.VocabularyFormatConverter
 
copy() - Static method in class org.apache.any23.configuration.DefaultConfiguration
 
copy(String) - Method in class org.apache.any23.extractor.ExtractionContext
 
coResident - Variable in class org.apache.any23.vocab.XFN
 
count - Variable in class org.apache.any23.vocab.ICAL
 
CountingTripleHandler - Class in org.apache.any23.writer
A simple TripleHandler that merely counts the number of triples it has received.
CountingTripleHandler(boolean) - Constructor for class org.apache.any23.writer.CountingTripleHandler
 
CountingTripleHandler() - Constructor for class org.apache.any23.writer.CountingTripleHandler
 
countNL(String) - Static method in class org.apache.any23.util.StringUtils
Counts the number of NL in the given in string.
countOccurrences(String, String) - Static method in class org.apache.any23.util.StringUtils
Counts how many times content appears within container without string overlapping.
country_name - Variable in class org.apache.any23.vocab.VCARD
The country of a postal address.
countryName - Variable in class org.apache.any23.vocab.HLISTING
 
Course - Variable in class org.apache.any23.vocab.DOAC
 
coWorker - Variable in class org.apache.any23.vocab.XFN
 
cp(InputStream, File) - Static method in class org.apache.any23.util.FileUtils
Copies the content of the input stream within the given dest file.
cp(File, File) - Static method in class org.apache.any23.util.FileUtils
Copies a file src to the dest.
Crawler - Class in org.apache.any23.cli
Implementation of a CLI crawler based on Rover.
Crawler() - Constructor for class org.apache.any23.cli.Crawler
 
Crawler.PatterConverter - Class in org.apache.any23.cli
 
Crawler.PatterConverter() - Constructor for class org.apache.any23.cli.Crawler.PatterConverter
 
CrawlerListener - Interface in org.apache.any23.plugin.crawler
Defines a listener for a SiteCrawler.
create(String, Prefixes, Collection<String>, String, Class<S>) - Static method in class org.apache.any23.extractor.SimpleExtractorFactory
Creates an instance of a ExtractorFactory serving concrete implementation instances of Extractor.
create1(String, String) - Static method in class org.apache.any23.rdf.Prefixes
 
createBNode() - Method in class org.apache.any23.rdf.Any23ValueFactoryWrapper
 
createBNode(String) - Method in class org.apache.any23.rdf.Any23ValueFactoryWrapper
 
createClass(String, String) - Method in class org.apache.any23.vocab.Vocabulary
Creates a resource and register it to the Vocabulary.classes map.
created - Variable in class org.apache.any23.vocab.ICAL
This URI specifies the date and time that the calendar information was created by the calendar user agent in the calendar store.
createDocumentSource(String) - Method in class org.apache.any23.Any23
Returns the most appropriate DocumentSource for the givendocumentURI.
createExtractor() - Method in interface org.apache.any23.extractor.ExtractorFactory
Creates an extractor instance.
createExtractor() - Method in class org.apache.any23.extractor.SimpleExtractorFactory
 
createFromMap(Map<String, String>, boolean) - Static method in class org.apache.any23.rdf.Prefixes
 
createHTTPDocumentSource(HTTPClient, String) - Method in class org.apache.any23.servlet.Servlet
 
createInitializedHTTPClient() - Static method in class org.apache.any23.http.DefaultHTTPClient
Creates a DefaultHTTPClient instance already initialized
createLiteral(String) - Method in class org.apache.any23.rdf.Any23ValueFactoryWrapper
 
createLiteral(boolean) - Method in class org.apache.any23.rdf.Any23ValueFactoryWrapper
 
createLiteral(byte) - Method in class org.apache.any23.rdf.Any23ValueFactoryWrapper
 
createLiteral(short) - Method in class org.apache.any23.rdf.Any23ValueFactoryWrapper
 
createLiteral(int) - Method in class org.apache.any23.rdf.Any23ValueFactoryWrapper
 
createLiteral(long) - Method in class org.apache.any23.rdf.Any23ValueFactoryWrapper
 
createLiteral(float) - Method in class org.apache.any23.rdf.Any23ValueFactoryWrapper
 
createLiteral(double) - Method in class org.apache.any23.rdf.Any23ValueFactoryWrapper
 
createLiteral(XMLGregorianCalendar) - Method in class org.apache.any23.rdf.Any23ValueFactoryWrapper
 
createLiteral(String, String) - Method in class org.apache.any23.rdf.Any23ValueFactoryWrapper
 
createLiteral(String, URI) - Method in class org.apache.any23.rdf.Any23ValueFactoryWrapper
 
createLocalCopy(DocumentSource) - Method in interface org.apache.any23.source.LocalCopyFactory
 
createLocalCopy(DocumentSource) - Method in class org.apache.any23.source.MemCopyFactory
 
createProperty(String) - Method in class org.apache.any23.vocab.CSV
 
createProperty(String) - Method in class org.apache.any23.vocab.Excel
 
createProperty(String, String) - Method in class org.apache.any23.vocab.Vocabulary
Creates a property and register it to the Vocabulary.properties map.
createResource(String) - Method in class org.apache.any23.vocab.CSV
 
createResource(String) - Method in class org.apache.any23.vocab.Excel
 
createStatement(Resource, URI, Value) - Method in class org.apache.any23.rdf.Any23ValueFactoryWrapper
 
createStatement(Resource, URI, Value, Resource) - Method in class org.apache.any23.rdf.Any23ValueFactoryWrapper
 
createSubset(String...) - Static method in class org.apache.any23.rdf.PopularPrefixes
This method perform a prefix lookup.
createSubset(String...) - Method in class org.apache.any23.rdf.Prefixes
 
createURI(String) - Method in class org.apache.any23.rdf.Any23ValueFactoryWrapper
 
createURI(String, String) - Method in class org.apache.any23.rdf.Any23ValueFactoryWrapper
 
createURI(String) - Method in class org.apache.any23.vocab.Vocabulary
Creates a URI.
creator - Variable in class org.apache.any23.vocab.DCTERMS
 
crush - Variable in class org.apache.any23.vocab.XFN
 
CSV - Class in org.apache.any23.vocab
This vocabulary models the structure of a CSV file according the RFC 4180.
CSV_MIMETYPE - Static variable in class org.apache.any23.mime.TikaMIMETypeDetector
 
CSVExtractor - Class in org.apache.any23.extractor.csv
This extractor produces RDF from a CSV file .
CSVExtractor() - Constructor for class org.apache.any23.extractor.csv.CSVExtractor
 
CSVReaderBuilder - Class in org.apache.any23.extractor.csv
This class is responsible to build a reader first guessing the configuration from the file it self and then, if not successful, from the DefaultConfiguration.
CSVReaderBuilder() - Constructor for class org.apache.any23.extractor.csv.CSVReaderBuilder
 
CURIE_SEPARATOR - Static variable in class org.apache.any23.extractor.rdfa.RDFa11Parser
 
currentProject - Variable in class org.apache.any23.vocab.FOAF
 
cutype - Variable in class org.apache.any23.vocab.ICAL
To specify the type of calendar user specified by the URI..

D

DATATYPE_ATTRIBUTE - Static variable in class org.apache.any23.extractor.rdfa.RDFa11Parser
 
date - Variable in class org.apache.any23.vocab.DCTERMS
 
DATE - Static variable in class org.apache.any23.vocab.SINDICE
 
date - Variable in class org.apache.any23.vocab.SINDICE
Date meta property indicating the data generation time.
date - Variable in class org.apache.any23.vocab.XFN
 
daylight - Variable in class org.apache.any23.vocab.ICAL
 
DCTERMS - Class in org.apache.any23.vocab
The DCTERMS vocabulary.
DEFAULT_BASE_URI - Static variable in class org.apache.any23.servlet.Servlet
 
DEFAULT_CONFIG_FILE - Static variable in class org.apache.any23.configuration.DefaultConfiguration
Default configuration file.
DEFAULT_HTTP_CLIENT_USER_AGENT - Static variable in class org.apache.any23.Any23
Default HTTP User Agent defined in default configuration.
DEFAULT_NUM_OF_CRAWLERS - Static variable in class org.apache.any23.plugin.crawler.SiteCrawler
Default number of crawler instances.
DEFAULT_PAGE_FILTER_RE - Static variable in class org.apache.any23.plugin.crawler.SiteCrawler
 
DEFAULT_PLUGIN_DIR - Static variable in class org.apache.any23.cli.ToolRunner
 
DEFAULT_WEB_CRAWLER - Static variable in class org.apache.any23.plugin.crawler.SiteCrawler
Default crawler implementation.
DefaultConfiguration - Class in org.apache.any23.configuration
Default implementation of Configuration.
DefaultConfiguration(Properties) - Constructor for class org.apache.any23.configuration.DefaultConfiguration
 
DefaultDOMDocument - Class in org.apache.any23.validator
This class wraps the DOM document.
DefaultDOMDocument(URI, Document) - Constructor for class org.apache.any23.validator.DefaultDOMDocument
 
defaultFilters - Variable in class org.apache.any23.plugin.crawler.SiteCrawler
Default filter applied to skip contents.
DefaultHTTPClient - Class in org.apache.any23.http
Opens an InputStream on an HTTP URI.
DefaultHTTPClient() - Constructor for class org.apache.any23.http.DefaultHTTPClient
 
DefaultHTTPClientConfiguration - Class in org.apache.any23.http
Default implementation of HTTPClientConfiguration.
DefaultHTTPClientConfiguration(String, int, int, String) - Constructor for class org.apache.any23.http.DefaultHTTPClientConfiguration
Constructor.
DefaultHTTPClientConfiguration(String) - Constructor for class org.apache.any23.http.DefaultHTTPClientConfiguration
Constructor.
DefaultHTTPClientConfiguration() - Constructor for class org.apache.any23.http.DefaultHTTPClientConfiguration
Constructor.
DefaultModifiableConfiguration - Class in org.apache.any23.configuration
Default implementation of ModifiableConfiguration.
DefaultModifiableConfiguration(Properties) - Constructor for class org.apache.any23.configuration.DefaultModifiableConfiguration
 
DefaultRuleContext - Class in org.apache.any23.validator
Default implementation of RuleContext.
DefaultRuleContext() - Constructor for class org.apache.any23.validator.DefaultRuleContext
 
DefaultValidationReport - Class in org.apache.any23.validator
Default implementation of ValidationReport.
DefaultValidationReport(List<ValidationReport.Issue>, List<ValidationReport.RuleActivation>, List<ValidationReport.Error>) - Constructor for class org.apache.any23.validator.DefaultValidationReport
 
DefaultValidationReportBuilder - Class in org.apache.any23.validator
Default implementation of ValidationReportBuilder.
DefaultValidationReportBuilder() - Constructor for class org.apache.any23.validator.DefaultValidationReportBuilder
 
DefaultValidator - Class in org.apache.any23.validator
Default implementation of Validator.
DefaultValidator() - Constructor for class org.apache.any23.validator.DefaultValidator
 
DefaultWebCrawler - Class in org.apache.any23.plugin.crawler
Default WebCrawler implementation.
DefaultWebCrawler() - Constructor for class org.apache.any23.plugin.crawler.DefaultWebCrawler
 
deferProperties(String...) - Method in class org.apache.any23.extractor.microdata.MicrodataParser
Given a document and a list of itemprop names this method will return such itemprops.
defineProperty(String) - Method in interface org.apache.any23.configuration.Configuration
Checks whether a property is defined or not in configuration.
defineProperty(String) - Method in class org.apache.any23.configuration.DefaultConfiguration
 
Degree - Variable in class org.apache.any23.vocab.DOAC
 
delegatedFrom - Variable in class org.apache.any23.vocab.ICAL
To specify the calendar users that have delegated their participation to the calendar user specified by the URI..
delegatedTo - Variable in class org.apache.any23.vocab.ICAL
To specify the calendar users to whom the calendar user specified by the URI has delegated participation..
depiction - Variable in class org.apache.any23.vocab.FOAF
 
depicts - Variable in class org.apache.any23.vocab.FOAF
 
description - Variable in class org.apache.any23.vocab.HLISTING
 
description - Variable in class org.apache.any23.vocab.ICAL
value type: TEXTThis URI provides a more complete description of the calendar component, than that provided by the "SUMMARY" URI..
DESCRIPTION - Static variable in class org.apache.any23.vocab.OGP
A one to two sentence description of your object.
description - Variable in class org.apache.any23.vocab.OGP
 
DETERMINER - Static variable in class org.apache.any23.vocab.OGP
The word that appears before this object's title in a sentence.
determiner - Variable in class org.apache.any23.vocab.OGP
 
dir - Variable in class org.apache.any23.vocab.ICAL
To specify reference to a directory entry associated with the calendar user specified by the URI..
DiscoveryUtils - Class in org.apache.any23.util
This class provides utility methods for discovering classes in packages.
disownProperty(ItemProp) - Method in class org.apache.any23.extractor.microdata.ItemScope
 
division - Variable in class org.apache.any23.vocab.WO
 
divisionClass - Variable in class org.apache.any23.vocab.WO
 
divisionName - Variable in class org.apache.any23.vocab.WO
 
dnaChecksum - Variable in class org.apache.any23.vocab.FOAF
 
DOAC - Class in org.apache.any23.vocab
The Description Of A Career vocabulary.
Document - Variable in class org.apache.any23.vocab.FOAF
 
DocumentReport - Class in org.apache.any23.extractor.html
Represents the validationReportBuilder generated by a the TagSoupParser when a document is retrieved and validated.
DocumentReport(ValidationReport, Document) - Constructor for class org.apache.any23.extractor.html.DocumentReport
 
DocumentSource - Interface in org.apache.any23.source
A source of input streams.
doGet(HttpServletRequest, HttpServletResponse) - Method in class org.apache.any23.servlet.RedirectServlet
 
doGet(HttpServletRequest, HttpServletResponse) - Method in class org.apache.any23.servlet.Servlet
 
DOMAIN - Static variable in class org.apache.any23.vocab.SINDICE
 
domain - Variable in class org.apache.any23.vocab.SINDICE
This property expresses the DNS domain of the resource on which it is applied.
DomainOf_rrule - Variable in class org.apache.any23.vocab.ICAL
 
DOMDocument - Interface in org.apache.any23.validator
This interface models a document to be processed by the Validator.
DomUtils - Class in org.apache.any23.extractor.html
This class provides utility methods for DOM manipulation.
doPost(HttpServletRequest, HttpServletResponse) - Method in class org.apache.any23.servlet.RedirectServlet
 
doPost(HttpServletRequest, HttpServletResponse) - Method in class org.apache.any23.servlet.Servlet
 
DrivingSkill - Variable in class org.apache.any23.vocab.DOAC
 
dtend - Variable in class org.apache.any23.vocab.ICAL
This URI specifies the date and time that a calendar component ends.default value type: DATE-TIME.
dtexpired - Variable in class org.apache.any23.vocab.HLISTING
 
dtlisted - Variable in class org.apache.any23.vocab.HLISTING
 
dtstamp - Variable in class org.apache.any23.vocab.ICAL
value type: DATE-TIMEThe URI indicates the date/time that the instance of the iCalendar object was created..
dtstart - Variable in class org.apache.any23.vocab.ICAL
default value type: DATE-TIMEThis URI specifies when the calendar component begins..
due - Variable in class org.apache.any23.vocab.ICAL
default value type: DATE-TIMEThis URI defines the date and time that a to-do is expected to be completed..
dumpContent(File, String) - Static method in class org.apache.any23.util.FileUtils
Dumps the given string within a file.
dumpContent(File, Throwable) - Static method in class org.apache.any23.util.FileUtils
Dumps the stack trace of the given exception into the specified file.
Duration - Variable in class org.apache.any23.vocab.HRECIPE
 
duration - Variable in class org.apache.any23.vocab.HRECIPE
 
duration - Variable in class org.apache.any23.vocab.ICAL
The URI specifies a positive duration of time.value type: DURATION.
durationTime - Variable in class org.apache.any23.vocab.HRECIPE
 
durationTitle - Variable in class org.apache.any23.vocab.HRECIPE
 

E

education - Variable in class org.apache.any23.vocab.DOAC
 
Education - Variable in class org.apache.any23.vocab.DOAC
 
ELEMENT_LOCATION - Static variable in class org.apache.any23.extractor.html.TagSoupParser
 
email - Variable in class org.apache.any23.vocab.VCARD
An email address.
EMPTY - Static variable in class org.apache.any23.rdf.Prefixes
 
EmptyValidationReport - Class in org.apache.any23.validator
An implementation of ValidationReportBuilder with no data.
encoding - Variable in class org.apache.any23.vocab.ICAL
To specify an alternate inline encoding for the URI value..
EncodingDetector - Interface in org.apache.any23.encoding
Defines a detector for charset encoding.
end_date - Variable in class org.apache.any23.vocab.DOAC
 
END_SCRIPT - Static variable in class org.apache.any23.extractor.html.MicroformatExtractor
 
endDocument(URI) - Method in class org.apache.any23.filter.ExtractionContextBlocker
 
endDocument(URI) - Method in class org.apache.any23.filter.IgnoreAccidentalRDFa
 
endDocument(URI) - Method in class org.apache.any23.filter.IgnoreTitlesOfEmptyDocuments
 
endDocument(URI) - Method in class org.apache.any23.writer.BenchmarkTripleHandler
 
endDocument(URI) - Method in class org.apache.any23.writer.CompositeTripleHandler
 
endDocument(URI) - Method in class org.apache.any23.writer.CountingTripleHandler
 
endDocument(URI) - Method in class org.apache.any23.writer.JSONWriter
 
endDocument(URI) - Method in class org.apache.any23.writer.LoggingTripleHandler
 
endDocument(URI) - Method in class org.apache.any23.writer.RDFWriterTripleHandler
 
endDocument(URI) - Method in class org.apache.any23.writer.ReportingTripleHandler
 
endDocument(URI) - Method in class org.apache.any23.writer.RepositoryWriter
 
endDocument(URI) - Method in interface org.apache.any23.writer.TripleHandler
Informs the handler that the end of the document has been reached.
endDocument(URI) - Method in class org.apache.any23.writer.URIListWriter
 
endRDF() - Method in class org.apache.any23.extractor.rdf.RDFHandlerAdapter
 
endRDF() - Method in class org.apache.any23.io.nquads.NQuadsWriter
 
EntityBasedMicroformatExtractor - Class in org.apache.any23.extractor.html
Base class for microformat extractors based on entities.
EntityBasedMicroformatExtractor() - Constructor for class org.apache.any23.extractor.html.EntityBasedMicroformatExtractor
 
equals(Object) - Method in class org.apache.any23.extractor.ExtractionContext
 
equals(Object) - Method in class org.apache.any23.extractor.ExtractionParameters
 
equals(Object) - Method in class org.apache.any23.extractor.microdata.Item
 
equals(Object) - Method in class org.apache.any23.extractor.microdata.ItemProp
 
equals(Object) - Method in class org.apache.any23.extractor.microdata.ItemPropValue
 
equals(Object) - Method in class org.apache.any23.extractor.microdata.ItemScope
 
ERRORED_META_NODES - Static variable in class org.apache.any23.validator.rule.MetaNameMisuseRule
 
escapeAsJSONString(String) - Static method in class org.apache.any23.util.StringUtils
Escapes the in string as JSON string to let it being embeddable within a string field.
escapeDoubleQuotes(String) - Static method in class org.apache.any23.util.StringUtils
Escapes all the unescaped double quotes when needed.
event - Variable in class org.apache.any23.vocab.HLISTING
 
ExampleInputOutput - Class in org.apache.any23.extractor
A reporter for example input and output of an extractor.
ExampleInputOutput(String) - Constructor for class org.apache.any23.extractor.ExampleInputOutput
 
ExampleInputOutput(ExtractorFactory<?>) - Constructor for class org.apache.any23.extractor.ExampleInputOutput
 
Excel - Class in org.apache.any23.vocab
The MS Excel extractor vocabulary.
ExcelExtractor - Class in org.apache.any23.plugin.officescraper
Implementation of Extractor.ContentExtractor able to process a MS Excel 97-2007+ file format .xls/.xlsx and convert the detected content to triples.
ExcelExtractor() - Constructor for class org.apache.any23.plugin.officescraper.ExcelExtractor
 
ExcelPlugin - Class in org.apache.any23.plugin.officescraper
Implementation of ExtractorPlugin for ExcelExtractor.
ExcelPlugin() - Constructor for class org.apache.any23.plugin.officescraper.ExcelPlugin
 
exdate - Variable in class org.apache.any23.vocab.ICAL
default value type: DATE-TIMEThis URI defines the list of date/time exceptions for a recurring calendar component..
execute(String...) - Method in class org.apache.any23.cli.ToolRunner
 
execute(Rule, RuleContext, DOMDocument) - Method in interface org.apache.any23.validator.Fix
Executes this fix over a document.
execute(Rule, RuleContext, DOMDocument) - Method in class org.apache.any23.validator.rule.MetaNameMisuseFix
 
execute(Rule, RuleContext, DOMDocument) - Method in class org.apache.any23.validator.rule.OpenGraphNamespaceFix
 
expand(String) - Method in class org.apache.any23.rdf.Prefixes
 
experience - Variable in class org.apache.any23.vocab.DOAC
 
Experience - Variable in class org.apache.any23.vocab.DOAC
 
exrule - Variable in class org.apache.any23.vocab.ICAL
This URI defines a rule or repeating pattern for an exception to a recurrence set.value type: RECUR.
extended_address - Variable in class org.apache.any23.vocab.VCARD
The extended address of a postal address.
extendedAddress - Variable in class org.apache.any23.vocab.HLISTING
 
extract(ExtractionParameters, DocumentSource, TripleHandler, String) - Method in class org.apache.any23.Any23
Performs metadata extraction from the content of the given in document source, sending the generated events to the specified outputHandler.
extract(String, String, String, String, TripleHandler) - Method in class org.apache.any23.Any23
Performs metadata extraction on the in string associated to the documentURI URI, declaring contentType and encoding.
extract(String, String, TripleHandler) - Method in class org.apache.any23.Any23
Performs metadata extraction on the in string associated to the documentURI URI, sending the generated events to the specified outputHandler.
extract(File, TripleHandler) - Method in class org.apache.any23.Any23
Performs metadata extraction from the content of the given file sending the generated events to the specified outputHandler.
extract(ExtractionParameters, String, TripleHandler) - Method in class org.apache.any23.Any23
Performs metadata extraction from the content of the given documentURI sending the generated events to the specified outputHandler.
extract(String, TripleHandler) - Method in class org.apache.any23.Any23
Performs metadata extraction from the content of the given documentURI sending the generated events to the specified outputHandler.
extract(DocumentSource, TripleHandler, String) - Method in class org.apache.any23.Any23
Performs metadata extraction from the content of the given in document source, sending the generated events to the specified outputHandler.
extract(DocumentSource, TripleHandler) - Method in class org.apache.any23.Any23
Performs metadata extraction from the content of the given in document source, sending the generated events to the specified outputHandler.
extract(ExtractionParameters, DocumentSource, TripleHandler) - Method in class org.apache.any23.Any23
Performs metadata extraction from the content of the given in document source, sending the generated events to the specified outputHandler.
extract() - Method in class org.apache.any23.extractor.html.EntityBasedMicroformatExtractor
 
extract() - Method in class org.apache.any23.extractor.html.HCalendarExtractor
 
extract() - Method in class org.apache.any23.extractor.html.MicroformatExtractor
Performs the extraction of the data and writes them to the model.
extractEntity(Node, ExtractionResult) - Method in class org.apache.any23.extractor.html.AdrExtractor
 
extractEntity(Node, ExtractionResult) - Method in class org.apache.any23.extractor.html.EntityBasedMicroformatExtractor
Extracts an entity from a DOM node.
extractEntity(Node, ExtractionResult) - Method in class org.apache.any23.extractor.html.GeoExtractor
 
extractEntity(Node, ExtractionResult) - Method in class org.apache.any23.extractor.html.HCardExtractor
 
extractEntity(Node, ExtractionResult) - Method in class org.apache.any23.extractor.html.HListingExtractor
 
extractEntity(Node, ExtractionResult) - Method in class org.apache.any23.extractor.html.HRecipeExtractor
 
extractEntity(Node, ExtractionResult) - Method in class org.apache.any23.extractor.html.HResumeExtractor
 
extractEntity(Node, ExtractionResult) - Method in class org.apache.any23.extractor.html.HReviewExtractor
 
extractEntity(Node, ExtractionResult) - Method in class org.apache.any23.extractor.html.SpeciesExtractor
Extracts an entity from a DOM node.
EXTRACTION_CONTEXT_URI_PROPERTY - Static variable in class org.apache.any23.extractor.ExtractionParameters
 
ExtractionContext - Class in org.apache.any23.extractor
This class provides the context for the processing of a single Extractor.
ExtractionContext(String, URI, String, String) - Constructor for class org.apache.any23.extractor.ExtractionContext
 
ExtractionContext(String, URI, String) - Constructor for class org.apache.any23.extractor.ExtractionContext
 
ExtractionContext(String, URI) - Constructor for class org.apache.any23.extractor.ExtractionContext
 
ExtractionContextBlocker - Class in org.apache.any23.filter
A wrapper around a TripleHandler that can block and unblock calls to the handler, either for the entire document, or for individual ExtractionContexts.
ExtractionContextBlocker(TripleHandler) - Constructor for class org.apache.any23.filter.ExtractionContextBlocker
 
ExtractionException - Exception in org.apache.any23.extractor
Defines a specific exception raised during the metadata extraction phase.
ExtractionException(String) - Constructor for exception org.apache.any23.extractor.ExtractionException
 
ExtractionException(String, Throwable) - Constructor for exception org.apache.any23.extractor.ExtractionException
 
ExtractionException(String, Throwable, ExtractionResult) - Constructor for exception org.apache.any23.extractor.ExtractionException
 
ExtractionParameters - Class in org.apache.any23.extractor
This class models the parameters to be used to perform an extraction.
ExtractionParameters(Configuration, ExtractionParameters.ValidationMode, Map<String, Boolean>, Map<String, String>) - Constructor for class org.apache.any23.extractor.ExtractionParameters
Constructor.
ExtractionParameters(Configuration, ExtractionParameters.ValidationMode) - Constructor for class org.apache.any23.extractor.ExtractionParameters
Constructor.
ExtractionParameters(Configuration, ExtractionParameters.ValidationMode, boolean) - Constructor for class org.apache.any23.extractor.ExtractionParameters
Constructor, allows to set explicitly the value for flag SingleDocumentExtraction#METADATA_NESTING_FLAG.
ExtractionParameters.ValidationMode - Enum in org.apache.any23.extractor
Declares the supported validation actions.
ExtractionReport - Class in org.apache.any23
This class contains some statistics and general information about an extraction.
ExtractionReport(List<Extractor>, String, String, ValidationReport, Map<String, Collection<IssueReport.Issue>>) - Constructor for class org.apache.any23.ExtractionReport
 
ExtractionResult - Interface in org.apache.any23.extractor
Interface defining the methods that a representation of an extraction result must have.
ExtractionResultImpl - Class in org.apache.any23.extractor

A default implementation of ExtractionResult; it receives extraction output from one Extractor working on one document, and passes the output on to a TripleHandler.

ExtractionResultImpl(ExtractionContext, Extractor<?>, TripleHandler) - Constructor for class org.apache.any23.extractor.ExtractionResultImpl
 
Extractor<Input> - Interface in org.apache.any23.extractor
It defines the signature of a generic Extractor.
Extractor.BlindExtractor - Interface in org.apache.any23.extractor
This interface specializes an Extractor able to handle URI as input format.
Extractor.ContentExtractor - Interface in org.apache.any23.extractor
This interface specializes an Extractor able to handle InputStream as input format.
Extractor.TagSoupDOMExtractor - Interface in org.apache.any23.extractor
This interface specializes an Extractor able to handle Document as input format.
ExtractorDescription - Interface in org.apache.any23.extractor
It defines a minimal signature for an Extractor description.
ExtractorDocumentation - Class in org.apache.any23.cli
This class provides some command-line documentation about available extractors and their usage.
ExtractorDocumentation() - Constructor for class org.apache.any23.cli.ExtractorDocumentation
 
ExtractorFactory<T extends Extractor<?>> - Interface in org.apache.any23.extractor
Interface defining a factory for Extractor.
ExtractorGroup - Class in org.apache.any23.extractor
It simple models a group of ExtractorFactory providing simple accessing methods.
ExtractorGroup(Collection<ExtractorFactory<?>>) - Constructor for class org.apache.any23.extractor.ExtractorGroup
 
ExtractorPlugin<T extends Extractor<?>> - Interface in org.apache.any23.plugin
This interface defines an org.apache.any23.cli.Any23 extractor plugin that can be detected and registered from the library classpath.
ExtractorRegistry - Interface in org.apache.any23.extractor
An interface to the enable a registry for extractors to be implemented by different implementors of this API.
ExtractorRegistryImpl - Class in org.apache.any23.extractor
Singleton class acting as a register for all the various Extractor.
ExtractorRegistryImpl() - Constructor for class org.apache.any23.extractor.ExtractorRegistryImpl
 
extractPrefixSections(String) - Static method in class org.apache.any23.extractor.rdfa.RDFa11Parser
Given a prefix declaration returns a list of prefixID:prefixURL strings normalizing blanks where present.
extractRelTag(String) - Static method in class org.apache.any23.extractor.html.HTMLDocument
Extracts the href specific rel-tag string.
extractRelTag(NamedNodeMap) - Static method in class org.apache.any23.extractor.html.HTMLDocument
Extracts the href specific rel-tag string.
extractRelTagNodes() - Method in class org.apache.any23.extractor.html.HTMLDocument
Extracts all the rel tag nodes.

F

factory - Static variable in class org.apache.any23.extractor.csv.CSVExtractor
 
factory - Static variable in class org.apache.any23.extractor.html.AdrExtractor
 
factory - Static variable in class org.apache.any23.extractor.html.GeoExtractor
 
factory - Static variable in class org.apache.any23.extractor.html.HCalendarExtractor
 
factory - Static variable in class org.apache.any23.extractor.html.HCardExtractor
 
factory - Static variable in class org.apache.any23.extractor.html.HeadLinkExtractor
 
factory - Static variable in class org.apache.any23.extractor.html.HListingExtractor
 
factory - Static variable in class org.apache.any23.extractor.html.HRecipeExtractor
 
factory - Static variable in class org.apache.any23.extractor.html.HResumeExtractor
 
factory - Static variable in class org.apache.any23.extractor.html.HReviewExtractor
 
factory - Static variable in class org.apache.any23.extractor.html.HTMLMetaExtractor
 
factory - Static variable in class org.apache.any23.extractor.html.ICBMExtractor
 
factory - Static variable in class org.apache.any23.extractor.html.LicenseExtractor
 
factory - Static variable in class org.apache.any23.extractor.html.SpeciesExtractor
 
factory - Static variable in class org.apache.any23.extractor.html.TitleExtractor
 
factory - Static variable in class org.apache.any23.extractor.html.TurtleHTMLExtractor
 
factory - Static variable in class org.apache.any23.extractor.html.XFNExtractor
 
factory - Static variable in class org.apache.any23.extractor.microdata.MicrodataExtractor
 
factory - Static variable in class org.apache.any23.extractor.rdf.NQuadsExtractor
 
factory - Static variable in class org.apache.any23.extractor.rdf.NTriplesExtractor
 
factory - Static variable in class org.apache.any23.extractor.rdf.RDFXMLExtractor
 
factory - Static variable in class org.apache.any23.extractor.rdf.TriXExtractor
 
factory - Static variable in class org.apache.any23.extractor.rdf.TurtleExtractor
 
factory - Static variable in class org.apache.any23.extractor.rdfa.RDFa11Extractor
 
factory - Static variable in class org.apache.any23.extractor.rdfa.RDFaExtractor
 
factory - Static variable in class org.apache.any23.extractor.xpath.XPathExtractor
 
factory - Static variable in class org.apache.any23.plugin.htmlscraper.HTMLScraperExtractor
 
factory - Static variable in class org.apache.any23.plugin.officescraper.ExcelExtractor
 
family - Variable in class org.apache.any23.vocab.WO
A family is a scientific grouping of closely related organisms.
FAMILY_NAME - Static variable in class org.apache.any23.extractor.html.HCardName
 
family_name - Variable in class org.apache.any23.vocab.FOAF
 
family_name - Variable in class org.apache.any23.vocab.VCARD
A family name part of a person's name.
familyName - Variable in class org.apache.any23.vocab.WO
Used to specify the name of a family as part of a Taxon Name
familyProperty - Variable in class org.apache.any23.vocab.WO
associates a taxon rank with a family
fax - Variable in class org.apache.any23.vocab.VCARD
A fax number of a person.
fbtype - Variable in class org.apache.any23.vocab.ICAL
To specify the free or busy time type..
Feedback - Variable in class org.apache.any23.vocab.REVIEW
Feedback on the review.
FIELDS - Static variable in class org.apache.any23.extractor.html.HCardName
 
FILE_DOCUMENT_PREFIX - Static variable in class org.apache.any23.cli.MimeDetector
 
FileDocumentSource - Class in org.apache.any23.source
File implementation of DocumentSource.
FileDocumentSource(File) - Constructor for class org.apache.any23.source.FileDocumentSource
 
FileDocumentSource(File, String) - Constructor for class org.apache.any23.source.FileDocumentSource
 
FileUtils - Class in org.apache.any23.util
Utility class for handling files.
filterByMIMEType(MIMEType) - Method in class org.apache.any23.extractor.ExtractorGroup
Returns a ExtractorGroup with a set of Extractor able to process the provided mime type.
find(Node, String) - Static method in class org.apache.any23.extractor.html.DomUtils
Gets the string value of an XPath expression.
find(String) - Method in class org.apache.any23.extractor.html.HTMLDocument
 
findAll(Node, String) - Static method in class org.apache.any23.extractor.html.DomUtils
Returns a NodeList composed of all the nodes that match an XPath expression, which must be valid.
findAll(String) - Method in class org.apache.any23.extractor.html.HTMLDocument
 
findAllByAttributeContains(Node, String, String) - Static method in class org.apache.any23.extractor.html.DomUtils
 
findAllByAttributeName(Node, String) - Static method in class org.apache.any23.extractor.html.DomUtils
Finds all nodes that have a declared attribute.
findAllByClassName(Node, String) - Static method in class org.apache.any23.extractor.html.DomUtils
Finds all nodes that have a declared class.
findAllByClassName(String) - Method in class org.apache.any23.extractor.html.HTMLDocument
Finds all the nodes by class name.
findAllByTag(Node, String) - Static method in class org.apache.any23.extractor.html.DomUtils
 
findAllByTagAndClassName(Node, String, String) - Static method in class org.apache.any23.extractor.html.DomUtils
 
findMicroformattedObjectNode(String, String) - Method in class org.apache.any23.extractor.html.HTMLDocument
 
findMicroformattedValue(String, String, String, String, String) - Method in class org.apache.any23.extractor.html.HTMLDocument
 
findNodeById(Node, String) - Static method in class org.apache.any23.extractor.html.DomUtils
Mimics the JS DOM API, or prototype's $()
findNodeById(String) - Method in class org.apache.any23.extractor.html.HTMLDocument
 
FIRST_CELL - Static variable in class org.apache.any23.vocab.Excel
 
FIRST_ROW - Static variable in class org.apache.any23.vocab.Excel
 
firstCell - Variable in class org.apache.any23.vocab.Excel
This property links the identifier of a Row to the index of the first declared cell.
firstName - Variable in class org.apache.any23.vocab.FOAF
 
firstRow - Variable in class org.apache.any23.vocab.Excel
This property links the identifier of a Sheet to the index of the first declared row.
Fix - Interface in org.apache.any23.validator
This interface models a fix for an issue.
fixAbsoluteURI(String) - Static method in class org.apache.any23.rdf.RDFUtils
Fixes typical errors in an absolute URI, such as unescaped spaces.
fixLink(String) - Method in class org.apache.any23.extractor.html.MicroformatExtractor
 
fixLink(String, String) - Method in class org.apache.any23.extractor.html.MicroformatExtractor
 
fixLink(String, String) - Method in class org.apache.any23.rdf.Any23ValueFactoryWrapper
Helper method to conditionally add a schema to a URI unless it's there, or null if link is empty.
fixURI(String) - Method in class org.apache.any23.rdf.Any23ValueFactoryWrapper
 
fixURIWithException(String) - Static method in class org.apache.any23.rdf.RDFUtils
Tries to fix a potentially broken relative or absolute URI.
fixWhiteSpace(String) - Method in class org.apache.any23.rdf.Any23ValueFactoryWrapper
 
FLAG_PROPERTY_OFF - Static variable in class org.apache.any23.configuration.DefaultConfiguration
 
FLAG_PROPERTY_ON - Static variable in class org.apache.any23.configuration.DefaultConfiguration
 
fmttype - Variable in class org.apache.any23.vocab.ICAL
To specify the content type of a referenced object..
fn - Variable in class org.apache.any23.vocab.HRECIPE
 
fn - Variable in class org.apache.any23.vocab.VCARD
A formatted name of a person.
FOAF - Class in org.apache.any23.vocab
The Fried Of A Friend vocabulary.
formatDateTime(Date) - Static method in class org.apache.any23.extractor.microdata.ItemPropValue
 
FormatWriter - Interface in org.apache.any23.writer
Base interface used for the definition of RDF format writers.
freebusy - Variable in class org.apache.any23.vocab.ICAL
The URI defines one or more free or busy time intervals.value type: PERIOD.
freq - Variable in class org.apache.any23.vocab.ICAL
 
friend - Variable in class org.apache.any23.vocab.XFN
 
fromStrings(Collection<String>) - Static method in class org.apache.any23.http.AcceptHeaderBuilder
 
fundedBy - Variable in class org.apache.any23.vocab.FOAF
 

G

geekcode - Variable in class org.apache.any23.vocab.FOAF
 
gender - Variable in class org.apache.any23.vocab.FOAF
 
genus - Variable in class org.apache.any23.vocab.WO
 
genusClass - Variable in class org.apache.any23.vocab.WO
 
genusName - Variable in class org.apache.any23.vocab.WO
 
GEO - Class in org.apache.any23.vocab
The GEO Names vocabulary.
geo - Variable in class org.apache.any23.vocab.ICAL
value type: list of FLOATThis URI specifies information related to the global position for the activity specified by a calendar component..
geo - Variable in class org.apache.any23.vocab.VCARD
A geographic location associated with a person.
GeoExtractor - Class in org.apache.any23.extractor.html
Extractor for the Geo microformat.
GeoExtractor() - Constructor for class org.apache.any23.extractor.html.GeoExtractor
 
get() - Static method in class org.apache.any23.rdf.PopularPrefixes
 
getAcceptHeader() - Method in class org.apache.any23.http.AcceptHeaderBuilder
Builds and returns an accept header.
getAcceptHeader() - Method in class org.apache.any23.http.DefaultHTTPClientConfiguration
 
getAcceptHeader() - Method in interface org.apache.any23.http.HTTPClientConfiguration
 
getActualDocumentURI() - Method in class org.apache.any23.http.DefaultHTTPClient
 
getActualDocumentURI() - Method in interface org.apache.any23.http.HTTPClient
Returns the actual URI from which the document was fetched.
getAliases() - Method in class org.apache.any23.servlet.conneg.ContentTypeNegotiator.VariantSpec
 
getAllNames() - Method in interface org.apache.any23.extractor.ExtractorRegistry
Returns the names of all registered extractors, sorted alphabetically.
getAllNames() - Method in class org.apache.any23.extractor.ExtractorRegistryImpl
Returns the names of all registered extractors, sorted alphabetically.
getAllRules() - Method in class org.apache.any23.validator.DefaultValidator
 
getAllRules() - Method in interface org.apache.any23.validator.Validator
Returns all the registered rules.
getApplicableExtractors(ExtractorRegistry, File...) - Method in class org.apache.any23.plugin.Any23PluginManager
Returns an extractor group containing both the default extractors declared by the ExtractorRegistry and the ExtractorPlugins.
getApplicableTools(File...) - Method in class org.apache.any23.plugin.Any23PluginManager
Returns an Iterator of tools that have been detected within the given list of locations.
getAsDate() - Method in class org.apache.any23.extractor.microdata.ItemPropValue
 
getAsFloat() - Method in class org.apache.any23.extractor.microdata.ItemPropValue
 
getAsInteger() - Method in class org.apache.any23.extractor.microdata.ItemPropValue
 
getAsLink() - Method in class org.apache.any23.extractor.microdata.ItemPropValue
 
getAsNested() - Method in class org.apache.any23.extractor.microdata.ItemPropValue
 
getAsPlainLiteral(Node, String) - Static method in class org.apache.any23.extractor.rdfa.RDFa11Parser
 
getAsXMLLiteral(Node) - Static method in class org.apache.any23.extractor.rdfa.RDFa11Parser
 
getBaseClassName() - Method in class org.apache.any23.extractor.html.AdrExtractor
 
getBaseClassName() - Method in class org.apache.any23.extractor.html.EntityBasedMicroformatExtractor
Returns the base class name for the extractor.
getBaseClassName() - Method in class org.apache.any23.extractor.html.GeoExtractor
 
getBaseClassName() - Method in class org.apache.any23.extractor.html.HCardExtractor
 
getBaseClassName() - Method in class org.apache.any23.extractor.html.HListingExtractor
 
getBaseClassName() - Method in class org.apache.any23.extractor.html.HRecipeExtractor
 
getBaseClassName() - Method in class org.apache.any23.extractor.html.HResumeExtractor
 
getBaseClassName() - Method in class org.apache.any23.extractor.html.HReviewExtractor
 
getBaseClassName() - Method in class org.apache.any23.extractor.html.SpeciesExtractor
Returns the base class name for the extractor.
getBeginColumnNumber() - Method in class org.apache.any23.extractor.html.TagSoupParser.ElementLocation
 
getBeginLineNumber() - Method in class org.apache.any23.extractor.html.TagSoupParser.ElementLocation
 
getBestMatch(String) - Method in class org.apache.any23.servlet.conneg.ContentTypeNegotiator
Returns the MediaRangeSpec associated to the given accept type.
getBestMatch(String, String) - Method in class org.apache.any23.servlet.conneg.ContentTypeNegotiator
Returns the MediaRangeSpec associated to the given accept type and userAgent.
getBestMatch(List<MediaRangeSpec>) - Method in class org.apache.any23.servlet.conneg.MediaRangeSpec
 
getBlankNodeFor(Node) - Method in class org.apache.any23.extractor.html.EntityBasedMicroformatExtractor
 
getBNode(String) - Static method in class org.apache.any23.rdf.RDFUtils
Creates a BNode.
getCause() - Method in class org.apache.any23.validator.ValidationReport.Error
 
getChilds() - Method in class org.apache.any23.writer.CompositeTripleHandler
 
getClass(String) - Method in class org.apache.any23.vocab.Vocabulary
Returns a class defined within this vocabulary.
getClasses() - Method in class org.apache.any23.vocab.Vocabulary
 
getClassesInPackage(String) - Static method in class org.apache.any23.util.DiscoveryUtils
Scans all classes accessible from the context class loader which belong to the given package and sub-packages.
getClassesInPackage(String, Class) - Static method in class org.apache.any23.util.DiscoveryUtils
Scans all classes accessible from the context class loader which belong to the given package and sub-packages and filter them by ones implementing the specified interface iface.
getCol() - Method in class org.apache.any23.extractor.IssueReport.Issue
 
getCommentFor(URI) - Method in class org.apache.any23.vocab.Vocabulary
Returns the comment for the given resource.
getComments() - Method in class org.apache.any23.vocab.Vocabulary
Returns all the defined comments for resources.
getConfigurationDump() - Method in interface org.apache.any23.configuration.Configuration
Returns a human readable string containing the configuration dump.
getConfigurationDump() - Method in class org.apache.any23.configuration.DefaultConfiguration
 
getConnectionTimeout() - Method in class org.apache.any23.http.DefaultHTTPClient
 
getContent() - Method in class org.apache.any23.extractor.microdata.ItemPropValue
 
getContentLength() - Method in class org.apache.any23.http.DefaultHTTPClient
 
getContentLength() - Method in interface org.apache.any23.http.HTTPClient
 
getContentLength() - Method in class org.apache.any23.source.ByteArrayDocumentSource
 
getContentLength() - Method in interface org.apache.any23.source.DocumentSource
 
getContentLength() - Method in class org.apache.any23.source.FileDocumentSource
 
getContentLength() - Method in class org.apache.any23.source.HTTPDocumentSource
 
getContentLength() - Method in class org.apache.any23.source.StringDocumentSource
 
getContentType() - Method in class org.apache.any23.http.DefaultHTTPClient
 
getContentType() - Method in interface org.apache.any23.http.HTTPClient
The value of the Content-Type header reported by the server.
getContentType() - Method in class org.apache.any23.source.ByteArrayDocumentSource
 
getContentType() - Method in interface org.apache.any23.source.DocumentSource
 
getContentType() - Method in class org.apache.any23.source.FileDocumentSource
 
getContentType() - Method in class org.apache.any23.source.HTTPDocumentSource
 
getContentType() - Method in class org.apache.any23.source.StringDocumentSource
 
getCount() - Method in class org.apache.any23.writer.CountingTripleHandler
 
getCurrentExtractionResult() - Method in class org.apache.any23.extractor.html.MicroformatExtractor
Returns the ExtractionResult associated to the extraction session.
getData(String) - Method in class org.apache.any23.validator.DefaultRuleContext
 
getData(String) - Method in interface org.apache.any23.validator.RuleContext
Retrieves a registered object.
getDefaultLanguage() - Method in class org.apache.any23.extractor.ExtractionContext
 
getDefaultLanguage() - Method in class org.apache.any23.extractor.html.HTMLDocument
Returns the document default language.
getDefaultLiteralLanguage() - Method in class org.apache.any23.rdf.Any23ValueFactoryWrapper
 
getDefaultTimeout() - Method in class org.apache.any23.http.DefaultHTTPClientConfiguration
 
getDefaultTimeout() - Method in interface org.apache.any23.http.HTTPClientConfiguration
 
getDescription() - Method in class org.apache.any23.extractor.csv.CSVExtractor
Returns a ExtractorDescription of this extractor.
getDescription() - Method in interface org.apache.any23.extractor.Extractor
Returns a ExtractorDescription of this extractor.
getDescription() - Method in class org.apache.any23.extractor.html.AdrExtractor
 
getDescription() - Method in class org.apache.any23.extractor.html.GeoExtractor
 
getDescription() - Method in class org.apache.any23.extractor.html.HCalendarExtractor
 
getDescription() - Method in class org.apache.any23.extractor.html.HCardExtractor
 
getDescription() - Method in class org.apache.any23.extractor.html.HeadLinkExtractor
 
getDescription() - Method in class org.apache.any23.extractor.html.HListingExtractor
 
getDescription() - Method in class org.apache.any23.extractor.html.HRecipeExtractor
 
getDescription() - Method in class org.apache.any23.extractor.html.HResumeExtractor
 
getDescription() - Method in class org.apache.any23.extractor.html.HReviewExtractor
 
getDescription() - Method in class org.apache.any23.extractor.html.HTMLMetaExtractor
 
getDescription() - Method in class org.apache.any23.extractor.html.ICBMExtractor
 
getDescription() - Method in class org.apache.any23.extractor.html.LicenseExtractor
 
getDescription() - Method in class org.apache.any23.extractor.html.MicroformatExtractor
Returns the description of this extractor.
getDescription() - Method in class org.apache.any23.extractor.html.SpeciesExtractor
Returns the description of this extractor.
getDescription() - Method in class org.apache.any23.extractor.html.TitleExtractor
 
getDescription() - Method in class org.apache.any23.extractor.html.TurtleHTMLExtractor
 
getDescription() - Method in class org.apache.any23.extractor.html.XFNExtractor
 
getDescription() - Method in class org.apache.any23.extractor.microdata.MicrodataExtractor
 
getDescription() - Method in class org.apache.any23.extractor.rdf.BaseRDFExtractor
 
getDescription() - Method in class org.apache.any23.extractor.rdf.NQuadsExtractor
 
getDescription() - Method in class org.apache.any23.extractor.rdf.NTriplesExtractor
 
getDescription() - Method in class org.apache.any23.extractor.rdf.RDFXMLExtractor
 
getDescription() - Method in class org.apache.any23.extractor.rdf.TriXExtractor
 
getDescription() - Method in class org.apache.any23.extractor.rdf.TurtleExtractor
 
getDescription() - Method in class org.apache.any23.extractor.rdfa.RDFa11Extractor
 
getDescription() - Method in class org.apache.any23.extractor.rdfa.RDFaExtractor
 
getDescription() - Method in class org.apache.any23.extractor.xpath.XPathExtractor
 
getDescription() - Method in class org.apache.any23.plugin.htmlscraper.HTMLScraperExtractor
 
getDescription() - Method in class org.apache.any23.plugin.officescraper.ExcelExtractor
 
getDetectedItemScopes() - Method in class org.apache.any23.extractor.microdata.MicrodataParserReport
 
getDetectedMimeType() - Method in class org.apache.any23.ExtractionReport
 
getDetectedMIMEType() - Method in class org.apache.any23.extractor.SingleDocumentExtraction
Returns the detected mimetype for the given DocumentSource.
getDocument() - Method in class org.apache.any23.extractor.html.DocumentReport
 
getDocument() - Method in class org.apache.any23.extractor.html.HTMLDocument
 
getDocumentBase(URL, Document) - Static method in class org.apache.any23.extractor.rdfa.RDFa11Parser
 
getDocumentURI() - Method in class org.apache.any23.extractor.ExtractionContext
 
getDocumentURI() - Method in class org.apache.any23.extractor.html.MicroformatExtractor
 
getDocumentURI() - Method in class org.apache.any23.source.ByteArrayDocumentSource
 
getDocumentURI() - Method in interface org.apache.any23.source.DocumentSource
 
getDocumentURI() - Method in class org.apache.any23.source.FileDocumentSource
 
getDocumentURI() - Method in class org.apache.any23.source.HTTPDocumentSource
 
getDocumentURI() - Method in class org.apache.any23.source.StringDocumentSource
 
getDocumentURI() - Method in class org.apache.any23.validator.DefaultDOMDocument
 
getDocumentURI() - Method in interface org.apache.any23.validator.DOMDocument
 
getDOM() - Method in class org.apache.any23.extractor.html.TagSoupParser
Returns the DOM of the given document URI.
getEncoding() - Method in class org.apache.any23.ExtractionReport
 
getEndColumnNumber() - Method in class org.apache.any23.extractor.html.TagSoupParser.ElementLocation
 
getEndLineNumber() - Method in class org.apache.any23.extractor.html.TagSoupParser.ElementLocation
 
getErrorLocationBeginCol() - Method in exception org.apache.any23.extractor.microdata.MicrodataParserException
 
getErrorLocationBeginRow() - Method in exception org.apache.any23.extractor.microdata.MicrodataParserException
 
getErrorLocationEndCol() - Method in exception org.apache.any23.extractor.microdata.MicrodataParserException
 
getErrorLocationEndRow() - Method in exception org.apache.any23.extractor.microdata.MicrodataParserException
 
getErrorMode() - Method in class org.apache.any23.extractor.microdata.MicrodataParser
 
getErrorPath() - Method in exception org.apache.any23.extractor.microdata.MicrodataParserException
 
getErrors() - Method in class org.apache.any23.extractor.microdata.MicrodataParser
 
getErrors() - Method in class org.apache.any23.extractor.microdata.MicrodataParserReport
 
getErrors() - Method in class org.apache.any23.validator.DefaultValidationReport
 
getErrors() - Method in class org.apache.any23.validator.EmptyValidationReport
 
getErrors() - Method in interface org.apache.any23.validator.ValidationReport
Returns the list of detected errors.
getExampleInput() - Method in class org.apache.any23.extractor.ExampleInputOutput
 
getExampleInput() - Method in interface org.apache.any23.extractor.ExtractorFactory
An example input file for the extractor, to be used in auto-generated documentation.
getExampleInput() - Method in class org.apache.any23.extractor.SimpleExtractorFactory
 
getExampleOutput() - Method in class org.apache.any23.extractor.ExampleInputOutput
 
getExampleURI() - Method in class org.apache.any23.extractor.ExampleInputOutput
 
getExtendedProperty(String) - Method in class org.apache.any23.vocab.XFN
 
getExtractionContext() - Method in class org.apache.any23.extractor.ExtractionResultImpl
 
getExtractionContext() - Method in class org.apache.any23.extractor.html.MicroformatExtractor
 
getExtractor() - Method in class org.apache.any23.extractor.TagSoupExtractionResult.PropertyPath
 
getExtractor() - Method in class org.apache.any23.extractor.TagSoupExtractionResult.ResourceRoot
 
getExtractorFactory() - Method in interface org.apache.any23.plugin.ExtractorPlugin
 
getExtractorFactory() - Method in class org.apache.any23.plugin.htmlscraper.HTMLScraperPlugin
 
getExtractorFactory() - Method in class org.apache.any23.plugin.officescraper.ExcelPlugin
 
getExtractorGroup() - Method in interface org.apache.any23.extractor.ExtractorRegistry
 
getExtractorGroup(List<String>) - Method in interface org.apache.any23.extractor.ExtractorRegistry
Returns an ExtractorGroup containing the ExtractorFactory mathing the names provided as input.
getExtractorGroup() - Method in class org.apache.any23.extractor.ExtractorRegistryImpl
 
getExtractorGroup(List<String>) - Method in class org.apache.any23.extractor.ExtractorRegistryImpl
Returns an ExtractorGroup containing the ExtractorFactory mathing the names provided as input.
getExtractorIssues(String) - Method in class org.apache.any23.ExtractionReport
 
getExtractorName() - Method in class org.apache.any23.extractor.ExtractionContext
 
getExtractorName() - Method in interface org.apache.any23.extractor.ExtractorDescription
Returns the name of the extractor.
getExtractorName() - Method in class org.apache.any23.extractor.SimpleExtractorFactory
 
getExtractorNames() - Method in class org.apache.any23.writer.ReportingTripleHandler
 
getExtractors() - Method in class org.apache.any23.plugin.Any23PluginManager
List of ExtractorPlugin classes declared within the classpath.
getExtractorToIssues() - Method in class org.apache.any23.extractor.SingleDocumentExtractionReport
 
getExtractorType() - Method in interface org.apache.any23.extractor.ExtractorFactory
Returns the extractor type.
getExtractorType() - Method in class org.apache.any23.extractor.SimpleExtractorFactory
 
getFactory(String) - Method in interface org.apache.any23.extractor.ExtractorRegistry
Retrieves a ExtractorFactory given its name
getFactory(String) - Method in class org.apache.any23.extractor.ExtractorRegistryImpl
Retrieves a ExtractorFactory given its name
getField(String) - Method in class org.apache.any23.extractor.html.HCardName
 
getFields(String) - Method in class org.apache.any23.extractor.html.HCardName
 
getFixes(Class<? extends Rule>) - Method in class org.apache.any23.validator.DefaultValidator
 
getFixes(Class<? extends Rule>) - Method in interface org.apache.any23.validator.Validator
Returns all fixes registered for the give rule.
getFlag(String) - Method in class org.apache.any23.extractor.ExtractionParameters
Returns the value of the specified extraction flag, if the flag is undefined it will be retrieved by the default Configuration.
getFlagProperty(String) - Method in interface org.apache.any23.configuration.Configuration
Returns the value of a flag property.
getFlagProperty(String) - Method in class org.apache.any23.configuration.DefaultConfiguration
 
getFormatByExtension(String) - Static method in class org.apache.any23.rdf.RDFUtils
Returns a parser type from the given extension.
getFormats() - Static method in class org.apache.any23.rdf.RDFUtils
Returns all the available RDFFormats.
getFullName() - Method in class org.apache.any23.extractor.html.HCardName
 
getFullType() - Method in class org.apache.any23.mime.MIMEType
 
getGraph() - Method in class org.apache.any23.extractor.xpath.QuadTemplate
 
getHRName() - Method in interface org.apache.any23.validator.Fix
 
getHRName() - Method in class org.apache.any23.validator.rule.AboutNotURIRule
 
getHRName() - Method in interface org.apache.any23.validator.Rule
 
getHRName() - Method in class org.apache.any23.validator.rule.MetaNameMisuseFix
 
getHRName() - Method in class org.apache.any23.validator.rule.MetaNameMisuseRule
 
getHRName() - Method in class org.apache.any23.validator.rule.MissingOpenGraphNamespaceRule
 
getHRName() - Method in class org.apache.any23.validator.rule.OpenGraphNamespaceFix
 
getHTMLDocument() - Method in class org.apache.any23.extractor.html.MicroformatExtractor
 
getHTTPClient() - Method in class org.apache.any23.Any23
Returns the current HTTPClient implementation.
getHTTPUserAgent() - Method in class org.apache.any23.Any23
Returns the HTTP Header User Agent, see RFC 2616-14.43.
getId() - Method in class org.apache.any23.extractor.microdata.ItemScope
 
getIdentifier() - Method in class org.apache.any23.writer.JSONWriterFactory
 
getIdentifier() - Method in class org.apache.any23.writer.NQuadsWriterFactory
 
getIdentifier() - Method in class org.apache.any23.writer.NTriplesWriterFactory
 
getIdentifier() - Method in class org.apache.any23.writer.RDFXMLWriterFactory
 
getIdentifier() - Method in class org.apache.any23.writer.TriXWriterFactory
 
getIdentifier() - Method in class org.apache.any23.writer.TurtleWriterFactory
 
getIdentifier() - Method in class org.apache.any23.writer.URIListWriterFactory
 
getIdentifier() - Method in interface org.apache.any23.writer.WriterFactory
 
getIdentifier(WriterFactory) - Static method in class org.apache.any23.writer.WriterFactoryRegistry
Reads the identifier specified for the given FormatWriter.
getIdentifiers() - Method in class org.apache.any23.writer.WriterFactoryRegistry
 
getIndexInParent(Node) - Static method in class org.apache.any23.extractor.html.DomUtils
Given a node this method returns the index corresponding to such node within the list of the children of its parent node.
getInstance() - Static method in class org.apache.any23.extractor.ExtractorRegistryImpl
 
getInstance() - Static method in class org.apache.any23.extractor.rdf.RDFParserFactory
 
getInstance() - Static method in class org.apache.any23.plugin.Any23PluginManager
 
getInstance() - Static method in class org.apache.any23.plugin.crawler.SharedData
 
getInstance() - Static method in class org.apache.any23.validator.EmptyValidationReport
 
getInstance() - Static method in class org.apache.any23.vocab.CSV
 
getInstance() - Static method in class org.apache.any23.vocab.DCTERMS
 
getInstance() - Static method in class org.apache.any23.vocab.DOAC
 
getInstance() - Static method in class org.apache.any23.vocab.Excel
 
getInstance() - Static method in class org.apache.any23.vocab.FOAF
 
getInstance() - Static method in class org.apache.any23.vocab.GEO
 
getInstance() - Static method in class org.apache.any23.vocab.HLISTING
 
getInstance() - Static method in class org.apache.any23.vocab.HRECIPE
 
getInstance() - Static method in class org.apache.any23.vocab.ICAL
 
getInstance() - Static method in class org.apache.any23.vocab.OGP
 
getInstance() - Static method in class org.apache.any23.vocab.REVIEW
 
getInstance() - Static method in class org.apache.any23.vocab.SCHEMAORG
 
getInstance() - Static method in class org.apache.any23.vocab.SINDICE
 
getInstance() - Static method in class org.apache.any23.vocab.VCARD
 
getInstance() - Static method in class org.apache.any23.vocab.WO
 
getInstance() - Static method in class org.apache.any23.vocab.XFN
 
getInstance() - Static method in class org.apache.any23.vocab.XHTML
 
getInstance() - Static method in class org.apache.any23.writer.WriterFactoryRegistry
 
getInternalValue() - Method in class org.apache.any23.extractor.xpath.Term
 
getIssueReport() - Method in class org.apache.any23.rdf.Any23ValueFactoryWrapper
 
getIssues() - Method in class org.apache.any23.extractor.ExtractionResultImpl
 
getIssues() - Method in interface org.apache.any23.extractor.IssueReport
Returns all the collected issues.
getIssues() - Method in class org.apache.any23.validator.DefaultValidationReport
 
getIssues() - Method in class org.apache.any23.validator.EmptyValidationReport
 
getIssues() - Method in interface org.apache.any23.validator.ValidationReport
Returns the list of detected issues.
getIssuesCount() - Method in class org.apache.any23.extractor.ExtractionResultImpl
 
getItemId() - Method in class org.apache.any23.extractor.microdata.ItemScope
 
getItemPropNodes(Node) - Static method in class org.apache.any23.extractor.microdata.MicrodataParser
Returns all the itemProps detected within the given root node.
getItemProps(Node, boolean) - Method in class org.apache.any23.extractor.microdata.MicrodataParser
Returns all the itemprops for the given itemscope node.
getItemScope(Node) - Method in class org.apache.any23.extractor.microdata.MicrodataParser
Returns the ItemScope instance described within the specified node.
getItemScopeNodes(Node) - Static method in class org.apache.any23.extractor.microdata.MicrodataParser
Returns all the itemScopes detected within the given root node.
getLevel() - Method in class org.apache.any23.extractor.IssueReport.Issue
 
getLevel() - Method in class org.apache.any23.validator.ValidationReport.Issue
 
getMajorType() - Method in class org.apache.any23.mime.MIMEType
 
getMapping(String) - Method in class org.apache.any23.extractor.rdfa.RDFa11Parser
Returns a URI mapping for a given prefix.
getMatchingExtractors() - Method in class org.apache.any23.ExtractionReport
 
getMatchingExtractors() - Method in class org.apache.any23.extractor.SingleDocumentExtraction
 
getMaxConnections() - Method in class org.apache.any23.http.DefaultHTTPClientConfiguration
 
getMaxConnections() - Method in interface org.apache.any23.http.HTTPClientConfiguration
 
getMaxDepth() - Method in class org.apache.any23.plugin.crawler.SiteCrawler
 
getMaxPages() - Method in class org.apache.any23.plugin.crawler.SiteCrawler
 
getMediaType() - Method in class org.apache.any23.servlet.conneg.ContentTypeNegotiator.VariantSpec
 
getMediaType() - Method in class org.apache.any23.servlet.conneg.MediaRangeSpec
 
getMessage() - Method in class org.apache.any23.extractor.IssueReport.Issue
 
getMessage() - Method in class org.apache.any23.validator.ValidationReport.Error
 
getMessage() - Method in class org.apache.any23.validator.ValidationReport.Issue
 
getMicrodata(Document, MicrodataParser.ErrorMode) - Static method in class org.apache.any23.extractor.microdata.MicrodataParser
Returns all the Microdata items detected within the given document.
getMicrodata(Document) - Static method in class org.apache.any23.extractor.microdata.MicrodataParser
Returns all the Microdata items detected within the given document, works in full report mode.
getMicrodataAsJSON(Document, PrintStream) - Static method in class org.apache.any23.extractor.microdata.MicrodataParser
Returns a JSON containing the list of all extracted Microdata, as described at Microdata JSON Specification.
getMimeType() - Method in class org.apache.any23.writer.JSONWriterFactory
 
getMimeType() - Method in class org.apache.any23.writer.NQuadsWriterFactory
 
getMimeType() - Method in class org.apache.any23.writer.NTriplesWriterFactory
 
getMimeType() - Method in class org.apache.any23.writer.RDFXMLWriterFactory
 
getMimeType() - Method in class org.apache.any23.writer.TriXWriterFactory
 
getMimeType() - Method in class org.apache.any23.writer.TurtleWriterFactory
 
getMimeType() - Method in class org.apache.any23.writer.URIListWriterFactory
 
getMimeType() - Method in interface org.apache.any23.writer.WriterFactory
 
getMimeType(WriterFactory) - Static method in class org.apache.any23.writer.WriterFactoryRegistry
Reads the MIME Type specified for the given FormatWriter.
getMimeTypes() - Method in class org.apache.any23.writer.WriterFactoryRegistry
 
getName() - Method in class org.apache.any23.extractor.microdata.ItemProp
 
getName() - Method in interface org.apache.any23.extractor.xpath.TemplateXPathExtractionRule
 
getName() - Method in class org.apache.any23.extractor.xpath.TemplateXPathExtractionRuleImpl
 
getName() - Method in class org.apache.any23.extractor.xpath.Variable
 
getName() - Method in interface org.apache.any23.extractor.xpath.XPathExtractionRule
 
getNamespace() - Method in class org.apache.any23.vocab.Vocabulary
 
getNamespaceURIFor(String) - Method in class org.apache.any23.rdf.Prefixes
 
getNegotiator() - Static method in class org.apache.any23.servlet.conneg.Any23Negotiator
 
getNode(String) - Method in class org.apache.any23.validator.DefaultDOMDocument
 
getNode(String) - Method in interface org.apache.any23.validator.DOMDocument
Returns the node addressed by the given XPath, if more then one an exception will be raised.
getNodeLocation(Node) - Static method in class org.apache.any23.extractor.html.DomUtils
Returns the row/col location of the given node.
getNodes(String) - Method in class org.apache.any23.validator.DefaultDOMDocument
 
getNodes(String) - Method in interface org.apache.any23.validator.DOMDocument
Returns the list of nodes addressed by the given XPath.
getNodesWithAttribute(String) - Method in class org.apache.any23.validator.DefaultDOMDocument
 
getNodesWithAttribute(String) - Method in interface org.apache.any23.validator.DOMDocument
Returns all the nodes declaring an attribute with the specified name.
getNQuadsParser(boolean, boolean, ExtractionContext, ExtractionResult) - Method in class org.apache.any23.extractor.rdf.RDFParserFactory
Returns a new instance of a configured NQuadsParser.
getNTriplesParser(boolean, boolean, ExtractionContext, ExtractionResult) - Method in class org.apache.any23.extractor.rdf.RDFParserFactory
Returns a new instance of a configured NTriplesParser.
getNumOfCrawlers() - Method in class org.apache.any23.plugin.crawler.SiteCrawler
 
getNumOfExtractors() - Method in class org.apache.any23.extractor.ExtractorGroup
 
getObject() - Method in class org.apache.any23.extractor.TagSoupExtractionResult.PropertyPath
 
getObject() - Method in class org.apache.any23.extractor.xpath.QuadTemplate
 
getOrganization() - Method in class org.apache.any23.extractor.html.HCardName
 
getOrganizationUnit() - Method in class org.apache.any23.extractor.html.HCardName
 
getOrigin() - Method in class org.apache.any23.validator.ValidationReport.FixError
 
getOrigin() - Method in class org.apache.any23.validator.ValidationReport.Issue
 
getOrigin() - Method in class org.apache.any23.validator.ValidationReport.RuleError
 
getOriginalDocument() - Method in class org.apache.any23.validator.DefaultDOMDocument
 
getOriginalDocument() - Method in interface org.apache.any23.validator.DOMDocument
Returns the original document.
getParameter(String) - Method in class org.apache.any23.servlet.conneg.MediaRangeSpec
 
getParameterNames() - Method in class org.apache.any23.servlet.conneg.MediaRangeSpec
 
getParser(ExtractionContext, ExtractionResult) - Method in class org.apache.any23.extractor.rdf.BaseRDFExtractor
 
getParser(ExtractionContext, ExtractionResult) - Method in class org.apache.any23.extractor.rdf.NQuadsExtractor
 
getParser(ExtractionContext, ExtractionResult) - Method in class org.apache.any23.extractor.rdf.NTriplesExtractor
 
getParser(ExtractionContext, ExtractionResult) - Method in class org.apache.any23.extractor.rdf.RDFXMLExtractor
 
getParser(ExtractionContext, ExtractionResult) - Method in class org.apache.any23.extractor.rdf.TriXExtractor
 
getParser(ExtractionContext, ExtractionResult) - Method in class org.apache.any23.extractor.rdf.TurtleExtractor
 
getParser() - Method in class org.apache.any23.io.nquads.NQuadsParserFactory
 
getParser(RDFFormat) - Static method in class org.apache.any23.rdf.RDFUtils
Creates a new RDFParser instance.
getParserEncoding() - Method in class org.apache.any23.extractor.SingleDocumentExtraction
 
getPath() - Method in class org.apache.any23.extractor.TagSoupExtractionResult.PropertyPath
 
getPath() - Method in class org.apache.any23.extractor.TagSoupExtractionResult.ResourceRoot
 
getPathToLocalRoot() - Method in class org.apache.any23.extractor.html.HTMLDocument
Returns the sequence of ancestors from the document root to the local root (document).
getPattern() - Method in class org.apache.any23.plugin.crawler.SharedData
 
getPlugins(Class<T>) - Method in class org.apache.any23.plugin.Any23PluginManager
Returns all classes within the specified packageName satisfying the given class filter.
getPluralTextField(String) - Method in class org.apache.any23.extractor.html.HTMLDocument
Returns a plural text field.
getPluralUrlField(String) - Method in class org.apache.any23.extractor.html.HTMLDocument
Returns the list of URLs associated to the fields marked with class className.
getPolitenessDelay() - Method in class org.apache.any23.plugin.crawler.SiteCrawler
 
getPrecedence(MediaRangeSpec) - Method in class org.apache.any23.servlet.conneg.MediaRangeSpec
 
getPredicate() - Method in class org.apache.any23.extractor.xpath.QuadTemplate
 
getPrefixes() - Method in interface org.apache.any23.extractor.ExtractorDescription
An instance defining the prefixes supported by this extractor.
getPrefixes() - Method in class org.apache.any23.extractor.SimpleExtractorFactory
 
getProperties() - Method in interface org.apache.any23.configuration.Configuration
Returns all the defined configuration properties.
getProperties() - Method in class org.apache.any23.configuration.DefaultConfiguration
 
getProperties() - Method in class org.apache.any23.extractor.microdata.ItemScope
 
getProperties() - Method in class org.apache.any23.vocab.Vocabulary
 
getProperty(String, String) - Method in interface org.apache.any23.configuration.Configuration
Returns the value of a specified property, of the default value if property is not defined.
getProperty(String, String) - Method in class org.apache.any23.configuration.DefaultConfiguration
 
getProperty(String) - Method in class org.apache.any23.extractor.ExtractionParameters
Returns the value of the specified extraction property, if the property is undefined it will be retrieved by the default Configuration.
getProperty() - Method in class org.apache.any23.extractor.TagSoupExtractionResult.PropertyPath
 
getProperty(String) - Method in class org.apache.any23.vocab.Vocabulary
Returns a property defined within this vocabulary.
getProperty(String, URI) - Method in class org.apache.any23.vocab.Vocabulary
Returns a property defined within this vocabulary, if not found the defaultValue will be returned.
getPropertyByLocalName(String) - Method in class org.apache.any23.vocab.XFN
 
getPropertyCamelCase(String) - Method in class org.apache.any23.vocab.Vocabulary
Returns the property URI for the specified property string.
getPropertyIntOrFail(String) - Method in interface org.apache.any23.configuration.Configuration
Returns the Integer value of the specified propertyName or raises an exception if propertyName is not defined.
getPropertyIntOrFail(String) - Method in class org.apache.any23.configuration.DefaultConfiguration
 
getPropertyOrFail(String) - Method in interface org.apache.any23.configuration.Configuration
Returns the value of the specified propertyName or raises an exception if propertyName is not defined.
getPropertyOrFail(String) - Method in class org.apache.any23.configuration.DefaultConfiguration
 
getPropertyPaths() - Method in class org.apache.any23.extractor.ExtractionResultImpl
 
getPropertyPaths() - Method in interface org.apache.any23.extractor.TagSoupExtractionResult
Returns all the collected property paths.
getPropertyValue(Node) - Method in class org.apache.any23.extractor.microdata.MicrodataParser
Reads the value of a itemprop node.
getQuality() - Method in class org.apache.any23.mime.MIMEType
 
getQuality() - Method in class org.apache.any23.servlet.conneg.MediaRangeSpec
 
getRDFFormat() - Method in class org.apache.any23.io.nquads.NQuadsParser
 
getRDFFormat() - Method in class org.apache.any23.io.nquads.NQuadsParserFactory
 
getRDFFormat() - Method in class org.apache.any23.io.nquads.NQuadsWriter
 
getRDFFormat() - Method in class org.apache.any23.io.nquads.NQuadsWriterFactory
 
getRdfFormat() - Method in class org.apache.any23.writer.JSONWriterFactory
 
getRdfFormat() - Method in class org.apache.any23.writer.NQuadsWriterFactory
 
getRdfFormat() - Method in class org.apache.any23.writer.NTriplesWriterFactory
 
getRdfFormat() - Method in class org.apache.any23.writer.RDFXMLWriterFactory
 
getRdfFormat() - Method in class org.apache.any23.writer.TriXWriterFactory
 
getRdfFormat() - Method in class org.apache.any23.writer.TurtleWriterFactory
 
getRdfFormat() - Method in class org.apache.any23.writer.URIListWriterFactory
 
getRdfFormat() - Method in interface org.apache.any23.writer.WriterFactory
 
getRdfWriter(OutputStream) - Method in class org.apache.any23.writer.JSONWriterFactory
 
getRdfWriter(OutputStream) - Method in class org.apache.any23.writer.NQuadsWriterFactory
 
getRdfWriter(OutputStream) - Method in class org.apache.any23.writer.NTriplesWriterFactory
 
getRdfWriter(OutputStream) - Method in class org.apache.any23.writer.RDFXMLWriterFactory
 
getRdfWriter(OutputStream) - Method in class org.apache.any23.writer.TriXWriterFactory
 
getRdfWriter(OutputStream) - Method in class org.apache.any23.writer.TurtleWriterFactory
 
getRdfWriter(OutputStream) - Method in class org.apache.any23.writer.URIListWriterFactory
 
getRdfWriter(OutputStream) - Method in interface org.apache.any23.writer.WriterFactory
 
getRDFXMLParser(boolean, boolean, ExtractionContext, ExtractionResult) - Method in class org.apache.any23.extractor.rdf.RDFParserFactory
Returns a new instance of a configured org.openrdf.rio.rdfxml.RDFXMLParser.
getRefs() - Method in class org.apache.any23.extractor.microdata.ItemScope
 
getReport() - Method in class org.apache.any23.extractor.html.DocumentReport
 
getReport() - Method in class org.apache.any23.validator.DefaultValidationReportBuilder
 
getReport() - Method in interface org.apache.any23.validator.ValidationReportBuilder
 
getResourceRoots() - Method in class org.apache.any23.extractor.ExtractionResultImpl
 
getResourceRoots() - Method in interface org.apache.any23.extractor.TagSoupExtractionResult
Returns all the collected property roots.
getRoot() - Method in class org.apache.any23.extractor.TagSoupExtractionResult.ResourceRoot
 
getRow() - Method in class org.apache.any23.extractor.IssueReport.Issue
 
getRuleActivations() - Method in class org.apache.any23.validator.DefaultValidationReport
 
getRuleActivations() - Method in class org.apache.any23.validator.EmptyValidationReport
 
getRuleActivations() - Method in interface org.apache.any23.validator.ValidationReport
Returns the list of activated rules.
getRuleStr() - Method in class org.apache.any23.validator.ValidationReport.RuleActivation
 
getSeed() - Method in class org.apache.any23.plugin.crawler.SharedData
 
getSingularTextField(String) - Method in class org.apache.any23.extractor.html.HTMLDocument
Returns a singular text field.
getSingularUrlField(String) - Method in class org.apache.any23.extractor.html.HTMLDocument
Returns the URL associated to the field marked with class className.
getSoTimeout() - Method in class org.apache.any23.http.DefaultHTTPClient
 
getSubject() - Method in class org.apache.any23.extractor.TagSoupExtractionResult.PropertyPath
 
getSubject() - Method in class org.apache.any23.extractor.xpath.QuadTemplate
 
getSubtype() - Method in class org.apache.any23.mime.MIMEType
 
getSubtype() - Method in class org.apache.any23.servlet.conneg.MediaRangeSpec
 
getSupportedMIMETypes() - Method in interface org.apache.any23.extractor.ExtractorFactory
Supports wildcards, e.g.
getSupportedMIMETypes() - Method in class org.apache.any23.extractor.SimpleExtractorFactory
 
getText() - Method in class org.apache.any23.extractor.html.HTMLDocument
Returns the text contained inside a node if leaf, null otherwise.
getTextExtractors() - Method in class org.apache.any23.plugin.htmlscraper.HTMLScraperExtractor
 
getTools() - Method in class org.apache.any23.plugin.Any23PluginManager
Returns the list of all the Tool classes declared within the classpath.
getTopLevelItemScopeNodes(Node) - Static method in class org.apache.any23.extractor.microdata.MicrodataParser
Returns only the itemScopes that are top level items.
getTotalDocuments() - Method in class org.apache.any23.writer.ReportingTripleHandler
 
getTotalTriples() - Method in class org.apache.any23.writer.ReportingTripleHandler
 
getTriXParser(boolean, boolean, ExtractionContext, ExtractionResult) - Method in class org.apache.any23.extractor.rdf.RDFParserFactory
Returns a new instance of a configured TriXParser.
getTurtleParserInstance(boolean, boolean, ExtractionContext, ExtractionResult) - Method in class org.apache.any23.extractor.rdf.RDFParserFactory
Returns a new instance of a configured TurtleParser.
getType() - Method in class org.apache.any23.extractor.microdata.ItemPropValue
 
getType() - Method in class org.apache.any23.extractor.microdata.ItemScope
 
getType() - Method in class org.apache.any23.servlet.conneg.MediaRangeSpec
 
getUniqueID() - Method in class org.apache.any23.extractor.ExtractionContext
 
getUriRegex() - Method in class org.apache.any23.extractor.xpath.TemplateXPathExtractionRuleImpl
 
getUserAgent() - Method in class org.apache.any23.http.DefaultHTTPClientConfiguration
 
getUserAgent() - Method in interface org.apache.any23.http.HTTPClientConfiguration
 
getValidatedDOM(boolean) - Method in class org.apache.any23.extractor.html.TagSoupParser
Returns the validated DOM and applies fixes on it if applyFix is set to true.
getValidationReport() - Method in class org.apache.any23.ExtractionReport
 
getValidationReport() - Method in class org.apache.any23.extractor.SingleDocumentExtractionReport
 
getValue() - Method in class org.apache.any23.extractor.microdata.ItemProp
 
getValue(Map<String, String>) - Method in class org.apache.any23.extractor.xpath.Term
Returns the value represented by this Term given the varMapping, the #isVar and #internalValue parameters.
getValueInternal(String) - Method in class org.apache.any23.extractor.xpath.TemplateGraph
 
getValueInternal(String) - Method in class org.apache.any23.extractor.xpath.TemplateObject
 
getValueInternal(String) - Method in class org.apache.any23.extractor.xpath.TemplatePredicate
 
getValueInternal(String) - Method in class org.apache.any23.extractor.xpath.TemplateSubject
 
getValueInternal(String) - Method in class org.apache.any23.extractor.xpath.Term
 
getWebCrawler() - Method in class org.apache.any23.plugin.crawler.SiteCrawler
 
getWriter(OutputStream) - Method in class org.apache.any23.io.nquads.NQuadsWriterFactory
 
getWriter(Writer) - Method in class org.apache.any23.io.nquads.NQuadsWriterFactory
 
getWriter(RDFFormat, Writer) - Static method in class org.apache.any23.rdf.RDFUtils
Creates a new RDFWriter instance.
getWriter(RDFFormat, OutputStream) - Static method in class org.apache.any23.rdf.RDFUtils
Creates a new RDFWriter instance.
getWriterByIdentifier(String) - Method in class org.apache.any23.writer.WriterFactoryRegistry
Returns the FormatWriter identified by id.
getWriterInstanceByIdentifier(String, OutputStream) - Method in class org.apache.any23.writer.WriterFactoryRegistry
Returns an instance of FormatWriter ready to write on the given os OutputStream.
getWriters() - Method in class org.apache.any23.writer.WriterFactoryRegistry
 
getWritersByMimeType(String) - Method in class org.apache.any23.writer.WriterFactoryRegistry
Returns all the writers matching the specified mimeType.
getXpath() - Method in class org.apache.any23.extractor.microdata.Item
 
getxPath() - Method in class org.apache.any23.extractor.xpath.Variable
 
getXPathForNode(Node) - Static method in class org.apache.any23.extractor.html.DomUtils
Does a reverse walking of the DOM tree to generate a unique XPath expression leading to this node.
getXPathListForNode(Node) - Static method in class org.apache.any23.extractor.html.DomUtils
Returns a list of tag names representing the path from the document root to the given node n.
getXSDDate(String, String) - Static method in class org.apache.any23.rdf.RDFUtils
This method allows to obtain an XML Schema compliant date providing a textual representation of a date and textual a pattern for parsing it.
getXSLT() - Static method in class org.apache.any23.extractor.rdfa.RDFaExtractor
Returns a XSLTStylesheet able to distill RDFa from HTML pages.
GIVEN_NAME - Static variable in class org.apache.any23.extractor.html.HCardName
 
given_name - Variable in class org.apache.any23.vocab.VCARD
A given name part of a person's name.
givenname - Variable in class org.apache.any23.vocab.FOAF
 
Group - Variable in class org.apache.any23.vocab.FOAF
 
guessEncoding(InputStream) - Method in interface org.apache.any23.encoding.EncodingDetector
Guesses the data encoding.
guessEncoding(InputStream) - Method in class org.apache.any23.encoding.TikaEncodingDetector
 
guessMIMEType(String, InputStream, MIMEType) - Method in interface org.apache.any23.mime.MIMETypeDetector
Estimates the MIME type of the content of input file.
guessMIMEType(String, InputStream, MIMEType) - Method in class org.apache.any23.mime.NaiveMIMETypeDetector
 
guessMIMEType(String, InputStream, MIMEType) - Method in class org.apache.any23.mime.TikaMIMETypeDetector
Estimates the MIME type of the content of input file.

H

handleComment(String) - Method in class org.apache.any23.extractor.rdf.RDFHandlerAdapter
 
handleComment(String) - Method in class org.apache.any23.io.nquads.NQuadsWriter
 
handleNamespace(String, String) - Method in class org.apache.any23.extractor.rdf.RDFHandlerAdapter
 
handleNamespace(String, String) - Method in class org.apache.any23.io.nquads.NQuadsWriter
 
handleStatement(Statement) - Method in class org.apache.any23.extractor.rdf.RDFHandlerAdapter
 
handleStatement(Statement) - Method in class org.apache.any23.io.nquads.NQuadsWriter
 
hasAnyField() - Method in class org.apache.any23.extractor.html.HCardName
 
hasAttribute(Node, String, String) - Static method in class org.apache.any23.extractor.html.DomUtils
Checks the presence of an attribute value in attributes that contain whitespace-separated lists of values.
hasAttribute(Node, String) - Static method in class org.apache.any23.extractor.html.DomUtils
Checks the presence of an attribute in the given node.
hasClassName(Node, String) - Static method in class org.apache.any23.extractor.html.DomUtils
Tells if an element has a class name not checking the parents in the hierarchy mimicking the CSS .foo match.
hasComment - Variable in class org.apache.any23.vocab.REVIEW
Used to associate a review with a comment on the review.
hasFeedback - Variable in class org.apache.any23.vocab.REVIEW
Associates a review with a feedback on the review.
hasField(String) - Method in class org.apache.any23.extractor.html.HCardName
 
hashCode() - Method in class org.apache.any23.extractor.ExtractionContext
 
hashCode() - Method in class org.apache.any23.extractor.ExtractionParameters
 
hashCode() - Method in class org.apache.any23.extractor.microdata.Item
 
hashCode() - Method in class org.apache.any23.extractor.microdata.ItemProp
 
hashCode() - Method in class org.apache.any23.extractor.microdata.ItemPropValue
 
hashCode() - Method in class org.apache.any23.extractor.microdata.ItemScope
 
hasIdentifier(String) - Method in class org.apache.any23.writer.WriterFactoryRegistry
Verifies if a FormatWriter with given id identifier has been registered.
hasIssues() - Method in class org.apache.any23.extractor.ExtractionResultImpl
 
hasMatchingExtractors() - Method in class org.apache.any23.ExtractionReport
 
hasMatchingExtractors() - Method in class org.apache.any23.extractor.SingleDocumentExtraction
Check whether the given DocumentSource content activates of not at least an extractor.
hasNamespaceURI(String) - Method in class org.apache.any23.rdf.Prefixes
 
hasPrefix(String) - Method in class org.apache.any23.rdf.Prefixes
 
hasReview - Variable in class org.apache.any23.vocab.REVIEW
Associates a work with a a review.
HCalendarExtractor - Class in org.apache.any23.extractor.html
Extractor for the hCalendar microformat.
HCalendarExtractor() - Constructor for class org.apache.any23.extractor.html.HCalendarExtractor
 
HCardExtractor - Class in org.apache.any23.extractor.html
Extractor for the hCard microformat.
HCardExtractor() - Constructor for class org.apache.any23.extractor.html.HCardExtractor
 
HCardName - Class in org.apache.any23.extractor.html
An HCard name, consisting of various parts.
HCardName() - Constructor for class org.apache.any23.extractor.html.HCardName
 
HEAD_TAG - Static variable in class org.apache.any23.extractor.rdfa.RDFa11Parser
 
HeadLinkExtractor - Class in org.apache.any23.extractor.html
This Extractor.TagSoupDOMExtractor implementation retrieves the LINKs declared within the HTML/HEAD page header.
HeadLinkExtractor() - Constructor for class org.apache.any23.extractor.html.HeadLinkExtractor
 
HLISTING - Class in org.apache.any23.vocab
Class modeling the hListing vocabulary.
HListingExtractor - Class in org.apache.any23.extractor.html
Extractor for the hListing microformat.
HListingExtractor() - Constructor for class org.apache.any23.extractor.html.HListingExtractor
 
holdsAccount - Variable in class org.apache.any23.vocab.FOAF
 
homeAdr - Variable in class org.apache.any23.vocab.VCARD
A home address of a person.
homepage - Variable in class org.apache.any23.vocab.FOAF
 
homeTel - Variable in class org.apache.any23.vocab.VCARD
A home phone number of a person.
HONORIFIC_PREFIX - Static variable in class org.apache.any23.extractor.html.HCardName
 
honorific_prefix - Variable in class org.apache.any23.vocab.VCARD
An honorific prefix part of a person's name.
HONORIFIC_SUFFIX - Static variable in class org.apache.any23.extractor.html.HCardName
 
honorific_suffix - Variable in class org.apache.any23.vocab.VCARD
An honorific suffix part of a person's name.
HRECIPE - Class in org.apache.any23.vocab
Vocabulary to map the hRecipe microformat.
HRecipeExtractor - Class in org.apache.any23.extractor.html
Extractor for the hRecipe microformat.
HRecipeExtractor() - Constructor for class org.apache.any23.extractor.html.HRecipeExtractor
 
HREF_ATTRIBUTE - Static variable in class org.apache.any23.extractor.rdfa.RDFa11Parser
 
HREF_TAGS - Static variable in class org.apache.any23.extractor.microdata.MicrodataParser
List of tags providing the href property.
HResumeExtractor - Class in org.apache.any23.extractor.html
Extractor for the hResume microformat.
HResumeExtractor() - Constructor for class org.apache.any23.extractor.html.HResumeExtractor
 
HReviewExtractor - Class in org.apache.any23.extractor.html
Extractor for the hReview microformat.
HReviewExtractor() - Constructor for class org.apache.any23.extractor.html.HReviewExtractor
 
HTMLDocument - Class in org.apache.any23.extractor.html
A wrapper around the DOM representation of an HTML document.
HTMLDocument(Node) - Constructor for class org.apache.any23.extractor.html.HTMLDocument
Constructor accepting the root node.
HTMLDocument.TextField - Class in org.apache.any23.extractor.html
This class represents a text extracted from the HTML DOM related to the node from which such test has been retrieved.
HTMLDocument.TextField(String, Node) - Constructor for class org.apache.any23.extractor.html.HTMLDocument.TextField
 
HTMLMetaExtractor - Class in org.apache.any23.extractor.html
This extractor represents the HTML META tag values according the HTML4 specification.
HTMLMetaExtractor() - Constructor for class org.apache.any23.extractor.html.HTMLMetaExtractor
 
HTMLScraperExtractor - Class in org.apache.any23.plugin.htmlscraper
Implementation of content extractor for performing HTML scraping.
HTMLScraperExtractor() - Constructor for class org.apache.any23.plugin.htmlscraper.HTMLScraperExtractor
 
HTMLScraperPlugin - Class in org.apache.any23.plugin.htmlscraper
Implementation of ExtractorPlugin based on the BoilerPipe Library.
HTMLScraperPlugin() - Constructor for class org.apache.any23.plugin.htmlscraper.HTMLScraperPlugin
 
HTTPClient - Interface in org.apache.any23.http
Abstraction for opening an InputStream on an HTTP URI.
HTTPClientConfiguration - Interface in org.apache.any23.http
Defines the HTTPClient configuration.
HTTPDocumentSource - Class in org.apache.any23.source
Http implementation of DocumentSource.
HTTPDocumentSource(HTTPClient, String) - Constructor for class org.apache.any23.source.HTTPDocumentSource
 

I

ICAL - Class in org.apache.any23.vocab
Vocabulary definitions from ical.rdf
ICBMExtractor - Class in org.apache.any23.extractor.html
Extractor for "ICBM coordinates" provided as META headers in the head of an HTML page.
ICBMExtractor() - Constructor for class org.apache.any23.extractor.html.ICBMExtractor
 
icqChatID - Variable in class org.apache.any23.vocab.FOAF
 
IDENTIFIER - Static variable in class org.apache.any23.writer.JSONWriterFactory
 
IDENTIFIER - Static variable in class org.apache.any23.writer.NQuadsWriterFactory
 
IDENTIFIER - Static variable in class org.apache.any23.writer.NTriplesWriterFactory
 
IDENTIFIER - Static variable in class org.apache.any23.writer.RDFXMLWriterFactory
 
IDENTIFIER - Static variable in class org.apache.any23.writer.TriXWriterFactory
 
IDENTIFIER - Static variable in class org.apache.any23.writer.TurtleWriterFactory
 
IDENTIFIER - Static variable in class org.apache.any23.writer.URIListWriterFactory
 
IgnoreAccidentalRDFa - Class in org.apache.any23.filter
A TripleHandler that suppresses output of the RDFa parser if the document only contains "accidental" RDFa, like stylesheet links and other non-RDFa uses of HTML's
IgnoreAccidentalRDFa(TripleHandler, boolean) - Constructor for class org.apache.any23.filter.IgnoreAccidentalRDFa
Constructor.
IgnoreAccidentalRDFa(TripleHandler) - Constructor for class org.apache.any23.filter.IgnoreAccidentalRDFa
 
IgnoreTitlesOfEmptyDocuments - Class in org.apache.any23.filter
A TripleHandler that suppresses output of the TitleExtractor unless some other triples could be parsed from the document.
IgnoreTitlesOfEmptyDocuments(TripleHandler) - Constructor for class org.apache.any23.filter.IgnoreTitlesOfEmptyDocuments
 
Image - Variable in class org.apache.any23.vocab.FOAF
 
IMAGE - Static variable in class org.apache.any23.vocab.OGP
The canonical URL of your object that will be used as its permanent ID in the graph (Facebook Graph API).
image - Variable in class org.apache.any23.vocab.OGP
 
IMAGE__HEIGHT - Static variable in class org.apache.any23.vocab.OGP
The number of pixels high.
IMAGE__SECURE_URL - Static variable in class org.apache.any23.vocab.OGP
An alternate url to use if the webpage requires HTTPS.
IMAGE__TYPE - Static variable in class org.apache.any23.vocab.OGP
A MIME type for this image.
IMAGE__URL - Static variable in class org.apache.any23.vocab.OGP
Identical to og:image.
IMAGE__WIDTH - Static variable in class org.apache.any23.vocab.OGP
The number of pixels wide.
imageHeight - Variable in class org.apache.any23.vocab.OGP
 
imageSecureURL - Variable in class org.apache.any23.vocab.OGP
 
imageType - Variable in class org.apache.any23.vocab.OGP
 
imageURL - Variable in class org.apache.any23.vocab.OGP
 
imageWidth - Variable in class org.apache.any23.vocab.OGP
 
img - Variable in class org.apache.any23.vocab.FOAF
 
Includes - Annotation Type in org.apache.any23.extractor.html.annotations
This annotation could be used to decorate a MicroformatExtractor to represent which of the other Microformats could it nest.
includes(Class<? extends MicroformatExtractor>, Class<? extends MicroformatExtractor>) - Static method in class org.apache.any23.extractor.html.MicroformatExtractor
This method checks if there is a native nesting relationship between two MicroformatExtractor.
Ingredient - Variable in class org.apache.any23.vocab.HRECIPE
 
ingredient - Variable in class org.apache.any23.vocab.HRECIPE
 
ingredientName - Variable in class org.apache.any23.vocab.HRECIPE
 
ingredientQuantity - Variable in class org.apache.any23.vocab.HRECIPE
 
ingredientQuantityType - Variable in class org.apache.any23.vocab.HRECIPE
 
init(HTTPClientConfiguration) - Method in class org.apache.any23.http.DefaultHTTPClient
 
init(HTTPClientConfiguration) - Method in interface org.apache.any23.http.HTTPClient
Initializes the HTTP client.
INLINE_DOCUMENT_PREFIX - Static variable in class org.apache.any23.cli.MimeDetector
 
inputURIs - Variable in class org.apache.any23.cli.Rover
 
instructions - Variable in class org.apache.any23.vocab.HRECIPE
 
interest - Variable in class org.apache.any23.vocab.FOAF
 
interval - Variable in class org.apache.any23.vocab.ICAL
 
isAbsoluteURI(String) - Static method in class org.apache.any23.extractor.rdfa.RDFa11Parser
 
isAbsoluteURI(String) - Static method in class org.apache.any23.rdf.RDFUtils
Checks if href is absolute or not.
isAncestorOf(Node, Node, boolean) - Static method in class org.apache.any23.extractor.html.DomUtils
Checks whether a node is ancestor or same of another node.
isAncestorOf(Node, Node) - Static method in class org.apache.any23.extractor.html.DomUtils
Checks whether a node is ancestor or same of another node.
isAnnotated() - Method in interface org.apache.any23.writer.FormatWriter
If true then the produced RDF is annotated with the extractors used to generate the specific statements.
isAnnotated() - Method in class org.apache.any23.writer.JSONWriter
 
isAnnotated() - Method in class org.apache.any23.writer.RDFWriterTripleHandler
If true then the produced RDF is annotated with the extractors used to generate the specific statements.
isAnnotated() - Method in class org.apache.any23.writer.URIListWriter
 
isAnyMajorType() - Method in class org.apache.any23.mime.MIMEType
 
isAnySubtype() - Method in class org.apache.any23.mime.MIMEType
 
isco88_code - Variable in class org.apache.any23.vocab.DOAC
 
isCSV(InputStream) - Static method in class org.apache.any23.extractor.csv.CSVReaderBuilder
Checks whether the given input stream is a CSV or not.
isCURIE(String) - Static method in class org.apache.any23.extractor.rdfa.RDFa11Parser
 
isCURIEBNode(String) - Static method in class org.apache.any23.extractor.rdfa.RDFa11Parser
 
isDate() - Method in class org.apache.any23.extractor.microdata.ItemPropValue
 
isDefault() - Method in class org.apache.any23.servlet.conneg.ContentTypeNegotiator.VariantSpec
 
isDocBlocked() - Method in class org.apache.any23.filter.ExtractionContextBlocker
 
isElementNode(Node) - Static method in class org.apache.any23.extractor.html.DomUtils
Verifies if the given target node is an element.
isEmpty() - Method in class org.apache.any23.extractor.ExtractorGroup
 
isEmpty() - Method in class org.apache.any23.rdf.Prefixes
 
isExtendedXFNLocalName(String) - Method in class org.apache.any23.vocab.XFN
 
isFix() - Method in class org.apache.any23.extractor.ExtractionParameters
 
isFloat() - Method in class org.apache.any23.extractor.microdata.ItemPropValue
 
isInteger() - Method in class org.apache.any23.extractor.microdata.ItemPropValue
 
isItemProp(Node) - Static method in class org.apache.any23.extractor.microdata.MicrodataParser
Check whether a node is an itemProp.
isItemScope(Node) - Static method in class org.apache.any23.extractor.microdata.MicrodataParser
Check whether a node is an itemScope.
isLink() - Method in class org.apache.any23.extractor.microdata.ItemPropValue
 
isLocal() - Method in class org.apache.any23.source.ByteArrayDocumentSource
 
isLocal() - Method in interface org.apache.any23.source.DocumentSource
A value of false indicates that the document resides remotely, and that multiple successive accesses to it should be avoided by copying it to local storage.
isLocal() - Method in class org.apache.any23.source.FileDocumentSource
 
isLocal() - Method in class org.apache.any23.source.HTTPDocumentSource
 
isLocal() - Method in class org.apache.any23.source.StringDocumentSource
 
isMultiField(String) - Method in class org.apache.any23.extractor.html.HCardName
 
isNested() - Method in class org.apache.any23.extractor.microdata.ItemPropValue
 
isNumber() - Method in class org.apache.any23.extractor.microdata.ItemPropValue
 
isOnline(String) - Static method in class org.apache.any23.util.URLUtils
Verifies if the specified URL is reachable online.
isPlain() - Method in class org.apache.any23.extractor.microdata.ItemPropValue
 
isPrefix(String, String) - Static method in class org.apache.any23.util.StringUtils
Check whether string candidatePrefix is prefix of string container.
isPrimaryTopicOf - Variable in class org.apache.any23.vocab.FOAF
 
isRegisteredName(String) - Method in interface org.apache.any23.extractor.ExtractorRegistry
 
isRegisteredName(String) - Method in class org.apache.any23.extractor.ExtractorRegistryImpl
 
isRelativeNode(Node) - Static method in class org.apache.any23.extractor.rdfa.RDFa11Parser
 
isStopAtFirstError() - Method in class org.apache.any23.extractor.rdf.BaseRDFExtractor
 
isStopAtFirstError() - Method in class org.apache.any23.extractor.rdfa.RDFa11Extractor
 
isStopAtFirstError() - Method in class org.apache.any23.extractor.rdfa.RDFaExtractor
 
isStopAtFirstError() - Method in class org.apache.any23.plugin.officescraper.ExcelExtractor
 
IssueReport - Interface in org.apache.any23.extractor
This interface models an issue reporter.
IssueReport.Issue - Class in org.apache.any23.extractor
This class defines a generic issue traced by this extraction result.
IssueReport.IssueLevel - Enum in org.apache.any23.extractor
Possible issue levels.
isSuffix(String, String) - Static method in class org.apache.any23.util.StringUtils
Check whether string candidateSuffix is suffix of string container.
isValidate() - Method in class org.apache.any23.extractor.ExtractionParameters
 
isVar() - Method in class org.apache.any23.extractor.xpath.Term
 
isVerifyDataType() - Method in class org.apache.any23.extractor.rdf.BaseRDFExtractor
 
isVerifyDataType() - Method in class org.apache.any23.extractor.rdfa.RDFa11Extractor
 
isVerifyDataType() - Method in class org.apache.any23.extractor.rdfa.RDFaExtractor
 
isVolatile(String) - Method in class org.apache.any23.rdf.Prefixes
 
isWildcardSubtype() - Method in class org.apache.any23.servlet.conneg.MediaRangeSpec
 
isWildcardType() - Method in class org.apache.any23.servlet.conneg.MediaRangeSpec
 
isXFNLocalName(String) - Method in class org.apache.any23.vocab.XFN
 
isXMLNSDeclared(Document) - Static method in class org.apache.any23.extractor.rdfa.RDFa11Parser
 
Item - Class in org.apache.any23.extractor.microdata
Base class for Microdata items.
Item(String) - Constructor for class org.apache.any23.extractor.microdata.Item
Constructor.
Item - Variable in class org.apache.any23.vocab.HLISTING
 
item - Variable in class org.apache.any23.vocab.HLISTING
 
itemName - Variable in class org.apache.any23.vocab.HLISTING
 
itemPhoto - Variable in class org.apache.any23.vocab.HLISTING
 
ItemProp - Class in org.apache.any23.extractor.microdata
Describes a Microdata item property.
ItemProp(String, String, ItemPropValue) - Constructor for class org.apache.any23.extractor.microdata.ItemProp
Constructor.
ITEMPROP_ATTRIBUTE - Static variable in class org.apache.any23.extractor.microdata.MicrodataParser
 
ItemPropValue - Class in org.apache.any23.extractor.microdata
Describes a possible value for a Microdata item property.
ItemPropValue(Object, ItemPropValue.Type) - Constructor for class org.apache.any23.extractor.microdata.ItemPropValue
Constructor.
ItemPropValue.Type - Enum in org.apache.any23.extractor.microdata
Supported types.
ItemScope - Class in org.apache.any23.extractor.microdata
This class describes a Microdata itemscope.
ItemScope(String, ItemProp[], String, String[], String, String) - Constructor for class org.apache.any23.extractor.microdata.ItemScope
Constructor.
ITEMSCOPE_ATTRIBUTE - Static variable in class org.apache.any23.extractor.microdata.MicrodataParser
 
itemUrl - Variable in class org.apache.any23.vocab.HLISTING
 
iterator() - Method in class org.apache.any23.extractor.ExtractorGroup
 

J

jabberID - Variable in class org.apache.any23.vocab.FOAF
 
join(String, String...) - Static method in class org.apache.any23.util.StringUtils
Joins the given input sting data list using the specified delimiter.
JSONWriter - Class in org.apache.any23.writer
Implementation of JSON format writer.
JSONWriter(OutputStream) - Constructor for class org.apache.any23.writer.JSONWriter
 
JSONWriterFactory - Class in org.apache.any23.writer
 
JSONWriterFactory() - Constructor for class org.apache.any23.writer.JSONWriterFactory
 

K

key - Variable in class org.apache.any23.vocab.VCARD
A key (e.g, PKI key) of a person.
kin - Variable in class org.apache.any23.vocab.XFN
 
kingdom - Variable in class org.apache.any23.vocab.WO
 
kingdomClass - Variable in class org.apache.any23.vocab.WO
 
kingdomName - Variable in class org.apache.any23.vocab.WO
 
knows - Variable in class org.apache.any23.vocab.FOAF
 

L

label - Variable in class org.apache.any23.vocab.VCARD
The formatted version of a postal address (a string with embedded line breaks, punctuation, etc.).
language - Variable in class org.apache.any23.vocab.DOAC
 
language - Variable in class org.apache.any23.vocab.ICAL
To specify the language for text values in a URI or URI parameter..
LanguageLevel - Variable in class org.apache.any23.vocab.DOAC
 
LanguageSkill - Variable in class org.apache.any23.vocab.DOAC
 
LAST_CELL - Static variable in class org.apache.any23.vocab.Excel
 
LAST_ROW - Static variable in class org.apache.any23.vocab.Excel
 
lastCell - Variable in class org.apache.any23.vocab.Excel
This property links the identifier of a Row to the index of the last declared cell.
lastModified - Variable in class org.apache.any23.vocab.ICAL
value type: DATE-TIMEThe URI specifies the date and time that the information associated with the calendar component was last revised in the calendar store.
lastRow - Variable in class org.apache.any23.vocab.Excel
This property links the identifier of a Sheet to the index of the last declared row.
lat - Variable in class org.apache.any23.vocab.GEO
 
latitude - Variable in class org.apache.any23.vocab.VCARD
The latitude of a geographic location.
license - Variable in class org.apache.any23.vocab.DCTERMS
 
license - Variable in class org.apache.any23.vocab.XHTML
 
LicenseExtractor - Class in org.apache.any23.extractor.html
Extractor for the rel-license microformat.
LicenseExtractor() - Constructor for class org.apache.any23.extractor.html.LicenseExtractor
 
List_of_Float - Variable in class org.apache.any23.vocab.ICAL
 
Lister - Variable in class org.apache.any23.vocab.HLISTING
 
lister - Variable in class org.apache.any23.vocab.HLISTING
 
listerLogo - Variable in class org.apache.any23.vocab.HLISTING
 
listerName - Variable in class org.apache.any23.vocab.HLISTING
 
listerOrg - Variable in class org.apache.any23.vocab.HLISTING
 
listerUrl - Variable in class org.apache.any23.vocab.HLISTING
 
listFilesRecursively(File, FilenameFilter) - Static method in class org.apache.any23.util.FileUtils
Lists the content of a dir applying the specified filter.
Listing - Variable in class org.apache.any23.vocab.HLISTING
 
literal(String) - Static method in class org.apache.any23.rdf.RDFUtils
Creates a Literal.
literal(boolean) - Static method in class org.apache.any23.rdf.RDFUtils
Creates a Literal.
literal(byte) - Static method in class org.apache.any23.rdf.RDFUtils
Creates a Literal.
literal(short) - Static method in class org.apache.any23.rdf.RDFUtils
Creates a Literal.
literal(int) - Static method in class org.apache.any23.rdf.RDFUtils
Creates a Literal.
literal(long) - Static method in class org.apache.any23.rdf.RDFUtils
Creates a Literal.
literal(float) - Static method in class org.apache.any23.rdf.RDFUtils
Creates a Literal.
literal(double) - Static method in class org.apache.any23.rdf.RDFUtils
Creates a Literal.
literal(String, String) - Static method in class org.apache.any23.rdf.RDFUtils
Creates a Literal.
literal(String, URI) - Static method in class org.apache.any23.rdf.RDFUtils
Creates a Literal.
loadClassDir(File) - Method in class org.apache.any23.plugin.Any23PluginManager
Loads a classes directory in the classpath.
loadClassDirs(File...) - Method in class org.apache.any23.plugin.Any23PluginManager
Loads a list of class dirs in the classpath.
loadFiles(File...) - Method in class org.apache.any23.plugin.Any23PluginManager
Loads a generic list of files, trying to determine the type of every file.
loadJAR(File) - Method in class org.apache.any23.plugin.Any23PluginManager
Loads a JAR file in the classpath.
loadJARDir(File) - Method in class org.apache.any23.plugin.Any23PluginManager
Loads all the JARs detected in a given directory.
loadJARs(File...) - Method in class org.apache.any23.plugin.Any23PluginManager
Loads a list of JARs in the classpath.
loadPlugins(File...) - Method in class org.apache.any23.plugin.Any23PluginManager
Loads plugins from a list of specified locations.
LocalCopyFactory - Interface in org.apache.any23.source
A factory that creates local copies of DocumentSources.
LOCALE - Static variable in class org.apache.any23.vocab.OGP
The locale these tags are marked up in.
locale - Variable in class org.apache.any23.vocab.OGP
 
LOCALE__ALTERNATE - Static variable in class org.apache.any23.vocab.OGP
An array of other locales this page is available in.
localeAlternate - Variable in class org.apache.any23.vocab.OGP
 
locality - Variable in class org.apache.any23.vocab.HLISTING
 
locality - Variable in class org.apache.any23.vocab.VCARD
The locality (e.g., city) of a postal address.
location - Variable in class org.apache.any23.vocab.ICAL
value type: TEXTThe URI defines the intended venue for the activity defined by a calendar component..
Location - Variable in class org.apache.any23.vocab.VCARD
Resources that are vCard geographic locations.
logger - Static variable in class org.apache.any23.Any23
 
logger - Static variable in class org.apache.any23.configuration.DefaultConfiguration
 
LoggingTripleHandler - Class in org.apache.any23.writer
Triple handler decorator useful for logging purposes.
LoggingTripleHandler(TripleHandler, PrintWriter) - Constructor for class org.apache.any23.writer.LoggingTripleHandler
 
logo - Variable in class org.apache.any23.vocab.FOAF
 
logo - Variable in class org.apache.any23.vocab.VCARD
A logo associated with a person or their organization.
LogUtils - Class in org.apache.any23.util
Provides specific application logger configuration methods.
LogUtils() - Constructor for class org.apache.any23.util.LogUtils
 
lon - Variable in class org.apache.any23.vocab.GEO
 
longitude - Variable in class org.apache.any23.vocab.VCARD
The longitude of a geographic location.

M

made - Variable in class org.apache.any23.vocab.FOAF
 
mailer - Variable in class org.apache.any23.vocab.VCARD
A mailer associated with a vCard.
main(String[]) - Static method in class org.apache.any23.cli.ToolRunner
 
makeDefault() - Method in class org.apache.any23.servlet.conneg.ContentTypeNegotiator.VariantSpec
 
maker - Variable in class org.apache.any23.vocab.FOAF
 
mark(int) - Method in class org.apache.any23.util.ReaderInputStream
Marks the read limit of the StringReader.
markSupported() - Method in class org.apache.any23.util.ReaderInputStream
 
MathUtils - Class in org.apache.any23.util
General utility functions.
maxRating - Variable in class org.apache.any23.vocab.REVIEW
A numeric value.
mbox - Variable in class org.apache.any23.vocab.FOAF
 
mbox_sha1sum - Variable in class org.apache.any23.vocab.FOAF
 
md5(String) - Static method in class org.apache.any23.util.MathUtils
 
me - Variable in class org.apache.any23.vocab.XFN
 
MediaRangeSpec - Class in org.apache.any23.servlet.conneg
This class implements the HTTP header media-range specification.
meet - Variable in class org.apache.any23.vocab.HLISTING
 
member - Variable in class org.apache.any23.vocab.FOAF
 
member - Variable in class org.apache.any23.vocab.ICAL
To specify the group or list membership of the calendar user specified by the URI..
membershipClass - Variable in class org.apache.any23.vocab.FOAF
 
MemCopyFactory - Class in org.apache.any23.source
Creates local copies of DocumentSource by reading them into an in-memory buffer.
MemCopyFactory() - Constructor for class org.apache.any23.source.MemCopyFactory
 
mePage - Variable in class org.apache.any23.vocab.XFN
 
met - Variable in class org.apache.any23.vocab.XFN
 
meta - Variable in class org.apache.any23.vocab.XHTML
 
METADATA_DOMAIN_PER_ENTITY_FLAG - Static variable in class org.apache.any23.extractor.ExtractionParameters
 
METADATA_NESTING_FLAG - Static variable in class org.apache.any23.extractor.ExtractionParameters
 
METADATA_TIMESIZE_FLAG - Static variable in class org.apache.any23.extractor.ExtractionParameters
 
MetaNameMisuseFix - Class in org.apache.any23.validator.rule
Fixes the misuse of the meta name attribute.
MetaNameMisuseFix() - Constructor for class org.apache.any23.validator.rule.MetaNameMisuseFix
 
MetaNameMisuseRule - Class in org.apache.any23.validator.rule
Checks whether the meta attribute name is used to contain a property.
MetaNameMisuseRule() - Constructor for class org.apache.any23.validator.rule.MetaNameMisuseRule
 
method - Variable in class org.apache.any23.vocab.ICAL
value type: TEXTThis URI defines the iCalendar object method associated with the calendar object..
MicrodataExtractor - Class in org.apache.any23.extractor.microdata
Default implementation of Microdata extractor, based on Extractor.TagSoupDOMExtractor.
MicrodataExtractor() - Constructor for class org.apache.any23.extractor.microdata.MicrodataExtractor
 
MicrodataParser - Class in org.apache.any23.cli
Command line Microdata parser, accepting both files and URLs and returing a JSON representation of the extracted metadata as described at Microdata JSON Specification.
MicrodataParser() - Constructor for class org.apache.any23.cli.MicrodataParser
 
MicrodataParser - Class in org.apache.any23.extractor.microdata
This class provides utility methods for handling Microdata nodes contained within a DOM document.
MicrodataParser(Document) - Constructor for class org.apache.any23.extractor.microdata.MicrodataParser
 
MicrodataParser.MicrodataParserDocumentSourceConverter - Class in org.apache.any23.cli
 
MicrodataParser.MicrodataParserDocumentSourceConverter() - Constructor for class org.apache.any23.cli.MicrodataParser.MicrodataParserDocumentSourceConverter
 
MicrodataParserException - Exception in org.apache.any23.extractor.microdata
Defines an exception occurring while parsing Microdata.
MicrodataParserException(String, Node) - Constructor for exception org.apache.any23.extractor.microdata.MicrodataParserException
 
MicrodataParserException(String, Throwable, Node) - Constructor for exception org.apache.any23.extractor.microdata.MicrodataParserException
 
MicrodataParserReport - Class in org.apache.any23.extractor.microdata
This class describes the report of the MicrodataParser.
MicrodataParserReport(ItemScope[], MicrodataParserException[]) - Constructor for class org.apache.any23.extractor.microdata.MicrodataParserReport
 
MicrodataParserReport(ItemScope[]) - Constructor for class org.apache.any23.extractor.microdata.MicrodataParserReport
 
MicroformatExtractor - Class in org.apache.any23.extractor.html
The abstract base class for any Microformat specification extractor.
MicroformatExtractor() - Constructor for class org.apache.any23.extractor.html.MicroformatExtractor
 
MIME_TYPE - Static variable in class org.apache.any23.writer.JSONWriterFactory
 
MIME_TYPE - Static variable in class org.apache.any23.writer.NQuadsWriterFactory
 
MIME_TYPE - Static variable in class org.apache.any23.writer.NTriplesWriterFactory
 
MIME_TYPE - Static variable in class org.apache.any23.writer.RDFXMLWriterFactory
 
MIME_TYPE - Static variable in class org.apache.any23.writer.TriXWriterFactory
 
MIME_TYPE - Static variable in class org.apache.any23.writer.TurtleWriterFactory
 
MIME_TYPE - Static variable in class org.apache.any23.writer.URIListWriterFactory
 
MimeDetector - Class in org.apache.any23.cli
Commandline tool to detect MIME Types from file, HTTP and direct input sources.
MimeDetector() - Constructor for class org.apache.any23.cli.MimeDetector
 
MimeDetector.MimeDetectorDocumentSourceConverter - Class in org.apache.any23.cli
 
MimeDetector.MimeDetectorDocumentSourceConverter() - Constructor for class org.apache.any23.cli.MimeDetector.MimeDetectorDocumentSourceConverter
 
MIMEType - Class in org.apache.any23.mime
A MIME type with an optional q (quality) value.
MIMETypeDetector - Interface in org.apache.any23.mime
This detector is able to estimate the MIME type of some given raw data.
minRating - Variable in class org.apache.any23.vocab.REVIEW
A numeric value.
MissingOpenGraphNamespaceRule - Class in org.apache.any23.validator.rule
This rule detects the issue of missing Open Graph namespace.
MissingOpenGraphNamespaceRule() - Constructor for class org.apache.any23.validator.rule.MissingOpenGraphNamespaceRule
 
mobileEmail - Variable in class org.apache.any23.vocab.VCARD
A mobile email address of a person.
mobileTel - Variable in class org.apache.any23.vocab.VCARD
A mobile phone number of a person.
ModifiableConfiguration - Interface in org.apache.any23.configuration
Modifiable implementation of Configuration.
msnChatID - Variable in class org.apache.any23.vocab.FOAF
 
multiply(char, int) - Static method in class org.apache.any23.util.StringUtils
Builds a string composed of the given char c n times.
muse - Variable in class org.apache.any23.vocab.XFN
 
mv(File, File) - Static method in class org.apache.any23.util.FileUtils
Moves a target file to a new dest location.
myersBriggs - Variable in class org.apache.any23.vocab.FOAF
 

N

n - Variable in class org.apache.any23.vocab.VCARD
The components of the name of a person.
NaiveMIMETypeDetector - Class in org.apache.any23.mime
Basic implementation of MIMETypeDetector based on file extensions.
NaiveMIMETypeDetector() - Constructor for class org.apache.any23.mime.NaiveMIMETypeDetector
 
NAME - Static variable in class org.apache.any23.extractor.html.HTMLMetaExtractor
 
NAME - Static variable in class org.apache.any23.extractor.html.TitleExtractor
 
NAME - Static variable in class org.apache.any23.extractor.html.TurtleHTMLExtractor
 
NAME - Static variable in class org.apache.any23.extractor.rdfa.RDFa11Extractor
 
NAME - Static variable in class org.apache.any23.extractor.rdfa.RDFaExtractor
 
NAME - Static variable in class org.apache.any23.extractor.xpath.XPathExtractor
 
NAME - Static variable in class org.apache.any23.plugin.htmlscraper.HTMLScraperExtractor
 
name - Variable in class org.apache.any23.vocab.FOAF
 
Name - Variable in class org.apache.any23.vocab.VCARD
Resources that are vCard personal names.
NAMESPACE - Variable in class org.apache.any23.vocab.ICAL
The namespace of the vocabulary as a URI.
NAMESPACE - Variable in class org.apache.any23.vocab.OGP
 
NAMESPACE - Variable in class org.apache.any23.vocab.REVIEW
The namespace of the vocabulary as a URI.
NAMESPACE - Variable in class org.apache.any23.vocab.SINDICE
The namespace of the vocabulary as a URI.
NAMESPACE - Variable in class org.apache.any23.vocab.VCARD
The namespace of the vocabulary as a URI.
NAMESPACE - Variable in class org.apache.any23.vocab.WO
The namespace of the vocabulary as a URI.
neighbor - Variable in class org.apache.any23.vocab.XFN
 
NESTING - Static variable in class org.apache.any23.vocab.SINDICE
 
nesting - Variable in class org.apache.any23.vocab.SINDICE
This property links a resource with a blank node that represents a nested Microformat node.
NESTING_ORIGINAL - Static variable in class org.apache.any23.vocab.SINDICE
 
nesting_original - Variable in class org.apache.any23.vocab.SINDICE
This property is used to keep track of the original nested RDF property.
NESTING_STRUCTURED - Static variable in class org.apache.any23.vocab.SINDICE
 
nesting_structured - Variable in class org.apache.any23.vocab.SINDICE
This property links the resource with a node representing the nested Microformat
newDefault(Configuration) - Static method in class org.apache.any23.extractor.ExtractionParameters
 
newDefault() - Static method in class org.apache.any23.extractor.ExtractionParameters
Creates the default extraction parameters with DefaultConfiguration.
nick - Variable in class org.apache.any23.vocab.FOAF
 
NICKNAME - Static variable in class org.apache.any23.extractor.html.HCardName
 
nickname - Variable in class org.apache.any23.vocab.VCARD
The nickname of a person.
NODES_WITH_INVALID_ABOUT - Static variable in class org.apache.any23.validator.rule.AboutNotURIRule
 
note - Variable in class org.apache.any23.vocab.VCARD
Notes about a person on a vCard.
notifyIssue(IssueReport.IssueLevel, String, int, int) - Method in class org.apache.any23.extractor.ExtractionResultImpl
 
notifyIssue(IssueReport.IssueLevel, String, int, int) - Method in interface org.apache.any23.extractor.IssueReport
Notifies an issue occurred while performing an extraction on an input stream.
notifyPage(Page) - Method in class org.apache.any23.plugin.crawler.SharedData
Notifies all listeners that a page has been discovered.
NQuadsExtractor - Class in org.apache.any23.extractor.rdf
Concrete implementation of Extractor.ContentExtractor handling N-Quads format.
NQuadsExtractor(boolean, boolean) - Constructor for class org.apache.any23.extractor.rdf.NQuadsExtractor
 
NQuadsExtractor() - Constructor for class org.apache.any23.extractor.rdf.NQuadsExtractor
 
NQuadsParser - Class in org.apache.any23.io.nquads
N-Quads parser implementation based on the RDFParser interface.
NQuadsParser() - Constructor for class org.apache.any23.io.nquads.NQuadsParser
 
NQuadsParserFactory - Class in org.apache.any23.io.nquads
Implementation of RDFParserFactory for NQuads.
NQuadsParserFactory() - Constructor for class org.apache.any23.io.nquads.NQuadsParserFactory
 
NQuadsWriter - Class in org.apache.any23.io.nquads
N-Quads implementation of an RDFWriter.
NQuadsWriter(OutputStream) - Constructor for class org.apache.any23.io.nquads.NQuadsWriter
 
NQuadsWriter(Writer) - Constructor for class org.apache.any23.io.nquads.NQuadsWriter
 
NQuadsWriter - Class in org.apache.any23.writer
Implementation of an NQuads writer.
NQuadsWriter(OutputStream) - Constructor for class org.apache.any23.writer.NQuadsWriter
 
NQuadsWriterFactory - Class in org.apache.any23.io.nquads
Implementation of RDFWriterFactory for NQuads.
NQuadsWriterFactory() - Constructor for class org.apache.any23.io.nquads.NQuadsWriterFactory
 
NQuadsWriterFactory - Class in org.apache.any23.writer
 
NQuadsWriterFactory() - Constructor for class org.apache.any23.writer.NQuadsWriterFactory
 
NS - Static variable in class org.apache.any23.vocab.CSV
The namespace of the vocabulary as a string.
NS - Static variable in class org.apache.any23.vocab.DCTERMS
 
NS - Static variable in class org.apache.any23.vocab.DOAC
 
NS - Static variable in class org.apache.any23.vocab.Excel
The namespace of the vocabulary as a string.
NS - Static variable in class org.apache.any23.vocab.FOAF
 
NS - Static variable in class org.apache.any23.vocab.GEO
 
NS - Static variable in class org.apache.any23.vocab.HLISTING
 
NS - Static variable in class org.apache.any23.vocab.HRECIPE
 
NS - Static variable in class org.apache.any23.vocab.ICAL
The namespace of the vocabulary as a string.
NS - Static variable in class org.apache.any23.vocab.OGP
 
NS - Static variable in class org.apache.any23.vocab.REVIEW
The namespace of the vocabulary as a string.
NS - Static variable in class org.apache.any23.vocab.SCHEMAORG
The namespace of the vocabulary as a string.
NS - Static variable in class org.apache.any23.vocab.SINDICE
The namespace of the vocabulary as a string.
NS - Static variable in class org.apache.any23.vocab.VCARD
The namespace of the vocabulary as a string.
NS - Static variable in class org.apache.any23.vocab.WO
The namespace of the vocabulary as a string.
NS - Static variable in class org.apache.any23.vocab.XFN
 
NS - Static variable in class org.apache.any23.vocab.XHTML
 
NTriplesExtractor - Class in org.apache.any23.extractor.rdf
Concrete implementation of Extractor.ContentExtractor handling NTriples NTriples format.
NTriplesExtractor(boolean, boolean) - Constructor for class org.apache.any23.extractor.rdf.NTriplesExtractor
 
NTriplesExtractor() - Constructor for class org.apache.any23.extractor.rdf.NTriplesExtractor
Default constructor, with no verification of data types and no stop at first error.
NTriplesWriter - Class in org.apache.any23.writer
N3 triples writer.
NTriplesWriter(OutputStream) - Constructor for class org.apache.any23.writer.NTriplesWriter
 
NTriplesWriterFactory - Class in org.apache.any23.writer
 
NTriplesWriterFactory() - Constructor for class org.apache.any23.writer.NTriplesWriterFactory
 
NULL_CHAR - Static variable in class org.apache.any23.extractor.csv.CSVReaderBuilder
 
NUMBER_OF_COLUMNS - Static variable in class org.apache.any23.vocab.CSV
 
NUMBER_OF_ROWS - Static variable in class org.apache.any23.vocab.CSV
 
numberOfColumns - Variable in class org.apache.any23.vocab.CSV
This property expresses the number of columns in a CSV file.
numberOfRows - Variable in class org.apache.any23.vocab.CSV
This property expresses the number of rows in a CSV file.
Nutrition - Variable in class org.apache.any23.vocab.HRECIPE
 
nutrition - Variable in class org.apache.any23.vocab.HRECIPE
 
nutritionValue - Variable in class org.apache.any23.vocab.HRECIPE
 
nutritionValueType - Variable in class org.apache.any23.vocab.HRECIPE
 

O

offer - Variable in class org.apache.any23.vocab.HLISTING
 
OGP - Class in org.apache.any23.vocab
OnlineAccount - Variable in class org.apache.any23.vocab.FOAF
 
OnlineChatAccount - Variable in class org.apache.any23.vocab.FOAF
 
OnlineEcommerceAccount - Variable in class org.apache.any23.vocab.FOAF
 
OnlineGamingAccount - Variable in class org.apache.any23.vocab.FOAF
 
openContext(ExtractionContext) - Method in class org.apache.any23.filter.ExtractionContextBlocker
 
openContext(ExtractionContext) - Method in class org.apache.any23.filter.IgnoreAccidentalRDFa
 
openContext(ExtractionContext) - Method in class org.apache.any23.filter.IgnoreTitlesOfEmptyDocuments
 
openContext(ExtractionContext) - Method in class org.apache.any23.writer.BenchmarkTripleHandler
 
openContext(ExtractionContext) - Method in class org.apache.any23.writer.CompositeTripleHandler
 
openContext(ExtractionContext) - Method in class org.apache.any23.writer.CountingTripleHandler
 
openContext(ExtractionContext) - Method in class org.apache.any23.writer.JSONWriter
 
openContext(ExtractionContext) - Method in class org.apache.any23.writer.LoggingTripleHandler
 
openContext(ExtractionContext) - Method in class org.apache.any23.writer.RDFWriterTripleHandler
 
openContext(ExtractionContext) - Method in class org.apache.any23.writer.ReportingTripleHandler
 
openContext(ExtractionContext) - Method in class org.apache.any23.writer.RepositoryWriter
 
openContext(ExtractionContext) - Method in interface org.apache.any23.writer.TripleHandler
Informs the handler that a new context has been established.
openContext(ExtractionContext) - Method in class org.apache.any23.writer.URIListWriter
 
OPENGRAPH_PROTOCOL_NS - Static variable in class org.apache.any23.validator.rule.OpenGraphNamespaceFix
 
OpenGraphNamespaceFix - Class in org.apache.any23.validator.rule
This fixes the missing Open Graph protocol.
OpenGraphNamespaceFix() - Constructor for class org.apache.any23.validator.rule.OpenGraphNamespaceFix
 
openInputStream(String) - Method in class org.apache.any23.http.DefaultHTTPClient
Opens an InputStream from a given URI.
openInputStream(String) - Method in interface org.apache.any23.http.HTTPClient
Opens the input stream for the given target URI.
openInputStream() - Method in class org.apache.any23.source.ByteArrayDocumentSource
 
openInputStream() - Method in interface org.apache.any23.source.DocumentSource
Returns the input stream for accessing the content of the document.
openInputStream() - Method in class org.apache.any23.source.FileDocumentSource
 
openInputStream() - Method in class org.apache.any23.source.HTTPDocumentSource
 
openInputStream() - Method in class org.apache.any23.source.StringDocumentSource
 
openSubResult(ExtractionContext) - Method in interface org.apache.any23.extractor.ExtractionResult
Open a result nested in the current one.
openSubResult(ExtractionContext) - Method in class org.apache.any23.extractor.ExtractionResultImpl
 
openSubResult(ExtractionContext) - Method in class org.apache.any23.extractor.html.MicroformatExtractor
 
order - Variable in class org.apache.any23.vocab.WO
 
orderClass - Variable in class org.apache.any23.vocab.WO
 
orderName - Variable in class org.apache.any23.vocab.WO
 
org - Variable in class org.apache.any23.vocab.VCARD
An organization associated with a person.
org.apache.any23 - package org.apache.any23
The Any23 root package defines the Any23 library facade and auxiliary classes.
org.apache.any23.cli - package org.apache.any23.cli
This package contains some command-line utilities which allow users to use the main Any23 features via commandline shell.
org.apache.any23.configuration - package org.apache.any23.configuration
This packages contains the Any23 Configuration definition.
org.apache.any23.encoding - package org.apache.any23.encoding
This package defines utility classes for the Encoding Detection.
org.apache.any23.extractor - package org.apache.any23.extractor
This package contains classes and interfaces modeling the Extractor API.
org.apache.any23.extractor.csv - package org.apache.any23.extractor.csv
This package contains the definition of a specific Extractor for CSV files as specified on RFC 4180.
org.apache.any23.extractor.html - package org.apache.any23.extractor.html
All the various Extractor needed to distill RDF from Microformats in HTML pages are contained in this package.
org.apache.any23.extractor.html.annotations - package org.apache.any23.extractor.html.annotations
This package contains the annotations needed to describe the single nesting relations among different Microformats.
org.apache.any23.extractor.microdata - package org.apache.any23.extractor.microdata
This package provides classes for dealing with Microdata extraction.
org.apache.any23.extractor.rdf - package org.apache.any23.extractor.rdf
This package contains the concrete implementations (and utilities classes) to perform the extraction from various RDF serialization formats.
org.apache.any23.extractor.rdfa - package org.apache.any23.extractor.rdfa
This package contains the definition of a specific Extractor for RDFa.
org.apache.any23.extractor.xpath - package org.apache.any23.extractor.xpath
This package provides an XPath extractor able to generate RDF statements from any HTML page on the basis of a set of specified rules and a list of statement templates.
org.apache.any23.filter - package org.apache.any23.filter
This package defines specific TripleHandlers able to filter document and triples that don't respect particular conditions.
org.apache.any23.http - package org.apache.any23.http
This package contains classes and interfaces to implement a small multi-threaded HTTP Client used to fetch data sources.
org.apache.any23.io.nquads - package org.apache.any23.io.nquads
This package contains an NQuads parser and writer based on the Sesame API.
org.apache.any23.mime - package org.apache.any23.mime
This package provides support for MIME Type data detection.
org.apache.any23.mime.purifier - package org.apache.any23.mime.purifier
This package contains all the logic to implement your own Purifier.
org.apache.any23.plugin - package org.apache.any23.plugin
This package provides Plugin Management support for Any23.
org.apache.any23.plugin.crawler - package org.apache.any23.plugin.crawler
Commandline extension of Rover able to extract semantic content from an entire site.
org.apache.any23.plugin.htmlscraper - package org.apache.any23.plugin.htmlscraper
The HTMLScraperExtractor is a special extractor to scrape textual content from a generic HTML pages.
org.apache.any23.plugin.officescraper - package org.apache.any23.plugin.officescraper
This package defines a set of Extractors able to extract metadata from various Microsoft Office documents.
org.apache.any23.rdf - package org.apache.any23.rdf
This package provides some utility classes to manage RDF models and prefixes.
org.apache.any23.servlet - package org.apache.any23.servlet
The package defines a servlet exposing Any23 as a REST Service.
org.apache.any23.servlet.conneg - package org.apache.any23.servlet.conneg
This package provides the Any23 content type negotiator.
org.apache.any23.source - package org.apache.any23.source
Package containing some auxiliary classes to handle different types of InputStream sources.
org.apache.any23.util - package org.apache.any23.util
This package collects a set of internal library utility classes.
org.apache.any23.validator - package org.apache.any23.validator
This package defines a simple rule based validation framework able to detect (Rule) and fix (Fix) common mistakes that can be present within an HTML DOM model and that can prevent the extraction of metadata.
org.apache.any23.validator.rule - package org.apache.any23.validator.rule
A set of predefined validation Rules and associated Fixes.
org.apache.any23.vocab - package org.apache.any23.vocab
This package defines the main ontologies used by Any23.
org.apache.any23.writer - package org.apache.any23.writer
This package collects a set of TripleHandler decorators and specific RDF format writers.
OrganisationalSkill - Variable in class org.apache.any23.vocab.DOAC
 
organization - Variable in class org.apache.any23.vocab.DOAC
 
Organization - Variable in class org.apache.any23.vocab.FOAF
 
Organization - Variable in class org.apache.any23.vocab.VCARD
Resources that are vCard organizations.
organization_name - Variable in class org.apache.any23.vocab.VCARD
The name of an organization.
organization_unit - Variable in class org.apache.any23.vocab.VCARD
The name of a unit within an organization.
organizer - Variable in class org.apache.any23.vocab.ICAL
value type: CAL-ADDRESSThe URI defines the organizer for a calendar component..

P

page - Variable in class org.apache.any23.vocab.FOAF
 
PAGE_CONTENT_AE_PROPERTY - Static variable in class org.apache.any23.plugin.htmlscraper.HTMLScraperExtractor
 
PAGE_CONTENT_CE_PROPERTY - Static variable in class org.apache.any23.plugin.htmlscraper.HTMLScraperExtractor
 
PAGE_CONTENT_DE_PROPERTY - Static variable in class org.apache.any23.plugin.htmlscraper.HTMLScraperExtractor
 
PAGE_CONTENT_LCE_PROPERTY - Static variable in class org.apache.any23.plugin.htmlscraper.HTMLScraperExtractor
 
parent - Variable in class org.apache.any23.vocab.XFN
 
parse(Reader, String) - Method in class org.apache.any23.io.nquads.NQuadsParser
 
parse(InputStream, String) - Method in class org.apache.any23.io.nquads.NQuadsParser
 
parse(String) - Static method in class org.apache.any23.mime.MIMEType
Parses the given MIME type string returning an instance of MIMEType.
parseAccept(String) - Static method in class org.apache.any23.servlet.conneg.MediaRangeSpec
Parses an HTTP Accept header into a List of MediaRangeSpecs
parseDateTime(String) - Static method in class org.apache.any23.extractor.microdata.ItemPropValue
 
parseRange(String) - Static method in class org.apache.any23.servlet.conneg.MediaRangeSpec
Parses a media range from a string such as text/*;charset=utf-8;q=0.9.
parseRDF(RDFFormat, InputStream, String) - Static method in class org.apache.any23.rdf.RDFUtils
Parses the content of is input stream with the specified parser p using baseURI.
parseRDF(RDFFormat, InputStream) - Static method in class org.apache.any23.rdf.RDFUtils
Parses the content of is input stream with the specified parser p using '' as base URI.
parseRDF(RDFFormat, String) - Static method in class org.apache.any23.rdf.RDFUtils
Parses the content of in string with the specified parser p using '' as base URI.
parseRDF(String) - Static method in class org.apache.any23.rdf.RDFUtils
Parses the content of the resource file guessing the content format from the extension.
parseType(String) - Static method in class org.apache.any23.servlet.conneg.MediaRangeSpec
Parses a media type from a string such as text/html;charset=utf-8;q=0.9.
partstat - Variable in class org.apache.any23.vocab.ICAL
To specify the participation status for the calendar user specified by the URI..
pastProject - Variable in class org.apache.any23.vocab.FOAF
 
percentComplete - Variable in class org.apache.any23.vocab.ICAL
value type: INTEGERThis URI is used by an assignee or delegatee of a to-do to convey the percent completion of a to-do to the Organizer..
performExtraction(DocumentSource) - Method in class org.apache.any23.cli.Rover
 
permalink - Variable in class org.apache.any23.vocab.HLISTING
 
Person - Variable in class org.apache.any23.vocab.FOAF
 
personalEmail - Variable in class org.apache.any23.vocab.VCARD
An email address unaffiliated with any particular organization or employer; a personal email address.
PersonalProfileDocument - Variable in class org.apache.any23.vocab.FOAF
 
phone - Variable in class org.apache.any23.vocab.FOAF
 
photo - Variable in class org.apache.any23.vocab.HRECIPE
 
photo - Variable in class org.apache.any23.vocab.VCARD
A photograph of a person.
phylum - Variable in class org.apache.any23.vocab.WO
 
phylumClass - Variable in class org.apache.any23.vocab.WO
 
phylumName - Variable in class org.apache.any23.vocab.WO
 
plan - Variable in class org.apache.any23.vocab.FOAF
 
PLUGIN_DIRS_LIST_SEPARATOR - Static variable in class org.apache.any23.plugin.Any23PluginManager
List separator for the string declaring the plugin list.
PLUGIN_DIRS_PROPERTY - Static variable in class org.apache.any23.plugin.Any23PluginManager
Property where look for plugins.
PLUGINS_PACKAGE - Static variable in class org.apache.any23.plugin.Any23PluginManager
Any23 Plugins package.
PluginVerifier - Class in org.apache.any23.cli
Commandline utility to verify the Any23 plugins and extract basic information.
PluginVerifier() - Constructor for class org.apache.any23.cli.PluginVerifier
 
Point - Variable in class org.apache.any23.vocab.GEO
 
PopularPrefixes - Class in org.apache.any23.rdf
This class act as a container for various well-known and adopted RDF Vocabulary prefixes.
PopularPrefixes() - Constructor for class org.apache.any23.rdf.PopularPrefixes
 
positiveVotes - Variable in class org.apache.any23.vocab.REVIEW
Number of positive usefulness votes (integer).
post_office_box - Variable in class org.apache.any23.vocab.VCARD
The post office box of a postal address.
postal_code - Variable in class org.apache.any23.vocab.VCARD
The postal code (e.g., U.S.
postalCode - Variable in class org.apache.any23.vocab.HLISTING
 
postOfficeBox - Variable in class org.apache.any23.vocab.HLISTING
 
PREFIX_ATTRIBUTE - Static variable in class org.apache.any23.extractor.rdfa.RDFa11Parser
 
Prefixes - Class in org.apache.any23.rdf
A mapping from prefixes to namespace URIs.
Prefixes() - Constructor for class org.apache.any23.rdf.Prefixes
 
Prefixes(Prefixes) - Constructor for class org.apache.any23.rdf.Prefixes
 
price - Variable in class org.apache.any23.vocab.HLISTING
 
PrimarySchool - Variable in class org.apache.any23.vocab.DOAC
 
primaryTopic - Variable in class org.apache.any23.vocab.FOAF
 
printError(String) - Method in class org.apache.any23.cli.ExtractorDocumentation
Print an error message.
printExampleInput(String, ExtractorRegistry) - Method in class org.apache.any23.cli.ExtractorDocumentation
Prints an example of input for the provided extractor.
printExampleOutput(String, ExtractorRegistry) - Method in class org.apache.any23.cli.ExtractorDocumentation
Prints an output example for the given extractor.
printExtractorList(ExtractorRegistry) - Method in class org.apache.any23.cli.ExtractorDocumentation
Prints the list of all the available extractors.
printOut(ExtractionResult, Map<String, String>) - Method in class org.apache.any23.extractor.xpath.QuadTemplate
Prints out this quad template in the given ExtractionResult, using the passed variableAssignment to expand variables.
printReport(ExtractorRegistry) - Method in class org.apache.any23.cli.ExtractorDocumentation
Prints a complete report on all the available extractors.
printReport(PrintStream) - Method in class org.apache.any23.extractor.ExtractionResultImpl
 
printReport(PrintStream) - Method in interface org.apache.any23.extractor.IssueReport
Prints out the content of the report.
printReport() - Method in class org.apache.any23.writer.ReportingTripleHandler
 
printReports() - Method in class org.apache.any23.cli.Rover
 
printStackTrace(PrintStream) - Method in exception org.apache.any23.extractor.ExtractionException
 
printStackTrace(PrintWriter) - Method in exception org.apache.any23.extractor.ExtractionException
 
priority - Variable in class org.apache.any23.vocab.ICAL
The URI defines the relative priority for a calendar component.value type: INTEGER.
process(Document, ExtractionResult) - Method in class org.apache.any23.extractor.xpath.TemplateXPathExtractionRuleImpl
 
process(Document, ExtractionResult) - Method in interface org.apache.any23.extractor.xpath.XPathExtractionRule
Processes this extraction rule on the given document.
processDocument(URL, Document, ExtractionResult) - Method in class org.apache.any23.extractor.rdfa.RDFa11Parser
prodid - Variable in class org.apache.any23.vocab.ICAL
value type: TEXTThis URI specifies the identifier for the product that created the iCalendar object..
PROFILE_ATTRIBUTE - Static variable in class org.apache.any23.extractor.rdfa.RDFa11Parser
 
Project - Variable in class org.apache.any23.vocab.FOAF
 
properties - Variable in class org.apache.any23.configuration.DefaultConfiguration
 
PROPERTY_ATTRIBUTE - Static variable in class org.apache.any23.extractor.rdfa.RDFa11Parser
 
publication - Variable in class org.apache.any23.vocab.DOAC
 
Publication - Variable in class org.apache.any23.vocab.DOAC
 
publications - Variable in class org.apache.any23.vocab.FOAF
 
published - Variable in class org.apache.any23.vocab.HRECIPE
 
Purifier - Interface in org.apache.any23.mime.purifier
This interface defines a minimum set of methods that a TikaMIMETypeDetector could call in order to clean the input before performing the MIME type detection.
purify(InputStream) - Method in interface org.apache.any23.mime.purifier.Purifier
Performs the purification of the provided resettable InputStream.
purify(InputStream) - Method in class org.apache.any23.mime.purifier.WhiteSpacesPurifier
Performs the purification of the provided resettable InputStream.
putData(String, Object) - Method in class org.apache.any23.validator.DefaultRuleContext
 
putData(String, T) - Method in interface org.apache.any23.validator.RuleContext
Puts a data within the context.

Q

quad(Resource, URI, Value, Resource) - Static method in class org.apache.any23.rdf.RDFUtils
Creates a Statement.
quad(String, String, String, String) - Static method in class org.apache.any23.rdf.RDFUtils
Creates a statement of type: toValue(s), toValue(p), toValue(o), toValue(g)
QuadTemplate - Class in org.apache.any23.extractor.xpath
This class models a NQuads template, that is a quadruple in which any component can be a variable.
QuadTemplate(TemplateSubject, TemplatePredicate, TemplateObject, TemplateGraph) - Constructor for class org.apache.any23.extractor.xpath.QuadTemplate
Constructor.
QuadTemplate(TemplateSubject, TemplatePredicate, TemplateObject) - Constructor for class org.apache.any23.extractor.xpath.QuadTemplate
Constructor for template with no graph.

R

range - Variable in class org.apache.any23.vocab.ICAL
To specify the effective range of recurrence instances from the instance specified by the recurrence identifier specified by the URI..
rating - Variable in class org.apache.any23.vocab.REVIEW
A numeric value.
rdate - Variable in class org.apache.any23.vocab.ICAL
default value type: DATE-TIMEThis URI defines the list of date/times for a recurrence set..
RDFa11Extractor - Class in org.apache.any23.extractor.rdfa
Extractor implementation for RDFa 1.1 specification.
RDFa11Extractor(boolean, boolean) - Constructor for class org.apache.any23.extractor.rdfa.RDFa11Extractor
Constructor, allows to specify the validation and error handling policies.
RDFa11Extractor() - Constructor for class org.apache.any23.extractor.rdfa.RDFa11Extractor
Default constructor, with no verification of data types and not stop at first error.
RDFa11Parser - Class in org.apache.any23.extractor.rdfa
This parser is able to extract RDFa 1.0 and RDFa 1.1 statements from any (X)HTML document.
RDFa11Parser() - Constructor for class org.apache.any23.extractor.rdfa.RDFa11Parser
 
RDFa11ParserException - Exception in org.apache.any23.extractor.rdfa
Exception class raised by RDFa11Parser.
RDFa11ParserException(String) - Constructor for exception org.apache.any23.extractor.rdfa.RDFa11ParserException
 
RDFa11ParserException(String, Throwable) - Constructor for exception org.apache.any23.extractor.rdfa.RDFa11ParserException
 
RDFaExtractor - Class in org.apache.any23.extractor.rdfa
Extractor for RDFa in HTML, based on Fabien Gadon's XSLT transform, found here.
RDFaExtractor(boolean, boolean) - Constructor for class org.apache.any23.extractor.rdfa.RDFaExtractor
Constructor, allows to specify the validation and error handling policies.
RDFaExtractor() - Constructor for class org.apache.any23.extractor.rdfa.RDFaExtractor
Default constructor, with no verification of data types and not stop at first error.
RDFHandlerAdapter - Class in org.apache.any23.extractor.rdf
An RDFHandler that relays statements and prefix definitions to an ExtractionResult.
RDFHandlerAdapter(ExtractionResult) - Constructor for class org.apache.any23.extractor.rdf.RDFHandlerAdapter
 
RDFParserFactory - Class in org.apache.any23.extractor.rdf
This factory provides a common logic for creating and configuring correctly any RDF parser used within the library.
RDFParserFactory() - Constructor for class org.apache.any23.extractor.rdf.RDFParserFactory
 
RDFSchemaUtils - Class in org.apache.any23.vocab
This class provides a set of methods for generating RDF Schema.
RDFSchemaUtils.VocabularyFormat - Enum in org.apache.any23.vocab
Supported formats for vocabulary serialization.
RDFUtils - Class in org.apache.any23.rdf
Basic class providing a set of utility methods when dealing with RDF.
RDFWriterTripleHandler - Class in org.apache.any23.writer
A TripleHandler that writes triples to a Sesame RDFWriter, eg for serialization using one of Sesame's writers.
RDFXMLExtractor - Class in org.apache.any23.extractor.rdf
Concrete implementation of Extractor.ContentExtractor able to perform the extraction on RDF/XML documents.
RDFXMLExtractor(boolean, boolean) - Constructor for class org.apache.any23.extractor.rdf.RDFXMLExtractor
Constructor, allows to specify the validation and error handling policies.
RDFXMLExtractor() - Constructor for class org.apache.any23.extractor.rdf.RDFXMLExtractor
Default constructor, with no verification of data types and not stop at first error.
RDFXMLWriter - Class in org.apache.any23.writer
RDF/XML writer implementation.
RDFXMLWriter(OutputStream) - Constructor for class org.apache.any23.writer.RDFXMLWriter
 
RDFXMLWriterFactory - Class in org.apache.any23.writer
 
RDFXMLWriterFactory() - Constructor for class org.apache.any23.writer.RDFXMLWriterFactory
 
read() - Method in class org.apache.any23.extractor.html.SpanCloserInputStream
 
read() - Method in class org.apache.any23.util.ReaderInputStream
Reads from the Reader, returning the same value.
read(byte[], int, int) - Method in class org.apache.any23.util.ReaderInputStream
Reads from the Reader into a byte array
readAttribute(Node, String, String) - Static method in class org.apache.any23.extractor.html.DomUtils
Reads the value of the specified attribute, returning the defaultValue string if not present.
readAttribute(Node, String) - Static method in class org.apache.any23.extractor.html.DomUtils
Reads the value of an attribute, returning the empty string if not present.
readAttribute(String) - Method in class org.apache.any23.extractor.html.HTMLDocument
Read an attribute avoiding NullPointerExceptions, if the attr is missing it just returns an empty string.
readAttributeWithPrefix(Node, String, String) - Static method in class org.apache.any23.extractor.html.DomUtils
Reads the value of the first attribute which name matches with the specified attributePrefix.
ReaderInputStream - Class in org.apache.any23.util
Adapts a Reader as an InputStream.
ReaderInputStream(Reader) - Constructor for class org.apache.any23.util.ReaderInputStream
Construct a ReaderInputStream for the specified Reader.
ReaderInputStream(Reader, String) - Constructor for class org.apache.any23.util.ReaderInputStream
Construct a ReaderInputStream for the specified Reader, with the specified encoding.
readFileContent(File) - Static method in class org.apache.any23.util.FileUtils
Returns the content of a file a single string.
readFileLines(File) - Static method in class org.apache.any23.util.FileUtils
Returns all the lines of a file.
readNodeContent(Node, boolean) - Static method in class org.apache.any23.extractor.html.HTMLDocument
Reads the text content of the given node and returns it.
readResourceContent(Class, String) - Static method in class org.apache.any23.util.FileUtils
Reads a resource file and returns the content as a string.
readResourceContent(String) - Static method in class org.apache.any23.util.FileUtils
Reads a resource file and returns the content as a string.
reads - Variable in class org.apache.any23.vocab.DOAC
 
readStream() - Method in class org.apache.any23.source.FileDocumentSource
 
readTextField(Node) - Static method in class org.apache.any23.extractor.html.HTMLDocument
Reads a text field from the given node adding the content to the given res list.
readUrlField(List<HTMLDocument.TextField>, Node) - Static method in class org.apache.any23.extractor.html.HTMLDocument
Reads an URL field from the given node adding the content to the given res list.
receiveNamespace(String, String, ExtractionContext) - Method in class org.apache.any23.filter.ExtractionContextBlocker
 
receiveNamespace(String, String, ExtractionContext) - Method in class org.apache.any23.filter.IgnoreAccidentalRDFa
 
receiveNamespace(String, String, ExtractionContext) - Method in class org.apache.any23.filter.IgnoreTitlesOfEmptyDocuments
 
receiveNamespace(String, String, ExtractionContext) - Method in class org.apache.any23.writer.BenchmarkTripleHandler
 
receiveNamespace(String, String, ExtractionContext) - Method in class org.apache.any23.writer.CompositeTripleHandler
 
receiveNamespace(String, String, ExtractionContext) - Method in class org.apache.any23.writer.CountingTripleHandler
 
receiveNamespace(String, String, ExtractionContext) - Method in class org.apache.any23.writer.JSONWriter
 
receiveNamespace(String, String, ExtractionContext) - Method in class org.apache.any23.writer.LoggingTripleHandler
 
receiveNamespace(String, String, ExtractionContext) - Method in class org.apache.any23.writer.RDFWriterTripleHandler
 
receiveNamespace(String, String, ExtractionContext) - Method in class org.apache.any23.writer.ReportingTripleHandler
 
receiveNamespace(String, String, ExtractionContext) - Method in class org.apache.any23.writer.RepositoryWriter
 
receiveNamespace(String, String, ExtractionContext) - Method in interface org.apache.any23.writer.TripleHandler
Invoked with a currently open context, notifies the detection of a namespace.
receiveNamespace(String, String, ExtractionContext) - Method in class org.apache.any23.writer.URIListWriter
 
receiveTriple(Resource, URI, Value, URI, ExtractionContext) - Method in class org.apache.any23.filter.ExtractionContextBlocker
 
receiveTriple(Resource, URI, Value, URI, ExtractionContext) - Method in class org.apache.any23.filter.IgnoreAccidentalRDFa
 
receiveTriple(Resource, URI, Value, URI, ExtractionContext) - Method in class org.apache.any23.filter.IgnoreTitlesOfEmptyDocuments
 
receiveTriple(Resource, URI, Value, URI, ExtractionContext) - Method in class org.apache.any23.writer.BenchmarkTripleHandler
 
receiveTriple(Resource, URI, Value, URI, ExtractionContext) - Method in class org.apache.any23.writer.CompositeTripleHandler
 
receiveTriple(Resource, URI, Value, URI, ExtractionContext) - Method in class org.apache.any23.writer.CountingTripleHandler
 
receiveTriple(Resource, URI, Value, URI, ExtractionContext) - Method in class org.apache.any23.writer.JSONWriter
 
receiveTriple(Resource, URI, Value, URI, ExtractionContext) - Method in class org.apache.any23.writer.LoggingTripleHandler
 
receiveTriple(Resource, URI, Value, URI, ExtractionContext) - Method in class org.apache.any23.writer.RDFWriterTripleHandler
 
receiveTriple(Resource, URI, Value, URI, ExtractionContext) - Method in class org.apache.any23.writer.ReportingTripleHandler
 
receiveTriple(Resource, URI, Value, URI, ExtractionContext) - Method in class org.apache.any23.writer.RepositoryWriter
 
receiveTriple(Resource, URI, Value, URI, ExtractionContext) - Method in interface org.apache.any23.writer.TripleHandler
Invoked with a currently open context, notifies the detection of a triple.
receiveTriple(Resource, URI, Value, URI, ExtractionContext) - Method in class org.apache.any23.writer.URIListWriter
 
Recipe - Variable in class org.apache.any23.vocab.HRECIPE
 
recurrenceId - Variable in class org.apache.any23.vocab.ICAL
default value type: DATE-TIMEThis URI is used in conjunction with the "UID" and "SEQUENCE" URI to identify a specific instance of a recurring "VEVENT", "VTODO" or "VJOURNAL" calendar component.
RedirectServlet - Class in org.apache.any23.servlet
This servlet contains the logic to perform the correct redirects when Any23 is used as a all-in-one web application.
RedirectServlet() - Constructor for class org.apache.any23.servlet.RedirectServlet
 
reference - Variable in class org.apache.any23.vocab.DOAC
 
Reference - Variable in class org.apache.any23.vocab.DOAC
 
reference_type - Variable in class org.apache.any23.vocab.DOAC
 
referer - Variable in class org.apache.any23.vocab.DOAC
 
region - Variable in class org.apache.any23.vocab.HLISTING
 
region - Variable in class org.apache.any23.vocab.VCARD
The region (e.g., state or province) of a postal address.
register(ExtractorFactory<?>) - Method in interface org.apache.any23.extractor.ExtractorRegistry
Registers an ExtractorFactory.
register(ExtractorFactory<?>) - Method in class org.apache.any23.extractor.ExtractorRegistryImpl
Registers an ExtractorFactory.
register(WriterFactory) - Method in class org.apache.any23.writer.WriterFactoryRegistry
Registers a new WriterFactory to the registry.
REL_ATTRIBUTE - Static variable in class org.apache.any23.extractor.rdfa.RDFa11Parser
 
related - Variable in class org.apache.any23.vocab.DCTERMS
 
related - Variable in class org.apache.any23.vocab.ICAL
To specify the relationship of the alarm trigger with respect to the start or end of the calendar component..
relatedTo - Variable in class org.apache.any23.vocab.ICAL
The URI is used to represent a relationship or reference between one calendar component and another.value type: TEXT.
reltype - Variable in class org.apache.any23.vocab.ICAL
To specify the type of hierarchical relationship associated with the calendar component specified by the URI..
remove(Variable) - Method in interface org.apache.any23.extractor.xpath.TemplateXPathExtractionRule
Removes a variable from the template.
remove(QuadTemplate) - Method in interface org.apache.any23.extractor.xpath.TemplateXPathExtractionRule
Removes a quad template from the rule.
remove(Variable) - Method in class org.apache.any23.extractor.xpath.TemplateXPathExtractionRuleImpl
 
remove(QuadTemplate) - Method in class org.apache.any23.extractor.xpath.TemplateXPathExtractionRuleImpl
 
remove(XPathExtractionRule) - Method in class org.apache.any23.extractor.xpath.XPathExtractor
 
removeData(String) - Method in class org.apache.any23.validator.DefaultRuleContext
 
removeData(String) - Method in interface org.apache.any23.validator.RuleContext
Removes a data from the context.
removeListener(CrawlerListener) - Method in class org.apache.any23.plugin.crawler.SiteCrawler
Deregisters a CrawlerListener from this crawler.
removePrefix(String) - Method in class org.apache.any23.rdf.Prefixes
 
removeRule(Class<? extends Rule>) - Method in class org.apache.any23.validator.DefaultValidator
 
removeRule(Class<? extends Rule>) - Method in interface org.apache.any23.validator.Validator
Allows to remove a rule from the validator and all the related Fixes.
rent - Variable in class org.apache.any23.vocab.HLISTING
 
repeat - Variable in class org.apache.any23.vocab.ICAL
This URI defines the number of time the alarm should be repeated, after the initial trigger.value type: INTEGER.
report() - Method in class org.apache.any23.writer.BenchmarkTripleHandler
Returns the report as a human readable string.
reportFixError(Fix, Exception, String) - Method in class org.apache.any23.validator.DefaultValidationReportBuilder
 
reportFixError(Fix, Exception, String) - Method in interface org.apache.any23.validator.ValidationReportBuilder
Reports an error occurred while executing a Fix.
ReportingTripleHandler - Class in org.apache.any23.writer
A TripleHandler that collects various information about the extraction process, such as the extractors used and the total number of triples.
ReportingTripleHandler(TripleHandler) - Constructor for class org.apache.any23.writer.ReportingTripleHandler
 
reportIssue(ValidationReport.IssueLevel, String, Node) - Method in class org.apache.any23.validator.DefaultValidationReportBuilder
 
reportIssue(ValidationReport.IssueLevel, String) - Method in class org.apache.any23.validator.DefaultValidationReportBuilder
 
reportIssue(ValidationReport.IssueLevel, String, Node) - Method in interface org.apache.any23.validator.ValidationReportBuilder
Reports an issue detected on a specified node.
reportIssue(ValidationReport.IssueLevel, String) - Method in interface org.apache.any23.validator.ValidationReportBuilder
Reports a detected issue.
reportRuleError(Rule, Exception, String) - Method in class org.apache.any23.validator.DefaultValidationReportBuilder
 
reportRuleError(Rule, Exception, String) - Method in interface org.apache.any23.validator.ValidationReportBuilder
Reports an error occurred while executing a Rule.
RepositoryWriter - Class in org.apache.any23.writer
A Sesame repository triple writer.
RepositoryWriter(RepositoryConnection) - Constructor for class org.apache.any23.writer.RepositoryWriter
 
RepositoryWriter(RepositoryConnection, Resource) - Constructor for class org.apache.any23.writer.RepositoryWriter
 
requestStatus - Variable in class org.apache.any23.vocab.ICAL
value type: TEXTThis URI defines the status code returned for a scheduling request..
reset() - Method in class org.apache.any23.extractor.html.HCardName
Resets the content of the HName fields.
reset() - Method in class org.apache.any23.extractor.rdfa.RDFa11Parser
Resets the parser to the original state.
reset() - Method in class org.apache.any23.util.ReaderInputStream
Resets the StringReader.
reset() - Method in class org.apache.any23.writer.CountingTripleHandler
 
resetExtractor() - Method in class org.apache.any23.extractor.html.AdrExtractor
 
resetExtractor() - Method in class org.apache.any23.extractor.html.EntityBasedMicroformatExtractor
Resets the internal status of the extractor to prepare it to a new extraction section.
resetExtractor() - Method in class org.apache.any23.extractor.html.GeoExtractor
 
resetExtractor() - Method in class org.apache.any23.extractor.html.HCardExtractor
 
resetExtractor() - Method in class org.apache.any23.extractor.html.HListingExtractor
 
resetExtractor() - Method in class org.apache.any23.extractor.html.HRecipeExtractor
 
resetExtractor() - Method in class org.apache.any23.extractor.html.HResumeExtractor
 
resetExtractor() - Method in class org.apache.any23.extractor.html.HReviewExtractor
 
resetExtractor() - Method in class org.apache.any23.extractor.html.SpeciesExtractor
Resets the internal status of the extractor to prepare it to a new extraction section.
resolveCURIEOrURI(String, boolean) - Method in class org.apache.any23.extractor.rdfa.RDFa11Parser
Resolves a CURIE or URI string.
resolveCurieOrURIList(Node, String, boolean) - Method in class org.apache.any23.extractor.rdfa.RDFa11Parser
Resolves a whitelist separated list of CURIE or URI.
resolveURI(String) - Method in class org.apache.any23.extractor.html.HTMLDocument
 
resolveURI(String) - Method in class org.apache.any23.extractor.rdfa.RDFa11Parser
Resolves a URI string as URI.
resolveURI(String, URI) - Method in class org.apache.any23.rdf.Any23ValueFactoryWrapper
Fixes typical errors in URIs, and resolves relative URIs against a base URI.
RESOURCE_ATTRIBUTE - Static variable in class org.apache.any23.extractor.rdfa.RDFa11Parser
 
RESOURCE_NAME - Static variable in class org.apache.any23.mime.TikaMIMETypeDetector
 
resources - Variable in class org.apache.any23.vocab.ICAL
value type: TEXTThis URI defines the equipment or resources anticipated for an activity specified by a calendar entity...
rev - Variable in class org.apache.any23.vocab.VCARD
The timestamp of a revision of a vCard.
REV_ATTRIBUTE - Static variable in class org.apache.any23.extractor.rdfa.RDFa11Parser
 
REVIEW - Class in org.apache.any23.vocab
Vocabulary definitions from vocabularies/review.rdf
Review - Variable in class org.apache.any23.vocab.REVIEW
A review of an work.
reviewer - Variable in class org.apache.any23.vocab.REVIEW
The person that has written the review.
role - Variable in class org.apache.any23.vocab.ICAL
To specify the participation role for the calendar user specified by the URI..
role - Variable in class org.apache.any23.vocab.VCARD
A role a person plays within an organization.
ROOT_EXTRACTION_RESULT_ID - Static variable in class org.apache.any23.extractor.ExtractionContext
 
Rover - Class in org.apache.any23.cli
A default rover implementation.
Rover() - Constructor for class org.apache.any23.cli.Rover
 
Rover.ArgumentToURIConverter - Class in org.apache.any23.cli
 
Rover.ArgumentToURIConverter() - Constructor for class org.apache.any23.cli.Rover.ArgumentToURIConverter
 
Rover.PrintStreamConverter - Class in org.apache.any23.cli
 
Rover.PrintStreamConverter() - Constructor for class org.apache.any23.cli.Rover.PrintStreamConverter
 
ROW - Static variable in class org.apache.any23.vocab.CSV
 
row - Variable in class org.apache.any23.vocab.CSV
This property links the identifier of a CSV to an entity representing a row.
ROW - Static variable in class org.apache.any23.vocab.Excel
 
row - Variable in class org.apache.any23.vocab.Excel
This resource identifies a row.
ROW_POSITION - Static variable in class org.apache.any23.vocab.CSV
 
ROW_TYPE - Static variable in class org.apache.any23.vocab.CSV
 
rowPosition - Variable in class org.apache.any23.vocab.CSV
This property expresses the index of a row in a CSV file.
rowType - Variable in class org.apache.any23.vocab.CSV
This resource identifies a Row.
rrule - Variable in class org.apache.any23.vocab.ICAL
This URI defines a rule or repeating pattern for recurring events, to-dos, or time zone definitions.value type: RECUR.
rsvp - Variable in class org.apache.any23.vocab.ICAL
To specify whether there is an expectation of a favor of a reply from the calendar user specified by the URI value..
Rule - Interface in org.apache.any23.validator
Models a rule for an issue detection.
RuleContext<T> - Interface in org.apache.any23.validator
This context is used to pass data from a Rule to a Fix.
run() - Method in class org.apache.any23.cli.Crawler
 
run() - Method in class org.apache.any23.cli.ExtractorDocumentation
 
run() - Method in class org.apache.any23.cli.MicrodataParser
 
run() - Method in class org.apache.any23.cli.MimeDetector
 
run() - Method in class org.apache.any23.cli.PluginVerifier
 
run() - Method in class org.apache.any23.cli.Rover
 
run() - Method in interface org.apache.any23.cli.Tool
Runs the tool and retrieves the exit code.
run() - Method in class org.apache.any23.cli.VocabPrinter
 
run(ExtractionParameters, ExtractionContext, InputStream, ExtractionResult) - Method in class org.apache.any23.extractor.csv.CSVExtractor
Executes the extractor.
run(ExtractionParameters, ExtractionContext, Input, ExtractionResult) - Method in interface org.apache.any23.extractor.Extractor
Executes the extractor.
run(ExtractionParameters, ExtractionContext, Document, ExtractionResult) - Method in class org.apache.any23.extractor.html.HeadLinkExtractor
 
run(ExtractionParameters, ExtractionContext, Document, ExtractionResult) - Method in class org.apache.any23.extractor.html.HTMLMetaExtractor
Executes the extractor.
run(ExtractionParameters, ExtractionContext, Document, ExtractionResult) - Method in class org.apache.any23.extractor.html.ICBMExtractor
 
run(ExtractionParameters, ExtractionContext, Document, ExtractionResult) - Method in class org.apache.any23.extractor.html.LicenseExtractor
 
run(ExtractionParameters, ExtractionContext, Document, ExtractionResult) - Method in class org.apache.any23.extractor.html.MicroformatExtractor
 
run(ExtractionParameters, ExtractionContext, Document, ExtractionResult) - Method in class org.apache.any23.extractor.html.TitleExtractor
 
run(ExtractionParameters, ExtractionContext, Document, ExtractionResult) - Method in class org.apache.any23.extractor.html.TurtleHTMLExtractor
 
run(ExtractionParameters, ExtractionContext, Document, ExtractionResult) - Method in class org.apache.any23.extractor.html.XFNExtractor
 
run(ExtractionParameters, ExtractionContext, Document, ExtractionResult) - Method in class org.apache.any23.extractor.microdata.MicrodataExtractor
This extraction performs the Microdata to RDF conversion algorithm.
run(ExtractionParameters, ExtractionContext, InputStream, ExtractionResult) - Method in class org.apache.any23.extractor.rdf.BaseRDFExtractor
 
run(ExtractionParameters, ExtractionContext, Document, ExtractionResult) - Method in class org.apache.any23.extractor.rdfa.RDFa11Extractor
 
run(ExtractionParameters, ExtractionContext, Document, ExtractionResult) - Method in class org.apache.any23.extractor.rdfa.RDFaExtractor
 
run(ExtractionParameters) - Method in class org.apache.any23.extractor.SingleDocumentExtraction
Triggers the execution of all the Extractor registered to this class using the specified extraction parameters.
run() - Method in class org.apache.any23.extractor.SingleDocumentExtraction
Triggers the execution of all the Extractor registered to this class using the default extraction parameters.
run(ExtractionParameters, ExtractionContext, Document, ExtractionResult) - Method in class org.apache.any23.extractor.xpath.XPathExtractor
 
run(ExtractionParameters, ExtractionContext, InputStream, ExtractionResult) - Method in class org.apache.any23.plugin.htmlscraper.HTMLScraperExtractor
 
run(ExtractionParameters, ExtractionContext, InputStream, ExtractionResult) - Method in class org.apache.any23.plugin.officescraper.ExcelExtractor
 

S

SCHEMAORG - Class in org.apache.any23.vocab
Vocabulary definition for schema.org.
schoolHomepage - Variable in class org.apache.any23.vocab.FOAF
 
scientificName - Variable in class org.apache.any23.vocab.WO
specifies the scientific name of a species, allowing this portion of the name to be explicitly described.
SecondarySchool - Variable in class org.apache.any23.vocab.DOAC
 
sell - Variable in class org.apache.any23.vocab.HLISTING
 
sentBy - Variable in class org.apache.any23.vocab.ICAL
To specify the calendar user that is acting on behalf of the calendar user specified by the URI..
sequence - Variable in class org.apache.any23.vocab.ICAL
value type: integerThis URI defines the revision sequence number of the calendar component within a sequence of revisions..
SerializationException - Exception in org.apache.any23.validator
Defines an error raised from the ValidationReportSerializer.
SerializationException(String) - Constructor for exception org.apache.any23.validator.SerializationException
 
SerializationException(String, Throwable) - Constructor for exception org.apache.any23.validator.SerializationException
 
serialize(ValidationReport, OutputStream) - Method in interface org.apache.any23.validator.ValidationReportSerializer
Serializes the validation report on the given output stream.
serialize(ValidationReport, OutputStream) - Method in class org.apache.any23.validator.XMLValidationReportSerializer
 
serializeGetterValue(Object, Method, PrintStream) - Method in class org.apache.any23.validator.XMLValidationReportSerializer
 
serializeToXML(Node, boolean) - Static method in class org.apache.any23.extractor.html.DomUtils
Given a DOM Node produces the XML serialization omitting the XML declaration.
serializeVocabularies(RDFSchemaUtils.VocabularyFormat, PrintStream) - Static method in class org.apache.any23.vocab.RDFSchemaUtils
Serializes all the vocabularies to NQuads over the given output stream.
serializeVocabulary(URI, URI[], URI[], Map<URI, String>, RDFWriter) - Static method in class org.apache.any23.vocab.RDFSchemaUtils
Serializes a vocabulary composed of the given namespace, resources and properties.
serializeVocabulary(Vocabulary, RDFWriter) - Static method in class org.apache.any23.vocab.RDFSchemaUtils
Serializes the given vocabulary to triples over the given writer.
serializeVocabulary(Vocabulary, RDFSchemaUtils.VocabularyFormat, boolean, PrintStream) - Static method in class org.apache.any23.vocab.RDFSchemaUtils
Serializes the given vocabulary to NQuads over the given output stream.
serializeVocabulary(Vocabulary, RDFSchemaUtils.VocabularyFormat) - Static method in class org.apache.any23.vocab.RDFSchemaUtils
Serialized the given vocabulary to NQuads and return them as string.
service - Variable in class org.apache.any23.vocab.HLISTING
 
Servlet - Class in org.apache.any23.servlet
A Servlet that fetches a client-specified URI, RDFizes the content, and returns it in a format chosen by the client.
Servlet() - Constructor for class org.apache.any23.servlet.Servlet
 
setAnnotated(boolean) - Method in interface org.apache.any23.writer.FormatWriter
Sets the annotation flag.
setAnnotated(boolean) - Method in class org.apache.any23.writer.JSONWriter
 
setAnnotated(boolean) - Method in class org.apache.any23.writer.RDFWriterTripleHandler
Sets the annotation flag.
setAnnotated(boolean) - Method in class org.apache.any23.writer.URIListWriter
 
setCacheFactory(LocalCopyFactory) - Method in class org.apache.any23.Any23
Allows to set a LocalCopyFactory instance.
setContentLength(long) - Method in class org.apache.any23.filter.ExtractionContextBlocker
 
setContentLength(long) - Method in class org.apache.any23.filter.IgnoreAccidentalRDFa
 
setContentLength(long) - Method in class org.apache.any23.filter.IgnoreTitlesOfEmptyDocuments
 
setContentLength(long) - Method in class org.apache.any23.writer.BenchmarkTripleHandler
 
setContentLength(long) - Method in class org.apache.any23.writer.CompositeTripleHandler
 
setContentLength(long) - Method in class org.apache.any23.writer.CountingTripleHandler
 
setContentLength(long) - Method in class org.apache.any23.writer.JSONWriter
 
setContentLength(long) - Method in class org.apache.any23.writer.LoggingTripleHandler
 
setContentLength(long) - Method in class org.apache.any23.writer.RDFWriterTripleHandler
 
setContentLength(long) - Method in class org.apache.any23.writer.ReportingTripleHandler
 
setContentLength(long) - Method in class org.apache.any23.writer.RepositoryWriter
 
setContentLength(long) - Method in interface org.apache.any23.writer.TripleHandler
Sets the length of the content to be processed.
setContentLength(long) - Method in class org.apache.any23.writer.URIListWriter
 
setCrawlData(String, Pattern, List<CrawlerListener>) - Static method in class org.apache.any23.plugin.crawler.SharedData
Initializes the crawler data.
setDefaultAccept(String) - Method in class org.apache.any23.servlet.conneg.ContentTypeNegotiator
Sets an Accept header to be used as the default if a client does not send an Accept header, or if the Accept header cannot be parsed.
setDefaultLogging() - Static method in class org.apache.any23.util.LogUtils
 
setErrorMode(MicrodataParser.ErrorMode) - Method in class org.apache.any23.extractor.microdata.MicrodataParser
 
setErrorNode(Node) - Method in exception org.apache.any23.extractor.microdata.MicrodataParserException
 
setField(String, HTMLDocument.TextField) - Method in class org.apache.any23.extractor.html.HCardName
 
setFlag(String, boolean) - Method in class org.apache.any23.extractor.ExtractionParameters
Sets the value for an extraction flag.
setFullName(HTMLDocument.TextField) - Method in class org.apache.any23.extractor.html.HCardName
 
setHTTPClient(HTTPClient) - Method in class org.apache.any23.Any23
Allows to set the HTTPClient implementation used to retrieve contents.
setHTTPUserAgent(String) - Method in class org.apache.any23.Any23
Sets the HTTP Header User Agent, see RFC 2616-14.43.
setIssueReport(IssueReport) - Method in class org.apache.any23.rdf.Any23ValueFactoryWrapper
 
setLocalCopyFactory(LocalCopyFactory) - Method in class org.apache.any23.extractor.SingleDocumentExtraction
Sets the internal factory for generating the document local copy, if null the MemCopyFactory will be used.
setMaxDepth(int) - Method in class org.apache.any23.plugin.crawler.SiteCrawler
Sets the maximum depth.
setMaxPages(int) - Method in class org.apache.any23.plugin.crawler.SiteCrawler
Sets the maximum collected pages.
setMIMETypeDetector(MIMETypeDetector) - Method in class org.apache.any23.Any23
Allows to set an instance of MIMETypeDetector.
setMIMETypeDetector(MIMETypeDetector) - Method in class org.apache.any23.extractor.SingleDocumentExtraction
Sets the internal mime type detector, if null mimetype detection will be skipped and all extractors will be activated.
setNumOfCrawlers(int) - Method in class org.apache.any23.plugin.crawler.SiteCrawler
Sets the number of crawler instances.
setOrganization(HTMLDocument.TextField) - Method in class org.apache.any23.extractor.html.HCardName
 
setOrganizationUnit(HTMLDocument.TextField) - Method in class org.apache.any23.extractor.html.HCardName
 
setParserEncoding(String) - Method in class org.apache.any23.extractor.SingleDocumentExtraction
Sets the document parser encoding.
setPolitenessDelay(int) - Method in class org.apache.any23.plugin.crawler.SiteCrawler
Sets the politeness delay.
setProperty(String, String) - Method in class org.apache.any23.configuration.DefaultModifiableConfiguration
 
setProperty(String, String) - Method in interface org.apache.any23.configuration.ModifiableConfiguration
Sets a new value propertyValue for property which name propertyName.
setProperty(String, String) - Method in class org.apache.any23.extractor.ExtractionParameters
Sets the value for an extraction property.
setStopAtFirstError(boolean) - Method in class org.apache.any23.extractor.csv.CSVExtractor
If true, the extractor will stop at first parsing error, iffalse the extractor will attempt to ignore all parsing errors.
setStopAtFirstError(boolean) - Method in interface org.apache.any23.extractor.Extractor.ContentExtractor
If true, the extractor will stop at first parsing error, iffalse the extractor will attempt to ignore all parsing errors.
setStopAtFirstError(boolean) - Method in class org.apache.any23.extractor.rdf.BaseRDFExtractor
 
setStopAtFirstError(boolean) - Method in class org.apache.any23.extractor.rdfa.RDFa11Extractor
 
setStopAtFirstError(boolean) - Method in class org.apache.any23.extractor.rdfa.RDFaExtractor
 
setStopAtFirstError(boolean) - Method in class org.apache.any23.plugin.htmlscraper.HTMLScraperExtractor
 
setStopAtFirstError(boolean) - Method in class org.apache.any23.plugin.officescraper.ExcelExtractor
 
setVerboseLogging() - Static method in class org.apache.any23.util.LogUtils
 
setVerifyDataType(boolean) - Method in class org.apache.any23.extractor.rdf.BaseRDFExtractor
 
setVerifyDataType(boolean) - Method in class org.apache.any23.extractor.rdfa.RDFa11Extractor
 
setVerifyDataType(boolean) - Method in class org.apache.any23.extractor.rdfa.RDFaExtractor
 
setWebCrawler(Class<? extends WebCrawler>) - Method in class org.apache.any23.plugin.crawler.SiteCrawler
Sets the actual crawler class.
sha1 - Variable in class org.apache.any23.vocab.FOAF
 
SharedData - Class in org.apache.any23.plugin.crawler
This class hosts shared data structures accessible to all the DefaultWebCrawler instances run by the SiteCrawler.
SHEET - Static variable in class org.apache.any23.vocab.Excel
 
sheet - Variable in class org.apache.any23.vocab.Excel
This resource identifies a Sheet.
SHEET_NAME - Static variable in class org.apache.any23.vocab.Excel
 
sheetName - Variable in class org.apache.any23.vocab.Excel
This property links the identifier of a Sheet to the name of the sheet.
shouldVisit(WebURL) - Method in class org.apache.any23.plugin.crawler.DefaultWebCrawler
Override this method to specify whether the given URL should be visited or not.
SimpleExtractorFactory<T extends Extractor<?>> - Class in org.apache.any23.extractor
This class is a simple and default-like implementation of ExtractorFactory.
SINDICE - Class in org.apache.any23.vocab
This class models an internal Sindice Vocabulary to describe resource domains and Microformat nesting relationships.
SingleDocumentExtraction - Class in org.apache.any23.extractor
This class acts as facade where all the extractors were called on a single document.
SingleDocumentExtraction(Configuration, DocumentSource, ExtractorGroup, TripleHandler) - Constructor for class org.apache.any23.extractor.SingleDocumentExtraction
Builds an extractor by the specification of document source, list of extractors and output triple handler.
SingleDocumentExtraction(Configuration, DocumentSource, ExtractorFactory<?>, TripleHandler) - Constructor for class org.apache.any23.extractor.SingleDocumentExtraction
Builds an extractor by the specification of document source, extractors factory and output triple handler.
SingleDocumentExtraction(DocumentSource, ExtractorFactory<?>, TripleHandler) - Constructor for class org.apache.any23.extractor.SingleDocumentExtraction
Builds an extractor by the specification of document source, extractors factory and output triple handler, using the DefaultConfiguration.
SingleDocumentExtractionReport - Class in org.apache.any23.extractor
This class provides the report for a SingleDocumentExtraction run.
SingleDocumentExtractionReport(ValidationReport, Map<String, Collection<IssueReport.Issue>>) - Constructor for class org.apache.any23.extractor.SingleDocumentExtractionReport
 
singleton - Static variable in class org.apache.any23.configuration.DefaultConfiguration
 
singleton() - Static method in class org.apache.any23.configuration.DefaultConfiguration
 
singleton() - Static method in class org.apache.any23.http.DefaultHTTPClientConfiguration
 
SITE_NAME - Static variable in class org.apache.any23.vocab.OGP
If your object is part of a larger web site, the name which should be displayed for the overall site.
SiteCrawler - Class in org.apache.any23.plugin.crawler
A basic site crawler to extract semantic content of small/medium size sites.
SiteCrawler(File) - Constructor for class org.apache.any23.plugin.crawler.SiteCrawler
Constructor.
siteName - Variable in class org.apache.any23.vocab.OGP
 
SIZE - Static variable in class org.apache.any23.vocab.SINDICE
 
size - Variable in class org.apache.any23.vocab.SINDICE
Size meta property indicating the number of triples within the returned dataset.
skill - Variable in class org.apache.any23.vocab.DOAC
 
Skill - Variable in class org.apache.any23.vocab.DOAC
 
SocialSkill - Variable in class org.apache.any23.vocab.DOAC
 
sort_string - Variable in class org.apache.any23.vocab.VCARD
A version of a person's name suitable for collation.
sound - Variable in class org.apache.any23.vocab.VCARD
A sound (e.g., a greeting or pronounciation) of a person.
source() - Method in class org.apache.any23.extractor.html.HTMLDocument.TextField
 
source - Variable in class org.apache.any23.vocab.DCTERMS
 
SpanCloserInputStream - Class in org.apache.any23.extractor.html
Extension of InputStream meant to detect and replace any occurrence of inline span:
SpanCloserInputStream(InputStream) - Constructor for class org.apache.any23.extractor.html.SpanCloserInputStream
 
speaks - Variable in class org.apache.any23.vocab.DOAC
 
species - Variable in class org.apache.any23.vocab.WO
Generic class defining a biological species
SpeciesExtractor - Class in org.apache.any23.extractor.html
Extractor able to extract the Species Microformat.
SpeciesExtractor() - Constructor for class org.apache.any23.extractor.html.SpeciesExtractor
 
speciesName - Variable in class org.apache.any23.vocab.WO
specifies the species part of a binomial name, allowing this portion of the name to be explicitly described.
spouse - Variable in class org.apache.any23.vocab.XFN
 
SRC_ATTRIBUTE - Static variable in class org.apache.any23.extractor.rdfa.RDFa11Parser
 
SRC_TAGS - Static variable in class org.apache.any23.extractor.microdata.MicrodataParser
List of tags providing the src property.
standard - Variable in class org.apache.any23.vocab.ICAL
 
start(URL, Pattern, boolean) - Method in class org.apache.any23.plugin.crawler.SiteCrawler
Starts the crawling process.
start(URL, boolean) - Method in class org.apache.any23.plugin.crawler.SiteCrawler
Starts the crawler process with the SiteCrawler.defaultFilters.
start_date - Variable in class org.apache.any23.vocab.DOAC
 
startDocument(URI) - Method in class org.apache.any23.filter.ExtractionContextBlocker
 
startDocument(URI) - Method in class org.apache.any23.filter.IgnoreAccidentalRDFa
 
startDocument(URI) - Method in class org.apache.any23.filter.IgnoreTitlesOfEmptyDocuments
 
startDocument(URI) - Method in class org.apache.any23.writer.BenchmarkTripleHandler
 
startDocument(URI) - Method in class org.apache.any23.writer.CompositeTripleHandler
 
startDocument(URI) - Method in class org.apache.any23.writer.CountingTripleHandler
 
startDocument(URI) - Method in class org.apache.any23.writer.JSONWriter
 
startDocument(URI) - Method in class org.apache.any23.writer.LoggingTripleHandler
 
startDocument(URI) - Method in class org.apache.any23.writer.RDFWriterTripleHandler
 
startDocument(URI) - Method in class org.apache.any23.writer.ReportingTripleHandler
 
startDocument(URI) - Method in class org.apache.any23.writer.RepositoryWriter
 
startDocument(URI) - Method in interface org.apache.any23.writer.TripleHandler
 
startDocument(URI) - Method in class org.apache.any23.writer.URIListWriter
 
startRDF() - Method in class org.apache.any23.extractor.rdf.RDFHandlerAdapter
 
startRDF() - Method in class org.apache.any23.io.nquads.NQuadsWriter
 
status - Variable in class org.apache.any23.vocab.ICAL
value type: TEXTThis URI defines the overall status or confirmation for the calendar component..
stop() - Method in class org.apache.any23.plugin.crawler.SiteCrawler
Interrupts the crawler process if started with wait flag == false.
StreamUtils - Class in org.apache.any23.util
Contains general utility functions for handling streams.
street_address - Variable in class org.apache.any23.vocab.VCARD
The street address of a postal address.
streetAddress - Variable in class org.apache.any23.vocab.HLISTING
 
StringDocumentSource - Class in org.apache.any23.source
String implementation of DocumentSource.
StringDocumentSource(String, String) - Constructor for class org.apache.any23.source.StringDocumentSource
 
StringDocumentSource(String, String, String) - Constructor for class org.apache.any23.source.StringDocumentSource
 
StringDocumentSource(String, String, String, String) - Constructor for class org.apache.any23.source.StringDocumentSource
 
StringUtils - Class in org.apache.any23.util
This class provides a set of string utility methods.
stylesheet - Variable in class org.apache.any23.vocab.XHTML
 
SUBJECT_ATTRIBUTES - Static variable in class org.apache.any23.extractor.rdfa.RDFa11Parser
 
summary - Variable in class org.apache.any23.vocab.DOAC
 
summary - Variable in class org.apache.any23.vocab.HLISTING
 
summary - Variable in class org.apache.any23.vocab.HRECIPE
 
summary - Variable in class org.apache.any23.vocab.ICAL
This URI defines a short summary or subject for the calendar component.value type: TEXT.
surname - Variable in class org.apache.any23.vocab.FOAF
 
sweetheart - Variable in class org.apache.any23.vocab.XFN
 

T

tag - Variable in class org.apache.any23.vocab.HRECIPE
 
TagSoupExtractionResult - Interface in org.apache.any23.extractor
This interface models a specific ExtractionResult able to collect property roots generated by HTML Microformat extractions.
TagSoupExtractionResult.PropertyPath - Class in org.apache.any23.extractor
Defines a property path object.
TagSoupExtractionResult.PropertyPath(String[], Resource, Resource, BNode, Class<? extends MicroformatExtractor>) - Constructor for class org.apache.any23.extractor.TagSoupExtractionResult.PropertyPath
 
TagSoupExtractionResult.ResourceRoot - Class in org.apache.any23.extractor
Defines a property root object.
TagSoupExtractionResult.ResourceRoot(String[], Resource, Class<? extends MicroformatExtractor>) - Constructor for class org.apache.any23.extractor.TagSoupExtractionResult.ResourceRoot
 
TagSoupParser - Class in org.apache.any23.extractor.html
Parses an InputStream into an HTML DOM tree using a TagSoup parser.
TagSoupParser(InputStream, String) - Constructor for class org.apache.any23.extractor.html.TagSoupParser
 
TagSoupParser(InputStream, String, String) - Constructor for class org.apache.any23.extractor.html.TagSoupParser
 
TagSoupParser.ElementLocation - Class in org.apache.any23.extractor.html
Describes a DOM Element location.
tel - Variable in class org.apache.any23.vocab.HLISTING
 
tel - Variable in class org.apache.any23.vocab.VCARD
A telephone number of a person.
Telephone - Variable in class org.apache.any23.vocab.VCARD
Resources that are vCard Telephones.
TemplateGraph - Class in org.apache.any23.extractor.xpath
Represents an Quad graph URI template.
TemplateGraph(String, boolean) - Constructor for class org.apache.any23.extractor.xpath.TemplateGraph
Constructor.
TemplateObject - Class in org.apache.any23.extractor.xpath
Represents a Quad object template.
TemplateObject(TemplateObject.Type, String, boolean) - Constructor for class org.apache.any23.extractor.xpath.TemplateObject
Constructor.
TemplateObject.Type - Enum in org.apache.any23.extractor.xpath
Supported object types.
TemplatePredicate - Class in org.apache.any23.extractor.xpath
Represents a Quad predicate template.
TemplatePredicate(String, boolean) - Constructor for class org.apache.any23.extractor.xpath.TemplatePredicate
Constructor.
TemplateSubject - Class in org.apache.any23.extractor.xpath
Represents a Quad subject template.
TemplateSubject(TemplateSubject.Type, String, boolean) - Constructor for class org.apache.any23.extractor.xpath.TemplateSubject
Constructor.
TemplateSubject.Type - Enum in org.apache.any23.extractor.xpath
Supported subject types.
TemplateXPathExtractionRule - Interface in org.apache.any23.extractor.xpath
Defines an XPathExtractionRule able to expand a QuadTemplate set based on a given template.
TemplateXPathExtractionRuleImpl - Class in org.apache.any23.extractor.xpath
Default implementation of XPathExtractionRule.
TemplateXPathExtractionRuleImpl(String, String) - Constructor for class org.apache.any23.extractor.xpath.TemplateXPathExtractionRuleImpl
 
Term<T extends org.openrdf.model.Value> - Class in org.apache.any23.extractor.xpath
Represents a generic template term.
Term(String, boolean) - Constructor for class org.apache.any23.extractor.xpath.Term
Constructor.
text - Variable in class org.apache.any23.vocab.REVIEW
The text of the review.
theme - Variable in class org.apache.any23.vocab.FOAF
 
thumbnail - Variable in class org.apache.any23.vocab.FOAF
 
TikaEncodingDetector - Class in org.apache.any23.encoding
An implementation of EncodingDetector based on Apache Tika.
TikaEncodingDetector() - Constructor for class org.apache.any23.encoding.TikaEncodingDetector
 
TikaMIMETypeDetector - Class in org.apache.any23.mime
Implementation of MIMETypeDetector based on Apache Tika.
TikaMIMETypeDetector(Purifier) - Constructor for class org.apache.any23.mime.TikaMIMETypeDetector
 
TikaMIMETypeDetector() - Constructor for class org.apache.any23.mime.TikaMIMETypeDetector
 
tipjar - Variable in class org.apache.any23.vocab.FOAF
 
title - Variable in class org.apache.any23.vocab.DCTERMS
 
title - Variable in class org.apache.any23.vocab.DOAC
 
title - Variable in class org.apache.any23.vocab.FOAF
 
TITLE - Static variable in class org.apache.any23.vocab.OGP
The title of your object as it should appear within the graph (Facebook Graph API).
title - Variable in class org.apache.any23.vocab.OGP
 
title - Variable in class org.apache.any23.vocab.REVIEW
The title of the review.
title - Variable in class org.apache.any23.vocab.VCARD
A person's title.
TitleExtractor - Class in org.apache.any23.extractor.html
Extracts the value of the <title> element of an HTML or XHTML page.
TitleExtractor() - Constructor for class org.apache.any23.extractor.html.TitleExtractor
 
toByteArray(InputStream) - Static method in class org.apache.any23.source.MemCopyFactory
 
toJSON() - Method in class org.apache.any23.extractor.microdata.Item
 
toJSON() - Method in class org.apache.any23.extractor.microdata.ItemProp
 
toJSON() - Method in class org.apache.any23.extractor.microdata.ItemPropValue
 
toJSON() - Method in class org.apache.any23.extractor.microdata.ItemScope
 
toJSON() - Method in exception org.apache.any23.extractor.microdata.MicrodataParserException
 
Tool - Interface in org.apache.any23.cli
Defines a runnable CLI tool.
ToolRunner - Class in org.apache.any23.cli
This class is the main class responsible to provide a uniform command-line access points to all the others tools like Rover.
ToolRunner() - Constructor for class org.apache.any23.cli.ToolRunner
 
topic - Variable in class org.apache.any23.vocab.FOAF
 
topic_interest - Variable in class org.apache.any23.vocab.FOAF
 
toString() - Method in class org.apache.any23.extractor.ExtractionContext
 
toString() - Method in class org.apache.any23.extractor.ExtractionResultImpl
 
toString() - Method in class org.apache.any23.extractor.IssueReport.Issue
 
toString() - Method in class org.apache.any23.extractor.microdata.ItemProp
 
toString() - Method in class org.apache.any23.extractor.microdata.ItemPropValue
 
toString() - Method in class org.apache.any23.extractor.microdata.ItemScope
 
toString() - Method in exception org.apache.any23.extractor.microdata.MicrodataParserException
 
toString() - Method in class org.apache.any23.extractor.TagSoupExtractionResult.PropertyPath
 
toString() - Method in class org.apache.any23.extractor.TagSoupExtractionResult.ResourceRoot
 
toString() - Method in class org.apache.any23.extractor.xpath.QuadTemplate
 
toString() - Method in class org.apache.any23.extractor.xpath.TemplateGraph
 
toString() - Method in class org.apache.any23.extractor.xpath.TemplateObject
 
toString() - Method in class org.apache.any23.extractor.xpath.TemplatePredicate
 
toString() - Method in class org.apache.any23.extractor.xpath.TemplateSubject
 
toString() - Method in class org.apache.any23.extractor.xpath.TemplateXPathExtractionRuleImpl
 
toString() - Method in class org.apache.any23.extractor.xpath.Term
 
toString() - Method in class org.apache.any23.mime.MIMEType
 
toString() - Method in class org.apache.any23.servlet.conneg.MediaRangeSpec
 
toString() - Method in class org.apache.any23.validator.DefaultValidationReport
 
toString() - Method in class org.apache.any23.validator.DefaultValidationReportBuilder
 
toString() - Method in class org.apache.any23.validator.EmptyValidationReport
 
toString() - Method in class org.apache.any23.validator.ValidationReport.Error
 
toString() - Method in class org.apache.any23.validator.ValidationReport.FixError
 
toString() - Method in class org.apache.any23.validator.ValidationReport.Issue
 
toString() - Method in class org.apache.any23.validator.ValidationReport.RuleActivation
 
toString() - Method in class org.apache.any23.validator.ValidationReport.RuleError
 
totalVotes - Variable in class org.apache.any23.vocab.REVIEW
Number of usefulness votes (integer).
toValue(String) - Static method in class org.apache.any23.rdf.RDFUtils
Creates a Value.
toXSDDateTime(Date) - Static method in class org.apache.any23.rdf.RDFUtils
Prints a date to the XSD datetime format.
traceRuleActivation(Rule) - Method in class org.apache.any23.validator.DefaultValidationReportBuilder
 
traceRuleActivation(Rule) - Method in interface org.apache.any23.validator.ValidationReportBuilder
Traces that a rule has been applied.
trade - Variable in class org.apache.any23.vocab.HLISTING
 
transp - Variable in class org.apache.any23.vocab.ICAL
This URI defines whether an event is transparent or not to busy time searches.value type: TEXT.
trigger - Variable in class org.apache.any23.vocab.ICAL
This URI specifies when an alarm will trigger.default value type: DURATION.
triple(Resource, URI, Value) - Static method in class org.apache.any23.rdf.RDFUtils
Creates a Statement.
triple(String, String, String) - Static method in class org.apache.any23.rdf.RDFUtils
Creates a statement of type: toValue(s), toValue(p), toValue(o)
TripleHandler - Interface in org.apache.any23.writer
Defines a document based triple handler.
TripleHandlerException - Exception in org.apache.any23.writer
Defines a generic exception raised when accessing to TripleHandler class.
TripleHandlerException(String) - Constructor for exception org.apache.any23.writer.TripleHandlerException
 
TripleHandlerException(String, Exception) - Constructor for exception org.apache.any23.writer.TripleHandlerException
 
TriXExtractor - Class in org.apache.any23.extractor.rdf
Concrete implementation of Extractor.ContentExtractor to perform extraction on TriX documents.
TriXExtractor(boolean, boolean) - Constructor for class org.apache.any23.extractor.rdf.TriXExtractor
Constructor, allows to specify the validation and error handling policies.
TriXExtractor() - Constructor for class org.apache.any23.extractor.rdf.TriXExtractor
Default constructor, with no verification of data types and not stop at first error.
TriXWriter - Class in org.apache.any23.writer
TriX format writer implementation.
TriXWriter(OutputStream) - Constructor for class org.apache.any23.writer.TriXWriter
 
TriXWriterFactory - Class in org.apache.any23.writer
 
TriXWriterFactory() - Constructor for class org.apache.any23.writer.TriXWriterFactory
 
TurtleExtractor - Class in org.apache.any23.extractor.rdf
Concrete implementation of Extractor.ContentExtractor able to perform the extraction on Turtle documents.
TurtleExtractor(boolean, boolean) - Constructor for class org.apache.any23.extractor.rdf.TurtleExtractor
Constructor, allows to specify the validation and error handling policies.
TurtleExtractor() - Constructor for class org.apache.any23.extractor.rdf.TurtleExtractor
Default constructor, with no verification of data types and no stop at first error.
TurtleHTMLExtractor - Class in org.apache.any23.extractor.html
Extractor for Turtle/N3 format embedded within HTML script tags.
TurtleHTMLExtractor() - Constructor for class org.apache.any23.extractor.html.TurtleHTMLExtractor
 
TurtleWriter - Class in org.apache.any23.writer
N3 notation writer.
TurtleWriter(OutputStream) - Constructor for class org.apache.any23.writer.TurtleWriter
Constructor.
TurtleWriterFactory - Class in org.apache.any23.writer
 
TurtleWriterFactory() - Constructor for class org.apache.any23.writer.TurtleWriterFactory
 
TYPE - Static variable in class org.apache.any23.vocab.OGP
The type of your object, e.g., video.movie.
type - Variable in class org.apache.any23.vocab.OGP
 
type - Variable in class org.apache.any23.vocab.REVIEW
The type of media of a work under review.
TYPE_ATTRIBUTE - Static variable in class org.apache.any23.extractor.rdfa.RDFa11Parser
 
TYPEOF_ATTRIBUTE - Static variable in class org.apache.any23.extractor.rdfa.RDFa11Parser
 
tz - Variable in class org.apache.any23.vocab.VCARD
A timezone associated with a person.
tzid - Variable in class org.apache.any23.vocab.ICAL
value type: TEXTTo specify the identifier for the time zone definition for a time component in the URI value.This URI specifies the text value that uniquely identifies the "VTIMEZONE" calendar component..
tzname - Variable in class org.apache.any23.vocab.ICAL
value type: TEXTThis URI specifies the customary designation for a time zone description..
tzoffsetfrom - Variable in class org.apache.any23.vocab.ICAL
value type: UTC-OFFSETThis URI specifies the offset which is in use prior to this time zone observance..
tzoffsetto - Variable in class org.apache.any23.vocab.ICAL
value type: UTC-OFFSETThis URI specifies the offset which is in use in this time zone observance..
tzurl - Variable in class org.apache.any23.vocab.ICAL
The TZURL provides a means for a VTIMEZONE component to point to a network location that can be used to retrieve an up-to- date version of itself.value type: URI.

U

uid - Variable in class org.apache.any23.vocab.ICAL
This URI defines the persistent, globally unique identifier for the calendar component.value type: TEXT.
uid - Variable in class org.apache.any23.vocab.VCARD
A UID of a person's vCard.
unblockContext(ExtractionContext) - Method in class org.apache.any23.filter.ExtractionContextBlocker
 
unblockDocument() - Method in class org.apache.any23.filter.ExtractionContextBlocker
 
unlabeledAdr - Variable in class org.apache.any23.vocab.VCARD
An (explicitly) unlabeled address of a person.
unlabeledEmail - Variable in class org.apache.any23.vocab.VCARD
An (explicitly) unlabeled email address of a person.
unlabeledTel - Variable in class org.apache.any23.vocab.VCARD
An (explicitly) unlabeled phone number of a person.
until - Variable in class org.apache.any23.vocab.ICAL
 
updateURIMapping(Node) - Method in class org.apache.any23.extractor.rdfa.RDFa11Parser
Updates the URI mapping with the XMLNS attributes declared in the current node.
updateVocabulary(Node) - Method in class org.apache.any23.extractor.rdfa.RDFa11Parser
Updates the vocabulary context with possible @vocab declarations.
uri(String) - Static method in class org.apache.any23.rdf.RDFUtils
Creates a URI.
uri(String, String) - Static method in class org.apache.any23.rdf.RDFUtils
Creates a URI.
URI_PATH_SEPARATOR - Static variable in class org.apache.any23.extractor.rdfa.RDFa11Parser
 
URI_PREFIX_SEPARATOR - Static variable in class org.apache.any23.extractor.rdfa.RDFa11Parser
 
URI_SCHEMA_SEPARATOR - Static variable in class org.apache.any23.extractor.rdfa.RDFa11Parser
 
URIListWriter - Class in org.apache.any23.writer
This writer simply produces a list of unique URI present in the subject or in the object of every single extracted RDF Statement.
URIListWriter(OutputStream) - Constructor for class org.apache.any23.writer.URIListWriter
 
URIListWriterFactory - Class in org.apache.any23.writer
 
URIListWriterFactory() - Constructor for class org.apache.any23.writer.URIListWriterFactory
 
url - Variable in class org.apache.any23.vocab.ICAL
This URI defines a Uniform URI Locator (URL) associated with the iCalendar object.value type: URI.
URL - Static variable in class org.apache.any23.vocab.OGP
An image URL which should represent your object within the graph.
url - Variable in class org.apache.any23.vocab.OGP
 
url - Variable in class org.apache.any23.vocab.VCARD
A URL associated with a person.
URL_DOCUMENT_RE - Static variable in class org.apache.any23.cli.MimeDetector
 
URLUtils - Class in org.apache.any23.util
Contains general utility functions for handling URLs.

V

Valarm - Variable in class org.apache.any23.vocab.ICAL
Provide a grouping of component properties that define an alarm..
validate(DOMDocument, boolean) - Method in class org.apache.any23.validator.DefaultValidator
 
validate(URI, Document, boolean) - Method in class org.apache.any23.validator.DefaultValidator
 
validate(DOMDocument, boolean) - Method in interface org.apache.any23.validator.Validator
Performs a validation - fixing of the provided document.
validate(URI, Document, boolean) - Method in interface org.apache.any23.validator.Validator
Performs a validation - fixing of the provided document.
ValidationReport - Interface in org.apache.any23.validator
This class contains the report of a validation performed by the Validator class.
ValidationReport.Error - Class in org.apache.any23.validator
An error occurred while performing the validation process.
ValidationReport.Error(Exception, String) - Constructor for class org.apache.any23.validator.ValidationReport.Error
 
ValidationReport.FixError - Class in org.apache.any23.validator
An error occurred while executing a fix.
ValidationReport.FixError(Fix, Exception, String) - Constructor for class org.apache.any23.validator.ValidationReport.FixError
 
ValidationReport.Issue - Class in org.apache.any23.validator
An issue found during the validation process.
ValidationReport.Issue(ValidationReport.IssueLevel, String, Node) - Constructor for class org.apache.any23.validator.ValidationReport.Issue
 
ValidationReport.IssueLevel - Enum in org.apache.any23.validator
Defines the different issue levels.
ValidationReport.RuleActivation - Class in org.apache.any23.validator
This class describes the activation of a rule.
ValidationReport.RuleActivation(Rule) - Constructor for class org.apache.any23.validator.ValidationReport.RuleActivation
 
ValidationReport.RuleError - Class in org.apache.any23.validator
An error occurred while executing a rule.
ValidationReport.RuleError(Rule, Exception, String) - Constructor for class org.apache.any23.validator.ValidationReport.RuleError
 
ValidationReportBuilder - Interface in org.apache.any23.validator
The report interface is used to generate diagnostics about validation.
ValidationReportSerializer - Interface in org.apache.any23.validator
Defines a serializer for validation reports.
Validator - Interface in org.apache.any23.validator
The validator class allows to perform validation - correction of related to HTML Document instances.
ValidatorException - Exception in org.apache.any23.validator
Models any exception raised by the Validator.
ValidatorException(String) - Constructor for exception org.apache.any23.validator.ValidatorException
 
ValidatorException(String, Throwable) - Constructor for exception org.apache.any23.validator.ValidatorException
 
value() - Method in class org.apache.any23.extractor.html.HTMLDocument.TextField
 
Value_CAL_ADDRESS - Variable in class org.apache.any23.vocab.ICAL
 
Value_DATE - Variable in class org.apache.any23.vocab.ICAL
 
Value_DURATION - Variable in class org.apache.any23.vocab.ICAL
 
Value_PERIOD - Variable in class org.apache.any23.vocab.ICAL
 
Value_RECUR - Variable in class org.apache.any23.vocab.ICAL
 
valueFactory - Variable in class org.apache.any23.extractor.html.MicroformatExtractor
 
valueOf(String) - Static method in enum org.apache.any23.extractor.ExtractionParameters.ValidationMode
Returns the enum constant of this type with the specified name.
valueOf(String) - Static method in enum org.apache.any23.extractor.IssueReport.IssueLevel
Returns the enum constant of this type with the specified name.
valueOf(String) - Static method in enum org.apache.any23.extractor.microdata.ItemPropValue.Type
Returns the enum constant of this type with the specified name.
valueOf(String) - Static method in enum org.apache.any23.extractor.xpath.TemplateObject.Type
Returns the enum constant of this type with the specified name.
valueOf(String) - Static method in enum org.apache.any23.extractor.xpath.TemplateSubject.Type
Returns the enum constant of this type with the specified name.
valueOf(String) - Static method in enum org.apache.any23.validator.ValidationReport.IssueLevel
Returns the enum constant of this type with the specified name.
valueOf(String) - Static method in enum org.apache.any23.vocab.RDFSchemaUtils.VocabularyFormat
Returns the enum constant of this type with the specified name.
values() - Static method in enum org.apache.any23.extractor.ExtractionParameters.ValidationMode
Returns an array containing the constants of this enum type, in the order they are declared.
values() - Static method in enum org.apache.any23.extractor.IssueReport.IssueLevel
Returns an array containing the constants of this enum type, in the order they are declared.
values() - Static method in enum org.apache.any23.extractor.microdata.ItemPropValue.Type
Returns an array containing the constants of this enum type, in the order they are declared.
values() - Static method in enum org.apache.any23.extractor.xpath.TemplateObject.Type
Returns an array containing the constants of this enum type, in the order they are declared.
values() - Static method in enum org.apache.any23.extractor.xpath.TemplateSubject.Type
Returns an array containing the constants of this enum type, in the order they are declared.
values() - Static method in enum org.apache.any23.validator.ValidationReport.IssueLevel
Returns an array containing the constants of this enum type, in the order they are declared.
values() - Static method in enum org.apache.any23.vocab.RDFSchemaUtils.VocabularyFormat
Returns an array containing the constants of this enum type, in the order they are declared.
Variable - Class in org.apache.any23.extractor.xpath
Represents a variable in TemplateXPathExtractionRule.
Variable(String, String) - Constructor for class org.apache.any23.extractor.xpath.Variable
 
Vcalendar - Variable in class org.apache.any23.vocab.ICAL
 
VCARD - Class in org.apache.any23.vocab
Vocabulary definitions from vcard.owl
VCARD() - Constructor for class org.apache.any23.vocab.VCARD
 
VCard - Variable in class org.apache.any23.vocab.VCARD
Resources that are vCards
VERSION - Static variable in class org.apache.any23.Any23
Any23 core library version.
version - Variable in class org.apache.any23.vocab.ICAL
value type: TEXTThis URI specifies the identifier corresponding to the highest version number or the minimum and maximum range of the iCalendar specification that is required in order to interpret the iCalendar object..
Vevent - Variable in class org.apache.any23.vocab.ICAL
Provide a grouping of component properties that describe an event..
Vfreebusy - Variable in class org.apache.any23.vocab.ICAL
Provide a grouping of component properties that describe either a request for free/busy time, describe a response to a request for free/busy time or describe a published set of busy time..
VIDEO - Static variable in class org.apache.any23.vocab.OGP
A URL to a video file that complements this object.
video - Variable in class org.apache.any23.vocab.OGP
 
VIDEO__HEIGHT - Static variable in class org.apache.any23.vocab.OGP
The number of pixels height.
VIDEO__SECURE_URL - Static variable in class org.apache.any23.vocab.OGP
An alternate url to use if the webpage requires HTTPS.
VIDEO__TYPE - Static variable in class org.apache.any23.vocab.OGP
A MIME type for this video.
VIDEO__URL - Static variable in class org.apache.any23.vocab.OGP
Video URL.
VIDEO__WIDTH - Static variable in class org.apache.any23.vocab.OGP
The number of pixels wide.
videoHeight - Variable in class org.apache.any23.vocab.OGP
 
videoSecureURL - Variable in class org.apache.any23.vocab.OGP
 
videoType - Variable in class org.apache.any23.vocab.OGP
 
videoURL - Variable in class org.apache.any23.vocab.OGP
 
videoWidth - Variable in class org.apache.any23.vocab.OGP
 
visit(Page) - Method in class org.apache.any23.plugin.crawler.DefaultWebCrawler
Override this method to implement the single page processing logic.
visitedPage(Page) - Method in interface org.apache.any23.plugin.crawler.CrawlerListener
Notifies to the listener that a page has been discovered.
Vjournal - Variable in class org.apache.any23.vocab.ICAL
Provide a grouping of component properties that describe a journal entry..
VOCAB_ATTRIBUTE - Static variable in class org.apache.any23.extractor.rdfa.RDFa11Parser
 
VocabPrinter - Class in org.apache.any23.cli
Prints out the vocabulary RDFSchema as NQuads.
VocabPrinter() - Constructor for class org.apache.any23.cli.VocabPrinter
 
VocabPrinter.VocabularyFormatConverter - Class in org.apache.any23.cli
 
VocabPrinter.VocabularyFormatConverter() - Constructor for class org.apache.any23.cli.VocabPrinter.VocabularyFormatConverter
 
Vocabulary - Class in org.apache.any23.vocab
Base class for the definition of a vocabulary.
Vocabulary(String) - Constructor for class org.apache.any23.vocab.Vocabulary
Constructor.
Vtimezone - Variable in class org.apache.any23.vocab.ICAL
Provide a grouping of component properties that defines a time zone..
Vtodo - Variable in class org.apache.any23.vocab.ICAL
Provide a grouping of calendar properties that describe a to-do..

W

wanted - Variable in class org.apache.any23.vocab.HLISTING
 
weblog - Variable in class org.apache.any23.vocab.FOAF
 
WhiteSpacesPurifier - Class in org.apache.any23.mime.purifier
Implementation of Purifier that removes all the eventual blank characters at the header of a file that might prevents its MIME Type detection.
WhiteSpacesPurifier() - Constructor for class org.apache.any23.mime.purifier.WhiteSpacesPurifier
 
WO - Class in org.apache.any23.vocab
This class models the BBC Wildlife Ontology.
workAdr - Variable in class org.apache.any23.vocab.VCARD
A work address of a person.
workEmail - Variable in class org.apache.any23.vocab.VCARD
A work email address of a person.
workInfoHomepage - Variable in class org.apache.any23.vocab.FOAF
 
workplaceHomepage - Variable in class org.apache.any23.vocab.FOAF
 
workTel - Variable in class org.apache.any23.vocab.VCARD
A work phone number of a person.
writeNamespace(String, String) - Method in interface org.apache.any23.extractor.ExtractionResult
Write a namespace.
writeNamespace(String, String) - Method in class org.apache.any23.extractor.ExtractionResultImpl
 
WriterFactory - Interface in org.apache.any23.writer
 
WriterFactoryRegistry - Class in org.apache.any23.writer
Registry class for WriterFactorys.
WriterFactoryRegistry() - Constructor for class org.apache.any23.writer.WriterFactoryRegistry
 
writes - Variable in class org.apache.any23.vocab.DOAC
 
writeTriple(Resource, URI, Value, URI) - Method in interface org.apache.any23.extractor.ExtractionResult
Writes a triple.
writeTriple(Resource, URI, Value) - Method in interface org.apache.any23.extractor.ExtractionResult
Write a triple.
writeTriple(Resource, URI, Value, URI) - Method in class org.apache.any23.extractor.ExtractionResultImpl
 
writeTriple(Resource, URI, Value) - Method in class org.apache.any23.extractor.ExtractionResultImpl
 

X

X_ - Variable in class org.apache.any23.vocab.ICAL
value type: TEXTThis class of URI provides a framework for defining non-standard properties..
XFN - Class in org.apache.any23.vocab
Vocabulary class for XFN, as per Expressing XFN in RDF.
XFNExtractor - Class in org.apache.any23.extractor.html
Extractor for the XFN microformat.
XFNExtractor() - Constructor for class org.apache.any23.extractor.html.XFNExtractor
 
XHTML - Class in org.apache.any23.vocab
The XHTML vocabulary.
XML_LANG_ATTRIBUTE - Static variable in class org.apache.any23.extractor.rdfa.RDFa11Parser
 
XML_LITERAL_DATATYPE - Static variable in class org.apache.any23.extractor.rdfa.RDFa11Parser
 
XMLNS_ATTRIBUTE - Static variable in class org.apache.any23.extractor.rdfa.RDFa11Parser
 
XMLNS_DEFAULT - Static variable in class org.apache.any23.extractor.rdfa.RDFa11Parser
 
XMLValidationReportSerializer - Class in org.apache.any23.validator
Default implementation of ValidationReportSerializer for XML.
XMLValidationReportSerializer() - Constructor for class org.apache.any23.validator.XMLValidationReportSerializer
 
XMLValidationReportSerializer.NodeName - Annotation Type in org.apache.any23.validator
Allows to specify a custom node name.
XPathExtractionRule - Interface in org.apache.any23.extractor.xpath
Defines an extraction rule for the XPathExtractor.
XPathExtractor - Class in org.apache.any23.extractor.xpath
Implementation of an Extractor.TagSoupDOMExtractor able to apply XPathExtractionRules and generate quads.
XPathExtractor(List<XPathExtractionRule>) - Constructor for class org.apache.any23.extractor.xpath.XPathExtractor
 
xsltFilename - Static variable in class org.apache.any23.extractor.rdfa.RDFaExtractor
 
XSLTStylesheet - Class in org.apache.any23.extractor.rdfa
An XSLT stylesheet loaded from an InputStream, can be applied to DOM trees and writes the result to a Writer.
XSLTStylesheet(InputStream) - Constructor for class org.apache.any23.extractor.rdfa.XSLTStylesheet
 
XSLTStylesheetException - Exception in org.apache.any23.extractor.rdfa
Defines an exception raised by the XSLTStylesheet class.
XSLTStylesheetException(String) - Constructor for exception org.apache.any23.extractor.rdfa.XSLTStylesheetException
 
XSLTStylesheetException(String, Throwable) - Constructor for exception org.apache.any23.extractor.rdfa.XSLTStylesheetException
 

Y

yahooChatID - Variable in class org.apache.any23.vocab.FOAF
 
yield - Variable in class org.apache.any23.vocab.HRECIPE
 
A B C D E F G H I J K L M N O P Q R S T U V W X Y 

Copyright © 2010-2012 The Apache Software Foundation. All Rights Reserved.