- abbreviate(String) - Method in class org.apache.any23.rdf.Prefixes
-
- ABOUT_ATTRIBUTE - Static variable in class org.apache.any23.extractor.rdfa.RDFa11Parser
-
- AboutNotURIRule - Class in org.apache.any23.validator.rule
-
This rule is able to detect whether an about value is a valid URL
or otherwise is a valid relative URL.
- AboutNotURIRule() - Constructor for class org.apache.any23.validator.rule.AboutNotURIRule
-
- AcceptHeaderBuilder - Class in org.apache.any23.http
-
Concatenates a collection of MIME specs in "type/subtype;q=x.x" notation
into an HTTP Accept header value, and removes duplicates and types
covered by wildcards.
- AcceptHeaderBuilder(Collection<MIMEType>) - Constructor for class org.apache.any23.http.AcceptHeaderBuilder
-
- acceptURI(URI) - Method in class org.apache.any23.extractor.xpath.TemplateXPathExtractionRuleImpl
-
- acceptURI(URI) - Method in interface org.apache.any23.extractor.xpath.XPathExtractionRule
-
Checks if the rule can be applied on the given document URI.
- accountName - Variable in class org.apache.any23.vocab.FOAF
-
- accountServiceHomepage - Variable in class org.apache.any23.vocab.FOAF
-
- acquaintance - Variable in class org.apache.any23.vocab.XFN
-
- acquireProperty(ItemProp) - Method in class org.apache.any23.extractor.microdata.ItemScope
-
- action - Variable in class org.apache.any23.vocab.HLISTING
-
- action - Variable in class org.apache.any23.vocab.ICAL
-
value type: TEXTThis URI defines the action to be invoked when an alarm
is triggered..
- add(Variable) - Method in interface org.apache.any23.extractor.xpath.TemplateXPathExtractionRule
-
Adds a variable to the template.
- add(QuadTemplate) - Method in interface org.apache.any23.extractor.xpath.TemplateXPathExtractionRule
-
- add(Variable) - Method in class org.apache.any23.extractor.xpath.TemplateXPathExtractionRuleImpl
-
- add(QuadTemplate) - Method in class org.apache.any23.extractor.xpath.TemplateXPathExtractionRuleImpl
-
- add(XPathExtractionRule) - Method in class org.apache.any23.extractor.xpath.XPathExtractor
-
- add(String, String) - Method in class org.apache.any23.rdf.Prefixes
-
- add(Prefixes) - Method in class org.apache.any23.rdf.Prefixes
-
- addAliasMediaType(String) - Method in class org.apache.any23.servlet.conneg.ContentTypeNegotiator.VariantSpec
-
- addAttribute(String, String, String) - Method in class org.apache.any23.validator.DefaultDOMDocument
-
- addAttribute(String, String, String) - Method in interface org.apache.any23.validator.DOMDocument
-
Adds an attribute to a node addressed by the given XPath.
- addBNodeProperty(Node, Resource, URI, BNode) - Method in class org.apache.any23.extractor.html.MicroformatExtractor
-
Helper method that adds a BNode property to a node.
- addBNodeProperty(Resource, URI, BNode) - Method in class org.apache.any23.extractor.html.MicroformatExtractor
-
Helper method that adds a BNode property to a node.
- addChild(TripleHandler) - Method in class org.apache.any23.writer.CompositeTripleHandler
-
Adds a decorated handler.
- ADDITIONAL_NAME - Static variable in class org.apache.any23.extractor.html.HCardName
-
- additional_name - Variable in class org.apache.any23.vocab.VCARD
-
An additional part of a person's name.
- addListener(CrawlerListener) - Method in class org.apache.any23.plugin.crawler.SiteCrawler
-
- addPropertyPath(Class<? extends MicroformatExtractor>, Resource, Resource, BNode, String[]) - Method in class org.apache.any23.extractor.ExtractionResultImpl
-
- addPropertyPath(Class<? extends MicroformatExtractor>, Resource, Resource, BNode, String[]) - Method in interface org.apache.any23.extractor.TagSoupExtractionResult
-
Adds a property path to the list of the extracted data.
- addResourceRoot(String[], Resource, Class<? extends MicroformatExtractor>) - Method in class org.apache.any23.extractor.ExtractionResultImpl
-
- addResourceRoot(String[], Resource, Class<? extends MicroformatExtractor>) - Method in interface org.apache.any23.extractor.TagSoupExtractionResult
-
Adds a root property to the extraction result, specifying also
the path corresponding to the root of data which generated the property
and the extractor responsible for such addition.
- Address - Variable in class org.apache.any23.vocab.VCARD
-
Resources that are vCard (postal) addresses.
- addressType - Variable in class org.apache.any23.vocab.VCARD
-
- addRule(Class<? extends Rule>, Class<? extends Fix>) - Method in class org.apache.any23.validator.DefaultValidator
-
- addRule(Class<? extends Rule>) - Method in class org.apache.any23.validator.DefaultValidator
-
- addRule(Class<? extends Rule>) - Method in interface org.apache.any23.validator.Validator
-
Allows to register a new rule to this validator
- addRule(Class<? extends Rule>, Class<? extends Fix>) - Method in interface org.apache.any23.validator.Validator
-
Allows to register a new rule to this validator and associating it to a fix.
- addTextExtractor(String, URI, BoilerpipeExtractor) - Method in class org.apache.any23.plugin.htmlscraper.HTMLScraperExtractor
-
- addURIProperty(Resource, URI, URI) - Method in class org.apache.any23.extractor.html.MicroformatExtractor
-
Helper method that adds a URI property to a node.
- addUserAgentOverride(Pattern, String, String) - Method in class org.apache.any23.servlet.conneg.ContentTypeNegotiator
-
Overrides the Accept header for certain user agents.
- addVariant(String) - Method in class org.apache.any23.servlet.conneg.ContentTypeNegotiator
-
- addVolatile(String, String) - Method in class org.apache.any23.rdf.Prefixes
-
- addVolatile(Prefixes) - Method in class org.apache.any23.rdf.Prefixes
-
- adr - Variable in class org.apache.any23.vocab.VCARD
-
A postal or street address of a person.
- AdrExtractor - Class in org.apache.any23.extractor.html
-
Extractor for the
adr
microformat.
- AdrExtractor() - Constructor for class org.apache.any23.extractor.html.AdrExtractor
-
- affiliation - Variable in class org.apache.any23.vocab.DOAC
-
- Agent - Variable in class org.apache.any23.vocab.FOAF
-
- agent - Variable in class org.apache.any23.vocab.VCARD
-
A person that acts as one's agent.
- aimChatID - Variable in class org.apache.any23.vocab.FOAF
-
- allExtractorsSupportAllContentTypes() - Method in class org.apache.any23.extractor.ExtractorGroup
-
- allPrefixes() - Method in class org.apache.any23.rdf.Prefixes
-
- alternate - Variable in class org.apache.any23.vocab.XHTML
-
- altrep - Variable in class org.apache.any23.vocab.ICAL
-
To specify an alternate text representation for the URI value..
- announce - Variable in class org.apache.any23.vocab.HLISTING
-
- Any23 - Class in org.apache.any23
-
A facade with convenience methods for typical Any23 extraction
operations.
- Any23(Configuration, ExtractorGroup) - Constructor for class org.apache.any23.Any23
-
Constructor that allows the specification of a
custom configuration and of a list of extractors.
- Any23(ExtractorGroup) - Constructor for class org.apache.any23.Any23
-
Constructor that allows the specification of a list of extractors.
- Any23(Configuration, String...) - Constructor for class org.apache.any23.Any23
-
Constructor that allows the specification of a
custom configuration and of list of extractor names.
- Any23(String...) - Constructor for class org.apache.any23.Any23
-
Constructor that allows the specification of a list of extractor names.
- Any23(Configuration) - Constructor for class org.apache.any23.Any23
-
- Any23() - Constructor for class org.apache.any23.Any23
-
Constructor with default configuration.
- Any23Negotiator - Class in org.apache.any23.servlet.conneg
-
- Any23Negotiator() - Constructor for class org.apache.any23.servlet.conneg.Any23Negotiator
-
- Any23PluginManager - Class in org.apache.any23.plugin
-
The Any23PluginManager is responsible for inspecting
dynamically the classpath and retrieving useful classes.
- Any23ValueFactoryWrapper - Class in org.apache.any23.rdf
-
Any23 specialization of the ValueFactory
.
- Any23ValueFactoryWrapper(ValueFactory, IssueReport, String) - Constructor for class org.apache.any23.rdf.Any23ValueFactoryWrapper
-
Constructor with error reporter.
- Any23ValueFactoryWrapper(ValueFactory, IssueReport) - Constructor for class org.apache.any23.rdf.Any23ValueFactoryWrapper
-
- Any23ValueFactoryWrapper(ValueFactory) - Constructor for class org.apache.any23.rdf.Any23ValueFactoryWrapper
-
- applyOn(DOMDocument, RuleContext, ValidationReportBuilder) - Method in class org.apache.any23.validator.rule.AboutNotURIRule
-
- applyOn(DOMDocument, RuleContext, ValidationReportBuilder) - Method in interface org.apache.any23.validator.Rule
-
Applies this rule to the given document.
- applyOn(DOMDocument, RuleContext, ValidationReportBuilder) - Method in class org.apache.any23.validator.rule.MetaNameMisuseRule
-
- applyOn(DOMDocument, RuleContext, ValidationReportBuilder) - Method in class org.apache.any23.validator.rule.MissingOpenGraphNamespaceRule
-
- applyTo(Document, Writer) - Method in class org.apache.any23.extractor.rdfa.XSLTStylesheet
-
Applies the XSLT transformation
- applyTo(Document, Writer, Map<String, String>) - Method in class org.apache.any23.extractor.rdfa.XSLTStylesheet
-
Applies the XSLT transformation
- asLines(InputStream) - Static method in class org.apache.any23.util.StreamUtils
-
Returns all the lines read from an input stream.
- asMap() - Method in class org.apache.any23.rdf.Prefixes
-
- asString(InputStream, boolean) - Static method in class org.apache.any23.util.StreamUtils
-
Returns the string content of a stream.
- asString(InputStream) - Static method in class org.apache.any23.util.StreamUtils
-
Returns the string content of a stream, new line chars will be removed.
- attach - Variable in class org.apache.any23.vocab.ICAL
-
The URI provides the capability to associate a document object with a
calendar component.default value type: URI.
- attendee - Variable in class org.apache.any23.vocab.ICAL
-
The URI defines an "Attendee" within a calendar component.value type:
CAL-ADDRESS.
- ATTRIBUTE_CSS - Static variable in class org.apache.any23.extractor.rdfa.RDFa11Parser
-
- AUDIO - Static variable in class org.apache.any23.vocab.OGP
-
A URL to an audio file to accompany this object.
- audio - Variable in class org.apache.any23.vocab.OGP
-
- AUDIO__SECURE_URL - Static variable in class org.apache.any23.vocab.OGP
-
An alternate url to use if the webpage requires HTTPS.
- AUDIO__TYPE - Static variable in class org.apache.any23.vocab.OGP
-
A MIME type for this audio.
- audioSecureURL - Variable in class org.apache.any23.vocab.OGP
-
- audioType - Variable in class org.apache.any23.vocab.OGP
-
- Author - Annotation Type in org.apache.any23.plugin
-
Describes the author of the plugin.
- author - Variable in class org.apache.any23.vocab.HRECIPE
-
- available() - Method in class org.apache.any23.util.ReaderInputStream
-
- calAddress - Variable in class org.apache.any23.vocab.ICAL
-
- calscale - Variable in class org.apache.any23.vocab.ICAL
-
value type: TEXTThis URI defines the calendar scale used for the calendar
information specified in the iCalendar object..
- canAbbreviate(String) - Method in class org.apache.any23.rdf.Prefixes
-
- canExpand(String) - Method in class org.apache.any23.rdf.Prefixes
-
- categories - Variable in class org.apache.any23.vocab.ICAL
-
value type: TEXTThis URI defines the categories for a calendar component..
- category - Variable in class org.apache.any23.vocab.VCARD
-
A category of a vCard.
- CELL - Static variable in class org.apache.any23.vocab.Excel
-
- cell - Variable in class org.apache.any23.vocab.Excel
-
This resource identifies a cell.
- CELL_VALUE - Static variable in class org.apache.any23.vocab.Excel
-
- cellValue - Variable in class org.apache.any23.vocab.Excel
-
This property links the identifier of a cell to the content of the cell.
- checkCSVFormat(InputStream) - Static method in class org.apache.any23.mime.TikaMIMETypeDetector
-
Checks if the stream contains a valid CSV pattern.
- checkN3Format(InputStream) - Static method in class org.apache.any23.mime.TikaMIMETypeDetector
-
Checks if the stream contains the N3 triple patterns.
- checkNQuadsFormat(InputStream) - Static method in class org.apache.any23.mime.TikaMIMETypeDetector
-
Checks if the stream contains the NQuads patterns.
- checkTurtleFormat(InputStream) - Static method in class org.apache.any23.mime.TikaMIMETypeDetector
-
Checks if the stream contains Turtle triple patterns.
- child - Variable in class org.apache.any23.vocab.XFN
-
- class_ - Variable in class org.apache.any23.vocab.ICAL
-
value type: TEXTThis URI defines the access classification for a calendar
component..
- class_ - Variable in class org.apache.any23.vocab.VCARD
-
A class (e.g., public, private, etc.) of a vCard.
- classClass - Variable in class org.apache.any23.vocab.WO
-
- clazz - Variable in class org.apache.any23.vocab.WO
-
- clazzName - Variable in class org.apache.any23.vocab.WO
-
- CLI_PACKAGE - Static variable in class org.apache.any23.plugin.Any23PluginManager
-
Any23 Command Line Interface package.
- close() - Method in class org.apache.any23.cli.Rover
-
- close() - Method in interface org.apache.any23.extractor.ExtractionResult
-
Close the result.
- close() - Method in class org.apache.any23.extractor.ExtractionResultImpl
-
- close() - Method in class org.apache.any23.filter.ExtractionContextBlocker
-
- close() - Method in class org.apache.any23.filter.IgnoreAccidentalRDFa
-
- close() - Method in class org.apache.any23.filter.IgnoreTitlesOfEmptyDocuments
-
- close() - Method in class org.apache.any23.http.DefaultHTTPClient
-
Shuts down the connection manager.
- close() - Method in interface org.apache.any23.http.HTTPClient
-
Release all static resources help by the instance.
- close() - Method in class org.apache.any23.util.ReaderInputStream
-
Closes the Stringreader.
- close() - Method in class org.apache.any23.writer.BenchmarkTripleHandler
-
- close() - Method in class org.apache.any23.writer.CompositeTripleHandler
-
- close() - Method in class org.apache.any23.writer.CountingTripleHandler
-
- close() - Method in class org.apache.any23.writer.JSONWriter
-
- close() - Method in class org.apache.any23.writer.LoggingTripleHandler
-
- close() - Method in class org.apache.any23.writer.RDFWriterTripleHandler
-
- close() - Method in class org.apache.any23.writer.ReportingTripleHandler
-
- close() - Method in class org.apache.any23.writer.RepositoryWriter
-
- close() - Method in interface org.apache.any23.writer.TripleHandler
-
Will be called last and exactly once.
- close() - Method in class org.apache.any23.writer.URIListWriter
-
- closeContext(ExtractionContext) - Method in class org.apache.any23.filter.ExtractionContextBlocker
-
- closeContext(ExtractionContext) - Method in class org.apache.any23.filter.IgnoreAccidentalRDFa
-
- closeContext(ExtractionContext) - Method in class org.apache.any23.filter.IgnoreTitlesOfEmptyDocuments
-
- closeContext(ExtractionContext) - Method in class org.apache.any23.writer.BenchmarkTripleHandler
-
- closeContext(ExtractionContext) - Method in class org.apache.any23.writer.CompositeTripleHandler
-
- closeContext(ExtractionContext) - Method in class org.apache.any23.writer.CountingTripleHandler
-
- closeContext(ExtractionContext) - Method in class org.apache.any23.writer.JSONWriter
-
- closeContext(ExtractionContext) - Method in class org.apache.any23.writer.LoggingTripleHandler
-
- closeContext(ExtractionContext) - Method in class org.apache.any23.writer.RDFWriterTripleHandler
-
- closeContext(ExtractionContext) - Method in class org.apache.any23.writer.ReportingTripleHandler
-
- closeContext(ExtractionContext) - Method in class org.apache.any23.writer.RepositoryWriter
-
- closeContext(ExtractionContext) - Method in interface org.apache.any23.writer.TripleHandler
-
Informs the handler that no more triples will come from a
previously opened context.
- closeContext(ExtractionContext) - Method in class org.apache.any23.writer.URIListWriter
-
- closeGracefully(Closeable) - Static method in class org.apache.any23.util.StreamUtils
-
Closes the closable interface and reports error if any.
- cn - Variable in class org.apache.any23.vocab.ICAL
-
To specify the common name to be associated with the calendar user specified
by the URI..
- colleague - Variable in class org.apache.any23.vocab.XFN
-
- COLUMN_POSITION - Static variable in class org.apache.any23.vocab.CSV
-
- columnPosition - Variable in class org.apache.any23.vocab.CSV
-
This property expresses the index of a column in a CSV file.
- comment - Variable in class org.apache.any23.vocab.ICAL
-
value type: TEXTThis URI specifies non-processing information intended
to provide a comment to the calendar user..
- Comment - Variable in class org.apache.any23.vocab.REVIEW
-
A comment on a review.
- commenter - Variable in class org.apache.any23.vocab.REVIEW
-
The commenter on the review.
- compareTo(MIMEType) - Method in class org.apache.any23.mime.MIMEType
-
- completed - Variable in class org.apache.any23.vocab.ICAL
-
value type: DATE-TIMEThis URI defines the date and time that a to-do
was actually completed..
- component - Variable in class org.apache.any23.vocab.ICAL
-
- CompositeTripleHandler - Class in org.apache.any23.writer
-
A
TripleHandler
multi decorator, that wraps zero or more
other triple handlers and dispatches all events to each of them.
- CompositeTripleHandler() - Constructor for class org.apache.any23.writer.CompositeTripleHandler
-
Constructor with empty decorated list.
- CompositeTripleHandler(Collection<TripleHandler>) - Constructor for class org.apache.any23.writer.CompositeTripleHandler
-
Constructor with initial list of decorated handlers.
- ComputerSkill - Variable in class org.apache.any23.vocab.DOAC
-
- conditionallyAddLiteralProperty(Node, Resource, URI, Literal) - Method in class org.apache.any23.extractor.html.MicroformatExtractor
-
Helper method that adds a literal property to a node.
- conditionallyAddResourceProperty(Resource, URI, URI) - Method in class org.apache.any23.extractor.html.MicroformatExtractor
-
Helper method that adds a URI property to a node.
- conditionallyAddStringProperty(Node, Resource, URI, String) - Method in class org.apache.any23.extractor.html.MicroformatExtractor
-
Helper method that adds a literal property to a subject only if the value of the property
is a valid string.
- Configuration - Interface in org.apache.any23.configuration
-
Defines the main Any23 configuration.
- configure() - Method in class org.apache.any23.cli.Rover
-
- configureExtractors(ExtractorGroup, File...) - Method in class org.apache.any23.plugin.Any23PluginManager
-
Configures a new list of extractors containing the extractors declared in initialExtractorGroup
and also the extractors detected in classpath specified by pluginLocations
.
- configureExtractors(ExtractorGroup) - Method in class org.apache.any23.plugin.Any23PluginManager
-
Configures a new list of extractors containing the extractors declared in initialExtractorGroup
and also the extractors detected in classpath specified by the default configuration.
- contact - Variable in class org.apache.any23.vocab.ICAL
-
value type: TEXTThe URI is used to represent contact information or alternately
a reference to contact information associated with the calendar component..
- contact - Variable in class org.apache.any23.vocab.XFN
-
- contains(XPathExtractionRule) - Method in class org.apache.any23.extractor.xpath.XPathExtractor
-
- CONTAINS_CELL - Static variable in class org.apache.any23.vocab.Excel
-
- CONTAINS_ROW - Static variable in class org.apache.any23.vocab.Excel
-
- CONTAINS_SHEET - Static variable in class org.apache.any23.vocab.Excel
-
- containsCell - Variable in class org.apache.any23.vocab.Excel
-
This property links the identifier of a row to the identifier of a cell.
- containsField(String) - Method in class org.apache.any23.extractor.html.HCardName
-
- containsRow - Variable in class org.apache.any23.vocab.Excel
-
This property links the identifier of a sheet to the identifier of a row.
- containsSheet - Variable in class org.apache.any23.vocab.Excel
-
This property links the identifier of a document to the identifier of a sheet.
- CONTENT_ATTRIBUTE - Static variable in class org.apache.any23.extractor.rdfa.RDFa11Parser
-
- ContentTypeNegotiator - Class in org.apache.any23.servlet.conneg
-
This class defines a negotiator for content types based on scoring.
- ContentTypeNegotiator() - Constructor for class org.apache.any23.servlet.conneg.ContentTypeNegotiator
-
- ContentTypeNegotiator.VariantSpec - Class in org.apache.any23.servlet.conneg
-
- ContentTypeNegotiator.VariantSpec(String) - Constructor for class org.apache.any23.servlet.conneg.ContentTypeNegotiator.VariantSpec
-
- convert(String) - Method in class org.apache.any23.cli.Crawler.PatterConverter
-
- convert(String) - Method in class org.apache.any23.cli.MicrodataParser.MicrodataParserDocumentSourceConverter
-
- convert(String) - Method in class org.apache.any23.cli.MimeDetector.MimeDetectorDocumentSourceConverter
-
- convert(String) - Method in class org.apache.any23.cli.Rover.ArgumentToURIConverter
-
- convert(String) - Method in class org.apache.any23.cli.Rover.PrintStreamConverter
-
- convert(String) - Method in class org.apache.any23.cli.VocabPrinter.VocabularyFormatConverter
-
- copy() - Static method in class org.apache.any23.configuration.DefaultConfiguration
-
- copy(String) - Method in class org.apache.any23.extractor.ExtractionContext
-
- coResident - Variable in class org.apache.any23.vocab.XFN
-
- count - Variable in class org.apache.any23.vocab.ICAL
-
- CountingTripleHandler - Class in org.apache.any23.writer
-
A simple
TripleHandler
that merely counts the number
of triples it has received.
- CountingTripleHandler(boolean) - Constructor for class org.apache.any23.writer.CountingTripleHandler
-
- CountingTripleHandler() - Constructor for class org.apache.any23.writer.CountingTripleHandler
-
- countNL(String) - Static method in class org.apache.any23.util.StringUtils
-
Counts the number of NL
in the given in string.
- countOccurrences(String, String) - Static method in class org.apache.any23.util.StringUtils
-
Counts how many times content
appears within container
without string overlapping.
- country_name - Variable in class org.apache.any23.vocab.VCARD
-
The country of a postal address.
- countryName - Variable in class org.apache.any23.vocab.HLISTING
-
- Course - Variable in class org.apache.any23.vocab.DOAC
-
- coWorker - Variable in class org.apache.any23.vocab.XFN
-
- cp(InputStream, File) - Static method in class org.apache.any23.util.FileUtils
-
Copies the content of the input stream within the given dest file.
- cp(File, File) - Static method in class org.apache.any23.util.FileUtils
-
Copies a file src
to the dest
.
- Crawler - Class in org.apache.any23.cli
-
Implementation of a
CLI crawler based on
Rover
.
- Crawler() - Constructor for class org.apache.any23.cli.Crawler
-
- Crawler.PatterConverter - Class in org.apache.any23.cli
-
- Crawler.PatterConverter() - Constructor for class org.apache.any23.cli.Crawler.PatterConverter
-
- CrawlerListener - Interface in org.apache.any23.plugin.crawler
-
- create(String, Prefixes, Collection<String>, String, Class<S>) - Static method in class org.apache.any23.extractor.SimpleExtractorFactory
-
- create1(String, String) - Static method in class org.apache.any23.rdf.Prefixes
-
- createBNode() - Method in class org.apache.any23.rdf.Any23ValueFactoryWrapper
-
- createBNode(String) - Method in class org.apache.any23.rdf.Any23ValueFactoryWrapper
-
- createClass(String, String) - Method in class org.apache.any23.vocab.Vocabulary
-
- created - Variable in class org.apache.any23.vocab.ICAL
-
This URI specifies the date and time that the calendar information was
created by the calendar user agent in the calendar store.
- createDocumentSource(String) - Method in class org.apache.any23.Any23
-
- createExtractor() - Method in interface org.apache.any23.extractor.ExtractorFactory
-
Creates an extractor instance.
- createExtractor() - Method in class org.apache.any23.extractor.SimpleExtractorFactory
-
- createFromMap(Map<String, String>, boolean) - Static method in class org.apache.any23.rdf.Prefixes
-
- createHTTPDocumentSource(HTTPClient, String) - Method in class org.apache.any23.servlet.Servlet
-
- createInitializedHTTPClient() - Static method in class org.apache.any23.http.DefaultHTTPClient
-
- createLiteral(String) - Method in class org.apache.any23.rdf.Any23ValueFactoryWrapper
-
- createLiteral(boolean) - Method in class org.apache.any23.rdf.Any23ValueFactoryWrapper
-
- createLiteral(byte) - Method in class org.apache.any23.rdf.Any23ValueFactoryWrapper
-
- createLiteral(short) - Method in class org.apache.any23.rdf.Any23ValueFactoryWrapper
-
- createLiteral(int) - Method in class org.apache.any23.rdf.Any23ValueFactoryWrapper
-
- createLiteral(long) - Method in class org.apache.any23.rdf.Any23ValueFactoryWrapper
-
- createLiteral(float) - Method in class org.apache.any23.rdf.Any23ValueFactoryWrapper
-
- createLiteral(double) - Method in class org.apache.any23.rdf.Any23ValueFactoryWrapper
-
- createLiteral(XMLGregorianCalendar) - Method in class org.apache.any23.rdf.Any23ValueFactoryWrapper
-
- createLiteral(String, String) - Method in class org.apache.any23.rdf.Any23ValueFactoryWrapper
-
- createLiteral(String, URI) - Method in class org.apache.any23.rdf.Any23ValueFactoryWrapper
-
- createLocalCopy(DocumentSource) - Method in interface org.apache.any23.source.LocalCopyFactory
-
- createLocalCopy(DocumentSource) - Method in class org.apache.any23.source.MemCopyFactory
-
- createProperty(String) - Method in class org.apache.any23.vocab.CSV
-
- createProperty(String) - Method in class org.apache.any23.vocab.Excel
-
- createProperty(String, String) - Method in class org.apache.any23.vocab.Vocabulary
-
- createResource(String) - Method in class org.apache.any23.vocab.CSV
-
- createResource(String) - Method in class org.apache.any23.vocab.Excel
-
- createStatement(Resource, URI, Value) - Method in class org.apache.any23.rdf.Any23ValueFactoryWrapper
-
- createStatement(Resource, URI, Value, Resource) - Method in class org.apache.any23.rdf.Any23ValueFactoryWrapper
-
- createSubset(String...) - Static method in class org.apache.any23.rdf.PopularPrefixes
-
This method perform a prefix lookup.
- createSubset(String...) - Method in class org.apache.any23.rdf.Prefixes
-
- createURI(String) - Method in class org.apache.any23.rdf.Any23ValueFactoryWrapper
-
- createURI(String, String) - Method in class org.apache.any23.rdf.Any23ValueFactoryWrapper
-
- createURI(String) - Method in class org.apache.any23.vocab.Vocabulary
-
Creates a URI.
- creator - Variable in class org.apache.any23.vocab.DCTERMS
-
- crush - Variable in class org.apache.any23.vocab.XFN
-
- CSV - Class in org.apache.any23.vocab
-
This vocabulary models the structure of a
CSV file
according the
RFC 4180.
- CSV_MIMETYPE - Static variable in class org.apache.any23.mime.TikaMIMETypeDetector
-
- CSVExtractor - Class in org.apache.any23.extractor.csv
-
This extractor produces RDF from a CSV file .
- CSVExtractor() - Constructor for class org.apache.any23.extractor.csv.CSVExtractor
-
- CSVReaderBuilder - Class in org.apache.any23.extractor.csv
-
This class is responsible to build a reader first guessing the configuration
from the file it self and then, if not successful, from the
DefaultConfiguration
.
- CSVReaderBuilder() - Constructor for class org.apache.any23.extractor.csv.CSVReaderBuilder
-
- CURIE_SEPARATOR - Static variable in class org.apache.any23.extractor.rdfa.RDFa11Parser
-
- currentProject - Variable in class org.apache.any23.vocab.FOAF
-
- cutype - Variable in class org.apache.any23.vocab.ICAL
-
To specify the type of calendar user specified by the URI..
- DATATYPE_ATTRIBUTE - Static variable in class org.apache.any23.extractor.rdfa.RDFa11Parser
-
- date - Variable in class org.apache.any23.vocab.DCTERMS
-
- DATE - Static variable in class org.apache.any23.vocab.SINDICE
-
- date - Variable in class org.apache.any23.vocab.SINDICE
-
Date meta property indicating the data generation time.
- date - Variable in class org.apache.any23.vocab.XFN
-
- daylight - Variable in class org.apache.any23.vocab.ICAL
-
- DCTERMS - Class in org.apache.any23.vocab
-
The DCTERMS vocabulary.
- DEFAULT_BASE_URI - Static variable in class org.apache.any23.servlet.Servlet
-
- DEFAULT_CONFIG_FILE - Static variable in class org.apache.any23.configuration.DefaultConfiguration
-
Default configuration file.
- DEFAULT_HTTP_CLIENT_USER_AGENT - Static variable in class org.apache.any23.Any23
-
Default HTTP User Agent defined in default configuration.
- DEFAULT_NUM_OF_CRAWLERS - Static variable in class org.apache.any23.plugin.crawler.SiteCrawler
-
Default number of crawler instances.
- DEFAULT_PAGE_FILTER_RE - Static variable in class org.apache.any23.plugin.crawler.SiteCrawler
-
- DEFAULT_PLUGIN_DIR - Static variable in class org.apache.any23.cli.ToolRunner
-
- DEFAULT_WEB_CRAWLER - Static variable in class org.apache.any23.plugin.crawler.SiteCrawler
-
Default crawler implementation.
- DefaultConfiguration - Class in org.apache.any23.configuration
-
- DefaultConfiguration(Properties) - Constructor for class org.apache.any23.configuration.DefaultConfiguration
-
- DefaultDOMDocument - Class in org.apache.any23.validator
-
This class wraps the DOM document.
- DefaultDOMDocument(URI, Document) - Constructor for class org.apache.any23.validator.DefaultDOMDocument
-
- defaultFilters - Variable in class org.apache.any23.plugin.crawler.SiteCrawler
-
Default filter applied to skip contents.
- DefaultHTTPClient - Class in org.apache.any23.http
-
- DefaultHTTPClient() - Constructor for class org.apache.any23.http.DefaultHTTPClient
-
- DefaultHTTPClientConfiguration - Class in org.apache.any23.http
-
- DefaultHTTPClientConfiguration(String, int, int, String) - Constructor for class org.apache.any23.http.DefaultHTTPClientConfiguration
-
Constructor.
- DefaultHTTPClientConfiguration(String) - Constructor for class org.apache.any23.http.DefaultHTTPClientConfiguration
-
Constructor.
- DefaultHTTPClientConfiguration() - Constructor for class org.apache.any23.http.DefaultHTTPClientConfiguration
-
Constructor.
- DefaultModifiableConfiguration - Class in org.apache.any23.configuration
-
- DefaultModifiableConfiguration(Properties) - Constructor for class org.apache.any23.configuration.DefaultModifiableConfiguration
-
- DefaultRuleContext - Class in org.apache.any23.validator
-
- DefaultRuleContext() - Constructor for class org.apache.any23.validator.DefaultRuleContext
-
- DefaultValidationReport - Class in org.apache.any23.validator
-
- DefaultValidationReport(List<ValidationReport.Issue>, List<ValidationReport.RuleActivation>, List<ValidationReport.Error>) - Constructor for class org.apache.any23.validator.DefaultValidationReport
-
- DefaultValidationReportBuilder - Class in org.apache.any23.validator
-
- DefaultValidationReportBuilder() - Constructor for class org.apache.any23.validator.DefaultValidationReportBuilder
-
- DefaultValidator - Class in org.apache.any23.validator
-
- DefaultValidator() - Constructor for class org.apache.any23.validator.DefaultValidator
-
- DefaultWebCrawler - Class in org.apache.any23.plugin.crawler
-
Default WebCrawler
implementation.
- DefaultWebCrawler() - Constructor for class org.apache.any23.plugin.crawler.DefaultWebCrawler
-
- deferProperties(String...) - Method in class org.apache.any23.extractor.microdata.MicrodataParser
-
Given a document and a list of itemprop names this method will return
such itemprops.
- defineProperty(String) - Method in interface org.apache.any23.configuration.Configuration
-
Checks whether a property is defined or not in configuration.
- defineProperty(String) - Method in class org.apache.any23.configuration.DefaultConfiguration
-
- Degree - Variable in class org.apache.any23.vocab.DOAC
-
- delegatedFrom - Variable in class org.apache.any23.vocab.ICAL
-
To specify the calendar users that have delegated their participation to the
calendar user specified by the URI..
- delegatedTo - Variable in class org.apache.any23.vocab.ICAL
-
To specify the calendar users to whom the calendar user specified by the URI
has delegated participation..
- depiction - Variable in class org.apache.any23.vocab.FOAF
-
- depicts - Variable in class org.apache.any23.vocab.FOAF
-
- description - Variable in class org.apache.any23.vocab.HLISTING
-
- description - Variable in class org.apache.any23.vocab.ICAL
-
value type: TEXTThis URI provides a more complete description of the
calendar component, than that provided by the "SUMMARY" URI..
- DESCRIPTION - Static variable in class org.apache.any23.vocab.OGP
-
A one to two sentence description of your object.
- description - Variable in class org.apache.any23.vocab.OGP
-
- DETERMINER - Static variable in class org.apache.any23.vocab.OGP
-
The word that appears before this object's title in a sentence.
- determiner - Variable in class org.apache.any23.vocab.OGP
-
- dir - Variable in class org.apache.any23.vocab.ICAL
-
To specify reference to a directory entry associated with the calendar user
specified by the URI..
- DiscoveryUtils - Class in org.apache.any23.util
-
This class provides utility methods
for discovering classes in packages.
- disownProperty(ItemProp) - Method in class org.apache.any23.extractor.microdata.ItemScope
-
- division - Variable in class org.apache.any23.vocab.WO
-
- divisionClass - Variable in class org.apache.any23.vocab.WO
-
- divisionName - Variable in class org.apache.any23.vocab.WO
-
- dnaChecksum - Variable in class org.apache.any23.vocab.FOAF
-
- DOAC - Class in org.apache.any23.vocab
-
- Document - Variable in class org.apache.any23.vocab.FOAF
-
- DocumentReport - Class in org.apache.any23.extractor.html
-
Represents the validationReportBuilder generated by a
the
TagSoupParser
when a document
is retrieved and validated.
- DocumentReport(ValidationReport, Document) - Constructor for class org.apache.any23.extractor.html.DocumentReport
-
- DocumentSource - Interface in org.apache.any23.source
-
A source of input streams.
- doGet(HttpServletRequest, HttpServletResponse) - Method in class org.apache.any23.servlet.RedirectServlet
-
- doGet(HttpServletRequest, HttpServletResponse) - Method in class org.apache.any23.servlet.Servlet
-
- DOMAIN - Static variable in class org.apache.any23.vocab.SINDICE
-
- domain - Variable in class org.apache.any23.vocab.SINDICE
-
This property expresses the DNS domain of the resource on which
it is applied.
- DomainOf_rrule - Variable in class org.apache.any23.vocab.ICAL
-
- DOMDocument - Interface in org.apache.any23.validator
-
This interface models a document to be processed
by the
Validator
.
- DomUtils - Class in org.apache.any23.extractor.html
-
This class provides utility methods for DOM manipulation.
- doPost(HttpServletRequest, HttpServletResponse) - Method in class org.apache.any23.servlet.RedirectServlet
-
- doPost(HttpServletRequest, HttpServletResponse) - Method in class org.apache.any23.servlet.Servlet
-
- DrivingSkill - Variable in class org.apache.any23.vocab.DOAC
-
- dtend - Variable in class org.apache.any23.vocab.ICAL
-
This URI specifies the date and time that a calendar component ends.default
value type: DATE-TIME.
- dtexpired - Variable in class org.apache.any23.vocab.HLISTING
-
- dtlisted - Variable in class org.apache.any23.vocab.HLISTING
-
- dtstamp - Variable in class org.apache.any23.vocab.ICAL
-
value type: DATE-TIMEThe URI indicates the date/time that the instance
of the iCalendar object was created..
- dtstart - Variable in class org.apache.any23.vocab.ICAL
-
default value type: DATE-TIMEThis URI specifies when the calendar component
begins..
- due - Variable in class org.apache.any23.vocab.ICAL
-
default value type: DATE-TIMEThis URI defines the date and time that
a to-do is expected to be completed..
- dumpContent(File, String) - Static method in class org.apache.any23.util.FileUtils
-
Dumps the given string within a file.
- dumpContent(File, Throwable) - Static method in class org.apache.any23.util.FileUtils
-
Dumps the stack trace of the given exception into the specified file.
- Duration - Variable in class org.apache.any23.vocab.HRECIPE
-
- duration - Variable in class org.apache.any23.vocab.HRECIPE
-
- duration - Variable in class org.apache.any23.vocab.ICAL
-
The URI specifies a positive duration of time.value type: DURATION.
- durationTime - Variable in class org.apache.any23.vocab.HRECIPE
-
- durationTitle - Variable in class org.apache.any23.vocab.HRECIPE
-
- education - Variable in class org.apache.any23.vocab.DOAC
-
- Education - Variable in class org.apache.any23.vocab.DOAC
-
- ELEMENT_LOCATION - Static variable in class org.apache.any23.extractor.html.TagSoupParser
-
- email - Variable in class org.apache.any23.vocab.VCARD
-
An email address.
- EMPTY - Static variable in class org.apache.any23.rdf.Prefixes
-
- EmptyValidationReport - Class in org.apache.any23.validator
-
- encoding - Variable in class org.apache.any23.vocab.ICAL
-
To specify an alternate inline encoding for the URI value..
- EncodingDetector - Interface in org.apache.any23.encoding
-
Defines a detector for charset encoding.
- end_date - Variable in class org.apache.any23.vocab.DOAC
-
- END_SCRIPT - Static variable in class org.apache.any23.extractor.html.MicroformatExtractor
-
- endDocument(URI) - Method in class org.apache.any23.filter.ExtractionContextBlocker
-
- endDocument(URI) - Method in class org.apache.any23.filter.IgnoreAccidentalRDFa
-
- endDocument(URI) - Method in class org.apache.any23.filter.IgnoreTitlesOfEmptyDocuments
-
- endDocument(URI) - Method in class org.apache.any23.writer.BenchmarkTripleHandler
-
- endDocument(URI) - Method in class org.apache.any23.writer.CompositeTripleHandler
-
- endDocument(URI) - Method in class org.apache.any23.writer.CountingTripleHandler
-
- endDocument(URI) - Method in class org.apache.any23.writer.JSONWriter
-
- endDocument(URI) - Method in class org.apache.any23.writer.LoggingTripleHandler
-
- endDocument(URI) - Method in class org.apache.any23.writer.RDFWriterTripleHandler
-
- endDocument(URI) - Method in class org.apache.any23.writer.ReportingTripleHandler
-
- endDocument(URI) - Method in class org.apache.any23.writer.RepositoryWriter
-
- endDocument(URI) - Method in interface org.apache.any23.writer.TripleHandler
-
Informs the handler that the end of the document
has been reached.
- endDocument(URI) - Method in class org.apache.any23.writer.URIListWriter
-
- endRDF() - Method in class org.apache.any23.extractor.rdf.RDFHandlerAdapter
-
- endRDF() - Method in class org.apache.any23.io.nquads.NQuadsWriter
-
- EntityBasedMicroformatExtractor - Class in org.apache.any23.extractor.html
-
Base class for microformat extractors based on entities.
- EntityBasedMicroformatExtractor() - Constructor for class org.apache.any23.extractor.html.EntityBasedMicroformatExtractor
-
- equals(Object) - Method in class org.apache.any23.extractor.ExtractionContext
-
- equals(Object) - Method in class org.apache.any23.extractor.ExtractionParameters
-
- equals(Object) - Method in class org.apache.any23.extractor.microdata.Item
-
- equals(Object) - Method in class org.apache.any23.extractor.microdata.ItemProp
-
- equals(Object) - Method in class org.apache.any23.extractor.microdata.ItemPropValue
-
- equals(Object) - Method in class org.apache.any23.extractor.microdata.ItemScope
-
- ERRORED_META_NODES - Static variable in class org.apache.any23.validator.rule.MetaNameMisuseRule
-
- escapeAsJSONString(String) - Static method in class org.apache.any23.util.StringUtils
-
Escapes the in
string as JSON string
to let it being embeddable within a string field.
- escapeDoubleQuotes(String) - Static method in class org.apache.any23.util.StringUtils
-
Escapes all the unescaped double quotes when needed.
- event - Variable in class org.apache.any23.vocab.HLISTING
-
- ExampleInputOutput - Class in org.apache.any23.extractor
-
A reporter for example input and output of an extractor.
- ExampleInputOutput(String) - Constructor for class org.apache.any23.extractor.ExampleInputOutput
-
- ExampleInputOutput(ExtractorFactory<?>) - Constructor for class org.apache.any23.extractor.ExampleInputOutput
-
- Excel - Class in org.apache.any23.vocab
-
The MS Excel extractor vocabulary.
- ExcelExtractor - Class in org.apache.any23.plugin.officescraper
-
Implementation of
Extractor.ContentExtractor
able to process
a
MS Excel 97-2007+ file format
.xls/.xlsx and
convert the detected content to triples.
- ExcelExtractor() - Constructor for class org.apache.any23.plugin.officescraper.ExcelExtractor
-
- ExcelPlugin - Class in org.apache.any23.plugin.officescraper
-
- ExcelPlugin() - Constructor for class org.apache.any23.plugin.officescraper.ExcelPlugin
-
- exdate - Variable in class org.apache.any23.vocab.ICAL
-
default value type: DATE-TIMEThis URI defines the list of date/time exceptions
for a recurring calendar component..
- execute(String...) - Method in class org.apache.any23.cli.ToolRunner
-
- execute(Rule, RuleContext, DOMDocument) - Method in interface org.apache.any23.validator.Fix
-
Executes this fix over a document.
- execute(Rule, RuleContext, DOMDocument) - Method in class org.apache.any23.validator.rule.MetaNameMisuseFix
-
- execute(Rule, RuleContext, DOMDocument) - Method in class org.apache.any23.validator.rule.OpenGraphNamespaceFix
-
- expand(String) - Method in class org.apache.any23.rdf.Prefixes
-
- experience - Variable in class org.apache.any23.vocab.DOAC
-
- Experience - Variable in class org.apache.any23.vocab.DOAC
-
- exrule - Variable in class org.apache.any23.vocab.ICAL
-
This URI defines a rule or repeating pattern for an exception to a recurrence
set.value type: RECUR.
- extended_address - Variable in class org.apache.any23.vocab.VCARD
-
The extended address of a postal address.
- extendedAddress - Variable in class org.apache.any23.vocab.HLISTING
-
- extract(ExtractionParameters, DocumentSource, TripleHandler, String) - Method in class org.apache.any23.Any23
-
Performs metadata extraction from the content of the given
in
document source, sending the generated events
to the specified outputHandler
.
- extract(String, String, String, String, TripleHandler) - Method in class org.apache.any23.Any23
-
Performs metadata extraction on the in
string
associated to the documentURI
URI, declaring
contentType
and encoding
.
- extract(String, String, TripleHandler) - Method in class org.apache.any23.Any23
-
Performs metadata extraction on the in
string
associated to the documentURI
URI, sending the generated
events to the specified outputHandler
.
- extract(File, TripleHandler) - Method in class org.apache.any23.Any23
-
Performs metadata extraction from the content of the given file
sending the generated events to the specified outputHandler
.
- extract(ExtractionParameters, String, TripleHandler) - Method in class org.apache.any23.Any23
-
Performs metadata extraction from the content of the given documentURI
sending the generated events to the specified outputHandler
.
- extract(String, TripleHandler) - Method in class org.apache.any23.Any23
-
Performs metadata extraction from the content of the given documentURI
sending the generated events to the specified outputHandler
.
- extract(DocumentSource, TripleHandler, String) - Method in class org.apache.any23.Any23
-
Performs metadata extraction from the content of the given
in
document source, sending the generated events
to the specified outputHandler
.
- extract(DocumentSource, TripleHandler) - Method in class org.apache.any23.Any23
-
Performs metadata extraction from the content of the given
in
document source, sending the generated events
to the specified outputHandler
.
- extract(ExtractionParameters, DocumentSource, TripleHandler) - Method in class org.apache.any23.Any23
-
Performs metadata extraction from the content of the given
in
document source, sending the generated events
to the specified outputHandler
.
- extract() - Method in class org.apache.any23.extractor.html.EntityBasedMicroformatExtractor
-
- extract() - Method in class org.apache.any23.extractor.html.HCalendarExtractor
-
- extract() - Method in class org.apache.any23.extractor.html.MicroformatExtractor
-
Performs the extraction of the data and writes them to the model.
- extractEntity(Node, ExtractionResult) - Method in class org.apache.any23.extractor.html.AdrExtractor
-
- extractEntity(Node, ExtractionResult) - Method in class org.apache.any23.extractor.html.EntityBasedMicroformatExtractor
-
Extracts an entity from a DOM node.
- extractEntity(Node, ExtractionResult) - Method in class org.apache.any23.extractor.html.GeoExtractor
-
- extractEntity(Node, ExtractionResult) - Method in class org.apache.any23.extractor.html.HCardExtractor
-
- extractEntity(Node, ExtractionResult) - Method in class org.apache.any23.extractor.html.HListingExtractor
-
- extractEntity(Node, ExtractionResult) - Method in class org.apache.any23.extractor.html.HRecipeExtractor
-
- extractEntity(Node, ExtractionResult) - Method in class org.apache.any23.extractor.html.HResumeExtractor
-
- extractEntity(Node, ExtractionResult) - Method in class org.apache.any23.extractor.html.HReviewExtractor
-
- extractEntity(Node, ExtractionResult) - Method in class org.apache.any23.extractor.html.SpeciesExtractor
-
Extracts an entity from a DOM node.
- EXTRACTION_CONTEXT_URI_PROPERTY - Static variable in class org.apache.any23.extractor.ExtractionParameters
-
- ExtractionContext - Class in org.apache.any23.extractor
-
This class provides the context for the processing of
a single
Extractor
.
- ExtractionContext(String, URI, String, String) - Constructor for class org.apache.any23.extractor.ExtractionContext
-
- ExtractionContext(String, URI, String) - Constructor for class org.apache.any23.extractor.ExtractionContext
-
- ExtractionContext(String, URI) - Constructor for class org.apache.any23.extractor.ExtractionContext
-
- ExtractionContextBlocker - Class in org.apache.any23.filter
-
A wrapper around a
TripleHandler
that can block and unblock
calls to the handler, either for the entire document, or for
individual
ExtractionContext
s.
- ExtractionContextBlocker(TripleHandler) - Constructor for class org.apache.any23.filter.ExtractionContextBlocker
-
- ExtractionException - Exception in org.apache.any23.extractor
-
Defines a specific exception raised during the metadata extraction phase.
- ExtractionException(String) - Constructor for exception org.apache.any23.extractor.ExtractionException
-
- ExtractionException(String, Throwable) - Constructor for exception org.apache.any23.extractor.ExtractionException
-
- ExtractionException(String, Throwable, ExtractionResult) - Constructor for exception org.apache.any23.extractor.ExtractionException
-
- ExtractionParameters - Class in org.apache.any23.extractor
-
This class models the parameters to be used to perform an extraction.
- ExtractionParameters(Configuration, ExtractionParameters.ValidationMode, Map<String, Boolean>, Map<String, String>) - Constructor for class org.apache.any23.extractor.ExtractionParameters
-
Constructor.
- ExtractionParameters(Configuration, ExtractionParameters.ValidationMode) - Constructor for class org.apache.any23.extractor.ExtractionParameters
-
Constructor.
- ExtractionParameters(Configuration, ExtractionParameters.ValidationMode, boolean) - Constructor for class org.apache.any23.extractor.ExtractionParameters
-
Constructor, allows to set explicitly the value for flag
SingleDocumentExtraction#METADATA_NESTING_FLAG
.
- ExtractionParameters.ValidationMode - Enum in org.apache.any23.extractor
-
Declares the supported validation actions.
- ExtractionReport - Class in org.apache.any23
-
This class contains some statistics and general information about
an extraction.
- ExtractionReport(List<Extractor>, String, String, ValidationReport, Map<String, Collection<IssueReport.Issue>>) - Constructor for class org.apache.any23.ExtractionReport
-
- ExtractionResult - Interface in org.apache.any23.extractor
-
Interface defining the methods that a representation of an extraction result must have.
- ExtractionResultImpl - Class in org.apache.any23.extractor
-
- ExtractionResultImpl(ExtractionContext, Extractor<?>, TripleHandler) - Constructor for class org.apache.any23.extractor.ExtractionResultImpl
-
- Extractor<Input> - Interface in org.apache.any23.extractor
-
It defines the signature of a generic Extractor.
- Extractor.BlindExtractor - Interface in org.apache.any23.extractor
-
This interface specializes an
Extractor
able to handle
URI
as input format.
- Extractor.ContentExtractor - Interface in org.apache.any23.extractor
-
- Extractor.TagSoupDOMExtractor - Interface in org.apache.any23.extractor
-
- ExtractorDescription - Interface in org.apache.any23.extractor
-
It defines a minimal signature for an
Extractor
description.
- ExtractorDocumentation - Class in org.apache.any23.cli
-
This class provides some command-line documentation
about available extractors and their usage.
- ExtractorDocumentation() - Constructor for class org.apache.any23.cli.ExtractorDocumentation
-
- ExtractorFactory<T extends Extractor<?>> - Interface in org.apache.any23.extractor
-
- ExtractorGroup - Class in org.apache.any23.extractor
-
It simple models a group of
ExtractorFactory
providing
simple accessing methods.
- ExtractorGroup(Collection<ExtractorFactory<?>>) - Constructor for class org.apache.any23.extractor.ExtractorGroup
-
- ExtractorPlugin<T extends Extractor<?>> - Interface in org.apache.any23.plugin
-
This interface defines an org.apache.any23.cli.Any23
extractor plugin that can be detected and registered from the library classpath.
- ExtractorRegistry - Interface in org.apache.any23.extractor
-
An interface to the enable a registry for extractors to be implemented by
different implementors of this API.
- ExtractorRegistryImpl - Class in org.apache.any23.extractor
-
Singleton class acting as a register for all the various
Extractor
.
- ExtractorRegistryImpl() - Constructor for class org.apache.any23.extractor.ExtractorRegistryImpl
-
- extractPrefixSections(String) - Static method in class org.apache.any23.extractor.rdfa.RDFa11Parser
-
Given a prefix declaration returns a list of prefixID:prefixURL
strings
normalizing blanks where present.
- extractRelTag(String) - Static method in class org.apache.any23.extractor.html.HTMLDocument
-
Extracts the href specific rel-tag string.
- extractRelTag(NamedNodeMap) - Static method in class org.apache.any23.extractor.html.HTMLDocument
-
Extracts the href specific rel-tag string.
- extractRelTagNodes() - Method in class org.apache.any23.extractor.html.HTMLDocument
-
Extracts all the rel
tag nodes.
- factory - Static variable in class org.apache.any23.extractor.csv.CSVExtractor
-
- factory - Static variable in class org.apache.any23.extractor.html.AdrExtractor
-
- factory - Static variable in class org.apache.any23.extractor.html.GeoExtractor
-
- factory - Static variable in class org.apache.any23.extractor.html.HCalendarExtractor
-
- factory - Static variable in class org.apache.any23.extractor.html.HCardExtractor
-
- factory - Static variable in class org.apache.any23.extractor.html.HeadLinkExtractor
-
- factory - Static variable in class org.apache.any23.extractor.html.HListingExtractor
-
- factory - Static variable in class org.apache.any23.extractor.html.HRecipeExtractor
-
- factory - Static variable in class org.apache.any23.extractor.html.HResumeExtractor
-
- factory - Static variable in class org.apache.any23.extractor.html.HReviewExtractor
-
- factory - Static variable in class org.apache.any23.extractor.html.HTMLMetaExtractor
-
- factory - Static variable in class org.apache.any23.extractor.html.ICBMExtractor
-
- factory - Static variable in class org.apache.any23.extractor.html.LicenseExtractor
-
- factory - Static variable in class org.apache.any23.extractor.html.SpeciesExtractor
-
- factory - Static variable in class org.apache.any23.extractor.html.TitleExtractor
-
- factory - Static variable in class org.apache.any23.extractor.html.TurtleHTMLExtractor
-
- factory - Static variable in class org.apache.any23.extractor.html.XFNExtractor
-
- factory - Static variable in class org.apache.any23.extractor.microdata.MicrodataExtractor
-
- factory - Static variable in class org.apache.any23.extractor.rdf.NQuadsExtractor
-
- factory - Static variable in class org.apache.any23.extractor.rdf.NTriplesExtractor
-
- factory - Static variable in class org.apache.any23.extractor.rdf.RDFXMLExtractor
-
- factory - Static variable in class org.apache.any23.extractor.rdf.TriXExtractor
-
- factory - Static variable in class org.apache.any23.extractor.rdf.TurtleExtractor
-
- factory - Static variable in class org.apache.any23.extractor.rdfa.RDFa11Extractor
-
- factory - Static variable in class org.apache.any23.extractor.rdfa.RDFaExtractor
-
- factory - Static variable in class org.apache.any23.extractor.xpath.XPathExtractor
-
- factory - Static variable in class org.apache.any23.plugin.htmlscraper.HTMLScraperExtractor
-
- factory - Static variable in class org.apache.any23.plugin.officescraper.ExcelExtractor
-
- family - Variable in class org.apache.any23.vocab.WO
-
A family is a scientific grouping of closely related organisms.
- FAMILY_NAME - Static variable in class org.apache.any23.extractor.html.HCardName
-
- family_name - Variable in class org.apache.any23.vocab.FOAF
-
- family_name - Variable in class org.apache.any23.vocab.VCARD
-
A family name part of a person's name.
- familyName - Variable in class org.apache.any23.vocab.WO
-
Used to specify the name of a family as part of a Taxon Name
- familyProperty - Variable in class org.apache.any23.vocab.WO
-
associates a taxon rank with a family
- fax - Variable in class org.apache.any23.vocab.VCARD
-
A fax number of a person.
- fbtype - Variable in class org.apache.any23.vocab.ICAL
-
To specify the free or busy time type..
- Feedback - Variable in class org.apache.any23.vocab.REVIEW
-
Feedback on the review.
- FIELDS - Static variable in class org.apache.any23.extractor.html.HCardName
-
- FILE_DOCUMENT_PREFIX - Static variable in class org.apache.any23.cli.MimeDetector
-
- FileDocumentSource - Class in org.apache.any23.source
-
- FileDocumentSource(File) - Constructor for class org.apache.any23.source.FileDocumentSource
-
- FileDocumentSource(File, String) - Constructor for class org.apache.any23.source.FileDocumentSource
-
- FileUtils - Class in org.apache.any23.util
-
Utility class for handling files.
- filterByMIMEType(MIMEType) - Method in class org.apache.any23.extractor.ExtractorGroup
-
- find(Node, String) - Static method in class org.apache.any23.extractor.html.DomUtils
-
Gets the string value of an XPath expression.
- find(String) - Method in class org.apache.any23.extractor.html.HTMLDocument
-
- findAll(Node, String) - Static method in class org.apache.any23.extractor.html.DomUtils
-
Returns a NodeList composed of all the nodes that match an XPath
expression, which must be valid.
- findAll(String) - Method in class org.apache.any23.extractor.html.HTMLDocument
-
- findAllByAttributeContains(Node, String, String) - Static method in class org.apache.any23.extractor.html.DomUtils
-
- findAllByAttributeName(Node, String) - Static method in class org.apache.any23.extractor.html.DomUtils
-
Finds all nodes that have a declared attribute.
- findAllByClassName(Node, String) - Static method in class org.apache.any23.extractor.html.DomUtils
-
Finds all nodes that have a declared class.
- findAllByClassName(String) - Method in class org.apache.any23.extractor.html.HTMLDocument
-
Finds all the nodes by class name.
- findAllByTag(Node, String) - Static method in class org.apache.any23.extractor.html.DomUtils
-
- findAllByTagAndClassName(Node, String, String) - Static method in class org.apache.any23.extractor.html.DomUtils
-
- findMicroformattedObjectNode(String, String) - Method in class org.apache.any23.extractor.html.HTMLDocument
-
- findMicroformattedValue(String, String, String, String, String) - Method in class org.apache.any23.extractor.html.HTMLDocument
-
- findNodeById(Node, String) - Static method in class org.apache.any23.extractor.html.DomUtils
-
Mimics the JS DOM API, or prototype's $()
- findNodeById(String) - Method in class org.apache.any23.extractor.html.HTMLDocument
-
- FIRST_CELL - Static variable in class org.apache.any23.vocab.Excel
-
- FIRST_ROW - Static variable in class org.apache.any23.vocab.Excel
-
- firstCell - Variable in class org.apache.any23.vocab.Excel
-
This property links the identifier of a Row to the index of the first declared cell.
- firstName - Variable in class org.apache.any23.vocab.FOAF
-
- firstRow - Variable in class org.apache.any23.vocab.Excel
-
This property links the identifier of a Sheet to the index of the first declared row.
- Fix - Interface in org.apache.any23.validator
-
This interface models a fix for an issue.
- fixAbsoluteURI(String) - Static method in class org.apache.any23.rdf.RDFUtils
-
Fixes typical errors in an absolute URI, such as unescaped spaces.
- fixLink(String) - Method in class org.apache.any23.extractor.html.MicroformatExtractor
-
- fixLink(String, String) - Method in class org.apache.any23.extractor.html.MicroformatExtractor
-
- fixLink(String, String) - Method in class org.apache.any23.rdf.Any23ValueFactoryWrapper
-
Helper method to conditionally add a schema to a URI unless it's there, or null if link is empty.
- fixURI(String) - Method in class org.apache.any23.rdf.Any23ValueFactoryWrapper
-
- fixURIWithException(String) - Static method in class org.apache.any23.rdf.RDFUtils
-
Tries to fix a potentially broken relative or absolute URI.
- fixWhiteSpace(String) - Method in class org.apache.any23.rdf.Any23ValueFactoryWrapper
-
- FLAG_PROPERTY_OFF - Static variable in class org.apache.any23.configuration.DefaultConfiguration
-
- FLAG_PROPERTY_ON - Static variable in class org.apache.any23.configuration.DefaultConfiguration
-
- fmttype - Variable in class org.apache.any23.vocab.ICAL
-
To specify the content type of a referenced object..
- fn - Variable in class org.apache.any23.vocab.HRECIPE
-
- fn - Variable in class org.apache.any23.vocab.VCARD
-
A formatted name of a person.
- FOAF - Class in org.apache.any23.vocab
-
- formatDateTime(Date) - Static method in class org.apache.any23.extractor.microdata.ItemPropValue
-
- FormatWriter - Interface in org.apache.any23.writer
-
Base interface used for the definition of RDF format writers.
- freebusy - Variable in class org.apache.any23.vocab.ICAL
-
The URI defines one or more free or busy time intervals.value type: PERIOD.
- freq - Variable in class org.apache.any23.vocab.ICAL
-
- friend - Variable in class org.apache.any23.vocab.XFN
-
- fromStrings(Collection<String>) - Static method in class org.apache.any23.http.AcceptHeaderBuilder
-
- fundedBy - Variable in class org.apache.any23.vocab.FOAF
-
- geekcode - Variable in class org.apache.any23.vocab.FOAF
-
- gender - Variable in class org.apache.any23.vocab.FOAF
-
- genus - Variable in class org.apache.any23.vocab.WO
-
- genusClass - Variable in class org.apache.any23.vocab.WO
-
- genusName - Variable in class org.apache.any23.vocab.WO
-
- GEO - Class in org.apache.any23.vocab
-
- geo - Variable in class org.apache.any23.vocab.ICAL
-
value type: list of FLOATThis URI specifies information related to the
global position for the activity specified by a calendar component..
- geo - Variable in class org.apache.any23.vocab.VCARD
-
A geographic location associated with a person.
- GeoExtractor - Class in org.apache.any23.extractor.html
-
Extractor for the
Geo
microformat.
- GeoExtractor() - Constructor for class org.apache.any23.extractor.html.GeoExtractor
-
- get() - Static method in class org.apache.any23.rdf.PopularPrefixes
-
- getAcceptHeader() - Method in class org.apache.any23.http.AcceptHeaderBuilder
-
Builds and returns an accept header.
- getAcceptHeader() - Method in class org.apache.any23.http.DefaultHTTPClientConfiguration
-
- getAcceptHeader() - Method in interface org.apache.any23.http.HTTPClientConfiguration
-
- getActualDocumentURI() - Method in class org.apache.any23.http.DefaultHTTPClient
-
- getActualDocumentURI() - Method in interface org.apache.any23.http.HTTPClient
-
Returns the actual URI from which the document was fetched.
- getAliases() - Method in class org.apache.any23.servlet.conneg.ContentTypeNegotiator.VariantSpec
-
- getAllNames() - Method in interface org.apache.any23.extractor.ExtractorRegistry
-
Returns the names of all registered extractors, sorted alphabetically.
- getAllNames() - Method in class org.apache.any23.extractor.ExtractorRegistryImpl
-
Returns the names of all registered extractors, sorted alphabetically.
- getAllRules() - Method in class org.apache.any23.validator.DefaultValidator
-
- getAllRules() - Method in interface org.apache.any23.validator.Validator
-
Returns all the registered rules.
- getApplicableExtractors(ExtractorRegistry, File...) - Method in class org.apache.any23.plugin.Any23PluginManager
-
- getApplicableTools(File...) - Method in class org.apache.any23.plugin.Any23PluginManager
-
Returns an
Iterator
of tools that have been detected within the given list of locations.
- getAsDate() - Method in class org.apache.any23.extractor.microdata.ItemPropValue
-
- getAsFloat() - Method in class org.apache.any23.extractor.microdata.ItemPropValue
-
- getAsInteger() - Method in class org.apache.any23.extractor.microdata.ItemPropValue
-
- getAsLink() - Method in class org.apache.any23.extractor.microdata.ItemPropValue
-
- getAsNested() - Method in class org.apache.any23.extractor.microdata.ItemPropValue
-
- getAsPlainLiteral(Node, String) - Static method in class org.apache.any23.extractor.rdfa.RDFa11Parser
-
- getAsXMLLiteral(Node) - Static method in class org.apache.any23.extractor.rdfa.RDFa11Parser
-
- getBaseClassName() - Method in class org.apache.any23.extractor.html.AdrExtractor
-
- getBaseClassName() - Method in class org.apache.any23.extractor.html.EntityBasedMicroformatExtractor
-
Returns the base class name for the extractor.
- getBaseClassName() - Method in class org.apache.any23.extractor.html.GeoExtractor
-
- getBaseClassName() - Method in class org.apache.any23.extractor.html.HCardExtractor
-
- getBaseClassName() - Method in class org.apache.any23.extractor.html.HListingExtractor
-
- getBaseClassName() - Method in class org.apache.any23.extractor.html.HRecipeExtractor
-
- getBaseClassName() - Method in class org.apache.any23.extractor.html.HResumeExtractor
-
- getBaseClassName() - Method in class org.apache.any23.extractor.html.HReviewExtractor
-
- getBaseClassName() - Method in class org.apache.any23.extractor.html.SpeciesExtractor
-
Returns the base class name for the extractor.
- getBeginColumnNumber() - Method in class org.apache.any23.extractor.html.TagSoupParser.ElementLocation
-
- getBeginLineNumber() - Method in class org.apache.any23.extractor.html.TagSoupParser.ElementLocation
-
- getBestMatch(String) - Method in class org.apache.any23.servlet.conneg.ContentTypeNegotiator
-
- getBestMatch(String, String) - Method in class org.apache.any23.servlet.conneg.ContentTypeNegotiator
-
Returns the
MediaRangeSpec
associated to the given
accept type and
userAgent.
- getBestMatch(List<MediaRangeSpec>) - Method in class org.apache.any23.servlet.conneg.MediaRangeSpec
-
- getBlankNodeFor(Node) - Method in class org.apache.any23.extractor.html.EntityBasedMicroformatExtractor
-
- getBNode(String) - Static method in class org.apache.any23.rdf.RDFUtils
-
Creates a BNode
.
- getCause() - Method in class org.apache.any23.validator.ValidationReport.Error
-
- getChilds() - Method in class org.apache.any23.writer.CompositeTripleHandler
-
- getClass(String) - Method in class org.apache.any23.vocab.Vocabulary
-
Returns a class defined within this vocabulary.
- getClasses() - Method in class org.apache.any23.vocab.Vocabulary
-
- getClassesInPackage(String) - Static method in class org.apache.any23.util.DiscoveryUtils
-
Scans all classes accessible from the context class loader
which belong to the given package and sub-packages.
- getClassesInPackage(String, Class) - Static method in class org.apache.any23.util.DiscoveryUtils
-
Scans all classes accessible from the context class loader
which belong to the given package and sub-packages and filter
them by ones implementing the specified interface iface
.
- getCol() - Method in class org.apache.any23.extractor.IssueReport.Issue
-
- getCommentFor(URI) - Method in class org.apache.any23.vocab.Vocabulary
-
Returns the comment for the given resource.
- getComments() - Method in class org.apache.any23.vocab.Vocabulary
-
Returns all the defined comments for resources.
- getConfigurationDump() - Method in interface org.apache.any23.configuration.Configuration
-
Returns a human readable string containing the configuration dump.
- getConfigurationDump() - Method in class org.apache.any23.configuration.DefaultConfiguration
-
- getConnectionTimeout() - Method in class org.apache.any23.http.DefaultHTTPClient
-
- getContent() - Method in class org.apache.any23.extractor.microdata.ItemPropValue
-
- getContentLength() - Method in class org.apache.any23.http.DefaultHTTPClient
-
- getContentLength() - Method in interface org.apache.any23.http.HTTPClient
-
- getContentLength() - Method in class org.apache.any23.source.ByteArrayDocumentSource
-
- getContentLength() - Method in interface org.apache.any23.source.DocumentSource
-
- getContentLength() - Method in class org.apache.any23.source.FileDocumentSource
-
- getContentLength() - Method in class org.apache.any23.source.HTTPDocumentSource
-
- getContentLength() - Method in class org.apache.any23.source.StringDocumentSource
-
- getContentType() - Method in class org.apache.any23.http.DefaultHTTPClient
-
- getContentType() - Method in interface org.apache.any23.http.HTTPClient
-
The value of the Content-Type header reported by the server.
- getContentType() - Method in class org.apache.any23.source.ByteArrayDocumentSource
-
- getContentType() - Method in interface org.apache.any23.source.DocumentSource
-
- getContentType() - Method in class org.apache.any23.source.FileDocumentSource
-
- getContentType() - Method in class org.apache.any23.source.HTTPDocumentSource
-
- getContentType() - Method in class org.apache.any23.source.StringDocumentSource
-
- getCount() - Method in class org.apache.any23.writer.CountingTripleHandler
-
- getCurrentExtractionResult() - Method in class org.apache.any23.extractor.html.MicroformatExtractor
-
- getData(String) - Method in class org.apache.any23.validator.DefaultRuleContext
-
- getData(String) - Method in interface org.apache.any23.validator.RuleContext
-
Retrieves a registered object.
- getDefaultLanguage() - Method in class org.apache.any23.extractor.ExtractionContext
-
- getDefaultLanguage() - Method in class org.apache.any23.extractor.html.HTMLDocument
-
Returns the document default language.
- getDefaultLiteralLanguage() - Method in class org.apache.any23.rdf.Any23ValueFactoryWrapper
-
- getDefaultTimeout() - Method in class org.apache.any23.http.DefaultHTTPClientConfiguration
-
- getDefaultTimeout() - Method in interface org.apache.any23.http.HTTPClientConfiguration
-
- getDescription() - Method in class org.apache.any23.extractor.csv.CSVExtractor
-
- getDescription() - Method in interface org.apache.any23.extractor.Extractor
-
- getDescription() - Method in class org.apache.any23.extractor.html.AdrExtractor
-
- getDescription() - Method in class org.apache.any23.extractor.html.GeoExtractor
-
- getDescription() - Method in class org.apache.any23.extractor.html.HCalendarExtractor
-
- getDescription() - Method in class org.apache.any23.extractor.html.HCardExtractor
-
- getDescription() - Method in class org.apache.any23.extractor.html.HeadLinkExtractor
-
- getDescription() - Method in class org.apache.any23.extractor.html.HListingExtractor
-
- getDescription() - Method in class org.apache.any23.extractor.html.HRecipeExtractor
-
- getDescription() - Method in class org.apache.any23.extractor.html.HResumeExtractor
-
- getDescription() - Method in class org.apache.any23.extractor.html.HReviewExtractor
-
- getDescription() - Method in class org.apache.any23.extractor.html.HTMLMetaExtractor
-
- getDescription() - Method in class org.apache.any23.extractor.html.ICBMExtractor
-
- getDescription() - Method in class org.apache.any23.extractor.html.LicenseExtractor
-
- getDescription() - Method in class org.apache.any23.extractor.html.MicroformatExtractor
-
Returns the description of this extractor.
- getDescription() - Method in class org.apache.any23.extractor.html.SpeciesExtractor
-
Returns the description of this extractor.
- getDescription() - Method in class org.apache.any23.extractor.html.TitleExtractor
-
- getDescription() - Method in class org.apache.any23.extractor.html.TurtleHTMLExtractor
-
- getDescription() - Method in class org.apache.any23.extractor.html.XFNExtractor
-
- getDescription() - Method in class org.apache.any23.extractor.microdata.MicrodataExtractor
-
- getDescription() - Method in class org.apache.any23.extractor.rdf.BaseRDFExtractor
-
- getDescription() - Method in class org.apache.any23.extractor.rdf.NQuadsExtractor
-
- getDescription() - Method in class org.apache.any23.extractor.rdf.NTriplesExtractor
-
- getDescription() - Method in class org.apache.any23.extractor.rdf.RDFXMLExtractor
-
- getDescription() - Method in class org.apache.any23.extractor.rdf.TriXExtractor
-
- getDescription() - Method in class org.apache.any23.extractor.rdf.TurtleExtractor
-
- getDescription() - Method in class org.apache.any23.extractor.rdfa.RDFa11Extractor
-
- getDescription() - Method in class org.apache.any23.extractor.rdfa.RDFaExtractor
-
- getDescription() - Method in class org.apache.any23.extractor.xpath.XPathExtractor
-
- getDescription() - Method in class org.apache.any23.plugin.htmlscraper.HTMLScraperExtractor
-
- getDescription() - Method in class org.apache.any23.plugin.officescraper.ExcelExtractor
-
- getDetectedItemScopes() - Method in class org.apache.any23.extractor.microdata.MicrodataParserReport
-
- getDetectedMimeType() - Method in class org.apache.any23.ExtractionReport
-
- getDetectedMIMEType() - Method in class org.apache.any23.extractor.SingleDocumentExtraction
-
- getDocument() - Method in class org.apache.any23.extractor.html.DocumentReport
-
- getDocument() - Method in class org.apache.any23.extractor.html.HTMLDocument
-
- getDocumentBase(URL, Document) - Static method in class org.apache.any23.extractor.rdfa.RDFa11Parser
-
- getDocumentURI() - Method in class org.apache.any23.extractor.ExtractionContext
-
- getDocumentURI() - Method in class org.apache.any23.extractor.html.MicroformatExtractor
-
- getDocumentURI() - Method in class org.apache.any23.source.ByteArrayDocumentSource
-
- getDocumentURI() - Method in interface org.apache.any23.source.DocumentSource
-
- getDocumentURI() - Method in class org.apache.any23.source.FileDocumentSource
-
- getDocumentURI() - Method in class org.apache.any23.source.HTTPDocumentSource
-
- getDocumentURI() - Method in class org.apache.any23.source.StringDocumentSource
-
- getDocumentURI() - Method in class org.apache.any23.validator.DefaultDOMDocument
-
- getDocumentURI() - Method in interface org.apache.any23.validator.DOMDocument
-
- getDOM() - Method in class org.apache.any23.extractor.html.TagSoupParser
-
Returns the DOM of the given document URI.
- getEncoding() - Method in class org.apache.any23.ExtractionReport
-
- getEndColumnNumber() - Method in class org.apache.any23.extractor.html.TagSoupParser.ElementLocation
-
- getEndLineNumber() - Method in class org.apache.any23.extractor.html.TagSoupParser.ElementLocation
-
- getErrorLocationBeginCol() - Method in exception org.apache.any23.extractor.microdata.MicrodataParserException
-
- getErrorLocationBeginRow() - Method in exception org.apache.any23.extractor.microdata.MicrodataParserException
-
- getErrorLocationEndCol() - Method in exception org.apache.any23.extractor.microdata.MicrodataParserException
-
- getErrorLocationEndRow() - Method in exception org.apache.any23.extractor.microdata.MicrodataParserException
-
- getErrorMode() - Method in class org.apache.any23.extractor.microdata.MicrodataParser
-
- getErrorPath() - Method in exception org.apache.any23.extractor.microdata.MicrodataParserException
-
- getErrors() - Method in class org.apache.any23.extractor.microdata.MicrodataParser
-
- getErrors() - Method in class org.apache.any23.extractor.microdata.MicrodataParserReport
-
- getErrors() - Method in class org.apache.any23.validator.DefaultValidationReport
-
- getErrors() - Method in class org.apache.any23.validator.EmptyValidationReport
-
- getErrors() - Method in interface org.apache.any23.validator.ValidationReport
-
Returns the list of detected errors.
- getExampleInput() - Method in class org.apache.any23.extractor.ExampleInputOutput
-
- getExampleInput() - Method in interface org.apache.any23.extractor.ExtractorFactory
-
An example input file for the extractor, to be used in auto-generated
documentation.
- getExampleInput() - Method in class org.apache.any23.extractor.SimpleExtractorFactory
-
- getExampleOutput() - Method in class org.apache.any23.extractor.ExampleInputOutput
-
- getExampleURI() - Method in class org.apache.any23.extractor.ExampleInputOutput
-
- getExtendedProperty(String) - Method in class org.apache.any23.vocab.XFN
-
- getExtractionContext() - Method in class org.apache.any23.extractor.ExtractionResultImpl
-
- getExtractionContext() - Method in class org.apache.any23.extractor.html.MicroformatExtractor
-
- getExtractor() - Method in class org.apache.any23.extractor.TagSoupExtractionResult.PropertyPath
-
- getExtractor() - Method in class org.apache.any23.extractor.TagSoupExtractionResult.ResourceRoot
-
- getExtractorFactory() - Method in interface org.apache.any23.plugin.ExtractorPlugin
-
- getExtractorFactory() - Method in class org.apache.any23.plugin.htmlscraper.HTMLScraperPlugin
-
- getExtractorFactory() - Method in class org.apache.any23.plugin.officescraper.ExcelPlugin
-
- getExtractorGroup() - Method in interface org.apache.any23.extractor.ExtractorRegistry
-
- getExtractorGroup(List<String>) - Method in interface org.apache.any23.extractor.ExtractorRegistry
-
- getExtractorGroup() - Method in class org.apache.any23.extractor.ExtractorRegistryImpl
-
- getExtractorGroup(List<String>) - Method in class org.apache.any23.extractor.ExtractorRegistryImpl
-
- getExtractorIssues(String) - Method in class org.apache.any23.ExtractionReport
-
- getExtractorName() - Method in class org.apache.any23.extractor.ExtractionContext
-
- getExtractorName() - Method in interface org.apache.any23.extractor.ExtractorDescription
-
Returns the name of the extractor.
- getExtractorName() - Method in class org.apache.any23.extractor.SimpleExtractorFactory
-
- getExtractorNames() - Method in class org.apache.any23.writer.ReportingTripleHandler
-
- getExtractors() - Method in class org.apache.any23.plugin.Any23PluginManager
-
- getExtractorToIssues() - Method in class org.apache.any23.extractor.SingleDocumentExtractionReport
-
- getExtractorType() - Method in interface org.apache.any23.extractor.ExtractorFactory
-
Returns the extractor type.
- getExtractorType() - Method in class org.apache.any23.extractor.SimpleExtractorFactory
-
- getFactory(String) - Method in interface org.apache.any23.extractor.ExtractorRegistry
-
- getFactory(String) - Method in class org.apache.any23.extractor.ExtractorRegistryImpl
-
- getField(String) - Method in class org.apache.any23.extractor.html.HCardName
-
- getFields(String) - Method in class org.apache.any23.extractor.html.HCardName
-
- getFixes(Class<? extends Rule>) - Method in class org.apache.any23.validator.DefaultValidator
-
- getFixes(Class<? extends Rule>) - Method in interface org.apache.any23.validator.Validator
-
Returns all fixes registered for the give rule.
- getFlag(String) - Method in class org.apache.any23.extractor.ExtractionParameters
-
Returns the value of the specified extraction flag, if the flag is undefined
it will be retrieved by the default
Configuration
.
- getFlagProperty(String) - Method in interface org.apache.any23.configuration.Configuration
-
Returns the value of a flag property.
- getFlagProperty(String) - Method in class org.apache.any23.configuration.DefaultConfiguration
-
- getFormatByExtension(String) - Static method in class org.apache.any23.rdf.RDFUtils
-
Returns a parser type from the given extension.
- getFormats() - Static method in class org.apache.any23.rdf.RDFUtils
-
Returns all the available RDFFormat
s.
- getFullName() - Method in class org.apache.any23.extractor.html.HCardName
-
- getFullType() - Method in class org.apache.any23.mime.MIMEType
-
- getGraph() - Method in class org.apache.any23.extractor.xpath.QuadTemplate
-
- getHRName() - Method in interface org.apache.any23.validator.Fix
-
- getHRName() - Method in class org.apache.any23.validator.rule.AboutNotURIRule
-
- getHRName() - Method in interface org.apache.any23.validator.Rule
-
- getHRName() - Method in class org.apache.any23.validator.rule.MetaNameMisuseFix
-
- getHRName() - Method in class org.apache.any23.validator.rule.MetaNameMisuseRule
-
- getHRName() - Method in class org.apache.any23.validator.rule.MissingOpenGraphNamespaceRule
-
- getHRName() - Method in class org.apache.any23.validator.rule.OpenGraphNamespaceFix
-
- getHTMLDocument() - Method in class org.apache.any23.extractor.html.MicroformatExtractor
-
- getHTTPClient() - Method in class org.apache.any23.Any23
-
- getHTTPUserAgent() - Method in class org.apache.any23.Any23
-
Returns the HTTP Header User Agent,
see RFC 2616-14.43.
- getId() - Method in class org.apache.any23.extractor.microdata.ItemScope
-
- getIdentifier() - Method in class org.apache.any23.writer.JSONWriterFactory
-
- getIdentifier() - Method in class org.apache.any23.writer.NQuadsWriterFactory
-
- getIdentifier() - Method in class org.apache.any23.writer.NTriplesWriterFactory
-
- getIdentifier() - Method in class org.apache.any23.writer.RDFXMLWriterFactory
-
- getIdentifier() - Method in class org.apache.any23.writer.TriXWriterFactory
-
- getIdentifier() - Method in class org.apache.any23.writer.TurtleWriterFactory
-
- getIdentifier() - Method in class org.apache.any23.writer.URIListWriterFactory
-
- getIdentifier() - Method in interface org.apache.any23.writer.WriterFactory
-
- getIdentifier(WriterFactory) - Static method in class org.apache.any23.writer.WriterFactoryRegistry
-
- getIdentifiers() - Method in class org.apache.any23.writer.WriterFactoryRegistry
-
- getIndexInParent(Node) - Static method in class org.apache.any23.extractor.html.DomUtils
-
Given a node this method returns the index corresponding to such node
within the list of the children of its parent node.
- getInstance() - Static method in class org.apache.any23.extractor.ExtractorRegistryImpl
-
- getInstance() - Static method in class org.apache.any23.extractor.rdf.RDFParserFactory
-
- getInstance() - Static method in class org.apache.any23.plugin.Any23PluginManager
-
- getInstance() - Static method in class org.apache.any23.plugin.crawler.SharedData
-
- getInstance() - Static method in class org.apache.any23.validator.EmptyValidationReport
-
- getInstance() - Static method in class org.apache.any23.vocab.CSV
-
- getInstance() - Static method in class org.apache.any23.vocab.DCTERMS
-
- getInstance() - Static method in class org.apache.any23.vocab.DOAC
-
- getInstance() - Static method in class org.apache.any23.vocab.Excel
-
- getInstance() - Static method in class org.apache.any23.vocab.FOAF
-
- getInstance() - Static method in class org.apache.any23.vocab.GEO
-
- getInstance() - Static method in class org.apache.any23.vocab.HLISTING
-
- getInstance() - Static method in class org.apache.any23.vocab.HRECIPE
-
- getInstance() - Static method in class org.apache.any23.vocab.ICAL
-
- getInstance() - Static method in class org.apache.any23.vocab.OGP
-
- getInstance() - Static method in class org.apache.any23.vocab.REVIEW
-
- getInstance() - Static method in class org.apache.any23.vocab.SCHEMAORG
-
- getInstance() - Static method in class org.apache.any23.vocab.SINDICE
-
- getInstance() - Static method in class org.apache.any23.vocab.VCARD
-
- getInstance() - Static method in class org.apache.any23.vocab.WO
-
- getInstance() - Static method in class org.apache.any23.vocab.XFN
-
- getInstance() - Static method in class org.apache.any23.vocab.XHTML
-
- getInstance() - Static method in class org.apache.any23.writer.WriterFactoryRegistry
-
- getInternalValue() - Method in class org.apache.any23.extractor.xpath.Term
-
- getIssueReport() - Method in class org.apache.any23.rdf.Any23ValueFactoryWrapper
-
- getIssues() - Method in class org.apache.any23.extractor.ExtractionResultImpl
-
- getIssues() - Method in interface org.apache.any23.extractor.IssueReport
-
Returns all the collected issues.
- getIssues() - Method in class org.apache.any23.validator.DefaultValidationReport
-
- getIssues() - Method in class org.apache.any23.validator.EmptyValidationReport
-
- getIssues() - Method in interface org.apache.any23.validator.ValidationReport
-
Returns the list of detected issues.
- getIssuesCount() - Method in class org.apache.any23.extractor.ExtractionResultImpl
-
- getItemId() - Method in class org.apache.any23.extractor.microdata.ItemScope
-
- getItemPropNodes(Node) - Static method in class org.apache.any23.extractor.microdata.MicrodataParser
-
Returns all the itemProps detected within the given root node.
- getItemProps(Node, boolean) - Method in class org.apache.any23.extractor.microdata.MicrodataParser
-
Returns all the itemprops for the given itemscope node.
- getItemScope(Node) - Method in class org.apache.any23.extractor.microdata.MicrodataParser
-
Returns the
ItemScope
instance described within the specified
node
.
- getItemScopeNodes(Node) - Static method in class org.apache.any23.extractor.microdata.MicrodataParser
-
Returns all the itemScopes detected within the given root node.
- getLevel() - Method in class org.apache.any23.extractor.IssueReport.Issue
-
- getLevel() - Method in class org.apache.any23.validator.ValidationReport.Issue
-
- getMajorType() - Method in class org.apache.any23.mime.MIMEType
-
- getMapping(String) - Method in class org.apache.any23.extractor.rdfa.RDFa11Parser
-
Returns a URI mapping for a given prefix.
- getMatchingExtractors() - Method in class org.apache.any23.ExtractionReport
-
- getMatchingExtractors() - Method in class org.apache.any23.extractor.SingleDocumentExtraction
-
- getMaxConnections() - Method in class org.apache.any23.http.DefaultHTTPClientConfiguration
-
- getMaxConnections() - Method in interface org.apache.any23.http.HTTPClientConfiguration
-
- getMaxDepth() - Method in class org.apache.any23.plugin.crawler.SiteCrawler
-
- getMaxPages() - Method in class org.apache.any23.plugin.crawler.SiteCrawler
-
- getMediaType() - Method in class org.apache.any23.servlet.conneg.ContentTypeNegotiator.VariantSpec
-
- getMediaType() - Method in class org.apache.any23.servlet.conneg.MediaRangeSpec
-
- getMessage() - Method in class org.apache.any23.extractor.IssueReport.Issue
-
- getMessage() - Method in class org.apache.any23.validator.ValidationReport.Error
-
- getMessage() - Method in class org.apache.any23.validator.ValidationReport.Issue
-
- getMicrodata(Document, MicrodataParser.ErrorMode) - Static method in class org.apache.any23.extractor.microdata.MicrodataParser
-
Returns all the Microdata items detected within the given document
.
- getMicrodata(Document) - Static method in class org.apache.any23.extractor.microdata.MicrodataParser
-
Returns all the Microdata items detected within the given document
,
works in full report mode.
- getMicrodataAsJSON(Document, PrintStream) - Static method in class org.apache.any23.extractor.microdata.MicrodataParser
-
- getMimeType() - Method in class org.apache.any23.writer.JSONWriterFactory
-
- getMimeType() - Method in class org.apache.any23.writer.NQuadsWriterFactory
-
- getMimeType() - Method in class org.apache.any23.writer.NTriplesWriterFactory
-
- getMimeType() - Method in class org.apache.any23.writer.RDFXMLWriterFactory
-
- getMimeType() - Method in class org.apache.any23.writer.TriXWriterFactory
-
- getMimeType() - Method in class org.apache.any23.writer.TurtleWriterFactory
-
- getMimeType() - Method in class org.apache.any23.writer.URIListWriterFactory
-
- getMimeType() - Method in interface org.apache.any23.writer.WriterFactory
-
- getMimeType(WriterFactory) - Static method in class org.apache.any23.writer.WriterFactoryRegistry
-
- getMimeTypes() - Method in class org.apache.any23.writer.WriterFactoryRegistry
-
- getName() - Method in class org.apache.any23.extractor.microdata.ItemProp
-
- getName() - Method in interface org.apache.any23.extractor.xpath.TemplateXPathExtractionRule
-
- getName() - Method in class org.apache.any23.extractor.xpath.TemplateXPathExtractionRuleImpl
-
- getName() - Method in class org.apache.any23.extractor.xpath.Variable
-
- getName() - Method in interface org.apache.any23.extractor.xpath.XPathExtractionRule
-
- getNamespace() - Method in class org.apache.any23.vocab.Vocabulary
-
- getNamespaceURIFor(String) - Method in class org.apache.any23.rdf.Prefixes
-
- getNegotiator() - Static method in class org.apache.any23.servlet.conneg.Any23Negotiator
-
- getNode(String) - Method in class org.apache.any23.validator.DefaultDOMDocument
-
- getNode(String) - Method in interface org.apache.any23.validator.DOMDocument
-
Returns the node addressed by the given XPath, if more then
one an exception will be raised.
- getNodeLocation(Node) - Static method in class org.apache.any23.extractor.html.DomUtils
-
Returns the row/col location of the given node.
- getNodes(String) - Method in class org.apache.any23.validator.DefaultDOMDocument
-
- getNodes(String) - Method in interface org.apache.any23.validator.DOMDocument
-
Returns the list of nodes addressed by the given XPath.
- getNodesWithAttribute(String) - Method in class org.apache.any23.validator.DefaultDOMDocument
-
- getNodesWithAttribute(String) - Method in interface org.apache.any23.validator.DOMDocument
-
Returns all the nodes declaring an attribute with the specified name.
- getNQuadsParser(boolean, boolean, ExtractionContext, ExtractionResult) - Method in class org.apache.any23.extractor.rdf.RDFParserFactory
-
- getNTriplesParser(boolean, boolean, ExtractionContext, ExtractionResult) - Method in class org.apache.any23.extractor.rdf.RDFParserFactory
-
Returns a new instance of a configured NTriplesParser
.
- getNumOfCrawlers() - Method in class org.apache.any23.plugin.crawler.SiteCrawler
-
- getNumOfExtractors() - Method in class org.apache.any23.extractor.ExtractorGroup
-
- getObject() - Method in class org.apache.any23.extractor.TagSoupExtractionResult.PropertyPath
-
- getObject() - Method in class org.apache.any23.extractor.xpath.QuadTemplate
-
- getOrganization() - Method in class org.apache.any23.extractor.html.HCardName
-
- getOrganizationUnit() - Method in class org.apache.any23.extractor.html.HCardName
-
- getOrigin() - Method in class org.apache.any23.validator.ValidationReport.FixError
-
- getOrigin() - Method in class org.apache.any23.validator.ValidationReport.Issue
-
- getOrigin() - Method in class org.apache.any23.validator.ValidationReport.RuleError
-
- getOriginalDocument() - Method in class org.apache.any23.validator.DefaultDOMDocument
-
- getOriginalDocument() - Method in interface org.apache.any23.validator.DOMDocument
-
Returns the original document.
- getParameter(String) - Method in class org.apache.any23.servlet.conneg.MediaRangeSpec
-
- getParameterNames() - Method in class org.apache.any23.servlet.conneg.MediaRangeSpec
-
- getParser(ExtractionContext, ExtractionResult) - Method in class org.apache.any23.extractor.rdf.BaseRDFExtractor
-
- getParser(ExtractionContext, ExtractionResult) - Method in class org.apache.any23.extractor.rdf.NQuadsExtractor
-
- getParser(ExtractionContext, ExtractionResult) - Method in class org.apache.any23.extractor.rdf.NTriplesExtractor
-
- getParser(ExtractionContext, ExtractionResult) - Method in class org.apache.any23.extractor.rdf.RDFXMLExtractor
-
- getParser(ExtractionContext, ExtractionResult) - Method in class org.apache.any23.extractor.rdf.TriXExtractor
-
- getParser(ExtractionContext, ExtractionResult) - Method in class org.apache.any23.extractor.rdf.TurtleExtractor
-
- getParser() - Method in class org.apache.any23.io.nquads.NQuadsParserFactory
-
- getParser(RDFFormat) - Static method in class org.apache.any23.rdf.RDFUtils
-
Creates a new RDFParser
instance.
- getParserEncoding() - Method in class org.apache.any23.extractor.SingleDocumentExtraction
-
- getPath() - Method in class org.apache.any23.extractor.TagSoupExtractionResult.PropertyPath
-
- getPath() - Method in class org.apache.any23.extractor.TagSoupExtractionResult.ResourceRoot
-
- getPathToLocalRoot() - Method in class org.apache.any23.extractor.html.HTMLDocument
-
Returns the sequence of ancestors from the document root to the local root (document).
- getPattern() - Method in class org.apache.any23.plugin.crawler.SharedData
-
- getPlugins(Class<T>) - Method in class org.apache.any23.plugin.Any23PluginManager
-
Returns all classes within the specified packageName
satisfying the given class
filter
.
- getPluralTextField(String) - Method in class org.apache.any23.extractor.html.HTMLDocument
-
Returns a plural text field.
- getPluralUrlField(String) - Method in class org.apache.any23.extractor.html.HTMLDocument
-
Returns the list of URLs associated to the fields marked with class className.
- getPolitenessDelay() - Method in class org.apache.any23.plugin.crawler.SiteCrawler
-
- getPrecedence(MediaRangeSpec) - Method in class org.apache.any23.servlet.conneg.MediaRangeSpec
-
- getPredicate() - Method in class org.apache.any23.extractor.xpath.QuadTemplate
-
- getPrefixes() - Method in interface org.apache.any23.extractor.ExtractorDescription
-
An instance defining the prefixes supported by this extractor.
- getPrefixes() - Method in class org.apache.any23.extractor.SimpleExtractorFactory
-
- getProperties() - Method in interface org.apache.any23.configuration.Configuration
-
Returns all the defined configuration properties.
- getProperties() - Method in class org.apache.any23.configuration.DefaultConfiguration
-
- getProperties() - Method in class org.apache.any23.extractor.microdata.ItemScope
-
- getProperties() - Method in class org.apache.any23.vocab.Vocabulary
-
- getProperty(String, String) - Method in interface org.apache.any23.configuration.Configuration
-
Returns the value of a specified property, of the default value if property is not defined.
- getProperty(String, String) - Method in class org.apache.any23.configuration.DefaultConfiguration
-
- getProperty(String) - Method in class org.apache.any23.extractor.ExtractionParameters
-
Returns the value of the specified extraction property, if the property is undefined
it will be retrieved by the default
Configuration
.
- getProperty() - Method in class org.apache.any23.extractor.TagSoupExtractionResult.PropertyPath
-
- getProperty(String) - Method in class org.apache.any23.vocab.Vocabulary
-
Returns a property defined within this vocabulary.
- getProperty(String, URI) - Method in class org.apache.any23.vocab.Vocabulary
-
Returns a property defined within this vocabulary, if not found the
defaultValue
will be returned.
- getPropertyByLocalName(String) - Method in class org.apache.any23.vocab.XFN
-
- getPropertyCamelCase(String) - Method in class org.apache.any23.vocab.Vocabulary
-
Returns the property URI for the specified property string.
- getPropertyIntOrFail(String) - Method in interface org.apache.any23.configuration.Configuration
-
Returns the
Integer
value of the specified
propertyName
or raises an exception
if
propertyName
is not defined.
- getPropertyIntOrFail(String) - Method in class org.apache.any23.configuration.DefaultConfiguration
-
- getPropertyOrFail(String) - Method in interface org.apache.any23.configuration.Configuration
-
Returns the value of the specified propertyName
or raises an exception
if propertyName
is not defined.
- getPropertyOrFail(String) - Method in class org.apache.any23.configuration.DefaultConfiguration
-
- getPropertyPaths() - Method in class org.apache.any23.extractor.ExtractionResultImpl
-
- getPropertyPaths() - Method in interface org.apache.any23.extractor.TagSoupExtractionResult
-
Returns all the collected property paths.
- getPropertyValue(Node) - Method in class org.apache.any23.extractor.microdata.MicrodataParser
-
Reads the value of a itemprop node.
- getQuality() - Method in class org.apache.any23.mime.MIMEType
-
- getQuality() - Method in class org.apache.any23.servlet.conneg.MediaRangeSpec
-
- getRDFFormat() - Method in class org.apache.any23.io.nquads.NQuadsParser
-
- getRDFFormat() - Method in class org.apache.any23.io.nquads.NQuadsParserFactory
-
- getRDFFormat() - Method in class org.apache.any23.io.nquads.NQuadsWriter
-
- getRDFFormat() - Method in class org.apache.any23.io.nquads.NQuadsWriterFactory
-
- getRdfFormat() - Method in class org.apache.any23.writer.JSONWriterFactory
-
- getRdfFormat() - Method in class org.apache.any23.writer.NQuadsWriterFactory
-
- getRdfFormat() - Method in class org.apache.any23.writer.NTriplesWriterFactory
-
- getRdfFormat() - Method in class org.apache.any23.writer.RDFXMLWriterFactory
-
- getRdfFormat() - Method in class org.apache.any23.writer.TriXWriterFactory
-
- getRdfFormat() - Method in class org.apache.any23.writer.TurtleWriterFactory
-
- getRdfFormat() - Method in class org.apache.any23.writer.URIListWriterFactory
-
- getRdfFormat() - Method in interface org.apache.any23.writer.WriterFactory
-
- getRdfWriter(OutputStream) - Method in class org.apache.any23.writer.JSONWriterFactory
-
- getRdfWriter(OutputStream) - Method in class org.apache.any23.writer.NQuadsWriterFactory
-
- getRdfWriter(OutputStream) - Method in class org.apache.any23.writer.NTriplesWriterFactory
-
- getRdfWriter(OutputStream) - Method in class org.apache.any23.writer.RDFXMLWriterFactory
-
- getRdfWriter(OutputStream) - Method in class org.apache.any23.writer.TriXWriterFactory
-
- getRdfWriter(OutputStream) - Method in class org.apache.any23.writer.TurtleWriterFactory
-
- getRdfWriter(OutputStream) - Method in class org.apache.any23.writer.URIListWriterFactory
-
- getRdfWriter(OutputStream) - Method in interface org.apache.any23.writer.WriterFactory
-
- getRDFXMLParser(boolean, boolean, ExtractionContext, ExtractionResult) - Method in class org.apache.any23.extractor.rdf.RDFParserFactory
-
Returns a new instance of a configured org.openrdf.rio.rdfxml.RDFXMLParser
.
- getRefs() - Method in class org.apache.any23.extractor.microdata.ItemScope
-
- getReport() - Method in class org.apache.any23.extractor.html.DocumentReport
-
- getReport() - Method in class org.apache.any23.validator.DefaultValidationReportBuilder
-
- getReport() - Method in interface org.apache.any23.validator.ValidationReportBuilder
-
- getResourceRoots() - Method in class org.apache.any23.extractor.ExtractionResultImpl
-
- getResourceRoots() - Method in interface org.apache.any23.extractor.TagSoupExtractionResult
-
Returns all the collected property roots.
- getRoot() - Method in class org.apache.any23.extractor.TagSoupExtractionResult.ResourceRoot
-
- getRow() - Method in class org.apache.any23.extractor.IssueReport.Issue
-
- getRuleActivations() - Method in class org.apache.any23.validator.DefaultValidationReport
-
- getRuleActivations() - Method in class org.apache.any23.validator.EmptyValidationReport
-
- getRuleActivations() - Method in interface org.apache.any23.validator.ValidationReport
-
Returns the list of activated rules.
- getRuleStr() - Method in class org.apache.any23.validator.ValidationReport.RuleActivation
-
- getSeed() - Method in class org.apache.any23.plugin.crawler.SharedData
-
- getSingularTextField(String) - Method in class org.apache.any23.extractor.html.HTMLDocument
-
Returns a singular text field.
- getSingularUrlField(String) - Method in class org.apache.any23.extractor.html.HTMLDocument
-
Returns the URL associated to the field marked with class className.
- getSoTimeout() - Method in class org.apache.any23.http.DefaultHTTPClient
-
- getSubject() - Method in class org.apache.any23.extractor.TagSoupExtractionResult.PropertyPath
-
- getSubject() - Method in class org.apache.any23.extractor.xpath.QuadTemplate
-
- getSubtype() - Method in class org.apache.any23.mime.MIMEType
-
- getSubtype() - Method in class org.apache.any23.servlet.conneg.MediaRangeSpec
-
- getSupportedMIMETypes() - Method in interface org.apache.any23.extractor.ExtractorFactory
-
Supports wildcards, e.g.
- getSupportedMIMETypes() - Method in class org.apache.any23.extractor.SimpleExtractorFactory
-
- getText() - Method in class org.apache.any23.extractor.html.HTMLDocument
-
Returns the text contained inside a node if leaf,
null
otherwise.
- getTextExtractors() - Method in class org.apache.any23.plugin.htmlscraper.HTMLScraperExtractor
-
- getTools() - Method in class org.apache.any23.plugin.Any23PluginManager
-
Returns the list of all the
Tool
classes declared within the classpath.
- getTopLevelItemScopeNodes(Node) - Static method in class org.apache.any23.extractor.microdata.MicrodataParser
-
Returns only the itemScopes that are top level items.
- getTotalDocuments() - Method in class org.apache.any23.writer.ReportingTripleHandler
-
- getTotalTriples() - Method in class org.apache.any23.writer.ReportingTripleHandler
-
- getTriXParser(boolean, boolean, ExtractionContext, ExtractionResult) - Method in class org.apache.any23.extractor.rdf.RDFParserFactory
-
Returns a new instance of a configured TriXParser
.
- getTurtleParserInstance(boolean, boolean, ExtractionContext, ExtractionResult) - Method in class org.apache.any23.extractor.rdf.RDFParserFactory
-
Returns a new instance of a configured TurtleParser
.
- getType() - Method in class org.apache.any23.extractor.microdata.ItemPropValue
-
- getType() - Method in class org.apache.any23.extractor.microdata.ItemScope
-
- getType() - Method in class org.apache.any23.servlet.conneg.MediaRangeSpec
-
- getUniqueID() - Method in class org.apache.any23.extractor.ExtractionContext
-
- getUriRegex() - Method in class org.apache.any23.extractor.xpath.TemplateXPathExtractionRuleImpl
-
- getUserAgent() - Method in class org.apache.any23.http.DefaultHTTPClientConfiguration
-
- getUserAgent() - Method in interface org.apache.any23.http.HTTPClientConfiguration
-
- getValidatedDOM(boolean) - Method in class org.apache.any23.extractor.html.TagSoupParser
-
Returns the validated DOM and applies fixes on it if applyFix
is set to true
.
- getValidationReport() - Method in class org.apache.any23.ExtractionReport
-
- getValidationReport() - Method in class org.apache.any23.extractor.SingleDocumentExtractionReport
-
- getValue() - Method in class org.apache.any23.extractor.microdata.ItemProp
-
- getValue(Map<String, String>) - Method in class org.apache.any23.extractor.xpath.Term
-
Returns the value represented by this
Term
given the
varMapping
, the #isVar and #internalValue
parameters.
- getValueInternal(String) - Method in class org.apache.any23.extractor.xpath.TemplateGraph
-
- getValueInternal(String) - Method in class org.apache.any23.extractor.xpath.TemplateObject
-
- getValueInternal(String) - Method in class org.apache.any23.extractor.xpath.TemplatePredicate
-
- getValueInternal(String) - Method in class org.apache.any23.extractor.xpath.TemplateSubject
-
- getValueInternal(String) - Method in class org.apache.any23.extractor.xpath.Term
-
- getWebCrawler() - Method in class org.apache.any23.plugin.crawler.SiteCrawler
-
- getWriter(OutputStream) - Method in class org.apache.any23.io.nquads.NQuadsWriterFactory
-
- getWriter(Writer) - Method in class org.apache.any23.io.nquads.NQuadsWriterFactory
-
- getWriter(RDFFormat, Writer) - Static method in class org.apache.any23.rdf.RDFUtils
-
Creates a new RDFWriter
instance.
- getWriter(RDFFormat, OutputStream) - Static method in class org.apache.any23.rdf.RDFUtils
-
Creates a new RDFWriter
instance.
- getWriterByIdentifier(String) - Method in class org.apache.any23.writer.WriterFactoryRegistry
-
- getWriterInstanceByIdentifier(String, OutputStream) - Method in class org.apache.any23.writer.WriterFactoryRegistry
-
- getWriters() - Method in class org.apache.any23.writer.WriterFactoryRegistry
-
- getWritersByMimeType(String) - Method in class org.apache.any23.writer.WriterFactoryRegistry
-
Returns all the writers matching the specified mimeType
.
- getXpath() - Method in class org.apache.any23.extractor.microdata.Item
-
- getxPath() - Method in class org.apache.any23.extractor.xpath.Variable
-
- getXPathForNode(Node) - Static method in class org.apache.any23.extractor.html.DomUtils
-
Does a reverse walking of the DOM tree to generate a unique XPath
expression leading to this node.
- getXPathListForNode(Node) - Static method in class org.apache.any23.extractor.html.DomUtils
-
Returns a list of tag names representing the path from
the document root to the given node n.
- getXSDDate(String, String) - Static method in class org.apache.any23.rdf.RDFUtils
-
This method allows to obtain an
XML Schema compliant date
providing a textual representation of a date and textual a pattern for parsing it.
- getXSLT() - Static method in class org.apache.any23.extractor.rdfa.RDFaExtractor
-
- GIVEN_NAME - Static variable in class org.apache.any23.extractor.html.HCardName
-
- given_name - Variable in class org.apache.any23.vocab.VCARD
-
A given name part of a person's name.
- givenname - Variable in class org.apache.any23.vocab.FOAF
-
- Group - Variable in class org.apache.any23.vocab.FOAF
-
- guessEncoding(InputStream) - Method in interface org.apache.any23.encoding.EncodingDetector
-
Guesses the data encoding.
- guessEncoding(InputStream) - Method in class org.apache.any23.encoding.TikaEncodingDetector
-
- guessMIMEType(String, InputStream, MIMEType) - Method in interface org.apache.any23.mime.MIMETypeDetector
-
Estimates the MIME
type of the content of input file.
- guessMIMEType(String, InputStream, MIMEType) - Method in class org.apache.any23.mime.NaiveMIMETypeDetector
-
- guessMIMEType(String, InputStream, MIMEType) - Method in class org.apache.any23.mime.TikaMIMETypeDetector
-
Estimates the MIME
type of the content of input file.
- ICAL - Class in org.apache.any23.vocab
-
Vocabulary definitions from ical.rdf
- ICBMExtractor - Class in org.apache.any23.extractor.html
-
Extractor for "ICBM coordinates" provided as META headers in the head
of an HTML page.
- ICBMExtractor() - Constructor for class org.apache.any23.extractor.html.ICBMExtractor
-
- icqChatID - Variable in class org.apache.any23.vocab.FOAF
-
- IDENTIFIER - Static variable in class org.apache.any23.writer.JSONWriterFactory
-
- IDENTIFIER - Static variable in class org.apache.any23.writer.NQuadsWriterFactory
-
- IDENTIFIER - Static variable in class org.apache.any23.writer.NTriplesWriterFactory
-
- IDENTIFIER - Static variable in class org.apache.any23.writer.RDFXMLWriterFactory
-
- IDENTIFIER - Static variable in class org.apache.any23.writer.TriXWriterFactory
-
- IDENTIFIER - Static variable in class org.apache.any23.writer.TurtleWriterFactory
-
- IDENTIFIER - Static variable in class org.apache.any23.writer.URIListWriterFactory
-
- IgnoreAccidentalRDFa - Class in org.apache.any23.filter
-
A
TripleHandler
that suppresses output of the RDFa
parser if the document only contains "accidental" RDFa,
like stylesheet links and other non-RDFa uses of HTML's
- IgnoreAccidentalRDFa(TripleHandler, boolean) - Constructor for class org.apache.any23.filter.IgnoreAccidentalRDFa
-
Constructor.
- IgnoreAccidentalRDFa(TripleHandler) - Constructor for class org.apache.any23.filter.IgnoreAccidentalRDFa
-
- IgnoreTitlesOfEmptyDocuments - Class in org.apache.any23.filter
-
- IgnoreTitlesOfEmptyDocuments(TripleHandler) - Constructor for class org.apache.any23.filter.IgnoreTitlesOfEmptyDocuments
-
- Image - Variable in class org.apache.any23.vocab.FOAF
-
- IMAGE - Static variable in class org.apache.any23.vocab.OGP
-
The canonical URL of your object that will be used as its permanent ID in the graph (Facebook Graph API).
- image - Variable in class org.apache.any23.vocab.OGP
-
- IMAGE__HEIGHT - Static variable in class org.apache.any23.vocab.OGP
-
The number of pixels high.
- IMAGE__SECURE_URL - Static variable in class org.apache.any23.vocab.OGP
-
An alternate url to use if the webpage requires HTTPS.
- IMAGE__TYPE - Static variable in class org.apache.any23.vocab.OGP
-
A MIME type for this image.
- IMAGE__URL - Static variable in class org.apache.any23.vocab.OGP
-
Identical to og:image
.
- IMAGE__WIDTH - Static variable in class org.apache.any23.vocab.OGP
-
The number of pixels wide.
- imageHeight - Variable in class org.apache.any23.vocab.OGP
-
- imageSecureURL - Variable in class org.apache.any23.vocab.OGP
-
- imageType - Variable in class org.apache.any23.vocab.OGP
-
- imageURL - Variable in class org.apache.any23.vocab.OGP
-
- imageWidth - Variable in class org.apache.any23.vocab.OGP
-
- img - Variable in class org.apache.any23.vocab.FOAF
-
- Includes - Annotation Type in org.apache.any23.extractor.html.annotations
-
This annotation could be used to decorate a
MicroformatExtractor
to
represent which of the other
Microformats could it nest.
- includes(Class<? extends MicroformatExtractor>, Class<? extends MicroformatExtractor>) - Static method in class org.apache.any23.extractor.html.MicroformatExtractor
-
- Ingredient - Variable in class org.apache.any23.vocab.HRECIPE
-
- ingredient - Variable in class org.apache.any23.vocab.HRECIPE
-
- ingredientName - Variable in class org.apache.any23.vocab.HRECIPE
-
- ingredientQuantity - Variable in class org.apache.any23.vocab.HRECIPE
-
- ingredientQuantityType - Variable in class org.apache.any23.vocab.HRECIPE
-
- init(HTTPClientConfiguration) - Method in class org.apache.any23.http.DefaultHTTPClient
-
- init(HTTPClientConfiguration) - Method in interface org.apache.any23.http.HTTPClient
-
Initializes the HTTP client.
- INLINE_DOCUMENT_PREFIX - Static variable in class org.apache.any23.cli.MimeDetector
-
- inputURIs - Variable in class org.apache.any23.cli.Rover
-
- instructions - Variable in class org.apache.any23.vocab.HRECIPE
-
- interest - Variable in class org.apache.any23.vocab.FOAF
-
- interval - Variable in class org.apache.any23.vocab.ICAL
-
- isAbsoluteURI(String) - Static method in class org.apache.any23.extractor.rdfa.RDFa11Parser
-
- isAbsoluteURI(String) - Static method in class org.apache.any23.rdf.RDFUtils
-
Checks if href
is absolute or not.
- isAncestorOf(Node, Node, boolean) - Static method in class org.apache.any23.extractor.html.DomUtils
-
Checks whether a node is ancestor or same of another node.
- isAncestorOf(Node, Node) - Static method in class org.apache.any23.extractor.html.DomUtils
-
Checks whether a node is ancestor or same of another node.
- isAnnotated() - Method in interface org.apache.any23.writer.FormatWriter
-
If true
then the produced RDF is annotated with
the extractors used to generate the specific statements.
- isAnnotated() - Method in class org.apache.any23.writer.JSONWriter
-
- isAnnotated() - Method in class org.apache.any23.writer.RDFWriterTripleHandler
-
If true
then the produced RDF is annotated with
the extractors used to generate the specific statements.
- isAnnotated() - Method in class org.apache.any23.writer.URIListWriter
-
- isAnyMajorType() - Method in class org.apache.any23.mime.MIMEType
-
- isAnySubtype() - Method in class org.apache.any23.mime.MIMEType
-
- isco88_code - Variable in class org.apache.any23.vocab.DOAC
-
- isCSV(InputStream) - Static method in class org.apache.any23.extractor.csv.CSVReaderBuilder
-
Checks whether the given input stream is a CSV or not.
- isCURIE(String) - Static method in class org.apache.any23.extractor.rdfa.RDFa11Parser
-
- isCURIEBNode(String) - Static method in class org.apache.any23.extractor.rdfa.RDFa11Parser
-
- isDate() - Method in class org.apache.any23.extractor.microdata.ItemPropValue
-
- isDefault() - Method in class org.apache.any23.servlet.conneg.ContentTypeNegotiator.VariantSpec
-
- isDocBlocked() - Method in class org.apache.any23.filter.ExtractionContextBlocker
-
- isElementNode(Node) - Static method in class org.apache.any23.extractor.html.DomUtils
-
Verifies if the given target node is an element.
- isEmpty() - Method in class org.apache.any23.extractor.ExtractorGroup
-
- isEmpty() - Method in class org.apache.any23.rdf.Prefixes
-
- isExtendedXFNLocalName(String) - Method in class org.apache.any23.vocab.XFN
-
- isFix() - Method in class org.apache.any23.extractor.ExtractionParameters
-
- isFloat() - Method in class org.apache.any23.extractor.microdata.ItemPropValue
-
- isInteger() - Method in class org.apache.any23.extractor.microdata.ItemPropValue
-
- isItemProp(Node) - Static method in class org.apache.any23.extractor.microdata.MicrodataParser
-
Check whether a node is an itemProp.
- isItemScope(Node) - Static method in class org.apache.any23.extractor.microdata.MicrodataParser
-
Check whether a node is an itemScope.
- isLink() - Method in class org.apache.any23.extractor.microdata.ItemPropValue
-
- isLocal() - Method in class org.apache.any23.source.ByteArrayDocumentSource
-
- isLocal() - Method in interface org.apache.any23.source.DocumentSource
-
A value of false indicates that the document
resides remotely, and that multiple successive accesses
to it should be avoided by copying it to local storage.
- isLocal() - Method in class org.apache.any23.source.FileDocumentSource
-
- isLocal() - Method in class org.apache.any23.source.HTTPDocumentSource
-
- isLocal() - Method in class org.apache.any23.source.StringDocumentSource
-
- isMultiField(String) - Method in class org.apache.any23.extractor.html.HCardName
-
- isNested() - Method in class org.apache.any23.extractor.microdata.ItemPropValue
-
- isNumber() - Method in class org.apache.any23.extractor.microdata.ItemPropValue
-
- isOnline(String) - Static method in class org.apache.any23.util.URLUtils
-
Verifies if the specified URL is reachable online.
- isPlain() - Method in class org.apache.any23.extractor.microdata.ItemPropValue
-
- isPrefix(String, String) - Static method in class org.apache.any23.util.StringUtils
-
Check whether string candidatePrefix
is prefix of string container
.
- isPrimaryTopicOf - Variable in class org.apache.any23.vocab.FOAF
-
- isRegisteredName(String) - Method in interface org.apache.any23.extractor.ExtractorRegistry
-
- isRegisteredName(String) - Method in class org.apache.any23.extractor.ExtractorRegistryImpl
-
- isRelativeNode(Node) - Static method in class org.apache.any23.extractor.rdfa.RDFa11Parser
-
- isStopAtFirstError() - Method in class org.apache.any23.extractor.rdf.BaseRDFExtractor
-
- isStopAtFirstError() - Method in class org.apache.any23.extractor.rdfa.RDFa11Extractor
-
- isStopAtFirstError() - Method in class org.apache.any23.extractor.rdfa.RDFaExtractor
-
- isStopAtFirstError() - Method in class org.apache.any23.plugin.officescraper.ExcelExtractor
-
- IssueReport - Interface in org.apache.any23.extractor
-
This interface models an issue reporter.
- IssueReport.Issue - Class in org.apache.any23.extractor
-
This class defines a generic issue traced by this extraction result.
- IssueReport.IssueLevel - Enum in org.apache.any23.extractor
-
Possible issue levels.
- isSuffix(String, String) - Static method in class org.apache.any23.util.StringUtils
-
Check whether string candidateSuffix
is suffix of string container
.
- isValidate() - Method in class org.apache.any23.extractor.ExtractionParameters
-
- isVar() - Method in class org.apache.any23.extractor.xpath.Term
-
- isVerifyDataType() - Method in class org.apache.any23.extractor.rdf.BaseRDFExtractor
-
- isVerifyDataType() - Method in class org.apache.any23.extractor.rdfa.RDFa11Extractor
-
- isVerifyDataType() - Method in class org.apache.any23.extractor.rdfa.RDFaExtractor
-
- isVolatile(String) - Method in class org.apache.any23.rdf.Prefixes
-
- isWildcardSubtype() - Method in class org.apache.any23.servlet.conneg.MediaRangeSpec
-
- isWildcardType() - Method in class org.apache.any23.servlet.conneg.MediaRangeSpec
-
- isXFNLocalName(String) - Method in class org.apache.any23.vocab.XFN
-
- isXMLNSDeclared(Document) - Static method in class org.apache.any23.extractor.rdfa.RDFa11Parser
-
- Item - Class in org.apache.any23.extractor.microdata
-
Base class for Microdata items.
- Item(String) - Constructor for class org.apache.any23.extractor.microdata.Item
-
Constructor.
- Item - Variable in class org.apache.any23.vocab.HLISTING
-
- item - Variable in class org.apache.any23.vocab.HLISTING
-
- itemName - Variable in class org.apache.any23.vocab.HLISTING
-
- itemPhoto - Variable in class org.apache.any23.vocab.HLISTING
-
- ItemProp - Class in org.apache.any23.extractor.microdata
-
Describes a Microdata item property.
- ItemProp(String, String, ItemPropValue) - Constructor for class org.apache.any23.extractor.microdata.ItemProp
-
Constructor.
- ITEMPROP_ATTRIBUTE - Static variable in class org.apache.any23.extractor.microdata.MicrodataParser
-
- ItemPropValue - Class in org.apache.any23.extractor.microdata
-
Describes a possible value for a Microdata item property.
- ItemPropValue(Object, ItemPropValue.Type) - Constructor for class org.apache.any23.extractor.microdata.ItemPropValue
-
Constructor.
- ItemPropValue.Type - Enum in org.apache.any23.extractor.microdata
-
Supported types.
- ItemScope - Class in org.apache.any23.extractor.microdata
-
This class describes a Microdata itemscope.
- ItemScope(String, ItemProp[], String, String[], String, String) - Constructor for class org.apache.any23.extractor.microdata.ItemScope
-
Constructor.
- ITEMSCOPE_ATTRIBUTE - Static variable in class org.apache.any23.extractor.microdata.MicrodataParser
-
- itemUrl - Variable in class org.apache.any23.vocab.HLISTING
-
- iterator() - Method in class org.apache.any23.extractor.ExtractorGroup
-
- label - Variable in class org.apache.any23.vocab.VCARD
-
The formatted version of a postal address (a string with embedded line breaks,
punctuation, etc.).
- language - Variable in class org.apache.any23.vocab.DOAC
-
- language - Variable in class org.apache.any23.vocab.ICAL
-
To specify the language for text values in a URI or URI parameter..
- LanguageLevel - Variable in class org.apache.any23.vocab.DOAC
-
- LanguageSkill - Variable in class org.apache.any23.vocab.DOAC
-
- LAST_CELL - Static variable in class org.apache.any23.vocab.Excel
-
- LAST_ROW - Static variable in class org.apache.any23.vocab.Excel
-
- lastCell - Variable in class org.apache.any23.vocab.Excel
-
This property links the identifier of a Row to the index of the last declared cell.
- lastModified - Variable in class org.apache.any23.vocab.ICAL
-
value type: DATE-TIMEThe URI specifies the date and time that the information
associated with the calendar component was last revised in the calendar store.
- lastRow - Variable in class org.apache.any23.vocab.Excel
-
This property links the identifier of a Sheet to the index of the last declared row.
- lat - Variable in class org.apache.any23.vocab.GEO
-
- latitude - Variable in class org.apache.any23.vocab.VCARD
-
The latitude of a geographic location.
- license - Variable in class org.apache.any23.vocab.DCTERMS
-
- license - Variable in class org.apache.any23.vocab.XHTML
-
- LicenseExtractor - Class in org.apache.any23.extractor.html
-
- LicenseExtractor() - Constructor for class org.apache.any23.extractor.html.LicenseExtractor
-
- List_of_Float - Variable in class org.apache.any23.vocab.ICAL
-
- Lister - Variable in class org.apache.any23.vocab.HLISTING
-
- lister - Variable in class org.apache.any23.vocab.HLISTING
-
- listerLogo - Variable in class org.apache.any23.vocab.HLISTING
-
- listerName - Variable in class org.apache.any23.vocab.HLISTING
-
- listerOrg - Variable in class org.apache.any23.vocab.HLISTING
-
- listerUrl - Variable in class org.apache.any23.vocab.HLISTING
-
- listFilesRecursively(File, FilenameFilter) - Static method in class org.apache.any23.util.FileUtils
-
Lists the content of a dir applying the specified filter.
- Listing - Variable in class org.apache.any23.vocab.HLISTING
-
- literal(String) - Static method in class org.apache.any23.rdf.RDFUtils
-
Creates a Literal
.
- literal(boolean) - Static method in class org.apache.any23.rdf.RDFUtils
-
Creates a Literal
.
- literal(byte) - Static method in class org.apache.any23.rdf.RDFUtils
-
Creates a Literal
.
- literal(short) - Static method in class org.apache.any23.rdf.RDFUtils
-
Creates a Literal
.
- literal(int) - Static method in class org.apache.any23.rdf.RDFUtils
-
Creates a Literal
.
- literal(long) - Static method in class org.apache.any23.rdf.RDFUtils
-
Creates a Literal
.
- literal(float) - Static method in class org.apache.any23.rdf.RDFUtils
-
Creates a Literal
.
- literal(double) - Static method in class org.apache.any23.rdf.RDFUtils
-
Creates a Literal
.
- literal(String, String) - Static method in class org.apache.any23.rdf.RDFUtils
-
Creates a Literal
.
- literal(String, URI) - Static method in class org.apache.any23.rdf.RDFUtils
-
Creates a Literal
.
- loadClassDir(File) - Method in class org.apache.any23.plugin.Any23PluginManager
-
Loads a classes directory in the classpath.
- loadClassDirs(File...) - Method in class org.apache.any23.plugin.Any23PluginManager
-
Loads a list of class dirs in the classpath.
- loadFiles(File...) - Method in class org.apache.any23.plugin.Any23PluginManager
-
Loads a generic list of files, trying to determine the type of every file.
- loadJAR(File) - Method in class org.apache.any23.plugin.Any23PluginManager
-
Loads a JAR file in the classpath.
- loadJARDir(File) - Method in class org.apache.any23.plugin.Any23PluginManager
-
Loads all the JARs detected in a given directory.
- loadJARs(File...) - Method in class org.apache.any23.plugin.Any23PluginManager
-
Loads a list of JARs in the classpath.
- loadPlugins(File...) - Method in class org.apache.any23.plugin.Any23PluginManager
-
Loads plugins from a list of specified locations.
- LocalCopyFactory - Interface in org.apache.any23.source
-
- LOCALE - Static variable in class org.apache.any23.vocab.OGP
-
The locale these tags are marked up in.
- locale - Variable in class org.apache.any23.vocab.OGP
-
- LOCALE__ALTERNATE - Static variable in class org.apache.any23.vocab.OGP
-
An array of other locales this page is available in.
- localeAlternate - Variable in class org.apache.any23.vocab.OGP
-
- locality - Variable in class org.apache.any23.vocab.HLISTING
-
- locality - Variable in class org.apache.any23.vocab.VCARD
-
The locality (e.g., city) of a postal address.
- location - Variable in class org.apache.any23.vocab.ICAL
-
value type: TEXTThe URI defines the intended venue for the activity defined
by a calendar component..
- Location - Variable in class org.apache.any23.vocab.VCARD
-
Resources that are vCard geographic locations.
- logger - Static variable in class org.apache.any23.Any23
-
- logger - Static variable in class org.apache.any23.configuration.DefaultConfiguration
-
- LoggingTripleHandler - Class in org.apache.any23.writer
-
Triple handler decorator useful for logging purposes.
- LoggingTripleHandler(TripleHandler, PrintWriter) - Constructor for class org.apache.any23.writer.LoggingTripleHandler
-
- logo - Variable in class org.apache.any23.vocab.FOAF
-
- logo - Variable in class org.apache.any23.vocab.VCARD
-
A logo associated with a person or their organization.
- LogUtils - Class in org.apache.any23.util
-
Provides specific application logger configuration methods.
- LogUtils() - Constructor for class org.apache.any23.util.LogUtils
-
- lon - Variable in class org.apache.any23.vocab.GEO
-
- longitude - Variable in class org.apache.any23.vocab.VCARD
-
The longitude of a geographic location.
- n - Variable in class org.apache.any23.vocab.VCARD
-
The components of the name of a person.
- NaiveMIMETypeDetector - Class in org.apache.any23.mime
-
- NaiveMIMETypeDetector() - Constructor for class org.apache.any23.mime.NaiveMIMETypeDetector
-
- NAME - Static variable in class org.apache.any23.extractor.html.HTMLMetaExtractor
-
- NAME - Static variable in class org.apache.any23.extractor.html.TitleExtractor
-
- NAME - Static variable in class org.apache.any23.extractor.html.TurtleHTMLExtractor
-
- NAME - Static variable in class org.apache.any23.extractor.rdfa.RDFa11Extractor
-
- NAME - Static variable in class org.apache.any23.extractor.rdfa.RDFaExtractor
-
- NAME - Static variable in class org.apache.any23.extractor.xpath.XPathExtractor
-
- NAME - Static variable in class org.apache.any23.plugin.htmlscraper.HTMLScraperExtractor
-
- name - Variable in class org.apache.any23.vocab.FOAF
-
- Name - Variable in class org.apache.any23.vocab.VCARD
-
Resources that are vCard personal names.
- NAMESPACE - Variable in class org.apache.any23.vocab.ICAL
-
The namespace of the vocabulary as a URI.
- NAMESPACE - Variable in class org.apache.any23.vocab.OGP
-
- NAMESPACE - Variable in class org.apache.any23.vocab.REVIEW
-
The namespace of the vocabulary as a URI.
- NAMESPACE - Variable in class org.apache.any23.vocab.SINDICE
-
The namespace of the vocabulary as a URI.
- NAMESPACE - Variable in class org.apache.any23.vocab.VCARD
-
The namespace of the vocabulary as a URI.
- NAMESPACE - Variable in class org.apache.any23.vocab.WO
-
The namespace of the vocabulary as a URI.
- neighbor - Variable in class org.apache.any23.vocab.XFN
-
- NESTING - Static variable in class org.apache.any23.vocab.SINDICE
-
- nesting - Variable in class org.apache.any23.vocab.SINDICE
-
This property links a resource with a blank node that represents
a nested Microformat node.
- NESTING_ORIGINAL - Static variable in class org.apache.any23.vocab.SINDICE
-
- nesting_original - Variable in class org.apache.any23.vocab.SINDICE
-
This property is used to keep track of the original nested RDF property.
- NESTING_STRUCTURED - Static variable in class org.apache.any23.vocab.SINDICE
-
- nesting_structured - Variable in class org.apache.any23.vocab.SINDICE
-
This property links the resource with a node representing the nested Microformat
- newDefault(Configuration) - Static method in class org.apache.any23.extractor.ExtractionParameters
-
- newDefault() - Static method in class org.apache.any23.extractor.ExtractionParameters
-
- nick - Variable in class org.apache.any23.vocab.FOAF
-
- NICKNAME - Static variable in class org.apache.any23.extractor.html.HCardName
-
- nickname - Variable in class org.apache.any23.vocab.VCARD
-
The nickname of a person.
- NODES_WITH_INVALID_ABOUT - Static variable in class org.apache.any23.validator.rule.AboutNotURIRule
-
- note - Variable in class org.apache.any23.vocab.VCARD
-
Notes about a person on a vCard.
- notifyIssue(IssueReport.IssueLevel, String, int, int) - Method in class org.apache.any23.extractor.ExtractionResultImpl
-
- notifyIssue(IssueReport.IssueLevel, String, int, int) - Method in interface org.apache.any23.extractor.IssueReport
-
Notifies an issue occurred while performing an extraction on an input stream.
- notifyPage(Page) - Method in class org.apache.any23.plugin.crawler.SharedData
-
Notifies all listeners that a page has been discovered.
- NQuadsExtractor - Class in org.apache.any23.extractor.rdf
-
- NQuadsExtractor(boolean, boolean) - Constructor for class org.apache.any23.extractor.rdf.NQuadsExtractor
-
- NQuadsExtractor() - Constructor for class org.apache.any23.extractor.rdf.NQuadsExtractor
-
- NQuadsParser - Class in org.apache.any23.io.nquads
-
N-Quads parser implementation based on the
RDFParser
interface.
- NQuadsParser() - Constructor for class org.apache.any23.io.nquads.NQuadsParser
-
- NQuadsParserFactory - Class in org.apache.any23.io.nquads
-
Implementation of RDFParserFactory
for NQuads
.
- NQuadsParserFactory() - Constructor for class org.apache.any23.io.nquads.NQuadsParserFactory
-
- NQuadsWriter - Class in org.apache.any23.io.nquads
-
N-Quads implementation of an RDFWriter
.
- NQuadsWriter(OutputStream) - Constructor for class org.apache.any23.io.nquads.NQuadsWriter
-
- NQuadsWriter(Writer) - Constructor for class org.apache.any23.io.nquads.NQuadsWriter
-
- NQuadsWriter - Class in org.apache.any23.writer
-
Implementation of an NQuads writer.
- NQuadsWriter(OutputStream) - Constructor for class org.apache.any23.writer.NQuadsWriter
-
- NQuadsWriterFactory - Class in org.apache.any23.io.nquads
-
Implementation of RDFWriterFactory
for NQuads
.
- NQuadsWriterFactory() - Constructor for class org.apache.any23.io.nquads.NQuadsWriterFactory
-
- NQuadsWriterFactory - Class in org.apache.any23.writer
-
- NQuadsWriterFactory() - Constructor for class org.apache.any23.writer.NQuadsWriterFactory
-
- NS - Static variable in class org.apache.any23.vocab.CSV
-
The namespace of the vocabulary as a string.
- NS - Static variable in class org.apache.any23.vocab.DCTERMS
-
- NS - Static variable in class org.apache.any23.vocab.DOAC
-
- NS - Static variable in class org.apache.any23.vocab.Excel
-
The namespace of the vocabulary as a string.
- NS - Static variable in class org.apache.any23.vocab.FOAF
-
- NS - Static variable in class org.apache.any23.vocab.GEO
-
- NS - Static variable in class org.apache.any23.vocab.HLISTING
-
- NS - Static variable in class org.apache.any23.vocab.HRECIPE
-
- NS - Static variable in class org.apache.any23.vocab.ICAL
-
The namespace of the vocabulary as a string.
- NS - Static variable in class org.apache.any23.vocab.OGP
-
- NS - Static variable in class org.apache.any23.vocab.REVIEW
-
The namespace of the vocabulary as a string.
- NS - Static variable in class org.apache.any23.vocab.SCHEMAORG
-
The namespace of the vocabulary as a string.
- NS - Static variable in class org.apache.any23.vocab.SINDICE
-
The namespace of the vocabulary as a string.
- NS - Static variable in class org.apache.any23.vocab.VCARD
-
The namespace of the vocabulary as a string.
- NS - Static variable in class org.apache.any23.vocab.WO
-
The namespace of the vocabulary as a string.
- NS - Static variable in class org.apache.any23.vocab.XFN
-
- NS - Static variable in class org.apache.any23.vocab.XHTML
-
- NTriplesExtractor - Class in org.apache.any23.extractor.rdf
-
- NTriplesExtractor(boolean, boolean) - Constructor for class org.apache.any23.extractor.rdf.NTriplesExtractor
-
- NTriplesExtractor() - Constructor for class org.apache.any23.extractor.rdf.NTriplesExtractor
-
Default constructor, with no verification of data types and no stop at first error.
- NTriplesWriter - Class in org.apache.any23.writer
-
N3 triples writer.
- NTriplesWriter(OutputStream) - Constructor for class org.apache.any23.writer.NTriplesWriter
-
- NTriplesWriterFactory - Class in org.apache.any23.writer
-
- NTriplesWriterFactory() - Constructor for class org.apache.any23.writer.NTriplesWriterFactory
-
- NULL_CHAR - Static variable in class org.apache.any23.extractor.csv.CSVReaderBuilder
-
- NUMBER_OF_COLUMNS - Static variable in class org.apache.any23.vocab.CSV
-
- NUMBER_OF_ROWS - Static variable in class org.apache.any23.vocab.CSV
-
- numberOfColumns - Variable in class org.apache.any23.vocab.CSV
-
This property expresses the number of columns in a CSV file.
- numberOfRows - Variable in class org.apache.any23.vocab.CSV
-
This property expresses the number of rows in a CSV file.
- Nutrition - Variable in class org.apache.any23.vocab.HRECIPE
-
- nutrition - Variable in class org.apache.any23.vocab.HRECIPE
-
- nutritionValue - Variable in class org.apache.any23.vocab.HRECIPE
-
- nutritionValueType - Variable in class org.apache.any23.vocab.HRECIPE
-
- offer - Variable in class org.apache.any23.vocab.HLISTING
-
- OGP - Class in org.apache.any23.vocab
-
- OnlineAccount - Variable in class org.apache.any23.vocab.FOAF
-
- OnlineChatAccount - Variable in class org.apache.any23.vocab.FOAF
-
- OnlineEcommerceAccount - Variable in class org.apache.any23.vocab.FOAF
-
- OnlineGamingAccount - Variable in class org.apache.any23.vocab.FOAF
-
- openContext(ExtractionContext) - Method in class org.apache.any23.filter.ExtractionContextBlocker
-
- openContext(ExtractionContext) - Method in class org.apache.any23.filter.IgnoreAccidentalRDFa
-
- openContext(ExtractionContext) - Method in class org.apache.any23.filter.IgnoreTitlesOfEmptyDocuments
-
- openContext(ExtractionContext) - Method in class org.apache.any23.writer.BenchmarkTripleHandler
-
- openContext(ExtractionContext) - Method in class org.apache.any23.writer.CompositeTripleHandler
-
- openContext(ExtractionContext) - Method in class org.apache.any23.writer.CountingTripleHandler
-
- openContext(ExtractionContext) - Method in class org.apache.any23.writer.JSONWriter
-
- openContext(ExtractionContext) - Method in class org.apache.any23.writer.LoggingTripleHandler
-
- openContext(ExtractionContext) - Method in class org.apache.any23.writer.RDFWriterTripleHandler
-
- openContext(ExtractionContext) - Method in class org.apache.any23.writer.ReportingTripleHandler
-
- openContext(ExtractionContext) - Method in class org.apache.any23.writer.RepositoryWriter
-
- openContext(ExtractionContext) - Method in interface org.apache.any23.writer.TripleHandler
-
Informs the handler that a new context has been established.
- openContext(ExtractionContext) - Method in class org.apache.any23.writer.URIListWriter
-
- OPENGRAPH_PROTOCOL_NS - Static variable in class org.apache.any23.validator.rule.OpenGraphNamespaceFix
-
- OpenGraphNamespaceFix - Class in org.apache.any23.validator.rule
-
This fixes the missing Open Graph protocol.
- OpenGraphNamespaceFix() - Constructor for class org.apache.any23.validator.rule.OpenGraphNamespaceFix
-
- openInputStream(String) - Method in class org.apache.any23.http.DefaultHTTPClient
-
- openInputStream(String) - Method in interface org.apache.any23.http.HTTPClient
-
Opens the input stream for the given target URI.
- openInputStream() - Method in class org.apache.any23.source.ByteArrayDocumentSource
-
- openInputStream() - Method in interface org.apache.any23.source.DocumentSource
-
Returns the input stream for accessing the content of the document.
- openInputStream() - Method in class org.apache.any23.source.FileDocumentSource
-
- openInputStream() - Method in class org.apache.any23.source.HTTPDocumentSource
-
- openInputStream() - Method in class org.apache.any23.source.StringDocumentSource
-
- openSubResult(ExtractionContext) - Method in interface org.apache.any23.extractor.ExtractionResult
-
Open a result nested in the current one.
- openSubResult(ExtractionContext) - Method in class org.apache.any23.extractor.ExtractionResultImpl
-
- openSubResult(ExtractionContext) - Method in class org.apache.any23.extractor.html.MicroformatExtractor
-
- order - Variable in class org.apache.any23.vocab.WO
-
- orderClass - Variable in class org.apache.any23.vocab.WO
-
- orderName - Variable in class org.apache.any23.vocab.WO
-
- org - Variable in class org.apache.any23.vocab.VCARD
-
An organization associated with a person.
- org.apache.any23 - package org.apache.any23
-
The
Any23 root package defines the
Any23
library facade and auxiliary classes.
- org.apache.any23.cli - package org.apache.any23.cli
-
This package contains some command-line utilities which allow users
to use the main Any23 features via commandline shell.
- org.apache.any23.configuration - package org.apache.any23.configuration
-
This packages contains the Any23 Configuration definition.
- org.apache.any23.encoding - package org.apache.any23.encoding
-
This package defines utility classes for the Encoding Detection.
- org.apache.any23.extractor - package org.apache.any23.extractor
-
This package contains classes and interfaces modeling the
Extractor
API.
- org.apache.any23.extractor.csv - package org.apache.any23.extractor.csv
-
This package contains the definition of a specific
Extractor
for
CSV files as specified on
RFC 4180.
- org.apache.any23.extractor.html - package org.apache.any23.extractor.html
-
All the various
Extractor
needed to distill
RDF
from
Microformats in HTML pages are contained in this package.
- org.apache.any23.extractor.html.annotations - package org.apache.any23.extractor.html.annotations
-
This package contains the annotations needed to describe the
single nesting relations among different
Microformats.
- org.apache.any23.extractor.microdata - package org.apache.any23.extractor.microdata
-
This package provides classes for dealing with
Microdata extraction.
- org.apache.any23.extractor.rdf - package org.apache.any23.extractor.rdf
-
This package contains the concrete implementations (and utilities classes)
to perform the extraction from various RDF serialization formats.
- org.apache.any23.extractor.rdfa - package org.apache.any23.extractor.rdfa
-
This package contains the definition of a specific
Extractor
for
RDFa.
- org.apache.any23.extractor.xpath - package org.apache.any23.extractor.xpath
-
This package provides an XPath extractor able to generate RDF statements
from any HTML page on the basis of a set of specified rules and a list of statement
templates.
- org.apache.any23.filter - package org.apache.any23.filter
-
This package defines specific
TripleHandler
s
able to filter document and triples that don't respect particular conditions.
- org.apache.any23.http - package org.apache.any23.http
-
This package contains classes and interfaces to implement a
small multi-threaded HTTP Client used to fetch data
sources.
- org.apache.any23.io.nquads - package org.apache.any23.io.nquads
-
This package contains an
NQuads
parser and writer based on the
Sesame API.
- org.apache.any23.mime - package org.apache.any23.mime
-
This package provides support for MIME Type
data detection.
- org.apache.any23.mime.purifier - package org.apache.any23.mime.purifier
-
This package contains all the logic to implement your own
Purifier
.
- org.apache.any23.plugin - package org.apache.any23.plugin
-
This package provides Plugin Management support for Any23.
- org.apache.any23.plugin.crawler - package org.apache.any23.plugin.crawler
-
Commandline extension of
Rover
able
to extract semantic content from an entire
site.
- org.apache.any23.plugin.htmlscraper - package org.apache.any23.plugin.htmlscraper
-
The
HTMLScraperExtractor
is a special extractor
to scrape textual content from a generic
HTML pages.
- org.apache.any23.plugin.officescraper - package org.apache.any23.plugin.officescraper
-
This package defines a set of
Extractor
s
able to extract metadata from various
Microsoft Office documents.
- org.apache.any23.rdf - package org.apache.any23.rdf
-
This package provides some utility classes to manage
RDF models and prefixes.
- org.apache.any23.servlet - package org.apache.any23.servlet
-
The package defines a servlet exposing Any23
as a REST Service.
- org.apache.any23.servlet.conneg - package org.apache.any23.servlet.conneg
-
This package provides the Any23 content type negotiator.
- org.apache.any23.source - package org.apache.any23.source
-
Package containing some auxiliary classes to handle different types
of
InputStream
sources.
- org.apache.any23.util - package org.apache.any23.util
-
This package collects a set of internal library utility classes.
- org.apache.any23.validator - package org.apache.any23.validator
-
This package defines a simple rule based validation framework able to detect
(
Rule
) and fix
(
Fix
) common mistakes that can be present
within an
HTML DOM model and that can prevent the extraction of metadata.
- org.apache.any23.validator.rule - package org.apache.any23.validator.rule
-
A set of predefined validation
Rule
s
and associated
Fix
es.
- org.apache.any23.vocab - package org.apache.any23.vocab
-
This package defines the main ontologies used by Any23.
- org.apache.any23.writer - package org.apache.any23.writer
-
This package collects a set of
TripleHandler
decorators and specific
RDF format writers.
- OrganisationalSkill - Variable in class org.apache.any23.vocab.DOAC
-
- organization - Variable in class org.apache.any23.vocab.DOAC
-
- Organization - Variable in class org.apache.any23.vocab.FOAF
-
- Organization - Variable in class org.apache.any23.vocab.VCARD
-
Resources that are vCard organizations.
- organization_name - Variable in class org.apache.any23.vocab.VCARD
-
The name of an organization.
- organization_unit - Variable in class org.apache.any23.vocab.VCARD
-
The name of a unit within an organization.
- organizer - Variable in class org.apache.any23.vocab.ICAL
-
value type: CAL-ADDRESSThe URI defines the organizer for a calendar component..
- page - Variable in class org.apache.any23.vocab.FOAF
-
- PAGE_CONTENT_AE_PROPERTY - Static variable in class org.apache.any23.plugin.htmlscraper.HTMLScraperExtractor
-
- PAGE_CONTENT_CE_PROPERTY - Static variable in class org.apache.any23.plugin.htmlscraper.HTMLScraperExtractor
-
- PAGE_CONTENT_DE_PROPERTY - Static variable in class org.apache.any23.plugin.htmlscraper.HTMLScraperExtractor
-
- PAGE_CONTENT_LCE_PROPERTY - Static variable in class org.apache.any23.plugin.htmlscraper.HTMLScraperExtractor
-
- parent - Variable in class org.apache.any23.vocab.XFN
-
- parse(Reader, String) - Method in class org.apache.any23.io.nquads.NQuadsParser
-
- parse(InputStream, String) - Method in class org.apache.any23.io.nquads.NQuadsParser
-
- parse(String) - Static method in class org.apache.any23.mime.MIMEType
-
Parses the given MIME type string returning an instance of
MIMEType
.
- parseAccept(String) - Static method in class org.apache.any23.servlet.conneg.MediaRangeSpec
-
Parses an HTTP Accept header into a List of MediaRangeSpecs
- parseDateTime(String) - Static method in class org.apache.any23.extractor.microdata.ItemPropValue
-
- parseRange(String) - Static method in class org.apache.any23.servlet.conneg.MediaRangeSpec
-
Parses a media range from a string such as text/*;charset=utf-8;q=0.9.
- parseRDF(RDFFormat, InputStream, String) - Static method in class org.apache.any23.rdf.RDFUtils
-
Parses the content of is
input stream with the
specified parser p
using baseURI
.
- parseRDF(RDFFormat, InputStream) - Static method in class org.apache.any23.rdf.RDFUtils
-
Parses the content of is
input stream with the
specified parser p
using ''
as base URI.
- parseRDF(RDFFormat, String) - Static method in class org.apache.any23.rdf.RDFUtils
-
Parses the content of in
string with the
specified parser p
using ''
as base URI.
- parseRDF(String) - Static method in class org.apache.any23.rdf.RDFUtils
-
Parses the content of the resource
file
guessing the content format from the extension.
- parseType(String) - Static method in class org.apache.any23.servlet.conneg.MediaRangeSpec
-
Parses a media type from a string such as text/html;charset=utf-8;q=0.9.
- partstat - Variable in class org.apache.any23.vocab.ICAL
-
To specify the participation status for the calendar user specified by the
URI..
- pastProject - Variable in class org.apache.any23.vocab.FOAF
-
- percentComplete - Variable in class org.apache.any23.vocab.ICAL
-
value type: INTEGERThis URI is used by an assignee or delegatee of a
to-do to convey the percent completion of a to-do to the Organizer..
- performExtraction(DocumentSource) - Method in class org.apache.any23.cli.Rover
-
- permalink - Variable in class org.apache.any23.vocab.HLISTING
-
- Person - Variable in class org.apache.any23.vocab.FOAF
-
- personalEmail - Variable in class org.apache.any23.vocab.VCARD
-
An email address unaffiliated with any particular organization or employer;
a personal email address.
- PersonalProfileDocument - Variable in class org.apache.any23.vocab.FOAF
-
- phone - Variable in class org.apache.any23.vocab.FOAF
-
- photo - Variable in class org.apache.any23.vocab.HRECIPE
-
- photo - Variable in class org.apache.any23.vocab.VCARD
-
A photograph of a person.
- phylum - Variable in class org.apache.any23.vocab.WO
-
- phylumClass - Variable in class org.apache.any23.vocab.WO
-
- phylumName - Variable in class org.apache.any23.vocab.WO
-
- plan - Variable in class org.apache.any23.vocab.FOAF
-
- PLUGIN_DIRS_LIST_SEPARATOR - Static variable in class org.apache.any23.plugin.Any23PluginManager
-
List separator for the string declaring the plugin list.
- PLUGIN_DIRS_PROPERTY - Static variable in class org.apache.any23.plugin.Any23PluginManager
-
Property where look for plugins.
- PLUGINS_PACKAGE - Static variable in class org.apache.any23.plugin.Any23PluginManager
-
Any23 Plugins package.
- PluginVerifier - Class in org.apache.any23.cli
-
Commandline utility to verify the Any23 plugins
and extract basic information.
- PluginVerifier() - Constructor for class org.apache.any23.cli.PluginVerifier
-
- Point - Variable in class org.apache.any23.vocab.GEO
-
- PopularPrefixes - Class in org.apache.any23.rdf
-
This class act as a container for various well-known and adopted RDF Vocabulary prefixes.
- PopularPrefixes() - Constructor for class org.apache.any23.rdf.PopularPrefixes
-
- positiveVotes - Variable in class org.apache.any23.vocab.REVIEW
-
Number of positive usefulness votes (integer).
- post_office_box - Variable in class org.apache.any23.vocab.VCARD
-
The post office box of a postal address.
- postal_code - Variable in class org.apache.any23.vocab.VCARD
-
The postal code (e.g., U.S.
- postalCode - Variable in class org.apache.any23.vocab.HLISTING
-
- postOfficeBox - Variable in class org.apache.any23.vocab.HLISTING
-
- PREFIX_ATTRIBUTE - Static variable in class org.apache.any23.extractor.rdfa.RDFa11Parser
-
- Prefixes - Class in org.apache.any23.rdf
-
A mapping from prefixes to namespace URIs.
- Prefixes() - Constructor for class org.apache.any23.rdf.Prefixes
-
- Prefixes(Prefixes) - Constructor for class org.apache.any23.rdf.Prefixes
-
- price - Variable in class org.apache.any23.vocab.HLISTING
-
- PrimarySchool - Variable in class org.apache.any23.vocab.DOAC
-
- primaryTopic - Variable in class org.apache.any23.vocab.FOAF
-
- printError(String) - Method in class org.apache.any23.cli.ExtractorDocumentation
-
Print an error message.
- printExampleInput(String, ExtractorRegistry) - Method in class org.apache.any23.cli.ExtractorDocumentation
-
Prints an example of input for the provided extractor.
- printExampleOutput(String, ExtractorRegistry) - Method in class org.apache.any23.cli.ExtractorDocumentation
-
Prints an output example for the given extractor.
- printExtractorList(ExtractorRegistry) - Method in class org.apache.any23.cli.ExtractorDocumentation
-
Prints the list of all the available extractors.
- printOut(ExtractionResult, Map<String, String>) - Method in class org.apache.any23.extractor.xpath.QuadTemplate
-
Prints out this quad template in the given
ExtractionResult
, using
the passed
variableAssignment to expand variables.
- printReport(ExtractorRegistry) - Method in class org.apache.any23.cli.ExtractorDocumentation
-
Prints a complete report on all the available extractors.
- printReport(PrintStream) - Method in class org.apache.any23.extractor.ExtractionResultImpl
-
- printReport(PrintStream) - Method in interface org.apache.any23.extractor.IssueReport
-
Prints out the content of the report.
- printReport() - Method in class org.apache.any23.writer.ReportingTripleHandler
-
- printReports() - Method in class org.apache.any23.cli.Rover
-
- printStackTrace(PrintStream) - Method in exception org.apache.any23.extractor.ExtractionException
-
- printStackTrace(PrintWriter) - Method in exception org.apache.any23.extractor.ExtractionException
-
- priority - Variable in class org.apache.any23.vocab.ICAL
-
The URI defines the relative priority for a calendar component.value
type: INTEGER.
- process(Document, ExtractionResult) - Method in class org.apache.any23.extractor.xpath.TemplateXPathExtractionRuleImpl
-
- process(Document, ExtractionResult) - Method in interface org.apache.any23.extractor.xpath.XPathExtractionRule
-
Processes this extraction rule on the given document.
- processDocument(URL, Document, ExtractionResult) - Method in class org.apache.any23.extractor.rdfa.RDFa11Parser
-
- prodid - Variable in class org.apache.any23.vocab.ICAL
-
value type: TEXTThis URI specifies the identifier for the product that
created the iCalendar object..
- PROFILE_ATTRIBUTE - Static variable in class org.apache.any23.extractor.rdfa.RDFa11Parser
-
- Project - Variable in class org.apache.any23.vocab.FOAF
-
- properties - Variable in class org.apache.any23.configuration.DefaultConfiguration
-
- PROPERTY_ATTRIBUTE - Static variable in class org.apache.any23.extractor.rdfa.RDFa11Parser
-
- publication - Variable in class org.apache.any23.vocab.DOAC
-
- Publication - Variable in class org.apache.any23.vocab.DOAC
-
- publications - Variable in class org.apache.any23.vocab.FOAF
-
- published - Variable in class org.apache.any23.vocab.HRECIPE
-
- Purifier - Interface in org.apache.any23.mime.purifier
-
This interface defines a minimum set of methods that
a
TikaMIMETypeDetector
could
call in order to clean the input before performing the
MIME type
detection.
- purify(InputStream) - Method in interface org.apache.any23.mime.purifier.Purifier
-
Performs the purification of the provided resettable
InputStream
.
- purify(InputStream) - Method in class org.apache.any23.mime.purifier.WhiteSpacesPurifier
-
Performs the purification of the provided resettable
InputStream
.
- putData(String, Object) - Method in class org.apache.any23.validator.DefaultRuleContext
-
- putData(String, T) - Method in interface org.apache.any23.validator.RuleContext
-
Puts a data within the context.
- range - Variable in class org.apache.any23.vocab.ICAL
-
To specify the effective range of recurrence instances from the instance specified
by the recurrence identifier specified by the URI..
- rating - Variable in class org.apache.any23.vocab.REVIEW
-
A numeric value.
- rdate - Variable in class org.apache.any23.vocab.ICAL
-
default value type: DATE-TIMEThis URI defines the list of date/times
for a recurrence set..
- RDFa11Extractor - Class in org.apache.any23.extractor.rdfa
-
- RDFa11Extractor(boolean, boolean) - Constructor for class org.apache.any23.extractor.rdfa.RDFa11Extractor
-
Constructor, allows to specify the validation and error handling policies.
- RDFa11Extractor() - Constructor for class org.apache.any23.extractor.rdfa.RDFa11Extractor
-
Default constructor, with no verification of data types and not stop at first error.
- RDFa11Parser - Class in org.apache.any23.extractor.rdfa
-
This parser is able to extract
RDFa 1.0 and
RDFa 1.1 statements from any
(X)HTML document.
- RDFa11Parser() - Constructor for class org.apache.any23.extractor.rdfa.RDFa11Parser
-
- RDFa11ParserException - Exception in org.apache.any23.extractor.rdfa
-
- RDFa11ParserException(String) - Constructor for exception org.apache.any23.extractor.rdfa.RDFa11ParserException
-
- RDFa11ParserException(String, Throwable) - Constructor for exception org.apache.any23.extractor.rdfa.RDFa11ParserException
-
- RDFaExtractor - Class in org.apache.any23.extractor.rdfa
-
Extractor for RDFa in HTML, based on Fabien Gadon's XSLT transform, found
here.
- RDFaExtractor(boolean, boolean) - Constructor for class org.apache.any23.extractor.rdfa.RDFaExtractor
-
Constructor, allows to specify the validation and error handling policies.
- RDFaExtractor() - Constructor for class org.apache.any23.extractor.rdfa.RDFaExtractor
-
Default constructor, with no verification of data types and not stop at first error.
- RDFHandlerAdapter - Class in org.apache.any23.extractor.rdf
-
An RDFHandler that relays statements and prefix definitions to
an
ExtractionResult
.
- RDFHandlerAdapter(ExtractionResult) - Constructor for class org.apache.any23.extractor.rdf.RDFHandlerAdapter
-
- RDFParserFactory - Class in org.apache.any23.extractor.rdf
-
This factory provides a common logic for creating and configuring correctly
any RDF parser used within the library.
- RDFParserFactory() - Constructor for class org.apache.any23.extractor.rdf.RDFParserFactory
-
- RDFSchemaUtils - Class in org.apache.any23.vocab
-
This class provides a set of methods for generating
RDF Schema.
- RDFSchemaUtils.VocabularyFormat - Enum in org.apache.any23.vocab
-
Supported formats for vocabulary serialization.
- RDFUtils - Class in org.apache.any23.rdf
-
Basic class providing a set of utility methods when dealing with RDF.
- RDFWriterTripleHandler - Class in org.apache.any23.writer
-
A
TripleHandler
that writes
triples to a Sesame
RDFWriter
,
eg for serialization using one of Sesame's writers.
- RDFXMLExtractor - Class in org.apache.any23.extractor.rdf
-
- RDFXMLExtractor(boolean, boolean) - Constructor for class org.apache.any23.extractor.rdf.RDFXMLExtractor
-
Constructor, allows to specify the validation and error handling policies.
- RDFXMLExtractor() - Constructor for class org.apache.any23.extractor.rdf.RDFXMLExtractor
-
Default constructor, with no verification of data types and not stop at first error.
- RDFXMLWriter - Class in org.apache.any23.writer
-
RDF/XML writer implementation.
- RDFXMLWriter(OutputStream) - Constructor for class org.apache.any23.writer.RDFXMLWriter
-
- RDFXMLWriterFactory - Class in org.apache.any23.writer
-
- RDFXMLWriterFactory() - Constructor for class org.apache.any23.writer.RDFXMLWriterFactory
-
- read() - Method in class org.apache.any23.extractor.html.SpanCloserInputStream
-
- read() - Method in class org.apache.any23.util.ReaderInputStream
-
Reads from the Reader
, returning the same value.
- read(byte[], int, int) - Method in class org.apache.any23.util.ReaderInputStream
-
Reads from the Reader
into a byte array
- readAttribute(Node, String, String) - Static method in class org.apache.any23.extractor.html.DomUtils
-
Reads the value of the specified attribute
, returning the
defaultValue
string if not present.
- readAttribute(Node, String) - Static method in class org.apache.any23.extractor.html.DomUtils
-
Reads the value of an attribute
, returning the
empty string if not present.
- readAttribute(String) - Method in class org.apache.any23.extractor.html.HTMLDocument
-
Read an attribute avoiding NullPointerExceptions, if the attr is
missing it just returns an empty string.
- readAttributeWithPrefix(Node, String, String) - Static method in class org.apache.any23.extractor.html.DomUtils
-
Reads the value of the first attribute which name matches with the specified attributePrefix
.
- ReaderInputStream - Class in org.apache.any23.util
-
Adapts a Reader
as an InputStream
.
- ReaderInputStream(Reader) - Constructor for class org.apache.any23.util.ReaderInputStream
-
Construct a ReaderInputStream
for the specified Reader
.
- ReaderInputStream(Reader, String) - Constructor for class org.apache.any23.util.ReaderInputStream
-
Construct a ReaderInputStream
for the specified Reader
,
with the specified encoding.
- readFileContent(File) - Static method in class org.apache.any23.util.FileUtils
-
Returns the content of a file a single string.
- readFileLines(File) - Static method in class org.apache.any23.util.FileUtils
-
Returns all the lines of a file.
- readNodeContent(Node, boolean) - Static method in class org.apache.any23.extractor.html.HTMLDocument
-
Reads the text content of the given node and returns it.
- readResourceContent(Class, String) - Static method in class org.apache.any23.util.FileUtils
-
Reads a resource file and returns the content as a string.
- readResourceContent(String) - Static method in class org.apache.any23.util.FileUtils
-
Reads a resource file and returns the content as a string.
- reads - Variable in class org.apache.any23.vocab.DOAC
-
- readStream() - Method in class org.apache.any23.source.FileDocumentSource
-
- readTextField(Node) - Static method in class org.apache.any23.extractor.html.HTMLDocument
-
Reads a text field from the given node adding the content to the given res list.
- readUrlField(List<HTMLDocument.TextField>, Node) - Static method in class org.apache.any23.extractor.html.HTMLDocument
-
Reads an URL field from the given node adding the content to the given res list.
- receiveNamespace(String, String, ExtractionContext) - Method in class org.apache.any23.filter.ExtractionContextBlocker
-
- receiveNamespace(String, String, ExtractionContext) - Method in class org.apache.any23.filter.IgnoreAccidentalRDFa
-
- receiveNamespace(String, String, ExtractionContext) - Method in class org.apache.any23.filter.IgnoreTitlesOfEmptyDocuments
-
- receiveNamespace(String, String, ExtractionContext) - Method in class org.apache.any23.writer.BenchmarkTripleHandler
-
- receiveNamespace(String, String, ExtractionContext) - Method in class org.apache.any23.writer.CompositeTripleHandler
-
- receiveNamespace(String, String, ExtractionContext) - Method in class org.apache.any23.writer.CountingTripleHandler
-
- receiveNamespace(String, String, ExtractionContext) - Method in class org.apache.any23.writer.JSONWriter
-
- receiveNamespace(String, String, ExtractionContext) - Method in class org.apache.any23.writer.LoggingTripleHandler
-
- receiveNamespace(String, String, ExtractionContext) - Method in class org.apache.any23.writer.RDFWriterTripleHandler
-
- receiveNamespace(String, String, ExtractionContext) - Method in class org.apache.any23.writer.ReportingTripleHandler
-
- receiveNamespace(String, String, ExtractionContext) - Method in class org.apache.any23.writer.RepositoryWriter
-
- receiveNamespace(String, String, ExtractionContext) - Method in interface org.apache.any23.writer.TripleHandler
-
Invoked with a currently open context, notifies the detection of a
namespace.
- receiveNamespace(String, String, ExtractionContext) - Method in class org.apache.any23.writer.URIListWriter
-
- receiveTriple(Resource, URI, Value, URI, ExtractionContext) - Method in class org.apache.any23.filter.ExtractionContextBlocker
-
- receiveTriple(Resource, URI, Value, URI, ExtractionContext) - Method in class org.apache.any23.filter.IgnoreAccidentalRDFa
-
- receiveTriple(Resource, URI, Value, URI, ExtractionContext) - Method in class org.apache.any23.filter.IgnoreTitlesOfEmptyDocuments
-
- receiveTriple(Resource, URI, Value, URI, ExtractionContext) - Method in class org.apache.any23.writer.BenchmarkTripleHandler
-
- receiveTriple(Resource, URI, Value, URI, ExtractionContext) - Method in class org.apache.any23.writer.CompositeTripleHandler
-
- receiveTriple(Resource, URI, Value, URI, ExtractionContext) - Method in class org.apache.any23.writer.CountingTripleHandler
-
- receiveTriple(Resource, URI, Value, URI, ExtractionContext) - Method in class org.apache.any23.writer.JSONWriter
-
- receiveTriple(Resource, URI, Value, URI, ExtractionContext) - Method in class org.apache.any23.writer.LoggingTripleHandler
-
- receiveTriple(Resource, URI, Value, URI, ExtractionContext) - Method in class org.apache.any23.writer.RDFWriterTripleHandler
-
- receiveTriple(Resource, URI, Value, URI, ExtractionContext) - Method in class org.apache.any23.writer.ReportingTripleHandler
-
- receiveTriple(Resource, URI, Value, URI, ExtractionContext) - Method in class org.apache.any23.writer.RepositoryWriter
-
- receiveTriple(Resource, URI, Value, URI, ExtractionContext) - Method in interface org.apache.any23.writer.TripleHandler
-
Invoked with a currently open context,
notifies the detection of a triple.
- receiveTriple(Resource, URI, Value, URI, ExtractionContext) - Method in class org.apache.any23.writer.URIListWriter
-
- Recipe - Variable in class org.apache.any23.vocab.HRECIPE
-
- recurrenceId - Variable in class org.apache.any23.vocab.ICAL
-
default value type: DATE-TIMEThis URI is used in conjunction with the
"UID" and "SEQUENCE" URI to identify a specific instance of a recurring
"VEVENT", "VTODO" or "VJOURNAL" calendar component.
- RedirectServlet - Class in org.apache.any23.servlet
-
This servlet contains the logic to perform the correct redirects
when Any23 is used as a all-in-one web application.
- RedirectServlet() - Constructor for class org.apache.any23.servlet.RedirectServlet
-
- reference - Variable in class org.apache.any23.vocab.DOAC
-
- Reference - Variable in class org.apache.any23.vocab.DOAC
-
- reference_type - Variable in class org.apache.any23.vocab.DOAC
-
- referer - Variable in class org.apache.any23.vocab.DOAC
-
- region - Variable in class org.apache.any23.vocab.HLISTING
-
- region - Variable in class org.apache.any23.vocab.VCARD
-
The region (e.g., state or province) of a postal address.
- register(ExtractorFactory<?>) - Method in interface org.apache.any23.extractor.ExtractorRegistry
-
- register(ExtractorFactory<?>) - Method in class org.apache.any23.extractor.ExtractorRegistryImpl
-
- register(WriterFactory) - Method in class org.apache.any23.writer.WriterFactoryRegistry
-
- REL_ATTRIBUTE - Static variable in class org.apache.any23.extractor.rdfa.RDFa11Parser
-
- related - Variable in class org.apache.any23.vocab.DCTERMS
-
- related - Variable in class org.apache.any23.vocab.ICAL
-
To specify the relationship of the alarm trigger with respect to the start
or end of the calendar component..
- relatedTo - Variable in class org.apache.any23.vocab.ICAL
-
The URI is used to represent a relationship or reference between one
calendar component and another.value type: TEXT.
- reltype - Variable in class org.apache.any23.vocab.ICAL
-
To specify the type of hierarchical relationship associated with the calendar
component specified by the URI..
- remove(Variable) - Method in interface org.apache.any23.extractor.xpath.TemplateXPathExtractionRule
-
Removes a variable from the template.
- remove(QuadTemplate) - Method in interface org.apache.any23.extractor.xpath.TemplateXPathExtractionRule
-
Removes a quad template from the rule.
- remove(Variable) - Method in class org.apache.any23.extractor.xpath.TemplateXPathExtractionRuleImpl
-
- remove(QuadTemplate) - Method in class org.apache.any23.extractor.xpath.TemplateXPathExtractionRuleImpl
-
- remove(XPathExtractionRule) - Method in class org.apache.any23.extractor.xpath.XPathExtractor
-
- removeData(String) - Method in class org.apache.any23.validator.DefaultRuleContext
-
- removeData(String) - Method in interface org.apache.any23.validator.RuleContext
-
Removes a data from the context.
- removeListener(CrawlerListener) - Method in class org.apache.any23.plugin.crawler.SiteCrawler
-
- removePrefix(String) - Method in class org.apache.any23.rdf.Prefixes
-
- removeRule(Class<? extends Rule>) - Method in class org.apache.any23.validator.DefaultValidator
-
- removeRule(Class<? extends Rule>) - Method in interface org.apache.any23.validator.Validator
-
Allows to remove a rule from the validator and all the related
Fix
es.
- rent - Variable in class org.apache.any23.vocab.HLISTING
-
- repeat - Variable in class org.apache.any23.vocab.ICAL
-
This URI defines the number of time the alarm should be repeated, after
the initial trigger.value type: INTEGER.
- report() - Method in class org.apache.any23.writer.BenchmarkTripleHandler
-
Returns the report as a human readable string.
- reportFixError(Fix, Exception, String) - Method in class org.apache.any23.validator.DefaultValidationReportBuilder
-
- reportFixError(Fix, Exception, String) - Method in interface org.apache.any23.validator.ValidationReportBuilder
-
Reports an error occurred while executing a
Fix
.
- ReportingTripleHandler - Class in org.apache.any23.writer
-
A
TripleHandler
that collects
various information about the extraction process, such as
the extractors used and the total number of triples.
- ReportingTripleHandler(TripleHandler) - Constructor for class org.apache.any23.writer.ReportingTripleHandler
-
- reportIssue(ValidationReport.IssueLevel, String, Node) - Method in class org.apache.any23.validator.DefaultValidationReportBuilder
-
- reportIssue(ValidationReport.IssueLevel, String) - Method in class org.apache.any23.validator.DefaultValidationReportBuilder
-
- reportIssue(ValidationReport.IssueLevel, String, Node) - Method in interface org.apache.any23.validator.ValidationReportBuilder
-
Reports an issue detected on a specified node.
- reportIssue(ValidationReport.IssueLevel, String) - Method in interface org.apache.any23.validator.ValidationReportBuilder
-
Reports a detected issue.
- reportRuleError(Rule, Exception, String) - Method in class org.apache.any23.validator.DefaultValidationReportBuilder
-
- reportRuleError(Rule, Exception, String) - Method in interface org.apache.any23.validator.ValidationReportBuilder
-
Reports an error occurred while executing a
Rule
.
- RepositoryWriter - Class in org.apache.any23.writer
-
A Sesame repository triple writer.
- RepositoryWriter(RepositoryConnection) - Constructor for class org.apache.any23.writer.RepositoryWriter
-
- RepositoryWriter(RepositoryConnection, Resource) - Constructor for class org.apache.any23.writer.RepositoryWriter
-
- requestStatus - Variable in class org.apache.any23.vocab.ICAL
-
value type: TEXTThis URI defines the status code returned for a scheduling
request..
- reset() - Method in class org.apache.any23.extractor.html.HCardName
-
Resets the content of the HName fields.
- reset() - Method in class org.apache.any23.extractor.rdfa.RDFa11Parser
-
Resets the parser to the original state.
- reset() - Method in class org.apache.any23.util.ReaderInputStream
-
Resets the StringReader.
- reset() - Method in class org.apache.any23.writer.CountingTripleHandler
-
- resetExtractor() - Method in class org.apache.any23.extractor.html.AdrExtractor
-
- resetExtractor() - Method in class org.apache.any23.extractor.html.EntityBasedMicroformatExtractor
-
Resets the internal status of the extractor to prepare it to a new extraction section.
- resetExtractor() - Method in class org.apache.any23.extractor.html.GeoExtractor
-
- resetExtractor() - Method in class org.apache.any23.extractor.html.HCardExtractor
-
- resetExtractor() - Method in class org.apache.any23.extractor.html.HListingExtractor
-
- resetExtractor() - Method in class org.apache.any23.extractor.html.HRecipeExtractor
-
- resetExtractor() - Method in class org.apache.any23.extractor.html.HResumeExtractor
-
- resetExtractor() - Method in class org.apache.any23.extractor.html.HReviewExtractor
-
- resetExtractor() - Method in class org.apache.any23.extractor.html.SpeciesExtractor
-
Resets the internal status of the extractor to prepare it to a new extraction section.
- resolveCURIEOrURI(String, boolean) - Method in class org.apache.any23.extractor.rdfa.RDFa11Parser
-
Resolves a CURIE or URI string.
- resolveCurieOrURIList(Node, String, boolean) - Method in class org.apache.any23.extractor.rdfa.RDFa11Parser
-
Resolves a whitelist separated list of CURIE or URI.
- resolveURI(String) - Method in class org.apache.any23.extractor.html.HTMLDocument
-
- resolveURI(String) - Method in class org.apache.any23.extractor.rdfa.RDFa11Parser
-
Resolves a URI string as URI.
- resolveURI(String, URI) - Method in class org.apache.any23.rdf.Any23ValueFactoryWrapper
-
Fixes typical errors in URIs, and resolves relative URIs against a base URI.
- RESOURCE_ATTRIBUTE - Static variable in class org.apache.any23.extractor.rdfa.RDFa11Parser
-
- RESOURCE_NAME - Static variable in class org.apache.any23.mime.TikaMIMETypeDetector
-
- resources - Variable in class org.apache.any23.vocab.ICAL
-
value type: TEXTThis URI defines the equipment or resources anticipated
for an activity specified by a calendar entity...
- rev - Variable in class org.apache.any23.vocab.VCARD
-
The timestamp of a revision of a vCard.
- REV_ATTRIBUTE - Static variable in class org.apache.any23.extractor.rdfa.RDFa11Parser
-
- REVIEW - Class in org.apache.any23.vocab
-
Vocabulary definitions from vocabularies/review.rdf
- Review - Variable in class org.apache.any23.vocab.REVIEW
-
A review of an work.
- reviewer - Variable in class org.apache.any23.vocab.REVIEW
-
The person that has written the review.
- role - Variable in class org.apache.any23.vocab.ICAL
-
To specify the participation role for the calendar user specified by the URI..
- role - Variable in class org.apache.any23.vocab.VCARD
-
A role a person plays within an organization.
- ROOT_EXTRACTION_RESULT_ID - Static variable in class org.apache.any23.extractor.ExtractionContext
-
- Rover - Class in org.apache.any23.cli
-
A default rover implementation.
- Rover() - Constructor for class org.apache.any23.cli.Rover
-
- Rover.ArgumentToURIConverter - Class in org.apache.any23.cli
-
- Rover.ArgumentToURIConverter() - Constructor for class org.apache.any23.cli.Rover.ArgumentToURIConverter
-
- Rover.PrintStreamConverter - Class in org.apache.any23.cli
-
- Rover.PrintStreamConverter() - Constructor for class org.apache.any23.cli.Rover.PrintStreamConverter
-
- ROW - Static variable in class org.apache.any23.vocab.CSV
-
- row - Variable in class org.apache.any23.vocab.CSV
-
This property links the identifier of a CSV to an entity representing
a row.
- ROW - Static variable in class org.apache.any23.vocab.Excel
-
- row - Variable in class org.apache.any23.vocab.Excel
-
This resource identifies a row.
- ROW_POSITION - Static variable in class org.apache.any23.vocab.CSV
-
- ROW_TYPE - Static variable in class org.apache.any23.vocab.CSV
-
- rowPosition - Variable in class org.apache.any23.vocab.CSV
-
This property expresses the index of a row in a CSV file.
- rowType - Variable in class org.apache.any23.vocab.CSV
-
This resource identifies a Row.
- rrule - Variable in class org.apache.any23.vocab.ICAL
-
This URI defines a rule or repeating pattern for recurring events, to-dos,
or time zone definitions.value type: RECUR.
- rsvp - Variable in class org.apache.any23.vocab.ICAL
-
To specify whether there is an expectation of a favor of a reply from the
calendar user specified by the URI value..
- Rule - Interface in org.apache.any23.validator
-
Models a rule for an issue detection.
- RuleContext<T> - Interface in org.apache.any23.validator
-
This context is used to pass data from a
Rule
to a
Fix
.
- run() - Method in class org.apache.any23.cli.Crawler
-
- run() - Method in class org.apache.any23.cli.ExtractorDocumentation
-
- run() - Method in class org.apache.any23.cli.MicrodataParser
-
- run() - Method in class org.apache.any23.cli.MimeDetector
-
- run() - Method in class org.apache.any23.cli.PluginVerifier
-
- run() - Method in class org.apache.any23.cli.Rover
-
- run() - Method in interface org.apache.any23.cli.Tool
-
Runs the tool and retrieves the exit code.
- run() - Method in class org.apache.any23.cli.VocabPrinter
-
- run(ExtractionParameters, ExtractionContext, InputStream, ExtractionResult) - Method in class org.apache.any23.extractor.csv.CSVExtractor
-
Executes the extractor.
- run(ExtractionParameters, ExtractionContext, Input, ExtractionResult) - Method in interface org.apache.any23.extractor.Extractor
-
Executes the extractor.
- run(ExtractionParameters, ExtractionContext, Document, ExtractionResult) - Method in class org.apache.any23.extractor.html.HeadLinkExtractor
-
- run(ExtractionParameters, ExtractionContext, Document, ExtractionResult) - Method in class org.apache.any23.extractor.html.HTMLMetaExtractor
-
Executes the extractor.
- run(ExtractionParameters, ExtractionContext, Document, ExtractionResult) - Method in class org.apache.any23.extractor.html.ICBMExtractor
-
- run(ExtractionParameters, ExtractionContext, Document, ExtractionResult) - Method in class org.apache.any23.extractor.html.LicenseExtractor
-
- run(ExtractionParameters, ExtractionContext, Document, ExtractionResult) - Method in class org.apache.any23.extractor.html.MicroformatExtractor
-
- run(ExtractionParameters, ExtractionContext, Document, ExtractionResult) - Method in class org.apache.any23.extractor.html.TitleExtractor
-
- run(ExtractionParameters, ExtractionContext, Document, ExtractionResult) - Method in class org.apache.any23.extractor.html.TurtleHTMLExtractor
-
- run(ExtractionParameters, ExtractionContext, Document, ExtractionResult) - Method in class org.apache.any23.extractor.html.XFNExtractor
-
- run(ExtractionParameters, ExtractionContext, Document, ExtractionResult) - Method in class org.apache.any23.extractor.microdata.MicrodataExtractor
-
- run(ExtractionParameters, ExtractionContext, InputStream, ExtractionResult) - Method in class org.apache.any23.extractor.rdf.BaseRDFExtractor
-
- run(ExtractionParameters, ExtractionContext, Document, ExtractionResult) - Method in class org.apache.any23.extractor.rdfa.RDFa11Extractor
-
- run(ExtractionParameters, ExtractionContext, Document, ExtractionResult) - Method in class org.apache.any23.extractor.rdfa.RDFaExtractor
-
- run(ExtractionParameters) - Method in class org.apache.any23.extractor.SingleDocumentExtraction
-
Triggers the execution of all the
Extractor
registered to this class using the specified extraction parameters.
- run() - Method in class org.apache.any23.extractor.SingleDocumentExtraction
-
Triggers the execution of all the
Extractor
registered to this class using the
default extraction parameters.
- run(ExtractionParameters, ExtractionContext, Document, ExtractionResult) - Method in class org.apache.any23.extractor.xpath.XPathExtractor
-
- run(ExtractionParameters, ExtractionContext, InputStream, ExtractionResult) - Method in class org.apache.any23.plugin.htmlscraper.HTMLScraperExtractor
-
- run(ExtractionParameters, ExtractionContext, InputStream, ExtractionResult) - Method in class org.apache.any23.plugin.officescraper.ExcelExtractor
-
- SCHEMAORG - Class in org.apache.any23.vocab
-
- schoolHomepage - Variable in class org.apache.any23.vocab.FOAF
-
- scientificName - Variable in class org.apache.any23.vocab.WO
-
specifies the scientific name of a species, allowing
this portion of the name to be explicitly described.
- SecondarySchool - Variable in class org.apache.any23.vocab.DOAC
-
- sell - Variable in class org.apache.any23.vocab.HLISTING
-
- sentBy - Variable in class org.apache.any23.vocab.ICAL
-
To specify the calendar user that is acting on behalf of the calendar user
specified by the URI..
- sequence - Variable in class org.apache.any23.vocab.ICAL
-
value type: integerThis URI defines the revision sequence number of the
calendar component within a sequence of revisions..
- SerializationException - Exception in org.apache.any23.validator
-
- SerializationException(String) - Constructor for exception org.apache.any23.validator.SerializationException
-
- SerializationException(String, Throwable) - Constructor for exception org.apache.any23.validator.SerializationException
-
- serialize(ValidationReport, OutputStream) - Method in interface org.apache.any23.validator.ValidationReportSerializer
-
Serializes the validation report on the given output stream.
- serialize(ValidationReport, OutputStream) - Method in class org.apache.any23.validator.XMLValidationReportSerializer
-
- serializeGetterValue(Object, Method, PrintStream) - Method in class org.apache.any23.validator.XMLValidationReportSerializer
-
- serializeToXML(Node, boolean) - Static method in class org.apache.any23.extractor.html.DomUtils
-
Given a
DOM Node
produces the
XML serialization
omitting the
XML declaration.
- serializeVocabularies(RDFSchemaUtils.VocabularyFormat, PrintStream) - Static method in class org.apache.any23.vocab.RDFSchemaUtils
-
Serializes all the vocabularies to NQuads over the given output stream.
- serializeVocabulary(URI, URI[], URI[], Map<URI, String>, RDFWriter) - Static method in class org.apache.any23.vocab.RDFSchemaUtils
-
Serializes a vocabulary composed of the given namespace
,
resources
and properties
.
- serializeVocabulary(Vocabulary, RDFWriter) - Static method in class org.apache.any23.vocab.RDFSchemaUtils
-
Serializes the given vocabulary
to triples over the given writer
.
- serializeVocabulary(Vocabulary, RDFSchemaUtils.VocabularyFormat, boolean, PrintStream) - Static method in class org.apache.any23.vocab.RDFSchemaUtils
-
Serializes the given vocabulary
to NQuads over the given output stream.
- serializeVocabulary(Vocabulary, RDFSchemaUtils.VocabularyFormat) - Static method in class org.apache.any23.vocab.RDFSchemaUtils
-
Serialized the given vocabulary
to NQuads and return them as string.
- service - Variable in class org.apache.any23.vocab.HLISTING
-
- Servlet - Class in org.apache.any23.servlet
-
A Servlet that fetches a client-specified URI,
RDFizes the content, and returns it in a format chosen by the client.
- Servlet() - Constructor for class org.apache.any23.servlet.Servlet
-
- setAnnotated(boolean) - Method in interface org.apache.any23.writer.FormatWriter
-
Sets the annotation flag.
- setAnnotated(boolean) - Method in class org.apache.any23.writer.JSONWriter
-
- setAnnotated(boolean) - Method in class org.apache.any23.writer.RDFWriterTripleHandler
-
Sets the annotation flag.
- setAnnotated(boolean) - Method in class org.apache.any23.writer.URIListWriter
-
- setCacheFactory(LocalCopyFactory) - Method in class org.apache.any23.Any23
-
- setContentLength(long) - Method in class org.apache.any23.filter.ExtractionContextBlocker
-
- setContentLength(long) - Method in class org.apache.any23.filter.IgnoreAccidentalRDFa
-
- setContentLength(long) - Method in class org.apache.any23.filter.IgnoreTitlesOfEmptyDocuments
-
- setContentLength(long) - Method in class org.apache.any23.writer.BenchmarkTripleHandler
-
- setContentLength(long) - Method in class org.apache.any23.writer.CompositeTripleHandler
-
- setContentLength(long) - Method in class org.apache.any23.writer.CountingTripleHandler
-
- setContentLength(long) - Method in class org.apache.any23.writer.JSONWriter
-
- setContentLength(long) - Method in class org.apache.any23.writer.LoggingTripleHandler
-
- setContentLength(long) - Method in class org.apache.any23.writer.RDFWriterTripleHandler
-
- setContentLength(long) - Method in class org.apache.any23.writer.ReportingTripleHandler
-
- setContentLength(long) - Method in class org.apache.any23.writer.RepositoryWriter
-
- setContentLength(long) - Method in interface org.apache.any23.writer.TripleHandler
-
Sets the length of the content to be processed.
- setContentLength(long) - Method in class org.apache.any23.writer.URIListWriter
-
- setCrawlData(String, Pattern, List<CrawlerListener>) - Static method in class org.apache.any23.plugin.crawler.SharedData
-
Initializes the crawler data.
- setDefaultAccept(String) - Method in class org.apache.any23.servlet.conneg.ContentTypeNegotiator
-
Sets an Accept header to be used as the default if a client does
not send an Accept header, or if the Accept header cannot be parsed.
- setDefaultLogging() - Static method in class org.apache.any23.util.LogUtils
-
- setErrorMode(MicrodataParser.ErrorMode) - Method in class org.apache.any23.extractor.microdata.MicrodataParser
-
- setErrorNode(Node) - Method in exception org.apache.any23.extractor.microdata.MicrodataParserException
-
- setField(String, HTMLDocument.TextField) - Method in class org.apache.any23.extractor.html.HCardName
-
- setFlag(String, boolean) - Method in class org.apache.any23.extractor.ExtractionParameters
-
Sets the value for an extraction flag.
- setFullName(HTMLDocument.TextField) - Method in class org.apache.any23.extractor.html.HCardName
-
- setHTTPClient(HTTPClient) - Method in class org.apache.any23.Any23
-
Allows to set the
HTTPClient
implementation
used to retrieve contents.
- setHTTPUserAgent(String) - Method in class org.apache.any23.Any23
-
Sets the HTTP Header User Agent,
see RFC 2616-14.43.
- setIssueReport(IssueReport) - Method in class org.apache.any23.rdf.Any23ValueFactoryWrapper
-
- setLocalCopyFactory(LocalCopyFactory) - Method in class org.apache.any23.extractor.SingleDocumentExtraction
-
Sets the internal factory for generating the document local copy,
if
null
the
MemCopyFactory
will be used.
- setMaxDepth(int) - Method in class org.apache.any23.plugin.crawler.SiteCrawler
-
Sets the maximum depth.
- setMaxPages(int) - Method in class org.apache.any23.plugin.crawler.SiteCrawler
-
Sets the maximum collected pages.
- setMIMETypeDetector(MIMETypeDetector) - Method in class org.apache.any23.Any23
-
- setMIMETypeDetector(MIMETypeDetector) - Method in class org.apache.any23.extractor.SingleDocumentExtraction
-
Sets the internal mime type detector,
if null
mimetype detection will
be skipped and all extractors will be activated.
- setNumOfCrawlers(int) - Method in class org.apache.any23.plugin.crawler.SiteCrawler
-
Sets the number of crawler instances.
- setOrganization(HTMLDocument.TextField) - Method in class org.apache.any23.extractor.html.HCardName
-
- setOrganizationUnit(HTMLDocument.TextField) - Method in class org.apache.any23.extractor.html.HCardName
-
- setParserEncoding(String) - Method in class org.apache.any23.extractor.SingleDocumentExtraction
-
Sets the document parser encoding.
- setPolitenessDelay(int) - Method in class org.apache.any23.plugin.crawler.SiteCrawler
-
Sets the politeness delay.
- setProperty(String, String) - Method in class org.apache.any23.configuration.DefaultModifiableConfiguration
-
- setProperty(String, String) - Method in interface org.apache.any23.configuration.ModifiableConfiguration
-
Sets a new value propertyValue
for
property which name propertyName
.
- setProperty(String, String) - Method in class org.apache.any23.extractor.ExtractionParameters
-
Sets the value for an extraction property.
- setStopAtFirstError(boolean) - Method in class org.apache.any23.extractor.csv.CSVExtractor
-
If true
, the extractor will stop at first parsing error,
iffalse
the extractor will attempt to ignore all parsing errors.
- setStopAtFirstError(boolean) - Method in interface org.apache.any23.extractor.Extractor.ContentExtractor
-
If true
, the extractor will stop at first parsing error,
iffalse
the extractor will attempt to ignore all parsing errors.
- setStopAtFirstError(boolean) - Method in class org.apache.any23.extractor.rdf.BaseRDFExtractor
-
- setStopAtFirstError(boolean) - Method in class org.apache.any23.extractor.rdfa.RDFa11Extractor
-
- setStopAtFirstError(boolean) - Method in class org.apache.any23.extractor.rdfa.RDFaExtractor
-
- setStopAtFirstError(boolean) - Method in class org.apache.any23.plugin.htmlscraper.HTMLScraperExtractor
-
- setStopAtFirstError(boolean) - Method in class org.apache.any23.plugin.officescraper.ExcelExtractor
-
- setVerboseLogging() - Static method in class org.apache.any23.util.LogUtils
-
- setVerifyDataType(boolean) - Method in class org.apache.any23.extractor.rdf.BaseRDFExtractor
-
- setVerifyDataType(boolean) - Method in class org.apache.any23.extractor.rdfa.RDFa11Extractor
-
- setVerifyDataType(boolean) - Method in class org.apache.any23.extractor.rdfa.RDFaExtractor
-
- setWebCrawler(Class<? extends WebCrawler>) - Method in class org.apache.any23.plugin.crawler.SiteCrawler
-
Sets the actual crawler class.
- sha1 - Variable in class org.apache.any23.vocab.FOAF
-
- SharedData - Class in org.apache.any23.plugin.crawler
-
- SHEET - Static variable in class org.apache.any23.vocab.Excel
-
- sheet - Variable in class org.apache.any23.vocab.Excel
-
This resource identifies a Sheet.
- SHEET_NAME - Static variable in class org.apache.any23.vocab.Excel
-
- sheetName - Variable in class org.apache.any23.vocab.Excel
-
This property links the identifier of a Sheet to the name of the sheet.
- shouldVisit(WebURL) - Method in class org.apache.any23.plugin.crawler.DefaultWebCrawler
-
Override this method to specify whether the given URL should be visited or not.
- SimpleExtractorFactory<T extends Extractor<?>> - Class in org.apache.any23.extractor
-
- SINDICE - Class in org.apache.any23.vocab
-
This class models an internal Sindice Vocabulary to describe
resource domains and Microformat nesting relationships.
- SingleDocumentExtraction - Class in org.apache.any23.extractor
-
This class acts as facade where all the extractors were called on a single document.
- SingleDocumentExtraction(Configuration, DocumentSource, ExtractorGroup, TripleHandler) - Constructor for class org.apache.any23.extractor.SingleDocumentExtraction
-
Builds an extractor by the specification of document source,
list of extractors and output triple handler.
- SingleDocumentExtraction(Configuration, DocumentSource, ExtractorFactory<?>, TripleHandler) - Constructor for class org.apache.any23.extractor.SingleDocumentExtraction
-
Builds an extractor by the specification of document source,
extractors factory and output triple handler.
- SingleDocumentExtraction(DocumentSource, ExtractorFactory<?>, TripleHandler) - Constructor for class org.apache.any23.extractor.SingleDocumentExtraction
-
Builds an extractor by the specification of document source,
extractors factory and output triple handler, using the
DefaultConfiguration
.
- SingleDocumentExtractionReport - Class in org.apache.any23.extractor
-
- SingleDocumentExtractionReport(ValidationReport, Map<String, Collection<IssueReport.Issue>>) - Constructor for class org.apache.any23.extractor.SingleDocumentExtractionReport
-
- singleton - Static variable in class org.apache.any23.configuration.DefaultConfiguration
-
- singleton() - Static method in class org.apache.any23.configuration.DefaultConfiguration
-
- singleton() - Static method in class org.apache.any23.http.DefaultHTTPClientConfiguration
-
- SITE_NAME - Static variable in class org.apache.any23.vocab.OGP
-
If your object is part of a larger web site, the name which should be
displayed for the overall site.
- SiteCrawler - Class in org.apache.any23.plugin.crawler
-
A basic site crawler to extract semantic content
of small/medium size sites.
- SiteCrawler(File) - Constructor for class org.apache.any23.plugin.crawler.SiteCrawler
-
Constructor.
- siteName - Variable in class org.apache.any23.vocab.OGP
-
- SIZE - Static variable in class org.apache.any23.vocab.SINDICE
-
- size - Variable in class org.apache.any23.vocab.SINDICE
-
Size meta property indicating the number of triples within the returned dataset.
- skill - Variable in class org.apache.any23.vocab.DOAC
-
- Skill - Variable in class org.apache.any23.vocab.DOAC
-
- SocialSkill - Variable in class org.apache.any23.vocab.DOAC
-
- sort_string - Variable in class org.apache.any23.vocab.VCARD
-
A version of a person's name suitable for collation.
- sound - Variable in class org.apache.any23.vocab.VCARD
-
A sound (e.g., a greeting or pronounciation) of a person.
- source() - Method in class org.apache.any23.extractor.html.HTMLDocument.TextField
-
- source - Variable in class org.apache.any23.vocab.DCTERMS
-
- SpanCloserInputStream - Class in org.apache.any23.extractor.html
-
Extension of
InputStream
meant to
detect and replace any occurrence of inline
span:
- SpanCloserInputStream(InputStream) - Constructor for class org.apache.any23.extractor.html.SpanCloserInputStream
-
- speaks - Variable in class org.apache.any23.vocab.DOAC
-
- species - Variable in class org.apache.any23.vocab.WO
-
Generic class defining a biological species
- SpeciesExtractor - Class in org.apache.any23.extractor.html
-
- SpeciesExtractor() - Constructor for class org.apache.any23.extractor.html.SpeciesExtractor
-
- speciesName - Variable in class org.apache.any23.vocab.WO
-
specifies the species part of a binomial name, allowing
this portion of the name to be explicitly described.
- spouse - Variable in class org.apache.any23.vocab.XFN
-
- SRC_ATTRIBUTE - Static variable in class org.apache.any23.extractor.rdfa.RDFa11Parser
-
- SRC_TAGS - Static variable in class org.apache.any23.extractor.microdata.MicrodataParser
-
List of tags providing the src
property.
- standard - Variable in class org.apache.any23.vocab.ICAL
-
- start(URL, Pattern, boolean) - Method in class org.apache.any23.plugin.crawler.SiteCrawler
-
Starts the crawling process.
- start(URL, boolean) - Method in class org.apache.any23.plugin.crawler.SiteCrawler
-
- start_date - Variable in class org.apache.any23.vocab.DOAC
-
- startDocument(URI) - Method in class org.apache.any23.filter.ExtractionContextBlocker
-
- startDocument(URI) - Method in class org.apache.any23.filter.IgnoreAccidentalRDFa
-
- startDocument(URI) - Method in class org.apache.any23.filter.IgnoreTitlesOfEmptyDocuments
-
- startDocument(URI) - Method in class org.apache.any23.writer.BenchmarkTripleHandler
-
- startDocument(URI) - Method in class org.apache.any23.writer.CompositeTripleHandler
-
- startDocument(URI) - Method in class org.apache.any23.writer.CountingTripleHandler
-
- startDocument(URI) - Method in class org.apache.any23.writer.JSONWriter
-
- startDocument(URI) - Method in class org.apache.any23.writer.LoggingTripleHandler
-
- startDocument(URI) - Method in class org.apache.any23.writer.RDFWriterTripleHandler
-
- startDocument(URI) - Method in class org.apache.any23.writer.ReportingTripleHandler
-
- startDocument(URI) - Method in class org.apache.any23.writer.RepositoryWriter
-
- startDocument(URI) - Method in interface org.apache.any23.writer.TripleHandler
-
- startDocument(URI) - Method in class org.apache.any23.writer.URIListWriter
-
- startRDF() - Method in class org.apache.any23.extractor.rdf.RDFHandlerAdapter
-
- startRDF() - Method in class org.apache.any23.io.nquads.NQuadsWriter
-
- status - Variable in class org.apache.any23.vocab.ICAL
-
value type: TEXTThis URI defines the overall status or confirmation for
the calendar component..
- stop() - Method in class org.apache.any23.plugin.crawler.SiteCrawler
-
Interrupts the crawler process if started with wait
flag == false
.
- StreamUtils - Class in org.apache.any23.util
-
Contains general utility functions for handling streams.
- street_address - Variable in class org.apache.any23.vocab.VCARD
-
The street address of a postal address.
- streetAddress - Variable in class org.apache.any23.vocab.HLISTING
-
- StringDocumentSource - Class in org.apache.any23.source
-
- StringDocumentSource(String, String) - Constructor for class org.apache.any23.source.StringDocumentSource
-
- StringDocumentSource(String, String, String) - Constructor for class org.apache.any23.source.StringDocumentSource
-
- StringDocumentSource(String, String, String, String) - Constructor for class org.apache.any23.source.StringDocumentSource
-
- StringUtils - Class in org.apache.any23.util
-
This class provides a set of string utility methods.
- stylesheet - Variable in class org.apache.any23.vocab.XHTML
-
- SUBJECT_ATTRIBUTES - Static variable in class org.apache.any23.extractor.rdfa.RDFa11Parser
-
- summary - Variable in class org.apache.any23.vocab.DOAC
-
- summary - Variable in class org.apache.any23.vocab.HLISTING
-
- summary - Variable in class org.apache.any23.vocab.HRECIPE
-
- summary - Variable in class org.apache.any23.vocab.ICAL
-
This URI defines a short summary or subject for the calendar component.value
type: TEXT.
- surname - Variable in class org.apache.any23.vocab.FOAF
-
- sweetheart - Variable in class org.apache.any23.vocab.XFN
-
- tag - Variable in class org.apache.any23.vocab.HRECIPE
-
- TagSoupExtractionResult - Interface in org.apache.any23.extractor
-
This interface models a specific
ExtractionResult
able to collect property roots generated by
HTML Microformat extractions.
- TagSoupExtractionResult.PropertyPath - Class in org.apache.any23.extractor
-
Defines a property path object.
- TagSoupExtractionResult.PropertyPath(String[], Resource, Resource, BNode, Class<? extends MicroformatExtractor>) - Constructor for class org.apache.any23.extractor.TagSoupExtractionResult.PropertyPath
-
- TagSoupExtractionResult.ResourceRoot - Class in org.apache.any23.extractor
-
Defines a property root object.
- TagSoupExtractionResult.ResourceRoot(String[], Resource, Class<? extends MicroformatExtractor>) - Constructor for class org.apache.any23.extractor.TagSoupExtractionResult.ResourceRoot
-
- TagSoupParser - Class in org.apache.any23.extractor.html
-
Parses an
InputStream
into an
HTML DOM tree using a TagSoup parser.
- TagSoupParser(InputStream, String) - Constructor for class org.apache.any23.extractor.html.TagSoupParser
-
- TagSoupParser(InputStream, String, String) - Constructor for class org.apache.any23.extractor.html.TagSoupParser
-
- TagSoupParser.ElementLocation - Class in org.apache.any23.extractor.html
-
Describes a DOM Element location.
- tel - Variable in class org.apache.any23.vocab.HLISTING
-
- tel - Variable in class org.apache.any23.vocab.VCARD
-
A telephone number of a person.
- Telephone - Variable in class org.apache.any23.vocab.VCARD
-
Resources that are vCard Telephones.
- TemplateGraph - Class in org.apache.any23.extractor.xpath
-
Represents an Quad graph URI template.
- TemplateGraph(String, boolean) - Constructor for class org.apache.any23.extractor.xpath.TemplateGraph
-
Constructor.
- TemplateObject - Class in org.apache.any23.extractor.xpath
-
Represents a Quad object template.
- TemplateObject(TemplateObject.Type, String, boolean) - Constructor for class org.apache.any23.extractor.xpath.TemplateObject
-
Constructor.
- TemplateObject.Type - Enum in org.apache.any23.extractor.xpath
-
Supported object types.
- TemplatePredicate - Class in org.apache.any23.extractor.xpath
-
Represents a Quad predicate template.
- TemplatePredicate(String, boolean) - Constructor for class org.apache.any23.extractor.xpath.TemplatePredicate
-
Constructor.
- TemplateSubject - Class in org.apache.any23.extractor.xpath
-
Represents a Quad subject template.
- TemplateSubject(TemplateSubject.Type, String, boolean) - Constructor for class org.apache.any23.extractor.xpath.TemplateSubject
-
Constructor.
- TemplateSubject.Type - Enum in org.apache.any23.extractor.xpath
-
Supported subject types.
- TemplateXPathExtractionRule - Interface in org.apache.any23.extractor.xpath
-
- TemplateXPathExtractionRuleImpl - Class in org.apache.any23.extractor.xpath
-
- TemplateXPathExtractionRuleImpl(String, String) - Constructor for class org.apache.any23.extractor.xpath.TemplateXPathExtractionRuleImpl
-
- Term<T extends org.openrdf.model.Value> - Class in org.apache.any23.extractor.xpath
-
Represents a generic template term.
- Term(String, boolean) - Constructor for class org.apache.any23.extractor.xpath.Term
-
Constructor.
- text - Variable in class org.apache.any23.vocab.REVIEW
-
The text of the review.
- theme - Variable in class org.apache.any23.vocab.FOAF
-
- thumbnail - Variable in class org.apache.any23.vocab.FOAF
-
- TikaEncodingDetector - Class in org.apache.any23.encoding
-
- TikaEncodingDetector() - Constructor for class org.apache.any23.encoding.TikaEncodingDetector
-
- TikaMIMETypeDetector - Class in org.apache.any23.mime
-
- TikaMIMETypeDetector(Purifier) - Constructor for class org.apache.any23.mime.TikaMIMETypeDetector
-
- TikaMIMETypeDetector() - Constructor for class org.apache.any23.mime.TikaMIMETypeDetector
-
- tipjar - Variable in class org.apache.any23.vocab.FOAF
-
- title - Variable in class org.apache.any23.vocab.DCTERMS
-
- title - Variable in class org.apache.any23.vocab.DOAC
-
- title - Variable in class org.apache.any23.vocab.FOAF
-
- TITLE - Static variable in class org.apache.any23.vocab.OGP
-
The title of your object as it should appear within the graph (Facebook Graph API).
- title - Variable in class org.apache.any23.vocab.OGP
-
- title - Variable in class org.apache.any23.vocab.REVIEW
-
The title of the review.
- title - Variable in class org.apache.any23.vocab.VCARD
-
A person's title.
- TitleExtractor - Class in org.apache.any23.extractor.html
-
Extracts the value of the <title> element of an
HTML or XHTML page.
- TitleExtractor() - Constructor for class org.apache.any23.extractor.html.TitleExtractor
-
- toByteArray(InputStream) - Static method in class org.apache.any23.source.MemCopyFactory
-
- toJSON() - Method in class org.apache.any23.extractor.microdata.Item
-
- toJSON() - Method in class org.apache.any23.extractor.microdata.ItemProp
-
- toJSON() - Method in class org.apache.any23.extractor.microdata.ItemPropValue
-
- toJSON() - Method in class org.apache.any23.extractor.microdata.ItemScope
-
- toJSON() - Method in exception org.apache.any23.extractor.microdata.MicrodataParserException
-
- Tool - Interface in org.apache.any23.cli
-
Defines a runnable CLI tool.
- ToolRunner - Class in org.apache.any23.cli
-
This class is the main class responsible to provide a uniform command-line
access points to all the others tools like
Rover
.
- ToolRunner() - Constructor for class org.apache.any23.cli.ToolRunner
-
- topic - Variable in class org.apache.any23.vocab.FOAF
-
- topic_interest - Variable in class org.apache.any23.vocab.FOAF
-
- toString() - Method in class org.apache.any23.extractor.ExtractionContext
-
- toString() - Method in class org.apache.any23.extractor.ExtractionResultImpl
-
- toString() - Method in class org.apache.any23.extractor.IssueReport.Issue
-
- toString() - Method in class org.apache.any23.extractor.microdata.ItemProp
-
- toString() - Method in class org.apache.any23.extractor.microdata.ItemPropValue
-
- toString() - Method in class org.apache.any23.extractor.microdata.ItemScope
-
- toString() - Method in exception org.apache.any23.extractor.microdata.MicrodataParserException
-
- toString() - Method in class org.apache.any23.extractor.TagSoupExtractionResult.PropertyPath
-
- toString() - Method in class org.apache.any23.extractor.TagSoupExtractionResult.ResourceRoot
-
- toString() - Method in class org.apache.any23.extractor.xpath.QuadTemplate
-
- toString() - Method in class org.apache.any23.extractor.xpath.TemplateGraph
-
- toString() - Method in class org.apache.any23.extractor.xpath.TemplateObject
-
- toString() - Method in class org.apache.any23.extractor.xpath.TemplatePredicate
-
- toString() - Method in class org.apache.any23.extractor.xpath.TemplateSubject
-
- toString() - Method in class org.apache.any23.extractor.xpath.TemplateXPathExtractionRuleImpl
-
- toString() - Method in class org.apache.any23.extractor.xpath.Term
-
- toString() - Method in class org.apache.any23.mime.MIMEType
-
- toString() - Method in class org.apache.any23.servlet.conneg.MediaRangeSpec
-
- toString() - Method in class org.apache.any23.validator.DefaultValidationReport
-
- toString() - Method in class org.apache.any23.validator.DefaultValidationReportBuilder
-
- toString() - Method in class org.apache.any23.validator.EmptyValidationReport
-
- toString() - Method in class org.apache.any23.validator.ValidationReport.Error
-
- toString() - Method in class org.apache.any23.validator.ValidationReport.FixError
-
- toString() - Method in class org.apache.any23.validator.ValidationReport.Issue
-
- toString() - Method in class org.apache.any23.validator.ValidationReport.RuleActivation
-
- toString() - Method in class org.apache.any23.validator.ValidationReport.RuleError
-
- totalVotes - Variable in class org.apache.any23.vocab.REVIEW
-
Number of usefulness votes (integer).
- toValue(String) - Static method in class org.apache.any23.rdf.RDFUtils
-
Creates a Value
.
- toXSDDateTime(Date) - Static method in class org.apache.any23.rdf.RDFUtils
-
Prints a date
to the XSD datetime format.
- traceRuleActivation(Rule) - Method in class org.apache.any23.validator.DefaultValidationReportBuilder
-
- traceRuleActivation(Rule) - Method in interface org.apache.any23.validator.ValidationReportBuilder
-
Traces that a rule has been applied.
- trade - Variable in class org.apache.any23.vocab.HLISTING
-
- transp - Variable in class org.apache.any23.vocab.ICAL
-
This URI defines whether an event is transparent or not to busy time
searches.value type: TEXT.
- trigger - Variable in class org.apache.any23.vocab.ICAL
-
This URI specifies when an alarm will trigger.default value type: DURATION.
- triple(Resource, URI, Value) - Static method in class org.apache.any23.rdf.RDFUtils
-
Creates a Statement
.
- triple(String, String, String) - Static method in class org.apache.any23.rdf.RDFUtils
-
Creates a statement of type: toValue(s), toValue(p), toValue(o)
- TripleHandler - Interface in org.apache.any23.writer
-
Defines a document based triple handler.
- TripleHandlerException - Exception in org.apache.any23.writer
-
Defines a generic exception raised when accessing to
TripleHandler
class.
- TripleHandlerException(String) - Constructor for exception org.apache.any23.writer.TripleHandlerException
-
- TripleHandlerException(String, Exception) - Constructor for exception org.apache.any23.writer.TripleHandlerException
-
- TriXExtractor - Class in org.apache.any23.extractor.rdf
-
- TriXExtractor(boolean, boolean) - Constructor for class org.apache.any23.extractor.rdf.TriXExtractor
-
Constructor, allows to specify the validation and error handling policies.
- TriXExtractor() - Constructor for class org.apache.any23.extractor.rdf.TriXExtractor
-
Default constructor, with no verification of data types and not stop at first error.
- TriXWriter - Class in org.apache.any23.writer
-
TriX format writer implementation.
- TriXWriter(OutputStream) - Constructor for class org.apache.any23.writer.TriXWriter
-
- TriXWriterFactory - Class in org.apache.any23.writer
-
- TriXWriterFactory() - Constructor for class org.apache.any23.writer.TriXWriterFactory
-
- TurtleExtractor - Class in org.apache.any23.extractor.rdf
-
- TurtleExtractor(boolean, boolean) - Constructor for class org.apache.any23.extractor.rdf.TurtleExtractor
-
Constructor, allows to specify the validation and error handling policies.
- TurtleExtractor() - Constructor for class org.apache.any23.extractor.rdf.TurtleExtractor
-
Default constructor, with no verification of data types and no stop at first error.
- TurtleHTMLExtractor - Class in org.apache.any23.extractor.html
-
Extractor for Turtle/N3 format embedded within HTML
script tags.
- TurtleHTMLExtractor() - Constructor for class org.apache.any23.extractor.html.TurtleHTMLExtractor
-
- TurtleWriter - Class in org.apache.any23.writer
-
N3 notation writer.
- TurtleWriter(OutputStream) - Constructor for class org.apache.any23.writer.TurtleWriter
-
Constructor.
- TurtleWriterFactory - Class in org.apache.any23.writer
-
- TurtleWriterFactory() - Constructor for class org.apache.any23.writer.TurtleWriterFactory
-
- TYPE - Static variable in class org.apache.any23.vocab.OGP
-
The type of your object, e.g., video.movie
.
- type - Variable in class org.apache.any23.vocab.OGP
-
- type - Variable in class org.apache.any23.vocab.REVIEW
-
The type of media of a work under review.
- TYPE_ATTRIBUTE - Static variable in class org.apache.any23.extractor.rdfa.RDFa11Parser
-
- TYPEOF_ATTRIBUTE - Static variable in class org.apache.any23.extractor.rdfa.RDFa11Parser
-
- tz - Variable in class org.apache.any23.vocab.VCARD
-
A timezone associated with a person.
- tzid - Variable in class org.apache.any23.vocab.ICAL
-
value type: TEXTTo specify the identifier for the time zone definition for
a time component in the URI value.This URI specifies the text value
that uniquely identifies the "VTIMEZONE" calendar component..
- tzname - Variable in class org.apache.any23.vocab.ICAL
-
value type: TEXTThis URI specifies the customary designation for a time
zone description..
- tzoffsetfrom - Variable in class org.apache.any23.vocab.ICAL
-
value type: UTC-OFFSETThis URI specifies the offset which is in use prior
to this time zone observance..
- tzoffsetto - Variable in class org.apache.any23.vocab.ICAL
-
value type: UTC-OFFSETThis URI specifies the offset which is in use in
this time zone observance..
- tzurl - Variable in class org.apache.any23.vocab.ICAL
-
The TZURL provides a means for a VTIMEZONE component to point to a network
location that can be used to retrieve an up-to- date version of itself.value
type: URI.
- Valarm - Variable in class org.apache.any23.vocab.ICAL
-
Provide a grouping of component properties that define an alarm..
- validate(DOMDocument, boolean) - Method in class org.apache.any23.validator.DefaultValidator
-
- validate(URI, Document, boolean) - Method in class org.apache.any23.validator.DefaultValidator
-
- validate(DOMDocument, boolean) - Method in interface org.apache.any23.validator.Validator
-
Performs a validation - fixing of the provided document.
- validate(URI, Document, boolean) - Method in interface org.apache.any23.validator.Validator
-
Performs a validation - fixing of the provided document.
- ValidationReport - Interface in org.apache.any23.validator
-
This class contains the report of a validation performed by
the
Validator
class.
- ValidationReport.Error - Class in org.apache.any23.validator
-
An error occurred while performing the validation process.
- ValidationReport.Error(Exception, String) - Constructor for class org.apache.any23.validator.ValidationReport.Error
-
- ValidationReport.FixError - Class in org.apache.any23.validator
-
An error occurred while executing a fix.
- ValidationReport.FixError(Fix, Exception, String) - Constructor for class org.apache.any23.validator.ValidationReport.FixError
-
- ValidationReport.Issue - Class in org.apache.any23.validator
-
An issue found during the validation process.
- ValidationReport.Issue(ValidationReport.IssueLevel, String, Node) - Constructor for class org.apache.any23.validator.ValidationReport.Issue
-
- ValidationReport.IssueLevel - Enum in org.apache.any23.validator
-
Defines the different issue levels.
- ValidationReport.RuleActivation - Class in org.apache.any23.validator
-
This class describes the activation of a rule.
- ValidationReport.RuleActivation(Rule) - Constructor for class org.apache.any23.validator.ValidationReport.RuleActivation
-
- ValidationReport.RuleError - Class in org.apache.any23.validator
-
An error occurred while executing a rule.
- ValidationReport.RuleError(Rule, Exception, String) - Constructor for class org.apache.any23.validator.ValidationReport.RuleError
-
- ValidationReportBuilder - Interface in org.apache.any23.validator
-
The report interface is used to generate diagnostics about validation.
- ValidationReportSerializer - Interface in org.apache.any23.validator
-
Defines a serializer for validation reports.
- Validator - Interface in org.apache.any23.validator
-
The validator class allows to perform validation - correction of
related to
HTML Document
instances.
- ValidatorException - Exception in org.apache.any23.validator
-
Models any exception raised by the
Validator
.
- ValidatorException(String) - Constructor for exception org.apache.any23.validator.ValidatorException
-
- ValidatorException(String, Throwable) - Constructor for exception org.apache.any23.validator.ValidatorException
-
- value() - Method in class org.apache.any23.extractor.html.HTMLDocument.TextField
-
- Value_CAL_ADDRESS - Variable in class org.apache.any23.vocab.ICAL
-
- Value_DATE - Variable in class org.apache.any23.vocab.ICAL
-
- Value_DURATION - Variable in class org.apache.any23.vocab.ICAL
-
- Value_PERIOD - Variable in class org.apache.any23.vocab.ICAL
-
- Value_RECUR - Variable in class org.apache.any23.vocab.ICAL
-
- valueFactory - Variable in class org.apache.any23.extractor.html.MicroformatExtractor
-
- valueOf(String) - Static method in enum org.apache.any23.extractor.ExtractionParameters.ValidationMode
-
Returns the enum constant of this type with the specified name.
- valueOf(String) - Static method in enum org.apache.any23.extractor.IssueReport.IssueLevel
-
Returns the enum constant of this type with the specified name.
- valueOf(String) - Static method in enum org.apache.any23.extractor.microdata.ItemPropValue.Type
-
Returns the enum constant of this type with the specified name.
- valueOf(String) - Static method in enum org.apache.any23.extractor.xpath.TemplateObject.Type
-
Returns the enum constant of this type with the specified name.
- valueOf(String) - Static method in enum org.apache.any23.extractor.xpath.TemplateSubject.Type
-
Returns the enum constant of this type with the specified name.
- valueOf(String) - Static method in enum org.apache.any23.validator.ValidationReport.IssueLevel
-
Returns the enum constant of this type with the specified name.
- valueOf(String) - Static method in enum org.apache.any23.vocab.RDFSchemaUtils.VocabularyFormat
-
Returns the enum constant of this type with the specified name.
- values() - Static method in enum org.apache.any23.extractor.ExtractionParameters.ValidationMode
-
Returns an array containing the constants of this enum type, in
the order they are declared.
- values() - Static method in enum org.apache.any23.extractor.IssueReport.IssueLevel
-
Returns an array containing the constants of this enum type, in
the order they are declared.
- values() - Static method in enum org.apache.any23.extractor.microdata.ItemPropValue.Type
-
Returns an array containing the constants of this enum type, in
the order they are declared.
- values() - Static method in enum org.apache.any23.extractor.xpath.TemplateObject.Type
-
Returns an array containing the constants of this enum type, in
the order they are declared.
- values() - Static method in enum org.apache.any23.extractor.xpath.TemplateSubject.Type
-
Returns an array containing the constants of this enum type, in
the order they are declared.
- values() - Static method in enum org.apache.any23.validator.ValidationReport.IssueLevel
-
Returns an array containing the constants of this enum type, in
the order they are declared.
- values() - Static method in enum org.apache.any23.vocab.RDFSchemaUtils.VocabularyFormat
-
Returns an array containing the constants of this enum type, in
the order they are declared.
- Variable - Class in org.apache.any23.extractor.xpath
-
- Variable(String, String) - Constructor for class org.apache.any23.extractor.xpath.Variable
-
- Vcalendar - Variable in class org.apache.any23.vocab.ICAL
-
- VCARD - Class in org.apache.any23.vocab
-
Vocabulary definitions from vcard.owl
- VCARD() - Constructor for class org.apache.any23.vocab.VCARD
-
- VCard - Variable in class org.apache.any23.vocab.VCARD
-
Resources that are vCards
- VERSION - Static variable in class org.apache.any23.Any23
-
Any23 core library version.
- version - Variable in class org.apache.any23.vocab.ICAL
-
value type: TEXTThis URI specifies the identifier corresponding to the
highest version number or the minimum and maximum range of the iCalendar specification
that is required in order to interpret the iCalendar object..
- Vevent - Variable in class org.apache.any23.vocab.ICAL
-
Provide a grouping of component properties that describe an event..
- Vfreebusy - Variable in class org.apache.any23.vocab.ICAL
-
Provide a grouping of component properties that describe either a request
for free/busy time, describe a response to a request for free/busy time or
describe a published set of busy time..
- VIDEO - Static variable in class org.apache.any23.vocab.OGP
-
A URL to a video file that complements this object.
- video - Variable in class org.apache.any23.vocab.OGP
-
- VIDEO__HEIGHT - Static variable in class org.apache.any23.vocab.OGP
-
The number of pixels height.
- VIDEO__SECURE_URL - Static variable in class org.apache.any23.vocab.OGP
-
An alternate url to use if the webpage requires HTTPS.
- VIDEO__TYPE - Static variable in class org.apache.any23.vocab.OGP
-
A MIME type for this video.
- VIDEO__URL - Static variable in class org.apache.any23.vocab.OGP
-
Video URL.
- VIDEO__WIDTH - Static variable in class org.apache.any23.vocab.OGP
-
The number of pixels wide.
- videoHeight - Variable in class org.apache.any23.vocab.OGP
-
- videoSecureURL - Variable in class org.apache.any23.vocab.OGP
-
- videoType - Variable in class org.apache.any23.vocab.OGP
-
- videoURL - Variable in class org.apache.any23.vocab.OGP
-
- videoWidth - Variable in class org.apache.any23.vocab.OGP
-
- visit(Page) - Method in class org.apache.any23.plugin.crawler.DefaultWebCrawler
-
Override this method to implement the single page processing logic.
- visitedPage(Page) - Method in interface org.apache.any23.plugin.crawler.CrawlerListener
-
Notifies to the listener that a page has been discovered.
- Vjournal - Variable in class org.apache.any23.vocab.ICAL
-
Provide a grouping of component properties that describe a journal entry..
- VOCAB_ATTRIBUTE - Static variable in class org.apache.any23.extractor.rdfa.RDFa11Parser
-
- VocabPrinter - Class in org.apache.any23.cli
-
Prints out the vocabulary RDFSchema as NQuads.
- VocabPrinter() - Constructor for class org.apache.any23.cli.VocabPrinter
-
- VocabPrinter.VocabularyFormatConverter - Class in org.apache.any23.cli
-
- VocabPrinter.VocabularyFormatConverter() - Constructor for class org.apache.any23.cli.VocabPrinter.VocabularyFormatConverter
-
- Vocabulary - Class in org.apache.any23.vocab
-
Base class for the definition of a vocabulary.
- Vocabulary(String) - Constructor for class org.apache.any23.vocab.Vocabulary
-
Constructor.
- Vtimezone - Variable in class org.apache.any23.vocab.ICAL
-
Provide a grouping of component properties that defines a time zone..
- Vtodo - Variable in class org.apache.any23.vocab.ICAL
-
Provide a grouping of calendar properties that describe a to-do..