|
||||||||||
PREV NEXT | FRAMES NO FRAMES |
QuadTemplate
to the rule.
Configuration
.
ValueFactory
.Extractor.ContentExtractor
.TripleHandler
decorator useful to
perform benchmarking.BNode
.
null
CSVParser
guessing
from the provided CSV file.
DocumentSource
.TripleHandler
multi decorator, that wraps zero or more
other triple handlers and dispatches all events to each of them.initialExtractorGroup
and also the extractors detected in classpath specified by pluginLocations
.
initialExtractorGroup
and also the extractors detected in classpath specified by the default configuration.
TripleHandler
that merely counts the number
of triples it has received.NL
in the given in string.
content
appears within container
without string overlapping.
src
to the dest
.
ExtractorFactory
serving concrete implementation
instances of Extractor
.
Vocabulary.classes
map.
DocumentSource
for the givendocumentURI
.
Vocabulary.properties
map.
DefaultConfiguration
.Configuration
.InputStream
on an HTTP URI.ModifiableConfiguration
.RuleContext
.ValidationReport
.ValidationReportBuilder
.Validator
.TagSoupParser
when a document
is retrieved and validated.Validator
.ValidationReportBuilder
with no data.in
string as JSON string
to let it being embeddable within a string field.
in
document source, sending the generated events
to the specified outputHandler
.
in
string
associated to the documentURI
URI, declaring
contentType
and encoding
.
in
string
associated to the documentURI
URI, sending the generated
events to the specified outputHandler
.
file
sending the generated events to the specified outputHandler
.
documentURI
sending the generated events to the specified outputHandler
.
documentURI
sending the generated events to the specified outputHandler
.
in
document source, sending the generated events
to the specified outputHandler
.
in
document source, sending the generated events
to the specified outputHandler
.
in
document source, sending the generated events
to the specified outputHandler
.
Extractor
.TripleHandler
that can block and unblock
calls to the handler, either for the entire document, or for
individual ExtractionContext
s.SingleDocumentExtraction.METADATA_NESTING_FLAG
.
ExtractionResult
; it receives
extraction output from one Extractor
working on one document,
and passes the output on to a TripleHandler
.Extractor
able to handle
URI
as input format.Extractor
able to handle
InputStream
as input format.Extractor
able to handle
Document
as input format.Extractor
description.Extractor
.ExtractorFactory
providing
simple accessing methods.Any23
extractor plugin that can be detected and registered from the library classpath.Extractor
.prefixID:prefixURL
strings
normalizing blanks where present.
rel
tag nodes.
DocumentSource
.ExtractorGroup
with a set of Extractor
able to
process the provided mime type.
ExtractorRegistry
and the ExtractorPlugin
s.
BNode
.
packageName
satisfying the given class
filter
.
iface
.
ExtractionResult
associated
to the extraction session.
ExtractorDescription
of this extractor.
ExtractorDescription
of this extractor.
DocumentSource
.
ExtractorGroup
containing the
ExtractorFactory
mathing the names provided as input.
ExtractorFactory
given its name
Configuration
.
HTTPClient
implementation.
FormatWriter
.
ItemScope
instance described within the specified node
.
document
.
document
,
works in full report mode.
FormatWriter
.
NQuadsParser
.
NTriplesParser
.
ExtractorPlugin
classes declared within the classpath.
Configuration
.
defaultValue
will be returned.
Integer
value of the specified propertyName
or raises an exception
if propertyName
is not defined.
propertyName
or raises an exception
if propertyName
is not defined.
RDFParser
instance.
RDFXMLParser
.
null
otherwise.
Tool
classes declared within the classpath.
TriXParser
.
TurtleParser
.
true
.
Term
given the varMapping
, the #isVar and #internalValue
parameters.
FormatWriter
identified by id
.
FormatWriter
ready to write on the given os
OutputStream
.
mimeType
.
XSLTStylesheet
able to distill RDFa from
HTML pages.
MIME
type of the content of input file.
MIME
type of the content of input file.
node
.
FormatWriter
with given id
identifier has been registered.
DocumentSource
content activates of not at least an extractor.
Extractor.TagSoupDOMExtractor
implementation
retrieves the LINK
s declared within the HTML/HEAD
page header.href
property.
InputStream
on an
HTTP URI.HTTPClient
configuration.DocumentSource
.ical.rdf
TripleHandler
that suppresses output of the RDFa
parser if the document only contains "accidental" RDFa,
like stylesheet links and other non-RDFa uses of HTML'sTripleHandler
that suppresses output of the
TitleExtractor
unless some other triples could
be parsed from the document.og:image
.
MicroformatExtractor
to
represent which of the other Microformats could it nest.MicroformatExtractor
.
href
is absolute or not.
true
then the produced RDF is annotated with
the extractors used to generate the specific statements.
true
then the produced RDF is annotated with
the extractors used to generate the specific statements.
candidatePrefix
is prefix of string container
.
candidateSuffix
is suffix of string container
.
data
list
using the specified delimiter
.
Literal
.
Literal
.
Literal
.
Literal
.
Literal
.
Literal
.
Literal
.
Literal
.
Literal
.
Literal
.
DocumentSource
s.DocumentSource
by
reading them into an in-memory buffer.TagSoupDOMExtractor
.MicrodataParser
.MIME
type of
some given raw data.Configuration
.c
n
times.
- muse -
Variable in class org.apache.any23.vocab.XFN
-
- mv(File, File) -
Static method in class org.apache.any23.util.FileUtils
- Moves a
target
file to a new dest
location.
- myersBriggs -
Variable in class org.apache.any23.vocab.FOAF
-
MIMETypeDetector
based
on file extensions.DefaultConfiguration
.
Extractor.ContentExtractor
handling N-Quads format.RDFParser
interface.RDFWriter
.Extractor.ContentExtractor
handling NTriples NTriples format.InputStream
from a given URI.
MIMEType
.
is
input stream with the
specified parser p
using baseURI
.
is
input stream with the
specified parser p
using ''
as base URI.
in
string with the
specified parser p
using ''
as base URI.
resource
file
guessing the content format from the extension.
ExtractionResult
, using
the passed variableAssignment to expand variables.
TikaMIMETypeDetector
could
call in order to clean the input before performing the MIME type
detection.InputStream
.
InputStream
.
Statement
.
Extractor
implementation for
RDFa 1.1 specification.RDFa11Parser
.ExtractionResult
.TripleHandler
that writes
triples to a Sesame RDFWriter
,
eg for serialization using one of Sesame's writers.Extractor.ContentExtractor
able to perform the extraction on RDF/XML
documents.Reader
, returning the same value.
Reader
into a byte array
attribute
, returning the
defaultValue
string if not present.
attribute
, returning the
empty string if not present.
attributePrefix
.
Reader
as an InputStream
.ReaderInputStream
for the specified Reader
.
ReaderInputStream
for the specified Reader
,
with the specified encoding.
ExtractorFactory
.
FormatWriter
to the registry.
Fix
es.
Fix
.
TripleHandler
that collects
various information about the extraction process, such as
the extractors used and the total number of triples.Rule
.
Rule
to a Fix
.Extractor
registered to this class using the specified extraction parameters.
Extractor
registered to this class using the default extraction parameters.
ValidationReportSerializer
.Node
produces the XML serialization
omitting the XML declaration.
namespace
,
resources
and properties
.
vocabulary
to triples over the given writer
.
vocabulary
to NQuads over the given output stream.
vocabulary
to NQuads and return them as string.
LocalCopyFactory
instance.
HTTPClient
implementation
used to retrieve contents.
null
the MemCopyFactory
will be used.
MIMETypeDetector
.
null
mimetype detection will
be skipped and all extractors will be activated.
propertyValue
for
property which name propertyName
.
true
, the extractor will stop at first parsing error,
iffalse
the extractor will attempt to ignore all parsing errors.
true
, the extractor will stop at first parsing error,
iffalse
the extractor will attempt to ignore all parsing errors.
ExtractorFactory
.DefaultConfiguration
.
SingleDocumentExtraction
run.InputStream
meant to
detect and replace any occurrence of inline span:src
property.
DocumentSource
.ExtractionResult
able to collect property roots generated by HTML Microformat extractions.InputStream
into an XPathExtractionRule
able
to expand a QuadTemplate
set based
on a given template.XPathExtractionRule
.EncodingDetector
based on
Apache Tika.MIMETypeDetector
based on
Apache Tika.Rover
.Value
.
toRDF(s), toRDF(p), toRDF(o)
date
to the XSD datetime format.
Statement
.
TripleHandler
class.ContentExtractor
to perform extraction on TriX documents.Extractor.ContentExtractor
able to perform the
extraction on Turtle documents.video.movie
.
URI
.
URI
.
Validator
class.Document
instances.Validator
.TemplateXPathExtractionRule
.Purifier
that removes all the eventual blank
characters at the header of a file that might prevents its MIME Type detection.FormatWriter
metadata.FormatWriter
s.ValidationReportSerializer
for XML.XPathExtractor
.Extractor.TagSoupDOMExtractor
able to
apply XPathExtractionRule
s and generate quads.Writer
.XSLTStylesheet
class.
|
||||||||||
PREV NEXT | FRAMES NO FRAMES |