|
||||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |
java.lang.Object org.apache.any23.extractor.html.MicroformatExtractor
public abstract class MicroformatExtractor
The abstract base class for any Microformat specification extractor.
Nested Class Summary |
---|
Nested classes/interfaces inherited from interface org.apache.any23.extractor.Extractor |
---|
Extractor.BlindExtractor, Extractor.ContentExtractor, Extractor.TagSoupDOMExtractor |
Field Summary | |
---|---|
static String |
BEGIN_SCRIPT
|
static String |
END_SCRIPT
|
protected Any23ValueFactoryWrapper |
valueFactory
|
Constructor Summary | |
---|---|
MicroformatExtractor()
|
Method Summary | |
---|---|
protected void |
addBNodeProperty(Node n,
org.openrdf.model.Resource subject,
org.openrdf.model.URI property,
org.openrdf.model.BNode bnode)
Helper method that adds a BNode property to a node. |
protected void |
addBNodeProperty(org.openrdf.model.Resource subject,
org.openrdf.model.URI property,
org.openrdf.model.BNode bnode)
Helper method that adds a BNode property to a node. |
protected void |
addURIProperty(org.openrdf.model.Resource subject,
org.openrdf.model.URI property,
org.openrdf.model.URI object)
Helper method that adds a URI property to a node. |
protected boolean |
conditionallyAddLiteralProperty(Node n,
org.openrdf.model.Resource subject,
org.openrdf.model.URI property,
org.openrdf.model.Literal literal)
Helper method that adds a literal property to a node. |
protected boolean |
conditionallyAddResourceProperty(org.openrdf.model.Resource subject,
org.openrdf.model.URI property,
org.openrdf.model.URI uri)
Helper method that adds a URI property to a node. |
protected boolean |
conditionallyAddStringProperty(Node n,
org.openrdf.model.Resource subject,
org.openrdf.model.URI p,
String value)
Helper method that adds a literal property to a subject only if the value of the property is a valid string. |
protected abstract boolean |
extract()
Performs the extraction of the data and writes them to the model. |
protected org.openrdf.model.URI |
fixLink(String link)
|
protected org.openrdf.model.URI |
fixLink(String link,
String defaultSchema)
|
protected ExtractionResult |
getCurrentExtractionResult()
Returns the ExtractionResult associated
to the extraction session. |
abstract ExtractorDescription |
getDescription()
Returns the description of this extractor. |
org.openrdf.model.URI |
getDocumentURI()
|
ExtractionContext |
getExtractionContext()
|
HTMLDocument |
getHTMLDocument()
|
static boolean |
includes(Class<? extends MicroformatExtractor> including,
Class<? extends MicroformatExtractor> included)
This method checks if there is a native nesting relationship between two MicroformatExtractor . |
protected ExtractionResult |
openSubResult(ExtractionContext context)
|
void |
run(ExtractionParameters extractionParameters,
ExtractionContext extractionContext,
Document in,
ExtractionResult out)
Executes the extractor. |
Methods inherited from class java.lang.Object |
---|
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
Field Detail |
---|
public static final String BEGIN_SCRIPT
public static final String END_SCRIPT
protected final Any23ValueFactoryWrapper valueFactory
Constructor Detail |
---|
public MicroformatExtractor()
Method Detail |
---|
public abstract ExtractorDescription getDescription()
getDescription
in interface Extractor<Document>
protected abstract boolean extract() throws ExtractionException
ExtractionException
public HTMLDocument getHTMLDocument()
public ExtractionContext getExtractionContext()
public org.openrdf.model.URI getDocumentURI()
public final void run(ExtractionParameters extractionParameters, ExtractionContext extractionContext, Document in, ExtractionResult out) throws IOException, ExtractionException
Extractor
run
in interface Extractor<Document>
extractionParameters
- the parameters to be applied during the extraction.extractionContext
- The document context.in
- The extractor input data.out
- the collector for the extracted data.
IOException
- On error while reading from the input stream.
ExtractionException
- On other error, such as parse errors.protected ExtractionResult getCurrentExtractionResult()
ExtractionResult
associated
to the extraction session.
protected ExtractionResult openSubResult(ExtractionContext context)
protected boolean conditionallyAddStringProperty(Node n, org.openrdf.model.Resource subject, org.openrdf.model.URI p, String value)
n
- the HTML node from which the property value has been extracted.subject
- the property subject.p
- the property URI.value
- the property value.
true
if the value has been accepted and added, false
otherwise.protected boolean conditionallyAddLiteralProperty(Node n, org.openrdf.model.Resource subject, org.openrdf.model.URI property, org.openrdf.model.Literal literal)
n
- the HTML node from which the property value has been extracted.subject
- subject the property subject.property
- the property URI.literal
- value the property value.
true
if the literal has been accepted and added, false
otherwise.protected boolean conditionallyAddResourceProperty(org.openrdf.model.Resource subject, org.openrdf.model.URI property, org.openrdf.model.URI uri)
subject
- the property subject.property
- the property URI.uri
- the property object.
true
if the the resource has been added, false
otherwise.protected void addBNodeProperty(Node n, org.openrdf.model.Resource subject, org.openrdf.model.URI property, org.openrdf.model.BNode bnode)
n
- the HTML node used for extracting such property.subject
- the property subject.property
- the property URI.bnode
- the property value.protected void addBNodeProperty(org.openrdf.model.Resource subject, org.openrdf.model.URI property, org.openrdf.model.BNode bnode)
subject
- the property subject.property
- the property URI.bnode
- the property value.protected void addURIProperty(org.openrdf.model.Resource subject, org.openrdf.model.URI property, org.openrdf.model.URI object)
subject
- property
- object
- protected org.openrdf.model.URI fixLink(String link)
protected org.openrdf.model.URI fixLink(String link, String defaultSchema)
public static boolean includes(Class<? extends MicroformatExtractor> including, Class<? extends MicroformatExtractor> included)
MicroformatExtractor
.
including
- the including MicroformatExtractor
included
- the included MicroformatExtractor
true
if there is a declared nesting relationshiporg.apache.any23.extractor.html.annotations.Includes}
|
||||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |