|
||||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |
java.lang.Object org.apache.any23.extractor.rdf.BaseRDFExtractor
public abstract class BaseRDFExtractor
Base class for a generic RDF
Extractor.ContentExtractor
.
Nested Class Summary |
---|
Nested classes/interfaces inherited from interface org.apache.any23.extractor.Extractor |
---|
Extractor.BlindExtractor, Extractor.ContentExtractor, Extractor.TagSoupDOMExtractor |
Constructor Summary | |
---|---|
BaseRDFExtractor()
|
|
BaseRDFExtractor(boolean verifyDataType,
boolean stopAtFirstError)
Constructor, allows to specify the validation and error handling policies. |
Method Summary | |
---|---|
abstract ExtractorDescription |
getDescription()
Returns a ExtractorDescription of this extractor. |
protected abstract org.openrdf.rio.helpers.RDFParserBase |
getParser(ExtractionContext extractionContext,
ExtractionResult extractionResult)
|
boolean |
isStopAtFirstError()
|
boolean |
isVerifyDataType()
|
void |
run(ExtractionParameters extractionParameters,
ExtractionContext extractionContext,
InputStream in,
ExtractionResult extractionResult)
Executes the extractor. |
void |
setStopAtFirstError(boolean b)
If true , the extractor will stop at first parsing error,
iffalse the extractor will attempt to ignore all parsing errors. |
void |
setVerifyDataType(boolean verifyDataType)
|
Methods inherited from class java.lang.Object |
---|
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
Constructor Detail |
---|
public BaseRDFExtractor(boolean verifyDataType, boolean stopAtFirstError)
verifyDataType
- if true
the data types will be verified,
if false
will be ignored.stopAtFirstError
- if true
the parser will stop at first parsing error,
if false
will ignore non blocking errors.public BaseRDFExtractor()
Method Detail |
---|
public abstract ExtractorDescription getDescription()
Extractor
ExtractorDescription
of this extractor.
getDescription
in interface Extractor<InputStream>
protected abstract org.openrdf.rio.helpers.RDFParserBase getParser(ExtractionContext extractionContext, ExtractionResult extractionResult)
public boolean isVerifyDataType()
public void setVerifyDataType(boolean verifyDataType)
public boolean isStopAtFirstError()
public void setStopAtFirstError(boolean b)
Extractor.ContentExtractor
true
, the extractor will stop at first parsing error,
iffalse
the extractor will attempt to ignore all parsing errors.
setStopAtFirstError
in interface Extractor.ContentExtractor
b
- tolerance flag.public void run(ExtractionParameters extractionParameters, ExtractionContext extractionContext, InputStream in, ExtractionResult extractionResult) throws IOException, ExtractionException
Extractor
run
in interface Extractor<InputStream>
extractionParameters
- the parameters to be applied during the extraction.extractionContext
- The document context.in
- The extractor input data.extractionResult
- the collector for the extracted data.
IOException
- On error while reading from the input stream.
ExtractionException
- On other error, such as parse errors.
|
||||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |