Class SXSLFPowerPointExtractorDecorator
java.lang.Object
org.apache.tika.parser.microsoft.ooxml.AbstractOOXMLExtractor
org.apache.tika.parser.microsoft.ooxml.SXSLFPowerPointExtractorDecorator
- All Implemented Interfaces:
OOXMLExtractor
SAX/Streaming pptx extractior
-
Field Summary
Fields inherited from class org.apache.tika.parser.microsoft.ooxml.AbstractOOXMLExtractor
config, EMBEDDED_RELATIONSHIPS, extractor
-
Constructor Summary
ConstructorsConstructorDescriptionSXSLFPowerPointExtractorDecorator
(Metadata metadata, ParseContext context, XSLFEventBasedPowerPointExtractor extractor) -
Method Summary
Modifier and TypeMethodDescriptionprotected void
buildXHTML
(XHTMLContentHandler xhtml) Populates theXHTMLContentHandler
object received as parameter.protected List<org.apache.poi.openxml4j.opc.PackagePart>
In PowerPoint files, slides have things embedded in them, and slide drawings which have the imagesMethods inherited from class org.apache.tika.parser.microsoft.ooxml.AbstractOOXMLExtractor
getDocument, getEmbeddedPartMetadataMap, getJustFileName, getMetadataExtractor, getXHTML, handleEmbeddedFile, loadLinkedRelationships
-
Constructor Details
-
SXSLFPowerPointExtractorDecorator
public SXSLFPowerPointExtractorDecorator(Metadata metadata, ParseContext context, XSLFEventBasedPowerPointExtractor extractor)
-
-
Method Details
-
buildXHTML
Description copied from class:AbstractOOXMLExtractor
Populates theXHTMLContentHandler
object received as parameter.- Specified by:
buildXHTML
in classAbstractOOXMLExtractor
- Throws:
SAXException
IOException
- See Also:
-
org.apache.poi.xslf.extractor.XSLFPowerPointExtractor#getText()
-
getMainDocumentParts
In PowerPoint files, slides have things embedded in them, and slide drawings which have the images- Specified by:
getMainDocumentParts
in classAbstractOOXMLExtractor
-