Class XWPFEventBasedWordExtractor

java.lang.Object
org.apache.tika.parser.microsoft.ooxml.xwpf.XWPFEventBasedWordExtractor
All Implemented Interfaces:
Closeable, AutoCloseable, org.apache.poi.extractor.POITextExtractor, org.apache.poi.ooxml.extractor.POIXMLTextExtractor

public class XWPFEventBasedWordExtractor extends Object implements org.apache.poi.ooxml.extractor.POIXMLTextExtractor
Experimental class that is based on POI's XSSFEventBasedExcelExtractor
  • Constructor Details

    • XWPFEventBasedWordExtractor

      public XWPFEventBasedWordExtractor(org.apache.poi.openxml4j.opc.OPCPackage container) throws org.apache.xmlbeans.XmlException, org.apache.poi.openxml4j.exceptions.OpenXML4JException, IOException
      Throws:
      org.apache.xmlbeans.XmlException
      org.apache.poi.openxml4j.exceptions.OpenXML4JException
      IOException
  • Method Details

    • getPackage

      public org.apache.poi.openxml4j.opc.OPCPackage getPackage()
      Specified by:
      getPackage in interface org.apache.poi.ooxml.extractor.POIXMLTextExtractor
    • getCoreProperties

      public org.apache.poi.ooxml.POIXMLProperties.CoreProperties getCoreProperties()
      Specified by:
      getCoreProperties in interface org.apache.poi.ooxml.extractor.POIXMLTextExtractor
    • getExtendedProperties

      public org.apache.poi.ooxml.POIXMLProperties.ExtendedProperties getExtendedProperties()
      Specified by:
      getExtendedProperties in interface org.apache.poi.ooxml.extractor.POIXMLTextExtractor
    • getCustomProperties

      public org.apache.poi.ooxml.POIXMLProperties.CustomProperties getCustomProperties()
      Specified by:
      getCustomProperties in interface org.apache.poi.ooxml.extractor.POIXMLTextExtractor
    • getDocument

      public org.apache.poi.ooxml.POIXMLDocument getDocument()
      Specified by:
      getDocument in interface org.apache.poi.extractor.POITextExtractor
      Specified by:
      getDocument in interface org.apache.poi.ooxml.extractor.POIXMLTextExtractor
    • getText

      public String getText()
      Specified by:
      getText in interface org.apache.poi.extractor.POITextExtractor
    • setCloseFilesystem

      public void setCloseFilesystem(boolean b)
      Specified by:
      setCloseFilesystem in interface org.apache.poi.extractor.POITextExtractor
    • isCloseFilesystem

      public boolean isCloseFilesystem()
      Specified by:
      isCloseFilesystem in interface org.apache.poi.extractor.POITextExtractor
    • getFilesystem

      public Closeable getFilesystem()
      Specified by:
      getFilesystem in interface org.apache.poi.extractor.POITextExtractor