Class ToTextContentHandler

java.lang.Object
org.xml.sax.helpers.DefaultHandler
org.apache.tika.sax.ToTextContentHandler
All Implemented Interfaces:
ContentHandler, DTDHandler, EntityResolver, ErrorHandler
Direct Known Subclasses:
ToXMLContentHandler

public class ToTextContentHandler extends DefaultHandler
SAX event handler that writes all character content out to a character stream. No escaping or other transformations are made on the character content.

As of Tika 1.20, this handler ignores content within <script> and <style> tags.

Since:
Apache Tika 0.10