org.apache.jackrabbit.extractor
Class AbstractTextExtractor
java.lang.Object
org.apache.jackrabbit.extractor.AbstractTextExtractor
- All Implemented Interfaces:
- TextExtractor
- Direct Known Subclasses:
- HTMLTextExtractor, MsExcelTextExtractor, MsOutlookTextExtractor, MsPowerPointTextExtractor, MsWordTextExtractor, OpenOfficeTextExtractor, PdfTextExtractor, PlainTextExtractor, PngTextExtractor, RTFTextExtractor, XMLTextExtractor
public abstract class AbstractTextExtractor
- extends Object
- implements TextExtractor
Base class for text extractor implementations.
Methods inherited from class java.lang.Object |
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
AbstractTextExtractor
public AbstractTextExtractor(String[] contentTypes)
- Parameters:
contentTypes
- the supported content types by this text extractor.
getContentTypes
public String[] getContentTypes()
- Description copied from interface:
TextExtractor
- Returns the MIME types supported by this extractor. The returned
strings must be in lower case, and the returned array must not be empty.
The returned array must not be modified.
- Specified by:
getContentTypes
in interface TextExtractor
- Returns:
- supported MIME types, lower case
Copyright © 2004-2009 The Apache Software Foundation. All Rights Reserved.