org.apache.jackrabbit.extractor
Class AbstractTextExtractor

java.lang.Object
  extended by org.apache.jackrabbit.extractor.AbstractTextExtractor
All Implemented Interfaces:
TextExtractor
Direct Known Subclasses:
HTMLTextExtractor, MsExcelTextExtractor, MsOutlookTextExtractor, MsPowerPointTextExtractor, MsWordTextExtractor, OpenOfficeTextExtractor, PdfTextExtractor, PlainTextExtractor, PngTextExtractor, RTFTextExtractor, XMLTextExtractor

public abstract class AbstractTextExtractor
extends Object
implements TextExtractor

Base class for text extractor implementations.


Constructor Summary
AbstractTextExtractor(String[] contentTypes)
           
 
Method Summary
 String[] getContentTypes()
          Returns the MIME types supported by this extractor.
 
Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
 
Methods inherited from interface org.apache.jackrabbit.extractor.TextExtractor
extractText
 

Constructor Detail

AbstractTextExtractor

public AbstractTextExtractor(String[] contentTypes)
Parameters:
contentTypes - the supported content types by this text extractor.
Method Detail

getContentTypes

public String[] getContentTypes()
Description copied from interface: TextExtractor
Returns the MIME types supported by this extractor. The returned strings must be in lower case, and the returned array must not be empty.

The returned array must not be modified.

Specified by:
getContentTypes in interface TextExtractor
Returns:
supported MIME types, lower case


Copyright © 2004-2008 The Apache Software Foundation. All Rights Reserved.