org.apache.any23.extractor
Interface Extractor.TagSoupDOMExtractor

All Superinterfaces:
Extractor<Document>
All Known Implementing Classes:
AdrExtractor, EntityBasedMicroformatExtractor, GeoExtractor, HCalendarExtractor, HCardExtractor, HeadLinkExtractor, HListingExtractor, HRecipeExtractor, HResumeExtractor, HReviewExtractor, HTMLMetaExtractor, ICBMExtractor, LicenseExtractor, MicrodataExtractor, MicroformatExtractor, RDFa11Extractor, RDFaExtractor, SpeciesExtractor, TitleExtractor, TurtleHTMLExtractor, XFNExtractor, XPathExtractor
Enclosing interface:
Extractor<Input>

public static interface Extractor.TagSoupDOMExtractor
extends Extractor<Document>

This interface specializes an Extractor able to handle Document as input format.


Nested Class Summary
 
Nested classes/interfaces inherited from interface org.apache.any23.extractor.Extractor
Extractor.BlindExtractor, Extractor.ContentExtractor, Extractor.TagSoupDOMExtractor
 
Method Summary
 
Methods inherited from interface org.apache.any23.extractor.Extractor
getDescription, run
 



Copyright © 2010-2012 The Apache Software Foundation. All Rights Reserved.