Package org.apache.any23.extractor
Interface Extractor.TagSoupDOMExtractor
-
- All Known Implementing Classes:
AdrExtractor
,EmbeddedJSONLDExtractor
,EntityBasedMicroformatExtractor
,GeoExtractor
,HAdrExtractor
,HCalendarExtractor
,HCardExtractor
,HCardExtractor
,HeadLinkExtractor
,HEntryExtractor
,HEventExtractor
,HGeoExtractor
,HItemExtractor
,HListingExtractor
,HProductExtractor
,HRecipeExtractor
,HRecipeExtractor
,HResumeExtractor
,HResumeExtractor
,HReviewAggregateExtractor
,HReviewExtractor
,HTMLMetaExtractor
,ICBMExtractor
,LicenseExtractor
,MicrodataExtractor
,MicroformatExtractor
,SpeciesExtractor
,TitleExtractor
,TurtleHTMLExtractor
,XFNExtractor
,XPathExtractor
public static interface Extractor.TagSoupDOMExtractor extends Extractor<Document>
-
-
Nested Class Summary
-
Nested classes/interfaces inherited from interface org.apache.any23.extractor.Extractor
Extractor.BlindExtractor, Extractor.ContentExtractor, Extractor.TagSoupDOMExtractor
-
-
Method Summary
-
Methods inherited from interface org.apache.any23.extractor.Extractor
getDescription, run
-
-