org.apache.nutch.parse
Interface Parse

All Known Implementing Classes:
ParseImpl

public interface Parse

The result of parsing a page's raw content.

See Also:
Parser.getParse(Content)

Method Summary
 ParseData getData()
          Other data extracted from the page.
 String getText()
          The textual content of the page.
 boolean isCanonical()
          Indicates if the parse is coming from a url or a sub-url
 

Method Detail

getText

String getText()
The textual content of the page. This is indexed, searched, and used when generating snippets.


getData

ParseData getData()
Other data extracted from the page.


isCanonical

boolean isCanonical()
Indicates if the parse is coming from a url or a sub-url



Copyright © 2011 The Apache Software Foundation