|
||||||||||
PREV PACKAGE NEXT PACKAGE | FRAMES NO FRAMES |
See:
Description
Class Summary | |
---|---|
HTMLLanguageParser | Adds metadata identifying language of document if found We could also run statistical analysis here but we'd miss all other formats |
LanguageIdentifier | Identify the language of a content, based on statistical analysis. |
LanguageIndexingFilter | An IndexingFilter that
add a lang (language) field to the document. |
LanguageQueryFilter | Handles "lang:" query clauses, causing them to search the "lang" field indexed by LanguageIdentifier. |
NGramProfile | This class runs a ngram analysis over submitted text, results might be used for automatic language identifiaction. |
Text document language identifier.
Language profiles are based on material from http://www.isi.edu/~koehn/europarl/.
|
||||||||||
PREV PACKAGE NEXT PACKAGE | FRAMES NO FRAMES |