public class NLPLemmatizerOp extends Object
Supply OpenNLP Lemmatizer tools.
Both a dictionary-based lemmatizer and a MaxEnt lemmatizer are supported. If both are configured, the dictionary-based lemmatizer is tried first, and then the MaxEnt lemmatizer is consulted for out-of-vocabulary tokens.
The MaxEnt implementation requires binary models from OpenNLP project on SourceForge.
Constructor and Description |
---|
NLPLemmatizerOp(InputStream dictionary,
opennlp.tools.lemmatizer.LemmatizerModel lemmatizerModel) |
public NLPLemmatizerOp(InputStream dictionary, opennlp.tools.lemmatizer.LemmatizerModel lemmatizerModel) throws IOException
IOException
Copyright © 2000-2018 Apache Software Foundation. All Rights Reserved.