public class MorfologikAnalyzer extends Analyzer
Analyzer
using Morfologik library.Analyzer.ReuseStrategy, Analyzer.TokenStreamComponents
GLOBAL_REUSE_STRATEGY, PER_FIELD_REUSE_STRATEGY
Constructor and Description |
---|
MorfologikAnalyzer()
Builds an analyzer with the default Morfologik's Polish dictionary.
|
MorfologikAnalyzer(morfologik.stemming.Dictionary dictionary)
Builds an analyzer with an explicit
Dictionary resource. |
Modifier and Type | Method and Description |
---|---|
protected Analyzer.TokenStreamComponents |
createComponents(String field)
Creates a
Analyzer.TokenStreamComponents
which tokenizes all the text in the provided Reader . |
close, getOffsetGap, getPositionIncrementGap, getReuseStrategy, getVersion, initReader, setVersion, tokenStream, tokenStream
public MorfologikAnalyzer(morfologik.stemming.Dictionary dictionary)
Dictionary
resource.dictionary
- A prebuilt automaton with inflected and base word forms.public MorfologikAnalyzer()
protected Analyzer.TokenStreamComponents createComponents(String field)
Analyzer.TokenStreamComponents
which tokenizes all the text in the provided Reader
.createComponents
in class Analyzer
field
- ignored field nameAnalyzer.TokenStreamComponents
built from an StandardTokenizer
filtered with
StandardFilter
and MorfologikFilter
.Copyright © 2000-2016 Apache Software Foundation. All Rights Reserved.