|
||||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |
java.lang.Object org.apache.lucene.util.AttributeSource org.apache.lucene.analysis.TokenStream org.apache.lucene.analysis.TokenFilter org.apache.lucene.analysis.stempel.StempelFilter
public final class StempelFilter
Transforms the token stream as per the stemming algorithm.
Note: the input to the stemming filter must already be in lower case, so you will need to use LowerCaseFilter or LowerCaseTokenizer farther down the Tokenizer chain in order for this to work properly!
Nested Class Summary |
---|
Nested classes/interfaces inherited from class org.apache.lucene.util.AttributeSource |
---|
org.apache.lucene.util.AttributeSource.AttributeFactory, org.apache.lucene.util.AttributeSource.State |
Field Summary | |
---|---|
static int |
DEFAULT_MIN_LENGTH
Minimum length of input words to be processed. |
Fields inherited from class org.apache.lucene.analysis.TokenFilter |
---|
input |
Constructor Summary | |
---|---|
StempelFilter(org.apache.lucene.analysis.TokenStream in,
StempelStemmer stemmer)
Create filter using the supplied stemming table. |
|
StempelFilter(org.apache.lucene.analysis.TokenStream in,
StempelStemmer stemmer,
int minLength)
Create filter using the supplied stemming table. |
Method Summary | |
---|---|
boolean |
incrementToken()
Returns the next input Token, after being stemmed |
Methods inherited from class org.apache.lucene.analysis.TokenFilter |
---|
close, end, reset |
Methods inherited from class org.apache.lucene.util.AttributeSource |
---|
addAttribute, addAttributeImpl, captureState, clearAttributes, cloneAttributes, copyTo, equals, getAttribute, getAttributeClassesIterator, getAttributeFactory, getAttributeImplsIterator, hasAttribute, hasAttributes, hashCode, reflectAsString, reflectWith, restoreState, toString |
Methods inherited from class java.lang.Object |
---|
clone, finalize, getClass, notify, notifyAll, wait, wait, wait |
Field Detail |
---|
public static final int DEFAULT_MIN_LENGTH
Constructor Detail |
---|
public StempelFilter(org.apache.lucene.analysis.TokenStream in, StempelStemmer stemmer)
in
- input token streamstemmer
- stemmerpublic StempelFilter(org.apache.lucene.analysis.TokenStream in, StempelStemmer stemmer, int minLength)
in
- input token streamstemmer
- stemmerminLength
- For performance reasons words shorter than minLength
characters are not processed, but simply returned.Method Detail |
---|
public boolean incrementToken() throws IOException
incrementToken
in class org.apache.lucene.analysis.TokenStream
IOException
|
||||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |