public class CollationKeyFilterFactory extends BaseTokenFilterFactory implements MultiTermAwareComponent, ResourceLoaderAware
CollationKeyFilter
.
This factory can be created in two ways:
Using a System collator:
Using a Tailored ruleset:
<fieldType name="text_clltnky" class="solr.TextField" positionIncrementGap="100"> <analyzer> <tokenizer class="solr.KeywordTokenizerFactory"/> <filter class="solr.CollationKeyFilterFactory" language="ja" country="JP"/> </analyzer> </fieldType>
Collator
,
Locale
,
RuleBasedCollator
Modifier and Type | Field and Description |
---|---|
protected Map<String,String> |
args
The init args
|
protected Version |
luceneMatchVersion
the luceneVersion arg
|
log
Constructor and Description |
---|
CollationKeyFilterFactory() |
Modifier and Type | Method and Description |
---|---|
protected void |
assureMatchVersion()
this method can be called in the
TokenizerFactory.create(java.io.Reader)
or TokenFilterFactory.create(org.apache.lucene.analysis.TokenStream) methods,
to inform user, that for this factory a luceneMatchVersion is required |
TokenStream |
create(TokenStream input)
Transform the specified input TokenStream
|
Map<String,String> |
getArgs() |
protected boolean |
getBoolean(String name,
boolean defaultVal) |
protected boolean |
getBoolean(String name,
boolean defaultVal,
boolean useDefault) |
protected int |
getInt(String name) |
protected int |
getInt(String name,
int defaultVal) |
protected int |
getInt(String name,
int defaultVal,
boolean useDefault) |
Object |
getMultiTermComponent()
Returns an analysis component to handle analysis if multi-term queries.
|
protected CharArraySet |
getSnowballWordSet(ResourceLoader loader,
String wordFiles,
boolean ignoreCase)
same as
getWordSet(ResourceLoader, String, boolean) ,
except the input is in snowball format. |
protected CharArraySet |
getWordSet(ResourceLoader loader,
String wordFiles,
boolean ignoreCase) |
void |
inform(ResourceLoader loader) |
void |
init(Map<String,String> args) |
protected void |
warnDeprecated(String message) |
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
getArgs, init
protected Version luceneMatchVersion
public void inform(ResourceLoader loader)
inform
in interface ResourceLoaderAware
public TokenStream create(TokenStream input)
TokenFilterFactory
create
in interface TokenFilterFactory
public Object getMultiTermComponent()
MultiTermAwareComponent
getMultiTermComponent
in interface MultiTermAwareComponent
protected final void assureMatchVersion()
TokenizerFactory.create(java.io.Reader)
or TokenFilterFactory.create(org.apache.lucene.analysis.TokenStream)
methods,
to inform user, that for this factory a luceneMatchVersion
is requiredprotected final void warnDeprecated(String message)
protected int getInt(String name)
protected int getInt(String name, int defaultVal)
protected int getInt(String name, int defaultVal, boolean useDefault)
protected boolean getBoolean(String name, boolean defaultVal)
protected boolean getBoolean(String name, boolean defaultVal, boolean useDefault)
protected CharArraySet getWordSet(ResourceLoader loader, String wordFiles, boolean ignoreCase) throws IOException
IOException
protected CharArraySet getSnowballWordSet(ResourceLoader loader, String wordFiles, boolean ignoreCase) throws IOException
getWordSet(ResourceLoader, String, boolean)
,
except the input is in snowball format.IOException