public abstract class ICUTokenizerConfig extends Object
Modifier and Type | Field and Description |
---|---|
static int |
EMOJI_SEQUENCE_STATUS
Rule status for emoji sequences
|
Constructor and Description |
---|
ICUTokenizerConfig()
Sole constructor.
|
Modifier and Type | Method and Description |
---|---|
abstract boolean |
combineCJ()
true if Han, Hiragana, and Katakana scripts should all be returned as Japanese
|
abstract com.ibm.icu.text.RuleBasedBreakIterator |
getBreakIterator(int script)
Return a breakiterator capable of processing a given script.
|
abstract String |
getType(int script,
int ruleStatus)
Return a token type value for a given script and BreakIterator
rule status.
|
public static final int EMOJI_SEQUENCE_STATUS
public ICUTokenizerConfig()
public abstract com.ibm.icu.text.RuleBasedBreakIterator getBreakIterator(int script)
public abstract String getType(int script, int ruleStatus)
public abstract boolean combineCJ()
Copyright © 2000-2018 Apache Software Foundation. All Rights Reserved.