A LetterTokenizer is a tokenizer that divides text at non-letters. That's to say, it defines tokens as maximal strings of adjacent letters, as defined by java.lang.Character.isLetter() predicate. Note: this does a decent job for most European languages, but does a terrible job for some Asian languages, where words are not separated by spaces.
For a list of all members of this type, see LetterTokenizer Members.
System.Object
Lucene.Net.Analysis.TokenStream
Lucene.Net.Analysis.Tokenizer
Lucene.Net.Analysis.CharTokenizer
Lucene.Net.Analysis.LetterTokenizer
Lucene.Net.Analysis.LowerCaseTokenizer
Public static (Shared in Visual Basic) members of this type are safe for multithreaded operations. Instance members are not guaranteed to be thread-safe.
Namespace: Lucene.Net.Analysis
Assembly: Lucene.Net (in Lucene.Net.dll)