Lucene.Net 1.4.3 Class Library

RussianLetterTokenizer Class

A RussianLetterTokenizer is a tokenizer that extends LetterTokenizer by additionally looking up letters in a given "russian charset". The problem with LeterTokenizer is that it uses Character.isLetter() method, which doesn't know how to detect letters in encodings like CP1252 and KOI8 (well-known problems with 0xD7 and 0xF7 chars)

For a list of all members of this type, see RussianLetterTokenizer Members.

System.Object
   Lucene.Net.Analysis.TokenStream
      Lucene.Net.Analysis.Tokenizer
         Lucene.Net.Analysis.CharTokenizer
            Lucene.Net.Analysis.RU.RussianLetterTokenizer

public class RussianLetterTokenizer : CharTokenizer

Thread Safety

Public static (Shared in Visual Basic) members of this type are safe for multithreaded operations. Instance members are not guaranteed to be thread-safe.

Requirements

Namespace: Lucene.Net.Analysis.RU

Assembly: Lucene.Net (in Lucene.Net.dll)

See Also

RussianLetterTokenizer Members | Lucene.Net.Analysis.RU Namespace