Creates a shingle filter with ad hoc parameter settings.
Namespace: Lucene.Net.Analyzers.ShingleAssembly: Lucene.Net.Contrib.Analyzers (in Lucene.Net.Contrib.Analyzers.dll) Version: 2.9.2.1 (2.9.2.1)
Syntax
C# |
---|
public ShingleMatrixFilter( TokenStream input, int minimumShingleSize, int maximumShingleSize, Nullable<char> spacerCharacter, bool ignoringSinglePrefixOrSuffixShingle, TokenSettingsCodec settingsCodec ) |
Visual Basic |
---|
Public Sub New ( _ input As TokenStream, _ minimumShingleSize As Integer, _ maximumShingleSize As Integer, _ spacerCharacter As Nullable(Of Char), _ ignoringSinglePrefixOrSuffixShingle As Boolean, _ settingsCodec As TokenSettingsCodec _ ) |
Visual C++ |
---|
public: ShingleMatrixFilter( TokenStream^ input, int minimumShingleSize, int maximumShingleSize, Nullable<wchar_t> spacerCharacter, bool ignoringSinglePrefixOrSuffixShingle, TokenSettingsCodec^ settingsCodec ) |
Parameters
- input
- Type: Lucene.Net.Analysis..::..TokenStream
stream from which to construct the matrix
- minimumShingleSize
- Type: System..::..Int32
minimum number of tokens in any shingle.
- maximumShingleSize
- Type: System..::..Int32
maximum number of tokens in any shingle.
- spacerCharacter
- Type: System..::..Nullable<(Of <(<'Char>)>)>
character to use between texts of the token parts in a shingle. null for none.
- ignoringSinglePrefixOrSuffixShingle
- Type: System..::..Boolean
if true, shingles that only contains permutation of the first of the last column will not be produced as shingles. Useful when adding boundary marker tokens such as '^' and '$'.
- settingsCodec
- Type: Lucene.Net.Analyzers.Shingle.Codec..::..TokenSettingsCodec
codec used to read input token weight and matrix positioning.