Creates a shingle filter using the default {@link TokenSettingsCodec}.
See DefaultSettingsCodec
Namespace: Lucene.Net.Analyzers.ShingleAssembly: Lucene.Net.Contrib.Analyzers (in Lucene.Net.Contrib.Analyzers.dll) Version: 2.9.2.1 (2.9.2.1)
Syntax
C# |
---|
public ShingleMatrixFilter(
TokenStream input,
int minimumShingleSize,
int maximumShingleSize,
Nullable<char> spacerCharacter,
bool ignoringSinglePrefixOrSuffixShingle
) |
Visual C++ |
---|
public:
ShingleMatrixFilter(
TokenStream^ input,
int minimumShingleSize,
int maximumShingleSize,
Nullable<wchar_t> spacerCharacter,
bool ignoringSinglePrefixOrSuffixShingle
) |
Parameters
- input
- Type: Lucene.Net.Analysis..::..TokenStream
stream from which to construct the matrix
- minimumShingleSize
- Type: System..::..Int32
minimum number of tokens in any shingle.
- maximumShingleSize
- Type: System..::..Int32
maximum number of tokens in any shingle.
- spacerCharacter
- Type: System..::..Nullable<(Of <(<'Char>)>)>
character to use between texts of the token parts in a shingle. null for none.
- ignoringSinglePrefixOrSuffixShingle
- Type: System..::..Boolean
if true, shingles that only contains permutation of the first of the last column will not be produced as shingles. Useful when adding boundary marker tokens such as '^' and '$'.
See Also