public class EnhancementsDocumentBuilder extends CategoryDocumentBuilder
EnhancementsDocumentBuilder
is a CategoryDocumentBuilder
which adds categories to documents according to the list of
CategoryEnhancement
s from EnhancementsIndexingParams
. The
additions over CategoryDocumentBuilder
could be in both category
tokens, and additional category lists.categoriesMap, fieldList, indexingParams, taxonomyWriter
Constructor and Description |
---|
EnhancementsDocumentBuilder(TaxonomyWriter taxonomyWriter,
EnhancementsIndexingParams params) |
Modifier and Type | Method and Description |
---|---|
protected CategoryListTokenizer |
getCategoryListTokenizer(org.apache.lucene.analysis.TokenStream categoryStream)
Get a category list tokenizer (or a series of such tokenizers) to create
the category list tokens.
|
protected CategoryTokenizer |
getCategoryTokenizer(org.apache.lucene.analysis.TokenStream categoryStream)
Get a
CategoryTokenizer to create the category tokens. |
protected org.apache.lucene.analysis.TokenStream |
getParentsStream(CategoryAttributesStream categoryAttributesStream)
Get a stream of categories which includes the parents, according to
policies defined in indexing parameters.
|
build, fillCategoriesMap, getCountingListTokenizer, setCategories, setCategoryPaths
public EnhancementsDocumentBuilder(TaxonomyWriter taxonomyWriter, EnhancementsIndexingParams params) throws IOException
taxonomyWriter
- params
- Indexing params which include CategoryEnhancement
s.IOException
protected org.apache.lucene.analysis.TokenStream getParentsStream(CategoryAttributesStream categoryAttributesStream)
CategoryDocumentBuilder
getParentsStream
in class CategoryDocumentBuilder
categoryAttributesStream
- The input streamOrdinalPolicy (for policy of adding category tokens for parents)
,
PathPolicy (for policy of adding category list tokens for parents)
protected CategoryListTokenizer getCategoryListTokenizer(org.apache.lucene.analysis.TokenStream categoryStream)
CategoryDocumentBuilder
getCategoryListTokenizer
in class CategoryDocumentBuilder
categoryStream
- A stream containing CategoryAttribute
with the
relevant data.protected CategoryTokenizer getCategoryTokenizer(org.apache.lucene.analysis.TokenStream categoryStream) throws IOException
CategoryDocumentBuilder
CategoryTokenizer
to create the category tokens.
This method can be overridden for adding more attributes to the category
tokens.getCategoryTokenizer
in class CategoryDocumentBuilder
categoryStream
- A stream containing CategoryAttribute
with the
relevant data.CategoryTokenizer
to be used in creating category
tokens.IOException