Highlighter.Net 2.0.0

Lucene.Net.Highlight Namespace

Namespace hierarchy

Classes

Class	Description
DefaultEncoder	Simple {@link Encoder} implementation that does not modify the output
GradientFormatter	Formats text with different color intensity depending on the score of the term.
Highlighter	Class used to markup highlighted terms found in the best sections of a text, using configurable {@link Fragmenter}, {@link Scorer}, {@link Formatter}, {@link Encoder} and tokenizers.
NullFragmenter	{@link Fragmenter} implementation which does not fragment the text. This is useful for highlighting the entire content of a document or field.
QueryScorer	{@link Scorer} implementation which scores text fragments by the number of unique query terms found. This class uses the {@link QueryTermExtractor} class to process determine the query terms and their boosts to be used.
QueryTermExtractor	Utility class used to extract the terms used in a query, plus any weights. This class will not find terms for MultiTermQuery, RangeQuery and PrefixQuery classes so the caller must pass a rewritten query (see Query.rewrite) to obtain a list of expanded terms.
SimpleFragmenter	{@link Fragmenter} implementation which breaks text up into same-size fragments with no concerns over spotting sentence boundaries.
SimpleHTMLEncoder	Simple {@link Encoder} implementation to escape text for HTML output
SimpleHTMLFormatter	Simple {@link Formatter} implementation to highlight terms with a pre and post tag
SpanGradientFormatter	Formats text with different color intensity depending on the score of the term using the span tag. GradientFormatter uses a bgcolor argument to the font tag which doesn't work in Mozilla, thus this class.
TextFragment	Low-level class used to record information about a section of a document with a score.
TokenGroup	One, or several overlapping tokens, along with the score(s) and the scope of the original text
TokenSources	Hides implementation issues associated with obtaining a TokenStream for use with the higlighter - can obtain from TermFreqVectors with offsets and (optionally) positions or from Analyzer class reparsing the stored content.
TokenSources.StoredTokenStream
WeightedTerm	Lightweight class to hold term and a weight value used for scoring this term

Interfaces

Interface	Description
Encoder	Encodes original text. The Encoder works with the Formatter to generate the output.
Formatter	Processes terms found in the original text, typically by applying some form of mark-up to highlight terms in HTML search results pages.
Fragmenter	Implements the policy for breaking text into multiple fragments for consideration by the {@link Highlighter} class. A sophisticated implementation may do this on the basis of detecting end of sentences in the text.
Scorer	Adds to the score for a fragment based on its tokens