Class AnalyzerManager

java.lang.Object
org.apache.tika.eval.core.tokens.AnalyzerManager

public class AnalyzerManager extends Object
  • Method Details

    • newInstance

      public static AnalyzerManager newInstance(int maxTokens)
    • getGeneralAnalyzer

      public org.apache.lucene.analysis.Analyzer getGeneralAnalyzer()
      This analyzer should be used to extract all tokens.
      Returns:
    • getCommonTokensAnalyzer

      public org.apache.lucene.analysis.Analyzer getCommonTokensAnalyzer()
      This analyzer should be used to generate common tokens lists from large corpora. It is not used by tika-eval in profiling or comparing.
      Returns: