Class CharsetUtils

java.lang.Object
org.apache.tika.utils.CharsetUtils

public class CharsetUtils extends Object
  • Constructor Details

    • CharsetUtils

      public CharsetUtils()
  • Method Details

    • isSupported

      public static boolean isSupported(String charsetName)
      Safely return whether is supported, without throwing exceptions
      Parameters:
      charsetName - Name of charset (can be null)
      Returns:
      true if the character set is supported
    • clean

      public static String clean(String charsetName)
      Handle various common charset name errors, and return something that will be considered valid (and is normalized)
      Parameters:
      charsetName - name of charset to process
      Returns:
      potentially remapped/cleaned up version of charset name
    • forName

      public static Charset forName(String name)
      Returns Charset impl, if one exists. This method optionally uses ICU4J's CharsetICU.forNameICU, if it is found on the classpath, else only uses JDK's builtin Charset.forName.