- CachedTranslator - Class in org.apache.tika.language.translate
-
CachedTranslator.
- CachedTranslator() - Constructor for class org.apache.tika.language.translate.CachedTranslator
-
- CachedTranslator(Translator) - Constructor for class org.apache.tika.language.translate.CachedTranslator
-
Create a new CachedTranslator.
- call() - Method in class org.apache.tika.batch.BatchProcess
-
Runs main execution loop.
- call() - Method in class org.apache.tika.batch.FileResourceConsumer
-
- call() - Method in class org.apache.tika.batch.FileResourceCrawler
-
- call() - Method in class org.apache.tika.batch.fs.strawman.StrawManTikaAppDriver
-
- call() - Method in class org.apache.tika.batch.Interrupter
-
- call() - Method in class org.apache.tika.batch.StatusReporter
-
Startup the reporter.
- CAN_MODIFY - Static variable in interface org.apache.tika.metadata.AccessPermissions
-
Can any modifications be made to the document
- CAN_MODIFY_ANNOTATIONS - Static variable in interface org.apache.tika.metadata.AccessPermissions
-
Can the user modify annotations
- CAN_PRINT - Static variable in interface org.apache.tika.metadata.AccessPermissions
-
Can the user print the document
- CAN_PRINT_DEGRADED - Static variable in interface org.apache.tika.metadata.AccessPermissions
-
Can the user print an image-degraded version of the document.
- canRun() - Static method in class org.apache.tika.langdetect.TextLangDetector
-
- canRun() - Static method in class org.apache.tika.parser.journal.GrobidRESTParser
-
- CAPTION_WRITER - Static variable in interface org.apache.tika.metadata.Photoshop
-
- cast(InputStream) - Static method in class org.apache.tika.io.TikaInputStream
-
Returns the given stream casts to a TikaInputStream, or
null
if the stream is not a TikaInputStream.
- CATEGORY - Static variable in interface org.apache.tika.metadata.IPTC
-
Deprecated.
- CATEGORY - Static variable in interface org.apache.tika.metadata.MSOffice
-
Deprecated.
- CATEGORY - Static variable in interface org.apache.tika.metadata.OfficeOpenXMLCore
-
A categorization of the content of this package.
- CATEGORY - Static variable in interface org.apache.tika.metadata.Photoshop
-
- Cell - Interface in org.apache.tika.parser.microsoft
-
Cell of content.
- cell(String, String, XSSFComment) - Method in class org.apache.tika.parser.microsoft.ooxml.XSSFExcelExtractorDecorator.SheetTextAsHTML
-
- CellDecorator - Class in org.apache.tika.parser.microsoft
-
Cell decorator.
- CellDecorator(Cell) - Constructor for class org.apache.tika.parser.microsoft.CellDecorator
-
- CERTIFICATE - Static variable in interface org.apache.tika.metadata.XMPRights
-
A Web URL for a rights management certificate.
- ChannelTypePropertyConverter() - Constructor for class org.apache.tika.metadata.XMPDM.ChannelTypePropertyConverter
-
Deprecated.
- CHARACTER_COUNT - Static variable in interface org.apache.tika.metadata.MSOffice
-
Deprecated.
- CHARACTER_COUNT - Static variable in interface org.apache.tika.metadata.Office
-
The number of Characters in the document
- CHARACTER_COUNT_WITH_SPACES - Static variable in interface org.apache.tika.metadata.MSOffice
-
Deprecated.
- CHARACTER_COUNT_WITH_SPACES - Static variable in interface org.apache.tika.metadata.Office
-
The number of Characters in the document, including spaces
- characters - Variable in class org.apache.tika.mime.MimeTypesReader
-
- characters(char[], int, int) - Method in class org.apache.tika.mime.MimeTypesReader
-
- characters(char[], int, int) - Method in class org.apache.tika.parser.ctakes.CTAKESContentHandler
-
- characters(char[], int, int) - Method in class org.apache.tika.parser.dif.DIFContentHandler
-
- characters(char[], int, int) - Method in class org.apache.tika.parser.html.BoilerpipeContentHandler
-
- characters(char[], int, int) - Method in class org.apache.tika.parser.microsoft.ooxml.XSSFExcelExtractorDecorator.XSSFSheetInterestingPartsCapturer
-
- characters(char[], int, int) - Method in class org.apache.tika.parser.xml.AttributeDependantMetadataHandler
-
- characters(char[], int, int) - Method in class org.apache.tika.parser.xml.ElementMetadataHandler
-
- characters(char[], int, int) - Method in class org.apache.tika.parser.xml.MetadataHandler
-
Deprecated.
- characters(char[], int, int) - Method in class org.apache.tika.sax.ContentHandlerDecorator
-
- characters(char[], int, int) - Method in class org.apache.tika.sax.DIFContentHandler
-
- characters(char[], int, int) - Method in class org.apache.tika.sax.ExpandedTitleContentHandler
-
- characters(char[], int, int) - Method in class org.apache.tika.sax.LinkContentHandler
-
- characters(char[], int, int) - Method in class org.apache.tika.sax.PhoneExtractingContentHandler
-
The characters method is called whenever a Parser wants to pass raw...
- characters(char[], int, int) - Method in class org.apache.tika.sax.SafeContentHandler
-
- characters(char[], int, int) - Method in class org.apache.tika.sax.SecureContentHandler
-
- characters(char[], int, int) - Method in class org.apache.tika.sax.TeeContentHandler
-
- characters(char[], int, int) - Method in class org.apache.tika.sax.TextContentHandler
-
- characters(char[], int, int) - Method in class org.apache.tika.sax.ToTextContentHandler
-
Writes the given characters to the given character stream.
- characters(char[], int, int) - Method in class org.apache.tika.sax.ToXMLContentHandler
-
- characters(char[], int, int) - Method in class org.apache.tika.sax.WriteOutContentHandler
-
Writes the given characters to the given character stream.
- characters(char[], int, int) - Method in class org.apache.tika.sax.XHTMLContentHandler
-
- characters(String) - Method in class org.apache.tika.sax.XHTMLContentHandler
-
- characters(char[], int, int) - Method in class org.apache.tika.sax.xpath.MatchingContentHandler
-
- CharsetDetector - Class in org.apache.tika.parser.txt
-
CharsetDetector
provides a facility for detecting the
charset or encoding of character data in an unknown format.
- CharsetDetector() - Constructor for class org.apache.tika.parser.txt.CharsetDetector
-
Constructor
- CharsetMatch - Class in org.apache.tika.parser.txt
-
This class represents a charset that has been identified by a CharsetDetector
as a possible encoding for a set of input data.
- CharsetUtils - Class in org.apache.tika.utils
-
- CharsetUtils() - Constructor for class org.apache.tika.utils.CharsetUtils
-
- check(String, int...) - Static method in class org.apache.tika.embedder.ExternalEmbedder
-
Checks to see if the command can be run.
- check(String[], int...) - Static method in class org.apache.tika.embedder.ExternalEmbedder
-
Checks to see if the command can be run.
- check(String, int...) - Static method in class org.apache.tika.parser.external.ExternalParser
-
Checks to see if the command can be run.
- check(String[], int...) - Static method in class org.apache.tika.parser.external.ExternalParser
-
- check(Metadata) - Method in class org.apache.tika.parser.pdf.AccessChecker
-
- CHECK_TAG - Static variable in interface org.apache.tika.parser.external.ExternalParsersConfigReaderMetKeys
-
- checkAvail() - Method in class org.apache.tika.parser.geo.topic.gazetteer.GeoGazetteerClient
-
Ping lucene-geo-gazetteer API
- checkBit(int) - Method in class org.apache.tika.parser.chm.lzx.ChmSection
-
- checkCommand(String, int...) - Method in class org.apache.tika.language.translate.ExternalTranslator
-
Checks to see if the command can be run.
- checkForTimedOutMillis(long) - Method in class org.apache.tika.batch.FileResourceConsumer
-
Checks to see if the currentFile being processed (if there is one)
should be timed out (still being worked on after staleThresholdMillis).
- checkThisIsAncestorOfOrSameAsThat(File, File) - Static method in class org.apache.tika.batch.fs.FSUtil
-
Deprecated.
- checkThisIsAncestorOfThat(File, File) - Static method in class org.apache.tika.batch.fs.FSUtil
-
Deprecated.
- ChildMatcher - Class in org.apache.tika.sax.xpath
-
Intermediate evaluation state of a .../*...
XPath expression.
- ChildMatcher(Matcher) - Constructor for class org.apache.tika.sax.xpath.ChildMatcher
-
- CHM_ITSF_V2_LEN - Static variable in class org.apache.tika.parser.chm.core.ChmConstants
-
- CHM_ITSF_V3_LEN - Static variable in class org.apache.tika.parser.chm.core.ChmConstants
-
- CHM_ITSP_V1_LEN - Static variable in class org.apache.tika.parser.chm.core.ChmConstants
-
- CHM_LZXC_MIN_LEN - Static variable in class org.apache.tika.parser.chm.core.ChmConstants
-
- CHM_LZXC_RESETTABLE_V1_LEN - Static variable in class org.apache.tika.parser.chm.core.ChmConstants
-
- CHM_LZXC_V2_LEN - Static variable in class org.apache.tika.parser.chm.core.ChmConstants
-
- CHM_PMGI_LEN - Static variable in class org.apache.tika.parser.chm.core.ChmConstants
-
- CHM_PMGI_MARKER - Static variable in class org.apache.tika.parser.chm.core.ChmConstants
-
- CHM_PMGL_LEN - Static variable in class org.apache.tika.parser.chm.core.ChmConstants
-
- CHM_SIGNATURE_LEN - Static variable in class org.apache.tika.parser.chm.core.ChmConstants
-
- CHM_VER_1 - Static variable in class org.apache.tika.parser.chm.core.ChmConstants
-
- CHM_VER_2 - Static variable in class org.apache.tika.parser.chm.core.ChmConstants
-
- CHM_VER_3 - Static variable in class org.apache.tika.parser.chm.core.ChmConstants
-
- CHM_WINDOW_SIZE_BLOCK - Static variable in class org.apache.tika.parser.chm.core.ChmConstants
-
- ChmAccessor<T> - Interface in org.apache.tika.parser.chm.accessor
-
Defines an accessor interface
- ChmAssert - Class in org.apache.tika.parser.chm.assertion
-
Contains chm extractor assertions
- ChmAssert() - Constructor for class org.apache.tika.parser.chm.assertion.ChmAssert
-
- ChmBlockInfo - Class in org.apache.tika.parser.chm.lzx
-
A container that contains chm block information such as: i.
- ChmCommons - Class in org.apache.tika.parser.chm.core
-
- ChmCommons.EntryType - Enum in org.apache.tika.parser.chm.core
-
Represents entry types: uncompressed, compressed
- ChmCommons.IntelState - Enum in org.apache.tika.parser.chm.core
-
Represents intel file states during decompression
- ChmCommons.LzxState - Enum in org.apache.tika.parser.chm.core
-
Represents lzx states: started decoding, not started decoding
- ChmConstants - Class in org.apache.tika.parser.chm.core
-
- ChmDirectoryListingSet - Class in org.apache.tika.parser.chm.accessor
-
Holds chm listing entries
- ChmDirectoryListingSet(byte[], ChmItsfHeader, ChmItspHeader) - Constructor for class org.apache.tika.parser.chm.accessor.ChmDirectoryListingSet
-
Constructs chm directory listing set
- ChmExtractor - Class in org.apache.tika.parser.chm.core
-
Extracts text from chm file.
- ChmExtractor(InputStream) - Constructor for class org.apache.tika.parser.chm.core.ChmExtractor
-
- ChmItsfHeader - Class in org.apache.tika.parser.chm.accessor
-
The Header 0000: char[4] 'ITSF' 0004: DWORD 3 (Version number) 0008: DWORD
Total header length, including header section table and following data.
- ChmItsfHeader() - Constructor for class org.apache.tika.parser.chm.accessor.ChmItsfHeader
-
- ChmItspHeader - Class in org.apache.tika.parser.chm.accessor
-
Directory header The directory starts with a header; its format is as
follows: 0000: char[4] 'ITSP' 0004: DWORD Version number 1 0008: DWORD Length
of the directory header 000C: DWORD $0a (unknown) 0010: DWORD $1000 Directory
chunk size 0014: DWORD "Density" of quickref section, usually 2 0018: DWORD
Depth of the index tree - 1 there is no index, 2 if there is one level of
PMGI chunks 001C: DWORD Chunk number of root index chunk, -1 if there is none
(though at least one file has 0 despite there being no index chunk, probably
a bug) 0020: DWORD Chunk number of first PMGL (listing) chunk 0024: DWORD
Chunk number of last PMGL (listing) chunk 0028: DWORD -1 (unknown) 002C:
DWORD Number of directory chunks (total) 0030: DWORD Windows language ID
0034: GUID {5D02926A-212E-11D0-9DF9-00A0C922E6EC} 0044: DWORD $54 (This is
the length again) 0048: DWORD -1 (unknown) 004C: DWORD -1 (unknown) 0050:
DWORD -1 (unknown)
://translated.by/you/microsoft-s-html-help-chm-format-incomplete/original
/?show-translation-form=1
- ChmItspHeader() - Constructor for class org.apache.tika.parser.chm.accessor.ChmItspHeader
-
- ChmLzxBlock - Class in org.apache.tika.parser.chm.lzx
-
Decompresses a chm block.
- ChmLzxBlock(int, byte[], long, ChmLzxBlock) - Constructor for class org.apache.tika.parser.chm.lzx.ChmLzxBlock
-
- ChmLzxcControlData - Class in org.apache.tika.parser.chm.accessor
-
::DataSpace/Storage//ControlData This file contains $20 bytes of
information on the compression.
- ChmLzxcControlData() - Constructor for class org.apache.tika.parser.chm.accessor.ChmLzxcControlData
-
- ChmLzxcResetTable - Class in org.apache.tika.parser.chm.accessor
-
LZXC reset table For ensuring a decompression.
- ChmLzxcResetTable() - Constructor for class org.apache.tika.parser.chm.accessor.ChmLzxcResetTable
-
- ChmLzxState - Class in org.apache.tika.parser.chm.lzx
-
- ChmLzxState(int) - Constructor for class org.apache.tika.parser.chm.lzx.ChmLzxState
-
- ChmParser - Class in org.apache.tika.parser.chm
-
- ChmParser() - Constructor for class org.apache.tika.parser.chm.ChmParser
-
- ChmParsingException - Exception in org.apache.tika.parser.chm.exception
-
- ChmParsingException(String) - Constructor for exception org.apache.tika.parser.chm.exception.ChmParsingException
-
- ChmPmgiHeader - Class in org.apache.tika.parser.chm.accessor
-
Description Note: not always exists An index chunk has the following format:
0000: char[4] 'PMGI' 0004: DWORD Length of quickref/free area at end of
directory chunk 0008: Directory index entries (to quickref/free area) The
quickref area in an PMGI is the same as in an PMGL The format of a directory
index entry is as follows: BYTE: length of name BYTEs: name (UTF-8 encoded)
ENCINT: directory listing chunk which starts with name Encoded Integers aka
ENCINT An ENCINT is a variable-length integer.
- ChmPmgiHeader() - Constructor for class org.apache.tika.parser.chm.accessor.ChmPmgiHeader
-
- ChmPmglHeader - Class in org.apache.tika.parser.chm.accessor
-
Description There are two types of directory chunks -- index chunks, and
listing chunks.
- ChmPmglHeader() - Constructor for class org.apache.tika.parser.chm.accessor.ChmPmglHeader
-
- ChmSection - Class in org.apache.tika.parser.chm.lzx
-
- ChmSection(byte[]) - Constructor for class org.apache.tika.parser.chm.lzx.ChmSection
-
- ChmSection(byte[], byte[]) - Constructor for class org.apache.tika.parser.chm.lzx.ChmSection
-
- ChmWrapper - Class in org.apache.tika.parser.chm.core
-
- ChmWrapper() - Constructor for class org.apache.tika.parser.chm.core.ChmWrapper
-
- CITY - Static variable in interface org.apache.tika.metadata.IPTC
-
Name of the city the content is focussing on -- either the place shown
in visual media or referenced by text or audio media.
- CITY - Static variable in interface org.apache.tika.metadata.Photoshop
-
- ClassLoaderUtil - Class in org.apache.tika.util
-
- ClassLoaderUtil() - Constructor for class org.apache.tika.util.ClassLoaderUtil
-
- className - Variable in class org.apache.tika.server.resource.TikaWelcome.Endpoint
-
- ClassParser - Class in org.apache.tika.parser.asm
-
Parser for Java .class files.
- ClassParser() - Constructor for class org.apache.tika.parser.asm.ClassParser
-
- clean(String) - Static method in class org.apache.tika.sax.CleanPhoneText
-
- clean(String) - Static method in class org.apache.tika.utils.CharsetUtils
-
Handle various common charset name errors, and return something
that will be considered valid (and is normalized)
- CleanPhoneText - Class in org.apache.tika.sax
-
Class to help de-obfuscate phone numbers in text.
- CleanPhoneText() - Constructor for class org.apache.tika.sax.CleanPhoneText
-
- cleanSubstitutions - Static variable in class org.apache.tika.sax.CleanPhoneText
-
- clearProfiles() - Static method in class org.apache.tika.language.LanguageIdentifier
-
Deprecated.
Clears the current map of language profiles
- ClimateForcast - Interface in org.apache.tika.metadata
-
- clone() - Method in class org.apache.tika.parser.chm.lzx.ChmLzxState
-
- close(Closeable) - Method in class org.apache.tika.batch.FileResourceConsumer
-
- close() - Method in class org.apache.tika.fork.ForkParser
-
- close() - Method in class org.apache.tika.io.CloseShieldInputStream
-
- close() - Method in class org.apache.tika.io.LookaheadInputStream
-
- close() - Method in class org.apache.tika.io.NullInputStream
-
Close this input stream - resets the internal state to
the initial values.
- close() - Method in class org.apache.tika.io.ProxyInputStream
-
Invokes the delegate's close()
method.
- close() - Method in class org.apache.tika.io.TemporaryResources
-
Closes all tracked resources.
- close() - Method in class org.apache.tika.io.TikaInputStream
-
- close() - Method in class org.apache.tika.language.detect.LanguageWriter
-
Ignored.
- close() - Method in class org.apache.tika.language.ProfilingWriter
-
Deprecated.
- close() - Method in class org.apache.tika.parser.mp4.DirectFileReadDataSource
-
- close() - Method in class org.apache.tika.parser.ParsingReader
-
Closes the read end of the pipe.
- close() - Method in class org.apache.tika.utils.RereadableInputStream
-
Closes the input stream and removes the temporary file if one was
created.
- ClosedInputStream - Class in org.apache.tika.io
-
Closed input stream.
- ClosedInputStream() - Constructor for class org.apache.tika.io.ClosedInputStream
-
- closeQuietly(Reader) - Static method in class org.apache.tika.io.IOUtils
-
Unconditionally close an Reader
.
- closeQuietly(Channel) - Static method in class org.apache.tika.io.IOUtils
-
Unconditionally close a Channel
.
- closeQuietly(Writer) - Static method in class org.apache.tika.io.IOUtils
-
Unconditionally close a Writer
.
- closeQuietly(InputStream) - Static method in class org.apache.tika.io.IOUtils
-
Unconditionally close an InputStream
.
- closeQuietly(OutputStream) - Static method in class org.apache.tika.io.IOUtils
-
Unconditionally close an OutputStream
.
- CloseShieldInputStream - Class in org.apache.tika.io
-
Proxy stream that prevents the underlying input stream from being closed.
- CloseShieldInputStream(InputStream) - Constructor for class org.apache.tika.io.CloseShieldInputStream
-
Creates a proxy that shields the given input stream from being
closed.
- COLOR_MODE - Static variable in interface org.apache.tika.metadata.Photoshop
-
- COLUMN_COUNT - Static variable in interface org.apache.tika.metadata.Database
-
- COLUMN_NAME - Static variable in interface org.apache.tika.metadata.Database
-
- COMMAND_LINE - Static variable in interface org.apache.tika.metadata.ClimateForcast
-
- COMMAND_TAG - Static variable in interface org.apache.tika.parser.external.ExternalParsersConfigReaderMetKeys
-
- CommandLineParserBuilder - Class in org.apache.tika.batch.builders
-
Reads configurable options from a config file and returns org.apache.commons.cli.Options
object to be used in commandline parser.
- CommandLineParserBuilder() - Constructor for class org.apache.tika.batch.builders.CommandLineParserBuilder
-
- COMMENT - Static variable in interface org.apache.tika.metadata.ClimateForcast
-
- COMMENT_TAG - Static variable in interface org.apache.tika.mime.MimeTypesReaderMetKeys
-
- COMMENTS - Static variable in interface org.apache.tika.metadata.MSOffice
-
Deprecated.
- COMMENTS - Static variable in interface org.apache.tika.metadata.OfficeOpenXMLExtended
-
- COMMENTS - Static variable in interface org.apache.tika.metadata.TikaCoreProperties
-
- CommonsDigester - Class in org.apache.tika.parser.utils
-
- CommonsDigester(int, CommonsDigester.DigestAlgorithm...) - Constructor for class org.apache.tika.parser.utils.CommonsDigester
-
- CommonsDigester.DigestAlgorithm - Enum in org.apache.tika.parser.utils
-
- COMP_OBJ - Static variable in class org.apache.tika.parser.microsoft.POIFSContainerDetector
-
Some other kind of embedded document, in a CompObj container within another OLE2 document
- COMPANY - Static variable in interface org.apache.tika.metadata.MSOffice
-
Deprecated.
- COMPANY - Static variable in interface org.apache.tika.metadata.OfficeOpenXMLExtended
-
- compare(String, String) - Method in class org.apache.tika.metadata.serialization.PrettyMetadataKeyComparator
-
- compareTo(Property) - Method in class org.apache.tika.metadata.Property
-
- compareTo(MediaType) - Method in class org.apache.tika.mime.MediaType
-
- compareTo(MimeType) - Method in class org.apache.tika.mime.MimeType
-
- compareTo(CharsetMatch) - Method in class org.apache.tika.parser.txt.CharsetMatch
-
Compare to other CharsetMatch objects.
- COMPILATION - Static variable in interface org.apache.tika.metadata.XMPDM
-
"An album created by various artists."
- COMPOSER - Static variable in interface org.apache.tika.metadata.XMPDM
-
"The composer's name."
- composite(Property, Property[]) - Static method in class org.apache.tika.metadata.Property
-
Constructs a new composite property from the given primary and array of secondary properties.
- CompositeDetector - Class in org.apache.tika.detect
-
Content type detector that combines multiple different detection mechanisms.
- CompositeDetector(MediaTypeRegistry, List<Detector>, Collection<Class<? extends Detector>>) - Constructor for class org.apache.tika.detect.CompositeDetector
-
- CompositeDetector(MediaTypeRegistry, List<Detector>) - Constructor for class org.apache.tika.detect.CompositeDetector
-
- CompositeDetector(List<Detector>) - Constructor for class org.apache.tika.detect.CompositeDetector
-
- CompositeDetector(Detector...) - Constructor for class org.apache.tika.detect.CompositeDetector
-
- CompositeExternalParser - Class in org.apache.tika.parser.external
-
A Composite Parser that wraps up all the available External Parsers,
and provides an easy way to access them.
- CompositeExternalParser() - Constructor for class org.apache.tika.parser.external.CompositeExternalParser
-
- CompositeExternalParser(MediaTypeRegistry) - Constructor for class org.apache.tika.parser.external.CompositeExternalParser
-
- CompositeMatcher - Class in org.apache.tika.sax.xpath
-
Composite XPath evaluation state.
- CompositeMatcher(Matcher, Matcher) - Constructor for class org.apache.tika.sax.xpath.CompositeMatcher
-
- CompositeParser - Class in org.apache.tika.parser
-
Composite parser that delegates parsing tasks to a component parser
based on the declared content type of the incoming document.
- CompositeParser(MediaTypeRegistry, List<Parser>, Collection<Class<? extends Parser>>) - Constructor for class org.apache.tika.parser.CompositeParser
-
- CompositeParser(MediaTypeRegistry, List<Parser>) - Constructor for class org.apache.tika.parser.CompositeParser
-
- CompositeParser(MediaTypeRegistry, Parser...) - Constructor for class org.apache.tika.parser.CompositeParser
-
- CompositeParser() - Constructor for class org.apache.tika.parser.CompositeParser
-
- CompositeTagHandler - Class in org.apache.tika.parser.mp3
-
Takes an array of
ID3Tags
in preference order, and when asked for
a given tag, will return it from the first
ID3Tags
that has it.
- CompositeTagHandler(ID3Tags[]) - Constructor for class org.apache.tika.parser.mp3.CompositeTagHandler
-
- CompressorParser - Class in org.apache.tika.parser.pkg
-
Parser for various compression formats.
- CompressorParser() - Constructor for class org.apache.tika.parser.pkg.CompressorParser
-
- CompressorParserOptions - Interface in org.apache.tika.parser.pkg
-
Interface for setting options for the
CompressorParser
by passing
via the
ParseContext
.
- ConcurrentUtils - Class in org.apache.tika.utils
-
Utility Class for Concurrency in Tika
- ConcurrentUtils() - Constructor for class org.apache.tika.utils.ConcurrentUtils
-
- ConfigurableThreadPoolExecutor - Interface in org.apache.tika.concurrent
-
Allows Thread Pool to be Configurable.
- configure(PDF2XHTML) - Method in class org.apache.tika.parser.pdf.PDFParserConfig
-
Configures the given pdf2XHTML.
- ConsumersManager - Class in org.apache.tika.batch
-
Simple interface around a collection of consumers that allows
for initializing and shutting shared resources (e.g.
- ConsumersManager(List<FileResourceConsumer>) - Constructor for class org.apache.tika.batch.ConsumersManager
-
- CONTACT - Static variable in interface org.apache.tika.metadata.ClimateForcast
-
- CONTACT_INFO_ADDRESS - Static variable in interface org.apache.tika.metadata.IPTC
-
The contact information address part.
- CONTACT_INFO_CITY - Static variable in interface org.apache.tika.metadata.IPTC
-
The contact information city part.
- CONTACT_INFO_COUNTRY - Static variable in interface org.apache.tika.metadata.IPTC
-
The contact information country part.
- CONTACT_INFO_EMAIL - Static variable in interface org.apache.tika.metadata.IPTC
-
The contact information email address part.
- CONTACT_INFO_PHONE - Static variable in interface org.apache.tika.metadata.IPTC
-
The contact information phone number part.
- CONTACT_INFO_POSTAL_CODE - Static variable in interface org.apache.tika.metadata.IPTC
-
The contact information part denoting the local postal code.
- CONTACT_INFO_STATE_PROVINCE - Static variable in interface org.apache.tika.metadata.IPTC
-
The contact information part denoting regional information such as state or province.
- CONTACT_INFO_WEB_URL - Static variable in interface org.apache.tika.metadata.IPTC
-
The contact information web address part.
- ContainerExtractor - Interface in org.apache.tika.extractor
-
Tika container extractor interface.
- contains(String, String, String) - Method in class org.apache.tika.language.translate.CachedTranslator
-
Check whether this CachedTranslator's cache contains a translation of the text from the
source language to the target language.
- contains(String, String) - Method in class org.apache.tika.language.translate.CachedTranslator
-
Check whether this CachedTranslator's cache contains a translation of the text to the target language,
attempting to auto-detect the source language.
- CONTENT - Static variable in class org.apache.tika.parser.chm.core.ChmConstants
-
- CONTENT_DISPOSITION - Static variable in interface org.apache.tika.metadata.HttpHeaders
-
- CONTENT_ENCODING - Static variable in interface org.apache.tika.metadata.HttpHeaders
-
- CONTENT_LANGUAGE - Static variable in interface org.apache.tika.metadata.HttpHeaders
-
- CONTENT_LENGTH - Static variable in interface org.apache.tika.metadata.HttpHeaders
-
- CONTENT_LOCATION - Static variable in interface org.apache.tika.metadata.HttpHeaders
-
- CONTENT_MD5 - Static variable in interface org.apache.tika.metadata.HttpHeaders
-
- CONTENT_STATUS - Static variable in interface org.apache.tika.metadata.MSOffice
-
Deprecated.
- CONTENT_STATUS - Static variable in interface org.apache.tika.metadata.OfficeOpenXMLCore
-
The status of the content.
- CONTENT_TYPE - Static variable in interface org.apache.tika.metadata.HttpHeaders
-
- CONTENT_TYPE_HINT - Static variable in interface org.apache.tika.metadata.TikaCoreProperties
-
This is currently used to identify Content-Type that may be
included within a document, such as in html documents
(e.g.
- contentEquals(InputStream, InputStream) - Static method in class org.apache.tika.io.IOUtils
-
Compare the contents of two Streams to determine if they are equal or
not.
- contentEquals(Reader, Reader) - Static method in class org.apache.tika.io.IOUtils
-
Compare the contents of two Readers to determine if they are equal or
not.
- ContentHandlerDecorator - Class in org.apache.tika.sax
-
- ContentHandlerDecorator(ContentHandler) - Constructor for class org.apache.tika.sax.ContentHandlerDecorator
-
Creates a decorator for the given SAX event handler.
- ContentHandlerDecorator() - Constructor for class org.apache.tika.sax.ContentHandlerDecorator
-
Creates a decorator that by default forwards incoming SAX events to
a dummy content handler that simply ignores all the events.
- ContentHandlerExample - Class in org.apache.tika.example
-
Examples of using different Content Handlers to
get different parts of the file's contents
- ContentHandlerExample() - Constructor for class org.apache.tika.example.ContentHandlerExample
-
- ContentHandlerFactory - Interface in org.apache.tika.sax
-
Interface to allow easier injection of code for getting a new ContentHandler
- CONTRIBUTOR - Static variable in interface org.apache.tika.metadata.DublinCore
-
An entity responsible for making contributions to the content of the
resource.
- CONTRIBUTOR - Static variable in class org.apache.tika.metadata.Metadata
-
- CONTRIBUTOR - Static variable in interface org.apache.tika.metadata.TikaCoreProperties
-
- CONTROL_DATA - Static variable in class org.apache.tika.parser.chm.core.ChmConstants
-
- CONTROLLED_VOCABULARY_TERM - Static variable in interface org.apache.tika.metadata.IPTC
-
A term to describe the content of the image by a value from a Controlled
Vocabulary.
- CONVENTIONS - Static variable in interface org.apache.tika.metadata.ClimateForcast
-
- convert(Object) - Static method in class org.apache.tika.metadata.XMPDM.ChannelTypePropertyConverter
-
Deprecated.
How a standalone converter might work
- convert(Metadata) - Static method in class org.apache.tika.xmp.convert.TikaToXMP
-
- convert(Metadata, String) - Static method in class org.apache.tika.xmp.convert.TikaToXMP
-
Convert the given Tika metadata map to XMP object.
- convertAndSet(Metadata, Object) - Static method in class org.apache.tika.metadata.XMPDM.ChannelTypePropertyConverter
-
Deprecated.
How convert+set might work
- converttoInt(byte[]) - Static method in class org.apache.tika.parser.image.ICNSType
-
- convertToJSONArray(JSONObject, String) - Method in class org.apache.tika.parser.ner.grobid.GrobidNERecogniser
-
Converts JSON Object to JSON Array
- convertToJSONObject(String) - Method in class org.apache.tika.parser.ner.grobid.GrobidNERecogniser
-
Parses a JSON String and converts it to a JSON Object
- copy(InputStream, OutputStream) - Static method in class org.apache.tika.io.IOUtils
-
Copy bytes from an InputStream
to an
OutputStream
.
- copy(InputStream, Writer) - Static method in class org.apache.tika.io.IOUtils
-
Copy bytes from an InputStream
to chars on a
Writer
using the default character encoding of the platform.
- copy(InputStream, Writer, String) - Static method in class org.apache.tika.io.IOUtils
-
Copy bytes from an InputStream
to chars on a
Writer
using the specified character encoding.
- copy(Reader, Writer) - Static method in class org.apache.tika.io.IOUtils
-
Copy chars from a Reader
to a Writer
.
- copy(Reader, OutputStream) - Static method in class org.apache.tika.io.IOUtils
-
Copy chars from a Reader
to bytes on an
OutputStream
using the default character encoding of the
platform, and calling flush.
- copy(Reader, OutputStream, String) - Static method in class org.apache.tika.io.IOUtils
-
Copy chars from a Reader
to bytes on an
OutputStream
using the specified character encoding, and
calling flush.
- copyLarge(InputStream, OutputStream) - Static method in class org.apache.tika.io.IOUtils
-
Copy bytes from a large (over 2GB) InputStream
to an
OutputStream
.
- copyLarge(Reader, Writer) - Static method in class org.apache.tika.io.IOUtils
-
Copy chars from a large (over 2GB) Reader
to a Writer
.
- copyOfRange(byte[], int, int) - Static method in class org.apache.tika.parser.chm.core.ChmCommons
-
- COPYRIGHT - Static variable in interface org.apache.tika.metadata.XMPDM
-
"The copyright information."
- COPYRIGHT_NOTICE - Static variable in interface org.apache.tika.metadata.IPTC
-
Contains any necessary copyright notice for claiming the intellectual
property for this item and should identify the current owner of the
copyright for the item.
- COPYRIGHT_OWNER - Static variable in interface org.apache.tika.metadata.IPTC
-
Owner or owners of the copyright in the licensed image.
- COPYRIGHT_OWNER_ID - Static variable in interface org.apache.tika.metadata.IPTC
-
The ID of the owner or owners of the copyright in the licensed image.
- COPYRIGHT_OWNER_ID_WRONG_CASE - Static variable in interface org.apache.tika.metadata.IPTC
-
- COPYRIGHT_OWNER_NAME - Static variable in interface org.apache.tika.metadata.IPTC
-
The name of the owner or owners of the copyright in the licensed image.
- CoreNLPNERecogniser - Class in org.apache.tika.parser.ner.corenlp
-
This class offers an implementation of
NERecogniser
based on
CRF classifiers from Stanford CoreNLP.
- CoreNLPNERecogniser() - Constructor for class org.apache.tika.parser.ner.corenlp.CoreNLPNERecogniser
-
- CoreNLPNERecogniser(String) - Constructor for class org.apache.tika.parser.ner.corenlp.CoreNLPNERecogniser
-
Creates a NERecogniser by loading model from given path
- count() - Method in class org.apache.tika.detect.TextStatistics
-
Returns the total number of bytes seen so far.
- count(int) - Method in class org.apache.tika.detect.TextStatistics
-
Returns the number of occurrences of the given byte.
- countControl() - Method in class org.apache.tika.detect.TextStatistics
-
Counts control characters (i.e.
- countEightBit() - Method in class org.apache.tika.detect.TextStatistics
-
Counts eight bit characters, i.e.
- CountingInputStream - Class in org.apache.tika.io
-
A decorating input stream that counts the number of bytes that have passed
through the stream so far.
- CountingInputStream(InputStream) - Constructor for class org.apache.tika.io.CountingInputStream
-
Constructs a new CountingInputStream.
- COUNTRY - Static variable in interface org.apache.tika.metadata.IPTC
-
Full name of the country the content is focussing on -- either the
country shown in visual media or referenced in text or audio media.
- COUNTRY - Static variable in interface org.apache.tika.metadata.Photoshop
-
- COUNTRY_CODE - Static variable in interface org.apache.tika.metadata.IPTC
-
Code of the country the content is focussing on -- either the country
shown in visual media or referenced in text or audio media.
- countSafeAscii() - Method in class org.apache.tika.detect.TextStatistics
-
Counts "safe" (i.e.
- COVERAGE - Static variable in interface org.apache.tika.metadata.DublinCore
-
The extent or scope of the content of the resource.
- COVERAGE - Static variable in class org.apache.tika.metadata.Metadata
-
- COVERAGE - Static variable in interface org.apache.tika.metadata.TikaCoreProperties
-
- create(String, InputStream, String) - Static method in class org.apache.tika.language.LanguageProfilerBuilder
-
Deprecated.
Creates a new Language profile from (preferably quite large - 5-10k of
lines) text file
- create() - Static method in class org.apache.tika.mime.MimeTypesFactory
-
Creates an empty instance; same as calling new MimeTypes().
- create(Document) - Static method in class org.apache.tika.mime.MimeTypesFactory
-
Creates and returns a MimeTypes instance from the specified document.
- create(InputStream...) - Static method in class org.apache.tika.mime.MimeTypesFactory
-
Creates and returns a MimeTypes instance from the specified input stream.
- create(InputStream) - Static method in class org.apache.tika.mime.MimeTypesFactory
-
- create(URL...) - Static method in class org.apache.tika.mime.MimeTypesFactory
-
Creates and returns a MimeTypes instance from the resource
at the location specified by the URL.
- create(URL) - Static method in class org.apache.tika.mime.MimeTypesFactory
-
- create(String) - Static method in class org.apache.tika.mime.MimeTypesFactory
-
Creates and returns a MimeTypes instance from the specified file path,
as interpreted by the class loader in getResource().
- create(String, String) - Static method in class org.apache.tika.mime.MimeTypesFactory
-
Creates and returns a MimeTypes instance.
- create(String, String, ClassLoader) - Static method in class org.apache.tika.mime.MimeTypesFactory
-
Creates and returns a MimeTypes instance.
- create() - Static method in class org.apache.tika.parser.external.ExternalParsersFactory
-
- create(ServiceLoader) - Static method in class org.apache.tika.parser.external.ExternalParsersFactory
-
- create(String, ServiceLoader) - Static method in class org.apache.tika.parser.external.ExternalParsersFactory
-
- create(URL...) - Static method in class org.apache.tika.parser.external.ExternalParsersFactory
-
- CREATE_DATE - Static variable in interface org.apache.tika.metadata.XMP
-
The date and time the resource was created.
- createArrayProperty(Property, String, String, int) - Method in class org.apache.tika.xmp.convert.AbstractConverter
-
- createArrayProperty(String, String, String, int) - Method in class org.apache.tika.xmp.convert.AbstractConverter
-
Creates an array property from a list of values.
- createCommaSeparatedArray(Property, String, String, int) - Method in class org.apache.tika.xmp.convert.AbstractConverter
-
- createCommaSeparatedArray(String, String, String, int) - Method in class org.apache.tika.xmp.convert.AbstractConverter
-
Creates an array property from a comma separated list.
- CREATED - Static variable in interface org.apache.tika.metadata.DublinCore
-
Date of creation of the resource.
- CREATED - Static variable in interface org.apache.tika.metadata.TikaCoreProperties
-
- createFrameIfPresent(InputStream) - Static method in class org.apache.tika.parser.mp3.ID3v2Frame
-
Returns the next ID3v2 Frame in
the file, or null if the next batch of data
doesn't correspond to either an ID3v2 header.
- createLangAltProperty(Property, String, String) - Method in class org.apache.tika.xmp.convert.AbstractConverter
-
- createLangAltProperty(String, String, String) - Method in class org.apache.tika.xmp.convert.AbstractConverter
-
Creates a language alternative property in the x-default language
- createParser() - Static method in class org.apache.tika.server.resource.TikaResource
-
- createProperty(Property, String, String) - Method in class org.apache.tika.xmp.convert.AbstractConverter
-
- createProperty(String, String, String) - Method in class org.apache.tika.xmp.convert.AbstractConverter
-
Creates a simple property.
- createTempFile() - Method in class org.apache.tika.io.TemporaryResources
-
Creates a temporary file that will automatically be deleted when
the
TemporaryResources.close()
method is called, returning its path.
- createTemporaryFile() - Method in class org.apache.tika.io.TemporaryResources
-
- CREATION_DATE - Static variable in interface org.apache.tika.metadata.MSOffice
-
Deprecated.
- CREATION_DATE - Static variable in interface org.apache.tika.metadata.Office
-
When was the document created?
- CreativeCommons - Interface in org.apache.tika.metadata
-
A collection of Creative Commons properties names.
- CREATOR - Static variable in interface org.apache.tika.metadata.DublinCore
-
An entity primarily responsible for making the content of the resource.
- CREATOR - Static variable in interface org.apache.tika.metadata.IPTC
-
Contains the name of the person who created the content of this item, a
photographer for photos, a graphic artist for graphics, or a writer for
textual news, but in cases where the photographer should not be
identified the name of a company or organisation may be appropriate.
- CREATOR - Static variable in class org.apache.tika.metadata.Metadata
-
- CREATOR - Static variable in interface org.apache.tika.metadata.TikaCoreProperties
-
- CREATOR_TOOL - Static variable in interface org.apache.tika.metadata.TikaCoreProperties
-
- CREATOR_TOOL - Static variable in interface org.apache.tika.metadata.XMP
-
The name of the first known tool used to create the resource.
- CREATORS_CONTACT_INFO - Static variable in interface org.apache.tika.metadata.IPTC
-
The creator's contact information provides all necessary information to
get in contact with the creator of this item and comprises a set of
sub-properties for proper addressing.
- CREATORS_JOB_TITLE - Static variable in interface org.apache.tika.metadata.IPTC
-
Contains the job title of the person who created the content of this
item.
- CREDIT - Static variable in interface org.apache.tika.metadata.Photoshop
-
- CREDIT_LINE - Static variable in interface org.apache.tika.metadata.IPTC
-
The credit to person(s) and/or organisation(s) required by the supplier
of the item to be used when published.
- CryptoParser - Class in org.apache.tika.parser
-
Decrypts the incoming document stream and delegates further parsing to
another parser instance.
- CryptoParser(String, Provider, Set<MediaType>) - Constructor for class org.apache.tika.parser.CryptoParser
-
- CryptoParser(String, Set<MediaType>) - Constructor for class org.apache.tika.parser.CryptoParser
-
- CSVMessageBodyWriter - Class in org.apache.tika.server.writer
-
- CSVMessageBodyWriter() - Constructor for class org.apache.tika.server.writer.CSVMessageBodyWriter
-
- CTAKES_META_PREFIX - Static variable in class org.apache.tika.parser.ctakes.CTAKESContentHandler
-
- CTAKESAnnotationProperty - Enum in org.apache.tika.parser.ctakes
-
This enumeration includes the properties that an IdentifiedAnnotation
object can provide.
- CTAKESConfig - Class in org.apache.tika.parser.ctakes
-
- CTAKESConfig() - Constructor for class org.apache.tika.parser.ctakes.CTAKESConfig
-
Default constructor.
- CTAKESConfig(InputStream) - Constructor for class org.apache.tika.parser.ctakes.CTAKESConfig
-
Loads properties from InputStream and then tries to close InputStream.
- CTAKESContentHandler - Class in org.apache.tika.parser.ctakes
-
Class used to extract biomedical information while parsing.
- CTAKESContentHandler(ContentHandler, Metadata, CTAKESConfig) - Constructor for class org.apache.tika.parser.ctakes.CTAKESContentHandler
-
- CTAKESContentHandler(ContentHandler, Metadata) - Constructor for class org.apache.tika.parser.ctakes.CTAKESContentHandler
-
- CTAKESContentHandler() - Constructor for class org.apache.tika.parser.ctakes.CTAKESContentHandler
-
Default constructor.
- CTAKESParser - Class in org.apache.tika.parser.ctakes
-
CTAKESParser decorates a
Parser
and leverages on
CTAKESContentHandler
to extract biomedical information from
clinical text using Apache cTAKES.
- CTAKESParser() - Constructor for class org.apache.tika.parser.ctakes.CTAKESParser
-
Wraps the default Parser
- CTAKESParser(TikaConfig) - Constructor for class org.apache.tika.parser.ctakes.CTAKESParser
-
Wraps the default Parser for this Config
- CTAKESParser(Parser) - Constructor for class org.apache.tika.parser.ctakes.CTAKESParser
-
Wraps the specified Parser
- CTAKESSerializer - Enum in org.apache.tika.parser.ctakes
-
Enumeration for types of cTAKES (UIMA) CAS serializer supported by cTAKES.
- CTAKESUtils - Class in org.apache.tika.parser.ctakes
-
This class provides methods to extract biomedical information from plain text
using
CTAKESContentHandler
that relies on Apache cTAKES.
- CTAKESUtils() - Constructor for class org.apache.tika.parser.ctakes.CTAKESUtils
-
- customCompositeDetector() - Static method in class org.apache.tika.example.CustomMimeInfo
-
- CustomMimeInfo - Class in org.apache.tika.example
-
- CustomMimeInfo() - Constructor for class org.apache.tika.example.CustomMimeInfo
-
- customMimeInfo() - Static method in class org.apache.tika.example.CustomMimeInfo
-
- data - Variable in class org.apache.tika.parser.mp3.ID3v2Frame.RawTag
-
- Database - Interface in org.apache.tika.metadata
-
- DATE - Static variable in interface org.apache.tika.metadata.DublinCore
-
A date associated with an event in the life cycle of the resource.
- DATE - Static variable in class org.apache.tika.metadata.Metadata
-
- DATE - Static variable in interface org.apache.tika.parser.ner.NERecogniser
-
- DATE_CREATED - Static variable in interface org.apache.tika.metadata.IPTC
-
Designates the date and optionally the time the intellectual content was
created rather than the date of the creation of the physical
representation.
- DATE_CREATED - Static variable in interface org.apache.tika.metadata.Photoshop
-
- DATE_FILE - Static variable in class org.apache.tika.parser.ner.opennlp.OpenNLPNERecogniser
-
- DateUtils - Class in org.apache.tika.utils
-
Date related utility methods and constants
- DateUtils() - Constructor for class org.apache.tika.utils.DateUtils
-
- DcXMLParser - Class in org.apache.tika.parser.xml
-
Dublin Core metadata parser
- DcXMLParser() - Constructor for class org.apache.tika.parser.xml.DcXMLParser
-
- DECLARED_ENCODING - Static variable in class org.apache.tika.parser.txt.CharsetMatch
-
Bit flag indicating he match is based on the declared encoding.
- decode(String) - Static method in class org.apache.tika.mime.HexCoDec
-
Decode a hex string
- decode(char[]) - Static method in class org.apache.tika.mime.HexCoDec
-
Decode an array of hex chars
- decode(char[], int, int) - Static method in class org.apache.tika.mime.HexCoDec
-
Decode an array of hex chars.
- decompressConcatenated(Metadata) - Method in interface org.apache.tika.parser.pkg.CompressorParserOptions
-
- DEFAULT_CHARSET - Static variable in class org.apache.tika.parser.chm.core.ChmConstants
-
- DEFAULT_HOST - Static variable in class org.apache.tika.server.TikaServerCli
-
- DEFAULT_ID - Static variable in class org.apache.tika.language.translate.MicrosoftTranslator
-
- DEFAULT_MAX_QUEUE_SIZE - Static variable in class org.apache.tika.batch.builders.BatchProcessBuilder
-
- DEFAULT_MODEL_PATH - Static variable in class org.apache.tika.parser.ner.corenlp.CoreNLPNERecogniser
-
default Model path
- DEFAULT_MODELS - Static variable in class org.apache.tika.parser.ner.opennlp.OpenNLPNERecogniser
-
- DEFAULT_NER_IMPL - Static variable in class org.apache.tika.parser.ner.NamedEntityParser
-
- DEFAULT_NGRAM_LENGTH - Static variable in class org.apache.tika.language.LanguageProfile
-
Deprecated.
- DEFAULT_PORT - Static variable in class org.apache.tika.server.TikaServerCli
-
- DEFAULT_SECRET - Static variable in class org.apache.tika.language.translate.MicrosoftTranslator
-
- DefaultContentHandlerFactoryBuilder - Class in org.apache.tika.batch.builders
-
Builds BasicContentHandler with type defined by attribute "basicHandlerType"
with possible values: xml, html, text, body, ignore.
- DefaultContentHandlerFactoryBuilder() - Constructor for class org.apache.tika.batch.builders.DefaultContentHandlerFactoryBuilder
-
- DefaultDetector - Class in org.apache.tika.detect
-
- DefaultDetector(MimeTypes, ServiceLoader, Collection<Class<? extends Detector>>) - Constructor for class org.apache.tika.detect.DefaultDetector
-
- DefaultDetector(MimeTypes, ServiceLoader) - Constructor for class org.apache.tika.detect.DefaultDetector
-
- DefaultDetector(MimeTypes, ClassLoader) - Constructor for class org.apache.tika.detect.DefaultDetector
-
- DefaultDetector(ClassLoader) - Constructor for class org.apache.tika.detect.DefaultDetector
-
- DefaultDetector(MimeTypes) - Constructor for class org.apache.tika.detect.DefaultDetector
-
- DefaultDetector() - Constructor for class org.apache.tika.detect.DefaultDetector
-
- DefaultHtmlMapper - Class in org.apache.tika.parser.html
-
The default HTML mapping rules in Tika.
- DefaultHtmlMapper() - Constructor for class org.apache.tika.parser.html.DefaultHtmlMapper
-
- DefaultParser - Class in org.apache.tika.parser
-
- DefaultParser(MediaTypeRegistry, ServiceLoader, Collection<Class<? extends Parser>>) - Constructor for class org.apache.tika.parser.DefaultParser
-
- DefaultParser(MediaTypeRegistry, ServiceLoader) - Constructor for class org.apache.tika.parser.DefaultParser
-
- DefaultParser(MediaTypeRegistry, ClassLoader) - Constructor for class org.apache.tika.parser.DefaultParser
-
- DefaultParser(ClassLoader) - Constructor for class org.apache.tika.parser.DefaultParser
-
- DefaultParser(MediaTypeRegistry) - Constructor for class org.apache.tika.parser.DefaultParser
-
- DefaultParser() - Constructor for class org.apache.tika.parser.DefaultParser
-
- DefaultProbDetector - Class in org.apache.tika.detect
-
A version of
DefaultDetector
for probabilistic mime
detectors, which use statistical techniques to blend the
results of differing underlying detectors when attempting
to detect the type of a given file.
- DefaultProbDetector(ProbabilisticMimeDetectionSelector, ServiceLoader) - Constructor for class org.apache.tika.detect.DefaultProbDetector
-
- DefaultProbDetector(ProbabilisticMimeDetectionSelector, ClassLoader) - Constructor for class org.apache.tika.detect.DefaultProbDetector
-
- DefaultProbDetector(ClassLoader) - Constructor for class org.apache.tika.detect.DefaultProbDetector
-
- DefaultProbDetector(MimeTypes) - Constructor for class org.apache.tika.detect.DefaultProbDetector
-
- DefaultProbDetector() - Constructor for class org.apache.tika.detect.DefaultProbDetector
-
- DefaultTranslator - Class in org.apache.tika.language.translate
-
- DefaultTranslator(ServiceLoader) - Constructor for class org.apache.tika.language.translate.DefaultTranslator
-
- DefaultTranslator() - Constructor for class org.apache.tika.language.translate.DefaultTranslator
-
- DelegatingParser - Class in org.apache.tika.parser
-
Base class for parser implementations that want to delegate parts of the
task of parsing an input document to another parser.
- DelegatingParser() - Constructor for class org.apache.tika.parser.DelegatingParser
-
- deleteNamespace(String) - Static method in class org.apache.tika.xmp.XMPMetadata
-
Deletes a namespace from the registry.
- DERIVED_FROM_DOCUMENTID - Static variable in interface org.apache.tika.metadata.XMPMM
-
Document id for the document that this document
was derived from
- DERIVED_FROM_INSTANCEID - Static variable in interface org.apache.tika.metadata.XMPMM
-
Instance id for the document instance that this
document was derived from
- descend(String, String) - Method in class org.apache.tika.sax.xpath.ChildMatcher
-
- descend(String, String) - Method in class org.apache.tika.sax.xpath.CompositeMatcher
-
- descend(String, String) - Method in class org.apache.tika.sax.xpath.Matcher
-
Returns the XPath evaluation state that results from descending
to a child element with the given name.
- descend(String, String) - Method in class org.apache.tika.sax.xpath.NamedElementMatcher
-
- descend(String, String) - Method in class org.apache.tika.sax.xpath.SubtreeMatcher
-
- describeMediaType() - Static method in class org.apache.tika.example.MediaTypeExample
-
- DescribeMetadata - Class in org.apache.tika.example
-
Print the supported Tika Metadata models and their fields.
- DescribeMetadata() - Constructor for class org.apache.tika.example.DescribeMetadata
-
- DESCRIPTION - Static variable in interface org.apache.tika.metadata.DublinCore
-
An account of the content of the resource.
- DESCRIPTION - Static variable in interface org.apache.tika.metadata.IPTC
-
A textual description, including captions, of the item's content,
particularly used where the object is not text.
- DESCRIPTION - Static variable in class org.apache.tika.metadata.Metadata
-
- DESCRIPTION - Static variable in interface org.apache.tika.metadata.TikaCoreProperties
-
- DESCRIPTION_WRITER - Static variable in interface org.apache.tika.metadata.IPTC
-
Identifier or the name of the person involved in writing, editing or
correcting the description of the content.
- deserialize(JsonElement, Type, JsonDeserializationContext) - Method in class org.apache.tika.metadata.serialization.JsonMetadataDeserializer
-
Deserializes a json object (equivalent to: Map)
into a Metadata object.
- detect(InputStream, Metadata) - Method in class org.apache.tika.detect.CompositeDetector
-
- detect(InputStream, Metadata) - Method in interface org.apache.tika.detect.Detector
-
Detects the content type of the given input document.
- detect(InputStream, Metadata) - Method in class org.apache.tika.detect.EmptyDetector
-
- detect(InputStream, Metadata) - Method in interface org.apache.tika.detect.EncodingDetector
-
Detects the character encoding of the given text document, or
null
if the encoding of the document can not be detected.
- detect(InputStream, Metadata) - Method in class org.apache.tika.detect.MagicDetector
-
- detect(InputStream, Metadata) - Method in class org.apache.tika.detect.NameDetector
-
Detects the content type of an input document based on the document
name given in the input metadata.
- detect(InputStream, Metadata) - Method in class org.apache.tika.detect.TextDetector
-
Looks at the beginning of the document input stream to determine
whether the document is text or not.
- detect(InputStream, Metadata) - Method in class org.apache.tika.detect.TrainedModelDetector
-
- detect(InputStream, Metadata) - Method in class org.apache.tika.detect.TypeDetector
-
Detects the content type of an input document based on a type hint
given in the input metadata.
- detect(InputStream, Metadata) - Method in class org.apache.tika.detect.ZeroSizeFileDetector
-
- detect(InputStream, Metadata) - Method in class org.apache.tika.example.EncryptedPrescriptionDetector
-
- detect() - Method in class org.apache.tika.language.detect.LanguageDetector
-
- detect(CharSequence) - Method in class org.apache.tika.language.detect.LanguageDetector
-
- detect(InputStream, Metadata) - Method in class org.apache.tika.mime.MimeTypes
-
Automatically detects the MIME type of a document based on magic
markers in the stream prefix and any given metadata hints.
- detect(InputStream, Metadata) - Method in class org.apache.tika.mime.ProbabilisticMimeDetectionSelector
-
- detect(InputStream, Metadata) - Method in class org.apache.tika.parser.html.HtmlEncodingDetector
-
- detect(Set<String>) - Static method in class org.apache.tika.parser.microsoft.POIFSContainerDetector
-
- detect(Set<String>, DirectoryEntry) - Static method in class org.apache.tika.parser.microsoft.POIFSContainerDetector
-
Internal detection of the specific kind of OLE2 document, based on the
names of the top-level streams within the file.
- detect(InputStream, Metadata) - Method in class org.apache.tika.parser.microsoft.POIFSContainerDetector
-
- detect(InputStream, Metadata) - Method in class org.apache.tika.parser.pkg.ZipContainerDetector
-
- detect() - Method in class org.apache.tika.parser.txt.CharsetDetector
-
Return the charset that best matches the supplied input data.
- detect(InputStream, Metadata) - Method in class org.apache.tika.parser.txt.Icu4jEncodingDetector
-
- detect(InputStream, Metadata) - Method in class org.apache.tika.parser.txt.UniversalEncodingDetector
-
- detect(InputStream, HttpHeaders, UriInfo) - Method in class org.apache.tika.server.resource.DetectorResource
-
- detect(InputStream) - Method in class org.apache.tika.server.resource.LanguageResource
-
- detect(String) - Method in class org.apache.tika.server.resource.LanguageResource
-
- detect(InputStream, Metadata) - Method in class org.apache.tika.Tika
-
Detects the media type of the given document.
- detect(InputStream, String) - Method in class org.apache.tika.Tika
-
Detects the media type of the given document.
- detect(InputStream) - Method in class org.apache.tika.Tika
-
Detects the media type of the given document.
- detect(byte[], String) - Method in class org.apache.tika.Tika
-
Detects the media type of the given document.
- detect(byte[]) - Method in class org.apache.tika.Tika
-
Detects the media type of the given document.
- detect(Path) - Method in class org.apache.tika.Tika
-
Detects the media type of the file at the given path.
- detect(File) - Method in class org.apache.tika.Tika
-
Detects the media type of the given file.
- detect(URL) - Method in class org.apache.tika.Tika
-
Detects the media type of the resource at the given URL.
- detect(String) - Method in class org.apache.tika.Tika
-
Detects the media type of a document with the given file name.
- detectAll() - Method in class org.apache.tika.langdetect.OptimaizeLangDetector
-
- detectAll() - Method in class org.apache.tika.langdetect.TextLangDetector
-
- detectAll() - Method in class org.apache.tika.language.detect.LanguageDetector
-
Detect languages based on previously submitted text (via addText calls).
- detectAll(String) - Method in class org.apache.tika.language.detect.LanguageDetector
-
Utility wrapper that detects the language of a given chunk of text.
- detectAll() - Method in class org.apache.tika.parser.txt.CharsetDetector
-
Return an array of all charsets that appear to be plausible
matches with the input data.
- detectFilename(MultivaluedMap<String, String>) - Static method in class org.apache.tika.server.resource.TikaResource
-
- detectLanguage(String) - Method in class org.apache.tika.example.LanguageDetectorExample
-
- detectLanguage(String) - Method in class org.apache.tika.language.translate.AbstractTranslator
-
- detectOfficeOpenXML(OPCPackage) - Static method in class org.apache.tika.parser.pkg.ZipContainerDetector
-
Detects the type of an OfficeOpenXML (OOXML) file from
opened Package
- Detector - Interface in org.apache.tika.detect
-
Content type detector.
- DetectorResource - Class in org.apache.tika.server.resource
-
- DetectorResource() - Constructor for class org.apache.tika.server.resource.DetectorResource
-
- detectType(ZipArchiveEntry, ZipFile) - Static method in enum org.apache.tika.parser.iwork.IWorkPackageParser.IWORKDocumentType
-
- detectType(ZipArchiveEntry, ZipArchiveInputStream) - Static method in enum org.apache.tika.parser.iwork.IWorkPackageParser.IWORKDocumentType
-
- detectType(POIFSFileSystem) - Static method in enum org.apache.tika.parser.microsoft.OfficeParser.POIFSDocumentType
-
- detectType(NPOIFSFileSystem) - Static method in enum org.apache.tika.parser.microsoft.OfficeParser.POIFSDocumentType
-
- detectType(DirectoryEntry) - Static method in enum org.apache.tika.parser.microsoft.OfficeParser.POIFSDocumentType
-
- detectWithCustomConfig(String) - Static method in class org.apache.tika.example.AdvancedTypeDetector
-
- detectWithCustomDetector(String) - Static method in class org.apache.tika.example.AdvancedTypeDetector
-
- DIFContentHandler - Class in org.apache.tika.parser.dif
-
- DIFContentHandler(ContentHandler, Metadata) - Constructor for class org.apache.tika.parser.dif.DIFContentHandler
-
- DIFContentHandler - Class in org.apache.tika.sax
-
- DIFContentHandler(ContentHandler, Metadata) - Constructor for class org.apache.tika.sax.DIFContentHandler
-
- DIFParser - Class in org.apache.tika.parser.dif
-
- DIFParser() - Constructor for class org.apache.tika.parser.dif.DIFParser
-
- digest(InputStream, Metadata, ParseContext) - Method in interface org.apache.tika.parser.DigestingParser.Digester
-
Digests an InputStream and sets the appropriate value(s) in the metadata.
- digest(InputStream, Metadata, ParseContext) - Method in class org.apache.tika.parser.utils.CommonsDigester
-
- DigestingAutoDetectParserFactory - Class in org.apache.tika.batch
-
- DigestingAutoDetectParserFactory() - Constructor for class org.apache.tika.batch.DigestingAutoDetectParserFactory
-
- DigestingParser - Class in org.apache.tika.parser
-
- DigestingParser(Parser, DigestingParser.Digester) - Constructor for class org.apache.tika.parser.DigestingParser
-
Creates a decorator for the given parser.
- DigestingParser.Digester - Interface in org.apache.tika.parser
-
Interface for optional digester, if specified during construction.
- DIGITAL_IMAGE_GUID - Static variable in interface org.apache.tika.metadata.IPTC
-
Globally unique identifier for the item.
- DIGITAL_SOURCE_FILE_TYPE - Static variable in interface org.apache.tika.metadata.IPTC
-
Deprecated.
- DIGITAL_SOURCE_TYPE - Static variable in interface org.apache.tika.metadata.IPTC
-
The type of the source of this digital image
- DirectFileReadDataSource - Class in org.apache.tika.parser.mp4
-
A
DataSource
implementation that relies on direct reads from a
RandomAccessFile
.
- DirectFileReadDataSource(File) - Constructor for class org.apache.tika.parser.mp4.DirectFileReadDataSource
-
- DirectoryListingEntry - Class in org.apache.tika.parser.chm.accessor
-
The format of a directory listing entry is as follows: BYTE: length of name
BYTEs: name (UTF-8 encoded) ENCINT: content section ENCINT: offset ENCINT:
length The offset is from the beginning of the content section the file is
in, after the section has been decompressed (if appropriate).
- DirectoryListingEntry() - Constructor for class org.apache.tika.parser.chm.accessor.DirectoryListingEntry
-
- DirectoryListingEntry(int, String, ChmCommons.EntryType, int, int) - Constructor for class org.apache.tika.parser.chm.accessor.DirectoryListingEntry
-
Constructs directoryListingEntry
- DirListParser - Class in org.apache.tika.example
-
Parses the output of /bin/ls and counts the number of files and the number of
executables using Tika.
- DirListParser() - Constructor for class org.apache.tika.example.DirListParser
-
- DISC_NUMBER - Static variable in interface org.apache.tika.metadata.XMPDM
-
"The disc number for part of an album set."
- DisplayMetInstance - Class in org.apache.tika.example
-
Grabs a PDF file from a URL and prints its
Metadata
- DisplayMetInstance() - Constructor for class org.apache.tika.example.DisplayMetInstance
-
- dispose() - Method in class org.apache.tika.example.LazyTextExtractorField
-
Releases all resources associated with this field.
- dispose() - Method in class org.apache.tika.io.TemporaryResources
-
- distance(LanguageProfile) - Method in class org.apache.tika.language.LanguageProfile
-
Deprecated.
Calculates the geometric distance between this and the given
other language profile.
- DOC - Static variable in class org.apache.tika.parser.microsoft.POIFSContainerDetector
-
Microsoft Word
- DOC_SECURITY - Static variable in interface org.apache.tika.metadata.OfficeOpenXMLExtended
-
- DOCUMENTID - Static variable in interface org.apache.tika.metadata.XMPMM
-
The common identifier for all versions and renditions of a resource.
- DocumentSelector - Interface in org.apache.tika.extractor
-
Interface for different document selection strategies for purposes like
embedded document extraction by a
ContainerExtractor
instance.
- doubleByte - Variable in class org.apache.tika.parser.mp3.ID3v2Frame.TextEncoding
-
- DRAW_NS - Static variable in class org.apache.tika.parser.odf.OpenDocumentContentParser
-
- DublinCore - Interface in org.apache.tika.metadata
-
A collection of Dublin Core metadata names.
- DumpTikaConfigExample - Class in org.apache.tika.example
-
This class shows how to dump a TikaConfig object to a configuration file.
- DumpTikaConfigExample() - Constructor for class org.apache.tika.example.DumpTikaConfigExample
-
- DURATION - Static variable in interface org.apache.tika.metadata.XMPDM
-
"The duration of the media file."
- DurationFormatUtils - Class in org.apache.tika.util
-
Functionality and naming conventions (roughly) copied from org.apache.commons.lang3
so that we didn't have to add another dependency.
- DurationFormatUtils() - Constructor for class org.apache.tika.util.DurationFormatUtils
-
- DWGParser - Class in org.apache.tika.parser.dwg
-
DWG (CAD Drawing) parser.
- DWGParser() - Constructor for class org.apache.tika.parser.dwg.DWGParser
-
- GDALParser - Class in org.apache.tika.parser.gdal
-
- GDALParser() - Constructor for class org.apache.tika.parser.gdal.GDALParser
-
- GENERAL_EMBEDDED - Static variable in class org.apache.tika.parser.microsoft.POIFSContainerDetector
-
General embedded document type within an OLE2 container
- generateFooter(StringBuffer) - Method in class org.apache.tika.server.HTMLHelper
-
- generateHeader(StringBuffer, String) - Method in class org.apache.tika.server.HTMLHelper
-
Generates the HTML Header for the user facing page, adding
in the given title as required
- generateRSS(File) - Method in class org.apache.tika.example.RecentFiles
-
- GenericConverter - Class in org.apache.tika.xmp.convert
-
Trys to convert as much of the properties in the Metadata
map to XMP namespaces.
- GenericConverter() - Constructor for class org.apache.tika.xmp.convert.GenericConverter
-
- GENRE - Static variable in interface org.apache.tika.metadata.XMPDM
-
"The name of the genre."
- GENRES - Static variable in interface org.apache.tika.parser.mp3.ID3Tags
-
List of predefined genres.
- GeoGazetteerClient - Class in org.apache.tika.parser.geo.topic.gazetteer
-
- GeoGazetteerClient(String) - Constructor for class org.apache.tika.parser.geo.topic.gazetteer.GeoGazetteerClient
-
Pass URL on which lucene-geo-gazetteer is available - eg.
- GeoGazetteerClient(GeoParserConfig) - Constructor for class org.apache.tika.parser.geo.topic.gazetteer.GeoGazetteerClient
-
- Geographic - Interface in org.apache.tika.metadata
-
Geographic schema.
- GeographicInformationParser - Class in org.apache.tika.parser.geoinfo
-
- GeographicInformationParser() - Constructor for class org.apache.tika.parser.geoinfo.GeographicInformationParser
-
- geoInfoType - Static variable in class org.apache.tika.parser.geoinfo.GeographicInformationParser
-
- GeoParser - Class in org.apache.tika.parser.geo.topic
-
- GeoParser() - Constructor for class org.apache.tika.parser.geo.topic.GeoParser
-
- GeoParserConfig - Class in org.apache.tika.parser.geo.topic
-
- GeoParserConfig() - Constructor for class org.apache.tika.parser.geo.topic.GeoParserConfig
-
- GeoTag - Class in org.apache.tika.parser.geo.topic
-
- GeoTag() - Constructor for class org.apache.tika.parser.geo.topic.GeoTag
-
- get(InputStream) - Static method in class org.apache.tika.io.TaggedInputStream
-
Casts or wraps the given stream to a TaggedInputStream instance.
- get(InputStream, TemporaryResources) - Static method in class org.apache.tika.io.TikaInputStream
-
Casts or wraps the given stream to a TikaInputStream instance.
- get(InputStream) - Static method in class org.apache.tika.io.TikaInputStream
-
Casts or wraps the given stream to a TikaInputStream instance.
- get(byte[]) - Static method in class org.apache.tika.io.TikaInputStream
-
Creates a TikaInputStream from the given array of bytes.
- get(byte[], Metadata) - Static method in class org.apache.tika.io.TikaInputStream
-
Creates a TikaInputStream from the given array of bytes.
- get(Path) - Static method in class org.apache.tika.io.TikaInputStream
-
Creates a TikaInputStream from the file at the given path.
- get(Path, Metadata) - Static method in class org.apache.tika.io.TikaInputStream
-
Creates a TikaInputStream from the file at the given path.
- get(File) - Static method in class org.apache.tika.io.TikaInputStream
-
- get(File, Metadata) - Static method in class org.apache.tika.io.TikaInputStream
-
- get(Blob) - Static method in class org.apache.tika.io.TikaInputStream
-
Creates a TikaInputStream from the given database BLOB.
- get(Blob, Metadata) - Static method in class org.apache.tika.io.TikaInputStream
-
Creates a TikaInputStream from the given database BLOB.
- get(URI) - Static method in class org.apache.tika.io.TikaInputStream
-
Creates a TikaInputStream from the resource at the given URI.
- get(URI, Metadata) - Static method in class org.apache.tika.io.TikaInputStream
-
Creates a TikaInputStream from the resource at the given URI.
- get(URL) - Static method in class org.apache.tika.io.TikaInputStream
-
Creates a TikaInputStream from the resource at the given URL.
- get(URL, Metadata) - Static method in class org.apache.tika.io.TikaInputStream
-
Creates a TikaInputStream from the resource at the given URL.
- get(String) - Method in class org.apache.tika.metadata.Metadata
-
Get the value associated to a metadata name.
- get(Property) - Method in class org.apache.tika.metadata.Metadata
-
Returns the value (if any) of the identified metadata property.
- get(String) - Static method in class org.apache.tika.metadata.Property
-
Retrieve the property object that corresponds to the given key
- get(Class<T>) - Method in class org.apache.tika.parser.ParseContext
-
Returns the object in this context that implements the given interface.
- get(Class<T>, T) - Method in class org.apache.tika.parser.ParseContext
-
Returns the object in this context that implements the given interface,
or the given default value if such an object is not found.
- get() - Method in enum org.apache.tika.parser.strings.StringsEncoding
-
- get(String) - Method in class org.apache.tika.xmp.XMPMetadata
-
Returns the value of a simple property or the first one of an array.
- get(Property) - Method in class org.apache.tika.xmp.XMPMetadata
-
- get7BitsInt(byte[], int) - Static method in class org.apache.tika.parser.mp3.ID3v2Frame
-
AKA a Synchsafe integer.
- getAccessChecker() - Method in class org.apache.tika.parser.pdf.PDFParserConfig
-
- getAcronym() - Method in class org.apache.tika.mime.MimeType
-
Returns an acronym for this mime type.
- getAdded() - Method in class org.apache.tika.batch.FileResourceCrawler
-
- getAdded() - Method in class org.apache.tika.batch.ParallelFileProcessingResult
-
- getAdditionalNamespaces() - Method in class org.apache.tika.xmp.convert.AbstractConverter
-
Every Converter has to provide information about namespaces that are used additionally to the
core set of XMP namespaces.
- getAdditionalNamespaces() - Method in class org.apache.tika.xmp.convert.GenericConverter
-
- getAdditionalNamespaces() - Method in class org.apache.tika.xmp.convert.MSOfficeBinaryConverter
-
- getAdditionalNamespaces() - Method in class org.apache.tika.xmp.convert.MSOfficeXMLConverter
-
- getAdditionalNamespaces() - Method in class org.apache.tika.xmp.convert.OpenDocumentConverter
-
- getAdditionalNamespaces() - Method in class org.apache.tika.xmp.convert.RTFConverter
-
- getAdmin1Code() - Method in class org.apache.tika.parser.geo.topic.gazetteer.Location
-
- getAdmin2Code() - Method in class org.apache.tika.parser.geo.topic.gazetteer.Location
-
- getAeDescriptorPath() - Method in class org.apache.tika.parser.ctakes.CTAKESConfig
-
Returns the path to XML descriptor for AnalysisEngine.
- getAlbum() - Method in class org.apache.tika.parser.mp3.CompositeTagHandler
-
- getAlbum() - Method in interface org.apache.tika.parser.mp3.ID3Tags
-
- getAlbum() - Method in class org.apache.tika.parser.mp3.ID3v1Handler
-
- getAlbum() - Method in class org.apache.tika.parser.mp3.ID3v22Handler
-
- getAlbum() - Method in class org.apache.tika.parser.mp3.ID3v23Handler
-
- getAlbum() - Method in class org.apache.tika.parser.mp3.ID3v24Handler
-
- getAlbumArtist() - Method in class org.apache.tika.parser.mp3.CompositeTagHandler
-
- getAlbumArtist() - Method in interface org.apache.tika.parser.mp3.ID3Tags
-
The Artist for the overall album / compilation of albums
- getAlbumArtist() - Method in class org.apache.tika.parser.mp3.ID3v1Handler
-
ID3v1 doesn't have album-wide artists,
so returns null;
- getAlbumArtist() - Method in class org.apache.tika.parser.mp3.ID3v22Handler
-
- getAlbumArtist() - Method in class org.apache.tika.parser.mp3.ID3v23Handler
-
- getAlbumArtist() - Method in class org.apache.tika.parser.mp3.ID3v24Handler
-
- getAliases(MediaType) - Method in class org.apache.tika.mime.MediaTypeRegistry
-
Returns the set of known aliases of the given canonical media type.
- getAlignedLenTable() - Method in class org.apache.tika.parser.chm.lzx.ChmLzxState
-
- getAlignedTreeTable() - Method in class org.apache.tika.parser.chm.lzx.ChmLzxState
-
- getAllComponentParsers() - Method in class org.apache.tika.parser.CompositeParser
-
Returns all parsers registered with the Composite Parser,
including ones which may not currently be active.
- getAllComponentParsers() - Method in class org.apache.tika.parser.DefaultParser
-
- getAllDetectableCharsets() - Static method in class org.apache.tika.parser.txt.CharsetDetector
-
Get the names of all char sets that can be recognized by the char set detector.
- getAllNameEntitiesfromInput(InputStream) - Method in class org.apache.tika.parser.geo.topic.NameEntityExtractor
-
- getAllTagHandlers(InputStream, ContentHandler) - Static method in class org.apache.tika.parser.mp3.Mp3Parser
-
Scans the MP3 frames for ID3 tags, and creates ID3Tag Handlers
for each supported set of tags.
- getAnalysisEngine(String, String, String) - Static method in class org.apache.tika.parser.ctakes.CTAKESUtils
-
Returns a new UIMA Analysis Engine (AE).
- getAnnotationProperty(IdentifiedAnnotation, CTAKESAnnotationProperty) - Static method in class org.apache.tika.parser.ctakes.CTAKESUtils
-
Returns the annotation value based on the given annotation type.
- getAnnotationProps() - Method in class org.apache.tika.parser.ctakes.CTAKESConfig
-
- getAnnotationPropsAsString() - Method in class org.apache.tika.parser.ctakes.CTAKESConfig
-
Returns a string containing a comma-separated list of
CTAKESAnnotationProperty
names that will be included into cTAKES metadata.
- getApiKey() - Method in class org.apache.tika.language.translate.YandexTranslator
-
Get the API Key in use for client authentication
- getArtist() - Method in class org.apache.tika.parser.mp3.CompositeTagHandler
-
- getArtist() - Method in interface org.apache.tika.parser.mp3.ID3Tags
-
The Artist for the track
- getArtist() - Method in class org.apache.tika.parser.mp3.ID3v1Handler
-
- getArtist() - Method in class org.apache.tika.parser.mp3.ID3v22Handler
-
- getArtist() - Method in class org.apache.tika.parser.mp3.ID3v23Handler
-
- getArtist() - Method in class org.apache.tika.parser.mp3.ID3v24Handler
-
- getAttributesMapping() - Method in class org.apache.tika.sax.ElementMappingContentHandler.TargetElement
-
- getAverageCharTolerance() - Method in class org.apache.tika.parser.pdf.PDFParserConfig
-
- getBaseType() - Method in class org.apache.tika.mime.MediaType
-
Returns the base form of the MediaType, excluding
any parameters, such as "text/plain" for
"text/plain; charset=utf-8"
- getBestNameEntity() - Method in class org.apache.tika.parser.geo.topic.NameEntityExtractor
-
- getBigInteger(int) - Method in class org.apache.tika.parser.chm.lzx.ChmSection
-
- getBitRate() - Method in class org.apache.tika.parser.mp3.AudioFrame
-
Get the bit rate in bit per second.
- getBitsPerPixel() - Method in class org.apache.tika.parser.image.ICNSType
-
- getBlock_len() - Method in class org.apache.tika.parser.chm.accessor.ChmItspHeader
-
Returns block's length
- getBlockAddress() - Method in class org.apache.tika.parser.chm.accessor.ChmLzxcResetTable
-
Returns block addresses
- getBlockCount() - Method in class org.apache.tika.parser.chm.accessor.ChmLzxcResetTable
-
Gets a block count
- getBlockidx_intvl() - Method in class org.apache.tika.parser.chm.accessor.ChmItspHeader
-
Returns block index interval
- getBlockLen() - Method in class org.apache.tika.parser.chm.accessor.ChmLzxcResetTable
-
Gets a block length
- getBlockLength() - Method in class org.apache.tika.parser.chm.lzx.ChmLzxState
-
- getBlockNext() - Method in class org.apache.tika.parser.chm.accessor.ChmPmglHeader
-
- getBlockNumber() - Method in class org.apache.tika.parser.chm.lzx.ChmLzxBlock
-
- getBlockPrev() - Method in class org.apache.tika.parser.chm.accessor.ChmPmglHeader
-
- getBlockRemaining() - Method in class org.apache.tika.parser.chm.lzx.ChmLzxState
-
- getBlockType() - Method in class org.apache.tika.parser.chm.lzx.ChmLzxState
-
- getBoolean(String, Boolean) - Static method in class org.apache.tika.util.PropsUtil
-
Parses v.
- getByte() - Method in class org.apache.tika.parser.chm.lzx.ChmSection
-
- getByteCount() - Method in class org.apache.tika.io.CountingInputStream
-
The number of bytes that have passed through this stream.
- getCause() - Method in exception org.apache.tika.io.TaggedIOException
-
Returns the wrapped exception.
- getCause() - Method in exception org.apache.tika.sax.TaggedSAXException
-
Returns the wrapped exception.
- getCauseForTermination() - Method in class org.apache.tika.batch.ParallelFileProcessingResult
-
- getCenter() - Method in class org.apache.tika.parser.microsoft.ooxml.XSSFExcelExtractorDecorator.HeaderFooterFromString
-
- getChannels() - Method in class org.apache.tika.parser.mp3.AudioFrame
-
Get the number of channels (1=mono, 2=stereo)
- getCharset() - Method in class org.apache.tika.detect.AutoDetectReader
-
- getChildTypes(MediaType) - Method in class org.apache.tika.mime.MediaTypeRegistry
-
Returns the set of known children of the given canonical media type
- getChmBlockInfo(DirectoryListingEntry, int, ChmLzxcControlData, ChmBlockInfo) - Method in class org.apache.tika.parser.chm.lzx.ChmBlockInfo
-
Returns an information related to the chmBlockInfo
- getChmBlockInfo() - Static method in class org.apache.tika.parser.chm.lzx.ChmBlockInfo
-
- getChmBlockInfoInstance(DirectoryListingEntry, int, ChmLzxcControlData) - Static method in class org.apache.tika.parser.chm.lzx.ChmBlockInfo
-
- getChmBlockSegment(byte[], ChmLzxcResetTable, int, int, int) - Static method in class org.apache.tika.parser.chm.core.ChmCommons
-
- getChmDirList() - Method in class org.apache.tika.parser.chm.core.ChmExtractor
-
- getChmDirList() - Method in class org.apache.tika.parser.chm.core.ChmWrapper
-
- getChmItsfHeader() - Method in class org.apache.tika.parser.chm.core.ChmWrapper
-
- getChmItspHeader() - Method in class org.apache.tika.parser.chm.core.ChmWrapper
-
- getChmLzxcControlData() - Method in class org.apache.tika.parser.chm.core.ChmWrapper
-
- getChmLzxcResetTable() - Method in class org.apache.tika.parser.chm.core.ChmWrapper
-
- getChoices() - Method in class org.apache.tika.metadata.Property
-
Returns the (immutable) set of choices for the values of this property.
- getClassName() - Method in enum org.apache.tika.parser.ctakes.CTAKESSerializer
-
- getCommand() - Method in class org.apache.tika.embedder.ExternalEmbedder
-
Gets the command to be run.
- getCommand() - Method in class org.apache.tika.parser.external.ExternalParser
-
- getCommand() - Method in class org.apache.tika.parser.gdal.GDALParser
-
- getCommandAppendOperator() - Method in class org.apache.tika.embedder.ExternalEmbedder
-
Gets the operator to append rather than replace a value for the command
line tool, i.e.
- getCommandAssignmentDelimeter() - Method in class org.apache.tika.embedder.ExternalEmbedder
-
Gets the delimiter for multiple assignments for the command line tool,
i.e.
- getCommandAssignmentOperator() - Method in class org.apache.tika.embedder.ExternalEmbedder
-
Gets the assignment operator for the command line tool, i.e.
- getCommandMetadataSegments(Metadata) - Method in class org.apache.tika.embedder.ExternalEmbedder
-
Constructs a collection of command line arguments responsible for setting
individual metadata fields based on the given metadata
.
- getComment(byte[], int, int) - Static method in class org.apache.tika.parser.mp3.ID3v2Frame
-
Builds up the ID3 comment, by parsing and extracting
the comment string parts from the given data.
- getComments() - Method in class org.apache.tika.parser.mp3.CompositeTagHandler
-
- getComments() - Method in interface org.apache.tika.parser.mp3.ID3Tags
-
Retrieves the comments, if any.
- getComments() - Method in class org.apache.tika.parser.mp3.ID3v1Handler
-
- getComments() - Method in class org.apache.tika.parser.mp3.ID3v22Handler
-
- getComments() - Method in class org.apache.tika.parser.mp3.ID3v23Handler
-
- getComments() - Method in class org.apache.tika.parser.mp3.ID3v24Handler
-
- getCompilation() - Method in class org.apache.tika.parser.mp3.CompositeTagHandler
-
- getCompilation() - Method in interface org.apache.tika.parser.mp3.ID3Tags
-
- getCompilation() - Method in class org.apache.tika.parser.mp3.ID3v1Handler
-
ID3v1 doesn't have compilations,
so returns null;
- getCompilation() - Method in class org.apache.tika.parser.mp3.ID3v22Handler
-
ID3v22 doesn't have compilations,
so returns null;
- getCompilation() - Method in class org.apache.tika.parser.mp3.ID3v23Handler
-
- getCompilation() - Method in class org.apache.tika.parser.mp3.ID3v24Handler
-
- getComposer() - Method in class org.apache.tika.parser.mp3.CompositeTagHandler
-
- getComposer() - Method in interface org.apache.tika.parser.mp3.ID3Tags
-
- getComposer() - Method in class org.apache.tika.parser.mp3.ID3v1Handler
-
ID3v1 doesn't have composers,
so returns null;
- getComposer() - Method in class org.apache.tika.parser.mp3.ID3v22Handler
-
- getComposer() - Method in class org.apache.tika.parser.mp3.ID3v23Handler
-
- getComposer() - Method in class org.apache.tika.parser.mp3.ID3v24Handler
-
- getCompressedLen() - Method in class org.apache.tika.parser.chm.accessor.ChmLzxcResetTable
-
Gets compressed length
- getConfidence() - Method in class org.apache.tika.language.detect.LanguageResult
-
- getConfidence() - Method in class org.apache.tika.parser.txt.CharsetMatch
-
Get an indication of the confidence in the charset detected.
- getConfig() - Static method in class org.apache.tika.server.resource.TikaResource
-
- getConsidered() - Method in class org.apache.tika.batch.FileResourceCrawler
-
- getConsidered() - Method in class org.apache.tika.batch.ParallelFileProcessingResult
-
Returns the number of file resources considered.
- getConsumed() - Method in class org.apache.tika.batch.ParallelFileProcessingResult
-
- getConsumers() - Method in class org.apache.tika.batch.ConsumersManager
-
Get the consumers
- getConsumersManagerMaxMillis() - Method in class org.apache.tika.batch.ConsumersManager
-
BatchProcess
will throw an exception
if the ConsumersManager doesn't complete init() or shutdown()
within this amount of time.
- getContent() - Method in class org.apache.tika.parser.chm.lzx.ChmLzxBlock
-
- getContent(int, int) - Method in class org.apache.tika.parser.chm.lzx.ChmLzxBlock
-
- getContent(int) - Method in class org.apache.tika.parser.chm.lzx.ChmLzxBlock
-
- getContentHandler(ContentHandler, Metadata, ParseContext) - Method in class org.apache.tika.example.PrescriptionParser
-
- getContentHandler(ContentHandler, Metadata, ParseContext) - Method in class org.apache.tika.parser.dif.DIFParser
-
- getContentHandler(ContentHandler, Metadata, ParseContext) - Method in class org.apache.tika.parser.odf.OpenDocumentMetaParser
-
- getContentHandler(ContentHandler, Metadata, ParseContext) - Method in class org.apache.tika.parser.xml.DcXMLParser
-
- getContentHandler(ContentHandler, Metadata, ParseContext) - Method in class org.apache.tika.parser.xml.FictionBookParser
-
- getContentHandler(ContentHandler, Metadata, ParseContext) - Method in class org.apache.tika.parser.xml.XMLParser
-
- getContentLanguage() - Method in class org.apache.tika.example.ImportContextImpl
-
- getContentLength() - Method in class org.apache.tika.example.ImportContextImpl
-
- getContentLength() - Method in class org.apache.tika.parser.chm.lzx.ChmLzxBlock
-
- getContentParser() - Method in class org.apache.tika.parser.epub.EpubParser
-
- getContentParser() - Method in class org.apache.tika.parser.odf.OpenDocumentParser
-
- getControlDataIndex() - Method in class org.apache.tika.parser.chm.accessor.ChmDirectoryListingSet
-
Returns control data index that located in List
- getConverter(String) - Static method in class org.apache.tika.xmp.convert.TikaToXMP
-
Retrieve a specific converter according to the mimetype
- getCount() - Method in class org.apache.tika.io.CountingInputStream
-
The number of bytes that have passed through this stream.
- getCount() - Method in class org.apache.tika.language.LanguageProfile
-
Deprecated.
- getCount(String) - Method in class org.apache.tika.language.LanguageProfile
-
Deprecated.
- getCountryCode() - Method in class org.apache.tika.parser.geo.topic.gazetteer.Location
-
- getCurrentFile() - Method in class org.apache.tika.batch.FileResourceConsumer
-
Returns the name and start time of a file that is currently being processed.
- getData() - Method in class org.apache.tika.parser.chm.core.ChmWrapper
-
- getData() - Method in class org.apache.tika.parser.chm.lzx.ChmSection
-
- getData() - Method in class org.apache.tika.parser.mp3.ID3v2Frame
-
- getDataOffset() - Method in class org.apache.tika.parser.chm.accessor.ChmDirectoryListingSet
-
Returns data offset
- getDataOffset() - Method in class org.apache.tika.parser.chm.accessor.ChmItsfHeader
-
Returns data offset
- getDate(Property) - Method in class org.apache.tika.metadata.Metadata
-
Returns the value of the identified Date based metadata property.
- getDate(Property) - Method in class org.apache.tika.xmp.XMPMetadata
-
- getDecorationName() - Method in class org.apache.tika.parser.ctakes.CTAKESParser
-
- getDecorationName() - Method in class org.apache.tika.parser.ParserDecorator
-
- getDectorsHTML() - Method in class org.apache.tika.server.resource.TikaDetectors
-
- getDefaultConfig() - Static method in class org.apache.tika.config.TikaConfig
-
Provides a default configuration (TikaConfig).
- getDefaultDetector(MimeTypes, ServiceLoader) - Static method in class org.apache.tika.config.TikaConfig
-
- getDefaultLanguageDetector() - Static method in class org.apache.tika.language.detect.LanguageDetector
-
- getDefaultMimeTypes() - Static method in class org.apache.tika.mime.MimeTypes
-
Get the default MimeTypes.
- getDefaultMimeTypes(ClassLoader) - Static method in class org.apache.tika.mime.MimeTypes
-
Get the default MimeTypes.
- getDefaultNumConsumers() - Static method in class org.apache.tika.batch.builders.AbstractConsumersBuilder
-
- getDefaultRegistry() - Static method in class org.apache.tika.mime.MediaTypeRegistry
-
Returns the built-in media type registry included in Tika.
- getDelegateParser(ParseContext) - Method in class org.apache.tika.parser.DelegatingParser
-
Returns the parser instance to which parsing tasks should be delegated.
- getDescription() - Method in class org.apache.tika.mime.MimeType
-
Returns the description of this media type.
- getDescription() - Method in class org.apache.tika.parser.mp3.ID3Tags.ID3Comment
-
Gets the description, if present
- getDetector() - Method in class org.apache.tika.config.TikaConfig
-
Returns the configured detector instance.
- getDetector() - Method in class org.apache.tika.example.ImportContextImpl
-
- getDetector() - Method in class org.apache.tika.language.detect.LanguageHandler
-
Returns the language detector used by this content handler.
- getDetector() - Method in class org.apache.tika.language.detect.LanguageWriter
-
Returns the language detector used by this writer.
- getDetector() - Method in class org.apache.tika.parser.AutoDetectParser
-
Returns the type detector used by this parser to auto-detect the type
of a document.
- getDetector(Parser) - Static method in class org.apache.tika.server.resource.TikaResource
-
- getDetector() - Method in class org.apache.tika.Tika
-
Returns the detector instance used by this facade.
- getDetectors() - Method in class org.apache.tika.detect.CompositeDetector
-
Returns the component detectors.
- getDetectors() - Method in class org.apache.tika.detect.DefaultDetector
-
- getDetectors() - Method in class org.apache.tika.detect.DefaultProbDetector
-
- getDetectorsJSON() - Method in class org.apache.tika.server.resource.TikaDetectors
-
- getDetectorsPlain() - Method in class org.apache.tika.server.resource.TikaDetectors
-
- getDir_uuid() - Method in class org.apache.tika.parser.chm.accessor.ChmItsfHeader
-
Returns directory uuid
- getDirectoryListingEntryList() - Method in class org.apache.tika.parser.chm.accessor.ChmDirectoryListingSet
-
Returns chm directory listing entry list
- getDirLen() - Method in class org.apache.tika.parser.chm.accessor.ChmItsfHeader
-
Returns directory length
- getDirOffset() - Method in class org.apache.tika.parser.chm.accessor.ChmItsfHeader
-
Returns directory offset
- getDisc() - Method in class org.apache.tika.parser.mp3.CompositeTagHandler
-
- getDisc() - Method in interface org.apache.tika.parser.mp3.ID3Tags
-
The number of the disc this belongs to, within the set
- getDisc() - Method in class org.apache.tika.parser.mp3.ID3v1Handler
-
ID3v1 doesn't have disc numbers,
so returns null;
- getDisc() - Method in class org.apache.tika.parser.mp3.ID3v22Handler
-
- getDisc() - Method in class org.apache.tika.parser.mp3.ID3v23Handler
-
- getDisc() - Method in class org.apache.tika.parser.mp3.ID3v24Handler
-
- getDocument() - Method in class org.apache.tika.parser.microsoft.ooxml.AbstractOOXMLExtractor
-
- getDocument() - Method in interface org.apache.tika.parser.microsoft.ooxml.OOXMLExtractor
-
Returns the opened document.
- getDocumentBuilder() - Method in class org.apache.tika.parser.ParseContext
-
Returns the DOM builder specified in this parsing context.
- getDuration() - Method in class org.apache.tika.parser.mp3.AudioFrame
-
Returns the duration in milliseconds.
- getEnableAutoSpace() - Method in class org.apache.tika.parser.pdf.PDFParser
-
- getEnableAutoSpace() - Method in class org.apache.tika.parser.pdf.PDFParserConfig
-
- getEncint() - Method in class org.apache.tika.parser.chm.lzx.ChmSection
-
- getEncoding() - Method in class org.apache.tika.example.ImportContextImpl
-
- getEncoding() - Method in class org.apache.tika.parser.strings.StringsConfig
-
Returns the character encoding of the strings that are to be found.
- getEndBlock() - Method in class org.apache.tika.parser.chm.lzx.ChmBlockInfo
-
Returns the end block index
- getEndDocumentWasCalled() - Method in class org.apache.tika.sax.EndDocumentShieldingContentHandler
-
- getEndOffset() - Method in class org.apache.tika.parser.chm.lzx.ChmBlockInfo
-
Returns the end offset index
- getEntityTypes() - Method in class org.apache.tika.parser.ner.corenlp.CoreNLPNERecogniser
-
Gets set of entity types recognised by this recogniser
- getEntityTypes() - Method in class org.apache.tika.parser.ner.grobid.GrobidNERecogniser
-
Gets set of entity types recognised by this recogniser
- getEntityTypes() - Method in class org.apache.tika.parser.ner.mitie.MITIENERecogniser
-
Gets set of entity types recognised by this recogniser
- getEntityTypes() - Method in interface org.apache.tika.parser.ner.NERecogniser
-
gets a set of entity types whose names are recognisable by this
- getEntityTypes() - Method in class org.apache.tika.parser.ner.nltk.NLTKNERecogniser
-
Gets set of entity types recognised by this recogniser
- getEntityTypes() - Method in class org.apache.tika.parser.ner.opennlp.OpenNLPNameFinder
-
- getEntityTypes() - Method in class org.apache.tika.parser.ner.opennlp.OpenNLPNERecogniser
-
- getEntityTypes() - Method in class org.apache.tika.parser.ner.regex.RegexNERecogniser
-
- getEntryType() - Method in class org.apache.tika.parser.chm.accessor.DirectoryListingEntry
-
Returns ChmCommons.EntryType (COMPRESSED or UNCOMPRESSED)
- getErrors() - Static method in class org.apache.tika.language.LanguageIdentifier
-
Deprecated.
Returns a string of error messages related to initializing langauge profiles
- getExecutorService() - Method in class org.apache.tika.config.TikaConfig
-
- getExitStatus() - Method in class org.apache.tika.batch.ParallelFileProcessingResult
-
- getExtendedHeader() - Method in class org.apache.tika.parser.mp3.ID3v2Frame
-
- getExtension() - Method in class org.apache.tika.mime.MimeType
-
Returns the preferred file extension of this type, or an empty string
if no extensions are known.
- getExtension() - Method in enum org.apache.tika.parser.microsoft.OfficeParser.POIFSDocumentType
-
- getExtensions() - Method in class org.apache.tika.mime.MimeType
-
Returns the list of all known file extensions of this media type.
- getExtractAcroFormContent() - Method in class org.apache.tika.parser.pdf.PDFParserConfig
-
- getExtractAnnotationText() - Method in class org.apache.tika.parser.pdf.PDFParser
-
- getExtractAnnotationText() - Method in class org.apache.tika.parser.pdf.PDFParserConfig
-
- getExtractInlineImages() - Method in class org.apache.tika.parser.pdf.PDFParserConfig
-
- getExtractUniqueInlineImagesOnly() - Method in class org.apache.tika.parser.pdf.PDFParserConfig
-
- getFallback() - Method in class org.apache.tika.parser.CompositeParser
-
Returns the fallback parser.
- getFile() - Method in class org.apache.tika.io.TikaInputStream
-
- getFile(String, File) - Static method in class org.apache.tika.util.PropsUtil
-
Deprecated.
- getFileChannel() - Method in class org.apache.tika.io.TikaInputStream
-
- getFilePath() - Method in class org.apache.tika.parser.strings.FileConfig
-
Returns the "file" installation folder.
- getFileProg() - Static method in class org.apache.tika.parser.strings.StringsParser
-
- getFilteredStackTrace(Throwable) - Static method in class org.apache.tika.utils.ExceptionUtils
-
Simple util to get stack trace.
- getFlags() - Method in class org.apache.tika.parser.mp3.ID3v2Frame
-
- getFormat() - Method in class org.apache.tika.language.translate.YandexTranslator
-
Retrieve the current text format setting.
- getFormattedNumber(Paragraph) - Method in class org.apache.tika.parser.microsoft.ListManager
-
Get the formatted number for a given paragraph
- getFormattedNumber(XWPFParagraph) - Method in class org.apache.tika.parser.microsoft.ooxml.XWPFListManager
-
- getFramesRead() - Method in class org.apache.tika.parser.chm.lzx.ChmLzxState
-
- getFreeSpace() - Method in class org.apache.tika.parser.chm.accessor.ChmPmgiHeader
-
Returns pmgi free space
- getFreeSpace() - Method in class org.apache.tika.parser.chm.accessor.ChmPmglHeader
-
- getGazetteerRestEndpoint() - Method in class org.apache.tika.parser.geo.topic.GeoParserConfig
-
- getGenre() - Method in class org.apache.tika.parser.mp3.CompositeTagHandler
-
- getGenre() - Method in interface org.apache.tika.parser.mp3.ID3Tags
-
- getGenre() - Method in class org.apache.tika.parser.mp3.ID3v1Handler
-
- getGenre() - Method in class org.apache.tika.parser.mp3.ID3v22Handler
-
- getGenre() - Method in class org.apache.tika.parser.mp3.ID3v23Handler
-
- getGenre() - Method in class org.apache.tika.parser.mp3.ID3v24Handler
-
- getHadStarted() - Method in class org.apache.tika.parser.chm.lzx.ChmLzxState
-
- getHeader_len() - Method in class org.apache.tika.parser.chm.accessor.ChmItspHeader
-
Returns header length
- getHeaderLen() - Method in class org.apache.tika.parser.chm.accessor.ChmItsfHeader
-
Returns itsf header length
- getHeight() - Method in class org.apache.tika.parser.image.ICNSType
-
- getHTML(InputStream, HttpHeaders, UriInfo) - Method in class org.apache.tika.server.resource.TikaResource
-
- getHTMLFromMultipart(Attachment, UriInfo) - Method in class org.apache.tika.server.resource.TikaResource
-
- getIfXFAExtractOnlyXFA() - Method in class org.apache.tika.parser.pdf.PDFParserConfig
-
- getImportRoot() - Method in class org.apache.tika.example.ImportContextImpl
-
- getIndex_depth() - Method in class org.apache.tika.parser.chm.accessor.ChmItspHeader
-
Returns an index depth
- getIndex_head() - Method in class org.apache.tika.parser.chm.accessor.ChmItspHeader
-
Returns an index head
- getIndex_root() - Method in class org.apache.tika.parser.chm.accessor.ChmItspHeader
-
Returns index root
- getIndexOfContent() - Method in class org.apache.tika.parser.chm.core.ChmWrapper
-
- getIndexOfResetData() - Method in class org.apache.tika.parser.chm.core.ChmWrapper
-
- getIndexOfResetTable() - Method in class org.apache.tika.parser.chm.core.ChmWrapper
-
- getIniBlock() - Method in class org.apache.tika.parser.chm.lzx.ChmBlockInfo
-
Returns an initial block index
- getInputStream(FileResource) - Method in class org.apache.tika.batch.fs.AbstractFSConsumer
-
- getInputStream() - Method in class org.apache.tika.example.ImportContextImpl
-
Returns a new InputStream
to the temporary file created
during instanciation or null
, if this context does not
provide a stream.
- getInstance() - Static method in class org.apache.tika.parser.ner.regex.RegexNERecogniser
-
- getInt(Property) - Method in class org.apache.tika.metadata.Metadata
-
Returns the value of the identified Integer based metadata property.
- getInt(byte[]) - Static method in class org.apache.tika.parser.mp3.ID3v2Frame
-
- getInt(byte[], int) - Static method in class org.apache.tika.parser.mp3.ID3v2Frame
-
- getInt(String, Integer) - Static method in class org.apache.tika.util.PropsUtil
-
Parses v.
- getInt(String, Map<String, String>, Node) - Static method in class org.apache.tika.util.XMLDOMUtil
-
Get an int value.
- getInt(Property) - Method in class org.apache.tika.xmp.XMPMetadata
-
- getInt2(byte[], int) - Static method in class org.apache.tika.parser.mp3.ID3v2Frame
-
- getInt3(byte[], int) - Static method in class org.apache.tika.parser.mp3.ID3v2Frame
-
- getIntBE(byte[]) - Static method in class org.apache.tika.io.EndianUtils
-
Get a BE int value from the beginning of a byte array
- getIntBE(byte[], int) - Static method in class org.apache.tika.io.EndianUtils
-
Get a BE int value from a byte array
- getIntelCurrentPossition() - Method in class org.apache.tika.parser.chm.lzx.ChmLzxState
-
- getIntelFileSize() - Method in class org.apache.tika.parser.chm.lzx.ChmLzxState
-
- getIntelState() - Method in class org.apache.tika.parser.chm.lzx.ChmLzxState
-
- getIntLE(byte[]) - Static method in class org.apache.tika.io.EndianUtils
-
Get a LE int value from the beginning of a byte array
- getIntLE(byte[], int) - Static method in class org.apache.tika.io.EndianUtils
-
Get a LE int value from a byte array
- getIOListener() - Method in class org.apache.tika.example.ImportContextImpl
-
- getJavaCommand() - Method in class org.apache.tika.fork.ForkParser
-
- getJavaCommandAsList() - Method in class org.apache.tika.fork.ForkParser
-
Returns the command used to start the forked server process.
- getJCas(AnalysisEngine) - Static method in class org.apache.tika.parser.ctakes.CTAKESUtils
-
Returns a new JCas () appropriate for the given Analysis Engine.
- getJustFileName(String) - Method in class org.apache.tika.parser.microsoft.ooxml.AbstractOOXMLExtractor
-
- getKey() - Static method in class org.apache.tika.example.Pharmacy
-
- getLang_id() - Method in class org.apache.tika.parser.chm.accessor.ChmItspHeader
-
Returns language id
- getLangId() - Method in class org.apache.tika.parser.chm.accessor.ChmItsfHeader
-
Returns language ID
- getLanguage() - Method in class org.apache.tika.language.detect.LanguageHandler
-
Returns the detected language based on text handled thus far.
- getLanguage() - Method in class org.apache.tika.language.detect.LanguageResult
-
- getLanguage() - Method in class org.apache.tika.language.detect.LanguageWriter
-
Returns the detected language based on text written thus far.
- getLanguage() - Method in class org.apache.tika.language.LanguageIdentifier
-
Deprecated.
Gets the identified language
- getLanguage() - Method in class org.apache.tika.language.ProfilingHandler
-
Deprecated.
Returns the language that best matches the current state of the
language profile.
- getLanguage() - Method in class org.apache.tika.language.ProfilingWriter
-
Deprecated.
Returns the language that best matches the current state of the
language profile.
- getLanguage(long) - Static method in class org.apache.tika.parser.chm.core.ChmCommons
-
Returns textual representation of LangID
- getLanguage() - Method in class org.apache.tika.parser.mp3.ID3Tags.ID3Comment
-
Gets the language, if present
- getLanguage() - Method in class org.apache.tika.parser.ocr.TesseractOCRConfig
-
- getLanguage() - Method in class org.apache.tika.parser.txt.CharsetMatch
-
Get the ISO code for the language of the detected charset.
- getLanguageDetectors() - Static method in class org.apache.tika.language.detect.LanguageDetector
-
- getLanguageDetectors(ServiceLoader) - Static method in class org.apache.tika.language.detect.LanguageDetector
-
- getLastModified() - Method in class org.apache.tika.parser.chm.accessor.ChmItsfHeader
-
Returns last modified date of the chm file
- getLatitude() - Method in class org.apache.tika.parser.geo.topic.gazetteer.Location
-
- getLayer() - Method in class org.apache.tika.parser.mp3.AudioFrame
-
Get the audio layer code.
- getLeft() - Method in class org.apache.tika.parser.chm.lzx.ChmSection
-
- getLeft() - Method in class org.apache.tika.parser.microsoft.ooxml.XSSFExcelExtractorDecorator.HeaderFooterFromString
-
- getLength() - Method in class org.apache.tika.detect.MagicDetector
-
- getLength() - Method in class org.apache.tika.io.TikaInputStream
-
Returns the length (in bytes) of this stream.
- getLength() - Method in class org.apache.tika.parser.chm.accessor.DirectoryListingEntry
-
- getLength() - Method in class org.apache.tika.parser.mp3.AudioFrame
-
Returns the frame length in bytes.
- getLength() - Method in class org.apache.tika.parser.mp3.ID3v2Frame
-
- getLengthTreeLengtsTable() - Method in class org.apache.tika.parser.chm.lzx.ChmLzxState
-
- getLengthTreeTable() - Method in class org.apache.tika.parser.chm.lzx.ChmLzxState
-
- getLinks() - Method in class org.apache.tika.mime.MimeType
-
Get a list of links to help document this mime type
- getLinks() - Method in class org.apache.tika.sax.LinkContentHandler
-
Returns the list of collected links.
- getLoadErrorHandler() - Method in class org.apache.tika.config.ServiceLoader
-
Returns the load error handler used by this loader.
- getLocations(List<String>) - Method in class org.apache.tika.parser.geo.topic.gazetteer.GeoGazetteerClient
-
Calls API of lucene-geo-gazetteer to search location name in gazetteer.
- getLong(String, Long) - Static method in class org.apache.tika.util.PropsUtil
-
Parses v.
- getLong(String, Map<String, String>, Node) - Static method in class org.apache.tika.util.XMLDOMUtil
-
Get a long value.
- getLongitude() - Method in class org.apache.tika.parser.geo.topic.gazetteer.Location
-
- getLongLE(byte[], int) - Static method in class org.apache.tika.io.EndianUtils
-
Get a LE long value from a byte array
- getLzxBlockLength() - Method in class org.apache.tika.parser.chm.core.ChmWrapper
-
- getLzxBlockOffset() - Method in class org.apache.tika.parser.chm.core.ChmWrapper
-
- getLzxBlocksCache() - Method in class org.apache.tika.parser.chm.core.ChmWrapper
-
- getMacroLanguage(String) - Static method in class org.apache.tika.language.detect.LanguageNames
-
If language is a specific variant of a macro language (e.g.
- getMainDocumentParts() - Method in class org.apache.tika.parser.microsoft.ooxml.AbstractOOXMLExtractor
-
Return a list of the main parts of the document, used
when searching for embedded resources.
- getMainDocumentParts() - Method in class org.apache.tika.parser.microsoft.ooxml.POIXMLTextExtractorDecorator
-
- getMainDocumentParts() - Method in class org.apache.tika.parser.microsoft.ooxml.XSLFPowerPointExtractorDecorator
-
In PowerPoint files, slides have things embedded in them,
and slide drawings which have the images
- getMainDocumentParts() - Method in class org.apache.tika.parser.microsoft.ooxml.XSSFExcelExtractorDecorator
-
In Excel files, sheets have things embedded in them,
and sheet drawings which have the images
- getMainDocumentParts() - Method in class org.apache.tika.parser.microsoft.ooxml.XWPFWordExtractorDecorator
-
Word documents are simple, they only have the one
main part
- getMainTreeElements() - Method in class org.apache.tika.parser.chm.lzx.ChmLzxState
-
- getMainTreeLengtsTable() - Method in class org.apache.tika.parser.chm.lzx.ChmLzxState
-
- getMainTreeTable() - Method in class org.apache.tika.parser.chm.lzx.ChmLzxState
-
- getMajorVersion() - Method in class org.apache.tika.parser.mp3.ID3v2Frame
-
- getMappedTagName() - Method in class org.apache.tika.sax.ElementMappingContentHandler.TargetElement
-
- getMatchType() - Method in class org.apache.tika.parser.txt.CharsetMatch
-
Return flags indicating what it was about the input data
that caused this charset to be considered as a possible match.
- getMaxBytesForEmbeddedObject() - Static method in class org.apache.tika.parser.rtf.RTFParser
-
- getMaxFileSizeToOcr() - Method in class org.apache.tika.parser.ocr.TesseractOCRConfig
-
- getMaximumCompressionRatio() - Method in class org.apache.tika.sax.SecureContentHandler
-
Returns the maximum compression ratio.
- getMaximumDepth() - Method in class org.apache.tika.sax.SecureContentHandler
-
Returns the maximum XML element nesting level.
- getMaximumPackageEntryDepth() - Method in class org.apache.tika.sax.SecureContentHandler
-
Returns the maximum package entry nesting level.
- getMaxStringLength() - Method in class org.apache.tika.Tika
-
Returns the maximum length of strings returned by the
parseToString methods.
- getMediaTypeRegistry() - Method in class org.apache.tika.config.TikaConfig
-
- getMediaTypeRegistry() - Method in class org.apache.tika.mime.MimeTypes
-
- getMediaTypeRegistry() - Method in class org.apache.tika.mime.ProbabilisticMimeDetectionSelector
-
- getMediaTypeRegistry() - Method in class org.apache.tika.parser.CompositeParser
-
Returns the media type registry used to infer type relationships.
- getMediaTypes() - Method in class org.apache.tika.server.resource.TikaMimeTypes
-
- getMessage() - Method in class org.apache.tika.server.resource.TikaResource
-
- getMet(URL) - Static method in class org.apache.tika.example.DisplayMetInstance
-
- getMetadata() - Method in interface org.apache.tika.batch.FileResource
-
This gets the metadata available before the parsing of the file.
- getMetadata() - Method in class org.apache.tika.batch.fs.FSFileResource
-
- getMetadata() - Method in class org.apache.tika.parser.ctakes.CTAKESConfig
-
Returns an array of metadata whose values will be analyzed using cTAKES.
- getMetadata() - Method in class org.apache.tika.parser.ctakes.CTAKESContentHandler
-
Returns metadata that includes cTAKES annotations.
- getMetadata() - Method in class org.apache.tika.parser.RecursiveParserWrapper
-
The first element in the returned list represents the
data from the outer container file.
- getMetadata() - Method in class org.apache.tika.server.MetadataList
-
- getMetadata(InputStream, HttpHeaders, UriInfo) - Method in class org.apache.tika.server.resource.MetadataResource
-
- getMetadata(InputStream, HttpHeaders, UriInfo, String) - Method in class org.apache.tika.server.resource.RecursiveMetadataResource
-
Returns an InputStream that can be deserialized as a list of
Metadata
objects.
- getMetadataAsString() - Method in class org.apache.tika.parser.ctakes.CTAKESConfig
-
Returns a string containing a comma-separated list of metadata whose values will be analyzed using cTAKES.
- getMetadataCommandArguments() - Method in class org.apache.tika.embedder.ExternalEmbedder
-
Gets the map of Metadata keys to command line parameters.
- getMetadataExtractionPatterns() - Method in class org.apache.tika.parser.external.ExternalParser
-
- getMetadataExtractor() - Method in class org.apache.tika.parser.microsoft.ooxml.AbstractOOXMLExtractor
-
- getMetadataExtractor() - Method in interface org.apache.tika.parser.microsoft.ooxml.OOXMLExtractor
-
POIXMLTextExtractor.getMetadataTextExtractor()
not yet supported
for OOXML by POI.
- getMetadataField(InputStream, HttpHeaders, UriInfo, String) - Method in class org.apache.tika.server.resource.MetadataResource
-
Get a specific metadata field.
- getMetadataFromMultipart(Attachment, UriInfo) - Method in class org.apache.tika.server.resource.MetadataResource
-
- getMetadataFromMultipart(Attachment, UriInfo, String) - Method in class org.apache.tika.server.resource.RecursiveMetadataResource
-
Returns an InputStream that can be deserialized as a list of
Metadata
objects.
- getMetaParser() - Method in class org.apache.tika.parser.epub.EpubParser
-
- getMetaParser() - Method in class org.apache.tika.parser.odf.OpenDocumentParser
-
- getMimeRepository() - Method in class org.apache.tika.config.TikaConfig
-
- getMimeType() - Method in class org.apache.tika.example.ImportContextImpl
-
- getMimeType(String) - Method in class org.apache.tika.mime.MimeTypes
-
- getMimeType(File) - Method in class org.apache.tika.mime.MimeTypes
-
- getMimeTypesHTML() - Method in class org.apache.tika.server.resource.TikaMimeTypes
-
- getMimeTypesJSON() - Method in class org.apache.tika.server.resource.TikaMimeTypes
-
- getMimeTypesPlain() - Method in class org.apache.tika.server.resource.TikaMimeTypes
-
- getMinFileSizeToOcr() - Method in class org.apache.tika.parser.ocr.TesseractOCRConfig
-
- getMinLength() - Method in class org.apache.tika.detect.TrainedModelDetector
-
- getMinLength() - Method in class org.apache.tika.mime.MimeTypes
-
Return the minimum length of data to provide to analyzing methods based
on the document's content in order to check all the known MimeTypes.
- getMinLength() - Method in class org.apache.tika.parser.strings.StringsConfig
-
Returns the minimum sequence length (characters) to print.
- getMinorVersion() - Method in class org.apache.tika.parser.mp3.ID3v2Frame
-
- getMinSize() - Method in class org.apache.tika.parser.strings.Latin1StringsParser
-
Returns the minimum size of a character sequence to be extracted.
- getModificationTime() - Method in class org.apache.tika.example.ImportContextImpl
-
- getMSB() - Method in class org.apache.tika.parser.executable.MachineMetadata.Endian
-
- getName(String) - Static method in class org.apache.tika.io.FilenameUtils
-
This is a duplication of the algorithm and functionality
available in commons io FilenameUtils.
- getName() - Method in class org.apache.tika.language.LanguageProfilerBuilder
-
Deprecated.
- getName() - Method in class org.apache.tika.metadata.Property
-
- getName() - Method in class org.apache.tika.mime.MimeType
-
Returns the name of this media type.
- getName() - Method in class org.apache.tika.parser.chm.accessor.DirectoryListingEntry
-
Returns an entry name
- getName() - Method in enum org.apache.tika.parser.ctakes.CTAKESAnnotationProperty
-
- getName() - Method in class org.apache.tika.parser.executable.MachineMetadata.Endian
-
- getName() - Method in class org.apache.tika.parser.geo.topic.gazetteer.Location
-
- getName() - Method in class org.apache.tika.parser.txt.CharsetMatch
-
Get the name of the detected charset.
- getNameLength() - Method in class org.apache.tika.parser.chm.accessor.DirectoryListingEntry
-
Returns an entry name length
- getNames(Metadata) - Method in class org.apache.tika.metadata.serialization.JsonMetadataSerializer
-
Override to get a custom sort order
or to filter names.
- getNamespace() - Method in enum org.apache.tika.parser.iwork.IWorkPackageParser.IWORKDocumentType
-
- getNamespacePrefix(String) - Static method in class org.apache.tika.xmp.XMPMetadata
-
Obtain the prefix for a registered namespace URI.
- getNamespaces() - Static method in class org.apache.tika.xmp.XMPMetadata
-
- getNamespaceURI(String) - Static method in class org.apache.tika.xmp.XMPMetadata
-
Obtain the URI for a registered namespace prefix.
- getNerModelUrl() - Method in class org.apache.tika.parser.geo.topic.GeoParserConfig
-
- getNewContentHandler() - Method in class org.apache.tika.sax.BasicContentHandlerFactory
-
- getNewContentHandler(OutputStream, String) - Method in class org.apache.tika.sax.BasicContentHandlerFactory
-
- getNewContentHandler() - Method in interface org.apache.tika.sax.ContentHandlerFactory
-
- getNewContentHandler(OutputStream, String) - Method in interface org.apache.tika.sax.ContentHandlerFactory
-
- getNum_blocks() - Method in class org.apache.tika.parser.chm.accessor.ChmItspHeader
-
Returns number of blocks
- getNumberHandledExceptions() - Method in class org.apache.tika.batch.ParallelFileProcessingResult
-
- getNumberOfLevels() - Method in class org.apache.tika.parser.microsoft.AbstractListManager.ParagraphLevelCounter
-
- getNumConsumers(Map<String, String>) - Static method in class org.apache.tika.batch.builders.BatchProcessBuilder
-
numConsumers is needed by both the crawler and the consumers.
- getNumHandledExceptions() - Method in class org.apache.tika.batch.FileResourceConsumer
-
- getNumOfHidden() - Method in class org.apache.tika.detect.NNTrainedModelBuilder
-
- getNumOfInputs() - Method in class org.apache.tika.detect.NNTrainedModelBuilder
-
- getNumOfOutputs() - Method in class org.apache.tika.detect.NNTrainedModelBuilder
-
- getNumResourcesConsumed() - Method in class org.apache.tika.batch.FileResourceConsumer
-
- getNumRestarts() - Method in class org.apache.tika.batch.BatchProcessDriverCLI
-
- getNumTranslationPairs() - Method in class org.apache.tika.language.translate.CachedTranslator
-
Get the number of different source/target translation pairs this CachedTranslator
currently has in its cache.
- getNumTranslationsFor(String, String) - Method in class org.apache.tika.language.translate.CachedTranslator
-
Get the number of different translations from the source language to the target language
this CachedTranslator has in its cache.
- getOffset() - Method in class org.apache.tika.parser.chm.accessor.DirectoryListingEntry
-
- getOpenContainer() - Method in class org.apache.tika.io.TikaInputStream
-
Returns the open container object, such as a
POIFS FileSystem in the event of an OLE2
document being detected and processed by
the OLE2 detector.
- getOutputEncoding() - Method in class org.apache.tika.batch.fs.BasicTikaFSConsumer
-
- getOutputEncoding() - Method in class org.apache.tika.batch.fs.RecursiveParserWrapperFSConsumer
-
- getOutputFile(File, String, FSUtil.HANDLE_EXISTING, String) - Static method in class org.apache.tika.batch.fs.FSUtil
-
Deprecated.
- getOutputPath(Path, String, FSUtil.HANDLE_EXISTING, String) - Static method in class org.apache.tika.batch.fs.FSUtil
-
Given an output root and an initial relative path,
return the output file according to the HANDLE_EXISTING strategy
In the most basic use case, given a root directory "input",
a file's relative path "dir1/dir2/fileA.docx", and an output directory
"output", the output file would be "output/dir1/dir2/fileA.docx."
If HANDLE_EXISTING is set to OVERWRITE, this will not check to see if the output already exists,
and the returned file could overwrite an existing file!!!
If HANDLE_EXISTING is set to RENAME, this will try to increment a counter at the end of
the file name (fileA(2).docx) until there is a file name that doesn't exist.
- getOutputStream(OutputStreamFactory, FileResource) - Method in class org.apache.tika.batch.fs.AbstractFSConsumer
-
Use this for consistent logging of exceptions.
- getOutputStream(Metadata) - Method in class org.apache.tika.batch.fs.FSOutputStreamFactory
-
This tries to create a file based on the
FSUtil.HANDLE_EXISTING
value that was passed in during initialization.
- getOutputStream(Metadata) - Method in interface org.apache.tika.batch.OutputStreamFactory
-
- getOutputStream() - Method in class org.apache.tika.parser.ctakes.CTAKESConfig
-
- getOutputThreshold() - Method in class org.apache.tika.sax.SecureContentHandler
-
Returns the configured output threshold.
- getPageSegMode() - Method in class org.apache.tika.parser.ocr.TesseractOCRConfig
-
- getParameters() - Method in class org.apache.tika.mime.MediaType
-
Returns an immutable sorted map of the parameters of this media type.
- getParams() - Method in class org.apache.tika.detect.NNTrainedModelBuilder
-
- getParser(TikaConfig) - Method in class org.apache.tika.batch.AutoDetectParserFactory
-
- getParser(TikaConfig) - Method in class org.apache.tika.batch.DigestingAutoDetectParserFactory
-
- getParser(TikaConfig) - Method in class org.apache.tika.batch.ParserFactory
-
- getParser(MediaType) - Method in class org.apache.tika.config.TikaConfig
-
- getParser() - Method in class org.apache.tika.config.TikaConfig
-
Returns the configured parser instance.
- getParser(Metadata) - Method in class org.apache.tika.parser.CompositeParser
-
Returns the parser that best matches the given metadata.
- getParser(Metadata, ParseContext) - Method in class org.apache.tika.parser.CompositeParser
-
- getParser() - Method in class org.apache.tika.Tika
-
Returns the parser instance used by this facade.
- getParserDetailsHTML() - Method in class org.apache.tika.server.resource.TikaParsers
-
- getParserDetailsJSON() - Method in class org.apache.tika.server.resource.TikaParsers
-
- getParserDetailssPlain() - Method in class org.apache.tika.server.resource.TikaParsers
-
- getParseRecursively() - Method in class org.apache.tika.batch.ParserFactory
-
- getParsers(ParseContext) - Method in class org.apache.tika.parser.CompositeParser
-
- getParsers() - Method in class org.apache.tika.parser.CompositeParser
-
Returns the component parsers.
- getParsers(ParseContext) - Method in class org.apache.tika.parser.DefaultParser
-
- getParsersHTML() - Method in class org.apache.tika.server.resource.TikaParsers
-
- getParsersHTML(boolean) - Method in class org.apache.tika.server.resource.TikaParsers
-
- getParsersJSON() - Method in class org.apache.tika.server.resource.TikaParsers
-
- getParsersJSON(boolean) - Method in class org.apache.tika.server.resource.TikaParsers
-
- getParsersPlain() - Method in class org.apache.tika.server.resource.TikaParsers
-
- getParsersPlain(boolean) - Method in class org.apache.tika.server.resource.TikaParsers
-
- getPart() - Method in enum org.apache.tika.parser.iwork.IWorkPackageParser.IWORKDocumentType
-
- getPassword(Metadata) - Method in interface org.apache.tika.parser.PasswordProvider
-
Looks up the password for a document with the given metadata,
and returns it for the Parser.
- getPath() - Method in class org.apache.tika.io.TikaInputStream
-
- getPath(String, Path) - Static method in class org.apache.tika.util.PropsUtil
-
Parses v.
- getPDFParserConfig() - Method in class org.apache.tika.parser.pdf.PDFParser
-
- getPoolSize() - Method in class org.apache.tika.fork.ForkParser
-
Returns the size of the process pool.
- getPosition() - Method in class org.apache.tika.io.NullInputStream
-
Return the current position.
- getPosition() - Method in class org.apache.tika.io.TikaInputStream
-
Returns the current position within the stream.
- getPrefixes() - Static method in class org.apache.tika.xmp.XMPMetadata
-
- getPrevContent() - Method in class org.apache.tika.parser.chm.lzx.ChmSection
-
- getPrimaryProperty() - Method in class org.apache.tika.metadata.Property
-
Gets the primary property for a composite property
- getProfile() - Method in class org.apache.tika.language.ProfilingHandler
-
Deprecated.
Returns the language profile being built by this content handler.
- getProfile() - Method in class org.apache.tika.language.ProfilingWriter
-
Deprecated.
Returns the language profile being built by this writer.
- getProperties(String) - Static method in class org.apache.tika.metadata.Property
-
- getProperty(Object) - Method in class org.apache.tika.example.ImportContextImpl
-
- getPropertyType(String) - Static method in class org.apache.tika.metadata.Property
-
Get the type of a property
- getPropertyType() - Method in class org.apache.tika.metadata.Property
-
- getQNameAsString(QName) - Static method in class org.apache.tika.sax.ElementMappingContentHandler
-
- getR0() - Method in class org.apache.tika.parser.chm.lzx.ChmLzxState
-
- getR1() - Method in class org.apache.tika.parser.chm.lzx.ChmLzxState
-
- getR2() - Method in class org.apache.tika.parser.chm.lzx.ChmLzxState
-
- getRawScore() - Method in class org.apache.tika.language.detect.LanguageResult
-
- getReader(InputStream, String) - Method in class org.apache.tika.parser.txt.CharsetDetector
-
Autodetect the charset of an inputStream, and return a Java Reader
to access the converted input data.
- getReader() - Method in class org.apache.tika.parser.txt.CharsetMatch
-
Create a java.io.Reader for reading the Unicode character data corresponding
to the original byte data supplied to the Charset detect operation.
- getRegisteredMimeType(String) - Method in class org.apache.tika.mime.MimeTypes
-
Returns the registered, normalised media type with the given name (or alias).
- getRel() - Method in class org.apache.tika.sax.Link
-
- getResetInterval() - Method in class org.apache.tika.parser.chm.accessor.ChmLzxcControlData
-
Returns reset interval
- getResetTableIndex() - Method in class org.apache.tika.parser.chm.accessor.ChmDirectoryListingSet
-
Return index of reset table
- getResource(Class<T>) - Method in class org.apache.tika.io.TemporaryResources
-
Returns the latest of the tracked resources that implements or
extends the given interface or class.
- getResourceAsStream(String) - Method in class org.apache.tika.config.ServiceLoader
-
Returns an input stream for reading the specified resource from the
configured class loader.
- getResourceId() - Method in interface org.apache.tika.batch.FileResource
-
This is only used in logging to identify which file
may have caused problems.
- getResourceId() - Method in class org.apache.tika.batch.fs.FSFileResource
-
- getRight() - Method in class org.apache.tika.parser.microsoft.ooxml.XSSFExcelExtractorDecorator.HeaderFooterFromString
-
- getRoughCountExceptions() - Method in class org.apache.tika.batch.StatusReporter
-
This returns a rough (unsynchronized) count of caught/handled exceptions.
- getRSSFooters() - Method in class org.apache.tika.example.RecentFiles
-
- getRSSHeaders() - Method in class org.apache.tika.example.RecentFiles
-
- getRSSItem(Document) - Method in class org.apache.tika.example.RecentFiles
-
- getSampleRate() - Method in class org.apache.tika.parser.mp3.AudioFrame
-
Get the sampling rate, in Hz
- getSAXParser() - Method in class org.apache.tika.parser.ParseContext
-
Returns the SAX parser specified in this parsing context.
- getSAXParserFactory() - Method in class org.apache.tika.parser.ParseContext
-
Returns the SAX parser factory specified in this parsing context.
- getSecondaryExtractProperties() - Method in class org.apache.tika.metadata.Property
-
Gets the secondary properties for a composite property
- getSeparatorChar() - Method in class org.apache.tika.parser.ctakes.CTAKESConfig
-
Returns the separator character used for annotation properties.
- getSerializerType() - Method in class org.apache.tika.parser.ctakes.CTAKESConfig
-
Returns the type of cTAKES (UIMA) serializer used to write the CAS.
- getServiceClass(Class<T>, String) - Method in class org.apache.tika.config.ServiceLoader
-
Loads and returns the named service class that's expected to implement
the given interface.
- getServiceLoader() - Method in class org.apache.tika.config.TikaConfig
-
- getShortBE(byte[]) - Static method in class org.apache.tika.io.EndianUtils
-
Get a BE short value from the beginning of a byte array
- getShortBE(byte[], int) - Static method in class org.apache.tika.io.EndianUtils
-
Get a BE short value from a byte array
- getShortLE(byte[]) - Static method in class org.apache.tika.io.EndianUtils
-
Get a LE short value from the beginning of a byte array
- getShortLE(byte[], int) - Static method in class org.apache.tika.io.EndianUtils
-
Get a LE short value from a byte array
- getSignature() - Method in class org.apache.tika.parser.chm.accessor.ChmItsfHeader
-
Returns a signature of itsf header
- getSignature() - Method in class org.apache.tika.parser.chm.accessor.ChmItspHeader
-
Returns a signature of the header
- getSignature() - Method in class org.apache.tika.parser.chm.accessor.ChmLzxcControlData
-
Returns a signature of control data block
- getSignature() - Method in class org.apache.tika.parser.chm.accessor.ChmPmgiHeader
-
Returns pmgi signature if exists
- getSignature() - Method in class org.apache.tika.parser.chm.accessor.ChmPmglHeader
-
- getSimilarity(LanguageProfilerBuilder) - Method in class org.apache.tika.language.LanguageProfilerBuilder
-
Deprecated.
Calculates a score how well NGramProfiles match each other
- getSize() - Method in class org.apache.tika.io.NullInputStream
-
- getSize() - Method in class org.apache.tika.parser.chm.accessor.ChmLzxcControlData
-
Returns a size of control data
- getSize() - Method in class org.apache.tika.parser.mp3.ID3v2Frame.RawTag
-
- getSize(Metadata, Class<?>, Type, Annotation[], MediaType) - Method in class org.apache.tika.server.writer.CSVMessageBodyWriter
-
- getSize(Metadata, Class<?>, Type, Annotation[], MediaType) - Method in class org.apache.tika.server.writer.JSONMessageBodyWriter
-
- getSize(MetadataList, Class<?>, Type, Annotation[], MediaType) - Method in class org.apache.tika.server.writer.MetadataListMessageBodyWriter
-
- getSize(Map<String, byte[]>, Class<?>, Type, Annotation[], MediaType) - Method in class org.apache.tika.server.writer.TarWriter
-
- getSize(Metadata, Class<?>, Type, Annotation[], MediaType) - Method in class org.apache.tika.server.writer.TextMessageBodyWriter
-
- getSize(Metadata, Class<?>, Type, Annotation[], MediaType) - Method in class org.apache.tika.server.writer.XMPMessageBodyWriter
-
- getSize(Map<String, byte[]>, Class<?>, Type, Annotation[], MediaType) - Method in class org.apache.tika.server.writer.ZipWriter
-
- getSize() - Method in class org.apache.tika.utils.RereadableInputStream
-
Returns the number of bytes read from the original stream.
- getSortByPosition() - Method in class org.apache.tika.parser.pdf.PDFParser
-
- getSortByPosition() - Method in class org.apache.tika.parser.pdf.PDFParserConfig
-
- getSorted() - Method in class org.apache.tika.language.LanguageProfilerBuilder
-
Deprecated.
Returns a sorted list of ngrams (sort done by 1.
- getSpacingTolerance() - Method in class org.apache.tika.parser.pdf.PDFParserConfig
-
- getStackTrace(Throwable) - Static method in class org.apache.tika.utils.ExceptionUtils
-
Get the full stacktrace as a string
- getStartBlock() - Method in class org.apache.tika.parser.chm.lzx.ChmBlockInfo
-
Returns the start block index
- getStartIndex() - Method in class org.apache.tika.parser.chm.core.ChmWrapper
-
- getStartOffset() - Method in class org.apache.tika.parser.chm.lzx.ChmBlockInfo
-
Returns the start offset index
- getState() - Method in class org.apache.tika.parser.chm.lzx.ChmLzxBlock
-
- getStream_uuid() - Method in class org.apache.tika.parser.chm.accessor.ChmItsfHeader
-
Returns stream uuid
- getString(byte[], int, int) - Static method in class org.apache.tika.parser.mp3.ID3v2Frame
-
Returns the String at the given
offset and length.
- getString(byte[], String) - Method in class org.apache.tika.parser.txt.CharsetDetector
-
Autodetect the charset of an inputStream, and return a String
containing the converted input data.
- getString() - Method in class org.apache.tika.parser.txt.CharsetMatch
-
Create a Java String from Unicode character data corresponding
to the original byte data supplied to the Charset detect operation.
- getString(int) - Method in class org.apache.tika.parser.txt.CharsetMatch
-
Create a Java String from Unicode character data corresponding
to the original byte data supplied to the Charset detect operation.
- getString(String, String) - Static method in class org.apache.tika.util.PropsUtil
-
Parses v.
- getStringsPath() - Method in class org.apache.tika.parser.strings.StringsConfig
-
Returns the "strings" installation folder.
- getStringsProg() - Static method in class org.apache.tika.parser.strings.StringsParser
-
- getStyleClass() - Method in class org.apache.tika.parser.microsoft.WordExtractor.TagAndStyle
-
- getSubtype() - Method in class org.apache.tika.mime.MediaType
-
Return the Sub-Type of the MediaType,
such as "plain" for "text/plain"
- getSuffix(InputStream, int) - Static method in class org.apache.tika.parser.mp3.LyricsHandler
-
Reads and returns the last length
bytes from the
given stream.
- getSupertype(MediaType) - Method in class org.apache.tika.mime.MediaTypeRegistry
-
Returns the supertype of the given type.
- getSupportedEmbedTypes(ParseContext) - Method in interface org.apache.tika.embedder.Embedder
-
Returns the set of media types supported by this embedder when used with
the given parse context.
- getSupportedEmbedTypes(ParseContext) - Method in class org.apache.tika.embedder.ExternalEmbedder
-
- getSupportedEmbedTypes() - Method in class org.apache.tika.embedder.ExternalEmbedder
-
- getSupportedLanguages() - Static method in class org.apache.tika.language.LanguageIdentifier
-
Deprecated.
Returns what languages are supported for language identification
- getSupportedTypes(ParseContext) - Method in class org.apache.tika.example.DirListParser
-
- getSupportedTypes(ParseContext) - Method in class org.apache.tika.example.EncryptedPrescriptionParser
-
- getSupportedTypes(ParseContext) - Method in class org.apache.tika.example.PrescriptionParser
-
- getSupportedTypes(ParseContext) - Method in class org.apache.tika.fork.ForkParser
-
- getSupportedTypes(ParseContext) - Method in class org.apache.tika.parser.asm.ClassParser
-
- getSupportedTypes(ParseContext) - Method in class org.apache.tika.parser.audio.AudioParser
-
- getSupportedTypes(ParseContext) - Method in class org.apache.tika.parser.audio.MidiParser
-
- getSupportedTypes(ParseContext) - Method in class org.apache.tika.parser.chm.ChmParser
-
- getSupportedTypes(ParseContext) - Method in class org.apache.tika.parser.code.SourceCodeParser
-
- getSupportedTypes(ParseContext) - Method in class org.apache.tika.parser.CompositeParser
-
- getSupportedTypes(ParseContext) - Method in class org.apache.tika.parser.crypto.Pkcs7Parser
-
- getSupportedTypes(ParseContext) - Method in class org.apache.tika.parser.CryptoParser
-
- getSupportedTypes(ParseContext) - Method in class org.apache.tika.parser.DelegatingParser
-
- getSupportedTypes(ParseContext) - Method in class org.apache.tika.parser.dif.DIFParser
-
- getSupportedTypes(ParseContext) - Method in class org.apache.tika.parser.dwg.DWGParser
-
- getSupportedTypes(ParseContext) - Method in class org.apache.tika.parser.EmptyParser
-
- getSupportedTypes(ParseContext) - Method in class org.apache.tika.parser.envi.EnviHeaderParser
-
- getSupportedTypes(ParseContext) - Method in class org.apache.tika.parser.epub.EpubContentParser
-
- getSupportedTypes(ParseContext) - Method in class org.apache.tika.parser.epub.EpubParser
-
- getSupportedTypes(ParseContext) - Method in class org.apache.tika.parser.ErrorParser
-
- getSupportedTypes(ParseContext) - Method in class org.apache.tika.parser.executable.ExecutableParser
-
- getSupportedTypes(ParseContext) - Method in class org.apache.tika.parser.external.ExternalParser
-
- getSupportedTypes() - Method in class org.apache.tika.parser.external.ExternalParser
-
- getSupportedTypes(ParseContext) - Method in class org.apache.tika.parser.feed.FeedParser
-
- getSupportedTypes(ParseContext) - Method in class org.apache.tika.parser.font.AdobeFontMetricParser
-
- getSupportedTypes(ParseContext) - Method in class org.apache.tika.parser.font.TrueTypeParser
-
- getSupportedTypes(ParseContext) - Method in class org.apache.tika.parser.gdal.GDALParser
-
- getSupportedTypes(ParseContext) - Method in class org.apache.tika.parser.geo.topic.GeoParser
-
- getSupportedTypes(ParseContext) - Method in class org.apache.tika.parser.geoinfo.GeographicInformationParser
-
- getSupportedTypes(ParseContext) - Method in class org.apache.tika.parser.grib.GribParser
-
- getSupportedTypes(ParseContext) - Method in class org.apache.tika.parser.hdf.HDFParser
-
- getSupportedTypes(ParseContext) - Method in class org.apache.tika.parser.html.HtmlParser
-
- getSupportedTypes(ParseContext) - Method in class org.apache.tika.parser.image.BPGParser
-
- getSupportedTypes(ParseContext) - Method in class org.apache.tika.parser.image.ICNSParser
-
- getSupportedTypes(ParseContext) - Method in class org.apache.tika.parser.image.ImageParser
-
- getSupportedTypes(ParseContext) - Method in class org.apache.tika.parser.image.PSDParser
-
- getSupportedTypes(ParseContext) - Method in class org.apache.tika.parser.image.TiffParser
-
- getSupportedTypes(ParseContext) - Method in class org.apache.tika.parser.image.WebPParser
-
- getSupportedTypes(ParseContext) - Method in class org.apache.tika.parser.iptc.IptcAnpaParser
-
- getSupportedTypes(ParseContext) - Method in class org.apache.tika.parser.isatab.ISArchiveParser
-
- getSupportedTypes(ParseContext) - Method in class org.apache.tika.parser.iwork.IWorkPackageParser
-
- getSupportedTypes(ParseContext) - Method in class org.apache.tika.parser.jdbc.SQLite3Parser
-
- getSupportedTypes(ParseContext) - Method in class org.apache.tika.parser.journal.JournalParser
-
- getSupportedTypes(ParseContext) - Method in class org.apache.tika.parser.jpeg.JpegParser
-
- getSupportedTypes(ParseContext) - Method in class org.apache.tika.parser.mail.RFC822Parser
-
- getSupportedTypes(ParseContext) - Method in class org.apache.tika.parser.mat.MatParser
-
- getSupportedTypes(ParseContext) - Method in class org.apache.tika.parser.mbox.MboxParser
-
- getSupportedTypes(ParseContext) - Method in class org.apache.tika.parser.mbox.OutlookPSTParser
-
- getSupportedTypes(ParseContext) - Method in class org.apache.tika.parser.microsoft.JackcessParser
-
- getSupportedTypes(ParseContext) - Method in class org.apache.tika.parser.microsoft.OfficeParser
-
- getSupportedTypes(ParseContext) - Method in class org.apache.tika.parser.microsoft.OldExcelParser
-
- getSupportedTypes(ParseContext) - Method in class org.apache.tika.parser.microsoft.ooxml.OOXMLParser
-
- getSupportedTypes(ParseContext) - Method in class org.apache.tika.parser.microsoft.TNEFParser
-
- getSupportedTypes(ParseContext) - Method in class org.apache.tika.parser.mp3.Mp3Parser
-
- getSupportedTypes(ParseContext) - Method in class org.apache.tika.parser.mp4.MP4Parser
-
- getSupportedTypes(ParseContext) - Method in class org.apache.tika.parser.ner.NamedEntityParser
-
- getSupportedTypes(ParseContext) - Method in class org.apache.tika.parser.netcdf.NetCDFParser
-
- getSupportedTypes(ParseContext) - Method in class org.apache.tika.parser.NetworkParser
-
- getSupportedTypes(ParseContext) - Method in class org.apache.tika.parser.ocr.TesseractOCRParser
-
- getSupportedTypes(ParseContext) - Method in class org.apache.tika.parser.odf.OpenDocumentContentParser
-
- getSupportedTypes(ParseContext) - Method in class org.apache.tika.parser.odf.OpenDocumentParser
-
- getSupportedTypes(ParseContext) - Method in interface org.apache.tika.parser.Parser
-
Returns the set of media types supported by this parser when used
with the given parse context.
- getSupportedTypes(ParseContext) - Method in class org.apache.tika.parser.ParserDecorator
-
Delegates the method call to the decorated parser.
- getSupportedTypes(ParseContext) - Method in class org.apache.tika.parser.pdf.PDFParser
-
- getSupportedTypes(ParseContext) - Method in class org.apache.tika.parser.pkg.CompressorParser
-
- getSupportedTypes(ParseContext) - Method in class org.apache.tika.parser.pkg.PackageParser
-
- getSupportedTypes(ParseContext) - Method in class org.apache.tika.parser.pkg.RarParser
-
- getSupportedTypes(ParseContext) - Method in class org.apache.tika.parser.pot.PooledTimeSeriesParser
-
Returns the set of media types supported by this parser when used with the
given parse context.
- getSupportedTypes(ParseContext) - Method in class org.apache.tika.parser.prt.PRTParser
-
- getSupportedTypes(ParseContext) - Method in class org.apache.tika.parser.RecursiveParserWrapper
-
- getSupportedTypes(ParseContext) - Method in class org.apache.tika.parser.rtf.RTFParser
-
- getSupportedTypes(ParseContext) - Method in class org.apache.tika.parser.strings.Latin1StringsParser
-
- getSupportedTypes(ParseContext) - Method in class org.apache.tika.parser.strings.StringsParser
-
- getSupportedTypes(ParseContext) - Method in class org.apache.tika.parser.txt.TXTParser
-
- getSupportedTypes(ParseContext) - Method in class org.apache.tika.parser.video.FLVParser
-
- getSupportedTypes(ParseContext) - Method in class org.apache.tika.parser.xml.FictionBookParser
-
- getSupportedTypes(ParseContext) - Method in class org.apache.tika.parser.xml.XMLParser
-
- getSuppressDuplicateOverlappingText() - Method in class org.apache.tika.parser.pdf.PDFParser
-
- getSuppressDuplicateOverlappingText() - Method in class org.apache.tika.parser.pdf.PDFParserConfig
-
- getSwath() - Method in class org.apache.tika.parser.chm.lzx.ChmSection
-
- getSyncBits(int) - Method in class org.apache.tika.parser.chm.lzx.ChmSection
-
- getSystem_uuid() - Method in class org.apache.tika.parser.chm.accessor.ChmItspHeader
-
Returns system uuid
- getSystemId() - Method in class org.apache.tika.example.ImportContextImpl
-
- getTableOffset() - Method in class org.apache.tika.parser.chm.accessor.ChmLzxcResetTable
-
Gets a table offset
- getTag() - Method in exception org.apache.tika.io.TaggedIOException
-
Returns the object reference used as the tag this exception.
- getTag() - Method in class org.apache.tika.parser.microsoft.WordExtractor.TagAndStyle
-
- getTag() - Method in exception org.apache.tika.sax.TaggedSAXException
-
Returns the object reference used as the tag this exception.
- getTagsPresent() - Method in class org.apache.tika.parser.mp3.CompositeTagHandler
-
- getTagsPresent() - Method in interface org.apache.tika.parser.mp3.ID3Tags
-
Does the file contain this kind of tags?
- getTagsPresent() - Method in class org.apache.tika.parser.mp3.ID3v1Handler
-
- getTagsPresent() - Method in class org.apache.tika.parser.mp3.ID3v22Handler
-
- getTagsPresent() - Method in class org.apache.tika.parser.mp3.ID3v23Handler
-
- getTagsPresent() - Method in class org.apache.tika.parser.mp3.ID3v24Handler
-
- getTagString(byte[], int, int) - Static method in class org.apache.tika.parser.mp3.ID3v2Frame
-
Returns the (possibly null padded) String at the given offset and
length.
- getTail() - Method in class org.apache.tika.io.TailStream
-
Returns an array with the last data read from the underlying stream.
- getTessdataPath() - Method in class org.apache.tika.parser.ocr.TesseractOCRConfig
-
- getTesseractPath() - Method in class org.apache.tika.parser.ocr.TesseractOCRConfig
-
- getText() - Method in class org.apache.tika.parser.mp3.ID3Tags.ID3Comment
-
Gets the text, if present
- getText() - Method in class org.apache.tika.sax.Link
-
- getText(InputStream, HttpHeaders, UriInfo) - Method in class org.apache.tika.server.resource.TikaResource
-
- getTextDocument() - Method in class org.apache.tika.parser.html.BoilerpipeContentHandler
-
Retrieves the built TextDocument
- getTextFromMultipart(Attachment, UriInfo) - Method in class org.apache.tika.server.resource.TikaResource
-
- getTimeout() - Method in class org.apache.tika.parser.ocr.TesseractOCRConfig
-
- getTimeout() - Method in class org.apache.tika.parser.strings.StringsConfig
-
Returns the maximum time (in seconds) to wait for the "strings" command
to terminate.
- getTitle() - Method in class org.apache.tika.parser.mp3.CompositeTagHandler
-
- getTitle() - Method in interface org.apache.tika.parser.mp3.ID3Tags
-
- getTitle() - Method in class org.apache.tika.parser.mp3.ID3v1Handler
-
- getTitle() - Method in class org.apache.tika.parser.mp3.ID3v22Handler
-
- getTitle() - Method in class org.apache.tika.parser.mp3.ID3v23Handler
-
- getTitle() - Method in class org.apache.tika.parser.mp3.ID3v24Handler
-
- getTitle() - Method in class org.apache.tika.sax.Link
-
- getTotal() - Method in class org.apache.tika.parser.chm.lzx.ChmSection
-
- getTrackingMetadata() - Method in class org.apache.tika.parser.mbox.MboxParser
-
- getTrackNumber() - Method in class org.apache.tika.parser.mp3.CompositeTagHandler
-
- getTrackNumber() - Method in interface org.apache.tika.parser.mp3.ID3Tags
-
The number of the track within the album / recording
- getTrackNumber() - Method in class org.apache.tika.parser.mp3.ID3v1Handler
-
- getTrackNumber() - Method in class org.apache.tika.parser.mp3.ID3v22Handler
-
- getTrackNumber() - Method in class org.apache.tika.parser.mp3.ID3v23Handler
-
- getTrackNumber() - Method in class org.apache.tika.parser.mp3.ID3v24Handler
-
- getTranslator() - Method in class org.apache.tika.config.TikaConfig
-
Returns the configured translator instance.
- getTranslator() - Method in class org.apache.tika.language.translate.CachedTranslator
-
- getTranslator() - Method in class org.apache.tika.language.translate.DefaultTranslator
-
Returns the current translator
- getTranslator() - Method in class org.apache.tika.Tika
-
Returns the translator instance used by this facade.
- getTranslators() - Method in class org.apache.tika.language.translate.DefaultTranslator
-
Returns all available translators
- getType() - Method in class org.apache.tika.detect.NNTrainedModelBuilder
-
- getType() - Method in class org.apache.tika.mime.MediaType
-
Return the Type of the MediaType, such as
"text" for "text/plain"
- getType() - Method in class org.apache.tika.mime.MimeType
-
Returns the normalized media type name.
- getType() - Method in class org.apache.tika.parser.image.ICNSType
-
- getType() - Method in enum org.apache.tika.parser.iwork.IWorkPackageParser.IWORKDocumentType
-
- getType() - Method in enum org.apache.tika.parser.microsoft.OfficeParser.POIFSDocumentType
-
- getType() - Method in class org.apache.tika.sax.BasicContentHandlerFactory
-
- getType() - Method in class org.apache.tika.sax.Link
-
- getTypes() - Method in class org.apache.tika.mime.MediaTypeRegistry
-
Returns the set of all known canonical media types.
- getUByte(byte[], int) - Static method in class org.apache.tika.io.EndianUtils
-
get the unsigned value of a byte.
- getUIntBE(byte[]) - Static method in class org.apache.tika.io.EndianUtils
-
Get a BE unsigned int value from a byte array
- getUIntBE(byte[], int) - Static method in class org.apache.tika.io.EndianUtils
-
Get a BE unsigned int value from a byte array
- getUIntLE(byte[]) - Static method in class org.apache.tika.io.EndianUtils
-
Get a LE unsigned int value from a byte array
- getUIntLE(byte[], int) - Static method in class org.apache.tika.io.EndianUtils
-
Get a LE unsigned int value from a byte array
- getUMLSPass() - Method in class org.apache.tika.parser.ctakes.CTAKESConfig
-
Returns the UMLS password.
- getUMLSUser() - Method in class org.apache.tika.parser.ctakes.CTAKESConfig
-
Returns the UMLS username.
- getUncompressedLen() - Method in class org.apache.tika.parser.chm.accessor.ChmLzxcResetTable
-
Gets uncompressed length
- getUniformTypeIdentifier() - Method in class org.apache.tika.mime.MimeType
-
Get the UTI for this mime type.
- getUnknown() - Method in class org.apache.tika.parser.chm.accessor.ChmLzxcResetTable
-
Gets unknown
- getUnknown0008() - Method in class org.apache.tika.parser.chm.accessor.ChmPmglHeader
-
- getUnknown_000c() - Method in class org.apache.tika.parser.chm.accessor.ChmItsfHeader
-
Returns unknown_00c value
- getUnknown_000c() - Method in class org.apache.tika.parser.chm.accessor.ChmItspHeader
-
Returns 000c unknown bytes
- getUnknown_0024() - Method in class org.apache.tika.parser.chm.accessor.ChmItspHeader
-
Returns 0024 unknown bytes
- getUnknown_002c() - Method in class org.apache.tika.parser.chm.accessor.ChmItspHeader
-
Returns 002c unknown bytes
- getUnknown_0044() - Method in class org.apache.tika.parser.chm.accessor.ChmItspHeader
-
Returns 0044 unknown bytes
- getUnknown_18() - Method in class org.apache.tika.parser.chm.accessor.ChmLzxcControlData
-
Returns unknown 18 bytes
- getUnknownLen() - Method in class org.apache.tika.parser.chm.accessor.ChmItsfHeader
-
Returns unknown length
- getUnknownOffset() - Method in class org.apache.tika.parser.chm.accessor.ChmItsfHeader
-
Returns unknown offset
- getUri() - Method in class org.apache.tika.sax.Link
-
- getUserInterrupted() - Method in class org.apache.tika.batch.BatchProcessDriverCLI
-
- getUShortBE(byte[]) - Static method in class org.apache.tika.io.EndianUtils
-
Get a BE unsigned short value from the beginning of a byte array
- getUShortBE(byte[], int) - Static method in class org.apache.tika.io.EndianUtils
-
Get a BE unsigned short value from a byte array
- getUShortLE(byte[]) - Static method in class org.apache.tika.io.EndianUtils
-
Get a LE unsigned short value from the beginning of a byte array
- getUShortLE(byte[], int) - Static method in class org.apache.tika.io.EndianUtils
-
Get a LE unsigned short value from a byte array
- getValues(Property) - Method in class org.apache.tika.metadata.Metadata
-
Get the values associated to a metadata name.
- getValues(String) - Method in class org.apache.tika.metadata.Metadata
-
Get the values associated to a metadata name.
- getValues(Property) - Method in class org.apache.tika.xmp.XMPMetadata
-
- getValues(String) - Method in class org.apache.tika.xmp.XMPMetadata
-
Returns the value of a simple property or all if the property is an array and the elements
are of simple type.
- getValueType() - Method in class org.apache.tika.metadata.Property
-
- getVersion() - Method in class org.apache.tika.parser.chm.accessor.ChmItsfHeader
-
Returns itsf header version
- getVersion() - Method in class org.apache.tika.parser.chm.accessor.ChmItspHeader
-
Returns version of itsp header
- getVersion() - Method in class org.apache.tika.parser.chm.accessor.ChmLzxcControlData
-
Returns a version of control data block
- getVersion() - Method in class org.apache.tika.parser.chm.accessor.ChmLzxcResetTable
-
Returns the version
- getVersion() - Method in class org.apache.tika.parser.mp3.AudioFrame
-
- getVersion() - Method in class org.apache.tika.server.resource.TikaVersion
-
- getVersionCode() - Method in class org.apache.tika.parser.mp3.AudioFrame
-
Get the version code.
- getWelcomeHTML() - Method in class org.apache.tika.server.resource.TikaWelcome
-
- getWelcomePlain() - Method in class org.apache.tika.server.resource.TikaWelcome
-
- getWidth() - Method in class org.apache.tika.parser.image.ICNSType
-
- getWindow() - Method in class org.apache.tika.parser.chm.lzx.ChmLzxState
-
- getWindowPosition() - Method in class org.apache.tika.parser.chm.lzx.ChmLzxState
-
- getWindowSize() - Method in class org.apache.tika.parser.chm.accessor.ChmLzxcControlData
-
Returns a window size
- getWindowSize(int) - Static method in class org.apache.tika.parser.chm.core.ChmCommons
-
LZX supports window sizes of 2^15 (32Kb) through 2^21 (2Mb) Returns X,
i.e 2^X
- getWindowSize() - Method in class org.apache.tika.parser.chm.lzx.ChmLzxState
-
- getWindowsPerReset() - Method in class org.apache.tika.parser.chm.accessor.ChmLzxcControlData
-
Returns windows per reset
- getWrappedParser() - Method in class org.apache.tika.parser.ParserDecorator
-
Gets the parser wrapped by this ParserDecorator
- getXHTML(ContentHandler, Metadata, ParseContext) - Method in class org.apache.tika.parser.microsoft.ooxml.AbstractOOXMLExtractor
-
- getXHTML(ContentHandler, Metadata, ParseContext) - Method in interface org.apache.tika.parser.microsoft.ooxml.OOXMLExtractor
-
Parses the document into a sequence of XHTML SAX events sent to the
given content handler.
- getXHTML(ContentHandler, Metadata, ParseContext) - Method in class org.apache.tika.parser.microsoft.ooxml.XSSFExcelExtractorDecorator
-
- getXML(InputStream, HttpHeaders, UriInfo) - Method in class org.apache.tika.server.resource.TikaResource
-
- getXMLFromMultipart(Attachment, UriInfo) - Method in class org.apache.tika.server.resource.TikaResource
-
- getXMLifiedLogMsg(String, String, String...) - Method in class org.apache.tika.batch.FileResourceConsumer
-
- getXMLifiedLogMsg(String, String, Throwable, String...) - Method in class org.apache.tika.batch.FileResourceConsumer
-
Use this for structured output that captures resourceId and other attributes.
- getXMLInputFactory() - Method in class org.apache.tika.parser.ParseContext
-
Returns the StAX input factory specified in this parsing context.
- getXMLReader() - Method in class org.apache.tika.parser.ParseContext
-
Returns the XMLReader specified in this parsing context.
- getXMPData() - Method in class org.apache.tika.xmp.XMPMetadata
-
Provides direct access to the XMP data model, in case a client prefers to work directly on it
instead of using the Metadata API
- getXMPMeta() - Method in class org.apache.tika.xmp.convert.AbstractConverter
-
- getYear() - Method in class org.apache.tika.parser.mp3.CompositeTagHandler
-
- getYear() - Method in interface org.apache.tika.parser.mp3.ID3Tags
-
- getYear() - Method in class org.apache.tika.parser.mp3.ID3v1Handler
-
- getYear() - Method in class org.apache.tika.parser.mp3.ID3v22Handler
-
- getYear() - Method in class org.apache.tika.parser.mp3.ID3v23Handler
-
- getYear() - Method in class org.apache.tika.parser.mp3.ID3v24Handler
-
- GLOB_TAG - Static variable in interface org.apache.tika.mime.MimeTypesReaderMetKeys
-
- GoogleTranslator - Class in org.apache.tika.language.translate
-
- GoogleTranslator() - Constructor for class org.apache.tika.language.translate.GoogleTranslator
-
- GrabPhoneNumbersExample - Class in org.apache.tika.example
-
- GrabPhoneNumbersExample() - Constructor for class org.apache.tika.example.GrabPhoneNumbersExample
-
- GREETING - Static variable in class org.apache.tika.server.resource.TikaResource
-
- GRIB_MIME_TYPE - Static variable in class org.apache.tika.parser.grib.GribParser
-
- GribParser - Class in org.apache.tika.parser.grib
-
- GribParser() - Constructor for class org.apache.tika.parser.grib.GribParser
-
- GrobidNERecogniser - Class in org.apache.tika.parser.ner.grobid
-
- GrobidNERecogniser() - Constructor for class org.apache.tika.parser.ner.grobid.GrobidNERecogniser
-
- GrobidRESTParser - Class in org.apache.tika.parser.journal
-
- GrobidRESTParser() - Constructor for class org.apache.tika.parser.journal.GrobidRESTParser
-
- ICNS_1024x1024_2X_JPEG_PNG_IMAGE - Static variable in class org.apache.tika.parser.image.ICNSType
-
- ICNS_128x128_24BIT_IMAGE - Static variable in class org.apache.tika.parser.image.ICNSType
-
- ICNS_128x128_2X_JPEG_PNG_IMAGE - Static variable in class org.apache.tika.parser.image.ICNSType
-
- ICNS_128x128_8BIT_MASK - Static variable in class org.apache.tika.parser.image.ICNSType
-
- ICNS_128x128_JPEG_PNG_IMAGE - Static variable in class org.apache.tika.parser.image.ICNSType
-
- ICNS_16x12_1BIT_IMAGE_AND_MASK - Static variable in class org.apache.tika.parser.image.ICNSType
-
- ICNS_16x12_4BIT_IMAGE - Static variable in class org.apache.tika.parser.image.ICNSType
-
- ICNS_16x12_8BIT_IMAGE - Static variable in class org.apache.tika.parser.image.ICNSType
-
- ICNS_16x16_1BIT_IMAGE_AND_MASK - Static variable in class org.apache.tika.parser.image.ICNSType
-
- ICNS_16x16_24BIT_IMAGE - Static variable in class org.apache.tika.parser.image.ICNSType
-
- ICNS_16x16_2X_JPEG_PNG_IMAGE - Static variable in class org.apache.tika.parser.image.ICNSType
-
- ICNS_16x16_4BIT_IMAGE - Static variable in class org.apache.tika.parser.image.ICNSType
-
- ICNS_16x16_8BIT_IMAGE - Static variable in class org.apache.tika.parser.image.ICNSType
-
- ICNS_16x16_8BIT_MASK - Static variable in class org.apache.tika.parser.image.ICNSType
-
- ICNS_16x16_JPEG_PNG_IMAGE - Static variable in class org.apache.tika.parser.image.ICNSType
-
- ICNS_256x256_2X_JPEG_PNG_IMAGE - Static variable in class org.apache.tika.parser.image.ICNSType
-
- ICNS_256x256_JPEG_PNG_IMAGE - Static variable in class org.apache.tika.parser.image.ICNSType
-
- ICNS_32x32_1BIT_IMAGE - Static variable in class org.apache.tika.parser.image.ICNSType
-
- ICNS_32x32_1BIT_IMAGE_AND_MASK - Static variable in class org.apache.tika.parser.image.ICNSType
-
- ICNS_32x32_24BIT_IMAGE - Static variable in class org.apache.tika.parser.image.ICNSType
-
- ICNS_32x32_2X_JPEG_PNG_IMAGE - Static variable in class org.apache.tika.parser.image.ICNSType
-
- ICNS_32x32_4BIT_IMAGE - Static variable in class org.apache.tika.parser.image.ICNSType
-
- ICNS_32x32_8BIT_IMAGE - Static variable in class org.apache.tika.parser.image.ICNSType
-
- ICNS_32x32_8BIT_MASK - Static variable in class org.apache.tika.parser.image.ICNSType
-
- ICNS_32x32_JPEG_PNG_IMAGE - Static variable in class org.apache.tika.parser.image.ICNSType
-
- ICNS_48x48_1BIT_IMAGE_AND_MASK - Static variable in class org.apache.tika.parser.image.ICNSType
-
- ICNS_48x48_24BIT_IMAGE - Static variable in class org.apache.tika.parser.image.ICNSType
-
- ICNS_48x48_4BIT_IMAGE - Static variable in class org.apache.tika.parser.image.ICNSType
-
- ICNS_48x48_8BIT_IMAGE - Static variable in class org.apache.tika.parser.image.ICNSType
-
- ICNS_48x48_8BIT_MASK - Static variable in class org.apache.tika.parser.image.ICNSType
-
- ICNS_512x512_JPEG_PNG_IMAGE - Static variable in class org.apache.tika.parser.image.ICNSType
-
- ICNS_64x64_JPEG_PNG_IMAGE - Static variable in class org.apache.tika.parser.image.ICNSType
-
- ICNS_MIME_TYPE - Static variable in class org.apache.tika.parser.image.ICNSParser
-
- ICNSParser - Class in org.apache.tika.parser.image
-
A basic parser class for Apple ICNS icon files
- ICNSParser() - Constructor for class org.apache.tika.parser.image.ICNSParser
-
- ICNSType - Class in org.apache.tika.parser.image
-
Holds details on Apple ICNS icons
- IContentHandlerFactoryBuilder - Interface in org.apache.tika.batch.builders
-
- ICrawlerBuilder - Interface in org.apache.tika.batch.builders
-
- Icu4jEncodingDetector - Class in org.apache.tika.parser.txt
-
- Icu4jEncodingDetector() - Constructor for class org.apache.tika.parser.txt.Icu4jEncodingDetector
-
- id - Variable in class org.apache.tika.parser.rtf.ListDescriptor
-
- ID3Comment(String) - Constructor for class org.apache.tika.parser.mp3.ID3Tags.ID3Comment
-
Creates an ID3 v1 style comment tag
- ID3Comment(String, String, String) - Constructor for class org.apache.tika.parser.mp3.ID3Tags.ID3Comment
-
Creates an ID3 v2 style comment tag
- ID3Tags - Interface in org.apache.tika.parser.mp3
-
Interface that defines the common interface for ID3 tag parsers,
such as ID3v1 and ID3v2.3.
- ID3Tags.ID3Comment - Class in org.apache.tika.parser.mp3
-
Represents a comments in ID3 (especially ID3 v2), where are
made up of several parts
- ID3TagsAndAudio() - Constructor for class org.apache.tika.parser.mp3.Mp3Parser.ID3TagsAndAudio
-
- ID3v1Handler - Class in org.apache.tika.parser.mp3
-
This is used to parse ID3 Version 1 Tag information from an MP3 file,
if available.
- ID3v1Handler(InputStream, ContentHandler) - Constructor for class org.apache.tika.parser.mp3.ID3v1Handler
-
- ID3v1Handler(byte[]) - Constructor for class org.apache.tika.parser.mp3.ID3v1Handler
-
Creates from the last 128 bytes of a stream.
- ID3v22Handler - Class in org.apache.tika.parser.mp3
-
This is used to parse ID3 Version 2.2 Tag information from an MP3 file,
if available.
- ID3v22Handler(ID3v2Frame) - Constructor for class org.apache.tika.parser.mp3.ID3v22Handler
-
- ID3v23Handler - Class in org.apache.tika.parser.mp3
-
This is used to parse ID3 Version 2.3 Tag information from an MP3 file,
if available.
- ID3v23Handler(ID3v2Frame) - Constructor for class org.apache.tika.parser.mp3.ID3v23Handler
-
- ID3v24Handler - Class in org.apache.tika.parser.mp3
-
This is used to parse ID3 Version 2.4 Tag information from an MP3 file,
if available.
- ID3v24Handler(ID3v2Frame) - Constructor for class org.apache.tika.parser.mp3.ID3v24Handler
-
- ID3v2Frame - Class in org.apache.tika.parser.mp3
-
A frame of ID3v2 data, which is then passed to a handler to
be turned into useful data.
- ID3v2Frame.RawTag - Class in org.apache.tika.parser.mp3
-
- ID3v2Frame.RawTagIterator - Class in org.apache.tika.parser.mp3
-
Iterates over id3v2 raw tags.
- ID3v2Frame.TextEncoding - Class in org.apache.tika.parser.mp3
-
- ID_PROPERTY - Static variable in class org.apache.tika.language.translate.MicrosoftTranslator
-
- IDENTIFIER - Static variable in interface org.apache.tika.metadata.DublinCore
-
Recommended best practice is to identify the resource by means of
a string or number conforming to a formal identification system.
- IDENTIFIER - Static variable in class org.apache.tika.metadata.Metadata
-
- IDENTIFIER - Static variable in interface org.apache.tika.metadata.TikaCoreProperties
-
- IDENTIFIER - Static variable in interface org.apache.tika.metadata.XMP
-
An unordered array of text strings that unambiguously identify the resource
within a given context.
- identifyEndpoints() - Method in class org.apache.tika.server.resource.TikaWelcome
-
- identifyStaticServiceProviders(Class<T>) - Method in class org.apache.tika.config.ServiceLoader
-
Returns the defined static service providers of the given type, without
attempting to load them.
- IdentityHtmlMapper - Class in org.apache.tika.parser.html
-
Alternative HTML mapping rules that pass the input HTML as-is without any
modifications.
- IdentityHtmlMapper() - Constructor for class org.apache.tika.parser.html.IdentityHtmlMapper
-
- IFileProcessorFutureResult - Interface in org.apache.tika.batch
-
stub interface to allow for different result types from different processors
- ignorableWhitespace(char[], int, int) - Method in class org.apache.tika.parser.dif.DIFContentHandler
-
- ignorableWhitespace(char[], int, int) - Method in class org.apache.tika.parser.microsoft.ooxml.XSSFExcelExtractorDecorator.XSSFSheetInterestingPartsCapturer
-
- ignorableWhitespace(char[], int, int) - Method in class org.apache.tika.parser.xml.ElementMetadataHandler
-
- ignorableWhitespace(char[], int, int) - Method in class org.apache.tika.sax.ContentHandlerDecorator
-
- ignorableWhitespace(char[], int, int) - Method in class org.apache.tika.sax.DIFContentHandler
-
- ignorableWhitespace(char[], int, int) - Method in class org.apache.tika.sax.LinkContentHandler
-
- ignorableWhitespace(char[], int, int) - Method in class org.apache.tika.sax.SafeContentHandler
-
- ignorableWhitespace(char[], int, int) - Method in class org.apache.tika.sax.SecureContentHandler
-
- ignorableWhitespace(char[], int, int) - Method in class org.apache.tika.sax.TeeContentHandler
-
- ignorableWhitespace(char[], int, int) - Method in class org.apache.tika.sax.TextContentHandler
-
- ignorableWhitespace(char[], int, int) - Method in class org.apache.tika.sax.ToTextContentHandler
-
Writes the given ignorable characters to the given character stream.
- ignorableWhitespace(char[], int, int) - Method in class org.apache.tika.sax.WriteOutContentHandler
-
- ignorableWhitespace(char[], int, int) - Method in class org.apache.tika.sax.xpath.MatchingContentHandler
-
- IGNORE - Static variable in interface org.apache.tika.config.LoadErrorHandler
-
Strategy that simply ignores all problems.
- image(String) - Static method in class org.apache.tika.mime.MediaType
-
- IMAGE_COUNT - Static variable in interface org.apache.tika.metadata.MSOffice
-
Deprecated.
- IMAGE_COUNT - Static variable in interface org.apache.tika.metadata.Office
-
The number of Images in the document
- IMAGE_CREATOR - Static variable in interface org.apache.tika.metadata.IPTC
-
Creator or creators of the image.
- IMAGE_CREATOR_ID - Static variable in interface org.apache.tika.metadata.IPTC
-
The ID of the creator or creators of the image.
- IMAGE_CREATOR_ID_WRONG_CASE - Static variable in interface org.apache.tika.metadata.IPTC
-
- IMAGE_CREATOR_NAME - Static variable in interface org.apache.tika.metadata.IPTC
-
The name of the creator or creators of the image.
- IMAGE_LENGTH - Static variable in interface org.apache.tika.metadata.TIFF
-
"Image height in pixels."
- IMAGE_REGISTRY_ENTRY - Static variable in interface org.apache.tika.metadata.IPTC
-
Both a Registry Item Id and a Registry Organisation Id to record any
registration of this item with a registry.
- IMAGE_SUPPLIER - Static variable in interface org.apache.tika.metadata.IPTC
-
Identifies the most recent supplier of the item, who is not necessarily
its owner or creator.
- IMAGE_SUPPLIER_ID - Static variable in interface org.apache.tika.metadata.IPTC
-
Identifies the most recent supplier of the item, who is not necessarily
its owner or creator.
- IMAGE_SUPPLIER_ID_WRONG_CASE - Static variable in interface org.apache.tika.metadata.IPTC
-
- IMAGE_SUPPLIER_IMAGE_ID - Static variable in interface org.apache.tika.metadata.IPTC
-
Optional identifier assigned by the Image Supplier to the image.
- IMAGE_SUPPLIER_NAME - Static variable in interface org.apache.tika.metadata.IPTC
-
Identifies the most recent supplier of the item, who is not necessarily
its owner or creator.
- IMAGE_WIDTH - Static variable in interface org.apache.tika.metadata.TIFF
-
"Image width in pixels."
- ImageMetadataExtractor - Class in org.apache.tika.parser.image
-
Uses the
Metadata Extractor library
to read EXIF and IPTC image metadata and map to Tika fields.
- ImageMetadataExtractor(Metadata) - Constructor for class org.apache.tika.parser.image.ImageMetadataExtractor
-
- ImageMetadataExtractor(Metadata, ImageMetadataExtractor.DirectoryHandler...) - Constructor for class org.apache.tika.parser.image.ImageMetadataExtractor
-
- ImageParser - Class in org.apache.tika.parser.image
-
- ImageParser() - Constructor for class org.apache.tika.parser.image.ImageParser
-
- ImportContextImpl - Class in org.apache.tika.example
-
ImportContextImpl
...
- ImportContextImpl(Item, String, InputContext, InputStream, IOListener, Detector) - Constructor for class org.apache.tika.example.ImportContextImpl
-
Creates a new item import context.
- increaseFramesRead() - Method in class org.apache.tika.parser.chm.lzx.ChmLzxState
-
- incrementHandledExceptions() - Method in class org.apache.tika.batch.FileResourceConsumer
-
Make sure to call this appropriately!
- incrementLevel(int, AbstractListManager.LevelTuple[]) - Method in class org.apache.tika.parser.microsoft.AbstractListManager.ParagraphLevelCounter
-
Apply this to every numbered paragraph in order.
- indexContentSpecificMet(File) - Method in class org.apache.tika.example.MetadataAwareLuceneIndexer
-
- indexDocument(File) - Method in class org.apache.tika.example.LuceneIndexer
-
- indexDocument(File) - Method in class org.apache.tika.example.LuceneIndexerExtended
-
- indexOf(byte[], byte[]) - Static method in class org.apache.tika.parser.chm.core.ChmCommons
-
Searches some pattern in byte[]
- indexOf(List<DirectoryListingEntry>, String) - Static method in class org.apache.tika.parser.chm.core.ChmCommons
-
Searches for some pattern in the directory listing entry list
- indexOfResetTableBlock(byte[], byte[]) - Static method in class org.apache.tika.parser.chm.core.ChmCommons
-
Returns an index of the reset table
- indexWithDublinCore(File) - Method in class org.apache.tika.example.MetadataAwareLuceneIndexer
-
- informCompleted(boolean) - Method in class org.apache.tika.example.ImportContextImpl
-
- init() - Method in class org.apache.tika.batch.ConsumersManager
-
This is called by BatchProcess before submitting the threads
- init() - Method in class org.apache.tika.batch.fs.FSConsumersManager
-
- init(DataInputStream, DataOutputStream) - Method in interface org.apache.tika.fork.ForkProxy
-
- init(TikaConfig, DigestingParser.Digester) - Static method in class org.apache.tika.server.resource.TikaResource
-
- INITIAL_AUTHOR - Static variable in interface org.apache.tika.metadata.Office
-
Name of the initial creator/author of a document
- initialize(URL) - Method in class org.apache.tika.parser.geo.topic.GeoParser
-
Initializes this parser
- initProfiles() - Static method in class org.apache.tika.language.LanguageIdentifier
-
Deprecated.
Builds the language profiles.
- initProfiles(Map<String, LanguageProfile>) - Static method in class org.apache.tika.language.LanguageIdentifier
-
Deprecated.
Initializes the language profiles from a user supplied initialized Map.
- INPUT_FILE_TOKEN - Static variable in class org.apache.tika.parser.external.ExternalParser
-
The token, which if present in the Command string, will
be replaced with the input filename.
- inputFilterEnabled() - Method in class org.apache.tika.parser.txt.CharsetDetector
-
Test whether or not input filtering is enabled.
- INSTANCE - Static variable in class org.apache.tika.detect.EmptyDetector
-
Singleton instance of this class.
- INSTANCE - Static variable in class org.apache.tika.parser.EmptyParser
-
Singleton instance of this class.
- INSTANCE - Static variable in class org.apache.tika.parser.ErrorParser
-
Singleton instance of this class.
- INSTANCE - Static variable in class org.apache.tika.parser.html.DefaultHtmlMapper
-
- INSTANCE - Static variable in class org.apache.tika.parser.html.IdentityHtmlMapper
-
- INSTANCE - Static variable in class org.apache.tika.sax.xpath.AttributeMatcher
-
- INSTANCE - Static variable in class org.apache.tika.sax.xpath.ElementMatcher
-
- INSTANCE - Static variable in class org.apache.tika.sax.xpath.NodeMatcher
-
- INSTANCE - Static variable in class org.apache.tika.sax.xpath.TextMatcher
-
- INSTANCEID - Static variable in interface org.apache.tika.metadata.XMPMM
-
An identifier for a specific incarnation of a resource, updated
each time a file is saved.
- inStartElement - Variable in class org.apache.tika.sax.ToXMLContentHandler
-
- INSTITUTION - Static variable in interface org.apache.tika.metadata.ClimateForcast
-
- INSTRUCTIONS - Static variable in interface org.apache.tika.metadata.IPTC
-
Any of a number of instructions from the provider or creator to the
receiver of the item.
- INSTRUCTIONS - Static variable in interface org.apache.tika.metadata.Photoshop
-
- INSTRUMENT - Static variable in interface org.apache.tika.metadata.XMPDM
-
"The musical instrument."
- intelE8Decoding() - Method in class org.apache.tika.parser.chm.lzx.ChmLzxBlock
-
- INTELLECTUAL_GENRE - Static variable in interface org.apache.tika.metadata.IPTC
-
Describes the nature, intellectual, artistic or journalistic
characteristic of a item, not specifically its content.
- internalBoolean(String) - Static method in class org.apache.tika.metadata.Property
-
- internalClosedChoise(String, String...) - Static method in class org.apache.tika.metadata.Property
-
- internalDate(String) - Static method in class org.apache.tika.metadata.Property
-
- internalInteger(String) - Static method in class org.apache.tika.metadata.Property
-
- internalIntegerSequence(String) - Static method in class org.apache.tika.metadata.Property
-
- internalOpenChoise(String, String...) - Static method in class org.apache.tika.metadata.Property
-
- internalRational(String) - Static method in class org.apache.tika.metadata.Property
-
- internalReal(String) - Static method in class org.apache.tika.metadata.Property
-
- internalText(String) - Static method in class org.apache.tika.metadata.Property
-
- internalTextBag(String) - Static method in class org.apache.tika.metadata.Property
-
- internalURI(String) - Static method in class org.apache.tika.metadata.Property
-
- InterruptableParsingExample - Class in org.apache.tika.example
-
This example demonstrates how to interrupt document parsing if
some condition is met.
- InterruptableParsingExample() - Constructor for class org.apache.tika.example.InterruptableParsingExample
-
- Interrupter - Class in org.apache.tika.batch
-
Class that waits for input on System.in.
- Interrupter() - Constructor for class org.apache.tika.batch.Interrupter
-
- InterrupterBuilder - Class in org.apache.tika.batch.builders
-
Builds an Interrupter
- InterrupterBuilder() - Constructor for class org.apache.tika.batch.builders.InterrupterBuilder
-
- InterrupterFutureResult - Class in org.apache.tika.batch
-
- InterrupterFutureResult() - Constructor for class org.apache.tika.batch.InterrupterFutureResult
-
- IO_IS - Static variable in class org.apache.tika.batch.FileResourceConsumer
-
- IO_OS - Static variable in class org.apache.tika.batch.FileResourceConsumer
-
- IOExceptionWithCause - Exception in org.apache.tika.io
-
Subclasses IOException with the
Throwable
constructors missing before Java 6.
- IOExceptionWithCause(String, Throwable) - Constructor for exception org.apache.tika.io.IOExceptionWithCause
-
Constructs a new instance with the given message and cause.
- IOExceptionWithCause(Throwable) - Constructor for exception org.apache.tika.io.IOExceptionWithCause
-
Constructs a new instance with the given cause.
- IOUtils - Class in org.apache.tika.io
-
General IO stream manipulation utilities.
- IOUtils() - Constructor for class org.apache.tika.io.IOUtils
-
Instances should NOT be constructed in standard programming.
- IParserFactoryBuilder - Interface in org.apache.tika.batch.builders
-
- IPTC - Interface in org.apache.tika.metadata
-
IPTC photo metadata schema.
- IPTC_LAST_EDITED - Static variable in interface org.apache.tika.metadata.IPTC
-
The date and optionally time when any of the IPTC photo metadata fields
has been last edited
- IptcAnpaParser - Class in org.apache.tika.parser.iptc
-
Parser for IPTC ANPA New Wire Feeds
- IptcAnpaParser() - Constructor for class org.apache.tika.parser.iptc.IptcAnpaParser
-
- isActive() - Method in class org.apache.tika.batch.FileResourceCrawler
-
If the crawler stops for any reason, it is no longer active.
- isAnchor() - Method in class org.apache.tika.sax.Link
-
- ISArchiveParser - Class in org.apache.tika.parser.isatab
-
- ISArchiveParser() - Constructor for class org.apache.tika.parser.isatab.ISArchiveParser
-
Default constructor.
- ISArchiveParser(String) - Constructor for class org.apache.tika.parser.isatab.ISArchiveParser
-
Constructor that accepts the pathname of ISArchive folder.
- ISATabUtils - Class in org.apache.tika.parser.isatab
-
- ISATabUtils() - Constructor for class org.apache.tika.parser.isatab.ISATabUtils
-
- isAudioHeader(int, int, int, int) - Static method in class org.apache.tika.parser.mp3.AudioFrame
-
Does this appear to be a 4 byte audio frame header?
- isAvailable() - Method in class org.apache.tika.language.translate.CachedTranslator
-
- isAvailable() - Method in class org.apache.tika.language.translate.DefaultTranslator
-
- isAvailable() - Method in class org.apache.tika.language.translate.EmptyTranslator
-
- isAvailable() - Method in class org.apache.tika.language.translate.GoogleTranslator
-
- isAvailable() - Method in class org.apache.tika.language.translate.Lingo24Translator
-
- isAvailable() - Method in class org.apache.tika.language.translate.MicrosoftTranslator
-
Check whether this instance has a working property file and its keys are not the defaults.
- isAvailable() - Method in class org.apache.tika.language.translate.MosesTranslator
-
- isAvailable() - Method in interface org.apache.tika.language.translate.Translator
-
- isAvailable() - Method in class org.apache.tika.language.translate.YandexTranslator
-
- isAvailable() - Method in class org.apache.tika.parser.geo.topic.GeoParser
-
- isAvailable() - Method in class org.apache.tika.parser.ner.corenlp.CoreNLPNERecogniser
-
- isAvailable() - Method in class org.apache.tika.parser.ner.grobid.GrobidNERecogniser
-
- isAvailable() - Method in class org.apache.tika.parser.ner.mitie.MITIENERecogniser
-
- isAvailable() - Method in interface org.apache.tika.parser.ner.NERecogniser
-
checks if this Named Entity recogniser is available for service
- isAvailable() - Method in class org.apache.tika.parser.ner.nltk.NLTKNERecogniser
-
- isAvailable() - Method in class org.apache.tika.parser.ner.opennlp.OpenNLPNameFinder
-
- isAvailable() - Method in class org.apache.tika.parser.ner.opennlp.OpenNLPNERecogniser
-
- isAvailable() - Method in class org.apache.tika.parser.ner.regex.RegexNERecogniser
-
- isCatchIntermediateIOExceptions() - Method in class org.apache.tika.parser.pdf.PDFParserConfig
-
- isCauseOf(IOException) - Method in class org.apache.tika.io.TaggedInputStream
-
Tests if the given exception was caused by this stream.
- isCauseOf(SAXException) - Method in class org.apache.tika.sax.TaggedContentHandler
-
Tests if the given exception was caused by this handler.
- isCompleted() - Method in class org.apache.tika.example.ImportContextImpl
-
- isConverterAvailable(String) - Static method in class org.apache.tika.xmp.convert.TikaToXMP
-
Check if there is a converter available which allows to convert the Tika metadata to XMP
- isDiscardElement(String) - Method in class org.apache.tika.parser.html.DefaultHtmlMapper
-
- isDiscardElement(String) - Method in interface org.apache.tika.parser.html.HtmlMapper
-
Checks whether all content within the given HTML element should be
discarded instead of including it in the parse output.
- isDiscardElement(String) - Method in class org.apache.tika.parser.html.HtmlParser
-
- isDiscardElement(String) - Method in class org.apache.tika.parser.html.IdentityHtmlMapper
-
- isDynamic() - Method in class org.apache.tika.config.ServiceLoader
-
Returns if the service loader is static or dynamic
- isEmpty(String) - Static method in class org.apache.tika.parser.chm.core.ChmCommons
-
- isExternal() - Method in class org.apache.tika.metadata.Property
-
- isExtractorFinished() - Method in class org.apache.tika.example.LazyTextExtractorField
-
Checks whether the text extraction task has finished.
- isHeading() - Method in class org.apache.tika.parser.microsoft.WordExtractor.TagAndStyle
-
- isIframe() - Method in class org.apache.tika.sax.Link
-
- isImage() - Method in class org.apache.tika.sax.Link
-
- isIncludeMarkup() - Method in class org.apache.tika.parser.html.BoilerpipeContentHandler
-
- isInstanceOf(MediaType, MediaType) - Method in class org.apache.tika.mime.MediaTypeRegistry
-
Checks whether the given media type equals the given base type or
is a specialization of it.
- isInstanceOf(String, MediaType) - Method in class org.apache.tika.mime.MediaTypeRegistry
-
Parses and normalises the given media type string and checks whether
the result equals the given base type or is a specialization of it.
- isInternal() - Method in class org.apache.tika.metadata.Property
-
- isInvalid(int) - Method in class org.apache.tika.sax.SafeContentHandler
-
Checks whether the given Unicode character is an invalid XML character
and should be replaced for output.
- isLanguage(String) - Method in class org.apache.tika.language.detect.LanguageResult
-
Return true if the target language matches the detected language.
- isLink() - Method in class org.apache.tika.sax.Link
-
- isListenForAllRecords() - Method in class org.apache.tika.parser.microsoft.ExcelExtractor
-
Returns true
if this parser is configured to listen
for all records instead of just the specified few.
- isMacroLanguage(String) - Static method in class org.apache.tika.language.detect.LanguageNames
-
- isMatchingElement(String, String) - Method in class org.apache.tika.parser.xml.ElementMetadataHandler
-
- isMatchingParentElement(String, String) - Method in class org.apache.tika.parser.xml.ElementMetadataHandler
-
- isMetadataField(String) - Static method in class org.apache.tika.parser.image.MetadataFields
-
- isMetadataField(Property) - Static method in class org.apache.tika.parser.image.MetadataFields
-
- isMimetype() - Method in class org.apache.tika.parser.strings.FileConfig
-
Returns true
if the mime option is enabled.
- isMixedLanguages() - Method in class org.apache.tika.language.detect.LanguageDetector
-
- isMostlyAscii() - Method in class org.apache.tika.detect.TextStatistics
-
Checks whether at least one byte was seen and that the bytes that
were seen were mostly plain text (i.e.
- isMSB() - Method in class org.apache.tika.parser.executable.MachineMetadata.Endian
-
- isMultiValued(Property) - Method in class org.apache.tika.metadata.Metadata
-
Returns true if named value is multivalued.
- isMultiValued(String) - Method in class org.apache.tika.metadata.Metadata
-
Returns true if named value is multivalued.
- isMultiValued(Property) - Method in class org.apache.tika.xmp.XMPMetadata
-
- isMultiValued(String) - Method in class org.apache.tika.xmp.XMPMetadata
-
Checks if the named property is an array.
- isMultiValuePermitted() - Method in class org.apache.tika.metadata.Property
-
Is the PropertyType one which accepts multiple values?
- ISO_SPEED_RATINGS - Static variable in interface org.apache.tika.metadata.TIFF
-
"ISO Speed and ISO Latitude of the input device as specified in ISO 12232"
- isPrettyPrint() - Method in class org.apache.tika.parser.ctakes.CTAKESConfig
-
Returns true
if formatted output is enabled, false
otherwise.
- isQueueEmpty() - Method in class org.apache.tika.batch.FileResourceCrawler
-
Use sparingly.
- isQuoteAssignmentValues() - Method in class org.apache.tika.embedder.ExternalEmbedder
-
Gets whether or not to quote assignment values, i.e.
- isReasonablyCertain() - Method in class org.apache.tika.language.detect.LanguageResult
-
- isReasonablyCertain() - Method in class org.apache.tika.language.LanguageIdentifier
-
Deprecated.
Tries to judge whether the identification is certain enough
to be trusted.
- ISREGEX_ATTR - Static variable in interface org.apache.tika.mime.MimeTypesReaderMetKeys
-
- isScript() - Method in class org.apache.tika.sax.Link
-
- isSerialize() - Method in class org.apache.tika.parser.ctakes.CTAKESConfig
-
Returns true
if CAS serialization is enabled, false
otherwise.
- isShortText() - Method in class org.apache.tika.language.detect.LanguageDetector
-
- isSpecializationOf(MediaType, MediaType) - Method in class org.apache.tika.mime.MediaTypeRegistry
-
Checks whether the given media type a is a specialization of a more
generic type b.
- isStillActive() - Method in class org.apache.tika.batch.FileResourceConsumer
-
Returns whether or not the consumer is still could process
a file or is still processing a file (ACTIVELY_CONSUMING or ASKED_TO_SHUTDOWN)
- isStyle - Variable in class org.apache.tika.parser.rtf.ListDescriptor
-
- isSupported(TikaInputStream) - Method in interface org.apache.tika.extractor.ContainerExtractor
-
Is this Container Extractor able to process the
supplied container?
- isSupported(TikaInputStream) - Method in class org.apache.tika.extractor.ParserContainerExtractor
-
- isSupported(String) - Static method in class org.apache.tika.utils.CharsetUtils
-
Safely return whether is supported, without throwing exceptions
- isText() - Method in class org.apache.tika.parser.ctakes.CTAKESConfig
-
Returns true
if content text analysis is enabled false
otherwise.
- isTikaInputStream(InputStream) - Static method in class org.apache.tika.io.TikaInputStream
-
Checks whether the given stream is a TikaInputStream instance.
- isTracking() - Method in class org.apache.tika.parser.mbox.MboxParser
-
- isUnknown() - Method in class org.apache.tika.language.detect.LanguageResult
-
- isUnordered(int) - Method in class org.apache.tika.parser.rtf.ListDescriptor
-
- isValid(String) - Static method in class org.apache.tika.mime.MimeType
-
Checks that the given string is a valid Internet media type name
based on rules from RFC 2054 section 5.3.
- isWriteable(Class<?>, Type, Annotation[], MediaType) - Method in class org.apache.tika.server.writer.CSVMessageBodyWriter
-
- isWriteable(Class<?>, Type, Annotation[], MediaType) - Method in class org.apache.tika.server.writer.JSONMessageBodyWriter
-
- isWriteable(Class<?>, Type, Annotation[], MediaType) - Method in class org.apache.tika.server.writer.MetadataListMessageBodyWriter
-
- isWriteable(Class<?>, Type, Annotation[], MediaType) - Method in class org.apache.tika.server.writer.TarWriter
-
- isWriteable(Class<?>, Type, Annotation[], MediaType) - Method in class org.apache.tika.server.writer.TextMessageBodyWriter
-
- isWriteable(Class<?>, Type, Annotation[], MediaType) - Method in class org.apache.tika.server.writer.XMPMessageBodyWriter
-
- isWriteable(Class<?>, Type, Annotation[], MediaType) - Method in class org.apache.tika.server.writer.ZipWriter
-
- isWriteLimitReached(Throwable) - Method in class org.apache.tika.sax.WriteOutContentHandler
-
Checks whether the given exception (or any of it's root causes) was
thrown by this handler as a signal of reaching the write limit.
- ITikaToXMPConverter - Interface in org.apache.tika.xmp.convert
-
Interface for the specific Metadata
to XMP converters
- ITSF - Static variable in class org.apache.tika.parser.chm.core.ChmConstants
-
- ITSP - Static variable in class org.apache.tika.parser.chm.core.ChmConstants
-
- IWORK_COMMON_ENTRY - Static variable in class org.apache.tika.parser.iwork.IWorkPackageParser
-
All iWork files contain one of these, so we can detect based on it
- IWORK_CONTENT_ENTRIES - Static variable in class org.apache.tika.parser.iwork.IWorkPackageParser
-
Which files within an iWork file contain the actual content?
- IWorkPackageParser - Class in org.apache.tika.parser.iwork
-
A parser for the IWork container files.
- IWorkPackageParser() - Constructor for class org.apache.tika.parser.iwork.IWorkPackageParser
-
- IWorkPackageParser.IWORKDocumentType - Enum in org.apache.tika.parser.iwork
-