Package org.apache.tika.metadata
Interface Office
public interface Office
Office Document properties collection. These properties apply to
Office / Productivity Documents of all forms, including (but not limited
to) MS Office and OpenDocument formats.
This is a logical collection of properties, which may be drawn from a
few different external definitions.
- Since:
- Apache Tika 1.2
-
Field Summary
Modifier and TypeFieldDescriptionstatic final Property
Name of the principal author(s) of a documentstatic final Property
The number of Characters in the documentstatic final Property
The number of Characters in the document, including spacesstatic final Property
When was the document created?static final Property
The number of Images in the documentstatic final Property
Name of the initial creator/author of a documentstatic final Property
Keywords pertaining to a document.static final Property
Name of the last (most recent) author of a documentstatic final Property
The number of lines in the documentstatic final Property
static final Property
static final Property
MAPI message class.static final Property
static final Property
static final String
static final Property
The number of Objects in the document.static final Property
static final Property
The number of Pages are there in the (paged) documentstatic final Property
The number of individual Paragraphs in the documentstatic final String
static final Property
When was the document last printed?static final Property
Embedded files may have a "progID" associated with them, such as Word.Document.12 or AcroExch.Document.DCstatic final Property
When was the document last saved?static final Property
The number of Slides are there in the (presentation) documentstatic final Property
The number of Tables in the documentstatic final String
For user defined metadata entries in the document, what prefix should be attached to the key names.static final Property
The number of Words in the document
-
Field Details
-
NAMESPACE_URI_DOC_META
- See Also:
-
PREFIX_DOC_META
- See Also:
-
USER_DEFINED_METADATA_NAME_PREFIX
For user defined metadata entries in the document, what prefix should be attached to the key names. eg <meta:user-defined meta:name="Info1">Text1</meta:user-defined> becomes custom:Info1=Text1- See Also:
-
KEYWORDS
Keywords pertaining to a document. Also populatesDublinCore.SUBJECT
. -
INITIAL_AUTHOR
Name of the initial creator/author of a document -
LAST_AUTHOR
Name of the last (most recent) author of a document -
AUTHOR
Name of the principal author(s) of a document -
CREATION_DATE
When was the document created? -
SAVE_DATE
When was the document last saved? -
PRINT_DATE
When was the document last printed? -
SLIDE_COUNT
The number of Slides are there in the (presentation) document -
PAGE_COUNT
The number of Pages are there in the (paged) document -
PARAGRAPH_COUNT
The number of individual Paragraphs in the document -
LINE_COUNT
The number of lines in the document -
WORD_COUNT
The number of Words in the document -
CHARACTER_COUNT
The number of Characters in the document -
CHARACTER_COUNT_WITH_SPACES
The number of Characters in the document, including spaces -
TABLE_COUNT
The number of Tables in the document -
IMAGE_COUNT
The number of Images in the document -
OBJECT_COUNT
The number of Objects in the document. These are typically non-Image resources embedded in the document, such as other documents or non-Image media. -
MAPI_MESSAGE_CLASS
MAPI message class. What type of .msg/MAPI file is it? -
MAPI_SENT_BY_SERVER_TYPE
-
MAPI_FROM_REPRESENTING_NAME
-
MAPI_FROM_REPRESENTING_EMAIL
-
MAPI_MESSAGE_CLIENT_SUBMIT_TIME
-
PROG_ID
Embedded files may have a "progID" associated with them, such as Word.Document.12 or AcroExch.Document.DC -
OCX_NAME
-