Tutorials

The Tutorials for PDFBox provide a quick start into some of the most common uses cases for PDFBox as part of a PDF application. In addition the Cookbook provides a wide range of samples including sample code.

Tutorial Description
Bookmarks PDF Bookmarks allow for a quick navigation within documents similar to a Table of Contents. The Tutorial covers how to read bookmarks as well as how to generate them.
File References A PDF file can contain references to external files as well as files embedded in the PDF itself. The Tutorial covers how handle such references as well as how to embed a file in a PDF document.
Highlighting PDF allows for text to be highlighted. This can be useful for example to highlight results of a search. There are different ways to approach this covered in the tutorial.
Metadata PDF documents can have metadata associated with the document or certain objects within the document. As an example metadata can be used to store the author of a document as well as copyright information for an image embedded in the document. The Tutorial covers how to read and generate such metadata.
Text Extraction One of the main features of PDFBox is its ability to quickly and accurately extract text from a variety of PDF documents. The tutorial covers different approaches to handle that task.