History of Changes
- Introduction and explanation of symbols
- Version 0.7.4-dev (Planned release:unknown)
- Version 0.7.3 (10/12/2006)
- Version 0.7.2 (09/11/2005)
- Version 0.7.1 (04/10/2005)
- Version 0.7.0 (1/22/2005)
- Version 0.6.7 (10/09/2004)
- Version 0.6.6 (07/20/2004)
- Version 0.6.5 (03/08/2004)
- Version 0.6.4 (11/02/2003)
- Version 0.6.3 (09/13/2003)
- Version 0.6.2 (4/18/2003)
- Version 0.6.1 (3/9/2003)
- Version 0.6.0 (3/5/2003)
- Version 0.5.6 (11/28/2002)
- Version 0.5.5 (10/03/2002)
- Version 0.5.4 (09/17/2002)
- Version 0.5.3 (09/13/2002)
- Version 0.5.2 (09/06/2002)
- Version 0.5.1 (09/04/2002)
- Version 0.5.0 (08/31/2002)
- Version 0.4.1 (07/25/2002)
- Version 0.4.0 (07/23/2002)
- Version 0.3.0 (07/09/2002)
- Version 0.2.0 (06/03/2002)
- Version 0.1.0 (05/25/2002)
Introduction and explanation of symbols
Changes are sorted by "false" and then chronologically with the most recent at the top. These symbols denote the various action types:=add,
=fix,
=remove,
=update
Version 0.7.4-dev (Planned release:unknown)
Changes to the Code Base
-
[ 1648374 ] Unexpected end of ZLIB input stream Committed by BJL.
-
[ 1806912 ] Access to stream arrays Committed by BJL.
-
[ 1793593 ] PDFStreamEngine doesn't handle rotation of -90. Committed by BJL.
-
[ 1792084 ] spot colors' getTintTransform needs quick fix Committed by BJL.
-
Added dependency on JAI Committed by BJL.
-
[ 1666960 ] Patch for any kind of tiff based pdf. Committed by BJL.
-
Fixed issue with "Could not read all parameters to decode image" Committed by BJL.
-
[ 1726446 ] Fixing merge w&w/o acroform Committed by BJL.
-
[ 1745133 ] OutOfMemoryError exception during text Committed by BJL.
-
Upgraded to lucene 2.2.0 Committed by BJL.
-
[ 1672991 ] Specify non default printer in silentPrint mode. Committed by BJL.
-
[ 1607074 ] Portrait and Landscape PageFormat Fix Committed by BJL.
-
***API Change:Renamed PDPageContentStream.drawImage to drawXObject*** Committed by BJL.
-
[ 1600672 ] Painting Form XObjects Committed by BJL.
-
Added integration with JempBox library Committed by BJL.
-
Upgraded to IKVM 0.32.0.0 Committed by BJL.
-
[ 1674903 ] InvalidCastException in write2file for images Committed by BJL.
-
[ 1057560 ] Overlay incorrectly orders overlay pages Committed by BJL.
-
Added ability to get/set document level JavaScript. Committed by BJL.
-
Fixed issue with PNG encoding when using LZW/Flate filters Committed by BJL.
-
[ 1562539 ] Implemented "B" operator for PageDrawer Committed by BJL.
-
[ 1562539 ] PageDrawer now uses the correct color to display text Committed by BJL.
-
Upgraded to junit4.3 Committed by BJL.
-
Upgraded to Lucene-2.1.0 Committed by BJL.
Version 0.7.3 (10/12/2006)
Changes to the Code Base
-
Upgraded to Checkstyle 4.2 Committed by BJL.
-
Upgraded to IKVM 0.30.0.0 Committed by BJL.
-
[ 1546399 ] Use get/set functions for separators in PDFTextStripper Committed by BJL.
-
PDDocument.silentPrint() to print without prompting for a printer Committed by BJL.
-
[ 1544118 ] Bug in PDFont.getCodeFromArray Committed by BJL.
-
[ 1529835 ] Add COSFloat.setValue() Committed by BJL.
-
[ 1492555 ] PDChoiceField dead loop Committed by BJL.
-
[ 1499521 ] NPE PDAppearance.convertToMultiLine Committed by BJL.
-
[ 1522007 ] Error converting date Committed by BJL.
-
Upgraded to Lucene 2.0.0 Committed by BJL.
-
[ 1451164 ] Problems filling combo and radio form fields Committed by BJL.
-
Upgraded to lucene 1.9.1 Committed by BJL.
-
[ 1023133 ] Support PDF Functions Committed by BJL.
-
Added command line org.pdfbox.PDFMerger Committed by BJL.
-
***API Change*** Promoted AppendDoc from example to util package, renamed to PDFMergerUtility. Committed by BJL.
-
Upgraded to IKVM-0.24.0.1 Committed by BJL.
-
[ 1391952 ] Problem extracting embedded attachments Committed by BJL.
-
[ 1249607 ] Fixed issue with broken PDFs that contain multiple endobj Committed by BJL.
-
[ 1153174 ] Added documentation for PDFHighlighter Committed by BJL.
-
Removed log4j dependency Committed by BJL.
-
[ 974661 ] getKids() Null Pointer Exception when parsing pdf Committed by BJL.
-
Added better support for CJK encoding Committed by BJL.
-
Changed signature of PDFPageContentStream.drawImage to take float arguments instead of int Committed by BJL.
-
Fixed issue where form xobjects where not being drawn in the viewer Committed by BJL.
-
Changed signature to PDDocumentCatalog.OpenAction to be an PDDestinationOrAction instead of just action. Committed by BJL.
-
Added tolerance to text extraction sorting where text on a line was not at the same exact y coordinate but very close Committed by BJL.
-
[ 1327133 ] Printing with form data Committed by BJL.
-
Fixed issue with DateConverter that was trying to parse an empty string Committed by BJL.
-
[ 1324846 ] appending text to PDPageContentStream messes up fonts Committed by BJL.
-
Added new example ReplaceURLs to show how to replace a clickable URL in a PDF Committed by BJL.
-
Implemented annotation drawing Committed by BJL.
-
Implemented EndPath and StrokeAndClosePath operators Committed by BJL.
-
Move text extraction permission checking from PDFTextStripper to ExtractText Committed by BJL.
-
Added support for more annotations, thanks to a contribution from Paul King Committed by BJL.
-
Created new FontBox project to hold all font library code Committed by BJL.
-
Fixed issue where only the first page was sent to the printer Committed by BJL.
-
Now automatically sets the page orientation when printing Committed by BJL.
Changes to Documentation
-
Upgraded to Apache Forrest 0.8-dev Committed by BJL.
Version 0.7.2 (09/11/2005)
Changes to the Code Base
-
Upgraded to IKVM-0.20.0.0 Committed by BJL.
-
Added support to get annotations from a page and to create a RubberStamp annotation Committed by BJL.
-
Added PDDocument.print() to send the PDF to a printer. Committed by BJL.
-
[ 1276623 ] NullPointerException in PageDrawer:241 when extractin images Committed by BJL.
-
Allow creation of PDJpeg from a BufferedImage, thanks to contribution from Paul King Committed by BJL.
-
Removed PDTiff in favor of PDCcitt Committed by BJL.
-
PDFBox no longer requires log4j!! Committed by BJL.
-
New class to allow you to specify 'named' regions where text is to be extracted. Committed by BJL.
-
[ 1261555 ] Unexpected end of ZLIB input stream when stream has a zero length Committed by BJL.
-
[ 1226665 ] ImportXFDF giving NPE error Committed by BJL.
-
renamed COSDictionary.setItem( String, boolean ) to COSDictionary.setBoolean( String, boolean ) Committed by BJL.
-
Added sorting parameter to PDFTextStripper Committed by BJL.
-
Fixed issues with PDF encryption Committed by BJL.
-
Better date support, added support for PDFs that use non standard dates, support for time zone offsets Committed by BJL.
-
FlateFilter-class now supports PNG-Predictors for decoding the imagedata, thanks to a contribution from Marcel Kammer Committed by BJL.
-
Added support for extracting tiff images, thanks to a contribution from Marcel Kammer Committed by BJL.
-
Added PDDocument.removePage to remove PDF pages Committed by BJL.
-
Fixed issue when creating a COSString with a UTF 16 string Committed by BJL.
-
Committed patch for type 1 PFB font parser(special thanks to Michael Niedermair) Committed by BJL.
-
Committed patch for PNG predictors (special thanks Erik Martino) Committed by BJL.
-
[ 1227428 ] failure of getMediaBox Committed by BJL.
-
[ 1227426 ] null pointer in PDFToImage(ColorModel is null) Committed by BJL.
-
[ 1207113 ] Enhancement: runtime accessible version Committed by BJL.
-
[ 1213320 ] setFfFlag() of PDField not working correctly Committed by BJL.
-
[ 1215945 ] Error in COSString.writePDF() - fixed escape sequences Committed by BJL.
-
[ 1198912 ] COSName with escaped characters not parsed correctly Committed by BJL.
-
Fixed issue where resources were not being cleared in PDFStreamEngine Committed by BJL.
-
[ 1165686 ] Expected int type parse error Committed by BJL.
-
[ 1182825 ] Wrong handling of signed/unsigned byte/int in TTF parsing Committed by BJL.
-
[ 1182892 ] PDFHighlight.setHighlightColor was removed because it is not implemented by adobe Committed by BJL.
Version 0.7.1 (04/10/2005)
Changes to the Code Base
-
[ 1170068 ] text field is not found Committed by BJL.
-
fixed NPE issue where an image did not have any applied filters Committed by BJL.
-
Fixed issue where extra spaces were being added during text extraction for type3 fonts Committed by BJL.
-
[ 1119420 ] Extract and Update the Meta-Information as XML Committed by BJL.
-
[ 1119410 ] Extract text in/between bookmarks Committed by BJL.
-
[ 1164476 ] XFDFImport should fail with non XFDF document Committed by BJL.
-
[ 1119408 ] Support named target for Bookmark extraction. Committed by BJL.
-
Created Resources/PDFBox_External_Fonts.properties to create a mapping for non-embedded fonts Committed by BJL.
-
**API Change** Renamed PDField.getName() to PDField.getPartialName(), added method getFullyQualifiedName() Committed by BJL.
-
**API Change** Renamed PDWidget to PDAnnotationWidget for naming consistency Committed by BJL.
-
Text is now extracted from embedded form xobjects. Committed by BJL.
-
Deployed site to new hosting vendor. Committed by BJL.
-
committed code for PDFHighlighter to highlight words in a PDF document. Committed by BJL.
-
Added command line application org.pdfbox.PDFToImage Committed by BJL.
-
Implemented runlength decoding Committed by BJL.
-
Added patch from Jorge Hernández Sellés to append content streams to existing page. Committed by BJL.
-
**API Change**renamed package from pdmodel.graphics.image to pdmodel.graphics.xobject Committed by BJL.
-
**API Change**Removed PDRadioButton, should use PDCheckbox instead Committed by BJL.
-
**API Change**COSStream now extends COSDictionary instead of containing a dictionary Committed by BJL.
-
[ 1021241 ] Text extraction should follow PDF article divisions Committed by BJL.
-
Added implementation for PDF page articles Committed by BJL.
-
Created TextToPDF command line application Committed by BJL.
-
Created ImageToPDF example Committed by BJL.
-
fixed parsing of header where a trailing % exists Committed by BJL.
-
[ 1110029 ] Character ">" not quoted in COSName::writePDF Committed by BJL.
Version 0.7.0 (1/22/2005)
Changes to the Code Base
-
committed [ 1097913 ] Enhance LucenePDFDocument streams(thanks to Olivier Parent) Committed by BJL.
-
Added implementation for PDF Bookmarks Committed by BJL.
-
Added implementation for PDF Destinations Committed by BJL.
-
Updated website for better format for documentation Committed by BJL.
-
Now ExportFDF and ExportXFDF will default output files to pdfname.fdf and pdfname.xfdf Committed by BJL.
-
[ 1046278 ] ClassCastException when doing FDF/XFDF Committed by BJL.
-
ExtractText now allows you to extract text if you decrypt with the owner password Committed by BJL.
-
Added PDF 1.5 Object Stream support Committed by BJL.
-
Added pdmodel.common.PDStream to represent COSStream Committed by BJL.
-
changed PDPage.getContents to use PDStream instead of COSStream Committed by BJL.
-
Updated LucenePDFDocument Javadoc to tell which Lucene fields it populates Committed by BJL.
-
moved HelloWorld example from persistence to pdmodel and updated to use new PD Model features Committed by BJL.
-
Refactored PDFStreamEngine based on contributions from Christophe Huault Committed by BJL.
-
This class no longer uses a gigantic if/else statement for all of the operators they are defined as properties when instantiating the class Committed by BJL.
-
Updated AFM resources to be ones released on Adobe's site, include AFM license as well Committed by BJL.
-
Added ability to embed TTF fonts, only WinAnsiEncoding is supported at this time Committed by BJL.
-
Added ability to extract images, thanks to contributions by Brigitte Mathiak Committed by BJL.
-
COSWriter now generates the document id if it does not already exist Committed by BJL.
-
improved performance for text extraction Committed by BJL.
-
[ 1058693 ] TextPosition does not take account of tz operator Committed by BJL.
-
upgraded to log4j-1.2.9 Committed by BJL.
-
include package-list for javadocs Committed by BJL.
-
[ 1037145 ] Infinite loop in PDFParser.parseObject Committed by BJL.
-
fixed error where spaces before integers was causing parse errors Committed by BJL.
Version 0.6.7 (10/09/2004)
Changes to the Code Base
-
Revamped the way character spacing and font information is obtained Committed by BJL.
-
Improved location information about a character drawn on the screen. Committed by BJL.
-
Changed the PDFStreamEngine.showString to showCharacter to support the newly improved location information. This will now only show one character at a time. Committed by BJL.
-
Fixed bug in PDDocument.isOwnerPassword and isUserPassword that was using the wrong length for the encryption key Committed by BJL.
-
Upgraded to ant 1.6.2 Committed by BJL.
-
Upgraded to checkstyle-3.4 Committed by BJL.
-
Upgraded to JUnit-3.8.1 Committed by BJL.
-
Upgraded to lucene-1.4.2 Committed by BJL.
-
Integrated patch(1016603) for issue 943319 to fix parsing of open office documents Committed by BJL.
-
Patch:985347 No longer throw exception for "No 'ToUnicode' and no 'Encoding' for Font" Committed by BJL.
-
Patch:996191 Fixed case statement with missing break Committed by BJL.
-
Patch:996781 Fixed null pointer exception in acroform fields Committed by BJL.
-
Renamed DecryptDocument to DocumentEncryption to support encryption and decryption Committed by BJL.
-
Added load/save/encrypt/decrypt convenience methods on the PDDocument class Committed by BJL.
-
COSWriter now attempts to keep object numbers from parsed documents and writes 'free' entries in the xref if necessary Committed by BJL.
-
Added the ability to set the word separator on the PDFTextStripper Committed by BJL.
-
Fixed issue where PDFBox would throw an IOException if a PDF was incorrectly missing an endobj tag Committed by BJL.
-
Fixed 918220 where PDFBox would freeze when parsing certain cmap files Committed by BJL.
-
Added initial colorspace support Committed by BJL.
-
Fixed issue where AppendDoc was throwing ClassCastException Committed by BJL.
-
Fixed 1013163 Can't parse filters that use filter abbreviation Committed by BJL.
-
Fixed 1011244 Where encrypting then decrypting was causing a problem Committed by BJL.
-
renamed TextPosition.getWidth to TextPosition.getCombinedHorizontalDisplacement to better reflect its actual value Committed by BJL.
-
Fixed 919215 PDFBox now support stream replacement Committed by BJL.
-
Fixed 955043 Added support for 'ETenms-B5-H' encoding Committed by BJL.
-
Fixed 996050 Class Cast exception when importing Committed by BJL.
-
Added support for Font descriptors Committed by BJL.
-
Fixed spacing issues when doing textfield FDF import Committed by BJL.
-
Fixed 1017175 Large number converted when re-written Committed by BJL.
-
Fixed 1029873 PDFBox now allows for multiple xref sections Committed by BJL.
-
Added support for document Viewer Preferences Committed by BJL.
-
Made currentDocument and pdfDocument protected in util.Splitter to allow easier subclassing Committed by BJL.
-
Fixed 1034427 After Splitting page orientation is lost Committed by BJL.
-
Added the following command line applications Committed by BJL.
Version 0.6.6 (07/20/2004)
Changes to the Code Base
-
Improved support for setting of checkbox fields(FDF import) Committed by BJL.
-
Added the org.pdfbox.PDFSplit utility to split a single document into many documents Committed by BJL.
-
PDFBox now ignore the Length field that is associated with a stream, it has been found to be wrong in some documents Committed by BJL.
-
Fixed bug when writing out PDF documents and the document contained an non alphabetic character such as ( or ) Committed by BJL.
-
Fixed bug in PDFont where dictionary encodings where not being processed correctly Committed by BJL.
-
Fixed bug in COSDocument.isEncrypted which was comparing COSNull to the wrong object Committed by BJL.
-
Integrated patch for supporting multiple lines in the appearance stream Committed by BJL.
-
Upgraded to lucene-1.4-final Committed by BJL.
-
org.pdfbox.ExtractText now uses the system encoding as the default encoding instead of ISO-8859-1 Committed by BJL.
Version 0.6.5 (03/08/2004)
Changes to the Code Base
-
Fixed bug in revision 3 encryption algorithm Committed by BJL.
-
added support for CIDFontType0 glyph widths, which fixed issue with spaces being during text extraction Committed by BJL.
-
Fixed infinite loop when parsing a corrupt content stream Committed by BJL.
-
Add characterspacing + wordspacing when determining the width of a space character Committed by BJL.
-
Added support for more font types Committed by BJL.
-
refactored the pdmodel.interactive package, form fields use object delegation instead of inheritance for the widget, see PDField.getWidget and PDField.getKids Committed by BJL.
-
Fixed bug where an inheritable cropbox would cause stackoverflow exception Committed by BJL.
-
Changed usage of PDField/PDWidget to look like object delegation instead of inheritance by adding a PDField.getWidget instead of extending PDWidget Committed by BJL.
-
refactored interactive package, this will break any existing code that uses the PDField/PDAnnotation classes. You will need to adjust your package names!! Committed by BJL.
-
Now uses StandardEncoding as the default encoding Committed by BJL.
-
Bug in AppendDoc example that did not take into account groups of pages Committed by BJL.
-
PDFont now also tries the bootstrap classloader when loading AFM resources Committed by BJL.
-
added -startPage and -endPage command line options to org.pdfbox.ExtractText Committed by BJL.
-
Added support for corrupt PDFs with garbage before the header Committed by BJL.
-
Fixed bug where there was whitespace instead of garbage characters in front of the first object Committed by BJL.
-
performance improvements for the Matrix implementation Committed by BJL.
-
upgraded to lucene 1.3 Committed by BJL.
-
fixed bug in cmap parser for cmap files that all ended in 'def' Committed by BJL.
-
Removed createObject method from COSDocument, COSWriter will handle all object references for you Committed by BJL.
-
Updated AppendDoc to use PDDocument instead of COSDocument and a couple bug fixes Committed by BJL.
-
PDFParser now closes the document if there were parse errors Committed by BJL.
-
TextPosition now has the PDFont that is associated with the piece of text Committed by BJL.
-
Added initial version of org.pdfbox.PDFViewer, a GUI application to view the internal structure of a PDF document. This can be used for debugging purposes at this time but may end up being a Adobe Reader like application if there is enough interest Committed by BJL.
-
Changed COSNumber/COSInteger/COSFloat interface to have both intValue and longValue Committed by BJL.
-
Added methods isUserPassword & isOwnerPassword to PDDocument Committed by BJL.
-
Added cmap files for CJK languages, please give me some feedback Committed by BJL.
Version 0.6.4 (11/02/2003)
Changes to the Code Base
-
Fixed bug which caused infinite loop Committed by BJL.
-
Fixed bug in encoding where DictionaryEncoding kept a reference instead of making a copy leading to encoding problems Committed by BJL.
-
Added PDFTextStripper.(get|set)PageSeparator, which will allow the user to output a string after every page Committed by BJL.
-
refactored text stripping code to separate the logic processing of PDF operators and the logic of extracting text Committed by BJL.
-
ran findbugs on source code and fixed a couple minor issues Committed by BJL.
-
Refactored font functionality to PDFont, some API methods are no longer available in COSObject Committed by BJL.
-
changed name of org.pdfbox.Main to org.pdfbox.ExtractText Committed by BJL.
-
added contribution of org.pdfbox.Overlay from Mario Ivankovits Committed by BJL.
-
added log.isDebugEnabled checks to log4j calls Committed by BJL.
-
added better escaping when writing COSNames Committed by BJL.
-
fixed bug where encryption dictionary is sometimes set to COSNull instead of not being present Committed by BJL.
Version 0.6.3 (09/13/2003)
Changes to the Code Base
-
Now contains the ability to import/set FDF data thanks to a contribution from Stefan Uldum Grinsted Committed by BJL.
-
No longer throw an error when stream is not followed by 0A or 0D0A to allow more PDFs to be parsed Committed by BJL.
-
Added -encoding argument to org.pdfbox.Main to control the encoding of the output Committed by BJL.
-
Remove Prev entry from trailer if it exists because PDFBox automatically clears all old entries, only an issue when modifying/saving an existing PDF document Committed by BJL.
-
Fixed bug in master password encryption algorithm for Revision 3 encrypted documents Committed by BJL.
-
COSString no longer uses UTF-8 when encoding the byte array Committed by BJL.
-
Added PDDocument.getPageCount() Committed by BJL.
-
Fixed bug in PDFEncryption where Committed by BJL.
-
Now enforces text extraction permissions Committed by BJL.
Version 0.6.2 (4/18/2003)
Changes to the Code Base
-
Modified build so that build.properties settings are no longer required Committed by BJL.
-
Added required libraries to CVS Committed by BJL.
-
Added log4j logging Committed by BJL.
-
Significant text extraction work Committed by BJL.
-
Added automatic handling of files encrypted with the empty password Committed by BJL.
-
Added automated tests and test data for text extraction Committed by BJL.
-
Removed unimplemented decoders from filters test Committed by BJL.
-
Fixed several LZW decode bugs introduced after 0.5.6 Committed by BJL.
-
Fixed bugs relating to processing out of spec PDF's with bad # escaping in the name ("java.io.IOException: Error: expected hex number" bug) Committed by BJL.
-
Fixed Lucene UID generation bug Committed by BJL.
-
Fixed GetFontWidths null pointer exception bug Committed by BJL.
Version 0.6.1 (3/9/2003)
Changes to the Code Base
-
Fixed bug in parsing stream objects which led to "Unexpected end of ZLIB input stream" Committed by BJL.
-
Changed license from LGPL to BSD to allow pdfbox to be used easily in Apache projects Committed by BJL.
Version 0.6.0 (3/5/2003)
Changes to the Code Base
-
Massive improvements to memory footprint Committed by BJL.
-
Must call close() on the COSDocument(LucenePDFDocument does this for you) Committed by BJL.
-
Really fixed the bug where small documents were not being indexed Committed by BJL.
-
Fixed bug where no whitespace existed between obj and start of object. Exception in thread "main" java.io.IOException: expected='obj' actual='obj<</Pro Committed by BJL.
-
Fixed issue with spacing where textLineMatrix was not being copied properly Committed by BJL.
-
Fixed 'bug' where parsing would fail with some pdfs with double endobj definitions Committed by BJL.
-
Added PDF document summary fields to the lucene document Committed by BJL.
Version 0.5.6 (11/28/2002)
Changes to the Code Base
-
Fixed bug in LucenePDFDocument where stream was not being closed and small documents were not being indexed Committed by BJL.
-
Fixed a spacing issue for some PDF documents Committed by BJL.
-
Fixed error while parsing the version number Committed by BJL.
-
Fixed NullPointer in persistence example Committed by BJL.
-
Create example lucene IndexFiles class which models the demo from lucene Committed by BJL.
-
Fixed bug where garbage at the end of file caused an infinite loop Committed by BJL.
-
Fixed bug in parsing boolean values with stuff at the end like "true>>" Committed by BJL.
Version 0.5.5 (10/03/2002)
Changes to the Code Base
-
Added example of printing document signature Committed by BJL.
-
Added example to print out form fields values Committed by BJL.
-
Fixed bug when appending documents Committed by BJL.
-
Various other bug fixes Committed by BJL.
Version 0.5.4 (09/17/2002)
Changes to the Code Base
-
Fixed bug in text output where '?' instead of the proper character Committed by BJL.
-
Fixed bug where sections of text were not being output at all Committed by BJL.
Version 0.5.3 (09/13/2002)
Version 0.5.2 (09/06/2002)
Changes to the Code Base
-
Fixed bug where FDF documents could not be appended to PDF Documents Committed by BJL.
-
Catch all NumberFormatExceptions and wrap them with IOExceptions Committed by BJL.
Version 0.5.1 (09/04/2002)
Changes to the Code Base
-
Now supports unicode for the document summary Committed by BJL.
-
Better support for Type0 fonts Committed by BJL.
-
Fixed bug with an empty LZW stream Committed by BJL.
-
Fixed parsing error for ID operator Committed by BJL.
Version 0.5.0 (08/31/2002)
Changes to the Code Base
-
Now supports unicode for the document summary Committed by BJL.
-
Better support for Type0 fonts Committed by BJL.
-
Fixed bug with an empty LZW stream Committed by BJL.
-
Fixed parsing error for ID operator Committed by BJL.
Version 0.4.1 (07/25/2002)
Changes to the Code Base
-
Fixed bug where .notdef was being output as document text Committed by BJL.
Version 0.4.0 (07/23/2002)
Changes to the Code Base
-
Added extract text ant task Committed by BJL.
-
Implemented AFM(Adobe Font Metrics) resource loading Committed by BJL.
-
Fixed numerous bugs submitted by users Committed by BJL.
-
Changed project from pdfparser to pdfbox to better reflect future needs Committed by BJL.
Version 0.3.0 (07/09/2002)
Changes to the Code Base
-
Added indexer for the lucene project Committed by BJL.
-
Initial implementation of PDF encryption(not working yet) Committed by BJL.
Version 0.2.0 (06/03/2002)
Changes to the Code Base
-
Added support for the various encodings Committed by BJL.
-
Improved the accuracy of the text output Committed by BJL.