1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83
|
pdftohtml 0.10
==============
First public release
pdftohtml 0.20
==============
-added complex document extraction
-added image extraction
-added pdftohtml script
pdftohtml 0.21
==============
-added support for colored fonts
-bugfixes
pdftohtml 0.22
==============
- fixed Digital C++ compilation bug
- fixed -e bug
- other bugfixes
pdftohtml 0.30 (test version)
==============
-Added pdf-vector drawings extraction
-HTML output compatible with MI 4.0
(using the patches by Claus Hulstrom and Zoltan Hawryluk)
-output for XML post-processing (patch by Anjo Anjewierden)
-zoom option
-other bugfixes
pdftohtml 0.31 (test version)
==============
-bugfixes
pdftohtml 0.32 (*test* version)
-updated to use xpdf 1.0
pdftohtml 0.32b
===============
-experimental ability to specify output encoding (UTF-8 might work)
-ability to specify user/master password
-fixed core dump on documents with type 3 fonts only
-fixed bug with inline images not being handled properly
pdftohtml 0.33
==============
-Ghostscript is executed from the program itself, no need for external script
-ps output (for complex mode) is produced by pdftohtml, got rid of pdftops
-several XML-related fixes
-several memory leaks fixed (should not crash on large files now)
pdftohtml 0.33a
===============
-bug fixes
-join multiple <b></b>'s and <i></i>'s together
pdftohtml 0.34
=============
-command line option to extract hidden text
-parts of hrefs (links) are joined together (just like <b> and <i>)
-paragraphs joined together
-fixed a crash
-updated Xpdf to 1.01
-can produce noframes (single html file) output for complex mode
pdftohtml 0.35
===============
-somewhat better HTML 4.01 compliance
-added meta author, keywords and date
-added mechanism for correct charsets
-fixed bug with links pointing nowhere for '-c -noframes'
-fixed bug with page numbering
-fixed bug with collapsing text for '-c -noframes -i'
-updated Xpdf to 2.01 (security fix among others)
pdftohtml 0.36
=====================
-added support for different Ghostscript output devices (-dev)
-updated Xpdf to 2.02 (security fix)
-a couple of fixes and tweaks for better output
-ported duplicate lines elimination from pdftotext (does not affect complex mode)
-fixed bug which caused bold to spread from one sentence to the entire document
-support for document outlines (patch by Nicolas Pitre)
|