File: attachments.rst

package info (click to toggle)
pikepdf 6.0.0%2Bdfsg-1
  • links: PTS, VCS
  • area: main
  • in suites: bookworm
  • size: 6,600 kB
  • sloc: python: 8,440; cpp: 4,645; makefile: 309; sh: 47
file content (107 lines) | stat: -rw-r--r-- 3,429 bytes parent folder | download
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
.. _attachments:

Attaching files to a PDF
************************

.. versionadded:: 3.0

You can attach (or if you prefer, embed) any file to a PDF, including
other PDFs. As a quick example, let's attach pikepdf's README.md file
to one of its test files.

.. ipython::

    In [1]: from pikepdf import Pdf, AttachedFileSpec, Name, Dictionary, Array

    In [1]: from pathlib import Path

    In [1]: pdf = Pdf.open('../tests/resources/fourpages.pdf')

    In [1]: filespec = AttachedFileSpec.from_filepath(pdf, Path('../README.md'))

    In [1]: pdf.attachments['README.md'] = filespec

    In [1]: pdf.attachments

This creates an attached file named ``README.md``, which holds the data in ``filespec``.
Now we can retrieve the data.

.. ipython::

    In [1]: pdf.attachments['README.md']

    In [1]: file = pdf.attachments['README.md'].get_file()

    In [1]: file.read_bytes()[:50]

If the data used to create an attachment is in memory:

.. ipython::

    In [1]: memfilespec = AttachedFileSpec(pdf, b'Some text', mime_type='text/plain')

    In [1]: pdf.attachments['plain.txt'] = memfilespec


General notes on attached files
-------------------------------

* If the main PDF is encrypted, any embedded files will be encrypted with the same
  encryption settings.

* PDF viewers tend to display attachment filenames in alphabetical order. Use prefixes
  if you want to control the display order.

* The ``AttachedFileSpec`` will capture all of the data when created, so the file object
  used to create the data can be closed.

* Each attachment is a :class:`pikepdf.AttachedFileSpec`. An attachment usually contains only
  one :class:`pikepdf.AttachedFile`, but might contain multiple objects of this
  type. Usually, multiple versions are used to provide different versions of the
  same file for alternate platforms, such as Windows and macOS versions of a file.
  Newer PDFs rarely provide multiple versions.

How to find attachments in a PDF viewer
---------------------------------------

Your PDF viewer should have an attachments panel that shows available attachments.

.. figure:: /images/acrobat-attachments.png
  :alt: Screenshot of attachments panel in Acrobat DC on Windows

  Attachments in Adobe Acrobat DC.

Attachments added to ``Pdf.attachments`` will be shown here.

You may find it useful to set ``pdf.root.PageMode = Name.UseAttachments``. This
tells the PDF viewer to open a pane that lists all attachments in the PDF. Note
that it is up to the PDF viewer to implement and honor this request.

Creating attachment annotations
-------------------------------

You can also create PDF Annotations and Actions that contain attached files.

Here is an example of an annotation that displays an icon. Clicking the icon
prompt the user to view the attached document.

.. ipython::

  In [1]: pdf = Pdf.open('../tests/resources/fourpages.pdf')

  In [1]: filespec = AttachedFileSpec.from_filepath(pdf, Path('../README.md'))

  In [1]: pushpin = Dictionary(
     ...:     Type=Name.Annot,
     ...:     Subtype=Name.FileAttachment,
     ...:     Name=Name.GraphPushPin,
     ...:     FS=filespec.obj,
     ...:     Rect=[2*72, 9*72, 3*72, 10*72],
     ...: )

  In [1]: pdf.pages[0].Annots = pdf.make_indirect(Array([
     ...:     pushpin
     ...: ]))

Files that are referenced as Annotations and Actions do not need to be added
to ``Pdf.attachments``. If they are, the file will be attached twice.