File: quickstart.cli.rst

package info (click to toggle)
pdf2docx 0.5.8-1
  • links: PTS, VCS
  • area: main
  • in suites: forky, sid
  • size: 4,632 kB
  • sloc: python: 6,692; makefile: 68
file content (81 lines) | stat: -rw-r--r-- 1,717 bytes parent folder | download
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
Command Line Interface
===========================

::

  $ pdf2docx --help

  NAME
      pdf2docx - Command line interface for pdf2docx.

  SYNOPSIS
      pdf2docx COMMAND | -

  DESCRIPTION
      Command line interface for pdf2docx.

  COMMANDS
      COMMAND is one of the following:

      convert
        Convert pdf file to docx file.

      debug
        Convert one PDF page and plot layout information for debugging.

      table
        Extract table content from pdf pages.


By range of pages
-----------------------

Specify pages range by ``--start`` (from the first page if omitted) and 
``--end`` (to the last page if omitted). 

.. note:: 
  The page index is zero-based by default, but can turn it off by 
  ``--zero_based_index=False``, i.e. the first page index starts from 1.

Convert all pages::

  $ pdf2docx convert test.pdf test.docx

Convert pages from the second to the end::

  $ pdf2docx convert test.pdf test.docx --start=1


Convert pages from the first to the third (index=2)::

  $ pdf2docx convert test.pdf test.docx --end=3


Convert second and third pages::

  $ pdf2docx convert test.pdf test.docx --start=1 --end=3

Convert the first and second pages with zero-based index turn off::

  $ pdf2docx convert test.pdf test.docx --start=1 --end=3 --zero_based_index=False



By page numbers
-----------------------

Convert the first, third and 5th pages::

  $ pdf2docx convert test.pdf test.docx --pages=0,2,4


Multi-Processing
--------------------------

Turn on multi-processing with default count of CPU::

  $ pdf2docx convert test.pdf test.docx --multi_processing=True

Specify the count of CPUs::

  $ pdf2docx convert test.pdf test.docx --multi_processing=True --cpu_count=4