1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93
|
Source: ocrmypdf
Maintainer: Debian Python Team <team+python@tracker.debian.org>
Uploaders: Anton Gladky <gladk@debian.org>
Section: graphics
Priority: optional
Build-Depends:
debhelper-compat (= 13),
dh-exec,
dh-python,
python3-all,
python3-hatchling,
python3-hatch-vcs,
python3-img2pdf (>= 0.3.0),
python3-pikepdf (>= 8.14.0),
python3-pil,
python3-pycparser,
python3-pytest-runner,
python3-reportlab,
python3-setuptools,
python3-setuptools-scm,
python3-sphinx (>= 1.4.8),
python3-sphinx-issues,
python3-sphinx-rtd-theme,
python3-pdfminer (>= 20181108+dfsg-3),
pybuild-plugin-pyproject
Standards-Version: 4.1.2
Homepage: https://github.com/jbarlow83/OCRmyPDF
Vcs-Browser: https://salsa.debian.org/debian/ocrmypdf
Vcs-Git: https://salsa.debian.org/debian/ocrmypdf.git
Package: ocrmypdf
Architecture: all
Depends:
ghostscript (>= 9.18~dfsg~),
icc-profiles-free,
python3-pdfminer (>= 20181108+dfsg-3),
# ^ an unversioned pdfminer dep is included in ${python3:Depends} but ocrmypdf needs the new fork
python3-pil,
python3-reportlab,
python3-pikepdf (>= 8.14.0),
python3-pluggy,
python3-coloredlogs,
python3-hypothesis,
tesseract-ocr (>= 4.0.0),
zlib1g,
${misc:Depends},
${python3:Depends},
Recommends:
unpaper,
pngquant,
jbig2
Suggests:
ocrmypdf-doc,
python-watchdog,
img2pdf,
Description: add an OCR text layer to PDF files
OCRmyPDF generates a searchable PDF/A file from a regular PDF
containing only images, allowing it to be searched.
.
It uses the Tesseract OCR engine and so supports all the languages
that Tesseract does.
.
Some other main features:
.
* Places OCR text accurately below the image to ease copy / paste
* Keeps the exact resolution of the original embedded images
* When possible, inserts OCR information as a lossless operation
without rendering vector information
* Keeps file size about the same
* If requested deskews and/or cleans the image before performing OCR
* Validates input and output files
* Provides debug mode to enable easy verification of the OCR results
* Processes pages in parallel when more than one CPU core is
available
* Battle-tested on thousands of PDFs, a test suite and continuous
integration.
Package: ocrmypdf-doc
Section: doc
Architecture: all
Multi-Arch: foreign
Depends:
${misc:Depends},
${sphinxdoc:Depends},
Built-Using: ${sphinxdoc:Built-Using}
Description: add an OCR text layer to PDF files - documentation
OCRmyPDF generates a searchable PDF/A file from a regular PDF
containing only images, allowing it to be searched.
.
It uses the Tesseract OCR engine and so supports all the languages
that Tesseract does.
.
This package includes OCRmyPDF's HTML documentation.
|