1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80
|
## Overview
*pdfgrep* is a tool to search text in PDF files. It works similarly to *grep*.
## Features
- Grep compatible: pdfgrep tries to be compatible with GNU grep,
where it makes sense. Many of your favorite grep options are
supported (such as `-r`, `-i`, `-n` or `-c`).
- Search many PDFs at once, even recursively in directories
- Regular expressions: Posix or PCRE
- Colored output
- Support for password protected PDFs
For a complete documentation, please consult the [manpage].
## Example
$ pdfgrep --max-count 1 --context 1 --with-filename --page-number pattern rabin-karp.pdf
rabin-karp.pdf-1-randomized
rabin-karp.pdf:1:pattern-matching
rabin-karp.pdf-1-algorithms
## Dependencies
- poppler-cpp (poppler >= 0.14) (http://poppler.freedesktop.org/)
- libgcrypt (https://www.gnu.org/software/libgcrypt/)
- optionally libpcre2 (http://www.pcre.org/)
## Building
To build `pdfgrep`, use the standard procedure:
./configure
make
sudo make install
If you're using the git version, you will also have to run
`./autogen.sh` in advance.
Depending on your OS, you need to have `-dev` or `-devel` packages
installed for all dependencies.
The `./configure` script can take lots of options to customize the
build process, the most important of which are:
- `--with-unac`: Build with experimental libunac support and add
the `--unac` flag to pdfgrep that strips all accents from
characters, making it possible to find the character 'รค' by
searching 'a'.
- `--with-{zsh,bash}-completion`: Configure installation directory
for shell completion files.
- `--without-libpcre`: Disable support for perl compatible regular
expressions.
- `--disable-doc`: Disable manpage generation.
To uninstall, run `sudo make uninstall`.
See `configure --help` for more info or read the (very extensive)
`INSTALL` file in the source.
## Download
Tarballs for releases are available at https://pdfgrep.org/download.html
The development version is available as a git repository at
https://gitlab.com/pdfgrep/pdfgrep
## Contact
General questions, suggestions, bug reports, patches or anything else
can be sent to the [mailinglist](mailto:pdfgrep-users@pdfgrep.org).
You can also use the [issue tracker] for bug reports or create a
[merge request] on GitLab, if you prefer that over mailinglists.
[manpage]: https://pdfgrep.org/doc.html
[issue tracker]: https://gitlab.com/pdfgrep/pdfgrep/issues
[merge request]: https://gitlab.com/pdfgrep/pdfgrep/merge_requests
|