File: catdoc.1

package info (click to toggle)
catdoc 0.33-3
  • links: PTS
  • area: main
  • in suites: hamm
  • size: 52 kB
  • ctags: 31
  • sloc: ansic: 261; tcl: 196; makefile: 34
file content (63 lines) | stat: -rw-r--r-- 1,816 bytes parent folder | download
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
.TH catdoc 1  "Version 0.32" "MS-Word reader"
.SH NAME
catdoc \- reads MS-Word file and puts its content as plain text on standard output
.SH SYNOPSIS

.BR catdoc " [-aswth] files..."

.SH DESCRIPTION

.B catdoc
behaves much like
.BR cat (1)
but reads MS-Word file and produces human-readable text on standard output.
Optionally it can use 
.BR latex (1)
escape sequences for characters which have special meaning for LaTeX.
It also makes some effort to recognize MS-Word tables, although it never
tries to write correct headers for LaTeX tabular environment.
.PP
.B catdoc
can be invoked as filter, if you supply "-" instead of filename, but it is
probably useless. It could be removed in future versions, because true
parsing of Word files (fast saves, footnotes) requires seekable output.

.SH OPTIONS
.TP 8
.B -a
- converts non-standard printable char into readable form (default).
Separates table columns with TAB
.TP 8
.B -t
- converts all printable chars, which have special meaning for 
.BR LaTeX (1)
into appropriate control sequences. Separates table columns by 
.BR &.
.TP 8
.B -w
disables word wrapping. By default 
.B catdoc
output is splitted into lines not longer than 72 characters and paragraphs
are separated by a blank line. With this option each paragraph is one
long line. 
.TP 8
.B -s 
exits with non-zero exit code, if MS-Word signature is not found
before first printable paragraph, producing no output.
.TP 8
.B -h
- displays brief usage message and exits
.PP
All options affect only files, specified 
.I after
them in command line.
 
.SH BUGS

Can produce garbage, if file contains embedded illustrations. Doesn't handle
fast-saves properly. Prints footnotes as separate paragraphs at the end of
file, instead of producing correct LaTeX commands. 

.SH AUTHOR

V.B.Wagner <vitus@fe.msk.su>