File: encoding.texi

package info (click to toggle)
a2ps 1%3A4.15.7-5
  • links: PTS, VCS
  • area: main
  • in suites: forky, sid
  • size: 18,716 kB
  • sloc: ansic: 44,830; sh: 11,625; lex: 1,851; perl: 708; yacc: 698; makefile: 494; lisp: 396; ada: 263; objc: 189; f90: 109; ml: 85; sql: 74; pascal: 57; modula3: 33; haskell: 32; sed: 30; java: 29; python: 24
file content (135 lines) | stat: -rw-r--r-- 4,774 bytes parent folder | download | duplicates (4)
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
The known encodings are:
@deftp {Encoding} {ASCII} (@file{ascii.edf})
US-ASCII.
@end deftp

@deftp {Encoding} {EUC-JP} (@file{euc-jp.edf})
The EUC-JP encoding is a 8-bit character set widely used in Japan.
@end deftp

@deftp {Encoding} {HPRoman} (@file{hp.edf})
The 8 bits Roman encoding for HP.
@end deftp

@deftp {Encoding} {IBM-CP437} (@file{ibm-cp437.edf})
This encoding is meant to be used for PC files with drawing lines.
@end deftp

@deftp {Encoding} {IBM-CP850} (@file{ibm-cp850.edf})
Several characters may be missing, especially Greek letters and some
mathematical symbols.
@end deftp

@deftp {Encoding} {ISO-8859-1} (@file{iso1.edf})
The ISO-8859-1 character set, often simply referred to as Latin 1,
covers most West European languages, such as French, Spanish, Catalan,
Basque, Portuguese, Italian, Albanian, Rhaeto-Romanic, Dutch, German,
Danish, Swedish, Norwegian, Finnish, Faroese, Icelandic, Irish,
Scottish, and English, incidentally also Afrikaans and Swahili, thus
in effect also the entire American continent, Australia and the
southern two-thirds of Africa. The lack of the ligatures Dutch IJ,
French OE and ,,German`` quotation marks is considered tolerable.

The lack of the new C=-resembling Euro currency symbol U+20AC has
opened the discussion of a new Latin0.
@end deftp

@deftp {Encoding} {ISO-8859-2} (@file{iso2.edf})
The Latin 2 character set supports the Slavic languages of Central
Europe which use the Latin alphabet. The ISO-8859-2 set is used for
the following languages: Czech, Croat, German, Hungarian, Polish,
Romanian, Slovak and Slovenian.

Support is provided thanks to Ogonkify.
@end deftp

@deftp {Encoding} {ISO-8859-3} (@file{iso3.edf})
This character set is used for Esperanto, Galician, Maltese and Turkish. 

Support is provided thanks to Ogonkify.
@end deftp

@deftp {Encoding} {ISO-8859-4} (@file{iso4.edf})
Some letters were added to the ISO-8859-4 to support languages such as
Estonian, Latvian and Lithuanian. It is an incomplete precursor of the
Latin 6 set.

Support is provided thanks to Ogonkify.
@end deftp

@deftp {Encoding} {ISO-8859-5} (@file{iso5.edf})
The ISO-8859-5 set is used for various forms of the Cyrillic
alphabet. It supports Bulgarian, Byelorussian, Macedonian, Serbian and
Ukrainian.

The Cyrillic alphabet was created by St. Cyril in the 9th century from
the upper case letters of the Greek alphabet. The more ancient
Glagolithic (from the ancient Slav glagol, which means "word"), was
created for certain dialects from the lower case Greek letters. These
characters are still used by Dalmatian Catholics in their liturgical
books. The kings of France were sworn in at Reims using a Gospel in
Glagolithic characters attributed to St. Jerome.

Note that Russians seem to prefer the KOI8-R character set to the ISO
set for computer purposes. KOI8-R is composed using the lower half
(the first 128 characters) of the corresponding American ASCII
character set.
@end deftp

@deftp {Encoding} {ISO-8859-7} (@file{iso7.edf})
ISO-8859-7 was formerly known as ELOT-928 or ECMA-118:1986.  It is
meant for modern Greek.
@end deftp

@deftp {Encoding} {ISO-8859-9} (@file{iso9.edf})
The ISO 8859-9 set, or Latin 5, replaces the rarely used Icelandic
letters from Latin 1 with Turkish letters.

Support is provided thanks to Ogonkify.
@end deftp

@deftp {Encoding} {ISO-8859-10} (@file{iso10.edf})
Latin 6 (or ISO-8859-10) adds the last letters from Greenlandic and
Lapp which were missing in Latin 4, and thereby covers all
Scandinavia.

Support is provided thanks to Ogonkify.
@end deftp

@deftp {Encoding} {ISO-8859-13} (@file{iso13.edf})
Latin7 (ISO-8859-13) is going to cover the Baltic Rim and re-establish
the Latvian (lv) support lost in Latin6 and may introduce the local
quotation marks.

Support is provided thanks to Ogonkify.
@end deftp

@deftp {Encoding} {ISO-8859-15} (@file{iso15.edf})
The new Latin9 nicknamed Latin0 aims to update Latin1 by replacing
some less needed symbols (some fractions and accents) with forgotten
French and Finnish letters and placing the U+20AC Euro sign in the
cell of the former international currency sign.

Support of the Euro symbol is provided thanks to Ogonkify.
@end deftp

@deftp {Encoding} {KOI8} (@file{koi8.edf})
KOI-8 is a subset of ISO-IR-111 that can be used in Serbia, Belarus
etc.
@end deftp

@deftp {Encoding} {MS-CP1250} (@file{ms-cp1250.edf})
Microsoft's CP-1250 encoding (aka CeP).
@end deftp

@deftp {Encoding} {MS-CP1251} (@file{ms-cp1251.edf})
Microsoft CP1251 is encoding used in Microsoft Windows for Cyrillic
languages
@end deftp

@deftp {Encoding} {Macintosh} (@file{mac.edf})
For the Macintosh encoding.  The support is not sufficient, and a lot
of characters may be missing at the end of the job (especially Greek
letters).
@end deftp