File: charsets.7

package info (click to toggle)
cman 0.0.7-1
  • links: PTS
  • area: main
  • in suites: woody
  • size: 6,664 kB
  • ctags: 1,518
  • sloc: perl: 555; sh: 148; makefile: 65
file content (217 lines) | stat: -rw-r--r-- 8,142 bytes parent folder | download
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
.\" Copyright (c) 1996 Eric S. Raymond 
.\" and Andries Brouwer 
.\" Chinese Version Copyright Scorpio,BitBIRD  www.linuxforum.net, 2000
.\" 
.\" This is free documentation; you can redistribute it and/or
.\" modify it under the terms of the GNU General Public License as
.\" published by the Free Software Foundation; either version 2 of
.\" the License, or (at your option) any later version.
.\"
.\" This is combined from many sources, including notes by aeb and
.\" research by esr. Portions derive from a writeup by Ramon Czybora.
.\"
.TH CHARSETS 7 "November 5th, 1996" "Linux" "Linux Programmer's Manual"
.SH 
charsets \- Աַ͹ʻĹ۵
.SH 
Linux һԵIJϵͳĸָʵó
 (̨ ) ֶ֧Եַ
иӷŵĸַ(ĸ),
ȫĸϣŴ˹
ϣ )
.LP
ֲԳԱ۹ȥַͬ׼Լ
 Linux еһġ۵ı׼ ASCIIISO 8859KOI8-R 
UnicodeISO 2022  ISO 4873 
.SH ASCII
ASCII (,Ϣ()׼())  7-bitַ,
ԭΪʽӢƵġǰ ECMA-6 ׼
.LP
Ӣʹһ ASCIIı壨ǣӢֵķŴ
crosshatch/octothorpe/hash İֵţ;Ҫʱ
ģţӢı壨ţ"US ASCII""UK ASCII"
Ϊ
.LP
Ϊ Linux ΪƵӲд, ֧ US ASCII 

.SH ISO 8859
ISO 8859 һϵ 10 -bit ַ, ASCII ĵλ (7 -bit ), 
128 159 ΧڵIJɼַ 96 ͼΣַ 160-255 
LP
ЩַУҪ ISO 8859-1 ( Latin-1 )
ͱ Linux ̨֧֣
X11R6 ֵ֧Ҳܺã HTML Ļַ
.LP

Linux ¿̨Ҳ֧ 8859 ַ
ͨûģʽʵó(  
.BR setfont ( 8 ))
޸ļ̰󶨺 EGA ͼα
Լп̨еġuser mappingûӰ䣩
.LP
ÿϼ̵
.TP
8859-1 (Latin-1) 
Latin-1 Ǵŷԣ簢, ̩, ,
,Ӣ,Ⱥ,,,,,,,
Ų䡣ȱٺ ij֣ij֣ 
 oeoe֣;ɷ',,' ``ģǿԵġ
.TP
8859-2 (Latin-2)
Latin-2 ִ֧д˹ŷԣ
޵ , ݿ, , , ǣ˹工ˣ
˹ǡ
.TP
8859-3 (Latin-3)
Latin-3 , , , ܻӭģԣ
.TP
8859-4 (Latin-4)
Latin-4 ˰ɳάǣַ ʵϹʱ;
μ 8859-10 (Latin-6 ) 
.TP
8859-5 
Ŵ˹ĸֱ֧, ׶˹,, , άڿ
ڿ˶ʵ`geh'Ϊ`heh',ͣҪôʵ ghe
дȷghe.μģڣKOI8-R ۡ
עЩдϰҲôҪɣϣĽͲҪ
˸Ϳˣ
.TP
8859-6
ְ֧ 8859-6 ͱǷַʽһ̶ֹ壬һ
ʾӦЩʹúʵĴףмĸʽ
.TP
8859-7
ִ֧ϣ
.TP
8859-8
֧ϣ
.TP
8859-9 (Latin-5)
Latin-1 һֱ壬һЩַõı
.TP
8859-10 (Latin-6) 
Latin 6 ĩŦ(룺last Inuit Ҳ֪ǷǶԵ) ()  
Sami (  ) Щ Lattin 4 ȱٵģŷַ
RFC 1345 г˳ĺͲͬġ latin 6 " Skolt Sami ȻЩҪ
š

.TP
8859-13 (Latin-7)
.TP
8859-14 (Latin-8)
.TP
8859-15
ŷ޷źͷ֣ Latin-1 ȱ©ġ
.SH KOI8-R
KOI8-R ڶеһ ISO ַ°벿 US ASCII;
ϲDZ ISO 8859-5 ƵĸõĹ˹ַ
.LP
̨Ϊ֧ KOI8-R ַ Linux £
ûģʽʵó޸ļ̰󶨺 EGA ͼα
Լڿ̨ʹuser mappingûӳ䣩
.SH UNICODEͳ[]һ,[˫]ַֽ
Unicode ISO 10646 ) һ׼Ŀ׵ر
ÿеÿַ֪Unicode ı 32 λ
( Щİ汾ʹ 16 λ )  Unicode 
һЩϢ<http://www.unicode.com>á
.LP
Linux ʹãλ Unicode תƸʽ (UTF-8 ) ʾ Unicode 
UTF-8 ǿɱ䳤 Unicode 롣ʹãֽڸ 7 bit
룬ʹãֽڸ  bit 룬
ʹãֽڸ  bit 룬ʹãֽڸ  bit 룬ʹãֽڸ
 bit 룬ʹãֽڸ  bit  
.LP
 0,1 , x 㣬һλֽ0xxxxxxx Unicode 00000000 0xxxxxxx
ź ASCII 0xxxxxxx ķһ 
 ASCII ûиΪ UTF-8ֻ ASCII ˲ע⵽κα仯
ڴ룬ҲļС
.LP
ֽ 110xxxxx һ2 ֽڴĿʼ
110xxxxx 10yyyyyy װ 00000xxx xxyyyyyy 
ֽ 1110xxxx һ  ֽڴĿʼ
1110xxxx 10yyyyyy 10zzzzzz װ xxxxyyyy yyzzzzzz
 UTF-8 ʹ 31-bit ISO 10646 룬ôͻ
 6 ֽڱ룩
.LP
 ISO-8859-1 ûԣζŴλַֽڡ
ͨıļ󣱵ٷֵ㡣ûб任,
Ϊ Unicode ISO-8859-1 ŵֵǵ ISO-8859-1 ֵ
( 8 ǰǰ׺) ûζԭõ 16 λ뽫
ռ 3 ֽڣһҪչӳձ˱Ƚϲ 
ISO 2022 
.LP
ע UTF-8 ͬģ 10xxxxxx һβ, κ
ֽDZͷASCII ֽڳ UTF-8 ΨһĿ
ΪԼ֡ر,  NULs  " /'s ǶЩȽϴıС
.LP
Ϊе ASCIIر, NUL '/', ûб仯, ں˲ע⵽
ʹ UTF-8ںڴֽڴʲô
.LP
Unicode ijͨͨ" subfont "
Unicode һӼַӳ䡣ںڲʹ Unicode 
װʾڴ subfontζ UTF-8 еһģʽ
ʹ 512 ͬķšͳ˵Dzģ
˴;
.SH ISO 2022 AND ISO 4873
ISO 2022  4873 ׼һ VT100 ʵֵģͣ
Linux ں˺ xterm (1) (  ) ֧ģ͡
ձͺС
.LP
 4 ͼεַΪ G0  G1  G2  G3 
֮һǵǰĸλΪ ıַ( G0 ),֮
һǵǰĸλΪıַ( G1 )ÿͼεַ
94  96 ַ ʵһ 7-bitַ
ʹ 040-0177 ( 041-0176 )  0240-0377 ( 0241-0376 )
еһG0 СΪ 94ʹ 041-0176 ֮ı롣
.LP
ַ֮лתshift functions
^N (SO  LS1), ^O (SI  LS0), ESC n (LS2), ESC o (LS3),
ESC N (SS2), ESC O (SS3), ESC ~ (LS1R), ESC } (LS2R), ESC | (LS3R).
LS\fIn\fP ַG\fIn\fPΪǰַڸλΪı롣
LS\fIn\fPR ַ G\fIn\fPΪǰַڸλΪı롣
SS\fIn\fP ַG\fIn\fP (\fIn\fP=2 or 3) Ϊǰַ
ֻһַ
ĸλֵʲô
.LP
94 ַļ G\fIn\f ַһ
ESC ( xx  G0ESC ) xx  G1
ESC * xx  G2ESC + xx  G3ȴģ xx һ
 ISO 2375 עַеһԷš
磬ESC ( @ ѡ ISO 646 ַΪGO
ESC ( A ѡ UK ׼ַ(ðּǺ), ESC ( B ѡ ASCII (
Ԫͨ), ESC ( M Ϊѡһַ ESC ( ! A 
ѡŰַ, ȵ. ȵ.
.LP
94 ַļ G\fIn\f ַһ
ESC - xx  G1, ESC . xx  G2
 ESC / xx  G3ȱʾ
, ESC - G ѡϣĸΪ G1.
.LP
ֽڵַ G\fIn\fP ַһ
ESC $ xx  ESC $ ( xx  G0
ESC $ ) xx  G1ESC $ * xx  G2ESC $ + xx  G3ʾ
, ESC $ ( C Ϊ G0ѡ񺫹ַ.
ձַ ESC $ Bѡ
ٽİ汾ESC & @ ESC $ Bѡ.
.LP
ISO 4873 涨һΧȽխʹַ G0ǹ̶ ( ASCII),
 G1, G2  G3ֻܱڸߴλ뼯
ǣʹ ^N  ^OESC ( xx
 xx=B,  ESC ) xx, ESC * xx, ESC + xx
ֱȼ ESC - xx, ESC . xx, ESC / xx

.SH ο
.BR console (4),
.BR console_ioctl (4),
.BR console_codes (4),
.BR ascii (7),
.BR iso_8859_1 (7),
.BR unicode (7),
.BR utf-8 (7)
.br
.SH "[İά]"
Scorpio E-mail:rawk@chinese.com
.SH "[ݸ]"
2000/10/30
.br
.B й Linux ̳ man ֲҳƻ:www.cmpp.net/