1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 112 113 114 115 116 117 118 119 120 121 122 123 124 125 126 127 128 129 130 131 132 133 134 135 136 137 138 139 140 141 142 143 144 145 146 147 148 149 150 151 152 153 154 155 156 157 158 159 160 161 162 163 164 165 166 167 168 169 170 171 172 173 174 175 176 177 178 179 180 181 182 183 184 185 186 187 188 189 190 191 192 193 194 195 196 197 198 199 200 201 202 203 204 205 206 207 208 209 210 211 212 213 214 215 216 217
|
.\" Copyright (c) 1996 Eric S. Raymond
.\" and Andries Brouwer
.\" Chinese Version Copyright Scorpio,BitBIRD www.linuxforum.net, 2000
.\"
.\" This is free documentation; you can redistribute it and/or
.\" modify it under the terms of the GNU General Public License as
.\" published by the Free Software Foundation; either version 2 of
.\" the License, or (at your option) any later version.
.\"
.\" This is combined from many sources, including notes by aeb and
.\" research by esr. Portions derive from a writeup by Ramon Czybora.
.\"
.TH CHARSETS 7 "November 5th, 1996" "Linux" "Linux Programmer's Manual"
.SH
charsets \- ԱַʻĹ۵
.SH
Linux һԵIJϵͳĸָʵó
(̨ ) ֶ֧Եַ
иӷŵĸַ(ĸ),
ȫĸϣŴ˹
ϣ )
.LP
ֲԳԱ۹ȥַͬԼ
Linux еһġ۵ı ASCIIISO 8859KOI8-R
UnicodeISO 2022 ISO 4873
.SH ASCII
ASCII (,Ϣ()()) 7-bitַ,
ԭΪʽӢƵġǰ ECMA-6
.LP
Ӣʹһ ASCIIı壨ǣӢֵķŴ
crosshatch/octothorpe/hash İֵţ;Ҫʱ
ģţӢı壨ţ"US ASCII""UK ASCII"
Ϊ
.LP
Ϊ Linux ΪƵӲд, ֧ US ASCII
.SH ISO 8859
ISO 8859 һϵ 10 -bit ַ, ASCII ĵλ (7 -bit ),
128 159 ΧڵIJɼַ 96 ͼΣַ 160-255
LP
ЩַУҪ ISO 8859-1 ( Latin-1 )
ͱ Linux ̨֧֣
X11R6 ֵ֧Ҳܺã HTML Ļַ
.LP
Linux ¿̨Ҳ֧ 8859 ַ
ͨûģʽʵó(
.BR setfont ( 8 ))
ļ̰ EGA ͼα
Լп̨еġuser mappingûӰ䣩
.LP
ÿϼ̵
.TP
8859-1 (Latin-1)
Latin-1 Ǵŷԣ簢, ̩, ,
,Ӣ,Ⱥ,,,,,,,
Ų䡣ȱٺ ij֣ij֣
oeoe֣;ɷ',,' ``ģǿԵġ
.TP
8859-2 (Latin-2)
Latin-2 ִ֧д˹ŷԣ
, ݿ, , , ǣ˹工ˣ
˹ǡ
.TP
8859-3 (Latin-3)
Latin-3 , , , ܻӭģԣ
.TP
8859-4 (Latin-4)
Latin-4 ˰ɳάǣַ ʵϹʱ;
μ 8859-10 (Latin-6 )
.TP
8859-5
Ŵ˹ĸֱ֧, ˹,, , άڿ
ڿ˶ʵ`geh'Ϊ`heh',ͣҪôʵ ghe
дȷghe.μģڣKOI8-R ۡ
עЩдϰҲôҪɣϣĽͲҪ
˸Ϳˣ
.TP
8859-6
ְ֧ 8859-6 ͱǷַʽһ̶ֹ壬һ
ʾӦЩʹúʵĴףмĸʽ
.TP
8859-7
ִ֧ϣ
.TP
8859-8
֧ϣ
.TP
8859-9 (Latin-5)
Latin-1 һֱ壬һЩַõı
.TP
8859-10 (Latin-6)
Latin 6 ĩŦ(룺last Inuit Ҳ֪ǷǶԵ) ()
Sami ( ) Щ Lattin 4 ȱٵģŷַ
RFC 1345 г˳ĺͲͬġ latin 6 " Skolt Sami ȻЩҪ
š
.TP
8859-13 (Latin-7)
.TP
8859-14 (Latin-8)
.TP
8859-15
ŷźͷ֣ Latin-1 ȱ©ġ
.SH KOI8-R
KOI8-R ڶеһ ISO ַ°벿 US ASCII;
ϲDZ ISO 8859-5 ƵĸõĹ˹ַ
.LP
̨Ϊ֧ KOI8-R ַ Linux £
ûģʽʵóļ̰ EGA ͼα
Լڿ̨ʹuser mappingûӳ䣩
.SH UNICODEͳ[]һ,[˫]ַֽ
Unicode ISO 10646 ) һĿر
ÿеÿַ֪Unicode ı 32 λ
( Щİ汾ʹ 16 λ ) Unicode
һЩϢ<http://www.unicode.com>á
.LP
Linux ʹãλ Unicode תƸʽ (UTF-8 ) ʾ Unicode
UTF-8 ǿɱ䳤 Unicode 롣ʹãֽڸ 7 bit
룬ʹãֽڸ bit 룬
ʹãֽڸ bit 룬ʹãֽڸ bit 룬ʹãֽڸ
bit 룬ʹãֽڸ bit
.LP
0,1 , x 㣬һλֽ0xxxxxxx Unicode 00000000 0xxxxxxx
ź ASCII 0xxxxxxx ķһ
ASCII ûиΪ UTF-8ֻ ASCII ˲עκα仯
ڴ룬ҲļС
.LP
ֽ 110xxxxx һ2 ֽڴĿʼ
110xxxxx 10yyyyyy װ 00000xxx xxyyyyyy
ֽ 1110xxxx һ ֽڴĿʼ
1110xxxx 10yyyyyy 10zzzzzz װ xxxxyyyy yyzzzzzz
UTF-8 ʹ 31-bit ISO 10646 룬ôͻ
6 ֽڱ룩
.LP
ISO-8859-1 ûԣζŴλַֽڡ
ͨıļٷֵ㡣ûб任,
Ϊ Unicode ISO-8859-1 ŵֵǵ ISO-8859-1 ֵ
( 8 ǰǰ) ûζԭõ 16 λ뽫
ռ 3 ֽڣһҪչӳձ˱Ƚϲ
ISO 2022
.LP
ע UTF-8 ͬģ 10xxxxxx һβ, κ
ֽDZͷASCII ֽڳ UTF-8 ΨһĿ
ΪԼ֡ر, NULs " /'s ǶЩȽϴıС
.LP
Ϊе ASCIIر, NUL '/', ûб仯, ں˲ע
ʹ UTF-8ںڴֽڴʲô
.LP
Unicode ijͨͨ" subfont "
Unicode һӼַӳ䡣ںڲʹ Unicode
װʾڴ subfontζ UTF-8 еһģʽ
ʹ 512 ͬķšͳ˵Dzģ
˴;
.SH ISO 2022 AND ISO 4873
ISO 2022 4873 һ VT100 ʵֵģͣ
Linux ں˺ xterm (1) ( ) ֧ģ͡
ձͺС
.LP
4 ͼεַΪ G0 G1 G2 G3
֮һǵǰĸλΪ ıַ( G0 ),֮
һǵǰĸλΪıַ( G1 )ÿͼεַ
94 96 ַ ʵһ 7-bitַ
ʹ 040-0177 ( 041-0176 ) 0240-0377 ( 0241-0376 )
еһG0 СΪ 94ʹ 041-0176 ֮ı롣
.LP
ַ֮лתshift functions
^N (SO LS1), ^O (SI LS0), ESC n (LS2), ESC o (LS3),
ESC N (SS2), ESC O (SS3), ESC ~ (LS1R), ESC } (LS2R), ESC | (LS3R).
LS\fIn\fP ַG\fIn\fPΪǰַڸλΪı롣
LS\fIn\fPR ַ G\fIn\fPΪǰַڸλΪı롣
SS\fIn\fP ַG\fIn\fP (\fIn\fP=2 or 3) Ϊǰַ
ֻһַ
ĸλֵʲô
.LP
94 ַļ G\fIn\f ַһ
ESC ( xx G0ESC ) xx G1
ESC * xx G2ESC + xx G3ȴģ xx һ
ISO 2375 עַеһԷš
磬ESC ( @ ѡ ISO 646 ַΪGO
ESC ( A ѡ UK ַ(ðּǺ), ESC ( B ѡ ASCII (
Ԫͨ), ESC ( M Ϊѡһַ ESC ( ! A
ѡŰַ, ȵ. ȵ.
.LP
94 ַļ G\fIn\f ַһ
ESC - xx G1, ESC . xx G2
ESC / xx G3ȱʾ
, ESC - G ѡϣĸΪ G1.
.LP
ֽڵַ G\fIn\fP ַһ
ESC $ xx ESC $ ( xx G0
ESC $ ) xx G1ESC $ * xx G2ESC $ + xx G3ʾ
, ESC $ ( C Ϊ G0ѡַ.
ձַ ESC $ Bѡ
ٽİ汾ESC & @ ESC $ Bѡ.
.LP
ISO 4873 涨һΧȽխʹַ G0ǹ̶ ( ASCII),
G1, G2 G3ֻܱڸߴλ뼯
ǣʹ ^N ^OESC ( xx
xx=B, ESC ) xx, ESC * xx, ESC + xx
ֱȼ ESC - xx, ESC . xx, ESC / xx
.SH ο
.BR console (4),
.BR console_ioctl (4),
.BR console_codes (4),
.BR ascii (7),
.BR iso_8859_1 (7),
.BR unicode (7),
.BR utf-8 (7)
.br
.SH "[İά]"
Scorpio E-mail:rawk@chinese.com
.SH "[ݸ]"
2000/10/30
.br
.B й Linux ̳ man ֲҳƻ:www.cmpp.net/
|