File: README.iso

package info (click to toggle)
jless 332iso240-1
  • links: PTS
  • area: main
  • in suites: hamm
  • size: 1,228 kB
  • ctags: 1,482
  • sloc: ansic: 16,120; sh: 190; makefile: 142; awk: 7
file content (208 lines) | stat: -rw-r--r-- 8,860 bytes parent folder | download
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
=========================================================================
    This is the distribution of enhanced less.
    It support ISO 2022 code extension techniques and Japanese codes.
    Please report any problems of it to the author at jam@pobox.com.
    See http://www.pobox.com/~jam/less/ for the latest info.
=========================================================================


**Features**

 - Support ISO 2022 code extension techniques in 7bits and 8bits.
 - Understand all following codes.
   - ISO 2022 level 3.
   - a special sequence, ESC 2/4[$] 2/12[,] FE, which is used in MULE.
   - Japanese codes, UJIS and SJIS.
 - Output following codes.
   - iso8 (CTEXT, which is defined in X11)
   - iso7 with only GZDM4, GZD4, G1DM4, G1DM6, G1D4, G1D6, SI and SO
     escape sequences.
   - Japanese codes, UJIS and SJIS.
 - Automatic decompression of compressed files.

**Problems**

 - To search, the less compare characters after conversion into
   internal codes.  If converted codes contain some gaps, a regular
   expression library is confused.  e.g. "う3柑a" which contain an escape
   sequence and codes which is JIS X0201 Japanese (Katakana) in UJIS
   make problems.
 - If you have a regular expression library with ability to detect
   character set, you don't get this problem.
 - Or if you search it without using regular expression library by
   typing '^R', you can solve this problem.

**Enhancements**

 - The JLESSCHARSET environment variable choose codes for input and
   output.  For example, if you set "japanese-iso7", the less read all
   Japanese codes and iso7 and write iso7.  If you set "ujis", the
   less read and write only UJIS.  Currently you can use followings.
   - Same as LESSCHARSET: ascii, latin1, dos, koi8-r, next
   - Others: iso7, iso8, ujis-iso7, euc-iso7, sjis-iso7, jis,
	     ujis-jis, euc-jis, sjis-jis, ujis, euc, jis-ujis,
	     jis-euc, sjis, jis-sjis, japanese, japanese-iso7,
	     japanese-jis, japanese-ujis, japanese-euc, japanese-sjis
 - Following values of JLESSCHARSET are treated for input and output
   separately.  For example, if you set "ujis-sjis", the less read
   only UJIS and write only SJIS.
   - Values: ujis-sjis, euc-sjis, sjis-ujis, sjis-euc
 - The JLESSPLANESET environment variable choose plane of iso8 and
   iso7.  Please read a manual about details.
 - The JLESSKEYCHARSET environment variable choose codes for input
   from keyboard.  All values for the JLESSCHARSET environment
   variable are used for this.
 - Sometime the less may mistake automatic handling of Japanese codes.
   The user change the status for it by '@' key.  The function name of
   it is "rotate-right".
 - If you have a regular expression library with character set
   detection, invoke configure script with "--with-cs-regex" option to
   use it.  Then compile the less.

**History about Japanized and ISO 2022-ized less**

 - About Japanization of less
   - Kato-san posted a difference which give less an ability for
     special escape sequence.
   - Kameyama-san posted a patch for UJIS code.
   - Shinoda-san posted a less.exe for SJIS code.
   - Tachi-san posted two patches for JIS and SJIS code.
   - Saito-san posted kanji conversion filter for less.
   - Ohta-san posted other less which convert roma-ji into kana.
   - Yamakita-san posted other patch for convert them into kanji.
   - Yoshida-san posted two patches of less-73 for UJIS and SJIS code.
   - Yoshida-san posted two patches of less-97 for UJIS and SJIS code.
   - Kato-san posted a patch of less-97(UJIS) for JIS code.
   - Ishii-san posted a patch of less-97(UJIS or SJIS) for JIS code.
   - Tanaka-san posted a modified less-123 for SJIS code.
   - Kawakami-san posted a patch of it for JIS code.
   - I changed less-177 for all Japanese codes.
   - Kusakabe-san changed less-178 by using patches for less-177.
   - Senda-san add printing function of Japanese code for less-177.
   - Sakai-san changed less-200 by using patches for less-178.
   - I changed less-205 for all Japanese codes and ISO 2022 codes.
   - Ushine-san changed less-321 by using patches for less-290.
   - I changed less-330 by using patches for less-290.  It is named
     iso202.
   - I removed some bugs and made jless-330-iso203.
   - I re-wrote jless and made jless-330-iso208.
   - I released patches to correct bugs.

 - About jless
   - I put 2 unofficial patches of KANJI searching enhancements for
     ISO 2022-ized less-237 on ftp.
   - I put an unofficial patch to use gawk-2.15.6-mb1.04's KANJI
     regular expression library for jless-330-iso210 on ftp.  It will
     works with iso212 and following.
   - I put a less.hlp file which is translated into Japanese.  It will 
     works with iso212 and following.

**FAQ**

 Q: When I try to view a Japanese text, the less shows inverse
    characters only.  How do I view them?
 A: You have to set "japanese" to the JLESSCHARSET environment
    variable.  Otherwise you have to set "ja_JP" to the LANG
    environment variable if your computer support it.

 Q: When I try to view a Japanese text, the less shows strange
    characters only like '$' and others.  How do I view them?
 A: Do your terminal or terminal emulator have ability to show
    Japanese?  If so, I guess you have to adjust output of the less by
    setting "japanese-ujis" or "japanese-sjis" as the JLESSCHARSET
    environment variable.

 Q: When I try to view compressed text, the less doesn't work.  How do
    I view them?
 A: The less uses the name of a text file to check whether it is
    compressed or not, and uses a "zcat" program to uncompress it.  So
    the compressed file has to be finished with ".z", ".gz" or ".Z"
    and a "zcat" program has to have ability to uncompress it.

 Q: How to set the default value of the JLESSCHARSET environment
    variable.
 A: You can use a DEFCHARSET declaration in "defines.h".  If your
    computer has a LANG property, you have to set HAVE_LOCALE to 0 in
    "defines.h".  Note: before editing it, you should execute a
    "configure" program.

 Q: Sometime my terminal shows non ASCII characters only.
 A: Some terminal emulators have bugs in handling the ISO 2022.  You
    should tell the less to display only JIS by setting "japanese-jis"
    to the JLESSCHARSET environment variable.

 Q: Sometime the less shows wrong Japanese characters when I try to
    view SJIS coded files.
 A: A less's automatic Japanese code handling routine is not perfect.
    Please type '^L' to re-display it.  If it doesn't work fine,
    please type '@' to change it to the manually handling routine.

    On the other hand, if you use the less with a -Z option, the less
    give priority to SJIS over UJIS in automatic handling routine.
    And you can change the default value of priority and invert the
    means of a -Z option by compiling with SJIS_PRE declaration.

    Or if you know the file is SJIS before executing the less, you
    should set "sjis" to the JLESSCHARSET environment variable to
    diable the automatic handling routine.

 Q: How do I use SJIS as typing string?
 A: You should set "sjis-jis" to the JLESSKEYCHARSET environment
    variable.  Otherwise you should set "sjis-jis" to a DEFKEYCHARSET
    declaration in "defines.h".

 Q: Can I use regular expression with Japanese?
 A: There are 2 ways.  You can use multi bytes regular expression
    library.  Or you can use regular expression library with ability
    to detect character sets.

    Please read "http://www.poboxes.com/~jam/less/".

 Q: When I looking up with Kanji, the less shows strange behavior.
 A: The less enables MSB of bytes of all non ASCII characters to
    decrease mis-matching between an ASCII character and an non ASCII
    characters if they have same value.  But your regular expression
    library might not support characters enabled MSB.  Please invoke
    configure script with "--disable-msb" option to disable it.  Then
    compile the less.

 Q: How do I input search pattern by JIS?
 A: Normally the less uses "\e$" sequence.  You should write following
    in your "~/.lesskey" file and execute a "lesskey" program to
    cancel it.

#line-edit
\e$	insert ^V\e$

 Q: How do I use emacs-like key binding?
 A: You should write following in your "~/.lesskey" file and execute a
    "lesskey" program.

#line-edit
^A	home
^E	end
^P	up
^N	down
^F	right
^B	left
\ef	word-right
^D	delete
\ed	word-delete

 Q: The os.c was not compiled on NEWS-OS 4.0C.
 A: Add following before "#include <setjmp.h>".

#define LANGUAGE_C   1

**Conclusion**

 - Please mail to "jam@pobox.com" if you have a problem or a suggestion.
 - At "ftp://ftp.aist-nara.ac.jp/pub/personal/kazusi-m/less", I will
   keep all patches and contributed patches about jless.  If you make
   any patches, please mail to me.


Local Variables:
mode: indented-text
fill-column: 70
End: