1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 112 113 114 115 116 117 118 119 120 121 122 123 124 125 126 127 128 129 130 131 132 133 134 135 136 137 138 139 140 141 142 143 144 145 146 147 148 149 150 151 152 153 154 155 156 157 158 159 160 161 162 163 164 165 166 167 168 169 170 171 172 173 174 175 176 177 178 179 180 181 182 183 184 185 186 187 188 189 190 191 192 193 194 195 196 197 198 199 200 201 202 203 204 205 206 207 208
|
=========================================================================
This is the distribution of enhanced less.
It support ISO 2022 code extension techniques and Japanese codes.
Please report any problems of it to the author at jam@pobox.com.
See http://www.pobox.com/~jam/less/ for the latest info.
=========================================================================
**Features**
- Support ISO 2022 code extension techniques in 7bits and 8bits.
- Understand all following codes.
- ISO 2022 level 3.
- a special sequence, ESC 2/4[$] 2/12[,] FE, which is used in MULE.
- Japanese codes, UJIS and SJIS.
- Output following codes.
- iso8 (CTEXT, which is defined in X11)
- iso7 with only GZDM4, GZD4, G1DM4, G1DM6, G1D4, G1D6, SI and SO
escape sequences.
- Japanese codes, UJIS and SJIS.
- Automatic decompression of compressed files.
**Problems**
- To search, the less compare characters after conversion into
internal codes. If converted codes contain some gaps, a regular
expression library is confused. e.g. "う3柑a" which contain an escape
sequence and codes which is JIS X0201 Japanese (Katakana) in UJIS
make problems.
- If you have a regular expression library with ability to detect
character set, you don't get this problem.
- Or if you search it without using regular expression library by
typing '^R', you can solve this problem.
**Enhancements**
- The JLESSCHARSET environment variable choose codes for input and
output. For example, if you set "japanese-iso7", the less read all
Japanese codes and iso7 and write iso7. If you set "ujis", the
less read and write only UJIS. Currently you can use followings.
- Same as LESSCHARSET: ascii, latin1, dos, koi8-r, next
- Others: iso7, iso8, ujis-iso7, euc-iso7, sjis-iso7, jis,
ujis-jis, euc-jis, sjis-jis, ujis, euc, jis-ujis,
jis-euc, sjis, jis-sjis, japanese, japanese-iso7,
japanese-jis, japanese-ujis, japanese-euc, japanese-sjis
- Following values of JLESSCHARSET are treated for input and output
separately. For example, if you set "ujis-sjis", the less read
only UJIS and write only SJIS.
- Values: ujis-sjis, euc-sjis, sjis-ujis, sjis-euc
- The JLESSPLANESET environment variable choose plane of iso8 and
iso7. Please read a manual about details.
- The JLESSKEYCHARSET environment variable choose codes for input
from keyboard. All values for the JLESSCHARSET environment
variable are used for this.
- Sometime the less may mistake automatic handling of Japanese codes.
The user change the status for it by '@' key. The function name of
it is "rotate-right".
- If you have a regular expression library with character set
detection, invoke configure script with "--with-cs-regex" option to
use it. Then compile the less.
**History about Japanized and ISO 2022-ized less**
- About Japanization of less
- Kato-san posted a difference which give less an ability for
special escape sequence.
- Kameyama-san posted a patch for UJIS code.
- Shinoda-san posted a less.exe for SJIS code.
- Tachi-san posted two patches for JIS and SJIS code.
- Saito-san posted kanji conversion filter for less.
- Ohta-san posted other less which convert roma-ji into kana.
- Yamakita-san posted other patch for convert them into kanji.
- Yoshida-san posted two patches of less-73 for UJIS and SJIS code.
- Yoshida-san posted two patches of less-97 for UJIS and SJIS code.
- Kato-san posted a patch of less-97(UJIS) for JIS code.
- Ishii-san posted a patch of less-97(UJIS or SJIS) for JIS code.
- Tanaka-san posted a modified less-123 for SJIS code.
- Kawakami-san posted a patch of it for JIS code.
- I changed less-177 for all Japanese codes.
- Kusakabe-san changed less-178 by using patches for less-177.
- Senda-san add printing function of Japanese code for less-177.
- Sakai-san changed less-200 by using patches for less-178.
- I changed less-205 for all Japanese codes and ISO 2022 codes.
- Ushine-san changed less-321 by using patches for less-290.
- I changed less-330 by using patches for less-290. It is named
iso202.
- I removed some bugs and made jless-330-iso203.
- I re-wrote jless and made jless-330-iso208.
- I released patches to correct bugs.
- About jless
- I put 2 unofficial patches of KANJI searching enhancements for
ISO 2022-ized less-237 on ftp.
- I put an unofficial patch to use gawk-2.15.6-mb1.04's KANJI
regular expression library for jless-330-iso210 on ftp. It will
works with iso212 and following.
- I put a less.hlp file which is translated into Japanese. It will
works with iso212 and following.
**FAQ**
Q: When I try to view a Japanese text, the less shows inverse
characters only. How do I view them?
A: You have to set "japanese" to the JLESSCHARSET environment
variable. Otherwise you have to set "ja_JP" to the LANG
environment variable if your computer support it.
Q: When I try to view a Japanese text, the less shows strange
characters only like '$' and others. How do I view them?
A: Do your terminal or terminal emulator have ability to show
Japanese? If so, I guess you have to adjust output of the less by
setting "japanese-ujis" or "japanese-sjis" as the JLESSCHARSET
environment variable.
Q: When I try to view compressed text, the less doesn't work. How do
I view them?
A: The less uses the name of a text file to check whether it is
compressed or not, and uses a "zcat" program to uncompress it. So
the compressed file has to be finished with ".z", ".gz" or ".Z"
and a "zcat" program has to have ability to uncompress it.
Q: How to set the default value of the JLESSCHARSET environment
variable.
A: You can use a DEFCHARSET declaration in "defines.h". If your
computer has a LANG property, you have to set HAVE_LOCALE to 0 in
"defines.h". Note: before editing it, you should execute a
"configure" program.
Q: Sometime my terminal shows non ASCII characters only.
A: Some terminal emulators have bugs in handling the ISO 2022. You
should tell the less to display only JIS by setting "japanese-jis"
to the JLESSCHARSET environment variable.
Q: Sometime the less shows wrong Japanese characters when I try to
view SJIS coded files.
A: A less's automatic Japanese code handling routine is not perfect.
Please type '^L' to re-display it. If it doesn't work fine,
please type '@' to change it to the manually handling routine.
On the other hand, if you use the less with a -Z option, the less
give priority to SJIS over UJIS in automatic handling routine.
And you can change the default value of priority and invert the
means of a -Z option by compiling with SJIS_PRE declaration.
Or if you know the file is SJIS before executing the less, you
should set "sjis" to the JLESSCHARSET environment variable to
diable the automatic handling routine.
Q: How do I use SJIS as typing string?
A: You should set "sjis-jis" to the JLESSKEYCHARSET environment
variable. Otherwise you should set "sjis-jis" to a DEFKEYCHARSET
declaration in "defines.h".
Q: Can I use regular expression with Japanese?
A: There are 2 ways. You can use multi bytes regular expression
library. Or you can use regular expression library with ability
to detect character sets.
Please read "http://www.poboxes.com/~jam/less/".
Q: When I looking up with Kanji, the less shows strange behavior.
A: The less enables MSB of bytes of all non ASCII characters to
decrease mis-matching between an ASCII character and an non ASCII
characters if they have same value. But your regular expression
library might not support characters enabled MSB. Please invoke
configure script with "--disable-msb" option to disable it. Then
compile the less.
Q: How do I input search pattern by JIS?
A: Normally the less uses "\e$" sequence. You should write following
in your "~/.lesskey" file and execute a "lesskey" program to
cancel it.
#line-edit
\e$ insert ^V\e$
Q: How do I use emacs-like key binding?
A: You should write following in your "~/.lesskey" file and execute a
"lesskey" program.
#line-edit
^A home
^E end
^P up
^N down
^F right
^B left
\ef word-right
^D delete
\ed word-delete
Q: The os.c was not compiled on NEWS-OS 4.0C.
A: Add following before "#include <setjmp.h>".
#define LANGUAGE_C 1
**Conclusion**
- Please mail to "jam@pobox.com" if you have a problem or a suggestion.
- At "ftp://ftp.aist-nara.ac.jp/pub/personal/kazusi-m/less", I will
keep all patches and contributed patches about jless. If you make
any patches, please mail to me.
Local Variables:
mode: indented-text
fill-column: 70
End:
|