1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 112 113 114 115 116 117 118 119 120 121 122 123 124 125
|
trscripts
---------
This package gives you two shell scripts: trcs and trbdf. They are
free software. You are welcome to distribute and use them and this
package under certain conditions, see the file GPL for copyright
conditions. But please note that there is NO WARRANTY.
1. trcs
trcs is a program for translating of simple text files from one
codeset to other. When exact translation is impossible it tries to
make approximations. For example pseudographic characters are
approximated by ASCII characters in codesets that have no
pseudographic characters.
Run the script `gen-trcs' to generate the `trcs' program in this way:
./gen_trcs csets fallbacks charmaps >trcs
chmod 755 trcs
Try ./trcs --help for information on how to use `trcs'.
2. trbdf
trbdf is a program for translating of BDF fonts from one encoding to
other. Symbols that are missing in the source font are approximated by
existing symbols when possible.
Run the script `gen-trbdf' to generate the `trbdf' program in this way:
./gen_trbdf xcsets fallbacks charmaps adobegliphlist >trbdf
chmod 755 trbdf
Try ./trbdf --help for information on how to use `trbdf'.
3. Configuration files
The file `csets' contains known codesets in the following format:
first field is the name of codeset, second contains `|'-separated list
of aliases (in case there are none it is just `|' character). After
the second field there may be some comment.
The file `xcsets' contains three-field records. First field is the
name of the encoding, second and third field are "charset registry" and
"charset encoding" respectively.
The file `fallbacks' contains fallbacks. The record in the form
00B7 2022 221A
means that the unicode U+00B7 can be approximated with unicode U+2022
and if necessary with unicode U+221A.
The directory `charmaps' contains two-column character maps from respective
codeset to unicode.
4. How to build and install
Edit Makefile if you want. Then run `make'. This produces the two
shell scripts `trcs' and `trbdf'. Then as root you can run `make
install' to install them and their man-pages in /usr/local/bin and
/usr/local/man respectivly. `make uninstall' removes them from this
directory.
5. History
1.0 (4 Mar 2000)
-- first version of trscripts
1.1 (3 Jun 2000)
-- trbdf supports unicode and uses fallbacks.
-- Fallbacks are improved.
-- Added script `gen_fallbacks' for authomatical generation of
fallbacks.
-- Added man-pages for `trcs' and `trbdf'.
1.2 (8 Jun 2000)
-- Added encodings ECMA-Cyrillic (ISO-IR-111) and KOI8-C.
-- Charmaps are correct about codes 0x00-0x1f. trcs ignores symbols with
such codes, but trbdf uses them.
1.3 (4 Okt 2000)
-- New option for `trbdf': --foundry.
1.4 (25 Jun 2001)
-- trbdf: The canonical Adobe gliph names are used always in outputed
fonts.
-- trbdf: Changes to work faster.
-- Makefile: $(charmaps) is dinamically computed by $(wildcard ..)
-- charmaps/unicode: Unexisting codes are removed.
1.5 (28 Jan 2002)
-- New encodings: PT154, PT254 and Mac-Ukr.
-- A few additions to fallbacks.
-- New encodings for trbdf: ISO 8859-1,2,15.
-- Some codes below 0x1f has been added to fallbacks in order to
allow pseudographics in translated fonts (termsym), thanks to
Dimitar Zhekov.
1.6 (20 Apr 2002)
-- Added fallback for the Tugrik sign.
1.7 (2 Jun 2002)
-- New encodings for trbdf: IBM850, IBM860 IBM863, IBM865.
-- Added fallbacks `epsilon' by `belongs to' and `phi' by `empty
set'.
1.8 (24 Oct 2002)
-- New encoding for trcs and trbdf: ISO-8859-16. New encoding for
trbdf: IBM852.
-- Added fallbacks for latin leters t and s with comma below.
1.9 (10 Nov 2002)
-- Fixed syntax error in csets which caused trcs not to work.
-- New encodings for trbdf: ISO-8859-9 and CP1254.
1.10 (19 Dec 2002)
-- Added new encoding for trbdf (unicode_small) which doesn't
contain character codes after U+2fff.
-- The generated fonts always contain DEFAULT_CHAR 32.
1.11 (7 Jun 2003)
-- Termsymbols with codes below 0x1F are used by trbdf as fallbacks
for some pseudographic symbols.
1.12 (30 Okt 2003)
-- New option for trbdf: --no-fallback.
1.13 (16 Nov 2003)
-- trbdf tries to approximate U+2010 (hyphen) even with --no-fallback.
|