1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 112 113 114 115 116 117 118 119 120 121 122 123 124 125 126 127 128 129 130 131 132 133 134 135 136 137 138 139 140 141 142 143 144 145 146 147 148 149 150 151 152 153 154 155 156 157 158 159 160 161 162 163 164 165 166 167 168 169 170 171 172 173 174 175 176 177 178 179 180 181 182 183 184 185 186 187 188 189 190 191 192 193 194 195 196 197 198 199 200 201 202 203 204 205 206 207 208 209 210 211 212 213 214 215 216 217 218 219 220 221 222 223 224 225 226 227 228 229 230 231 232 233 234 235 236 237 238 239 240 241 242 243 244 245 246 247 248 249 250 251 252 253 254 255 256 257 258 259 260 261 262 263 264 265 266 267 268 269 270 271 272 273 274 275 276 277 278 279 280 281 282 283 284 285 286 287 288 289 290 291 292 293 294 295 296 297 298 299 300 301 302 303 304 305 306 307 308 309 310 311 312 313 314 315 316 317 318 319 320 321 322 323 324 325 326 327 328 329 330 331 332 333 334 335 336 337 338 339 340 341 342 343 344 345 346 347 348 349 350 351 352 353 354 355 356 357 358 359 360 361 362 363 364 365 366 367 368 369 370 371 372 373 374 375 376 377 378 379 380 381 382 383 384 385 386 387 388 389 390 391 392 393 394 395 396 397 398 399 400 401 402 403 404 405 406 407 408 409 410 411 412 413 414 415 416 417 418 419 420 421 422 423 424 425 426 427 428 429 430 431 432 433 434 435 436 437 438 439 440 441 442 443 444 445 446 447 448 449 450 451 452 453 454 455 456 457 458 459 460 461 462 463 464 465 466 467 468 469 470 471 472 473 474 475 476 477 478 479 480 481 482 483 484 485 486 487 488 489 490 491 492 493 494 495 496 497 498 499 500 501 502 503 504 505 506 507 508 509 510 511 512 513 514 515 516 517 518 519 520 521 522 523 524 525 526 527 528 529 530 531 532 533 534 535 536 537 538 539 540 541 542 543 544 545 546 547 548 549 550 551 552 553 554 555 556 557 558 559 560 561 562 563 564 565 566 567 568 569 570 571 572 573 574 575 576 577 578 579 580 581 582 583 584 585 586 587 588 589 590 591 592 593 594 595 596 597 598 599 600 601 602 603 604 605 606 607 608 609 610 611 612 613 614 615 616 617 618 619 620 621 622 623 624 625 626 627 628 629 630 631 632 633 634 635 636 637 638 639 640 641 642 643 644 645 646 647 648 649 650 651 652 653 654 655 656 657 658 659 660 661 662 663 664 665 666 667 668 669 670 671 672 673 674 675 676 677 678 679 680 681 682 683 684 685 686 687 688 689 690 691 692 693 694 695 696 697 698 699 700 701 702 703 704 705 706 707 708 709 710 711 712 713 714 715 716 717 718 719 720 721 722 723 724 725 726 727 728 729 730 731 732 733 734 735 736 737 738 739 740 741 742 743 744 745 746 747 748 749 750 751 752 753 754 755 756 757 758 759 760 761 762 763 764 765 766 767 768 769 770 771 772 773 774 775 776 777 778 779 780 781 782 783 784 785 786 787 788 789 790 791 792 793 794 795 796 797 798 799 800 801 802 803 804 805 806 807 808 809 810 811 812 813 814 815 816 817 818 819 820 821 822 823 824 825 826 827 828 829 830 831 832 833 834 835 836 837 838 839 840 841 842 843 844 845 846 847 848 849 850 851 852 853 854 855 856 857 858 859 860 861 862 863 864 865 866 867 868 869 870 871 872 873 874 875 876 877 878 879 880 881 882 883 884 885 886 887 888 889 890 891 892 893 894 895 896 897 898 899 900 901 902 903 904 905 906 907 908 909 910 911 912 913 914 915 916 917 918 919 920 921 922 923 924 925 926 927 928 929 930 931 932 933 934 935 936 937 938 939 940 941 942 943 944 945 946 947 948 949 950 951 952 953 954 955 956 957 958 959 960 961 962 963 964 965 966
|
Fri Nov 19 18:35:20 2004 Kiyokazu SUTO <suto@ks-and-ks.ne.jp>
* Makefile: Use ``install.sh'' instead of ``cp -pd'' because
option ``-d'' is not portable (report from an anonymous person).
* install.sh: New file.
Sun Jul 25 21:06:40 2004 Kiyokazu SUTO <suto@ks-and-ks.ne.jp>
* mb2iso.c (mb_decode): Fix Debian Bug report logs - #260430.
(mb_conv_for_specific_ces): ditto.
Tue Aug 19 18:59:16 2003 Kiyokazu SUTO <suto@ks-and-ks.ne.jp>
* Makefile: Fix typo ``mee'' --> ``me''.
Tue Aug 19 18:58:33 2003 Tsutomu OKADA <tsutomu.okada@furuno.co.jp>
* Makefile: Fix compilation failure on Cygwin environment.
Thu Aug 14 08:58:29 2003 Kiyokazu SUTO <suto@ks-and-ks.ne.jp>
* Makefile: Fix typo ``definisions'' --> ``definitions''.
* ucs-to-iso-lib.pl (unihan): Calculation of universal codepoints
for JIS X 0213 characters was broken.
Wed Apr 23 18:33:56 2003 Kiyokazu SUTO <suto@ks-and-ks.ne.jp>
* mbconv.c (doit): ``istream_eof'' should be cleared (otherwise,
when multiple files were give on command line, line oriented
operations ignore files other than 1st one).
(doit): Follow convention that, when multiple files are given on
comand line, file names are prevented to lines in output.
Sat Apr 20 12:18:50 2002 Kiyokazu SUTO <suto@ks-and-ks.ne.jp>
* uirx.c (uirx_parse_end): Closing epsilon transition was added in
wrong manner.
Tue Feb 19 02:08:20 2002 Kiyokazu SUTO <suto@ks-and-ks.ne.jp>
* mbio.c (mb_new_pos_info): Re-allocation of vector of
``mb_pos_info_t'' did corrupt application using ``mb_info_t''
struct returned by this function.
(mb_new_pos_info): ``mb_pos_info_t'' struct was not initialized
correctly.
Mon Feb 18 03:40:08 2002 Kiyokazu SUTO <suto@ks-and-ks.ne.jp>
* mb2iso.c (mb_conv_for_specific_ces): Add check whehter ``ces''
is NULL or not.
Sun Jan 27 02:40:31 2002 Kiyokazu SUTO <suto@ks-and-ks.ne.jp>
* mb.h: Definition of ``mb_conv_t'' is modified.
* mb2iso.c (mb_conv_for_specific_ces): New API to impliments CES
specific conversion function.
Sat Jan 26 19:53:50 2002 Kiyokazu SUTO <suto@ks-and-ks.ne.jp>
* mbfind.c (mb_{mem,str}_to_wstr): New APIs for block conversion
of multi-octet string to wide string.
Mon Jan 14 02:02:43 2002 Kiyokazu SUTO <suto@ks-and-ks.ne.jp>
* CCS conversion functions are now passed sequence of wide
characters instead of a single wide character.
Fri Jan 11 02:00:29 2002 Kiyokazu SUTO <suto@ks-and-ks.ne.jp>
* iso2mb.c (mb_encode): Read point in buffer of ``mb_info_t''
struct was not updated porperly.
Thu Jan 10 22:12:58 2002 Kiyokazu SUTO <suto@ks-and-ks.ne.jp>
* mb2iso.c (mb_decode): New API to provide reverse mapping of
``mb_encode()''.
* iso2mb.c (mb_encode): New API to encode input from a stream into
wide character sequence or into ``x-moe-internal'' encoded octet
sequence.
* Bit width of internal wide character is changed to 24.
Following after this change, ``x-moe-internal'' has become 4
octets (at most) encoding.
Thu Dec 13 01:32:57 2001 Kiyokazu SUTO <suto@ks-and-ks.ne.jp>
* btri.c (main): Add support for case sensitive string table.
Thu Dec 6 00:29:01 2001 Kiyokazu SUTO <suto@ks-and-ks.ne.jp>
* mbinit.c: UTF-16{,BE,LE} are removed from autodetection
candidates (including CESs incompatible with US-ASCII is harmful).
* mbinit.c: UTF-16{,BE,LE} are included in autodetection
candidates.
* altmalloc.{c,h}: Obsolete files were included.
* iso2mb.c (mb_cs_detector_read, mb_getmem, mb_getline):
Completely broken.
Wed Dec 5 02:17:15 2001 Kiyokazu SUTO <suto@ks-and-ks.ne.jp>
* iso2mb.c (mb_mkunbound_cs_detector): Clear flag
``MB_FLAG_UNKNOWNCS''.
(mb_cs_detector_read): Clear flag ``MB_FLAG_UNKNOWNCS'' when
detection finishes.
(mb_bind_cs_detector): Set flag ``MB_FLAG_UNKNOWNCS''.
Thu Nov 29 14:24:37 2001 Kiyokazu SUTO <suto@ks-and-ks.ne.jp>
* ucs-to-iso.pl: Stop to permit line-breaking after a character
other than a kind of space or a CJK character.
Thu Nov 8 14:25:09 2001 Kiyokazu SUTO <suto@ks-and-ks.ne.jp>
* altmalloc.c (alt_{set_atomic_allocater,call_malloc_atomic}): New
APIs.
Thu Aug 30 19:32:00 2001 Kiyokazu SUTO <suto@ks-and-ks.ne.jp>
* Makefile: Options ``v'' (to increase verbosity) were removed
from some commands (base on report from Ambrose Li
<acli@ada.dhs.org>).
Fri Jun 15 22:42:24 2001 Kiyokazu SUTO <suto@ks-and-ks.ne.jp>
* eaw_ucswidth.pl: Data format of EastAsianWidth.txt of Unicode
3.1 was not processed properly.
Wed Jun 13 21:51:31 2001 Kiyokazu SUTO <suto@ks-and-ks.ne.jp>
* mbfind.c: New width table ``eaw_a2w_ucswidth'' of Unicode
characters based on EastAsianWidth.txt is introduced, and original
table ``eaw_ucswidth'' is renamed to ``eaw_a2n_ucswidth'' (based
on report from Hideyuki SHIRAI <shirai@rdmg.mgcs.mei.co.jp>).
* eaw_ucswidth.pl: Criterion whether a codepoint should be
registered or not, was wrong.
Tue Jun 5 01:43:25 2001 Kiyokazu SUTO <suto@ks-and-ks.ne.jp>
* mbcesdefs.pl: New CES ``TIS-620''.
Sat Jun 2 00:57:50 2001 Kiyokazu SUTO <suto@ks-and-ks.ne.jp>
* iso2mb.c (mb_iso2022_SSL_encoder): Mapping between final octet
of 7bit single shift sequence and intermediate buffer was wrong
(report from Akihiro Arisawa <ari@mbf.sphere.ne.jp>).
Tue May 29 13:50:00 2001 Kiyokazu SUTO <suto@ks-and-ks.ne.jp>
* iso2mb.c (mb_fetch_wchar): When compiled with GCC, the body was
not included in libmoe, which might cause compilation failure of
programms using libmoe (report from Kenzi Cano
<kc@furukawa.ch.kagu.sut.ac.jp>).
Mon May 28 02:05:28 2001 Kiyokazu SUTO <suto@ks-and-ks.ne.jp>
* ucs-to-iso-lib.pl (make_map): Regexp. to check a mapping table
of which encoding a file contains, was too strict, and fails to
find windows-1253 <-> Unicode mapping table.
Mon May 21 13:19:14 2001 Kiyokazu SUTO <suto@ks-and-ks.ne.jp>
* Update maping tables for ``jis0208-to-ucs-extra'' and
``ucs-to-jis0208-extra'' based on the update from Ambrose Li
<acli@ada.dhs.org>.
Sun May 13 13:41:55 2001 Kiyokazu SUTO <suto@ks-and-ks.ne.jp>
* mbfind.c (mb_mem_to_wchar): Octet at the beginning of string was
returned unless multiple octet character was detected.
Thu Apr 26 23:07:58 2001 Kiyokazu SUTO <suto@ks-and-ks.ne.jp>
* Makefile: Add description about required changes for Solaris 7
(information from Hideyuki SHIRAI <shirai@rdmg.mgcs.mei.co.jp>).
Wed Apr 25 21:08:08 2001 Kiyokazu SUTO <suto@ks-and-ks.ne.jp>
* iso2mb.c (mb_info2mb): Terminating NUL character was not stored.
(mb_info2mb): Characters were not stored into correct place.
Tue Apr 17 00:29:11 2001 Kiyokazu SUTO <suto@ks-and-ks.ne.jp>
* mb2iso.c (mb_vmem2iso_setup): ``mb_init_w()'' was used instead
of ``mb_vinit_w()''.
* ucs-to-iso-lib.pl (unihan): Failed to find line for CNS 11643 in
Unihan.txt.
Mon Apr 16 13:44:28 2001 Kiyokazu SUTO <suto@ks-and-ks.ne.jp>
* ucs-to-iso-lib.pl (unihan): New mapping table entry generator
using Unihan.txt of Unicode 3.1.
* mb2iso.c (mb_vmem2b64): ``free()'' was used instead of
``alt_call_free()''.
Thu Apr 12 23:45:16 2001 Kiyokazu SUTO <suto@ks-and-ks.ne.jp>
* mb2iso.c (mb_store_esc_for_char_internal): Escape sequence to
identify UTF-16{,BE,LE} was corrupt.
Thu Mar 15 12:16:22 2001 Kiyokazu SUTO <suto@ks-and-ks.ne.jp>
* wcrx.c (wcrx_compile_group): Structure representing meta
character ``.'' was destroyed once used, which caused segmentation
fault when the structure used next (report from Kazuhiko
<kazuhiko@ring.gr.jp>).
* uirx.c (uirx_match): NFA remains unchanged when match have
failed.
Wed Mar 14 18:00:02 2001 Kiyokazu SUTO <suto@ks-and-ks.ne.jp>
* uirx.h: Introduce configurable type ``uirx_wc_t'' (after
suggestion by Hironori Sakamoto <h-saka@lsi.nec.co.jp>).
Sun Mar 11 11:56:46 2001 Kiyokazu SUTO <suto@ks-and-ks.ne.jp>
* Add new converters ``jis0208-to-ucs-extra'' and
``ucs-to-jis0208-extra'' using the mapping table from Ambrose Li
<acli@ada.dhs.org>.
* wcrx.c: Character class is compiled into one leaf in parse tree.
Sat Mar 10 12:13:23 2001 Kiyokazu SUTO <suto@ks-and-ks.ne.jp>
* uirx.c (uirx_match): Member ``posflag'' of structure
``uirx_nfa_t'' was not initialized correctly.
Sun Mar 4 14:06:52 2001 Kiyokazu SUTO <suto@ks-and-ks.ne.jp>
* wcrx.c: Remove codes using binary tri.
(wcrx_compile_group): Epsilon transition at end of group was not
properly processed when closing parenthesis was explicity
specified.
Wed Feb 28 01:27:11 2001 Kiyokazu SUTO <suto@ks-and-ks.ne.jp>
* mbenclib.pl: {En,de}coder table for internal multiple octet
encoding was wrong.
* btri.c (btri_uint_optimize): Calculation of max. key value of
subtree was wrong.
Tue Feb 27 02:59:24 2001 Kiyokazu SUTO <suto@ks-and-ks.ne.jp>
* wcrx.c (wcrx_compile_class_macro_negate_class_body,
wcrx_compile_class_negate_macro_negate_class_body): Both
``bof_char'' and ``eof_char'' are always excluded.
* Add support for case insensitive regular expression search.
* mbfind.c (mb_mem_to_wchar): Position of first octet was not
detected correctly.
Mon Feb 26 23:21:45 2001 Fumitoshi UKAI <ukai@debian.or.jp>
* mb2iso.c (mb_vprintf): Fix paramter type mismatch (``NULL'' vs.
``va_list'')
Mon Feb 26 01:06:29 2001 Kiyokazu SUTO <suto@ks-and-ks.ne.jp>
* btri.c (main): Add new source type 'u' to generate
``butri_uint_tab_t'' table.
* New structure ``btri_uint_opt_tab_t'' to generate packed binary
tri with key of unsigned int type. Members ``count'' and ``max''
of ``btri_uint_tab_t'' are removed.
* Bit width of wide character is expanded to 32 again.
* Structure of structures related to regular expression are
changed in accordance with time when their values are determined --
at matching, at parsing, or never changed.
Sun Feb 25 01:31:33 2001 Kiyokazu SUTO <suto@ks-and-ks.ne.jp>
* btri.c (str2uintkey): Accept string of C's character literal
form as key.
* mbconv.c (chomp): Recognize CRLF as end of line as well as LF
and remove them from each line.
Sat Feb 24 22:13:06 2001 Kiyokazu SUTO <suto@ks-and-ks.ne.jp>
* Support for regular expression is re-implemented as generic API.
Thu Feb 22 21:37:05 2001 Kiyokazu SUTO <suto@ks-and-ks.ne.jp>
* Add support for regular expression over unsigned int.
Wed Feb 14 11:21:23 2001 Kiyokazu SUTO <suto@ks-and-ks.ne.jp>
* ucs-to-iso-lib.pl (uni2gbk2k): Some mapping of GBK might not be
registered.
Tue Feb 13 22:11:46 2001 Kiyokazu SUTO <suto@ks-and-ks.ne.jp>
* libmoe.shtml: Updated after implementation.
* mbenclib.pl: Character map ``CL'' stop to recognize codes such
as NUL which do not apprear in usuall text stream.
* Add support for UTF-16.
Sun Feb 11 02:09:08 2001 Kiyokazu SUTO <suto@ks-and-ks.ne.jp>
* mbinit.c: ``cn-gb'' in candidates of CES auto detection is
changed to ``x-gb-18030-2000''.
* mbcesdefs.pl: Add new CES ``GBK'' and ``GBK2K''.
Sat Feb 10 02:41:12 2001 Kiyokazu SUTO <suto@ks-and-ks.ne.jp>
* Some mapping tables are re-implemented using binary tri again.
* Dense sub trees of binary tri are converted to array.
* ucs-to-iso-lib.pl (uni2jis): Did not register entry for
``HALFWIDTH KATAKANA MIDDLE DOT'' because it had been assumed that
all lines are composed of single tab seprated columns.
Wed Feb 7 04:25:56 2001 Kiyokazu SUTO <suto@ks-and-ks.ne.jp>
* Stop special treatment of mapping of JIS double octet
characters.
* Some mapping tables are re-implemented using array.
Tue Feb 6 22:18:50 2001 Kiyokazu SUTO <suto@ks-and-ks.ne.jp>
* Fix mapping table of BIG5 into Unicode based on information from
Ambrose Li <acli@ada.dhs.org>.
Sun Feb 4 02:57:45 2001 Kiyokazu SUTO <suto@ks-and-ks.ne.jp>
* Bit width of wide character is restricted to 24.
* Internal multiple octet encoding has been changed to that of
fixed length with name ``x-moe-internal''.
Sat Feb 3 02:23:56 2001 Kiyokazu SUTO <suto@ks-and-ks.ne.jp>
* btri.c (btri_pack_uint_tab): New API to generate packed binary
tri with key of unsigned int type.
Sat Jan 27 02:03:26 2001 Kiyokazu SUTO <suto@ks-and-ks.ne.jp>
* mb.h (MB_NCPROPS): Introduce new macro which shows number of
properties.
Fri Jan 26 01:21:11 2001 Kiyokazu SUTO <suto@ks-and-ks.ne.jp>
* mb2iso.c (mb_force_flush_buffer): Fix posibility to lose part of
data in buffer.
Thu Jan 25 02:49:24 2001 Kiyokazu SUTO <suto@ks-and-ks.ne.jp>
* mb.h (mb_mem_to_wchar_internal): Resembled to ``mbtowc()'' in
XPG5.
Tue Jan 23 00:00:51 2001 Kiyokazu SUTO <suto@ks-and-ks.ne.jp>
* Remove codes to check failure of binary search in case the
search never fail.
* Makefile: Introduce extra part of version string to add
auxiliary infomation such as prerelease version.
* mb.h (mb_store_esc_for_char): Re-implemented as macro.
(mb_fetch_wchar): Defined as inline function if GNU C is used.
* btri.c (btri_map_max_smaller and btri_map_min_larger): More
strict check whether retrieval should be continued.
* Prefix ``ISO2MB_'' is change to ``MOE_'' or ``MB_''.
Mon Jan 22 22:30:23 2001 Kiyokazu SUTO <suto@ks-and-ks.ne.jp>
* btri.h: Prototypes of ``btri_add_uint_n_to_1()'' and
``btri_add_uint_n_to_n()'' were missing.
* Simplify decoder from internal encoding.
Sun Jan 21 13:36:34 2001 Kiyokazu SUTO <suto@ks-and-ks.ne.jp>
* libmoe.shtml: Add description about developement version.
* iso2mb.c (mb_cs_try_detect): Destroy contents in input buffer of
original ``mb_info_t'' structure (report from Kazuhiko
<kazuhiko@ring.gr.jp>).
* mbenclib.pl (make_cm_matrix): Encoder tables were not correctly
made for NON-ISO 2022 encodings (report from Kazuhiko
<kazuhiko@ring.gr.jp>).
Sat Jan 20 02:20:10 2001 Kiyokazu SUTO <suto@ks-and-ks.ne.jp>
* iso2mb.c (mb_cs_try_detect): Wide character weight table is
re-implemented using binary try.
* mbwcweight.pl: Range of katakana codepoints in EUC encoding was
wrong.
* EUC_JISX0213_PACKED: New CES.
* Binary tri is re-implemented as ususal PATRICIA tree.
Thu Jan 18 21:13:16 2001 Kazuhiko <kazuhiko@ring.gr.jp>
* Makefile: Semicolons are added to tail of last statements in
brace blocks.
* btri.c: Includes ``errno.h''.
Tue Jan 16 00:37:10 2001 Kiyokazu SUTO <suto@ks-and-ks.ne.jp>
* mb2iso.c (mb_conv_from_jisx0213_internal and
mb_conv_to_jisx0213): New converters.
Sun Jan 14 00:44:04 2001 Kiyokazu SUTO <suto@ks-and-ks.ne.jp>
* btri.c (btri_copy): New implimentation.
* btri.h: API names are changed:
btri_uc_cmp() --> btri_uchar_cmp(),
btri_uc_ci_cmp() --> btri_uchar_ci_cmp().
* btri.c: New API's: btri_uc_cmp(), btri_uc_ci_cmp(), and
btri_uint_cmp().
* mbfind.c: API names are changed:
mb_wcv_width() --> mb_wmem_width(),
mb_wcstr_width() --> mb_wstr_width().
New API's are added:
mb_str_width_n(), mb_wstr_width_n().
Sat Jan 13 03:46:39 2001 Kiyokazu SUTO <suto@ks-and-ks.ne.jp>
* mb2iso.c (mb_store_char): Return value was length of decoded
multiple octet form, which should be length of ``fake'' UTF-8
encoding form.
* iso2mb.c (mb_escfun_cs_96): Processing of escape sequence to
designate 96 set was broken.
Fri Jan 12 13:15:39 2001 Kiyokazu SUTO <suto@ks-and-ks.ne.jp>
* Stop implicit invocation of user defined encoding conversion.
* mbcesdefs.pl: Range of acceptable codepoints for ``x-ctext'' was
wrong.
* mb.h (mb_str_to_wchar_internal): 1st octet of 2 octet utf-8
encoded character was not processed correctly.
* ucs-to-iso.pl: Initialization data of default converter table
was wrong.
Properties of some punctuation characters in JISC6226 or in
JISX0208 are not recorded.
Thu Jan 11 03:31:38 2001 Kiyokazu SUTO <suto@ks-and-ks.ne.jp>
* iso2mb.c (mb_cs_try_detect_from_mem): New API to detect CES from
byte sequence on main memory.
* mbenclib.pl (make_named_encoder): Codepoint of 2^7 set character
was calcuated as 2^15 set character.
* mb.h (MB_SBC_LOWER): Was assigned with ``MB_96_LOWER'' which
should be ``MB_96_UPPER''.
Wed Jan 10 19:30:07 2001 Kiyokazu SUTO <suto@ks-and-ks.ne.jp>
* mb.h (MB_UTF8_TO_WCHAR): Accept single octet.
(mb_wchar_to_badutf8): Comparison is performed in ``binary search
order''.
(mb_str_to_wchar_internal): Wide character is processed by
MB_UTF8_TO_WCHAR
* btri.c (btri_copy): Was completely broken.
* iso2mb.c (mb_vmem2mb_setup): Order of intialization of members
was wrong.
* mbfind.c (mb_str_to_wchar): ``*p_s'' was not updated correctly.
Wed Jan 10 00:08:04 2001 Kiyokazu SUTO <suto@ks-and-ks.ne.jp>
* Change all the prefix ``mb_bt'' of binary tri related objects to
``bt''.
* mb.h: Prototype of ``mb_flush_auxbuf()'' is added.
Prototype of ``mb_cs_try_detect()'' is added.
Tue Jan 9 21:55:39 2001 Kiyokazu SUTO <suto@ks-and-ks.ne.jp>
* mbbtri.h: Prototype of ``mb_btri_copy()'' is added.
* mb2iso.c (mb_flush_auxbuf): Separate codes to flush auxbuf from
``mb_flush''.
Mon Jan 8 13:26:49 2001 Kiyokazu SUTO <suto@ks-and-ks.ne.jp>
* Makefile (INSTLIBEXEC): ``mk_btri'' is installed into
``${DESTDIR}${LIBEXECDIR}''.
* mb.h (MB_SBC_ESC_ENC and MB_DBC_ESC_ENC): Change arithmetic
minus operation to logical and.
Sun Jan 7 03:46:28 2001 Kiyokazu SUTO <suto@ks-and-ks.ne.jp>
* The function ``mb_char_dec()'' is renamed to
``mb_mem_to_wchar_internal()'' and implemented as macro.
* Some functions are implemented as macros
(``mb_wchar_to_{badutf8,utf8,mbc}()'').
* New API (``mb_str_to_wchar()'', ``mb_str_to_prop()'').
* Some APIs are renamed (``mb_find_wchar()'' -->
``mb_mem_to_wchar()'', ``mb_mbc_prop()'' -->
``mb_mem_to_prop()'').
Sat Jan 6 01:15:31 2001 Kiyokazu SUTO <suto@ks-and-ks.ne.jp>
* iso2mb.c (mb_escfun_cs_utf8): Was broken.
* mbfind.c (mb_wchar_prop): Wrong property might be returned for
Unicode.
* mb2iso.c (mb_conv_f2h and mb_conv_h2f): New converters between
halfwidth and fullwidth characters.
(mb_store_wchar): Converters of (mb_info_t *) is called before
those of (mb_ces_t *).
* mbbtri.c (mb_bt_search): Went down to right half of each node
even if it had no valid value.
* mb.h (MB_WORD_UCS_DEC): Decoder of UCS was broken.
* mb2iso.c (mb_conv_to_ucs): Return value was garabage when
conversion failed.
* iso2mb.c (mb_encode_to_wchar): Conversion is now applied to
characters from input stream.
* Treatment of ASCII was different from what I had intended.
* Change treatment of JIS multiple octet characters.
* mbbtri.c (mb_btri_add_uint_n_to_n and mb_btri_add_uint_n_to_1):
Calculation of key range was wrong.
Fri Jan 5 01:41:01 2001 Kiyokazu SUTO <suto@ks-and-ks.ne.jp>
* mbbtri.c (output_bt): counting of necessary words for leaf node
was wrong.
(mb_btri_uint_optimize): criterion whether 2 subnodes may be
joined into one, was wrong.
* mbconv.c (opt_help): Shows library version.
* Remove an obsolete header file.
* Improve setup of LD_LIBRARY_PATH in test script.
* Fix some mistates in documentation.
* Start of new project ``libmoe''.
Thu Dec 7 01:25:02 2000 Kiyokazu SUTO <suto@ks-and-ks.ne.jp>
* cprop.pl: ``MB_CPROP_MAY_BREAK'' flag was not set for HIRAGANA,
KATAKANA, KANA, FULLWIDTH, or IDEOGRAPHIC punctuation character.
Tue Nov 28 01:51:43 2000 Kiyokazu SUTO <suto@ks-and-ks.ne.jp>
* ucs-to-iso-lib.pl (ucs2isov): Add sequence to designate right
half of iso-8859-1 to G1 after other character set designated.
Mon Nov 27 21:06:36 2000 Kiyokazu SUTO <suto@ks-and-ks.ne.jp>
* preconv.h and notascii.h: Re-generated.
* ucs-to-iso-lib.pl: Fix some typo.
* ucs-to-iso-lib.pl (ucs2isov): Some fixes to follow after changes
of JIS related mapping table.
Sun Nov 26 00:59:03 2000 Kiyokazu SUTO <suto@ks-and-ks.ne.jp>
* Makefile (big_headers): New target to generate big headers.
* mbfind.c (mb_charprop): Wrong values were returned for
characters registered property table.
* mbbtri.pl (bt_optimize): Criterion whether two leaf nodes may be
combined was wrong.
Sat Nov 25 13:21:04 2000 Kiyokazu SUTO <suto@ks-and-ks.ne.jp>
* jismap.pl: New Perl script to make a mapping table of JIS
codepoints into registeration numbers, separated from
ucs-to-iso.pl.
* notascii.pl: New Perl script to make a mapping table of
domestic-ascii characters which are not mapped to U+0021 --
U+007E, separated from ucs-to-iso.pl.
* mbbtri.pl (bt_make_c_header): Inser file name and date at
beginning of file to notify cvs that the file is really modified.
* mbbtri.c (mb_bt_search): Wrong value was generated from multiple
value node again.
* Makefile: Separate target ``install'' into ``install-bin'',
``install-lib'', and ``install-man''.
Sat Nov 25 13:18:05 2000 Kazuhiko <kazuhiko@ring.gr.jp>
* Makefile (DESTDIR): New variable to override top of install
directories regardless of re-definition of individual ones.
Sat Nov 25 02:08:16 2000 Kiyokazu SUTO <suto@ks-and-ks.ne.jp>
* mbbtri.c (mb_bt_search): Wrong value was generated from multiple
value node.
* mbbtri.pl (bt_encode): Computation of mask for key was wrong.
* Compress conversion table between Unicode and others.
Thu Nov 23 02:57:44 2000 Kiyokazu SUTO <suto@ks-and-ks.ne.jp>
* ucs-to-iso.pl: Mapping table of codepoint into JIS registration
number was corrupt (report from Fumitoshi UKAI
<ukai@debian.or.jp>).
* Makefile: Include files are now installed into
${PREFIX}/include/iso2mb (suggestion from Fumitoshi UKAI
<ukai@debian.or.jp>).
* Mapping tables between Unicode and other kind characters are
modified to fix JIS X 0208 round trip problem (report from
Fumitoshi UKAI <ukai@debian.or.jp>).
Mon Nov 20 00:10:25 2000 Kiyokazu SUTO <suto@ks-and-ks.ne.jp>
* mbinit.c: Converter sequence bound to charset using ISO 2022
registered character set include ``mb_conv_allg0sl'' as first
entry.
* ucs-to-iso-lib.pl (is94x94): Add special treatment of Hangul.
Sun Nov 19 06:12:05 2000 Kiyokazu SUTO <suto@ks-and-ks.ne.jp>
* ucs-to-iso-lib.pl (is94x94): Return true value for Non-ISO 2022
multi octet character set.
* mbfind.c (mb_charprop): More accurate check whether CJK or not.
* mb.h: ``mb_char_t'' and of ``mb_echar_t'' are merged into one
structure ``mb_char_t''.
* mbfind.c (mb_charprop): Property of US-ASCII was wrong.
* mb.h: Members of ``mb_char_t'' and of ``mb_echar_t'' other than
code are defined as ``unsigned char''.
* Makefile (install): Allwasy copy files regardless of modified
date of files.
* cprop.pl: All ideographic character (in Uncode sense) may break
line.
Sat Nov 18 00:15:41 2000 Kiyokazu SUTO <suto@ks-and-ks.ne.jp>
* New types ``mb_char_t'' and ``mb_echar_t'' for characters and
attributes.
* Add new charset "x-fake-utf-8".
* mbinit.c: Position of entry named ``euc-jisx0213'' of charset
table was wrong.
* Add new language ``cjk''.
* kr.c (mb_cs_judge_kr): Range of sencod octet of Johab encoding
was wrong.
* mbconv.c: New option ``--which'' to display charset name of each
input stream.
Fri Nov 17 01:19:16 2000 Kiyokazu SUTO <suto@ks-and-ks.ne.jp>
* kr.c (mb_cs_judge_kr): Weights of characters was wrong.
* mb.h: Stop to swap final octets 0x40 and 0x42 of 94x94 set in
``fake'' UTF-8.
* mbinit.c: Fix order of entries in converter table.
Stop to hard-code number of candidates of charset in autodetector
table.
* kr.c, kr.h, ucs-to-iso-lib.pl, and mb128.h: Add support for
Unified Hangul.
* kr.c (mb_cs_judge_kr): Improve autodetection of encoding.
Thu Nov 16 00:12:35 2000 Kiyokazu SUTO <suto@ks-and-ks.ne.jp>
* mbconv.pod: Add description about new converters.
Add description about language specifications.
* cn.c, ja.c, and kr.c: Charset remains ``UNKNOWN'' if no
character with MSB set presents.
* mbinit.c: Number of encoding candidates was wrong for
``chinese''.
* cn.c (mb_conv_ucs_to_gb): ``mb_conv_ucs_to_cn'' is divided into
``mb_conv_ucs_to_cn'' and ``mb_conv_ucs_to_gb''.
(mb_conv_euc_tw): Final octet was not set correctly.
* mbinit.c: Add new converter ``euc-tw''.
* cn.c: Rename ``mb_conv_euctw'' to ``mb_conv_euc_tw''.
* Makefile (TAB_Hs): Add ``preconv.h''.
* preconv.h: Generate separately from other table headers.
Wed Nov 15 01:33:09 2000 Kiyokazu SUTO <suto@ks-and-ks.ne.jp>
* cn.c: Add support for ``EUC-TW''.
(mb_cs_setup_cn): ``utf-8'' was returned for ``x-euc-tw''.
(mb_conv_euctw): Characters in CNS 11643 plane 1, are converted to
ISO 2022 encoding.
* Makefile (kr.o): Add dependency upon ``hangul.h''.
* cn.c (mb_cs_judge_cn): Indexing variable was not updated
correctly.
* kr.c (mb_cs_judge_kr): Indexing variable was not updated
correctly.
(mb_johab_dec): Generate wrong code for johab code 0x8441 -
0xD3A0.
Sun Nov 12 12:57:22 2000 Kiyokazu SUTO <suto@ks-and-ks.ne.jp>
* Improve autodetection of character encoding for cn and kr.
* Improve autodetection of character encoding.
Fri Nov 10 02:15:07 2000 Kiyokazu SUTO <suto@ks-and-ks.ne.jp>
* mbfind.c (mb_is_cjk): New API to see if Unicode character is CJK
character.
* cprop.pl: Modified to generate binary search table for CJK.
Thu Nov 9 13:00:31 2000 Kiyokazu SUTO <suto@ks-and-ks.ne.jp>
* Makefile: Fix dependence on some headers.
* Property of punctuation characters are revised.
Wed Nov 8 21:37:16 2000 Kiyokazu SUTO <suto@ks-and-ks.ne.jp>
* Restructuring of Unicode <-> ISO 2022 mapping mainly to fix
problem that there is no mapping information between Unicode and
JIS C 6226-1978 (report from Kazuhiko <kazuhiko@ring.gr.jp>).
Sat Nov 4 21:12:37 2000 Kiyokazu SUTO <suto@ks-and-ks.ne.jp>
* iso2mb.c (mb_fetch_char): There was possiblity to fail to
recognize Unicode after conversion applied.
Fri Nov 3 00:37:26 2000 Kiyokazu SUTO <suto@ks-and-ks.ne.jp>
* mbinit.c: Converters bound to charsets normalize characters
according to Unicode unification.
* ucs-to-iso-lib.pl (set_map): Generate mapping of Shift_JIS and
Shift_JISX0213 into Unicode.
Thu Nov 2 00:25:31 2000 Kiyokazu SUTO <suto@ks-and-ks.ne.jp>
* mbfind.c (mb_mbc_proc): New API to look up character property
from string.
* cprop.pl: Add new property for space character.
* mbfind.c (mb_charprop): New API to look up character property.
Wed Nov 1 03:23:08 2000 Kiyokazu SUTO <suto@ks-and-ks.ne.jp>
* Remove codes assuming ``char'' is unsinged.
Tue Oct 31 20:11:37 2000 Kiyokazu SUTO <suto@ks-and-ks.ne.jp>
* mb128misc.c (mb_conv_ms_latin1): New converter to convert
characters of code point 0x80-0x9F into Microsoft Windows Codepage
1252.
Wed Oct 25 06:03:47 2000 Kiyokazu SUTO <suto@ks-and-ks.ne.jp>
* Makefile (PREFIX): Everthing is installed into subdirectories of
the directory specified by this macro.
* iso2mb.c (mb_fetch_char): Fix mis-encoding of single octet
character set when designated G2 or G3 and invoked to GR by
single-shift.
Mon Oct 23 06:57:21 2000 Kiyokazu SUTO <suto@ks-and-ks.ne.jp>
* mbinit.c (mb_charset_to_esc): Member ``cs2esc'' of ``*info'' is
initialized with ``NULL''.
* iso2mb.c (mb_conv_charset): New converter to apply converters in
structure pointed by member ``cs2esc'' in ``mb_info_t''.
* iso2mb.c (mb_conv_ascii): Table search is performed only for 94
set.
* Makefile (${Os}): Add missed depedence on ``domestic-ascii.h''.
* domestic-ascii.pl: Broken domestic ascii table was made by
variable scope problem.
===== libiso2mb-0.7.1 <-- libiso2mb-0.7.0 =====
Sat Oct 21 12:57:09 2000 Kiyokazu SUTO <suto@ks-and-ks.ne.jp>
* mbinit.c (mb_namev_to_flag): Table of conversion names was
specified as binary search table of flag names.
* mbconv.c (main): Output stream ``ostream'' was always overridden
by stdout.
Fri Oct 20 13:47:43 2000 Kiyokazu SUTO <suto@ks-and-ks.ne.jp>
* mbinit.c: Fix mistake in conditional compilation of
``to_mb128misc[]''. Add missed converter for charset ``cn-big5''.
* iso2mb.c (mb_conv_ascii): New converter from domestic ascii to
us-ascii.
Thu Oct 19 13:40:02 2000 Kiyokazu SUTO <suto@ks-and-ks.ne.jp>
* mbinit.c: Automatic setup of converters for output stream based
on charset.
Wed Oct 18 03:40:44 2000 Kiyokazu SUTO <suto@ks-and-ks.ne.jp>
* mbconv.pod: Upated after C source.
* ucs.c (mb_ucs_to_isoc): Stop to generate non ISO 2022 registered
character.
Tue Oct 17 15:36:53 2000 Kazuhiko <kazuhiko@ring.gr.jp>
* mbconv.c and Makefile: Fix of compilation and install problem.
Tue Oct 17 02:44:43 2000 Kiyokazu SUTO <suto@ks-and-ks.ne.jp>
* mbio.c: Remove codes assuming ``fpos_t'' is arithmetic.
Fri Oct 13 17:08:12 2000 Kiyokazu SUTO <suto@ks-and-ks.ne.jp>
* Improve JIS X 0213 support.
Thu Oct 12 13:37:46 2000 Kiyokazu SUTO <suto@ks-and-ks.ne.jp>
* mbconv.pod: Manual page of sample application is provided.
* mbconv.c: Add new option to display help message.
* mbinit.c: Fix conv_tab, flag_tab, and default_cs2esc.
===== libiso2mb-0.7.0 <-- libiso2mb-0.6.3 =====
Wed Oct 11 23:24:06 2000 Kiyokazu SUTO <suto@ks-and-ks.ne.jp>
* kr.c: Add support for Johab encoding. Add automatic detection
of charset among EUC-kr, Johab, and UTF-8.
* ja.c (mb_cs_judge_ja): Add support for automatic UTF-8
detection.
* cn.c: Add automatic detection of charset among CN-GB, Big Five,
and UTF-8.
Tue Oct 10 11:30:52 2000 Kiyokazu SUTO <suto@merry.xmath.ous.ac.jp>
* Separate Big Five to CNS 11643 conversion code from
mb_conv_allg0sl().
===== libiso2mb-0.6.3 <-- libiso2mb-0.6.2 =====
Mon Oct 9 00:12:14 2000 Kiyokazu SUTO <suto@ks-and-ks.ne.jp>
* mb128misc.c (mb_misc_enc): Fix encoding buf.
* Add support for Microsoft Windows Codepage 1250 -- 1258.
===== libiso2mb-0.6.2 <-- libiso2mb-0.6.1 =====
Sun Oct 8 21:39:50 2000 Kiyokazu SUTO <suto@ks-and-ks.ne.jp>
* iso2mb.c (mb_set_localized_ascii_table): New API to disable (or
change behavior of) localized ASCII normalization.
* ucs-to-iso-lib.pl: Add entry for "FULLWIDTH REVERSE SOLIDUS" to
mapping table between Unicode and ISO 2022 registered characters.
* iso2mb.c (mb_nonutf8_enc): Fix bug when encoding 94 set
character into "fake" UTF-8.
===== libiso2mb-0.6.1 <-- libiso2mb-0.6.0 =====
Sun Oct 8 00:50:22 2000 Kiyokazu SUTO <suto@ks-and-ks.ne.jp>
* ja.c (mb_sjis_dec): Fix bug of Shift_JIS decoder.
* ttywidth.pl: Get data from EastAsianWidth.txt.
===== libiso2mb-0.6.0 <-- libiso2mb-0.5.0 =====
Fri Oct 6 03:33:40 2000 Kiyokazu SUTO <suto@ks-and-ks.ne.jp>
* iso2mb.c (mb_utf8_escenc): Add new flag to avoid conflict
between 7bit shingle shift and [kx]term's escape sequence.
Thu Oct 5 16:42:05 2000 Kiyokazu SUTO <suto@ks-and-ks.ne.jp>
* Interpret ISO 2022 escape sequence even if UTF-8 mode.
* mbfind.c: Add new API to setup table of character width.
Wed Oct 4 21:10:49 2000 Kiyokazu SUTO <suto@ks-and-ks.ne.jp>
* Modular implementation of encodings which are neither UTF-8 nor
ISO 2022.
===== libiso2mb-0.5.0 <-- libiso2mb-0.4.19 =====
Mon Oct 2 23:36:07 2000 Kiyokazu SUTO <suto@ks-and-ks.ne.jp>
* mbinit.c: Add new APIs to setup charset table dynamically.
Mon Oct 2 13:55:46 2000 Kiyokazu SUTO <suto@ks-and-ks.ne.jp>
* mbinit.c: Add charset ``x-euc-jp''.
|