1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 112 113 114 115 116 117 118 119 120 121 122 123 124 125 126 127 128 129 130 131 132 133 134 135 136 137 138 139 140 141 142 143 144 145 146 147 148 149 150 151 152 153 154 155 156 157 158 159 160 161 162 163 164 165 166 167 168 169 170 171 172 173 174 175 176 177 178 179 180 181 182 183 184 185 186 187 188 189 190 191 192 193 194 195 196 197 198 199 200 201 202 203 204 205 206 207 208 209 210 211 212 213 214 215 216 217 218 219 220 221 222 223 224 225 226 227 228 229 230 231 232 233 234 235 236 237 238 239 240 241 242 243 244 245 246 247 248 249 250 251 252 253 254 255 256 257 258 259 260 261 262 263 264 265 266 267 268 269 270 271 272 273 274 275 276 277 278 279 280 281 282 283 284 285 286 287 288 289 290 291 292 293 294 295 296 297 298 299 300 301 302 303 304 305 306 307 308 309 310 311 312 313 314 315 316 317 318 319 320 321 322 323 324 325 326 327 328 329 330 331 332 333 334 335 336 337 338 339 340 341 342 343 344 345 346 347 348 349 350 351 352 353 354 355 356 357 358 359 360 361 362 363 364 365 366 367 368 369 370 371 372 373 374 375 376 377 378 379 380 381 382 383 384 385 386 387 388 389 390 391 392 393 394 395 396 397 398 399 400 401 402 403 404 405 406 407 408 409 410 411 412 413 414 415 416 417 418 419 420 421 422 423 424 425 426 427 428 429 430 431 432 433 434 435 436 437 438 439 440 441 442 443 444 445 446 447 448 449 450 451 452 453 454 455 456 457 458 459 460 461 462 463 464 465 466 467 468 469 470 471 472 473 474 475 476 477 478 479 480 481 482 483 484 485 486 487 488 489 490 491 492 493 494 495 496 497 498 499 500 501 502 503 504 505 506 507 508 509 510 511 512 513 514 515 516 517 518 519 520 521 522 523 524 525 526 527 528 529 530 531 532 533 534 535 536 537 538 539 540 541 542 543 544 545 546 547 548 549 550 551 552 553 554 555 556 557 558 559 560 561 562 563 564 565 566 567 568 569 570 571 572 573 574 575 576 577 578 579 580 581 582 583 584 585 586 587 588 589 590 591 592 593 594 595 596 597 598 599 600 601 602 603 604 605 606 607 608 609 610 611 612 613 614 615 616 617 618 619 620 621 622 623 624 625 626 627 628 629 630 631 632 633 634 635 636 637 638 639 640 641 642 643 644 645 646 647 648 649 650 651 652 653 654 655 656 657 658 659 660 661 662 663 664 665 666 667 668 669 670 671 672 673 674 675 676 677 678 679 680 681 682 683 684 685 686 687 688 689 690 691 692 693 694 695 696 697 698 699 700 701 702 703 704 705 706 707 708 709 710 711 712 713 714 715 716 717 718 719 720 721 722 723 724 725 726 727 728 729 730 731 732 733 734 735 736 737 738 739 740 741 742 743 744 745 746 747 748 749 750 751 752 753 754 755 756 757 758 759 760 761 762 763 764 765 766 767 768 769 770 771 772 773 774 775 776 777 778 779 780 781 782 783 784 785 786 787 788 789 790 791 792 793 794 795 796 797 798 799 800 801 802 803 804 805 806 807 808 809 810 811 812 813 814 815 816 817 818 819 820 821 822 823 824 825 826 827 828 829 830 831 832 833 834 835 836 837 838 839 840 841 842 843 844 845 846 847 848 849 850 851 852 853 854 855 856 857 858 859 860 861 862 863 864 865 866 867 868 869 870 871 872 873 874 875 876 877 878 879 880 881 882 883 884 885 886 887 888 889 890 891 892 893 894 895 896 897 898 899 900 901 902 903 904 905 906 907 908 909 910 911 912 913 914 915 916 917 918 919 920 921 922 923 924 925 926 927 928 929 930 931 932 933 934 935 936 937 938 939 940 941 942 943 944 945 946 947 948 949 950 951 952 953 954 955 956 957 958 959 960 961 962 963 964 965 966 967 968 969 970 971 972 973 974 975 976 977 978 979 980 981 982 983 984 985 986 987 988 989 990 991 992 993 994 995 996 997 998 999 1000 1001 1002 1003 1004 1005 1006 1007 1008 1009 1010 1011 1012
|
/* chartab.h -- character table module.
Copyright (C) 2003, 2004, 2005, 2006, 2007, 2008, 2009, 2010, 2011, 2012
National Institute of Advanced Industrial Science and Technology (AIST)
Registration Number H15PRO112
This file is part of the m17n library.
The m17n library is free software; you can redistribute it and/or
modify it under the terms of the GNU Lesser General Public License
as published by the Free Software Foundation; either version 2.1 of
the License, or (at your option) any later version.
The m17n library is distributed in the hope that it will be useful,
but WITHOUT ANY WARRANTY; without even the implied warranty of
MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU
Lesser General Public License for more details.
You should have received a copy of the GNU Lesser General Public
License along with the m17n library; if not, write to the Free
Software Foundation, Inc., 51 Franklin Street, Fifth Floor,
Boston, MA 02110-1301 USA. */
/***en
@addtogroup m17nChartable
@brief Chartable objects and API for them.
The m17n library supports enormous number of characters. Thus, if
attributes of each character are to be stored in a simple array,
such an array would be impractically big. The attributes usually
used, however, are often assigned only to a range of characters.
Even when all characters have attributes, characters of
consecutive character code tend to have the same attribute values.
The m17n library utilizes this tendency to store characters and
their attribute values efficiently in an object called @e
Chartable. Although a chartable object is not a simple array,
application programs can handle a chartable as if it is an array.
Attribute values of a character can be obtained by accessing a
Chartable for the attribute with the character code of the
specified character.
A chartable is a managed object. */
/***ja
@addtogroup m17nChartable 文字テーブル
@brief 文字テーブルとそれに関する API.
m17n ライブラリが扱う文字の空間は広大であるため、文字毎の情報を単純な配列に格納しようとすると、その配列は巨大になりすぎ、非実用的である。
しかし通常必要となる文字についての情報は、ある特定の範囲の文字にのみ付いていることが多い。
全文字に関して情報がある場合にも、連続した文字コードを持つ文字は同じ情報を持つことが多い。
このような傾向を利用して文字とその付加情報を効率的に格納するために、
m17n ライブラリは @e 文字テーブル (chartable) と呼ぶオブジェクトを用いる。
文字テーブルは配列ではないが、アプリケーションプログラムは文字テーブルを配列の一種として扱うことができる。
ある文字についての特定の情報は、その情報を持つ文字テーブルをその文字のコードで引くこと
で得られる。
文字テーブルは管理下オブジェクトである。 */
/*=*/
#if !defined (FOR_DOXYGEN) || defined (DOXYGEN_INTERNAL_MODULE)
/*** @addtogroup m17nInternal
@{ */
#include <config.h>
#include <stdlib.h>
#include <string.h>
#include <limits.h>
#include <ctype.h>
#include <stdio.h>
#include "m17n.h"
#include "m17n-misc.h"
#include "internal.h"
#include "symbol.h"
static M17NObjectArray chartable_table;
/*** Maximum depth of char-table. */
#define CHAR_TAB_MAX_DEPTH 3
/** @name Define: Number of characters covered by char-table of each level.
@{ */
/** BITs for number of characters covered by char-table of each
level. */
#if MCHAR_MAX < 0x400000
#define SUB_BITS_0 22 /* i.e. 0x400000 chars */
#define SUB_BITS_1 16 /* i.e. 0x10000 chars */
#define SUB_BITS_2 12 /* i.e. 0x1000 chars */
#define SUB_BITS_3 7 /* i.e. 0x80 chars */
#else /* MCHAR_MAX >= 0x400000 */
#define SUB_BITS_0 31
#define SUB_BITS_1 24
#define SUB_BITS_2 16
#define SUB_BITS_3 8
#endif
/** @} */
/** How many characters a char-table covers at each level. */
static const int chartab_chars[] =
{ (1 << SUB_BITS_0),
(1 << SUB_BITS_1),
(1 << SUB_BITS_2),
(1 << SUB_BITS_3) };
/** How many slots a char-table has at each level. */
static const int chartab_slots[] =
{ (1 << (SUB_BITS_0 - SUB_BITS_1)),
(1 << (SUB_BITS_1 - SUB_BITS_2)),
(1 << (SUB_BITS_2 - SUB_BITS_3)),
(1 << SUB_BITS_3) };
/** Mask bits to obtain the valid bits from a character code for looking
up a char-table of each level. */
static const int chartab_mask[] =
{ (int) ((((unsigned) 1) << SUB_BITS_0) - 1),
(1 << SUB_BITS_1) - 1,
(1 << SUB_BITS_2) - 1,
(1 << SUB_BITS_3) - 1 };
/** Bit-shifting counts to obtain a valid index from a character code
for looking up a char-table of each level. */
static const int chartab_shift[] =
{ SUB_BITS_1, SUB_BITS_2, SUB_BITS_3, 0 };
/** Index for looking up character C in a char-table at DEPTH. */
#define SUB_IDX(depth, c) \
(((c) & chartab_mask[depth]) >> chartab_shift[depth])
/** Structure of sub char-table. */
typedef struct MSubCharTable MSubCharTable;
struct MSubCharTable
{
#if SUB_BITS_0 > 24
/* The depth of the table; 0, 1, 2, or 3. */
int depth;
/* The minimum character covered by the table. */
int min_char;
#else /* SUB_BITS_0 <= 24 */
/* The value is ((<depth> << 24) | <min_char>). */
int depth_min_char;
#endif /* SUB_BITS_0 <= 24 */
/** The default value of characters covered by the table. */
void *default_value;
/** For a table of bottom level, array of values. For a non-bottom
table, array of sub char-tables. It may be NULL if all
characters covered by the table has <default_value>. */
union {
void **values;
MSubCharTable *tables;
} contents;
};
#if SUB_BITS_0 > 24
#define TABLE_DEPTH(table) ((table)->depth)
#define TABLE_MIN_CHAR(table) ((table)->min_char)
#define SET_DEPTH_MIN_CHAR(table, DEPTH, MIN_CHAR) \
((table)->depth = (DEPTH), (table)->min_char = (MIN_CHAR))
#else /* SUB_BITS_0 <= 24 */
#define TABLE_DEPTH(table) ((table)->depth_min_char >> 24)
#define TABLE_MIN_CHAR(table) ((table)->depth_min_char & 0xFFFFFF)
#define SET_DEPTH_MIN_CHAR(table, DEPTH, MIN_CHAR) \
((table)->depth_min_char = ((DEPTH) << 24) | (MIN_CHAR))
#endif /* SUB_BITS_0 <= 24 */
/** Structure of char-table. */
struct MCharTable
{
/** Common header for a managed object. */
M17NObject control;
/** Key of the table. */
MSymbol key;
/** The minimum and maximum characters covered by the table. */
int min_char, max_char;
MSubCharTable subtable;
};
/* Local functions. */
/** Allocate and initialize an array of sub-tables for sub char-table
TABLE. It is assumed that TABLE_DEPTH (TABLE) <
CHAR_TAB_MAX_DEPTH.*/
static void
make_sub_tables (MSubCharTable *table, int managedp)
{
int depth = TABLE_DEPTH (table);
int min_char = TABLE_MIN_CHAR (table);
int slots = chartab_slots[depth];
int chars = chartab_chars[depth + 1];
MSubCharTable *tables;
int i;
MTABLE_MALLOC (tables, slots, MERROR_CHARTABLE);
for (i = 0; i < slots; i++, min_char += chars)
{
SET_DEPTH_MIN_CHAR (tables + i, depth + 1, min_char);
tables[i].default_value = table->default_value;
tables[i].contents.tables = NULL;
}
if (managedp && table->default_value)
M17N_OBJECT_REF_NTIMES (tables->default_value, slots);
table->contents.tables = tables;
}
/** Allocate and initialize an array of values for sub char-table
TABLE. It is assumed that TABLE_DEPTH (TABLE) ==
CHAR_TAB_MAX_DEPTH. */
static void
make_sub_values (MSubCharTable *table, int managedp)
{
int slots = chartab_slots[CHAR_TAB_MAX_DEPTH];
void **values;
int i;
MTABLE_MALLOC (values, slots, MERROR_CHARTABLE);
for (i = 0; i < slots; i++)
values[i] = table->default_value;
if (managedp && table->default_value)
M17N_OBJECT_REF_NTIMES (table->default_value, slots);
table->contents.values = values;
}
/** Free contents of sub char-table TABLE and the default value of
TABLE. Free also the sub-tables recursively. */
static void
free_sub_tables (MSubCharTable *table, int managedp)
{
int depth = TABLE_DEPTH (table);
int slots = chartab_slots[depth];
if (table->contents.tables)
{
if (depth < CHAR_TAB_MAX_DEPTH)
{
while (slots--)
free_sub_tables (table->contents.tables + slots, managedp);
free (table->contents.tables);
}
else
{
if (managedp)
while (slots--)
{
if (table->contents.values[slots])
M17N_OBJECT_UNREF (table->contents.values[slots]);
}
free (table->contents.values);
}
table->contents.tables = NULL;
}
if (managedp && table->default_value)
M17N_OBJECT_UNREF (table->default_value);
}
/** In sub char-table TABLE, set value VAL for characters of the range
FROM and TO. */
static void
set_chartable_range (MSubCharTable *table, int from, int to, void *val,
int managedp)
{
int depth = TABLE_DEPTH (table);
int min_char = TABLE_MIN_CHAR (table);
int max_char = min_char + (chartab_chars[depth] - 1);
int i;
if (max_char < 0 || max_char > MCHAR_MAX)
max_char = MCHAR_MAX;
if (from < min_char)
from = min_char;
if (to > max_char)
to = max_char;
if (from == min_char && to == max_char)
{
free_sub_tables (table, managedp);
if (managedp && val)
M17N_OBJECT_REF (val);
table->default_value = val;
return;
}
if (depth < CHAR_TAB_MAX_DEPTH)
{
if (! table->contents.tables)
make_sub_tables (table, managedp);
i = SUB_IDX (depth, from);
table = table->contents.tables + i;
while (i < chartab_slots[depth] && TABLE_MIN_CHAR (table) <= to)
{
set_chartable_range (table, from, to, val, managedp);
table++, i++;
}
}
else
{
int idx_from = SUB_IDX (depth, from);
int idx_to = SUB_IDX (depth, to);
if (! table->contents.values)
make_sub_values (table, managedp);
for (i = idx_from; i <= idx_to; i++)
{
if (managedp && table->contents.values[i])
M17N_OBJECT_UNREF (table->contents.values[i]);
table->contents.values[i] = val;
}
if (managedp && val)
M17N_OBJECT_REF_NTIMES (val, (idx_to - idx_from + 1));
}
}
/** Lookup the sub char-table TABLE for the character C. If NEXT_C is
not NULL, set *NEXT_C to the next interesting character to lookup
for. If DEFAULT_P is zero, the next interesting character is what
possibly has the different value than C. Otherwise, the next
interesting character is what possibly has the default value (if C
has a value deferent from the default value) or has a value
different from the default value (if C has the default value). */
static void *
lookup_chartable (MSubCharTable *table, int c, int *next_c, int default_p)
{
int depth = TABLE_DEPTH (table);
void *val;
void *default_value = table->default_value;
int idx;
while (1)
{
if (! table->contents.tables)
{
if (next_c)
*next_c = TABLE_MIN_CHAR (table) + chartab_chars[depth];
return table->default_value;
}
if (depth == CHAR_TAB_MAX_DEPTH)
break;
table = table->contents.tables + SUB_IDX (depth, c);
depth++;
}
idx = SUB_IDX (depth, c);
val = table->contents.values[idx];
if (next_c)
{
int max_char = TABLE_MIN_CHAR (table) + (chartab_chars[depth] - 1);
if (max_char < 0 || max_char > MCHAR_MAX)
max_char = MCHAR_MAX;
if (default_p && val != default_value)
{
do { c++, idx++; }
while (c >= 0 && c <= max_char
&& table->contents.values[idx] != default_value);
}
else
{
do { c++, idx++; }
while (c >= 0 && c <= max_char
&& table->contents.values[idx] == val);
}
*next_c = c;
}
return val;
}
/** Call FUNC for characters in sub char-table TABLE. Ignore such
characters that has a value IGNORE. FUNC is called with four
arguments; FROM, TO, VAL, and ARG (same as FUNC_ARG). If
DEFAULT_P is zero, FROM and TO are range of characters that has
the same value VAL. Otherwise, FROM and TO are range of
characters that has the different value than the default value of
TABLE. */
static void
map_chartable (MSubCharTable *table, void *ignore, int default_p,
void (*func) (int, int, void *, void *),
void *func_arg)
{
void *current;
int from = 0;
int c, next_c;
current = lookup_chartable (table, 0, &next_c, default_p);
c = next_c;
while (c >= 0 && c <= MCHAR_MAX)
{
void *next = lookup_chartable (table, c, &next_c, default_p);
if (current != next)
{
if (current != ignore)
(*func) (from, c - 1, current, func_arg);
current = next;
from = c;
}
c = next_c;
}
if (from <= MCHAR_MAX && current != ignore)
(*func) (from, MCHAR_MAX, current, func_arg);
}
/* Return the smallest character whose value is not DEFAULT_VALUE in
TABLE. If all characters in TABLE have DEFAULT_VALUE, return
-1. */
static int
chartab_min_non_default_char (MSubCharTable *table, void *default_value)
{
int depth = TABLE_DEPTH (table);
int slots;
int i, c;
if (!table->contents.tables)
return (default_value == table->default_value
? -1 : TABLE_MIN_CHAR (table));
slots = chartab_slots[depth];
if (depth == CHAR_TAB_MAX_DEPTH)
{
for (i = 0; i < slots; i++)
if (table->contents.values[i] != default_value)
return (TABLE_MIN_CHAR (table) + i);
}
else
{
for (i = 0; i < slots; i++)
if ((c = chartab_min_non_default_char (table->contents.tables + i,
default_value))
>= 0)
return c;
}
return -1;
}
/* Return the largest character whose value is not DEFAULT_VALUE in
TABLE. If all characters in TABLE have DEFAULT_VALUE, return
-1. */
static int
chartab_max_non_default_char (MSubCharTable *table, void *default_value)
{
int depth = TABLE_DEPTH (table);
int slots;
int i, c;
if (!table->contents.tables)
return (default_value == table->default_value
? -1 : TABLE_MIN_CHAR (table) + chartab_chars[depth] - 1);
slots = chartab_slots[depth];
if (depth == CHAR_TAB_MAX_DEPTH)
{
for (i = slots - 1; i >= 0; i--)
if (table->contents.values[i] != default_value)
return (TABLE_MIN_CHAR (table) + i);
}
else
{
for (i = slots - 1; i >= 0; i--)
if ((c = chartab_max_non_default_char (table->contents.tables + i,
default_value))
>= 0)
return c;
}
return -1;
}
static void
free_chartable (void *object)
{
MCharTable *table = (MCharTable *) object;
int managedp = table->key != Mnil && table->key->managing_key;
if (table->subtable.contents.tables)
{
int i;
for (i = 0; i < chartab_slots[0]; i++)
free_sub_tables (table->subtable.contents.tables + i, managedp);
free (table->subtable.contents.tables);
if (managedp && table->subtable.default_value)
M17N_OBJECT_UNREF (table->subtable.default_value);
}
M17N_OBJECT_UNREGISTER (chartable_table, table);
free (object);
}
#include <stdio.h>
/* Support function of mdebug_dump_chartab. */
static void
dump_sub_chartab (MSubCharTable *table, void *default_value,
MSymbol key, int indent)
{
int depth = TABLE_DEPTH (table);
int min_char = TABLE_MIN_CHAR (table);
int max_char = min_char + (chartab_chars[depth] - 1);
char *prefix = (char *) alloca (indent + 1);
int i;
if (max_char < 0 || max_char > MCHAR_MAX)
max_char = MCHAR_MAX;
memset (prefix, 32, indent);
prefix[indent] = 0;
if (! table->contents.tables && table->default_value == default_value)
return;
fprintf (mdebug__output, "\n%s(sub%d (U+%04X U+%04X) ",
prefix, depth, min_char, max_char);
if (key == Msymbol)
{
if (table->default_value)
fprintf (mdebug__output, "(default %s)",
((MSymbol) table->default_value)->name);
else
fprintf (mdebug__output, "(default nil)");
}
else
fprintf (mdebug__output, "(default #x%p)", table->default_value);
default_value = table->default_value;
if (table->contents.tables)
{
if (depth < CHAR_TAB_MAX_DEPTH)
for (i = 0; i < chartab_slots[depth]; i++)
dump_sub_chartab (table->contents.tables + i, default_value,
key, indent + 2);
else
for (i = 0; i < chartab_slots[depth]; i++, min_char++)
{
void **val = table->contents.values + i;
if (val == default_value)
continue;
default_value = *val;
fprintf (mdebug__output, "\n%s (U+%04X", prefix, min_char);
while (i + 1 < chartab_slots[depth]
&& val[1] == default_value)
i++, val++, min_char++;
fprintf (mdebug__output, "-U+%04X ", min_char);
if (key == Msymbol)
{
if (default_value)
fprintf (mdebug__output, "%s)",
((MSymbol) default_value)->name);
else
fprintf (mdebug__output, "nil)");
}
else
fprintf (mdebug__output, " #x%p)", default_value);
}
}
fprintf (mdebug__output, ")");
}
/* Internal API */
int
mchartable__init ()
{
M17N_OBJECT_ADD_ARRAY (chartable_table, "Chartable");
return 0;
}
void
mchartable__fini ()
{
}
void *
mchartable__lookup (MCharTable *table, int c, int *next_c, int default_p)
{
return lookup_chartable (&table->subtable, c, next_c, default_p);
}
/*** @} */
#endif /* !FOR_DOXYGEN || DOXYGEN_INTERNAL_MODULE */
/* External API */
/*** @addtogroup m17nChartable
@{ */
/***en
@brief Symbol whose name is "char-table".
The symbol @c Mchar_table has the name <tt>"char-table"</tt>. */
/***ja
@brief "char-table" という名前を持つシンボル.
シンボル @c Mchar_table は名前 <tt>"char-table"</tt> を持つ。
*/
MSymbol Mchar_table;
/*=*/
/***en
@brief Create a new chartable.
The mchartable () function creates a new chartable object with
symbol $KEY and the default value $DEFAULT_VALUE. If $KEY is a
managing key, the elements of the table (including the default
value) are managed objects or NULL.
@return
If the operation was successful, mchartable () returns a pointer
to the created chartable. Otherwise it returns @c NULL and
assigns an error code to the external variable #merror_code. */
/***ja
@brief 新しい文字テーブルを作る.
関数 mchartable () はキーが $KEY で要素のデフォルト値が
$DEFAULT_VALUE である新しい文字テーブルを作る。もし $KEY
が管理キーであれば、このテーブルの要素は(デフォルト値を含めて)管理下オブジェクトか
NULL のいずれかである。
@return
処理が成功すれば mchartable () は作成された文字テーブルへのポインタを返す。
失敗した場合は @c NULL を返し、外部変数 #merror_code にエラーコードを設定する。 */
MCharTable *
mchartable (MSymbol key, void *default_value)
{
MCharTable *table;
M17N_OBJECT (table, free_chartable, MERROR_CHARTABLE);
M17N_OBJECT_REGISTER (chartable_table, table);
table->key = key;
table->min_char = -1;
table->max_char = -1;
SET_DEPTH_MIN_CHAR (&table->subtable, 0, 0);
table->subtable.default_value = default_value;
if (key != Mnil && key->managing_key && default_value)
M17N_OBJECT_REF (default_value);
table->subtable.contents.tables = NULL;
return table;
}
/*=*/
/***en
@brief Return the minimum character whose value is set in a chartabe.
The mchartable_min_char () function return the minimum character
whose value is set in chartable $TABLE. No character is set its
value, the function returns -1.
*/
int
mchartable_min_char (MCharTable *table)
{
return table->min_char;
}
/*=*/
/***en
@brief Return the maximum character whose value is set in a chartabe.
The mchartable_max_char () function return the maximum character
whose value is set in chartable $TABLE. No character is set its
value, the function returns -1.
*/
int
mchartable_max_char (MCharTable *table)
{
return table->max_char;
}
/*=*/
/***en
@brief Return the assigned value of a character in a chartable.
The mchartable_lookup () function returns the value assigned to
character $C in chartable $TABLE. If no value has been set for $C
explicitly, the default value of $TABLE is returned. If $C is not
a valid character, mchartable_lookup () returns @c NULL and
assigns an error code to the external variable #merror_code. */
/***ja
@brief 文字テーブル中で文字に割り当てられた値を返す.
関数 mchartable_lookup () は文字テーブル $TABLE 中で文字 $C
に割り当てられた値を返す。$C に対する明示的な値がなければ、$TABLE
のデフォルト値を返す。$C が妥当な文字でなければ、mchartable_lookup () は
@c NULL を返し、外部変数 #merror_code にエラーコードを設定する。 */
/***
@errors
@c MERROR_CHAR
@seealso
mchartable_set () */
void *
mchartable_lookup (MCharTable *table, int c)
{
M_CHECK_CHAR (c, NULL);
if (c < table->min_char || c > table->max_char)
return table->subtable.default_value;
return lookup_chartable (&table->subtable, c, NULL, 0);
}
/*=*/
/***en
@brief Assign a value to a character in a chartable.
The mchartable_set () function sets the value of character $C in
chartable $TABLE to $VAL.
@return
If the operation was successful, mchartable_set () returns 0.
Otherwise it returns -1 and assigns an error code to the external
variable #merror_code. */
/***ja
@brief 文字テーブル中での文字の値を設定する.
関数 mchartable_set () は、文字テーブル $TABLE 中の文字 $C
に値 $VAL を割り当てる。
@return
処理が成功すれば、mchartable_set () は 0 を返す。そうでなければ -1
を返し、外部変数 #merror_code にエラーコードを設定する。 */
/***
@errors
@c MERROR_CHAR
@seealso
mchartable_lookup (), mchartable_set_range () */
int
mchartable_set (MCharTable *table, int c, void *val)
{
int managedp = table->key != Mnil && table->key->managing_key;
MSubCharTable *sub = &table->subtable;
int i;
M_CHECK_CHAR (c, -1);
if (table->max_char < 0)
table->min_char = table->max_char = c;
else
{
if (c < table->min_char)
table->min_char = c;
else if (c > table->max_char)
table->max_char = c;
}
for (i = 0; i < CHAR_TAB_MAX_DEPTH; i++)
{
if (! sub->contents.tables)
{
if (sub->default_value == val)
return 0;
make_sub_tables (sub, managedp);
}
sub = sub->contents.tables + SUB_IDX (i, c);
}
if (! sub->contents.values)
{
if (sub->default_value == val)
return 0;
make_sub_values (sub, managedp);
}
sub->contents.values[SUB_IDX (3, c)] = val;
if (managedp && val)
M17N_OBJECT_REF (val);
return 0;
}
/*=*/
/***en
@brief Assign a value to the characters in the specified range.
The mchartable_set_range () function assigns value $VAL to the
characters from $FROM to $TO (both inclusive) in chartable $TABLE.
@return
If the operation was successful, mchartable_set_range () returns
0. Otherwise it returns -1 and assigns an error code to the
external variable #merror_code. If $FROM is greater than $TO,
mchartable_set_range () returns immediately without an error. */
/***ja
@brief 指定範囲の文字に値を設定する.
関数 mchartable_set_range () は、文字テーブル $TABLE 中の $FROM
から $TO まで(両端を含む)の文字に、値として $VAL を設定する。
@return
処理が成功すれば mchartable_set_range () は 0 を返す。そうでなければ
-1 を返し、外部変数 #merror_code にエラーコードを設定する。$FROM が
$TO より大きいときには、 mchartable_set_range ()
は何もせず、エラーも起こさない。 */
/***
@errors
@c MERROR_CHAR
@seealso
mchartable_set () */
int
mchartable_set_range (MCharTable *table, int from, int to, void *val)
{
int managedp = table->key != Mnil && table->key->managing_key;
M_CHECK_CHAR (from, -1);
M_CHECK_CHAR (to, -1);
if (from > to)
return 0;
if (table->max_char < 0)
table->min_char = from, table->max_char = to;
else{
if (from < table->min_char)
table->min_char = from;
if (to > table->max_char)
table->max_char = to;
}
set_chartable_range (&table->subtable, from, to, val, managedp);
return 0;
}
/*=*/
/***en
@brief Search for characters that have non-default value.
The mchartable_range () function searches chartable $TABLE for the
first and the last character codes that do not have the default
value of $TABLE, and set $FROM and $TO to them, respectively. If
all characters have the default value, both $FROM and $TO are set
to -1. */
/***ja
@brief 値がデフォルトと異なる文字を探す.
関数 mchartable_range () は文字テーブル $TABLE 中で、$TABLE
のデフォルト値以外の値を持つ最初と最後の文字を探し、それぞれを $FROM
と $TO に設定する。すべての文字が値としてデフォルト値をとっている場合には
$FROM と $TO を -1に設定する。 */
void
mchartable_range (MCharTable *table, int *from, int *to)
{
*from = chartab_min_non_default_char (&table->subtable,
table->subtable.default_value);
if (*from == -1)
*to = -1;
else
*to = chartab_max_non_default_char (&table->subtable,
table->subtable.default_value);
}
/*=*/
/***en
@brief Call a function for characters in a chartable.
The mchartable_map () function calls function $FUNC for characters
in chartable $TABLE. No function call occurs for characters that
have value $IGNORE in $TABLE. Comparison of $IGNORE and character
value is done with the operator @c ==. Be careful when you use
string literals or pointers.
Instead of calling $FUNC for each character, mchartable_map ()
tries to optimize the number of function calls, i.e. it makes a
single function call for a chunk of characters when those
consecutive characters have the same value.
No matter how long the character chunk is, $FUNC is called with
four arguments; $FROM, $TO, $VAL, and $ARG. $FROM and $TO (both
inclusive) defines the range of characters that have value $VAL.
$ARG is the same as $FUNC_ARG.
@return
This function always returns 0. */
/***ja
@brief 文字テーブル中の文字に対して指定の関数を呼ぶ.
関数 mchartable_map () は、文字テーブル $TABLE 中の文字に対して関数
$FUNC を呼ぶ。ただし$TABLE 中でも値が $IGNORE
である文字については関数呼び出しを行なわない。$IGNORE と文字の値の比較は
@c == で行なうので、文字列リテラルやポインタを使う際には注意を要する。
mchartable_map () は、一文字ごとに $FUNC
を呼ぶのではなく、関数呼び出しの回数を最適化しようとする。
すなわち、連続した文字が同じ値を持っていた場合には、その文字のまとまり全体について一度の関数呼び出
ししか行なわない。
文字のまとまりの大きさにかかわらず、$FUNC は $FROM, $TO, $VAL, $ARG
の4引数で呼ばれる。$FROM と $TO (両端を含む)は $VAL
を値として持つ文字の範囲を示し、$ARG は $FUNC_ARG そのものである。
@return
この関数は常に0を返す。 */
int
mchartable_map (MCharTable *table, void *ignore,
void (*func) (int, int, void *, void *),
void *func_arg)
{
map_chartable (&table->subtable, ignore, 0, func, func_arg);
return 0;
}
/*=*/
/*** @} */
/*** @addtogroup m17nDebug */
/*=*/
/*** @{ */
/***en
@brief Dump a chartable.
The mdebug_dump_chartab () function prints a chartable $TABLE in a
human readable way to the stderr or to what specified by the
environment variable MDEBUG_OUTPUT_FILE. $INDENT specifies how
many columns to indent the lines but the first one.
@return
This function returns $TABLE. */
/***ja
@brief 文字テーブルをダンプする.
関数 mdebug_dump_chartab () は文字テーブル $TABLE を標準エラー出力
もしくは環境変数 MDEBUG_DUMP_FONT で指定されたファイルに人間に可読
な形で印刷する。$INDENT は2行目以降のインデントを指定する。
@return
この関数は $TABLE を返す。 */
MCharTable *
mdebug_dump_chartab (MCharTable *table, int indent)
{
fprintf (mdebug__output, "(chartab (U+%04X U+%04X)",
table->min_char, table->max_char);
dump_sub_chartab (&table->subtable, table->subtable.default_value,
table->key, indent + 2);
fprintf (mdebug__output, ")");
return table;
}
/*** @} */
/*
Local Variables:
coding: utf-8
End:
*/
|