1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 112 113 114 115 116 117 118 119 120 121 122 123 124 125 126 127 128 129 130 131 132 133 134 135 136 137 138 139 140 141 142 143 144 145 146 147 148 149 150 151 152 153 154 155 156 157 158 159 160 161 162 163 164 165 166 167 168 169 170 171 172 173 174 175 176 177 178 179 180 181 182 183 184 185 186 187 188 189 190 191 192 193 194 195 196 197 198 199 200 201 202 203 204 205 206 207 208 209 210 211 212 213 214 215 216 217 218 219 220 221 222 223 224 225 226 227 228 229 230 231 232 233 234 235 236 237 238 239 240 241 242 243 244 245 246 247 248 249 250 251 252 253 254 255 256 257 258 259 260 261 262 263 264 265 266 267 268 269 270 271 272 273 274 275 276 277 278 279 280 281 282 283 284 285 286 287 288 289 290 291 292 293 294 295 296 297 298 299 300 301 302 303 304 305 306 307 308 309 310 311 312 313 314 315 316 317 318 319 320 321 322 323 324 325 326 327 328 329 330 331 332 333 334 335 336 337 338 339 340 341 342 343 344 345 346 347 348 349 350 351 352 353 354 355 356 357 358 359 360 361 362 363 364 365 366 367 368 369 370 371 372 373 374 375 376 377 378 379 380 381 382 383 384 385 386 387 388 389 390 391 392 393 394 395 396 397 398 399 400 401 402 403 404 405 406 407 408 409 410 411 412 413 414 415 416 417 418 419 420 421 422 423 424 425 426 427 428 429 430 431 432 433 434 435 436 437 438 439 440 441 442 443 444 445 446 447 448 449 450 451 452 453 454 455 456 457 458 459 460 461 462 463 464 465 466 467 468 469 470 471 472 473 474 475 476 477 478 479 480 481 482 483 484 485 486 487 488 489 490 491 492 493 494 495 496 497 498 499 500 501 502 503 504 505 506 507 508 509 510 511 512 513 514 515 516 517 518 519 520 521 522 523 524 525 526 527 528 529 530 531 532 533 534 535 536 537 538 539 540 541 542 543 544 545 546 547 548 549 550 551 552 553 554 555 556 557 558 559 560 561 562 563 564 565 566 567 568 569 570 571 572 573 574 575 576 577 578 579 580 581 582 583 584 585 586 587 588 589 590 591 592 593 594 595 596 597 598 599 600 601 602 603 604 605 606 607 608 609 610 611 612 613 614 615 616 617 618 619 620 621 622 623 624 625 626 627 628 629 630 631 632 633 634 635 636 637 638 639 640 641 642 643 644 645 646 647 648 649 650 651 652 653 654 655 656 657 658 659 660 661 662 663 664 665 666 667 668 669 670 671 672 673 674 675 676 677 678 679 680 681 682 683 684 685 686 687 688 689 690 691 692 693 694 695 696 697 698 699 700 701 702 703 704 705 706 707 708 709 710 711 712 713 714 715 716 717 718 719 720 721 722 723 724 725 726 727 728 729 730 731 732 733 734 735 736 737 738 739 740 741 742 743 744 745 746 747 748 749 750 751 752 753 754 755 756 757 758 759 760 761 762 763 764 765 766 767 768 769 770 771 772 773 774 775 776 777 778 779 780 781 782 783 784 785 786 787 788 789 790 791 792 793 794 795 796 797 798 799 800 801 802 803 804 805 806 807 808 809 810 811 812 813 814 815 816 817 818 819 820 821 822 823 824 825 826 827 828 829 830 831 832 833 834 835 836 837 838 839 840 841 842 843 844 845 846 847 848 849 850 851 852 853 854 855 856 857 858 859 860 861 862 863 864 865 866 867 868 869 870 871 872 873 874 875 876 877 878 879 880 881 882 883 884 885 886 887 888 889 890 891 892 893 894 895 896 897 898 899 900 901 902 903 904 905 906 907 908 909 910 911 912 913 914 915 916 917 918 919 920 921 922 923 924 925 926 927 928 929 930 931 932 933 934 935 936 937 938 939 940 941 942 943 944 945 946 947 948 949 950 951 952 953 954 955 956 957 958 959 960 961 962 963 964 965 966 967 968 969 970 971 972 973 974 975 976 977 978 979 980 981 982 983 984 985 986 987 988 989 990 991 992 993 994 995 996 997 998 999 1000 1001 1002 1003 1004 1005 1006 1007 1008 1009 1010 1011 1012 1013 1014 1015 1016 1017 1018 1019 1020 1021 1022 1023 1024 1025 1026 1027 1028 1029 1030 1031 1032 1033 1034 1035 1036 1037 1038 1039 1040 1041 1042 1043 1044 1045 1046 1047 1048 1049 1050 1051 1052 1053 1054 1055 1056 1057 1058 1059 1060 1061 1062 1063 1064 1065 1066 1067 1068 1069 1070 1071 1072 1073 1074 1075 1076 1077 1078 1079 1080 1081 1082 1083 1084 1085
|
<!DOCTYPE html>
<html class="client-nojs" lang="en" dir="ltr">
<head>
<meta charset="UTF-8"/>
<title>KANJIDIC Project - EDRDG Wiki</title>
<script>document.documentElement.className="client-js";RLCONF={"wgBreakFrames":false,"wgSeparatorTransformTable":["",""],"wgDigitTransformTable":["",""],"wgDefaultDateFormat":"dmy","wgMonthNames":["","January","February","March","April","May","June","July","August","September","October","November","December"],"wgRequestId":"927888317893c72ff07f9799","wgCSPNonce":false,"wgCanonicalNamespace":"","wgCanonicalSpecialPageName":false,"wgNamespaceNumber":0,"wgPageName":"KANJIDIC_Project","wgTitle":"KANJIDIC Project","wgCurRevisionId":954,"wgRevisionId":954,"wgArticleId":283,"wgIsArticle":true,"wgIsRedirect":false,"wgAction":"view","wgUserName":null,"wgUserGroups":["*"],"wgCategories":[],"wgPageContentLanguage":"en","wgPageContentModel":"wikitext","wgRelevantPageName":"KANJIDIC_Project","wgRelevantArticleId":283,"wgIsProbablyEditable":false,"wgRelevantPageIsProbablyEditable":false,"wgRestrictionEdit":[],"wgRestrictionMove":[],"wgVector2022PreviewPages":[]};RLSTATE={"site.styles":"ready","user.styles":
"ready","user":"ready","user.options":"loading","skins.vector.styles.legacy":"ready","jquery.tablesorter.styles":"ready"};RLPAGEMODULES=["site","mediawiki.page.ready","jquery.tablesorter","mediawiki.toc","skins.vector.legacy.js"];</script>
<script>(RLQ=window.RLQ||[]).push(function(){mw.loader.implement("user.options@12s5i",function($,jQuery,require,module){mw.user.tokens.set({"patrolToken":"+\\","watchToken":"+\\","csrfToken":"+\\"});});});</script>
<link rel="stylesheet" href="/wiki/load.php?lang=en&modules=jquery.tablesorter.styles%7Cskins.vector.styles.legacy&only=styles&skin=vector"/>
<script async="" src="/wiki/load.php?lang=en&modules=startup&only=scripts&raw=1&skin=vector"></script>
<meta name="generator" content="MediaWiki 1.39.1"/>
<meta name="format-detection" content="telephone=no"/>
<meta name="viewport" content="width=1000"/>
<link rel="icon" href="/favicon.ico"/>
<link rel="search" type="application/opensearchdescription+xml" href="/wiki/opensearch_desc.php" title="EDRDG Wiki (en)"/>
<link rel="EditURI" type="application/rsd+xml" href="https://www.edrdg.org/wiki/api.php?action=rsd"/>
<link rel="alternate" type="application/atom+xml" title="EDRDG Wiki Atom feed" href="/wiki/index.php?title=Special:RecentChanges&feed=atom"/>
</head>
<body class="mediawiki ltr sitedir-ltr mw-hide-empty-elt ns-0 ns-subject page-KANJIDIC_Project rootpage-KANJIDIC_Project skin-vector action-view skin-vector-legacy vector-feature-language-in-header-enabled vector-feature-language-in-main-page-header-disabled vector-feature-language-alert-in-sidebar-disabled vector-feature-sticky-header-disabled vector-feature-sticky-header-edit-disabled vector-feature-table-of-contents-disabled vector-feature-visual-enhancement-next-disabled"><div id="mw-page-base" class="noprint"></div>
<div id="mw-head-base" class="noprint"></div>
<div id="content" class="mw-body" role="main">
<a id="top"></a>
<div id="siteNotice"></div>
<div class="mw-indicators">
</div>
<h1 id="firstHeading" class="firstHeading mw-first-heading"><span class="mw-page-title-main">KANJIDIC Project</span></h1>
<div id="bodyContent" class="vector-body">
<div id="siteSub" class="noprint">From EDRDG Wiki</div>
<div id="contentSub"></div>
<div id="contentSub2"></div>
<div id="jump-to-nav"></div>
<a class="mw-jump-link" href="#mw-head">Jump to navigation</a>
<a class="mw-jump-link" href="#searchInput">Jump to search</a>
<div id="mw-content-text" class="mw-body-content mw-content-ltr" lang="en" dir="ltr"><div class="mw-parser-output"><div id="toc" class="toc" role="navigation" aria-labelledby="mw-toc-heading"><input type="checkbox" role="button" id="toctogglecheckbox" class="toctogglecheckbox" style="display:none" /><div class="toctitle" lang="en" dir="ltr"><h2 id="mw-toc-heading">Contents</h2><span class="toctogglespan"><label class="toctogglelabel" for="toctogglecheckbox"></label></span></div>
<ul>
<li class="toclevel-1 tocsection-1"><a href="#The_KANJIDIC_Project"><span class="tocnumber">1</span> <span class="toctext">The KANJIDIC Project</span></a>
<ul>
<li class="toclevel-2 tocsection-2"><a href="#Introduction"><span class="tocnumber">1.1</span> <span class="toctext">Introduction</span></a></li>
<li class="toclevel-2 tocsection-3"><a href="#Content_&_Format"><span class="tocnumber">1.2</span> <span class="toctext">Content & Format</span></a></li>
<li class="toclevel-2 tocsection-4"><a href="#Radical_and_Stroke_Counting_Rules"><span class="tocnumber">1.3</span> <span class="toctext">Radical and Stroke Counting Rules</span></a>
<ul>
<li class="toclevel-3 tocsection-5"><a href="#Radicals"><span class="tocnumber">1.3.1</span> <span class="toctext">Radicals</span></a></li>
<li class="toclevel-3 tocsection-6"><a href="#Other_Stroke_Patterns"><span class="tocnumber">1.3.2</span> <span class="toctext">Other Stroke Patterns</span></a></li>
</ul>
</li>
<li class="toclevel-2 tocsection-7"><a href="#Kanji_Dictionary_Search_Codes"><span class="tocnumber">1.4</span> <span class="toctext">Kanji Dictionary Search Codes</span></a>
<ul>
<li class="toclevel-3 tocsection-8"><a href="#SKIP_Codes"><span class="tocnumber">1.4.1</span> <span class="toctext">SKIP Codes</span></a></li>
<li class="toclevel-3 tocsection-9"><a href="#De_Roo_Codes"><span class="tocnumber">1.4.2</span> <span class="toctext">De Roo Codes</span></a></li>
<li class="toclevel-3 tocsection-10"><a href="#Four_Corner_Codes"><span class="tocnumber">1.4.3</span> <span class="toctext">Four Corner Codes</span></a></li>
</ul>
</li>
<li class="toclevel-2 tocsection-11"><a href="#Proposing_Changes"><span class="tocnumber">1.5</span> <span class="toctext">Proposing Changes</span></a></li>
<li class="toclevel-2 tocsection-12"><a href="#Kanji_Information_Sites"><span class="tocnumber">1.6</span> <span class="toctext">Kanji Information Sites</span></a></li>
<li class="toclevel-2 tocsection-13"><a href="#Legacy_Documentation"><span class="tocnumber">1.7</span> <span class="toctext">Legacy Documentation</span></a></li>
<li class="toclevel-2 tocsection-14"><a href="#Copyright_and_Permissions"><span class="tocnumber">1.8</span> <span class="toctext">Copyright and Permissions</span></a></li>
<li class="toclevel-2 tocsection-15"><a href="#History"><span class="tocnumber">1.9</span> <span class="toctext">History</span></a></li>
</ul>
</li>
</ul>
</div>
<h1><span class="mw-headline" id="The_KANJIDIC_Project">The KANJIDIC Project</span></h1>
<p><i>(Note that this page in the process of being rewritten, so be patient with any aspects that seems incomplete.)</i>
</p>
<h2><span class="mw-headline" id="Introduction">Introduction</span></h2>
<p>The KANJIDIC project, which began in 1991, has the goal of compiling and distributing comprehensive information on the kanji used in Japanese text processing. It covers the 13,108 kanji in three main Japanese standards:
</p>
<ul><li><a rel="nofollow" class="external text" href="https://en.wikipedia.org/wiki/JIS_X_0208">JIS X 0208-1998</a>, which includes 6,355 kanji.</li>
<li><a rel="nofollow" class="external text" href="https://en.wikipedia.org/wiki/JIS_X_0212">JIS X 0212-1990</a>, which includes extra 5,801 kanji</li>
<li><a rel="nofollow" class="external text" href="https://en.wikipedia.org/wiki/JIS_X_0213">JIS X 0213-2012</a>, which extends JIS X 0208, overlaps with some of JIS X 0212, and adds 952 additional kanji.</li></ul>
<p>Three data files are distributed by this project:
</p>
<ul><li>the KANJIDIC2 file, which is in XML format and <a rel="nofollow" class="external text" href="https://en.wikipedia.org/wiki/UTF-8">Unicode/UTF-8</a> coding, and contains information about all 13,108 kanji. (<a rel="nofollow" class="external text" href="http://www.edrdg.org/kanjidic/kanjidic2.xml.gz">download</a>)</li>
<li>the KANJIDIC file, which in in <a rel="nofollow" class="external text" href="https://en.wikipedia.org/wiki/Extended_Unix_Code#EUC-JP">EUC-JP</a> coding and covers the 6,355 kanji in JIS X 0208. (<a rel="nofollow" class="external text" href="http://www.edrdg.org/kanjidic/kanjidic.gz">download</a>)</li>
<li>the KANJD212 file, which also is in EUC-JP coding and covers the 5,801 kanji in JIS X 0212. (<a rel="nofollow" class="external text" href="http://www.edrdg.org/kanjidic/kanjd212.gz">download</a>)</li></ul>
<h2><span id="Content_.26_Format"></span><span class="mw-headline" id="Content_&_Format">Content & Format</span></h2>
<p>The database and distributed data files contain an entry for each of the kanji, with each entry containing a number of fields of data about the kanji. The data is described in the following table. The format of the distributed files as as follows:
</p>
<ul><li>the KANJIDIC and KANJD212 files are text files with one line per kanji and the information fields separated by spaces. The format of each line is:
<ul><li>the kanji itself followed by the hexadecimal form of the JIS <i>ku-ten</i> coding, e.g. "亜 3021" (the decimal <i>ku-ten</i> code is 16-01);</li>
<li>information fields beginning with one or two-letter codes as per the table below. For example "S10" indicates a stroke count of 10;</li>
<li>the Japanese readings of the kanji. ON readings (音読み) are generally in <i>katakana</i> and KUN readings (訓読み) in <i>hiragana</i>. An exception is the set of <i>kokuji</i> for measurements such as centimetres, where the reading is in <i>katakana</i>. Hyphens are used to indicate prefixes/suffixes, and '.' indicates the portion of the reading that is <i>okurigana</i>. There may be several classes of reading fields, with ordinary readings first, followed by members of the other classes, if any. The current other classes, and their tagging, are:
<ul><li>where the kanji has special <i>nanori</i> (i.e. name) readings, these are preceded the marker "T1";</li>
<li>where the kanji is a radical, and the radical name is not already a reading, the radical name is preceded the marker "T2".</li></ul></li>
<li>the meanings (usually in English). Each field begins with an open brace '{' and ends at the next close brace '}'.</li></ul></li>
<li>the KANJIDIC2 file is in XML and is structured according to its <a rel="nofollow" class="external text" href="http://www.edrdg.org/kanjidic/kanjidic2_dtdh.html">DTD</a> (Document Type Definition). The DTD contains extensive annotations and is intended to be the primary documentation for the file. This <a rel="nofollow" class="external text" href="http://www.edrdg.org/kanjidic/kd2examph.html">sample</a> illustrates the structure of a typical entry. Information fields are grouped by type within entities such as <dic_number> and <query_code>, with specific values indicated by an attribute code. For example the kanji 亜 has the number 43 in the original Nelson kanji dictionary and 81 in the New Nelson. This is recorded in the XML file as:<br /></li></ul>
<dl><dd><dic_number>
<dl><dd><dic_ref dr_type="nelson_c">43</dic_ref></dd>
<dd><dic_ref dr_type="nelson_n">81</dic_ref></dd>
<dd>....</dd></dl></dd>
<dd></dic_number></dd></dl>
<table class="wikitable sortable">
<caption>Kanjidic Information Fields
</caption>
<tbody><tr>
<th>Field
</th>
<th>Kanjidic Code<br />(if any)
</th>
<th>Group Entity
</th>
<th>Entity plus Attribute(s)<br />(if any)
</th>
<th>Comment
</th></tr>
<tr>
<td>Kanji
</td>
<td>none
</td>
<td>literal
</td>
<td>
</td>
<td>
</td></tr>
<tr>
<td>JIS code-point
</td>
<td>none
</td>
<td>codepoint
</td>
<td>cp_value cp_type="jis208" (or "jis212" or "jis213")
</td>
<td>e.g. 亜 is "3021" in KANJIDIC and<br />"1-16-01" in KANJIDIC2
</td></tr>
<tr>
<td>Unicode code-point
</td>
<td>U
</td>
<td>codepoint
</td>
<td>cp_value cp_type="ucs"
</td>
<td>
</td></tr>
<tr>
<td>Radical (Classical) (See Note 1 below)
</td>
<td>B/C
</td>
<td>radical
</td>
<td>rad_value rad_type="classical"
</td>
<td>Where Nelson uses the classical radical this has a "B" code, otherwise it has a "C" code
</td></tr>
<tr>
<td>Radical (Nelson)
</td>
<td>B
</td>
<td>radical
</td>
<td>rad_value rad_type="nelson_c"
</td>
<td>
</td></tr>
<tr>
<td>Grade
</td>
<td>G
</td>
<td>misc
</td>
<td>grade
</td>
<td>The "grade" of the kanji. <br />- G1 to G6 indicates the grade level as specified by the Japanese Ministry of Education for kanji that are to be taught in elementary school (1026 Kanji). These are sometimes called the <i>kyōiku</i> (education) kanji and are part of the set of <i>jōyō</i> (daily use) kanji;<br />- G8 indicates the remaining <i>jōyō</i> kanji that are to be taught in secondary school (additional 1,110 Kanji). Note that 1,106 of the G8 kanji are in the KANJIDIC file, a further two are in the KANJD212 file and the remaining two are only in the KANJIDIC2 XML file; <br />- G9 and G10 indicate <i>jinmeiyō</i> ("for use in names") kanji which in addition to the <i>jōyō</i> kanji are approved for use in family name registers and other official documents. G9 (649 kanji, of which 640 are in KANJIDIC) indicates the kanji is a "regular" name kanji, and G10 (212 kanji of which 130 are in KANJIDIC) indicates the kanji is a variant of a <i>jōyō</i> kanji.
</td></tr>
<tr>
<td>Stroke count
</td>
<td>S
</td>
<td>misc
</td>
<td>stroke_count
</td>
<td>The stroke count of the kanji. If more than one, the first is considered the accepted count, while subsequent ones are common miscounts. (See the section later in this document on counting strokes for some of the rules applied especially to radicals.)
</td></tr>
<tr>
<td>Frequency-of-use ranking
</td>
<td>F
</td>
<td>misc
</td>
<td>freq
</td>
<td>The 2,501 most-used characters have a ranking which expresses the relative frequency of occurrence of a character in modern Japanese. The data is based on an analysis of word frequencies in the Mainichi Shimbun over 4 years by Alexandre Girardi. Note: (a) these frequencies are biased towards words and kanji used in newspaper articles, and (b) the relative frequencies for the last few hundred kanji so graded is quite imprecise.
</td></tr>
<tr>
<td>Variant JIS 0208 kanji
</td>
<td>XJ0
</td>
<td>misc
</td>
<td>variant var_type="jis208"
</td>
<td>Code-point of a similar or related kanji. (In the kanjidic file the JIS hex code is used and in the XML file the equivalent "1-nn-nn" kuten code is used.)
</td></tr>
<tr>
<td>Variant JIS 0212 kanji
</td>
<td>XJ1
</td>
<td>misc
</td>
<td>variant var_type="jis212"
</td>
<td>Code-point of a similar or related kanji. (In the kanjidic file the JIS hex code is used and in the XML file the equivalent "1-nn-nn" kuten code is used.)
</td></tr>
<tr>
<td>Variant JIS 0213 kanji
</td>
<td>XJ2
</td>
<td>misc
</td>
<td>variant var_type="jis213"
</td>
<td>Code-point of a similar or related kanji. (In the kanjidic file the plane number (P: 1 or 2) plus the JIS hex code is used, and in the XML file the equivalent "P-nn-nn" kuten code is used.)
</td></tr>
<tr>
<td>Variant kanji (De Roo index)
</td>
<td>XJD
</td>
<td>misc
</td>
<td>variant var_type="deroo"
</td>
<td>Code-point of a similar or related kanji.
</td></tr>
<tr>
<td>Variant kanji (NJECD index)
</td>
<td>XH
</td>
<td>misc
</td>
<td>variant var_type="halpern_njecd"
</td>
<td>Code-point of a similar or related kanji.
</td></tr>
<tr>
<td>Variant kanji (S&H index)
</td>
<td>XI
</td>
<td>misc
</td>
<td>variant var_type="s_h"
</td>
<td>Code-point of a similar or related kanji.
</td></tr>
<tr>
<td>Variant kanji (Nelson index)
</td>
<td>XN
</td>
<td>misc
</td>
<td>variant var_type="nelson_c"
</td>
<td>Code-point of a similar or related kanji.
</td></tr>
<tr>
<td>Variant kanji (O'Neill index)
</td>
<td>XO
</td>
<td>misc
</td>
<td>variant var_type="oneill"
</td>
<td>Code-point of a similar or related kanji.
</td></tr>
<tr>
<td>Radical name(s)
</td>
<td>none
</td>
<td>misc
</td>
<td>rad_name
</td>
<td>The name of the radical in <i>hiragana</i>. In the KANJIDIC edition these are placed after the readings and preceded by the "T2" tag.
</td></tr>
<tr>
<td>JLPT Level
</td>
<td>J
</td>
<td>misc
</td>
<td>jlpt
</td>
<td>The pre-2010 level of the Japanese Language Proficiency Test (JLPT) in which the kanji occurs (1-4). Note that the JLPT test levels changed in 2010, with a new 5-level system (N1 to N5) being introduced. No official kanji lists are available for the new levels. The new levels are regarded as being similar to the old levels except that the old level 2 is now divided between N2 and N3, and the old levels 3 and 4 are now N4 and N5.
</td></tr>
<tr>
<td>Nelson (Classic) number
</td>
<td>N
</td>
<td>dic_number
</td>
<td>dic_ref dr_type="nelson_c"
</td>
<td>The index number in "The Modern Reader's Japanese-English Character Dictionary", edited by Andrew Nelson. If not present, the character is not in Nelson, or is considered to be a non-standard version, in which case it may have a variant. Note that many kanji glyphs currently used are what Nelson described as "non-standard".
</td></tr>
<tr>
<td>Nelson (New) number
</td>
<td>V
</td>
<td>dic_number
</td>
<td>dic_ref dr_type="nelson_n"
</td>
<td>The index number in "The New Nelson Japanese-English Character Dictionary", edited by John Haig.
</td></tr>
<tr>
<td>NJECD number
</td>
<td>H
</td>
<td>dic_number
</td>
<td>dic_ref dr_type="halpern_njecd"
</td>
<td>The index number in the "New Japanese-English Character Dictionary" (1990), edited by Jack Halpern.
</td></tr>
<tr>
<td>Kodansha Kanji Dictionary number
</td>
<td>DP
</td>
<td>dic_number
</td>
<td>dic_ref dr_type="halpern_kkd"
</td>
<td>The index numbers used by Jack Halpern in the "Kodansha Kanji Dictionary" (2013), which is the revised version of the "New Japanese-English Kanji Dictionary" of 1990.
</td></tr>
<tr>
<td>Kanji Learners Dictionary number
</td>
<td>DK
</td>
<td>dic_number
</td>
<td>dic_ref dr_type="halpern_kkld"
</td>
<td>The index numbers used by Jack Halpern in the "Kanji Learners Dictionary", published by Kodansha in 1999.
</td></tr>
<tr>
<td>Kanji Learners Dictionary number (2nd ed)
</td>
<td>DL
</td>
<td>dic_number
</td>
<td>dic_ref dr_type="halpern_kkld_2ed"
</td>
<td>The index numbers used by Jack Halpern in the 2nd edition of the "Kanji Learners Dictionary", published by Kodansha in 2013.
</td></tr>
<tr>
<td>Remembering The Kanji number
</td>
<td>L
</td>
<td>dic_number
</td>
<td>dic_ref dr_type="heisig"
</td>
<td>The index number used in "Remembering The Kanji" by James Heisig.
</td></tr>
<tr>
<td>Remembering The Kanji number (6th ed)
</td>
<td>DN
</td>
<td>dic_number
</td>
<td>dic_ref dr_type="heisig6"
</td>
<td>The index number used in "Remembering The Kanji, 6th Edition" by James Heisig.
</td></tr>
<tr>
<td>Gakken number
</td>
<td>K
</td>
<td>dic_number
</td>
<td>dic_ref dr_type="gakken"
</td>
<td>The index number in the Gakken Kanji Dictionary ("A New Dictionary of Kanji Usage"). Some of the numbers relate to the list at the back of the book, jouyou kanji not contained in the dictionary, and various historical tables at the end.
</td></tr>
<tr>
<td>O'Neill's Japanese Names number
</td>
<td>O
</td>
<td>dic_number
</td>
<td>dic_ref dr_type="oneill_names"
</td>
<td>The index number in "Japanese Names", by P.G. O'Neill. (Weatherhill, 1972) (Note: some of the numbers end with 'A'.)
</td></tr>
<tr>
<td>O'Neill's Essential Kanji number
</td>
<td>DO
</td>
<td>dic_number
</td>
<td>dic_ref dr_type="oneill_kk"
</td>
<td>The index numbers used in P.G. O'Neill's "Essential Kanji".
</td></tr>
<tr>
<td>Morohashi number
</td>
<td>MN/MP
</td>
<td>dic_number
</td>
<td>dic_ref dr_type="moro" m_vol m_page
</td>
<td>The index number and volume.page respectively of the kanji in the 13-volume Morohashi Daikanwajiten. A terminal `P` in the number, e.g. 4879P, indicates that it is 4879' in the original. In some 500 cases, the number is terminated with an `X`, to indicate that the kanji in Morohashi has a close, but not identical, glyph to the form in the JIS X 0208 standard.<br />In the XML the volume and page are attribute values.
</td></tr>
<tr>
<td>Henshall number
</td>
<td>E
</td>
<td>dic_number
</td>
<td>dic_ref dr_type="henshall"
</td>
<td>The index number used in "A Guide To Remembering Japanese Characters" by Kenneth G. Henshall.
</td></tr>
<tr>
<td>Kanji & Kana number
</td>
<td>IN
</td>
<td>dic_number
</td>
<td>dic_ref dr_type="sh_kk"
</td>
<td>The index number used in Spahn & Hadamitzky's "Kanji & Kana", 2nd edition (Tuttle).
</td></tr>
<tr>
<td>Kanji & Kana number (2011 ed)
</td>
<td>DA
</td>
<td>dic_number
</td>
<td>dic_ref dr_type="sh_kk2"
</td>
<td>The index number used in 2011 edition of Spahn & Hadamitzky's "Kanji & Kana".
</td></tr>
<tr>
<td>Sakade number
</td>
<td>DS
</td>
<td>dic_number
</td>
<td>dic_ref dr_type="sakade"
</td>
<td>The index numbers used in the early editions of "A Guide To Reading and Writing Japanese", edited by Florence Sakade.
</td></tr>
<tr>
<td>Japanese Kanji Flashcards number
</td>
<td>DF
</td>
<td>dic_number
</td>
<td>dic_ref dr_type="jf_cards"
</td>
<td>The index numbers used in the "Japanese Kanji Flashcards", by Max Hodges and Tomoko Okazaki (White Rabbit Press).
</td></tr>
<tr>
<td>Henshall Guide number
</td>
<td>DH
</td>
<td>dic_number
</td>
<td>dic_ref dr_type="henshall3"
</td>
<td>The index numbers used in the 3rd edition of "A Guide To Reading and Writing Japanese" edited by Ken Henshall et al.
</td></tr>
<tr>
<td>Tuttle Kanji Cards number
</td>
<td>DT
</td>
<td>dic_number
</td>
<td>dic_ref dr_type="tutt_cards"
</td>
<td>The index numbers used in the Tuttle Kanji Cards, compiled by Alexander Kask.
</td></tr>
<tr>
<td>Crowley number
</td>
<td>DC
</td>
<td>dic_number
</td>
<td>dic_ref dr_type="crowley"
</td>
<td>The index numbers used in "The Kanji Way to Japanese Language Power" by Dale Crowley.
</td></tr>
<tr>
<td>Kanji in Context number
</td>
<td>DJ
</td>
<td>dic_number
</td>
<td>dic_ref dr_type="kanji_in_context"
</td>
<td>The index numbers used in the "Kanji in Context" by Nishiguchi and Kono.
</td></tr>
<tr>
<td>Kodansha Compact Kanji Guide number
</td>
<td>DG
</td>
<td>dic_number
</td>
<td>dic_ref dr_type="kodansha_compact"
</td>
<td>The index numbers used in the "Kodansha Compact Kanji Guide".
</td></tr>
<tr>
<td>Japanese For Busy People number
</td>
<td>DB
</td>
<td>dic_number
</td>
<td>dic_ref dr_type="busy_people"
</td>
<td>The index numbers used in "Japanese For Busy People" vols I-III, published by the AJLT. The codes are the volume.chapter.
</td></tr>
<tr>
<td>Maniette number
</td>
<td>DM
</td>
<td>dic_number
</td>
<td>dic_ref dr_type="maniette"
</td>
<td>The numbers in Yves Maniette's "Les Kanjis dans la tête", the French adaptation of Heisig's "Remembering The Kanji".
</td></tr>
<tr>
<td>SKIP code
</td>
<td>P
</td>
<td>query_code
</td>
<td>q_code qc_type="skip"
</td>
<td>The SKIP (System of Kanji Indexing by Patterns) developed by Jack Halpern. The code is of the form "l-m-n". See <a href="#SKIP_Codes">SKIP Codes</a> section for more information.
</td></tr>
<tr>
<td>S&H descriptor
</td>
<td>I
</td>
<td>query_code
</td>
<td>q_code qc_type="sh_desc"
</td>
<td>The index code in "The Kanji Dictionary" (Tuttle 1996), by Spahn & Hadamitzky. It is the form nxnn.n, e.g. 3k11.2, where the kanji has 3 strokes in the identifying radical, it is radical "k" in the S&H classification system, there are 11 other strokes, and it is the 2nd kanji in the 3k11 sequence.
</td></tr>
<tr>
<td>Four Corner code
</td>
<td>Q
</td>
<td>query_code
</td>
<td>q_code qc_type="four_corner"
</td>
<td>The Four Corner code for the kanji. See the <a href="#Four_Corner_Codes">Four Corner codes</a> section for more information.
</td></tr>
<tr>
<td>De Roo code
</td>
<td>DR
</td>
<td>query_code
</td>
<td>q_code qc_type="deroo"
</td>
<td>The codes developed by Father Joseph De Roo, and published in his book "2001 Kanji" (Bonjinsha). See the <a href="#De_Roo_Codes">De Roo Codes</a> section for more information.
</td></tr>
<tr>
<td>Misclassification code
</td>
<td>ZPP
</td>
<td>query_code
</td>
<td>q_code qc_type="skip" skip_misclass="posn"
</td>
<td>SKIP misclassification by position.
</td></tr>
<tr>
<td>Misclassification code
</td>
<td>ZSP
</td>
<td>query_code
</td>
<td>q_code qc_type="skip" skip_misclass="stroke_count"
</td>
<td>SKIP misclassification by stroke count.
</td></tr>
<tr>
<td>Misclassification code
</td>
<td>ZBP
</td>
<td>query_code
</td>
<td>q_code qc_type="skip" skip_misclass="stroke_and_posn"
</td>
<td>SKIP misclassification by both position and stroke count.
</td></tr>
<tr>
<td>Misclassification code
</td>
<td>ZRP
</td>
<td>query_code
</td>
<td>q_code qc_type="skip" skip_misclass="stroke_diff"
</td>
<td>SKIP misclassification by differing opinions on stroke counts.
</td></tr>
<tr>
<td>Chinese reading
</td>
<td>Y
</td>
<td>rmgroup
</td>
<td>reading r_type="pinyin"
</td>
<td>The PinYin (Chinese) reading of the kanji.
</td></tr>
<tr>
<td>Korean reading (romanized)
</td>
<td>W
</td>
<td>rmgroup
</td>
<td>reading r_type="korean_r"
</td>
<td>The Korean reading of the kanji in the (Republic of Korea) Ministry of Education style.
</td></tr>
<tr>
<td>Korean reading (hangul)
</td>
<td>not included
</td>
<td>rmgroup
</td>
<td>reading r_type="korean_h"
</td>
<td>The Korean reading of the kanji in the hangul script.
</td></tr>
<tr>
<td>Vietnamese reading (chữ quốc ngữ)
</td>
<td>not included
</td>
<td>rmgroup
</td>
<td>reading r_type="vietnam"
</td>
<td>The Vietnamese reading of the kanji in chữ quốc ngữ.
</td></tr>
<tr>
<td>Japanese on reading (<i>katakana</i>)
</td>
<td>none
</td>
<td>rmgroup
</td>
<td>reading r_type="ja_on"
</td>
<td>In the KANJIDIC edition the readings are placed between the information fields and the meanings.
</td></tr>
<tr>
<td>Japanese kun reading (<i>usu. hiragana</i>)
</td>
<td>none
</td>
<td>rmgroup
</td>
<td>reading r_type="ja_kun"
</td>
<td>
</td></tr>
<tr>
<td>Meanings
</td>
<td>none
</td>
<td>rmgroup
</td>
<td>meaning m_lang="xx"
</td>
<td>The kanji meaning(s). For languages other than English the m_lang attribute is used with two-letter ISO 639-1 language codes. In the KANJIDIC edition the meanings are placed at the end of the line.
</td></tr>
<tr>
<td>Name reading(s) (<i>hiragana</i>)
</td>
<td>T1
</td>
<td>
</td>
<td>nanori
</td>
<td>The readings only associated with named-entities. In the KANJIDIC edition the first of these is preceded by the "T1" tag.
</td></tr></tbody></table>
<p>Note 1: For the sake of consistency the classical radical is the one indicated in the JIS漢字字典 (日本規格協会).
</p>
<h2><span class="mw-headline" id="Radical_and_Stroke_Counting_Rules">Radical and Stroke Counting Rules</span></h2>
<p>These rules apply to:
</p>
<ol><li>the stroke-counts themselves;</li>
<li>the stroke counts in the SKIP codes. Where this results in a SKIP which differs from that in the NJECD, or in the non-NJECD SKIPs provided by Jack Halpern, the Jack Halpern version is included prefixed with "ZR".</li></ol>
<h3><span class="mw-headline" id="Radicals">Radicals</span></h3>
<p>The radicals listed below are ones where there are differing approaches to the counting of radicals in the various references. The stroke counting in this file does not strictly follow any reference, but tends to be more aligned to Halpern.
</p>
<ol><li>B54 ENNYOU - 廴. Traditionally counted as 3 strokes, but more recently often counted as 2. S&H count this as 2; Nelson, Halpern, Koujien, etc, count it is 3. I treat it as 3.</li>
<li>B97 URI - 瓜. Traditionally counted as 5 strokes, as the middle portion looks like a katakana ム. Modern glyphs invariably make it look like 6 strokes. Nelson says it is 5 strokes. Halpern does too, but then counts the shape as 6 in other kanji. Koujien says 6, as do S&H. I treat it as 6.</li>
<li>B113 SHIMESU e.g. 礼, is counted as 4 strokes in that form, and 5 strokes in its older form, 祀 <a rel="nofollow" class="external text" href="http://www.edrdg.org/~jwb/U7940old.png">(image)</a>. 18 kanji are in the 4-stroke form and 20 are in the 5-stroke form. (Nelson and S&H count it as 4; Halpern counts it as 4 or 5. [See Note 1.])</li>
<li>B131 SHIN/KERAI 臣. Counted as 7 (Nelson counts it as 6, Halpern as 7 (in the book), and S&H as both for different kanji.)</li>
<li>B136 MAI ASHI 舛. Counted as 7 (traditionally counted as 6, in accordance with the older writing of `ヰ'. Nelson counts as 6, S&H as 7, and Halpern as 7 for 常用 and 人名用漢字 and 6 for the rest.) Note this is also applied to counting 絳 and for kanji with the 韋 pattern.</li>
<li>B140 KUSA-KANMURI e.g. 苛 always counted as 3 strokes (Halpern counts this 4 strokes for the (mostly level 2) kanji where the older form is often printed.) Note that this has been carried through to kanji where this element is not the indexing radical, such as 朦.</li>
<li>B162 SHIN-NYUU e.g. 遙 or 逢 counted as 3 or 4 strokes. (Nelson and S&H count it as 2 strokes, and Halpern as either 3 or 4.) [See Note 1 below.]</li>
<li>B163 OOZATOZUKIRI & B170 KOZATO-HEN 邦 and 阡 always counted as 3 strokes (Nelson and S&H count it as 2, Halpern as 3.) This also applies where it appears mid-kanji, such as in 橢.</li>
<li>B184 SHOKU HEN 食, 飢, etc.is counted as 8 strokes in the 飢 form, and as 9 strokes in the 飭 and 餐 forms. (Nelson and S&H count it as 8 strokes, and Halpern as 8 or 9.) [See Note 1. below.]</li>
<li>B199 MUGI 麦 always counted as 7 strokes, except for 麥 & 麩 where it is counted as 11. (Nelson and Halpern do the same, and S&H avoid treating it as a radical, but count it as 12 in the remainder.)</li>
<li>The ROO or OI radical (老) has a variant consisting of the top 4 strokes. For example, it is in 者. Traditionally, this variant had an extra dot, and was counted as 5 strokes. I'm counting it as 4 throughout.</li></ol>
<h3><span class="mw-headline" id="Other_Stroke_Patterns">Other Stroke Patterns</span></h3>
<ol><li>While the pattern 臼 is a 6-stroke radical, the top half of 叟 is made up of three distinct parts totalling 8 strokes. Note that this also is the case with 嫂, 溲, 艘 and 痩 despite the simplification in the JIS glyphs.</li>
<li>牙 (KIBA HEN) is a problem. It is classically counted as 4 strokes, but these days has a flick that makes it effectively 5. Halpern, Nelson and S&H usually have it as 5 strokes, so I'm standardizing on that.</li>
<li>Another little horror is 旡 (MU or NASHI), which is classically counted as 4 strokes. The most common variant has 5 strokes, but looks like 6. Halpern, S&H and the Classical Nelson count this as 4 strokes, and the New Nelson as 5. I'm making it 5 too.</li>
<li>The JUU or ASHIATO radical is at the bottom of 禽 and 禺. It is traditionally counted as 5 strokes, although sometimes it looks like 4. I'm using 5 throughout.</li>
<li>A related shape is ム, as in 瓜, 孤, 弧, etc. This is sometimes counted as two strokes (both Nelsons) and sometimes as three strokes (Halpern, S&H). Classically it is regarded as two strokes. I am using 6 strokes for 瓜.</li>
<li>The pattern to the left of 敝, which appears in several kanji, e.g. 幣 and 瞥, has 8 strokes. (There are 3 strokes at the top as in 尚.)</li>
<li>The "east" pattern (東) has 8 strokes. There is an older form in which there are two strokes in the box (柬). It is counted as 8 strokes here in the 東 form (e.g. 諌) and 9 in the 柬 form, as in 諫.</li>
<li>The pattern at the bottom of 雋 is counted as 4 strokes in modern dictionaries, although traditionally it was 5.</li>
<li>The pattern 巻, which appears in several kanji, is counted as 9 strokes. Several dictionaries count it as either 8 or 9.</li>
<li>The pattern on the left of 収 is variously handled as 2 strokes or 3 strokes. As more recent dictionaries make it 4, I will do so too.</li>
<li>The 攵 pattern has 3 and 4-stroke versions, and sometimes the glyphs can be confusing as to which is used. In the 緻 kanji, for example, it is traditionally counted as 3, but Spahn & Hadamitzky count it as 4 and the Nelsons include both.</li></ol>
<p>Note: The JIS X 0208-1990 standard does not formally specify the precise glyphs used for kanji, however the glyphs it uses in the published version have become de facto standards for many font compilations. In the published standard, for several kanji, e.g. 辿/迚, 礼/祀, 飢/飭, the JIS level one kanji use the simpler form, and the Level 2 kanji use the older more complex form. Just to make matters worse, many fonts for JIS X 0208 kanji are based on the bit-maps specified in JIS X 9051-1984 standard, which defines the 16x16 patterns for JIS X 0208-1983 characters. According to Ken Lunde: "This standard was not very good, and JSA is no longer supporting it." Anyway, JIS X 9051-1984 had the simpler form for all these bushu in both Levels 1 and 2, as well as having simplifications of kanji like 濾. Thus, as the font foundries have freedom to choose whichever glyphs they like, what you see on your screen may well not agree with these rules. All the rules in this appendix relate to the glyphs as published in the JIS X 0208-1990 standard, and as appearing in font compilations based on them.
</p>
<h2><span class="mw-headline" id="Kanji_Dictionary_Search_Codes">Kanji Dictionary Search Codes</span></h2>
<h3><span class="mw-headline" id="SKIP_Codes">SKIP Codes</span></h3>
<p>The System of Kanji Indexing by Patterns (SKIP) is a scheme for the classification and rapid retrieval of Chinese characters on the basis of geometrical patterns. Developed by Jack Halpern, it first appeared in the New Japanese-English Character Dictionary (Kenkyusha, Tokyo 1990; NTC, Chicago 1993), and in successor publications such as the "Kanji Learners Dictionary" (Kodansha 1999,2011) and the "Kodansha Kanji Dictionary" (2013). A description of the coding system is <a rel="nofollow" class="external text" href="http://www.edrdg.org/wwwjdic/SKIP.html">available</a>.
</p><p>As examples, 割 has a SKIP code of 1-10-2, indicating it is divided into left-right portions with 10 strokes at the left and 2 at the right. 度 has a SKIP code of 度 indicating it has a 3-stroke enclosure with 6 strokes inside it.
</p>
<h3><span class="mw-headline" id="De_Roo_Codes">De Roo Codes</span></h3>
<p>The De Roo codes were developed by Father Joseph De Roo, and published in his book "2001 Kanji" (Bonjinsha). They are based on the shapes observed at the top and bottom of the character. A <a rel="nofollow" class="external text" href="http://www.edrdg.org/wwwjdic/deroo.html">detailed description</a> is available.
</p><p>As an example, 亜 has a code of 3273 indicating that the top of the kanji is pattern number 32 (兀) and the bottom pattern number 73 (horizontal line with two vertical strokes above it.
</p>
<h3><span class="mw-headline" id="Four_Corner_Codes">Four Corner Codes</span></h3>
<p>The Four Corner coding system was invented by Wang Chen in 1928, it has since then been widely used in dictionaries in China and Japan for classifying kanji and hanzi. In China it is losing popularity in favour of Pinyin ordering. Some Japanese dictionaries, such as the Morohashi Daikanwajiten have a Four Corner Index.An <a rel="nofollow" class="external text" href="http://www.edrdg.org/wwwjdic/FOURCORNER.html">overview</a> of the coding system is available.
In some cases a character may have two of these codes, as it is can be little ambiguous, and Morohashi has some kanji coded differently from their traditional Chinese codes.
The coding system indexes characters according to the shapes at the corners.
</p>
<h2><span class="mw-headline" id="Proposing_Changes">Proposing Changes</span></h2>
<p>There is currently no online access to the database the holds the KANJIDIC contents (the information is mostly quite static.) Anyone wishing to propose a change to the data for a kanji, e.g. add or change a reading will need to email Jim Breen at jimbreen@gmail.com.
</p>
<h2><span class="mw-headline" id="Kanji_Information_Sites">Kanji Information Sites</span></h2>
<p><i>(Being expanded)</i>
</p>
<ul><li>Jim's <a rel="nofollow" class="external text" href="http://nihongo.monash.edu/kanjiinfo.html">Kanji Information Page</a>.</li>
<li>The <a rel="nofollow" class="external text" href="https://kanjialive.com/">Kanji alive</a> site at the University of Chicago.</li>
<li>The <a rel="nofollow" class="external text" href="https://www.kanjipedia.jp/">Kanjipedia</a> sit (mostly in Japanese).</li></ul>
<h2><span class="mw-headline" id="Legacy_Documentation">Legacy Documentation</span></h2>
<p>The current Wiki page was compiled from several older documents, which are no longer being maintained. They are still available for historical purposes. They are:
</p>
<ul><li>a basic home page about <a rel="nofollow" class="external text" href="http://www.edrdg.org/kanjidic/kanjd2index_legacy.html">KANJIDIC2</a>;</li>
<li>an overview page about the <a rel="nofollow" class="external text" href="http://www.edrdg.org/kanjidic/kanjidic2_ov_legacy.html">KANJIDIC2 structure</a>;</li>
<li>an overview page about <a rel="nofollow" class="external text" href="http://www.edrdg.org/kanjidic/kanjidic_legacy.html">KANJIDIC and KANJD212</a>;</li>
<li>the original <a rel="nofollow" class="external text" href="http://www.edrdg.org/kanjidic/kanjidic_doc_legacy.html">KANJIDIC</a> documentation;</li>
<li>the original <a rel="nofollow" class="external text" href="http://www.edrdg.org/kanjidic/kanjd212_doc_legacy.html">KANJD212</a> documentation.</li></ul>
<h2><span class="mw-headline" id="Copyright_and_Permissions">Copyright and Permissions</span></h2>
<p>The KANJIDIC project files are released under a Creative Commons Attribution-ShareAlike Licence (V4.0). See the <a rel="nofollow" class="external text" href="https://www.edrdg.org/edrdg/licence.html">EDRDG General Dictionary Licence Statement</a> for details.
</p><p>For the most part the information provided in the project's files is in the public domain. Information relating to the sequence numbers of kanji in published dictionaries is not considered to be subject to copyright. Descriptor and other search codes are considered to be the intellectusl policy of the developers. With regard to the codes included in the KANJIDIC files:
</p>
<ul><li>in 2014 the SKIP codes were placed by Jack Halpern under a under a CC-SA licence. See <a rel="nofollow" class="external text" href="http://www.kanji.org/kanji/dictionaries/skip_permission.htm">this page</a> for his announcement. It is now under a <a rel="nofollow" class="external text" href="https://creativecommons.org/licenses/by-nc-sa/4.0/">Creative Commons Attribution-Noncommercial-Share Alike 4.0 Unported Licence</a>.</li>
<li>Fr De Roo provided written permission for the De Roo codes to be included in KANJIDIC.</li>
<li>the Spahn and Hadamitzky descriptor codes were kindly supplied by Mark Spahn for inclusion in KANJIDIC.</li></ul>
<h2><span class="mw-headline" id="History">History</span></h2>
<p><i>(some comments by Jim Breen)</i>
</p><p>KANJIDIC began around 1991 as two files: jis1detl.lst and jis2detl.lst, which were later merged into a single file.
</p><p>The first file was compiled initially from the file "kinfo.dat" supplied by Stephen Chung, who in turn compiled his file from a file prepared by Mike Erickson. I originally added about 1900 "meanings" by James Heisig keyed in by Kevin Moore from the book "Remembering The Kanji". I later added the meanings from Rik Smoody's files, compiled when he was working for Sony in Japan. These appear to have been based on Nelson.
</p><p>The second file was compiled from a complete JIS2 list with Bushu and stroke counts kindly supplied to me by Jon Crossley, to which I added Nelson numbers, yomikata and meanings extracted from Rik Smoody's file.
</p><p>Theresa Martin was an early assister with this file, particularly with tracking down and correcting many mistranscribed yomikata (the old zu/dzu, oo/ou, ji/dji, etc. problems).
</p><p>Jeffrey Friedl did a major overhaul in September-October 1992, in which he added the original frequency rankings, Halpern codes, SKIP patterns, updated the grading ("G" fields) to reflect the modern Jouyou lists, corrected radical numbers, corrected stroke counts and readings to fall in line with modern usage.
</p><p>Magnus Halldorsson corrected some erroneous Halpern numbers, and provided them for a lot of the radicals. He provided the list of Heisig indices, which he originally compiled himself, then verified and expanded using lists from Richard Walters and Antti Karttunen. He also passed on to me the list of Gakken indices compiled by Antti Karttunen.
</p><p>Lee Collins provided the Unicode mappings.
</p><p>Iain Sinclair has provided the yomikata, meanings and S&H indices of many of the obscure JIS2 kanji.
</p><p>Christian Wittern, a Sinologist working at Kyoto University, sent me a monster file prepared by Dr Urs App from Hanazono College. From this I have extracted the Four Corner and Morohashi information. Christian also provided the original Pinyin details, which were later replaced. I am very grateful for these significant contributions.
</p><p>In March 1994 the Morohashi indices were proof-read and corrected by Christian.
</p><p>Alfredo Pinochet supplied all the Henshall numbers.
</p><p>Ingar Holst has provided considerable assistance in regularizing the Bnnn and Cnnn radical classifications to remove some errors that were in the original JIS2 file, and to make it all conform to Nelson's classification.
</p><p>In mid-1993 I withdrew the SKIP codes from the distributed file as it appeared that their presence violated Jack Halpern's copyright on these codes. Jeffrey Friedl contacted Jack about this, and Jack obtained permission from his publisher for the codes to be included subject (initially) to copyright and usage restrictions. In March 1994 the Halpern indices and SKIP codes were checked against an extract from Jack's files, and the "Z" mis-classification codes added, again from his files. Jack has also made a lot of useful comments and suggestions about the content and format of the file. I am most grateful to Jack for his permission and assistance, and also to Jeffrey for making the contact.
</p><p>In May 1995, a number of updates took place. Jeffrey Friedl established contact with James Heisig, and obtained a further set of his indices. I contacted Mark Spahn (via the "honyaku" mailing list) and he kindly provided most of the missing S&H descriptors, and Jack Halpern released to me the SKIP codes of the kanji not in the New Japanese-English Character Dictionary. For all this material I am most grateful.
</p><p>In August 1995, I added the O'Neill index numbers. These were compiled by Jenny Nazak, David Rosenfeld and myself. Thanks to Jenny & David for their assistance.
</p><p>In January and February 1996 the Morohashi numbers were checked thoroughly against two important sources: a file of Unicode-Morohashi data (Uni2Dict) which was prepared by Koichi Yasuoka from the allocation in the JIS X 0221 standard, and the review draft of the proposed revision of the JIS X 0208 standard, which was prepared by the INSTAC Committee, and made available in a text file, thus enabling comparisons. All the mismatches between the three files were examined against the Morohashi text, and extensive corrections made to all three files. I am grateful to Koichi Yasuoka and Masayuki Toyoshima for their considerable assistance in this task.
</p><p>In March 1996 the Korean readings were added. They were provided by Dr Charles Muller, then of of Toyo Gakuen University, to whom I am most grateful. Chuck's compilation of Korean readings is extremely thorough and scholarly, and I am pleased to be able to incorporate them.
</p><p>In April 1996 the readings of all the kanji were compared with those in the JIS X 0208 draft, and a number of corrections and additions made.
</p><p>In May 1996 I carried out a "unification" of the readings of the KANJIDIC and KANJD212 files, wherein all the readings of the "itaiji" were brought into line. The identification of these itaiji was drawn from a file posted to the fj.kanji group by Taichi Kawabata (kawabata@is.s.u-tokyo.ac.jp), which was compiled at the ETL from the itaiji identification in the JIS X 0208 and JIS X 0212 standards. I corrected a few errors, and added some extra sets which were indicated in the JIS X 0208-1996 draft.
</p><p>In July 1996 the Pinyin details were completely replaced by a new set. The original Pinyin were from an earlier compilation by Christian Wittern, and and contained many errors. Two more reliable sources had become available: the Uni2Pinyin file compiled by Koichi Yasuoka, which is based in part on the TONEPY.tit by Yongguang Zhang; and the PYCHAR set of readings of Big5 hanzi compiled by Christian Wittern. The Pinyin currently in the KANJIDIC file is a combination of the two, following the order in the Uni2Pinyin file.
</p><p>In August 1996 I corrected a few more missing and erroneous Nelson numbers, using a massive Nelson list prepared by Wolfgang Cronrath. He also flagged the kokuji, so I added these to the readings fields as "{(kokuji)}".
</p><p>Also in August 1996 I deleted the handful of former "XJxxxx" cross-references, and replaced them with a much more comprehensive set, so that they now represent all the recognized "itaiji". The file I used for this was the corrected itaiji file mentioned above.
</p><p>In April 1997 I corrected a large number of bushu codes. Many of these had been identified as errors by Jean-Luc Leger who analyzed and examined all the Nelson bushu. I also identified and added a large number of missing Cnnn codes.
</p><p>Also in April 1997 I added the S&H "Kanji & Kana" indices. These had been keyed by Olivier Galibert (Olivier.Galibert@mines.u-nancy.fr). (There must be an outbreak of kanji interest on Nancy.)
</p><p>In February 1998, the long-awaited inclusion of the "New Nelson" numbers took place. I had been waiting for the editor of the New Nelson, John Haig, to supply a list (as he had agreed some years before), but in the meantime, Jean-Luc Leger keyed a list, so they are now available.
</p><p>Also between December 1997 and February 1998 a large number of Level 2 kanji had their stroke counts corrected to bring them into line with the counting principles used in the Level 1 kanji. This usually aligned the counts with those used in the New Nelson and in S&H. Appendix E of this document was amended to reflect this. The leg-work in tracking this material down was done by Wolfgang Cronrath.
</p><p>During December 1998 & Jan 1999 I updated the stroke counts of many of the Level 2 kanji, using an analysis of them carried out by Wolfgang Cronrath. I also added the De Roo codes, which had been keyed by Jasmin Blanchette, who also typed the explanatory material. I contacted Fr De Roo in Tokyo who readily agreed to the inclusion of the codes.
</p><p>The extension of the S&H Kana & Kanji numbers to the 2nd edition was done by Enrique Sanchez Rosa.
</p><p>The Hangul versions of the Korean readings (which only appear in the XML version) were provided by Francis Bond and Kyonghee Paik.
</p><p>I did the Tuttle card numbers myself.
</p><p>James Rose provided the numbers from Crowley's "The Kanji Way to Japanese Language Power", Sakade's "A Guide To Reading and Writing Japanese", and also for that book's 3rd Edition edited by Henshall, Seeley & De Groot.
</p><p>The "Kodansha's compact Kanji guide" codes were provided by Richard Fremmerlid.
</p><p>The "Kanji in Context" codes were provided by Randy Foreman.
</p><p>The Spanish kanji meanings (which appear in the XML format, and may also appear in special versions of KANJIDIC) were compiled by Francisco Gutierrez and provided by Gabriel Sanroman.
</p><p>Alain Thierion translated the meanings of the kanji into French, and also provided the Maniette numbers.
</p><p>Andrew Slater provided updates to the JLPT numbers, and additional numbers for the Japanese Flashcards series.
</p>
<!--
NewPP limit report
Cached time: 20240805054918
Cache expiry: 86400
Reduced expiry: false
Complications: [show‐toc]
CPU time usage: 0.043 seconds
Real time usage: 0.043 seconds
Preprocessor visited node count: 47/1000000
Post‐expand include size: 0/2097152 bytes
Template argument size: 0/2097152 bytes
Highest expansion depth: 2/100
Expensive parser function count: 0/100
Unstrip recursion depth: 0/20
Unstrip post‐expand size: 0/5000000 bytes
-->
<!--
Transclusion expansion time report (%,ms,calls,template)
100.00% 0.000 1 -total
-->
<!-- Saved in parser cache with key wikidb-mediawiki-:pcache:idhash:283-0!canonical and timestamp 20240805054918 and revision id 954.
-->
</div>
<div class="printfooter" data-nosnippet="">Retrieved from "<a dir="ltr" href="https://www.edrdg.org/wiki/index.php?title=KANJIDIC_Project&oldid=954">https://www.edrdg.org/wiki/index.php?title=KANJIDIC_Project&oldid=954</a>"</div></div>
<div id="catlinks" class="catlinks catlinks-allhidden" data-mw="interface"></div>
</div>
</div>
<div id="mw-navigation">
<h2>Navigation menu</h2>
<div id="mw-head">
<nav id="p-personal" class="vector-menu mw-portlet mw-portlet-personal vector-user-menu-legacy" aria-labelledby="p-personal-label" role="navigation" >
<h3
id="p-personal-label"
class="vector-menu-heading "
>
<span class="vector-menu-heading-label">Personal tools</span>
</h3>
<div class="vector-menu-content">
<ul class="vector-menu-content-list"><li id="pt-login" class="mw-list-item"><a href="/wiki/index.php?title=Special:UserLogin&returnto=KANJIDIC+Project" title="You are encouraged to log in; however, it is not mandatory [o]" accesskey="o"><span>Log in</span></a></li></ul>
</div>
</nav>
<div id="left-navigation">
<nav id="p-namespaces" class="vector-menu mw-portlet mw-portlet-namespaces vector-menu-tabs vector-menu-tabs-legacy" aria-labelledby="p-namespaces-label" role="navigation" >
<h3
id="p-namespaces-label"
class="vector-menu-heading "
>
<span class="vector-menu-heading-label">Namespaces</span>
</h3>
<div class="vector-menu-content">
<ul class="vector-menu-content-list"><li id="ca-nstab-main" class="selected mw-list-item"><a href="/wiki/index.php/KANJIDIC_Project" title="View the content page [c]" accesskey="c"><span>Page</span></a></li><li id="ca-talk" class="new mw-list-item"><a href="/wiki/index.php?title=Talk:KANJIDIC_Project&action=edit&redlink=1" rel="discussion" title="Discussion about the content page (page does not exist) [t]" accesskey="t"><span>Discussion</span></a></li></ul>
</div>
</nav>
<nav id="p-variants" class="vector-menu mw-portlet mw-portlet-variants emptyPortlet vector-menu-dropdown" aria-labelledby="p-variants-label" role="navigation" >
<input type="checkbox"
id="p-variants-checkbox"
role="button"
aria-haspopup="true"
data-event-name="ui.dropdown-p-variants"
class="vector-menu-checkbox"
aria-labelledby="p-variants-label"
/>
<label
id="p-variants-label"
aria-label="Change language variant"
class="vector-menu-heading "
>
<span class="vector-menu-heading-label">English</span>
</label>
<div class="vector-menu-content">
<ul class="vector-menu-content-list"></ul>
</div>
</nav>
</div>
<div id="right-navigation">
<nav id="p-views" class="vector-menu mw-portlet mw-portlet-views vector-menu-tabs vector-menu-tabs-legacy" aria-labelledby="p-views-label" role="navigation" >
<h3
id="p-views-label"
class="vector-menu-heading "
>
<span class="vector-menu-heading-label">Views</span>
</h3>
<div class="vector-menu-content">
<ul class="vector-menu-content-list"><li id="ca-view" class="selected mw-list-item"><a href="/wiki/index.php/KANJIDIC_Project"><span>Read</span></a></li><li id="ca-viewsource" class="mw-list-item"><a href="/wiki/index.php?title=KANJIDIC_Project&action=edit" title="This page is protected. You can view its source [e]" accesskey="e"><span>View source</span></a></li><li id="ca-history" class="mw-list-item"><a href="/wiki/index.php?title=KANJIDIC_Project&action=history" title="Past revisions of this page [h]" accesskey="h"><span>View history</span></a></li></ul>
</div>
</nav>
<nav id="p-cactions" class="vector-menu mw-portlet mw-portlet-cactions emptyPortlet vector-menu-dropdown" aria-labelledby="p-cactions-label" role="navigation" title="More options" >
<input type="checkbox"
id="p-cactions-checkbox"
role="button"
aria-haspopup="true"
data-event-name="ui.dropdown-p-cactions"
class="vector-menu-checkbox"
aria-labelledby="p-cactions-label"
/>
<label
id="p-cactions-label"
class="vector-menu-heading "
>
<span class="vector-menu-heading-label">More</span>
</label>
<div class="vector-menu-content">
<ul class="vector-menu-content-list"></ul>
</div>
</nav>
<div id="p-search" role="search" class="vector-search-box-vue vector-search-box-show-thumbnail vector-search-box-auto-expand-width vector-search-box">
<div>
<h3 >
<label for="searchInput">Search</label>
</h3>
<form action="/wiki/index.php" id="searchform"
class="vector-search-box-form">
<div id="simpleSearch"
class="vector-search-box-inner"
data-search-loc="header-navigation">
<input class="vector-search-box-input"
type="search" name="search" placeholder="Search EDRDG Wiki" aria-label="Search EDRDG Wiki" autocapitalize="sentences" title="Search EDRDG Wiki [f]" accesskey="f" id="searchInput"
>
<input type="hidden" name="title" value="Special:Search">
<input id="mw-searchButton"
class="searchButton mw-fallbackSearchButton" type="submit" name="fulltext" title="Search the pages for this text" value="Search">
<input id="searchButton"
class="searchButton" type="submit" name="go" title="Go to a page with this exact name if it exists" value="Go">
</div>
</form>
</div>
</div>
</div>
</div>
<div id="mw-panel">
<div id="p-logo" role="banner">
<a class="mw-wiki-logo" href="/wiki/index.php/Main_Page"
title="Visit the main page"></a>
</div>
<nav id="p-navigation" class="vector-menu mw-portlet mw-portlet-navigation vector-menu-portal portal" aria-labelledby="p-navigation-label" role="navigation" >
<h3
id="p-navigation-label"
class="vector-menu-heading "
>
<span class="vector-menu-heading-label">Navigation</span>
</h3>
<div class="vector-menu-content">
<ul class="vector-menu-content-list"><li id="n-mainpage-description" class="mw-list-item"><a href="/wiki/index.php/Main_Page" title="Visit the main page [z]" accesskey="z"><span>Main page</span></a></li><li id="n-recentchanges" class="mw-list-item"><a href="/wiki/index.php/Special:RecentChanges" title="A list of recent changes in the wiki [r]" accesskey="r"><span>Recent changes</span></a></li><li id="n-randompage" class="mw-list-item"><a href="/wiki/index.php/Special:Random" title="Load a random page [x]" accesskey="x"><span>Random page</span></a></li><li id="n-help-mediawiki" class="mw-list-item"><a href="https://www.mediawiki.org/wiki/Special:MyLanguage/Help:Contents"><span>Help about MediaWiki</span></a></li></ul>
</div>
</nav>
<nav id="p-tb" class="vector-menu mw-portlet mw-portlet-tb vector-menu-portal portal" aria-labelledby="p-tb-label" role="navigation" >
<h3
id="p-tb-label"
class="vector-menu-heading "
>
<span class="vector-menu-heading-label">Tools</span>
</h3>
<div class="vector-menu-content">
<ul class="vector-menu-content-list"><li id="t-whatlinkshere" class="mw-list-item"><a href="/wiki/index.php/Special:WhatLinksHere/KANJIDIC_Project" title="A list of all wiki pages that link here [j]" accesskey="j"><span>What links here</span></a></li><li id="t-recentchangeslinked" class="mw-list-item"><a href="/wiki/index.php/Special:RecentChangesLinked/KANJIDIC_Project" rel="nofollow" title="Recent changes in pages linked from this page [k]" accesskey="k"><span>Related changes</span></a></li><li id="t-specialpages" class="mw-list-item"><a href="/wiki/index.php/Special:SpecialPages" title="A list of all special pages [q]" accesskey="q"><span>Special pages</span></a></li><li id="t-print" class="mw-list-item"><a href="javascript:print();" rel="alternate" title="Printable version of this page [p]" accesskey="p"><span>Printable version</span></a></li><li id="t-permalink" class="mw-list-item"><a href="/wiki/index.php?title=KANJIDIC_Project&oldid=954" title="Permanent link to this revision of this page"><span>Permanent link</span></a></li><li id="t-info" class="mw-list-item"><a href="/wiki/index.php?title=KANJIDIC_Project&action=info" title="More information about this page"><span>Page information</span></a></li></ul>
</div>
</nav>
</div>
</div>
<footer id="footer" class="mw-footer" role="contentinfo" >
<ul id="footer-info">
<li id="footer-info-lastmod"> This page was last edited on 22 January 2023, at 10:26.</li>
</ul>
<ul id="footer-places">
<li id="footer-places-privacy"><a href="/wiki/index.php/EDRDG_Wiki:Privacy_policy">Privacy policy</a></li>
<li id="footer-places-about"><a href="/wiki/index.php/EDRDG_Wiki:About">About EDRDG Wiki</a></li>
<li id="footer-places-disclaimer"><a href="/wiki/index.php/EDRDG_Wiki:General_disclaimer">Disclaimers</a></li>
</ul>
<ul id="footer-icons" class="noprint">
<li id="footer-poweredbyico"><a href="https://www.mediawiki.org/"><img src="/wiki/resources/assets/poweredby_mediawiki_88x31.png" alt="Powered by MediaWiki" srcset="/wiki/resources/assets/poweredby_mediawiki_132x47.png 1.5x, /wiki/resources/assets/poweredby_mediawiki_176x62.png 2x" width="88" height="31" loading="lazy"/></a></li>
</ul>
</footer>
<script>(RLQ=window.RLQ||[]).push(function(){mw.config.set({"wgPageParseReport":{"limitreport":{"cputime":"0.043","walltime":"0.043","ppvisitednodes":{"value":47,"limit":1000000},"postexpandincludesize":{"value":0,"limit":2097152},"templateargumentsize":{"value":0,"limit":2097152},"expansiondepth":{"value":2,"limit":100},"expensivefunctioncount":{"value":0,"limit":100},"unstrip-depth":{"value":0,"limit":20},"unstrip-size":{"value":0,"limit":5000000},"timingprofile":["100.00% 0.000 1 -total"]},"cachereport":{"timestamp":"20240805054918","ttl":86400,"transientcontent":false}}});mw.config.set({"wgBackendResponseTime":599});});</script>
</body>
</html>
|