1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 112 113 114 115 116 117 118 119 120 121 122 123 124 125 126 127 128 129 130 131 132 133 134 135 136 137 138 139 140 141 142 143 144 145 146 147 148 149 150 151 152 153
|
use strict;
use utf8;
BEGIN { binmode STDOUT, ':utf8'; binmode STDERR, ':utf8'; }
use Test::More;
use Net::IDN::Punycode ':all';
BEGIN {
plan skip_all => 'no XS version' if eval {
\&Net::IDN::Punycode::encode_punycode ==
\&Net::IDN::Punycode::PP::encode_punycode; }
}
use Test::NoWarnings;
our @idna = (
["Arabic (Egyptian)",
"\x{0644}\x{064A}\x{0647}\x{0645}\x{0627}\x{0628}\x{062A}\x{0643}".
"\x{0644}\x{0645}\x{0648}\x{0634}\x{0639}\x{0631}\x{0628}\x{064A}\x{061F}",
"egbpdaj6bu4bxfgehfvwxn", 0, 0, 1, 1 ],
["Chinese (simplified)",
"\x{4ED6}\x{4EEC}\x{4E3A}\x{4EC0}\x{4E48}\x{4E0D}\x{8BF4}\x{4E2D}".
"\x{6587}",
"ihqwcrb4cv8a8dqg056pqjye", 0, 0, 1, 1 ],
["Chinese (traditional)",
"\x{4ED6}\x{5011}\x{7232}\x{4EC0}\x{9EBD}\x{4E0D}\x{8AAA}\x{4E2D}".
"\x{6587}",
"ihqwctvzc91f659drss3x8bo0yb", 0, 0, 1, 1 ],
["Czech",
"\x{0050}\x{0072}\x{006F}\x{010D}\x{0070}\x{0072}\x{006F}\x{0073}".
"\x{0074}\x{011B}\x{006E}\x{0065}\x{006D}\x{006C}\x{0075}\x{0076}\x{00ED}".
"\x{010D}\x{0065}\x{0073}\x{006B}\x{0079}",
"Proprostnemluvesky-uyb24dma41a", 0, 0, 1, 1 ],
["Hebrew",
"\x{05DC}\x{05DE}\x{05D4}\x{05D4}\x{05DD}\x{05E4}\x{05E9}\x{05D5}".
"\x{05D8}\x{05DC}\x{05D0}\x{05DE}\x{05D3}\x{05D1}\x{05E8}\x{05D9}\x{05DD}".
"\x{05E2}\x{05D1}\x{05E8}\x{05D9}\x{05EA}",
"4dbcagdahymbxekheh6e0a7fei0b", 0, 0, 1, 1 ],
["Hindi (Devanagari)",
"\x{092F}\x{0939}\x{0932}\x{094B}\x{0917}\x{0939}\x{093F}\x{0928}".
"\x{094D}\x{0926}\x{0940}\x{0915}\x{094D}\x{092F}\x{094B}\x{0902}\x{0928}".
"\x{0939}\x{0940}\x{0902}\x{092C}\x{094B}\x{0932}\x{0938}\x{0915}\x{0924}".
"\x{0947}\x{0939}\x{0948}\x{0902}",
"i1baa7eci9glrd9b2ae1bj0hfcgg6iyaf8o0a1dig0cd", 0, 0, 1 ],
["Japanese (kanji and hiragana)",
"\x{306A}\x{305C}\x{307F}\x{3093}\x{306A}\x{65E5}\x{672C}\x{8A9E}".
"\x{3092}\x{8A71}\x{3057}\x{3066}\x{304F}\x{308C}\x{306A}\x{3044}\x{306E}".
"\x{304B}",
"n8jok5ay5dzabd5bym9f0cm5685rrjetr6pdxa", 0, 0, 1 ],
["Russian (Cyrillic)",
"\x{043F}\x{043E}\x{0447}\x{0435}\x{043C}\x{0443}\x{0436}\x{0435}".
"\x{043E}\x{043D}\x{0438}\x{043D}\x{0435}\x{0433}\x{043E}\x{0432}\x{043E}".
"\x{0440}\x{044F}\x{0442}\x{043F}\x{043E}\x{0440}\x{0443}\x{0441}\x{0441}".
"\x{043A}\x{0438}",
"b1abfaaepdrnnbgefbadotcwatmq2g4l", 0, 0, 1, 1 ],
["Spanish",
"\x{0050}\x{006F}\x{0072}\x{0071}\x{0075}\x{00E9}\x{006E}\x{006F}".
"\x{0070}\x{0075}\x{0065}\x{0064}\x{0065}\x{006E}\x{0073}\x{0069}\x{006D}".
"\x{0070}\x{006C}\x{0065}\x{006D}\x{0065}\x{006E}\x{0074}\x{0065}\x{0068}".
"\x{0061}\x{0062}\x{006C}\x{0061}\x{0072}\x{0065}\x{006E}\x{0045}\x{0073}".
"\x{0070}\x{0061}\x{00F1}\x{006F}\x{006C}",
"PorqunopuedensimplementehablarenEspaol-fmd56a", 0, 0, 1 ],
["Vietnamese",
"\x{0054}\x{1EA1}\x{0069}\x{0073}\x{0061}\x{006F}\x{0068}\x{1ECD}".
"\x{006B}\x{0068}\x{00F4}\x{006E}\x{0067}\x{0074}\x{0068}\x{1EC3}\x{0063}".
"\x{0068}\x{1EC9}\x{006E}\x{00F3}\x{0069}\x{0074}\x{0069}\x{1EBF}\x{006E}".
"\x{0067}\x{0056}\x{0069}\x{1EC7}\x{0074}",
"TisaohkhngthchnitingVit-kjcr8268qyxafd2f1b9g", 0, 0, 1 ],
["Japanese",
"\x{0033}\x{5E74}\x{0042}\x{7D44}\x{91D1}\x{516B}\x{5148}\x{751F}",
"3B-ww4c5e180e575a65lsy2b", 0, 0, 1, 1 ],
["Japanese",
"\x{5B89}\x{5BA4}\x{5948}\x{7F8E}\x{6075}\x{002D}\x{0077}\x{0069}".
"\x{0074}\x{0068}\x{002D}\x{0053}\x{0055}\x{0050}\x{0045}\x{0052}\x{002D}".
"\x{004D}\x{004F}\x{004E}\x{004B}\x{0045}\x{0059}\x{0053}",
"-with-SUPER-MONKEYS-pc58ag80a8qai00g7n9n", 0, 0, 1 ],
["Japanese",
"\x{0048}\x{0065}\x{006C}\x{006C}\x{006F}\x{002D}\x{0041}\x{006E}".
"\x{006F}\x{0074}\x{0068}\x{0065}\x{0072}\x{002D}\x{0057}\x{0061}\x{0079}".
"\x{002D}\x{305D}\x{308C}\x{305E}\x{308C}\x{306E}\x{5834}\x{6240}",
"Hello-Another-Way--fc4qua05auwb3674vfr0b", 0, 0, 1 ],
["Japanese",
"\x{3072}\x{3068}\x{3064}\x{5C4B}\x{6839}\x{306E}\x{4E0B}\x{0032}",
"2-u9tlzr9756bt3uc0v", 0, 0, 1, 1 ],
["Japanese",
"\x{004D}\x{0061}\x{006A}\x{0069}\x{3067}\x{004B}\x{006F}\x{0069}".
"\x{3059}\x{308B}\x{0035}\x{79D2}\x{524D}",
"MajiKoi5-783gue6qz075azm5e", 0, 0, 1, 1 ],
["Japanese",
"\x{30D1}\x{30D5}\x{30A3}\x{30FC}\x{0064}\x{0065}\x{30EB}\x{30F3}".
"\x{30D0}",
"de-jg4avhby1noc0d", 0, 0, 1, 1 ],
["Japanese",
"\x{305D}\x{306E}\x{30B9}\x{30D4}\x{30FC}\x{30C9}\x{3067}",
"d9juau41awczczp", 0, 0, 1, 1 ],
["Greek",
"\x{03b5}\x{03bb}\x{03bb}\x{03b7}\x{03bd}\x{03b9}\x{03ba}\x{03ac}",
"hxargifdar", 0, 0, 1, 1 ],
["Maltese (Malti)",
"\x{0062}\x{006f}\x{006e}\x{0121}\x{0075}\x{0073}\x{0061}\x{0127}".
"\x{0127}\x{0061}",
"bonusaa-5bb1da", 0, 0, 1, 1 ],
["Russian (Cyrillic)",
"\x{043f}\x{043e}\x{0447}\x{0435}\x{043c}\x{0443}\x{0436}\x{0435}".
"\x{043e}\x{043d}\x{0438}\x{043d}\x{0435}\x{0433}\x{043e}\x{0432}\x{043e}".
"\x{0440}\x{044f}\x{0442}\x{043f}\x{043e}\x{0440}\x{0443}\x{0441}\x{0441}".
"\x{043a}\x{0438}",
"b1abfaaepdrnnbgefbadotcwatmq2g4l", 0, 0, 1, 1 ],
## Test vector from IdnaTest.txt
["Hebrew (combining)",
"\x{05D0}\x{0308}",
"ssa73l",
],
['U+094D',
"a\x{094D}b",
"ab-fsf",
],
);
my $tests = 2 * (scalar @idna);
plan tests => 1 + $tests;
foreach my $test (@idna)
{
my ($comment,$in,$out,$allowunassigned,$usestd3asciirules,$toascii,$tounicode) = @{$test};
is(encode_punycode($in), $out, $comment.' (encode_punycode)');
is(decode_punycode($out), $in, $comment.' (decode_punycode)');
}
# Test vectors extracted from:
#
# Nameprep and IDNA Test Vectors
# draft-josefsson-idn-test-vectors
#
# Copyright (C) The Internet Society (2003). All Rights Reserved.
#
# This document and translations of it may be copied and furnished
# to others, and derivative works that comment on or otherwise
# explain it or assist in its implementation may be prepared,
# copied, published and distributed, in whole or in part, without
# restriction of any kind, provided that the above copyright
# notice and this paragraph are included on all such copies and
# derivative works. However, this document itself may not be
# modified in any way, such as by removing the copyright notice or
# references to the Internet Society or other Internet
# organizations, except as needed for the purpose of developing
# Internet standards in which case the procedures for copyrights
# defined in the Internet Standards process must be followed, or
# as required to translate it into languages other than English.
|