1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 112 113 114 115 116 117 118 119 120 121 122 123 124 125 126 127 128 129 130 131 132 133 134 135 136 137 138 139 140 141 142 143 144 145 146 147 148 149 150 151 152 153 154 155 156 157 158 159 160 161 162 163 164 165 166 167 168 169 170 171 172 173 174 175 176 177 178 179 180 181 182 183 184 185 186 187 188 189 190 191 192 193 194 195 196 197 198 199 200 201 202 203 204 205 206 207 208 209 210 211 212 213 214 215 216 217 218 219 220 221 222 223 224 225 226 227 228 229 230 231 232 233 234 235 236 237 238 239 240 241 242 243 244 245 246 247 248 249 250 251 252 253 254 255 256 257 258 259 260 261 262 263 264 265 266 267 268 269 270 271 272 273 274 275 276 277 278 279 280 281 282 283 284 285 286 287 288 289 290 291 292 293 294 295 296 297 298 299 300 301 302 303 304 305 306 307 308 309 310 311 312 313 314 315 316 317 318 319 320 321 322 323 324 325 326 327 328 329 330 331 332 333 334 335 336 337 338 339 340 341 342 343 344 345 346 347 348 349 350 351 352 353 354 355 356 357 358 359 360 361 362 363 364 365 366 367 368 369 370 371 372 373 374 375 376 377 378 379 380 381 382 383 384 385 386 387 388 389 390 391 392 393 394 395 396 397 398 399 400 401 402 403 404 405 406 407 408 409 410 411 412 413 414 415 416 417 418 419 420 421 422 423 424 425 426 427 428 429 430 431 432 433 434 435 436 437 438 439 440 441 442 443 444 445 446 447 448 449 450 451 452 453 454 455 456 457 458 459 460 461 462 463 464 465 466 467 468 469 470 471 472 473 474 475 476 477 478 479 480 481 482 483 484 485 486 487 488 489 490 491 492 493 494 495 496 497 498 499 500 501 502 503 504 505 506 507 508 509 510 511 512 513 514 515 516 517 518 519 520 521 522 523 524 525 526 527 528 529 530 531 532 533 534 535 536 537 538 539 540 541 542 543 544 545 546 547 548 549 550 551 552 553 554 555 556 557 558 559 560 561 562 563 564 565 566 567 568 569 570 571 572 573 574 575 576 577 578 579 580 581 582 583 584 585 586 587 588 589 590 591 592 593 594 595 596 597 598 599 600 601 602 603 604 605 606 607 608 609 610 611 612 613 614 615 616 617 618 619 620 621 622 623 624 625 626 627 628 629 630 631 632 633 634 635 636 637 638 639 640 641 642 643 644 645 646 647 648 649 650 651 652 653 654 655 656 657 658 659 660 661 662 663 664 665 666 667 668 669 670 671 672 673 674 675 676 677 678 679 680 681 682 683 684 685 686 687 688 689 690 691 692 693 694 695 696 697 698 699 700 701 702 703 704 705 706 707 708 709 710 711 712 713 714 715 716 717 718 719 720 721 722 723 724 725 726 727 728 729 730 731 732 733 734 735 736 737 738 739 740 741 742 743 744 745 746 747 748 749 750 751 752 753 754 755 756 757 758 759 760 761 762 763 764 765 766 767 768 769 770 771 772 773 774 775 776 777 778 779 780 781 782 783 784 785 786 787 788 789 790 791 792 793 794 795 796 797 798 799 800 801 802 803 804 805 806 807 808 809 810 811 812 813 814 815 816 817 818 819 820 821 822 823 824 825 826 827 828 829 830 831 832 833 834 835 836 837 838 839 840 841 842 843 844 845 846 847 848 849 850 851 852 853 854 855 856 857 858 859 860 861 862 863 864 865 866 867 868 869 870 871 872 873 874 875 876 877 878 879 880 881 882 883 884 885 886 887 888 889 890 891 892 893 894 895 896 897 898 899 900 901 902 903 904 905 906 907 908 909 910 911 912 913 914 915 916 917 918 919 920 921 922 923 924 925 926 927 928 929 930 931 932 933 934 935 936 937 938 939 940 941 942 943 944 945 946 947 948 949 950 951 952 953 954 955 956 957 958 959 960 961 962 963 964 965 966 967 968 969 970 971 972 973 974 975 976 977 978 979 980 981 982 983 984 985 986 987 988 989 990 991 992 993 994 995 996 997 998 999 1000 1001 1002 1003 1004 1005 1006 1007 1008 1009 1010 1011 1012 1013 1014 1015 1016 1017 1018 1019 1020 1021 1022 1023 1024 1025 1026 1027 1028 1029 1030 1031 1032 1033 1034 1035 1036 1037 1038 1039 1040 1041 1042 1043 1044 1045 1046 1047 1048 1049 1050 1051 1052 1053 1054 1055 1056 1057 1058 1059 1060 1061 1062 1063 1064 1065 1066 1067 1068 1069 1070 1071 1072 1073 1074 1075 1076 1077 1078 1079 1080 1081 1082 1083 1084 1085 1086 1087 1088 1089 1090 1091 1092 1093 1094 1095 1096 1097 1098 1099 1100 1101 1102 1103 1104 1105 1106 1107 1108 1109 1110 1111 1112 1113 1114 1115 1116 1117 1118 1119 1120 1121 1122 1123 1124 1125 1126 1127 1128 1129 1130 1131 1132 1133 1134 1135 1136 1137 1138 1139 1140 1141 1142 1143 1144 1145 1146 1147 1148 1149 1150 1151 1152 1153 1154 1155 1156 1157 1158 1159 1160 1161 1162 1163 1164 1165 1166 1167 1168 1169 1170 1171 1172 1173 1174 1175 1176 1177 1178 1179 1180 1181 1182 1183 1184 1185 1186 1187 1188 1189 1190 1191 1192 1193 1194 1195 1196 1197 1198 1199 1200 1201 1202 1203 1204 1205 1206 1207 1208 1209 1210 1211 1212 1213 1214 1215 1216 1217 1218 1219 1220 1221 1222 1223 1224 1225 1226 1227 1228 1229 1230 1231 1232 1233 1234 1235 1236 1237 1238 1239 1240 1241 1242 1243 1244 1245 1246 1247 1248 1249 1250 1251 1252 1253 1254 1255 1256 1257 1258 1259 1260 1261 1262 1263 1264 1265 1266 1267 1268 1269 1270 1271 1272 1273 1274 1275 1276 1277 1278 1279 1280 1281 1282 1283 1284 1285 1286 1287 1288 1289 1290 1291 1292 1293 1294 1295 1296 1297 1298 1299 1300 1301 1302 1303 1304 1305 1306 1307 1308 1309 1310 1311 1312 1313 1314 1315 1316 1317 1318 1319 1320 1321 1322 1323 1324 1325 1326 1327 1328 1329 1330 1331 1332 1333 1334 1335 1336 1337 1338 1339 1340 1341 1342 1343 1344 1345 1346 1347 1348 1349 1350 1351 1352 1353 1354 1355 1356 1357 1358 1359 1360 1361 1362 1363 1364 1365 1366 1367 1368 1369 1370 1371 1372 1373 1374 1375 1376 1377 1378 1379 1380 1381 1382 1383 1384 1385 1386 1387 1388 1389 1390 1391 1392 1393 1394 1395 1396 1397 1398 1399 1400 1401 1402 1403 1404 1405 1406 1407 1408 1409 1410 1411 1412 1413 1414 1415 1416 1417 1418 1419 1420 1421 1422 1423 1424 1425 1426 1427 1428 1429 1430 1431 1432 1433 1434 1435 1436 1437 1438 1439 1440 1441 1442 1443 1444 1445 1446 1447 1448 1449 1450 1451 1452 1453 1454 1455 1456 1457 1458 1459 1460 1461 1462 1463 1464 1465 1466 1467 1468 1469 1470 1471 1472 1473 1474 1475 1476 1477 1478 1479 1480 1481 1482 1483 1484 1485 1486 1487 1488 1489 1490 1491 1492 1493 1494 1495 1496 1497 1498 1499 1500 1501 1502 1503 1504 1505 1506 1507 1508 1509 1510 1511 1512 1513 1514 1515 1516 1517 1518 1519 1520 1521 1522 1523 1524 1525 1526 1527 1528 1529 1530 1531 1532 1533 1534 1535 1536 1537 1538 1539 1540 1541 1542 1543 1544 1545 1546 1547 1548 1549 1550 1551 1552 1553 1554 1555 1556 1557 1558 1559 1560 1561 1562 1563 1564 1565 1566 1567 1568 1569 1570 1571 1572 1573 1574 1575 1576 1577 1578 1579 1580 1581 1582 1583 1584 1585 1586 1587 1588 1589 1590 1591 1592 1593 1594 1595 1596 1597 1598 1599 1600 1601 1602 1603 1604 1605 1606 1607 1608 1609 1610 1611 1612 1613 1614 1615 1616 1617 1618 1619 1620 1621 1622 1623 1624 1625 1626 1627 1628 1629 1630 1631 1632 1633 1634 1635 1636 1637 1638 1639 1640 1641 1642 1643 1644 1645 1646 1647 1648 1649 1650 1651 1652 1653 1654 1655 1656 1657 1658 1659 1660 1661 1662 1663 1664 1665 1666 1667 1668 1669 1670 1671 1672 1673 1674 1675 1676 1677 1678 1679 1680 1681 1682 1683 1684 1685 1686 1687 1688 1689 1690 1691 1692 1693 1694 1695 1696 1697 1698 1699 1700 1701 1702 1703 1704 1705 1706 1707 1708 1709 1710 1711 1712 1713 1714 1715 1716 1717 1718 1719 1720 1721 1722 1723 1724 1725 1726 1727 1728 1729 1730 1731 1732 1733 1734 1735 1736 1737 1738 1739 1740 1741 1742 1743 1744 1745 1746 1747 1748 1749 1750 1751 1752 1753 1754 1755 1756 1757 1758 1759 1760 1761 1762 1763 1764 1765 1766 1767 1768 1769 1770 1771 1772 1773 1774 1775 1776 1777 1778 1779 1780 1781 1782 1783 1784 1785 1786 1787 1788 1789 1790 1791 1792 1793 1794 1795 1796 1797 1798 1799 1800 1801 1802 1803 1804 1805 1806 1807 1808 1809 1810 1811 1812 1813 1814 1815 1816 1817 1818 1819 1820 1821 1822 1823 1824 1825 1826 1827 1828 1829 1830 1831 1832 1833 1834 1835 1836 1837 1838 1839 1840 1841 1842 1843 1844 1845 1846 1847 1848 1849 1850 1851 1852 1853 1854 1855 1856 1857 1858 1859 1860 1861 1862 1863 1864 1865 1866 1867 1868 1869 1870 1871 1872 1873 1874 1875 1876 1877 1878 1879 1880 1881 1882 1883 1884 1885 1886 1887 1888 1889 1890 1891 1892 1893 1894 1895 1896 1897 1898 1899 1900 1901 1902 1903 1904 1905 1906 1907 1908 1909 1910 1911 1912 1913 1914 1915 1916 1917 1918 1919 1920 1921 1922 1923 1924 1925 1926 1927 1928 1929 1930 1931 1932 1933 1934 1935 1936 1937 1938 1939 1940 1941 1942 1943 1944 1945 1946 1947 1948 1949 1950 1951 1952 1953 1954 1955 1956 1957 1958 1959 1960 1961 1962 1963 1964 1965 1966 1967 1968 1969 1970 1971 1972 1973 1974 1975 1976 1977 1978 1979 1980 1981 1982 1983 1984 1985 1986 1987 1988 1989 1990 1991 1992 1993 1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 2022 2023 2024 2025 2026 2027 2028 2029 2030 2031 2032 2033 2034 2035 2036 2037 2038 2039 2040 2041 2042 2043 2044 2045 2046 2047 2048 2049 2050 2051 2052 2053 2054 2055 2056 2057 2058 2059 2060 2061 2062 2063 2064 2065 2066 2067 2068 2069 2070 2071 2072 2073 2074 2075 2076 2077 2078 2079 2080 2081 2082 2083 2084 2085 2086 2087 2088 2089 2090 2091 2092 2093 2094 2095 2096 2097 2098 2099 2100 2101 2102 2103 2104 2105 2106 2107 2108 2109 2110 2111 2112 2113 2114 2115 2116 2117 2118 2119 2120 2121 2122 2123 2124 2125 2126 2127 2128 2129 2130 2131 2132 2133 2134 2135 2136 2137 2138 2139 2140 2141 2142 2143 2144 2145 2146 2147 2148 2149 2150 2151 2152 2153 2154 2155 2156 2157 2158 2159 2160 2161 2162 2163 2164 2165 2166 2167 2168 2169 2170 2171 2172 2173 2174 2175 2176 2177 2178 2179 2180 2181 2182 2183 2184 2185 2186 2187 2188 2189 2190 2191 2192 2193 2194 2195 2196 2197 2198 2199 2200 2201 2202 2203 2204 2205 2206 2207 2208 2209 2210 2211 2212 2213 2214 2215 2216 2217 2218 2219 2220 2221 2222 2223 2224 2225 2226 2227 2228 2229 2230 2231 2232 2233 2234 2235 2236 2237 2238 2239 2240 2241 2242 2243 2244 2245 2246 2247 2248 2249 2250 2251 2252 2253 2254 2255 2256 2257 2258 2259 2260 2261 2262 2263 2264 2265 2266 2267 2268 2269 2270 2271 2272 2273 2274 2275 2276 2277 2278 2279 2280 2281 2282 2283 2284 2285 2286 2287 2288 2289 2290 2291 2292 2293 2294 2295 2296 2297 2298 2299 2300 2301 2302 2303 2304 2305 2306 2307 2308 2309 2310 2311 2312 2313 2314 2315 2316 2317 2318 2319 2320 2321 2322 2323 2324 2325 2326 2327 2328 2329 2330 2331 2332 2333 2334 2335 2336 2337 2338 2339 2340 2341 2342 2343 2344 2345 2346 2347 2348 2349 2350 2351 2352 2353 2354 2355 2356 2357 2358 2359 2360 2361 2362 2363 2364 2365 2366 2367 2368 2369 2370 2371 2372 2373 2374 2375 2376 2377 2378 2379 2380 2381 2382 2383 2384 2385 2386 2387 2388 2389 2390 2391 2392 2393 2394 2395 2396 2397 2398 2399 2400 2401 2402 2403 2404 2405 2406 2407 2408 2409 2410 2411 2412 2413 2414 2415 2416 2417 2418 2419 2420 2421 2422 2423 2424 2425 2426 2427 2428 2429 2430 2431 2432 2433 2434 2435 2436 2437 2438 2439 2440 2441 2442 2443 2444 2445 2446 2447 2448 2449 2450 2451 2452 2453 2454 2455 2456 2457 2458 2459 2460 2461 2462 2463 2464 2465 2466 2467 2468 2469 2470 2471 2472 2473 2474 2475 2476 2477 2478 2479 2480 2481 2482 2483 2484 2485 2486 2487 2488 2489 2490 2491 2492 2493 2494 2495 2496 2497 2498 2499 2500 2501 2502 2503 2504 2505 2506 2507 2508 2509 2510 2511 2512 2513 2514 2515 2516 2517 2518 2519 2520 2521 2522 2523 2524 2525 2526 2527 2528 2529 2530 2531 2532 2533 2534 2535 2536 2537 2538 2539 2540 2541 2542 2543 2544 2545 2546 2547 2548 2549 2550 2551 2552 2553 2554 2555 2556 2557 2558 2559 2560 2561 2562 2563 2564 2565 2566 2567 2568 2569 2570 2571 2572 2573 2574 2575 2576 2577 2578 2579 2580 2581 2582 2583 2584 2585 2586 2587 2588 2589 2590 2591 2592 2593 2594 2595 2596 2597 2598 2599 2600 2601 2602 2603 2604 2605 2606 2607 2608 2609 2610 2611 2612 2613 2614 2615 2616 2617 2618 2619 2620 2621 2622 2623 2624 2625 2626 2627 2628 2629 2630 2631 2632 2633 2634 2635 2636 2637 2638 2639 2640 2641 2642 2643 2644 2645 2646 2647 2648 2649 2650 2651 2652 2653 2654 2655 2656 2657 2658 2659 2660 2661 2662 2663 2664 2665 2666 2667 2668 2669 2670 2671 2672 2673 2674 2675 2676 2677 2678 2679 2680 2681 2682 2683 2684 2685 2686 2687 2688 2689 2690 2691 2692 2693 2694 2695 2696 2697 2698 2699 2700 2701 2702 2703 2704 2705 2706 2707 2708 2709 2710 2711 2712 2713 2714 2715 2716 2717 2718 2719 2720 2721 2722 2723 2724 2725 2726 2727 2728 2729 2730 2731 2732 2733 2734 2735 2736 2737 2738 2739 2740 2741 2742 2743 2744 2745 2746 2747 2748 2749 2750 2751 2752 2753 2754 2755 2756 2757 2758 2759 2760 2761 2762 2763 2764 2765 2766 2767 2768 2769 2770 2771 2772 2773 2774 2775 2776 2777 2778 2779 2780 2781 2782 2783 2784 2785 2786 2787 2788 2789 2790 2791 2792 2793 2794 2795 2796 2797 2798 2799 2800 2801 2802 2803 2804 2805 2806 2807 2808 2809 2810 2811 2812 2813 2814 2815 2816 2817 2818 2819 2820 2821 2822 2823 2824 2825 2826 2827 2828 2829 2830 2831 2832 2833 2834 2835 2836 2837 2838 2839 2840 2841 2842 2843 2844 2845 2846 2847 2848 2849 2850 2851 2852 2853 2854 2855 2856 2857 2858 2859 2860 2861 2862 2863 2864 2865 2866 2867 2868 2869 2870 2871 2872 2873 2874 2875 2876 2877 2878 2879 2880 2881 2882 2883 2884 2885 2886 2887 2888 2889 2890 2891 2892 2893 2894 2895 2896 2897 2898 2899 2900 2901 2902 2903 2904 2905 2906 2907 2908 2909 2910 2911 2912 2913 2914 2915 2916 2917 2918 2919 2920 2921 2922 2923 2924 2925 2926 2927 2928 2929 2930 2931 2932 2933 2934 2935 2936 2937 2938 2939 2940 2941 2942 2943 2944 2945 2946 2947 2948 2949 2950 2951 2952 2953 2954 2955 2956 2957 2958 2959 2960 2961 2962 2963 2964 2965 2966 2967 2968 2969 2970 2971 2972 2973 2974 2975 2976 2977 2978 2979 2980 2981 2982 2983 2984 2985 2986 2987 2988 2989 2990 2991 2992 2993 2994 2995 2996 2997 2998 2999 3000 3001 3002 3003 3004 3005 3006 3007 3008 3009 3010 3011 3012 3013 3014 3015 3016 3017 3018 3019 3020 3021 3022 3023 3024 3025 3026 3027 3028 3029 3030 3031 3032 3033 3034 3035 3036 3037 3038 3039 3040 3041 3042 3043 3044 3045 3046 3047 3048 3049 3050 3051 3052 3053 3054 3055 3056 3057 3058 3059 3060 3061 3062 3063 3064 3065 3066 3067 3068 3069 3070 3071 3072 3073 3074 3075 3076 3077 3078 3079 3080 3081 3082 3083 3084 3085 3086 3087 3088 3089 3090 3091 3092 3093 3094 3095 3096 3097 3098 3099 3100 3101 3102 3103 3104 3105 3106 3107 3108 3109 3110 3111 3112 3113 3114 3115 3116 3117 3118 3119 3120 3121 3122 3123 3124 3125 3126 3127 3128 3129 3130 3131 3132 3133 3134 3135 3136 3137 3138 3139 3140 3141 3142 3143 3144 3145 3146 3147 3148 3149 3150 3151 3152 3153 3154 3155 3156 3157 3158 3159 3160 3161 3162 3163 3164 3165 3166 3167 3168 3169 3170 3171 3172 3173 3174 3175 3176 3177 3178 3179 3180 3181 3182 3183 3184 3185 3186 3187 3188 3189 3190 3191 3192 3193 3194 3195 3196 3197 3198 3199 3200 3201 3202 3203 3204 3205 3206 3207 3208 3209 3210 3211 3212 3213 3214 3215 3216 3217 3218 3219 3220 3221 3222 3223 3224 3225 3226 3227 3228 3229 3230 3231 3232 3233 3234 3235 3236 3237 3238 3239 3240 3241 3242 3243 3244 3245 3246 3247 3248 3249 3250 3251 3252 3253 3254 3255 3256 3257 3258 3259 3260 3261 3262 3263 3264 3265 3266 3267 3268 3269 3270 3271 3272 3273 3274 3275 3276 3277 3278 3279 3280 3281 3282 3283 3284 3285 3286 3287 3288 3289 3290 3291 3292 3293 3294 3295 3296 3297 3298 3299 3300 3301 3302 3303 3304 3305 3306 3307 3308 3309 3310 3311 3312 3313 3314 3315 3316 3317 3318 3319 3320 3321 3322 3323 3324 3325 3326 3327 3328 3329 3330 3331 3332 3333 3334 3335 3336 3337 3338 3339 3340 3341 3342 3343 3344 3345 3346 3347 3348 3349 3350 3351 3352 3353 3354 3355 3356 3357 3358 3359 3360 3361 3362 3363 3364 3365 3366 3367 3368 3369 3370 3371 3372 3373 3374 3375 3376 3377 3378 3379 3380 3381 3382 3383 3384 3385 3386 3387 3388 3389 3390 3391 3392 3393 3394 3395 3396 3397 3398 3399 3400 3401 3402 3403 3404 3405 3406 3407 3408 3409 3410 3411 3412 3413 3414 3415 3416 3417 3418 3419 3420 3421 3422 3423 3424 3425 3426 3427 3428 3429 3430 3431 3432 3433 3434 3435 3436 3437 3438 3439 3440 3441 3442 3443 3444 3445 3446 3447 3448 3449 3450 3451 3452 3453 3454 3455 3456 3457 3458 3459 3460 3461 3462 3463 3464 3465 3466 3467 3468 3469 3470 3471 3472 3473 3474 3475 3476 3477 3478 3479 3480 3481 3482 3483 3484 3485 3486 3487 3488 3489 3490 3491 3492 3493 3494 3495 3496 3497 3498 3499 3500 3501 3502 3503 3504 3505 3506 3507 3508 3509 3510 3511 3512 3513 3514 3515 3516 3517 3518 3519 3520 3521 3522 3523 3524 3525 3526 3527 3528 3529 3530 3531 3532 3533 3534 3535 3536 3537 3538 3539 3540 3541 3542 3543 3544 3545 3546 3547 3548 3549 3550 3551 3552 3553 3554 3555 3556 3557 3558 3559 3560 3561 3562 3563 3564 3565 3566 3567 3568 3569 3570 3571 3572 3573 3574 3575 3576 3577 3578 3579 3580 3581 3582 3583 3584 3585 3586 3587 3588 3589 3590 3591 3592 3593 3594 3595 3596 3597 3598 3599 3600 3601 3602 3603 3604 3605 3606 3607 3608 3609 3610 3611 3612 3613 3614 3615 3616 3617 3618 3619 3620 3621 3622 3623 3624 3625 3626 3627 3628 3629 3630 3631 3632 3633 3634 3635 3636 3637 3638 3639 3640 3641 3642 3643 3644 3645 3646 3647 3648 3649 3650 3651 3652 3653 3654 3655 3656 3657 3658 3659 3660 3661 3662 3663 3664 3665 3666 3667 3668 3669 3670 3671 3672 3673 3674 3675 3676 3677 3678 3679 3680 3681 3682 3683 3684 3685 3686 3687 3688 3689 3690 3691 3692 3693 3694 3695 3696 3697 3698 3699 3700 3701 3702 3703 3704 3705 3706 3707 3708 3709 3710 3711 3712 3713 3714 3715 3716 3717 3718 3719 3720 3721 3722 3723 3724 3725 3726 3727 3728 3729 3730 3731 3732 3733 3734 3735 3736 3737 3738 3739 3740 3741 3742 3743 3744 3745 3746 3747 3748 3749 3750 3751 3752 3753 3754 3755 3756 3757 3758 3759 3760 3761 3762 3763 3764 3765 3766 3767 3768 3769 3770 3771 3772 3773 3774 3775 3776 3777 3778 3779 3780 3781 3782 3783 3784 3785 3786 3787 3788 3789 3790 3791 3792 3793 3794 3795 3796 3797 3798 3799 3800 3801 3802 3803 3804 3805 3806 3807 3808 3809 3810 3811 3812 3813 3814 3815 3816 3817 3818 3819 3820 3821 3822 3823 3824 3825 3826 3827 3828 3829 3830 3831 3832 3833 3834 3835 3836 3837 3838 3839 3840 3841 3842 3843 3844 3845 3846 3847 3848 3849 3850 3851 3852 3853 3854 3855 3856 3857 3858 3859 3860 3861 3862 3863 3864 3865 3866 3867 3868 3869 3870 3871 3872 3873 3874 3875 3876 3877 3878 3879 3880 3881 3882 3883 3884 3885 3886 3887 3888 3889 3890 3891 3892 3893 3894 3895 3896 3897 3898 3899 3900 3901 3902 3903 3904 3905 3906 3907 3908 3909 3910 3911 3912 3913 3914 3915 3916 3917 3918 3919 3920 3921 3922 3923 3924 3925 3926 3927 3928 3929 3930 3931 3932 3933 3934 3935 3936 3937 3938 3939 3940 3941 3942 3943 3944 3945 3946 3947 3948 3949 3950 3951 3952 3953 3954 3955 3956 3957 3958 3959 3960 3961 3962 3963 3964 3965 3966 3967 3968 3969 3970 3971 3972 3973 3974 3975 3976 3977 3978 3979 3980 3981 3982 3983 3984 3985 3986 3987 3988 3989 3990 3991 3992 3993 3994 3995 3996 3997 3998 3999 4000 4001 4002 4003 4004 4005 4006 4007 4008 4009 4010 4011 4012 4013 4014 4015 4016 4017 4018 4019 4020 4021 4022 4023 4024 4025 4026 4027 4028 4029 4030 4031 4032 4033 4034 4035 4036 4037 4038 4039 4040 4041 4042 4043 4044 4045 4046 4047 4048 4049 4050 4051 4052 4053 4054 4055 4056 4057 4058 4059 4060 4061 4062 4063 4064 4065 4066 4067 4068 4069 4070 4071 4072 4073 4074 4075 4076 4077 4078 4079 4080 4081 4082 4083 4084 4085 4086 4087 4088 4089 4090 4091 4092 4093 4094 4095 4096 4097 4098 4099 4100 4101 4102 4103 4104 4105 4106 4107 4108 4109 4110 4111 4112 4113 4114 4115 4116 4117 4118 4119 4120 4121 4122 4123 4124 4125 4126 4127 4128 4129 4130 4131 4132 4133 4134 4135 4136 4137 4138 4139 4140 4141 4142 4143 4144 4145 4146 4147 4148 4149 4150 4151 4152 4153 4154 4155 4156 4157 4158 4159 4160 4161 4162 4163 4164 4165 4166 4167 4168 4169 4170 4171 4172 4173 4174 4175 4176 4177 4178 4179 4180 4181 4182 4183 4184 4185 4186 4187 4188 4189 4190 4191 4192 4193 4194 4195 4196 4197 4198 4199 4200 4201 4202 4203 4204 4205 4206 4207 4208 4209 4210 4211 4212 4213 4214 4215 4216 4217 4218 4219 4220 4221 4222 4223 4224 4225 4226 4227 4228 4229 4230 4231 4232 4233 4234 4235 4236 4237 4238 4239 4240 4241 4242 4243 4244 4245 4246 4247 4248 4249 4250 4251 4252 4253 4254 4255 4256 4257 4258 4259 4260 4261 4262 4263 4264 4265 4266 4267 4268 4269 4270 4271 4272 4273 4274 4275 4276 4277 4278 4279 4280 4281 4282 4283 4284 4285 4286 4287 4288 4289 4290 4291 4292 4293 4294 4295 4296 4297 4298 4299 4300 4301 4302 4303 4304 4305 4306 4307 4308 4309 4310 4311 4312 4313 4314 4315 4316 4317 4318 4319 4320 4321 4322 4323 4324 4325 4326 4327 4328 4329 4330 4331 4332 4333 4334 4335 4336 4337 4338 4339 4340 4341 4342 4343 4344 4345 4346 4347 4348 4349 4350 4351 4352 4353 4354 4355 4356 4357 4358 4359 4360 4361 4362 4363 4364 4365 4366 4367 4368 4369 4370 4371 4372 4373 4374 4375 4376 4377 4378 4379 4380 4381 4382 4383 4384 4385 4386 4387 4388 4389 4390 4391 4392 4393 4394 4395 4396 4397 4398 4399 4400 4401 4402 4403 4404 4405 4406 4407 4408 4409 4410 4411 4412 4413 4414 4415 4416 4417 4418 4419 4420 4421 4422 4423 4424 4425 4426 4427 4428 4429 4430 4431 4432 4433 4434 4435 4436 4437 4438 4439 4440 4441 4442 4443 4444 4445 4446 4447 4448 4449 4450 4451 4452 4453 4454 4455 4456 4457 4458 4459 4460 4461 4462 4463 4464 4465 4466 4467 4468 4469 4470 4471 4472 4473 4474 4475 4476 4477 4478 4479 4480 4481 4482 4483 4484 4485 4486 4487 4488 4489 4490 4491 4492 4493 4494 4495 4496 4497 4498 4499 4500 4501 4502 4503 4504 4505 4506 4507 4508 4509 4510 4511 4512 4513 4514 4515 4516 4517 4518 4519 4520 4521 4522 4523 4524 4525 4526 4527 4528 4529 4530 4531 4532 4533 4534 4535 4536 4537 4538 4539 4540 4541 4542 4543 4544 4545 4546 4547 4548 4549 4550 4551 4552 4553 4554 4555 4556 4557 4558 4559 4560 4561 4562 4563 4564 4565 4566 4567 4568 4569 4570 4571 4572 4573 4574 4575 4576 4577 4578 4579 4580 4581 4582 4583 4584 4585 4586 4587 4588 4589 4590 4591 4592 4593 4594 4595 4596 4597 4598 4599 4600 4601 4602 4603 4604 4605 4606 4607 4608 4609 4610 4611 4612 4613 4614 4615 4616 4617 4618 4619 4620 4621 4622 4623 4624 4625 4626 4627 4628 4629 4630 4631 4632 4633 4634 4635 4636 4637 4638 4639 4640 4641 4642 4643 4644 4645 4646 4647 4648 4649 4650 4651 4652 4653 4654 4655 4656 4657 4658 4659 4660 4661 4662 4663 4664 4665 4666 4667 4668 4669 4670 4671 4672 4673 4674 4675 4676 4677 4678 4679 4680 4681 4682 4683 4684 4685 4686 4687 4688 4689 4690 4691 4692 4693 4694 4695 4696 4697 4698 4699 4700 4701 4702 4703 4704 4705 4706 4707 4708 4709 4710 4711 4712 4713 4714 4715 4716 4717 4718 4719 4720 4721 4722 4723 4724 4725 4726 4727 4728 4729 4730 4731 4732 4733 4734 4735 4736 4737 4738 4739 4740 4741 4742 4743 4744 4745 4746 4747 4748 4749 4750 4751 4752 4753 4754 4755 4756 4757 4758 4759 4760 4761 4762 4763 4764 4765 4766 4767 4768 4769 4770 4771 4772 4773 4774 4775 4776 4777 4778 4779 4780 4781 4782 4783 4784 4785 4786 4787 4788 4789 4790 4791 4792 4793 4794 4795 4796 4797 4798 4799 4800 4801 4802 4803 4804 4805 4806 4807 4808 4809 4810 4811 4812 4813 4814 4815 4816 4817 4818 4819 4820 4821 4822 4823 4824 4825 4826 4827 4828 4829 4830 4831 4832 4833 4834 4835 4836 4837 4838 4839 4840 4841 4842 4843 4844 4845 4846 4847 4848 4849 4850 4851 4852 4853 4854 4855 4856 4857 4858 4859 4860 4861 4862 4863 4864 4865 4866 4867 4868 4869 4870 4871 4872 4873 4874 4875 4876 4877 4878 4879 4880 4881 4882 4883 4884 4885 4886 4887 4888 4889 4890 4891 4892 4893 4894 4895 4896 4897 4898 4899 4900 4901 4902 4903 4904 4905 4906 4907 4908 4909 4910 4911 4912 4913 4914 4915 4916 4917 4918 4919 4920 4921 4922 4923 4924 4925 4926 4927 4928 4929 4930 4931 4932 4933 4934 4935 4936 4937 4938 4939 4940 4941 4942 4943 4944 4945 4946 4947 4948 4949 4950 4951 4952 4953 4954 4955 4956 4957 4958 4959 4960 4961 4962 4963 4964 4965 4966 4967 4968 4969 4970 4971 4972 4973 4974 4975 4976 4977 4978 4979 4980 4981 4982 4983 4984 4985 4986 4987 4988 4989 4990 4991 4992 4993 4994 4995 4996 4997 4998 4999 5000 5001 5002 5003 5004 5005 5006 5007 5008 5009 5010 5011 5012 5013 5014 5015 5016 5017 5018 5019 5020 5021 5022 5023 5024 5025 5026 5027 5028 5029 5030 5031 5032 5033 5034 5035 5036 5037 5038 5039 5040 5041 5042 5043 5044 5045 5046 5047 5048 5049 5050 5051 5052 5053 5054 5055 5056 5057 5058 5059 5060 5061 5062 5063 5064 5065 5066 5067 5068 5069 5070 5071 5072 5073 5074 5075 5076 5077 5078 5079 5080 5081 5082 5083 5084 5085 5086 5087 5088 5089 5090 5091 5092 5093 5094 5095 5096 5097 5098 5099 5100 5101 5102 5103 5104 5105 5106 5107 5108 5109 5110 5111 5112 5113 5114 5115 5116 5117 5118 5119 5120 5121 5122 5123 5124 5125 5126 5127 5128 5129 5130 5131 5132 5133 5134 5135 5136 5137 5138 5139 5140 5141 5142 5143 5144 5145 5146 5147 5148 5149 5150 5151 5152 5153 5154 5155 5156 5157 5158 5159 5160 5161 5162 5163 5164 5165 5166 5167 5168 5169 5170 5171 5172 5173 5174 5175 5176 5177 5178 5179 5180 5181 5182 5183 5184 5185 5186 5187 5188 5189 5190 5191 5192 5193 5194 5195 5196 5197 5198 5199 5200 5201 5202 5203 5204 5205 5206 5207 5208 5209 5210 5211 5212 5213 5214 5215 5216 5217 5218 5219 5220 5221 5222 5223 5224 5225 5226 5227 5228 5229 5230 5231 5232 5233 5234 5235 5236 5237 5238 5239 5240 5241 5242 5243 5244 5245 5246 5247 5248 5249 5250 5251 5252 5253 5254 5255 5256 5257 5258 5259 5260 5261 5262 5263 5264 5265 5266 5267 5268 5269 5270 5271 5272 5273 5274 5275 5276 5277 5278 5279 5280 5281 5282 5283 5284 5285 5286 5287 5288 5289 5290 5291 5292 5293 5294 5295 5296 5297 5298 5299 5300 5301 5302 5303 5304 5305 5306 5307 5308 5309 5310 5311 5312 5313 5314 5315 5316 5317 5318 5319 5320 5321 5322 5323 5324 5325 5326 5327 5328 5329 5330 5331 5332 5333 5334 5335 5336 5337 5338 5339 5340 5341 5342 5343 5344 5345 5346 5347 5348 5349 5350 5351 5352 5353 5354 5355 5356 5357 5358 5359 5360 5361 5362 5363 5364 5365 5366 5367 5368 5369 5370 5371 5372 5373 5374 5375 5376 5377 5378 5379 5380 5381 5382 5383 5384 5385 5386 5387 5388 5389 5390 5391 5392 5393 5394 5395 5396 5397 5398 5399 5400 5401 5402 5403 5404 5405 5406 5407 5408 5409 5410 5411 5412 5413 5414 5415 5416 5417 5418 5419 5420 5421 5422 5423 5424 5425 5426 5427 5428 5429 5430 5431 5432 5433 5434 5435 5436 5437 5438 5439 5440 5441 5442 5443 5444 5445 5446 5447 5448 5449 5450 5451 5452 5453 5454 5455 5456 5457 5458 5459 5460 5461 5462 5463 5464 5465 5466 5467 5468 5469 5470 5471 5472 5473 5474 5475 5476 5477 5478 5479 5480 5481 5482 5483 5484 5485 5486 5487 5488 5489 5490 5491 5492 5493 5494 5495 5496 5497 5498 5499 5500 5501 5502 5503 5504 5505 5506 5507 5508 5509 5510 5511 5512 5513 5514 5515 5516 5517 5518 5519 5520 5521 5522 5523 5524 5525 5526 5527 5528 5529 5530 5531 5532 5533 5534 5535 5536 5537 5538 5539 5540 5541 5542 5543 5544 5545 5546 5547 5548 5549 5550 5551 5552 5553 5554 5555 5556 5557 5558 5559 5560 5561 5562 5563 5564 5565 5566 5567 5568 5569 5570 5571 5572 5573 5574 5575 5576 5577 5578 5579 5580 5581 5582 5583 5584 5585 5586 5587 5588 5589 5590 5591 5592 5593 5594 5595 5596 5597 5598 5599 5600 5601 5602 5603 5604 5605 5606 5607 5608 5609 5610 5611 5612 5613 5614 5615 5616 5617 5618 5619 5620 5621 5622 5623 5624 5625 5626 5627 5628 5629 5630 5631 5632 5633 5634 5635 5636 5637 5638 5639 5640 5641 5642 5643 5644 5645 5646 5647 5648 5649 5650 5651 5652 5653 5654 5655 5656 5657 5658 5659 5660 5661 5662 5663 5664 5665 5666 5667 5668 5669 5670 5671 5672 5673 5674 5675 5676 5677 5678 5679 5680 5681 5682 5683 5684 5685 5686 5687 5688 5689 5690 5691 5692 5693 5694 5695 5696 5697 5698 5699 5700 5701 5702 5703 5704 5705 5706 5707 5708 5709 5710 5711 5712 5713 5714 5715 5716 5717 5718 5719 5720 5721 5722 5723 5724 5725 5726 5727 5728 5729 5730 5731 5732 5733 5734 5735 5736 5737 5738 5739 5740 5741 5742 5743 5744 5745 5746 5747 5748 5749 5750 5751 5752 5753 5754 5755 5756 5757 5758 5759 5760 5761 5762 5763 5764 5765 5766 5767 5768 5769 5770 5771 5772 5773 5774 5775 5776 5777 5778 5779 5780 5781 5782 5783 5784 5785 5786 5787 5788 5789 5790 5791 5792 5793 5794 5795 5796 5797 5798 5799 5800 5801 5802 5803 5804 5805 5806 5807 5808 5809 5810 5811 5812 5813 5814 5815 5816 5817 5818 5819 5820 5821 5822 5823 5824 5825 5826 5827 5828 5829 5830 5831 5832 5833 5834 5835 5836 5837 5838 5839 5840 5841 5842 5843 5844 5845 5846 5847 5848 5849 5850 5851 5852 5853 5854 5855 5856 5857 5858 5859 5860 5861 5862 5863 5864 5865 5866 5867 5868 5869 5870 5871 5872 5873 5874 5875 5876 5877 5878 5879 5880 5881 5882 5883 5884 5885 5886 5887 5888 5889 5890 5891 5892 5893 5894 5895 5896 5897 5898 5899 5900 5901 5902 5903 5904 5905 5906 5907 5908 5909 5910 5911 5912 5913 5914 5915 5916 5917 5918 5919 5920 5921 5922 5923 5924 5925 5926 5927 5928 5929 5930 5931 5932 5933 5934 5935 5936 5937 5938 5939 5940 5941 5942 5943 5944 5945 5946 5947 5948 5949 5950 5951 5952 5953 5954 5955 5956 5957 5958 5959 5960 5961 5962 5963 5964 5965 5966 5967 5968 5969 5970 5971 5972 5973 5974 5975 5976 5977 5978 5979 5980 5981 5982 5983 5984 5985 5986 5987 5988 5989 5990 5991 5992 5993 5994 5995 5996 5997 5998 5999 6000 6001 6002 6003 6004 6005 6006 6007 6008 6009 6010 6011 6012 6013 6014 6015 6016 6017 6018 6019 6020 6021 6022 6023 6024 6025 6026 6027 6028 6029 6030 6031 6032 6033 6034 6035 6036 6037 6038 6039 6040 6041 6042 6043 6044 6045 6046 6047 6048 6049 6050 6051 6052 6053 6054 6055 6056 6057 6058 6059 6060 6061 6062 6063 6064 6065 6066 6067 6068 6069 6070 6071 6072 6073 6074 6075 6076 6077 6078 6079 6080 6081 6082 6083 6084 6085 6086 6087 6088 6089 6090 6091 6092 6093 6094 6095 6096 6097 6098 6099 6100 6101 6102 6103 6104 6105 6106 6107 6108 6109 6110 6111 6112 6113 6114 6115 6116 6117 6118 6119 6120 6121 6122 6123 6124 6125 6126 6127 6128 6129 6130 6131 6132 6133 6134 6135 6136 6137 6138 6139 6140 6141 6142 6143 6144 6145 6146 6147 6148 6149 6150 6151 6152 6153 6154 6155 6156 6157 6158 6159 6160 6161 6162 6163 6164 6165 6166 6167 6168 6169 6170 6171 6172 6173 6174 6175 6176 6177 6178 6179 6180 6181 6182 6183 6184 6185 6186 6187 6188 6189 6190 6191 6192 6193 6194 6195 6196 6197 6198 6199 6200 6201 6202 6203 6204 6205 6206 6207 6208 6209 6210 6211 6212 6213 6214 6215 6216 6217 6218 6219 6220 6221 6222 6223 6224 6225 6226 6227 6228 6229 6230 6231 6232 6233 6234 6235 6236 6237 6238 6239 6240 6241 6242 6243 6244 6245 6246 6247 6248 6249 6250 6251 6252 6253 6254 6255 6256 6257 6258 6259 6260 6261 6262 6263 6264 6265 6266 6267 6268 6269 6270 6271 6272 6273 6274 6275 6276 6277 6278 6279 6280 6281 6282 6283 6284 6285 6286 6287 6288 6289 6290 6291 6292 6293 6294 6295 6296 6297 6298 6299 6300 6301 6302 6303 6304 6305 6306 6307 6308 6309 6310 6311 6312 6313 6314 6315 6316 6317 6318 6319 6320 6321 6322 6323 6324 6325 6326 6327 6328 6329 6330 6331 6332 6333 6334 6335 6336 6337 6338 6339 6340 6341 6342 6343 6344 6345 6346 6347 6348 6349 6350 6351 6352 6353 6354 6355 6356 6357 6358 6359 6360 6361 6362 6363 6364 6365 6366 6367 6368 6369 6370 6371 6372 6373 6374 6375 6376 6377 6378 6379 6380 6381 6382 6383 6384 6385 6386 6387 6388 6389 6390 6391 6392 6393 6394 6395 6396 6397 6398 6399 6400 6401 6402 6403 6404 6405 6406 6407 6408 6409 6410 6411 6412 6413 6414 6415 6416 6417 6418 6419 6420 6421 6422 6423 6424 6425 6426 6427 6428 6429 6430 6431 6432 6433 6434 6435 6436 6437 6438 6439 6440 6441 6442 6443 6444 6445 6446 6447 6448 6449 6450 6451 6452 6453 6454 6455 6456 6457 6458 6459 6460 6461 6462 6463 6464 6465 6466 6467 6468 6469 6470 6471 6472 6473 6474 6475 6476 6477 6478 6479 6480 6481 6482 6483 6484 6485 6486 6487 6488 6489 6490 6491 6492 6493 6494 6495 6496 6497 6498 6499 6500 6501 6502 6503 6504 6505 6506 6507 6508 6509 6510 6511 6512 6513 6514 6515 6516 6517 6518 6519 6520 6521 6522 6523 6524 6525 6526 6527 6528 6529 6530 6531 6532 6533 6534 6535 6536 6537 6538 6539 6540 6541 6542 6543 6544 6545 6546 6547 6548 6549 6550 6551 6552 6553 6554 6555 6556 6557 6558 6559 6560 6561 6562 6563 6564 6565 6566 6567 6568 6569 6570 6571 6572 6573 6574 6575 6576 6577 6578 6579 6580 6581 6582 6583 6584 6585 6586 6587 6588 6589 6590 6591 6592 6593 6594 6595 6596 6597 6598 6599 6600 6601 6602 6603 6604 6605 6606 6607 6608 6609 6610 6611 6612 6613 6614 6615 6616 6617 6618 6619 6620 6621 6622 6623 6624 6625 6626 6627 6628 6629 6630 6631 6632 6633 6634 6635 6636 6637 6638 6639 6640 6641 6642 6643 6644 6645 6646 6647 6648 6649 6650 6651 6652 6653 6654 6655 6656 6657 6658 6659 6660 6661 6662 6663 6664 6665 6666 6667 6668 6669 6670 6671 6672 6673 6674 6675 6676 6677 6678 6679 6680 6681 6682 6683 6684 6685 6686 6687 6688 6689 6690 6691 6692 6693 6694 6695 6696 6697 6698 6699 6700 6701 6702 6703 6704 6705 6706 6707 6708 6709 6710 6711 6712 6713 6714 6715 6716 6717 6718 6719 6720 6721 6722 6723 6724 6725 6726 6727 6728 6729 6730 6731 6732 6733 6734 6735 6736 6737 6738 6739 6740 6741 6742 6743 6744 6745 6746 6747 6748 6749 6750 6751 6752 6753 6754 6755 6756 6757 6758 6759 6760 6761 6762 6763 6764 6765 6766 6767 6768 6769 6770 6771 6772 6773 6774 6775 6776 6777 6778 6779 6780 6781 6782 6783 6784 6785 6786 6787 6788 6789 6790 6791 6792 6793 6794 6795 6796 6797 6798 6799 6800 6801 6802 6803 6804 6805 6806 6807 6808 6809 6810 6811 6812 6813 6814 6815 6816 6817 6818 6819 6820 6821 6822 6823 6824 6825 6826 6827 6828 6829 6830 6831 6832 6833 6834 6835 6836 6837 6838 6839 6840 6841 6842 6843 6844 6845 6846 6847 6848 6849 6850 6851 6852 6853 6854 6855 6856 6857 6858 6859 6860 6861 6862 6863 6864 6865 6866 6867 6868 6869 6870 6871 6872 6873 6874 6875 6876 6877 6878 6879 6880 6881 6882 6883 6884 6885 6886 6887 6888 6889 6890 6891 6892 6893 6894 6895 6896 6897 6898 6899 6900 6901 6902 6903 6904 6905 6906 6907 6908 6909 6910 6911 6912 6913 6914 6915 6916 6917 6918 6919 6920 6921 6922 6923 6924 6925 6926 6927 6928 6929 6930 6931 6932 6933 6934 6935 6936 6937 6938 6939 6940 6941 6942 6943 6944 6945 6946 6947 6948 6949 6950 6951 6952 6953 6954 6955 6956 6957 6958 6959 6960 6961 6962 6963 6964 6965 6966 6967 6968 6969 6970 6971 6972 6973 6974 6975 6976 6977 6978 6979 6980 6981 6982 6983 6984 6985 6986 6987 6988 6989 6990 6991 6992 6993 6994 6995 6996 6997 6998 6999 7000 7001 7002 7003 7004 7005 7006 7007 7008 7009 7010 7011 7012 7013 7014 7015 7016 7017 7018 7019 7020 7021 7022 7023 7024 7025 7026 7027 7028 7029 7030 7031 7032 7033 7034 7035 7036 7037 7038 7039 7040 7041 7042 7043 7044 7045 7046 7047 7048 7049 7050 7051 7052 7053 7054 7055 7056 7057 7058 7059 7060 7061 7062 7063 7064 7065 7066 7067 7068 7069 7070 7071 7072 7073 7074 7075 7076 7077 7078 7079 7080 7081 7082 7083 7084 7085 7086 7087 7088 7089 7090 7091 7092 7093 7094 7095 7096 7097 7098 7099 7100 7101 7102 7103 7104 7105 7106 7107 7108 7109 7110 7111 7112 7113 7114 7115 7116 7117 7118 7119 7120 7121 7122 7123 7124 7125 7126 7127 7128 7129 7130 7131 7132 7133 7134 7135 7136 7137 7138 7139 7140 7141 7142 7143 7144 7145 7146 7147 7148 7149 7150 7151 7152 7153 7154 7155 7156 7157 7158 7159 7160 7161 7162 7163 7164 7165 7166 7167 7168 7169 7170 7171 7172 7173 7174 7175 7176 7177 7178 7179 7180 7181 7182 7183 7184 7185 7186 7187 7188 7189 7190 7191 7192 7193 7194 7195 7196 7197 7198 7199 7200 7201 7202 7203 7204 7205 7206 7207 7208 7209 7210 7211 7212 7213 7214 7215 7216 7217 7218 7219 7220 7221 7222 7223 7224 7225 7226 7227 7228 7229 7230 7231 7232 7233 7234 7235 7236 7237 7238 7239 7240 7241 7242 7243 7244 7245 7246 7247 7248 7249 7250 7251 7252 7253 7254 7255 7256 7257 7258 7259 7260 7261 7262 7263 7264 7265 7266 7267 7268 7269 7270 7271 7272 7273 7274 7275 7276 7277 7278 7279 7280 7281 7282 7283 7284 7285 7286 7287 7288 7289 7290 7291 7292 7293 7294 7295 7296 7297 7298 7299 7300 7301 7302 7303 7304 7305 7306 7307 7308 7309 7310 7311 7312 7313 7314 7315 7316 7317 7318 7319 7320 7321 7322 7323 7324 7325 7326 7327 7328 7329 7330 7331 7332 7333 7334 7335 7336 7337 7338 7339 7340 7341 7342 7343 7344 7345 7346 7347 7348 7349 7350 7351 7352 7353 7354 7355 7356 7357 7358 7359 7360 7361 7362 7363 7364 7365 7366 7367 7368 7369 7370 7371 7372 7373 7374 7375 7376 7377 7378 7379 7380 7381 7382 7383 7384 7385 7386 7387 7388 7389 7390 7391 7392 7393 7394 7395 7396 7397 7398 7399 7400 7401 7402 7403 7404 7405 7406 7407 7408 7409 7410 7411 7412 7413 7414 7415 7416 7417 7418 7419 7420 7421 7422 7423 7424 7425 7426 7427 7428 7429 7430 7431 7432 7433 7434 7435 7436 7437 7438 7439 7440 7441 7442 7443 7444 7445 7446 7447 7448 7449 7450 7451 7452 7453 7454 7455 7456 7457 7458 7459 7460 7461 7462 7463 7464 7465 7466 7467 7468 7469 7470 7471 7472 7473 7474 7475 7476 7477 7478 7479 7480 7481 7482 7483 7484 7485 7486 7487 7488 7489 7490 7491 7492 7493 7494 7495 7496 7497 7498 7499 7500 7501 7502 7503 7504 7505 7506 7507 7508 7509 7510 7511 7512 7513 7514 7515 7516 7517 7518 7519 7520 7521 7522 7523 7524 7525 7526 7527 7528 7529 7530 7531 7532 7533 7534 7535 7536 7537 7538 7539 7540 7541 7542 7543 7544 7545 7546 7547 7548 7549 7550 7551 7552 7553 7554 7555 7556 7557 7558 7559 7560 7561 7562 7563 7564 7565 7566 7567 7568 7569 7570 7571 7572 7573 7574 7575 7576 7577 7578 7579 7580 7581 7582 7583 7584 7585 7586 7587 7588 7589 7590 7591 7592 7593 7594 7595 7596 7597 7598 7599 7600 7601 7602 7603 7604 7605 7606 7607 7608 7609 7610 7611 7612 7613 7614 7615 7616 7617 7618 7619 7620 7621 7622 7623 7624 7625 7626 7627 7628 7629 7630 7631 7632 7633 7634 7635 7636 7637 7638 7639 7640 7641 7642 7643 7644 7645 7646 7647 7648 7649 7650 7651 7652 7653 7654 7655 7656 7657 7658 7659 7660 7661 7662 7663 7664 7665 7666 7667 7668 7669 7670 7671 7672 7673 7674 7675 7676 7677 7678 7679 7680 7681 7682 7683 7684 7685 7686 7687 7688 7689 7690 7691 7692 7693 7694 7695 7696 7697 7698 7699 7700 7701 7702 7703 7704 7705 7706 7707 7708 7709 7710 7711 7712 7713 7714 7715 7716 7717 7718 7719 7720 7721 7722 7723 7724 7725 7726 7727 7728 7729 7730 7731 7732 7733 7734 7735 7736 7737 7738 7739 7740 7741 7742 7743 7744 7745 7746 7747 7748 7749 7750 7751 7752 7753 7754 7755 7756 7757 7758 7759 7760 7761 7762 7763 7764 7765 7766 7767 7768 7769 7770 7771 7772 7773 7774 7775 7776 7777 7778 7779 7780 7781 7782 7783 7784 7785 7786 7787 7788 7789 7790 7791 7792 7793 7794 7795 7796 7797 7798 7799 7800 7801 7802 7803 7804 7805 7806 7807 7808 7809 7810 7811 7812 7813 7814 7815 7816 7817 7818 7819 7820 7821 7822 7823 7824 7825 7826 7827 7828 7829 7830 7831 7832 7833 7834 7835 7836 7837 7838 7839 7840 7841 7842 7843 7844 7845 7846 7847 7848 7849 7850 7851 7852 7853 7854 7855 7856 7857 7858 7859 7860 7861 7862 7863 7864 7865 7866 7867 7868 7869 7870 7871 7872 7873 7874 7875 7876 7877 7878 7879 7880 7881 7882 7883 7884 7885 7886 7887 7888 7889 7890 7891 7892 7893 7894 7895 7896 7897 7898 7899 7900 7901 7902 7903 7904 7905 7906 7907 7908 7909 7910 7911 7912 7913 7914 7915 7916 7917 7918 7919 7920 7921 7922 7923 7924 7925 7926 7927 7928 7929 7930 7931 7932 7933 7934 7935 7936 7937 7938 7939 7940 7941 7942 7943 7944 7945 7946 7947 7948 7949 7950 7951 7952 7953 7954 7955 7956 7957 7958 7959 7960 7961 7962 7963 7964 7965 7966 7967 7968 7969 7970 7971 7972 7973 7974 7975 7976 7977 7978 7979 7980 7981 7982 7983 7984 7985 7986 7987 7988 7989 7990 7991 7992 7993 7994 7995 7996 7997 7998 7999 8000 8001 8002 8003 8004 8005 8006 8007 8008 8009 8010 8011 8012 8013 8014 8015 8016 8017 8018 8019 8020 8021 8022 8023 8024 8025 8026 8027 8028 8029 8030 8031 8032 8033 8034 8035 8036 8037 8038 8039 8040 8041 8042 8043 8044 8045 8046 8047 8048 8049 8050 8051 8052 8053 8054 8055 8056 8057 8058 8059 8060 8061 8062 8063 8064 8065 8066 8067 8068 8069 8070 8071 8072 8073 8074 8075 8076 8077 8078 8079 8080 8081 8082 8083 8084 8085 8086 8087 8088 8089 8090 8091 8092 8093 8094 8095 8096 8097 8098 8099 8100 8101 8102 8103 8104 8105 8106 8107 8108 8109 8110 8111 8112 8113 8114 8115 8116 8117 8118 8119 8120 8121 8122 8123 8124 8125 8126 8127 8128 8129 8130 8131 8132 8133 8134 8135 8136 8137 8138 8139 8140 8141 8142 8143 8144 8145 8146 8147 8148 8149 8150 8151 8152 8153 8154 8155 8156 8157 8158 8159 8160 8161 8162 8163 8164 8165 8166 8167 8168 8169 8170 8171 8172 8173 8174 8175 8176 8177 8178 8179 8180 8181 8182 8183 8184 8185 8186 8187 8188 8189 8190 8191 8192 8193 8194 8195 8196 8197 8198 8199 8200 8201 8202 8203 8204 8205 8206 8207 8208 8209 8210 8211 8212 8213 8214 8215 8216 8217 8218 8219 8220 8221 8222 8223 8224 8225 8226 8227 8228 8229 8230 8231 8232 8233 8234 8235 8236 8237 8238 8239 8240 8241 8242 8243 8244 8245 8246 8247 8248 8249 8250 8251 8252 8253 8254 8255 8256 8257 8258 8259 8260 8261 8262 8263 8264 8265 8266 8267 8268 8269 8270 8271 8272 8273 8274 8275 8276 8277 8278 8279 8280 8281 8282 8283 8284 8285 8286 8287 8288 8289 8290 8291 8292 8293 8294 8295 8296 8297 8298 8299 8300 8301 8302 8303 8304 8305 8306 8307 8308 8309 8310 8311 8312 8313 8314 8315 8316 8317 8318 8319 8320 8321 8322 8323 8324 8325 8326 8327 8328 8329 8330 8331 8332 8333 8334 8335 8336 8337 8338 8339 8340 8341 8342 8343 8344 8345 8346 8347 8348 8349 8350 8351 8352 8353 8354 8355 8356 8357 8358 8359 8360 8361 8362 8363 8364 8365 8366 8367 8368 8369 8370 8371 8372 8373 8374 8375 8376 8377 8378 8379 8380 8381 8382 8383 8384 8385 8386 8387 8388 8389 8390 8391 8392 8393 8394 8395 8396 8397 8398 8399 8400 8401 8402 8403 8404 8405 8406 8407 8408 8409 8410 8411 8412 8413 8414 8415 8416 8417 8418 8419 8420 8421 8422 8423 8424 8425 8426 8427 8428 8429 8430 8431 8432 8433 8434 8435 8436 8437 8438 8439 8440 8441 8442 8443 8444 8445 8446 8447 8448 8449 8450 8451 8452 8453 8454 8455 8456 8457 8458 8459 8460 8461 8462 8463 8464 8465 8466 8467 8468 8469 8470 8471 8472 8473 8474 8475 8476 8477 8478 8479 8480 8481 8482 8483 8484 8485 8486 8487 8488 8489 8490 8491 8492 8493 8494 8495 8496 8497 8498 8499 8500 8501 8502 8503 8504 8505 8506 8507 8508 8509 8510 8511 8512 8513 8514 8515 8516 8517 8518 8519 8520 8521 8522 8523 8524 8525 8526 8527 8528 8529 8530 8531 8532 8533 8534 8535 8536 8537 8538 8539 8540 8541 8542 8543 8544 8545 8546 8547 8548 8549 8550 8551 8552 8553 8554 8555 8556 8557 8558 8559 8560 8561 8562 8563 8564 8565 8566 8567 8568 8569 8570 8571 8572 8573 8574 8575 8576 8577 8578 8579 8580 8581 8582 8583 8584 8585 8586 8587 8588 8589 8590 8591 8592 8593 8594 8595 8596 8597 8598 8599 8600 8601 8602 8603 8604 8605 8606 8607 8608 8609 8610 8611 8612 8613 8614 8615 8616 8617 8618 8619 8620 8621 8622 8623 8624 8625 8626 8627 8628 8629 8630 8631 8632 8633 8634 8635 8636 8637 8638 8639 8640 8641 8642 8643 8644 8645 8646 8647 8648 8649 8650 8651 8652 8653 8654 8655 8656 8657 8658 8659 8660 8661 8662 8663 8664 8665 8666 8667 8668 8669 8670 8671 8672 8673 8674 8675 8676 8677 8678 8679 8680 8681 8682 8683 8684 8685 8686 8687 8688 8689 8690 8691 8692 8693 8694 8695 8696 8697 8698 8699 8700 8701 8702 8703 8704 8705 8706 8707 8708 8709 8710 8711 8712 8713 8714 8715 8716 8717 8718 8719 8720 8721 8722 8723 8724 8725 8726 8727 8728 8729 8730 8731 8732 8733 8734 8735 8736 8737 8738 8739 8740 8741 8742 8743 8744 8745 8746 8747 8748 8749 8750 8751 8752 8753 8754 8755 8756 8757 8758 8759 8760 8761 8762 8763 8764 8765 8766 8767 8768 8769 8770 8771 8772 8773 8774 8775 8776 8777 8778 8779 8780 8781 8782 8783 8784 8785 8786 8787 8788 8789 8790 8791 8792 8793 8794 8795 8796 8797 8798 8799 8800 8801 8802 8803 8804 8805 8806 8807 8808 8809 8810 8811 8812 8813 8814 8815 8816 8817 8818 8819 8820 8821 8822 8823 8824 8825 8826 8827 8828 8829 8830 8831 8832 8833 8834 8835 8836 8837 8838 8839 8840 8841 8842 8843 8844 8845 8846 8847 8848 8849 8850 8851 8852 8853 8854 8855 8856 8857 8858 8859 8860 8861 8862 8863 8864 8865 8866 8867 8868 8869 8870 8871 8872 8873 8874 8875 8876 8877 8878 8879 8880 8881 8882 8883 8884 8885 8886 8887 8888 8889 8890 8891 8892 8893 8894 8895 8896 8897 8898 8899 8900 8901 8902 8903 8904 8905 8906 8907 8908 8909 8910 8911 8912 8913 8914 8915 8916 8917 8918 8919 8920 8921 8922 8923 8924 8925 8926 8927 8928 8929 8930 8931 8932 8933 8934 8935 8936 8937 8938 8939 8940 8941 8942 8943 8944 8945 8946 8947 8948 8949 8950 8951 8952 8953 8954 8955 8956 8957 8958 8959 8960 8961 8962 8963 8964 8965 8966 8967 8968 8969 8970 8971 8972 8973 8974 8975 8976 8977 8978 8979 8980 8981 8982 8983 8984 8985 8986 8987 8988 8989 8990 8991 8992 8993 8994 8995 8996 8997 8998 8999 9000 9001 9002 9003 9004 9005 9006 9007 9008 9009 9010 9011 9012 9013 9014 9015 9016 9017 9018 9019 9020 9021 9022 9023 9024 9025 9026 9027 9028 9029 9030 9031 9032 9033 9034 9035 9036 9037 9038 9039 9040 9041 9042 9043 9044 9045 9046 9047 9048 9049 9050 9051 9052 9053 9054 9055 9056 9057 9058 9059 9060 9061 9062 9063 9064 9065 9066 9067 9068 9069 9070 9071 9072 9073 9074 9075 9076 9077 9078 9079 9080 9081 9082 9083 9084 9085 9086 9087 9088 9089 9090 9091 9092 9093 9094 9095 9096 9097 9098 9099 9100 9101 9102 9103 9104 9105 9106 9107 9108 9109 9110 9111 9112 9113 9114 9115 9116 9117 9118 9119 9120 9121 9122 9123 9124 9125 9126 9127 9128 9129 9130 9131 9132 9133 9134 9135 9136 9137 9138 9139 9140 9141 9142 9143 9144 9145 9146 9147 9148 9149 9150 9151 9152 9153 9154 9155 9156 9157 9158 9159 9160 9161 9162 9163 9164 9165 9166 9167 9168 9169 9170 9171 9172 9173 9174 9175 9176 9177 9178 9179 9180 9181 9182 9183 9184 9185 9186 9187 9188 9189 9190 9191 9192 9193 9194 9195 9196 9197 9198 9199 9200 9201 9202 9203 9204 9205 9206 9207 9208 9209 9210 9211 9212 9213 9214 9215 9216 9217 9218 9219 9220 9221 9222 9223 9224 9225 9226 9227 9228 9229 9230 9231 9232 9233 9234 9235 9236 9237 9238 9239 9240 9241 9242 9243 9244 9245 9246 9247 9248 9249 9250 9251 9252 9253 9254 9255 9256 9257 9258 9259 9260 9261 9262 9263 9264 9265 9266 9267 9268 9269 9270 9271 9272 9273 9274 9275 9276 9277 9278 9279 9280 9281 9282 9283 9284 9285 9286 9287 9288 9289 9290 9291 9292 9293 9294 9295 9296 9297 9298 9299 9300 9301 9302 9303 9304 9305 9306 9307 9308 9309 9310 9311 9312 9313 9314 9315 9316 9317 9318 9319 9320 9321 9322 9323 9324 9325 9326 9327 9328 9329 9330 9331 9332 9333 9334 9335 9336 9337 9338 9339 9340 9341 9342 9343 9344 9345 9346 9347 9348 9349 9350 9351 9352 9353 9354 9355 9356 9357 9358 9359 9360 9361 9362 9363 9364 9365 9366 9367 9368 9369 9370 9371 9372 9373 9374 9375 9376 9377 9378 9379 9380 9381 9382 9383 9384 9385 9386 9387 9388 9389 9390 9391 9392 9393 9394 9395 9396 9397 9398 9399 9400 9401 9402 9403 9404 9405 9406 9407 9408 9409 9410 9411 9412 9413 9414 9415 9416 9417 9418 9419 9420 9421 9422 9423 9424 9425 9426 9427 9428 9429 9430 9431 9432 9433 9434 9435 9436 9437 9438 9439 9440 9441 9442 9443 9444 9445 9446 9447 9448 9449 9450 9451 9452 9453 9454 9455 9456 9457 9458 9459 9460 9461 9462 9463 9464 9465 9466 9467 9468 9469 9470 9471 9472 9473 9474 9475 9476 9477 9478 9479 9480 9481 9482 9483 9484 9485 9486 9487 9488 9489 9490 9491 9492 9493 9494 9495 9496 9497 9498 9499 9500 9501 9502 9503 9504 9505 9506 9507 9508 9509 9510 9511 9512 9513 9514 9515 9516 9517 9518 9519 9520 9521 9522 9523 9524 9525 9526 9527 9528 9529 9530 9531 9532 9533 9534 9535 9536 9537 9538 9539 9540 9541 9542 9543 9544 9545 9546 9547 9548 9549 9550 9551 9552 9553 9554 9555 9556 9557 9558 9559 9560 9561 9562 9563 9564 9565 9566 9567 9568 9569 9570 9571 9572 9573 9574 9575 9576 9577 9578 9579 9580 9581 9582 9583 9584 9585 9586 9587 9588 9589 9590 9591 9592 9593 9594 9595 9596 9597 9598 9599 9600 9601 9602 9603 9604 9605 9606 9607 9608 9609 9610 9611 9612 9613 9614 9615 9616 9617 9618 9619 9620 9621 9622 9623 9624 9625 9626 9627 9628 9629 9630 9631 9632 9633 9634 9635 9636 9637 9638 9639 9640 9641 9642 9643 9644 9645 9646 9647 9648 9649 9650 9651 9652 9653 9654 9655 9656 9657 9658 9659 9660 9661 9662 9663 9664 9665 9666 9667 9668 9669 9670 9671 9672 9673 9674 9675 9676 9677 9678 9679 9680 9681 9682 9683 9684 9685 9686 9687 9688 9689 9690 9691 9692 9693 9694 9695 9696 9697 9698 9699 9700 9701 9702 9703 9704 9705 9706 9707 9708 9709 9710 9711 9712 9713 9714 9715 9716 9717 9718 9719 9720 9721 9722 9723 9724 9725 9726 9727 9728 9729 9730 9731 9732 9733 9734 9735 9736 9737 9738 9739 9740 9741 9742 9743 9744 9745 9746 9747 9748 9749 9750 9751 9752 9753 9754 9755 9756 9757 9758 9759 9760 9761 9762 9763 9764 9765 9766 9767 9768 9769 9770 9771 9772 9773 9774 9775 9776 9777 9778 9779 9780 9781 9782 9783 9784 9785 9786 9787 9788 9789 9790 9791 9792 9793 9794 9795 9796 9797 9798 9799 9800 9801 9802 9803 9804 9805 9806 9807 9808 9809 9810 9811 9812 9813 9814 9815 9816 9817 9818 9819 9820 9821 9822 9823 9824 9825 9826 9827 9828 9829 9830 9831 9832 9833 9834 9835 9836 9837 9838 9839 9840 9841 9842 9843 9844 9845 9846 9847 9848 9849 9850 9851 9852 9853 9854 9855 9856 9857 9858 9859 9860 9861 9862 9863 9864 9865 9866 9867 9868 9869 9870 9871 9872 9873 9874 9875 9876 9877 9878 9879 9880 9881 9882 9883 9884 9885 9886 9887 9888 9889 9890 9891 9892 9893 9894 9895 9896 9897 9898 9899 9900 9901 9902 9903 9904 9905 9906 9907 9908 9909 9910 9911 9912 9913 9914 9915 9916 9917 9918 9919 9920 9921 9922 9923 9924 9925 9926 9927 9928 9929 9930 9931 9932 9933 9934 9935 9936 9937 9938 9939 9940 9941 9942 9943 9944 9945 9946 9947 9948 9949 9950 9951 9952 9953 9954 9955 9956 9957 9958 9959 9960 9961 9962 9963 9964 9965 9966 9967 9968 9969 9970 9971 9972 9973 9974 9975 9976 9977 9978 9979 9980 9981 9982 9983 9984 9985 9986 9987 9988 9989 9990 9991 9992 9993 9994 9995 9996 9997 9998 9999 10000 10001 10002 10003 10004 10005 10006 10007 10008 10009 10010 10011 10012 10013 10014 10015 10016 10017 10018 10019 10020 10021 10022 10023 10024 10025 10026 10027 10028 10029 10030 10031 10032 10033 10034 10035 10036 10037 10038 10039 10040 10041 10042 10043 10044 10045 10046 10047 10048 10049 10050 10051 10052 10053 10054 10055 10056 10057 10058 10059 10060 10061 10062 10063 10064 10065 10066 10067 10068 10069 10070 10071 10072 10073 10074 10075 10076 10077 10078 10079 10080 10081 10082 10083 10084 10085 10086 10087 10088 10089 10090 10091 10092 10093 10094 10095 10096 10097 10098 10099 10100 10101 10102 10103 10104 10105 10106 10107 10108 10109 10110 10111 10112 10113 10114 10115 10116 10117 10118 10119 10120 10121 10122 10123 10124 10125 10126 10127 10128 10129 10130 10131 10132 10133 10134 10135 10136 10137 10138 10139 10140 10141 10142 10143 10144 10145 10146 10147 10148 10149 10150 10151 10152 10153 10154 10155 10156 10157 10158 10159 10160 10161 10162 10163 10164 10165 10166 10167 10168 10169 10170 10171 10172 10173 10174 10175 10176 10177 10178 10179 10180 10181 10182 10183 10184 10185 10186 10187 10188 10189 10190 10191 10192 10193 10194 10195 10196 10197 10198 10199 10200 10201 10202 10203 10204 10205 10206 10207 10208 10209 10210 10211 10212 10213 10214 10215 10216 10217 10218 10219 10220 10221 10222 10223 10224 10225 10226 10227 10228 10229 10230 10231 10232 10233 10234 10235 10236 10237 10238 10239 10240 10241 10242 10243 10244 10245 10246 10247 10248 10249 10250 10251 10252 10253 10254 10255 10256 10257 10258 10259 10260 10261 10262 10263 10264 10265 10266 10267 10268 10269 10270 10271 10272 10273 10274 10275 10276 10277 10278 10279 10280 10281 10282 10283 10284 10285 10286 10287 10288 10289 10290 10291 10292 10293 10294 10295 10296 10297 10298 10299 10300 10301 10302 10303 10304 10305 10306 10307 10308 10309 10310 10311 10312 10313 10314 10315 10316 10317 10318 10319 10320 10321 10322 10323 10324 10325 10326 10327 10328 10329 10330 10331 10332 10333 10334 10335 10336 10337 10338 10339 10340 10341 10342 10343 10344 10345 10346 10347 10348 10349 10350 10351 10352 10353 10354 10355 10356 10357 10358 10359 10360 10361 10362 10363 10364 10365 10366 10367 10368 10369 10370 10371 10372 10373 10374 10375 10376 10377 10378 10379 10380 10381 10382 10383 10384 10385 10386 10387 10388 10389 10390 10391 10392 10393 10394 10395 10396 10397 10398 10399 10400 10401 10402 10403 10404 10405 10406 10407 10408 10409 10410 10411 10412 10413 10414 10415 10416 10417 10418 10419 10420 10421 10422 10423 10424 10425 10426 10427 10428 10429 10430 10431 10432 10433 10434 10435 10436 10437 10438 10439 10440 10441 10442 10443 10444 10445 10446 10447 10448 10449 10450 10451 10452 10453 10454 10455 10456 10457 10458 10459 10460 10461 10462 10463 10464 10465 10466 10467 10468 10469 10470 10471 10472 10473 10474 10475 10476 10477 10478 10479 10480 10481 10482 10483 10484 10485 10486 10487 10488 10489 10490 10491 10492 10493 10494 10495 10496 10497 10498 10499 10500 10501 10502 10503 10504 10505 10506 10507 10508 10509 10510 10511 10512 10513 10514 10515 10516 10517 10518 10519 10520 10521 10522 10523 10524 10525 10526 10527 10528 10529 10530 10531 10532 10533 10534 10535 10536 10537 10538 10539 10540 10541 10542 10543 10544 10545 10546 10547 10548 10549 10550 10551 10552 10553 10554 10555 10556 10557 10558 10559 10560 10561 10562 10563 10564 10565 10566 10567 10568 10569 10570 10571 10572 10573 10574 10575 10576 10577 10578 10579 10580 10581 10582 10583 10584 10585 10586 10587 10588 10589 10590 10591 10592 10593 10594 10595 10596 10597 10598 10599 10600 10601 10602 10603 10604 10605 10606 10607 10608 10609 10610 10611 10612 10613 10614 10615 10616 10617 10618 10619 10620 10621 10622 10623 10624 10625 10626 10627 10628 10629 10630 10631 10632 10633 10634 10635 10636 10637 10638 10639 10640 10641 10642 10643 10644 10645 10646 10647 10648 10649 10650 10651 10652 10653 10654 10655 10656 10657 10658 10659 10660 10661 10662 10663 10664 10665 10666 10667 10668 10669 10670 10671 10672 10673 10674 10675 10676 10677 10678 10679 10680 10681 10682 10683 10684 10685 10686 10687 10688 10689 10690 10691 10692 10693 10694 10695 10696 10697 10698 10699 10700 10701 10702 10703 10704 10705 10706 10707 10708 10709 10710 10711 10712 10713 10714 10715 10716 10717 10718 10719 10720 10721 10722 10723 10724 10725 10726 10727 10728 10729 10730 10731 10732 10733 10734 10735 10736 10737 10738 10739 10740 10741 10742 10743 10744 10745 10746 10747 10748 10749 10750 10751 10752 10753 10754 10755 10756 10757 10758 10759 10760 10761 10762 10763 10764 10765 10766 10767 10768 10769 10770 10771 10772 10773 10774 10775 10776 10777 10778 10779 10780 10781 10782 10783 10784 10785 10786 10787 10788 10789 10790 10791 10792 10793 10794 10795 10796 10797 10798 10799 10800 10801 10802 10803 10804 10805 10806 10807 10808 10809 10810 10811 10812 10813 10814 10815 10816 10817 10818 10819 10820 10821 10822 10823 10824 10825 10826 10827 10828 10829 10830 10831 10832 10833 10834 10835 10836 10837 10838 10839 10840 10841 10842 10843 10844 10845 10846 10847 10848 10849 10850 10851 10852 10853 10854 10855 10856 10857 10858 10859 10860 10861 10862 10863 10864 10865 10866 10867 10868 10869 10870 10871 10872 10873 10874 10875 10876 10877 10878 10879 10880 10881 10882 10883 10884 10885 10886 10887 10888 10889 10890 10891 10892 10893 10894 10895 10896 10897 10898 10899 10900 10901 10902 10903 10904 10905 10906 10907 10908 10909 10910 10911 10912 10913 10914 10915 10916 10917 10918 10919 10920 10921 10922 10923 10924 10925 10926 10927 10928 10929 10930 10931 10932 10933 10934 10935 10936 10937 10938 10939 10940 10941 10942 10943 10944 10945 10946 10947 10948 10949 10950 10951 10952 10953 10954 10955 10956 10957 10958 10959 10960 10961 10962 10963 10964 10965 10966 10967 10968 10969 10970 10971 10972 10973 10974 10975 10976 10977 10978 10979 10980 10981 10982 10983 10984 10985 10986 10987 10988 10989 10990 10991 10992 10993 10994 10995 10996 10997 10998 10999 11000 11001 11002 11003 11004 11005 11006 11007 11008 11009 11010 11011 11012 11013 11014 11015 11016 11017 11018 11019 11020 11021 11022 11023 11024 11025 11026 11027 11028 11029 11030 11031 11032 11033 11034 11035 11036 11037 11038 11039 11040 11041 11042 11043 11044 11045 11046 11047 11048 11049 11050 11051 11052 11053 11054 11055 11056 11057 11058 11059 11060 11061 11062 11063 11064 11065 11066 11067 11068 11069 11070 11071 11072 11073 11074 11075 11076 11077 11078 11079 11080 11081 11082 11083 11084 11085 11086 11087 11088 11089 11090 11091 11092 11093 11094 11095 11096 11097 11098 11099 11100 11101 11102 11103 11104 11105 11106 11107 11108 11109 11110 11111 11112 11113 11114 11115 11116 11117 11118 11119 11120 11121 11122 11123 11124 11125 11126 11127 11128 11129 11130 11131 11132 11133 11134 11135 11136 11137 11138 11139 11140 11141 11142 11143 11144 11145 11146 11147 11148 11149 11150 11151 11152 11153 11154 11155 11156 11157 11158 11159 11160 11161 11162 11163 11164 11165 11166 11167 11168 11169 11170 11171 11172 11173 11174 11175 11176 11177 11178 11179 11180 11181 11182 11183 11184 11185 11186 11187 11188 11189 11190 11191 11192 11193 11194 11195 11196 11197 11198 11199 11200 11201 11202 11203 11204 11205 11206 11207 11208 11209 11210 11211 11212 11213 11214 11215 11216 11217 11218 11219 11220 11221 11222 11223 11224 11225 11226 11227 11228 11229 11230 11231 11232 11233 11234 11235 11236 11237 11238 11239 11240 11241 11242 11243 11244 11245 11246 11247 11248 11249 11250 11251 11252 11253 11254 11255 11256 11257 11258 11259 11260 11261 11262 11263 11264 11265 11266 11267 11268 11269 11270 11271 11272 11273 11274 11275 11276 11277 11278 11279 11280 11281 11282 11283 11284 11285 11286 11287 11288 11289 11290 11291 11292 11293 11294 11295 11296 11297 11298 11299 11300 11301 11302 11303 11304 11305 11306 11307 11308 11309 11310 11311 11312 11313 11314 11315 11316 11317 11318 11319 11320 11321 11322 11323 11324 11325 11326 11327 11328 11329 11330 11331 11332 11333 11334 11335 11336 11337 11338 11339 11340 11341 11342 11343 11344 11345 11346 11347 11348 11349 11350 11351 11352 11353 11354 11355 11356 11357 11358 11359 11360 11361 11362 11363 11364 11365 11366 11367 11368 11369 11370 11371 11372 11373 11374 11375 11376 11377 11378 11379 11380 11381 11382 11383 11384 11385 11386 11387 11388 11389 11390 11391 11392 11393 11394 11395 11396 11397 11398 11399 11400 11401 11402 11403 11404 11405 11406 11407 11408 11409 11410 11411 11412 11413 11414 11415 11416 11417 11418 11419 11420 11421 11422 11423 11424 11425 11426 11427 11428 11429 11430 11431 11432 11433 11434 11435 11436 11437 11438 11439 11440 11441 11442 11443 11444 11445 11446 11447 11448 11449 11450 11451 11452 11453 11454 11455 11456 11457 11458 11459 11460 11461 11462 11463 11464 11465 11466 11467 11468 11469 11470 11471 11472 11473 11474 11475 11476 11477 11478 11479 11480 11481 11482 11483 11484 11485 11486 11487 11488 11489 11490 11491 11492 11493 11494 11495 11496 11497 11498 11499 11500 11501 11502 11503 11504 11505 11506 11507 11508 11509 11510 11511 11512 11513 11514 11515 11516 11517 11518 11519 11520 11521 11522 11523 11524 11525 11526 11527 11528 11529 11530 11531 11532 11533 11534 11535 11536 11537 11538 11539 11540 11541 11542 11543 11544 11545 11546 11547 11548 11549 11550 11551 11552 11553 11554 11555 11556 11557 11558 11559 11560 11561 11562 11563 11564 11565 11566 11567 11568 11569 11570 11571 11572 11573 11574 11575 11576 11577 11578 11579 11580 11581 11582 11583 11584 11585 11586 11587 11588 11589 11590 11591 11592 11593 11594 11595 11596 11597 11598 11599 11600 11601 11602 11603 11604 11605 11606 11607 11608 11609 11610 11611 11612 11613 11614 11615 11616 11617 11618 11619 11620 11621 11622 11623 11624 11625 11626 11627 11628 11629 11630 11631 11632 11633 11634 11635 11636 11637 11638 11639 11640 11641 11642 11643 11644 11645 11646 11647 11648 11649 11650 11651 11652 11653 11654 11655 11656 11657 11658 11659 11660 11661 11662 11663 11664 11665 11666 11667 11668 11669 11670 11671 11672 11673 11674 11675 11676 11677 11678 11679 11680 11681 11682 11683 11684 11685 11686 11687 11688 11689 11690 11691 11692 11693 11694 11695 11696 11697 11698 11699 11700 11701 11702 11703 11704 11705 11706 11707 11708 11709 11710 11711 11712 11713 11714 11715 11716 11717 11718 11719 11720 11721 11722 11723 11724 11725 11726 11727 11728 11729 11730 11731 11732 11733 11734 11735 11736 11737 11738 11739 11740 11741 11742 11743 11744 11745 11746 11747 11748 11749 11750 11751 11752 11753 11754 11755 11756 11757 11758 11759 11760 11761 11762 11763 11764 11765 11766 11767 11768 11769 11770 11771 11772 11773 11774 11775 11776 11777 11778 11779 11780 11781 11782 11783 11784 11785 11786 11787 11788 11789 11790 11791 11792 11793 11794 11795 11796 11797 11798 11799 11800 11801 11802 11803 11804 11805 11806 11807 11808 11809 11810 11811 11812 11813 11814 11815 11816 11817 11818 11819 11820 11821 11822 11823 11824 11825 11826 11827 11828 11829 11830 11831 11832 11833 11834 11835 11836 11837 11838 11839 11840 11841 11842 11843 11844 11845 11846 11847 11848 11849 11850 11851 11852 11853 11854 11855 11856 11857 11858 11859 11860 11861 11862 11863 11864 11865 11866 11867 11868 11869 11870 11871 11872 11873 11874 11875 11876 11877 11878 11879 11880 11881 11882 11883 11884 11885 11886 11887 11888 11889 11890 11891 11892 11893 11894 11895 11896 11897 11898 11899 11900 11901 11902 11903 11904 11905 11906 11907 11908 11909 11910 11911 11912 11913 11914 11915 11916 11917 11918 11919 11920 11921 11922 11923 11924 11925 11926 11927 11928 11929 11930 11931 11932 11933 11934 11935 11936 11937 11938 11939 11940 11941 11942 11943 11944 11945 11946 11947 11948 11949 11950 11951 11952 11953 11954 11955 11956 11957 11958 11959 11960 11961 11962 11963 11964 11965 11966 11967 11968 11969 11970 11971 11972 11973 11974 11975 11976 11977 11978 11979 11980 11981 11982 11983 11984 11985 11986 11987 11988 11989 11990 11991 11992 11993 11994 11995 11996 11997 11998 11999 12000 12001 12002 12003 12004 12005 12006 12007 12008 12009 12010 12011 12012 12013 12014 12015 12016 12017 12018 12019 12020 12021 12022 12023 12024 12025 12026 12027 12028 12029 12030 12031 12032 12033 12034 12035 12036 12037 12038 12039 12040 12041 12042 12043 12044 12045 12046 12047 12048 12049 12050 12051 12052 12053 12054 12055 12056 12057 12058 12059 12060 12061 12062 12063 12064 12065 12066 12067 12068 12069 12070 12071 12072 12073 12074 12075 12076 12077 12078 12079 12080 12081 12082 12083 12084 12085 12086 12087 12088 12089 12090 12091 12092 12093 12094 12095 12096 12097 12098 12099 12100 12101 12102 12103 12104 12105 12106 12107 12108 12109 12110 12111 12112 12113 12114 12115 12116 12117 12118 12119 12120 12121 12122 12123 12124 12125 12126 12127 12128 12129 12130 12131 12132 12133 12134 12135 12136 12137 12138 12139 12140 12141 12142 12143 12144 12145 12146 12147 12148 12149 12150 12151 12152 12153 12154 12155 12156 12157 12158 12159 12160 12161 12162 12163 12164 12165 12166 12167 12168 12169 12170 12171 12172 12173 12174 12175 12176 12177 12178 12179 12180 12181 12182 12183 12184 12185 12186 12187 12188 12189 12190 12191 12192 12193 12194 12195 12196 12197 12198 12199 12200 12201 12202 12203 12204 12205 12206 12207 12208 12209 12210 12211 12212 12213 12214 12215 12216 12217 12218 12219 12220 12221 12222 12223 12224 12225 12226 12227 12228 12229 12230 12231 12232 12233 12234 12235 12236 12237 12238 12239 12240 12241 12242 12243 12244 12245 12246 12247 12248 12249 12250 12251 12252 12253 12254 12255 12256 12257 12258 12259 12260 12261 12262 12263 12264 12265 12266 12267 12268 12269 12270 12271 12272 12273 12274 12275 12276 12277 12278 12279 12280 12281 12282 12283 12284 12285 12286 12287 12288 12289 12290 12291 12292 12293 12294 12295 12296 12297 12298 12299 12300 12301 12302 12303 12304 12305 12306 12307 12308 12309 12310 12311 12312 12313 12314 12315 12316 12317 12318 12319 12320 12321 12322 12323 12324 12325 12326 12327 12328 12329 12330 12331 12332 12333 12334 12335 12336 12337 12338 12339 12340 12341 12342 12343 12344 12345 12346 12347 12348 12349 12350 12351 12352 12353 12354 12355 12356 12357 12358 12359 12360 12361 12362 12363 12364 12365 12366 12367 12368 12369 12370 12371 12372 12373 12374 12375 12376 12377 12378 12379 12380 12381 12382 12383 12384 12385 12386 12387 12388 12389 12390 12391 12392 12393 12394 12395 12396 12397 12398 12399 12400 12401 12402 12403 12404 12405 12406 12407 12408 12409 12410 12411 12412 12413 12414 12415 12416 12417 12418 12419 12420 12421 12422 12423 12424 12425 12426 12427 12428 12429 12430 12431 12432 12433 12434 12435 12436 12437 12438 12439 12440 12441 12442 12443 12444 12445 12446 12447 12448 12449 12450 12451 12452 12453 12454 12455 12456 12457 12458 12459 12460 12461 12462 12463 12464 12465 12466 12467 12468 12469 12470 12471 12472 12473 12474 12475 12476 12477 12478 12479 12480 12481 12482 12483 12484 12485 12486 12487 12488 12489 12490 12491 12492 12493 12494 12495 12496 12497 12498 12499 12500 12501 12502 12503 12504 12505 12506 12507 12508 12509 12510 12511 12512 12513 12514 12515 12516 12517 12518 12519 12520 12521 12522 12523 12524 12525 12526 12527 12528 12529 12530 12531 12532 12533 12534 12535 12536 12537 12538 12539 12540 12541 12542 12543 12544 12545 12546 12547 12548 12549 12550 12551 12552 12553 12554 12555 12556 12557 12558 12559 12560 12561 12562 12563 12564 12565 12566 12567 12568 12569 12570 12571 12572 12573 12574 12575 12576 12577 12578 12579 12580 12581 12582 12583 12584 12585 12586 12587 12588 12589 12590 12591 12592 12593 12594 12595 12596 12597 12598 12599 12600 12601 12602 12603 12604 12605 12606 12607 12608 12609 12610 12611 12612 12613 12614 12615 12616 12617 12618 12619 12620 12621 12622 12623 12624 12625 12626 12627 12628 12629 12630 12631 12632 12633 12634 12635 12636 12637 12638 12639 12640 12641 12642 12643 12644 12645 12646 12647 12648 12649 12650 12651 12652 12653 12654 12655 12656 12657 12658 12659 12660 12661 12662 12663 12664 12665 12666 12667 12668 12669 12670 12671 12672 12673 12674 12675 12676 12677 12678 12679 12680 12681 12682 12683 12684 12685 12686 12687 12688 12689 12690 12691 12692 12693 12694 12695 12696 12697 12698 12699 12700 12701 12702 12703 12704 12705 12706 12707 12708 12709 12710 12711 12712 12713 12714 12715 12716 12717 12718 12719 12720 12721 12722 12723 12724 12725 12726 12727 12728 12729 12730 12731 12732 12733 12734 12735 12736 12737 12738 12739 12740 12741 12742 12743 12744 12745 12746 12747 12748 12749 12750 12751 12752 12753 12754 12755 12756 12757 12758 12759 12760 12761 12762 12763 12764 12765 12766 12767 12768 12769 12770 12771 12772 12773 12774 12775 12776 12777 12778 12779 12780 12781 12782 12783 12784 12785 12786 12787 12788 12789 12790 12791 12792 12793 12794 12795 12796 12797 12798 12799 12800 12801 12802 12803 12804 12805 12806 12807 12808 12809 12810 12811 12812 12813 12814 12815 12816 12817 12818 12819 12820 12821 12822 12823 12824 12825 12826 12827 12828 12829 12830 12831 12832 12833 12834 12835 12836 12837 12838 12839 12840 12841 12842 12843 12844 12845 12846 12847 12848 12849 12850 12851 12852 12853 12854 12855 12856 12857 12858 12859 12860 12861 12862 12863 12864 12865 12866 12867 12868 12869 12870 12871 12872 12873 12874 12875 12876 12877 12878 12879 12880 12881 12882 12883 12884 12885 12886 12887 12888 12889 12890 12891 12892 12893 12894 12895 12896 12897 12898 12899 12900 12901 12902 12903 12904 12905 12906 12907 12908 12909 12910 12911 12912 12913 12914 12915 12916 12917 12918 12919 12920 12921 12922 12923 12924 12925 12926 12927 12928 12929 12930 12931 12932 12933 12934 12935 12936 12937 12938 12939 12940 12941 12942 12943 12944 12945 12946 12947 12948 12949 12950 12951 12952 12953 12954 12955 12956 12957 12958 12959 12960 12961 12962 12963 12964 12965 12966 12967 12968 12969 12970 12971 12972 12973 12974 12975 12976 12977 12978 12979 12980 12981 12982 12983 12984 12985 12986 12987 12988 12989 12990 12991 12992 12993 12994 12995 12996 12997 12998 12999 13000 13001 13002 13003 13004 13005 13006 13007 13008 13009 13010 13011 13012 13013 13014 13015 13016 13017 13018 13019 13020 13021 13022 13023 13024 13025 13026 13027 13028 13029 13030 13031 13032 13033 13034 13035 13036 13037 13038 13039 13040 13041 13042 13043 13044 13045 13046 13047 13048 13049 13050 13051 13052 13053 13054 13055 13056 13057 13058 13059 13060 13061 13062 13063 13064 13065 13066 13067 13068 13069 13070 13071 13072 13073 13074 13075 13076 13077 13078 13079 13080 13081 13082 13083 13084 13085 13086 13087 13088 13089 13090 13091 13092 13093 13094 13095 13096 13097 13098 13099 13100 13101 13102 13103 13104 13105 13106 13107 13108 13109 13110 13111 13112 13113 13114 13115 13116 13117 13118 13119 13120 13121 13122 13123 13124 13125 13126 13127 13128 13129 13130 13131 13132 13133 13134 13135 13136 13137 13138 13139 13140 13141 13142 13143 13144 13145 13146 13147 13148 13149 13150 13151 13152 13153 13154 13155 13156 13157 13158 13159 13160 13161 13162 13163 13164 13165 13166 13167 13168 13169 13170 13171 13172 13173 13174 13175 13176 13177 13178 13179 13180 13181 13182 13183 13184 13185 13186 13187 13188 13189 13190 13191 13192 13193 13194 13195 13196 13197 13198 13199 13200 13201 13202 13203 13204 13205 13206 13207 13208 13209 13210 13211 13212 13213 13214 13215 13216 13217 13218 13219 13220 13221 13222 13223 13224 13225 13226 13227 13228 13229 13230 13231 13232 13233 13234 13235 13236 13237 13238 13239 13240 13241 13242 13243 13244 13245 13246 13247 13248 13249 13250 13251 13252 13253 13254 13255 13256 13257 13258 13259 13260 13261 13262 13263 13264 13265 13266 13267 13268 13269 13270 13271 13272 13273 13274 13275 13276 13277 13278 13279 13280 13281 13282 13283 13284 13285 13286 13287 13288 13289 13290 13291 13292 13293 13294 13295 13296 13297 13298 13299 13300 13301 13302 13303 13304 13305 13306 13307 13308 13309 13310 13311 13312 13313 13314 13315 13316 13317 13318 13319 13320 13321 13322 13323 13324 13325 13326 13327 13328 13329 13330 13331 13332 13333 13334 13335 13336 13337 13338 13339 13340 13341 13342 13343 13344 13345 13346 13347 13348 13349 13350 13351 13352 13353 13354 13355 13356 13357 13358 13359 13360 13361 13362 13363 13364 13365 13366 13367 13368 13369 13370 13371 13372 13373 13374 13375 13376 13377 13378 13379 13380 13381 13382 13383 13384 13385 13386 13387 13388 13389 13390 13391 13392 13393 13394 13395 13396 13397 13398 13399 13400 13401 13402 13403 13404 13405 13406 13407 13408 13409 13410 13411 13412 13413 13414 13415 13416 13417 13418 13419 13420 13421 13422 13423 13424 13425 13426 13427 13428 13429 13430 13431 13432 13433 13434 13435 13436 13437 13438 13439 13440 13441 13442 13443 13444 13445 13446 13447 13448 13449 13450 13451 13452 13453 13454 13455 13456 13457 13458 13459 13460 13461 13462 13463 13464 13465 13466 13467 13468 13469 13470 13471 13472 13473 13474 13475 13476 13477 13478 13479 13480 13481 13482 13483 13484 13485 13486 13487 13488 13489 13490 13491 13492 13493 13494 13495 13496 13497 13498 13499 13500 13501 13502 13503 13504 13505 13506 13507 13508 13509 13510 13511 13512 13513 13514 13515 13516 13517 13518 13519 13520 13521 13522 13523 13524 13525 13526 13527 13528 13529 13530 13531 13532 13533 13534 13535 13536 13537 13538 13539 13540 13541 13542 13543 13544 13545 13546 13547 13548 13549 13550 13551 13552 13553 13554 13555 13556 13557 13558 13559 13560 13561 13562 13563 13564 13565 13566 13567 13568 13569 13570 13571 13572 13573 13574 13575 13576 13577 13578 13579 13580 13581 13582 13583 13584 13585 13586 13587 13588 13589 13590 13591 13592 13593 13594 13595 13596 13597 13598 13599 13600 13601 13602 13603 13604 13605 13606 13607 13608 13609 13610 13611 13612 13613 13614 13615 13616 13617 13618 13619 13620 13621 13622 13623 13624 13625 13626 13627 13628 13629 13630 13631 13632 13633 13634 13635 13636 13637 13638 13639 13640 13641 13642 13643 13644 13645 13646 13647 13648 13649 13650 13651 13652 13653 13654 13655 13656 13657 13658 13659 13660 13661 13662 13663 13664 13665 13666 13667 13668 13669 13670 13671 13672 13673 13674 13675 13676 13677 13678 13679 13680 13681 13682 13683 13684 13685 13686 13687 13688 13689 13690 13691 13692 13693 13694 13695 13696 13697 13698 13699 13700 13701 13702 13703 13704 13705 13706 13707 13708 13709 13710 13711 13712 13713 13714 13715 13716 13717 13718 13719 13720 13721 13722 13723 13724 13725 13726 13727 13728 13729 13730 13731 13732 13733 13734 13735 13736 13737 13738 13739 13740 13741 13742 13743 13744 13745 13746 13747 13748 13749 13750 13751 13752 13753 13754 13755 13756 13757 13758 13759 13760 13761 13762 13763 13764 13765 13766 13767 13768 13769 13770 13771 13772 13773 13774 13775 13776 13777 13778 13779 13780 13781 13782 13783 13784 13785 13786 13787 13788 13789 13790 13791 13792 13793 13794 13795 13796 13797 13798 13799 13800 13801 13802 13803 13804 13805 13806 13807 13808 13809 13810 13811 13812 13813 13814 13815 13816 13817 13818 13819 13820 13821 13822 13823 13824 13825 13826 13827 13828 13829 13830 13831 13832 13833 13834 13835 13836 13837 13838 13839 13840 13841 13842 13843 13844 13845 13846 13847 13848 13849 13850 13851 13852 13853 13854 13855 13856 13857 13858 13859 13860 13861 13862 13863 13864 13865 13866 13867 13868 13869 13870 13871 13872 13873 13874 13875 13876 13877 13878 13879 13880 13881 13882 13883 13884 13885 13886 13887 13888 13889 13890 13891 13892 13893 13894 13895 13896 13897 13898 13899 13900 13901 13902 13903 13904 13905 13906 13907 13908 13909 13910 13911 13912 13913 13914 13915 13916 13917 13918 13919 13920 13921 13922 13923 13924 13925 13926 13927 13928 13929 13930 13931 13932 13933 13934 13935 13936 13937 13938 13939 13940 13941 13942 13943 13944 13945 13946 13947 13948 13949 13950 13951 13952 13953 13954 13955 13956 13957 13958 13959 13960 13961 13962 13963 13964 13965 13966 13967 13968 13969 13970 13971 13972 13973 13974 13975 13976 13977 13978 13979 13980 13981 13982 13983 13984 13985 13986 13987 13988 13989 13990 13991 13992 13993 13994 13995 13996 13997 13998 13999 14000 14001 14002 14003 14004 14005 14006 14007 14008 14009 14010 14011 14012 14013 14014 14015 14016 14017 14018 14019 14020 14021 14022 14023 14024 14025 14026 14027 14028 14029 14030 14031 14032 14033 14034 14035 14036 14037 14038 14039 14040 14041 14042 14043 14044 14045 14046 14047 14048 14049 14050 14051 14052 14053 14054 14055 14056 14057 14058 14059 14060 14061 14062 14063 14064 14065 14066 14067 14068 14069 14070 14071 14072 14073 14074 14075 14076 14077 14078 14079 14080 14081 14082 14083 14084 14085 14086 14087 14088 14089 14090 14091 14092 14093 14094 14095 14096 14097 14098 14099 14100 14101 14102 14103 14104 14105 14106 14107 14108 14109 14110 14111 14112 14113 14114 14115 14116 14117 14118 14119 14120 14121 14122 14123 14124 14125 14126 14127 14128 14129 14130 14131 14132 14133 14134 14135 14136 14137 14138 14139 14140 14141 14142 14143 14144 14145 14146 14147 14148 14149 14150 14151 14152 14153 14154 14155 14156 14157 14158 14159 14160 14161 14162 14163 14164 14165 14166 14167 14168 14169 14170 14171 14172 14173 14174 14175 14176 14177 14178 14179 14180 14181 14182 14183 14184 14185 14186 14187 14188 14189 14190 14191 14192 14193 14194 14195 14196 14197 14198 14199 14200 14201 14202 14203 14204 14205 14206 14207 14208 14209 14210 14211 14212 14213 14214 14215 14216 14217 14218 14219 14220 14221 14222 14223 14224 14225 14226 14227 14228 14229 14230 14231 14232 14233 14234 14235 14236 14237 14238 14239 14240 14241 14242 14243 14244 14245 14246 14247 14248 14249 14250 14251 14252 14253 14254 14255 14256 14257 14258 14259 14260 14261 14262 14263 14264 14265 14266 14267 14268 14269 14270 14271 14272 14273 14274 14275 14276 14277 14278 14279 14280 14281 14282 14283 14284 14285 14286 14287 14288 14289 14290 14291 14292 14293 14294 14295 14296 14297 14298 14299 14300 14301 14302 14303 14304 14305 14306 14307 14308 14309 14310 14311 14312 14313 14314 14315 14316 14317 14318 14319 14320 14321 14322 14323 14324 14325 14326 14327 14328 14329 14330 14331 14332 14333 14334 14335 14336 14337 14338 14339 14340 14341 14342 14343 14344 14345 14346 14347 14348 14349 14350 14351 14352 14353 14354 14355 14356 14357 14358 14359 14360 14361 14362 14363 14364 14365 14366 14367 14368 14369 14370 14371 14372 14373 14374 14375 14376 14377 14378 14379 14380 14381 14382 14383 14384 14385 14386 14387 14388 14389 14390 14391 14392 14393 14394 14395 14396 14397 14398 14399 14400 14401 14402 14403 14404 14405 14406 14407 14408 14409 14410 14411 14412 14413 14414 14415 14416 14417 14418 14419 14420 14421 14422 14423 14424 14425 14426 14427 14428 14429 14430 14431 14432 14433 14434 14435 14436 14437 14438 14439 14440 14441 14442 14443 14444 14445 14446 14447 14448 14449 14450 14451 14452 14453 14454 14455 14456 14457 14458 14459 14460 14461 14462 14463 14464 14465 14466 14467 14468 14469 14470 14471 14472 14473 14474 14475 14476 14477 14478 14479 14480 14481 14482 14483 14484 14485 14486 14487 14488 14489 14490 14491 14492 14493 14494 14495 14496 14497 14498 14499 14500 14501 14502 14503 14504 14505 14506 14507 14508 14509 14510 14511 14512 14513 14514 14515 14516 14517 14518 14519 14520 14521 14522 14523 14524 14525 14526 14527 14528 14529 14530 14531 14532 14533 14534 14535 14536 14537 14538 14539 14540 14541 14542 14543 14544 14545 14546 14547 14548 14549 14550 14551 14552 14553 14554 14555 14556 14557 14558 14559 14560 14561 14562 14563 14564 14565 14566 14567 14568 14569 14570 14571 14572 14573 14574 14575 14576 14577 14578 14579 14580 14581 14582 14583 14584 14585 14586 14587 14588 14589 14590 14591 14592 14593 14594 14595 14596 14597 14598 14599 14600 14601 14602 14603 14604 14605 14606 14607 14608 14609 14610 14611 14612 14613 14614 14615 14616 14617 14618 14619 14620 14621 14622 14623 14624 14625 14626 14627 14628 14629 14630 14631 14632 14633 14634 14635 14636 14637 14638 14639 14640 14641 14642 14643 14644 14645 14646 14647 14648 14649 14650 14651 14652 14653 14654 14655 14656 14657 14658 14659 14660 14661 14662 14663 14664 14665 14666 14667 14668 14669 14670 14671 14672 14673 14674 14675 14676 14677 14678 14679 14680 14681 14682 14683 14684 14685 14686 14687 14688 14689 14690 14691 14692 14693 14694 14695 14696 14697 14698 14699 14700 14701 14702 14703 14704 14705 14706 14707 14708 14709 14710 14711 14712 14713 14714 14715 14716 14717 14718 14719 14720 14721 14722 14723 14724 14725 14726 14727 14728 14729 14730 14731 14732 14733 14734 14735 14736 14737 14738 14739 14740 14741 14742 14743 14744 14745 14746 14747 14748 14749 14750 14751 14752 14753 14754 14755 14756 14757 14758 14759 14760 14761 14762 14763 14764 14765 14766 14767 14768 14769 14770 14771 14772 14773 14774 14775 14776 14777 14778 14779 14780 14781 14782 14783 14784 14785 14786 14787 14788 14789 14790 14791 14792 14793 14794 14795 14796 14797 14798 14799 14800 14801 14802 14803 14804 14805 14806 14807 14808 14809 14810 14811 14812 14813 14814 14815 14816 14817 14818 14819 14820 14821 14822 14823 14824 14825 14826 14827 14828 14829 14830 14831 14832 14833 14834 14835 14836 14837 14838 14839 14840 14841 14842 14843 14844 14845 14846 14847 14848 14849 14850 14851 14852 14853 14854 14855 14856 14857 14858 14859 14860 14861 14862 14863 14864 14865 14866 14867 14868 14869 14870 14871 14872 14873 14874 14875 14876 14877 14878 14879 14880 14881 14882 14883 14884 14885 14886 14887 14888 14889 14890 14891 14892 14893 14894 14895 14896 14897 14898 14899 14900 14901 14902 14903 14904 14905 14906 14907 14908 14909 14910 14911 14912 14913 14914 14915 14916 14917 14918 14919 14920 14921 14922 14923 14924 14925 14926 14927 14928 14929 14930 14931 14932 14933 14934 14935 14936 14937 14938 14939 14940 14941 14942 14943 14944 14945 14946 14947 14948 14949 14950 14951 14952 14953 14954 14955 14956 14957 14958 14959 14960 14961 14962 14963 14964 14965 14966 14967 14968 14969 14970 14971 14972 14973 14974 14975 14976 14977 14978 14979 14980 14981 14982 14983 14984 14985 14986 14987 14988 14989 14990 14991 14992 14993 14994 14995 14996 14997 14998 14999 15000 15001 15002 15003 15004 15005 15006 15007 15008 15009 15010 15011 15012 15013 15014 15015 15016 15017 15018 15019 15020 15021 15022 15023 15024 15025 15026 15027 15028 15029 15030 15031 15032 15033 15034 15035 15036 15037 15038 15039 15040 15041 15042 15043 15044 15045 15046 15047 15048 15049 15050 15051 15052 15053 15054 15055 15056 15057 15058 15059 15060 15061 15062 15063 15064 15065 15066 15067 15068 15069 15070 15071 15072 15073 15074 15075 15076 15077 15078 15079 15080 15081 15082 15083 15084 15085 15086 15087 15088 15089 15090 15091 15092 15093 15094 15095 15096 15097 15098 15099 15100 15101 15102 15103 15104 15105 15106 15107 15108 15109 15110 15111 15112 15113 15114 15115 15116 15117 15118 15119 15120 15121 15122 15123 15124 15125 15126 15127 15128 15129 15130 15131 15132 15133 15134 15135 15136 15137 15138 15139 15140 15141 15142 15143 15144 15145 15146 15147 15148 15149 15150 15151 15152 15153 15154 15155 15156 15157 15158 15159 15160 15161 15162 15163 15164 15165 15166 15167 15168 15169 15170 15171 15172 15173 15174 15175 15176 15177 15178 15179 15180 15181 15182 15183 15184 15185 15186 15187 15188 15189 15190 15191 15192 15193 15194 15195 15196 15197 15198 15199 15200 15201 15202 15203 15204 15205 15206 15207 15208 15209 15210 15211 15212 15213 15214 15215 15216 15217 15218 15219 15220 15221 15222 15223 15224 15225 15226 15227 15228 15229 15230 15231 15232 15233 15234 15235 15236 15237 15238 15239 15240 15241 15242 15243 15244 15245 15246 15247 15248 15249 15250 15251 15252 15253 15254 15255 15256 15257 15258 15259 15260 15261 15262 15263 15264 15265 15266 15267 15268 15269 15270 15271 15272 15273 15274 15275 15276 15277 15278 15279 15280 15281 15282 15283 15284 15285 15286 15287 15288 15289 15290 15291 15292 15293 15294 15295 15296 15297 15298 15299 15300 15301 15302 15303 15304 15305 15306 15307 15308 15309 15310 15311 15312 15313 15314 15315 15316 15317 15318 15319 15320 15321 15322 15323 15324 15325 15326 15327 15328 15329 15330 15331 15332 15333 15334 15335 15336 15337 15338 15339 15340 15341 15342 15343 15344 15345 15346 15347 15348 15349 15350 15351 15352 15353 15354 15355 15356 15357 15358 15359 15360 15361 15362 15363 15364 15365 15366 15367 15368 15369 15370 15371 15372 15373 15374 15375 15376 15377 15378 15379 15380 15381 15382 15383 15384 15385 15386 15387 15388 15389 15390 15391 15392 15393 15394 15395 15396 15397 15398 15399 15400 15401 15402 15403 15404 15405 15406 15407 15408 15409 15410 15411 15412 15413 15414 15415 15416 15417 15418 15419 15420 15421 15422 15423 15424 15425 15426 15427 15428 15429 15430 15431 15432 15433 15434 15435 15436 15437 15438 15439 15440 15441 15442 15443 15444 15445 15446 15447 15448 15449 15450 15451 15452 15453 15454 15455 15456 15457 15458 15459 15460 15461 15462 15463 15464 15465 15466 15467 15468 15469 15470 15471 15472 15473 15474 15475 15476 15477 15478 15479 15480 15481 15482 15483 15484 15485 15486 15487 15488 15489 15490 15491 15492 15493 15494 15495 15496 15497 15498 15499 15500 15501 15502 15503 15504 15505 15506 15507 15508 15509 15510 15511 15512 15513 15514 15515 15516 15517 15518 15519 15520 15521 15522 15523 15524 15525 15526 15527 15528 15529 15530 15531 15532 15533 15534 15535 15536 15537 15538 15539 15540 15541 15542 15543 15544 15545 15546 15547 15548 15549 15550 15551 15552 15553 15554 15555 15556 15557 15558 15559 15560 15561 15562 15563 15564 15565 15566 15567 15568 15569 15570 15571 15572 15573 15574 15575 15576 15577 15578 15579 15580 15581 15582 15583 15584 15585 15586 15587 15588 15589 15590 15591 15592 15593 15594 15595 15596 15597 15598 15599 15600 15601 15602 15603 15604 15605 15606 15607 15608 15609 15610 15611 15612 15613 15614 15615 15616 15617 15618 15619 15620 15621 15622 15623 15624 15625 15626 15627 15628 15629 15630 15631 15632 15633 15634 15635 15636 15637 15638 15639 15640 15641 15642 15643 15644 15645 15646 15647 15648 15649 15650 15651 15652 15653 15654 15655 15656 15657 15658 15659 15660 15661 15662 15663 15664 15665 15666 15667 15668 15669 15670 15671 15672 15673 15674 15675 15676 15677 15678 15679 15680 15681 15682 15683 15684 15685 15686 15687 15688 15689 15690 15691 15692 15693 15694 15695 15696 15697 15698 15699 15700 15701 15702 15703 15704 15705 15706 15707 15708 15709 15710 15711 15712 15713 15714 15715 15716 15717 15718 15719 15720 15721 15722 15723 15724 15725 15726 15727 15728 15729 15730 15731 15732 15733 15734 15735 15736 15737 15738 15739 15740 15741 15742 15743 15744 15745 15746 15747 15748 15749 15750 15751 15752 15753 15754 15755 15756 15757 15758 15759 15760 15761 15762 15763 15764 15765 15766 15767 15768 15769 15770 15771 15772 15773 15774 15775 15776 15777 15778 15779 15780 15781 15782 15783 15784 15785 15786 15787 15788 15789 15790 15791 15792 15793 15794 15795 15796 15797 15798 15799 15800 15801 15802 15803 15804 15805 15806 15807 15808 15809 15810 15811 15812 15813 15814 15815 15816 15817 15818 15819 15820 15821 15822 15823 15824 15825 15826 15827 15828 15829 15830 15831 15832 15833 15834 15835 15836 15837 15838 15839 15840 15841 15842 15843 15844 15845 15846 15847 15848 15849 15850 15851 15852 15853 15854 15855 15856 15857 15858 15859 15860 15861 15862 15863 15864 15865 15866 15867 15868 15869 15870 15871 15872 15873 15874 15875 15876 15877 15878 15879 15880 15881 15882 15883 15884 15885 15886 15887 15888 15889 15890 15891 15892 15893 15894 15895 15896 15897 15898 15899 15900 15901 15902 15903 15904 15905 15906 15907 15908 15909 15910 15911 15912 15913 15914 15915 15916 15917 15918 15919 15920 15921 15922 15923 15924 15925 15926 15927 15928 15929 15930 15931 15932 15933 15934 15935 15936 15937 15938 15939 15940 15941 15942 15943 15944 15945 15946 15947 15948 15949 15950 15951 15952 15953 15954 15955 15956 15957 15958 15959 15960 15961 15962 15963 15964 15965 15966 15967 15968 15969 15970 15971 15972 15973 15974 15975 15976 15977 15978 15979 15980 15981 15982 15983 15984 15985 15986 15987 15988 15989 15990 15991 15992 15993 15994 15995 15996 15997 15998 15999 16000 16001 16002 16003 16004 16005 16006 16007 16008 16009 16010 16011 16012 16013 16014 16015 16016 16017 16018 16019 16020 16021 16022 16023 16024 16025 16026 16027 16028 16029 16030 16031 16032 16033 16034 16035 16036 16037 16038 16039 16040 16041 16042 16043 16044 16045 16046 16047 16048 16049 16050 16051 16052 16053 16054 16055 16056 16057 16058 16059 16060 16061 16062 16063 16064 16065 16066 16067 16068 16069 16070 16071 16072 16073 16074 16075 16076 16077 16078 16079 16080 16081 16082 16083 16084 16085 16086 16087 16088 16089 16090 16091 16092 16093 16094 16095 16096 16097 16098 16099 16100 16101 16102 16103 16104 16105 16106 16107 16108 16109 16110 16111 16112 16113 16114 16115 16116 16117 16118 16119 16120 16121 16122 16123 16124 16125 16126 16127 16128 16129 16130 16131 16132 16133 16134 16135 16136 16137 16138 16139 16140 16141 16142 16143 16144 16145 16146 16147 16148 16149 16150 16151 16152 16153 16154 16155 16156 16157 16158 16159 16160 16161 16162 16163 16164 16165 16166 16167 16168 16169 16170 16171 16172 16173 16174 16175 16176 16177 16178 16179 16180 16181 16182 16183 16184 16185 16186 16187 16188 16189 16190 16191 16192 16193 16194 16195 16196 16197 16198 16199 16200 16201 16202 16203 16204 16205 16206 16207 16208 16209 16210 16211 16212 16213 16214 16215 16216 16217 16218 16219 16220 16221 16222 16223 16224 16225 16226 16227 16228 16229 16230 16231 16232 16233 16234 16235 16236 16237 16238 16239 16240 16241 16242 16243 16244 16245 16246 16247 16248 16249 16250 16251 16252 16253 16254 16255 16256 16257 16258 16259 16260 16261 16262 16263 16264 16265 16266 16267 16268 16269 16270 16271 16272 16273 16274 16275 16276 16277 16278 16279 16280 16281 16282 16283 16284 16285 16286 16287 16288 16289 16290 16291 16292 16293 16294 16295 16296 16297 16298 16299 16300 16301 16302 16303 16304 16305 16306 16307 16308 16309 16310 16311 16312 16313 16314 16315 16316 16317 16318 16319 16320 16321 16322 16323 16324 16325 16326 16327 16328 16329 16330 16331 16332 16333 16334 16335 16336 16337 16338 16339 16340 16341 16342 16343 16344 16345 16346 16347 16348 16349 16350 16351 16352 16353 16354 16355 16356 16357 16358 16359 16360 16361 16362 16363 16364 16365 16366 16367 16368 16369 16370 16371 16372 16373 16374 16375 16376 16377 16378 16379 16380 16381 16382 16383 16384 16385 16386 16387 16388 16389 16390 16391 16392 16393 16394 16395 16396 16397 16398 16399 16400 16401 16402 16403 16404 16405 16406 16407 16408 16409 16410 16411 16412 16413 16414 16415 16416 16417 16418 16419 16420 16421 16422 16423 16424 16425 16426 16427 16428 16429 16430 16431 16432 16433 16434 16435 16436 16437 16438 16439 16440 16441 16442 16443 16444 16445 16446 16447 16448 16449 16450 16451 16452 16453 16454 16455 16456 16457 16458 16459 16460 16461 16462 16463 16464 16465 16466 16467 16468 16469 16470 16471 16472 16473 16474 16475 16476 16477 16478 16479 16480 16481 16482 16483 16484 16485 16486 16487 16488 16489 16490 16491 16492 16493 16494 16495 16496 16497 16498 16499 16500 16501 16502 16503 16504 16505 16506 16507 16508 16509 16510 16511 16512 16513 16514 16515 16516 16517 16518 16519 16520 16521 16522 16523 16524 16525 16526 16527 16528 16529 16530 16531 16532 16533 16534 16535 16536 16537 16538 16539 16540 16541 16542 16543 16544 16545 16546 16547 16548 16549 16550 16551 16552 16553 16554 16555 16556 16557 16558 16559 16560 16561 16562 16563 16564 16565 16566 16567 16568 16569 16570 16571 16572 16573 16574 16575 16576 16577 16578 16579 16580 16581 16582 16583 16584 16585 16586 16587 16588 16589 16590 16591 16592 16593 16594 16595 16596 16597 16598 16599 16600 16601 16602 16603 16604 16605 16606 16607 16608 16609 16610 16611 16612 16613 16614 16615 16616 16617 16618 16619 16620 16621 16622 16623 16624 16625 16626 16627 16628 16629 16630 16631 16632 16633 16634 16635 16636 16637 16638 16639 16640 16641 16642 16643 16644 16645 16646 16647 16648 16649 16650 16651 16652 16653 16654 16655 16656 16657 16658 16659 16660 16661 16662 16663 16664 16665 16666 16667 16668 16669 16670 16671 16672 16673 16674 16675 16676 16677 16678 16679 16680 16681 16682 16683 16684 16685 16686 16687 16688 16689 16690 16691 16692 16693 16694 16695 16696 16697 16698 16699 16700 16701 16702 16703 16704 16705 16706 16707 16708 16709 16710 16711 16712 16713 16714 16715 16716 16717 16718 16719 16720 16721 16722 16723 16724 16725 16726 16727 16728 16729 16730 16731 16732 16733 16734 16735 16736 16737 16738 16739 16740 16741 16742 16743 16744 16745 16746 16747 16748 16749 16750 16751 16752 16753 16754 16755 16756 16757 16758 16759 16760 16761 16762 16763 16764 16765 16766 16767 16768 16769 16770 16771 16772 16773 16774 16775 16776 16777 16778 16779 16780 16781 16782 16783 16784 16785 16786 16787 16788 16789 16790 16791 16792 16793 16794 16795 16796 16797 16798 16799 16800 16801 16802 16803 16804 16805 16806 16807 16808 16809 16810 16811 16812 16813 16814 16815 16816 16817 16818 16819 16820 16821 16822 16823 16824 16825 16826 16827 16828 16829 16830 16831 16832 16833 16834 16835 16836 16837 16838 16839 16840 16841 16842 16843 16844 16845 16846 16847 16848 16849 16850 16851 16852 16853 16854 16855 16856 16857 16858 16859 16860 16861 16862 16863 16864 16865 16866 16867 16868 16869 16870 16871 16872 16873 16874 16875 16876 16877 16878 16879 16880 16881 16882 16883 16884 16885 16886 16887 16888 16889 16890 16891 16892 16893 16894 16895 16896 16897 16898 16899 16900 16901 16902 16903 16904 16905 16906 16907 16908 16909 16910 16911 16912 16913 16914 16915 16916 16917 16918 16919 16920 16921 16922 16923 16924 16925 16926 16927 16928 16929 16930 16931 16932 16933 16934 16935 16936 16937 16938 16939 16940 16941 16942 16943 16944 16945 16946 16947 16948 16949 16950 16951 16952 16953 16954 16955 16956 16957 16958 16959 16960 16961 16962 16963 16964 16965 16966 16967 16968 16969 16970 16971 16972 16973 16974 16975 16976 16977 16978 16979 16980 16981 16982 16983 16984 16985 16986 16987 16988 16989 16990 16991 16992 16993 16994 16995 16996 16997 16998 16999 17000 17001 17002 17003 17004 17005 17006 17007 17008 17009 17010 17011 17012 17013 17014 17015 17016 17017 17018 17019 17020 17021 17022 17023 17024 17025 17026 17027 17028 17029 17030 17031 17032 17033 17034 17035 17036 17037 17038 17039 17040 17041 17042 17043 17044 17045 17046 17047 17048 17049 17050 17051 17052 17053 17054 17055 17056 17057 17058 17059 17060 17061 17062 17063 17064 17065 17066 17067 17068 17069 17070 17071 17072 17073 17074 17075 17076 17077 17078 17079 17080 17081 17082 17083 17084 17085 17086 17087 17088 17089 17090 17091 17092 17093 17094 17095 17096 17097 17098 17099 17100 17101 17102 17103 17104 17105 17106 17107 17108 17109 17110 17111 17112 17113 17114 17115 17116 17117 17118 17119 17120 17121 17122 17123 17124 17125 17126 17127 17128 17129 17130 17131 17132 17133 17134 17135 17136 17137 17138 17139 17140 17141 17142 17143 17144 17145 17146 17147 17148 17149 17150 17151 17152 17153 17154 17155 17156 17157 17158 17159 17160 17161 17162 17163 17164 17165 17166 17167 17168 17169 17170 17171 17172 17173 17174 17175 17176 17177 17178 17179 17180 17181 17182 17183 17184 17185 17186 17187 17188 17189 17190 17191 17192 17193 17194 17195 17196 17197 17198 17199 17200 17201 17202 17203 17204 17205 17206 17207 17208 17209 17210 17211 17212 17213 17214 17215 17216 17217 17218 17219 17220 17221 17222 17223 17224 17225 17226 17227 17228 17229 17230 17231 17232 17233 17234 17235 17236 17237 17238 17239 17240 17241 17242 17243 17244 17245 17246 17247 17248 17249 17250 17251 17252 17253 17254 17255 17256 17257 17258 17259 17260 17261 17262 17263 17264 17265 17266 17267 17268 17269 17270 17271 17272 17273 17274 17275 17276 17277 17278 17279 17280 17281 17282 17283 17284 17285 17286 17287 17288 17289 17290 17291 17292 17293 17294 17295 17296 17297 17298 17299 17300 17301 17302 17303 17304 17305 17306 17307 17308 17309 17310 17311 17312 17313 17314 17315 17316 17317 17318 17319 17320 17321 17322 17323 17324 17325 17326 17327 17328 17329 17330 17331 17332 17333 17334 17335 17336 17337 17338 17339 17340 17341 17342 17343 17344 17345 17346 17347 17348 17349 17350 17351 17352 17353 17354 17355 17356 17357 17358 17359 17360 17361 17362 17363 17364 17365 17366 17367 17368 17369 17370 17371 17372 17373 17374 17375 17376 17377 17378 17379 17380 17381 17382 17383 17384 17385 17386 17387 17388 17389 17390 17391 17392 17393 17394 17395 17396 17397 17398 17399 17400 17401 17402 17403 17404 17405 17406 17407 17408 17409 17410 17411 17412 17413 17414 17415 17416 17417 17418 17419 17420 17421 17422 17423 17424 17425 17426 17427 17428 17429 17430 17431 17432 17433 17434 17435 17436 17437 17438 17439 17440 17441 17442 17443 17444 17445 17446 17447 17448 17449 17450 17451 17452 17453 17454 17455 17456 17457 17458 17459 17460 17461 17462 17463 17464 17465 17466 17467 17468 17469 17470 17471 17472 17473 17474 17475 17476 17477 17478 17479 17480 17481 17482 17483 17484 17485 17486 17487 17488 17489 17490 17491 17492 17493 17494 17495 17496 17497 17498 17499 17500 17501 17502 17503 17504 17505 17506 17507 17508 17509 17510 17511 17512 17513 17514 17515 17516 17517 17518 17519 17520 17521 17522 17523 17524 17525 17526 17527 17528 17529 17530 17531 17532 17533 17534 17535 17536 17537 17538 17539 17540 17541 17542 17543 17544 17545 17546 17547 17548 17549 17550 17551 17552 17553 17554 17555 17556 17557 17558 17559 17560 17561 17562 17563 17564 17565 17566 17567 17568 17569 17570 17571 17572 17573 17574 17575 17576 17577 17578 17579 17580 17581 17582 17583 17584 17585 17586 17587 17588 17589 17590 17591 17592 17593 17594 17595 17596 17597 17598 17599 17600 17601 17602 17603 17604 17605 17606 17607 17608 17609 17610 17611 17612 17613 17614 17615 17616 17617 17618 17619 17620 17621 17622 17623 17624 17625 17626 17627 17628 17629 17630 17631 17632 17633 17634 17635 17636 17637 17638 17639 17640 17641 17642 17643 17644 17645 17646 17647 17648 17649 17650 17651 17652 17653 17654 17655 17656 17657 17658 17659 17660 17661 17662 17663 17664 17665 17666 17667 17668 17669 17670 17671 17672 17673 17674 17675 17676 17677 17678 17679 17680 17681 17682 17683 17684 17685 17686 17687 17688 17689 17690 17691 17692 17693 17694 17695 17696 17697 17698 17699 17700 17701 17702 17703 17704 17705 17706 17707 17708 17709 17710 17711 17712 17713 17714 17715 17716 17717 17718 17719 17720 17721 17722 17723 17724 17725 17726 17727 17728 17729 17730 17731 17732 17733 17734 17735 17736 17737 17738 17739 17740 17741 17742 17743 17744 17745 17746 17747 17748 17749 17750 17751 17752 17753 17754 17755 17756 17757 17758 17759 17760 17761 17762 17763 17764 17765 17766 17767 17768 17769 17770 17771 17772 17773 17774 17775 17776 17777 17778 17779 17780 17781 17782 17783 17784 17785 17786 17787 17788 17789 17790 17791 17792 17793 17794 17795 17796 17797 17798 17799 17800 17801 17802 17803 17804 17805 17806 17807 17808 17809 17810 17811 17812 17813 17814 17815 17816 17817 17818 17819 17820 17821 17822 17823 17824 17825 17826 17827 17828 17829 17830 17831 17832 17833 17834 17835 17836 17837 17838 17839 17840 17841 17842 17843 17844 17845 17846 17847 17848 17849 17850 17851 17852 17853 17854 17855 17856 17857 17858 17859 17860 17861 17862 17863 17864 17865 17866 17867 17868 17869 17870 17871 17872 17873 17874 17875 17876 17877 17878 17879 17880 17881 17882 17883 17884 17885 17886 17887 17888 17889 17890 17891 17892 17893 17894 17895 17896 17897 17898 17899 17900 17901 17902 17903 17904 17905 17906 17907 17908 17909 17910 17911 17912 17913 17914 17915 17916 17917 17918 17919 17920 17921 17922 17923 17924 17925 17926 17927 17928 17929 17930 17931 17932 17933 17934 17935 17936 17937 17938 17939 17940 17941 17942 17943 17944 17945 17946 17947 17948 17949 17950 17951 17952 17953 17954 17955 17956 17957 17958 17959 17960 17961 17962 17963 17964 17965 17966 17967 17968 17969 17970 17971 17972 17973 17974 17975 17976 17977 17978 17979 17980 17981 17982 17983 17984 17985 17986 17987 17988 17989 17990 17991 17992 17993 17994 17995 17996 17997 17998 17999 18000 18001 18002 18003 18004 18005 18006 18007 18008 18009 18010 18011 18012 18013 18014 18015 18016 18017 18018 18019 18020 18021 18022 18023 18024 18025 18026 18027 18028 18029 18030 18031 18032 18033 18034 18035 18036 18037 18038 18039 18040 18041 18042 18043 18044 18045 18046 18047 18048 18049 18050 18051 18052 18053 18054 18055 18056 18057 18058 18059 18060 18061 18062 18063 18064 18065 18066 18067 18068 18069 18070 18071 18072 18073 18074 18075 18076 18077 18078 18079 18080 18081 18082 18083 18084 18085 18086 18087 18088 18089 18090 18091 18092 18093 18094 18095 18096 18097 18098 18099 18100 18101 18102 18103 18104 18105 18106 18107 18108 18109 18110 18111 18112 18113 18114 18115 18116 18117 18118 18119 18120 18121 18122 18123 18124 18125 18126 18127 18128 18129 18130 18131 18132 18133 18134 18135 18136 18137 18138 18139 18140 18141 18142 18143 18144 18145 18146 18147 18148 18149 18150 18151 18152 18153 18154 18155 18156 18157 18158 18159 18160 18161 18162 18163 18164 18165 18166 18167 18168 18169 18170 18171 18172 18173 18174 18175 18176 18177 18178 18179 18180 18181 18182 18183 18184 18185 18186 18187 18188 18189 18190 18191 18192 18193 18194 18195 18196 18197 18198 18199 18200 18201 18202 18203 18204 18205 18206 18207 18208 18209 18210 18211 18212 18213 18214 18215 18216 18217 18218 18219 18220 18221 18222 18223 18224 18225 18226 18227 18228 18229 18230 18231 18232 18233 18234 18235 18236 18237 18238 18239 18240 18241 18242 18243 18244 18245 18246 18247 18248 18249 18250 18251 18252 18253 18254 18255 18256 18257 18258 18259 18260 18261 18262 18263 18264 18265 18266 18267 18268 18269 18270 18271 18272 18273 18274 18275 18276 18277 18278 18279 18280 18281 18282 18283 18284 18285 18286 18287 18288 18289 18290 18291 18292 18293 18294 18295 18296 18297 18298 18299 18300 18301 18302 18303 18304 18305 18306 18307 18308 18309 18310 18311 18312 18313 18314 18315 18316 18317 18318 18319 18320 18321 18322 18323 18324 18325 18326 18327 18328 18329 18330 18331 18332 18333 18334 18335 18336 18337 18338 18339 18340 18341 18342 18343 18344 18345 18346 18347 18348 18349 18350 18351 18352 18353 18354 18355 18356 18357 18358 18359 18360 18361 18362 18363 18364 18365 18366 18367 18368 18369 18370 18371 18372 18373 18374 18375 18376 18377 18378 18379 18380 18381 18382 18383 18384 18385 18386 18387 18388 18389 18390 18391 18392 18393 18394 18395 18396 18397 18398 18399 18400 18401 18402 18403 18404 18405 18406 18407 18408 18409 18410 18411 18412 18413 18414 18415 18416 18417 18418 18419 18420 18421 18422 18423 18424 18425 18426 18427 18428 18429 18430 18431 18432 18433 18434 18435 18436 18437 18438 18439 18440 18441 18442 18443 18444 18445 18446 18447 18448 18449 18450 18451 18452 18453 18454 18455 18456 18457 18458 18459 18460 18461 18462 18463 18464 18465 18466 18467 18468 18469 18470 18471 18472 18473 18474 18475 18476 18477 18478 18479 18480 18481 18482 18483 18484 18485 18486 18487 18488 18489 18490 18491 18492 18493 18494 18495 18496 18497 18498 18499 18500 18501 18502 18503 18504 18505 18506 18507 18508 18509 18510 18511 18512 18513 18514 18515 18516 18517 18518 18519 18520 18521 18522 18523 18524 18525 18526 18527 18528 18529 18530 18531 18532 18533 18534 18535 18536 18537 18538 18539 18540 18541 18542 18543 18544 18545 18546 18547 18548 18549 18550 18551 18552 18553 18554 18555 18556 18557 18558 18559 18560 18561 18562 18563 18564 18565 18566 18567 18568 18569 18570 18571 18572 18573 18574 18575 18576 18577 18578 18579 18580 18581 18582 18583 18584 18585 18586 18587 18588 18589 18590 18591 18592 18593 18594 18595 18596 18597 18598 18599 18600 18601 18602 18603 18604 18605 18606 18607 18608 18609 18610 18611 18612 18613 18614 18615 18616 18617 18618 18619 18620 18621 18622 18623 18624 18625 18626 18627 18628 18629 18630 18631 18632 18633 18634 18635 18636 18637 18638 18639 18640 18641 18642 18643 18644 18645 18646 18647 18648 18649 18650 18651 18652 18653 18654 18655 18656 18657 18658 18659 18660 18661 18662 18663 18664 18665 18666 18667 18668 18669 18670 18671 18672 18673 18674 18675 18676 18677 18678 18679 18680 18681 18682 18683 18684 18685 18686 18687 18688 18689 18690 18691 18692 18693 18694 18695 18696 18697 18698 18699 18700 18701 18702 18703 18704 18705 18706 18707 18708 18709 18710 18711 18712 18713 18714 18715 18716 18717 18718 18719 18720 18721 18722 18723 18724 18725 18726 18727 18728 18729 18730 18731 18732 18733 18734 18735 18736 18737 18738 18739 18740 18741 18742 18743 18744 18745 18746 18747 18748 18749 18750 18751 18752 18753 18754 18755 18756 18757 18758 18759 18760 18761 18762 18763 18764 18765 18766 18767 18768 18769 18770 18771 18772 18773 18774 18775 18776 18777 18778 18779 18780 18781 18782 18783 18784 18785 18786 18787 18788 18789 18790 18791 18792 18793 18794 18795 18796 18797 18798 18799 18800 18801 18802 18803 18804 18805 18806 18807 18808 18809 18810 18811 18812 18813 18814 18815 18816 18817 18818 18819 18820 18821 18822 18823 18824 18825 18826 18827 18828 18829 18830 18831 18832 18833 18834 18835 18836 18837 18838 18839 18840 18841 18842 18843 18844 18845 18846 18847 18848 18849 18850 18851 18852 18853 18854 18855 18856 18857 18858 18859 18860 18861 18862 18863 18864 18865 18866 18867 18868 18869 18870 18871 18872 18873 18874 18875 18876 18877 18878 18879 18880 18881 18882 18883 18884 18885 18886 18887 18888 18889 18890 18891 18892 18893 18894 18895 18896 18897 18898 18899 18900 18901 18902 18903 18904 18905 18906 18907 18908 18909 18910 18911 18912 18913 18914 18915 18916 18917 18918 18919 18920 18921 18922 18923 18924 18925 18926 18927 18928 18929 18930 18931 18932 18933 18934 18935 18936 18937 18938 18939 18940 18941 18942 18943 18944 18945 18946 18947 18948 18949 18950 18951 18952 18953 18954 18955 18956 18957 18958 18959 18960 18961 18962 18963 18964 18965 18966 18967 18968 18969 18970 18971 18972 18973 18974 18975 18976 18977 18978 18979 18980 18981 18982 18983 18984 18985 18986 18987 18988 18989 18990 18991 18992 18993 18994 18995 18996 18997 18998 18999 19000 19001 19002 19003 19004 19005 19006 19007 19008 19009 19010 19011 19012 19013 19014 19015 19016 19017 19018 19019 19020 19021 19022 19023 19024 19025 19026 19027 19028 19029 19030 19031 19032 19033 19034 19035 19036 19037 19038 19039 19040 19041 19042 19043 19044 19045 19046 19047 19048 19049 19050 19051 19052 19053 19054 19055 19056 19057 19058 19059 19060 19061 19062 19063 19064 19065 19066 19067 19068 19069 19070 19071 19072 19073 19074 19075 19076 19077 19078 19079 19080 19081 19082 19083 19084 19085 19086 19087 19088 19089 19090 19091 19092 19093 19094 19095 19096 19097 19098 19099 19100 19101 19102 19103 19104 19105 19106 19107 19108 19109 19110 19111 19112 19113 19114 19115 19116 19117 19118 19119 19120 19121 19122 19123 19124 19125 19126 19127 19128 19129 19130 19131 19132 19133 19134 19135 19136 19137 19138 19139 19140 19141 19142 19143 19144 19145 19146 19147 19148 19149 19150 19151 19152 19153 19154 19155 19156 19157 19158 19159 19160 19161 19162 19163 19164 19165 19166 19167 19168 19169 19170 19171 19172 19173 19174 19175 19176 19177 19178 19179 19180 19181 19182 19183 19184 19185 19186 19187 19188 19189 19190 19191 19192 19193 19194 19195 19196 19197 19198 19199 19200 19201 19202 19203 19204 19205 19206 19207 19208 19209 19210 19211 19212 19213 19214 19215 19216 19217 19218 19219 19220 19221 19222 19223 19224 19225 19226 19227 19228 19229 19230 19231 19232 19233 19234 19235 19236 19237 19238 19239 19240 19241 19242 19243 19244 19245 19246 19247 19248 19249 19250 19251 19252 19253 19254 19255 19256 19257 19258 19259 19260 19261 19262 19263 19264 19265 19266 19267 19268 19269 19270 19271 19272 19273 19274 19275 19276 19277 19278 19279 19280 19281 19282 19283 19284 19285 19286 19287 19288 19289 19290 19291 19292 19293 19294 19295 19296 19297 19298 19299 19300 19301 19302 19303 19304 19305 19306 19307 19308 19309 19310 19311 19312 19313 19314 19315 19316 19317 19318 19319 19320 19321 19322 19323 19324 19325 19326 19327 19328 19329 19330 19331 19332 19333 19334 19335 19336 19337 19338 19339 19340 19341 19342 19343 19344 19345 19346 19347 19348 19349 19350 19351 19352 19353 19354 19355 19356 19357 19358 19359 19360 19361 19362 19363 19364 19365 19366 19367 19368 19369 19370 19371 19372 19373 19374 19375 19376 19377 19378 19379 19380 19381 19382 19383 19384 19385 19386 19387 19388 19389 19390 19391 19392 19393 19394 19395 19396 19397 19398 19399 19400 19401 19402 19403 19404 19405 19406 19407 19408 19409 19410 19411 19412 19413 19414 19415 19416 19417 19418 19419 19420 19421 19422 19423 19424 19425 19426 19427 19428 19429 19430 19431 19432 19433 19434 19435 19436 19437 19438 19439 19440 19441 19442 19443 19444 19445 19446 19447 19448 19449 19450 19451 19452 19453 19454 19455 19456 19457 19458 19459 19460 19461 19462 19463 19464 19465 19466 19467 19468 19469 19470 19471 19472 19473 19474 19475 19476 19477 19478 19479 19480 19481 19482 19483 19484 19485 19486 19487 19488 19489 19490 19491 19492 19493 19494 19495 19496 19497 19498 19499 19500 19501 19502 19503 19504 19505 19506 19507 19508 19509 19510 19511 19512 19513 19514 19515 19516 19517 19518 19519 19520 19521 19522 19523 19524 19525 19526 19527 19528 19529 19530 19531 19532 19533 19534 19535 19536 19537 19538 19539 19540 19541 19542 19543 19544 19545 19546 19547 19548 19549 19550 19551 19552 19553 19554 19555 19556 19557 19558 19559 19560 19561 19562 19563 19564 19565 19566 19567 19568 19569 19570 19571 19572 19573 19574 19575 19576 19577 19578 19579 19580 19581 19582 19583 19584 19585 19586 19587 19588 19589 19590 19591 19592 19593 19594 19595 19596 19597 19598 19599 19600 19601 19602 19603 19604 19605 19606 19607 19608 19609 19610 19611 19612 19613 19614 19615 19616 19617 19618 19619 19620 19621 19622 19623 19624 19625 19626 19627 19628 19629 19630 19631 19632 19633 19634 19635 19636 19637 19638 19639 19640 19641 19642 19643 19644 19645 19646 19647 19648 19649 19650 19651 19652 19653 19654 19655 19656 19657 19658 19659 19660 19661 19662 19663 19664 19665 19666 19667 19668 19669 19670 19671 19672 19673 19674 19675 19676 19677 19678 19679 19680 19681 19682 19683 19684 19685 19686 19687 19688 19689 19690 19691 19692 19693 19694 19695 19696 19697 19698 19699 19700 19701 19702 19703 19704 19705 19706 19707 19708 19709 19710 19711 19712 19713 19714 19715 19716 19717 19718 19719 19720 19721 19722 19723 19724 19725 19726 19727 19728 19729 19730 19731 19732 19733 19734 19735 19736 19737 19738 19739 19740 19741 19742 19743 19744 19745 19746 19747 19748 19749 19750 19751 19752 19753 19754 19755 19756 19757 19758 19759 19760 19761 19762 19763 19764 19765 19766 19767 19768 19769 19770 19771 19772 19773 19774 19775 19776 19777 19778 19779 19780 19781 19782 19783 19784 19785 19786 19787 19788 19789 19790 19791 19792 19793 19794 19795 19796 19797 19798 19799 19800 19801 19802 19803 19804 19805 19806 19807 19808 19809 19810 19811 19812 19813 19814 19815 19816 19817 19818 19819 19820 19821 19822 19823 19824 19825 19826 19827 19828 19829 19830 19831 19832 19833 19834 19835 19836 19837 19838 19839 19840 19841 19842 19843 19844 19845 19846 19847 19848 19849 19850 19851 19852 19853 19854 19855 19856 19857 19858 19859 19860 19861 19862 19863 19864 19865 19866 19867 19868 19869 19870 19871 19872 19873 19874 19875 19876 19877 19878 19879 19880 19881 19882 19883 19884 19885 19886 19887 19888 19889 19890 19891 19892 19893 19894 19895 19896 19897 19898 19899 19900 19901 19902 19903 19904 19905 19906 19907 19908 19909 19910 19911 19912 19913 19914 19915 19916 19917 19918 19919 19920 19921 19922 19923 19924 19925 19926 19927 19928 19929 19930 19931 19932 19933 19934 19935 19936 19937 19938 19939 19940 19941 19942 19943 19944 19945 19946 19947 19948 19949 19950 19951 19952 19953 19954 19955 19956 19957 19958 19959 19960 19961 19962 19963 19964 19965 19966 19967 19968 19969 19970 19971 19972 19973 19974 19975 19976 19977 19978 19979 19980 19981 19982 19983 19984 19985 19986 19987 19988 19989 19990 19991 19992 19993 19994 19995 19996 19997 19998 19999 20000 20001 20002 20003 20004 20005 20006 20007 20008 20009 20010 20011 20012 20013 20014 20015 20016 20017 20018 20019 20020 20021 20022 20023 20024 20025 20026 20027 20028 20029 20030 20031 20032 20033 20034 20035 20036 20037 20038 20039 20040 20041 20042 20043 20044 20045 20046 20047 20048 20049 20050 20051 20052 20053 20054 20055 20056 20057 20058 20059 20060 20061 20062 20063 20064 20065 20066 20067 20068 20069 20070 20071 20072 20073 20074 20075 20076 20077 20078 20079 20080 20081 20082 20083 20084 20085 20086 20087 20088 20089 20090 20091 20092 20093 20094 20095 20096 20097 20098 20099 20100 20101 20102 20103 20104 20105 20106 20107 20108 20109 20110 20111 20112 20113 20114 20115 20116 20117 20118 20119 20120 20121 20122 20123 20124 20125 20126 20127 20128 20129 20130 20131 20132 20133 20134 20135 20136 20137 20138 20139 20140 20141 20142 20143 20144 20145 20146 20147 20148 20149 20150 20151 20152 20153 20154 20155 20156 20157 20158 20159 20160 20161 20162 20163 20164 20165 20166 20167 20168 20169 20170 20171 20172 20173 20174 20175 20176 20177 20178 20179 20180 20181 20182 20183 20184 20185 20186 20187 20188 20189 20190 20191 20192 20193 20194 20195 20196 20197 20198 20199 20200 20201 20202 20203 20204 20205 20206 20207 20208 20209 20210 20211 20212 20213 20214 20215 20216 20217 20218 20219 20220 20221 20222 20223 20224 20225 20226 20227 20228 20229 20230 20231 20232 20233 20234 20235 20236 20237 20238 20239 20240 20241 20242 20243 20244 20245 20246 20247 20248 20249 20250 20251 20252 20253 20254 20255 20256 20257 20258 20259 20260 20261 20262 20263 20264 20265 20266 20267 20268 20269 20270 20271 20272 20273 20274 20275 20276 20277 20278 20279 20280 20281 20282 20283 20284 20285 20286 20287 20288 20289 20290 20291 20292 20293 20294 20295 20296 20297 20298 20299 20300 20301 20302 20303 20304 20305 20306 20307 20308 20309 20310 20311 20312 20313 20314 20315 20316 20317 20318 20319 20320 20321 20322 20323 20324 20325 20326 20327 20328 20329 20330 20331 20332 20333 20334 20335 20336 20337 20338 20339 20340 20341 20342 20343 20344 20345 20346 20347 20348 20349 20350 20351 20352 20353 20354 20355 20356 20357 20358 20359 20360 20361 20362 20363 20364 20365 20366 20367 20368 20369 20370 20371 20372 20373 20374 20375 20376 20377 20378 20379 20380 20381 20382 20383 20384 20385 20386 20387 20388 20389 20390 20391 20392 20393 20394 20395 20396 20397 20398 20399 20400 20401 20402 20403 20404 20405 20406 20407 20408 20409 20410 20411 20412 20413 20414 20415 20416 20417 20418 20419 20420 20421 20422 20423 20424 20425 20426 20427 20428 20429 20430 20431 20432 20433 20434 20435 20436 20437 20438 20439 20440 20441 20442 20443 20444 20445 20446 20447 20448 20449 20450 20451 20452 20453 20454 20455 20456 20457 20458 20459 20460 20461 20462 20463 20464 20465 20466 20467 20468 20469 20470 20471 20472 20473 20474 20475 20476 20477 20478 20479 20480 20481 20482 20483 20484 20485 20486 20487 20488 20489 20490 20491 20492 20493 20494 20495 20496 20497 20498 20499 20500 20501 20502 20503 20504 20505 20506 20507 20508 20509 20510 20511 20512 20513 20514 20515 20516 20517 20518 20519 20520 20521 20522 20523 20524 20525 20526 20527 20528 20529 20530 20531 20532 20533 20534 20535 20536 20537 20538 20539 20540 20541 20542 20543 20544 20545 20546 20547 20548 20549 20550 20551 20552 20553 20554 20555 20556 20557 20558 20559 20560 20561 20562 20563 20564 20565 20566 20567 20568 20569 20570 20571 20572 20573 20574 20575 20576 20577 20578 20579 20580 20581 20582 20583 20584 20585 20586 20587 20588 20589 20590 20591 20592 20593 20594 20595 20596 20597 20598 20599 20600 20601 20602 20603 20604 20605 20606 20607 20608 20609 20610 20611 20612 20613 20614 20615 20616 20617 20618 20619 20620 20621 20622 20623 20624 20625 20626 20627 20628 20629 20630 20631 20632 20633 20634 20635 20636 20637 20638 20639 20640 20641 20642 20643 20644 20645 20646 20647 20648 20649 20650 20651 20652 20653 20654 20655 20656 20657 20658 20659 20660 20661 20662 20663 20664 20665 20666 20667 20668 20669 20670 20671 20672 20673 20674 20675 20676 20677 20678 20679 20680 20681 20682 20683 20684 20685 20686 20687 20688 20689 20690 20691 20692 20693 20694 20695 20696 20697 20698 20699 20700 20701 20702 20703 20704 20705 20706 20707 20708 20709 20710 20711 20712 20713 20714 20715 20716 20717 20718 20719 20720 20721 20722 20723 20724 20725 20726 20727 20728 20729 20730 20731 20732 20733 20734 20735 20736 20737 20738 20739 20740 20741 20742 20743 20744 20745 20746 20747 20748 20749 20750 20751 20752 20753 20754 20755 20756 20757 20758 20759 20760 20761 20762 20763 20764 20765 20766 20767 20768 20769 20770 20771 20772 20773 20774 20775 20776 20777 20778 20779 20780 20781 20782 20783 20784 20785 20786 20787 20788 20789 20790 20791 20792 20793 20794 20795 20796 20797 20798 20799 20800 20801 20802 20803 20804 20805 20806 20807 20808 20809 20810 20811 20812 20813 20814 20815 20816 20817 20818 20819 20820 20821 20822 20823 20824 20825 20826 20827 20828 20829 20830 20831 20832 20833 20834 20835 20836 20837 20838 20839 20840 20841 20842 20843 20844 20845 20846 20847 20848 20849 20850 20851 20852 20853 20854 20855 20856 20857 20858 20859 20860 20861 20862 20863 20864 20865 20866 20867 20868 20869 20870 20871 20872 20873 20874 20875 20876 20877 20878 20879 20880 20881 20882 20883 20884 20885 20886 20887 20888 20889 20890 20891 20892 20893 20894 20895 20896 20897 20898 20899 20900 20901 20902 20903 20904 20905 20906 20907 20908 20909 20910 20911 20912 20913 20914 20915 20916 20917 20918 20919 20920 20921 20922 20923 20924 20925 20926 20927 20928 20929 20930 20931 20932 20933 20934 20935 20936 20937 20938 20939 20940 20941 20942 20943 20944 20945 20946 20947 20948 20949 20950 20951 20952 20953 20954 20955 20956 20957 20958 20959 20960 20961 20962 20963 20964 20965 20966 20967 20968 20969 20970 20971 20972 20973 20974 20975 20976 20977 20978 20979 20980 20981 20982 20983 20984 20985 20986 20987 20988 20989 20990 20991 20992 20993 20994 20995 20996 20997 20998 20999 21000 21001 21002 21003 21004 21005 21006 21007 21008 21009 21010 21011 21012 21013 21014 21015 21016 21017 21018 21019 21020 21021 21022 21023 21024 21025 21026 21027 21028 21029 21030 21031 21032 21033 21034 21035 21036 21037 21038 21039 21040 21041 21042 21043 21044 21045 21046 21047 21048 21049 21050 21051 21052 21053 21054 21055 21056 21057 21058 21059 21060 21061 21062 21063 21064 21065 21066 21067 21068 21069 21070 21071 21072 21073 21074 21075 21076 21077 21078 21079 21080 21081 21082 21083 21084 21085 21086 21087 21088 21089 21090 21091 21092 21093 21094 21095 21096 21097 21098 21099 21100 21101 21102 21103 21104 21105 21106 21107 21108 21109 21110 21111 21112 21113 21114 21115 21116 21117 21118 21119 21120 21121 21122 21123 21124 21125 21126 21127 21128 21129 21130 21131 21132 21133 21134 21135 21136 21137 21138 21139 21140 21141 21142 21143 21144 21145 21146 21147 21148 21149 21150 21151 21152 21153 21154 21155 21156 21157 21158 21159 21160 21161 21162 21163 21164 21165 21166 21167 21168 21169 21170 21171 21172 21173 21174 21175 21176 21177 21178 21179 21180 21181 21182 21183 21184 21185 21186 21187 21188 21189 21190 21191 21192 21193 21194 21195 21196 21197 21198 21199 21200 21201 21202 21203 21204 21205 21206 21207 21208 21209 21210 21211 21212 21213 21214 21215 21216 21217 21218 21219 21220 21221 21222 21223 21224 21225 21226 21227 21228 21229 21230 21231 21232 21233 21234 21235 21236 21237 21238 21239 21240 21241 21242 21243 21244 21245 21246 21247 21248 21249 21250 21251 21252 21253 21254 21255 21256 21257 21258 21259 21260 21261 21262 21263 21264 21265 21266 21267 21268 21269 21270 21271 21272 21273 21274 21275 21276 21277 21278 21279 21280 21281 21282 21283 21284 21285 21286 21287 21288 21289 21290 21291 21292 21293 21294 21295 21296 21297 21298 21299 21300 21301 21302 21303 21304 21305 21306 21307 21308 21309 21310 21311 21312 21313 21314 21315 21316 21317 21318 21319 21320 21321 21322 21323 21324 21325 21326 21327 21328 21329 21330 21331 21332 21333 21334 21335 21336 21337 21338 21339 21340 21341 21342 21343 21344 21345 21346 21347 21348 21349 21350 21351 21352 21353 21354 21355 21356 21357 21358 21359 21360 21361 21362 21363 21364 21365 21366 21367 21368 21369 21370 21371 21372 21373 21374 21375 21376 21377 21378 21379 21380 21381 21382 21383 21384 21385 21386 21387 21388 21389 21390 21391 21392 21393 21394 21395 21396 21397 21398 21399 21400 21401 21402 21403 21404 21405 21406 21407 21408 21409 21410 21411 21412 21413 21414 21415 21416 21417 21418 21419 21420 21421 21422 21423 21424 21425 21426 21427 21428 21429 21430 21431 21432 21433 21434 21435 21436 21437 21438 21439 21440 21441 21442 21443 21444 21445 21446 21447 21448 21449 21450 21451 21452 21453 21454 21455 21456 21457 21458 21459 21460 21461 21462 21463 21464 21465 21466 21467 21468 21469 21470 21471 21472 21473 21474 21475 21476 21477 21478 21479 21480 21481 21482 21483 21484 21485 21486 21487 21488 21489 21490 21491 21492 21493 21494 21495 21496 21497 21498 21499 21500 21501 21502 21503 21504 21505 21506 21507 21508 21509 21510 21511 21512 21513 21514 21515 21516 21517 21518 21519 21520 21521 21522 21523 21524 21525 21526 21527 21528 21529 21530 21531 21532 21533 21534 21535 21536 21537 21538 21539 21540 21541 21542 21543 21544 21545 21546 21547 21548 21549 21550 21551 21552 21553 21554 21555 21556 21557 21558 21559 21560 21561 21562 21563 21564 21565 21566 21567 21568 21569 21570 21571 21572 21573 21574 21575 21576 21577 21578 21579 21580 21581 21582 21583 21584 21585 21586 21587 21588 21589 21590 21591 21592 21593 21594 21595 21596 21597 21598 21599 21600 21601 21602 21603 21604 21605 21606 21607 21608 21609 21610 21611 21612 21613 21614 21615 21616 21617 21618 21619 21620 21621 21622 21623 21624 21625 21626 21627 21628 21629 21630 21631 21632 21633 21634 21635 21636 21637 21638 21639 21640 21641 21642 21643 21644 21645 21646 21647 21648 21649 21650 21651 21652 21653 21654 21655 21656 21657 21658 21659 21660 21661 21662 21663 21664 21665 21666 21667 21668 21669 21670 21671 21672 21673 21674 21675 21676 21677 21678 21679 21680 21681 21682 21683 21684 21685 21686 21687 21688 21689 21690 21691 21692 21693 21694 21695 21696 21697 21698 21699 21700 21701 21702 21703 21704 21705 21706 21707 21708 21709 21710 21711 21712 21713 21714 21715 21716 21717 21718 21719 21720 21721 21722 21723 21724 21725 21726 21727 21728 21729 21730 21731 21732 21733 21734 21735 21736 21737 21738 21739 21740 21741 21742 21743 21744 21745 21746 21747 21748 21749 21750 21751 21752 21753 21754 21755 21756 21757 21758 21759 21760 21761 21762 21763 21764 21765 21766 21767 21768 21769 21770 21771 21772 21773 21774 21775 21776 21777 21778 21779 21780 21781 21782 21783 21784 21785 21786 21787 21788 21789 21790 21791 21792 21793 21794 21795 21796 21797 21798 21799 21800 21801 21802 21803 21804 21805 21806 21807 21808 21809 21810 21811 21812 21813 21814 21815 21816 21817 21818 21819 21820 21821 21822 21823 21824 21825 21826 21827 21828 21829 21830 21831 21832 21833 21834 21835 21836 21837 21838 21839 21840 21841 21842 21843 21844 21845 21846 21847 21848 21849 21850 21851 21852 21853 21854 21855 21856 21857 21858 21859 21860 21861 21862 21863 21864 21865 21866 21867 21868 21869 21870 21871 21872 21873 21874 21875 21876 21877 21878 21879 21880 21881 21882 21883 21884 21885 21886 21887 21888 21889 21890 21891 21892 21893 21894 21895 21896 21897 21898 21899 21900 21901 21902 21903 21904 21905 21906 21907 21908 21909 21910 21911 21912 21913 21914 21915 21916 21917 21918 21919 21920 21921 21922 21923 21924 21925 21926 21927 21928 21929 21930 21931 21932 21933 21934 21935 21936 21937 21938 21939 21940 21941 21942 21943 21944 21945 21946 21947 21948 21949 21950 21951 21952 21953 21954 21955 21956 21957 21958 21959 21960 21961 21962 21963 21964 21965 21966 21967 21968 21969 21970 21971 21972 21973 21974 21975 21976 21977 21978 21979 21980 21981 21982 21983 21984 21985 21986 21987 21988 21989 21990 21991 21992 21993 21994 21995 21996 21997 21998 21999 22000 22001 22002 22003 22004 22005 22006 22007 22008 22009 22010 22011 22012 22013 22014 22015 22016 22017 22018 22019 22020 22021 22022 22023 22024 22025 22026 22027 22028 22029 22030 22031 22032 22033 22034 22035 22036 22037 22038 22039 22040 22041 22042 22043 22044 22045 22046 22047 22048 22049 22050 22051 22052 22053 22054 22055 22056 22057 22058 22059 22060 22061 22062 22063 22064 22065 22066 22067 22068 22069 22070 22071 22072 22073 22074 22075 22076 22077 22078 22079 22080 22081 22082 22083 22084 22085 22086 22087 22088 22089 22090 22091 22092 22093 22094 22095 22096 22097 22098 22099 22100 22101 22102 22103 22104 22105 22106 22107 22108 22109 22110 22111 22112 22113 22114 22115 22116 22117 22118 22119 22120 22121 22122 22123 22124 22125 22126 22127 22128 22129 22130 22131 22132 22133 22134 22135 22136 22137 22138 22139 22140 22141 22142 22143 22144 22145 22146 22147 22148 22149 22150 22151 22152 22153 22154 22155 22156 22157 22158 22159 22160 22161 22162 22163 22164 22165 22166 22167 22168 22169 22170 22171 22172 22173 22174 22175 22176 22177 22178 22179 22180 22181 22182 22183 22184 22185 22186 22187 22188 22189 22190 22191 22192 22193 22194 22195 22196 22197 22198 22199 22200 22201 22202 22203 22204 22205 22206 22207 22208 22209 22210 22211 22212 22213 22214 22215 22216 22217 22218 22219 22220 22221 22222 22223 22224 22225 22226 22227 22228 22229 22230 22231 22232 22233 22234 22235 22236 22237 22238 22239 22240 22241 22242 22243 22244 22245 22246 22247 22248 22249 22250 22251 22252 22253 22254 22255 22256 22257 22258 22259 22260 22261 22262 22263 22264 22265 22266 22267 22268 22269 22270 22271 22272 22273 22274 22275 22276 22277 22278 22279 22280 22281 22282 22283 22284 22285 22286 22287 22288 22289 22290 22291 22292 22293 22294 22295 22296 22297 22298 22299 22300 22301 22302 22303 22304 22305 22306 22307 22308 22309 22310 22311 22312 22313 22314 22315 22316 22317 22318 22319 22320 22321 22322 22323 22324 22325 22326 22327 22328 22329 22330 22331 22332 22333 22334 22335 22336 22337 22338 22339 22340 22341 22342 22343 22344 22345 22346 22347 22348 22349 22350 22351 22352 22353 22354 22355 22356 22357 22358 22359 22360 22361 22362 22363 22364 22365 22366 22367 22368 22369 22370 22371 22372 22373 22374 22375 22376 22377 22378 22379 22380 22381 22382 22383 22384 22385 22386 22387 22388 22389 22390 22391 22392 22393 22394 22395 22396 22397 22398 22399 22400 22401 22402 22403 22404 22405 22406 22407 22408 22409 22410 22411 22412 22413 22414 22415 22416 22417 22418 22419 22420 22421 22422 22423 22424 22425 22426 22427 22428 22429 22430 22431 22432 22433 22434 22435 22436 22437 22438 22439 22440 22441 22442 22443 22444 22445 22446 22447 22448 22449 22450 22451 22452 22453 22454 22455 22456 22457 22458 22459 22460 22461 22462 22463 22464 22465 22466 22467 22468 22469 22470 22471 22472 22473 22474 22475 22476 22477 22478 22479 22480 22481 22482 22483 22484 22485 22486 22487 22488 22489 22490 22491 22492 22493 22494 22495 22496 22497 22498 22499 22500 22501 22502 22503 22504 22505 22506 22507 22508 22509 22510 22511 22512 22513 22514 22515 22516 22517 22518 22519 22520 22521 22522 22523 22524 22525 22526 22527 22528 22529 22530 22531 22532 22533 22534 22535 22536 22537 22538 22539 22540 22541 22542 22543 22544 22545 22546 22547 22548 22549 22550 22551 22552 22553 22554 22555 22556 22557 22558 22559 22560 22561 22562 22563 22564 22565 22566 22567 22568 22569 22570 22571 22572 22573 22574 22575 22576 22577 22578 22579 22580 22581 22582 22583 22584 22585 22586 22587 22588 22589 22590 22591 22592 22593 22594 22595 22596 22597 22598 22599 22600 22601 22602 22603 22604 22605 22606 22607 22608 22609 22610 22611 22612 22613 22614 22615 22616 22617 22618 22619 22620 22621 22622 22623 22624 22625 22626 22627 22628 22629 22630 22631 22632 22633 22634 22635 22636 22637 22638 22639 22640 22641 22642 22643 22644 22645 22646 22647 22648 22649 22650 22651 22652 22653 22654 22655 22656 22657 22658 22659 22660 22661 22662 22663 22664 22665 22666 22667 22668 22669 22670 22671 22672 22673 22674 22675 22676 22677 22678 22679 22680 22681 22682 22683 22684 22685 22686 22687 22688 22689 22690 22691 22692 22693 22694 22695 22696 22697 22698 22699 22700 22701 22702 22703 22704 22705 22706 22707 22708 22709 22710 22711 22712 22713 22714 22715 22716 22717 22718 22719 22720 22721 22722 22723 22724 22725 22726 22727 22728 22729 22730 22731 22732 22733 22734 22735 22736 22737 22738 22739 22740 22741 22742 22743 22744 22745 22746 22747 22748 22749 22750 22751 22752 22753 22754 22755 22756 22757 22758 22759 22760 22761 22762 22763 22764 22765 22766 22767 22768 22769 22770 22771 22772 22773 22774 22775 22776 22777 22778 22779 22780 22781 22782 22783 22784 22785 22786 22787 22788 22789 22790 22791 22792 22793 22794 22795 22796 22797 22798 22799 22800 22801 22802 22803 22804 22805 22806 22807 22808 22809 22810 22811 22812 22813 22814 22815 22816 22817 22818 22819 22820 22821 22822 22823 22824 22825 22826 22827 22828 22829 22830 22831 22832 22833 22834 22835 22836 22837 22838 22839 22840 22841 22842 22843 22844 22845 22846 22847 22848 22849 22850 22851 22852 22853 22854 22855 22856 22857 22858 22859 22860 22861 22862 22863 22864 22865 22866 22867 22868 22869 22870 22871 22872 22873 22874 22875 22876 22877 22878 22879 22880 22881 22882 22883 22884 22885 22886 22887 22888 22889 22890 22891 22892 22893 22894 22895 22896 22897 22898 22899 22900 22901 22902 22903 22904 22905 22906 22907 22908 22909 22910 22911 22912 22913 22914 22915 22916 22917 22918 22919 22920 22921 22922 22923 22924 22925 22926 22927 22928 22929 22930 22931 22932 22933 22934 22935 22936 22937 22938 22939 22940 22941 22942 22943 22944 22945 22946 22947 22948 22949 22950 22951 22952 22953 22954 22955 22956 22957 22958 22959 22960 22961 22962 22963 22964 22965 22966 22967 22968 22969 22970 22971 22972 22973 22974 22975 22976 22977 22978 22979 22980 22981 22982 22983 22984 22985 22986 22987 22988 22989 22990 22991 22992 22993 22994 22995 22996 22997 22998 22999 23000 23001 23002 23003 23004 23005 23006 23007 23008 23009 23010 23011 23012 23013 23014 23015 23016 23017 23018 23019 23020 23021 23022 23023 23024 23025 23026 23027 23028 23029 23030 23031 23032 23033 23034 23035 23036 23037 23038 23039 23040 23041 23042 23043 23044 23045 23046 23047 23048 23049 23050 23051 23052 23053 23054 23055 23056 23057 23058 23059 23060 23061 23062 23063 23064 23065 23066 23067 23068 23069 23070 23071 23072 23073 23074 23075 23076 23077 23078 23079 23080 23081 23082 23083 23084 23085 23086 23087 23088 23089 23090 23091 23092 23093 23094 23095 23096 23097 23098 23099 23100 23101 23102 23103 23104 23105 23106 23107 23108 23109 23110 23111 23112 23113 23114 23115 23116 23117 23118 23119 23120 23121 23122 23123 23124 23125 23126 23127 23128 23129 23130 23131 23132 23133 23134 23135 23136 23137 23138 23139 23140 23141 23142 23143 23144 23145 23146 23147 23148 23149 23150 23151 23152 23153 23154 23155 23156 23157 23158 23159 23160 23161 23162 23163 23164 23165 23166 23167 23168 23169 23170 23171 23172 23173 23174 23175 23176 23177 23178 23179 23180 23181 23182 23183 23184 23185 23186 23187 23188 23189 23190 23191 23192 23193 23194 23195 23196 23197 23198 23199 23200 23201 23202 23203 23204 23205 23206 23207 23208 23209 23210 23211 23212 23213 23214 23215 23216 23217 23218 23219 23220 23221 23222 23223 23224 23225 23226 23227 23228 23229 23230 23231 23232 23233 23234 23235 23236 23237 23238 23239 23240 23241 23242 23243 23244 23245 23246 23247 23248 23249 23250 23251 23252 23253 23254 23255 23256 23257 23258 23259 23260 23261 23262 23263 23264 23265 23266 23267 23268 23269 23270 23271 23272 23273 23274 23275 23276 23277 23278 23279 23280 23281 23282 23283 23284 23285 23286 23287 23288 23289 23290 23291 23292 23293 23294 23295 23296 23297 23298 23299 23300 23301 23302 23303 23304 23305 23306 23307 23308 23309 23310 23311 23312 23313 23314 23315 23316 23317 23318 23319 23320 23321 23322 23323 23324 23325 23326 23327 23328 23329 23330 23331 23332 23333 23334 23335 23336 23337 23338 23339 23340 23341 23342 23343 23344 23345 23346 23347 23348 23349 23350 23351 23352 23353 23354 23355 23356 23357 23358 23359 23360 23361 23362 23363 23364 23365 23366 23367 23368 23369 23370 23371 23372 23373 23374 23375 23376 23377 23378 23379 23380 23381 23382 23383 23384 23385 23386 23387 23388 23389 23390 23391 23392 23393 23394 23395 23396 23397 23398 23399 23400 23401 23402 23403 23404 23405 23406 23407 23408 23409 23410 23411 23412 23413 23414 23415 23416 23417 23418 23419 23420 23421 23422 23423 23424 23425 23426 23427 23428 23429 23430 23431 23432 23433 23434 23435 23436 23437 23438 23439 23440 23441 23442 23443 23444 23445 23446 23447 23448 23449 23450 23451 23452 23453 23454 23455 23456 23457 23458 23459 23460 23461 23462 23463 23464 23465 23466 23467 23468 23469 23470 23471 23472 23473 23474 23475 23476 23477 23478 23479 23480 23481 23482 23483 23484 23485 23486 23487 23488 23489 23490 23491 23492 23493 23494 23495 23496 23497 23498 23499 23500 23501 23502 23503 23504 23505 23506 23507 23508 23509 23510 23511 23512 23513 23514 23515 23516 23517 23518 23519 23520 23521 23522 23523 23524 23525 23526 23527 23528 23529 23530 23531 23532 23533 23534 23535 23536 23537 23538 23539 23540 23541 23542 23543 23544 23545 23546 23547 23548 23549 23550 23551 23552 23553 23554 23555 23556 23557 23558 23559 23560 23561 23562 23563 23564 23565 23566 23567 23568 23569 23570 23571 23572 23573 23574 23575 23576 23577 23578 23579 23580 23581 23582 23583 23584 23585 23586 23587 23588 23589 23590 23591 23592 23593 23594 23595 23596 23597 23598 23599 23600 23601 23602 23603 23604 23605 23606 23607 23608 23609 23610 23611 23612 23613 23614 23615 23616 23617 23618 23619 23620 23621 23622 23623 23624 23625 23626 23627 23628 23629 23630 23631 23632 23633 23634 23635 23636 23637 23638 23639 23640 23641 23642 23643 23644 23645 23646 23647 23648 23649 23650 23651 23652 23653 23654 23655 23656 23657 23658 23659 23660 23661 23662 23663 23664 23665 23666 23667 23668 23669 23670 23671 23672 23673 23674 23675 23676 23677 23678 23679 23680 23681 23682 23683 23684 23685 23686 23687 23688 23689 23690 23691 23692 23693 23694 23695 23696 23697 23698 23699 23700 23701 23702 23703 23704 23705 23706 23707 23708 23709 23710 23711 23712 23713 23714 23715 23716 23717 23718 23719 23720 23721 23722 23723 23724 23725 23726 23727 23728 23729 23730 23731 23732 23733 23734 23735 23736 23737 23738 23739 23740 23741 23742 23743 23744 23745 23746 23747 23748 23749 23750 23751 23752 23753 23754 23755 23756 23757 23758 23759 23760 23761 23762 23763 23764 23765 23766 23767 23768 23769 23770 23771 23772 23773 23774 23775 23776 23777 23778 23779 23780 23781 23782 23783 23784 23785 23786 23787 23788 23789 23790 23791 23792 23793 23794 23795 23796 23797 23798 23799 23800 23801 23802 23803 23804 23805 23806 23807 23808 23809 23810 23811 23812 23813 23814 23815 23816 23817 23818 23819 23820 23821 23822 23823 23824 23825 23826 23827 23828 23829 23830 23831 23832 23833 23834 23835 23836 23837 23838 23839 23840 23841 23842 23843 23844 23845 23846 23847 23848 23849 23850 23851 23852 23853 23854 23855 23856 23857 23858 23859 23860 23861 23862 23863 23864 23865 23866 23867 23868 23869 23870 23871 23872 23873 23874 23875 23876 23877 23878 23879 23880 23881 23882 23883 23884 23885 23886 23887 23888 23889 23890 23891 23892 23893 23894 23895 23896 23897 23898 23899 23900 23901 23902 23903 23904 23905 23906 23907 23908 23909 23910 23911 23912 23913 23914 23915 23916 23917 23918 23919 23920 23921 23922 23923 23924 23925 23926 23927 23928 23929 23930 23931 23932 23933 23934 23935 23936 23937 23938 23939 23940 23941 23942 23943 23944 23945 23946 23947 23948 23949 23950 23951 23952 23953 23954 23955 23956 23957 23958 23959 23960 23961 23962 23963 23964 23965 23966 23967 23968 23969 23970 23971 23972 23973 23974 23975 23976 23977 23978 23979 23980 23981 23982 23983 23984 23985 23986 23987 23988 23989 23990 23991 23992 23993 23994 23995 23996 23997 23998 23999 24000 24001 24002 24003 24004 24005 24006 24007 24008 24009 24010 24011 24012 24013 24014 24015 24016 24017 24018 24019 24020 24021 24022 24023 24024 24025 24026 24027 24028 24029 24030 24031 24032 24033 24034 24035 24036 24037 24038 24039 24040 24041 24042 24043 24044 24045 24046 24047 24048 24049 24050 24051 24052 24053 24054 24055 24056 24057 24058 24059 24060 24061 24062 24063 24064 24065 24066 24067 24068 24069 24070 24071 24072 24073 24074 24075 24076 24077 24078 24079 24080 24081 24082 24083 24084 24085 24086 24087 24088 24089 24090 24091 24092 24093 24094 24095 24096 24097 24098 24099 24100 24101 24102 24103 24104 24105 24106 24107 24108 24109 24110 24111 24112 24113 24114 24115 24116 24117 24118 24119 24120 24121 24122 24123 24124 24125 24126 24127 24128 24129 24130 24131 24132 24133 24134 24135 24136 24137 24138 24139 24140 24141 24142 24143 24144 24145 24146 24147 24148 24149 24150 24151 24152 24153 24154 24155 24156 24157 24158 24159 24160 24161 24162 24163 24164 24165 24166 24167 24168 24169 24170 24171 24172 24173 24174 24175 24176 24177 24178 24179 24180 24181 24182 24183 24184 24185 24186 24187 24188 24189 24190 24191 24192 24193 24194 24195 24196 24197 24198 24199 24200 24201 24202 24203 24204 24205 24206 24207 24208 24209 24210 24211 24212 24213 24214 24215 24216 24217 24218 24219 24220 24221 24222 24223 24224 24225 24226 24227 24228 24229 24230 24231 24232 24233 24234 24235 24236 24237 24238 24239 24240 24241 24242 24243 24244 24245 24246 24247 24248 24249 24250 24251 24252 24253 24254 24255 24256 24257 24258 24259 24260 24261 24262 24263 24264 24265 24266 24267 24268 24269 24270 24271 24272 24273 24274 24275 24276 24277 24278 24279 24280 24281 24282 24283 24284 24285 24286 24287 24288 24289 24290 24291 24292 24293 24294 24295 24296 24297 24298 24299 24300 24301 24302 24303 24304 24305 24306 24307 24308 24309 24310 24311 24312 24313 24314 24315 24316 24317 24318 24319 24320 24321 24322 24323 24324 24325 24326 24327 24328 24329 24330 24331 24332 24333 24334 24335 24336 24337 24338 24339 24340 24341 24342 24343 24344 24345 24346 24347 24348 24349 24350 24351 24352 24353 24354 24355 24356 24357 24358 24359 24360 24361 24362 24363 24364 24365 24366 24367 24368 24369 24370 24371 24372 24373 24374 24375 24376 24377 24378 24379 24380 24381 24382 24383 24384 24385 24386 24387 24388 24389 24390 24391 24392 24393 24394 24395 24396 24397 24398 24399 24400 24401 24402 24403 24404 24405 24406 24407 24408 24409 24410 24411 24412 24413 24414 24415 24416 24417 24418 24419 24420 24421 24422 24423 24424 24425 24426 24427 24428 24429 24430 24431 24432 24433 24434 24435 24436 24437 24438 24439 24440 24441 24442 24443 24444 24445 24446 24447 24448 24449 24450 24451 24452 24453 24454 24455 24456 24457 24458 24459 24460 24461 24462 24463 24464 24465 24466 24467 24468 24469 24470 24471 24472 24473 24474 24475 24476 24477 24478 24479 24480 24481 24482 24483 24484 24485 24486 24487 24488 24489 24490 24491 24492 24493 24494 24495 24496 24497 24498 24499 24500 24501 24502 24503 24504 24505 24506 24507 24508 24509 24510 24511 24512 24513 24514 24515 24516 24517 24518 24519 24520 24521 24522 24523 24524 24525 24526 24527 24528 24529 24530 24531 24532 24533 24534 24535 24536 24537 24538 24539 24540 24541 24542 24543 24544 24545 24546 24547 24548 24549 24550 24551 24552 24553 24554 24555 24556 24557 24558 24559 24560 24561 24562 24563 24564 24565 24566 24567 24568 24569 24570 24571 24572 24573 24574 24575 24576 24577 24578 24579 24580 24581 24582 24583 24584 24585 24586 24587 24588 24589 24590 24591 24592 24593 24594 24595 24596 24597 24598 24599 24600 24601 24602 24603 24604 24605 24606 24607 24608 24609 24610 24611 24612 24613 24614 24615 24616 24617 24618 24619 24620 24621 24622 24623 24624 24625 24626 24627 24628 24629 24630 24631 24632 24633 24634 24635 24636 24637 24638 24639 24640 24641 24642 24643 24644 24645 24646 24647 24648 24649 24650 24651 24652 24653 24654 24655 24656 24657 24658 24659 24660 24661 24662 24663 24664 24665 24666 24667 24668 24669 24670 24671 24672 24673 24674 24675 24676 24677 24678 24679 24680 24681 24682 24683 24684 24685 24686 24687 24688 24689 24690 24691 24692 24693 24694 24695 24696 24697 24698 24699 24700 24701 24702 24703 24704 24705 24706 24707 24708 24709 24710 24711 24712 24713 24714 24715 24716 24717 24718 24719 24720 24721 24722 24723 24724 24725 24726 24727 24728 24729 24730 24731 24732 24733 24734 24735 24736 24737 24738 24739 24740 24741 24742 24743 24744 24745 24746 24747 24748 24749 24750 24751 24752 24753 24754 24755 24756 24757 24758 24759 24760 24761 24762 24763 24764 24765 24766 24767 24768 24769 24770 24771 24772 24773 24774 24775 24776 24777 24778 24779 24780 24781 24782 24783 24784 24785 24786 24787 24788 24789 24790 24791 24792 24793 24794 24795 24796 24797 24798 24799 24800 24801 24802 24803 24804 24805 24806 24807 24808 24809 24810 24811 24812 24813 24814 24815 24816 24817 24818 24819 24820 24821 24822 24823 24824 24825 24826 24827 24828 24829 24830 24831 24832 24833 24834 24835 24836 24837 24838 24839 24840 24841 24842 24843 24844 24845 24846 24847 24848 24849 24850 24851 24852 24853 24854 24855 24856 24857 24858 24859 24860 24861 24862 24863 24864 24865 24866 24867 24868 24869 24870 24871 24872 24873 24874 24875 24876 24877 24878 24879 24880 24881 24882 24883 24884 24885 24886 24887 24888 24889 24890 24891 24892 24893 24894 24895 24896 24897 24898 24899 24900 24901 24902 24903 24904 24905 24906 24907 24908 24909 24910 24911 24912 24913 24914 24915 24916 24917 24918 24919 24920 24921 24922 24923 24924 24925 24926 24927 24928 24929 24930 24931 24932 24933 24934 24935 24936 24937 24938 24939 24940 24941 24942 24943 24944 24945 24946 24947 24948 24949 24950 24951 24952 24953 24954 24955 24956 24957 24958 24959 24960 24961 24962 24963 24964 24965 24966 24967 24968 24969 24970 24971 24972 24973 24974 24975 24976 24977 24978 24979 24980 24981 24982 24983 24984 24985 24986 24987 24988 24989 24990 24991 24992 24993 24994 24995 24996 24997 24998 24999 25000 25001 25002 25003 25004 25005 25006 25007 25008 25009 25010 25011 25012 25013 25014 25015 25016 25017 25018 25019 25020 25021 25022 25023 25024 25025 25026 25027 25028 25029 25030 25031 25032 25033 25034 25035 25036 25037 25038 25039 25040 25041 25042 25043 25044 25045 25046 25047 25048 25049 25050 25051 25052 25053 25054 25055 25056 25057 25058 25059 25060 25061 25062 25063 25064 25065 25066 25067 25068 25069 25070 25071 25072 25073 25074 25075 25076 25077 25078 25079 25080 25081 25082 25083 25084 25085 25086 25087 25088 25089 25090 25091 25092 25093 25094 25095 25096 25097 25098 25099 25100 25101 25102 25103 25104 25105 25106 25107 25108 25109 25110 25111 25112 25113 25114 25115 25116 25117 25118 25119 25120 25121 25122 25123 25124 25125 25126 25127 25128 25129 25130 25131 25132 25133 25134 25135 25136 25137 25138 25139 25140 25141 25142 25143 25144 25145 25146 25147 25148 25149 25150 25151 25152 25153 25154 25155 25156 25157 25158 25159 25160 25161 25162 25163 25164 25165 25166 25167 25168 25169 25170 25171 25172 25173 25174 25175 25176 25177 25178 25179 25180 25181 25182 25183 25184 25185 25186 25187 25188 25189 25190 25191 25192 25193 25194 25195 25196 25197 25198 25199 25200 25201 25202 25203 25204 25205 25206 25207 25208 25209 25210 25211 25212 25213 25214 25215 25216 25217 25218 25219 25220 25221 25222 25223 25224 25225 25226 25227 25228 25229 25230 25231 25232 25233 25234 25235 25236 25237 25238 25239 25240 25241 25242 25243 25244 25245 25246 25247 25248 25249 25250 25251 25252 25253 25254 25255 25256 25257 25258 25259 25260 25261 25262 25263 25264 25265 25266 25267 25268 25269 25270 25271 25272 25273 25274 25275 25276 25277 25278 25279 25280 25281 25282 25283 25284 25285 25286 25287 25288 25289 25290 25291 25292 25293 25294 25295 25296 25297 25298 25299 25300 25301 25302 25303 25304 25305 25306 25307 25308 25309 25310 25311 25312 25313 25314 25315 25316 25317 25318 25319 25320 25321 25322 25323 25324 25325 25326 25327 25328 25329 25330 25331 25332 25333 25334 25335 25336 25337 25338 25339 25340 25341 25342 25343 25344 25345 25346 25347 25348 25349 25350 25351 25352 25353 25354 25355 25356 25357 25358 25359 25360 25361 25362 25363 25364 25365 25366 25367 25368 25369 25370 25371 25372 25373 25374 25375 25376 25377 25378 25379 25380 25381 25382 25383 25384 25385 25386 25387 25388 25389 25390 25391 25392 25393 25394 25395 25396 25397 25398 25399 25400 25401 25402 25403 25404 25405 25406 25407 25408 25409 25410 25411 25412 25413 25414 25415 25416 25417 25418 25419 25420 25421 25422 25423 25424 25425 25426 25427 25428 25429 25430 25431 25432 25433 25434 25435 25436 25437 25438 25439 25440 25441 25442 25443 25444 25445 25446 25447 25448 25449 25450 25451 25452 25453 25454 25455 25456 25457 25458 25459 25460 25461 25462 25463 25464 25465 25466 25467 25468 25469 25470 25471 25472 25473 25474 25475 25476 25477 25478 25479 25480 25481 25482 25483 25484 25485 25486 25487 25488 25489 25490 25491 25492 25493 25494 25495 25496 25497 25498 25499 25500 25501 25502 25503 25504 25505 25506 25507 25508 25509 25510 25511 25512 25513 25514 25515 25516 25517 25518 25519 25520 25521 25522 25523 25524 25525 25526 25527 25528 25529 25530 25531 25532 25533 25534 25535 25536 25537 25538 25539 25540 25541 25542 25543 25544 25545 25546 25547 25548 25549 25550 25551 25552 25553 25554 25555 25556 25557 25558 25559 25560 25561 25562 25563 25564 25565 25566 25567 25568 25569 25570 25571 25572 25573 25574 25575 25576 25577 25578 25579 25580 25581 25582 25583 25584 25585 25586 25587 25588 25589 25590 25591 25592 25593 25594 25595 25596 25597 25598 25599 25600 25601 25602 25603 25604 25605 25606 25607 25608 25609 25610 25611 25612 25613 25614 25615 25616 25617 25618 25619 25620 25621 25622 25623 25624 25625 25626 25627 25628 25629 25630 25631 25632 25633 25634 25635 25636 25637 25638 25639 25640 25641 25642 25643 25644 25645 25646 25647 25648 25649 25650 25651 25652 25653 25654 25655 25656 25657 25658 25659 25660 25661 25662 25663 25664 25665 25666 25667 25668 25669 25670 25671 25672 25673 25674 25675 25676 25677 25678 25679 25680 25681 25682 25683 25684 25685 25686 25687 25688 25689 25690 25691 25692 25693 25694 25695 25696 25697 25698 25699 25700 25701 25702 25703 25704 25705 25706 25707 25708 25709 25710 25711 25712 25713 25714 25715 25716 25717 25718 25719 25720 25721 25722 25723 25724 25725 25726 25727 25728 25729 25730 25731 25732 25733 25734 25735 25736 25737 25738 25739 25740 25741 25742 25743 25744 25745 25746 25747 25748 25749 25750 25751 25752 25753 25754 25755 25756 25757 25758 25759 25760 25761 25762 25763 25764 25765 25766 25767 25768 25769 25770 25771 25772 25773 25774 25775 25776 25777 25778 25779 25780 25781 25782 25783 25784 25785 25786 25787 25788 25789 25790 25791 25792 25793 25794 25795 25796 25797 25798 25799 25800 25801 25802 25803 25804 25805 25806 25807 25808 25809 25810 25811 25812 25813 25814 25815 25816 25817 25818 25819 25820 25821 25822 25823 25824 25825 25826 25827 25828 25829 25830 25831 25832 25833 25834 25835 25836 25837 25838 25839 25840 25841 25842 25843 25844 25845 25846 25847 25848 25849 25850 25851 25852 25853 25854 25855 25856 25857 25858 25859 25860 25861 25862 25863 25864 25865 25866 25867 25868 25869 25870 25871 25872 25873 25874 25875 25876 25877 25878 25879 25880 25881 25882 25883 25884 25885 25886 25887 25888 25889 25890 25891 25892 25893 25894 25895 25896 25897 25898 25899 25900 25901 25902 25903 25904 25905 25906 25907 25908 25909 25910 25911 25912 25913 25914 25915 25916 25917 25918 25919 25920 25921 25922 25923 25924 25925 25926 25927 25928 25929 25930 25931 25932 25933 25934 25935 25936 25937 25938 25939 25940 25941 25942 25943 25944 25945 25946 25947 25948 25949 25950 25951 25952 25953 25954 25955 25956 25957 25958 25959 25960 25961 25962 25963 25964 25965 25966 25967 25968 25969 25970 25971 25972 25973 25974 25975 25976 25977 25978 25979 25980 25981 25982 25983 25984 25985 25986 25987 25988 25989 25990 25991 25992 25993 25994 25995 25996 25997 25998 25999 26000 26001 26002 26003 26004 26005 26006 26007 26008 26009 26010 26011 26012 26013 26014 26015 26016 26017 26018 26019 26020 26021 26022 26023 26024 26025 26026 26027 26028 26029 26030 26031 26032 26033 26034 26035 26036 26037 26038 26039 26040 26041 26042 26043 26044 26045 26046 26047 26048 26049 26050 26051 26052 26053 26054 26055 26056 26057 26058 26059 26060 26061 26062 26063 26064 26065 26066 26067 26068 26069 26070 26071 26072 26073 26074 26075 26076 26077 26078 26079 26080 26081 26082 26083 26084 26085 26086 26087 26088 26089 26090 26091 26092 26093 26094 26095 26096 26097 26098 26099 26100 26101 26102 26103 26104 26105 26106 26107 26108 26109 26110 26111 26112 26113 26114 26115 26116 26117 26118 26119 26120 26121 26122 26123 26124 26125 26126 26127 26128 26129 26130 26131 26132 26133 26134 26135 26136 26137 26138 26139 26140 26141 26142 26143 26144 26145 26146 26147 26148 26149 26150 26151 26152 26153 26154 26155 26156 26157 26158 26159 26160 26161 26162 26163 26164 26165 26166 26167 26168 26169 26170 26171 26172 26173 26174 26175 26176 26177 26178 26179 26180 26181 26182 26183 26184 26185 26186 26187 26188 26189 26190 26191 26192 26193 26194 26195 26196 26197 26198 26199 26200 26201 26202 26203 26204 26205 26206 26207 26208 26209 26210 26211 26212 26213 26214 26215 26216 26217 26218 26219 26220 26221 26222 26223 26224 26225 26226 26227 26228 26229 26230 26231 26232 26233 26234 26235 26236 26237 26238 26239 26240 26241 26242 26243 26244 26245 26246 26247 26248 26249 26250 26251 26252 26253 26254 26255 26256 26257 26258 26259 26260 26261 26262 26263 26264 26265 26266 26267 26268 26269 26270 26271 26272 26273 26274 26275 26276 26277 26278 26279 26280 26281 26282 26283 26284 26285 26286 26287 26288 26289 26290 26291 26292 26293 26294 26295 26296 26297 26298 26299 26300 26301 26302 26303 26304 26305 26306 26307 26308 26309 26310 26311 26312 26313 26314 26315 26316 26317 26318 26319 26320 26321 26322 26323 26324 26325 26326 26327 26328 26329 26330 26331 26332 26333 26334 26335 26336 26337 26338 26339 26340 26341 26342 26343 26344 26345 26346 26347 26348 26349 26350 26351 26352 26353 26354 26355 26356 26357 26358 26359 26360 26361 26362 26363 26364 26365 26366 26367 26368 26369 26370 26371 26372 26373 26374 26375 26376 26377 26378 26379 26380 26381 26382 26383 26384 26385 26386 26387 26388 26389 26390 26391 26392 26393 26394 26395 26396 26397 26398 26399 26400 26401 26402 26403 26404 26405 26406 26407 26408 26409 26410 26411 26412 26413 26414 26415 26416 26417 26418 26419 26420 26421 26422 26423 26424 26425 26426 26427 26428 26429 26430 26431 26432 26433 26434 26435 26436 26437 26438 26439 26440 26441 26442 26443 26444 26445 26446 26447 26448 26449 26450 26451 26452 26453 26454 26455 26456 26457 26458 26459 26460 26461 26462 26463 26464 26465 26466 26467 26468 26469 26470 26471 26472 26473 26474 26475 26476 26477 26478 26479 26480 26481 26482 26483 26484 26485 26486 26487 26488 26489 26490 26491 26492 26493 26494 26495 26496 26497 26498 26499 26500 26501 26502 26503 26504 26505 26506 26507 26508 26509 26510 26511 26512 26513 26514 26515 26516 26517 26518 26519 26520 26521 26522 26523 26524 26525 26526 26527 26528 26529 26530 26531 26532 26533 26534 26535 26536 26537 26538 26539 26540 26541 26542 26543 26544 26545 26546 26547 26548 26549 26550 26551 26552 26553 26554 26555 26556 26557 26558 26559 26560 26561 26562 26563 26564 26565 26566 26567 26568 26569 26570 26571 26572 26573 26574 26575 26576 26577 26578 26579 26580 26581 26582 26583 26584 26585 26586 26587 26588 26589 26590 26591 26592 26593 26594 26595 26596 26597 26598 26599 26600 26601 26602 26603 26604 26605 26606 26607 26608 26609 26610 26611 26612 26613 26614 26615 26616 26617 26618 26619 26620 26621 26622 26623 26624 26625 26626 26627 26628 26629 26630 26631 26632 26633 26634 26635 26636 26637 26638 26639 26640 26641 26642 26643 26644 26645 26646 26647 26648 26649 26650 26651 26652 26653 26654 26655 26656 26657 26658 26659 26660 26661 26662 26663 26664 26665 26666 26667 26668 26669 26670 26671 26672 26673 26674 26675 26676 26677 26678 26679 26680 26681 26682 26683 26684 26685 26686 26687 26688 26689 26690 26691 26692 26693 26694 26695 26696 26697 26698 26699 26700 26701 26702 26703 26704 26705 26706 26707 26708 26709 26710 26711 26712 26713 26714 26715 26716 26717 26718 26719 26720 26721 26722 26723 26724 26725 26726 26727 26728 26729 26730 26731 26732 26733 26734 26735 26736 26737 26738 26739 26740 26741 26742 26743 26744 26745 26746 26747 26748 26749 26750 26751 26752 26753 26754 26755 26756 26757 26758 26759 26760 26761 26762 26763 26764 26765 26766 26767 26768 26769 26770 26771 26772 26773 26774 26775 26776 26777 26778 26779 26780 26781 26782 26783 26784 26785 26786 26787 26788 26789 26790 26791 26792 26793 26794 26795 26796 26797 26798 26799 26800 26801 26802 26803 26804 26805 26806 26807 26808 26809 26810 26811 26812 26813 26814 26815 26816 26817 26818 26819 26820 26821 26822 26823 26824 26825 26826 26827 26828 26829 26830 26831 26832 26833 26834 26835 26836 26837 26838 26839 26840 26841 26842 26843 26844 26845 26846 26847 26848 26849 26850 26851 26852 26853 26854 26855 26856 26857 26858 26859 26860 26861 26862 26863 26864 26865 26866 26867 26868 26869 26870 26871 26872 26873 26874 26875 26876 26877 26878 26879 26880 26881 26882 26883 26884 26885 26886 26887 26888 26889 26890 26891 26892 26893 26894 26895 26896 26897 26898 26899 26900 26901 26902 26903 26904 26905 26906 26907 26908 26909 26910 26911 26912 26913 26914 26915 26916 26917 26918 26919 26920 26921 26922 26923 26924 26925 26926 26927 26928 26929 26930 26931 26932 26933 26934 26935 26936 26937 26938 26939 26940 26941 26942 26943 26944 26945 26946 26947 26948 26949 26950 26951 26952 26953 26954 26955 26956 26957 26958 26959 26960 26961 26962 26963 26964 26965 26966 26967 26968 26969 26970 26971 26972 26973 26974 26975 26976 26977 26978 26979 26980 26981 26982 26983 26984 26985 26986 26987 26988 26989 26990 26991 26992 26993 26994 26995 26996 26997 26998 26999 27000 27001 27002 27003 27004 27005 27006 27007 27008 27009 27010 27011 27012 27013 27014 27015 27016 27017 27018 27019 27020 27021 27022 27023 27024 27025 27026 27027 27028 27029 27030 27031 27032 27033 27034 27035 27036 27037 27038 27039 27040 27041 27042 27043 27044 27045 27046 27047 27048 27049 27050 27051 27052 27053 27054 27055 27056 27057 27058 27059 27060 27061 27062 27063 27064 27065 27066 27067 27068 27069 27070 27071 27072 27073 27074 27075 27076 27077 27078 27079 27080 27081 27082 27083 27084 27085 27086 27087 27088 27089 27090 27091 27092 27093 27094 27095 27096 27097 27098 27099 27100 27101 27102 27103 27104 27105 27106 27107 27108 27109 27110 27111 27112 27113 27114 27115 27116 27117 27118 27119 27120 27121 27122 27123 27124 27125 27126 27127 27128 27129 27130 27131 27132 27133 27134 27135 27136 27137 27138 27139 27140 27141 27142 27143 27144 27145 27146 27147 27148 27149 27150 27151 27152 27153 27154 27155 27156 27157 27158 27159 27160 27161 27162 27163 27164 27165 27166 27167 27168 27169 27170 27171 27172 27173 27174 27175 27176 27177 27178 27179 27180 27181 27182 27183 27184 27185 27186 27187 27188 27189 27190 27191 27192 27193 27194 27195 27196 27197 27198 27199 27200 27201 27202 27203 27204 27205 27206 27207 27208 27209 27210 27211 27212 27213 27214 27215 27216 27217 27218 27219 27220 27221 27222 27223 27224 27225 27226 27227 27228 27229 27230 27231 27232 27233 27234 27235 27236 27237 27238 27239 27240 27241 27242 27243 27244 27245 27246 27247 27248 27249 27250 27251 27252 27253 27254 27255 27256 27257 27258 27259 27260 27261 27262 27263 27264 27265 27266 27267 27268 27269 27270 27271 27272 27273 27274 27275 27276 27277 27278 27279 27280 27281 27282 27283 27284 27285 27286 27287 27288 27289 27290 27291 27292 27293 27294 27295 27296 27297 27298 27299 27300 27301 27302 27303 27304 27305 27306 27307 27308 27309 27310 27311 27312 27313 27314 27315 27316 27317 27318 27319 27320 27321 27322 27323 27324 27325 27326 27327 27328 27329 27330 27331 27332 27333 27334 27335 27336 27337 27338 27339 27340 27341 27342 27343 27344 27345 27346 27347 27348 27349 27350 27351 27352 27353 27354 27355 27356 27357 27358 27359 27360 27361 27362 27363 27364 27365 27366 27367 27368 27369 27370 27371 27372 27373 27374 27375 27376 27377 27378 27379 27380 27381 27382 27383 27384 27385 27386 27387 27388 27389 27390 27391 27392 27393 27394 27395 27396 27397 27398 27399 27400 27401 27402 27403 27404 27405 27406 27407 27408 27409 27410 27411 27412 27413 27414 27415 27416 27417 27418 27419 27420 27421 27422 27423 27424 27425 27426 27427 27428 27429 27430 27431 27432 27433 27434 27435 27436 27437 27438 27439 27440 27441 27442 27443 27444 27445 27446 27447 27448 27449 27450 27451 27452 27453 27454 27455 27456 27457 27458 27459 27460 27461 27462 27463 27464 27465 27466 27467 27468 27469 27470 27471 27472 27473 27474 27475 27476 27477 27478 27479 27480 27481 27482 27483 27484 27485 27486 27487 27488 27489 27490 27491 27492 27493 27494 27495 27496 27497 27498 27499 27500 27501 27502 27503 27504 27505 27506 27507 27508 27509 27510 27511 27512 27513 27514 27515 27516 27517 27518 27519 27520 27521 27522 27523 27524 27525 27526 27527 27528 27529 27530 27531 27532 27533 27534 27535 27536 27537 27538 27539 27540 27541 27542 27543 27544 27545 27546 27547 27548 27549 27550 27551 27552 27553 27554 27555 27556 27557 27558 27559 27560 27561 27562 27563 27564 27565 27566 27567 27568 27569 27570 27571 27572 27573 27574 27575 27576 27577 27578 27579 27580 27581 27582 27583 27584 27585 27586 27587 27588 27589 27590 27591 27592 27593 27594 27595 27596 27597 27598 27599 27600 27601 27602 27603 27604 27605 27606 27607 27608 27609 27610 27611 27612 27613 27614 27615 27616 27617 27618 27619 27620 27621 27622 27623 27624 27625 27626 27627 27628 27629 27630 27631 27632 27633 27634 27635 27636 27637 27638 27639 27640 27641 27642 27643 27644 27645 27646 27647 27648 27649 27650 27651 27652 27653 27654 27655 27656 27657 27658 27659 27660 27661 27662 27663 27664 27665 27666 27667 27668 27669 27670 27671 27672 27673 27674 27675 27676 27677 27678 27679 27680 27681 27682 27683 27684 27685 27686 27687 27688 27689 27690 27691 27692 27693 27694 27695 27696 27697 27698 27699 27700 27701 27702 27703 27704 27705 27706 27707 27708 27709 27710 27711 27712 27713 27714 27715 27716 27717 27718 27719 27720 27721 27722 27723 27724 27725 27726 27727 27728 27729 27730 27731 27732 27733 27734 27735 27736 27737 27738 27739 27740 27741 27742 27743 27744 27745 27746 27747 27748 27749 27750 27751 27752 27753 27754 27755 27756 27757 27758 27759 27760 27761 27762 27763 27764 27765 27766 27767 27768 27769 27770 27771 27772 27773 27774 27775 27776 27777 27778 27779 27780 27781 27782 27783 27784 27785 27786 27787 27788 27789 27790 27791 27792 27793 27794 27795 27796 27797 27798 27799 27800 27801 27802 27803 27804 27805 27806 27807 27808 27809 27810 27811 27812 27813 27814 27815 27816 27817 27818 27819 27820 27821 27822 27823 27824 27825 27826 27827 27828 27829 27830 27831 27832 27833 27834 27835 27836 27837 27838 27839 27840 27841 27842 27843 27844 27845 27846 27847 27848 27849 27850 27851 27852 27853 27854 27855 27856 27857 27858 27859 27860 27861 27862 27863 27864 27865 27866 27867 27868 27869 27870 27871 27872 27873 27874 27875 27876 27877 27878 27879 27880 27881 27882 27883 27884 27885 27886 27887 27888 27889 27890 27891 27892 27893 27894 27895 27896 27897 27898 27899 27900 27901 27902 27903 27904 27905 27906 27907 27908 27909 27910 27911 27912 27913 27914 27915 27916 27917 27918 27919 27920 27921 27922 27923 27924 27925 27926 27927 27928 27929 27930 27931 27932 27933 27934 27935 27936 27937 27938 27939 27940 27941 27942 27943 27944 27945 27946 27947 27948 27949 27950 27951 27952 27953 27954 27955 27956 27957 27958 27959 27960 27961 27962 27963 27964 27965 27966 27967 27968 27969 27970 27971 27972 27973 27974 27975 27976 27977 27978 27979 27980 27981 27982 27983 27984 27985 27986 27987 27988 27989 27990 27991 27992 27993 27994 27995 27996 27997 27998 27999 28000 28001 28002 28003 28004 28005 28006 28007 28008 28009 28010 28011 28012 28013 28014 28015 28016 28017 28018 28019 28020 28021 28022 28023 28024 28025 28026 28027 28028 28029 28030 28031 28032 28033 28034 28035 28036 28037 28038 28039 28040 28041 28042 28043 28044 28045 28046 28047 28048 28049 28050 28051 28052 28053 28054 28055 28056 28057 28058 28059 28060 28061 28062 28063 28064 28065 28066 28067 28068 28069 28070 28071 28072 28073 28074 28075 28076 28077 28078 28079 28080 28081 28082 28083 28084 28085 28086 28087 28088 28089 28090 28091 28092 28093 28094 28095 28096 28097 28098 28099 28100 28101 28102 28103 28104 28105 28106 28107 28108 28109 28110 28111 28112 28113 28114 28115 28116 28117 28118 28119 28120 28121 28122 28123 28124 28125 28126 28127 28128 28129 28130 28131 28132 28133 28134 28135 28136 28137 28138 28139 28140 28141 28142 28143 28144 28145 28146 28147 28148 28149 28150 28151 28152 28153 28154 28155 28156 28157 28158 28159 28160 28161 28162 28163 28164 28165 28166 28167 28168 28169 28170 28171 28172 28173 28174 28175 28176 28177 28178 28179 28180 28181 28182 28183 28184 28185 28186 28187 28188 28189 28190 28191 28192 28193 28194 28195 28196 28197 28198 28199 28200 28201 28202 28203 28204 28205 28206 28207 28208 28209 28210 28211 28212 28213 28214 28215 28216 28217 28218 28219 28220 28221 28222 28223 28224 28225 28226 28227 28228 28229 28230 28231 28232 28233 28234 28235 28236 28237 28238 28239 28240 28241 28242 28243 28244 28245 28246 28247 28248 28249 28250 28251 28252 28253 28254 28255 28256 28257 28258 28259 28260 28261 28262 28263 28264 28265 28266 28267 28268 28269 28270 28271 28272 28273 28274 28275 28276 28277 28278 28279 28280 28281 28282 28283 28284 28285 28286 28287 28288 28289 28290 28291 28292 28293 28294 28295 28296 28297 28298 28299 28300 28301 28302 28303 28304 28305 28306 28307 28308 28309 28310 28311 28312 28313 28314 28315 28316 28317 28318 28319 28320 28321 28322 28323 28324 28325 28326 28327 28328 28329 28330 28331 28332 28333 28334 28335 28336 28337 28338 28339 28340 28341 28342 28343 28344 28345 28346 28347 28348 28349 28350 28351 28352 28353 28354 28355 28356 28357 28358 28359 28360 28361 28362 28363 28364 28365 28366 28367 28368 28369 28370 28371 28372 28373 28374 28375 28376 28377 28378 28379 28380 28381 28382 28383 28384 28385 28386 28387 28388 28389 28390 28391 28392 28393 28394 28395 28396 28397 28398 28399 28400 28401 28402 28403 28404 28405 28406 28407 28408 28409 28410 28411 28412 28413 28414 28415 28416 28417 28418 28419 28420 28421 28422 28423 28424 28425 28426 28427 28428 28429 28430 28431 28432 28433 28434 28435 28436 28437 28438 28439 28440 28441 28442 28443 28444 28445 28446 28447 28448 28449 28450 28451 28452 28453 28454 28455 28456 28457 28458 28459 28460 28461 28462 28463 28464 28465 28466 28467 28468 28469 28470 28471 28472 28473 28474 28475 28476 28477 28478 28479 28480 28481 28482 28483 28484 28485 28486 28487 28488 28489 28490 28491 28492 28493 28494 28495 28496 28497 28498 28499 28500 28501 28502 28503 28504 28505 28506 28507 28508 28509 28510 28511 28512 28513 28514 28515 28516 28517 28518 28519 28520 28521 28522 28523 28524 28525 28526 28527 28528 28529 28530 28531 28532 28533 28534 28535 28536 28537 28538 28539 28540 28541 28542 28543 28544 28545 28546 28547 28548 28549 28550 28551 28552 28553 28554 28555 28556 28557 28558 28559 28560 28561 28562 28563 28564 28565 28566 28567 28568 28569 28570 28571 28572 28573 28574 28575 28576 28577 28578 28579 28580 28581 28582 28583 28584 28585 28586 28587 28588 28589 28590 28591 28592 28593 28594 28595 28596 28597 28598 28599 28600 28601 28602 28603 28604 28605 28606 28607 28608 28609 28610 28611 28612 28613 28614 28615 28616 28617 28618 28619 28620 28621 28622 28623 28624 28625 28626 28627 28628 28629 28630 28631 28632 28633 28634 28635 28636 28637 28638 28639 28640 28641 28642 28643 28644 28645 28646 28647 28648 28649 28650 28651 28652 28653 28654 28655 28656 28657 28658 28659 28660 28661 28662 28663 28664 28665 28666 28667 28668 28669 28670 28671 28672 28673 28674 28675 28676 28677 28678 28679 28680 28681 28682 28683 28684 28685 28686 28687 28688 28689 28690 28691 28692 28693 28694 28695 28696 28697 28698 28699 28700 28701 28702 28703 28704 28705 28706 28707 28708 28709 28710 28711 28712 28713 28714 28715 28716 28717 28718 28719 28720 28721 28722 28723 28724 28725 28726 28727 28728 28729 28730 28731 28732 28733 28734 28735 28736 28737 28738 28739 28740 28741 28742 28743 28744 28745 28746 28747 28748 28749 28750 28751 28752 28753 28754 28755 28756 28757 28758 28759 28760 28761 28762 28763 28764 28765 28766 28767 28768 28769 28770 28771 28772 28773 28774 28775 28776 28777 28778 28779 28780 28781 28782 28783 28784 28785 28786 28787 28788 28789 28790 28791 28792 28793 28794 28795 28796 28797 28798 28799 28800 28801 28802 28803 28804 28805 28806 28807 28808 28809 28810 28811 28812 28813 28814 28815 28816 28817 28818 28819 28820 28821 28822 28823 28824 28825 28826 28827 28828 28829 28830 28831 28832 28833 28834 28835 28836 28837 28838 28839 28840 28841 28842 28843 28844 28845 28846 28847 28848 28849 28850 28851 28852 28853 28854 28855 28856 28857 28858 28859 28860 28861 28862 28863 28864 28865 28866 28867 28868 28869 28870 28871 28872 28873 28874 28875 28876 28877 28878 28879 28880 28881 28882 28883 28884 28885 28886 28887 28888 28889 28890 28891 28892 28893 28894 28895 28896 28897 28898 28899 28900 28901 28902 28903 28904 28905 28906 28907 28908 28909 28910 28911 28912 28913 28914 28915 28916 28917 28918 28919 28920 28921 28922 28923 28924 28925 28926 28927 28928 28929 28930 28931 28932 28933 28934 28935 28936 28937 28938 28939 28940 28941 28942 28943 28944 28945 28946 28947 28948 28949 28950 28951 28952 28953 28954 28955 28956 28957 28958 28959 28960 28961 28962 28963 28964 28965 28966 28967 28968 28969 28970 28971 28972 28973 28974 28975 28976 28977 28978 28979 28980 28981 28982 28983 28984 28985 28986 28987 28988 28989 28990 28991 28992 28993 28994 28995 28996 28997 28998 28999 29000 29001 29002 29003 29004 29005 29006 29007 29008 29009 29010 29011 29012 29013 29014 29015 29016 29017 29018 29019 29020 29021 29022 29023 29024 29025 29026 29027 29028 29029 29030 29031 29032 29033 29034 29035 29036 29037 29038 29039 29040 29041 29042 29043 29044 29045 29046 29047 29048 29049 29050 29051 29052 29053 29054 29055 29056 29057 29058 29059 29060 29061 29062 29063 29064 29065 29066 29067 29068 29069 29070 29071 29072 29073 29074 29075 29076 29077 29078 29079 29080 29081 29082 29083 29084 29085 29086 29087 29088 29089 29090 29091 29092 29093 29094 29095 29096 29097 29098 29099 29100 29101 29102 29103 29104 29105 29106 29107 29108 29109 29110 29111 29112 29113 29114 29115 29116 29117 29118 29119 29120 29121 29122 29123 29124 29125 29126 29127 29128 29129 29130 29131 29132 29133 29134 29135 29136 29137 29138 29139 29140 29141 29142 29143 29144 29145 29146 29147 29148 29149 29150 29151 29152 29153 29154 29155 29156 29157 29158 29159 29160 29161 29162 29163 29164 29165 29166 29167 29168 29169 29170 29171 29172 29173 29174 29175 29176 29177 29178 29179 29180 29181 29182 29183 29184 29185 29186 29187 29188 29189 29190 29191 29192 29193 29194 29195 29196 29197 29198 29199 29200 29201 29202 29203 29204 29205 29206 29207 29208 29209 29210 29211 29212 29213 29214 29215 29216 29217 29218 29219 29220 29221 29222 29223 29224 29225 29226 29227 29228 29229 29230 29231 29232 29233 29234 29235 29236 29237 29238 29239 29240 29241 29242 29243 29244 29245 29246 29247 29248 29249 29250 29251 29252 29253 29254 29255 29256 29257 29258 29259 29260 29261 29262 29263 29264 29265 29266 29267 29268 29269 29270 29271 29272 29273 29274 29275 29276 29277 29278 29279 29280 29281 29282 29283 29284 29285 29286 29287 29288 29289 29290 29291 29292 29293 29294 29295 29296 29297 29298 29299 29300 29301 29302 29303 29304 29305 29306 29307 29308 29309 29310 29311 29312 29313 29314 29315 29316 29317 29318 29319 29320 29321 29322 29323 29324 29325 29326 29327 29328 29329 29330 29331 29332 29333 29334 29335 29336 29337 29338 29339 29340 29341 29342 29343 29344 29345 29346 29347 29348 29349 29350 29351 29352 29353 29354 29355 29356 29357 29358 29359 29360 29361 29362 29363 29364 29365 29366 29367 29368 29369 29370 29371 29372 29373 29374 29375 29376 29377 29378 29379 29380 29381 29382 29383 29384 29385 29386 29387 29388 29389 29390 29391 29392 29393 29394 29395 29396 29397 29398 29399 29400 29401 29402 29403 29404 29405 29406 29407 29408 29409 29410 29411 29412 29413 29414 29415 29416 29417 29418 29419 29420 29421 29422 29423 29424 29425 29426 29427 29428 29429 29430 29431 29432 29433 29434 29435 29436 29437 29438 29439 29440 29441 29442 29443 29444 29445 29446 29447 29448 29449 29450 29451 29452 29453 29454 29455 29456 29457 29458 29459 29460 29461 29462 29463 29464 29465 29466 29467 29468 29469 29470 29471 29472 29473 29474 29475 29476 29477 29478 29479 29480 29481 29482 29483 29484 29485 29486 29487 29488 29489 29490 29491 29492 29493 29494 29495 29496 29497 29498 29499 29500 29501 29502 29503 29504 29505 29506 29507 29508 29509 29510 29511 29512 29513 29514 29515 29516 29517 29518 29519 29520 29521 29522 29523 29524 29525 29526 29527 29528 29529 29530 29531 29532 29533 29534 29535 29536 29537 29538 29539 29540 29541 29542 29543 29544 29545 29546 29547 29548 29549 29550 29551 29552 29553 29554 29555 29556 29557 29558 29559 29560 29561 29562 29563 29564 29565 29566 29567 29568 29569 29570 29571 29572 29573 29574 29575 29576 29577 29578 29579 29580 29581 29582 29583 29584 29585 29586 29587 29588 29589 29590 29591 29592 29593 29594 29595 29596 29597 29598 29599 29600 29601 29602 29603 29604 29605 29606 29607 29608 29609 29610 29611 29612 29613 29614 29615 29616 29617 29618 29619 29620 29621 29622 29623 29624 29625 29626 29627 29628 29629 29630 29631 29632 29633 29634 29635 29636 29637 29638 29639 29640 29641 29642 29643 29644 29645 29646 29647 29648 29649 29650 29651 29652 29653 29654 29655 29656 29657 29658 29659 29660 29661 29662 29663 29664 29665 29666 29667 29668 29669 29670 29671 29672 29673 29674 29675 29676 29677 29678 29679 29680 29681 29682 29683 29684 29685 29686 29687 29688 29689 29690 29691 29692 29693 29694 29695 29696 29697 29698 29699 29700 29701 29702 29703 29704 29705 29706 29707 29708 29709 29710 29711 29712 29713 29714 29715 29716 29717 29718 29719 29720 29721 29722 29723 29724 29725 29726 29727 29728 29729 29730 29731 29732 29733 29734 29735 29736 29737 29738 29739 29740 29741 29742 29743 29744 29745 29746 29747 29748 29749 29750 29751 29752 29753 29754 29755 29756 29757 29758 29759 29760 29761 29762 29763 29764 29765 29766 29767 29768 29769 29770 29771 29772 29773 29774 29775 29776 29777 29778 29779 29780 29781 29782 29783 29784 29785 29786 29787 29788 29789 29790 29791 29792 29793 29794 29795 29796 29797 29798 29799 29800 29801 29802 29803 29804 29805 29806 29807 29808 29809 29810 29811 29812 29813 29814 29815 29816 29817 29818 29819 29820 29821 29822 29823 29824 29825 29826 29827 29828 29829 29830 29831 29832 29833 29834 29835 29836 29837 29838 29839 29840 29841 29842 29843 29844 29845 29846 29847 29848 29849 29850 29851 29852 29853 29854 29855 29856 29857 29858 29859 29860 29861 29862 29863 29864 29865 29866 29867 29868 29869 29870 29871 29872 29873 29874 29875 29876 29877 29878 29879 29880 29881 29882 29883 29884 29885 29886 29887 29888 29889 29890 29891 29892 29893 29894 29895 29896 29897 29898 29899 29900 29901 29902 29903 29904 29905 29906 29907 29908 29909 29910 29911 29912 29913 29914 29915 29916 29917 29918 29919 29920 29921 29922 29923 29924 29925 29926 29927 29928 29929 29930 29931 29932 29933 29934 29935 29936 29937 29938 29939 29940 29941 29942 29943 29944 29945 29946 29947 29948 29949 29950 29951 29952 29953 29954 29955 29956 29957 29958 29959 29960 29961 29962 29963 29964 29965 29966 29967 29968 29969 29970 29971 29972 29973 29974 29975 29976 29977 29978 29979 29980 29981 29982 29983 29984 29985 29986 29987 29988 29989 29990 29991 29992 29993 29994 29995 29996 29997 29998 29999 30000 30001 30002 30003 30004 30005 30006 30007 30008 30009 30010 30011 30012 30013 30014 30015 30016 30017 30018 30019 30020 30021 30022 30023 30024 30025 30026 30027 30028 30029 30030 30031 30032 30033 30034 30035 30036 30037 30038 30039 30040 30041 30042 30043 30044 30045 30046 30047 30048 30049 30050 30051 30052 30053 30054 30055 30056 30057 30058 30059 30060 30061 30062 30063 30064 30065 30066 30067 30068 30069 30070 30071 30072 30073 30074 30075 30076 30077 30078 30079 30080 30081 30082 30083 30084 30085 30086 30087 30088 30089 30090 30091 30092 30093 30094 30095 30096 30097 30098 30099 30100 30101 30102 30103 30104 30105 30106 30107 30108 30109 30110 30111 30112 30113 30114 30115 30116 30117 30118 30119 30120 30121 30122 30123 30124 30125 30126 30127 30128 30129 30130 30131 30132 30133 30134 30135 30136 30137 30138 30139 30140 30141 30142 30143 30144 30145 30146 30147 30148 30149 30150 30151 30152 30153 30154 30155 30156 30157 30158 30159 30160 30161 30162 30163 30164 30165 30166 30167 30168 30169 30170 30171 30172 30173 30174 30175 30176 30177 30178 30179 30180 30181 30182 30183 30184 30185 30186 30187 30188 30189 30190 30191 30192 30193 30194 30195 30196 30197 30198 30199 30200 30201 30202 30203 30204 30205 30206 30207 30208 30209 30210 30211 30212 30213 30214 30215 30216 30217 30218 30219 30220 30221 30222 30223 30224 30225 30226 30227 30228 30229 30230 30231 30232 30233 30234 30235 30236 30237 30238 30239 30240 30241 30242 30243 30244 30245 30246 30247 30248 30249 30250 30251 30252 30253 30254 30255 30256 30257 30258 30259 30260 30261 30262 30263 30264 30265 30266 30267 30268 30269 30270 30271 30272 30273 30274 30275 30276 30277 30278 30279 30280 30281 30282 30283 30284 30285 30286 30287 30288 30289 30290 30291 30292 30293 30294 30295 30296 30297 30298 30299 30300 30301 30302 30303 30304 30305 30306 30307 30308 30309 30310 30311 30312 30313 30314 30315 30316 30317 30318 30319 30320 30321 30322 30323 30324 30325 30326 30327 30328 30329 30330 30331 30332 30333 30334 30335 30336 30337 30338 30339 30340 30341 30342 30343 30344 30345 30346 30347 30348 30349 30350 30351 30352 30353 30354 30355 30356 30357 30358 30359 30360 30361 30362 30363 30364 30365 30366 30367 30368 30369 30370 30371 30372 30373 30374 30375 30376 30377 30378 30379 30380 30381 30382 30383 30384 30385 30386 30387 30388 30389 30390 30391 30392 30393 30394 30395 30396 30397 30398 30399 30400 30401 30402 30403 30404 30405 30406 30407 30408 30409 30410 30411 30412 30413 30414 30415 30416 30417 30418 30419 30420 30421 30422 30423 30424 30425 30426 30427 30428 30429 30430 30431 30432 30433 30434 30435 30436 30437 30438 30439 30440 30441 30442 30443 30444 30445 30446 30447 30448 30449 30450 30451 30452 30453 30454 30455 30456 30457 30458 30459 30460 30461 30462 30463 30464 30465 30466 30467 30468 30469 30470 30471 30472 30473 30474 30475 30476 30477 30478 30479 30480 30481 30482 30483 30484 30485 30486 30487 30488 30489 30490 30491 30492 30493 30494 30495 30496 30497 30498 30499 30500 30501 30502 30503 30504 30505 30506 30507 30508 30509 30510 30511 30512 30513 30514 30515 30516 30517 30518 30519 30520 30521 30522 30523 30524 30525 30526 30527 30528 30529 30530 30531 30532 30533 30534 30535 30536 30537 30538 30539 30540 30541 30542 30543 30544 30545 30546 30547 30548 30549 30550 30551 30552 30553 30554 30555 30556 30557 30558 30559 30560 30561 30562 30563 30564 30565 30566 30567 30568 30569 30570 30571 30572 30573 30574 30575 30576 30577 30578 30579 30580 30581 30582 30583 30584 30585 30586 30587 30588 30589 30590 30591 30592 30593 30594 30595 30596 30597 30598 30599 30600 30601 30602 30603 30604 30605 30606 30607 30608 30609 30610 30611 30612 30613 30614 30615 30616 30617 30618 30619 30620 30621 30622 30623 30624 30625 30626 30627 30628 30629 30630 30631 30632 30633 30634 30635 30636 30637 30638 30639 30640 30641 30642 30643 30644 30645 30646 30647 30648 30649 30650 30651 30652 30653 30654 30655 30656 30657 30658 30659 30660 30661 30662 30663 30664 30665 30666 30667 30668 30669 30670 30671 30672 30673 30674 30675 30676 30677 30678 30679 30680 30681 30682 30683 30684 30685 30686 30687 30688 30689 30690 30691 30692 30693 30694 30695 30696 30697 30698 30699 30700 30701 30702 30703 30704 30705 30706 30707 30708 30709 30710 30711 30712 30713 30714 30715 30716 30717 30718 30719 30720 30721 30722 30723 30724 30725 30726 30727 30728 30729 30730 30731 30732 30733 30734 30735 30736 30737 30738 30739 30740 30741 30742 30743 30744 30745 30746 30747 30748 30749 30750 30751 30752 30753 30754 30755 30756 30757 30758 30759 30760 30761 30762 30763 30764 30765 30766 30767 30768 30769 30770 30771 30772 30773 30774 30775 30776 30777 30778 30779 30780 30781 30782 30783 30784 30785 30786 30787 30788 30789 30790 30791 30792 30793 30794 30795 30796 30797 30798 30799 30800 30801 30802 30803 30804 30805 30806 30807 30808 30809 30810 30811 30812 30813 30814 30815 30816 30817 30818 30819 30820 30821 30822 30823 30824 30825 30826 30827 30828 30829 30830 30831 30832 30833 30834 30835 30836 30837 30838 30839 30840 30841 30842 30843 30844 30845 30846 30847 30848 30849 30850 30851 30852 30853 30854 30855 30856 30857 30858 30859 30860 30861 30862 30863 30864 30865 30866 30867 30868 30869 30870 30871 30872 30873 30874 30875 30876 30877 30878 30879 30880 30881 30882 30883 30884 30885 30886 30887 30888 30889 30890 30891 30892 30893 30894 30895 30896 30897 30898 30899 30900 30901 30902 30903 30904 30905 30906 30907 30908 30909 30910 30911 30912 30913 30914 30915 30916 30917 30918 30919 30920 30921 30922 30923 30924 30925 30926 30927 30928 30929 30930 30931 30932 30933 30934 30935 30936 30937 30938 30939 30940 30941 30942 30943 30944 30945 30946 30947 30948 30949 30950 30951 30952 30953 30954 30955 30956 30957 30958 30959 30960 30961 30962 30963 30964 30965 30966 30967 30968 30969 30970 30971 30972 30973 30974 30975 30976 30977 30978 30979 30980 30981 30982 30983 30984 30985 30986 30987 30988 30989 30990 30991 30992 30993 30994 30995 30996 30997 30998 30999 31000 31001 31002 31003 31004 31005 31006 31007 31008 31009 31010 31011 31012 31013 31014 31015 31016 31017 31018 31019 31020 31021 31022 31023 31024 31025 31026 31027 31028 31029 31030 31031 31032 31033 31034 31035 31036 31037 31038 31039 31040 31041 31042 31043 31044 31045 31046 31047 31048 31049 31050 31051 31052 31053 31054 31055 31056 31057 31058 31059 31060 31061 31062 31063 31064 31065 31066 31067 31068 31069 31070 31071 31072 31073 31074 31075 31076 31077 31078 31079 31080 31081 31082 31083 31084 31085 31086 31087 31088 31089 31090 31091 31092 31093 31094 31095 31096 31097 31098 31099 31100 31101 31102 31103 31104 31105 31106 31107 31108 31109 31110 31111 31112 31113 31114 31115 31116 31117 31118 31119 31120 31121 31122 31123 31124 31125 31126 31127 31128 31129 31130 31131 31132 31133 31134 31135 31136 31137 31138 31139 31140 31141 31142 31143 31144 31145 31146 31147 31148 31149 31150 31151 31152 31153 31154 31155 31156 31157 31158 31159 31160 31161 31162 31163 31164 31165 31166 31167 31168 31169 31170 31171 31172 31173 31174 31175 31176 31177 31178 31179 31180 31181 31182 31183 31184 31185 31186 31187 31188 31189 31190 31191 31192 31193 31194 31195 31196 31197 31198 31199 31200 31201 31202 31203 31204 31205 31206 31207 31208 31209 31210 31211 31212 31213 31214 31215 31216 31217 31218 31219 31220 31221 31222 31223 31224 31225 31226 31227 31228 31229 31230 31231 31232 31233 31234 31235 31236 31237 31238 31239 31240 31241 31242 31243 31244 31245 31246 31247 31248 31249 31250 31251 31252 31253 31254 31255 31256 31257 31258 31259 31260 31261 31262 31263 31264 31265 31266 31267 31268 31269 31270 31271 31272 31273 31274 31275 31276 31277 31278 31279 31280 31281 31282 31283 31284 31285 31286 31287 31288 31289 31290 31291 31292 31293 31294 31295 31296 31297 31298 31299 31300 31301 31302 31303 31304 31305 31306 31307 31308 31309 31310 31311 31312 31313 31314 31315 31316 31317 31318 31319 31320 31321 31322 31323 31324 31325 31326 31327 31328 31329 31330 31331 31332 31333 31334 31335 31336 31337 31338 31339 31340 31341 31342 31343 31344 31345 31346 31347 31348 31349 31350 31351 31352 31353 31354 31355 31356 31357 31358 31359 31360 31361 31362 31363 31364
|
/* Permission is hereby granted, free of charge, to any person
* obtaining a copy of this software and associated documentation
* files (the "Software"), to deal in the Software without
* restriction, including without limitation the rights to use, copy,
* modify, merge, publish, distribute, sublicense, and/or sell copies
* of the Software, and to permit persons to whom the Software is
* furnished to do so, subject to the following conditions:
*
* The above copyright notice and this permission notice shall be
* included in all copies or substantial portions of the Software.
*
* THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND,
* EXPRESS OR IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF
* MERCHANTABILITY, FITNESS FOR A PARTICULAR PURPOSE AND
* NONINFRINGEMENT. IN NO EVENT SHALL THE AUTHORS OR COPYRIGHT HOLDERS
* BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER LIABILITY, WHETHER IN AN
* ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM, OUT OF OR IN
* CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE
* SOFTWARE.
*
* Copyright:
* 2020 Evan Nemerson <evan@nemerson.com>
*/
#define SIMDE_TESTS_CURRENT_ISAX svml
#include <test/x86/avx512/test-avx512.h>
#include <simde/x86/svml.h>
static int
test_simde_mm_acos_ps(SIMDE_MUNIT_TEST_ARGS) {
const struct {
simde__m128 a;
simde__m128 r;
} test_vec[8] = {
{ simde_mm_set_ps(SIMDE_FLOAT32_C( -0.19), SIMDE_FLOAT32_C( 0.04), SIMDE_FLOAT32_C( -0.75), SIMDE_FLOAT32_C( 0.35)),
simde_mm_set_ps(SIMDE_FLOAT32_C( 1.76), SIMDE_FLOAT32_C( 1.53), SIMDE_FLOAT32_C( 2.42), SIMDE_FLOAT32_C( 1.21)) },
{ simde_mm_set_ps(SIMDE_FLOAT32_C( 0.50), SIMDE_FLOAT32_C( 0.67), SIMDE_FLOAT32_C( -0.30), SIMDE_FLOAT32_C( 0.03)),
simde_mm_set_ps(SIMDE_FLOAT32_C( 1.05), SIMDE_FLOAT32_C( 0.84), SIMDE_FLOAT32_C( 1.88), SIMDE_FLOAT32_C( 1.54)) },
{ simde_mm_set_ps(SIMDE_FLOAT32_C( 0.83), SIMDE_FLOAT32_C( 0.42), SIMDE_FLOAT32_C( -0.27), SIMDE_FLOAT32_C( 0.47)),
simde_mm_set_ps(SIMDE_FLOAT32_C( 0.59), SIMDE_FLOAT32_C( 1.14), SIMDE_FLOAT32_C( 1.84), SIMDE_FLOAT32_C( 1.08)) },
{ simde_mm_set_ps(SIMDE_FLOAT32_C( -0.68), SIMDE_FLOAT32_C( -0.69), SIMDE_FLOAT32_C( 0.08), SIMDE_FLOAT32_C( 0.57)),
simde_mm_set_ps(SIMDE_FLOAT32_C( 2.32), SIMDE_FLOAT32_C( 2.33), SIMDE_FLOAT32_C( 1.49), SIMDE_FLOAT32_C( 0.96)) },
{ simde_mm_set_ps(SIMDE_FLOAT32_C( -0.31), SIMDE_FLOAT32_C( -0.86), SIMDE_FLOAT32_C( -0.42), SIMDE_FLOAT32_C( 0.70)),
simde_mm_set_ps(SIMDE_FLOAT32_C( 1.89), SIMDE_FLOAT32_C( 2.61), SIMDE_FLOAT32_C( 2.00), SIMDE_FLOAT32_C( 0.80)) },
{ simde_mm_set_ps(SIMDE_FLOAT32_C( -0.44), SIMDE_FLOAT32_C( 0.03), SIMDE_FLOAT32_C( -0.38), SIMDE_FLOAT32_C( -0.92)),
simde_mm_set_ps(SIMDE_FLOAT32_C( 2.03), SIMDE_FLOAT32_C( 1.54), SIMDE_FLOAT32_C( 1.96), SIMDE_FLOAT32_C( 2.74)) },
{ simde_mm_set_ps(SIMDE_FLOAT32_C( 0.26), SIMDE_FLOAT32_C( -0.21), SIMDE_FLOAT32_C( -0.98), SIMDE_FLOAT32_C( -0.66)),
simde_mm_set_ps(SIMDE_FLOAT32_C( 1.31), SIMDE_FLOAT32_C( 1.78), SIMDE_FLOAT32_C( 2.94), SIMDE_FLOAT32_C( 2.29)) },
{ simde_mm_set_ps(SIMDE_FLOAT32_C( 0.18), SIMDE_FLOAT32_C( -0.45), SIMDE_FLOAT32_C( 0.23), SIMDE_FLOAT32_C( 0.69)),
simde_mm_set_ps(SIMDE_FLOAT32_C( 1.39), SIMDE_FLOAT32_C( 2.04), SIMDE_FLOAT32_C( 1.34), SIMDE_FLOAT32_C( 0.81)) }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])); i++) {
simde__m128 r = simde_mm_acos_ps(test_vec[i].a);
simde_assert_m128_close(r, test_vec[i].r, 1);
}
return 0;
}
static int
test_simde_mm_acos_pd(SIMDE_MUNIT_TEST_ARGS) {
const struct {
simde__m128d a;
simde__m128d r;
} test_vec[8] = {
{ simde_mm_set_pd(SIMDE_FLOAT64_C( -0.75), SIMDE_FLOAT64_C( 0.35)),
simde_mm_set_pd(SIMDE_FLOAT64_C( 2.42), SIMDE_FLOAT64_C( 1.21)) },
{ simde_mm_set_pd(SIMDE_FLOAT64_C( -0.19), SIMDE_FLOAT64_C( 0.04)),
simde_mm_set_pd(SIMDE_FLOAT64_C( 1.76), SIMDE_FLOAT64_C( 1.53)) },
{ simde_mm_set_pd(SIMDE_FLOAT64_C( -0.30), SIMDE_FLOAT64_C( 0.03)),
simde_mm_set_pd(SIMDE_FLOAT64_C( 1.88), SIMDE_FLOAT64_C( 1.54)) },
{ simde_mm_set_pd(SIMDE_FLOAT64_C( 0.50), SIMDE_FLOAT64_C( 0.67)),
simde_mm_set_pd(SIMDE_FLOAT64_C( 1.05), SIMDE_FLOAT64_C( 0.84)) },
{ simde_mm_set_pd(SIMDE_FLOAT64_C( -0.27), SIMDE_FLOAT64_C( 0.47)),
simde_mm_set_pd(SIMDE_FLOAT64_C( 1.84), SIMDE_FLOAT64_C( 1.08)) },
{ simde_mm_set_pd(SIMDE_FLOAT64_C( 0.83), SIMDE_FLOAT64_C( 0.42)),
simde_mm_set_pd(SIMDE_FLOAT64_C( 0.59), SIMDE_FLOAT64_C( 1.14)) },
{ simde_mm_set_pd(SIMDE_FLOAT64_C( 0.08), SIMDE_FLOAT64_C( 0.57)),
simde_mm_set_pd(SIMDE_FLOAT64_C( 1.49), SIMDE_FLOAT64_C( 0.96)) },
{ simde_mm_set_pd(SIMDE_FLOAT64_C( -0.68), SIMDE_FLOAT64_C( -0.69)),
simde_mm_set_pd(SIMDE_FLOAT64_C( 2.32), SIMDE_FLOAT64_C( 2.33)) }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])); i++) {
simde__m128d r = simde_mm_acos_pd(test_vec[i].a);
simde_assert_m128d_close(r, test_vec[i].r, 1);
}
return 0;
}
static int
test_simde_mm256_acos_ps(SIMDE_MUNIT_TEST_ARGS) {
const struct {
simde__m256 a;
simde__m256 r;
} test_vec[8] = {
{ simde_mm256_set_ps(SIMDE_FLOAT32_C( 0.50), SIMDE_FLOAT32_C( 0.67),
SIMDE_FLOAT32_C( -0.30), SIMDE_FLOAT32_C( 0.03),
SIMDE_FLOAT32_C( -0.19), SIMDE_FLOAT32_C( 0.04),
SIMDE_FLOAT32_C( -0.75), SIMDE_FLOAT32_C( 0.35)),
simde_mm256_set_ps(SIMDE_FLOAT32_C( 1.05), SIMDE_FLOAT32_C( 0.84),
SIMDE_FLOAT32_C( 1.88), SIMDE_FLOAT32_C( 1.54),
SIMDE_FLOAT32_C( 1.76), SIMDE_FLOAT32_C( 1.53),
SIMDE_FLOAT32_C( 2.42), SIMDE_FLOAT32_C( 1.21)) },
{ simde_mm256_set_ps(SIMDE_FLOAT32_C( -0.68), SIMDE_FLOAT32_C( -0.69),
SIMDE_FLOAT32_C( 0.08), SIMDE_FLOAT32_C( 0.57),
SIMDE_FLOAT32_C( 0.83), SIMDE_FLOAT32_C( 0.42),
SIMDE_FLOAT32_C( -0.27), SIMDE_FLOAT32_C( 0.47)),
simde_mm256_set_ps(SIMDE_FLOAT32_C( 2.32), SIMDE_FLOAT32_C( 2.33),
SIMDE_FLOAT32_C( 1.49), SIMDE_FLOAT32_C( 0.96),
SIMDE_FLOAT32_C( 0.59), SIMDE_FLOAT32_C( 1.14),
SIMDE_FLOAT32_C( 1.84), SIMDE_FLOAT32_C( 1.08)) },
{ simde_mm256_set_ps(SIMDE_FLOAT32_C( -0.44), SIMDE_FLOAT32_C( 0.03),
SIMDE_FLOAT32_C( -0.38), SIMDE_FLOAT32_C( -0.92),
SIMDE_FLOAT32_C( -0.31), SIMDE_FLOAT32_C( -0.86),
SIMDE_FLOAT32_C( -0.42), SIMDE_FLOAT32_C( 0.70)),
simde_mm256_set_ps(SIMDE_FLOAT32_C( 2.03), SIMDE_FLOAT32_C( 1.54),
SIMDE_FLOAT32_C( 1.96), SIMDE_FLOAT32_C( 2.74),
SIMDE_FLOAT32_C( 1.89), SIMDE_FLOAT32_C( 2.61),
SIMDE_FLOAT32_C( 2.00), SIMDE_FLOAT32_C( 0.80)) },
{ simde_mm256_set_ps(SIMDE_FLOAT32_C( 0.18), SIMDE_FLOAT32_C( -0.45),
SIMDE_FLOAT32_C( 0.23), SIMDE_FLOAT32_C( 0.69),
SIMDE_FLOAT32_C( 0.26), SIMDE_FLOAT32_C( -0.21),
SIMDE_FLOAT32_C( -0.98), SIMDE_FLOAT32_C( -0.66)),
simde_mm256_set_ps(SIMDE_FLOAT32_C( 1.39), SIMDE_FLOAT32_C( 2.04),
SIMDE_FLOAT32_C( 1.34), SIMDE_FLOAT32_C( 0.81),
SIMDE_FLOAT32_C( 1.31), SIMDE_FLOAT32_C( 1.78),
SIMDE_FLOAT32_C( 2.94), SIMDE_FLOAT32_C( 2.29)) },
{ simde_mm256_set_ps(SIMDE_FLOAT32_C( -0.77), SIMDE_FLOAT32_C( 0.44),
SIMDE_FLOAT32_C( -0.58), SIMDE_FLOAT32_C( 0.38),
SIMDE_FLOAT32_C( -0.77), SIMDE_FLOAT32_C( 0.99),
SIMDE_FLOAT32_C( 0.03), SIMDE_FLOAT32_C( 0.84)),
simde_mm256_set_ps(SIMDE_FLOAT32_C( 2.45), SIMDE_FLOAT32_C( 1.12),
SIMDE_FLOAT32_C( 2.19), SIMDE_FLOAT32_C( 1.18),
SIMDE_FLOAT32_C( 2.45), SIMDE_FLOAT32_C( 0.14),
SIMDE_FLOAT32_C( 1.54), SIMDE_FLOAT32_C( 0.57)) },
{ simde_mm256_set_ps(SIMDE_FLOAT32_C( -0.39), SIMDE_FLOAT32_C( 0.40),
SIMDE_FLOAT32_C( 0.66), SIMDE_FLOAT32_C( 0.34),
SIMDE_FLOAT32_C( 0.53), SIMDE_FLOAT32_C( -0.26),
SIMDE_FLOAT32_C( 0.78), SIMDE_FLOAT32_C( -0.03)),
simde_mm256_set_ps(SIMDE_FLOAT32_C( 1.97), SIMDE_FLOAT32_C( 1.16),
SIMDE_FLOAT32_C( 0.85), SIMDE_FLOAT32_C( 1.22),
SIMDE_FLOAT32_C( 1.01), SIMDE_FLOAT32_C( 1.83),
SIMDE_FLOAT32_C( 0.68), SIMDE_FLOAT32_C( 1.60)) },
{ simde_mm256_set_ps(SIMDE_FLOAT32_C( -0.20), SIMDE_FLOAT32_C( -0.08),
SIMDE_FLOAT32_C( 0.34), SIMDE_FLOAT32_C( -0.94),
SIMDE_FLOAT32_C( -0.75), SIMDE_FLOAT32_C( -0.77),
SIMDE_FLOAT32_C( -0.55), SIMDE_FLOAT32_C( 0.40)),
simde_mm256_set_ps(SIMDE_FLOAT32_C( 1.77), SIMDE_FLOAT32_C( 1.65),
SIMDE_FLOAT32_C( 1.22), SIMDE_FLOAT32_C( 2.79),
SIMDE_FLOAT32_C( 2.42), SIMDE_FLOAT32_C( 2.45),
SIMDE_FLOAT32_C( 2.15), SIMDE_FLOAT32_C( 1.16)) },
{ simde_mm256_set_ps(SIMDE_FLOAT32_C( 0.47), SIMDE_FLOAT32_C( 0.68),
SIMDE_FLOAT32_C( -0.15), SIMDE_FLOAT32_C( 0.82),
SIMDE_FLOAT32_C( 0.91), SIMDE_FLOAT32_C( 0.60),
SIMDE_FLOAT32_C( 0.79), SIMDE_FLOAT32_C( 0.25)),
simde_mm256_set_ps(SIMDE_FLOAT32_C( 1.08), SIMDE_FLOAT32_C( 0.82),
SIMDE_FLOAT32_C( 1.72), SIMDE_FLOAT32_C( 0.61),
SIMDE_FLOAT32_C( 0.43), SIMDE_FLOAT32_C( 0.93),
SIMDE_FLOAT32_C( 0.66), SIMDE_FLOAT32_C( 1.32)) }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])); i++) {
simde__m256 r = simde_mm256_acos_ps(test_vec[i].a);
simde_assert_m256_close(r, test_vec[i].r, 1);
}
return 0;
}
static int
test_simde_mm256_acos_pd(SIMDE_MUNIT_TEST_ARGS) {
const struct {
simde__m256d a;
simde__m256d r;
} test_vec[8] = {
{ simde_mm256_set_pd(SIMDE_FLOAT64_C( -0.19), SIMDE_FLOAT64_C( 0.04),
SIMDE_FLOAT64_C( -0.75), SIMDE_FLOAT64_C( 0.35)),
simde_mm256_set_pd(SIMDE_FLOAT64_C( 1.76), SIMDE_FLOAT64_C( 1.53),
SIMDE_FLOAT64_C( 2.42), SIMDE_FLOAT64_C( 1.21)) },
{ simde_mm256_set_pd(SIMDE_FLOAT64_C( 0.50), SIMDE_FLOAT64_C( 0.67),
SIMDE_FLOAT64_C( -0.30), SIMDE_FLOAT64_C( 0.03)),
simde_mm256_set_pd(SIMDE_FLOAT64_C( 1.05), SIMDE_FLOAT64_C( 0.84),
SIMDE_FLOAT64_C( 1.88), SIMDE_FLOAT64_C( 1.54)) },
{ simde_mm256_set_pd(SIMDE_FLOAT64_C( 0.83), SIMDE_FLOAT64_C( 0.42),
SIMDE_FLOAT64_C( -0.27), SIMDE_FLOAT64_C( 0.47)),
simde_mm256_set_pd(SIMDE_FLOAT64_C( 0.59), SIMDE_FLOAT64_C( 1.14),
SIMDE_FLOAT64_C( 1.84), SIMDE_FLOAT64_C( 1.08)) },
{ simde_mm256_set_pd(SIMDE_FLOAT64_C( -0.68), SIMDE_FLOAT64_C( -0.69),
SIMDE_FLOAT64_C( 0.08), SIMDE_FLOAT64_C( 0.57)),
simde_mm256_set_pd(SIMDE_FLOAT64_C( 2.32), SIMDE_FLOAT64_C( 2.33),
SIMDE_FLOAT64_C( 1.49), SIMDE_FLOAT64_C( 0.96)) },
{ simde_mm256_set_pd(SIMDE_FLOAT64_C( -0.31), SIMDE_FLOAT64_C( -0.86),
SIMDE_FLOAT64_C( -0.42), SIMDE_FLOAT64_C( 0.70)),
simde_mm256_set_pd(SIMDE_FLOAT64_C( 1.89), SIMDE_FLOAT64_C( 2.61),
SIMDE_FLOAT64_C( 2.00), SIMDE_FLOAT64_C( 0.80)) },
{ simde_mm256_set_pd(SIMDE_FLOAT64_C( -0.44), SIMDE_FLOAT64_C( 0.03),
SIMDE_FLOAT64_C( -0.38), SIMDE_FLOAT64_C( -0.92)),
simde_mm256_set_pd(SIMDE_FLOAT64_C( 2.03), SIMDE_FLOAT64_C( 1.54),
SIMDE_FLOAT64_C( 1.96), SIMDE_FLOAT64_C( 2.74)) },
{ simde_mm256_set_pd(SIMDE_FLOAT64_C( 0.26), SIMDE_FLOAT64_C( -0.21),
SIMDE_FLOAT64_C( -0.98), SIMDE_FLOAT64_C( -0.66)),
simde_mm256_set_pd(SIMDE_FLOAT64_C( 1.31), SIMDE_FLOAT64_C( 1.78),
SIMDE_FLOAT64_C( 2.94), SIMDE_FLOAT64_C( 2.29)) },
{ simde_mm256_set_pd(SIMDE_FLOAT64_C( 0.18), SIMDE_FLOAT64_C( -0.45),
SIMDE_FLOAT64_C( 0.23), SIMDE_FLOAT64_C( 0.69)),
simde_mm256_set_pd(SIMDE_FLOAT64_C( 1.39), SIMDE_FLOAT64_C( 2.04),
SIMDE_FLOAT64_C( 1.34), SIMDE_FLOAT64_C( 0.81)) }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])); i++) {
simde__m256d r = simde_mm256_acos_pd(test_vec[i].a);
simde_assert_m256d_close(r, test_vec[i].r, 1);
}
return 0;
}
static int
test_simde_mm512_acos_ps(SIMDE_MUNIT_TEST_ARGS) {
const struct {
simde__m512 a;
simde__m512 r;
} test_vec[8] = {
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( -0.68), SIMDE_FLOAT32_C( -0.69), SIMDE_FLOAT32_C( 0.08), SIMDE_FLOAT32_C( 0.57),
SIMDE_FLOAT32_C( 0.83), SIMDE_FLOAT32_C( 0.42), SIMDE_FLOAT32_C( -0.27), SIMDE_FLOAT32_C( 0.47),
SIMDE_FLOAT32_C( 0.50), SIMDE_FLOAT32_C( 0.67), SIMDE_FLOAT32_C( -0.30), SIMDE_FLOAT32_C( 0.03),
SIMDE_FLOAT32_C( -0.19), SIMDE_FLOAT32_C( 0.04), SIMDE_FLOAT32_C( -0.75), SIMDE_FLOAT32_C( 0.35)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 2.32), SIMDE_FLOAT32_C( 2.33), SIMDE_FLOAT32_C( 1.49), SIMDE_FLOAT32_C( 0.96),
SIMDE_FLOAT32_C( 0.59), SIMDE_FLOAT32_C( 1.14), SIMDE_FLOAT32_C( 1.84), SIMDE_FLOAT32_C( 1.08),
SIMDE_FLOAT32_C( 1.05), SIMDE_FLOAT32_C( 0.84), SIMDE_FLOAT32_C( 1.88), SIMDE_FLOAT32_C( 1.54),
SIMDE_FLOAT32_C( 1.76), SIMDE_FLOAT32_C( 1.53), SIMDE_FLOAT32_C( 2.42), SIMDE_FLOAT32_C( 1.21)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( 0.18), SIMDE_FLOAT32_C( -0.45), SIMDE_FLOAT32_C( 0.23), SIMDE_FLOAT32_C( 0.69),
SIMDE_FLOAT32_C( 0.26), SIMDE_FLOAT32_C( -0.21), SIMDE_FLOAT32_C( -0.98), SIMDE_FLOAT32_C( -0.66),
SIMDE_FLOAT32_C( -0.44), SIMDE_FLOAT32_C( 0.03), SIMDE_FLOAT32_C( -0.38), SIMDE_FLOAT32_C( -0.92),
SIMDE_FLOAT32_C( -0.31), SIMDE_FLOAT32_C( -0.86), SIMDE_FLOAT32_C( -0.42), SIMDE_FLOAT32_C( 0.70)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 1.39), SIMDE_FLOAT32_C( 2.04), SIMDE_FLOAT32_C( 1.34), SIMDE_FLOAT32_C( 0.81),
SIMDE_FLOAT32_C( 1.31), SIMDE_FLOAT32_C( 1.78), SIMDE_FLOAT32_C( 2.94), SIMDE_FLOAT32_C( 2.29),
SIMDE_FLOAT32_C( 2.03), SIMDE_FLOAT32_C( 1.54), SIMDE_FLOAT32_C( 1.96), SIMDE_FLOAT32_C( 2.74),
SIMDE_FLOAT32_C( 1.89), SIMDE_FLOAT32_C( 2.61), SIMDE_FLOAT32_C( 2.00), SIMDE_FLOAT32_C( 0.80)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( -0.39), SIMDE_FLOAT32_C( 0.40), SIMDE_FLOAT32_C( 0.66), SIMDE_FLOAT32_C( 0.34),
SIMDE_FLOAT32_C( 0.53), SIMDE_FLOAT32_C( -0.26), SIMDE_FLOAT32_C( 0.78), SIMDE_FLOAT32_C( -0.03),
SIMDE_FLOAT32_C( -0.77), SIMDE_FLOAT32_C( 0.44), SIMDE_FLOAT32_C( -0.58), SIMDE_FLOAT32_C( 0.38),
SIMDE_FLOAT32_C( -0.77), SIMDE_FLOAT32_C( 0.99), SIMDE_FLOAT32_C( 0.03), SIMDE_FLOAT32_C( 0.84)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 1.97), SIMDE_FLOAT32_C( 1.16), SIMDE_FLOAT32_C( 0.85), SIMDE_FLOAT32_C( 1.22),
SIMDE_FLOAT32_C( 1.01), SIMDE_FLOAT32_C( 1.83), SIMDE_FLOAT32_C( 0.68), SIMDE_FLOAT32_C( 1.60),
SIMDE_FLOAT32_C( 2.45), SIMDE_FLOAT32_C( 1.12), SIMDE_FLOAT32_C( 2.19), SIMDE_FLOAT32_C( 1.18),
SIMDE_FLOAT32_C( 2.45), SIMDE_FLOAT32_C( 0.14), SIMDE_FLOAT32_C( 1.54), SIMDE_FLOAT32_C( 0.57)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( 0.47), SIMDE_FLOAT32_C( 0.68), SIMDE_FLOAT32_C( -0.15), SIMDE_FLOAT32_C( 0.82),
SIMDE_FLOAT32_C( 0.91), SIMDE_FLOAT32_C( 0.60), SIMDE_FLOAT32_C( 0.79), SIMDE_FLOAT32_C( 0.25),
SIMDE_FLOAT32_C( -0.20), SIMDE_FLOAT32_C( -0.08), SIMDE_FLOAT32_C( 0.34), SIMDE_FLOAT32_C( -0.94),
SIMDE_FLOAT32_C( -0.75), SIMDE_FLOAT32_C( -0.77), SIMDE_FLOAT32_C( -0.55), SIMDE_FLOAT32_C( 0.40)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 1.08), SIMDE_FLOAT32_C( 0.82), SIMDE_FLOAT32_C( 1.72), SIMDE_FLOAT32_C( 0.61),
SIMDE_FLOAT32_C( 0.43), SIMDE_FLOAT32_C( 0.93), SIMDE_FLOAT32_C( 0.66), SIMDE_FLOAT32_C( 1.32),
SIMDE_FLOAT32_C( 1.77), SIMDE_FLOAT32_C( 1.65), SIMDE_FLOAT32_C( 1.22), SIMDE_FLOAT32_C( 2.79),
SIMDE_FLOAT32_C( 2.42), SIMDE_FLOAT32_C( 2.45), SIMDE_FLOAT32_C( 2.15), SIMDE_FLOAT32_C( 1.16)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( -0.18), SIMDE_FLOAT32_C( 0.76), SIMDE_FLOAT32_C( 0.32), SIMDE_FLOAT32_C( 0.34),
SIMDE_FLOAT32_C( -0.87), SIMDE_FLOAT32_C( -0.80), SIMDE_FLOAT32_C( -0.33), SIMDE_FLOAT32_C( -0.53),
SIMDE_FLOAT32_C( -0.19), SIMDE_FLOAT32_C( -0.82), SIMDE_FLOAT32_C( 0.56), SIMDE_FLOAT32_C( 0.66),
SIMDE_FLOAT32_C( -0.07), SIMDE_FLOAT32_C( 0.54), SIMDE_FLOAT32_C( 0.12), SIMDE_FLOAT32_C( -0.17)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 1.75), SIMDE_FLOAT32_C( 0.71), SIMDE_FLOAT32_C( 1.25), SIMDE_FLOAT32_C( 1.22),
SIMDE_FLOAT32_C( 2.63), SIMDE_FLOAT32_C( 2.50), SIMDE_FLOAT32_C( 1.91), SIMDE_FLOAT32_C( 2.13),
SIMDE_FLOAT32_C( 1.76), SIMDE_FLOAT32_C( 2.53), SIMDE_FLOAT32_C( 0.98), SIMDE_FLOAT32_C( 0.85),
SIMDE_FLOAT32_C( 1.64), SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( 1.45), SIMDE_FLOAT32_C( 1.74)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( 0.69), SIMDE_FLOAT32_C( 0.84), SIMDE_FLOAT32_C( -0.02), SIMDE_FLOAT32_C( -0.59),
SIMDE_FLOAT32_C( -0.45), SIMDE_FLOAT32_C( 0.73), SIMDE_FLOAT32_C( 0.51), SIMDE_FLOAT32_C( 0.62),
SIMDE_FLOAT32_C( 0.83), SIMDE_FLOAT32_C( 0.14), SIMDE_FLOAT32_C( 0.98), SIMDE_FLOAT32_C( -0.91),
SIMDE_FLOAT32_C( 0.33), SIMDE_FLOAT32_C( 0.10), SIMDE_FLOAT32_C( 0.46), SIMDE_FLOAT32_C( -0.74)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 0.81), SIMDE_FLOAT32_C( 0.57), SIMDE_FLOAT32_C( 1.59), SIMDE_FLOAT32_C( 2.20),
SIMDE_FLOAT32_C( 2.04), SIMDE_FLOAT32_C( 0.75), SIMDE_FLOAT32_C( 1.04), SIMDE_FLOAT32_C( 0.90),
SIMDE_FLOAT32_C( 0.59), SIMDE_FLOAT32_C( 1.43), SIMDE_FLOAT32_C( 0.20), SIMDE_FLOAT32_C( 2.71),
SIMDE_FLOAT32_C( 1.23), SIMDE_FLOAT32_C( 1.47), SIMDE_FLOAT32_C( 1.09), SIMDE_FLOAT32_C( 2.40)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( -0.77), SIMDE_FLOAT32_C( 0.82), SIMDE_FLOAT32_C( -0.57), SIMDE_FLOAT32_C( -1.00),
SIMDE_FLOAT32_C( -0.34), SIMDE_FLOAT32_C( 0.92), SIMDE_FLOAT32_C( 0.29), SIMDE_FLOAT32_C( -0.77),
SIMDE_FLOAT32_C( -0.58), SIMDE_FLOAT32_C( -0.07), SIMDE_FLOAT32_C( 0.71), SIMDE_FLOAT32_C( 0.98),
SIMDE_FLOAT32_C( -0.76), SIMDE_FLOAT32_C( 0.42), SIMDE_FLOAT32_C( 0.03), SIMDE_FLOAT32_C( -0.10)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 2.45), SIMDE_FLOAT32_C( 0.61), SIMDE_FLOAT32_C( 2.18), SIMDE_FLOAT32_C( 3.14),
SIMDE_FLOAT32_C( 1.92), SIMDE_FLOAT32_C( 0.40), SIMDE_FLOAT32_C( 1.28), SIMDE_FLOAT32_C( 2.45),
SIMDE_FLOAT32_C( 2.19), SIMDE_FLOAT32_C( 1.64), SIMDE_FLOAT32_C( 0.78), SIMDE_FLOAT32_C( 0.20),
SIMDE_FLOAT32_C( 2.43), SIMDE_FLOAT32_C( 1.14), SIMDE_FLOAT32_C( 1.54), SIMDE_FLOAT32_C( 1.67)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( -0.44), SIMDE_FLOAT32_C( -0.36), SIMDE_FLOAT32_C( -0.75), SIMDE_FLOAT32_C( -0.03),
SIMDE_FLOAT32_C( 0.93), SIMDE_FLOAT32_C( 0.01), SIMDE_FLOAT32_C( -0.33), SIMDE_FLOAT32_C( -0.13),
SIMDE_FLOAT32_C( -0.18), SIMDE_FLOAT32_C( 0.04), SIMDE_FLOAT32_C( 0.51), SIMDE_FLOAT32_C( 0.39),
SIMDE_FLOAT32_C( 0.01), SIMDE_FLOAT32_C( -0.30), SIMDE_FLOAT32_C( 0.92), SIMDE_FLOAT32_C( -0.70)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 2.03), SIMDE_FLOAT32_C( 1.94), SIMDE_FLOAT32_C( 2.42), SIMDE_FLOAT32_C( 1.60),
SIMDE_FLOAT32_C( 0.38), SIMDE_FLOAT32_C( 1.56), SIMDE_FLOAT32_C( 1.91), SIMDE_FLOAT32_C( 1.70),
SIMDE_FLOAT32_C( 1.75), SIMDE_FLOAT32_C( 1.53), SIMDE_FLOAT32_C( 1.04), SIMDE_FLOAT32_C( 1.17),
SIMDE_FLOAT32_C( 1.56), SIMDE_FLOAT32_C( 1.88), SIMDE_FLOAT32_C( 0.40), SIMDE_FLOAT32_C( 2.35)) }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])); i++) {
simde__m512 r = simde_mm512_acos_ps(test_vec[i].a);
simde_assert_m512_close(r, test_vec[i].r, 1);
}
return 0;
}
static int
test_simde_mm512_mask_acos_ps(SIMDE_MUNIT_TEST_ARGS) {
const struct {
simde__m512 src;
simde__mmask16 k;
simde__m512 a;
simde__m512 r;
} test_vec[8] = {
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( -0.45), SIMDE_FLOAT32_C( 0.69), SIMDE_FLOAT32_C( -0.21), SIMDE_FLOAT32_C( -0.66),
SIMDE_FLOAT32_C( 0.03), SIMDE_FLOAT32_C( -0.92), SIMDE_FLOAT32_C( -0.86), SIMDE_FLOAT32_C( 0.70),
SIMDE_FLOAT32_C( -0.69), SIMDE_FLOAT32_C( 0.57), SIMDE_FLOAT32_C( 0.42), SIMDE_FLOAT32_C( 0.47),
SIMDE_FLOAT32_C( 0.67), SIMDE_FLOAT32_C( 0.03), SIMDE_FLOAT32_C( 0.04), SIMDE_FLOAT32_C( 0.35)),
UINT16_C(41466),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 0.18), SIMDE_FLOAT32_C( 0.23), SIMDE_FLOAT32_C( 0.26), SIMDE_FLOAT32_C( -0.98),
SIMDE_FLOAT32_C( -0.44), SIMDE_FLOAT32_C( -0.38), SIMDE_FLOAT32_C( -0.31), SIMDE_FLOAT32_C( -0.42),
SIMDE_FLOAT32_C( -0.68), SIMDE_FLOAT32_C( 0.08), SIMDE_FLOAT32_C( 0.83), SIMDE_FLOAT32_C( -0.27),
SIMDE_FLOAT32_C( 0.50), SIMDE_FLOAT32_C( -0.30), SIMDE_FLOAT32_C( -0.19), SIMDE_FLOAT32_C( -0.75)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 1.39), SIMDE_FLOAT32_C( 0.69), SIMDE_FLOAT32_C( 1.31), SIMDE_FLOAT32_C( -0.66),
SIMDE_FLOAT32_C( 0.03), SIMDE_FLOAT32_C( -0.92), SIMDE_FLOAT32_C( -0.86), SIMDE_FLOAT32_C( 2.00),
SIMDE_FLOAT32_C( 2.32), SIMDE_FLOAT32_C( 1.49), SIMDE_FLOAT32_C( 0.59), SIMDE_FLOAT32_C( 1.84),
SIMDE_FLOAT32_C( 1.05), SIMDE_FLOAT32_C( 0.03), SIMDE_FLOAT32_C( 1.76), SIMDE_FLOAT32_C( 0.35)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( 0.47), SIMDE_FLOAT32_C( -0.15), SIMDE_FLOAT32_C( 0.91), SIMDE_FLOAT32_C( 0.79),
SIMDE_FLOAT32_C( -0.20), SIMDE_FLOAT32_C( 0.34), SIMDE_FLOAT32_C( -0.75), SIMDE_FLOAT32_C( -0.55),
SIMDE_FLOAT32_C( -0.39), SIMDE_FLOAT32_C( 0.66), SIMDE_FLOAT32_C( 0.53), SIMDE_FLOAT32_C( 0.78),
SIMDE_FLOAT32_C( -0.77), SIMDE_FLOAT32_C( -0.58), SIMDE_FLOAT32_C( -0.77), SIMDE_FLOAT32_C( 0.03)),
UINT16_C(36797),
simde_mm512_set_ps(SIMDE_FLOAT32_C( -0.17), SIMDE_FLOAT32_C( 0.68), SIMDE_FLOAT32_C( 0.82), SIMDE_FLOAT32_C( 0.60),
SIMDE_FLOAT32_C( 0.25), SIMDE_FLOAT32_C( -0.08), SIMDE_FLOAT32_C( -0.94), SIMDE_FLOAT32_C( -0.77),
SIMDE_FLOAT32_C( 0.40), SIMDE_FLOAT32_C( 0.40), SIMDE_FLOAT32_C( 0.34), SIMDE_FLOAT32_C( -0.26),
SIMDE_FLOAT32_C( -0.03), SIMDE_FLOAT32_C( 0.44), SIMDE_FLOAT32_C( 0.38), SIMDE_FLOAT32_C( 0.99)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 1.74), SIMDE_FLOAT32_C( -0.15), SIMDE_FLOAT32_C( 0.91), SIMDE_FLOAT32_C( 0.79),
SIMDE_FLOAT32_C( 1.32), SIMDE_FLOAT32_C( 1.65), SIMDE_FLOAT32_C( 2.79), SIMDE_FLOAT32_C( 2.45),
SIMDE_FLOAT32_C( 1.16), SIMDE_FLOAT32_C( 0.66), SIMDE_FLOAT32_C( 1.22), SIMDE_FLOAT32_C( 1.83),
SIMDE_FLOAT32_C( 1.60), SIMDE_FLOAT32_C( 1.12), SIMDE_FLOAT32_C( -0.77), SIMDE_FLOAT32_C( 0.14)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( -0.10), SIMDE_FLOAT32_C( 0.84), SIMDE_FLOAT32_C( -0.59), SIMDE_FLOAT32_C( 0.73),
SIMDE_FLOAT32_C( 0.62), SIMDE_FLOAT32_C( 0.14), SIMDE_FLOAT32_C( -0.91), SIMDE_FLOAT32_C( 0.10),
SIMDE_FLOAT32_C( -0.74), SIMDE_FLOAT32_C( 0.76), SIMDE_FLOAT32_C( 0.34), SIMDE_FLOAT32_C( -0.80),
SIMDE_FLOAT32_C( -0.53), SIMDE_FLOAT32_C( -0.82), SIMDE_FLOAT32_C( 0.66), SIMDE_FLOAT32_C( 0.54)),
UINT16_C(16804),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 0.03), SIMDE_FLOAT32_C( 0.69), SIMDE_FLOAT32_C( -0.02), SIMDE_FLOAT32_C( -0.45),
SIMDE_FLOAT32_C( 0.51), SIMDE_FLOAT32_C( 0.83), SIMDE_FLOAT32_C( 0.98), SIMDE_FLOAT32_C( 0.33),
SIMDE_FLOAT32_C( 0.46), SIMDE_FLOAT32_C( -0.18), SIMDE_FLOAT32_C( 0.32), SIMDE_FLOAT32_C( -0.87),
SIMDE_FLOAT32_C( -0.33), SIMDE_FLOAT32_C( -0.19), SIMDE_FLOAT32_C( 0.56), SIMDE_FLOAT32_C( -0.07)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( -0.10), SIMDE_FLOAT32_C( 0.81), SIMDE_FLOAT32_C( -0.59), SIMDE_FLOAT32_C( 0.73),
SIMDE_FLOAT32_C( 0.62), SIMDE_FLOAT32_C( 0.14), SIMDE_FLOAT32_C( -0.91), SIMDE_FLOAT32_C( 1.23),
SIMDE_FLOAT32_C( 1.09), SIMDE_FLOAT32_C( 0.76), SIMDE_FLOAT32_C( 1.25), SIMDE_FLOAT32_C( -0.80),
SIMDE_FLOAT32_C( -0.53), SIMDE_FLOAT32_C( 1.76), SIMDE_FLOAT32_C( 0.66), SIMDE_FLOAT32_C( 0.54)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( -0.35), SIMDE_FLOAT32_C( -0.44), SIMDE_FLOAT32_C( -0.75), SIMDE_FLOAT32_C( 0.93),
SIMDE_FLOAT32_C( -0.33), SIMDE_FLOAT32_C( -0.18), SIMDE_FLOAT32_C( 0.51), SIMDE_FLOAT32_C( 0.01),
SIMDE_FLOAT32_C( 0.92), SIMDE_FLOAT32_C( -0.77), SIMDE_FLOAT32_C( -0.57), SIMDE_FLOAT32_C( -0.34),
SIMDE_FLOAT32_C( 0.29), SIMDE_FLOAT32_C( -0.58), SIMDE_FLOAT32_C( 0.71), SIMDE_FLOAT32_C( -0.76)),
UINT16_C( 2107),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 0.90), SIMDE_FLOAT32_C( -0.20), SIMDE_FLOAT32_C( -0.36), SIMDE_FLOAT32_C( -0.03),
SIMDE_FLOAT32_C( 0.01), SIMDE_FLOAT32_C( -0.13), SIMDE_FLOAT32_C( 0.04), SIMDE_FLOAT32_C( 0.39),
SIMDE_FLOAT32_C( -0.30), SIMDE_FLOAT32_C( -0.70), SIMDE_FLOAT32_C( 0.82), SIMDE_FLOAT32_C( -1.00),
SIMDE_FLOAT32_C( 0.92), SIMDE_FLOAT32_C( -0.77), SIMDE_FLOAT32_C( -0.07), SIMDE_FLOAT32_C( 0.98)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( -0.35), SIMDE_FLOAT32_C( -0.44), SIMDE_FLOAT32_C( -0.75), SIMDE_FLOAT32_C( 0.93),
SIMDE_FLOAT32_C( 1.56), SIMDE_FLOAT32_C( -0.18), SIMDE_FLOAT32_C( 0.51), SIMDE_FLOAT32_C( 0.01),
SIMDE_FLOAT32_C( 0.92), SIMDE_FLOAT32_C( -0.77), SIMDE_FLOAT32_C( 0.61), SIMDE_FLOAT32_C( 3.14),
SIMDE_FLOAT32_C( 0.40), SIMDE_FLOAT32_C( -0.58), SIMDE_FLOAT32_C( 1.64), SIMDE_FLOAT32_C( 0.20)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( -0.02), SIMDE_FLOAT32_C( -0.74), SIMDE_FLOAT32_C( -0.31), SIMDE_FLOAT32_C( 0.18),
SIMDE_FLOAT32_C( 0.35), SIMDE_FLOAT32_C( 0.89), SIMDE_FLOAT32_C( 0.92), SIMDE_FLOAT32_C( 0.13),
SIMDE_FLOAT32_C( 0.48), SIMDE_FLOAT32_C( -0.60), SIMDE_FLOAT32_C( -0.79), SIMDE_FLOAT32_C( -0.77),
SIMDE_FLOAT32_C( 0.22), SIMDE_FLOAT32_C( -0.79), SIMDE_FLOAT32_C( -0.78), SIMDE_FLOAT32_C( 0.44)),
UINT16_C(22274),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 0.50), SIMDE_FLOAT32_C( 0.92), SIMDE_FLOAT32_C( -0.72), SIMDE_FLOAT32_C( 0.16),
SIMDE_FLOAT32_C( -0.86), SIMDE_FLOAT32_C( 0.43), SIMDE_FLOAT32_C( 0.93), SIMDE_FLOAT32_C( 0.11),
SIMDE_FLOAT32_C( 0.83), SIMDE_FLOAT32_C( -0.08), SIMDE_FLOAT32_C( 0.24), SIMDE_FLOAT32_C( -0.38),
SIMDE_FLOAT32_C( -0.60), SIMDE_FLOAT32_C( -0.62), SIMDE_FLOAT32_C( -0.94), SIMDE_FLOAT32_C( 0.48)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( -0.02), SIMDE_FLOAT32_C( 0.40), SIMDE_FLOAT32_C( -0.31), SIMDE_FLOAT32_C( 1.41),
SIMDE_FLOAT32_C( 0.35), SIMDE_FLOAT32_C( 1.13), SIMDE_FLOAT32_C( 0.38), SIMDE_FLOAT32_C( 1.46),
SIMDE_FLOAT32_C( 0.48), SIMDE_FLOAT32_C( -0.60), SIMDE_FLOAT32_C( -0.79), SIMDE_FLOAT32_C( -0.77),
SIMDE_FLOAT32_C( 0.22), SIMDE_FLOAT32_C( -0.79), SIMDE_FLOAT32_C( 2.79), SIMDE_FLOAT32_C( 0.44)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( 0.88), SIMDE_FLOAT32_C( -0.81), SIMDE_FLOAT32_C( -0.07), SIMDE_FLOAT32_C( -0.78),
SIMDE_FLOAT32_C( 0.09), SIMDE_FLOAT32_C( 0.21), SIMDE_FLOAT32_C( 0.83), SIMDE_FLOAT32_C( -0.07),
SIMDE_FLOAT32_C( -0.29), SIMDE_FLOAT32_C( -0.21), SIMDE_FLOAT32_C( -0.32), SIMDE_FLOAT32_C( 0.78),
SIMDE_FLOAT32_C( -0.63), SIMDE_FLOAT32_C( 0.56), SIMDE_FLOAT32_C( 0.44), SIMDE_FLOAT32_C( 0.43)),
UINT16_C(27396),
simde_mm512_set_ps(SIMDE_FLOAT32_C( -0.96), SIMDE_FLOAT32_C( -0.41), SIMDE_FLOAT32_C( -0.74), SIMDE_FLOAT32_C( -0.76),
SIMDE_FLOAT32_C( 0.79), SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( -0.82), SIMDE_FLOAT32_C( 0.16),
SIMDE_FLOAT32_C( 0.58), SIMDE_FLOAT32_C( -0.01), SIMDE_FLOAT32_C( -0.31), SIMDE_FLOAT32_C( -0.72),
SIMDE_FLOAT32_C( 0.33), SIMDE_FLOAT32_C( 0.27), SIMDE_FLOAT32_C( -0.92), SIMDE_FLOAT32_C( -0.49)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 0.88), SIMDE_FLOAT32_C( 1.99), SIMDE_FLOAT32_C( 2.40), SIMDE_FLOAT32_C( -0.78),
SIMDE_FLOAT32_C( 0.66), SIMDE_FLOAT32_C( 0.21), SIMDE_FLOAT32_C( 2.53), SIMDE_FLOAT32_C( 1.41),
SIMDE_FLOAT32_C( -0.29), SIMDE_FLOAT32_C( -0.21), SIMDE_FLOAT32_C( -0.32), SIMDE_FLOAT32_C( 0.78),
SIMDE_FLOAT32_C( -0.63), SIMDE_FLOAT32_C( 1.30), SIMDE_FLOAT32_C( 0.44), SIMDE_FLOAT32_C( 0.43)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( 0.53), SIMDE_FLOAT32_C( -0.02), SIMDE_FLOAT32_C( -0.97), SIMDE_FLOAT32_C( 0.64),
SIMDE_FLOAT32_C( 0.45), SIMDE_FLOAT32_C( -0.77), SIMDE_FLOAT32_C( 0.11), SIMDE_FLOAT32_C( 0.59),
SIMDE_FLOAT32_C( 0.03), SIMDE_FLOAT32_C( 0.64), SIMDE_FLOAT32_C( -0.08), SIMDE_FLOAT32_C( 0.08),
SIMDE_FLOAT32_C( -0.71), SIMDE_FLOAT32_C( 0.61), SIMDE_FLOAT32_C( 0.39), SIMDE_FLOAT32_C( -0.89)),
UINT16_C( 953),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 0.02), SIMDE_FLOAT32_C( 0.81), SIMDE_FLOAT32_C( 0.14), SIMDE_FLOAT32_C( -0.43),
SIMDE_FLOAT32_C( 0.31), SIMDE_FLOAT32_C( -0.18), SIMDE_FLOAT32_C( -0.46), SIMDE_FLOAT32_C( 0.68),
SIMDE_FLOAT32_C( 0.07), SIMDE_FLOAT32_C( -0.27), SIMDE_FLOAT32_C( 0.12), SIMDE_FLOAT32_C( -0.58),
SIMDE_FLOAT32_C( -0.04), SIMDE_FLOAT32_C( -0.25), SIMDE_FLOAT32_C( -0.05), SIMDE_FLOAT32_C( 0.09)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 0.53), SIMDE_FLOAT32_C( -0.02), SIMDE_FLOAT32_C( -0.97), SIMDE_FLOAT32_C( 0.64),
SIMDE_FLOAT32_C( 0.45), SIMDE_FLOAT32_C( -0.77), SIMDE_FLOAT32_C( 2.05), SIMDE_FLOAT32_C( 0.82),
SIMDE_FLOAT32_C( 1.50), SIMDE_FLOAT32_C( 0.64), SIMDE_FLOAT32_C( 1.45), SIMDE_FLOAT32_C( 2.19),
SIMDE_FLOAT32_C( 1.61), SIMDE_FLOAT32_C( 0.61), SIMDE_FLOAT32_C( 0.39), SIMDE_FLOAT32_C( 1.48)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( -0.79), SIMDE_FLOAT32_C( 0.33), SIMDE_FLOAT32_C( -0.49), SIMDE_FLOAT32_C( 0.82),
SIMDE_FLOAT32_C( 0.96), SIMDE_FLOAT32_C( 0.95), SIMDE_FLOAT32_C( 0.83), SIMDE_FLOAT32_C( -0.82),
SIMDE_FLOAT32_C( -0.21), SIMDE_FLOAT32_C( -0.93), SIMDE_FLOAT32_C( -0.73), SIMDE_FLOAT32_C( -0.42),
SIMDE_FLOAT32_C( 0.10), SIMDE_FLOAT32_C( 0.10), SIMDE_FLOAT32_C( 0.44), SIMDE_FLOAT32_C( -0.20)),
UINT16_C(12713),
simde_mm512_set_ps(SIMDE_FLOAT32_C( -0.84), SIMDE_FLOAT32_C( -0.01), SIMDE_FLOAT32_C( 0.82), SIMDE_FLOAT32_C( 0.79),
SIMDE_FLOAT32_C( -0.74), SIMDE_FLOAT32_C( -0.31), SIMDE_FLOAT32_C( 0.73), SIMDE_FLOAT32_C( -0.35),
SIMDE_FLOAT32_C( 0.06), SIMDE_FLOAT32_C( 0.11), SIMDE_FLOAT32_C( 0.72), SIMDE_FLOAT32_C( -0.25),
SIMDE_FLOAT32_C( 0.94), SIMDE_FLOAT32_C( 0.36), SIMDE_FLOAT32_C( -0.01), SIMDE_FLOAT32_C( 0.85)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( -0.79), SIMDE_FLOAT32_C( 0.33), SIMDE_FLOAT32_C( 0.61), SIMDE_FLOAT32_C( 0.66),
SIMDE_FLOAT32_C( 0.96), SIMDE_FLOAT32_C( 0.95), SIMDE_FLOAT32_C( 0.83), SIMDE_FLOAT32_C( 1.93),
SIMDE_FLOAT32_C( 1.51), SIMDE_FLOAT32_C( -0.93), SIMDE_FLOAT32_C( 0.77), SIMDE_FLOAT32_C( -0.42),
SIMDE_FLOAT32_C( 0.35), SIMDE_FLOAT32_C( 0.10), SIMDE_FLOAT32_C( 0.44), SIMDE_FLOAT32_C( 0.55)) }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])); i++) {
simde__m512 r = simde_mm512_mask_acos_ps(test_vec[i].src, test_vec[i].k, test_vec[i].a);
simde_assert_m512_close(r, test_vec[i].r, 1);
}
return 0;
}
static int
test_simde_mm512_acos_pd(SIMDE_MUNIT_TEST_ARGS) {
const struct {
simde__m512d a;
simde__m512d r;
} test_vec[8] = {
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( 0.50), SIMDE_FLOAT64_C( 0.67),
SIMDE_FLOAT64_C( -0.30), SIMDE_FLOAT64_C( 0.03),
SIMDE_FLOAT64_C( -0.19), SIMDE_FLOAT64_C( 0.04),
SIMDE_FLOAT64_C( -0.75), SIMDE_FLOAT64_C( 0.35)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 1.05), SIMDE_FLOAT64_C( 0.84),
SIMDE_FLOAT64_C( 1.88), SIMDE_FLOAT64_C( 1.54),
SIMDE_FLOAT64_C( 1.76), SIMDE_FLOAT64_C( 1.53),
SIMDE_FLOAT64_C( 2.42), SIMDE_FLOAT64_C( 1.21)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( -0.68), SIMDE_FLOAT64_C( -0.69),
SIMDE_FLOAT64_C( 0.08), SIMDE_FLOAT64_C( 0.57),
SIMDE_FLOAT64_C( 0.83), SIMDE_FLOAT64_C( 0.42),
SIMDE_FLOAT64_C( -0.27), SIMDE_FLOAT64_C( 0.47)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 2.32), SIMDE_FLOAT64_C( 2.33),
SIMDE_FLOAT64_C( 1.49), SIMDE_FLOAT64_C( 0.96),
SIMDE_FLOAT64_C( 0.59), SIMDE_FLOAT64_C( 1.14),
SIMDE_FLOAT64_C( 1.84), SIMDE_FLOAT64_C( 1.08)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( -0.44), SIMDE_FLOAT64_C( 0.03),
SIMDE_FLOAT64_C( -0.38), SIMDE_FLOAT64_C( -0.92),
SIMDE_FLOAT64_C( -0.31), SIMDE_FLOAT64_C( -0.86),
SIMDE_FLOAT64_C( -0.42), SIMDE_FLOAT64_C( 0.70)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 2.03), SIMDE_FLOAT64_C( 1.54),
SIMDE_FLOAT64_C( 1.96), SIMDE_FLOAT64_C( 2.74),
SIMDE_FLOAT64_C( 1.89), SIMDE_FLOAT64_C( 2.61),
SIMDE_FLOAT64_C( 2.00), SIMDE_FLOAT64_C( 0.80)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( 0.18), SIMDE_FLOAT64_C( -0.45),
SIMDE_FLOAT64_C( 0.23), SIMDE_FLOAT64_C( 0.69),
SIMDE_FLOAT64_C( 0.26), SIMDE_FLOAT64_C( -0.21),
SIMDE_FLOAT64_C( -0.98), SIMDE_FLOAT64_C( -0.66)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 1.39), SIMDE_FLOAT64_C( 2.04),
SIMDE_FLOAT64_C( 1.34), SIMDE_FLOAT64_C( 0.81),
SIMDE_FLOAT64_C( 1.31), SIMDE_FLOAT64_C( 1.78),
SIMDE_FLOAT64_C( 2.94), SIMDE_FLOAT64_C( 2.29)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( -0.77), SIMDE_FLOAT64_C( 0.44),
SIMDE_FLOAT64_C( -0.58), SIMDE_FLOAT64_C( 0.38),
SIMDE_FLOAT64_C( -0.77), SIMDE_FLOAT64_C( 0.99),
SIMDE_FLOAT64_C( 0.03), SIMDE_FLOAT64_C( 0.84)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 2.45), SIMDE_FLOAT64_C( 1.12),
SIMDE_FLOAT64_C( 2.19), SIMDE_FLOAT64_C( 1.18),
SIMDE_FLOAT64_C( 2.45), SIMDE_FLOAT64_C( 0.14),
SIMDE_FLOAT64_C( 1.54), SIMDE_FLOAT64_C( 0.57)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( -0.39), SIMDE_FLOAT64_C( 0.40),
SIMDE_FLOAT64_C( 0.66), SIMDE_FLOAT64_C( 0.34),
SIMDE_FLOAT64_C( 0.53), SIMDE_FLOAT64_C( -0.26),
SIMDE_FLOAT64_C( 0.78), SIMDE_FLOAT64_C( -0.03)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 1.97), SIMDE_FLOAT64_C( 1.16),
SIMDE_FLOAT64_C( 0.85), SIMDE_FLOAT64_C( 1.22),
SIMDE_FLOAT64_C( 1.01), SIMDE_FLOAT64_C( 1.83),
SIMDE_FLOAT64_C( 0.68), SIMDE_FLOAT64_C( 1.60)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( -0.20), SIMDE_FLOAT64_C( -0.08),
SIMDE_FLOAT64_C( 0.34), SIMDE_FLOAT64_C( -0.94),
SIMDE_FLOAT64_C( -0.75), SIMDE_FLOAT64_C( -0.77),
SIMDE_FLOAT64_C( -0.55), SIMDE_FLOAT64_C( 0.40)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 1.77), SIMDE_FLOAT64_C( 1.65),
SIMDE_FLOAT64_C( 1.22), SIMDE_FLOAT64_C( 2.79),
SIMDE_FLOAT64_C( 2.42), SIMDE_FLOAT64_C( 2.45),
SIMDE_FLOAT64_C( 2.15), SIMDE_FLOAT64_C( 1.16)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( 0.47), SIMDE_FLOAT64_C( 0.68),
SIMDE_FLOAT64_C( -0.15), SIMDE_FLOAT64_C( 0.82),
SIMDE_FLOAT64_C( 0.91), SIMDE_FLOAT64_C( 0.60),
SIMDE_FLOAT64_C( 0.79), SIMDE_FLOAT64_C( 0.25)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 1.08), SIMDE_FLOAT64_C( 0.82),
SIMDE_FLOAT64_C( 1.72), SIMDE_FLOAT64_C( 0.61),
SIMDE_FLOAT64_C( 0.43), SIMDE_FLOAT64_C( 0.93),
SIMDE_FLOAT64_C( 0.66), SIMDE_FLOAT64_C( 1.32)) }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])); i++) {
simde__m512d r = simde_mm512_acos_pd(test_vec[i].a);
simde_assert_m512d_close(r, test_vec[i].r, 1);
}
return 0;
}
static int
test_simde_mm512_mask_acos_pd(SIMDE_MUNIT_TEST_ARGS) {
const struct {
simde__m512d src;
simde__mmask8 k;
simde__m512d a;
simde__m512d r;
} test_vec[8] = {
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( -0.69), SIMDE_FLOAT64_C( 0.57),
SIMDE_FLOAT64_C( 0.42), SIMDE_FLOAT64_C( 0.47),
SIMDE_FLOAT64_C( 0.67), SIMDE_FLOAT64_C( 0.03),
SIMDE_FLOAT64_C( 0.04), SIMDE_FLOAT64_C( 0.35)),
UINT8_C(139),
simde_mm512_set_pd(SIMDE_FLOAT64_C( -0.68), SIMDE_FLOAT64_C( 0.08),
SIMDE_FLOAT64_C( 0.83), SIMDE_FLOAT64_C( -0.27),
SIMDE_FLOAT64_C( 0.50), SIMDE_FLOAT64_C( -0.30),
SIMDE_FLOAT64_C( -0.19), SIMDE_FLOAT64_C( -0.75)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 2.32), SIMDE_FLOAT64_C( 0.57),
SIMDE_FLOAT64_C( 0.42), SIMDE_FLOAT64_C( 0.47),
SIMDE_FLOAT64_C( 1.05), SIMDE_FLOAT64_C( 0.03),
SIMDE_FLOAT64_C( 1.76), SIMDE_FLOAT64_C( 2.42)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( 0.18), SIMDE_FLOAT64_C( 0.23),
SIMDE_FLOAT64_C( 0.26), SIMDE_FLOAT64_C( -0.98),
SIMDE_FLOAT64_C( -0.44), SIMDE_FLOAT64_C( -0.38),
SIMDE_FLOAT64_C( -0.31), SIMDE_FLOAT64_C( -0.42)),
UINT8_C(229),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 0.84), SIMDE_FLOAT64_C( -0.45),
SIMDE_FLOAT64_C( 0.69), SIMDE_FLOAT64_C( -0.21),
SIMDE_FLOAT64_C( -0.66), SIMDE_FLOAT64_C( 0.03),
SIMDE_FLOAT64_C( -0.92), SIMDE_FLOAT64_C( -0.86)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 0.57), SIMDE_FLOAT64_C( 2.04),
SIMDE_FLOAT64_C( 0.81), SIMDE_FLOAT64_C( -0.98),
SIMDE_FLOAT64_C( -0.44), SIMDE_FLOAT64_C( 1.54),
SIMDE_FLOAT64_C( -0.31), SIMDE_FLOAT64_C( 2.61)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( 0.40), SIMDE_FLOAT64_C( 0.40),
SIMDE_FLOAT64_C( 0.34), SIMDE_FLOAT64_C( -0.26),
SIMDE_FLOAT64_C( -0.03), SIMDE_FLOAT64_C( 0.44),
SIMDE_FLOAT64_C( 0.38), SIMDE_FLOAT64_C( 0.99)),
UINT8_C(253),
simde_mm512_set_pd(SIMDE_FLOAT64_C( -0.55), SIMDE_FLOAT64_C( -0.39),
SIMDE_FLOAT64_C( 0.66), SIMDE_FLOAT64_C( 0.53),
SIMDE_FLOAT64_C( 0.78), SIMDE_FLOAT64_C( -0.77),
SIMDE_FLOAT64_C( -0.58), SIMDE_FLOAT64_C( -0.77)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 2.15), SIMDE_FLOAT64_C( 1.97),
SIMDE_FLOAT64_C( 0.85), SIMDE_FLOAT64_C( 1.01),
SIMDE_FLOAT64_C( 0.68), SIMDE_FLOAT64_C( 2.45),
SIMDE_FLOAT64_C( 0.38), SIMDE_FLOAT64_C( 2.45)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( 0.12), SIMDE_FLOAT64_C( 0.47),
SIMDE_FLOAT64_C( -0.15), SIMDE_FLOAT64_C( 0.91),
SIMDE_FLOAT64_C( 0.79), SIMDE_FLOAT64_C( -0.20),
SIMDE_FLOAT64_C( 0.34), SIMDE_FLOAT64_C( -0.75)),
UINT8_C( 93),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 0.54), SIMDE_FLOAT64_C( -0.17),
SIMDE_FLOAT64_C( 0.68), SIMDE_FLOAT64_C( 0.82),
SIMDE_FLOAT64_C( 0.60), SIMDE_FLOAT64_C( 0.25),
SIMDE_FLOAT64_C( -0.08), SIMDE_FLOAT64_C( -0.94)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 0.12), SIMDE_FLOAT64_C( 1.74),
SIMDE_FLOAT64_C( -0.15), SIMDE_FLOAT64_C( 0.61),
SIMDE_FLOAT64_C( 0.93), SIMDE_FLOAT64_C( 1.32),
SIMDE_FLOAT64_C( 0.34), SIMDE_FLOAT64_C( 2.79)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( 0.10), SIMDE_FLOAT64_C( -0.74),
SIMDE_FLOAT64_C( 0.76), SIMDE_FLOAT64_C( 0.34),
SIMDE_FLOAT64_C( -0.80), SIMDE_FLOAT64_C( -0.53),
SIMDE_FLOAT64_C( -0.82), SIMDE_FLOAT64_C( 0.66)),
UINT8_C(145),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 0.33), SIMDE_FLOAT64_C( 0.46),
SIMDE_FLOAT64_C( -0.18), SIMDE_FLOAT64_C( 0.32),
SIMDE_FLOAT64_C( -0.87), SIMDE_FLOAT64_C( -0.33),
SIMDE_FLOAT64_C( -0.19), SIMDE_FLOAT64_C( 0.56)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 1.23), SIMDE_FLOAT64_C( -0.74),
SIMDE_FLOAT64_C( 0.76), SIMDE_FLOAT64_C( 1.25),
SIMDE_FLOAT64_C( -0.80), SIMDE_FLOAT64_C( -0.53),
SIMDE_FLOAT64_C( -0.82), SIMDE_FLOAT64_C( 0.98)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( -0.76), SIMDE_FLOAT64_C( 0.03),
SIMDE_FLOAT64_C( 0.69), SIMDE_FLOAT64_C( -0.02),
SIMDE_FLOAT64_C( -0.45), SIMDE_FLOAT64_C( 0.51),
SIMDE_FLOAT64_C( 0.83), SIMDE_FLOAT64_C( 0.98)),
UINT8_C( 75),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 0.98), SIMDE_FLOAT64_C( 0.42),
SIMDE_FLOAT64_C( -0.10), SIMDE_FLOAT64_C( 0.84),
SIMDE_FLOAT64_C( -0.59), SIMDE_FLOAT64_C( 0.73),
SIMDE_FLOAT64_C( 0.62), SIMDE_FLOAT64_C( 0.14)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( -0.76), SIMDE_FLOAT64_C( 1.14),
SIMDE_FLOAT64_C( 0.69), SIMDE_FLOAT64_C( -0.02),
SIMDE_FLOAT64_C( 2.20), SIMDE_FLOAT64_C( 0.51),
SIMDE_FLOAT64_C( 0.90), SIMDE_FLOAT64_C( 1.43)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( 0.39), SIMDE_FLOAT64_C( -0.30),
SIMDE_FLOAT64_C( -0.70), SIMDE_FLOAT64_C( 0.82),
SIMDE_FLOAT64_C( -1.00), SIMDE_FLOAT64_C( 0.92),
SIMDE_FLOAT64_C( -0.77), SIMDE_FLOAT64_C( -0.07)),
UINT8_C( 93),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 0.51), SIMDE_FLOAT64_C( 0.01),
SIMDE_FLOAT64_C( 0.92), SIMDE_FLOAT64_C( -0.77),
SIMDE_FLOAT64_C( -0.57), SIMDE_FLOAT64_C( -0.34),
SIMDE_FLOAT64_C( 0.29), SIMDE_FLOAT64_C( -0.58)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 0.39), SIMDE_FLOAT64_C( 1.56),
SIMDE_FLOAT64_C( -0.70), SIMDE_FLOAT64_C( 2.45),
SIMDE_FLOAT64_C( 2.18), SIMDE_FLOAT64_C( 1.92),
SIMDE_FLOAT64_C( -0.77), SIMDE_FLOAT64_C( 2.19)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( 0.48), SIMDE_FLOAT64_C( 0.94),
SIMDE_FLOAT64_C( -0.35), SIMDE_FLOAT64_C( -0.44),
SIMDE_FLOAT64_C( -0.75), SIMDE_FLOAT64_C( 0.93),
SIMDE_FLOAT64_C( -0.33), SIMDE_FLOAT64_C( -0.18)),
UINT8_C(213),
simde_mm512_set_pd(SIMDE_FLOAT64_C( -0.78), SIMDE_FLOAT64_C( 0.44),
SIMDE_FLOAT64_C( 0.90), SIMDE_FLOAT64_C( -0.20),
SIMDE_FLOAT64_C( -0.36), SIMDE_FLOAT64_C( -0.03),
SIMDE_FLOAT64_C( 0.01), SIMDE_FLOAT64_C( -0.13)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 2.47), SIMDE_FLOAT64_C( 1.12),
SIMDE_FLOAT64_C( -0.35), SIMDE_FLOAT64_C( 1.77),
SIMDE_FLOAT64_C( -0.75), SIMDE_FLOAT64_C( 1.60),
SIMDE_FLOAT64_C( -0.33), SIMDE_FLOAT64_C( 1.70)) }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])); i++) {
simde__m512d r = simde_mm512_mask_acos_pd(test_vec[i].src, test_vec[i].k, test_vec[i].a);
simde_assert_m512d_close(r, test_vec[i].r, 1);
}
return 0;
}
static int
test_simde_mm_acosh_ps(SIMDE_MUNIT_TEST_ARGS) {
const struct {
simde__m128 a;
simde__m128 r;
} test_vec[8] = {
{ simde_mm_set_ps(SIMDE_FLOAT32_C( 3.69), SIMDE_FLOAT32_C( 4.43), SIMDE_FLOAT32_C( 1.81), SIMDE_FLOAT32_C( 5.44)),
simde_mm_set_ps(SIMDE_FLOAT32_C( 1.98), SIMDE_FLOAT32_C( 2.17), SIMDE_FLOAT32_C( 1.20), SIMDE_FLOAT32_C( 2.38)) },
{ simde_mm_set_ps(SIMDE_FLOAT32_C( 5.94), SIMDE_FLOAT32_C( 6.51), SIMDE_FLOAT32_C( 3.32), SIMDE_FLOAT32_C( 4.41)),
simde_mm_set_ps(SIMDE_FLOAT32_C( 2.47), SIMDE_FLOAT32_C( 2.56), SIMDE_FLOAT32_C( 1.87), SIMDE_FLOAT32_C( 2.16)) },
{ simde_mm_set_ps(SIMDE_FLOAT32_C( 7.02), SIMDE_FLOAT32_C( 5.69), SIMDE_FLOAT32_C( 3.41), SIMDE_FLOAT32_C( 5.84)),
simde_mm_set_ps(SIMDE_FLOAT32_C( 2.64), SIMDE_FLOAT32_C( 2.42), SIMDE_FLOAT32_C( 1.90), SIMDE_FLOAT32_C( 2.45)) },
{ simde_mm_set_ps(SIMDE_FLOAT32_C( 2.06), SIMDE_FLOAT32_C( 2.04), SIMDE_FLOAT32_C( 4.58), SIMDE_FLOAT32_C( 6.19)),
simde_mm_set_ps(SIMDE_FLOAT32_C( 1.35), SIMDE_FLOAT32_C( 1.34), SIMDE_FLOAT32_C( 2.20), SIMDE_FLOAT32_C( 2.51)) },
{ simde_mm_set_ps(SIMDE_FLOAT32_C( 3.29), SIMDE_FLOAT32_C( 1.46), SIMDE_FLOAT32_C( 2.92), SIMDE_FLOAT32_C( 6.60)),
simde_mm_set_ps(SIMDE_FLOAT32_C( 1.86), SIMDE_FLOAT32_C( 0.93), SIMDE_FLOAT32_C( 1.73), SIMDE_FLOAT32_C( 2.57)) },
{ simde_mm_set_ps(SIMDE_FLOAT32_C( 2.83), SIMDE_FLOAT32_C( 4.39), SIMDE_FLOAT32_C( 3.03), SIMDE_FLOAT32_C( 1.25)),
simde_mm_set_ps(SIMDE_FLOAT32_C( 1.70), SIMDE_FLOAT32_C( 2.16), SIMDE_FLOAT32_C( 1.77), SIMDE_FLOAT32_C( 0.69)) },
{ simde_mm_set_ps(SIMDE_FLOAT32_C( 5.16), SIMDE_FLOAT32_C( 3.60), SIMDE_FLOAT32_C( 1.08), SIMDE_FLOAT32_C( 2.12)),
simde_mm_set_ps(SIMDE_FLOAT32_C( 2.32), SIMDE_FLOAT32_C( 1.95), SIMDE_FLOAT32_C( 0.40), SIMDE_FLOAT32_C( 1.38)) },
{ simde_mm_set_ps(SIMDE_FLOAT32_C( 4.89), SIMDE_FLOAT32_C( 2.81), SIMDE_FLOAT32_C( 5.07), SIMDE_FLOAT32_C( 6.57)),
simde_mm_set_ps(SIMDE_FLOAT32_C( 2.27), SIMDE_FLOAT32_C( 1.69), SIMDE_FLOAT32_C( 2.31), SIMDE_FLOAT32_C( 2.57)) }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])); i++) {
simde__m128 r = simde_mm_acosh_ps(test_vec[i].a);
simde_assert_m128_close(r, test_vec[i].r, 1);
}
return 0;
}
static int
test_simde_mm_acosh_pd(SIMDE_MUNIT_TEST_ARGS) {
const struct {
simde__m128d a;
simde__m128d r;
} test_vec[8] = {
{ simde_mm_set_pd(SIMDE_FLOAT64_C( 1.81), SIMDE_FLOAT64_C( 5.44)),
simde_mm_set_pd(SIMDE_FLOAT64_C( 1.20), SIMDE_FLOAT64_C( 2.38)) },
{ simde_mm_set_pd(SIMDE_FLOAT64_C( 3.69), SIMDE_FLOAT64_C( 4.43)),
simde_mm_set_pd(SIMDE_FLOAT64_C( 1.98), SIMDE_FLOAT64_C( 2.17)) },
{ simde_mm_set_pd(SIMDE_FLOAT64_C( 3.32), SIMDE_FLOAT64_C( 4.41)),
simde_mm_set_pd(SIMDE_FLOAT64_C( 1.87), SIMDE_FLOAT64_C( 2.16)) },
{ simde_mm_set_pd(SIMDE_FLOAT64_C( 5.94), SIMDE_FLOAT64_C( 6.51)),
simde_mm_set_pd(SIMDE_FLOAT64_C( 2.47), SIMDE_FLOAT64_C( 2.56)) },
{ simde_mm_set_pd(SIMDE_FLOAT64_C( 3.41), SIMDE_FLOAT64_C( 5.84)),
simde_mm_set_pd(SIMDE_FLOAT64_C( 1.90), SIMDE_FLOAT64_C( 2.45)) },
{ simde_mm_set_pd(SIMDE_FLOAT64_C( 7.02), SIMDE_FLOAT64_C( 5.69)),
simde_mm_set_pd(SIMDE_FLOAT64_C( 2.64), SIMDE_FLOAT64_C( 2.42)) },
{ simde_mm_set_pd(SIMDE_FLOAT64_C( 4.58), SIMDE_FLOAT64_C( 6.19)),
simde_mm_set_pd(SIMDE_FLOAT64_C( 2.20), SIMDE_FLOAT64_C( 2.51)) },
{ simde_mm_set_pd(SIMDE_FLOAT64_C( 2.06), SIMDE_FLOAT64_C( 2.04)),
simde_mm_set_pd(SIMDE_FLOAT64_C( 1.35), SIMDE_FLOAT64_C( 1.34)) }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])); i++) {
simde__m128d r = simde_mm_acosh_pd(test_vec[i].a);
simde_assert_m128d_close(r, test_vec[i].r, 1);
}
return 0;
}
static int
test_simde_mm256_acosh_ps(SIMDE_MUNIT_TEST_ARGS) {
const struct {
simde__m256 a;
simde__m256 r;
} test_vec[8] = {
{ simde_mm256_set_ps(SIMDE_FLOAT32_C( 5.94), SIMDE_FLOAT32_C( 6.51),
SIMDE_FLOAT32_C( 3.32), SIMDE_FLOAT32_C( 4.41),
SIMDE_FLOAT32_C( 3.69), SIMDE_FLOAT32_C( 4.43),
SIMDE_FLOAT32_C( 1.81), SIMDE_FLOAT32_C( 5.44)),
simde_mm256_set_ps(SIMDE_FLOAT32_C( 2.47), SIMDE_FLOAT32_C( 2.56),
SIMDE_FLOAT32_C( 1.87), SIMDE_FLOAT32_C( 2.16),
SIMDE_FLOAT32_C( 1.98), SIMDE_FLOAT32_C( 2.17),
SIMDE_FLOAT32_C( 1.20), SIMDE_FLOAT32_C( 2.38)) },
{ simde_mm256_set_ps(SIMDE_FLOAT32_C( 2.06), SIMDE_FLOAT32_C( 2.04),
SIMDE_FLOAT32_C( 4.58), SIMDE_FLOAT32_C( 6.19),
SIMDE_FLOAT32_C( 7.02), SIMDE_FLOAT32_C( 5.69),
SIMDE_FLOAT32_C( 3.41), SIMDE_FLOAT32_C( 5.84)),
simde_mm256_set_ps(SIMDE_FLOAT32_C( 1.35), SIMDE_FLOAT32_C( 1.34),
SIMDE_FLOAT32_C( 2.20), SIMDE_FLOAT32_C( 2.51),
SIMDE_FLOAT32_C( 2.64), SIMDE_FLOAT32_C( 2.42),
SIMDE_FLOAT32_C( 1.90), SIMDE_FLOAT32_C( 2.45)) },
{ simde_mm256_set_ps(SIMDE_FLOAT32_C( 2.83), SIMDE_FLOAT32_C( 4.39),
SIMDE_FLOAT32_C( 3.03), SIMDE_FLOAT32_C( 1.25),
SIMDE_FLOAT32_C( 3.29), SIMDE_FLOAT32_C( 1.46),
SIMDE_FLOAT32_C( 2.92), SIMDE_FLOAT32_C( 6.60)),
simde_mm256_set_ps(SIMDE_FLOAT32_C( 1.70), SIMDE_FLOAT32_C( 2.16),
SIMDE_FLOAT32_C( 1.77), SIMDE_FLOAT32_C( 0.69),
SIMDE_FLOAT32_C( 1.86), SIMDE_FLOAT32_C( 0.93),
SIMDE_FLOAT32_C( 1.73), SIMDE_FLOAT32_C( 2.57)) },
{ simde_mm256_set_ps(SIMDE_FLOAT32_C( 4.89), SIMDE_FLOAT32_C( 2.81),
SIMDE_FLOAT32_C( 5.07), SIMDE_FLOAT32_C( 6.57),
SIMDE_FLOAT32_C( 5.16), SIMDE_FLOAT32_C( 3.60),
SIMDE_FLOAT32_C( 1.08), SIMDE_FLOAT32_C( 2.12)),
simde_mm256_set_ps(SIMDE_FLOAT32_C( 2.27), SIMDE_FLOAT32_C( 1.69),
SIMDE_FLOAT32_C( 2.31), SIMDE_FLOAT32_C( 2.57),
SIMDE_FLOAT32_C( 2.32), SIMDE_FLOAT32_C( 1.95),
SIMDE_FLOAT32_C( 0.40), SIMDE_FLOAT32_C( 1.38)) },
{ simde_mm256_set_ps(SIMDE_FLOAT32_C( 1.76), SIMDE_FLOAT32_C( 5.76),
SIMDE_FLOAT32_C( 2.37), SIMDE_FLOAT32_C( 5.56),
SIMDE_FLOAT32_C( 1.76), SIMDE_FLOAT32_C( 7.58),
SIMDE_FLOAT32_C( 4.39), SIMDE_FLOAT32_C( 7.08)),
simde_mm256_set_ps(SIMDE_FLOAT32_C( 1.17), SIMDE_FLOAT32_C( 2.44),
SIMDE_FLOAT32_C( 1.51), SIMDE_FLOAT32_C( 2.40),
SIMDE_FLOAT32_C( 1.17), SIMDE_FLOAT32_C( 2.71),
SIMDE_FLOAT32_C( 2.16), SIMDE_FLOAT32_C( 2.65)) },
{ simde_mm256_set_ps(SIMDE_FLOAT32_C( 3.02), SIMDE_FLOAT32_C( 5.61),
SIMDE_FLOAT32_C( 6.46), SIMDE_FLOAT32_C( 5.42),
SIMDE_FLOAT32_C( 6.06), SIMDE_FLOAT32_C( 3.43),
SIMDE_FLOAT32_C( 6.88), SIMDE_FLOAT32_C( 4.20)),
simde_mm256_set_ps(SIMDE_FLOAT32_C( 1.77), SIMDE_FLOAT32_C( 2.41),
SIMDE_FLOAT32_C( 2.55), SIMDE_FLOAT32_C( 2.37),
SIMDE_FLOAT32_C( 2.49), SIMDE_FLOAT32_C( 1.90),
SIMDE_FLOAT32_C( 2.62), SIMDE_FLOAT32_C( 2.11)) },
{ simde_mm256_set_ps(SIMDE_FLOAT32_C( 3.63), SIMDE_FLOAT32_C( 4.03),
SIMDE_FLOAT32_C( 5.41), SIMDE_FLOAT32_C( 1.18),
SIMDE_FLOAT32_C( 1.83), SIMDE_FLOAT32_C( 1.77),
SIMDE_FLOAT32_C( 2.47), SIMDE_FLOAT32_C( 5.62)),
simde_mm256_set_ps(SIMDE_FLOAT32_C( 1.96), SIMDE_FLOAT32_C( 2.07),
SIMDE_FLOAT32_C( 2.37), SIMDE_FLOAT32_C( 0.59),
SIMDE_FLOAT32_C( 1.21), SIMDE_FLOAT32_C( 1.17),
SIMDE_FLOAT32_C( 1.55), SIMDE_FLOAT32_C( 2.41)) },
{ simde_mm256_set_ps(SIMDE_FLOAT32_C( 5.85), SIMDE_FLOAT32_C( 6.54),
SIMDE_FLOAT32_C( 3.81), SIMDE_FLOAT32_C( 7.00),
SIMDE_FLOAT32_C( 7.30), SIMDE_FLOAT32_C( 6.28),
SIMDE_FLOAT32_C( 6.91), SIMDE_FLOAT32_C( 5.14)),
simde_mm256_set_ps(SIMDE_FLOAT32_C( 2.45), SIMDE_FLOAT32_C( 2.57),
SIMDE_FLOAT32_C( 2.01), SIMDE_FLOAT32_C( 2.63),
SIMDE_FLOAT32_C( 2.68), SIMDE_FLOAT32_C( 2.52),
SIMDE_FLOAT32_C( 2.62), SIMDE_FLOAT32_C( 2.32)) }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])); i++) {
simde__m256 r = simde_mm256_acosh_ps(test_vec[i].a);
simde_assert_m256_close(r, test_vec[i].r, 1);
}
return 0;
}
static int
test_simde_mm256_acosh_pd(SIMDE_MUNIT_TEST_ARGS) {
const struct {
simde__m256d a;
simde__m256d r;
} test_vec[8] = {
{ simde_mm256_set_pd(SIMDE_FLOAT64_C( 3.69), SIMDE_FLOAT64_C( 4.43),
SIMDE_FLOAT64_C( 1.81), SIMDE_FLOAT64_C( 5.44)),
simde_mm256_set_pd(SIMDE_FLOAT64_C( 1.98), SIMDE_FLOAT64_C( 2.17),
SIMDE_FLOAT64_C( 1.20), SIMDE_FLOAT64_C( 2.38)) },
{ simde_mm256_set_pd(SIMDE_FLOAT64_C( 5.94), SIMDE_FLOAT64_C( 6.51),
SIMDE_FLOAT64_C( 3.32), SIMDE_FLOAT64_C( 4.41)),
simde_mm256_set_pd(SIMDE_FLOAT64_C( 2.47), SIMDE_FLOAT64_C( 2.56),
SIMDE_FLOAT64_C( 1.87), SIMDE_FLOAT64_C( 2.16)) },
{ simde_mm256_set_pd(SIMDE_FLOAT64_C( 7.02), SIMDE_FLOAT64_C( 5.69),
SIMDE_FLOAT64_C( 3.41), SIMDE_FLOAT64_C( 5.84)),
simde_mm256_set_pd(SIMDE_FLOAT64_C( 2.64), SIMDE_FLOAT64_C( 2.42),
SIMDE_FLOAT64_C( 1.90), SIMDE_FLOAT64_C( 2.45)) },
{ simde_mm256_set_pd(SIMDE_FLOAT64_C( 2.06), SIMDE_FLOAT64_C( 2.04),
SIMDE_FLOAT64_C( 4.58), SIMDE_FLOAT64_C( 6.19)),
simde_mm256_set_pd(SIMDE_FLOAT64_C( 1.35), SIMDE_FLOAT64_C( 1.34),
SIMDE_FLOAT64_C( 2.20), SIMDE_FLOAT64_C( 2.51)) },
{ simde_mm256_set_pd(SIMDE_FLOAT64_C( 3.29), SIMDE_FLOAT64_C( 1.46),
SIMDE_FLOAT64_C( 2.92), SIMDE_FLOAT64_C( 6.60)),
simde_mm256_set_pd(SIMDE_FLOAT64_C( 1.86), SIMDE_FLOAT64_C( 0.93),
SIMDE_FLOAT64_C( 1.73), SIMDE_FLOAT64_C( 2.57)) },
{ simde_mm256_set_pd(SIMDE_FLOAT64_C( 2.83), SIMDE_FLOAT64_C( 4.39),
SIMDE_FLOAT64_C( 3.03), SIMDE_FLOAT64_C( 1.25)),
simde_mm256_set_pd(SIMDE_FLOAT64_C( 1.70), SIMDE_FLOAT64_C( 2.16),
SIMDE_FLOAT64_C( 1.77), SIMDE_FLOAT64_C( 0.69)) },
{ simde_mm256_set_pd(SIMDE_FLOAT64_C( 5.16), SIMDE_FLOAT64_C( 3.60),
SIMDE_FLOAT64_C( 1.08), SIMDE_FLOAT64_C( 2.12)),
simde_mm256_set_pd(SIMDE_FLOAT64_C( 2.32), SIMDE_FLOAT64_C( 1.95),
SIMDE_FLOAT64_C( 0.40), SIMDE_FLOAT64_C( 1.38)) },
{ simde_mm256_set_pd(SIMDE_FLOAT64_C( 4.89), SIMDE_FLOAT64_C( 2.81),
SIMDE_FLOAT64_C( 5.07), SIMDE_FLOAT64_C( 6.57)),
simde_mm256_set_pd(SIMDE_FLOAT64_C( 2.27), SIMDE_FLOAT64_C( 1.69),
SIMDE_FLOAT64_C( 2.31), SIMDE_FLOAT64_C( 2.57)) }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])); i++) {
simde__m256d r = simde_mm256_acosh_pd(test_vec[i].a);
simde_assert_m256d_close(r, test_vec[i].r, 1);
}
return 0;
}
static int
test_simde_mm512_acosh_ps(SIMDE_MUNIT_TEST_ARGS) {
const struct {
simde__m512 a;
simde__m512 r;
} test_vec[8] = {
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( 2.06), SIMDE_FLOAT32_C( 2.04), SIMDE_FLOAT32_C( 4.58), SIMDE_FLOAT32_C( 6.19),
SIMDE_FLOAT32_C( 7.02), SIMDE_FLOAT32_C( 5.69), SIMDE_FLOAT32_C( 3.41), SIMDE_FLOAT32_C( 5.84),
SIMDE_FLOAT32_C( 5.94), SIMDE_FLOAT32_C( 6.51), SIMDE_FLOAT32_C( 3.32), SIMDE_FLOAT32_C( 4.41),
SIMDE_FLOAT32_C( 3.69), SIMDE_FLOAT32_C( 4.43), SIMDE_FLOAT32_C( 1.81), SIMDE_FLOAT32_C( 5.44)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 1.35), SIMDE_FLOAT32_C( 1.34), SIMDE_FLOAT32_C( 2.20), SIMDE_FLOAT32_C( 2.51),
SIMDE_FLOAT32_C( 2.64), SIMDE_FLOAT32_C( 2.42), SIMDE_FLOAT32_C( 1.90), SIMDE_FLOAT32_C( 2.45),
SIMDE_FLOAT32_C( 2.47), SIMDE_FLOAT32_C( 2.56), SIMDE_FLOAT32_C( 1.87), SIMDE_FLOAT32_C( 2.16),
SIMDE_FLOAT32_C( 1.98), SIMDE_FLOAT32_C( 2.17), SIMDE_FLOAT32_C( 1.20), SIMDE_FLOAT32_C( 2.38)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( 4.89), SIMDE_FLOAT32_C( 2.81), SIMDE_FLOAT32_C( 5.07), SIMDE_FLOAT32_C( 6.57),
SIMDE_FLOAT32_C( 5.16), SIMDE_FLOAT32_C( 3.60), SIMDE_FLOAT32_C( 1.08), SIMDE_FLOAT32_C( 2.12),
SIMDE_FLOAT32_C( 2.83), SIMDE_FLOAT32_C( 4.39), SIMDE_FLOAT32_C( 3.03), SIMDE_FLOAT32_C( 1.25),
SIMDE_FLOAT32_C( 3.29), SIMDE_FLOAT32_C( 1.46), SIMDE_FLOAT32_C( 2.92), SIMDE_FLOAT32_C( 6.60)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 2.27), SIMDE_FLOAT32_C( 1.69), SIMDE_FLOAT32_C( 2.31), SIMDE_FLOAT32_C( 2.57),
SIMDE_FLOAT32_C( 2.32), SIMDE_FLOAT32_C( 1.95), SIMDE_FLOAT32_C( 0.40), SIMDE_FLOAT32_C( 1.38),
SIMDE_FLOAT32_C( 1.70), SIMDE_FLOAT32_C( 2.16), SIMDE_FLOAT32_C( 1.77), SIMDE_FLOAT32_C( 0.69),
SIMDE_FLOAT32_C( 1.86), SIMDE_FLOAT32_C( 0.93), SIMDE_FLOAT32_C( 1.73), SIMDE_FLOAT32_C( 2.57)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( 3.02), SIMDE_FLOAT32_C( 5.61), SIMDE_FLOAT32_C( 6.46), SIMDE_FLOAT32_C( 5.42),
SIMDE_FLOAT32_C( 6.06), SIMDE_FLOAT32_C( 3.43), SIMDE_FLOAT32_C( 6.88), SIMDE_FLOAT32_C( 4.20),
SIMDE_FLOAT32_C( 1.76), SIMDE_FLOAT32_C( 5.76), SIMDE_FLOAT32_C( 2.37), SIMDE_FLOAT32_C( 5.56),
SIMDE_FLOAT32_C( 1.76), SIMDE_FLOAT32_C( 7.58), SIMDE_FLOAT32_C( 4.39), SIMDE_FLOAT32_C( 7.08)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 1.77), SIMDE_FLOAT32_C( 2.41), SIMDE_FLOAT32_C( 2.55), SIMDE_FLOAT32_C( 2.37),
SIMDE_FLOAT32_C( 2.49), SIMDE_FLOAT32_C( 1.90), SIMDE_FLOAT32_C( 2.62), SIMDE_FLOAT32_C( 2.11),
SIMDE_FLOAT32_C( 1.17), SIMDE_FLOAT32_C( 2.44), SIMDE_FLOAT32_C( 1.51), SIMDE_FLOAT32_C( 2.40),
SIMDE_FLOAT32_C( 1.17), SIMDE_FLOAT32_C( 2.71), SIMDE_FLOAT32_C( 2.16), SIMDE_FLOAT32_C( 2.65)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( 5.85), SIMDE_FLOAT32_C( 6.54), SIMDE_FLOAT32_C( 3.81), SIMDE_FLOAT32_C( 7.00),
SIMDE_FLOAT32_C( 7.30), SIMDE_FLOAT32_C( 6.28), SIMDE_FLOAT32_C( 6.91), SIMDE_FLOAT32_C( 5.14),
SIMDE_FLOAT32_C( 3.63), SIMDE_FLOAT32_C( 4.03), SIMDE_FLOAT32_C( 5.41), SIMDE_FLOAT32_C( 1.18),
SIMDE_FLOAT32_C( 1.83), SIMDE_FLOAT32_C( 1.77), SIMDE_FLOAT32_C( 2.47), SIMDE_FLOAT32_C( 5.62)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 2.45), SIMDE_FLOAT32_C( 2.57), SIMDE_FLOAT32_C( 2.01), SIMDE_FLOAT32_C( 2.63),
SIMDE_FLOAT32_C( 2.68), SIMDE_FLOAT32_C( 2.52), SIMDE_FLOAT32_C( 2.62), SIMDE_FLOAT32_C( 2.32),
SIMDE_FLOAT32_C( 1.96), SIMDE_FLOAT32_C( 2.07), SIMDE_FLOAT32_C( 2.37), SIMDE_FLOAT32_C( 0.59),
SIMDE_FLOAT32_C( 1.21), SIMDE_FLOAT32_C( 1.17), SIMDE_FLOAT32_C( 1.55), SIMDE_FLOAT32_C( 2.41)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( 3.71), SIMDE_FLOAT32_C( 6.80), SIMDE_FLOAT32_C( 5.37), SIMDE_FLOAT32_C( 5.43),
SIMDE_FLOAT32_C( 1.41), SIMDE_FLOAT32_C( 1.67), SIMDE_FLOAT32_C( 3.22), SIMDE_FLOAT32_C( 2.56),
SIMDE_FLOAT32_C( 3.67), SIMDE_FLOAT32_C( 1.59), SIMDE_FLOAT32_C( 6.15), SIMDE_FLOAT32_C( 6.46),
SIMDE_FLOAT32_C( 4.07), SIMDE_FLOAT32_C( 6.09), SIMDE_FLOAT32_C( 4.70), SIMDE_FLOAT32_C( 3.73)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 1.99), SIMDE_FLOAT32_C( 2.60), SIMDE_FLOAT32_C( 2.37), SIMDE_FLOAT32_C( 2.38),
SIMDE_FLOAT32_C( 0.88), SIMDE_FLOAT32_C( 1.10), SIMDE_FLOAT32_C( 1.84), SIMDE_FLOAT32_C( 1.59),
SIMDE_FLOAT32_C( 1.97), SIMDE_FLOAT32_C( 1.04), SIMDE_FLOAT32_C( 2.50), SIMDE_FLOAT32_C( 2.55),
SIMDE_FLOAT32_C( 2.08), SIMDE_FLOAT32_C( 2.49), SIMDE_FLOAT32_C( 2.23), SIMDE_FLOAT32_C( 1.99)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( 6.58), SIMDE_FLOAT32_C( 7.07), SIMDE_FLOAT32_C( 4.23), SIMDE_FLOAT32_C( 2.35),
SIMDE_FLOAT32_C( 2.82), SIMDE_FLOAT32_C( 6.71), SIMDE_FLOAT32_C( 5.97), SIMDE_FLOAT32_C( 6.36),
SIMDE_FLOAT32_C( 7.04), SIMDE_FLOAT32_C( 4.76), SIMDE_FLOAT32_C( 7.53), SIMDE_FLOAT32_C( 1.31),
SIMDE_FLOAT32_C( 5.39), SIMDE_FLOAT32_C( 4.63), SIMDE_FLOAT32_C( 5.83), SIMDE_FLOAT32_C( 1.86)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 2.57), SIMDE_FLOAT32_C( 2.64), SIMDE_FLOAT32_C( 2.12), SIMDE_FLOAT32_C( 1.50),
SIMDE_FLOAT32_C( 1.70), SIMDE_FLOAT32_C( 2.59), SIMDE_FLOAT32_C( 2.47), SIMDE_FLOAT32_C( 2.54),
SIMDE_FLOAT32_C( 2.64), SIMDE_FLOAT32_C( 2.24), SIMDE_FLOAT32_C( 2.71), SIMDE_FLOAT32_C( 0.77),
SIMDE_FLOAT32_C( 2.37), SIMDE_FLOAT32_C( 2.21), SIMDE_FLOAT32_C( 2.45), SIMDE_FLOAT32_C( 1.23)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( 1.76), SIMDE_FLOAT32_C( 7.01), SIMDE_FLOAT32_C( 2.41), SIMDE_FLOAT32_C( 1.01),
SIMDE_FLOAT32_C( 3.19), SIMDE_FLOAT32_C( 7.35), SIMDE_FLOAT32_C( 5.27), SIMDE_FLOAT32_C( 1.77),
SIMDE_FLOAT32_C( 2.40), SIMDE_FLOAT32_C( 4.08), SIMDE_FLOAT32_C( 6.64), SIMDE_FLOAT32_C( 7.53),
SIMDE_FLOAT32_C( 1.80), SIMDE_FLOAT32_C( 5.70), SIMDE_FLOAT32_C( 4.39), SIMDE_FLOAT32_C( 3.99)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 1.17), SIMDE_FLOAT32_C( 2.64), SIMDE_FLOAT32_C( 1.53), SIMDE_FLOAT32_C( 0.14),
SIMDE_FLOAT32_C( 1.83), SIMDE_FLOAT32_C( 2.68), SIMDE_FLOAT32_C( 2.35), SIMDE_FLOAT32_C( 1.17),
SIMDE_FLOAT32_C( 1.52), SIMDE_FLOAT32_C( 2.08), SIMDE_FLOAT32_C( 2.58), SIMDE_FLOAT32_C( 2.71),
SIMDE_FLOAT32_C( 1.19), SIMDE_FLOAT32_C( 2.43), SIMDE_FLOAT32_C( 2.16), SIMDE_FLOAT32_C( 2.06)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( 2.85), SIMDE_FLOAT32_C( 3.11), SIMDE_FLOAT32_C( 1.82), SIMDE_FLOAT32_C( 4.19),
SIMDE_FLOAT32_C( 7.38), SIMDE_FLOAT32_C( 4.32), SIMDE_FLOAT32_C( 3.22), SIMDE_FLOAT32_C( 3.89),
SIMDE_FLOAT32_C( 3.70), SIMDE_FLOAT32_C( 4.43), SIMDE_FLOAT32_C( 5.99), SIMDE_FLOAT32_C( 5.60),
SIMDE_FLOAT32_C( 4.35), SIMDE_FLOAT32_C( 3.29), SIMDE_FLOAT32_C( 7.32), SIMDE_FLOAT32_C( 2.00)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 1.71), SIMDE_FLOAT32_C( 1.80), SIMDE_FLOAT32_C( 1.21), SIMDE_FLOAT32_C( 2.11),
SIMDE_FLOAT32_C( 2.69), SIMDE_FLOAT32_C( 2.14), SIMDE_FLOAT32_C( 1.84), SIMDE_FLOAT32_C( 2.03),
SIMDE_FLOAT32_C( 1.98), SIMDE_FLOAT32_C( 2.17), SIMDE_FLOAT32_C( 2.48), SIMDE_FLOAT32_C( 2.41),
SIMDE_FLOAT32_C( 2.15), SIMDE_FLOAT32_C( 1.86), SIMDE_FLOAT32_C( 2.68), SIMDE_FLOAT32_C( 1.32)) }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])); i++) {
simde__m512 r = simde_mm512_acosh_ps(test_vec[i].a);
simde_assert_m512_close(r, test_vec[i].r, 1);
}
return 0;
}
static int
test_simde_mm512_mask_acosh_ps(SIMDE_MUNIT_TEST_ARGS) {
const struct {
simde__m512 src;
simde__mmask16 k;
simde__m512 a;
simde__m512 r;
} test_vec[8] = {
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( 2.81), SIMDE_FLOAT32_C( 6.57), SIMDE_FLOAT32_C( 3.60), SIMDE_FLOAT32_C( 2.12),
SIMDE_FLOAT32_C( 4.39), SIMDE_FLOAT32_C( 1.25), SIMDE_FLOAT32_C( 1.46), SIMDE_FLOAT32_C( 6.60),
SIMDE_FLOAT32_C( 2.04), SIMDE_FLOAT32_C( 6.19), SIMDE_FLOAT32_C( 5.69), SIMDE_FLOAT32_C( 5.84),
SIMDE_FLOAT32_C( 6.51), SIMDE_FLOAT32_C( 4.41), SIMDE_FLOAT32_C( 4.43), SIMDE_FLOAT32_C( 5.44)),
UINT16_C(41466),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 4.89), SIMDE_FLOAT32_C( 5.07), SIMDE_FLOAT32_C( 5.16), SIMDE_FLOAT32_C( 1.08),
SIMDE_FLOAT32_C( 2.83), SIMDE_FLOAT32_C( 3.03), SIMDE_FLOAT32_C( 3.29), SIMDE_FLOAT32_C( 2.92),
SIMDE_FLOAT32_C( 2.06), SIMDE_FLOAT32_C( 4.58), SIMDE_FLOAT32_C( 7.02), SIMDE_FLOAT32_C( 3.41),
SIMDE_FLOAT32_C( 5.94), SIMDE_FLOAT32_C( 3.32), SIMDE_FLOAT32_C( 3.69), SIMDE_FLOAT32_C( 1.81)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 2.27), SIMDE_FLOAT32_C( 6.57), SIMDE_FLOAT32_C( 2.32), SIMDE_FLOAT32_C( 2.12),
SIMDE_FLOAT32_C( 4.39), SIMDE_FLOAT32_C( 1.25), SIMDE_FLOAT32_C( 1.46), SIMDE_FLOAT32_C( 1.73),
SIMDE_FLOAT32_C( 1.35), SIMDE_FLOAT32_C( 2.20), SIMDE_FLOAT32_C( 2.64), SIMDE_FLOAT32_C( 1.90),
SIMDE_FLOAT32_C( 2.47), SIMDE_FLOAT32_C( 4.41), SIMDE_FLOAT32_C( 1.98), SIMDE_FLOAT32_C( 5.44)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( 5.85), SIMDE_FLOAT32_C( 3.81), SIMDE_FLOAT32_C( 7.30), SIMDE_FLOAT32_C( 6.91),
SIMDE_FLOAT32_C( 3.63), SIMDE_FLOAT32_C( 5.41), SIMDE_FLOAT32_C( 1.83), SIMDE_FLOAT32_C( 2.47),
SIMDE_FLOAT32_C( 3.02), SIMDE_FLOAT32_C( 6.46), SIMDE_FLOAT32_C( 6.06), SIMDE_FLOAT32_C( 6.88),
SIMDE_FLOAT32_C( 1.76), SIMDE_FLOAT32_C( 2.37), SIMDE_FLOAT32_C( 1.76), SIMDE_FLOAT32_C( 4.39)),
UINT16_C(36797),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 3.73), SIMDE_FLOAT32_C( 6.54), SIMDE_FLOAT32_C( 7.00), SIMDE_FLOAT32_C( 6.28),
SIMDE_FLOAT32_C( 5.14), SIMDE_FLOAT32_C( 4.03), SIMDE_FLOAT32_C( 1.18), SIMDE_FLOAT32_C( 1.77),
SIMDE_FLOAT32_C( 5.62), SIMDE_FLOAT32_C( 5.61), SIMDE_FLOAT32_C( 5.42), SIMDE_FLOAT32_C( 3.43),
SIMDE_FLOAT32_C( 4.20), SIMDE_FLOAT32_C( 5.76), SIMDE_FLOAT32_C( 5.56), SIMDE_FLOAT32_C( 7.58)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 1.99), SIMDE_FLOAT32_C( 3.81), SIMDE_FLOAT32_C( 7.30), SIMDE_FLOAT32_C( 6.91),
SIMDE_FLOAT32_C( 2.32), SIMDE_FLOAT32_C( 2.07), SIMDE_FLOAT32_C( 0.59), SIMDE_FLOAT32_C( 1.17),
SIMDE_FLOAT32_C( 2.41), SIMDE_FLOAT32_C( 6.46), SIMDE_FLOAT32_C( 2.37), SIMDE_FLOAT32_C( 1.90),
SIMDE_FLOAT32_C( 2.11), SIMDE_FLOAT32_C( 2.44), SIMDE_FLOAT32_C( 1.76), SIMDE_FLOAT32_C( 2.71)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( 3.99), SIMDE_FLOAT32_C( 7.07), SIMDE_FLOAT32_C( 2.35), SIMDE_FLOAT32_C( 6.71),
SIMDE_FLOAT32_C( 6.36), SIMDE_FLOAT32_C( 4.76), SIMDE_FLOAT32_C( 1.31), SIMDE_FLOAT32_C( 4.63),
SIMDE_FLOAT32_C( 1.86), SIMDE_FLOAT32_C( 6.80), SIMDE_FLOAT32_C( 5.43), SIMDE_FLOAT32_C( 1.67),
SIMDE_FLOAT32_C( 2.56), SIMDE_FLOAT32_C( 1.59), SIMDE_FLOAT32_C( 6.46), SIMDE_FLOAT32_C( 6.09)),
UINT16_C(16804),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 4.39), SIMDE_FLOAT32_C( 6.58), SIMDE_FLOAT32_C( 4.23), SIMDE_FLOAT32_C( 2.82),
SIMDE_FLOAT32_C( 5.97), SIMDE_FLOAT32_C( 7.04), SIMDE_FLOAT32_C( 7.53), SIMDE_FLOAT32_C( 5.39),
SIMDE_FLOAT32_C( 5.83), SIMDE_FLOAT32_C( 3.71), SIMDE_FLOAT32_C( 5.37), SIMDE_FLOAT32_C( 1.41),
SIMDE_FLOAT32_C( 3.22), SIMDE_FLOAT32_C( 3.67), SIMDE_FLOAT32_C( 6.15), SIMDE_FLOAT32_C( 4.07)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 3.99), SIMDE_FLOAT32_C( 2.57), SIMDE_FLOAT32_C( 2.35), SIMDE_FLOAT32_C( 6.71),
SIMDE_FLOAT32_C( 6.36), SIMDE_FLOAT32_C( 4.76), SIMDE_FLOAT32_C( 1.31), SIMDE_FLOAT32_C( 2.37),
SIMDE_FLOAT32_C( 2.45), SIMDE_FLOAT32_C( 6.80), SIMDE_FLOAT32_C( 2.37), SIMDE_FLOAT32_C( 1.67),
SIMDE_FLOAT32_C( 2.56), SIMDE_FLOAT32_C( 1.97), SIMDE_FLOAT32_C( 6.46), SIMDE_FLOAT32_C( 6.09)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( 3.15), SIMDE_FLOAT32_C( 2.85), SIMDE_FLOAT32_C( 1.82), SIMDE_FLOAT32_C( 7.38),
SIMDE_FLOAT32_C( 3.22), SIMDE_FLOAT32_C( 3.70), SIMDE_FLOAT32_C( 5.99), SIMDE_FLOAT32_C( 4.35),
SIMDE_FLOAT32_C( 7.32), SIMDE_FLOAT32_C( 1.76), SIMDE_FLOAT32_C( 2.41), SIMDE_FLOAT32_C( 3.19),
SIMDE_FLOAT32_C( 5.27), SIMDE_FLOAT32_C( 2.40), SIMDE_FLOAT32_C( 6.64), SIMDE_FLOAT32_C( 1.80)),
UINT16_C( 2107),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 7.26), SIMDE_FLOAT32_C( 3.65), SIMDE_FLOAT32_C( 3.11), SIMDE_FLOAT32_C( 4.19),
SIMDE_FLOAT32_C( 4.32), SIMDE_FLOAT32_C( 3.89), SIMDE_FLOAT32_C( 4.43), SIMDE_FLOAT32_C( 5.60),
SIMDE_FLOAT32_C( 3.29), SIMDE_FLOAT32_C( 2.00), SIMDE_FLOAT32_C( 7.01), SIMDE_FLOAT32_C( 1.01),
SIMDE_FLOAT32_C( 7.35), SIMDE_FLOAT32_C( 1.77), SIMDE_FLOAT32_C( 4.08), SIMDE_FLOAT32_C( 7.53)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 3.15), SIMDE_FLOAT32_C( 2.85), SIMDE_FLOAT32_C( 1.82), SIMDE_FLOAT32_C( 7.38),
SIMDE_FLOAT32_C( 2.14), SIMDE_FLOAT32_C( 3.70), SIMDE_FLOAT32_C( 5.99), SIMDE_FLOAT32_C( 4.35),
SIMDE_FLOAT32_C( 7.32), SIMDE_FLOAT32_C( 1.76), SIMDE_FLOAT32_C( 2.64), SIMDE_FLOAT32_C( 0.14),
SIMDE_FLOAT32_C( 2.68), SIMDE_FLOAT32_C( 2.40), SIMDE_FLOAT32_C( 2.08), SIMDE_FLOAT32_C( 2.71)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( 4.25), SIMDE_FLOAT32_C( 1.87), SIMDE_FLOAT32_C( 3.26), SIMDE_FLOAT32_C( 4.89),
SIMDE_FLOAT32_C( 5.44), SIMDE_FLOAT32_C( 7.23), SIMDE_FLOAT32_C( 7.32), SIMDE_FLOAT32_C( 4.74),
SIMDE_FLOAT32_C( 5.90), SIMDE_FLOAT32_C( 2.33), SIMDE_FLOAT32_C( 1.69), SIMDE_FLOAT32_C( 1.77),
SIMDE_FLOAT32_C( 5.03), SIMDE_FLOAT32_C( 1.70), SIMDE_FLOAT32_C( 1.74), SIMDE_FLOAT32_C( 5.75)),
UINT16_C(22274),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 5.94), SIMDE_FLOAT32_C( 7.32), SIMDE_FLOAT32_C( 1.93), SIMDE_FLOAT32_C( 4.83),
SIMDE_FLOAT32_C( 1.46), SIMDE_FLOAT32_C( 5.71), SIMDE_FLOAT32_C( 7.38), SIMDE_FLOAT32_C( 4.66),
SIMDE_FLOAT32_C( 7.03), SIMDE_FLOAT32_C( 4.05), SIMDE_FLOAT32_C( 5.08), SIMDE_FLOAT32_C( 3.05),
SIMDE_FLOAT32_C( 2.31), SIMDE_FLOAT32_C( 2.24), SIMDE_FLOAT32_C( 1.19), SIMDE_FLOAT32_C( 5.87)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 4.25), SIMDE_FLOAT32_C( 2.68), SIMDE_FLOAT32_C( 3.26), SIMDE_FLOAT32_C( 2.26),
SIMDE_FLOAT32_C( 5.44), SIMDE_FLOAT32_C( 2.43), SIMDE_FLOAT32_C( 2.69), SIMDE_FLOAT32_C( 2.22),
SIMDE_FLOAT32_C( 5.90), SIMDE_FLOAT32_C( 2.33), SIMDE_FLOAT32_C( 1.69), SIMDE_FLOAT32_C( 1.77),
SIMDE_FLOAT32_C( 5.03), SIMDE_FLOAT32_C( 1.70), SIMDE_FLOAT32_C( 0.61), SIMDE_FLOAT32_C( 5.75)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( 7.21), SIMDE_FLOAT32_C( 1.64), SIMDE_FLOAT32_C( 4.07), SIMDE_FLOAT32_C( 1.71),
SIMDE_FLOAT32_C( 4.61), SIMDE_FLOAT32_C( 4.98), SIMDE_FLOAT32_C( 7.05), SIMDE_FLOAT32_C( 4.08),
SIMDE_FLOAT32_C( 3.36), SIMDE_FLOAT32_C( 3.60), SIMDE_FLOAT32_C( 3.25), SIMDE_FLOAT32_C( 6.89),
SIMDE_FLOAT32_C( 2.22), SIMDE_FLOAT32_C( 6.14), SIMDE_FLOAT32_C( 5.75), SIMDE_FLOAT32_C( 5.73)),
UINT16_C(27396),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 1.12), SIMDE_FLOAT32_C( 2.96), SIMDE_FLOAT32_C( 1.85), SIMDE_FLOAT32_C( 1.78),
SIMDE_FLOAT32_C( 6.91), SIMDE_FLOAT32_C( 4.32), SIMDE_FLOAT32_C( 1.60), SIMDE_FLOAT32_C( 4.83),
SIMDE_FLOAT32_C( 6.21), SIMDE_FLOAT32_C( 4.26), SIMDE_FLOAT32_C( 3.28), SIMDE_FLOAT32_C( 1.93),
SIMDE_FLOAT32_C( 5.40), SIMDE_FLOAT32_C( 5.21), SIMDE_FLOAT32_C( 1.27), SIMDE_FLOAT32_C( 2.68)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 7.21), SIMDE_FLOAT32_C( 1.75), SIMDE_FLOAT32_C( 1.23), SIMDE_FLOAT32_C( 1.71),
SIMDE_FLOAT32_C( 2.62), SIMDE_FLOAT32_C( 4.98), SIMDE_FLOAT32_C( 1.05), SIMDE_FLOAT32_C( 2.26),
SIMDE_FLOAT32_C( 3.36), SIMDE_FLOAT32_C( 3.60), SIMDE_FLOAT32_C( 3.25), SIMDE_FLOAT32_C( 6.89),
SIMDE_FLOAT32_C( 2.22), SIMDE_FLOAT32_C( 2.33), SIMDE_FLOAT32_C( 5.75), SIMDE_FLOAT32_C( 5.73)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( 6.05), SIMDE_FLOAT32_C( 4.22), SIMDE_FLOAT32_C( 1.11), SIMDE_FLOAT32_C( 6.41),
SIMDE_FLOAT32_C( 5.79), SIMDE_FLOAT32_C( 1.75), SIMDE_FLOAT32_C( 4.65), SIMDE_FLOAT32_C( 6.25),
SIMDE_FLOAT32_C( 4.40), SIMDE_FLOAT32_C( 6.40), SIMDE_FLOAT32_C( 4.02), SIMDE_FLOAT32_C( 4.56),
SIMDE_FLOAT32_C( 1.96), SIMDE_FLOAT32_C( 6.31), SIMDE_FLOAT32_C( 5.60), SIMDE_FLOAT32_C( 1.37)),
UINT16_C( 953),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 4.36), SIMDE_FLOAT32_C( 6.97), SIMDE_FLOAT32_C( 4.78), SIMDE_FLOAT32_C( 2.89),
SIMDE_FLOAT32_C( 5.32), SIMDE_FLOAT32_C( 3.72), SIMDE_FLOAT32_C( 2.79), SIMDE_FLOAT32_C( 6.54),
SIMDE_FLOAT32_C( 4.52), SIMDE_FLOAT32_C( 3.42), SIMDE_FLOAT32_C( 4.69), SIMDE_FLOAT32_C( 2.40),
SIMDE_FLOAT32_C( 4.17), SIMDE_FLOAT32_C( 3.47), SIMDE_FLOAT32_C( 4.12), SIMDE_FLOAT32_C( 4.60)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 6.05), SIMDE_FLOAT32_C( 4.22), SIMDE_FLOAT32_C( 1.11), SIMDE_FLOAT32_C( 6.41),
SIMDE_FLOAT32_C( 5.79), SIMDE_FLOAT32_C( 1.75), SIMDE_FLOAT32_C( 1.69), SIMDE_FLOAT32_C( 2.57),
SIMDE_FLOAT32_C( 2.19), SIMDE_FLOAT32_C( 6.40), SIMDE_FLOAT32_C( 2.23), SIMDE_FLOAT32_C( 1.52),
SIMDE_FLOAT32_C( 2.11), SIMDE_FLOAT32_C( 6.31), SIMDE_FLOAT32_C( 5.60), SIMDE_FLOAT32_C( 2.21)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( 1.70), SIMDE_FLOAT32_C( 5.39), SIMDE_FLOAT32_C( 2.67), SIMDE_FLOAT32_C( 7.01),
SIMDE_FLOAT32_C( 7.46), SIMDE_FLOAT32_C( 7.45), SIMDE_FLOAT32_C( 7.02), SIMDE_FLOAT32_C( 1.61),
SIMDE_FLOAT32_C( 3.61), SIMDE_FLOAT32_C( 1.22), SIMDE_FLOAT32_C( 1.90), SIMDE_FLOAT32_C( 2.91),
SIMDE_FLOAT32_C( 4.63), SIMDE_FLOAT32_C( 4.64), SIMDE_FLOAT32_C( 5.75), SIMDE_FLOAT32_C( 3.63)),
UINT16_C(12713),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 1.52), SIMDE_FLOAT32_C( 4.25), SIMDE_FLOAT32_C( 7.02), SIMDE_FLOAT32_C( 6.92),
SIMDE_FLOAT32_C( 1.87), SIMDE_FLOAT32_C( 3.28), SIMDE_FLOAT32_C( 6.71), SIMDE_FLOAT32_C( 3.14),
SIMDE_FLOAT32_C( 4.50), SIMDE_FLOAT32_C( 4.66), SIMDE_FLOAT32_C( 6.66), SIMDE_FLOAT32_C( 3.47),
SIMDE_FLOAT32_C( 7.42), SIMDE_FLOAT32_C( 5.49), SIMDE_FLOAT32_C( 4.26), SIMDE_FLOAT32_C( 7.11)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 1.70), SIMDE_FLOAT32_C( 5.39), SIMDE_FLOAT32_C( 2.64), SIMDE_FLOAT32_C( 2.62),
SIMDE_FLOAT32_C( 7.46), SIMDE_FLOAT32_C( 7.45), SIMDE_FLOAT32_C( 7.02), SIMDE_FLOAT32_C( 1.81),
SIMDE_FLOAT32_C( 2.18), SIMDE_FLOAT32_C( 1.22), SIMDE_FLOAT32_C( 2.58), SIMDE_FLOAT32_C( 2.91),
SIMDE_FLOAT32_C( 2.69), SIMDE_FLOAT32_C( 4.64), SIMDE_FLOAT32_C( 5.75), SIMDE_FLOAT32_C( 2.65)) }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])); i++) {
simde__m512 r = simde_mm512_mask_acosh_ps(test_vec[i].src, test_vec[i].k, test_vec[i].a);
simde_assert_m512_close(r, test_vec[i].r, 1);
}
return 0;
}
static int
test_simde_mm512_acosh_pd(SIMDE_MUNIT_TEST_ARGS) {
const struct {
simde__m512d a;
simde__m512d r;
} test_vec[8] = {
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( 5.94), SIMDE_FLOAT64_C( 6.51),
SIMDE_FLOAT64_C( 3.32), SIMDE_FLOAT64_C( 4.41),
SIMDE_FLOAT64_C( 3.69), SIMDE_FLOAT64_C( 4.43),
SIMDE_FLOAT64_C( 1.81), SIMDE_FLOAT64_C( 5.44)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 2.47), SIMDE_FLOAT64_C( 2.56),
SIMDE_FLOAT64_C( 1.87), SIMDE_FLOAT64_C( 2.16),
SIMDE_FLOAT64_C( 1.98), SIMDE_FLOAT64_C( 2.17),
SIMDE_FLOAT64_C( 1.20), SIMDE_FLOAT64_C( 2.38)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( 2.06), SIMDE_FLOAT64_C( 2.04),
SIMDE_FLOAT64_C( 4.58), SIMDE_FLOAT64_C( 6.19),
SIMDE_FLOAT64_C( 7.02), SIMDE_FLOAT64_C( 5.69),
SIMDE_FLOAT64_C( 3.41), SIMDE_FLOAT64_C( 5.84)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 1.35), SIMDE_FLOAT64_C( 1.34),
SIMDE_FLOAT64_C( 2.20), SIMDE_FLOAT64_C( 2.51),
SIMDE_FLOAT64_C( 2.64), SIMDE_FLOAT64_C( 2.42),
SIMDE_FLOAT64_C( 1.90), SIMDE_FLOAT64_C( 2.45)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( 2.83), SIMDE_FLOAT64_C( 4.39),
SIMDE_FLOAT64_C( 3.03), SIMDE_FLOAT64_C( 1.25),
SIMDE_FLOAT64_C( 3.29), SIMDE_FLOAT64_C( 1.46),
SIMDE_FLOAT64_C( 2.92), SIMDE_FLOAT64_C( 6.60)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 1.70), SIMDE_FLOAT64_C( 2.16),
SIMDE_FLOAT64_C( 1.77), SIMDE_FLOAT64_C( 0.69),
SIMDE_FLOAT64_C( 1.86), SIMDE_FLOAT64_C( 0.93),
SIMDE_FLOAT64_C( 1.73), SIMDE_FLOAT64_C( 2.57)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( 4.89), SIMDE_FLOAT64_C( 2.81),
SIMDE_FLOAT64_C( 5.07), SIMDE_FLOAT64_C( 6.57),
SIMDE_FLOAT64_C( 5.16), SIMDE_FLOAT64_C( 3.60),
SIMDE_FLOAT64_C( 1.08), SIMDE_FLOAT64_C( 2.12)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 2.27), SIMDE_FLOAT64_C( 1.69),
SIMDE_FLOAT64_C( 2.31), SIMDE_FLOAT64_C( 2.57),
SIMDE_FLOAT64_C( 2.32), SIMDE_FLOAT64_C( 1.95),
SIMDE_FLOAT64_C( 0.40), SIMDE_FLOAT64_C( 1.38)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( 1.76), SIMDE_FLOAT64_C( 5.76),
SIMDE_FLOAT64_C( 2.37), SIMDE_FLOAT64_C( 5.56),
SIMDE_FLOAT64_C( 1.76), SIMDE_FLOAT64_C( 7.58),
SIMDE_FLOAT64_C( 4.39), SIMDE_FLOAT64_C( 7.08)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 1.17), SIMDE_FLOAT64_C( 2.44),
SIMDE_FLOAT64_C( 1.51), SIMDE_FLOAT64_C( 2.40),
SIMDE_FLOAT64_C( 1.17), SIMDE_FLOAT64_C( 2.71),
SIMDE_FLOAT64_C( 2.16), SIMDE_FLOAT64_C( 2.65)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( 3.02), SIMDE_FLOAT64_C( 5.61),
SIMDE_FLOAT64_C( 6.46), SIMDE_FLOAT64_C( 5.42),
SIMDE_FLOAT64_C( 6.06), SIMDE_FLOAT64_C( 3.43),
SIMDE_FLOAT64_C( 6.88), SIMDE_FLOAT64_C( 4.20)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 1.77), SIMDE_FLOAT64_C( 2.41),
SIMDE_FLOAT64_C( 2.55), SIMDE_FLOAT64_C( 2.37),
SIMDE_FLOAT64_C( 2.49), SIMDE_FLOAT64_C( 1.90),
SIMDE_FLOAT64_C( 2.62), SIMDE_FLOAT64_C( 2.11)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( 3.63), SIMDE_FLOAT64_C( 4.03),
SIMDE_FLOAT64_C( 5.41), SIMDE_FLOAT64_C( 1.18),
SIMDE_FLOAT64_C( 1.83), SIMDE_FLOAT64_C( 1.77),
SIMDE_FLOAT64_C( 2.47), SIMDE_FLOAT64_C( 5.62)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 1.96), SIMDE_FLOAT64_C( 2.07),
SIMDE_FLOAT64_C( 2.37), SIMDE_FLOAT64_C( 0.59),
SIMDE_FLOAT64_C( 1.21), SIMDE_FLOAT64_C( 1.17),
SIMDE_FLOAT64_C( 1.55), SIMDE_FLOAT64_C( 2.41)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( 5.85), SIMDE_FLOAT64_C( 6.54),
SIMDE_FLOAT64_C( 3.81), SIMDE_FLOAT64_C( 7.00),
SIMDE_FLOAT64_C( 7.30), SIMDE_FLOAT64_C( 6.28),
SIMDE_FLOAT64_C( 6.91), SIMDE_FLOAT64_C( 5.14)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 2.45), SIMDE_FLOAT64_C( 2.57),
SIMDE_FLOAT64_C( 2.01), SIMDE_FLOAT64_C( 2.63),
SIMDE_FLOAT64_C( 2.68), SIMDE_FLOAT64_C( 2.52),
SIMDE_FLOAT64_C( 2.62), SIMDE_FLOAT64_C( 2.32)) }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])); i++) {
simde__m512d r = simde_mm512_acosh_pd(test_vec[i].a);
simde_assert_m512d_close(r, test_vec[i].r, 1);
}
return 0;
}
static int
test_simde_mm512_mask_acosh_pd(SIMDE_MUNIT_TEST_ARGS) {
const struct {
simde__m512d src;
simde__mmask8 k;
simde__m512d a;
simde__m512d r;
} test_vec[8] = {
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( 2.04), SIMDE_FLOAT64_C( 6.19),
SIMDE_FLOAT64_C( 5.69), SIMDE_FLOAT64_C( 5.84),
SIMDE_FLOAT64_C( 6.51), SIMDE_FLOAT64_C( 4.41),
SIMDE_FLOAT64_C( 4.43), SIMDE_FLOAT64_C( 5.44)),
UINT8_C(139),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 2.06), SIMDE_FLOAT64_C( 4.58),
SIMDE_FLOAT64_C( 7.02), SIMDE_FLOAT64_C( 3.41),
SIMDE_FLOAT64_C( 5.94), SIMDE_FLOAT64_C( 3.32),
SIMDE_FLOAT64_C( 3.69), SIMDE_FLOAT64_C( 1.81)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 1.35), SIMDE_FLOAT64_C( 6.19),
SIMDE_FLOAT64_C( 5.69), SIMDE_FLOAT64_C( 5.84),
SIMDE_FLOAT64_C( 2.47), SIMDE_FLOAT64_C( 4.41),
SIMDE_FLOAT64_C( 1.98), SIMDE_FLOAT64_C( 1.20)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( 4.89), SIMDE_FLOAT64_C( 5.07),
SIMDE_FLOAT64_C( 5.16), SIMDE_FLOAT64_C( 1.08),
SIMDE_FLOAT64_C( 2.83), SIMDE_FLOAT64_C( 3.03),
SIMDE_FLOAT64_C( 3.29), SIMDE_FLOAT64_C( 2.92)),
UINT8_C(229),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 7.08), SIMDE_FLOAT64_C( 2.81),
SIMDE_FLOAT64_C( 6.57), SIMDE_FLOAT64_C( 3.60),
SIMDE_FLOAT64_C( 2.12), SIMDE_FLOAT64_C( 4.39),
SIMDE_FLOAT64_C( 1.25), SIMDE_FLOAT64_C( 1.46)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 2.65), SIMDE_FLOAT64_C( 1.69),
SIMDE_FLOAT64_C( 2.57), SIMDE_FLOAT64_C( 1.08),
SIMDE_FLOAT64_C( 2.83), SIMDE_FLOAT64_C( 2.16),
SIMDE_FLOAT64_C( 3.29), SIMDE_FLOAT64_C( 0.93)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( 5.62), SIMDE_FLOAT64_C( 5.61),
SIMDE_FLOAT64_C( 5.42), SIMDE_FLOAT64_C( 3.43),
SIMDE_FLOAT64_C( 4.20), SIMDE_FLOAT64_C( 5.76),
SIMDE_FLOAT64_C( 5.56), SIMDE_FLOAT64_C( 7.58)),
UINT8_C(253),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 2.47), SIMDE_FLOAT64_C( 3.02),
SIMDE_FLOAT64_C( 6.46), SIMDE_FLOAT64_C( 6.06),
SIMDE_FLOAT64_C( 6.88), SIMDE_FLOAT64_C( 1.76),
SIMDE_FLOAT64_C( 2.37), SIMDE_FLOAT64_C( 1.76)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 1.55), SIMDE_FLOAT64_C( 1.77),
SIMDE_FLOAT64_C( 2.55), SIMDE_FLOAT64_C( 2.49),
SIMDE_FLOAT64_C( 2.62), SIMDE_FLOAT64_C( 1.17),
SIMDE_FLOAT64_C( 5.56), SIMDE_FLOAT64_C( 1.17)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( 4.70), SIMDE_FLOAT64_C( 5.85),
SIMDE_FLOAT64_C( 3.81), SIMDE_FLOAT64_C( 7.30),
SIMDE_FLOAT64_C( 6.91), SIMDE_FLOAT64_C( 3.63),
SIMDE_FLOAT64_C( 5.41), SIMDE_FLOAT64_C( 1.83)),
UINT8_C( 93),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 6.09), SIMDE_FLOAT64_C( 3.73),
SIMDE_FLOAT64_C( 6.54), SIMDE_FLOAT64_C( 7.00),
SIMDE_FLOAT64_C( 6.28), SIMDE_FLOAT64_C( 5.14),
SIMDE_FLOAT64_C( 4.03), SIMDE_FLOAT64_C( 1.18)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 4.70), SIMDE_FLOAT64_C( 1.99),
SIMDE_FLOAT64_C( 3.81), SIMDE_FLOAT64_C( 2.63),
SIMDE_FLOAT64_C( 2.52), SIMDE_FLOAT64_C( 2.32),
SIMDE_FLOAT64_C( 5.41), SIMDE_FLOAT64_C( 0.59)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( 4.63), SIMDE_FLOAT64_C( 1.86),
SIMDE_FLOAT64_C( 6.80), SIMDE_FLOAT64_C( 5.43),
SIMDE_FLOAT64_C( 1.67), SIMDE_FLOAT64_C( 2.56),
SIMDE_FLOAT64_C( 1.59), SIMDE_FLOAT64_C( 6.46)),
UINT8_C(145),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 5.39), SIMDE_FLOAT64_C( 5.83),
SIMDE_FLOAT64_C( 3.71), SIMDE_FLOAT64_C( 5.37),
SIMDE_FLOAT64_C( 1.41), SIMDE_FLOAT64_C( 3.22),
SIMDE_FLOAT64_C( 3.67), SIMDE_FLOAT64_C( 6.15)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 2.37), SIMDE_FLOAT64_C( 1.86),
SIMDE_FLOAT64_C( 6.80), SIMDE_FLOAT64_C( 2.37),
SIMDE_FLOAT64_C( 1.67), SIMDE_FLOAT64_C( 2.56),
SIMDE_FLOAT64_C( 1.59), SIMDE_FLOAT64_C( 2.50)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( 1.80), SIMDE_FLOAT64_C( 4.39),
SIMDE_FLOAT64_C( 6.58), SIMDE_FLOAT64_C( 4.23),
SIMDE_FLOAT64_C( 2.82), SIMDE_FLOAT64_C( 5.97),
SIMDE_FLOAT64_C( 7.04), SIMDE_FLOAT64_C( 7.53)),
UINT8_C( 75),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 7.53), SIMDE_FLOAT64_C( 5.70),
SIMDE_FLOAT64_C( 3.99), SIMDE_FLOAT64_C( 7.07),
SIMDE_FLOAT64_C( 2.35), SIMDE_FLOAT64_C( 6.71),
SIMDE_FLOAT64_C( 6.36), SIMDE_FLOAT64_C( 4.76)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 1.80), SIMDE_FLOAT64_C( 2.43),
SIMDE_FLOAT64_C( 6.58), SIMDE_FLOAT64_C( 4.23),
SIMDE_FLOAT64_C( 1.50), SIMDE_FLOAT64_C( 5.97),
SIMDE_FLOAT64_C( 2.54), SIMDE_FLOAT64_C( 2.24)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( 5.60), SIMDE_FLOAT64_C( 3.29),
SIMDE_FLOAT64_C( 2.00), SIMDE_FLOAT64_C( 7.01),
SIMDE_FLOAT64_C( 1.01), SIMDE_FLOAT64_C( 7.35),
SIMDE_FLOAT64_C( 1.77), SIMDE_FLOAT64_C( 4.08)),
UINT8_C( 93),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 5.99), SIMDE_FLOAT64_C( 4.35),
SIMDE_FLOAT64_C( 7.32), SIMDE_FLOAT64_C( 1.76),
SIMDE_FLOAT64_C( 2.41), SIMDE_FLOAT64_C( 3.19),
SIMDE_FLOAT64_C( 5.27), SIMDE_FLOAT64_C( 2.40)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 5.60), SIMDE_FLOAT64_C( 2.15),
SIMDE_FLOAT64_C( 2.00), SIMDE_FLOAT64_C( 1.17),
SIMDE_FLOAT64_C( 1.53), SIMDE_FLOAT64_C( 1.83),
SIMDE_FLOAT64_C( 1.77), SIMDE_FLOAT64_C( 1.52)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( 5.87), SIMDE_FLOAT64_C( 7.39),
SIMDE_FLOAT64_C( 3.15), SIMDE_FLOAT64_C( 2.85),
SIMDE_FLOAT64_C( 1.82), SIMDE_FLOAT64_C( 7.38),
SIMDE_FLOAT64_C( 3.22), SIMDE_FLOAT64_C( 3.70)),
UINT8_C(213),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 1.74), SIMDE_FLOAT64_C( 5.75),
SIMDE_FLOAT64_C( 7.26), SIMDE_FLOAT64_C( 3.65),
SIMDE_FLOAT64_C( 3.11), SIMDE_FLOAT64_C( 4.19),
SIMDE_FLOAT64_C( 4.32), SIMDE_FLOAT64_C( 3.89)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 1.15), SIMDE_FLOAT64_C( 2.43),
SIMDE_FLOAT64_C( 3.15), SIMDE_FLOAT64_C( 1.97),
SIMDE_FLOAT64_C( 1.82), SIMDE_FLOAT64_C( 2.11),
SIMDE_FLOAT64_C( 3.22), SIMDE_FLOAT64_C( 2.03)) }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])); i++) {
simde__m512d r = simde_mm512_mask_acosh_pd(test_vec[i].src, test_vec[i].k, test_vec[i].a);
simde_assert_m512d_close(r, test_vec[i].r, 1);
}
return 0;
}
static int
test_simde_mm_asin_ps(SIMDE_MUNIT_TEST_ARGS) {
const struct {
simde__m128 a;
simde__m128 r;
} test_vec[8] = {
{ simde_mm_set_ps(SIMDE_FLOAT32_C( -0.19), SIMDE_FLOAT32_C( 0.04), SIMDE_FLOAT32_C( -0.75), SIMDE_FLOAT32_C( 0.35)),
simde_mm_set_ps(SIMDE_FLOAT32_C( -0.19), SIMDE_FLOAT32_C( 0.04), SIMDE_FLOAT32_C( -0.85), SIMDE_FLOAT32_C( 0.36)) },
{ simde_mm_set_ps(SIMDE_FLOAT32_C( 0.50), SIMDE_FLOAT32_C( 0.67), SIMDE_FLOAT32_C( -0.30), SIMDE_FLOAT32_C( 0.03)),
simde_mm_set_ps(SIMDE_FLOAT32_C( 0.52), SIMDE_FLOAT32_C( 0.73), SIMDE_FLOAT32_C( -0.30), SIMDE_FLOAT32_C( 0.03)) },
{ simde_mm_set_ps(SIMDE_FLOAT32_C( 0.83), SIMDE_FLOAT32_C( 0.42), SIMDE_FLOAT32_C( -0.27), SIMDE_FLOAT32_C( 0.47)),
simde_mm_set_ps(SIMDE_FLOAT32_C( 0.98), SIMDE_FLOAT32_C( 0.43), SIMDE_FLOAT32_C( -0.27), SIMDE_FLOAT32_C( 0.49)) },
{ simde_mm_set_ps(SIMDE_FLOAT32_C( -0.68), SIMDE_FLOAT32_C( -0.69), SIMDE_FLOAT32_C( 0.08), SIMDE_FLOAT32_C( 0.57)),
simde_mm_set_ps(SIMDE_FLOAT32_C( -0.75), SIMDE_FLOAT32_C( -0.76), SIMDE_FLOAT32_C( 0.08), SIMDE_FLOAT32_C( 0.61)) },
{ simde_mm_set_ps(SIMDE_FLOAT32_C( -0.31), SIMDE_FLOAT32_C( -0.86), SIMDE_FLOAT32_C( -0.42), SIMDE_FLOAT32_C( 0.70)),
simde_mm_set_ps(SIMDE_FLOAT32_C( -0.32), SIMDE_FLOAT32_C( -1.04), SIMDE_FLOAT32_C( -0.43), SIMDE_FLOAT32_C( 0.78)) },
{ simde_mm_set_ps(SIMDE_FLOAT32_C( -0.44), SIMDE_FLOAT32_C( 0.03), SIMDE_FLOAT32_C( -0.38), SIMDE_FLOAT32_C( -0.92)),
simde_mm_set_ps(SIMDE_FLOAT32_C( -0.46), SIMDE_FLOAT32_C( 0.03), SIMDE_FLOAT32_C( -0.39), SIMDE_FLOAT32_C( -1.17)) },
{ simde_mm_set_ps(SIMDE_FLOAT32_C( 0.26), SIMDE_FLOAT32_C( -0.21), SIMDE_FLOAT32_C( -0.98), SIMDE_FLOAT32_C( -0.66)),
simde_mm_set_ps(SIMDE_FLOAT32_C( 0.26), SIMDE_FLOAT32_C( -0.21), SIMDE_FLOAT32_C( -1.37), SIMDE_FLOAT32_C( -0.72)) },
{ simde_mm_set_ps(SIMDE_FLOAT32_C( 0.18), SIMDE_FLOAT32_C( -0.45), SIMDE_FLOAT32_C( 0.23), SIMDE_FLOAT32_C( 0.69)),
simde_mm_set_ps(SIMDE_FLOAT32_C( 0.18), SIMDE_FLOAT32_C( -0.47), SIMDE_FLOAT32_C( 0.23), SIMDE_FLOAT32_C( 0.76)) }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])); i++) {
simde__m128 r = simde_mm_asin_ps(test_vec[i].a);
simde_assert_m128_close(r, test_vec[i].r, 1);
}
return 0;
}
static int
test_simde_mm_asin_pd(SIMDE_MUNIT_TEST_ARGS) {
const struct {
simde__m128d a;
simde__m128d r;
} test_vec[8] = {
{ simde_mm_set_pd(SIMDE_FLOAT64_C( -0.75), SIMDE_FLOAT64_C( 0.35)),
simde_mm_set_pd(SIMDE_FLOAT64_C( -0.85), SIMDE_FLOAT64_C( 0.36)) },
{ simde_mm_set_pd(SIMDE_FLOAT64_C( -0.19), SIMDE_FLOAT64_C( 0.04)),
simde_mm_set_pd(SIMDE_FLOAT64_C( -0.19), SIMDE_FLOAT64_C( 0.04)) },
{ simde_mm_set_pd(SIMDE_FLOAT64_C( -0.30), SIMDE_FLOAT64_C( 0.03)),
simde_mm_set_pd(SIMDE_FLOAT64_C( -0.30), SIMDE_FLOAT64_C( 0.03)) },
{ simde_mm_set_pd(SIMDE_FLOAT64_C( 0.50), SIMDE_FLOAT64_C( 0.67)),
simde_mm_set_pd(SIMDE_FLOAT64_C( 0.52), SIMDE_FLOAT64_C( 0.73)) },
{ simde_mm_set_pd(SIMDE_FLOAT64_C( -0.27), SIMDE_FLOAT64_C( 0.47)),
simde_mm_set_pd(SIMDE_FLOAT64_C( -0.27), SIMDE_FLOAT64_C( 0.49)) },
{ simde_mm_set_pd(SIMDE_FLOAT64_C( 0.83), SIMDE_FLOAT64_C( 0.42)),
simde_mm_set_pd(SIMDE_FLOAT64_C( 0.98), SIMDE_FLOAT64_C( 0.43)) },
{ simde_mm_set_pd(SIMDE_FLOAT64_C( 0.08), SIMDE_FLOAT64_C( 0.57)),
simde_mm_set_pd(SIMDE_FLOAT64_C( 0.08), SIMDE_FLOAT64_C( 0.61)) },
{ simde_mm_set_pd(SIMDE_FLOAT64_C( -0.68), SIMDE_FLOAT64_C( -0.69)),
simde_mm_set_pd(SIMDE_FLOAT64_C( -0.75), SIMDE_FLOAT64_C( -0.76)) }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])); i++) {
simde__m128d r = simde_mm_asin_pd(test_vec[i].a);
simde_assert_m128d_close(r, test_vec[i].r, 1);
}
return 0;
}
static int
test_simde_mm256_asin_ps(SIMDE_MUNIT_TEST_ARGS) {
const struct {
simde__m256 a;
simde__m256 r;
} test_vec[8] = {
{ simde_mm256_set_ps(SIMDE_FLOAT32_C( 0.50), SIMDE_FLOAT32_C( 0.67),
SIMDE_FLOAT32_C( -0.30), SIMDE_FLOAT32_C( 0.03),
SIMDE_FLOAT32_C( -0.19), SIMDE_FLOAT32_C( 0.04),
SIMDE_FLOAT32_C( -0.75), SIMDE_FLOAT32_C( 0.35)),
simde_mm256_set_ps(SIMDE_FLOAT32_C( 0.52), SIMDE_FLOAT32_C( 0.73),
SIMDE_FLOAT32_C( -0.30), SIMDE_FLOAT32_C( 0.03),
SIMDE_FLOAT32_C( -0.19), SIMDE_FLOAT32_C( 0.04),
SIMDE_FLOAT32_C( -0.85), SIMDE_FLOAT32_C( 0.36)) },
{ simde_mm256_set_ps(SIMDE_FLOAT32_C( -0.68), SIMDE_FLOAT32_C( -0.69),
SIMDE_FLOAT32_C( 0.08), SIMDE_FLOAT32_C( 0.57),
SIMDE_FLOAT32_C( 0.83), SIMDE_FLOAT32_C( 0.42),
SIMDE_FLOAT32_C( -0.27), SIMDE_FLOAT32_C( 0.47)),
simde_mm256_set_ps(SIMDE_FLOAT32_C( -0.75), SIMDE_FLOAT32_C( -0.76),
SIMDE_FLOAT32_C( 0.08), SIMDE_FLOAT32_C( 0.61),
SIMDE_FLOAT32_C( 0.98), SIMDE_FLOAT32_C( 0.43),
SIMDE_FLOAT32_C( -0.27), SIMDE_FLOAT32_C( 0.49)) },
{ simde_mm256_set_ps(SIMDE_FLOAT32_C( -0.44), SIMDE_FLOAT32_C( 0.03),
SIMDE_FLOAT32_C( -0.38), SIMDE_FLOAT32_C( -0.92),
SIMDE_FLOAT32_C( -0.31), SIMDE_FLOAT32_C( -0.86),
SIMDE_FLOAT32_C( -0.42), SIMDE_FLOAT32_C( 0.70)),
simde_mm256_set_ps(SIMDE_FLOAT32_C( -0.46), SIMDE_FLOAT32_C( 0.03),
SIMDE_FLOAT32_C( -0.39), SIMDE_FLOAT32_C( -1.17),
SIMDE_FLOAT32_C( -0.32), SIMDE_FLOAT32_C( -1.04),
SIMDE_FLOAT32_C( -0.43), SIMDE_FLOAT32_C( 0.78)) },
{ simde_mm256_set_ps(SIMDE_FLOAT32_C( 0.18), SIMDE_FLOAT32_C( -0.45),
SIMDE_FLOAT32_C( 0.23), SIMDE_FLOAT32_C( 0.69),
SIMDE_FLOAT32_C( 0.26), SIMDE_FLOAT32_C( -0.21),
SIMDE_FLOAT32_C( -0.98), SIMDE_FLOAT32_C( -0.66)),
simde_mm256_set_ps(SIMDE_FLOAT32_C( 0.18), SIMDE_FLOAT32_C( -0.47),
SIMDE_FLOAT32_C( 0.23), SIMDE_FLOAT32_C( 0.76),
SIMDE_FLOAT32_C( 0.26), SIMDE_FLOAT32_C( -0.21),
SIMDE_FLOAT32_C( -1.37), SIMDE_FLOAT32_C( -0.72)) },
{ simde_mm256_set_ps(SIMDE_FLOAT32_C( -0.77), SIMDE_FLOAT32_C( 0.44),
SIMDE_FLOAT32_C( -0.58), SIMDE_FLOAT32_C( 0.38),
SIMDE_FLOAT32_C( -0.77), SIMDE_FLOAT32_C( 0.99),
SIMDE_FLOAT32_C( 0.03), SIMDE_FLOAT32_C( 0.84)),
simde_mm256_set_ps(SIMDE_FLOAT32_C( -0.88), SIMDE_FLOAT32_C( 0.46),
SIMDE_FLOAT32_C( -0.62), SIMDE_FLOAT32_C( 0.39),
SIMDE_FLOAT32_C( -0.88), SIMDE_FLOAT32_C( 1.43),
SIMDE_FLOAT32_C( 0.03), SIMDE_FLOAT32_C( 1.00)) },
{ simde_mm256_set_ps(SIMDE_FLOAT32_C( -0.39), SIMDE_FLOAT32_C( 0.40),
SIMDE_FLOAT32_C( 0.66), SIMDE_FLOAT32_C( 0.34),
SIMDE_FLOAT32_C( 0.53), SIMDE_FLOAT32_C( -0.26),
SIMDE_FLOAT32_C( 0.78), SIMDE_FLOAT32_C( -0.03)),
simde_mm256_set_ps(SIMDE_FLOAT32_C( -0.40), SIMDE_FLOAT32_C( 0.41),
SIMDE_FLOAT32_C( 0.72), SIMDE_FLOAT32_C( 0.35),
SIMDE_FLOAT32_C( 0.56), SIMDE_FLOAT32_C( -0.26),
SIMDE_FLOAT32_C( 0.89), SIMDE_FLOAT32_C( -0.03)) },
{ simde_mm256_set_ps(SIMDE_FLOAT32_C( -0.20), SIMDE_FLOAT32_C( -0.08),
SIMDE_FLOAT32_C( 0.34), SIMDE_FLOAT32_C( -0.94),
SIMDE_FLOAT32_C( -0.75), SIMDE_FLOAT32_C( -0.77),
SIMDE_FLOAT32_C( -0.55), SIMDE_FLOAT32_C( 0.40)),
simde_mm256_set_ps(SIMDE_FLOAT32_C( -0.20), SIMDE_FLOAT32_C( -0.08),
SIMDE_FLOAT32_C( 0.35), SIMDE_FLOAT32_C( -1.22),
SIMDE_FLOAT32_C( -0.85), SIMDE_FLOAT32_C( -0.88),
SIMDE_FLOAT32_C( -0.58), SIMDE_FLOAT32_C( 0.41)) },
{ simde_mm256_set_ps(SIMDE_FLOAT32_C( 0.47), SIMDE_FLOAT32_C( 0.68),
SIMDE_FLOAT32_C( -0.15), SIMDE_FLOAT32_C( 0.82),
SIMDE_FLOAT32_C( 0.91), SIMDE_FLOAT32_C( 0.60),
SIMDE_FLOAT32_C( 0.79), SIMDE_FLOAT32_C( 0.25)),
simde_mm256_set_ps(SIMDE_FLOAT32_C( 0.49), SIMDE_FLOAT32_C( 0.75),
SIMDE_FLOAT32_C( -0.15), SIMDE_FLOAT32_C( 0.96),
SIMDE_FLOAT32_C( 1.14), SIMDE_FLOAT32_C( 0.64),
SIMDE_FLOAT32_C( 0.91), SIMDE_FLOAT32_C( 0.25)) }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])); i++) {
simde__m256 r = simde_mm256_asin_ps(test_vec[i].a);
simde_assert_m256_close(r, test_vec[i].r, 1);
}
return 0;
}
static int
test_simde_mm256_asin_pd(SIMDE_MUNIT_TEST_ARGS) {
const struct {
simde__m256d a;
simde__m256d r;
} test_vec[8] = {
{ simde_mm256_set_pd(SIMDE_FLOAT64_C( -0.19), SIMDE_FLOAT64_C( 0.04),
SIMDE_FLOAT64_C( -0.75), SIMDE_FLOAT64_C( 0.35)),
simde_mm256_set_pd(SIMDE_FLOAT64_C( -0.19), SIMDE_FLOAT64_C( 0.04),
SIMDE_FLOAT64_C( -0.85), SIMDE_FLOAT64_C( 0.36)) },
{ simde_mm256_set_pd(SIMDE_FLOAT64_C( 0.50), SIMDE_FLOAT64_C( 0.67),
SIMDE_FLOAT64_C( -0.30), SIMDE_FLOAT64_C( 0.03)),
simde_mm256_set_pd(SIMDE_FLOAT64_C( 0.52), SIMDE_FLOAT64_C( 0.73),
SIMDE_FLOAT64_C( -0.30), SIMDE_FLOAT64_C( 0.03)) },
{ simde_mm256_set_pd(SIMDE_FLOAT64_C( 0.83), SIMDE_FLOAT64_C( 0.42),
SIMDE_FLOAT64_C( -0.27), SIMDE_FLOAT64_C( 0.47)),
simde_mm256_set_pd(SIMDE_FLOAT64_C( 0.98), SIMDE_FLOAT64_C( 0.43),
SIMDE_FLOAT64_C( -0.27), SIMDE_FLOAT64_C( 0.49)) },
{ simde_mm256_set_pd(SIMDE_FLOAT64_C( -0.68), SIMDE_FLOAT64_C( -0.69),
SIMDE_FLOAT64_C( 0.08), SIMDE_FLOAT64_C( 0.57)),
simde_mm256_set_pd(SIMDE_FLOAT64_C( -0.75), SIMDE_FLOAT64_C( -0.76),
SIMDE_FLOAT64_C( 0.08), SIMDE_FLOAT64_C( 0.61)) },
{ simde_mm256_set_pd(SIMDE_FLOAT64_C( -0.31), SIMDE_FLOAT64_C( -0.86),
SIMDE_FLOAT64_C( -0.42), SIMDE_FLOAT64_C( 0.70)),
simde_mm256_set_pd(SIMDE_FLOAT64_C( -0.32), SIMDE_FLOAT64_C( -1.04),
SIMDE_FLOAT64_C( -0.43), SIMDE_FLOAT64_C( 0.78)) },
{ simde_mm256_set_pd(SIMDE_FLOAT64_C( -0.44), SIMDE_FLOAT64_C( 0.03),
SIMDE_FLOAT64_C( -0.38), SIMDE_FLOAT64_C( -0.92)),
simde_mm256_set_pd(SIMDE_FLOAT64_C( -0.46), SIMDE_FLOAT64_C( 0.03),
SIMDE_FLOAT64_C( -0.39), SIMDE_FLOAT64_C( -1.17)) },
{ simde_mm256_set_pd(SIMDE_FLOAT64_C( 0.26), SIMDE_FLOAT64_C( -0.21),
SIMDE_FLOAT64_C( -0.98), SIMDE_FLOAT64_C( -0.66)),
simde_mm256_set_pd(SIMDE_FLOAT64_C( 0.26), SIMDE_FLOAT64_C( -0.21),
SIMDE_FLOAT64_C( -1.37), SIMDE_FLOAT64_C( -0.72)) },
{ simde_mm256_set_pd(SIMDE_FLOAT64_C( 0.18), SIMDE_FLOAT64_C( -0.45),
SIMDE_FLOAT64_C( 0.23), SIMDE_FLOAT64_C( 0.69)),
simde_mm256_set_pd(SIMDE_FLOAT64_C( 0.18), SIMDE_FLOAT64_C( -0.47),
SIMDE_FLOAT64_C( 0.23), SIMDE_FLOAT64_C( 0.76)) }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])); i++) {
simde__m256d r = simde_mm256_asin_pd(test_vec[i].a);
simde_assert_m256d_close(r, test_vec[i].r, 1);
}
return 0;
}
static int
test_simde_mm512_asin_ps(SIMDE_MUNIT_TEST_ARGS) {
const struct {
simde__m512 a;
simde__m512 r;
} test_vec[8] = {
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( -0.68), SIMDE_FLOAT32_C( -0.69), SIMDE_FLOAT32_C( 0.08), SIMDE_FLOAT32_C( 0.57),
SIMDE_FLOAT32_C( 0.83), SIMDE_FLOAT32_C( 0.42), SIMDE_FLOAT32_C( -0.27), SIMDE_FLOAT32_C( 0.47),
SIMDE_FLOAT32_C( 0.50), SIMDE_FLOAT32_C( 0.67), SIMDE_FLOAT32_C( -0.30), SIMDE_FLOAT32_C( 0.03),
SIMDE_FLOAT32_C( -0.19), SIMDE_FLOAT32_C( 0.04), SIMDE_FLOAT32_C( -0.75), SIMDE_FLOAT32_C( 0.35)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( -0.75), SIMDE_FLOAT32_C( -0.76), SIMDE_FLOAT32_C( 0.08), SIMDE_FLOAT32_C( 0.61),
SIMDE_FLOAT32_C( 0.98), SIMDE_FLOAT32_C( 0.43), SIMDE_FLOAT32_C( -0.27), SIMDE_FLOAT32_C( 0.49),
SIMDE_FLOAT32_C( 0.52), SIMDE_FLOAT32_C( 0.73), SIMDE_FLOAT32_C( -0.30), SIMDE_FLOAT32_C( 0.03),
SIMDE_FLOAT32_C( -0.19), SIMDE_FLOAT32_C( 0.04), SIMDE_FLOAT32_C( -0.85), SIMDE_FLOAT32_C( 0.36)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( 0.18), SIMDE_FLOAT32_C( -0.45), SIMDE_FLOAT32_C( 0.23), SIMDE_FLOAT32_C( 0.69),
SIMDE_FLOAT32_C( 0.26), SIMDE_FLOAT32_C( -0.21), SIMDE_FLOAT32_C( -0.98), SIMDE_FLOAT32_C( -0.66),
SIMDE_FLOAT32_C( -0.44), SIMDE_FLOAT32_C( 0.03), SIMDE_FLOAT32_C( -0.38), SIMDE_FLOAT32_C( -0.92),
SIMDE_FLOAT32_C( -0.31), SIMDE_FLOAT32_C( -0.86), SIMDE_FLOAT32_C( -0.42), SIMDE_FLOAT32_C( 0.70)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 0.18), SIMDE_FLOAT32_C( -0.47), SIMDE_FLOAT32_C( 0.23), SIMDE_FLOAT32_C( 0.76),
SIMDE_FLOAT32_C( 0.26), SIMDE_FLOAT32_C( -0.21), SIMDE_FLOAT32_C( -1.37), SIMDE_FLOAT32_C( -0.72),
SIMDE_FLOAT32_C( -0.46), SIMDE_FLOAT32_C( 0.03), SIMDE_FLOAT32_C( -0.39), SIMDE_FLOAT32_C( -1.17),
SIMDE_FLOAT32_C( -0.32), SIMDE_FLOAT32_C( -1.04), SIMDE_FLOAT32_C( -0.43), SIMDE_FLOAT32_C( 0.78)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( -0.39), SIMDE_FLOAT32_C( 0.40), SIMDE_FLOAT32_C( 0.66), SIMDE_FLOAT32_C( 0.34),
SIMDE_FLOAT32_C( 0.53), SIMDE_FLOAT32_C( -0.26), SIMDE_FLOAT32_C( 0.78), SIMDE_FLOAT32_C( -0.03),
SIMDE_FLOAT32_C( -0.77), SIMDE_FLOAT32_C( 0.44), SIMDE_FLOAT32_C( -0.58), SIMDE_FLOAT32_C( 0.38),
SIMDE_FLOAT32_C( -0.77), SIMDE_FLOAT32_C( 0.99), SIMDE_FLOAT32_C( 0.03), SIMDE_FLOAT32_C( 0.84)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( -0.40), SIMDE_FLOAT32_C( 0.41), SIMDE_FLOAT32_C( 0.72), SIMDE_FLOAT32_C( 0.35),
SIMDE_FLOAT32_C( 0.56), SIMDE_FLOAT32_C( -0.26), SIMDE_FLOAT32_C( 0.89), SIMDE_FLOAT32_C( -0.03),
SIMDE_FLOAT32_C( -0.88), SIMDE_FLOAT32_C( 0.46), SIMDE_FLOAT32_C( -0.62), SIMDE_FLOAT32_C( 0.39),
SIMDE_FLOAT32_C( -0.88), SIMDE_FLOAT32_C( 1.43), SIMDE_FLOAT32_C( 0.03), SIMDE_FLOAT32_C( 1.00)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( 0.47), SIMDE_FLOAT32_C( 0.68), SIMDE_FLOAT32_C( -0.15), SIMDE_FLOAT32_C( 0.82),
SIMDE_FLOAT32_C( 0.91), SIMDE_FLOAT32_C( 0.60), SIMDE_FLOAT32_C( 0.79), SIMDE_FLOAT32_C( 0.25),
SIMDE_FLOAT32_C( -0.20), SIMDE_FLOAT32_C( -0.08), SIMDE_FLOAT32_C( 0.34), SIMDE_FLOAT32_C( -0.94),
SIMDE_FLOAT32_C( -0.75), SIMDE_FLOAT32_C( -0.77), SIMDE_FLOAT32_C( -0.55), SIMDE_FLOAT32_C( 0.40)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 0.49), SIMDE_FLOAT32_C( 0.75), SIMDE_FLOAT32_C( -0.15), SIMDE_FLOAT32_C( 0.96),
SIMDE_FLOAT32_C( 1.14), SIMDE_FLOAT32_C( 0.64), SIMDE_FLOAT32_C( 0.91), SIMDE_FLOAT32_C( 0.25),
SIMDE_FLOAT32_C( -0.20), SIMDE_FLOAT32_C( -0.08), SIMDE_FLOAT32_C( 0.35), SIMDE_FLOAT32_C( -1.22),
SIMDE_FLOAT32_C( -0.85), SIMDE_FLOAT32_C( -0.88), SIMDE_FLOAT32_C( -0.58), SIMDE_FLOAT32_C( 0.41)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( -0.18), SIMDE_FLOAT32_C( 0.76), SIMDE_FLOAT32_C( 0.32), SIMDE_FLOAT32_C( 0.34),
SIMDE_FLOAT32_C( -0.87), SIMDE_FLOAT32_C( -0.80), SIMDE_FLOAT32_C( -0.33), SIMDE_FLOAT32_C( -0.53),
SIMDE_FLOAT32_C( -0.19), SIMDE_FLOAT32_C( -0.82), SIMDE_FLOAT32_C( 0.56), SIMDE_FLOAT32_C( 0.66),
SIMDE_FLOAT32_C( -0.07), SIMDE_FLOAT32_C( 0.54), SIMDE_FLOAT32_C( 0.12), SIMDE_FLOAT32_C( -0.17)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( -0.18), SIMDE_FLOAT32_C( 0.86), SIMDE_FLOAT32_C( 0.33), SIMDE_FLOAT32_C( 0.35),
SIMDE_FLOAT32_C( -1.06), SIMDE_FLOAT32_C( -0.93), SIMDE_FLOAT32_C( -0.34), SIMDE_FLOAT32_C( -0.56),
SIMDE_FLOAT32_C( -0.19), SIMDE_FLOAT32_C( -0.96), SIMDE_FLOAT32_C( 0.59), SIMDE_FLOAT32_C( 0.72),
SIMDE_FLOAT32_C( -0.07), SIMDE_FLOAT32_C( 0.57), SIMDE_FLOAT32_C( 0.12), SIMDE_FLOAT32_C( -0.17)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( 0.69), SIMDE_FLOAT32_C( 0.84), SIMDE_FLOAT32_C( -0.02), SIMDE_FLOAT32_C( -0.59),
SIMDE_FLOAT32_C( -0.45), SIMDE_FLOAT32_C( 0.73), SIMDE_FLOAT32_C( 0.51), SIMDE_FLOAT32_C( 0.62),
SIMDE_FLOAT32_C( 0.83), SIMDE_FLOAT32_C( 0.14), SIMDE_FLOAT32_C( 0.98), SIMDE_FLOAT32_C( -0.91),
SIMDE_FLOAT32_C( 0.33), SIMDE_FLOAT32_C( 0.10), SIMDE_FLOAT32_C( 0.46), SIMDE_FLOAT32_C( -0.74)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 0.76), SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( -0.02), SIMDE_FLOAT32_C( -0.63),
SIMDE_FLOAT32_C( -0.47), SIMDE_FLOAT32_C( 0.82), SIMDE_FLOAT32_C( 0.54), SIMDE_FLOAT32_C( 0.67),
SIMDE_FLOAT32_C( 0.98), SIMDE_FLOAT32_C( 0.14), SIMDE_FLOAT32_C( 1.37), SIMDE_FLOAT32_C( -1.14),
SIMDE_FLOAT32_C( 0.34), SIMDE_FLOAT32_C( 0.10), SIMDE_FLOAT32_C( 0.48), SIMDE_FLOAT32_C( -0.83)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( -0.77), SIMDE_FLOAT32_C( 0.82), SIMDE_FLOAT32_C( -0.57), SIMDE_FLOAT32_C( -1.00),
SIMDE_FLOAT32_C( -0.34), SIMDE_FLOAT32_C( 0.92), SIMDE_FLOAT32_C( 0.29), SIMDE_FLOAT32_C( -0.77),
SIMDE_FLOAT32_C( -0.58), SIMDE_FLOAT32_C( -0.07), SIMDE_FLOAT32_C( 0.71), SIMDE_FLOAT32_C( 0.98),
SIMDE_FLOAT32_C( -0.76), SIMDE_FLOAT32_C( 0.42), SIMDE_FLOAT32_C( 0.03), SIMDE_FLOAT32_C( -0.10)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( -0.88), SIMDE_FLOAT32_C( 0.96), SIMDE_FLOAT32_C( -0.61), SIMDE_FLOAT32_C( -1.57),
SIMDE_FLOAT32_C( -0.35), SIMDE_FLOAT32_C( 1.17), SIMDE_FLOAT32_C( 0.29), SIMDE_FLOAT32_C( -0.88),
SIMDE_FLOAT32_C( -0.62), SIMDE_FLOAT32_C( -0.07), SIMDE_FLOAT32_C( 0.79), SIMDE_FLOAT32_C( 1.37),
SIMDE_FLOAT32_C( -0.86), SIMDE_FLOAT32_C( 0.43), SIMDE_FLOAT32_C( 0.03), SIMDE_FLOAT32_C( -0.10)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( -0.44), SIMDE_FLOAT32_C( -0.36), SIMDE_FLOAT32_C( -0.75), SIMDE_FLOAT32_C( -0.03),
SIMDE_FLOAT32_C( 0.93), SIMDE_FLOAT32_C( 0.01), SIMDE_FLOAT32_C( -0.33), SIMDE_FLOAT32_C( -0.13),
SIMDE_FLOAT32_C( -0.18), SIMDE_FLOAT32_C( 0.04), SIMDE_FLOAT32_C( 0.51), SIMDE_FLOAT32_C( 0.39),
SIMDE_FLOAT32_C( 0.01), SIMDE_FLOAT32_C( -0.30), SIMDE_FLOAT32_C( 0.92), SIMDE_FLOAT32_C( -0.70)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( -0.46), SIMDE_FLOAT32_C( -0.37), SIMDE_FLOAT32_C( -0.85), SIMDE_FLOAT32_C( -0.03),
SIMDE_FLOAT32_C( 1.19), SIMDE_FLOAT32_C( 0.01), SIMDE_FLOAT32_C( -0.34), SIMDE_FLOAT32_C( -0.13),
SIMDE_FLOAT32_C( -0.18), SIMDE_FLOAT32_C( 0.04), SIMDE_FLOAT32_C( 0.54), SIMDE_FLOAT32_C( 0.40),
SIMDE_FLOAT32_C( 0.01), SIMDE_FLOAT32_C( -0.30), SIMDE_FLOAT32_C( 1.17), SIMDE_FLOAT32_C( -0.78)) }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])); i++) {
simde__m512 r = simde_mm512_asin_ps(test_vec[i].a);
simde_assert_m512_close(r, test_vec[i].r, 1);
}
return 0;
}
static int
test_simde_mm512_mask_asin_ps(SIMDE_MUNIT_TEST_ARGS) {
const struct {
simde__m512 src;
simde__mmask16 k;
simde__m512 a;
simde__m512 r;
} test_vec[8] = {
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( -0.45), SIMDE_FLOAT32_C( 0.69), SIMDE_FLOAT32_C( -0.21), SIMDE_FLOAT32_C( -0.66),
SIMDE_FLOAT32_C( 0.03), SIMDE_FLOAT32_C( -0.92), SIMDE_FLOAT32_C( -0.86), SIMDE_FLOAT32_C( 0.70),
SIMDE_FLOAT32_C( -0.69), SIMDE_FLOAT32_C( 0.57), SIMDE_FLOAT32_C( 0.42), SIMDE_FLOAT32_C( 0.47),
SIMDE_FLOAT32_C( 0.67), SIMDE_FLOAT32_C( 0.03), SIMDE_FLOAT32_C( 0.04), SIMDE_FLOAT32_C( 0.35)),
UINT16_C(41466),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 0.18), SIMDE_FLOAT32_C( 0.23), SIMDE_FLOAT32_C( 0.26), SIMDE_FLOAT32_C( -0.98),
SIMDE_FLOAT32_C( -0.44), SIMDE_FLOAT32_C( -0.38), SIMDE_FLOAT32_C( -0.31), SIMDE_FLOAT32_C( -0.42),
SIMDE_FLOAT32_C( -0.68), SIMDE_FLOAT32_C( 0.08), SIMDE_FLOAT32_C( 0.83), SIMDE_FLOAT32_C( -0.27),
SIMDE_FLOAT32_C( 0.50), SIMDE_FLOAT32_C( -0.30), SIMDE_FLOAT32_C( -0.19), SIMDE_FLOAT32_C( -0.75)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 0.18), SIMDE_FLOAT32_C( 0.69), SIMDE_FLOAT32_C( 0.26), SIMDE_FLOAT32_C( -0.66),
SIMDE_FLOAT32_C( 0.03), SIMDE_FLOAT32_C( -0.92), SIMDE_FLOAT32_C( -0.86), SIMDE_FLOAT32_C( -0.43),
SIMDE_FLOAT32_C( -0.75), SIMDE_FLOAT32_C( 0.08), SIMDE_FLOAT32_C( 0.98), SIMDE_FLOAT32_C( -0.27),
SIMDE_FLOAT32_C( 0.52), SIMDE_FLOAT32_C( 0.03), SIMDE_FLOAT32_C( -0.19), SIMDE_FLOAT32_C( 0.35)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( 0.47), SIMDE_FLOAT32_C( -0.15), SIMDE_FLOAT32_C( 0.91), SIMDE_FLOAT32_C( 0.79),
SIMDE_FLOAT32_C( -0.20), SIMDE_FLOAT32_C( 0.34), SIMDE_FLOAT32_C( -0.75), SIMDE_FLOAT32_C( -0.55),
SIMDE_FLOAT32_C( -0.39), SIMDE_FLOAT32_C( 0.66), SIMDE_FLOAT32_C( 0.53), SIMDE_FLOAT32_C( 0.78),
SIMDE_FLOAT32_C( -0.77), SIMDE_FLOAT32_C( -0.58), SIMDE_FLOAT32_C( -0.77), SIMDE_FLOAT32_C( 0.03)),
UINT16_C(36797),
simde_mm512_set_ps(SIMDE_FLOAT32_C( -0.17), SIMDE_FLOAT32_C( 0.68), SIMDE_FLOAT32_C( 0.82), SIMDE_FLOAT32_C( 0.60),
SIMDE_FLOAT32_C( 0.25), SIMDE_FLOAT32_C( -0.08), SIMDE_FLOAT32_C( -0.94), SIMDE_FLOAT32_C( -0.77),
SIMDE_FLOAT32_C( 0.40), SIMDE_FLOAT32_C( 0.40), SIMDE_FLOAT32_C( 0.34), SIMDE_FLOAT32_C( -0.26),
SIMDE_FLOAT32_C( -0.03), SIMDE_FLOAT32_C( 0.44), SIMDE_FLOAT32_C( 0.38), SIMDE_FLOAT32_C( 0.99)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( -0.17), SIMDE_FLOAT32_C( -0.15), SIMDE_FLOAT32_C( 0.91), SIMDE_FLOAT32_C( 0.79),
SIMDE_FLOAT32_C( 0.25), SIMDE_FLOAT32_C( -0.08), SIMDE_FLOAT32_C( -1.22), SIMDE_FLOAT32_C( -0.88),
SIMDE_FLOAT32_C( 0.41), SIMDE_FLOAT32_C( 0.66), SIMDE_FLOAT32_C( 0.35), SIMDE_FLOAT32_C( -0.26),
SIMDE_FLOAT32_C( -0.03), SIMDE_FLOAT32_C( 0.46), SIMDE_FLOAT32_C( -0.77), SIMDE_FLOAT32_C( 1.43)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( -0.10), SIMDE_FLOAT32_C( 0.84), SIMDE_FLOAT32_C( -0.59), SIMDE_FLOAT32_C( 0.73),
SIMDE_FLOAT32_C( 0.62), SIMDE_FLOAT32_C( 0.14), SIMDE_FLOAT32_C( -0.91), SIMDE_FLOAT32_C( 0.10),
SIMDE_FLOAT32_C( -0.74), SIMDE_FLOAT32_C( 0.76), SIMDE_FLOAT32_C( 0.34), SIMDE_FLOAT32_C( -0.80),
SIMDE_FLOAT32_C( -0.53), SIMDE_FLOAT32_C( -0.82), SIMDE_FLOAT32_C( 0.66), SIMDE_FLOAT32_C( 0.54)),
UINT16_C(16804),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 0.03), SIMDE_FLOAT32_C( 0.69), SIMDE_FLOAT32_C( -0.02), SIMDE_FLOAT32_C( -0.45),
SIMDE_FLOAT32_C( 0.51), SIMDE_FLOAT32_C( 0.83), SIMDE_FLOAT32_C( 0.98), SIMDE_FLOAT32_C( 0.33),
SIMDE_FLOAT32_C( 0.46), SIMDE_FLOAT32_C( -0.18), SIMDE_FLOAT32_C( 0.32), SIMDE_FLOAT32_C( -0.87),
SIMDE_FLOAT32_C( -0.33), SIMDE_FLOAT32_C( -0.19), SIMDE_FLOAT32_C( 0.56), SIMDE_FLOAT32_C( -0.07)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( -0.10), SIMDE_FLOAT32_C( 0.76), SIMDE_FLOAT32_C( -0.59), SIMDE_FLOAT32_C( 0.73),
SIMDE_FLOAT32_C( 0.62), SIMDE_FLOAT32_C( 0.14), SIMDE_FLOAT32_C( -0.91), SIMDE_FLOAT32_C( 0.34),
SIMDE_FLOAT32_C( 0.48), SIMDE_FLOAT32_C( 0.76), SIMDE_FLOAT32_C( 0.33), SIMDE_FLOAT32_C( -0.80),
SIMDE_FLOAT32_C( -0.53), SIMDE_FLOAT32_C( -0.19), SIMDE_FLOAT32_C( 0.66), SIMDE_FLOAT32_C( 0.54)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( -0.35), SIMDE_FLOAT32_C( -0.44), SIMDE_FLOAT32_C( -0.75), SIMDE_FLOAT32_C( 0.93),
SIMDE_FLOAT32_C( -0.33), SIMDE_FLOAT32_C( -0.18), SIMDE_FLOAT32_C( 0.51), SIMDE_FLOAT32_C( 0.01),
SIMDE_FLOAT32_C( 0.92), SIMDE_FLOAT32_C( -0.77), SIMDE_FLOAT32_C( -0.57), SIMDE_FLOAT32_C( -0.34),
SIMDE_FLOAT32_C( 0.29), SIMDE_FLOAT32_C( -0.58), SIMDE_FLOAT32_C( 0.71), SIMDE_FLOAT32_C( -0.76)),
UINT16_C( 2107),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 0.90), SIMDE_FLOAT32_C( -0.20), SIMDE_FLOAT32_C( -0.36), SIMDE_FLOAT32_C( -0.03),
SIMDE_FLOAT32_C( 0.01), SIMDE_FLOAT32_C( -0.13), SIMDE_FLOAT32_C( 0.04), SIMDE_FLOAT32_C( 0.39),
SIMDE_FLOAT32_C( -0.30), SIMDE_FLOAT32_C( -0.70), SIMDE_FLOAT32_C( 0.82), SIMDE_FLOAT32_C( -1.00),
SIMDE_FLOAT32_C( 0.92), SIMDE_FLOAT32_C( -0.77), SIMDE_FLOAT32_C( -0.07), SIMDE_FLOAT32_C( 0.98)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( -0.35), SIMDE_FLOAT32_C( -0.44), SIMDE_FLOAT32_C( -0.75), SIMDE_FLOAT32_C( 0.93),
SIMDE_FLOAT32_C( 0.01), SIMDE_FLOAT32_C( -0.18), SIMDE_FLOAT32_C( 0.51), SIMDE_FLOAT32_C( 0.01),
SIMDE_FLOAT32_C( 0.92), SIMDE_FLOAT32_C( -0.77), SIMDE_FLOAT32_C( 0.96), SIMDE_FLOAT32_C( -1.57),
SIMDE_FLOAT32_C( 1.17), SIMDE_FLOAT32_C( -0.58), SIMDE_FLOAT32_C( -0.07), SIMDE_FLOAT32_C( 1.37)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( -0.02), SIMDE_FLOAT32_C( -0.74), SIMDE_FLOAT32_C( -0.31), SIMDE_FLOAT32_C( 0.18),
SIMDE_FLOAT32_C( 0.35), SIMDE_FLOAT32_C( 0.89), SIMDE_FLOAT32_C( 0.92), SIMDE_FLOAT32_C( 0.13),
SIMDE_FLOAT32_C( 0.48), SIMDE_FLOAT32_C( -0.60), SIMDE_FLOAT32_C( -0.79), SIMDE_FLOAT32_C( -0.77),
SIMDE_FLOAT32_C( 0.22), SIMDE_FLOAT32_C( -0.79), SIMDE_FLOAT32_C( -0.78), SIMDE_FLOAT32_C( 0.44)),
UINT16_C(22274),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 0.50), SIMDE_FLOAT32_C( 0.92), SIMDE_FLOAT32_C( -0.72), SIMDE_FLOAT32_C( 0.16),
SIMDE_FLOAT32_C( -0.86), SIMDE_FLOAT32_C( 0.43), SIMDE_FLOAT32_C( 0.93), SIMDE_FLOAT32_C( 0.11),
SIMDE_FLOAT32_C( 0.83), SIMDE_FLOAT32_C( -0.08), SIMDE_FLOAT32_C( 0.24), SIMDE_FLOAT32_C( -0.38),
SIMDE_FLOAT32_C( -0.60), SIMDE_FLOAT32_C( -0.62), SIMDE_FLOAT32_C( -0.94), SIMDE_FLOAT32_C( 0.48)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( -0.02), SIMDE_FLOAT32_C( 1.17), SIMDE_FLOAT32_C( -0.31), SIMDE_FLOAT32_C( 0.16),
SIMDE_FLOAT32_C( 0.35), SIMDE_FLOAT32_C( 0.44), SIMDE_FLOAT32_C( 1.19), SIMDE_FLOAT32_C( 0.11),
SIMDE_FLOAT32_C( 0.48), SIMDE_FLOAT32_C( -0.60), SIMDE_FLOAT32_C( -0.79), SIMDE_FLOAT32_C( -0.77),
SIMDE_FLOAT32_C( 0.22), SIMDE_FLOAT32_C( -0.79), SIMDE_FLOAT32_C( -1.22), SIMDE_FLOAT32_C( 0.44)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( 0.88), SIMDE_FLOAT32_C( -0.81), SIMDE_FLOAT32_C( -0.07), SIMDE_FLOAT32_C( -0.78),
SIMDE_FLOAT32_C( 0.09), SIMDE_FLOAT32_C( 0.21), SIMDE_FLOAT32_C( 0.83), SIMDE_FLOAT32_C( -0.07),
SIMDE_FLOAT32_C( -0.29), SIMDE_FLOAT32_C( -0.21), SIMDE_FLOAT32_C( -0.32), SIMDE_FLOAT32_C( 0.78),
SIMDE_FLOAT32_C( -0.63), SIMDE_FLOAT32_C( 0.56), SIMDE_FLOAT32_C( 0.44), SIMDE_FLOAT32_C( 0.43)),
UINT16_C(27396),
simde_mm512_set_ps(SIMDE_FLOAT32_C( -0.96), SIMDE_FLOAT32_C( -0.41), SIMDE_FLOAT32_C( -0.74), SIMDE_FLOAT32_C( -0.76),
SIMDE_FLOAT32_C( 0.79), SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( -0.82), SIMDE_FLOAT32_C( 0.16),
SIMDE_FLOAT32_C( 0.58), SIMDE_FLOAT32_C( -0.01), SIMDE_FLOAT32_C( -0.31), SIMDE_FLOAT32_C( -0.72),
SIMDE_FLOAT32_C( 0.33), SIMDE_FLOAT32_C( 0.27), SIMDE_FLOAT32_C( -0.92), SIMDE_FLOAT32_C( -0.49)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 0.88), SIMDE_FLOAT32_C( -0.42), SIMDE_FLOAT32_C( -0.83), SIMDE_FLOAT32_C( -0.78),
SIMDE_FLOAT32_C( 0.91), SIMDE_FLOAT32_C( 0.21), SIMDE_FLOAT32_C( -0.96), SIMDE_FLOAT32_C( 0.16),
SIMDE_FLOAT32_C( -0.29), SIMDE_FLOAT32_C( -0.21), SIMDE_FLOAT32_C( -0.32), SIMDE_FLOAT32_C( 0.78),
SIMDE_FLOAT32_C( -0.63), SIMDE_FLOAT32_C( 0.27), SIMDE_FLOAT32_C( 0.44), SIMDE_FLOAT32_C( 0.43)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( 0.53), SIMDE_FLOAT32_C( -0.02), SIMDE_FLOAT32_C( -0.97), SIMDE_FLOAT32_C( 0.64),
SIMDE_FLOAT32_C( 0.45), SIMDE_FLOAT32_C( -0.77), SIMDE_FLOAT32_C( 0.11), SIMDE_FLOAT32_C( 0.59),
SIMDE_FLOAT32_C( 0.03), SIMDE_FLOAT32_C( 0.64), SIMDE_FLOAT32_C( -0.08), SIMDE_FLOAT32_C( 0.08),
SIMDE_FLOAT32_C( -0.71), SIMDE_FLOAT32_C( 0.61), SIMDE_FLOAT32_C( 0.39), SIMDE_FLOAT32_C( -0.89)),
UINT16_C( 953),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 0.02), SIMDE_FLOAT32_C( 0.81), SIMDE_FLOAT32_C( 0.14), SIMDE_FLOAT32_C( -0.43),
SIMDE_FLOAT32_C( 0.31), SIMDE_FLOAT32_C( -0.18), SIMDE_FLOAT32_C( -0.46), SIMDE_FLOAT32_C( 0.68),
SIMDE_FLOAT32_C( 0.07), SIMDE_FLOAT32_C( -0.27), SIMDE_FLOAT32_C( 0.12), SIMDE_FLOAT32_C( -0.58),
SIMDE_FLOAT32_C( -0.04), SIMDE_FLOAT32_C( -0.25), SIMDE_FLOAT32_C( -0.05), SIMDE_FLOAT32_C( 0.09)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 0.53), SIMDE_FLOAT32_C( -0.02), SIMDE_FLOAT32_C( -0.97), SIMDE_FLOAT32_C( 0.64),
SIMDE_FLOAT32_C( 0.45), SIMDE_FLOAT32_C( -0.77), SIMDE_FLOAT32_C( -0.48), SIMDE_FLOAT32_C( 0.75),
SIMDE_FLOAT32_C( 0.07), SIMDE_FLOAT32_C( 0.64), SIMDE_FLOAT32_C( 0.12), SIMDE_FLOAT32_C( -0.62),
SIMDE_FLOAT32_C( -0.04), SIMDE_FLOAT32_C( 0.61), SIMDE_FLOAT32_C( 0.39), SIMDE_FLOAT32_C( 0.09)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( -0.79), SIMDE_FLOAT32_C( 0.33), SIMDE_FLOAT32_C( -0.49), SIMDE_FLOAT32_C( 0.82),
SIMDE_FLOAT32_C( 0.96), SIMDE_FLOAT32_C( 0.95), SIMDE_FLOAT32_C( 0.83), SIMDE_FLOAT32_C( -0.82),
SIMDE_FLOAT32_C( -0.21), SIMDE_FLOAT32_C( -0.93), SIMDE_FLOAT32_C( -0.73), SIMDE_FLOAT32_C( -0.42),
SIMDE_FLOAT32_C( 0.10), SIMDE_FLOAT32_C( 0.10), SIMDE_FLOAT32_C( 0.44), SIMDE_FLOAT32_C( -0.20)),
UINT16_C(12713),
simde_mm512_set_ps(SIMDE_FLOAT32_C( -0.84), SIMDE_FLOAT32_C( -0.01), SIMDE_FLOAT32_C( 0.82), SIMDE_FLOAT32_C( 0.79),
SIMDE_FLOAT32_C( -0.74), SIMDE_FLOAT32_C( -0.31), SIMDE_FLOAT32_C( 0.73), SIMDE_FLOAT32_C( -0.35),
SIMDE_FLOAT32_C( 0.06), SIMDE_FLOAT32_C( 0.11), SIMDE_FLOAT32_C( 0.72), SIMDE_FLOAT32_C( -0.25),
SIMDE_FLOAT32_C( 0.94), SIMDE_FLOAT32_C( 0.36), SIMDE_FLOAT32_C( -0.01), SIMDE_FLOAT32_C( 0.85)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( -0.79), SIMDE_FLOAT32_C( 0.33), SIMDE_FLOAT32_C( 0.96), SIMDE_FLOAT32_C( 0.91),
SIMDE_FLOAT32_C( 0.96), SIMDE_FLOAT32_C( 0.95), SIMDE_FLOAT32_C( 0.83), SIMDE_FLOAT32_C( -0.36),
SIMDE_FLOAT32_C( 0.06), SIMDE_FLOAT32_C( -0.93), SIMDE_FLOAT32_C( 0.80), SIMDE_FLOAT32_C( -0.42),
SIMDE_FLOAT32_C( 1.22), SIMDE_FLOAT32_C( 0.10), SIMDE_FLOAT32_C( 0.44), SIMDE_FLOAT32_C( 1.02)) }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])); i++) {
simde__m512 r = simde_mm512_mask_asin_ps(test_vec[i].src, test_vec[i].k, test_vec[i].a);
simde_assert_m512_close(r, test_vec[i].r, 1);
}
return 0;
}
static int
test_simde_mm512_asin_pd(SIMDE_MUNIT_TEST_ARGS) {
const struct {
simde__m512d a;
simde__m512d r;
} test_vec[8] = {
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( 0.50), SIMDE_FLOAT64_C( 0.67),
SIMDE_FLOAT64_C( -0.30), SIMDE_FLOAT64_C( 0.03),
SIMDE_FLOAT64_C( -0.19), SIMDE_FLOAT64_C( 0.04),
SIMDE_FLOAT64_C( -0.75), SIMDE_FLOAT64_C( 0.35)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 0.52), SIMDE_FLOAT64_C( 0.73),
SIMDE_FLOAT64_C( -0.30), SIMDE_FLOAT64_C( 0.03),
SIMDE_FLOAT64_C( -0.19), SIMDE_FLOAT64_C( 0.04),
SIMDE_FLOAT64_C( -0.85), SIMDE_FLOAT64_C( 0.36)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( -0.68), SIMDE_FLOAT64_C( -0.69),
SIMDE_FLOAT64_C( 0.08), SIMDE_FLOAT64_C( 0.57),
SIMDE_FLOAT64_C( 0.83), SIMDE_FLOAT64_C( 0.42),
SIMDE_FLOAT64_C( -0.27), SIMDE_FLOAT64_C( 0.47)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( -0.75), SIMDE_FLOAT64_C( -0.76),
SIMDE_FLOAT64_C( 0.08), SIMDE_FLOAT64_C( 0.61),
SIMDE_FLOAT64_C( 0.98), SIMDE_FLOAT64_C( 0.43),
SIMDE_FLOAT64_C( -0.27), SIMDE_FLOAT64_C( 0.49)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( -0.44), SIMDE_FLOAT64_C( 0.03),
SIMDE_FLOAT64_C( -0.38), SIMDE_FLOAT64_C( -0.92),
SIMDE_FLOAT64_C( -0.31), SIMDE_FLOAT64_C( -0.86),
SIMDE_FLOAT64_C( -0.42), SIMDE_FLOAT64_C( 0.70)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( -0.46), SIMDE_FLOAT64_C( 0.03),
SIMDE_FLOAT64_C( -0.39), SIMDE_FLOAT64_C( -1.17),
SIMDE_FLOAT64_C( -0.32), SIMDE_FLOAT64_C( -1.04),
SIMDE_FLOAT64_C( -0.43), SIMDE_FLOAT64_C( 0.78)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( 0.18), SIMDE_FLOAT64_C( -0.45),
SIMDE_FLOAT64_C( 0.23), SIMDE_FLOAT64_C( 0.69),
SIMDE_FLOAT64_C( 0.26), SIMDE_FLOAT64_C( -0.21),
SIMDE_FLOAT64_C( -0.98), SIMDE_FLOAT64_C( -0.66)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 0.18), SIMDE_FLOAT64_C( -0.47),
SIMDE_FLOAT64_C( 0.23), SIMDE_FLOAT64_C( 0.76),
SIMDE_FLOAT64_C( 0.26), SIMDE_FLOAT64_C( -0.21),
SIMDE_FLOAT64_C( -1.37), SIMDE_FLOAT64_C( -0.72)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( -0.77), SIMDE_FLOAT64_C( 0.44),
SIMDE_FLOAT64_C( -0.58), SIMDE_FLOAT64_C( 0.38),
SIMDE_FLOAT64_C( -0.77), SIMDE_FLOAT64_C( 0.99),
SIMDE_FLOAT64_C( 0.03), SIMDE_FLOAT64_C( 0.84)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( -0.88), SIMDE_FLOAT64_C( 0.46),
SIMDE_FLOAT64_C( -0.62), SIMDE_FLOAT64_C( 0.39),
SIMDE_FLOAT64_C( -0.88), SIMDE_FLOAT64_C( 1.43),
SIMDE_FLOAT64_C( 0.03), SIMDE_FLOAT64_C( 1.00)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( -0.39), SIMDE_FLOAT64_C( 0.40),
SIMDE_FLOAT64_C( 0.66), SIMDE_FLOAT64_C( 0.34),
SIMDE_FLOAT64_C( 0.53), SIMDE_FLOAT64_C( -0.26),
SIMDE_FLOAT64_C( 0.78), SIMDE_FLOAT64_C( -0.03)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( -0.40), SIMDE_FLOAT64_C( 0.41),
SIMDE_FLOAT64_C( 0.72), SIMDE_FLOAT64_C( 0.35),
SIMDE_FLOAT64_C( 0.56), SIMDE_FLOAT64_C( -0.26),
SIMDE_FLOAT64_C( 0.89), SIMDE_FLOAT64_C( -0.03)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( -0.20), SIMDE_FLOAT64_C( -0.08),
SIMDE_FLOAT64_C( 0.34), SIMDE_FLOAT64_C( -0.94),
SIMDE_FLOAT64_C( -0.75), SIMDE_FLOAT64_C( -0.77),
SIMDE_FLOAT64_C( -0.55), SIMDE_FLOAT64_C( 0.40)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( -0.20), SIMDE_FLOAT64_C( -0.08),
SIMDE_FLOAT64_C( 0.35), SIMDE_FLOAT64_C( -1.22),
SIMDE_FLOAT64_C( -0.85), SIMDE_FLOAT64_C( -0.88),
SIMDE_FLOAT64_C( -0.58), SIMDE_FLOAT64_C( 0.41)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( 0.47), SIMDE_FLOAT64_C( 0.68),
SIMDE_FLOAT64_C( -0.15), SIMDE_FLOAT64_C( 0.82),
SIMDE_FLOAT64_C( 0.91), SIMDE_FLOAT64_C( 0.60),
SIMDE_FLOAT64_C( 0.79), SIMDE_FLOAT64_C( 0.25)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 0.49), SIMDE_FLOAT64_C( 0.75),
SIMDE_FLOAT64_C( -0.15), SIMDE_FLOAT64_C( 0.96),
SIMDE_FLOAT64_C( 1.14), SIMDE_FLOAT64_C( 0.64),
SIMDE_FLOAT64_C( 0.91), SIMDE_FLOAT64_C( 0.25)) }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])); i++) {
simde__m512d r = simde_mm512_asin_pd(test_vec[i].a);
simde_assert_m512d_close(r, test_vec[i].r, 1);
}
return 0;
}
static int
test_simde_mm512_mask_asin_pd(SIMDE_MUNIT_TEST_ARGS) {
const struct {
simde__m512d src;
simde__mmask8 k;
simde__m512d a;
simde__m512d r;
} test_vec[8] = {
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( -0.69), SIMDE_FLOAT64_C( 0.57),
SIMDE_FLOAT64_C( 0.42), SIMDE_FLOAT64_C( 0.47),
SIMDE_FLOAT64_C( 0.67), SIMDE_FLOAT64_C( 0.03),
SIMDE_FLOAT64_C( 0.04), SIMDE_FLOAT64_C( 0.35)),
UINT8_C(139),
simde_mm512_set_pd(SIMDE_FLOAT64_C( -0.68), SIMDE_FLOAT64_C( 0.08),
SIMDE_FLOAT64_C( 0.83), SIMDE_FLOAT64_C( -0.27),
SIMDE_FLOAT64_C( 0.50), SIMDE_FLOAT64_C( -0.30),
SIMDE_FLOAT64_C( -0.19), SIMDE_FLOAT64_C( -0.75)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( -0.75), SIMDE_FLOAT64_C( 0.57),
SIMDE_FLOAT64_C( 0.42), SIMDE_FLOAT64_C( 0.47),
SIMDE_FLOAT64_C( 0.52), SIMDE_FLOAT64_C( 0.03),
SIMDE_FLOAT64_C( -0.19), SIMDE_FLOAT64_C( -0.85)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( 0.18), SIMDE_FLOAT64_C( 0.23),
SIMDE_FLOAT64_C( 0.26), SIMDE_FLOAT64_C( -0.98),
SIMDE_FLOAT64_C( -0.44), SIMDE_FLOAT64_C( -0.38),
SIMDE_FLOAT64_C( -0.31), SIMDE_FLOAT64_C( -0.42)),
UINT8_C(229),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 0.84), SIMDE_FLOAT64_C( -0.45),
SIMDE_FLOAT64_C( 0.69), SIMDE_FLOAT64_C( -0.21),
SIMDE_FLOAT64_C( -0.66), SIMDE_FLOAT64_C( 0.03),
SIMDE_FLOAT64_C( -0.92), SIMDE_FLOAT64_C( -0.86)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 1.00), SIMDE_FLOAT64_C( -0.47),
SIMDE_FLOAT64_C( 0.76), SIMDE_FLOAT64_C( -0.98),
SIMDE_FLOAT64_C( -0.44), SIMDE_FLOAT64_C( 0.03),
SIMDE_FLOAT64_C( -0.31), SIMDE_FLOAT64_C( -1.04)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( 0.40), SIMDE_FLOAT64_C( 0.40),
SIMDE_FLOAT64_C( 0.34), SIMDE_FLOAT64_C( -0.26),
SIMDE_FLOAT64_C( -0.03), SIMDE_FLOAT64_C( 0.44),
SIMDE_FLOAT64_C( 0.38), SIMDE_FLOAT64_C( 0.99)),
UINT8_C(253),
simde_mm512_set_pd(SIMDE_FLOAT64_C( -0.55), SIMDE_FLOAT64_C( -0.39),
SIMDE_FLOAT64_C( 0.66), SIMDE_FLOAT64_C( 0.53),
SIMDE_FLOAT64_C( 0.78), SIMDE_FLOAT64_C( -0.77),
SIMDE_FLOAT64_C( -0.58), SIMDE_FLOAT64_C( -0.77)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( -0.58), SIMDE_FLOAT64_C( -0.40),
SIMDE_FLOAT64_C( 0.72), SIMDE_FLOAT64_C( 0.56),
SIMDE_FLOAT64_C( 0.89), SIMDE_FLOAT64_C( -0.88),
SIMDE_FLOAT64_C( 0.38), SIMDE_FLOAT64_C( -0.88)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( 0.12), SIMDE_FLOAT64_C( 0.47),
SIMDE_FLOAT64_C( -0.15), SIMDE_FLOAT64_C( 0.91),
SIMDE_FLOAT64_C( 0.79), SIMDE_FLOAT64_C( -0.20),
SIMDE_FLOAT64_C( 0.34), SIMDE_FLOAT64_C( -0.75)),
UINT8_C( 93),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 0.54), SIMDE_FLOAT64_C( -0.17),
SIMDE_FLOAT64_C( 0.68), SIMDE_FLOAT64_C( 0.82),
SIMDE_FLOAT64_C( 0.60), SIMDE_FLOAT64_C( 0.25),
SIMDE_FLOAT64_C( -0.08), SIMDE_FLOAT64_C( -0.94)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 0.12), SIMDE_FLOAT64_C( -0.17),
SIMDE_FLOAT64_C( -0.15), SIMDE_FLOAT64_C( 0.96),
SIMDE_FLOAT64_C( 0.64), SIMDE_FLOAT64_C( 0.25),
SIMDE_FLOAT64_C( 0.34), SIMDE_FLOAT64_C( -1.22)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( 0.10), SIMDE_FLOAT64_C( -0.74),
SIMDE_FLOAT64_C( 0.76), SIMDE_FLOAT64_C( 0.34),
SIMDE_FLOAT64_C( -0.80), SIMDE_FLOAT64_C( -0.53),
SIMDE_FLOAT64_C( -0.82), SIMDE_FLOAT64_C( 0.66)),
UINT8_C(145),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 0.33), SIMDE_FLOAT64_C( 0.46),
SIMDE_FLOAT64_C( -0.18), SIMDE_FLOAT64_C( 0.32),
SIMDE_FLOAT64_C( -0.87), SIMDE_FLOAT64_C( -0.33),
SIMDE_FLOAT64_C( -0.19), SIMDE_FLOAT64_C( 0.56)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 0.34), SIMDE_FLOAT64_C( -0.74),
SIMDE_FLOAT64_C( 0.76), SIMDE_FLOAT64_C( 0.33),
SIMDE_FLOAT64_C( -0.80), SIMDE_FLOAT64_C( -0.53),
SIMDE_FLOAT64_C( -0.82), SIMDE_FLOAT64_C( 0.59)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( -0.76), SIMDE_FLOAT64_C( 0.03),
SIMDE_FLOAT64_C( 0.69), SIMDE_FLOAT64_C( -0.02),
SIMDE_FLOAT64_C( -0.45), SIMDE_FLOAT64_C( 0.51),
SIMDE_FLOAT64_C( 0.83), SIMDE_FLOAT64_C( 0.98)),
UINT8_C( 75),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 0.98), SIMDE_FLOAT64_C( 0.42),
SIMDE_FLOAT64_C( -0.10), SIMDE_FLOAT64_C( 0.84),
SIMDE_FLOAT64_C( -0.59), SIMDE_FLOAT64_C( 0.73),
SIMDE_FLOAT64_C( 0.62), SIMDE_FLOAT64_C( 0.14)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( -0.76), SIMDE_FLOAT64_C( 0.43),
SIMDE_FLOAT64_C( 0.69), SIMDE_FLOAT64_C( -0.02),
SIMDE_FLOAT64_C( -0.63), SIMDE_FLOAT64_C( 0.51),
SIMDE_FLOAT64_C( 0.67), SIMDE_FLOAT64_C( 0.14)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( 0.39), SIMDE_FLOAT64_C( -0.30),
SIMDE_FLOAT64_C( -0.70), SIMDE_FLOAT64_C( 0.82),
SIMDE_FLOAT64_C( -1.00), SIMDE_FLOAT64_C( 0.92),
SIMDE_FLOAT64_C( -0.77), SIMDE_FLOAT64_C( -0.07)),
UINT8_C( 93),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 0.51), SIMDE_FLOAT64_C( 0.01),
SIMDE_FLOAT64_C( 0.92), SIMDE_FLOAT64_C( -0.77),
SIMDE_FLOAT64_C( -0.57), SIMDE_FLOAT64_C( -0.34),
SIMDE_FLOAT64_C( 0.29), SIMDE_FLOAT64_C( -0.58)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 0.39), SIMDE_FLOAT64_C( 0.01),
SIMDE_FLOAT64_C( -0.70), SIMDE_FLOAT64_C( -0.88),
SIMDE_FLOAT64_C( -0.61), SIMDE_FLOAT64_C( -0.35),
SIMDE_FLOAT64_C( -0.77), SIMDE_FLOAT64_C( -0.62)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( 0.48), SIMDE_FLOAT64_C( 0.94),
SIMDE_FLOAT64_C( -0.35), SIMDE_FLOAT64_C( -0.44),
SIMDE_FLOAT64_C( -0.75), SIMDE_FLOAT64_C( 0.93),
SIMDE_FLOAT64_C( -0.33), SIMDE_FLOAT64_C( -0.18)),
UINT8_C(213),
simde_mm512_set_pd(SIMDE_FLOAT64_C( -0.78), SIMDE_FLOAT64_C( 0.44),
SIMDE_FLOAT64_C( 0.90), SIMDE_FLOAT64_C( -0.20),
SIMDE_FLOAT64_C( -0.36), SIMDE_FLOAT64_C( -0.03),
SIMDE_FLOAT64_C( 0.01), SIMDE_FLOAT64_C( -0.13)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( -0.89), SIMDE_FLOAT64_C( 0.46),
SIMDE_FLOAT64_C( -0.35), SIMDE_FLOAT64_C( -0.20),
SIMDE_FLOAT64_C( -0.75), SIMDE_FLOAT64_C( -0.03),
SIMDE_FLOAT64_C( -0.33), SIMDE_FLOAT64_C( -0.13)) }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])); i++) {
simde__m512d r = simde_mm512_mask_asin_pd(test_vec[i].src, test_vec[i].k, test_vec[i].a);
simde_assert_m512d_close(r, test_vec[i].r, 1);
}
return 0;
}
static int
test_simde_mm_asinh_ps(SIMDE_MUNIT_TEST_ARGS) {
const struct {
simde__m128 a;
simde__m128 r;
} test_vec[8] = {
{ simde_mm_set_ps(SIMDE_FLOAT32_C( -186.21), SIMDE_FLOAT32_C( 39.01), SIMDE_FLOAT32_C( -754.38), SIMDE_FLOAT32_C( 346.63)),
simde_mm_set_ps(SIMDE_FLOAT32_C( -5.92), SIMDE_FLOAT32_C( 4.36), SIMDE_FLOAT32_C( -7.32), SIMDE_FLOAT32_C( 6.54)) },
{ simde_mm_set_ps(SIMDE_FLOAT32_C( 497.31), SIMDE_FLOAT32_C( 670.24), SIMDE_FLOAT32_C( -297.45), SIMDE_FLOAT32_C( 34.06)),
simde_mm_set_ps(SIMDE_FLOAT32_C( 6.90), SIMDE_FLOAT32_C( 7.20), SIMDE_FLOAT32_C( -6.39), SIMDE_FLOAT32_C( 4.22)) },
{ simde_mm_set_ps(SIMDE_FLOAT32_C( 825.53), SIMDE_FLOAT32_C( 422.21), SIMDE_FLOAT32_C( -269.45), SIMDE_FLOAT32_C( 467.76)),
simde_mm_set_ps(SIMDE_FLOAT32_C( 7.41), SIMDE_FLOAT32_C( 6.74), SIMDE_FLOAT32_C( -6.29), SIMDE_FLOAT32_C( 6.84)) },
{ simde_mm_set_ps(SIMDE_FLOAT32_C( -678.17), SIMDE_FLOAT32_C( -686.13), SIMDE_FLOAT32_C( 84.77), SIMDE_FLOAT32_C( 571.46)),
simde_mm_set_ps(SIMDE_FLOAT32_C( -7.21), SIMDE_FLOAT32_C( -7.22), SIMDE_FLOAT32_C( 5.13), SIMDE_FLOAT32_C( 7.04)) },
{ simde_mm_set_ps(SIMDE_FLOAT32_C( -305.07), SIMDE_FLOAT32_C( -860.95), SIMDE_FLOAT32_C( -417.54), SIMDE_FLOAT32_C( 696.87)),
simde_mm_set_ps(SIMDE_FLOAT32_C( -6.41), SIMDE_FLOAT32_C( -7.45), SIMDE_FLOAT32_C( -6.73), SIMDE_FLOAT32_C( 7.24)) },
{ simde_mm_set_ps(SIMDE_FLOAT32_C( -444.81), SIMDE_FLOAT32_C( 28.47), SIMDE_FLOAT32_C( -384.03), SIMDE_FLOAT32_C( -923.64)),
simde_mm_set_ps(SIMDE_FLOAT32_C( -6.79), SIMDE_FLOAT32_C( 4.04), SIMDE_FLOAT32_C( -6.64), SIMDE_FLOAT32_C( -7.52)) },
{ simde_mm_set_ps(SIMDE_FLOAT32_C( 261.31), SIMDE_FLOAT32_C( -212.54), SIMDE_FLOAT32_C( -976.55), SIMDE_FLOAT32_C( -660.80)),
simde_mm_set_ps(SIMDE_FLOAT32_C( 6.26), SIMDE_FLOAT32_C( -6.05), SIMDE_FLOAT32_C( -7.58), SIMDE_FLOAT32_C( -7.19)) },
{ simde_mm_set_ps(SIMDE_FLOAT32_C( 178.20), SIMDE_FLOAT32_C( -450.67), SIMDE_FLOAT32_C( 233.37), SIMDE_FLOAT32_C( 687.09)),
simde_mm_set_ps(SIMDE_FLOAT32_C( 5.88), SIMDE_FLOAT32_C( -6.80), SIMDE_FLOAT32_C( 6.15), SIMDE_FLOAT32_C( 7.23)) }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])); i++) {
simde__m128 r = simde_mm_asinh_ps(test_vec[i].a);
simde_assert_m128_close(r, test_vec[i].r, 1);
}
return 0;
}
static int
test_simde_mm_asinh_pd(SIMDE_MUNIT_TEST_ARGS) {
const struct {
simde__m128d a;
simde__m128d r;
} test_vec[8] = {
{ simde_mm_set_pd(SIMDE_FLOAT64_C( -754.38), SIMDE_FLOAT64_C( 346.63)),
simde_mm_set_pd(SIMDE_FLOAT64_C( -7.32), SIMDE_FLOAT64_C( 6.54)) },
{ simde_mm_set_pd(SIMDE_FLOAT64_C( -186.21), SIMDE_FLOAT64_C( 39.01)),
simde_mm_set_pd(SIMDE_FLOAT64_C( -5.92), SIMDE_FLOAT64_C( 4.36)) },
{ simde_mm_set_pd(SIMDE_FLOAT64_C( -297.45), SIMDE_FLOAT64_C( 34.06)),
simde_mm_set_pd(SIMDE_FLOAT64_C( -6.39), SIMDE_FLOAT64_C( 4.22)) },
{ simde_mm_set_pd(SIMDE_FLOAT64_C( 497.31), SIMDE_FLOAT64_C( 670.24)),
simde_mm_set_pd(SIMDE_FLOAT64_C( 6.90), SIMDE_FLOAT64_C( 7.20)) },
{ simde_mm_set_pd(SIMDE_FLOAT64_C( -269.45), SIMDE_FLOAT64_C( 467.76)),
simde_mm_set_pd(SIMDE_FLOAT64_C( -6.29), SIMDE_FLOAT64_C( 6.84)) },
{ simde_mm_set_pd(SIMDE_FLOAT64_C( 825.53), SIMDE_FLOAT64_C( 422.21)),
simde_mm_set_pd(SIMDE_FLOAT64_C( 7.41), SIMDE_FLOAT64_C( 6.74)) },
{ simde_mm_set_pd(SIMDE_FLOAT64_C( 84.77), SIMDE_FLOAT64_C( 571.46)),
simde_mm_set_pd(SIMDE_FLOAT64_C( 5.13), SIMDE_FLOAT64_C( 7.04)) },
{ simde_mm_set_pd(SIMDE_FLOAT64_C( -678.17), SIMDE_FLOAT64_C( -686.13)),
simde_mm_set_pd(SIMDE_FLOAT64_C( -7.21), SIMDE_FLOAT64_C( -7.22)) }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])); i++) {
simde__m128d r = simde_mm_asinh_pd(test_vec[i].a);
simde_assert_m128d_close(r, test_vec[i].r, 1);
}
return 0;
}
static int
test_simde_mm256_asinh_ps(SIMDE_MUNIT_TEST_ARGS) {
const struct {
simde__m256 a;
simde__m256 r;
} test_vec[8] = {
{ simde_mm256_set_ps(SIMDE_FLOAT32_C( 497.31), SIMDE_FLOAT32_C( 670.24),
SIMDE_FLOAT32_C( -297.45), SIMDE_FLOAT32_C( 34.06),
SIMDE_FLOAT32_C( -186.21), SIMDE_FLOAT32_C( 39.01),
SIMDE_FLOAT32_C( -754.38), SIMDE_FLOAT32_C( 346.63)),
simde_mm256_set_ps(SIMDE_FLOAT32_C( 6.90), SIMDE_FLOAT32_C( 7.20),
SIMDE_FLOAT32_C( -6.39), SIMDE_FLOAT32_C( 4.22),
SIMDE_FLOAT32_C( -5.92), SIMDE_FLOAT32_C( 4.36),
SIMDE_FLOAT32_C( -7.32), SIMDE_FLOAT32_C( 6.54)) },
{ simde_mm256_set_ps(SIMDE_FLOAT32_C( -678.17), SIMDE_FLOAT32_C( -686.13),
SIMDE_FLOAT32_C( 84.77), SIMDE_FLOAT32_C( 571.46),
SIMDE_FLOAT32_C( 825.53), SIMDE_FLOAT32_C( 422.21),
SIMDE_FLOAT32_C( -269.45), SIMDE_FLOAT32_C( 467.76)),
simde_mm256_set_ps(SIMDE_FLOAT32_C( -7.21), SIMDE_FLOAT32_C( -7.22),
SIMDE_FLOAT32_C( 5.13), SIMDE_FLOAT32_C( 7.04),
SIMDE_FLOAT32_C( 7.41), SIMDE_FLOAT32_C( 6.74),
SIMDE_FLOAT32_C( -6.29), SIMDE_FLOAT32_C( 6.84)) },
{ simde_mm256_set_ps(SIMDE_FLOAT32_C( -444.81), SIMDE_FLOAT32_C( 28.47),
SIMDE_FLOAT32_C( -384.03), SIMDE_FLOAT32_C( -923.64),
SIMDE_FLOAT32_C( -305.07), SIMDE_FLOAT32_C( -860.95),
SIMDE_FLOAT32_C( -417.54), SIMDE_FLOAT32_C( 696.87)),
simde_mm256_set_ps(SIMDE_FLOAT32_C( -6.79), SIMDE_FLOAT32_C( 4.04),
SIMDE_FLOAT32_C( -6.64), SIMDE_FLOAT32_C( -7.52),
SIMDE_FLOAT32_C( -6.41), SIMDE_FLOAT32_C( -7.45),
SIMDE_FLOAT32_C( -6.73), SIMDE_FLOAT32_C( 7.24)) },
{ simde_mm256_set_ps(SIMDE_FLOAT32_C( 178.20), SIMDE_FLOAT32_C( -450.67),
SIMDE_FLOAT32_C( 233.37), SIMDE_FLOAT32_C( 687.09),
SIMDE_FLOAT32_C( 261.31), SIMDE_FLOAT32_C( -212.54),
SIMDE_FLOAT32_C( -976.55), SIMDE_FLOAT32_C( -660.80)),
simde_mm256_set_ps(SIMDE_FLOAT32_C( 5.88), SIMDE_FLOAT32_C( -6.80),
SIMDE_FLOAT32_C( 6.15), SIMDE_FLOAT32_C( 7.23),
SIMDE_FLOAT32_C( 6.26), SIMDE_FLOAT32_C( -6.05),
SIMDE_FLOAT32_C( -7.58), SIMDE_FLOAT32_C( -7.19)) },
{ simde_mm256_set_ps(SIMDE_FLOAT32_C( -770.35), SIMDE_FLOAT32_C( 443.48),
SIMDE_FLOAT32_C( -583.60), SIMDE_FLOAT32_C( 380.46),
SIMDE_FLOAT32_C( -770.72), SIMDE_FLOAT32_C( 993.90),
SIMDE_FLOAT32_C( 28.08), SIMDE_FLOAT32_C( 841.21)),
simde_mm256_set_ps(SIMDE_FLOAT32_C( -7.34), SIMDE_FLOAT32_C( 6.79),
SIMDE_FLOAT32_C( -7.06), SIMDE_FLOAT32_C( 6.63),
SIMDE_FLOAT32_C( -7.34), SIMDE_FLOAT32_C( 7.59),
SIMDE_FLOAT32_C( 4.03), SIMDE_FLOAT32_C( 7.43)) },
{ simde_mm256_set_ps(SIMDE_FLOAT32_C( -387.90), SIMDE_FLOAT32_C( 395.92),
SIMDE_FLOAT32_C( 655.87), SIMDE_FLOAT32_C( 339.21),
SIMDE_FLOAT32_C( 532.35), SIMDE_FLOAT32_C( -263.99),
SIMDE_FLOAT32_C( 780.64), SIMDE_FLOAT32_C( -30.79)),
simde_mm256_set_ps(SIMDE_FLOAT32_C( -6.65), SIMDE_FLOAT32_C( 6.67),
SIMDE_FLOAT32_C( 7.18), SIMDE_FLOAT32_C( 6.52),
SIMDE_FLOAT32_C( 6.97), SIMDE_FLOAT32_C( -6.27),
SIMDE_FLOAT32_C( 7.35), SIMDE_FLOAT32_C( -4.12)) },
{ simde_mm256_set_ps(SIMDE_FLOAT32_C( -203.65), SIMDE_FLOAT32_C( -80.73),
SIMDE_FLOAT32_C( 336.73), SIMDE_FLOAT32_C( -944.78),
SIMDE_FLOAT32_C( -747.59), SIMDE_FLOAT32_C( -767.23),
SIMDE_FLOAT32_C( -554.19), SIMDE_FLOAT32_C( 398.82)),
simde_mm256_set_ps(SIMDE_FLOAT32_C( -6.01), SIMDE_FLOAT32_C( -5.08),
SIMDE_FLOAT32_C( 6.51), SIMDE_FLOAT32_C( -7.54),
SIMDE_FLOAT32_C( -7.31), SIMDE_FLOAT32_C( -7.34),
SIMDE_FLOAT32_C( -7.01), SIMDE_FLOAT32_C( 6.68)) },
{ simde_mm256_set_ps(SIMDE_FLOAT32_C( 469.66), SIMDE_FLOAT32_C( 680.02),
SIMDE_FLOAT32_C( -148.69), SIMDE_FLOAT32_C( 818.66),
SIMDE_FLOAT32_C( 910.03), SIMDE_FLOAT32_C( 600.47),
SIMDE_FLOAT32_C( 791.23), SIMDE_FLOAT32_C( 254.31)),
simde_mm256_set_ps(SIMDE_FLOAT32_C( 6.85), SIMDE_FLOAT32_C( 7.22),
SIMDE_FLOAT32_C( -5.70), SIMDE_FLOAT32_C( 7.40),
SIMDE_FLOAT32_C( 7.51), SIMDE_FLOAT32_C( 7.09),
SIMDE_FLOAT32_C( 7.37), SIMDE_FLOAT32_C( 6.23)) }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])); i++) {
simde__m256 r = simde_mm256_asinh_ps(test_vec[i].a);
simde_assert_m256_close(r, test_vec[i].r, 1);
}
return 0;
}
static int
test_simde_mm256_asinh_pd(SIMDE_MUNIT_TEST_ARGS) {
const struct {
simde__m256d a;
simde__m256d r;
} test_vec[8] = {
{ simde_mm256_set_pd(SIMDE_FLOAT64_C( -186.21), SIMDE_FLOAT64_C( 39.01),
SIMDE_FLOAT64_C( -754.38), SIMDE_FLOAT64_C( 346.63)),
simde_mm256_set_pd(SIMDE_FLOAT64_C( -5.92), SIMDE_FLOAT64_C( 4.36),
SIMDE_FLOAT64_C( -7.32), SIMDE_FLOAT64_C( 6.54)) },
{ simde_mm256_set_pd(SIMDE_FLOAT64_C( 497.31), SIMDE_FLOAT64_C( 670.24),
SIMDE_FLOAT64_C( -297.45), SIMDE_FLOAT64_C( 34.06)),
simde_mm256_set_pd(SIMDE_FLOAT64_C( 6.90), SIMDE_FLOAT64_C( 7.20),
SIMDE_FLOAT64_C( -6.39), SIMDE_FLOAT64_C( 4.22)) },
{ simde_mm256_set_pd(SIMDE_FLOAT64_C( 825.53), SIMDE_FLOAT64_C( 422.21),
SIMDE_FLOAT64_C( -269.45), SIMDE_FLOAT64_C( 467.76)),
simde_mm256_set_pd(SIMDE_FLOAT64_C( 7.41), SIMDE_FLOAT64_C( 6.74),
SIMDE_FLOAT64_C( -6.29), SIMDE_FLOAT64_C( 6.84)) },
{ simde_mm256_set_pd(SIMDE_FLOAT64_C( -678.17), SIMDE_FLOAT64_C( -686.13),
SIMDE_FLOAT64_C( 84.77), SIMDE_FLOAT64_C( 571.46)),
simde_mm256_set_pd(SIMDE_FLOAT64_C( -7.21), SIMDE_FLOAT64_C( -7.22),
SIMDE_FLOAT64_C( 5.13), SIMDE_FLOAT64_C( 7.04)) },
{ simde_mm256_set_pd(SIMDE_FLOAT64_C( -305.07), SIMDE_FLOAT64_C( -860.95),
SIMDE_FLOAT64_C( -417.54), SIMDE_FLOAT64_C( 696.87)),
simde_mm256_set_pd(SIMDE_FLOAT64_C( -6.41), SIMDE_FLOAT64_C( -7.45),
SIMDE_FLOAT64_C( -6.73), SIMDE_FLOAT64_C( 7.24)) },
{ simde_mm256_set_pd(SIMDE_FLOAT64_C( -444.81), SIMDE_FLOAT64_C( 28.47),
SIMDE_FLOAT64_C( -384.03), SIMDE_FLOAT64_C( -923.64)),
simde_mm256_set_pd(SIMDE_FLOAT64_C( -6.79), SIMDE_FLOAT64_C( 4.04),
SIMDE_FLOAT64_C( -6.64), SIMDE_FLOAT64_C( -7.52)) },
{ simde_mm256_set_pd(SIMDE_FLOAT64_C( 261.31), SIMDE_FLOAT64_C( -212.54),
SIMDE_FLOAT64_C( -976.55), SIMDE_FLOAT64_C( -660.80)),
simde_mm256_set_pd(SIMDE_FLOAT64_C( 6.26), SIMDE_FLOAT64_C( -6.05),
SIMDE_FLOAT64_C( -7.58), SIMDE_FLOAT64_C( -7.19)) },
{ simde_mm256_set_pd(SIMDE_FLOAT64_C( 178.20), SIMDE_FLOAT64_C( -450.67),
SIMDE_FLOAT64_C( 233.37), SIMDE_FLOAT64_C( 687.09)),
simde_mm256_set_pd(SIMDE_FLOAT64_C( 5.88), SIMDE_FLOAT64_C( -6.80),
SIMDE_FLOAT64_C( 6.15), SIMDE_FLOAT64_C( 7.23)) }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])); i++) {
simde__m256d r = simde_mm256_asinh_pd(test_vec[i].a);
simde_assert_m256d_close(r, test_vec[i].r, 1);
}
return 0;
}
static int
test_simde_mm512_asinh_ps(SIMDE_MUNIT_TEST_ARGS) {
const struct {
simde__m512 a;
simde__m512 r;
} test_vec[8] = {
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( -678.17), SIMDE_FLOAT32_C( -686.13), SIMDE_FLOAT32_C( 84.77), SIMDE_FLOAT32_C( 571.46),
SIMDE_FLOAT32_C( 825.53), SIMDE_FLOAT32_C( 422.21), SIMDE_FLOAT32_C( -269.45), SIMDE_FLOAT32_C( 467.76),
SIMDE_FLOAT32_C( 497.31), SIMDE_FLOAT32_C( 670.24), SIMDE_FLOAT32_C( -297.45), SIMDE_FLOAT32_C( 34.06),
SIMDE_FLOAT32_C( -186.21), SIMDE_FLOAT32_C( 39.01), SIMDE_FLOAT32_C( -754.38), SIMDE_FLOAT32_C( 346.63)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( -7.21), SIMDE_FLOAT32_C( -7.22), SIMDE_FLOAT32_C( 5.13), SIMDE_FLOAT32_C( 7.04),
SIMDE_FLOAT32_C( 7.41), SIMDE_FLOAT32_C( 6.74), SIMDE_FLOAT32_C( -6.29), SIMDE_FLOAT32_C( 6.84),
SIMDE_FLOAT32_C( 6.90), SIMDE_FLOAT32_C( 7.20), SIMDE_FLOAT32_C( -6.39), SIMDE_FLOAT32_C( 4.22),
SIMDE_FLOAT32_C( -5.92), SIMDE_FLOAT32_C( 4.36), SIMDE_FLOAT32_C( -7.32), SIMDE_FLOAT32_C( 6.54)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( 178.20), SIMDE_FLOAT32_C( -450.67), SIMDE_FLOAT32_C( 233.37), SIMDE_FLOAT32_C( 687.09),
SIMDE_FLOAT32_C( 261.31), SIMDE_FLOAT32_C( -212.54), SIMDE_FLOAT32_C( -976.55), SIMDE_FLOAT32_C( -660.80),
SIMDE_FLOAT32_C( -444.81), SIMDE_FLOAT32_C( 28.47), SIMDE_FLOAT32_C( -384.03), SIMDE_FLOAT32_C( -923.64),
SIMDE_FLOAT32_C( -305.07), SIMDE_FLOAT32_C( -860.95), SIMDE_FLOAT32_C( -417.54), SIMDE_FLOAT32_C( 696.87)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 5.88), SIMDE_FLOAT32_C( -6.80), SIMDE_FLOAT32_C( 6.15), SIMDE_FLOAT32_C( 7.23),
SIMDE_FLOAT32_C( 6.26), SIMDE_FLOAT32_C( -6.05), SIMDE_FLOAT32_C( -7.58), SIMDE_FLOAT32_C( -7.19),
SIMDE_FLOAT32_C( -6.79), SIMDE_FLOAT32_C( 4.04), SIMDE_FLOAT32_C( -6.64), SIMDE_FLOAT32_C( -7.52),
SIMDE_FLOAT32_C( -6.41), SIMDE_FLOAT32_C( -7.45), SIMDE_FLOAT32_C( -6.73), SIMDE_FLOAT32_C( 7.24)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( -387.90), SIMDE_FLOAT32_C( 395.92), SIMDE_FLOAT32_C( 655.87), SIMDE_FLOAT32_C( 339.21),
SIMDE_FLOAT32_C( 532.35), SIMDE_FLOAT32_C( -263.99), SIMDE_FLOAT32_C( 780.64), SIMDE_FLOAT32_C( -30.79),
SIMDE_FLOAT32_C( -770.35), SIMDE_FLOAT32_C( 443.48), SIMDE_FLOAT32_C( -583.60), SIMDE_FLOAT32_C( 380.46),
SIMDE_FLOAT32_C( -770.72), SIMDE_FLOAT32_C( 993.90), SIMDE_FLOAT32_C( 28.08), SIMDE_FLOAT32_C( 841.21)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( -6.65), SIMDE_FLOAT32_C( 6.67), SIMDE_FLOAT32_C( 7.18), SIMDE_FLOAT32_C( 6.52),
SIMDE_FLOAT32_C( 6.97), SIMDE_FLOAT32_C( -6.27), SIMDE_FLOAT32_C( 7.35), SIMDE_FLOAT32_C( -4.12),
SIMDE_FLOAT32_C( -7.34), SIMDE_FLOAT32_C( 6.79), SIMDE_FLOAT32_C( -7.06), SIMDE_FLOAT32_C( 6.63),
SIMDE_FLOAT32_C( -7.34), SIMDE_FLOAT32_C( 7.59), SIMDE_FLOAT32_C( 4.03), SIMDE_FLOAT32_C( 7.43)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( 469.66), SIMDE_FLOAT32_C( 680.02), SIMDE_FLOAT32_C( -148.69), SIMDE_FLOAT32_C( 818.66),
SIMDE_FLOAT32_C( 910.03), SIMDE_FLOAT32_C( 600.47), SIMDE_FLOAT32_C( 791.23), SIMDE_FLOAT32_C( 254.31),
SIMDE_FLOAT32_C( -203.65), SIMDE_FLOAT32_C( -80.73), SIMDE_FLOAT32_C( 336.73), SIMDE_FLOAT32_C( -944.78),
SIMDE_FLOAT32_C( -747.59), SIMDE_FLOAT32_C( -767.23), SIMDE_FLOAT32_C( -554.19), SIMDE_FLOAT32_C( 398.82)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 6.85), SIMDE_FLOAT32_C( 7.22), SIMDE_FLOAT32_C( -5.70), SIMDE_FLOAT32_C( 7.40),
SIMDE_FLOAT32_C( 7.51), SIMDE_FLOAT32_C( 7.09), SIMDE_FLOAT32_C( 7.37), SIMDE_FLOAT32_C( 6.23),
SIMDE_FLOAT32_C( -6.01), SIMDE_FLOAT32_C( -5.08), SIMDE_FLOAT32_C( 6.51), SIMDE_FLOAT32_C( -7.54),
SIMDE_FLOAT32_C( -7.31), SIMDE_FLOAT32_C( -7.34), SIMDE_FLOAT32_C( -7.01), SIMDE_FLOAT32_C( 6.68)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( -178.99), SIMDE_FLOAT32_C( 758.79), SIMDE_FLOAT32_C( 324.62), SIMDE_FLOAT32_C( 343.48),
SIMDE_FLOAT32_C( -874.31), SIMDE_FLOAT32_C( -797.92), SIMDE_FLOAT32_C( -328.54), SIMDE_FLOAT32_C( -525.83),
SIMDE_FLOAT32_C( -192.31), SIMDE_FLOAT32_C( -822.65), SIMDE_FLOAT32_C( 561.36), SIMDE_FLOAT32_C( 655.67),
SIMDE_FLOAT32_C( -70.91), SIMDE_FLOAT32_C( 543.35), SIMDE_FLOAT32_C( 120.65), SIMDE_FLOAT32_C( -171.51)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( -5.88), SIMDE_FLOAT32_C( 7.32), SIMDE_FLOAT32_C( 6.48), SIMDE_FLOAT32_C( 6.53),
SIMDE_FLOAT32_C( -7.47), SIMDE_FLOAT32_C( -7.38), SIMDE_FLOAT32_C( -6.49), SIMDE_FLOAT32_C( -6.96),
SIMDE_FLOAT32_C( -5.95), SIMDE_FLOAT32_C( -7.41), SIMDE_FLOAT32_C( 7.02), SIMDE_FLOAT32_C( 7.18),
SIMDE_FLOAT32_C( -4.95), SIMDE_FLOAT32_C( 6.99), SIMDE_FLOAT32_C( 5.49), SIMDE_FLOAT32_C( -5.84)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( 690.12), SIMDE_FLOAT32_C( 840.65), SIMDE_FLOAT32_C( -21.09), SIMDE_FLOAT32_C( -591.56),
SIMDE_FLOAT32_C( -448.89), SIMDE_FLOAT32_C( 731.49), SIMDE_FLOAT32_C( 505.79), SIMDE_FLOAT32_C( 623.70),
SIMDE_FLOAT32_C( 831.02), SIMDE_FLOAT32_C( 140.67), SIMDE_FLOAT32_C( 977.36), SIMDE_FLOAT32_C( -906.16),
SIMDE_FLOAT32_C( 331.34), SIMDE_FLOAT32_C( 99.93), SIMDE_FLOAT32_C( 462.95), SIMDE_FLOAT32_C( -738.19)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 7.23), SIMDE_FLOAT32_C( 7.43), SIMDE_FLOAT32_C( -3.74), SIMDE_FLOAT32_C( -7.08),
SIMDE_FLOAT32_C( -6.80), SIMDE_FLOAT32_C( 7.29), SIMDE_FLOAT32_C( 6.92), SIMDE_FLOAT32_C( 7.13),
SIMDE_FLOAT32_C( 7.42), SIMDE_FLOAT32_C( 5.64), SIMDE_FLOAT32_C( 7.58), SIMDE_FLOAT32_C( -7.50),
SIMDE_FLOAT32_C( 6.50), SIMDE_FLOAT32_C( 5.30), SIMDE_FLOAT32_C( 6.83), SIMDE_FLOAT32_C( -7.30)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( -769.09), SIMDE_FLOAT32_C( 822.06), SIMDE_FLOAT32_C( -573.81), SIMDE_FLOAT32_C( -997.63),
SIMDE_FLOAT32_C( -337.60), SIMDE_FLOAT32_C( 923.64), SIMDE_FLOAT32_C( 293.64), SIMDE_FLOAT32_C( -768.12),
SIMDE_FLOAT32_C( -576.22), SIMDE_FLOAT32_C( -67.64), SIMDE_FLOAT32_C( 710.38), SIMDE_FLOAT32_C( 977.49),
SIMDE_FLOAT32_C( -756.42), SIMDE_FLOAT32_C( 424.81), SIMDE_FLOAT32_C( 27.25), SIMDE_FLOAT32_C( -95.15)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( -7.34), SIMDE_FLOAT32_C( 7.40), SIMDE_FLOAT32_C( -7.05), SIMDE_FLOAT32_C( -7.60),
SIMDE_FLOAT32_C( -6.52), SIMDE_FLOAT32_C( 7.52), SIMDE_FLOAT32_C( 6.38), SIMDE_FLOAT32_C( -7.34),
SIMDE_FLOAT32_C( -7.05), SIMDE_FLOAT32_C( -4.91), SIMDE_FLOAT32_C( 7.26), SIMDE_FLOAT32_C( 7.58),
SIMDE_FLOAT32_C( -7.32), SIMDE_FLOAT32_C( 6.74), SIMDE_FLOAT32_C( 4.00), SIMDE_FLOAT32_C( -5.25)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( -438.19), SIMDE_FLOAT32_C( -359.76), SIMDE_FLOAT32_C( -752.43), SIMDE_FLOAT32_C( -33.67),
SIMDE_FLOAT32_C( 932.66), SIMDE_FLOAT32_C( 7.27), SIMDE_FLOAT32_C( -327.22), SIMDE_FLOAT32_C( -125.20),
SIMDE_FLOAT32_C( -182.45), SIMDE_FLOAT32_C( 39.93), SIMDE_FLOAT32_C( 510.85), SIMDE_FLOAT32_C( 394.67),
SIMDE_FLOAT32_C( 14.34), SIMDE_FLOAT32_C( -304.73), SIMDE_FLOAT32_C( 916.26), SIMDE_FLOAT32_C( -696.69)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( -6.78), SIMDE_FLOAT32_C( -6.58), SIMDE_FLOAT32_C( -7.32), SIMDE_FLOAT32_C( -4.21),
SIMDE_FLOAT32_C( 7.53), SIMDE_FLOAT32_C( 2.68), SIMDE_FLOAT32_C( -6.48), SIMDE_FLOAT32_C( -5.52),
SIMDE_FLOAT32_C( -5.90), SIMDE_FLOAT32_C( 4.38), SIMDE_FLOAT32_C( 6.93), SIMDE_FLOAT32_C( 6.67),
SIMDE_FLOAT32_C( 3.36), SIMDE_FLOAT32_C( -6.41), SIMDE_FLOAT32_C( 7.51), SIMDE_FLOAT32_C( -7.24)) }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])); i++) {
simde__m512 r = simde_mm512_asinh_ps(test_vec[i].a);
simde_assert_m512_close(r, test_vec[i].r, 1);
}
return 0;
}
static int
test_simde_mm512_mask_asinh_ps(SIMDE_MUNIT_TEST_ARGS) {
const struct {
simde__m512 src;
simde__mmask16 k;
simde__m512 a;
simde__m512 r;
} test_vec[8] = {
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( -450.67), SIMDE_FLOAT32_C( 687.09), SIMDE_FLOAT32_C( -212.54), SIMDE_FLOAT32_C( -660.80),
SIMDE_FLOAT32_C( 28.47), SIMDE_FLOAT32_C( -923.64), SIMDE_FLOAT32_C( -860.95), SIMDE_FLOAT32_C( 696.87),
SIMDE_FLOAT32_C( -686.13), SIMDE_FLOAT32_C( 571.46), SIMDE_FLOAT32_C( 422.21), SIMDE_FLOAT32_C( 467.76),
SIMDE_FLOAT32_C( 670.24), SIMDE_FLOAT32_C( 34.06), SIMDE_FLOAT32_C( 39.01), SIMDE_FLOAT32_C( 346.63)),
UINT16_C(41466),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 178.20), SIMDE_FLOAT32_C( 233.37), SIMDE_FLOAT32_C( 261.31), SIMDE_FLOAT32_C( -976.55),
SIMDE_FLOAT32_C( -444.81), SIMDE_FLOAT32_C( -384.03), SIMDE_FLOAT32_C( -305.07), SIMDE_FLOAT32_C( -417.54),
SIMDE_FLOAT32_C( -678.17), SIMDE_FLOAT32_C( 84.77), SIMDE_FLOAT32_C( 825.53), SIMDE_FLOAT32_C( -269.45),
SIMDE_FLOAT32_C( 497.31), SIMDE_FLOAT32_C( -297.45), SIMDE_FLOAT32_C( -186.21), SIMDE_FLOAT32_C( -754.38)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 5.88), SIMDE_FLOAT32_C( 687.09), SIMDE_FLOAT32_C( 6.26), SIMDE_FLOAT32_C( -660.80),
SIMDE_FLOAT32_C( 28.47), SIMDE_FLOAT32_C( -923.64), SIMDE_FLOAT32_C( -860.95), SIMDE_FLOAT32_C( -6.73),
SIMDE_FLOAT32_C( -7.21), SIMDE_FLOAT32_C( 5.13), SIMDE_FLOAT32_C( 7.41), SIMDE_FLOAT32_C( -6.29),
SIMDE_FLOAT32_C( 6.90), SIMDE_FLOAT32_C( 34.06), SIMDE_FLOAT32_C( -5.92), SIMDE_FLOAT32_C( 346.63)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( 469.66), SIMDE_FLOAT32_C( -148.69), SIMDE_FLOAT32_C( 910.03), SIMDE_FLOAT32_C( 791.23),
SIMDE_FLOAT32_C( -203.65), SIMDE_FLOAT32_C( 336.73), SIMDE_FLOAT32_C( -747.59), SIMDE_FLOAT32_C( -554.19),
SIMDE_FLOAT32_C( -387.90), SIMDE_FLOAT32_C( 655.87), SIMDE_FLOAT32_C( 532.35), SIMDE_FLOAT32_C( 780.64),
SIMDE_FLOAT32_C( -770.35), SIMDE_FLOAT32_C( -583.60), SIMDE_FLOAT32_C( -770.72), SIMDE_FLOAT32_C( 28.08)),
UINT16_C(36797),
simde_mm512_set_ps(SIMDE_FLOAT32_C( -171.51), SIMDE_FLOAT32_C( 680.02), SIMDE_FLOAT32_C( 818.66), SIMDE_FLOAT32_C( 600.47),
SIMDE_FLOAT32_C( 254.31), SIMDE_FLOAT32_C( -80.73), SIMDE_FLOAT32_C( -944.78), SIMDE_FLOAT32_C( -767.23),
SIMDE_FLOAT32_C( 398.82), SIMDE_FLOAT32_C( 395.92), SIMDE_FLOAT32_C( 339.21), SIMDE_FLOAT32_C( -263.99),
SIMDE_FLOAT32_C( -30.79), SIMDE_FLOAT32_C( 443.48), SIMDE_FLOAT32_C( 380.46), SIMDE_FLOAT32_C( 993.90)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( -5.84), SIMDE_FLOAT32_C( -148.69), SIMDE_FLOAT32_C( 910.03), SIMDE_FLOAT32_C( 791.23),
SIMDE_FLOAT32_C( 6.23), SIMDE_FLOAT32_C( -5.08), SIMDE_FLOAT32_C( -7.54), SIMDE_FLOAT32_C( -7.34),
SIMDE_FLOAT32_C( 6.68), SIMDE_FLOAT32_C( 655.87), SIMDE_FLOAT32_C( 6.52), SIMDE_FLOAT32_C( -6.27),
SIMDE_FLOAT32_C( -4.12), SIMDE_FLOAT32_C( 6.79), SIMDE_FLOAT32_C( -770.72), SIMDE_FLOAT32_C( 7.59)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( -95.15), SIMDE_FLOAT32_C( 840.65), SIMDE_FLOAT32_C( -591.56), SIMDE_FLOAT32_C( 731.49),
SIMDE_FLOAT32_C( 623.70), SIMDE_FLOAT32_C( 140.67), SIMDE_FLOAT32_C( -906.16), SIMDE_FLOAT32_C( 99.93),
SIMDE_FLOAT32_C( -738.19), SIMDE_FLOAT32_C( 758.79), SIMDE_FLOAT32_C( 343.48), SIMDE_FLOAT32_C( -797.92),
SIMDE_FLOAT32_C( -525.83), SIMDE_FLOAT32_C( -822.65), SIMDE_FLOAT32_C( 655.67), SIMDE_FLOAT32_C( 543.35)),
UINT16_C(16804),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 27.25), SIMDE_FLOAT32_C( 690.12), SIMDE_FLOAT32_C( -21.09), SIMDE_FLOAT32_C( -448.89),
SIMDE_FLOAT32_C( 505.79), SIMDE_FLOAT32_C( 831.02), SIMDE_FLOAT32_C( 977.36), SIMDE_FLOAT32_C( 331.34),
SIMDE_FLOAT32_C( 462.95), SIMDE_FLOAT32_C( -178.99), SIMDE_FLOAT32_C( 324.62), SIMDE_FLOAT32_C( -874.31),
SIMDE_FLOAT32_C( -328.54), SIMDE_FLOAT32_C( -192.31), SIMDE_FLOAT32_C( 561.36), SIMDE_FLOAT32_C( -70.91)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( -95.15), SIMDE_FLOAT32_C( 7.23), SIMDE_FLOAT32_C( -591.56), SIMDE_FLOAT32_C( 731.49),
SIMDE_FLOAT32_C( 623.70), SIMDE_FLOAT32_C( 140.67), SIMDE_FLOAT32_C( -906.16), SIMDE_FLOAT32_C( 6.50),
SIMDE_FLOAT32_C( 6.83), SIMDE_FLOAT32_C( 758.79), SIMDE_FLOAT32_C( 6.48), SIMDE_FLOAT32_C( -797.92),
SIMDE_FLOAT32_C( -525.83), SIMDE_FLOAT32_C( -5.95), SIMDE_FLOAT32_C( 655.67), SIMDE_FLOAT32_C( 543.35)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( -348.70), SIMDE_FLOAT32_C( -438.19), SIMDE_FLOAT32_C( -752.43), SIMDE_FLOAT32_C( 932.66),
SIMDE_FLOAT32_C( -327.22), SIMDE_FLOAT32_C( -182.45), SIMDE_FLOAT32_C( 510.85), SIMDE_FLOAT32_C( 14.34),
SIMDE_FLOAT32_C( 916.26), SIMDE_FLOAT32_C( -769.09), SIMDE_FLOAT32_C( -573.81), SIMDE_FLOAT32_C( -337.60),
SIMDE_FLOAT32_C( 293.64), SIMDE_FLOAT32_C( -576.22), SIMDE_FLOAT32_C( 710.38), SIMDE_FLOAT32_C( -756.42)),
UINT16_C( 2107),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 897.27), SIMDE_FLOAT32_C( -197.89), SIMDE_FLOAT32_C( -359.76), SIMDE_FLOAT32_C( -33.67),
SIMDE_FLOAT32_C( 7.27), SIMDE_FLOAT32_C( -125.20), SIMDE_FLOAT32_C( 39.93), SIMDE_FLOAT32_C( 394.67),
SIMDE_FLOAT32_C( -304.73), SIMDE_FLOAT32_C( -696.69), SIMDE_FLOAT32_C( 822.06), SIMDE_FLOAT32_C( -997.63),
SIMDE_FLOAT32_C( 923.64), SIMDE_FLOAT32_C( -768.12), SIMDE_FLOAT32_C( -67.64), SIMDE_FLOAT32_C( 977.49)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( -348.70), SIMDE_FLOAT32_C( -438.19), SIMDE_FLOAT32_C( -752.43), SIMDE_FLOAT32_C( 932.66),
SIMDE_FLOAT32_C( 2.68), SIMDE_FLOAT32_C( -182.45), SIMDE_FLOAT32_C( 510.85), SIMDE_FLOAT32_C( 14.34),
SIMDE_FLOAT32_C( 916.26), SIMDE_FLOAT32_C( -769.09), SIMDE_FLOAT32_C( 7.40), SIMDE_FLOAT32_C( -7.60),
SIMDE_FLOAT32_C( 7.52), SIMDE_FLOAT32_C( -576.22), SIMDE_FLOAT32_C( -4.91), SIMDE_FLOAT32_C( 7.58)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( -15.61), SIMDE_FLOAT32_C( -737.13), SIMDE_FLOAT32_C( -314.93), SIMDE_FLOAT32_C( 177.92),
SIMDE_FLOAT32_C( 345.93), SIMDE_FLOAT32_C( 888.71), SIMDE_FLOAT32_C( 915.71), SIMDE_FLOAT32_C( 133.52),
SIMDE_FLOAT32_C( 484.94), SIMDE_FLOAT32_C( -598.06), SIMDE_FLOAT32_C( -791.07), SIMDE_FLOAT32_C( -765.93),
SIMDE_FLOAT32_C( 221.37), SIMDE_FLOAT32_C( -788.36), SIMDE_FLOAT32_C( -775.04), SIMDE_FLOAT32_C( 440.64)),
UINT16_C(22274),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 496.57), SIMDE_FLOAT32_C( 915.19), SIMDE_FLOAT32_C( -718.40), SIMDE_FLOAT32_C( 159.97),
SIMDE_FLOAT32_C( -861.01), SIMDE_FLOAT32_C( 426.61), SIMDE_FLOAT32_C( 932.11), SIMDE_FLOAT32_C( 110.36),
SIMDE_FLOAT32_C( 826.84), SIMDE_FLOAT32_C( -76.75), SIMDE_FLOAT32_C( 237.58), SIMDE_FLOAT32_C( -378.50),
SIMDE_FLOAT32_C( -601.68), SIMDE_FLOAT32_C( -623.50), SIMDE_FLOAT32_C( -942.47), SIMDE_FLOAT32_C( 475.51)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( -15.61), SIMDE_FLOAT32_C( 7.51), SIMDE_FLOAT32_C( -314.93), SIMDE_FLOAT32_C( 5.77),
SIMDE_FLOAT32_C( 345.93), SIMDE_FLOAT32_C( 6.75), SIMDE_FLOAT32_C( 7.53), SIMDE_FLOAT32_C( 5.40),
SIMDE_FLOAT32_C( 484.94), SIMDE_FLOAT32_C( -598.06), SIMDE_FLOAT32_C( -791.07), SIMDE_FLOAT32_C( -765.93),
SIMDE_FLOAT32_C( 221.37), SIMDE_FLOAT32_C( -788.36), SIMDE_FLOAT32_C( -7.54), SIMDE_FLOAT32_C( 440.64)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( 883.05), SIMDE_FLOAT32_C( -807.28), SIMDE_FLOAT32_C( -70.05), SIMDE_FLOAT32_C( -784.34),
SIMDE_FLOAT32_C( 92.52), SIMDE_FLOAT32_C( 206.60), SIMDE_FLOAT32_C( 834.60), SIMDE_FLOAT32_C( -65.60),
SIMDE_FLOAT32_C( -286.07), SIMDE_FLOAT32_C( -212.86), SIMDE_FLOAT32_C( -318.38), SIMDE_FLOAT32_C( 783.48),
SIMDE_FLOAT32_C( -628.82), SIMDE_FLOAT32_C( 556.35), SIMDE_FLOAT32_C( 439.43), SIMDE_FLOAT32_C( 434.03)),
UINT16_C(27396),
simde_mm512_set_ps(SIMDE_FLOAT32_C( -964.25), SIMDE_FLOAT32_C( -406.33), SIMDE_FLOAT32_C( -743.66), SIMDE_FLOAT32_C( -764.58),
SIMDE_FLOAT32_C( 789.89), SIMDE_FLOAT32_C( 4.83), SIMDE_FLOAT32_C( -818.54), SIMDE_FLOAT32_C( 161.06),
SIMDE_FLOAT32_C( 579.25), SIMDE_FLOAT32_C( -11.78), SIMDE_FLOAT32_C( -308.52), SIMDE_FLOAT32_C( -719.57),
SIMDE_FLOAT32_C( 334.00), SIMDE_FLOAT32_C( 274.71), SIMDE_FLOAT32_C( -916.82), SIMDE_FLOAT32_C( -490.00)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 883.05), SIMDE_FLOAT32_C( -6.70), SIMDE_FLOAT32_C( -7.30), SIMDE_FLOAT32_C( -784.34),
SIMDE_FLOAT32_C( 7.37), SIMDE_FLOAT32_C( 206.60), SIMDE_FLOAT32_C( -7.40), SIMDE_FLOAT32_C( 5.77),
SIMDE_FLOAT32_C( -286.07), SIMDE_FLOAT32_C( -212.86), SIMDE_FLOAT32_C( -318.38), SIMDE_FLOAT32_C( 783.48),
SIMDE_FLOAT32_C( -628.82), SIMDE_FLOAT32_C( 6.31), SIMDE_FLOAT32_C( 439.43), SIMDE_FLOAT32_C( 434.03)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( 529.63), SIMDE_FLOAT32_C( -24.89), SIMDE_FLOAT32_C( -967.78), SIMDE_FLOAT32_C( 638.94),
SIMDE_FLOAT32_C( 450.90), SIMDE_FLOAT32_C( -771.54), SIMDE_FLOAT32_C( 105.79), SIMDE_FLOAT32_C( 590.10),
SIMDE_FLOAT32_C( 30.91), SIMDE_FLOAT32_C( 635.35), SIMDE_FLOAT32_C( -84.00), SIMDE_FLOAT32_C( 80.04),
SIMDE_FLOAT32_C( -709.46), SIMDE_FLOAT32_C( 607.86), SIMDE_FLOAT32_C( 394.58), SIMDE_FLOAT32_C( -889.11)),
UINT16_C( 953),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 18.75), SIMDE_FLOAT32_C( 809.05), SIMDE_FLOAT32_C( 144.05), SIMDE_FLOAT32_C( -427.72),
SIMDE_FLOAT32_C( 308.28), SIMDE_FLOAT32_C( -177.05), SIMDE_FLOAT32_C( -457.77), SIMDE_FLOAT32_C( 678.24),
SIMDE_FLOAT32_C( 66.05), SIMDE_FLOAT32_C( -267.71), SIMDE_FLOAT32_C( 117.28), SIMDE_FLOAT32_C( -576.80),
SIMDE_FLOAT32_C( -38.39), SIMDE_FLOAT32_C( -250.14), SIMDE_FLOAT32_C( -53.92), SIMDE_FLOAT32_C( 91.94)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 529.63), SIMDE_FLOAT32_C( -24.89), SIMDE_FLOAT32_C( -967.78), SIMDE_FLOAT32_C( 638.94),
SIMDE_FLOAT32_C( 450.90), SIMDE_FLOAT32_C( -771.54), SIMDE_FLOAT32_C( -6.82), SIMDE_FLOAT32_C( 7.21),
SIMDE_FLOAT32_C( 4.88), SIMDE_FLOAT32_C( 635.35), SIMDE_FLOAT32_C( 5.46), SIMDE_FLOAT32_C( -7.05),
SIMDE_FLOAT32_C( -4.34), SIMDE_FLOAT32_C( 607.86), SIMDE_FLOAT32_C( 394.58), SIMDE_FLOAT32_C( 5.21)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( -788.39), SIMDE_FLOAT32_C( 330.43), SIMDE_FLOAT32_C( -493.41), SIMDE_FLOAT32_C( 822.72),
SIMDE_FLOAT32_C( 956.68), SIMDE_FLOAT32_C( 954.62), SIMDE_FLOAT32_C( 825.49), SIMDE_FLOAT32_C( -816.27),
SIMDE_FLOAT32_C( -209.34), SIMDE_FLOAT32_C( -933.21), SIMDE_FLOAT32_C( -728.70), SIMDE_FLOAT32_C( -420.06),
SIMDE_FLOAT32_C( 100.32), SIMDE_FLOAT32_C( 103.15), SIMDE_FLOAT32_C( 439.77), SIMDE_FLOAT32_C( -204.33)),
UINT16_C(12713),
simde_mm512_set_ps(SIMDE_FLOAT32_C( -841.43), SIMDE_FLOAT32_C( -14.16), SIMDE_FLOAT32_C( 824.88), SIMDE_FLOAT32_C( 793.63),
SIMDE_FLOAT32_C( -736.75), SIMDE_FLOAT32_C( -310.57), SIMDE_FLOAT32_C( 728.87), SIMDE_FLOAT32_C( -350.72),
SIMDE_FLOAT32_C( 60.89), SIMDE_FLOAT32_C( 109.81), SIMDE_FLOAT32_C( 715.94), SIMDE_FLOAT32_C( -250.60),
SIMDE_FLOAT32_C( 944.14), SIMDE_FLOAT32_C( 361.85), SIMDE_FLOAT32_C( -13.07), SIMDE_FLOAT32_C( 852.60)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( -788.39), SIMDE_FLOAT32_C( 330.43), SIMDE_FLOAT32_C( 7.41), SIMDE_FLOAT32_C( 7.37),
SIMDE_FLOAT32_C( 956.68), SIMDE_FLOAT32_C( 954.62), SIMDE_FLOAT32_C( 825.49), SIMDE_FLOAT32_C( -6.55),
SIMDE_FLOAT32_C( 4.80), SIMDE_FLOAT32_C( -933.21), SIMDE_FLOAT32_C( 7.27), SIMDE_FLOAT32_C( -420.06),
SIMDE_FLOAT32_C( 7.54), SIMDE_FLOAT32_C( 103.15), SIMDE_FLOAT32_C( 439.77), SIMDE_FLOAT32_C( 7.44)) }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])); i++) {
simde__m512 r = simde_mm512_mask_asinh_ps(test_vec[i].src, test_vec[i].k, test_vec[i].a);
simde_assert_m512_close(r, test_vec[i].r, 1);
}
return 0;
}
static int
test_simde_mm512_asinh_pd(SIMDE_MUNIT_TEST_ARGS) {
const struct {
simde__m512d a;
simde__m512d r;
} test_vec[8] = {
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( 497.31), SIMDE_FLOAT64_C( 670.24),
SIMDE_FLOAT64_C( -297.45), SIMDE_FLOAT64_C( 34.06),
SIMDE_FLOAT64_C( -186.21), SIMDE_FLOAT64_C( 39.01),
SIMDE_FLOAT64_C( -754.38), SIMDE_FLOAT64_C( 346.63)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 6.90), SIMDE_FLOAT64_C( 7.20),
SIMDE_FLOAT64_C( -6.39), SIMDE_FLOAT64_C( 4.22),
SIMDE_FLOAT64_C( -5.92), SIMDE_FLOAT64_C( 4.36),
SIMDE_FLOAT64_C( -7.32), SIMDE_FLOAT64_C( 6.54)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( -678.17), SIMDE_FLOAT64_C( -686.13),
SIMDE_FLOAT64_C( 84.77), SIMDE_FLOAT64_C( 571.46),
SIMDE_FLOAT64_C( 825.53), SIMDE_FLOAT64_C( 422.21),
SIMDE_FLOAT64_C( -269.45), SIMDE_FLOAT64_C( 467.76)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( -7.21), SIMDE_FLOAT64_C( -7.22),
SIMDE_FLOAT64_C( 5.13), SIMDE_FLOAT64_C( 7.04),
SIMDE_FLOAT64_C( 7.41), SIMDE_FLOAT64_C( 6.74),
SIMDE_FLOAT64_C( -6.29), SIMDE_FLOAT64_C( 6.84)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( -444.81), SIMDE_FLOAT64_C( 28.47),
SIMDE_FLOAT64_C( -384.03), SIMDE_FLOAT64_C( -923.64),
SIMDE_FLOAT64_C( -305.07), SIMDE_FLOAT64_C( -860.95),
SIMDE_FLOAT64_C( -417.54), SIMDE_FLOAT64_C( 696.87)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( -6.79), SIMDE_FLOAT64_C( 4.04),
SIMDE_FLOAT64_C( -6.64), SIMDE_FLOAT64_C( -7.52),
SIMDE_FLOAT64_C( -6.41), SIMDE_FLOAT64_C( -7.45),
SIMDE_FLOAT64_C( -6.73), SIMDE_FLOAT64_C( 7.24)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( 178.20), SIMDE_FLOAT64_C( -450.67),
SIMDE_FLOAT64_C( 233.37), SIMDE_FLOAT64_C( 687.09),
SIMDE_FLOAT64_C( 261.31), SIMDE_FLOAT64_C( -212.54),
SIMDE_FLOAT64_C( -976.55), SIMDE_FLOAT64_C( -660.80)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 5.88), SIMDE_FLOAT64_C( -6.80),
SIMDE_FLOAT64_C( 6.15), SIMDE_FLOAT64_C( 7.23),
SIMDE_FLOAT64_C( 6.26), SIMDE_FLOAT64_C( -6.05),
SIMDE_FLOAT64_C( -7.58), SIMDE_FLOAT64_C( -7.19)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( -770.35), SIMDE_FLOAT64_C( 443.48),
SIMDE_FLOAT64_C( -583.60), SIMDE_FLOAT64_C( 380.46),
SIMDE_FLOAT64_C( -770.72), SIMDE_FLOAT64_C( 993.90),
SIMDE_FLOAT64_C( 28.08), SIMDE_FLOAT64_C( 841.21)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( -7.34), SIMDE_FLOAT64_C( 6.79),
SIMDE_FLOAT64_C( -7.06), SIMDE_FLOAT64_C( 6.63),
SIMDE_FLOAT64_C( -7.34), SIMDE_FLOAT64_C( 7.59),
SIMDE_FLOAT64_C( 4.03), SIMDE_FLOAT64_C( 7.43)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( -387.90), SIMDE_FLOAT64_C( 395.92),
SIMDE_FLOAT64_C( 655.87), SIMDE_FLOAT64_C( 339.21),
SIMDE_FLOAT64_C( 532.35), SIMDE_FLOAT64_C( -263.99),
SIMDE_FLOAT64_C( 780.64), SIMDE_FLOAT64_C( -30.79)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( -6.65), SIMDE_FLOAT64_C( 6.67),
SIMDE_FLOAT64_C( 7.18), SIMDE_FLOAT64_C( 6.52),
SIMDE_FLOAT64_C( 6.97), SIMDE_FLOAT64_C( -6.27),
SIMDE_FLOAT64_C( 7.35), SIMDE_FLOAT64_C( -4.12)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( -203.65), SIMDE_FLOAT64_C( -80.73),
SIMDE_FLOAT64_C( 336.73), SIMDE_FLOAT64_C( -944.78),
SIMDE_FLOAT64_C( -747.59), SIMDE_FLOAT64_C( -767.23),
SIMDE_FLOAT64_C( -554.19), SIMDE_FLOAT64_C( 398.82)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( -6.01), SIMDE_FLOAT64_C( -5.08),
SIMDE_FLOAT64_C( 6.51), SIMDE_FLOAT64_C( -7.54),
SIMDE_FLOAT64_C( -7.31), SIMDE_FLOAT64_C( -7.34),
SIMDE_FLOAT64_C( -7.01), SIMDE_FLOAT64_C( 6.68)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( 469.66), SIMDE_FLOAT64_C( 680.02),
SIMDE_FLOAT64_C( -148.69), SIMDE_FLOAT64_C( 818.66),
SIMDE_FLOAT64_C( 910.03), SIMDE_FLOAT64_C( 600.47),
SIMDE_FLOAT64_C( 791.23), SIMDE_FLOAT64_C( 254.31)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 6.85), SIMDE_FLOAT64_C( 7.22),
SIMDE_FLOAT64_C( -5.70), SIMDE_FLOAT64_C( 7.40),
SIMDE_FLOAT64_C( 7.51), SIMDE_FLOAT64_C( 7.09),
SIMDE_FLOAT64_C( 7.37), SIMDE_FLOAT64_C( 6.23)) }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])); i++) {
simde__m512d r = simde_mm512_asinh_pd(test_vec[i].a);
simde_assert_m512d_close(r, test_vec[i].r, 1);
}
return 0;
}
static int
test_simde_mm512_mask_asinh_pd(SIMDE_MUNIT_TEST_ARGS) {
const struct {
simde__m512d src;
simde__mmask8 k;
simde__m512d a;
simde__m512d r;
} test_vec[8] = {
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( -686.13), SIMDE_FLOAT64_C( 571.46),
SIMDE_FLOAT64_C( 422.21), SIMDE_FLOAT64_C( 467.76),
SIMDE_FLOAT64_C( 670.24), SIMDE_FLOAT64_C( 34.06),
SIMDE_FLOAT64_C( 39.01), SIMDE_FLOAT64_C( 346.63)),
UINT8_C(139),
simde_mm512_set_pd(SIMDE_FLOAT64_C( -678.17), SIMDE_FLOAT64_C( 84.77),
SIMDE_FLOAT64_C( 825.53), SIMDE_FLOAT64_C( -269.45),
SIMDE_FLOAT64_C( 497.31), SIMDE_FLOAT64_C( -297.45),
SIMDE_FLOAT64_C( -186.21), SIMDE_FLOAT64_C( -754.38)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( -7.21), SIMDE_FLOAT64_C( 571.46),
SIMDE_FLOAT64_C( 422.21), SIMDE_FLOAT64_C( 467.76),
SIMDE_FLOAT64_C( 6.90), SIMDE_FLOAT64_C( 34.06),
SIMDE_FLOAT64_C( -5.92), SIMDE_FLOAT64_C( -7.32)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( 178.20), SIMDE_FLOAT64_C( 233.37),
SIMDE_FLOAT64_C( 261.31), SIMDE_FLOAT64_C( -976.55),
SIMDE_FLOAT64_C( -444.81), SIMDE_FLOAT64_C( -384.03),
SIMDE_FLOAT64_C( -305.07), SIMDE_FLOAT64_C( -417.54)),
UINT8_C(229),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 841.21), SIMDE_FLOAT64_C( -450.67),
SIMDE_FLOAT64_C( 687.09), SIMDE_FLOAT64_C( -212.54),
SIMDE_FLOAT64_C( -660.80), SIMDE_FLOAT64_C( 28.47),
SIMDE_FLOAT64_C( -923.64), SIMDE_FLOAT64_C( -860.95)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 7.43), SIMDE_FLOAT64_C( -6.80),
SIMDE_FLOAT64_C( 7.23), SIMDE_FLOAT64_C( -976.55),
SIMDE_FLOAT64_C( -444.81), SIMDE_FLOAT64_C( 4.04),
SIMDE_FLOAT64_C( -305.07), SIMDE_FLOAT64_C( -7.45)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( 398.82), SIMDE_FLOAT64_C( 395.92),
SIMDE_FLOAT64_C( 339.21), SIMDE_FLOAT64_C( -263.99),
SIMDE_FLOAT64_C( -30.79), SIMDE_FLOAT64_C( 443.48),
SIMDE_FLOAT64_C( 380.46), SIMDE_FLOAT64_C( 993.90)),
UINT8_C(253),
simde_mm512_set_pd(SIMDE_FLOAT64_C( -554.19), SIMDE_FLOAT64_C( -387.90),
SIMDE_FLOAT64_C( 655.87), SIMDE_FLOAT64_C( 532.35),
SIMDE_FLOAT64_C( 780.64), SIMDE_FLOAT64_C( -770.35),
SIMDE_FLOAT64_C( -583.60), SIMDE_FLOAT64_C( -770.72)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( -7.01), SIMDE_FLOAT64_C( -6.65),
SIMDE_FLOAT64_C( 7.18), SIMDE_FLOAT64_C( 6.97),
SIMDE_FLOAT64_C( 7.35), SIMDE_FLOAT64_C( -7.34),
SIMDE_FLOAT64_C( 380.46), SIMDE_FLOAT64_C( -7.34)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( 120.65), SIMDE_FLOAT64_C( 469.66),
SIMDE_FLOAT64_C( -148.69), SIMDE_FLOAT64_C( 910.03),
SIMDE_FLOAT64_C( 791.23), SIMDE_FLOAT64_C( -203.65),
SIMDE_FLOAT64_C( 336.73), SIMDE_FLOAT64_C( -747.59)),
UINT8_C( 93),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 543.35), SIMDE_FLOAT64_C( -171.51),
SIMDE_FLOAT64_C( 680.02), SIMDE_FLOAT64_C( 818.66),
SIMDE_FLOAT64_C( 600.47), SIMDE_FLOAT64_C( 254.31),
SIMDE_FLOAT64_C( -80.73), SIMDE_FLOAT64_C( -944.78)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 120.65), SIMDE_FLOAT64_C( -5.84),
SIMDE_FLOAT64_C( -148.69), SIMDE_FLOAT64_C( 7.40),
SIMDE_FLOAT64_C( 7.09), SIMDE_FLOAT64_C( 6.23),
SIMDE_FLOAT64_C( 336.73), SIMDE_FLOAT64_C( -7.54)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( 99.93), SIMDE_FLOAT64_C( -738.19),
SIMDE_FLOAT64_C( 758.79), SIMDE_FLOAT64_C( 343.48),
SIMDE_FLOAT64_C( -797.92), SIMDE_FLOAT64_C( -525.83),
SIMDE_FLOAT64_C( -822.65), SIMDE_FLOAT64_C( 655.67)),
UINT8_C(145),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 331.34), SIMDE_FLOAT64_C( 462.95),
SIMDE_FLOAT64_C( -178.99), SIMDE_FLOAT64_C( 324.62),
SIMDE_FLOAT64_C( -874.31), SIMDE_FLOAT64_C( -328.54),
SIMDE_FLOAT64_C( -192.31), SIMDE_FLOAT64_C( 561.36)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 6.50), SIMDE_FLOAT64_C( -738.19),
SIMDE_FLOAT64_C( 758.79), SIMDE_FLOAT64_C( 6.48),
SIMDE_FLOAT64_C( -797.92), SIMDE_FLOAT64_C( -525.83),
SIMDE_FLOAT64_C( -822.65), SIMDE_FLOAT64_C( 7.02)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( -756.42), SIMDE_FLOAT64_C( 27.25),
SIMDE_FLOAT64_C( 690.12), SIMDE_FLOAT64_C( -21.09),
SIMDE_FLOAT64_C( -448.89), SIMDE_FLOAT64_C( 505.79),
SIMDE_FLOAT64_C( 831.02), SIMDE_FLOAT64_C( 977.36)),
UINT8_C( 75),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 977.49), SIMDE_FLOAT64_C( 424.81),
SIMDE_FLOAT64_C( -95.15), SIMDE_FLOAT64_C( 840.65),
SIMDE_FLOAT64_C( -591.56), SIMDE_FLOAT64_C( 731.49),
SIMDE_FLOAT64_C( 623.70), SIMDE_FLOAT64_C( 140.67)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( -756.42), SIMDE_FLOAT64_C( 6.74),
SIMDE_FLOAT64_C( 690.12), SIMDE_FLOAT64_C( -21.09),
SIMDE_FLOAT64_C( -7.08), SIMDE_FLOAT64_C( 505.79),
SIMDE_FLOAT64_C( 7.13), SIMDE_FLOAT64_C( 5.64)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( 394.67), SIMDE_FLOAT64_C( -304.73),
SIMDE_FLOAT64_C( -696.69), SIMDE_FLOAT64_C( 822.06),
SIMDE_FLOAT64_C( -997.63), SIMDE_FLOAT64_C( 923.64),
SIMDE_FLOAT64_C( -768.12), SIMDE_FLOAT64_C( -67.64)),
UINT8_C( 93),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 510.85), SIMDE_FLOAT64_C( 14.34),
SIMDE_FLOAT64_C( 916.26), SIMDE_FLOAT64_C( -769.09),
SIMDE_FLOAT64_C( -573.81), SIMDE_FLOAT64_C( -337.60),
SIMDE_FLOAT64_C( 293.64), SIMDE_FLOAT64_C( -576.22)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 394.67), SIMDE_FLOAT64_C( 3.36),
SIMDE_FLOAT64_C( -696.69), SIMDE_FLOAT64_C( -7.34),
SIMDE_FLOAT64_C( -7.05), SIMDE_FLOAT64_C( -6.52),
SIMDE_FLOAT64_C( -768.12), SIMDE_FLOAT64_C( -7.05)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( 475.51), SIMDE_FLOAT64_C( 936.65),
SIMDE_FLOAT64_C( -348.70), SIMDE_FLOAT64_C( -438.19),
SIMDE_FLOAT64_C( -752.43), SIMDE_FLOAT64_C( 932.66),
SIMDE_FLOAT64_C( -327.22), SIMDE_FLOAT64_C( -182.45)),
UINT8_C(213),
simde_mm512_set_pd(SIMDE_FLOAT64_C( -775.04), SIMDE_FLOAT64_C( 440.64),
SIMDE_FLOAT64_C( 897.27), SIMDE_FLOAT64_C( -197.89),
SIMDE_FLOAT64_C( -359.76), SIMDE_FLOAT64_C( -33.67),
SIMDE_FLOAT64_C( 7.27), SIMDE_FLOAT64_C( -125.20)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( -7.35), SIMDE_FLOAT64_C( 6.78),
SIMDE_FLOAT64_C( -348.70), SIMDE_FLOAT64_C( -5.98),
SIMDE_FLOAT64_C( -752.43), SIMDE_FLOAT64_C( -4.21),
SIMDE_FLOAT64_C( -327.22), SIMDE_FLOAT64_C( -5.52)) }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])); i++) {
simde__m512d r = simde_mm512_mask_asinh_pd(test_vec[i].src, test_vec[i].k, test_vec[i].a);
simde_assert_m512d_close(r, test_vec[i].r, 1);
}
return 0;
}
static int
test_simde_mm_atan_ps(SIMDE_MUNIT_TEST_ARGS) {
const struct {
simde__m128 a;
simde__m128 r;
} test_vec[8] = {
{ simde_mm_set_ps(SIMDE_FLOAT32_C( -186.21), SIMDE_FLOAT32_C( 39.01), SIMDE_FLOAT32_C( -754.38), SIMDE_FLOAT32_C( 346.63)),
simde_mm_set_ps(SIMDE_FLOAT32_C( -1.57), SIMDE_FLOAT32_C( 1.55), SIMDE_FLOAT32_C( -1.57), SIMDE_FLOAT32_C( 1.57)) },
{ simde_mm_set_ps(SIMDE_FLOAT32_C( 497.31), SIMDE_FLOAT32_C( 670.24), SIMDE_FLOAT32_C( -297.45), SIMDE_FLOAT32_C( 34.06)),
simde_mm_set_ps(SIMDE_FLOAT32_C( 1.57), SIMDE_FLOAT32_C( 1.57), SIMDE_FLOAT32_C( -1.57), SIMDE_FLOAT32_C( 1.54)) },
{ simde_mm_set_ps(SIMDE_FLOAT32_C( 825.53), SIMDE_FLOAT32_C( 422.21), SIMDE_FLOAT32_C( -269.45), SIMDE_FLOAT32_C( 467.76)),
simde_mm_set_ps(SIMDE_FLOAT32_C( 1.57), SIMDE_FLOAT32_C( 1.57), SIMDE_FLOAT32_C( -1.57), SIMDE_FLOAT32_C( 1.57)) },
{ simde_mm_set_ps(SIMDE_FLOAT32_C( -678.17), SIMDE_FLOAT32_C( -686.13), SIMDE_FLOAT32_C( 84.77), SIMDE_FLOAT32_C( 571.46)),
simde_mm_set_ps(SIMDE_FLOAT32_C( -1.57), SIMDE_FLOAT32_C( -1.57), SIMDE_FLOAT32_C( 1.56), SIMDE_FLOAT32_C( 1.57)) },
{ simde_mm_set_ps(SIMDE_FLOAT32_C( -305.07), SIMDE_FLOAT32_C( -860.95), SIMDE_FLOAT32_C( -417.54), SIMDE_FLOAT32_C( 696.87)),
simde_mm_set_ps(SIMDE_FLOAT32_C( -1.57), SIMDE_FLOAT32_C( -1.57), SIMDE_FLOAT32_C( -1.57), SIMDE_FLOAT32_C( 1.57)) },
{ simde_mm_set_ps(SIMDE_FLOAT32_C( -444.81), SIMDE_FLOAT32_C( 28.47), SIMDE_FLOAT32_C( -384.03), SIMDE_FLOAT32_C( -923.64)),
simde_mm_set_ps(SIMDE_FLOAT32_C( -1.57), SIMDE_FLOAT32_C( 1.54), SIMDE_FLOAT32_C( -1.57), SIMDE_FLOAT32_C( -1.57)) },
{ simde_mm_set_ps(SIMDE_FLOAT32_C( 261.31), SIMDE_FLOAT32_C( -212.54), SIMDE_FLOAT32_C( -976.55), SIMDE_FLOAT32_C( -660.80)),
simde_mm_set_ps(SIMDE_FLOAT32_C( 1.57), SIMDE_FLOAT32_C( -1.57), SIMDE_FLOAT32_C( -1.57), SIMDE_FLOAT32_C( -1.57)) },
{ simde_mm_set_ps(SIMDE_FLOAT32_C( 178.20), SIMDE_FLOAT32_C( -450.67), SIMDE_FLOAT32_C( 233.37), SIMDE_FLOAT32_C( 687.09)),
simde_mm_set_ps(SIMDE_FLOAT32_C( 1.57), SIMDE_FLOAT32_C( -1.57), SIMDE_FLOAT32_C( 1.57), SIMDE_FLOAT32_C( 1.57)) }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])); i++) {
simde__m128 r = simde_mm_atan_ps(test_vec[i].a);
simde_assert_m128_close(r, test_vec[i].r, 1);
}
return 0;
}
static int
test_simde_mm_atan_pd(SIMDE_MUNIT_TEST_ARGS) {
const struct {
simde__m128d a;
simde__m128d r;
} test_vec[8] = {
{ simde_mm_set_pd(SIMDE_FLOAT64_C( -754.38), SIMDE_FLOAT64_C( 346.63)),
simde_mm_set_pd(SIMDE_FLOAT64_C( -1.57), SIMDE_FLOAT64_C( 1.57)) },
{ simde_mm_set_pd(SIMDE_FLOAT64_C( -186.21), SIMDE_FLOAT64_C( 39.01)),
simde_mm_set_pd(SIMDE_FLOAT64_C( -1.57), SIMDE_FLOAT64_C( 1.55)) },
{ simde_mm_set_pd(SIMDE_FLOAT64_C( -297.45), SIMDE_FLOAT64_C( 34.06)),
simde_mm_set_pd(SIMDE_FLOAT64_C( -1.57), SIMDE_FLOAT64_C( 1.54)) },
{ simde_mm_set_pd(SIMDE_FLOAT64_C( 497.31), SIMDE_FLOAT64_C( 670.24)),
simde_mm_set_pd(SIMDE_FLOAT64_C( 1.57), SIMDE_FLOAT64_C( 1.57)) },
{ simde_mm_set_pd(SIMDE_FLOAT64_C( -269.45), SIMDE_FLOAT64_C( 467.76)),
simde_mm_set_pd(SIMDE_FLOAT64_C( -1.57), SIMDE_FLOAT64_C( 1.57)) },
{ simde_mm_set_pd(SIMDE_FLOAT64_C( 825.53), SIMDE_FLOAT64_C( 422.21)),
simde_mm_set_pd(SIMDE_FLOAT64_C( 1.57), SIMDE_FLOAT64_C( 1.57)) },
{ simde_mm_set_pd(SIMDE_FLOAT64_C( 84.77), SIMDE_FLOAT64_C( 571.46)),
simde_mm_set_pd(SIMDE_FLOAT64_C( 1.56), SIMDE_FLOAT64_C( 1.57)) },
{ simde_mm_set_pd(SIMDE_FLOAT64_C( -678.17), SIMDE_FLOAT64_C( -686.13)),
simde_mm_set_pd(SIMDE_FLOAT64_C( -1.57), SIMDE_FLOAT64_C( -1.57)) }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])); i++) {
simde__m128d r = simde_mm_atan_pd(test_vec[i].a);
simde_assert_m128d_close(r, test_vec[i].r, 1);
}
return 0;
}
static int
test_simde_mm256_atan_ps(SIMDE_MUNIT_TEST_ARGS) {
const struct {
simde__m256 a;
simde__m256 r;
} test_vec[8] = {
{ simde_mm256_set_ps(SIMDE_FLOAT32_C( 497.31), SIMDE_FLOAT32_C( 670.24),
SIMDE_FLOAT32_C( -297.45), SIMDE_FLOAT32_C( 34.06),
SIMDE_FLOAT32_C( -186.21), SIMDE_FLOAT32_C( 39.01),
SIMDE_FLOAT32_C( -754.38), SIMDE_FLOAT32_C( 346.63)),
simde_mm256_set_ps(SIMDE_FLOAT32_C( 1.57), SIMDE_FLOAT32_C( 1.57),
SIMDE_FLOAT32_C( -1.57), SIMDE_FLOAT32_C( 1.54),
SIMDE_FLOAT32_C( -1.57), SIMDE_FLOAT32_C( 1.55),
SIMDE_FLOAT32_C( -1.57), SIMDE_FLOAT32_C( 1.57)) },
{ simde_mm256_set_ps(SIMDE_FLOAT32_C( -678.17), SIMDE_FLOAT32_C( -686.13),
SIMDE_FLOAT32_C( 84.77), SIMDE_FLOAT32_C( 571.46),
SIMDE_FLOAT32_C( 825.53), SIMDE_FLOAT32_C( 422.21),
SIMDE_FLOAT32_C( -269.45), SIMDE_FLOAT32_C( 467.76)),
simde_mm256_set_ps(SIMDE_FLOAT32_C( -1.57), SIMDE_FLOAT32_C( -1.57),
SIMDE_FLOAT32_C( 1.56), SIMDE_FLOAT32_C( 1.57),
SIMDE_FLOAT32_C( 1.57), SIMDE_FLOAT32_C( 1.57),
SIMDE_FLOAT32_C( -1.57), SIMDE_FLOAT32_C( 1.57)) },
{ simde_mm256_set_ps(SIMDE_FLOAT32_C( -444.81), SIMDE_FLOAT32_C( 28.47),
SIMDE_FLOAT32_C( -384.03), SIMDE_FLOAT32_C( -923.64),
SIMDE_FLOAT32_C( -305.07), SIMDE_FLOAT32_C( -860.95),
SIMDE_FLOAT32_C( -417.54), SIMDE_FLOAT32_C( 696.87)),
simde_mm256_set_ps(SIMDE_FLOAT32_C( -1.57), SIMDE_FLOAT32_C( 1.54),
SIMDE_FLOAT32_C( -1.57), SIMDE_FLOAT32_C( -1.57),
SIMDE_FLOAT32_C( -1.57), SIMDE_FLOAT32_C( -1.57),
SIMDE_FLOAT32_C( -1.57), SIMDE_FLOAT32_C( 1.57)) },
{ simde_mm256_set_ps(SIMDE_FLOAT32_C( 178.20), SIMDE_FLOAT32_C( -450.67),
SIMDE_FLOAT32_C( 233.37), SIMDE_FLOAT32_C( 687.09),
SIMDE_FLOAT32_C( 261.31), SIMDE_FLOAT32_C( -212.54),
SIMDE_FLOAT32_C( -976.55), SIMDE_FLOAT32_C( -660.80)),
simde_mm256_set_ps(SIMDE_FLOAT32_C( 1.57), SIMDE_FLOAT32_C( -1.57),
SIMDE_FLOAT32_C( 1.57), SIMDE_FLOAT32_C( 1.57),
SIMDE_FLOAT32_C( 1.57), SIMDE_FLOAT32_C( -1.57),
SIMDE_FLOAT32_C( -1.57), SIMDE_FLOAT32_C( -1.57)) },
{ simde_mm256_set_ps(SIMDE_FLOAT32_C( -770.35), SIMDE_FLOAT32_C( 443.48),
SIMDE_FLOAT32_C( -583.60), SIMDE_FLOAT32_C( 380.46),
SIMDE_FLOAT32_C( -770.72), SIMDE_FLOAT32_C( 993.90),
SIMDE_FLOAT32_C( 28.08), SIMDE_FLOAT32_C( 841.21)),
simde_mm256_set_ps(SIMDE_FLOAT32_C( -1.57), SIMDE_FLOAT32_C( 1.57),
SIMDE_FLOAT32_C( -1.57), SIMDE_FLOAT32_C( 1.57),
SIMDE_FLOAT32_C( -1.57), SIMDE_FLOAT32_C( 1.57),
SIMDE_FLOAT32_C( 1.54), SIMDE_FLOAT32_C( 1.57)) },
{ simde_mm256_set_ps(SIMDE_FLOAT32_C( -387.90), SIMDE_FLOAT32_C( 395.92),
SIMDE_FLOAT32_C( 655.87), SIMDE_FLOAT32_C( 339.21),
SIMDE_FLOAT32_C( 532.35), SIMDE_FLOAT32_C( -263.99),
SIMDE_FLOAT32_C( 780.64), SIMDE_FLOAT32_C( -30.79)),
simde_mm256_set_ps(SIMDE_FLOAT32_C( -1.57), SIMDE_FLOAT32_C( 1.57),
SIMDE_FLOAT32_C( 1.57), SIMDE_FLOAT32_C( 1.57),
SIMDE_FLOAT32_C( 1.57), SIMDE_FLOAT32_C( -1.57),
SIMDE_FLOAT32_C( 1.57), SIMDE_FLOAT32_C( -1.54)) },
{ simde_mm256_set_ps(SIMDE_FLOAT32_C( -203.65), SIMDE_FLOAT32_C( -80.73),
SIMDE_FLOAT32_C( 336.73), SIMDE_FLOAT32_C( -944.78),
SIMDE_FLOAT32_C( -747.59), SIMDE_FLOAT32_C( -767.23),
SIMDE_FLOAT32_C( -554.19), SIMDE_FLOAT32_C( 398.82)),
simde_mm256_set_ps(SIMDE_FLOAT32_C( -1.57), SIMDE_FLOAT32_C( -1.56),
SIMDE_FLOAT32_C( 1.57), SIMDE_FLOAT32_C( -1.57),
SIMDE_FLOAT32_C( -1.57), SIMDE_FLOAT32_C( -1.57),
SIMDE_FLOAT32_C( -1.57), SIMDE_FLOAT32_C( 1.57)) },
{ simde_mm256_set_ps(SIMDE_FLOAT32_C( 469.66), SIMDE_FLOAT32_C( 680.02),
SIMDE_FLOAT32_C( -148.69), SIMDE_FLOAT32_C( 818.66),
SIMDE_FLOAT32_C( 910.03), SIMDE_FLOAT32_C( 600.47),
SIMDE_FLOAT32_C( 791.23), SIMDE_FLOAT32_C( 254.31)),
simde_mm256_set_ps(SIMDE_FLOAT32_C( 1.57), SIMDE_FLOAT32_C( 1.57),
SIMDE_FLOAT32_C( -1.56), SIMDE_FLOAT32_C( 1.57),
SIMDE_FLOAT32_C( 1.57), SIMDE_FLOAT32_C( 1.57),
SIMDE_FLOAT32_C( 1.57), SIMDE_FLOAT32_C( 1.57)) }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])); i++) {
simde__m256 r = simde_mm256_atan_ps(test_vec[i].a);
simde_assert_m256_close(r, test_vec[i].r, 1);
}
return 0;
}
static int
test_simde_mm256_atan_pd(SIMDE_MUNIT_TEST_ARGS) {
const struct {
simde__m256d a;
simde__m256d r;
} test_vec[8] = {
{ simde_mm256_set_pd(SIMDE_FLOAT64_C( -186.21), SIMDE_FLOAT64_C( 39.01),
SIMDE_FLOAT64_C( -754.38), SIMDE_FLOAT64_C( 346.63)),
simde_mm256_set_pd(SIMDE_FLOAT64_C( -1.57), SIMDE_FLOAT64_C( 1.55),
SIMDE_FLOAT64_C( -1.57), SIMDE_FLOAT64_C( 1.57)) },
{ simde_mm256_set_pd(SIMDE_FLOAT64_C( 497.31), SIMDE_FLOAT64_C( 670.24),
SIMDE_FLOAT64_C( -297.45), SIMDE_FLOAT64_C( 34.06)),
simde_mm256_set_pd(SIMDE_FLOAT64_C( 1.57), SIMDE_FLOAT64_C( 1.57),
SIMDE_FLOAT64_C( -1.57), SIMDE_FLOAT64_C( 1.54)) },
{ simde_mm256_set_pd(SIMDE_FLOAT64_C( 825.53), SIMDE_FLOAT64_C( 422.21),
SIMDE_FLOAT64_C( -269.45), SIMDE_FLOAT64_C( 467.76)),
simde_mm256_set_pd(SIMDE_FLOAT64_C( 1.57), SIMDE_FLOAT64_C( 1.57),
SIMDE_FLOAT64_C( -1.57), SIMDE_FLOAT64_C( 1.57)) },
{ simde_mm256_set_pd(SIMDE_FLOAT64_C( -678.17), SIMDE_FLOAT64_C( -686.13),
SIMDE_FLOAT64_C( 84.77), SIMDE_FLOAT64_C( 571.46)),
simde_mm256_set_pd(SIMDE_FLOAT64_C( -1.57), SIMDE_FLOAT64_C( -1.57),
SIMDE_FLOAT64_C( 1.56), SIMDE_FLOAT64_C( 1.57)) },
{ simde_mm256_set_pd(SIMDE_FLOAT64_C( -305.07), SIMDE_FLOAT64_C( -860.95),
SIMDE_FLOAT64_C( -417.54), SIMDE_FLOAT64_C( 696.87)),
simde_mm256_set_pd(SIMDE_FLOAT64_C( -1.57), SIMDE_FLOAT64_C( -1.57),
SIMDE_FLOAT64_C( -1.57), SIMDE_FLOAT64_C( 1.57)) },
{ simde_mm256_set_pd(SIMDE_FLOAT64_C( -444.81), SIMDE_FLOAT64_C( 28.47),
SIMDE_FLOAT64_C( -384.03), SIMDE_FLOAT64_C( -923.64)),
simde_mm256_set_pd(SIMDE_FLOAT64_C( -1.57), SIMDE_FLOAT64_C( 1.54),
SIMDE_FLOAT64_C( -1.57), SIMDE_FLOAT64_C( -1.57)) },
{ simde_mm256_set_pd(SIMDE_FLOAT64_C( 261.31), SIMDE_FLOAT64_C( -212.54),
SIMDE_FLOAT64_C( -976.55), SIMDE_FLOAT64_C( -660.80)),
simde_mm256_set_pd(SIMDE_FLOAT64_C( 1.57), SIMDE_FLOAT64_C( -1.57),
SIMDE_FLOAT64_C( -1.57), SIMDE_FLOAT64_C( -1.57)) },
{ simde_mm256_set_pd(SIMDE_FLOAT64_C( 178.20), SIMDE_FLOAT64_C( -450.67),
SIMDE_FLOAT64_C( 233.37), SIMDE_FLOAT64_C( 687.09)),
simde_mm256_set_pd(SIMDE_FLOAT64_C( 1.57), SIMDE_FLOAT64_C( -1.57),
SIMDE_FLOAT64_C( 1.57), SIMDE_FLOAT64_C( 1.57)) }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])); i++) {
simde__m256d r = simde_mm256_atan_pd(test_vec[i].a);
simde_assert_m256d_close(r, test_vec[i].r, 1);
}
return 0;
}
static int
test_simde_mm512_atan_ps(SIMDE_MUNIT_TEST_ARGS) {
const struct {
simde__m512 a;
simde__m512 r;
} test_vec[8] = {
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( -678.17), SIMDE_FLOAT32_C( -686.13), SIMDE_FLOAT32_C( 84.77), SIMDE_FLOAT32_C( 571.46),
SIMDE_FLOAT32_C( 825.53), SIMDE_FLOAT32_C( 422.21), SIMDE_FLOAT32_C( -269.45), SIMDE_FLOAT32_C( 467.76),
SIMDE_FLOAT32_C( 497.31), SIMDE_FLOAT32_C( 670.24), SIMDE_FLOAT32_C( -297.45), SIMDE_FLOAT32_C( 34.06),
SIMDE_FLOAT32_C( -186.21), SIMDE_FLOAT32_C( 39.01), SIMDE_FLOAT32_C( -754.38), SIMDE_FLOAT32_C( 346.63)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( -1.57), SIMDE_FLOAT32_C( -1.57), SIMDE_FLOAT32_C( 1.56), SIMDE_FLOAT32_C( 1.57),
SIMDE_FLOAT32_C( 1.57), SIMDE_FLOAT32_C( 1.57), SIMDE_FLOAT32_C( -1.57), SIMDE_FLOAT32_C( 1.57),
SIMDE_FLOAT32_C( 1.57), SIMDE_FLOAT32_C( 1.57), SIMDE_FLOAT32_C( -1.57), SIMDE_FLOAT32_C( 1.54),
SIMDE_FLOAT32_C( -1.57), SIMDE_FLOAT32_C( 1.55), SIMDE_FLOAT32_C( -1.57), SIMDE_FLOAT32_C( 1.57)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( 178.20), SIMDE_FLOAT32_C( -450.67), SIMDE_FLOAT32_C( 233.37), SIMDE_FLOAT32_C( 687.09),
SIMDE_FLOAT32_C( 261.31), SIMDE_FLOAT32_C( -212.54), SIMDE_FLOAT32_C( -976.55), SIMDE_FLOAT32_C( -660.80),
SIMDE_FLOAT32_C( -444.81), SIMDE_FLOAT32_C( 28.47), SIMDE_FLOAT32_C( -384.03), SIMDE_FLOAT32_C( -923.64),
SIMDE_FLOAT32_C( -305.07), SIMDE_FLOAT32_C( -860.95), SIMDE_FLOAT32_C( -417.54), SIMDE_FLOAT32_C( 696.87)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 1.57), SIMDE_FLOAT32_C( -1.57), SIMDE_FLOAT32_C( 1.57), SIMDE_FLOAT32_C( 1.57),
SIMDE_FLOAT32_C( 1.57), SIMDE_FLOAT32_C( -1.57), SIMDE_FLOAT32_C( -1.57), SIMDE_FLOAT32_C( -1.57),
SIMDE_FLOAT32_C( -1.57), SIMDE_FLOAT32_C( 1.54), SIMDE_FLOAT32_C( -1.57), SIMDE_FLOAT32_C( -1.57),
SIMDE_FLOAT32_C( -1.57), SIMDE_FLOAT32_C( -1.57), SIMDE_FLOAT32_C( -1.57), SIMDE_FLOAT32_C( 1.57)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( -387.90), SIMDE_FLOAT32_C( 395.92), SIMDE_FLOAT32_C( 655.87), SIMDE_FLOAT32_C( 339.21),
SIMDE_FLOAT32_C( 532.35), SIMDE_FLOAT32_C( -263.99), SIMDE_FLOAT32_C( 780.64), SIMDE_FLOAT32_C( -30.79),
SIMDE_FLOAT32_C( -770.35), SIMDE_FLOAT32_C( 443.48), SIMDE_FLOAT32_C( -583.60), SIMDE_FLOAT32_C( 380.46),
SIMDE_FLOAT32_C( -770.72), SIMDE_FLOAT32_C( 993.90), SIMDE_FLOAT32_C( 28.08), SIMDE_FLOAT32_C( 841.21)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( -1.57), SIMDE_FLOAT32_C( 1.57), SIMDE_FLOAT32_C( 1.57), SIMDE_FLOAT32_C( 1.57),
SIMDE_FLOAT32_C( 1.57), SIMDE_FLOAT32_C( -1.57), SIMDE_FLOAT32_C( 1.57), SIMDE_FLOAT32_C( -1.54),
SIMDE_FLOAT32_C( -1.57), SIMDE_FLOAT32_C( 1.57), SIMDE_FLOAT32_C( -1.57), SIMDE_FLOAT32_C( 1.57),
SIMDE_FLOAT32_C( -1.57), SIMDE_FLOAT32_C( 1.57), SIMDE_FLOAT32_C( 1.54), SIMDE_FLOAT32_C( 1.57)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( 469.66), SIMDE_FLOAT32_C( 680.02), SIMDE_FLOAT32_C( -148.69), SIMDE_FLOAT32_C( 818.66),
SIMDE_FLOAT32_C( 910.03), SIMDE_FLOAT32_C( 600.47), SIMDE_FLOAT32_C( 791.23), SIMDE_FLOAT32_C( 254.31),
SIMDE_FLOAT32_C( -203.65), SIMDE_FLOAT32_C( -80.73), SIMDE_FLOAT32_C( 336.73), SIMDE_FLOAT32_C( -944.78),
SIMDE_FLOAT32_C( -747.59), SIMDE_FLOAT32_C( -767.23), SIMDE_FLOAT32_C( -554.19), SIMDE_FLOAT32_C( 398.82)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 1.57), SIMDE_FLOAT32_C( 1.57), SIMDE_FLOAT32_C( -1.56), SIMDE_FLOAT32_C( 1.57),
SIMDE_FLOAT32_C( 1.57), SIMDE_FLOAT32_C( 1.57), SIMDE_FLOAT32_C( 1.57), SIMDE_FLOAT32_C( 1.57),
SIMDE_FLOAT32_C( -1.57), SIMDE_FLOAT32_C( -1.56), SIMDE_FLOAT32_C( 1.57), SIMDE_FLOAT32_C( -1.57),
SIMDE_FLOAT32_C( -1.57), SIMDE_FLOAT32_C( -1.57), SIMDE_FLOAT32_C( -1.57), SIMDE_FLOAT32_C( 1.57)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( -178.99), SIMDE_FLOAT32_C( 758.79), SIMDE_FLOAT32_C( 324.62), SIMDE_FLOAT32_C( 343.48),
SIMDE_FLOAT32_C( -874.31), SIMDE_FLOAT32_C( -797.92), SIMDE_FLOAT32_C( -328.54), SIMDE_FLOAT32_C( -525.83),
SIMDE_FLOAT32_C( -192.31), SIMDE_FLOAT32_C( -822.65), SIMDE_FLOAT32_C( 561.36), SIMDE_FLOAT32_C( 655.67),
SIMDE_FLOAT32_C( -70.91), SIMDE_FLOAT32_C( 543.35), SIMDE_FLOAT32_C( 120.65), SIMDE_FLOAT32_C( -171.51)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( -1.57), SIMDE_FLOAT32_C( 1.57), SIMDE_FLOAT32_C( 1.57), SIMDE_FLOAT32_C( 1.57),
SIMDE_FLOAT32_C( -1.57), SIMDE_FLOAT32_C( -1.57), SIMDE_FLOAT32_C( -1.57), SIMDE_FLOAT32_C( -1.57),
SIMDE_FLOAT32_C( -1.57), SIMDE_FLOAT32_C( -1.57), SIMDE_FLOAT32_C( 1.57), SIMDE_FLOAT32_C( 1.57),
SIMDE_FLOAT32_C( -1.56), SIMDE_FLOAT32_C( 1.57), SIMDE_FLOAT32_C( 1.56), SIMDE_FLOAT32_C( -1.56)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( 690.12), SIMDE_FLOAT32_C( 840.65), SIMDE_FLOAT32_C( -21.09), SIMDE_FLOAT32_C( -591.56),
SIMDE_FLOAT32_C( -448.89), SIMDE_FLOAT32_C( 731.49), SIMDE_FLOAT32_C( 505.79), SIMDE_FLOAT32_C( 623.70),
SIMDE_FLOAT32_C( 831.02), SIMDE_FLOAT32_C( 140.67), SIMDE_FLOAT32_C( 977.36), SIMDE_FLOAT32_C( -906.16),
SIMDE_FLOAT32_C( 331.34), SIMDE_FLOAT32_C( 99.93), SIMDE_FLOAT32_C( 462.95), SIMDE_FLOAT32_C( -738.19)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 1.57), SIMDE_FLOAT32_C( 1.57), SIMDE_FLOAT32_C( -1.52), SIMDE_FLOAT32_C( -1.57),
SIMDE_FLOAT32_C( -1.57), SIMDE_FLOAT32_C( 1.57), SIMDE_FLOAT32_C( 1.57), SIMDE_FLOAT32_C( 1.57),
SIMDE_FLOAT32_C( 1.57), SIMDE_FLOAT32_C( 1.56), SIMDE_FLOAT32_C( 1.57), SIMDE_FLOAT32_C( -1.57),
SIMDE_FLOAT32_C( 1.57), SIMDE_FLOAT32_C( 1.56), SIMDE_FLOAT32_C( 1.57), SIMDE_FLOAT32_C( -1.57)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( -769.09), SIMDE_FLOAT32_C( 822.06), SIMDE_FLOAT32_C( -573.81), SIMDE_FLOAT32_C( -997.63),
SIMDE_FLOAT32_C( -337.60), SIMDE_FLOAT32_C( 923.64), SIMDE_FLOAT32_C( 293.64), SIMDE_FLOAT32_C( -768.12),
SIMDE_FLOAT32_C( -576.22), SIMDE_FLOAT32_C( -67.64), SIMDE_FLOAT32_C( 710.38), SIMDE_FLOAT32_C( 977.49),
SIMDE_FLOAT32_C( -756.42), SIMDE_FLOAT32_C( 424.81), SIMDE_FLOAT32_C( 27.25), SIMDE_FLOAT32_C( -95.15)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( -1.57), SIMDE_FLOAT32_C( 1.57), SIMDE_FLOAT32_C( -1.57), SIMDE_FLOAT32_C( -1.57),
SIMDE_FLOAT32_C( -1.57), SIMDE_FLOAT32_C( 1.57), SIMDE_FLOAT32_C( 1.57), SIMDE_FLOAT32_C( -1.57),
SIMDE_FLOAT32_C( -1.57), SIMDE_FLOAT32_C( -1.56), SIMDE_FLOAT32_C( 1.57), SIMDE_FLOAT32_C( 1.57),
SIMDE_FLOAT32_C( -1.57), SIMDE_FLOAT32_C( 1.57), SIMDE_FLOAT32_C( 1.53), SIMDE_FLOAT32_C( -1.56)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( -438.19), SIMDE_FLOAT32_C( -359.76), SIMDE_FLOAT32_C( -752.43), SIMDE_FLOAT32_C( -33.67),
SIMDE_FLOAT32_C( 932.66), SIMDE_FLOAT32_C( 7.27), SIMDE_FLOAT32_C( -327.22), SIMDE_FLOAT32_C( -125.20),
SIMDE_FLOAT32_C( -182.45), SIMDE_FLOAT32_C( 39.93), SIMDE_FLOAT32_C( 510.85), SIMDE_FLOAT32_C( 394.67),
SIMDE_FLOAT32_C( 14.34), SIMDE_FLOAT32_C( -304.73), SIMDE_FLOAT32_C( 916.26), SIMDE_FLOAT32_C( -696.69)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( -1.57), SIMDE_FLOAT32_C( -1.57), SIMDE_FLOAT32_C( -1.57), SIMDE_FLOAT32_C( -1.54),
SIMDE_FLOAT32_C( 1.57), SIMDE_FLOAT32_C( 1.43), SIMDE_FLOAT32_C( -1.57), SIMDE_FLOAT32_C( -1.56),
SIMDE_FLOAT32_C( -1.57), SIMDE_FLOAT32_C( 1.55), SIMDE_FLOAT32_C( 1.57), SIMDE_FLOAT32_C( 1.57),
SIMDE_FLOAT32_C( 1.50), SIMDE_FLOAT32_C( -1.57), SIMDE_FLOAT32_C( 1.57), SIMDE_FLOAT32_C( -1.57)) }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])); i++) {
simde__m512 r = simde_mm512_atan_ps(test_vec[i].a);
simde_assert_m512_close(r, test_vec[i].r, 1);
}
return 0;
}
static int
test_simde_mm512_mask_atan_ps(SIMDE_MUNIT_TEST_ARGS) {
const struct {
simde__m512 src;
simde__mmask16 k;
simde__m512 a;
simde__m512 r;
} test_vec[8] = {
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( -450.67), SIMDE_FLOAT32_C( 687.09), SIMDE_FLOAT32_C( -212.54), SIMDE_FLOAT32_C( -660.80),
SIMDE_FLOAT32_C( 28.47), SIMDE_FLOAT32_C( -923.64), SIMDE_FLOAT32_C( -860.95), SIMDE_FLOAT32_C( 696.87),
SIMDE_FLOAT32_C( -686.13), SIMDE_FLOAT32_C( 571.46), SIMDE_FLOAT32_C( 422.21), SIMDE_FLOAT32_C( 467.76),
SIMDE_FLOAT32_C( 670.24), SIMDE_FLOAT32_C( 34.06), SIMDE_FLOAT32_C( 39.01), SIMDE_FLOAT32_C( 346.63)),
UINT16_C(41466),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 178.20), SIMDE_FLOAT32_C( 233.37), SIMDE_FLOAT32_C( 261.31), SIMDE_FLOAT32_C( -976.55),
SIMDE_FLOAT32_C( -444.81), SIMDE_FLOAT32_C( -384.03), SIMDE_FLOAT32_C( -305.07), SIMDE_FLOAT32_C( -417.54),
SIMDE_FLOAT32_C( -678.17), SIMDE_FLOAT32_C( 84.77), SIMDE_FLOAT32_C( 825.53), SIMDE_FLOAT32_C( -269.45),
SIMDE_FLOAT32_C( 497.31), SIMDE_FLOAT32_C( -297.45), SIMDE_FLOAT32_C( -186.21), SIMDE_FLOAT32_C( -754.38)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 1.57), SIMDE_FLOAT32_C( 687.09), SIMDE_FLOAT32_C( 1.57), SIMDE_FLOAT32_C( -660.80),
SIMDE_FLOAT32_C( 28.47), SIMDE_FLOAT32_C( -923.64), SIMDE_FLOAT32_C( -860.95), SIMDE_FLOAT32_C( -1.57),
SIMDE_FLOAT32_C( -1.57), SIMDE_FLOAT32_C( 1.56), SIMDE_FLOAT32_C( 1.57), SIMDE_FLOAT32_C( -1.57),
SIMDE_FLOAT32_C( 1.57), SIMDE_FLOAT32_C( 34.06), SIMDE_FLOAT32_C( -1.57), SIMDE_FLOAT32_C( 346.63)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( 469.66), SIMDE_FLOAT32_C( -148.69), SIMDE_FLOAT32_C( 910.03), SIMDE_FLOAT32_C( 791.23),
SIMDE_FLOAT32_C( -203.65), SIMDE_FLOAT32_C( 336.73), SIMDE_FLOAT32_C( -747.59), SIMDE_FLOAT32_C( -554.19),
SIMDE_FLOAT32_C( -387.90), SIMDE_FLOAT32_C( 655.87), SIMDE_FLOAT32_C( 532.35), SIMDE_FLOAT32_C( 780.64),
SIMDE_FLOAT32_C( -770.35), SIMDE_FLOAT32_C( -583.60), SIMDE_FLOAT32_C( -770.72), SIMDE_FLOAT32_C( 28.08)),
UINT16_C(36797),
simde_mm512_set_ps(SIMDE_FLOAT32_C( -171.51), SIMDE_FLOAT32_C( 680.02), SIMDE_FLOAT32_C( 818.66), SIMDE_FLOAT32_C( 600.47),
SIMDE_FLOAT32_C( 254.31), SIMDE_FLOAT32_C( -80.73), SIMDE_FLOAT32_C( -944.78), SIMDE_FLOAT32_C( -767.23),
SIMDE_FLOAT32_C( 398.82), SIMDE_FLOAT32_C( 395.92), SIMDE_FLOAT32_C( 339.21), SIMDE_FLOAT32_C( -263.99),
SIMDE_FLOAT32_C( -30.79), SIMDE_FLOAT32_C( 443.48), SIMDE_FLOAT32_C( 380.46), SIMDE_FLOAT32_C( 993.90)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( -1.56), SIMDE_FLOAT32_C( -148.69), SIMDE_FLOAT32_C( 910.03), SIMDE_FLOAT32_C( 791.23),
SIMDE_FLOAT32_C( 1.57), SIMDE_FLOAT32_C( -1.56), SIMDE_FLOAT32_C( -1.57), SIMDE_FLOAT32_C( -1.57),
SIMDE_FLOAT32_C( 1.57), SIMDE_FLOAT32_C( 655.87), SIMDE_FLOAT32_C( 1.57), SIMDE_FLOAT32_C( -1.57),
SIMDE_FLOAT32_C( -1.54), SIMDE_FLOAT32_C( 1.57), SIMDE_FLOAT32_C( -770.72), SIMDE_FLOAT32_C( 1.57)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( -95.15), SIMDE_FLOAT32_C( 840.65), SIMDE_FLOAT32_C( -591.56), SIMDE_FLOAT32_C( 731.49),
SIMDE_FLOAT32_C( 623.70), SIMDE_FLOAT32_C( 140.67), SIMDE_FLOAT32_C( -906.16), SIMDE_FLOAT32_C( 99.93),
SIMDE_FLOAT32_C( -738.19), SIMDE_FLOAT32_C( 758.79), SIMDE_FLOAT32_C( 343.48), SIMDE_FLOAT32_C( -797.92),
SIMDE_FLOAT32_C( -525.83), SIMDE_FLOAT32_C( -822.65), SIMDE_FLOAT32_C( 655.67), SIMDE_FLOAT32_C( 543.35)),
UINT16_C(16804),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 27.25), SIMDE_FLOAT32_C( 690.12), SIMDE_FLOAT32_C( -21.09), SIMDE_FLOAT32_C( -448.89),
SIMDE_FLOAT32_C( 505.79), SIMDE_FLOAT32_C( 831.02), SIMDE_FLOAT32_C( 977.36), SIMDE_FLOAT32_C( 331.34),
SIMDE_FLOAT32_C( 462.95), SIMDE_FLOAT32_C( -178.99), SIMDE_FLOAT32_C( 324.62), SIMDE_FLOAT32_C( -874.31),
SIMDE_FLOAT32_C( -328.54), SIMDE_FLOAT32_C( -192.31), SIMDE_FLOAT32_C( 561.36), SIMDE_FLOAT32_C( -70.91)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( -95.15), SIMDE_FLOAT32_C( 1.57), SIMDE_FLOAT32_C( -591.56), SIMDE_FLOAT32_C( 731.49),
SIMDE_FLOAT32_C( 623.70), SIMDE_FLOAT32_C( 140.67), SIMDE_FLOAT32_C( -906.16), SIMDE_FLOAT32_C( 1.57),
SIMDE_FLOAT32_C( 1.57), SIMDE_FLOAT32_C( 758.79), SIMDE_FLOAT32_C( 1.57), SIMDE_FLOAT32_C( -797.92),
SIMDE_FLOAT32_C( -525.83), SIMDE_FLOAT32_C( -1.57), SIMDE_FLOAT32_C( 655.67), SIMDE_FLOAT32_C( 543.35)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( -348.70), SIMDE_FLOAT32_C( -438.19), SIMDE_FLOAT32_C( -752.43), SIMDE_FLOAT32_C( 932.66),
SIMDE_FLOAT32_C( -327.22), SIMDE_FLOAT32_C( -182.45), SIMDE_FLOAT32_C( 510.85), SIMDE_FLOAT32_C( 14.34),
SIMDE_FLOAT32_C( 916.26), SIMDE_FLOAT32_C( -769.09), SIMDE_FLOAT32_C( -573.81), SIMDE_FLOAT32_C( -337.60),
SIMDE_FLOAT32_C( 293.64), SIMDE_FLOAT32_C( -576.22), SIMDE_FLOAT32_C( 710.38), SIMDE_FLOAT32_C( -756.42)),
UINT16_C( 2107),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 897.27), SIMDE_FLOAT32_C( -197.89), SIMDE_FLOAT32_C( -359.76), SIMDE_FLOAT32_C( -33.67),
SIMDE_FLOAT32_C( 7.27), SIMDE_FLOAT32_C( -125.20), SIMDE_FLOAT32_C( 39.93), SIMDE_FLOAT32_C( 394.67),
SIMDE_FLOAT32_C( -304.73), SIMDE_FLOAT32_C( -696.69), SIMDE_FLOAT32_C( 822.06), SIMDE_FLOAT32_C( -997.63),
SIMDE_FLOAT32_C( 923.64), SIMDE_FLOAT32_C( -768.12), SIMDE_FLOAT32_C( -67.64), SIMDE_FLOAT32_C( 977.49)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( -348.70), SIMDE_FLOAT32_C( -438.19), SIMDE_FLOAT32_C( -752.43), SIMDE_FLOAT32_C( 932.66),
SIMDE_FLOAT32_C( 1.43), SIMDE_FLOAT32_C( -182.45), SIMDE_FLOAT32_C( 510.85), SIMDE_FLOAT32_C( 14.34),
SIMDE_FLOAT32_C( 916.26), SIMDE_FLOAT32_C( -769.09), SIMDE_FLOAT32_C( 1.57), SIMDE_FLOAT32_C( -1.57),
SIMDE_FLOAT32_C( 1.57), SIMDE_FLOAT32_C( -576.22), SIMDE_FLOAT32_C( -1.56), SIMDE_FLOAT32_C( 1.57)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( -15.61), SIMDE_FLOAT32_C( -737.13), SIMDE_FLOAT32_C( -314.93), SIMDE_FLOAT32_C( 177.92),
SIMDE_FLOAT32_C( 345.93), SIMDE_FLOAT32_C( 888.71), SIMDE_FLOAT32_C( 915.71), SIMDE_FLOAT32_C( 133.52),
SIMDE_FLOAT32_C( 484.94), SIMDE_FLOAT32_C( -598.06), SIMDE_FLOAT32_C( -791.07), SIMDE_FLOAT32_C( -765.93),
SIMDE_FLOAT32_C( 221.37), SIMDE_FLOAT32_C( -788.36), SIMDE_FLOAT32_C( -775.04), SIMDE_FLOAT32_C( 440.64)),
UINT16_C(22274),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 496.57), SIMDE_FLOAT32_C( 915.19), SIMDE_FLOAT32_C( -718.40), SIMDE_FLOAT32_C( 159.97),
SIMDE_FLOAT32_C( -861.01), SIMDE_FLOAT32_C( 426.61), SIMDE_FLOAT32_C( 932.11), SIMDE_FLOAT32_C( 110.36),
SIMDE_FLOAT32_C( 826.84), SIMDE_FLOAT32_C( -76.75), SIMDE_FLOAT32_C( 237.58), SIMDE_FLOAT32_C( -378.50),
SIMDE_FLOAT32_C( -601.68), SIMDE_FLOAT32_C( -623.50), SIMDE_FLOAT32_C( -942.47), SIMDE_FLOAT32_C( 475.51)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( -15.61), SIMDE_FLOAT32_C( 1.57), SIMDE_FLOAT32_C( -314.93), SIMDE_FLOAT32_C( 1.56),
SIMDE_FLOAT32_C( 345.93), SIMDE_FLOAT32_C( 1.57), SIMDE_FLOAT32_C( 1.57), SIMDE_FLOAT32_C( 1.56),
SIMDE_FLOAT32_C( 484.94), SIMDE_FLOAT32_C( -598.06), SIMDE_FLOAT32_C( -791.07), SIMDE_FLOAT32_C( -765.93),
SIMDE_FLOAT32_C( 221.37), SIMDE_FLOAT32_C( -788.36), SIMDE_FLOAT32_C( -1.57), SIMDE_FLOAT32_C( 440.64)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( 883.05), SIMDE_FLOAT32_C( -807.28), SIMDE_FLOAT32_C( -70.05), SIMDE_FLOAT32_C( -784.34),
SIMDE_FLOAT32_C( 92.52), SIMDE_FLOAT32_C( 206.60), SIMDE_FLOAT32_C( 834.60), SIMDE_FLOAT32_C( -65.60),
SIMDE_FLOAT32_C( -286.07), SIMDE_FLOAT32_C( -212.86), SIMDE_FLOAT32_C( -318.38), SIMDE_FLOAT32_C( 783.48),
SIMDE_FLOAT32_C( -628.82), SIMDE_FLOAT32_C( 556.35), SIMDE_FLOAT32_C( 439.43), SIMDE_FLOAT32_C( 434.03)),
UINT16_C(27396),
simde_mm512_set_ps(SIMDE_FLOAT32_C( -964.25), SIMDE_FLOAT32_C( -406.33), SIMDE_FLOAT32_C( -743.66), SIMDE_FLOAT32_C( -764.58),
SIMDE_FLOAT32_C( 789.89), SIMDE_FLOAT32_C( 4.83), SIMDE_FLOAT32_C( -818.54), SIMDE_FLOAT32_C( 161.06),
SIMDE_FLOAT32_C( 579.25), SIMDE_FLOAT32_C( -11.78), SIMDE_FLOAT32_C( -308.52), SIMDE_FLOAT32_C( -719.57),
SIMDE_FLOAT32_C( 334.00), SIMDE_FLOAT32_C( 274.71), SIMDE_FLOAT32_C( -916.82), SIMDE_FLOAT32_C( -490.00)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 883.05), SIMDE_FLOAT32_C( -1.57), SIMDE_FLOAT32_C( -1.57), SIMDE_FLOAT32_C( -784.34),
SIMDE_FLOAT32_C( 1.57), SIMDE_FLOAT32_C( 206.60), SIMDE_FLOAT32_C( -1.57), SIMDE_FLOAT32_C( 1.56),
SIMDE_FLOAT32_C( -286.07), SIMDE_FLOAT32_C( -212.86), SIMDE_FLOAT32_C( -318.38), SIMDE_FLOAT32_C( 783.48),
SIMDE_FLOAT32_C( -628.82), SIMDE_FLOAT32_C( 1.57), SIMDE_FLOAT32_C( 439.43), SIMDE_FLOAT32_C( 434.03)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( 529.63), SIMDE_FLOAT32_C( -24.89), SIMDE_FLOAT32_C( -967.78), SIMDE_FLOAT32_C( 638.94),
SIMDE_FLOAT32_C( 450.90), SIMDE_FLOAT32_C( -771.54), SIMDE_FLOAT32_C( 105.79), SIMDE_FLOAT32_C( 590.10),
SIMDE_FLOAT32_C( 30.91), SIMDE_FLOAT32_C( 635.35), SIMDE_FLOAT32_C( -84.00), SIMDE_FLOAT32_C( 80.04),
SIMDE_FLOAT32_C( -709.46), SIMDE_FLOAT32_C( 607.86), SIMDE_FLOAT32_C( 394.58), SIMDE_FLOAT32_C( -889.11)),
UINT16_C( 953),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 18.75), SIMDE_FLOAT32_C( 809.05), SIMDE_FLOAT32_C( 144.05), SIMDE_FLOAT32_C( -427.72),
SIMDE_FLOAT32_C( 308.28), SIMDE_FLOAT32_C( -177.05), SIMDE_FLOAT32_C( -457.77), SIMDE_FLOAT32_C( 678.24),
SIMDE_FLOAT32_C( 66.05), SIMDE_FLOAT32_C( -267.71), SIMDE_FLOAT32_C( 117.28), SIMDE_FLOAT32_C( -576.80),
SIMDE_FLOAT32_C( -38.39), SIMDE_FLOAT32_C( -250.14), SIMDE_FLOAT32_C( -53.92), SIMDE_FLOAT32_C( 91.94)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 529.63), SIMDE_FLOAT32_C( -24.89), SIMDE_FLOAT32_C( -967.78), SIMDE_FLOAT32_C( 638.94),
SIMDE_FLOAT32_C( 450.90), SIMDE_FLOAT32_C( -771.54), SIMDE_FLOAT32_C( -1.57), SIMDE_FLOAT32_C( 1.57),
SIMDE_FLOAT32_C( 1.56), SIMDE_FLOAT32_C( 635.35), SIMDE_FLOAT32_C( 1.56), SIMDE_FLOAT32_C( -1.57),
SIMDE_FLOAT32_C( -1.54), SIMDE_FLOAT32_C( 607.86), SIMDE_FLOAT32_C( 394.58), SIMDE_FLOAT32_C( 1.56)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( -788.39), SIMDE_FLOAT32_C( 330.43), SIMDE_FLOAT32_C( -493.41), SIMDE_FLOAT32_C( 822.72),
SIMDE_FLOAT32_C( 956.68), SIMDE_FLOAT32_C( 954.62), SIMDE_FLOAT32_C( 825.49), SIMDE_FLOAT32_C( -816.27),
SIMDE_FLOAT32_C( -209.34), SIMDE_FLOAT32_C( -933.21), SIMDE_FLOAT32_C( -728.70), SIMDE_FLOAT32_C( -420.06),
SIMDE_FLOAT32_C( 100.32), SIMDE_FLOAT32_C( 103.15), SIMDE_FLOAT32_C( 439.77), SIMDE_FLOAT32_C( -204.33)),
UINT16_C(12713),
simde_mm512_set_ps(SIMDE_FLOAT32_C( -841.43), SIMDE_FLOAT32_C( -14.16), SIMDE_FLOAT32_C( 824.88), SIMDE_FLOAT32_C( 793.63),
SIMDE_FLOAT32_C( -736.75), SIMDE_FLOAT32_C( -310.57), SIMDE_FLOAT32_C( 728.87), SIMDE_FLOAT32_C( -350.72),
SIMDE_FLOAT32_C( 60.89), SIMDE_FLOAT32_C( 109.81), SIMDE_FLOAT32_C( 715.94), SIMDE_FLOAT32_C( -250.60),
SIMDE_FLOAT32_C( 944.14), SIMDE_FLOAT32_C( 361.85), SIMDE_FLOAT32_C( -13.07), SIMDE_FLOAT32_C( 852.60)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( -788.39), SIMDE_FLOAT32_C( 330.43), SIMDE_FLOAT32_C( 1.57), SIMDE_FLOAT32_C( 1.57),
SIMDE_FLOAT32_C( 956.68), SIMDE_FLOAT32_C( 954.62), SIMDE_FLOAT32_C( 825.49), SIMDE_FLOAT32_C( -1.57),
SIMDE_FLOAT32_C( 1.55), SIMDE_FLOAT32_C( -933.21), SIMDE_FLOAT32_C( 1.57), SIMDE_FLOAT32_C( -420.06),
SIMDE_FLOAT32_C( 1.57), SIMDE_FLOAT32_C( 103.15), SIMDE_FLOAT32_C( 439.77), SIMDE_FLOAT32_C( 1.57)) }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])); i++) {
simde__m512 r = simde_mm512_mask_atan_ps(test_vec[i].src, test_vec[i].k, test_vec[i].a);
simde_assert_m512_close(r, test_vec[i].r, 1);
}
return 0;
}
static int
test_simde_mm512_atan_pd(SIMDE_MUNIT_TEST_ARGS) {
const struct {
simde__m512d a;
simde__m512d r;
} test_vec[8] = {
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( 497.31), SIMDE_FLOAT64_C( 670.24),
SIMDE_FLOAT64_C( -297.45), SIMDE_FLOAT64_C( 34.06),
SIMDE_FLOAT64_C( -186.21), SIMDE_FLOAT64_C( 39.01),
SIMDE_FLOAT64_C( -754.38), SIMDE_FLOAT64_C( 346.63)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 1.57), SIMDE_FLOAT64_C( 1.57),
SIMDE_FLOAT64_C( -1.57), SIMDE_FLOAT64_C( 1.54),
SIMDE_FLOAT64_C( -1.57), SIMDE_FLOAT64_C( 1.55),
SIMDE_FLOAT64_C( -1.57), SIMDE_FLOAT64_C( 1.57)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( -678.17), SIMDE_FLOAT64_C( -686.13),
SIMDE_FLOAT64_C( 84.77), SIMDE_FLOAT64_C( 571.46),
SIMDE_FLOAT64_C( 825.53), SIMDE_FLOAT64_C( 422.21),
SIMDE_FLOAT64_C( -269.45), SIMDE_FLOAT64_C( 467.76)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( -1.57), SIMDE_FLOAT64_C( -1.57),
SIMDE_FLOAT64_C( 1.56), SIMDE_FLOAT64_C( 1.57),
SIMDE_FLOAT64_C( 1.57), SIMDE_FLOAT64_C( 1.57),
SIMDE_FLOAT64_C( -1.57), SIMDE_FLOAT64_C( 1.57)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( -444.81), SIMDE_FLOAT64_C( 28.47),
SIMDE_FLOAT64_C( -384.03), SIMDE_FLOAT64_C( -923.64),
SIMDE_FLOAT64_C( -305.07), SIMDE_FLOAT64_C( -860.95),
SIMDE_FLOAT64_C( -417.54), SIMDE_FLOAT64_C( 696.87)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( -1.57), SIMDE_FLOAT64_C( 1.54),
SIMDE_FLOAT64_C( -1.57), SIMDE_FLOAT64_C( -1.57),
SIMDE_FLOAT64_C( -1.57), SIMDE_FLOAT64_C( -1.57),
SIMDE_FLOAT64_C( -1.57), SIMDE_FLOAT64_C( 1.57)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( 178.20), SIMDE_FLOAT64_C( -450.67),
SIMDE_FLOAT64_C( 233.37), SIMDE_FLOAT64_C( 687.09),
SIMDE_FLOAT64_C( 261.31), SIMDE_FLOAT64_C( -212.54),
SIMDE_FLOAT64_C( -976.55), SIMDE_FLOAT64_C( -660.80)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 1.57), SIMDE_FLOAT64_C( -1.57),
SIMDE_FLOAT64_C( 1.57), SIMDE_FLOAT64_C( 1.57),
SIMDE_FLOAT64_C( 1.57), SIMDE_FLOAT64_C( -1.57),
SIMDE_FLOAT64_C( -1.57), SIMDE_FLOAT64_C( -1.57)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( -770.35), SIMDE_FLOAT64_C( 443.48),
SIMDE_FLOAT64_C( -583.60), SIMDE_FLOAT64_C( 380.46),
SIMDE_FLOAT64_C( -770.72), SIMDE_FLOAT64_C( 993.90),
SIMDE_FLOAT64_C( 28.08), SIMDE_FLOAT64_C( 841.21)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( -1.57), SIMDE_FLOAT64_C( 1.57),
SIMDE_FLOAT64_C( -1.57), SIMDE_FLOAT64_C( 1.57),
SIMDE_FLOAT64_C( -1.57), SIMDE_FLOAT64_C( 1.57),
SIMDE_FLOAT64_C( 1.54), SIMDE_FLOAT64_C( 1.57)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( -387.90), SIMDE_FLOAT64_C( 395.92),
SIMDE_FLOAT64_C( 655.87), SIMDE_FLOAT64_C( 339.21),
SIMDE_FLOAT64_C( 532.35), SIMDE_FLOAT64_C( -263.99),
SIMDE_FLOAT64_C( 780.64), SIMDE_FLOAT64_C( -30.79)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( -1.57), SIMDE_FLOAT64_C( 1.57),
SIMDE_FLOAT64_C( 1.57), SIMDE_FLOAT64_C( 1.57),
SIMDE_FLOAT64_C( 1.57), SIMDE_FLOAT64_C( -1.57),
SIMDE_FLOAT64_C( 1.57), SIMDE_FLOAT64_C( -1.54)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( -203.65), SIMDE_FLOAT64_C( -80.73),
SIMDE_FLOAT64_C( 336.73), SIMDE_FLOAT64_C( -944.78),
SIMDE_FLOAT64_C( -747.59), SIMDE_FLOAT64_C( -767.23),
SIMDE_FLOAT64_C( -554.19), SIMDE_FLOAT64_C( 398.82)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( -1.57), SIMDE_FLOAT64_C( -1.56),
SIMDE_FLOAT64_C( 1.57), SIMDE_FLOAT64_C( -1.57),
SIMDE_FLOAT64_C( -1.57), SIMDE_FLOAT64_C( -1.57),
SIMDE_FLOAT64_C( -1.57), SIMDE_FLOAT64_C( 1.57)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( 469.66), SIMDE_FLOAT64_C( 680.02),
SIMDE_FLOAT64_C( -148.69), SIMDE_FLOAT64_C( 818.66),
SIMDE_FLOAT64_C( 910.03), SIMDE_FLOAT64_C( 600.47),
SIMDE_FLOAT64_C( 791.23), SIMDE_FLOAT64_C( 254.31)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 1.57), SIMDE_FLOAT64_C( 1.57),
SIMDE_FLOAT64_C( -1.56), SIMDE_FLOAT64_C( 1.57),
SIMDE_FLOAT64_C( 1.57), SIMDE_FLOAT64_C( 1.57),
SIMDE_FLOAT64_C( 1.57), SIMDE_FLOAT64_C( 1.57)) }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])); i++) {
simde__m512d r = simde_mm512_atan_pd(test_vec[i].a);
simde_assert_m512d_close(r, test_vec[i].r, 1);
}
return 0;
}
static int
test_simde_mm512_mask_atan_pd(SIMDE_MUNIT_TEST_ARGS) {
const struct {
simde__m512d src;
simde__mmask8 k;
simde__m512d a;
simde__m512d r;
} test_vec[8] = {
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( -686.13), SIMDE_FLOAT64_C( 571.46),
SIMDE_FLOAT64_C( 422.21), SIMDE_FLOAT64_C( 467.76),
SIMDE_FLOAT64_C( 670.24), SIMDE_FLOAT64_C( 34.06),
SIMDE_FLOAT64_C( 39.01), SIMDE_FLOAT64_C( 346.63)),
UINT8_C(139),
simde_mm512_set_pd(SIMDE_FLOAT64_C( -678.17), SIMDE_FLOAT64_C( 84.77),
SIMDE_FLOAT64_C( 825.53), SIMDE_FLOAT64_C( -269.45),
SIMDE_FLOAT64_C( 497.31), SIMDE_FLOAT64_C( -297.45),
SIMDE_FLOAT64_C( -186.21), SIMDE_FLOAT64_C( -754.38)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( -1.57), SIMDE_FLOAT64_C( 571.46),
SIMDE_FLOAT64_C( 422.21), SIMDE_FLOAT64_C( 467.76),
SIMDE_FLOAT64_C( 1.57), SIMDE_FLOAT64_C( 34.06),
SIMDE_FLOAT64_C( -1.57), SIMDE_FLOAT64_C( -1.57)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( 178.20), SIMDE_FLOAT64_C( 233.37),
SIMDE_FLOAT64_C( 261.31), SIMDE_FLOAT64_C( -976.55),
SIMDE_FLOAT64_C( -444.81), SIMDE_FLOAT64_C( -384.03),
SIMDE_FLOAT64_C( -305.07), SIMDE_FLOAT64_C( -417.54)),
UINT8_C(229),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 841.21), SIMDE_FLOAT64_C( -450.67),
SIMDE_FLOAT64_C( 687.09), SIMDE_FLOAT64_C( -212.54),
SIMDE_FLOAT64_C( -660.80), SIMDE_FLOAT64_C( 28.47),
SIMDE_FLOAT64_C( -923.64), SIMDE_FLOAT64_C( -860.95)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 1.57), SIMDE_FLOAT64_C( -1.57),
SIMDE_FLOAT64_C( 1.57), SIMDE_FLOAT64_C( -976.55),
SIMDE_FLOAT64_C( -444.81), SIMDE_FLOAT64_C( 1.54),
SIMDE_FLOAT64_C( -305.07), SIMDE_FLOAT64_C( -1.57)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( 398.82), SIMDE_FLOAT64_C( 395.92),
SIMDE_FLOAT64_C( 339.21), SIMDE_FLOAT64_C( -263.99),
SIMDE_FLOAT64_C( -30.79), SIMDE_FLOAT64_C( 443.48),
SIMDE_FLOAT64_C( 380.46), SIMDE_FLOAT64_C( 993.90)),
UINT8_C(253),
simde_mm512_set_pd(SIMDE_FLOAT64_C( -554.19), SIMDE_FLOAT64_C( -387.90),
SIMDE_FLOAT64_C( 655.87), SIMDE_FLOAT64_C( 532.35),
SIMDE_FLOAT64_C( 780.64), SIMDE_FLOAT64_C( -770.35),
SIMDE_FLOAT64_C( -583.60), SIMDE_FLOAT64_C( -770.72)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( -1.57), SIMDE_FLOAT64_C( -1.57),
SIMDE_FLOAT64_C( 1.57), SIMDE_FLOAT64_C( 1.57),
SIMDE_FLOAT64_C( 1.57), SIMDE_FLOAT64_C( -1.57),
SIMDE_FLOAT64_C( 380.46), SIMDE_FLOAT64_C( -1.57)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( 120.65), SIMDE_FLOAT64_C( 469.66),
SIMDE_FLOAT64_C( -148.69), SIMDE_FLOAT64_C( 910.03),
SIMDE_FLOAT64_C( 791.23), SIMDE_FLOAT64_C( -203.65),
SIMDE_FLOAT64_C( 336.73), SIMDE_FLOAT64_C( -747.59)),
UINT8_C( 93),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 543.35), SIMDE_FLOAT64_C( -171.51),
SIMDE_FLOAT64_C( 680.02), SIMDE_FLOAT64_C( 818.66),
SIMDE_FLOAT64_C( 600.47), SIMDE_FLOAT64_C( 254.31),
SIMDE_FLOAT64_C( -80.73), SIMDE_FLOAT64_C( -944.78)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 120.65), SIMDE_FLOAT64_C( -1.56),
SIMDE_FLOAT64_C( -148.69), SIMDE_FLOAT64_C( 1.57),
SIMDE_FLOAT64_C( 1.57), SIMDE_FLOAT64_C( 1.57),
SIMDE_FLOAT64_C( 336.73), SIMDE_FLOAT64_C( -1.57)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( 99.93), SIMDE_FLOAT64_C( -738.19),
SIMDE_FLOAT64_C( 758.79), SIMDE_FLOAT64_C( 343.48),
SIMDE_FLOAT64_C( -797.92), SIMDE_FLOAT64_C( -525.83),
SIMDE_FLOAT64_C( -822.65), SIMDE_FLOAT64_C( 655.67)),
UINT8_C(145),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 331.34), SIMDE_FLOAT64_C( 462.95),
SIMDE_FLOAT64_C( -178.99), SIMDE_FLOAT64_C( 324.62),
SIMDE_FLOAT64_C( -874.31), SIMDE_FLOAT64_C( -328.54),
SIMDE_FLOAT64_C( -192.31), SIMDE_FLOAT64_C( 561.36)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 1.57), SIMDE_FLOAT64_C( -738.19),
SIMDE_FLOAT64_C( 758.79), SIMDE_FLOAT64_C( 1.57),
SIMDE_FLOAT64_C( -797.92), SIMDE_FLOAT64_C( -525.83),
SIMDE_FLOAT64_C( -822.65), SIMDE_FLOAT64_C( 1.57)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( -756.42), SIMDE_FLOAT64_C( 27.25),
SIMDE_FLOAT64_C( 690.12), SIMDE_FLOAT64_C( -21.09),
SIMDE_FLOAT64_C( -448.89), SIMDE_FLOAT64_C( 505.79),
SIMDE_FLOAT64_C( 831.02), SIMDE_FLOAT64_C( 977.36)),
UINT8_C( 75),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 977.49), SIMDE_FLOAT64_C( 424.81),
SIMDE_FLOAT64_C( -95.15), SIMDE_FLOAT64_C( 840.65),
SIMDE_FLOAT64_C( -591.56), SIMDE_FLOAT64_C( 731.49),
SIMDE_FLOAT64_C( 623.70), SIMDE_FLOAT64_C( 140.67)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( -756.42), SIMDE_FLOAT64_C( 1.57),
SIMDE_FLOAT64_C( 690.12), SIMDE_FLOAT64_C( -21.09),
SIMDE_FLOAT64_C( -1.57), SIMDE_FLOAT64_C( 505.79),
SIMDE_FLOAT64_C( 1.57), SIMDE_FLOAT64_C( 1.56)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( 394.67), SIMDE_FLOAT64_C( -304.73),
SIMDE_FLOAT64_C( -696.69), SIMDE_FLOAT64_C( 822.06),
SIMDE_FLOAT64_C( -997.63), SIMDE_FLOAT64_C( 923.64),
SIMDE_FLOAT64_C( -768.12), SIMDE_FLOAT64_C( -67.64)),
UINT8_C( 93),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 510.85), SIMDE_FLOAT64_C( 14.34),
SIMDE_FLOAT64_C( 916.26), SIMDE_FLOAT64_C( -769.09),
SIMDE_FLOAT64_C( -573.81), SIMDE_FLOAT64_C( -337.60),
SIMDE_FLOAT64_C( 293.64), SIMDE_FLOAT64_C( -576.22)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 394.67), SIMDE_FLOAT64_C( 1.50),
SIMDE_FLOAT64_C( -696.69), SIMDE_FLOAT64_C( -1.57),
SIMDE_FLOAT64_C( -1.57), SIMDE_FLOAT64_C( -1.57),
SIMDE_FLOAT64_C( -768.12), SIMDE_FLOAT64_C( -1.57)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( 475.51), SIMDE_FLOAT64_C( 936.65),
SIMDE_FLOAT64_C( -348.70), SIMDE_FLOAT64_C( -438.19),
SIMDE_FLOAT64_C( -752.43), SIMDE_FLOAT64_C( 932.66),
SIMDE_FLOAT64_C( -327.22), SIMDE_FLOAT64_C( -182.45)),
UINT8_C(213),
simde_mm512_set_pd(SIMDE_FLOAT64_C( -775.04), SIMDE_FLOAT64_C( 440.64),
SIMDE_FLOAT64_C( 897.27), SIMDE_FLOAT64_C( -197.89),
SIMDE_FLOAT64_C( -359.76), SIMDE_FLOAT64_C( -33.67),
SIMDE_FLOAT64_C( 7.27), SIMDE_FLOAT64_C( -125.20)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( -1.57), SIMDE_FLOAT64_C( 1.57),
SIMDE_FLOAT64_C( -348.70), SIMDE_FLOAT64_C( -1.57),
SIMDE_FLOAT64_C( -752.43), SIMDE_FLOAT64_C( -1.54),
SIMDE_FLOAT64_C( -327.22), SIMDE_FLOAT64_C( -1.56)) }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])); i++) {
simde__m512d r = simde_mm512_mask_atan_pd(test_vec[i].src, test_vec[i].k, test_vec[i].a);
simde_assert_m512d_close(r, test_vec[i].r, 1);
}
return 0;
}
static int
test_simde_mm_atan2_ps(SIMDE_MUNIT_TEST_ARGS) {
const struct {
simde__m128 a;
simde__m128 b;
simde__m128 r;
} test_vec[9] = {
{ simde_mm_set_ps(SIMDE_FLOAT32_C( 670.24), SIMDE_FLOAT32_C( 34.06), SIMDE_FLOAT32_C( 39.01), SIMDE_FLOAT32_C( 346.63)),
simde_mm_set_ps(SIMDE_FLOAT32_C( 497.31), SIMDE_FLOAT32_C( -297.45), SIMDE_FLOAT32_C( -186.21), SIMDE_FLOAT32_C( -754.38)),
simde_mm_set_ps(SIMDE_FLOAT32_C( 0.93), SIMDE_FLOAT32_C( 3.03), SIMDE_FLOAT32_C( 2.94), SIMDE_FLOAT32_C( 2.71)) },
{ simde_mm_set_ps(SIMDE_FLOAT32_C( -686.13), SIMDE_FLOAT32_C( 571.46), SIMDE_FLOAT32_C( 422.21), SIMDE_FLOAT32_C( 467.76)),
simde_mm_set_ps(SIMDE_FLOAT32_C( -678.17), SIMDE_FLOAT32_C( 84.77), SIMDE_FLOAT32_C( 825.53), SIMDE_FLOAT32_C( -269.45)),
simde_mm_set_ps(SIMDE_FLOAT32_C( -2.35), SIMDE_FLOAT32_C( 1.42), SIMDE_FLOAT32_C( 0.47), SIMDE_FLOAT32_C( 2.09)) },
{ simde_mm_set_ps(SIMDE_FLOAT32_C( -686.13), SIMDE_FLOAT32_C( 571.46), SIMDE_FLOAT32_C( 422.21), SIMDE_FLOAT32_C( 467.76)),
simde_mm_set_ps(SIMDE_FLOAT32_C( -678.17), SIMDE_FLOAT32_C( 84.77), SIMDE_FLOAT32_C( 825.53), SIMDE_FLOAT32_C( -269.45)),
simde_mm_set_ps(SIMDE_FLOAT32_C( -2.35), SIMDE_FLOAT32_C( 1.42), SIMDE_FLOAT32_C( 0.47), SIMDE_FLOAT32_C( 2.09)) },
{ simde_mm_set_ps(SIMDE_FLOAT32_C( 28.47), SIMDE_FLOAT32_C( -923.64), SIMDE_FLOAT32_C( -860.95), SIMDE_FLOAT32_C( 696.87)),
simde_mm_set_ps(SIMDE_FLOAT32_C( -444.81), SIMDE_FLOAT32_C( -384.03), SIMDE_FLOAT32_C( -305.07), SIMDE_FLOAT32_C( -417.54)),
simde_mm_set_ps(SIMDE_FLOAT32_C( 3.08), SIMDE_FLOAT32_C( -1.96), SIMDE_FLOAT32_C( -1.91), SIMDE_FLOAT32_C( 2.11)) },
{ simde_mm_set_ps(SIMDE_FLOAT32_C( -450.67), SIMDE_FLOAT32_C( 687.09), SIMDE_FLOAT32_C( -212.54), SIMDE_FLOAT32_C( -660.80)),
simde_mm_set_ps(SIMDE_FLOAT32_C( 178.20), SIMDE_FLOAT32_C( 233.37), SIMDE_FLOAT32_C( 261.31), SIMDE_FLOAT32_C( -976.55)),
simde_mm_set_ps(SIMDE_FLOAT32_C( -1.19), SIMDE_FLOAT32_C( 1.24), SIMDE_FLOAT32_C( -0.68), SIMDE_FLOAT32_C( -2.55)) },
{ simde_mm_set_ps(SIMDE_FLOAT32_C( 443.48), SIMDE_FLOAT32_C( 380.46), SIMDE_FLOAT32_C( 993.90), SIMDE_FLOAT32_C( 841.21)),
simde_mm_set_ps(SIMDE_FLOAT32_C( -770.35), SIMDE_FLOAT32_C( -583.60), SIMDE_FLOAT32_C( -770.72), SIMDE_FLOAT32_C( 28.08)),
simde_mm_set_ps(SIMDE_FLOAT32_C( 2.62), SIMDE_FLOAT32_C( 2.56), SIMDE_FLOAT32_C( 2.23), SIMDE_FLOAT32_C( 1.54)) },
{ simde_mm_set_ps(SIMDE_FLOAT32_C( 395.92), SIMDE_FLOAT32_C( 339.21), SIMDE_FLOAT32_C( -263.99), SIMDE_FLOAT32_C( -30.79)),
simde_mm_set_ps(SIMDE_FLOAT32_C( -387.90), SIMDE_FLOAT32_C( 655.87), SIMDE_FLOAT32_C( 532.35), SIMDE_FLOAT32_C( 780.64)),
simde_mm_set_ps(SIMDE_FLOAT32_C( 2.35), SIMDE_FLOAT32_C( 0.48), SIMDE_FLOAT32_C( -0.46), SIMDE_FLOAT32_C( -0.04)) },
{ simde_mm_set_ps(SIMDE_FLOAT32_C( -80.73), SIMDE_FLOAT32_C( -944.78), SIMDE_FLOAT32_C( -767.23), SIMDE_FLOAT32_C( 398.82)),
simde_mm_set_ps(SIMDE_FLOAT32_C( -203.65), SIMDE_FLOAT32_C( 336.73), SIMDE_FLOAT32_C( -747.59), SIMDE_FLOAT32_C( -554.19)),
simde_mm_set_ps(SIMDE_FLOAT32_C( -2.76), SIMDE_FLOAT32_C( -1.23), SIMDE_FLOAT32_C( -2.34), SIMDE_FLOAT32_C( 2.52)) },
{ simde_mm_set_ps(SIMDE_FLOAT32_C( 593.11), SIMDE_FLOAT32_C( 480.49), SIMDE_FLOAT32_C( -877.19), SIMDE_FLOAT32_C( -326.68)),
simde_mm_set_ps(SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( -0.00), SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( -0.00)),
simde_mm_set_ps(SIMDE_FLOAT32_C( 1.57), SIMDE_FLOAT32_C( 1.57), SIMDE_FLOAT32_C( -1.57), SIMDE_FLOAT32_C( -1.57)) }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])); i++) {
simde__m128 r = simde_mm_atan2_ps(test_vec[i].a, test_vec[i].b);
simde_assert_m128_close(r, test_vec[i].r, 1);
}
return 0;
}
static int
test_simde_mm_atan2_pd(SIMDE_MUNIT_TEST_ARGS) {
const struct {
simde__m128d a;
simde__m128d b;
simde__m128d r;
} test_vec[8] = {
{ simde_mm_set_pd(SIMDE_FLOAT64_C( 39.01), SIMDE_FLOAT64_C( 346.63)),
simde_mm_set_pd(SIMDE_FLOAT64_C( -186.21), SIMDE_FLOAT64_C( -754.38)),
simde_mm_set_pd(SIMDE_FLOAT64_C( 2.94), SIMDE_FLOAT64_C( 2.71)) },
{ simde_mm_set_pd(SIMDE_FLOAT64_C( 670.24), SIMDE_FLOAT64_C( 34.06)),
simde_mm_set_pd(SIMDE_FLOAT64_C( 497.31), SIMDE_FLOAT64_C( -297.45)),
simde_mm_set_pd(SIMDE_FLOAT64_C( 0.93), SIMDE_FLOAT64_C( 3.03)) },
{ simde_mm_set_pd(SIMDE_FLOAT64_C( 422.21), SIMDE_FLOAT64_C( 467.76)),
simde_mm_set_pd(SIMDE_FLOAT64_C( 825.53), SIMDE_FLOAT64_C( -269.45)),
simde_mm_set_pd(SIMDE_FLOAT64_C( 0.47), SIMDE_FLOAT64_C( 2.09)) },
{ simde_mm_set_pd(SIMDE_FLOAT64_C( -686.13), SIMDE_FLOAT64_C( 571.46)),
simde_mm_set_pd(SIMDE_FLOAT64_C( -678.17), SIMDE_FLOAT64_C( 84.77)),
simde_mm_set_pd(SIMDE_FLOAT64_C( -2.35), SIMDE_FLOAT64_C( 1.42)) },
{ simde_mm_set_pd(SIMDE_FLOAT64_C( -860.95), SIMDE_FLOAT64_C( 696.87)),
simde_mm_set_pd(SIMDE_FLOAT64_C( -305.07), SIMDE_FLOAT64_C( -417.54)),
simde_mm_set_pd(SIMDE_FLOAT64_C( -1.91), SIMDE_FLOAT64_C( 2.11)) },
{ simde_mm_set_pd(SIMDE_FLOAT64_C( 28.47), SIMDE_FLOAT64_C( -923.64)),
simde_mm_set_pd(SIMDE_FLOAT64_C( -444.81), SIMDE_FLOAT64_C( -384.03)),
simde_mm_set_pd(SIMDE_FLOAT64_C( 3.08), SIMDE_FLOAT64_C( -1.96)) },
{ simde_mm_set_pd(SIMDE_FLOAT64_C( -212.54), SIMDE_FLOAT64_C( -660.80)),
simde_mm_set_pd(SIMDE_FLOAT64_C( 261.31), SIMDE_FLOAT64_C( -976.55)),
simde_mm_set_pd(SIMDE_FLOAT64_C( -0.68), SIMDE_FLOAT64_C( -2.55)) },
{ simde_mm_set_pd(SIMDE_FLOAT64_C( -754.38), SIMDE_FLOAT64_C( 346.63)),
simde_mm_set_pd(SIMDE_FLOAT64_C( 0.00), SIMDE_FLOAT64_C( 0.00)),
simde_mm_set_pd(SIMDE_FLOAT64_C( -1.57), SIMDE_FLOAT64_C( 1.57)) }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])); i++) {
simde__m128d r = simde_mm_atan2_pd(test_vec[i].a, test_vec[i].b);
simde_assert_m128d_close(r, test_vec[i].r, 1);
}
return 0;
}
static int
test_simde_mm256_atan2_ps(SIMDE_MUNIT_TEST_ARGS) {
const struct {
simde__m256 a;
simde__m256 b;
simde__m256 r;
} test_vec[9] = {
{ simde_mm256_set_ps(SIMDE_FLOAT32_C( -686.13), SIMDE_FLOAT32_C( 571.46),
SIMDE_FLOAT32_C( 422.21), SIMDE_FLOAT32_C( 467.76),
SIMDE_FLOAT32_C( 670.24), SIMDE_FLOAT32_C( 34.06),
SIMDE_FLOAT32_C( 39.01), SIMDE_FLOAT32_C( 346.63)),
simde_mm256_set_ps(SIMDE_FLOAT32_C( -678.17), SIMDE_FLOAT32_C( 84.77),
SIMDE_FLOAT32_C( 825.53), SIMDE_FLOAT32_C( -269.45),
SIMDE_FLOAT32_C( 497.31), SIMDE_FLOAT32_C( -297.45),
SIMDE_FLOAT32_C( -186.21), SIMDE_FLOAT32_C( -754.38)),
simde_mm256_set_ps(SIMDE_FLOAT32_C( -2.35), SIMDE_FLOAT32_C( 1.42),
SIMDE_FLOAT32_C( 0.47), SIMDE_FLOAT32_C( 2.09),
SIMDE_FLOAT32_C( 0.93), SIMDE_FLOAT32_C( 3.03),
SIMDE_FLOAT32_C( 2.94), SIMDE_FLOAT32_C( 2.71)) },
{ simde_mm256_set_ps(SIMDE_FLOAT32_C( -450.67), SIMDE_FLOAT32_C( 687.09),
SIMDE_FLOAT32_C( -212.54), SIMDE_FLOAT32_C( -660.80),
SIMDE_FLOAT32_C( 28.47), SIMDE_FLOAT32_C( -923.64),
SIMDE_FLOAT32_C( -860.95), SIMDE_FLOAT32_C( 696.87)),
simde_mm256_set_ps(SIMDE_FLOAT32_C( 178.20), SIMDE_FLOAT32_C( 233.37),
SIMDE_FLOAT32_C( 261.31), SIMDE_FLOAT32_C( -976.55),
SIMDE_FLOAT32_C( -444.81), SIMDE_FLOAT32_C( -384.03),
SIMDE_FLOAT32_C( -305.07), SIMDE_FLOAT32_C( -417.54)),
simde_mm256_set_ps(SIMDE_FLOAT32_C( -1.19), SIMDE_FLOAT32_C( 1.24),
SIMDE_FLOAT32_C( -0.68), SIMDE_FLOAT32_C( -2.55),
SIMDE_FLOAT32_C( 3.08), SIMDE_FLOAT32_C( -1.96),
SIMDE_FLOAT32_C( -1.91), SIMDE_FLOAT32_C( 2.11)) },
{ simde_mm256_set_ps(SIMDE_FLOAT32_C( 395.92), SIMDE_FLOAT32_C( 339.21),
SIMDE_FLOAT32_C( -263.99), SIMDE_FLOAT32_C( -30.79),
SIMDE_FLOAT32_C( 443.48), SIMDE_FLOAT32_C( 380.46),
SIMDE_FLOAT32_C( 993.90), SIMDE_FLOAT32_C( 841.21)),
simde_mm256_set_ps(SIMDE_FLOAT32_C( -387.90), SIMDE_FLOAT32_C( 655.87),
SIMDE_FLOAT32_C( 532.35), SIMDE_FLOAT32_C( 780.64),
SIMDE_FLOAT32_C( -770.35), SIMDE_FLOAT32_C( -583.60),
SIMDE_FLOAT32_C( -770.72), SIMDE_FLOAT32_C( 28.08)),
simde_mm256_set_ps(SIMDE_FLOAT32_C( 2.35), SIMDE_FLOAT32_C( 0.48),
SIMDE_FLOAT32_C( -0.46), SIMDE_FLOAT32_C( -0.04),
SIMDE_FLOAT32_C( 2.62), SIMDE_FLOAT32_C( 2.56),
SIMDE_FLOAT32_C( 2.23), SIMDE_FLOAT32_C( 1.54)) },
{ simde_mm256_set_ps(SIMDE_FLOAT32_C( 680.02), SIMDE_FLOAT32_C( 818.66),
SIMDE_FLOAT32_C( 600.47), SIMDE_FLOAT32_C( 254.31),
SIMDE_FLOAT32_C( -80.73), SIMDE_FLOAT32_C( -944.78),
SIMDE_FLOAT32_C( -767.23), SIMDE_FLOAT32_C( 398.82)),
simde_mm256_set_ps(SIMDE_FLOAT32_C( 469.66), SIMDE_FLOAT32_C( -148.69),
SIMDE_FLOAT32_C( 910.03), SIMDE_FLOAT32_C( 791.23),
SIMDE_FLOAT32_C( -203.65), SIMDE_FLOAT32_C( 336.73),
SIMDE_FLOAT32_C( -747.59), SIMDE_FLOAT32_C( -554.19)),
simde_mm256_set_ps(SIMDE_FLOAT32_C( 0.97), SIMDE_FLOAT32_C( 1.75),
SIMDE_FLOAT32_C( 0.58), SIMDE_FLOAT32_C( 0.31),
SIMDE_FLOAT32_C( -2.76), SIMDE_FLOAT32_C( -1.23),
SIMDE_FLOAT32_C( -2.34), SIMDE_FLOAT32_C( 2.52)) },
{ simde_mm256_set_ps(SIMDE_FLOAT32_C( 758.79), SIMDE_FLOAT32_C( 343.48),
SIMDE_FLOAT32_C( -797.92), SIMDE_FLOAT32_C( -525.83),
SIMDE_FLOAT32_C( -822.65), SIMDE_FLOAT32_C( 655.67),
SIMDE_FLOAT32_C( 543.35), SIMDE_FLOAT32_C( -171.51)),
simde_mm256_set_ps(SIMDE_FLOAT32_C( -178.99), SIMDE_FLOAT32_C( 324.62),
SIMDE_FLOAT32_C( -874.31), SIMDE_FLOAT32_C( -328.54),
SIMDE_FLOAT32_C( -192.31), SIMDE_FLOAT32_C( 561.36),
SIMDE_FLOAT32_C( -70.91), SIMDE_FLOAT32_C( 120.65)),
simde_mm256_set_ps(SIMDE_FLOAT32_C( 1.80), SIMDE_FLOAT32_C( 0.81),
SIMDE_FLOAT32_C( -2.40), SIMDE_FLOAT32_C( -2.13),
SIMDE_FLOAT32_C( -1.80), SIMDE_FLOAT32_C( 0.86),
SIMDE_FLOAT32_C( 1.70), SIMDE_FLOAT32_C( -0.96)) },
{ simde_mm256_set_ps(SIMDE_FLOAT32_C( 840.65), SIMDE_FLOAT32_C( -591.56),
SIMDE_FLOAT32_C( 731.49), SIMDE_FLOAT32_C( 623.70),
SIMDE_FLOAT32_C( 140.67), SIMDE_FLOAT32_C( -906.16),
SIMDE_FLOAT32_C( 99.93), SIMDE_FLOAT32_C( -738.19)),
simde_mm256_set_ps(SIMDE_FLOAT32_C( 690.12), SIMDE_FLOAT32_C( -21.09),
SIMDE_FLOAT32_C( -448.89), SIMDE_FLOAT32_C( 505.79),
SIMDE_FLOAT32_C( 831.02), SIMDE_FLOAT32_C( 977.36),
SIMDE_FLOAT32_C( 331.34), SIMDE_FLOAT32_C( 462.95)),
simde_mm256_set_ps(SIMDE_FLOAT32_C( 0.88), SIMDE_FLOAT32_C( -1.61),
SIMDE_FLOAT32_C( 2.12), SIMDE_FLOAT32_C( 0.89),
SIMDE_FLOAT32_C( 0.17), SIMDE_FLOAT32_C( -0.75),
SIMDE_FLOAT32_C( 0.29), SIMDE_FLOAT32_C( -1.01)) },
{ simde_mm256_set_ps(SIMDE_FLOAT32_C( 822.06), SIMDE_FLOAT32_C( -997.63),
SIMDE_FLOAT32_C( 923.64), SIMDE_FLOAT32_C( -768.12),
SIMDE_FLOAT32_C( -67.64), SIMDE_FLOAT32_C( 977.49),
SIMDE_FLOAT32_C( 424.81), SIMDE_FLOAT32_C( -95.15)),
simde_mm256_set_ps(SIMDE_FLOAT32_C( -769.09), SIMDE_FLOAT32_C( -573.81),
SIMDE_FLOAT32_C( -337.60), SIMDE_FLOAT32_C( 293.64),
SIMDE_FLOAT32_C( -576.22), SIMDE_FLOAT32_C( 710.38),
SIMDE_FLOAT32_C( -756.42), SIMDE_FLOAT32_C( 27.25)),
simde_mm256_set_ps(SIMDE_FLOAT32_C( 2.32), SIMDE_FLOAT32_C( -2.09),
SIMDE_FLOAT32_C( 1.92), SIMDE_FLOAT32_C( -1.21),
SIMDE_FLOAT32_C( -3.02), SIMDE_FLOAT32_C( 0.94),
SIMDE_FLOAT32_C( 2.63), SIMDE_FLOAT32_C( -1.29)) },
{ simde_mm256_set_ps(SIMDE_FLOAT32_C( -359.76), SIMDE_FLOAT32_C( -33.67),
SIMDE_FLOAT32_C( 7.27), SIMDE_FLOAT32_C( -125.20),
SIMDE_FLOAT32_C( 39.93), SIMDE_FLOAT32_C( 394.67),
SIMDE_FLOAT32_C( -304.73), SIMDE_FLOAT32_C( -696.69)),
simde_mm256_set_ps(SIMDE_FLOAT32_C( -438.19), SIMDE_FLOAT32_C( -752.43),
SIMDE_FLOAT32_C( 932.66), SIMDE_FLOAT32_C( -327.22),
SIMDE_FLOAT32_C( -182.45), SIMDE_FLOAT32_C( 510.85),
SIMDE_FLOAT32_C( 14.34), SIMDE_FLOAT32_C( 916.26)),
simde_mm256_set_ps(SIMDE_FLOAT32_C( -2.45), SIMDE_FLOAT32_C( -3.10),
SIMDE_FLOAT32_C( 0.01), SIMDE_FLOAT32_C( -2.78),
SIMDE_FLOAT32_C( 2.93), SIMDE_FLOAT32_C( 0.66),
SIMDE_FLOAT32_C( -1.52), SIMDE_FLOAT32_C( -0.65)) },
{ simde_mm256_set_ps(SIMDE_FLOAT32_C( 497.31), SIMDE_FLOAT32_C( 670.24),
SIMDE_FLOAT32_C( -297.45), SIMDE_FLOAT32_C( 34.06),
SIMDE_FLOAT32_C( -186.21), SIMDE_FLOAT32_C( 39.01),
SIMDE_FLOAT32_C( -754.38), SIMDE_FLOAT32_C( 346.63)),
simde_mm256_set_ps(SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 0.00),
SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 0.00),
SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 0.00),
SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 0.00)),
simde_mm256_set_ps(SIMDE_FLOAT32_C( 1.57), SIMDE_FLOAT32_C( 1.57),
SIMDE_FLOAT32_C( -1.57), SIMDE_FLOAT32_C( 1.57),
SIMDE_FLOAT32_C( -1.57), SIMDE_FLOAT32_C( 1.57),
SIMDE_FLOAT32_C( -1.57), SIMDE_FLOAT32_C( 1.57)) }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])); i++) {
simde__m256 r = simde_mm256_atan2_ps(test_vec[i].a, test_vec[i].b);
simde_assert_m256_close(r, test_vec[i].r, 1);
}
return 0;
}
static int
test_simde_mm256_atan2_pd(SIMDE_MUNIT_TEST_ARGS) {
const struct {
simde__m256d a;
simde__m256d b;
simde__m256d r;
} test_vec[9] = {
{ simde_mm256_set_pd(SIMDE_FLOAT64_C( 670.24), SIMDE_FLOAT64_C( 34.06),
SIMDE_FLOAT64_C( 39.01), SIMDE_FLOAT64_C( 346.63)),
simde_mm256_set_pd(SIMDE_FLOAT64_C( 497.31), SIMDE_FLOAT64_C( -297.45),
SIMDE_FLOAT64_C( -186.21), SIMDE_FLOAT64_C( -754.38)),
simde_mm256_set_pd(SIMDE_FLOAT64_C( 0.93), SIMDE_FLOAT64_C( 3.03),
SIMDE_FLOAT64_C( 2.94), SIMDE_FLOAT64_C( 2.71)) },
{ simde_mm256_set_pd(SIMDE_FLOAT64_C( -686.13), SIMDE_FLOAT64_C( 571.46),
SIMDE_FLOAT64_C( 422.21), SIMDE_FLOAT64_C( 467.76)),
simde_mm256_set_pd(SIMDE_FLOAT64_C( -678.17), SIMDE_FLOAT64_C( 84.77),
SIMDE_FLOAT64_C( 825.53), SIMDE_FLOAT64_C( -269.45)),
simde_mm256_set_pd(SIMDE_FLOAT64_C( -2.35), SIMDE_FLOAT64_C( 1.42),
SIMDE_FLOAT64_C( 0.47), SIMDE_FLOAT64_C( 2.09)) },
{ simde_mm256_set_pd(SIMDE_FLOAT64_C( 28.47), SIMDE_FLOAT64_C( -923.64),
SIMDE_FLOAT64_C( -860.95), SIMDE_FLOAT64_C( 696.87)),
simde_mm256_set_pd(SIMDE_FLOAT64_C( -444.81), SIMDE_FLOAT64_C( -384.03),
SIMDE_FLOAT64_C( -305.07), SIMDE_FLOAT64_C( -417.54)),
simde_mm256_set_pd(SIMDE_FLOAT64_C( 3.08), SIMDE_FLOAT64_C( -1.96),
SIMDE_FLOAT64_C( -1.91), SIMDE_FLOAT64_C( 2.11)) },
{ simde_mm256_set_pd(SIMDE_FLOAT64_C( -450.67), SIMDE_FLOAT64_C( 687.09),
SIMDE_FLOAT64_C( -212.54), SIMDE_FLOAT64_C( -660.80)),
simde_mm256_set_pd(SIMDE_FLOAT64_C( 178.20), SIMDE_FLOAT64_C( 233.37),
SIMDE_FLOAT64_C( 261.31), SIMDE_FLOAT64_C( -976.55)),
simde_mm256_set_pd(SIMDE_FLOAT64_C( -1.19), SIMDE_FLOAT64_C( 1.24),
SIMDE_FLOAT64_C( -0.68), SIMDE_FLOAT64_C( -2.55)) },
{ simde_mm256_set_pd(SIMDE_FLOAT64_C( 443.48), SIMDE_FLOAT64_C( 380.46),
SIMDE_FLOAT64_C( 993.90), SIMDE_FLOAT64_C( 841.21)),
simde_mm256_set_pd(SIMDE_FLOAT64_C( -770.35), SIMDE_FLOAT64_C( -583.60),
SIMDE_FLOAT64_C( -770.72), SIMDE_FLOAT64_C( 28.08)),
simde_mm256_set_pd(SIMDE_FLOAT64_C( 2.62), SIMDE_FLOAT64_C( 2.56),
SIMDE_FLOAT64_C( 2.23), SIMDE_FLOAT64_C( 1.54)) },
{ simde_mm256_set_pd(SIMDE_FLOAT64_C( 395.92), SIMDE_FLOAT64_C( 339.21),
SIMDE_FLOAT64_C( -263.99), SIMDE_FLOAT64_C( -30.79)),
simde_mm256_set_pd(SIMDE_FLOAT64_C( -387.90), SIMDE_FLOAT64_C( 655.87),
SIMDE_FLOAT64_C( 532.35), SIMDE_FLOAT64_C( 780.64)),
simde_mm256_set_pd(SIMDE_FLOAT64_C( 2.35), SIMDE_FLOAT64_C( 0.48),
SIMDE_FLOAT64_C( -0.46), SIMDE_FLOAT64_C( -0.04)) },
{ simde_mm256_set_pd(SIMDE_FLOAT64_C( -80.73), SIMDE_FLOAT64_C( -944.78),
SIMDE_FLOAT64_C( -767.23), SIMDE_FLOAT64_C( 398.82)),
simde_mm256_set_pd(SIMDE_FLOAT64_C( -203.65), SIMDE_FLOAT64_C( 336.73),
SIMDE_FLOAT64_C( -747.59), SIMDE_FLOAT64_C( -554.19)),
simde_mm256_set_pd(SIMDE_FLOAT64_C( -2.76), SIMDE_FLOAT64_C( -1.23),
SIMDE_FLOAT64_C( -2.34), SIMDE_FLOAT64_C( 2.52)) },
{ simde_mm256_set_pd(SIMDE_FLOAT64_C( 680.02), SIMDE_FLOAT64_C( 818.66),
SIMDE_FLOAT64_C( 600.47), SIMDE_FLOAT64_C( 254.31)),
simde_mm256_set_pd(SIMDE_FLOAT64_C( 469.66), SIMDE_FLOAT64_C( -148.69),
SIMDE_FLOAT64_C( 910.03), SIMDE_FLOAT64_C( 791.23)),
simde_mm256_set_pd(SIMDE_FLOAT64_C( 0.97), SIMDE_FLOAT64_C( 1.75),
SIMDE_FLOAT64_C( 0.58), SIMDE_FLOAT64_C( 0.31)) },
{ simde_mm256_set_pd(SIMDE_FLOAT64_C( -186.21), SIMDE_FLOAT64_C( 39.01),
SIMDE_FLOAT64_C( -754.38), SIMDE_FLOAT64_C( 346.63)),
simde_mm256_set_pd(SIMDE_FLOAT64_C( 0.00), SIMDE_FLOAT64_C( 0.00),
SIMDE_FLOAT64_C( 0.00), SIMDE_FLOAT64_C( 0.00)),
simde_mm256_set_pd(SIMDE_FLOAT64_C( -1.57), SIMDE_FLOAT64_C( 1.57),
SIMDE_FLOAT64_C( -1.57), SIMDE_FLOAT64_C( 1.57)) }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])); i++) {
simde__m256d r = simde_mm256_atan2_pd(test_vec[i].a, test_vec[i].b);
simde_assert_m256d_close(r, test_vec[i].r, 1);
}
return 0;
}
static int
test_simde_mm512_atan2_ps(SIMDE_MUNIT_TEST_ARGS) {
const struct {
simde__m512 a;
simde__m512 b;
simde__m512 r;
} test_vec[9] = {
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( -450.67), SIMDE_FLOAT32_C( 687.09), SIMDE_FLOAT32_C( -212.54), SIMDE_FLOAT32_C( -660.80),
SIMDE_FLOAT32_C( 28.47), SIMDE_FLOAT32_C( -923.64), SIMDE_FLOAT32_C( -860.95), SIMDE_FLOAT32_C( 696.87),
SIMDE_FLOAT32_C( -686.13), SIMDE_FLOAT32_C( 571.46), SIMDE_FLOAT32_C( 422.21), SIMDE_FLOAT32_C( 467.76),
SIMDE_FLOAT32_C( 670.24), SIMDE_FLOAT32_C( 34.06), SIMDE_FLOAT32_C( 39.01), SIMDE_FLOAT32_C( 346.63)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 178.20), SIMDE_FLOAT32_C( 233.37), SIMDE_FLOAT32_C( 261.31), SIMDE_FLOAT32_C( -976.55),
SIMDE_FLOAT32_C( -444.81), SIMDE_FLOAT32_C( -384.03), SIMDE_FLOAT32_C( -305.07), SIMDE_FLOAT32_C( -417.54),
SIMDE_FLOAT32_C( -678.17), SIMDE_FLOAT32_C( 84.77), SIMDE_FLOAT32_C( 825.53), SIMDE_FLOAT32_C( -269.45),
SIMDE_FLOAT32_C( 497.31), SIMDE_FLOAT32_C( -297.45), SIMDE_FLOAT32_C( -186.21), SIMDE_FLOAT32_C( -754.38)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( -1.19), SIMDE_FLOAT32_C( 1.24), SIMDE_FLOAT32_C( -0.68), SIMDE_FLOAT32_C( -2.55),
SIMDE_FLOAT32_C( 3.08), SIMDE_FLOAT32_C( -1.96), SIMDE_FLOAT32_C( -1.91), SIMDE_FLOAT32_C( 2.11),
SIMDE_FLOAT32_C( -2.35), SIMDE_FLOAT32_C( 1.42), SIMDE_FLOAT32_C( 0.47), SIMDE_FLOAT32_C( 2.09),
SIMDE_FLOAT32_C( 0.93), SIMDE_FLOAT32_C( 3.03), SIMDE_FLOAT32_C( 2.94), SIMDE_FLOAT32_C( 2.71)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( 680.02), SIMDE_FLOAT32_C( 818.66), SIMDE_FLOAT32_C( 600.47), SIMDE_FLOAT32_C( 254.31),
SIMDE_FLOAT32_C( -80.73), SIMDE_FLOAT32_C( -944.78), SIMDE_FLOAT32_C( -767.23), SIMDE_FLOAT32_C( 398.82),
SIMDE_FLOAT32_C( 395.92), SIMDE_FLOAT32_C( 339.21), SIMDE_FLOAT32_C( -263.99), SIMDE_FLOAT32_C( -30.79),
SIMDE_FLOAT32_C( 443.48), SIMDE_FLOAT32_C( 380.46), SIMDE_FLOAT32_C( 993.90), SIMDE_FLOAT32_C( 841.21)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 469.66), SIMDE_FLOAT32_C( -148.69), SIMDE_FLOAT32_C( 910.03), SIMDE_FLOAT32_C( 791.23),
SIMDE_FLOAT32_C( -203.65), SIMDE_FLOAT32_C( 336.73), SIMDE_FLOAT32_C( -747.59), SIMDE_FLOAT32_C( -554.19),
SIMDE_FLOAT32_C( -387.90), SIMDE_FLOAT32_C( 655.87), SIMDE_FLOAT32_C( 532.35), SIMDE_FLOAT32_C( 780.64),
SIMDE_FLOAT32_C( -770.35), SIMDE_FLOAT32_C( -583.60), SIMDE_FLOAT32_C( -770.72), SIMDE_FLOAT32_C( 28.08)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 0.97), SIMDE_FLOAT32_C( 1.75), SIMDE_FLOAT32_C( 0.58), SIMDE_FLOAT32_C( 0.31),
SIMDE_FLOAT32_C( -2.76), SIMDE_FLOAT32_C( -1.23), SIMDE_FLOAT32_C( -2.34), SIMDE_FLOAT32_C( 2.52),
SIMDE_FLOAT32_C( 2.35), SIMDE_FLOAT32_C( 0.48), SIMDE_FLOAT32_C( -0.46), SIMDE_FLOAT32_C( -0.04),
SIMDE_FLOAT32_C( 2.62), SIMDE_FLOAT32_C( 2.56), SIMDE_FLOAT32_C( 2.23), SIMDE_FLOAT32_C( 1.54)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( 840.65), SIMDE_FLOAT32_C( -591.56), SIMDE_FLOAT32_C( 731.49), SIMDE_FLOAT32_C( 623.70),
SIMDE_FLOAT32_C( 140.67), SIMDE_FLOAT32_C( -906.16), SIMDE_FLOAT32_C( 99.93), SIMDE_FLOAT32_C( -738.19),
SIMDE_FLOAT32_C( 758.79), SIMDE_FLOAT32_C( 343.48), SIMDE_FLOAT32_C( -797.92), SIMDE_FLOAT32_C( -525.83),
SIMDE_FLOAT32_C( -822.65), SIMDE_FLOAT32_C( 655.67), SIMDE_FLOAT32_C( 543.35), SIMDE_FLOAT32_C( -171.51)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 690.12), SIMDE_FLOAT32_C( -21.09), SIMDE_FLOAT32_C( -448.89), SIMDE_FLOAT32_C( 505.79),
SIMDE_FLOAT32_C( 831.02), SIMDE_FLOAT32_C( 977.36), SIMDE_FLOAT32_C( 331.34), SIMDE_FLOAT32_C( 462.95),
SIMDE_FLOAT32_C( -178.99), SIMDE_FLOAT32_C( 324.62), SIMDE_FLOAT32_C( -874.31), SIMDE_FLOAT32_C( -328.54),
SIMDE_FLOAT32_C( -192.31), SIMDE_FLOAT32_C( 561.36), SIMDE_FLOAT32_C( -70.91), SIMDE_FLOAT32_C( 120.65)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 0.88), SIMDE_FLOAT32_C( -1.61), SIMDE_FLOAT32_C( 2.12), SIMDE_FLOAT32_C( 0.89),
SIMDE_FLOAT32_C( 0.17), SIMDE_FLOAT32_C( -0.75), SIMDE_FLOAT32_C( 0.29), SIMDE_FLOAT32_C( -1.01),
SIMDE_FLOAT32_C( 1.80), SIMDE_FLOAT32_C( 0.81), SIMDE_FLOAT32_C( -2.40), SIMDE_FLOAT32_C( -2.13),
SIMDE_FLOAT32_C( -1.80), SIMDE_FLOAT32_C( 0.86), SIMDE_FLOAT32_C( 1.70), SIMDE_FLOAT32_C( -0.96)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( -359.76), SIMDE_FLOAT32_C( -33.67), SIMDE_FLOAT32_C( 7.27), SIMDE_FLOAT32_C( -125.20),
SIMDE_FLOAT32_C( 39.93), SIMDE_FLOAT32_C( 394.67), SIMDE_FLOAT32_C( -304.73), SIMDE_FLOAT32_C( -696.69),
SIMDE_FLOAT32_C( 822.06), SIMDE_FLOAT32_C( -997.63), SIMDE_FLOAT32_C( 923.64), SIMDE_FLOAT32_C( -768.12),
SIMDE_FLOAT32_C( -67.64), SIMDE_FLOAT32_C( 977.49), SIMDE_FLOAT32_C( 424.81), SIMDE_FLOAT32_C( -95.15)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( -438.19), SIMDE_FLOAT32_C( -752.43), SIMDE_FLOAT32_C( 932.66), SIMDE_FLOAT32_C( -327.22),
SIMDE_FLOAT32_C( -182.45), SIMDE_FLOAT32_C( 510.85), SIMDE_FLOAT32_C( 14.34), SIMDE_FLOAT32_C( 916.26),
SIMDE_FLOAT32_C( -769.09), SIMDE_FLOAT32_C( -573.81), SIMDE_FLOAT32_C( -337.60), SIMDE_FLOAT32_C( 293.64),
SIMDE_FLOAT32_C( -576.22), SIMDE_FLOAT32_C( 710.38), SIMDE_FLOAT32_C( -756.42), SIMDE_FLOAT32_C( 27.25)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( -2.45), SIMDE_FLOAT32_C( -3.10), SIMDE_FLOAT32_C( 0.01), SIMDE_FLOAT32_C( -2.78),
SIMDE_FLOAT32_C( 2.93), SIMDE_FLOAT32_C( 0.66), SIMDE_FLOAT32_C( -1.52), SIMDE_FLOAT32_C( -0.65),
SIMDE_FLOAT32_C( 2.32), SIMDE_FLOAT32_C( -2.09), SIMDE_FLOAT32_C( 1.92), SIMDE_FLOAT32_C( -1.21),
SIMDE_FLOAT32_C( -3.02), SIMDE_FLOAT32_C( 0.94), SIMDE_FLOAT32_C( 2.63), SIMDE_FLOAT32_C( -1.29)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( -314.93), SIMDE_FLOAT32_C( 177.92), SIMDE_FLOAT32_C( 345.93), SIMDE_FLOAT32_C( 888.71),
SIMDE_FLOAT32_C( 915.71), SIMDE_FLOAT32_C( 133.52), SIMDE_FLOAT32_C( 484.94), SIMDE_FLOAT32_C( -598.06),
SIMDE_FLOAT32_C( -791.07), SIMDE_FLOAT32_C( -765.93), SIMDE_FLOAT32_C( 221.37), SIMDE_FLOAT32_C( -788.36),
SIMDE_FLOAT32_C( -775.04), SIMDE_FLOAT32_C( 440.64), SIMDE_FLOAT32_C( 897.27), SIMDE_FLOAT32_C( -197.89)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( -718.40), SIMDE_FLOAT32_C( 159.97), SIMDE_FLOAT32_C( -861.01), SIMDE_FLOAT32_C( 426.61),
SIMDE_FLOAT32_C( 932.11), SIMDE_FLOAT32_C( 110.36), SIMDE_FLOAT32_C( 826.84), SIMDE_FLOAT32_C( -76.75),
SIMDE_FLOAT32_C( 237.58), SIMDE_FLOAT32_C( -378.50), SIMDE_FLOAT32_C( -601.68), SIMDE_FLOAT32_C( -623.50),
SIMDE_FLOAT32_C( -942.47), SIMDE_FLOAT32_C( 475.51), SIMDE_FLOAT32_C( 936.65), SIMDE_FLOAT32_C( -348.70)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( -2.73), SIMDE_FLOAT32_C( 0.84), SIMDE_FLOAT32_C( 2.76), SIMDE_FLOAT32_C( 1.12),
SIMDE_FLOAT32_C( 0.78), SIMDE_FLOAT32_C( 0.88), SIMDE_FLOAT32_C( 0.53), SIMDE_FLOAT32_C( -1.70),
SIMDE_FLOAT32_C( -1.28), SIMDE_FLOAT32_C( -2.03), SIMDE_FLOAT32_C( 2.79), SIMDE_FLOAT32_C( -2.24),
SIMDE_FLOAT32_C( -2.45), SIMDE_FLOAT32_C( 0.75), SIMDE_FLOAT32_C( 0.76), SIMDE_FLOAT32_C( -2.63)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( -764.58), SIMDE_FLOAT32_C( 789.89), SIMDE_FLOAT32_C( 4.83), SIMDE_FLOAT32_C( -818.54),
SIMDE_FLOAT32_C( 161.06), SIMDE_FLOAT32_C( 579.25), SIMDE_FLOAT32_C( -11.78), SIMDE_FLOAT32_C( -308.52),
SIMDE_FLOAT32_C( -719.57), SIMDE_FLOAT32_C( 334.00), SIMDE_FLOAT32_C( 274.71), SIMDE_FLOAT32_C( -916.82),
SIMDE_FLOAT32_C( -490.00), SIMDE_FLOAT32_C( -799.40), SIMDE_FLOAT32_C( -15.61), SIMDE_FLOAT32_C( -737.13)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( -70.05), SIMDE_FLOAT32_C( -784.34), SIMDE_FLOAT32_C( 92.52), SIMDE_FLOAT32_C( 206.60),
SIMDE_FLOAT32_C( 834.60), SIMDE_FLOAT32_C( -65.60), SIMDE_FLOAT32_C( -286.07), SIMDE_FLOAT32_C( -212.86),
SIMDE_FLOAT32_C( -318.38), SIMDE_FLOAT32_C( 783.48), SIMDE_FLOAT32_C( -628.82), SIMDE_FLOAT32_C( 556.35),
SIMDE_FLOAT32_C( 439.43), SIMDE_FLOAT32_C( 434.03), SIMDE_FLOAT32_C( 496.57), SIMDE_FLOAT32_C( 915.19)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( -1.66), SIMDE_FLOAT32_C( 2.35), SIMDE_FLOAT32_C( 0.05), SIMDE_FLOAT32_C( -1.32),
SIMDE_FLOAT32_C( 0.19), SIMDE_FLOAT32_C( 1.68), SIMDE_FLOAT32_C( -3.10), SIMDE_FLOAT32_C( -2.17),
SIMDE_FLOAT32_C( -1.99), SIMDE_FLOAT32_C( 0.40), SIMDE_FLOAT32_C( 2.73), SIMDE_FLOAT32_C( -1.03),
SIMDE_FLOAT32_C( -0.84), SIMDE_FLOAT32_C( -1.07), SIMDE_FLOAT32_C( -0.03), SIMDE_FLOAT32_C( -0.68)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( 638.94), SIMDE_FLOAT32_C( 450.90), SIMDE_FLOAT32_C( -771.54), SIMDE_FLOAT32_C( 105.79),
SIMDE_FLOAT32_C( 590.10), SIMDE_FLOAT32_C( 30.91), SIMDE_FLOAT32_C( 635.35), SIMDE_FLOAT32_C( -84.00),
SIMDE_FLOAT32_C( 80.04), SIMDE_FLOAT32_C( -709.46), SIMDE_FLOAT32_C( 607.86), SIMDE_FLOAT32_C( 394.58),
SIMDE_FLOAT32_C( -889.11), SIMDE_FLOAT32_C( -964.25), SIMDE_FLOAT32_C( -406.33), SIMDE_FLOAT32_C( -743.66)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( -427.72), SIMDE_FLOAT32_C( 308.28), SIMDE_FLOAT32_C( -177.05), SIMDE_FLOAT32_C( -457.77),
SIMDE_FLOAT32_C( 678.24), SIMDE_FLOAT32_C( 66.05), SIMDE_FLOAT32_C( -267.71), SIMDE_FLOAT32_C( 117.28),
SIMDE_FLOAT32_C( -576.80), SIMDE_FLOAT32_C( -38.39), SIMDE_FLOAT32_C( -250.14), SIMDE_FLOAT32_C( -53.92),
SIMDE_FLOAT32_C( 91.94), SIMDE_FLOAT32_C( -78.84), SIMDE_FLOAT32_C( 883.05), SIMDE_FLOAT32_C( -807.28)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 2.16), SIMDE_FLOAT32_C( 0.97), SIMDE_FLOAT32_C( -1.80), SIMDE_FLOAT32_C( 2.91),
SIMDE_FLOAT32_C( 0.72), SIMDE_FLOAT32_C( 0.44), SIMDE_FLOAT32_C( 1.97), SIMDE_FLOAT32_C( -0.62),
SIMDE_FLOAT32_C( 3.00), SIMDE_FLOAT32_C( -1.62), SIMDE_FLOAT32_C( 1.96), SIMDE_FLOAT32_C( 1.71),
SIMDE_FLOAT32_C( -1.47), SIMDE_FLOAT32_C( -1.65), SIMDE_FLOAT32_C( -0.43), SIMDE_FLOAT32_C( -2.40)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( -736.75), SIMDE_FLOAT32_C( -310.57), SIMDE_FLOAT32_C( 728.87), SIMDE_FLOAT32_C( -350.72),
SIMDE_FLOAT32_C( 60.89), SIMDE_FLOAT32_C( 109.81), SIMDE_FLOAT32_C( 715.94), SIMDE_FLOAT32_C( -250.60),
SIMDE_FLOAT32_C( 944.14), SIMDE_FLOAT32_C( 361.85), SIMDE_FLOAT32_C( -13.07), SIMDE_FLOAT32_C( 852.60),
SIMDE_FLOAT32_C( -440.06), SIMDE_FLOAT32_C( 529.63), SIMDE_FLOAT32_C( -24.89), SIMDE_FLOAT32_C( -967.78)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 822.72), SIMDE_FLOAT32_C( 956.68), SIMDE_FLOAT32_C( 954.62), SIMDE_FLOAT32_C( 825.49),
SIMDE_FLOAT32_C( -816.27), SIMDE_FLOAT32_C( -209.34), SIMDE_FLOAT32_C( -933.21), SIMDE_FLOAT32_C( -728.70),
SIMDE_FLOAT32_C( -420.06), SIMDE_FLOAT32_C( 100.32), SIMDE_FLOAT32_C( 103.15), SIMDE_FLOAT32_C( 439.77),
SIMDE_FLOAT32_C( -204.33), SIMDE_FLOAT32_C( 18.75), SIMDE_FLOAT32_C( 809.05), SIMDE_FLOAT32_C( 144.05)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( -0.73), SIMDE_FLOAT32_C( -0.31), SIMDE_FLOAT32_C( 0.65), SIMDE_FLOAT32_C( -0.40),
SIMDE_FLOAT32_C( 3.07), SIMDE_FLOAT32_C( 2.66), SIMDE_FLOAT32_C( 2.49), SIMDE_FLOAT32_C( -2.81),
SIMDE_FLOAT32_C( 1.99), SIMDE_FLOAT32_C( 1.30), SIMDE_FLOAT32_C( -0.13), SIMDE_FLOAT32_C( 1.09),
SIMDE_FLOAT32_C( -2.01), SIMDE_FLOAT32_C( 1.54), SIMDE_FLOAT32_C( -0.03), SIMDE_FLOAT32_C( -1.42)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( -678.17), SIMDE_FLOAT32_C( -686.13), SIMDE_FLOAT32_C( 84.77), SIMDE_FLOAT32_C( 571.46),
SIMDE_FLOAT32_C( 825.53), SIMDE_FLOAT32_C( 422.21), SIMDE_FLOAT32_C( -269.45), SIMDE_FLOAT32_C( 467.76),
SIMDE_FLOAT32_C( 497.31), SIMDE_FLOAT32_C( 670.24), SIMDE_FLOAT32_C( -297.45), SIMDE_FLOAT32_C( 34.06),
SIMDE_FLOAT32_C( -186.21), SIMDE_FLOAT32_C( 39.01), SIMDE_FLOAT32_C( -754.38), SIMDE_FLOAT32_C( 346.63)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 0.00),
SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 0.00),
SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 0.00),
SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 0.00)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( -1.57), SIMDE_FLOAT32_C( -1.57), SIMDE_FLOAT32_C( 1.57), SIMDE_FLOAT32_C( 1.57),
SIMDE_FLOAT32_C( 1.57), SIMDE_FLOAT32_C( 1.57), SIMDE_FLOAT32_C( -1.57), SIMDE_FLOAT32_C( 1.57),
SIMDE_FLOAT32_C( 1.57), SIMDE_FLOAT32_C( 1.57), SIMDE_FLOAT32_C( -1.57), SIMDE_FLOAT32_C( 1.57),
SIMDE_FLOAT32_C( -1.57), SIMDE_FLOAT32_C( 1.57), SIMDE_FLOAT32_C( -1.57), SIMDE_FLOAT32_C( 1.57)) }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])); i++) {
simde__m512 r = simde_mm512_atan2_ps(test_vec[i].a, test_vec[i].b);
simde_assert_m512_close(r, test_vec[i].r, 1);
}
return 0;
}
static int
test_simde_mm512_mask_atan2_ps(SIMDE_MUNIT_TEST_ARGS) {
const struct {
simde__m512 src;
simde__mmask16 k;
simde__m512 a;
simde__m512 b;
simde__m512 r;
} test_vec[9] = {
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( 655.87), SIMDE_FLOAT32_C( -263.99), SIMDE_FLOAT32_C( -770.35), SIMDE_FLOAT32_C( 380.46),
SIMDE_FLOAT32_C( 28.08), SIMDE_FLOAT32_C( -450.67), SIMDE_FLOAT32_C( 261.31), SIMDE_FLOAT32_C( -660.80),
SIMDE_FLOAT32_C( -384.03), SIMDE_FLOAT32_C( -860.95), SIMDE_FLOAT32_C( -678.17), SIMDE_FLOAT32_C( 571.46),
SIMDE_FLOAT32_C( -269.45), SIMDE_FLOAT32_C( 670.24), SIMDE_FLOAT32_C( -186.21), SIMDE_FLOAT32_C( 346.63)),
UINT16_C(25611),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 395.92), SIMDE_FLOAT32_C( 532.35), SIMDE_FLOAT32_C( -30.79), SIMDE_FLOAT32_C( -583.60),
SIMDE_FLOAT32_C( 993.90), SIMDE_FLOAT32_C( 178.20), SIMDE_FLOAT32_C( 687.09), SIMDE_FLOAT32_C( -976.55),
SIMDE_FLOAT32_C( 28.47), SIMDE_FLOAT32_C( -305.07), SIMDE_FLOAT32_C( 696.87), SIMDE_FLOAT32_C( 84.77),
SIMDE_FLOAT32_C( 422.21), SIMDE_FLOAT32_C( 497.31), SIMDE_FLOAT32_C( 34.06), SIMDE_FLOAT32_C( -754.38)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( -387.90), SIMDE_FLOAT32_C( 339.21), SIMDE_FLOAT32_C( 780.64), SIMDE_FLOAT32_C( 443.48),
SIMDE_FLOAT32_C( -770.72), SIMDE_FLOAT32_C( 841.21), SIMDE_FLOAT32_C( 233.37), SIMDE_FLOAT32_C( -212.54),
SIMDE_FLOAT32_C( -444.81), SIMDE_FLOAT32_C( -923.64), SIMDE_FLOAT32_C( -417.54), SIMDE_FLOAT32_C( -686.13),
SIMDE_FLOAT32_C( 825.53), SIMDE_FLOAT32_C( 467.76), SIMDE_FLOAT32_C( -297.45), SIMDE_FLOAT32_C( 39.01)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 655.87), SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( -0.04), SIMDE_FLOAT32_C( 380.46),
SIMDE_FLOAT32_C( 28.08), SIMDE_FLOAT32_C( 0.21), SIMDE_FLOAT32_C( 261.31), SIMDE_FLOAT32_C( -660.80),
SIMDE_FLOAT32_C( -384.03), SIMDE_FLOAT32_C( -860.95), SIMDE_FLOAT32_C( -678.17), SIMDE_FLOAT32_C( 571.46),
SIMDE_FLOAT32_C( 0.47), SIMDE_FLOAT32_C( 670.24), SIMDE_FLOAT32_C( 3.03), SIMDE_FLOAT32_C( -1.52)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( 840.65), SIMDE_FLOAT32_C( -448.89), SIMDE_FLOAT32_C( 623.70), SIMDE_FLOAT32_C( 977.36),
SIMDE_FLOAT32_C( 99.93), SIMDE_FLOAT32_C( -178.99), SIMDE_FLOAT32_C( 343.48), SIMDE_FLOAT32_C( -328.54),
SIMDE_FLOAT32_C( -822.65), SIMDE_FLOAT32_C( -70.91), SIMDE_FLOAT32_C( -171.51), SIMDE_FLOAT32_C( -148.69),
SIMDE_FLOAT32_C( 600.47), SIMDE_FLOAT32_C( -203.65), SIMDE_FLOAT32_C( -944.78), SIMDE_FLOAT32_C( -554.19)),
UINT16_C(63749),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 690.12), SIMDE_FLOAT32_C( -591.56), SIMDE_FLOAT32_C( 505.79), SIMDE_FLOAT32_C( 140.67),
SIMDE_FLOAT32_C( 331.34), SIMDE_FLOAT32_C( -738.19), SIMDE_FLOAT32_C( 324.62), SIMDE_FLOAT32_C( -797.92),
SIMDE_FLOAT32_C( -192.31), SIMDE_FLOAT32_C( 655.67), SIMDE_FLOAT32_C( 120.65), SIMDE_FLOAT32_C( 680.02),
SIMDE_FLOAT32_C( 910.03), SIMDE_FLOAT32_C( 254.31), SIMDE_FLOAT32_C( 336.73), SIMDE_FLOAT32_C( -767.23)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( -95.15), SIMDE_FLOAT32_C( -21.09), SIMDE_FLOAT32_C( 731.49), SIMDE_FLOAT32_C( 831.02),
SIMDE_FLOAT32_C( -906.16), SIMDE_FLOAT32_C( 462.95), SIMDE_FLOAT32_C( 758.79), SIMDE_FLOAT32_C( -874.31),
SIMDE_FLOAT32_C( -525.83), SIMDE_FLOAT32_C( 561.36), SIMDE_FLOAT32_C( 543.35), SIMDE_FLOAT32_C( 469.66),
SIMDE_FLOAT32_C( 818.66), SIMDE_FLOAT32_C( 791.23), SIMDE_FLOAT32_C( -80.73), SIMDE_FLOAT32_C( -747.59)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 1.71), SIMDE_FLOAT32_C( -1.61), SIMDE_FLOAT32_C( 0.60), SIMDE_FLOAT32_C( 0.17),
SIMDE_FLOAT32_C( 2.79), SIMDE_FLOAT32_C( -178.99), SIMDE_FLOAT32_C( 343.48), SIMDE_FLOAT32_C( -2.40),
SIMDE_FLOAT32_C( -822.65), SIMDE_FLOAT32_C( -70.91), SIMDE_FLOAT32_C( -171.51), SIMDE_FLOAT32_C( -148.69),
SIMDE_FLOAT32_C( 600.47), SIMDE_FLOAT32_C( 0.31), SIMDE_FLOAT32_C( -944.78), SIMDE_FLOAT32_C( -2.34)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( 237.58), SIMDE_FLOAT32_C( -765.93), SIMDE_FLOAT32_C( -623.50), SIMDE_FLOAT32_C( -775.04),
SIMDE_FLOAT32_C( 936.65), SIMDE_FLOAT32_C( -197.89), SIMDE_FLOAT32_C( -752.43), SIMDE_FLOAT32_C( 7.27),
SIMDE_FLOAT32_C( -182.45), SIMDE_FLOAT32_C( 394.67), SIMDE_FLOAT32_C( 916.26), SIMDE_FLOAT32_C( 822.06),
SIMDE_FLOAT32_C( -337.60), SIMDE_FLOAT32_C( -768.12), SIMDE_FLOAT32_C( 710.38), SIMDE_FLOAT32_C( 424.81)),
UINT16_C(23119),
simde_mm512_set_ps(SIMDE_FLOAT32_C( -598.06), SIMDE_FLOAT32_C( -378.50), SIMDE_FLOAT32_C( 221.37), SIMDE_FLOAT32_C( -942.47),
SIMDE_FLOAT32_C( 440.64), SIMDE_FLOAT32_C( -348.70), SIMDE_FLOAT32_C( -359.76), SIMDE_FLOAT32_C( 932.66),
SIMDE_FLOAT32_C( -125.20), SIMDE_FLOAT32_C( 510.85), SIMDE_FLOAT32_C( -304.73), SIMDE_FLOAT32_C( -769.09),
SIMDE_FLOAT32_C( -997.63), SIMDE_FLOAT32_C( 293.64), SIMDE_FLOAT32_C( -67.64), SIMDE_FLOAT32_C( -756.42)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( -76.75), SIMDE_FLOAT32_C( -791.07), SIMDE_FLOAT32_C( -601.68), SIMDE_FLOAT32_C( -788.36),
SIMDE_FLOAT32_C( 475.51), SIMDE_FLOAT32_C( 897.27), SIMDE_FLOAT32_C( -438.19), SIMDE_FLOAT32_C( -33.67),
SIMDE_FLOAT32_C( -327.22), SIMDE_FLOAT32_C( 39.93), SIMDE_FLOAT32_C( 14.34), SIMDE_FLOAT32_C( -696.69),
SIMDE_FLOAT32_C( -573.81), SIMDE_FLOAT32_C( 923.64), SIMDE_FLOAT32_C( -576.22), SIMDE_FLOAT32_C( 977.49)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 237.58), SIMDE_FLOAT32_C( -2.70), SIMDE_FLOAT32_C( -623.50), SIMDE_FLOAT32_C( -2.27),
SIMDE_FLOAT32_C( 0.75), SIMDE_FLOAT32_C( -197.89), SIMDE_FLOAT32_C( -2.45), SIMDE_FLOAT32_C( 7.27),
SIMDE_FLOAT32_C( -182.45), SIMDE_FLOAT32_C( 1.49), SIMDE_FLOAT32_C( 916.26), SIMDE_FLOAT32_C( 822.06),
SIMDE_FLOAT32_C( -2.09), SIMDE_FLOAT32_C( 0.31), SIMDE_FLOAT32_C( -3.02), SIMDE_FLOAT32_C( -0.66)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( -743.66), SIMDE_FLOAT32_C( -784.34), SIMDE_FLOAT32_C( 4.83), SIMDE_FLOAT32_C( 834.60),
SIMDE_FLOAT32_C( 579.25), SIMDE_FLOAT32_C( -212.86), SIMDE_FLOAT32_C( -719.57), SIMDE_FLOAT32_C( -628.82),
SIMDE_FLOAT32_C( -916.82), SIMDE_FLOAT32_C( 434.03), SIMDE_FLOAT32_C( -15.61), SIMDE_FLOAT32_C( -718.40),
SIMDE_FLOAT32_C( 177.92), SIMDE_FLOAT32_C( 426.61), SIMDE_FLOAT32_C( 915.71), SIMDE_FLOAT32_C( 826.84)),
UINT16_C(57786),
simde_mm512_set_ps(SIMDE_FLOAT32_C( -807.28), SIMDE_FLOAT32_C( -764.58), SIMDE_FLOAT32_C( 92.52), SIMDE_FLOAT32_C( -818.54),
SIMDE_FLOAT32_C( -65.60), SIMDE_FLOAT32_C( -11.78), SIMDE_FLOAT32_C( -318.38), SIMDE_FLOAT32_C( 334.00),
SIMDE_FLOAT32_C( 556.35), SIMDE_FLOAT32_C( -490.00), SIMDE_FLOAT32_C( 496.57), SIMDE_FLOAT32_C( -737.13),
SIMDE_FLOAT32_C( 159.97), SIMDE_FLOAT32_C( 345.93), SIMDE_FLOAT32_C( 932.11), SIMDE_FLOAT32_C( 133.52)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( -406.33), SIMDE_FLOAT32_C( -70.05), SIMDE_FLOAT32_C( 789.89), SIMDE_FLOAT32_C( 206.60),
SIMDE_FLOAT32_C( 161.06), SIMDE_FLOAT32_C( -286.07), SIMDE_FLOAT32_C( -308.52), SIMDE_FLOAT32_C( 783.48),
SIMDE_FLOAT32_C( 274.71), SIMDE_FLOAT32_C( 439.43), SIMDE_FLOAT32_C( -799.40), SIMDE_FLOAT32_C( 915.19),
SIMDE_FLOAT32_C( -314.93), SIMDE_FLOAT32_C( -861.01), SIMDE_FLOAT32_C( 888.71), SIMDE_FLOAT32_C( 110.36)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( -2.04), SIMDE_FLOAT32_C( -1.66), SIMDE_FLOAT32_C( 0.12), SIMDE_FLOAT32_C( 834.60),
SIMDE_FLOAT32_C( 579.25), SIMDE_FLOAT32_C( -212.86), SIMDE_FLOAT32_C( -719.57), SIMDE_FLOAT32_C( 0.40),
SIMDE_FLOAT32_C( 1.11), SIMDE_FLOAT32_C( 434.03), SIMDE_FLOAT32_C( 2.59), SIMDE_FLOAT32_C( -0.68),
SIMDE_FLOAT32_C( 2.67), SIMDE_FLOAT32_C( 426.61), SIMDE_FLOAT32_C( 0.81), SIMDE_FLOAT32_C( 826.84)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( -728.70), SIMDE_FLOAT32_C( 944.14), SIMDE_FLOAT32_C( 103.15), SIMDE_FLOAT32_C( 852.60),
SIMDE_FLOAT32_C( 18.75), SIMDE_FLOAT32_C( -24.89), SIMDE_FLOAT32_C( -427.72), SIMDE_FLOAT32_C( 450.90),
SIMDE_FLOAT32_C( -457.77), SIMDE_FLOAT32_C( 590.10), SIMDE_FLOAT32_C( -267.71), SIMDE_FLOAT32_C( -84.00),
SIMDE_FLOAT32_C( -38.39), SIMDE_FLOAT32_C( 607.86), SIMDE_FLOAT32_C( 91.94), SIMDE_FLOAT32_C( -964.25)),
UINT16_C(25589),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 715.94), SIMDE_FLOAT32_C( -420.06), SIMDE_FLOAT32_C( 361.85), SIMDE_FLOAT32_C( 439.77),
SIMDE_FLOAT32_C( -440.06), SIMDE_FLOAT32_C( 809.05), SIMDE_FLOAT32_C( -967.78), SIMDE_FLOAT32_C( 308.28),
SIMDE_FLOAT32_C( -771.54), SIMDE_FLOAT32_C( 678.24), SIMDE_FLOAT32_C( 30.91), SIMDE_FLOAT32_C( 117.28),
SIMDE_FLOAT32_C( 80.04), SIMDE_FLOAT32_C( -250.14), SIMDE_FLOAT32_C( 394.58), SIMDE_FLOAT32_C( -78.84)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( -933.21), SIMDE_FLOAT32_C( -250.60), SIMDE_FLOAT32_C( 100.32), SIMDE_FLOAT32_C( -13.07),
SIMDE_FLOAT32_C( -204.33), SIMDE_FLOAT32_C( 529.63), SIMDE_FLOAT32_C( 144.05), SIMDE_FLOAT32_C( 638.94),
SIMDE_FLOAT32_C( -177.05), SIMDE_FLOAT32_C( 105.79), SIMDE_FLOAT32_C( 66.05), SIMDE_FLOAT32_C( 635.35),
SIMDE_FLOAT32_C( -576.80), SIMDE_FLOAT32_C( -709.46), SIMDE_FLOAT32_C( -53.92), SIMDE_FLOAT32_C( -889.11)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( -728.70), SIMDE_FLOAT32_C( -2.11), SIMDE_FLOAT32_C( 1.30), SIMDE_FLOAT32_C( 852.60),
SIMDE_FLOAT32_C( 18.75), SIMDE_FLOAT32_C( -24.89), SIMDE_FLOAT32_C( -1.42), SIMDE_FLOAT32_C( 0.45),
SIMDE_FLOAT32_C( -1.80), SIMDE_FLOAT32_C( 1.42), SIMDE_FLOAT32_C( 0.44), SIMDE_FLOAT32_C( 0.18),
SIMDE_FLOAT32_C( -38.39), SIMDE_FLOAT32_C( -2.80), SIMDE_FLOAT32_C( 91.94), SIMDE_FLOAT32_C( -3.05)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( 195.04), SIMDE_FLOAT32_C( 266.59), SIMDE_FLOAT32_C( 227.06), SIMDE_FLOAT32_C( 410.49),
SIMDE_FLOAT32_C( -523.93), SIMDE_FLOAT32_C( 762.39), SIMDE_FLOAT32_C( 112.81), SIMDE_FLOAT32_C( 686.52),
SIMDE_FLOAT32_C( 719.98), SIMDE_FLOAT32_C( 766.36), SIMDE_FLOAT32_C( -14.16), SIMDE_FLOAT32_C( -493.41),
SIMDE_FLOAT32_C( -736.75), SIMDE_FLOAT32_C( 954.62), SIMDE_FLOAT32_C( -350.72), SIMDE_FLOAT32_C( -209.34)),
UINT16_C(43196),
simde_mm512_set_ps(SIMDE_FLOAT32_C( -658.72), SIMDE_FLOAT32_C( -177.76), SIMDE_FLOAT32_C( -265.00), SIMDE_FLOAT32_C( -554.31),
SIMDE_FLOAT32_C( 533.87), SIMDE_FLOAT32_C( 51.67), SIMDE_FLOAT32_C( -492.25), SIMDE_FLOAT32_C( 777.74),
SIMDE_FLOAT32_C( 793.81), SIMDE_FLOAT32_C( 15.12), SIMDE_FLOAT32_C( -788.39), SIMDE_FLOAT32_C( 824.88),
SIMDE_FLOAT32_C( 822.72), SIMDE_FLOAT32_C( -310.57), SIMDE_FLOAT32_C( 825.49), SIMDE_FLOAT32_C( 60.89)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( -371.53), SIMDE_FLOAT32_C( 353.46), SIMDE_FLOAT32_C( -605.99), SIMDE_FLOAT32_C( -513.13),
SIMDE_FLOAT32_C( -390.22), SIMDE_FLOAT32_C( -973.72), SIMDE_FLOAT32_C( -469.41), SIMDE_FLOAT32_C( 31.72),
SIMDE_FLOAT32_C( -35.27), SIMDE_FLOAT32_C( -851.21), SIMDE_FLOAT32_C( -841.43), SIMDE_FLOAT32_C( 330.43),
SIMDE_FLOAT32_C( 793.63), SIMDE_FLOAT32_C( 956.68), SIMDE_FLOAT32_C( 728.87), SIMDE_FLOAT32_C( -816.27)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( -2.08), SIMDE_FLOAT32_C( 266.59), SIMDE_FLOAT32_C( -2.73), SIMDE_FLOAT32_C( 410.49),
SIMDE_FLOAT32_C( 2.20), SIMDE_FLOAT32_C( 762.39), SIMDE_FLOAT32_C( 112.81), SIMDE_FLOAT32_C( 686.52),
SIMDE_FLOAT32_C( 1.62), SIMDE_FLOAT32_C( 766.36), SIMDE_FLOAT32_C( -2.39), SIMDE_FLOAT32_C( 1.19),
SIMDE_FLOAT32_C( 0.80), SIMDE_FLOAT32_C( -0.31), SIMDE_FLOAT32_C( -350.72), SIMDE_FLOAT32_C( -209.34)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( -858.24), SIMDE_FLOAT32_C( -559.04), SIMDE_FLOAT32_C( -867.90), SIMDE_FLOAT32_C( -91.47),
SIMDE_FLOAT32_C( -996.53), SIMDE_FLOAT32_C( 7.89), SIMDE_FLOAT32_C( 519.91), SIMDE_FLOAT32_C( -788.90),
SIMDE_FLOAT32_C( 494.45), SIMDE_FLOAT32_C( 338.97), SIMDE_FLOAT32_C( 858.03), SIMDE_FLOAT32_C( -607.40),
SIMDE_FLOAT32_C( 289.29), SIMDE_FLOAT32_C( 618.46), SIMDE_FLOAT32_C( 413.47), SIMDE_FLOAT32_C( -978.77)),
UINT16_C( 4768),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 740.49), SIMDE_FLOAT32_C( -751.81), SIMDE_FLOAT32_C( 13.69), SIMDE_FLOAT32_C( 786.36),
SIMDE_FLOAT32_C( -616.97), SIMDE_FLOAT32_C( 500.34), SIMDE_FLOAT32_C( -906.43), SIMDE_FLOAT32_C( 690.06),
SIMDE_FLOAT32_C( -252.06), SIMDE_FLOAT32_C( 828.60), SIMDE_FLOAT32_C( -203.59), SIMDE_FLOAT32_C( 933.39),
SIMDE_FLOAT32_C( -10.85), SIMDE_FLOAT32_C( -429.78), SIMDE_FLOAT32_C( 190.25), SIMDE_FLOAT32_C( 546.67)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( -969.00), SIMDE_FLOAT32_C( 251.09), SIMDE_FLOAT32_C( 109.97), SIMDE_FLOAT32_C( 792.28),
SIMDE_FLOAT32_C( -643.59), SIMDE_FLOAT32_C( 926.98), SIMDE_FLOAT32_C( -815.02), SIMDE_FLOAT32_C( 181.20),
SIMDE_FLOAT32_C( -206.24), SIMDE_FLOAT32_C( 378.12), SIMDE_FLOAT32_C( -36.10), SIMDE_FLOAT32_C( -538.28),
SIMDE_FLOAT32_C( 894.04), SIMDE_FLOAT32_C( 72.41), SIMDE_FLOAT32_C( 681.48), SIMDE_FLOAT32_C( 677.82)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( -858.24), SIMDE_FLOAT32_C( -559.04), SIMDE_FLOAT32_C( -867.90), SIMDE_FLOAT32_C( 0.78),
SIMDE_FLOAT32_C( -996.53), SIMDE_FLOAT32_C( 7.89), SIMDE_FLOAT32_C( -2.30), SIMDE_FLOAT32_C( -788.90),
SIMDE_FLOAT32_C( -2.26), SIMDE_FLOAT32_C( 338.97), SIMDE_FLOAT32_C( -1.75), SIMDE_FLOAT32_C( -607.40),
SIMDE_FLOAT32_C( 289.29), SIMDE_FLOAT32_C( 618.46), SIMDE_FLOAT32_C( 413.47), SIMDE_FLOAT32_C( -978.77)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( -69.61), SIMDE_FLOAT32_C( -548.92), SIMDE_FLOAT32_C( 625.99), SIMDE_FLOAT32_C( 381.43),
SIMDE_FLOAT32_C( 949.66), SIMDE_FLOAT32_C( -196.91), SIMDE_FLOAT32_C( 28.28), SIMDE_FLOAT32_C( -181.88),
SIMDE_FLOAT32_C( 536.29), SIMDE_FLOAT32_C( -985.19), SIMDE_FLOAT32_C( 77.09), SIMDE_FLOAT32_C( 315.82),
SIMDE_FLOAT32_C( 11.44), SIMDE_FLOAT32_C( -742.19), SIMDE_FLOAT32_C( 808.07), SIMDE_FLOAT32_C( -406.94)),
UINT16_C(49835),
simde_mm512_set_ps(SIMDE_FLOAT32_C( -137.31), SIMDE_FLOAT32_C( -142.23), SIMDE_FLOAT32_C( 35.44), SIMDE_FLOAT32_C( -260.69),
SIMDE_FLOAT32_C( -868.51), SIMDE_FLOAT32_C( -878.61), SIMDE_FLOAT32_C( 777.12), SIMDE_FLOAT32_C( 132.77),
SIMDE_FLOAT32_C( -396.93), SIMDE_FLOAT32_C( 836.29), SIMDE_FLOAT32_C( -770.09), SIMDE_FLOAT32_C( 911.50),
SIMDE_FLOAT32_C( 393.21), SIMDE_FLOAT32_C( -291.56), SIMDE_FLOAT32_C( 446.83), SIMDE_FLOAT32_C( 802.68)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( -821.75), SIMDE_FLOAT32_C( -892.28), SIMDE_FLOAT32_C( -852.69), SIMDE_FLOAT32_C( 9.54),
SIMDE_FLOAT32_C( -850.83), SIMDE_FLOAT32_C( 144.77), SIMDE_FLOAT32_C( 932.71), SIMDE_FLOAT32_C( -565.94),
SIMDE_FLOAT32_C( -821.82), SIMDE_FLOAT32_C( -929.08), SIMDE_FLOAT32_C( -624.00), SIMDE_FLOAT32_C( -595.23),
SIMDE_FLOAT32_C( 666.07), SIMDE_FLOAT32_C( -246.97), SIMDE_FLOAT32_C( -517.48), SIMDE_FLOAT32_C( 645.83)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( -2.98), SIMDE_FLOAT32_C( -2.98), SIMDE_FLOAT32_C( 625.99), SIMDE_FLOAT32_C( 381.43),
SIMDE_FLOAT32_C( 949.66), SIMDE_FLOAT32_C( -196.91), SIMDE_FLOAT32_C( 0.69), SIMDE_FLOAT32_C( -181.88),
SIMDE_FLOAT32_C( -2.69), SIMDE_FLOAT32_C( -985.19), SIMDE_FLOAT32_C( -2.25), SIMDE_FLOAT32_C( 315.82),
SIMDE_FLOAT32_C( 0.53), SIMDE_FLOAT32_C( -742.19), SIMDE_FLOAT32_C( 2.43), SIMDE_FLOAT32_C( 0.89)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( -450.67), SIMDE_FLOAT32_C( 687.09), SIMDE_FLOAT32_C( -212.54), SIMDE_FLOAT32_C( -660.80),
SIMDE_FLOAT32_C( 28.47), SIMDE_FLOAT32_C( -923.64), SIMDE_FLOAT32_C( -860.95), SIMDE_FLOAT32_C( 696.87),
SIMDE_FLOAT32_C( -686.13), SIMDE_FLOAT32_C( 571.46), SIMDE_FLOAT32_C( 422.21), SIMDE_FLOAT32_C( 467.76),
SIMDE_FLOAT32_C( 670.24), SIMDE_FLOAT32_C( 34.06), SIMDE_FLOAT32_C( 39.01), SIMDE_FLOAT32_C( 346.63)),
UINT16_C(41466),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 178.20), SIMDE_FLOAT32_C( 233.37), SIMDE_FLOAT32_C( 261.31), SIMDE_FLOAT32_C( -976.55),
SIMDE_FLOAT32_C( -444.81), SIMDE_FLOAT32_C( -384.03), SIMDE_FLOAT32_C( -305.07), SIMDE_FLOAT32_C( -417.54),
SIMDE_FLOAT32_C( -678.17), SIMDE_FLOAT32_C( 84.77), SIMDE_FLOAT32_C( 825.53), SIMDE_FLOAT32_C( -269.45),
SIMDE_FLOAT32_C( 497.31), SIMDE_FLOAT32_C( -297.45), SIMDE_FLOAT32_C( -186.21), SIMDE_FLOAT32_C( -754.38)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 0.00),
SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 0.00),
SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 0.00),
SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 0.00)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 1.57), SIMDE_FLOAT32_C( 687.09), SIMDE_FLOAT32_C( 1.57), SIMDE_FLOAT32_C( -660.80),
SIMDE_FLOAT32_C( 28.47), SIMDE_FLOAT32_C( -923.64), SIMDE_FLOAT32_C( -860.95), SIMDE_FLOAT32_C( -1.57),
SIMDE_FLOAT32_C( -1.57), SIMDE_FLOAT32_C( 1.57), SIMDE_FLOAT32_C( 1.57), SIMDE_FLOAT32_C( -1.57),
SIMDE_FLOAT32_C( 1.57), SIMDE_FLOAT32_C( 34.06), SIMDE_FLOAT32_C( -1.57), SIMDE_FLOAT32_C( 346.63)) }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])); i++) {
simde__m512 r = simde_mm512_mask_atan2_ps(test_vec[i].src, test_vec[i].k, test_vec[i].a, test_vec[i].b);
simde_assert_m512_close(r, test_vec[i].r, 1);
}
return 0;
}
static int
test_simde_mm512_atan2_pd(SIMDE_MUNIT_TEST_ARGS) {
const struct {
simde__m512d a;
simde__m512d b;
simde__m512d r;
} test_vec[9] = {
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( -686.13), SIMDE_FLOAT64_C( 571.46),
SIMDE_FLOAT64_C( 422.21), SIMDE_FLOAT64_C( 467.76),
SIMDE_FLOAT64_C( 670.24), SIMDE_FLOAT64_C( 34.06),
SIMDE_FLOAT64_C( 39.01), SIMDE_FLOAT64_C( 346.63)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( -678.17), SIMDE_FLOAT64_C( 84.77),
SIMDE_FLOAT64_C( 825.53), SIMDE_FLOAT64_C( -269.45),
SIMDE_FLOAT64_C( 497.31), SIMDE_FLOAT64_C( -297.45),
SIMDE_FLOAT64_C( -186.21), SIMDE_FLOAT64_C( -754.38)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( -2.35), SIMDE_FLOAT64_C( 1.42),
SIMDE_FLOAT64_C( 0.47), SIMDE_FLOAT64_C( 2.09),
SIMDE_FLOAT64_C( 0.93), SIMDE_FLOAT64_C( 3.03),
SIMDE_FLOAT64_C( 2.94), SIMDE_FLOAT64_C( 2.71)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( -450.67), SIMDE_FLOAT64_C( 687.09),
SIMDE_FLOAT64_C( -212.54), SIMDE_FLOAT64_C( -660.80),
SIMDE_FLOAT64_C( 28.47), SIMDE_FLOAT64_C( -923.64),
SIMDE_FLOAT64_C( -860.95), SIMDE_FLOAT64_C( 696.87)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 178.20), SIMDE_FLOAT64_C( 233.37),
SIMDE_FLOAT64_C( 261.31), SIMDE_FLOAT64_C( -976.55),
SIMDE_FLOAT64_C( -444.81), SIMDE_FLOAT64_C( -384.03),
SIMDE_FLOAT64_C( -305.07), SIMDE_FLOAT64_C( -417.54)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( -1.19), SIMDE_FLOAT64_C( 1.24),
SIMDE_FLOAT64_C( -0.68), SIMDE_FLOAT64_C( -2.55),
SIMDE_FLOAT64_C( 3.08), SIMDE_FLOAT64_C( -1.96),
SIMDE_FLOAT64_C( -1.91), SIMDE_FLOAT64_C( 2.11)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( 395.92), SIMDE_FLOAT64_C( 339.21),
SIMDE_FLOAT64_C( -263.99), SIMDE_FLOAT64_C( -30.79),
SIMDE_FLOAT64_C( 443.48), SIMDE_FLOAT64_C( 380.46),
SIMDE_FLOAT64_C( 993.90), SIMDE_FLOAT64_C( 841.21)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( -387.90), SIMDE_FLOAT64_C( 655.87),
SIMDE_FLOAT64_C( 532.35), SIMDE_FLOAT64_C( 780.64),
SIMDE_FLOAT64_C( -770.35), SIMDE_FLOAT64_C( -583.60),
SIMDE_FLOAT64_C( -770.72), SIMDE_FLOAT64_C( 28.08)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 2.35), SIMDE_FLOAT64_C( 0.48),
SIMDE_FLOAT64_C( -0.46), SIMDE_FLOAT64_C( -0.04),
SIMDE_FLOAT64_C( 2.62), SIMDE_FLOAT64_C( 2.56),
SIMDE_FLOAT64_C( 2.23), SIMDE_FLOAT64_C( 1.54)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( 680.02), SIMDE_FLOAT64_C( 818.66),
SIMDE_FLOAT64_C( 600.47), SIMDE_FLOAT64_C( 254.31),
SIMDE_FLOAT64_C( -80.73), SIMDE_FLOAT64_C( -944.78),
SIMDE_FLOAT64_C( -767.23), SIMDE_FLOAT64_C( 398.82)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 469.66), SIMDE_FLOAT64_C( -148.69),
SIMDE_FLOAT64_C( 910.03), SIMDE_FLOAT64_C( 791.23),
SIMDE_FLOAT64_C( -203.65), SIMDE_FLOAT64_C( 336.73),
SIMDE_FLOAT64_C( -747.59), SIMDE_FLOAT64_C( -554.19)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 0.97), SIMDE_FLOAT64_C( 1.75),
SIMDE_FLOAT64_C( 0.58), SIMDE_FLOAT64_C( 0.31),
SIMDE_FLOAT64_C( -2.76), SIMDE_FLOAT64_C( -1.23),
SIMDE_FLOAT64_C( -2.34), SIMDE_FLOAT64_C( 2.52)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( 758.79), SIMDE_FLOAT64_C( 343.48),
SIMDE_FLOAT64_C( -797.92), SIMDE_FLOAT64_C( -525.83),
SIMDE_FLOAT64_C( -822.65), SIMDE_FLOAT64_C( 655.67),
SIMDE_FLOAT64_C( 543.35), SIMDE_FLOAT64_C( -171.51)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( -178.99), SIMDE_FLOAT64_C( 324.62),
SIMDE_FLOAT64_C( -874.31), SIMDE_FLOAT64_C( -328.54),
SIMDE_FLOAT64_C( -192.31), SIMDE_FLOAT64_C( 561.36),
SIMDE_FLOAT64_C( -70.91), SIMDE_FLOAT64_C( 120.65)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 1.80), SIMDE_FLOAT64_C( 0.81),
SIMDE_FLOAT64_C( -2.40), SIMDE_FLOAT64_C( -2.13),
SIMDE_FLOAT64_C( -1.80), SIMDE_FLOAT64_C( 0.86),
SIMDE_FLOAT64_C( 1.70), SIMDE_FLOAT64_C( -0.96)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( 840.65), SIMDE_FLOAT64_C( -591.56),
SIMDE_FLOAT64_C( 731.49), SIMDE_FLOAT64_C( 623.70),
SIMDE_FLOAT64_C( 140.67), SIMDE_FLOAT64_C( -906.16),
SIMDE_FLOAT64_C( 99.93), SIMDE_FLOAT64_C( -738.19)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 690.12), SIMDE_FLOAT64_C( -21.09),
SIMDE_FLOAT64_C( -448.89), SIMDE_FLOAT64_C( 505.79),
SIMDE_FLOAT64_C( 831.02), SIMDE_FLOAT64_C( 977.36),
SIMDE_FLOAT64_C( 331.34), SIMDE_FLOAT64_C( 462.95)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 0.88), SIMDE_FLOAT64_C( -1.61),
SIMDE_FLOAT64_C( 2.12), SIMDE_FLOAT64_C( 0.89),
SIMDE_FLOAT64_C( 0.17), SIMDE_FLOAT64_C( -0.75),
SIMDE_FLOAT64_C( 0.29), SIMDE_FLOAT64_C( -1.01)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( 822.06), SIMDE_FLOAT64_C( -997.63),
SIMDE_FLOAT64_C( 923.64), SIMDE_FLOAT64_C( -768.12),
SIMDE_FLOAT64_C( -67.64), SIMDE_FLOAT64_C( 977.49),
SIMDE_FLOAT64_C( 424.81), SIMDE_FLOAT64_C( -95.15)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( -769.09), SIMDE_FLOAT64_C( -573.81),
SIMDE_FLOAT64_C( -337.60), SIMDE_FLOAT64_C( 293.64),
SIMDE_FLOAT64_C( -576.22), SIMDE_FLOAT64_C( 710.38),
SIMDE_FLOAT64_C( -756.42), SIMDE_FLOAT64_C( 27.25)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 2.32), SIMDE_FLOAT64_C( -2.09),
SIMDE_FLOAT64_C( 1.92), SIMDE_FLOAT64_C( -1.21),
SIMDE_FLOAT64_C( -3.02), SIMDE_FLOAT64_C( 0.94),
SIMDE_FLOAT64_C( 2.63), SIMDE_FLOAT64_C( -1.29)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( -359.76), SIMDE_FLOAT64_C( -33.67),
SIMDE_FLOAT64_C( 7.27), SIMDE_FLOAT64_C( -125.20),
SIMDE_FLOAT64_C( 39.93), SIMDE_FLOAT64_C( 394.67),
SIMDE_FLOAT64_C( -304.73), SIMDE_FLOAT64_C( -696.69)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( -438.19), SIMDE_FLOAT64_C( -752.43),
SIMDE_FLOAT64_C( 932.66), SIMDE_FLOAT64_C( -327.22),
SIMDE_FLOAT64_C( -182.45), SIMDE_FLOAT64_C( 510.85),
SIMDE_FLOAT64_C( 14.34), SIMDE_FLOAT64_C( 916.26)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( -2.45), SIMDE_FLOAT64_C( -3.10),
SIMDE_FLOAT64_C( 0.01), SIMDE_FLOAT64_C( -2.78),
SIMDE_FLOAT64_C( 2.93), SIMDE_FLOAT64_C( 0.66),
SIMDE_FLOAT64_C( -1.52), SIMDE_FLOAT64_C( -0.65)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( 469.66), SIMDE_FLOAT64_C( 680.02),
SIMDE_FLOAT64_C( -148.69), SIMDE_FLOAT64_C( 818.66),
SIMDE_FLOAT64_C( 910.03), SIMDE_FLOAT64_C( 600.47),
SIMDE_FLOAT64_C( 791.23), SIMDE_FLOAT64_C( 254.31)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 0.00), SIMDE_FLOAT64_C( 0.00),
SIMDE_FLOAT64_C( 0.00), SIMDE_FLOAT64_C( 0.00),
SIMDE_FLOAT64_C( 0.00), SIMDE_FLOAT64_C( 0.00),
SIMDE_FLOAT64_C( 0.00), SIMDE_FLOAT64_C( 0.00)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 1.57), SIMDE_FLOAT64_C( 1.57),
SIMDE_FLOAT64_C( -1.57), SIMDE_FLOAT64_C( 1.57),
SIMDE_FLOAT64_C( 1.57), SIMDE_FLOAT64_C( 1.57),
SIMDE_FLOAT64_C( 1.57), SIMDE_FLOAT64_C( 1.57)) }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])); i++) {
simde__m512d r = simde_mm512_atan2_pd(test_vec[i].a, test_vec[i].b);
simde_assert_m512d_close(r, test_vec[i].r, 1);
}
return 0;
}
static int
test_simde_mm512_mask_atan2_pd(SIMDE_MUNIT_TEST_ARGS) {
const struct {
simde__m512d src;
simde__mmask8 k;
simde__m512d a;
simde__m512d b;
simde__m512d r;
} test_vec[9] = {
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( -384.03), SIMDE_FLOAT64_C( -860.95),
SIMDE_FLOAT64_C( -678.17), SIMDE_FLOAT64_C( 571.46),
SIMDE_FLOAT64_C( -269.45), SIMDE_FLOAT64_C( 670.24),
SIMDE_FLOAT64_C( -186.21), SIMDE_FLOAT64_C( 346.63)),
UINT8_C(212),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 28.47), SIMDE_FLOAT64_C( -305.07),
SIMDE_FLOAT64_C( 696.87), SIMDE_FLOAT64_C( 84.77),
SIMDE_FLOAT64_C( 422.21), SIMDE_FLOAT64_C( 497.31),
SIMDE_FLOAT64_C( 34.06), SIMDE_FLOAT64_C( -754.38)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( -444.81), SIMDE_FLOAT64_C( -923.64),
SIMDE_FLOAT64_C( -417.54), SIMDE_FLOAT64_C( -686.13),
SIMDE_FLOAT64_C( 825.53), SIMDE_FLOAT64_C( 467.76),
SIMDE_FLOAT64_C( -297.45), SIMDE_FLOAT64_C( 39.01)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 3.08), SIMDE_FLOAT64_C( -2.82),
SIMDE_FLOAT64_C( -678.17), SIMDE_FLOAT64_C( 3.02),
SIMDE_FLOAT64_C( -269.45), SIMDE_FLOAT64_C( 0.82),
SIMDE_FLOAT64_C( -186.21), SIMDE_FLOAT64_C( 346.63)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( 395.92), SIMDE_FLOAT64_C( 532.35),
SIMDE_FLOAT64_C( -30.79), SIMDE_FLOAT64_C( -583.60),
SIMDE_FLOAT64_C( 993.90), SIMDE_FLOAT64_C( 178.20),
SIMDE_FLOAT64_C( 687.09), SIMDE_FLOAT64_C( -976.55)),
UINT8_C(126),
simde_mm512_set_pd(SIMDE_FLOAT64_C( -387.90), SIMDE_FLOAT64_C( 339.21),
SIMDE_FLOAT64_C( 780.64), SIMDE_FLOAT64_C( 443.48),
SIMDE_FLOAT64_C( -770.72), SIMDE_FLOAT64_C( 841.21),
SIMDE_FLOAT64_C( 233.37), SIMDE_FLOAT64_C( -212.54)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 398.82), SIMDE_FLOAT64_C( 655.87),
SIMDE_FLOAT64_C( -263.99), SIMDE_FLOAT64_C( -770.35),
SIMDE_FLOAT64_C( 380.46), SIMDE_FLOAT64_C( 28.08),
SIMDE_FLOAT64_C( -450.67), SIMDE_FLOAT64_C( 261.31)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 395.92), SIMDE_FLOAT64_C( 0.48),
SIMDE_FLOAT64_C( 1.90), SIMDE_FLOAT64_C( 2.62),
SIMDE_FLOAT64_C( -1.11), SIMDE_FLOAT64_C( 1.54),
SIMDE_FLOAT64_C( 2.66), SIMDE_FLOAT64_C( -976.55)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( -192.31), SIMDE_FLOAT64_C( 655.67),
SIMDE_FLOAT64_C( 120.65), SIMDE_FLOAT64_C( 680.02),
SIMDE_FLOAT64_C( 910.03), SIMDE_FLOAT64_C( 254.31),
SIMDE_FLOAT64_C( 336.73), SIMDE_FLOAT64_C( -767.23)),
UINT8_C( 39),
simde_mm512_set_pd(SIMDE_FLOAT64_C( -525.83), SIMDE_FLOAT64_C( 561.36),
SIMDE_FLOAT64_C( 543.35), SIMDE_FLOAT64_C( 469.66),
SIMDE_FLOAT64_C( 818.66), SIMDE_FLOAT64_C( 791.23),
SIMDE_FLOAT64_C( -80.73), SIMDE_FLOAT64_C( -747.59)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( -328.54), SIMDE_FLOAT64_C( -822.65),
SIMDE_FLOAT64_C( -70.91), SIMDE_FLOAT64_C( -171.51),
SIMDE_FLOAT64_C( -148.69), SIMDE_FLOAT64_C( 600.47),
SIMDE_FLOAT64_C( -203.65), SIMDE_FLOAT64_C( -944.78)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( -192.31), SIMDE_FLOAT64_C( 655.67),
SIMDE_FLOAT64_C( 1.70), SIMDE_FLOAT64_C( 680.02),
SIMDE_FLOAT64_C( 910.03), SIMDE_FLOAT64_C( 0.92),
SIMDE_FLOAT64_C( -2.76), SIMDE_FLOAT64_C( -2.47)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( -95.15), SIMDE_FLOAT64_C( -21.09),
SIMDE_FLOAT64_C( 731.49), SIMDE_FLOAT64_C( 831.02),
SIMDE_FLOAT64_C( -906.16), SIMDE_FLOAT64_C( 462.95),
SIMDE_FLOAT64_C( 758.79), SIMDE_FLOAT64_C( -874.31)),
UINT8_C( 45),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 27.25), SIMDE_FLOAT64_C( 840.65),
SIMDE_FLOAT64_C( -448.89), SIMDE_FLOAT64_C( 623.70),
SIMDE_FLOAT64_C( 977.36), SIMDE_FLOAT64_C( 99.93),
SIMDE_FLOAT64_C( -178.99), SIMDE_FLOAT64_C( 343.48)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 424.81), SIMDE_FLOAT64_C( 690.12),
SIMDE_FLOAT64_C( -591.56), SIMDE_FLOAT64_C( 505.79),
SIMDE_FLOAT64_C( 140.67), SIMDE_FLOAT64_C( 331.34),
SIMDE_FLOAT64_C( -738.19), SIMDE_FLOAT64_C( 324.62)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( -95.15), SIMDE_FLOAT64_C( -21.09),
SIMDE_FLOAT64_C( -2.49), SIMDE_FLOAT64_C( 831.02),
SIMDE_FLOAT64_C( 1.43), SIMDE_FLOAT64_C( 0.29),
SIMDE_FLOAT64_C( 758.79), SIMDE_FLOAT64_C( 0.81)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( -327.22), SIMDE_FLOAT64_C( 39.93),
SIMDE_FLOAT64_C( 14.34), SIMDE_FLOAT64_C( -696.69),
SIMDE_FLOAT64_C( -573.81), SIMDE_FLOAT64_C( 923.64),
SIMDE_FLOAT64_C( -576.22), SIMDE_FLOAT64_C( 977.49)),
UINT8_C(108),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 7.27), SIMDE_FLOAT64_C( -182.45),
SIMDE_FLOAT64_C( 394.67), SIMDE_FLOAT64_C( 916.26),
SIMDE_FLOAT64_C( 822.06), SIMDE_FLOAT64_C( -337.60),
SIMDE_FLOAT64_C( -768.12), SIMDE_FLOAT64_C( 710.38)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 932.66), SIMDE_FLOAT64_C( -125.20),
SIMDE_FLOAT64_C( 510.85), SIMDE_FLOAT64_C( -304.73),
SIMDE_FLOAT64_C( -769.09), SIMDE_FLOAT64_C( -997.63),
SIMDE_FLOAT64_C( 293.64), SIMDE_FLOAT64_C( -67.64)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( -327.22), SIMDE_FLOAT64_C( -2.17),
SIMDE_FLOAT64_C( 0.66), SIMDE_FLOAT64_C( -696.69),
SIMDE_FLOAT64_C( 2.32), SIMDE_FLOAT64_C( -2.82),
SIMDE_FLOAT64_C( -576.22), SIMDE_FLOAT64_C( 977.49)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( 484.94), SIMDE_FLOAT64_C( 237.58),
SIMDE_FLOAT64_C( -765.93), SIMDE_FLOAT64_C( -623.50),
SIMDE_FLOAT64_C( -775.04), SIMDE_FLOAT64_C( 936.65),
SIMDE_FLOAT64_C( -197.89), SIMDE_FLOAT64_C( -752.43)),
UINT8_C(214),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 826.84), SIMDE_FLOAT64_C( -598.06),
SIMDE_FLOAT64_C( -378.50), SIMDE_FLOAT64_C( 221.37),
SIMDE_FLOAT64_C( -942.47), SIMDE_FLOAT64_C( 440.64),
SIMDE_FLOAT64_C( -348.70), SIMDE_FLOAT64_C( -359.76)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 133.52), SIMDE_FLOAT64_C( -76.75),
SIMDE_FLOAT64_C( -791.07), SIMDE_FLOAT64_C( -601.68),
SIMDE_FLOAT64_C( -788.36), SIMDE_FLOAT64_C( 475.51),
SIMDE_FLOAT64_C( 897.27), SIMDE_FLOAT64_C( -438.19)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 1.41), SIMDE_FLOAT64_C( -1.70),
SIMDE_FLOAT64_C( -765.93), SIMDE_FLOAT64_C( 2.79),
SIMDE_FLOAT64_C( -775.04), SIMDE_FLOAT64_C( 0.75),
SIMDE_FLOAT64_C( -0.37), SIMDE_FLOAT64_C( -752.43)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( -628.82), SIMDE_FLOAT64_C( -916.82),
SIMDE_FLOAT64_C( 434.03), SIMDE_FLOAT64_C( -15.61),
SIMDE_FLOAT64_C( -718.40), SIMDE_FLOAT64_C( 177.92),
SIMDE_FLOAT64_C( 426.61), SIMDE_FLOAT64_C( 915.71)),
UINT8_C( 31),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 334.00), SIMDE_FLOAT64_C( 556.35),
SIMDE_FLOAT64_C( -490.00), SIMDE_FLOAT64_C( 496.57),
SIMDE_FLOAT64_C( -737.13), SIMDE_FLOAT64_C( 159.97),
SIMDE_FLOAT64_C( 345.93), SIMDE_FLOAT64_C( 932.11)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 783.48), SIMDE_FLOAT64_C( 274.71),
SIMDE_FLOAT64_C( 439.43), SIMDE_FLOAT64_C( -799.40),
SIMDE_FLOAT64_C( 915.19), SIMDE_FLOAT64_C( -314.93),
SIMDE_FLOAT64_C( -861.01), SIMDE_FLOAT64_C( 888.71)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( -628.82), SIMDE_FLOAT64_C( -916.82),
SIMDE_FLOAT64_C( 434.03), SIMDE_FLOAT64_C( 2.59),
SIMDE_FLOAT64_C( -0.68), SIMDE_FLOAT64_C( 2.67),
SIMDE_FLOAT64_C( 2.76), SIMDE_FLOAT64_C( 0.81)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( -964.25), SIMDE_FLOAT64_C( -807.28),
SIMDE_FLOAT64_C( -764.58), SIMDE_FLOAT64_C( 92.52),
SIMDE_FLOAT64_C( -818.54), SIMDE_FLOAT64_C( -65.60),
SIMDE_FLOAT64_C( -11.78), SIMDE_FLOAT64_C( -318.38)),
UINT8_C( 46),
simde_mm512_set_pd(SIMDE_FLOAT64_C( -78.84), SIMDE_FLOAT64_C( -406.33),
SIMDE_FLOAT64_C( -70.05), SIMDE_FLOAT64_C( 789.89),
SIMDE_FLOAT64_C( 206.60), SIMDE_FLOAT64_C( 161.06),
SIMDE_FLOAT64_C( -286.07), SIMDE_FLOAT64_C( -308.52)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( -889.11), SIMDE_FLOAT64_C( 883.05),
SIMDE_FLOAT64_C( -743.66), SIMDE_FLOAT64_C( -784.34),
SIMDE_FLOAT64_C( 4.83), SIMDE_FLOAT64_C( 834.60),
SIMDE_FLOAT64_C( 579.25), SIMDE_FLOAT64_C( -212.86)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( -964.25), SIMDE_FLOAT64_C( -807.28),
SIMDE_FLOAT64_C( -3.05), SIMDE_FLOAT64_C( 92.52),
SIMDE_FLOAT64_C( 1.55), SIMDE_FLOAT64_C( 0.19),
SIMDE_FLOAT64_C( -0.46), SIMDE_FLOAT64_C( -318.38)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( -686.00), SIMDE_FLOAT64_C( 571.00),
SIMDE_FLOAT64_C( 422.00), SIMDE_FLOAT64_C( 468.00),
SIMDE_FLOAT64_C( 670.00), SIMDE_FLOAT64_C( 34.00),
SIMDE_FLOAT64_C( 39.00), SIMDE_FLOAT64_C( 347.00)),
UINT8_C(139),
simde_mm512_set_pd(SIMDE_FLOAT64_C( -678.00), SIMDE_FLOAT64_C( 85.00),
SIMDE_FLOAT64_C( 826.00), SIMDE_FLOAT64_C( -269.00),
SIMDE_FLOAT64_C( 497.00), SIMDE_FLOAT64_C( -297.00),
SIMDE_FLOAT64_C( -186.00), SIMDE_FLOAT64_C( -754.00)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 0.00), SIMDE_FLOAT64_C( 0.00),
SIMDE_FLOAT64_C( 0.00), SIMDE_FLOAT64_C( 0.00),
SIMDE_FLOAT64_C( 0.00), SIMDE_FLOAT64_C( 0.00),
SIMDE_FLOAT64_C( 0.00), SIMDE_FLOAT64_C( 0.00)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( -1.57), SIMDE_FLOAT64_C( 571.00),
SIMDE_FLOAT64_C( 422.00), SIMDE_FLOAT64_C( 468.00),
SIMDE_FLOAT64_C( 1.57), SIMDE_FLOAT64_C( 34.00),
SIMDE_FLOAT64_C( -1.57), SIMDE_FLOAT64_C( -1.57)) }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])); i++) {
simde__m512d r = simde_mm512_mask_atan2_pd(test_vec[i].src, test_vec[i].k, test_vec[i].a, test_vec[i].b);
simde_assert_m512d_close(r, test_vec[i].r, 1);
}
return 0;
}
static int
test_simde_mm_atanh_ps(SIMDE_MUNIT_TEST_ARGS) {
const struct {
simde__m128 a;
simde__m128 r;
} test_vec[8] = {
{ simde_mm_set_ps(SIMDE_FLOAT32_C( -0.19), SIMDE_FLOAT32_C( 0.04), SIMDE_FLOAT32_C( -0.75), SIMDE_FLOAT32_C( 0.35)),
simde_mm_set_ps(SIMDE_FLOAT32_C( -0.19), SIMDE_FLOAT32_C( 0.04), SIMDE_FLOAT32_C( -0.97), SIMDE_FLOAT32_C( 0.37)) },
{ simde_mm_set_ps(SIMDE_FLOAT32_C( 0.50), SIMDE_FLOAT32_C( 0.67), SIMDE_FLOAT32_C( -0.30), SIMDE_FLOAT32_C( 0.03)),
simde_mm_set_ps(SIMDE_FLOAT32_C( 0.55), SIMDE_FLOAT32_C( 0.81), SIMDE_FLOAT32_C( -0.31), SIMDE_FLOAT32_C( 0.03)) },
{ simde_mm_set_ps(SIMDE_FLOAT32_C( 0.83), SIMDE_FLOAT32_C( 0.42), SIMDE_FLOAT32_C( -0.27), SIMDE_FLOAT32_C( 0.47)),
simde_mm_set_ps(SIMDE_FLOAT32_C( 1.19), SIMDE_FLOAT32_C( 0.45), SIMDE_FLOAT32_C( -0.28), SIMDE_FLOAT32_C( 0.51)) },
{ simde_mm_set_ps(SIMDE_FLOAT32_C( -0.68), SIMDE_FLOAT32_C( -0.69), SIMDE_FLOAT32_C( 0.08), SIMDE_FLOAT32_C( 0.57)),
simde_mm_set_ps(SIMDE_FLOAT32_C( -0.83), SIMDE_FLOAT32_C( -0.85), SIMDE_FLOAT32_C( 0.08), SIMDE_FLOAT32_C( 0.65)) },
{ simde_mm_set_ps(SIMDE_FLOAT32_C( -0.31), SIMDE_FLOAT32_C( -0.86), SIMDE_FLOAT32_C( -0.42), SIMDE_FLOAT32_C( 0.70)),
simde_mm_set_ps(SIMDE_FLOAT32_C( -0.32), SIMDE_FLOAT32_C( -1.29), SIMDE_FLOAT32_C( -0.45), SIMDE_FLOAT32_C( 0.87)) },
{ simde_mm_set_ps(SIMDE_FLOAT32_C( -0.44), SIMDE_FLOAT32_C( 0.03), SIMDE_FLOAT32_C( -0.38), SIMDE_FLOAT32_C( -0.92)),
simde_mm_set_ps(SIMDE_FLOAT32_C( -0.47), SIMDE_FLOAT32_C( 0.03), SIMDE_FLOAT32_C( -0.40), SIMDE_FLOAT32_C( -1.59)) },
{ simde_mm_set_ps(SIMDE_FLOAT32_C( 0.26), SIMDE_FLOAT32_C( -0.21), SIMDE_FLOAT32_C( -0.98), SIMDE_FLOAT32_C( -0.66)),
simde_mm_set_ps(SIMDE_FLOAT32_C( 0.27), SIMDE_FLOAT32_C( -0.21), SIMDE_FLOAT32_C( -2.30), SIMDE_FLOAT32_C( -0.79)) },
{ simde_mm_set_ps(SIMDE_FLOAT32_C( 0.18), SIMDE_FLOAT32_C( -0.45), SIMDE_FLOAT32_C( 0.23), SIMDE_FLOAT32_C( 0.69)),
simde_mm_set_ps(SIMDE_FLOAT32_C( 0.18), SIMDE_FLOAT32_C( -0.48), SIMDE_FLOAT32_C( 0.23), SIMDE_FLOAT32_C( 0.85)) }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])); i++) {
simde__m128 r = simde_mm_atanh_ps(test_vec[i].a);
simde_assert_m128_close(r, test_vec[i].r, 1);
}
return 0;
}
static int
test_simde_mm_atanh_pd(SIMDE_MUNIT_TEST_ARGS) {
const struct {
simde__m128d a;
simde__m128d r;
} test_vec[8] = {
{ simde_mm_set_pd(SIMDE_FLOAT64_C( -0.75), SIMDE_FLOAT64_C( 0.35)),
simde_mm_set_pd(SIMDE_FLOAT64_C( -0.97), SIMDE_FLOAT64_C( 0.37)) },
{ simde_mm_set_pd(SIMDE_FLOAT64_C( -0.19), SIMDE_FLOAT64_C( 0.04)),
simde_mm_set_pd(SIMDE_FLOAT64_C( -0.19), SIMDE_FLOAT64_C( 0.04)) },
{ simde_mm_set_pd(SIMDE_FLOAT64_C( -0.30), SIMDE_FLOAT64_C( 0.03)),
simde_mm_set_pd(SIMDE_FLOAT64_C( -0.31), SIMDE_FLOAT64_C( 0.03)) },
{ simde_mm_set_pd(SIMDE_FLOAT64_C( 0.50), SIMDE_FLOAT64_C( 0.67)),
simde_mm_set_pd(SIMDE_FLOAT64_C( 0.55), SIMDE_FLOAT64_C( 0.81)) },
{ simde_mm_set_pd(SIMDE_FLOAT64_C( -0.27), SIMDE_FLOAT64_C( 0.47)),
simde_mm_set_pd(SIMDE_FLOAT64_C( -0.28), SIMDE_FLOAT64_C( 0.51)) },
{ simde_mm_set_pd(SIMDE_FLOAT64_C( 0.83), SIMDE_FLOAT64_C( 0.42)),
simde_mm_set_pd(SIMDE_FLOAT64_C( 1.19), SIMDE_FLOAT64_C( 0.45)) },
{ simde_mm_set_pd(SIMDE_FLOAT64_C( 0.08), SIMDE_FLOAT64_C( 0.57)),
simde_mm_set_pd(SIMDE_FLOAT64_C( 0.08), SIMDE_FLOAT64_C( 0.65)) },
{ simde_mm_set_pd(SIMDE_FLOAT64_C( -0.68), SIMDE_FLOAT64_C( -0.69)),
simde_mm_set_pd(SIMDE_FLOAT64_C( -0.83), SIMDE_FLOAT64_C( -0.85)) }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])); i++) {
simde__m128d r = simde_mm_atanh_pd(test_vec[i].a);
simde_assert_m128d_close(r, test_vec[i].r, 1);
}
return 0;
}
static int
test_simde_mm256_atanh_ps(SIMDE_MUNIT_TEST_ARGS) {
const struct {
simde__m256 a;
simde__m256 r;
} test_vec[8] = {
{ simde_mm256_set_ps(SIMDE_FLOAT32_C( 0.50), SIMDE_FLOAT32_C( 0.67),
SIMDE_FLOAT32_C( -0.30), SIMDE_FLOAT32_C( 0.03),
SIMDE_FLOAT32_C( -0.19), SIMDE_FLOAT32_C( 0.04),
SIMDE_FLOAT32_C( -0.75), SIMDE_FLOAT32_C( 0.35)),
simde_mm256_set_ps(SIMDE_FLOAT32_C( 0.55), SIMDE_FLOAT32_C( 0.81),
SIMDE_FLOAT32_C( -0.31), SIMDE_FLOAT32_C( 0.03),
SIMDE_FLOAT32_C( -0.19), SIMDE_FLOAT32_C( 0.04),
SIMDE_FLOAT32_C( -0.97), SIMDE_FLOAT32_C( 0.37)) },
{ simde_mm256_set_ps(SIMDE_FLOAT32_C( -0.68), SIMDE_FLOAT32_C( -0.69),
SIMDE_FLOAT32_C( 0.08), SIMDE_FLOAT32_C( 0.57),
SIMDE_FLOAT32_C( 0.83), SIMDE_FLOAT32_C( 0.42),
SIMDE_FLOAT32_C( -0.27), SIMDE_FLOAT32_C( 0.47)),
simde_mm256_set_ps(SIMDE_FLOAT32_C( -0.83), SIMDE_FLOAT32_C( -0.85),
SIMDE_FLOAT32_C( 0.08), SIMDE_FLOAT32_C( 0.65),
SIMDE_FLOAT32_C( 1.19), SIMDE_FLOAT32_C( 0.45),
SIMDE_FLOAT32_C( -0.28), SIMDE_FLOAT32_C( 0.51)) },
{ simde_mm256_set_ps(SIMDE_FLOAT32_C( -0.44), SIMDE_FLOAT32_C( 0.03),
SIMDE_FLOAT32_C( -0.38), SIMDE_FLOAT32_C( -0.92),
SIMDE_FLOAT32_C( -0.31), SIMDE_FLOAT32_C( -0.86),
SIMDE_FLOAT32_C( -0.42), SIMDE_FLOAT32_C( 0.70)),
simde_mm256_set_ps(SIMDE_FLOAT32_C( -0.47), SIMDE_FLOAT32_C( 0.03),
SIMDE_FLOAT32_C( -0.40), SIMDE_FLOAT32_C( -1.59),
SIMDE_FLOAT32_C( -0.32), SIMDE_FLOAT32_C( -1.29),
SIMDE_FLOAT32_C( -0.45), SIMDE_FLOAT32_C( 0.87)) },
{ simde_mm256_set_ps(SIMDE_FLOAT32_C( 0.18), SIMDE_FLOAT32_C( -0.45),
SIMDE_FLOAT32_C( 0.23), SIMDE_FLOAT32_C( 0.69),
SIMDE_FLOAT32_C( 0.26), SIMDE_FLOAT32_C( -0.21),
SIMDE_FLOAT32_C( -0.98), SIMDE_FLOAT32_C( -0.66)),
simde_mm256_set_ps(SIMDE_FLOAT32_C( 0.18), SIMDE_FLOAT32_C( -0.48),
SIMDE_FLOAT32_C( 0.23), SIMDE_FLOAT32_C( 0.85),
SIMDE_FLOAT32_C( 0.27), SIMDE_FLOAT32_C( -0.21),
SIMDE_FLOAT32_C( -2.30), SIMDE_FLOAT32_C( -0.79)) },
{ simde_mm256_set_ps(SIMDE_FLOAT32_C( -0.77), SIMDE_FLOAT32_C( 0.44),
SIMDE_FLOAT32_C( -0.58), SIMDE_FLOAT32_C( 0.38),
SIMDE_FLOAT32_C( -0.77), SIMDE_FLOAT32_C( 0.99),
SIMDE_FLOAT32_C( 0.03), SIMDE_FLOAT32_C( 0.84)),
simde_mm256_set_ps(SIMDE_FLOAT32_C( -1.02), SIMDE_FLOAT32_C( 0.47),
SIMDE_FLOAT32_C( -0.66), SIMDE_FLOAT32_C( 0.40),
SIMDE_FLOAT32_C( -1.02), SIMDE_FLOAT32_C( 2.65),
SIMDE_FLOAT32_C( 0.03), SIMDE_FLOAT32_C( 1.22)) },
{ simde_mm256_set_ps(SIMDE_FLOAT32_C( -0.39), SIMDE_FLOAT32_C( 0.40),
SIMDE_FLOAT32_C( 0.66), SIMDE_FLOAT32_C( 0.34),
SIMDE_FLOAT32_C( 0.53), SIMDE_FLOAT32_C( -0.26),
SIMDE_FLOAT32_C( 0.78), SIMDE_FLOAT32_C( -0.03)),
simde_mm256_set_ps(SIMDE_FLOAT32_C( -0.41), SIMDE_FLOAT32_C( 0.42),
SIMDE_FLOAT32_C( 0.79), SIMDE_FLOAT32_C( 0.35),
SIMDE_FLOAT32_C( 0.59), SIMDE_FLOAT32_C( -0.27),
SIMDE_FLOAT32_C( 1.05), SIMDE_FLOAT32_C( -0.03)) },
{ simde_mm256_set_ps(SIMDE_FLOAT32_C( -0.20), SIMDE_FLOAT32_C( -0.08),
SIMDE_FLOAT32_C( 0.34), SIMDE_FLOAT32_C( -0.94),
SIMDE_FLOAT32_C( -0.75), SIMDE_FLOAT32_C( -0.77),
SIMDE_FLOAT32_C( -0.55), SIMDE_FLOAT32_C( 0.40)),
simde_mm256_set_ps(SIMDE_FLOAT32_C( -0.20), SIMDE_FLOAT32_C( -0.08),
SIMDE_FLOAT32_C( 0.35), SIMDE_FLOAT32_C( -1.74),
SIMDE_FLOAT32_C( -0.97), SIMDE_FLOAT32_C( -1.02),
SIMDE_FLOAT32_C( -0.62), SIMDE_FLOAT32_C( 0.42)) },
{ simde_mm256_set_ps(SIMDE_FLOAT32_C( 0.47), SIMDE_FLOAT32_C( 0.68),
SIMDE_FLOAT32_C( -0.15), SIMDE_FLOAT32_C( 0.82),
SIMDE_FLOAT32_C( 0.91), SIMDE_FLOAT32_C( 0.60),
SIMDE_FLOAT32_C( 0.79), SIMDE_FLOAT32_C( 0.25)),
simde_mm256_set_ps(SIMDE_FLOAT32_C( 0.51), SIMDE_FLOAT32_C( 0.83),
SIMDE_FLOAT32_C( -0.15), SIMDE_FLOAT32_C( 1.16),
SIMDE_FLOAT32_C( 1.53), SIMDE_FLOAT32_C( 0.69),
SIMDE_FLOAT32_C( 1.07), SIMDE_FLOAT32_C( 0.26)) }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])); i++) {
simde__m256 r = simde_mm256_atanh_ps(test_vec[i].a);
simde_assert_m256_close(r, test_vec[i].r, 1);
}
return 0;
}
static int
test_simde_mm256_atanh_pd(SIMDE_MUNIT_TEST_ARGS) {
const struct {
simde__m256d a;
simde__m256d r;
} test_vec[8] = {
{ simde_mm256_set_pd(SIMDE_FLOAT64_C( -0.19), SIMDE_FLOAT64_C( 0.04),
SIMDE_FLOAT64_C( -0.75), SIMDE_FLOAT64_C( 0.35)),
simde_mm256_set_pd(SIMDE_FLOAT64_C( -0.19), SIMDE_FLOAT64_C( 0.04),
SIMDE_FLOAT64_C( -0.97), SIMDE_FLOAT64_C( 0.37)) },
{ simde_mm256_set_pd(SIMDE_FLOAT64_C( 0.50), SIMDE_FLOAT64_C( 0.67),
SIMDE_FLOAT64_C( -0.30), SIMDE_FLOAT64_C( 0.03)),
simde_mm256_set_pd(SIMDE_FLOAT64_C( 0.55), SIMDE_FLOAT64_C( 0.81),
SIMDE_FLOAT64_C( -0.31), SIMDE_FLOAT64_C( 0.03)) },
{ simde_mm256_set_pd(SIMDE_FLOAT64_C( 0.83), SIMDE_FLOAT64_C( 0.42),
SIMDE_FLOAT64_C( -0.27), SIMDE_FLOAT64_C( 0.47)),
simde_mm256_set_pd(SIMDE_FLOAT64_C( 1.19), SIMDE_FLOAT64_C( 0.45),
SIMDE_FLOAT64_C( -0.28), SIMDE_FLOAT64_C( 0.51)) },
{ simde_mm256_set_pd(SIMDE_FLOAT64_C( -0.68), SIMDE_FLOAT64_C( -0.69),
SIMDE_FLOAT64_C( 0.08), SIMDE_FLOAT64_C( 0.57)),
simde_mm256_set_pd(SIMDE_FLOAT64_C( -0.83), SIMDE_FLOAT64_C( -0.85),
SIMDE_FLOAT64_C( 0.08), SIMDE_FLOAT64_C( 0.65)) },
{ simde_mm256_set_pd(SIMDE_FLOAT64_C( -0.31), SIMDE_FLOAT64_C( -0.86),
SIMDE_FLOAT64_C( -0.42), SIMDE_FLOAT64_C( 0.70)),
simde_mm256_set_pd(SIMDE_FLOAT64_C( -0.32), SIMDE_FLOAT64_C( -1.29),
SIMDE_FLOAT64_C( -0.45), SIMDE_FLOAT64_C( 0.87)) },
{ simde_mm256_set_pd(SIMDE_FLOAT64_C( -0.44), SIMDE_FLOAT64_C( 0.03),
SIMDE_FLOAT64_C( -0.38), SIMDE_FLOAT64_C( -0.92)),
simde_mm256_set_pd(SIMDE_FLOAT64_C( -0.47), SIMDE_FLOAT64_C( 0.03),
SIMDE_FLOAT64_C( -0.40), SIMDE_FLOAT64_C( -1.59)) },
{ simde_mm256_set_pd(SIMDE_FLOAT64_C( 0.26), SIMDE_FLOAT64_C( -0.21),
SIMDE_FLOAT64_C( -0.98), SIMDE_FLOAT64_C( -0.66)),
simde_mm256_set_pd(SIMDE_FLOAT64_C( 0.27), SIMDE_FLOAT64_C( -0.21),
SIMDE_FLOAT64_C( -2.30), SIMDE_FLOAT64_C( -0.79)) },
{ simde_mm256_set_pd(SIMDE_FLOAT64_C( 0.18), SIMDE_FLOAT64_C( -0.45),
SIMDE_FLOAT64_C( 0.23), SIMDE_FLOAT64_C( 0.69)),
simde_mm256_set_pd(SIMDE_FLOAT64_C( 0.18), SIMDE_FLOAT64_C( -0.48),
SIMDE_FLOAT64_C( 0.23), SIMDE_FLOAT64_C( 0.85)) }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])); i++) {
simde__m256d r = simde_mm256_atanh_pd(test_vec[i].a);
simde_assert_m256d_close(r, test_vec[i].r, 1);
}
return 0;
}
static int
test_simde_mm512_atanh_ps(SIMDE_MUNIT_TEST_ARGS) {
const struct {
simde__m512 a;
simde__m512 r;
} test_vec[8] = {
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( 0.16), SIMDE_FLOAT32_C( 0.16), SIMDE_FLOAT32_C( 0.54), SIMDE_FLOAT32_C( 0.78),
SIMDE_FLOAT32_C( 0.91), SIMDE_FLOAT32_C( 0.71), SIMDE_FLOAT32_C( 0.36), SIMDE_FLOAT32_C( 0.73),
SIMDE_FLOAT32_C( 0.75), SIMDE_FLOAT32_C( 0.83), SIMDE_FLOAT32_C( 0.35), SIMDE_FLOAT32_C( 0.52),
SIMDE_FLOAT32_C( 0.41), SIMDE_FLOAT32_C( 0.52), SIMDE_FLOAT32_C( 0.12), SIMDE_FLOAT32_C( 0.67)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 0.16), SIMDE_FLOAT32_C( 0.16), SIMDE_FLOAT32_C( 0.60), SIMDE_FLOAT32_C( 1.05),
SIMDE_FLOAT32_C( 1.53), SIMDE_FLOAT32_C( 0.89), SIMDE_FLOAT32_C( 0.38), SIMDE_FLOAT32_C( 0.93),
SIMDE_FLOAT32_C( 0.97), SIMDE_FLOAT32_C( 1.19), SIMDE_FLOAT32_C( 0.37), SIMDE_FLOAT32_C( 0.58),
SIMDE_FLOAT32_C( 0.44), SIMDE_FLOAT32_C( 0.58), SIMDE_FLOAT32_C( 0.12), SIMDE_FLOAT32_C( 0.81)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( 0.59), SIMDE_FLOAT32_C( 0.27), SIMDE_FLOAT32_C( 0.62), SIMDE_FLOAT32_C( 0.84),
SIMDE_FLOAT32_C( 0.63), SIMDE_FLOAT32_C( 0.39), SIMDE_FLOAT32_C( 0.01), SIMDE_FLOAT32_C( 0.17),
SIMDE_FLOAT32_C( 0.28), SIMDE_FLOAT32_C( 0.51), SIMDE_FLOAT32_C( 0.31), SIMDE_FLOAT32_C( 0.04),
SIMDE_FLOAT32_C( 0.35), SIMDE_FLOAT32_C( 0.07), SIMDE_FLOAT32_C( 0.29), SIMDE_FLOAT32_C( 0.85)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 0.68), SIMDE_FLOAT32_C( 0.28), SIMDE_FLOAT32_C( 0.73), SIMDE_FLOAT32_C( 1.22),
SIMDE_FLOAT32_C( 0.74), SIMDE_FLOAT32_C( 0.41), SIMDE_FLOAT32_C( 0.01), SIMDE_FLOAT32_C( 0.17),
SIMDE_FLOAT32_C( 0.29), SIMDE_FLOAT32_C( 0.56), SIMDE_FLOAT32_C( 0.32), SIMDE_FLOAT32_C( 0.04),
SIMDE_FLOAT32_C( 0.37), SIMDE_FLOAT32_C( 0.07), SIMDE_FLOAT32_C( 0.30), SIMDE_FLOAT32_C( 1.26)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( 0.31), SIMDE_FLOAT32_C( 0.70), SIMDE_FLOAT32_C( 0.83), SIMDE_FLOAT32_C( 0.67),
SIMDE_FLOAT32_C( 0.76), SIMDE_FLOAT32_C( 0.37), SIMDE_FLOAT32_C( 0.89), SIMDE_FLOAT32_C( 0.48),
SIMDE_FLOAT32_C( 0.11), SIMDE_FLOAT32_C( 0.72), SIMDE_FLOAT32_C( 0.21), SIMDE_FLOAT32_C( 0.69),
SIMDE_FLOAT32_C( 0.11), SIMDE_FLOAT32_C( 0.99), SIMDE_FLOAT32_C( 0.51), SIMDE_FLOAT32_C( 0.92)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 0.32), SIMDE_FLOAT32_C( 0.87), SIMDE_FLOAT32_C( 1.19), SIMDE_FLOAT32_C( 0.81),
SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( 0.39), SIMDE_FLOAT32_C( 1.42), SIMDE_FLOAT32_C( 0.52),
SIMDE_FLOAT32_C( 0.11), SIMDE_FLOAT32_C( 0.91), SIMDE_FLOAT32_C( 0.21), SIMDE_FLOAT32_C( 0.85),
SIMDE_FLOAT32_C( 0.11), SIMDE_FLOAT32_C( 2.65), SIMDE_FLOAT32_C( 0.56), SIMDE_FLOAT32_C( 1.59)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( 0.73), SIMDE_FLOAT32_C( 0.84), SIMDE_FLOAT32_C( 0.42), SIMDE_FLOAT32_C( 0.91),
SIMDE_FLOAT32_C( 0.95), SIMDE_FLOAT32_C( 0.80), SIMDE_FLOAT32_C( 0.89), SIMDE_FLOAT32_C( 0.63),
SIMDE_FLOAT32_C( 0.40), SIMDE_FLOAT32_C( 0.46), SIMDE_FLOAT32_C( 0.67), SIMDE_FLOAT32_C( 0.03),
SIMDE_FLOAT32_C( 0.13), SIMDE_FLOAT32_C( 0.12), SIMDE_FLOAT32_C( 0.22), SIMDE_FLOAT32_C( 0.70)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 0.93), SIMDE_FLOAT32_C( 1.22), SIMDE_FLOAT32_C( 0.45), SIMDE_FLOAT32_C( 1.53),
SIMDE_FLOAT32_C( 1.83), SIMDE_FLOAT32_C( 1.10), SIMDE_FLOAT32_C( 1.42), SIMDE_FLOAT32_C( 0.74),
SIMDE_FLOAT32_C( 0.42), SIMDE_FLOAT32_C( 0.50), SIMDE_FLOAT32_C( 0.81), SIMDE_FLOAT32_C( 0.03),
SIMDE_FLOAT32_C( 0.13), SIMDE_FLOAT32_C( 0.12), SIMDE_FLOAT32_C( 0.22), SIMDE_FLOAT32_C( 0.87)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( 0.41), SIMDE_FLOAT32_C( 0.88), SIMDE_FLOAT32_C( 0.66), SIMDE_FLOAT32_C( 0.67),
SIMDE_FLOAT32_C( 0.06), SIMDE_FLOAT32_C( 0.10), SIMDE_FLOAT32_C( 0.34), SIMDE_FLOAT32_C( 0.24),
SIMDE_FLOAT32_C( 0.40), SIMDE_FLOAT32_C( 0.09), SIMDE_FLOAT32_C( 0.78), SIMDE_FLOAT32_C( 0.83),
SIMDE_FLOAT32_C( 0.46), SIMDE_FLOAT32_C( 0.77), SIMDE_FLOAT32_C( 0.56), SIMDE_FLOAT32_C( 0.41)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 0.44), SIMDE_FLOAT32_C( 1.38), SIMDE_FLOAT32_C( 0.79), SIMDE_FLOAT32_C( 0.81),
SIMDE_FLOAT32_C( 0.06), SIMDE_FLOAT32_C( 0.10), SIMDE_FLOAT32_C( 0.35), SIMDE_FLOAT32_C( 0.24),
SIMDE_FLOAT32_C( 0.42), SIMDE_FLOAT32_C( 0.09), SIMDE_FLOAT32_C( 1.05), SIMDE_FLOAT32_C( 1.19),
SIMDE_FLOAT32_C( 0.50), SIMDE_FLOAT32_C( 1.02), SIMDE_FLOAT32_C( 0.63), SIMDE_FLOAT32_C( 0.44)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( 0.84), SIMDE_FLOAT32_C( 0.92), SIMDE_FLOAT32_C( 0.49), SIMDE_FLOAT32_C( 0.20),
SIMDE_FLOAT32_C( 0.28), SIMDE_FLOAT32_C( 0.86), SIMDE_FLOAT32_C( 0.75), SIMDE_FLOAT32_C( 0.81),
SIMDE_FLOAT32_C( 0.91), SIMDE_FLOAT32_C( 0.57), SIMDE_FLOAT32_C( 0.99), SIMDE_FLOAT32_C( 0.05),
SIMDE_FLOAT32_C( 0.66), SIMDE_FLOAT32_C( 0.55), SIMDE_FLOAT32_C( 0.73), SIMDE_FLOAT32_C( 0.13)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 1.22), SIMDE_FLOAT32_C( 1.59), SIMDE_FLOAT32_C( 0.54), SIMDE_FLOAT32_C( 0.20),
SIMDE_FLOAT32_C( 0.29), SIMDE_FLOAT32_C( 1.29), SIMDE_FLOAT32_C( 0.97), SIMDE_FLOAT32_C( 1.13),
SIMDE_FLOAT32_C( 1.53), SIMDE_FLOAT32_C( 0.65), SIMDE_FLOAT32_C( 2.65), SIMDE_FLOAT32_C( 0.05),
SIMDE_FLOAT32_C( 0.79), SIMDE_FLOAT32_C( 0.62), SIMDE_FLOAT32_C( 0.93), SIMDE_FLOAT32_C( 0.13)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( 0.12), SIMDE_FLOAT32_C( 0.91), SIMDE_FLOAT32_C( 0.21), SIMDE_FLOAT32_C( 0.00),
SIMDE_FLOAT32_C( 0.33), SIMDE_FLOAT32_C( 0.96), SIMDE_FLOAT32_C( 0.65), SIMDE_FLOAT32_C( 0.12),
SIMDE_FLOAT32_C( 0.21), SIMDE_FLOAT32_C( 0.47), SIMDE_FLOAT32_C( 0.85), SIMDE_FLOAT32_C( 0.99),
SIMDE_FLOAT32_C( 0.12), SIMDE_FLOAT32_C( 0.71), SIMDE_FLOAT32_C( 0.51), SIMDE_FLOAT32_C( 0.45)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 0.12), SIMDE_FLOAT32_C( 1.53), SIMDE_FLOAT32_C( 0.21), SIMDE_FLOAT32_C( 0.00),
SIMDE_FLOAT32_C( 0.34), SIMDE_FLOAT32_C( 1.95), SIMDE_FLOAT32_C( 0.78), SIMDE_FLOAT32_C( 0.12),
SIMDE_FLOAT32_C( 0.21), SIMDE_FLOAT32_C( 0.51), SIMDE_FLOAT32_C( 1.26), SIMDE_FLOAT32_C( 2.65),
SIMDE_FLOAT32_C( 0.12), SIMDE_FLOAT32_C( 0.89), SIMDE_FLOAT32_C( 0.56), SIMDE_FLOAT32_C( 0.48)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( 0.28), SIMDE_FLOAT32_C( 0.32), SIMDE_FLOAT32_C( 0.12), SIMDE_FLOAT32_C( 0.48),
SIMDE_FLOAT32_C( 0.96), SIMDE_FLOAT32_C( 0.50), SIMDE_FLOAT32_C( 0.34), SIMDE_FLOAT32_C( 0.44),
SIMDE_FLOAT32_C( 0.41), SIMDE_FLOAT32_C( 0.52), SIMDE_FLOAT32_C( 0.75), SIMDE_FLOAT32_C( 0.70),
SIMDE_FLOAT32_C( 0.51), SIMDE_FLOAT32_C( 0.35), SIMDE_FLOAT32_C( 0.96), SIMDE_FLOAT32_C( 0.15)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 0.29), SIMDE_FLOAT32_C( 0.33), SIMDE_FLOAT32_C( 0.12), SIMDE_FLOAT32_C( 0.52),
SIMDE_FLOAT32_C( 1.95), SIMDE_FLOAT32_C( 0.55), SIMDE_FLOAT32_C( 0.35), SIMDE_FLOAT32_C( 0.47),
SIMDE_FLOAT32_C( 0.44), SIMDE_FLOAT32_C( 0.58), SIMDE_FLOAT32_C( 0.97), SIMDE_FLOAT32_C( 0.87),
SIMDE_FLOAT32_C( 0.56), SIMDE_FLOAT32_C( 0.37), SIMDE_FLOAT32_C( 1.95), SIMDE_FLOAT32_C( 0.15)) }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])); i++) {
simde__m512 r = simde_mm512_atanh_ps(test_vec[i].a);
simde_assert_m512_close(r, test_vec[i].r, 1);
}
return 0;
}
static int
test_simde_mm512_mask_atanh_ps(SIMDE_MUNIT_TEST_ARGS) {
const struct {
simde__m512 src;
simde__mmask16 k;
simde__m512 a;
simde__m512 r;
} test_vec[8] = {
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( 0.27), SIMDE_FLOAT32_C( 0.84), SIMDE_FLOAT32_C( 0.39), SIMDE_FLOAT32_C( 0.17),
SIMDE_FLOAT32_C( 0.51), SIMDE_FLOAT32_C( 0.04), SIMDE_FLOAT32_C( 0.07), SIMDE_FLOAT32_C( 0.85),
SIMDE_FLOAT32_C( 0.16), SIMDE_FLOAT32_C( 0.78), SIMDE_FLOAT32_C( 0.71), SIMDE_FLOAT32_C( 0.73),
SIMDE_FLOAT32_C( 0.83), SIMDE_FLOAT32_C( 0.52), SIMDE_FLOAT32_C( 0.52), SIMDE_FLOAT32_C( 0.67)),
UINT16_C(41466),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 0.59), SIMDE_FLOAT32_C( 0.62), SIMDE_FLOAT32_C( 0.63), SIMDE_FLOAT32_C( 0.01),
SIMDE_FLOAT32_C( 0.28), SIMDE_FLOAT32_C( 0.31), SIMDE_FLOAT32_C( 0.35), SIMDE_FLOAT32_C( 0.29),
SIMDE_FLOAT32_C( 0.16), SIMDE_FLOAT32_C( 0.54), SIMDE_FLOAT32_C( 0.91), SIMDE_FLOAT32_C( 0.36),
SIMDE_FLOAT32_C( 0.75), SIMDE_FLOAT32_C( 0.35), SIMDE_FLOAT32_C( 0.41), SIMDE_FLOAT32_C( 0.12)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 0.68), SIMDE_FLOAT32_C( 0.84), SIMDE_FLOAT32_C( 0.74), SIMDE_FLOAT32_C( 0.17),
SIMDE_FLOAT32_C( 0.51), SIMDE_FLOAT32_C( 0.04), SIMDE_FLOAT32_C( 0.07), SIMDE_FLOAT32_C( 0.30),
SIMDE_FLOAT32_C( 0.16), SIMDE_FLOAT32_C( 0.60), SIMDE_FLOAT32_C( 1.53), SIMDE_FLOAT32_C( 0.38),
SIMDE_FLOAT32_C( 0.97), SIMDE_FLOAT32_C( 0.52), SIMDE_FLOAT32_C( 0.44), SIMDE_FLOAT32_C( 0.67)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( 0.73), SIMDE_FLOAT32_C( 0.42), SIMDE_FLOAT32_C( 0.95), SIMDE_FLOAT32_C( 0.89),
SIMDE_FLOAT32_C( 0.40), SIMDE_FLOAT32_C( 0.67), SIMDE_FLOAT32_C( 0.13), SIMDE_FLOAT32_C( 0.22),
SIMDE_FLOAT32_C( 0.31), SIMDE_FLOAT32_C( 0.83), SIMDE_FLOAT32_C( 0.76), SIMDE_FLOAT32_C( 0.89),
SIMDE_FLOAT32_C( 0.11), SIMDE_FLOAT32_C( 0.21), SIMDE_FLOAT32_C( 0.11), SIMDE_FLOAT32_C( 0.51)),
UINT16_C(36797),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 0.41), SIMDE_FLOAT32_C( 0.84), SIMDE_FLOAT32_C( 0.91), SIMDE_FLOAT32_C( 0.80),
SIMDE_FLOAT32_C( 0.63), SIMDE_FLOAT32_C( 0.46), SIMDE_FLOAT32_C( 0.03), SIMDE_FLOAT32_C( 0.12),
SIMDE_FLOAT32_C( 0.70), SIMDE_FLOAT32_C( 0.70), SIMDE_FLOAT32_C( 0.67), SIMDE_FLOAT32_C( 0.37),
SIMDE_FLOAT32_C( 0.48), SIMDE_FLOAT32_C( 0.72), SIMDE_FLOAT32_C( 0.69), SIMDE_FLOAT32_C( 0.99)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 0.44), SIMDE_FLOAT32_C( 0.42), SIMDE_FLOAT32_C( 0.95), SIMDE_FLOAT32_C( 0.89),
SIMDE_FLOAT32_C( 0.74), SIMDE_FLOAT32_C( 0.50), SIMDE_FLOAT32_C( 0.03), SIMDE_FLOAT32_C( 0.12),
SIMDE_FLOAT32_C( 0.87), SIMDE_FLOAT32_C( 0.83), SIMDE_FLOAT32_C( 0.81), SIMDE_FLOAT32_C( 0.39),
SIMDE_FLOAT32_C( 0.52), SIMDE_FLOAT32_C( 0.91), SIMDE_FLOAT32_C( 0.11), SIMDE_FLOAT32_C( 2.65)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( 0.45), SIMDE_FLOAT32_C( 0.92), SIMDE_FLOAT32_C( 0.20), SIMDE_FLOAT32_C( 0.86),
SIMDE_FLOAT32_C( 0.81), SIMDE_FLOAT32_C( 0.57), SIMDE_FLOAT32_C( 0.05), SIMDE_FLOAT32_C( 0.55),
SIMDE_FLOAT32_C( 0.13), SIMDE_FLOAT32_C( 0.88), SIMDE_FLOAT32_C( 0.67), SIMDE_FLOAT32_C( 0.10),
SIMDE_FLOAT32_C( 0.24), SIMDE_FLOAT32_C( 0.09), SIMDE_FLOAT32_C( 0.83), SIMDE_FLOAT32_C( 0.77)),
UINT16_C(16804),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 0.51), SIMDE_FLOAT32_C( 0.84), SIMDE_FLOAT32_C( 0.49), SIMDE_FLOAT32_C( 0.28),
SIMDE_FLOAT32_C( 0.75), SIMDE_FLOAT32_C( 0.91), SIMDE_FLOAT32_C( 0.99), SIMDE_FLOAT32_C( 0.66),
SIMDE_FLOAT32_C( 0.73), SIMDE_FLOAT32_C( 0.41), SIMDE_FLOAT32_C( 0.66), SIMDE_FLOAT32_C( 0.06),
SIMDE_FLOAT32_C( 0.34), SIMDE_FLOAT32_C( 0.40), SIMDE_FLOAT32_C( 0.78), SIMDE_FLOAT32_C( 0.46)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 0.45), SIMDE_FLOAT32_C( 1.22), SIMDE_FLOAT32_C( 0.20), SIMDE_FLOAT32_C( 0.86),
SIMDE_FLOAT32_C( 0.81), SIMDE_FLOAT32_C( 0.57), SIMDE_FLOAT32_C( 0.05), SIMDE_FLOAT32_C( 0.79),
SIMDE_FLOAT32_C( 0.93), SIMDE_FLOAT32_C( 0.88), SIMDE_FLOAT32_C( 0.79), SIMDE_FLOAT32_C( 0.10),
SIMDE_FLOAT32_C( 0.24), SIMDE_FLOAT32_C( 0.42), SIMDE_FLOAT32_C( 0.83), SIMDE_FLOAT32_C( 0.77)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( 0.32), SIMDE_FLOAT32_C( 0.28), SIMDE_FLOAT32_C( 0.12), SIMDE_FLOAT32_C( 0.96),
SIMDE_FLOAT32_C( 0.34), SIMDE_FLOAT32_C( 0.41), SIMDE_FLOAT32_C( 0.75), SIMDE_FLOAT32_C( 0.51),
SIMDE_FLOAT32_C( 0.96), SIMDE_FLOAT32_C( 0.12), SIMDE_FLOAT32_C( 0.21), SIMDE_FLOAT32_C( 0.33),
SIMDE_FLOAT32_C( 0.65), SIMDE_FLOAT32_C( 0.21), SIMDE_FLOAT32_C( 0.85), SIMDE_FLOAT32_C( 0.12)),
UINT16_C( 2107),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 0.95), SIMDE_FLOAT32_C( 0.40), SIMDE_FLOAT32_C( 0.32), SIMDE_FLOAT32_C( 0.48),
SIMDE_FLOAT32_C( 0.50), SIMDE_FLOAT32_C( 0.44), SIMDE_FLOAT32_C( 0.52), SIMDE_FLOAT32_C( 0.70),
SIMDE_FLOAT32_C( 0.35), SIMDE_FLOAT32_C( 0.15), SIMDE_FLOAT32_C( 0.91), SIMDE_FLOAT32_C( 0.00),
SIMDE_FLOAT32_C( 0.96), SIMDE_FLOAT32_C( 0.12), SIMDE_FLOAT32_C( 0.47), SIMDE_FLOAT32_C( 0.99)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 0.32), SIMDE_FLOAT32_C( 0.28), SIMDE_FLOAT32_C( 0.12), SIMDE_FLOAT32_C( 0.96),
SIMDE_FLOAT32_C( 0.55), SIMDE_FLOAT32_C( 0.41), SIMDE_FLOAT32_C( 0.75), SIMDE_FLOAT32_C( 0.51),
SIMDE_FLOAT32_C( 0.96), SIMDE_FLOAT32_C( 0.12), SIMDE_FLOAT32_C( 1.53), SIMDE_FLOAT32_C( 0.00),
SIMDE_FLOAT32_C( 1.95), SIMDE_FLOAT32_C( 0.21), SIMDE_FLOAT32_C( 0.51), SIMDE_FLOAT32_C( 2.65)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( 0.49), SIMDE_FLOAT32_C( 0.13), SIMDE_FLOAT32_C( 0.34), SIMDE_FLOAT32_C( 0.59),
SIMDE_FLOAT32_C( 0.67), SIMDE_FLOAT32_C( 0.94), SIMDE_FLOAT32_C( 0.96), SIMDE_FLOAT32_C( 0.57),
SIMDE_FLOAT32_C( 0.74), SIMDE_FLOAT32_C( 0.20), SIMDE_FLOAT32_C( 0.10), SIMDE_FLOAT32_C( 0.12),
SIMDE_FLOAT32_C( 0.61), SIMDE_FLOAT32_C( 0.11), SIMDE_FLOAT32_C( 0.11), SIMDE_FLOAT32_C( 0.72)),
UINT16_C(22274),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 0.75), SIMDE_FLOAT32_C( 0.96), SIMDE_FLOAT32_C( 0.14), SIMDE_FLOAT32_C( 0.58),
SIMDE_FLOAT32_C( 0.07), SIMDE_FLOAT32_C( 0.71), SIMDE_FLOAT32_C( 0.96), SIMDE_FLOAT32_C( 0.55),
SIMDE_FLOAT32_C( 0.91), SIMDE_FLOAT32_C( 0.46), SIMDE_FLOAT32_C( 0.62), SIMDE_FLOAT32_C( 0.31),
SIMDE_FLOAT32_C( 0.20), SIMDE_FLOAT32_C( 0.19), SIMDE_FLOAT32_C( 0.03), SIMDE_FLOAT32_C( 0.74)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 0.49), SIMDE_FLOAT32_C( 1.95), SIMDE_FLOAT32_C( 0.34), SIMDE_FLOAT32_C( 0.66),
SIMDE_FLOAT32_C( 0.67), SIMDE_FLOAT32_C( 0.89), SIMDE_FLOAT32_C( 1.95), SIMDE_FLOAT32_C( 0.62),
SIMDE_FLOAT32_C( 0.74), SIMDE_FLOAT32_C( 0.20), SIMDE_FLOAT32_C( 0.10), SIMDE_FLOAT32_C( 0.12),
SIMDE_FLOAT32_C( 0.61), SIMDE_FLOAT32_C( 0.11), SIMDE_FLOAT32_C( 0.03), SIMDE_FLOAT32_C( 0.72)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( 0.94), SIMDE_FLOAT32_C( 0.10), SIMDE_FLOAT32_C( 0.46), SIMDE_FLOAT32_C( 0.11),
SIMDE_FLOAT32_C( 0.55), SIMDE_FLOAT32_C( 0.60), SIMDE_FLOAT32_C( 0.92), SIMDE_FLOAT32_C( 0.47),
SIMDE_FLOAT32_C( 0.36), SIMDE_FLOAT32_C( 0.39), SIMDE_FLOAT32_C( 0.34), SIMDE_FLOAT32_C( 0.89),
SIMDE_FLOAT32_C( 0.19), SIMDE_FLOAT32_C( 0.78), SIMDE_FLOAT32_C( 0.72), SIMDE_FLOAT32_C( 0.72)),
UINT16_C(27396),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 0.02), SIMDE_FLOAT32_C( 0.30), SIMDE_FLOAT32_C( 0.13), SIMDE_FLOAT32_C( 0.12),
SIMDE_FLOAT32_C( 0.89), SIMDE_FLOAT32_C( 0.50), SIMDE_FLOAT32_C( 0.09), SIMDE_FLOAT32_C( 0.58),
SIMDE_FLOAT32_C( 0.79), SIMDE_FLOAT32_C( 0.49), SIMDE_FLOAT32_C( 0.35), SIMDE_FLOAT32_C( 0.14),
SIMDE_FLOAT32_C( 0.67), SIMDE_FLOAT32_C( 0.64), SIMDE_FLOAT32_C( 0.04), SIMDE_FLOAT32_C( 0.25)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 0.94), SIMDE_FLOAT32_C( 0.31), SIMDE_FLOAT32_C( 0.13), SIMDE_FLOAT32_C( 0.11),
SIMDE_FLOAT32_C( 1.42), SIMDE_FLOAT32_C( 0.60), SIMDE_FLOAT32_C( 0.09), SIMDE_FLOAT32_C( 0.66),
SIMDE_FLOAT32_C( 0.36), SIMDE_FLOAT32_C( 0.39), SIMDE_FLOAT32_C( 0.34), SIMDE_FLOAT32_C( 0.89),
SIMDE_FLOAT32_C( 0.19), SIMDE_FLOAT32_C( 0.76), SIMDE_FLOAT32_C( 0.72), SIMDE_FLOAT32_C( 0.72)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( 0.76), SIMDE_FLOAT32_C( 0.49), SIMDE_FLOAT32_C( 0.02), SIMDE_FLOAT32_C( 0.82),
SIMDE_FLOAT32_C( 0.72), SIMDE_FLOAT32_C( 0.11), SIMDE_FLOAT32_C( 0.55), SIMDE_FLOAT32_C( 0.79),
SIMDE_FLOAT32_C( 0.51), SIMDE_FLOAT32_C( 0.82), SIMDE_FLOAT32_C( 0.46), SIMDE_FLOAT32_C( 0.54),
SIMDE_FLOAT32_C( 0.14), SIMDE_FLOAT32_C( 0.80), SIMDE_FLOAT32_C( 0.70), SIMDE_FLOAT32_C( 0.06)),
UINT16_C( 953),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 0.51), SIMDE_FLOAT32_C( 0.90), SIMDE_FLOAT32_C( 0.57), SIMDE_FLOAT32_C( 0.29),
SIMDE_FLOAT32_C( 0.65), SIMDE_FLOAT32_C( 0.41), SIMDE_FLOAT32_C( 0.27), SIMDE_FLOAT32_C( 0.84),
SIMDE_FLOAT32_C( 0.53), SIMDE_FLOAT32_C( 0.37), SIMDE_FLOAT32_C( 0.56), SIMDE_FLOAT32_C( 0.21),
SIMDE_FLOAT32_C( 0.48), SIMDE_FLOAT32_C( 0.37), SIMDE_FLOAT32_C( 0.47), SIMDE_FLOAT32_C( 0.54)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 0.76), SIMDE_FLOAT32_C( 0.49), SIMDE_FLOAT32_C( 0.02), SIMDE_FLOAT32_C( 0.82),
SIMDE_FLOAT32_C( 0.72), SIMDE_FLOAT32_C( 0.11), SIMDE_FLOAT32_C( 0.28), SIMDE_FLOAT32_C( 1.22),
SIMDE_FLOAT32_C( 0.59), SIMDE_FLOAT32_C( 0.82), SIMDE_FLOAT32_C( 0.63), SIMDE_FLOAT32_C( 0.21),
SIMDE_FLOAT32_C( 0.52), SIMDE_FLOAT32_C( 0.80), SIMDE_FLOAT32_C( 0.70), SIMDE_FLOAT32_C( 0.60)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( 0.11), SIMDE_FLOAT32_C( 0.66), SIMDE_FLOAT32_C( 0.25), SIMDE_FLOAT32_C( 0.91),
SIMDE_FLOAT32_C( 0.98), SIMDE_FLOAT32_C( 0.98), SIMDE_FLOAT32_C( 0.91), SIMDE_FLOAT32_C( 0.09),
SIMDE_FLOAT32_C( 0.39), SIMDE_FLOAT32_C( 0.03), SIMDE_FLOAT32_C( 0.14), SIMDE_FLOAT32_C( 0.29),
SIMDE_FLOAT32_C( 0.55), SIMDE_FLOAT32_C( 0.55), SIMDE_FLOAT32_C( 0.72), SIMDE_FLOAT32_C( 0.40)),
UINT16_C(12713),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 0.08), SIMDE_FLOAT32_C( 0.49), SIMDE_FLOAT32_C( 0.91), SIMDE_FLOAT32_C( 0.90),
SIMDE_FLOAT32_C( 0.13), SIMDE_FLOAT32_C( 0.34), SIMDE_FLOAT32_C( 0.86), SIMDE_FLOAT32_C( 0.32),
SIMDE_FLOAT32_C( 0.53), SIMDE_FLOAT32_C( 0.55), SIMDE_FLOAT32_C( 0.86), SIMDE_FLOAT32_C( 0.37),
SIMDE_FLOAT32_C( 0.97), SIMDE_FLOAT32_C( 0.68), SIMDE_FLOAT32_C( 0.49), SIMDE_FLOAT32_C( 0.92)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 0.11), SIMDE_FLOAT32_C( 0.66), SIMDE_FLOAT32_C( 1.53), SIMDE_FLOAT32_C( 1.47),
SIMDE_FLOAT32_C( 0.98), SIMDE_FLOAT32_C( 0.98), SIMDE_FLOAT32_C( 0.91), SIMDE_FLOAT32_C( 0.33),
SIMDE_FLOAT32_C( 0.59), SIMDE_FLOAT32_C( 0.03), SIMDE_FLOAT32_C( 1.29), SIMDE_FLOAT32_C( 0.29),
SIMDE_FLOAT32_C( 2.09), SIMDE_FLOAT32_C( 0.55), SIMDE_FLOAT32_C( 0.72), SIMDE_FLOAT32_C( 1.59)) }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])); i++) {
simde__m512 r = simde_mm512_mask_atanh_ps(test_vec[i].src, test_vec[i].k, test_vec[i].a);
simde_assert_m512_close(r, test_vec[i].r, 1);
}
return 0;
}
static int
test_simde_mm512_atanh_pd(SIMDE_MUNIT_TEST_ARGS) {
const struct {
simde__m512d a;
simde__m512d r;
} test_vec[8] = {
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( 0.50), SIMDE_FLOAT64_C( 0.67),
SIMDE_FLOAT64_C( -0.30), SIMDE_FLOAT64_C( 0.03),
SIMDE_FLOAT64_C( -0.19), SIMDE_FLOAT64_C( 0.04),
SIMDE_FLOAT64_C( -0.75), SIMDE_FLOAT64_C( 0.35)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 0.55), SIMDE_FLOAT64_C( 0.81),
SIMDE_FLOAT64_C( -0.31), SIMDE_FLOAT64_C( 0.03),
SIMDE_FLOAT64_C( -0.19), SIMDE_FLOAT64_C( 0.04),
SIMDE_FLOAT64_C( -0.97), SIMDE_FLOAT64_C( 0.37)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( -0.68), SIMDE_FLOAT64_C( -0.69),
SIMDE_FLOAT64_C( 0.08), SIMDE_FLOAT64_C( 0.57),
SIMDE_FLOAT64_C( 0.83), SIMDE_FLOAT64_C( 0.42),
SIMDE_FLOAT64_C( -0.27), SIMDE_FLOAT64_C( 0.47)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( -0.83), SIMDE_FLOAT64_C( -0.85),
SIMDE_FLOAT64_C( 0.08), SIMDE_FLOAT64_C( 0.65),
SIMDE_FLOAT64_C( 1.19), SIMDE_FLOAT64_C( 0.45),
SIMDE_FLOAT64_C( -0.28), SIMDE_FLOAT64_C( 0.51)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( -0.44), SIMDE_FLOAT64_C( 0.03),
SIMDE_FLOAT64_C( -0.38), SIMDE_FLOAT64_C( -0.92),
SIMDE_FLOAT64_C( -0.31), SIMDE_FLOAT64_C( -0.86),
SIMDE_FLOAT64_C( -0.42), SIMDE_FLOAT64_C( 0.70)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( -0.47), SIMDE_FLOAT64_C( 0.03),
SIMDE_FLOAT64_C( -0.40), SIMDE_FLOAT64_C( -1.59),
SIMDE_FLOAT64_C( -0.32), SIMDE_FLOAT64_C( -1.29),
SIMDE_FLOAT64_C( -0.45), SIMDE_FLOAT64_C( 0.87)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( 0.18), SIMDE_FLOAT64_C( -0.45),
SIMDE_FLOAT64_C( 0.23), SIMDE_FLOAT64_C( 0.69),
SIMDE_FLOAT64_C( 0.26), SIMDE_FLOAT64_C( -0.21),
SIMDE_FLOAT64_C( -0.98), SIMDE_FLOAT64_C( -0.66)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 0.18), SIMDE_FLOAT64_C( -0.48),
SIMDE_FLOAT64_C( 0.23), SIMDE_FLOAT64_C( 0.85),
SIMDE_FLOAT64_C( 0.27), SIMDE_FLOAT64_C( -0.21),
SIMDE_FLOAT64_C( -2.30), SIMDE_FLOAT64_C( -0.79)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( -0.77), SIMDE_FLOAT64_C( 0.44),
SIMDE_FLOAT64_C( -0.58), SIMDE_FLOAT64_C( 0.38),
SIMDE_FLOAT64_C( -0.77), SIMDE_FLOAT64_C( 0.99),
SIMDE_FLOAT64_C( 0.03), SIMDE_FLOAT64_C( 0.84)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( -1.02), SIMDE_FLOAT64_C( 0.47),
SIMDE_FLOAT64_C( -0.66), SIMDE_FLOAT64_C( 0.40),
SIMDE_FLOAT64_C( -1.02), SIMDE_FLOAT64_C( 2.65),
SIMDE_FLOAT64_C( 0.03), SIMDE_FLOAT64_C( 1.22)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( -0.39), SIMDE_FLOAT64_C( 0.40),
SIMDE_FLOAT64_C( 0.66), SIMDE_FLOAT64_C( 0.34),
SIMDE_FLOAT64_C( 0.53), SIMDE_FLOAT64_C( -0.26),
SIMDE_FLOAT64_C( 0.78), SIMDE_FLOAT64_C( -0.03)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( -0.41), SIMDE_FLOAT64_C( 0.42),
SIMDE_FLOAT64_C( 0.79), SIMDE_FLOAT64_C( 0.35),
SIMDE_FLOAT64_C( 0.59), SIMDE_FLOAT64_C( -0.27),
SIMDE_FLOAT64_C( 1.05), SIMDE_FLOAT64_C( -0.03)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( -0.20), SIMDE_FLOAT64_C( -0.08),
SIMDE_FLOAT64_C( 0.34), SIMDE_FLOAT64_C( -0.94),
SIMDE_FLOAT64_C( -0.75), SIMDE_FLOAT64_C( -0.77),
SIMDE_FLOAT64_C( -0.55), SIMDE_FLOAT64_C( 0.40)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( -0.20), SIMDE_FLOAT64_C( -0.08),
SIMDE_FLOAT64_C( 0.35), SIMDE_FLOAT64_C( -1.74),
SIMDE_FLOAT64_C( -0.97), SIMDE_FLOAT64_C( -1.02),
SIMDE_FLOAT64_C( -0.62), SIMDE_FLOAT64_C( 0.42)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( 0.47), SIMDE_FLOAT64_C( 0.68),
SIMDE_FLOAT64_C( -0.15), SIMDE_FLOAT64_C( 0.82),
SIMDE_FLOAT64_C( 0.91), SIMDE_FLOAT64_C( 0.60),
SIMDE_FLOAT64_C( 0.79), SIMDE_FLOAT64_C( 0.25)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 0.51), SIMDE_FLOAT64_C( 0.83),
SIMDE_FLOAT64_C( -0.15), SIMDE_FLOAT64_C( 1.16),
SIMDE_FLOAT64_C( 1.53), SIMDE_FLOAT64_C( 0.69),
SIMDE_FLOAT64_C( 1.07), SIMDE_FLOAT64_C( 0.26)) }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])); i++) {
simde__m512d r = simde_mm512_atanh_pd(test_vec[i].a);
simde_assert_m512d_close(r, test_vec[i].r, 1);
}
return 0;
}
static int
test_simde_mm512_mask_atanh_pd(SIMDE_MUNIT_TEST_ARGS) {
const struct {
simde__m512d src;
simde__mmask8 k;
simde__m512d a;
simde__m512d r;
} test_vec[8] = {
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( -0.69), SIMDE_FLOAT64_C( 0.57),
SIMDE_FLOAT64_C( 0.42), SIMDE_FLOAT64_C( 0.47),
SIMDE_FLOAT64_C( 0.67), SIMDE_FLOAT64_C( 0.03),
SIMDE_FLOAT64_C( 0.04), SIMDE_FLOAT64_C( 0.35)),
UINT8_C(139),
simde_mm512_set_pd(SIMDE_FLOAT64_C( -0.68), SIMDE_FLOAT64_C( 0.08),
SIMDE_FLOAT64_C( 0.83), SIMDE_FLOAT64_C( -0.27),
SIMDE_FLOAT64_C( 0.50), SIMDE_FLOAT64_C( -0.30),
SIMDE_FLOAT64_C( -0.19), SIMDE_FLOAT64_C( -0.75)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( -0.83), SIMDE_FLOAT64_C( 0.57),
SIMDE_FLOAT64_C( 0.42), SIMDE_FLOAT64_C( 0.47),
SIMDE_FLOAT64_C( 0.55), SIMDE_FLOAT64_C( 0.03),
SIMDE_FLOAT64_C( -0.19), SIMDE_FLOAT64_C( -0.97)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( 0.18), SIMDE_FLOAT64_C( 0.23),
SIMDE_FLOAT64_C( 0.26), SIMDE_FLOAT64_C( -0.98),
SIMDE_FLOAT64_C( -0.44), SIMDE_FLOAT64_C( -0.38),
SIMDE_FLOAT64_C( -0.31), SIMDE_FLOAT64_C( -0.42)),
UINT8_C(229),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 0.84), SIMDE_FLOAT64_C( -0.45),
SIMDE_FLOAT64_C( 0.69), SIMDE_FLOAT64_C( -0.21),
SIMDE_FLOAT64_C( -0.66), SIMDE_FLOAT64_C( 0.03),
SIMDE_FLOAT64_C( -0.92), SIMDE_FLOAT64_C( -0.86)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 1.22), SIMDE_FLOAT64_C( -0.48),
SIMDE_FLOAT64_C( 0.85), SIMDE_FLOAT64_C( -0.98),
SIMDE_FLOAT64_C( -0.44), SIMDE_FLOAT64_C( 0.03),
SIMDE_FLOAT64_C( -0.31), SIMDE_FLOAT64_C( -1.29)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( 0.40), SIMDE_FLOAT64_C( 0.40),
SIMDE_FLOAT64_C( 0.34), SIMDE_FLOAT64_C( -0.26),
SIMDE_FLOAT64_C( -0.03), SIMDE_FLOAT64_C( 0.44),
SIMDE_FLOAT64_C( 0.38), SIMDE_FLOAT64_C( 0.99)),
UINT8_C(253),
simde_mm512_set_pd(SIMDE_FLOAT64_C( -0.55), SIMDE_FLOAT64_C( -0.39),
SIMDE_FLOAT64_C( 0.66), SIMDE_FLOAT64_C( 0.53),
SIMDE_FLOAT64_C( 0.78), SIMDE_FLOAT64_C( -0.77),
SIMDE_FLOAT64_C( -0.58), SIMDE_FLOAT64_C( -0.77)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( -0.62), SIMDE_FLOAT64_C( -0.41),
SIMDE_FLOAT64_C( 0.79), SIMDE_FLOAT64_C( 0.59),
SIMDE_FLOAT64_C( 1.05), SIMDE_FLOAT64_C( -1.02),
SIMDE_FLOAT64_C( 0.38), SIMDE_FLOAT64_C( -1.02)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( 0.12), SIMDE_FLOAT64_C( 0.47),
SIMDE_FLOAT64_C( -0.15), SIMDE_FLOAT64_C( 0.91),
SIMDE_FLOAT64_C( 0.79), SIMDE_FLOAT64_C( -0.20),
SIMDE_FLOAT64_C( 0.34), SIMDE_FLOAT64_C( -0.75)),
UINT8_C( 93),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 0.54), SIMDE_FLOAT64_C( -0.17),
SIMDE_FLOAT64_C( 0.68), SIMDE_FLOAT64_C( 0.82),
SIMDE_FLOAT64_C( 0.60), SIMDE_FLOAT64_C( 0.25),
SIMDE_FLOAT64_C( -0.08), SIMDE_FLOAT64_C( -0.94)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 0.12), SIMDE_FLOAT64_C( -0.17),
SIMDE_FLOAT64_C( -0.15), SIMDE_FLOAT64_C( 1.16),
SIMDE_FLOAT64_C( 0.69), SIMDE_FLOAT64_C( 0.26),
SIMDE_FLOAT64_C( 0.34), SIMDE_FLOAT64_C( -1.74)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( 0.10), SIMDE_FLOAT64_C( -0.74),
SIMDE_FLOAT64_C( 0.76), SIMDE_FLOAT64_C( 0.34),
SIMDE_FLOAT64_C( -0.80), SIMDE_FLOAT64_C( -0.53),
SIMDE_FLOAT64_C( -0.82), SIMDE_FLOAT64_C( 0.66)),
UINT8_C(145),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 0.33), SIMDE_FLOAT64_C( 0.46),
SIMDE_FLOAT64_C( -0.18), SIMDE_FLOAT64_C( 0.32),
SIMDE_FLOAT64_C( -0.87), SIMDE_FLOAT64_C( -0.33),
SIMDE_FLOAT64_C( -0.19), SIMDE_FLOAT64_C( 0.56)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 0.34), SIMDE_FLOAT64_C( -0.74),
SIMDE_FLOAT64_C( 0.76), SIMDE_FLOAT64_C( 0.33),
SIMDE_FLOAT64_C( -0.80), SIMDE_FLOAT64_C( -0.53),
SIMDE_FLOAT64_C( -0.82), SIMDE_FLOAT64_C( 0.63)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( -0.76), SIMDE_FLOAT64_C( 0.03),
SIMDE_FLOAT64_C( 0.69), SIMDE_FLOAT64_C( -0.02),
SIMDE_FLOAT64_C( -0.45), SIMDE_FLOAT64_C( 0.51),
SIMDE_FLOAT64_C( 0.83), SIMDE_FLOAT64_C( 0.98)),
UINT8_C( 75),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 0.98), SIMDE_FLOAT64_C( 0.42),
SIMDE_FLOAT64_C( -0.10), SIMDE_FLOAT64_C( 0.84),
SIMDE_FLOAT64_C( -0.59), SIMDE_FLOAT64_C( 0.73),
SIMDE_FLOAT64_C( 0.62), SIMDE_FLOAT64_C( 0.14)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( -0.76), SIMDE_FLOAT64_C( 0.45),
SIMDE_FLOAT64_C( 0.69), SIMDE_FLOAT64_C( -0.02),
SIMDE_FLOAT64_C( -0.68), SIMDE_FLOAT64_C( 0.51),
SIMDE_FLOAT64_C( 0.73), SIMDE_FLOAT64_C( 0.14)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( 0.39), SIMDE_FLOAT64_C( -0.30),
SIMDE_FLOAT64_C( -0.70), SIMDE_FLOAT64_C( 0.82),
SIMDE_FLOAT64_C( -1.00), SIMDE_FLOAT64_C( 0.92),
SIMDE_FLOAT64_C( -0.77), SIMDE_FLOAT64_C( -0.07)),
UINT8_C( 93),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 0.51), SIMDE_FLOAT64_C( 0.01),
SIMDE_FLOAT64_C( 0.92), SIMDE_FLOAT64_C( -0.77),
SIMDE_FLOAT64_C( -0.57), SIMDE_FLOAT64_C( -0.34),
SIMDE_FLOAT64_C( 0.29), SIMDE_FLOAT64_C( -0.58)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 0.39), SIMDE_FLOAT64_C( 0.01),
SIMDE_FLOAT64_C( -0.70), SIMDE_FLOAT64_C( -1.02),
SIMDE_FLOAT64_C( -0.65), SIMDE_FLOAT64_C( -0.35),
SIMDE_FLOAT64_C( -0.77), SIMDE_FLOAT64_C( -0.66)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( 0.48), SIMDE_FLOAT64_C( 0.94),
SIMDE_FLOAT64_C( -0.35), SIMDE_FLOAT64_C( -0.44),
SIMDE_FLOAT64_C( -0.75), SIMDE_FLOAT64_C( 0.93),
SIMDE_FLOAT64_C( -0.33), SIMDE_FLOAT64_C( -0.18)),
UINT8_C(213),
simde_mm512_set_pd(SIMDE_FLOAT64_C( -0.77), SIMDE_FLOAT64_C( 0.44),
SIMDE_FLOAT64_C( 0.90), SIMDE_FLOAT64_C( -0.20),
SIMDE_FLOAT64_C( -0.36), SIMDE_FLOAT64_C( -0.03),
SIMDE_FLOAT64_C( 0.01), SIMDE_FLOAT64_C( -0.13)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( -1.02), SIMDE_FLOAT64_C( 0.47),
SIMDE_FLOAT64_C( -0.35), SIMDE_FLOAT64_C( -0.20),
SIMDE_FLOAT64_C( -0.75), SIMDE_FLOAT64_C( -0.03),
SIMDE_FLOAT64_C( -0.33), SIMDE_FLOAT64_C( -0.13)) }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])); i++) {
simde__m512d r = simde_mm512_mask_atanh_pd(test_vec[i].src, test_vec[i].k, test_vec[i].a);
simde_assert_m512d_close(r, test_vec[i].r, 1);
}
return 0;
}
static int
test_simde_mm_cdfnorm_ps (SIMDE_MUNIT_TEST_ARGS) {
static const struct {
const simde_float32 a[4];
const simde_float32 r[4];
} test_vec[] = {
{ { SIMDE_FLOAT32_C( -993.83), SIMDE_FLOAT32_C( 92.27), SIMDE_FLOAT32_C( 208.35), SIMDE_FLOAT32_C( 761.44) },
{ SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( 1.00) } },
{ { SIMDE_FLOAT32_C( -963.46), SIMDE_FLOAT32_C( 429.93), SIMDE_FLOAT32_C( 318.99), SIMDE_FLOAT32_C( 532.75) },
{ SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( 1.00) } },
{ { SIMDE_FLOAT32_C( 677.31), SIMDE_FLOAT32_C( -552.55), SIMDE_FLOAT32_C( 344.89), SIMDE_FLOAT32_C( -275.73) },
{ SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( 0.00) } },
{ { SIMDE_FLOAT32_C( -396.40), SIMDE_FLOAT32_C( 319.50), SIMDE_FLOAT32_C( 348.88), SIMDE_FLOAT32_C( -732.73) },
{ SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( 0.00) } },
{ { SIMDE_FLOAT32_C( 638.44), SIMDE_FLOAT32_C( -14.16), SIMDE_FLOAT32_C( -165.87), SIMDE_FLOAT32_C( 843.45) },
{ SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 1.00) } },
{ { SIMDE_FLOAT32_C( -841.80), SIMDE_FLOAT32_C( -382.17), SIMDE_FLOAT32_C( -889.98), SIMDE_FLOAT32_C( 238.69) },
{ SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 1.00) } },
{ { SIMDE_FLOAT32_C( -193.56), SIMDE_FLOAT32_C( 381.13), SIMDE_FLOAT32_C( -623.80), SIMDE_FLOAT32_C( -46.41) },
{ SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 0.00) } },
{ { SIMDE_FLOAT32_C( 798.25), SIMDE_FLOAT32_C( -366.96), SIMDE_FLOAT32_C( 249.70), SIMDE_FLOAT32_C( 804.43) },
{ SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( 1.00) } }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])) ; i++) {
simde__m128 a = simde_mm_loadu_ps(test_vec[i].a);
simde__m128 r = simde_mm_cdfnorm_ps(a);
simde_test_x86_assert_equal_f32x4(r, simde_mm_loadu_ps(test_vec[i].r), 1);
}
return 0;
}
static int
test_simde_mm_cdfnorm_pd (SIMDE_MUNIT_TEST_ARGS) {
static const struct {
const simde_float64 a[2];
const simde_float64 r[2];
} test_vec[] = {
{ { SIMDE_FLOAT64_C( -954.47), SIMDE_FLOAT64_C( -900.72) },
{ SIMDE_FLOAT64_C( 0.00), SIMDE_FLOAT64_C( 0.00) } },
{ { SIMDE_FLOAT64_C( 375.82), SIMDE_FLOAT64_C( 323.80) },
{ SIMDE_FLOAT64_C( 1.00), SIMDE_FLOAT64_C( 1.00) } },
{ { SIMDE_FLOAT64_C( -882.15), SIMDE_FLOAT64_C( -872.83) },
{ SIMDE_FLOAT64_C( 0.00), SIMDE_FLOAT64_C( 0.00) } },
{ { SIMDE_FLOAT64_C( -880.22), SIMDE_FLOAT64_C( 404.86) },
{ SIMDE_FLOAT64_C( 0.00), SIMDE_FLOAT64_C( 1.00) } },
{ { SIMDE_FLOAT64_C( 587.17), SIMDE_FLOAT64_C( 674.97) },
{ SIMDE_FLOAT64_C( 1.00), SIMDE_FLOAT64_C( 1.00) } },
{ { SIMDE_FLOAT64_C( -509.08), SIMDE_FLOAT64_C( -152.91) },
{ SIMDE_FLOAT64_C( 0.00), SIMDE_FLOAT64_C( 0.00) } },
{ { SIMDE_FLOAT64_C( -296.61), SIMDE_FLOAT64_C( 576.29) },
{ SIMDE_FLOAT64_C( 0.00), SIMDE_FLOAT64_C( 1.00) } },
{ { SIMDE_FLOAT64_C( -858.64), SIMDE_FLOAT64_C( -995.64) },
{ SIMDE_FLOAT64_C( 0.00), SIMDE_FLOAT64_C( 0.00) } }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])) ; i++) {
simde__m128d a = simde_mm_loadu_pd(test_vec[i].a);
simde__m128d r = simde_mm_cdfnorm_pd(a);
simde_test_x86_assert_equal_f64x2(r, simde_mm_loadu_pd(test_vec[i].r), 1);
}
return 0;
}
static int
test_simde_mm256_cdfnorm_ps (SIMDE_MUNIT_TEST_ARGS) {
static const struct {
const simde_float32 a[8];
const simde_float32 r[8];
} test_vec[] = {
{ { SIMDE_FLOAT32_C( 818.12), SIMDE_FLOAT32_C( 842.04), SIMDE_FLOAT32_C( -990.82), SIMDE_FLOAT32_C( -180.40),
SIMDE_FLOAT32_C( -703.48), SIMDE_FLOAT32_C( -658.67), SIMDE_FLOAT32_C( -675.01), SIMDE_FLOAT32_C( -213.67) },
{ SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 0.00),
SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 0.00) } },
{ { SIMDE_FLOAT32_C( -312.75), SIMDE_FLOAT32_C( -440.95), SIMDE_FLOAT32_C( 40.83), SIMDE_FLOAT32_C( -601.56),
SIMDE_FLOAT32_C( 516.51), SIMDE_FLOAT32_C( 64.68), SIMDE_FLOAT32_C( 765.54), SIMDE_FLOAT32_C( 383.86) },
{ SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( 0.00),
SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( 1.00) } },
{ { SIMDE_FLOAT32_C( -264.08), SIMDE_FLOAT32_C( -961.69), SIMDE_FLOAT32_C( 776.59), SIMDE_FLOAT32_C( -476.70),
SIMDE_FLOAT32_C( 398.19), SIMDE_FLOAT32_C( 561.61), SIMDE_FLOAT32_C( -253.27), SIMDE_FLOAT32_C( 994.83) },
{ SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( 0.00),
SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 1.00) } },
{ { SIMDE_FLOAT32_C( -614.21), SIMDE_FLOAT32_C( 933.12), SIMDE_FLOAT32_C( 521.15), SIMDE_FLOAT32_C( 87.99),
SIMDE_FLOAT32_C( 511.16), SIMDE_FLOAT32_C( 278.58), SIMDE_FLOAT32_C( -327.57), SIMDE_FLOAT32_C( 329.28) },
{ SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( 1.00),
SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 1.00) } },
{ { SIMDE_FLOAT32_C( 120.61), SIMDE_FLOAT32_C( -318.39), SIMDE_FLOAT32_C( -851.12), SIMDE_FLOAT32_C( 417.13),
SIMDE_FLOAT32_C( 22.95), SIMDE_FLOAT32_C( -526.13), SIMDE_FLOAT32_C( -796.54), SIMDE_FLOAT32_C( 710.20) },
{ SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 1.00),
SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 1.00) } },
{ { SIMDE_FLOAT32_C( 32.92), SIMDE_FLOAT32_C( 244.29), SIMDE_FLOAT32_C( -891.36), SIMDE_FLOAT32_C( -450.57),
SIMDE_FLOAT32_C( -691.03), SIMDE_FLOAT32_C( 874.17), SIMDE_FLOAT32_C( 933.29), SIMDE_FLOAT32_C( 44.89) },
{ SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 0.00),
SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( 1.00) } },
{ { SIMDE_FLOAT32_C( 912.48), SIMDE_FLOAT32_C( 709.88), SIMDE_FLOAT32_C( 568.19), SIMDE_FLOAT32_C( 310.67),
SIMDE_FLOAT32_C( 271.49), SIMDE_FLOAT32_C( -685.08), SIMDE_FLOAT32_C( 305.50), SIMDE_FLOAT32_C( 657.28) },
{ SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( 1.00),
SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( 1.00) } },
{ { SIMDE_FLOAT32_C( -751.96), SIMDE_FLOAT32_C( -173.35), SIMDE_FLOAT32_C( -254.73), SIMDE_FLOAT32_C( 759.20),
SIMDE_FLOAT32_C( -894.77), SIMDE_FLOAT32_C( 417.70), SIMDE_FLOAT32_C( 88.48), SIMDE_FLOAT32_C( 225.84) },
{ SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 1.00),
SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( 1.00) } }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])) ; i++) {
simde__m256 a = simde_mm256_loadu_ps(test_vec[i].a);
simde__m256 r = simde_mm256_cdfnorm_ps(a);
simde_test_x86_assert_equal_f32x8(r, simde_mm256_loadu_ps(test_vec[i].r), 1);
}
return 0;
}
static int
test_simde_mm256_cdfnorm_pd (SIMDE_MUNIT_TEST_ARGS) {
static const struct {
const simde_float64 a[4];
const simde_float64 r[4];
} test_vec[] = {
{ { SIMDE_FLOAT64_C( -924.75), SIMDE_FLOAT64_C( -974.37), SIMDE_FLOAT64_C( -748.27), SIMDE_FLOAT64_C( -367.36) },
{ SIMDE_FLOAT64_C( 0.00), SIMDE_FLOAT64_C( 0.00), SIMDE_FLOAT64_C( 0.00), SIMDE_FLOAT64_C( 0.00) } },
{ { SIMDE_FLOAT64_C( -632.95), SIMDE_FLOAT64_C( 220.99), SIMDE_FLOAT64_C( 820.62), SIMDE_FLOAT64_C( -652.24) },
{ SIMDE_FLOAT64_C( 0.00), SIMDE_FLOAT64_C( 1.00), SIMDE_FLOAT64_C( 1.00), SIMDE_FLOAT64_C( 0.00) } },
{ { SIMDE_FLOAT64_C( -811.15), SIMDE_FLOAT64_C( -815.96), SIMDE_FLOAT64_C( 903.78), SIMDE_FLOAT64_C( 978.99) },
{ SIMDE_FLOAT64_C( 0.00), SIMDE_FLOAT64_C( 0.00), SIMDE_FLOAT64_C( 1.00), SIMDE_FLOAT64_C( 1.00) } },
{ { SIMDE_FLOAT64_C( -359.97), SIMDE_FLOAT64_C( -262.68), SIMDE_FLOAT64_C( -977.31), SIMDE_FLOAT64_C( -241.69) },
{ SIMDE_FLOAT64_C( 0.00), SIMDE_FLOAT64_C( 0.00), SIMDE_FLOAT64_C( 0.00), SIMDE_FLOAT64_C( 0.00) } },
{ { SIMDE_FLOAT64_C( 96.53), SIMDE_FLOAT64_C( 838.57), SIMDE_FLOAT64_C( 179.14), SIMDE_FLOAT64_C( 108.78) },
{ SIMDE_FLOAT64_C( 1.00), SIMDE_FLOAT64_C( 1.00), SIMDE_FLOAT64_C( 1.00), SIMDE_FLOAT64_C( 1.00) } },
{ { SIMDE_FLOAT64_C( -69.02), SIMDE_FLOAT64_C( -39.14), SIMDE_FLOAT64_C( 24.34), SIMDE_FLOAT64_C( -579.34) },
{ SIMDE_FLOAT64_C( 0.00), SIMDE_FLOAT64_C( 0.00), SIMDE_FLOAT64_C( 1.00), SIMDE_FLOAT64_C( 0.00) } },
{ { SIMDE_FLOAT64_C( 73.79), SIMDE_FLOAT64_C( 99.84), SIMDE_FLOAT64_C( 430.49), SIMDE_FLOAT64_C( 713.26) },
{ SIMDE_FLOAT64_C( 1.00), SIMDE_FLOAT64_C( 1.00), SIMDE_FLOAT64_C( 1.00), SIMDE_FLOAT64_C( 1.00) } },
{ { SIMDE_FLOAT64_C( -127.22), SIMDE_FLOAT64_C( -439.34), SIMDE_FLOAT64_C( -849.37), SIMDE_FLOAT64_C( -51.97) },
{ SIMDE_FLOAT64_C( 0.00), SIMDE_FLOAT64_C( 0.00), SIMDE_FLOAT64_C( 0.00), SIMDE_FLOAT64_C( 0.00) } }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])) ; i++) {
simde__m256d a = simde_mm256_loadu_pd(test_vec[i].a);
simde__m256d r = simde_mm256_cdfnorm_pd(a);
simde_test_x86_assert_equal_f64x4(r, simde_mm256_loadu_pd(test_vec[i].r), 1);
}
return 0;
}
static int
test_simde_mm512_cdfnorm_ps (SIMDE_MUNIT_TEST_ARGS) {
static const struct {
const simde_float32 a[16];
const simde_float32 r[16];
} test_vec[] = {
{ { SIMDE_FLOAT32_C( -171.83), SIMDE_FLOAT32_C( -16.40), SIMDE_FLOAT32_C( -352.71), SIMDE_FLOAT32_C( -355.76),
SIMDE_FLOAT32_C( -532.92), SIMDE_FLOAT32_C( -657.24), SIMDE_FLOAT32_C( -31.51), SIMDE_FLOAT32_C( -403.96),
SIMDE_FLOAT32_C( 10.99), SIMDE_FLOAT32_C( -120.77), SIMDE_FLOAT32_C( 317.51), SIMDE_FLOAT32_C( 262.42),
SIMDE_FLOAT32_C( 830.85), SIMDE_FLOAT32_C( -503.76), SIMDE_FLOAT32_C( 762.65), SIMDE_FLOAT32_C( -301.62) },
{ SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 0.00),
SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 0.00),
SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( 1.00),
SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( 0.00) } },
{ { SIMDE_FLOAT32_C( 330.53), SIMDE_FLOAT32_C( 478.14), SIMDE_FLOAT32_C( -836.82), SIMDE_FLOAT32_C( 378.71),
SIMDE_FLOAT32_C( 784.61), SIMDE_FLOAT32_C( 602.57), SIMDE_FLOAT32_C( 441.59), SIMDE_FLOAT32_C( -912.33),
SIMDE_FLOAT32_C( -474.27), SIMDE_FLOAT32_C( 991.91), SIMDE_FLOAT32_C( 893.21), SIMDE_FLOAT32_C( 55.17),
SIMDE_FLOAT32_C( -251.62), SIMDE_FLOAT32_C( 632.38), SIMDE_FLOAT32_C( 573.89), SIMDE_FLOAT32_C( 576.55) },
{ SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 1.00),
SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( 0.00),
SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( 1.00),
SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( 1.00) } },
{ { SIMDE_FLOAT32_C( -384.02), SIMDE_FLOAT32_C( -778.82), SIMDE_FLOAT32_C( -779.21), SIMDE_FLOAT32_C( 83.07),
SIMDE_FLOAT32_C( -436.06), SIMDE_FLOAT32_C( 189.28), SIMDE_FLOAT32_C( 679.10), SIMDE_FLOAT32_C( 574.93),
SIMDE_FLOAT32_C( -931.49), SIMDE_FLOAT32_C( -3.39), SIMDE_FLOAT32_C( -162.65), SIMDE_FLOAT32_C( 899.36),
SIMDE_FLOAT32_C( 492.85), SIMDE_FLOAT32_C( -399.99), SIMDE_FLOAT32_C( -402.27), SIMDE_FLOAT32_C( -176.62) },
{ SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 1.00),
SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( 1.00),
SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 1.00),
SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 0.00) } },
{ { SIMDE_FLOAT32_C( -921.85), SIMDE_FLOAT32_C( -239.09), SIMDE_FLOAT32_C( -797.90), SIMDE_FLOAT32_C( 862.75),
SIMDE_FLOAT32_C( -636.52), SIMDE_FLOAT32_C( 643.69), SIMDE_FLOAT32_C( 950.42), SIMDE_FLOAT32_C( -110.78),
SIMDE_FLOAT32_C( 635.59), SIMDE_FLOAT32_C( 843.63), SIMDE_FLOAT32_C( 944.39), SIMDE_FLOAT32_C( -616.03),
SIMDE_FLOAT32_C( 476.02), SIMDE_FLOAT32_C( 518.27), SIMDE_FLOAT32_C( 960.52), SIMDE_FLOAT32_C( -908.00) },
{ SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 1.00),
SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( 0.00),
SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( 0.00),
SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( 0.00) } },
{ { SIMDE_FLOAT32_C( 739.45), SIMDE_FLOAT32_C( -818.69), SIMDE_FLOAT32_C( 175.06), SIMDE_FLOAT32_C( -696.61),
SIMDE_FLOAT32_C( 370.60), SIMDE_FLOAT32_C( -145.84), SIMDE_FLOAT32_C( 878.31), SIMDE_FLOAT32_C( 439.11),
SIMDE_FLOAT32_C( 850.77), SIMDE_FLOAT32_C( -284.33), SIMDE_FLOAT32_C( 338.47), SIMDE_FLOAT32_C( 343.62),
SIMDE_FLOAT32_C( 315.67), SIMDE_FLOAT32_C( 936.20), SIMDE_FLOAT32_C( -832.99), SIMDE_FLOAT32_C( 393.82) },
{ SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( 0.00),
SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( 1.00),
SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( 1.00),
SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 1.00) } },
{ { SIMDE_FLOAT32_C( -302.88), SIMDE_FLOAT32_C( -630.90), SIMDE_FLOAT32_C( 256.57), SIMDE_FLOAT32_C( 60.60),
SIMDE_FLOAT32_C( -987.21), SIMDE_FLOAT32_C( 206.99), SIMDE_FLOAT32_C( 949.82), SIMDE_FLOAT32_C( 648.38),
SIMDE_FLOAT32_C( 50.62), SIMDE_FLOAT32_C( 894.21), SIMDE_FLOAT32_C( -967.65), SIMDE_FLOAT32_C( -473.36),
SIMDE_FLOAT32_C( 412.48), SIMDE_FLOAT32_C( 992.88), SIMDE_FLOAT32_C( -381.36), SIMDE_FLOAT32_C( 151.93) },
{ SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( 1.00),
SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( 1.00),
SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 0.00),
SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 1.00) } },
{ { SIMDE_FLOAT32_C( -825.81), SIMDE_FLOAT32_C( 793.70), SIMDE_FLOAT32_C( 455.32), SIMDE_FLOAT32_C( 544.79),
SIMDE_FLOAT32_C( -352.14), SIMDE_FLOAT32_C( 333.63), SIMDE_FLOAT32_C( -16.10), SIMDE_FLOAT32_C( -501.36),
SIMDE_FLOAT32_C( -950.70), SIMDE_FLOAT32_C( -677.63), SIMDE_FLOAT32_C( 842.26), SIMDE_FLOAT32_C( 364.97),
SIMDE_FLOAT32_C( -741.43), SIMDE_FLOAT32_C( -990.74), SIMDE_FLOAT32_C( -241.21), SIMDE_FLOAT32_C( -44.31) },
{ SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( 1.00),
SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 0.00),
SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( 1.00),
SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 0.00) } },
{ { SIMDE_FLOAT32_C( -621.64), SIMDE_FLOAT32_C( -984.64), SIMDE_FLOAT32_C( -983.70), SIMDE_FLOAT32_C( -608.85),
SIMDE_FLOAT32_C( 222.35), SIMDE_FLOAT32_C( 966.12), SIMDE_FLOAT32_C( -960.47), SIMDE_FLOAT32_C( -727.02),
SIMDE_FLOAT32_C( 860.32), SIMDE_FLOAT32_C( -928.11), SIMDE_FLOAT32_C( -200.38), SIMDE_FLOAT32_C( 272.80),
SIMDE_FLOAT32_C( -935.24), SIMDE_FLOAT32_C( 418.26), SIMDE_FLOAT32_C( -575.27), SIMDE_FLOAT32_C( -761.04) },
{ SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 0.00),
SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 0.00),
SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 1.00),
SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 0.00) } }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])) ; i++) {
simde__m512 a = simde_mm512_loadu_ps(test_vec[i].a);
simde__m512 r = simde_mm512_cdfnorm_ps(a);
simde_test_x86_assert_equal_f32x16(r, simde_mm512_loadu_ps(test_vec[i].r), 1);
}
return 0;
}
static int
test_simde_mm512_mask_cdfnorm_ps (SIMDE_MUNIT_TEST_ARGS) {
static const struct {
const simde_float32 src[16];
const simde__mmask8 k;
const simde_float32 a[16];
const simde_float32 r[16];
} test_vec[] = {
{ { SIMDE_FLOAT32_C( 742.28), SIMDE_FLOAT32_C( -10.25), SIMDE_FLOAT32_C( -827.23), SIMDE_FLOAT32_C( 995.37),
SIMDE_FLOAT32_C( 256.37), SIMDE_FLOAT32_C( 283.72), SIMDE_FLOAT32_C( -388.62), SIMDE_FLOAT32_C( -979.71),
SIMDE_FLOAT32_C( -680.17), SIMDE_FLOAT32_C( -749.87), SIMDE_FLOAT32_C( -71.05), SIMDE_FLOAT32_C( -60.71),
SIMDE_FLOAT32_C( -405.48), SIMDE_FLOAT32_C( 786.24), SIMDE_FLOAT32_C( -561.14), SIMDE_FLOAT32_C( 561.28) },
UINT8_C(133),
{ SIMDE_FLOAT32_C( 409.19), SIMDE_FLOAT32_C( -492.65), SIMDE_FLOAT32_C( 57.95), SIMDE_FLOAT32_C( -250.60),
SIMDE_FLOAT32_C( -403.16), SIMDE_FLOAT32_C( 437.65), SIMDE_FLOAT32_C( 509.49), SIMDE_FLOAT32_C( -69.63),
SIMDE_FLOAT32_C( 308.33), SIMDE_FLOAT32_C( 780.29), SIMDE_FLOAT32_C( -943.64), SIMDE_FLOAT32_C( 322.23),
SIMDE_FLOAT32_C( 242.19), SIMDE_FLOAT32_C( 643.12), SIMDE_FLOAT32_C( 64.51), SIMDE_FLOAT32_C( -768.06) },
{ SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( -10.25), SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( 995.37),
SIMDE_FLOAT32_C( 256.37), SIMDE_FLOAT32_C( 283.72), SIMDE_FLOAT32_C( -388.62), SIMDE_FLOAT32_C( 0.00),
SIMDE_FLOAT32_C( -680.17), SIMDE_FLOAT32_C( -749.87), SIMDE_FLOAT32_C( -71.05), SIMDE_FLOAT32_C( -60.71),
SIMDE_FLOAT32_C( -405.48), SIMDE_FLOAT32_C( 786.24), SIMDE_FLOAT32_C( -561.14), SIMDE_FLOAT32_C( 561.28) } },
{ { SIMDE_FLOAT32_C( 815.89), SIMDE_FLOAT32_C( 59.87), SIMDE_FLOAT32_C( 488.31), SIMDE_FLOAT32_C( 99.61),
SIMDE_FLOAT32_C( 671.25), SIMDE_FLOAT32_C( 508.61), SIMDE_FLOAT32_C( 419.45), SIMDE_FLOAT32_C( 921.38),
SIMDE_FLOAT32_C( -562.45), SIMDE_FLOAT32_C( -641.27), SIMDE_FLOAT32_C( -484.11), SIMDE_FLOAT32_C( -776.21),
SIMDE_FLOAT32_C( -202.41), SIMDE_FLOAT32_C( -922.83), SIMDE_FLOAT32_C( -317.45), SIMDE_FLOAT32_C( -793.22) },
UINT8_C(110),
{ SIMDE_FLOAT32_C( 740.50), SIMDE_FLOAT32_C( -43.82), SIMDE_FLOAT32_C( 181.36), SIMDE_FLOAT32_C( 178.15),
SIMDE_FLOAT32_C( -534.33), SIMDE_FLOAT32_C( -888.27), SIMDE_FLOAT32_C( -513.52), SIMDE_FLOAT32_C( -754.04),
SIMDE_FLOAT32_C( -831.91), SIMDE_FLOAT32_C( 808.71), SIMDE_FLOAT32_C( 488.15), SIMDE_FLOAT32_C( 811.21),
SIMDE_FLOAT32_C( -126.78), SIMDE_FLOAT32_C( 720.09), SIMDE_FLOAT32_C( 627.10), SIMDE_FLOAT32_C( 933.09) },
{ SIMDE_FLOAT32_C( 815.89), SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( 1.00),
SIMDE_FLOAT32_C( 671.25), SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 921.38),
SIMDE_FLOAT32_C( -562.45), SIMDE_FLOAT32_C( -641.27), SIMDE_FLOAT32_C( -484.11), SIMDE_FLOAT32_C( -776.21),
SIMDE_FLOAT32_C( -202.41), SIMDE_FLOAT32_C( -922.83), SIMDE_FLOAT32_C( -317.45), SIMDE_FLOAT32_C( -793.22) } },
{ { SIMDE_FLOAT32_C( 208.40), SIMDE_FLOAT32_C( -273.28), SIMDE_FLOAT32_C( 604.34), SIMDE_FLOAT32_C( -282.99),
SIMDE_FLOAT32_C( -853.84), SIMDE_FLOAT32_C( 525.72), SIMDE_FLOAT32_C( 154.57), SIMDE_FLOAT32_C( -495.10),
SIMDE_FLOAT32_C( -958.39), SIMDE_FLOAT32_C( 378.36), SIMDE_FLOAT32_C( 302.49), SIMDE_FLOAT32_C( -881.22),
SIMDE_FLOAT32_C( -939.09), SIMDE_FLOAT32_C( 509.27), SIMDE_FLOAT32_C( -296.70), SIMDE_FLOAT32_C( 801.40) },
UINT8_C(108),
{ SIMDE_FLOAT32_C( 884.66), SIMDE_FLOAT32_C( -20.45), SIMDE_FLOAT32_C( -68.88), SIMDE_FLOAT32_C( 996.39),
SIMDE_FLOAT32_C( 466.03), SIMDE_FLOAT32_C( 177.08), SIMDE_FLOAT32_C( -835.52), SIMDE_FLOAT32_C( 274.74),
SIMDE_FLOAT32_C( -334.77), SIMDE_FLOAT32_C( 975.69), SIMDE_FLOAT32_C( -852.04), SIMDE_FLOAT32_C( -614.68),
SIMDE_FLOAT32_C( 602.80), SIMDE_FLOAT32_C( -918.95), SIMDE_FLOAT32_C( 593.73), SIMDE_FLOAT32_C( -670.48) },
{ SIMDE_FLOAT32_C( 208.40), SIMDE_FLOAT32_C( -273.28), SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 1.00),
SIMDE_FLOAT32_C( -853.84), SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( -495.10),
SIMDE_FLOAT32_C( -958.39), SIMDE_FLOAT32_C( 378.36), SIMDE_FLOAT32_C( 302.49), SIMDE_FLOAT32_C( -881.22),
SIMDE_FLOAT32_C( -939.09), SIMDE_FLOAT32_C( 509.27), SIMDE_FLOAT32_C( -296.70), SIMDE_FLOAT32_C( 801.40) } },
{ { SIMDE_FLOAT32_C( 685.39), SIMDE_FLOAT32_C( -689.26), SIMDE_FLOAT32_C( -524.32), SIMDE_FLOAT32_C( 211.10),
SIMDE_FLOAT32_C( 465.30), SIMDE_FLOAT32_C( -19.43), SIMDE_FLOAT32_C( 252.72), SIMDE_FLOAT32_C( -156.34),
SIMDE_FLOAT32_C( -716.94), SIMDE_FLOAT32_C( 371.50), SIMDE_FLOAT32_C( -95.43), SIMDE_FLOAT32_C( 792.33),
SIMDE_FLOAT32_C( -925.20), SIMDE_FLOAT32_C( -294.03), SIMDE_FLOAT32_C( -742.21), SIMDE_FLOAT32_C( 959.46) },
UINT8_C(216),
{ SIMDE_FLOAT32_C( 188.91), SIMDE_FLOAT32_C( 955.85), SIMDE_FLOAT32_C( 151.56), SIMDE_FLOAT32_C( -634.01),
SIMDE_FLOAT32_C( -879.66), SIMDE_FLOAT32_C( -573.70), SIMDE_FLOAT32_C( 31.23), SIMDE_FLOAT32_C( -903.97),
SIMDE_FLOAT32_C( -425.74), SIMDE_FLOAT32_C( 416.55), SIMDE_FLOAT32_C( 698.83), SIMDE_FLOAT32_C( -344.69),
SIMDE_FLOAT32_C( 10.28), SIMDE_FLOAT32_C( -971.65), SIMDE_FLOAT32_C( -659.31), SIMDE_FLOAT32_C( 321.02) },
{ SIMDE_FLOAT32_C( 685.39), SIMDE_FLOAT32_C( -689.26), SIMDE_FLOAT32_C( -524.32), SIMDE_FLOAT32_C( 0.00),
SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( -19.43), SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( 0.00),
SIMDE_FLOAT32_C( -716.94), SIMDE_FLOAT32_C( 371.50), SIMDE_FLOAT32_C( -95.43), SIMDE_FLOAT32_C( 792.33),
SIMDE_FLOAT32_C( -925.20), SIMDE_FLOAT32_C( -294.03), SIMDE_FLOAT32_C( -742.21), SIMDE_FLOAT32_C( 959.46) } },
{ { SIMDE_FLOAT32_C( -495.97), SIMDE_FLOAT32_C( 551.80), SIMDE_FLOAT32_C( -213.68), SIMDE_FLOAT32_C( 484.60),
SIMDE_FLOAT32_C( -195.49), SIMDE_FLOAT32_C( 629.98), SIMDE_FLOAT32_C( 767.66), SIMDE_FLOAT32_C( -823.99),
SIMDE_FLOAT32_C( -465.45), SIMDE_FLOAT32_C( 560.00), SIMDE_FLOAT32_C( -749.18), SIMDE_FLOAT32_C( 240.52),
SIMDE_FLOAT32_C( 817.78), SIMDE_FLOAT32_C( -789.72), SIMDE_FLOAT32_C( -73.95), SIMDE_FLOAT32_C( 6.69) },
UINT8_C(202),
{ SIMDE_FLOAT32_C( -922.39), SIMDE_FLOAT32_C( 372.68), SIMDE_FLOAT32_C( -713.53), SIMDE_FLOAT32_C( -496.09),
SIMDE_FLOAT32_C( -596.09), SIMDE_FLOAT32_C( -617.49), SIMDE_FLOAT32_C( 78.17), SIMDE_FLOAT32_C( 820.46),
SIMDE_FLOAT32_C( -918.66), SIMDE_FLOAT32_C( 733.47), SIMDE_FLOAT32_C( -169.26), SIMDE_FLOAT32_C( -890.32),
SIMDE_FLOAT32_C( -925.83), SIMDE_FLOAT32_C( -848.24), SIMDE_FLOAT32_C( -386.29), SIMDE_FLOAT32_C( 625.96) },
{ SIMDE_FLOAT32_C( -495.97), SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( -213.68), SIMDE_FLOAT32_C( 0.00),
SIMDE_FLOAT32_C( -195.49), SIMDE_FLOAT32_C( 629.98), SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( 1.00),
SIMDE_FLOAT32_C( -465.45), SIMDE_FLOAT32_C( 560.00), SIMDE_FLOAT32_C( -749.18), SIMDE_FLOAT32_C( 240.52),
SIMDE_FLOAT32_C( 817.78), SIMDE_FLOAT32_C( -789.72), SIMDE_FLOAT32_C( -73.95), SIMDE_FLOAT32_C( 6.69) } },
{ { SIMDE_FLOAT32_C( -61.91), SIMDE_FLOAT32_C( -901.69), SIMDE_FLOAT32_C( -569.52), SIMDE_FLOAT32_C( -431.93),
SIMDE_FLOAT32_C( 865.97), SIMDE_FLOAT32_C( -393.51), SIMDE_FLOAT32_C( 102.62), SIMDE_FLOAT32_C( 425.97),
SIMDE_FLOAT32_C( -142.69), SIMDE_FLOAT32_C( -656.86), SIMDE_FLOAT32_C( 243.75), SIMDE_FLOAT32_C( 67.59),
SIMDE_FLOAT32_C( 269.19), SIMDE_FLOAT32_C( -749.56), SIMDE_FLOAT32_C( 233.72), SIMDE_FLOAT32_C( 346.79) },
UINT8_C(117),
{ SIMDE_FLOAT32_C( 520.19), SIMDE_FLOAT32_C( 850.70), SIMDE_FLOAT32_C( -972.96), SIMDE_FLOAT32_C( 902.70),
SIMDE_FLOAT32_C( -71.13), SIMDE_FLOAT32_C( 847.50), SIMDE_FLOAT32_C( 984.04), SIMDE_FLOAT32_C( -337.66),
SIMDE_FLOAT32_C( -321.75), SIMDE_FLOAT32_C( -906.28), SIMDE_FLOAT32_C( -263.49), SIMDE_FLOAT32_C( -169.99),
SIMDE_FLOAT32_C( -292.57), SIMDE_FLOAT32_C( -637.53), SIMDE_FLOAT32_C( 768.10), SIMDE_FLOAT32_C( -194.26) },
{ SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( -901.69), SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( -431.93),
SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( 425.97),
SIMDE_FLOAT32_C( -142.69), SIMDE_FLOAT32_C( -656.86), SIMDE_FLOAT32_C( 243.75), SIMDE_FLOAT32_C( 67.59),
SIMDE_FLOAT32_C( 269.19), SIMDE_FLOAT32_C( -749.56), SIMDE_FLOAT32_C( 233.72), SIMDE_FLOAT32_C( 346.79) } },
{ { SIMDE_FLOAT32_C( -207.05), SIMDE_FLOAT32_C( -663.84), SIMDE_FLOAT32_C( -328.29), SIMDE_FLOAT32_C( 399.44),
SIMDE_FLOAT32_C( 438.78), SIMDE_FLOAT32_C( -902.33), SIMDE_FLOAT32_C( -743.25), SIMDE_FLOAT32_C( 781.93),
SIMDE_FLOAT32_C( 341.42), SIMDE_FLOAT32_C( 324.33), SIMDE_FLOAT32_C( 51.11), SIMDE_FLOAT32_C( 591.87),
SIMDE_FLOAT32_C( -441.94), SIMDE_FLOAT32_C( -602.09), SIMDE_FLOAT32_C( 214.99), SIMDE_FLOAT32_C( -921.75) },
UINT8_MAX,
{ SIMDE_FLOAT32_C( 242.04), SIMDE_FLOAT32_C( 980.95), SIMDE_FLOAT32_C( 177.48), SIMDE_FLOAT32_C( 89.54),
SIMDE_FLOAT32_C( 964.99), SIMDE_FLOAT32_C( 839.82), SIMDE_FLOAT32_C( 767.79), SIMDE_FLOAT32_C( -941.29),
SIMDE_FLOAT32_C( -423.68), SIMDE_FLOAT32_C( -402.20), SIMDE_FLOAT32_C( -233.86), SIMDE_FLOAT32_C( -61.21),
SIMDE_FLOAT32_C( -634.11), SIMDE_FLOAT32_C( 571.87), SIMDE_FLOAT32_C( 731.74), SIMDE_FLOAT32_C( -297.94) },
{ SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( 1.00),
SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( 0.00),
SIMDE_FLOAT32_C( 341.42), SIMDE_FLOAT32_C( 324.33), SIMDE_FLOAT32_C( 51.11), SIMDE_FLOAT32_C( 591.87),
SIMDE_FLOAT32_C( -441.94), SIMDE_FLOAT32_C( -602.09), SIMDE_FLOAT32_C( 214.99), SIMDE_FLOAT32_C( -921.75) } },
{ { SIMDE_FLOAT32_C( -756.42), SIMDE_FLOAT32_C( 131.18), SIMDE_FLOAT32_C( -859.16), SIMDE_FLOAT32_C( -658.75),
SIMDE_FLOAT32_C( 387.93), SIMDE_FLOAT32_C( 922.77), SIMDE_FLOAT32_C( 682.68), SIMDE_FLOAT32_C( -287.73),
SIMDE_FLOAT32_C( -26.12), SIMDE_FLOAT32_C( 274.55), SIMDE_FLOAT32_C( 270.32), SIMDE_FLOAT32_C( 371.79),
SIMDE_FLOAT32_C( -510.46), SIMDE_FLOAT32_C( 348.57), SIMDE_FLOAT32_C( 620.40), SIMDE_FLOAT32_C( 731.58) },
UINT8_C(111),
{ SIMDE_FLOAT32_C( -202.12), SIMDE_FLOAT32_C( -178.88), SIMDE_FLOAT32_C( 294.51), SIMDE_FLOAT32_C( -362.30),
SIMDE_FLOAT32_C( -411.10), SIMDE_FLOAT32_C( 353.22), SIMDE_FLOAT32_C( 214.02), SIMDE_FLOAT32_C( 186.70),
SIMDE_FLOAT32_C( -880.64), SIMDE_FLOAT32_C( -847.18), SIMDE_FLOAT32_C( 552.59), SIMDE_FLOAT32_C( 691.24),
SIMDE_FLOAT32_C( 884.56), SIMDE_FLOAT32_C( -745.35), SIMDE_FLOAT32_C( 934.82), SIMDE_FLOAT32_C( 15.74) },
{ SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( 0.00),
SIMDE_FLOAT32_C( 387.93), SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( -287.73),
SIMDE_FLOAT32_C( -26.12), SIMDE_FLOAT32_C( 274.55), SIMDE_FLOAT32_C( 270.32), SIMDE_FLOAT32_C( 371.79),
SIMDE_FLOAT32_C( -510.46), SIMDE_FLOAT32_C( 348.57), SIMDE_FLOAT32_C( 620.40), SIMDE_FLOAT32_C( 731.58) } }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])) ; i++) {
simde__m512 src = simde_mm512_loadu_ps(test_vec[i].src);
simde__m512 a = simde_mm512_loadu_ps(test_vec[i].a);
simde__m512 r = simde_mm512_mask_cdfnorm_ps(src, test_vec[i].k, a);
simde_test_x86_assert_equal_f32x16(r, simde_mm512_loadu_ps(test_vec[i].r), 1);
}
return 0;
}
static int
test_simde_mm512_cdfnorm_pd (SIMDE_MUNIT_TEST_ARGS) {
static const struct {
const simde_float64 a[8];
const simde_float64 r[8];
} test_vec[] = {
{ { SIMDE_FLOAT64_C( 515.78), SIMDE_FLOAT64_C( -190.13), SIMDE_FLOAT64_C( -905.08), SIMDE_FLOAT64_C( 734.43),
SIMDE_FLOAT64_C( -737.45), SIMDE_FLOAT64_C( 98.47), SIMDE_FLOAT64_C( -95.41), SIMDE_FLOAT64_C( -675.32) },
{ SIMDE_FLOAT64_C( 1.00), SIMDE_FLOAT64_C( 0.00), SIMDE_FLOAT64_C( 0.00), SIMDE_FLOAT64_C( 1.00),
SIMDE_FLOAT64_C( 0.00), SIMDE_FLOAT64_C( 1.00), SIMDE_FLOAT64_C( 0.00), SIMDE_FLOAT64_C( 0.00) } },
{ { SIMDE_FLOAT64_C( -274.83), SIMDE_FLOAT64_C( 838.86), SIMDE_FLOAT64_C( -796.42), SIMDE_FLOAT64_C( 478.49),
SIMDE_FLOAT64_C( 554.96), SIMDE_FLOAT64_C( -640.77), SIMDE_FLOAT64_C( -29.13), SIMDE_FLOAT64_C( -94.09) },
{ SIMDE_FLOAT64_C( 0.00), SIMDE_FLOAT64_C( 1.00), SIMDE_FLOAT64_C( 0.00), SIMDE_FLOAT64_C( 1.00),
SIMDE_FLOAT64_C( 1.00), SIMDE_FLOAT64_C( 0.00), SIMDE_FLOAT64_C( 0.00), SIMDE_FLOAT64_C( 0.00) } },
{ { SIMDE_FLOAT64_C( 398.68), SIMDE_FLOAT64_C( 316.09), SIMDE_FLOAT64_C( 332.14), SIMDE_FLOAT64_C( 590.41),
SIMDE_FLOAT64_C( -417.40), SIMDE_FLOAT64_C( -789.19), SIMDE_FLOAT64_C( -493.08), SIMDE_FLOAT64_C( 967.90) },
{ SIMDE_FLOAT64_C( 1.00), SIMDE_FLOAT64_C( 1.00), SIMDE_FLOAT64_C( 1.00), SIMDE_FLOAT64_C( 1.00),
SIMDE_FLOAT64_C( 0.00), SIMDE_FLOAT64_C( 0.00), SIMDE_FLOAT64_C( 0.00), SIMDE_FLOAT64_C( 1.00) } },
{ { SIMDE_FLOAT64_C( -877.90), SIMDE_FLOAT64_C( 49.76), SIMDE_FLOAT64_C( 604.59), SIMDE_FLOAT64_C( -550.52),
SIMDE_FLOAT64_C( -548.72), SIMDE_FLOAT64_C( 124.59), SIMDE_FLOAT64_C( 499.19), SIMDE_FLOAT64_C( 967.06) },
{ SIMDE_FLOAT64_C( 0.00), SIMDE_FLOAT64_C( 1.00), SIMDE_FLOAT64_C( 1.00), SIMDE_FLOAT64_C( 0.00),
SIMDE_FLOAT64_C( 0.00), SIMDE_FLOAT64_C( 1.00), SIMDE_FLOAT64_C( 1.00), SIMDE_FLOAT64_C( 1.00) } },
{ { SIMDE_FLOAT64_C( 934.46), SIMDE_FLOAT64_C( 594.11), SIMDE_FLOAT64_C( 701.49), SIMDE_FLOAT64_C( -802.98),
SIMDE_FLOAT64_C( -307.42), SIMDE_FLOAT64_C( -393.92), SIMDE_FLOAT64_C( -478.30), SIMDE_FLOAT64_C( 417.75) },
{ SIMDE_FLOAT64_C( 1.00), SIMDE_FLOAT64_C( 1.00), SIMDE_FLOAT64_C( 1.00), SIMDE_FLOAT64_C( 0.00),
SIMDE_FLOAT64_C( 0.00), SIMDE_FLOAT64_C( 0.00), SIMDE_FLOAT64_C( 0.00), SIMDE_FLOAT64_C( 1.00) } },
{ { SIMDE_FLOAT64_C( -555.06), SIMDE_FLOAT64_C( -274.72), SIMDE_FLOAT64_C( -103.76), SIMDE_FLOAT64_C( 999.90),
SIMDE_FLOAT64_C( 84.51), SIMDE_FLOAT64_C( 867.11), SIMDE_FLOAT64_C( -94.19), SIMDE_FLOAT64_C( -516.80) },
{ SIMDE_FLOAT64_C( 0.00), SIMDE_FLOAT64_C( 0.00), SIMDE_FLOAT64_C( 0.00), SIMDE_FLOAT64_C( 1.00),
SIMDE_FLOAT64_C( 1.00), SIMDE_FLOAT64_C( 1.00), SIMDE_FLOAT64_C( 0.00), SIMDE_FLOAT64_C( 0.00) } },
{ { SIMDE_FLOAT64_C( 183.20), SIMDE_FLOAT64_C( -762.05), SIMDE_FLOAT64_C( -926.39), SIMDE_FLOAT64_C( 765.80),
SIMDE_FLOAT64_C( -551.23), SIMDE_FLOAT64_C( -419.47), SIMDE_FLOAT64_C( 733.70), SIMDE_FLOAT64_C( -429.13) },
{ SIMDE_FLOAT64_C( 1.00), SIMDE_FLOAT64_C( 0.00), SIMDE_FLOAT64_C( 0.00), SIMDE_FLOAT64_C( 1.00),
SIMDE_FLOAT64_C( 0.00), SIMDE_FLOAT64_C( 0.00), SIMDE_FLOAT64_C( 1.00), SIMDE_FLOAT64_C( 0.00) } },
{ { SIMDE_FLOAT64_C( 630.29), SIMDE_FLOAT64_C( 338.28), SIMDE_FLOAT64_C( 20.35), SIMDE_FLOAT64_C( -918.43),
SIMDE_FLOAT64_C( -537.13), SIMDE_FLOAT64_C( -480.46), SIMDE_FLOAT64_C( -951.37), SIMDE_FLOAT64_C( -602.66) },
{ SIMDE_FLOAT64_C( 1.00), SIMDE_FLOAT64_C( 1.00), SIMDE_FLOAT64_C( 1.00), SIMDE_FLOAT64_C( 0.00),
SIMDE_FLOAT64_C( 0.00), SIMDE_FLOAT64_C( 0.00), SIMDE_FLOAT64_C( 0.00), SIMDE_FLOAT64_C( 0.00) } }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])) ; i++) {
simde__m512d a = simde_mm512_loadu_pd(test_vec[i].a);
simde__m512d r = simde_mm512_cdfnorm_pd(a);
simde_test_x86_assert_equal_f64x8(r, simde_mm512_loadu_pd(test_vec[i].r), 1);
}
return 0;
}
static int
test_simde_mm512_mask_cdfnorm_pd (SIMDE_MUNIT_TEST_ARGS) {
static const struct {
const simde_float64 src[8];
const simde__mmask8 k;
const simde_float64 a[8];
const simde_float64 r[8];
} test_vec[] = {
{ { SIMDE_FLOAT64_C( -488.95), SIMDE_FLOAT64_C( 602.82), SIMDE_FLOAT64_C( 180.74), SIMDE_FLOAT64_C( -325.95),
SIMDE_FLOAT64_C( -721.92), SIMDE_FLOAT64_C( 512.04), SIMDE_FLOAT64_C( 182.27), SIMDE_FLOAT64_C( -392.39) },
UINT8_C( 25),
{ SIMDE_FLOAT64_C( -174.69), SIMDE_FLOAT64_C( 219.93), SIMDE_FLOAT64_C( 649.77), SIMDE_FLOAT64_C( -892.75),
SIMDE_FLOAT64_C( -136.71), SIMDE_FLOAT64_C( -906.14), SIMDE_FLOAT64_C( 643.57), SIMDE_FLOAT64_C( 669.62) },
{ SIMDE_FLOAT64_C( 0.00), SIMDE_FLOAT64_C( 602.82), SIMDE_FLOAT64_C( 180.74), SIMDE_FLOAT64_C( 0.00),
SIMDE_FLOAT64_C( 0.00), SIMDE_FLOAT64_C( 512.04), SIMDE_FLOAT64_C( 182.27), SIMDE_FLOAT64_C( -392.39) } },
{ { SIMDE_FLOAT64_C( -655.46), SIMDE_FLOAT64_C( 837.15), SIMDE_FLOAT64_C( 772.04), SIMDE_FLOAT64_C( 272.82),
SIMDE_FLOAT64_C( 490.61), SIMDE_FLOAT64_C( 38.88), SIMDE_FLOAT64_C( -668.93), SIMDE_FLOAT64_C( -501.66) },
UINT8_C(232),
{ SIMDE_FLOAT64_C( -130.58), SIMDE_FLOAT64_C( 219.17), SIMDE_FLOAT64_C( 309.61), SIMDE_FLOAT64_C( -572.70),
SIMDE_FLOAT64_C( 851.68), SIMDE_FLOAT64_C( 820.66), SIMDE_FLOAT64_C( -969.88), SIMDE_FLOAT64_C( 32.42) },
{ SIMDE_FLOAT64_C( -655.46), SIMDE_FLOAT64_C( 837.15), SIMDE_FLOAT64_C( 772.04), SIMDE_FLOAT64_C( 0.00),
SIMDE_FLOAT64_C( 490.61), SIMDE_FLOAT64_C( 1.00), SIMDE_FLOAT64_C( 0.00), SIMDE_FLOAT64_C( 1.00) } },
{ { SIMDE_FLOAT64_C( -505.29), SIMDE_FLOAT64_C( -691.80), SIMDE_FLOAT64_C( -455.53), SIMDE_FLOAT64_C( 676.98),
SIMDE_FLOAT64_C( -84.19), SIMDE_FLOAT64_C( -340.34), SIMDE_FLOAT64_C( -497.71), SIMDE_FLOAT64_C( -864.27) },
UINT8_C(183),
{ SIMDE_FLOAT64_C( -390.46), SIMDE_FLOAT64_C( -0.97), SIMDE_FLOAT64_C( -596.71), SIMDE_FLOAT64_C( -746.89),
SIMDE_FLOAT64_C( -331.35), SIMDE_FLOAT64_C( -252.17), SIMDE_FLOAT64_C( -909.75), SIMDE_FLOAT64_C( -559.31) },
{ SIMDE_FLOAT64_C( 0.00), SIMDE_FLOAT64_C( 0.17), SIMDE_FLOAT64_C( 0.00), SIMDE_FLOAT64_C( 676.98),
SIMDE_FLOAT64_C( 0.00), SIMDE_FLOAT64_C( 0.00), SIMDE_FLOAT64_C( -497.71), SIMDE_FLOAT64_C( 0.00) } },
{ { SIMDE_FLOAT64_C( -979.36), SIMDE_FLOAT64_C( 580.86), SIMDE_FLOAT64_C( 479.57), SIMDE_FLOAT64_C( -648.29),
SIMDE_FLOAT64_C( -920.80), SIMDE_FLOAT64_C( 377.46), SIMDE_FLOAT64_C( 221.14), SIMDE_FLOAT64_C( 298.38) },
UINT8_C(194),
{ SIMDE_FLOAT64_C( 648.44), SIMDE_FLOAT64_C( 150.06), SIMDE_FLOAT64_C( -492.27), SIMDE_FLOAT64_C( 678.56),
SIMDE_FLOAT64_C( -817.52), SIMDE_FLOAT64_C( 2.44), SIMDE_FLOAT64_C( 986.76), SIMDE_FLOAT64_C( -273.05) },
{ SIMDE_FLOAT64_C( -979.36), SIMDE_FLOAT64_C( 1.00), SIMDE_FLOAT64_C( 479.57), SIMDE_FLOAT64_C( -648.29),
SIMDE_FLOAT64_C( -920.80), SIMDE_FLOAT64_C( 377.46), SIMDE_FLOAT64_C( 1.00), SIMDE_FLOAT64_C( 0.00) } },
{ { SIMDE_FLOAT64_C( -320.57), SIMDE_FLOAT64_C( -97.43), SIMDE_FLOAT64_C( 386.61), SIMDE_FLOAT64_C( 181.71),
SIMDE_FLOAT64_C( 38.30), SIMDE_FLOAT64_C( 696.05), SIMDE_FLOAT64_C( 791.25), SIMDE_FLOAT64_C( -962.67) },
UINT8_C(160),
{ SIMDE_FLOAT64_C( -955.64), SIMDE_FLOAT64_C( -294.02), SIMDE_FLOAT64_C( -152.83), SIMDE_FLOAT64_C( -865.39),
SIMDE_FLOAT64_C( 146.67), SIMDE_FLOAT64_C( -132.19), SIMDE_FLOAT64_C( 715.47), SIMDE_FLOAT64_C( -373.76) },
{ SIMDE_FLOAT64_C( -320.57), SIMDE_FLOAT64_C( -97.43), SIMDE_FLOAT64_C( 386.61), SIMDE_FLOAT64_C( 181.71),
SIMDE_FLOAT64_C( 38.30), SIMDE_FLOAT64_C( 0.00), SIMDE_FLOAT64_C( 791.25), SIMDE_FLOAT64_C( 0.00) } },
{ { SIMDE_FLOAT64_C( 219.52), SIMDE_FLOAT64_C( 794.68), SIMDE_FLOAT64_C( -996.30), SIMDE_FLOAT64_C( -559.34),
SIMDE_FLOAT64_C( 93.05), SIMDE_FLOAT64_C( -309.23), SIMDE_FLOAT64_C( -910.90), SIMDE_FLOAT64_C( -756.89) },
UINT8_C( 25),
{ SIMDE_FLOAT64_C( 767.66), SIMDE_FLOAT64_C( -574.40), SIMDE_FLOAT64_C( -799.05), SIMDE_FLOAT64_C( 754.42),
SIMDE_FLOAT64_C( 152.54), SIMDE_FLOAT64_C( -119.63), SIMDE_FLOAT64_C( -343.01), SIMDE_FLOAT64_C( -460.84) },
{ SIMDE_FLOAT64_C( 1.00), SIMDE_FLOAT64_C( 794.68), SIMDE_FLOAT64_C( -996.30), SIMDE_FLOAT64_C( 1.00),
SIMDE_FLOAT64_C( 1.00), SIMDE_FLOAT64_C( -309.23), SIMDE_FLOAT64_C( -910.90), SIMDE_FLOAT64_C( -756.89) } },
{ { SIMDE_FLOAT64_C( -937.91), SIMDE_FLOAT64_C( 695.30), SIMDE_FLOAT64_C( -764.79), SIMDE_FLOAT64_C( 853.34),
SIMDE_FLOAT64_C( 732.63), SIMDE_FLOAT64_C( -665.45), SIMDE_FLOAT64_C( 897.70), SIMDE_FLOAT64_C( -561.39) },
UINT8_C(185),
{ SIMDE_FLOAT64_C( -967.69), SIMDE_FLOAT64_C( 585.27), SIMDE_FLOAT64_C( -950.48), SIMDE_FLOAT64_C( 747.78),
SIMDE_FLOAT64_C( -788.49), SIMDE_FLOAT64_C( 269.05), SIMDE_FLOAT64_C( 542.46), SIMDE_FLOAT64_C( -784.79) },
{ SIMDE_FLOAT64_C( 0.00), SIMDE_FLOAT64_C( 695.30), SIMDE_FLOAT64_C( -764.79), SIMDE_FLOAT64_C( 1.00),
SIMDE_FLOAT64_C( 0.00), SIMDE_FLOAT64_C( 1.00), SIMDE_FLOAT64_C( 897.70), SIMDE_FLOAT64_C( 0.00) } },
{ { SIMDE_FLOAT64_C( 709.71), SIMDE_FLOAT64_C( -364.49), SIMDE_FLOAT64_C( -94.02), SIMDE_FLOAT64_C( 798.81),
SIMDE_FLOAT64_C( -121.37), SIMDE_FLOAT64_C( -895.52), SIMDE_FLOAT64_C( 566.47), SIMDE_FLOAT64_C( 304.22) },
UINT8_C(190),
{ SIMDE_FLOAT64_C( 320.89), SIMDE_FLOAT64_C( -543.23), SIMDE_FLOAT64_C( 185.80), SIMDE_FLOAT64_C( 977.88),
SIMDE_FLOAT64_C( -4.07), SIMDE_FLOAT64_C( 247.88), SIMDE_FLOAT64_C( 673.18), SIMDE_FLOAT64_C( 231.13) },
{ SIMDE_FLOAT64_C( 709.71), SIMDE_FLOAT64_C( 0.00), SIMDE_FLOAT64_C( 1.00), SIMDE_FLOAT64_C( 1.00),
SIMDE_FLOAT64_C( 0.00), SIMDE_FLOAT64_C( 1.00), SIMDE_FLOAT64_C( 566.47), SIMDE_FLOAT64_C( 1.00) } }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])) ; i++) {
simde__m512d src = simde_mm512_loadu_pd(test_vec[i].src);
simde__m512d a = simde_mm512_loadu_pd(test_vec[i].a);
simde__m512d r = simde_mm512_mask_cdfnorm_pd(src, test_vec[i].k, a);
simde_test_x86_assert_equal_f64x8(r, simde_mm512_loadu_pd(test_vec[i].r), 1);
}
return 0;
}
static int
test_simde_mm_cdfnorminv_ps (SIMDE_MUNIT_TEST_ARGS) {
static const struct {
const simde_float32 a[4];
const simde_float32 r[4];
} test_vec[] = {
{ { SIMDE_FLOAT32_C( 0.30), SIMDE_FLOAT32_C( 0.99), SIMDE_FLOAT32_C( 0.77), SIMDE_FLOAT32_C( 0.90) },
{ SIMDE_FLOAT32_C( -0.52), SIMDE_FLOAT32_C( 2.33), SIMDE_FLOAT32_C( 0.74), SIMDE_FLOAT32_C( 1.28) } },
{ { SIMDE_FLOAT32_C( 0.08), SIMDE_FLOAT32_C( 0.73), SIMDE_FLOAT32_C( 0.62), SIMDE_FLOAT32_C( 0.57) },
{ SIMDE_FLOAT32_C( -1.41), SIMDE_FLOAT32_C( 0.61), SIMDE_FLOAT32_C( 0.31), SIMDE_FLOAT32_C( 0.18) } },
{ { SIMDE_FLOAT32_C( 0.84), SIMDE_FLOAT32_C( 0.26), SIMDE_FLOAT32_C( 0.44), SIMDE_FLOAT32_C( 0.19) },
{ SIMDE_FLOAT32_C( 0.99), SIMDE_FLOAT32_C( -0.64), SIMDE_FLOAT32_C( -0.15), SIMDE_FLOAT32_C( -0.88) } },
{ { SIMDE_FLOAT32_C( 0.41), SIMDE_FLOAT32_C( 0.31), SIMDE_FLOAT32_C( 0.02), SIMDE_FLOAT32_C( 0.53) },
{ SIMDE_FLOAT32_C( -0.23), SIMDE_FLOAT32_C( -0.50), SIMDE_FLOAT32_C( -2.05), SIMDE_FLOAT32_C( 0.08) } },
{ { SIMDE_FLOAT32_C( 0.68), SIMDE_FLOAT32_C( 0.40), SIMDE_FLOAT32_C( 0.20), SIMDE_FLOAT32_C( 0.99) },
{ SIMDE_FLOAT32_C( 0.47), SIMDE_FLOAT32_C( -0.25), SIMDE_FLOAT32_C( -0.84), SIMDE_FLOAT32_C( 2.33) } },
{ { SIMDE_FLOAT32_C( 0.48), SIMDE_FLOAT32_C( 0.33), SIMDE_FLOAT32_C( 0.36), SIMDE_FLOAT32_C( 0.81) },
{ SIMDE_FLOAT32_C( -0.05), SIMDE_FLOAT32_C( -0.44), SIMDE_FLOAT32_C( -0.36), SIMDE_FLOAT32_C( 0.88) } },
{ { SIMDE_FLOAT32_C( 0.09), SIMDE_FLOAT32_C( 0.60), SIMDE_FLOAT32_C( 0.58), SIMDE_FLOAT32_C( 0.92) },
{ SIMDE_FLOAT32_C( -1.34), SIMDE_FLOAT32_C( 0.25), SIMDE_FLOAT32_C( 0.20), SIMDE_FLOAT32_C( 1.41) } },
{ { SIMDE_FLOAT32_C( 0.54), SIMDE_FLOAT32_C( 0.97), SIMDE_FLOAT32_C( 0.67), SIMDE_FLOAT32_C( 0.84) },
{ SIMDE_FLOAT32_C( 0.10), SIMDE_FLOAT32_C( 1.88), SIMDE_FLOAT32_C( 0.44), SIMDE_FLOAT32_C( 0.99) } }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])) ; i++) {
simde__m128 a = simde_mm_loadu_ps(test_vec[i].a);
simde__m128 r = simde_mm_cdfnorminv_ps(a);
simde_test_x86_assert_equal_f32x4(r, simde_mm_loadu_ps(test_vec[i].r), 1);
}
return 0;
}
static int
test_simde_mm_cdfnorminv_pd (SIMDE_MUNIT_TEST_ARGS) {
static const struct {
const simde_float64 a[2];
const simde_float64 r[2];
} test_vec[] = {
{ { SIMDE_FLOAT64_C( 0.78), SIMDE_FLOAT64_C( 0.77) },
{ SIMDE_FLOAT64_C( 0.77), SIMDE_FLOAT64_C( 0.74) } },
{ { SIMDE_FLOAT64_C( 0.93), SIMDE_FLOAT64_C( 0.34) },
{ SIMDE_FLOAT64_C( 1.48), SIMDE_FLOAT64_C( -0.41) } },
{ { SIMDE_FLOAT64_C( 0.02), SIMDE_FLOAT64_C( 0.32) },
{ SIMDE_FLOAT64_C( -2.05), SIMDE_FLOAT64_C( -0.47) } },
{ { SIMDE_FLOAT64_C( 0.60), SIMDE_FLOAT64_C( 0.80) },
{ SIMDE_FLOAT64_C( 0.25), SIMDE_FLOAT64_C( 0.84) } },
{ { SIMDE_FLOAT64_C( 0.56), SIMDE_FLOAT64_C( 0.03) },
{ SIMDE_FLOAT64_C( 0.15), SIMDE_FLOAT64_C( -1.88) } },
{ { SIMDE_FLOAT64_C( 0.61), SIMDE_FLOAT64_C( 0.02) },
{ SIMDE_FLOAT64_C( 0.28), SIMDE_FLOAT64_C( -2.05) } },
{ { SIMDE_FLOAT64_C( 0.68), SIMDE_FLOAT64_C( 0.81) },
{ SIMDE_FLOAT64_C( 0.47), SIMDE_FLOAT64_C( 0.88) } },
{ { SIMDE_FLOAT64_C( 0.77), SIMDE_FLOAT64_C( 0.04) },
{ SIMDE_FLOAT64_C( 0.74), SIMDE_FLOAT64_C( -1.75) } }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])) ; i++) {
simde__m128d a = simde_mm_loadu_pd(test_vec[i].a);
simde__m128d r = simde_mm_cdfnorminv_pd(a);
simde_test_x86_assert_equal_f64x2(r, simde_mm_loadu_pd(test_vec[i].r), 1);
}
return 0;
}
static int
test_simde_mm256_cdfnorminv_ps (SIMDE_MUNIT_TEST_ARGS) {
static const struct {
const simde_float32 a[8];
const simde_float32 r[8];
} test_vec[] = {
{ { SIMDE_FLOAT32_C( 0.58), SIMDE_FLOAT32_C( 0.82), SIMDE_FLOAT32_C( 0.47), SIMDE_FLOAT32_C( 0.39),
SIMDE_FLOAT32_C( 0.23), SIMDE_FLOAT32_C( 0.41), SIMDE_FLOAT32_C( 0.82), SIMDE_FLOAT32_C( 0.19) },
{ SIMDE_FLOAT32_C( 0.20), SIMDE_FLOAT32_C( 0.92), SIMDE_FLOAT32_C( -0.08), SIMDE_FLOAT32_C( -0.28),
SIMDE_FLOAT32_C( -0.74), SIMDE_FLOAT32_C( -0.23), SIMDE_FLOAT32_C( 0.92), SIMDE_FLOAT32_C( -0.88) } },
{ { SIMDE_FLOAT32_C( 0.06), SIMDE_FLOAT32_C( 0.74), SIMDE_FLOAT32_C( 0.24), SIMDE_FLOAT32_C( 0.37),
SIMDE_FLOAT32_C( 0.17), SIMDE_FLOAT32_C( 0.65), SIMDE_FLOAT32_C( 0.47), SIMDE_FLOAT32_C( 0.16) },
{ SIMDE_FLOAT32_C( -1.55), SIMDE_FLOAT32_C( 0.64), SIMDE_FLOAT32_C( -0.71), SIMDE_FLOAT32_C( -0.33),
SIMDE_FLOAT32_C( -0.95), SIMDE_FLOAT32_C( 0.39), SIMDE_FLOAT32_C( -0.08), SIMDE_FLOAT32_C( -0.99) } },
{ { SIMDE_FLOAT32_C( 0.64), SIMDE_FLOAT32_C( 0.45), SIMDE_FLOAT32_C( 0.94), SIMDE_FLOAT32_C( 0.74),
SIMDE_FLOAT32_C( 0.06), SIMDE_FLOAT32_C( 0.51), SIMDE_FLOAT32_C( 0.98), SIMDE_FLOAT32_C( 0.62) },
{ SIMDE_FLOAT32_C( 0.36), SIMDE_FLOAT32_C( -0.13), SIMDE_FLOAT32_C( 1.55), SIMDE_FLOAT32_C( 0.64),
SIMDE_FLOAT32_C( -1.55), SIMDE_FLOAT32_C( 0.03), SIMDE_FLOAT32_C( 2.05), SIMDE_FLOAT32_C( 0.31) } },
{ { SIMDE_FLOAT32_C( 0.80), SIMDE_FLOAT32_C( 0.08), SIMDE_FLOAT32_C( 0.44), SIMDE_FLOAT32_C( 0.97),
SIMDE_FLOAT32_C( 0.64), SIMDE_FLOAT32_C( 0.54), SIMDE_FLOAT32_C( 0.60), SIMDE_FLOAT32_C( 0.22) },
{ SIMDE_FLOAT32_C( 0.84), SIMDE_FLOAT32_C( -1.41), SIMDE_FLOAT32_C( -0.15), SIMDE_FLOAT32_C( 1.88),
SIMDE_FLOAT32_C( 0.36), SIMDE_FLOAT32_C( 0.10), SIMDE_FLOAT32_C( 0.25), SIMDE_FLOAT32_C( -0.77) } },
{ { SIMDE_FLOAT32_C( 0.37), SIMDE_FLOAT32_C( 0.07), SIMDE_FLOAT32_C( 0.62), SIMDE_FLOAT32_C( 0.60),
SIMDE_FLOAT32_C( 0.48), SIMDE_FLOAT32_C( 0.43), SIMDE_FLOAT32_C( 0.78), SIMDE_FLOAT32_C( 0.55) },
{ SIMDE_FLOAT32_C( -0.33), SIMDE_FLOAT32_C( -1.48), SIMDE_FLOAT32_C( 0.31), SIMDE_FLOAT32_C( 0.25),
SIMDE_FLOAT32_C( -0.05), SIMDE_FLOAT32_C( -0.18), SIMDE_FLOAT32_C( 0.77), SIMDE_FLOAT32_C( 0.13) } },
{ { SIMDE_FLOAT32_C( 0.17), SIMDE_FLOAT32_C( 0.03), SIMDE_FLOAT32_C( 0.91), SIMDE_FLOAT32_C( 0.34),
SIMDE_FLOAT32_C( 0.68), SIMDE_FLOAT32_C( 0.38), SIMDE_FLOAT32_C( 0.50), SIMDE_FLOAT32_C( 0.32) },
{ SIMDE_FLOAT32_C( -0.95), SIMDE_FLOAT32_C( -1.88), SIMDE_FLOAT32_C( 1.34), SIMDE_FLOAT32_C( -0.41),
SIMDE_FLOAT32_C( 0.47), SIMDE_FLOAT32_C( -0.31), SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( -0.47) } },
{ { SIMDE_FLOAT32_C( 0.83), SIMDE_FLOAT32_C( 0.44), SIMDE_FLOAT32_C( 0.06), SIMDE_FLOAT32_C( 0.89),
SIMDE_FLOAT32_C( 0.95), SIMDE_FLOAT32_C( 0.04), SIMDE_FLOAT32_C( 0.50), SIMDE_FLOAT32_C( 0.75) },
{ SIMDE_FLOAT32_C( 0.95), SIMDE_FLOAT32_C( -0.15), SIMDE_FLOAT32_C( -1.55), SIMDE_FLOAT32_C( 1.23),
SIMDE_FLOAT32_C( 1.64), SIMDE_FLOAT32_C( -1.75), SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 0.67) } },
{ { SIMDE_FLOAT32_C( 0.12), SIMDE_FLOAT32_C( 0.94), SIMDE_FLOAT32_C( 0.72), SIMDE_FLOAT32_C( 0.76),
SIMDE_FLOAT32_C( 0.48), SIMDE_FLOAT32_C( 0.32), SIMDE_FLOAT32_C( 0.98), SIMDE_FLOAT32_C( 0.85) },
{ SIMDE_FLOAT32_C( -1.17), SIMDE_FLOAT32_C( 1.55), SIMDE_FLOAT32_C( 0.58), SIMDE_FLOAT32_C( 0.71),
SIMDE_FLOAT32_C( -0.05), SIMDE_FLOAT32_C( -0.47), SIMDE_FLOAT32_C( 2.05), SIMDE_FLOAT32_C( 1.04) } }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])) ; i++) {
simde__m256 a = simde_mm256_loadu_ps(test_vec[i].a);
// simde__m256 b = simde_mm256_loadu_ps(test_vec[i].b);
simde__m256 r = simde_mm256_cdfnorminv_ps(a);
simde_test_x86_assert_equal_f32x8(r, simde_mm256_loadu_ps(test_vec[i].r), 1);
}
return 0;
}
static int
test_simde_mm256_cdfnorminv_pd (SIMDE_MUNIT_TEST_ARGS) {
static const struct {
const simde_float64 a[4];
const simde_float64 r[4];
} test_vec[] = {
{ { SIMDE_FLOAT64_C( 0.58), SIMDE_FLOAT64_C( 0.84), SIMDE_FLOAT64_C( 0.82), SIMDE_FLOAT64_C( 0.90) },
{ SIMDE_FLOAT64_C( 0.20), SIMDE_FLOAT64_C( 0.99), SIMDE_FLOAT64_C( 0.92), SIMDE_FLOAT64_C( 1.28) } },
{ { SIMDE_FLOAT64_C( 0.68), SIMDE_FLOAT64_C( 0.69), SIMDE_FLOAT64_C( 0.68), SIMDE_FLOAT64_C( 0.88) },
{ SIMDE_FLOAT64_C( 0.47), SIMDE_FLOAT64_C( 0.50), SIMDE_FLOAT64_C( 0.47), SIMDE_FLOAT64_C( 1.17) } },
{ { SIMDE_FLOAT64_C( 0.23), SIMDE_FLOAT64_C( 0.40), SIMDE_FLOAT64_C( 0.83), SIMDE_FLOAT64_C( 0.78) },
{ SIMDE_FLOAT64_C( -0.74), SIMDE_FLOAT64_C( -0.25), SIMDE_FLOAT64_C( 0.95), SIMDE_FLOAT64_C( 0.77) } },
{ { SIMDE_FLOAT64_C( 0.25), SIMDE_FLOAT64_C( 0.63), SIMDE_FLOAT64_C( 0.76), SIMDE_FLOAT64_C( 0.44) },
{ SIMDE_FLOAT64_C( -0.67), SIMDE_FLOAT64_C( 0.33), SIMDE_FLOAT64_C( 0.71), SIMDE_FLOAT64_C( -0.15) } },
{ { SIMDE_FLOAT64_C( 0.41), SIMDE_FLOAT64_C( 0.18), SIMDE_FLOAT64_C( 0.39), SIMDE_FLOAT64_C( 0.53) },
{ SIMDE_FLOAT64_C( -0.23), SIMDE_FLOAT64_C( -0.92), SIMDE_FLOAT64_C( -0.28), SIMDE_FLOAT64_C( 0.08) } },
{ { SIMDE_FLOAT64_C( 0.36), SIMDE_FLOAT64_C( 0.41), SIMDE_FLOAT64_C( 0.39), SIMDE_FLOAT64_C( 0.63) },
{ SIMDE_FLOAT64_C( -0.36), SIMDE_FLOAT64_C( -0.23), SIMDE_FLOAT64_C( -0.28), SIMDE_FLOAT64_C( 0.33) } },
{ { SIMDE_FLOAT64_C( 0.30), SIMDE_FLOAT64_C( 0.91), SIMDE_FLOAT64_C( 0.94), SIMDE_FLOAT64_C( 0.41) },
{ SIMDE_FLOAT64_C( -0.52), SIMDE_FLOAT64_C( 1.34), SIMDE_FLOAT64_C( 1.55), SIMDE_FLOAT64_C( -0.23) } },
{ { SIMDE_FLOAT64_C( 0.09), SIMDE_FLOAT64_C( 0.82), SIMDE_FLOAT64_C( 0.52), SIMDE_FLOAT64_C( 0.67) },
{ SIMDE_FLOAT64_C( -1.34), SIMDE_FLOAT64_C( 0.92), SIMDE_FLOAT64_C( 0.05), SIMDE_FLOAT64_C( 0.44) } }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])) ; i++) {
simde__m256d a = simde_mm256_loadu_pd(test_vec[i].a);
simde__m256d r = simde_mm256_cdfnorminv_pd(a);
simde_test_x86_assert_equal_f64x4(r, simde_mm256_loadu_pd(test_vec[i].r), 1);
}
return 0;
}
static int
test_simde_mm512_cdfnorminv_ps (SIMDE_MUNIT_TEST_ARGS) {
static const struct {
const simde_float32 a[16];
const simde_float32 r[16];
} test_vec[] = {
{ { SIMDE_FLOAT32_C( 0.30), SIMDE_FLOAT32_C( 0.94), SIMDE_FLOAT32_C( 0.50), SIMDE_FLOAT32_C( 0.34),
SIMDE_FLOAT32_C( 0.34), SIMDE_FLOAT32_C( 0.23), SIMDE_FLOAT32_C( 0.96), SIMDE_FLOAT32_C( 0.80),
SIMDE_FLOAT32_C( 0.22), SIMDE_FLOAT32_C( 0.13), SIMDE_FLOAT32_C( 0.68), SIMDE_FLOAT32_C( 0.99),
SIMDE_FLOAT32_C( 0.05), SIMDE_FLOAT32_C( 0.22), SIMDE_FLOAT32_C( 0.86), SIMDE_FLOAT32_C( 0.81) },
{ SIMDE_FLOAT32_C( -0.52), SIMDE_FLOAT32_C( 1.55), SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( -0.41),
SIMDE_FLOAT32_C( -0.41), SIMDE_FLOAT32_C( -0.74), SIMDE_FLOAT32_C( 1.75), SIMDE_FLOAT32_C( 0.84),
SIMDE_FLOAT32_C( -0.77), SIMDE_FLOAT32_C( -1.13), SIMDE_FLOAT32_C( 0.47), SIMDE_FLOAT32_C( 2.33),
SIMDE_FLOAT32_C( -1.64), SIMDE_FLOAT32_C( -0.77), SIMDE_FLOAT32_C( 1.08), SIMDE_FLOAT32_C( 0.88) } },
{ { SIMDE_FLOAT32_C( 0.10), SIMDE_FLOAT32_C( 0.64), SIMDE_FLOAT32_C( 0.44), SIMDE_FLOAT32_C( 0.13),
SIMDE_FLOAT32_C( 0.23), SIMDE_FLOAT32_C( 0.68), SIMDE_FLOAT32_C( 0.91), SIMDE_FLOAT32_C( 0.21),
SIMDE_FLOAT32_C( 0.66), SIMDE_FLOAT32_C( 0.27), SIMDE_FLOAT32_C( 0.42), SIMDE_FLOAT32_C( 0.91),
SIMDE_FLOAT32_C( 0.78), SIMDE_FLOAT32_C( 0.93), SIMDE_FLOAT32_C( 0.23), SIMDE_FLOAT32_C( 0.08) },
{ SIMDE_FLOAT32_C( -1.28), SIMDE_FLOAT32_C( 0.36), SIMDE_FLOAT32_C( -0.15), SIMDE_FLOAT32_C( -1.13),
SIMDE_FLOAT32_C( -0.74), SIMDE_FLOAT32_C( 0.47), SIMDE_FLOAT32_C( 1.34), SIMDE_FLOAT32_C( -0.81),
SIMDE_FLOAT32_C( 0.41), SIMDE_FLOAT32_C( -0.61), SIMDE_FLOAT32_C( -0.20), SIMDE_FLOAT32_C( 1.34),
SIMDE_FLOAT32_C( 0.77), SIMDE_FLOAT32_C( 1.48), SIMDE_FLOAT32_C( -0.74), SIMDE_FLOAT32_C( -1.41) } },
{ { SIMDE_FLOAT32_C( 0.87), SIMDE_FLOAT32_C( 0.73), SIMDE_FLOAT32_C( 0.42), SIMDE_FLOAT32_C( 0.21),
SIMDE_FLOAT32_C( 0.96), SIMDE_FLOAT32_C( 0.38), SIMDE_FLOAT32_C( 0.01), SIMDE_FLOAT32_C( 0.19),
SIMDE_FLOAT32_C( 0.51), SIMDE_FLOAT32_C( 0.69), SIMDE_FLOAT32_C( 0.18), SIMDE_FLOAT32_C( 0.56),
SIMDE_FLOAT32_C( 0.91), SIMDE_FLOAT32_C( 0.03), SIMDE_FLOAT32_C( 0.37), SIMDE_FLOAT32_C( 0.01) },
{ SIMDE_FLOAT32_C( 1.13), SIMDE_FLOAT32_C( 0.61), SIMDE_FLOAT32_C( -0.20), SIMDE_FLOAT32_C( -0.81),
SIMDE_FLOAT32_C( 1.75), SIMDE_FLOAT32_C( -0.31), SIMDE_FLOAT32_C( -2.33), SIMDE_FLOAT32_C( -0.88),
SIMDE_FLOAT32_C( 0.03), SIMDE_FLOAT32_C( 0.50), SIMDE_FLOAT32_C( -0.92), SIMDE_FLOAT32_C( 0.15),
SIMDE_FLOAT32_C( 1.34), SIMDE_FLOAT32_C( -1.88), SIMDE_FLOAT32_C( -0.33), SIMDE_FLOAT32_C( -2.33) } },
{ { SIMDE_FLOAT32_C( 0.67), SIMDE_FLOAT32_C( 0.81), SIMDE_FLOAT32_C( 0.13), SIMDE_FLOAT32_C( 0.91),
SIMDE_FLOAT32_C( 0.49), SIMDE_FLOAT32_C( 0.04), SIMDE_FLOAT32_C( 0.12), SIMDE_FLOAT32_C( 0.15),
SIMDE_FLOAT32_C( 0.31), SIMDE_FLOAT32_C( 0.54), SIMDE_FLOAT32_C( 0.06), SIMDE_FLOAT32_C( 0.09),
SIMDE_FLOAT32_C( 0.47), SIMDE_FLOAT32_C( 0.29), SIMDE_FLOAT32_C( 0.17), SIMDE_FLOAT32_C( 0.34) },
{ SIMDE_FLOAT32_C( 0.44), SIMDE_FLOAT32_C( 0.88), SIMDE_FLOAT32_C( -1.13), SIMDE_FLOAT32_C( 1.34),
SIMDE_FLOAT32_C( -0.03), SIMDE_FLOAT32_C( -1.75), SIMDE_FLOAT32_C( -1.17), SIMDE_FLOAT32_C( -1.04),
SIMDE_FLOAT32_C( -0.50), SIMDE_FLOAT32_C( 0.10), SIMDE_FLOAT32_C( -1.55), SIMDE_FLOAT32_C( -1.34),
SIMDE_FLOAT32_C( -0.08), SIMDE_FLOAT32_C( -0.55), SIMDE_FLOAT32_C( -0.95), SIMDE_FLOAT32_C( -0.41) } },
{ { SIMDE_FLOAT32_C( 0.01), SIMDE_FLOAT32_C( 0.59), SIMDE_FLOAT32_C( 0.55), SIMDE_FLOAT32_C( 0.98),
SIMDE_FLOAT32_C( 0.97), SIMDE_FLOAT32_C( 0.56), SIMDE_FLOAT32_C( 0.16), SIMDE_FLOAT32_C( 0.47),
SIMDE_FLOAT32_C( 0.25), SIMDE_FLOAT32_C( 0.34), SIMDE_FLOAT32_C( 0.03), SIMDE_FLOAT32_C( 0.16),
SIMDE_FLOAT32_C( 0.37), SIMDE_FLOAT32_C( 0.41), SIMDE_FLOAT32_C( 0.17), SIMDE_FLOAT32_C( 0.04) },
{ SIMDE_FLOAT32_C( -2.33), SIMDE_FLOAT32_C( 0.23), SIMDE_FLOAT32_C( 0.13), SIMDE_FLOAT32_C( 2.05),
SIMDE_FLOAT32_C( 1.88), SIMDE_FLOAT32_C( 0.15), SIMDE_FLOAT32_C( -0.99), SIMDE_FLOAT32_C( -0.08),
SIMDE_FLOAT32_C( -0.67), SIMDE_FLOAT32_C( -0.41), SIMDE_FLOAT32_C( -1.88), SIMDE_FLOAT32_C( -0.99),
SIMDE_FLOAT32_C( -0.33), SIMDE_FLOAT32_C( -0.23), SIMDE_FLOAT32_C( -0.95), SIMDE_FLOAT32_C( -1.75) } },
{ { SIMDE_FLOAT32_C( 0.22), SIMDE_FLOAT32_C( 0.30), SIMDE_FLOAT32_C( 0.95), SIMDE_FLOAT32_C( 0.71),
SIMDE_FLOAT32_C( 0.35), SIMDE_FLOAT32_C( 0.07), SIMDE_FLOAT32_C( 0.86), SIMDE_FLOAT32_C( 0.66),
SIMDE_FLOAT32_C( 0.61), SIMDE_FLOAT32_C( 0.92), SIMDE_FLOAT32_C( 0.74), SIMDE_FLOAT32_C( 0.07),
SIMDE_FLOAT32_C( 0.20), SIMDE_FLOAT32_C( 0.91), SIMDE_FLOAT32_C( 0.41), SIMDE_FLOAT32_C( 0.22) },
{ SIMDE_FLOAT32_C( -0.77), SIMDE_FLOAT32_C( -0.52), SIMDE_FLOAT32_C( 1.64), SIMDE_FLOAT32_C( 0.55),
SIMDE_FLOAT32_C( -0.39), SIMDE_FLOAT32_C( -1.48), SIMDE_FLOAT32_C( 1.08), SIMDE_FLOAT32_C( 0.41),
SIMDE_FLOAT32_C( 0.28), SIMDE_FLOAT32_C( 1.41), SIMDE_FLOAT32_C( 0.64), SIMDE_FLOAT32_C( -1.48),
SIMDE_FLOAT32_C( -0.84), SIMDE_FLOAT32_C( 1.34), SIMDE_FLOAT32_C( -0.23), SIMDE_FLOAT32_C( -0.77) } },
{ { SIMDE_FLOAT32_C( 0.50), SIMDE_FLOAT32_C( 0.96), SIMDE_FLOAT32_C( 0.19), SIMDE_FLOAT32_C( 0.46),
SIMDE_FLOAT32_C( 0.52), SIMDE_FLOAT32_C( 0.36), SIMDE_FLOAT32_C( 0.94), SIMDE_FLOAT32_C( 0.77),
SIMDE_FLOAT32_C( 0.69), SIMDE_FLOAT32_C( 0.97), SIMDE_FLOAT32_C( 0.93), SIMDE_FLOAT32_C( 0.06),
SIMDE_FLOAT32_C( 0.38), SIMDE_FLOAT32_C( 0.10), SIMDE_FLOAT32_C( 0.11), SIMDE_FLOAT32_C( 0.60) },
{ SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 1.75), SIMDE_FLOAT32_C( -0.88), SIMDE_FLOAT32_C( -0.10),
SIMDE_FLOAT32_C( 0.05), SIMDE_FLOAT32_C( -0.36), SIMDE_FLOAT32_C( 1.55), SIMDE_FLOAT32_C( 0.74),
SIMDE_FLOAT32_C( 0.50), SIMDE_FLOAT32_C( 1.88), SIMDE_FLOAT32_C( 1.48), SIMDE_FLOAT32_C( -1.55),
SIMDE_FLOAT32_C( -0.31), SIMDE_FLOAT32_C( -1.28), SIMDE_FLOAT32_C( -1.23), SIMDE_FLOAT32_C( 0.25) } },
{ { SIMDE_FLOAT32_C( 0.40), SIMDE_FLOAT32_C( 0.06), SIMDE_FLOAT32_C( 0.30), SIMDE_FLOAT32_C( 0.75),
SIMDE_FLOAT32_C( 0.13), SIMDE_FLOAT32_C( 0.16), SIMDE_FLOAT32_C( 0.40), SIMDE_FLOAT32_C( 0.74),
SIMDE_FLOAT32_C( 0.08), SIMDE_FLOAT32_C( 0.15), SIMDE_FLOAT32_C( 0.81), SIMDE_FLOAT32_C( 0.28),
SIMDE_FLOAT32_C( 0.06), SIMDE_FLOAT32_C( 0.22), SIMDE_FLOAT32_C( 0.49), SIMDE_FLOAT32_C( 0.56) },
{ SIMDE_FLOAT32_C( -0.25), SIMDE_FLOAT32_C( -1.55), SIMDE_FLOAT32_C( -0.52), SIMDE_FLOAT32_C( 0.67),
SIMDE_FLOAT32_C( -1.13), SIMDE_FLOAT32_C( -0.99), SIMDE_FLOAT32_C( -0.25), SIMDE_FLOAT32_C( 0.64),
SIMDE_FLOAT32_C( -1.41), SIMDE_FLOAT32_C( -1.04), SIMDE_FLOAT32_C( 0.88), SIMDE_FLOAT32_C( -0.58),
SIMDE_FLOAT32_C( -1.55), SIMDE_FLOAT32_C( -0.77), SIMDE_FLOAT32_C( -0.03), SIMDE_FLOAT32_C( 0.15) } }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])) ; i++) {
simde__m512 a = simde_mm512_loadu_ps(test_vec[i].a);
simde__m512 r = simde_mm512_cdfnorminv_ps(a);
simde_test_x86_assert_equal_f32x16(r, simde_mm512_loadu_ps(test_vec[i].r), 1);
}
return 0;
}
static int
test_simde_mm512_mask_cdfnorminv_ps (SIMDE_MUNIT_TEST_ARGS) {
static const struct {
const simde_float32 src[16];
const simde__mmask8 k;
const simde_float32 a[16];
const simde_float32 r[16];
} test_vec[] = {
{ { SIMDE_FLOAT32_C( 0.65), SIMDE_FLOAT32_C( 0.09), SIMDE_FLOAT32_C( 0.98), SIMDE_FLOAT32_C( 0.18),
SIMDE_FLOAT32_C( 0.89), SIMDE_FLOAT32_C( 0.34), SIMDE_FLOAT32_C( 0.51), SIMDE_FLOAT32_C( 0.74),
SIMDE_FLOAT32_C( 0.56), SIMDE_FLOAT32_C( 0.23), SIMDE_FLOAT32_C( 0.67), SIMDE_FLOAT32_C( 0.38),
SIMDE_FLOAT32_C( 0.59), SIMDE_FLOAT32_C( 0.23), SIMDE_FLOAT32_C( 0.81), SIMDE_FLOAT32_C( 0.03) },
UINT8_C(249),
{ SIMDE_FLOAT32_C( 0.14), SIMDE_FLOAT32_C( 0.61), SIMDE_FLOAT32_C( 0.53), SIMDE_FLOAT32_C( 0.22),
SIMDE_FLOAT32_C( 0.01), SIMDE_FLOAT32_C( 0.03), SIMDE_FLOAT32_C( 0.38), SIMDE_FLOAT32_C( 0.21),
SIMDE_FLOAT32_C( 0.96), SIMDE_FLOAT32_C( 0.01), SIMDE_FLOAT32_C( 0.29), SIMDE_FLOAT32_C( 0.63),
SIMDE_FLOAT32_C( 0.71), SIMDE_FLOAT32_C( 0.45), SIMDE_FLOAT32_C( 0.28), SIMDE_FLOAT32_C( 0.81) },
{ SIMDE_FLOAT32_C( -1.08), SIMDE_FLOAT32_C( 0.09), SIMDE_FLOAT32_C( 0.98), SIMDE_FLOAT32_C( -0.77),
SIMDE_FLOAT32_C( -2.33), SIMDE_FLOAT32_C( -1.88), SIMDE_FLOAT32_C( -0.31), SIMDE_FLOAT32_C( -0.81),
SIMDE_FLOAT32_C( 0.56), SIMDE_FLOAT32_C( 0.23), SIMDE_FLOAT32_C( 0.67), SIMDE_FLOAT32_C( 0.38),
SIMDE_FLOAT32_C( 0.59), SIMDE_FLOAT32_C( 0.23), SIMDE_FLOAT32_C( 0.81), SIMDE_FLOAT32_C( 0.03) } },
{ { SIMDE_FLOAT32_C( 0.43), SIMDE_FLOAT32_C( 0.46), SIMDE_FLOAT32_C( 0.70), SIMDE_FLOAT32_C( 0.77),
SIMDE_FLOAT32_C( 0.97), SIMDE_FLOAT32_C( 0.44), SIMDE_FLOAT32_C( 0.33), SIMDE_FLOAT32_C( 0.20),
SIMDE_FLOAT32_C( 0.11), SIMDE_FLOAT32_C( 0.71), SIMDE_FLOAT32_C( 0.79), SIMDE_FLOAT32_C( 0.34),
SIMDE_FLOAT32_C( 0.52), SIMDE_FLOAT32_C( 0.82), SIMDE_FLOAT32_C( 0.10), SIMDE_FLOAT32_C( 0.65) },
UINT8_C(209),
{ SIMDE_FLOAT32_C( 0.63), SIMDE_FLOAT32_C( 0.87), SIMDE_FLOAT32_C( 0.45), SIMDE_FLOAT32_C( 0.66),
SIMDE_FLOAT32_C( 0.25), SIMDE_FLOAT32_C( 0.66), SIMDE_FLOAT32_C( 0.62), SIMDE_FLOAT32_C( 0.26),
SIMDE_FLOAT32_C( 0.95), SIMDE_FLOAT32_C( 0.26), SIMDE_FLOAT32_C( 0.97), SIMDE_FLOAT32_C( 0.40),
SIMDE_FLOAT32_C( 0.54), SIMDE_FLOAT32_C( 0.78), SIMDE_FLOAT32_C( 0.83), SIMDE_FLOAT32_C( 0.00) },
{ SIMDE_FLOAT32_C( 0.33), SIMDE_FLOAT32_C( 0.46), SIMDE_FLOAT32_C( 0.70), SIMDE_FLOAT32_C( 0.77),
SIMDE_FLOAT32_C( -0.67), SIMDE_FLOAT32_C( 0.44), SIMDE_FLOAT32_C( 0.31), SIMDE_FLOAT32_C( -0.64),
SIMDE_FLOAT32_C( 0.11), SIMDE_FLOAT32_C( 0.71), SIMDE_FLOAT32_C( 0.79), SIMDE_FLOAT32_C( 0.34),
SIMDE_FLOAT32_C( 0.52), SIMDE_FLOAT32_C( 0.82), SIMDE_FLOAT32_C( 0.10), SIMDE_FLOAT32_C( 0.65) } },
{ { SIMDE_FLOAT32_C( 0.48), SIMDE_FLOAT32_C( 0.61), SIMDE_FLOAT32_C( 0.97), SIMDE_FLOAT32_C( 0.92),
SIMDE_FLOAT32_C( 0.94), SIMDE_FLOAT32_C( 0.17), SIMDE_FLOAT32_C( 0.03), SIMDE_FLOAT32_C( 0.65),
SIMDE_FLOAT32_C( 0.96), SIMDE_FLOAT32_C( 0.37), SIMDE_FLOAT32_C( 0.17), SIMDE_FLOAT32_C( 0.78),
SIMDE_FLOAT32_C( 0.47), SIMDE_FLOAT32_C( 0.82), SIMDE_FLOAT32_C( 0.21), SIMDE_FLOAT32_C( 0.10) },
UINT8_C(123),
{ SIMDE_FLOAT32_C( 0.66), SIMDE_FLOAT32_C( 0.76), SIMDE_FLOAT32_C( 0.94), SIMDE_FLOAT32_C( 0.32),
SIMDE_FLOAT32_C( 0.38), SIMDE_FLOAT32_C( 0.19), SIMDE_FLOAT32_C( 0.26), SIMDE_FLOAT32_C( 0.64),
SIMDE_FLOAT32_C( 0.17), SIMDE_FLOAT32_C( 0.66), SIMDE_FLOAT32_C( 0.18), SIMDE_FLOAT32_C( 0.94),
SIMDE_FLOAT32_C( 0.50), SIMDE_FLOAT32_C( 0.18), SIMDE_FLOAT32_C( 0.42), SIMDE_FLOAT32_C( 0.10) },
{ SIMDE_FLOAT32_C( 0.41), SIMDE_FLOAT32_C( 0.71), SIMDE_FLOAT32_C( 0.97), SIMDE_FLOAT32_C( -0.47),
SIMDE_FLOAT32_C( -0.31), SIMDE_FLOAT32_C( -0.88), SIMDE_FLOAT32_C( -0.64), SIMDE_FLOAT32_C( 0.65),
SIMDE_FLOAT32_C( 0.96), SIMDE_FLOAT32_C( 0.37), SIMDE_FLOAT32_C( 0.17), SIMDE_FLOAT32_C( 0.78),
SIMDE_FLOAT32_C( 0.47), SIMDE_FLOAT32_C( 0.82), SIMDE_FLOAT32_C( 0.21), SIMDE_FLOAT32_C( 0.10) } },
{ { SIMDE_FLOAT32_C( 0.16), SIMDE_FLOAT32_C( 0.34), SIMDE_FLOAT32_C( 0.04), SIMDE_FLOAT32_C( 0.33),
SIMDE_FLOAT32_C( 0.36), SIMDE_FLOAT32_C( 0.69), SIMDE_FLOAT32_C( 0.29), SIMDE_FLOAT32_C( 0.73),
SIMDE_FLOAT32_C( 0.85), SIMDE_FLOAT32_C( 0.08), SIMDE_FLOAT32_C( 0.20), SIMDE_FLOAT32_C( 0.67),
SIMDE_FLOAT32_C( 0.29), SIMDE_FLOAT32_C( 0.29), SIMDE_FLOAT32_C( 0.36), SIMDE_FLOAT32_C( 0.95) },
UINT8_C( 43),
{ SIMDE_FLOAT32_C( 0.30), SIMDE_FLOAT32_C( 0.27), SIMDE_FLOAT32_C( 0.44), SIMDE_FLOAT32_C( 0.50),
SIMDE_FLOAT32_C( 0.53), SIMDE_FLOAT32_C( 0.08), SIMDE_FLOAT32_C( 0.66), SIMDE_FLOAT32_C( 0.19),
SIMDE_FLOAT32_C( 0.26), SIMDE_FLOAT32_C( 0.60), SIMDE_FLOAT32_C( 0.69), SIMDE_FLOAT32_C( 0.44),
SIMDE_FLOAT32_C( 0.02), SIMDE_FLOAT32_C( 0.79), SIMDE_FLOAT32_C( 0.59), SIMDE_FLOAT32_C( 0.36) },
{ SIMDE_FLOAT32_C( -0.52), SIMDE_FLOAT32_C( -0.61), SIMDE_FLOAT32_C( 0.04), SIMDE_FLOAT32_C( 0.00),
SIMDE_FLOAT32_C( 0.36), SIMDE_FLOAT32_C( -1.41), SIMDE_FLOAT32_C( 0.29), SIMDE_FLOAT32_C( 0.73),
SIMDE_FLOAT32_C( 0.85), SIMDE_FLOAT32_C( 0.08), SIMDE_FLOAT32_C( 0.20), SIMDE_FLOAT32_C( 0.67),
SIMDE_FLOAT32_C( 0.29), SIMDE_FLOAT32_C( 0.29), SIMDE_FLOAT32_C( 0.36), SIMDE_FLOAT32_C( 0.95) } },
{ { SIMDE_FLOAT32_C( 0.83), SIMDE_FLOAT32_C( 0.92), SIMDE_FLOAT32_C( 0.72), SIMDE_FLOAT32_C( 0.52),
SIMDE_FLOAT32_C( 0.22), SIMDE_FLOAT32_C( 0.45), SIMDE_FLOAT32_C( 0.37), SIMDE_FLOAT32_C( 0.29),
SIMDE_FLOAT32_C( 0.64), SIMDE_FLOAT32_C( 0.04), SIMDE_FLOAT32_C( 0.58), SIMDE_FLOAT32_C( 0.94),
SIMDE_FLOAT32_C( 0.40), SIMDE_FLOAT32_C( 0.53), SIMDE_FLOAT32_C( 0.99), SIMDE_FLOAT32_C( 0.70) },
UINT8_C( 66),
{ SIMDE_FLOAT32_C( 0.43), SIMDE_FLOAT32_C( 0.20), SIMDE_FLOAT32_C( 0.33), SIMDE_FLOAT32_C( 0.51),
SIMDE_FLOAT32_C( 0.86), SIMDE_FLOAT32_C( 0.52), SIMDE_FLOAT32_C( 0.77), SIMDE_FLOAT32_C( 0.46),
SIMDE_FLOAT32_C( 0.21), SIMDE_FLOAT32_C( 0.21), SIMDE_FLOAT32_C( 0.49), SIMDE_FLOAT32_C( 0.00),
SIMDE_FLOAT32_C( 0.80), SIMDE_FLOAT32_C( 0.84), SIMDE_FLOAT32_C( 0.83), SIMDE_FLOAT32_C( 0.72) },
{ SIMDE_FLOAT32_C( 0.83), SIMDE_FLOAT32_C( -0.84), SIMDE_FLOAT32_C( 0.72), SIMDE_FLOAT32_C( 0.52),
SIMDE_FLOAT32_C( 0.22), SIMDE_FLOAT32_C( 0.45), SIMDE_FLOAT32_C( 0.74), SIMDE_FLOAT32_C( 0.29),
SIMDE_FLOAT32_C( 0.64), SIMDE_FLOAT32_C( 0.04), SIMDE_FLOAT32_C( 0.58), SIMDE_FLOAT32_C( 0.94),
SIMDE_FLOAT32_C( 0.40), SIMDE_FLOAT32_C( 0.53), SIMDE_FLOAT32_C( 0.99), SIMDE_FLOAT32_C( 0.70) } },
{ { SIMDE_FLOAT32_C( 0.56), SIMDE_FLOAT32_C( 0.35), SIMDE_FLOAT32_C( 0.94), SIMDE_FLOAT32_C( 0.01),
SIMDE_FLOAT32_C( 0.72), SIMDE_FLOAT32_C( 0.23), SIMDE_FLOAT32_C( 0.65), SIMDE_FLOAT32_C( 0.76),
SIMDE_FLOAT32_C( 0.81), SIMDE_FLOAT32_C( 0.59), SIMDE_FLOAT32_C( 0.16), SIMDE_FLOAT32_C( 0.34),
SIMDE_FLOAT32_C( 0.58), SIMDE_FLOAT32_C( 0.86), SIMDE_FLOAT32_C( 0.14), SIMDE_FLOAT32_C( 0.02) },
UINT8_C(157),
{ SIMDE_FLOAT32_C( 0.47), SIMDE_FLOAT32_C( 0.53), SIMDE_FLOAT32_C( 0.92), SIMDE_FLOAT32_C( 0.99),
SIMDE_FLOAT32_C( 0.29), SIMDE_FLOAT32_C( 0.39), SIMDE_FLOAT32_C( 0.21), SIMDE_FLOAT32_C( 0.50),
SIMDE_FLOAT32_C( 0.87), SIMDE_FLOAT32_C( 0.21), SIMDE_FLOAT32_C( 0.30), SIMDE_FLOAT32_C( 0.71),
SIMDE_FLOAT32_C( 0.04), SIMDE_FLOAT32_C( 0.02), SIMDE_FLOAT32_C( 0.27), SIMDE_FLOAT32_C( 0.39) },
{ SIMDE_FLOAT32_C( -0.08), SIMDE_FLOAT32_C( 0.35), SIMDE_FLOAT32_C( 1.41), SIMDE_FLOAT32_C( 2.33),
SIMDE_FLOAT32_C( -0.55), SIMDE_FLOAT32_C( 0.23), SIMDE_FLOAT32_C( 0.65), SIMDE_FLOAT32_C( 0.00),
SIMDE_FLOAT32_C( 0.81), SIMDE_FLOAT32_C( 0.59), SIMDE_FLOAT32_C( 0.16), SIMDE_FLOAT32_C( 0.34),
SIMDE_FLOAT32_C( 0.58), SIMDE_FLOAT32_C( 0.86), SIMDE_FLOAT32_C( 0.14), SIMDE_FLOAT32_C( 0.02) } },
{ { SIMDE_FLOAT32_C( 0.96), SIMDE_FLOAT32_C( 0.28), SIMDE_FLOAT32_C( 0.11), SIMDE_FLOAT32_C( 0.19),
SIMDE_FLOAT32_C( 0.93), SIMDE_FLOAT32_C( 0.87), SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 0.52),
SIMDE_FLOAT32_C( 0.03), SIMDE_FLOAT32_C( 0.34), SIMDE_FLOAT32_C( 0.11), SIMDE_FLOAT32_C( 0.89),
SIMDE_FLOAT32_C( 0.48), SIMDE_FLOAT32_C( 0.12), SIMDE_FLOAT32_C( 0.95), SIMDE_FLOAT32_C( 0.95) },
UINT8_C( 65),
{ SIMDE_FLOAT32_C( 0.88), SIMDE_FLOAT32_C( 0.94), SIMDE_FLOAT32_C( 0.94), SIMDE_FLOAT32_C( 0.27),
SIMDE_FLOAT32_C( 0.15), SIMDE_FLOAT32_C( 0.44), SIMDE_FLOAT32_C( 0.14), SIMDE_FLOAT32_C( 0.36),
SIMDE_FLOAT32_C( 0.74), SIMDE_FLOAT32_C( 0.85), SIMDE_FLOAT32_C( 0.40), SIMDE_FLOAT32_C( 0.76),
SIMDE_FLOAT32_C( 0.13), SIMDE_FLOAT32_C( 0.79), SIMDE_FLOAT32_C( 0.73), SIMDE_FLOAT32_C( 0.41) },
{ SIMDE_FLOAT32_C( 1.17), SIMDE_FLOAT32_C( 0.28), SIMDE_FLOAT32_C( 0.11), SIMDE_FLOAT32_C( 0.19),
SIMDE_FLOAT32_C( 0.93), SIMDE_FLOAT32_C( 0.87), SIMDE_FLOAT32_C( -1.08), SIMDE_FLOAT32_C( 0.52),
SIMDE_FLOAT32_C( 0.03), SIMDE_FLOAT32_C( 0.34), SIMDE_FLOAT32_C( 0.11), SIMDE_FLOAT32_C( 0.89),
SIMDE_FLOAT32_C( 0.48), SIMDE_FLOAT32_C( 0.12), SIMDE_FLOAT32_C( 0.95), SIMDE_FLOAT32_C( 0.95) } },
{ { SIMDE_FLOAT32_C( 0.90), SIMDE_FLOAT32_C( 0.92), SIMDE_FLOAT32_C( 0.34), SIMDE_FLOAT32_C( 0.76),
SIMDE_FLOAT32_C( 0.92), SIMDE_FLOAT32_C( 0.86), SIMDE_FLOAT32_C( 0.79), SIMDE_FLOAT32_C( 0.26),
SIMDE_FLOAT32_C( 0.97), SIMDE_FLOAT32_C( 0.68), SIMDE_FLOAT32_C( 0.74), SIMDE_FLOAT32_C( 0.09),
SIMDE_FLOAT32_C( 0.64), SIMDE_FLOAT32_C( 0.69), SIMDE_FLOAT32_C( 0.74), SIMDE_FLOAT32_C( 0.52) },
UINT8_C(240),
{ SIMDE_FLOAT32_C( 0.68), SIMDE_FLOAT32_C( 0.78), SIMDE_FLOAT32_C( 0.78), SIMDE_FLOAT32_C( 0.12),
SIMDE_FLOAT32_C( 0.92), SIMDE_FLOAT32_C( 0.14), SIMDE_FLOAT32_C( 0.86), SIMDE_FLOAT32_C( 0.78),
SIMDE_FLOAT32_C( 0.54), SIMDE_FLOAT32_C( 0.62), SIMDE_FLOAT32_C( 0.90), SIMDE_FLOAT32_C( 0.33),
SIMDE_FLOAT32_C( 0.35), SIMDE_FLOAT32_C( 0.31), SIMDE_FLOAT32_C( 0.22), SIMDE_FLOAT32_C( 0.26) },
{ SIMDE_FLOAT32_C( 0.90), SIMDE_FLOAT32_C( 0.92), SIMDE_FLOAT32_C( 0.34), SIMDE_FLOAT32_C( 0.76),
SIMDE_FLOAT32_C( 1.41), SIMDE_FLOAT32_C( -1.08), SIMDE_FLOAT32_C( 1.08), SIMDE_FLOAT32_C( 0.77),
SIMDE_FLOAT32_C( 0.97), SIMDE_FLOAT32_C( 0.68), SIMDE_FLOAT32_C( 0.74), SIMDE_FLOAT32_C( 0.09),
SIMDE_FLOAT32_C( 0.64), SIMDE_FLOAT32_C( 0.69), SIMDE_FLOAT32_C( 0.74), SIMDE_FLOAT32_C( 0.52) } }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])) ; i++) {
simde__m512 src = simde_mm512_loadu_ps(test_vec[i].src);
simde__m512 a = simde_mm512_loadu_ps(test_vec[i].a);
simde__m512 r = simde_mm512_mask_cdfnorminv_ps(src, test_vec[i].k, a);
simde_test_x86_assert_equal_f32x16(r, simde_mm512_loadu_ps(test_vec[i].r), 1);
}
return 0;
}
static int
test_simde_mm512_cdfnorminv_pd (SIMDE_MUNIT_TEST_ARGS) {
static const struct {
const simde_float64 a[8];
const simde_float64 r[8];
} test_vec[] = {
{ { SIMDE_FLOAT64_C( 0.33), SIMDE_FLOAT64_C( 0.38), SIMDE_FLOAT64_C( 0.78), SIMDE_FLOAT64_C( 0.48),
SIMDE_FLOAT64_C( 0.39), SIMDE_FLOAT64_C( 0.88), SIMDE_FLOAT64_C( 0.84), SIMDE_FLOAT64_C( 0.89) },
{ SIMDE_FLOAT64_C( -0.44), SIMDE_FLOAT64_C( -0.31), SIMDE_FLOAT64_C( 0.77), SIMDE_FLOAT64_C( -0.05),
SIMDE_FLOAT64_C( -0.28), SIMDE_FLOAT64_C( 1.17), SIMDE_FLOAT64_C( 0.99), SIMDE_FLOAT64_C( 1.23) } },
{ { SIMDE_FLOAT64_C( 0.58), SIMDE_FLOAT64_C( 0.28), SIMDE_FLOAT64_C( 0.81), SIMDE_FLOAT64_C( 0.87),
SIMDE_FLOAT64_C( 0.33), SIMDE_FLOAT64_C( 0.98), SIMDE_FLOAT64_C( 0.34), SIMDE_FLOAT64_C( 0.51) },
{ SIMDE_FLOAT64_C( 0.20), SIMDE_FLOAT64_C( -0.58), SIMDE_FLOAT64_C( 0.88), SIMDE_FLOAT64_C( 1.13),
SIMDE_FLOAT64_C( -0.44), SIMDE_FLOAT64_C( 2.05), SIMDE_FLOAT64_C( -0.41), SIMDE_FLOAT64_C( 0.03) } },
{ { SIMDE_FLOAT64_C( 0.85), SIMDE_FLOAT64_C( 0.51), SIMDE_FLOAT64_C( 0.79), SIMDE_FLOAT64_C( 0.79),
SIMDE_FLOAT64_C( 0.28), SIMDE_FLOAT64_C( 0.73), SIMDE_FLOAT64_C( 0.95), SIMDE_FLOAT64_C( 0.96) },
{ SIMDE_FLOAT64_C( 1.04), SIMDE_FLOAT64_C( 0.03), SIMDE_FLOAT64_C( 0.81), SIMDE_FLOAT64_C( 0.81),
SIMDE_FLOAT64_C( -0.58), SIMDE_FLOAT64_C( 0.61), SIMDE_FLOAT64_C( 1.64), SIMDE_FLOAT64_C( 1.75) } },
{ { SIMDE_FLOAT64_C( 0.86), SIMDE_FLOAT64_C( 0.47), SIMDE_FLOAT64_C( 0.03), SIMDE_FLOAT64_C( 0.84),
SIMDE_FLOAT64_C( 0.62), SIMDE_FLOAT64_C( 0.04), SIMDE_FLOAT64_C( 0.60), SIMDE_FLOAT64_C( 0.95) },
{ SIMDE_FLOAT64_C( 1.08), SIMDE_FLOAT64_C( -0.08), SIMDE_FLOAT64_C( -1.88), SIMDE_FLOAT64_C( 0.99),
SIMDE_FLOAT64_C( 0.31), SIMDE_FLOAT64_C( -1.75), SIMDE_FLOAT64_C( 0.25), SIMDE_FLOAT64_C( 1.64) } },
{ { SIMDE_FLOAT64_C( 0.42), SIMDE_FLOAT64_C( 0.38), SIMDE_FLOAT64_C( 0.44), SIMDE_FLOAT64_C( 0.82),
SIMDE_FLOAT64_C( 0.26), SIMDE_FLOAT64_C( 0.28), SIMDE_FLOAT64_C( 0.71), SIMDE_FLOAT64_C( 0.83) },
{ SIMDE_FLOAT64_C( -0.20), SIMDE_FLOAT64_C( -0.31), SIMDE_FLOAT64_C( -0.15), SIMDE_FLOAT64_C( 0.92),
SIMDE_FLOAT64_C( -0.64), SIMDE_FLOAT64_C( -0.58), SIMDE_FLOAT64_C( 0.55), SIMDE_FLOAT64_C( 0.95) } },
{ { SIMDE_FLOAT64_C( 0.56), SIMDE_FLOAT64_C( 0.52), SIMDE_FLOAT64_C( 0.70), SIMDE_FLOAT64_C( 0.89),
SIMDE_FLOAT64_C( 0.51), SIMDE_FLOAT64_C( 0.04), SIMDE_FLOAT64_C( 0.39), SIMDE_FLOAT64_C( 0.35) },
{ SIMDE_FLOAT64_C( 0.15), SIMDE_FLOAT64_C( 0.05), SIMDE_FLOAT64_C( 0.52), SIMDE_FLOAT64_C( 1.23),
SIMDE_FLOAT64_C( 0.03), SIMDE_FLOAT64_C( -1.75), SIMDE_FLOAT64_C( -0.28), SIMDE_FLOAT64_C( -0.39) } },
{ { SIMDE_FLOAT64_C( 0.55), SIMDE_FLOAT64_C( 0.18), SIMDE_FLOAT64_C( 0.15), SIMDE_FLOAT64_C( 0.84),
SIMDE_FLOAT64_C( 0.91), SIMDE_FLOAT64_C( 0.10), SIMDE_FLOAT64_C( 0.79), SIMDE_FLOAT64_C( 0.77) },
{ SIMDE_FLOAT64_C( 0.13), SIMDE_FLOAT64_C( -0.92), SIMDE_FLOAT64_C( -1.04), SIMDE_FLOAT64_C( 0.99),
SIMDE_FLOAT64_C( 1.34), SIMDE_FLOAT64_C( -1.28), SIMDE_FLOAT64_C( 0.81), SIMDE_FLOAT64_C( 0.74) } },
{ { SIMDE_FLOAT64_C( 0.57), SIMDE_FLOAT64_C( 0.82), SIMDE_FLOAT64_C( 0.61), SIMDE_FLOAT64_C( 0.20),
SIMDE_FLOAT64_C( 0.86), SIMDE_FLOAT64_C( 0.21), SIMDE_FLOAT64_C( 0.15), SIMDE_FLOAT64_C( 0.28) },
{ SIMDE_FLOAT64_C( 0.18), SIMDE_FLOAT64_C( 0.92), SIMDE_FLOAT64_C( 0.28), SIMDE_FLOAT64_C( -0.84),
SIMDE_FLOAT64_C( 1.08), SIMDE_FLOAT64_C( -0.81), SIMDE_FLOAT64_C( -1.04), SIMDE_FLOAT64_C( -0.58) } }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])) ; i++) {
simde__m512d a = simde_mm512_loadu_pd(test_vec[i].a);
simde__m512d r = simde_mm512_cdfnorminv_pd(a);
simde_test_x86_assert_equal_f64x8(r, simde_mm512_loadu_pd(test_vec[i].r), 1);
}
return 0;
}
static int
test_simde_mm512_mask_cdfnorminv_pd (SIMDE_MUNIT_TEST_ARGS) {
static const struct {
const simde_float64 src[8];
const simde__mmask8 k;
const simde_float64 a[8];
const simde_float64 r[8];
} test_vec[] = {
{ { SIMDE_FLOAT64_C( 0.14), SIMDE_FLOAT64_C( 0.22), SIMDE_FLOAT64_C( 0.14), SIMDE_FLOAT64_C( 0.13),
SIMDE_FLOAT64_C( 0.37), SIMDE_FLOAT64_C( 0.15), SIMDE_FLOAT64_C( 0.36), SIMDE_FLOAT64_C( 0.19) },
UINT8_C( 53),
{ SIMDE_FLOAT64_C( 0.28), SIMDE_FLOAT64_C( 0.13), SIMDE_FLOAT64_C( 0.36), SIMDE_FLOAT64_C( 0.21),
SIMDE_FLOAT64_C( 0.18), SIMDE_FLOAT64_C( 0.11), SIMDE_FLOAT64_C( 0.26), SIMDE_FLOAT64_C( 0.24) },
{ SIMDE_FLOAT64_C( -0.58), SIMDE_FLOAT64_C( 0.22), SIMDE_FLOAT64_C( -0.36), SIMDE_FLOAT64_C( 0.13),
SIMDE_FLOAT64_C( -0.92), SIMDE_FLOAT64_C( -1.23), SIMDE_FLOAT64_C( 0.36), SIMDE_FLOAT64_C( 0.19) } },
{ { SIMDE_FLOAT64_C( 0.30), SIMDE_FLOAT64_C( 0.25), SIMDE_FLOAT64_C( 0.49), SIMDE_FLOAT64_C( 0.16),
SIMDE_FLOAT64_C( 0.37), SIMDE_FLOAT64_C( 0.49), SIMDE_FLOAT64_C( 0.21), SIMDE_FLOAT64_C( 0.47) },
UINT8_C( 92),
{ SIMDE_FLOAT64_C( 0.27), SIMDE_FLOAT64_C( 0.17), SIMDE_FLOAT64_C( 0.21), SIMDE_FLOAT64_C( 0.47),
SIMDE_FLOAT64_C( 0.43), SIMDE_FLOAT64_C( 0.25), SIMDE_FLOAT64_C( 0.19), SIMDE_FLOAT64_C( 0.46) },
{ SIMDE_FLOAT64_C( 0.30), SIMDE_FLOAT64_C( 0.25), SIMDE_FLOAT64_C( -0.81), SIMDE_FLOAT64_C( -0.08),
SIMDE_FLOAT64_C( -0.18), SIMDE_FLOAT64_C( 0.49), SIMDE_FLOAT64_C( -0.88), SIMDE_FLOAT64_C( 0.47) } },
{ { SIMDE_FLOAT64_C( 0.28), SIMDE_FLOAT64_C( 0.46), SIMDE_FLOAT64_C( 0.11), SIMDE_FLOAT64_C( 0.14),
SIMDE_FLOAT64_C( 0.15), SIMDE_FLOAT64_C( 0.31), SIMDE_FLOAT64_C( 0.32), SIMDE_FLOAT64_C( 0.18) },
UINT8_C(232),
{ SIMDE_FLOAT64_C( 0.43), SIMDE_FLOAT64_C( 0.26), SIMDE_FLOAT64_C( 0.19), SIMDE_FLOAT64_C( 0.19),
SIMDE_FLOAT64_C( 0.40), SIMDE_FLOAT64_C( 0.39), SIMDE_FLOAT64_C( 0.34), SIMDE_FLOAT64_C( 0.39) },
{ SIMDE_FLOAT64_C( 0.28), SIMDE_FLOAT64_C( 0.46), SIMDE_FLOAT64_C( 0.11), SIMDE_FLOAT64_C( -0.88),
SIMDE_FLOAT64_C( 0.15), SIMDE_FLOAT64_C( -0.28), SIMDE_FLOAT64_C( -0.41), SIMDE_FLOAT64_C( -0.28) } },
{ { SIMDE_FLOAT64_C( 0.45), SIMDE_FLOAT64_C( 0.20), SIMDE_FLOAT64_C( 0.38), SIMDE_FLOAT64_C( 0.17),
SIMDE_FLOAT64_C( 0.18), SIMDE_FLOAT64_C( 0.20), SIMDE_FLOAT64_C( 0.33), SIMDE_FLOAT64_C( 0.25) },
UINT8_C(135),
{ SIMDE_FLOAT64_C( 0.30), SIMDE_FLOAT64_C( 0.18), SIMDE_FLOAT64_C( 0.45), SIMDE_FLOAT64_C( 0.39),
SIMDE_FLOAT64_C( 0.14), SIMDE_FLOAT64_C( 0.23), SIMDE_FLOAT64_C( 0.35), SIMDE_FLOAT64_C( 0.15) },
{ SIMDE_FLOAT64_C( -0.52), SIMDE_FLOAT64_C( -0.92), SIMDE_FLOAT64_C( -0.13), SIMDE_FLOAT64_C( 0.17),
SIMDE_FLOAT64_C( 0.18), SIMDE_FLOAT64_C( 0.20), SIMDE_FLOAT64_C( 0.33), SIMDE_FLOAT64_C( -1.04) } },
{ { SIMDE_FLOAT64_C( 0.27), SIMDE_FLOAT64_C( 0.40), SIMDE_FLOAT64_C( 0.36), SIMDE_FLOAT64_C( 0.49),
SIMDE_FLOAT64_C( 0.48), SIMDE_FLOAT64_C( 0.44), SIMDE_FLOAT64_C( 0.41), SIMDE_FLOAT64_C( 0.25) },
UINT8_C(111),
{ SIMDE_FLOAT64_C( 0.10), SIMDE_FLOAT64_C( 0.15), SIMDE_FLOAT64_C( 0.41), SIMDE_FLOAT64_C( 0.34),
SIMDE_FLOAT64_C( 0.43), SIMDE_FLOAT64_C( 0.37), SIMDE_FLOAT64_C( 0.44), SIMDE_FLOAT64_C( 0.31) },
{ SIMDE_FLOAT64_C( -1.28), SIMDE_FLOAT64_C( -1.04), SIMDE_FLOAT64_C( -0.23), SIMDE_FLOAT64_C( -0.41),
SIMDE_FLOAT64_C( 0.48), SIMDE_FLOAT64_C( -0.33), SIMDE_FLOAT64_C( -0.15), SIMDE_FLOAT64_C( 0.25) } },
{ { SIMDE_FLOAT64_C( 0.44), SIMDE_FLOAT64_C( 0.12), SIMDE_FLOAT64_C( 0.41), SIMDE_FLOAT64_C( 0.27),
SIMDE_FLOAT64_C( 0.27), SIMDE_FLOAT64_C( 0.22), SIMDE_FLOAT64_C( 0.47), SIMDE_FLOAT64_C( 0.34) },
UINT8_C( 67),
{ SIMDE_FLOAT64_C( 0.37), SIMDE_FLOAT64_C( 0.38), SIMDE_FLOAT64_C( 0.31), SIMDE_FLOAT64_C( 0.22),
SIMDE_FLOAT64_C( 0.43), SIMDE_FLOAT64_C( 0.48), SIMDE_FLOAT64_C( 0.12), SIMDE_FLOAT64_C( 0.29) },
{ SIMDE_FLOAT64_C( -0.33), SIMDE_FLOAT64_C( -0.31), SIMDE_FLOAT64_C( 0.41), SIMDE_FLOAT64_C( 0.27),
SIMDE_FLOAT64_C( 0.27), SIMDE_FLOAT64_C( 0.22), SIMDE_FLOAT64_C( -1.17), SIMDE_FLOAT64_C( 0.34) } },
{ { SIMDE_FLOAT64_C( 0.46), SIMDE_FLOAT64_C( 0.11), SIMDE_FLOAT64_C( 0.23), SIMDE_FLOAT64_C( 0.37),
SIMDE_FLOAT64_C( 0.25), SIMDE_FLOAT64_C( 0.25), SIMDE_FLOAT64_C( 0.38), SIMDE_FLOAT64_C( 0.30) },
UINT8_C(205),
{ SIMDE_FLOAT64_C( 0.22), SIMDE_FLOAT64_C( 0.23), SIMDE_FLOAT64_C( 0.43), SIMDE_FLOAT64_C( 0.16),
SIMDE_FLOAT64_C( 0.45), SIMDE_FLOAT64_C( 0.37), SIMDE_FLOAT64_C( 0.18), SIMDE_FLOAT64_C( 0.36) },
{ SIMDE_FLOAT64_C( -0.77), SIMDE_FLOAT64_C( 0.11), SIMDE_FLOAT64_C( -0.18), SIMDE_FLOAT64_C( -0.99),
SIMDE_FLOAT64_C( 0.25), SIMDE_FLOAT64_C( 0.25), SIMDE_FLOAT64_C( -0.92), SIMDE_FLOAT64_C( -0.36) } },
{ { SIMDE_FLOAT64_C( 0.14), SIMDE_FLOAT64_C( 0.34), SIMDE_FLOAT64_C( 0.48), SIMDE_FLOAT64_C( 0.11),
SIMDE_FLOAT64_C( 0.18), SIMDE_FLOAT64_C( 0.16), SIMDE_FLOAT64_C( 0.38), SIMDE_FLOAT64_C( 0.46) },
UINT8_C( 64),
{ SIMDE_FLOAT64_C( 0.50), SIMDE_FLOAT64_C( 0.39), SIMDE_FLOAT64_C( 0.34), SIMDE_FLOAT64_C( 0.12),
SIMDE_FLOAT64_C( 0.18), SIMDE_FLOAT64_C( 0.30), SIMDE_FLOAT64_C( 0.13), SIMDE_FLOAT64_C( 0.31) },
{ SIMDE_FLOAT64_C( 0.14), SIMDE_FLOAT64_C( 0.34), SIMDE_FLOAT64_C( 0.48), SIMDE_FLOAT64_C( 0.11),
SIMDE_FLOAT64_C( 0.18), SIMDE_FLOAT64_C( 0.16), SIMDE_FLOAT64_C( -1.13), SIMDE_FLOAT64_C( 0.46) } }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])) ; i++) {
simde__m512d src = simde_mm512_loadu_pd(test_vec[i].src);
simde__m512d a = simde_mm512_loadu_pd(test_vec[i].a);
simde__m512d r = simde_mm512_mask_cdfnorminv_pd(src, test_vec[i].k, a);
simde_test_x86_assert_equal_f64x8(r, simde_mm512_loadu_pd(test_vec[i].r), 1);
}
return 0;
}
static int
test_simde_mm_cexp_ps (SIMDE_MUNIT_TEST_ARGS) {
static const struct {
const simde_float32 a[4];
const simde_float32 r[4];
} test_vec[] = {
{ { SIMDE_FLOAT32_C( 1.08), SIMDE_FLOAT32_C( 8.03), SIMDE_FLOAT32_C( 6.08), SIMDE_FLOAT32_C( 9.10) },
{ SIMDE_FLOAT32_C( -0.52), SIMDE_FLOAT32_C( 2.90), SIMDE_FLOAT32_C( -414.18), SIMDE_FLOAT32_C( 139.46) } },
{ { SIMDE_FLOAT32_C( 1.28), SIMDE_FLOAT32_C( 5.24), SIMDE_FLOAT32_C( 3.60), SIMDE_FLOAT32_C( 4.31) },
{ SIMDE_FLOAT32_C( 1.81), SIMDE_FLOAT32_C( -3.11), SIMDE_FLOAT32_C( -14.33), SIMDE_FLOAT32_C( -33.68) } },
{ { SIMDE_FLOAT32_C( 0.14), SIMDE_FLOAT32_C( 1.48), SIMDE_FLOAT32_C( 6.66), SIMDE_FLOAT32_C( 7.44) },
{ SIMDE_FLOAT32_C( 0.10), SIMDE_FLOAT32_C( 1.15), SIMDE_FLOAT32_C( 313.98), SIMDE_FLOAT32_C( 714.61) } },
{ { SIMDE_FLOAT32_C( 5.32), SIMDE_FLOAT32_C( 1.63), SIMDE_FLOAT32_C( 3.75), SIMDE_FLOAT32_C( 1.94) },
{ SIMDE_FLOAT32_C( -12.09), SIMDE_FLOAT32_C( 204.03), SIMDE_FLOAT32_C( -15.34), SIMDE_FLOAT32_C( 39.66) } },
{ { SIMDE_FLOAT32_C( 0.94), SIMDE_FLOAT32_C( 4.84), SIMDE_FLOAT32_C( 7.08), SIMDE_FLOAT32_C( 8.24) },
{ SIMDE_FLOAT32_C( 0.33), SIMDE_FLOAT32_C( -2.54), SIMDE_FLOAT32_C( -447.27), SIMDE_FLOAT32_C( 1100.55) } },
{ { SIMDE_FLOAT32_C( 3.07), SIMDE_FLOAT32_C( 7.57), SIMDE_FLOAT32_C( 8.46), SIMDE_FLOAT32_C( 6.20) },
{ SIMDE_FLOAT32_C( 6.04), SIMDE_FLOAT32_C( 20.68), SIMDE_FLOAT32_C( 4705.73), SIMDE_FLOAT32_C( -392.35) } },
{ { SIMDE_FLOAT32_C( 6.65), SIMDE_FLOAT32_C( 0.69), SIMDE_FLOAT32_C( 5.84), SIMDE_FLOAT32_C( 9.64) },
{ SIMDE_FLOAT32_C( 596.01), SIMDE_FLOAT32_C( 491.91), SIMDE_FLOAT32_C( -335.85), SIMDE_FLOAT32_C( -73.42) } },
{ { SIMDE_FLOAT32_C( 5.18), SIMDE_FLOAT32_C( 4.56), SIMDE_FLOAT32_C( 1.04), SIMDE_FLOAT32_C( 6.26) },
{ SIMDE_FLOAT32_C( -26.97), SIMDE_FLOAT32_C( -175.62), SIMDE_FLOAT32_C( 2.83), SIMDE_FLOAT32_C( -0.07) } }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])) ; i++) {
simde__m128 a = simde_mm_loadu_ps(test_vec[i].a);
simde__m128 r = simde_mm_cexp_ps(a);
simde_test_x86_assert_equal_f32x4(r, simde_mm_loadu_ps(test_vec[i].r), 1);
}
return 0;
}
static int
test_simde_mm256_cexp_ps (SIMDE_MUNIT_TEST_ARGS) {
static const struct {
const simde_float32 a[8];
const simde_float32 r[8];
} test_vec[] = {
{ { SIMDE_FLOAT32_C( 0.42), SIMDE_FLOAT32_C( 5.22), SIMDE_FLOAT32_C( 1.35), SIMDE_FLOAT32_C( 9.44),
SIMDE_FLOAT32_C( 1.49), SIMDE_FLOAT32_C( 1.99), SIMDE_FLOAT32_C( 9.55), SIMDE_FLOAT32_C( 7.98) },
{ SIMDE_FLOAT32_C( 0.74), SIMDE_FLOAT32_C( -1.33), SIMDE_FLOAT32_C( -3.86), SIMDE_FLOAT32_C( -0.06),
SIMDE_FLOAT32_C( -1.81), SIMDE_FLOAT32_C( 4.05), SIMDE_FLOAT32_C( -1765.21), SIMDE_FLOAT32_C( 13933.33) } },
{ { SIMDE_FLOAT32_C( 9.68), SIMDE_FLOAT32_C( 3.77), SIMDE_FLOAT32_C( 6.40), SIMDE_FLOAT32_C( 1.44),
SIMDE_FLOAT32_C( 7.91), SIMDE_FLOAT32_C( 7.80), SIMDE_FLOAT32_C( 1.22), SIMDE_FLOAT32_C( 3.48) },
{ SIMDE_FLOAT32_C(-12938.99), SIMDE_FLOAT32_C( -9402.48), SIMDE_FLOAT32_C( 78.49), SIMDE_FLOAT32_C( 596.70),
SIMDE_FLOAT32_C( 147.00), SIMDE_FLOAT32_C( 2720.42), SIMDE_FLOAT32_C( -3.20), SIMDE_FLOAT32_C( -1.12) } },
{ { SIMDE_FLOAT32_C( 2.89), SIMDE_FLOAT32_C( 8.55), SIMDE_FLOAT32_C( 4.24), SIMDE_FLOAT32_C( 4.12),
SIMDE_FLOAT32_C( 7.15), SIMDE_FLOAT32_C( 0.93), SIMDE_FLOAT32_C( 6.80), SIMDE_FLOAT32_C( 3.92) },
{ SIMDE_FLOAT32_C( -11.54), SIMDE_FLOAT32_C( 13.81), SIMDE_FLOAT32_C( -38.75), SIMDE_FLOAT32_C( -57.58),
SIMDE_FLOAT32_C( 761.70), SIMDE_FLOAT32_C( 1021.35), SIMDE_FLOAT32_C( -639.30), SIMDE_FLOAT32_C( -630.42) } },
{ { SIMDE_FLOAT32_C( 4.44), SIMDE_FLOAT32_C( 7.17), SIMDE_FLOAT32_C( 7.74), SIMDE_FLOAT32_C( 2.32),
SIMDE_FLOAT32_C( 3.91), SIMDE_FLOAT32_C( 7.33), SIMDE_FLOAT32_C( 0.19), SIMDE_FLOAT32_C( 4.33) },
{ SIMDE_FLOAT32_C( 53.57), SIMDE_FLOAT32_C( 65.71), SIMDE_FLOAT32_C( -1565.39), SIMDE_FLOAT32_C( 1683.01),
SIMDE_FLOAT32_C( 24.97), SIMDE_FLOAT32_C( 43.20), SIMDE_FLOAT32_C( -0.45), SIMDE_FLOAT32_C( -1.12) } },
{ { SIMDE_FLOAT32_C( 2.55), SIMDE_FLOAT32_C( 1.54), SIMDE_FLOAT32_C( 3.76), SIMDE_FLOAT32_C( 4.04),
SIMDE_FLOAT32_C( 3.54), SIMDE_FLOAT32_C( 3.32), SIMDE_FLOAT32_C( 2.02), SIMDE_FLOAT32_C( 3.21) },
{ SIMDE_FLOAT32_C( 0.39), SIMDE_FLOAT32_C( 12.80), SIMDE_FLOAT32_C( -26.75), SIMDE_FLOAT32_C( -33.60),
SIMDE_FLOAT32_C( -33.92), SIMDE_FLOAT32_C( -6.12), SIMDE_FLOAT32_C( -7.52), SIMDE_FLOAT32_C( -0.52) } },
{ { SIMDE_FLOAT32_C( 7.08), SIMDE_FLOAT32_C( 8.42), SIMDE_FLOAT32_C( 4.66), SIMDE_FLOAT32_C( 4.99),
SIMDE_FLOAT32_C( 6.22), SIMDE_FLOAT32_C( 5.87), SIMDE_FLOAT32_C( 8.47), SIMDE_FLOAT32_C( 9.11) },
{ SIMDE_FLOAT32_C( -637.08), SIMDE_FLOAT32_C( 1002.70), SIMDE_FLOAT32_C( 28.95), SIMDE_FLOAT32_C( -101.59),
SIMDE_FLOAT32_C( 460.40), SIMDE_FLOAT32_C( -201.85), SIMDE_FLOAT32_C( -4535.17), SIMDE_FLOAT32_C( 1476.67) } },
{ { SIMDE_FLOAT32_C( 4.42), SIMDE_FLOAT32_C( 2.71), SIMDE_FLOAT32_C( 3.23), SIMDE_FLOAT32_C( 1.57),
SIMDE_FLOAT32_C( 3.64), SIMDE_FLOAT32_C( 0.03), SIMDE_FLOAT32_C( 5.49), SIMDE_FLOAT32_C( 8.08) },
{ SIMDE_FLOAT32_C( -75.48), SIMDE_FLOAT32_C( 34.76), SIMDE_FLOAT32_C( 0.02), SIMDE_FLOAT32_C( 25.28),
SIMDE_FLOAT32_C( 38.07), SIMDE_FLOAT32_C( 1.14), SIMDE_FLOAT32_C( -54.29), SIMDE_FLOAT32_C( 236.10) } },
{ { SIMDE_FLOAT32_C( 7.19), SIMDE_FLOAT32_C( 3.23), SIMDE_FLOAT32_C( 0.40), SIMDE_FLOAT32_C( 1.10),
SIMDE_FLOAT32_C( 0.56), SIMDE_FLOAT32_C( 0.59), SIMDE_FLOAT32_C( 5.43), SIMDE_FLOAT32_C( 3.11) },
{ SIMDE_FLOAT32_C( -1320.92), SIMDE_FLOAT32_C( -117.08), SIMDE_FLOAT32_C( 0.68), SIMDE_FLOAT32_C( 1.33),
SIMDE_FLOAT32_C( 1.45), SIMDE_FLOAT32_C( 0.97), SIMDE_FLOAT32_C( -228.04), SIMDE_FLOAT32_C( 7.21) } }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])) ; i++) {
simde__m256 a = simde_mm256_loadu_ps(test_vec[i].a);
simde__m256 r = simde_mm256_cexp_ps(a);
simde_test_x86_assert_equal_f32x8(r, simde_mm256_loadu_ps(test_vec[i].r), 1);
}
return 0;
}
static int
test_simde_mm_clog_ps (SIMDE_MUNIT_TEST_ARGS) {
static const struct {
const simde_float32 a[4];
const simde_float32 r[4];
} test_vec[] = {
{ { SIMDE_FLOAT32_C( 467.27), SIMDE_FLOAT32_C( -810.49), SIMDE_FLOAT32_C( -408.53), SIMDE_FLOAT32_C( -463.46) },
{ SIMDE_FLOAT32_C( 6.84), SIMDE_FLOAT32_C( -1.05), SIMDE_FLOAT32_C( 6.43), SIMDE_FLOAT32_C( -2.29) } },
{ { SIMDE_FLOAT32_C( -597.00), SIMDE_FLOAT32_C( 144.37), SIMDE_FLOAT32_C( 819.91), SIMDE_FLOAT32_C( 258.51) },
{ SIMDE_FLOAT32_C( 6.42), SIMDE_FLOAT32_C( 2.90), SIMDE_FLOAT32_C( 6.76), SIMDE_FLOAT32_C( 0.31) } },
{ { SIMDE_FLOAT32_C( -690.61), SIMDE_FLOAT32_C( -496.03), SIMDE_FLOAT32_C( -379.26), SIMDE_FLOAT32_C( 822.50) },
{ SIMDE_FLOAT32_C( 6.75), SIMDE_FLOAT32_C( -2.52), SIMDE_FLOAT32_C( 6.81), SIMDE_FLOAT32_C( 2.00) } },
{ { SIMDE_FLOAT32_C( 369.47), SIMDE_FLOAT32_C( 917.67), SIMDE_FLOAT32_C( 917.67), SIMDE_FLOAT32_C( 649.13) },
{ SIMDE_FLOAT32_C( 6.90), SIMDE_FLOAT32_C( 1.19), SIMDE_FLOAT32_C( 7.02), SIMDE_FLOAT32_C( 0.62) } },
{ { SIMDE_FLOAT32_C( -165.00), SIMDE_FLOAT32_C( -18.10), SIMDE_FLOAT32_C( 943.19), SIMDE_FLOAT32_C( 635.72) },
{ SIMDE_FLOAT32_C( 5.11), SIMDE_FLOAT32_C( -3.03), SIMDE_FLOAT32_C( 7.04), SIMDE_FLOAT32_C( 0.59) } },
{ { SIMDE_FLOAT32_C( -21.66), SIMDE_FLOAT32_C( 494.23), SIMDE_FLOAT32_C( -734.58), SIMDE_FLOAT32_C( 417.20) },
{ SIMDE_FLOAT32_C( 6.20), SIMDE_FLOAT32_C( 1.61), SIMDE_FLOAT32_C( 6.74), SIMDE_FLOAT32_C( 2.63) } },
{ { SIMDE_FLOAT32_C( 812.64), SIMDE_FLOAT32_C( -983.61), SIMDE_FLOAT32_C( 15.40), SIMDE_FLOAT32_C( 505.51) },
{ SIMDE_FLOAT32_C( 7.15), SIMDE_FLOAT32_C( -0.88), SIMDE_FLOAT32_C( 6.23), SIMDE_FLOAT32_C( 1.54) } },
{ { SIMDE_FLOAT32_C( -497.22), SIMDE_FLOAT32_C( 590.38), SIMDE_FLOAT32_C( 600.11), SIMDE_FLOAT32_C( 970.05) },
{ SIMDE_FLOAT32_C( 6.65), SIMDE_FLOAT32_C( 2.27), SIMDE_FLOAT32_C( 7.04), SIMDE_FLOAT32_C( 1.02) } }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])) ; i++) {
simde__m128 a = simde_mm_loadu_ps(test_vec[i].a);
simde__m128 r = simde_mm_clog_ps(a);
simde_test_x86_assert_equal_f32x4(r, simde_mm_loadu_ps(test_vec[i].r), 1);
}
return 0;
}
static int
test_simde_mm256_clog_ps (SIMDE_MUNIT_TEST_ARGS) {
static const struct {
const simde_float32 a[8];
const simde_float32 r[8];
} test_vec[] = {
{ { SIMDE_FLOAT32_C( 312.27), SIMDE_FLOAT32_C( 505.55), SIMDE_FLOAT32_C( 862.46), SIMDE_FLOAT32_C( 31.99),
SIMDE_FLOAT32_C( 800.53), SIMDE_FLOAT32_C( 181.00), SIMDE_FLOAT32_C( 161.95), SIMDE_FLOAT32_C( -71.19) },
{ SIMDE_FLOAT32_C( 6.39), SIMDE_FLOAT32_C( 1.02), SIMDE_FLOAT32_C( 6.76), SIMDE_FLOAT32_C( 0.04),
SIMDE_FLOAT32_C( 6.71), SIMDE_FLOAT32_C( 0.22), SIMDE_FLOAT32_C( 5.18), SIMDE_FLOAT32_C( -0.41) } },
{ { SIMDE_FLOAT32_C( 183.06), SIMDE_FLOAT32_C( 131.57), SIMDE_FLOAT32_C( 568.96), SIMDE_FLOAT32_C( 107.92),
SIMDE_FLOAT32_C( 898.15), SIMDE_FLOAT32_C( 154.17), SIMDE_FLOAT32_C( 262.39), SIMDE_FLOAT32_C( 850.07) },
{ SIMDE_FLOAT32_C( 5.42), SIMDE_FLOAT32_C( 0.62), SIMDE_FLOAT32_C( 6.36), SIMDE_FLOAT32_C( 0.19),
SIMDE_FLOAT32_C( 6.81), SIMDE_FLOAT32_C( 0.17), SIMDE_FLOAT32_C( 6.79), SIMDE_FLOAT32_C( 1.27) } },
{ { SIMDE_FLOAT32_C( 459.40), SIMDE_FLOAT32_C( 479.25), SIMDE_FLOAT32_C( 503.31), SIMDE_FLOAT32_C( 451.65),
SIMDE_FLOAT32_C( 353.11), SIMDE_FLOAT32_C( 438.44), SIMDE_FLOAT32_C( 777.37), SIMDE_FLOAT32_C( 20.59) },
{ SIMDE_FLOAT32_C( 6.50), SIMDE_FLOAT32_C( 0.81), SIMDE_FLOAT32_C( 6.52), SIMDE_FLOAT32_C( 0.73),
SIMDE_FLOAT32_C( 6.33), SIMDE_FLOAT32_C( 0.89), SIMDE_FLOAT32_C( 6.66), SIMDE_FLOAT32_C( 0.03) } },
{ { SIMDE_FLOAT32_C( -35.16), SIMDE_FLOAT32_C( 449.22), SIMDE_FLOAT32_C( -48.41), SIMDE_FLOAT32_C( 925.44),
SIMDE_FLOAT32_C( 309.83), SIMDE_FLOAT32_C( 130.15), SIMDE_FLOAT32_C( 38.89), SIMDE_FLOAT32_C( 722.10) },
{ SIMDE_FLOAT32_C( 6.11), SIMDE_FLOAT32_C( 1.65), SIMDE_FLOAT32_C( 6.83), SIMDE_FLOAT32_C( 1.62),
SIMDE_FLOAT32_C( 5.82), SIMDE_FLOAT32_C( 0.40), SIMDE_FLOAT32_C( 6.58), SIMDE_FLOAT32_C( 1.52) } },
{ { SIMDE_FLOAT32_C( 735.70), SIMDE_FLOAT32_C( -98.65), SIMDE_FLOAT32_C( 854.09), SIMDE_FLOAT32_C( 536.23),
SIMDE_FLOAT32_C( 182.34), SIMDE_FLOAT32_C( 16.04), SIMDE_FLOAT32_C( 565.04), SIMDE_FLOAT32_C( 465.40) },
{ SIMDE_FLOAT32_C( 6.61), SIMDE_FLOAT32_C( -0.13), SIMDE_FLOAT32_C( 6.92), SIMDE_FLOAT32_C( 0.56),
SIMDE_FLOAT32_C( 5.21), SIMDE_FLOAT32_C( 0.09), SIMDE_FLOAT32_C( 6.60), SIMDE_FLOAT32_C( 0.69) } },
{ { SIMDE_FLOAT32_C( 247.61), SIMDE_FLOAT32_C( 134.00), SIMDE_FLOAT32_C( 673.33), SIMDE_FLOAT32_C( 145.76),
SIMDE_FLOAT32_C( 388.17), SIMDE_FLOAT32_C( -64.29), SIMDE_FLOAT32_C( -4.17), SIMDE_FLOAT32_C( 947.57) },
{ SIMDE_FLOAT32_C( 5.64), SIMDE_FLOAT32_C( 0.50), SIMDE_FLOAT32_C( 6.54), SIMDE_FLOAT32_C( 0.21),
SIMDE_FLOAT32_C( 5.97), SIMDE_FLOAT32_C( -0.16), SIMDE_FLOAT32_C( 6.85), SIMDE_FLOAT32_C( 1.58) } },
{ { SIMDE_FLOAT32_C( 514.96), SIMDE_FLOAT32_C( 599.14), SIMDE_FLOAT32_C( 399.22), SIMDE_FLOAT32_C( 968.07),
SIMDE_FLOAT32_C( 37.59), SIMDE_FLOAT32_C( 176.60), SIMDE_FLOAT32_C( -11.35), SIMDE_FLOAT32_C( 102.43) },
{ SIMDE_FLOAT32_C( 6.67), SIMDE_FLOAT32_C( 0.86), SIMDE_FLOAT32_C( 6.95), SIMDE_FLOAT32_C( 1.18),
SIMDE_FLOAT32_C( 5.20), SIMDE_FLOAT32_C( 1.36), SIMDE_FLOAT32_C( 4.64), SIMDE_FLOAT32_C( 1.68) } },
{ { SIMDE_FLOAT32_C( 725.82), SIMDE_FLOAT32_C( 40.24), SIMDE_FLOAT32_C( 27.87), SIMDE_FLOAT32_C( 35.65),
SIMDE_FLOAT32_C( 270.39), SIMDE_FLOAT32_C( 166.76), SIMDE_FLOAT32_C( 857.75), SIMDE_FLOAT32_C( 6.09) },
{ SIMDE_FLOAT32_C( 6.59), SIMDE_FLOAT32_C( 0.06), SIMDE_FLOAT32_C( 3.81), SIMDE_FLOAT32_C( 0.91),
SIMDE_FLOAT32_C( 5.76), SIMDE_FLOAT32_C( 0.55), SIMDE_FLOAT32_C( 6.75), SIMDE_FLOAT32_C( 0.01) } }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])) ; i++) {
simde__m256 a = simde_mm256_loadu_ps(test_vec[i].a);
simde__m256 r = simde_mm256_clog_ps(a);
simde_test_x86_assert_equal_f32x8(r, simde_mm256_loadu_ps(test_vec[i].r), 1);
}
return 0;
}
static int
test_simde_mm_csqrt_ps (SIMDE_MUNIT_TEST_ARGS) {
static const struct {
const simde_float32 a[4];
const simde_float32 r[4];
} test_vec[] = {
{ { SIMDE_FLOAT32_C( 467.84), SIMDE_FLOAT32_C( 803.96), SIMDE_FLOAT32_C( 261.38), SIMDE_FLOAT32_C( -142.34) },
{ SIMDE_FLOAT32_C( 26.44), SIMDE_FLOAT32_C( 15.20), SIMDE_FLOAT32_C( 16.72), SIMDE_FLOAT32_C( -4.26) } },
{ { SIMDE_FLOAT32_C( 742.87), SIMDE_FLOAT32_C( 79.67), SIMDE_FLOAT32_C( 840.90), SIMDE_FLOAT32_C( -323.18) },
{ SIMDE_FLOAT32_C( 27.29), SIMDE_FLOAT32_C( 1.46), SIMDE_FLOAT32_C( 29.51), SIMDE_FLOAT32_C( -5.48) } },
{ { SIMDE_FLOAT32_C( -240.48), SIMDE_FLOAT32_C( -541.73), SIMDE_FLOAT32_C( 989.55), SIMDE_FLOAT32_C( 570.06) },
{ SIMDE_FLOAT32_C( 13.27), SIMDE_FLOAT32_C( -20.41), SIMDE_FLOAT32_C( 32.65), SIMDE_FLOAT32_C( 8.73) } },
{ { SIMDE_FLOAT32_C( 83.09), SIMDE_FLOAT32_C( -1.32), SIMDE_FLOAT32_C( 106.90), SIMDE_FLOAT32_C( -376.28) },
{ SIMDE_FLOAT32_C( 9.12), SIMDE_FLOAT32_C( -0.07), SIMDE_FLOAT32_C( 15.78), SIMDE_FLOAT32_C( -11.92) } },
{ { SIMDE_FLOAT32_C( -403.08), SIMDE_FLOAT32_C( 970.42), SIMDE_FLOAT32_C( -962.81), SIMDE_FLOAT32_C( 736.64) },
{ SIMDE_FLOAT32_C( 18.00), SIMDE_FLOAT32_C( 26.96), SIMDE_FLOAT32_C( 11.17), SIMDE_FLOAT32_C( 32.98) } },
{ { SIMDE_FLOAT32_C( 711.24), SIMDE_FLOAT32_C( -757.45), SIMDE_FLOAT32_C( 634.59), SIMDE_FLOAT32_C( -16.19) },
{ SIMDE_FLOAT32_C( 29.58), SIMDE_FLOAT32_C( -12.80), SIMDE_FLOAT32_C( 25.19), SIMDE_FLOAT32_C( -0.32) } },
{ { SIMDE_FLOAT32_C( 81.29), SIMDE_FLOAT32_C( -815.58), SIMDE_FLOAT32_C( -317.77), SIMDE_FLOAT32_C( -90.40) },
{ SIMDE_FLOAT32_C( 21.22), SIMDE_FLOAT32_C( -19.21), SIMDE_FLOAT32_C( 2.51), SIMDE_FLOAT32_C( -18.00) } },
{ { SIMDE_FLOAT32_C( -84.58), SIMDE_FLOAT32_C( 322.77), SIMDE_FLOAT32_C( 454.95), SIMDE_FLOAT32_C( -616.74) },
{ SIMDE_FLOAT32_C( 11.16), SIMDE_FLOAT32_C( 14.46), SIMDE_FLOAT32_C( 24.71), SIMDE_FLOAT32_C( -12.48) } }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])) ; i++) {
simde__m128 a = simde_mm_loadu_ps(test_vec[i].a);
simde__m128 r = simde_mm_csqrt_ps(a);
simde_test_x86_assert_equal_f32x4(r, simde_mm_loadu_ps(test_vec[i].r), 1);
}
return 0;
}
static int
test_simde_mm256_csqrt_ps (SIMDE_MUNIT_TEST_ARGS) {
static const struct {
const simde_float32 a[8];
const simde_float32 r[8];
} test_vec[] = {
{ { SIMDE_FLOAT32_C( 374.45), SIMDE_FLOAT32_C( -986.58), SIMDE_FLOAT32_C( -44.44), SIMDE_FLOAT32_C( -160.79),
SIMDE_FLOAT32_C( -660.98), SIMDE_FLOAT32_C( -996.70), SIMDE_FLOAT32_C( -22.70), SIMDE_FLOAT32_C( -74.73) },
{ SIMDE_FLOAT32_C( 26.74), SIMDE_FLOAT32_C( -18.45), SIMDE_FLOAT32_C( 7.82), SIMDE_FLOAT32_C( -10.28),
SIMDE_FLOAT32_C( 16.36), SIMDE_FLOAT32_C( -30.47), SIMDE_FLOAT32_C( 5.26), SIMDE_FLOAT32_C( -7.10) } },
{ { SIMDE_FLOAT32_C( -335.08), SIMDE_FLOAT32_C( -387.45), SIMDE_FLOAT32_C( 992.50), SIMDE_FLOAT32_C( 334.99),
SIMDE_FLOAT32_C( -373.08), SIMDE_FLOAT32_C( -939.30), SIMDE_FLOAT32_C( 219.57), SIMDE_FLOAT32_C( -565.96) },
{ SIMDE_FLOAT32_C( 9.41), SIMDE_FLOAT32_C( -20.58), SIMDE_FLOAT32_C( 31.94), SIMDE_FLOAT32_C( 5.24),
SIMDE_FLOAT32_C( 17.85), SIMDE_FLOAT32_C( -26.30), SIMDE_FLOAT32_C( 20.33), SIMDE_FLOAT32_C( -13.92) } },
{ { SIMDE_FLOAT32_C( 626.25), SIMDE_FLOAT32_C( -390.81), SIMDE_FLOAT32_C( 653.44), SIMDE_FLOAT32_C( 423.64),
SIMDE_FLOAT32_C( 320.72), SIMDE_FLOAT32_C( 749.19), SIMDE_FLOAT32_C( -605.94), SIMDE_FLOAT32_C( 183.09) },
{ SIMDE_FLOAT32_C( 26.12), SIMDE_FLOAT32_C( -7.48), SIMDE_FLOAT32_C( 26.76), SIMDE_FLOAT32_C( 7.92),
SIMDE_FLOAT32_C( 23.83), SIMDE_FLOAT32_C( 15.72), SIMDE_FLOAT32_C( 3.68), SIMDE_FLOAT32_C( 24.89) } },
{ { SIMDE_FLOAT32_C( 911.79), SIMDE_FLOAT32_C( 134.97), SIMDE_FLOAT32_C( -550.62), SIMDE_FLOAT32_C( -842.16),
SIMDE_FLOAT32_C( 650.87), SIMDE_FLOAT32_C( -128.95), SIMDE_FLOAT32_C( 295.76), SIMDE_FLOAT32_C( 25.32) },
{ SIMDE_FLOAT32_C( 30.28), SIMDE_FLOAT32_C( 2.23), SIMDE_FLOAT32_C( 15.09), SIMDE_FLOAT32_C( -27.90),
SIMDE_FLOAT32_C( 25.64), SIMDE_FLOAT32_C( -2.52), SIMDE_FLOAT32_C( 17.21), SIMDE_FLOAT32_C( 0.74) } },
{ { SIMDE_FLOAT32_C( -115.53), SIMDE_FLOAT32_C( -748.68), SIMDE_FLOAT32_C( 864.53), SIMDE_FLOAT32_C( 223.49),
SIMDE_FLOAT32_C( -745.38), SIMDE_FLOAT32_C( -158.17), SIMDE_FLOAT32_C( -851.24), SIMDE_FLOAT32_C( -80.46) },
{ SIMDE_FLOAT32_C( 17.92), SIMDE_FLOAT32_C( -20.89), SIMDE_FLOAT32_C( 29.64), SIMDE_FLOAT32_C( 3.77),
SIMDE_FLOAT32_C( 2.88), SIMDE_FLOAT32_C( -27.45), SIMDE_FLOAT32_C( 1.38), SIMDE_FLOAT32_C( -29.21) } },
{ { SIMDE_FLOAT32_C( 454.37), SIMDE_FLOAT32_C( -858.75), SIMDE_FLOAT32_C( -745.47), SIMDE_FLOAT32_C( -918.71),
SIMDE_FLOAT32_C( -798.04), SIMDE_FLOAT32_C( 474.10), SIMDE_FLOAT32_C( -484.67), SIMDE_FLOAT32_C( 828.20) },
{ SIMDE_FLOAT32_C( 26.70), SIMDE_FLOAT32_C( -16.08), SIMDE_FLOAT32_C( 14.79), SIMDE_FLOAT32_C( -31.05),
SIMDE_FLOAT32_C( 8.07), SIMDE_FLOAT32_C( 29.38), SIMDE_FLOAT32_C( 15.41), SIMDE_FLOAT32_C( 26.87) } },
{ { SIMDE_FLOAT32_C( -916.70), SIMDE_FLOAT32_C( -831.23), SIMDE_FLOAT32_C( 251.85), SIMDE_FLOAT32_C( 404.02),
SIMDE_FLOAT32_C( 917.96), SIMDE_FLOAT32_C( 645.91), SIMDE_FLOAT32_C( -412.89), SIMDE_FLOAT32_C( 829.74) },
{ SIMDE_FLOAT32_C( 12.66), SIMDE_FLOAT32_C( -32.82), SIMDE_FLOAT32_C( 19.08), SIMDE_FLOAT32_C( 10.59),
SIMDE_FLOAT32_C( 31.94), SIMDE_FLOAT32_C( 10.11), SIMDE_FLOAT32_C( 16.03), SIMDE_FLOAT32_C( 25.88) } },
{ { SIMDE_FLOAT32_C( -219.12), SIMDE_FLOAT32_C( 36.49), SIMDE_FLOAT32_C( 987.58), SIMDE_FLOAT32_C( -568.25),
SIMDE_FLOAT32_C( 907.54), SIMDE_FLOAT32_C( 283.34), SIMDE_FLOAT32_C( 457.07), SIMDE_FLOAT32_C( -207.99) },
{ SIMDE_FLOAT32_C( 1.23), SIMDE_FLOAT32_C( 14.85), SIMDE_FLOAT32_C( 32.61), SIMDE_FLOAT32_C( -8.71),
SIMDE_FLOAT32_C( 30.48), SIMDE_FLOAT32_C( 4.65), SIMDE_FLOAT32_C( 21.90), SIMDE_FLOAT32_C( -4.75) } }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])) ; i++) {
simde__m256 a = simde_mm256_loadu_ps(test_vec[i].a);
simde__m256 r = simde_mm256_csqrt_ps(a);
simde_test_x86_assert_equal_f32x8(r, simde_mm256_loadu_ps(test_vec[i].r), 1);
}
return 0;
}
static int
test_simde_mm_cos_ps(SIMDE_MUNIT_TEST_ARGS) {
const struct {
simde__m128 a;
simde__m128 r;
} test_vec[8] = {
{ simde_mm_set_ps(SIMDE_FLOAT32_C( -186.21), SIMDE_FLOAT32_C( 39.01), SIMDE_FLOAT32_C( -754.38), SIMDE_FLOAT32_C( 346.63)),
simde_mm_set_ps(SIMDE_FLOAT32_C( -0.66), SIMDE_FLOAT32_C( 0.26), SIMDE_FLOAT32_C( 0.92), SIMDE_FLOAT32_C( 0.49)) },
{ simde_mm_set_ps(SIMDE_FLOAT32_C( 497.31), SIMDE_FLOAT32_C( 670.24), SIMDE_FLOAT32_C( -297.45), SIMDE_FLOAT32_C( 34.06)),
simde_mm_set_ps(SIMDE_FLOAT32_C( 0.59), SIMDE_FLOAT32_C( -0.47), SIMDE_FLOAT32_C( -0.54), SIMDE_FLOAT32_C( -0.88)) },
{ simde_mm_set_ps(SIMDE_FLOAT32_C( 825.53), SIMDE_FLOAT32_C( 422.21), SIMDE_FLOAT32_C( -269.45), SIMDE_FLOAT32_C( 467.76)),
simde_mm_set_ps(SIMDE_FLOAT32_C( -0.76), SIMDE_FLOAT32_C( 0.33), SIMDE_FLOAT32_C( 0.75), SIMDE_FLOAT32_C( -0.94)) },
{ simde_mm_set_ps(SIMDE_FLOAT32_C( -678.17), SIMDE_FLOAT32_C( -686.13), SIMDE_FLOAT32_C( 84.77), SIMDE_FLOAT32_C( 571.46)),
simde_mm_set_ps(SIMDE_FLOAT32_C( 0.92), SIMDE_FLOAT32_C( 0.30), SIMDE_FLOAT32_C( -1.00), SIMDE_FLOAT32_C( 0.95)) },
{ simde_mm_set_ps(SIMDE_FLOAT32_C( -305.07), SIMDE_FLOAT32_C( -860.95), SIMDE_FLOAT32_C( -417.54), SIMDE_FLOAT32_C( 696.87)),
simde_mm_set_ps(SIMDE_FLOAT32_C( -0.94), SIMDE_FLOAT32_C( 0.99), SIMDE_FLOAT32_C( -0.96), SIMDE_FLOAT32_C( 0.85)) },
{ simde_mm_set_ps(SIMDE_FLOAT32_C( -444.81), SIMDE_FLOAT32_C( 28.47), SIMDE_FLOAT32_C( -384.03), SIMDE_FLOAT32_C( -923.64)),
simde_mm_set_ps(SIMDE_FLOAT32_C( 0.27), SIMDE_FLOAT32_C( -0.98), SIMDE_FLOAT32_C( 0.73), SIMDE_FLOAT32_C( 1.00)) },
{ simde_mm_set_ps(SIMDE_FLOAT32_C( 261.31), SIMDE_FLOAT32_C( -212.54), SIMDE_FLOAT32_C( -976.55), SIMDE_FLOAT32_C( -660.80)),
simde_mm_set_ps(SIMDE_FLOAT32_C( -0.85), SIMDE_FLOAT32_C( 0.46), SIMDE_FLOAT32_C( -0.88), SIMDE_FLOAT32_C( 0.48)) },
{ simde_mm_set_ps(SIMDE_FLOAT32_C( 178.20), SIMDE_FLOAT32_C( -450.67), SIMDE_FLOAT32_C( 233.37), SIMDE_FLOAT32_C( 687.09)),
simde_mm_set_ps(SIMDE_FLOAT32_C( -0.64), SIMDE_FLOAT32_C( -0.15), SIMDE_FLOAT32_C( 0.63), SIMDE_FLOAT32_C( -0.61)) }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])); i++) {
simde__m128 r = simde_mm_cos_ps(test_vec[i].a);
simde_assert_m128_close(r, test_vec[i].r, 1);
}
return 0;
}
static int
test_simde_mm_cos_pd(SIMDE_MUNIT_TEST_ARGS) {
const struct {
simde__m128d a;
simde__m128d r;
} test_vec[8] = {
{ simde_mm_set_pd(SIMDE_FLOAT64_C( -754.38), SIMDE_FLOAT64_C( 346.63)),
simde_mm_set_pd(SIMDE_FLOAT64_C( 0.92), SIMDE_FLOAT64_C( 0.49)) },
{ simde_mm_set_pd(SIMDE_FLOAT64_C( -186.21), SIMDE_FLOAT64_C( 39.01)),
simde_mm_set_pd(SIMDE_FLOAT64_C( -0.66), SIMDE_FLOAT64_C( 0.26)) },
{ simde_mm_set_pd(SIMDE_FLOAT64_C( -297.45), SIMDE_FLOAT64_C( 34.06)),
simde_mm_set_pd(SIMDE_FLOAT64_C( -0.54), SIMDE_FLOAT64_C( -0.88)) },
{ simde_mm_set_pd(SIMDE_FLOAT64_C( 497.31), SIMDE_FLOAT64_C( 670.24)),
simde_mm_set_pd(SIMDE_FLOAT64_C( 0.59), SIMDE_FLOAT64_C( -0.47)) },
{ simde_mm_set_pd(SIMDE_FLOAT64_C( -269.45), SIMDE_FLOAT64_C( 467.76)),
simde_mm_set_pd(SIMDE_FLOAT64_C( 0.75), SIMDE_FLOAT64_C( -0.94)) },
{ simde_mm_set_pd(SIMDE_FLOAT64_C( 825.53), SIMDE_FLOAT64_C( 422.21)),
simde_mm_set_pd(SIMDE_FLOAT64_C( -0.76), SIMDE_FLOAT64_C( 0.33)) },
{ simde_mm_set_pd(SIMDE_FLOAT64_C( 84.77), SIMDE_FLOAT64_C( 571.46)),
simde_mm_set_pd(SIMDE_FLOAT64_C( -1.00), SIMDE_FLOAT64_C( 0.95)) },
{ simde_mm_set_pd(SIMDE_FLOAT64_C( -678.17), SIMDE_FLOAT64_C( -686.13)),
simde_mm_set_pd(SIMDE_FLOAT64_C( 0.92), SIMDE_FLOAT64_C( 0.30)) }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])); i++) {
simde__m128d r = simde_mm_cos_pd(test_vec[i].a);
simde_assert_m128d_close(r, test_vec[i].r, 1);
}
return 0;
}
static int
test_simde_mm256_cos_ps(SIMDE_MUNIT_TEST_ARGS) {
const struct {
simde__m256 a;
simde__m256 r;
} test_vec[8] = {
{ simde_mm256_set_ps(SIMDE_FLOAT32_C( 497.31), SIMDE_FLOAT32_C( 670.24),
SIMDE_FLOAT32_C( -297.45), SIMDE_FLOAT32_C( 34.06),
SIMDE_FLOAT32_C( -186.21), SIMDE_FLOAT32_C( 39.01),
SIMDE_FLOAT32_C( -754.38), SIMDE_FLOAT32_C( 346.63)),
simde_mm256_set_ps(SIMDE_FLOAT32_C( 0.59), SIMDE_FLOAT32_C( -0.47),
SIMDE_FLOAT32_C( -0.54), SIMDE_FLOAT32_C( -0.88),
SIMDE_FLOAT32_C( -0.66), SIMDE_FLOAT32_C( 0.26),
SIMDE_FLOAT32_C( 0.92), SIMDE_FLOAT32_C( 0.49)) },
{ simde_mm256_set_ps(SIMDE_FLOAT32_C( -678.17), SIMDE_FLOAT32_C( -686.13),
SIMDE_FLOAT32_C( 84.77), SIMDE_FLOAT32_C( 571.46),
SIMDE_FLOAT32_C( 825.53), SIMDE_FLOAT32_C( 422.21),
SIMDE_FLOAT32_C( -269.45), SIMDE_FLOAT32_C( 467.76)),
simde_mm256_set_ps(SIMDE_FLOAT32_C( 0.92), SIMDE_FLOAT32_C( 0.30),
SIMDE_FLOAT32_C( -1.00), SIMDE_FLOAT32_C( 0.95),
SIMDE_FLOAT32_C( -0.76), SIMDE_FLOAT32_C( 0.33),
SIMDE_FLOAT32_C( 0.75), SIMDE_FLOAT32_C( -0.94)) },
{ simde_mm256_set_ps(SIMDE_FLOAT32_C( -444.81), SIMDE_FLOAT32_C( 28.47),
SIMDE_FLOAT32_C( -384.03), SIMDE_FLOAT32_C( -923.64),
SIMDE_FLOAT32_C( -305.07), SIMDE_FLOAT32_C( -860.95),
SIMDE_FLOAT32_C( -417.54), SIMDE_FLOAT32_C( 696.87)),
simde_mm256_set_ps(SIMDE_FLOAT32_C( 0.27), SIMDE_FLOAT32_C( -0.98),
SIMDE_FLOAT32_C( 0.73), SIMDE_FLOAT32_C( 1.00),
SIMDE_FLOAT32_C( -0.94), SIMDE_FLOAT32_C( 0.99),
SIMDE_FLOAT32_C( -0.96), SIMDE_FLOAT32_C( 0.85)) },
{ simde_mm256_set_ps(SIMDE_FLOAT32_C( 178.20), SIMDE_FLOAT32_C( -450.67),
SIMDE_FLOAT32_C( 233.37), SIMDE_FLOAT32_C( 687.09),
SIMDE_FLOAT32_C( 261.31), SIMDE_FLOAT32_C( -212.54),
SIMDE_FLOAT32_C( -976.55), SIMDE_FLOAT32_C( -660.80)),
simde_mm256_set_ps(SIMDE_FLOAT32_C( -0.64), SIMDE_FLOAT32_C( -0.15),
SIMDE_FLOAT32_C( 0.63), SIMDE_FLOAT32_C( -0.61),
SIMDE_FLOAT32_C( -0.85), SIMDE_FLOAT32_C( 0.46),
SIMDE_FLOAT32_C( -0.88), SIMDE_FLOAT32_C( 0.48)) },
{ simde_mm256_set_ps(SIMDE_FLOAT32_C( -770.35), SIMDE_FLOAT32_C( 443.48),
SIMDE_FLOAT32_C( -583.60), SIMDE_FLOAT32_C( 380.46),
SIMDE_FLOAT32_C( -770.72), SIMDE_FLOAT32_C( 993.90),
SIMDE_FLOAT32_C( 28.08), SIMDE_FLOAT32_C( 841.21)),
simde_mm256_set_ps(SIMDE_FLOAT32_C( -0.79), SIMDE_FLOAT32_C( -0.87),
SIMDE_FLOAT32_C( 0.74), SIMDE_FLOAT32_C( -0.95),
SIMDE_FLOAT32_C( -0.52), SIMDE_FLOAT32_C( 0.40),
SIMDE_FLOAT32_C( -0.98), SIMDE_FLOAT32_C( 0.74)) },
{ simde_mm256_set_ps(SIMDE_FLOAT32_C( -387.90), SIMDE_FLOAT32_C( 395.92),
SIMDE_FLOAT32_C( 655.87), SIMDE_FLOAT32_C( 339.21),
SIMDE_FLOAT32_C( 532.35), SIMDE_FLOAT32_C( -263.99),
SIMDE_FLOAT32_C( 780.64), SIMDE_FLOAT32_C( -30.79)),
simde_mm256_set_ps(SIMDE_FLOAT32_C( -0.09), SIMDE_FLOAT32_C( 1.00),
SIMDE_FLOAT32_C( -0.75), SIMDE_FLOAT32_C( 1.00),
SIMDE_FLOAT32_C( -0.15), SIMDE_FLOAT32_C( 1.00),
SIMDE_FLOAT32_C( 0.05), SIMDE_FLOAT32_C( 0.81)) },
{ simde_mm256_set_ps(SIMDE_FLOAT32_C( -203.65), SIMDE_FLOAT32_C( -80.73),
SIMDE_FLOAT32_C( 336.73), SIMDE_FLOAT32_C( -944.78),
SIMDE_FLOAT32_C( -747.59), SIMDE_FLOAT32_C( -767.23),
SIMDE_FLOAT32_C( -554.19), SIMDE_FLOAT32_C( 398.82)),
simde_mm256_set_ps(SIMDE_FLOAT32_C( -0.85), SIMDE_FLOAT32_C( 0.58),
SIMDE_FLOAT32_C( -0.84), SIMDE_FLOAT32_C( -0.67),
SIMDE_FLOAT32_C( 0.99), SIMDE_FLOAT32_C( 0.78),
SIMDE_FLOAT32_C( 0.30), SIMDE_FLOAT32_C( -0.99)) },
{ simde_mm256_set_ps(SIMDE_FLOAT32_C( 469.66), SIMDE_FLOAT32_C( 680.02),
SIMDE_FLOAT32_C( -148.69), SIMDE_FLOAT32_C( 818.66),
SIMDE_FLOAT32_C( 910.03), SIMDE_FLOAT32_C( 600.47),
SIMDE_FLOAT32_C( 791.23), SIMDE_FLOAT32_C( 254.31)),
simde_mm256_set_ps(SIMDE_FLOAT32_C( -0.01), SIMDE_FLOAT32_C( 0.13),
SIMDE_FLOAT32_C( -0.51), SIMDE_FLOAT32_C( -0.27),
SIMDE_FLOAT32_C( 0.51), SIMDE_FLOAT32_C( -0.91),
SIMDE_FLOAT32_C( 0.90), SIMDE_FLOAT32_C( -0.99)) }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])); i++) {
simde__m256 r = simde_mm256_cos_ps(test_vec[i].a);
simde_assert_m256_close(r, test_vec[i].r, 1);
}
return 0;
}
static int
test_simde_mm256_cos_pd(SIMDE_MUNIT_TEST_ARGS) {
const struct {
simde__m256d a;
simde__m256d r;
} test_vec[8] = {
{ simde_mm256_set_pd(SIMDE_FLOAT64_C( -186.21), SIMDE_FLOAT64_C( 39.01),
SIMDE_FLOAT64_C( -754.38), SIMDE_FLOAT64_C( 346.63)),
simde_mm256_set_pd(SIMDE_FLOAT64_C( -0.66), SIMDE_FLOAT64_C( 0.26),
SIMDE_FLOAT64_C( 0.92), SIMDE_FLOAT64_C( 0.49)) },
{ simde_mm256_set_pd(SIMDE_FLOAT64_C( 497.31), SIMDE_FLOAT64_C( 670.24),
SIMDE_FLOAT64_C( -297.45), SIMDE_FLOAT64_C( 34.06)),
simde_mm256_set_pd(SIMDE_FLOAT64_C( 0.59), SIMDE_FLOAT64_C( -0.47),
SIMDE_FLOAT64_C( -0.54), SIMDE_FLOAT64_C( -0.88)) },
{ simde_mm256_set_pd(SIMDE_FLOAT64_C( 825.53), SIMDE_FLOAT64_C( 422.21),
SIMDE_FLOAT64_C( -269.45), SIMDE_FLOAT64_C( 467.76)),
simde_mm256_set_pd(SIMDE_FLOAT64_C( -0.76), SIMDE_FLOAT64_C( 0.33),
SIMDE_FLOAT64_C( 0.75), SIMDE_FLOAT64_C( -0.94)) },
{ simde_mm256_set_pd(SIMDE_FLOAT64_C( -678.17), SIMDE_FLOAT64_C( -686.13),
SIMDE_FLOAT64_C( 84.77), SIMDE_FLOAT64_C( 571.46)),
simde_mm256_set_pd(SIMDE_FLOAT64_C( 0.92), SIMDE_FLOAT64_C( 0.30),
SIMDE_FLOAT64_C( -1.00), SIMDE_FLOAT64_C( 0.95)) },
{ simde_mm256_set_pd(SIMDE_FLOAT64_C( -305.07), SIMDE_FLOAT64_C( -860.95),
SIMDE_FLOAT64_C( -417.54), SIMDE_FLOAT64_C( 696.87)),
simde_mm256_set_pd(SIMDE_FLOAT64_C( -0.94), SIMDE_FLOAT64_C( 0.99),
SIMDE_FLOAT64_C( -0.96), SIMDE_FLOAT64_C( 0.85)) },
{ simde_mm256_set_pd(SIMDE_FLOAT64_C( -444.81), SIMDE_FLOAT64_C( 28.47),
SIMDE_FLOAT64_C( -384.03), SIMDE_FLOAT64_C( -923.64)),
simde_mm256_set_pd(SIMDE_FLOAT64_C( 0.27), SIMDE_FLOAT64_C( -0.98),
SIMDE_FLOAT64_C( 0.73), SIMDE_FLOAT64_C( 1.00)) },
{ simde_mm256_set_pd(SIMDE_FLOAT64_C( 261.31), SIMDE_FLOAT64_C( -212.54),
SIMDE_FLOAT64_C( -976.55), SIMDE_FLOAT64_C( -660.80)),
simde_mm256_set_pd(SIMDE_FLOAT64_C( -0.85), SIMDE_FLOAT64_C( 0.46),
SIMDE_FLOAT64_C( -0.88), SIMDE_FLOAT64_C( 0.48)) },
{ simde_mm256_set_pd(SIMDE_FLOAT64_C( 178.20), SIMDE_FLOAT64_C( -450.67),
SIMDE_FLOAT64_C( 233.37), SIMDE_FLOAT64_C( 687.09)),
simde_mm256_set_pd(SIMDE_FLOAT64_C( -0.64), SIMDE_FLOAT64_C( -0.15),
SIMDE_FLOAT64_C( 0.63), SIMDE_FLOAT64_C( -0.61)) }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])); i++) {
simde__m256d r = simde_mm256_cos_pd(test_vec[i].a);
simde_assert_m256d_close(r, test_vec[i].r, 1);
}
return 0;
}
static int
test_simde_mm_cbrt_ps (SIMDE_MUNIT_TEST_ARGS) {
static const struct {
const simde_float32 a[4];
const simde_float32 r[4];
} test_vec[] = {
{ { SIMDE_FLOAT32_C( -233.95), SIMDE_FLOAT32_C( 484.12), SIMDE_FLOAT32_C( -627.08), SIMDE_FLOAT32_C( -978.93) },
{ SIMDE_FLOAT32_C( -6.16), SIMDE_FLOAT32_C( 7.85), SIMDE_FLOAT32_C( -8.56), SIMDE_FLOAT32_C( -9.93) } },
{ { SIMDE_FLOAT32_C( -749.83), SIMDE_FLOAT32_C( 484.28), SIMDE_FLOAT32_C( 749.02), SIMDE_FLOAT32_C( 850.44) },
{ SIMDE_FLOAT32_C( -9.08), SIMDE_FLOAT32_C( 7.85), SIMDE_FLOAT32_C( 9.08), SIMDE_FLOAT32_C( 9.47) } },
{ { SIMDE_FLOAT32_C( -517.39), SIMDE_FLOAT32_C( -725.46), SIMDE_FLOAT32_C( -558.90), SIMDE_FLOAT32_C( -267.33) },
{ SIMDE_FLOAT32_C( -8.03), SIMDE_FLOAT32_C( -8.99), SIMDE_FLOAT32_C( -8.24), SIMDE_FLOAT32_C( -6.44) } },
{ { SIMDE_FLOAT32_C( 569.35), SIMDE_FLOAT32_C( 995.62), SIMDE_FLOAT32_C( 709.27), SIMDE_FLOAT32_C( -107.57) },
{ SIMDE_FLOAT32_C( 8.29), SIMDE_FLOAT32_C( 9.99), SIMDE_FLOAT32_C( 8.92), SIMDE_FLOAT32_C( -4.76) } },
{ { SIMDE_FLOAT32_C( 350.06), SIMDE_FLOAT32_C( 89.99), SIMDE_FLOAT32_C( 267.98), SIMDE_FLOAT32_C( -152.18) },
{ SIMDE_FLOAT32_C( 7.05), SIMDE_FLOAT32_C( 4.48), SIMDE_FLOAT32_C( 6.45), SIMDE_FLOAT32_C( -5.34) } },
{ { SIMDE_FLOAT32_C( 0.67), SIMDE_FLOAT32_C( 317.87), SIMDE_FLOAT32_C( -435.79), SIMDE_FLOAT32_C( -295.24) },
{ SIMDE_FLOAT32_C( 0.88), SIMDE_FLOAT32_C( 6.82), SIMDE_FLOAT32_C( -7.58), SIMDE_FLOAT32_C( -6.66) } },
{ { SIMDE_FLOAT32_C( 382.46), SIMDE_FLOAT32_C( 327.49), SIMDE_FLOAT32_C( -186.96), SIMDE_FLOAT32_C( 913.54) },
{ SIMDE_FLOAT32_C( 7.26), SIMDE_FLOAT32_C( 6.89), SIMDE_FLOAT32_C( -5.72), SIMDE_FLOAT32_C( 9.70) } },
{ { SIMDE_FLOAT32_C( 619.00), SIMDE_FLOAT32_C( 936.03), SIMDE_FLOAT32_C( 27.91), SIMDE_FLOAT32_C( -614.95) },
{ SIMDE_FLOAT32_C( 8.52), SIMDE_FLOAT32_C( 9.78), SIMDE_FLOAT32_C( 3.03), SIMDE_FLOAT32_C( -8.50) } }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])) ; i++) {
simde__m128 a = simde_mm_loadu_ps(test_vec[i].a);
simde__m128 r = simde_mm_cbrt_ps(a);
simde_test_x86_assert_equal_f32x4(r, simde_mm_loadu_ps(test_vec[i].r), 1);
}
return 0;
}
static int
test_simde_mm_cbrt_pd (SIMDE_MUNIT_TEST_ARGS) {
static const struct {
const simde_float64 a[2];
const simde_float64 r[2];
} test_vec[] = {
{ { SIMDE_FLOAT64_C( -517.18), SIMDE_FLOAT64_C( 744.08) },
{ SIMDE_FLOAT64_C( -8.03), SIMDE_FLOAT64_C( 9.06) } },
{ { SIMDE_FLOAT64_C( 664.94), SIMDE_FLOAT64_C( 255.05) },
{ SIMDE_FLOAT64_C( 8.73), SIMDE_FLOAT64_C( 6.34) } },
{ { SIMDE_FLOAT64_C( 38.42), SIMDE_FLOAT64_C( 432.02) },
{ SIMDE_FLOAT64_C( 3.37), SIMDE_FLOAT64_C( 7.56) } },
{ { SIMDE_FLOAT64_C( -843.35), SIMDE_FLOAT64_C( -957.81) },
{ SIMDE_FLOAT64_C( -9.45), SIMDE_FLOAT64_C( -9.86) } },
{ { SIMDE_FLOAT64_C( -560.27), SIMDE_FLOAT64_C( 292.64) },
{ SIMDE_FLOAT64_C( -8.24), SIMDE_FLOAT64_C( 6.64) } },
{ { SIMDE_FLOAT64_C( 329.56), SIMDE_FLOAT64_C( 633.90) },
{ SIMDE_FLOAT64_C( 6.91), SIMDE_FLOAT64_C( 8.59) } },
{ { SIMDE_FLOAT64_C( -774.56), SIMDE_FLOAT64_C( 892.85) },
{ SIMDE_FLOAT64_C( -9.18), SIMDE_FLOAT64_C( 9.63) } },
{ { SIMDE_FLOAT64_C( 705.03), SIMDE_FLOAT64_C( -332.78) },
{ SIMDE_FLOAT64_C( 8.90), SIMDE_FLOAT64_C( -6.93) } }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])) ; i++) {
simde__m128d a = simde_mm_loadu_pd(test_vec[i].a);
simde__m128d r = simde_mm_cbrt_pd(a);
simde_test_x86_assert_equal_f64x2(r, simde_mm_loadu_pd(test_vec[i].r), 1);
}
return 0;
}
static int
test_simde_mm256_cbrt_ps (SIMDE_MUNIT_TEST_ARGS) {
static const struct {
const simde_float32 a[8];
const simde_float32 r[8];
} test_vec[] = {
{ { SIMDE_FLOAT32_C( 14.66), SIMDE_FLOAT32_C( -346.78), SIMDE_FLOAT32_C( 608.16), SIMDE_FLOAT32_C( -175.40),
SIMDE_FLOAT32_C( -696.64), SIMDE_FLOAT32_C( -645.46), SIMDE_FLOAT32_C( -765.98), SIMDE_FLOAT32_C( 391.25) },
{ SIMDE_FLOAT32_C( 2.45), SIMDE_FLOAT32_C( -7.03), SIMDE_FLOAT32_C( 8.47), SIMDE_FLOAT32_C( -5.60),
SIMDE_FLOAT32_C( -8.86), SIMDE_FLOAT32_C( -8.64), SIMDE_FLOAT32_C( -9.15), SIMDE_FLOAT32_C( 7.31) } },
{ { SIMDE_FLOAT32_C( -27.85), SIMDE_FLOAT32_C( 887.61), SIMDE_FLOAT32_C( -720.32), SIMDE_FLOAT32_C( -702.24),
SIMDE_FLOAT32_C( -320.58), SIMDE_FLOAT32_C( -360.38), SIMDE_FLOAT32_C( -53.29), SIMDE_FLOAT32_C( 251.62) },
{ SIMDE_FLOAT32_C( -3.03), SIMDE_FLOAT32_C( 9.61), SIMDE_FLOAT32_C( -8.96), SIMDE_FLOAT32_C( -8.89),
SIMDE_FLOAT32_C( -6.84), SIMDE_FLOAT32_C( -7.12), SIMDE_FLOAT32_C( -3.76), SIMDE_FLOAT32_C( 6.31) } },
{ { SIMDE_FLOAT32_C( 677.19), SIMDE_FLOAT32_C( 865.20), SIMDE_FLOAT32_C( -346.98), SIMDE_FLOAT32_C( -605.62),
SIMDE_FLOAT32_C( -498.20), SIMDE_FLOAT32_C( 696.85), SIMDE_FLOAT32_C( -203.22), SIMDE_FLOAT32_C( -909.19) },
{ SIMDE_FLOAT32_C( 8.78), SIMDE_FLOAT32_C( 9.53), SIMDE_FLOAT32_C( -7.03), SIMDE_FLOAT32_C( -8.46),
SIMDE_FLOAT32_C( -7.93), SIMDE_FLOAT32_C( 8.87), SIMDE_FLOAT32_C( -5.88), SIMDE_FLOAT32_C( -9.69) } },
{ { SIMDE_FLOAT32_C( 46.70), SIMDE_FLOAT32_C( -557.66), SIMDE_FLOAT32_C( -327.34), SIMDE_FLOAT32_C( -489.40),
SIMDE_FLOAT32_C( -78.90), SIMDE_FLOAT32_C( -843.63), SIMDE_FLOAT32_C( -527.77), SIMDE_FLOAT32_C( 935.75) },
{ SIMDE_FLOAT32_C( 3.60), SIMDE_FLOAT32_C( -8.23), SIMDE_FLOAT32_C( -6.89), SIMDE_FLOAT32_C( -7.88),
SIMDE_FLOAT32_C( -4.29), SIMDE_FLOAT32_C( -9.45), SIMDE_FLOAT32_C( -8.08), SIMDE_FLOAT32_C( 9.78) } },
{ { SIMDE_FLOAT32_C( -190.41), SIMDE_FLOAT32_C( -919.61), SIMDE_FLOAT32_C( -239.64), SIMDE_FLOAT32_C( 112.95),
SIMDE_FLOAT32_C( -565.07), SIMDE_FLOAT32_C( -5.63), SIMDE_FLOAT32_C( -495.80), SIMDE_FLOAT32_C( 407.08) },
{ SIMDE_FLOAT32_C( -5.75), SIMDE_FLOAT32_C( -9.72), SIMDE_FLOAT32_C( -6.21), SIMDE_FLOAT32_C( 4.83),
SIMDE_FLOAT32_C( -8.27), SIMDE_FLOAT32_C( -1.78), SIMDE_FLOAT32_C( -7.91), SIMDE_FLOAT32_C( 7.41) } },
{ { SIMDE_FLOAT32_C( -118.02), SIMDE_FLOAT32_C( -216.12), SIMDE_FLOAT32_C( 704.84), SIMDE_FLOAT32_C( 561.40),
SIMDE_FLOAT32_C( 423.50), SIMDE_FLOAT32_C( -348.46), SIMDE_FLOAT32_C( -186.97), SIMDE_FLOAT32_C( 100.69) },
{ SIMDE_FLOAT32_C( -4.91), SIMDE_FLOAT32_C( -6.00), SIMDE_FLOAT32_C( 8.90), SIMDE_FLOAT32_C( 8.25),
SIMDE_FLOAT32_C( 7.51), SIMDE_FLOAT32_C( -7.04), SIMDE_FLOAT32_C( -5.72), SIMDE_FLOAT32_C( 4.65) } },
{ { SIMDE_FLOAT32_C( -483.26), SIMDE_FLOAT32_C( 466.05), SIMDE_FLOAT32_C( 495.07), SIMDE_FLOAT32_C( 18.54),
SIMDE_FLOAT32_C( 162.90), SIMDE_FLOAT32_C( -708.15), SIMDE_FLOAT32_C( 109.34), SIMDE_FLOAT32_C( -790.40) },
{ SIMDE_FLOAT32_C( -7.85), SIMDE_FLOAT32_C( 7.75), SIMDE_FLOAT32_C( 7.91), SIMDE_FLOAT32_C( 2.65),
SIMDE_FLOAT32_C( 5.46), SIMDE_FLOAT32_C( -8.91), SIMDE_FLOAT32_C( 4.78), SIMDE_FLOAT32_C( -9.25) } },
{ { SIMDE_FLOAT32_C( -265.81), SIMDE_FLOAT32_C( 782.01), SIMDE_FLOAT32_C( -279.80), SIMDE_FLOAT32_C( 655.29),
SIMDE_FLOAT32_C( 938.38), SIMDE_FLOAT32_C( 192.43), SIMDE_FLOAT32_C( 591.04), SIMDE_FLOAT32_C( -252.03) },
{ SIMDE_FLOAT32_C( -6.43), SIMDE_FLOAT32_C( 9.21), SIMDE_FLOAT32_C( -6.54), SIMDE_FLOAT32_C( 8.69),
SIMDE_FLOAT32_C( 9.79), SIMDE_FLOAT32_C( 5.77), SIMDE_FLOAT32_C( 8.39), SIMDE_FLOAT32_C( -6.32) } }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])) ; i++) {
simde__m256 a = simde_mm256_loadu_ps(test_vec[i].a);
simde__m256 r = simde_mm256_cbrt_ps(a);
simde_test_x86_assert_equal_f32x8(r, simde_mm256_loadu_ps(test_vec[i].r), 1);
}
return 0;
}
static int
test_simde_mm256_cbrt_pd (SIMDE_MUNIT_TEST_ARGS) {
static const struct {
const simde_float64 a[4];
const simde_float64 r[4];
} test_vec[] = {
{ { SIMDE_FLOAT64_C( 286.65), SIMDE_FLOAT64_C( -385.66), SIMDE_FLOAT64_C( 84.84), SIMDE_FLOAT64_C( 116.45) },
{ SIMDE_FLOAT64_C( 6.59), SIMDE_FLOAT64_C( -7.28), SIMDE_FLOAT64_C( 4.39), SIMDE_FLOAT64_C( 4.88) } },
{ { SIMDE_FLOAT64_C( 443.79), SIMDE_FLOAT64_C( 321.91), SIMDE_FLOAT64_C( -219.08), SIMDE_FLOAT64_C( -924.57) },
{ SIMDE_FLOAT64_C( 7.63), SIMDE_FLOAT64_C( 6.85), SIMDE_FLOAT64_C( -6.03), SIMDE_FLOAT64_C( -9.74) } },
{ { SIMDE_FLOAT64_C( 745.74), SIMDE_FLOAT64_C( 694.64), SIMDE_FLOAT64_C( 266.38), SIMDE_FLOAT64_C( 138.63) },
{ SIMDE_FLOAT64_C( 9.07), SIMDE_FLOAT64_C( 8.86), SIMDE_FLOAT64_C( 6.43), SIMDE_FLOAT64_C( 5.18) } },
{ { SIMDE_FLOAT64_C( 417.51), SIMDE_FLOAT64_C( 27.01), SIMDE_FLOAT64_C( -921.58), SIMDE_FLOAT64_C( 56.73) },
{ SIMDE_FLOAT64_C( 7.47), SIMDE_FLOAT64_C( 3.00), SIMDE_FLOAT64_C( -9.73), SIMDE_FLOAT64_C( 3.84) } },
{ { SIMDE_FLOAT64_C( 568.89), SIMDE_FLOAT64_C( 355.21), SIMDE_FLOAT64_C( -243.68), SIMDE_FLOAT64_C( 232.84) },
{ SIMDE_FLOAT64_C( 8.29), SIMDE_FLOAT64_C( 7.08), SIMDE_FLOAT64_C( -6.25), SIMDE_FLOAT64_C( 6.15) } },
{ { SIMDE_FLOAT64_C( -964.92), SIMDE_FLOAT64_C( -649.34), SIMDE_FLOAT64_C( -100.47), SIMDE_FLOAT64_C( -303.39) },
{ SIMDE_FLOAT64_C( -9.88), SIMDE_FLOAT64_C( -8.66), SIMDE_FLOAT64_C( -4.65), SIMDE_FLOAT64_C( -6.72) } },
{ { SIMDE_FLOAT64_C( -56.31), SIMDE_FLOAT64_C( -696.56), SIMDE_FLOAT64_C( -500.81), SIMDE_FLOAT64_C( 866.34) },
{ SIMDE_FLOAT64_C( -3.83), SIMDE_FLOAT64_C( -8.86), SIMDE_FLOAT64_C( -7.94), SIMDE_FLOAT64_C( 9.53) } },
{ { SIMDE_FLOAT64_C( 560.33), SIMDE_FLOAT64_C( 808.06), SIMDE_FLOAT64_C( 566.38), SIMDE_FLOAT64_C( -153.02) },
{ SIMDE_FLOAT64_C( 8.24), SIMDE_FLOAT64_C( 9.31), SIMDE_FLOAT64_C( 8.27), SIMDE_FLOAT64_C( -5.35) } }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])) ; i++) {
simde__m256d a = simde_mm256_loadu_pd(test_vec[i].a);
simde__m256d r = simde_mm256_cbrt_pd(a);
simde_test_x86_assert_equal_f64x4(r, simde_mm256_loadu_pd(test_vec[i].r), 1);
}
return 0;
}
static int
test_simde_mm512_cbrt_ps (SIMDE_MUNIT_TEST_ARGS) {
static const struct {
const simde_float32 a[16];
const simde_float32 r[16];
} test_vec[] = {
{ { SIMDE_FLOAT32_C( -13.67), SIMDE_FLOAT32_C( -56.84), SIMDE_FLOAT32_C( -51.43), SIMDE_FLOAT32_C( 570.17),
SIMDE_FLOAT32_C( 282.97), SIMDE_FLOAT32_C( -935.16), SIMDE_FLOAT32_C( 302.89), SIMDE_FLOAT32_C( -720.37),
SIMDE_FLOAT32_C( -327.22), SIMDE_FLOAT32_C( -98.04), SIMDE_FLOAT32_C( -1.14), SIMDE_FLOAT32_C( -24.91),
SIMDE_FLOAT32_C( 315.22), SIMDE_FLOAT32_C( -790.04), SIMDE_FLOAT32_C( -92.39), SIMDE_FLOAT32_C( -624.42) },
{ SIMDE_FLOAT32_C( -2.39), SIMDE_FLOAT32_C( -3.84), SIMDE_FLOAT32_C( -3.72), SIMDE_FLOAT32_C( 8.29),
SIMDE_FLOAT32_C( 6.57), SIMDE_FLOAT32_C( -9.78), SIMDE_FLOAT32_C( 6.72), SIMDE_FLOAT32_C( -8.96),
SIMDE_FLOAT32_C( -6.89), SIMDE_FLOAT32_C( -4.61), SIMDE_FLOAT32_C( -1.05), SIMDE_FLOAT32_C( -2.92),
SIMDE_FLOAT32_C( 6.81), SIMDE_FLOAT32_C( -9.24), SIMDE_FLOAT32_C( -4.52), SIMDE_FLOAT32_C( -8.55) } },
{ { SIMDE_FLOAT32_C( 534.24), SIMDE_FLOAT32_C( 480.60), SIMDE_FLOAT32_C( -464.10), SIMDE_FLOAT32_C( 924.79),
SIMDE_FLOAT32_C( 691.98), SIMDE_FLOAT32_C( 368.05), SIMDE_FLOAT32_C( 181.75), SIMDE_FLOAT32_C( 967.37),
SIMDE_FLOAT32_C( -837.71), SIMDE_FLOAT32_C( -61.77), SIMDE_FLOAT32_C( -702.36), SIMDE_FLOAT32_C( 76.18),
SIMDE_FLOAT32_C( 549.27), SIMDE_FLOAT32_C( 36.35), SIMDE_FLOAT32_C( -116.93), SIMDE_FLOAT32_C( -464.40) },
{ SIMDE_FLOAT32_C( 8.11), SIMDE_FLOAT32_C( 7.83), SIMDE_FLOAT32_C( -7.74), SIMDE_FLOAT32_C( 9.74),
SIMDE_FLOAT32_C( 8.84), SIMDE_FLOAT32_C( 7.17), SIMDE_FLOAT32_C( 5.66), SIMDE_FLOAT32_C( 9.89),
SIMDE_FLOAT32_C( -9.43), SIMDE_FLOAT32_C( -3.95), SIMDE_FLOAT32_C( -8.89), SIMDE_FLOAT32_C( 4.24),
SIMDE_FLOAT32_C( 8.19), SIMDE_FLOAT32_C( 3.31), SIMDE_FLOAT32_C( -4.89), SIMDE_FLOAT32_C( -7.74) } },
{ { SIMDE_FLOAT32_C( 979.51), SIMDE_FLOAT32_C( 831.64), SIMDE_FLOAT32_C( -894.23), SIMDE_FLOAT32_C( 262.49),
SIMDE_FLOAT32_C( 896.48), SIMDE_FLOAT32_C( 408.65), SIMDE_FLOAT32_C( 542.11), SIMDE_FLOAT32_C( -430.74),
SIMDE_FLOAT32_C( -689.38), SIMDE_FLOAT32_C( -459.03), SIMDE_FLOAT32_C( 544.35), SIMDE_FLOAT32_C( 625.84),
SIMDE_FLOAT32_C( -249.07), SIMDE_FLOAT32_C( -548.04), SIMDE_FLOAT32_C( -998.58), SIMDE_FLOAT32_C( -714.83) },
{ SIMDE_FLOAT32_C( 9.93), SIMDE_FLOAT32_C( 9.40), SIMDE_FLOAT32_C( -9.63), SIMDE_FLOAT32_C( 6.40),
SIMDE_FLOAT32_C( 9.64), SIMDE_FLOAT32_C( 7.42), SIMDE_FLOAT32_C( 8.15), SIMDE_FLOAT32_C( -7.55),
SIMDE_FLOAT32_C( -8.83), SIMDE_FLOAT32_C( -7.71), SIMDE_FLOAT32_C( 8.17), SIMDE_FLOAT32_C( 8.55),
SIMDE_FLOAT32_C( -6.29), SIMDE_FLOAT32_C( -8.18), SIMDE_FLOAT32_C( -10.00), SIMDE_FLOAT32_C( -8.94) } },
{ { SIMDE_FLOAT32_C( 932.56), SIMDE_FLOAT32_C( -462.68), SIMDE_FLOAT32_C( -790.04), SIMDE_FLOAT32_C( 624.53),
SIMDE_FLOAT32_C( 905.37), SIMDE_FLOAT32_C( 391.72), SIMDE_FLOAT32_C( 591.90), SIMDE_FLOAT32_C( -932.34),
SIMDE_FLOAT32_C( -670.05), SIMDE_FLOAT32_C( 889.54), SIMDE_FLOAT32_C( 143.84), SIMDE_FLOAT32_C( 879.22),
SIMDE_FLOAT32_C( -74.11), SIMDE_FLOAT32_C( -973.09), SIMDE_FLOAT32_C( -585.18), SIMDE_FLOAT32_C( -94.60) },
{ SIMDE_FLOAT32_C( 9.77), SIMDE_FLOAT32_C( -7.73), SIMDE_FLOAT32_C( -9.24), SIMDE_FLOAT32_C( 8.55),
SIMDE_FLOAT32_C( 9.67), SIMDE_FLOAT32_C( 7.32), SIMDE_FLOAT32_C( 8.40), SIMDE_FLOAT32_C( -9.77),
SIMDE_FLOAT32_C( -8.75), SIMDE_FLOAT32_C( 9.62), SIMDE_FLOAT32_C( 5.24), SIMDE_FLOAT32_C( 9.58),
SIMDE_FLOAT32_C( -4.20), SIMDE_FLOAT32_C( -9.91), SIMDE_FLOAT32_C( -8.36), SIMDE_FLOAT32_C( -4.56) } },
{ { SIMDE_FLOAT32_C( 858.55), SIMDE_FLOAT32_C( -479.41), SIMDE_FLOAT32_C( -832.11), SIMDE_FLOAT32_C( 755.02),
SIMDE_FLOAT32_C( 929.24), SIMDE_FLOAT32_C( 710.00), SIMDE_FLOAT32_C( -675.72), SIMDE_FLOAT32_C( -760.15),
SIMDE_FLOAT32_C( -749.03), SIMDE_FLOAT32_C( 868.63), SIMDE_FLOAT32_C( 865.69), SIMDE_FLOAT32_C( 1.90),
SIMDE_FLOAT32_C( -679.42), SIMDE_FLOAT32_C( 867.11), SIMDE_FLOAT32_C( 287.07), SIMDE_FLOAT32_C( -746.86) },
{ SIMDE_FLOAT32_C( 9.50), SIMDE_FLOAT32_C( -7.83), SIMDE_FLOAT32_C( -9.41), SIMDE_FLOAT32_C( 9.11),
SIMDE_FLOAT32_C( 9.76), SIMDE_FLOAT32_C( 8.92), SIMDE_FLOAT32_C( -8.78), SIMDE_FLOAT32_C( -9.13),
SIMDE_FLOAT32_C( -9.08), SIMDE_FLOAT32_C( 9.54), SIMDE_FLOAT32_C( 9.53), SIMDE_FLOAT32_C( 1.24),
SIMDE_FLOAT32_C( -8.79), SIMDE_FLOAT32_C( 9.54), SIMDE_FLOAT32_C( 6.60), SIMDE_FLOAT32_C( -9.07) } },
{ { SIMDE_FLOAT32_C( -595.56), SIMDE_FLOAT32_C( 497.03), SIMDE_FLOAT32_C( 877.67), SIMDE_FLOAT32_C( -690.19),
SIMDE_FLOAT32_C( -111.25), SIMDE_FLOAT32_C( 469.57), SIMDE_FLOAT32_C( -622.53), SIMDE_FLOAT32_C( 218.70),
SIMDE_FLOAT32_C( 359.11), SIMDE_FLOAT32_C( 521.31), SIMDE_FLOAT32_C( 97.92), SIMDE_FLOAT32_C( -714.99),
SIMDE_FLOAT32_C( 548.22), SIMDE_FLOAT32_C( 512.74), SIMDE_FLOAT32_C( 190.41), SIMDE_FLOAT32_C( 406.77) },
{ SIMDE_FLOAT32_C( -8.41), SIMDE_FLOAT32_C( 7.92), SIMDE_FLOAT32_C( 9.57), SIMDE_FLOAT32_C( -8.84),
SIMDE_FLOAT32_C( -4.81), SIMDE_FLOAT32_C( 7.77), SIMDE_FLOAT32_C( -8.54), SIMDE_FLOAT32_C( 6.02),
SIMDE_FLOAT32_C( 7.11), SIMDE_FLOAT32_C( 8.05), SIMDE_FLOAT32_C( 4.61), SIMDE_FLOAT32_C( -8.94),
SIMDE_FLOAT32_C( 8.18), SIMDE_FLOAT32_C( 8.00), SIMDE_FLOAT32_C( 5.75), SIMDE_FLOAT32_C( 7.41) } },
{ { SIMDE_FLOAT32_C( -966.68), SIMDE_FLOAT32_C( 358.30), SIMDE_FLOAT32_C( 161.79), SIMDE_FLOAT32_C( 962.56),
SIMDE_FLOAT32_C( 68.29), SIMDE_FLOAT32_C( 486.07), SIMDE_FLOAT32_C( -797.58), SIMDE_FLOAT32_C( 319.26),
SIMDE_FLOAT32_C( 354.70), SIMDE_FLOAT32_C( -931.89), SIMDE_FLOAT32_C( -678.84), SIMDE_FLOAT32_C( 675.28),
SIMDE_FLOAT32_C( 935.22), SIMDE_FLOAT32_C( 608.23), SIMDE_FLOAT32_C( 928.43), SIMDE_FLOAT32_C( -660.34) },
{ SIMDE_FLOAT32_C( -9.89), SIMDE_FLOAT32_C( 7.10), SIMDE_FLOAT32_C( 5.45), SIMDE_FLOAT32_C( 9.87),
SIMDE_FLOAT32_C( 4.09), SIMDE_FLOAT32_C( 7.86), SIMDE_FLOAT32_C( -9.27), SIMDE_FLOAT32_C( 6.83),
SIMDE_FLOAT32_C( 7.08), SIMDE_FLOAT32_C( -9.77), SIMDE_FLOAT32_C( -8.79), SIMDE_FLOAT32_C( 8.77),
SIMDE_FLOAT32_C( 9.78), SIMDE_FLOAT32_C( 8.47), SIMDE_FLOAT32_C( 9.76), SIMDE_FLOAT32_C( -8.71) } },
{ { SIMDE_FLOAT32_C( 105.27), SIMDE_FLOAT32_C( 806.10), SIMDE_FLOAT32_C( -350.53), SIMDE_FLOAT32_C( 994.02),
SIMDE_FLOAT32_C( 275.67), SIMDE_FLOAT32_C( 26.95), SIMDE_FLOAT32_C( 212.72), SIMDE_FLOAT32_C( -365.21),
SIMDE_FLOAT32_C( -451.74), SIMDE_FLOAT32_C( -689.36), SIMDE_FLOAT32_C( -80.21), SIMDE_FLOAT32_C( -903.52),
SIMDE_FLOAT32_C( 823.38), SIMDE_FLOAT32_C( -889.80), SIMDE_FLOAT32_C( 503.25), SIMDE_FLOAT32_C( 856.70) },
{ SIMDE_FLOAT32_C( 4.72), SIMDE_FLOAT32_C( 9.31), SIMDE_FLOAT32_C( -7.05), SIMDE_FLOAT32_C( 9.98),
SIMDE_FLOAT32_C( 6.51), SIMDE_FLOAT32_C( 3.00), SIMDE_FLOAT32_C( 5.97), SIMDE_FLOAT32_C( -7.15),
SIMDE_FLOAT32_C( -7.67), SIMDE_FLOAT32_C( -8.83), SIMDE_FLOAT32_C( -4.31), SIMDE_FLOAT32_C( -9.67),
SIMDE_FLOAT32_C( 9.37), SIMDE_FLOAT32_C( -9.62), SIMDE_FLOAT32_C( 7.95), SIMDE_FLOAT32_C( 9.50) } }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])) ; i++) {
simde__m512 a = simde_mm512_loadu_ps(test_vec[i].a);
simde__m512 r = simde_mm512_cbrt_ps(a);
simde_test_x86_assert_equal_f32x16(r, simde_mm512_loadu_ps(test_vec[i].r), 1);
}
return 0;
}
static int
test_simde_mm512_cbrt_pd (SIMDE_MUNIT_TEST_ARGS) {
static const struct {
const simde_float64 a[8];
const simde_float64 r[8];
} test_vec[] = {
{ { SIMDE_FLOAT64_C( -399.60), SIMDE_FLOAT64_C( 73.77), SIMDE_FLOAT64_C( 137.22), SIMDE_FLOAT64_C( -676.98),
SIMDE_FLOAT64_C( -304.40), SIMDE_FLOAT64_C( -35.84), SIMDE_FLOAT64_C( -298.40), SIMDE_FLOAT64_C( -909.21) },
{ SIMDE_FLOAT64_C( -7.37), SIMDE_FLOAT64_C( 4.19), SIMDE_FLOAT64_C( 5.16), SIMDE_FLOAT64_C( -8.78),
SIMDE_FLOAT64_C( -6.73), SIMDE_FLOAT64_C( -3.30), SIMDE_FLOAT64_C( -6.68), SIMDE_FLOAT64_C( -9.69) } },
{ { SIMDE_FLOAT64_C( -369.66), SIMDE_FLOAT64_C( -37.98), SIMDE_FLOAT64_C( 225.69), SIMDE_FLOAT64_C( 708.35),
SIMDE_FLOAT64_C( 411.81), SIMDE_FLOAT64_C( -32.59), SIMDE_FLOAT64_C( 605.95), SIMDE_FLOAT64_C( -309.62) },
{ SIMDE_FLOAT64_C( -7.18), SIMDE_FLOAT64_C( -3.36), SIMDE_FLOAT64_C( 6.09), SIMDE_FLOAT64_C( 8.91),
SIMDE_FLOAT64_C( 7.44), SIMDE_FLOAT64_C( -3.19), SIMDE_FLOAT64_C( 8.46), SIMDE_FLOAT64_C( -6.77) } },
{ { SIMDE_FLOAT64_C( 644.51), SIMDE_FLOAT64_C( -178.16), SIMDE_FLOAT64_C( -305.15), SIMDE_FLOAT64_C( 654.50),
SIMDE_FLOAT64_C( -229.06), SIMDE_FLOAT64_C( -577.20), SIMDE_FLOAT64_C( 549.91), SIMDE_FLOAT64_C( -450.26) },
{ SIMDE_FLOAT64_C( 8.64), SIMDE_FLOAT64_C( -5.63), SIMDE_FLOAT64_C( -6.73), SIMDE_FLOAT64_C( 8.68),
SIMDE_FLOAT64_C( -6.12), SIMDE_FLOAT64_C( -8.33), SIMDE_FLOAT64_C( 8.19), SIMDE_FLOAT64_C( -7.66) } },
{ { SIMDE_FLOAT64_C( 336.68), SIMDE_FLOAT64_C( -367.59), SIMDE_FLOAT64_C( 113.01), SIMDE_FLOAT64_C( -952.73),
SIMDE_FLOAT64_C( 958.03), SIMDE_FLOAT64_C( 319.98), SIMDE_FLOAT64_C( -626.30), SIMDE_FLOAT64_C( -441.56) },
{ SIMDE_FLOAT64_C( 6.96), SIMDE_FLOAT64_C( -7.16), SIMDE_FLOAT64_C( 4.83), SIMDE_FLOAT64_C( -9.84),
SIMDE_FLOAT64_C( 9.86), SIMDE_FLOAT64_C( 6.84), SIMDE_FLOAT64_C( -8.56), SIMDE_FLOAT64_C( -7.61) } },
{ { SIMDE_FLOAT64_C( -606.25), SIMDE_FLOAT64_C( 510.93), SIMDE_FLOAT64_C( -118.54), SIMDE_FLOAT64_C( 89.36),
SIMDE_FLOAT64_C( -524.91), SIMDE_FLOAT64_C( 583.06), SIMDE_FLOAT64_C( 180.15), SIMDE_FLOAT64_C( 105.43) },
{ SIMDE_FLOAT64_C( -8.46), SIMDE_FLOAT64_C( 7.99), SIMDE_FLOAT64_C( -4.91), SIMDE_FLOAT64_C( 4.47),
SIMDE_FLOAT64_C( -8.07), SIMDE_FLOAT64_C( 8.35), SIMDE_FLOAT64_C( 5.65), SIMDE_FLOAT64_C( 4.72) } },
{ { SIMDE_FLOAT64_C( -454.92), SIMDE_FLOAT64_C( -594.16), SIMDE_FLOAT64_C( -186.22), SIMDE_FLOAT64_C( 956.89),
SIMDE_FLOAT64_C( 373.25), SIMDE_FLOAT64_C( -580.27), SIMDE_FLOAT64_C( -352.73), SIMDE_FLOAT64_C( 17.77) },
{ SIMDE_FLOAT64_C( -7.69), SIMDE_FLOAT64_C( -8.41), SIMDE_FLOAT64_C( -5.71), SIMDE_FLOAT64_C( 9.85),
SIMDE_FLOAT64_C( 7.20), SIMDE_FLOAT64_C( -8.34), SIMDE_FLOAT64_C( -7.07), SIMDE_FLOAT64_C( 2.61) } },
{ { SIMDE_FLOAT64_C( 241.57), SIMDE_FLOAT64_C( 342.12), SIMDE_FLOAT64_C( -327.73), SIMDE_FLOAT64_C( -987.48),
SIMDE_FLOAT64_C( 764.92), SIMDE_FLOAT64_C( -777.82), SIMDE_FLOAT64_C( -437.75), SIMDE_FLOAT64_C( 101.60) },
{ SIMDE_FLOAT64_C( 6.23), SIMDE_FLOAT64_C( 6.99), SIMDE_FLOAT64_C( -6.89), SIMDE_FLOAT64_C( -9.96),
SIMDE_FLOAT64_C( 9.15), SIMDE_FLOAT64_C( -9.20), SIMDE_FLOAT64_C( -7.59), SIMDE_FLOAT64_C( 4.67) } },
{ { SIMDE_FLOAT64_C( -145.41), SIMDE_FLOAT64_C( 675.27), SIMDE_FLOAT64_C( 148.87), SIMDE_FLOAT64_C( -187.38),
SIMDE_FLOAT64_C( -4.75), SIMDE_FLOAT64_C( 522.57), SIMDE_FLOAT64_C( 371.06), SIMDE_FLOAT64_C( 389.00) },
{ SIMDE_FLOAT64_C( -5.26), SIMDE_FLOAT64_C( 8.77), SIMDE_FLOAT64_C( 5.30), SIMDE_FLOAT64_C( -5.72),
SIMDE_FLOAT64_C( -1.68), SIMDE_FLOAT64_C( 8.05), SIMDE_FLOAT64_C( 7.19), SIMDE_FLOAT64_C( 7.30) } }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])) ; i++) {
simde__m512d a = simde_mm512_loadu_pd(test_vec[i].a);
simde__m512d r = simde_mm512_cbrt_pd(a);
simde_test_x86_assert_equal_f64x8(r, simde_mm512_loadu_pd(test_vec[i].r), 1);
}
return 0;
}
static int
test_simde_mm512_mask_cbrt_ps (SIMDE_MUNIT_TEST_ARGS) {
static const struct {
const simde_float32 src[16];
const simde__mmask8 k;
const simde_float32 a[16];
const simde_float32 r[16];
} test_vec[] = {
{ { SIMDE_FLOAT32_C( 98.98), SIMDE_FLOAT32_C( 913.15), SIMDE_FLOAT32_C( 690.53), SIMDE_FLOAT32_C( -536.23),
SIMDE_FLOAT32_C( -147.17), SIMDE_FLOAT32_C( 971.04), SIMDE_FLOAT32_C( -365.05), SIMDE_FLOAT32_C( 728.65),
SIMDE_FLOAT32_C( 340.02), SIMDE_FLOAT32_C( -288.54), SIMDE_FLOAT32_C( 298.61), SIMDE_FLOAT32_C( -421.40),
SIMDE_FLOAT32_C( 62.04), SIMDE_FLOAT32_C( 962.27), SIMDE_FLOAT32_C( -847.19), SIMDE_FLOAT32_C( -983.83) },
UINT8_C( 93),
{ SIMDE_FLOAT32_C( -474.95), SIMDE_FLOAT32_C( -938.65), SIMDE_FLOAT32_C( -941.09), SIMDE_FLOAT32_C( 980.71),
SIMDE_FLOAT32_C( -613.85), SIMDE_FLOAT32_C( 304.68), SIMDE_FLOAT32_C( -395.19), SIMDE_FLOAT32_C( -357.37),
SIMDE_FLOAT32_C( 667.44), SIMDE_FLOAT32_C( 353.93), SIMDE_FLOAT32_C( 659.42), SIMDE_FLOAT32_C( -91.27),
SIMDE_FLOAT32_C( -203.61), SIMDE_FLOAT32_C( -908.72), SIMDE_FLOAT32_C( -992.29), SIMDE_FLOAT32_C( -290.45) },
{ SIMDE_FLOAT32_C( -7.80), SIMDE_FLOAT32_C( 913.15), SIMDE_FLOAT32_C( -9.80), SIMDE_FLOAT32_C( 9.94),
SIMDE_FLOAT32_C( -8.50), SIMDE_FLOAT32_C( 971.04), SIMDE_FLOAT32_C( -7.34), SIMDE_FLOAT32_C( 728.65),
SIMDE_FLOAT32_C( 340.02), SIMDE_FLOAT32_C( -288.54), SIMDE_FLOAT32_C( 298.61), SIMDE_FLOAT32_C( -421.40),
SIMDE_FLOAT32_C( 62.04), SIMDE_FLOAT32_C( 962.27), SIMDE_FLOAT32_C( -847.19), SIMDE_FLOAT32_C( -983.83) } },
{ { SIMDE_FLOAT32_C( 781.81), SIMDE_FLOAT32_C( -528.52), SIMDE_FLOAT32_C( 562.38), SIMDE_FLOAT32_C( 752.86),
SIMDE_FLOAT32_C( 106.43), SIMDE_FLOAT32_C( 291.03), SIMDE_FLOAT32_C( 92.88), SIMDE_FLOAT32_C( 817.89),
SIMDE_FLOAT32_C( -410.36), SIMDE_FLOAT32_C( 671.48), SIMDE_FLOAT32_C( -120.07), SIMDE_FLOAT32_C( -448.09),
SIMDE_FLOAT32_C( 824.29), SIMDE_FLOAT32_C( -103.90), SIMDE_FLOAT32_C( -767.52), SIMDE_FLOAT32_C( -650.66) },
UINT8_C( 13),
{ SIMDE_FLOAT32_C( -708.61), SIMDE_FLOAT32_C( -669.94), SIMDE_FLOAT32_C( 343.60), SIMDE_FLOAT32_C( 596.08),
SIMDE_FLOAT32_C( -65.13), SIMDE_FLOAT32_C( 986.24), SIMDE_FLOAT32_C( 263.52), SIMDE_FLOAT32_C( -711.20),
SIMDE_FLOAT32_C( 645.65), SIMDE_FLOAT32_C( -827.76), SIMDE_FLOAT32_C( 85.19), SIMDE_FLOAT32_C( 736.94),
SIMDE_FLOAT32_C( -820.04), SIMDE_FLOAT32_C( 794.74), SIMDE_FLOAT32_C( 518.75), SIMDE_FLOAT32_C( -348.56) },
{ SIMDE_FLOAT32_C( -8.92), SIMDE_FLOAT32_C( -528.52), SIMDE_FLOAT32_C( 7.00), SIMDE_FLOAT32_C( 8.42),
SIMDE_FLOAT32_C( 106.43), SIMDE_FLOAT32_C( 291.03), SIMDE_FLOAT32_C( 92.88), SIMDE_FLOAT32_C( 817.89),
SIMDE_FLOAT32_C( -410.36), SIMDE_FLOAT32_C( 671.48), SIMDE_FLOAT32_C( -120.07), SIMDE_FLOAT32_C( -448.09),
SIMDE_FLOAT32_C( 824.29), SIMDE_FLOAT32_C( -103.90), SIMDE_FLOAT32_C( -767.52), SIMDE_FLOAT32_C( -650.66) } },
{ { SIMDE_FLOAT32_C( 357.12), SIMDE_FLOAT32_C( 271.61), SIMDE_FLOAT32_C( 757.87), SIMDE_FLOAT32_C( -351.85),
SIMDE_FLOAT32_C( -635.52), SIMDE_FLOAT32_C( 575.76), SIMDE_FLOAT32_C( 237.78), SIMDE_FLOAT32_C( -964.04),
SIMDE_FLOAT32_C( -544.31), SIMDE_FLOAT32_C( 789.69), SIMDE_FLOAT32_C( 860.25), SIMDE_FLOAT32_C( 351.79),
SIMDE_FLOAT32_C( -977.83), SIMDE_FLOAT32_C( -790.40), SIMDE_FLOAT32_C( -690.76), SIMDE_FLOAT32_C( -686.43) },
UINT8_C( 57),
{ SIMDE_FLOAT32_C( 652.85), SIMDE_FLOAT32_C( 909.64), SIMDE_FLOAT32_C( 474.52), SIMDE_FLOAT32_C( 639.08),
SIMDE_FLOAT32_C( 173.16), SIMDE_FLOAT32_C( 763.32), SIMDE_FLOAT32_C( 284.74), SIMDE_FLOAT32_C( 345.41),
SIMDE_FLOAT32_C( -151.49), SIMDE_FLOAT32_C( 21.68), SIMDE_FLOAT32_C( 525.36), SIMDE_FLOAT32_C( -356.75),
SIMDE_FLOAT32_C( -459.57), SIMDE_FLOAT32_C( -823.20), SIMDE_FLOAT32_C( -999.64), SIMDE_FLOAT32_C( 812.03) },
{ SIMDE_FLOAT32_C( 8.68), SIMDE_FLOAT32_C( 271.61), SIMDE_FLOAT32_C( 757.87), SIMDE_FLOAT32_C( 8.61),
SIMDE_FLOAT32_C( 5.57), SIMDE_FLOAT32_C( 9.14), SIMDE_FLOAT32_C( 237.78), SIMDE_FLOAT32_C( -964.04),
SIMDE_FLOAT32_C( -544.31), SIMDE_FLOAT32_C( 789.69), SIMDE_FLOAT32_C( 860.25), SIMDE_FLOAT32_C( 351.79),
SIMDE_FLOAT32_C( -977.83), SIMDE_FLOAT32_C( -790.40), SIMDE_FLOAT32_C( -690.76), SIMDE_FLOAT32_C( -686.43) } },
{ { SIMDE_FLOAT32_C( 934.67), SIMDE_FLOAT32_C( -351.49), SIMDE_FLOAT32_C( -823.49), SIMDE_FLOAT32_C( 510.43),
SIMDE_FLOAT32_C( 886.29), SIMDE_FLOAT32_C( -787.53), SIMDE_FLOAT32_C( 966.12), SIMDE_FLOAT32_C( 675.98),
SIMDE_FLOAT32_C( -927.28), SIMDE_FLOAT32_C( 317.91), SIMDE_FLOAT32_C( 698.16), SIMDE_FLOAT32_C( -717.68),
SIMDE_FLOAT32_C( 627.15), SIMDE_FLOAT32_C( -988.28), SIMDE_FLOAT32_C( -178.03), SIMDE_FLOAT32_C( 279.99) },
UINT8_C( 81),
{ SIMDE_FLOAT32_C( -703.51), SIMDE_FLOAT32_C( -80.92), SIMDE_FLOAT32_C( 94.53), SIMDE_FLOAT32_C( -940.19),
SIMDE_FLOAT32_C( -796.18), SIMDE_FLOAT32_C( -560.07), SIMDE_FLOAT32_C( -91.68), SIMDE_FLOAT32_C( 225.49),
SIMDE_FLOAT32_C( 965.29), SIMDE_FLOAT32_C( 551.56), SIMDE_FLOAT32_C( 765.92), SIMDE_FLOAT32_C( -857.91),
SIMDE_FLOAT32_C( 551.93), SIMDE_FLOAT32_C( 577.95), SIMDE_FLOAT32_C( -923.23), SIMDE_FLOAT32_C( -799.56) },
{ SIMDE_FLOAT32_C( -8.89), SIMDE_FLOAT32_C( -351.49), SIMDE_FLOAT32_C( -823.49), SIMDE_FLOAT32_C( 510.43),
SIMDE_FLOAT32_C( -9.27), SIMDE_FLOAT32_C( -787.53), SIMDE_FLOAT32_C( -4.51), SIMDE_FLOAT32_C( 675.98),
SIMDE_FLOAT32_C( -927.28), SIMDE_FLOAT32_C( 317.91), SIMDE_FLOAT32_C( 698.16), SIMDE_FLOAT32_C( -717.68),
SIMDE_FLOAT32_C( 627.15), SIMDE_FLOAT32_C( -988.28), SIMDE_FLOAT32_C( -178.03), SIMDE_FLOAT32_C( 279.99) } },
{ { SIMDE_FLOAT32_C( 754.46), SIMDE_FLOAT32_C( 587.20), SIMDE_FLOAT32_C( -913.27), SIMDE_FLOAT32_C( 966.93),
SIMDE_FLOAT32_C( 553.32), SIMDE_FLOAT32_C( 762.71), SIMDE_FLOAT32_C( -960.34), SIMDE_FLOAT32_C( -128.78),
SIMDE_FLOAT32_C( 460.87), SIMDE_FLOAT32_C( -678.02), SIMDE_FLOAT32_C( -501.63), SIMDE_FLOAT32_C( 472.59),
SIMDE_FLOAT32_C( 143.95), SIMDE_FLOAT32_C( 778.36), SIMDE_FLOAT32_C( 393.95), SIMDE_FLOAT32_C( 440.44) },
UINT8_C(131),
{ SIMDE_FLOAT32_C( -511.52), SIMDE_FLOAT32_C( 500.25), SIMDE_FLOAT32_C( -98.74), SIMDE_FLOAT32_C( -71.59),
SIMDE_FLOAT32_C( -591.44), SIMDE_FLOAT32_C( -873.25), SIMDE_FLOAT32_C( -106.29), SIMDE_FLOAT32_C( 960.13),
SIMDE_FLOAT32_C( 892.67), SIMDE_FLOAT32_C( 35.80), SIMDE_FLOAT32_C( 512.05), SIMDE_FLOAT32_C( 470.62),
SIMDE_FLOAT32_C( 112.57), SIMDE_FLOAT32_C( 712.49), SIMDE_FLOAT32_C( 225.08), SIMDE_FLOAT32_C( -300.23) },
{ SIMDE_FLOAT32_C( -8.00), SIMDE_FLOAT32_C( 7.94), SIMDE_FLOAT32_C( -913.27), SIMDE_FLOAT32_C( 966.93),
SIMDE_FLOAT32_C( 553.32), SIMDE_FLOAT32_C( 762.71), SIMDE_FLOAT32_C( -960.34), SIMDE_FLOAT32_C( 9.87),
SIMDE_FLOAT32_C( 460.87), SIMDE_FLOAT32_C( -678.02), SIMDE_FLOAT32_C( -501.63), SIMDE_FLOAT32_C( 472.59),
SIMDE_FLOAT32_C( 143.95), SIMDE_FLOAT32_C( 778.36), SIMDE_FLOAT32_C( 393.95), SIMDE_FLOAT32_C( 440.44) } },
{ { SIMDE_FLOAT32_C( 799.22), SIMDE_FLOAT32_C( 192.01), SIMDE_FLOAT32_C( -746.92), SIMDE_FLOAT32_C( 561.93),
SIMDE_FLOAT32_C( 231.67), SIMDE_FLOAT32_C( 124.30), SIMDE_FLOAT32_C( 22.80), SIMDE_FLOAT32_C( 553.64),
SIMDE_FLOAT32_C( 622.67), SIMDE_FLOAT32_C( -504.61), SIMDE_FLOAT32_C( -302.41), SIMDE_FLOAT32_C( 401.04),
SIMDE_FLOAT32_C( 889.34), SIMDE_FLOAT32_C( -861.97), SIMDE_FLOAT32_C( -901.52), SIMDE_FLOAT32_C( -622.17) },
UINT8_C( 8),
{ SIMDE_FLOAT32_C( -0.26), SIMDE_FLOAT32_C( 306.24), SIMDE_FLOAT32_C( -953.16), SIMDE_FLOAT32_C( 126.49),
SIMDE_FLOAT32_C( -800.06), SIMDE_FLOAT32_C( -993.04), SIMDE_FLOAT32_C( 19.16), SIMDE_FLOAT32_C( 235.74),
SIMDE_FLOAT32_C( 519.02), SIMDE_FLOAT32_C( -510.22), SIMDE_FLOAT32_C( -651.69), SIMDE_FLOAT32_C( 231.50),
SIMDE_FLOAT32_C( 714.86), SIMDE_FLOAT32_C( 48.08), SIMDE_FLOAT32_C( 30.72), SIMDE_FLOAT32_C( -93.13) },
{ SIMDE_FLOAT32_C( 799.22), SIMDE_FLOAT32_C( 192.01), SIMDE_FLOAT32_C( -746.92), SIMDE_FLOAT32_C( 5.02),
SIMDE_FLOAT32_C( 231.67), SIMDE_FLOAT32_C( 124.30), SIMDE_FLOAT32_C( 22.80), SIMDE_FLOAT32_C( 553.64),
SIMDE_FLOAT32_C( 622.67), SIMDE_FLOAT32_C( -504.61), SIMDE_FLOAT32_C( -302.41), SIMDE_FLOAT32_C( 401.04),
SIMDE_FLOAT32_C( 889.34), SIMDE_FLOAT32_C( -861.97), SIMDE_FLOAT32_C( -901.52), SIMDE_FLOAT32_C( -622.17) } },
{ { SIMDE_FLOAT32_C( 301.16), SIMDE_FLOAT32_C( -407.35), SIMDE_FLOAT32_C( -861.46), SIMDE_FLOAT32_C( -574.54),
SIMDE_FLOAT32_C( 615.45), SIMDE_FLOAT32_C( 692.19), SIMDE_FLOAT32_C( -951.86), SIMDE_FLOAT32_C( -889.16),
SIMDE_FLOAT32_C( -610.22), SIMDE_FLOAT32_C( 449.17), SIMDE_FLOAT32_C( -999.81), SIMDE_FLOAT32_C( -472.20),
SIMDE_FLOAT32_C( 547.65), SIMDE_FLOAT32_C( -621.98), SIMDE_FLOAT32_C( -833.92), SIMDE_FLOAT32_C( -452.61) },
UINT8_C( 61),
{ SIMDE_FLOAT32_C( -787.08), SIMDE_FLOAT32_C( 673.88), SIMDE_FLOAT32_C( 884.20), SIMDE_FLOAT32_C( -780.12),
SIMDE_FLOAT32_C( -306.96), SIMDE_FLOAT32_C( 119.94), SIMDE_FLOAT32_C( 738.89), SIMDE_FLOAT32_C( 182.83),
SIMDE_FLOAT32_C( 468.25), SIMDE_FLOAT32_C( -29.60), SIMDE_FLOAT32_C( -102.31), SIMDE_FLOAT32_C( -483.67),
SIMDE_FLOAT32_C( -998.88), SIMDE_FLOAT32_C( 804.56), SIMDE_FLOAT32_C( 817.49), SIMDE_FLOAT32_C( -406.23) },
{ SIMDE_FLOAT32_C( -9.23), SIMDE_FLOAT32_C( -407.35), SIMDE_FLOAT32_C( 9.60), SIMDE_FLOAT32_C( -9.21),
SIMDE_FLOAT32_C( -6.75), SIMDE_FLOAT32_C( 4.93), SIMDE_FLOAT32_C( -951.86), SIMDE_FLOAT32_C( -889.16),
SIMDE_FLOAT32_C( -610.22), SIMDE_FLOAT32_C( 449.17), SIMDE_FLOAT32_C( -999.81), SIMDE_FLOAT32_C( -472.20),
SIMDE_FLOAT32_C( 547.65), SIMDE_FLOAT32_C( -621.98), SIMDE_FLOAT32_C( -833.92), SIMDE_FLOAT32_C( -452.61) } },
{ { SIMDE_FLOAT32_C( 943.11), SIMDE_FLOAT32_C( -757.05), SIMDE_FLOAT32_C( -790.77), SIMDE_FLOAT32_C( 635.29),
SIMDE_FLOAT32_C( -708.91), SIMDE_FLOAT32_C( -679.93), SIMDE_FLOAT32_C( -974.93), SIMDE_FLOAT32_C( 740.26),
SIMDE_FLOAT32_C( -679.74), SIMDE_FLOAT32_C( -447.13), SIMDE_FLOAT32_C( 287.91), SIMDE_FLOAT32_C( -301.72),
SIMDE_FLOAT32_C( -281.05), SIMDE_FLOAT32_C( 835.30), SIMDE_FLOAT32_C( -617.47), SIMDE_FLOAT32_C( -68.13) },
UINT8_C(116),
{ SIMDE_FLOAT32_C( -733.27), SIMDE_FLOAT32_C( 151.75), SIMDE_FLOAT32_C( -797.77), SIMDE_FLOAT32_C( 386.67),
SIMDE_FLOAT32_C( -109.36), SIMDE_FLOAT32_C( 385.06), SIMDE_FLOAT32_C( -145.07), SIMDE_FLOAT32_C( 861.04),
SIMDE_FLOAT32_C( -717.26), SIMDE_FLOAT32_C( 371.26), SIMDE_FLOAT32_C( 862.16), SIMDE_FLOAT32_C( -912.69),
SIMDE_FLOAT32_C( 188.75), SIMDE_FLOAT32_C( -544.07), SIMDE_FLOAT32_C( -969.58), SIMDE_FLOAT32_C( 431.70) },
{ SIMDE_FLOAT32_C( 943.11), SIMDE_FLOAT32_C( -757.05), SIMDE_FLOAT32_C( -9.27), SIMDE_FLOAT32_C( 635.29),
SIMDE_FLOAT32_C( -4.78), SIMDE_FLOAT32_C( 7.28), SIMDE_FLOAT32_C( -5.25), SIMDE_FLOAT32_C( 740.26),
SIMDE_FLOAT32_C( -679.74), SIMDE_FLOAT32_C( -447.13), SIMDE_FLOAT32_C( 287.91), SIMDE_FLOAT32_C( -301.72),
SIMDE_FLOAT32_C( -281.05), SIMDE_FLOAT32_C( 835.30), SIMDE_FLOAT32_C( -617.47), SIMDE_FLOAT32_C( -68.13) } }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])) ; i++) {
simde__m512 src = simde_mm512_loadu_ps(test_vec[i].src);
simde__m512 a = simde_mm512_loadu_ps(test_vec[i].a);
simde__m512 r = simde_mm512_mask_cbrt_ps(src, test_vec[i].k, a);
simde_test_x86_assert_equal_f32x16(r, simde_mm512_loadu_ps(test_vec[i].r), 1);
}
return 0;
}
static int
test_simde_mm512_mask_cbrt_pd (SIMDE_MUNIT_TEST_ARGS) {
static const struct {
const simde_float64 src[8];
const simde__mmask8 k;
const simde_float64 a[8];
const simde_float64 r[8];
} test_vec[] = {
{ { SIMDE_FLOAT64_C( -759.76), SIMDE_FLOAT64_C( 815.00), SIMDE_FLOAT64_C( -816.92), SIMDE_FLOAT64_C( 967.48),
SIMDE_FLOAT64_C( -635.21), SIMDE_FLOAT64_C( 789.99), SIMDE_FLOAT64_C( -526.03), SIMDE_FLOAT64_C( -914.28) },
UINT8_C( 48),
{ SIMDE_FLOAT64_C( 53.18), SIMDE_FLOAT64_C( 207.38), SIMDE_FLOAT64_C( -889.97), SIMDE_FLOAT64_C( -694.52),
SIMDE_FLOAT64_C( 45.75), SIMDE_FLOAT64_C( 94.09), SIMDE_FLOAT64_C( -391.74), SIMDE_FLOAT64_C( 959.63) },
{ SIMDE_FLOAT64_C( -759.76), SIMDE_FLOAT64_C( 815.00), SIMDE_FLOAT64_C( -816.92), SIMDE_FLOAT64_C( 967.48),
SIMDE_FLOAT64_C( 3.58), SIMDE_FLOAT64_C( 4.55), SIMDE_FLOAT64_C( -526.03), SIMDE_FLOAT64_C( -914.28) } },
{ { SIMDE_FLOAT64_C( 378.72), SIMDE_FLOAT64_C( -982.35), SIMDE_FLOAT64_C( -413.18), SIMDE_FLOAT64_C( 706.92),
SIMDE_FLOAT64_C( 679.73), SIMDE_FLOAT64_C( 156.25), SIMDE_FLOAT64_C( 267.05), SIMDE_FLOAT64_C( -563.13) },
UINT8_C( 62),
{ SIMDE_FLOAT64_C( -595.59), SIMDE_FLOAT64_C( -667.14), SIMDE_FLOAT64_C( -678.76), SIMDE_FLOAT64_C( -24.40),
SIMDE_FLOAT64_C( 817.42), SIMDE_FLOAT64_C( -438.52), SIMDE_FLOAT64_C( -209.40), SIMDE_FLOAT64_C( -999.49) },
{ SIMDE_FLOAT64_C( 378.72), SIMDE_FLOAT64_C( -8.74), SIMDE_FLOAT64_C( -8.79), SIMDE_FLOAT64_C( -2.90),
SIMDE_FLOAT64_C( 9.35), SIMDE_FLOAT64_C( -7.60), SIMDE_FLOAT64_C( 267.05), SIMDE_FLOAT64_C( -563.13) } },
{ { SIMDE_FLOAT64_C( -471.03), SIMDE_FLOAT64_C( 155.40), SIMDE_FLOAT64_C( 790.50), SIMDE_FLOAT64_C( 2.94),
SIMDE_FLOAT64_C( 241.12), SIMDE_FLOAT64_C( 295.11), SIMDE_FLOAT64_C( -943.89), SIMDE_FLOAT64_C( -551.50) },
UINT8_C(142),
{ SIMDE_FLOAT64_C( -638.40), SIMDE_FLOAT64_C( 494.25), SIMDE_FLOAT64_C( -500.77), SIMDE_FLOAT64_C( -30.15),
SIMDE_FLOAT64_C( 453.88), SIMDE_FLOAT64_C( 877.94), SIMDE_FLOAT64_C( -12.50), SIMDE_FLOAT64_C( -959.30) },
{ SIMDE_FLOAT64_C( -471.03), SIMDE_FLOAT64_C( 7.91), SIMDE_FLOAT64_C( -7.94), SIMDE_FLOAT64_C( -3.11),
SIMDE_FLOAT64_C( 241.12), SIMDE_FLOAT64_C( 295.11), SIMDE_FLOAT64_C( -943.89), SIMDE_FLOAT64_C( -9.86) } },
{ { SIMDE_FLOAT64_C( 584.87), SIMDE_FLOAT64_C( -332.77), SIMDE_FLOAT64_C( 196.95), SIMDE_FLOAT64_C( -148.09),
SIMDE_FLOAT64_C( 104.11), SIMDE_FLOAT64_C( -809.90), SIMDE_FLOAT64_C( 256.33), SIMDE_FLOAT64_C( 436.96) },
UINT8_C(231),
{ SIMDE_FLOAT64_C( -768.07), SIMDE_FLOAT64_C( 254.39), SIMDE_FLOAT64_C( 72.83), SIMDE_FLOAT64_C( 22.53),
SIMDE_FLOAT64_C( 254.89), SIMDE_FLOAT64_C( 601.79), SIMDE_FLOAT64_C( -822.07), SIMDE_FLOAT64_C( 45.39) },
{ SIMDE_FLOAT64_C( -9.16), SIMDE_FLOAT64_C( 6.34), SIMDE_FLOAT64_C( 4.18), SIMDE_FLOAT64_C( -148.09),
SIMDE_FLOAT64_C( 104.11), SIMDE_FLOAT64_C( 8.44), SIMDE_FLOAT64_C( -9.37), SIMDE_FLOAT64_C( 3.57) } },
{ { SIMDE_FLOAT64_C( -395.27), SIMDE_FLOAT64_C( 419.05), SIMDE_FLOAT64_C( -659.50), SIMDE_FLOAT64_C( -339.16),
SIMDE_FLOAT64_C( 867.55), SIMDE_FLOAT64_C( 745.64), SIMDE_FLOAT64_C( 22.44), SIMDE_FLOAT64_C( 361.79) },
UINT8_C( 20),
{ SIMDE_FLOAT64_C( 992.29), SIMDE_FLOAT64_C( -184.33), SIMDE_FLOAT64_C( -877.19), SIMDE_FLOAT64_C( -20.21),
SIMDE_FLOAT64_C( -143.62), SIMDE_FLOAT64_C( 707.68), SIMDE_FLOAT64_C( 647.03), SIMDE_FLOAT64_C( -946.67) },
{ SIMDE_FLOAT64_C( -395.27), SIMDE_FLOAT64_C( 419.05), SIMDE_FLOAT64_C( -9.57), SIMDE_FLOAT64_C( -339.16),
SIMDE_FLOAT64_C( -5.24), SIMDE_FLOAT64_C( 745.64), SIMDE_FLOAT64_C( 22.44), SIMDE_FLOAT64_C( 361.79) } },
{ { SIMDE_FLOAT64_C( -440.41), SIMDE_FLOAT64_C( -248.87), SIMDE_FLOAT64_C( -756.57), SIMDE_FLOAT64_C( 815.92),
SIMDE_FLOAT64_C( -811.90), SIMDE_FLOAT64_C( -245.23), SIMDE_FLOAT64_C( -952.16), SIMDE_FLOAT64_C( 442.48) },
UINT8_C( 34),
{ SIMDE_FLOAT64_C( 70.37), SIMDE_FLOAT64_C( -302.63), SIMDE_FLOAT64_C( 429.40), SIMDE_FLOAT64_C( 248.30),
SIMDE_FLOAT64_C( 742.77), SIMDE_FLOAT64_C( -965.87), SIMDE_FLOAT64_C( -332.65), SIMDE_FLOAT64_C( -916.73) },
{ SIMDE_FLOAT64_C( -440.41), SIMDE_FLOAT64_C( -6.71), SIMDE_FLOAT64_C( -756.57), SIMDE_FLOAT64_C( 815.92),
SIMDE_FLOAT64_C( -811.90), SIMDE_FLOAT64_C( -9.88), SIMDE_FLOAT64_C( -952.16), SIMDE_FLOAT64_C( 442.48) } },
{ { SIMDE_FLOAT64_C( -305.03), SIMDE_FLOAT64_C( -465.11), SIMDE_FLOAT64_C( 828.91), SIMDE_FLOAT64_C( 717.41),
SIMDE_FLOAT64_C( 896.69), SIMDE_FLOAT64_C( -926.23), SIMDE_FLOAT64_C( 709.70), SIMDE_FLOAT64_C( -287.64) },
UINT8_C( 68),
{ SIMDE_FLOAT64_C( -310.50), SIMDE_FLOAT64_C( 568.74), SIMDE_FLOAT64_C( 904.26), SIMDE_FLOAT64_C( -663.47),
SIMDE_FLOAT64_C( 622.07), SIMDE_FLOAT64_C( -536.15), SIMDE_FLOAT64_C( 87.66), SIMDE_FLOAT64_C( 865.50) },
{ SIMDE_FLOAT64_C( -305.03), SIMDE_FLOAT64_C( -465.11), SIMDE_FLOAT64_C( 9.67), SIMDE_FLOAT64_C( 717.41),
SIMDE_FLOAT64_C( 896.69), SIMDE_FLOAT64_C( -926.23), SIMDE_FLOAT64_C( 4.44), SIMDE_FLOAT64_C( -287.64) } },
{ { SIMDE_FLOAT64_C( -720.23), SIMDE_FLOAT64_C( 275.76), SIMDE_FLOAT64_C( -379.73), SIMDE_FLOAT64_C( -672.39),
SIMDE_FLOAT64_C( -281.76), SIMDE_FLOAT64_C( -552.12), SIMDE_FLOAT64_C( 397.98), SIMDE_FLOAT64_C( 415.61) },
UINT8_C(204),
{ SIMDE_FLOAT64_C( -353.72), SIMDE_FLOAT64_C( 158.38), SIMDE_FLOAT64_C( 911.40), SIMDE_FLOAT64_C( 313.63),
SIMDE_FLOAT64_C( 241.65), SIMDE_FLOAT64_C( -393.63), SIMDE_FLOAT64_C( 848.52), SIMDE_FLOAT64_C( 70.56) },
{ SIMDE_FLOAT64_C( -720.23), SIMDE_FLOAT64_C( 275.76), SIMDE_FLOAT64_C( 9.70), SIMDE_FLOAT64_C( 6.79),
SIMDE_FLOAT64_C( -281.76), SIMDE_FLOAT64_C( -552.12), SIMDE_FLOAT64_C( 9.47), SIMDE_FLOAT64_C( 4.13) } }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])) ; i++) {
simde__m512d src = simde_mm512_loadu_pd(test_vec[i].src);
simde__m512d a = simde_mm512_loadu_pd(test_vec[i].a);
simde__m512d r = simde_mm512_mask_cbrt_pd(src, test_vec[i].k, a);
simde_test_x86_assert_equal_f64x8(r, simde_mm512_loadu_pd(test_vec[i].r), 1);
}
return 0;
}
static int
test_simde_mm512_cos_ps(SIMDE_MUNIT_TEST_ARGS) {
const struct {
simde__m512 a;
simde__m512 r;
} test_vec[8] = {
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( -678.17), SIMDE_FLOAT32_C( -686.13), SIMDE_FLOAT32_C( 84.77), SIMDE_FLOAT32_C( 571.46),
SIMDE_FLOAT32_C( 825.53), SIMDE_FLOAT32_C( 422.21), SIMDE_FLOAT32_C( -269.45), SIMDE_FLOAT32_C( 467.76),
SIMDE_FLOAT32_C( 497.31), SIMDE_FLOAT32_C( 670.24), SIMDE_FLOAT32_C( -297.45), SIMDE_FLOAT32_C( 34.06),
SIMDE_FLOAT32_C( -186.21), SIMDE_FLOAT32_C( 39.01), SIMDE_FLOAT32_C( -754.38), SIMDE_FLOAT32_C( 346.63)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 0.92), SIMDE_FLOAT32_C( 0.30), SIMDE_FLOAT32_C( -1.00), SIMDE_FLOAT32_C( 0.95),
SIMDE_FLOAT32_C( -0.76), SIMDE_FLOAT32_C( 0.33), SIMDE_FLOAT32_C( 0.75), SIMDE_FLOAT32_C( -0.94),
SIMDE_FLOAT32_C( 0.59), SIMDE_FLOAT32_C( -0.47), SIMDE_FLOAT32_C( -0.54), SIMDE_FLOAT32_C( -0.88),
SIMDE_FLOAT32_C( -0.66), SIMDE_FLOAT32_C( 0.26), SIMDE_FLOAT32_C( 0.92), SIMDE_FLOAT32_C( 0.49)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( 178.20), SIMDE_FLOAT32_C( -450.67), SIMDE_FLOAT32_C( 233.37), SIMDE_FLOAT32_C( 687.09),
SIMDE_FLOAT32_C( 261.31), SIMDE_FLOAT32_C( -212.54), SIMDE_FLOAT32_C( -976.55), SIMDE_FLOAT32_C( -660.80),
SIMDE_FLOAT32_C( -444.81), SIMDE_FLOAT32_C( 28.47), SIMDE_FLOAT32_C( -384.03), SIMDE_FLOAT32_C( -923.64),
SIMDE_FLOAT32_C( -305.07), SIMDE_FLOAT32_C( -860.95), SIMDE_FLOAT32_C( -417.54), SIMDE_FLOAT32_C( 696.87)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( -0.64), SIMDE_FLOAT32_C( -0.15), SIMDE_FLOAT32_C( 0.63), SIMDE_FLOAT32_C( -0.61),
SIMDE_FLOAT32_C( -0.85), SIMDE_FLOAT32_C( 0.46), SIMDE_FLOAT32_C( -0.88), SIMDE_FLOAT32_C( 0.48),
SIMDE_FLOAT32_C( 0.27), SIMDE_FLOAT32_C( -0.98), SIMDE_FLOAT32_C( 0.73), SIMDE_FLOAT32_C( 1.00),
SIMDE_FLOAT32_C( -0.94), SIMDE_FLOAT32_C( 0.99), SIMDE_FLOAT32_C( -0.96), SIMDE_FLOAT32_C( 0.85)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( -387.90), SIMDE_FLOAT32_C( 395.92), SIMDE_FLOAT32_C( 655.87), SIMDE_FLOAT32_C( 339.21),
SIMDE_FLOAT32_C( 532.35), SIMDE_FLOAT32_C( -263.99), SIMDE_FLOAT32_C( 780.64), SIMDE_FLOAT32_C( -30.79),
SIMDE_FLOAT32_C( -770.35), SIMDE_FLOAT32_C( 443.48), SIMDE_FLOAT32_C( -583.60), SIMDE_FLOAT32_C( 380.46),
SIMDE_FLOAT32_C( -770.72), SIMDE_FLOAT32_C( 993.90), SIMDE_FLOAT32_C( 28.08), SIMDE_FLOAT32_C( 841.21)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( -0.09), SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( -0.75), SIMDE_FLOAT32_C( 1.00),
SIMDE_FLOAT32_C( -0.15), SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( 0.05), SIMDE_FLOAT32_C( 0.81),
SIMDE_FLOAT32_C( -0.79), SIMDE_FLOAT32_C( -0.87), SIMDE_FLOAT32_C( 0.74), SIMDE_FLOAT32_C( -0.95),
SIMDE_FLOAT32_C( -0.52), SIMDE_FLOAT32_C( 0.40), SIMDE_FLOAT32_C( -0.98), SIMDE_FLOAT32_C( 0.74)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( 469.66), SIMDE_FLOAT32_C( 680.02), SIMDE_FLOAT32_C( -148.69), SIMDE_FLOAT32_C( 818.66),
SIMDE_FLOAT32_C( 910.03), SIMDE_FLOAT32_C( 600.47), SIMDE_FLOAT32_C( 791.23), SIMDE_FLOAT32_C( 254.31),
SIMDE_FLOAT32_C( -203.65), SIMDE_FLOAT32_C( -80.73), SIMDE_FLOAT32_C( 336.73), SIMDE_FLOAT32_C( -944.78),
SIMDE_FLOAT32_C( -747.59), SIMDE_FLOAT32_C( -767.23), SIMDE_FLOAT32_C( -554.19), SIMDE_FLOAT32_C( 398.82)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( -0.01), SIMDE_FLOAT32_C( 0.13), SIMDE_FLOAT32_C( -0.51), SIMDE_FLOAT32_C( -0.27),
SIMDE_FLOAT32_C( 0.51), SIMDE_FLOAT32_C( -0.91), SIMDE_FLOAT32_C( 0.90), SIMDE_FLOAT32_C( -0.99),
SIMDE_FLOAT32_C( -0.85), SIMDE_FLOAT32_C( 0.58), SIMDE_FLOAT32_C( -0.84), SIMDE_FLOAT32_C( -0.67),
SIMDE_FLOAT32_C( 0.99), SIMDE_FLOAT32_C( 0.78), SIMDE_FLOAT32_C( 0.30), SIMDE_FLOAT32_C( -0.99)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( -178.99), SIMDE_FLOAT32_C( 758.79), SIMDE_FLOAT32_C( 324.62), SIMDE_FLOAT32_C( 343.48),
SIMDE_FLOAT32_C( -874.31), SIMDE_FLOAT32_C( -797.92), SIMDE_FLOAT32_C( -328.54), SIMDE_FLOAT32_C( -525.83),
SIMDE_FLOAT32_C( -192.31), SIMDE_FLOAT32_C( -822.65), SIMDE_FLOAT32_C( 561.36), SIMDE_FLOAT32_C( 655.67),
SIMDE_FLOAT32_C( -70.91), SIMDE_FLOAT32_C( 543.35), SIMDE_FLOAT32_C( 120.65), SIMDE_FLOAT32_C( -171.51)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( -1.00), SIMDE_FLOAT32_C( 0.10), SIMDE_FLOAT32_C( -0.51), SIMDE_FLOAT32_C( -0.50),
SIMDE_FLOAT32_C( 0.58), SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( -0.24), SIMDE_FLOAT32_C( -0.38),
SIMDE_FLOAT32_C( -0.78), SIMDE_FLOAT32_C( 0.90), SIMDE_FLOAT32_C( -0.55), SIMDE_FLOAT32_C( -0.60),
SIMDE_FLOAT32_C( -0.22), SIMDE_FLOAT32_C( -0.99), SIMDE_FLOAT32_C( 0.30), SIMDE_FLOAT32_C( -0.29)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( 690.12), SIMDE_FLOAT32_C( 840.65), SIMDE_FLOAT32_C( -21.09), SIMDE_FLOAT32_C( -591.56),
SIMDE_FLOAT32_C( -448.89), SIMDE_FLOAT32_C( 731.49), SIMDE_FLOAT32_C( 505.79), SIMDE_FLOAT32_C( 623.70),
SIMDE_FLOAT32_C( 831.02), SIMDE_FLOAT32_C( 140.67), SIMDE_FLOAT32_C( 977.36), SIMDE_FLOAT32_C( -906.16),
SIMDE_FLOAT32_C( 331.34), SIMDE_FLOAT32_C( 99.93), SIMDE_FLOAT32_C( 462.95), SIMDE_FLOAT32_C( -738.19)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 0.51), SIMDE_FLOAT32_C( 0.27), SIMDE_FLOAT32_C( -0.62), SIMDE_FLOAT32_C( 0.59),
SIMDE_FLOAT32_C( -0.94), SIMDE_FLOAT32_C( -0.88), SIMDE_FLOAT32_C( -1.00), SIMDE_FLOAT32_C( -0.09),
SIMDE_FLOAT32_C( -0.07), SIMDE_FLOAT32_C( -0.76), SIMDE_FLOAT32_C( -0.95), SIMDE_FLOAT32_C( 0.19),
SIMDE_FLOAT32_C( -0.10), SIMDE_FLOAT32_C( 0.82), SIMDE_FLOAT32_C( -0.42), SIMDE_FLOAT32_C( -1.00)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( -769.09), SIMDE_FLOAT32_C( 822.06), SIMDE_FLOAT32_C( -573.81), SIMDE_FLOAT32_C( -997.63),
SIMDE_FLOAT32_C( -337.60), SIMDE_FLOAT32_C( 923.64), SIMDE_FLOAT32_C( 293.64), SIMDE_FLOAT32_C( -768.12),
SIMDE_FLOAT32_C( -576.22), SIMDE_FLOAT32_C( -67.64), SIMDE_FLOAT32_C( 710.38), SIMDE_FLOAT32_C( 977.49),
SIMDE_FLOAT32_C( -756.42), SIMDE_FLOAT32_C( 424.81), SIMDE_FLOAT32_C( 27.25), SIMDE_FLOAT32_C( -95.15)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( -0.83), SIMDE_FLOAT32_C( 0.51), SIMDE_FLOAT32_C( -0.45), SIMDE_FLOAT32_C( 0.17),
SIMDE_FLOAT32_C( -0.12), SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( -0.10), SIMDE_FLOAT32_C( -0.00),
SIMDE_FLOAT32_C( -0.26), SIMDE_FLOAT32_C( 0.10), SIMDE_FLOAT32_C( 0.93), SIMDE_FLOAT32_C( -0.90),
SIMDE_FLOAT32_C( -0.76), SIMDE_FLOAT32_C( -0.77), SIMDE_FLOAT32_C( -0.52), SIMDE_FLOAT32_C( 0.62)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( -438.19), SIMDE_FLOAT32_C( -359.76), SIMDE_FLOAT32_C( -752.43), SIMDE_FLOAT32_C( -33.67),
SIMDE_FLOAT32_C( 932.66), SIMDE_FLOAT32_C( 7.27), SIMDE_FLOAT32_C( -327.22), SIMDE_FLOAT32_C( -125.20),
SIMDE_FLOAT32_C( -182.45), SIMDE_FLOAT32_C( 39.93), SIMDE_FLOAT32_C( 510.85), SIMDE_FLOAT32_C( 394.67),
SIMDE_FLOAT32_C( 14.34), SIMDE_FLOAT32_C( -304.73), SIMDE_FLOAT32_C( 916.26), SIMDE_FLOAT32_C( -696.69)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( -0.06), SIMDE_FLOAT32_C( -0.05), SIMDE_FLOAT32_C( 0.02), SIMDE_FLOAT32_C( -0.63),
SIMDE_FLOAT32_C( -0.92), SIMDE_FLOAT32_C( 0.55), SIMDE_FLOAT32_C( 0.88), SIMDE_FLOAT32_C( 0.89),
SIMDE_FLOAT32_C( 0.97), SIMDE_FLOAT32_C( -0.61), SIMDE_FLOAT32_C( -0.33), SIMDE_FLOAT32_C( 0.39),
SIMDE_FLOAT32_C( -0.20), SIMDE_FLOAT32_C( -1.00), SIMDE_FLOAT32_C( 0.47), SIMDE_FLOAT32_C( 0.74)) }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])); i++) {
simde__m512 r = simde_mm512_cos_ps(test_vec[i].a);
simde_assert_m512_close(r, test_vec[i].r, 1);
}
return 0;
}
static int
test_simde_mm512_mask_cos_ps(SIMDE_MUNIT_TEST_ARGS) {
const struct {
simde__m512 src;
simde__mmask16 k;
simde__m512 a;
simde__m512 r;
} test_vec[8] = {
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( -450.67), SIMDE_FLOAT32_C( 687.09), SIMDE_FLOAT32_C( -212.54), SIMDE_FLOAT32_C( -660.80),
SIMDE_FLOAT32_C( 28.47), SIMDE_FLOAT32_C( -923.64), SIMDE_FLOAT32_C( -860.95), SIMDE_FLOAT32_C( 696.87),
SIMDE_FLOAT32_C( -686.13), SIMDE_FLOAT32_C( 571.46), SIMDE_FLOAT32_C( 422.21), SIMDE_FLOAT32_C( 467.76),
SIMDE_FLOAT32_C( 670.24), SIMDE_FLOAT32_C( 34.06), SIMDE_FLOAT32_C( 39.01), SIMDE_FLOAT32_C( 346.63)),
UINT16_C(41466),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 178.20), SIMDE_FLOAT32_C( 233.37), SIMDE_FLOAT32_C( 261.31), SIMDE_FLOAT32_C( -976.55),
SIMDE_FLOAT32_C( -444.81), SIMDE_FLOAT32_C( -384.03), SIMDE_FLOAT32_C( -305.07), SIMDE_FLOAT32_C( -417.54),
SIMDE_FLOAT32_C( -678.17), SIMDE_FLOAT32_C( 84.77), SIMDE_FLOAT32_C( 825.53), SIMDE_FLOAT32_C( -269.45),
SIMDE_FLOAT32_C( 497.31), SIMDE_FLOAT32_C( -297.45), SIMDE_FLOAT32_C( -186.21), SIMDE_FLOAT32_C( -754.38)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( -0.64), SIMDE_FLOAT32_C( 687.09), SIMDE_FLOAT32_C( -0.85), SIMDE_FLOAT32_C( -660.80),
SIMDE_FLOAT32_C( 28.47), SIMDE_FLOAT32_C( -923.64), SIMDE_FLOAT32_C( -860.95), SIMDE_FLOAT32_C( -0.96),
SIMDE_FLOAT32_C( 0.92), SIMDE_FLOAT32_C( -1.00), SIMDE_FLOAT32_C( -0.76), SIMDE_FLOAT32_C( 0.75),
SIMDE_FLOAT32_C( 0.59), SIMDE_FLOAT32_C( 34.06), SIMDE_FLOAT32_C( -0.66), SIMDE_FLOAT32_C( 346.63)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( 469.66), SIMDE_FLOAT32_C( -148.69), SIMDE_FLOAT32_C( 910.03), SIMDE_FLOAT32_C( 791.23),
SIMDE_FLOAT32_C( -203.65), SIMDE_FLOAT32_C( 336.73), SIMDE_FLOAT32_C( -747.59), SIMDE_FLOAT32_C( -554.19),
SIMDE_FLOAT32_C( -387.90), SIMDE_FLOAT32_C( 655.87), SIMDE_FLOAT32_C( 532.35), SIMDE_FLOAT32_C( 780.64),
SIMDE_FLOAT32_C( -770.35), SIMDE_FLOAT32_C( -583.60), SIMDE_FLOAT32_C( -770.72), SIMDE_FLOAT32_C( 28.08)),
UINT16_C(36797),
simde_mm512_set_ps(SIMDE_FLOAT32_C( -171.51), SIMDE_FLOAT32_C( 680.02), SIMDE_FLOAT32_C( 818.66), SIMDE_FLOAT32_C( 600.47),
SIMDE_FLOAT32_C( 254.31), SIMDE_FLOAT32_C( -80.73), SIMDE_FLOAT32_C( -944.78), SIMDE_FLOAT32_C( -767.23),
SIMDE_FLOAT32_C( 398.82), SIMDE_FLOAT32_C( 395.92), SIMDE_FLOAT32_C( 339.21), SIMDE_FLOAT32_C( -263.99),
SIMDE_FLOAT32_C( -30.79), SIMDE_FLOAT32_C( 443.48), SIMDE_FLOAT32_C( 380.46), SIMDE_FLOAT32_C( 993.90)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( -0.29), SIMDE_FLOAT32_C( -148.69), SIMDE_FLOAT32_C( 910.03), SIMDE_FLOAT32_C( 791.23),
SIMDE_FLOAT32_C( -0.99), SIMDE_FLOAT32_C( 0.58), SIMDE_FLOAT32_C( -0.67), SIMDE_FLOAT32_C( 0.78),
SIMDE_FLOAT32_C( -0.99), SIMDE_FLOAT32_C( 655.87), SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( 1.00),
SIMDE_FLOAT32_C( 0.81), SIMDE_FLOAT32_C( -0.87), SIMDE_FLOAT32_C( -770.72), SIMDE_FLOAT32_C( 0.40)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( -95.15), SIMDE_FLOAT32_C( 840.65), SIMDE_FLOAT32_C( -591.56), SIMDE_FLOAT32_C( 731.49),
SIMDE_FLOAT32_C( 623.70), SIMDE_FLOAT32_C( 140.67), SIMDE_FLOAT32_C( -906.16), SIMDE_FLOAT32_C( 99.93),
SIMDE_FLOAT32_C( -738.19), SIMDE_FLOAT32_C( 758.79), SIMDE_FLOAT32_C( 343.48), SIMDE_FLOAT32_C( -797.92),
SIMDE_FLOAT32_C( -525.83), SIMDE_FLOAT32_C( -822.65), SIMDE_FLOAT32_C( 655.67), SIMDE_FLOAT32_C( 543.35)),
UINT16_C(16804),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 27.25), SIMDE_FLOAT32_C( 690.12), SIMDE_FLOAT32_C( -21.09), SIMDE_FLOAT32_C( -448.89),
SIMDE_FLOAT32_C( 505.79), SIMDE_FLOAT32_C( 831.02), SIMDE_FLOAT32_C( 977.36), SIMDE_FLOAT32_C( 331.34),
SIMDE_FLOAT32_C( 462.95), SIMDE_FLOAT32_C( -178.99), SIMDE_FLOAT32_C( 324.62), SIMDE_FLOAT32_C( -874.31),
SIMDE_FLOAT32_C( -328.54), SIMDE_FLOAT32_C( -192.31), SIMDE_FLOAT32_C( 561.36), SIMDE_FLOAT32_C( -70.91)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( -95.15), SIMDE_FLOAT32_C( 0.51), SIMDE_FLOAT32_C( -591.56), SIMDE_FLOAT32_C( 731.49),
SIMDE_FLOAT32_C( 623.70), SIMDE_FLOAT32_C( 140.67), SIMDE_FLOAT32_C( -906.16), SIMDE_FLOAT32_C( -0.10),
SIMDE_FLOAT32_C( -0.42), SIMDE_FLOAT32_C( 758.79), SIMDE_FLOAT32_C( -0.51), SIMDE_FLOAT32_C( -797.92),
SIMDE_FLOAT32_C( -525.83), SIMDE_FLOAT32_C( -0.78), SIMDE_FLOAT32_C( 655.67), SIMDE_FLOAT32_C( 543.35)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( -348.70), SIMDE_FLOAT32_C( -438.19), SIMDE_FLOAT32_C( -752.43), SIMDE_FLOAT32_C( 932.66),
SIMDE_FLOAT32_C( -327.22), SIMDE_FLOAT32_C( -182.45), SIMDE_FLOAT32_C( 510.85), SIMDE_FLOAT32_C( 14.34),
SIMDE_FLOAT32_C( 916.26), SIMDE_FLOAT32_C( -769.09), SIMDE_FLOAT32_C( -573.81), SIMDE_FLOAT32_C( -337.60),
SIMDE_FLOAT32_C( 293.64), SIMDE_FLOAT32_C( -576.22), SIMDE_FLOAT32_C( 710.38), SIMDE_FLOAT32_C( -756.42)),
UINT16_C( 2107),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 897.27), SIMDE_FLOAT32_C( -197.89), SIMDE_FLOAT32_C( -359.76), SIMDE_FLOAT32_C( -33.67),
SIMDE_FLOAT32_C( 7.27), SIMDE_FLOAT32_C( -125.20), SIMDE_FLOAT32_C( 39.93), SIMDE_FLOAT32_C( 394.67),
SIMDE_FLOAT32_C( -304.73), SIMDE_FLOAT32_C( -696.69), SIMDE_FLOAT32_C( 822.06), SIMDE_FLOAT32_C( -997.63),
SIMDE_FLOAT32_C( 923.64), SIMDE_FLOAT32_C( -768.12), SIMDE_FLOAT32_C( -67.64), SIMDE_FLOAT32_C( 977.49)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( -348.70), SIMDE_FLOAT32_C( -438.19), SIMDE_FLOAT32_C( -752.43), SIMDE_FLOAT32_C( 932.66),
SIMDE_FLOAT32_C( 0.55), SIMDE_FLOAT32_C( -182.45), SIMDE_FLOAT32_C( 510.85), SIMDE_FLOAT32_C( 14.34),
SIMDE_FLOAT32_C( 916.26), SIMDE_FLOAT32_C( -769.09), SIMDE_FLOAT32_C( 0.51), SIMDE_FLOAT32_C( 0.17),
SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( -576.22), SIMDE_FLOAT32_C( 0.10), SIMDE_FLOAT32_C( -0.90)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( -15.61), SIMDE_FLOAT32_C( -737.13), SIMDE_FLOAT32_C( -314.93), SIMDE_FLOAT32_C( 177.92),
SIMDE_FLOAT32_C( 345.93), SIMDE_FLOAT32_C( 888.71), SIMDE_FLOAT32_C( 915.71), SIMDE_FLOAT32_C( 133.52),
SIMDE_FLOAT32_C( 484.94), SIMDE_FLOAT32_C( -598.06), SIMDE_FLOAT32_C( -791.07), SIMDE_FLOAT32_C( -765.93),
SIMDE_FLOAT32_C( 221.37), SIMDE_FLOAT32_C( -788.36), SIMDE_FLOAT32_C( -775.04), SIMDE_FLOAT32_C( 440.64)),
UINT16_C(22274),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 496.57), SIMDE_FLOAT32_C( 915.19), SIMDE_FLOAT32_C( -718.40), SIMDE_FLOAT32_C( 159.97),
SIMDE_FLOAT32_C( -861.01), SIMDE_FLOAT32_C( 426.61), SIMDE_FLOAT32_C( 932.11), SIMDE_FLOAT32_C( 110.36),
SIMDE_FLOAT32_C( 826.84), SIMDE_FLOAT32_C( -76.75), SIMDE_FLOAT32_C( 237.58), SIMDE_FLOAT32_C( -378.50),
SIMDE_FLOAT32_C( -601.68), SIMDE_FLOAT32_C( -623.50), SIMDE_FLOAT32_C( -942.47), SIMDE_FLOAT32_C( 475.51)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( -15.61), SIMDE_FLOAT32_C( -0.55), SIMDE_FLOAT32_C( -314.93), SIMDE_FLOAT32_C( -0.97),
SIMDE_FLOAT32_C( 345.93), SIMDE_FLOAT32_C( 0.80), SIMDE_FLOAT32_C( -0.59), SIMDE_FLOAT32_C( -0.92),
SIMDE_FLOAT32_C( 484.94), SIMDE_FLOAT32_C( -598.06), SIMDE_FLOAT32_C( -791.07), SIMDE_FLOAT32_C( -765.93),
SIMDE_FLOAT32_C( 221.37), SIMDE_FLOAT32_C( -788.36), SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( 440.64)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( 883.05), SIMDE_FLOAT32_C( -807.28), SIMDE_FLOAT32_C( -70.05), SIMDE_FLOAT32_C( -784.34),
SIMDE_FLOAT32_C( 92.52), SIMDE_FLOAT32_C( 206.60), SIMDE_FLOAT32_C( 834.60), SIMDE_FLOAT32_C( -65.60),
SIMDE_FLOAT32_C( -286.07), SIMDE_FLOAT32_C( -212.86), SIMDE_FLOAT32_C( -318.38), SIMDE_FLOAT32_C( 783.48),
SIMDE_FLOAT32_C( -628.82), SIMDE_FLOAT32_C( 556.35), SIMDE_FLOAT32_C( 439.43), SIMDE_FLOAT32_C( 434.03)),
UINT16_C(27396),
simde_mm512_set_ps(SIMDE_FLOAT32_C( -964.25), SIMDE_FLOAT32_C( -406.33), SIMDE_FLOAT32_C( -743.66), SIMDE_FLOAT32_C( -764.58),
SIMDE_FLOAT32_C( 789.89), SIMDE_FLOAT32_C( 4.83), SIMDE_FLOAT32_C( -818.54), SIMDE_FLOAT32_C( 161.06),
SIMDE_FLOAT32_C( 579.25), SIMDE_FLOAT32_C( -11.78), SIMDE_FLOAT32_C( -308.52), SIMDE_FLOAT32_C( -719.57),
SIMDE_FLOAT32_C( 334.00), SIMDE_FLOAT32_C( 274.71), SIMDE_FLOAT32_C( -916.82), SIMDE_FLOAT32_C( -490.00)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 883.05), SIMDE_FLOAT32_C( -0.48), SIMDE_FLOAT32_C( -0.62), SIMDE_FLOAT32_C( -784.34),
SIMDE_FLOAT32_C( -0.22), SIMDE_FLOAT32_C( 206.60), SIMDE_FLOAT32_C( -0.15), SIMDE_FLOAT32_C( -0.67),
SIMDE_FLOAT32_C( -286.07), SIMDE_FLOAT32_C( -212.86), SIMDE_FLOAT32_C( -318.38), SIMDE_FLOAT32_C( 783.48),
SIMDE_FLOAT32_C( -628.82), SIMDE_FLOAT32_C( -0.18), SIMDE_FLOAT32_C( 439.43), SIMDE_FLOAT32_C( 434.03)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( 529.63), SIMDE_FLOAT32_C( -24.89), SIMDE_FLOAT32_C( -967.78), SIMDE_FLOAT32_C( 638.94),
SIMDE_FLOAT32_C( 450.90), SIMDE_FLOAT32_C( -771.54), SIMDE_FLOAT32_C( 105.79), SIMDE_FLOAT32_C( 590.10),
SIMDE_FLOAT32_C( 30.91), SIMDE_FLOAT32_C( 635.35), SIMDE_FLOAT32_C( -84.00), SIMDE_FLOAT32_C( 80.04),
SIMDE_FLOAT32_C( -709.46), SIMDE_FLOAT32_C( 607.86), SIMDE_FLOAT32_C( 394.58), SIMDE_FLOAT32_C( -889.11)),
UINT16_C( 953),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 18.75), SIMDE_FLOAT32_C( 809.05), SIMDE_FLOAT32_C( 144.05), SIMDE_FLOAT32_C( -427.72),
SIMDE_FLOAT32_C( 308.28), SIMDE_FLOAT32_C( -177.05), SIMDE_FLOAT32_C( -457.77), SIMDE_FLOAT32_C( 678.24),
SIMDE_FLOAT32_C( 66.05), SIMDE_FLOAT32_C( -267.71), SIMDE_FLOAT32_C( 117.28), SIMDE_FLOAT32_C( -576.80),
SIMDE_FLOAT32_C( -38.39), SIMDE_FLOAT32_C( -250.14), SIMDE_FLOAT32_C( -53.92), SIMDE_FLOAT32_C( 91.94)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 529.63), SIMDE_FLOAT32_C( -24.89), SIMDE_FLOAT32_C( -967.78), SIMDE_FLOAT32_C( 638.94),
SIMDE_FLOAT32_C( 450.90), SIMDE_FLOAT32_C( -771.54), SIMDE_FLOAT32_C( 0.62), SIMDE_FLOAT32_C( 0.94),
SIMDE_FLOAT32_C( -1.00), SIMDE_FLOAT32_C( 635.35), SIMDE_FLOAT32_C( -0.51), SIMDE_FLOAT32_C( 0.31),
SIMDE_FLOAT32_C( 0.77), SIMDE_FLOAT32_C( 607.86), SIMDE_FLOAT32_C( 394.58), SIMDE_FLOAT32_C( -0.67)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( -788.39), SIMDE_FLOAT32_C( 330.43), SIMDE_FLOAT32_C( -493.41), SIMDE_FLOAT32_C( 822.72),
SIMDE_FLOAT32_C( 956.68), SIMDE_FLOAT32_C( 954.62), SIMDE_FLOAT32_C( 825.49), SIMDE_FLOAT32_C( -816.27),
SIMDE_FLOAT32_C( -209.34), SIMDE_FLOAT32_C( -933.21), SIMDE_FLOAT32_C( -728.70), SIMDE_FLOAT32_C( -420.06),
SIMDE_FLOAT32_C( 100.32), SIMDE_FLOAT32_C( 103.15), SIMDE_FLOAT32_C( 439.77), SIMDE_FLOAT32_C( -204.33)),
UINT16_C(12713),
simde_mm512_set_ps(SIMDE_FLOAT32_C( -841.43), SIMDE_FLOAT32_C( -14.16), SIMDE_FLOAT32_C( 824.88), SIMDE_FLOAT32_C( 793.63),
SIMDE_FLOAT32_C( -736.75), SIMDE_FLOAT32_C( -310.57), SIMDE_FLOAT32_C( 728.87), SIMDE_FLOAT32_C( -350.72),
SIMDE_FLOAT32_C( 60.89), SIMDE_FLOAT32_C( 109.81), SIMDE_FLOAT32_C( 715.94), SIMDE_FLOAT32_C( -250.60),
SIMDE_FLOAT32_C( 944.14), SIMDE_FLOAT32_C( 361.85), SIMDE_FLOAT32_C( -13.07), SIMDE_FLOAT32_C( 852.60)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( -788.39), SIMDE_FLOAT32_C( 330.43), SIMDE_FLOAT32_C( -0.21), SIMDE_FLOAT32_C( -0.37),
SIMDE_FLOAT32_C( 956.68), SIMDE_FLOAT32_C( 954.62), SIMDE_FLOAT32_C( 825.49), SIMDE_FLOAT32_C( 0.42),
SIMDE_FLOAT32_C( -0.36), SIMDE_FLOAT32_C( -933.21), SIMDE_FLOAT32_C( 0.94), SIMDE_FLOAT32_C( -420.06),
SIMDE_FLOAT32_C( -0.09), SIMDE_FLOAT32_C( 103.15), SIMDE_FLOAT32_C( 439.77), SIMDE_FLOAT32_C( -0.34)) }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])); i++) {
simde__m512 r = simde_mm512_mask_cos_ps(test_vec[i].src, test_vec[i].k, test_vec[i].a);
simde_assert_m512_close(r, test_vec[i].r, 1);
}
return 0;
}
static int
test_simde_mm512_cos_pd(SIMDE_MUNIT_TEST_ARGS) {
const struct {
simde__m512d a;
simde__m512d r;
} test_vec[8] = {
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( 497.31), SIMDE_FLOAT64_C( 670.24),
SIMDE_FLOAT64_C( -297.45), SIMDE_FLOAT64_C( 34.06),
SIMDE_FLOAT64_C( -186.21), SIMDE_FLOAT64_C( 39.01),
SIMDE_FLOAT64_C( -754.38), SIMDE_FLOAT64_C( 346.63)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 0.59), SIMDE_FLOAT64_C( -0.47),
SIMDE_FLOAT64_C( -0.54), SIMDE_FLOAT64_C( -0.88),
SIMDE_FLOAT64_C( -0.66), SIMDE_FLOAT64_C( 0.26),
SIMDE_FLOAT64_C( 0.92), SIMDE_FLOAT64_C( 0.49)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( -678.17), SIMDE_FLOAT64_C( -686.13),
SIMDE_FLOAT64_C( 84.77), SIMDE_FLOAT64_C( 571.46),
SIMDE_FLOAT64_C( 825.53), SIMDE_FLOAT64_C( 422.21),
SIMDE_FLOAT64_C( -269.45), SIMDE_FLOAT64_C( 467.76)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 0.92), SIMDE_FLOAT64_C( 0.30),
SIMDE_FLOAT64_C( -1.00), SIMDE_FLOAT64_C( 0.95),
SIMDE_FLOAT64_C( -0.76), SIMDE_FLOAT64_C( 0.33),
SIMDE_FLOAT64_C( 0.75), SIMDE_FLOAT64_C( -0.94)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( -444.81), SIMDE_FLOAT64_C( 28.47),
SIMDE_FLOAT64_C( -384.03), SIMDE_FLOAT64_C( -923.64),
SIMDE_FLOAT64_C( -305.07), SIMDE_FLOAT64_C( -860.95),
SIMDE_FLOAT64_C( -417.54), SIMDE_FLOAT64_C( 696.87)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 0.27), SIMDE_FLOAT64_C( -0.98),
SIMDE_FLOAT64_C( 0.73), SIMDE_FLOAT64_C( 1.00),
SIMDE_FLOAT64_C( -0.94), SIMDE_FLOAT64_C( 0.99),
SIMDE_FLOAT64_C( -0.96), SIMDE_FLOAT64_C( 0.85)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( 178.20), SIMDE_FLOAT64_C( -450.67),
SIMDE_FLOAT64_C( 233.37), SIMDE_FLOAT64_C( 687.09),
SIMDE_FLOAT64_C( 261.31), SIMDE_FLOAT64_C( -212.54),
SIMDE_FLOAT64_C( -976.55), SIMDE_FLOAT64_C( -660.80)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( -0.64), SIMDE_FLOAT64_C( -0.15),
SIMDE_FLOAT64_C( 0.63), SIMDE_FLOAT64_C( -0.61),
SIMDE_FLOAT64_C( -0.85), SIMDE_FLOAT64_C( 0.46),
SIMDE_FLOAT64_C( -0.88), SIMDE_FLOAT64_C( 0.48)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( -770.35), SIMDE_FLOAT64_C( 443.48),
SIMDE_FLOAT64_C( -583.60), SIMDE_FLOAT64_C( 380.46),
SIMDE_FLOAT64_C( -770.72), SIMDE_FLOAT64_C( 993.90),
SIMDE_FLOAT64_C( 28.08), SIMDE_FLOAT64_C( 841.21)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( -0.79), SIMDE_FLOAT64_C( -0.87),
SIMDE_FLOAT64_C( 0.74), SIMDE_FLOAT64_C( -0.95),
SIMDE_FLOAT64_C( -0.51), SIMDE_FLOAT64_C( 0.40),
SIMDE_FLOAT64_C( -0.98), SIMDE_FLOAT64_C( 0.74)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( -387.90), SIMDE_FLOAT64_C( 395.92),
SIMDE_FLOAT64_C( 655.87), SIMDE_FLOAT64_C( 339.21),
SIMDE_FLOAT64_C( 532.35), SIMDE_FLOAT64_C( -263.99),
SIMDE_FLOAT64_C( 780.64), SIMDE_FLOAT64_C( -30.79)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( -0.09), SIMDE_FLOAT64_C( 1.00),
SIMDE_FLOAT64_C( -0.75), SIMDE_FLOAT64_C( 1.00),
SIMDE_FLOAT64_C( -0.15), SIMDE_FLOAT64_C( 1.00),
SIMDE_FLOAT64_C( 0.05), SIMDE_FLOAT64_C( 0.81)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( -203.65), SIMDE_FLOAT64_C( -80.73),
SIMDE_FLOAT64_C( 336.73), SIMDE_FLOAT64_C( -944.78),
SIMDE_FLOAT64_C( -747.59), SIMDE_FLOAT64_C( -767.23),
SIMDE_FLOAT64_C( -554.19), SIMDE_FLOAT64_C( 398.82)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( -0.85), SIMDE_FLOAT64_C( 0.58),
SIMDE_FLOAT64_C( -0.84), SIMDE_FLOAT64_C( -0.67),
SIMDE_FLOAT64_C( 0.99), SIMDE_FLOAT64_C( 0.78),
SIMDE_FLOAT64_C( 0.30), SIMDE_FLOAT64_C( -0.99)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( 469.66), SIMDE_FLOAT64_C( 680.02),
SIMDE_FLOAT64_C( -148.69), SIMDE_FLOAT64_C( 818.66),
SIMDE_FLOAT64_C( 910.03), SIMDE_FLOAT64_C( 600.47),
SIMDE_FLOAT64_C( 791.23), SIMDE_FLOAT64_C( 254.31)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( -0.01), SIMDE_FLOAT64_C( 0.13),
SIMDE_FLOAT64_C( -0.51), SIMDE_FLOAT64_C( -0.27),
SIMDE_FLOAT64_C( 0.51), SIMDE_FLOAT64_C( -0.91),
SIMDE_FLOAT64_C( 0.90), SIMDE_FLOAT64_C( -0.99)) }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])); i++) {
simde__m512d r = simde_mm512_cos_pd(test_vec[i].a);
simde_assert_m512d_close(r, test_vec[i].r, 1);
}
return 0;
}
static int
test_simde_mm512_mask_cos_pd(SIMDE_MUNIT_TEST_ARGS) {
const struct {
simde__m512d src;
simde__mmask8 k;
simde__m512d a;
simde__m512d r;
} test_vec[8] = {
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( -686.13), SIMDE_FLOAT64_C( 571.46),
SIMDE_FLOAT64_C( 422.21), SIMDE_FLOAT64_C( 467.76),
SIMDE_FLOAT64_C( 670.24), SIMDE_FLOAT64_C( 34.06),
SIMDE_FLOAT64_C( 39.01), SIMDE_FLOAT64_C( 346.63)),
UINT8_C(139),
simde_mm512_set_pd(SIMDE_FLOAT64_C( -678.17), SIMDE_FLOAT64_C( 84.77),
SIMDE_FLOAT64_C( 825.53), SIMDE_FLOAT64_C( -269.45),
SIMDE_FLOAT64_C( 497.31), SIMDE_FLOAT64_C( -297.45),
SIMDE_FLOAT64_C( -186.21), SIMDE_FLOAT64_C( -754.38)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 0.92), SIMDE_FLOAT64_C( 571.46),
SIMDE_FLOAT64_C( 422.21), SIMDE_FLOAT64_C( 467.76),
SIMDE_FLOAT64_C( 0.59), SIMDE_FLOAT64_C( 34.06),
SIMDE_FLOAT64_C( -0.66), SIMDE_FLOAT64_C( 0.92)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( 178.20), SIMDE_FLOAT64_C( 233.37),
SIMDE_FLOAT64_C( 261.31), SIMDE_FLOAT64_C( -976.55),
SIMDE_FLOAT64_C( -444.81), SIMDE_FLOAT64_C( -384.03),
SIMDE_FLOAT64_C( -305.07), SIMDE_FLOAT64_C( -417.54)),
UINT8_C(229),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 841.21), SIMDE_FLOAT64_C( -450.67),
SIMDE_FLOAT64_C( 687.09), SIMDE_FLOAT64_C( -212.54),
SIMDE_FLOAT64_C( -660.80), SIMDE_FLOAT64_C( 28.47),
SIMDE_FLOAT64_C( -923.64), SIMDE_FLOAT64_C( -860.95)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 0.74), SIMDE_FLOAT64_C( -0.15),
SIMDE_FLOAT64_C( -0.61), SIMDE_FLOAT64_C( -976.55),
SIMDE_FLOAT64_C( -444.81), SIMDE_FLOAT64_C( -0.98),
SIMDE_FLOAT64_C( -305.07), SIMDE_FLOAT64_C( 0.99)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( 398.82), SIMDE_FLOAT64_C( 395.92),
SIMDE_FLOAT64_C( 339.21), SIMDE_FLOAT64_C( -263.99),
SIMDE_FLOAT64_C( -30.79), SIMDE_FLOAT64_C( 443.48),
SIMDE_FLOAT64_C( 380.46), SIMDE_FLOAT64_C( 993.90)),
UINT8_C(253),
simde_mm512_set_pd(SIMDE_FLOAT64_C( -554.19), SIMDE_FLOAT64_C( -387.90),
SIMDE_FLOAT64_C( 655.87), SIMDE_FLOAT64_C( 532.35),
SIMDE_FLOAT64_C( 780.64), SIMDE_FLOAT64_C( -770.35),
SIMDE_FLOAT64_C( -583.60), SIMDE_FLOAT64_C( -770.72)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 0.30), SIMDE_FLOAT64_C( -0.09),
SIMDE_FLOAT64_C( -0.75), SIMDE_FLOAT64_C( -0.15),
SIMDE_FLOAT64_C( 0.05), SIMDE_FLOAT64_C( -0.79),
SIMDE_FLOAT64_C( 380.46), SIMDE_FLOAT64_C( -0.51)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( 120.65), SIMDE_FLOAT64_C( 469.66),
SIMDE_FLOAT64_C( -148.69), SIMDE_FLOAT64_C( 910.03),
SIMDE_FLOAT64_C( 791.23), SIMDE_FLOAT64_C( -203.65),
SIMDE_FLOAT64_C( 336.73), SIMDE_FLOAT64_C( -747.59)),
UINT8_C( 93),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 543.35), SIMDE_FLOAT64_C( -171.51),
SIMDE_FLOAT64_C( 680.02), SIMDE_FLOAT64_C( 818.66),
SIMDE_FLOAT64_C( 600.47), SIMDE_FLOAT64_C( 254.31),
SIMDE_FLOAT64_C( -80.73), SIMDE_FLOAT64_C( -944.78)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 120.65), SIMDE_FLOAT64_C( -0.29),
SIMDE_FLOAT64_C( -148.69), SIMDE_FLOAT64_C( -0.27),
SIMDE_FLOAT64_C( -0.91), SIMDE_FLOAT64_C( -0.99),
SIMDE_FLOAT64_C( 336.73), SIMDE_FLOAT64_C( -0.67)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( 99.93), SIMDE_FLOAT64_C( -738.19),
SIMDE_FLOAT64_C( 758.79), SIMDE_FLOAT64_C( 343.48),
SIMDE_FLOAT64_C( -797.92), SIMDE_FLOAT64_C( -525.83),
SIMDE_FLOAT64_C( -822.65), SIMDE_FLOAT64_C( 655.67)),
UINT8_C(145),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 331.34), SIMDE_FLOAT64_C( 462.95),
SIMDE_FLOAT64_C( -178.99), SIMDE_FLOAT64_C( 324.62),
SIMDE_FLOAT64_C( -874.31), SIMDE_FLOAT64_C( -328.54),
SIMDE_FLOAT64_C( -192.31), SIMDE_FLOAT64_C( 561.36)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( -0.10), SIMDE_FLOAT64_C( -738.19),
SIMDE_FLOAT64_C( 758.79), SIMDE_FLOAT64_C( -0.51),
SIMDE_FLOAT64_C( -797.92), SIMDE_FLOAT64_C( -525.83),
SIMDE_FLOAT64_C( -822.65), SIMDE_FLOAT64_C( -0.55)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( -756.42), SIMDE_FLOAT64_C( 27.25),
SIMDE_FLOAT64_C( 690.12), SIMDE_FLOAT64_C( -21.09),
SIMDE_FLOAT64_C( -448.89), SIMDE_FLOAT64_C( 505.79),
SIMDE_FLOAT64_C( 831.02), SIMDE_FLOAT64_C( 977.36)),
UINT8_C( 75),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 977.49), SIMDE_FLOAT64_C( 424.81),
SIMDE_FLOAT64_C( -95.15), SIMDE_FLOAT64_C( 840.65),
SIMDE_FLOAT64_C( -591.56), SIMDE_FLOAT64_C( 731.49),
SIMDE_FLOAT64_C( 623.70), SIMDE_FLOAT64_C( 140.67)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( -756.42), SIMDE_FLOAT64_C( -0.77),
SIMDE_FLOAT64_C( 690.12), SIMDE_FLOAT64_C( -21.09),
SIMDE_FLOAT64_C( 0.59), SIMDE_FLOAT64_C( 505.79),
SIMDE_FLOAT64_C( -0.09), SIMDE_FLOAT64_C( -0.76)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( 394.67), SIMDE_FLOAT64_C( -304.73),
SIMDE_FLOAT64_C( -696.69), SIMDE_FLOAT64_C( 822.06),
SIMDE_FLOAT64_C( -997.63), SIMDE_FLOAT64_C( 923.64),
SIMDE_FLOAT64_C( -768.12), SIMDE_FLOAT64_C( -67.64)),
UINT8_C( 93),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 510.85), SIMDE_FLOAT64_C( 14.34),
SIMDE_FLOAT64_C( 916.26), SIMDE_FLOAT64_C( -769.09),
SIMDE_FLOAT64_C( -573.81), SIMDE_FLOAT64_C( -337.60),
SIMDE_FLOAT64_C( 293.64), SIMDE_FLOAT64_C( -576.22)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 394.67), SIMDE_FLOAT64_C( -0.20),
SIMDE_FLOAT64_C( -696.69), SIMDE_FLOAT64_C( -0.83),
SIMDE_FLOAT64_C( -0.45), SIMDE_FLOAT64_C( -0.12),
SIMDE_FLOAT64_C( -768.12), SIMDE_FLOAT64_C( -0.26)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( 475.51), SIMDE_FLOAT64_C( 936.65),
SIMDE_FLOAT64_C( -348.70), SIMDE_FLOAT64_C( -438.19),
SIMDE_FLOAT64_C( -752.43), SIMDE_FLOAT64_C( 932.66),
SIMDE_FLOAT64_C( -327.22), SIMDE_FLOAT64_C( -182.45)),
UINT8_C(213),
simde_mm512_set_pd(SIMDE_FLOAT64_C( -775.04), SIMDE_FLOAT64_C( 440.64),
SIMDE_FLOAT64_C( 897.27), SIMDE_FLOAT64_C( -197.89),
SIMDE_FLOAT64_C( -359.76), SIMDE_FLOAT64_C( -33.67),
SIMDE_FLOAT64_C( 7.27), SIMDE_FLOAT64_C( -125.20)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( -0.60), SIMDE_FLOAT64_C( 0.68),
SIMDE_FLOAT64_C( -348.70), SIMDE_FLOAT64_C( -1.00),
SIMDE_FLOAT64_C( -752.43), SIMDE_FLOAT64_C( -0.63),
SIMDE_FLOAT64_C( -327.22), SIMDE_FLOAT64_C( 0.89)) }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])); i++) {
simde__m512d r = simde_mm512_mask_cos_pd(test_vec[i].src, test_vec[i].k, test_vec[i].a);
simde_assert_m512d_close(r, test_vec[i].r, 1);
}
return 0;
}
static int
test_simde_mm_cosd_ps(SIMDE_MUNIT_TEST_ARGS) {
const struct {
simde__m128 a;
simde__m128 r;
} test_vec[8] = {
{ simde_mm_set_ps(SIMDE_FLOAT32_C( -186.21), SIMDE_FLOAT32_C( 39.01), SIMDE_FLOAT32_C( -754.38), SIMDE_FLOAT32_C( 346.63)),
simde_mm_set_ps(SIMDE_FLOAT32_C( -0.99), SIMDE_FLOAT32_C( 0.78), SIMDE_FLOAT32_C( 0.83), SIMDE_FLOAT32_C( 0.97)) },
{ simde_mm_set_ps(SIMDE_FLOAT32_C( 497.31), SIMDE_FLOAT32_C( 670.24), SIMDE_FLOAT32_C( -297.45), SIMDE_FLOAT32_C( 34.06)),
simde_mm_set_ps(SIMDE_FLOAT32_C( -0.74), SIMDE_FLOAT32_C( 0.65), SIMDE_FLOAT32_C( 0.46), SIMDE_FLOAT32_C( 0.83)) },
{ simde_mm_set_ps(SIMDE_FLOAT32_C( 825.53), SIMDE_FLOAT32_C( 422.21), SIMDE_FLOAT32_C( -269.45), SIMDE_FLOAT32_C( 467.76)),
simde_mm_set_ps(SIMDE_FLOAT32_C( -0.27), SIMDE_FLOAT32_C( 0.47), SIMDE_FLOAT32_C( -0.01), SIMDE_FLOAT32_C( -0.31)) },
{ simde_mm_set_ps(SIMDE_FLOAT32_C( -678.17), SIMDE_FLOAT32_C( -686.13), SIMDE_FLOAT32_C( 84.77), SIMDE_FLOAT32_C( 571.46)),
simde_mm_set_ps(SIMDE_FLOAT32_C( 0.75), SIMDE_FLOAT32_C( 0.83), SIMDE_FLOAT32_C( 0.09), SIMDE_FLOAT32_C( -0.85)) },
{ simde_mm_set_ps(SIMDE_FLOAT32_C( -305.07), SIMDE_FLOAT32_C( -860.95), SIMDE_FLOAT32_C( -417.54), SIMDE_FLOAT32_C( 696.87)),
simde_mm_set_ps(SIMDE_FLOAT32_C( 0.57), SIMDE_FLOAT32_C( -0.78), SIMDE_FLOAT32_C( 0.54), SIMDE_FLOAT32_C( 0.92)) },
{ simde_mm_set_ps(SIMDE_FLOAT32_C( -444.81), SIMDE_FLOAT32_C( 28.47), SIMDE_FLOAT32_C( -384.03), SIMDE_FLOAT32_C( -923.64)),
simde_mm_set_ps(SIMDE_FLOAT32_C( 0.09), SIMDE_FLOAT32_C( 0.88), SIMDE_FLOAT32_C( 0.91), SIMDE_FLOAT32_C( -0.92)) },
{ simde_mm_set_ps(SIMDE_FLOAT32_C( 261.31), SIMDE_FLOAT32_C( -212.54), SIMDE_FLOAT32_C( -976.55), SIMDE_FLOAT32_C( -660.80)),
simde_mm_set_ps(SIMDE_FLOAT32_C( -0.15), SIMDE_FLOAT32_C( -0.84), SIMDE_FLOAT32_C( -0.23), SIMDE_FLOAT32_C( 0.51)) },
{ simde_mm_set_ps(SIMDE_FLOAT32_C( 178.20), SIMDE_FLOAT32_C( -450.67), SIMDE_FLOAT32_C( 233.37), SIMDE_FLOAT32_C( 687.09)),
simde_mm_set_ps(SIMDE_FLOAT32_C( -1.00), SIMDE_FLOAT32_C( -0.01), SIMDE_FLOAT32_C( -0.60), SIMDE_FLOAT32_C( 0.84)) }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])); i++) {
simde__m128 r = simde_mm_cosd_ps(test_vec[i].a);
simde_assert_m128_close(r, test_vec[i].r, 1);
}
return 0;
}
static int
test_simde_mm_cosd_pd(SIMDE_MUNIT_TEST_ARGS) {
const struct {
simde__m128d a;
simde__m128d r;
} test_vec[8] = {
{ simde_mm_set_pd(SIMDE_FLOAT64_C( -754.38), SIMDE_FLOAT64_C( 346.63)),
simde_mm_set_pd(SIMDE_FLOAT64_C( 0.83), SIMDE_FLOAT64_C( 0.97)) },
{ simde_mm_set_pd(SIMDE_FLOAT64_C( -186.21), SIMDE_FLOAT64_C( 39.01)),
simde_mm_set_pd(SIMDE_FLOAT64_C( -0.99), SIMDE_FLOAT64_C( 0.78)) },
{ simde_mm_set_pd(SIMDE_FLOAT64_C( -297.45), SIMDE_FLOAT64_C( 34.06)),
simde_mm_set_pd(SIMDE_FLOAT64_C( 0.46), SIMDE_FLOAT64_C( 0.83)) },
{ simde_mm_set_pd(SIMDE_FLOAT64_C( 497.31), SIMDE_FLOAT64_C( 670.24)),
simde_mm_set_pd(SIMDE_FLOAT64_C( -0.74), SIMDE_FLOAT64_C( 0.65)) },
{ simde_mm_set_pd(SIMDE_FLOAT64_C( -269.45), SIMDE_FLOAT64_C( 467.76)),
simde_mm_set_pd(SIMDE_FLOAT64_C( -0.01), SIMDE_FLOAT64_C( -0.31)) },
{ simde_mm_set_pd(SIMDE_FLOAT64_C( 825.53), SIMDE_FLOAT64_C( 422.21)),
simde_mm_set_pd(SIMDE_FLOAT64_C( -0.27), SIMDE_FLOAT64_C( 0.47)) },
{ simde_mm_set_pd(SIMDE_FLOAT64_C( 84.77), SIMDE_FLOAT64_C( 571.46)),
simde_mm_set_pd(SIMDE_FLOAT64_C( 0.09), SIMDE_FLOAT64_C( -0.85)) },
{ simde_mm_set_pd(SIMDE_FLOAT64_C( -678.17), SIMDE_FLOAT64_C( -686.13)),
simde_mm_set_pd(SIMDE_FLOAT64_C( 0.75), SIMDE_FLOAT64_C( 0.83)) }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])); i++) {
simde__m128d r = simde_mm_cosd_pd(test_vec[i].a);
simde_assert_m128d_close(r, test_vec[i].r, 1);
}
return 0;
}
static int
test_simde_mm256_cosd_ps(SIMDE_MUNIT_TEST_ARGS) {
const struct {
simde__m256 a;
simde__m256 r;
} test_vec[8] = {
{ simde_mm256_set_ps(SIMDE_FLOAT32_C( 497.31), SIMDE_FLOAT32_C( 670.24),
SIMDE_FLOAT32_C( -297.45), SIMDE_FLOAT32_C( 34.06),
SIMDE_FLOAT32_C( -186.21), SIMDE_FLOAT32_C( 39.01),
SIMDE_FLOAT32_C( -754.38), SIMDE_FLOAT32_C( 346.63)),
simde_mm256_set_ps(SIMDE_FLOAT32_C( -0.74), SIMDE_FLOAT32_C( 0.65),
SIMDE_FLOAT32_C( 0.46), SIMDE_FLOAT32_C( 0.83),
SIMDE_FLOAT32_C( -0.99), SIMDE_FLOAT32_C( 0.78),
SIMDE_FLOAT32_C( 0.83), SIMDE_FLOAT32_C( 0.97)) },
{ simde_mm256_set_ps(SIMDE_FLOAT32_C( -678.17), SIMDE_FLOAT32_C( -686.13),
SIMDE_FLOAT32_C( 84.77), SIMDE_FLOAT32_C( 571.46),
SIMDE_FLOAT32_C( 825.53), SIMDE_FLOAT32_C( 422.21),
SIMDE_FLOAT32_C( -269.45), SIMDE_FLOAT32_C( 467.76)),
simde_mm256_set_ps(SIMDE_FLOAT32_C( 0.75), SIMDE_FLOAT32_C( 0.83),
SIMDE_FLOAT32_C( 0.09), SIMDE_FLOAT32_C( -0.85),
SIMDE_FLOAT32_C( -0.27), SIMDE_FLOAT32_C( 0.47),
SIMDE_FLOAT32_C( -0.01), SIMDE_FLOAT32_C( -0.31)) },
{ simde_mm256_set_ps(SIMDE_FLOAT32_C( -444.81), SIMDE_FLOAT32_C( 28.47),
SIMDE_FLOAT32_C( -384.03), SIMDE_FLOAT32_C( -923.64),
SIMDE_FLOAT32_C( -305.07), SIMDE_FLOAT32_C( -860.95),
SIMDE_FLOAT32_C( -417.54), SIMDE_FLOAT32_C( 696.87)),
simde_mm256_set_ps(SIMDE_FLOAT32_C( 0.09), SIMDE_FLOAT32_C( 0.88),
SIMDE_FLOAT32_C( 0.91), SIMDE_FLOAT32_C( -0.92),
SIMDE_FLOAT32_C( 0.57), SIMDE_FLOAT32_C( -0.78),
SIMDE_FLOAT32_C( 0.54), SIMDE_FLOAT32_C( 0.92)) },
{ simde_mm256_set_ps(SIMDE_FLOAT32_C( 178.20), SIMDE_FLOAT32_C( -450.67),
SIMDE_FLOAT32_C( 233.37), SIMDE_FLOAT32_C( 687.09),
SIMDE_FLOAT32_C( 261.31), SIMDE_FLOAT32_C( -212.54),
SIMDE_FLOAT32_C( -976.55), SIMDE_FLOAT32_C( -660.80)),
simde_mm256_set_ps(SIMDE_FLOAT32_C( -1.00), SIMDE_FLOAT32_C( -0.01),
SIMDE_FLOAT32_C( -0.60), SIMDE_FLOAT32_C( 0.84),
SIMDE_FLOAT32_C( -0.15), SIMDE_FLOAT32_C( -0.84),
SIMDE_FLOAT32_C( -0.23), SIMDE_FLOAT32_C( 0.51)) },
{ simde_mm256_set_ps(SIMDE_FLOAT32_C( -770.35), SIMDE_FLOAT32_C( 443.48),
SIMDE_FLOAT32_C( -583.60), SIMDE_FLOAT32_C( 380.46),
SIMDE_FLOAT32_C( -770.72), SIMDE_FLOAT32_C( 993.90),
SIMDE_FLOAT32_C( 28.08), SIMDE_FLOAT32_C( 841.21)),
simde_mm256_set_ps(SIMDE_FLOAT32_C( 0.64), SIMDE_FLOAT32_C( 0.11),
SIMDE_FLOAT32_C( -0.72), SIMDE_FLOAT32_C( 0.94),
SIMDE_FLOAT32_C( 0.63), SIMDE_FLOAT32_C( 0.07),
SIMDE_FLOAT32_C( 0.88), SIMDE_FLOAT32_C( -0.52)) },
{ simde_mm256_set_ps(SIMDE_FLOAT32_C( -387.90), SIMDE_FLOAT32_C( 395.92),
SIMDE_FLOAT32_C( 655.87), SIMDE_FLOAT32_C( 339.21),
SIMDE_FLOAT32_C( 532.35), SIMDE_FLOAT32_C( -263.99),
SIMDE_FLOAT32_C( 780.64), SIMDE_FLOAT32_C( -30.79)),
simde_mm256_set_ps(SIMDE_FLOAT32_C( 0.88), SIMDE_FLOAT32_C( 0.81),
SIMDE_FLOAT32_C( 0.44), SIMDE_FLOAT32_C( 0.93),
SIMDE_FLOAT32_C( -0.99), SIMDE_FLOAT32_C( -0.10),
SIMDE_FLOAT32_C( 0.49), SIMDE_FLOAT32_C( 0.86)) },
{ simde_mm256_set_ps(SIMDE_FLOAT32_C( -203.65), SIMDE_FLOAT32_C( -80.73),
SIMDE_FLOAT32_C( 336.73), SIMDE_FLOAT32_C( -944.78),
SIMDE_FLOAT32_C( -747.59), SIMDE_FLOAT32_C( -767.23),
SIMDE_FLOAT32_C( -554.19), SIMDE_FLOAT32_C( 398.82)),
simde_mm256_set_ps(SIMDE_FLOAT32_C( -0.92), SIMDE_FLOAT32_C( 0.16),
SIMDE_FLOAT32_C( 0.92), SIMDE_FLOAT32_C( -0.71),
SIMDE_FLOAT32_C( 0.89), SIMDE_FLOAT32_C( 0.68),
SIMDE_FLOAT32_C( -0.97), SIMDE_FLOAT32_C( 0.78)) },
{ simde_mm256_set_ps(SIMDE_FLOAT32_C( 469.66), SIMDE_FLOAT32_C( 680.02),
SIMDE_FLOAT32_C( -148.69), SIMDE_FLOAT32_C( 818.66),
SIMDE_FLOAT32_C( 910.03), SIMDE_FLOAT32_C( 600.47),
SIMDE_FLOAT32_C( 791.23), SIMDE_FLOAT32_C( 254.31)),
simde_mm256_set_ps(SIMDE_FLOAT32_C( -0.34), SIMDE_FLOAT32_C( 0.77),
SIMDE_FLOAT32_C( -0.85), SIMDE_FLOAT32_C( -0.15),
SIMDE_FLOAT32_C( -0.98), SIMDE_FLOAT32_C( -0.49),
SIMDE_FLOAT32_C( 0.32), SIMDE_FLOAT32_C( -0.27)) }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])); i++) {
simde__m256 r = simde_mm256_cosd_ps(test_vec[i].a);
simde_assert_m256_close(r, test_vec[i].r, 1);
}
return 0;
}
static int
test_simde_mm256_cosd_pd(SIMDE_MUNIT_TEST_ARGS) {
const struct {
simde__m256d a;
simde__m256d r;
} test_vec[8] = {
{ simde_mm256_set_pd(SIMDE_FLOAT64_C( -186.21), SIMDE_FLOAT64_C( 39.01),
SIMDE_FLOAT64_C( -754.38), SIMDE_FLOAT64_C( 346.63)),
simde_mm256_set_pd(SIMDE_FLOAT64_C( -0.99), SIMDE_FLOAT64_C( 0.78),
SIMDE_FLOAT64_C( 0.83), SIMDE_FLOAT64_C( 0.97)) },
{ simde_mm256_set_pd(SIMDE_FLOAT64_C( 497.31), SIMDE_FLOAT64_C( 670.24),
SIMDE_FLOAT64_C( -297.45), SIMDE_FLOAT64_C( 34.06)),
simde_mm256_set_pd(SIMDE_FLOAT64_C( -0.74), SIMDE_FLOAT64_C( 0.65),
SIMDE_FLOAT64_C( 0.46), SIMDE_FLOAT64_C( 0.83)) },
{ simde_mm256_set_pd(SIMDE_FLOAT64_C( 825.53), SIMDE_FLOAT64_C( 422.21),
SIMDE_FLOAT64_C( -269.45), SIMDE_FLOAT64_C( 467.76)),
simde_mm256_set_pd(SIMDE_FLOAT64_C( -0.27), SIMDE_FLOAT64_C( 0.47),
SIMDE_FLOAT64_C( -0.01), SIMDE_FLOAT64_C( -0.31)) },
{ simde_mm256_set_pd(SIMDE_FLOAT64_C( -678.17), SIMDE_FLOAT64_C( -686.13),
SIMDE_FLOAT64_C( 84.77), SIMDE_FLOAT64_C( 571.46)),
simde_mm256_set_pd(SIMDE_FLOAT64_C( 0.75), SIMDE_FLOAT64_C( 0.83),
SIMDE_FLOAT64_C( 0.09), SIMDE_FLOAT64_C( -0.85)) },
{ simde_mm256_set_pd(SIMDE_FLOAT64_C( -305.07), SIMDE_FLOAT64_C( -860.95),
SIMDE_FLOAT64_C( -417.54), SIMDE_FLOAT64_C( 696.87)),
simde_mm256_set_pd(SIMDE_FLOAT64_C( 0.57), SIMDE_FLOAT64_C( -0.78),
SIMDE_FLOAT64_C( 0.54), SIMDE_FLOAT64_C( 0.92)) },
{ simde_mm256_set_pd(SIMDE_FLOAT64_C( -444.81), SIMDE_FLOAT64_C( 28.47),
SIMDE_FLOAT64_C( -384.03), SIMDE_FLOAT64_C( -923.64)),
simde_mm256_set_pd(SIMDE_FLOAT64_C( 0.09), SIMDE_FLOAT64_C( 0.88),
SIMDE_FLOAT64_C( 0.91), SIMDE_FLOAT64_C( -0.92)) },
{ simde_mm256_set_pd(SIMDE_FLOAT64_C( 261.31), SIMDE_FLOAT64_C( -212.54),
SIMDE_FLOAT64_C( -976.55), SIMDE_FLOAT64_C( -660.80)),
simde_mm256_set_pd(SIMDE_FLOAT64_C( -0.15), SIMDE_FLOAT64_C( -0.84),
SIMDE_FLOAT64_C( -0.23), SIMDE_FLOAT64_C( 0.51)) },
{ simde_mm256_set_pd(SIMDE_FLOAT64_C( 178.20), SIMDE_FLOAT64_C( -450.67),
SIMDE_FLOAT64_C( 233.37), SIMDE_FLOAT64_C( 687.09)),
simde_mm256_set_pd(SIMDE_FLOAT64_C( -1.00), SIMDE_FLOAT64_C( -0.01),
SIMDE_FLOAT64_C( -0.60), SIMDE_FLOAT64_C( 0.84)) }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])); i++) {
simde__m256d r = simde_mm256_cosd_pd(test_vec[i].a);
simde_assert_m256d_close(r, test_vec[i].r, 1);
}
return 0;
}
static int
test_simde_mm512_cosd_ps(SIMDE_MUNIT_TEST_ARGS) {
const struct {
simde__m512 a;
simde__m512 r;
} test_vec[8] = {
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( -678.17), SIMDE_FLOAT32_C( -686.13), SIMDE_FLOAT32_C( 84.77), SIMDE_FLOAT32_C( 571.46),
SIMDE_FLOAT32_C( 825.53), SIMDE_FLOAT32_C( 422.21), SIMDE_FLOAT32_C( -269.45), SIMDE_FLOAT32_C( 467.76),
SIMDE_FLOAT32_C( 497.31), SIMDE_FLOAT32_C( 670.24), SIMDE_FLOAT32_C( -297.45), SIMDE_FLOAT32_C( 34.06),
SIMDE_FLOAT32_C( -186.21), SIMDE_FLOAT32_C( 39.01), SIMDE_FLOAT32_C( -754.38), SIMDE_FLOAT32_C( 346.63)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 0.75), SIMDE_FLOAT32_C( 0.83), SIMDE_FLOAT32_C( 0.09), SIMDE_FLOAT32_C( -0.85),
SIMDE_FLOAT32_C( -0.27), SIMDE_FLOAT32_C( 0.47), SIMDE_FLOAT32_C( -0.01), SIMDE_FLOAT32_C( -0.31),
SIMDE_FLOAT32_C( -0.74), SIMDE_FLOAT32_C( 0.65), SIMDE_FLOAT32_C( 0.46), SIMDE_FLOAT32_C( 0.83),
SIMDE_FLOAT32_C( -0.99), SIMDE_FLOAT32_C( 0.78), SIMDE_FLOAT32_C( 0.83), SIMDE_FLOAT32_C( 0.97)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( 178.20), SIMDE_FLOAT32_C( -450.67), SIMDE_FLOAT32_C( 233.37), SIMDE_FLOAT32_C( 687.09),
SIMDE_FLOAT32_C( 261.31), SIMDE_FLOAT32_C( -212.54), SIMDE_FLOAT32_C( -976.55), SIMDE_FLOAT32_C( -660.80),
SIMDE_FLOAT32_C( -444.81), SIMDE_FLOAT32_C( 28.47), SIMDE_FLOAT32_C( -384.03), SIMDE_FLOAT32_C( -923.64),
SIMDE_FLOAT32_C( -305.07), SIMDE_FLOAT32_C( -860.95), SIMDE_FLOAT32_C( -417.54), SIMDE_FLOAT32_C( 696.87)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( -1.00), SIMDE_FLOAT32_C( -0.01), SIMDE_FLOAT32_C( -0.60), SIMDE_FLOAT32_C( 0.84),
SIMDE_FLOAT32_C( -0.15), SIMDE_FLOAT32_C( -0.84), SIMDE_FLOAT32_C( -0.23), SIMDE_FLOAT32_C( 0.51),
SIMDE_FLOAT32_C( 0.09), SIMDE_FLOAT32_C( 0.88), SIMDE_FLOAT32_C( 0.91), SIMDE_FLOAT32_C( -0.92),
SIMDE_FLOAT32_C( 0.57), SIMDE_FLOAT32_C( -0.78), SIMDE_FLOAT32_C( 0.54), SIMDE_FLOAT32_C( 0.92)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( -387.90), SIMDE_FLOAT32_C( 395.92), SIMDE_FLOAT32_C( 655.87), SIMDE_FLOAT32_C( 339.21),
SIMDE_FLOAT32_C( 532.35), SIMDE_FLOAT32_C( -263.99), SIMDE_FLOAT32_C( 780.64), SIMDE_FLOAT32_C( -30.79),
SIMDE_FLOAT32_C( -770.35), SIMDE_FLOAT32_C( 443.48), SIMDE_FLOAT32_C( -583.60), SIMDE_FLOAT32_C( 380.46),
SIMDE_FLOAT32_C( -770.72), SIMDE_FLOAT32_C( 993.90), SIMDE_FLOAT32_C( 28.08), SIMDE_FLOAT32_C( 841.21)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 0.88), SIMDE_FLOAT32_C( 0.81), SIMDE_FLOAT32_C( 0.44), SIMDE_FLOAT32_C( 0.93),
SIMDE_FLOAT32_C( -0.99), SIMDE_FLOAT32_C( -0.10), SIMDE_FLOAT32_C( 0.49), SIMDE_FLOAT32_C( 0.86),
SIMDE_FLOAT32_C( 0.64), SIMDE_FLOAT32_C( 0.11), SIMDE_FLOAT32_C( -0.72), SIMDE_FLOAT32_C( 0.94),
SIMDE_FLOAT32_C( 0.63), SIMDE_FLOAT32_C( 0.07), SIMDE_FLOAT32_C( 0.88), SIMDE_FLOAT32_C( -0.52)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( 469.66), SIMDE_FLOAT32_C( 680.02), SIMDE_FLOAT32_C( -148.69), SIMDE_FLOAT32_C( 818.66),
SIMDE_FLOAT32_C( 910.03), SIMDE_FLOAT32_C( 600.47), SIMDE_FLOAT32_C( 791.23), SIMDE_FLOAT32_C( 254.31),
SIMDE_FLOAT32_C( -203.65), SIMDE_FLOAT32_C( -80.73), SIMDE_FLOAT32_C( 336.73), SIMDE_FLOAT32_C( -944.78),
SIMDE_FLOAT32_C( -747.59), SIMDE_FLOAT32_C( -767.23), SIMDE_FLOAT32_C( -554.19), SIMDE_FLOAT32_C( 398.82)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( -0.34), SIMDE_FLOAT32_C( 0.77), SIMDE_FLOAT32_C( -0.85), SIMDE_FLOAT32_C( -0.15),
SIMDE_FLOAT32_C( -0.98), SIMDE_FLOAT32_C( -0.49), SIMDE_FLOAT32_C( 0.32), SIMDE_FLOAT32_C( -0.27),
SIMDE_FLOAT32_C( -0.92), SIMDE_FLOAT32_C( 0.16), SIMDE_FLOAT32_C( 0.92), SIMDE_FLOAT32_C( -0.71),
SIMDE_FLOAT32_C( 0.89), SIMDE_FLOAT32_C( 0.68), SIMDE_FLOAT32_C( -0.97), SIMDE_FLOAT32_C( 0.78)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( -178.99), SIMDE_FLOAT32_C( 758.79), SIMDE_FLOAT32_C( 324.62), SIMDE_FLOAT32_C( 343.48),
SIMDE_FLOAT32_C( -874.31), SIMDE_FLOAT32_C( -797.92), SIMDE_FLOAT32_C( -328.54), SIMDE_FLOAT32_C( -525.83),
SIMDE_FLOAT32_C( -192.31), SIMDE_FLOAT32_C( -822.65), SIMDE_FLOAT32_C( 561.36), SIMDE_FLOAT32_C( 655.67),
SIMDE_FLOAT32_C( -70.91), SIMDE_FLOAT32_C( 543.35), SIMDE_FLOAT32_C( 120.65), SIMDE_FLOAT32_C( -171.51)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( -1.00), SIMDE_FLOAT32_C( 0.78), SIMDE_FLOAT32_C( 0.82), SIMDE_FLOAT32_C( 0.96),
SIMDE_FLOAT32_C( -0.90), SIMDE_FLOAT32_C( 0.21), SIMDE_FLOAT32_C( 0.85), SIMDE_FLOAT32_C( -0.97),
SIMDE_FLOAT32_C( -0.98), SIMDE_FLOAT32_C( -0.22), SIMDE_FLOAT32_C( -0.93), SIMDE_FLOAT32_C( 0.43),
SIMDE_FLOAT32_C( 0.33), SIMDE_FLOAT32_C( -1.00), SIMDE_FLOAT32_C( -0.51), SIMDE_FLOAT32_C( -0.99)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( 690.12), SIMDE_FLOAT32_C( 840.65), SIMDE_FLOAT32_C( -21.09), SIMDE_FLOAT32_C( -591.56),
SIMDE_FLOAT32_C( -448.89), SIMDE_FLOAT32_C( 731.49), SIMDE_FLOAT32_C( 505.79), SIMDE_FLOAT32_C( 623.70),
SIMDE_FLOAT32_C( 831.02), SIMDE_FLOAT32_C( 140.67), SIMDE_FLOAT32_C( 977.36), SIMDE_FLOAT32_C( -906.16),
SIMDE_FLOAT32_C( 331.34), SIMDE_FLOAT32_C( 99.93), SIMDE_FLOAT32_C( 462.95), SIMDE_FLOAT32_C( -738.19)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 0.87), SIMDE_FLOAT32_C( -0.51), SIMDE_FLOAT32_C( 0.93), SIMDE_FLOAT32_C( -0.62),
SIMDE_FLOAT32_C( 0.02), SIMDE_FLOAT32_C( 0.98), SIMDE_FLOAT32_C( -0.83), SIMDE_FLOAT32_C( -0.11),
SIMDE_FLOAT32_C( -0.36), SIMDE_FLOAT32_C( -0.77), SIMDE_FLOAT32_C( -0.22), SIMDE_FLOAT32_C( -0.99),
SIMDE_FLOAT32_C( 0.88), SIMDE_FLOAT32_C( -0.17), SIMDE_FLOAT32_C( -0.22), SIMDE_FLOAT32_C( 0.95)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( -769.09), SIMDE_FLOAT32_C( 822.06), SIMDE_FLOAT32_C( -573.81), SIMDE_FLOAT32_C( -997.63),
SIMDE_FLOAT32_C( -337.60), SIMDE_FLOAT32_C( 923.64), SIMDE_FLOAT32_C( 293.64), SIMDE_FLOAT32_C( -768.12),
SIMDE_FLOAT32_C( -576.22), SIMDE_FLOAT32_C( -67.64), SIMDE_FLOAT32_C( 710.38), SIMDE_FLOAT32_C( 977.49),
SIMDE_FLOAT32_C( -756.42), SIMDE_FLOAT32_C( 424.81), SIMDE_FLOAT32_C( 27.25), SIMDE_FLOAT32_C( -95.15)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 0.65), SIMDE_FLOAT32_C( -0.21), SIMDE_FLOAT32_C( -0.83), SIMDE_FLOAT32_C( 0.13),
SIMDE_FLOAT32_C( 0.92), SIMDE_FLOAT32_C( -0.92), SIMDE_FLOAT32_C( 0.40), SIMDE_FLOAT32_C( 0.67),
SIMDE_FLOAT32_C( -0.81), SIMDE_FLOAT32_C( 0.38), SIMDE_FLOAT32_C( 0.99), SIMDE_FLOAT32_C( -0.22),
SIMDE_FLOAT32_C( 0.80), SIMDE_FLOAT32_C( 0.43), SIMDE_FLOAT32_C( 0.89), SIMDE_FLOAT32_C( -0.09)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( -438.19), SIMDE_FLOAT32_C( -359.76), SIMDE_FLOAT32_C( -752.43), SIMDE_FLOAT32_C( -33.67),
SIMDE_FLOAT32_C( 932.66), SIMDE_FLOAT32_C( 7.27), SIMDE_FLOAT32_C( -327.22), SIMDE_FLOAT32_C( -125.20),
SIMDE_FLOAT32_C( -182.45), SIMDE_FLOAT32_C( 39.93), SIMDE_FLOAT32_C( 510.85), SIMDE_FLOAT32_C( 394.67),
SIMDE_FLOAT32_C( 14.34), SIMDE_FLOAT32_C( -304.73), SIMDE_FLOAT32_C( 916.26), SIMDE_FLOAT32_C( -696.69)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 0.20), SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( 0.84), SIMDE_FLOAT32_C( 0.83),
SIMDE_FLOAT32_C( -0.84), SIMDE_FLOAT32_C( 0.99), SIMDE_FLOAT32_C( 0.84), SIMDE_FLOAT32_C( -0.58),
SIMDE_FLOAT32_C( -1.00), SIMDE_FLOAT32_C( 0.77), SIMDE_FLOAT32_C( -0.87), SIMDE_FLOAT32_C( 0.82),
SIMDE_FLOAT32_C( 0.97), SIMDE_FLOAT32_C( 0.57), SIMDE_FLOAT32_C( -0.96), SIMDE_FLOAT32_C( 0.92)) }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])); i++) {
simde__m512 r = simde_mm512_cosd_ps(test_vec[i].a);
simde_assert_m512_close(r, test_vec[i].r, 1);
}
return 0;
}
static int
test_simde_mm512_mask_cosd_ps(SIMDE_MUNIT_TEST_ARGS) {
const struct {
simde__m512 src;
simde__mmask16 k;
simde__m512 a;
simde__m512 r;
} test_vec[8] = {
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( -450.67), SIMDE_FLOAT32_C( 687.09), SIMDE_FLOAT32_C( -212.54), SIMDE_FLOAT32_C( -660.80),
SIMDE_FLOAT32_C( 28.47), SIMDE_FLOAT32_C( -923.64), SIMDE_FLOAT32_C( -860.95), SIMDE_FLOAT32_C( 696.87),
SIMDE_FLOAT32_C( -686.13), SIMDE_FLOAT32_C( 571.46), SIMDE_FLOAT32_C( 422.21), SIMDE_FLOAT32_C( 467.76),
SIMDE_FLOAT32_C( 670.24), SIMDE_FLOAT32_C( 34.06), SIMDE_FLOAT32_C( 39.01), SIMDE_FLOAT32_C( 346.63)),
UINT16_C(41466),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 178.20), SIMDE_FLOAT32_C( 233.37), SIMDE_FLOAT32_C( 261.31), SIMDE_FLOAT32_C( -976.55),
SIMDE_FLOAT32_C( -444.81), SIMDE_FLOAT32_C( -384.03), SIMDE_FLOAT32_C( -305.07), SIMDE_FLOAT32_C( -417.54),
SIMDE_FLOAT32_C( -678.17), SIMDE_FLOAT32_C( 84.77), SIMDE_FLOAT32_C( 825.53), SIMDE_FLOAT32_C( -269.45),
SIMDE_FLOAT32_C( 497.31), SIMDE_FLOAT32_C( -297.45), SIMDE_FLOAT32_C( -186.21), SIMDE_FLOAT32_C( -754.38)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( -1.00), SIMDE_FLOAT32_C( 687.09), SIMDE_FLOAT32_C( -0.15), SIMDE_FLOAT32_C( -660.80),
SIMDE_FLOAT32_C( 28.47), SIMDE_FLOAT32_C( -923.64), SIMDE_FLOAT32_C( -860.95), SIMDE_FLOAT32_C( 0.54),
SIMDE_FLOAT32_C( 0.75), SIMDE_FLOAT32_C( 0.09), SIMDE_FLOAT32_C( -0.27), SIMDE_FLOAT32_C( -0.01),
SIMDE_FLOAT32_C( -0.74), SIMDE_FLOAT32_C( 34.06), SIMDE_FLOAT32_C( -0.99), SIMDE_FLOAT32_C( 346.63)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( 469.66), SIMDE_FLOAT32_C( -148.69), SIMDE_FLOAT32_C( 910.03), SIMDE_FLOAT32_C( 791.23),
SIMDE_FLOAT32_C( -203.65), SIMDE_FLOAT32_C( 336.73), SIMDE_FLOAT32_C( -747.59), SIMDE_FLOAT32_C( -554.19),
SIMDE_FLOAT32_C( -387.90), SIMDE_FLOAT32_C( 655.87), SIMDE_FLOAT32_C( 532.35), SIMDE_FLOAT32_C( 780.64),
SIMDE_FLOAT32_C( -770.35), SIMDE_FLOAT32_C( -583.60), SIMDE_FLOAT32_C( -770.72), SIMDE_FLOAT32_C( 28.08)),
UINT16_C(36797),
simde_mm512_set_ps(SIMDE_FLOAT32_C( -171.51), SIMDE_FLOAT32_C( 680.02), SIMDE_FLOAT32_C( 818.66), SIMDE_FLOAT32_C( 600.47),
SIMDE_FLOAT32_C( 254.31), SIMDE_FLOAT32_C( -80.73), SIMDE_FLOAT32_C( -944.78), SIMDE_FLOAT32_C( -767.23),
SIMDE_FLOAT32_C( 398.82), SIMDE_FLOAT32_C( 395.92), SIMDE_FLOAT32_C( 339.21), SIMDE_FLOAT32_C( -263.99),
SIMDE_FLOAT32_C( -30.79), SIMDE_FLOAT32_C( 443.48), SIMDE_FLOAT32_C( 380.46), SIMDE_FLOAT32_C( 993.90)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( -0.99), SIMDE_FLOAT32_C( -148.69), SIMDE_FLOAT32_C( 910.03), SIMDE_FLOAT32_C( 791.23),
SIMDE_FLOAT32_C( -0.27), SIMDE_FLOAT32_C( 0.16), SIMDE_FLOAT32_C( -0.71), SIMDE_FLOAT32_C( 0.68),
SIMDE_FLOAT32_C( 0.78), SIMDE_FLOAT32_C( 655.87), SIMDE_FLOAT32_C( 0.93), SIMDE_FLOAT32_C( -0.10),
SIMDE_FLOAT32_C( 0.86), SIMDE_FLOAT32_C( 0.11), SIMDE_FLOAT32_C( -770.72), SIMDE_FLOAT32_C( 0.07)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( -95.15), SIMDE_FLOAT32_C( 840.65), SIMDE_FLOAT32_C( -591.56), SIMDE_FLOAT32_C( 731.49),
SIMDE_FLOAT32_C( 623.70), SIMDE_FLOAT32_C( 140.67), SIMDE_FLOAT32_C( -906.16), SIMDE_FLOAT32_C( 99.93),
SIMDE_FLOAT32_C( -738.19), SIMDE_FLOAT32_C( 758.79), SIMDE_FLOAT32_C( 343.48), SIMDE_FLOAT32_C( -797.92),
SIMDE_FLOAT32_C( -525.83), SIMDE_FLOAT32_C( -822.65), SIMDE_FLOAT32_C( 655.67), SIMDE_FLOAT32_C( 543.35)),
UINT16_C(16804),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 27.25), SIMDE_FLOAT32_C( 690.12), SIMDE_FLOAT32_C( -21.09), SIMDE_FLOAT32_C( -448.89),
SIMDE_FLOAT32_C( 505.79), SIMDE_FLOAT32_C( 831.02), SIMDE_FLOAT32_C( 977.36), SIMDE_FLOAT32_C( 331.34),
SIMDE_FLOAT32_C( 462.95), SIMDE_FLOAT32_C( -178.99), SIMDE_FLOAT32_C( 324.62), SIMDE_FLOAT32_C( -874.31),
SIMDE_FLOAT32_C( -328.54), SIMDE_FLOAT32_C( -192.31), SIMDE_FLOAT32_C( 561.36), SIMDE_FLOAT32_C( -70.91)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( -95.15), SIMDE_FLOAT32_C( 0.87), SIMDE_FLOAT32_C( -591.56), SIMDE_FLOAT32_C( 731.49),
SIMDE_FLOAT32_C( 623.70), SIMDE_FLOAT32_C( 140.67), SIMDE_FLOAT32_C( -906.16), SIMDE_FLOAT32_C( 0.88),
SIMDE_FLOAT32_C( -0.22), SIMDE_FLOAT32_C( 758.79), SIMDE_FLOAT32_C( 0.82), SIMDE_FLOAT32_C( -797.92),
SIMDE_FLOAT32_C( -525.83), SIMDE_FLOAT32_C( -0.98), SIMDE_FLOAT32_C( 655.67), SIMDE_FLOAT32_C( 543.35)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( -348.70), SIMDE_FLOAT32_C( -438.19), SIMDE_FLOAT32_C( -752.43), SIMDE_FLOAT32_C( 932.66),
SIMDE_FLOAT32_C( -327.22), SIMDE_FLOAT32_C( -182.45), SIMDE_FLOAT32_C( 510.85), SIMDE_FLOAT32_C( 14.34),
SIMDE_FLOAT32_C( 916.26), SIMDE_FLOAT32_C( -769.09), SIMDE_FLOAT32_C( -573.81), SIMDE_FLOAT32_C( -337.60),
SIMDE_FLOAT32_C( 293.64), SIMDE_FLOAT32_C( -576.22), SIMDE_FLOAT32_C( 710.38), SIMDE_FLOAT32_C( -756.42)),
UINT16_C( 2107),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 897.27), SIMDE_FLOAT32_C( -197.89), SIMDE_FLOAT32_C( -359.76), SIMDE_FLOAT32_C( -33.67),
SIMDE_FLOAT32_C( 7.27), SIMDE_FLOAT32_C( -125.20), SIMDE_FLOAT32_C( 39.93), SIMDE_FLOAT32_C( 394.67),
SIMDE_FLOAT32_C( -304.73), SIMDE_FLOAT32_C( -696.69), SIMDE_FLOAT32_C( 822.06), SIMDE_FLOAT32_C( -997.63),
SIMDE_FLOAT32_C( 923.64), SIMDE_FLOAT32_C( -768.12), SIMDE_FLOAT32_C( -67.64), SIMDE_FLOAT32_C( 977.49)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( -348.70), SIMDE_FLOAT32_C( -438.19), SIMDE_FLOAT32_C( -752.43), SIMDE_FLOAT32_C( 932.66),
SIMDE_FLOAT32_C( 0.99), SIMDE_FLOAT32_C( -182.45), SIMDE_FLOAT32_C( 510.85), SIMDE_FLOAT32_C( 14.34),
SIMDE_FLOAT32_C( 916.26), SIMDE_FLOAT32_C( -769.09), SIMDE_FLOAT32_C( -0.21), SIMDE_FLOAT32_C( 0.13),
SIMDE_FLOAT32_C( -0.92), SIMDE_FLOAT32_C( -576.22), SIMDE_FLOAT32_C( 0.38), SIMDE_FLOAT32_C( -0.22)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( -15.61), SIMDE_FLOAT32_C( -737.13), SIMDE_FLOAT32_C( -314.93), SIMDE_FLOAT32_C( 177.92),
SIMDE_FLOAT32_C( 345.93), SIMDE_FLOAT32_C( 888.71), SIMDE_FLOAT32_C( 915.71), SIMDE_FLOAT32_C( 133.52),
SIMDE_FLOAT32_C( 484.94), SIMDE_FLOAT32_C( -598.06), SIMDE_FLOAT32_C( -791.07), SIMDE_FLOAT32_C( -765.93),
SIMDE_FLOAT32_C( 221.37), SIMDE_FLOAT32_C( -788.36), SIMDE_FLOAT32_C( -775.04), SIMDE_FLOAT32_C( 440.64)),
UINT16_C(22274),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 496.57), SIMDE_FLOAT32_C( 915.19), SIMDE_FLOAT32_C( -718.40), SIMDE_FLOAT32_C( 159.97),
SIMDE_FLOAT32_C( -861.01), SIMDE_FLOAT32_C( 426.61), SIMDE_FLOAT32_C( 932.11), SIMDE_FLOAT32_C( 110.36),
SIMDE_FLOAT32_C( 826.84), SIMDE_FLOAT32_C( -76.75), SIMDE_FLOAT32_C( 237.58), SIMDE_FLOAT32_C( -378.50),
SIMDE_FLOAT32_C( -601.68), SIMDE_FLOAT32_C( -623.50), SIMDE_FLOAT32_C( -942.47), SIMDE_FLOAT32_C( 475.51)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( -15.61), SIMDE_FLOAT32_C( -0.97), SIMDE_FLOAT32_C( -314.93), SIMDE_FLOAT32_C( -0.94),
SIMDE_FLOAT32_C( 345.93), SIMDE_FLOAT32_C( 0.40), SIMDE_FLOAT32_C( -0.85), SIMDE_FLOAT32_C( -0.35),
SIMDE_FLOAT32_C( 484.94), SIMDE_FLOAT32_C( -598.06), SIMDE_FLOAT32_C( -791.07), SIMDE_FLOAT32_C( -765.93),
SIMDE_FLOAT32_C( 221.37), SIMDE_FLOAT32_C( -788.36), SIMDE_FLOAT32_C( -0.74), SIMDE_FLOAT32_C( 440.64)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( 883.05), SIMDE_FLOAT32_C( -807.28), SIMDE_FLOAT32_C( -70.05), SIMDE_FLOAT32_C( -784.34),
SIMDE_FLOAT32_C( 92.52), SIMDE_FLOAT32_C( 206.60), SIMDE_FLOAT32_C( 834.60), SIMDE_FLOAT32_C( -65.60),
SIMDE_FLOAT32_C( -286.07), SIMDE_FLOAT32_C( -212.86), SIMDE_FLOAT32_C( -318.38), SIMDE_FLOAT32_C( 783.48),
SIMDE_FLOAT32_C( -628.82), SIMDE_FLOAT32_C( 556.35), SIMDE_FLOAT32_C( 439.43), SIMDE_FLOAT32_C( 434.03)),
UINT16_C(27396),
simde_mm512_set_ps(SIMDE_FLOAT32_C( -964.25), SIMDE_FLOAT32_C( -406.33), SIMDE_FLOAT32_C( -743.66), SIMDE_FLOAT32_C( -764.58),
SIMDE_FLOAT32_C( 789.89), SIMDE_FLOAT32_C( 4.83), SIMDE_FLOAT32_C( -818.54), SIMDE_FLOAT32_C( 161.06),
SIMDE_FLOAT32_C( 579.25), SIMDE_FLOAT32_C( -11.78), SIMDE_FLOAT32_C( -308.52), SIMDE_FLOAT32_C( -719.57),
SIMDE_FLOAT32_C( 334.00), SIMDE_FLOAT32_C( 274.71), SIMDE_FLOAT32_C( -916.82), SIMDE_FLOAT32_C( -490.00)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 883.05), SIMDE_FLOAT32_C( 0.69), SIMDE_FLOAT32_C( 0.92), SIMDE_FLOAT32_C( -784.34),
SIMDE_FLOAT32_C( 0.34), SIMDE_FLOAT32_C( 206.60), SIMDE_FLOAT32_C( -0.15), SIMDE_FLOAT32_C( -0.95),
SIMDE_FLOAT32_C( -286.07), SIMDE_FLOAT32_C( -212.86), SIMDE_FLOAT32_C( -318.38), SIMDE_FLOAT32_C( 783.48),
SIMDE_FLOAT32_C( -628.82), SIMDE_FLOAT32_C( 0.08), SIMDE_FLOAT32_C( 439.43), SIMDE_FLOAT32_C( 434.03)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( 529.63), SIMDE_FLOAT32_C( -24.89), SIMDE_FLOAT32_C( -967.78), SIMDE_FLOAT32_C( 638.94),
SIMDE_FLOAT32_C( 450.90), SIMDE_FLOAT32_C( -771.54), SIMDE_FLOAT32_C( 105.79), SIMDE_FLOAT32_C( 590.10),
SIMDE_FLOAT32_C( 30.91), SIMDE_FLOAT32_C( 635.35), SIMDE_FLOAT32_C( -84.00), SIMDE_FLOAT32_C( 80.04),
SIMDE_FLOAT32_C( -709.46), SIMDE_FLOAT32_C( 607.86), SIMDE_FLOAT32_C( 394.58), SIMDE_FLOAT32_C( -889.11)),
UINT16_C( 953),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 18.75), SIMDE_FLOAT32_C( 809.05), SIMDE_FLOAT32_C( 144.05), SIMDE_FLOAT32_C( -427.72),
SIMDE_FLOAT32_C( 308.28), SIMDE_FLOAT32_C( -177.05), SIMDE_FLOAT32_C( -457.77), SIMDE_FLOAT32_C( 678.24),
SIMDE_FLOAT32_C( 66.05), SIMDE_FLOAT32_C( -267.71), SIMDE_FLOAT32_C( 117.28), SIMDE_FLOAT32_C( -576.80),
SIMDE_FLOAT32_C( -38.39), SIMDE_FLOAT32_C( -250.14), SIMDE_FLOAT32_C( -53.92), SIMDE_FLOAT32_C( 91.94)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 529.63), SIMDE_FLOAT32_C( -24.89), SIMDE_FLOAT32_C( -967.78), SIMDE_FLOAT32_C( 638.94),
SIMDE_FLOAT32_C( 450.90), SIMDE_FLOAT32_C( -771.54), SIMDE_FLOAT32_C( -0.14), SIMDE_FLOAT32_C( 0.75),
SIMDE_FLOAT32_C( 0.41), SIMDE_FLOAT32_C( 635.35), SIMDE_FLOAT32_C( -0.46), SIMDE_FLOAT32_C( -0.80),
SIMDE_FLOAT32_C( 0.78), SIMDE_FLOAT32_C( 607.86), SIMDE_FLOAT32_C( 394.58), SIMDE_FLOAT32_C( -0.03)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( -788.39), SIMDE_FLOAT32_C( 330.43), SIMDE_FLOAT32_C( -493.41), SIMDE_FLOAT32_C( 822.72),
SIMDE_FLOAT32_C( 956.68), SIMDE_FLOAT32_C( 954.62), SIMDE_FLOAT32_C( 825.49), SIMDE_FLOAT32_C( -816.27),
SIMDE_FLOAT32_C( -209.34), SIMDE_FLOAT32_C( -933.21), SIMDE_FLOAT32_C( -728.70), SIMDE_FLOAT32_C( -420.06),
SIMDE_FLOAT32_C( 100.32), SIMDE_FLOAT32_C( 103.15), SIMDE_FLOAT32_C( 439.77), SIMDE_FLOAT32_C( -204.33)),
UINT16_C(12713),
simde_mm512_set_ps(SIMDE_FLOAT32_C( -841.43), SIMDE_FLOAT32_C( -14.16), SIMDE_FLOAT32_C( 824.88), SIMDE_FLOAT32_C( 793.63),
SIMDE_FLOAT32_C( -736.75), SIMDE_FLOAT32_C( -310.57), SIMDE_FLOAT32_C( 728.87), SIMDE_FLOAT32_C( -350.72),
SIMDE_FLOAT32_C( 60.89), SIMDE_FLOAT32_C( 109.81), SIMDE_FLOAT32_C( 715.94), SIMDE_FLOAT32_C( -250.60),
SIMDE_FLOAT32_C( 944.14), SIMDE_FLOAT32_C( 361.85), SIMDE_FLOAT32_C( -13.07), SIMDE_FLOAT32_C( 852.60)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( -788.39), SIMDE_FLOAT32_C( 330.43), SIMDE_FLOAT32_C( -0.26), SIMDE_FLOAT32_C( 0.28),
SIMDE_FLOAT32_C( 956.68), SIMDE_FLOAT32_C( 954.62), SIMDE_FLOAT32_C( 825.49), SIMDE_FLOAT32_C( 0.99),
SIMDE_FLOAT32_C( 0.49), SIMDE_FLOAT32_C( -933.21), SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( -420.06),
SIMDE_FLOAT32_C( -0.72), SIMDE_FLOAT32_C( 103.15), SIMDE_FLOAT32_C( 439.77), SIMDE_FLOAT32_C( -0.68)) }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])); i++) {
simde__m512 r = simde_mm512_mask_cosd_ps(test_vec[i].src, test_vec[i].k, test_vec[i].a);
simde_assert_m512_close(r, test_vec[i].r, 1);
}
return 0;
}
static int
test_simde_mm512_cosd_pd(SIMDE_MUNIT_TEST_ARGS) {
const struct {
simde__m512d a;
simde__m512d r;
} test_vec[8] = {
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( 497.31), SIMDE_FLOAT64_C( 670.24),
SIMDE_FLOAT64_C( -297.45), SIMDE_FLOAT64_C( 34.06),
SIMDE_FLOAT64_C( -186.21), SIMDE_FLOAT64_C( 39.01),
SIMDE_FLOAT64_C( -754.38), SIMDE_FLOAT64_C( 346.63)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( -0.74), SIMDE_FLOAT64_C( 0.65),
SIMDE_FLOAT64_C( 0.46), SIMDE_FLOAT64_C( 0.83),
SIMDE_FLOAT64_C( -0.99), SIMDE_FLOAT64_C( 0.78),
SIMDE_FLOAT64_C( 0.83), SIMDE_FLOAT64_C( 0.97)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( -678.17), SIMDE_FLOAT64_C( -686.13),
SIMDE_FLOAT64_C( 84.77), SIMDE_FLOAT64_C( 571.46),
SIMDE_FLOAT64_C( 825.53), SIMDE_FLOAT64_C( 422.21),
SIMDE_FLOAT64_C( -269.45), SIMDE_FLOAT64_C( 467.76)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 0.75), SIMDE_FLOAT64_C( 0.83),
SIMDE_FLOAT64_C( 0.09), SIMDE_FLOAT64_C( -0.85),
SIMDE_FLOAT64_C( -0.27), SIMDE_FLOAT64_C( 0.47),
SIMDE_FLOAT64_C( -0.01), SIMDE_FLOAT64_C( -0.31)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( -444.81), SIMDE_FLOAT64_C( 28.47),
SIMDE_FLOAT64_C( -384.03), SIMDE_FLOAT64_C( -923.64),
SIMDE_FLOAT64_C( -305.07), SIMDE_FLOAT64_C( -860.95),
SIMDE_FLOAT64_C( -417.54), SIMDE_FLOAT64_C( 696.87)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 0.09), SIMDE_FLOAT64_C( 0.88),
SIMDE_FLOAT64_C( 0.91), SIMDE_FLOAT64_C( -0.92),
SIMDE_FLOAT64_C( 0.57), SIMDE_FLOAT64_C( -0.78),
SIMDE_FLOAT64_C( 0.54), SIMDE_FLOAT64_C( 0.92)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( 178.20), SIMDE_FLOAT64_C( -450.67),
SIMDE_FLOAT64_C( 233.37), SIMDE_FLOAT64_C( 687.09),
SIMDE_FLOAT64_C( 261.31), SIMDE_FLOAT64_C( -212.54),
SIMDE_FLOAT64_C( -976.55), SIMDE_FLOAT64_C( -660.80)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( -1.00), SIMDE_FLOAT64_C( -0.01),
SIMDE_FLOAT64_C( -0.60), SIMDE_FLOAT64_C( 0.84),
SIMDE_FLOAT64_C( -0.15), SIMDE_FLOAT64_C( -0.84),
SIMDE_FLOAT64_C( -0.23), SIMDE_FLOAT64_C( 0.51)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( -770.35), SIMDE_FLOAT64_C( 443.48),
SIMDE_FLOAT64_C( -583.60), SIMDE_FLOAT64_C( 380.46),
SIMDE_FLOAT64_C( -770.72), SIMDE_FLOAT64_C( 993.90),
SIMDE_FLOAT64_C( 28.08), SIMDE_FLOAT64_C( 841.21)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 0.64), SIMDE_FLOAT64_C( 0.11),
SIMDE_FLOAT64_C( -0.72), SIMDE_FLOAT64_C( 0.94),
SIMDE_FLOAT64_C( 0.63), SIMDE_FLOAT64_C( 0.07),
SIMDE_FLOAT64_C( 0.88), SIMDE_FLOAT64_C( -0.52)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( -387.90), SIMDE_FLOAT64_C( 395.92),
SIMDE_FLOAT64_C( 655.87), SIMDE_FLOAT64_C( 339.21),
SIMDE_FLOAT64_C( 532.35), SIMDE_FLOAT64_C( -263.99),
SIMDE_FLOAT64_C( 780.64), SIMDE_FLOAT64_C( -30.79)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 0.88), SIMDE_FLOAT64_C( 0.81),
SIMDE_FLOAT64_C( 0.44), SIMDE_FLOAT64_C( 0.93),
SIMDE_FLOAT64_C( -0.99), SIMDE_FLOAT64_C( -0.10),
SIMDE_FLOAT64_C( 0.49), SIMDE_FLOAT64_C( 0.86)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( -203.65), SIMDE_FLOAT64_C( -80.73),
SIMDE_FLOAT64_C( 336.73), SIMDE_FLOAT64_C( -944.78),
SIMDE_FLOAT64_C( -747.59), SIMDE_FLOAT64_C( -767.23),
SIMDE_FLOAT64_C( -554.19), SIMDE_FLOAT64_C( 398.82)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( -0.92), SIMDE_FLOAT64_C( 0.16),
SIMDE_FLOAT64_C( 0.92), SIMDE_FLOAT64_C( -0.71),
SIMDE_FLOAT64_C( 0.89), SIMDE_FLOAT64_C( 0.68),
SIMDE_FLOAT64_C( -0.97), SIMDE_FLOAT64_C( 0.78)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( 469.66), SIMDE_FLOAT64_C( 680.02),
SIMDE_FLOAT64_C( -148.69), SIMDE_FLOAT64_C( 818.66),
SIMDE_FLOAT64_C( 910.03), SIMDE_FLOAT64_C( 600.47),
SIMDE_FLOAT64_C( 791.23), SIMDE_FLOAT64_C( 254.31)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( -0.34), SIMDE_FLOAT64_C( 0.77),
SIMDE_FLOAT64_C( -0.85), SIMDE_FLOAT64_C( -0.15),
SIMDE_FLOAT64_C( -0.98), SIMDE_FLOAT64_C( -0.49),
SIMDE_FLOAT64_C( 0.32), SIMDE_FLOAT64_C( -0.27)) }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])); i++) {
simde__m512d r = simde_mm512_cosd_pd(test_vec[i].a);
simde_assert_m512d_close(r, test_vec[i].r, 1);
}
return 0;
}
static int
test_simde_mm512_mask_cosd_pd(SIMDE_MUNIT_TEST_ARGS) {
const struct {
simde__m512d src;
simde__mmask8 k;
simde__m512d a;
simde__m512d r;
} test_vec[8] = {
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( -686.13), SIMDE_FLOAT64_C( 571.46),
SIMDE_FLOAT64_C( 422.21), SIMDE_FLOAT64_C( 467.76),
SIMDE_FLOAT64_C( 670.24), SIMDE_FLOAT64_C( 34.06),
SIMDE_FLOAT64_C( 39.01), SIMDE_FLOAT64_C( 346.63)),
UINT8_C(139),
simde_mm512_set_pd(SIMDE_FLOAT64_C( -678.17), SIMDE_FLOAT64_C( 84.77),
SIMDE_FLOAT64_C( 825.53), SIMDE_FLOAT64_C( -269.45),
SIMDE_FLOAT64_C( 497.31), SIMDE_FLOAT64_C( -297.45),
SIMDE_FLOAT64_C( -186.21), SIMDE_FLOAT64_C( -754.38)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 0.75), SIMDE_FLOAT64_C( 571.46),
SIMDE_FLOAT64_C( 422.21), SIMDE_FLOAT64_C( 467.76),
SIMDE_FLOAT64_C( -0.74), SIMDE_FLOAT64_C( 34.06),
SIMDE_FLOAT64_C( -0.99), SIMDE_FLOAT64_C( 0.83)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( 178.20), SIMDE_FLOAT64_C( 233.37),
SIMDE_FLOAT64_C( 261.31), SIMDE_FLOAT64_C( -976.55),
SIMDE_FLOAT64_C( -444.81), SIMDE_FLOAT64_C( -384.03),
SIMDE_FLOAT64_C( -305.07), SIMDE_FLOAT64_C( -417.54)),
UINT8_C(229),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 841.21), SIMDE_FLOAT64_C( -450.67),
SIMDE_FLOAT64_C( 687.09), SIMDE_FLOAT64_C( -212.54),
SIMDE_FLOAT64_C( -660.80), SIMDE_FLOAT64_C( 28.47),
SIMDE_FLOAT64_C( -923.64), SIMDE_FLOAT64_C( -860.95)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( -0.52), SIMDE_FLOAT64_C( -0.01),
SIMDE_FLOAT64_C( 0.84), SIMDE_FLOAT64_C( -976.55),
SIMDE_FLOAT64_C( -444.81), SIMDE_FLOAT64_C( 0.88),
SIMDE_FLOAT64_C( -305.07), SIMDE_FLOAT64_C( -0.78)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( 398.82), SIMDE_FLOAT64_C( 395.92),
SIMDE_FLOAT64_C( 339.21), SIMDE_FLOAT64_C( -263.99),
SIMDE_FLOAT64_C( -30.79), SIMDE_FLOAT64_C( 443.48),
SIMDE_FLOAT64_C( 380.46), SIMDE_FLOAT64_C( 993.90)),
UINT8_C(253),
simde_mm512_set_pd(SIMDE_FLOAT64_C( -554.19), SIMDE_FLOAT64_C( -387.90),
SIMDE_FLOAT64_C( 655.87), SIMDE_FLOAT64_C( 532.35),
SIMDE_FLOAT64_C( 780.64), SIMDE_FLOAT64_C( -770.35),
SIMDE_FLOAT64_C( -583.60), SIMDE_FLOAT64_C( -770.72)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( -0.97), SIMDE_FLOAT64_C( 0.88),
SIMDE_FLOAT64_C( 0.44), SIMDE_FLOAT64_C( -0.99),
SIMDE_FLOAT64_C( 0.49), SIMDE_FLOAT64_C( 0.64),
SIMDE_FLOAT64_C( 380.46), SIMDE_FLOAT64_C( 0.63)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( 120.65), SIMDE_FLOAT64_C( 469.66),
SIMDE_FLOAT64_C( -148.69), SIMDE_FLOAT64_C( 910.03),
SIMDE_FLOAT64_C( 791.23), SIMDE_FLOAT64_C( -203.65),
SIMDE_FLOAT64_C( 336.73), SIMDE_FLOAT64_C( -747.59)),
UINT8_C( 93),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 543.35), SIMDE_FLOAT64_C( -171.51),
SIMDE_FLOAT64_C( 680.02), SIMDE_FLOAT64_C( 818.66),
SIMDE_FLOAT64_C( 600.47), SIMDE_FLOAT64_C( 254.31),
SIMDE_FLOAT64_C( -80.73), SIMDE_FLOAT64_C( -944.78)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 120.65), SIMDE_FLOAT64_C( -0.99),
SIMDE_FLOAT64_C( -148.69), SIMDE_FLOAT64_C( -0.15),
SIMDE_FLOAT64_C( -0.49), SIMDE_FLOAT64_C( -0.27),
SIMDE_FLOAT64_C( 336.73), SIMDE_FLOAT64_C( -0.71)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( 99.93), SIMDE_FLOAT64_C( -738.19),
SIMDE_FLOAT64_C( 758.79), SIMDE_FLOAT64_C( 343.48),
SIMDE_FLOAT64_C( -797.92), SIMDE_FLOAT64_C( -525.83),
SIMDE_FLOAT64_C( -822.65), SIMDE_FLOAT64_C( 655.67)),
UINT8_C(145),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 331.34), SIMDE_FLOAT64_C( 462.95),
SIMDE_FLOAT64_C( -178.99), SIMDE_FLOAT64_C( 324.62),
SIMDE_FLOAT64_C( -874.31), SIMDE_FLOAT64_C( -328.54),
SIMDE_FLOAT64_C( -192.31), SIMDE_FLOAT64_C( 561.36)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 0.88), SIMDE_FLOAT64_C( -738.19),
SIMDE_FLOAT64_C( 758.79), SIMDE_FLOAT64_C( 0.82),
SIMDE_FLOAT64_C( -797.92), SIMDE_FLOAT64_C( -525.83),
SIMDE_FLOAT64_C( -822.65), SIMDE_FLOAT64_C( -0.93)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( -756.42), SIMDE_FLOAT64_C( 27.25),
SIMDE_FLOAT64_C( 690.12), SIMDE_FLOAT64_C( -21.09),
SIMDE_FLOAT64_C( -448.89), SIMDE_FLOAT64_C( 505.79),
SIMDE_FLOAT64_C( 831.02), SIMDE_FLOAT64_C( 977.36)),
UINT8_C( 75),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 977.49), SIMDE_FLOAT64_C( 424.81),
SIMDE_FLOAT64_C( -95.15), SIMDE_FLOAT64_C( 840.65),
SIMDE_FLOAT64_C( -591.56), SIMDE_FLOAT64_C( 731.49),
SIMDE_FLOAT64_C( 623.70), SIMDE_FLOAT64_C( 140.67)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( -756.42), SIMDE_FLOAT64_C( 0.43),
SIMDE_FLOAT64_C( 690.12), SIMDE_FLOAT64_C( -21.09),
SIMDE_FLOAT64_C( -0.62), SIMDE_FLOAT64_C( 505.79),
SIMDE_FLOAT64_C( -0.11), SIMDE_FLOAT64_C( -0.77)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( 394.67), SIMDE_FLOAT64_C( -304.73),
SIMDE_FLOAT64_C( -696.69), SIMDE_FLOAT64_C( 822.06),
SIMDE_FLOAT64_C( -997.63), SIMDE_FLOAT64_C( 923.64),
SIMDE_FLOAT64_C( -768.12), SIMDE_FLOAT64_C( -67.64)),
UINT8_C( 93),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 510.85), SIMDE_FLOAT64_C( 14.34),
SIMDE_FLOAT64_C( 916.26), SIMDE_FLOAT64_C( -769.09),
SIMDE_FLOAT64_C( -573.81), SIMDE_FLOAT64_C( -337.60),
SIMDE_FLOAT64_C( 293.64), SIMDE_FLOAT64_C( -576.22)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 394.67), SIMDE_FLOAT64_C( 0.97),
SIMDE_FLOAT64_C( -696.69), SIMDE_FLOAT64_C( 0.65),
SIMDE_FLOAT64_C( -0.83), SIMDE_FLOAT64_C( 0.92),
SIMDE_FLOAT64_C( -768.12), SIMDE_FLOAT64_C( -0.81)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( 475.51), SIMDE_FLOAT64_C( 936.65),
SIMDE_FLOAT64_C( -348.70), SIMDE_FLOAT64_C( -438.19),
SIMDE_FLOAT64_C( -752.43), SIMDE_FLOAT64_C( 932.66),
SIMDE_FLOAT64_C( -327.22), SIMDE_FLOAT64_C( -182.45)),
UINT8_C(213),
simde_mm512_set_pd(SIMDE_FLOAT64_C( -775.04), SIMDE_FLOAT64_C( 440.64),
SIMDE_FLOAT64_C( 897.27), SIMDE_FLOAT64_C( -197.89),
SIMDE_FLOAT64_C( -359.76), SIMDE_FLOAT64_C( -33.67),
SIMDE_FLOAT64_C( 7.27), SIMDE_FLOAT64_C( -125.20)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 0.57), SIMDE_FLOAT64_C( 0.16),
SIMDE_FLOAT64_C( -348.70), SIMDE_FLOAT64_C( -0.95),
SIMDE_FLOAT64_C( -752.43), SIMDE_FLOAT64_C( 0.83),
SIMDE_FLOAT64_C( -327.22), SIMDE_FLOAT64_C( -0.58)) }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])); i++) {
simde__m512d r = simde_mm512_mask_cosd_pd(test_vec[i].src, test_vec[i].k, test_vec[i].a);
simde_assert_m512d_close(r, test_vec[i].r, 1);
}
return 0;
}
static int
test_simde_mm_cosh_ps(SIMDE_MUNIT_TEST_ARGS) {
const struct {
simde__m128 a;
simde__m128 r;
} test_vec[8] = {
{ simde_mm_set_ps(SIMDE_FLOAT32_C( 3.48), SIMDE_FLOAT32_C( 4.71), SIMDE_FLOAT32_C( 0.35), SIMDE_FLOAT32_C( 6.41)),
simde_mm_set_ps(SIMDE_FLOAT32_C( 16.25), SIMDE_FLOAT32_C( 55.53), SIMDE_FLOAT32_C( 1.06), SIMDE_FLOAT32_C( 303.95)) },
{ simde_mm_set_ps(SIMDE_FLOAT32_C( 7.24), SIMDE_FLOAT32_C( 8.19), SIMDE_FLOAT32_C( 2.86), SIMDE_FLOAT32_C( 4.69)),
simde_mm_set_ps(SIMDE_FLOAT32_C( 697.05), SIMDE_FLOAT32_C( 1802.36), SIMDE_FLOAT32_C( 8.76), SIMDE_FLOAT32_C( 54.43)) },
{ simde_mm_set_ps(SIMDE_FLOAT32_C( 9.04), SIMDE_FLOAT32_C( 6.82), SIMDE_FLOAT32_C( 3.02), SIMDE_FLOAT32_C( 7.07)),
simde_mm_set_ps(SIMDE_FLOAT32_C( 4216.89), SIMDE_FLOAT32_C( 457.99), SIMDE_FLOAT32_C( 10.27), SIMDE_FLOAT32_C( 588.07)) },
{ simde_mm_set_ps(SIMDE_FLOAT32_C( 0.77), SIMDE_FLOAT32_C( 0.73), SIMDE_FLOAT32_C( 4.97), SIMDE_FLOAT32_C( 7.64)),
simde_mm_set_ps(SIMDE_FLOAT32_C( 1.31), SIMDE_FLOAT32_C( 1.28), SIMDE_FLOAT32_C( 72.02), SIMDE_FLOAT32_C( 1039.87)) },
{ simde_mm_set_ps(SIMDE_FLOAT32_C( 2.82), SIMDE_FLOAT32_C( -0.24), SIMDE_FLOAT32_C( 2.20), SIMDE_FLOAT32_C( 8.33)),
simde_mm_set_ps(SIMDE_FLOAT32_C( 8.42), SIMDE_FLOAT32_C( 1.03), SIMDE_FLOAT32_C( 4.57), SIMDE_FLOAT32_C( 2073.21)) },
{ simde_mm_set_ps(SIMDE_FLOAT32_C( 2.05), SIMDE_FLOAT32_C( 4.66), SIMDE_FLOAT32_C( 2.39), SIMDE_FLOAT32_C( -0.58)),
simde_mm_set_ps(SIMDE_FLOAT32_C( 3.95), SIMDE_FLOAT32_C( 52.82), SIMDE_FLOAT32_C( 5.50), SIMDE_FLOAT32_C( 1.17)) },
{ simde_mm_set_ps(SIMDE_FLOAT32_C( 5.94), SIMDE_FLOAT32_C( 3.33), SIMDE_FLOAT32_C( -0.87), SIMDE_FLOAT32_C( 0.87)),
simde_mm_set_ps(SIMDE_FLOAT32_C( 189.97), SIMDE_FLOAT32_C( 13.99), SIMDE_FLOAT32_C( 1.40), SIMDE_FLOAT32_C( 1.40)) },
{ simde_mm_set_ps(SIMDE_FLOAT32_C( 5.48), SIMDE_FLOAT32_C( 2.02), SIMDE_FLOAT32_C( 5.78), SIMDE_FLOAT32_C( 8.28)),
simde_mm_set_ps(SIMDE_FLOAT32_C( 119.93), SIMDE_FLOAT32_C( 3.84), SIMDE_FLOAT32_C( 161.88), SIMDE_FLOAT32_C( 1972.10)) }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])); i++) {
simde__m128 r = simde_mm_cosh_ps(test_vec[i].a);
simde_assert_m128_close(r, test_vec[i].r, 1);
}
return 0;
}
static int
test_simde_mm_cosh_pd(SIMDE_MUNIT_TEST_ARGS) {
const struct {
simde__m128d a;
simde__m128d r;
} test_vec[8] = {
{ simde_mm_set_pd(SIMDE_FLOAT64_C( -0.26), SIMDE_FLOAT64_C( 3.04)),
simde_mm_set_pd(SIMDE_FLOAT64_C( 1.03), SIMDE_FLOAT64_C( 10.48)) },
{ simde_mm_set_pd(SIMDE_FLOAT64_C( 1.44), SIMDE_FLOAT64_C( 2.12)),
simde_mm_set_pd(SIMDE_FLOAT64_C( 2.23), SIMDE_FLOAT64_C( 4.23)) },
{ simde_mm_set_pd(SIMDE_FLOAT64_C( 1.11), SIMDE_FLOAT64_C( 2.10)),
simde_mm_set_pd(SIMDE_FLOAT64_C( 1.68), SIMDE_FLOAT64_C( 4.14)) },
{ simde_mm_set_pd(SIMDE_FLOAT64_C( 3.49), SIMDE_FLOAT64_C( 4.01)),
simde_mm_set_pd(SIMDE_FLOAT64_C( 16.41), SIMDE_FLOAT64_C( 27.58)) },
{ simde_mm_set_pd(SIMDE_FLOAT64_C( 1.19), SIMDE_FLOAT64_C( 3.40)),
simde_mm_set_pd(SIMDE_FLOAT64_C( 1.80), SIMDE_FLOAT64_C( 15.00)) },
{ simde_mm_set_pd(SIMDE_FLOAT64_C( 4.48), SIMDE_FLOAT64_C( 3.27)),
simde_mm_set_pd(SIMDE_FLOAT64_C( 44.12), SIMDE_FLOAT64_C( 13.17)) },
{ simde_mm_set_pd(SIMDE_FLOAT64_C( 2.25), SIMDE_FLOAT64_C( 3.71)),
simde_mm_set_pd(SIMDE_FLOAT64_C( 4.80), SIMDE_FLOAT64_C( 20.44)) },
{ simde_mm_set_pd(SIMDE_FLOAT64_C( -0.03), SIMDE_FLOAT64_C( -0.06)),
simde_mm_set_pd(SIMDE_FLOAT64_C( 1.00), SIMDE_FLOAT64_C( 1.00)) }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])); i++) {
simde__m128d r = simde_mm_cosh_pd(test_vec[i].a);
simde_assert_m128d_close(r, test_vec[i].r, 1);
}
return 0;
}
static int
test_simde_mm256_cosh_ps(SIMDE_MUNIT_TEST_ARGS) {
const struct {
simde__m256 a;
simde__m256 r;
} test_vec[8] = {
{ simde_mm256_set_ps(SIMDE_FLOAT32_C( 7.24), SIMDE_FLOAT32_C( 8.19),
SIMDE_FLOAT32_C( 2.86), SIMDE_FLOAT32_C( 4.69),
SIMDE_FLOAT32_C( 3.48), SIMDE_FLOAT32_C( 4.71),
SIMDE_FLOAT32_C( 0.35), SIMDE_FLOAT32_C( 6.41)),
simde_mm256_set_ps(SIMDE_FLOAT32_C( 697.05), SIMDE_FLOAT32_C( 1802.36),
SIMDE_FLOAT32_C( 8.76), SIMDE_FLOAT32_C( 54.43),
SIMDE_FLOAT32_C( 16.25), SIMDE_FLOAT32_C( 55.53),
SIMDE_FLOAT32_C( 1.06), SIMDE_FLOAT32_C( 303.95)) },
{ simde_mm256_set_ps(SIMDE_FLOAT32_C( 0.77), SIMDE_FLOAT32_C( 0.73),
SIMDE_FLOAT32_C( 4.97), SIMDE_FLOAT32_C( 7.64),
SIMDE_FLOAT32_C( 9.04), SIMDE_FLOAT32_C( 6.82),
SIMDE_FLOAT32_C( 3.02), SIMDE_FLOAT32_C( 7.07)),
simde_mm256_set_ps(SIMDE_FLOAT32_C( 1.31), SIMDE_FLOAT32_C( 1.28),
SIMDE_FLOAT32_C( 72.02), SIMDE_FLOAT32_C( 1039.87),
SIMDE_FLOAT32_C( 4216.89), SIMDE_FLOAT32_C( 457.99),
SIMDE_FLOAT32_C( 10.27), SIMDE_FLOAT32_C( 588.07)) },
{ simde_mm256_set_ps(SIMDE_FLOAT32_C( 2.05), SIMDE_FLOAT32_C( 4.66),
SIMDE_FLOAT32_C( 2.39), SIMDE_FLOAT32_C( -0.58),
SIMDE_FLOAT32_C( 2.82), SIMDE_FLOAT32_C( -0.24),
SIMDE_FLOAT32_C( 2.20), SIMDE_FLOAT32_C( 8.33)),
simde_mm256_set_ps(SIMDE_FLOAT32_C( 3.95), SIMDE_FLOAT32_C( 52.82),
SIMDE_FLOAT32_C( 5.50), SIMDE_FLOAT32_C( 1.17),
SIMDE_FLOAT32_C( 8.42), SIMDE_FLOAT32_C( 1.03),
SIMDE_FLOAT32_C( 4.57), SIMDE_FLOAT32_C( 2073.21)) },
{ simde_mm256_set_ps(SIMDE_FLOAT32_C( 5.48), SIMDE_FLOAT32_C( 2.02),
SIMDE_FLOAT32_C( 5.78), SIMDE_FLOAT32_C( 8.28),
SIMDE_FLOAT32_C( 5.94), SIMDE_FLOAT32_C( 3.33),
SIMDE_FLOAT32_C( -0.87), SIMDE_FLOAT32_C( 0.87)),
simde_mm256_set_ps(SIMDE_FLOAT32_C( 119.93), SIMDE_FLOAT32_C( 3.84),
SIMDE_FLOAT32_C( 161.88), SIMDE_FLOAT32_C( 1972.10),
SIMDE_FLOAT32_C( 189.97), SIMDE_FLOAT32_C( 13.99),
SIMDE_FLOAT32_C( 1.40), SIMDE_FLOAT32_C( 1.40)) },
{ simde_mm256_set_ps(SIMDE_FLOAT32_C( 0.26), SIMDE_FLOAT32_C( 6.94),
SIMDE_FLOAT32_C( 1.29), SIMDE_FLOAT32_C( 6.59),
SIMDE_FLOAT32_C( 0.26), SIMDE_FLOAT32_C( 9.97),
SIMDE_FLOAT32_C( 4.65), SIMDE_FLOAT32_C( 9.13)),
simde_mm256_set_ps(SIMDE_FLOAT32_C( 1.03), SIMDE_FLOAT32_C( 516.39),
SIMDE_FLOAT32_C( 1.95), SIMDE_FLOAT32_C( 363.89),
SIMDE_FLOAT32_C( 1.03), SIMDE_FLOAT32_C( 10687.75),
SIMDE_FLOAT32_C( 52.30), SIMDE_FLOAT32_C( 4614.01)) },
{ simde_mm256_set_ps(SIMDE_FLOAT32_C( 2.37), SIMDE_FLOAT32_C( 6.68),
SIMDE_FLOAT32_C( 8.11), SIMDE_FLOAT32_C( 6.37),
SIMDE_FLOAT32_C( 7.43), SIMDE_FLOAT32_C( 3.05),
SIMDE_FLOAT32_C( 8.79), SIMDE_FLOAT32_C( 4.33)),
simde_mm256_set_ps(SIMDE_FLOAT32_C( 5.40), SIMDE_FLOAT32_C( 398.16),
SIMDE_FLOAT32_C( 1663.79), SIMDE_FLOAT32_C( 292.03),
SIMDE_FLOAT32_C( 842.90), SIMDE_FLOAT32_C( 10.58),
SIMDE_FLOAT32_C( 3284.12), SIMDE_FLOAT32_C( 37.98)) },
{ simde_mm256_set_ps(SIMDE_FLOAT32_C( 3.38), SIMDE_FLOAT32_C( 4.06),
SIMDE_FLOAT32_C( 6.35), SIMDE_FLOAT32_C( -0.70),
SIMDE_FLOAT32_C( 0.39), SIMDE_FLOAT32_C( 0.28),
SIMDE_FLOAT32_C( 1.45), SIMDE_FLOAT32_C( 6.69)),
simde_mm256_set_ps(SIMDE_FLOAT32_C( 14.70), SIMDE_FLOAT32_C( 29.00),
SIMDE_FLOAT32_C( 286.25), SIMDE_FLOAT32_C( 1.26),
SIMDE_FLOAT32_C( 1.08), SIMDE_FLOAT32_C( 1.04),
SIMDE_FLOAT32_C( 2.25), SIMDE_FLOAT32_C( 402.16)) },
{ simde_mm256_set_ps(SIMDE_FLOAT32_C( 7.08), SIMDE_FLOAT32_C( 8.24),
SIMDE_FLOAT32_C( 3.68), SIMDE_FLOAT32_C( 9.00),
SIMDE_FLOAT32_C( 9.51), SIMDE_FLOAT32_C( 7.80),
SIMDE_FLOAT32_C( 8.85), SIMDE_FLOAT32_C( 5.90)),
simde_mm256_set_ps(SIMDE_FLOAT32_C( 593.98), SIMDE_FLOAT32_C( 1894.77),
SIMDE_FLOAT32_C( 19.84), SIMDE_FLOAT32_C( 4051.54),
SIMDE_FLOAT32_C( 6747.00), SIMDE_FLOAT32_C( 1220.30),
SIMDE_FLOAT32_C( 3487.20), SIMDE_FLOAT32_C( 182.52)) }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])); i++) {
simde__m256 r = simde_mm256_cosh_ps(test_vec[i].a);
simde_assert_m256_close(r, test_vec[i].r, 1);
}
return 0;
}
static int
test_simde_mm256_cosh_pd(SIMDE_MUNIT_TEST_ARGS) {
const struct {
simde__m256d a;
simde__m256d r;
} test_vec[8] = {
{ simde_mm256_set_pd(SIMDE_FLOAT64_C( 1.44), SIMDE_FLOAT64_C( 2.12),
SIMDE_FLOAT64_C( -0.26), SIMDE_FLOAT64_C( 3.04)),
simde_mm256_set_pd(SIMDE_FLOAT64_C( 2.23), SIMDE_FLOAT64_C( 4.23),
SIMDE_FLOAT64_C( 1.03), SIMDE_FLOAT64_C( 10.48)) },
{ simde_mm256_set_pd(SIMDE_FLOAT64_C( 3.49), SIMDE_FLOAT64_C( 4.01),
SIMDE_FLOAT64_C( 1.11), SIMDE_FLOAT64_C( 2.10)),
simde_mm256_set_pd(SIMDE_FLOAT64_C( 16.41), SIMDE_FLOAT64_C( 27.58),
SIMDE_FLOAT64_C( 1.68), SIMDE_FLOAT64_C( 4.14)) },
{ simde_mm256_set_pd(SIMDE_FLOAT64_C( 4.48), SIMDE_FLOAT64_C( 3.27),
SIMDE_FLOAT64_C( 1.19), SIMDE_FLOAT64_C( 3.40)),
simde_mm256_set_pd(SIMDE_FLOAT64_C( 44.12), SIMDE_FLOAT64_C( 13.17),
SIMDE_FLOAT64_C( 1.80), SIMDE_FLOAT64_C( 15.00)) },
{ simde_mm256_set_pd(SIMDE_FLOAT64_C( -0.03), SIMDE_FLOAT64_C( -0.06),
SIMDE_FLOAT64_C( 2.25), SIMDE_FLOAT64_C( 3.71)),
simde_mm256_set_pd(SIMDE_FLOAT64_C( 1.00), SIMDE_FLOAT64_C( 1.00),
SIMDE_FLOAT64_C( 4.80), SIMDE_FLOAT64_C( 20.44)) },
{ simde_mm256_set_pd(SIMDE_FLOAT64_C( 1.08), SIMDE_FLOAT64_C( -0.58),
SIMDE_FLOAT64_C( 0.75), SIMDE_FLOAT64_C( 4.09)),
simde_mm256_set_pd(SIMDE_FLOAT64_C( 1.64), SIMDE_FLOAT64_C( 1.17),
SIMDE_FLOAT64_C( 1.29), SIMDE_FLOAT64_C( 29.88)) },
{ simde_mm256_set_pd(SIMDE_FLOAT64_C( 0.67), SIMDE_FLOAT64_C( 2.09),
SIMDE_FLOAT64_C( 0.85), SIMDE_FLOAT64_C( -0.77)),
simde_mm256_set_pd(SIMDE_FLOAT64_C( 1.23), SIMDE_FLOAT64_C( 4.10),
SIMDE_FLOAT64_C( 1.38), SIMDE_FLOAT64_C( 1.31)) },
{ simde_mm256_set_pd(SIMDE_FLOAT64_C( 2.78), SIMDE_FLOAT64_C( 1.36),
SIMDE_FLOAT64_C( -0.93), SIMDE_FLOAT64_C( 0.02)),
simde_mm256_set_pd(SIMDE_FLOAT64_C( 8.09), SIMDE_FLOAT64_C( 2.08),
SIMDE_FLOAT64_C( 1.46), SIMDE_FLOAT64_C( 1.00)) },
{ simde_mm256_set_pd(SIMDE_FLOAT64_C( 2.53), SIMDE_FLOAT64_C( 0.65),
SIMDE_FLOAT64_C( 2.70), SIMDE_FLOAT64_C( 4.06)),
simde_mm256_set_pd(SIMDE_FLOAT64_C( 6.32), SIMDE_FLOAT64_C( 1.22),
SIMDE_FLOAT64_C( 7.47), SIMDE_FLOAT64_C( 29.00)) }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])); i++) {
simde__m256d r = simde_mm256_cosh_pd(test_vec[i].a);
simde_assert_m256d_close(r, test_vec[i].r, 1);
}
return 0;
}
static int
test_simde_mm512_cosh_ps(SIMDE_MUNIT_TEST_ARGS) {
const struct {
simde__m512 a;
simde__m512 r;
} test_vec[8] = {
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( 0.77), SIMDE_FLOAT32_C( 0.73), SIMDE_FLOAT32_C( 4.97), SIMDE_FLOAT32_C( 7.64),
SIMDE_FLOAT32_C( 9.04), SIMDE_FLOAT32_C( 6.82), SIMDE_FLOAT32_C( 3.02), SIMDE_FLOAT32_C( 7.07),
SIMDE_FLOAT32_C( 7.24), SIMDE_FLOAT32_C( 8.19), SIMDE_FLOAT32_C( 2.86), SIMDE_FLOAT32_C( 4.69),
SIMDE_FLOAT32_C( 3.48), SIMDE_FLOAT32_C( 4.71), SIMDE_FLOAT32_C( 0.35), SIMDE_FLOAT32_C( 6.41)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 1.31), SIMDE_FLOAT32_C( 1.28), SIMDE_FLOAT32_C( 72.02), SIMDE_FLOAT32_C( 1039.87),
SIMDE_FLOAT32_C( 4216.89), SIMDE_FLOAT32_C( 457.99), SIMDE_FLOAT32_C( 10.27), SIMDE_FLOAT32_C( 588.07),
SIMDE_FLOAT32_C( 697.05), SIMDE_FLOAT32_C( 1802.36), SIMDE_FLOAT32_C( 8.76), SIMDE_FLOAT32_C( 54.43),
SIMDE_FLOAT32_C( 16.25), SIMDE_FLOAT32_C( 55.53), SIMDE_FLOAT32_C( 1.06), SIMDE_FLOAT32_C( 303.95)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( 5.48), SIMDE_FLOAT32_C( 2.02), SIMDE_FLOAT32_C( 5.78), SIMDE_FLOAT32_C( 8.28),
SIMDE_FLOAT32_C( 5.94), SIMDE_FLOAT32_C( 3.33), SIMDE_FLOAT32_C( -0.87), SIMDE_FLOAT32_C( 0.87),
SIMDE_FLOAT32_C( 2.05), SIMDE_FLOAT32_C( 4.66), SIMDE_FLOAT32_C( 2.39), SIMDE_FLOAT32_C( -0.58),
SIMDE_FLOAT32_C( 2.82), SIMDE_FLOAT32_C( -0.24), SIMDE_FLOAT32_C( 2.20), SIMDE_FLOAT32_C( 8.33)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 119.93), SIMDE_FLOAT32_C( 3.84), SIMDE_FLOAT32_C( 161.88), SIMDE_FLOAT32_C( 1972.10),
SIMDE_FLOAT32_C( 189.97), SIMDE_FLOAT32_C( 13.99), SIMDE_FLOAT32_C( 1.40), SIMDE_FLOAT32_C( 1.40),
SIMDE_FLOAT32_C( 3.95), SIMDE_FLOAT32_C( 52.82), SIMDE_FLOAT32_C( 5.50), SIMDE_FLOAT32_C( 1.17),
SIMDE_FLOAT32_C( 8.42), SIMDE_FLOAT32_C( 1.03), SIMDE_FLOAT32_C( 4.57), SIMDE_FLOAT32_C( 2073.21)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( 2.37), SIMDE_FLOAT32_C( 6.68), SIMDE_FLOAT32_C( 8.11), SIMDE_FLOAT32_C( 6.37),
SIMDE_FLOAT32_C( 7.43), SIMDE_FLOAT32_C( 3.05), SIMDE_FLOAT32_C( 8.79), SIMDE_FLOAT32_C( 4.33),
SIMDE_FLOAT32_C( 0.26), SIMDE_FLOAT32_C( 6.94), SIMDE_FLOAT32_C( 1.29), SIMDE_FLOAT32_C( 6.59),
SIMDE_FLOAT32_C( 0.26), SIMDE_FLOAT32_C( 9.97), SIMDE_FLOAT32_C( 4.65), SIMDE_FLOAT32_C( 9.13)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 5.40), SIMDE_FLOAT32_C( 398.16), SIMDE_FLOAT32_C( 1663.79), SIMDE_FLOAT32_C( 292.03),
SIMDE_FLOAT32_C( 842.90), SIMDE_FLOAT32_C( 10.58), SIMDE_FLOAT32_C( 3284.12), SIMDE_FLOAT32_C( 37.98),
SIMDE_FLOAT32_C( 1.03), SIMDE_FLOAT32_C( 516.39), SIMDE_FLOAT32_C( 1.95), SIMDE_FLOAT32_C( 363.89),
SIMDE_FLOAT32_C( 1.03), SIMDE_FLOAT32_C( 10687.75), SIMDE_FLOAT32_C( 52.30), SIMDE_FLOAT32_C( 4614.01)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( 7.08), SIMDE_FLOAT32_C( 8.24), SIMDE_FLOAT32_C( 3.68), SIMDE_FLOAT32_C( 9.00),
SIMDE_FLOAT32_C( 9.51), SIMDE_FLOAT32_C( 7.80), SIMDE_FLOAT32_C( 8.85), SIMDE_FLOAT32_C( 5.90),
SIMDE_FLOAT32_C( 3.38), SIMDE_FLOAT32_C( 4.06), SIMDE_FLOAT32_C( 6.35), SIMDE_FLOAT32_C( -0.70),
SIMDE_FLOAT32_C( 0.39), SIMDE_FLOAT32_C( 0.28), SIMDE_FLOAT32_C( 1.45), SIMDE_FLOAT32_C( 6.69)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 593.98), SIMDE_FLOAT32_C( 1894.77), SIMDE_FLOAT32_C( 19.84), SIMDE_FLOAT32_C( 4051.54),
SIMDE_FLOAT32_C( 6747.00), SIMDE_FLOAT32_C( 1220.30), SIMDE_FLOAT32_C( 3487.20), SIMDE_FLOAT32_C( 182.52),
SIMDE_FLOAT32_C( 14.70), SIMDE_FLOAT32_C( 29.00), SIMDE_FLOAT32_C( 286.25), SIMDE_FLOAT32_C( 1.26),
SIMDE_FLOAT32_C( 1.08), SIMDE_FLOAT32_C( 1.04), SIMDE_FLOAT32_C( 2.25), SIMDE_FLOAT32_C( 402.16)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( 3.52), SIMDE_FLOAT32_C( 8.67), SIMDE_FLOAT32_C( 6.29), SIMDE_FLOAT32_C( 6.39),
SIMDE_FLOAT32_C( -0.31), SIMDE_FLOAT32_C( 0.11), SIMDE_FLOAT32_C( 2.69), SIMDE_FLOAT32_C( 1.61),
SIMDE_FLOAT32_C( 3.44), SIMDE_FLOAT32_C( -0.02), SIMDE_FLOAT32_C( 7.59), SIMDE_FLOAT32_C( 8.11),
SIMDE_FLOAT32_C( 4.11), SIMDE_FLOAT32_C( 7.49), SIMDE_FLOAT32_C( 5.16), SIMDE_FLOAT32_C( 3.56)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 16.91), SIMDE_FLOAT32_C( 2912.75), SIMDE_FLOAT32_C( 269.58), SIMDE_FLOAT32_C( 297.93),
SIMDE_FLOAT32_C( 1.05), SIMDE_FLOAT32_C( 1.01), SIMDE_FLOAT32_C( 7.40), SIMDE_FLOAT32_C( 2.60),
SIMDE_FLOAT32_C( 15.61), SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( 989.16), SIMDE_FLOAT32_C( 1663.79),
SIMDE_FLOAT32_C( 30.48), SIMDE_FLOAT32_C( 895.03), SIMDE_FLOAT32_C( 87.09), SIMDE_FLOAT32_C( 17.60)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( 8.30), SIMDE_FLOAT32_C( 9.12), SIMDE_FLOAT32_C( 4.38), SIMDE_FLOAT32_C( 1.25),
SIMDE_FLOAT32_C( 2.03), SIMDE_FLOAT32_C( 8.52), SIMDE_FLOAT32_C( 7.28), SIMDE_FLOAT32_C( 7.93),
SIMDE_FLOAT32_C( 9.07), SIMDE_FLOAT32_C( 5.27), SIMDE_FLOAT32_C( 9.88), SIMDE_FLOAT32_C( -0.48),
SIMDE_FLOAT32_C( 6.32), SIMDE_FLOAT32_C( 5.05), SIMDE_FLOAT32_C( 7.05), SIMDE_FLOAT32_C( 0.44)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 2011.94), SIMDE_FLOAT32_C( 4568.10), SIMDE_FLOAT32_C( 39.93), SIMDE_FLOAT32_C( 1.89),
SIMDE_FLOAT32_C( 3.87), SIMDE_FLOAT32_C( 2507.03), SIMDE_FLOAT32_C( 725.49), SIMDE_FLOAT32_C( 1389.71),
SIMDE_FLOAT32_C( 4345.31), SIMDE_FLOAT32_C( 97.21), SIMDE_FLOAT32_C( 9767.86), SIMDE_FLOAT32_C( 1.12),
SIMDE_FLOAT32_C( 277.79), SIMDE_FLOAT32_C( 78.01), SIMDE_FLOAT32_C( 576.43), SIMDE_FLOAT32_C( 1.10)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( 0.27), SIMDE_FLOAT32_C( 9.02), SIMDE_FLOAT32_C( 1.34), SIMDE_FLOAT32_C( -0.99),
SIMDE_FLOAT32_C( 2.64), SIMDE_FLOAT32_C( 9.58), SIMDE_FLOAT32_C( 6.12), SIMDE_FLOAT32_C( 0.28),
SIMDE_FLOAT32_C( 1.33), SIMDE_FLOAT32_C( 4.13), SIMDE_FLOAT32_C( 8.41), SIMDE_FLOAT32_C( 9.88),
SIMDE_FLOAT32_C( 0.34), SIMDE_FLOAT32_C( 6.84), SIMDE_FLOAT32_C( 4.65), SIMDE_FLOAT32_C( 3.98)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 1.04), SIMDE_FLOAT32_C( 4133.39), SIMDE_FLOAT32_C( 2.04), SIMDE_FLOAT32_C( 1.53),
SIMDE_FLOAT32_C( 7.04), SIMDE_FLOAT32_C( 7236.21), SIMDE_FLOAT32_C( 227.43), SIMDE_FLOAT32_C( 1.04),
SIMDE_FLOAT32_C( 2.02), SIMDE_FLOAT32_C( 31.10), SIMDE_FLOAT32_C( 2245.88), SIMDE_FLOAT32_C( 9767.86),
SIMDE_FLOAT32_C( 1.06), SIMDE_FLOAT32_C( 467.25), SIMDE_FLOAT32_C( 52.30), SIMDE_FLOAT32_C( 26.77)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( 2.09), SIMDE_FLOAT32_C( 2.52), SIMDE_FLOAT32_C( 0.36), SIMDE_FLOAT32_C( 4.31),
SIMDE_FLOAT32_C( 9.63), SIMDE_FLOAT32_C( 4.54), SIMDE_FLOAT32_C( 2.70), SIMDE_FLOAT32_C( 3.81),
SIMDE_FLOAT32_C( 3.50), SIMDE_FLOAT32_C( 4.72), SIMDE_FLOAT32_C( 7.31), SIMDE_FLOAT32_C( 6.67),
SIMDE_FLOAT32_C( 4.58), SIMDE_FLOAT32_C( 2.82), SIMDE_FLOAT32_C( 9.54), SIMDE_FLOAT32_C( 0.67)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 4.10), SIMDE_FLOAT32_C( 6.25), SIMDE_FLOAT32_C( 1.07), SIMDE_FLOAT32_C( 37.23),
SIMDE_FLOAT32_C( 7607.22), SIMDE_FLOAT32_C( 46.85), SIMDE_FLOAT32_C( 7.47), SIMDE_FLOAT32_C( 22.59),
SIMDE_FLOAT32_C( 16.57), SIMDE_FLOAT32_C( 56.09), SIMDE_FLOAT32_C( 747.59), SIMDE_FLOAT32_C( 394.20),
SIMDE_FLOAT32_C( 48.76), SIMDE_FLOAT32_C( 8.42), SIMDE_FLOAT32_C( 6952.47), SIMDE_FLOAT32_C( 1.23)) }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])); i++) {
simde__m512 r = simde_mm512_cosh_ps(test_vec[i].a);
simde_assert_m512_close(r, test_vec[i].r, 1);
}
return 0;
}
static int
test_simde_mm512_mask_cosh_ps(SIMDE_MUNIT_TEST_ARGS) {
const struct {
simde__m512 src;
simde__mmask16 k;
simde__m512 a;
simde__m512 r;
} test_vec[8] = {
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( 2.02), SIMDE_FLOAT32_C( 8.28), SIMDE_FLOAT32_C( 3.33), SIMDE_FLOAT32_C( 0.87),
SIMDE_FLOAT32_C( 4.66), SIMDE_FLOAT32_C( -0.58), SIMDE_FLOAT32_C( -0.24), SIMDE_FLOAT32_C( 8.33),
SIMDE_FLOAT32_C( 0.73), SIMDE_FLOAT32_C( 7.64), SIMDE_FLOAT32_C( 6.82), SIMDE_FLOAT32_C( 7.07),
SIMDE_FLOAT32_C( 8.19), SIMDE_FLOAT32_C( 4.69), SIMDE_FLOAT32_C( 4.71), SIMDE_FLOAT32_C( 6.41)),
UINT16_C(41466),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 5.48), SIMDE_FLOAT32_C( 5.78), SIMDE_FLOAT32_C( 5.94), SIMDE_FLOAT32_C( -0.87),
SIMDE_FLOAT32_C( 2.05), SIMDE_FLOAT32_C( 2.39), SIMDE_FLOAT32_C( 2.82), SIMDE_FLOAT32_C( 2.20),
SIMDE_FLOAT32_C( 0.77), SIMDE_FLOAT32_C( 4.97), SIMDE_FLOAT32_C( 9.04), SIMDE_FLOAT32_C( 3.02),
SIMDE_FLOAT32_C( 7.24), SIMDE_FLOAT32_C( 2.86), SIMDE_FLOAT32_C( 3.48), SIMDE_FLOAT32_C( 0.35)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 119.93), SIMDE_FLOAT32_C( 8.28), SIMDE_FLOAT32_C( 189.97), SIMDE_FLOAT32_C( 0.87),
SIMDE_FLOAT32_C( 4.66), SIMDE_FLOAT32_C( -0.58), SIMDE_FLOAT32_C( -0.24), SIMDE_FLOAT32_C( 4.57),
SIMDE_FLOAT32_C( 1.31), SIMDE_FLOAT32_C( 72.02), SIMDE_FLOAT32_C( 4216.89), SIMDE_FLOAT32_C( 10.27),
SIMDE_FLOAT32_C( 697.05), SIMDE_FLOAT32_C( 4.69), SIMDE_FLOAT32_C( 16.25), SIMDE_FLOAT32_C( 6.41)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( 7.08), SIMDE_FLOAT32_C( 3.68), SIMDE_FLOAT32_C( 9.51), SIMDE_FLOAT32_C( 8.85),
SIMDE_FLOAT32_C( 3.38), SIMDE_FLOAT32_C( 6.35), SIMDE_FLOAT32_C( 0.39), SIMDE_FLOAT32_C( 1.45),
SIMDE_FLOAT32_C( 2.37), SIMDE_FLOAT32_C( 8.11), SIMDE_FLOAT32_C( 7.43), SIMDE_FLOAT32_C( 8.79),
SIMDE_FLOAT32_C( 0.26), SIMDE_FLOAT32_C( 1.29), SIMDE_FLOAT32_C( 0.26), SIMDE_FLOAT32_C( 4.65)),
UINT16_C(36797),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 3.56), SIMDE_FLOAT32_C( 8.24), SIMDE_FLOAT32_C( 9.00), SIMDE_FLOAT32_C( 7.80),
SIMDE_FLOAT32_C( 5.90), SIMDE_FLOAT32_C( 4.06), SIMDE_FLOAT32_C( -0.70), SIMDE_FLOAT32_C( 0.28),
SIMDE_FLOAT32_C( 6.69), SIMDE_FLOAT32_C( 6.68), SIMDE_FLOAT32_C( 6.37), SIMDE_FLOAT32_C( 3.05),
SIMDE_FLOAT32_C( 4.33), SIMDE_FLOAT32_C( 6.94), SIMDE_FLOAT32_C( 6.59), SIMDE_FLOAT32_C( 9.97)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 17.60), SIMDE_FLOAT32_C( 3.68), SIMDE_FLOAT32_C( 9.51), SIMDE_FLOAT32_C( 8.85),
SIMDE_FLOAT32_C( 182.52), SIMDE_FLOAT32_C( 29.00), SIMDE_FLOAT32_C( 1.26), SIMDE_FLOAT32_C( 1.04),
SIMDE_FLOAT32_C( 402.16), SIMDE_FLOAT32_C( 8.11), SIMDE_FLOAT32_C( 292.03), SIMDE_FLOAT32_C( 10.58),
SIMDE_FLOAT32_C( 37.98), SIMDE_FLOAT32_C( 516.39), SIMDE_FLOAT32_C( 0.26), SIMDE_FLOAT32_C( 10687.75)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( 3.98), SIMDE_FLOAT32_C( 9.12), SIMDE_FLOAT32_C( 1.25), SIMDE_FLOAT32_C( 8.52),
SIMDE_FLOAT32_C( 7.93), SIMDE_FLOAT32_C( 5.27), SIMDE_FLOAT32_C( -0.48), SIMDE_FLOAT32_C( 5.05),
SIMDE_FLOAT32_C( 0.44), SIMDE_FLOAT32_C( 8.67), SIMDE_FLOAT32_C( 6.39), SIMDE_FLOAT32_C( 0.11),
SIMDE_FLOAT32_C( 1.61), SIMDE_FLOAT32_C( -0.02), SIMDE_FLOAT32_C( 8.11), SIMDE_FLOAT32_C( 7.49)),
UINT16_C(16804),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 4.65), SIMDE_FLOAT32_C( 8.30), SIMDE_FLOAT32_C( 4.38), SIMDE_FLOAT32_C( 2.03),
SIMDE_FLOAT32_C( 7.28), SIMDE_FLOAT32_C( 9.07), SIMDE_FLOAT32_C( 9.88), SIMDE_FLOAT32_C( 6.32),
SIMDE_FLOAT32_C( 7.05), SIMDE_FLOAT32_C( 3.52), SIMDE_FLOAT32_C( 6.29), SIMDE_FLOAT32_C( -0.31),
SIMDE_FLOAT32_C( 2.69), SIMDE_FLOAT32_C( 3.44), SIMDE_FLOAT32_C( 7.59), SIMDE_FLOAT32_C( 4.11)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 3.98), SIMDE_FLOAT32_C( 2011.94), SIMDE_FLOAT32_C( 1.25), SIMDE_FLOAT32_C( 8.52),
SIMDE_FLOAT32_C( 7.93), SIMDE_FLOAT32_C( 5.27), SIMDE_FLOAT32_C( -0.48), SIMDE_FLOAT32_C( 277.79),
SIMDE_FLOAT32_C( 576.43), SIMDE_FLOAT32_C( 8.67), SIMDE_FLOAT32_C( 269.58), SIMDE_FLOAT32_C( 0.11),
SIMDE_FLOAT32_C( 1.61), SIMDE_FLOAT32_C( 15.61), SIMDE_FLOAT32_C( 8.11), SIMDE_FLOAT32_C( 7.49)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( 2.58), SIMDE_FLOAT32_C( 2.09), SIMDE_FLOAT32_C( 0.36), SIMDE_FLOAT32_C( 9.63),
SIMDE_FLOAT32_C( 2.70), SIMDE_FLOAT32_C( 3.50), SIMDE_FLOAT32_C( 7.31), SIMDE_FLOAT32_C( 4.58),
SIMDE_FLOAT32_C( 9.54), SIMDE_FLOAT32_C( 0.27), SIMDE_FLOAT32_C( 1.34), SIMDE_FLOAT32_C( 2.64),
SIMDE_FLOAT32_C( 6.12), SIMDE_FLOAT32_C( 1.33), SIMDE_FLOAT32_C( 8.41), SIMDE_FLOAT32_C( 0.34)),
UINT16_C( 2107),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 9.43), SIMDE_FLOAT32_C( 3.41), SIMDE_FLOAT32_C( 2.52), SIMDE_FLOAT32_C( 4.31),
SIMDE_FLOAT32_C( 4.54), SIMDE_FLOAT32_C( 3.81), SIMDE_FLOAT32_C( 4.72), SIMDE_FLOAT32_C( 6.67),
SIMDE_FLOAT32_C( 2.82), SIMDE_FLOAT32_C( 0.67), SIMDE_FLOAT32_C( 9.02), SIMDE_FLOAT32_C( -0.99),
SIMDE_FLOAT32_C( 9.58), SIMDE_FLOAT32_C( 0.28), SIMDE_FLOAT32_C( 4.13), SIMDE_FLOAT32_C( 9.88)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 2.58), SIMDE_FLOAT32_C( 2.09), SIMDE_FLOAT32_C( 0.36), SIMDE_FLOAT32_C( 9.63),
SIMDE_FLOAT32_C( 46.85), SIMDE_FLOAT32_C( 3.50), SIMDE_FLOAT32_C( 7.31), SIMDE_FLOAT32_C( 4.58),
SIMDE_FLOAT32_C( 9.54), SIMDE_FLOAT32_C( 0.27), SIMDE_FLOAT32_C( 4133.39), SIMDE_FLOAT32_C( 1.53),
SIMDE_FLOAT32_C( 7236.21), SIMDE_FLOAT32_C( 1.33), SIMDE_FLOAT32_C( 31.10), SIMDE_FLOAT32_C( 9767.86)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( 4.41), SIMDE_FLOAT32_C( 0.45), SIMDE_FLOAT32_C( 2.77), SIMDE_FLOAT32_C( 5.48),
SIMDE_FLOAT32_C( 6.40), SIMDE_FLOAT32_C( 9.39), SIMDE_FLOAT32_C( 9.54), SIMDE_FLOAT32_C( 5.23),
SIMDE_FLOAT32_C( 7.17), SIMDE_FLOAT32_C( 1.21), SIMDE_FLOAT32_C( 0.15), SIMDE_FLOAT32_C( 0.29),
SIMDE_FLOAT32_C( 5.72), SIMDE_FLOAT32_C( 0.16), SIMDE_FLOAT32_C( 0.24), SIMDE_FLOAT32_C( 6.92)),
UINT16_C(22274),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 7.23), SIMDE_FLOAT32_C( 9.53), SIMDE_FLOAT32_C( 0.55), SIMDE_FLOAT32_C( 5.38),
SIMDE_FLOAT32_C( -0.24), SIMDE_FLOAT32_C( 6.85), SIMDE_FLOAT32_C( 9.63), SIMDE_FLOAT32_C( 5.11),
SIMDE_FLOAT32_C( 9.05), SIMDE_FLOAT32_C( 4.08), SIMDE_FLOAT32_C( 5.81), SIMDE_FLOAT32_C( 2.42),
SIMDE_FLOAT32_C( 1.19), SIMDE_FLOAT32_C( 1.07), SIMDE_FLOAT32_C( -0.68), SIMDE_FLOAT32_C( 7.12)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 4.41), SIMDE_FLOAT32_C( 6883.29), SIMDE_FLOAT32_C( 2.77), SIMDE_FLOAT32_C( 108.51),
SIMDE_FLOAT32_C( 6.40), SIMDE_FLOAT32_C( 471.94), SIMDE_FLOAT32_C( 7607.22), SIMDE_FLOAT32_C( 82.84),
SIMDE_FLOAT32_C( 7.17), SIMDE_FLOAT32_C( 1.21), SIMDE_FLOAT32_C( 0.15), SIMDE_FLOAT32_C( 0.29),
SIMDE_FLOAT32_C( 5.72), SIMDE_FLOAT32_C( 0.16), SIMDE_FLOAT32_C( 1.24), SIMDE_FLOAT32_C( 6.92)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( 9.36), SIMDE_FLOAT32_C( 0.06), SIMDE_FLOAT32_C( 4.11), SIMDE_FLOAT32_C( 0.19),
SIMDE_FLOAT32_C( 5.01), SIMDE_FLOAT32_C( 5.64), SIMDE_FLOAT32_C( 9.09), SIMDE_FLOAT32_C( 4.14),
SIMDE_FLOAT32_C( 2.93), SIMDE_FLOAT32_C( 3.33), SIMDE_FLOAT32_C( 2.75), SIMDE_FLOAT32_C( 8.81),
SIMDE_FLOAT32_C( 1.04), SIMDE_FLOAT32_C( 7.56), SIMDE_FLOAT32_C( 6.92), SIMDE_FLOAT32_C( 6.89)),
UINT16_C(27396),
simde_mm512_set_ps(SIMDE_FLOAT32_C( -0.80), SIMDE_FLOAT32_C( 2.27), SIMDE_FLOAT32_C( 0.41), SIMDE_FLOAT32_C( 0.29),
SIMDE_FLOAT32_C( 8.84), SIMDE_FLOAT32_C( 4.53), SIMDE_FLOAT32_C( -0.00), SIMDE_FLOAT32_C( 5.39),
SIMDE_FLOAT32_C( 7.69), SIMDE_FLOAT32_C( 4.44), SIMDE_FLOAT32_C( 2.80), SIMDE_FLOAT32_C( 0.54),
SIMDE_FLOAT32_C( 6.34), SIMDE_FLOAT32_C( 6.01), SIMDE_FLOAT32_C( -0.54), SIMDE_FLOAT32_C( 1.81)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 9.36), SIMDE_FLOAT32_C( 4.89), SIMDE_FLOAT32_C( 1.09), SIMDE_FLOAT32_C( 0.19),
SIMDE_FLOAT32_C( 3452.50), SIMDE_FLOAT32_C( 5.64), SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( 109.60),
SIMDE_FLOAT32_C( 2.93), SIMDE_FLOAT32_C( 3.33), SIMDE_FLOAT32_C( 2.75), SIMDE_FLOAT32_C( 8.81),
SIMDE_FLOAT32_C( 1.04), SIMDE_FLOAT32_C( 203.74), SIMDE_FLOAT32_C( 6.92), SIMDE_FLOAT32_C( 6.89)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( 7.41), SIMDE_FLOAT32_C( 4.36), SIMDE_FLOAT32_C( -0.82), SIMDE_FLOAT32_C( 8.01),
SIMDE_FLOAT32_C( 6.98), SIMDE_FLOAT32_C( 0.26), SIMDE_FLOAT32_C( 5.08), SIMDE_FLOAT32_C( 7.75),
SIMDE_FLOAT32_C( 4.67), SIMDE_FLOAT32_C( 7.99), SIMDE_FLOAT32_C( 4.04), SIMDE_FLOAT32_C( 4.94),
SIMDE_FLOAT32_C( 0.60), SIMDE_FLOAT32_C( 7.84), SIMDE_FLOAT32_C( 6.67), SIMDE_FLOAT32_C( -0.39)),
UINT16_C( 953),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 4.60), SIMDE_FLOAT32_C( 8.95), SIMDE_FLOAT32_C( 5.29), SIMDE_FLOAT32_C( 2.15),
SIMDE_FLOAT32_C( 6.20), SIMDE_FLOAT32_C( 3.53), SIMDE_FLOAT32_C( 1.98), SIMDE_FLOAT32_C( 8.23),
SIMDE_FLOAT32_C( 4.86), SIMDE_FLOAT32_C( 3.03), SIMDE_FLOAT32_C( 5.15), SIMDE_FLOAT32_C( 1.33),
SIMDE_FLOAT32_C( 4.29), SIMDE_FLOAT32_C( 3.12), SIMDE_FLOAT32_C( 4.20), SIMDE_FLOAT32_C( 5.01)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 7.41), SIMDE_FLOAT32_C( 4.36), SIMDE_FLOAT32_C( -0.82), SIMDE_FLOAT32_C( 8.01),
SIMDE_FLOAT32_C( 6.98), SIMDE_FLOAT32_C( 0.26), SIMDE_FLOAT32_C( 3.69), SIMDE_FLOAT32_C( 1875.92),
SIMDE_FLOAT32_C( 64.52), SIMDE_FLOAT32_C( 7.99), SIMDE_FLOAT32_C( 86.22), SIMDE_FLOAT32_C( 2.02),
SIMDE_FLOAT32_C( 36.49), SIMDE_FLOAT32_C( 7.84), SIMDE_FLOAT32_C( 6.67), SIMDE_FLOAT32_C( 74.96)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( 0.16), SIMDE_FLOAT32_C( 6.32), SIMDE_FLOAT32_C( 1.79), SIMDE_FLOAT32_C( 9.02),
SIMDE_FLOAT32_C( 9.76), SIMDE_FLOAT32_C( 9.75), SIMDE_FLOAT32_C( 9.04), SIMDE_FLOAT32_C( 0.01),
SIMDE_FLOAT32_C( 3.35), SIMDE_FLOAT32_C( -0.63), SIMDE_FLOAT32_C( 0.49), SIMDE_FLOAT32_C( 2.19),
SIMDE_FLOAT32_C( 5.05), SIMDE_FLOAT32_C( 5.07), SIMDE_FLOAT32_C( 6.92), SIMDE_FLOAT32_C( 3.38)),
UINT16_C(12713),
simde_mm512_set_ps(SIMDE_FLOAT32_C( -0.13), SIMDE_FLOAT32_C( 4.42), SIMDE_FLOAT32_C( 9.04), SIMDE_FLOAT32_C( 8.86),
SIMDE_FLOAT32_C( 0.45), SIMDE_FLOAT32_C( 2.79), SIMDE_FLOAT32_C( 8.51), SIMDE_FLOAT32_C( 2.57),
SIMDE_FLOAT32_C( 4.83), SIMDE_FLOAT32_C( 5.10), SIMDE_FLOAT32_C( 8.44), SIMDE_FLOAT32_C( 3.12),
SIMDE_FLOAT32_C( 9.69), SIMDE_FLOAT32_C( 6.49), SIMDE_FLOAT32_C( 4.43), SIMDE_FLOAT32_C( 9.19)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 0.16), SIMDE_FLOAT32_C( 6.32), SIMDE_FLOAT32_C( 4216.89), SIMDE_FLOAT32_C( 3522.24),
SIMDE_FLOAT32_C( 9.76), SIMDE_FLOAT32_C( 9.75), SIMDE_FLOAT32_C( 9.04), SIMDE_FLOAT32_C( 6.57),
SIMDE_FLOAT32_C( 62.61), SIMDE_FLOAT32_C( -0.63), SIMDE_FLOAT32_C( 2314.28), SIMDE_FLOAT32_C( 2.19),
SIMDE_FLOAT32_C( 8077.62), SIMDE_FLOAT32_C( 5.07), SIMDE_FLOAT32_C( 6.92), SIMDE_FLOAT32_C( 4899.32)) }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])); i++) {
simde__m512 r = simde_mm512_mask_cosh_ps(test_vec[i].src, test_vec[i].k, test_vec[i].a);
simde_assert_m512_close(r, test_vec[i].r, 1);
}
return 0;
}
static int
test_simde_mm512_cosh_pd(SIMDE_MUNIT_TEST_ARGS) {
const struct {
simde__m512d a;
simde__m512d r;
} test_vec[8] = {
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( 3.49), SIMDE_FLOAT64_C( 4.01),
SIMDE_FLOAT64_C( 1.11), SIMDE_FLOAT64_C( 2.10),
SIMDE_FLOAT64_C( 1.44), SIMDE_FLOAT64_C( 2.12),
SIMDE_FLOAT64_C( -0.26), SIMDE_FLOAT64_C( 3.04)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 16.41), SIMDE_FLOAT64_C( 27.58),
SIMDE_FLOAT64_C( 1.68), SIMDE_FLOAT64_C( 4.14),
SIMDE_FLOAT64_C( 2.23), SIMDE_FLOAT64_C( 4.23),
SIMDE_FLOAT64_C( 1.03), SIMDE_FLOAT64_C( 10.48)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( -0.03), SIMDE_FLOAT64_C( -0.06),
SIMDE_FLOAT64_C( 2.25), SIMDE_FLOAT64_C( 3.71),
SIMDE_FLOAT64_C( 4.48), SIMDE_FLOAT64_C( 3.27),
SIMDE_FLOAT64_C( 1.19), SIMDE_FLOAT64_C( 3.40)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 1.00), SIMDE_FLOAT64_C( 1.00),
SIMDE_FLOAT64_C( 4.80), SIMDE_FLOAT64_C( 20.44),
SIMDE_FLOAT64_C( 44.12), SIMDE_FLOAT64_C( 13.17),
SIMDE_FLOAT64_C( 1.80), SIMDE_FLOAT64_C( 15.00)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( 0.67), SIMDE_FLOAT64_C( 2.09),
SIMDE_FLOAT64_C( 0.85), SIMDE_FLOAT64_C( -0.77),
SIMDE_FLOAT64_C( 1.08), SIMDE_FLOAT64_C( -0.58),
SIMDE_FLOAT64_C( 0.75), SIMDE_FLOAT64_C( 4.09)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 1.23), SIMDE_FLOAT64_C( 4.10),
SIMDE_FLOAT64_C( 1.38), SIMDE_FLOAT64_C( 1.31),
SIMDE_FLOAT64_C( 1.64), SIMDE_FLOAT64_C( 1.17),
SIMDE_FLOAT64_C( 1.29), SIMDE_FLOAT64_C( 29.88)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( 2.53), SIMDE_FLOAT64_C( 0.65),
SIMDE_FLOAT64_C( 2.70), SIMDE_FLOAT64_C( 4.06),
SIMDE_FLOAT64_C( 2.78), SIMDE_FLOAT64_C( 1.36),
SIMDE_FLOAT64_C( -0.93), SIMDE_FLOAT64_C( 0.02)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 6.32), SIMDE_FLOAT64_C( 1.22),
SIMDE_FLOAT64_C( 7.47), SIMDE_FLOAT64_C( 29.00),
SIMDE_FLOAT64_C( 8.09), SIMDE_FLOAT64_C( 2.08),
SIMDE_FLOAT64_C( 1.46), SIMDE_FLOAT64_C( 1.00)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( -0.31), SIMDE_FLOAT64_C( 3.33),
SIMDE_FLOAT64_C( 0.25), SIMDE_FLOAT64_C( 3.14),
SIMDE_FLOAT64_C( -0.31), SIMDE_FLOAT64_C( 4.98),
SIMDE_FLOAT64_C( 2.08), SIMDE_FLOAT64_C( 4.52)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 1.05), SIMDE_FLOAT64_C( 13.99),
SIMDE_FLOAT64_C( 1.03), SIMDE_FLOAT64_C( 11.57),
SIMDE_FLOAT64_C( 1.05), SIMDE_FLOAT64_C( 72.74),
SIMDE_FLOAT64_C( 4.06), SIMDE_FLOAT64_C( 45.92)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( 0.84), SIMDE_FLOAT64_C( 3.19),
SIMDE_FLOAT64_C( 3.97), SIMDE_FLOAT64_C( 3.02),
SIMDE_FLOAT64_C( 3.60), SIMDE_FLOAT64_C( 1.21),
SIMDE_FLOAT64_C( 4.34), SIMDE_FLOAT64_C( 1.91)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 1.37), SIMDE_FLOAT64_C( 12.16),
SIMDE_FLOAT64_C( 26.50), SIMDE_FLOAT64_C( 10.27),
SIMDE_FLOAT64_C( 18.31), SIMDE_FLOAT64_C( 1.83),
SIMDE_FLOAT64_C( 38.36), SIMDE_FLOAT64_C( 3.45)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( 1.39), SIMDE_FLOAT64_C( 1.76),
SIMDE_FLOAT64_C( 3.01), SIMDE_FLOAT64_C( -0.83),
SIMDE_FLOAT64_C( -0.24), SIMDE_FLOAT64_C( -0.30),
SIMDE_FLOAT64_C( 0.34), SIMDE_FLOAT64_C( 3.20)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 2.13), SIMDE_FLOAT64_C( 2.99),
SIMDE_FLOAT64_C( 10.17), SIMDE_FLOAT64_C( 1.36),
SIMDE_FLOAT64_C( 1.03), SIMDE_FLOAT64_C( 1.05),
SIMDE_FLOAT64_C( 1.06), SIMDE_FLOAT64_C( 12.29)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( 3.41), SIMDE_FLOAT64_C( 4.04),
SIMDE_FLOAT64_C( 1.55), SIMDE_FLOAT64_C( 4.46),
SIMDE_FLOAT64_C( 4.73), SIMDE_FLOAT64_C( 3.80),
SIMDE_FLOAT64_C( 4.37), SIMDE_FLOAT64_C( 2.76)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 15.15), SIMDE_FLOAT64_C( 28.42),
SIMDE_FLOAT64_C( 2.46), SIMDE_FLOAT64_C( 43.25),
SIMDE_FLOAT64_C( 56.65), SIMDE_FLOAT64_C( 22.36),
SIMDE_FLOAT64_C( 39.53), SIMDE_FLOAT64_C( 7.93)) }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])); i++) {
simde__m512d r = simde_mm512_cosh_pd(test_vec[i].a);
simde_assert_m512d_close(r, test_vec[i].r, 1);
}
return 0;
}
static int
test_simde_mm512_mask_cosh_pd(SIMDE_MUNIT_TEST_ARGS) {
const struct {
simde__m512d src;
simde__mmask8 k;
simde__m512d a;
simde__m512d r;
} test_vec[8] = {
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( -0.06), SIMDE_FLOAT64_C( 3.71),
SIMDE_FLOAT64_C( 3.27), SIMDE_FLOAT64_C( 3.40),
SIMDE_FLOAT64_C( 4.01), SIMDE_FLOAT64_C( 2.10),
SIMDE_FLOAT64_C( 2.12), SIMDE_FLOAT64_C( 3.04)),
UINT8_C(139),
simde_mm512_set_pd(SIMDE_FLOAT64_C( -0.03), SIMDE_FLOAT64_C( 2.25),
SIMDE_FLOAT64_C( 4.48), SIMDE_FLOAT64_C( 1.19),
SIMDE_FLOAT64_C( 3.49), SIMDE_FLOAT64_C( 1.11),
SIMDE_FLOAT64_C( 1.44), SIMDE_FLOAT64_C( -0.26)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 1.00), SIMDE_FLOAT64_C( 3.71),
SIMDE_FLOAT64_C( 3.27), SIMDE_FLOAT64_C( 3.40),
SIMDE_FLOAT64_C( 16.41), SIMDE_FLOAT64_C( 2.10),
SIMDE_FLOAT64_C( 2.23), SIMDE_FLOAT64_C( 1.03)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( 2.53), SIMDE_FLOAT64_C( 2.70),
SIMDE_FLOAT64_C( 2.78), SIMDE_FLOAT64_C( -0.93),
SIMDE_FLOAT64_C( 0.67), SIMDE_FLOAT64_C( 0.85),
SIMDE_FLOAT64_C( 1.08), SIMDE_FLOAT64_C( 0.75)),
UINT8_C(229),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 4.52), SIMDE_FLOAT64_C( 0.65),
SIMDE_FLOAT64_C( 4.06), SIMDE_FLOAT64_C( 1.36),
SIMDE_FLOAT64_C( 0.02), SIMDE_FLOAT64_C( 2.09),
SIMDE_FLOAT64_C( -0.77), SIMDE_FLOAT64_C( -0.58)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 45.92), SIMDE_FLOAT64_C( 1.22),
SIMDE_FLOAT64_C( 29.00), SIMDE_FLOAT64_C( -0.93),
SIMDE_FLOAT64_C( 0.67), SIMDE_FLOAT64_C( 4.10),
SIMDE_FLOAT64_C( 1.08), SIMDE_FLOAT64_C( 1.17)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( 3.20), SIMDE_FLOAT64_C( 3.19),
SIMDE_FLOAT64_C( 3.02), SIMDE_FLOAT64_C( 1.21),
SIMDE_FLOAT64_C( 1.91), SIMDE_FLOAT64_C( 3.33),
SIMDE_FLOAT64_C( 3.14), SIMDE_FLOAT64_C( 4.98)),
UINT8_C(253),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 0.34), SIMDE_FLOAT64_C( 0.84),
SIMDE_FLOAT64_C( 3.97), SIMDE_FLOAT64_C( 3.60),
SIMDE_FLOAT64_C( 4.34), SIMDE_FLOAT64_C( -0.31),
SIMDE_FLOAT64_C( 0.25), SIMDE_FLOAT64_C( -0.31)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 1.06), SIMDE_FLOAT64_C( 1.37),
SIMDE_FLOAT64_C( 26.50), SIMDE_FLOAT64_C( 18.31),
SIMDE_FLOAT64_C( 38.36), SIMDE_FLOAT64_C( 1.05),
SIMDE_FLOAT64_C( 3.14), SIMDE_FLOAT64_C( 1.05)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( 2.36), SIMDE_FLOAT64_C( 3.41),
SIMDE_FLOAT64_C( 1.55), SIMDE_FLOAT64_C( 4.73),
SIMDE_FLOAT64_C( 4.37), SIMDE_FLOAT64_C( 1.39),
SIMDE_FLOAT64_C( 3.01), SIMDE_FLOAT64_C( -0.24)),
UINT8_C( 93),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 3.63), SIMDE_FLOAT64_C( 1.49),
SIMDE_FLOAT64_C( 4.04), SIMDE_FLOAT64_C( 4.46),
SIMDE_FLOAT64_C( 3.80), SIMDE_FLOAT64_C( 2.76),
SIMDE_FLOAT64_C( 1.76), SIMDE_FLOAT64_C( -0.83)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 2.36), SIMDE_FLOAT64_C( 2.33),
SIMDE_FLOAT64_C( 1.55), SIMDE_FLOAT64_C( 43.25),
SIMDE_FLOAT64_C( 22.36), SIMDE_FLOAT64_C( 7.93),
SIMDE_FLOAT64_C( 3.01), SIMDE_FLOAT64_C( 1.36)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( 2.30), SIMDE_FLOAT64_C( -0.21),
SIMDE_FLOAT64_C( 4.28), SIMDE_FLOAT64_C( 3.03),
SIMDE_FLOAT64_C( -0.39), SIMDE_FLOAT64_C( 0.42),
SIMDE_FLOAT64_C( -0.47), SIMDE_FLOAT64_C( 3.97)),
UINT8_C(145),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 2.99), SIMDE_FLOAT64_C( 3.39),
SIMDE_FLOAT64_C( 1.46), SIMDE_FLOAT64_C( 2.97),
SIMDE_FLOAT64_C( -0.62), SIMDE_FLOAT64_C( 1.01),
SIMDE_FLOAT64_C( 1.42), SIMDE_FLOAT64_C( 3.68)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 9.97), SIMDE_FLOAT64_C( -0.21),
SIMDE_FLOAT64_C( 4.28), SIMDE_FLOAT64_C( 9.77),
SIMDE_FLOAT64_C( -0.39), SIMDE_FLOAT64_C( 0.42),
SIMDE_FLOAT64_C( -0.47), SIMDE_FLOAT64_C( 19.84)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( -0.27), SIMDE_FLOAT64_C( 2.08),
SIMDE_FLOAT64_C( 4.07), SIMDE_FLOAT64_C( 1.94),
SIMDE_FLOAT64_C( 0.65), SIMDE_FLOAT64_C( 3.52),
SIMDE_FLOAT64_C( 4.49), SIMDE_FLOAT64_C( 4.93)),
UINT8_C( 75),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 4.93), SIMDE_FLOAT64_C( 3.27),
SIMDE_FLOAT64_C( 1.71), SIMDE_FLOAT64_C( 4.52),
SIMDE_FLOAT64_C( 0.23), SIMDE_FLOAT64_C( 4.19),
SIMDE_FLOAT64_C( 3.87), SIMDE_FLOAT64_C( 2.42)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( -0.27), SIMDE_FLOAT64_C( 13.17),
SIMDE_FLOAT64_C( 4.07), SIMDE_FLOAT64_C( 1.94),
SIMDE_FLOAT64_C( 1.03), SIMDE_FLOAT64_C( 3.52),
SIMDE_FLOAT64_C( 23.98), SIMDE_FLOAT64_C( 5.67)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( 3.18), SIMDE_FLOAT64_C( 1.09),
SIMDE_FLOAT64_C( -0.09), SIMDE_FLOAT64_C( 4.47),
SIMDE_FLOAT64_C( -0.99), SIMDE_FLOAT64_C( 4.77),
SIMDE_FLOAT64_C( -0.30), SIMDE_FLOAT64_C( 1.80)),
UINT8_C( 93),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 3.53), SIMDE_FLOAT64_C( 2.04),
SIMDE_FLOAT64_C( 4.75), SIMDE_FLOAT64_C( -0.31),
SIMDE_FLOAT64_C( 0.28), SIMDE_FLOAT64_C( 0.99),
SIMDE_FLOAT64_C( 2.88), SIMDE_FLOAT64_C( 0.27)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 3.18), SIMDE_FLOAT64_C( 3.91),
SIMDE_FLOAT64_C( -0.09), SIMDE_FLOAT64_C( 1.05),
SIMDE_FLOAT64_C( 1.04), SIMDE_FLOAT64_C( 1.53),
SIMDE_FLOAT64_C( -0.30), SIMDE_FLOAT64_C( 1.04)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( 3.43), SIMDE_FLOAT64_C( 4.81),
SIMDE_FLOAT64_C( 0.95), SIMDE_FLOAT64_C( 0.69),
SIMDE_FLOAT64_C( -0.26), SIMDE_FLOAT64_C( 4.80),
SIMDE_FLOAT64_C( 1.02), SIMDE_FLOAT64_C( 1.45)),
UINT8_C(213),
simde_mm512_set_pd(SIMDE_FLOAT64_C( -0.33), SIMDE_FLOAT64_C( 3.32),
SIMDE_FLOAT64_C( 4.69), SIMDE_FLOAT64_C( 1.41),
SIMDE_FLOAT64_C( 0.92), SIMDE_FLOAT64_C( 1.90),
SIMDE_FLOAT64_C( 2.02), SIMDE_FLOAT64_C( 1.62)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 1.05), SIMDE_FLOAT64_C( 13.85),
SIMDE_FLOAT64_C( 0.95), SIMDE_FLOAT64_C( 2.17),
SIMDE_FLOAT64_C( -0.26), SIMDE_FLOAT64_C( 3.42),
SIMDE_FLOAT64_C( 1.02), SIMDE_FLOAT64_C( 2.63)) }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])); i++) {
simde__m512d r = simde_mm512_mask_cosh_pd(test_vec[i].src, test_vec[i].k, test_vec[i].a);
simde_assert_m512d_close(r, test_vec[i].r, 1);
}
return 0;
}
static int
test_simde_x_mm_deg2rad_ps (SIMDE_MUNIT_TEST_ARGS) {
static const struct {
const simde_float32 a[4];
const simde_float32 r[4];
} test_vec[] = {
{ { SIMDE_FLOAT32_C( 954.59), SIMDE_FLOAT32_C( -212.53), SIMDE_FLOAT32_C( -73.32), SIMDE_FLOAT32_C( -280.66) },
{ SIMDE_FLOAT32_C( 16.66), SIMDE_FLOAT32_C( -3.71), SIMDE_FLOAT32_C( -1.28), SIMDE_FLOAT32_C( -4.90) } },
{ { SIMDE_FLOAT32_C( 908.48), SIMDE_FLOAT32_C( 789.59), SIMDE_FLOAT32_C( 675.09), SIMDE_FLOAT32_C( 164.25) },
{ SIMDE_FLOAT32_C( 15.86), SIMDE_FLOAT32_C( 13.78), SIMDE_FLOAT32_C( 11.78), SIMDE_FLOAT32_C( 2.87) } },
{ { SIMDE_FLOAT32_C( 515.80), SIMDE_FLOAT32_C( -965.27), SIMDE_FLOAT32_C( 659.44), SIMDE_FLOAT32_C( -806.83) },
{ SIMDE_FLOAT32_C( 9.00), SIMDE_FLOAT32_C( -16.85), SIMDE_FLOAT32_C( 11.51), SIMDE_FLOAT32_C( -14.08) } },
{ { SIMDE_FLOAT32_C( -402.30), SIMDE_FLOAT32_C( 576.73), SIMDE_FLOAT32_C( -978.47), SIMDE_FLOAT32_C( 782.95) },
{ SIMDE_FLOAT32_C( -7.02), SIMDE_FLOAT32_C( 10.07), SIMDE_FLOAT32_C( -17.08), SIMDE_FLOAT32_C( 13.67) } },
{ { SIMDE_FLOAT32_C( -948.47), SIMDE_FLOAT32_C( 987.01), SIMDE_FLOAT32_C( 630.41), SIMDE_FLOAT32_C( -637.23) },
{ SIMDE_FLOAT32_C( -16.55), SIMDE_FLOAT32_C( 17.23), SIMDE_FLOAT32_C( 11.00), SIMDE_FLOAT32_C( -11.12) } },
{ { SIMDE_FLOAT32_C( 66.92), SIMDE_FLOAT32_C( 674.00), SIMDE_FLOAT32_C( -52.88), SIMDE_FLOAT32_C( -732.15) },
{ SIMDE_FLOAT32_C( 1.17), SIMDE_FLOAT32_C( 11.76), SIMDE_FLOAT32_C( -0.92), SIMDE_FLOAT32_C( -12.78) } },
{ { SIMDE_FLOAT32_C( 750.47), SIMDE_FLOAT32_C( -906.63), SIMDE_FLOAT32_C( 205.33), SIMDE_FLOAT32_C( -941.95) },
{ SIMDE_FLOAT32_C( 13.10), SIMDE_FLOAT32_C( -15.82), SIMDE_FLOAT32_C( 3.58), SIMDE_FLOAT32_C( -16.44) } },
{ { SIMDE_FLOAT32_C( 705.35), SIMDE_FLOAT32_C( 774.66), SIMDE_FLOAT32_C( -289.06), SIMDE_FLOAT32_C( -214.64) },
{ SIMDE_FLOAT32_C( 12.31), SIMDE_FLOAT32_C( 13.52), SIMDE_FLOAT32_C( -5.05), SIMDE_FLOAT32_C( -3.75) } }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])) ; i++) {
simde__m128 a = simde_mm_loadu_ps(test_vec[i].a);
simde__m128 r = simde_x_mm_deg2rad_ps(a);
simde_test_x86_assert_equal_f32x4(r, simde_mm_loadu_ps(test_vec[i].r), 1);
}
return 0;
}
static int
test_simde_x_mm_deg2rad_pd (SIMDE_MUNIT_TEST_ARGS) {
static const struct {
const simde_float64 a[2];
const simde_float64 r[2];
} test_vec[] = {
{ { SIMDE_FLOAT64_C( -666.18), SIMDE_FLOAT64_C( -415.97) },
{ SIMDE_FLOAT64_C( -11.63), SIMDE_FLOAT64_C( -7.26) } },
{ { SIMDE_FLOAT64_C( 793.43), SIMDE_FLOAT64_C( -853.65) },
{ SIMDE_FLOAT64_C( 13.85), SIMDE_FLOAT64_C( -14.90) } },
{ { SIMDE_FLOAT64_C( 738.56), SIMDE_FLOAT64_C( 967.23) },
{ SIMDE_FLOAT64_C( 12.89), SIMDE_FLOAT64_C( 16.88) } },
{ { SIMDE_FLOAT64_C( 309.17), SIMDE_FLOAT64_C( 265.53) },
{ SIMDE_FLOAT64_C( 5.40), SIMDE_FLOAT64_C( 4.63) } },
{ { SIMDE_FLOAT64_C( 844.47), SIMDE_FLOAT64_C( 938.60) },
{ SIMDE_FLOAT64_C( 14.74), SIMDE_FLOAT64_C( 16.38) } },
{ { SIMDE_FLOAT64_C( -902.86), SIMDE_FLOAT64_C( -334.71) },
{ SIMDE_FLOAT64_C( -15.76), SIMDE_FLOAT64_C( -5.84) } },
{ { SIMDE_FLOAT64_C( 582.46), SIMDE_FLOAT64_C( -651.74) },
{ SIMDE_FLOAT64_C( 10.17), SIMDE_FLOAT64_C( -11.38) } },
{ { SIMDE_FLOAT64_C( 196.36), SIMDE_FLOAT64_C( 200.15) },
{ SIMDE_FLOAT64_C( 3.43), SIMDE_FLOAT64_C( 3.49) } }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])) ; i++) {
simde__m128d a = simde_mm_loadu_pd(test_vec[i].a);
simde__m128d r = simde_x_mm_deg2rad_pd(a);
simde_test_x86_assert_equal_f64x2(r, simde_mm_loadu_pd(test_vec[i].r), 1);
}
return 0;
}
static int
test_simde_x_mm256_deg2rad_ps (SIMDE_MUNIT_TEST_ARGS) {
static const struct {
const simde_float32 a[8];
const simde_float32 r[8];
} test_vec[] = {
{ { SIMDE_FLOAT32_C( 442.73), SIMDE_FLOAT32_C( -968.72), SIMDE_FLOAT32_C( 679.13), SIMDE_FLOAT32_C( 114.21),
SIMDE_FLOAT32_C( -467.66), SIMDE_FLOAT32_C( -37.81), SIMDE_FLOAT32_C( 579.12), SIMDE_FLOAT32_C( -687.98) },
{ SIMDE_FLOAT32_C( 7.73), SIMDE_FLOAT32_C( -16.91), SIMDE_FLOAT32_C( 11.85), SIMDE_FLOAT32_C( 1.99),
SIMDE_FLOAT32_C( -8.16), SIMDE_FLOAT32_C( -0.66), SIMDE_FLOAT32_C( 10.11), SIMDE_FLOAT32_C( -12.01) } },
{ { SIMDE_FLOAT32_C( -896.03), SIMDE_FLOAT32_C( 496.82), SIMDE_FLOAT32_C( 46.75), SIMDE_FLOAT32_C( -189.63),
SIMDE_FLOAT32_C( 888.19), SIMDE_FLOAT32_C( -178.85), SIMDE_FLOAT32_C( 106.49), SIMDE_FLOAT32_C( -266.59) },
{ SIMDE_FLOAT32_C( -15.64), SIMDE_FLOAT32_C( 8.67), SIMDE_FLOAT32_C( 0.82), SIMDE_FLOAT32_C( -3.31),
SIMDE_FLOAT32_C( 15.50), SIMDE_FLOAT32_C( -3.12), SIMDE_FLOAT32_C( 1.86), SIMDE_FLOAT32_C( -4.65) } },
{ { SIMDE_FLOAT32_C( -577.36), SIMDE_FLOAT32_C( 319.48), SIMDE_FLOAT32_C( -568.91), SIMDE_FLOAT32_C( 369.60),
SIMDE_FLOAT32_C( -195.78), SIMDE_FLOAT32_C( -445.13), SIMDE_FLOAT32_C( 676.76), SIMDE_FLOAT32_C( 270.74) },
{ SIMDE_FLOAT32_C( -10.08), SIMDE_FLOAT32_C( 5.58), SIMDE_FLOAT32_C( -9.93), SIMDE_FLOAT32_C( 6.45),
SIMDE_FLOAT32_C( -3.42), SIMDE_FLOAT32_C( -7.77), SIMDE_FLOAT32_C( 11.81), SIMDE_FLOAT32_C( 4.73) } },
{ { SIMDE_FLOAT32_C( 386.69), SIMDE_FLOAT32_C( -818.31), SIMDE_FLOAT32_C( 697.61), SIMDE_FLOAT32_C( 731.13),
SIMDE_FLOAT32_C( 89.36), SIMDE_FLOAT32_C( -163.03), SIMDE_FLOAT32_C( 9.17), SIMDE_FLOAT32_C( 76.19) },
{ SIMDE_FLOAT32_C( 6.75), SIMDE_FLOAT32_C( -14.28), SIMDE_FLOAT32_C( 12.18), SIMDE_FLOAT32_C( 12.76),
SIMDE_FLOAT32_C( 1.56), SIMDE_FLOAT32_C( -2.85), SIMDE_FLOAT32_C( 0.16), SIMDE_FLOAT32_C( 1.33) } },
{ { SIMDE_FLOAT32_C( 522.23), SIMDE_FLOAT32_C( -876.19), SIMDE_FLOAT32_C( -206.90), SIMDE_FLOAT32_C( 647.79),
SIMDE_FLOAT32_C( -633.72), SIMDE_FLOAT32_C( -908.37), SIMDE_FLOAT32_C( 944.64), SIMDE_FLOAT32_C( 520.31) },
{ SIMDE_FLOAT32_C( 9.11), SIMDE_FLOAT32_C( -15.29), SIMDE_FLOAT32_C( -3.61), SIMDE_FLOAT32_C( 11.31),
SIMDE_FLOAT32_C( -11.06), SIMDE_FLOAT32_C( -15.85), SIMDE_FLOAT32_C( 16.49), SIMDE_FLOAT32_C( 9.08) } },
{ { SIMDE_FLOAT32_C( 907.89), SIMDE_FLOAT32_C( 849.63), SIMDE_FLOAT32_C( -208.12), SIMDE_FLOAT32_C( 68.74),
SIMDE_FLOAT32_C( -670.75), SIMDE_FLOAT32_C( 677.18), SIMDE_FLOAT32_C( -644.75), SIMDE_FLOAT32_C( -292.10) },
{ SIMDE_FLOAT32_C( 15.85), SIMDE_FLOAT32_C( 14.83), SIMDE_FLOAT32_C( -3.63), SIMDE_FLOAT32_C( 1.20),
SIMDE_FLOAT32_C( -11.71), SIMDE_FLOAT32_C( 11.82), SIMDE_FLOAT32_C( -11.25), SIMDE_FLOAT32_C( -5.10) } },
{ { SIMDE_FLOAT32_C( 675.40), SIMDE_FLOAT32_C( -616.47), SIMDE_FLOAT32_C( 962.11), SIMDE_FLOAT32_C( 134.41),
SIMDE_FLOAT32_C( -905.98), SIMDE_FLOAT32_C( -860.48), SIMDE_FLOAT32_C( -24.28), SIMDE_FLOAT32_C( -121.44) },
{ SIMDE_FLOAT32_C( 11.79), SIMDE_FLOAT32_C( -10.76), SIMDE_FLOAT32_C( 16.79), SIMDE_FLOAT32_C( 2.35),
SIMDE_FLOAT32_C( -15.81), SIMDE_FLOAT32_C( -15.02), SIMDE_FLOAT32_C( -0.42), SIMDE_FLOAT32_C( -2.12) } },
{ { SIMDE_FLOAT32_C( -960.63), SIMDE_FLOAT32_C( 687.26), SIMDE_FLOAT32_C( 788.74), SIMDE_FLOAT32_C( 386.45),
SIMDE_FLOAT32_C( -901.72), SIMDE_FLOAT32_C( 856.65), SIMDE_FLOAT32_C( -345.73), SIMDE_FLOAT32_C( -616.97) },
{ SIMDE_FLOAT32_C( -16.77), SIMDE_FLOAT32_C( 11.99), SIMDE_FLOAT32_C( 13.77), SIMDE_FLOAT32_C( 6.74),
SIMDE_FLOAT32_C( -15.74), SIMDE_FLOAT32_C( 14.95), SIMDE_FLOAT32_C( -6.03), SIMDE_FLOAT32_C( -10.77) } }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])) ; i++) {
simde__m256 a = simde_mm256_loadu_ps(test_vec[i].a);
simde__m256 r = simde_x_mm256_deg2rad_ps(a);
simde_test_x86_assert_equal_f32x8(r, simde_mm256_loadu_ps(test_vec[i].r), 1);
}
return 0;
}
static int
test_simde_x_mm256_deg2rad_pd (SIMDE_MUNIT_TEST_ARGS) {
static const struct {
const simde_float64 a[4];
const simde_float64 r[4];
} test_vec[] = {
{ { SIMDE_FLOAT64_C( -467.83), SIMDE_FLOAT64_C( -838.03), SIMDE_FLOAT64_C( -852.25), SIMDE_FLOAT64_C( 261.37) },
{ SIMDE_FLOAT64_C( -8.17), SIMDE_FLOAT64_C( -14.63), SIMDE_FLOAT64_C( -14.87), SIMDE_FLOAT64_C( 4.56) } },
{ { SIMDE_FLOAT64_C( 838.67), SIMDE_FLOAT64_C( -424.12), SIMDE_FLOAT64_C( -236.36), SIMDE_FLOAT64_C( -471.04) },
{ SIMDE_FLOAT64_C( 14.64), SIMDE_FLOAT64_C( -7.40), SIMDE_FLOAT64_C( -4.13), SIMDE_FLOAT64_C( -8.22) } },
{ { SIMDE_FLOAT64_C( -834.32), SIMDE_FLOAT64_C( -357.08), SIMDE_FLOAT64_C( 596.48), SIMDE_FLOAT64_C( 991.10) },
{ SIMDE_FLOAT64_C( -14.56), SIMDE_FLOAT64_C( -6.23), SIMDE_FLOAT64_C( 10.41), SIMDE_FLOAT64_C( 17.30) } },
{ { SIMDE_FLOAT64_C( -638.79), SIMDE_FLOAT64_C( -95.57), SIMDE_FLOAT64_C( -262.62), SIMDE_FLOAT64_C( 117.35) },
{ SIMDE_FLOAT64_C( -11.15), SIMDE_FLOAT64_C( -1.67), SIMDE_FLOAT64_C( -4.58), SIMDE_FLOAT64_C( 2.05) } },
{ { SIMDE_FLOAT64_C( 253.25), SIMDE_FLOAT64_C( 332.14), SIMDE_FLOAT64_C( 311.92), SIMDE_FLOAT64_C( 451.40) },
{ SIMDE_FLOAT64_C( 4.42), SIMDE_FLOAT64_C( 5.80), SIMDE_FLOAT64_C( 5.44), SIMDE_FLOAT64_C( 7.88) } },
{ { SIMDE_FLOAT64_C( 635.16), SIMDE_FLOAT64_C( -795.05), SIMDE_FLOAT64_C( -458.24), SIMDE_FLOAT64_C( 422.17) },
{ SIMDE_FLOAT64_C( 11.09), SIMDE_FLOAT64_C( -13.88), SIMDE_FLOAT64_C( -8.00), SIMDE_FLOAT64_C( 7.37) } },
{ { SIMDE_FLOAT64_C( -505.84), SIMDE_FLOAT64_C( 400.55), SIMDE_FLOAT64_C( 54.12), SIMDE_FLOAT64_C( -409.93) },
{ SIMDE_FLOAT64_C( -8.83), SIMDE_FLOAT64_C( 6.99), SIMDE_FLOAT64_C( 0.94), SIMDE_FLOAT64_C( -7.15) } },
{ { SIMDE_FLOAT64_C( 241.03), SIMDE_FLOAT64_C( -950.08), SIMDE_FLOAT64_C( 5.55), SIMDE_FLOAT64_C( -683.44) },
{ SIMDE_FLOAT64_C( 4.21), SIMDE_FLOAT64_C( -16.58), SIMDE_FLOAT64_C( 0.10), SIMDE_FLOAT64_C( -11.93) } }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])) ; i++) {
simde__m256d a = simde_mm256_loadu_pd(test_vec[i].a);
simde__m256d r = simde_x_mm256_deg2rad_pd(a);
simde_test_x86_assert_equal_f64x4(r, simde_mm256_loadu_pd(test_vec[i].r), 1);
}
return 0;
}
static int
test_simde_x_mm512_deg2rad_ps (SIMDE_MUNIT_TEST_ARGS) {
static const struct {
const simde_float32 a[16];
const simde_float32 r[16];
} test_vec[] = {
{ { SIMDE_FLOAT32_C( -204.97), SIMDE_FLOAT32_C( -943.14), SIMDE_FLOAT32_C( 662.36), SIMDE_FLOAT32_C( 286.89),
SIMDE_FLOAT32_C( -272.57), SIMDE_FLOAT32_C( 978.11), SIMDE_FLOAT32_C( -911.94), SIMDE_FLOAT32_C( -924.18),
SIMDE_FLOAT32_C( -626.92), SIMDE_FLOAT32_C( -721.73), SIMDE_FLOAT32_C( -41.73), SIMDE_FLOAT32_C( 615.09),
SIMDE_FLOAT32_C( -253.85), SIMDE_FLOAT32_C( -484.20), SIMDE_FLOAT32_C( 130.81), SIMDE_FLOAT32_C( 548.86) },
{ SIMDE_FLOAT32_C( -3.58), SIMDE_FLOAT32_C( -16.46), SIMDE_FLOAT32_C( 11.56), SIMDE_FLOAT32_C( 5.01),
SIMDE_FLOAT32_C( -4.76), SIMDE_FLOAT32_C( 17.07), SIMDE_FLOAT32_C( -15.92), SIMDE_FLOAT32_C( -16.13),
SIMDE_FLOAT32_C( -10.94), SIMDE_FLOAT32_C( -12.60), SIMDE_FLOAT32_C( -0.73), SIMDE_FLOAT32_C( 10.74),
SIMDE_FLOAT32_C( -4.43), SIMDE_FLOAT32_C( -8.45), SIMDE_FLOAT32_C( 2.28), SIMDE_FLOAT32_C( 9.58) } },
{ { SIMDE_FLOAT32_C( 759.71), SIMDE_FLOAT32_C( 445.37), SIMDE_FLOAT32_C( -639.90), SIMDE_FLOAT32_C( -816.54),
SIMDE_FLOAT32_C( 349.70), SIMDE_FLOAT32_C( -526.35), SIMDE_FLOAT32_C( -291.02), SIMDE_FLOAT32_C( 855.10),
SIMDE_FLOAT32_C( -382.23), SIMDE_FLOAT32_C( -58.28), SIMDE_FLOAT32_C( 435.56), SIMDE_FLOAT32_C( 388.92),
SIMDE_FLOAT32_C( 616.34), SIMDE_FLOAT32_C( 879.74), SIMDE_FLOAT32_C( -205.65), SIMDE_FLOAT32_C( -284.03) },
{ SIMDE_FLOAT32_C( 13.26), SIMDE_FLOAT32_C( 7.77), SIMDE_FLOAT32_C( -11.17), SIMDE_FLOAT32_C( -14.25),
SIMDE_FLOAT32_C( 6.10), SIMDE_FLOAT32_C( -9.19), SIMDE_FLOAT32_C( -5.08), SIMDE_FLOAT32_C( 14.92),
SIMDE_FLOAT32_C( -6.67), SIMDE_FLOAT32_C( -1.02), SIMDE_FLOAT32_C( 7.60), SIMDE_FLOAT32_C( 6.79),
SIMDE_FLOAT32_C( 10.76), SIMDE_FLOAT32_C( 15.35), SIMDE_FLOAT32_C( -3.59), SIMDE_FLOAT32_C( -4.96) } },
{ { SIMDE_FLOAT32_C( 252.00), SIMDE_FLOAT32_C( -672.50), SIMDE_FLOAT32_C( -750.03), SIMDE_FLOAT32_C( 219.53),
SIMDE_FLOAT32_C( -348.40), SIMDE_FLOAT32_C( 510.16), SIMDE_FLOAT32_C( 308.72), SIMDE_FLOAT32_C( 669.84),
SIMDE_FLOAT32_C( 1.09), SIMDE_FLOAT32_C( 327.67), SIMDE_FLOAT32_C( -780.79), SIMDE_FLOAT32_C( -790.56),
SIMDE_FLOAT32_C( 999.19), SIMDE_FLOAT32_C( -674.94), SIMDE_FLOAT32_C( 338.16), SIMDE_FLOAT32_C( -623.42) },
{ SIMDE_FLOAT32_C( 4.40), SIMDE_FLOAT32_C( -11.74), SIMDE_FLOAT32_C( -13.09), SIMDE_FLOAT32_C( 3.83),
SIMDE_FLOAT32_C( -6.08), SIMDE_FLOAT32_C( 8.90), SIMDE_FLOAT32_C( 5.39), SIMDE_FLOAT32_C( 11.69),
SIMDE_FLOAT32_C( 0.02), SIMDE_FLOAT32_C( 5.72), SIMDE_FLOAT32_C( -13.63), SIMDE_FLOAT32_C( -13.80),
SIMDE_FLOAT32_C( 17.44), SIMDE_FLOAT32_C( -11.78), SIMDE_FLOAT32_C( 5.90), SIMDE_FLOAT32_C( -10.88) } },
{ { SIMDE_FLOAT32_C( 210.99), SIMDE_FLOAT32_C( 133.74), SIMDE_FLOAT32_C( -196.68), SIMDE_FLOAT32_C( 412.53),
SIMDE_FLOAT32_C( -531.14), SIMDE_FLOAT32_C( -816.95), SIMDE_FLOAT32_C( -550.15), SIMDE_FLOAT32_C( -344.98),
SIMDE_FLOAT32_C( -32.75), SIMDE_FLOAT32_C( -439.61), SIMDE_FLOAT32_C( -503.00), SIMDE_FLOAT32_C( 19.70),
SIMDE_FLOAT32_C( -850.81), SIMDE_FLOAT32_C( 392.70), SIMDE_FLOAT32_C( 36.21), SIMDE_FLOAT32_C( 667.59) },
{ SIMDE_FLOAT32_C( 3.68), SIMDE_FLOAT32_C( 2.33), SIMDE_FLOAT32_C( -3.43), SIMDE_FLOAT32_C( 7.20),
SIMDE_FLOAT32_C( -9.27), SIMDE_FLOAT32_C( -14.26), SIMDE_FLOAT32_C( -9.60), SIMDE_FLOAT32_C( -6.02),
SIMDE_FLOAT32_C( -0.57), SIMDE_FLOAT32_C( -7.67), SIMDE_FLOAT32_C( -8.78), SIMDE_FLOAT32_C( 0.34),
SIMDE_FLOAT32_C( -14.85), SIMDE_FLOAT32_C( 6.85), SIMDE_FLOAT32_C( 0.63), SIMDE_FLOAT32_C( 11.65) } },
{ { SIMDE_FLOAT32_C( 226.81), SIMDE_FLOAT32_C( -68.31), SIMDE_FLOAT32_C( -92.58), SIMDE_FLOAT32_C( 1.70),
SIMDE_FLOAT32_C( 617.13), SIMDE_FLOAT32_C( 53.88), SIMDE_FLOAT32_C( -383.79), SIMDE_FLOAT32_C( -333.97),
SIMDE_FLOAT32_C( 936.36), SIMDE_FLOAT32_C( -516.23), SIMDE_FLOAT32_C( -313.77), SIMDE_FLOAT32_C( 516.09),
SIMDE_FLOAT32_C( -12.76), SIMDE_FLOAT32_C( -491.30), SIMDE_FLOAT32_C( 729.84), SIMDE_FLOAT32_C( 483.88) },
{ SIMDE_FLOAT32_C( 3.96), SIMDE_FLOAT32_C( -1.19), SIMDE_FLOAT32_C( -1.62), SIMDE_FLOAT32_C( 0.03),
SIMDE_FLOAT32_C( 10.77), SIMDE_FLOAT32_C( 0.94), SIMDE_FLOAT32_C( -6.70), SIMDE_FLOAT32_C( -5.83),
SIMDE_FLOAT32_C( 16.34), SIMDE_FLOAT32_C( -9.01), SIMDE_FLOAT32_C( -5.48), SIMDE_FLOAT32_C( 9.01),
SIMDE_FLOAT32_C( -0.22), SIMDE_FLOAT32_C( -8.57), SIMDE_FLOAT32_C( 12.74), SIMDE_FLOAT32_C( 8.45) } },
{ { SIMDE_FLOAT32_C( 619.03), SIMDE_FLOAT32_C( -43.28), SIMDE_FLOAT32_C( 522.00), SIMDE_FLOAT32_C( -713.37),
SIMDE_FLOAT32_C( 394.03), SIMDE_FLOAT32_C( 425.58), SIMDE_FLOAT32_C( 710.40), SIMDE_FLOAT32_C( -291.67),
SIMDE_FLOAT32_C( -116.91), SIMDE_FLOAT32_C( -890.48), SIMDE_FLOAT32_C( -316.42), SIMDE_FLOAT32_C( -26.59),
SIMDE_FLOAT32_C( -918.69), SIMDE_FLOAT32_C( -397.83), SIMDE_FLOAT32_C( -284.98), SIMDE_FLOAT32_C( 339.56) },
{ SIMDE_FLOAT32_C( 10.80), SIMDE_FLOAT32_C( -0.76), SIMDE_FLOAT32_C( 9.11), SIMDE_FLOAT32_C( -12.45),
SIMDE_FLOAT32_C( 6.88), SIMDE_FLOAT32_C( 7.43), SIMDE_FLOAT32_C( 12.40), SIMDE_FLOAT32_C( -5.09),
SIMDE_FLOAT32_C( -2.04), SIMDE_FLOAT32_C( -15.54), SIMDE_FLOAT32_C( -5.52), SIMDE_FLOAT32_C( -0.46),
SIMDE_FLOAT32_C( -16.03), SIMDE_FLOAT32_C( -6.94), SIMDE_FLOAT32_C( -4.97), SIMDE_FLOAT32_C( 5.93) } },
{ { SIMDE_FLOAT32_C( -935.68), SIMDE_FLOAT32_C( 109.78), SIMDE_FLOAT32_C( -972.99), SIMDE_FLOAT32_C( 894.31),
SIMDE_FLOAT32_C( 633.79), SIMDE_FLOAT32_C( 41.84), SIMDE_FLOAT32_C( -852.93), SIMDE_FLOAT32_C( 776.08),
SIMDE_FLOAT32_C( -443.88), SIMDE_FLOAT32_C( -301.71), SIMDE_FLOAT32_C( -808.76), SIMDE_FLOAT32_C( -785.15),
SIMDE_FLOAT32_C( -67.76), SIMDE_FLOAT32_C( -895.91), SIMDE_FLOAT32_C( 478.10), SIMDE_FLOAT32_C( -636.03) },
{ SIMDE_FLOAT32_C( -16.33), SIMDE_FLOAT32_C( 1.92), SIMDE_FLOAT32_C( -16.98), SIMDE_FLOAT32_C( 15.61),
SIMDE_FLOAT32_C( 11.06), SIMDE_FLOAT32_C( 0.73), SIMDE_FLOAT32_C( -14.89), SIMDE_FLOAT32_C( 13.55),
SIMDE_FLOAT32_C( -7.75), SIMDE_FLOAT32_C( -5.27), SIMDE_FLOAT32_C( -14.12), SIMDE_FLOAT32_C( -13.70),
SIMDE_FLOAT32_C( -1.18), SIMDE_FLOAT32_C( -15.64), SIMDE_FLOAT32_C( 8.34), SIMDE_FLOAT32_C( -11.10) } },
{ { SIMDE_FLOAT32_C( 320.10), SIMDE_FLOAT32_C( 3.69), SIMDE_FLOAT32_C( -21.63), SIMDE_FLOAT32_C( 500.34),
SIMDE_FLOAT32_C( -733.82), SIMDE_FLOAT32_C( 741.17), SIMDE_FLOAT32_C( 921.80), SIMDE_FLOAT32_C( 676.47),
SIMDE_FLOAT32_C( -545.48), SIMDE_FLOAT32_C( 136.48), SIMDE_FLOAT32_C( -243.90), SIMDE_FLOAT32_C( 744.83),
SIMDE_FLOAT32_C( 297.50), SIMDE_FLOAT32_C( 109.44), SIMDE_FLOAT32_C( -667.13), SIMDE_FLOAT32_C( -475.76) },
{ SIMDE_FLOAT32_C( 5.59), SIMDE_FLOAT32_C( 0.06), SIMDE_FLOAT32_C( -0.38), SIMDE_FLOAT32_C( 8.73),
SIMDE_FLOAT32_C( -12.81), SIMDE_FLOAT32_C( 12.94), SIMDE_FLOAT32_C( 16.09), SIMDE_FLOAT32_C( 11.81),
SIMDE_FLOAT32_C( -9.52), SIMDE_FLOAT32_C( 2.38), SIMDE_FLOAT32_C( -4.26), SIMDE_FLOAT32_C( 13.00),
SIMDE_FLOAT32_C( 5.19), SIMDE_FLOAT32_C( 1.91), SIMDE_FLOAT32_C( -11.64), SIMDE_FLOAT32_C( -8.30) } }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])) ; i++) {
simde__m512 a = simde_mm512_loadu_ps(test_vec[i].a);
simde__m512 r = simde_x_mm512_deg2rad_ps(a);
simde_test_x86_assert_equal_f32x16(r, simde_mm512_loadu_ps(test_vec[i].r), 1);
}
return 0;
}
static int
test_simde_x_mm512_deg2rad_pd (SIMDE_MUNIT_TEST_ARGS) {
static const struct {
const simde_float64 a[8];
const simde_float64 r[8];
} test_vec[] = {
{ { SIMDE_FLOAT64_C( 984.73), SIMDE_FLOAT64_C( 383.87), SIMDE_FLOAT64_C( -286.43), SIMDE_FLOAT64_C( 18.78),
SIMDE_FLOAT64_C( -399.99), SIMDE_FLOAT64_C( -675.58), SIMDE_FLOAT64_C( -438.55), SIMDE_FLOAT64_C( -737.71) },
{ SIMDE_FLOAT64_C( 17.19), SIMDE_FLOAT64_C( 6.70), SIMDE_FLOAT64_C( -5.00), SIMDE_FLOAT64_C( 0.33),
SIMDE_FLOAT64_C( -6.98), SIMDE_FLOAT64_C( -11.79), SIMDE_FLOAT64_C( -7.65), SIMDE_FLOAT64_C( -12.88) } },
{ { SIMDE_FLOAT64_C( -671.93), SIMDE_FLOAT64_C( 826.99), SIMDE_FLOAT64_C( -830.65), SIMDE_FLOAT64_C( -694.10),
SIMDE_FLOAT64_C( 255.50), SIMDE_FLOAT64_C( 118.40), SIMDE_FLOAT64_C( -39.28), SIMDE_FLOAT64_C( -160.67) },
{ SIMDE_FLOAT64_C( -11.73), SIMDE_FLOAT64_C( 14.43), SIMDE_FLOAT64_C( -14.50), SIMDE_FLOAT64_C( -12.11),
SIMDE_FLOAT64_C( 4.46), SIMDE_FLOAT64_C( 2.07), SIMDE_FLOAT64_C( -0.69), SIMDE_FLOAT64_C( -2.80) } },
{ { SIMDE_FLOAT64_C( -422.40), SIMDE_FLOAT64_C( 720.88), SIMDE_FLOAT64_C( -179.50), SIMDE_FLOAT64_C( -877.62),
SIMDE_FLOAT64_C( -132.27), SIMDE_FLOAT64_C( 998.68), SIMDE_FLOAT64_C( 784.22), SIMDE_FLOAT64_C( 465.33) },
{ SIMDE_FLOAT64_C( -7.37), SIMDE_FLOAT64_C( 12.58), SIMDE_FLOAT64_C( -3.13), SIMDE_FLOAT64_C( -15.32),
SIMDE_FLOAT64_C( -2.31), SIMDE_FLOAT64_C( 17.43), SIMDE_FLOAT64_C( 13.69), SIMDE_FLOAT64_C( 8.12) } },
{ { SIMDE_FLOAT64_C( 844.52), SIMDE_FLOAT64_C( -91.48), SIMDE_FLOAT64_C( 575.23), SIMDE_FLOAT64_C( -167.13),
SIMDE_FLOAT64_C( -906.69), SIMDE_FLOAT64_C( -808.01), SIMDE_FLOAT64_C( -191.68), SIMDE_FLOAT64_C( 439.44) },
{ SIMDE_FLOAT64_C( 14.74), SIMDE_FLOAT64_C( -1.60), SIMDE_FLOAT64_C( 10.04), SIMDE_FLOAT64_C( -2.92),
SIMDE_FLOAT64_C( -15.82), SIMDE_FLOAT64_C( -14.10), SIMDE_FLOAT64_C( -3.35), SIMDE_FLOAT64_C( 7.67) } },
{ { SIMDE_FLOAT64_C( -327.12), SIMDE_FLOAT64_C( 74.58), SIMDE_FLOAT64_C( -612.17), SIMDE_FLOAT64_C( -701.50),
SIMDE_FLOAT64_C( -128.00), SIMDE_FLOAT64_C( 625.20), SIMDE_FLOAT64_C( -218.65), SIMDE_FLOAT64_C( -917.42) },
{ SIMDE_FLOAT64_C( -5.71), SIMDE_FLOAT64_C( 1.30), SIMDE_FLOAT64_C( -10.68), SIMDE_FLOAT64_C( -12.24),
SIMDE_FLOAT64_C( -2.23), SIMDE_FLOAT64_C( 10.91), SIMDE_FLOAT64_C( -3.82), SIMDE_FLOAT64_C( -16.01) } },
{ { SIMDE_FLOAT64_C( -997.92), SIMDE_FLOAT64_C( -38.58), SIMDE_FLOAT64_C( -337.38), SIMDE_FLOAT64_C( -285.85),
SIMDE_FLOAT64_C( -318.88), SIMDE_FLOAT64_C( 574.80), SIMDE_FLOAT64_C( 587.94), SIMDE_FLOAT64_C( -489.48) },
{ SIMDE_FLOAT64_C( -17.42), SIMDE_FLOAT64_C( -0.67), SIMDE_FLOAT64_C( -5.89), SIMDE_FLOAT64_C( -4.99),
SIMDE_FLOAT64_C( -5.57), SIMDE_FLOAT64_C( 10.03), SIMDE_FLOAT64_C( 10.26), SIMDE_FLOAT64_C( -8.54) } },
{ { SIMDE_FLOAT64_C( -699.61), SIMDE_FLOAT64_C( -288.00), SIMDE_FLOAT64_C( -454.37), SIMDE_FLOAT64_C( -597.58),
SIMDE_FLOAT64_C( 496.99), SIMDE_FLOAT64_C( 888.51), SIMDE_FLOAT64_C( -818.76), SIMDE_FLOAT64_C( -819.32) },
{ SIMDE_FLOAT64_C( -12.21), SIMDE_FLOAT64_C( -5.03), SIMDE_FLOAT64_C( -7.93), SIMDE_FLOAT64_C( -10.43),
SIMDE_FLOAT64_C( 8.67), SIMDE_FLOAT64_C( 15.51), SIMDE_FLOAT64_C( -14.29), SIMDE_FLOAT64_C( -14.30) } },
{ { SIMDE_FLOAT64_C( -315.95), SIMDE_FLOAT64_C( -109.61), SIMDE_FLOAT64_C( -186.03), SIMDE_FLOAT64_C( -677.21),
SIMDE_FLOAT64_C( 98.17), SIMDE_FLOAT64_C( -43.95), SIMDE_FLOAT64_C( -639.89), SIMDE_FLOAT64_C( -591.44) },
{ SIMDE_FLOAT64_C( -5.51), SIMDE_FLOAT64_C( -1.91), SIMDE_FLOAT64_C( -3.25), SIMDE_FLOAT64_C( -11.82),
SIMDE_FLOAT64_C( 1.71), SIMDE_FLOAT64_C( -0.77), SIMDE_FLOAT64_C( -11.17), SIMDE_FLOAT64_C( -10.32) } }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])) ; i++) {
simde__m512d a = simde_mm512_loadu_pd(test_vec[i].a);
simde__m512d r = simde_x_mm512_deg2rad_pd(a);
simde_test_x86_assert_equal_f64x8(r, simde_mm512_loadu_pd(test_vec[i].r), 1);
}
return 0;
}
static int
test_simde_mm_div_epi8(SIMDE_MUNIT_TEST_ARGS) {
const struct {
simde__m128i a;
simde__m128i b;
simde__m128i r;
} test_vec[8] = {
{ simde_mm_set_epi8(INT8_C( 80), INT8_C( 26), INT8_C( -96), INT8_C( 63),
INT8_C( 84), INT8_C( 0), INT8_C( 86), INT8_C( -92),
INT8_C( 19), INT8_C( 73), INT8_C( 49), INT8_C( 84),
INT8_C( 93), INT8_C( -26), INT8_C( 48), INT8_C( -85)),
simde_mm_set_epi8(INT8_C( 4), INT8_C( 4), INT8_C( 3), INT8_C( 27),
INT8_C( 44), INT8_C( 48), INT8_C( 3), INT8_C( 53),
INT8_C( 11), INT8_C( 6), INT8_C( 2), INT8_C( 14),
INT8_C( 89), INT8_C( 10), INT8_C( 3), INT8_C( 1)),
simde_mm_set_epi8(INT8_C( 20), INT8_C( 6), INT8_C( -32), INT8_C( 2),
INT8_C( 1), INT8_C( 0), INT8_C( 28), INT8_C( -1),
INT8_C( 1), INT8_C( 12), INT8_C( 24), INT8_C( 6),
INT8_C( 1), INT8_C( -2), INT8_C( 16), INT8_C( -85)) },
{ simde_mm_set_epi8(INT8_C( -53), INT8_C(-123), INT8_C( 83), INT8_C( 82),
INT8_C( -17), INT8_C( 32), INT8_C( -32), INT8_C( 68),
INT8_C( -20), INT8_C( 5), INT8_C( -1), INT8_C( -23),
INT8_C( 118), INT8_C(-101), INT8_C( 53), INT8_C( 4)),
simde_mm_set_epi8(INT8_C( 9), INT8_C( 1), INT8_C( -68), INT8_C( 1),
INT8_C( 1), INT8_C( 1), INT8_C( 22), INT8_C( 17),
INT8_C( 4), INT8_C( 8), INT8_C( 6), INT8_C( 10),
INT8_C( 55), INT8_C( 3), INT8_C( 14), INT8_C( 14)),
simde_mm_set_epi8(INT8_C( -5), INT8_C(-123), INT8_C( -1), INT8_C( 82),
INT8_C( -17), INT8_C( 32), INT8_C( -1), INT8_C( 4),
INT8_C( -5), INT8_C( 0), INT8_C( 0), INT8_C( -2),
INT8_C( 2), INT8_C( -33), INT8_C( 3), INT8_C( 0)) },
{ simde_mm_set_epi8(INT8_C( 122), INT8_C( 103), INT8_C( 28), INT8_C(-102),
INT8_C( -41), INT8_C(-105), INT8_C( -14), INT8_C(-120),
INT8_C( -71), INT8_C( 84), INT8_C( 90), INT8_C( 8),
INT8_C( 84), INT8_C( 120), INT8_C( -59), INT8_C( 9)),
simde_mm_set_epi8(INT8_C( 59), INT8_C( -21), INT8_C( 22), INT8_C( 53),
INT8_C( 22), INT8_C( 3), INT8_C( 5), INT8_C( 6),
INT8_C( 2), INT8_C( 21), INT8_C( 3), INT8_C( 3),
INT8_C( 2), INT8_C( 10), INT8_C( 10), INT8_C( 3)),
simde_mm_set_epi8(INT8_C( 2), INT8_C( -4), INT8_C( 1), INT8_C( -1),
INT8_C( -1), INT8_C( -35), INT8_C( -2), INT8_C( -20),
INT8_C( -35), INT8_C( 4), INT8_C( 30), INT8_C( 2),
INT8_C( 42), INT8_C( 12), INT8_C( -5), INT8_C( 3)) },
{ simde_mm_set_epi8(INT8_C( 121), INT8_C( -15), INT8_C(-123), INT8_C( 80),
INT8_C( 43), INT8_C( 58), INT8_C( 119), INT8_C( -49),
INT8_C( 107), INT8_C( -94), INT8_C( 51), INT8_C(-118),
INT8_C( 68), INT8_C( 112), INT8_C( -56), INT8_C(-103)),
simde_mm_set_epi8(INT8_C( 44), INT8_C( 13), INT8_C( 14), INT8_C( 8),
INT8_C( -24), INT8_C( 77), INT8_C( 118), INT8_C( 21),
INT8_C( 1), INT8_C( -34), INT8_C( 2), INT8_C( 29),
INT8_C( 14), INT8_C( 53), INT8_C( 1), INT8_C( 54)),
simde_mm_set_epi8(INT8_C( 2), INT8_C( -1), INT8_C( -8), INT8_C( 10),
INT8_C( -1), INT8_C( 0), INT8_C( 1), INT8_C( -2),
INT8_C( 107), INT8_C( 2), INT8_C( 25), INT8_C( -4),
INT8_C( 4), INT8_C( 2), INT8_C( -56), INT8_C( -1)) },
{ simde_mm_set_epi8(INT8_C( -42), INT8_C( 14), INT8_C(-113), INT8_C( 62),
INT8_C( -34), INT8_C( -16), INT8_C(-103), INT8_C(-122),
INT8_C(-128), INT8_C( -77), INT8_C( -15), INT8_C( -38),
INT8_C( 87), INT8_C( -72), INT8_C( 57), INT8_C( -40)),
simde_mm_set_epi8(INT8_C( 30), INT8_C( 124), INT8_C( -94), INT8_C( 4),
INT8_C( 46), INT8_C( 11), INT8_C( 3), INT8_C( -54),
INT8_C( 11), INT8_C( 8), INT8_C(-114), INT8_C( 3),
INT8_C( 6), INT8_C( 1), INT8_C(-121), INT8_C( 4)),
simde_mm_set_epi8(INT8_C( -1), INT8_C( 0), INT8_C( 1), INT8_C( 15),
INT8_C( 0), INT8_C( -1), INT8_C( -34), INT8_C( 2),
INT8_C( -11), INT8_C( -9), INT8_C( 0), INT8_C( -12),
INT8_C( 14), INT8_C( -72), INT8_C( 0), INT8_C( -10)) },
{ simde_mm_set_epi8(INT8_C( -13), INT8_C( -82), INT8_C( 64), INT8_C( -67),
INT8_C(-120), INT8_C( 26), INT8_C(-105), INT8_C( 40),
INT8_C( 59), INT8_C( -83), INT8_C( 64), INT8_C( -39),
INT8_C( 99), INT8_C( -73), INT8_C( -97), INT8_C( -1)),
simde_mm_set_epi8(INT8_C( -27), INT8_C( 114), INT8_C(-109), INT8_C( 8),
INT8_C( 12), INT8_C( 4), INT8_C( 2), INT8_C( 2),
INT8_C( 3), INT8_C( 11), INT8_C( 3), INT8_C( 11),
INT8_C( 82), INT8_C( 14), INT8_C( 120), INT8_C(-107)),
simde_mm_set_epi8(INT8_C( 0), INT8_C( 0), INT8_C( 0), INT8_C( -8),
INT8_C( -10), INT8_C( 6), INT8_C( -52), INT8_C( 20),
INT8_C( 19), INT8_C( -7), INT8_C( 21), INT8_C( -3),
INT8_C( 1), INT8_C( -5), INT8_C( 0), INT8_C( 0)) },
{ simde_mm_set_epi8(INT8_C( -57), INT8_C( 53), INT8_C( 114), INT8_C( -35),
INT8_C( -22), INT8_C( -59), INT8_C( 52), INT8_C( 113),
INT8_C( 25), INT8_C( 16), INT8_C( -8), INT8_C( -67),
INT8_C( 7), INT8_C( -33), INT8_C( 51), INT8_C( 118)),
simde_mm_set_epi8(INT8_C( 14), INT8_C( 15), INT8_C( 24), INT8_C( 83),
INT8_C( 4), INT8_C( 45), INT8_C( 4), INT8_C( 34),
INT8_C( 9), INT8_C( 19), INT8_C( 4), INT8_C( 11),
INT8_C( 8), INT8_C( 14), INT8_C( 102), INT8_C( -88)),
simde_mm_set_epi8(INT8_C( -4), INT8_C( 3), INT8_C( 4), INT8_C( 0),
INT8_C( -5), INT8_C( -1), INT8_C( 13), INT8_C( 3),
INT8_C( 2), INT8_C( 0), INT8_C( -2), INT8_C( -6),
INT8_C( 0), INT8_C( -2), INT8_C( 0), INT8_C( -1)) },
{ simde_mm_set_epi8(INT8_C( -69), INT8_C( 57), INT8_C( 3), INT8_C( 127),
INT8_C( -28), INT8_C( -47), INT8_C(-127), INT8_C( -14),
INT8_C( -28), INT8_C( 68), INT8_C( -27), INT8_C( -44),
INT8_C( -16), INT8_C( 1), INT8_C( -44), INT8_C( 112)),
simde_mm_set_epi8(INT8_C( 57), INT8_C( 1), INT8_C( -43), INT8_C( 103),
INT8_C( 4), INT8_C( 1), INT8_C( 2), INT8_C( 96),
INT8_C( 9), INT8_C( 57), INT8_C( 54), INT8_C( 105),
INT8_C( 1), INT8_C( 31), INT8_C( -85), INT8_C( 104)),
simde_mm_set_epi8(INT8_C( -1), INT8_C( 57), INT8_C( 0), INT8_C( 1),
INT8_C( -7), INT8_C( -47), INT8_C( -63), INT8_C( 0),
INT8_C( -3), INT8_C( 1), INT8_C( 0), INT8_C( 0),
INT8_C( -16), INT8_C( 0), INT8_C( 0), INT8_C( 1)) }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])); i++) {
simde__m128i r = simde_mm_div_epi8(test_vec[i].a, test_vec[i].b);
simde_assert_m128i_i8(r, ==, test_vec[i].r);
}
return 0;
}
static int
test_simde_mm_div_epi16(SIMDE_MUNIT_TEST_ARGS) {
const struct {
simde__m128i a;
simde__m128i b;
simde__m128i r;
} test_vec[8] = {
{ simde_mm_set_epi16(INT16_C( 7569), INT16_C(-21774), INT16_C( 5125), INT16_C( 21356),
INT16_C( 9222), INT16_C( 7511), INT16_C(-21561), INT16_C( 29102)),
simde_mm_set_epi16(INT16_C( 6450), INT16_C( -2), INT16_C( 190), INT16_C( -44),
INT16_C( -3), INT16_C( -9), INT16_C( -911), INT16_C( 3)),
simde_mm_set_epi16(INT16_C( 1), INT16_C( 10887), INT16_C( 26), INT16_C( -485),
INT16_C( -3074), INT16_C( -834), INT16_C( 23), INT16_C( 9700)) },
{ simde_mm_set_epi16(INT16_C( 14790), INT16_C(-17845), INT16_C( 12471), INT16_C( 16666),
INT16_C( -4541), INT16_C( 18926), INT16_C( 4112), INT16_C( 26905)),
simde_mm_set_epi16(INT16_C( -1), INT16_C( -8), INT16_C( 15), INT16_C( -16),
INT16_C( -1), INT16_C( -28), INT16_C( -3387), INT16_C( -5)),
simde_mm_set_epi16(INT16_C(-14790), INT16_C( 2230), INT16_C( 831), INT16_C( -1041),
INT16_C( 4541), INT16_C( -675), INT16_C( -1), INT16_C( -5381)) },
{ simde_mm_set_epi16(INT16_C( 24700), INT16_C( 18820), INT16_C( -6493), INT16_C(-11852),
INT16_C( 7293), INT16_C( 18330), INT16_C(-13423), INT16_C( 30834)),
simde_mm_set_epi16(INT16_C( 9411), INT16_C( -2), INT16_C( -2), INT16_C( -10),
INT16_C( 942), INT16_C( 5062), INT16_C( 3712), INT16_C(-24297)),
simde_mm_set_epi16(INT16_C( 2), INT16_C( -9410), INT16_C( 3246), INT16_C( 1185),
INT16_C( 7), INT16_C( 3), INT16_C( -3), INT16_C( -1)) },
{ simde_mm_set_epi16(INT16_C( -8188), INT16_C( -5752), INT16_C( -6400), INT16_C(-18754),
INT16_C( 26203), INT16_C( 11990), INT16_C( 27655), INT16_C( 30479)),
simde_mm_set_epi16(INT16_C( -2891), INT16_C( -9), INT16_C( 1), INT16_C( 24),
INT16_C( 1410), INT16_C( -7348), INT16_C( 56), INT16_C( -8)),
simde_mm_set_epi16(INT16_C( 2), INT16_C( 639), INT16_C( -6400), INT16_C( -781),
INT16_C( 18), INT16_C( -1), INT16_C( 493), INT16_C( -3809)) },
{ simde_mm_set_epi16(INT16_C( 27464), INT16_C( 30742), INT16_C(-17463), INT16_C( 5584),
INT16_C( 16882), INT16_C(-13221), INT16_C(-30009), INT16_C( 27529)),
simde_mm_set_epi16(INT16_C( 92), INT16_C( -245), INT16_C( 87), INT16_C( 2027),
INT16_C( -218), INT16_C( 181), INT16_C( 1), INT16_C( -448)),
simde_mm_set_epi16(INT16_C( 298), INT16_C( -125), INT16_C( -200), INT16_C( 2),
INT16_C( -77), INT16_C( -73), INT16_C(-30009), INT16_C( -61)) },
{ simde_mm_set_epi16(INT16_C(-28312), INT16_C( -6464), INT16_C( 7438), INT16_C(-24771),
INT16_C( 27969), INT16_C( 18884), INT16_C( 17235), INT16_C( 31019)),
simde_mm_set_epi16(INT16_C( -3989), INT16_C( 8), INT16_C( -1), INT16_C( -27),
INT16_C( 53), INT16_C( -58), INT16_C( 2274), INT16_C( -9)),
simde_mm_set_epi16(INT16_C( 7), INT16_C( -808), INT16_C( -7438), INT16_C( 917),
INT16_C( 527), INT16_C( -325), INT16_C( 7), INT16_C( -3446)) },
{ simde_mm_set_epi16(INT16_C(-31090), INT16_C( 20346), INT16_C( 14276), INT16_C(-27653),
INT16_C( 19203), INT16_C(-24798), INT16_C(-17826), INT16_C( 16379)),
simde_mm_set_epi16(INT16_C( 3), INT16_C( 8), INT16_C( -60), INT16_C( 14),
INT16_C( -435), INT16_C( -1), INT16_C( -395), INT16_C( -1532)),
simde_mm_set_epi16(INT16_C(-10363), INT16_C( 2543), INT16_C( -237), INT16_C( -1975),
INT16_C( -44), INT16_C( 24798), INT16_C( 45), INT16_C( -10)) },
{ simde_mm_set_epi16(INT16_C( -4012), INT16_C( 17981), INT16_C( 26341), INT16_C(-11451),
INT16_C(-22746), INT16_C(-13246), INT16_C( -6273), INT16_C( 15936)),
simde_mm_set_epi16(INT16_C( -5), INT16_C( 325), INT16_C( 10), INT16_C( -2018),
INT16_C(-26192), INT16_C( -15), INT16_C( -29), INT16_C( 2009)),
simde_mm_set_epi16(INT16_C( 802), INT16_C( 55), INT16_C( 2634), INT16_C( 5),
INT16_C( 0), INT16_C( 883), INT16_C( 216), INT16_C( 7)) }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])); i++) {
simde__m128i r = simde_mm_div_epi16(test_vec[i].a, test_vec[i].b);
simde_assert_m128i_i16(r, ==, test_vec[i].r);
}
return 0;
}
static int
test_simde_mm_div_epi32(SIMDE_MUNIT_TEST_ARGS) {
const struct {
simde__m128i a;
simde__m128i b;
simde__m128i r;
} test_vec[8] = {
{ simde_mm_set_epi32(INT32_C(-2101284579), INT32_C( 1788896628), INT32_C( 742774378), INT32_C( -512831871)),
simde_mm_set_epi32(INT32_C( -173), INT32_C( -20613654), INT32_C( 28772), INT32_C( 118)),
simde_mm_set_epi32(INT32_C( 12146153), INT32_C( -86), INT32_C( 25815), INT32_C( -4346032)) },
{ simde_mm_set_epi32(INT32_C( 505370509), INT32_C( -307733024), INT32_C( -192358019), INT32_C( -299231491)),
simde_mm_set_epi32(INT32_C( 34268), INT32_C( -6), INT32_C( 6850), INT32_C( 1214711)),
simde_mm_set_epi32(INT32_C( 14747), INT32_C( 51288837), INT32_C( -28081), INT32_C( -246)) },
{ simde_mm_set_epi32(INT32_C(-1154189768), INT32_C( 94538029), INT32_C( 423884488), INT32_C( 1619435962)),
simde_mm_set_epi32(INT32_C( -565), INT32_C( -128659), INT32_C( -59), INT32_C( -208397178)),
simde_mm_set_epi32(INT32_C( 2042813), INT32_C( -734), INT32_C( -7184482), INT32_C( -7)) },
{ simde_mm_set_epi32(INT32_C(-1938127942), INT32_C( -553846699), INT32_C( 685427224), INT32_C( -86375451)),
simde_mm_set_epi32(INT32_C( 1223981911), INT32_C( -108113), INT32_C( 3), INT32_C( -3698)),
simde_mm_set_epi32(INT32_C( -1), INT32_C( 5122), INT32_C( 228475741), INT32_C( 23357)) },
{ simde_mm_set_epi32(INT32_C(-1690889220), INT32_C( -667367235), INT32_C( 1220206139), INT32_C(-1217543723)),
simde_mm_set_epi32(INT32_C( 299), INT32_C( 7724), INT32_C( -1), INT32_C( 173051558)),
simde_mm_set_epi32(INT32_C( -5655147), INT32_C( -86401), INT32_C(-1220206139), INT32_C( -7)) },
{ simde_mm_set_epi32(INT32_C( 93323521), INT32_C( 1996592708), INT32_C( 2087305602), INT32_C( 27568495)),
simde_mm_set_epi32(INT32_C( -2), INT32_C( 15626723), INT32_C( 1507), INT32_C( 5412)),
simde_mm_set_epi32(INT32_C( -46661760), INT32_C( 127), INT32_C( 1385073), INT32_C( 5093)) },
{ simde_mm_set_epi32(INT32_C( 1825211631), INT32_C( 1750705004), INT32_C( 1935103134), INT32_C(-1042289581)),
simde_mm_set_epi32(INT32_C( -20153), INT32_C( -109992928), INT32_C( -4), INT32_C( 3)),
simde_mm_set_epi32(INT32_C( -90567), INT32_C( -15), INT32_C( -483775783), INT32_C( -347429860)) },
{ simde_mm_set_epi32(INT32_C( -836927167), INT32_C(-2031963629), INT32_C( 1244477192), INT32_C( 662038781)),
simde_mm_set_epi32(INT32_C( -226), INT32_C( 320), INT32_C( 17085036), INT32_C( -883)),
simde_mm_set_epi32(INT32_C( 3703217), INT32_C( -6349886), INT32_C( 72), INT32_C( -749760)) }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])); i++) {
simde__m128i r = simde_mm_div_epi32(test_vec[i].a, test_vec[i].b);
simde_assert_m128i_i32(r, ==, test_vec[i].r);
}
return 0;
}
static int
test_simde_mm_div_epi64(SIMDE_MUNIT_TEST_ARGS) {
const struct {
simde__m128i a;
simde__m128i b;
simde__m128i r;
} test_vec[8] = {
{ simde_mm_set_epi64x(INT64_C(-8762915026342605517), INT64_C( 6327019035084041530)),
simde_mm_set_epi64x(INT64_C( 1040172869250133860), INT64_C( -3393154419)),
simde_mm_set_epi64x(INT64_C( -8), INT64_C( -1864642233)) },
{ simde_mm_set_epi64x(INT64_C( 7086115847005357544), INT64_C( 7169462887889416879)),
simde_mm_set_epi64x(INT64_C( -402272), INT64_C( -6362438)),
simde_mm_set_epi64x(INT64_C( -17615235082246), INT64_C( -1126842082844)) },
{ simde_mm_set_epi64x(INT64_C( 3227829673356714047), INT64_C( 5122063021698718134)),
simde_mm_set_epi64x(INT64_C( 290796), INT64_C( -647054)),
simde_mm_set_epi64x(INT64_C( 11099979619240), INT64_C( -7915974588981)) },
{ simde_mm_set_epi64x(INT64_C( -712959233727550094), INT64_C( 8175697730423622547)),
simde_mm_set_epi64x(INT64_C( -114108996), INT64_C( 727492806)),
simde_mm_set_epi64x(INT64_C( 6248054568), INT64_C( 11238183612)) },
{ simde_mm_set_epi64x(INT64_C( 7475816922473172733), INT64_C(-1631503293395556188)),
simde_mm_set_epi64x(INT64_C( 5), INT64_C( -24770378177)),
simde_mm_set_epi64x(INT64_C( 1495163384494634546), INT64_C( 65865094)) },
{ simde_mm_set_epi64x(INT64_C(-7220293124938945390), INT64_C( 5345879758546587877)),
simde_mm_set_epi64x(INT64_C( -716), INT64_C( 1692902)),
simde_mm_set_epi64x(INT64_C( 10084208275054393), INT64_C( 3157819979270)) },
{ simde_mm_set_epi64x(INT64_C(-2100788141468237692), INT64_C( 1869244361192362281)),
simde_mm_set_epi64x(INT64_C( -1), INT64_C( 27867346395)),
simde_mm_set_epi64x(INT64_C( 2100788141468237692), INT64_C( 67076510)) },
{ simde_mm_set_epi64x(INT64_C(-4218200756000910912), INT64_C( 8429274423139369867)),
simde_mm_set_epi64x(INT64_C( 25), INT64_C( -63869567732)),
simde_mm_set_epi64x(INT64_C( -168728030240036436), INT64_C( -131976381)) }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])); i++) {
simde__m128i r = simde_mm_div_epi64(test_vec[i].a, test_vec[i].b);
simde_assert_m128i_i64(r, ==, test_vec[i].r);
}
return 0;
}
static int
test_simde_mm_div_epu8(SIMDE_MUNIT_TEST_ARGS) {
const struct {
simde__m128i a;
simde__m128i b;
simde__m128i r;
} test_vec[8] = {
{ simde_x_mm_set_epu8(UINT8_C( 15), UINT8_C( 75), UINT8_C(224), UINT8_C(156),
UINT8_C( 1), UINT8_C( 34), UINT8_C( 35), UINT8_C(127),
UINT8_C(127), UINT8_C(120), UINT8_C(177), UINT8_C( 31),
UINT8_C(136), UINT8_C(180), UINT8_C(141), UINT8_C(206)),
simde_x_mm_set_epu8(UINT8_C( 45), UINT8_C( 8), UINT8_C( 9), UINT8_C( 13),
UINT8_C(246), UINT8_C( 1), UINT8_C( 15), UINT8_C( 2),
UINT8_C(152), UINT8_C( 45), UINT8_C( 56), UINT8_C( 26),
UINT8_C( 1), UINT8_C( 1), UINT8_C( 16), UINT8_C( 15)),
simde_x_mm_set_epu8(UINT8_C( 0), UINT8_C( 9), UINT8_C( 24), UINT8_C( 12),
UINT8_C( 0), UINT8_C( 34), UINT8_C( 2), UINT8_C( 63),
UINT8_C( 0), UINT8_C( 2), UINT8_C( 3), UINT8_C( 1),
UINT8_C(136), UINT8_C(180), UINT8_C( 8), UINT8_C( 13)) },
{ simde_x_mm_set_epu8(UINT8_C( 75), UINT8_C(233), UINT8_C(186), UINT8_C(216),
UINT8_C(224), UINT8_C( 45), UINT8_C( 40), UINT8_C(134),
UINT8_C( 1), UINT8_C( 47), UINT8_C( 23), UINT8_C(119),
UINT8_C(229), UINT8_C(107), UINT8_C(175), UINT8_C( 79)),
simde_x_mm_set_epu8(UINT8_C( 9), UINT8_C( 12), UINT8_C( 46), UINT8_C( 39),
UINT8_C( 11), UINT8_C( 15), UINT8_C( 32), UINT8_C( 13),
UINT8_C( 21), UINT8_C(239), UINT8_C( 5), UINT8_C( 2),
UINT8_C( 1), UINT8_C( 26), UINT8_C(182), UINT8_C( 29)),
simde_x_mm_set_epu8(UINT8_C( 8), UINT8_C( 19), UINT8_C( 4), UINT8_C( 5),
UINT8_C( 20), UINT8_C( 3), UINT8_C( 1), UINT8_C( 10),
UINT8_C( 0), UINT8_C( 0), UINT8_C( 4), UINT8_C( 59),
UINT8_C(229), UINT8_C( 4), UINT8_C( 0), UINT8_C( 2)) },
{ simde_x_mm_set_epu8(UINT8_C( 75), UINT8_C(109), UINT8_C( 28), UINT8_C(204),
UINT8_C( 53), UINT8_C(255), UINT8_C(143), UINT8_C(254),
UINT8_C( 82), UINT8_C(109), UINT8_C(205), UINT8_C( 21),
UINT8_C( 16), UINT8_C( 18), UINT8_C(221), UINT8_C(119)),
simde_x_mm_set_epu8(UINT8_C(210), UINT8_C(108), UINT8_C( 89), UINT8_C( 21),
UINT8_C(154), UINT8_C( 52), UINT8_C( 17), UINT8_C( 8),
UINT8_C( 90), UINT8_C( 6), UINT8_C( 1), UINT8_C( 5),
UINT8_C( 1), UINT8_C(201), UINT8_C( 23), UINT8_C( 2)),
simde_x_mm_set_epu8(UINT8_C( 0), UINT8_C( 1), UINT8_C( 0), UINT8_C( 9),
UINT8_C( 0), UINT8_C( 4), UINT8_C( 8), UINT8_C( 31),
UINT8_C( 0), UINT8_C( 18), UINT8_C(205), UINT8_C( 4),
UINT8_C( 16), UINT8_C( 0), UINT8_C( 9), UINT8_C( 59)) },
{ simde_x_mm_set_epu8(UINT8_C( 23), UINT8_C(229), UINT8_C(200), UINT8_C( 62),
UINT8_C(169), UINT8_C(116), UINT8_C(131), UINT8_C(205),
UINT8_C(117), UINT8_C( 49), UINT8_C(130), UINT8_C( 21),
UINT8_C( 91), UINT8_C(138), UINT8_C(101), UINT8_C(205)),
simde_x_mm_set_epu8(UINT8_C( 43), UINT8_C( 65), UINT8_C( 28), UINT8_C( 61),
UINT8_C( 12), UINT8_C( 4), UINT8_C( 37), UINT8_C( 4),
UINT8_C(237), UINT8_C( 25), UINT8_C( 38), UINT8_C( 15),
UINT8_C( 9), UINT8_C( 6), UINT8_C(140), UINT8_C( 10)),
simde_x_mm_set_epu8(UINT8_C( 0), UINT8_C( 3), UINT8_C( 7), UINT8_C( 1),
UINT8_C( 14), UINT8_C( 29), UINT8_C( 3), UINT8_C( 51),
UINT8_C( 0), UINT8_C( 1), UINT8_C( 3), UINT8_C( 1),
UINT8_C( 10), UINT8_C( 23), UINT8_C( 0), UINT8_C( 20)) },
{ simde_x_mm_set_epu8(UINT8_C(140), UINT8_C(170), UINT8_C(150), UINT8_C(208),
UINT8_C( 64), UINT8_C( 6), UINT8_C(116), UINT8_C(102),
UINT8_C(200), UINT8_C(110), UINT8_C(136), UINT8_C(125),
UINT8_C(201), UINT8_C( 22), UINT8_C(166), UINT8_C(235)),
simde_x_mm_set_epu8(UINT8_C( 1), UINT8_C( 7), UINT8_C( 23), UINT8_C( 2),
UINT8_C( 12), UINT8_C(103), UINT8_C( 24), UINT8_C( 18),
UINT8_C(234), UINT8_C( 11), UINT8_C( 6), UINT8_C( 2),
UINT8_C( 5), UINT8_C( 34), UINT8_C( 60), UINT8_C( 13)),
simde_x_mm_set_epu8(UINT8_C(140), UINT8_C( 24), UINT8_C( 6), UINT8_C(104),
UINT8_C( 5), UINT8_C( 0), UINT8_C( 4), UINT8_C( 5),
UINT8_C( 0), UINT8_C( 10), UINT8_C( 22), UINT8_C( 62),
UINT8_C( 40), UINT8_C( 0), UINT8_C( 2), UINT8_C( 18)) },
{ simde_x_mm_set_epu8(UINT8_C(143), UINT8_C( 77), UINT8_C(114), UINT8_C( 66),
UINT8_C( 82), UINT8_C(133), UINT8_C( 93), UINT8_C(122),
UINT8_C(225), UINT8_C(230), UINT8_C(202), UINT8_C(147),
UINT8_C(170), UINT8_C(252), UINT8_C(163), UINT8_C(161)),
simde_x_mm_set_epu8(UINT8_C( 5), UINT8_C( 8), UINT8_C( 15), UINT8_C( 99),
UINT8_C( 10), UINT8_C( 4), UINT8_C( 1), UINT8_C( 1),
UINT8_C( 15), UINT8_C( 21), UINT8_C( 3), UINT8_C( 1),
UINT8_C( 2), UINT8_C( 18), UINT8_C( 18), UINT8_C( 2)),
simde_x_mm_set_epu8(UINT8_C( 28), UINT8_C( 9), UINT8_C( 7), UINT8_C( 0),
UINT8_C( 8), UINT8_C( 33), UINT8_C( 93), UINT8_C(122),
UINT8_C( 15), UINT8_C( 10), UINT8_C( 67), UINT8_C(147),
UINT8_C( 85), UINT8_C( 14), UINT8_C( 9), UINT8_C( 80)) },
{ simde_x_mm_set_epu8(UINT8_C(125), UINT8_C(134), UINT8_C(114), UINT8_C( 16),
UINT8_C(101), UINT8_C( 75), UINT8_C( 71), UINT8_C(136),
UINT8_C(137), UINT8_C(104), UINT8_C(249), UINT8_C(115),
UINT8_C(110), UINT8_C(132), UINT8_C(229), UINT8_C( 48)),
simde_x_mm_set_epu8(UINT8_C( 69), UINT8_C( 11), UINT8_C( 3), UINT8_C( 2),
UINT8_C( 2), UINT8_C( 21), UINT8_C( 3), UINT8_C( 1),
UINT8_C( 5), UINT8_C( 1), UINT8_C( 3), UINT8_C( 2),
UINT8_C( 1), UINT8_C(163), UINT8_C( 1), UINT8_C( 2)),
simde_x_mm_set_epu8(UINT8_C( 1), UINT8_C( 12), UINT8_C( 38), UINT8_C( 8),
UINT8_C( 50), UINT8_C( 3), UINT8_C( 23), UINT8_C(136),
UINT8_C( 27), UINT8_C(104), UINT8_C( 83), UINT8_C( 57),
UINT8_C(110), UINT8_C( 0), UINT8_C(229), UINT8_C( 24)) },
{ simde_x_mm_set_epu8(UINT8_C( 72), UINT8_C(139), UINT8_C(120), UINT8_C(127),
UINT8_C(102), UINT8_C(165), UINT8_C( 82), UINT8_C( 63),
UINT8_C(192), UINT8_C( 18), UINT8_C(103), UINT8_C(151),
UINT8_C( 81), UINT8_C(222), UINT8_C(212), UINT8_C( 1)),
simde_x_mm_set_epu8(UINT8_C( 7), UINT8_C( 26), UINT8_C( 32), UINT8_C( 1),
UINT8_C( 1), UINT8_C( 1), UINT8_C( 3), UINT8_C( 2),
UINT8_C( 65), UINT8_C( 24), UINT8_C( 1), UINT8_C( 97),
UINT8_C( 14), UINT8_C( 8), UINT8_C( 89), UINT8_C( 11)),
simde_x_mm_set_epu8(UINT8_C( 10), UINT8_C( 5), UINT8_C( 3), UINT8_C(127),
UINT8_C(102), UINT8_C(165), UINT8_C( 27), UINT8_C( 31),
UINT8_C( 2), UINT8_C( 0), UINT8_C(103), UINT8_C( 1),
UINT8_C( 5), UINT8_C( 27), UINT8_C( 2), UINT8_C( 0)) }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])); i++) {
simde__m128i r = simde_mm_div_epu8(test_vec[i].a, test_vec[i].b);
simde_assert_m128i_u8(r, ==, test_vec[i].r);
}
return 0;
}
static int
test_simde_mm_div_epu16(SIMDE_MUNIT_TEST_ARGS) {
const struct {
simde__m128i a;
simde__m128i b;
simde__m128i r;
} test_vec[8] = {
{ simde_x_mm_set_epu16(UINT16_C(27566), UINT16_C(40504), UINT16_C( 4629), UINT16_C(53715),
UINT16_C( 9716), UINT16_C( 9411), UINT16_C(47476), UINT16_C(41385)),
simde_x_mm_set_epu16(UINT16_C( 13), UINT16_C( 6506), UINT16_C( 2031), UINT16_C( 2041),
UINT16_C( 41), UINT16_C( 3089), UINT16_C( 4707), UINT16_C( 3)),
simde_x_mm_set_epu16(UINT16_C( 2120), UINT16_C( 6), UINT16_C( 2), UINT16_C( 26),
UINT16_C( 236), UINT16_C( 3), UINT16_C( 10), UINT16_C(13795)) },
{ simde_x_mm_set_epu16(UINT16_C( 9353), UINT16_C( 761), UINT16_C( 3256), UINT16_C(15648),
UINT16_C(54529), UINT16_C(37909), UINT16_C( 6524), UINT16_C(24806)),
simde_x_mm_set_epu16(UINT16_C(17088), UINT16_C( 3660), UINT16_C( 3), UINT16_C( 9),
UINT16_C( 186), UINT16_C( 2), UINT16_C( 7), UINT16_C( 1856)),
simde_x_mm_set_epu16(UINT16_C( 0), UINT16_C( 0), UINT16_C( 1085), UINT16_C( 1738),
UINT16_C( 293), UINT16_C(18954), UINT16_C( 932), UINT16_C( 13)) },
{ simde_x_mm_set_epu16(UINT16_C(19795), UINT16_C(45332), UINT16_C(60579), UINT16_C(32327),
UINT16_C(25905), UINT16_C(63671), UINT16_C( 930), UINT16_C(32017)),
simde_x_mm_set_epu16(UINT16_C( 8), UINT16_C(30488), UINT16_C( 26), UINT16_C( 3397),
UINT16_C( 1518), UINT16_C( 2), UINT16_C( 20), UINT16_C( 6)),
simde_x_mm_set_epu16(UINT16_C( 2474), UINT16_C( 1), UINT16_C( 2329), UINT16_C( 9),
UINT16_C( 17), UINT16_C(31835), UINT16_C( 46), UINT16_C( 5336)) },
{ simde_x_mm_set_epu16(UINT16_C(29801), UINT16_C(62435), UINT16_C(31106), UINT16_C(58247),
UINT16_C(47275), UINT16_C(34875), UINT16_C(63847), UINT16_C( 8602)),
simde_x_mm_set_epu16(UINT16_C( 5), UINT16_C( 1), UINT16_C( 842), UINT16_C( 1634),
UINT16_C( 11), UINT16_C( 25), UINT16_C( 3640), UINT16_C( 932)),
simde_x_mm_set_epu16(UINT16_C( 5960), UINT16_C(62435), UINT16_C( 36), UINT16_C( 35),
UINT16_C( 4297), UINT16_C( 1395), UINT16_C( 17), UINT16_C( 9)) },
{ simde_x_mm_set_epu16(UINT16_C(41564), UINT16_C(16940), UINT16_C(39647), UINT16_C(59460),
UINT16_C(17425), UINT16_C(59711), UINT16_C(30880), UINT16_C(42139)),
simde_x_mm_set_epu16(UINT16_C(25139), UINT16_C( 3416), UINT16_C( 43), UINT16_C( 6),
UINT16_C( 4), UINT16_C( 1256), UINT16_C( 60), UINT16_C( 129)),
simde_x_mm_set_epu16(UINT16_C( 1), UINT16_C( 4), UINT16_C( 922), UINT16_C( 9910),
UINT16_C( 4356), UINT16_C( 47), UINT16_C( 514), UINT16_C( 326)) },
{ simde_x_mm_set_epu16(UINT16_C(39593), UINT16_C(41522), UINT16_C(58894), UINT16_C( 6383),
UINT16_C(39956), UINT16_C( 2820), UINT16_C(20260), UINT16_C(57360)),
simde_x_mm_set_epu16(UINT16_C( 1), UINT16_C(10468), UINT16_C( 2), UINT16_C( 79),
UINT16_C( 5), UINT16_C( 1166), UINT16_C( 2), UINT16_C( 3)),
simde_x_mm_set_epu16(UINT16_C(39593), UINT16_C( 3), UINT16_C(29447), UINT16_C( 80),
UINT16_C( 7991), UINT16_C( 2), UINT16_C(10130), UINT16_C(19120)) },
{ simde_x_mm_set_epu16(UINT16_C(58633), UINT16_C(30014), UINT16_C(57061), UINT16_C(60439),
UINT16_C(22536), UINT16_C(20868), UINT16_C(20870), UINT16_C(13916)),
simde_x_mm_set_epu16(UINT16_C( 15), UINT16_C( 490), UINT16_C( 2338), UINT16_C( 64),
UINT16_C( 876), UINT16_C( 706), UINT16_C( 65), UINT16_C( 320)),
simde_x_mm_set_epu16(UINT16_C( 3908), UINT16_C( 61), UINT16_C( 24), UINT16_C( 944),
UINT16_C( 25), UINT16_C( 29), UINT16_C( 321), UINT16_C( 43)) },
{ simde_x_mm_set_epu16(UINT16_C( 6697), UINT16_C(21906), UINT16_C(59582), UINT16_C(44845),
UINT16_C(35883), UINT16_C(64682), UINT16_C(55100), UINT16_C(57711)),
simde_x_mm_set_epu16(UINT16_C( 7058), UINT16_C( 10), UINT16_C(60566), UINT16_C( 1),
UINT16_C( 1), UINT16_C( 872), UINT16_C( 109), UINT16_C( 1)),
simde_x_mm_set_epu16(UINT16_C( 0), UINT16_C( 2190), UINT16_C( 0), UINT16_C(44845),
UINT16_C(35883), UINT16_C( 74), UINT16_C( 505), UINT16_C(57711)) }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])); i++) {
simde__m128i r = simde_mm_div_epu16(test_vec[i].a, test_vec[i].b);
simde_assert_m128i_u16(r, ==, test_vec[i].r);
}
return 0;
}
static int
test_simde_mm_div_epu32(SIMDE_MUNIT_TEST_ARGS) {
const struct {
simde__m128i a;
simde__m128i b;
simde__m128i r;
} test_vec[8] = {
{ simde_x_mm_set_epu32(UINT32_C(3152261024), UINT32_C(2598586578), UINT32_C(1610828679), UINT32_C(3536337768)),
simde_x_mm_set_epu32(UINT32_C( 14157), UINT32_C( 947), UINT32_C(1043337665), UINT32_C( 97937)),
simde_x_mm_set_epu32(UINT32_C( 222664), UINT32_C( 2744019), UINT32_C( 1), UINT32_C( 36108)) },
{ simde_x_mm_set_epu32(UINT32_C( 75140339), UINT32_C(1941562012), UINT32_C( 857740081), UINT32_C(1336535286)),
simde_x_mm_set_epu32(UINT32_C( 22), UINT32_C( 1682), UINT32_C( 11), UINT32_C( 2)),
simde_x_mm_set_epu32(UINT32_C( 3415469), UINT32_C( 1154317), UINT32_C( 77976371), UINT32_C( 668267643)) },
{ simde_x_mm_set_epu32(UINT32_C( 948661264), UINT32_C(1195769225), UINT32_C( 694120276), UINT32_C(3517239447)),
simde_x_mm_set_epu32(UINT32_C( 3949), UINT32_C( 275), UINT32_C( 12430067), UINT32_C( 15794)),
simde_x_mm_set_epu32(UINT32_C( 240228), UINT32_C( 4348251), UINT32_C( 55), UINT32_C( 222694)) },
{ simde_x_mm_set_epu32(UINT32_C(3023938951), UINT32_C(4109050401), UINT32_C( 287757059), UINT32_C(2648669825)),
simde_x_mm_set_epu32(UINT32_C( 57756), UINT32_C( 40), UINT32_C(1080216164), UINT32_C( 173312)),
simde_x_mm_set_epu32(UINT32_C( 52357), UINT32_C( 102726260), UINT32_C( 0), UINT32_C( 15282)) },
{ simde_x_mm_set_epu32(UINT32_C( 864299658), UINT32_C(2427378437), UINT32_C( 823539242), UINT32_C(1758563044)),
simde_x_mm_set_epu32(UINT32_C( 225), UINT32_C( 75), UINT32_C( 11529), UINT32_C( 119418298)),
simde_x_mm_set_epu32(UINT32_C( 3841331), UINT32_C( 32365045), UINT32_C( 71431), UINT32_C( 14)) },
{ simde_x_mm_set_epu32(UINT32_C(2662820398), UINT32_C(1208068616), UINT32_C(2158211537), UINT32_C(3417661837)),
simde_x_mm_set_epu32(UINT32_C( 2367), UINT32_C( 126619), UINT32_C( 55203), UINT32_C( 155)),
simde_x_mm_set_epu32(UINT32_C( 1124976), UINT32_C( 9540), UINT32_C( 39095), UINT32_C( 22049431)) },
{ simde_x_mm_set_epu32(UINT32_C(1097247740), UINT32_C(3448507951), UINT32_C(4106436665), UINT32_C(3017338787)),
simde_x_mm_set_epu32(UINT32_C( 61963115), UINT32_C( 238397327), UINT32_C( 245318), UINT32_C( 3312135)),
simde_x_mm_set_epu32(UINT32_C( 17), UINT32_C( 14), UINT32_C( 16739), UINT32_C( 910)) },
{ simde_x_mm_set_epu32(UINT32_C(3006363325), UINT32_C(2983927188), UINT32_C(2177891039), UINT32_C(1117727917)),
simde_x_mm_set_epu32(UINT32_C( 24), UINT32_C( 12), UINT32_C(1067413818), UINT32_C( 206)),
simde_x_mm_set_epu32(UINT32_C( 125265138), UINT32_C( 248660599), UINT32_C( 2), UINT32_C( 5425863)) }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])); i++) {
simde__m128i r = simde_mm_div_epu32(test_vec[i].a, test_vec[i].b);
simde_assert_m128i_u32(r, ==, test_vec[i].r);
}
return 0;
}
static int
test_simde_mm_div_epu64(SIMDE_MUNIT_TEST_ARGS) {
const struct {
simde__m128i a;
simde__m128i b;
simde__m128i r;
} test_vec[8] = {
{ simde_x_mm_set_epu64x(UINT64_C(14823946846053138543), UINT64_C( 2773213006356142856)),
simde_x_mm_set_epu64x(UINT64_C( 22806630538915743), UINT64_C( 1295)),
simde_x_mm_set_epu64x(UINT64_C( 649), UINT64_C( 2141477224985438)) },
{ simde_x_mm_set_epu64x(UINT64_C(16338394746286416599), UINT64_C( 4395568244008230294)),
simde_x_mm_set_epu64x(UINT64_C( 1610), UINT64_C( 68247035008)),
simde_x_mm_set_epu64x(UINT64_C( 10148071270985351), UINT64_C( 64406728)) },
{ simde_x_mm_set_epu64x(UINT64_C( 6431957656146818365), UINT64_C(14710883493083458909)),
simde_x_mm_set_epu64x(UINT64_C( 2399266305377), UINT64_C( 16092627197291141)),
simde_x_mm_set_epu64x(UINT64_C( 2680801), UINT64_C( 914)) },
{ simde_x_mm_set_epu64x(UINT64_C( 7920700281052633117), UINT64_C(15482760419196872328)),
simde_x_mm_set_epu64x(UINT64_C( 45928957131), UINT64_C( 837231)),
simde_x_mm_set_epu64x(UINT64_C( 172455478), UINT64_C( 18492817895176)) },
{ simde_x_mm_set_epu64x(UINT64_C( 230158309193392347), UINT64_C(18390356791266391163)),
simde_x_mm_set_epu64x(UINT64_C( 2253), UINT64_C( 1691141090999)),
simde_x_mm_set_epu64x(UINT64_C( 102156373365908), UINT64_C( 10874525)) },
{ simde_x_mm_set_epu64x(UINT64_C(12307531484633875995), UINT64_C(16695234188854570094)),
simde_x_mm_set_epu64x(UINT64_C( 131150029), UINT64_C( 516657134296053652)),
simde_x_mm_set_epu64x(UINT64_C( 93843147260), UINT64_C( 32)) },
{ simde_x_mm_set_epu64x(UINT64_C(11764896934406933200), UINT64_C(18439918542668248477)),
simde_x_mm_set_epu64x(UINT64_C( 306481550847), UINT64_C( 776223621938168297)),
simde_x_mm_set_epu64x(UINT64_C( 38386966), UINT64_C( 23)) },
{ simde_x_mm_set_epu64x(UINT64_C(15338454595408931369), UINT64_C(14530768559531423502)),
simde_x_mm_set_epu64x(UINT64_C( 3408), UINT64_C( 2)),
simde_x_mm_set_epu64x(UINT64_C( 4500720245131728), UINT64_C( 7265384279765711751)) }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])); i++) {
simde__m128i r = simde_mm_div_epu64(test_vec[i].a, test_vec[i].b);
simde_assert_m128i_u64(r, ==, test_vec[i].r);
}
return 0;
}
static int
test_simde_mm256_div_epi8(SIMDE_MUNIT_TEST_ARGS) {
const struct {
simde__m256i a;
simde__m256i b;
simde__m256i r;
} test_vec[8] = {
{ simde_mm256_set_epi8(INT8_C( -27), INT8_C( 46), INT8_C(-122), INT8_C( 87),
INT8_C( 34), INT8_C( -53), INT8_C( 64), INT8_C( -70),
INT8_C( 25), INT8_C( -17), INT8_C( 56), INT8_C( 3),
INT8_C( -75), INT8_C( -17), INT8_C( -12), INT8_C( 60),
INT8_C( 100), INT8_C( -7), INT8_C(-102), INT8_C( -6),
INT8_C( -10), INT8_C(-111), INT8_C( 106), INT8_C( -43),
INT8_C( -28), INT8_C( -46), INT8_C( 42), INT8_C( -58),
INT8_C( 85), INT8_C( -33), INT8_C(-106), INT8_C(-106)),
simde_mm256_set_epi8(INT8_C( 1), INT8_C( 4), INT8_C( -31), INT8_C( 6),
INT8_C( 13), INT8_C( 15), INT8_C( 20), INT8_C( 3),
INT8_C( -77), INT8_C( 32), INT8_C( 5), INT8_C( 55),
INT8_C( 5), INT8_C( 1), INT8_C( 16), INT8_C( 49),
INT8_C( 43), INT8_C( 83), INT8_C( 5), INT8_C( 16),
INT8_C( 34), INT8_C( 20), INT8_C( 2), INT8_C( 13),
INT8_C( 8), INT8_C( 2), INT8_C( 90), INT8_C( 2),
INT8_C( 23), INT8_C( 12), INT8_C( 2), INT8_C( 5)),
simde_mm256_set_epi8(INT8_C( -27), INT8_C( 11), INT8_C( 3), INT8_C( 14),
INT8_C( 2), INT8_C( -3), INT8_C( 3), INT8_C( -23),
INT8_C( 0), INT8_C( 0), INT8_C( 11), INT8_C( 0),
INT8_C( -15), INT8_C( -17), INT8_C( 0), INT8_C( 1),
INT8_C( 2), INT8_C( 0), INT8_C( -20), INT8_C( 0),
INT8_C( 0), INT8_C( -5), INT8_C( 53), INT8_C( -3),
INT8_C( -3), INT8_C( -23), INT8_C( 0), INT8_C( -29),
INT8_C( 3), INT8_C( -2), INT8_C( -53), INT8_C( -21)) },
{ simde_mm256_set_epi8(INT8_C( 64), INT8_C(-114), INT8_C( 66), INT8_C( -73),
INT8_C( -80), INT8_C( 97), INT8_C( 103), INT8_C( -46),
INT8_C( -83), INT8_C( 104), INT8_C( 22), INT8_C( -39),
INT8_C( 114), INT8_C( -82), INT8_C( 83), INT8_C( 122),
INT8_C( 1), INT8_C( 51), INT8_C( 75), INT8_C(-100),
INT8_C( 17), INT8_C( 37), INT8_C( 53), INT8_C( -57),
INT8_C( 121), INT8_C( -35), INT8_C( 108), INT8_C( -68),
INT8_C( 25), INT8_C( -78), INT8_C( -54), INT8_C(-104)),
simde_mm256_set_epi8(INT8_C( 91), INT8_C( 10), INT8_C( -96), INT8_C( 14),
INT8_C( 21), INT8_C( 23), INT8_C( 1), INT8_C( 8),
INT8_C( 9), INT8_C( 2), INT8_C( 8), INT8_C( 30),
INT8_C( 1), INT8_C( -75), INT8_C( 15), INT8_C( 1),
INT8_C( 27), INT8_C( 5), INT8_C( 104), INT8_C( 48),
INT8_C( 11), INT8_C( 4), INT8_C( 31), INT8_C( 3),
INT8_C( 20), INT8_C( 118), INT8_C( 1), INT8_C( 18),
INT8_C( 1), INT8_C( 22), INT8_C( 20), INT8_C( 33)),
simde_mm256_set_epi8(INT8_C( 0), INT8_C( -11), INT8_C( 0), INT8_C( -5),
INT8_C( -3), INT8_C( 4), INT8_C( 103), INT8_C( -5),
INT8_C( -9), INT8_C( 52), INT8_C( 2), INT8_C( -1),
INT8_C( 114), INT8_C( 1), INT8_C( 5), INT8_C( 122),
INT8_C( 0), INT8_C( 10), INT8_C( 0), INT8_C( -2),
INT8_C( 1), INT8_C( 9), INT8_C( 1), INT8_C( -19),
INT8_C( 6), INT8_C( 0), INT8_C( 108), INT8_C( -3),
INT8_C( 25), INT8_C( -3), INT8_C( -2), INT8_C( -3)) },
{ simde_mm256_set_epi8(INT8_C( 123), INT8_C( 92), INT8_C( -58), INT8_C( 47),
INT8_C( 51), INT8_C( 47), INT8_C( 69), INT8_C( 12),
INT8_C( 68), INT8_C( -99), INT8_C( 76), INT8_C( 32),
INT8_C( 85), INT8_C( -81), INT8_C( -3), INT8_C( -4),
INT8_C( -35), INT8_C( -48), INT8_C( 17), INT8_C( -73),
INT8_C( 109), INT8_C( 88), INT8_C( -56), INT8_C( -99),
INT8_C(-114), INT8_C( 127), INT8_C( 26), INT8_C( -29),
INT8_C( -48), INT8_C( -28), INT8_C( 93), INT8_C( -85)),
simde_mm256_set_epi8(INT8_C( 86), INT8_C( 12), INT8_C( 90), INT8_C( 46),
INT8_C( 10), INT8_C( 18), INT8_C( 1), INT8_C( 58),
INT8_C( -94), INT8_C( 4), INT8_C( 2), INT8_C( 1),
INT8_C( 20), INT8_C( 20), INT8_C( 1), INT8_C( 10),
INT8_C( 4), INT8_C( 13), INT8_C( 1), INT8_C( 1),
INT8_C( 1), INT8_C( 3), INT8_C( 16), INT8_C( 4),
INT8_C( 4), INT8_C( 2), INT8_C( 8), INT8_C( -96),
INT8_C( 1), INT8_C( 5), INT8_C( -98), INT8_C( 11)),
simde_mm256_set_epi8(INT8_C( 1), INT8_C( 7), INT8_C( 0), INT8_C( 1),
INT8_C( 5), INT8_C( 2), INT8_C( 69), INT8_C( 0),
INT8_C( 0), INT8_C( -24), INT8_C( 38), INT8_C( 32),
INT8_C( 4), INT8_C( -4), INT8_C( -3), INT8_C( 0),
INT8_C( -8), INT8_C( -3), INT8_C( 17), INT8_C( -73),
INT8_C( 109), INT8_C( 29), INT8_C( -3), INT8_C( -24),
INT8_C( -28), INT8_C( 63), INT8_C( 3), INT8_C( 0),
INT8_C( -48), INT8_C( -5), INT8_C( 0), INT8_C( -7)) },
{ simde_mm256_set_epi8(INT8_C( -83), INT8_C( 8), INT8_C( 39), INT8_C( 32),
INT8_C( -68), INT8_C( 0), INT8_C( 93), INT8_C( 7),
INT8_C( -26), INT8_C( -37), INT8_C( 3), INT8_C( -23),
INT8_C( 38), INT8_C( -61), INT8_C( 87), INT8_C( 32),
INT8_C( 65), INT8_C( 24), INT8_C( -17), INT8_C( -19),
INT8_C( 113), INT8_C( -25), INT8_C( 58), INT8_C( 4),
INT8_C(-127), INT8_C( 41), INT8_C( -74), INT8_C( 113),
INT8_C( 49), INT8_C( -39), INT8_C( -48), INT8_C( 114)),
simde_mm256_set_epi8(INT8_C(-102), INT8_C( 1), INT8_C( 22), INT8_C( 1),
INT8_C( 15), INT8_C( 2), INT8_C( 19), INT8_C( 69),
INT8_C( 1), INT8_C( 49), INT8_C( 66), INT8_C( 2),
INT8_C( 1), INT8_C( 2), INT8_C( 10), INT8_C( 8),
INT8_C( 1), INT8_C( 1), INT8_C( 4), INT8_C( 66),
INT8_C( 11), INT8_C( 22), INT8_C(-126), INT8_C( 49),
INT8_C( 1), INT8_C( 38), INT8_C( 1), INT8_C( 3),
INT8_C( 7), INT8_C( 3), INT8_C( 21), INT8_C( 21)),
simde_mm256_set_epi8(INT8_C( 0), INT8_C( 8), INT8_C( 1), INT8_C( 32),
INT8_C( -4), INT8_C( 0), INT8_C( 4), INT8_C( 0),
INT8_C( -26), INT8_C( 0), INT8_C( 0), INT8_C( -11),
INT8_C( 38), INT8_C( -30), INT8_C( 8), INT8_C( 4),
INT8_C( 65), INT8_C( 24), INT8_C( -4), INT8_C( 0),
INT8_C( 10), INT8_C( -1), INT8_C( 0), INT8_C( 0),
INT8_C(-127), INT8_C( 1), INT8_C( -74), INT8_C( 37),
INT8_C( 7), INT8_C( -13), INT8_C( -2), INT8_C( 5)) },
{ simde_mm256_set_epi8(INT8_C( 66), INT8_C( 127), INT8_C( 41), INT8_C(-124),
INT8_C( -90), INT8_C( 28), INT8_C(-118), INT8_C( 18),
INT8_C( 79), INT8_C( 17), INT8_C( 126), INT8_C( -43),
INT8_C( -78), INT8_C( 78), INT8_C( 76), INT8_C( 46),
INT8_C( 60), INT8_C(-126), INT8_C( -41), INT8_C( -77),
INT8_C( -62), INT8_C(-116), INT8_C(-115), INT8_C( 55),
INT8_C( 19), INT8_C( 104), INT8_C(-104), INT8_C( -29),
INT8_C( 54), INT8_C(-118), INT8_C( -40), INT8_C( -58)),
simde_mm256_set_epi8(INT8_C( 3), INT8_C( 53), INT8_C( 28), INT8_C( -96),
INT8_C( 1), INT8_C( 91), INT8_C( 7), INT8_C( 1),
INT8_C( 29), INT8_C( 30), INT8_C( 1), INT8_C( 10),
INT8_C( 1), INT8_C( 36), INT8_C( 7), INT8_C( 1),
INT8_C(-101), INT8_C( 5), INT8_C( 13), INT8_C( 5),
INT8_C( 85), INT8_C( 11), INT8_C( 34), INT8_C( 48),
INT8_C( 17), INT8_C( 42), INT8_C( 3), INT8_C( 87),
INT8_C( 1), INT8_C( 2), INT8_C( 74), INT8_C( 8)),
simde_mm256_set_epi8(INT8_C( 22), INT8_C( 2), INT8_C( 1), INT8_C( 1),
INT8_C( -90), INT8_C( 0), INT8_C( -16), INT8_C( 18),
INT8_C( 2), INT8_C( 0), INT8_C( 126), INT8_C( -4),
INT8_C( -78), INT8_C( 2), INT8_C( 10), INT8_C( 46),
INT8_C( 0), INT8_C( -25), INT8_C( -3), INT8_C( -15),
INT8_C( 0), INT8_C( -10), INT8_C( -3), INT8_C( 1),
INT8_C( 1), INT8_C( 2), INT8_C( -34), INT8_C( 0),
INT8_C( 54), INT8_C( -59), INT8_C( 0), INT8_C( -7)) },
{ simde_mm256_set_epi8(INT8_C( 79), INT8_C( -60), INT8_C( 106), INT8_C( -93),
INT8_C(-111), INT8_C( 118), INT8_C( -87), INT8_C( -78),
INT8_C( -28), INT8_C( 107), INT8_C( -12), INT8_C( -54),
INT8_C( 101), INT8_C( -62), INT8_C( 4), INT8_C( -51),
INT8_C( -90), INT8_C(-114), INT8_C( 14), INT8_C( 124),
INT8_C( -67), INT8_C( 47), INT8_C( 41), INT8_C( 37),
INT8_C( 126), INT8_C( -20), INT8_C( 119), INT8_C( 105),
INT8_C( -17), INT8_C( 95), INT8_C( -41), INT8_C( 19)),
simde_mm256_set_epi8(INT8_C( -34), INT8_C( 4), INT8_C( 32), INT8_C( 1),
INT8_C( 4), INT8_C( 10), INT8_C( 7), INT8_C( 5),
INT8_C( 120), INT8_C( 1), INT8_C( 1), INT8_C( 1),
INT8_C( 26), INT8_C( 6), INT8_C( 44), INT8_C( 2),
INT8_C( 55), INT8_C( 14), INT8_C( 4), INT8_C( 41),
INT8_C( 41), INT8_C( 6), INT8_C( 10), INT8_C( 7),
INT8_C( 7), INT8_C( 21), INT8_C( 126), INT8_C( 59),
INT8_C( 13), INT8_C( 8), INT8_C( 2), INT8_C( 6)),
simde_mm256_set_epi8(INT8_C( -2), INT8_C( -15), INT8_C( 3), INT8_C( -93),
INT8_C( -27), INT8_C( 11), INT8_C( -12), INT8_C( -15),
INT8_C( 0), INT8_C( 107), INT8_C( -12), INT8_C( -54),
INT8_C( 3), INT8_C( -10), INT8_C( 0), INT8_C( -25),
INT8_C( -1), INT8_C( -8), INT8_C( 3), INT8_C( 3),
INT8_C( -1), INT8_C( 7), INT8_C( 4), INT8_C( 5),
INT8_C( 18), INT8_C( 0), INT8_C( 0), INT8_C( 1),
INT8_C( -1), INT8_C( 11), INT8_C( -20), INT8_C( 3)) },
{ simde_mm256_set_epi8(INT8_C( -48), INT8_C( -29), INT8_C( 23), INT8_C( 39),
INT8_C( 106), INT8_C( -37), INT8_C( 1), INT8_C( 62),
INT8_C( -21), INT8_C( -4), INT8_C( -92), INT8_C( -12),
INT8_C( 78), INT8_C( -93), INT8_C( 36), INT8_C( -10),
INT8_C( -84), INT8_C( 102), INT8_C( 9), INT8_C( 70),
INT8_C( -16), INT8_C( -90), INT8_C( 82), INT8_C(-124),
INT8_C( -78), INT8_C( 58), INT8_C( 35), INT8_C( 108),
INT8_C(-105), INT8_C( -72), INT8_C( -16), INT8_C(-103)),
simde_mm256_set_epi8(INT8_C( 2), INT8_C( 4), INT8_C( 28), INT8_C( 120),
INT8_C( 1), INT8_C( 5), INT8_C( 2), INT8_C( 61),
INT8_C( 1), INT8_C( 33), INT8_C( 110), INT8_C( 1),
INT8_C( 102), INT8_C( 3), INT8_C( 3), INT8_C( 1),
INT8_C( 1), INT8_C( 26), INT8_C( 11), INT8_C( 7),
INT8_C( 75), INT8_C( 3), INT8_C( 5), INT8_C( 19),
INT8_C( 3), INT8_C( -26), INT8_C( 56), INT8_C( 5),
INT8_C( 7), INT8_C( 6), INT8_C( 2), INT8_C( 5)),
simde_mm256_set_epi8(INT8_C( -24), INT8_C( -7), INT8_C( 0), INT8_C( 0),
INT8_C( 106), INT8_C( -7), INT8_C( 0), INT8_C( 1),
INT8_C( -21), INT8_C( 0), INT8_C( 0), INT8_C( -12),
INT8_C( 0), INT8_C( -31), INT8_C( 12), INT8_C( -10),
INT8_C( -84), INT8_C( 3), INT8_C( 0), INT8_C( 10),
INT8_C( 0), INT8_C( -30), INT8_C( 16), INT8_C( -6),
INT8_C( -26), INT8_C( -2), INT8_C( 0), INT8_C( 21),
INT8_C( -15), INT8_C( -12), INT8_C( -8), INT8_C( -20)) },
{ simde_mm256_set_epi8(INT8_C( 110), INT8_C( 56), INT8_C(-120), INT8_C( -32),
INT8_C( -22), INT8_C( 97), INT8_C( -56), INT8_C( 55),
INT8_C( -90), INT8_C( 33), INT8_C( 92), INT8_C( 89),
INT8_C(-107), INT8_C( 55), INT8_C( -50), INT8_C( -88),
INT8_C( 35), INT8_C( 21), INT8_C( 54), INT8_C( 26),
INT8_C(-122), INT8_C( 103), INT8_C( 76), INT8_C( 38),
INT8_C(-110), INT8_C( 11), INT8_C( 26), INT8_C( -11),
INT8_C( 0), INT8_C( 3), INT8_C( 30), INT8_C( 59)),
simde_mm256_set_epi8(INT8_C( -31), INT8_C( -83), INT8_C( 101), INT8_C( 17),
INT8_C( 8), INT8_C( 15), INT8_C( 2), INT8_C( 7),
INT8_C( 37), INT8_C( 84), INT8_C( -52), INT8_C( 25),
INT8_C( 42), INT8_C( -27), INT8_C( 1), INT8_C( 10),
INT8_C( 7), INT8_C( 37), INT8_C( 54), INT8_C( 31),
INT8_C( 54), INT8_C( 62), INT8_C( 11), INT8_C( 54),
INT8_C( 43), INT8_C( 1), INT8_C( 4), INT8_C( 5),
INT8_C( 93), INT8_C( 124), INT8_C( 2), INT8_C( 3)),
simde_mm256_set_epi8(INT8_C( -3), INT8_C( 0), INT8_C( -1), INT8_C( -1),
INT8_C( -2), INT8_C( 6), INT8_C( -28), INT8_C( 7),
INT8_C( -2), INT8_C( 0), INT8_C( -1), INT8_C( 3),
INT8_C( -2), INT8_C( -2), INT8_C( -50), INT8_C( -8),
INT8_C( 5), INT8_C( 0), INT8_C( 1), INT8_C( 0),
INT8_C( -2), INT8_C( 1), INT8_C( 6), INT8_C( 0),
INT8_C( -2), INT8_C( 11), INT8_C( 6), INT8_C( -2),
INT8_C( 0), INT8_C( 0), INT8_C( 15), INT8_C( 19)) }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])); i++) {
simde__m256i r = simde_mm256_div_epi8(test_vec[i].a, test_vec[i].b);
simde_assert_m256i_i8(r, ==, test_vec[i].r);
}
return 0;
}
static int
test_simde_mm256_div_epi16(SIMDE_MUNIT_TEST_ARGS) {
const struct {
simde__m256i a;
simde__m256i b;
simde__m256i r;
} test_vec[8] = {
{ simde_mm256_set_epi16(INT16_C(-29867), INT16_C( 9314), INT16_C( 7980), INT16_C( 8102),
INT16_C(-24663), INT16_C( 4367), INT16_C(-15443), INT16_C( -5657),
INT16_C(-20080), INT16_C(-10092), INT16_C(-31734), INT16_C( 6262),
INT16_C( 3510), INT16_C(-31811), INT16_C( -4053), INT16_C( -6124)),
simde_mm256_set_epi16(INT16_C( 1), INT16_C( 1438), INT16_C( -9), INT16_C( 435),
INT16_C( -11), INT16_C( 2), INT16_C( -496), INT16_C( 10321),
INT16_C( -1000), INT16_C( -27), INT16_C( -4), INT16_C( 453),
INT16_C( -2), INT16_C( 19741), INT16_C( -615), INT16_C( -3265)),
simde_mm256_set_epi16(INT16_C(-29867), INT16_C( 6), INT16_C( -886), INT16_C( 18),
INT16_C( 2242), INT16_C( 2183), INT16_C( 31), INT16_C( 0),
INT16_C( 20), INT16_C( 373), INT16_C( 7933), INT16_C( 13),
INT16_C( -1755), INT16_C( -1), INT16_C( 6), INT16_C( 1)) },
{ simde_mm256_set_epi16(INT16_C( -6800), INT16_C( 13259), INT16_C( -2233), INT16_C( 1354),
INT16_C( -8106), INT16_C(-17039), INT16_C( 9504), INT16_C( 22255),
INT16_C( 12402), INT16_C( -2677), INT16_C( 4463), INT16_C( 28303),
INT16_C(-12322), INT16_C(-19201), INT16_C( 30668), INT16_C( 15284)),
simde_mm256_set_epi16(INT16_C( 16270), INT16_C(-26534), INT16_C( -13), INT16_C( -20),
INT16_C( -12), INT16_C( -182), INT16_C( -13), INT16_C( -2),
INT16_C( 399), INT16_C( -245), INT16_C( -1), INT16_C( -1),
INT16_C( -3), INT16_C( 59), INT16_C( 11), INT16_C( -9799)),
simde_mm256_set_epi16(INT16_C( 0), INT16_C( 0), INT16_C( 171), INT16_C( -67),
INT16_C( 675), INT16_C( 93), INT16_C( -731), INT16_C(-11127),
INT16_C( 31), INT16_C( 10), INT16_C( -4463), INT16_C(-28303),
INT16_C( 4107), INT16_C( -325), INT16_C( 2788), INT16_C( -1)) },
{ simde_mm256_set_epi16(INT16_C( 23535), INT16_C( 10930), INT16_C( 30193), INT16_C( -8194),
INT16_C( -8688), INT16_C( 2183), INT16_C(-14596), INT16_C(-28144),
INT16_C(-10670), INT16_C( 1107), INT16_C( 31427), INT16_C( -7322),
INT16_C( 17038), INT16_C(-32679), INT16_C( 23368), INT16_C(-24524)),
simde_mm256_set_epi16(INT16_C( 19), INT16_C( -388), INT16_C( -1), INT16_C( -2261),
INT16_C( -7651), INT16_C( 1639), INT16_C( -50), INT16_C( -2059),
INT16_C( -25), INT16_C( -57), INT16_C( -952), INT16_C( 17),
INT16_C( -4528), INT16_C( -764), INT16_C( -925), INT16_C( -20)),
simde_mm256_set_epi16(INT16_C( 1238), INT16_C( -28), INT16_C(-30193), INT16_C( 3),
INT16_C( 1), INT16_C( 1), INT16_C( 291), INT16_C( 13),
INT16_C( 426), INT16_C( -19), INT16_C( -33), INT16_C( -430),
INT16_C( -3), INT16_C( 42), INT16_C( -25), INT16_C( 1226)) },
{ simde_mm256_set_epi16(INT16_C( 22767), INT16_C( 28543), INT16_C(-30401), INT16_C( 25623),
INT16_C( 2206), INT16_C(-16640), INT16_C(-13607), INT16_C(-30899),
INT16_C( -2384), INT16_C( -1714), INT16_C( 12691), INT16_C( 9427),
INT16_C( 11864), INT16_C( 29526), INT16_C( 8259), INT16_C( 6808)),
simde_mm256_set_epi16(INT16_C( 15244), INT16_C( 1), INT16_C( -1), INT16_C( -3),
INT16_C( -18), INT16_C( -10), INT16_C(-15299), INT16_C( -824),
INT16_C( 2005), INT16_C( 471), INT16_C( 2069), INT16_C( 204),
INT16_C( 25), INT16_C( -13), INT16_C( -3), INT16_C( 11)),
simde_mm256_set_epi16(INT16_C( 1), INT16_C( 28543), INT16_C( 30401), INT16_C( -8541),
INT16_C( -122), INT16_C( 1664), INT16_C( 0), INT16_C( 37),
INT16_C( -1), INT16_C( -3), INT16_C( 6), INT16_C( 46),
INT16_C( 474), INT16_C( -2271), INT16_C( -2753), INT16_C( 618)) },
{ simde_mm256_set_epi16(INT16_C(-16585), INT16_C(-25277), INT16_C( -4139), INT16_C(-27065),
INT16_C(-28777), INT16_C( -9487), INT16_C(-18713), INT16_C(-30387),
INT16_C(-14811), INT16_C( 24102), INT16_C(-10162), INT16_C( 7921),
INT16_C( 29417), INT16_C( 15464), INT16_C( 24785), INT16_C( -1285)),
simde_mm256_set_epi16(INT16_C( -121), INT16_C( 328), INT16_C( 10), INT16_C( -385),
INT16_C( -1), INT16_C( 4), INT16_C( 388), INT16_C( -1),
INT16_C( 1), INT16_C( 4863), INT16_C( -499), INT16_C( 3),
INT16_C( -226), INT16_C(-15244), INT16_C( 5), INT16_C( -5)),
simde_mm256_set_epi16(INT16_C( 137), INT16_C( -77), INT16_C( -413), INT16_C( 70),
INT16_C( 28777), INT16_C( -2371), INT16_C( -48), INT16_C( 30387),
INT16_C(-14811), INT16_C( 4), INT16_C( 20), INT16_C( 2640),
INT16_C( -130), INT16_C( -1), INT16_C( 4957), INT16_C( 257)) },
{ simde_mm256_set_epi16(INT16_C( -8831), INT16_C(-12421), INT16_C( 28092), INT16_C(-15215),
INT16_C( 5495), INT16_C( 15560), INT16_C( 8747), INT16_C( 22186),
INT16_C(-22634), INT16_C(-23262), INT16_C( 360), INT16_C(-18340),
INT16_C(-15939), INT16_C(-18429), INT16_C(-10641), INT16_C(-25953)),
simde_mm256_set_epi16(INT16_C( 6646), INT16_C( -440), INT16_C( 5), INT16_C( 9),
INT16_C( 5230), INT16_C( 14027), INT16_C( -115), INT16_C( -1),
INT16_C( -118), INT16_C( -466), INT16_C( -288), INT16_C( -9),
INT16_C( 114), INT16_C( -2656), INT16_C( -2539), INT16_C( 1803)),
simde_mm256_set_epi16(INT16_C( -1), INT16_C( 28), INT16_C( 5618), INT16_C( -1690),
INT16_C( 1), INT16_C( 1), INT16_C( -76), INT16_C(-22186),
INT16_C( 191), INT16_C( 49), INT16_C( -1), INT16_C( 2037),
INT16_C( -139), INT16_C( 6), INT16_C( 4), INT16_C( -14)) },
{ simde_mm256_set_epi16(INT16_C( 2118), INT16_C( 26269), INT16_C( 31059), INT16_C( 17912),
INT16_C(-28141), INT16_C( 5202), INT16_C( 30957), INT16_C(-32121),
INT16_C( -2609), INT16_C(-12316), INT16_C(-10959), INT16_C( 17018),
INT16_C( 4376), INT16_C( 1963), INT16_C( 14912), INT16_C( 8031)),
simde_mm256_set_epi16(INT16_C( -2197), INT16_C( 11), INT16_C( -18), INT16_C( -3745),
INT16_C( -1), INT16_C( -3), INT16_C( 4), INT16_C( 3362),
INT16_C( -1965), INT16_C( 2), INT16_C( 574), INT16_C( 1347),
INT16_C( -888), INT16_C( -15), INT16_C( 1260), INT16_C( -640)),
simde_mm256_set_epi16(INT16_C( 0), INT16_C( 2388), INT16_C( -1725), INT16_C( -4),
INT16_C( 28141), INT16_C( -1734), INT16_C( 7739), INT16_C( -9),
INT16_C( 1), INT16_C( -6158), INT16_C( -19), INT16_C( 12),
INT16_C( -4), INT16_C( -130), INT16_C( 11), INT16_C( -12)) },
{ simde_mm256_set_epi16(INT16_C(-28159), INT16_C( 7162), INT16_C(-24830), INT16_C( 4589),
INT16_C( 7038), INT16_C( 3178), INT16_C( 4246), INT16_C( -8357),
INT16_C( -4695), INT16_C( -9928), INT16_C( -5517), INT16_C(-27023),
INT16_C( 18843), INT16_C( 726), INT16_C( 30135), INT16_C( -4871)),
simde_mm256_set_epi16(INT16_C( -48), INT16_C( 767), INT16_C( 10), INT16_C( 14),
INT16_C( -2039), INT16_C( -2), INT16_C( -53), INT16_C( -1),
INT16_C( -1865), INT16_C( -5344), INT16_C( 63), INT16_C( -505),
INT16_C( 2993), INT16_C(-14674), INT16_C( 3), INT16_C( -2)),
simde_mm256_set_epi16(INT16_C( 586), INT16_C( 9), INT16_C( -2483), INT16_C( 327),
INT16_C( -3), INT16_C( -1589), INT16_C( -80), INT16_C( 8357),
INT16_C( 2), INT16_C( 1), INT16_C( -87), INT16_C( 53),
INT16_C( 6), INT16_C( 0), INT16_C( 10045), INT16_C( 2435)) }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])); i++) {
simde__m256i r = simde_mm256_div_epi16(test_vec[i].a, test_vec[i].b);
simde_assert_m256i_i16(r, ==, test_vec[i].r);
}
return 0;
}
static int
test_simde_mm256_div_epi32(SIMDE_MUNIT_TEST_ARGS) {
const struct {
simde__m256i a;
simde__m256i b;
simde__m256i r;
} test_vec[8] = {
{ simde_mm256_set_epi32(INT32_C( 1220357195), INT32_C( 1053623553), INT32_C( 1487300768), INT32_C(-1113593972),
INT32_C( -270466921), INT32_C( 1339961381), INT32_C( 586340423), INT32_C( 1641199948)),
simde_mm256_set_epi32(INT32_C( 119685834), INT32_C( 18), INT32_C( 13175516), INT32_C( 2634495),
INT32_C( 17), INT32_C( 43789), INT32_C( -89), INT32_C( 14)),
simde_mm256_set_epi32(INT32_C( 10), INT32_C( 58534641), INT32_C( 112), INT32_C( -422),
INT32_C( -15909818), INT32_C( 30600), INT32_C( -6588094), INT32_C( 117228567)) },
{ simde_mm256_set_epi32(INT32_C( 1446174898), INT32_C( 1812297946), INT32_C(-2020316623), INT32_C( 843765864),
INT32_C(-1892632155), INT32_C( -473868741), INT32_C( -150363910), INT32_C(-1673359813)),
simde_mm256_set_epi32(INT32_C( 2569135), INT32_C( 8168), INT32_C( -4111977), INT32_C( -322),
INT32_C( -34091386), INT32_C( 6306), INT32_C( 363174), INT32_C( -37460)),
simde_mm256_set_epi32(INT32_C( 562), INT32_C( 221877), INT32_C( 491), INT32_C( -2620390),
INT32_C( 55), INT32_C( -75145), INT32_C( -414), INT32_C( 44670)) },
{ simde_mm256_set_epi32(INT32_C( 1015973964), INT32_C( -637033789), INT32_C(-1269659180), INT32_C(-1847076164),
INT32_C( 841308417), INT32_C(-1365136816), INT32_C( -621262370), INT32_C( -734285761)),
simde_mm256_set_epi32(INT32_C( -1597720), INT32_C( 192391), INT32_C( 2145556), INT32_C( -4054),
INT32_C( -1), INT32_C( 63753), INT32_C( 24015328), INT32_C( 267)),
simde_mm256_set_epi32(INT32_C( -635), INT32_C( -3311), INT32_C( -591), INT32_C( 455618),
INT32_C( -841308417), INT32_C( -21412), INT32_C( -25), INT32_C( -2750133)) },
{ simde_mm256_set_epi32(INT32_C( 55709148), INT32_C( 1036348942), INT32_C( 1622954205), INT32_C( 1464937075),
INT32_C( 309602207), INT32_C( 765487752), INT32_C(-1883826060), INT32_C( 396580110)),
simde_mm256_set_epi32(INT32_C( 81348), INT32_C( 130432), INT32_C( -2896201), INT32_C( 130033),
INT32_C( 2659), INT32_C( 12656), INT32_C( -49), INT32_C( -3976)),
simde_mm256_set_epi32(INT32_C( 684), INT32_C( 7945), INT32_C( -560), INT32_C( 11265),
INT32_C( 116435), INT32_C( 60484), INT32_C( 38445429), INT32_C( -99743)) },
{ simde_mm256_set_epi32(INT32_C( -679308904), INT32_C( 1402916027), INT32_C( -568259373), INT32_C( -151984025),
INT32_C(-1276596492), INT32_C( 897258790), INT32_C( 1125465930), INT32_C(-1843912592)),
simde_mm256_set_epi32(INT32_C( -32), INT32_C( -3810), INT32_C( -77), INT32_C( -56604),
INT32_C( 2670), INT32_C( -7949), INT32_C( 3200), INT32_C( 22045)),
simde_mm256_set_epi32(INT32_C( 21228403), INT32_C( -368219), INT32_C( 7379991), INT32_C( 2685),
INT32_C( -478126), INT32_C( -112876), INT32_C( 351708), INT32_C( -83643)) },
{ simde_mm256_set_epi32(INT32_C(-2128829075), INT32_C( -944286219), INT32_C(-1801390937), INT32_C( 1597729863),
INT32_C( -919883082), INT32_C( 243529930), INT32_C(-1346833089), INT32_C( -703593878)),
simde_mm256_set_epi32(INT32_C( -702474), INT32_C( -505), INT32_C( -33538370), INT32_C( 98),
INT32_C( -989384), INT32_C( -3405840), INT32_C( 1441037), INT32_C( 13)),
simde_mm256_set_epi32(INT32_C( 3030), INT32_C( 1869873), INT32_C( 53), INT32_C( 16303365),
INT32_C( 929), INT32_C( -71), INT32_C( -934), INT32_C( -54122606)) },
{ simde_mm256_set_epi32(INT32_C( 2104898600), INT32_C( 1858378377), INT32_C( 427610695), INT32_C( 1702051599),
INT32_C( 1832473397), INT32_C( 333005662), INT32_C( 2145787203), INT32_C(-1223503753)),
simde_mm256_set_epi32(INT32_C( -558822192), INT32_C( -1119473), INT32_C( 71), INT32_C( -1),
INT32_C( 83208), INT32_C( -24), INT32_C( 490), INT32_C( 1423105)),
simde_mm256_set_epi32(INT32_C( -3), INT32_C( -1660), INT32_C( 6022685), INT32_C(-1702051599),
INT32_C( 22022), INT32_C( -13875235), INT32_C( 4379157), INT32_C( -859)) },
{ simde_mm256_set_epi32(INT32_C( 1485879823), INT32_C( -139198096), INT32_C( 325243915), INT32_C( 1406493107),
INT32_C( 631640676), INT32_C( -221831503), INT32_C(-1100348538), INT32_C(-1615759789)),
simde_mm256_set_epi32(INT32_C( -5), INT32_C( 6019751), INT32_C( 240957918), INT32_C( -11512),
INT32_C( 598), INT32_C( -2086), INT32_C( -398), INT32_C( 57524929)),
simde_mm256_set_epi32(INT32_C( -297175964), INT32_C( -23), INT32_C( 1), INT32_C( -122176),
INT32_C( 1056255), INT32_C( 106343), INT32_C( 2764694), INT32_C( -28)) }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])); i++) {
simde__m256i r = simde_mm256_div_epi32(test_vec[i].a, test_vec[i].b);
simde_assert_m256i_i32(r, ==, test_vec[i].r);
}
return 0;
}
static int
test_simde_mm256_div_epi64(SIMDE_MUNIT_TEST_ARGS) {
const struct {
simde__m256i a;
simde__m256i b;
simde__m256i r;
} test_vec[8] = {
{ simde_mm256_set_epi64x(INT64_C(-3334573923423752375), INT64_C( 5523377417165557950),
INT64_C( 8907494989684855351), INT64_C(-7237415305059575746)),
simde_mm256_set_epi64x(INT64_C( -9171626596647), INT64_C( -528646059918),
INT64_C( -547414), INT64_C( -408)),
simde_mm256_set_epi64x(INT64_C( 363574), INT64_C( -10448157),
INT64_C( -16271953201205), INT64_C( 17738763002596999)) },
{ simde_mm256_set_epi64x(INT64_C( 1061533355853207499), INT64_C(-6945701440990101118),
INT64_C( 2574461366811200995), INT64_C( 5644549884645175906)),
simde_mm256_set_epi64x(INT64_C( -7767261), INT64_C( 10),
INT64_C( 703320391), INT64_C( 12482)),
simde_mm256_set_epi64x(INT64_C( -136667656185), INT64_C( -694570144099010111),
INT64_C( 3660438968), INT64_C( 452215180631723)) },
{ simde_mm256_set_epi64x(INT64_C( 6574854431853233270), INT64_C(-4435882974713226150),
INT64_C(-7281891715377237835), INT64_C( 5757222003030846963)),
simde_mm256_set_epi64x(INT64_C( -6789037658203169), INT64_C( -17570),
INT64_C( 13607885161437703), INT64_C( -3435095)),
simde_mm256_set_epi64x(INT64_C( -968), INT64_C( 252469150524372),
INT64_C( -535), INT64_C( -1676000810175)) },
{ simde_mm256_set_epi64x(INT64_C( 8744553519166698091), INT64_C( 1287292031192317940),
INT64_C( 3174243940922689145), INT64_C( 1491394686146555130)),
simde_mm256_set_epi64x(INT64_C( 4922490686897444762), INT64_C( 39224412374),
INT64_C( 408105256075342), INT64_C( -123591096713)),
simde_mm256_set_epi64x(INT64_C( 1), INT64_C( 32818644),
INT64_C( 7778), INT64_C( -12067169)) },
{ simde_mm256_set_epi64x(INT64_C( 7799483112595335323), INT64_C(-7884857912053188380),
INT64_C( 7107489308993436793), INT64_C( 8695475100908985079)),
simde_mm256_set_epi64x(INT64_C( 87), INT64_C( 9826793),
INT64_C( -161255109), INT64_C( -1858599442623445)),
simde_mm256_set_epi64x(INT64_C( 89649231179256727), INT64_C( -802383637474),
INT64_C( -44076056585), INT64_C( -4678)) },
{ simde_mm256_set_epi64x(INT64_C(-7825910496387937639), INT64_C( -900763466419687908),
INT64_C(-4456690812175475739), INT64_C(-5053240277275181299)),
simde_mm256_set_epi64x(INT64_C( -6606649764768), INT64_C( -57398),
INT64_C( -568604113828926107), INT64_C( 4737239)),
simde_mm256_set_epi64x(INT64_C( 1184550), INT64_C( 15693290121950),
INT64_C( 7), INT64_C( -1066705791553)) },
{ simde_mm256_set_epi64x(INT64_C(-3221953081539923764), INT64_C(-1956032791701614517),
INT64_C( 7374977017813000944), INT64_C( 1124803906659433418)),
simde_mm256_set_epi64x(INT64_C( -339969907608416876), INT64_C( -15370),
INT64_C( -1321351535), INT64_C( -7)),
simde_mm256_set_epi64x(INT64_C( 9), INT64_C( 127263031340378),
INT64_C( -5581389072), INT64_C( -160686272379919059)) },
{ simde_mm256_set_epi64x(INT64_C( 2535418176622027197), INT64_C(-1425521063377864898),
INT64_C( 5027060343823160394), INT64_C(-2416798548878703366)),
simde_mm256_set_epi64x(INT64_C( -250), INT64_C( 51),
INT64_C( 3355), INT64_C( 22043462023905)),
simde_mm256_set_epi64x(INT64_C( -10141672706488108), INT64_C( -27951393399565978),
INT64_C( 1498378641974116), INT64_C( -109637)) }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])); i++) {
simde__m256i r = simde_mm256_div_epi64(test_vec[i].a, test_vec[i].b);
simde_assert_m256i_i64(r, ==, test_vec[i].r);
}
return 0;
}
static int
test_simde_mm256_div_epu8(SIMDE_MUNIT_TEST_ARGS) {
const struct {
simde__m256i a;
simde__m256i b;
simde__m256i r;
} test_vec[8] = {
{ simde_x_mm256_set_epu8(UINT8_C(236), UINT8_C(194), UINT8_C(120), UINT8_C( 0),
UINT8_C(238), UINT8_C(197), UINT8_C(223), UINT8_C( 50),
UINT8_C(177), UINT8_C( 51), UINT8_C( 14), UINT8_C(208),
UINT8_C(118), UINT8_C(136), UINT8_C(234), UINT8_C(162),
UINT8_C( 34), UINT8_C(152), UINT8_C( 32), UINT8_C( 62),
UINT8_C( 35), UINT8_C(101), UINT8_C( 72), UINT8_C( 2),
UINT8_C(139), UINT8_C(150), UINT8_C(255), UINT8_C( 2),
UINT8_C( 37), UINT8_C(232), UINT8_C( 3), UINT8_C(210)),
simde_x_mm256_set_epu8(UINT8_C(218), UINT8_C( 43), UINT8_C( 2), UINT8_C( 2),
UINT8_C( 29), UINT8_C( 90), UINT8_C( 30), UINT8_C( 31),
UINT8_C( 20), UINT8_C( 1), UINT8_C( 24), UINT8_C( 92),
UINT8_C( 3), UINT8_C( 1), UINT8_C( 33), UINT8_C( 6),
UINT8_C( 14), UINT8_C( 38), UINT8_C( 5), UINT8_C( 4),
UINT8_C( 13), UINT8_C( 2), UINT8_C( 11), UINT8_C( 1),
UINT8_C( 1), UINT8_C( 25), UINT8_C(242), UINT8_C( 3),
UINT8_C( 12), UINT8_C( 59), UINT8_C( 75), UINT8_C(192)),
simde_x_mm256_set_epu8(UINT8_C( 1), UINT8_C( 4), UINT8_C( 60), UINT8_C( 0),
UINT8_C( 8), UINT8_C( 2), UINT8_C( 7), UINT8_C( 1),
UINT8_C( 8), UINT8_C( 51), UINT8_C( 0), UINT8_C( 2),
UINT8_C( 39), UINT8_C(136), UINT8_C( 7), UINT8_C( 27),
UINT8_C( 2), UINT8_C( 4), UINT8_C( 6), UINT8_C( 15),
UINT8_C( 2), UINT8_C( 50), UINT8_C( 6), UINT8_C( 2),
UINT8_C(139), UINT8_C( 6), UINT8_C( 1), UINT8_C( 0),
UINT8_C( 3), UINT8_C( 3), UINT8_C( 0), UINT8_C( 1)) },
{ simde_x_mm256_set_epu8(UINT8_C(223), UINT8_C(136), UINT8_C(181), UINT8_C(189),
UINT8_C(144), UINT8_C(162), UINT8_C( 60), UINT8_C(122),
UINT8_C(180), UINT8_C(157), UINT8_C(255), UINT8_C( 4),
UINT8_C(248), UINT8_C( 71), UINT8_C( 45), UINT8_C(231),
UINT8_C(108), UINT8_C(100), UINT8_C( 13), UINT8_C(181),
UINT8_C(158), UINT8_C(251), UINT8_C(141), UINT8_C( 49),
UINT8_C(175), UINT8_C( 90), UINT8_C(251), UINT8_C( 13),
UINT8_C(151), UINT8_C(233), UINT8_C(181), UINT8_C(181)),
simde_x_mm256_set_epu8(UINT8_C( 2), UINT8_C( 7), UINT8_C( 2), UINT8_C( 7),
UINT8_C( 6), UINT8_C( 23), UINT8_C( 1), UINT8_C( 22),
UINT8_C( 9), UINT8_C( 21), UINT8_C( 6), UINT8_C( 1),
UINT8_C( 1), UINT8_C( 27), UINT8_C( 1), UINT8_C(254),
UINT8_C( 30), UINT8_C( 92), UINT8_C( 8), UINT8_C( 13),
UINT8_C( 7), UINT8_C( 4), UINT8_C( 29), UINT8_C( 24),
UINT8_C( 1), UINT8_C( 15), UINT8_C( 31), UINT8_C( 1),
UINT8_C(190), UINT8_C( 1), UINT8_C( 20), UINT8_C( 8)),
simde_x_mm256_set_epu8(UINT8_C(111), UINT8_C( 19), UINT8_C( 90), UINT8_C( 27),
UINT8_C( 24), UINT8_C( 7), UINT8_C( 60), UINT8_C( 5),
UINT8_C( 20), UINT8_C( 7), UINT8_C( 42), UINT8_C( 4),
UINT8_C(248), UINT8_C( 2), UINT8_C( 45), UINT8_C( 0),
UINT8_C( 3), UINT8_C( 1), UINT8_C( 1), UINT8_C( 13),
UINT8_C( 22), UINT8_C( 62), UINT8_C( 4), UINT8_C( 2),
UINT8_C(175), UINT8_C( 6), UINT8_C( 8), UINT8_C( 13),
UINT8_C( 0), UINT8_C(233), UINT8_C( 9), UINT8_C( 22)) },
{ simde_x_mm256_set_epu8(UINT8_C(162), UINT8_C( 7), UINT8_C(145), UINT8_C(154),
UINT8_C(168), UINT8_C(175), UINT8_C( 61), UINT8_C( 3),
UINT8_C( 93), UINT8_C( 6), UINT8_C(114), UINT8_C( 59),
UINT8_C( 17), UINT8_C(165), UINT8_C(240), UINT8_C(189),
UINT8_C(201), UINT8_C( 90), UINT8_C( 72), UINT8_C( 56),
UINT8_C( 98), UINT8_C(155), UINT8_C( 93), UINT8_C(190),
UINT8_C( 59), UINT8_C(174), UINT8_C(136), UINT8_C( 6),
UINT8_C(153), UINT8_C(172), UINT8_C(102), UINT8_C(120)),
simde_x_mm256_set_epu8(UINT8_C(110), UINT8_C( 41), UINT8_C( 3), UINT8_C( 12),
UINT8_C(210), UINT8_C( 1), UINT8_C( 5), UINT8_C( 6),
UINT8_C( 47), UINT8_C( 58), UINT8_C( 48), UINT8_C( 20),
UINT8_C(109), UINT8_C( 3), UINT8_C( 34), UINT8_C( 3),
UINT8_C( 8), UINT8_C( 5), UINT8_C( 3), UINT8_C( 1),
UINT8_C( 20), UINT8_C( 14), UINT8_C( 1), UINT8_C( 6),
UINT8_C( 15), UINT8_C( 3), UINT8_C( 95), UINT8_C( 1),
UINT8_C( 4), UINT8_C( 1), UINT8_C( 7), UINT8_C( 1)),
simde_x_mm256_set_epu8(UINT8_C( 1), UINT8_C( 0), UINT8_C( 48), UINT8_C( 12),
UINT8_C( 0), UINT8_C(175), UINT8_C( 12), UINT8_C( 0),
UINT8_C( 1), UINT8_C( 0), UINT8_C( 2), UINT8_C( 2),
UINT8_C( 0), UINT8_C( 55), UINT8_C( 7), UINT8_C( 63),
UINT8_C( 25), UINT8_C( 18), UINT8_C( 24), UINT8_C( 56),
UINT8_C( 4), UINT8_C( 11), UINT8_C( 93), UINT8_C( 31),
UINT8_C( 3), UINT8_C( 58), UINT8_C( 1), UINT8_C( 6),
UINT8_C( 38), UINT8_C(172), UINT8_C( 14), UINT8_C(120)) },
{ simde_x_mm256_set_epu8(UINT8_C( 3), UINT8_C( 62), UINT8_C(201), UINT8_C( 91),
UINT8_C( 81), UINT8_C(108), UINT8_C(219), UINT8_C(124),
UINT8_C(107), UINT8_C(229), UINT8_C(194), UINT8_C( 6),
UINT8_C(247), UINT8_C(122), UINT8_C( 69), UINT8_C(216),
UINT8_C(192), UINT8_C(132), UINT8_C( 14), UINT8_C(210),
UINT8_C(242), UINT8_C(228), UINT8_C( 76), UINT8_C(247),
UINT8_C(164), UINT8_C(249), UINT8_C(124), UINT8_C(200),
UINT8_C(141), UINT8_C(206), UINT8_C(142), UINT8_C(235)),
simde_x_mm256_set_epu8(UINT8_C(182), UINT8_C( 3), UINT8_C( 13), UINT8_C( 91),
UINT8_C( 12), UINT8_C( 10), UINT8_C( 1), UINT8_C( 3),
UINT8_C( 4), UINT8_C( 8), UINT8_C( 93), UINT8_C( 1),
UINT8_C( 2), UINT8_C( 38), UINT8_C( 3), UINT8_C(172),
UINT8_C( 38), UINT8_C( 15), UINT8_C( 55), UINT8_C( 26),
UINT8_C( 4), UINT8_C( 16), UINT8_C( 28), UINT8_C( 54),
UINT8_C( 21), UINT8_C( 30), UINT8_C( 3), UINT8_C( 39),
UINT8_C( 14), UINT8_C(171), UINT8_C( 2), UINT8_C( 4)),
simde_x_mm256_set_epu8(UINT8_C( 0), UINT8_C( 20), UINT8_C( 15), UINT8_C( 1),
UINT8_C( 6), UINT8_C( 10), UINT8_C(219), UINT8_C( 41),
UINT8_C( 26), UINT8_C( 28), UINT8_C( 2), UINT8_C( 6),
UINT8_C(123), UINT8_C( 3), UINT8_C( 23), UINT8_C( 1),
UINT8_C( 5), UINT8_C( 8), UINT8_C( 0), UINT8_C( 8),
UINT8_C( 60), UINT8_C( 14), UINT8_C( 2), UINT8_C( 4),
UINT8_C( 7), UINT8_C( 8), UINT8_C( 41), UINT8_C( 5),
UINT8_C( 10), UINT8_C( 1), UINT8_C( 71), UINT8_C( 58)) },
{ simde_x_mm256_set_epu8(UINT8_C(168), UINT8_C( 0), UINT8_C(141), UINT8_C(215),
UINT8_C( 23), UINT8_C(105), UINT8_C(153), UINT8_C(228),
UINT8_C(144), UINT8_C(204), UINT8_C(214), UINT8_C(202),
UINT8_C(227), UINT8_C(255), UINT8_C( 22), UINT8_C(115),
UINT8_C(131), UINT8_C(142), UINT8_C( 73), UINT8_C(133),
UINT8_C( 47), UINT8_C(243), UINT8_C(254), UINT8_C(234),
UINT8_C( 91), UINT8_C(217), UINT8_C(119), UINT8_C(247),
UINT8_C(245), UINT8_C( 31), UINT8_C( 46), UINT8_C( 19)),
simde_x_mm256_set_epu8(UINT8_C( 1), UINT8_C(248), UINT8_C( 3), UINT8_C( 9),
UINT8_C( 3), UINT8_C( 87), UINT8_C(117), UINT8_C( 58),
UINT8_C( 18), UINT8_C( 9), UINT8_C( 7), UINT8_C( 77),
UINT8_C( 11), UINT8_C( 11), UINT8_C( 28), UINT8_C( 49),
UINT8_C( 64), UINT8_C( 46), UINT8_C( 5), UINT8_C( 1),
UINT8_C(115), UINT8_C( 2), UINT8_C( 1), UINT8_C( 1),
UINT8_C( 86), UINT8_C( 10), UINT8_C( 3), UINT8_C( 12),
UINT8_C( 49), UINT8_C(155), UINT8_C( 1), UINT8_C( 3)),
simde_x_mm256_set_epu8(UINT8_C(168), UINT8_C( 0), UINT8_C( 47), UINT8_C( 23),
UINT8_C( 7), UINT8_C( 1), UINT8_C( 1), UINT8_C( 3),
UINT8_C( 8), UINT8_C( 22), UINT8_C( 30), UINT8_C( 2),
UINT8_C( 20), UINT8_C( 23), UINT8_C( 0), UINT8_C( 2),
UINT8_C( 2), UINT8_C( 3), UINT8_C( 14), UINT8_C(133),
UINT8_C( 0), UINT8_C(121), UINT8_C(254), UINT8_C(234),
UINT8_C( 1), UINT8_C( 21), UINT8_C( 39), UINT8_C( 20),
UINT8_C( 5), UINT8_C( 0), UINT8_C( 46), UINT8_C( 6)) },
{ simde_x_mm256_set_epu8(UINT8_C(163), UINT8_C(117), UINT8_C( 13), UINT8_C( 71),
UINT8_C(173), UINT8_C(230), UINT8_C(206), UINT8_C( 2),
UINT8_C( 15), UINT8_C(252), UINT8_C( 14), UINT8_C(197),
UINT8_C(249), UINT8_C(198), UINT8_C( 30), UINT8_C(180),
UINT8_C(128), UINT8_C( 78), UINT8_C(184), UINT8_C(254),
UINT8_C(184), UINT8_C(231), UINT8_C(238), UINT8_C( 30),
UINT8_C(194), UINT8_C( 37), UINT8_C(226), UINT8_C( 86),
UINT8_C(140), UINT8_C( 24), UINT8_C(144), UINT8_C( 16)),
simde_x_mm256_set_epu8(UINT8_C( 48), UINT8_C( 1), UINT8_C( 7), UINT8_C( 6),
UINT8_C(119), UINT8_C( 41), UINT8_C(111), UINT8_C( 8),
UINT8_C(135), UINT8_C( 2), UINT8_C( 23), UINT8_C( 1),
UINT8_C( 88), UINT8_C( 15), UINT8_C( 65), UINT8_C( 79),
UINT8_C( 29), UINT8_C( 5), UINT8_C( 5), UINT8_C( 6),
UINT8_C( 44), UINT8_C( 21), UINT8_C( 2), UINT8_C( 3),
UINT8_C( 15), UINT8_C( 1), UINT8_C( 3), UINT8_C( 3),
UINT8_C( 1), UINT8_C( 10), UINT8_C( 1), UINT8_C( 55)),
simde_x_mm256_set_epu8(UINT8_C( 3), UINT8_C(117), UINT8_C( 1), UINT8_C( 11),
UINT8_C( 1), UINT8_C( 5), UINT8_C( 1), UINT8_C( 0),
UINT8_C( 0), UINT8_C(126), UINT8_C( 0), UINT8_C(197),
UINT8_C( 2), UINT8_C( 13), UINT8_C( 0), UINT8_C( 2),
UINT8_C( 4), UINT8_C( 15), UINT8_C( 36), UINT8_C( 42),
UINT8_C( 4), UINT8_C( 11), UINT8_C(119), UINT8_C( 10),
UINT8_C( 12), UINT8_C( 37), UINT8_C( 75), UINT8_C( 28),
UINT8_C(140), UINT8_C( 2), UINT8_C(144), UINT8_C( 0)) },
{ simde_x_mm256_set_epu8(UINT8_C(239), UINT8_C(204), UINT8_C( 51), UINT8_C(246),
UINT8_C( 77), UINT8_C(149), UINT8_C( 40), UINT8_C( 86),
UINT8_C( 29), UINT8_C( 8), UINT8_C(140), UINT8_C(202),
UINT8_C(138), UINT8_C(208), UINT8_C(142), UINT8_C( 95),
UINT8_C(247), UINT8_C(102), UINT8_C( 63), UINT8_C(232),
UINT8_C(115), UINT8_C(187), UINT8_C(122), UINT8_C(179),
UINT8_C( 81), UINT8_C(192), UINT8_C( 47), UINT8_C( 34),
UINT8_C( 24), UINT8_C(133), UINT8_C( 98), UINT8_C(208)),
simde_x_mm256_set_epu8(UINT8_C( 11), UINT8_C( 8), UINT8_C( 2), UINT8_C( 10),
UINT8_C( 3), UINT8_C( 7), UINT8_C( 38), UINT8_C( 21),
UINT8_C(247), UINT8_C( 14), UINT8_C( 4), UINT8_C( 3),
UINT8_C( 85), UINT8_C( 59), UINT8_C( 41), UINT8_C( 1),
UINT8_C( 1), UINT8_C(250), UINT8_C( 1), UINT8_C( 2),
UINT8_C( 6), UINT8_C( 8), UINT8_C( 6), UINT8_C( 40),
UINT8_C(136), UINT8_C( 10), UINT8_C( 29), UINT8_C( 7),
UINT8_C( 36), UINT8_C( 8), UINT8_C( 1), UINT8_C( 7)),
simde_x_mm256_set_epu8(UINT8_C( 21), UINT8_C( 25), UINT8_C( 25), UINT8_C( 24),
UINT8_C( 25), UINT8_C( 21), UINT8_C( 1), UINT8_C( 4),
UINT8_C( 0), UINT8_C( 0), UINT8_C( 35), UINT8_C( 67),
UINT8_C( 1), UINT8_C( 3), UINT8_C( 3), UINT8_C( 95),
UINT8_C(247), UINT8_C( 0), UINT8_C( 63), UINT8_C(116),
UINT8_C( 19), UINT8_C( 23), UINT8_C( 20), UINT8_C( 4),
UINT8_C( 0), UINT8_C( 19), UINT8_C( 1), UINT8_C( 4),
UINT8_C( 0), UINT8_C( 16), UINT8_C( 98), UINT8_C( 29)) },
{ simde_x_mm256_set_epu8(UINT8_C(179), UINT8_C(197), UINT8_C(124), UINT8_C(228),
UINT8_C(210), UINT8_C(205), UINT8_C(251), UINT8_C( 37),
UINT8_C( 37), UINT8_C( 57), UINT8_C( 27), UINT8_C( 38),
UINT8_C( 13), UINT8_C(212), UINT8_C(201), UINT8_C(125),
UINT8_C( 84), UINT8_C(229), UINT8_C( 76), UINT8_C(128),
UINT8_C(139), UINT8_C(203), UINT8_C(238), UINT8_C(218),
UINT8_C( 40), UINT8_C( 95), UINT8_C(243), UINT8_C(110),
UINT8_C( 74), UINT8_C( 0), UINT8_C(215), UINT8_C( 43)),
simde_x_mm256_set_epu8(UINT8_C( 2), UINT8_C( 2), UINT8_C( 4), UINT8_C( 5),
UINT8_C( 7), UINT8_C( 2), UINT8_C(195), UINT8_C( 2),
UINT8_C( 30), UINT8_C( 1), UINT8_C( 9), UINT8_C( 24),
UINT8_C( 6), UINT8_C( 7), UINT8_C( 28), UINT8_C( 58),
UINT8_C( 3), UINT8_C( 77), UINT8_C( 90), UINT8_C( 51),
UINT8_C( 13), UINT8_C( 12), UINT8_C( 7), UINT8_C( 91),
UINT8_C(243), UINT8_C( 40), UINT8_C( 1), UINT8_C( 45),
UINT8_C( 77), UINT8_C( 45), UINT8_C( 60), UINT8_C( 3)),
simde_x_mm256_set_epu8(UINT8_C( 89), UINT8_C( 98), UINT8_C( 31), UINT8_C( 45),
UINT8_C( 30), UINT8_C(102), UINT8_C( 1), UINT8_C( 18),
UINT8_C( 1), UINT8_C( 57), UINT8_C( 3), UINT8_C( 1),
UINT8_C( 2), UINT8_C( 30), UINT8_C( 7), UINT8_C( 2),
UINT8_C( 28), UINT8_C( 2), UINT8_C( 0), UINT8_C( 2),
UINT8_C( 10), UINT8_C( 16), UINT8_C( 34), UINT8_C( 2),
UINT8_C( 0), UINT8_C( 2), UINT8_C(243), UINT8_C( 2),
UINT8_C( 0), UINT8_C( 0), UINT8_C( 3), UINT8_C( 14)) }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])); i++) {
simde__m256i r = simde_mm256_div_epu8(test_vec[i].a, test_vec[i].b);
simde_assert_m256i_u8(r, ==, test_vec[i].r);
}
return 0;
}
static int
test_simde_mm256_div_epu16(SIMDE_MUNIT_TEST_ARGS) {
const struct {
simde__m256i a;
simde__m256i b;
simde__m256i r;
} test_vec[8] = {
{ simde_x_mm256_set_epu16(UINT16_C( 50042), UINT16_C( 33648), UINT16_C( 7535), UINT16_C( 12279),
UINT16_C( 36071), UINT16_C( 18107), UINT16_C( 48674), UINT16_C( 48206),
UINT16_C( 9011), UINT16_C( 45275), UINT16_C( 7845), UINT16_C( 54048),
UINT16_C( 27322), UINT16_C( 31657), UINT16_C( 43497), UINT16_C( 33598)),
simde_x_mm256_set_epu16(UINT16_C( 12011), UINT16_C( 249), UINT16_C( 5), UINT16_C( 2),
UINT16_C( 1870), UINT16_C( 2904), UINT16_C( 1530), UINT16_C( 42479),
UINT16_C( 63442), UINT16_C( 1039), UINT16_C( 54), UINT16_C( 1),
UINT16_C( 98), UINT16_C( 7948), UINT16_C( 2053), UINT16_C( 29)),
simde_x_mm256_set_epu16(UINT16_C( 4), UINT16_C( 135), UINT16_C( 1507), UINT16_C( 6139),
UINT16_C( 19), UINT16_C( 6), UINT16_C( 31), UINT16_C( 1),
UINT16_C( 0), UINT16_C( 43), UINT16_C( 145), UINT16_C( 54048),
UINT16_C( 278), UINT16_C( 3), UINT16_C( 21), UINT16_C( 1158)) },
{ simde_x_mm256_set_epu16(UINT16_C( 31411), UINT16_C( 55001), UINT16_C( 38051), UINT16_C( 20389),
UINT16_C( 61351), UINT16_C( 22045), UINT16_C( 61939), UINT16_C( 10168),
UINT16_C( 65482), UINT16_C( 32951), UINT16_C( 59114), UINT16_C( 9472),
UINT16_C( 21787), UINT16_C( 1387), UINT16_C( 60519), UINT16_C( 39038)),
simde_x_mm256_set_epu16(UINT16_C( 11771), UINT16_C( 1), UINT16_C( 490), UINT16_C( 32408),
UINT16_C( 2225), UINT16_C( 134), UINT16_C( 13968), UINT16_C( 1),
UINT16_C( 387), UINT16_C( 14591), UINT16_C( 24), UINT16_C( 46),
UINT16_C( 8450), UINT16_C( 1053), UINT16_C( 908), UINT16_C( 5686)),
simde_x_mm256_set_epu16(UINT16_C( 2), UINT16_C( 55001), UINT16_C( 77), UINT16_C( 0),
UINT16_C( 27), UINT16_C( 164), UINT16_C( 4), UINT16_C( 10168),
UINT16_C( 169), UINT16_C( 2), UINT16_C( 2463), UINT16_C( 205),
UINT16_C( 2), UINT16_C( 1), UINT16_C( 66), UINT16_C( 6)) },
{ simde_x_mm256_set_epu16(UINT16_C( 22899), UINT16_C( 630), UINT16_C( 34558), UINT16_C( 7884),
UINT16_C( 39724), UINT16_C( 33230), UINT16_C( 54475), UINT16_C( 22805),
UINT16_C( 61755), UINT16_C( 34661), UINT16_C( 28373), UINT16_C( 58279),
UINT16_C( 22187), UINT16_C( 56981), UINT16_C( 43877), UINT16_C( 3469)),
simde_x_mm256_set_epu16(UINT16_C( 12306), UINT16_C( 182), UINT16_C( 29239), UINT16_C( 4194),
UINT16_C( 818), UINT16_C( 16), UINT16_C( 5), UINT16_C( 38),
UINT16_C( 42688), UINT16_C( 8), UINT16_C( 1), UINT16_C( 96),
UINT16_C( 3), UINT16_C( 1), UINT16_C( 508), UINT16_C( 1)),
simde_x_mm256_set_epu16(UINT16_C( 1), UINT16_C( 3), UINT16_C( 1), UINT16_C( 1),
UINT16_C( 48), UINT16_C( 2076), UINT16_C( 10895), UINT16_C( 600),
UINT16_C( 1), UINT16_C( 4332), UINT16_C( 28373), UINT16_C( 607),
UINT16_C( 7395), UINT16_C( 56981), UINT16_C( 86), UINT16_C( 3469)) },
{ simde_x_mm256_set_epu16(UINT16_C( 29363), UINT16_C( 50584), UINT16_C( 56168), UINT16_C( 44370),
UINT16_C( 62910), UINT16_C( 23255), UINT16_C( 39479), UINT16_C( 21044),
UINT16_C( 7491), UINT16_C( 25737), UINT16_C( 6938), UINT16_C( 40142),
UINT16_C( 22210), UINT16_C( 63545), UINT16_C( 33358), UINT16_C( 9014)),
simde_x_mm256_set_epu16(UINT16_C( 61), UINT16_C( 274), UINT16_C( 365), UINT16_C( 58937),
UINT16_C( 2), UINT16_C( 172), UINT16_C( 432), UINT16_C( 2),
UINT16_C( 957), UINT16_C( 351), UINT16_C( 18), UINT16_C( 12717),
UINT16_C( 4), UINT16_C( 417), UINT16_C( 1), UINT16_C( 10550)),
simde_x_mm256_set_epu16(UINT16_C( 481), UINT16_C( 184), UINT16_C( 153), UINT16_C( 0),
UINT16_C( 31455), UINT16_C( 135), UINT16_C( 91), UINT16_C( 10522),
UINT16_C( 7), UINT16_C( 73), UINT16_C( 385), UINT16_C( 3),
UINT16_C( 5552), UINT16_C( 152), UINT16_C( 33358), UINT16_C( 0)) },
{ simde_x_mm256_set_epu16(UINT16_C( 22208), UINT16_C( 58940), UINT16_C( 24739), UINT16_C( 29405),
UINT16_C( 9863), UINT16_C( 41917), UINT16_C( 30045), UINT16_C( 40634),
UINT16_C( 50211), UINT16_C( 4668), UINT16_C( 42314), UINT16_C( 29370),
UINT16_C( 57744), UINT16_C( 37787), UINT16_C( 17171), UINT16_C( 34222)),
simde_x_mm256_set_epu16(UINT16_C( 4256), UINT16_C( 23971), UINT16_C( 171), UINT16_C( 12),
UINT16_C( 8070), UINT16_C( 2906), UINT16_C( 22), UINT16_C( 107),
UINT16_C( 3), UINT16_C( 1), UINT16_C( 28355), UINT16_C( 2210),
UINT16_C( 1), UINT16_C( 1161), UINT16_C( 613), UINT16_C( 51426)),
simde_x_mm256_set_epu16(UINT16_C( 5), UINT16_C( 2), UINT16_C( 144), UINT16_C( 2450),
UINT16_C( 1), UINT16_C( 14), UINT16_C( 1365), UINT16_C( 379),
UINT16_C( 16737), UINT16_C( 4668), UINT16_C( 1), UINT16_C( 13),
UINT16_C( 57744), UINT16_C( 32), UINT16_C( 28), UINT16_C( 0)) },
{ simde_x_mm256_set_epu16(UINT16_C( 9143), UINT16_C( 55963), UINT16_C( 46820), UINT16_C( 55354),
UINT16_C( 21540), UINT16_C( 21596), UINT16_C( 49435), UINT16_C( 42142),
UINT16_C( 28170), UINT16_C( 3714), UINT16_C( 39462), UINT16_C( 28043),
UINT16_C( 45359), UINT16_C( 22609), UINT16_C( 55149), UINT16_C( 21886)),
simde_x_mm256_set_epu16(UINT16_C( 3121), UINT16_C( 103), UINT16_C( 1), UINT16_C( 283),
UINT16_C( 201), UINT16_C( 53), UINT16_C( 25996), UINT16_C( 3169),
UINT16_C( 1), UINT16_C( 2), UINT16_C( 38), UINT16_C( 24),
UINT16_C( 55), UINT16_C( 25444), UINT16_C( 5182), UINT16_C( 9)),
simde_x_mm256_set_epu16(UINT16_C( 2), UINT16_C( 543), UINT16_C( 46820), UINT16_C( 195),
UINT16_C( 107), UINT16_C( 407), UINT16_C( 1), UINT16_C( 13),
UINT16_C( 28170), UINT16_C( 1857), UINT16_C( 1038), UINT16_C( 1168),
UINT16_C( 824), UINT16_C( 0), UINT16_C( 10), UINT16_C( 2431)) },
{ simde_x_mm256_set_epu16(UINT16_C( 51894), UINT16_C( 1840), UINT16_C( 33552), UINT16_C( 50070),
UINT16_C( 16848), UINT16_C( 13340), UINT16_C( 25356), UINT16_C( 34016),
UINT16_C( 61275), UINT16_C( 22886), UINT16_C( 28292), UINT16_C( 37845),
UINT16_C( 1481), UINT16_C( 559), UINT16_C( 12899), UINT16_C( 38851)),
simde_x_mm256_set_epu16(UINT16_C( 16266), UINT16_C( 376), UINT16_C( 62048), UINT16_C( 8),
UINT16_C( 53), UINT16_C( 1573), UINT16_C( 8), UINT16_C( 212),
UINT16_C( 15505), UINT16_C( 1), UINT16_C( 10), UINT16_C( 2744),
UINT16_C( 2), UINT16_C( 5), UINT16_C( 4478), UINT16_C( 12656)),
simde_x_mm256_set_epu16(UINT16_C( 3), UINT16_C( 4), UINT16_C( 0), UINT16_C( 6258),
UINT16_C( 317), UINT16_C( 8), UINT16_C( 3169), UINT16_C( 160),
UINT16_C( 3), UINT16_C( 22886), UINT16_C( 2829), UINT16_C( 13),
UINT16_C( 740), UINT16_C( 111), UINT16_C( 2), UINT16_C( 3)) },
{ simde_x_mm256_set_epu16(UINT16_C( 40946), UINT16_C( 11832), UINT16_C( 52869), UINT16_C( 41324),
UINT16_C( 41064), UINT16_C( 57085), UINT16_C( 14204), UINT16_C( 23869),
UINT16_C( 30467), UINT16_C( 20149), UINT16_C( 58844), UINT16_C( 49602),
UINT16_C( 36092), UINT16_C( 39146), UINT16_C( 62840), UINT16_C( 19573)),
simde_x_mm256_set_epu16(UINT16_C( 7725), UINT16_C( 5897), UINT16_C( 81), UINT16_C( 199),
UINT16_C( 33008), UINT16_C( 55443), UINT16_C( 925), UINT16_C( 4043),
UINT16_C( 362), UINT16_C( 156), UINT16_C( 2592), UINT16_C( 29),
UINT16_C( 213), UINT16_C( 14), UINT16_C( 39), UINT16_C( 178)),
simde_x_mm256_set_epu16(UINT16_C( 5), UINT16_C( 2), UINT16_C( 652), UINT16_C( 207),
UINT16_C( 1), UINT16_C( 1), UINT16_C( 15), UINT16_C( 5),
UINT16_C( 84), UINT16_C( 129), UINT16_C( 22), UINT16_C( 1710),
UINT16_C( 169), UINT16_C( 2796), UINT16_C( 1611), UINT16_C( 109)) }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])); i++) {
simde__m256i r = simde_mm256_div_epu16(test_vec[i].a, test_vec[i].b);
simde_assert_m256i_u16(r, ==, test_vec[i].r);
}
return 0;
}
static int
test_simde_mm256_div_epu32(SIMDE_MUNIT_TEST_ARGS) {
const struct {
simde__m256i a;
simde__m256i b;
simde__m256i r;
} test_vec[8] = {
{ simde_x_mm256_set_epu32(UINT32_C( 621216267), UINT32_C(2973447507), UINT32_C(1814279233), UINT32_C(3673557536),
UINT32_C(4015780858), UINT32_C(1070914538), UINT32_C(2707640519), UINT32_C(3041291274)),
simde_x_mm256_set_epu32(UINT32_C( 122731), UINT32_C( 51630147), UINT32_C( 152670), UINT32_C( 7731229),
UINT32_C( 711400), UINT32_C( 1744981), UINT32_C( 164943127), UINT32_C( 169494)),
simde_x_mm256_set_epu32(UINT32_C( 5061), UINT32_C( 57), UINT32_C( 11883), UINT32_C( 475),
UINT32_C( 5644), UINT32_C( 613), UINT32_C( 16), UINT32_C( 17943)) },
{ simde_x_mm256_set_epu32(UINT32_C(1084014678), UINT32_C(1666523830), UINT32_C(3454667769), UINT32_C(4029614313),
UINT32_C(3425016021), UINT32_C(2449839571), UINT32_C(1601532569), UINT32_C(1519388398)),
simde_x_mm256_set_epu32(UINT32_C( 130157), UINT32_C( 5585515), UINT32_C( 62691231), UINT32_C( 37123),
UINT32_C( 2515600), UINT32_C( 106484982), UINT32_C(4168501606), UINT32_C( 2781814)),
simde_x_mm256_set_epu32(UINT32_C( 8328), UINT32_C( 298), UINT32_C( 55), UINT32_C( 108547),
UINT32_C( 1361), UINT32_C( 23), UINT32_C( 0), UINT32_C( 546)) },
{ simde_x_mm256_set_epu32(UINT32_C(2187853776), UINT32_C( 131263503), UINT32_C( 20338031), UINT32_C(3062800456),
UINT32_C(1802896354), UINT32_C( 22231847), UINT32_C(3438214155), UINT32_C(1776513196)),
simde_x_mm256_set_epu32(UINT32_C( 28353115), UINT32_C( 92496104), UINT32_C( 15335526), UINT32_C( 99105532),
UINT32_C( 5905009), UINT32_C( 27824), UINT32_C( 28986), UINT32_C( 12459911)),
simde_x_mm256_set_epu32(UINT32_C( 77), UINT32_C( 1), UINT32_C( 1), UINT32_C( 30),
UINT32_C( 305), UINT32_C( 799), UINT32_C( 118616), UINT32_C( 142)) },
{ simde_x_mm256_set_epu32(UINT32_C( 524596333), UINT32_C(3965897825), UINT32_C(1593754725), UINT32_C( 694203496),
UINT32_C(1917650066), UINT32_C(2692610113), UINT32_C(1620259645), UINT32_C( 607116294)),
simde_x_mm256_set_epu32(UINT32_C( 29757558), UINT32_C( 80117), UINT32_C( 412054571), UINT32_C( 878110),
UINT32_C(4124070325), UINT32_C( 8250706), UINT32_C( 7930575), UINT32_C( 51813)),
simde_x_mm256_set_epu32(UINT32_C( 17), UINT32_C( 49501), UINT32_C( 3), UINT32_C( 790),
UINT32_C( 0), UINT32_C( 326), UINT32_C( 204), UINT32_C( 11717)) },
{ simde_x_mm256_set_epu32(UINT32_C( 625862951), UINT32_C( 793130310), UINT32_C(2489185635), UINT32_C(2468815203),
UINT32_C(3079066921), UINT32_C( 802958712), UINT32_C(1537818066), UINT32_C(1678295724)),
simde_x_mm256_set_epu32(UINT32_C( 8259237), UINT32_C( 229091), UINT32_C( 7899398), UINT32_C( 41009690),
UINT32_C( 26030333), UINT32_C( 228627), UINT32_C(1200021710), UINT32_C( 186204)),
simde_x_mm256_set_epu32(UINT32_C( 75), UINT32_C( 3462), UINT32_C( 315), UINT32_C( 60),
UINT32_C( 118), UINT32_C( 3512), UINT32_C( 1), UINT32_C( 9013)) },
{ simde_x_mm256_set_epu32(UINT32_C(3334078645), UINT32_C(2226952893), UINT32_C(1901933944), UINT32_C(3456551705),
UINT32_C(3394846076), UINT32_C(2592342753), UINT32_C(1822000161), UINT32_C(3060682219)),
simde_x_mm256_set_epu32(UINT32_C( 55529), UINT32_C( 95077), UINT32_C( 61849330), UINT32_C( 77269),
UINT32_C( 181901), UINT32_C( 66287), UINT32_C( 46407), UINT32_C( 1962)),
simde_x_mm256_set_epu32(UINT32_C( 60042), UINT32_C( 23422), UINT32_C( 30), UINT32_C( 44734),
UINT32_C( 18663), UINT32_C( 39107), UINT32_C( 39261), UINT32_C( 1559980)) },
{ simde_x_mm256_set_epu32(UINT32_C(2418478797), UINT32_C(3856569345), UINT32_C(2562700829), UINT32_C(2670510577),
UINT32_C(3958231909), UINT32_C(3386864730), UINT32_C(2249491002), UINT32_C( 367242130)),
simde_x_mm256_set_epu32(UINT32_C( 106591767), UINT32_C( 591565864), UINT32_C( 241208), UINT32_C( 384474),
UINT32_C( 63569588), UINT32_C(1007016971), UINT32_C( 701090048), UINT32_C( 4482965)),
simde_x_mm256_set_epu32(UINT32_C( 22), UINT32_C( 6), UINT32_C( 10624), UINT32_C( 6945),
UINT32_C( 62), UINT32_C( 3), UINT32_C( 3), UINT32_C( 81)) },
{ simde_x_mm256_set_epu32(UINT32_C(3497551851), UINT32_C(3538232808), UINT32_C(3581222707), UINT32_C(2092274030),
UINT32_C(1202922035), UINT32_C(3381143079), UINT32_C(1645890362), UINT32_C(2497764821)),
simde_x_mm256_set_epu32(UINT32_C( 7255461), UINT32_C( 387871), UINT32_C( 216379987), UINT32_C( 1108325),
UINT32_C( 9779926), UINT32_C( 265173482), UINT32_C( 305369), UINT32_C(1628979148)),
simde_x_mm256_set_epu32(UINT32_C( 482), UINT32_C( 9122), UINT32_C( 16), UINT32_C( 1887),
UINT32_C( 122), UINT32_C( 12), UINT32_C( 5389), UINT32_C( 1)) }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])); i++) {
simde__m256i r = simde_mm256_div_epu32(test_vec[i].a, test_vec[i].b);
simde_assert_m256i_u32(r, ==, test_vec[i].r);
}
return 0;
}
static int
test_simde_mm256_div_epu64(SIMDE_MUNIT_TEST_ARGS) {
const struct {
simde__m256i a;
simde__m256i b;
simde__m256i r;
} test_vec[8] = {
{ simde_x_mm256_set_epu64x(UINT64_C(10385902570114433083), UINT64_C(14228451038995253976),
UINT64_C( 3524803476344021799), UINT64_C( 9008088981795720991)),
simde_x_mm256_set_epu64x(UINT64_C( 11435629647830), UINT64_C( 134705148152),
UINT64_C( 1685), UINT64_C( 72468903699)),
simde_x_mm256_set_epu64x(UINT64_C(18446744073708846728), UINT64_C(18446744073678236607),
UINT64_C( 2091871499313959), UINT64_C( 124302818)) },
{ simde_x_mm256_set_epu64x(UINT64_C( 2776707612149100363), UINT64_C(15446686956822865619),
UINT64_C( 8116027459326381863), UINT64_C(10577862568627142107)),
simde_x_mm256_set_epu64x(UINT64_C( 160900), UINT64_C( 876),
UINT64_C( 6656645), UINT64_C( 198)),
simde_x_mm256_set_epu64x(UINT64_C( 17257349982281), UINT64_C(18443319350973379601),
UINT64_C( 1219236936824), UINT64_C(18407002247926307124)) },
{ simde_x_mm256_set_epu64x(UINT64_C(17966513918331168112), UINT64_C(15404442576328540960),
UINT64_C( 1544001744444053712), UINT64_C(12311626015854130554)),
simde_x_mm256_set_epu64x(UINT64_C( 73453582701), UINT64_C( 2241703492778),
UINT64_C( 149), UINT64_C( 1898802076338580)),
simde_x_mm256_set_epu64x(UINT64_C(18446744073703013744), UINT64_C(18446744073708194478),
UINT64_C( 10362427815060763), UINT64_C(18446744073709548385)) },
{ simde_x_mm256_set_epu64x(UINT64_C( 4996618049503500636), UINT64_C( 3587306346705364576),
UINT64_C( 1416661578746677042), UINT64_C(18012200189266188151)),
simde_x_mm256_set_epu64x(UINT64_C( 9141117518131), UINT64_C( 259684114065326460),
UINT64_C( 3735868918), UINT64_C( 13028085907926)),
simde_x_mm256_set_epu64x(UINT64_C( 546609), UINT64_C( 13),
UINT64_C( 379205376), UINT64_C(18446744073709518262)) },
{ simde_x_mm256_set_epu64x(UINT64_C(17900245410321819662), UINT64_C( 86463307544105486),
UINT64_C( 7004808110937624000), UINT64_C( 5352056724630121100)),
simde_x_mm256_set_epu64x(UINT64_C( 574976069), UINT64_C( 26168849408611714),
UINT64_C( 479458176), UINT64_C( 85883846687)),
simde_x_mm256_set_epu64x(UINT64_C(18446744072759079601), UINT64_C( 3),
UINT64_C( 14609841820), UINT64_C( 62317384)) },
{ simde_x_mm256_set_epu64x(UINT64_C(18191047755947595201), UINT64_C(11274709867061747164),
UINT64_C( 4957427800472277352), UINT64_C( 2636046644056480855)),
simde_x_mm256_set_epu64x(UINT64_C( 455513034), UINT64_C( 4176708352330988763),
UINT64_C( 255407), UINT64_C( 77468887445572755)),
simde_x_mm256_set_epu64x(UINT64_C(18446744073148214621), UINT64_C(18446744073709551615),
UINT64_C( 19409913590748), UINT64_C( 34)) },
{ simde_x_mm256_set_epu64x(UINT64_C(17236629464649076584), UINT64_C( 6716520602983844465),
UINT64_C(12794135593178656259), UINT64_C( 3865374743078695737)),
simde_x_mm256_set_epu64x(UINT64_C( 13893724010244), UINT64_C( 1),
UINT64_C( 142890905), UINT64_C( 135073488234)),
simde_x_mm256_set_epu64x(UINT64_C(18446744073709464519), UINT64_C( 6716520602983844465),
UINT64_C(18446744034150641408), UINT64_C( 28616827)) },
{ simde_x_mm256_set_epu64x(UINT64_C( 3248934010021333275), UINT64_C( 8464322280604302303),
UINT64_C(10783963704762759650), UINT64_C(14288989654597257942)),
simde_x_mm256_set_epu64x(UINT64_C( 37187973814779), UINT64_C( 988730192),
UINT64_C( 9409064941619), UINT64_C( 554649997)),
simde_x_mm256_set_epu64x(UINT64_C( 87365), UINT64_C( 8560800862),
UINT64_C(18446744073708737212), UINT64_C(18446744066213374853)) }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])); i++) {
simde__m256i r = simde_mm256_div_epi64(test_vec[i].a, test_vec[i].b);
simde_assert_m256i_i64(r, ==, test_vec[i].r);
}
return 0;
}
static int
test_simde_mm512_div_epi8(SIMDE_MUNIT_TEST_ARGS) {
const struct {
simde__m512i a;
simde__m512i b;
simde__m512i r;
} test_vec[8] = {
{ simde_mm512_set_epi8(INT8_C( 114), INT8_C( 89), INT8_C( 1), INT8_C( 122),
INT8_C( 12), INT8_C( 107), INT8_C( 92), INT8_C(-102),
INT8_C( -63), INT8_C( 120), INT8_C( 107), INT8_C( -43),
INT8_C(-119), INT8_C( -10), INT8_C( 98), INT8_C( -26),
INT8_C( 122), INT8_C( 1), INT8_C( -83), INT8_C( 43),
INT8_C( 82), INT8_C( -59), INT8_C( -43), INT8_C( -10),
INT8_C( 77), INT8_C( -22), INT8_C( -72), INT8_C( -94),
INT8_C( 75), INT8_C( -23), INT8_C( -92), INT8_C( -69),
INT8_C( 108), INT8_C( 26), INT8_C( 71), INT8_C( -21),
INT8_C( 15), INT8_C( 107), INT8_C(-112), INT8_C( -22),
INT8_C( -24), INT8_C( 35), INT8_C( 87), INT8_C( 75),
INT8_C( 27), INT8_C( -73), INT8_C( 9), INT8_C( -72),
INT8_C( 35), INT8_C( -9), INT8_C( -68), INT8_C( 73),
INT8_C( -61), INT8_C( 118), INT8_C( 78), INT8_C( -20),
INT8_C( -42), INT8_C( -19), INT8_C(-125), INT8_C( 51),
INT8_C( -14), INT8_C( 17), INT8_C( -24), INT8_C( -72)),
simde_mm512_set_epi8(INT8_C( 14), INT8_C(-123), INT8_C( 73), INT8_C( -6),
INT8_C( -78), INT8_C( -38), INT8_C( -82), INT8_C( -80),
INT8_C( 31), INT8_C( -9), INT8_C( 35), INT8_C(-110),
INT8_C( -7), INT8_C( 74), INT8_C( -30), INT8_C( 100),
INT8_C( 10), INT8_C( 23), INT8_C( -11), INT8_C( 90),
INT8_C( 71), INT8_C(-126), INT8_C( -11), INT8_C( -5),
INT8_C( 26), INT8_C( 58), INT8_C(-123), INT8_C( 125),
INT8_C(-104), INT8_C( 39), INT8_C( 75), INT8_C( 69),
INT8_C( 5), INT8_C(-119), INT8_C( 20), INT8_C( 6),
INT8_C( -18), INT8_C( -87), INT8_C( 95), INT8_C( 24),
INT8_C( 15), INT8_C( -48), INT8_C( -40), INT8_C( 79),
INT8_C(-107), INT8_C( -73), INT8_C(-108), INT8_C( -43),
INT8_C( 53), INT8_C( -95), INT8_C( 75), INT8_C(-123),
INT8_C( 61), INT8_C( 28), INT8_C( 20), INT8_C( -5),
INT8_C(-127), INT8_C( -90), INT8_C( 94), INT8_C( -61),
INT8_C( 91), INT8_C( -70), INT8_C(-111), INT8_C( 30)),
simde_mm512_set_epi8(INT8_C( 8), INT8_C( 0), INT8_C( 0), INT8_C( -20),
INT8_C( 0), INT8_C( -2), INT8_C( -1), INT8_C( 1),
INT8_C( -2), INT8_C( -13), INT8_C( 3), INT8_C( 0),
INT8_C( 17), INT8_C( 0), INT8_C( -3), INT8_C( 0),
INT8_C( 12), INT8_C( 0), INT8_C( 7), INT8_C( 0),
INT8_C( 1), INT8_C( 0), INT8_C( 3), INT8_C( 2),
INT8_C( 2), INT8_C( 0), INT8_C( 0), INT8_C( 0),
INT8_C( 0), INT8_C( 0), INT8_C( -1), INT8_C( -1),
INT8_C( 21), INT8_C( 0), INT8_C( 3), INT8_C( -3),
INT8_C( 0), INT8_C( -1), INT8_C( -1), INT8_C( 0),
INT8_C( -1), INT8_C( 0), INT8_C( -2), INT8_C( 0),
INT8_C( 0), INT8_C( 1), INT8_C( 0), INT8_C( 1),
INT8_C( 0), INT8_C( 0), INT8_C( 0), INT8_C( 0),
INT8_C( -1), INT8_C( 4), INT8_C( 3), INT8_C( 4),
INT8_C( 0), INT8_C( 0), INT8_C( -1), INT8_C( 0),
INT8_C( 0), INT8_C( 0), INT8_C( 0), INT8_C( -2)) },
{ simde_mm512_set_epi8(INT8_C( 12), INT8_C( -52), INT8_C( -7), INT8_C( 17),
INT8_C(-122), INT8_C( 53), INT8_C( -15), INT8_C(-121),
INT8_C( -47), INT8_C(-109), INT8_C( -20), INT8_C( -5),
INT8_C( -34), INT8_C( 6), INT8_C( 3), INT8_C( -49),
INT8_C( 63), INT8_C( 48), INT8_C( -18), INT8_C( 117),
INT8_C( -63), INT8_C( 63), INT8_C( 77), INT8_C( -90),
INT8_C( -12), INT8_C( 83), INT8_C( 69), INT8_C( 113),
INT8_C( 28), INT8_C( 104), INT8_C( -69), INT8_C( -69),
INT8_C(-128), INT8_C( 96), INT8_C( 18), INT8_C( 9),
INT8_C( 99), INT8_C(-100), INT8_C( -63), INT8_C( 74),
INT8_C( -69), INT8_C( 22), INT8_C( 126), INT8_C( 62),
INT8_C( 46), INT8_C( 88), INT8_C( 24), INT8_C( 21),
INT8_C( 121), INT8_C( 64), INT8_C( 24), INT8_C(-125),
INT8_C(-125), INT8_C( -56), INT8_C( -13), INT8_C( 51),
INT8_C( 53), INT8_C( -41), INT8_C( -85), INT8_C(-121),
INT8_C( -44), INT8_C( -43), INT8_C( -24), INT8_C( 102)),
simde_mm512_set_epi8(INT8_C( 109), INT8_C(-119), INT8_C( 12), INT8_C( 72),
INT8_C( -36), INT8_C(-115), INT8_C( 98), INT8_C(-110),
INT8_C( 58), INT8_C( -6), INT8_C( -54), INT8_C( 39),
INT8_C( -42), INT8_C( -8), INT8_C( -77), INT8_C( -22),
INT8_C( -49), INT8_C( 4), INT8_C( 119), INT8_C( 82),
INT8_C( 112), INT8_C( 3), INT8_C( 74), INT8_C( 94),
INT8_C( -27), INT8_C( 90), INT8_C( 17), INT8_C( 13),
INT8_C( 5), INT8_C( 89), INT8_C(-121), INT8_C( 56),
INT8_C( 46), INT8_C( -66), INT8_C( 124), INT8_C( -23),
INT8_C( 38), INT8_C( 53), INT8_C( 18), INT8_C( -68),
INT8_C( -6), INT8_C( -62), INT8_C( -9), INT8_C( 11),
INT8_C( -6), INT8_C( 56), INT8_C( -81), INT8_C( 41),
INT8_C( 112), INT8_C( 58), INT8_C( -21), INT8_C( 108),
INT8_C( 17), INT8_C( 40), INT8_C( 4), INT8_C( 80),
INT8_C( 75), INT8_C( 35), INT8_C( 80), INT8_C( -85),
INT8_C( 88), INT8_C( -11), INT8_C( 23), INT8_C( 51)),
simde_mm512_set_epi8(INT8_C( 0), INT8_C( 0), INT8_C( 0), INT8_C( 0),
INT8_C( 3), INT8_C( 0), INT8_C( 0), INT8_C( 1),
INT8_C( 0), INT8_C( 18), INT8_C( 0), INT8_C( 0),
INT8_C( 0), INT8_C( 0), INT8_C( 0), INT8_C( 2),
INT8_C( -1), INT8_C( 12), INT8_C( 0), INT8_C( 1),
INT8_C( 0), INT8_C( 21), INT8_C( 1), INT8_C( 0),
INT8_C( 0), INT8_C( 0), INT8_C( 4), INT8_C( 8),
INT8_C( 5), INT8_C( 1), INT8_C( 0), INT8_C( -1),
INT8_C( -2), INT8_C( -1), INT8_C( 0), INT8_C( 0),
INT8_C( 2), INT8_C( -1), INT8_C( -3), INT8_C( -1),
INT8_C( 11), INT8_C( 0), INT8_C( -14), INT8_C( 5),
INT8_C( -7), INT8_C( 1), INT8_C( 0), INT8_C( 0),
INT8_C( 1), INT8_C( 1), INT8_C( -1), INT8_C( -1),
INT8_C( -7), INT8_C( -1), INT8_C( -3), INT8_C( 0),
INT8_C( 0), INT8_C( -1), INT8_C( -1), INT8_C( 1),
INT8_C( 0), INT8_C( 3), INT8_C( -1), INT8_C( 2)) },
{ simde_mm512_set_epi8(INT8_C(-111), INT8_C( -3), INT8_C( 110), INT8_C( -96),
INT8_C( 117), INT8_C( -29), INT8_C(-127), INT8_C( 101),
INT8_C(-120), INT8_C( 11), INT8_C( 87), INT8_C( 17),
INT8_C(-108), INT8_C( 87), INT8_C( 4), INT8_C( -21),
INT8_C( 98), INT8_C( 2), INT8_C( -60), INT8_C( -28),
INT8_C( 66), INT8_C(-109), INT8_C( 8), INT8_C( -58),
INT8_C( 13), INT8_C( -66), INT8_C( -49), INT8_C( 93),
INT8_C(-119), INT8_C( 58), INT8_C( 30), INT8_C( 10),
INT8_C( -11), INT8_C( 78), INT8_C( 76), INT8_C( 108),
INT8_C( -34), INT8_C( -94), INT8_C( -77), INT8_C(-122),
INT8_C( 37), INT8_C( -32), INT8_C( -97), INT8_C( 121),
INT8_C( -95), INT8_C( -80), INT8_C( -87), INT8_C( -89),
INT8_C( -4), INT8_C( 115), INT8_C( -42), INT8_C( -55),
INT8_C( 95), INT8_C( -63), INT8_C( 31), INT8_C( -74),
INT8_C( -45), INT8_C( 119), INT8_C( 57), INT8_C( -52),
INT8_C( -69), INT8_C(-123), INT8_C( 106), INT8_C( 119)),
simde_mm512_set_epi8(INT8_C( -74), INT8_C( -32), INT8_C( 89), INT8_C( 50),
INT8_C(-105), INT8_C( 85), INT8_C( -71), INT8_C( 105),
INT8_C( -37), INT8_C( -78), INT8_C(-107), INT8_C( -67),
INT8_C( 9), INT8_C( 2), INT8_C( 83), INT8_C( 67),
INT8_C( 25), INT8_C(-103), INT8_C( -90), INT8_C( 30),
INT8_C( 69), INT8_C(-127), INT8_C( 114), INT8_C( -99),
INT8_C( -97), INT8_C( -52), INT8_C( 120), INT8_C( 78),
INT8_C( 97), INT8_C( 124), INT8_C( 31), INT8_C( 72),
INT8_C( -6), INT8_C( 19), INT8_C( -4), INT8_C( -65),
INT8_C( 107), INT8_C( -15), INT8_C(-116), INT8_C( -13),
INT8_C( 106), INT8_C( -71), INT8_C( -14), INT8_C( -87),
INT8_C(-122), INT8_C( -59), INT8_C( -65), INT8_C( -58),
INT8_C( -26), INT8_C( 55), INT8_C( 28), INT8_C( -31),
INT8_C( -20), INT8_C( -40), INT8_C( -47), INT8_C( 58),
INT8_C( -3), INT8_C( 67), INT8_C( -47), INT8_C( 93),
INT8_C( -77), INT8_C( 21), INT8_C( 49), INT8_C( -54)),
simde_mm512_set_epi8(INT8_C( 1), INT8_C( 0), INT8_C( 1), INT8_C( -1),
INT8_C( -1), INT8_C( 0), INT8_C( 1), INT8_C( 0),
INT8_C( 3), INT8_C( 0), INT8_C( 0), INT8_C( 0),
INT8_C( -12), INT8_C( 43), INT8_C( 0), INT8_C( 0),
INT8_C( 3), INT8_C( 0), INT8_C( 0), INT8_C( 0),
INT8_C( 0), INT8_C( 0), INT8_C( 0), INT8_C( 0),
INT8_C( 0), INT8_C( 1), INT8_C( 0), INT8_C( 1),
INT8_C( -1), INT8_C( 0), INT8_C( 0), INT8_C( 0),
INT8_C( 1), INT8_C( 4), INT8_C( -19), INT8_C( -1),
INT8_C( 0), INT8_C( 6), INT8_C( 0), INT8_C( 9),
INT8_C( 0), INT8_C( 0), INT8_C( 6), INT8_C( -1),
INT8_C( 0), INT8_C( 1), INT8_C( 1), INT8_C( 1),
INT8_C( 0), INT8_C( 2), INT8_C( -1), INT8_C( 1),
INT8_C( -4), INT8_C( 1), INT8_C( 0), INT8_C( -1),
INT8_C( 15), INT8_C( 1), INT8_C( -1), INT8_C( 0),
INT8_C( 0), INT8_C( -5), INT8_C( 2), INT8_C( -2)) },
{ simde_mm512_set_epi8(INT8_C( -91), INT8_C( 110), INT8_C( 126), INT8_C( 44),
INT8_C( 21), INT8_C( -84), INT8_C( 100), INT8_C( -15),
INT8_C( -61), INT8_C( -53), INT8_C( 75), INT8_C( -30),
INT8_C( -56), INT8_C( -86), INT8_C( 52), INT8_C( 108),
INT8_C( 96), INT8_C( 6), INT8_C(-100), INT8_C(-109),
INT8_C( -7), INT8_C( -22), INT8_C( 109), INT8_C( 124),
INT8_C( 85), INT8_C( 53), INT8_C( -45), INT8_C( 122),
INT8_C( 7), INT8_C( -21), INT8_C(-123), INT8_C( 4),
INT8_C( 3), INT8_C( 94), INT8_C(-127), INT8_C( 73),
INT8_C( 65), INT8_C( -69), INT8_C( -91), INT8_C(-115),
INT8_C( 117), INT8_C(-104), INT8_C( 66), INT8_C( 79),
INT8_C( -63), INT8_C(-115), INT8_C( -77), INT8_C( -89),
INT8_C(-113), INT8_C( 34), INT8_C( 100), INT8_C( 96),
INT8_C(-101), INT8_C( -34), INT8_C( 64), INT8_C( -59),
INT8_C( -53), INT8_C( 87), INT8_C( 48), INT8_C( 95),
INT8_C( -53), INT8_C( 61), INT8_C( 63), INT8_C( 106)),
simde_mm512_set_epi8(INT8_C( -1), INT8_C( 95), INT8_C( 91), INT8_C( 117),
INT8_C( 15), INT8_C( -50), INT8_C( -39), INT8_C( 74),
INT8_C( 36), INT8_C( 100), INT8_C( -62), INT8_C(-111),
INT8_C( 9), INT8_C( 41), INT8_C( 36), INT8_C( -21),
INT8_C( 71), INT8_C( -85), INT8_C( 120), INT8_C( -33),
INT8_C( 125), INT8_C( 38), INT8_C(-127), INT8_C( 39),
INT8_C( 28), INT8_C(-118), INT8_C( 31), INT8_C( 92),
INT8_C( 22), INT8_C( 48), INT8_C( 122), INT8_C( -6),
INT8_C( 107), INT8_C(-101), INT8_C( 14), INT8_C( -17),
INT8_C( 26), INT8_C( -4), INT8_C( -71), INT8_C( 13),
INT8_C( -39), INT8_C( -26), INT8_C( -37), INT8_C( 110),
INT8_C( 36), INT8_C( 78), INT8_C( -24), INT8_C( -52),
INT8_C(-117), INT8_C( -27), INT8_C( 113), INT8_C(-111),
INT8_C( -59), INT8_C( 38), INT8_C( -10), INT8_C( -53),
INT8_C( 110), INT8_C( 62), INT8_C( -4), INT8_C( 19),
INT8_C( -15), INT8_C( 42), INT8_C( 122), INT8_C( 105)),
simde_mm512_set_epi8(INT8_C( 91), INT8_C( 1), INT8_C( 1), INT8_C( 0),
INT8_C( 1), INT8_C( 1), INT8_C( -2), INT8_C( 0),
INT8_C( -1), INT8_C( 0), INT8_C( -1), INT8_C( 0),
INT8_C( -6), INT8_C( -2), INT8_C( 1), INT8_C( -5),
INT8_C( 1), INT8_C( 0), INT8_C( 0), INT8_C( 3),
INT8_C( 0), INT8_C( 0), INT8_C( 0), INT8_C( 3),
INT8_C( 3), INT8_C( 0), INT8_C( -1), INT8_C( 1),
INT8_C( 0), INT8_C( 0), INT8_C( -1), INT8_C( 0),
INT8_C( 0), INT8_C( 0), INT8_C( -9), INT8_C( -4),
INT8_C( 2), INT8_C( 17), INT8_C( 1), INT8_C( -8),
INT8_C( -3), INT8_C( 4), INT8_C( -1), INT8_C( 0),
INT8_C( -1), INT8_C( -1), INT8_C( 3), INT8_C( 1),
INT8_C( 0), INT8_C( -1), INT8_C( 0), INT8_C( 0),
INT8_C( 1), INT8_C( 0), INT8_C( -6), INT8_C( 1),
INT8_C( 0), INT8_C( 1), INT8_C( -12), INT8_C( 5),
INT8_C( 3), INT8_C( 1), INT8_C( 0), INT8_C( 1)) },
{ simde_mm512_set_epi8(INT8_C( -55), INT8_C( -14), INT8_C( 9), INT8_C(-109),
INT8_C( 77), INT8_C( -36), INT8_C( 82), INT8_C( -60),
INT8_C( -11), INT8_C( 52), INT8_C( 95), INT8_C( 118),
INT8_C( 124), INT8_C( 103), INT8_C( 108), INT8_C( 5),
INT8_C( -7), INT8_C( 55), INT8_C( 1), INT8_C( -90),
INT8_C( 89), INT8_C( 106), INT8_C( -80), INT8_C(-113),
INT8_C( -97), INT8_C( 113), INT8_C( 100), INT8_C( 9),
INT8_C( 122), INT8_C( -51), INT8_C(-121), INT8_C( 78),
INT8_C(-100), INT8_C( 26), INT8_C( -23), INT8_C( -89),
INT8_C( 20), INT8_C( 19), INT8_C( -91), INT8_C( -38),
INT8_C( -59), INT8_C( 10), INT8_C(-121), INT8_C( -30),
INT8_C( 79), INT8_C( 49), INT8_C( 104), INT8_C( 55),
INT8_C( 2), INT8_C( -2), INT8_C( -24), INT8_C( -48),
INT8_C( -25), INT8_C( -39), INT8_C( 89), INT8_C( 19),
INT8_C( -33), INT8_C( 101), INT8_C( 31), INT8_C( -59),
INT8_C(-123), INT8_C( 38), INT8_C( 124), INT8_C( 108)),
simde_mm512_set_epi8(INT8_C( -47), INT8_C( -85), INT8_C( 13), INT8_C( -86),
INT8_C( 92), INT8_C( 23), INT8_C( 69), INT8_C( -53),
INT8_C( 11), INT8_C( -74), INT8_C( 93), INT8_C( 45),
INT8_C( 123), INT8_C( -37), INT8_C( 6), INT8_C( -51),
INT8_C( 52), INT8_C( -77), INT8_C( -79), INT8_C( -50),
INT8_C( -32), INT8_C( 4), INT8_C( -47), INT8_C( -53),
INT8_C( -18), INT8_C( -18), INT8_C( 115), INT8_C( 117),
INT8_C( -67), INT8_C( -53), INT8_C( -72), INT8_C( 83),
INT8_C( -37), INT8_C( 34), INT8_C( 127), INT8_C( -10),
INT8_C( 126), INT8_C( -99), INT8_C(-106), INT8_C( 33),
INT8_C( 106), INT8_C( -41), INT8_C( -43), INT8_C( -4),
INT8_C(-104), INT8_C( 77), INT8_C(-107), INT8_C( -78),
INT8_C( 126), INT8_C( 37), INT8_C(-124), INT8_C( -92),
INT8_C( -30), INT8_C( -11), INT8_C( -49), INT8_C( 22),
INT8_C( 41), INT8_C( 82), INT8_C( -75), INT8_C( 81),
INT8_C( 39), INT8_C( -91), INT8_C( 65), INT8_C( -12)),
simde_mm512_set_epi8(INT8_C( 1), INT8_C( 0), INT8_C( 0), INT8_C( 1),
INT8_C( 0), INT8_C( -1), INT8_C( 1), INT8_C( 1),
INT8_C( -1), INT8_C( 0), INT8_C( 1), INT8_C( 2),
INT8_C( 1), INT8_C( -2), INT8_C( 18), INT8_C( 0),
INT8_C( 0), INT8_C( 0), INT8_C( 0), INT8_C( 1),
INT8_C( -2), INT8_C( 26), INT8_C( 1), INT8_C( 2),
INT8_C( 5), INT8_C( -6), INT8_C( 0), INT8_C( 0),
INT8_C( -1), INT8_C( 0), INT8_C( 1), INT8_C( 0),
INT8_C( 2), INT8_C( 0), INT8_C( 0), INT8_C( 8),
INT8_C( 0), INT8_C( 0), INT8_C( 0), INT8_C( -1),
INT8_C( 0), INT8_C( 0), INT8_C( 2), INT8_C( 7),
INT8_C( 0), INT8_C( 0), INT8_C( 0), INT8_C( 0),
INT8_C( 0), INT8_C( 0), INT8_C( 0), INT8_C( 0),
INT8_C( 0), INT8_C( 3), INT8_C( -1), INT8_C( 0),
INT8_C( 0), INT8_C( 1), INT8_C( 0), INT8_C( 0),
INT8_C( -3), INT8_C( 0), INT8_C( 1), INT8_C( -9)) },
{ simde_mm512_set_epi8(INT8_C( 101), INT8_C( 62), INT8_C( -23), INT8_C( 48),
INT8_C( 118), INT8_C( 51), INT8_C( -2), INT8_C(-103),
INT8_C( 110), INT8_C( -27), INT8_C( 109), INT8_C( 60),
INT8_C( 81), INT8_C( 82), INT8_C( 61), INT8_C( -96),
INT8_C( -57), INT8_C( 116), INT8_C( -5), INT8_C( 0),
INT8_C( 28), INT8_C( 71), INT8_C( -24), INT8_C( 46),
INT8_C( -73), INT8_C( 2), INT8_C( -88), INT8_C( 76),
INT8_C( 95), INT8_C( -58), INT8_C( 94), INT8_C( 46),
INT8_C( 20), INT8_C( 112), INT8_C( -69), INT8_C( 111),
INT8_C( -44), INT8_C( -74), INT8_C( -18), INT8_C( 53),
INT8_C( 127), INT8_C( 36), INT8_C( 79), INT8_C( -48),
INT8_C( 114), INT8_C( 84), INT8_C( 65), INT8_C(-112),
INT8_C(-112), INT8_C( 23), INT8_C( 37), INT8_C( 63),
INT8_C( -88), INT8_C( -57), INT8_C( 100), INT8_C( 121),
INT8_C( 97), INT8_C( 122), INT8_C( 12), INT8_C( -79),
INT8_C( 47), INT8_C( 60), INT8_C( -36), INT8_C( -83)),
simde_mm512_set_epi8(INT8_C( -6), INT8_C( 53), INT8_C( 88), INT8_C( -36),
INT8_C( 96), INT8_C( 32), INT8_C( 77), INT8_C( 2),
INT8_C( -8), INT8_C( -42), INT8_C( -69), INT8_C( 40),
INT8_C( -69), INT8_C( 97), INT8_C( 30), INT8_C( 102),
INT8_C( -84), INT8_C( -54), INT8_C(-126), INT8_C( 91),
INT8_C( 69), INT8_C( 35), INT8_C( 100), INT8_C(-118),
INT8_C( -93), INT8_C( 108), INT8_C( 21), INT8_C( -16),
INT8_C( 32), INT8_C( 106), INT8_C( -36), INT8_C( -46),
INT8_C( -28), INT8_C( -81), INT8_C( 80), INT8_C( 14),
INT8_C( -78), INT8_C( 3), INT8_C( 82), INT8_C(-104),
INT8_C( 13), INT8_C( -56), INT8_C(-106), INT8_C( 89),
INT8_C( -24), INT8_C( 42), INT8_C( 41), INT8_C( 68),
INT8_C( -88), INT8_C(-107), INT8_C( -36), INT8_C( 52),
INT8_C( 32), INT8_C( -59), INT8_C( -33), INT8_C( 120),
INT8_C( 47), INT8_C(-127), INT8_C( 64), INT8_C( 114),
INT8_C( 107), INT8_C( -75), INT8_C( 127), INT8_C( 23)),
simde_mm512_set_epi8(INT8_C( -16), INT8_C( 1), INT8_C( 0), INT8_C( -1),
INT8_C( 1), INT8_C( 1), INT8_C( 0), INT8_C( -51),
INT8_C( -13), INT8_C( 0), INT8_C( -1), INT8_C( 1),
INT8_C( -1), INT8_C( 0), INT8_C( 2), INT8_C( 0),
INT8_C( 0), INT8_C( -2), INT8_C( 0), INT8_C( 0),
INT8_C( 0), INT8_C( 2), INT8_C( 0), INT8_C( 0),
INT8_C( 0), INT8_C( 0), INT8_C( -4), INT8_C( -4),
INT8_C( 2), INT8_C( 0), INT8_C( -2), INT8_C( -1),
INT8_C( 0), INT8_C( -1), INT8_C( 0), INT8_C( 7),
INT8_C( 0), INT8_C( -24), INT8_C( 0), INT8_C( 0),
INT8_C( 9), INT8_C( 0), INT8_C( 0), INT8_C( 0),
INT8_C( -4), INT8_C( 2), INT8_C( 1), INT8_C( -1),
INT8_C( 1), INT8_C( 0), INT8_C( -1), INT8_C( 1),
INT8_C( -2), INT8_C( 0), INT8_C( -3), INT8_C( 1),
INT8_C( 2), INT8_C( 0), INT8_C( 0), INT8_C( 0),
INT8_C( 0), INT8_C( 0), INT8_C( 0), INT8_C( -3)) },
{ simde_mm512_set_epi8(INT8_C( 106), INT8_C( -71), INT8_C( 61), INT8_C( 19),
INT8_C( 29), INT8_C( 79), INT8_C( 45), INT8_C( 94),
INT8_C(-112), INT8_C( 60), INT8_C( 2), INT8_C( 77),
INT8_C( 30), INT8_C( -34), INT8_C( 102), INT8_C( 43),
INT8_C( -87), INT8_C( 52), INT8_C(-104), INT8_C( -8),
INT8_C(-103), INT8_C( 79), INT8_C( -22), INT8_C( 31),
INT8_C( 11), INT8_C( 124), INT8_C( 70), INT8_C( -64),
INT8_C( -91), INT8_C( 88), INT8_C( -70), INT8_C( -61),
INT8_C( -84), INT8_C(-108), INT8_C( -57), INT8_C( 13),
INT8_C( -58), INT8_C( -7), INT8_C( 39), INT8_C( 66),
INT8_C( 50), INT8_C( -61), INT8_C( -9), INT8_C( -41),
INT8_C( 25), INT8_C( -31), INT8_C( 64), INT8_C( 18),
INT8_C( 73), INT8_C( 60), INT8_C( -53), INT8_C( 42),
INT8_C( -1), INT8_C( 50), INT8_C( 95), INT8_C( 78),
INT8_C( 39), INT8_C( -9), INT8_C(-121), INT8_C( -72),
INT8_C( 48), INT8_C( 20), INT8_C( 76), INT8_C( -48)),
simde_mm512_set_epi8(INT8_C( 12), INT8_C( 55), INT8_C(-111), INT8_C( -85),
INT8_C( -94), INT8_C( -11), INT8_C( 57), INT8_C( 93),
INT8_C( 32), INT8_C( 57), INT8_C( 61), INT8_C( -21),
INT8_C(-102), INT8_C( 75), INT8_C( -15), INT8_C(-114),
INT8_C( 26), INT8_C( 71), INT8_C(-127), INT8_C( -52),
INT8_C( -57), INT8_C( -26), INT8_C( -36), INT8_C( -4),
INT8_C( -7), INT8_C( 40), INT8_C( 60), INT8_C( 82),
INT8_C( 6), INT8_C( -12), INT8_C( 52), INT8_C( -37),
INT8_C( -96), INT8_C(-117), INT8_C( 104), INT8_C( -99),
INT8_C( -1), INT8_C( 95), INT8_C( 81), INT8_C( -70),
INT8_C( -22), INT8_C( -86), INT8_C( 114), INT8_C( -43),
INT8_C(-120), INT8_C( 109), INT8_C( -86), INT8_C( -33),
INT8_C( -23), INT8_C( 69), INT8_C( -80), INT8_C( 61),
INT8_C( -35), INT8_C( 107), INT8_C( -31), INT8_C( 11),
INT8_C( -45), INT8_C( 125), INT8_C( -53), INT8_C( -7),
INT8_C( 88), INT8_C(-111), INT8_C( 86), INT8_C(-105)),
simde_mm512_set_epi8(INT8_C( 8), INT8_C( -1), INT8_C( 0), INT8_C( 0),
INT8_C( 0), INT8_C( -7), INT8_C( 0), INT8_C( 1),
INT8_C( -3), INT8_C( 1), INT8_C( 0), INT8_C( -3),
INT8_C( 0), INT8_C( 0), INT8_C( -6), INT8_C( 0),
INT8_C( -3), INT8_C( 0), INT8_C( 0), INT8_C( 0),
INT8_C( 1), INT8_C( -3), INT8_C( 0), INT8_C( -7),
INT8_C( -1), INT8_C( 3), INT8_C( 1), INT8_C( 0),
INT8_C( -15), INT8_C( -7), INT8_C( -1), INT8_C( 1),
INT8_C( 0), INT8_C( 0), INT8_C( 0), INT8_C( 0),
INT8_C( 58), INT8_C( 0), INT8_C( 0), INT8_C( 0),
INT8_C( -2), INT8_C( 0), INT8_C( 0), INT8_C( 0),
INT8_C( 0), INT8_C( 0), INT8_C( 0), INT8_C( 0),
INT8_C( -3), INT8_C( 0), INT8_C( 0), INT8_C( 0),
INT8_C( 0), INT8_C( 0), INT8_C( -3), INT8_C( 7),
INT8_C( 0), INT8_C( 0), INT8_C( 2), INT8_C( 10),
INT8_C( 0), INT8_C( 0), INT8_C( 0), INT8_C( 0)) },
{ simde_mm512_set_epi8(INT8_C( 102), INT8_C( 35), INT8_C( 43), INT8_C( -33),
INT8_C( -74), INT8_C( 81), INT8_C( 81), INT8_C( 115),
INT8_C( -81), INT8_C( 72), INT8_C(-127), INT8_C( 118),
INT8_C(-113), INT8_C( 106), INT8_C( 25), INT8_C( 84),
INT8_C( -82), INT8_C( 58), INT8_C( 13), INT8_C( -38),
INT8_C( -3), INT8_C( 104), INT8_C( 85), INT8_C(-112),
INT8_C( -4), INT8_C( 52), INT8_C( -2), INT8_C( -64),
INT8_C( -23), INT8_C( 5), INT8_C( 33), INT8_C( -11),
INT8_C( 116), INT8_C( 110), INT8_C( 21), INT8_C( 84),
INT8_C( 42), INT8_C( 77), INT8_C( 25), INT8_C( 68),
INT8_C( 71), INT8_C( 60), INT8_C( -51), INT8_C( -46),
INT8_C( -1), INT8_C( -12), INT8_C( 88), INT8_C( 19),
INT8_C( -70), INT8_C( 27), INT8_C( -6), INT8_C( 61),
INT8_C( -48), INT8_C( 119), INT8_C(-107), INT8_C(-115),
INT8_C( 90), INT8_C( 64), INT8_C( 19), INT8_C( 64),
INT8_C( -19), INT8_C( -7), INT8_C( 40), INT8_C( -68)),
simde_mm512_set_epi8(INT8_C( 66), INT8_C( 58), INT8_C( 74), INT8_C( -51),
INT8_C( -69), INT8_C( -59), INT8_C( 84), INT8_C( 27),
INT8_C( 43), INT8_C( -40), INT8_C( -56), INT8_C( 125),
INT8_C( 1), INT8_C( 92), INT8_C( -82), INT8_C( 49),
INT8_C( -14), INT8_C( 14), INT8_C( 52), INT8_C( -25),
INT8_C( 47), INT8_C( -55), INT8_C( -54), INT8_C( -50),
INT8_C( -40), INT8_C(-118), INT8_C( 97), INT8_C( -86),
INT8_C( 93), INT8_C( 116), INT8_C( -54), INT8_C(-127),
INT8_C( 17), INT8_C( -57), INT8_C( -81), INT8_C( -49),
INT8_C( 73), INT8_C( 79), INT8_C( -43), INT8_C( 61),
INT8_C( -14), INT8_C( 18), INT8_C( 125), INT8_C( -11),
INT8_C( -70), INT8_C( 81), INT8_C(-107), INT8_C( -13),
INT8_C( -75), INT8_C( 46), INT8_C( 17), INT8_C( -39),
INT8_C( -35), INT8_C( 57), INT8_C( -8), INT8_C( -62),
INT8_C( -61), INT8_C( 118), INT8_C( -33), INT8_C( 116),
INT8_C( -5), INT8_C( 120), INT8_C( 126), INT8_C( -48)),
simde_mm512_set_epi8(INT8_C( 1), INT8_C( 0), INT8_C( 0), INT8_C( 0),
INT8_C( 1), INT8_C( -1), INT8_C( 0), INT8_C( 4),
INT8_C( -1), INT8_C( -1), INT8_C( 2), INT8_C( 0),
INT8_C(-113), INT8_C( 1), INT8_C( 0), INT8_C( 1),
INT8_C( 5), INT8_C( 4), INT8_C( 0), INT8_C( 1),
INT8_C( 0), INT8_C( -1), INT8_C( -1), INT8_C( 2),
INT8_C( 0), INT8_C( 0), INT8_C( 0), INT8_C( 0),
INT8_C( 0), INT8_C( 0), INT8_C( 0), INT8_C( 0),
INT8_C( 6), INT8_C( -1), INT8_C( 0), INT8_C( -1),
INT8_C( 0), INT8_C( 0), INT8_C( 0), INT8_C( 1),
INT8_C( -5), INT8_C( 3), INT8_C( 0), INT8_C( 4),
INT8_C( 0), INT8_C( 0), INT8_C( 0), INT8_C( -1),
INT8_C( 0), INT8_C( 0), INT8_C( 0), INT8_C( -1),
INT8_C( 1), INT8_C( 2), INT8_C( 13), INT8_C( 1),
INT8_C( -1), INT8_C( 0), INT8_C( 0), INT8_C( 0),
INT8_C( 3), INT8_C( 0), INT8_C( 0), INT8_C( 1)) }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])); i++) {
simde__m512i r = simde_mm512_div_epi8(test_vec[i].a, test_vec[i].b);
simde_assert_m512i_i8(r, ==, test_vec[i].r);
}
return 0;
}
static int
test_simde_mm512_div_epi16(SIMDE_MUNIT_TEST_ARGS) {
const struct {
simde__m512i a;
simde__m512i b;
simde__m512i r;
} test_vec[8] = {
{ simde_mm512_set_epi16(INT16_C(-20040), INT16_C( 8356), INT16_C(-32332), INT16_C( 10333),
INT16_C( -5915), INT16_C( 26879), INT16_C( 2532), INT16_C( 21861),
INT16_C(-27724), INT16_C(-13980), INT16_C(-30566), INT16_C(-12851),
INT16_C( 30608), INT16_C( 27665), INT16_C( 548), INT16_C( 7224),
INT16_C(-23312), INT16_C( -9410), INT16_C( 2838), INT16_C(-28448),
INT16_C( 30003), INT16_C(-15914), INT16_C(-27549), INT16_C( 6027),
INT16_C( 28687), INT16_C(-19881), INT16_C( 5735), INT16_C( 9519),
INT16_C( -3746), INT16_C(-25453), INT16_C(-16345), INT16_C(-27291)),
simde_mm512_set_epi16(INT16_C( 4335), INT16_C( -8694), INT16_C( 20589), INT16_C( -2761),
INT16_C( -3216), INT16_C(-24783), INT16_C(-17777), INT16_C( -501),
INT16_C( 25504), INT16_C( 26559), INT16_C( 27843), INT16_C( 31769),
INT16_C(-18807), INT16_C( 5762), INT16_C(-26736), INT16_C( 14349),
INT16_C(-15519), INT16_C( 4924), INT16_C(-19685), INT16_C( 31074),
INT16_C(-20201), INT16_C( -4452), INT16_C( 11125), INT16_C( 19762),
INT16_C(-31890), INT16_C(-20519), INT16_C(-27796), INT16_C( 4844),
INT16_C( 1980), INT16_C(-25222), INT16_C(-27366), INT16_C( 20455)),
simde_mm512_set_epi16(INT16_C( -4), INT16_C( 0), INT16_C( -1), INT16_C( -3),
INT16_C( 1), INT16_C( -1), INT16_C( 0), INT16_C( -43),
INT16_C( -1), INT16_C( 0), INT16_C( -1), INT16_C( 0),
INT16_C( -1), INT16_C( 4), INT16_C( 0), INT16_C( 0),
INT16_C( 1), INT16_C( -1), INT16_C( 0), INT16_C( 0),
INT16_C( -1), INT16_C( 3), INT16_C( -2), INT16_C( 0),
INT16_C( 0), INT16_C( 0), INT16_C( 0), INT16_C( 1),
INT16_C( -1), INT16_C( 1), INT16_C( 0), INT16_C( -1)) },
{ simde_mm512_set_epi16(INT16_C( 30542), INT16_C(-21686), INT16_C(-12987), INT16_C(-10637),
INT16_C( -1601), INT16_C(-28302), INT16_C( 15211), INT16_C(-14111),
INT16_C( 25976), INT16_C( 21242), INT16_C(-23929), INT16_C(-19059),
INT16_C(-25081), INT16_C( 5942), INT16_C(-21376), INT16_C( 4770),
INT16_C( -1129), INT16_C(-19990), INT16_C( 26476), INT16_C(-29290),
INT16_C(-16617), INT16_C(-24641), INT16_C( 13060), INT16_C(-26392),
INT16_C(-31122), INT16_C( 1166), INT16_C(-13169), INT16_C( 10959),
INT16_C( 3043), INT16_C(-24353), INT16_C(-25618), INT16_C( 3998)),
simde_mm512_set_epi16(INT16_C( 8697), INT16_C( 4862), INT16_C(-26319), INT16_C(-11370),
INT16_C( 4314), INT16_C(-16926), INT16_C( 26882), INT16_C( 8784),
INT16_C(-23412), INT16_C( 6784), INT16_C( 27807), INT16_C( 29358),
INT16_C( 28774), INT16_C( -1248), INT16_C( 14871), INT16_C( 4639),
INT16_C( 17536), INT16_C( -3921), INT16_C(-31860), INT16_C( 18313),
INT16_C( 13025), INT16_C(-15494), INT16_C( -6838), INT16_C(-31563),
INT16_C( 10488), INT16_C( 29317), INT16_C( 5913), INT16_C( -5447),
INT16_C( 11124), INT16_C(-18588), INT16_C(-20055), INT16_C( 31068)),
simde_mm512_set_epi16(INT16_C( 3), INT16_C( -4), INT16_C( 0), INT16_C( 0),
INT16_C( 0), INT16_C( 1), INT16_C( 0), INT16_C( -1),
INT16_C( -1), INT16_C( 3), INT16_C( 0), INT16_C( 0),
INT16_C( 0), INT16_C( -4), INT16_C( -1), INT16_C( 1),
INT16_C( 0), INT16_C( 5), INT16_C( 0), INT16_C( -1),
INT16_C( -1), INT16_C( 1), INT16_C( -1), INT16_C( 0),
INT16_C( -2), INT16_C( 0), INT16_C( -2), INT16_C( -2),
INT16_C( 0), INT16_C( 1), INT16_C( 1), INT16_C( 0)) },
{ simde_mm512_set_epi16(INT16_C( 10506), INT16_C( 27276), INT16_C( 10689), INT16_C( 7669),
INT16_C( -9146), INT16_C(-17193), INT16_C( 7411), INT16_C( 5177),
INT16_C( 18940), INT16_C(-16405), INT16_C( 3246), INT16_C( 3104),
INT16_C( -7140), INT16_C( 31568), INT16_C( -2399), INT16_C(-28909),
INT16_C( 26564), INT16_C(-28507), INT16_C( 3797), INT16_C( -9359),
INT16_C(-12946), INT16_C( 18074), INT16_C( -6465), INT16_C( 3679),
INT16_C( 17483), INT16_C( -5905), INT16_C( 3591), INT16_C(-20227),
INT16_C( -6079), INT16_C( -1639), INT16_C(-29076), INT16_C( 29393)),
simde_mm512_set_epi16(INT16_C( 11630), INT16_C( 9206), INT16_C(-15696), INT16_C( 3180),
INT16_C( 12868), INT16_C(-30976), INT16_C( -5774), INT16_C(-11992),
INT16_C(-18085), INT16_C( 32470), INT16_C( 17470), INT16_C(-31399),
INT16_C( 9368), INT16_C( 3571), INT16_C( 7161), INT16_C(-27278),
INT16_C( 9802), INT16_C( 20270), INT16_C(-19501), INT16_C( 19621),
INT16_C( 14613), INT16_C( -6394), INT16_C( -6716), INT16_C( -8239),
INT16_C(-25839), INT16_C( 28062), INT16_C( -8851), INT16_C(-12431),
INT16_C( -8955), INT16_C( -676), INT16_C( 10256), INT16_C( 15625)),
simde_mm512_set_epi16(INT16_C( 0), INT16_C( 2), INT16_C( 0), INT16_C( 2),
INT16_C( 0), INT16_C( 0), INT16_C( -1), INT16_C( 0),
INT16_C( -1), INT16_C( 0), INT16_C( 0), INT16_C( 0),
INT16_C( 0), INT16_C( 8), INT16_C( 0), INT16_C( 1),
INT16_C( 2), INT16_C( -1), INT16_C( 0), INT16_C( 0),
INT16_C( 0), INT16_C( -2), INT16_C( 0), INT16_C( 0),
INT16_C( 0), INT16_C( 0), INT16_C( 0), INT16_C( 1),
INT16_C( 0), INT16_C( 2), INT16_C( -2), INT16_C( 1)) },
{ simde_mm512_set_epi16(INT16_C( 14453), INT16_C(-27323), INT16_C( 14069), INT16_C(-15038),
INT16_C( 29890), INT16_C(-32496), INT16_C( -8033), INT16_C( 2034),
INT16_C( 28252), INT16_C(-12993), INT16_C(-12172), INT16_C( 21268),
INT16_C(-19693), INT16_C( -3590), INT16_C( -7723), INT16_C(-15496),
INT16_C( -5494), INT16_C( 10297), INT16_C( 10325), INT16_C( 32003),
INT16_C(-11357), INT16_C( 14609), INT16_C(-13537), INT16_C( 17128),
INT16_C( 6812), INT16_C( 32194), INT16_C( 287), INT16_C( 5824),
INT16_C( 13352), INT16_C(-19334), INT16_C( 8294), INT16_C(-20267)),
simde_mm512_set_epi16(INT16_C(-10192), INT16_C(-26586), INT16_C( 32452), INT16_C( 4989),
INT16_C(-13693), INT16_C(-13838), INT16_C( 2151), INT16_C( 31183),
INT16_C(-12217), INT16_C( 28038), INT16_C( 27497), INT16_C(-25404),
INT16_C(-25184), INT16_C(-12134), INT16_C( 25347), INT16_C( -5075),
INT16_C( 19038), INT16_C( 9321), INT16_C(-20974), INT16_C( 22487),
INT16_C( -3253), INT16_C(-14033), INT16_C( 24624), INT16_C( 14772),
INT16_C( 16067), INT16_C(-16101), INT16_C( 12034), INT16_C( 11420),
INT16_C(-30652), INT16_C(-30195), INT16_C(-10496), INT16_C( 32407)),
simde_mm512_set_epi16(INT16_C( -1), INT16_C( 1), INT16_C( 0), INT16_C( -3),
INT16_C( -2), INT16_C( 2), INT16_C( -3), INT16_C( 0),
INT16_C( -2), INT16_C( 0), INT16_C( 0), INT16_C( 0),
INT16_C( 0), INT16_C( 0), INT16_C( 0), INT16_C( 3),
INT16_C( 0), INT16_C( 1), INT16_C( 0), INT16_C( 1),
INT16_C( 3), INT16_C( -1), INT16_C( 0), INT16_C( 1),
INT16_C( 0), INT16_C( -1), INT16_C( 0), INT16_C( 0),
INT16_C( 0), INT16_C( 0), INT16_C( 0), INT16_C( 0)) },
{ simde_mm512_set_epi16(INT16_C(-12762), INT16_C( -143), INT16_C( 24201), INT16_C( 27500),
INT16_C(-21606), INT16_C(-10954), INT16_C( 30460), INT16_C( 28331),
INT16_C(-22171), INT16_C(-30589), INT16_C( 16765), INT16_C(-17393),
INT16_C( 31673), INT16_C( 13306), INT16_C( -8624), INT16_C( -3653),
INT16_C(-23812), INT16_C( 2378), INT16_C( -6069), INT16_C( -8645),
INT16_C( 9750), INT16_C( 6252), INT16_C(-30407), INT16_C(-28082),
INT16_C(-14686), INT16_C( -5840), INT16_C( 24502), INT16_C( 12329),
INT16_C( -5959), INT16_C(-16932), INT16_C( -4867), INT16_C( 10388)),
simde_mm512_set_epi16(INT16_C(-30203), INT16_C(-31292), INT16_C( 7054), INT16_C( 31766),
INT16_C(-23643), INT16_C( -7634), INT16_C( 23958), INT16_C(-19164),
INT16_C( 32358), INT16_C( 32485), INT16_C( -8137), INT16_C( 2854),
INT16_C( 443), INT16_C( 3757), INT16_C(-31602), INT16_C( 26770),
INT16_C( 1434), INT16_C(-26880), INT16_C(-13137), INT16_C(-25600),
INT16_C( 3310), INT16_C( 31739), INT16_C( 22782), INT16_C( 27721),
INT16_C(-28215), INT16_C( 10286), INT16_C( 11994), INT16_C(-23317),
INT16_C(-11843), INT16_C( 6466), INT16_C( 8900), INT16_C( 11867)),
simde_mm512_set_epi16(INT16_C( 0), INT16_C( 0), INT16_C( 3), INT16_C( 0),
INT16_C( 0), INT16_C( 1), INT16_C( 1), INT16_C( -1),
INT16_C( 0), INT16_C( 0), INT16_C( -2), INT16_C( -6),
INT16_C( 71), INT16_C( 3), INT16_C( 0), INT16_C( 0),
INT16_C( -16), INT16_C( 0), INT16_C( 0), INT16_C( 0),
INT16_C( 2), INT16_C( 0), INT16_C( -1), INT16_C( -1),
INT16_C( 0), INT16_C( 0), INT16_C( 2), INT16_C( 0),
INT16_C( 0), INT16_C( -2), INT16_C( 0), INT16_C( 0)) },
{ simde_mm512_set_epi16(INT16_C(-29408), INT16_C( 7369), INT16_C( -5051), INT16_C( 7942),
INT16_C( 18019), INT16_C(-25065), INT16_C( -8302), INT16_C( 17011),
INT16_C( 2762), INT16_C( 27559), INT16_C( 18647), INT16_C( 22035),
INT16_C(-10618), INT16_C( -3223), INT16_C( 25352), INT16_C(-32696),
INT16_C( -1859), INT16_C(-20090), INT16_C( 18297), INT16_C(-27701),
INT16_C(-31478), INT16_C(-13300), INT16_C(-15493), INT16_C(-16792),
INT16_C(-23954), INT16_C(-14239), INT16_C(-15716), INT16_C( 12103),
INT16_C(-30330), INT16_C( -2111), INT16_C(-26781), INT16_C( 25851)),
simde_mm512_set_epi16(INT16_C( 11252), INT16_C(-25669), INT16_C(-31001), INT16_C( 13518),
INT16_C( 30845), INT16_C(-14200), INT16_C(-30880), INT16_C( 22795),
INT16_C(-15552), INT16_C( -1554), INT16_C( 29162), INT16_C( -8371),
INT16_C( 5731), INT16_C( 22086), INT16_C( 7870), INT16_C(-26229),
INT16_C( 19406), INT16_C(-22832), INT16_C(-14386), INT16_C( 22375),
INT16_C( -8274), INT16_C( -9174), INT16_C(-24184), INT16_C( 24847),
INT16_C( 26808), INT16_C( -2235), INT16_C( 4293), INT16_C(-30072),
INT16_C( 23713), INT16_C( 20910), INT16_C( 6378), INT16_C(-18450)),
simde_mm512_set_epi16(INT16_C( -2), INT16_C( 0), INT16_C( 0), INT16_C( 0),
INT16_C( 0), INT16_C( 1), INT16_C( 0), INT16_C( 0),
INT16_C( 0), INT16_C( -17), INT16_C( 0), INT16_C( -2),
INT16_C( -1), INT16_C( 0), INT16_C( 3), INT16_C( 1),
INT16_C( 0), INT16_C( 0), INT16_C( -1), INT16_C( -1),
INT16_C( 3), INT16_C( 1), INT16_C( 0), INT16_C( 0),
INT16_C( 0), INT16_C( 6), INT16_C( -3), INT16_C( 0),
INT16_C( -1), INT16_C( 0), INT16_C( -4), INT16_C( -1)) },
{ simde_mm512_set_epi16(INT16_C( -8644), INT16_C( 4438), INT16_C( 1025), INT16_C(-26642),
INT16_C( 18378), INT16_C(-13976), INT16_C( 21110), INT16_C( 14955),
INT16_C( 2525), INT16_C(-19773), INT16_C( 28133), INT16_C(-32693),
INT16_C( 12259), INT16_C(-21141), INT16_C(-27294), INT16_C( 16198),
INT16_C( -2640), INT16_C( 31144), INT16_C(-15827), INT16_C( 20747),
INT16_C(-19791), INT16_C( 30374), INT16_C( -9055), INT16_C(-20334),
INT16_C( 28339), INT16_C( 29800), INT16_C( 32312), INT16_C(-19316),
INT16_C(-15043), INT16_C(-27434), INT16_C( 29424), INT16_C(-25521)),
simde_mm512_set_epi16(INT16_C(-24272), INT16_C( -9025), INT16_C(-17538), INT16_C(-13789),
INT16_C( 3646), INT16_C( 17578), INT16_C( -9614), INT16_C(-11054),
INT16_C( 23757), INT16_C( -5736), INT16_C( 8067), INT16_C( 10531),
INT16_C(-24488), INT16_C( 16639), INT16_C(-22179), INT16_C( -8704),
INT16_C( -927), INT16_C(-31517), INT16_C( 10091), INT16_C( 19448),
INT16_C( 12069), INT16_C( 8742), INT16_C( 16653), INT16_C( 31958),
INT16_C(-18440), INT16_C(-30513), INT16_C( -3426), INT16_C( -7330),
INT16_C( 24804), INT16_C( 18228), INT16_C( 16072), INT16_C(-15326)),
simde_mm512_set_epi16(INT16_C( 0), INT16_C( 0), INT16_C( 0), INT16_C( 1),
INT16_C( 5), INT16_C( 0), INT16_C( -2), INT16_C( -1),
INT16_C( 0), INT16_C( 3), INT16_C( 3), INT16_C( -3),
INT16_C( 0), INT16_C( -1), INT16_C( 1), INT16_C( -1),
INT16_C( 2), INT16_C( 0), INT16_C( -1), INT16_C( 1),
INT16_C( -1), INT16_C( 3), INT16_C( 0), INT16_C( 0),
INT16_C( -1), INT16_C( 0), INT16_C( -9), INT16_C( 2),
INT16_C( 0), INT16_C( -1), INT16_C( 1), INT16_C( 1)) },
{ simde_mm512_set_epi16(INT16_C( 23232), INT16_C(-29257), INT16_C( 1254), INT16_C( -9317),
INT16_C(-20336), INT16_C( 10081), INT16_C( 18681), INT16_C( 12677),
INT16_C( 17973), INT16_C(-10276), INT16_C(-23503), INT16_C( 18772),
INT16_C( 8312), INT16_C( 15138), INT16_C( -9415), INT16_C(-23183),
INT16_C( 4065), INT16_C( 14928), INT16_C( -9505), INT16_C( -3213),
INT16_C( -8135), INT16_C(-17864), INT16_C(-23451), INT16_C( -2372),
INT16_C( 14548), INT16_C(-10992), INT16_C( 6282), INT16_C(-22066),
INT16_C(-11858), INT16_C( 14867), INT16_C( -6173), INT16_C( 24146)),
simde_mm512_set_epi16(INT16_C(-20244), INT16_C( 14874), INT16_C( 7829), INT16_C( 32218),
INT16_C( 17818), INT16_C( 309), INT16_C( 27668), INT16_C( 9211),
INT16_C( 15166), INT16_C( 4076), INT16_C( 28109), INT16_C(-30601),
INT16_C( 4803), INT16_C(-19074), INT16_C(-23287), INT16_C(-27917),
INT16_C( 7634), INT16_C(-13255), INT16_C( 14290), INT16_C( -8590),
INT16_C(-11602), INT16_C( 9361), INT16_C(-18559), INT16_C( 3976),
INT16_C( 20763), INT16_C( 17266), INT16_C( 8709), INT16_C(-30498),
INT16_C( 31994), INT16_C(-17983), INT16_C( 25233), INT16_C( 29991)),
simde_mm512_set_epi16(INT16_C( -1), INT16_C( -1), INT16_C( 0), INT16_C( 0),
INT16_C( -1), INT16_C( 32), INT16_C( 0), INT16_C( 1),
INT16_C( 1), INT16_C( -2), INT16_C( 0), INT16_C( 0),
INT16_C( 1), INT16_C( 0), INT16_C( 0), INT16_C( 0),
INT16_C( 0), INT16_C( -1), INT16_C( 0), INT16_C( 0),
INT16_C( 0), INT16_C( -1), INT16_C( 1), INT16_C( 0),
INT16_C( 0), INT16_C( 0), INT16_C( 0), INT16_C( 0),
INT16_C( 0), INT16_C( 0), INT16_C( 0), INT16_C( 0)) }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])); i++) {
simde__m512i r = simde_mm512_div_epi16(test_vec[i].a, test_vec[i].b);
simde_assert_m512i_i16(r, ==, test_vec[i].r);
}
return 0;
}
static int
test_simde_mm512_div_epi32(SIMDE_MUNIT_TEST_ARGS) {
const struct {
simde__m512i a;
simde__m512i b;
simde__m512i r;
} test_vec[8] = {
{ simde_mm512_set_epi32(INT32_C(-1425964510), INT32_C( 1884851068), INT32_C( -245085200), INT32_C( 312441627),
INT32_C( 1361020823), INT32_C( -269027644), INT32_C( 2046290516), INT32_C( 253262419),
INT32_C(-1435031175), INT32_C( -983397284), INT32_C( 1158205006), INT32_C( 2142968427),
INT32_C( -610621785), INT32_C(-1874018384), INT32_C( 408084487), INT32_C( 314643093)),
simde_mm512_set_epi32(INT32_C(-1816447538), INT32_C( 1352799684), INT32_C( 437452333), INT32_C(-2106809533),
INT32_C( 850823800), INT32_C(-1580883911), INT32_C(-2115707304), INT32_C( 1577531711),
INT32_C( 801246884), INT32_C( 59025302), INT32_C( 905783489), INT32_C(-1645941779),
INT32_C( 962943312), INT32_C( 2128170875), INT32_C(-1348448230), INT32_C( -975134432)),
simde_mm512_set_epi32(INT32_C( 0), INT32_C( 1), INT32_C( 0), INT32_C( 0),
INT32_C( 1), INT32_C( 0), INT32_C( 0), INT32_C( 0),
INT32_C( -1), INT32_C( -16), INT32_C( 1), INT32_C( -1),
INT32_C( 0), INT32_C( 0), INT32_C( 0), INT32_C( 0)) },
{ simde_mm512_set_epi32(INT32_C( 1427225802), INT32_C(-1035302594), INT32_C( -199744603), INT32_C( 1376388625),
INT32_C(-2114897409), INT32_C( 1679349706), INT32_C(-1031333846), INT32_C(-1198347443),
INT32_C( -637748341), INT32_C( 1314591131), INT32_C( 282479090), INT32_C( 1660196054),
INT32_C(-1167126507), INT32_C(-1998854068), INT32_C( 933881032), INT32_C( -624384653)),
simde_mm512_set_epi32(INT32_C( 1612321322), INT32_C( 2051698478), INT32_C( 1596883036), INT32_C(-1369467325),
INT32_C( 1851004364), INT32_C( 1092388812), INT32_C( 828772877), INT32_C( -259189725),
INT32_C( -849691191), INT32_C(-1191458488), INT32_C( 801339023), INT32_C( -104328386),
INT32_C( 757083857), INT32_C(-1236967236), INT32_C( -850146114), INT32_C( 1258625824)),
simde_mm512_set_epi32(INT32_C( 0), INT32_C( 0), INT32_C( 0), INT32_C( -1),
INT32_C( -1), INT32_C( 1), INT32_C( -1), INT32_C( 4),
INT32_C( 0), INT32_C( -1), INT32_C( 0), INT32_C( -15),
INT32_C( -1), INT32_C( 1), INT32_C( -1), INT32_C( 0)) },
{ simde_mm512_set_epi32(INT32_C( 237418199), INT32_C( -70579339), INT32_C(-2042257710), INT32_C( 1462546998),
INT32_C( -202189538), INT32_C(-1353367648), INT32_C( 304511606), INT32_C( -539003093),
INT32_C( 1923205305), INT32_C( 464427515), INT32_C( -694421636), INT32_C(-1729085762),
INT32_C( 1377800186), INT32_C( -626233146), INT32_C(-2090091895), INT32_C( 1314335058)),
simde_mm512_set_epi32(INT32_C( 38009422), INT32_C( -855531694), INT32_C( 1096529400), INT32_C( 740723389),
INT32_C( -703601695), INT32_C(-1082310854), INT32_C( 120520136), INT32_C( 494300544),
INT32_C(-1280011607), INT32_C(-1943894617), INT32_C( -321878744), INT32_C( -690430536),
INT32_C( 1135419008), INT32_C( 1818004981), INT32_C( 1471877533), INT32_C( 559240384)),
simde_mm512_set_epi32(INT32_C( 6), INT32_C( 0), INT32_C( -1), INT32_C( 1),
INT32_C( 0), INT32_C( 1), INT32_C( 2), INT32_C( -1),
INT32_C( -1), INT32_C( 0), INT32_C( 2), INT32_C( 2),
INT32_C( 1), INT32_C( 0), INT32_C( -1), INT32_C( 2)) },
{ simde_mm512_set_epi32(INT32_C(-1724745069), INT32_C( 1135206576), INT32_C( 1179583658), INT32_C(-1966673560),
INT32_C( 876279100), INT32_C( -587502732), INT32_C( -149418425), INT32_C( -921830900),
INT32_C( 17215575), INT32_C(-1719497158), INT32_C(-1349196793), INT32_C( 1245762398),
INT32_C( 813297065), INT32_C( -835921648), INT32_C(-1975778091), INT32_C( 2110087211)),
simde_mm512_set_epi32(INT32_C(-1421142882), INT32_C( -720107087), INT32_C( -533473336), INT32_C(-1235553858),
INT32_C( 1997884077), INT32_C(-1507361050), INT32_C( 21786729), INT32_C( 743816821),
INT32_C( 150690827), INT32_C(-1210873139), INT32_C( 1036977320), INT32_C( -399295069),
INT32_C(-1569884506), INT32_C( -616191901), INT32_C(-1839631465), INT32_C( -912247900)),
simde_mm512_set_epi32(INT32_C( 1), INT32_C( -1), INT32_C( -2), INT32_C( 1),
INT32_C( 0), INT32_C( 0), INT32_C( -6), INT32_C( -1),
INT32_C( 0), INT32_C( 1), INT32_C( -1), INT32_C( -3),
INT32_C( 0), INT32_C( 1), INT32_C( 1), INT32_C( -2)) },
{ simde_mm512_set_epi32(INT32_C( -788754092), INT32_C( 1871593252), INT32_C(-1494005905), INT32_C(-1673341020),
INT32_C( -802349852), INT32_C( 1483795222), INT32_C( -482009835), INT32_C( -91245467),
INT32_C( 1580169915), INT32_C( 692091070), INT32_C( 1863695169), INT32_C( -863865867),
INT32_C(-1394651654), INT32_C( -860864123), INT32_C( 684761994), INT32_C(-1721896503)),
simde_mm512_set_epi32(INT32_C(-1337054377), INT32_C( 66234694), INT32_C(-1856118156), INT32_C(-1127800230),
INT32_C( 814009506), INT32_C(-2034345199), INT32_C( 1765405247), INT32_C(-1048066647),
INT32_C( -423083536), INT32_C(-1848382006), INT32_C( -152706477), INT32_C(-1375856509),
INT32_C( -23675804), INT32_C( -242644348), INT32_C( 1836148713), INT32_C( -17324905)),
simde_mm512_set_epi32(INT32_C( 0), INT32_C( 28), INT32_C( 0), INT32_C( 1),
INT32_C( 0), INT32_C( 0), INT32_C( 0), INT32_C( 0),
INT32_C( -3), INT32_C( 0), INT32_C( -12), INT32_C( 0),
INT32_C( 58), INT32_C( 3), INT32_C( 0), INT32_C( 99)) },
{ simde_mm512_set_epi32(INT32_C( -463247298), INT32_C( -951467140), INT32_C( 1433027324), INT32_C(-1349535490),
INT32_C( -916446608), INT32_C(-1679952824), INT32_C( 515026148), INT32_C( -79374441),
INT32_C(-1055204414), INT32_C( 1214763982), INT32_C( -351626877), INT32_C( 427209663),
INT32_C( 1651021910), INT32_C( -181051643), INT32_C(-1481830173), INT32_C( 1285378207)),
simde_mm512_set_epi32(INT32_C( -895026020), INT32_C(-2124493776), INT32_C( -806312731), INT32_C( 721610054),
INT32_C( 677519448), INT32_C( 1470235459), INT32_C(-2123699180), INT32_C( 883454038),
INT32_C(-2020088518), INT32_C( -300465294), INT32_C( 1493254397), INT32_C( 2062995345),
INT32_C( -10095941), INT32_C(-1400374264), INT32_C( 1068728589), INT32_C( 234142625)),
simde_mm512_set_epi32(INT32_C( 0), INT32_C( 0), INT32_C( -1), INT32_C( -1),
INT32_C( -1), INT32_C( -1), INT32_C( 0), INT32_C( 0),
INT32_C( 0), INT32_C( -4), INT32_C( 0), INT32_C( 0),
INT32_C( -163), INT32_C( 0), INT32_C( -1), INT32_C( 5)) },
{ simde_mm512_set_epi32(INT32_C( -939190848), INT32_C(-2083825761), INT32_C( 2014997186), INT32_C( 790185633),
INT32_C(-1507225536), INT32_C( -384122450), INT32_C(-1588213257), INT32_C(-1040817544),
INT32_C( 1965628193), INT32_C(-2067530457), INT32_C( 1204204418), INT32_C( -39160501),
INT32_C( -605764870), INT32_C( 561973657), INT32_C( 1912174450), INT32_C( 1415728252)),
simde_mm512_set_epi32(INT32_C( -927506034), INT32_C( 155586444), INT32_C( -406884871), INT32_C( -252994257),
INT32_C( 1219028873), INT32_C(-1972688074), INT32_C( -597390303), INT32_C( 291669377),
INT32_C( -695882735), INT32_C( 879590202), INT32_C( 1348714758), INT32_C( 1712617745),
INT32_C( -236530514), INT32_C( 1880792230), INT32_C( 1810070042), INT32_C(-1599785869)),
simde_mm512_set_epi32(INT32_C( 1), INT32_C( -13), INT32_C( -4), INT32_C( -3),
INT32_C( -1), INT32_C( 0), INT32_C( 2), INT32_C( -3),
INT32_C( -2), INT32_C( -2), INT32_C( 0), INT32_C( 0),
INT32_C( 2), INT32_C( 0), INT32_C( 1), INT32_C( 0)) },
{ simde_mm512_set_epi32(INT32_C(-1601700614), INT32_C( 1985924496), INT32_C( -342633815), INT32_C(-2007999861),
INT32_C( 297828713), INT32_C( 1383645848), INT32_C(-2056044415), INT32_C( 373512753),
INT32_C( -26545593), INT32_C( -328575199), INT32_C( -462276628), INT32_C( 1976153041),
INT32_C( 1430984961), INT32_C(-1934079238), INT32_C( 399344654), INT32_C( 1569206763)),
simde_mm512_set_epi32(INT32_C( 102595444), INT32_C( 731375272), INT32_C(-1673993680), INT32_C( -406822977),
INT32_C( -578959028), INT32_C( 1173139127), INT32_C(-1295304556), INT32_C( 955166905),
INT32_C( 270270084), INT32_C( 134608446), INT32_C( -519669996), INT32_C( -265658570),
INT32_C(-1584344142), INT32_C( 1279036686), INT32_C(-1076842770), INT32_C( -44502324)),
simde_mm512_set_epi32(INT32_C( -15), INT32_C( 2), INT32_C( 0), INT32_C( 4),
INT32_C( 0), INT32_C( 1), INT32_C( 1), INT32_C( 0),
INT32_C( 0), INT32_C( -2), INT32_C( 0), INT32_C( -7),
INT32_C( 0), INT32_C( -1), INT32_C( 0), INT32_C( -35)) }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])); i++) {
simde__m512i r = simde_mm512_div_epi32(test_vec[i].a, test_vec[i].b);
simde_assert_m512i_i32(r, ==, test_vec[i].r);
}
return 0;
}
static int
test_simde_mm512_mask_div_epi32(SIMDE_MUNIT_TEST_ARGS) {
const struct {
simde__m512i src;
simde__mmask16 k;
simde__m512i a;
simde__m512i b;
simde__m512i r;
} test_vec[8] = {
{ simde_mm512_set_epi32(INT32_C( 691121094), INT32_C( 674034227), INT32_C(-1965434887), INT32_C( -920286947),
INT32_C( -374673026), INT32_C(-1240805178), INT32_C( 1568850865), INT32_C(-1142977539),
INT32_C(-1079516608), INT32_C( -708153743), INT32_C( 1508722402), INT32_C(-2074345640),
INT32_C( 1747596798), INT32_C(-2063703989), INT32_C( 527472553), INT32_C(-1403096998)),
UINT16_C(63371),
simde_mm512_set_epi32(INT32_C( -341007878), INT32_C(-1764810870), INT32_C( 1179683687), INT32_C(-1646326602),
INT32_C( -671967289), INT32_C(-1586327268), INT32_C( 1691051285), INT32_C( 50347892),
INT32_C( 728425428), INT32_C( 1192263444), INT32_C(-2086343723), INT32_C( 1322777130),
INT32_C( 163989560), INT32_C( 1492341726), INT32_C( 298608154), INT32_C( 1250819173)),
simde_mm512_set_epi32(INT32_C(-1291033589), INT32_C( 1314482530), INT32_C(-1297250617), INT32_C( -739008036),
INT32_C(-1419039999), INT32_C(-1004264650), INT32_C( 1580565751), INT32_C( -471064457),
INT32_C( 2081361826), INT32_C( 493161721), INT32_C(-1195115819), INT32_C( 894221337),
INT32_C(-1330460172), INT32_C( 492373082), INT32_C( -13096811), INT32_C(-2087181083)),
simde_mm512_set_epi32(INT32_C( 0), INT32_C( -1), INT32_C( 0), INT32_C( 2),
INT32_C( -374673026), INT32_C( 1), INT32_C( 1), INT32_C( 0),
INT32_C( 0), INT32_C( -708153743), INT32_C( 1508722402), INT32_C(-2074345640),
INT32_C( 0), INT32_C(-2063703989), INT32_C( -22), INT32_C( 0)) },
{ simde_mm512_set_epi32(INT32_C( 1779168063), INT32_C(-1138893231), INT32_C( -687161637), INT32_C( 1828175063),
INT32_C( -389420023), INT32_C( -193211433), INT32_C( -857989172), INT32_C( -448329300),
INT32_C(-1601364212), INT32_C( 1710148738), INT32_C( 1974123080), INT32_C(-1424367196),
INT32_C( 118588227), INT32_C( 542053192), INT32_C( 499863549), INT32_C( 957375358)),
UINT16_C(36797),
simde_mm512_set_epi32(INT32_C(-1153303869), INT32_C( 562234020), INT32_C( 1763100483), INT32_C( -518004559),
INT32_C(-1450358898), INT32_C(-1409866198), INT32_C( 269910347), INT32_C( 433971495),
INT32_C( 1441956227), INT32_C( 1018271575), INT32_C( 1734496959), INT32_C( 380846712),
INT32_C( -941967689), INT32_C( -739443621), INT32_C( 1995198557), INT32_C( -980655097)),
simde_mm512_set_epi32(INT32_C(-2088961787), INT32_C( 1943141679), INT32_C( -665465241), INT32_C( -342195833),
INT32_C( 2102184556), INT32_C( 877111492), INT32_C( 1183491905), INT32_C( -576610979),
INT32_C(-1061316197), INT32_C( -808097400), INT32_C( -362876916), INT32_C(-1845390533),
INT32_C( -48621016), INT32_C( 201516689), INT32_C(-1435930720), INT32_C(-1932876068)),
simde_mm512_set_epi32(INT32_C( 0), INT32_C(-1138893231), INT32_C( -687161637), INT32_C( 1828175063),
INT32_C( 0), INT32_C( -1), INT32_C( 0), INT32_C( 0),
INT32_C( -1), INT32_C( 1710148738), INT32_C( -4), INT32_C( 0),
INT32_C( 19), INT32_C( -3), INT32_C( 499863549), INT32_C( 0)) },
{ simde_mm512_set_epi32(INT32_C( -179829877), INT32_C( 651362699), INT32_C( 495870887), INT32_C( -382126427),
INT32_C( 915244711), INT32_C( 5081424), INT32_C( 1422501384), INT32_C( -163979724),
INT32_C(-1516900265), INT32_C( 497965579), INT32_C( 910061584), INT32_C( 2002226944),
INT32_C( -621963189), INT32_C( -48343218), INT32_C( 523093293), INT32_C(-1235205724)),
UINT16_C(46902),
simde_mm512_set_epi32(INT32_C( -220620904), INT32_C( 1398655610), INT32_C( 1722520923), INT32_C( 1206471293),
INT32_C( 1374915518), INT32_C( 531653117), INT32_C( 2075187308), INT32_C( -144618549),
INT32_C(-2131865715), INT32_C( 1444783055), INT32_C( 1878625233), INT32_C( 1755684145),
INT32_C(-2061726371), INT32_C(-1050443653), INT32_C(-1299940555), INT32_C(-2116696545)),
simde_mm512_set_epi32(INT32_C(-1106093489), INT32_C( 1982658188), INT32_C( 863153207), INT32_C(-1637276628),
INT32_C( 448681074), INT32_C( 1334667053), INT32_C( 502667641), INT32_C( 855395764),
INT32_C(-1672092948), INT32_C( 808531712), INT32_C( 454488139), INT32_C( 123547093),
INT32_C( 483090439), INT32_C(-1126329757), INT32_C(-1201220189), INT32_C( -136050629)),
simde_mm512_set_epi32(INT32_C( 0), INT32_C( 651362699), INT32_C( 1), INT32_C( 0),
INT32_C( 915244711), INT32_C( 0), INT32_C( 4), INT32_C( 0),
INT32_C(-1516900265), INT32_C( 497965579), INT32_C( 4), INT32_C( 14),
INT32_C( -621963189), INT32_C( 0), INT32_C( 1), INT32_C(-1235205724)) },
{ simde_mm512_set_epi32(INT32_C( 2113970745), INT32_C( -182128842), INT32_C( 564512596), INT32_C( 604721400),
INT32_C( 1471174399), INT32_C(-1803940708), INT32_C(-1765392929), INT32_C( 298473775),
INT32_C(-1404600737), INT32_C(-1231334921), INT32_C( -238983338), INT32_C( -145797796),
INT32_C( -181019162), INT32_C(-1910480170), INT32_C(-1860760170), INT32_C( -371855625)),
UINT16_C(38914),
simde_mm512_set_epi32(INT32_C( 1533151625), INT32_C( 2122196136), INT32_C( 1690360675), INT32_C( 1484935627),
INT32_C( 1463758672), INT32_C( 602211615), INT32_C( -464964305), INT32_C(-1430226195),
INT32_C( 797104998), INT32_C(-1557543977), INT32_C( -952737410), INT32_C( 178625368),
INT32_C(-1203806300), INT32_C( 1095216728), INT32_C(-1215405554), INT32_C( 430790402)),
simde_mm512_set_epi32(INT32_C( -251141702), INT32_C( 1274901810), INT32_C( 413860084), INT32_C( 550494320),
INT32_C( 1997049765), INT32_C( 505563651), INT32_C( 463125220), INT32_C( -451213519),
INT32_C(-1948793453), INT32_C(-2137102362), INT32_C(-1703809327), INT32_C( 389679318),
INT32_C( -355192167), INT32_C(-1801602389), INT32_C( 2006619059), INT32_C( -903558132)),
simde_mm512_set_epi32(INT32_C( -6), INT32_C( -182128842), INT32_C( 564512596), INT32_C( 2),
INT32_C( 0), INT32_C(-1803940708), INT32_C(-1765392929), INT32_C( 298473775),
INT32_C(-1404600737), INT32_C(-1231334921), INT32_C( -238983338), INT32_C( -145797796),
INT32_C( -181019162), INT32_C(-1910480170), INT32_C( 0), INT32_C( -371855625)) },
{ simde_mm512_set_epi32(INT32_C( 1572579389), INT32_C( -783078337), INT32_C(-1895621282), INT32_C( 1967093325),
INT32_C( 908815803), INT32_C(-1975591270), INT32_C( 2065037155), INT32_C( 623932649),
INT32_C( 1610322797), INT32_C( -842122991), INT32_C( 2031682359), INT32_C(-1300130353),
INT32_C(-1950048210), INT32_C( 238137788), INT32_C( 1978166020), INT32_C( 76768592)),
UINT16_C( 883),
simde_mm512_set_epi32(INT32_C(-1010119490), INT32_C( -410070063), INT32_C( 2094036024), INT32_C(-1838133114),
INT32_C( 69201629), INT32_C( 1228958503), INT32_C( -775379327), INT32_C(-1485462767),
INT32_C(-1179177847), INT32_C( 1767270276), INT32_C( 490610321), INT32_C( 1164436618),
INT32_C(-1920297499), INT32_C( -690964678), INT32_C( -880248267), INT32_C(-2005634277)),
simde_mm512_set_epi32(INT32_C(-1911659531), INT32_C( 143428987), INT32_C( -610024215), INT32_C( 582607980),
INT32_C( 1609326889), INT32_C( 1245407235), INT32_C( -119962198), INT32_C(-1932052969),
INT32_C(-1370414254), INT32_C(-1925960308), INT32_C( 2119408419), INT32_C(-1203088886),
INT32_C( -316530353), INT32_C( 1708684203), INT32_C( 1202455481), INT32_C(-2107221827)),
simde_mm512_set_epi32(INT32_C( 1572579389), INT32_C( -783078337), INT32_C(-1895621282), INT32_C( 1967093325),
INT32_C( 908815803), INT32_C(-1975591270), INT32_C( 6), INT32_C( 0),
INT32_C( 1610322797), INT32_C( 0), INT32_C( 0), INT32_C( 0),
INT32_C(-1950048210), INT32_C( 238137788), INT32_C( 0), INT32_C( 0)) },
{ simde_mm512_set_epi32(INT32_C( 2117071873), INT32_C(-1437889529), INT32_C( -376074104), INT32_C( 1087893388),
INT32_C( -443183285), INT32_C( -380695552), INT32_C( 565328458), INT32_C( -93024748),
INT32_C( 1480532604), INT32_C( -97460760), INT32_C( -582247600), INT32_C( -374749470),
INT32_C( 1394313506), INT32_C( 394553965), INT32_C(-2016714120), INT32_C( 1697927724)),
UINT16_C(12254),
simde_mm512_set_epi32(INT32_C( 56443211), INT32_C(-2036514643), INT32_C( -510270824), INT32_C( 1139427205),
INT32_C( 1090384090), INT32_C(-1905231405), INT32_C(-2079359983), INT32_C( -477294891),
INT32_C( -673197028), INT32_C( 2071747620), INT32_C( -442789099), INT32_C( -601334711),
INT32_C( 319530416), INT32_C(-2115012481), INT32_C( -501730903), INT32_C( 340519338)),
simde_mm512_set_epi32(INT32_C( 1219537084), INT32_C( 1349635715), INT32_C( 732887738), INT32_C(-1728641921),
INT32_C(-1388433411), INT32_C( 1765754685), INT32_C(-1574983663), INT32_C( 846129112),
INT32_C( 1578410935), INT32_C(-1659872458), INT32_C( 1045536663), INT32_C( 957117985),
INT32_C(-1265958651), INT32_C( 1309498779), INT32_C(-1001015299), INT32_C( 1022360677)),
simde_mm512_set_epi32(INT32_C( 2117071873), INT32_C(-1437889529), INT32_C( 0), INT32_C( 1087893388),
INT32_C( 0), INT32_C( -1), INT32_C( 1), INT32_C( 0),
INT32_C( 0), INT32_C( -1), INT32_C( -582247600), INT32_C( 0),
INT32_C( 0), INT32_C( -1), INT32_C( 0), INT32_C( 1697927724)) },
{ simde_mm512_set_epi32(INT32_C( -304885978), INT32_C( 991545752), INT32_C( -143034937), INT32_C( 843112042),
INT32_C( -227554783), INT32_C( 2124182542), INT32_C(-1526246088), INT32_C(-1991977382),
INT32_C( 1224533822), INT32_C( -819361196), INT32_C( -684010252), INT32_C(-1738921185),
INT32_C(-1259570772), INT32_C( -691865929), INT32_C( -973523371), INT32_C( 45581573)),
UINT16_C(42669),
simde_mm512_set_epi32(INT32_C( -156799603), INT32_C(-1073012339), INT32_C(-2130532125), INT32_C( 397240391),
INT32_C( 200936922), INT32_C(-1030980309), INT32_C(-1758363174), INT32_C( -665586367),
INT32_C( 453331046), INT32_C( 1704580573), INT32_C( 1606190487), INT32_C(-1085658047),
INT32_C(-1335469644), INT32_C( -368070561), INT32_C(-1419559633), INT32_C( 2069966669)),
simde_mm512_set_epi32(INT32_C( 1379668640), INT32_C( 66581512), INT32_C( -557301797), INT32_C( 304428974),
INT32_C(-1608262788), INT32_C( 532978979), INT32_C( 946958552), INT32_C(-1911324669),
INT32_C(-2118093156), INT32_C( 283691898), INT32_C( -446072631), INT32_C( -458781294),
INT32_C( 1951055651), INT32_C( 765387914), INT32_C( 822559116), INT32_C( 7445617)),
simde_mm512_set_epi32(INT32_C( 0), INT32_C( 991545752), INT32_C( 3), INT32_C( 843112042),
INT32_C( -227554783), INT32_C( -1), INT32_C( -1), INT32_C(-1991977382),
INT32_C( 0), INT32_C( -819361196), INT32_C( -3), INT32_C(-1738921185),
INT32_C( 0), INT32_C( 0), INT32_C( -973523371), INT32_C( 278)) },
{ simde_mm512_set_epi32(INT32_C(-1981938926), INT32_C( 869237081), INT32_C( -190053534), INT32_C(-1469275330),
INT32_C( -717100794), INT32_C(-1303072888), INT32_C(-2122918671), INT32_C( 1617119933),
INT32_C( 1521363431), INT32_C( 553638116), INT32_C( 1036201367), INT32_C(-1187933851),
INT32_C( -412155886), INT32_C( -760582943), INT32_C( -423751457), INT32_C( 1273589632)),
UINT16_C(35103),
simde_mm512_set_epi32(INT32_C(-1836595644), INT32_C( 260676470), INT32_C( 1724614860), INT32_C( -144514633),
INT32_C( -478630580), INT32_C(-2086755061), INT32_C( 932145867), INT32_C(-1862372735),
INT32_C( 1756892633), INT32_C( 382632965), INT32_C( 1295078740), INT32_C( -995802034),
INT32_C( 152308919), INT32_C( -351555508), INT32_C( 31813624), INT32_C( 807463845)),
simde_mm512_set_epi32(INT32_C( 615301803), INT32_C( 382786341), INT32_C( 1852603705), INT32_C( 1998007730),
INT32_C( 231325888), INT32_C( 1842039329), INT32_C( 968682756), INT32_C( 316335394),
INT32_C(-2071382094), INT32_C( -803185337), INT32_C(-2126995500), INT32_C( 1587647099),
INT32_C(-1328358584), INT32_C( 320339033), INT32_C( 282380179), INT32_C( -108102092)),
simde_mm512_set_epi32(INT32_C( -2), INT32_C( 869237081), INT32_C( -190053534), INT32_C(-1469275330),
INT32_C( -2), INT32_C(-1303072888), INT32_C(-2122918671), INT32_C( -5),
INT32_C( 1521363431), INT32_C( 553638116), INT32_C( 1036201367), INT32_C( 0),
INT32_C( 0), INT32_C( -1), INT32_C( 0), INT32_C( -7)) }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])); i++) {
simde__m512i r = simde_mm512_mask_div_epi32(test_vec[i].src, test_vec[i].k, test_vec[i].a, test_vec[i].b);
simde_assert_m512i_i32(r, ==, test_vec[i].r);
}
return 0;
}
static int
test_simde_mm512_div_epi64(SIMDE_MUNIT_TEST_ARGS) {
const struct {
simde__m512i a;
simde__m512i b;
simde__m512i r;
} test_vec[8] = {
{ simde_mm512_set_epi64(INT64_C(-7120494377185439159), INT64_C( 5095015079852768951),
INT64_C( -719755322986504865), INT64_C( 1195398499335632561),
INT64_C( 4232475372952240435), INT64_C(-1117570177728981140),
INT64_C(-4721763859644106046), INT64_C( 6636524825657073074)),
simde_mm512_set_epi64(INT64_C( 6283111750805844985), INT64_C(-7772496718970349305),
INT64_C(-6967007030435791671), INT64_C( 2761331052478409707),
INT64_C(-5439727342880208313), INT64_C(-6280010522852202514),
INT64_C(-2361957704355445009), INT64_C(-3413538286934776973)),
simde_mm512_set_epi64(INT64_C( -1), INT64_C( 0),
INT64_C( 0), INT64_C( 0),
INT64_C( 0), INT64_C( 0),
INT64_C( 1), INT64_C( -1)) },
{ simde_mm512_set_epi64(INT64_C( 7047516970419020428), INT64_C( 2488576598769637001),
INT64_C( 4233591199077735008), INT64_C( 1735409980007662056),
INT64_C(-2964306467966319268), INT64_C(-6472988581173317799),
INT64_C( 1870256929123231698), INT64_C(-5453281473672019922)),
simde_mm512_set_epi64(INT64_C(-6026337221937727695), INT64_C( 8654798725117969005),
INT64_C( 743584473088107844), INT64_C( 5114866458456107677),
INT64_C( 1917095392115883075), INT64_C( 8815346252210924017),
INT64_C(-1666651333186431127), INT64_C( 4973081304470687258)),
simde_mm512_set_epi64(INT64_C( -1), INT64_C( 0),
INT64_C( 5), INT64_C( 0),
INT64_C( -1), INT64_C( 0),
INT64_C( -1), INT64_C( -1)) },
{ simde_mm512_set_epi64(INT64_C(-1433819957247000466), INT64_C(-7270540428235491436),
INT64_C( 3506767658669433751), INT64_C(-6269164040512613371),
INT64_C(-2703740818469134807), INT64_C( 3442758576787517783),
INT64_C(-4507715808807193748), INT64_C( 4997387685805642122)),
simde_mm512_set_epi64(INT64_C(-3375611624029359751), INT64_C( 155579560497872257),
INT64_C( 4346579001240147982), INT64_C( 8478054430600792515),
INT64_C( 7917529543412977905), INT64_C( 6077094839460323156),
INT64_C(-3234198817213444484), INT64_C( 5455426772165090925)),
simde_mm512_set_epi64(INT64_C( 0), INT64_C( -46),
INT64_C( 0), INT64_C( 0),
INT64_C( 0), INT64_C( 0),
INT64_C( 1), INT64_C( 0)) },
{ simde_mm512_set_epi64(INT64_C( 5060007040297057440), INT64_C(-6547486212696877775),
INT64_C( 4083773956347780040), INT64_C(-7582952476466356489),
INT64_C( -533799245190218148), INT64_C( 6528011672062484486),
INT64_C( 8505594160370567764), INT64_C(-7955306051941505966)),
simde_mm512_set_epi64(INT64_C( 8381795236484256749), INT64_C(-8094121819208130597),
INT64_C(-4463810942012697177), INT64_C( 1695569373680370472),
INT64_C( 6457800057248167752), INT64_C( 2509734679188915375),
INT64_C(-1817858424181439867), INT64_C(-1140679629593449988)),
simde_mm512_set_epi64(INT64_C( 0), INT64_C( 0),
INT64_C( 0), INT64_C( -4),
INT64_C( 0), INT64_C( 2),
INT64_C( -4), INT64_C( 6)) },
{ simde_mm512_set_epi64(INT64_C(-3727073512330556719), INT64_C( 1145199535931310009),
INT64_C( 6618746106828964781), INT64_C( -318594899546127361),
INT64_C(-8348228873903822999), INT64_C( 6522300981577637255),
INT64_C(-2123306667443487570), INT64_C(-4210181406724347525)),
simde_mm512_set_epi64(INT64_C(-5833250200550208329), INT64_C( 8217300129052611844),
INT64_C( -649664904511148711), INT64_C( 3231016623164402124),
INT64_C( 8024018119100712605), INT64_C( 4306653136982574157),
INT64_C(-5380031023357226466), INT64_C( 2544237471105729967)),
simde_mm512_set_epi64(INT64_C( 0), INT64_C( 0),
INT64_C( -10), INT64_C( 0),
INT64_C( -1), INT64_C( 1),
INT64_C( 0), INT64_C( -1)) },
{ simde_mm512_set_epi64(INT64_C(-6427790700478275098), INT64_C(-3168480089241839861),
INT64_C(-5000559488767708993), INT64_C( 2755885615249137538),
INT64_C( -821966059249139816), INT64_C( 1089871025732147351),
INT64_C( 4566772594003817295), INT64_C(-9114574651084812253)),
simde_mm512_set_epi64(INT64_C( 1778890864282373370), INT64_C( 5911759041868723302),
INT64_C( 4553617065988887085), INT64_C( -523178035921802922),
INT64_C( 8875040781716651384), INT64_C( 2040058868339841473),
INT64_C(-2732208005963885166), INT64_C(-4435516374878659804)),
simde_mm512_set_epi64(INT64_C( -3), INT64_C( 0),
INT64_C( -1), INT64_C( -5),
INT64_C( 0), INT64_C( 0),
INT64_C( -1), INT64_C( 2)) },
{ simde_mm512_set_epi64(INT64_C( 423237589908350744), INT64_C( 2795901596537384901),
INT64_C( 1719109459006160254), INT64_C(-9093479824318774446),
INT64_C(-4511267031708830231), INT64_C(-3402553166296368495),
INT64_C( 1216620777318406949), INT64_C( -836102980820378689)),
simde_mm512_set_epi64(INT64_C( 7782115963838117574), INT64_C(-6846698536887599933),
INT64_C( 4072223690207540333), INT64_C(-1026965696159348843),
INT64_C( 4340400659569160523), INT64_C(-8299269241811916492),
INT64_C( 7360887374546597504), INT64_C(-6651085920823128052)),
simde_mm512_set_epi64(INT64_C( 0), INT64_C( 0),
INT64_C( 0), INT64_C( 8),
INT64_C( -1), INT64_C( 0),
INT64_C( 0), INT64_C( 0)) },
{ simde_mm512_set_epi64(INT64_C( 453211281016332666), INT64_C( 5434252921191502101),
INT64_C(-6060319301844209563), INT64_C(-5254139409542070482),
INT64_C(-8624885551201065882), INT64_C( 8329149627836272144),
INT64_C( 8516875663163240125), INT64_C(-4575460702098419673)),
simde_mm512_set_epi64(INT64_C(-5051260979279221837), INT64_C( 6222948671724306809),
INT64_C( 6742741209152957138), INT64_C( 5958951964162816685),
INT64_C( 2981515940173974322), INT64_C( 3752367916961311345),
INT64_C(-2840979297342041250), INT64_C(-2506264265844715430)),
simde_mm512_set_epi64(INT64_C( 0), INT64_C( 0),
INT64_C( 0), INT64_C( 0),
INT64_C( -2), INT64_C( 2),
INT64_C( -2), INT64_C( 1)) }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])); i++) {
simde__m512i r = simde_mm512_div_epi64(test_vec[i].a, test_vec[i].b);
simde_assert_m512i_i64(r, ==, test_vec[i].r);
}
return 0;
}
static int
test_simde_mm512_div_epu8(SIMDE_MUNIT_TEST_ARGS) {
const struct {
simde__m512i a;
simde__m512i b;
simde__m512i r;
} test_vec[8] = {
{ simde_x_mm512_set_epu8(UINT8_C( 41), UINT8_C( 49), UINT8_C(171), UINT8_C(198),
UINT8_C( 40), UINT8_C( 44), UINT8_C(242), UINT8_C( 51),
UINT8_C(138), UINT8_C(217), UINT8_C(215), UINT8_C(249),
UINT8_C(201), UINT8_C( 37), UINT8_C(137), UINT8_C( 29),
UINT8_C(233), UINT8_C(170), UINT8_C(241), UINT8_C(126),
UINT8_C(182), UINT8_C( 10), UINT8_C(208), UINT8_C(198),
UINT8_C( 93), UINT8_C(130), UINT8_C(195), UINT8_C(177),
UINT8_C(187), UINT8_C(223), UINT8_C(139), UINT8_C(253),
UINT8_C(191), UINT8_C(167), UINT8_C(226), UINT8_C( 64),
UINT8_C(213), UINT8_C(202), UINT8_C(110), UINT8_C(113),
UINT8_C( 89), UINT8_C(237), UINT8_C( 70), UINT8_C(226),
UINT8_C(132), UINT8_C( 91), UINT8_C(255), UINT8_C( 88),
UINT8_C(104), UINT8_C( 42), UINT8_C( 53), UINT8_C(254),
UINT8_C(132), UINT8_C(254), UINT8_C( 96), UINT8_C( 75),
UINT8_C( 31), UINT8_C(112), UINT8_C(151), UINT8_C(169),
UINT8_C(172), UINT8_C( 94), UINT8_C(112), UINT8_C( 90)),
simde_x_mm512_set_epu8(UINT8_C(195), UINT8_C( 49), UINT8_C( 14), UINT8_C(170),
UINT8_C(203), UINT8_C(167), UINT8_C( 3), UINT8_C(215),
UINT8_C( 63), UINT8_C(248), UINT8_C( 55), UINT8_C(219),
UINT8_C(221), UINT8_C(135), UINT8_C( 61), UINT8_C(191),
UINT8_C(209), UINT8_C( 91), UINT8_C( 87), UINT8_C(137),
UINT8_C( 87), UINT8_C( 76), UINT8_C( 44), UINT8_C(140),
UINT8_C( 2), UINT8_C(200), UINT8_C( 36), UINT8_C(195),
UINT8_C(200), UINT8_C(125), UINT8_C(254), UINT8_C(139),
UINT8_C(226), UINT8_C( 71), UINT8_C( 92), UINT8_C(129),
UINT8_C(182), UINT8_C(119), UINT8_C(247), UINT8_C( 34),
UINT8_C(121), UINT8_C( 85), UINT8_C(153), UINT8_C(116),
UINT8_C(218), UINT8_C( 21), UINT8_C(101), UINT8_C(122),
UINT8_C( 10), UINT8_C(231), UINT8_C( 54), UINT8_C( 71),
UINT8_C(156), UINT8_C(149), UINT8_C(244), UINT8_C( 84),
UINT8_C(148), UINT8_C( 85), UINT8_C(170), UINT8_C(184),
UINT8_C( 94), UINT8_C(154), UINT8_C(229), UINT8_C( 11)),
simde_x_mm512_set_epu8(UINT8_C( 0), UINT8_C( 1), UINT8_C( 12), UINT8_C( 1),
UINT8_C( 0), UINT8_C( 0), UINT8_C( 80), UINT8_C( 0),
UINT8_C( 2), UINT8_C( 0), UINT8_C( 3), UINT8_C( 1),
UINT8_C( 0), UINT8_C( 0), UINT8_C( 2), UINT8_C( 0),
UINT8_C( 1), UINT8_C( 1), UINT8_C( 2), UINT8_C( 0),
UINT8_C( 2), UINT8_C( 0), UINT8_C( 4), UINT8_C( 1),
UINT8_C( 46), UINT8_C( 0), UINT8_C( 5), UINT8_C( 0),
UINT8_C( 0), UINT8_C( 1), UINT8_C( 0), UINT8_C( 1),
UINT8_C( 0), UINT8_C( 2), UINT8_C( 2), UINT8_C( 0),
UINT8_C( 1), UINT8_C( 1), UINT8_C( 0), UINT8_C( 3),
UINT8_C( 0), UINT8_C( 2), UINT8_C( 0), UINT8_C( 1),
UINT8_C( 0), UINT8_C( 4), UINT8_C( 2), UINT8_C( 0),
UINT8_C( 10), UINT8_C( 0), UINT8_C( 0), UINT8_C( 3),
UINT8_C( 0), UINT8_C( 1), UINT8_C( 0), UINT8_C( 0),
UINT8_C( 0), UINT8_C( 1), UINT8_C( 0), UINT8_C( 0),
UINT8_C( 1), UINT8_C( 0), UINT8_C( 0), UINT8_C( 8)) },
{ simde_x_mm512_set_epu8(UINT8_C(216), UINT8_C( 85), UINT8_C(206), UINT8_C(103),
UINT8_C(235), UINT8_C(154), UINT8_C(129), UINT8_C(135),
UINT8_C(125), UINT8_C( 76), UINT8_C(202), UINT8_C(108),
UINT8_C( 52), UINT8_C( 71), UINT8_C(168), UINT8_C(196),
UINT8_C( 70), UINT8_C(138), UINT8_C(167), UINT8_C( 65),
UINT8_C(221), UINT8_C(161), UINT8_C(157), UINT8_C( 93),
UINT8_C(192), UINT8_C(189), UINT8_C(153), UINT8_C(155),
UINT8_C(207), UINT8_C(213), UINT8_C(105), UINT8_C(136),
UINT8_C(234), UINT8_C( 94), UINT8_C(240), UINT8_C( 12),
UINT8_C(146), UINT8_C( 1), UINT8_C(147), UINT8_C( 59),
UINT8_C(253), UINT8_C( 26), UINT8_C( 26), UINT8_C( 40),
UINT8_C( 12), UINT8_C( 2), UINT8_C(230), UINT8_C(145),
UINT8_C(170), UINT8_C(105), UINT8_C(111), UINT8_C(160),
UINT8_C(140), UINT8_C(202), UINT8_C(166), UINT8_C(220),
UINT8_C(187), UINT8_C( 65), UINT8_C(250), UINT8_C(195),
UINT8_C( 33), UINT8_C(131), UINT8_C( 2), UINT8_C(164)),
simde_x_mm512_set_epu8(UINT8_C(120), UINT8_C(127), UINT8_C( 28), UINT8_C( 95),
UINT8_C(175), UINT8_C(223), UINT8_C(119), UINT8_C(214),
UINT8_C(220), UINT8_C(102), UINT8_C( 86), UINT8_C( 22),
UINT8_C(119), UINT8_C(207), UINT8_C( 12), UINT8_C(183),
UINT8_C(172), UINT8_C(242), UINT8_C(173), UINT8_C(249),
UINT8_C( 52), UINT8_C(108), UINT8_C(128), UINT8_C(203),
UINT8_C( 85), UINT8_C(135), UINT8_C(227), UINT8_C( 35),
UINT8_C(187), UINT8_C( 24), UINT8_C(250), UINT8_C(219),
UINT8_C(253), UINT8_C( 62), UINT8_C(125), UINT8_C(236),
UINT8_C( 75), UINT8_C( 13), UINT8_C( 79), UINT8_C( 81),
UINT8_C(177), UINT8_C(221), UINT8_C(251), UINT8_C(181),
UINT8_C(159), UINT8_C(182), UINT8_C( 11), UINT8_C( 11),
UINT8_C( 39), UINT8_C( 37), UINT8_C( 39), UINT8_C(208),
UINT8_C(136), UINT8_C(180), UINT8_C(215), UINT8_C(139),
UINT8_C(144), UINT8_C(128), UINT8_C(203), UINT8_C(206),
UINT8_C(173), UINT8_C( 36), UINT8_C(133), UINT8_C(175)),
simde_x_mm512_set_epu8(UINT8_C( 1), UINT8_C( 0), UINT8_C( 7), UINT8_C( 1),
UINT8_C( 1), UINT8_C( 0), UINT8_C( 1), UINT8_C( 0),
UINT8_C( 0), UINT8_C( 0), UINT8_C( 2), UINT8_C( 4),
UINT8_C( 0), UINT8_C( 0), UINT8_C( 14), UINT8_C( 1),
UINT8_C( 0), UINT8_C( 0), UINT8_C( 0), UINT8_C( 0),
UINT8_C( 4), UINT8_C( 1), UINT8_C( 1), UINT8_C( 0),
UINT8_C( 2), UINT8_C( 1), UINT8_C( 0), UINT8_C( 4),
UINT8_C( 1), UINT8_C( 8), UINT8_C( 0), UINT8_C( 0),
UINT8_C( 0), UINT8_C( 1), UINT8_C( 1), UINT8_C( 0),
UINT8_C( 1), UINT8_C( 0), UINT8_C( 1), UINT8_C( 0),
UINT8_C( 1), UINT8_C( 0), UINT8_C( 0), UINT8_C( 0),
UINT8_C( 0), UINT8_C( 0), UINT8_C( 20), UINT8_C( 13),
UINT8_C( 4), UINT8_C( 2), UINT8_C( 2), UINT8_C( 0),
UINT8_C( 1), UINT8_C( 1), UINT8_C( 0), UINT8_C( 1),
UINT8_C( 1), UINT8_C( 0), UINT8_C( 1), UINT8_C( 0),
UINT8_C( 0), UINT8_C( 3), UINT8_C( 0), UINT8_C( 0)) },
{ simde_x_mm512_set_epu8(UINT8_C( 87), UINT8_C( 63), UINT8_C( 47), UINT8_C( 80),
UINT8_C( 35), UINT8_C(229), UINT8_C( 5), UINT8_C( 31),
UINT8_C(228), UINT8_C( 73), UINT8_C( 53), UINT8_C( 47),
UINT8_C(170), UINT8_C(192), UINT8_C(122), UINT8_C(237),
UINT8_C( 47), UINT8_C(130), UINT8_C(219), UINT8_C(102),
UINT8_C(163), UINT8_C( 41), UINT8_C(195), UINT8_C(215),
UINT8_C(199), UINT8_C( 54), UINT8_C( 97), UINT8_C(126),
UINT8_C( 10), UINT8_C(165), UINT8_C(155), UINT8_C( 88),
UINT8_C(184), UINT8_C( 63), UINT8_C( 95), UINT8_C(164),
UINT8_C( 65), UINT8_C( 71), UINT8_C(174), UINT8_C( 88),
UINT8_C(183), UINT8_C(142), UINT8_C( 98), UINT8_C( 14),
UINT8_C( 25), UINT8_C(173), UINT8_C( 87), UINT8_C( 2),
UINT8_C(191), UINT8_C(143), UINT8_C(152), UINT8_C( 2),
UINT8_C(126), UINT8_C( 0), UINT8_C(162), UINT8_C( 57),
UINT8_C(245), UINT8_C( 36), UINT8_C(239), UINT8_C( 54),
UINT8_C( 33), UINT8_C(165), UINT8_C(199), UINT8_C( 84)),
simde_x_mm512_set_epu8(UINT8_C(131), UINT8_C( 42), UINT8_C(151), UINT8_C(210),
UINT8_C( 12), UINT8_C(163), UINT8_C(138), UINT8_C(207),
UINT8_C( 43), UINT8_C( 57), UINT8_C( 61), UINT8_C( 62),
UINT8_C( 81), UINT8_C(184), UINT8_C( 6), UINT8_C( 93),
UINT8_C(167), UINT8_C( 1), UINT8_C(145), UINT8_C( 9),
UINT8_C( 4), UINT8_C( 17), UINT8_C( 10), UINT8_C(101),
UINT8_C(186), UINT8_C(181), UINT8_C(155), UINT8_C(243),
UINT8_C(189), UINT8_C(191), UINT8_C(222), UINT8_C(205),
UINT8_C( 59), UINT8_C( 26), UINT8_C(227), UINT8_C(105),
UINT8_C(237), UINT8_C(145), UINT8_C(183), UINT8_C( 79),
UINT8_C(174), UINT8_C( 60), UINT8_C(132), UINT8_C(208),
UINT8_C( 58), UINT8_C(178), UINT8_C(116), UINT8_C(240),
UINT8_C( 37), UINT8_C(131), UINT8_C(100), UINT8_C(177),
UINT8_C( 19), UINT8_C(102), UINT8_C( 81), UINT8_C( 86),
UINT8_C( 25), UINT8_C( 43), UINT8_C( 51), UINT8_C(140),
UINT8_C( 9), UINT8_C( 40), UINT8_C(227), UINT8_C( 75)),
simde_x_mm512_set_epu8(UINT8_C( 0), UINT8_C( 1), UINT8_C( 0), UINT8_C( 0),
UINT8_C( 2), UINT8_C( 1), UINT8_C( 0), UINT8_C( 0),
UINT8_C( 5), UINT8_C( 1), UINT8_C( 0), UINT8_C( 0),
UINT8_C( 2), UINT8_C( 1), UINT8_C( 20), UINT8_C( 2),
UINT8_C( 0), UINT8_C(130), UINT8_C( 1), UINT8_C( 11),
UINT8_C( 40), UINT8_C( 2), UINT8_C( 19), UINT8_C( 2),
UINT8_C( 1), UINT8_C( 0), UINT8_C( 0), UINT8_C( 0),
UINT8_C( 0), UINT8_C( 0), UINT8_C( 0), UINT8_C( 0),
UINT8_C( 3), UINT8_C( 2), UINT8_C( 0), UINT8_C( 1),
UINT8_C( 0), UINT8_C( 0), UINT8_C( 0), UINT8_C( 1),
UINT8_C( 1), UINT8_C( 2), UINT8_C( 0), UINT8_C( 0),
UINT8_C( 0), UINT8_C( 0), UINT8_C( 0), UINT8_C( 0),
UINT8_C( 5), UINT8_C( 1), UINT8_C( 1), UINT8_C( 0),
UINT8_C( 6), UINT8_C( 0), UINT8_C( 2), UINT8_C( 0),
UINT8_C( 9), UINT8_C( 0), UINT8_C( 4), UINT8_C( 0),
UINT8_C( 3), UINT8_C( 4), UINT8_C( 0), UINT8_C( 1)) },
{ simde_x_mm512_set_epu8(UINT8_C(233), UINT8_C( 79), UINT8_C( 12), UINT8_C( 0),
UINT8_C( 33), UINT8_C(178), UINT8_C( 58), UINT8_C( 74),
UINT8_C(250), UINT8_C(116), UINT8_C(142), UINT8_C( 20),
UINT8_C( 88), UINT8_C( 63), UINT8_C( 34), UINT8_C(124),
UINT8_C(250), UINT8_C( 48), UINT8_C(221), UINT8_C(232),
UINT8_C(221), UINT8_C( 75), UINT8_C(155), UINT8_C( 80),
UINT8_C(233), UINT8_C(169), UINT8_C(198), UINT8_C(226),
UINT8_C( 83), UINT8_C( 27), UINT8_C(137), UINT8_C( 34),
UINT8_C( 23), UINT8_C(132), UINT8_C(106), UINT8_C(109),
UINT8_C(135), UINT8_C(203), UINT8_C( 98), UINT8_C(120),
UINT8_C(101), UINT8_C( 52), UINT8_C( 82), UINT8_C( 44),
UINT8_C(142), UINT8_C( 14), UINT8_C( 99), UINT8_C(245),
UINT8_C( 8), UINT8_C(140), UINT8_C(141), UINT8_C(123),
UINT8_C(219), UINT8_C(163), UINT8_C(196), UINT8_C(233),
UINT8_C( 34), UINT8_C(185), UINT8_C(228), UINT8_C(108),
UINT8_C( 95), UINT8_C(236), UINT8_C( 97), UINT8_C( 41)),
simde_x_mm512_set_epu8(UINT8_C(193), UINT8_C(230), UINT8_C( 93), UINT8_C( 23),
UINT8_C(193), UINT8_C( 52), UINT8_C(223), UINT8_C(175),
UINT8_C(205), UINT8_C( 45), UINT8_C(166), UINT8_C( 24),
UINT8_C( 71), UINT8_C(234), UINT8_C(161), UINT8_C(142),
UINT8_C(184), UINT8_C(218), UINT8_C(190), UINT8_C(212),
UINT8_C(116), UINT8_C(159), UINT8_C( 44), UINT8_C( 55),
UINT8_C(213), UINT8_C(133), UINT8_C( 60), UINT8_C( 3),
UINT8_C( 58), UINT8_C(255), UINT8_C(125), UINT8_C(189),
UINT8_C(145), UINT8_C( 88), UINT8_C( 55), UINT8_C(182),
UINT8_C( 23), UINT8_C(161), UINT8_C(133), UINT8_C( 27),
UINT8_C(125), UINT8_C(229), UINT8_C(203), UINT8_C( 45),
UINT8_C( 24), UINT8_C( 5), UINT8_C( 90), UINT8_C( 83),
UINT8_C(145), UINT8_C( 85), UINT8_C(156), UINT8_C(164),
UINT8_C(149), UINT8_C(201), UINT8_C( 48), UINT8_C(255),
UINT8_C( 41), UINT8_C( 42), UINT8_C( 94), UINT8_C(129),
UINT8_C(135), UINT8_C( 8), UINT8_C( 12), UINT8_C(203)),
simde_x_mm512_set_epu8(UINT8_C( 1), UINT8_C( 0), UINT8_C( 0), UINT8_C( 0),
UINT8_C( 0), UINT8_C( 3), UINT8_C( 0), UINT8_C( 0),
UINT8_C( 1), UINT8_C( 2), UINT8_C( 0), UINT8_C( 0),
UINT8_C( 1), UINT8_C( 0), UINT8_C( 0), UINT8_C( 0),
UINT8_C( 1), UINT8_C( 0), UINT8_C( 1), UINT8_C( 1),
UINT8_C( 1), UINT8_C( 0), UINT8_C( 3), UINT8_C( 1),
UINT8_C( 1), UINT8_C( 1), UINT8_C( 3), UINT8_C( 75),
UINT8_C( 1), UINT8_C( 0), UINT8_C( 1), UINT8_C( 0),
UINT8_C( 0), UINT8_C( 1), UINT8_C( 1), UINT8_C( 0),
UINT8_C( 5), UINT8_C( 1), UINT8_C( 0), UINT8_C( 4),
UINT8_C( 0), UINT8_C( 0), UINT8_C( 0), UINT8_C( 0),
UINT8_C( 5), UINT8_C( 2), UINT8_C( 1), UINT8_C( 2),
UINT8_C( 0), UINT8_C( 1), UINT8_C( 0), UINT8_C( 0),
UINT8_C( 1), UINT8_C( 0), UINT8_C( 4), UINT8_C( 0),
UINT8_C( 0), UINT8_C( 4), UINT8_C( 2), UINT8_C( 0),
UINT8_C( 0), UINT8_C( 29), UINT8_C( 8), UINT8_C( 0)) },
{ simde_x_mm512_set_epu8(UINT8_C(142), UINT8_C( 19), UINT8_C(128), UINT8_C( 3),
UINT8_C(129), UINT8_C(192), UINT8_C(118), UINT8_C(156),
UINT8_C( 16), UINT8_C(232), UINT8_C(203), UINT8_C(122),
UINT8_C(229), UINT8_C(105), UINT8_C(120), UINT8_C(201),
UINT8_C(228), UINT8_C(167), UINT8_C(141), UINT8_C(146),
UINT8_C(116), UINT8_C( 74), UINT8_C(191), UINT8_C( 35),
UINT8_C( 45), UINT8_C(158), UINT8_C(228), UINT8_C(138),
UINT8_C( 49), UINT8_C( 7), UINT8_C( 65), UINT8_C(140),
UINT8_C( 0), UINT8_C(113), UINT8_C(156), UINT8_C(113),
UINT8_C(246), UINT8_C(167), UINT8_C(109), UINT8_C(141),
UINT8_C(192), UINT8_C( 11), UINT8_C( 33), UINT8_C(141),
UINT8_C(129), UINT8_C( 2), UINT8_C(168), UINT8_C(227),
UINT8_C( 23), UINT8_C(173), UINT8_C(104), UINT8_C( 71),
UINT8_C( 11), UINT8_C(250), UINT8_C( 13), UINT8_C(218),
UINT8_C(194), UINT8_C(140), UINT8_C(125), UINT8_C( 43),
UINT8_C(151), UINT8_C( 49), UINT8_C(129), UINT8_C(218)),
simde_x_mm512_set_epu8(UINT8_C( 8), UINT8_C( 25), UINT8_C(147), UINT8_C(220),
UINT8_C(173), UINT8_C(138), UINT8_C( 38), UINT8_C(150),
UINT8_C( 35), UINT8_C( 43), UINT8_C(165), UINT8_C(185),
UINT8_C( 50), UINT8_C( 64), UINT8_C(161), UINT8_C(132),
UINT8_C(162), UINT8_C( 50), UINT8_C(199), UINT8_C( 84),
UINT8_C(251), UINT8_C(200), UINT8_C(217), UINT8_C( 19),
UINT8_C(180), UINT8_C(196), UINT8_C(246), UINT8_C( 76),
UINT8_C( 55), UINT8_C(204), UINT8_C(139), UINT8_C( 75),
UINT8_C( 1), UINT8_C( 89), UINT8_C(133), UINT8_C(212),
UINT8_C(206), UINT8_C( 55), UINT8_C(204), UINT8_C(120),
UINT8_C( 37), UINT8_C(159), UINT8_C(146), UINT8_C(217),
UINT8_C(226), UINT8_C(190), UINT8_C(134), UINT8_C( 8),
UINT8_C(113), UINT8_C( 61), UINT8_C(103), UINT8_C(100),
UINT8_C( 23), UINT8_C(229), UINT8_C(146), UINT8_C( 97),
UINT8_C( 95), UINT8_C( 32), UINT8_C(136), UINT8_C( 91),
UINT8_C( 46), UINT8_C(252), UINT8_C(163), UINT8_C( 88)),
simde_x_mm512_set_epu8(UINT8_C( 17), UINT8_C( 0), UINT8_C( 0), UINT8_C( 0),
UINT8_C( 0), UINT8_C( 1), UINT8_C( 3), UINT8_C( 1),
UINT8_C( 0), UINT8_C( 5), UINT8_C( 1), UINT8_C( 0),
UINT8_C( 4), UINT8_C( 1), UINT8_C( 0), UINT8_C( 1),
UINT8_C( 1), UINT8_C( 3), UINT8_C( 0), UINT8_C( 1),
UINT8_C( 0), UINT8_C( 0), UINT8_C( 0), UINT8_C( 1),
UINT8_C( 0), UINT8_C( 0), UINT8_C( 0), UINT8_C( 1),
UINT8_C( 0), UINT8_C( 0), UINT8_C( 0), UINT8_C( 1),
UINT8_C( 0), UINT8_C( 1), UINT8_C( 1), UINT8_C( 0),
UINT8_C( 1), UINT8_C( 3), UINT8_C( 0), UINT8_C( 1),
UINT8_C( 5), UINT8_C( 0), UINT8_C( 0), UINT8_C( 0),
UINT8_C( 0), UINT8_C( 0), UINT8_C( 1), UINT8_C( 28),
UINT8_C( 0), UINT8_C( 2), UINT8_C( 1), UINT8_C( 0),
UINT8_C( 0), UINT8_C( 1), UINT8_C( 0), UINT8_C( 2),
UINT8_C( 2), UINT8_C( 4), UINT8_C( 0), UINT8_C( 0),
UINT8_C( 3), UINT8_C( 0), UINT8_C( 0), UINT8_C( 2)) },
{ simde_x_mm512_set_epu8(UINT8_C( 46), UINT8_C( 43), UINT8_C(246), UINT8_C(157),
UINT8_C( 80), UINT8_C(154), UINT8_C( 27), UINT8_C(118),
UINT8_C(176), UINT8_C(216), UINT8_C( 46), UINT8_C(142),
UINT8_C(198), UINT8_C(248), UINT8_C( 88), UINT8_C( 29),
UINT8_C(176), UINT8_C( 25), UINT8_C(101), UINT8_C( 54),
UINT8_C(103), UINT8_C(120), UINT8_C( 94), UINT8_C( 16),
UINT8_C(197), UINT8_C(205), UINT8_C( 71), UINT8_C(246),
UINT8_C(158), UINT8_C(176), UINT8_C(218), UINT8_C( 43),
UINT8_C(235), UINT8_C(249), UINT8_C(116), UINT8_C(137),
UINT8_C( 89), UINT8_C(212), UINT8_C(132), UINT8_C( 56),
UINT8_C(230), UINT8_C(137), UINT8_C( 66), UINT8_C( 41),
UINT8_C( 44), UINT8_C( 35), UINT8_C(189), UINT8_C(155),
UINT8_C(125), UINT8_C(130), UINT8_C(123), UINT8_C(117),
UINT8_C(123), UINT8_C(127), UINT8_C(151), UINT8_C( 60),
UINT8_C(153), UINT8_C(185), UINT8_C(250), UINT8_C(100),
UINT8_C( 83), UINT8_C(112), UINT8_C( 33), UINT8_C(140)),
simde_x_mm512_set_epu8(UINT8_C( 36), UINT8_C( 33), UINT8_C( 42), UINT8_C( 75),
UINT8_C(179), UINT8_C(172), UINT8_C(126), UINT8_C(171),
UINT8_C(110), UINT8_C(150), UINT8_C(107), UINT8_C(180),
UINT8_C(134), UINT8_C( 73), UINT8_C(207), UINT8_C( 15),
UINT8_C(241), UINT8_C(103), UINT8_C(103), UINT8_C(150),
UINT8_C(103), UINT8_C( 58), UINT8_C(104), UINT8_C( 35),
UINT8_C(249), UINT8_C( 79), UINT8_C(113), UINT8_C( 97),
UINT8_C(189), UINT8_C(197), UINT8_C(174), UINT8_C(222),
UINT8_C(224), UINT8_C(104), UINT8_C(123), UINT8_C(124),
UINT8_C( 49), UINT8_C(226), UINT8_C( 37), UINT8_C( 22),
UINT8_C(105), UINT8_C(157), UINT8_C(110), UINT8_C( 52),
UINT8_C(254), UINT8_C(103), UINT8_C(162), UINT8_C(210),
UINT8_C(202), UINT8_C( 39), UINT8_C(193), UINT8_C(151),
UINT8_C(183), UINT8_C( 73), UINT8_C( 97), UINT8_C(187),
UINT8_C(102), UINT8_C(195), UINT8_C( 68), UINT8_C(190),
UINT8_C( 65), UINT8_C( 60), UINT8_C(165), UINT8_C(126)),
simde_x_mm512_set_epu8(UINT8_C( 1), UINT8_C( 1), UINT8_C( 5), UINT8_C( 2),
UINT8_C( 0), UINT8_C( 0), UINT8_C( 0), UINT8_C( 0),
UINT8_C( 1), UINT8_C( 1), UINT8_C( 0), UINT8_C( 0),
UINT8_C( 1), UINT8_C( 3), UINT8_C( 0), UINT8_C( 1),
UINT8_C( 0), UINT8_C( 0), UINT8_C( 0), UINT8_C( 0),
UINT8_C( 1), UINT8_C( 2), UINT8_C( 0), UINT8_C( 0),
UINT8_C( 0), UINT8_C( 2), UINT8_C( 0), UINT8_C( 2),
UINT8_C( 0), UINT8_C( 0), UINT8_C( 1), UINT8_C( 0),
UINT8_C( 1), UINT8_C( 2), UINT8_C( 0), UINT8_C( 1),
UINT8_C( 1), UINT8_C( 0), UINT8_C( 3), UINT8_C( 2),
UINT8_C( 2), UINT8_C( 0), UINT8_C( 0), UINT8_C( 0),
UINT8_C( 0), UINT8_C( 0), UINT8_C( 1), UINT8_C( 0),
UINT8_C( 0), UINT8_C( 3), UINT8_C( 0), UINT8_C( 0),
UINT8_C( 0), UINT8_C( 1), UINT8_C( 1), UINT8_C( 0),
UINT8_C( 1), UINT8_C( 0), UINT8_C( 3), UINT8_C( 0),
UINT8_C( 1), UINT8_C( 1), UINT8_C( 0), UINT8_C( 1)) },
{ simde_x_mm512_set_epu8(UINT8_C(240), UINT8_C(169), UINT8_C( 8), UINT8_C( 54),
UINT8_C( 66), UINT8_C( 99), UINT8_C( 14), UINT8_C( 32),
UINT8_C(148), UINT8_C( 92), UINT8_C(122), UINT8_C(200),
UINT8_C(192), UINT8_C(186), UINT8_C(225), UINT8_C( 52),
UINT8_C(182), UINT8_C(244), UINT8_C(253), UINT8_C(228),
UINT8_C(141), UINT8_C(228), UINT8_C(148), UINT8_C(168),
UINT8_C(231), UINT8_C(107), UINT8_C( 47), UINT8_C(205),
UINT8_C(126), UINT8_C( 7), UINT8_C(182), UINT8_C(245),
UINT8_C(165), UINT8_C(186), UINT8_C(213), UINT8_C( 84),
UINT8_C( 19), UINT8_C(131), UINT8_C( 54), UINT8_C( 13),
UINT8_C(185), UINT8_C(182), UINT8_C( 72), UINT8_C( 61),
UINT8_C(125), UINT8_C(104), UINT8_C(147), UINT8_C( 11),
UINT8_C( 89), UINT8_C(204), UINT8_C( 62), UINT8_C(163),
UINT8_C(198), UINT8_C(162), UINT8_C(205), UINT8_C( 9),
UINT8_C(182), UINT8_C(123), UINT8_C( 65), UINT8_C(208),
UINT8_C(145), UINT8_C(179), UINT8_C( 34), UINT8_C(195)),
simde_x_mm512_set_epu8(UINT8_C(141), UINT8_C(103), UINT8_C(116), UINT8_C( 12),
UINT8_C(174), UINT8_C(226), UINT8_C(193), UINT8_C(175),
UINT8_C(155), UINT8_C(174), UINT8_C( 73), UINT8_C( 6),
UINT8_C(141), UINT8_C(140), UINT8_C(254), UINT8_C(193),
UINT8_C(100), UINT8_C(151), UINT8_C( 14), UINT8_C( 19),
UINT8_C( 38), UINT8_C(115), UINT8_C(201), UINT8_C(118),
UINT8_C( 74), UINT8_C(186), UINT8_C( 89), UINT8_C(183),
UINT8_C( 65), UINT8_C(138), UINT8_C( 64), UINT8_C( 90),
UINT8_C(152), UINT8_C(241), UINT8_C(229), UINT8_C(218),
UINT8_C(126), UINT8_C( 38), UINT8_C(159), UINT8_C( 27),
UINT8_C(164), UINT8_C(199), UINT8_C( 25), UINT8_C(253),
UINT8_C(181), UINT8_C(104), UINT8_C( 6), UINT8_C(183),
UINT8_C( 36), UINT8_C(203), UINT8_C(138), UINT8_C(145),
UINT8_C(116), UINT8_C(155), UINT8_C(218), UINT8_C( 24),
UINT8_C(205), UINT8_C(238), UINT8_C(242), UINT8_C( 26),
UINT8_C(226), UINT8_C( 76), UINT8_C(226), UINT8_C(214)),
simde_x_mm512_set_epu8(UINT8_C( 1), UINT8_C( 1), UINT8_C( 0), UINT8_C( 4),
UINT8_C( 0), UINT8_C( 0), UINT8_C( 0), UINT8_C( 0),
UINT8_C( 0), UINT8_C( 0), UINT8_C( 1), UINT8_C( 33),
UINT8_C( 1), UINT8_C( 1), UINT8_C( 0), UINT8_C( 0),
UINT8_C( 1), UINT8_C( 1), UINT8_C( 18), UINT8_C( 12),
UINT8_C( 3), UINT8_C( 1), UINT8_C( 0), UINT8_C( 1),
UINT8_C( 3), UINT8_C( 0), UINT8_C( 0), UINT8_C( 1),
UINT8_C( 1), UINT8_C( 0), UINT8_C( 2), UINT8_C( 2),
UINT8_C( 1), UINT8_C( 0), UINT8_C( 0), UINT8_C( 0),
UINT8_C( 0), UINT8_C( 3), UINT8_C( 0), UINT8_C( 0),
UINT8_C( 1), UINT8_C( 0), UINT8_C( 2), UINT8_C( 0),
UINT8_C( 0), UINT8_C( 1), UINT8_C( 24), UINT8_C( 0),
UINT8_C( 2), UINT8_C( 1), UINT8_C( 0), UINT8_C( 1),
UINT8_C( 1), UINT8_C( 1), UINT8_C( 0), UINT8_C( 0),
UINT8_C( 0), UINT8_C( 0), UINT8_C( 0), UINT8_C( 8),
UINT8_C( 0), UINT8_C( 2), UINT8_C( 0), UINT8_C( 0)) },
{ simde_x_mm512_set_epu8(UINT8_C(197), UINT8_C( 52), UINT8_C(145), UINT8_C( 20),
UINT8_C( 26), UINT8_C(178), UINT8_C(121), UINT8_C( 16),
UINT8_C( 45), UINT8_C(229), UINT8_C( 11), UINT8_C(230),
UINT8_C( 53), UINT8_C( 2), UINT8_C(234), UINT8_C( 7),
UINT8_C(207), UINT8_C(146), UINT8_C(169), UINT8_C(233),
UINT8_C(206), UINT8_C(116), UINT8_C( 55), UINT8_C(156),
UINT8_C(180), UINT8_C( 91), UINT8_C( 56), UINT8_C(146),
UINT8_C( 55), UINT8_C(137), UINT8_C(200), UINT8_C( 76),
UINT8_C( 43), UINT8_C(245), UINT8_C(138), UINT8_C( 3),
UINT8_C(213), UINT8_C(156), UINT8_C(166), UINT8_C(234),
UINT8_C(199), UINT8_C( 2), UINT8_C( 86), UINT8_C( 72),
UINT8_C( 93), UINT8_C(254), UINT8_C(190), UINT8_C(121),
UINT8_C(119), UINT8_C( 75), UINT8_C(159), UINT8_C( 76),
UINT8_C( 70), UINT8_C(218), UINT8_C( 17), UINT8_C(239),
UINT8_C( 43), UINT8_C(152), UINT8_C(222), UINT8_C( 80),
UINT8_C(197), UINT8_C(113), UINT8_C(112), UINT8_C( 81)),
simde_x_mm512_set_epu8(UINT8_C(193), UINT8_C(162), UINT8_C(178), UINT8_C( 36),
UINT8_C(178), UINT8_C( 86), UINT8_C( 79), UINT8_C(167),
UINT8_C(179), UINT8_C( 45), UINT8_C( 18), UINT8_C(231),
UINT8_C(113), UINT8_C(127), UINT8_C(211), UINT8_C(181),
UINT8_C(121), UINT8_C(171), UINT8_C( 76), UINT8_C(135),
UINT8_C( 15), UINT8_C(133), UINT8_C(247), UINT8_C( 32),
UINT8_C(181), UINT8_C(168), UINT8_C(236), UINT8_C( 99),
UINT8_C( 85), UINT8_C(151), UINT8_C( 36), UINT8_C( 99),
UINT8_C(101), UINT8_C( 42), UINT8_C( 63), UINT8_C( 96),
UINT8_C(210), UINT8_C(198), UINT8_C(202), UINT8_C(105),
UINT8_C(214), UINT8_C( 74), UINT8_C(199), UINT8_C( 17),
UINT8_C(234), UINT8_C( 22), UINT8_C(134), UINT8_C(112),
UINT8_C( 62), UINT8_C(141), UINT8_C(156), UINT8_C( 91),
UINT8_C( 99), UINT8_C( 24), UINT8_C(198), UINT8_C(131),
UINT8_C( 88), UINT8_C(136), UINT8_C( 61), UINT8_C( 94),
UINT8_C(189), UINT8_C(213), UINT8_C(249), UINT8_C(131)),
simde_x_mm512_set_epu8(UINT8_C( 1), UINT8_C( 0), UINT8_C( 0), UINT8_C( 0),
UINT8_C( 0), UINT8_C( 2), UINT8_C( 1), UINT8_C( 0),
UINT8_C( 0), UINT8_C( 5), UINT8_C( 0), UINT8_C( 0),
UINT8_C( 0), UINT8_C( 0), UINT8_C( 1), UINT8_C( 0),
UINT8_C( 1), UINT8_C( 0), UINT8_C( 2), UINT8_C( 1),
UINT8_C( 13), UINT8_C( 0), UINT8_C( 0), UINT8_C( 4),
UINT8_C( 0), UINT8_C( 0), UINT8_C( 0), UINT8_C( 1),
UINT8_C( 0), UINT8_C( 0), UINT8_C( 5), UINT8_C( 0),
UINT8_C( 0), UINT8_C( 5), UINT8_C( 2), UINT8_C( 0),
UINT8_C( 1), UINT8_C( 0), UINT8_C( 0), UINT8_C( 2),
UINT8_C( 0), UINT8_C( 0), UINT8_C( 0), UINT8_C( 4),
UINT8_C( 0), UINT8_C( 11), UINT8_C( 1), UINT8_C( 1),
UINT8_C( 1), UINT8_C( 0), UINT8_C( 1), UINT8_C( 0),
UINT8_C( 0), UINT8_C( 9), UINT8_C( 0), UINT8_C( 1),
UINT8_C( 0), UINT8_C( 1), UINT8_C( 3), UINT8_C( 0),
UINT8_C( 1), UINT8_C( 0), UINT8_C( 0), UINT8_C( 0)) }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])); i++) {
simde__m512i r = simde_mm512_div_epu8(test_vec[i].a, test_vec[i].b);
simde_assert_m512i_u8(r, ==, test_vec[i].r);
}
return 0;
}
static int
test_simde_mm512_div_epu16(SIMDE_MUNIT_TEST_ARGS) {
const struct {
simde__m512i a;
simde__m512i b;
simde__m512i r;
} test_vec[8] = {
{ simde_x_mm512_set_epu16(UINT16_C( 10545), UINT16_C( 43974), UINT16_C( 10284), UINT16_C( 62003),
UINT16_C( 35545), UINT16_C( 55289), UINT16_C( 51493), UINT16_C( 35101),
UINT16_C( 59818), UINT16_C( 61822), UINT16_C( 46602), UINT16_C( 53446),
UINT16_C( 23938), UINT16_C( 50097), UINT16_C( 48095), UINT16_C( 35837),
UINT16_C( 49063), UINT16_C( 57920), UINT16_C( 54730), UINT16_C( 28273),
UINT16_C( 23021), UINT16_C( 18146), UINT16_C( 33883), UINT16_C( 65368),
UINT16_C( 26666), UINT16_C( 13822), UINT16_C( 34046), UINT16_C( 24651),
UINT16_C( 8048), UINT16_C( 38825), UINT16_C( 44126), UINT16_C( 28762)),
simde_x_mm512_set_epu16(UINT16_C( 38607), UINT16_C( 8074), UINT16_C( 18000), UINT16_C( 35687),
UINT16_C( 40415), UINT16_C( 3254), UINT16_C( 55282), UINT16_C( 38855),
UINT16_C( 41330), UINT16_C( 37148), UINT16_C( 25803), UINT16_C( 25877),
UINT16_C( 768), UINT16_C( 16244), UINT16_C( 11114), UINT16_C( 58324),
UINT16_C( 18192), UINT16_C( 32532), UINT16_C( 33700), UINT16_C( 60373),
UINT16_C( 20183), UINT16_C( 64042), UINT16_C( 2502), UINT16_C( 18488),
UINT16_C( 22771), UINT16_C( 21470), UINT16_C( 4556), UINT16_C( 26138),
UINT16_C( 19085), UINT16_C( 64613), UINT16_C( 55602), UINT16_C( 63371)),
simde_x_mm512_set_epu16(UINT16_C( 0), UINT16_C( 5), UINT16_C( 0), UINT16_C( 1),
UINT16_C( 0), UINT16_C( 16), UINT16_C( 0), UINT16_C( 0),
UINT16_C( 1), UINT16_C( 1), UINT16_C( 1), UINT16_C( 2),
UINT16_C( 31), UINT16_C( 3), UINT16_C( 4), UINT16_C( 0),
UINT16_C( 2), UINT16_C( 1), UINT16_C( 1), UINT16_C( 0),
UINT16_C( 1), UINT16_C( 0), UINT16_C( 13), UINT16_C( 3),
UINT16_C( 1), UINT16_C( 0), UINT16_C( 7), UINT16_C( 0),
UINT16_C( 0), UINT16_C( 0), UINT16_C( 0), UINT16_C( 0)) },
{ simde_x_mm512_set_epu16(UINT16_C( 20057), UINT16_C( 26978), UINT16_C( 45741), UINT16_C( 34503),
UINT16_C( 54259), UINT16_C( 41436), UINT16_C( 43883), UINT16_C( 11009),
UINT16_C( 50212), UINT16_C( 9014), UINT16_C( 24117), UINT16_C( 34039),
UINT16_C( 58348), UINT16_C( 8311), UINT16_C( 31759), UINT16_C( 4002),
UINT16_C( 7525), UINT16_C( 3321), UINT16_C( 47299), UINT16_C( 64213),
UINT16_C( 13644), UINT16_C( 48153), UINT16_C( 45234), UINT16_C( 51700),
UINT16_C( 7513), UINT16_C( 1114), UINT16_C( 65336), UINT16_C( 10389),
UINT16_C( 33688), UINT16_C( 9445), UINT16_C( 60332), UINT16_C( 41466)),
simde_x_mm512_set_epu16(UINT16_C( 48157), UINT16_C( 56913), UINT16_C( 55050), UINT16_C( 48859),
UINT16_C( 27895), UINT16_C( 48343), UINT16_C( 59593), UINT16_C( 60425),
UINT16_C( 62587), UINT16_C( 54231), UINT16_C( 52444), UINT16_C( 8140),
UINT16_C( 58695), UINT16_C( 2476), UINT16_C( 41101), UINT16_C( 7948),
UINT16_C( 26094), UINT16_C( 52354), UINT16_C( 30122), UINT16_C( 47688),
UINT16_C( 43801), UINT16_C( 57764), UINT16_C( 1809), UINT16_C( 33603),
UINT16_C( 8271), UINT16_C( 4936), UINT16_C( 7627), UINT16_C( 20477),
UINT16_C( 14608), UINT16_C( 25470), UINT16_C( 45836), UINT16_C( 25611)),
simde_x_mm512_set_epu16(UINT16_C( 0), UINT16_C( 0), UINT16_C( 0), UINT16_C( 0),
UINT16_C( 1), UINT16_C( 0), UINT16_C( 0), UINT16_C( 0),
UINT16_C( 0), UINT16_C( 0), UINT16_C( 0), UINT16_C( 4),
UINT16_C( 0), UINT16_C( 3), UINT16_C( 0), UINT16_C( 0),
UINT16_C( 0), UINT16_C( 0), UINT16_C( 1), UINT16_C( 1),
UINT16_C( 0), UINT16_C( 0), UINT16_C( 25), UINT16_C( 1),
UINT16_C( 0), UINT16_C( 0), UINT16_C( 8), UINT16_C( 0),
UINT16_C( 2), UINT16_C( 0), UINT16_C( 1), UINT16_C( 1)) },
{ simde_x_mm512_set_epu16(UINT16_C( 26902), UINT16_C( 51011), UINT16_C( 57631), UINT16_C( 57521),
UINT16_C( 43405), UINT16_C( 18318), UINT16_C( 44023), UINT16_C( 9770),
UINT16_C( 4118), UINT16_C( 33099), UINT16_C( 6621), UINT16_C( 57639),
UINT16_C( 22002), UINT16_C( 33155), UINT16_C( 15537), UINT16_C( 38743),
UINT16_C( 26466), UINT16_C( 21183), UINT16_C( 5811), UINT16_C( 17016),
UINT16_C( 51162), UINT16_C( 46775), UINT16_C( 54252), UINT16_C( 64603),
UINT16_C( 30444), UINT16_C( 20573), UINT16_C( 50572), UINT16_C( 25607),
UINT16_C( 36721), UINT16_C( 36797), UINT16_C( 27147), UINT16_C( 62271)),
simde_x_mm512_set_epu16(UINT16_C( 55381), UINT16_C( 52839), UINT16_C( 60314), UINT16_C( 33159),
UINT16_C( 32076), UINT16_C( 51820), UINT16_C( 13383), UINT16_C( 43204),
UINT16_C( 18058), UINT16_C( 42817), UINT16_C( 56737), UINT16_C( 40285),
UINT16_C( 49341), UINT16_C( 39323), UINT16_C( 53205), UINT16_C( 27016),
UINT16_C( 59998), UINT16_C( 61452), UINT16_C( 37377), UINT16_C( 37691),
UINT16_C( 64794), UINT16_C( 6696), UINT16_C( 3074), UINT16_C( 59025),
UINT16_C( 43625), UINT16_C( 28576), UINT16_C( 36042), UINT16_C( 42716),
UINT16_C( 47937), UINT16_C( 64195), UINT16_C( 8579), UINT16_C( 676)),
simde_x_mm512_set_epu16(UINT16_C( 0), UINT16_C( 0), UINT16_C( 0), UINT16_C( 1),
UINT16_C( 1), UINT16_C( 0), UINT16_C( 3), UINT16_C( 0),
UINT16_C( 0), UINT16_C( 0), UINT16_C( 0), UINT16_C( 1),
UINT16_C( 0), UINT16_C( 0), UINT16_C( 0), UINT16_C( 1),
UINT16_C( 0), UINT16_C( 0), UINT16_C( 0), UINT16_C( 0),
UINT16_C( 0), UINT16_C( 6), UINT16_C( 17), UINT16_C( 1),
UINT16_C( 0), UINT16_C( 0), UINT16_C( 1), UINT16_C( 0),
UINT16_C( 0), UINT16_C( 0), UINT16_C( 3), UINT16_C( 92)) },
{ simde_x_mm512_set_epu16(UINT16_C( 7566), UINT16_C( 25511), UINT16_C( 59705), UINT16_C( 13989),
UINT16_C( 13965), UINT16_C( 34471), UINT16_C( 77), UINT16_C( 35152),
UINT16_C( 21705), UINT16_C( 42504), UINT16_C( 63033), UINT16_C( 56884),
UINT16_C( 42389), UINT16_C( 61527), UINT16_C( 7598), UINT16_C( 23051),
UINT16_C( 13886), UINT16_C( 28688), UINT16_C( 30551), UINT16_C( 36608),
UINT16_C( 56045), UINT16_C( 38987), UINT16_C( 64798), UINT16_C( 22350),
UINT16_C( 7981), UINT16_C( 50477), UINT16_C( 46688), UINT16_C( 16804),
UINT16_C( 33660), UINT16_C( 63749), UINT16_C( 29649), UINT16_C( 64815)),
simde_x_mm512_set_epu16(UINT16_C( 18409), UINT16_C( 19069), UINT16_C( 20979), UINT16_C( 35774),
UINT16_C( 8112), UINT16_C( 25085), UINT16_C( 31664), UINT16_C( 55404),
UINT16_C( 63329), UINT16_C( 19403), UINT16_C( 33006), UINT16_C( 20365),
UINT16_C( 22045), UINT16_C( 41935), UINT16_C( 28665), UINT16_C( 35793),
UINT16_C( 26789), UINT16_C( 40241), UINT16_C( 34076), UINT16_C( 36189),
UINT16_C( 49507), UINT16_C( 32891), UINT16_C( 45700), UINT16_C( 31541),
UINT16_C( 33237), UINT16_C( 50719), UINT16_C( 22782), UINT16_C( 46902),
UINT16_C( 62792), UINT16_C( 907), UINT16_C( 9939), UINT16_C( 395)),
simde_x_mm512_set_epu16(UINT16_C( 0), UINT16_C( 1), UINT16_C( 2), UINT16_C( 0),
UINT16_C( 1), UINT16_C( 1), UINT16_C( 0), UINT16_C( 0),
UINT16_C( 0), UINT16_C( 2), UINT16_C( 1), UINT16_C( 2),
UINT16_C( 1), UINT16_C( 1), UINT16_C( 0), UINT16_C( 0),
UINT16_C( 0), UINT16_C( 0), UINT16_C( 0), UINT16_C( 1),
UINT16_C( 1), UINT16_C( 1), UINT16_C( 1), UINT16_C( 0),
UINT16_C( 0), UINT16_C( 0), UINT16_C( 2), UINT16_C( 0),
UINT16_C( 0), UINT16_C( 70), UINT16_C( 2), UINT16_C( 164)) },
{ simde_x_mm512_set_epu16(UINT16_C( 40553), UINT16_C( 9260), UINT16_C( 6846), UINT16_C( 21618),
UINT16_C( 20365), UINT16_C( 26413), UINT16_C( 7670), UINT16_C( 6521),
UINT16_C( 13052), UINT16_C( 19892), UINT16_C( 40021), UINT16_C( 58092),
UINT16_C( 12337), UINT16_C( 14080), UINT16_C( 6934), UINT16_C( 61515),
UINT16_C( 1885), UINT16_C( 11733), UINT16_C( 7371), UINT16_C( 24583),
UINT16_C( 48349), UINT16_C( 37475), UINT16_C( 47206), UINT16_C( 54691),
UINT16_C( 63460), UINT16_C( 2107), UINT16_C( 62169), UINT16_C( 38808),
UINT16_C( 21341), UINT16_C( 51834), UINT16_C( 26283), UINT16_C( 38235)),
simde_x_mm512_set_epu16(UINT16_C( 9227), UINT16_C( 20728), UINT16_C( 22448), UINT16_C( 22271),
UINT16_C( 38010), UINT16_C( 3228), UINT16_C( 38598), UINT16_C( 15839),
UINT16_C( 4554), UINT16_C( 22831), UINT16_C( 44103), UINT16_C( 32351),
UINT16_C( 46747), UINT16_C( 20983), UINT16_C( 61889), UINT16_C( 26454),
UINT16_C( 63311), UINT16_C( 19804), UINT16_C( 62773), UINT16_C( 56806),
UINT16_C( 36384), UINT16_C( 25302), UINT16_C( 37143), UINT16_C( 3478),
UINT16_C( 59861), UINT16_C( 61175), UINT16_C( 48658), UINT16_C( 23119),
UINT16_C( 30252), UINT16_C( 63116), UINT16_C( 13170), UINT16_C( 44087)),
simde_x_mm512_set_epu16(UINT16_C( 4), UINT16_C( 0), UINT16_C( 0), UINT16_C( 0),
UINT16_C( 0), UINT16_C( 8), UINT16_C( 0), UINT16_C( 0),
UINT16_C( 2), UINT16_C( 0), UINT16_C( 0), UINT16_C( 1),
UINT16_C( 0), UINT16_C( 0), UINT16_C( 0), UINT16_C( 2),
UINT16_C( 0), UINT16_C( 0), UINT16_C( 0), UINT16_C( 0),
UINT16_C( 1), UINT16_C( 1), UINT16_C( 1), UINT16_C( 15),
UINT16_C( 1), UINT16_C( 0), UINT16_C( 1), UINT16_C( 1),
UINT16_C( 0), UINT16_C( 0), UINT16_C( 1), UINT16_C( 0)) },
{ simde_x_mm512_set_epu16(UINT16_C( 22335), UINT16_C( 12112), UINT16_C( 9189), UINT16_C( 1311),
UINT16_C( 58441), UINT16_C( 13615), UINT16_C( 43712), UINT16_C( 31469),
UINT16_C( 12162), UINT16_C( 56166), UINT16_C( 41769), UINT16_C( 50135),
UINT16_C( 50998), UINT16_C( 24958), UINT16_C( 2725), UINT16_C( 39768),
UINT16_C( 47167), UINT16_C( 24484), UINT16_C( 16711), UINT16_C( 44632),
UINT16_C( 46990), UINT16_C( 25102), UINT16_C( 6573), UINT16_C( 22274),
UINT16_C( 49039), UINT16_C( 38914), UINT16_C( 32256), UINT16_C( 41529),
UINT16_C( 62756), UINT16_C( 61238), UINT16_C( 8613), UINT16_C( 51028)),
simde_x_mm512_set_epu16(UINT16_C( 30472), UINT16_C( 36773), UINT16_C( 7714), UINT16_C( 18947),
UINT16_C( 7066), UINT16_C( 47844), UINT16_C( 58651), UINT16_C( 1841),
UINT16_C( 35799), UINT16_C( 50579), UINT16_C( 32926), UINT16_C( 26598),
UINT16_C( 39537), UINT16_C( 61137), UINT16_C( 5946), UINT16_C( 2262),
UINT16_C( 60116), UINT16_C( 12953), UINT16_C( 38045), UINT16_C( 47787),
UINT16_C( 30618), UINT16_C( 37811), UINT16_C( 51748), UINT16_C( 52236),
UINT16_C( 23394), UINT16_C( 2441), UINT16_C( 32382), UINT16_C( 9384),
UINT16_C( 25792), UINT16_C( 56163), UINT16_C( 22658), UINT16_C( 20939)),
simde_x_mm512_set_epu16(UINT16_C( 0), UINT16_C( 0), UINT16_C( 1), UINT16_C( 0),
UINT16_C( 8), UINT16_C( 0), UINT16_C( 0), UINT16_C( 17),
UINT16_C( 0), UINT16_C( 1), UINT16_C( 1), UINT16_C( 1),
UINT16_C( 1), UINT16_C( 0), UINT16_C( 0), UINT16_C( 17),
UINT16_C( 0), UINT16_C( 1), UINT16_C( 0), UINT16_C( 0),
UINT16_C( 1), UINT16_C( 0), UINT16_C( 0), UINT16_C( 0),
UINT16_C( 2), UINT16_C( 15), UINT16_C( 0), UINT16_C( 4),
UINT16_C( 2), UINT16_C( 1), UINT16_C( 0), UINT16_C( 2)) },
{ simde_x_mm512_set_epu16(UINT16_C( 13867), UINT16_C( 28091), UINT16_C( 35390), UINT16_C( 56986),
UINT16_C( 31509), UINT16_C( 63331), UINT16_C( 9520), UINT16_C( 29929),
UINT16_C( 24571), UINT16_C( 37741), UINT16_C( 52686), UINT16_C( 14609),
UINT16_C( 31001), UINT16_C( 823), UINT16_C( 45697), UINT16_C( 38351),
UINT16_C( 35780), UINT16_C( 41006), UINT16_C( 3633), UINT16_C( 45500),
UINT16_C( 30184), UINT16_C( 27396), UINT16_C( 1171), UINT16_C( 25936),
UINT16_C( 61703), UINT16_C( 57786), UINT16_C( 19453), UINT16_C( 30002),
UINT16_C( 6315), UINT16_C( 244), UINT16_C( 8399), UINT16_C( 57456)),
simde_x_mm512_set_epu16(UINT16_C( 18752), UINT16_C( 27431), UINT16_C( 53704), UINT16_C( 42625),
UINT16_C( 42869), UINT16_C( 41745), UINT16_C( 47543), UINT16_C( 11401),
UINT16_C( 26966), UINT16_C( 26500), UINT16_C( 7486), UINT16_C( 7825),
UINT16_C( 17767), UINT16_C( 58506), UINT16_C( 36234), UINT16_C( 38373),
UINT16_C( 54992), UINT16_C( 46906), UINT16_C( 52104), UINT16_C( 31285),
UINT16_C( 34932), UINT16_C( 29467), UINT16_C( 33781), UINT16_C( 883),
UINT16_C( 23995), UINT16_C( 43069), UINT16_C( 53587), UINT16_C( 11327),
UINT16_C( 36611), UINT16_C( 7518), UINT16_C( 30015), UINT16_C( 30285)),
simde_x_mm512_set_epu16(UINT16_C( 0), UINT16_C( 1), UINT16_C( 0), UINT16_C( 1),
UINT16_C( 0), UINT16_C( 1), UINT16_C( 0), UINT16_C( 2),
UINT16_C( 0), UINT16_C( 1), UINT16_C( 7), UINT16_C( 1),
UINT16_C( 1), UINT16_C( 0), UINT16_C( 1), UINT16_C( 0),
UINT16_C( 0), UINT16_C( 0), UINT16_C( 0), UINT16_C( 1),
UINT16_C( 0), UINT16_C( 0), UINT16_C( 0), UINT16_C( 29),
UINT16_C( 2), UINT16_C( 1), UINT16_C( 0), UINT16_C( 2),
UINT16_C( 0), UINT16_C( 0), UINT16_C( 0), UINT16_C( 1)) },
{ simde_x_mm512_set_epu16(UINT16_C( 19003), UINT16_C( 26627), UINT16_C( 63705), UINT16_C( 34218),
UINT16_C( 36055), UINT16_C( 13847), UINT16_C( 44625), UINT16_C( 9042),
UINT16_C( 36148), UINT16_C( 11660), UINT16_C( 32339), UINT16_C( 39715),
UINT16_C( 47178), UINT16_C( 21002), UINT16_C( 60706), UINT16_C( 8527),
UINT16_C( 26072), UINT16_C( 29611), UINT16_C( 18348), UINT16_C( 953),
UINT16_C( 33382), UINT16_C( 22717), UINT16_C( 50122), UINT16_C( 52414),
UINT16_C( 59278), UINT16_C( 54225), UINT16_C( 31952), UINT16_C( 29752),
UINT16_C( 37488), UINT16_C( 20614), UINT16_C( 1055), UINT16_C( 61149)),
simde_x_mm512_set_epu16(UINT16_C( 59727), UINT16_C( 3072), UINT16_C( 8626), UINT16_C( 14922),
UINT16_C( 64116), UINT16_C( 36372), UINT16_C( 22591), UINT16_C( 8828),
UINT16_C( 64048), UINT16_C( 56808), UINT16_C( 56651), UINT16_C( 39760),
UINT16_C( 59817), UINT16_C( 50914), UINT16_C( 21275), UINT16_C( 35106),
UINT16_C( 6020), UINT16_C( 27245), UINT16_C( 34763), UINT16_C( 25208),
UINT16_C( 25908), UINT16_C( 21036), UINT16_C( 36366), UINT16_C( 25589),
UINT16_C( 2188), UINT16_C( 36219), UINT16_C( 56227), UINT16_C( 50409),
UINT16_C( 8889), UINT16_C( 58476), UINT16_C( 24556), UINT16_C( 24873)),
simde_x_mm512_set_epu16(UINT16_C( 0), UINT16_C( 8), UINT16_C( 7), UINT16_C( 2),
UINT16_C( 0), UINT16_C( 0), UINT16_C( 1), UINT16_C( 1),
UINT16_C( 0), UINT16_C( 0), UINT16_C( 0), UINT16_C( 0),
UINT16_C( 0), UINT16_C( 0), UINT16_C( 2), UINT16_C( 0),
UINT16_C( 4), UINT16_C( 1), UINT16_C( 0), UINT16_C( 0),
UINT16_C( 1), UINT16_C( 1), UINT16_C( 1), UINT16_C( 2),
UINT16_C( 27), UINT16_C( 1), UINT16_C( 0), UINT16_C( 0),
UINT16_C( 4), UINT16_C( 0), UINT16_C( 0), UINT16_C( 2)) }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])); i++) {
simde__m512i r = simde_mm512_div_epu16(test_vec[i].a, test_vec[i].b);
simde_assert_m512i_u16(r, ==, test_vec[i].r);
}
return 0;
}
static int
test_simde_mm512_div_epu32(SIMDE_MUNIT_TEST_ARGS) {
const struct {
simde__m512i a;
simde__m512i b;
simde__m512i r;
} test_vec[8] = {
{ simde_x_mm512_set_epu32(UINT32_C( 691121094), UINT32_C( 674034227), UINT32_C(2329532409), UINT32_C(3374680349),
UINT32_C(3920294270), UINT32_C(3054162118), UINT32_C(1568850865), UINT32_C(3151989757),
UINT32_C(3215450688), UINT32_C(3586813553), UINT32_C(1508722402), UINT32_C(2220621656),
UINT32_C(1747596798), UINT32_C(2231263307), UINT32_C( 527472553), UINT32_C(2891870298)),
simde_x_mm512_set_epu32(UINT32_C(2530156426), UINT32_C(1179683687), UINT32_C(2648640694), UINT32_C(3623000007),
UINT32_C(2708640028), UINT32_C(1691051285), UINT32_C( 50347892), UINT32_C( 728425428),
UINT32_C(1192263444), UINT32_C(2208623573), UINT32_C(1322777130), UINT32_C( 163989560),
UINT32_C(1492341726), UINT32_C( 298608154), UINT32_C(1250819173), UINT32_C(3643996043)),
simde_x_mm512_set_epu32(UINT32_C( 0), UINT32_C( 0), UINT32_C( 0), UINT32_C( 0),
UINT32_C( 1), UINT32_C( 1), UINT32_C( 31), UINT32_C( 4),
UINT32_C( 2), UINT32_C( 1), UINT32_C( 1), UINT32_C( 13),
UINT32_C( 1), UINT32_C( 7), UINT32_C( 0), UINT32_C( 0)) },
{ simde_x_mm512_set_epu32(UINT32_C(1314482530), UINT32_C(2997716679), UINT32_C(3555959260), UINT32_C(2875927297),
UINT32_C(3290702646), UINT32_C(1580565751), UINT32_C(3823902839), UINT32_C(2081361826),
UINT32_C( 493161721), UINT32_C(3099851477), UINT32_C( 894221337), UINT32_C(2964507124),
UINT32_C( 492373082), UINT32_C(4281870485), UINT32_C(2207786213), UINT32_C(3953959418)),
simde_x_mm512_set_epu32(UINT32_C(3156074065), UINT32_C(3607805659), UINT32_C(1828175063), UINT32_C(3905547273),
UINT32_C(4101755863), UINT32_C(3436978124), UINT32_C(3846637996), UINT32_C(2693603084),
UINT32_C(1710148738), UINT32_C(1974123080), UINT32_C(2870600100), UINT32_C( 118588227),
UINT32_C( 542053192), UINT32_C( 499863549), UINT32_C( 957375358), UINT32_C(3003933707)),
simde_x_mm512_set_epu32(UINT32_C( 0), UINT32_C( 0), UINT32_C( 1), UINT32_C( 0),
UINT32_C( 0), UINT32_C( 0), UINT32_C( 0), UINT32_C( 0),
UINT32_C( 0), UINT32_C( 1), UINT32_C( 0), UINT32_C( 24),
UINT32_C( 0), UINT32_C( 8), UINT32_C( 2), UINT32_C( 1)) },
{ simde_x_mm512_set_epu32(UINT32_C(1763100483), UINT32_C(3776962737), UINT32_C(2844608398), UINT32_C(2885101098),
UINT32_C( 269910347), UINT32_C( 433971495), UINT32_C(1441956227), UINT32_C(1018271575),
UINT32_C(1734496959), UINT32_C( 380846712), UINT32_C(3352999607), UINT32_C(3555523675),
UINT32_C(1995198557), UINT32_C(3314312199), UINT32_C(2406584253), UINT32_C(1779168063)),
simde_x_mm512_set_epu32(UINT32_C(3629502055), UINT32_C(3952771463), UINT32_C(2102184556), UINT32_C( 877111492),
UINT32_C(1183491905), UINT32_C(3718356317), UINT32_C(3233651099), UINT32_C(3486869896),
UINT32_C(3932090380), UINT32_C(2449576763), UINT32_C(4246346280), UINT32_C( 201516689),
UINT32_C(2859036576), UINT32_C(2362091228), UINT32_C(3141663427), UINT32_C( 562234020)),
simde_x_mm512_set_epu32(UINT32_C( 0), UINT32_C( 0), UINT32_C( 1), UINT32_C( 3),
UINT32_C( 0), UINT32_C( 0), UINT32_C( 0), UINT32_C( 0),
UINT32_C( 0), UINT32_C( 0), UINT32_C( 0), UINT32_C( 17),
UINT32_C( 0), UINT32_C( 1), UINT32_C( 0), UINT32_C( 3)) },
{ simde_x_mm512_set_epu32(UINT32_C( 495870887), UINT32_C(3912840869), UINT32_C( 915244711), UINT32_C( 5081424),
UINT32_C(1422501384), UINT32_C(4130987572), UINT32_C(2778067031), UINT32_C( 497965579),
UINT32_C( 910061584), UINT32_C(2002226944), UINT32_C(3673004107), UINT32_C(4246624078),
UINT32_C( 523093293), UINT32_C(3059761572), UINT32_C(2206005509), UINT32_C(1943141679)),
simde_x_mm512_set_epu32(UINT32_C(1206471293), UINT32_C(1374915518), UINT32_C( 531653117), UINT32_C(2075187308),
UINT32_C(4150348747), UINT32_C(2163101581), UINT32_C(1444783055), UINT32_C(1878625233),
UINT32_C(1755684145), UINT32_C(2233240925), UINT32_C(3244523643), UINT32_C(2995026741),
UINT32_C(2178270751), UINT32_C(1493088054), UINT32_C(4115137419), UINT32_C( 651362699)),
simde_x_mm512_set_epu32(UINT32_C( 0), UINT32_C( 2), UINT32_C( 1), UINT32_C( 0),
UINT32_C( 0), UINT32_C( 1), UINT32_C( 1), UINT32_C( 0),
UINT32_C( 0), UINT32_C( 0), UINT32_C( 1), UINT32_C( 1),
UINT32_C( 0), UINT32_C( 2), UINT32_C( 0), UINT32_C( 2)) },
{ simde_x_mm512_set_epu32(UINT32_C(2657690668), UINT32_C( 448681074), UINT32_C(1334667053), UINT32_C( 502667641),
UINT32_C( 855395764), UINT32_C(2622874348), UINT32_C( 808531712), UINT32_C( 454488139),
UINT32_C( 123547093), UINT32_C( 483090439), UINT32_C(3168637539), UINT32_C(3093747107),
UINT32_C(4158916667), UINT32_C(4074346392), UINT32_C(1398655610), UINT32_C(1722520923)),
simde_x_mm512_set_epu32(UINT32_C( 604721400), UINT32_C(1471174399), UINT32_C(2491026588), UINT32_C(2529574367),
UINT32_C( 298473775), UINT32_C(2890366559), UINT32_C(3063632375), UINT32_C(4055983958),
UINT32_C(4149169500), UINT32_C(4113948134), UINT32_C(2384487126), UINT32_C(2434207126),
UINT32_C(3923111671), UINT32_C(3188873807), UINT32_C(1982658188), UINT32_C( 863153207)),
simde_x_mm512_set_epu32(UINT32_C( 4), UINT32_C( 0), UINT32_C( 0), UINT32_C( 0),
UINT32_C( 2), UINT32_C( 0), UINT32_C( 0), UINT32_C( 0),
UINT32_C( 0), UINT32_C( 0), UINT32_C( 1), UINT32_C( 1),
UINT32_C( 1), UINT32_C( 1), UINT32_C( 0), UINT32_C( 1)) },
{ simde_x_mm512_set_epu32(UINT32_C(1463758672), UINT32_C( 602211615), UINT32_C(3830002991), UINT32_C(2864741101),
UINT32_C( 797104998), UINT32_C(2737423319), UINT32_C(3342229886), UINT32_C( 178625368),
UINT32_C(3091160996), UINT32_C(1095216728), UINT32_C(3079561742), UINT32_C( 430790402),
UINT32_C(3213858818), UINT32_C(2113970745), UINT32_C(4112838454), UINT32_C( 564512596)),
simde_x_mm512_set_epu32(UINT32_C(1997049765), UINT32_C( 505563651), UINT32_C( 463125220), UINT32_C(3843753777),
UINT32_C(2346173843), UINT32_C(2157864934), UINT32_C(2591157969), UINT32_C( 389679318),
UINT32_C(3939775129), UINT32_C(2493364907), UINT32_C(2006619059), UINT32_C(3391409164),
UINT32_C(1533151625), UINT32_C(2122196136), UINT32_C(1690360675), UINT32_C(1484935627)),
simde_x_mm512_set_epu32(UINT32_C( 0), UINT32_C( 1), UINT32_C( 8), UINT32_C( 0),
UINT32_C( 0), UINT32_C( 1), UINT32_C( 1), UINT32_C( 0),
UINT32_C( 0), UINT32_C( 0), UINT32_C( 1), UINT32_C( 0),
UINT32_C( 2), UINT32_C( 0), UINT32_C( 2), UINT32_C( 0)) },
{ simde_x_mm512_set_epu32(UINT32_C( 908815803), UINT32_C(2319376026), UINT32_C(2065037155), UINT32_C( 623932649),
UINT32_C(1610322797), UINT32_C(3452844305), UINT32_C(2031682359), UINT32_C(2994836943),
UINT32_C(2344919086), UINT32_C( 238137788), UINT32_C(1978166020), UINT32_C( 76768592),
UINT32_C(4043825594), UINT32_C(1274901810), UINT32_C( 413860084), UINT32_C( 550494320)),
simde_x_mm512_set_epu32(UINT32_C(1228958503), UINT32_C(3519587969), UINT32_C(2809504529), UINT32_C(3115789449),
UINT32_C(1767270276), UINT32_C( 490610321), UINT32_C(1164436618), UINT32_C(2374669797),
UINT32_C(3604002618), UINT32_C(3414719029), UINT32_C(2289333019), UINT32_C(2213872499),
UINT32_C(1572579389), UINT32_C(3511888959), UINT32_C(2399346014), UINT32_C(1967093325)),
simde_x_mm512_set_epu32(UINT32_C( 0), UINT32_C( 0), UINT32_C( 0), UINT32_C( 0),
UINT32_C( 0), UINT32_C( 7), UINT32_C( 1), UINT32_C( 1),
UINT32_C( 0), UINT32_C( 0), UINT32_C( 0), UINT32_C( 0),
UINT32_C( 2), UINT32_C( 0), UINT32_C( 0), UINT32_C( 0)) },
{ simde_x_mm512_set_epu32(UINT32_C(1245407235), UINT32_C(4175005098), UINT32_C(2362914327), UINT32_C(2924553042),
UINT32_C(2369006988), UINT32_C(2119408419), UINT32_C(3091878410), UINT32_C(3978436943),
UINT32_C(1708684203), UINT32_C(1202455481), UINT32_C(2187745469), UINT32_C(3284847806),
UINT32_C(3884897233), UINT32_C(2094036024), UINT32_C(2456834182), UINT32_C( 69201629)),
simde_x_mm512_set_epu32(UINT32_C(3914271744), UINT32_C( 565328458), UINT32_C(4201942548), UINT32_C(1480532604),
UINT32_C(4197506536), UINT32_C(3712719696), UINT32_C(3920217826), UINT32_C(1394313506),
UINT32_C( 394553965), UINT32_C(2278253176), UINT32_C(1697927724), UINT32_C(2383307765),
UINT32_C( 143428987), UINT32_C(3684943081), UINT32_C( 582607980), UINT32_C(1609326889)),
simde_x_mm512_set_epu32(UINT32_C( 0), UINT32_C( 7), UINT32_C( 0), UINT32_C( 1),
UINT32_C( 0), UINT32_C( 0), UINT32_C( 0), UINT32_C( 2),
UINT32_C( 4), UINT32_C( 0), UINT32_C( 1), UINT32_C( 1),
UINT32_C( 27), UINT32_C( 0), UINT32_C( 4), UINT32_C( 0)) }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])); i++) {
simde__m512i r = simde_mm512_div_epu32(test_vec[i].a, test_vec[i].b);
simde_assert_m512i_u32(r, ==, test_vec[i].r);
}
return 0;
}
static int
test_simde_mm512_mask_div_epu32(SIMDE_MUNIT_TEST_ARGS) {
const struct {
simde__m512i src;
simde__mmask16 k;
simde__m512i a;
simde__m512i b;
simde__m512i r;
} test_vec[8] = {
{ simde_x_mm512_set_epu32(UINT32_C( 691121094), UINT32_C( 674034227), UINT32_C(2329532409), UINT32_C(3374680349),
UINT32_C(3920294270), UINT32_C(3054162118), UINT32_C(1568850865), UINT32_C(3151989757),
UINT32_C(3215450688), UINT32_C(3586813553), UINT32_C(1508722402), UINT32_C(2220621656),
UINT32_C(1747596798), UINT32_C(2231263307), UINT32_C( 527472553), UINT32_C(2891870298)),
UINT16_C(63371),
simde_x_mm512_set_epu32(UINT32_C(3953959418), UINT32_C(2530156426), UINT32_C(1179683687), UINT32_C(2648640694),
UINT32_C(3623000007), UINT32_C(2708640028), UINT32_C(1691051285), UINT32_C( 50347892),
UINT32_C( 728425428), UINT32_C(1192263444), UINT32_C(2208623573), UINT32_C(1322777130),
UINT32_C( 163989560), UINT32_C(1492341726), UINT32_C( 298608154), UINT32_C(1250819173)),
simde_x_mm512_set_epu32(UINT32_C(3003933707), UINT32_C(1314482530), UINT32_C(2997716679), UINT32_C(3555959260),
UINT32_C(2875927297), UINT32_C(3290702646), UINT32_C(1580565751), UINT32_C(3823902839),
UINT32_C(2081361826), UINT32_C( 493161721), UINT32_C(3099851477), UINT32_C( 894221337),
UINT32_C(2964507124), UINT32_C( 492373082), UINT32_C(4281870485), UINT32_C(2207786213)),
simde_x_mm512_set_epu32(UINT32_C( 1), UINT32_C( 1), UINT32_C( 0), UINT32_C( 0),
UINT32_C(3920294270), UINT32_C( 0), UINT32_C( 1), UINT32_C( 0),
UINT32_C( 0), UINT32_C(3586813553), UINT32_C(1508722402), UINT32_C(2220621656),
UINT32_C( 0), UINT32_C(2231263307), UINT32_C( 0), UINT32_C( 0)) },
{ simde_x_mm512_set_epu32(UINT32_C(1779168063), UINT32_C(3156074065), UINT32_C(3607805659), UINT32_C(1828175063),
UINT32_C(3905547273), UINT32_C(4101755863), UINT32_C(3436978124), UINT32_C(3846637996),
UINT32_C(2693603084), UINT32_C(1710148738), UINT32_C(1974123080), UINT32_C(2870600100),
UINT32_C( 118588227), UINT32_C( 542053192), UINT32_C( 499863549), UINT32_C( 957375358)),
UINT16_C(36797),
simde_x_mm512_set_epu32(UINT32_C(3141663427), UINT32_C( 562234020), UINT32_C(1763100483), UINT32_C(3776962737),
UINT32_C(2844608398), UINT32_C(2885101098), UINT32_C( 269910347), UINT32_C( 433971495),
UINT32_C(1441956227), UINT32_C(1018271575), UINT32_C(1734496959), UINT32_C( 380846712),
UINT32_C(3352999607), UINT32_C(3555523675), UINT32_C(1995198557), UINT32_C(3314312199)),
simde_x_mm512_set_epu32(UINT32_C(2206005509), UINT32_C(1943141679), UINT32_C(3629502055), UINT32_C(3952771463),
UINT32_C(2102184556), UINT32_C( 877111492), UINT32_C(1183491905), UINT32_C(3718356317),
UINT32_C(3233651099), UINT32_C(3486869896), UINT32_C(3932090380), UINT32_C(2449576763),
UINT32_C(4246346280), UINT32_C( 201516689), UINT32_C(2859036576), UINT32_C(2362091228)),
simde_x_mm512_set_epu32(UINT32_C( 1), UINT32_C(3156074065), UINT32_C(3607805659), UINT32_C(1828175063),
UINT32_C( 1), UINT32_C( 3), UINT32_C( 0), UINT32_C( 0),
UINT32_C( 0), UINT32_C(1710148738), UINT32_C( 0), UINT32_C( 0),
UINT32_C( 0), UINT32_C( 17), UINT32_C( 499863549), UINT32_C( 1)) },
{ simde_x_mm512_set_epu32(UINT32_C(4115137419), UINT32_C( 651362699), UINT32_C( 495870887), UINT32_C(3912840869),
UINT32_C( 915244711), UINT32_C( 5081424), UINT32_C(1422501384), UINT32_C(4130987572),
UINT32_C(2778067031), UINT32_C( 497965579), UINT32_C( 910061584), UINT32_C(2002226944),
UINT32_C(3673004107), UINT32_C(4246624078), UINT32_C( 523093293), UINT32_C(3059761572)),
UINT16_C(46902),
simde_x_mm512_set_epu32(UINT32_C(4074346392), UINT32_C(1398655610), UINT32_C(1722520923), UINT32_C(1206471293),
UINT32_C(1374915518), UINT32_C( 531653117), UINT32_C(2075187308), UINT32_C(4150348747),
UINT32_C(2163101581), UINT32_C(1444783055), UINT32_C(1878625233), UINT32_C(1755684145),
UINT32_C(2233240925), UINT32_C(3244523643), UINT32_C(2995026741), UINT32_C(2178270751)),
simde_x_mm512_set_epu32(UINT32_C(3188873807), UINT32_C(1982658188), UINT32_C( 863153207), UINT32_C(2657690668),
UINT32_C( 448681074), UINT32_C(1334667053), UINT32_C( 502667641), UINT32_C( 855395764),
UINT32_C(2622874348), UINT32_C( 808531712), UINT32_C( 454488139), UINT32_C( 123547093),
UINT32_C( 483090439), UINT32_C(3168637539), UINT32_C(3093747107), UINT32_C(4158916667)),
simde_x_mm512_set_epu32(UINT32_C( 1), UINT32_C( 651362699), UINT32_C( 1), UINT32_C( 0),
UINT32_C( 915244711), UINT32_C( 0), UINT32_C( 4), UINT32_C( 4),
UINT32_C(2778067031), UINT32_C( 497965579), UINT32_C( 4), UINT32_C( 14),
UINT32_C(3673004107), UINT32_C( 1), UINT32_C( 0), UINT32_C(3059761572)) },
{ simde_x_mm512_set_epu32(UINT32_C(2113970745), UINT32_C(4112838454), UINT32_C( 564512596), UINT32_C( 604721400),
UINT32_C(1471174399), UINT32_C(2491026588), UINT32_C(2529574367), UINT32_C( 298473775),
UINT32_C(2890366559), UINT32_C(3063632375), UINT32_C(4055983958), UINT32_C(4149169500),
UINT32_C(4113948134), UINT32_C(2384487126), UINT32_C(2434207126), UINT32_C(3923111671)),
UINT16_C(38914),
simde_x_mm512_set_epu32(UINT32_C(1533151625), UINT32_C(2122196136), UINT32_C(1690360675), UINT32_C(1484935627),
UINT32_C(1463758672), UINT32_C( 602211615), UINT32_C(3830002991), UINT32_C(2864741101),
UINT32_C( 797104998), UINT32_C(2737423319), UINT32_C(3342229886), UINT32_C( 178625368),
UINT32_C(3091160996), UINT32_C(1095216728), UINT32_C(3079561742), UINT32_C( 430790402)),
simde_x_mm512_set_epu32(UINT32_C(4043825594), UINT32_C(1274901810), UINT32_C( 413860084), UINT32_C( 550494320),
UINT32_C(1997049765), UINT32_C( 505563651), UINT32_C( 463125220), UINT32_C(3843753777),
UINT32_C(2346173843), UINT32_C(2157864934), UINT32_C(2591157969), UINT32_C( 389679318),
UINT32_C(3939775129), UINT32_C(2493364907), UINT32_C(2006619059), UINT32_C(3391409164)),
simde_x_mm512_set_epu32(UINT32_C( 0), UINT32_C(4112838454), UINT32_C( 564512596), UINT32_C( 2),
UINT32_C( 0), UINT32_C(2491026588), UINT32_C(2529574367), UINT32_C( 298473775),
UINT32_C(2890366559), UINT32_C(3063632375), UINT32_C(4055983958), UINT32_C(4149169500),
UINT32_C(4113948134), UINT32_C(2384487126), UINT32_C( 1), UINT32_C(3923111671)) },
{ simde_x_mm512_set_epu32(UINT32_C(1572579389), UINT32_C(3511888959), UINT32_C(2399346014), UINT32_C(1967093325),
UINT32_C( 908815803), UINT32_C(2319376026), UINT32_C(2065037155), UINT32_C( 623932649),
UINT32_C(1610322797), UINT32_C(3452844305), UINT32_C(2031682359), UINT32_C(2994836943),
UINT32_C(2344919086), UINT32_C( 238137788), UINT32_C(1978166020), UINT32_C( 76768592)),
UINT16_C( 883),
simde_x_mm512_set_epu32(UINT32_C(3284847806), UINT32_C(3884897233), UINT32_C(2094036024), UINT32_C(2456834182),
UINT32_C( 69201629), UINT32_C(1228958503), UINT32_C(3519587969), UINT32_C(2809504529),
UINT32_C(3115789449), UINT32_C(1767270276), UINT32_C( 490610321), UINT32_C(1164436618),
UINT32_C(2374669797), UINT32_C(3604002618), UINT32_C(3414719029), UINT32_C(2289333019)),
simde_x_mm512_set_epu32(UINT32_C(2383307765), UINT32_C( 143428987), UINT32_C(3684943081), UINT32_C( 582607980),
UINT32_C(1609326889), UINT32_C(1245407235), UINT32_C(4175005098), UINT32_C(2362914327),
UINT32_C(2924553042), UINT32_C(2369006988), UINT32_C(2119408419), UINT32_C(3091878410),
UINT32_C(3978436943), UINT32_C(1708684203), UINT32_C(1202455481), UINT32_C(2187745469)),
simde_x_mm512_set_epu32(UINT32_C(1572579389), UINT32_C(3511888959), UINT32_C(2399346014), UINT32_C(1967093325),
UINT32_C( 908815803), UINT32_C(2319376026), UINT32_C( 0), UINT32_C( 1),
UINT32_C(1610322797), UINT32_C( 0), UINT32_C( 0), UINT32_C( 0),
UINT32_C(2344919086), UINT32_C( 238137788), UINT32_C( 2), UINT32_C( 1)) },
{ simde_x_mm512_set_epu32(UINT32_C(2117071873), UINT32_C(2857077767), UINT32_C(3918893192), UINT32_C(1087893388),
UINT32_C(3851784011), UINT32_C(3914271744), UINT32_C( 565328458), UINT32_C(4201942548),
UINT32_C(1480532604), UINT32_C(4197506536), UINT32_C(3712719696), UINT32_C(3920217826),
UINT32_C(1394313506), UINT32_C( 394553965), UINT32_C(2278253176), UINT32_C(1697927724)),
UINT16_C(12254),
simde_x_mm512_set_epu32(UINT32_C( 56443211), UINT32_C(2258452653), UINT32_C(3784696472), UINT32_C(1139427205),
UINT32_C(1090384090), UINT32_C(2389735891), UINT32_C(2215607313), UINT32_C(3817672405),
UINT32_C(3621770268), UINT32_C(2071747620), UINT32_C(3852178197), UINT32_C(3693632585),
UINT32_C( 319530416), UINT32_C(2179954815), UINT32_C(3793236393), UINT32_C( 340519338)),
simde_x_mm512_set_epu32(UINT32_C(1219537084), UINT32_C(1349635715), UINT32_C( 732887738), UINT32_C(2566325375),
UINT32_C(2906533885), UINT32_C(1765754685), UINT32_C(2719983633), UINT32_C( 846129112),
UINT32_C(1578410935), UINT32_C(2635094838), UINT32_C(1045536663), UINT32_C( 957117985),
UINT32_C(3029008645), UINT32_C(1309498779), UINT32_C(3293951997), UINT32_C(1022360677)),
simde_x_mm512_set_epu32(UINT32_C(2117071873), UINT32_C(2857077767), UINT32_C( 5), UINT32_C(1087893388),
UINT32_C( 0), UINT32_C( 1), UINT32_C( 0), UINT32_C( 4),
UINT32_C( 2), UINT32_C( 0), UINT32_C(3712719696), UINT32_C( 3),
UINT32_C( 0), UINT32_C( 1), UINT32_C( 1), UINT32_C(1697927724)) },
{ simde_x_mm512_set_epu32(UINT32_C(3990081318), UINT32_C( 991545752), UINT32_C(4151932359), UINT32_C( 843112042),
UINT32_C(4067412513), UINT32_C(2124182542), UINT32_C(2768721208), UINT32_C(2302989914),
UINT32_C(1224533822), UINT32_C(3475606100), UINT32_C(3610957044), UINT32_C(2556046111),
UINT32_C(3035396524), UINT32_C(3603101367), UINT32_C(3321443925), UINT32_C( 45581573)),
UINT16_C(42669),
simde_x_mm512_set_epu32(UINT32_C(4138167693), UINT32_C(3221954957), UINT32_C(2164435171), UINT32_C( 397240391),
UINT32_C( 200936922), UINT32_C(3263986987), UINT32_C(2536604122), UINT32_C(3629380929),
UINT32_C( 453331046), UINT32_C(1704580573), UINT32_C(1606190487), UINT32_C(3209309249),
UINT32_C(2959497652), UINT32_C(3926896735), UINT32_C(2875407663), UINT32_C(2069966669)),
simde_x_mm512_set_epu32(UINT32_C(1379668640), UINT32_C( 66581512), UINT32_C(3737665499), UINT32_C( 304428974),
UINT32_C(2686704508), UINT32_C( 532978979), UINT32_C( 946958552), UINT32_C(2383642627),
UINT32_C(2176874140), UINT32_C( 283691898), UINT32_C(3848894665), UINT32_C(3836186002),
UINT32_C(1951055651), UINT32_C( 765387914), UINT32_C( 822559116), UINT32_C( 7445617)),
simde_x_mm512_set_epu32(UINT32_C( 2), UINT32_C( 991545752), UINT32_C( 0), UINT32_C( 843112042),
UINT32_C(4067412513), UINT32_C( 6), UINT32_C( 2), UINT32_C(2302989914),
UINT32_C( 0), UINT32_C(3475606100), UINT32_C( 0), UINT32_C(2556046111),
UINT32_C( 1), UINT32_C( 5), UINT32_C(3321443925), UINT32_C( 278)) },
{ simde_x_mm512_set_epu32(UINT32_C(2313028370), UINT32_C( 869237081), UINT32_C(4104913762), UINT32_C(2825691966),
UINT32_C(3577866502), UINT32_C(2991894408), UINT32_C(2172048625), UINT32_C(1617119933),
UINT32_C(1521363431), UINT32_C( 553638116), UINT32_C(1036201367), UINT32_C(3107033445),
UINT32_C(3882811410), UINT32_C(3534384353), UINT32_C(3871215839), UINT32_C(1273589632)),
UINT16_C(35103),
simde_x_mm512_set_epu32(UINT32_C(2458371652), UINT32_C( 260676470), UINT32_C(1724614860), UINT32_C(4150452663),
UINT32_C(3816336716), UINT32_C(2208212235), UINT32_C( 932145867), UINT32_C(2432594561),
UINT32_C(1756892633), UINT32_C( 382632965), UINT32_C(1295078740), UINT32_C(3299165262),
UINT32_C( 152308919), UINT32_C(3943411788), UINT32_C( 31813624), UINT32_C( 807463845)),
simde_x_mm512_set_epu32(UINT32_C( 615301803), UINT32_C( 382786341), UINT32_C(1852603705), UINT32_C(1998007730),
UINT32_C( 231325888), UINT32_C(1842039329), UINT32_C( 968682756), UINT32_C( 316335394),
UINT32_C(2223585202), UINT32_C(3491781959), UINT32_C(2167971796), UINT32_C(1587647099),
UINT32_C(2966608712), UINT32_C( 320339033), UINT32_C( 282380179), UINT32_C(4186865204)),
simde_x_mm512_set_epu32(UINT32_C( 3), UINT32_C( 869237081), UINT32_C(4104913762), UINT32_C(2825691966),
UINT32_C( 16), UINT32_C(2991894408), UINT32_C(2172048625), UINT32_C( 7),
UINT32_C(1521363431), UINT32_C( 553638116), UINT32_C(1036201367), UINT32_C( 2),
UINT32_C( 0), UINT32_C( 12), UINT32_C( 0), UINT32_C( 0)) }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])); i++) {
simde__m512i r = simde_mm512_mask_div_epu32(test_vec[i].src, test_vec[i].k, test_vec[i].a, test_vec[i].b);
simde_assert_m512i_u32(r, ==, test_vec[i].r);
}
return 0;
}
static int
test_simde_mm512_div_epu64(SIMDE_MUNIT_TEST_ARGS) {
const struct {
simde__m512i a;
simde__m512i b;
simde__m512i r;
} test_vec[8] = {
{ simde_x_mm512_set_epu64(UINT64_C( 2968342496979776051), UINT64_C(10005265515001776413),
UINT64_C(16837535683400356038), UINT64_C( 6738163160628300797),
UINT64_C(13810255550447513201), UINT64_C( 6479913377553186648),
UINT64_C( 7505871096235581515), UINT64_C( 2265477367564496986)),
simde_x_mm512_set_epu64(UINT64_C(10866939104613927783), UINT64_C(11375825163207743431),
UINT64_C(11633520338587575573), UINT64_C( 216242550290965460),
UINT64_C( 5120732502404950997), UINT64_C( 5681284513410730040),
UINT64_C( 6409558907924801050), UINT64_C( 5372227444888762251)),
simde_x_mm512_set_epu64(UINT64_C( 0), UINT64_C( 0),
UINT64_C( 1), UINT64_C( 31),
UINT64_C( 2), UINT64_C( 1),
UINT64_C( 1), UINT64_C( 0)) },
{ simde_x_mm512_set_epu64(UINT64_C( 5645659480511055559), UINT64_C(15272728730484288257),
UINT64_C(14133460247011230967), UINT64_C(16423537638667915170),
UINT64_C( 2118113466433927893), UINT64_C( 3840651400764901876),
UINT64_C( 2114726288902596757), UINT64_C( 9482369585348649466)),
simde_x_mm512_set_epu64(UINT64_C(13555234896536583899), UINT64_C( 7851952110853286921),
UINT64_C(17616907291198234572), UINT64_C(16521184395064581900),
UINT64_C( 7345032902979795528), UINT64_C(12329133549512917827),
UINT64_C( 2328100732832272381), UINT64_C( 4111895855610225675)),
simde_x_mm512_set_epu64(UINT64_C( 0), UINT64_C( 1),
UINT64_C( 0), UINT64_C( 0),
UINT64_C( 0), UINT64_C( 0),
UINT64_C( 0), UINT64_C( 2)) },
{ simde_x_mm512_set_epu64(UINT64_C( 7572458917823766705), UINT64_C(12217500042222052906),
UINT64_C( 1159256113650983207), UINT64_C( 6193154838246823767),
UINT64_C( 7449607714297299576), UINT64_C(14401023659121376347),
UINT64_C( 8569312554655704071), UINT64_C(10336200663482757951)),
simde_x_mm512_set_epu64(UINT64_C(15588592630942564743), UINT64_C( 9028813919053392068),
UINT64_C( 5083059030774095197), UINT64_C(13888425720366328200),
UINT64_C(16888199589465789243), UINT64_C(18237918400292775569),
UINT64_C(12279468594349909724), UINT64_C(13493341674566517412)),
simde_x_mm512_set_epu64(UINT64_C( 0), UINT64_C( 1),
UINT64_C( 0), UINT64_C( 0),
UINT64_C( 0), UINT64_C( 0),
UINT64_C( 0), UINT64_C( 0)) },
{ simde_x_mm512_set_epu64(UINT64_C( 2129749246616352421), UINT64_C( 3930946101587052880),
UINT64_C( 6109596926925725236), UINT64_C(11931707044738783755),
UINT64_C( 3908684742628183808), UINT64_C(15775432521885308750),
UINT64_C( 2246668589251707300), UINT64_C( 9474721517893975343)),
simde_x_mm512_set_epu64(UINT64_C( 5181754748372749246), UINT64_C( 2283432752406648940),
UINT64_C(17825612137522679693), UINT64_C( 6205295972918594513),
UINT64_C( 7540605987113962845), UINT64_C(13935122940778806069),
UINT64_C( 9355601638871447350), UINT64_C(17674380633802211723)),
simde_x_mm512_set_epu64(UINT64_C( 0), UINT64_C( 1),
UINT64_C( 0), UINT64_C( 1),
UINT64_C( 0), UINT64_C( 1),
UINT64_C( 0), UINT64_C( 0)) },
{ simde_x_mm512_set_epu64(UINT64_C(11414694502393074802), UINT64_C( 5732351344186366329),
UINT64_C( 3673896834139808492), UINT64_C( 3472617261273378891),
UINT64_C( 530630724433960967), UINT64_C(13609194605976671651),
UINT64_C(17862411075628668824), UINT64_C( 6007180105039451483)),
simde_x_mm512_set_epu64(UINT64_C( 2597258637662508799), UINT64_C(10698877731456040415),
UINT64_C( 1281935105229028959), UINT64_C(13158200861647791958),
UINT64_C(17820547312174620134), UINT64_C(10241294226337238422),
UINT64_C(16849636328689785423), UINT64_C( 8515452077469772855)),
simde_x_mm512_set_epu64(UINT64_C( 4), UINT64_C( 0),
UINT64_C( 2), UINT64_C( 0),
UINT64_C( 0), UINT64_C( 1),
UINT64_C( 1), UINT64_C( 0)) },
{ simde_x_mm512_set_epu64(UINT64_C( 6286795626078602527), UINT64_C(16449737592791923437),
UINT64_C( 3423539900625568727), UINT64_C(14354768056262433624),
UINT64_C(13276435385586003544), UINT64_C(13226616968333580034),
UINT64_C(13803418519385186873), UINT64_C(17664506654225712980)),
simde_x_mm512_set_epu64(UINT64_C( 8577263429665049091), UINT64_C( 1989107677696558897),
UINT64_C(10076739928573503462), UINT64_C(11128938736014461142),
UINT64_C(16921205335142546091), UINT64_C( 8618363237326703628),
UINT64_C( 6584836091306452136), UINT64_C( 7260043819054420427)),
simde_x_mm512_set_epu64(UINT64_C( 0), UINT64_C( 8),
UINT64_C( 0), UINT64_C( 1),
UINT64_C( 0), UINT64_C( 1),
UINT64_C( 2), UINT64_C( 2)) },
{ simde_x_mm512_set_epu64(UINT64_C( 3903334154292354714), UINT64_C( 8869267046373815529),
UINT64_C( 6916283752571091217), UINT64_C( 8726009290759968207),
UINT64_C(10071350786374349244), UINT64_C( 8496158362035250512),
UINT64_C(17368098678232675634), UINT64_C( 1777515526450307184)),
simde_x_mm512_set_epu64(UINT64_C( 5278336582045705857), UINT64_C(12066730073134673033),
UINT64_C( 7590368039103504017), UINT64_C( 5001217194949514725),
UINT64_C(15479073382423099957), UINT64_C( 9832610448471819123),
UINT64_C( 6754177049630551103), UINT64_C(10305112663885051469)),
simde_x_mm512_set_epu64(UINT64_C( 0), UINT64_C( 0),
UINT64_C( 0), UINT64_C( 1),
UINT64_C( 0), UINT64_C( 0),
UINT64_C( 2), UINT64_C( 0)) },
{ simde_x_mm512_set_epu64(UINT64_C( 5348983348701791658), UINT64_C(10148639760639402834),
UINT64_C(10174807539574872867), UINT64_C(13279516658136916303),
UINT64_C( 7338742772279280569), UINT64_C( 9396295244612029630),
UINT64_C(16685506566149927992), UINT64_C(10552022463454113501)),
simde_x_mm512_set_epu64(UINT64_C(16811669128702212682), UINT64_C(18047205824811442812),
UINT64_C(18028153300578966352), UINT64_C(16837207357260532002),
UINT64_C( 1694596378460381816), UINT64_C( 7292544047935022069),
UINT64_C( 616022812148352233), UINT64_C( 2502282222097948969)),
simde_x_mm512_set_epu64(UINT64_C( 0), UINT64_C( 0),
UINT64_C( 0), UINT64_C( 0),
UINT64_C( 4), UINT64_C( 1),
UINT64_C( 27), UINT64_C( 4)) }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])); i++) {
simde__m512i r = simde_mm512_div_epu64(test_vec[i].a, test_vec[i].b);
simde_assert_m512i_u64(r, ==, test_vec[i].r);
}
return 0;
}
static int
test_simde_mm_erf_ps (SIMDE_MUNIT_TEST_ARGS) {
static const struct {
const simde_float32 a[4];
const simde_float32 r[4];
} test_vec[] = {
{ { SIMDE_FLOAT32_C( 449.73), SIMDE_FLOAT32_C( -898.83), SIMDE_FLOAT32_C( 193.72), SIMDE_FLOAT32_C( -793.70) },
{ SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( -1.00), SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( -1.00) } },
{ { SIMDE_FLOAT32_C( 434.26), SIMDE_FLOAT32_C( 437.61), SIMDE_FLOAT32_C( -29.18), SIMDE_FLOAT32_C( -288.39) },
{ SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( -1.00), SIMDE_FLOAT32_C( -1.00) } },
{ { SIMDE_FLOAT32_C( -989.93), SIMDE_FLOAT32_C( -799.36), SIMDE_FLOAT32_C( 150.13), SIMDE_FLOAT32_C( 690.23) },
{ SIMDE_FLOAT32_C( -1.00), SIMDE_FLOAT32_C( -1.00), SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( 1.00) } },
{ { SIMDE_FLOAT32_C( -667.63), SIMDE_FLOAT32_C( -368.07), SIMDE_FLOAT32_C( 316.47), SIMDE_FLOAT32_C( 916.61) },
{ SIMDE_FLOAT32_C( -1.00), SIMDE_FLOAT32_C( -1.00), SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( 1.00) } },
{ { SIMDE_FLOAT32_C( 256.26), SIMDE_FLOAT32_C( -321.94), SIMDE_FLOAT32_C( 111.81), SIMDE_FLOAT32_C( -665.54) },
{ SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( -1.00), SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( -1.00) } },
{ { SIMDE_FLOAT32_C( 169.01), SIMDE_FLOAT32_C( -375.29), SIMDE_FLOAT32_C( -768.83), SIMDE_FLOAT32_C( 166.33) },
{ SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( -1.00), SIMDE_FLOAT32_C( -1.00), SIMDE_FLOAT32_C( 1.00) } },
{ { SIMDE_FLOAT32_C( 327.83), SIMDE_FLOAT32_C( -583.11), SIMDE_FLOAT32_C( 452.18), SIMDE_FLOAT32_C( -922.36) },
{ SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( -1.00), SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( -1.00) } },
{ { SIMDE_FLOAT32_C( 33.53), SIMDE_FLOAT32_C( -944.72), SIMDE_FLOAT32_C( -608.58), SIMDE_FLOAT32_C( -516.73) },
{ SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( -1.00), SIMDE_FLOAT32_C( -1.00), SIMDE_FLOAT32_C( -1.00) } }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])) ; i++) {
simde__m128 a = simde_mm_loadu_ps(test_vec[i].a);
simde__m128 r = simde_mm_erf_ps(a);
simde_test_x86_assert_equal_f32x4(r, simde_mm_loadu_ps(test_vec[i].r), 1);
}
return 0;
}
static int
test_simde_mm_erf_pd (SIMDE_MUNIT_TEST_ARGS) {
static const struct {
const simde_float64 a[2];
const simde_float64 r[2];
} test_vec[] = {
{ { SIMDE_FLOAT64_C( -733.03), SIMDE_FLOAT64_C( -222.93) },
{ SIMDE_FLOAT64_C( -1.00), SIMDE_FLOAT64_C( -1.00) } },
{ { SIMDE_FLOAT64_C( -762.35), SIMDE_FLOAT64_C( -559.95) },
{ SIMDE_FLOAT64_C( -1.00), SIMDE_FLOAT64_C( -1.00) } },
{ { SIMDE_FLOAT64_C( -868.93), SIMDE_FLOAT64_C( -580.21) },
{ SIMDE_FLOAT64_C( -1.00), SIMDE_FLOAT64_C( -1.00) } },
{ { SIMDE_FLOAT64_C( 299.67), SIMDE_FLOAT64_C( -439.96) },
{ SIMDE_FLOAT64_C( 1.00), SIMDE_FLOAT64_C( -1.00) } },
{ { SIMDE_FLOAT64_C( -152.35), SIMDE_FLOAT64_C( 5.07) },
{ SIMDE_FLOAT64_C( -1.00), SIMDE_FLOAT64_C( 1.00) } },
{ { SIMDE_FLOAT64_C( 40.68), SIMDE_FLOAT64_C( -726.52) },
{ SIMDE_FLOAT64_C( 1.00), SIMDE_FLOAT64_C( -1.00) } },
{ { SIMDE_FLOAT64_C( 642.06), SIMDE_FLOAT64_C( -970.77) },
{ SIMDE_FLOAT64_C( 1.00), SIMDE_FLOAT64_C( -1.00) } },
{ { SIMDE_FLOAT64_C( 563.08), SIMDE_FLOAT64_C( -718.61) },
{ SIMDE_FLOAT64_C( 1.00), SIMDE_FLOAT64_C( -1.00) } }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])) ; i++) {
simde__m128d a = simde_mm_loadu_pd(test_vec[i].a);
simde__m128d r = simde_mm_erf_pd(a);
simde_test_x86_assert_equal_f64x2(r, simde_mm_loadu_pd(test_vec[i].r), 1);
}
return 0;
}
static int
test_simde_mm256_erf_ps (SIMDE_MUNIT_TEST_ARGS) {
static const struct {
const simde_float32 a[8];
const simde_float32 r[8];
} test_vec[] = {
{ { SIMDE_FLOAT32_C( 374.20), SIMDE_FLOAT32_C( -943.32), SIMDE_FLOAT32_C( -503.43), SIMDE_FLOAT32_C( -980.91),
SIMDE_FLOAT32_C( 588.09), SIMDE_FLOAT32_C( 116.98), SIMDE_FLOAT32_C( 159.00), SIMDE_FLOAT32_C( 60.92) },
{ SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( -1.00), SIMDE_FLOAT32_C( -1.00), SIMDE_FLOAT32_C( -1.00),
SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( 1.00) } },
{ { SIMDE_FLOAT32_C( 517.69), SIMDE_FLOAT32_C( 565.06), SIMDE_FLOAT32_C( 410.42), SIMDE_FLOAT32_C( 802.07),
SIMDE_FLOAT32_C( -337.69), SIMDE_FLOAT32_C( 790.63), SIMDE_FLOAT32_C( 48.57), SIMDE_FLOAT32_C( 385.99) },
{ SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( 1.00),
SIMDE_FLOAT32_C( -1.00), SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( 1.00) } },
{ { SIMDE_FLOAT32_C( 695.57), SIMDE_FLOAT32_C( -950.00), SIMDE_FLOAT32_C( 565.77), SIMDE_FLOAT32_C( -123.23),
SIMDE_FLOAT32_C( 205.87), SIMDE_FLOAT32_C( -194.42), SIMDE_FLOAT32_C( 803.30), SIMDE_FLOAT32_C( -901.24) },
{ SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( -1.00), SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( -1.00),
SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( -1.00), SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( -1.00) } },
{ { SIMDE_FLOAT32_C( 429.62), SIMDE_FLOAT32_C( -530.89), SIMDE_FLOAT32_C( 279.94), SIMDE_FLOAT32_C( 445.55),
SIMDE_FLOAT32_C( 34.20), SIMDE_FLOAT32_C( 333.48), SIMDE_FLOAT32_C( 841.52), SIMDE_FLOAT32_C( -591.60) },
{ SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( -1.00), SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( 1.00),
SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( -1.00) } },
{ { SIMDE_FLOAT32_C( 390.15), SIMDE_FLOAT32_C( -661.91), SIMDE_FLOAT32_C( -572.50), SIMDE_FLOAT32_C( -21.76),
SIMDE_FLOAT32_C( 455.07), SIMDE_FLOAT32_C( 586.50), SIMDE_FLOAT32_C( -960.84), SIMDE_FLOAT32_C( -27.24) },
{ SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( -1.00), SIMDE_FLOAT32_C( -1.00), SIMDE_FLOAT32_C( -1.00),
SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( -1.00), SIMDE_FLOAT32_C( -1.00) } },
{ { SIMDE_FLOAT32_C( 151.56), SIMDE_FLOAT32_C( 449.58), SIMDE_FLOAT32_C( -225.17), SIMDE_FLOAT32_C( 813.87),
SIMDE_FLOAT32_C( 240.21), SIMDE_FLOAT32_C( 823.40), SIMDE_FLOAT32_C( 199.87), SIMDE_FLOAT32_C( -64.22) },
{ SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( -1.00), SIMDE_FLOAT32_C( 1.00),
SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( -1.00) } },
{ { SIMDE_FLOAT32_C( 873.40), SIMDE_FLOAT32_C( -234.36), SIMDE_FLOAT32_C( 812.55), SIMDE_FLOAT32_C( 79.27),
SIMDE_FLOAT32_C( 571.22), SIMDE_FLOAT32_C( 615.85), SIMDE_FLOAT32_C( 178.03), SIMDE_FLOAT32_C( 0.84) },
{ SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( -1.00), SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( 1.00),
SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( 0.77) } },
{ { SIMDE_FLOAT32_C( -915.04), SIMDE_FLOAT32_C( -542.03), SIMDE_FLOAT32_C( -553.61), SIMDE_FLOAT32_C( 119.16),
SIMDE_FLOAT32_C( 791.44), SIMDE_FLOAT32_C( -712.09), SIMDE_FLOAT32_C( 527.56), SIMDE_FLOAT32_C( 181.60) },
{ SIMDE_FLOAT32_C( -1.00), SIMDE_FLOAT32_C( -1.00), SIMDE_FLOAT32_C( -1.00), SIMDE_FLOAT32_C( 1.00),
SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( -1.00), SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( 1.00) } }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])) ; i++) {
simde__m256 a = simde_mm256_loadu_ps(test_vec[i].a);
simde__m256 r = simde_mm256_erf_ps(a);
simde_test_x86_assert_equal_f32x8(r, simde_mm256_loadu_ps(test_vec[i].r), 1);
}
return 0;
}
static int
test_simde_mm256_erf_pd (SIMDE_MUNIT_TEST_ARGS) {
static const struct {
const simde_float64 a[4];
const simde_float64 r[4];
} test_vec[] = {
{ { SIMDE_FLOAT64_C( -313.70), SIMDE_FLOAT64_C( 714.53), SIMDE_FLOAT64_C( 927.20), SIMDE_FLOAT64_C( -898.10) },
{ SIMDE_FLOAT64_C( -1.00), SIMDE_FLOAT64_C( 1.00), SIMDE_FLOAT64_C( 1.00), SIMDE_FLOAT64_C( -1.00) } },
{ { SIMDE_FLOAT64_C( 921.61), SIMDE_FLOAT64_C( 406.65), SIMDE_FLOAT64_C( 519.73), SIMDE_FLOAT64_C( -550.92) },
{ SIMDE_FLOAT64_C( 1.00), SIMDE_FLOAT64_C( 1.00), SIMDE_FLOAT64_C( 1.00), SIMDE_FLOAT64_C( -1.00) } },
{ { SIMDE_FLOAT64_C( 655.77), SIMDE_FLOAT64_C( -305.99), SIMDE_FLOAT64_C( -29.82), SIMDE_FLOAT64_C( -266.26) },
{ SIMDE_FLOAT64_C( 1.00), SIMDE_FLOAT64_C( -1.00), SIMDE_FLOAT64_C( -1.00), SIMDE_FLOAT64_C( -1.00) } },
{ { SIMDE_FLOAT64_C( 47.11), SIMDE_FLOAT64_C( 991.16), SIMDE_FLOAT64_C( -298.84), SIMDE_FLOAT64_C( 426.24) },
{ SIMDE_FLOAT64_C( 1.00), SIMDE_FLOAT64_C( 1.00), SIMDE_FLOAT64_C( -1.00), SIMDE_FLOAT64_C( 1.00) } },
{ { SIMDE_FLOAT64_C( -122.46), SIMDE_FLOAT64_C( 928.48), SIMDE_FLOAT64_C( -151.69), SIMDE_FLOAT64_C( -677.70) },
{ SIMDE_FLOAT64_C( -1.00), SIMDE_FLOAT64_C( 1.00), SIMDE_FLOAT64_C( -1.00), SIMDE_FLOAT64_C( -1.00) } },
{ { SIMDE_FLOAT64_C( -184.81), SIMDE_FLOAT64_C( -799.82), SIMDE_FLOAT64_C( 978.74), SIMDE_FLOAT64_C( -554.85) },
{ SIMDE_FLOAT64_C( -1.00), SIMDE_FLOAT64_C( -1.00), SIMDE_FLOAT64_C( 1.00), SIMDE_FLOAT64_C( -1.00) } },
{ { SIMDE_FLOAT64_C( 83.95), SIMDE_FLOAT64_C( -400.78), SIMDE_FLOAT64_C( -165.64), SIMDE_FLOAT64_C( -926.09) },
{ SIMDE_FLOAT64_C( 1.00), SIMDE_FLOAT64_C( -1.00), SIMDE_FLOAT64_C( -1.00), SIMDE_FLOAT64_C( -1.00) } },
{ { SIMDE_FLOAT64_C( 941.89), SIMDE_FLOAT64_C( 862.77), SIMDE_FLOAT64_C( 150.41), SIMDE_FLOAT64_C( -371.81) },
{ SIMDE_FLOAT64_C( 1.00), SIMDE_FLOAT64_C( 1.00), SIMDE_FLOAT64_C( 1.00), SIMDE_FLOAT64_C( -1.00) } }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])) ; i++) {
simde__m256d a = simde_mm256_loadu_pd(test_vec[i].a);
simde__m256d r = simde_mm256_erf_pd(a);
simde_test_x86_assert_equal_f64x4(r, simde_mm256_loadu_pd(test_vec[i].r), 1);
}
return 0;
}
static int
test_simde_mm512_erf_ps (SIMDE_MUNIT_TEST_ARGS) {
static const struct {
const simde_float32 a[16];
const simde_float32 r[16];
} test_vec[] = {
{ { SIMDE_FLOAT32_C( -838.40), SIMDE_FLOAT32_C( 872.70), SIMDE_FLOAT32_C( 438.38), SIMDE_FLOAT32_C( -298.62),
SIMDE_FLOAT32_C( 781.61), SIMDE_FLOAT32_C( 970.11), SIMDE_FLOAT32_C( 78.85), SIMDE_FLOAT32_C( 723.02),
SIMDE_FLOAT32_C( -818.83), SIMDE_FLOAT32_C( -579.07), SIMDE_FLOAT32_C( 251.53), SIMDE_FLOAT32_C( -753.80),
SIMDE_FLOAT32_C( 319.82), SIMDE_FLOAT32_C( 967.37), SIMDE_FLOAT32_C( 725.05), SIMDE_FLOAT32_C( 873.27) },
{ SIMDE_FLOAT32_C( -1.00), SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( -1.00),
SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( 1.00),
SIMDE_FLOAT32_C( -1.00), SIMDE_FLOAT32_C( -1.00), SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( -1.00),
SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( 1.00) } },
{ { SIMDE_FLOAT32_C( -304.80), SIMDE_FLOAT32_C( 941.81), SIMDE_FLOAT32_C( -83.14), SIMDE_FLOAT32_C( -799.93),
SIMDE_FLOAT32_C( -339.09), SIMDE_FLOAT32_C( 125.84), SIMDE_FLOAT32_C( 891.08), SIMDE_FLOAT32_C( -989.54),
SIMDE_FLOAT32_C( 253.61), SIMDE_FLOAT32_C( 980.01), SIMDE_FLOAT32_C( 634.54), SIMDE_FLOAT32_C( 449.90),
SIMDE_FLOAT32_C( 462.95), SIMDE_FLOAT32_C( 271.95), SIMDE_FLOAT32_C( 654.57), SIMDE_FLOAT32_C( 624.56) },
{ SIMDE_FLOAT32_C( -1.00), SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( -1.00), SIMDE_FLOAT32_C( -1.00),
SIMDE_FLOAT32_C( -1.00), SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( -1.00),
SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( 1.00),
SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( 1.00) } },
{ { SIMDE_FLOAT32_C( 144.65), SIMDE_FLOAT32_C( 92.95), SIMDE_FLOAT32_C( -674.06), SIMDE_FLOAT32_C( -73.74),
SIMDE_FLOAT32_C( 63.06), SIMDE_FLOAT32_C( 404.78), SIMDE_FLOAT32_C( -350.71), SIMDE_FLOAT32_C( 244.23),
SIMDE_FLOAT32_C( 825.71), SIMDE_FLOAT32_C( 900.82), SIMDE_FLOAT32_C( 490.43), SIMDE_FLOAT32_C( 145.53),
SIMDE_FLOAT32_C( 868.18), SIMDE_FLOAT32_C( 215.47), SIMDE_FLOAT32_C( 18.80), SIMDE_FLOAT32_C( -436.61) },
{ SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( -1.00), SIMDE_FLOAT32_C( -1.00),
SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( -1.00), SIMDE_FLOAT32_C( 1.00),
SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( 1.00),
SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( -1.00) } },
{ { SIMDE_FLOAT32_C( 157.28), SIMDE_FLOAT32_C( 935.67), SIMDE_FLOAT32_C( -236.55), SIMDE_FLOAT32_C( 818.19),
SIMDE_FLOAT32_C( 61.50), SIMDE_FLOAT32_C( -345.47), SIMDE_FLOAT32_C( 828.65), SIMDE_FLOAT32_C( -684.89),
SIMDE_FLOAT32_C( -365.46), SIMDE_FLOAT32_C( 463.19), SIMDE_FLOAT32_C( 765.01), SIMDE_FLOAT32_C( -902.51),
SIMDE_FLOAT32_C( -264.87), SIMDE_FLOAT32_C( 419.58), SIMDE_FLOAT32_C( 722.05), SIMDE_FLOAT32_C( 879.78) },
{ SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( -1.00), SIMDE_FLOAT32_C( 1.00),
SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( -1.00), SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( -1.00),
SIMDE_FLOAT32_C( -1.00), SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( -1.00),
SIMDE_FLOAT32_C( -1.00), SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( 1.00) } },
{ { SIMDE_FLOAT32_C( -487.47), SIMDE_FLOAT32_C( -952.01), SIMDE_FLOAT32_C( -193.96), SIMDE_FLOAT32_C( 575.59),
SIMDE_FLOAT32_C( 452.77), SIMDE_FLOAT32_C( 455.33), SIMDE_FLOAT32_C( -180.18), SIMDE_FLOAT32_C( 278.48),
SIMDE_FLOAT32_C( 356.14), SIMDE_FLOAT32_C( -689.76), SIMDE_FLOAT32_C( -575.99), SIMDE_FLOAT32_C( 224.33),
SIMDE_FLOAT32_C( 525.72), SIMDE_FLOAT32_C( 442.82), SIMDE_FLOAT32_C( 787.71), SIMDE_FLOAT32_C( -317.01) },
{ SIMDE_FLOAT32_C( -1.00), SIMDE_FLOAT32_C( -1.00), SIMDE_FLOAT32_C( -1.00), SIMDE_FLOAT32_C( 1.00),
SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( -1.00), SIMDE_FLOAT32_C( 1.00),
SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( -1.00), SIMDE_FLOAT32_C( -1.00), SIMDE_FLOAT32_C( 1.00),
SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( -1.00) } },
{ { SIMDE_FLOAT32_C( 378.48), SIMDE_FLOAT32_C( -448.83), SIMDE_FLOAT32_C( -498.82), SIMDE_FLOAT32_C( -560.02),
SIMDE_FLOAT32_C( 205.70), SIMDE_FLOAT32_C( -670.17), SIMDE_FLOAT32_C( -244.90), SIMDE_FLOAT32_C( 840.24),
SIMDE_FLOAT32_C( 793.02), SIMDE_FLOAT32_C( -479.90), SIMDE_FLOAT32_C( 937.74), SIMDE_FLOAT32_C( -471.85),
SIMDE_FLOAT32_C( 939.68), SIMDE_FLOAT32_C( 659.79), SIMDE_FLOAT32_C( -592.07), SIMDE_FLOAT32_C( -547.79) },
{ SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( -1.00), SIMDE_FLOAT32_C( -1.00), SIMDE_FLOAT32_C( -1.00),
SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( -1.00), SIMDE_FLOAT32_C( -1.00), SIMDE_FLOAT32_C( 1.00),
SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( -1.00), SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( -1.00),
SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( -1.00), SIMDE_FLOAT32_C( -1.00) } },
{ { SIMDE_FLOAT32_C( 707.78), SIMDE_FLOAT32_C( 213.97), SIMDE_FLOAT32_C( -972.20), SIMDE_FLOAT32_C( 160.55),
SIMDE_FLOAT32_C( -330.70), SIMDE_FLOAT32_C( -152.38), SIMDE_FLOAT32_C( -560.98), SIMDE_FLOAT32_C( -974.56),
SIMDE_FLOAT32_C( 157.86), SIMDE_FLOAT32_C( -136.96), SIMDE_FLOAT32_C( 249.77), SIMDE_FLOAT32_C( -316.43),
SIMDE_FLOAT32_C( -694.15), SIMDE_FLOAT32_C( 37.48), SIMDE_FLOAT32_C( 366.57), SIMDE_FLOAT32_C( 684.33) },
{ SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( -1.00), SIMDE_FLOAT32_C( 1.00),
SIMDE_FLOAT32_C( -1.00), SIMDE_FLOAT32_C( -1.00), SIMDE_FLOAT32_C( -1.00), SIMDE_FLOAT32_C( -1.00),
SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( -1.00), SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( -1.00),
SIMDE_FLOAT32_C( -1.00), SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( 1.00) } },
{ { SIMDE_FLOAT32_C( 588.65), SIMDE_FLOAT32_C( 867.75), SIMDE_FLOAT32_C( -875.68), SIMDE_FLOAT32_C( -205.65),
SIMDE_FLOAT32_C( -802.42), SIMDE_FLOAT32_C( -120.59), SIMDE_FLOAT32_C( -365.41), SIMDE_FLOAT32_C( 990.60),
SIMDE_FLOAT32_C( 399.52), SIMDE_FLOAT32_C( -427.67), SIMDE_FLOAT32_C( -481.25), SIMDE_FLOAT32_C( 339.20),
SIMDE_FLOAT32_C( -767.88), SIMDE_FLOAT32_C( -73.32), SIMDE_FLOAT32_C( 791.41), SIMDE_FLOAT32_C( 939.89) },
{ SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( -1.00), SIMDE_FLOAT32_C( -1.00),
SIMDE_FLOAT32_C( -1.00), SIMDE_FLOAT32_C( -1.00), SIMDE_FLOAT32_C( -1.00), SIMDE_FLOAT32_C( 1.00),
SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( -1.00), SIMDE_FLOAT32_C( -1.00), SIMDE_FLOAT32_C( 1.00),
SIMDE_FLOAT32_C( -1.00), SIMDE_FLOAT32_C( -1.00), SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( 1.00) } }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])) ; i++) {
simde__m512 a = simde_mm512_loadu_ps(test_vec[i].a);
simde__m512 r = simde_mm512_erf_ps(a);
simde_test_x86_assert_equal_f32x16(r, simde_mm512_loadu_ps(test_vec[i].r), 1);
}
return 0;
}
static int
test_simde_mm512_mask_erf_ps (SIMDE_MUNIT_TEST_ARGS) {
static const struct {
const simde_float32 src[16];
const simde__mmask8 k;
const simde_float32 a[16];
const simde_float32 r[16];
} test_vec[] = {
{ { SIMDE_FLOAT32_C( 60.80), SIMDE_FLOAT32_C( 224.62), SIMDE_FLOAT32_C( -945.35), SIMDE_FLOAT32_C( -219.00),
SIMDE_FLOAT32_C( 891.11), SIMDE_FLOAT32_C( 761.94), SIMDE_FLOAT32_C( 992.65), SIMDE_FLOAT32_C( 332.00),
SIMDE_FLOAT32_C( 387.85), SIMDE_FLOAT32_C( -689.44), SIMDE_FLOAT32_C( 195.76), SIMDE_FLOAT32_C( -335.77),
SIMDE_FLOAT32_C( -349.96), SIMDE_FLOAT32_C( -675.36), SIMDE_FLOAT32_C( 298.19), SIMDE_FLOAT32_C( 171.46) },
UINT8_C( 43),
{ SIMDE_FLOAT32_C( -593.03), SIMDE_FLOAT32_C( 241.03), SIMDE_FLOAT32_C( 550.96), SIMDE_FLOAT32_C( 496.03),
SIMDE_FLOAT32_C( -94.31), SIMDE_FLOAT32_C( -581.85), SIMDE_FLOAT32_C( -755.59), SIMDE_FLOAT32_C( 80.74),
SIMDE_FLOAT32_C( 755.01), SIMDE_FLOAT32_C( 520.11), SIMDE_FLOAT32_C( 62.41), SIMDE_FLOAT32_C( -580.00),
SIMDE_FLOAT32_C( 448.06), SIMDE_FLOAT32_C( -303.73), SIMDE_FLOAT32_C( 480.80), SIMDE_FLOAT32_C( -327.32) },
{ SIMDE_FLOAT32_C( -1.00), SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( -945.35), SIMDE_FLOAT32_C( 1.00),
SIMDE_FLOAT32_C( 891.11), SIMDE_FLOAT32_C( -1.00), SIMDE_FLOAT32_C( 992.65), SIMDE_FLOAT32_C( 332.00),
SIMDE_FLOAT32_C( 387.85), SIMDE_FLOAT32_C( -689.44), SIMDE_FLOAT32_C( 195.76), SIMDE_FLOAT32_C( -335.77),
SIMDE_FLOAT32_C( -349.96), SIMDE_FLOAT32_C( -675.36), SIMDE_FLOAT32_C( 298.19), SIMDE_FLOAT32_C( 171.46) } },
{ { SIMDE_FLOAT32_C( -249.08), SIMDE_FLOAT32_C( -738.20), SIMDE_FLOAT32_C( -436.21), SIMDE_FLOAT32_C( -487.13),
SIMDE_FLOAT32_C( -745.54), SIMDE_FLOAT32_C( 895.79), SIMDE_FLOAT32_C( 900.71), SIMDE_FLOAT32_C( -434.99),
SIMDE_FLOAT32_C( 91.55), SIMDE_FLOAT32_C( -435.06), SIMDE_FLOAT32_C( 215.05), SIMDE_FLOAT32_C( 416.20),
SIMDE_FLOAT32_C( 863.14), SIMDE_FLOAT32_C( -613.49), SIMDE_FLOAT32_C( -739.87), SIMDE_FLOAT32_C( -729.89) },
UINT8_C(228),
{ SIMDE_FLOAT32_C( 811.10), SIMDE_FLOAT32_C( 766.14), SIMDE_FLOAT32_C( -466.77), SIMDE_FLOAT32_C( -770.76),
SIMDE_FLOAT32_C( -989.45), SIMDE_FLOAT32_C( 613.97), SIMDE_FLOAT32_C( 984.25), SIMDE_FLOAT32_C( 530.66),
SIMDE_FLOAT32_C( -323.62), SIMDE_FLOAT32_C( -595.75), SIMDE_FLOAT32_C( -21.28), SIMDE_FLOAT32_C( 372.65),
SIMDE_FLOAT32_C( 885.05), SIMDE_FLOAT32_C( 651.40), SIMDE_FLOAT32_C( -876.43), SIMDE_FLOAT32_C( -853.15) },
{ SIMDE_FLOAT32_C( -249.08), SIMDE_FLOAT32_C( -738.20), SIMDE_FLOAT32_C( -1.00), SIMDE_FLOAT32_C( -487.13),
SIMDE_FLOAT32_C( -745.54), SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( 1.00),
SIMDE_FLOAT32_C( 91.55), SIMDE_FLOAT32_C( -435.06), SIMDE_FLOAT32_C( 215.05), SIMDE_FLOAT32_C( 416.20),
SIMDE_FLOAT32_C( 863.14), SIMDE_FLOAT32_C( -613.49), SIMDE_FLOAT32_C( -739.87), SIMDE_FLOAT32_C( -729.89) } },
{ { SIMDE_FLOAT32_C( -784.80), SIMDE_FLOAT32_C( -363.56), SIMDE_FLOAT32_C( -598.70), SIMDE_FLOAT32_C( -889.01),
SIMDE_FLOAT32_C( -462.85), SIMDE_FLOAT32_C( -33.68), SIMDE_FLOAT32_C( 202.54), SIMDE_FLOAT32_C( 102.09),
SIMDE_FLOAT32_C( -818.63), SIMDE_FLOAT32_C( -381.26), SIMDE_FLOAT32_C( -34.77), SIMDE_FLOAT32_C( -432.12),
SIMDE_FLOAT32_C( -121.13), SIMDE_FLOAT32_C( 235.34), SIMDE_FLOAT32_C( -804.58), SIMDE_FLOAT32_C( -310.04) },
UINT8_C(218),
{ SIMDE_FLOAT32_C( -271.35), SIMDE_FLOAT32_C( -80.79), SIMDE_FLOAT32_C( 12.03), SIMDE_FLOAT32_C( -657.38),
SIMDE_FLOAT32_C( -96.55), SIMDE_FLOAT32_C( -457.32), SIMDE_FLOAT32_C( 19.00), SIMDE_FLOAT32_C( 307.70),
SIMDE_FLOAT32_C( 521.41), SIMDE_FLOAT32_C( -608.35), SIMDE_FLOAT32_C( 192.75), SIMDE_FLOAT32_C( 172.81),
SIMDE_FLOAT32_C( -484.78), SIMDE_FLOAT32_C( 339.60), SIMDE_FLOAT32_C( 388.01), SIMDE_FLOAT32_C( 151.65) },
{ SIMDE_FLOAT32_C( -784.80), SIMDE_FLOAT32_C( -1.00), SIMDE_FLOAT32_C( -598.70), SIMDE_FLOAT32_C( -1.00),
SIMDE_FLOAT32_C( -1.00), SIMDE_FLOAT32_C( -33.68), SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( 1.00),
SIMDE_FLOAT32_C( -818.63), SIMDE_FLOAT32_C( -381.26), SIMDE_FLOAT32_C( -34.77), SIMDE_FLOAT32_C( -432.12),
SIMDE_FLOAT32_C( -121.13), SIMDE_FLOAT32_C( 235.34), SIMDE_FLOAT32_C( -804.58), SIMDE_FLOAT32_C( -310.04) } },
{ { SIMDE_FLOAT32_C( 740.90), SIMDE_FLOAT32_C( 498.99), SIMDE_FLOAT32_C( 688.80), SIMDE_FLOAT32_C( -292.78),
SIMDE_FLOAT32_C( -298.47), SIMDE_FLOAT32_C( -209.10), SIMDE_FLOAT32_C( -111.42), SIMDE_FLOAT32_C( 320.27),
SIMDE_FLOAT32_C( 756.13), SIMDE_FLOAT32_C( 456.46), SIMDE_FLOAT32_C( -800.86), SIMDE_FLOAT32_C( -8.53),
SIMDE_FLOAT32_C( 651.88), SIMDE_FLOAT32_C( -110.90), SIMDE_FLOAT32_C( 992.95), SIMDE_FLOAT32_C( -619.48) },
UINT8_C(168),
{ SIMDE_FLOAT32_C( 4.98), SIMDE_FLOAT32_C( -276.86), SIMDE_FLOAT32_C( -288.24), SIMDE_FLOAT32_C( 547.66),
SIMDE_FLOAT32_C( 742.14), SIMDE_FLOAT32_C( -980.53), SIMDE_FLOAT32_C( 69.07), SIMDE_FLOAT32_C( -866.21),
SIMDE_FLOAT32_C( 212.21), SIMDE_FLOAT32_C( -758.12), SIMDE_FLOAT32_C( -351.00), SIMDE_FLOAT32_C( -448.19),
SIMDE_FLOAT32_C( 629.88), SIMDE_FLOAT32_C( 800.65), SIMDE_FLOAT32_C( -707.29), SIMDE_FLOAT32_C( 128.87) },
{ SIMDE_FLOAT32_C( 740.90), SIMDE_FLOAT32_C( 498.99), SIMDE_FLOAT32_C( 688.80), SIMDE_FLOAT32_C( 1.00),
SIMDE_FLOAT32_C( -298.47), SIMDE_FLOAT32_C( -1.00), SIMDE_FLOAT32_C( -111.42), SIMDE_FLOAT32_C( -1.00),
SIMDE_FLOAT32_C( 756.13), SIMDE_FLOAT32_C( 456.46), SIMDE_FLOAT32_C( -800.86), SIMDE_FLOAT32_C( -8.53),
SIMDE_FLOAT32_C( 651.88), SIMDE_FLOAT32_C( -110.90), SIMDE_FLOAT32_C( 992.95), SIMDE_FLOAT32_C( -619.48) } },
{ { SIMDE_FLOAT32_C( 489.46), SIMDE_FLOAT32_C( -0.07), SIMDE_FLOAT32_C( 830.41), SIMDE_FLOAT32_C( -719.65),
SIMDE_FLOAT32_C( 888.51), SIMDE_FLOAT32_C( 150.68), SIMDE_FLOAT32_C( -963.52), SIMDE_FLOAT32_C( 344.97),
SIMDE_FLOAT32_C( 349.82), SIMDE_FLOAT32_C( 27.95), SIMDE_FLOAT32_C( -3.15), SIMDE_FLOAT32_C( -761.08),
SIMDE_FLOAT32_C( 20.90), SIMDE_FLOAT32_C( 377.37), SIMDE_FLOAT32_C( -952.77), SIMDE_FLOAT32_C( -974.12) },
UINT8_C( 30),
{ SIMDE_FLOAT32_C( -241.01), SIMDE_FLOAT32_C( 573.54), SIMDE_FLOAT32_C( 842.66), SIMDE_FLOAT32_C( -221.54),
SIMDE_FLOAT32_C( -357.39), SIMDE_FLOAT32_C( 976.44), SIMDE_FLOAT32_C( 990.67), SIMDE_FLOAT32_C( -115.52),
SIMDE_FLOAT32_C( -374.55), SIMDE_FLOAT32_C( -457.51), SIMDE_FLOAT32_C( -485.63), SIMDE_FLOAT32_C( -573.90),
SIMDE_FLOAT32_C( -164.80), SIMDE_FLOAT32_C( 643.24), SIMDE_FLOAT32_C( 915.55), SIMDE_FLOAT32_C( 835.12) },
{ SIMDE_FLOAT32_C( 489.46), SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( -1.00),
SIMDE_FLOAT32_C( -1.00), SIMDE_FLOAT32_C( 150.68), SIMDE_FLOAT32_C( -963.52), SIMDE_FLOAT32_C( 344.97),
SIMDE_FLOAT32_C( 349.82), SIMDE_FLOAT32_C( 27.95), SIMDE_FLOAT32_C( -3.15), SIMDE_FLOAT32_C( -761.08),
SIMDE_FLOAT32_C( 20.90), SIMDE_FLOAT32_C( 377.37), SIMDE_FLOAT32_C( -952.77), SIMDE_FLOAT32_C( -974.12) } },
{ { SIMDE_FLOAT32_C( 473.65), SIMDE_FLOAT32_C( -804.09), SIMDE_FLOAT32_C( 723.64), SIMDE_FLOAT32_C( -375.67),
SIMDE_FLOAT32_C( -767.61), SIMDE_FLOAT32_C( 68.61), SIMDE_FLOAT32_C( 974.15), SIMDE_FLOAT32_C( 260.34),
SIMDE_FLOAT32_C( -934.54), SIMDE_FLOAT32_C( -786.93), SIMDE_FLOAT32_C( -718.76), SIMDE_FLOAT32_C( 442.83),
SIMDE_FLOAT32_C( -739.70), SIMDE_FLOAT32_C( -692.88), SIMDE_FLOAT32_C( 543.35), SIMDE_FLOAT32_C( 19.29) },
UINT8_C(120),
{ SIMDE_FLOAT32_C( 386.01), SIMDE_FLOAT32_C( 797.75), SIMDE_FLOAT32_C( -476.73), SIMDE_FLOAT32_C( 362.46),
SIMDE_FLOAT32_C( 788.43), SIMDE_FLOAT32_C( 407.75), SIMDE_FLOAT32_C( 987.90), SIMDE_FLOAT32_C( -669.09),
SIMDE_FLOAT32_C( 922.12), SIMDE_FLOAT32_C( -586.00), SIMDE_FLOAT32_C( 166.11), SIMDE_FLOAT32_C( 565.36),
SIMDE_FLOAT32_C( -670.44), SIMDE_FLOAT32_C( 1.23), SIMDE_FLOAT32_C( 39.01), SIMDE_FLOAT32_C( -474.54) },
{ SIMDE_FLOAT32_C( 473.65), SIMDE_FLOAT32_C( -804.09), SIMDE_FLOAT32_C( 723.64), SIMDE_FLOAT32_C( 1.00),
SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( 260.34),
SIMDE_FLOAT32_C( -934.54), SIMDE_FLOAT32_C( -786.93), SIMDE_FLOAT32_C( -718.76), SIMDE_FLOAT32_C( 442.83),
SIMDE_FLOAT32_C( -739.70), SIMDE_FLOAT32_C( -692.88), SIMDE_FLOAT32_C( 543.35), SIMDE_FLOAT32_C( 19.29) } },
{ { SIMDE_FLOAT32_C( -275.13), SIMDE_FLOAT32_C( 663.34), SIMDE_FLOAT32_C( -242.15), SIMDE_FLOAT32_C( 793.48),
SIMDE_FLOAT32_C( 637.49), SIMDE_FLOAT32_C( -981.81), SIMDE_FLOAT32_C( 858.94), SIMDE_FLOAT32_C( 850.55),
SIMDE_FLOAT32_C( -700.57), SIMDE_FLOAT32_C( 301.77), SIMDE_FLOAT32_C( -889.15), SIMDE_FLOAT32_C( -393.45),
SIMDE_FLOAT32_C( -154.87), SIMDE_FLOAT32_C( 130.14), SIMDE_FLOAT32_C( -512.79), SIMDE_FLOAT32_C( -768.86) },
UINT8_C( 73),
{ SIMDE_FLOAT32_C( 10.48), SIMDE_FLOAT32_C( 593.59), SIMDE_FLOAT32_C( -283.68), SIMDE_FLOAT32_C( -581.77),
SIMDE_FLOAT32_C( 581.50), SIMDE_FLOAT32_C( 47.23), SIMDE_FLOAT32_C( -659.65), SIMDE_FLOAT32_C( 995.50),
SIMDE_FLOAT32_C( -786.66), SIMDE_FLOAT32_C( 905.71), SIMDE_FLOAT32_C( -674.95), SIMDE_FLOAT32_C( 214.58),
SIMDE_FLOAT32_C( -55.28), SIMDE_FLOAT32_C( -149.49), SIMDE_FLOAT32_C( 939.45), SIMDE_FLOAT32_C( -391.94) },
{ SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( 663.34), SIMDE_FLOAT32_C( -242.15), SIMDE_FLOAT32_C( -1.00),
SIMDE_FLOAT32_C( 637.49), SIMDE_FLOAT32_C( -981.81), SIMDE_FLOAT32_C( -1.00), SIMDE_FLOAT32_C( 850.55),
SIMDE_FLOAT32_C( -700.57), SIMDE_FLOAT32_C( 301.77), SIMDE_FLOAT32_C( -889.15), SIMDE_FLOAT32_C( -393.45),
SIMDE_FLOAT32_C( -154.87), SIMDE_FLOAT32_C( 130.14), SIMDE_FLOAT32_C( -512.79), SIMDE_FLOAT32_C( -768.86) } },
{ { SIMDE_FLOAT32_C( 608.36), SIMDE_FLOAT32_C( 732.93), SIMDE_FLOAT32_C( -754.45), SIMDE_FLOAT32_C( 626.55),
SIMDE_FLOAT32_C( 591.86), SIMDE_FLOAT32_C( -903.90), SIMDE_FLOAT32_C( 925.98), SIMDE_FLOAT32_C( -106.36),
SIMDE_FLOAT32_C( -793.05), SIMDE_FLOAT32_C( -467.47), SIMDE_FLOAT32_C( 738.77), SIMDE_FLOAT32_C( 337.09),
SIMDE_FLOAT32_C( 19.74), SIMDE_FLOAT32_C( 969.90), SIMDE_FLOAT32_C( -735.01), SIMDE_FLOAT32_C( -969.78) },
UINT8_C(189),
{ SIMDE_FLOAT32_C( -18.69), SIMDE_FLOAT32_C( -551.55), SIMDE_FLOAT32_C( 144.99), SIMDE_FLOAT32_C( -971.46),
SIMDE_FLOAT32_C( -211.20), SIMDE_FLOAT32_C( 140.49), SIMDE_FLOAT32_C( -758.11), SIMDE_FLOAT32_C( -305.49),
SIMDE_FLOAT32_C( 465.54), SIMDE_FLOAT32_C( 456.46), SIMDE_FLOAT32_C( 639.24), SIMDE_FLOAT32_C( -683.94),
SIMDE_FLOAT32_C( 395.91), SIMDE_FLOAT32_C( -752.70), SIMDE_FLOAT32_C( 924.42), SIMDE_FLOAT32_C( 128.84) },
{ SIMDE_FLOAT32_C( -1.00), SIMDE_FLOAT32_C( 732.93), SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( -1.00),
SIMDE_FLOAT32_C( -1.00), SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( 925.98), SIMDE_FLOAT32_C( -1.00),
SIMDE_FLOAT32_C( -793.05), SIMDE_FLOAT32_C( -467.47), SIMDE_FLOAT32_C( 738.77), SIMDE_FLOAT32_C( 337.09),
SIMDE_FLOAT32_C( 19.74), SIMDE_FLOAT32_C( 969.90), SIMDE_FLOAT32_C( -735.01), SIMDE_FLOAT32_C( -969.78) } }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])) ; i++) {
simde__m512 src = simde_mm512_loadu_ps(test_vec[i].src);
simde__m512 a = simde_mm512_loadu_ps(test_vec[i].a);
simde__m512 r = simde_mm512_mask_erf_ps(src, test_vec[i].k, a);
simde_test_x86_assert_equal_f32x16(r, simde_mm512_loadu_ps(test_vec[i].r), 1);
}
return 0;
}
static int
test_simde_mm512_erf_pd (SIMDE_MUNIT_TEST_ARGS) {
static const struct {
const simde_float64 a[8];
const simde_float64 r[8];
} test_vec[] = {
{ { SIMDE_FLOAT64_C( 287.12), SIMDE_FLOAT64_C( 923.43), SIMDE_FLOAT64_C( -235.47), SIMDE_FLOAT64_C( -270.63),
SIMDE_FLOAT64_C( 872.91), SIMDE_FLOAT64_C( 62.22), SIMDE_FLOAT64_C( -259.06), SIMDE_FLOAT64_C( 509.74) },
{ SIMDE_FLOAT64_C( 1.00), SIMDE_FLOAT64_C( 1.00), SIMDE_FLOAT64_C( -1.00), SIMDE_FLOAT64_C( -1.00),
SIMDE_FLOAT64_C( 1.00), SIMDE_FLOAT64_C( 1.00), SIMDE_FLOAT64_C( -1.00), SIMDE_FLOAT64_C( 1.00) } },
{ { SIMDE_FLOAT64_C( -381.16), SIMDE_FLOAT64_C( -659.69), SIMDE_FLOAT64_C( 397.49), SIMDE_FLOAT64_C( -803.01),
SIMDE_FLOAT64_C( -467.01), SIMDE_FLOAT64_C( -777.46), SIMDE_FLOAT64_C( -995.46), SIMDE_FLOAT64_C( -455.46) },
{ SIMDE_FLOAT64_C( -1.00), SIMDE_FLOAT64_C( -1.00), SIMDE_FLOAT64_C( 1.00), SIMDE_FLOAT64_C( -1.00),
SIMDE_FLOAT64_C( -1.00), SIMDE_FLOAT64_C( -1.00), SIMDE_FLOAT64_C( -1.00), SIMDE_FLOAT64_C( -1.00) } },
{ { SIMDE_FLOAT64_C( 412.93), SIMDE_FLOAT64_C( 31.33), SIMDE_FLOAT64_C( 675.90), SIMDE_FLOAT64_C( 842.14),
SIMDE_FLOAT64_C( 999.42), SIMDE_FLOAT64_C( -210.59), SIMDE_FLOAT64_C( 469.06), SIMDE_FLOAT64_C( -204.67) },
{ SIMDE_FLOAT64_C( 1.00), SIMDE_FLOAT64_C( 1.00), SIMDE_FLOAT64_C( 1.00), SIMDE_FLOAT64_C( 1.00),
SIMDE_FLOAT64_C( 1.00), SIMDE_FLOAT64_C( -1.00), SIMDE_FLOAT64_C( 1.00), SIMDE_FLOAT64_C( -1.00) } },
{ { SIMDE_FLOAT64_C( 194.13), SIMDE_FLOAT64_C( 752.63), SIMDE_FLOAT64_C( 950.43), SIMDE_FLOAT64_C( 627.80),
SIMDE_FLOAT64_C( 3.93), SIMDE_FLOAT64_C( -80.48), SIMDE_FLOAT64_C( -738.99), SIMDE_FLOAT64_C( -708.95) },
{ SIMDE_FLOAT64_C( 1.00), SIMDE_FLOAT64_C( 1.00), SIMDE_FLOAT64_C( 1.00), SIMDE_FLOAT64_C( 1.00),
SIMDE_FLOAT64_C( 1.00), SIMDE_FLOAT64_C( -1.00), SIMDE_FLOAT64_C( -1.00), SIMDE_FLOAT64_C( -1.00) } },
{ { SIMDE_FLOAT64_C( -157.05), SIMDE_FLOAT64_C( 25.54), SIMDE_FLOAT64_C( 20.42), SIMDE_FLOAT64_C( -284.15),
SIMDE_FLOAT64_C( -912.24), SIMDE_FLOAT64_C( 761.36), SIMDE_FLOAT64_C( -774.41), SIMDE_FLOAT64_C( -293.40) },
{ SIMDE_FLOAT64_C( -1.00), SIMDE_FLOAT64_C( 1.00), SIMDE_FLOAT64_C( 1.00), SIMDE_FLOAT64_C( -1.00),
SIMDE_FLOAT64_C( -1.00), SIMDE_FLOAT64_C( 1.00), SIMDE_FLOAT64_C( -1.00), SIMDE_FLOAT64_C( -1.00) } },
{ { SIMDE_FLOAT64_C( -898.33), SIMDE_FLOAT64_C( 623.08), SIMDE_FLOAT64_C( -96.41), SIMDE_FLOAT64_C( -365.34),
SIMDE_FLOAT64_C( 845.62), SIMDE_FLOAT64_C( -91.87), SIMDE_FLOAT64_C( 179.19), SIMDE_FLOAT64_C( 258.55) },
{ SIMDE_FLOAT64_C( -1.00), SIMDE_FLOAT64_C( 1.00), SIMDE_FLOAT64_C( -1.00), SIMDE_FLOAT64_C( -1.00),
SIMDE_FLOAT64_C( 1.00), SIMDE_FLOAT64_C( -1.00), SIMDE_FLOAT64_C( 1.00), SIMDE_FLOAT64_C( 1.00) } },
{ { SIMDE_FLOAT64_C( 939.45), SIMDE_FLOAT64_C( -144.90), SIMDE_FLOAT64_C( 100.69), SIMDE_FLOAT64_C( 938.87),
SIMDE_FLOAT64_C( 644.51), SIMDE_FLOAT64_C( -430.25), SIMDE_FLOAT64_C( -265.80), SIMDE_FLOAT64_C( -161.37) },
{ SIMDE_FLOAT64_C( 1.00), SIMDE_FLOAT64_C( -1.00), SIMDE_FLOAT64_C( 1.00), SIMDE_FLOAT64_C( 1.00),
SIMDE_FLOAT64_C( 1.00), SIMDE_FLOAT64_C( -1.00), SIMDE_FLOAT64_C( -1.00), SIMDE_FLOAT64_C( -1.00) } },
{ { SIMDE_FLOAT64_C( -677.63), SIMDE_FLOAT64_C( -315.37), SIMDE_FLOAT64_C( -533.56), SIMDE_FLOAT64_C( 326.31),
SIMDE_FLOAT64_C( 604.15), SIMDE_FLOAT64_C( -272.55), SIMDE_FLOAT64_C( 617.36), SIMDE_FLOAT64_C( -552.90) },
{ SIMDE_FLOAT64_C( -1.00), SIMDE_FLOAT64_C( -1.00), SIMDE_FLOAT64_C( -1.00), SIMDE_FLOAT64_C( 1.00),
SIMDE_FLOAT64_C( 1.00), SIMDE_FLOAT64_C( -1.00), SIMDE_FLOAT64_C( 1.00), SIMDE_FLOAT64_C( -1.00) } }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])) ; i++) {
simde__m512d a = simde_mm512_loadu_pd(test_vec[i].a);
simde__m512d r = simde_mm512_erf_pd(a);
simde_test_x86_assert_equal_f64x8(r, simde_mm512_loadu_pd(test_vec[i].r), 1);
}
return 0;
}
static int
test_simde_mm512_mask_erf_pd (SIMDE_MUNIT_TEST_ARGS) {
static const struct {
const simde_float64 src[8];
const simde__mmask8 k;
const simde_float64 a[8];
const simde_float64 r[8];
} test_vec[] = {
{ { SIMDE_FLOAT64_C( -475.71), SIMDE_FLOAT64_C( -480.68), SIMDE_FLOAT64_C( 251.56), SIMDE_FLOAT64_C( 974.57),
SIMDE_FLOAT64_C( -654.33), SIMDE_FLOAT64_C( 974.69), SIMDE_FLOAT64_C( -443.19), SIMDE_FLOAT64_C( 343.95) },
UINT8_C(224),
{ SIMDE_FLOAT64_C( -493.29), SIMDE_FLOAT64_C( -325.36), SIMDE_FLOAT64_C( -887.40), SIMDE_FLOAT64_C( -727.34),
SIMDE_FLOAT64_C( -936.73), SIMDE_FLOAT64_C( 654.69), SIMDE_FLOAT64_C( 988.04), SIMDE_FLOAT64_C( -361.17) },
{ SIMDE_FLOAT64_C( -475.71), SIMDE_FLOAT64_C( -480.68), SIMDE_FLOAT64_C( 251.56), SIMDE_FLOAT64_C( 974.57),
SIMDE_FLOAT64_C( -654.33), SIMDE_FLOAT64_C( 1.00), SIMDE_FLOAT64_C( 1.00), SIMDE_FLOAT64_C( -1.00) } },
{ { SIMDE_FLOAT64_C( 370.27), SIMDE_FLOAT64_C( 594.68), SIMDE_FLOAT64_C( -149.62), SIMDE_FLOAT64_C( -535.38),
SIMDE_FLOAT64_C( 277.92), SIMDE_FLOAT64_C( -615.67), SIMDE_FLOAT64_C( -531.54), SIMDE_FLOAT64_C( 583.79) },
UINT8_C(113),
{ SIMDE_FLOAT64_C( -420.19), SIMDE_FLOAT64_C( -624.33), SIMDE_FLOAT64_C( -915.05), SIMDE_FLOAT64_C( -155.08),
SIMDE_FLOAT64_C( 757.99), SIMDE_FLOAT64_C( -390.77), SIMDE_FLOAT64_C( 364.24), SIMDE_FLOAT64_C( 9.55) },
{ SIMDE_FLOAT64_C( -1.00), SIMDE_FLOAT64_C( 594.68), SIMDE_FLOAT64_C( -149.62), SIMDE_FLOAT64_C( -535.38),
SIMDE_FLOAT64_C( 1.00), SIMDE_FLOAT64_C( -1.00), SIMDE_FLOAT64_C( 1.00), SIMDE_FLOAT64_C( 583.79) } },
{ { SIMDE_FLOAT64_C( -416.20), SIMDE_FLOAT64_C( 709.91), SIMDE_FLOAT64_C( -15.76), SIMDE_FLOAT64_C( 140.62),
SIMDE_FLOAT64_C( 53.86), SIMDE_FLOAT64_C( -954.63), SIMDE_FLOAT64_C( 647.32), SIMDE_FLOAT64_C( 728.50) },
UINT8_C(252),
{ SIMDE_FLOAT64_C( 919.98), SIMDE_FLOAT64_C( 791.78), SIMDE_FLOAT64_C( 812.66), SIMDE_FLOAT64_C( 908.02),
SIMDE_FLOAT64_C( -569.39), SIMDE_FLOAT64_C( 182.93), SIMDE_FLOAT64_C( 502.70), SIMDE_FLOAT64_C( 280.99) },
{ SIMDE_FLOAT64_C( -416.20), SIMDE_FLOAT64_C( 709.91), SIMDE_FLOAT64_C( 1.00), SIMDE_FLOAT64_C( 1.00),
SIMDE_FLOAT64_C( -1.00), SIMDE_FLOAT64_C( 1.00), SIMDE_FLOAT64_C( 1.00), SIMDE_FLOAT64_C( 1.00) } },
{ { SIMDE_FLOAT64_C( 647.55), SIMDE_FLOAT64_C( -219.38), SIMDE_FLOAT64_C( 665.32), SIMDE_FLOAT64_C( -883.99),
SIMDE_FLOAT64_C( -635.59), SIMDE_FLOAT64_C( -276.35), SIMDE_FLOAT64_C( -304.18), SIMDE_FLOAT64_C( -259.92) },
UINT8_C( 7),
{ SIMDE_FLOAT64_C( 540.74), SIMDE_FLOAT64_C( -501.92), SIMDE_FLOAT64_C( 417.83), SIMDE_FLOAT64_C( -95.02),
SIMDE_FLOAT64_C( 507.63), SIMDE_FLOAT64_C( -998.37), SIMDE_FLOAT64_C( -385.10), SIMDE_FLOAT64_C( -508.13) },
{ SIMDE_FLOAT64_C( 1.00), SIMDE_FLOAT64_C( -1.00), SIMDE_FLOAT64_C( 1.00), SIMDE_FLOAT64_C( -883.99),
SIMDE_FLOAT64_C( -635.59), SIMDE_FLOAT64_C( -276.35), SIMDE_FLOAT64_C( -304.18), SIMDE_FLOAT64_C( -259.92) } },
{ { SIMDE_FLOAT64_C( 142.25), SIMDE_FLOAT64_C( 668.76), SIMDE_FLOAT64_C( -462.76), SIMDE_FLOAT64_C( -210.42),
SIMDE_FLOAT64_C( 397.27), SIMDE_FLOAT64_C( -304.79), SIMDE_FLOAT64_C( -290.44), SIMDE_FLOAT64_C( 189.04) },
UINT8_C(184),
{ SIMDE_FLOAT64_C( -382.42), SIMDE_FLOAT64_C( 619.65), SIMDE_FLOAT64_C( 690.79), SIMDE_FLOAT64_C( -879.72),
SIMDE_FLOAT64_C( -99.35), SIMDE_FLOAT64_C( 338.34), SIMDE_FLOAT64_C( -99.10), SIMDE_FLOAT64_C( -434.03) },
{ SIMDE_FLOAT64_C( 142.25), SIMDE_FLOAT64_C( 668.76), SIMDE_FLOAT64_C( -462.76), SIMDE_FLOAT64_C( -1.00),
SIMDE_FLOAT64_C( -1.00), SIMDE_FLOAT64_C( 1.00), SIMDE_FLOAT64_C( -290.44), SIMDE_FLOAT64_C( -1.00) } },
{ { SIMDE_FLOAT64_C( 454.35), SIMDE_FLOAT64_C( 265.31), SIMDE_FLOAT64_C( 289.62), SIMDE_FLOAT64_C( -849.83),
SIMDE_FLOAT64_C( -994.61), SIMDE_FLOAT64_C( -901.78), SIMDE_FLOAT64_C( 690.91), SIMDE_FLOAT64_C( -496.53) },
UINT8_C( 88),
{ SIMDE_FLOAT64_C( -404.11), SIMDE_FLOAT64_C( -988.90), SIMDE_FLOAT64_C( 517.68), SIMDE_FLOAT64_C( 210.79),
SIMDE_FLOAT64_C( -497.03), SIMDE_FLOAT64_C( -340.06), SIMDE_FLOAT64_C( -120.45), SIMDE_FLOAT64_C( 40.21) },
{ SIMDE_FLOAT64_C( 454.35), SIMDE_FLOAT64_C( 265.31), SIMDE_FLOAT64_C( 289.62), SIMDE_FLOAT64_C( 1.00),
SIMDE_FLOAT64_C( -1.00), SIMDE_FLOAT64_C( -901.78), SIMDE_FLOAT64_C( -1.00), SIMDE_FLOAT64_C( -496.53) } },
{ { SIMDE_FLOAT64_C( 449.51), SIMDE_FLOAT64_C( -723.18), SIMDE_FLOAT64_C( 735.42), SIMDE_FLOAT64_C( -840.92),
SIMDE_FLOAT64_C( 465.86), SIMDE_FLOAT64_C( -756.71), SIMDE_FLOAT64_C( -223.34), SIMDE_FLOAT64_C( 85.52) },
UINT8_C(226),
{ SIMDE_FLOAT64_C( -103.06), SIMDE_FLOAT64_C( 986.16), SIMDE_FLOAT64_C( 272.42), SIMDE_FLOAT64_C( 797.84),
SIMDE_FLOAT64_C( -447.86), SIMDE_FLOAT64_C( -273.23), SIMDE_FLOAT64_C( 63.15), SIMDE_FLOAT64_C( 841.76) },
{ SIMDE_FLOAT64_C( 449.51), SIMDE_FLOAT64_C( 1.00), SIMDE_FLOAT64_C( 735.42), SIMDE_FLOAT64_C( -840.92),
SIMDE_FLOAT64_C( 465.86), SIMDE_FLOAT64_C( -1.00), SIMDE_FLOAT64_C( 1.00), SIMDE_FLOAT64_C( 1.00) } },
{ { SIMDE_FLOAT64_C( -123.06), SIMDE_FLOAT64_C( 68.54), SIMDE_FLOAT64_C( 939.98), SIMDE_FLOAT64_C( -432.16),
SIMDE_FLOAT64_C( 572.01), SIMDE_FLOAT64_C( 456.03), SIMDE_FLOAT64_C( 163.74), SIMDE_FLOAT64_C( 583.10) },
UINT8_C(247),
{ SIMDE_FLOAT64_C( -625.47), SIMDE_FLOAT64_C( -913.93), SIMDE_FLOAT64_C( 633.64), SIMDE_FLOAT64_C( 254.08),
SIMDE_FLOAT64_C( 126.28), SIMDE_FLOAT64_C( 83.16), SIMDE_FLOAT64_C( 530.89), SIMDE_FLOAT64_C( -138.30) },
{ SIMDE_FLOAT64_C( -1.00), SIMDE_FLOAT64_C( -1.00), SIMDE_FLOAT64_C( 1.00), SIMDE_FLOAT64_C( -432.16),
SIMDE_FLOAT64_C( 1.00), SIMDE_FLOAT64_C( 1.00), SIMDE_FLOAT64_C( 1.00), SIMDE_FLOAT64_C( -1.00) } }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])) ; i++) {
simde__m512d src = simde_mm512_loadu_pd(test_vec[i].src);
simde__m512d a = simde_mm512_loadu_pd(test_vec[i].a);
simde__m512d r = simde_mm512_mask_erf_pd(src, test_vec[i].k, a);
simde_test_x86_assert_equal_f64x8(r, simde_mm512_loadu_pd(test_vec[i].r), 1);
}
return 0;
}
static int
test_simde_mm_erfinv_ps (SIMDE_MUNIT_TEST_ARGS) {
static const struct {
const simde_float32 a[4];
const simde_float32 r[4];
} test_vec[] = {
{ { SIMDE_FLOAT32_C( -0.74), SIMDE_FLOAT32_C( 0.61), SIMDE_FLOAT32_C( 0.31), SIMDE_FLOAT32_C( 0.29) },
{ SIMDE_FLOAT32_C( -0.80), SIMDE_FLOAT32_C( 0.61), SIMDE_FLOAT32_C( 0.28), SIMDE_FLOAT32_C( 0.26) } },
{ { SIMDE_FLOAT32_C( -0.28), SIMDE_FLOAT32_C( 0.41), SIMDE_FLOAT32_C( -0.16), SIMDE_FLOAT32_C( 0.67) },
{ SIMDE_FLOAT32_C( -0.25), SIMDE_FLOAT32_C( 0.38), SIMDE_FLOAT32_C( -0.14), SIMDE_FLOAT32_C( 0.69) } },
{ { SIMDE_FLOAT32_C( -0.13), SIMDE_FLOAT32_C( 0.10), SIMDE_FLOAT32_C( -0.01), SIMDE_FLOAT32_C( 0.25) },
{ SIMDE_FLOAT32_C( -0.12), SIMDE_FLOAT32_C( 0.09), SIMDE_FLOAT32_C( -0.01), SIMDE_FLOAT32_C( 0.23) } },
{ { SIMDE_FLOAT32_C( -0.08), SIMDE_FLOAT32_C( 0.37), SIMDE_FLOAT32_C( 0.97), SIMDE_FLOAT32_C( 0.13) },
{ SIMDE_FLOAT32_C( -0.07), SIMDE_FLOAT32_C( 0.34), SIMDE_FLOAT32_C( 1.53), SIMDE_FLOAT32_C( 0.12) } },
{ { SIMDE_FLOAT32_C( -0.38), SIMDE_FLOAT32_C( -0.91), SIMDE_FLOAT32_C( -0.89), SIMDE_FLOAT32_C( 0.35) },
{ SIMDE_FLOAT32_C( -0.35), SIMDE_FLOAT32_C( -1.20), SIMDE_FLOAT32_C( -1.13), SIMDE_FLOAT32_C( 0.32) } },
{ { SIMDE_FLOAT32_C( -0.50), SIMDE_FLOAT32_C( -0.52), SIMDE_FLOAT32_C( 0.66), SIMDE_FLOAT32_C( -0.27) },
{ SIMDE_FLOAT32_C( -0.48), SIMDE_FLOAT32_C( -0.50), SIMDE_FLOAT32_C( 0.67), SIMDE_FLOAT32_C( -0.24) } },
{ { SIMDE_FLOAT32_C( -0.73), SIMDE_FLOAT32_C( 0.98), SIMDE_FLOAT32_C( 0.26), SIMDE_FLOAT32_C( -0.47) },
{ SIMDE_FLOAT32_C( -0.78), SIMDE_FLOAT32_C( 1.64), SIMDE_FLOAT32_C( 0.23), SIMDE_FLOAT32_C( -0.44) } },
{ { SIMDE_FLOAT32_C( 0.47), SIMDE_FLOAT32_C( 0.37), SIMDE_FLOAT32_C( -0.52), SIMDE_FLOAT32_C( 0.73) },
{ SIMDE_FLOAT32_C( 0.44), SIMDE_FLOAT32_C( 0.34), SIMDE_FLOAT32_C( -0.50), SIMDE_FLOAT32_C( 0.78) } }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])) ; i++) {
simde__m128 a = simde_mm_loadu_ps(test_vec[i].a);
simde__m128 r = simde_mm_erfinv_ps(a);
simde_test_x86_assert_equal_f32x4(r, simde_mm_loadu_ps(test_vec[i].r), 1);
}
return 0;
}
static int
test_simde_mm_erfinv_pd (SIMDE_MUNIT_TEST_ARGS) {
static const struct {
const simde_float64 a[2];
const simde_float64 r[2];
} test_vec[] = {
{ { SIMDE_FLOAT64_C( 0.10), SIMDE_FLOAT64_C( -0.59) },
{ SIMDE_FLOAT64_C( 0.09), SIMDE_FLOAT64_C( -0.58) } },
{ { SIMDE_FLOAT64_C( 0.15), SIMDE_FLOAT64_C( -0.15) },
{ SIMDE_FLOAT64_C( 0.13), SIMDE_FLOAT64_C( -0.13) } },
{ { SIMDE_FLOAT64_C( 0.69), SIMDE_FLOAT64_C( -0.24) },
{ SIMDE_FLOAT64_C( 0.72), SIMDE_FLOAT64_C( -0.22) } },
{ { SIMDE_FLOAT64_C( -0.02), SIMDE_FLOAT64_C( 0.81) },
{ SIMDE_FLOAT64_C( -0.02), SIMDE_FLOAT64_C( 0.93) } },
{ { SIMDE_FLOAT64_C( 0.42), SIMDE_FLOAT64_C( 0.36) },
{ SIMDE_FLOAT64_C( 0.39), SIMDE_FLOAT64_C( 0.33) } },
{ { SIMDE_FLOAT64_C( -0.75), SIMDE_FLOAT64_C( 0.21) },
{ SIMDE_FLOAT64_C( -0.81), SIMDE_FLOAT64_C( 0.19) } },
{ { SIMDE_FLOAT64_C( -0.67), SIMDE_FLOAT64_C( -0.11) },
{ SIMDE_FLOAT64_C( -0.69), SIMDE_FLOAT64_C( -0.10) } },
{ { SIMDE_FLOAT64_C( -0.54), SIMDE_FLOAT64_C( -0.85) },
{ SIMDE_FLOAT64_C( -0.52), SIMDE_FLOAT64_C( -1.02) } }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])) ; i++) {
simde__m128d a = simde_mm_loadu_pd(test_vec[i].a);
simde__m128d r = simde_mm_erfinv_pd(a);
simde_test_x86_assert_equal_f64x2(r, simde_mm_loadu_pd(test_vec[i].r), 1);
}
return 0;
}
static int
test_simde_mm256_erfinv_ps (SIMDE_MUNIT_TEST_ARGS) {
static const struct {
const simde_float32 a[8];
const simde_float32 r[8];
} test_vec[] = {
{ { SIMDE_FLOAT32_C( -0.11), SIMDE_FLOAT32_C( -0.91), SIMDE_FLOAT32_C( -0.63), SIMDE_FLOAT32_C( -0.73),
SIMDE_FLOAT32_C( -0.36), SIMDE_FLOAT32_C( 0.18), SIMDE_FLOAT32_C( -0.44), SIMDE_FLOAT32_C( 0.30) },
{ SIMDE_FLOAT32_C( -0.10), SIMDE_FLOAT32_C( -1.20), SIMDE_FLOAT32_C( -0.63), SIMDE_FLOAT32_C( -0.78),
SIMDE_FLOAT32_C( -0.33), SIMDE_FLOAT32_C( 0.16), SIMDE_FLOAT32_C( -0.41), SIMDE_FLOAT32_C( 0.27) } },
{ { SIMDE_FLOAT32_C( -0.37), SIMDE_FLOAT32_C( 0.90), SIMDE_FLOAT32_C( 0.13), SIMDE_FLOAT32_C( 0.39),
SIMDE_FLOAT32_C( 0.61), SIMDE_FLOAT32_C( -0.83), SIMDE_FLOAT32_C( 0.62), SIMDE_FLOAT32_C( 0.54) },
{ SIMDE_FLOAT32_C( -0.34), SIMDE_FLOAT32_C( 1.16), SIMDE_FLOAT32_C( 0.12), SIMDE_FLOAT32_C( 0.36),
SIMDE_FLOAT32_C( 0.61), SIMDE_FLOAT32_C( -0.97), SIMDE_FLOAT32_C( 0.62), SIMDE_FLOAT32_C( 0.52) } },
{ { SIMDE_FLOAT32_C( -0.76), SIMDE_FLOAT32_C( -0.16), SIMDE_FLOAT32_C( -0.66), SIMDE_FLOAT32_C( -0.65),
SIMDE_FLOAT32_C( -0.48), SIMDE_FLOAT32_C( 0.13), SIMDE_FLOAT32_C( 0.36), SIMDE_FLOAT32_C( -0.20) },
{ SIMDE_FLOAT32_C( -0.83), SIMDE_FLOAT32_C( -0.14), SIMDE_FLOAT32_C( -0.67), SIMDE_FLOAT32_C( -0.66),
SIMDE_FLOAT32_C( -0.45), SIMDE_FLOAT32_C( 0.12), SIMDE_FLOAT32_C( 0.33), SIMDE_FLOAT32_C( -0.18) } },
{ { SIMDE_FLOAT32_C( -0.90), SIMDE_FLOAT32_C( -0.84), SIMDE_FLOAT32_C( 0.35), SIMDE_FLOAT32_C( -0.60),
SIMDE_FLOAT32_C( -0.17), SIMDE_FLOAT32_C( -0.22), SIMDE_FLOAT32_C( -0.44), SIMDE_FLOAT32_C( 0.71) },
{ SIMDE_FLOAT32_C( -1.16), SIMDE_FLOAT32_C( -0.99), SIMDE_FLOAT32_C( 0.32), SIMDE_FLOAT32_C( -0.60),
SIMDE_FLOAT32_C( -0.15), SIMDE_FLOAT32_C( -0.20), SIMDE_FLOAT32_C( -0.41), SIMDE_FLOAT32_C( 0.75) } },
{ { SIMDE_FLOAT32_C( -0.13), SIMDE_FLOAT32_C( -0.07), SIMDE_FLOAT32_C( 0.98), SIMDE_FLOAT32_C( 0.51),
SIMDE_FLOAT32_C( -0.88), SIMDE_FLOAT32_C( -0.46), SIMDE_FLOAT32_C( -0.18), SIMDE_FLOAT32_C( -0.26) },
{ SIMDE_FLOAT32_C( -0.12), SIMDE_FLOAT32_C( -0.06), SIMDE_FLOAT32_C( 1.64), SIMDE_FLOAT32_C( 0.49),
SIMDE_FLOAT32_C( -1.10), SIMDE_FLOAT32_C( -0.43), SIMDE_FLOAT32_C( -0.16), SIMDE_FLOAT32_C( -0.23) } },
{ { SIMDE_FLOAT32_C( -0.56), SIMDE_FLOAT32_C( 0.95), SIMDE_FLOAT32_C( -0.86), SIMDE_FLOAT32_C( -0.95),
SIMDE_FLOAT32_C( -0.89), SIMDE_FLOAT32_C( 0.76), SIMDE_FLOAT32_C( 0.59), SIMDE_FLOAT32_C( -0.64) },
{ SIMDE_FLOAT32_C( -0.55), SIMDE_FLOAT32_C( 1.39), SIMDE_FLOAT32_C( -1.04), SIMDE_FLOAT32_C( -1.39),
SIMDE_FLOAT32_C( -1.13), SIMDE_FLOAT32_C( 0.83), SIMDE_FLOAT32_C( 0.58), SIMDE_FLOAT32_C( -0.65) } },
{ { SIMDE_FLOAT32_C( -0.40), SIMDE_FLOAT32_C( 0.93), SIMDE_FLOAT32_C( -0.30), SIMDE_FLOAT32_C( 0.11),
SIMDE_FLOAT32_C( 0.06), SIMDE_FLOAT32_C( -0.93), SIMDE_FLOAT32_C( 0.91), SIMDE_FLOAT32_C( 0.16) },
{ SIMDE_FLOAT32_C( -0.37), SIMDE_FLOAT32_C( 1.28), SIMDE_FLOAT32_C( -0.27), SIMDE_FLOAT32_C( 0.10),
SIMDE_FLOAT32_C( 0.05), SIMDE_FLOAT32_C( -1.28), SIMDE_FLOAT32_C( 1.20), SIMDE_FLOAT32_C( 0.14) } },
{ { SIMDE_FLOAT32_C( -0.77), SIMDE_FLOAT32_C( 0.26), SIMDE_FLOAT32_C( 0.56), SIMDE_FLOAT32_C( 0.05),
SIMDE_FLOAT32_C( -0.96), SIMDE_FLOAT32_C( -0.88), SIMDE_FLOAT32_C( -0.24), SIMDE_FLOAT32_C( -0.08) },
{ SIMDE_FLOAT32_C( -0.85), SIMDE_FLOAT32_C( 0.23), SIMDE_FLOAT32_C( 0.55), SIMDE_FLOAT32_C( 0.04),
SIMDE_FLOAT32_C( -1.45), SIMDE_FLOAT32_C( -1.10), SIMDE_FLOAT32_C( -0.22), SIMDE_FLOAT32_C( -0.07) } }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])) ; i++) {
simde__m256 a = simde_mm256_loadu_ps(test_vec[i].a);
simde__m256 r = simde_mm256_erfinv_ps(a);
simde_test_x86_assert_equal_f32x8(r, simde_mm256_loadu_ps(test_vec[i].r), 1);
}
return 0;
}
static int
test_simde_mm256_erfinv_pd (SIMDE_MUNIT_TEST_ARGS) {
static const struct {
const simde_float64 a[4];
const simde_float64 r[4];
} test_vec[] = {
{ { SIMDE_FLOAT64_C( 0.44), SIMDE_FLOAT64_C( 0.53), SIMDE_FLOAT64_C( -0.77), SIMDE_FLOAT64_C( 0.01) },
{ SIMDE_FLOAT64_C( 0.41), SIMDE_FLOAT64_C( 0.51), SIMDE_FLOAT64_C( -0.85), SIMDE_FLOAT64_C( 0.01) } },
{ { SIMDE_FLOAT64_C( -0.06), SIMDE_FLOAT64_C( 0.07), SIMDE_FLOAT64_C( -0.85), SIMDE_FLOAT64_C( -0.30) },
{ SIMDE_FLOAT64_C( -0.05), SIMDE_FLOAT64_C( 0.06), SIMDE_FLOAT64_C( -1.02), SIMDE_FLOAT64_C( -0.27) } },
{ { SIMDE_FLOAT64_C( -0.05), SIMDE_FLOAT64_C( 0.50), SIMDE_FLOAT64_C( 0.71), SIMDE_FLOAT64_C( 0.05) },
{ SIMDE_FLOAT64_C( -0.04), SIMDE_FLOAT64_C( 0.48), SIMDE_FLOAT64_C( 0.75), SIMDE_FLOAT64_C( 0.04) } },
{ { SIMDE_FLOAT64_C( 0.12), SIMDE_FLOAT64_C( -0.77), SIMDE_FLOAT64_C( 0.52), SIMDE_FLOAT64_C( 0.86) },
{ SIMDE_FLOAT64_C( 0.11), SIMDE_FLOAT64_C( -0.85), SIMDE_FLOAT64_C( 0.50), SIMDE_FLOAT64_C( 1.04) } },
{ { SIMDE_FLOAT64_C( -0.22), SIMDE_FLOAT64_C( -0.32), SIMDE_FLOAT64_C( -0.94), SIMDE_FLOAT64_C( -0.31) },
{ SIMDE_FLOAT64_C( -0.20), SIMDE_FLOAT64_C( -0.29), SIMDE_FLOAT64_C( -1.33), SIMDE_FLOAT64_C( -0.28) } },
{ { SIMDE_FLOAT64_C( 0.82), SIMDE_FLOAT64_C( 0.78), SIMDE_FLOAT64_C( -0.14), SIMDE_FLOAT64_C( 0.26) },
{ SIMDE_FLOAT64_C( 0.95), SIMDE_FLOAT64_C( 0.87), SIMDE_FLOAT64_C( -0.12), SIMDE_FLOAT64_C( 0.23) } },
{ { SIMDE_FLOAT64_C( 0.47), SIMDE_FLOAT64_C( -0.32), SIMDE_FLOAT64_C( 0.71), SIMDE_FLOAT64_C( -0.56) },
{ SIMDE_FLOAT64_C( 0.44), SIMDE_FLOAT64_C( -0.29), SIMDE_FLOAT64_C( 0.75), SIMDE_FLOAT64_C( -0.55) } },
{ { SIMDE_FLOAT64_C( -0.59), SIMDE_FLOAT64_C( 0.67), SIMDE_FLOAT64_C( 0.23), SIMDE_FLOAT64_C( 0.85) },
{ SIMDE_FLOAT64_C( -0.58), SIMDE_FLOAT64_C( 0.69), SIMDE_FLOAT64_C( 0.21), SIMDE_FLOAT64_C( 1.02) } }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])) ; i++) {
simde__m256d a = simde_mm256_loadu_pd(test_vec[i].a);
simde__m256d r = simde_mm256_erfinv_pd(a);
simde_test_x86_assert_equal_f64x4(r, simde_mm256_loadu_pd(test_vec[i].r), 1);
}
return 0;
}
static int
test_simde_mm512_erfinv_ps (SIMDE_MUNIT_TEST_ARGS) {
static const struct {
const simde_float32 a[16];
const simde_float32 r[16];
} test_vec[] = {
{ { SIMDE_FLOAT32_C( -0.11), SIMDE_FLOAT32_C( 0.30), SIMDE_FLOAT32_C( -0.83), SIMDE_FLOAT32_C( -0.66),
SIMDE_FLOAT32_C( -0.88), SIMDE_FLOAT32_C( -0.81), SIMDE_FLOAT32_C( -0.62), SIMDE_FLOAT32_C( 0.98),
SIMDE_FLOAT32_C( 0.21), SIMDE_FLOAT32_C( -0.49), SIMDE_FLOAT32_C( -0.79), SIMDE_FLOAT32_C( 0.09),
SIMDE_FLOAT32_C( 0.54), SIMDE_FLOAT32_C( 0.44), SIMDE_FLOAT32_C( -0.64), SIMDE_FLOAT32_C( -0.05) },
{ SIMDE_FLOAT32_C( -0.10), SIMDE_FLOAT32_C( 0.27), SIMDE_FLOAT32_C( -0.97), SIMDE_FLOAT32_C( -0.67),
SIMDE_FLOAT32_C( -1.10), SIMDE_FLOAT32_C( -0.93), SIMDE_FLOAT32_C( -0.62), SIMDE_FLOAT32_C( 1.64),
SIMDE_FLOAT32_C( 0.19), SIMDE_FLOAT32_C( -0.47), SIMDE_FLOAT32_C( -0.89), SIMDE_FLOAT32_C( 0.08),
SIMDE_FLOAT32_C( 0.52), SIMDE_FLOAT32_C( 0.41), SIMDE_FLOAT32_C( -0.65), SIMDE_FLOAT32_C( -0.04) } },
{ { SIMDE_FLOAT32_C( -0.52), SIMDE_FLOAT32_C( -0.99), SIMDE_FLOAT32_C( -0.02), SIMDE_FLOAT32_C( -0.33),
SIMDE_FLOAT32_C( 0.49), SIMDE_FLOAT32_C( 0.95), SIMDE_FLOAT32_C( -0.26), SIMDE_FLOAT32_C( 0.85),
SIMDE_FLOAT32_C( -0.36), SIMDE_FLOAT32_C( -0.00), SIMDE_FLOAT32_C( -0.90), SIMDE_FLOAT32_C( 0.37),
SIMDE_FLOAT32_C( 0.20), SIMDE_FLOAT32_C( -0.56), SIMDE_FLOAT32_C( 0.64), SIMDE_FLOAT32_C( -0.91) },
{ SIMDE_FLOAT32_C( -0.50), SIMDE_FLOAT32_C( -1.82), SIMDE_FLOAT32_C( -0.02), SIMDE_FLOAT32_C( -0.30),
SIMDE_FLOAT32_C( 0.47), SIMDE_FLOAT32_C( 1.39), SIMDE_FLOAT32_C( -0.23), SIMDE_FLOAT32_C( 1.02),
SIMDE_FLOAT32_C( -0.33), SIMDE_FLOAT32_C( -0.00), SIMDE_FLOAT32_C( -1.16), SIMDE_FLOAT32_C( 0.34),
SIMDE_FLOAT32_C( 0.18), SIMDE_FLOAT32_C( -0.55), SIMDE_FLOAT32_C( 0.65), SIMDE_FLOAT32_C( -1.20) } },
{ { SIMDE_FLOAT32_C( 0.75), SIMDE_FLOAT32_C( 0.81), SIMDE_FLOAT32_C( -0.57), SIMDE_FLOAT32_C( 0.86),
SIMDE_FLOAT32_C( 0.99), SIMDE_FLOAT32_C( -0.19), SIMDE_FLOAT32_C( 0.85), SIMDE_FLOAT32_C( 0.21),
SIMDE_FLOAT32_C( 0.32), SIMDE_FLOAT32_C( -0.95), SIMDE_FLOAT32_C( -0.70), SIMDE_FLOAT32_C( -0.14),
SIMDE_FLOAT32_C( 0.49), SIMDE_FLOAT32_C( -0.34), SIMDE_FLOAT32_C( 0.81), SIMDE_FLOAT32_C( 0.97) },
{ SIMDE_FLOAT32_C( 0.81), SIMDE_FLOAT32_C( 0.93), SIMDE_FLOAT32_C( -0.56), SIMDE_FLOAT32_C( 1.04),
SIMDE_FLOAT32_C( 1.82), SIMDE_FLOAT32_C( -0.17), SIMDE_FLOAT32_C( 1.02), SIMDE_FLOAT32_C( 0.19),
SIMDE_FLOAT32_C( 0.29), SIMDE_FLOAT32_C( -1.39), SIMDE_FLOAT32_C( -0.73), SIMDE_FLOAT32_C( -0.12),
SIMDE_FLOAT32_C( 0.47), SIMDE_FLOAT32_C( -0.31), SIMDE_FLOAT32_C( 0.93), SIMDE_FLOAT32_C( 1.53) } },
{ { SIMDE_FLOAT32_C( -0.33), SIMDE_FLOAT32_C( -0.21), SIMDE_FLOAT32_C( -0.36), SIMDE_FLOAT32_C( -0.85),
SIMDE_FLOAT32_C( -0.27), SIMDE_FLOAT32_C( 0.38), SIMDE_FLOAT32_C( -0.99), SIMDE_FLOAT32_C( 0.37),
SIMDE_FLOAT32_C( -0.62), SIMDE_FLOAT32_C( -0.89), SIMDE_FLOAT32_C( -0.25), SIMDE_FLOAT32_C( 0.58),
SIMDE_FLOAT32_C( -0.45), SIMDE_FLOAT32_C( -0.61), SIMDE_FLOAT32_C( 0.67), SIMDE_FLOAT32_C( -0.70) },
{ SIMDE_FLOAT32_C( -0.30), SIMDE_FLOAT32_C( -0.19), SIMDE_FLOAT32_C( -0.33), SIMDE_FLOAT32_C( -1.02),
SIMDE_FLOAT32_C( -0.24), SIMDE_FLOAT32_C( 0.35), SIMDE_FLOAT32_C( -1.82), SIMDE_FLOAT32_C( 0.34),
SIMDE_FLOAT32_C( -0.62), SIMDE_FLOAT32_C( -1.13), SIMDE_FLOAT32_C( -0.23), SIMDE_FLOAT32_C( 0.57),
SIMDE_FLOAT32_C( -0.42), SIMDE_FLOAT32_C( -0.61), SIMDE_FLOAT32_C( 0.69), SIMDE_FLOAT32_C( -0.73) } },
{ { SIMDE_FLOAT32_C( -0.81), SIMDE_FLOAT32_C( -0.89), SIMDE_FLOAT32_C( -0.84), SIMDE_FLOAT32_C( -0.81),
SIMDE_FLOAT32_C( -0.08), SIMDE_FLOAT32_C( -0.99), SIMDE_FLOAT32_C( 0.40), SIMDE_FLOAT32_C( -0.76),
SIMDE_FLOAT32_C( -0.94), SIMDE_FLOAT32_C( 0.70), SIMDE_FLOAT32_C( 0.10), SIMDE_FLOAT32_C( 0.56),
SIMDE_FLOAT32_C( -0.64), SIMDE_FLOAT32_C( -0.09), SIMDE_FLOAT32_C( 0.53), SIMDE_FLOAT32_C( 0.03) },
{ SIMDE_FLOAT32_C( -0.93), SIMDE_FLOAT32_C( -1.13), SIMDE_FLOAT32_C( -0.99), SIMDE_FLOAT32_C( -0.93),
SIMDE_FLOAT32_C( -0.07), SIMDE_FLOAT32_C( -1.82), SIMDE_FLOAT32_C( 0.37), SIMDE_FLOAT32_C( -0.83),
SIMDE_FLOAT32_C( -1.33), SIMDE_FLOAT32_C( 0.73), SIMDE_FLOAT32_C( 0.09), SIMDE_FLOAT32_C( 0.55),
SIMDE_FLOAT32_C( -0.65), SIMDE_FLOAT32_C( -0.08), SIMDE_FLOAT32_C( 0.51), SIMDE_FLOAT32_C( 0.03) } },
{ { SIMDE_FLOAT32_C( 0.70), SIMDE_FLOAT32_C( -0.83), SIMDE_FLOAT32_C( 0.18), SIMDE_FLOAT32_C( -0.57),
SIMDE_FLOAT32_C( 0.55), SIMDE_FLOAT32_C( 0.19), SIMDE_FLOAT32_C( 0.80), SIMDE_FLOAT32_C( 0.93),
SIMDE_FLOAT32_C( 0.30), SIMDE_FLOAT32_C( -0.45), SIMDE_FLOAT32_C( 0.51), SIMDE_FLOAT32_C( 0.85),
SIMDE_FLOAT32_C( -0.06), SIMDE_FLOAT32_C( 0.18), SIMDE_FLOAT32_C( -0.85), SIMDE_FLOAT32_C( 0.14) },
{ SIMDE_FLOAT32_C( 0.73), SIMDE_FLOAT32_C( -0.97), SIMDE_FLOAT32_C( 0.16), SIMDE_FLOAT32_C( -0.56),
SIMDE_FLOAT32_C( 0.53), SIMDE_FLOAT32_C( 0.17), SIMDE_FLOAT32_C( 0.91), SIMDE_FLOAT32_C( 1.28),
SIMDE_FLOAT32_C( 0.27), SIMDE_FLOAT32_C( -0.42), SIMDE_FLOAT32_C( 0.49), SIMDE_FLOAT32_C( 1.02),
SIMDE_FLOAT32_C( -0.05), SIMDE_FLOAT32_C( 0.16), SIMDE_FLOAT32_C( -1.02), SIMDE_FLOAT32_C( 0.12) } },
{ { SIMDE_FLOAT32_C( 0.29), SIMDE_FLOAT32_C( -0.69), SIMDE_FLOAT32_C( 0.32), SIMDE_FLOAT32_C( -0.79),
SIMDE_FLOAT32_C( -0.68), SIMDE_FLOAT32_C( -0.28), SIMDE_FLOAT32_C( -0.56), SIMDE_FLOAT32_C( -0.61),
SIMDE_FLOAT32_C( -0.58), SIMDE_FLOAT32_C( 0.54), SIMDE_FLOAT32_C( 0.94), SIMDE_FLOAT32_C( -0.22),
SIMDE_FLOAT32_C( -0.55), SIMDE_FLOAT32_C( 0.47), SIMDE_FLOAT32_C( 0.80), SIMDE_FLOAT32_C( -0.85) },
{ SIMDE_FLOAT32_C( 0.26), SIMDE_FLOAT32_C( -0.72), SIMDE_FLOAT32_C( 0.29), SIMDE_FLOAT32_C( -0.89),
SIMDE_FLOAT32_C( -0.70), SIMDE_FLOAT32_C( -0.25), SIMDE_FLOAT32_C( -0.55), SIMDE_FLOAT32_C( -0.61),
SIMDE_FLOAT32_C( -0.57), SIMDE_FLOAT32_C( 0.52), SIMDE_FLOAT32_C( 1.33), SIMDE_FLOAT32_C( -0.20),
SIMDE_FLOAT32_C( -0.53), SIMDE_FLOAT32_C( 0.44), SIMDE_FLOAT32_C( 0.91), SIMDE_FLOAT32_C( -1.02) } },
{ { SIMDE_FLOAT32_C( 0.65), SIMDE_FLOAT32_C( -0.02), SIMDE_FLOAT32_C( -0.42), SIMDE_FLOAT32_C( 0.20),
SIMDE_FLOAT32_C( -0.83), SIMDE_FLOAT32_C( -0.61), SIMDE_FLOAT32_C( 0.13), SIMDE_FLOAT32_C( 0.47),
SIMDE_FLOAT32_C( -0.06), SIMDE_FLOAT32_C( -0.36), SIMDE_FLOAT32_C( 0.32), SIMDE_FLOAT32_C( 0.88),
SIMDE_FLOAT32_C( 0.82), SIMDE_FLOAT32_C( 0.48), SIMDE_FLOAT32_C( 0.02), SIMDE_FLOAT32_C( 0.11) },
{ SIMDE_FLOAT32_C( 0.66), SIMDE_FLOAT32_C( -0.02), SIMDE_FLOAT32_C( -0.39), SIMDE_FLOAT32_C( 0.18),
SIMDE_FLOAT32_C( -0.97), SIMDE_FLOAT32_C( -0.61), SIMDE_FLOAT32_C( 0.12), SIMDE_FLOAT32_C( 0.44),
SIMDE_FLOAT32_C( -0.05), SIMDE_FLOAT32_C( -0.33), SIMDE_FLOAT32_C( 0.29), SIMDE_FLOAT32_C( 1.10),
SIMDE_FLOAT32_C( 0.95), SIMDE_FLOAT32_C( 0.45), SIMDE_FLOAT32_C( 0.02), SIMDE_FLOAT32_C( 0.10) } }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])) ; i++) {
simde__m512 a = simde_mm512_loadu_ps(test_vec[i].a);
simde__m512 r = simde_mm512_erfinv_ps(a);
simde_test_x86_assert_equal_f32x16(r, simde_mm512_loadu_ps(test_vec[i].r), 1);
}
return 0;
}
static int
test_simde_mm512_erfinv_pd (SIMDE_MUNIT_TEST_ARGS) {
static const struct {
const simde_float64 a[8];
const simde_float64 r[8];
} test_vec[] = {
{ { SIMDE_FLOAT64_C( -0.15), SIMDE_FLOAT64_C( 0.41), SIMDE_FLOAT64_C( 0.45), SIMDE_FLOAT64_C( -0.46),
SIMDE_FLOAT64_C( -0.63), SIMDE_FLOAT64_C( 0.54), SIMDE_FLOAT64_C( -0.93), SIMDE_FLOAT64_C( -0.78) },
{ SIMDE_FLOAT64_C( -0.13), SIMDE_FLOAT64_C( 0.38), SIMDE_FLOAT64_C( 0.42), SIMDE_FLOAT64_C( -0.43),
SIMDE_FLOAT64_C( -0.63), SIMDE_FLOAT64_C( 0.52), SIMDE_FLOAT64_C( -1.28), SIMDE_FLOAT64_C( -0.87) } },
{ { SIMDE_FLOAT64_C( 0.57), SIMDE_FLOAT64_C( -0.28), SIMDE_FLOAT64_C( 0.60), SIMDE_FLOAT64_C( -0.02),
SIMDE_FLOAT64_C( -0.14), SIMDE_FLOAT64_C( 0.09), SIMDE_FLOAT64_C( -0.87), SIMDE_FLOAT64_C( 0.63) },
{ SIMDE_FLOAT64_C( 0.56), SIMDE_FLOAT64_C( -0.25), SIMDE_FLOAT64_C( 0.60), SIMDE_FLOAT64_C( -0.02),
SIMDE_FLOAT64_C( -0.12), SIMDE_FLOAT64_C( 0.08), SIMDE_FLOAT64_C( -1.07), SIMDE_FLOAT64_C( 0.63) } },
{ { SIMDE_FLOAT64_C( 0.56), SIMDE_FLOAT64_C( -0.49), SIMDE_FLOAT64_C( 0.58), SIMDE_FLOAT64_C( 0.24),
SIMDE_FLOAT64_C( 0.35), SIMDE_FLOAT64_C( -0.11), SIMDE_FLOAT64_C( -0.32), SIMDE_FLOAT64_C( -0.23) },
{ SIMDE_FLOAT64_C( 0.55), SIMDE_FLOAT64_C( -0.47), SIMDE_FLOAT64_C( 0.57), SIMDE_FLOAT64_C( 0.22),
SIMDE_FLOAT64_C( 0.32), SIMDE_FLOAT64_C( -0.10), SIMDE_FLOAT64_C( -0.29), SIMDE_FLOAT64_C( -0.21) } },
{ { SIMDE_FLOAT64_C( -0.23), SIMDE_FLOAT64_C( -0.01), SIMDE_FLOAT64_C( -0.76), SIMDE_FLOAT64_C( 0.18),
SIMDE_FLOAT64_C( 0.91), SIMDE_FLOAT64_C( -0.69), SIMDE_FLOAT64_C( -0.66), SIMDE_FLOAT64_C( -0.24) },
{ SIMDE_FLOAT64_C( -0.21), SIMDE_FLOAT64_C( -0.01), SIMDE_FLOAT64_C( -0.83), SIMDE_FLOAT64_C( 0.16),
SIMDE_FLOAT64_C( 1.20), SIMDE_FLOAT64_C( -0.72), SIMDE_FLOAT64_C( -0.67), SIMDE_FLOAT64_C( -0.22) } },
{ { SIMDE_FLOAT64_C( 0.72), SIMDE_FLOAT64_C( 0.79), SIMDE_FLOAT64_C( 0.30), SIMDE_FLOAT64_C( -0.91),
SIMDE_FLOAT64_C( 0.34), SIMDE_FLOAT64_C( 0.37), SIMDE_FLOAT64_C( -0.68), SIMDE_FLOAT64_C( -0.09) },
{ SIMDE_FLOAT64_C( 0.76), SIMDE_FLOAT64_C( 0.89), SIMDE_FLOAT64_C( 0.27), SIMDE_FLOAT64_C( -1.20),
SIMDE_FLOAT64_C( 0.31), SIMDE_FLOAT64_C( 0.34), SIMDE_FLOAT64_C( -0.70), SIMDE_FLOAT64_C( -0.08) } },
{ { SIMDE_FLOAT64_C( -0.91), SIMDE_FLOAT64_C( 0.91), SIMDE_FLOAT64_C( 0.89), SIMDE_FLOAT64_C( -0.05),
SIMDE_FLOAT64_C( 0.01), SIMDE_FLOAT64_C( -0.97), SIMDE_FLOAT64_C( -0.41), SIMDE_FLOAT64_C( -0.43) },
{ SIMDE_FLOAT64_C( -1.20), SIMDE_FLOAT64_C( 1.20), SIMDE_FLOAT64_C( 1.13), SIMDE_FLOAT64_C( -0.04),
SIMDE_FLOAT64_C( 0.01), SIMDE_FLOAT64_C( -1.53), SIMDE_FLOAT64_C( -0.38), SIMDE_FLOAT64_C( -0.40) } },
{ { SIMDE_FLOAT64_C( -0.46), SIMDE_FLOAT64_C( -0.84), SIMDE_FLOAT64_C( 0.81), SIMDE_FLOAT64_C( 0.89),
SIMDE_FLOAT64_C( 0.06), SIMDE_FLOAT64_C( -0.51), SIMDE_FLOAT64_C( -0.34), SIMDE_FLOAT64_C( 0.82) },
{ SIMDE_FLOAT64_C( -0.43), SIMDE_FLOAT64_C( -0.99), SIMDE_FLOAT64_C( 0.93), SIMDE_FLOAT64_C( 1.13),
SIMDE_FLOAT64_C( 0.05), SIMDE_FLOAT64_C( -0.49), SIMDE_FLOAT64_C( -0.31), SIMDE_FLOAT64_C( 0.95) } },
{ { SIMDE_FLOAT64_C( 0.49), SIMDE_FLOAT64_C( -0.10), SIMDE_FLOAT64_C( 0.01), SIMDE_FLOAT64_C( 0.40),
SIMDE_FLOAT64_C( 0.21), SIMDE_FLOAT64_C( 0.35), SIMDE_FLOAT64_C( -0.84), SIMDE_FLOAT64_C( -0.07) },
{ SIMDE_FLOAT64_C( 0.47), SIMDE_FLOAT64_C( -0.09), SIMDE_FLOAT64_C( 0.01), SIMDE_FLOAT64_C( 0.37),
SIMDE_FLOAT64_C( 0.19), SIMDE_FLOAT64_C( 0.32), SIMDE_FLOAT64_C( -0.99), SIMDE_FLOAT64_C( -0.06) } }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])) ; i++) {
simde__m512d a = simde_mm512_loadu_pd(test_vec[i].a);
simde__m512d r = simde_mm512_erfinv_pd(a);
simde_test_x86_assert_equal_f64x8(r, simde_mm512_loadu_pd(test_vec[i].r), 1);
}
return 0;
}
static int
test_simde_mm512_mask_erfinv_ps (SIMDE_MUNIT_TEST_ARGS) {
static const struct {
const simde_float32 src[16];
const simde__mmask8 k;
const simde_float32 a[16];
const simde_float32 r[16];
} test_vec[] = {
{ { SIMDE_FLOAT32_C( 0.76), SIMDE_FLOAT32_C( 0.69), SIMDE_FLOAT32_C( 0.66), SIMDE_FLOAT32_C( -0.66),
SIMDE_FLOAT32_C( -0.79), SIMDE_FLOAT32_C( 0.49), SIMDE_FLOAT32_C( 0.27), SIMDE_FLOAT32_C( 0.59),
SIMDE_FLOAT32_C( 0.35), SIMDE_FLOAT32_C( 0.58), SIMDE_FLOAT32_C( 0.82), SIMDE_FLOAT32_C( 0.15),
SIMDE_FLOAT32_C( 0.47), SIMDE_FLOAT32_C( 0.67), SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 0.82) },
UINT8_C(161),
{ SIMDE_FLOAT32_C( -0.93), SIMDE_FLOAT32_C( 0.39), SIMDE_FLOAT32_C( -0.81), SIMDE_FLOAT32_C( -0.18),
SIMDE_FLOAT32_C( 0.83), SIMDE_FLOAT32_C( -0.50), SIMDE_FLOAT32_C( -0.75), SIMDE_FLOAT32_C( 0.57),
SIMDE_FLOAT32_C( 0.22), SIMDE_FLOAT32_C( -0.87), SIMDE_FLOAT32_C( -0.83), SIMDE_FLOAT32_C( -0.64),
SIMDE_FLOAT32_C( 0.28), SIMDE_FLOAT32_C( -0.54), SIMDE_FLOAT32_C( -0.87), SIMDE_FLOAT32_C( -0.03) },
{ SIMDE_FLOAT32_C( -1.28), SIMDE_FLOAT32_C( 0.69), SIMDE_FLOAT32_C( 0.66), SIMDE_FLOAT32_C( -0.66),
SIMDE_FLOAT32_C( -0.79), SIMDE_FLOAT32_C( -0.48), SIMDE_FLOAT32_C( 0.27), SIMDE_FLOAT32_C( 0.56),
SIMDE_FLOAT32_C( 0.35), SIMDE_FLOAT32_C( 0.58), SIMDE_FLOAT32_C( 0.82), SIMDE_FLOAT32_C( 0.15),
SIMDE_FLOAT32_C( 0.47), SIMDE_FLOAT32_C( 0.67), SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 0.82) } },
{ { SIMDE_FLOAT32_C( -0.87), SIMDE_FLOAT32_C( -0.53), SIMDE_FLOAT32_C( 0.19), SIMDE_FLOAT32_C( 0.62),
SIMDE_FLOAT32_C( 0.74), SIMDE_FLOAT32_C( -0.22), SIMDE_FLOAT32_C( -0.03), SIMDE_FLOAT32_C( 0.32),
SIMDE_FLOAT32_C( -0.41), SIMDE_FLOAT32_C( -0.88), SIMDE_FLOAT32_C( -0.21), SIMDE_FLOAT32_C( -0.74),
SIMDE_FLOAT32_C( 0.12), SIMDE_FLOAT32_C( -0.39), SIMDE_FLOAT32_C( -0.76), SIMDE_FLOAT32_C( 0.19) },
UINT8_C( 98),
{ SIMDE_FLOAT32_C( -0.58), SIMDE_FLOAT32_C( -0.99), SIMDE_FLOAT32_C( 0.84), SIMDE_FLOAT32_C( -0.08),
SIMDE_FLOAT32_C( -0.74), SIMDE_FLOAT32_C( 0.41), SIMDE_FLOAT32_C( -0.86), SIMDE_FLOAT32_C( -0.60),
SIMDE_FLOAT32_C( 0.59), SIMDE_FLOAT32_C( -0.50), SIMDE_FLOAT32_C( 0.68), SIMDE_FLOAT32_C( -0.95),
SIMDE_FLOAT32_C( -0.37), SIMDE_FLOAT32_C( -0.35), SIMDE_FLOAT32_C( -0.82), SIMDE_FLOAT32_C( 0.10) },
{ SIMDE_FLOAT32_C( -0.87), SIMDE_FLOAT32_C( -1.82), SIMDE_FLOAT32_C( 0.19), SIMDE_FLOAT32_C( 0.62),
SIMDE_FLOAT32_C( 0.74), SIMDE_FLOAT32_C( 0.38), SIMDE_FLOAT32_C( -1.04), SIMDE_FLOAT32_C( 0.32),
SIMDE_FLOAT32_C( -0.41), SIMDE_FLOAT32_C( -0.88), SIMDE_FLOAT32_C( -0.21), SIMDE_FLOAT32_C( -0.74),
SIMDE_FLOAT32_C( 0.12), SIMDE_FLOAT32_C( -0.39), SIMDE_FLOAT32_C( -0.76), SIMDE_FLOAT32_C( 0.19) } },
{ { SIMDE_FLOAT32_C( 0.84), SIMDE_FLOAT32_C( 0.79), SIMDE_FLOAT32_C( -0.17), SIMDE_FLOAT32_C( -0.38),
SIMDE_FLOAT32_C( -0.24), SIMDE_FLOAT32_C( -0.85), SIMDE_FLOAT32_C( 0.21), SIMDE_FLOAT32_C( -0.12),
SIMDE_FLOAT32_C( -0.05), SIMDE_FLOAT32_C( 0.47), SIMDE_FLOAT32_C( -1.00), SIMDE_FLOAT32_C( 0.56),
SIMDE_FLOAT32_C( 0.71), SIMDE_FLOAT32_C( 0.19), SIMDE_FLOAT32_C( 0.57), SIMDE_FLOAT32_C( -0.87) },
UINT8_C( 32),
{ SIMDE_FLOAT32_C( 0.41), SIMDE_FLOAT32_C( 0.06), SIMDE_FLOAT32_C( 0.46), SIMDE_FLOAT32_C( -0.18),
SIMDE_FLOAT32_C( 0.19), SIMDE_FLOAT32_C( 0.86), SIMDE_FLOAT32_C( -0.59), SIMDE_FLOAT32_C( 0.69),
SIMDE_FLOAT32_C( 0.54), SIMDE_FLOAT32_C( -0.54), SIMDE_FLOAT32_C( -0.68), SIMDE_FLOAT32_C( -0.81),
SIMDE_FLOAT32_C( -0.36), SIMDE_FLOAT32_C( 0.42), SIMDE_FLOAT32_C( -0.97), SIMDE_FLOAT32_C( -0.57) },
{ SIMDE_FLOAT32_C( 0.84), SIMDE_FLOAT32_C( 0.79), SIMDE_FLOAT32_C( -0.17), SIMDE_FLOAT32_C( -0.38),
SIMDE_FLOAT32_C( -0.24), SIMDE_FLOAT32_C( 1.04), SIMDE_FLOAT32_C( 0.21), SIMDE_FLOAT32_C( -0.12),
SIMDE_FLOAT32_C( -0.05), SIMDE_FLOAT32_C( 0.47), SIMDE_FLOAT32_C( -1.00), SIMDE_FLOAT32_C( 0.56),
SIMDE_FLOAT32_C( 0.71), SIMDE_FLOAT32_C( 0.19), SIMDE_FLOAT32_C( 0.57), SIMDE_FLOAT32_C( -0.87) } },
{ { SIMDE_FLOAT32_C( -0.75), SIMDE_FLOAT32_C( -0.36), SIMDE_FLOAT32_C( 0.19), SIMDE_FLOAT32_C( -0.60),
SIMDE_FLOAT32_C( 0.85), SIMDE_FLOAT32_C( -0.93), SIMDE_FLOAT32_C( 0.35), SIMDE_FLOAT32_C( 0.32),
SIMDE_FLOAT32_C( -0.93), SIMDE_FLOAT32_C( -0.09), SIMDE_FLOAT32_C( 0.03), SIMDE_FLOAT32_C( 0.26),
SIMDE_FLOAT32_C( -0.52), SIMDE_FLOAT32_C( 0.17), SIMDE_FLOAT32_C( -0.53), SIMDE_FLOAT32_C( 0.89) },
UINT8_C(177),
{ SIMDE_FLOAT32_C( 0.93), SIMDE_FLOAT32_C( -0.29), SIMDE_FLOAT32_C( 0.42), SIMDE_FLOAT32_C( 0.79),
SIMDE_FLOAT32_C( 0.12), SIMDE_FLOAT32_C( 0.11), SIMDE_FLOAT32_C( 0.33), SIMDE_FLOAT32_C( 0.58),
SIMDE_FLOAT32_C( 0.43), SIMDE_FLOAT32_C( 0.52), SIMDE_FLOAT32_C( -0.78), SIMDE_FLOAT32_C( -0.16),
SIMDE_FLOAT32_C( 0.55), SIMDE_FLOAT32_C( -0.35), SIMDE_FLOAT32_C( 0.09), SIMDE_FLOAT32_C( -0.81) },
{ SIMDE_FLOAT32_C( 1.28), SIMDE_FLOAT32_C( -0.36), SIMDE_FLOAT32_C( 0.19), SIMDE_FLOAT32_C( -0.60),
SIMDE_FLOAT32_C( 0.11), SIMDE_FLOAT32_C( 0.10), SIMDE_FLOAT32_C( 0.35), SIMDE_FLOAT32_C( 0.57),
SIMDE_FLOAT32_C( -0.93), SIMDE_FLOAT32_C( -0.09), SIMDE_FLOAT32_C( 0.03), SIMDE_FLOAT32_C( 0.26),
SIMDE_FLOAT32_C( -0.52), SIMDE_FLOAT32_C( 0.17), SIMDE_FLOAT32_C( -0.53), SIMDE_FLOAT32_C( 0.89) } },
{ { SIMDE_FLOAT32_C( 0.83), SIMDE_FLOAT32_C( 0.49), SIMDE_FLOAT32_C( -0.95), SIMDE_FLOAT32_C( 0.90),
SIMDE_FLOAT32_C( -0.16), SIMDE_FLOAT32_C( 0.37), SIMDE_FLOAT32_C( 0.97), SIMDE_FLOAT32_C( 0.76),
SIMDE_FLOAT32_C( -0.59), SIMDE_FLOAT32_C( 0.23), SIMDE_FLOAT32_C( -0.76), SIMDE_FLOAT32_C( 0.57),
SIMDE_FLOAT32_C( 0.70), SIMDE_FLOAT32_C( -0.87), SIMDE_FLOAT32_C( 0.80), SIMDE_FLOAT32_C( 0.63) },
UINT8_C( 55),
{ SIMDE_FLOAT32_C( 0.21), SIMDE_FLOAT32_C( 0.42), SIMDE_FLOAT32_C( 0.96), SIMDE_FLOAT32_C( -0.68),
SIMDE_FLOAT32_C( -0.25), SIMDE_FLOAT32_C( 0.54), SIMDE_FLOAT32_C( 0.75), SIMDE_FLOAT32_C( -0.73),
SIMDE_FLOAT32_C( 0.76), SIMDE_FLOAT32_C( -0.41), SIMDE_FLOAT32_C( 0.82), SIMDE_FLOAT32_C( -0.59),
SIMDE_FLOAT32_C( 0.69), SIMDE_FLOAT32_C( -0.99), SIMDE_FLOAT32_C( -0.76), SIMDE_FLOAT32_C( 0.18) },
{ SIMDE_FLOAT32_C( 0.19), SIMDE_FLOAT32_C( 0.39), SIMDE_FLOAT32_C( 1.45), SIMDE_FLOAT32_C( 0.90),
SIMDE_FLOAT32_C( -0.23), SIMDE_FLOAT32_C( 0.52), SIMDE_FLOAT32_C( 0.97), SIMDE_FLOAT32_C( 0.76),
SIMDE_FLOAT32_C( -0.59), SIMDE_FLOAT32_C( 0.23), SIMDE_FLOAT32_C( -0.76), SIMDE_FLOAT32_C( 0.57),
SIMDE_FLOAT32_C( 0.70), SIMDE_FLOAT32_C( -0.87), SIMDE_FLOAT32_C( 0.80), SIMDE_FLOAT32_C( 0.63) } },
{ { SIMDE_FLOAT32_C( -0.94), SIMDE_FLOAT32_C( -0.86), SIMDE_FLOAT32_C( -0.98), SIMDE_FLOAT32_C( 0.43),
SIMDE_FLOAT32_C( -0.89), SIMDE_FLOAT32_C( 0.78), SIMDE_FLOAT32_C( 0.84), SIMDE_FLOAT32_C( 0.34),
SIMDE_FLOAT32_C( -0.98), SIMDE_FLOAT32_C( 0.41), SIMDE_FLOAT32_C( 0.04), SIMDE_FLOAT32_C( -0.86),
SIMDE_FLOAT32_C( 0.21), SIMDE_FLOAT32_C( -0.33), SIMDE_FLOAT32_C( -0.02), SIMDE_FLOAT32_C( -0.58) },
UINT8_C( 30),
{ SIMDE_FLOAT32_C( -0.06), SIMDE_FLOAT32_C( -0.26), SIMDE_FLOAT32_C( -0.16), SIMDE_FLOAT32_C( -0.51),
SIMDE_FLOAT32_C( -0.51), SIMDE_FLOAT32_C( 0.11), SIMDE_FLOAT32_C( -0.75), SIMDE_FLOAT32_C( 0.09),
SIMDE_FLOAT32_C( -0.07), SIMDE_FLOAT32_C( -0.35), SIMDE_FLOAT32_C( -0.23), SIMDE_FLOAT32_C( -0.06),
SIMDE_FLOAT32_C( -0.11), SIMDE_FLOAT32_C( 0.95), SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 0.03) },
{ SIMDE_FLOAT32_C( -0.94), SIMDE_FLOAT32_C( -0.23), SIMDE_FLOAT32_C( -0.14), SIMDE_FLOAT32_C( -0.49),
SIMDE_FLOAT32_C( -0.49), SIMDE_FLOAT32_C( 0.78), SIMDE_FLOAT32_C( 0.84), SIMDE_FLOAT32_C( 0.34),
SIMDE_FLOAT32_C( -0.98), SIMDE_FLOAT32_C( 0.41), SIMDE_FLOAT32_C( 0.04), SIMDE_FLOAT32_C( -0.86),
SIMDE_FLOAT32_C( 0.21), SIMDE_FLOAT32_C( -0.33), SIMDE_FLOAT32_C( -0.02), SIMDE_FLOAT32_C( -0.58) } },
{ { SIMDE_FLOAT32_C( 0.98), SIMDE_FLOAT32_C( -0.57), SIMDE_FLOAT32_C( 0.14), SIMDE_FLOAT32_C( 0.76),
SIMDE_FLOAT32_C( -0.73), SIMDE_FLOAT32_C( -0.52), SIMDE_FLOAT32_C( 0.77), SIMDE_FLOAT32_C( 0.68),
SIMDE_FLOAT32_C( 0.52), SIMDE_FLOAT32_C( 0.91), SIMDE_FLOAT32_C( -0.11), SIMDE_FLOAT32_C( -0.82),
SIMDE_FLOAT32_C( -0.10), SIMDE_FLOAT32_C( 0.31), SIMDE_FLOAT32_C( -0.73), SIMDE_FLOAT32_C( 0.84) },
UINT8_C( 89),
{ SIMDE_FLOAT32_C( 0.11), SIMDE_FLOAT32_C( -0.68), SIMDE_FLOAT32_C( -0.46), SIMDE_FLOAT32_C( -0.78),
SIMDE_FLOAT32_C( -0.43), SIMDE_FLOAT32_C( 0.63), SIMDE_FLOAT32_C( 0.15), SIMDE_FLOAT32_C( 0.22),
SIMDE_FLOAT32_C( -0.60), SIMDE_FLOAT32_C( -0.90), SIMDE_FLOAT32_C( -0.88), SIMDE_FLOAT32_C( -0.65),
SIMDE_FLOAT32_C( 0.10), SIMDE_FLOAT32_C( 0.15), SIMDE_FLOAT32_C( -0.67), SIMDE_FLOAT32_C( 0.53) },
{ SIMDE_FLOAT32_C( 0.10), SIMDE_FLOAT32_C( -0.57), SIMDE_FLOAT32_C( 0.14), SIMDE_FLOAT32_C( -0.87),
SIMDE_FLOAT32_C( -0.40), SIMDE_FLOAT32_C( -0.52), SIMDE_FLOAT32_C( 0.13), SIMDE_FLOAT32_C( 0.68),
SIMDE_FLOAT32_C( 0.52), SIMDE_FLOAT32_C( 0.91), SIMDE_FLOAT32_C( -0.11), SIMDE_FLOAT32_C( -0.82),
SIMDE_FLOAT32_C( -0.10), SIMDE_FLOAT32_C( 0.31), SIMDE_FLOAT32_C( -0.73), SIMDE_FLOAT32_C( 0.84) } },
{ { SIMDE_FLOAT32_C( -0.72), SIMDE_FLOAT32_C( -0.92), SIMDE_FLOAT32_C( 0.80), SIMDE_FLOAT32_C( -0.24),
SIMDE_FLOAT32_C( 0.85), SIMDE_FLOAT32_C( 0.48), SIMDE_FLOAT32_C( -0.72), SIMDE_FLOAT32_C( 0.77),
SIMDE_FLOAT32_C( -0.63), SIMDE_FLOAT32_C( -0.54), SIMDE_FLOAT32_C( -0.34), SIMDE_FLOAT32_C( 0.67),
SIMDE_FLOAT32_C( -0.27), SIMDE_FLOAT32_C( -0.50), SIMDE_FLOAT32_C( 0.72), SIMDE_FLOAT32_C( 0.84) },
UINT8_C(239),
{ SIMDE_FLOAT32_C( -0.74), SIMDE_FLOAT32_C( -0.94), SIMDE_FLOAT32_C( 0.40), SIMDE_FLOAT32_C( 0.89),
SIMDE_FLOAT32_C( 0.22), SIMDE_FLOAT32_C( -0.38), SIMDE_FLOAT32_C( -0.71), SIMDE_FLOAT32_C( 0.31),
SIMDE_FLOAT32_C( -0.26), SIMDE_FLOAT32_C( -0.36), SIMDE_FLOAT32_C( -0.59), SIMDE_FLOAT32_C( 0.89),
SIMDE_FLOAT32_C( -0.03), SIMDE_FLOAT32_C( 0.95), SIMDE_FLOAT32_C( -0.83), SIMDE_FLOAT32_C( 0.05) },
{ SIMDE_FLOAT32_C( -0.80), SIMDE_FLOAT32_C( -1.33), SIMDE_FLOAT32_C( 0.37), SIMDE_FLOAT32_C( 1.13),
SIMDE_FLOAT32_C( 0.85), SIMDE_FLOAT32_C( -0.35), SIMDE_FLOAT32_C( -0.75), SIMDE_FLOAT32_C( 0.28),
SIMDE_FLOAT32_C( -0.63), SIMDE_FLOAT32_C( -0.54), SIMDE_FLOAT32_C( -0.34), SIMDE_FLOAT32_C( 0.67),
SIMDE_FLOAT32_C( -0.27), SIMDE_FLOAT32_C( -0.50), SIMDE_FLOAT32_C( 0.72), SIMDE_FLOAT32_C( 0.84) } }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])) ; i++) {
simde__m512 src = simde_mm512_loadu_ps(test_vec[i].src);
simde__m512 a = simde_mm512_loadu_ps(test_vec[i].a);
simde__m512 r = simde_mm512_mask_erfinv_ps(src, test_vec[i].k, a);
simde_test_x86_assert_equal_f32x16(r, simde_mm512_loadu_ps(test_vec[i].r), 1);
}
return 0;
}
static int
test_simde_mm512_mask_erfinv_pd (SIMDE_MUNIT_TEST_ARGS) {
static const struct {
const simde_float64 src[8];
const simde__mmask8 k;
const simde_float64 a[8];
const simde_float64 r[8];
} test_vec[] = {
{ { SIMDE_FLOAT64_C( -0.86), SIMDE_FLOAT64_C( 0.14), SIMDE_FLOAT64_C( 0.76), SIMDE_FLOAT64_C( 0.87),
SIMDE_FLOAT64_C( 0.31), SIMDE_FLOAT64_C( 0.95), SIMDE_FLOAT64_C( 0.56), SIMDE_FLOAT64_C( 0.70) },
UINT8_C(108),
{ SIMDE_FLOAT64_C( 0.53), SIMDE_FLOAT64_C( -0.20), SIMDE_FLOAT64_C( -0.78), SIMDE_FLOAT64_C( 0.69),
SIMDE_FLOAT64_C( 0.25), SIMDE_FLOAT64_C( -0.30), SIMDE_FLOAT64_C( 0.34), SIMDE_FLOAT64_C( -0.50) },
{ SIMDE_FLOAT64_C( -0.86), SIMDE_FLOAT64_C( 0.14), SIMDE_FLOAT64_C( -0.87), SIMDE_FLOAT64_C( 0.72),
SIMDE_FLOAT64_C( 0.31), SIMDE_FLOAT64_C( -0.27), SIMDE_FLOAT64_C( 0.31), SIMDE_FLOAT64_C( 0.70) } },
{ { SIMDE_FLOAT64_C( 0.11), SIMDE_FLOAT64_C( -0.89), SIMDE_FLOAT64_C( 0.54), SIMDE_FLOAT64_C( 0.35),
SIMDE_FLOAT64_C( -0.47), SIMDE_FLOAT64_C( -0.10), SIMDE_FLOAT64_C( -0.79), SIMDE_FLOAT64_C( -0.77) },
UINT8_C(112),
{ SIMDE_FLOAT64_C( 0.18), SIMDE_FLOAT64_C( 0.43), SIMDE_FLOAT64_C( 0.91), SIMDE_FLOAT64_C( 0.54),
SIMDE_FLOAT64_C( 0.23), SIMDE_FLOAT64_C( -0.95), SIMDE_FLOAT64_C( -0.32), SIMDE_FLOAT64_C( -0.01) },
{ SIMDE_FLOAT64_C( 0.11), SIMDE_FLOAT64_C( -0.89), SIMDE_FLOAT64_C( 0.54), SIMDE_FLOAT64_C( 0.35),
SIMDE_FLOAT64_C( 0.21), SIMDE_FLOAT64_C( -1.39), SIMDE_FLOAT64_C( -0.29), SIMDE_FLOAT64_C( -0.77) } },
{ { SIMDE_FLOAT64_C( 0.92), SIMDE_FLOAT64_C( 0.99), SIMDE_FLOAT64_C( -0.06), SIMDE_FLOAT64_C( 0.47),
SIMDE_FLOAT64_C( 0.69), SIMDE_FLOAT64_C( -0.35), SIMDE_FLOAT64_C( 0.00), SIMDE_FLOAT64_C( -0.51) },
UINT8_C(248),
{ SIMDE_FLOAT64_C( -0.30), SIMDE_FLOAT64_C( 0.74), SIMDE_FLOAT64_C( 0.57), SIMDE_FLOAT64_C( -0.96),
SIMDE_FLOAT64_C( -0.76), SIMDE_FLOAT64_C( -0.32), SIMDE_FLOAT64_C( -0.85), SIMDE_FLOAT64_C( 0.79) },
{ SIMDE_FLOAT64_C( 0.92), SIMDE_FLOAT64_C( 0.99), SIMDE_FLOAT64_C( -0.06), SIMDE_FLOAT64_C( -1.45),
SIMDE_FLOAT64_C( -0.83), SIMDE_FLOAT64_C( -0.29), SIMDE_FLOAT64_C( -1.02), SIMDE_FLOAT64_C( 0.89) } },
{ { SIMDE_FLOAT64_C( -0.97), SIMDE_FLOAT64_C( -0.32), SIMDE_FLOAT64_C( -0.31), SIMDE_FLOAT64_C( -0.76),
SIMDE_FLOAT64_C( -0.09), SIMDE_FLOAT64_C( -0.63), SIMDE_FLOAT64_C( 0.42), SIMDE_FLOAT64_C( -0.66) },
UINT8_C( 18),
{ SIMDE_FLOAT64_C( -0.04), SIMDE_FLOAT64_C( 0.57), SIMDE_FLOAT64_C( -0.67), SIMDE_FLOAT64_C( 0.64),
SIMDE_FLOAT64_C( -0.44), SIMDE_FLOAT64_C( -0.75), SIMDE_FLOAT64_C( 0.63), SIMDE_FLOAT64_C( 0.50) },
{ SIMDE_FLOAT64_C( -0.97), SIMDE_FLOAT64_C( 0.56), SIMDE_FLOAT64_C( -0.31), SIMDE_FLOAT64_C( -0.76),
SIMDE_FLOAT64_C( -0.41), SIMDE_FLOAT64_C( -0.63), SIMDE_FLOAT64_C( 0.42), SIMDE_FLOAT64_C( -0.66) } },
{ { SIMDE_FLOAT64_C( 0.72), SIMDE_FLOAT64_C( 0.32), SIMDE_FLOAT64_C( -0.85), SIMDE_FLOAT64_C( -0.28),
SIMDE_FLOAT64_C( 0.81), SIMDE_FLOAT64_C( 0.03), SIMDE_FLOAT64_C( 0.42), SIMDE_FLOAT64_C( 0.55) },
UINT8_C( 45),
{ SIMDE_FLOAT64_C( 0.46), SIMDE_FLOAT64_C( 0.80), SIMDE_FLOAT64_C( 0.28), SIMDE_FLOAT64_C( 0.60),
SIMDE_FLOAT64_C( 0.58), SIMDE_FLOAT64_C( 0.31), SIMDE_FLOAT64_C( -0.72), SIMDE_FLOAT64_C( -0.73) },
{ SIMDE_FLOAT64_C( 0.43), SIMDE_FLOAT64_C( 0.32), SIMDE_FLOAT64_C( 0.25), SIMDE_FLOAT64_C( 0.60),
SIMDE_FLOAT64_C( 0.81), SIMDE_FLOAT64_C( 0.28), SIMDE_FLOAT64_C( 0.42), SIMDE_FLOAT64_C( 0.55) } },
{ { SIMDE_FLOAT64_C( 0.55), SIMDE_FLOAT64_C( 0.19), SIMDE_FLOAT64_C( -0.36), SIMDE_FLOAT64_C( -0.03),
SIMDE_FLOAT64_C( 0.54), SIMDE_FLOAT64_C( -0.08), SIMDE_FLOAT64_C( 0.93), SIMDE_FLOAT64_C( 0.11) },
UINT8_C( 61),
{ SIMDE_FLOAT64_C( 0.57), SIMDE_FLOAT64_C( 0.67), SIMDE_FLOAT64_C( 0.50), SIMDE_FLOAT64_C( 0.20),
SIMDE_FLOAT64_C( 0.16), SIMDE_FLOAT64_C( 0.22), SIMDE_FLOAT64_C( -0.49), SIMDE_FLOAT64_C( 0.31) },
{ SIMDE_FLOAT64_C( 0.56), SIMDE_FLOAT64_C( 0.19), SIMDE_FLOAT64_C( 0.48), SIMDE_FLOAT64_C( 0.18),
SIMDE_FLOAT64_C( 0.14), SIMDE_FLOAT64_C( 0.20), SIMDE_FLOAT64_C( 0.93), SIMDE_FLOAT64_C( 0.11) } },
{ { SIMDE_FLOAT64_C( 0.95), SIMDE_FLOAT64_C( -0.67), SIMDE_FLOAT64_C( -0.66), SIMDE_FLOAT64_C( 0.37),
SIMDE_FLOAT64_C( 0.88), SIMDE_FLOAT64_C( -0.06), SIMDE_FLOAT64_C( -0.17), SIMDE_FLOAT64_C( 0.67) },
UINT8_C(215),
{ SIMDE_FLOAT64_C( -0.57), SIMDE_FLOAT64_C( 0.26), SIMDE_FLOAT64_C( 0.53), SIMDE_FLOAT64_C( -0.29),
SIMDE_FLOAT64_C( 0.53), SIMDE_FLOAT64_C( 0.09), SIMDE_FLOAT64_C( 0.91), SIMDE_FLOAT64_C( -0.83) },
{ SIMDE_FLOAT64_C( -0.56), SIMDE_FLOAT64_C( 0.23), SIMDE_FLOAT64_C( 0.51), SIMDE_FLOAT64_C( 0.37),
SIMDE_FLOAT64_C( 0.51), SIMDE_FLOAT64_C( -0.06), SIMDE_FLOAT64_C( 1.20), SIMDE_FLOAT64_C( -0.97) } },
{ { SIMDE_FLOAT64_C( -0.94), SIMDE_FLOAT64_C( 0.44), SIMDE_FLOAT64_C( 0.10), SIMDE_FLOAT64_C( 0.99),
SIMDE_FLOAT64_C( -0.45), SIMDE_FLOAT64_C( -0.65), SIMDE_FLOAT64_C( 0.56), SIMDE_FLOAT64_C( -0.79) },
UINT8_C(161),
{ SIMDE_FLOAT64_C( -0.24), SIMDE_FLOAT64_C( 0.38), SIMDE_FLOAT64_C( 0.08), SIMDE_FLOAT64_C( 0.27),
SIMDE_FLOAT64_C( -0.31), SIMDE_FLOAT64_C( 0.03), SIMDE_FLOAT64_C( 0.60), SIMDE_FLOAT64_C( 0.03) },
{ SIMDE_FLOAT64_C( -0.22), SIMDE_FLOAT64_C( 0.44), SIMDE_FLOAT64_C( 0.10), SIMDE_FLOAT64_C( 0.99),
SIMDE_FLOAT64_C( -0.45), SIMDE_FLOAT64_C( 0.03), SIMDE_FLOAT64_C( 0.56), SIMDE_FLOAT64_C( 0.03) } }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])) ; i++) {
simde__m512d src = simde_mm512_loadu_pd(test_vec[i].src);
simde__m512d a = simde_mm512_loadu_pd(test_vec[i].a);
simde__m512d r = simde_mm512_mask_erfinv_pd(src, test_vec[i].k, a);
simde_test_x86_assert_equal_f64x8(r, simde_mm512_loadu_pd(test_vec[i].r), 1);
}
return 0;
}
static int
test_simde_mm_erfc_ps (SIMDE_MUNIT_TEST_ARGS) {
static const struct {
const simde_float32 a[4];
const simde_float32 r[4];
} test_vec[] = {
{ { SIMDE_FLOAT32_C( -315.30), SIMDE_FLOAT32_C( -413.87), SIMDE_FLOAT32_C( -345.31), SIMDE_FLOAT32_C( -228.93) },
{ SIMDE_FLOAT32_C( 2.00), SIMDE_FLOAT32_C( 2.00), SIMDE_FLOAT32_C( 2.00), SIMDE_FLOAT32_C( 2.00) } },
{ { SIMDE_FLOAT32_C( 600.65), SIMDE_FLOAT32_C( -112.11), SIMDE_FLOAT32_C( -98.86), SIMDE_FLOAT32_C( 20.55) },
{ SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 2.00), SIMDE_FLOAT32_C( 2.00), SIMDE_FLOAT32_C( 0.00) } },
{ { SIMDE_FLOAT32_C( -949.84), SIMDE_FLOAT32_C( -802.03), SIMDE_FLOAT32_C( 212.71), SIMDE_FLOAT32_C( -757.84) },
{ SIMDE_FLOAT32_C( 2.00), SIMDE_FLOAT32_C( 2.00), SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 2.00) } },
{ { SIMDE_FLOAT32_C( -651.52), SIMDE_FLOAT32_C( -363.93), SIMDE_FLOAT32_C( 876.28), SIMDE_FLOAT32_C( -203.61) },
{ SIMDE_FLOAT32_C( 2.00), SIMDE_FLOAT32_C( 2.00), SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 2.00) } },
{ { SIMDE_FLOAT32_C( 527.04), SIMDE_FLOAT32_C( 57.60), SIMDE_FLOAT32_C( -839.49), SIMDE_FLOAT32_C( 826.28) },
{ SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 2.00), SIMDE_FLOAT32_C( 0.00) } },
{ { SIMDE_FLOAT32_C( 974.10), SIMDE_FLOAT32_C( 325.71), SIMDE_FLOAT32_C( -535.87), SIMDE_FLOAT32_C( 230.83) },
{ SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 2.00), SIMDE_FLOAT32_C( 0.00) } },
{ { SIMDE_FLOAT32_C( 348.57), SIMDE_FLOAT32_C( 534.66), SIMDE_FLOAT32_C( 231.47), SIMDE_FLOAT32_C( 673.78) },
{ SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 0.00) } },
{ { SIMDE_FLOAT32_C( 954.08), SIMDE_FLOAT32_C( 495.36), SIMDE_FLOAT32_C( 387.10), SIMDE_FLOAT32_C( -361.22) },
{ SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 2.00) } }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])) ; i++) {
simde__m128 a = simde_mm_loadu_ps(test_vec[i].a);
simde__m128 r = simde_mm_erfc_ps(a);
simde_test_x86_assert_equal_f32x4(r, simde_mm_loadu_ps(test_vec[i].r), 1);
}
return 0;
}
static int
test_simde_mm_erfc_pd (SIMDE_MUNIT_TEST_ARGS) {
static const struct {
const simde_float64 a[2];
const simde_float64 r[2];
} test_vec[] = {
{ { SIMDE_FLOAT64_C( -645.17), SIMDE_FLOAT64_C( 211.72) },
{ SIMDE_FLOAT64_C( 2.00), SIMDE_FLOAT64_C( 0.00) } },
{ { SIMDE_FLOAT64_C( 715.58), SIMDE_FLOAT64_C( 471.86) },
{ SIMDE_FLOAT64_C( 0.00), SIMDE_FLOAT64_C( 0.00) } },
{ { SIMDE_FLOAT64_C( 209.41), SIMDE_FLOAT64_C( -887.34) },
{ SIMDE_FLOAT64_C( 0.00), SIMDE_FLOAT64_C( 2.00) } },
{ { SIMDE_FLOAT64_C( -326.89), SIMDE_FLOAT64_C( 772.60) },
{ SIMDE_FLOAT64_C( 2.00), SIMDE_FLOAT64_C( 0.00) } },
{ { SIMDE_FLOAT64_C( 574.21), SIMDE_FLOAT64_C( 504.70) },
{ SIMDE_FLOAT64_C( 0.00), SIMDE_FLOAT64_C( 0.00) } },
{ { SIMDE_FLOAT64_C( -447.93), SIMDE_FLOAT64_C( -208.36) },
{ SIMDE_FLOAT64_C( 2.00), SIMDE_FLOAT64_C( 2.00) } },
{ { SIMDE_FLOAT64_C( 404.62), SIMDE_FLOAT64_C( -998.91) },
{ SIMDE_FLOAT64_C( 0.00), SIMDE_FLOAT64_C( 2.00) } },
{ { SIMDE_FLOAT64_C( -193.72), SIMDE_FLOAT64_C( 660.84) },
{ SIMDE_FLOAT64_C( 2.00), SIMDE_FLOAT64_C( 0.00) } }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])) ; i++) {
simde__m128d a = simde_mm_loadu_pd(test_vec[i].a);
simde__m128d r = simde_mm_erfc_pd(a);
simde_test_x86_assert_equal_f64x2(r, simde_mm_loadu_pd(test_vec[i].r), 1);
}
return 0;
}
static int
test_simde_mm256_erfc_ps (SIMDE_MUNIT_TEST_ARGS) {
static const struct {
const simde_float32 a[8];
const simde_float32 r[8];
} test_vec[] = {
{ { SIMDE_FLOAT32_C( 496.19), SIMDE_FLOAT32_C( -675.69), SIMDE_FLOAT32_C( -153.22), SIMDE_FLOAT32_C( -88.71),
SIMDE_FLOAT32_C( 381.12), SIMDE_FLOAT32_C( -119.60), SIMDE_FLOAT32_C( 255.09), SIMDE_FLOAT32_C( -509.70) },
{ SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 2.00), SIMDE_FLOAT32_C( 2.00), SIMDE_FLOAT32_C( 2.00),
SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 2.00), SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 2.00) } },
{ { SIMDE_FLOAT32_C( -168.05), SIMDE_FLOAT32_C( -24.56), SIMDE_FLOAT32_C( -778.51), SIMDE_FLOAT32_C( 349.90),
SIMDE_FLOAT32_C( 925.97), SIMDE_FLOAT32_C( 439.36), SIMDE_FLOAT32_C( -180.81), SIMDE_FLOAT32_C( 678.48) },
{ SIMDE_FLOAT32_C( 2.00), SIMDE_FLOAT32_C( 2.00), SIMDE_FLOAT32_C( 2.00), SIMDE_FLOAT32_C( 0.00),
SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 2.00), SIMDE_FLOAT32_C( 0.00) } },
{ { SIMDE_FLOAT32_C( -580.27), SIMDE_FLOAT32_C( -258.04), SIMDE_FLOAT32_C( -62.98), SIMDE_FLOAT32_C( -953.83),
SIMDE_FLOAT32_C( 354.49), SIMDE_FLOAT32_C( 914.71), SIMDE_FLOAT32_C( -173.05), SIMDE_FLOAT32_C( -256.98) },
{ SIMDE_FLOAT32_C( 2.00), SIMDE_FLOAT32_C( 2.00), SIMDE_FLOAT32_C( 2.00), SIMDE_FLOAT32_C( 2.00),
SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 2.00), SIMDE_FLOAT32_C( 2.00) } },
{ { SIMDE_FLOAT32_C( 277.83), SIMDE_FLOAT32_C( 49.94), SIMDE_FLOAT32_C( -710.16), SIMDE_FLOAT32_C( 556.77),
SIMDE_FLOAT32_C( -300.30), SIMDE_FLOAT32_C( 375.96), SIMDE_FLOAT32_C( 468.75), SIMDE_FLOAT32_C( -804.12) },
{ SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 2.00), SIMDE_FLOAT32_C( 0.00),
SIMDE_FLOAT32_C( 2.00), SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 2.00) } },
{ { SIMDE_FLOAT32_C( 700.27), SIMDE_FLOAT32_C( -684.46), SIMDE_FLOAT32_C( 107.18), SIMDE_FLOAT32_C( 81.39),
SIMDE_FLOAT32_C( 195.94), SIMDE_FLOAT32_C( -637.73), SIMDE_FLOAT32_C( 571.69), SIMDE_FLOAT32_C( -972.11) },
{ SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 2.00), SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 0.00),
SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 2.00), SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 2.00) } },
{ { SIMDE_FLOAT32_C( 337.71), SIMDE_FLOAT32_C( 793.18), SIMDE_FLOAT32_C( 377.79), SIMDE_FLOAT32_C( 263.68),
SIMDE_FLOAT32_C( 232.54), SIMDE_FLOAT32_C( -803.02), SIMDE_FLOAT32_C( -57.84), SIMDE_FLOAT32_C( 652.27) },
{ SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 0.00),
SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 2.00), SIMDE_FLOAT32_C( 2.00), SIMDE_FLOAT32_C( 0.00) } },
{ { SIMDE_FLOAT32_C( -61.06), SIMDE_FLOAT32_C( 879.18), SIMDE_FLOAT32_C( 698.44), SIMDE_FLOAT32_C( -706.57),
SIMDE_FLOAT32_C( 793.88), SIMDE_FLOAT32_C( -474.61), SIMDE_FLOAT32_C( 36.44), SIMDE_FLOAT32_C( 71.71) },
{ SIMDE_FLOAT32_C( 2.00), SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 2.00),
SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 2.00), SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 0.00) } },
{ { SIMDE_FLOAT32_C( 575.33), SIMDE_FLOAT32_C( 326.28), SIMDE_FLOAT32_C( -371.52), SIMDE_FLOAT32_C( -724.97),
SIMDE_FLOAT32_C( -297.76), SIMDE_FLOAT32_C( -902.77), SIMDE_FLOAT32_C( -529.09), SIMDE_FLOAT32_C( -597.49) },
{ SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 2.00), SIMDE_FLOAT32_C( 2.00),
SIMDE_FLOAT32_C( 2.00), SIMDE_FLOAT32_C( 2.00), SIMDE_FLOAT32_C( 2.00), SIMDE_FLOAT32_C( 2.00) } }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])) ; i++) {
simde__m256 a = simde_mm256_loadu_ps(test_vec[i].a);
simde__m256 r = simde_mm256_erfc_ps(a);
simde_test_x86_assert_equal_f32x8(r, simde_mm256_loadu_ps(test_vec[i].r), 1);
}
return 0;
}
static int
test_simde_mm256_erfc_pd (SIMDE_MUNIT_TEST_ARGS) {
static const struct {
const simde_float64 a[4];
const simde_float64 r[4];
} test_vec[] = {
{ { SIMDE_FLOAT64_C( 461.51), SIMDE_FLOAT64_C( -571.50), SIMDE_FLOAT64_C( 241.15), SIMDE_FLOAT64_C( 521.48) },
{ SIMDE_FLOAT64_C( 0.00), SIMDE_FLOAT64_C( 2.00), SIMDE_FLOAT64_C( 0.00), SIMDE_FLOAT64_C( 0.00) } },
{ { SIMDE_FLOAT64_C( -695.16), SIMDE_FLOAT64_C( -842.41), SIMDE_FLOAT64_C( 799.26), SIMDE_FLOAT64_C( 685.42) },
{ SIMDE_FLOAT64_C( 2.00), SIMDE_FLOAT64_C( 2.00), SIMDE_FLOAT64_C( 0.00), SIMDE_FLOAT64_C( 0.00) } },
{ { SIMDE_FLOAT64_C( 3.40), SIMDE_FLOAT64_C( -776.18), SIMDE_FLOAT64_C( -325.62), SIMDE_FLOAT64_C( 7.02) },
{ SIMDE_FLOAT64_C( 0.00), SIMDE_FLOAT64_C( 2.00), SIMDE_FLOAT64_C( 2.00), SIMDE_FLOAT64_C( 0.00) } },
{ { SIMDE_FLOAT64_C( 948.46), SIMDE_FLOAT64_C( 348.12), SIMDE_FLOAT64_C( 741.43), SIMDE_FLOAT64_C( -182.81) },
{ SIMDE_FLOAT64_C( 0.00), SIMDE_FLOAT64_C( 0.00), SIMDE_FLOAT64_C( 0.00), SIMDE_FLOAT64_C( 2.00) } },
{ { SIMDE_FLOAT64_C( 319.42), SIMDE_FLOAT64_C( 46.64), SIMDE_FLOAT64_C( 792.19), SIMDE_FLOAT64_C( -94.82) },
{ SIMDE_FLOAT64_C( 0.00), SIMDE_FLOAT64_C( 0.00), SIMDE_FLOAT64_C( 0.00), SIMDE_FLOAT64_C( 2.00) } },
{ { SIMDE_FLOAT64_C( -364.65), SIMDE_FLOAT64_C( -718.98), SIMDE_FLOAT64_C( 201.33), SIMDE_FLOAT64_C( 634.78) },
{ SIMDE_FLOAT64_C( 2.00), SIMDE_FLOAT64_C( 2.00), SIMDE_FLOAT64_C( 0.00), SIMDE_FLOAT64_C( 0.00) } },
{ { SIMDE_FLOAT64_C( 348.43), SIMDE_FLOAT64_C( 374.84), SIMDE_FLOAT64_C( -48.84), SIMDE_FLOAT64_C( -910.34) },
{ SIMDE_FLOAT64_C( 0.00), SIMDE_FLOAT64_C( 0.00), SIMDE_FLOAT64_C( 2.00), SIMDE_FLOAT64_C( 2.00) } },
{ { SIMDE_FLOAT64_C( -513.67), SIMDE_FLOAT64_C( -235.62), SIMDE_FLOAT64_C( -80.01), SIMDE_FLOAT64_C( 947.84) },
{ SIMDE_FLOAT64_C( 2.00), SIMDE_FLOAT64_C( 2.00), SIMDE_FLOAT64_C( 2.00), SIMDE_FLOAT64_C( 0.00) } }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])) ; i++) {
simde__m256d a = simde_mm256_loadu_pd(test_vec[i].a);
simde__m256d r = simde_mm256_erfc_pd(a);
simde_test_x86_assert_equal_f64x4(r, simde_mm256_loadu_pd(test_vec[i].r), 1);
}
return 0;
}
static int
test_simde_mm512_erfc_ps (SIMDE_MUNIT_TEST_ARGS) {
static const struct {
const simde_float32 a[16];
const simde_float32 r[16];
} test_vec[] = {
{ { SIMDE_FLOAT32_C( 430.03), SIMDE_FLOAT32_C( -494.11), SIMDE_FLOAT32_C( -522.83), SIMDE_FLOAT32_C( -160.68),
SIMDE_FLOAT32_C( -217.51), SIMDE_FLOAT32_C( 364.22), SIMDE_FLOAT32_C( -906.03), SIMDE_FLOAT32_C( 335.92),
SIMDE_FLOAT32_C( -779.46), SIMDE_FLOAT32_C( -248.95), SIMDE_FLOAT32_C( -22.71), SIMDE_FLOAT32_C( -802.66),
SIMDE_FLOAT32_C( -495.02), SIMDE_FLOAT32_C( -618.65), SIMDE_FLOAT32_C( -592.74), SIMDE_FLOAT32_C( 774.33) },
{ SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 2.00), SIMDE_FLOAT32_C( 2.00), SIMDE_FLOAT32_C( 2.00),
SIMDE_FLOAT32_C( 2.00), SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 2.00), SIMDE_FLOAT32_C( 0.00),
SIMDE_FLOAT32_C( 2.00), SIMDE_FLOAT32_C( 2.00), SIMDE_FLOAT32_C( 2.00), SIMDE_FLOAT32_C( 2.00),
SIMDE_FLOAT32_C( 2.00), SIMDE_FLOAT32_C( 2.00), SIMDE_FLOAT32_C( 2.00), SIMDE_FLOAT32_C( 0.00) } },
{ { SIMDE_FLOAT32_C( -819.68), SIMDE_FLOAT32_C( -841.87), SIMDE_FLOAT32_C( 969.10), SIMDE_FLOAT32_C( -855.15),
SIMDE_FLOAT32_C( -473.12), SIMDE_FLOAT32_C( 203.71), SIMDE_FLOAT32_C( -640.23), SIMDE_FLOAT32_C( -593.80),
SIMDE_FLOAT32_C( -307.51), SIMDE_FLOAT32_C( 246.67), SIMDE_FLOAT32_C( -893.51), SIMDE_FLOAT32_C( 533.63),
SIMDE_FLOAT32_C( 217.68), SIMDE_FLOAT32_C( 100.04), SIMDE_FLOAT32_C( 228.82), SIMDE_FLOAT32_C( -352.29) },
{ SIMDE_FLOAT32_C( 2.00), SIMDE_FLOAT32_C( 2.00), SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 2.00),
SIMDE_FLOAT32_C( 2.00), SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 2.00), SIMDE_FLOAT32_C( 2.00),
SIMDE_FLOAT32_C( 2.00), SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 2.00), SIMDE_FLOAT32_C( 0.00),
SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 2.00) } },
{ { SIMDE_FLOAT32_C( 605.93), SIMDE_FLOAT32_C( 705.99), SIMDE_FLOAT32_C( 487.03), SIMDE_FLOAT32_C( -611.58),
SIMDE_FLOAT32_C( 70.21), SIMDE_FLOAT32_C( 581.00), SIMDE_FLOAT32_C( 724.34), SIMDE_FLOAT32_C( 290.75),
SIMDE_FLOAT32_C( -667.95), SIMDE_FLOAT32_C( -298.37), SIMDE_FLOAT32_C( 488.09), SIMDE_FLOAT32_C( -162.97),
SIMDE_FLOAT32_C( 82.98), SIMDE_FLOAT32_C( 895.36), SIMDE_FLOAT32_C( -388.63), SIMDE_FLOAT32_C( 263.30) },
{ SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 2.00),
SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 0.00),
SIMDE_FLOAT32_C( 2.00), SIMDE_FLOAT32_C( 2.00), SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 2.00),
SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 2.00), SIMDE_FLOAT32_C( 0.00) } },
{ { SIMDE_FLOAT32_C( -946.51), SIMDE_FLOAT32_C( -419.53), SIMDE_FLOAT32_C( 408.15), SIMDE_FLOAT32_C( -419.64),
SIMDE_FLOAT32_C( 784.18), SIMDE_FLOAT32_C( 767.92), SIMDE_FLOAT32_C( -13.43), SIMDE_FLOAT32_C( -523.33),
SIMDE_FLOAT32_C( 14.59), SIMDE_FLOAT32_C( 93.06), SIMDE_FLOAT32_C( -989.70), SIMDE_FLOAT32_C( -767.74),
SIMDE_FLOAT32_C( -806.91), SIMDE_FLOAT32_C( 239.11), SIMDE_FLOAT32_C( -120.03), SIMDE_FLOAT32_C( 799.02) },
{ SIMDE_FLOAT32_C( 2.00), SIMDE_FLOAT32_C( 2.00), SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 2.00),
SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 2.00), SIMDE_FLOAT32_C( 2.00),
SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 2.00), SIMDE_FLOAT32_C( 2.00),
SIMDE_FLOAT32_C( 2.00), SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 2.00), SIMDE_FLOAT32_C( 0.00) } },
{ { SIMDE_FLOAT32_C( -54.90), SIMDE_FLOAT32_C( -633.00), SIMDE_FLOAT32_C( -812.56), SIMDE_FLOAT32_C( -984.69),
SIMDE_FLOAT32_C( 948.00), SIMDE_FLOAT32_C( 911.78), SIMDE_FLOAT32_C( 306.06), SIMDE_FLOAT32_C( -719.95),
SIMDE_FLOAT32_C( -386.59), SIMDE_FLOAT32_C( -205.84), SIMDE_FLOAT32_C( 117.08), SIMDE_FLOAT32_C( 696.39),
SIMDE_FLOAT32_C( -310.49), SIMDE_FLOAT32_C( 728.45), SIMDE_FLOAT32_C( -40.32), SIMDE_FLOAT32_C( -257.00) },
{ SIMDE_FLOAT32_C( 2.00), SIMDE_FLOAT32_C( 2.00), SIMDE_FLOAT32_C( 2.00), SIMDE_FLOAT32_C( 2.00),
SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 2.00),
SIMDE_FLOAT32_C( 2.00), SIMDE_FLOAT32_C( 2.00), SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 0.00),
SIMDE_FLOAT32_C( 2.00), SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 2.00), SIMDE_FLOAT32_C( 2.00) } },
{ { SIMDE_FLOAT32_C( -691.08), SIMDE_FLOAT32_C( -632.17), SIMDE_FLOAT32_C( 323.36), SIMDE_FLOAT32_C( -906.91),
SIMDE_FLOAT32_C( -864.25), SIMDE_FLOAT32_C( -690.07), SIMDE_FLOAT32_C( -430.23), SIMDE_FLOAT32_C( 150.34),
SIMDE_FLOAT32_C( 402.99), SIMDE_FLOAT32_C( -419.93), SIMDE_FLOAT32_C( 382.60), SIMDE_FLOAT32_C( 596.09),
SIMDE_FLOAT32_C( 819.18), SIMDE_FLOAT32_C( -737.43), SIMDE_FLOAT32_C( 395.11), SIMDE_FLOAT32_C( -235.72) },
{ SIMDE_FLOAT32_C( 2.00), SIMDE_FLOAT32_C( 2.00), SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 2.00),
SIMDE_FLOAT32_C( 2.00), SIMDE_FLOAT32_C( 2.00), SIMDE_FLOAT32_C( 2.00), SIMDE_FLOAT32_C( 0.00),
SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 2.00), SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 0.00),
SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 2.00), SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 2.00) } },
{ { SIMDE_FLOAT32_C( -370.43), SIMDE_FLOAT32_C( 582.55), SIMDE_FLOAT32_C( -220.40), SIMDE_FLOAT32_C( -422.43),
SIMDE_FLOAT32_C( 494.33), SIMDE_FLOAT32_C( -914.34), SIMDE_FLOAT32_C( -142.39), SIMDE_FLOAT32_C( -892.26),
SIMDE_FLOAT32_C( -120.19), SIMDE_FLOAT32_C( 974.69), SIMDE_FLOAT32_C( 804.12), SIMDE_FLOAT32_C( 569.33),
SIMDE_FLOAT32_C( 703.14), SIMDE_FLOAT32_C( -236.19), SIMDE_FLOAT32_C( -687.67), SIMDE_FLOAT32_C( -987.95) },
{ SIMDE_FLOAT32_C( 2.00), SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 2.00), SIMDE_FLOAT32_C( 2.00),
SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 2.00), SIMDE_FLOAT32_C( 2.00), SIMDE_FLOAT32_C( 2.00),
SIMDE_FLOAT32_C( 2.00), SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 0.00),
SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 2.00), SIMDE_FLOAT32_C( 2.00), SIMDE_FLOAT32_C( 2.00) } },
{ { SIMDE_FLOAT32_C( 131.64), SIMDE_FLOAT32_C( 635.69), SIMDE_FLOAT32_C( -894.85), SIMDE_FLOAT32_C( 267.39),
SIMDE_FLOAT32_C( 945.62), SIMDE_FLOAT32_C( -325.08), SIMDE_FLOAT32_C( -582.27), SIMDE_FLOAT32_C( 348.62),
SIMDE_FLOAT32_C( 254.98), SIMDE_FLOAT32_C( 800.33), SIMDE_FLOAT32_C( -55.30), SIMDE_FLOAT32_C( 74.16),
SIMDE_FLOAT32_C( -937.10), SIMDE_FLOAT32_C( -660.19), SIMDE_FLOAT32_C( 838.44), SIMDE_FLOAT32_C( -307.53) },
{ SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 2.00), SIMDE_FLOAT32_C( 0.00),
SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 2.00), SIMDE_FLOAT32_C( 2.00), SIMDE_FLOAT32_C( 0.00),
SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 2.00), SIMDE_FLOAT32_C( 0.00),
SIMDE_FLOAT32_C( 2.00), SIMDE_FLOAT32_C( 2.00), SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 2.00) } }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])) ; i++) {
simde__m512 a = simde_mm512_loadu_ps(test_vec[i].a);
simde__m512 r = simde_mm512_erfc_ps(a);
simde_test_x86_assert_equal_f32x16(r, simde_mm512_loadu_ps(test_vec[i].r), 1);
}
return 0;
}
static int
test_simde_mm512_mask_erfc_ps (SIMDE_MUNIT_TEST_ARGS) {
static const struct {
const simde_float32 src[16];
const simde__mmask8 k;
const simde_float32 a[16];
const simde_float32 r[16];
} test_vec[] = {
{ { SIMDE_FLOAT32_C( 956.61), SIMDE_FLOAT32_C( 234.13), SIMDE_FLOAT32_C( 892.38), SIMDE_FLOAT32_C( 414.62),
SIMDE_FLOAT32_C( -352.76), SIMDE_FLOAT32_C( 66.22), SIMDE_FLOAT32_C( -611.87), SIMDE_FLOAT32_C( 409.12),
SIMDE_FLOAT32_C( -59.49), SIMDE_FLOAT32_C( 561.33), SIMDE_FLOAT32_C( -922.08), SIMDE_FLOAT32_C( 538.83),
SIMDE_FLOAT32_C( -425.54), SIMDE_FLOAT32_C( -342.56), SIMDE_FLOAT32_C( -597.87), SIMDE_FLOAT32_C( 992.17) },
UINT8_C(125),
{ SIMDE_FLOAT32_C( 513.40), SIMDE_FLOAT32_C( -248.97), SIMDE_FLOAT32_C( -181.44), SIMDE_FLOAT32_C( 317.13),
SIMDE_FLOAT32_C( 267.53), SIMDE_FLOAT32_C( 935.63), SIMDE_FLOAT32_C( 584.65), SIMDE_FLOAT32_C( 221.64),
SIMDE_FLOAT32_C( -188.28), SIMDE_FLOAT32_C( 142.72), SIMDE_FLOAT32_C( 400.07), SIMDE_FLOAT32_C( 778.58),
SIMDE_FLOAT32_C( 216.90), SIMDE_FLOAT32_C( 410.27), SIMDE_FLOAT32_C( 735.18), SIMDE_FLOAT32_C( -548.98) },
{ SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 234.13), SIMDE_FLOAT32_C( 2.00), SIMDE_FLOAT32_C( 0.00),
SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 409.12),
SIMDE_FLOAT32_C( -59.49), SIMDE_FLOAT32_C( 561.33), SIMDE_FLOAT32_C( -922.08), SIMDE_FLOAT32_C( 538.83),
SIMDE_FLOAT32_C( -425.54), SIMDE_FLOAT32_C( -342.56), SIMDE_FLOAT32_C( -597.87), SIMDE_FLOAT32_C( 992.17) } },
{ { SIMDE_FLOAT32_C( 302.65), SIMDE_FLOAT32_C( 149.80), SIMDE_FLOAT32_C( 98.26), SIMDE_FLOAT32_C( -631.12),
SIMDE_FLOAT32_C( 537.93), SIMDE_FLOAT32_C( -492.62), SIMDE_FLOAT32_C( 309.39), SIMDE_FLOAT32_C( 99.26),
SIMDE_FLOAT32_C( -414.70), SIMDE_FLOAT32_C( -151.78), SIMDE_FLOAT32_C( 673.72), SIMDE_FLOAT32_C( 242.74),
SIMDE_FLOAT32_C( 250.35), SIMDE_FLOAT32_C( 665.88), SIMDE_FLOAT32_C( 646.74), SIMDE_FLOAT32_C( -236.25) },
UINT8_C(226),
{ SIMDE_FLOAT32_C( -534.70), SIMDE_FLOAT32_C( -919.12), SIMDE_FLOAT32_C( 684.44), SIMDE_FLOAT32_C( -599.07),
SIMDE_FLOAT32_C( 665.53), SIMDE_FLOAT32_C( -93.93), SIMDE_FLOAT32_C( 212.65), SIMDE_FLOAT32_C( -191.74),
SIMDE_FLOAT32_C( -693.86), SIMDE_FLOAT32_C( -8.77), SIMDE_FLOAT32_C( -974.85), SIMDE_FLOAT32_C( 716.41),
SIMDE_FLOAT32_C( -273.59), SIMDE_FLOAT32_C( -523.82), SIMDE_FLOAT32_C( 19.06), SIMDE_FLOAT32_C( 876.21) },
{ SIMDE_FLOAT32_C( 302.65), SIMDE_FLOAT32_C( 2.00), SIMDE_FLOAT32_C( 98.26), SIMDE_FLOAT32_C( -631.12),
SIMDE_FLOAT32_C( 537.93), SIMDE_FLOAT32_C( 2.00), SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 2.00),
SIMDE_FLOAT32_C( -414.70), SIMDE_FLOAT32_C( -151.78), SIMDE_FLOAT32_C( 673.72), SIMDE_FLOAT32_C( 242.74),
SIMDE_FLOAT32_C( 250.35), SIMDE_FLOAT32_C( 665.88), SIMDE_FLOAT32_C( 646.74), SIMDE_FLOAT32_C( -236.25) } },
{ { SIMDE_FLOAT32_C( 574.44), SIMDE_FLOAT32_C( 387.93), SIMDE_FLOAT32_C( 414.13), SIMDE_FLOAT32_C( -918.18),
SIMDE_FLOAT32_C( -302.68), SIMDE_FLOAT32_C( -486.61), SIMDE_FLOAT32_C( -332.89), SIMDE_FLOAT32_C( 545.53),
SIMDE_FLOAT32_C( -812.89), SIMDE_FLOAT32_C( 909.85), SIMDE_FLOAT32_C( -204.12), SIMDE_FLOAT32_C( 852.99),
SIMDE_FLOAT32_C( 556.59), SIMDE_FLOAT32_C( 559.63), SIMDE_FLOAT32_C( -730.10), SIMDE_FLOAT32_C( -978.11) },
UINT8_C( 0),
{ SIMDE_FLOAT32_C( 954.34), SIMDE_FLOAT32_C( -577.18), SIMDE_FLOAT32_C( 306.05), SIMDE_FLOAT32_C( -139.59),
SIMDE_FLOAT32_C( 635.48), SIMDE_FLOAT32_C( -885.69), SIMDE_FLOAT32_C( 166.55), SIMDE_FLOAT32_C( -373.29),
SIMDE_FLOAT32_C( -860.54), SIMDE_FLOAT32_C( -117.04), SIMDE_FLOAT32_C( 353.12), SIMDE_FLOAT32_C( -384.37),
SIMDE_FLOAT32_C( 902.02), SIMDE_FLOAT32_C( 229.33), SIMDE_FLOAT32_C( -809.93), SIMDE_FLOAT32_C( 289.95) },
{ SIMDE_FLOAT32_C( 574.44), SIMDE_FLOAT32_C( 387.93), SIMDE_FLOAT32_C( 414.13), SIMDE_FLOAT32_C( -918.18),
SIMDE_FLOAT32_C( -302.68), SIMDE_FLOAT32_C( -486.61), SIMDE_FLOAT32_C( -332.89), SIMDE_FLOAT32_C( 545.53),
SIMDE_FLOAT32_C( -812.89), SIMDE_FLOAT32_C( 909.85), SIMDE_FLOAT32_C( -204.12), SIMDE_FLOAT32_C( 852.99),
SIMDE_FLOAT32_C( 556.59), SIMDE_FLOAT32_C( 559.63), SIMDE_FLOAT32_C( -730.10), SIMDE_FLOAT32_C( -978.11) } },
{ { SIMDE_FLOAT32_C( -356.54), SIMDE_FLOAT32_C( -728.11), SIMDE_FLOAT32_C( 987.27), SIMDE_FLOAT32_C( 156.85),
SIMDE_FLOAT32_C( -61.00), SIMDE_FLOAT32_C( 532.80), SIMDE_FLOAT32_C( 343.96), SIMDE_FLOAT32_C( -151.15),
SIMDE_FLOAT32_C( -671.32), SIMDE_FLOAT32_C( 196.95), SIMDE_FLOAT32_C( -594.56), SIMDE_FLOAT32_C( 888.32),
SIMDE_FLOAT32_C( 466.85), SIMDE_FLOAT32_C( -572.66), SIMDE_FLOAT32_C( 528.83), SIMDE_FLOAT32_C( 421.19) },
UINT8_C(129),
{ SIMDE_FLOAT32_C( -165.12), SIMDE_FLOAT32_C( -718.39), SIMDE_FLOAT32_C( -514.36), SIMDE_FLOAT32_C( -50.81),
SIMDE_FLOAT32_C( 448.16), SIMDE_FLOAT32_C( 112.35), SIMDE_FLOAT32_C( 88.64), SIMDE_FLOAT32_C( -668.88),
SIMDE_FLOAT32_C( -534.54), SIMDE_FLOAT32_C( 704.28), SIMDE_FLOAT32_C( -766.86), SIMDE_FLOAT32_C( 694.79),
SIMDE_FLOAT32_C( 894.35), SIMDE_FLOAT32_C( 523.08), SIMDE_FLOAT32_C( -661.75), SIMDE_FLOAT32_C( -833.77) },
{ SIMDE_FLOAT32_C( 2.00), SIMDE_FLOAT32_C( -728.11), SIMDE_FLOAT32_C( 987.27), SIMDE_FLOAT32_C( 156.85),
SIMDE_FLOAT32_C( -61.00), SIMDE_FLOAT32_C( 532.80), SIMDE_FLOAT32_C( 343.96), SIMDE_FLOAT32_C( 2.00),
SIMDE_FLOAT32_C( -671.32), SIMDE_FLOAT32_C( 196.95), SIMDE_FLOAT32_C( -594.56), SIMDE_FLOAT32_C( 888.32),
SIMDE_FLOAT32_C( 466.85), SIMDE_FLOAT32_C( -572.66), SIMDE_FLOAT32_C( 528.83), SIMDE_FLOAT32_C( 421.19) } },
{ { SIMDE_FLOAT32_C( 510.35), SIMDE_FLOAT32_C( 495.10), SIMDE_FLOAT32_C( 105.23), SIMDE_FLOAT32_C( 43.15),
SIMDE_FLOAT32_C( -160.94), SIMDE_FLOAT32_C( 954.08), SIMDE_FLOAT32_C( 371.83), SIMDE_FLOAT32_C( -963.98),
SIMDE_FLOAT32_C( -640.48), SIMDE_FLOAT32_C( 260.15), SIMDE_FLOAT32_C( 502.87), SIMDE_FLOAT32_C( -213.14),
SIMDE_FLOAT32_C( -211.02), SIMDE_FLOAT32_C( -75.94), SIMDE_FLOAT32_C( 637.02), SIMDE_FLOAT32_C( 623.86) },
UINT8_C( 36),
{ SIMDE_FLOAT32_C( -877.34), SIMDE_FLOAT32_C( -426.95), SIMDE_FLOAT32_C( -346.17), SIMDE_FLOAT32_C( 235.01),
SIMDE_FLOAT32_C( 661.70), SIMDE_FLOAT32_C( -15.05), SIMDE_FLOAT32_C( 700.47), SIMDE_FLOAT32_C( 365.98),
SIMDE_FLOAT32_C( 218.09), SIMDE_FLOAT32_C( 395.26), SIMDE_FLOAT32_C( 260.32), SIMDE_FLOAT32_C( -258.83),
SIMDE_FLOAT32_C( 733.51), SIMDE_FLOAT32_C( 426.55), SIMDE_FLOAT32_C( -748.48), SIMDE_FLOAT32_C( 228.61) },
{ SIMDE_FLOAT32_C( 510.35), SIMDE_FLOAT32_C( 495.10), SIMDE_FLOAT32_C( 2.00), SIMDE_FLOAT32_C( 43.15),
SIMDE_FLOAT32_C( -160.94), SIMDE_FLOAT32_C( 2.00), SIMDE_FLOAT32_C( 371.83), SIMDE_FLOAT32_C( -963.98),
SIMDE_FLOAT32_C( -640.48), SIMDE_FLOAT32_C( 260.15), SIMDE_FLOAT32_C( 502.87), SIMDE_FLOAT32_C( -213.14),
SIMDE_FLOAT32_C( -211.02), SIMDE_FLOAT32_C( -75.94), SIMDE_FLOAT32_C( 637.02), SIMDE_FLOAT32_C( 623.86) } },
{ { SIMDE_FLOAT32_C( -468.22), SIMDE_FLOAT32_C( 294.67), SIMDE_FLOAT32_C( -932.33), SIMDE_FLOAT32_C( -514.14),
SIMDE_FLOAT32_C( -333.50), SIMDE_FLOAT32_C( -896.31), SIMDE_FLOAT32_C( -154.62), SIMDE_FLOAT32_C( 926.65),
SIMDE_FLOAT32_C( 606.56), SIMDE_FLOAT32_C( 632.24), SIMDE_FLOAT32_C( -284.37), SIMDE_FLOAT32_C( -469.38),
SIMDE_FLOAT32_C( 269.27), SIMDE_FLOAT32_C( -660.50), SIMDE_FLOAT32_C( 736.29), SIMDE_FLOAT32_C( 391.93) },
UINT8_C(251),
{ SIMDE_FLOAT32_C( -609.88), SIMDE_FLOAT32_C( -373.06), SIMDE_FLOAT32_C( -425.75), SIMDE_FLOAT32_C( 375.07),
SIMDE_FLOAT32_C( -672.58), SIMDE_FLOAT32_C( 940.22), SIMDE_FLOAT32_C( -406.85), SIMDE_FLOAT32_C( 722.68),
SIMDE_FLOAT32_C( 200.54), SIMDE_FLOAT32_C( 334.32), SIMDE_FLOAT32_C( 456.19), SIMDE_FLOAT32_C( -372.90),
SIMDE_FLOAT32_C( 585.84), SIMDE_FLOAT32_C( -315.20), SIMDE_FLOAT32_C( 158.88), SIMDE_FLOAT32_C( -119.49) },
{ SIMDE_FLOAT32_C( 2.00), SIMDE_FLOAT32_C( 2.00), SIMDE_FLOAT32_C( -932.33), SIMDE_FLOAT32_C( 0.00),
SIMDE_FLOAT32_C( 2.00), SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 2.00), SIMDE_FLOAT32_C( 0.00),
SIMDE_FLOAT32_C( 606.56), SIMDE_FLOAT32_C( 632.24), SIMDE_FLOAT32_C( -284.37), SIMDE_FLOAT32_C( -469.38),
SIMDE_FLOAT32_C( 269.27), SIMDE_FLOAT32_C( -660.50), SIMDE_FLOAT32_C( 736.29), SIMDE_FLOAT32_C( 391.93) } },
{ { SIMDE_FLOAT32_C( -247.53), SIMDE_FLOAT32_C( 644.74), SIMDE_FLOAT32_C( 547.01), SIMDE_FLOAT32_C( -143.84),
SIMDE_FLOAT32_C( -509.87), SIMDE_FLOAT32_C( 473.66), SIMDE_FLOAT32_C( -537.28), SIMDE_FLOAT32_C( -877.63),
SIMDE_FLOAT32_C( -810.70), SIMDE_FLOAT32_C( -6.66), SIMDE_FLOAT32_C( 391.64), SIMDE_FLOAT32_C( -471.21),
SIMDE_FLOAT32_C( -270.37), SIMDE_FLOAT32_C( -216.43), SIMDE_FLOAT32_C( 441.34), SIMDE_FLOAT32_C( 119.74) },
UINT8_C(113),
{ SIMDE_FLOAT32_C( -984.41), SIMDE_FLOAT32_C( -505.19), SIMDE_FLOAT32_C( 737.93), SIMDE_FLOAT32_C( 955.81),
SIMDE_FLOAT32_C( 87.96), SIMDE_FLOAT32_C( 460.61), SIMDE_FLOAT32_C( 156.35), SIMDE_FLOAT32_C( -577.72),
SIMDE_FLOAT32_C( -83.20), SIMDE_FLOAT32_C( 783.45), SIMDE_FLOAT32_C( -991.87), SIMDE_FLOAT32_C( 601.60),
SIMDE_FLOAT32_C( -57.67), SIMDE_FLOAT32_C( -111.36), SIMDE_FLOAT32_C( -645.93), SIMDE_FLOAT32_C( -412.93) },
{ SIMDE_FLOAT32_C( 2.00), SIMDE_FLOAT32_C( 644.74), SIMDE_FLOAT32_C( 547.01), SIMDE_FLOAT32_C( -143.84),
SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( -877.63),
SIMDE_FLOAT32_C( -810.70), SIMDE_FLOAT32_C( -6.66), SIMDE_FLOAT32_C( 391.64), SIMDE_FLOAT32_C( -471.21),
SIMDE_FLOAT32_C( -270.37), SIMDE_FLOAT32_C( -216.43), SIMDE_FLOAT32_C( 441.34), SIMDE_FLOAT32_C( 119.74) } },
{ { SIMDE_FLOAT32_C( -564.35), SIMDE_FLOAT32_C( 210.23), SIMDE_FLOAT32_C( 77.20), SIMDE_FLOAT32_C( 909.32),
SIMDE_FLOAT32_C( 672.96), SIMDE_FLOAT32_C( 199.57), SIMDE_FLOAT32_C( -901.39), SIMDE_FLOAT32_C( -333.70),
SIMDE_FLOAT32_C( -408.79), SIMDE_FLOAT32_C( -372.60), SIMDE_FLOAT32_C( 395.92), SIMDE_FLOAT32_C( 374.78),
SIMDE_FLOAT32_C( -931.26), SIMDE_FLOAT32_C( -484.33), SIMDE_FLOAT32_C( -214.70), SIMDE_FLOAT32_C( -915.67) },
UINT8_C(139),
{ SIMDE_FLOAT32_C( -476.78), SIMDE_FLOAT32_C( -959.86), SIMDE_FLOAT32_C( -901.56), SIMDE_FLOAT32_C( 983.83),
SIMDE_FLOAT32_C( 196.49), SIMDE_FLOAT32_C( -479.28), SIMDE_FLOAT32_C( -99.37), SIMDE_FLOAT32_C( -20.06),
SIMDE_FLOAT32_C( -471.16), SIMDE_FLOAT32_C( -497.78), SIMDE_FLOAT32_C( 922.27), SIMDE_FLOAT32_C( 417.48),
SIMDE_FLOAT32_C( -143.71), SIMDE_FLOAT32_C( -490.66), SIMDE_FLOAT32_C( 853.13), SIMDE_FLOAT32_C( -933.47) },
{ SIMDE_FLOAT32_C( 2.00), SIMDE_FLOAT32_C( 2.00), SIMDE_FLOAT32_C( 77.20), SIMDE_FLOAT32_C( 0.00),
SIMDE_FLOAT32_C( 672.96), SIMDE_FLOAT32_C( 199.57), SIMDE_FLOAT32_C( -901.39), SIMDE_FLOAT32_C( 2.00),
SIMDE_FLOAT32_C( -408.79), SIMDE_FLOAT32_C( -372.60), SIMDE_FLOAT32_C( 395.92), SIMDE_FLOAT32_C( 374.78),
SIMDE_FLOAT32_C( -931.26), SIMDE_FLOAT32_C( -484.33), SIMDE_FLOAT32_C( -214.70), SIMDE_FLOAT32_C( -915.67) } }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])) ; i++) {
simde__m512 src = simde_mm512_loadu_ps(test_vec[i].src);
simde__m512 a = simde_mm512_loadu_ps(test_vec[i].a);
simde__m512 r = simde_mm512_mask_erfc_ps(src, test_vec[i].k, a);
simde_test_x86_assert_equal_f32x16(r, simde_mm512_loadu_ps(test_vec[i].r), 1);
}
return 0;
}
static int
test_simde_mm512_erfc_pd (SIMDE_MUNIT_TEST_ARGS) {
static const struct {
const simde_float64 a[8];
const simde_float64 r[8];
} test_vec[] = {
{ { SIMDE_FLOAT64_C( 926.55), SIMDE_FLOAT64_C( 763.10), SIMDE_FLOAT64_C( 6.18), SIMDE_FLOAT64_C( 453.38),
SIMDE_FLOAT64_C( 184.79), SIMDE_FLOAT64_C( 608.12), SIMDE_FLOAT64_C( 303.22), SIMDE_FLOAT64_C( 429.75) },
{ SIMDE_FLOAT64_C( 0.00), SIMDE_FLOAT64_C( 0.00), SIMDE_FLOAT64_C( 0.00), SIMDE_FLOAT64_C( 0.00),
SIMDE_FLOAT64_C( 0.00), SIMDE_FLOAT64_C( 0.00), SIMDE_FLOAT64_C( 0.00), SIMDE_FLOAT64_C( 0.00) } },
{ { SIMDE_FLOAT64_C( 610.63), SIMDE_FLOAT64_C( -505.99), SIMDE_FLOAT64_C( -566.70), SIMDE_FLOAT64_C( -890.86),
SIMDE_FLOAT64_C( -469.61), SIMDE_FLOAT64_C( -65.43), SIMDE_FLOAT64_C( -190.70), SIMDE_FLOAT64_C( 797.08) },
{ SIMDE_FLOAT64_C( 0.00), SIMDE_FLOAT64_C( 2.00), SIMDE_FLOAT64_C( 2.00), SIMDE_FLOAT64_C( 2.00),
SIMDE_FLOAT64_C( 2.00), SIMDE_FLOAT64_C( 2.00), SIMDE_FLOAT64_C( 2.00), SIMDE_FLOAT64_C( 0.00) } },
{ { SIMDE_FLOAT64_C( 883.79), SIMDE_FLOAT64_C( -999.64), SIMDE_FLOAT64_C( 928.39), SIMDE_FLOAT64_C( -465.63),
SIMDE_FLOAT64_C( -214.31), SIMDE_FLOAT64_C( 650.21), SIMDE_FLOAT64_C( 880.22), SIMDE_FLOAT64_C( -127.39) },
{ SIMDE_FLOAT64_C( 0.00), SIMDE_FLOAT64_C( 2.00), SIMDE_FLOAT64_C( 0.00), SIMDE_FLOAT64_C( 2.00),
SIMDE_FLOAT64_C( 2.00), SIMDE_FLOAT64_C( 0.00), SIMDE_FLOAT64_C( 0.00), SIMDE_FLOAT64_C( 2.00) } },
{ { SIMDE_FLOAT64_C( 687.46), SIMDE_FLOAT64_C( -738.40), SIMDE_FLOAT64_C( -655.58), SIMDE_FLOAT64_C( -737.41),
SIMDE_FLOAT64_C( -335.05), SIMDE_FLOAT64_C( -354.48), SIMDE_FLOAT64_C( -302.30), SIMDE_FLOAT64_C( -408.50) },
{ SIMDE_FLOAT64_C( 0.00), SIMDE_FLOAT64_C( 2.00), SIMDE_FLOAT64_C( 2.00), SIMDE_FLOAT64_C( 2.00),
SIMDE_FLOAT64_C( 2.00), SIMDE_FLOAT64_C( 2.00), SIMDE_FLOAT64_C( 2.00), SIMDE_FLOAT64_C( 2.00) } },
{ { SIMDE_FLOAT64_C( -591.38), SIMDE_FLOAT64_C( 703.88), SIMDE_FLOAT64_C( -955.11), SIMDE_FLOAT64_C( 593.41),
SIMDE_FLOAT64_C( 311.99), SIMDE_FLOAT64_C( 348.11), SIMDE_FLOAT64_C( 23.16), SIMDE_FLOAT64_C( -77.38) },
{ SIMDE_FLOAT64_C( 2.00), SIMDE_FLOAT64_C( 0.00), SIMDE_FLOAT64_C( 2.00), SIMDE_FLOAT64_C( 0.00),
SIMDE_FLOAT64_C( 0.00), SIMDE_FLOAT64_C( 0.00), SIMDE_FLOAT64_C( 0.00), SIMDE_FLOAT64_C( 2.00) } },
{ { SIMDE_FLOAT64_C( 842.12), SIMDE_FLOAT64_C( 456.45), SIMDE_FLOAT64_C( 31.76), SIMDE_FLOAT64_C( -627.49),
SIMDE_FLOAT64_C( -608.98), SIMDE_FLOAT64_C( 841.06), SIMDE_FLOAT64_C( -830.41), SIMDE_FLOAT64_C( -725.19) },
{ SIMDE_FLOAT64_C( 0.00), SIMDE_FLOAT64_C( 0.00), SIMDE_FLOAT64_C( 0.00), SIMDE_FLOAT64_C( 2.00),
SIMDE_FLOAT64_C( 2.00), SIMDE_FLOAT64_C( 0.00), SIMDE_FLOAT64_C( 2.00), SIMDE_FLOAT64_C( 2.00) } },
{ { SIMDE_FLOAT64_C( 841.43), SIMDE_FLOAT64_C( -902.02), SIMDE_FLOAT64_C( -190.81), SIMDE_FLOAT64_C( -372.89),
SIMDE_FLOAT64_C( 748.18), SIMDE_FLOAT64_C( -310.59), SIMDE_FLOAT64_C( 499.72), SIMDE_FLOAT64_C( 435.64) },
{ SIMDE_FLOAT64_C( 0.00), SIMDE_FLOAT64_C( 2.00), SIMDE_FLOAT64_C( 2.00), SIMDE_FLOAT64_C( 2.00),
SIMDE_FLOAT64_C( 0.00), SIMDE_FLOAT64_C( 2.00), SIMDE_FLOAT64_C( 0.00), SIMDE_FLOAT64_C( 0.00) } },
{ { SIMDE_FLOAT64_C( -48.99), SIMDE_FLOAT64_C( 844.14), SIMDE_FLOAT64_C( 698.23), SIMDE_FLOAT64_C( 615.96),
SIMDE_FLOAT64_C( -510.34), SIMDE_FLOAT64_C( -604.07), SIMDE_FLOAT64_C( -792.54), SIMDE_FLOAT64_C( -101.72) },
{ SIMDE_FLOAT64_C( 2.00), SIMDE_FLOAT64_C( 0.00), SIMDE_FLOAT64_C( 0.00), SIMDE_FLOAT64_C( 0.00),
SIMDE_FLOAT64_C( 2.00), SIMDE_FLOAT64_C( 2.00), SIMDE_FLOAT64_C( 2.00), SIMDE_FLOAT64_C( 2.00) } }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])) ; i++) {
simde__m512d a = simde_mm512_loadu_pd(test_vec[i].a);
simde__m512d r = simde_mm512_erfc_pd(a);
simde_test_x86_assert_equal_f64x8(r, simde_mm512_loadu_pd(test_vec[i].r), 1);
}
return 0;
}
static int
test_simde_mm512_mask_erfc_pd (SIMDE_MUNIT_TEST_ARGS) {
static const struct {
const simde_float64 src[8];
const simde__mmask8 k;
const simde_float64 a[8];
const simde_float64 r[8];
} test_vec[] = {
{ { SIMDE_FLOAT64_C( -683.28), SIMDE_FLOAT64_C( 804.20), SIMDE_FLOAT64_C( -404.66), SIMDE_FLOAT64_C( -472.79),
SIMDE_FLOAT64_C( -863.69), SIMDE_FLOAT64_C( -237.69), SIMDE_FLOAT64_C( -919.11), SIMDE_FLOAT64_C( 998.91) },
UINT8_C( 80),
{ SIMDE_FLOAT64_C( 291.91), SIMDE_FLOAT64_C( -572.21), SIMDE_FLOAT64_C( 220.68), SIMDE_FLOAT64_C( -193.99),
SIMDE_FLOAT64_C( -17.57), SIMDE_FLOAT64_C( 493.29), SIMDE_FLOAT64_C( 557.85), SIMDE_FLOAT64_C( 412.26) },
{ SIMDE_FLOAT64_C( -683.28), SIMDE_FLOAT64_C( 804.20), SIMDE_FLOAT64_C( -404.66), SIMDE_FLOAT64_C( -472.79),
SIMDE_FLOAT64_C( 2.00), SIMDE_FLOAT64_C( -237.69), SIMDE_FLOAT64_C( 0.00), SIMDE_FLOAT64_C( 998.91) } },
{ { SIMDE_FLOAT64_C( 986.63), SIMDE_FLOAT64_C( -515.33), SIMDE_FLOAT64_C( -32.91), SIMDE_FLOAT64_C( -333.09),
SIMDE_FLOAT64_C( -321.96), SIMDE_FLOAT64_C( 468.63), SIMDE_FLOAT64_C( 439.22), SIMDE_FLOAT64_C( -104.11) },
UINT8_C( 73),
{ SIMDE_FLOAT64_C( 199.74), SIMDE_FLOAT64_C( 522.47), SIMDE_FLOAT64_C( 516.01), SIMDE_FLOAT64_C( -942.26),
SIMDE_FLOAT64_C( -623.61), SIMDE_FLOAT64_C( 832.73), SIMDE_FLOAT64_C( 861.94), SIMDE_FLOAT64_C( -28.27) },
{ SIMDE_FLOAT64_C( 0.00), SIMDE_FLOAT64_C( -515.33), SIMDE_FLOAT64_C( -32.91), SIMDE_FLOAT64_C( 2.00),
SIMDE_FLOAT64_C( -321.96), SIMDE_FLOAT64_C( 468.63), SIMDE_FLOAT64_C( 0.00), SIMDE_FLOAT64_C( -104.11) } },
{ { SIMDE_FLOAT64_C( -640.06), SIMDE_FLOAT64_C( 998.25), SIMDE_FLOAT64_C( 734.04), SIMDE_FLOAT64_C( -559.17),
SIMDE_FLOAT64_C( 997.17), SIMDE_FLOAT64_C( -856.00), SIMDE_FLOAT64_C( 732.74), SIMDE_FLOAT64_C( -575.04) },
UINT8_C(158),
{ SIMDE_FLOAT64_C( -461.24), SIMDE_FLOAT64_C( 407.39), SIMDE_FLOAT64_C( -142.02), SIMDE_FLOAT64_C( -903.39),
SIMDE_FLOAT64_C( -180.35), SIMDE_FLOAT64_C( -155.40), SIMDE_FLOAT64_C( -418.72), SIMDE_FLOAT64_C( 786.74) },
{ SIMDE_FLOAT64_C( -640.06), SIMDE_FLOAT64_C( 0.00), SIMDE_FLOAT64_C( 2.00), SIMDE_FLOAT64_C( 2.00),
SIMDE_FLOAT64_C( 2.00), SIMDE_FLOAT64_C( -856.00), SIMDE_FLOAT64_C( 732.74), SIMDE_FLOAT64_C( 0.00) } },
{ { SIMDE_FLOAT64_C( 511.51), SIMDE_FLOAT64_C( 259.32), SIMDE_FLOAT64_C( 255.37), SIMDE_FLOAT64_C( -49.27),
SIMDE_FLOAT64_C( -844.79), SIMDE_FLOAT64_C( 939.27), SIMDE_FLOAT64_C( -849.53), SIMDE_FLOAT64_C( 677.68) },
UINT8_C(184),
{ SIMDE_FLOAT64_C( -791.79), SIMDE_FLOAT64_C( -945.93), SIMDE_FLOAT64_C( 288.01), SIMDE_FLOAT64_C( -929.85),
SIMDE_FLOAT64_C( 25.80), SIMDE_FLOAT64_C( 647.95), SIMDE_FLOAT64_C( -931.60), SIMDE_FLOAT64_C( -240.16) },
{ SIMDE_FLOAT64_C( 511.51), SIMDE_FLOAT64_C( 259.32), SIMDE_FLOAT64_C( 255.37), SIMDE_FLOAT64_C( 2.00),
SIMDE_FLOAT64_C( 0.00), SIMDE_FLOAT64_C( 0.00), SIMDE_FLOAT64_C( -849.53), SIMDE_FLOAT64_C( 2.00) } },
{ { SIMDE_FLOAT64_C( -911.22), SIMDE_FLOAT64_C( -934.43), SIMDE_FLOAT64_C( -96.16), SIMDE_FLOAT64_C( 821.52),
SIMDE_FLOAT64_C( -509.47), SIMDE_FLOAT64_C( -731.47), SIMDE_FLOAT64_C( -639.72), SIMDE_FLOAT64_C( 897.92) },
UINT8_C(176),
{ SIMDE_FLOAT64_C( -543.12), SIMDE_FLOAT64_C( -282.43), SIMDE_FLOAT64_C( 971.11), SIMDE_FLOAT64_C( 38.16),
SIMDE_FLOAT64_C( -495.70), SIMDE_FLOAT64_C( 482.61), SIMDE_FLOAT64_C( -702.52), SIMDE_FLOAT64_C( 759.67) },
{ SIMDE_FLOAT64_C( -911.22), SIMDE_FLOAT64_C( -934.43), SIMDE_FLOAT64_C( -96.16), SIMDE_FLOAT64_C( 821.52),
SIMDE_FLOAT64_C( 2.00), SIMDE_FLOAT64_C( 0.00), SIMDE_FLOAT64_C( -639.72), SIMDE_FLOAT64_C( 0.00) } },
{ { SIMDE_FLOAT64_C( -566.66), SIMDE_FLOAT64_C( -547.31), SIMDE_FLOAT64_C( 698.94), SIMDE_FLOAT64_C( -416.19),
SIMDE_FLOAT64_C( -869.63), SIMDE_FLOAT64_C( 154.22), SIMDE_FLOAT64_C( -207.98), SIMDE_FLOAT64_C( -815.57) },
UINT8_C(142),
{ SIMDE_FLOAT64_C( -137.83), SIMDE_FLOAT64_C( 210.23), SIMDE_FLOAT64_C( -909.82), SIMDE_FLOAT64_C( -69.43),
SIMDE_FLOAT64_C( 970.07), SIMDE_FLOAT64_C( -821.05), SIMDE_FLOAT64_C( -3.87), SIMDE_FLOAT64_C( -126.08) },
{ SIMDE_FLOAT64_C( -566.66), SIMDE_FLOAT64_C( 0.00), SIMDE_FLOAT64_C( 2.00), SIMDE_FLOAT64_C( 2.00),
SIMDE_FLOAT64_C( -869.63), SIMDE_FLOAT64_C( 154.22), SIMDE_FLOAT64_C( -207.98), SIMDE_FLOAT64_C( 2.00) } },
{ { SIMDE_FLOAT64_C( -999.53), SIMDE_FLOAT64_C( 486.66), SIMDE_FLOAT64_C( 142.44), SIMDE_FLOAT64_C( -639.25),
SIMDE_FLOAT64_C( 384.58), SIMDE_FLOAT64_C( -731.05), SIMDE_FLOAT64_C( -182.37), SIMDE_FLOAT64_C( -897.86) },
UINT8_C(227),
{ SIMDE_FLOAT64_C( 855.79), SIMDE_FLOAT64_C( -393.55), SIMDE_FLOAT64_C( 722.67), SIMDE_FLOAT64_C( -846.73),
SIMDE_FLOAT64_C( -633.88), SIMDE_FLOAT64_C( -843.99), SIMDE_FLOAT64_C( -394.03), SIMDE_FLOAT64_C( -934.94) },
{ SIMDE_FLOAT64_C( 0.00), SIMDE_FLOAT64_C( 2.00), SIMDE_FLOAT64_C( 142.44), SIMDE_FLOAT64_C( -639.25),
SIMDE_FLOAT64_C( 384.58), SIMDE_FLOAT64_C( 2.00), SIMDE_FLOAT64_C( 2.00), SIMDE_FLOAT64_C( 2.00) } },
{ { SIMDE_FLOAT64_C( -260.18), SIMDE_FLOAT64_C( -263.67), SIMDE_FLOAT64_C( 219.28), SIMDE_FLOAT64_C( 531.84),
SIMDE_FLOAT64_C( -79.23), SIMDE_FLOAT64_C( 661.51), SIMDE_FLOAT64_C( -605.99), SIMDE_FLOAT64_C( -869.00) },
UINT8_C( 64),
{ SIMDE_FLOAT64_C( 324.57), SIMDE_FLOAT64_C( -898.93), SIMDE_FLOAT64_C( 930.64), SIMDE_FLOAT64_C( -679.29),
SIMDE_FLOAT64_C( -25.01), SIMDE_FLOAT64_C( 931.11), SIMDE_FLOAT64_C( 807.37), SIMDE_FLOAT64_C( -882.57) },
{ SIMDE_FLOAT64_C( -260.18), SIMDE_FLOAT64_C( -263.67), SIMDE_FLOAT64_C( 219.28), SIMDE_FLOAT64_C( 531.84),
SIMDE_FLOAT64_C( -79.23), SIMDE_FLOAT64_C( 661.51), SIMDE_FLOAT64_C( 0.00), SIMDE_FLOAT64_C( -869.00) } }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])) ; i++) {
simde__m512d src = simde_mm512_loadu_pd(test_vec[i].src);
simde__m512d a = simde_mm512_loadu_pd(test_vec[i].a);
simde__m512d r = simde_mm512_mask_erfc_pd(src, test_vec[i].k, a);
simde_test_x86_assert_equal_f64x8(r, simde_mm512_loadu_pd(test_vec[i].r), 1);
}
return 0;
}
static int
test_simde_mm_erfcinv_ps (SIMDE_MUNIT_TEST_ARGS) {
static const struct {
const simde_float32 a[4];
const simde_float32 r[4];
} test_vec[] = {
{ { SIMDE_FLOAT32_C( 1.69), SIMDE_FLOAT32_C( 0.07), SIMDE_FLOAT32_C( 1.25), SIMDE_FLOAT32_C( 1.16) },
{ SIMDE_FLOAT32_C( -0.72), SIMDE_FLOAT32_C( 1.28), SIMDE_FLOAT32_C( -0.23), SIMDE_FLOAT32_C( -0.14) } },
{ { SIMDE_FLOAT32_C( 0.39), SIMDE_FLOAT32_C( 0.29), SIMDE_FLOAT32_C( 0.06), SIMDE_FLOAT32_C( 1.53) },
{ SIMDE_FLOAT32_C( 0.61), SIMDE_FLOAT32_C( 0.75), SIMDE_FLOAT32_C( 1.33), SIMDE_FLOAT32_C( -0.51) } },
{ { SIMDE_FLOAT32_C( 0.65), SIMDE_FLOAT32_C( 0.92), SIMDE_FLOAT32_C( 0.95), SIMDE_FLOAT32_C( 1.68) },
{ SIMDE_FLOAT32_C( 0.32), SIMDE_FLOAT32_C( 0.07), SIMDE_FLOAT32_C( 0.04), SIMDE_FLOAT32_C( -0.70) } },
{ { SIMDE_FLOAT32_C( 1.28), SIMDE_FLOAT32_C( 1.66), SIMDE_FLOAT32_C( 0.12), SIMDE_FLOAT32_C( 1.42) },
{ SIMDE_FLOAT32_C( -0.25), SIMDE_FLOAT32_C( -0.67), SIMDE_FLOAT32_C( 1.10), SIMDE_FLOAT32_C( -0.39) } },
{ { SIMDE_FLOAT32_C( 1.53), SIMDE_FLOAT32_C( 0.81), SIMDE_FLOAT32_C( 1.20), SIMDE_FLOAT32_C( 1.51) },
{ SIMDE_FLOAT32_C( -0.51), SIMDE_FLOAT32_C( 0.17), SIMDE_FLOAT32_C( -0.18), SIMDE_FLOAT32_C( -0.49) } },
{ { SIMDE_FLOAT32_C( 1.44), SIMDE_FLOAT32_C( 0.31), SIMDE_FLOAT32_C( 0.10), SIMDE_FLOAT32_C( 0.49) },
{ SIMDE_FLOAT32_C( -0.41), SIMDE_FLOAT32_C( 0.72), SIMDE_FLOAT32_C( 1.16), SIMDE_FLOAT32_C( 0.49) } },
{ { SIMDE_FLOAT32_C( 0.57), SIMDE_FLOAT32_C( 0.52), SIMDE_FLOAT32_C( 1.99), SIMDE_FLOAT32_C( 1.35) },
{ SIMDE_FLOAT32_C( 0.40), SIMDE_FLOAT32_C( 0.45), SIMDE_FLOAT32_C( -1.82), SIMDE_FLOAT32_C( -0.32) } },
{ { SIMDE_FLOAT32_C( 1.94), SIMDE_FLOAT32_C( 0.64), SIMDE_FLOAT32_C( 0.39), SIMDE_FLOAT32_C( 1.62) },
{ SIMDE_FLOAT32_C( -1.33), SIMDE_FLOAT32_C( 0.33), SIMDE_FLOAT32_C( 0.61), SIMDE_FLOAT32_C( -0.62) } }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])) ; i++) {
simde__m128 a = simde_mm_loadu_ps(test_vec[i].a);
simde__m128 r = simde_mm_erfcinv_ps(a);
simde_test_x86_assert_equal_f32x4(r, simde_mm_loadu_ps(test_vec[i].r), 1);
}
return 0;
}
static int
test_simde_mm_erfcinv_pd (SIMDE_MUNIT_TEST_ARGS) {
static const struct {
const simde_float64 a[2];
const simde_float64 r[2];
} test_vec[] = {
{ { SIMDE_FLOAT64_C( 0.73), SIMDE_FLOAT64_C( 0.13) },
{ SIMDE_FLOAT64_C( 0.24), SIMDE_FLOAT64_C( 1.07) } },
{ { SIMDE_FLOAT64_C( 1.09), SIMDE_FLOAT64_C( 0.70) },
{ SIMDE_FLOAT64_C( -0.08), SIMDE_FLOAT64_C( 0.27) } },
{ { SIMDE_FLOAT64_C( 1.13), SIMDE_FLOAT64_C( 0.97) },
{ SIMDE_FLOAT64_C( -0.12), SIMDE_FLOAT64_C( 0.03) } },
{ { SIMDE_FLOAT64_C( 0.45), SIMDE_FLOAT64_C( 1.72) },
{ SIMDE_FLOAT64_C( 0.53), SIMDE_FLOAT64_C( -0.76) } },
{ { SIMDE_FLOAT64_C( 0.25), SIMDE_FLOAT64_C( 0.82) },
{ SIMDE_FLOAT64_C( 0.81), SIMDE_FLOAT64_C( 0.16) } },
{ { SIMDE_FLOAT64_C( 0.01), SIMDE_FLOAT64_C( 1.88) },
{ SIMDE_FLOAT64_C( 1.82), SIMDE_FLOAT64_C( -1.10) } },
{ { SIMDE_FLOAT64_C( 1.11), SIMDE_FLOAT64_C( 0.87) },
{ SIMDE_FLOAT64_C( -0.10), SIMDE_FLOAT64_C( 0.12) } },
{ { SIMDE_FLOAT64_C( 1.13), SIMDE_FLOAT64_C( 0.05) },
{ SIMDE_FLOAT64_C( -0.12), SIMDE_FLOAT64_C( 1.39) } }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])) ; i++) {
simde__m128d a = simde_mm_loadu_pd(test_vec[i].a);
simde__m128d r = simde_mm_erfcinv_pd(a);
simde_test_x86_assert_equal_f64x2(r, simde_mm_loadu_pd(test_vec[i].r), 1);
}
return 0;
}
static int
test_simde_mm256_erfcinv_ps (SIMDE_MUNIT_TEST_ARGS) {
static const struct {
const simde_float32 a[8];
const simde_float32 r[8];
} test_vec[] = {
{ { SIMDE_FLOAT32_C( 1.58), SIMDE_FLOAT32_C( 1.39), SIMDE_FLOAT32_C( 1.73), SIMDE_FLOAT32_C( 0.95),
SIMDE_FLOAT32_C( 0.76), SIMDE_FLOAT32_C( 1.58), SIMDE_FLOAT32_C( 0.54), SIMDE_FLOAT32_C( 0.73) },
{ SIMDE_FLOAT32_C( -0.57), SIMDE_FLOAT32_C( -0.36), SIMDE_FLOAT32_C( -0.78), SIMDE_FLOAT32_C( 0.04),
SIMDE_FLOAT32_C( 0.22), SIMDE_FLOAT32_C( -0.57), SIMDE_FLOAT32_C( 0.43), SIMDE_FLOAT32_C( 0.24) } },
{ { SIMDE_FLOAT32_C( 1.93), SIMDE_FLOAT32_C( 0.77), SIMDE_FLOAT32_C( 1.80), SIMDE_FLOAT32_C( 1.88),
SIMDE_FLOAT32_C( 0.69), SIMDE_FLOAT32_C( 0.44), SIMDE_FLOAT32_C( 1.44), SIMDE_FLOAT32_C( 1.26) },
{ SIMDE_FLOAT32_C( -1.28), SIMDE_FLOAT32_C( 0.21), SIMDE_FLOAT32_C( -0.91), SIMDE_FLOAT32_C( -1.10),
SIMDE_FLOAT32_C( 0.28), SIMDE_FLOAT32_C( 0.55), SIMDE_FLOAT32_C( -0.41), SIMDE_FLOAT32_C( -0.23) } },
{ { SIMDE_FLOAT32_C( 0.61), SIMDE_FLOAT32_C( 0.20), SIMDE_FLOAT32_C( 0.76), SIMDE_FLOAT32_C( 1.32),
SIMDE_FLOAT32_C( 1.69), SIMDE_FLOAT32_C( 0.41), SIMDE_FLOAT32_C( 1.97), SIMDE_FLOAT32_C( 1.77) },
{ SIMDE_FLOAT32_C( 0.36), SIMDE_FLOAT32_C( 0.91), SIMDE_FLOAT32_C( 0.22), SIMDE_FLOAT32_C( -0.29),
SIMDE_FLOAT32_C( -0.72), SIMDE_FLOAT32_C( 0.58), SIMDE_FLOAT32_C( -1.53), SIMDE_FLOAT32_C( -0.85) } },
{ { SIMDE_FLOAT32_C( 1.71), SIMDE_FLOAT32_C( 0.87), SIMDE_FLOAT32_C( 1.55), SIMDE_FLOAT32_C( 1.99),
SIMDE_FLOAT32_C( 0.53), SIMDE_FLOAT32_C( 1.50), SIMDE_FLOAT32_C( 1.58), SIMDE_FLOAT32_C( 0.11) },
{ SIMDE_FLOAT32_C( -0.75), SIMDE_FLOAT32_C( 0.12), SIMDE_FLOAT32_C( -0.53), SIMDE_FLOAT32_C( -1.82),
SIMDE_FLOAT32_C( 0.44), SIMDE_FLOAT32_C( -0.48), SIMDE_FLOAT32_C( -0.57), SIMDE_FLOAT32_C( 1.13) } },
{ { SIMDE_FLOAT32_C( 0.89), SIMDE_FLOAT32_C( 1.30), SIMDE_FLOAT32_C( 1.06), SIMDE_FLOAT32_C( 1.66),
SIMDE_FLOAT32_C( 0.88), SIMDE_FLOAT32_C( 1.60), SIMDE_FLOAT32_C( 0.39), SIMDE_FLOAT32_C( 0.81) },
{ SIMDE_FLOAT32_C( 0.10), SIMDE_FLOAT32_C( -0.27), SIMDE_FLOAT32_C( -0.05), SIMDE_FLOAT32_C( -0.67),
SIMDE_FLOAT32_C( 0.11), SIMDE_FLOAT32_C( -0.60), SIMDE_FLOAT32_C( 0.61), SIMDE_FLOAT32_C( 0.17) } },
{ { SIMDE_FLOAT32_C( 0.36), SIMDE_FLOAT32_C( 0.19), SIMDE_FLOAT32_C( 0.70), SIMDE_FLOAT32_C( 1.05),
SIMDE_FLOAT32_C( 0.62), SIMDE_FLOAT32_C( 0.13), SIMDE_FLOAT32_C( 0.31), SIMDE_FLOAT32_C( 1.23) },
{ SIMDE_FLOAT32_C( 0.65), SIMDE_FLOAT32_C( 0.93), SIMDE_FLOAT32_C( 0.27), SIMDE_FLOAT32_C( -0.04),
SIMDE_FLOAT32_C( 0.35), SIMDE_FLOAT32_C( 1.07), SIMDE_FLOAT32_C( 0.72), SIMDE_FLOAT32_C( -0.21) } },
{ { SIMDE_FLOAT32_C( 0.33), SIMDE_FLOAT32_C( 1.07), SIMDE_FLOAT32_C( 0.56), SIMDE_FLOAT32_C( 0.02),
SIMDE_FLOAT32_C( 1.48), SIMDE_FLOAT32_C( 0.53), SIMDE_FLOAT32_C( 1.80), SIMDE_FLOAT32_C( 1.19) },
{ SIMDE_FLOAT32_C( 0.69), SIMDE_FLOAT32_C( -0.06), SIMDE_FLOAT32_C( 0.41), SIMDE_FLOAT32_C( 1.64),
SIMDE_FLOAT32_C( -0.45), SIMDE_FLOAT32_C( 0.44), SIMDE_FLOAT32_C( -0.91), SIMDE_FLOAT32_C( -0.17) } },
{ { SIMDE_FLOAT32_C( 1.88), SIMDE_FLOAT32_C( 1.35), SIMDE_FLOAT32_C( 1.18), SIMDE_FLOAT32_C( 1.92),
SIMDE_FLOAT32_C( 0.85), SIMDE_FLOAT32_C( 0.76), SIMDE_FLOAT32_C( 0.03), SIMDE_FLOAT32_C( 1.75) },
{ SIMDE_FLOAT32_C( -1.09), SIMDE_FLOAT32_C( -0.32), SIMDE_FLOAT32_C( -0.16), SIMDE_FLOAT32_C( -1.24),
SIMDE_FLOAT32_C( 0.13), SIMDE_FLOAT32_C( 0.22), SIMDE_FLOAT32_C( 1.53), SIMDE_FLOAT32_C( -0.81) } }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])) ; i++) {
simde__m256 a = simde_mm256_loadu_ps(test_vec[i].a);
simde__m256 r = simde_mm256_erfcinv_ps(a);
simde_test_x86_assert_equal_f32x8(r, simde_mm256_loadu_ps(test_vec[i].r), 1);
}
return 0;
}
static int
test_simde_mm256_erfcinv_pd (SIMDE_MUNIT_TEST_ARGS) {
static const struct {
const simde_float64 a[4];
const simde_float64 r[4];
} test_vec[] = {
{ { SIMDE_FLOAT64_C( 1.66), SIMDE_FLOAT64_C( 0.42), SIMDE_FLOAT64_C( 0.13), SIMDE_FLOAT64_C( 1.56) },
{ SIMDE_FLOAT64_C( -0.67), SIMDE_FLOAT64_C( 0.57), SIMDE_FLOAT64_C( 1.07), SIMDE_FLOAT64_C( -0.55) } },
{ { SIMDE_FLOAT64_C( 1.89), SIMDE_FLOAT64_C( 0.26), SIMDE_FLOAT64_C( 0.61), SIMDE_FLOAT64_C( 0.46) },
{ SIMDE_FLOAT64_C( -1.13), SIMDE_FLOAT64_C( 0.80), SIMDE_FLOAT64_C( 0.36), SIMDE_FLOAT64_C( 0.52) } },
{ { SIMDE_FLOAT64_C( 1.50), SIMDE_FLOAT64_C( 1.78), SIMDE_FLOAT64_C( 0.98), SIMDE_FLOAT64_C( 0.70) },
{ SIMDE_FLOAT64_C( -0.48), SIMDE_FLOAT64_C( -0.87), SIMDE_FLOAT64_C( 0.02), SIMDE_FLOAT64_C( 0.27) } },
{ { SIMDE_FLOAT64_C( 1.88), SIMDE_FLOAT64_C( 0.83), SIMDE_FLOAT64_C( 0.39), SIMDE_FLOAT64_C( 1.75) },
{ SIMDE_FLOAT64_C( -1.10), SIMDE_FLOAT64_C( 0.15), SIMDE_FLOAT64_C( 0.61), SIMDE_FLOAT64_C( -0.81) } },
{ { SIMDE_FLOAT64_C( 0.23), SIMDE_FLOAT64_C( 0.84), SIMDE_FLOAT64_C( 1.15), SIMDE_FLOAT64_C( 0.52) },
{ SIMDE_FLOAT64_C( 0.85), SIMDE_FLOAT64_C( 0.14), SIMDE_FLOAT64_C( -0.13), SIMDE_FLOAT64_C( 0.45) } },
{ { SIMDE_FLOAT64_C( 0.16), SIMDE_FLOAT64_C( 1.48), SIMDE_FLOAT64_C( 0.12), SIMDE_FLOAT64_C( 1.38) },
{ SIMDE_FLOAT64_C( 0.99), SIMDE_FLOAT64_C( -0.45), SIMDE_FLOAT64_C( 1.10), SIMDE_FLOAT64_C( -0.35) } },
{ { SIMDE_FLOAT64_C( 1.88), SIMDE_FLOAT64_C( 0.46), SIMDE_FLOAT64_C( 1.09), SIMDE_FLOAT64_C( 0.47) },
{ SIMDE_FLOAT64_C( -1.09), SIMDE_FLOAT64_C( 0.52), SIMDE_FLOAT64_C( -0.08), SIMDE_FLOAT64_C( 0.51) } },
{ { SIMDE_FLOAT64_C( 0.67), SIMDE_FLOAT64_C( 1.43), SIMDE_FLOAT64_C( 1.79), SIMDE_FLOAT64_C( 0.34) },
{ SIMDE_FLOAT64_C( 0.30), SIMDE_FLOAT64_C( -0.40), SIMDE_FLOAT64_C( -0.89), SIMDE_FLOAT64_C( 0.67) } }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])) ; i++) {
simde__m256d a = simde_mm256_loadu_pd(test_vec[i].a);
simde__m256d r = simde_mm256_erfcinv_pd(a);
simde_test_x86_assert_equal_f64x4(r, simde_mm256_loadu_pd(test_vec[i].r), 1);
}
return 0;
}
static int
test_simde_mm512_erfcinv_ps (SIMDE_MUNIT_TEST_ARGS) {
static const struct {
const simde_float32 a[16];
const simde_float32 r[16];
} test_vec[] = {
{ { SIMDE_FLOAT32_C( 0.64), SIMDE_FLOAT32_C( 0.79), SIMDE_FLOAT32_C( 1.41), SIMDE_FLOAT32_C( 1.20),
SIMDE_FLOAT32_C( 0.38), SIMDE_FLOAT32_C( 1.33), SIMDE_FLOAT32_C( 1.84), SIMDE_FLOAT32_C( 1.37),
SIMDE_FLOAT32_C( 0.58), SIMDE_FLOAT32_C( 0.35), SIMDE_FLOAT32_C( 0.86), SIMDE_FLOAT32_C( 1.44),
SIMDE_FLOAT32_C( 0.28), SIMDE_FLOAT32_C( 0.80), SIMDE_FLOAT32_C( 0.19), SIMDE_FLOAT32_C( 1.79) },
{ SIMDE_FLOAT32_C( 0.33), SIMDE_FLOAT32_C( 0.19), SIMDE_FLOAT32_C( -0.38), SIMDE_FLOAT32_C( -0.18),
SIMDE_FLOAT32_C( 0.62), SIMDE_FLOAT32_C( -0.30), SIMDE_FLOAT32_C( -0.99), SIMDE_FLOAT32_C( -0.34),
SIMDE_FLOAT32_C( 0.39), SIMDE_FLOAT32_C( 0.66), SIMDE_FLOAT32_C( 0.12), SIMDE_FLOAT32_C( -0.41),
SIMDE_FLOAT32_C( 0.76), SIMDE_FLOAT32_C( 0.18), SIMDE_FLOAT32_C( 0.93), SIMDE_FLOAT32_C( -0.89) } },
{ { SIMDE_FLOAT32_C( 1.30), SIMDE_FLOAT32_C( 1.40), SIMDE_FLOAT32_C( 1.53), SIMDE_FLOAT32_C( 1.60),
SIMDE_FLOAT32_C( 1.33), SIMDE_FLOAT32_C( 0.34), SIMDE_FLOAT32_C( 1.72), SIMDE_FLOAT32_C( 1.10),
SIMDE_FLOAT32_C( 0.08), SIMDE_FLOAT32_C( 1.37), SIMDE_FLOAT32_C( 1.67), SIMDE_FLOAT32_C( 0.93),
SIMDE_FLOAT32_C( 1.50), SIMDE_FLOAT32_C( 0.81), SIMDE_FLOAT32_C( 0.10), SIMDE_FLOAT32_C( 0.13) },
{ SIMDE_FLOAT32_C( -0.27), SIMDE_FLOAT32_C( -0.37), SIMDE_FLOAT32_C( -0.51), SIMDE_FLOAT32_C( -0.60),
SIMDE_FLOAT32_C( -0.30), SIMDE_FLOAT32_C( 0.67), SIMDE_FLOAT32_C( -0.76), SIMDE_FLOAT32_C( -0.09),
SIMDE_FLOAT32_C( 1.24), SIMDE_FLOAT32_C( -0.34), SIMDE_FLOAT32_C( -0.69), SIMDE_FLOAT32_C( 0.06),
SIMDE_FLOAT32_C( -0.48), SIMDE_FLOAT32_C( 0.17), SIMDE_FLOAT32_C( 1.16), SIMDE_FLOAT32_C( 1.07) } },
{ { SIMDE_FLOAT32_C( 1.59), SIMDE_FLOAT32_C( 1.51), SIMDE_FLOAT32_C( 1.33), SIMDE_FLOAT32_C( 1.97),
SIMDE_FLOAT32_C( 0.83), SIMDE_FLOAT32_C( 1.16), SIMDE_FLOAT32_C( 1.35), SIMDE_FLOAT32_C( 1.41),
SIMDE_FLOAT32_C( 1.51), SIMDE_FLOAT32_C( 0.20), SIMDE_FLOAT32_C( 0.85), SIMDE_FLOAT32_C( 1.79),
SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( 1.04), SIMDE_FLOAT32_C( 1.58), SIMDE_FLOAT32_C( 0.30) },
{ SIMDE_FLOAT32_C( -0.58), SIMDE_FLOAT32_C( -0.49), SIMDE_FLOAT32_C( -0.30), SIMDE_FLOAT32_C( -1.53),
SIMDE_FLOAT32_C( 0.15), SIMDE_FLOAT32_C( -0.14), SIMDE_FLOAT32_C( -0.32), SIMDE_FLOAT32_C( -0.38),
SIMDE_FLOAT32_C( -0.49), SIMDE_FLOAT32_C( 0.91), SIMDE_FLOAT32_C( 0.13), SIMDE_FLOAT32_C( -0.89),
SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( -0.04), SIMDE_FLOAT32_C( -0.57), SIMDE_FLOAT32_C( 0.73) } },
{ { SIMDE_FLOAT32_C( 0.44), SIMDE_FLOAT32_C( 1.11), SIMDE_FLOAT32_C( 1.90), SIMDE_FLOAT32_C( 1.78),
SIMDE_FLOAT32_C( 1.46), SIMDE_FLOAT32_C( 1.62), SIMDE_FLOAT32_C( 0.88), SIMDE_FLOAT32_C( 1.54),
SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( 0.55), SIMDE_FLOAT32_C( 0.47), SIMDE_FLOAT32_C( 0.49),
SIMDE_FLOAT32_C( 1.36), SIMDE_FLOAT32_C( 0.56), SIMDE_FLOAT32_C( 0.62), SIMDE_FLOAT32_C( 0.95) },
{ SIMDE_FLOAT32_C( 0.55), SIMDE_FLOAT32_C( -0.10), SIMDE_FLOAT32_C( -1.16), SIMDE_FLOAT32_C( -0.87),
SIMDE_FLOAT32_C( -0.43), SIMDE_FLOAT32_C( -0.62), SIMDE_FLOAT32_C( 0.11), SIMDE_FLOAT32_C( -0.52),
SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 0.42), SIMDE_FLOAT32_C( 0.51), SIMDE_FLOAT32_C( 0.49),
SIMDE_FLOAT32_C( -0.33), SIMDE_FLOAT32_C( 0.41), SIMDE_FLOAT32_C( 0.35), SIMDE_FLOAT32_C( 0.04) } },
{ { SIMDE_FLOAT32_C( 0.07), SIMDE_FLOAT32_C( 1.95), SIMDE_FLOAT32_C( 0.92), SIMDE_FLOAT32_C( 0.90),
SIMDE_FLOAT32_C( 1.11), SIMDE_FLOAT32_C( 0.27), SIMDE_FLOAT32_C( 0.31), SIMDE_FLOAT32_C( 0.63),
SIMDE_FLOAT32_C( 0.47), SIMDE_FLOAT32_C( 1.16), SIMDE_FLOAT32_C( 0.42), SIMDE_FLOAT32_C( 1.48),
SIMDE_FLOAT32_C( 0.20), SIMDE_FLOAT32_C( 0.01), SIMDE_FLOAT32_C( 1.78), SIMDE_FLOAT32_C( 0.64) },
{ SIMDE_FLOAT32_C( 1.28), SIMDE_FLOAT32_C( -1.39), SIMDE_FLOAT32_C( 0.07), SIMDE_FLOAT32_C( 0.09),
SIMDE_FLOAT32_C( -0.10), SIMDE_FLOAT32_C( 0.78), SIMDE_FLOAT32_C( 0.72), SIMDE_FLOAT32_C( 0.34),
SIMDE_FLOAT32_C( 0.51), SIMDE_FLOAT32_C( -0.14), SIMDE_FLOAT32_C( 0.57), SIMDE_FLOAT32_C( -0.45),
SIMDE_FLOAT32_C( 0.91), SIMDE_FLOAT32_C( 1.82), SIMDE_FLOAT32_C( -0.87), SIMDE_FLOAT32_C( 0.33) } },
{ { SIMDE_FLOAT32_C( 1.12), SIMDE_FLOAT32_C( 1.68), SIMDE_FLOAT32_C( 0.42), SIMDE_FLOAT32_C( 0.58),
SIMDE_FLOAT32_C( 1.30), SIMDE_FLOAT32_C( 1.29), SIMDE_FLOAT32_C( 0.11), SIMDE_FLOAT32_C( 0.29),
SIMDE_FLOAT32_C( 1.84), SIMDE_FLOAT32_C( 0.58), SIMDE_FLOAT32_C( 0.79), SIMDE_FLOAT32_C( 1.20),
SIMDE_FLOAT32_C( 1.14), SIMDE_FLOAT32_C( 1.41), SIMDE_FLOAT32_C( 0.15), SIMDE_FLOAT32_C( 1.21) },
{ SIMDE_FLOAT32_C( -0.11), SIMDE_FLOAT32_C( -0.70), SIMDE_FLOAT32_C( 0.57), SIMDE_FLOAT32_C( 0.39),
SIMDE_FLOAT32_C( -0.27), SIMDE_FLOAT32_C( -0.26), SIMDE_FLOAT32_C( 1.13), SIMDE_FLOAT32_C( 0.75),
SIMDE_FLOAT32_C( -0.99), SIMDE_FLOAT32_C( 0.39), SIMDE_FLOAT32_C( 0.19), SIMDE_FLOAT32_C( -0.18),
SIMDE_FLOAT32_C( -0.12), SIMDE_FLOAT32_C( -0.38), SIMDE_FLOAT32_C( 1.02), SIMDE_FLOAT32_C( -0.19) } },
{ { SIMDE_FLOAT32_C( 1.36), SIMDE_FLOAT32_C( 1.07), SIMDE_FLOAT32_C( 0.12), SIMDE_FLOAT32_C( 0.48),
SIMDE_FLOAT32_C( 1.34), SIMDE_FLOAT32_C( 0.43), SIMDE_FLOAT32_C( 1.10), SIMDE_FLOAT32_C( 1.81),
SIMDE_FLOAT32_C( 1.59), SIMDE_FLOAT32_C( 1.52), SIMDE_FLOAT32_C( 1.29), SIMDE_FLOAT32_C( 1.79),
SIMDE_FLOAT32_C( 1.53), SIMDE_FLOAT32_C( 1.07), SIMDE_FLOAT32_C( 0.43), SIMDE_FLOAT32_C( 0.65) },
{ SIMDE_FLOAT32_C( -0.33), SIMDE_FLOAT32_C( -0.06), SIMDE_FLOAT32_C( 1.10), SIMDE_FLOAT32_C( 0.50),
SIMDE_FLOAT32_C( -0.31), SIMDE_FLOAT32_C( 0.56), SIMDE_FLOAT32_C( -0.09), SIMDE_FLOAT32_C( -0.93),
SIMDE_FLOAT32_C( -0.58), SIMDE_FLOAT32_C( -0.50), SIMDE_FLOAT32_C( -0.26), SIMDE_FLOAT32_C( -0.89),
SIMDE_FLOAT32_C( -0.51), SIMDE_FLOAT32_C( -0.06), SIMDE_FLOAT32_C( 0.56), SIMDE_FLOAT32_C( 0.32) } },
{ { SIMDE_FLOAT32_C( 0.75), SIMDE_FLOAT32_C( 0.85), SIMDE_FLOAT32_C( 1.22), SIMDE_FLOAT32_C( 0.05),
SIMDE_FLOAT32_C( 0.14), SIMDE_FLOAT32_C( 1.34), SIMDE_FLOAT32_C( 0.34), SIMDE_FLOAT32_C( 1.99),
SIMDE_FLOAT32_C( 1.92), SIMDE_FLOAT32_C( 1.13), SIMDE_FLOAT32_C( 1.19), SIMDE_FLOAT32_C( 1.06),
SIMDE_FLOAT32_C( 0.54), SIMDE_FLOAT32_C( 1.34), SIMDE_FLOAT32_C( 0.28), SIMDE_FLOAT32_C( 1.90) },
{ SIMDE_FLOAT32_C( 0.23), SIMDE_FLOAT32_C( 0.13), SIMDE_FLOAT32_C( -0.20), SIMDE_FLOAT32_C( 1.39),
SIMDE_FLOAT32_C( 1.04), SIMDE_FLOAT32_C( -0.31), SIMDE_FLOAT32_C( 0.67), SIMDE_FLOAT32_C( -1.82),
SIMDE_FLOAT32_C( -1.24), SIMDE_FLOAT32_C( -0.12), SIMDE_FLOAT32_C( -0.17), SIMDE_FLOAT32_C( -0.05),
SIMDE_FLOAT32_C( 0.43), SIMDE_FLOAT32_C( -0.31), SIMDE_FLOAT32_C( 0.76), SIMDE_FLOAT32_C( -1.16) } }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])) ; i++) {
simde__m512 a = simde_mm512_loadu_ps(test_vec[i].a);
simde__m512 r = simde_mm512_erfcinv_ps(a);
simde_test_x86_assert_equal_f32x16(r, simde_mm512_loadu_ps(test_vec[i].r), 1);
}
return 0;
}
static int
test_simde_mm512_mask_erfcinv_ps (SIMDE_MUNIT_TEST_ARGS) {
static const struct {
const simde_float32 src[16];
const simde__mmask8 k;
const simde_float32 a[16];
const simde_float32 r[16];
} test_vec[] = {
{ { SIMDE_FLOAT32_C( 1.76), SIMDE_FLOAT32_C( 1.62), SIMDE_FLOAT32_C( 1.96), SIMDE_FLOAT32_C( 1.18),
SIMDE_FLOAT32_C( 0.86), SIMDE_FLOAT32_C( 1.14), SIMDE_FLOAT32_C( 1.94), SIMDE_FLOAT32_C( 0.55),
SIMDE_FLOAT32_C( 0.09), SIMDE_FLOAT32_C( 0.33), SIMDE_FLOAT32_C( 0.88), SIMDE_FLOAT32_C( 1.50),
SIMDE_FLOAT32_C( 1.33), SIMDE_FLOAT32_C( 1.30), SIMDE_FLOAT32_C( 0.59), SIMDE_FLOAT32_C( 0.58) },
UINT8_C(239),
{ SIMDE_FLOAT32_C( 1.92), SIMDE_FLOAT32_C( 1.56), SIMDE_FLOAT32_C( 1.73), SIMDE_FLOAT32_C( 1.31),
SIMDE_FLOAT32_C( 1.27), SIMDE_FLOAT32_C( 1.70), SIMDE_FLOAT32_C( 0.03), SIMDE_FLOAT32_C( 1.64),
SIMDE_FLOAT32_C( 0.96), SIMDE_FLOAT32_C( 0.83), SIMDE_FLOAT32_C( 0.58), SIMDE_FLOAT32_C( 1.77),
SIMDE_FLOAT32_C( 0.86), SIMDE_FLOAT32_C( 1.98), SIMDE_FLOAT32_C( 1.53), SIMDE_FLOAT32_C( 0.47) },
{ SIMDE_FLOAT32_C( -1.24), SIMDE_FLOAT32_C( -0.55), SIMDE_FLOAT32_C( -0.78), SIMDE_FLOAT32_C( -0.28),
SIMDE_FLOAT32_C( 0.86), SIMDE_FLOAT32_C( -0.73), SIMDE_FLOAT32_C( 1.53), SIMDE_FLOAT32_C( -0.65),
SIMDE_FLOAT32_C( 0.09), SIMDE_FLOAT32_C( 0.33), SIMDE_FLOAT32_C( 0.88), SIMDE_FLOAT32_C( 1.50),
SIMDE_FLOAT32_C( 1.33), SIMDE_FLOAT32_C( 1.30), SIMDE_FLOAT32_C( 0.59), SIMDE_FLOAT32_C( 0.58) } },
{ { SIMDE_FLOAT32_C( 1.93), SIMDE_FLOAT32_C( 0.71), SIMDE_FLOAT32_C( 1.33), SIMDE_FLOAT32_C( 1.08),
SIMDE_FLOAT32_C( 0.65), SIMDE_FLOAT32_C( 1.88), SIMDE_FLOAT32_C( 1.17), SIMDE_FLOAT32_C( 0.97),
SIMDE_FLOAT32_C( 0.76), SIMDE_FLOAT32_C( 0.66), SIMDE_FLOAT32_C( 0.31), SIMDE_FLOAT32_C( 0.06),
SIMDE_FLOAT32_C( 1.25), SIMDE_FLOAT32_C( 0.89), SIMDE_FLOAT32_C( 0.27), SIMDE_FLOAT32_C( 1.18) },
UINT8_C( 23),
{ SIMDE_FLOAT32_C( 1.99), SIMDE_FLOAT32_C( 0.49), SIMDE_FLOAT32_C( 1.72), SIMDE_FLOAT32_C( 1.69),
SIMDE_FLOAT32_C( 0.52), SIMDE_FLOAT32_C( 1.36), SIMDE_FLOAT32_C( 0.66), SIMDE_FLOAT32_C( 1.34),
SIMDE_FLOAT32_C( 1.94), SIMDE_FLOAT32_C( 0.43), SIMDE_FLOAT32_C( 0.20), SIMDE_FLOAT32_C( 1.92),
SIMDE_FLOAT32_C( 1.96), SIMDE_FLOAT32_C( 0.67), SIMDE_FLOAT32_C( 1.85), SIMDE_FLOAT32_C( 0.67) },
{ SIMDE_FLOAT32_C( -1.82), SIMDE_FLOAT32_C( 0.49), SIMDE_FLOAT32_C( -0.76), SIMDE_FLOAT32_C( 1.08),
SIMDE_FLOAT32_C( 0.45), SIMDE_FLOAT32_C( 1.88), SIMDE_FLOAT32_C( 1.17), SIMDE_FLOAT32_C( 0.97),
SIMDE_FLOAT32_C( 0.76), SIMDE_FLOAT32_C( 0.66), SIMDE_FLOAT32_C( 0.31), SIMDE_FLOAT32_C( 0.06),
SIMDE_FLOAT32_C( 1.25), SIMDE_FLOAT32_C( 0.89), SIMDE_FLOAT32_C( 0.27), SIMDE_FLOAT32_C( 1.18) } },
{ { SIMDE_FLOAT32_C( 2.00), SIMDE_FLOAT32_C( 0.93), SIMDE_FLOAT32_C( 1.32), SIMDE_FLOAT32_C( 1.88),
SIMDE_FLOAT32_C( 0.09), SIMDE_FLOAT32_C( 0.29), SIMDE_FLOAT32_C( 0.64), SIMDE_FLOAT32_C( 0.76),
SIMDE_FLOAT32_C( 0.60), SIMDE_FLOAT32_C( 0.70), SIMDE_FLOAT32_C( 0.01), SIMDE_FLOAT32_C( 1.49),
SIMDE_FLOAT32_C( 0.96), SIMDE_FLOAT32_C( 1.18), SIMDE_FLOAT32_C( 1.93), SIMDE_FLOAT32_C( 0.96) },
UINT8_C( 91),
{ SIMDE_FLOAT32_C( 1.65), SIMDE_FLOAT32_C( 0.65), SIMDE_FLOAT32_C( 0.19), SIMDE_FLOAT32_C( 1.01),
SIMDE_FLOAT32_C( 1.31), SIMDE_FLOAT32_C( 1.53), SIMDE_FLOAT32_C( 0.95), SIMDE_FLOAT32_C( 1.74),
SIMDE_FLOAT32_C( 1.73), SIMDE_FLOAT32_C( 0.86), SIMDE_FLOAT32_C( 1.70), SIMDE_FLOAT32_C( 0.40),
SIMDE_FLOAT32_C( 0.71), SIMDE_FLOAT32_C( 0.37), SIMDE_FLOAT32_C( 0.40), SIMDE_FLOAT32_C( 1.64) },
{ SIMDE_FLOAT32_C( -0.66), SIMDE_FLOAT32_C( 0.32), SIMDE_FLOAT32_C( 1.32), SIMDE_FLOAT32_C( -0.01),
SIMDE_FLOAT32_C( -0.28), SIMDE_FLOAT32_C( 0.29), SIMDE_FLOAT32_C( 0.04), SIMDE_FLOAT32_C( 0.76),
SIMDE_FLOAT32_C( 0.60), SIMDE_FLOAT32_C( 0.70), SIMDE_FLOAT32_C( 0.01), SIMDE_FLOAT32_C( 1.49),
SIMDE_FLOAT32_C( 0.96), SIMDE_FLOAT32_C( 1.18), SIMDE_FLOAT32_C( 1.93), SIMDE_FLOAT32_C( 0.96) } },
{ { SIMDE_FLOAT32_C( 1.69), SIMDE_FLOAT32_C( 0.28), SIMDE_FLOAT32_C( 1.73), SIMDE_FLOAT32_C( 1.98),
SIMDE_FLOAT32_C( 0.92), SIMDE_FLOAT32_C( 0.48), SIMDE_FLOAT32_C( 0.58), SIMDE_FLOAT32_C( 1.61),
SIMDE_FLOAT32_C( 0.49), SIMDE_FLOAT32_C( 0.06), SIMDE_FLOAT32_C( 0.58), SIMDE_FLOAT32_C( 1.68),
SIMDE_FLOAT32_C( 2.00), SIMDE_FLOAT32_C( 1.54), SIMDE_FLOAT32_C( 1.35), SIMDE_FLOAT32_C( 1.65) },
UINT8_C(144),
{ SIMDE_FLOAT32_C( 1.53), SIMDE_FLOAT32_C( 0.66), SIMDE_FLOAT32_C( 1.50), SIMDE_FLOAT32_C( 1.07),
SIMDE_FLOAT32_C( 1.61), SIMDE_FLOAT32_C( 1.24), SIMDE_FLOAT32_C( 0.80), SIMDE_FLOAT32_C( 0.47),
SIMDE_FLOAT32_C( 0.94), SIMDE_FLOAT32_C( 1.20), SIMDE_FLOAT32_C( 1.18), SIMDE_FLOAT32_C( 1.31),
SIMDE_FLOAT32_C( 1.60), SIMDE_FLOAT32_C( 0.82), SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( 1.88) },
{ SIMDE_FLOAT32_C( 1.69), SIMDE_FLOAT32_C( 0.28), SIMDE_FLOAT32_C( 1.73), SIMDE_FLOAT32_C( 1.98),
SIMDE_FLOAT32_C( -0.61), SIMDE_FLOAT32_C( 0.48), SIMDE_FLOAT32_C( 0.58), SIMDE_FLOAT32_C( 0.51),
SIMDE_FLOAT32_C( 0.49), SIMDE_FLOAT32_C( 0.06), SIMDE_FLOAT32_C( 0.58), SIMDE_FLOAT32_C( 1.68),
SIMDE_FLOAT32_C( 2.00), SIMDE_FLOAT32_C( 1.54), SIMDE_FLOAT32_C( 1.35), SIMDE_FLOAT32_C( 1.65) } },
{ { SIMDE_FLOAT32_C( 0.55), SIMDE_FLOAT32_C( 0.98), SIMDE_FLOAT32_C( 0.80), SIMDE_FLOAT32_C( 1.03),
SIMDE_FLOAT32_C( 1.56), SIMDE_FLOAT32_C( 0.41), SIMDE_FLOAT32_C( 1.52), SIMDE_FLOAT32_C( 1.62),
SIMDE_FLOAT32_C( 0.99), SIMDE_FLOAT32_C( 1.20), SIMDE_FLOAT32_C( 1.61), SIMDE_FLOAT32_C( 0.53),
SIMDE_FLOAT32_C( 0.55), SIMDE_FLOAT32_C( 1.26), SIMDE_FLOAT32_C( 0.72), SIMDE_FLOAT32_C( 0.08) },
UINT8_C(233),
{ SIMDE_FLOAT32_C( 0.22), SIMDE_FLOAT32_C( 1.15), SIMDE_FLOAT32_C( 1.53), SIMDE_FLOAT32_C( 1.46),
SIMDE_FLOAT32_C( 1.95), SIMDE_FLOAT32_C( 1.99), SIMDE_FLOAT32_C( 0.40), SIMDE_FLOAT32_C( 1.15),
SIMDE_FLOAT32_C( 1.18), SIMDE_FLOAT32_C( 1.71), SIMDE_FLOAT32_C( 0.75), SIMDE_FLOAT32_C( 1.99),
SIMDE_FLOAT32_C( 0.71), SIMDE_FLOAT32_C( 0.64), SIMDE_FLOAT32_C( 0.54), SIMDE_FLOAT32_C( 1.69) },
{ SIMDE_FLOAT32_C( 0.87), SIMDE_FLOAT32_C( 0.98), SIMDE_FLOAT32_C( 0.80), SIMDE_FLOAT32_C( -0.43),
SIMDE_FLOAT32_C( 1.56), SIMDE_FLOAT32_C( -1.82), SIMDE_FLOAT32_C( 0.60), SIMDE_FLOAT32_C( -0.13),
SIMDE_FLOAT32_C( 0.99), SIMDE_FLOAT32_C( 1.20), SIMDE_FLOAT32_C( 1.61), SIMDE_FLOAT32_C( 0.53),
SIMDE_FLOAT32_C( 0.55), SIMDE_FLOAT32_C( 1.26), SIMDE_FLOAT32_C( 0.72), SIMDE_FLOAT32_C( 0.08) } },
{ { SIMDE_FLOAT32_C( 1.44), SIMDE_FLOAT32_C( 1.57), SIMDE_FLOAT32_C( 1.25), SIMDE_FLOAT32_C( 1.85),
SIMDE_FLOAT32_C( 1.09), SIMDE_FLOAT32_C( 0.87), SIMDE_FLOAT32_C( 0.84), SIMDE_FLOAT32_C( 0.29),
SIMDE_FLOAT32_C( 0.48), SIMDE_FLOAT32_C( 1.37), SIMDE_FLOAT32_C( 0.84), SIMDE_FLOAT32_C( 1.74),
SIMDE_FLOAT32_C( 0.09), SIMDE_FLOAT32_C( 0.92), SIMDE_FLOAT32_C( 1.67), SIMDE_FLOAT32_C( 0.31) },
UINT8_C(221),
{ SIMDE_FLOAT32_C( 1.19), SIMDE_FLOAT32_C( 1.77), SIMDE_FLOAT32_C( 0.02), SIMDE_FLOAT32_C( 1.19),
SIMDE_FLOAT32_C( 0.17), SIMDE_FLOAT32_C( 1.17), SIMDE_FLOAT32_C( 0.37), SIMDE_FLOAT32_C( 1.88),
SIMDE_FLOAT32_C( 1.92), SIMDE_FLOAT32_C( 0.36), SIMDE_FLOAT32_C( 0.59), SIMDE_FLOAT32_C( 0.56),
SIMDE_FLOAT32_C( 0.90), SIMDE_FLOAT32_C( 0.28), SIMDE_FLOAT32_C( 2.00), SIMDE_FLOAT32_C( 0.47) },
{ SIMDE_FLOAT32_C( -0.17), SIMDE_FLOAT32_C( 1.57), SIMDE_FLOAT32_C( 1.64), SIMDE_FLOAT32_C( -0.17),
SIMDE_FLOAT32_C( 0.97), SIMDE_FLOAT32_C( 0.87), SIMDE_FLOAT32_C( 0.63), SIMDE_FLOAT32_C( -1.10),
SIMDE_FLOAT32_C( 0.48), SIMDE_FLOAT32_C( 1.37), SIMDE_FLOAT32_C( 0.84), SIMDE_FLOAT32_C( 1.74),
SIMDE_FLOAT32_C( 0.09), SIMDE_FLOAT32_C( 0.92), SIMDE_FLOAT32_C( 1.67), SIMDE_FLOAT32_C( 0.31) } },
{ { SIMDE_FLOAT32_C( 1.53), SIMDE_FLOAT32_C( 1.85), SIMDE_FLOAT32_C( 1.56), SIMDE_FLOAT32_C( 0.40),
SIMDE_FLOAT32_C( 0.69), SIMDE_FLOAT32_C( 1.86), SIMDE_FLOAT32_C( 0.88), SIMDE_FLOAT32_C( 0.06),
SIMDE_FLOAT32_C( 0.70), SIMDE_FLOAT32_C( 0.62), SIMDE_FLOAT32_C( 0.15), SIMDE_FLOAT32_C( 1.62),
SIMDE_FLOAT32_C( 0.29), SIMDE_FLOAT32_C( 0.46), SIMDE_FLOAT32_C( 1.70), SIMDE_FLOAT32_C( 1.48) },
UINT8_C(108),
{ SIMDE_FLOAT32_C( 1.72), SIMDE_FLOAT32_C( 0.67), SIMDE_FLOAT32_C( 0.40), SIMDE_FLOAT32_C( 0.89),
SIMDE_FLOAT32_C( 1.04), SIMDE_FLOAT32_C( 0.27), SIMDE_FLOAT32_C( 0.81), SIMDE_FLOAT32_C( 1.40),
SIMDE_FLOAT32_C( 0.86), SIMDE_FLOAT32_C( 1.38), SIMDE_FLOAT32_C( 0.30), SIMDE_FLOAT32_C( 1.15),
SIMDE_FLOAT32_C( 1.38), SIMDE_FLOAT32_C( 0.77), SIMDE_FLOAT32_C( 0.68), SIMDE_FLOAT32_C( 1.22) },
{ SIMDE_FLOAT32_C( 1.53), SIMDE_FLOAT32_C( 1.85), SIMDE_FLOAT32_C( 0.60), SIMDE_FLOAT32_C( 0.10),
SIMDE_FLOAT32_C( 0.69), SIMDE_FLOAT32_C( 0.78), SIMDE_FLOAT32_C( 0.17), SIMDE_FLOAT32_C( 0.06),
SIMDE_FLOAT32_C( 0.70), SIMDE_FLOAT32_C( 0.62), SIMDE_FLOAT32_C( 0.15), SIMDE_FLOAT32_C( 1.62),
SIMDE_FLOAT32_C( 0.29), SIMDE_FLOAT32_C( 0.46), SIMDE_FLOAT32_C( 1.70), SIMDE_FLOAT32_C( 1.48) } },
{ { SIMDE_FLOAT32_C( 0.33), SIMDE_FLOAT32_C( 1.08), SIMDE_FLOAT32_C( 1.92), SIMDE_FLOAT32_C( 0.19),
SIMDE_FLOAT32_C( 1.96), SIMDE_FLOAT32_C( 1.98), SIMDE_FLOAT32_C( 0.89), SIMDE_FLOAT32_C( 0.58),
SIMDE_FLOAT32_C( 0.13), SIMDE_FLOAT32_C( 0.52), SIMDE_FLOAT32_C( 0.87), SIMDE_FLOAT32_C( 0.60),
SIMDE_FLOAT32_C( 0.21), SIMDE_FLOAT32_C( 0.35), SIMDE_FLOAT32_C( 0.83), SIMDE_FLOAT32_C( 1.93) },
UINT8_C(110),
{ SIMDE_FLOAT32_C( 1.22), SIMDE_FLOAT32_C( 0.82), SIMDE_FLOAT32_C( 0.06), SIMDE_FLOAT32_C( 1.50),
SIMDE_FLOAT32_C( 1.64), SIMDE_FLOAT32_C( 1.46), SIMDE_FLOAT32_C( 0.36), SIMDE_FLOAT32_C( 1.01),
SIMDE_FLOAT32_C( 1.76), SIMDE_FLOAT32_C( 1.51), SIMDE_FLOAT32_C( 0.39), SIMDE_FLOAT32_C( 0.53),
SIMDE_FLOAT32_C( 0.19), SIMDE_FLOAT32_C( 1.61), SIMDE_FLOAT32_C( 0.86), SIMDE_FLOAT32_C( 1.26) },
{ SIMDE_FLOAT32_C( 0.33), SIMDE_FLOAT32_C( 0.16), SIMDE_FLOAT32_C( 1.33), SIMDE_FLOAT32_C( -0.48),
SIMDE_FLOAT32_C( 1.96), SIMDE_FLOAT32_C( -0.43), SIMDE_FLOAT32_C( 0.65), SIMDE_FLOAT32_C( 0.58),
SIMDE_FLOAT32_C( 0.13), SIMDE_FLOAT32_C( 0.52), SIMDE_FLOAT32_C( 0.87), SIMDE_FLOAT32_C( 0.60),
SIMDE_FLOAT32_C( 0.21), SIMDE_FLOAT32_C( 0.35), SIMDE_FLOAT32_C( 0.83), SIMDE_FLOAT32_C( 1.93) } }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])) ; i++) {
simde__m512 src = simde_mm512_loadu_ps(test_vec[i].src);
simde__m512 a = simde_mm512_loadu_ps(test_vec[i].a);
simde__m512 r = simde_mm512_mask_erfcinv_ps(src, test_vec[i].k, a);
simde_test_x86_assert_equal_f32x16(r, simde_mm512_loadu_ps(test_vec[i].r), 1);
}
return 0;
}
static int
test_simde_mm512_erfcinv_pd (SIMDE_MUNIT_TEST_ARGS) {
static const struct {
const simde_float64 a[8];
const simde_float64 r[8];
} test_vec[] = {
{ { SIMDE_FLOAT64_C( 0.07), SIMDE_FLOAT64_C( 0.39), SIMDE_FLOAT64_C( 1.62), SIMDE_FLOAT64_C( 0.30),
SIMDE_FLOAT64_C( 0.52), SIMDE_FLOAT64_C( 1.24), SIMDE_FLOAT64_C( 0.18), SIMDE_FLOAT64_C( 1.25) },
{ SIMDE_FLOAT64_C( 1.28), SIMDE_FLOAT64_C( 0.61), SIMDE_FLOAT64_C( -0.62), SIMDE_FLOAT64_C( 0.73),
SIMDE_FLOAT64_C( 0.45), SIMDE_FLOAT64_C( -0.22), SIMDE_FLOAT64_C( 0.95), SIMDE_FLOAT64_C( -0.23) } },
{ { SIMDE_FLOAT64_C( 0.67), SIMDE_FLOAT64_C( 1.75), SIMDE_FLOAT64_C( 0.51), SIMDE_FLOAT64_C( 0.09),
SIMDE_FLOAT64_C( 0.43), SIMDE_FLOAT64_C( 1.42), SIMDE_FLOAT64_C( 0.10), SIMDE_FLOAT64_C( 0.26) },
{ SIMDE_FLOAT64_C( 0.30), SIMDE_FLOAT64_C( -0.81), SIMDE_FLOAT64_C( 0.47), SIMDE_FLOAT64_C( 1.20),
SIMDE_FLOAT64_C( 0.56), SIMDE_FLOAT64_C( -0.39), SIMDE_FLOAT64_C( 1.16), SIMDE_FLOAT64_C( 0.80) } },
{ { SIMDE_FLOAT64_C( 1.39), SIMDE_FLOAT64_C( 1.04), SIMDE_FLOAT64_C( 1.66), SIMDE_FLOAT64_C( 0.51),
SIMDE_FLOAT64_C( 1.47), SIMDE_FLOAT64_C( 0.78), SIMDE_FLOAT64_C( 1.28), SIMDE_FLOAT64_C( 0.25) },
{ SIMDE_FLOAT64_C( -0.36), SIMDE_FLOAT64_C( -0.04), SIMDE_FLOAT64_C( -0.67), SIMDE_FLOAT64_C( 0.47),
SIMDE_FLOAT64_C( -0.44), SIMDE_FLOAT64_C( 0.20), SIMDE_FLOAT64_C( -0.25), SIMDE_FLOAT64_C( 0.81) } },
{ { SIMDE_FLOAT64_C( 1.45), SIMDE_FLOAT64_C( 0.93), SIMDE_FLOAT64_C( 1.98), SIMDE_FLOAT64_C( 0.38),
SIMDE_FLOAT64_C( 1.21), SIMDE_FLOAT64_C( 0.34), SIMDE_FLOAT64_C( 0.67), SIMDE_FLOAT64_C( 1.28) },
{ SIMDE_FLOAT64_C( -0.42), SIMDE_FLOAT64_C( 0.06), SIMDE_FLOAT64_C( -1.64), SIMDE_FLOAT64_C( 0.62),
SIMDE_FLOAT64_C( -0.19), SIMDE_FLOAT64_C( 0.67), SIMDE_FLOAT64_C( 0.30), SIMDE_FLOAT64_C( -0.25) } },
{ { SIMDE_FLOAT64_C( 0.74), SIMDE_FLOAT64_C( 0.29), SIMDE_FLOAT64_C( 1.58), SIMDE_FLOAT64_C( 1.25),
SIMDE_FLOAT64_C( 1.53), SIMDE_FLOAT64_C( 1.76), SIMDE_FLOAT64_C( 0.50), SIMDE_FLOAT64_C( 0.21) },
{ SIMDE_FLOAT64_C( 0.23), SIMDE_FLOAT64_C( 0.75), SIMDE_FLOAT64_C( -0.57), SIMDE_FLOAT64_C( -0.23),
SIMDE_FLOAT64_C( -0.51), SIMDE_FLOAT64_C( -0.83), SIMDE_FLOAT64_C( 0.48), SIMDE_FLOAT64_C( 0.89) } },
{ { SIMDE_FLOAT64_C( 1.51), SIMDE_FLOAT64_C( 1.01), SIMDE_FLOAT64_C( 0.29), SIMDE_FLOAT64_C( 1.94),
SIMDE_FLOAT64_C( 0.43), SIMDE_FLOAT64_C( 0.39), SIMDE_FLOAT64_C( 0.20), SIMDE_FLOAT64_C( 1.83) },
{ SIMDE_FLOAT64_C( -0.49), SIMDE_FLOAT64_C( -0.01), SIMDE_FLOAT64_C( 0.75), SIMDE_FLOAT64_C( -1.33),
SIMDE_FLOAT64_C( 0.56), SIMDE_FLOAT64_C( 0.61), SIMDE_FLOAT64_C( 0.91), SIMDE_FLOAT64_C( -0.97) } },
{ { SIMDE_FLOAT64_C( 1.43), SIMDE_FLOAT64_C( 1.86), SIMDE_FLOAT64_C( 0.34), SIMDE_FLOAT64_C( 0.90),
SIMDE_FLOAT64_C( 0.64), SIMDE_FLOAT64_C( 1.62), SIMDE_FLOAT64_C( 1.15), SIMDE_FLOAT64_C( 0.09) },
{ SIMDE_FLOAT64_C( -0.40), SIMDE_FLOAT64_C( -1.04), SIMDE_FLOAT64_C( 0.67), SIMDE_FLOAT64_C( 0.09),
SIMDE_FLOAT64_C( 0.33), SIMDE_FLOAT64_C( -0.62), SIMDE_FLOAT64_C( -0.13), SIMDE_FLOAT64_C( 1.20) } },
{ { SIMDE_FLOAT64_C( 0.55), SIMDE_FLOAT64_C( 1.13), SIMDE_FLOAT64_C( 0.47), SIMDE_FLOAT64_C( 1.76),
SIMDE_FLOAT64_C( 1.48), SIMDE_FLOAT64_C( 1.14), SIMDE_FLOAT64_C( 1.04), SIMDE_FLOAT64_C( 0.21) },
{ SIMDE_FLOAT64_C( 0.42), SIMDE_FLOAT64_C( -0.12), SIMDE_FLOAT64_C( 0.51), SIMDE_FLOAT64_C( -0.83),
SIMDE_FLOAT64_C( -0.45), SIMDE_FLOAT64_C( -0.12), SIMDE_FLOAT64_C( -0.04), SIMDE_FLOAT64_C( 0.89) } }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])) ; i++) {
simde__m512d a = simde_mm512_loadu_pd(test_vec[i].a);
simde__m512d r = simde_mm512_erfcinv_pd(a);
simde_test_x86_assert_equal_f64x8(r, simde_mm512_loadu_pd(test_vec[i].r), 1);
}
return 0;
}
static int
test_simde_mm512_mask_erfcinv_pd (SIMDE_MUNIT_TEST_ARGS) {
static const struct {
const simde_float64 src[8];
const simde__mmask8 k;
const simde_float64 a[8];
const simde_float64 r[8];
} test_vec[] = {
{ { SIMDE_FLOAT64_C( 0.49), SIMDE_FLOAT64_C( 0.99), SIMDE_FLOAT64_C( 1.36), SIMDE_FLOAT64_C( 1.68),
SIMDE_FLOAT64_C( 0.56), SIMDE_FLOAT64_C( 0.17), SIMDE_FLOAT64_C( 1.10), SIMDE_FLOAT64_C( 0.08) },
UINT8_C(117),
{ SIMDE_FLOAT64_C( 0.85), SIMDE_FLOAT64_C( 1.24), SIMDE_FLOAT64_C( 1.32), SIMDE_FLOAT64_C( 0.39),
SIMDE_FLOAT64_C( 1.15), SIMDE_FLOAT64_C( 1.13), SIMDE_FLOAT64_C( 0.65), SIMDE_FLOAT64_C( 0.74) },
{ SIMDE_FLOAT64_C( 0.13), SIMDE_FLOAT64_C( 0.99), SIMDE_FLOAT64_C( -0.29), SIMDE_FLOAT64_C( 1.68),
SIMDE_FLOAT64_C( -0.13), SIMDE_FLOAT64_C( -0.12), SIMDE_FLOAT64_C( 0.32), SIMDE_FLOAT64_C( 0.08) } },
{ { SIMDE_FLOAT64_C( 1.75), SIMDE_FLOAT64_C( 0.45), SIMDE_FLOAT64_C( 0.78), SIMDE_FLOAT64_C( 1.37),
SIMDE_FLOAT64_C( 0.76), SIMDE_FLOAT64_C( 1.22), SIMDE_FLOAT64_C( 1.43), SIMDE_FLOAT64_C( 0.91) },
UINT8_C( 90),
{ SIMDE_FLOAT64_C( 0.35), SIMDE_FLOAT64_C( 1.48), SIMDE_FLOAT64_C( 0.06), SIMDE_FLOAT64_C( 1.46),
SIMDE_FLOAT64_C( 1.53), SIMDE_FLOAT64_C( 0.55), SIMDE_FLOAT64_C( 0.44), SIMDE_FLOAT64_C( 0.89) },
{ SIMDE_FLOAT64_C( 1.75), SIMDE_FLOAT64_C( -0.45), SIMDE_FLOAT64_C( 0.78), SIMDE_FLOAT64_C( -0.43),
SIMDE_FLOAT64_C( -0.51), SIMDE_FLOAT64_C( 1.22), SIMDE_FLOAT64_C( 0.55), SIMDE_FLOAT64_C( 0.91) } },
{ { SIMDE_FLOAT64_C( 0.22), SIMDE_FLOAT64_C( 1.01), SIMDE_FLOAT64_C( 1.06), SIMDE_FLOAT64_C( 1.33),
SIMDE_FLOAT64_C( 1.08), SIMDE_FLOAT64_C( 1.27), SIMDE_FLOAT64_C( 0.17), SIMDE_FLOAT64_C( 0.33) },
UINT8_C(134),
{ SIMDE_FLOAT64_C( 0.56), SIMDE_FLOAT64_C( 1.48), SIMDE_FLOAT64_C( 1.71), SIMDE_FLOAT64_C( 1.22),
SIMDE_FLOAT64_C( 0.22), SIMDE_FLOAT64_C( 1.46), SIMDE_FLOAT64_C( 1.67), SIMDE_FLOAT64_C( 1.01) },
{ SIMDE_FLOAT64_C( 0.22), SIMDE_FLOAT64_C( -0.45), SIMDE_FLOAT64_C( -0.75), SIMDE_FLOAT64_C( 1.33),
SIMDE_FLOAT64_C( 1.08), SIMDE_FLOAT64_C( 1.27), SIMDE_FLOAT64_C( 0.17), SIMDE_FLOAT64_C( -0.01) } },
{ { SIMDE_FLOAT64_C( 0.83), SIMDE_FLOAT64_C( 0.43), SIMDE_FLOAT64_C( 0.22), SIMDE_FLOAT64_C( 0.26),
SIMDE_FLOAT64_C( 1.34), SIMDE_FLOAT64_C( 1.77), SIMDE_FLOAT64_C( 0.61), SIMDE_FLOAT64_C( 0.83) },
UINT8_C(179),
{ SIMDE_FLOAT64_C( 0.07), SIMDE_FLOAT64_C( 0.36), SIMDE_FLOAT64_C( 0.37), SIMDE_FLOAT64_C( 0.51),
SIMDE_FLOAT64_C( 1.25), SIMDE_FLOAT64_C( 0.60), SIMDE_FLOAT64_C( 1.52), SIMDE_FLOAT64_C( 0.31) },
{ SIMDE_FLOAT64_C( 1.28), SIMDE_FLOAT64_C( 0.65), SIMDE_FLOAT64_C( 0.22), SIMDE_FLOAT64_C( 0.26),
SIMDE_FLOAT64_C( -0.23), SIMDE_FLOAT64_C( 0.37), SIMDE_FLOAT64_C( 0.61), SIMDE_FLOAT64_C( 0.72) } },
{ { SIMDE_FLOAT64_C( 1.92), SIMDE_FLOAT64_C( 0.60), SIMDE_FLOAT64_C( 1.58), SIMDE_FLOAT64_C( 0.10),
SIMDE_FLOAT64_C( 0.93), SIMDE_FLOAT64_C( 0.16), SIMDE_FLOAT64_C( 0.66), SIMDE_FLOAT64_C( 0.41) },
UINT8_C(115),
{ SIMDE_FLOAT64_C( 1.87), SIMDE_FLOAT64_C( 0.63), SIMDE_FLOAT64_C( 1.34), SIMDE_FLOAT64_C( 1.54),
SIMDE_FLOAT64_C( 1.64), SIMDE_FLOAT64_C( 0.17), SIMDE_FLOAT64_C( 1.97), SIMDE_FLOAT64_C( 1.86) },
{ SIMDE_FLOAT64_C( -1.07), SIMDE_FLOAT64_C( 0.34), SIMDE_FLOAT64_C( 1.58), SIMDE_FLOAT64_C( 0.10),
SIMDE_FLOAT64_C( -0.65), SIMDE_FLOAT64_C( 0.97), SIMDE_FLOAT64_C( -1.53), SIMDE_FLOAT64_C( 0.41) } },
{ { SIMDE_FLOAT64_C( 0.43), SIMDE_FLOAT64_C( 1.32), SIMDE_FLOAT64_C( 1.63), SIMDE_FLOAT64_C( 1.04),
SIMDE_FLOAT64_C( 0.14), SIMDE_FLOAT64_C( 1.46), SIMDE_FLOAT64_C( 1.11), SIMDE_FLOAT64_C( 0.50) },
UINT8_C( 31),
{ SIMDE_FLOAT64_C( 1.62), SIMDE_FLOAT64_C( 1.75), SIMDE_FLOAT64_C( 0.43), SIMDE_FLOAT64_C( 1.14),
SIMDE_FLOAT64_C( 0.06), SIMDE_FLOAT64_C( 0.36), SIMDE_FLOAT64_C( 1.74), SIMDE_FLOAT64_C( 1.64) },
{ SIMDE_FLOAT64_C( -0.62), SIMDE_FLOAT64_C( -0.81), SIMDE_FLOAT64_C( 0.56), SIMDE_FLOAT64_C( -0.12),
SIMDE_FLOAT64_C( 1.33), SIMDE_FLOAT64_C( 1.46), SIMDE_FLOAT64_C( 1.11), SIMDE_FLOAT64_C( 0.50) } },
{ { SIMDE_FLOAT64_C( 0.45), SIMDE_FLOAT64_C( 0.67), SIMDE_FLOAT64_C( 1.79), SIMDE_FLOAT64_C( 1.11),
SIMDE_FLOAT64_C( 1.08), SIMDE_FLOAT64_C( 1.67), SIMDE_FLOAT64_C( 0.99), SIMDE_FLOAT64_C( 1.71) },
UINT8_C(152),
{ SIMDE_FLOAT64_C( 0.53), SIMDE_FLOAT64_C( 1.35), SIMDE_FLOAT64_C( 1.17), SIMDE_FLOAT64_C( 0.50),
SIMDE_FLOAT64_C( 1.21), SIMDE_FLOAT64_C( 1.60), SIMDE_FLOAT64_C( 1.82), SIMDE_FLOAT64_C( 0.85) },
{ SIMDE_FLOAT64_C( 0.45), SIMDE_FLOAT64_C( 0.67), SIMDE_FLOAT64_C( 1.79), SIMDE_FLOAT64_C( 0.48),
SIMDE_FLOAT64_C( -0.19), SIMDE_FLOAT64_C( 1.67), SIMDE_FLOAT64_C( 0.99), SIMDE_FLOAT64_C( 0.13) } },
{ { SIMDE_FLOAT64_C( 0.64), SIMDE_FLOAT64_C( 1.96), SIMDE_FLOAT64_C( 0.30), SIMDE_FLOAT64_C( 1.74),
SIMDE_FLOAT64_C( 0.46), SIMDE_FLOAT64_C( 0.14), SIMDE_FLOAT64_C( 1.37), SIMDE_FLOAT64_C( 0.21) },
UINT8_C( 95),
{ SIMDE_FLOAT64_C( 0.51), SIMDE_FLOAT64_C( 0.27), SIMDE_FLOAT64_C( 0.92), SIMDE_FLOAT64_C( 0.25),
SIMDE_FLOAT64_C( 1.91), SIMDE_FLOAT64_C( 1.38), SIMDE_FLOAT64_C( 0.92), SIMDE_FLOAT64_C( 1.70) },
{ SIMDE_FLOAT64_C( 0.47), SIMDE_FLOAT64_C( 0.78), SIMDE_FLOAT64_C( 0.07), SIMDE_FLOAT64_C( 0.81),
SIMDE_FLOAT64_C( -1.20), SIMDE_FLOAT64_C( 0.14), SIMDE_FLOAT64_C( 0.07), SIMDE_FLOAT64_C( 0.21) } }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])) ; i++) {
simde__m512d src = simde_mm512_loadu_pd(test_vec[i].src);
simde__m512d a = simde_mm512_loadu_pd(test_vec[i].a);
simde__m512d r = simde_mm512_mask_erfcinv_pd(src, test_vec[i].k, a);
simde_test_x86_assert_equal_f64x8(r, simde_mm512_loadu_pd(test_vec[i].r), 1);
}
return 0;
}
static int
test_simde_mm_exp_ps (SIMDE_MUNIT_TEST_ARGS) {
static const struct {
const simde_float32 a[4];
const simde_float32 r[4];
} test_vec[] = {
{ { SIMDE_FLOAT32_C( 0.98), SIMDE_FLOAT32_C( -1.86), SIMDE_FLOAT32_C( 1.40), SIMDE_FLOAT32_C( 3.13) },
{ SIMDE_FLOAT32_C( 2.66), SIMDE_FLOAT32_C( 0.16), SIMDE_FLOAT32_C( 4.06), SIMDE_FLOAT32_C( 22.87) } },
{ { SIMDE_FLOAT32_C( -1.01), SIMDE_FLOAT32_C( -1.34), SIMDE_FLOAT32_C( 2.77), SIMDE_FLOAT32_C( -0.13) },
{ SIMDE_FLOAT32_C( 0.36), SIMDE_FLOAT32_C( 0.26), SIMDE_FLOAT32_C( 15.96), SIMDE_FLOAT32_C( 0.88) } },
{ { SIMDE_FLOAT32_C( -2.37), SIMDE_FLOAT32_C( 2.01), SIMDE_FLOAT32_C( -3.83), SIMDE_FLOAT32_C( -3.05) },
{ SIMDE_FLOAT32_C( 0.09), SIMDE_FLOAT32_C( 7.46), SIMDE_FLOAT32_C( 0.02), SIMDE_FLOAT32_C( 0.05) } },
{ { SIMDE_FLOAT32_C( 1.11), SIMDE_FLOAT32_C( -1.44), SIMDE_FLOAT32_C( 3.85), SIMDE_FLOAT32_C( 2.66) },
{ SIMDE_FLOAT32_C( 3.03), SIMDE_FLOAT32_C( 0.24), SIMDE_FLOAT32_C( 46.99), SIMDE_FLOAT32_C( 14.30) } },
{ { SIMDE_FLOAT32_C( -1.62), SIMDE_FLOAT32_C( 0.28), SIMDE_FLOAT32_C( 0.69), SIMDE_FLOAT32_C( 2.09) },
{ SIMDE_FLOAT32_C( 0.20), SIMDE_FLOAT32_C( 1.32), SIMDE_FLOAT32_C( 1.99), SIMDE_FLOAT32_C( 8.08) } },
{ { SIMDE_FLOAT32_C( -1.46), SIMDE_FLOAT32_C( -3.87), SIMDE_FLOAT32_C( -1.51), SIMDE_FLOAT32_C( -0.90) },
{ SIMDE_FLOAT32_C( 0.23), SIMDE_FLOAT32_C( 0.02), SIMDE_FLOAT32_C( 0.22), SIMDE_FLOAT32_C( 0.41) } },
{ { SIMDE_FLOAT32_C( -1.48), SIMDE_FLOAT32_C( 3.26), SIMDE_FLOAT32_C( 3.11), SIMDE_FLOAT32_C( 2.62) },
{ SIMDE_FLOAT32_C( 0.23), SIMDE_FLOAT32_C( 26.05), SIMDE_FLOAT32_C( 22.42), SIMDE_FLOAT32_C( 13.74) } },
{ { SIMDE_FLOAT32_C( 2.92), SIMDE_FLOAT32_C( 2.52), SIMDE_FLOAT32_C( -1.27), SIMDE_FLOAT32_C( -0.09) },
{ SIMDE_FLOAT32_C( 18.54), SIMDE_FLOAT32_C( 12.43), SIMDE_FLOAT32_C( 0.28), SIMDE_FLOAT32_C( 0.91) } }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])) ; i++) {
simde__m128 a = simde_mm_loadu_ps(test_vec[i].a);
simde__m128 r = simde_mm_exp_ps(a);
simde_test_x86_assert_equal_f32x4(r, simde_mm_loadu_ps(test_vec[i].r), 1);
}
return 0;
}
static int
test_simde_mm_exp_pd (SIMDE_MUNIT_TEST_ARGS) {
static const struct {
const simde_float64 a[2];
const simde_float64 r[2];
} test_vec[] = {
{ { SIMDE_FLOAT64_C( -2.66), SIMDE_FLOAT64_C( -2.80) },
{ SIMDE_FLOAT64_C( 0.07), SIMDE_FLOAT64_C( 0.06) } },
{ { SIMDE_FLOAT64_C( -3.89), SIMDE_FLOAT64_C( -1.37) },
{ SIMDE_FLOAT64_C( 0.02), SIMDE_FLOAT64_C( 0.25) } },
{ { SIMDE_FLOAT64_C( 0.22), SIMDE_FLOAT64_C( -2.64) },
{ SIMDE_FLOAT64_C( 1.25), SIMDE_FLOAT64_C( 0.07) } },
{ { SIMDE_FLOAT64_C( -3.57), SIMDE_FLOAT64_C( -2.12) },
{ SIMDE_FLOAT64_C( 0.03), SIMDE_FLOAT64_C( 0.12) } },
{ { SIMDE_FLOAT64_C( 1.63), SIMDE_FLOAT64_C( 1.90) },
{ SIMDE_FLOAT64_C( 5.10), SIMDE_FLOAT64_C( 6.69) } },
{ { SIMDE_FLOAT64_C( -3.29), SIMDE_FLOAT64_C( 2.38) },
{ SIMDE_FLOAT64_C( 0.04), SIMDE_FLOAT64_C( 10.80) } },
{ { SIMDE_FLOAT64_C( 2.98), SIMDE_FLOAT64_C( -3.59) },
{ SIMDE_FLOAT64_C( 19.69), SIMDE_FLOAT64_C( 0.03) } },
{ { SIMDE_FLOAT64_C( 1.60), SIMDE_FLOAT64_C( 3.03) },
{ SIMDE_FLOAT64_C( 4.95), SIMDE_FLOAT64_C( 20.70) } }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])) ; i++) {
simde__m128d a = simde_mm_loadu_pd(test_vec[i].a);
simde__m128d r = simde_mm_exp_pd(a);
simde_test_x86_assert_equal_f64x2(r, simde_mm_loadu_pd(test_vec[i].r), 1);
}
return 0;
}
static int
test_simde_mm256_exp_ps (SIMDE_MUNIT_TEST_ARGS) {
static const struct {
const simde_float32 a[8];
const simde_float32 r[8];
} test_vec[] = {
{ { SIMDE_FLOAT32_C( 0.63), SIMDE_FLOAT32_C( -0.06), SIMDE_FLOAT32_C( -0.79), SIMDE_FLOAT32_C( 3.31),
SIMDE_FLOAT32_C( -0.33), SIMDE_FLOAT32_C( 1.48), SIMDE_FLOAT32_C( -1.86), SIMDE_FLOAT32_C( -0.07) },
{ SIMDE_FLOAT32_C( 1.88), SIMDE_FLOAT32_C( 0.94), SIMDE_FLOAT32_C( 0.45), SIMDE_FLOAT32_C( 27.39),
SIMDE_FLOAT32_C( 0.72), SIMDE_FLOAT32_C( 4.39), SIMDE_FLOAT32_C( 0.16), SIMDE_FLOAT32_C( 0.93) } },
{ { SIMDE_FLOAT32_C( -0.89), SIMDE_FLOAT32_C( -0.25), SIMDE_FLOAT32_C( 3.55), SIMDE_FLOAT32_C( 2.79),
SIMDE_FLOAT32_C( 0.61), SIMDE_FLOAT32_C( -2.09), SIMDE_FLOAT32_C( 1.92), SIMDE_FLOAT32_C( -2.83) },
{ SIMDE_FLOAT32_C( 0.41), SIMDE_FLOAT32_C( 0.78), SIMDE_FLOAT32_C( 34.81), SIMDE_FLOAT32_C( 16.28),
SIMDE_FLOAT32_C( 1.84), SIMDE_FLOAT32_C( 0.12), SIMDE_FLOAT32_C( 6.82), SIMDE_FLOAT32_C( 0.06) } },
{ { SIMDE_FLOAT32_C( -0.64), SIMDE_FLOAT32_C( 3.78), SIMDE_FLOAT32_C( 1.32), SIMDE_FLOAT32_C( 0.41),
SIMDE_FLOAT32_C( 3.60), SIMDE_FLOAT32_C( 3.24), SIMDE_FLOAT32_C( -3.08), SIMDE_FLOAT32_C( 1.67) },
{ SIMDE_FLOAT32_C( 0.53), SIMDE_FLOAT32_C( 43.82), SIMDE_FLOAT32_C( 3.74), SIMDE_FLOAT32_C( 1.51),
SIMDE_FLOAT32_C( 36.60), SIMDE_FLOAT32_C( 25.53), SIMDE_FLOAT32_C( 0.05), SIMDE_FLOAT32_C( 5.31) } },
{ { SIMDE_FLOAT32_C( 3.42), SIMDE_FLOAT32_C( -0.32), SIMDE_FLOAT32_C( 0.62), SIMDE_FLOAT32_C( -0.47),
SIMDE_FLOAT32_C( 0.47), SIMDE_FLOAT32_C( -3.11), SIMDE_FLOAT32_C( -2.54), SIMDE_FLOAT32_C( -2.91) },
{ SIMDE_FLOAT32_C( 30.57), SIMDE_FLOAT32_C( 0.73), SIMDE_FLOAT32_C( 1.86), SIMDE_FLOAT32_C( 0.63),
SIMDE_FLOAT32_C( 1.60), SIMDE_FLOAT32_C( 0.04), SIMDE_FLOAT32_C( 0.08), SIMDE_FLOAT32_C( 0.05) } },
{ { SIMDE_FLOAT32_C( 0.83), SIMDE_FLOAT32_C( 0.67), SIMDE_FLOAT32_C( -3.60), SIMDE_FLOAT32_C( -3.49),
SIMDE_FLOAT32_C( -1.85), SIMDE_FLOAT32_C( -1.46), SIMDE_FLOAT32_C( 0.43), SIMDE_FLOAT32_C( 1.26) },
{ SIMDE_FLOAT32_C( 2.29), SIMDE_FLOAT32_C( 1.95), SIMDE_FLOAT32_C( 0.03), SIMDE_FLOAT32_C( 0.03),
SIMDE_FLOAT32_C( 0.16), SIMDE_FLOAT32_C( 0.23), SIMDE_FLOAT32_C( 1.54), SIMDE_FLOAT32_C( 3.53) } },
{ { SIMDE_FLOAT32_C( 2.29), SIMDE_FLOAT32_C( -0.02), SIMDE_FLOAT32_C( 0.05), SIMDE_FLOAT32_C( -1.11),
SIMDE_FLOAT32_C( 1.89), SIMDE_FLOAT32_C( -2.02), SIMDE_FLOAT32_C( 0.07), SIMDE_FLOAT32_C( -2.75) },
{ SIMDE_FLOAT32_C( 9.87), SIMDE_FLOAT32_C( 0.98), SIMDE_FLOAT32_C( 1.05), SIMDE_FLOAT32_C( 0.33),
SIMDE_FLOAT32_C( 6.62), SIMDE_FLOAT32_C( 0.13), SIMDE_FLOAT32_C( 1.07), SIMDE_FLOAT32_C( 0.06) } },
{ { SIMDE_FLOAT32_C( -2.25), SIMDE_FLOAT32_C( -2.61), SIMDE_FLOAT32_C( 1.66), SIMDE_FLOAT32_C( -2.65),
SIMDE_FLOAT32_C( -3.37), SIMDE_FLOAT32_C( 2.59), SIMDE_FLOAT32_C( 3.02), SIMDE_FLOAT32_C( -3.95) },
{ SIMDE_FLOAT32_C( 0.11), SIMDE_FLOAT32_C( 0.07), SIMDE_FLOAT32_C( 5.26), SIMDE_FLOAT32_C( 0.07),
SIMDE_FLOAT32_C( 0.03), SIMDE_FLOAT32_C( 13.33), SIMDE_FLOAT32_C( 20.49), SIMDE_FLOAT32_C( 0.02) } },
{ { SIMDE_FLOAT32_C( -1.74), SIMDE_FLOAT32_C( -0.36), SIMDE_FLOAT32_C( -0.42), SIMDE_FLOAT32_C( 2.73),
SIMDE_FLOAT32_C( 0.53), SIMDE_FLOAT32_C( 1.04), SIMDE_FLOAT32_C( 3.82), SIMDE_FLOAT32_C( -2.64) },
{ SIMDE_FLOAT32_C( 0.18), SIMDE_FLOAT32_C( 0.70), SIMDE_FLOAT32_C( 0.66), SIMDE_FLOAT32_C( 15.33),
SIMDE_FLOAT32_C( 1.70), SIMDE_FLOAT32_C( 2.83), SIMDE_FLOAT32_C( 45.60), SIMDE_FLOAT32_C( 0.07) } }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])) ; i++) {
simde__m256 a = simde_mm256_loadu_ps(test_vec[i].a);
simde__m256 r = simde_mm256_exp_ps(a);
simde_test_x86_assert_equal_f32x8(r, simde_mm256_loadu_ps(test_vec[i].r), 1);
}
return 0;
}
static int
test_simde_mm256_exp_pd (SIMDE_MUNIT_TEST_ARGS) {
static const struct {
const simde_float64 a[4];
const simde_float64 r[4];
} test_vec[] = {
{ { SIMDE_FLOAT64_C( 3.29), SIMDE_FLOAT64_C( -2.77), SIMDE_FLOAT64_C( 3.69), SIMDE_FLOAT64_C( -0.61) },
{ SIMDE_FLOAT64_C( 26.84), SIMDE_FLOAT64_C( 0.06), SIMDE_FLOAT64_C( 40.04), SIMDE_FLOAT64_C( 0.54) } },
{ { SIMDE_FLOAT64_C( -1.69), SIMDE_FLOAT64_C( 0.88), SIMDE_FLOAT64_C( 0.83), SIMDE_FLOAT64_C( 1.60) },
{ SIMDE_FLOAT64_C( 0.18), SIMDE_FLOAT64_C( 2.41), SIMDE_FLOAT64_C( 2.29), SIMDE_FLOAT64_C( 4.95) } },
{ { SIMDE_FLOAT64_C( -2.30), SIMDE_FLOAT64_C( 2.39), SIMDE_FLOAT64_C( -1.55), SIMDE_FLOAT64_C( -3.39) },
{ SIMDE_FLOAT64_C( 0.10), SIMDE_FLOAT64_C( 10.91), SIMDE_FLOAT64_C( 0.21), SIMDE_FLOAT64_C( 0.03) } },
{ { SIMDE_FLOAT64_C( 3.91), SIMDE_FLOAT64_C( -3.26), SIMDE_FLOAT64_C( -0.58), SIMDE_FLOAT64_C( -1.96) },
{ SIMDE_FLOAT64_C( 49.90), SIMDE_FLOAT64_C( 0.04), SIMDE_FLOAT64_C( 0.56), SIMDE_FLOAT64_C( 0.14) } },
{ { SIMDE_FLOAT64_C( 0.08), SIMDE_FLOAT64_C( 2.77), SIMDE_FLOAT64_C( -1.45), SIMDE_FLOAT64_C( -1.25) },
{ SIMDE_FLOAT64_C( 1.08), SIMDE_FLOAT64_C( 15.96), SIMDE_FLOAT64_C( 0.23), SIMDE_FLOAT64_C( 0.29) } },
{ { SIMDE_FLOAT64_C( -1.13), SIMDE_FLOAT64_C( 2.76), SIMDE_FLOAT64_C( 0.99), SIMDE_FLOAT64_C( 2.44) },
{ SIMDE_FLOAT64_C( 0.32), SIMDE_FLOAT64_C( 15.80), SIMDE_FLOAT64_C( 2.69), SIMDE_FLOAT64_C( 11.47) } },
{ { SIMDE_FLOAT64_C( -1.89), SIMDE_FLOAT64_C( 0.01), SIMDE_FLOAT64_C( 0.54), SIMDE_FLOAT64_C( 0.58) },
{ SIMDE_FLOAT64_C( 0.15), SIMDE_FLOAT64_C( 1.01), SIMDE_FLOAT64_C( 1.72), SIMDE_FLOAT64_C( 1.79) } },
{ { SIMDE_FLOAT64_C( 1.40), SIMDE_FLOAT64_C( -0.80), SIMDE_FLOAT64_C( 1.70), SIMDE_FLOAT64_C( 0.69) },
{ SIMDE_FLOAT64_C( 4.06), SIMDE_FLOAT64_C( 0.45), SIMDE_FLOAT64_C( 5.47), SIMDE_FLOAT64_C( 1.99) } }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])) ; i++) {
simde__m256d a = simde_mm256_loadu_pd(test_vec[i].a);
simde__m256d r = simde_mm256_exp_pd(a);
simde_test_x86_assert_equal_f64x4(r, simde_mm256_loadu_pd(test_vec[i].r), 1);
}
return 0;
}
static int
test_simde_mm512_exp_ps (SIMDE_MUNIT_TEST_ARGS) {
static const struct {
const simde_float32 a[16];
const simde_float32 r[16];
} test_vec[] = {
{ { SIMDE_FLOAT32_C( 1.28), SIMDE_FLOAT32_C( 0.04), SIMDE_FLOAT32_C( -2.09), SIMDE_FLOAT32_C( -2.90),
SIMDE_FLOAT32_C( -0.56), SIMDE_FLOAT32_C( 3.88), SIMDE_FLOAT32_C( -2.98), SIMDE_FLOAT32_C( -3.94),
SIMDE_FLOAT32_C( -1.92), SIMDE_FLOAT32_C( 2.94), SIMDE_FLOAT32_C( 1.88), SIMDE_FLOAT32_C( -0.28),
SIMDE_FLOAT32_C( 2.19), SIMDE_FLOAT32_C( -2.64), SIMDE_FLOAT32_C( -0.36), SIMDE_FLOAT32_C( 3.50) },
{ SIMDE_FLOAT32_C( 3.60), SIMDE_FLOAT32_C( 1.04), SIMDE_FLOAT32_C( 0.12), SIMDE_FLOAT32_C( 0.06),
SIMDE_FLOAT32_C( 0.57), SIMDE_FLOAT32_C( 48.42), SIMDE_FLOAT32_C( 0.05), SIMDE_FLOAT32_C( 0.02),
SIMDE_FLOAT32_C( 0.15), SIMDE_FLOAT32_C( 18.92), SIMDE_FLOAT32_C( 6.55), SIMDE_FLOAT32_C( 0.76),
SIMDE_FLOAT32_C( 8.94), SIMDE_FLOAT32_C( 0.07), SIMDE_FLOAT32_C( 0.70), SIMDE_FLOAT32_C( 33.12) } },
{ { SIMDE_FLOAT32_C( 3.07), SIMDE_FLOAT32_C( 2.28), SIMDE_FLOAT32_C( -3.90), SIMDE_FLOAT32_C( 1.18),
SIMDE_FLOAT32_C( -0.59), SIMDE_FLOAT32_C( -1.46), SIMDE_FLOAT32_C( -0.38), SIMDE_FLOAT32_C( 2.83),
SIMDE_FLOAT32_C( -0.05), SIMDE_FLOAT32_C( -2.52), SIMDE_FLOAT32_C( 2.98), SIMDE_FLOAT32_C( 1.96),
SIMDE_FLOAT32_C( -2.99), SIMDE_FLOAT32_C( -2.11), SIMDE_FLOAT32_C( 1.16), SIMDE_FLOAT32_C( 2.29) },
{ SIMDE_FLOAT32_C( 21.54), SIMDE_FLOAT32_C( 9.78), SIMDE_FLOAT32_C( 0.02), SIMDE_FLOAT32_C( 3.25),
SIMDE_FLOAT32_C( 0.55), SIMDE_FLOAT32_C( 0.23), SIMDE_FLOAT32_C( 0.68), SIMDE_FLOAT32_C( 16.95),
SIMDE_FLOAT32_C( 0.95), SIMDE_FLOAT32_C( 0.08), SIMDE_FLOAT32_C( 19.69), SIMDE_FLOAT32_C( 7.10),
SIMDE_FLOAT32_C( 0.05), SIMDE_FLOAT32_C( 0.12), SIMDE_FLOAT32_C( 3.19), SIMDE_FLOAT32_C( 9.87) } },
{ { SIMDE_FLOAT32_C( 1.94), SIMDE_FLOAT32_C( 3.07), SIMDE_FLOAT32_C( 3.39), SIMDE_FLOAT32_C( -2.62),
SIMDE_FLOAT32_C( 2.95), SIMDE_FLOAT32_C( -3.59), SIMDE_FLOAT32_C( -2.56), SIMDE_FLOAT32_C( -2.97),
SIMDE_FLOAT32_C( 3.35), SIMDE_FLOAT32_C( 3.33), SIMDE_FLOAT32_C( 0.75), SIMDE_FLOAT32_C( 1.54),
SIMDE_FLOAT32_C( -3.32), SIMDE_FLOAT32_C( -3.62), SIMDE_FLOAT32_C( 1.04), SIMDE_FLOAT32_C( 3.75) },
{ SIMDE_FLOAT32_C( 6.96), SIMDE_FLOAT32_C( 21.54), SIMDE_FLOAT32_C( 29.67), SIMDE_FLOAT32_C( 0.07),
SIMDE_FLOAT32_C( 19.11), SIMDE_FLOAT32_C( 0.03), SIMDE_FLOAT32_C( 0.08), SIMDE_FLOAT32_C( 0.05),
SIMDE_FLOAT32_C( 28.50), SIMDE_FLOAT32_C( 27.94), SIMDE_FLOAT32_C( 2.12), SIMDE_FLOAT32_C( 4.66),
SIMDE_FLOAT32_C( 0.04), SIMDE_FLOAT32_C( 0.03), SIMDE_FLOAT32_C( 2.83), SIMDE_FLOAT32_C( 42.52) } },
{ { SIMDE_FLOAT32_C( 2.66), SIMDE_FLOAT32_C( 1.14), SIMDE_FLOAT32_C( 0.94), SIMDE_FLOAT32_C( -1.93),
SIMDE_FLOAT32_C( 3.69), SIMDE_FLOAT32_C( -3.45), SIMDE_FLOAT32_C( -3.09), SIMDE_FLOAT32_C( -0.37),
SIMDE_FLOAT32_C( -1.97), SIMDE_FLOAT32_C( 3.89), SIMDE_FLOAT32_C( -2.41), SIMDE_FLOAT32_C( -0.96),
SIMDE_FLOAT32_C( -2.21), SIMDE_FLOAT32_C( 2.75), SIMDE_FLOAT32_C( -2.67), SIMDE_FLOAT32_C( 3.72) },
{ SIMDE_FLOAT32_C( 14.30), SIMDE_FLOAT32_C( 3.13), SIMDE_FLOAT32_C( 2.56), SIMDE_FLOAT32_C( 0.15),
SIMDE_FLOAT32_C( 40.04), SIMDE_FLOAT32_C( 0.03), SIMDE_FLOAT32_C( 0.05), SIMDE_FLOAT32_C( 0.69),
SIMDE_FLOAT32_C( 0.14), SIMDE_FLOAT32_C( 48.91), SIMDE_FLOAT32_C( 0.09), SIMDE_FLOAT32_C( 0.38),
SIMDE_FLOAT32_C( 0.11), SIMDE_FLOAT32_C( 15.64), SIMDE_FLOAT32_C( 0.07), SIMDE_FLOAT32_C( 41.26) } },
{ { SIMDE_FLOAT32_C( 1.82), SIMDE_FLOAT32_C( -3.27), SIMDE_FLOAT32_C( -2.90), SIMDE_FLOAT32_C( 0.77),
SIMDE_FLOAT32_C( -2.86), SIMDE_FLOAT32_C( -1.45), SIMDE_FLOAT32_C( 1.79), SIMDE_FLOAT32_C( -3.51),
SIMDE_FLOAT32_C( -2.13), SIMDE_FLOAT32_C( -1.46), SIMDE_FLOAT32_C( 2.03), SIMDE_FLOAT32_C( -1.45),
SIMDE_FLOAT32_C( -1.08), SIMDE_FLOAT32_C( -0.93), SIMDE_FLOAT32_C( -1.69), SIMDE_FLOAT32_C( -2.41) },
{ SIMDE_FLOAT32_C( 6.17), SIMDE_FLOAT32_C( 0.04), SIMDE_FLOAT32_C( 0.06), SIMDE_FLOAT32_C( 2.16),
SIMDE_FLOAT32_C( 0.06), SIMDE_FLOAT32_C( 0.23), SIMDE_FLOAT32_C( 5.99), SIMDE_FLOAT32_C( 0.03),
SIMDE_FLOAT32_C( 0.12), SIMDE_FLOAT32_C( 0.23), SIMDE_FLOAT32_C( 7.61), SIMDE_FLOAT32_C( 0.23),
SIMDE_FLOAT32_C( 0.34), SIMDE_FLOAT32_C( 0.39), SIMDE_FLOAT32_C( 0.18), SIMDE_FLOAT32_C( 0.09) } },
{ { SIMDE_FLOAT32_C( -3.79), SIMDE_FLOAT32_C( 3.24), SIMDE_FLOAT32_C( -0.34), SIMDE_FLOAT32_C( 3.90),
SIMDE_FLOAT32_C( 3.80), SIMDE_FLOAT32_C( 0.56), SIMDE_FLOAT32_C( -0.46), SIMDE_FLOAT32_C( -2.17),
SIMDE_FLOAT32_C( 0.45), SIMDE_FLOAT32_C( 1.13), SIMDE_FLOAT32_C( 0.87), SIMDE_FLOAT32_C( 2.24),
SIMDE_FLOAT32_C( -0.12), SIMDE_FLOAT32_C( 2.20), SIMDE_FLOAT32_C( 1.96), SIMDE_FLOAT32_C( -2.30) },
{ SIMDE_FLOAT32_C( 0.02), SIMDE_FLOAT32_C( 25.53), SIMDE_FLOAT32_C( 0.71), SIMDE_FLOAT32_C( 49.40),
SIMDE_FLOAT32_C( 44.70), SIMDE_FLOAT32_C( 1.75), SIMDE_FLOAT32_C( 0.63), SIMDE_FLOAT32_C( 0.11),
SIMDE_FLOAT32_C( 1.57), SIMDE_FLOAT32_C( 3.10), SIMDE_FLOAT32_C( 2.39), SIMDE_FLOAT32_C( 9.39),
SIMDE_FLOAT32_C( 0.89), SIMDE_FLOAT32_C( 9.03), SIMDE_FLOAT32_C( 7.10), SIMDE_FLOAT32_C( 0.10) } },
{ { SIMDE_FLOAT32_C( 2.93), SIMDE_FLOAT32_C( 3.07), SIMDE_FLOAT32_C( 2.46), SIMDE_FLOAT32_C( -3.93),
SIMDE_FLOAT32_C( -2.39), SIMDE_FLOAT32_C( 0.26), SIMDE_FLOAT32_C( -3.44), SIMDE_FLOAT32_C( -0.52),
SIMDE_FLOAT32_C( 2.80), SIMDE_FLOAT32_C( 2.60), SIMDE_FLOAT32_C( 2.04), SIMDE_FLOAT32_C( -2.28),
SIMDE_FLOAT32_C( -2.33), SIMDE_FLOAT32_C( -3.65), SIMDE_FLOAT32_C( -0.70), SIMDE_FLOAT32_C( -2.12) },
{ SIMDE_FLOAT32_C( 18.73), SIMDE_FLOAT32_C( 21.54), SIMDE_FLOAT32_C( 11.70), SIMDE_FLOAT32_C( 0.02),
SIMDE_FLOAT32_C( 0.09), SIMDE_FLOAT32_C( 1.30), SIMDE_FLOAT32_C( 0.03), SIMDE_FLOAT32_C( 0.59),
SIMDE_FLOAT32_C( 16.44), SIMDE_FLOAT32_C( 13.46), SIMDE_FLOAT32_C( 7.69), SIMDE_FLOAT32_C( 0.10),
SIMDE_FLOAT32_C( 0.10), SIMDE_FLOAT32_C( 0.03), SIMDE_FLOAT32_C( 0.50), SIMDE_FLOAT32_C( 0.12) } },
{ { SIMDE_FLOAT32_C( 3.59), SIMDE_FLOAT32_C( 2.96), SIMDE_FLOAT32_C( -2.22), SIMDE_FLOAT32_C( 3.39),
SIMDE_FLOAT32_C( -0.48), SIMDE_FLOAT32_C( 1.32), SIMDE_FLOAT32_C( -2.78), SIMDE_FLOAT32_C( 3.98),
SIMDE_FLOAT32_C( -1.55), SIMDE_FLOAT32_C( 2.09), SIMDE_FLOAT32_C( 2.22), SIMDE_FLOAT32_C( 2.33),
SIMDE_FLOAT32_C( 0.29), SIMDE_FLOAT32_C( 0.18), SIMDE_FLOAT32_C( -3.98), SIMDE_FLOAT32_C( -0.78) },
{ SIMDE_FLOAT32_C( 36.23), SIMDE_FLOAT32_C( 19.30), SIMDE_FLOAT32_C( 0.11), SIMDE_FLOAT32_C( 29.67),
SIMDE_FLOAT32_C( 0.62), SIMDE_FLOAT32_C( 3.74), SIMDE_FLOAT32_C( 0.06), SIMDE_FLOAT32_C( 53.52),
SIMDE_FLOAT32_C( 0.21), SIMDE_FLOAT32_C( 8.08), SIMDE_FLOAT32_C( 9.21), SIMDE_FLOAT32_C( 10.28),
SIMDE_FLOAT32_C( 1.34), SIMDE_FLOAT32_C( 1.20), SIMDE_FLOAT32_C( 0.02), SIMDE_FLOAT32_C( 0.46) } }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])) ; i++) {
simde__m512 a = simde_mm512_loadu_ps(test_vec[i].a);
simde__m512 r = simde_mm512_exp_ps(a);
simde_test_x86_assert_equal_f32x16(r, simde_mm512_loadu_ps(test_vec[i].r), 1);
}
return 0;
}
static int
test_simde_mm512_mask_exp_ps (SIMDE_MUNIT_TEST_ARGS) {
static const struct {
const simde_float32 src[16];
const simde__mmask8 k;
const simde_float32 a[16];
const simde_float32 r[16];
} test_vec[] = {
{ { SIMDE_FLOAT32_C( -0.72), SIMDE_FLOAT32_C( 3.92), SIMDE_FLOAT32_C( -2.75), SIMDE_FLOAT32_C( 3.98),
SIMDE_FLOAT32_C( 0.54), SIMDE_FLOAT32_C( 1.25), SIMDE_FLOAT32_C( 1.89), SIMDE_FLOAT32_C( 2.66),
SIMDE_FLOAT32_C( 1.91), SIMDE_FLOAT32_C( 3.79), SIMDE_FLOAT32_C( 2.42), SIMDE_FLOAT32_C( 0.54),
SIMDE_FLOAT32_C( 2.11), SIMDE_FLOAT32_C( -1.61), SIMDE_FLOAT32_C( 3.84), SIMDE_FLOAT32_C( 3.46) },
UINT8_C( 98),
{ SIMDE_FLOAT32_C( -0.38), SIMDE_FLOAT32_C( 2.14), SIMDE_FLOAT32_C( 1.44), SIMDE_FLOAT32_C( -0.22),
SIMDE_FLOAT32_C( 0.73), SIMDE_FLOAT32_C( 3.48), SIMDE_FLOAT32_C( -2.98), SIMDE_FLOAT32_C( -3.66),
SIMDE_FLOAT32_C( -2.38), SIMDE_FLOAT32_C( 0.70), SIMDE_FLOAT32_C( 1.62), SIMDE_FLOAT32_C( -3.73),
SIMDE_FLOAT32_C( 3.17), SIMDE_FLOAT32_C( -2.33), SIMDE_FLOAT32_C( -0.45), SIMDE_FLOAT32_C( 3.09) },
{ SIMDE_FLOAT32_C( -0.72), SIMDE_FLOAT32_C( 8.50), SIMDE_FLOAT32_C( -2.75), SIMDE_FLOAT32_C( 3.98),
SIMDE_FLOAT32_C( 0.54), SIMDE_FLOAT32_C( 32.46), SIMDE_FLOAT32_C( 0.05), SIMDE_FLOAT32_C( 2.66),
SIMDE_FLOAT32_C( 1.91), SIMDE_FLOAT32_C( 3.79), SIMDE_FLOAT32_C( 2.42), SIMDE_FLOAT32_C( 0.54),
SIMDE_FLOAT32_C( 2.11), SIMDE_FLOAT32_C( -1.61), SIMDE_FLOAT32_C( 3.84), SIMDE_FLOAT32_C( 3.46) } },
{ { SIMDE_FLOAT32_C( -1.08), SIMDE_FLOAT32_C( -0.47), SIMDE_FLOAT32_C( -0.38), SIMDE_FLOAT32_C( -3.83),
SIMDE_FLOAT32_C( -2.58), SIMDE_FLOAT32_C( -1.71), SIMDE_FLOAT32_C( 2.08), SIMDE_FLOAT32_C( -2.80),
SIMDE_FLOAT32_C( -3.29), SIMDE_FLOAT32_C( -1.38), SIMDE_FLOAT32_C( 3.32), SIMDE_FLOAT32_C( -0.90),
SIMDE_FLOAT32_C( -1.54), SIMDE_FLOAT32_C( 2.77), SIMDE_FLOAT32_C( 1.45), SIMDE_FLOAT32_C( 2.08) },
UINT8_C(254),
{ SIMDE_FLOAT32_C( -1.11), SIMDE_FLOAT32_C( -2.14), SIMDE_FLOAT32_C( -2.35), SIMDE_FLOAT32_C( -1.63),
SIMDE_FLOAT32_C( -1.11), SIMDE_FLOAT32_C( -2.01), SIMDE_FLOAT32_C( -0.01), SIMDE_FLOAT32_C( 3.59),
SIMDE_FLOAT32_C( 3.61), SIMDE_FLOAT32_C( 0.26), SIMDE_FLOAT32_C( 2.76), SIMDE_FLOAT32_C( -2.73),
SIMDE_FLOAT32_C( 3.81), SIMDE_FLOAT32_C( 1.85), SIMDE_FLOAT32_C( 0.19), SIMDE_FLOAT32_C( -0.66) },
{ SIMDE_FLOAT32_C( -1.08), SIMDE_FLOAT32_C( 0.12), SIMDE_FLOAT32_C( 0.10), SIMDE_FLOAT32_C( 0.20),
SIMDE_FLOAT32_C( 0.33), SIMDE_FLOAT32_C( 0.13), SIMDE_FLOAT32_C( 0.99), SIMDE_FLOAT32_C( 36.23),
SIMDE_FLOAT32_C( -3.29), SIMDE_FLOAT32_C( -1.38), SIMDE_FLOAT32_C( 3.32), SIMDE_FLOAT32_C( -0.90),
SIMDE_FLOAT32_C( -1.54), SIMDE_FLOAT32_C( 2.77), SIMDE_FLOAT32_C( 1.45), SIMDE_FLOAT32_C( 2.08) } },
{ { SIMDE_FLOAT32_C( -2.52), SIMDE_FLOAT32_C( 0.36), SIMDE_FLOAT32_C( 0.75), SIMDE_FLOAT32_C( -0.24),
SIMDE_FLOAT32_C( -1.56), SIMDE_FLOAT32_C( 1.95), SIMDE_FLOAT32_C( 0.47), SIMDE_FLOAT32_C( 1.06),
SIMDE_FLOAT32_C( 1.27), SIMDE_FLOAT32_C( 3.57), SIMDE_FLOAT32_C( 3.52), SIMDE_FLOAT32_C( 0.04),
SIMDE_FLOAT32_C( 1.02), SIMDE_FLOAT32_C( 1.60), SIMDE_FLOAT32_C( -3.04), SIMDE_FLOAT32_C( 3.91) },
UINT8_C(140),
{ SIMDE_FLOAT32_C( -1.39), SIMDE_FLOAT32_C( -1.72), SIMDE_FLOAT32_C( -1.65), SIMDE_FLOAT32_C( 0.59),
SIMDE_FLOAT32_C( 2.27), SIMDE_FLOAT32_C( -2.06), SIMDE_FLOAT32_C( 0.20), SIMDE_FLOAT32_C( -1.47),
SIMDE_FLOAT32_C( -3.30), SIMDE_FLOAT32_C( 1.48), SIMDE_FLOAT32_C( -1.67), SIMDE_FLOAT32_C( 2.55),
SIMDE_FLOAT32_C( -2.33), SIMDE_FLOAT32_C( 1.67), SIMDE_FLOAT32_C( -3.97), SIMDE_FLOAT32_C( 2.04) },
{ SIMDE_FLOAT32_C( -2.52), SIMDE_FLOAT32_C( 0.36), SIMDE_FLOAT32_C( 0.19), SIMDE_FLOAT32_C( 1.80),
SIMDE_FLOAT32_C( -1.56), SIMDE_FLOAT32_C( 1.95), SIMDE_FLOAT32_C( 0.47), SIMDE_FLOAT32_C( 0.23),
SIMDE_FLOAT32_C( 1.27), SIMDE_FLOAT32_C( 3.57), SIMDE_FLOAT32_C( 3.52), SIMDE_FLOAT32_C( 0.04),
SIMDE_FLOAT32_C( 1.02), SIMDE_FLOAT32_C( 1.60), SIMDE_FLOAT32_C( -3.04), SIMDE_FLOAT32_C( 3.91) } },
{ { SIMDE_FLOAT32_C( -1.58), SIMDE_FLOAT32_C( -0.21), SIMDE_FLOAT32_C( -3.52), SIMDE_FLOAT32_C( -3.63),
SIMDE_FLOAT32_C( -3.74), SIMDE_FLOAT32_C( 1.54), SIMDE_FLOAT32_C( 1.64), SIMDE_FLOAT32_C( 3.83),
SIMDE_FLOAT32_C( 1.07), SIMDE_FLOAT32_C( -2.32), SIMDE_FLOAT32_C( 0.85), SIMDE_FLOAT32_C( -1.33),
SIMDE_FLOAT32_C( -1.36), SIMDE_FLOAT32_C( 0.75), SIMDE_FLOAT32_C( -1.87), SIMDE_FLOAT32_C( 1.25) },
UINT8_C(221),
{ SIMDE_FLOAT32_C( 0.47), SIMDE_FLOAT32_C( -2.16), SIMDE_FLOAT32_C( 1.30), SIMDE_FLOAT32_C( 2.41),
SIMDE_FLOAT32_C( 2.04), SIMDE_FLOAT32_C( 3.83), SIMDE_FLOAT32_C( 3.11), SIMDE_FLOAT32_C( -0.48),
SIMDE_FLOAT32_C( -1.84), SIMDE_FLOAT32_C( 1.66), SIMDE_FLOAT32_C( 1.19), SIMDE_FLOAT32_C( 3.83),
SIMDE_FLOAT32_C( 1.69), SIMDE_FLOAT32_C( -0.77), SIMDE_FLOAT32_C( -1.75), SIMDE_FLOAT32_C( -2.52) },
{ SIMDE_FLOAT32_C( 1.60), SIMDE_FLOAT32_C( -0.21), SIMDE_FLOAT32_C( 3.67), SIMDE_FLOAT32_C( 11.13),
SIMDE_FLOAT32_C( 7.69), SIMDE_FLOAT32_C( 1.54), SIMDE_FLOAT32_C( 22.42), SIMDE_FLOAT32_C( 0.62),
SIMDE_FLOAT32_C( 1.07), SIMDE_FLOAT32_C( -2.32), SIMDE_FLOAT32_C( 0.85), SIMDE_FLOAT32_C( -1.33),
SIMDE_FLOAT32_C( -1.36), SIMDE_FLOAT32_C( 0.75), SIMDE_FLOAT32_C( -1.87), SIMDE_FLOAT32_C( 1.25) } },
{ { SIMDE_FLOAT32_C( -0.29), SIMDE_FLOAT32_C( -1.37), SIMDE_FLOAT32_C( -2.26), SIMDE_FLOAT32_C( -2.75),
SIMDE_FLOAT32_C( -3.73), SIMDE_FLOAT32_C( -2.43), SIMDE_FLOAT32_C( 2.32), SIMDE_FLOAT32_C( -2.05),
SIMDE_FLOAT32_C( 2.41), SIMDE_FLOAT32_C( -3.02), SIMDE_FLOAT32_C( 0.59), SIMDE_FLOAT32_C( -0.83),
SIMDE_FLOAT32_C( -0.89), SIMDE_FLOAT32_C( -2.16), SIMDE_FLOAT32_C( -1.80), SIMDE_FLOAT32_C( 3.58) },
UINT8_C(165),
{ SIMDE_FLOAT32_C( 3.50), SIMDE_FLOAT32_C( 1.99), SIMDE_FLOAT32_C( -2.28), SIMDE_FLOAT32_C( 3.33),
SIMDE_FLOAT32_C( 1.10), SIMDE_FLOAT32_C( 1.24), SIMDE_FLOAT32_C( -2.51), SIMDE_FLOAT32_C( -1.23),
SIMDE_FLOAT32_C( -1.56), SIMDE_FLOAT32_C( -2.68), SIMDE_FLOAT32_C( -3.54), SIMDE_FLOAT32_C( 1.66),
SIMDE_FLOAT32_C( -0.43), SIMDE_FLOAT32_C( -2.06), SIMDE_FLOAT32_C( -2.63), SIMDE_FLOAT32_C( 2.20) },
{ SIMDE_FLOAT32_C( 33.12), SIMDE_FLOAT32_C( -1.37), SIMDE_FLOAT32_C( 0.10), SIMDE_FLOAT32_C( -2.75),
SIMDE_FLOAT32_C( -3.73), SIMDE_FLOAT32_C( 3.46), SIMDE_FLOAT32_C( 2.32), SIMDE_FLOAT32_C( 0.29),
SIMDE_FLOAT32_C( 2.41), SIMDE_FLOAT32_C( -3.02), SIMDE_FLOAT32_C( 0.59), SIMDE_FLOAT32_C( -0.83),
SIMDE_FLOAT32_C( -0.89), SIMDE_FLOAT32_C( -2.16), SIMDE_FLOAT32_C( -1.80), SIMDE_FLOAT32_C( 3.58) } },
{ { SIMDE_FLOAT32_C( -0.33), SIMDE_FLOAT32_C( -1.38), SIMDE_FLOAT32_C( 2.46), SIMDE_FLOAT32_C( 1.24),
SIMDE_FLOAT32_C( -3.06), SIMDE_FLOAT32_C( -3.58), SIMDE_FLOAT32_C( -0.34), SIMDE_FLOAT32_C( -2.07),
SIMDE_FLOAT32_C( 1.01), SIMDE_FLOAT32_C( 2.83), SIMDE_FLOAT32_C( 1.04), SIMDE_FLOAT32_C( 2.85),
SIMDE_FLOAT32_C( -2.97), SIMDE_FLOAT32_C( 0.62), SIMDE_FLOAT32_C( -1.47), SIMDE_FLOAT32_C( -3.47) },
UINT8_C(155),
{ SIMDE_FLOAT32_C( 0.26), SIMDE_FLOAT32_C( 3.85), SIMDE_FLOAT32_C( 3.72), SIMDE_FLOAT32_C( -2.50),
SIMDE_FLOAT32_C( -2.66), SIMDE_FLOAT32_C( -1.52), SIMDE_FLOAT32_C( -0.06), SIMDE_FLOAT32_C( -1.34),
SIMDE_FLOAT32_C( -1.06), SIMDE_FLOAT32_C( -2.40), SIMDE_FLOAT32_C( 2.23), SIMDE_FLOAT32_C( 0.88),
SIMDE_FLOAT32_C( -1.03), SIMDE_FLOAT32_C( 0.42), SIMDE_FLOAT32_C( -3.45), SIMDE_FLOAT32_C( 1.60) },
{ SIMDE_FLOAT32_C( 1.30), SIMDE_FLOAT32_C( 46.99), SIMDE_FLOAT32_C( 2.46), SIMDE_FLOAT32_C( 0.08),
SIMDE_FLOAT32_C( 0.07), SIMDE_FLOAT32_C( -3.58), SIMDE_FLOAT32_C( -0.34), SIMDE_FLOAT32_C( 0.26),
SIMDE_FLOAT32_C( 1.01), SIMDE_FLOAT32_C( 2.83), SIMDE_FLOAT32_C( 1.04), SIMDE_FLOAT32_C( 2.85),
SIMDE_FLOAT32_C( -2.97), SIMDE_FLOAT32_C( 0.62), SIMDE_FLOAT32_C( -1.47), SIMDE_FLOAT32_C( -3.47) } },
{ { SIMDE_FLOAT32_C( -1.11), SIMDE_FLOAT32_C( 1.79), SIMDE_FLOAT32_C( 2.54), SIMDE_FLOAT32_C( -0.69),
SIMDE_FLOAT32_C( -2.55), SIMDE_FLOAT32_C( -3.53), SIMDE_FLOAT32_C( -3.68), SIMDE_FLOAT32_C( -3.72),
SIMDE_FLOAT32_C( 1.50), SIMDE_FLOAT32_C( 3.17), SIMDE_FLOAT32_C( -2.70), SIMDE_FLOAT32_C( -1.88),
SIMDE_FLOAT32_C( -2.30), SIMDE_FLOAT32_C( -2.17), SIMDE_FLOAT32_C( 0.73), SIMDE_FLOAT32_C( 1.96) },
UINT8_C(151),
{ SIMDE_FLOAT32_C( 0.45), SIMDE_FLOAT32_C( 3.47), SIMDE_FLOAT32_C( -0.97), SIMDE_FLOAT32_C( 2.93),
SIMDE_FLOAT32_C( -0.59), SIMDE_FLOAT32_C( 1.68), SIMDE_FLOAT32_C( -2.13), SIMDE_FLOAT32_C( 1.01),
SIMDE_FLOAT32_C( -0.09), SIMDE_FLOAT32_C( 2.75), SIMDE_FLOAT32_C( 3.99), SIMDE_FLOAT32_C( -3.67),
SIMDE_FLOAT32_C( 3.30), SIMDE_FLOAT32_C( 1.59), SIMDE_FLOAT32_C( -0.78), SIMDE_FLOAT32_C( 1.09) },
{ SIMDE_FLOAT32_C( 1.57), SIMDE_FLOAT32_C( 32.14), SIMDE_FLOAT32_C( 0.38), SIMDE_FLOAT32_C( -0.69),
SIMDE_FLOAT32_C( 0.55), SIMDE_FLOAT32_C( -3.53), SIMDE_FLOAT32_C( -3.68), SIMDE_FLOAT32_C( 2.75),
SIMDE_FLOAT32_C( 1.50), SIMDE_FLOAT32_C( 3.17), SIMDE_FLOAT32_C( -2.70), SIMDE_FLOAT32_C( -1.88),
SIMDE_FLOAT32_C( -2.30), SIMDE_FLOAT32_C( -2.17), SIMDE_FLOAT32_C( 0.73), SIMDE_FLOAT32_C( 1.96) } },
{ { SIMDE_FLOAT32_C( 0.13), SIMDE_FLOAT32_C( 2.53), SIMDE_FLOAT32_C( 2.54), SIMDE_FLOAT32_C( 0.59),
SIMDE_FLOAT32_C( 2.84), SIMDE_FLOAT32_C( 2.82), SIMDE_FLOAT32_C( -1.91), SIMDE_FLOAT32_C( 2.01),
SIMDE_FLOAT32_C( -3.88), SIMDE_FLOAT32_C( 0.22), SIMDE_FLOAT32_C( 3.72), SIMDE_FLOAT32_C( -2.04),
SIMDE_FLOAT32_C( -3.05), SIMDE_FLOAT32_C( 1.68), SIMDE_FLOAT32_C( -0.36), SIMDE_FLOAT32_C( 1.40) },
UINT8_C( 24),
{ SIMDE_FLOAT32_C( 2.67), SIMDE_FLOAT32_C( 0.34), SIMDE_FLOAT32_C( -3.44), SIMDE_FLOAT32_C( 0.35),
SIMDE_FLOAT32_C( 2.21), SIMDE_FLOAT32_C( 1.57), SIMDE_FLOAT32_C( -3.74), SIMDE_FLOAT32_C( 0.96),
SIMDE_FLOAT32_C( 1.56), SIMDE_FLOAT32_C( -3.40), SIMDE_FLOAT32_C( 0.26), SIMDE_FLOAT32_C( -0.86),
SIMDE_FLOAT32_C( -0.18), SIMDE_FLOAT32_C( -2.65), SIMDE_FLOAT32_C( 3.27), SIMDE_FLOAT32_C( -1.65) },
{ SIMDE_FLOAT32_C( 0.13), SIMDE_FLOAT32_C( 2.53), SIMDE_FLOAT32_C( 2.54), SIMDE_FLOAT32_C( 1.42),
SIMDE_FLOAT32_C( 9.12), SIMDE_FLOAT32_C( 2.82), SIMDE_FLOAT32_C( -1.91), SIMDE_FLOAT32_C( 2.01),
SIMDE_FLOAT32_C( -3.88), SIMDE_FLOAT32_C( 0.22), SIMDE_FLOAT32_C( 3.72), SIMDE_FLOAT32_C( -2.04),
SIMDE_FLOAT32_C( -3.05), SIMDE_FLOAT32_C( 1.68), SIMDE_FLOAT32_C( -0.36), SIMDE_FLOAT32_C( 1.40) } }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])) ; i++) {
simde__m512 src = simde_mm512_loadu_ps(test_vec[i].src);
simde__m512 a = simde_mm512_loadu_ps(test_vec[i].a);
simde__m512 r = simde_mm512_mask_exp_ps(src, test_vec[i].k, a);
simde_test_x86_assert_equal_f32x16(r, simde_mm512_loadu_ps(test_vec[i].r), 1);
}
return 0;
}
static int
test_simde_mm512_exp_pd (SIMDE_MUNIT_TEST_ARGS) {
static const struct {
const simde_float64 a[8];
const simde_float64 r[8];
} test_vec[] = {
{ { SIMDE_FLOAT64_C( 3.06), SIMDE_FLOAT64_C( -0.78), SIMDE_FLOAT64_C( 1.53), SIMDE_FLOAT64_C( 2.94),
SIMDE_FLOAT64_C( -3.88), SIMDE_FLOAT64_C( 3.46), SIMDE_FLOAT64_C( 1.02), SIMDE_FLOAT64_C( -3.05) },
{ SIMDE_FLOAT64_C( 21.33), SIMDE_FLOAT64_C( 0.46), SIMDE_FLOAT64_C( 4.62), SIMDE_FLOAT64_C( 18.92),
SIMDE_FLOAT64_C( 0.02), SIMDE_FLOAT64_C( 31.82), SIMDE_FLOAT64_C( 2.77), SIMDE_FLOAT64_C( 0.05) } },
{ { SIMDE_FLOAT64_C( 1.99), SIMDE_FLOAT64_C( -3.10), SIMDE_FLOAT64_C( 1.58), SIMDE_FLOAT64_C( 2.87),
SIMDE_FLOAT64_C( -2.25), SIMDE_FLOAT64_C( -0.61), SIMDE_FLOAT64_C( -0.12), SIMDE_FLOAT64_C( 3.71) },
{ SIMDE_FLOAT64_C( 7.32), SIMDE_FLOAT64_C( 0.05), SIMDE_FLOAT64_C( 4.85), SIMDE_FLOAT64_C( 17.64),
SIMDE_FLOAT64_C( 0.11), SIMDE_FLOAT64_C( 0.54), SIMDE_FLOAT64_C( 0.89), SIMDE_FLOAT64_C( 40.85) } },
{ { SIMDE_FLOAT64_C( -3.09), SIMDE_FLOAT64_C( 1.38), SIMDE_FLOAT64_C( -1.35), SIMDE_FLOAT64_C( -3.35),
SIMDE_FLOAT64_C( 2.49), SIMDE_FLOAT64_C( -1.09), SIMDE_FLOAT64_C( -3.89), SIMDE_FLOAT64_C( 0.92) },
{ SIMDE_FLOAT64_C( 0.05), SIMDE_FLOAT64_C( 3.97), SIMDE_FLOAT64_C( 0.26), SIMDE_FLOAT64_C( 0.04),
SIMDE_FLOAT64_C( 12.06), SIMDE_FLOAT64_C( 0.34), SIMDE_FLOAT64_C( 0.02), SIMDE_FLOAT64_C( 2.51) } },
{ { SIMDE_FLOAT64_C( -1.13), SIMDE_FLOAT64_C( -1.04), SIMDE_FLOAT64_C( -0.11), SIMDE_FLOAT64_C( 0.37),
SIMDE_FLOAT64_C( 1.47), SIMDE_FLOAT64_C( -3.30), SIMDE_FLOAT64_C( -0.41), SIMDE_FLOAT64_C( 0.53) },
{ SIMDE_FLOAT64_C( 0.32), SIMDE_FLOAT64_C( 0.35), SIMDE_FLOAT64_C( 0.90), SIMDE_FLOAT64_C( 1.45),
SIMDE_FLOAT64_C( 4.35), SIMDE_FLOAT64_C( 0.04), SIMDE_FLOAT64_C( 0.66), SIMDE_FLOAT64_C( 1.70) } },
{ { SIMDE_FLOAT64_C( -0.08), SIMDE_FLOAT64_C( -2.87), SIMDE_FLOAT64_C( -0.54), SIMDE_FLOAT64_C( 0.04),
SIMDE_FLOAT64_C( -3.41), SIMDE_FLOAT64_C( -3.51), SIMDE_FLOAT64_C( 0.99), SIMDE_FLOAT64_C( 2.57) },
{ SIMDE_FLOAT64_C( 0.92), SIMDE_FLOAT64_C( 0.06), SIMDE_FLOAT64_C( 0.58), SIMDE_FLOAT64_C( 1.04),
SIMDE_FLOAT64_C( 0.03), SIMDE_FLOAT64_C( 0.03), SIMDE_FLOAT64_C( 2.69), SIMDE_FLOAT64_C( 13.07) } },
{ { SIMDE_FLOAT64_C( -2.62), SIMDE_FLOAT64_C( -1.43), SIMDE_FLOAT64_C( 1.44), SIMDE_FLOAT64_C( -0.87),
SIMDE_FLOAT64_C( 1.96), SIMDE_FLOAT64_C( -2.68), SIMDE_FLOAT64_C( -1.16), SIMDE_FLOAT64_C( 2.87) },
{ SIMDE_FLOAT64_C( 0.07), SIMDE_FLOAT64_C( 0.24), SIMDE_FLOAT64_C( 4.22), SIMDE_FLOAT64_C( 0.42),
SIMDE_FLOAT64_C( 7.10), SIMDE_FLOAT64_C( 0.07), SIMDE_FLOAT64_C( 0.31), SIMDE_FLOAT64_C( 17.64) } },
{ { SIMDE_FLOAT64_C( 2.70), SIMDE_FLOAT64_C( 1.49), SIMDE_FLOAT64_C( 3.52), SIMDE_FLOAT64_C( 1.19),
SIMDE_FLOAT64_C( -3.59), SIMDE_FLOAT64_C( 3.63), SIMDE_FLOAT64_C( -1.89), SIMDE_FLOAT64_C( -0.72) },
{ SIMDE_FLOAT64_C( 14.88), SIMDE_FLOAT64_C( 4.44), SIMDE_FLOAT64_C( 33.78), SIMDE_FLOAT64_C( 3.29),
SIMDE_FLOAT64_C( 0.03), SIMDE_FLOAT64_C( 37.71), SIMDE_FLOAT64_C( 0.15), SIMDE_FLOAT64_C( 0.49) } },
{ { SIMDE_FLOAT64_C( -1.41), SIMDE_FLOAT64_C( 2.00), SIMDE_FLOAT64_C( 3.65), SIMDE_FLOAT64_C( -3.94),
SIMDE_FLOAT64_C( 2.70), SIMDE_FLOAT64_C( -0.76), SIMDE_FLOAT64_C( 0.59), SIMDE_FLOAT64_C( -1.37) },
{ SIMDE_FLOAT64_C( 0.24), SIMDE_FLOAT64_C( 7.39), SIMDE_FLOAT64_C( 38.47), SIMDE_FLOAT64_C( 0.02),
SIMDE_FLOAT64_C( 14.88), SIMDE_FLOAT64_C( 0.47), SIMDE_FLOAT64_C( 1.80), SIMDE_FLOAT64_C( 0.25) } }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])) ; i++) {
simde__m512d a = simde_mm512_loadu_pd(test_vec[i].a);
simde__m512d r = simde_mm512_exp_pd(a);
simde_test_x86_assert_equal_f64x8(r, simde_mm512_loadu_pd(test_vec[i].r), 1);
}
return 0;
}
static int
test_simde_mm512_mask_exp_pd (SIMDE_MUNIT_TEST_ARGS) {
static const struct {
const simde_float64 src[8];
const simde__mmask8 k;
const simde_float64 a[8];
const simde_float64 r[8];
} test_vec[] = {
{ { SIMDE_FLOAT64_C( -3.51), SIMDE_FLOAT64_C( -3.98), SIMDE_FLOAT64_C( 3.54), SIMDE_FLOAT64_C( -1.79),
SIMDE_FLOAT64_C( -1.83), SIMDE_FLOAT64_C( -3.73), SIMDE_FLOAT64_C( -3.51), SIMDE_FLOAT64_C( 3.71) },
UINT8_C(199),
{ SIMDE_FLOAT64_C( 2.33), SIMDE_FLOAT64_C( -1.17), SIMDE_FLOAT64_C( -1.77), SIMDE_FLOAT64_C( -2.21),
SIMDE_FLOAT64_C( 2.46), SIMDE_FLOAT64_C( 1.54), SIMDE_FLOAT64_C( -1.07), SIMDE_FLOAT64_C( -0.25) },
{ SIMDE_FLOAT64_C( 10.28), SIMDE_FLOAT64_C( 0.31), SIMDE_FLOAT64_C( 0.17), SIMDE_FLOAT64_C( -1.79),
SIMDE_FLOAT64_C( -1.83), SIMDE_FLOAT64_C( -3.73), SIMDE_FLOAT64_C( 0.34), SIMDE_FLOAT64_C( 0.78) } },
{ { SIMDE_FLOAT64_C( -2.63), SIMDE_FLOAT64_C( 1.07), SIMDE_FLOAT64_C( -1.37), SIMDE_FLOAT64_C( -0.96),
SIMDE_FLOAT64_C( -3.82), SIMDE_FLOAT64_C( -0.01), SIMDE_FLOAT64_C( -2.76), SIMDE_FLOAT64_C( -2.64) },
UINT8_C(126),
{ SIMDE_FLOAT64_C( 0.87), SIMDE_FLOAT64_C( 0.29), SIMDE_FLOAT64_C( 1.08), SIMDE_FLOAT64_C( 2.46),
SIMDE_FLOAT64_C( -0.27), SIMDE_FLOAT64_C( 1.56), SIMDE_FLOAT64_C( 2.48), SIMDE_FLOAT64_C( -0.72) },
{ SIMDE_FLOAT64_C( -2.63), SIMDE_FLOAT64_C( 1.34), SIMDE_FLOAT64_C( 2.94), SIMDE_FLOAT64_C( 11.70),
SIMDE_FLOAT64_C( 0.76), SIMDE_FLOAT64_C( 4.76), SIMDE_FLOAT64_C( 11.94), SIMDE_FLOAT64_C( -2.64) } },
{ { SIMDE_FLOAT64_C( 3.77), SIMDE_FLOAT64_C( -3.36), SIMDE_FLOAT64_C( -0.46), SIMDE_FLOAT64_C( -3.74),
SIMDE_FLOAT64_C( -3.65), SIMDE_FLOAT64_C( 1.21), SIMDE_FLOAT64_C( 2.59), SIMDE_FLOAT64_C( -0.82) },
UINT8_C( 39),
{ SIMDE_FLOAT64_C( -3.62), SIMDE_FLOAT64_C( -2.35), SIMDE_FLOAT64_C( 0.98), SIMDE_FLOAT64_C( -0.70),
SIMDE_FLOAT64_C( 1.40), SIMDE_FLOAT64_C( 2.35), SIMDE_FLOAT64_C( -3.63), SIMDE_FLOAT64_C( -3.97) },
{ SIMDE_FLOAT64_C( 0.03), SIMDE_FLOAT64_C( 0.10), SIMDE_FLOAT64_C( 2.66), SIMDE_FLOAT64_C( -3.74),
SIMDE_FLOAT64_C( -3.65), SIMDE_FLOAT64_C( 10.49), SIMDE_FLOAT64_C( 2.59), SIMDE_FLOAT64_C( -0.82) } },
{ { SIMDE_FLOAT64_C( -2.61), SIMDE_FLOAT64_C( -3.45), SIMDE_FLOAT64_C( 0.02), SIMDE_FLOAT64_C( -1.37),
SIMDE_FLOAT64_C( -2.08), SIMDE_FLOAT64_C( -3.79), SIMDE_FLOAT64_C( 3.50), SIMDE_FLOAT64_C( 2.21) },
UINT8_C(165),
{ SIMDE_FLOAT64_C( 1.96), SIMDE_FLOAT64_C( -2.06), SIMDE_FLOAT64_C( -1.15), SIMDE_FLOAT64_C( 0.44),
SIMDE_FLOAT64_C( 1.22), SIMDE_FLOAT64_C( -1.37), SIMDE_FLOAT64_C( 1.08), SIMDE_FLOAT64_C( -3.24) },
{ SIMDE_FLOAT64_C( 7.10), SIMDE_FLOAT64_C( -3.45), SIMDE_FLOAT64_C( 0.32), SIMDE_FLOAT64_C( -1.37),
SIMDE_FLOAT64_C( -2.08), SIMDE_FLOAT64_C( 0.25), SIMDE_FLOAT64_C( 3.50), SIMDE_FLOAT64_C( 0.04) } },
{ { SIMDE_FLOAT64_C( -1.11), SIMDE_FLOAT64_C( 1.43), SIMDE_FLOAT64_C( 1.97), SIMDE_FLOAT64_C( -2.52),
SIMDE_FLOAT64_C( -3.38), SIMDE_FLOAT64_C( 1.41), SIMDE_FLOAT64_C( -2.14), SIMDE_FLOAT64_C( -1.73) },
UINT8_C(202),
{ SIMDE_FLOAT64_C( 1.17), SIMDE_FLOAT64_C( 3.67), SIMDE_FLOAT64_C( -3.26), SIMDE_FLOAT64_C( 1.54),
SIMDE_FLOAT64_C( 3.70), SIMDE_FLOAT64_C( -1.87), SIMDE_FLOAT64_C( 2.10), SIMDE_FLOAT64_C( -0.28) },
{ SIMDE_FLOAT64_C( -1.11), SIMDE_FLOAT64_C( 39.25), SIMDE_FLOAT64_C( 1.97), SIMDE_FLOAT64_C( 4.66),
SIMDE_FLOAT64_C( -3.38), SIMDE_FLOAT64_C( 1.41), SIMDE_FLOAT64_C( 8.17), SIMDE_FLOAT64_C( 0.76) } },
{ { SIMDE_FLOAT64_C( 0.76), SIMDE_FLOAT64_C( -3.99), SIMDE_FLOAT64_C( -0.06), SIMDE_FLOAT64_C( 0.26),
SIMDE_FLOAT64_C( 2.22), SIMDE_FLOAT64_C( -2.77), SIMDE_FLOAT64_C( -1.78), SIMDE_FLOAT64_C( -3.84) },
UINT8_C(172),
{ SIMDE_FLOAT64_C( 2.66), SIMDE_FLOAT64_C( 1.38), SIMDE_FLOAT64_C( 2.71), SIMDE_FLOAT64_C( -0.26),
SIMDE_FLOAT64_C( 2.13), SIMDE_FLOAT64_C( -2.39), SIMDE_FLOAT64_C( -2.83), SIMDE_FLOAT64_C( 0.11) },
{ SIMDE_FLOAT64_C( 0.76), SIMDE_FLOAT64_C( -3.99), SIMDE_FLOAT64_C( 15.03), SIMDE_FLOAT64_C( 0.77),
SIMDE_FLOAT64_C( 2.22), SIMDE_FLOAT64_C( 0.09), SIMDE_FLOAT64_C( -1.78), SIMDE_FLOAT64_C( 1.12) } },
{ { SIMDE_FLOAT64_C( -0.91), SIMDE_FLOAT64_C( -2.21), SIMDE_FLOAT64_C( -2.48), SIMDE_FLOAT64_C( 0.95),
SIMDE_FLOAT64_C( 0.06), SIMDE_FLOAT64_C( -0.09), SIMDE_FLOAT64_C( -1.88), SIMDE_FLOAT64_C( -0.27) },
UINT8_C(244),
{ SIMDE_FLOAT64_C( 3.66), SIMDE_FLOAT64_C( -0.57), SIMDE_FLOAT64_C( 2.78), SIMDE_FLOAT64_C( 1.75),
SIMDE_FLOAT64_C( 3.15), SIMDE_FLOAT64_C( -0.45), SIMDE_FLOAT64_C( 1.76), SIMDE_FLOAT64_C( -0.91) },
{ SIMDE_FLOAT64_C( -0.91), SIMDE_FLOAT64_C( -2.21), SIMDE_FLOAT64_C( 16.12), SIMDE_FLOAT64_C( 0.95),
SIMDE_FLOAT64_C( 23.34), SIMDE_FLOAT64_C( 0.64), SIMDE_FLOAT64_C( 5.81), SIMDE_FLOAT64_C( 0.40) } },
{ { SIMDE_FLOAT64_C( 3.81), SIMDE_FLOAT64_C( -0.02), SIMDE_FLOAT64_C( 0.32), SIMDE_FLOAT64_C( -1.97),
SIMDE_FLOAT64_C( 0.14), SIMDE_FLOAT64_C( -3.60), SIMDE_FLOAT64_C( -3.30), SIMDE_FLOAT64_C( -2.48) },
UINT8_C(117),
{ SIMDE_FLOAT64_C( 0.44), SIMDE_FLOAT64_C( 3.65), SIMDE_FLOAT64_C( -3.28), SIMDE_FLOAT64_C( 1.61),
SIMDE_FLOAT64_C( -0.24), SIMDE_FLOAT64_C( -0.20), SIMDE_FLOAT64_C( 3.40), SIMDE_FLOAT64_C( 1.27) },
{ SIMDE_FLOAT64_C( 1.55), SIMDE_FLOAT64_C( -0.02), SIMDE_FLOAT64_C( 0.04), SIMDE_FLOAT64_C( -1.97),
SIMDE_FLOAT64_C( 0.79), SIMDE_FLOAT64_C( 0.82), SIMDE_FLOAT64_C( 29.96), SIMDE_FLOAT64_C( -2.48) } }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])) ; i++) {
simde__m512d src = simde_mm512_loadu_pd(test_vec[i].src);
simde__m512d a = simde_mm512_loadu_pd(test_vec[i].a);
simde__m512d r = simde_mm512_mask_exp_pd(src, test_vec[i].k, a);
simde_test_x86_assert_equal_f64x8(r, simde_mm512_loadu_pd(test_vec[i].r), 1);
}
return 0;
}
static int
test_simde_mm_expm1_ps (SIMDE_MUNIT_TEST_ARGS) {
static const struct {
const simde_float32 a[4];
const simde_float32 r[4];
} test_vec[] = {
{ { SIMDE_FLOAT32_C( 6.33), SIMDE_FLOAT32_C( 1.68), SIMDE_FLOAT32_C( 8.16), SIMDE_FLOAT32_C( 5.04) },
{ SIMDE_FLOAT32_C( 560.16), SIMDE_FLOAT32_C( 4.37), SIMDE_FLOAT32_C( 3497.19), SIMDE_FLOAT32_C( 153.47) } },
{ { SIMDE_FLOAT32_C( 0.86), SIMDE_FLOAT32_C( 8.63), SIMDE_FLOAT32_C( 5.23), SIMDE_FLOAT32_C( 4.43) },
{ SIMDE_FLOAT32_C( 1.36), SIMDE_FLOAT32_C( 5596.08), SIMDE_FLOAT32_C( 185.79), SIMDE_FLOAT32_C( 82.93) } },
{ { SIMDE_FLOAT32_C( 7.85), SIMDE_FLOAT32_C( 6.92), SIMDE_FLOAT32_C( 2.77), SIMDE_FLOAT32_C( 5.34) },
{ SIMDE_FLOAT32_C( 2564.73), SIMDE_FLOAT32_C( 1011.32), SIMDE_FLOAT32_C( 14.96), SIMDE_FLOAT32_C( 207.51) } },
{ { SIMDE_FLOAT32_C( 6.60), SIMDE_FLOAT32_C( 1.06), SIMDE_FLOAT32_C( 2.02), SIMDE_FLOAT32_C( 0.13) },
{ SIMDE_FLOAT32_C( 734.10), SIMDE_FLOAT32_C( 1.89), SIMDE_FLOAT32_C( 6.54), SIMDE_FLOAT32_C( 0.14) } },
{ { SIMDE_FLOAT32_C( 3.00), SIMDE_FLOAT32_C( 7.36), SIMDE_FLOAT32_C( 9.70), SIMDE_FLOAT32_C( 5.19) },
{ SIMDE_FLOAT32_C( 19.09), SIMDE_FLOAT32_C( 1570.84), SIMDE_FLOAT32_C( 16316.60), SIMDE_FLOAT32_C( 178.47) } },
{ { SIMDE_FLOAT32_C( 2.21), SIMDE_FLOAT32_C( 0.73), SIMDE_FLOAT32_C( 8.65), SIMDE_FLOAT32_C( 9.58) },
{ SIMDE_FLOAT32_C( 8.12), SIMDE_FLOAT32_C( 1.08), SIMDE_FLOAT32_C( 5709.14), SIMDE_FLOAT32_C( 14471.42) } },
{ { SIMDE_FLOAT32_C( 3.70), SIMDE_FLOAT32_C( 4.96), SIMDE_FLOAT32_C( 0.17), SIMDE_FLOAT32_C( 7.49) },
{ SIMDE_FLOAT32_C( 39.45), SIMDE_FLOAT32_C( 141.59), SIMDE_FLOAT32_C( 0.19), SIMDE_FLOAT32_C( 1789.05) } },
{ { SIMDE_FLOAT32_C( 7.91), SIMDE_FLOAT32_C( 1.05), SIMDE_FLOAT32_C( 4.16), SIMDE_FLOAT32_C( 4.24) },
{ SIMDE_FLOAT32_C( 2723.39), SIMDE_FLOAT32_C( 1.86), SIMDE_FLOAT32_C( 63.07), SIMDE_FLOAT32_C( 68.41) } }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])) ; i++) {
simde__m128 a = simde_mm_loadu_ps(test_vec[i].a);
simde__m128 r = simde_mm_expm1_ps(a);
simde_test_x86_assert_equal_f32x4(r, simde_mm_loadu_ps(test_vec[i].r), 1);
}
return 0;
}
static int
test_simde_mm_expm1_pd (SIMDE_MUNIT_TEST_ARGS) {
static const struct {
const simde_float64 a[2];
const simde_float64 r[2];
} test_vec[] = {
{ { SIMDE_FLOAT64_C( 0.24), SIMDE_FLOAT64_C( 7.18) },
{ SIMDE_FLOAT64_C( 0.27), SIMDE_FLOAT64_C( 1311.91) } },
{ { SIMDE_FLOAT64_C( 9.69), SIMDE_FLOAT64_C( 1.13) },
{ SIMDE_FLOAT64_C( 16154.24), SIMDE_FLOAT64_C( 2.10) } },
{ { SIMDE_FLOAT64_C( 6.24), SIMDE_FLOAT64_C( 8.67) },
{ SIMDE_FLOAT64_C( 511.86), SIMDE_FLOAT64_C( 5824.50) } },
{ { SIMDE_FLOAT64_C( 9.69), SIMDE_FLOAT64_C( 7.67) },
{ SIMDE_FLOAT64_C( 16154.24), SIMDE_FLOAT64_C( 2142.08) } },
{ { SIMDE_FLOAT64_C( 4.67), SIMDE_FLOAT64_C( 1.83) },
{ SIMDE_FLOAT64_C( 105.70), SIMDE_FLOAT64_C( 5.23) } },
{ { SIMDE_FLOAT64_C( 2.80), SIMDE_FLOAT64_C( 6.65) },
{ SIMDE_FLOAT64_C( 15.44), SIMDE_FLOAT64_C( 771.78) } },
{ { SIMDE_FLOAT64_C( 8.11), SIMDE_FLOAT64_C( 9.49) },
{ SIMDE_FLOAT64_C( 3326.58), SIMDE_FLOAT64_C( 13225.80) } },
{ { SIMDE_FLOAT64_C( 1.48), SIMDE_FLOAT64_C( 7.85) },
{ SIMDE_FLOAT64_C( 3.39), SIMDE_FLOAT64_C( 2564.73) } }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])) ; i++) {
simde__m128d a = simde_mm_loadu_pd(test_vec[i].a);
simde__m128d r = simde_mm_expm1_pd(a);
simde_test_x86_assert_equal_f64x2(r, simde_mm_loadu_pd(test_vec[i].r), 1);
}
return 0;
}
static int
test_simde_mm256_expm1_ps (SIMDE_MUNIT_TEST_ARGS) {
static const struct {
const simde_float32 a[8];
const simde_float32 r[8];
} test_vec[] = {
{ { SIMDE_FLOAT32_C( 8.24), SIMDE_FLOAT32_C( 2.23), SIMDE_FLOAT32_C( 3.81), SIMDE_FLOAT32_C( 4.38),
SIMDE_FLOAT32_C( 3.85), SIMDE_FLOAT32_C( 0.70), SIMDE_FLOAT32_C( 8.49), SIMDE_FLOAT32_C( 5.32) },
{ SIMDE_FLOAT32_C( 3788.54), SIMDE_FLOAT32_C( 8.30), SIMDE_FLOAT32_C( 44.15), SIMDE_FLOAT32_C( 78.84),
SIMDE_FLOAT32_C( 45.99), SIMDE_FLOAT32_C( 1.01), SIMDE_FLOAT32_C( 4864.86), SIMDE_FLOAT32_C( 203.38) } },
{ { SIMDE_FLOAT32_C( 0.54), SIMDE_FLOAT32_C( 4.59), SIMDE_FLOAT32_C( 9.56), SIMDE_FLOAT32_C( 9.67),
SIMDE_FLOAT32_C( 3.42), SIMDE_FLOAT32_C( 3.74), SIMDE_FLOAT32_C( 0.27), SIMDE_FLOAT32_C( 0.14) },
{ SIMDE_FLOAT32_C( 0.72), SIMDE_FLOAT32_C( 97.49), SIMDE_FLOAT32_C( 14184.85), SIMDE_FLOAT32_C( 15834.35),
SIMDE_FLOAT32_C( 29.57), SIMDE_FLOAT32_C( 41.10), SIMDE_FLOAT32_C( 0.31), SIMDE_FLOAT32_C( 0.15) } },
{ { SIMDE_FLOAT32_C( 6.62), SIMDE_FLOAT32_C( 4.91), SIMDE_FLOAT32_C( 3.11), SIMDE_FLOAT32_C( 8.04),
SIMDE_FLOAT32_C( 0.58), SIMDE_FLOAT32_C( 9.84), SIMDE_FLOAT32_C( 7.16), SIMDE_FLOAT32_C( 7.09) },
{ SIMDE_FLOAT32_C( 748.95), SIMDE_FLOAT32_C( 134.64), SIMDE_FLOAT32_C( 21.42), SIMDE_FLOAT32_C( 3101.61),
SIMDE_FLOAT32_C( 0.79), SIMDE_FLOAT32_C( 18768.72), SIMDE_FLOAT32_C( 1285.91), SIMDE_FLOAT32_C( 1198.91) } },
{ { SIMDE_FLOAT32_C( 0.13), SIMDE_FLOAT32_C( 1.93), SIMDE_FLOAT32_C( 9.95), SIMDE_FLOAT32_C( 7.75),
SIMDE_FLOAT32_C( 0.79), SIMDE_FLOAT32_C( 3.96), SIMDE_FLOAT32_C( 4.01), SIMDE_FLOAT32_C( 9.02) },
{ SIMDE_FLOAT32_C( 0.14), SIMDE_FLOAT32_C( 5.89), SIMDE_FLOAT32_C( 20951.22), SIMDE_FLOAT32_C( 2320.57),
SIMDE_FLOAT32_C( 1.20), SIMDE_FLOAT32_C( 51.46), SIMDE_FLOAT32_C( 54.15), SIMDE_FLOAT32_C( 8265.78) } },
{ { SIMDE_FLOAT32_C( 6.19), SIMDE_FLOAT32_C( 7.82), SIMDE_FLOAT32_C( 3.40), SIMDE_FLOAT32_C( 0.04),
SIMDE_FLOAT32_C( 8.52), SIMDE_FLOAT32_C( 1.89), SIMDE_FLOAT32_C( 5.37), SIMDE_FLOAT32_C( 9.06) },
{ SIMDE_FLOAT32_C( 486.85), SIMDE_FLOAT32_C( 2488.91), SIMDE_FLOAT32_C( 28.96), SIMDE_FLOAT32_C( 0.04),
SIMDE_FLOAT32_C( 5013.06), SIMDE_FLOAT32_C( 5.62), SIMDE_FLOAT32_C( 213.86), SIMDE_FLOAT32_C( 8603.15) } },
{ { SIMDE_FLOAT32_C( 6.48), SIMDE_FLOAT32_C( 4.92), SIMDE_FLOAT32_C( 8.72), SIMDE_FLOAT32_C( 9.90),
SIMDE_FLOAT32_C( 8.66), SIMDE_FLOAT32_C( 8.99), SIMDE_FLOAT32_C( 0.04), SIMDE_FLOAT32_C( 5.28) },
{ SIMDE_FLOAT32_C( 650.97), SIMDE_FLOAT32_C( 136.00), SIMDE_FLOAT32_C( 6123.18), SIMDE_FLOAT32_C( 19929.36),
SIMDE_FLOAT32_C( 5766.53), SIMDE_FLOAT32_C( 8021.46), SIMDE_FLOAT32_C( 0.04), SIMDE_FLOAT32_C( 195.37) } },
{ { SIMDE_FLOAT32_C( 3.90), SIMDE_FLOAT32_C( 3.15), SIMDE_FLOAT32_C( 3.32), SIMDE_FLOAT32_C( 4.49),
SIMDE_FLOAT32_C( 2.99), SIMDE_FLOAT32_C( 0.48), SIMDE_FLOAT32_C( 1.57), SIMDE_FLOAT32_C( 3.12) },
{ SIMDE_FLOAT32_C( 48.40), SIMDE_FLOAT32_C( 22.34), SIMDE_FLOAT32_C( 26.66), SIMDE_FLOAT32_C( 88.12),
SIMDE_FLOAT32_C( 18.89), SIMDE_FLOAT32_C( 0.62), SIMDE_FLOAT32_C( 3.81), SIMDE_FLOAT32_C( 21.65) } },
{ { SIMDE_FLOAT32_C( 2.41), SIMDE_FLOAT32_C( 1.52), SIMDE_FLOAT32_C( 0.87), SIMDE_FLOAT32_C( 3.20),
SIMDE_FLOAT32_C( 5.48), SIMDE_FLOAT32_C( 4.88), SIMDE_FLOAT32_C( 2.22), SIMDE_FLOAT32_C( 1.67) },
{ SIMDE_FLOAT32_C( 10.13), SIMDE_FLOAT32_C( 3.57), SIMDE_FLOAT32_C( 1.39), SIMDE_FLOAT32_C( 23.53),
SIMDE_FLOAT32_C( 238.85), SIMDE_FLOAT32_C( 130.63), SIMDE_FLOAT32_C( 8.21), SIMDE_FLOAT32_C( 4.31) } }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])) ; i++) {
simde__m256 a = simde_mm256_loadu_ps(test_vec[i].a);
simde__m256 r = simde_mm256_expm1_ps(a);
simde_test_x86_assert_equal_f32x8(r, simde_mm256_loadu_ps(test_vec[i].r), 1);
}
return 0;
}
static int
test_simde_mm256_expm1_pd (SIMDE_MUNIT_TEST_ARGS) {
static const struct {
const simde_float64 a[4];
const simde_float64 r[4];
} test_vec[] = {
{ { SIMDE_FLOAT64_C( 6.68), SIMDE_FLOAT64_C( 7.67), SIMDE_FLOAT64_C( 2.13), SIMDE_FLOAT64_C( 3.50) },
{ SIMDE_FLOAT64_C( 795.32), SIMDE_FLOAT64_C( 2142.08), SIMDE_FLOAT64_C( 7.41), SIMDE_FLOAT64_C( 32.12) } },
{ { SIMDE_FLOAT64_C( 4.83), SIMDE_FLOAT64_C( 1.25), SIMDE_FLOAT64_C( 4.74), SIMDE_FLOAT64_C( 8.00) },
{ SIMDE_FLOAT64_C( 124.21), SIMDE_FLOAT64_C( 2.49), SIMDE_FLOAT64_C( 113.43), SIMDE_FLOAT64_C( 2979.96) } },
{ { SIMDE_FLOAT64_C( 9.68), SIMDE_FLOAT64_C( 1.62), SIMDE_FLOAT64_C( 7.69), SIMDE_FLOAT64_C( 7.36) },
{ SIMDE_FLOAT64_C( 15993.50), SIMDE_FLOAT64_C( 4.05), SIMDE_FLOAT64_C( 2185.37), SIMDE_FLOAT64_C( 1570.84) } },
{ { SIMDE_FLOAT64_C( 8.87), SIMDE_FLOAT64_C( 3.50), SIMDE_FLOAT64_C( 7.63), SIMDE_FLOAT64_C( 8.66) },
{ SIMDE_FLOAT64_C( 7114.28), SIMDE_FLOAT64_C( 32.12), SIMDE_FLOAT64_C( 2058.05), SIMDE_FLOAT64_C( 5766.53) } },
{ { SIMDE_FLOAT64_C( 5.89), SIMDE_FLOAT64_C( 2.15), SIMDE_FLOAT64_C( 8.77), SIMDE_FLOAT64_C( 4.86) },
{ SIMDE_FLOAT64_C( 360.41), SIMDE_FLOAT64_C( 7.58), SIMDE_FLOAT64_C( 6437.17), SIMDE_FLOAT64_C( 128.02) } },
{ { SIMDE_FLOAT64_C( 2.27), SIMDE_FLOAT64_C( 7.65), SIMDE_FLOAT64_C( 5.22), SIMDE_FLOAT64_C( 9.35) },
{ SIMDE_FLOAT64_C( 8.68), SIMDE_FLOAT64_C( 2099.65), SIMDE_FLOAT64_C( 183.93), SIMDE_FLOAT64_C( 11497.82) } },
{ { SIMDE_FLOAT64_C( 3.29), SIMDE_FLOAT64_C( 3.19), SIMDE_FLOAT64_C( 2.91), SIMDE_FLOAT64_C( 3.13) },
{ SIMDE_FLOAT64_C( 25.84), SIMDE_FLOAT64_C( 23.29), SIMDE_FLOAT64_C( 17.36), SIMDE_FLOAT64_C( 21.87) } },
{ { SIMDE_FLOAT64_C( 5.79), SIMDE_FLOAT64_C( 1.89), SIMDE_FLOAT64_C( 0.98), SIMDE_FLOAT64_C( 2.47) },
{ SIMDE_FLOAT64_C( 326.01), SIMDE_FLOAT64_C( 5.62), SIMDE_FLOAT64_C( 1.66), SIMDE_FLOAT64_C( 10.82) } }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])) ; i++) {
simde__m256d a = simde_mm256_loadu_pd(test_vec[i].a);
simde__m256d r = simde_mm256_expm1_pd(a);
simde_test_x86_assert_equal_f64x4(r, simde_mm256_loadu_pd(test_vec[i].r), 1);
}
return 0;
}
static int
test_simde_mm512_expm1_ps (SIMDE_MUNIT_TEST_ARGS) {
static const struct {
const simde_float32 a[16];
const simde_float32 r[16];
} test_vec[] = {
{ { SIMDE_FLOAT32_C( 2.06), SIMDE_FLOAT32_C( 8.37), SIMDE_FLOAT32_C( 4.10), SIMDE_FLOAT32_C( 0.86),
SIMDE_FLOAT32_C( 5.30), SIMDE_FLOAT32_C( 6.13), SIMDE_FLOAT32_C( 1.82), SIMDE_FLOAT32_C( 0.38),
SIMDE_FLOAT32_C( 1.44), SIMDE_FLOAT32_C( 6.24), SIMDE_FLOAT32_C( 8.36), SIMDE_FLOAT32_C( 5.93),
SIMDE_FLOAT32_C( 2.32), SIMDE_FLOAT32_C( 8.82), SIMDE_FLOAT32_C( 8.89), SIMDE_FLOAT32_C( 5.58) },
{ SIMDE_FLOAT32_C( 6.85), SIMDE_FLOAT32_C( 4314.64), SIMDE_FLOAT32_C( 59.34), SIMDE_FLOAT32_C( 1.36),
SIMDE_FLOAT32_C( 199.34), SIMDE_FLOAT32_C( 458.44), SIMDE_FLOAT32_C( 5.17), SIMDE_FLOAT32_C( 0.46),
SIMDE_FLOAT32_C( 3.22), SIMDE_FLOAT32_C( 511.86), SIMDE_FLOAT32_C( 4271.69), SIMDE_FLOAT32_C( 375.15),
SIMDE_FLOAT32_C( 9.18), SIMDE_FLOAT32_C( 6767.26), SIMDE_FLOAT32_C( 7258.02), SIMDE_FLOAT32_C( 264.07) } },
{ { SIMDE_FLOAT32_C( 1.96), SIMDE_FLOAT32_C( 7.28), SIMDE_FLOAT32_C( 6.53), SIMDE_FLOAT32_C( 8.60),
SIMDE_FLOAT32_C( 1.35), SIMDE_FLOAT32_C( 1.74), SIMDE_FLOAT32_C( 4.08), SIMDE_FLOAT32_C( 7.80),
SIMDE_FLOAT32_C( 1.34), SIMDE_FLOAT32_C( 7.90), SIMDE_FLOAT32_C( 9.34), SIMDE_FLOAT32_C( 7.60),
SIMDE_FLOAT32_C( 3.95), SIMDE_FLOAT32_C( 5.46), SIMDE_FLOAT32_C( 8.74), SIMDE_FLOAT32_C( 6.01) },
{ SIMDE_FLOAT32_C( 6.10), SIMDE_FLOAT32_C( 1449.99), SIMDE_FLOAT32_C( 684.40), SIMDE_FLOAT32_C( 5430.66),
SIMDE_FLOAT32_C( 2.86), SIMDE_FLOAT32_C( 4.70), SIMDE_FLOAT32_C( 58.15), SIMDE_FLOAT32_C( 2439.60),
SIMDE_FLOAT32_C( 2.82), SIMDE_FLOAT32_C( 2696.28), SIMDE_FLOAT32_C( 11383.41), SIMDE_FLOAT32_C( 1997.20),
SIMDE_FLOAT32_C( 50.94), SIMDE_FLOAT32_C( 234.10), SIMDE_FLOAT32_C( 6246.89), SIMDE_FLOAT32_C( 406.48) } },
{ { SIMDE_FLOAT32_C( 3.83), SIMDE_FLOAT32_C( 2.84), SIMDE_FLOAT32_C( 6.87), SIMDE_FLOAT32_C( 9.14),
SIMDE_FLOAT32_C( 8.97), SIMDE_FLOAT32_C( 8.69), SIMDE_FLOAT32_C( 9.51), SIMDE_FLOAT32_C( 0.41),
SIMDE_FLOAT32_C( 4.93), SIMDE_FLOAT32_C( 7.87), SIMDE_FLOAT32_C( 6.35), SIMDE_FLOAT32_C( 7.25),
SIMDE_FLOAT32_C( 6.69), SIMDE_FLOAT32_C( 5.24), SIMDE_FLOAT32_C( 2.83), SIMDE_FLOAT32_C( 8.65) },
{ SIMDE_FLOAT32_C( 45.06), SIMDE_FLOAT32_C( 16.12), SIMDE_FLOAT32_C( 961.95), SIMDE_FLOAT32_C( 9319.77),
SIMDE_FLOAT32_C( 7862.60), SIMDE_FLOAT32_C( 5942.18), SIMDE_FLOAT32_C( 13493.00), SIMDE_FLOAT32_C( 0.51),
SIMDE_FLOAT32_C( 137.38), SIMDE_FLOAT32_C( 2616.57), SIMDE_FLOAT32_C( 571.49), SIMDE_FLOAT32_C( 1407.10),
SIMDE_FLOAT32_C( 803.32), SIMDE_FLOAT32_C( 187.67), SIMDE_FLOAT32_C( 15.95), SIMDE_FLOAT32_C( 5709.14) } },
{ { SIMDE_FLOAT32_C( 2.51), SIMDE_FLOAT32_C( 9.36), SIMDE_FLOAT32_C( 7.24), SIMDE_FLOAT32_C( 3.86),
SIMDE_FLOAT32_C( 1.10), SIMDE_FLOAT32_C( 1.32), SIMDE_FLOAT32_C( 1.66), SIMDE_FLOAT32_C( 2.44),
SIMDE_FLOAT32_C( 9.22), SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( 0.03), SIMDE_FLOAT32_C( 3.17),
SIMDE_FLOAT32_C( 6.46), SIMDE_FLOAT32_C( 8.77), SIMDE_FLOAT32_C( 9.18), SIMDE_FLOAT32_C( 0.30) },
{ SIMDE_FLOAT32_C( 11.30), SIMDE_FLOAT32_C( 11613.38), SIMDE_FLOAT32_C( 1393.09), SIMDE_FLOAT32_C( 46.47),
SIMDE_FLOAT32_C( 2.00), SIMDE_FLOAT32_C( 2.74), SIMDE_FLOAT32_C( 4.26), SIMDE_FLOAT32_C( 10.47),
SIMDE_FLOAT32_C( 10096.07), SIMDE_FLOAT32_C( 1.72), SIMDE_FLOAT32_C( 0.03), SIMDE_FLOAT32_C( 22.81),
SIMDE_FLOAT32_C( 638.06), SIMDE_FLOAT32_C( 6437.18), SIMDE_FLOAT32_C( 9700.16), SIMDE_FLOAT32_C( 0.35) } },
{ { SIMDE_FLOAT32_C( 1.62), SIMDE_FLOAT32_C( 6.06), SIMDE_FLOAT32_C( 9.43), SIMDE_FLOAT32_C( 0.59),
SIMDE_FLOAT32_C( 4.74), SIMDE_FLOAT32_C( 8.94), SIMDE_FLOAT32_C( 1.01), SIMDE_FLOAT32_C( 9.67),
SIMDE_FLOAT32_C( 6.81), SIMDE_FLOAT32_C( 7.35), SIMDE_FLOAT32_C( 6.92), SIMDE_FLOAT32_C( 3.50),
SIMDE_FLOAT32_C( 2.59), SIMDE_FLOAT32_C( 9.75), SIMDE_FLOAT32_C( 2.14), SIMDE_FLOAT32_C( 5.10) },
{ SIMDE_FLOAT32_C( 4.05), SIMDE_FLOAT32_C( 427.38), SIMDE_FLOAT32_C( 12455.53), SIMDE_FLOAT32_C( 0.80),
SIMDE_FLOAT32_C( 113.43), SIMDE_FLOAT32_C( 7630.19), SIMDE_FLOAT32_C( 1.75), SIMDE_FLOAT32_C( 15834.35),
SIMDE_FLOAT32_C( 905.87), SIMDE_FLOAT32_C( 1555.20), SIMDE_FLOAT32_C( 1011.32), SIMDE_FLOAT32_C( 32.12),
SIMDE_FLOAT32_C( 12.33), SIMDE_FLOAT32_C( 17153.23), SIMDE_FLOAT32_C( 7.50), SIMDE_FLOAT32_C( 163.02) } },
{ { SIMDE_FLOAT32_C( 9.11), SIMDE_FLOAT32_C( 9.39), SIMDE_FLOAT32_C( 8.97), SIMDE_FLOAT32_C( 0.21),
SIMDE_FLOAT32_C( 0.71), SIMDE_FLOAT32_C( 0.63), SIMDE_FLOAT32_C( 2.64), SIMDE_FLOAT32_C( 9.92),
SIMDE_FLOAT32_C( 1.63), SIMDE_FLOAT32_C( 2.67), SIMDE_FLOAT32_C( 3.10), SIMDE_FLOAT32_C( 8.09),
SIMDE_FLOAT32_C( 1.45), SIMDE_FLOAT32_C( 2.28), SIMDE_FLOAT32_C( 8.38), SIMDE_FLOAT32_C( 3.06) },
{ SIMDE_FLOAT32_C( 9044.29), SIMDE_FLOAT32_C( 11967.10), SIMDE_FLOAT32_C( 7862.60), SIMDE_FLOAT32_C( 0.23),
SIMDE_FLOAT32_C( 1.03), SIMDE_FLOAT32_C( 0.88), SIMDE_FLOAT32_C( 13.01), SIMDE_FLOAT32_C( 20331.99),
SIMDE_FLOAT32_C( 4.10), SIMDE_FLOAT32_C( 13.44), SIMDE_FLOAT32_C( 21.20), SIMDE_FLOAT32_C( 3260.69),
SIMDE_FLOAT32_C( 3.26), SIMDE_FLOAT32_C( 8.78), SIMDE_FLOAT32_C( 4358.01), SIMDE_FLOAT32_C( 20.33) } },
{ { SIMDE_FLOAT32_C( 8.34), SIMDE_FLOAT32_C( 7.81), SIMDE_FLOAT32_C( 3.65), SIMDE_FLOAT32_C( 3.08),
SIMDE_FLOAT32_C( 6.75), SIMDE_FLOAT32_C( 4.66), SIMDE_FLOAT32_C( 2.75), SIMDE_FLOAT32_C( 3.56),
SIMDE_FLOAT32_C( 2.01), SIMDE_FLOAT32_C( 9.67), SIMDE_FLOAT32_C( 7.06), SIMDE_FLOAT32_C( 4.60),
SIMDE_FLOAT32_C( 9.42), SIMDE_FLOAT32_C( 9.20), SIMDE_FLOAT32_C( 9.71), SIMDE_FLOAT32_C( 8.53) },
{ SIMDE_FLOAT32_C( 4187.09), SIMDE_FLOAT32_C( 2464.13), SIMDE_FLOAT32_C( 37.47), SIMDE_FLOAT32_C( 20.76),
SIMDE_FLOAT32_C( 853.06), SIMDE_FLOAT32_C( 104.64), SIMDE_FLOAT32_C( 14.64), SIMDE_FLOAT32_C( 34.16),
SIMDE_FLOAT32_C( 6.46), SIMDE_FLOAT32_C( 15834.35), SIMDE_FLOAT32_C( 1163.45), SIMDE_FLOAT32_C( 98.48),
SIMDE_FLOAT32_C( 12331.58), SIMDE_FLOAT32_C( 9896.13), SIMDE_FLOAT32_C( 16480.60), SIMDE_FLOAT32_C( 5063.44) } },
{ { SIMDE_FLOAT32_C( 8.59), SIMDE_FLOAT32_C( 8.67), SIMDE_FLOAT32_C( 8.74), SIMDE_FLOAT32_C( 9.29),
SIMDE_FLOAT32_C( 9.30), SIMDE_FLOAT32_C( 1.38), SIMDE_FLOAT32_C( 9.22), SIMDE_FLOAT32_C( 0.93),
SIMDE_FLOAT32_C( 4.05), SIMDE_FLOAT32_C( 2.31), SIMDE_FLOAT32_C( 9.01), SIMDE_FLOAT32_C( 5.49),
SIMDE_FLOAT32_C( 4.59), SIMDE_FLOAT32_C( 7.39), SIMDE_FLOAT32_C( 8.56), SIMDE_FLOAT32_C( 2.93) },
{ SIMDE_FLOAT32_C( 5376.61), SIMDE_FLOAT32_C( 5824.50), SIMDE_FLOAT32_C( 6246.89), SIMDE_FLOAT32_C( 10828.18),
SIMDE_FLOAT32_C( 10937.02), SIMDE_FLOAT32_C( 2.97), SIMDE_FLOAT32_C( 10096.07), SIMDE_FLOAT32_C( 1.53),
SIMDE_FLOAT32_C( 56.40), SIMDE_FLOAT32_C( 9.07), SIMDE_FLOAT32_C( 8183.52), SIMDE_FLOAT32_C( 241.26),
SIMDE_FLOAT32_C( 97.49), SIMDE_FLOAT32_C( 1618.71), SIMDE_FLOAT32_C( 5217.68), SIMDE_FLOAT32_C( 17.73) } }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])) ; i++) {
simde__m512 a = simde_mm512_loadu_ps(test_vec[i].a);
simde__m512 r = simde_mm512_expm1_ps(a);
simde_test_x86_assert_equal_f32x16(r, simde_mm512_loadu_ps(test_vec[i].r), 1);
}
return 0;
}
static int
test_simde_mm512_mask_expm1_ps (SIMDE_MUNIT_TEST_ARGS) {
static const struct {
const simde_float32 src[16];
const simde__mmask8 k;
const simde_float32 a[16];
const simde_float32 r[16];
} test_vec[] = {
{ { SIMDE_FLOAT32_C( 9.71), SIMDE_FLOAT32_C( 3.42), SIMDE_FLOAT32_C( 0.05), SIMDE_FLOAT32_C( 0.81),
SIMDE_FLOAT32_C( 0.23), SIMDE_FLOAT32_C( 8.23), SIMDE_FLOAT32_C( 6.49), SIMDE_FLOAT32_C( 5.78),
SIMDE_FLOAT32_C( 3.33), SIMDE_FLOAT32_C( 3.67), SIMDE_FLOAT32_C( 6.70), SIMDE_FLOAT32_C( 7.95),
SIMDE_FLOAT32_C( 1.18), SIMDE_FLOAT32_C( 7.57), SIMDE_FLOAT32_C( 6.39), SIMDE_FLOAT32_C( 7.96) },
UINT8_C(170),
{ SIMDE_FLOAT32_C( 1.99), SIMDE_FLOAT32_C( 4.36), SIMDE_FLOAT32_C( 1.34), SIMDE_FLOAT32_C( 5.69),
SIMDE_FLOAT32_C( 4.94), SIMDE_FLOAT32_C( 9.39), SIMDE_FLOAT32_C( 3.52), SIMDE_FLOAT32_C( 4.73),
SIMDE_FLOAT32_C( 2.21), SIMDE_FLOAT32_C( 5.91), SIMDE_FLOAT32_C( 7.57), SIMDE_FLOAT32_C( 1.54),
SIMDE_FLOAT32_C( 8.30), SIMDE_FLOAT32_C( 3.13), SIMDE_FLOAT32_C( 1.25), SIMDE_FLOAT32_C( 1.72) },
{ SIMDE_FLOAT32_C( 9.71), SIMDE_FLOAT32_C( 77.26), SIMDE_FLOAT32_C( 0.05), SIMDE_FLOAT32_C( 294.89),
SIMDE_FLOAT32_C( 0.23), SIMDE_FLOAT32_C( 11967.10), SIMDE_FLOAT32_C( 6.49), SIMDE_FLOAT32_C( 112.30),
SIMDE_FLOAT32_C( 3.33), SIMDE_FLOAT32_C( 3.67), SIMDE_FLOAT32_C( 6.70), SIMDE_FLOAT32_C( 7.95),
SIMDE_FLOAT32_C( 1.18), SIMDE_FLOAT32_C( 7.57), SIMDE_FLOAT32_C( 6.39), SIMDE_FLOAT32_C( 7.96) } },
{ { SIMDE_FLOAT32_C( 3.18), SIMDE_FLOAT32_C( 2.06), SIMDE_FLOAT32_C( 1.94), SIMDE_FLOAT32_C( 1.40),
SIMDE_FLOAT32_C( 8.55), SIMDE_FLOAT32_C( 7.72), SIMDE_FLOAT32_C( 4.74), SIMDE_FLOAT32_C( 2.22),
SIMDE_FLOAT32_C( 4.42), SIMDE_FLOAT32_C( 2.69), SIMDE_FLOAT32_C( 3.40), SIMDE_FLOAT32_C( 1.99),
SIMDE_FLOAT32_C( 9.08), SIMDE_FLOAT32_C( 1.36), SIMDE_FLOAT32_C( 1.11), SIMDE_FLOAT32_C( 1.07) },
UINT8_C( 91),
{ SIMDE_FLOAT32_C( 2.44), SIMDE_FLOAT32_C( 6.76), SIMDE_FLOAT32_C( 0.67), SIMDE_FLOAT32_C( 1.84),
SIMDE_FLOAT32_C( 0.28), SIMDE_FLOAT32_C( 5.39), SIMDE_FLOAT32_C( 4.05), SIMDE_FLOAT32_C( 6.19),
SIMDE_FLOAT32_C( 2.97), SIMDE_FLOAT32_C( 5.59), SIMDE_FLOAT32_C( 4.49), SIMDE_FLOAT32_C( 6.09),
SIMDE_FLOAT32_C( 6.84), SIMDE_FLOAT32_C( 6.20), SIMDE_FLOAT32_C( 9.27), SIMDE_FLOAT32_C( 8.90) },
{ SIMDE_FLOAT32_C( 10.47), SIMDE_FLOAT32_C( 861.64), SIMDE_FLOAT32_C( 1.94), SIMDE_FLOAT32_C( 5.30),
SIMDE_FLOAT32_C( 0.32), SIMDE_FLOAT32_C( 7.72), SIMDE_FLOAT32_C( 56.40), SIMDE_FLOAT32_C( 2.22),
SIMDE_FLOAT32_C( 4.42), SIMDE_FLOAT32_C( 2.69), SIMDE_FLOAT32_C( 3.40), SIMDE_FLOAT32_C( 1.99),
SIMDE_FLOAT32_C( 9.08), SIMDE_FLOAT32_C( 1.36), SIMDE_FLOAT32_C( 1.11), SIMDE_FLOAT32_C( 1.07) } },
{ { SIMDE_FLOAT32_C( 8.15), SIMDE_FLOAT32_C( 0.67), SIMDE_FLOAT32_C( 7.45), SIMDE_FLOAT32_C( 5.87),
SIMDE_FLOAT32_C( 5.41), SIMDE_FLOAT32_C( 9.67), SIMDE_FLOAT32_C( 0.29), SIMDE_FLOAT32_C( 8.10),
SIMDE_FLOAT32_C( 3.07), SIMDE_FLOAT32_C( 2.28), SIMDE_FLOAT32_C( 7.18), SIMDE_FLOAT32_C( 4.44),
SIMDE_FLOAT32_C( 3.39), SIMDE_FLOAT32_C( 8.25), SIMDE_FLOAT32_C( 0.16), SIMDE_FLOAT32_C( 5.83) },
UINT8_C( 10),
{ SIMDE_FLOAT32_C( 0.83), SIMDE_FLOAT32_C( 7.67), SIMDE_FLOAT32_C( 5.29), SIMDE_FLOAT32_C( 6.22),
SIMDE_FLOAT32_C( 1.72), SIMDE_FLOAT32_C( 1.47), SIMDE_FLOAT32_C( 9.19), SIMDE_FLOAT32_C( 7.31),
SIMDE_FLOAT32_C( 5.96), SIMDE_FLOAT32_C( 5.28), SIMDE_FLOAT32_C( 4.15), SIMDE_FLOAT32_C( 2.16),
SIMDE_FLOAT32_C( 4.55), SIMDE_FLOAT32_C( 3.05), SIMDE_FLOAT32_C( 0.31), SIMDE_FLOAT32_C( 5.23) },
{ SIMDE_FLOAT32_C( 8.15), SIMDE_FLOAT32_C( 2142.08), SIMDE_FLOAT32_C( 7.45), SIMDE_FLOAT32_C( 501.70),
SIMDE_FLOAT32_C( 5.41), SIMDE_FLOAT32_C( 9.67), SIMDE_FLOAT32_C( 0.29), SIMDE_FLOAT32_C( 8.10),
SIMDE_FLOAT32_C( 3.07), SIMDE_FLOAT32_C( 2.28), SIMDE_FLOAT32_C( 7.18), SIMDE_FLOAT32_C( 4.44),
SIMDE_FLOAT32_C( 3.39), SIMDE_FLOAT32_C( 8.25), SIMDE_FLOAT32_C( 0.16), SIMDE_FLOAT32_C( 5.83) } },
{ { SIMDE_FLOAT32_C( 0.50), SIMDE_FLOAT32_C( 6.17), SIMDE_FLOAT32_C( 0.64), SIMDE_FLOAT32_C( 0.17),
SIMDE_FLOAT32_C( 6.46), SIMDE_FLOAT32_C( 8.74), SIMDE_FLOAT32_C( 3.24), SIMDE_FLOAT32_C( 8.75),
SIMDE_FLOAT32_C( 5.92), SIMDE_FLOAT32_C( 7.68), SIMDE_FLOAT32_C( 2.13), SIMDE_FLOAT32_C( 4.17),
SIMDE_FLOAT32_C( 7.84), SIMDE_FLOAT32_C( 7.97), SIMDE_FLOAT32_C( 9.18), SIMDE_FLOAT32_C( 8.67) },
UINT8_C(236),
{ SIMDE_FLOAT32_C( 4.47), SIMDE_FLOAT32_C( 4.89), SIMDE_FLOAT32_C( 7.36), SIMDE_FLOAT32_C( 5.94),
SIMDE_FLOAT32_C( 4.08), SIMDE_FLOAT32_C( 4.67), SIMDE_FLOAT32_C( 1.90), SIMDE_FLOAT32_C( 9.36),
SIMDE_FLOAT32_C( 8.82), SIMDE_FLOAT32_C( 4.06), SIMDE_FLOAT32_C( 3.91), SIMDE_FLOAT32_C( 1.86),
SIMDE_FLOAT32_C( 4.37), SIMDE_FLOAT32_C( 9.13), SIMDE_FLOAT32_C( 2.36), SIMDE_FLOAT32_C( 0.55) },
{ SIMDE_FLOAT32_C( 0.50), SIMDE_FLOAT32_C( 6.17), SIMDE_FLOAT32_C( 1570.84), SIMDE_FLOAT32_C( 378.93),
SIMDE_FLOAT32_C( 6.46), SIMDE_FLOAT32_C( 105.70), SIMDE_FLOAT32_C( 5.69), SIMDE_FLOAT32_C( 11613.38),
SIMDE_FLOAT32_C( 5.92), SIMDE_FLOAT32_C( 7.68), SIMDE_FLOAT32_C( 2.13), SIMDE_FLOAT32_C( 4.17),
SIMDE_FLOAT32_C( 7.84), SIMDE_FLOAT32_C( 7.97), SIMDE_FLOAT32_C( 9.18), SIMDE_FLOAT32_C( 8.67) } },
{ { SIMDE_FLOAT32_C( 9.77), SIMDE_FLOAT32_C( 2.53), SIMDE_FLOAT32_C( 7.01), SIMDE_FLOAT32_C( 8.51),
SIMDE_FLOAT32_C( 5.77), SIMDE_FLOAT32_C( 5.76), SIMDE_FLOAT32_C( 4.43), SIMDE_FLOAT32_C( 3.45),
SIMDE_FLOAT32_C( 7.89), SIMDE_FLOAT32_C( 8.61), SIMDE_FLOAT32_C( 1.29), SIMDE_FLOAT32_C( 5.86),
SIMDE_FLOAT32_C( 7.79), SIMDE_FLOAT32_C( 9.96), SIMDE_FLOAT32_C( 1.50), SIMDE_FLOAT32_C( 2.25) },
UINT8_C( 16),
{ SIMDE_FLOAT32_C( 8.86), SIMDE_FLOAT32_C( 8.20), SIMDE_FLOAT32_C( 8.92), SIMDE_FLOAT32_C( 3.53),
SIMDE_FLOAT32_C( 0.10), SIMDE_FLOAT32_C( 8.28), SIMDE_FLOAT32_C( 2.35), SIMDE_FLOAT32_C( 4.16),
SIMDE_FLOAT32_C( 2.18), SIMDE_FLOAT32_C( 4.21), SIMDE_FLOAT32_C( 8.53), SIMDE_FLOAT32_C( 1.32),
SIMDE_FLOAT32_C( 6.57), SIMDE_FLOAT32_C( 9.08), SIMDE_FLOAT32_C( 1.09), SIMDE_FLOAT32_C( 9.10) },
{ SIMDE_FLOAT32_C( 9.77), SIMDE_FLOAT32_C( 2.53), SIMDE_FLOAT32_C( 7.01), SIMDE_FLOAT32_C( 8.51),
SIMDE_FLOAT32_C( 0.11), SIMDE_FLOAT32_C( 5.76), SIMDE_FLOAT32_C( 4.43), SIMDE_FLOAT32_C( 3.45),
SIMDE_FLOAT32_C( 7.89), SIMDE_FLOAT32_C( 8.61), SIMDE_FLOAT32_C( 1.29), SIMDE_FLOAT32_C( 5.86),
SIMDE_FLOAT32_C( 7.79), SIMDE_FLOAT32_C( 9.96), SIMDE_FLOAT32_C( 1.50), SIMDE_FLOAT32_C( 2.25) } },
{ { SIMDE_FLOAT32_C( 6.09), SIMDE_FLOAT32_C( 9.60), SIMDE_FLOAT32_C( 4.88), SIMDE_FLOAT32_C( 1.85),
SIMDE_FLOAT32_C( 4.03), SIMDE_FLOAT32_C( 8.33), SIMDE_FLOAT32_C( 9.74), SIMDE_FLOAT32_C( 2.64),
SIMDE_FLOAT32_C( 9.62), SIMDE_FLOAT32_C( 5.60), SIMDE_FLOAT32_C( 0.43), SIMDE_FLOAT32_C( 9.57),
SIMDE_FLOAT32_C( 7.10), SIMDE_FLOAT32_C( 2.68), SIMDE_FLOAT32_C( 4.42), SIMDE_FLOAT32_C( 5.97) },
UINT8_C( 45),
{ SIMDE_FLOAT32_C( 3.34), SIMDE_FLOAT32_C( 9.50), SIMDE_FLOAT32_C( 0.97), SIMDE_FLOAT32_C( 1.61),
SIMDE_FLOAT32_C( 1.85), SIMDE_FLOAT32_C( 5.13), SIMDE_FLOAT32_C( 3.80), SIMDE_FLOAT32_C( 6.06),
SIMDE_FLOAT32_C( 3.66), SIMDE_FLOAT32_C( 5.11), SIMDE_FLOAT32_C( 2.63), SIMDE_FLOAT32_C( 2.74),
SIMDE_FLOAT32_C( 6.20), SIMDE_FLOAT32_C( 1.73), SIMDE_FLOAT32_C( 8.83), SIMDE_FLOAT32_C( 5.80) },
{ SIMDE_FLOAT32_C( 27.22), SIMDE_FLOAT32_C( 9.60), SIMDE_FLOAT32_C( 1.64), SIMDE_FLOAT32_C( 4.00),
SIMDE_FLOAT32_C( 4.03), SIMDE_FLOAT32_C( 168.02), SIMDE_FLOAT32_C( 9.74), SIMDE_FLOAT32_C( 2.64),
SIMDE_FLOAT32_C( 9.62), SIMDE_FLOAT32_C( 5.60), SIMDE_FLOAT32_C( 0.43), SIMDE_FLOAT32_C( 9.57),
SIMDE_FLOAT32_C( 7.10), SIMDE_FLOAT32_C( 2.68), SIMDE_FLOAT32_C( 4.42), SIMDE_FLOAT32_C( 5.97) } },
{ { SIMDE_FLOAT32_C( 6.61), SIMDE_FLOAT32_C( 0.68), SIMDE_FLOAT32_C( 9.83), SIMDE_FLOAT32_C( 4.94),
SIMDE_FLOAT32_C( 0.42), SIMDE_FLOAT32_C( 2.47), SIMDE_FLOAT32_C( 4.55), SIMDE_FLOAT32_C( 6.02),
SIMDE_FLOAT32_C( 2.90), SIMDE_FLOAT32_C( 4.12), SIMDE_FLOAT32_C( 3.12), SIMDE_FLOAT32_C( 5.58),
SIMDE_FLOAT32_C( 8.54), SIMDE_FLOAT32_C( 9.08), SIMDE_FLOAT32_C( 6.46), SIMDE_FLOAT32_C( 1.88) },
UINT8_C(126),
{ SIMDE_FLOAT32_C( 7.43), SIMDE_FLOAT32_C( 3.49), SIMDE_FLOAT32_C( 0.43), SIMDE_FLOAT32_C( 2.56),
SIMDE_FLOAT32_C( 7.28), SIMDE_FLOAT32_C( 6.48), SIMDE_FLOAT32_C( 6.22), SIMDE_FLOAT32_C( 2.40),
SIMDE_FLOAT32_C( 9.11), SIMDE_FLOAT32_C( 8.96), SIMDE_FLOAT32_C( 8.60), SIMDE_FLOAT32_C( 0.84),
SIMDE_FLOAT32_C( 7.79), SIMDE_FLOAT32_C( 4.40), SIMDE_FLOAT32_C( 7.45), SIMDE_FLOAT32_C( 8.47) },
{ SIMDE_FLOAT32_C( 6.61), SIMDE_FLOAT32_C( 31.79), SIMDE_FLOAT32_C( 0.54), SIMDE_FLOAT32_C( 11.94),
SIMDE_FLOAT32_C( 1449.99), SIMDE_FLOAT32_C( 650.97), SIMDE_FLOAT32_C( 501.70), SIMDE_FLOAT32_C( 6.02),
SIMDE_FLOAT32_C( 2.90), SIMDE_FLOAT32_C( 4.12), SIMDE_FLOAT32_C( 3.12), SIMDE_FLOAT32_C( 5.58),
SIMDE_FLOAT32_C( 8.54), SIMDE_FLOAT32_C( 9.08), SIMDE_FLOAT32_C( 6.46), SIMDE_FLOAT32_C( 1.88) } },
{ { SIMDE_FLOAT32_C( 4.23), SIMDE_FLOAT32_C( 2.39), SIMDE_FLOAT32_C( 8.88), SIMDE_FLOAT32_C( 6.71),
SIMDE_FLOAT32_C( 6.94), SIMDE_FLOAT32_C( 4.90), SIMDE_FLOAT32_C( 9.61), SIMDE_FLOAT32_C( 1.06),
SIMDE_FLOAT32_C( 8.02), SIMDE_FLOAT32_C( 5.19), SIMDE_FLOAT32_C( 9.60), SIMDE_FLOAT32_C( 7.10),
SIMDE_FLOAT32_C( 1.65), SIMDE_FLOAT32_C( 1.48), SIMDE_FLOAT32_C( 5.68), SIMDE_FLOAT32_C( 9.08) },
UINT8_C( 53),
{ SIMDE_FLOAT32_C( 6.11), SIMDE_FLOAT32_C( 1.64), SIMDE_FLOAT32_C( 2.25), SIMDE_FLOAT32_C( 2.59),
SIMDE_FLOAT32_C( 7.86), SIMDE_FLOAT32_C( 4.65), SIMDE_FLOAT32_C( 1.70), SIMDE_FLOAT32_C( 6.82),
SIMDE_FLOAT32_C( 3.25), SIMDE_FLOAT32_C( 2.55), SIMDE_FLOAT32_C( 4.61), SIMDE_FLOAT32_C( 7.65),
SIMDE_FLOAT32_C( 10.00), SIMDE_FLOAT32_C( 3.07), SIMDE_FLOAT32_C( 1.88), SIMDE_FLOAT32_C( 2.39) },
{ SIMDE_FLOAT32_C( 449.34), SIMDE_FLOAT32_C( 2.39), SIMDE_FLOAT32_C( 8.49), SIMDE_FLOAT32_C( 6.71),
SIMDE_FLOAT32_C( 2590.52), SIMDE_FLOAT32_C( 103.58), SIMDE_FLOAT32_C( 9.61), SIMDE_FLOAT32_C( 1.06),
SIMDE_FLOAT32_C( 8.02), SIMDE_FLOAT32_C( 5.19), SIMDE_FLOAT32_C( 9.60), SIMDE_FLOAT32_C( 7.10),
SIMDE_FLOAT32_C( 1.65), SIMDE_FLOAT32_C( 1.48), SIMDE_FLOAT32_C( 5.68), SIMDE_FLOAT32_C( 9.08) } }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])) ; i++) {
simde__m512 src = simde_mm512_loadu_ps(test_vec[i].src);
simde__m512 a = simde_mm512_loadu_ps(test_vec[i].a);
simde__m512 r = simde_mm512_mask_expm1_ps(src, test_vec[i].k, a);
simde_test_x86_assert_equal_f32x16(r, simde_mm512_loadu_ps(test_vec[i].r), 1);
}
return 0;
}
static int
test_simde_mm512_expm1_pd (SIMDE_MUNIT_TEST_ARGS) {
static const struct {
const simde_float64 a[8];
const simde_float64 r[8];
} test_vec[] = {
{ { SIMDE_FLOAT64_C( 0.34), SIMDE_FLOAT64_C( 1.40), SIMDE_FLOAT64_C( 0.40), SIMDE_FLOAT64_C( 7.27),
SIMDE_FLOAT64_C( 9.13), SIMDE_FLOAT64_C( 1.31), SIMDE_FLOAT64_C( 2.56), SIMDE_FLOAT64_C( 1.21) },
{ SIMDE_FLOAT64_C( 0.40), SIMDE_FLOAT64_C( 3.06), SIMDE_FLOAT64_C( 0.49), SIMDE_FLOAT64_C( 1435.55),
SIMDE_FLOAT64_C( 9227.02), SIMDE_FLOAT64_C( 2.71), SIMDE_FLOAT64_C( 11.94), SIMDE_FLOAT64_C( 2.35) } },
{ { SIMDE_FLOAT64_C( 6.72), SIMDE_FLOAT64_C( 0.51), SIMDE_FLOAT64_C( 3.99), SIMDE_FLOAT64_C( 2.10),
SIMDE_FLOAT64_C( 2.80), SIMDE_FLOAT64_C( 5.43), SIMDE_FLOAT64_C( 3.71), SIMDE_FLOAT64_C( 6.65) },
{ SIMDE_FLOAT64_C( 827.82), SIMDE_FLOAT64_C( 0.67), SIMDE_FLOAT64_C( 53.05), SIMDE_FLOAT64_C( 7.17),
SIMDE_FLOAT64_C( 15.44), SIMDE_FLOAT64_C( 227.15), SIMDE_FLOAT64_C( 39.85), SIMDE_FLOAT64_C( 771.78) } },
{ { SIMDE_FLOAT64_C( 3.81), SIMDE_FLOAT64_C( 4.42), SIMDE_FLOAT64_C( 8.46), SIMDE_FLOAT64_C( 3.88),
SIMDE_FLOAT64_C( 7.48), SIMDE_FLOAT64_C( 9.11), SIMDE_FLOAT64_C( 0.59), SIMDE_FLOAT64_C( 5.94) },
{ SIMDE_FLOAT64_C( 44.15), SIMDE_FLOAT64_C( 82.10), SIMDE_FLOAT64_C( 4721.06), SIMDE_FLOAT64_C( 47.42),
SIMDE_FLOAT64_C( 1771.24), SIMDE_FLOAT64_C( 9044.29), SIMDE_FLOAT64_C( 0.80), SIMDE_FLOAT64_C( 378.93) } },
{ { SIMDE_FLOAT64_C( 7.31), SIMDE_FLOAT64_C( 0.56), SIMDE_FLOAT64_C( 9.76), SIMDE_FLOAT64_C( 8.87),
SIMDE_FLOAT64_C( 7.78), SIMDE_FLOAT64_C( 3.26), SIMDE_FLOAT64_C( 6.27), SIMDE_FLOAT64_C( 8.12) },
{ SIMDE_FLOAT64_C( 1494.18), SIMDE_FLOAT64_C( 0.75), SIMDE_FLOAT64_C( 17325.63), SIMDE_FLOAT64_C( 7114.28),
SIMDE_FLOAT64_C( 2391.27), SIMDE_FLOAT64_C( 25.05), SIMDE_FLOAT64_C( 527.48), SIMDE_FLOAT64_C( 3360.02) } },
{ { SIMDE_FLOAT64_C( 4.67), SIMDE_FLOAT64_C( 6.67), SIMDE_FLOAT64_C( 5.39), SIMDE_FLOAT64_C( 3.79),
SIMDE_FLOAT64_C( 7.97), SIMDE_FLOAT64_C( 7.95), SIMDE_FLOAT64_C( 5.00), SIMDE_FLOAT64_C( 4.69) },
{ SIMDE_FLOAT64_C( 105.70), SIMDE_FLOAT64_C( 787.40), SIMDE_FLOAT64_C( 218.20), SIMDE_FLOAT64_C( 43.26),
SIMDE_FLOAT64_C( 2891.86), SIMDE_FLOAT64_C( 2834.57), SIMDE_FLOAT64_C( 147.41), SIMDE_FLOAT64_C( 107.85) } },
{ { SIMDE_FLOAT64_C( 8.47), SIMDE_FLOAT64_C( 9.00), SIMDE_FLOAT64_C( 6.79), SIMDE_FLOAT64_C( 1.27),
SIMDE_FLOAT64_C( 4.42), SIMDE_FLOAT64_C( 0.49), SIMDE_FLOAT64_C( 7.92), SIMDE_FLOAT64_C( 8.23) },
{ SIMDE_FLOAT64_C( 4768.52), SIMDE_FLOAT64_C( 8102.08), SIMDE_FLOAT64_C( 887.91), SIMDE_FLOAT64_C( 2.56),
SIMDE_FLOAT64_C( 82.10), SIMDE_FLOAT64_C( 0.63), SIMDE_FLOAT64_C( 2750.77), SIMDE_FLOAT64_C( 3750.83) } },
{ { SIMDE_FLOAT64_C( 4.92), SIMDE_FLOAT64_C( 6.38), SIMDE_FLOAT64_C( 2.12), SIMDE_FLOAT64_C( 2.40),
SIMDE_FLOAT64_C( 5.49), SIMDE_FLOAT64_C( 2.70), SIMDE_FLOAT64_C( 8.35), SIMDE_FLOAT64_C( 2.80) },
{ SIMDE_FLOAT64_C( 136.00), SIMDE_FLOAT64_C( 588.93), SIMDE_FLOAT64_C( 7.33), SIMDE_FLOAT64_C( 10.02),
SIMDE_FLOAT64_C( 241.26), SIMDE_FLOAT64_C( 13.88), SIMDE_FLOAT64_C( 4229.18), SIMDE_FLOAT64_C( 15.44) } },
{ { SIMDE_FLOAT64_C( 3.27), SIMDE_FLOAT64_C( 8.10), SIMDE_FLOAT64_C( 1.67), SIMDE_FLOAT64_C( 1.04),
SIMDE_FLOAT64_C( 1.36), SIMDE_FLOAT64_C( 7.94), SIMDE_FLOAT64_C( 9.16), SIMDE_FLOAT64_C( 6.03) },
{ SIMDE_FLOAT64_C( 25.31), SIMDE_FLOAT64_C( 3293.47), SIMDE_FLOAT64_C( 4.31), SIMDE_FLOAT64_C( 1.83),
SIMDE_FLOAT64_C( 2.90), SIMDE_FLOAT64_C( 2806.36), SIMDE_FLOAT64_C( 9508.06), SIMDE_FLOAT64_C( 414.72) } }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])) ; i++) {
simde__m512d a = simde_mm512_loadu_pd(test_vec[i].a);
simde__m512d r = simde_mm512_expm1_pd(a);
simde_test_x86_assert_equal_f64x8(r, simde_mm512_loadu_pd(test_vec[i].r), 1);
}
return 0;
}
static int
test_simde_mm512_mask_expm1_pd (SIMDE_MUNIT_TEST_ARGS) {
static const struct {
const simde_float64 src[8];
const simde__mmask8 k;
const simde_float64 a[8];
const simde_float64 r[8];
} test_vec[] = {
{ { SIMDE_FLOAT64_C( 4.06), SIMDE_FLOAT64_C( 9.24), SIMDE_FLOAT64_C( 8.55), SIMDE_FLOAT64_C( 9.59),
SIMDE_FLOAT64_C( 0.69), SIMDE_FLOAT64_C( 2.26), SIMDE_FLOAT64_C( 0.56), SIMDE_FLOAT64_C( 4.06) },
UINT8_C(110),
{ SIMDE_FLOAT64_C( 5.74), SIMDE_FLOAT64_C( 3.51), SIMDE_FLOAT64_C( 5.07), SIMDE_FLOAT64_C( 6.58),
SIMDE_FLOAT64_C( 8.73), SIMDE_FLOAT64_C( 4.57), SIMDE_FLOAT64_C( 0.46), SIMDE_FLOAT64_C( 7.96) },
{ SIMDE_FLOAT64_C( 4.06), SIMDE_FLOAT64_C( 32.45), SIMDE_FLOAT64_C( 158.17), SIMDE_FLOAT64_C( 719.54),
SIMDE_FLOAT64_C( 0.69), SIMDE_FLOAT64_C( 95.54), SIMDE_FLOAT64_C( 0.58), SIMDE_FLOAT64_C( 4.06) } },
{ { SIMDE_FLOAT64_C( 1.32), SIMDE_FLOAT64_C( 1.91), SIMDE_FLOAT64_C( 7.33), SIMDE_FLOAT64_C( 4.66),
SIMDE_FLOAT64_C( 3.27), SIMDE_FLOAT64_C( 4.31), SIMDE_FLOAT64_C( 0.71), SIMDE_FLOAT64_C( 9.20) },
UINT8_C(124),
{ SIMDE_FLOAT64_C( 5.28), SIMDE_FLOAT64_C( 1.80), SIMDE_FLOAT64_C( 4.85), SIMDE_FLOAT64_C( 0.49),
SIMDE_FLOAT64_C( 1.99), SIMDE_FLOAT64_C( 8.91), SIMDE_FLOAT64_C( 9.72), SIMDE_FLOAT64_C( 0.53) },
{ SIMDE_FLOAT64_C( 1.32), SIMDE_FLOAT64_C( 1.91), SIMDE_FLOAT64_C( 126.74), SIMDE_FLOAT64_C( 0.63),
SIMDE_FLOAT64_C( 6.32), SIMDE_FLOAT64_C( 7404.66), SIMDE_FLOAT64_C( 16646.24), SIMDE_FLOAT64_C( 9.20) } },
{ { SIMDE_FLOAT64_C( 8.50), SIMDE_FLOAT64_C( 0.42), SIMDE_FLOAT64_C( 2.80), SIMDE_FLOAT64_C( 9.06),
SIMDE_FLOAT64_C( 4.48), SIMDE_FLOAT64_C( 0.59), SIMDE_FLOAT64_C( 4.80), SIMDE_FLOAT64_C( 7.99) },
UINT8_C( 51),
{ SIMDE_FLOAT64_C( 1.38), SIMDE_FLOAT64_C( 6.72), SIMDE_FLOAT64_C( 0.22), SIMDE_FLOAT64_C( 1.85),
SIMDE_FLOAT64_C( 4.68), SIMDE_FLOAT64_C( 1.54), SIMDE_FLOAT64_C( 3.76), SIMDE_FLOAT64_C( 2.01) },
{ SIMDE_FLOAT64_C( 2.97), SIMDE_FLOAT64_C( 827.82), SIMDE_FLOAT64_C( 2.80), SIMDE_FLOAT64_C( 9.06),
SIMDE_FLOAT64_C( 106.77), SIMDE_FLOAT64_C( 3.66), SIMDE_FLOAT64_C( 4.80), SIMDE_FLOAT64_C( 7.99) } },
{ { SIMDE_FLOAT64_C( 6.20), SIMDE_FLOAT64_C( 7.03), SIMDE_FLOAT64_C( 6.32), SIMDE_FLOAT64_C( 6.91),
SIMDE_FLOAT64_C( 6.23), SIMDE_FLOAT64_C( 3.88), SIMDE_FLOAT64_C( 2.18), SIMDE_FLOAT64_C( 8.02) },
UINT8_C(179),
{ SIMDE_FLOAT64_C( 2.67), SIMDE_FLOAT64_C( 0.01), SIMDE_FLOAT64_C( 7.63), SIMDE_FLOAT64_C( 2.40),
SIMDE_FLOAT64_C( 0.54), SIMDE_FLOAT64_C( 6.13), SIMDE_FLOAT64_C( 2.81), SIMDE_FLOAT64_C( 3.34) },
{ SIMDE_FLOAT64_C( 13.44), SIMDE_FLOAT64_C( 0.01), SIMDE_FLOAT64_C( 6.32), SIMDE_FLOAT64_C( 6.91),
SIMDE_FLOAT64_C( 0.72), SIMDE_FLOAT64_C( 458.44), SIMDE_FLOAT64_C( 2.18), SIMDE_FLOAT64_C( 27.22) } },
{ { SIMDE_FLOAT64_C( 5.19), SIMDE_FLOAT64_C( 7.29), SIMDE_FLOAT64_C( 3.93), SIMDE_FLOAT64_C( 10.00),
SIMDE_FLOAT64_C( 5.28), SIMDE_FLOAT64_C( 9.58), SIMDE_FLOAT64_C( 1.38), SIMDE_FLOAT64_C( 2.00) },
UINT8_C(216),
{ SIMDE_FLOAT64_C( 3.23), SIMDE_FLOAT64_C( 6.68), SIMDE_FLOAT64_C( 1.35), SIMDE_FLOAT64_C( 6.99),
SIMDE_FLOAT64_C( 8.69), SIMDE_FLOAT64_C( 7.55), SIMDE_FLOAT64_C( 4.02), SIMDE_FLOAT64_C( 5.01) },
{ SIMDE_FLOAT64_C( 5.19), SIMDE_FLOAT64_C( 7.29), SIMDE_FLOAT64_C( 3.93), SIMDE_FLOAT64_C( 1084.72),
SIMDE_FLOAT64_C( 5942.18), SIMDE_FLOAT64_C( 9.58), SIMDE_FLOAT64_C( 54.70), SIMDE_FLOAT64_C( 148.90) } },
{ { SIMDE_FLOAT64_C( 4.45), SIMDE_FLOAT64_C( 0.25), SIMDE_FLOAT64_C( 8.89), SIMDE_FLOAT64_C( 6.64),
SIMDE_FLOAT64_C( 8.27), SIMDE_FLOAT64_C( 7.61), SIMDE_FLOAT64_C( 9.31), SIMDE_FLOAT64_C( 8.28) },
UINT8_C(185),
{ SIMDE_FLOAT64_C( 1.71), SIMDE_FLOAT64_C( 8.83), SIMDE_FLOAT64_C( 1.38), SIMDE_FLOAT64_C( 4.52),
SIMDE_FLOAT64_C( 2.17), SIMDE_FLOAT64_C( 6.57), SIMDE_FLOAT64_C( 1.81), SIMDE_FLOAT64_C( 6.09) },
{ SIMDE_FLOAT64_C( 4.53), SIMDE_FLOAT64_C( 0.25), SIMDE_FLOAT64_C( 8.89), SIMDE_FLOAT64_C( 90.84),
SIMDE_FLOAT64_C( 7.76), SIMDE_FLOAT64_C( 712.37), SIMDE_FLOAT64_C( 9.31), SIMDE_FLOAT64_C( 440.42) } },
{ { SIMDE_FLOAT64_C( 6.57), SIMDE_FLOAT64_C( 7.09), SIMDE_FLOAT64_C( 5.68), SIMDE_FLOAT64_C( 7.95),
SIMDE_FLOAT64_C( 9.09), SIMDE_FLOAT64_C( 5.48), SIMDE_FLOAT64_C( 1.18), SIMDE_FLOAT64_C( 5.77) },
UINT8_C(171),
{ SIMDE_FLOAT64_C( 8.17), SIMDE_FLOAT64_C( 4.46), SIMDE_FLOAT64_C( 4.37), SIMDE_FLOAT64_C( 2.19),
SIMDE_FLOAT64_C( 9.47), SIMDE_FLOAT64_C( 8.83), SIMDE_FLOAT64_C( 2.44), SIMDE_FLOAT64_C( 8.36) },
{ SIMDE_FLOAT64_C( 3532.34), SIMDE_FLOAT64_C( 85.49), SIMDE_FLOAT64_C( 5.68), SIMDE_FLOAT64_C( 7.94),
SIMDE_FLOAT64_C( 9.09), SIMDE_FLOAT64_C( 6835.29), SIMDE_FLOAT64_C( 1.18), SIMDE_FLOAT64_C( 4271.69) } },
{ { SIMDE_FLOAT64_C( 5.47), SIMDE_FLOAT64_C( 0.71), SIMDE_FLOAT64_C( 5.97), SIMDE_FLOAT64_C( 4.78),
SIMDE_FLOAT64_C( 9.00), SIMDE_FLOAT64_C( 1.22), SIMDE_FLOAT64_C( 6.48), SIMDE_FLOAT64_C( 7.82) },
UINT8_C(171),
{ SIMDE_FLOAT64_C( 1.00), SIMDE_FLOAT64_C( 9.99), SIMDE_FLOAT64_C( 9.17), SIMDE_FLOAT64_C( 2.81),
SIMDE_FLOAT64_C( 6.09), SIMDE_FLOAT64_C( 5.74), SIMDE_FLOAT64_C( 9.89), SIMDE_FLOAT64_C( 1.76) },
{ SIMDE_FLOAT64_C( 1.72), SIMDE_FLOAT64_C( 21806.30), SIMDE_FLOAT64_C( 5.97), SIMDE_FLOAT64_C( 15.61),
SIMDE_FLOAT64_C( 9.00), SIMDE_FLOAT64_C( 310.06), SIMDE_FLOAT64_C( 6.48), SIMDE_FLOAT64_C( 4.81) } }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])) ; i++) {
simde__m512d src = simde_mm512_loadu_pd(test_vec[i].src);
simde__m512d a = simde_mm512_loadu_pd(test_vec[i].a);
simde__m512d r = simde_mm512_mask_expm1_pd(src, test_vec[i].k, a);
simde_test_x86_assert_equal_f64x8(r, simde_mm512_loadu_pd(test_vec[i].r), 1);
}
return 0;
}
static int
test_simde_mm_exp2_ps (SIMDE_MUNIT_TEST_ARGS) {
static const struct {
const simde_float32 a[4];
const simde_float32 r[4];
} test_vec[] = {
{ { SIMDE_FLOAT32_C( -2.08), SIMDE_FLOAT32_C( 1.71), SIMDE_FLOAT32_C( 2.58), SIMDE_FLOAT32_C( -1.10) },
{ SIMDE_FLOAT32_C( 0.24), SIMDE_FLOAT32_C( 3.27), SIMDE_FLOAT32_C( 5.98), SIMDE_FLOAT32_C( 0.47) } },
{ { SIMDE_FLOAT32_C( 1.65), SIMDE_FLOAT32_C( 0.28), SIMDE_FLOAT32_C( -2.92), SIMDE_FLOAT32_C( -3.15) },
{ SIMDE_FLOAT32_C( 3.14), SIMDE_FLOAT32_C( 1.21), SIMDE_FLOAT32_C( 0.13), SIMDE_FLOAT32_C( 0.11) } },
{ { SIMDE_FLOAT32_C( 2.04), SIMDE_FLOAT32_C( 0.81), SIMDE_FLOAT32_C( -3.95), SIMDE_FLOAT32_C( -1.01) },
{ SIMDE_FLOAT32_C( 4.11), SIMDE_FLOAT32_C( 1.75), SIMDE_FLOAT32_C( 0.06), SIMDE_FLOAT32_C( 0.50) } },
{ { SIMDE_FLOAT32_C( -2.84), SIMDE_FLOAT32_C( -0.36), SIMDE_FLOAT32_C( -3.08), SIMDE_FLOAT32_C( 0.96) },
{ SIMDE_FLOAT32_C( 0.14), SIMDE_FLOAT32_C( 0.78), SIMDE_FLOAT32_C( 0.12), SIMDE_FLOAT32_C( 1.95) } },
{ { SIMDE_FLOAT32_C( 0.12), SIMDE_FLOAT32_C( -1.38), SIMDE_FLOAT32_C( -3.16), SIMDE_FLOAT32_C( 0.33) },
{ SIMDE_FLOAT32_C( 1.09), SIMDE_FLOAT32_C( 0.38), SIMDE_FLOAT32_C( 0.11), SIMDE_FLOAT32_C( 1.26) } },
{ { SIMDE_FLOAT32_C( 0.28), SIMDE_FLOAT32_C( 1.17), SIMDE_FLOAT32_C( -3.70), SIMDE_FLOAT32_C( -0.75) },
{ SIMDE_FLOAT32_C( 1.21), SIMDE_FLOAT32_C( 2.25), SIMDE_FLOAT32_C( 0.08), SIMDE_FLOAT32_C( 0.59) } },
{ { SIMDE_FLOAT32_C( -1.25), SIMDE_FLOAT32_C( -2.03), SIMDE_FLOAT32_C( -1.41), SIMDE_FLOAT32_C( -1.44) },
{ SIMDE_FLOAT32_C( 0.42), SIMDE_FLOAT32_C( 0.24), SIMDE_FLOAT32_C( 0.38), SIMDE_FLOAT32_C( 0.37) } },
{ { SIMDE_FLOAT32_C( -2.57), SIMDE_FLOAT32_C( -1.64), SIMDE_FLOAT32_C( 1.24), SIMDE_FLOAT32_C( -0.66) },
{ SIMDE_FLOAT32_C( 0.17), SIMDE_FLOAT32_C( 0.32), SIMDE_FLOAT32_C( 2.36), SIMDE_FLOAT32_C( 0.63) } }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])) ; i++) {
simde__m128 a = simde_mm_loadu_ps(test_vec[i].a);
simde__m128 r = simde_mm_exp2_ps(a);
simde_test_x86_assert_equal_f32x4(r, simde_mm_loadu_ps(test_vec[i].r), 1);
}
return 0;
}
static int
test_simde_mm_exp2_pd (SIMDE_MUNIT_TEST_ARGS) {
static const struct {
const simde_float64 a[2];
const simde_float64 r[2];
} test_vec[] = {
{ { SIMDE_FLOAT64_C( -1.05), SIMDE_FLOAT64_C( -3.96) },
{ SIMDE_FLOAT64_C( 0.48), SIMDE_FLOAT64_C( 0.06) } },
{ { SIMDE_FLOAT64_C( -3.17), SIMDE_FLOAT64_C( -0.18) },
{ SIMDE_FLOAT64_C( 0.11), SIMDE_FLOAT64_C( 0.88) } },
{ { SIMDE_FLOAT64_C( 2.75), SIMDE_FLOAT64_C( -3.78) },
{ SIMDE_FLOAT64_C( 6.73), SIMDE_FLOAT64_C( 0.07) } },
{ { SIMDE_FLOAT64_C( -3.43), SIMDE_FLOAT64_C( 0.85) },
{ SIMDE_FLOAT64_C( 0.09), SIMDE_FLOAT64_C( 1.80) } },
{ { SIMDE_FLOAT64_C( 0.37), SIMDE_FLOAT64_C( 1.23) },
{ SIMDE_FLOAT64_C( 1.29), SIMDE_FLOAT64_C( 2.35) } },
{ { SIMDE_FLOAT64_C( 1.92), SIMDE_FLOAT64_C( -0.38) },
{ SIMDE_FLOAT64_C( 3.78), SIMDE_FLOAT64_C( 0.77) } },
{ { SIMDE_FLOAT64_C( 3.87), SIMDE_FLOAT64_C( 2.98) },
{ SIMDE_FLOAT64_C( 14.62), SIMDE_FLOAT64_C( 7.89) } },
{ { SIMDE_FLOAT64_C( -1.16), SIMDE_FLOAT64_C( 1.76) },
{ SIMDE_FLOAT64_C( 0.45), SIMDE_FLOAT64_C( 3.39) } }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])) ; i++) {
simde__m128d a = simde_mm_loadu_pd(test_vec[i].a);
simde__m128d r = simde_mm_exp2_pd(a);
simde_test_x86_assert_equal_f64x2(r, simde_mm_loadu_pd(test_vec[i].r), 1);
}
return 0;
}
static int
test_simde_mm256_exp2_ps (SIMDE_MUNIT_TEST_ARGS) {
static const struct {
const simde_float32 a[8];
const simde_float32 r[8];
} test_vec[] = {
{ { SIMDE_FLOAT32_C( 2.88), SIMDE_FLOAT32_C( 3.98), SIMDE_FLOAT32_C( 2.77), SIMDE_FLOAT32_C( 0.64),
SIMDE_FLOAT32_C( -1.90), SIMDE_FLOAT32_C( -1.78), SIMDE_FLOAT32_C( -1.91), SIMDE_FLOAT32_C( -1.34) },
{ SIMDE_FLOAT32_C( 7.36), SIMDE_FLOAT32_C( 15.78), SIMDE_FLOAT32_C( 6.82), SIMDE_FLOAT32_C( 1.56),
SIMDE_FLOAT32_C( 0.27), SIMDE_FLOAT32_C( 0.29), SIMDE_FLOAT32_C( 0.27), SIMDE_FLOAT32_C( 0.40) } },
{ { SIMDE_FLOAT32_C( -2.07), SIMDE_FLOAT32_C( -3.29), SIMDE_FLOAT32_C( -3.96), SIMDE_FLOAT32_C( 0.14),
SIMDE_FLOAT32_C( -3.42), SIMDE_FLOAT32_C( -0.41), SIMDE_FLOAT32_C( 0.45), SIMDE_FLOAT32_C( 0.63) },
{ SIMDE_FLOAT32_C( 0.24), SIMDE_FLOAT32_C( 0.10), SIMDE_FLOAT32_C( 0.06), SIMDE_FLOAT32_C( 1.10),
SIMDE_FLOAT32_C( 0.09), SIMDE_FLOAT32_C( 0.75), SIMDE_FLOAT32_C( 1.37), SIMDE_FLOAT32_C( 1.55) } },
{ { SIMDE_FLOAT32_C( -0.27), SIMDE_FLOAT32_C( 0.55), SIMDE_FLOAT32_C( -2.58), SIMDE_FLOAT32_C( 1.40),
SIMDE_FLOAT32_C( -1.19), SIMDE_FLOAT32_C( 2.84), SIMDE_FLOAT32_C( 2.74), SIMDE_FLOAT32_C( -3.03) },
{ SIMDE_FLOAT32_C( 0.83), SIMDE_FLOAT32_C( 1.46), SIMDE_FLOAT32_C( 0.17), SIMDE_FLOAT32_C( 2.64),
SIMDE_FLOAT32_C( 0.44), SIMDE_FLOAT32_C( 7.16), SIMDE_FLOAT32_C( 6.68), SIMDE_FLOAT32_C( 0.12) } },
{ { SIMDE_FLOAT32_C( 2.79), SIMDE_FLOAT32_C( -3.44), SIMDE_FLOAT32_C( -3.79), SIMDE_FLOAT32_C( 3.43),
SIMDE_FLOAT32_C( 3.84), SIMDE_FLOAT32_C( -3.35), SIMDE_FLOAT32_C( -0.82), SIMDE_FLOAT32_C( 2.71) },
{ SIMDE_FLOAT32_C( 6.92), SIMDE_FLOAT32_C( 0.09), SIMDE_FLOAT32_C( 0.07), SIMDE_FLOAT32_C( 10.78),
SIMDE_FLOAT32_C( 14.32), SIMDE_FLOAT32_C( 0.10), SIMDE_FLOAT32_C( 0.57), SIMDE_FLOAT32_C( 6.54) } },
{ { SIMDE_FLOAT32_C( -3.37), SIMDE_FLOAT32_C( -2.06), SIMDE_FLOAT32_C( -0.65), SIMDE_FLOAT32_C( -1.27),
SIMDE_FLOAT32_C( 0.17), SIMDE_FLOAT32_C( 1.44), SIMDE_FLOAT32_C( 1.39), SIMDE_FLOAT32_C( 2.10) },
{ SIMDE_FLOAT32_C( 0.10), SIMDE_FLOAT32_C( 0.24), SIMDE_FLOAT32_C( 0.64), SIMDE_FLOAT32_C( 0.41),
SIMDE_FLOAT32_C( 1.13), SIMDE_FLOAT32_C( 2.71), SIMDE_FLOAT32_C( 2.62), SIMDE_FLOAT32_C( 4.29) } },
{ { SIMDE_FLOAT32_C( 2.15), SIMDE_FLOAT32_C( 1.43), SIMDE_FLOAT32_C( -1.76), SIMDE_FLOAT32_C( 2.73),
SIMDE_FLOAT32_C( -2.98), SIMDE_FLOAT32_C( 2.69), SIMDE_FLOAT32_C( -0.64), SIMDE_FLOAT32_C( 0.75) },
{ SIMDE_FLOAT32_C( 4.44), SIMDE_FLOAT32_C( 2.69), SIMDE_FLOAT32_C( 0.30), SIMDE_FLOAT32_C( 6.63),
SIMDE_FLOAT32_C( 0.13), SIMDE_FLOAT32_C( 6.45), SIMDE_FLOAT32_C( 0.64), SIMDE_FLOAT32_C( 1.68) } },
{ { SIMDE_FLOAT32_C( -0.76), SIMDE_FLOAT32_C( 0.78), SIMDE_FLOAT32_C( -1.85), SIMDE_FLOAT32_C( 2.05),
SIMDE_FLOAT32_C( -0.38), SIMDE_FLOAT32_C( -3.11), SIMDE_FLOAT32_C( 3.02), SIMDE_FLOAT32_C( -1.59) },
{ SIMDE_FLOAT32_C( 0.59), SIMDE_FLOAT32_C( 1.72), SIMDE_FLOAT32_C( 0.28), SIMDE_FLOAT32_C( 4.14),
SIMDE_FLOAT32_C( 0.77), SIMDE_FLOAT32_C( 0.12), SIMDE_FLOAT32_C( 8.11), SIMDE_FLOAT32_C( 0.33) } },
{ { SIMDE_FLOAT32_C( -2.54), SIMDE_FLOAT32_C( 3.23), SIMDE_FLOAT32_C( -2.16), SIMDE_FLOAT32_C( -2.71),
SIMDE_FLOAT32_C( 3.88), SIMDE_FLOAT32_C( 1.02), SIMDE_FLOAT32_C( -4.00), SIMDE_FLOAT32_C( -3.49) },
{ SIMDE_FLOAT32_C( 0.17), SIMDE_FLOAT32_C( 9.38), SIMDE_FLOAT32_C( 0.22), SIMDE_FLOAT32_C( 0.15),
SIMDE_FLOAT32_C( 14.72), SIMDE_FLOAT32_C( 2.03), SIMDE_FLOAT32_C( 0.06), SIMDE_FLOAT32_C( 0.09) } }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])) ; i++) {
simde__m256 a = simde_mm256_loadu_ps(test_vec[i].a);
simde__m256 r = simde_mm256_exp2_ps(a);
simde_test_x86_assert_equal_f32x8(r, simde_mm256_loadu_ps(test_vec[i].r), 1);
}
return 0;
}
static int
test_simde_mm256_exp2_pd (SIMDE_MUNIT_TEST_ARGS) {
static const struct {
const simde_float64 a[4];
const simde_float64 r[4];
} test_vec[] = {
{ { SIMDE_FLOAT64_C( 1.66), SIMDE_FLOAT64_C( -0.38), SIMDE_FLOAT64_C( 1.40), SIMDE_FLOAT64_C( 3.84) },
{ SIMDE_FLOAT64_C( 3.16), SIMDE_FLOAT64_C( 0.77), SIMDE_FLOAT64_C( 2.64), SIMDE_FLOAT64_C( 14.32) } },
{ { SIMDE_FLOAT64_C( -2.15), SIMDE_FLOAT64_C( -0.83), SIMDE_FLOAT64_C( -2.32), SIMDE_FLOAT64_C( 1.94) },
{ SIMDE_FLOAT64_C( 0.23), SIMDE_FLOAT64_C( 0.56), SIMDE_FLOAT64_C( 0.20), SIMDE_FLOAT64_C( 3.84) } },
{ { SIMDE_FLOAT64_C( 3.43), SIMDE_FLOAT64_C( -1.57), SIMDE_FLOAT64_C( -0.88), SIMDE_FLOAT64_C( 0.76) },
{ SIMDE_FLOAT64_C( 10.78), SIMDE_FLOAT64_C( 0.34), SIMDE_FLOAT64_C( 0.54), SIMDE_FLOAT64_C( 1.69) } },
{ { SIMDE_FLOAT64_C( 1.69), SIMDE_FLOAT64_C( 2.74), SIMDE_FLOAT64_C( 0.99), SIMDE_FLOAT64_C( -2.45) },
{ SIMDE_FLOAT64_C( 3.23), SIMDE_FLOAT64_C( 6.68), SIMDE_FLOAT64_C( 1.99), SIMDE_FLOAT64_C( 0.18) } },
{ { SIMDE_FLOAT64_C( 2.22), SIMDE_FLOAT64_C( 1.74), SIMDE_FLOAT64_C( 3.15), SIMDE_FLOAT64_C( 0.54) },
{ SIMDE_FLOAT64_C( 4.66), SIMDE_FLOAT64_C( 3.34), SIMDE_FLOAT64_C( 8.88), SIMDE_FLOAT64_C( 1.45) } },
{ { SIMDE_FLOAT64_C( 1.30), SIMDE_FLOAT64_C( -1.80), SIMDE_FLOAT64_C( 2.76), SIMDE_FLOAT64_C( -4.00) },
{ SIMDE_FLOAT64_C( 2.46), SIMDE_FLOAT64_C( 0.29), SIMDE_FLOAT64_C( 6.77), SIMDE_FLOAT64_C( 0.06) } },
{ { SIMDE_FLOAT64_C( -2.49), SIMDE_FLOAT64_C( -1.07), SIMDE_FLOAT64_C( 1.81), SIMDE_FLOAT64_C( 0.86) },
{ SIMDE_FLOAT64_C( 0.18), SIMDE_FLOAT64_C( 0.48), SIMDE_FLOAT64_C( 3.51), SIMDE_FLOAT64_C( 1.82) } },
{ { SIMDE_FLOAT64_C( -2.31), SIMDE_FLOAT64_C( -2.25), SIMDE_FLOAT64_C( 2.43), SIMDE_FLOAT64_C( 3.36) },
{ SIMDE_FLOAT64_C( 0.20), SIMDE_FLOAT64_C( 0.21), SIMDE_FLOAT64_C( 5.39), SIMDE_FLOAT64_C( 10.27) } }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])) ; i++) {
simde__m256d a = simde_mm256_loadu_pd(test_vec[i].a);
simde__m256d r = simde_mm256_exp2_pd(a);
simde_test_x86_assert_equal_f64x4(r, simde_mm256_loadu_pd(test_vec[i].r), 1);
}
return 0;
}
static int
test_simde_mm512_exp2_ps (SIMDE_MUNIT_TEST_ARGS) {
static const struct {
const simde_float32 a[16];
const simde_float32 r[16];
} test_vec[] = {
{ { SIMDE_FLOAT32_C( -0.15), SIMDE_FLOAT32_C( 0.24), SIMDE_FLOAT32_C( -1.43), SIMDE_FLOAT32_C( 3.90),
SIMDE_FLOAT32_C( 2.40), SIMDE_FLOAT32_C( 0.22), SIMDE_FLOAT32_C( -0.26), SIMDE_FLOAT32_C( 0.73),
SIMDE_FLOAT32_C( 1.74), SIMDE_FLOAT32_C( 2.25), SIMDE_FLOAT32_C( -1.28), SIMDE_FLOAT32_C( 1.46),
SIMDE_FLOAT32_C( 2.43), SIMDE_FLOAT32_C( -3.47), SIMDE_FLOAT32_C( -0.46), SIMDE_FLOAT32_C( 3.90) },
{ SIMDE_FLOAT32_C( 0.90), SIMDE_FLOAT32_C( 1.18), SIMDE_FLOAT32_C( 0.37), SIMDE_FLOAT32_C( 14.93),
SIMDE_FLOAT32_C( 5.28), SIMDE_FLOAT32_C( 1.16), SIMDE_FLOAT32_C( 0.84), SIMDE_FLOAT32_C( 1.66),
SIMDE_FLOAT32_C( 3.34), SIMDE_FLOAT32_C( 4.76), SIMDE_FLOAT32_C( 0.41), SIMDE_FLOAT32_C( 2.75),
SIMDE_FLOAT32_C( 5.39), SIMDE_FLOAT32_C( 0.09), SIMDE_FLOAT32_C( 0.73), SIMDE_FLOAT32_C( 14.93) } },
{ { SIMDE_FLOAT32_C( -3.96), SIMDE_FLOAT32_C( 2.44), SIMDE_FLOAT32_C( -3.40), SIMDE_FLOAT32_C( -2.09),
SIMDE_FLOAT32_C( -2.19), SIMDE_FLOAT32_C( 1.78), SIMDE_FLOAT32_C( -0.10), SIMDE_FLOAT32_C( -3.80),
SIMDE_FLOAT32_C( 1.57), SIMDE_FLOAT32_C( -1.05), SIMDE_FLOAT32_C( 0.36), SIMDE_FLOAT32_C( 2.82),
SIMDE_FLOAT32_C( -1.74), SIMDE_FLOAT32_C( 2.82), SIMDE_FLOAT32_C( 3.23), SIMDE_FLOAT32_C( 2.11) },
{ SIMDE_FLOAT32_C( 0.06), SIMDE_FLOAT32_C( 5.43), SIMDE_FLOAT32_C( 0.09), SIMDE_FLOAT32_C( 0.23),
SIMDE_FLOAT32_C( 0.22), SIMDE_FLOAT32_C( 3.43), SIMDE_FLOAT32_C( 0.93), SIMDE_FLOAT32_C( 0.07),
SIMDE_FLOAT32_C( 2.97), SIMDE_FLOAT32_C( 0.48), SIMDE_FLOAT32_C( 1.28), SIMDE_FLOAT32_C( 7.06),
SIMDE_FLOAT32_C( 0.30), SIMDE_FLOAT32_C( 7.06), SIMDE_FLOAT32_C( 9.38), SIMDE_FLOAT32_C( 4.32) } },
{ { SIMDE_FLOAT32_C( -0.94), SIMDE_FLOAT32_C( -2.20), SIMDE_FLOAT32_C( 2.02), SIMDE_FLOAT32_C( -2.54),
SIMDE_FLOAT32_C( 2.02), SIMDE_FLOAT32_C( -2.24), SIMDE_FLOAT32_C( 2.19), SIMDE_FLOAT32_C( -0.24),
SIMDE_FLOAT32_C( -3.99), SIMDE_FLOAT32_C( -3.09), SIMDE_FLOAT32_C( -2.77), SIMDE_FLOAT32_C( 2.43),
SIMDE_FLOAT32_C( -2.56), SIMDE_FLOAT32_C( 0.77), SIMDE_FLOAT32_C( 2.33), SIMDE_FLOAT32_C( -2.52) },
{ SIMDE_FLOAT32_C( 0.52), SIMDE_FLOAT32_C( 0.22), SIMDE_FLOAT32_C( 4.06), SIMDE_FLOAT32_C( 0.17),
SIMDE_FLOAT32_C( 4.06), SIMDE_FLOAT32_C( 0.21), SIMDE_FLOAT32_C( 4.56), SIMDE_FLOAT32_C( 0.85),
SIMDE_FLOAT32_C( 0.06), SIMDE_FLOAT32_C( 0.12), SIMDE_FLOAT32_C( 0.15), SIMDE_FLOAT32_C( 5.39),
SIMDE_FLOAT32_C( 0.17), SIMDE_FLOAT32_C( 1.71), SIMDE_FLOAT32_C( 5.03), SIMDE_FLOAT32_C( 0.17) } },
{ { SIMDE_FLOAT32_C( -0.79), SIMDE_FLOAT32_C( 2.93), SIMDE_FLOAT32_C( -0.60), SIMDE_FLOAT32_C( 1.02),
SIMDE_FLOAT32_C( 0.71), SIMDE_FLOAT32_C( 3.30), SIMDE_FLOAT32_C( 1.22), SIMDE_FLOAT32_C( -1.72),
SIMDE_FLOAT32_C( -1.76), SIMDE_FLOAT32_C( -2.42), SIMDE_FLOAT32_C( -2.90), SIMDE_FLOAT32_C( 0.50),
SIMDE_FLOAT32_C( -3.60), SIMDE_FLOAT32_C( -3.67), SIMDE_FLOAT32_C( -1.38), SIMDE_FLOAT32_C( -0.54) },
{ SIMDE_FLOAT32_C( 0.58), SIMDE_FLOAT32_C( 7.62), SIMDE_FLOAT32_C( 0.66), SIMDE_FLOAT32_C( 2.03),
SIMDE_FLOAT32_C( 1.64), SIMDE_FLOAT32_C( 9.85), SIMDE_FLOAT32_C( 2.33), SIMDE_FLOAT32_C( 0.30),
SIMDE_FLOAT32_C( 0.30), SIMDE_FLOAT32_C( 0.19), SIMDE_FLOAT32_C( 0.13), SIMDE_FLOAT32_C( 1.41),
SIMDE_FLOAT32_C( 0.08), SIMDE_FLOAT32_C( 0.08), SIMDE_FLOAT32_C( 0.38), SIMDE_FLOAT32_C( 0.69) } },
{ { SIMDE_FLOAT32_C( -1.87), SIMDE_FLOAT32_C( -3.37), SIMDE_FLOAT32_C( 0.93), SIMDE_FLOAT32_C( -3.85),
SIMDE_FLOAT32_C( -1.61), SIMDE_FLOAT32_C( -0.88), SIMDE_FLOAT32_C( -0.09), SIMDE_FLOAT32_C( -1.60),
SIMDE_FLOAT32_C( 0.04), SIMDE_FLOAT32_C( 1.14), SIMDE_FLOAT32_C( -3.17), SIMDE_FLOAT32_C( 1.48),
SIMDE_FLOAT32_C( -2.09), SIMDE_FLOAT32_C( 3.16), SIMDE_FLOAT32_C( 2.96), SIMDE_FLOAT32_C( 1.12) },
{ SIMDE_FLOAT32_C( 0.27), SIMDE_FLOAT32_C( 0.10), SIMDE_FLOAT32_C( 1.91), SIMDE_FLOAT32_C( 0.07),
SIMDE_FLOAT32_C( 0.33), SIMDE_FLOAT32_C( 0.54), SIMDE_FLOAT32_C( 0.94), SIMDE_FLOAT32_C( 0.33),
SIMDE_FLOAT32_C( 1.03), SIMDE_FLOAT32_C( 2.20), SIMDE_FLOAT32_C( 0.11), SIMDE_FLOAT32_C( 2.79),
SIMDE_FLOAT32_C( 0.23), SIMDE_FLOAT32_C( 8.94), SIMDE_FLOAT32_C( 7.78), SIMDE_FLOAT32_C( 2.17) } },
{ { SIMDE_FLOAT32_C( 2.09), SIMDE_FLOAT32_C( -1.64), SIMDE_FLOAT32_C( -1.86), SIMDE_FLOAT32_C( -1.20),
SIMDE_FLOAT32_C( -2.34), SIMDE_FLOAT32_C( 3.36), SIMDE_FLOAT32_C( 1.08), SIMDE_FLOAT32_C( -0.10),
SIMDE_FLOAT32_C( -3.05), SIMDE_FLOAT32_C( 2.18), SIMDE_FLOAT32_C( -3.59), SIMDE_FLOAT32_C( -2.65),
SIMDE_FLOAT32_C( 2.52), SIMDE_FLOAT32_C( -0.98), SIMDE_FLOAT32_C( 0.81), SIMDE_FLOAT32_C( -3.36) },
{ SIMDE_FLOAT32_C( 4.26), SIMDE_FLOAT32_C( 0.32), SIMDE_FLOAT32_C( 0.28), SIMDE_FLOAT32_C( 0.44),
SIMDE_FLOAT32_C( 0.20), SIMDE_FLOAT32_C( 10.27), SIMDE_FLOAT32_C( 2.11), SIMDE_FLOAT32_C( 0.93),
SIMDE_FLOAT32_C( 0.12), SIMDE_FLOAT32_C( 4.53), SIMDE_FLOAT32_C( 0.08), SIMDE_FLOAT32_C( 0.16),
SIMDE_FLOAT32_C( 5.74), SIMDE_FLOAT32_C( 0.51), SIMDE_FLOAT32_C( 1.75), SIMDE_FLOAT32_C( 0.10) } },
{ { SIMDE_FLOAT32_C( -0.34), SIMDE_FLOAT32_C( -2.26), SIMDE_FLOAT32_C( -3.21), SIMDE_FLOAT32_C( 2.05),
SIMDE_FLOAT32_C( 0.86), SIMDE_FLOAT32_C( 0.70), SIMDE_FLOAT32_C( -3.55), SIMDE_FLOAT32_C( -3.10),
SIMDE_FLOAT32_C( -2.16), SIMDE_FLOAT32_C( -2.72), SIMDE_FLOAT32_C( 2.38), SIMDE_FLOAT32_C( -0.25),
SIMDE_FLOAT32_C( -3.56), SIMDE_FLOAT32_C( 1.34), SIMDE_FLOAT32_C( -3.13), SIMDE_FLOAT32_C( 2.53) },
{ SIMDE_FLOAT32_C( 0.79), SIMDE_FLOAT32_C( 0.21), SIMDE_FLOAT32_C( 0.11), SIMDE_FLOAT32_C( 4.14),
SIMDE_FLOAT32_C( 1.82), SIMDE_FLOAT32_C( 1.62), SIMDE_FLOAT32_C( 0.09), SIMDE_FLOAT32_C( 0.12),
SIMDE_FLOAT32_C( 0.22), SIMDE_FLOAT32_C( 0.15), SIMDE_FLOAT32_C( 5.21), SIMDE_FLOAT32_C( 0.84),
SIMDE_FLOAT32_C( 0.08), SIMDE_FLOAT32_C( 2.53), SIMDE_FLOAT32_C( 0.11), SIMDE_FLOAT32_C( 5.78) } },
{ { SIMDE_FLOAT32_C( 3.70), SIMDE_FLOAT32_C( -0.99), SIMDE_FLOAT32_C( -2.68), SIMDE_FLOAT32_C( -2.64),
SIMDE_FLOAT32_C( -1.63), SIMDE_FLOAT32_C( 2.40), SIMDE_FLOAT32_C( 1.26), SIMDE_FLOAT32_C( -0.68),
SIMDE_FLOAT32_C( 0.59), SIMDE_FLOAT32_C( 1.67), SIMDE_FLOAT32_C( 0.67), SIMDE_FLOAT32_C( -0.90),
SIMDE_FLOAT32_C( -3.31), SIMDE_FLOAT32_C( -2.52), SIMDE_FLOAT32_C( -0.25), SIMDE_FLOAT32_C( 0.35) },
{ SIMDE_FLOAT32_C( 13.00), SIMDE_FLOAT32_C( 0.50), SIMDE_FLOAT32_C( 0.16), SIMDE_FLOAT32_C( 0.16),
SIMDE_FLOAT32_C( 0.32), SIMDE_FLOAT32_C( 5.28), SIMDE_FLOAT32_C( 2.39), SIMDE_FLOAT32_C( 0.62),
SIMDE_FLOAT32_C( 1.51), SIMDE_FLOAT32_C( 3.18), SIMDE_FLOAT32_C( 1.59), SIMDE_FLOAT32_C( 0.54),
SIMDE_FLOAT32_C( 0.10), SIMDE_FLOAT32_C( 0.17), SIMDE_FLOAT32_C( 0.84), SIMDE_FLOAT32_C( 1.27) } }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])) ; i++) {
simde__m512 a = simde_mm512_loadu_ps(test_vec[i].a);
simde__m512 r = simde_mm512_exp2_ps(a);
simde_test_x86_assert_equal_f32x16(r, simde_mm512_loadu_ps(test_vec[i].r), 1);
}
return 0;
}
static int
test_simde_mm512_mask_exp2_ps (SIMDE_MUNIT_TEST_ARGS) {
static const struct {
const simde_float32 src[16];
const simde__mmask8 k;
const simde_float32 a[16];
const simde_float32 r[16];
} test_vec[] = {
{ { SIMDE_FLOAT32_C( 1.94), SIMDE_FLOAT32_C( -0.99), SIMDE_FLOAT32_C( 3.64), SIMDE_FLOAT32_C( 1.77),
SIMDE_FLOAT32_C( 1.75), SIMDE_FLOAT32_C( -2.53), SIMDE_FLOAT32_C( -1.72), SIMDE_FLOAT32_C( -1.12),
SIMDE_FLOAT32_C( -3.88), SIMDE_FLOAT32_C( 1.32), SIMDE_FLOAT32_C( 1.04), SIMDE_FLOAT32_C( -0.68),
SIMDE_FLOAT32_C( 0.88), SIMDE_FLOAT32_C( 0.80), SIMDE_FLOAT32_C( 3.54), SIMDE_FLOAT32_C( -2.21) },
UINT8_C(173),
{ SIMDE_FLOAT32_C( -3.02), SIMDE_FLOAT32_C( 1.34), SIMDE_FLOAT32_C( 3.49), SIMDE_FLOAT32_C( -2.99),
SIMDE_FLOAT32_C( 2.05), SIMDE_FLOAT32_C( -3.69), SIMDE_FLOAT32_C( 0.17), SIMDE_FLOAT32_C( 2.30),
SIMDE_FLOAT32_C( 3.57), SIMDE_FLOAT32_C( 2.44), SIMDE_FLOAT32_C( 1.56), SIMDE_FLOAT32_C( -0.19),
SIMDE_FLOAT32_C( 3.28), SIMDE_FLOAT32_C( 1.86), SIMDE_FLOAT32_C( -2.25), SIMDE_FLOAT32_C( -1.70) },
{ SIMDE_FLOAT32_C( 0.12), SIMDE_FLOAT32_C( -0.99), SIMDE_FLOAT32_C( 11.24), SIMDE_FLOAT32_C( 0.13),
SIMDE_FLOAT32_C( 1.75), SIMDE_FLOAT32_C( 0.08), SIMDE_FLOAT32_C( -1.72), SIMDE_FLOAT32_C( 4.92),
SIMDE_FLOAT32_C( -3.88), SIMDE_FLOAT32_C( 1.32), SIMDE_FLOAT32_C( 1.04), SIMDE_FLOAT32_C( -0.68),
SIMDE_FLOAT32_C( 0.88), SIMDE_FLOAT32_C( 0.80), SIMDE_FLOAT32_C( 3.54), SIMDE_FLOAT32_C( -2.21) } },
{ { SIMDE_FLOAT32_C( 1.50), SIMDE_FLOAT32_C( 3.52), SIMDE_FLOAT32_C( -3.95), SIMDE_FLOAT32_C( 2.97),
SIMDE_FLOAT32_C( -2.20), SIMDE_FLOAT32_C( -1.07), SIMDE_FLOAT32_C( 3.09), SIMDE_FLOAT32_C( 3.12),
SIMDE_FLOAT32_C( 3.97), SIMDE_FLOAT32_C( -1.59), SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 0.77),
SIMDE_FLOAT32_C( -2.05), SIMDE_FLOAT32_C( 1.79), SIMDE_FLOAT32_C( -3.38), SIMDE_FLOAT32_C( -1.07) },
UINT8_C(225),
{ SIMDE_FLOAT32_C( -3.89), SIMDE_FLOAT32_C( -0.06), SIMDE_FLOAT32_C( -2.82), SIMDE_FLOAT32_C( -3.58),
SIMDE_FLOAT32_C( -3.89), SIMDE_FLOAT32_C( 3.49), SIMDE_FLOAT32_C( 3.99), SIMDE_FLOAT32_C( 2.55),
SIMDE_FLOAT32_C( 1.05), SIMDE_FLOAT32_C( -0.20), SIMDE_FLOAT32_C( 1.83), SIMDE_FLOAT32_C( -1.10),
SIMDE_FLOAT32_C( 1.55), SIMDE_FLOAT32_C( -3.87), SIMDE_FLOAT32_C( -3.60), SIMDE_FLOAT32_C( 1.07) },
{ SIMDE_FLOAT32_C( 0.07), SIMDE_FLOAT32_C( 3.52), SIMDE_FLOAT32_C( -3.95), SIMDE_FLOAT32_C( 2.97),
SIMDE_FLOAT32_C( -2.20), SIMDE_FLOAT32_C( 11.24), SIMDE_FLOAT32_C( 15.89), SIMDE_FLOAT32_C( 5.86),
SIMDE_FLOAT32_C( 3.97), SIMDE_FLOAT32_C( -1.59), SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 0.77),
SIMDE_FLOAT32_C( -2.05), SIMDE_FLOAT32_C( 1.79), SIMDE_FLOAT32_C( -3.38), SIMDE_FLOAT32_C( -1.07) } },
{ { SIMDE_FLOAT32_C( -3.82), SIMDE_FLOAT32_C( 3.38), SIMDE_FLOAT32_C( 2.88), SIMDE_FLOAT32_C( -0.89),
SIMDE_FLOAT32_C( 2.46), SIMDE_FLOAT32_C( 2.00), SIMDE_FLOAT32_C( -0.92), SIMDE_FLOAT32_C( -3.13),
SIMDE_FLOAT32_C( -1.99), SIMDE_FLOAT32_C( 3.84), SIMDE_FLOAT32_C( -1.18), SIMDE_FLOAT32_C( 3.80),
SIMDE_FLOAT32_C( -3.53), SIMDE_FLOAT32_C( 1.75), SIMDE_FLOAT32_C( -1.07), SIMDE_FLOAT32_C( -3.42) },
UINT8_C(147),
{ SIMDE_FLOAT32_C( 0.11), SIMDE_FLOAT32_C( -3.00), SIMDE_FLOAT32_C( -2.20), SIMDE_FLOAT32_C( -0.40),
SIMDE_FLOAT32_C( -3.00), SIMDE_FLOAT32_C( -3.65), SIMDE_FLOAT32_C( -3.35), SIMDE_FLOAT32_C( 0.80),
SIMDE_FLOAT32_C( 2.18), SIMDE_FLOAT32_C( -0.45), SIMDE_FLOAT32_C( -1.64), SIMDE_FLOAT32_C( 2.31),
SIMDE_FLOAT32_C( -0.05), SIMDE_FLOAT32_C( 3.43), SIMDE_FLOAT32_C( 2.50), SIMDE_FLOAT32_C( -0.67) },
{ SIMDE_FLOAT32_C( 1.08), SIMDE_FLOAT32_C( 0.12), SIMDE_FLOAT32_C( 2.88), SIMDE_FLOAT32_C( -0.89),
SIMDE_FLOAT32_C( 0.12), SIMDE_FLOAT32_C( 2.00), SIMDE_FLOAT32_C( -0.92), SIMDE_FLOAT32_C( 1.74),
SIMDE_FLOAT32_C( -1.99), SIMDE_FLOAT32_C( 3.84), SIMDE_FLOAT32_C( -1.18), SIMDE_FLOAT32_C( 3.80),
SIMDE_FLOAT32_C( -3.53), SIMDE_FLOAT32_C( 1.75), SIMDE_FLOAT32_C( -1.07), SIMDE_FLOAT32_C( -3.42) } },
{ { SIMDE_FLOAT32_C( 2.31), SIMDE_FLOAT32_C( -2.39), SIMDE_FLOAT32_C( -2.21), SIMDE_FLOAT32_C( 0.31),
SIMDE_FLOAT32_C( 0.68), SIMDE_FLOAT32_C( -1.33), SIMDE_FLOAT32_C( 2.32), SIMDE_FLOAT32_C( 0.52),
SIMDE_FLOAT32_C( 1.49), SIMDE_FLOAT32_C( 2.12), SIMDE_FLOAT32_C( 0.99), SIMDE_FLOAT32_C( -0.76),
SIMDE_FLOAT32_C( -2.95), SIMDE_FLOAT32_C( 1.57), SIMDE_FLOAT32_C( 0.93), SIMDE_FLOAT32_C( 1.17) },
UINT8_C( 16),
{ SIMDE_FLOAT32_C( 2.73), SIMDE_FLOAT32_C( -3.23), SIMDE_FLOAT32_C( 3.57), SIMDE_FLOAT32_C( 3.08),
SIMDE_FLOAT32_C( -2.59), SIMDE_FLOAT32_C( 0.38), SIMDE_FLOAT32_C( 1.26), SIMDE_FLOAT32_C( 0.97),
SIMDE_FLOAT32_C( 2.73), SIMDE_FLOAT32_C( -0.43), SIMDE_FLOAT32_C( -3.08), SIMDE_FLOAT32_C( 2.17),
SIMDE_FLOAT32_C( -1.93), SIMDE_FLOAT32_C( 0.25), SIMDE_FLOAT32_C( 0.48), SIMDE_FLOAT32_C( -0.33) },
{ SIMDE_FLOAT32_C( 2.31), SIMDE_FLOAT32_C( -2.39), SIMDE_FLOAT32_C( -2.21), SIMDE_FLOAT32_C( 0.31),
SIMDE_FLOAT32_C( 0.17), SIMDE_FLOAT32_C( -1.33), SIMDE_FLOAT32_C( 2.32), SIMDE_FLOAT32_C( 0.52),
SIMDE_FLOAT32_C( 1.49), SIMDE_FLOAT32_C( 2.12), SIMDE_FLOAT32_C( 0.99), SIMDE_FLOAT32_C( -0.76),
SIMDE_FLOAT32_C( -2.95), SIMDE_FLOAT32_C( 1.57), SIMDE_FLOAT32_C( 0.93), SIMDE_FLOAT32_C( 1.17) } },
{ { SIMDE_FLOAT32_C( 2.04), SIMDE_FLOAT32_C( -3.21), SIMDE_FLOAT32_C( -3.65), SIMDE_FLOAT32_C( -3.29),
SIMDE_FLOAT32_C( 3.11), SIMDE_FLOAT32_C( 0.88), SIMDE_FLOAT32_C( 2.21), SIMDE_FLOAT32_C( 1.23),
SIMDE_FLOAT32_C( -2.13), SIMDE_FLOAT32_C( -2.55), SIMDE_FLOAT32_C( 2.29), SIMDE_FLOAT32_C( 3.44),
SIMDE_FLOAT32_C( 2.38), SIMDE_FLOAT32_C( -0.55), SIMDE_FLOAT32_C( 2.01), SIMDE_FLOAT32_C( 1.11) },
UINT8_C(254),
{ SIMDE_FLOAT32_C( 1.58), SIMDE_FLOAT32_C( 0.19), SIMDE_FLOAT32_C( 1.63), SIMDE_FLOAT32_C( -2.04),
SIMDE_FLOAT32_C( -2.55), SIMDE_FLOAT32_C( -1.40), SIMDE_FLOAT32_C( -3.31), SIMDE_FLOAT32_C( 1.02),
SIMDE_FLOAT32_C( -0.48), SIMDE_FLOAT32_C( 2.86), SIMDE_FLOAT32_C( 3.09), SIMDE_FLOAT32_C( 3.77),
SIMDE_FLOAT32_C( -0.66), SIMDE_FLOAT32_C( -1.23), SIMDE_FLOAT32_C( 1.81), SIMDE_FLOAT32_C( 0.13) },
{ SIMDE_FLOAT32_C( 2.04), SIMDE_FLOAT32_C( 1.14), SIMDE_FLOAT32_C( 3.10), SIMDE_FLOAT32_C( 0.24),
SIMDE_FLOAT32_C( 0.17), SIMDE_FLOAT32_C( 0.38), SIMDE_FLOAT32_C( 0.10), SIMDE_FLOAT32_C( 2.03),
SIMDE_FLOAT32_C( -2.13), SIMDE_FLOAT32_C( -2.55), SIMDE_FLOAT32_C( 2.29), SIMDE_FLOAT32_C( 3.44),
SIMDE_FLOAT32_C( 2.38), SIMDE_FLOAT32_C( -0.55), SIMDE_FLOAT32_C( 2.01), SIMDE_FLOAT32_C( 1.11) } },
{ { SIMDE_FLOAT32_C( -0.88), SIMDE_FLOAT32_C( 2.53), SIMDE_FLOAT32_C( -0.75), SIMDE_FLOAT32_C( 4.00),
SIMDE_FLOAT32_C( 0.73), SIMDE_FLOAT32_C( -3.52), SIMDE_FLOAT32_C( -2.14), SIMDE_FLOAT32_C( 2.18),
SIMDE_FLOAT32_C( 2.77), SIMDE_FLOAT32_C( -2.70), SIMDE_FLOAT32_C( 0.57), SIMDE_FLOAT32_C( -1.78),
SIMDE_FLOAT32_C( 3.31), SIMDE_FLOAT32_C( -2.32), SIMDE_FLOAT32_C( 2.44), SIMDE_FLOAT32_C( 0.89) },
UINT8_C(128),
{ SIMDE_FLOAT32_C( 0.07), SIMDE_FLOAT32_C( 2.85), SIMDE_FLOAT32_C( 3.32), SIMDE_FLOAT32_C( 2.67),
SIMDE_FLOAT32_C( 3.55), SIMDE_FLOAT32_C( 0.34), SIMDE_FLOAT32_C( -1.81), SIMDE_FLOAT32_C( 2.41),
SIMDE_FLOAT32_C( -0.57), SIMDE_FLOAT32_C( -2.03), SIMDE_FLOAT32_C( -2.25), SIMDE_FLOAT32_C( 2.20),
SIMDE_FLOAT32_C( 3.78), SIMDE_FLOAT32_C( 1.88), SIMDE_FLOAT32_C( -2.68), SIMDE_FLOAT32_C( 2.31) },
{ SIMDE_FLOAT32_C( -0.88), SIMDE_FLOAT32_C( 2.53), SIMDE_FLOAT32_C( -0.75), SIMDE_FLOAT32_C( 4.00),
SIMDE_FLOAT32_C( 0.73), SIMDE_FLOAT32_C( -3.52), SIMDE_FLOAT32_C( -2.14), SIMDE_FLOAT32_C( 5.31),
SIMDE_FLOAT32_C( 2.77), SIMDE_FLOAT32_C( -2.70), SIMDE_FLOAT32_C( 0.57), SIMDE_FLOAT32_C( -1.78),
SIMDE_FLOAT32_C( 3.31), SIMDE_FLOAT32_C( -2.32), SIMDE_FLOAT32_C( 2.44), SIMDE_FLOAT32_C( 0.89) } },
{ { SIMDE_FLOAT32_C( -2.87), SIMDE_FLOAT32_C( -2.68), SIMDE_FLOAT32_C( -0.96), SIMDE_FLOAT32_C( -2.39),
SIMDE_FLOAT32_C( -0.82), SIMDE_FLOAT32_C( -2.77), SIMDE_FLOAT32_C( -3.62), SIMDE_FLOAT32_C( 0.48),
SIMDE_FLOAT32_C( 1.79), SIMDE_FLOAT32_C( -1.40), SIMDE_FLOAT32_C( -0.21), SIMDE_FLOAT32_C( 3.47),
SIMDE_FLOAT32_C( -2.97), SIMDE_FLOAT32_C( -3.32), SIMDE_FLOAT32_C( 1.34), SIMDE_FLOAT32_C( 1.11) },
UINT8_C( 84),
{ SIMDE_FLOAT32_C( 0.66), SIMDE_FLOAT32_C( -0.22), SIMDE_FLOAT32_C( 3.09), SIMDE_FLOAT32_C( -3.00),
SIMDE_FLOAT32_C( 1.98), SIMDE_FLOAT32_C( 1.50), SIMDE_FLOAT32_C( 0.43), SIMDE_FLOAT32_C( 3.94),
SIMDE_FLOAT32_C( 3.25), SIMDE_FLOAT32_C( -1.37), SIMDE_FLOAT32_C( 3.72), SIMDE_FLOAT32_C( 1.13),
SIMDE_FLOAT32_C( -0.06), SIMDE_FLOAT32_C( 2.03), SIMDE_FLOAT32_C( 2.26), SIMDE_FLOAT32_C( 1.26) },
{ SIMDE_FLOAT32_C( -2.87), SIMDE_FLOAT32_C( -2.68), SIMDE_FLOAT32_C( 8.51), SIMDE_FLOAT32_C( -2.39),
SIMDE_FLOAT32_C( 3.94), SIMDE_FLOAT32_C( -2.77), SIMDE_FLOAT32_C( 1.35), SIMDE_FLOAT32_C( 0.48),
SIMDE_FLOAT32_C( 1.79), SIMDE_FLOAT32_C( -1.40), SIMDE_FLOAT32_C( -0.21), SIMDE_FLOAT32_C( 3.47),
SIMDE_FLOAT32_C( -2.97), SIMDE_FLOAT32_C( -3.32), SIMDE_FLOAT32_C( 1.34), SIMDE_FLOAT32_C( 1.11) } },
{ { SIMDE_FLOAT32_C( -2.93), SIMDE_FLOAT32_C( 3.87), SIMDE_FLOAT32_C( -3.56), SIMDE_FLOAT32_C( -1.70),
SIMDE_FLOAT32_C( -3.75), SIMDE_FLOAT32_C( 0.92), SIMDE_FLOAT32_C( -3.91), SIMDE_FLOAT32_C( -1.16),
SIMDE_FLOAT32_C( -3.29), SIMDE_FLOAT32_C( 3.56), SIMDE_FLOAT32_C( -0.12), SIMDE_FLOAT32_C( -2.61),
SIMDE_FLOAT32_C( 0.90), SIMDE_FLOAT32_C( -3.02), SIMDE_FLOAT32_C( -3.07), SIMDE_FLOAT32_C( -2.45) },
UINT8_C(114),
{ SIMDE_FLOAT32_C( -3.99), SIMDE_FLOAT32_C( -1.45), SIMDE_FLOAT32_C( -1.26), SIMDE_FLOAT32_C( 1.51),
SIMDE_FLOAT32_C( 2.98), SIMDE_FLOAT32_C( -1.31), SIMDE_FLOAT32_C( 0.76), SIMDE_FLOAT32_C( -2.40),
SIMDE_FLOAT32_C( -1.59), SIMDE_FLOAT32_C( -2.11), SIMDE_FLOAT32_C( 1.55), SIMDE_FLOAT32_C( -3.56),
SIMDE_FLOAT32_C( -3.85), SIMDE_FLOAT32_C( -1.19), SIMDE_FLOAT32_C( -2.49), SIMDE_FLOAT32_C( -3.98) },
{ SIMDE_FLOAT32_C( -2.93), SIMDE_FLOAT32_C( 0.37), SIMDE_FLOAT32_C( -3.56), SIMDE_FLOAT32_C( -1.70),
SIMDE_FLOAT32_C( 7.89), SIMDE_FLOAT32_C( 0.40), SIMDE_FLOAT32_C( 1.69), SIMDE_FLOAT32_C( -1.16),
SIMDE_FLOAT32_C( -3.29), SIMDE_FLOAT32_C( 3.56), SIMDE_FLOAT32_C( -0.12), SIMDE_FLOAT32_C( -2.61),
SIMDE_FLOAT32_C( 0.90), SIMDE_FLOAT32_C( -3.02), SIMDE_FLOAT32_C( -3.07), SIMDE_FLOAT32_C( -2.45) } }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])) ; i++) {
simde__m512 src = simde_mm512_loadu_ps(test_vec[i].src);
simde__m512 a = simde_mm512_loadu_ps(test_vec[i].a);
simde__m512 r = simde_mm512_mask_exp2_ps(src, test_vec[i].k, a);
simde_test_x86_assert_equal_f32x16(r, simde_mm512_loadu_ps(test_vec[i].r), 1);
}
return 0;
}
static int
test_simde_mm512_exp2_pd (SIMDE_MUNIT_TEST_ARGS) {
static const struct {
const simde_float64 a[8];
const simde_float64 r[8];
} test_vec[] = {
{ { SIMDE_FLOAT64_C( 3.48), SIMDE_FLOAT64_C( -0.87), SIMDE_FLOAT64_C( 0.54), SIMDE_FLOAT64_C( -1.18),
SIMDE_FLOAT64_C( -0.93), SIMDE_FLOAT64_C( -0.41), SIMDE_FLOAT64_C( -1.58), SIMDE_FLOAT64_C( -1.72) },
{ SIMDE_FLOAT64_C( 11.16), SIMDE_FLOAT64_C( 0.55), SIMDE_FLOAT64_C( 1.45), SIMDE_FLOAT64_C( 0.44),
SIMDE_FLOAT64_C( 0.52), SIMDE_FLOAT64_C( 0.75), SIMDE_FLOAT64_C( 0.33), SIMDE_FLOAT64_C( 0.30) } },
{ { SIMDE_FLOAT64_C( -0.88), SIMDE_FLOAT64_C( 1.04), SIMDE_FLOAT64_C( 0.84), SIMDE_FLOAT64_C( -3.45),
SIMDE_FLOAT64_C( 3.01), SIMDE_FLOAT64_C( -3.59), SIMDE_FLOAT64_C( 0.34), SIMDE_FLOAT64_C( 3.12) },
{ SIMDE_FLOAT64_C( 0.54), SIMDE_FLOAT64_C( 2.06), SIMDE_FLOAT64_C( 1.79), SIMDE_FLOAT64_C( 0.09),
SIMDE_FLOAT64_C( 8.06), SIMDE_FLOAT64_C( 0.08), SIMDE_FLOAT64_C( 1.27), SIMDE_FLOAT64_C( 8.69) } },
{ { SIMDE_FLOAT64_C( -1.74), SIMDE_FLOAT64_C( -2.12), SIMDE_FLOAT64_C( 0.09), SIMDE_FLOAT64_C( 0.52),
SIMDE_FLOAT64_C( -1.12), SIMDE_FLOAT64_C( -1.89), SIMDE_FLOAT64_C( 2.97), SIMDE_FLOAT64_C( 2.38) },
{ SIMDE_FLOAT64_C( 0.30), SIMDE_FLOAT64_C( 0.23), SIMDE_FLOAT64_C( 1.06), SIMDE_FLOAT64_C( 1.43),
SIMDE_FLOAT64_C( 0.46), SIMDE_FLOAT64_C( 0.27), SIMDE_FLOAT64_C( 7.84), SIMDE_FLOAT64_C( 5.21) } },
{ { SIMDE_FLOAT64_C( 2.06), SIMDE_FLOAT64_C( 2.07), SIMDE_FLOAT64_C( -3.17), SIMDE_FLOAT64_C( 1.53),
SIMDE_FLOAT64_C( -1.34), SIMDE_FLOAT64_C( 1.50), SIMDE_FLOAT64_C( -0.18), SIMDE_FLOAT64_C( -1.86) },
{ SIMDE_FLOAT64_C( 4.17), SIMDE_FLOAT64_C( 4.20), SIMDE_FLOAT64_C( 0.11), SIMDE_FLOAT64_C( 2.89),
SIMDE_FLOAT64_C( 0.40), SIMDE_FLOAT64_C( 2.83), SIMDE_FLOAT64_C( 0.88), SIMDE_FLOAT64_C( 0.28) } },
{ { SIMDE_FLOAT64_C( -3.38), SIMDE_FLOAT64_C( -3.65), SIMDE_FLOAT64_C( 0.96), SIMDE_FLOAT64_C( -0.31),
SIMDE_FLOAT64_C( -0.06), SIMDE_FLOAT64_C( 3.37), SIMDE_FLOAT64_C( 1.97), SIMDE_FLOAT64_C( 3.07) },
{ SIMDE_FLOAT64_C( 0.10), SIMDE_FLOAT64_C( 0.08), SIMDE_FLOAT64_C( 1.95), SIMDE_FLOAT64_C( 0.81),
SIMDE_FLOAT64_C( 0.96), SIMDE_FLOAT64_C( 10.34), SIMDE_FLOAT64_C( 3.92), SIMDE_FLOAT64_C( 8.40) } },
{ { SIMDE_FLOAT64_C( 0.41), SIMDE_FLOAT64_C( -1.19), SIMDE_FLOAT64_C( 3.61), SIMDE_FLOAT64_C( -0.57),
SIMDE_FLOAT64_C( -0.78), SIMDE_FLOAT64_C( -0.04), SIMDE_FLOAT64_C( -1.46), SIMDE_FLOAT64_C( 1.48) },
{ SIMDE_FLOAT64_C( 1.33), SIMDE_FLOAT64_C( 0.44), SIMDE_FLOAT64_C( 12.21), SIMDE_FLOAT64_C( 0.67),
SIMDE_FLOAT64_C( 0.58), SIMDE_FLOAT64_C( 0.97), SIMDE_FLOAT64_C( 0.36), SIMDE_FLOAT64_C( 2.79) } },
{ { SIMDE_FLOAT64_C( 1.84), SIMDE_FLOAT64_C( 2.63), SIMDE_FLOAT64_C( -1.99), SIMDE_FLOAT64_C( -3.28),
SIMDE_FLOAT64_C( -3.26), SIMDE_FLOAT64_C( -3.02), SIMDE_FLOAT64_C( 3.10), SIMDE_FLOAT64_C( 2.79) },
{ SIMDE_FLOAT64_C( 3.58), SIMDE_FLOAT64_C( 6.19), SIMDE_FLOAT64_C( 0.25), SIMDE_FLOAT64_C( 0.10),
SIMDE_FLOAT64_C( 0.10), SIMDE_FLOAT64_C( 0.12), SIMDE_FLOAT64_C( 8.57), SIMDE_FLOAT64_C( 6.92) } },
{ { SIMDE_FLOAT64_C( 3.05), SIMDE_FLOAT64_C( 3.93), SIMDE_FLOAT64_C( 0.33), SIMDE_FLOAT64_C( -2.28),
SIMDE_FLOAT64_C( 1.42), SIMDE_FLOAT64_C( -3.86), SIMDE_FLOAT64_C( -0.14), SIMDE_FLOAT64_C( 2.05) },
{ SIMDE_FLOAT64_C( 8.28), SIMDE_FLOAT64_C( 15.24), SIMDE_FLOAT64_C( 1.26), SIMDE_FLOAT64_C( 0.21),
SIMDE_FLOAT64_C( 2.68), SIMDE_FLOAT64_C( 0.07), SIMDE_FLOAT64_C( 0.91), SIMDE_FLOAT64_C( 4.14) } }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])) ; i++) {
simde__m512d a = simde_mm512_loadu_pd(test_vec[i].a);
simde__m512d r = simde_mm512_exp2_pd(a);
simde_test_x86_assert_equal_f64x8(r, simde_mm512_loadu_pd(test_vec[i].r), 1);
}
return 0;
}
static int
test_simde_mm512_mask_exp2_pd (SIMDE_MUNIT_TEST_ARGS) {
static const struct {
const simde_float64 src[8];
const simde__mmask8 k;
const simde_float64 a[8];
const simde_float64 r[8];
} test_vec[] = {
{ { SIMDE_FLOAT64_C( 1.08), SIMDE_FLOAT64_C( -1.78), SIMDE_FLOAT64_C( -3.94), SIMDE_FLOAT64_C( 2.91),
SIMDE_FLOAT64_C( -3.39), SIMDE_FLOAT64_C( -0.34), SIMDE_FLOAT64_C( -1.05), SIMDE_FLOAT64_C( -1.87) },
UINT8_C( 59),
{ SIMDE_FLOAT64_C( 0.47), SIMDE_FLOAT64_C( 3.70), SIMDE_FLOAT64_C( -0.34), SIMDE_FLOAT64_C( -2.49),
SIMDE_FLOAT64_C( -3.69), SIMDE_FLOAT64_C( 1.16), SIMDE_FLOAT64_C( -0.71), SIMDE_FLOAT64_C( 3.16) },
{ SIMDE_FLOAT64_C( 1.39), SIMDE_FLOAT64_C( 13.00), SIMDE_FLOAT64_C( -3.94), SIMDE_FLOAT64_C( 0.18),
SIMDE_FLOAT64_C( 0.08), SIMDE_FLOAT64_C( 2.23), SIMDE_FLOAT64_C( -1.05), SIMDE_FLOAT64_C( -1.87) } },
{ { SIMDE_FLOAT64_C( 0.52), SIMDE_FLOAT64_C( 0.79), SIMDE_FLOAT64_C( -1.70), SIMDE_FLOAT64_C( -1.78),
SIMDE_FLOAT64_C( -0.42), SIMDE_FLOAT64_C( -3.00), SIMDE_FLOAT64_C( -0.85), SIMDE_FLOAT64_C( 3.64) },
UINT8_C(181),
{ SIMDE_FLOAT64_C( -3.64), SIMDE_FLOAT64_C( 1.07), SIMDE_FLOAT64_C( 0.09), SIMDE_FLOAT64_C( -0.99),
SIMDE_FLOAT64_C( 2.92), SIMDE_FLOAT64_C( -2.83), SIMDE_FLOAT64_C( 1.23), SIMDE_FLOAT64_C( 2.98) },
{ SIMDE_FLOAT64_C( 0.08), SIMDE_FLOAT64_C( 0.79), SIMDE_FLOAT64_C( 1.06), SIMDE_FLOAT64_C( -1.78),
SIMDE_FLOAT64_C( 7.57), SIMDE_FLOAT64_C( 0.14), SIMDE_FLOAT64_C( -0.85), SIMDE_FLOAT64_C( 7.89) } },
{ { SIMDE_FLOAT64_C( -3.92), SIMDE_FLOAT64_C( 1.84), SIMDE_FLOAT64_C( -1.36), SIMDE_FLOAT64_C( -0.97),
SIMDE_FLOAT64_C( 3.97), SIMDE_FLOAT64_C( -2.62), SIMDE_FLOAT64_C( 3.51), SIMDE_FLOAT64_C( 3.67) },
UINT8_C( 39),
{ SIMDE_FLOAT64_C( -2.98), SIMDE_FLOAT64_C( 3.98), SIMDE_FLOAT64_C( -1.79), SIMDE_FLOAT64_C( 0.31),
SIMDE_FLOAT64_C( 3.14), SIMDE_FLOAT64_C( 2.73), SIMDE_FLOAT64_C( -2.90), SIMDE_FLOAT64_C( -2.56) },
{ SIMDE_FLOAT64_C( 0.13), SIMDE_FLOAT64_C( 15.78), SIMDE_FLOAT64_C( 0.29), SIMDE_FLOAT64_C( -0.97),
SIMDE_FLOAT64_C( 3.97), SIMDE_FLOAT64_C( 6.63), SIMDE_FLOAT64_C( 3.51), SIMDE_FLOAT64_C( 3.67) } },
{ { SIMDE_FLOAT64_C( -3.05), SIMDE_FLOAT64_C( 0.68), SIMDE_FLOAT64_C( -1.56), SIMDE_FLOAT64_C( 0.11),
SIMDE_FLOAT64_C( 0.32), SIMDE_FLOAT64_C( -3.35), SIMDE_FLOAT64_C( 0.47), SIMDE_FLOAT64_C( -2.62) },
UINT8_C(222),
{ SIMDE_FLOAT64_C( 3.48), SIMDE_FLOAT64_C( -3.70), SIMDE_FLOAT64_C( 1.91), SIMDE_FLOAT64_C( 0.71),
SIMDE_FLOAT64_C( 3.28), SIMDE_FLOAT64_C( 1.99), SIMDE_FLOAT64_C( -1.45), SIMDE_FLOAT64_C( -2.07) },
{ SIMDE_FLOAT64_C( -3.05), SIMDE_FLOAT64_C( 0.08), SIMDE_FLOAT64_C( 3.76), SIMDE_FLOAT64_C( 1.64),
SIMDE_FLOAT64_C( 9.71), SIMDE_FLOAT64_C( -3.35), SIMDE_FLOAT64_C( 0.37), SIMDE_FLOAT64_C( 0.24) } },
{ { SIMDE_FLOAT64_C( -2.98), SIMDE_FLOAT64_C( -1.47), SIMDE_FLOAT64_C( -0.69), SIMDE_FLOAT64_C( -3.47),
SIMDE_FLOAT64_C( -1.80), SIMDE_FLOAT64_C( -3.64), SIMDE_FLOAT64_C( -2.45), SIMDE_FLOAT64_C( -1.83) },
UINT8_C(173),
{ SIMDE_FLOAT64_C( 1.86), SIMDE_FLOAT64_C( -2.68), SIMDE_FLOAT64_C( -2.71), SIMDE_FLOAT64_C( 2.96),
SIMDE_FLOAT64_C( -1.24), SIMDE_FLOAT64_C( -1.76), SIMDE_FLOAT64_C( -0.37), SIMDE_FLOAT64_C( 1.20) },
{ SIMDE_FLOAT64_C( 3.63), SIMDE_FLOAT64_C( -1.47), SIMDE_FLOAT64_C( 0.15), SIMDE_FLOAT64_C( 7.78),
SIMDE_FLOAT64_C( -1.80), SIMDE_FLOAT64_C( 0.30), SIMDE_FLOAT64_C( -2.45), SIMDE_FLOAT64_C( 2.30) } },
{ { SIMDE_FLOAT64_C( 2.35), SIMDE_FLOAT64_C( 3.95), SIMDE_FLOAT64_C( 1.85), SIMDE_FLOAT64_C( -1.18),
SIMDE_FLOAT64_C( -2.67), SIMDE_FLOAT64_C( -1.41), SIMDE_FLOAT64_C( -1.70), SIMDE_FLOAT64_C( -2.37) },
UINT8_C(128),
{ SIMDE_FLOAT64_C( 3.01), SIMDE_FLOAT64_C( -3.08), SIMDE_FLOAT64_C( 2.48), SIMDE_FLOAT64_C( -2.44),
SIMDE_FLOAT64_C( -1.16), SIMDE_FLOAT64_C( 3.50), SIMDE_FLOAT64_C( 0.09), SIMDE_FLOAT64_C( 2.16) },
{ SIMDE_FLOAT64_C( 2.35), SIMDE_FLOAT64_C( 3.95), SIMDE_FLOAT64_C( 1.85), SIMDE_FLOAT64_C( -1.18),
SIMDE_FLOAT64_C( -2.67), SIMDE_FLOAT64_C( -1.41), SIMDE_FLOAT64_C( -1.70), SIMDE_FLOAT64_C( 4.47) } },
{ { SIMDE_FLOAT64_C( -3.97), SIMDE_FLOAT64_C( 2.28), SIMDE_FLOAT64_C( 2.51), SIMDE_FLOAT64_C( -2.42),
SIMDE_FLOAT64_C( -3.54), SIMDE_FLOAT64_C( -2.92), SIMDE_FLOAT64_C( 3.44), SIMDE_FLOAT64_C( -2.23) },
UINT8_C( 29),
{ SIMDE_FLOAT64_C( 2.39), SIMDE_FLOAT64_C( 0.53), SIMDE_FLOAT64_C( 0.61), SIMDE_FLOAT64_C( -1.97),
SIMDE_FLOAT64_C( -2.27), SIMDE_FLOAT64_C( -1.04), SIMDE_FLOAT64_C( -2.02), SIMDE_FLOAT64_C( 3.58) },
{ SIMDE_FLOAT64_C( 5.24), SIMDE_FLOAT64_C( 2.28), SIMDE_FLOAT64_C( 1.53), SIMDE_FLOAT64_C( 0.26),
SIMDE_FLOAT64_C( 0.21), SIMDE_FLOAT64_C( -2.92), SIMDE_FLOAT64_C( 3.44), SIMDE_FLOAT64_C( -2.23) } },
{ { SIMDE_FLOAT64_C( 1.78), SIMDE_FLOAT64_C( -0.69), SIMDE_FLOAT64_C( -1.84), SIMDE_FLOAT64_C( -3.92),
SIMDE_FLOAT64_C( 0.94), SIMDE_FLOAT64_C( -1.34), SIMDE_FLOAT64_C( 3.09), SIMDE_FLOAT64_C( 1.86) },
UINT8_C(207),
{ SIMDE_FLOAT64_C( -3.35), SIMDE_FLOAT64_C( -3.29), SIMDE_FLOAT64_C( -3.36), SIMDE_FLOAT64_C( 0.74),
SIMDE_FLOAT64_C( 2.86), SIMDE_FLOAT64_C( -3.33), SIMDE_FLOAT64_C( -0.98), SIMDE_FLOAT64_C( 1.37) },
{ SIMDE_FLOAT64_C( 0.10), SIMDE_FLOAT64_C( 0.10), SIMDE_FLOAT64_C( 0.10), SIMDE_FLOAT64_C( 1.67),
SIMDE_FLOAT64_C( 0.94), SIMDE_FLOAT64_C( -1.34), SIMDE_FLOAT64_C( 0.51), SIMDE_FLOAT64_C( 2.58) } }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])) ; i++) {
simde__m512d src = simde_mm512_loadu_pd(test_vec[i].src);
simde__m512d a = simde_mm512_loadu_pd(test_vec[i].a);
simde__m512d r = simde_mm512_mask_exp2_pd(src, test_vec[i].k, a);
simde_test_x86_assert_equal_f64x8(r, simde_mm512_loadu_pd(test_vec[i].r), 1);
}
return 0;
}
static int
test_simde_mm_exp10_ps (SIMDE_MUNIT_TEST_ARGS) {
static const struct {
const simde_float32 a[4];
const simde_float32 r[4];
} test_vec[] = {
{ { SIMDE_FLOAT32_C( -1.28), SIMDE_FLOAT32_C( -0.35), SIMDE_FLOAT32_C( 3.28), SIMDE_FLOAT32_C( -3.13) },
{ SIMDE_FLOAT32_C( 0.05), SIMDE_FLOAT32_C( 0.45), SIMDE_FLOAT32_C( 1905.46), SIMDE_FLOAT32_C( 0.00) } },
{ { SIMDE_FLOAT32_C( 1.43), SIMDE_FLOAT32_C( 0.39), SIMDE_FLOAT32_C( 1.40), SIMDE_FLOAT32_C( -2.59) },
{ SIMDE_FLOAT32_C( 26.92), SIMDE_FLOAT32_C( 2.45), SIMDE_FLOAT32_C( 25.12), SIMDE_FLOAT32_C( 0.00) } },
{ { SIMDE_FLOAT32_C( -0.10), SIMDE_FLOAT32_C( -0.25), SIMDE_FLOAT32_C( 3.03), SIMDE_FLOAT32_C( 1.67) },
{ SIMDE_FLOAT32_C( 0.79), SIMDE_FLOAT32_C( 0.56), SIMDE_FLOAT32_C( 1071.52), SIMDE_FLOAT32_C( 46.77) } },
{ { SIMDE_FLOAT32_C( -3.68), SIMDE_FLOAT32_C( 0.43), SIMDE_FLOAT32_C( 1.53), SIMDE_FLOAT32_C( 1.43) },
{ SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 2.69), SIMDE_FLOAT32_C( 33.88), SIMDE_FLOAT32_C( 26.92) } },
{ { SIMDE_FLOAT32_C( -1.86), SIMDE_FLOAT32_C( 0.28), SIMDE_FLOAT32_C( 3.72), SIMDE_FLOAT32_C( -1.56) },
{ SIMDE_FLOAT32_C( 0.01), SIMDE_FLOAT32_C( 1.91), SIMDE_FLOAT32_C( 5248.07), SIMDE_FLOAT32_C( 0.03) } },
{ { SIMDE_FLOAT32_C( 0.55), SIMDE_FLOAT32_C( 2.62), SIMDE_FLOAT32_C( -1.43), SIMDE_FLOAT32_C( 0.99) },
{ SIMDE_FLOAT32_C( 3.55), SIMDE_FLOAT32_C( 416.87), SIMDE_FLOAT32_C( 0.04), SIMDE_FLOAT32_C( 9.77) } },
{ { SIMDE_FLOAT32_C( 1.99), SIMDE_FLOAT32_C( 3.09), SIMDE_FLOAT32_C( 2.38), SIMDE_FLOAT32_C( -3.37) },
{ SIMDE_FLOAT32_C( 97.72), SIMDE_FLOAT32_C( 1230.27), SIMDE_FLOAT32_C( 239.88), SIMDE_FLOAT32_C( 0.00) } },
{ { SIMDE_FLOAT32_C( -2.15), SIMDE_FLOAT32_C( 0.22), SIMDE_FLOAT32_C( 0.41), SIMDE_FLOAT32_C( 0.56) },
{ SIMDE_FLOAT32_C( 0.01), SIMDE_FLOAT32_C( 1.66), SIMDE_FLOAT32_C( 2.57), SIMDE_FLOAT32_C( 3.63) } }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])) ; i++) {
simde__m128 a = simde_mm_loadu_ps(test_vec[i].a);
simde__m128 r = simde_mm_exp10_ps(a);
simde_test_x86_assert_equal_f32x4(r, simde_mm_loadu_ps(test_vec[i].r), 1);
}
return 0;
}
static int
test_simde_mm_exp10_pd (SIMDE_MUNIT_TEST_ARGS) {
static const struct {
const simde_float64 a[2];
const simde_float64 r[2];
} test_vec[] = {
{ { SIMDE_FLOAT64_C( 2.71), SIMDE_FLOAT64_C( -2.06) },
{ SIMDE_FLOAT64_C( 512.86), SIMDE_FLOAT64_C( 0.01) } },
{ { SIMDE_FLOAT64_C( 0.82), SIMDE_FLOAT64_C( 2.37) },
{ SIMDE_FLOAT64_C( 6.61), SIMDE_FLOAT64_C( 234.42) } },
{ { SIMDE_FLOAT64_C( -1.27), SIMDE_FLOAT64_C( -2.72) },
{ SIMDE_FLOAT64_C( 0.05), SIMDE_FLOAT64_C( 0.00) } },
{ { SIMDE_FLOAT64_C( -0.02), SIMDE_FLOAT64_C( 1.72) },
{ SIMDE_FLOAT64_C( 0.95), SIMDE_FLOAT64_C( 52.48) } },
{ { SIMDE_FLOAT64_C( -2.59), SIMDE_FLOAT64_C( -1.62) },
{ SIMDE_FLOAT64_C( 0.00), SIMDE_FLOAT64_C( 0.02) } },
{ { SIMDE_FLOAT64_C( 1.83), SIMDE_FLOAT64_C( 3.25) },
{ SIMDE_FLOAT64_C( 67.61), SIMDE_FLOAT64_C( 1778.28) } },
{ { SIMDE_FLOAT64_C( -2.12), SIMDE_FLOAT64_C( 3.99) },
{ SIMDE_FLOAT64_C( 0.01), SIMDE_FLOAT64_C( 9772.37) } },
{ { SIMDE_FLOAT64_C( -3.59), SIMDE_FLOAT64_C( 0.94) },
{ SIMDE_FLOAT64_C( 0.00), SIMDE_FLOAT64_C( 8.71) } }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])) ; i++) {
simde__m128d a = simde_mm_loadu_pd(test_vec[i].a);
simde__m128d r = simde_mm_exp10_pd(a);
simde_test_x86_assert_equal_f64x2(r, simde_mm_loadu_pd(test_vec[i].r), 1);
}
return 0;
}
static int
test_simde_mm256_exp10_ps (SIMDE_MUNIT_TEST_ARGS) {
static const struct {
const simde_float32 a[8];
const simde_float32 r[8];
} test_vec[] = {
{ { SIMDE_FLOAT32_C( -2.69), SIMDE_FLOAT32_C( 3.91), SIMDE_FLOAT32_C( 0.61), SIMDE_FLOAT32_C( -0.80),
SIMDE_FLOAT32_C( 2.51), SIMDE_FLOAT32_C( -1.38), SIMDE_FLOAT32_C( -3.31), SIMDE_FLOAT32_C( -0.75) },
{ SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 8128.31), SIMDE_FLOAT32_C( 4.07), SIMDE_FLOAT32_C( 0.16),
SIMDE_FLOAT32_C( 323.59), SIMDE_FLOAT32_C( 0.04), SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 0.18) } },
{ { SIMDE_FLOAT32_C( 3.51), SIMDE_FLOAT32_C( -3.93), SIMDE_FLOAT32_C( -3.82), SIMDE_FLOAT32_C( 1.46),
SIMDE_FLOAT32_C( -3.04), SIMDE_FLOAT32_C( 3.29), SIMDE_FLOAT32_C( -3.04), SIMDE_FLOAT32_C( -3.66) },
{ SIMDE_FLOAT32_C( 3235.94), SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 28.84),
SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 1949.84), SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 0.00) } },
{ { SIMDE_FLOAT32_C( -2.34), SIMDE_FLOAT32_C( -3.98), SIMDE_FLOAT32_C( -1.70), SIMDE_FLOAT32_C( -1.23),
SIMDE_FLOAT32_C( -3.97), SIMDE_FLOAT32_C( -3.62), SIMDE_FLOAT32_C( 3.06), SIMDE_FLOAT32_C( -1.19) },
{ SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 0.02), SIMDE_FLOAT32_C( 0.06),
SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 1148.15), SIMDE_FLOAT32_C( 0.06) } },
{ { SIMDE_FLOAT32_C( 3.70), SIMDE_FLOAT32_C( -1.22), SIMDE_FLOAT32_C( 2.16), SIMDE_FLOAT32_C( 3.83),
SIMDE_FLOAT32_C( -3.41), SIMDE_FLOAT32_C( -0.48), SIMDE_FLOAT32_C( -2.66), SIMDE_FLOAT32_C( -2.09) },
{ SIMDE_FLOAT32_C( 5011.87), SIMDE_FLOAT32_C( 0.06), SIMDE_FLOAT32_C( 144.54), SIMDE_FLOAT32_C( 6760.83),
SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 0.33), SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 0.01) } },
{ { SIMDE_FLOAT32_C( -0.57), SIMDE_FLOAT32_C( 1.95), SIMDE_FLOAT32_C( 1.11), SIMDE_FLOAT32_C( -2.06),
SIMDE_FLOAT32_C( -3.42), SIMDE_FLOAT32_C( 1.79), SIMDE_FLOAT32_C( 1.19), SIMDE_FLOAT32_C( -3.92) },
{ SIMDE_FLOAT32_C( 0.27), SIMDE_FLOAT32_C( 89.13), SIMDE_FLOAT32_C( 12.88), SIMDE_FLOAT32_C( 0.01),
SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 61.66), SIMDE_FLOAT32_C( 15.49), SIMDE_FLOAT32_C( 0.00) } },
{ { SIMDE_FLOAT32_C( 1.86), SIMDE_FLOAT32_C( 1.37), SIMDE_FLOAT32_C( 1.54), SIMDE_FLOAT32_C( 2.82),
SIMDE_FLOAT32_C( 0.66), SIMDE_FLOAT32_C( 2.50), SIMDE_FLOAT32_C( 3.16), SIMDE_FLOAT32_C( 2.33) },
{ SIMDE_FLOAT32_C( 72.44), SIMDE_FLOAT32_C( 23.44), SIMDE_FLOAT32_C( 34.67), SIMDE_FLOAT32_C( 660.69),
SIMDE_FLOAT32_C( 4.57), SIMDE_FLOAT32_C( 316.23), SIMDE_FLOAT32_C( 1445.44), SIMDE_FLOAT32_C( 213.80) } },
{ { SIMDE_FLOAT32_C( 2.52), SIMDE_FLOAT32_C( -2.54), SIMDE_FLOAT32_C( -2.90), SIMDE_FLOAT32_C( 2.55),
SIMDE_FLOAT32_C( -2.16), SIMDE_FLOAT32_C( -3.84), SIMDE_FLOAT32_C( -2.64), SIMDE_FLOAT32_C( -2.46) },
{ SIMDE_FLOAT32_C( 331.13), SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 354.81),
SIMDE_FLOAT32_C( 0.01), SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 0.00) } },
{ { SIMDE_FLOAT32_C( -1.06), SIMDE_FLOAT32_C( 3.52), SIMDE_FLOAT32_C( -2.64), SIMDE_FLOAT32_C( -0.47),
SIMDE_FLOAT32_C( -0.96), SIMDE_FLOAT32_C( -1.29), SIMDE_FLOAT32_C( 1.44), SIMDE_FLOAT32_C( 2.48) },
{ SIMDE_FLOAT32_C( 0.09), SIMDE_FLOAT32_C( 3311.31), SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 0.34),
SIMDE_FLOAT32_C( 0.11), SIMDE_FLOAT32_C( 0.05), SIMDE_FLOAT32_C( 27.54), SIMDE_FLOAT32_C( 302.00) } }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])) ; i++) {
simde__m256 a = simde_mm256_loadu_ps(test_vec[i].a);
simde__m256 r = simde_mm256_exp10_ps(a);
simde_test_x86_assert_equal_f32x8(r, simde_mm256_loadu_ps(test_vec[i].r), 1);
}
return 0;
}
static int
test_simde_mm256_exp10_pd (SIMDE_MUNIT_TEST_ARGS) {
static const struct {
const simde_float64 a[4];
const simde_float64 r[4];
} test_vec[] = {
{ { SIMDE_FLOAT64_C( -3.01), SIMDE_FLOAT64_C( 0.37), SIMDE_FLOAT64_C( 0.10), SIMDE_FLOAT64_C( -0.62) },
{ SIMDE_FLOAT64_C( 0.00), SIMDE_FLOAT64_C( 2.34), SIMDE_FLOAT64_C( 1.26), SIMDE_FLOAT64_C( 0.24) } },
{ { SIMDE_FLOAT64_C( 1.29), SIMDE_FLOAT64_C( 2.86), SIMDE_FLOAT64_C( 0.75), SIMDE_FLOAT64_C( -3.99) },
{ SIMDE_FLOAT64_C( 19.50), SIMDE_FLOAT64_C( 724.44), SIMDE_FLOAT64_C( 5.62), SIMDE_FLOAT64_C( 0.00) } },
{ { SIMDE_FLOAT64_C( -2.93), SIMDE_FLOAT64_C( 3.81), SIMDE_FLOAT64_C( 3.34), SIMDE_FLOAT64_C( 3.21) },
{ SIMDE_FLOAT64_C( 0.00), SIMDE_FLOAT64_C( 6456.54), SIMDE_FLOAT64_C( 2187.76), SIMDE_FLOAT64_C( 1621.81) } },
{ { SIMDE_FLOAT64_C( -2.76), SIMDE_FLOAT64_C( -1.49), SIMDE_FLOAT64_C( 3.76), SIMDE_FLOAT64_C( -1.66) },
{ SIMDE_FLOAT64_C( 0.00), SIMDE_FLOAT64_C( 0.03), SIMDE_FLOAT64_C( 5754.40), SIMDE_FLOAT64_C( 0.02) } },
{ { SIMDE_FLOAT64_C( -0.02), SIMDE_FLOAT64_C( -2.70), SIMDE_FLOAT64_C( 2.90), SIMDE_FLOAT64_C( -0.73) },
{ SIMDE_FLOAT64_C( 0.95), SIMDE_FLOAT64_C( 0.00), SIMDE_FLOAT64_C( 794.33), SIMDE_FLOAT64_C( 0.19) } },
{ { SIMDE_FLOAT64_C( -1.67), SIMDE_FLOAT64_C( 0.20), SIMDE_FLOAT64_C( -2.21), SIMDE_FLOAT64_C( -3.15) },
{ SIMDE_FLOAT64_C( 0.02), SIMDE_FLOAT64_C( 1.58), SIMDE_FLOAT64_C( 0.01), SIMDE_FLOAT64_C( 0.00) } },
{ { SIMDE_FLOAT64_C( 2.30), SIMDE_FLOAT64_C( 3.98), SIMDE_FLOAT64_C( -0.86), SIMDE_FLOAT64_C( -1.96) },
{ SIMDE_FLOAT64_C( 199.53), SIMDE_FLOAT64_C( 9549.93), SIMDE_FLOAT64_C( 0.14), SIMDE_FLOAT64_C( 0.01) } },
{ { SIMDE_FLOAT64_C( 0.50), SIMDE_FLOAT64_C( 0.60), SIMDE_FLOAT64_C( 0.65), SIMDE_FLOAT64_C( 1.49) },
{ SIMDE_FLOAT64_C( 3.16), SIMDE_FLOAT64_C( 3.98), SIMDE_FLOAT64_C( 4.47), SIMDE_FLOAT64_C( 30.90) } }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])) ; i++) {
simde__m256d a = simde_mm256_loadu_pd(test_vec[i].a);
simde__m256d r = simde_mm256_exp10_pd(a);
simde_test_x86_assert_equal_f64x4(r, simde_mm256_loadu_pd(test_vec[i].r), 1);
}
return 0;
}
static int
test_simde_mm512_exp10_ps (SIMDE_MUNIT_TEST_ARGS) {
static const struct {
const simde_float32 a[16];
const simde_float32 r[16];
} test_vec[] = {
{ { SIMDE_FLOAT32_C( 2.15), SIMDE_FLOAT32_C( 3.90), SIMDE_FLOAT32_C( -3.06), SIMDE_FLOAT32_C( -3.99),
SIMDE_FLOAT32_C( -1.49), SIMDE_FLOAT32_C( 3.34), SIMDE_FLOAT32_C( 0.08), SIMDE_FLOAT32_C( 0.15),
SIMDE_FLOAT32_C( 3.24), SIMDE_FLOAT32_C( 2.10), SIMDE_FLOAT32_C( -1.61), SIMDE_FLOAT32_C( -3.33),
SIMDE_FLOAT32_C( 1.69), SIMDE_FLOAT32_C( -2.51), SIMDE_FLOAT32_C( 3.50), SIMDE_FLOAT32_C( -1.30) },
{ SIMDE_FLOAT32_C( 141.25), SIMDE_FLOAT32_C( 7943.28), SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 0.00),
SIMDE_FLOAT32_C( 0.03), SIMDE_FLOAT32_C( 2187.76), SIMDE_FLOAT32_C( 1.20), SIMDE_FLOAT32_C( 1.41),
SIMDE_FLOAT32_C( 1737.80), SIMDE_FLOAT32_C( 125.89), SIMDE_FLOAT32_C( 0.02), SIMDE_FLOAT32_C( 0.00),
SIMDE_FLOAT32_C( 48.98), SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 3162.28), SIMDE_FLOAT32_C( 0.05) } },
{ { SIMDE_FLOAT32_C( 0.53), SIMDE_FLOAT32_C( -1.13), SIMDE_FLOAT32_C( -1.51), SIMDE_FLOAT32_C( 1.48),
SIMDE_FLOAT32_C( -3.11), SIMDE_FLOAT32_C( -2.56), SIMDE_FLOAT32_C( -2.35), SIMDE_FLOAT32_C( -0.62),
SIMDE_FLOAT32_C( 2.03), SIMDE_FLOAT32_C( -1.51), SIMDE_FLOAT32_C( 0.84), SIMDE_FLOAT32_C( -3.88),
SIMDE_FLOAT32_C( -2.12), SIMDE_FLOAT32_C( 3.49), SIMDE_FLOAT32_C( -2.42), SIMDE_FLOAT32_C( -3.98) },
{ SIMDE_FLOAT32_C( 3.39), SIMDE_FLOAT32_C( 0.07), SIMDE_FLOAT32_C( 0.03), SIMDE_FLOAT32_C( 30.20),
SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 0.24),
SIMDE_FLOAT32_C( 107.15), SIMDE_FLOAT32_C( 0.03), SIMDE_FLOAT32_C( 6.92), SIMDE_FLOAT32_C( 0.00),
SIMDE_FLOAT32_C( 0.01), SIMDE_FLOAT32_C( 3090.30), SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 0.00) } },
{ { SIMDE_FLOAT32_C( 3.38), SIMDE_FLOAT32_C( -1.48), SIMDE_FLOAT32_C( -3.96), SIMDE_FLOAT32_C( -2.11),
SIMDE_FLOAT32_C( -2.14), SIMDE_FLOAT32_C( 0.11), SIMDE_FLOAT32_C( 2.04), SIMDE_FLOAT32_C( -2.89),
SIMDE_FLOAT32_C( -1.78), SIMDE_FLOAT32_C( -3.57), SIMDE_FLOAT32_C( -2.23), SIMDE_FLOAT32_C( 3.90),
SIMDE_FLOAT32_C( -2.08), SIMDE_FLOAT32_C( -2.73), SIMDE_FLOAT32_C( -1.40), SIMDE_FLOAT32_C( 2.46) },
{ SIMDE_FLOAT32_C( 2398.83), SIMDE_FLOAT32_C( 0.03), SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 0.01),
SIMDE_FLOAT32_C( 0.01), SIMDE_FLOAT32_C( 1.29), SIMDE_FLOAT32_C( 109.65), SIMDE_FLOAT32_C( 0.00),
SIMDE_FLOAT32_C( 0.02), SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 0.01), SIMDE_FLOAT32_C( 7943.28),
SIMDE_FLOAT32_C( 0.01), SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 0.04), SIMDE_FLOAT32_C( 288.40) } },
{ { SIMDE_FLOAT32_C( 0.15), SIMDE_FLOAT32_C( 1.09), SIMDE_FLOAT32_C( -0.06), SIMDE_FLOAT32_C( 1.03),
SIMDE_FLOAT32_C( 2.53), SIMDE_FLOAT32_C( 1.59), SIMDE_FLOAT32_C( -3.59), SIMDE_FLOAT32_C( 0.56),
SIMDE_FLOAT32_C( -3.91), SIMDE_FLOAT32_C( 1.26), SIMDE_FLOAT32_C( 0.68), SIMDE_FLOAT32_C( -2.04),
SIMDE_FLOAT32_C( 0.74), SIMDE_FLOAT32_C( 2.26), SIMDE_FLOAT32_C( -2.02), SIMDE_FLOAT32_C( 0.13) },
{ SIMDE_FLOAT32_C( 1.41), SIMDE_FLOAT32_C( 12.30), SIMDE_FLOAT32_C( 0.87), SIMDE_FLOAT32_C( 10.72),
SIMDE_FLOAT32_C( 338.84), SIMDE_FLOAT32_C( 38.90), SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 3.63),
SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 18.20), SIMDE_FLOAT32_C( 4.79), SIMDE_FLOAT32_C( 0.01),
SIMDE_FLOAT32_C( 5.50), SIMDE_FLOAT32_C( 181.97), SIMDE_FLOAT32_C( 0.01), SIMDE_FLOAT32_C( 1.35) } },
{ { SIMDE_FLOAT32_C( -3.22), SIMDE_FLOAT32_C( -1.98), SIMDE_FLOAT32_C( 2.02), SIMDE_FLOAT32_C( -1.36),
SIMDE_FLOAT32_C( 2.14), SIMDE_FLOAT32_C( 0.06), SIMDE_FLOAT32_C( -0.25), SIMDE_FLOAT32_C( -3.65),
SIMDE_FLOAT32_C( 0.49), SIMDE_FLOAT32_C( 1.52), SIMDE_FLOAT32_C( -3.75), SIMDE_FLOAT32_C( 2.41),
SIMDE_FLOAT32_C( 2.80), SIMDE_FLOAT32_C( -1.15), SIMDE_FLOAT32_C( 0.87), SIMDE_FLOAT32_C( -1.06) },
{ SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 0.01), SIMDE_FLOAT32_C( 104.71), SIMDE_FLOAT32_C( 0.04),
SIMDE_FLOAT32_C( 138.04), SIMDE_FLOAT32_C( 1.15), SIMDE_FLOAT32_C( 0.56), SIMDE_FLOAT32_C( 0.00),
SIMDE_FLOAT32_C( 3.09), SIMDE_FLOAT32_C( 33.11), SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 257.04),
SIMDE_FLOAT32_C( 630.96), SIMDE_FLOAT32_C( 0.07), SIMDE_FLOAT32_C( 7.41), SIMDE_FLOAT32_C( 0.09) } },
{ { SIMDE_FLOAT32_C( 3.94), SIMDE_FLOAT32_C( -3.19), SIMDE_FLOAT32_C( 3.97), SIMDE_FLOAT32_C( 2.47),
SIMDE_FLOAT32_C( 2.41), SIMDE_FLOAT32_C( -3.62), SIMDE_FLOAT32_C( -0.98), SIMDE_FLOAT32_C( 2.49),
SIMDE_FLOAT32_C( 1.64), SIMDE_FLOAT32_C( 3.70), SIMDE_FLOAT32_C( -3.55), SIMDE_FLOAT32_C( -1.62),
SIMDE_FLOAT32_C( 1.96), SIMDE_FLOAT32_C( -1.56), SIMDE_FLOAT32_C( 2.51), SIMDE_FLOAT32_C( 2.74) },
{ SIMDE_FLOAT32_C( 8709.64), SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 9332.54), SIMDE_FLOAT32_C( 295.12),
SIMDE_FLOAT32_C( 257.04), SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 0.10), SIMDE_FLOAT32_C( 309.03),
SIMDE_FLOAT32_C( 43.65), SIMDE_FLOAT32_C( 5011.87), SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 0.02),
SIMDE_FLOAT32_C( 91.20), SIMDE_FLOAT32_C( 0.03), SIMDE_FLOAT32_C( 323.59), SIMDE_FLOAT32_C( 549.54) } },
{ { SIMDE_FLOAT32_C( 0.46), SIMDE_FLOAT32_C( 0.53), SIMDE_FLOAT32_C( -2.61), SIMDE_FLOAT32_C( -1.40),
SIMDE_FLOAT32_C( -3.41), SIMDE_FLOAT32_C( 1.14), SIMDE_FLOAT32_C( -1.05), SIMDE_FLOAT32_C( 1.08),
SIMDE_FLOAT32_C( -1.34), SIMDE_FLOAT32_C( -0.80), SIMDE_FLOAT32_C( -0.51), SIMDE_FLOAT32_C( -2.54),
SIMDE_FLOAT32_C( 2.05), SIMDE_FLOAT32_C( -3.64), SIMDE_FLOAT32_C( 0.40), SIMDE_FLOAT32_C( 2.00) },
{ SIMDE_FLOAT32_C( 2.88), SIMDE_FLOAT32_C( 3.39), SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 0.04),
SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 13.80), SIMDE_FLOAT32_C( 0.09), SIMDE_FLOAT32_C( 12.02),
SIMDE_FLOAT32_C( 0.05), SIMDE_FLOAT32_C( 0.16), SIMDE_FLOAT32_C( 0.31), SIMDE_FLOAT32_C( 0.00),
SIMDE_FLOAT32_C( 112.20), SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 2.51), SIMDE_FLOAT32_C( 100.00) } },
{ { SIMDE_FLOAT32_C( -2.83), SIMDE_FLOAT32_C( 0.37), SIMDE_FLOAT32_C( 0.46), SIMDE_FLOAT32_C( 3.58),
SIMDE_FLOAT32_C( 0.76), SIMDE_FLOAT32_C( 3.48), SIMDE_FLOAT32_C( 2.07), SIMDE_FLOAT32_C( -1.60),
SIMDE_FLOAT32_C( 3.19), SIMDE_FLOAT32_C( 2.53), SIMDE_FLOAT32_C( 0.78), SIMDE_FLOAT32_C( 1.15),
SIMDE_FLOAT32_C( -3.03), SIMDE_FLOAT32_C( -0.71), SIMDE_FLOAT32_C( -0.10), SIMDE_FLOAT32_C( 1.43) },
{ SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 2.34), SIMDE_FLOAT32_C( 2.88), SIMDE_FLOAT32_C( 3801.89),
SIMDE_FLOAT32_C( 5.75), SIMDE_FLOAT32_C( 3019.95), SIMDE_FLOAT32_C( 117.49), SIMDE_FLOAT32_C( 0.03),
SIMDE_FLOAT32_C( 1548.82), SIMDE_FLOAT32_C( 338.84), SIMDE_FLOAT32_C( 6.03), SIMDE_FLOAT32_C( 14.13),
SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 0.19), SIMDE_FLOAT32_C( 0.79), SIMDE_FLOAT32_C( 26.92) } }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])) ; i++) {
simde__m512 a = simde_mm512_loadu_ps(test_vec[i].a);
simde__m512 r = simde_mm512_exp10_ps(a);
simde_test_x86_assert_equal_f32x16(r, simde_mm512_loadu_ps(test_vec[i].r), 1);
}
return 0;
}
static int
test_simde_mm512_mask_exp10_ps (SIMDE_MUNIT_TEST_ARGS) {
static const struct {
const simde_float32 src[16];
const simde__mmask8 k;
const simde_float32 a[16];
const simde_float32 r[16];
} test_vec[] = {
{ { SIMDE_FLOAT32_C( -0.59), SIMDE_FLOAT32_C( 2.53), SIMDE_FLOAT32_C( -3.09), SIMDE_FLOAT32_C( 2.30),
SIMDE_FLOAT32_C( -3.02), SIMDE_FLOAT32_C( -1.71), SIMDE_FLOAT32_C( -2.65), SIMDE_FLOAT32_C( 2.34),
SIMDE_FLOAT32_C( 3.56), SIMDE_FLOAT32_C( -1.53), SIMDE_FLOAT32_C( 1.69), SIMDE_FLOAT32_C( 2.13),
SIMDE_FLOAT32_C( 1.98), SIMDE_FLOAT32_C( -3.96), SIMDE_FLOAT32_C( -3.24), SIMDE_FLOAT32_C( -2.96) },
UINT8_C( 58),
{ SIMDE_FLOAT32_C( 2.80), SIMDE_FLOAT32_C( 1.59), SIMDE_FLOAT32_C( 0.33), SIMDE_FLOAT32_C( -1.26),
SIMDE_FLOAT32_C( 0.81), SIMDE_FLOAT32_C( 1.75), SIMDE_FLOAT32_C( 0.29), SIMDE_FLOAT32_C( -2.77),
SIMDE_FLOAT32_C( 3.35), SIMDE_FLOAT32_C( 2.05), SIMDE_FLOAT32_C( 2.33), SIMDE_FLOAT32_C( 3.33),
SIMDE_FLOAT32_C( 2.21), SIMDE_FLOAT32_C( -1.15), SIMDE_FLOAT32_C( -1.25), SIMDE_FLOAT32_C( 0.74) },
{ SIMDE_FLOAT32_C( -0.59), SIMDE_FLOAT32_C( 38.90), SIMDE_FLOAT32_C( -3.09), SIMDE_FLOAT32_C( 0.05),
SIMDE_FLOAT32_C( 6.46), SIMDE_FLOAT32_C( 56.23), SIMDE_FLOAT32_C( -2.65), SIMDE_FLOAT32_C( 2.34),
SIMDE_FLOAT32_C( 3.56), SIMDE_FLOAT32_C( -1.53), SIMDE_FLOAT32_C( 1.69), SIMDE_FLOAT32_C( 2.13),
SIMDE_FLOAT32_C( 1.98), SIMDE_FLOAT32_C( -3.96), SIMDE_FLOAT32_C( -3.24), SIMDE_FLOAT32_C( -2.96) } },
{ { SIMDE_FLOAT32_C( -0.24), SIMDE_FLOAT32_C( -2.95), SIMDE_FLOAT32_C( 1.72), SIMDE_FLOAT32_C( 2.05),
SIMDE_FLOAT32_C( -1.60), SIMDE_FLOAT32_C( 0.07), SIMDE_FLOAT32_C( 1.61), SIMDE_FLOAT32_C( 0.87),
SIMDE_FLOAT32_C( -2.25), SIMDE_FLOAT32_C( -0.27), SIMDE_FLOAT32_C( -1.15), SIMDE_FLOAT32_C( -2.21),
SIMDE_FLOAT32_C( 0.49), SIMDE_FLOAT32_C( -0.11), SIMDE_FLOAT32_C( -3.54), SIMDE_FLOAT32_C( -0.71) },
UINT8_C(193),
{ SIMDE_FLOAT32_C( 0.79), SIMDE_FLOAT32_C( 2.03), SIMDE_FLOAT32_C( 2.29), SIMDE_FLOAT32_C( -1.46),
SIMDE_FLOAT32_C( -1.68), SIMDE_FLOAT32_C( 3.52), SIMDE_FLOAT32_C( -2.11), SIMDE_FLOAT32_C( -3.63),
SIMDE_FLOAT32_C( 1.85), SIMDE_FLOAT32_C( -2.78), SIMDE_FLOAT32_C( 2.58), SIMDE_FLOAT32_C( -3.29),
SIMDE_FLOAT32_C( -0.03), SIMDE_FLOAT32_C( -0.68), SIMDE_FLOAT32_C( 0.47), SIMDE_FLOAT32_C( 1.02) },
{ SIMDE_FLOAT32_C( 6.17), SIMDE_FLOAT32_C( -2.95), SIMDE_FLOAT32_C( 1.72), SIMDE_FLOAT32_C( 2.05),
SIMDE_FLOAT32_C( -1.60), SIMDE_FLOAT32_C( 0.07), SIMDE_FLOAT32_C( 0.01), SIMDE_FLOAT32_C( 0.00),
SIMDE_FLOAT32_C( -2.25), SIMDE_FLOAT32_C( -0.27), SIMDE_FLOAT32_C( -1.15), SIMDE_FLOAT32_C( -2.21),
SIMDE_FLOAT32_C( 0.49), SIMDE_FLOAT32_C( -0.11), SIMDE_FLOAT32_C( -3.54), SIMDE_FLOAT32_C( -0.71) } },
{ { SIMDE_FLOAT32_C( -2.96), SIMDE_FLOAT32_C( -1.49), SIMDE_FLOAT32_C( 3.42), SIMDE_FLOAT32_C( 1.10),
SIMDE_FLOAT32_C( -3.88), SIMDE_FLOAT32_C( 0.30), SIMDE_FLOAT32_C( 2.86), SIMDE_FLOAT32_C( -0.15),
SIMDE_FLOAT32_C( 3.14), SIMDE_FLOAT32_C( -3.35), SIMDE_FLOAT32_C( -3.66), SIMDE_FLOAT32_C( -0.97),
SIMDE_FLOAT32_C( -2.89), SIMDE_FLOAT32_C( -0.37), SIMDE_FLOAT32_C( 0.51), SIMDE_FLOAT32_C( 1.89) },
UINT8_C(215),
{ SIMDE_FLOAT32_C( -1.19), SIMDE_FLOAT32_C( -3.57), SIMDE_FLOAT32_C( -0.01), SIMDE_FLOAT32_C( -1.67),
SIMDE_FLOAT32_C( -1.68), SIMDE_FLOAT32_C( 0.36), SIMDE_FLOAT32_C( -3.82), SIMDE_FLOAT32_C( -0.45),
SIMDE_FLOAT32_C( -1.07), SIMDE_FLOAT32_C( -3.11), SIMDE_FLOAT32_C( 3.52), SIMDE_FLOAT32_C( 2.25),
SIMDE_FLOAT32_C( 1.35), SIMDE_FLOAT32_C( 0.54), SIMDE_FLOAT32_C( 3.29), SIMDE_FLOAT32_C( 3.86) },
{ SIMDE_FLOAT32_C( 0.06), SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 0.98), SIMDE_FLOAT32_C( 1.10),
SIMDE_FLOAT32_C( 0.02), SIMDE_FLOAT32_C( 0.30), SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 0.35),
SIMDE_FLOAT32_C( 3.14), SIMDE_FLOAT32_C( -3.35), SIMDE_FLOAT32_C( -3.66), SIMDE_FLOAT32_C( -0.97),
SIMDE_FLOAT32_C( -2.89), SIMDE_FLOAT32_C( -0.37), SIMDE_FLOAT32_C( 0.51), SIMDE_FLOAT32_C( 1.89) } },
{ { SIMDE_FLOAT32_C( -0.04), SIMDE_FLOAT32_C( 0.40), SIMDE_FLOAT32_C( 3.98), SIMDE_FLOAT32_C( -3.74),
SIMDE_FLOAT32_C( -0.75), SIMDE_FLOAT32_C( -0.17), SIMDE_FLOAT32_C( 3.40), SIMDE_FLOAT32_C( -0.10),
SIMDE_FLOAT32_C( 0.17), SIMDE_FLOAT32_C( -1.56), SIMDE_FLOAT32_C( 1.01), SIMDE_FLOAT32_C( 3.80),
SIMDE_FLOAT32_C( 2.95), SIMDE_FLOAT32_C( -1.09), SIMDE_FLOAT32_C( -2.53), SIMDE_FLOAT32_C( -2.24) },
UINT8_C(253),
{ SIMDE_FLOAT32_C( 1.45), SIMDE_FLOAT32_C( 0.09), SIMDE_FLOAT32_C( 1.66), SIMDE_FLOAT32_C( -2.19),
SIMDE_FLOAT32_C( 0.27), SIMDE_FLOAT32_C( -2.79), SIMDE_FLOAT32_C( 0.75), SIMDE_FLOAT32_C( 1.16),
SIMDE_FLOAT32_C( -3.28), SIMDE_FLOAT32_C( -1.00), SIMDE_FLOAT32_C( -1.49), SIMDE_FLOAT32_C( 1.26),
SIMDE_FLOAT32_C( -1.71), SIMDE_FLOAT32_C( -1.63), SIMDE_FLOAT32_C( -2.77), SIMDE_FLOAT32_C( 2.69) },
{ SIMDE_FLOAT32_C( 28.18), SIMDE_FLOAT32_C( 0.40), SIMDE_FLOAT32_C( 45.71), SIMDE_FLOAT32_C( 0.01),
SIMDE_FLOAT32_C( 1.86), SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 5.62), SIMDE_FLOAT32_C( 14.45),
SIMDE_FLOAT32_C( 0.17), SIMDE_FLOAT32_C( -1.56), SIMDE_FLOAT32_C( 1.01), SIMDE_FLOAT32_C( 3.80),
SIMDE_FLOAT32_C( 2.95), SIMDE_FLOAT32_C( -1.09), SIMDE_FLOAT32_C( -2.53), SIMDE_FLOAT32_C( -2.24) } },
{ { SIMDE_FLOAT32_C( -1.65), SIMDE_FLOAT32_C( -2.52), SIMDE_FLOAT32_C( -2.06), SIMDE_FLOAT32_C( 2.18),
SIMDE_FLOAT32_C( -3.11), SIMDE_FLOAT32_C( 1.85), SIMDE_FLOAT32_C( -1.65), SIMDE_FLOAT32_C( -0.68),
SIMDE_FLOAT32_C( -1.14), SIMDE_FLOAT32_C( -1.85), SIMDE_FLOAT32_C( -1.73), SIMDE_FLOAT32_C( 1.76),
SIMDE_FLOAT32_C( -0.38), SIMDE_FLOAT32_C( 0.03), SIMDE_FLOAT32_C( -2.90), SIMDE_FLOAT32_C( -2.93) },
UINT8_C(202),
{ SIMDE_FLOAT32_C( 2.76), SIMDE_FLOAT32_C( -1.12), SIMDE_FLOAT32_C( 0.39), SIMDE_FLOAT32_C( 3.97),
SIMDE_FLOAT32_C( 3.63), SIMDE_FLOAT32_C( -2.46), SIMDE_FLOAT32_C( -3.31), SIMDE_FLOAT32_C( -1.37),
SIMDE_FLOAT32_C( 0.05), SIMDE_FLOAT32_C( 1.95), SIMDE_FLOAT32_C( 0.92), SIMDE_FLOAT32_C( 2.42),
SIMDE_FLOAT32_C( 3.18), SIMDE_FLOAT32_C( -0.39), SIMDE_FLOAT32_C( -3.23), SIMDE_FLOAT32_C( -3.34) },
{ SIMDE_FLOAT32_C( -1.65), SIMDE_FLOAT32_C( 0.08), SIMDE_FLOAT32_C( -2.06), SIMDE_FLOAT32_C( 9332.54),
SIMDE_FLOAT32_C( -3.11), SIMDE_FLOAT32_C( 1.85), SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 0.04),
SIMDE_FLOAT32_C( -1.14), SIMDE_FLOAT32_C( -1.85), SIMDE_FLOAT32_C( -1.73), SIMDE_FLOAT32_C( 1.76),
SIMDE_FLOAT32_C( -0.38), SIMDE_FLOAT32_C( 0.03), SIMDE_FLOAT32_C( -2.90), SIMDE_FLOAT32_C( -2.93) } },
{ { SIMDE_FLOAT32_C( 1.55), SIMDE_FLOAT32_C( 2.95), SIMDE_FLOAT32_C( -2.45), SIMDE_FLOAT32_C( -0.61),
SIMDE_FLOAT32_C( -2.70), SIMDE_FLOAT32_C( 0.87), SIMDE_FLOAT32_C( 2.25), SIMDE_FLOAT32_C( -0.54),
SIMDE_FLOAT32_C( 3.14), SIMDE_FLOAT32_C( 0.01), SIMDE_FLOAT32_C( 3.08), SIMDE_FLOAT32_C( -0.83),
SIMDE_FLOAT32_C( 1.11), SIMDE_FLOAT32_C( -3.85), SIMDE_FLOAT32_C( -0.71), SIMDE_FLOAT32_C( -0.12) },
UINT8_C( 4),
{ SIMDE_FLOAT32_C( 3.68), SIMDE_FLOAT32_C( -0.16), SIMDE_FLOAT32_C( -1.34), SIMDE_FLOAT32_C( -2.78),
SIMDE_FLOAT32_C( 0.53), SIMDE_FLOAT32_C( 1.29), SIMDE_FLOAT32_C( 1.28), SIMDE_FLOAT32_C( -1.51),
SIMDE_FLOAT32_C( -1.79), SIMDE_FLOAT32_C( -0.30), SIMDE_FLOAT32_C( -2.34), SIMDE_FLOAT32_C( 1.81),
SIMDE_FLOAT32_C( 0.47), SIMDE_FLOAT32_C( -1.67), SIMDE_FLOAT32_C( -0.64), SIMDE_FLOAT32_C( -0.57) },
{ SIMDE_FLOAT32_C( 1.55), SIMDE_FLOAT32_C( 2.95), SIMDE_FLOAT32_C( 0.05), SIMDE_FLOAT32_C( -0.61),
SIMDE_FLOAT32_C( -2.70), SIMDE_FLOAT32_C( 0.87), SIMDE_FLOAT32_C( 2.25), SIMDE_FLOAT32_C( -0.54),
SIMDE_FLOAT32_C( 3.14), SIMDE_FLOAT32_C( 0.01), SIMDE_FLOAT32_C( 3.08), SIMDE_FLOAT32_C( -0.83),
SIMDE_FLOAT32_C( 1.11), SIMDE_FLOAT32_C( -3.85), SIMDE_FLOAT32_C( -0.71), SIMDE_FLOAT32_C( -0.12) } },
{ { SIMDE_FLOAT32_C( -0.13), SIMDE_FLOAT32_C( 2.76), SIMDE_FLOAT32_C( 0.73), SIMDE_FLOAT32_C( -3.26),
SIMDE_FLOAT32_C( 1.01), SIMDE_FLOAT32_C( -3.81), SIMDE_FLOAT32_C( 3.89), SIMDE_FLOAT32_C( -2.98),
SIMDE_FLOAT32_C( 3.27), SIMDE_FLOAT32_C( -0.94), SIMDE_FLOAT32_C( 2.14), SIMDE_FLOAT32_C( 3.42),
SIMDE_FLOAT32_C( 2.35), SIMDE_FLOAT32_C( -1.99), SIMDE_FLOAT32_C( -1.55), SIMDE_FLOAT32_C( 2.03) },
UINT8_C( 74),
{ SIMDE_FLOAT32_C( 1.11), SIMDE_FLOAT32_C( 3.25), SIMDE_FLOAT32_C( -1.61), SIMDE_FLOAT32_C( -1.60),
SIMDE_FLOAT32_C( 0.53), SIMDE_FLOAT32_C( 0.87), SIMDE_FLOAT32_C( 0.60), SIMDE_FLOAT32_C( -3.77),
SIMDE_FLOAT32_C( 2.54), SIMDE_FLOAT32_C( -1.58), SIMDE_FLOAT32_C( 0.70), SIMDE_FLOAT32_C( -3.14),
SIMDE_FLOAT32_C( 1.78), SIMDE_FLOAT32_C( -3.87), SIMDE_FLOAT32_C( 0.74), SIMDE_FLOAT32_C( 0.53) },
{ SIMDE_FLOAT32_C( -0.13), SIMDE_FLOAT32_C( 1778.28), SIMDE_FLOAT32_C( 0.73), SIMDE_FLOAT32_C( 0.03),
SIMDE_FLOAT32_C( 1.01), SIMDE_FLOAT32_C( -3.81), SIMDE_FLOAT32_C( 3.98), SIMDE_FLOAT32_C( -2.98),
SIMDE_FLOAT32_C( 3.27), SIMDE_FLOAT32_C( -0.94), SIMDE_FLOAT32_C( 2.14), SIMDE_FLOAT32_C( 3.42),
SIMDE_FLOAT32_C( 2.35), SIMDE_FLOAT32_C( -1.99), SIMDE_FLOAT32_C( -1.55), SIMDE_FLOAT32_C( 2.03) } },
{ { SIMDE_FLOAT32_C( 0.86), SIMDE_FLOAT32_C( 1.48), SIMDE_FLOAT32_C( -2.46), SIMDE_FLOAT32_C( 1.04),
SIMDE_FLOAT32_C( 1.37), SIMDE_FLOAT32_C( -1.44), SIMDE_FLOAT32_C( 0.31), SIMDE_FLOAT32_C( -3.58),
SIMDE_FLOAT32_C( -3.30), SIMDE_FLOAT32_C( -0.27), SIMDE_FLOAT32_C( 2.77), SIMDE_FLOAT32_C( -1.29),
SIMDE_FLOAT32_C( 2.18), SIMDE_FLOAT32_C( 0.80), SIMDE_FLOAT32_C( -3.44), SIMDE_FLOAT32_C( -0.71) },
UINT8_C(120),
{ SIMDE_FLOAT32_C( -1.05), SIMDE_FLOAT32_C( 1.68), SIMDE_FLOAT32_C( -3.41), SIMDE_FLOAT32_C( 3.82),
SIMDE_FLOAT32_C( -1.71), SIMDE_FLOAT32_C( -3.18), SIMDE_FLOAT32_C( 2.36), SIMDE_FLOAT32_C( 0.70),
SIMDE_FLOAT32_C( 1.52), SIMDE_FLOAT32_C( 3.22), SIMDE_FLOAT32_C( -1.52), SIMDE_FLOAT32_C( 1.64),
SIMDE_FLOAT32_C( -0.04), SIMDE_FLOAT32_C( 3.01), SIMDE_FLOAT32_C( -1.50), SIMDE_FLOAT32_C( -2.56) },
{ SIMDE_FLOAT32_C( 0.86), SIMDE_FLOAT32_C( 1.48), SIMDE_FLOAT32_C( -2.46), SIMDE_FLOAT32_C( 6606.93),
SIMDE_FLOAT32_C( 0.02), SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 229.09), SIMDE_FLOAT32_C( -3.58),
SIMDE_FLOAT32_C( -3.30), SIMDE_FLOAT32_C( -0.27), SIMDE_FLOAT32_C( 2.77), SIMDE_FLOAT32_C( -1.29),
SIMDE_FLOAT32_C( 2.18), SIMDE_FLOAT32_C( 0.80), SIMDE_FLOAT32_C( -3.44), SIMDE_FLOAT32_C( -0.71) } }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])) ; i++) {
simde__m512 src = simde_mm512_loadu_ps(test_vec[i].src);
simde__m512 a = simde_mm512_loadu_ps(test_vec[i].a);
simde__m512 r = simde_mm512_mask_exp10_ps(src, test_vec[i].k, a);
simde_test_x86_assert_equal_f32x16(r, simde_mm512_loadu_ps(test_vec[i].r), 1);
}
return 0;
}
static int
test_simde_mm512_exp10_pd (SIMDE_MUNIT_TEST_ARGS) {
static const struct {
const simde_float64 a[8];
const simde_float64 r[8];
} test_vec[] = {
{ { SIMDE_FLOAT64_C( 0.93), SIMDE_FLOAT64_C( 0.90), SIMDE_FLOAT64_C( -1.02), SIMDE_FLOAT64_C( -3.98),
SIMDE_FLOAT64_C( 3.95), SIMDE_FLOAT64_C( 0.31), SIMDE_FLOAT64_C( -2.47), SIMDE_FLOAT64_C( -3.25) },
{ SIMDE_FLOAT64_C( 8.51), SIMDE_FLOAT64_C( 7.94), SIMDE_FLOAT64_C( 0.10), SIMDE_FLOAT64_C( 0.00),
SIMDE_FLOAT64_C( 8912.51), SIMDE_FLOAT64_C( 2.04), SIMDE_FLOAT64_C( 0.00), SIMDE_FLOAT64_C( 0.00) } },
{ { SIMDE_FLOAT64_C( -3.33), SIMDE_FLOAT64_C( 1.18), SIMDE_FLOAT64_C( -1.87), SIMDE_FLOAT64_C( 0.97),
SIMDE_FLOAT64_C( 2.34), SIMDE_FLOAT64_C( -3.33), SIMDE_FLOAT64_C( -0.73), SIMDE_FLOAT64_C( 2.80) },
{ SIMDE_FLOAT64_C( 0.00), SIMDE_FLOAT64_C( 15.14), SIMDE_FLOAT64_C( 0.01), SIMDE_FLOAT64_C( 9.33),
SIMDE_FLOAT64_C( 218.78), SIMDE_FLOAT64_C( 0.00), SIMDE_FLOAT64_C( 0.19), SIMDE_FLOAT64_C( 630.96) } },
{ { SIMDE_FLOAT64_C( 2.86), SIMDE_FLOAT64_C( 2.64), SIMDE_FLOAT64_C( -2.88), SIMDE_FLOAT64_C( 3.99),
SIMDE_FLOAT64_C( 2.91), SIMDE_FLOAT64_C( -0.85), SIMDE_FLOAT64_C( -2.79), SIMDE_FLOAT64_C( 3.08) },
{ SIMDE_FLOAT64_C( 724.44), SIMDE_FLOAT64_C( 436.52), SIMDE_FLOAT64_C( 0.00), SIMDE_FLOAT64_C( 9772.37),
SIMDE_FLOAT64_C( 812.83), SIMDE_FLOAT64_C( 0.14), SIMDE_FLOAT64_C( 0.00), SIMDE_FLOAT64_C( 1202.26) } },
{ { SIMDE_FLOAT64_C( 3.79), SIMDE_FLOAT64_C( 1.10), SIMDE_FLOAT64_C( -2.75), SIMDE_FLOAT64_C( 2.52),
SIMDE_FLOAT64_C( 1.05), SIMDE_FLOAT64_C( 0.37), SIMDE_FLOAT64_C( -2.28), SIMDE_FLOAT64_C( -2.02) },
{ SIMDE_FLOAT64_C( 6165.95), SIMDE_FLOAT64_C( 12.59), SIMDE_FLOAT64_C( 0.00), SIMDE_FLOAT64_C( 331.13),
SIMDE_FLOAT64_C( 11.22), SIMDE_FLOAT64_C( 2.34), SIMDE_FLOAT64_C( 0.01), SIMDE_FLOAT64_C( 0.01) } },
{ { SIMDE_FLOAT64_C( -2.73), SIMDE_FLOAT64_C( 0.70), SIMDE_FLOAT64_C( -2.00), SIMDE_FLOAT64_C( -2.78),
SIMDE_FLOAT64_C( -2.99), SIMDE_FLOAT64_C( -0.48), SIMDE_FLOAT64_C( -2.02), SIMDE_FLOAT64_C( -2.32) },
{ SIMDE_FLOAT64_C( 0.00), SIMDE_FLOAT64_C( 5.01), SIMDE_FLOAT64_C( 0.01), SIMDE_FLOAT64_C( 0.00),
SIMDE_FLOAT64_C( 0.00), SIMDE_FLOAT64_C( 0.33), SIMDE_FLOAT64_C( 0.01), SIMDE_FLOAT64_C( 0.00) } },
{ { SIMDE_FLOAT64_C( -3.30), SIMDE_FLOAT64_C( 0.11), SIMDE_FLOAT64_C( 2.65), SIMDE_FLOAT64_C( 3.04),
SIMDE_FLOAT64_C( 0.78), SIMDE_FLOAT64_C( -2.08), SIMDE_FLOAT64_C( 1.84), SIMDE_FLOAT64_C( -0.36) },
{ SIMDE_FLOAT64_C( 0.00), SIMDE_FLOAT64_C( 1.29), SIMDE_FLOAT64_C( 446.68), SIMDE_FLOAT64_C( 1096.48),
SIMDE_FLOAT64_C( 6.03), SIMDE_FLOAT64_C( 0.01), SIMDE_FLOAT64_C( 69.18), SIMDE_FLOAT64_C( 0.44) } },
{ { SIMDE_FLOAT64_C( -3.45), SIMDE_FLOAT64_C( 2.96), SIMDE_FLOAT64_C( -0.37), SIMDE_FLOAT64_C( 3.46),
SIMDE_FLOAT64_C( -1.89), SIMDE_FLOAT64_C( 0.84), SIMDE_FLOAT64_C( 2.54), SIMDE_FLOAT64_C( -2.10) },
{ SIMDE_FLOAT64_C( 0.00), SIMDE_FLOAT64_C( 912.01), SIMDE_FLOAT64_C( 0.43), SIMDE_FLOAT64_C( 2884.03),
SIMDE_FLOAT64_C( 0.01), SIMDE_FLOAT64_C( 6.92), SIMDE_FLOAT64_C( 346.74), SIMDE_FLOAT64_C( 0.01) } },
{ { SIMDE_FLOAT64_C( -2.06), SIMDE_FLOAT64_C( 3.79), SIMDE_FLOAT64_C( -3.58), SIMDE_FLOAT64_C( 2.98),
SIMDE_FLOAT64_C( 0.16), SIMDE_FLOAT64_C( -1.86), SIMDE_FLOAT64_C( -3.04), SIMDE_FLOAT64_C( 1.43) },
{ SIMDE_FLOAT64_C( 0.01), SIMDE_FLOAT64_C( 6165.95), SIMDE_FLOAT64_C( 0.00), SIMDE_FLOAT64_C( 954.99),
SIMDE_FLOAT64_C( 1.45), SIMDE_FLOAT64_C( 0.01), SIMDE_FLOAT64_C( 0.00), SIMDE_FLOAT64_C( 26.92) } }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])) ; i++) {
simde__m512d a = simde_mm512_loadu_pd(test_vec[i].a);
simde__m512d r = simde_mm512_exp10_pd(a);
simde_test_x86_assert_equal_f64x8(r, simde_mm512_loadu_pd(test_vec[i].r), 1);
}
return 0;
}
static int
test_simde_mm512_mask_exp10_pd (SIMDE_MUNIT_TEST_ARGS) {
static const struct {
const simde_float64 src[8];
const simde__mmask8 k;
const simde_float64 a[8];
const simde_float64 r[8];
} test_vec[] = {
{ { SIMDE_FLOAT64_C( 2.35), SIMDE_FLOAT64_C( 3.51), SIMDE_FLOAT64_C( -2.45), SIMDE_FLOAT64_C( 0.95),
SIMDE_FLOAT64_C( -2.12), SIMDE_FLOAT64_C( -1.70), SIMDE_FLOAT64_C( 3.27), SIMDE_FLOAT64_C( -3.97) },
UINT8_C( 85),
{ SIMDE_FLOAT64_C( -1.97), SIMDE_FLOAT64_C( 0.32), SIMDE_FLOAT64_C( -0.33), SIMDE_FLOAT64_C( -3.87),
SIMDE_FLOAT64_C( 0.18), SIMDE_FLOAT64_C( -1.78), SIMDE_FLOAT64_C( 2.41), SIMDE_FLOAT64_C( 3.67) },
{ SIMDE_FLOAT64_C( 0.01), SIMDE_FLOAT64_C( 3.51), SIMDE_FLOAT64_C( 0.47), SIMDE_FLOAT64_C( 0.95),
SIMDE_FLOAT64_C( 1.51), SIMDE_FLOAT64_C( -1.70), SIMDE_FLOAT64_C( 257.04), SIMDE_FLOAT64_C( -3.97) } },
{ { SIMDE_FLOAT64_C( -1.74), SIMDE_FLOAT64_C( -3.97), SIMDE_FLOAT64_C( 3.52), SIMDE_FLOAT64_C( -3.35),
SIMDE_FLOAT64_C( -1.31), SIMDE_FLOAT64_C( 1.64), SIMDE_FLOAT64_C( 3.64), SIMDE_FLOAT64_C( 1.35) },
UINT8_C(237),
{ SIMDE_FLOAT64_C( -3.09), SIMDE_FLOAT64_C( 0.21), SIMDE_FLOAT64_C( -0.49), SIMDE_FLOAT64_C( 1.71),
SIMDE_FLOAT64_C( -0.39), SIMDE_FLOAT64_C( -2.14), SIMDE_FLOAT64_C( 1.22), SIMDE_FLOAT64_C( 1.16) },
{ SIMDE_FLOAT64_C( 0.00), SIMDE_FLOAT64_C( -3.97), SIMDE_FLOAT64_C( 0.32), SIMDE_FLOAT64_C( 51.29),
SIMDE_FLOAT64_C( -1.31), SIMDE_FLOAT64_C( 0.01), SIMDE_FLOAT64_C( 16.60), SIMDE_FLOAT64_C( 14.45) } },
{ { SIMDE_FLOAT64_C( 2.80), SIMDE_FLOAT64_C( 3.11), SIMDE_FLOAT64_C( 3.45), SIMDE_FLOAT64_C( 2.07),
SIMDE_FLOAT64_C( 3.14), SIMDE_FLOAT64_C( -1.25), SIMDE_FLOAT64_C( -3.90), SIMDE_FLOAT64_C( -0.54) },
UINT8_C(112),
{ SIMDE_FLOAT64_C( -3.77), SIMDE_FLOAT64_C( 3.65), SIMDE_FLOAT64_C( -3.35), SIMDE_FLOAT64_C( 2.64),
SIMDE_FLOAT64_C( 3.31), SIMDE_FLOAT64_C( -1.09), SIMDE_FLOAT64_C( 2.67), SIMDE_FLOAT64_C( 2.83) },
{ SIMDE_FLOAT64_C( 2.80), SIMDE_FLOAT64_C( 3.11), SIMDE_FLOAT64_C( 3.45), SIMDE_FLOAT64_C( 2.07),
SIMDE_FLOAT64_C( 2041.74), SIMDE_FLOAT64_C( 0.08), SIMDE_FLOAT64_C( 467.74), SIMDE_FLOAT64_C( -0.54) } },
{ { SIMDE_FLOAT64_C( -0.44), SIMDE_FLOAT64_C( -2.64), SIMDE_FLOAT64_C( 0.47), SIMDE_FLOAT64_C( -0.80),
SIMDE_FLOAT64_C( 2.71), SIMDE_FLOAT64_C( -3.20), SIMDE_FLOAT64_C( 0.11), SIMDE_FLOAT64_C( -1.08) },
UINT8_C( 28),
{ SIMDE_FLOAT64_C( -2.18), SIMDE_FLOAT64_C( 2.52), SIMDE_FLOAT64_C( 2.16), SIMDE_FLOAT64_C( 3.05),
SIMDE_FLOAT64_C( -0.32), SIMDE_FLOAT64_C( 0.96), SIMDE_FLOAT64_C( 2.15), SIMDE_FLOAT64_C( -0.87) },
{ SIMDE_FLOAT64_C( -0.44), SIMDE_FLOAT64_C( -2.64), SIMDE_FLOAT64_C( 144.54), SIMDE_FLOAT64_C( 1122.02),
SIMDE_FLOAT64_C( 0.48), SIMDE_FLOAT64_C( -3.20), SIMDE_FLOAT64_C( 0.11), SIMDE_FLOAT64_C( -1.08) } },
{ { SIMDE_FLOAT64_C( -0.98), SIMDE_FLOAT64_C( 1.30), SIMDE_FLOAT64_C( 1.89), SIMDE_FLOAT64_C( -0.87),
SIMDE_FLOAT64_C( -3.24), SIMDE_FLOAT64_C( 0.31), SIMDE_FLOAT64_C( -0.65), SIMDE_FLOAT64_C( -3.59) },
UINT8_C(243),
{ SIMDE_FLOAT64_C( -2.01), SIMDE_FLOAT64_C( 3.72), SIMDE_FLOAT64_C( 3.87), SIMDE_FLOAT64_C( -3.34),
SIMDE_FLOAT64_C( 2.55), SIMDE_FLOAT64_C( -0.57), SIMDE_FLOAT64_C( -1.99), SIMDE_FLOAT64_C( -0.99) },
{ SIMDE_FLOAT64_C( 0.01), SIMDE_FLOAT64_C( 5248.07), SIMDE_FLOAT64_C( 1.89), SIMDE_FLOAT64_C( -0.87),
SIMDE_FLOAT64_C( 354.81), SIMDE_FLOAT64_C( 0.27), SIMDE_FLOAT64_C( 0.01), SIMDE_FLOAT64_C( 0.10) } },
{ { SIMDE_FLOAT64_C( 2.63), SIMDE_FLOAT64_C( -3.28), SIMDE_FLOAT64_C( -0.19), SIMDE_FLOAT64_C( -1.27),
SIMDE_FLOAT64_C( -0.36), SIMDE_FLOAT64_C( -3.89), SIMDE_FLOAT64_C( 0.55), SIMDE_FLOAT64_C( -1.84) },
UINT8_C( 79),
{ SIMDE_FLOAT64_C( -0.40), SIMDE_FLOAT64_C( 1.84), SIMDE_FLOAT64_C( -0.77), SIMDE_FLOAT64_C( -2.25),
SIMDE_FLOAT64_C( -3.02), SIMDE_FLOAT64_C( 2.26), SIMDE_FLOAT64_C( 3.05), SIMDE_FLOAT64_C( 2.87) },
{ SIMDE_FLOAT64_C( 0.40), SIMDE_FLOAT64_C( 69.18), SIMDE_FLOAT64_C( 0.17), SIMDE_FLOAT64_C( 0.01),
SIMDE_FLOAT64_C( -0.36), SIMDE_FLOAT64_C( -3.89), SIMDE_FLOAT64_C( 1122.02), SIMDE_FLOAT64_C( -1.84) } },
{ { SIMDE_FLOAT64_C( -2.62), SIMDE_FLOAT64_C( 3.81), SIMDE_FLOAT64_C( -0.82), SIMDE_FLOAT64_C( 0.73),
SIMDE_FLOAT64_C( -3.78), SIMDE_FLOAT64_C( -3.86), SIMDE_FLOAT64_C( 2.72), SIMDE_FLOAT64_C( 3.93) },
UINT8_C(113),
{ SIMDE_FLOAT64_C( 3.38), SIMDE_FLOAT64_C( 2.48), SIMDE_FLOAT64_C( -0.55), SIMDE_FLOAT64_C( -2.61),
SIMDE_FLOAT64_C( -2.50), SIMDE_FLOAT64_C( -1.93), SIMDE_FLOAT64_C( -1.89), SIMDE_FLOAT64_C( 1.31) },
{ SIMDE_FLOAT64_C( 2398.83), SIMDE_FLOAT64_C( 3.81), SIMDE_FLOAT64_C( -0.82), SIMDE_FLOAT64_C( 0.73),
SIMDE_FLOAT64_C( 0.00), SIMDE_FLOAT64_C( 0.01), SIMDE_FLOAT64_C( 0.01), SIMDE_FLOAT64_C( 3.93) } },
{ { SIMDE_FLOAT64_C( 0.80), SIMDE_FLOAT64_C( 1.75), SIMDE_FLOAT64_C( 1.42), SIMDE_FLOAT64_C( -2.64),
SIMDE_FLOAT64_C( 3.91), SIMDE_FLOAT64_C( -0.31), SIMDE_FLOAT64_C( 0.96), SIMDE_FLOAT64_C( 1.76) },
UINT8_C(247),
{ SIMDE_FLOAT64_C( 2.71), SIMDE_FLOAT64_C( 2.74), SIMDE_FLOAT64_C( 1.18), SIMDE_FLOAT64_C( 1.76),
SIMDE_FLOAT64_C( 1.61), SIMDE_FLOAT64_C( 2.56), SIMDE_FLOAT64_C( 1.57), SIMDE_FLOAT64_C( -3.21) },
{ SIMDE_FLOAT64_C( 512.86), SIMDE_FLOAT64_C( 549.54), SIMDE_FLOAT64_C( 15.14), SIMDE_FLOAT64_C( -2.64),
SIMDE_FLOAT64_C( 40.74), SIMDE_FLOAT64_C( 363.08), SIMDE_FLOAT64_C( 37.15), SIMDE_FLOAT64_C( 0.00) } }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])) ; i++) {
simde__m512d src = simde_mm512_loadu_pd(test_vec[i].src);
simde__m512d a = simde_mm512_loadu_pd(test_vec[i].a);
simde__m512d r = simde_mm512_mask_exp10_pd(src, test_vec[i].k, a);
simde_test_x86_assert_equal_f64x8(r, simde_mm512_loadu_pd(test_vec[i].r), 1);
}
return 0;
}
static int
test_simde_mm_idivrem_epi32(SIMDE_MUNIT_TEST_ARGS) {
const struct {
simde__m128i a;
simde__m128i b;
simde__m128i rem;
simde__m128i r;
} test_vec[8] = {
{ simde_mm_set_epi32(INT32_C( 1747596798), INT32_C(-2063703989), INT32_C( 527472553), INT32_C(-1403096998)),
simde_mm_set_epi32(INT32_C( -269879152), INT32_C( -177038436), INT32_C( 377180600), INT32_C( -518586410)),
simde_mm_set_epi32(INT32_C( 128321886), INT32_C( -116281193), INT32_C( 150291953), INT32_C( -365924178)),
simde_mm_set_epi32(INT32_C( -6), INT32_C( 11), INT32_C( 1), INT32_C( 2)) },
{ simde_mm_set_epi32(INT32_C( -374673026), INT32_C(-1240805178), INT32_C( 1568850865), INT32_C(-1142977539)),
simde_mm_set_epi32(INT32_C( 172780273), INT32_C( 168508556), INT32_C( -491358722), INT32_C( -230071737)),
simde_mm_set_epi32(INT32_C( -29112480), INT32_C( -61245286), INT32_C( 94774699), INT32_C( -222690591)),
simde_mm_set_epi32(INT32_C( -2), INT32_C( -7), INT32_C( -3), INT32_C( 4)) },
{ simde_mm_set_epi32(INT32_C( 1492341726), INT32_C( 298608154), INT32_C( 1250819173), INT32_C( -650971253)),
simde_mm_set_epi32(INT32_C( 298065861), INT32_C( -521585931), INT32_C( 330694282), INT32_C( 40997390)),
simde_mm_set_epi32(INT32_C( 2012421), INT32_C( 298608154), INT32_C( 258736327), INT32_C( -36010403)),
simde_mm_set_epi32(INT32_C( 5), INT32_C( 0), INT32_C( 3), INT32_C( -15)) },
{ simde_mm_set_epi32(INT32_C(-1586327268), INT32_C( 1691051285), INT32_C( 50347892), INT32_C( 728425428)),
simde_mm_set_epi32(INT32_C( -441202718), INT32_C( 294920921), INT32_C( -411581651), INT32_C( -167991823)),
simde_mm_set_epi32(INT32_C( -262719114), INT32_C( 216446680), INT32_C( 50347892), INT32_C( 56458136)),
simde_mm_set_epi32(INT32_C( 3), INT32_C( 5), INT32_C( 0), INT32_C( -4)) },
{ simde_mm_set_epi32(INT32_C( 492373082), INT32_C( -13096811), INT32_C(-2087181083), INT32_C( -341007878)),
simde_mm_set_epi32(INT32_C( 123290430), INT32_C( -298778955), INT32_C( 223555334), INT32_C( -332615043)),
simde_mm_set_epi32(INT32_C( 122501792), INT32_C( -13096811), INT32_C( -75183077), INT32_C( -8392835)),
simde_mm_set_epi32(INT32_C( 3), INT32_C( 0), INT32_C( -9), INT32_C( 1)) },
{ simde_mm_set_epi32(INT32_C(-1004264650), INT32_C( 1580565751), INT32_C( -471064457), INT32_C( 2081361826)),
simde_mm_set_epi32(INT32_C( 328620632), INT32_C( -324312655), INT32_C( -184752009), INT32_C( -354760000)),
simde_mm_set_epi32(INT32_C( -18402754), INT32_C( 283315131), INT32_C( -101560439), INT32_C( 307561826)),
simde_mm_set_epi32(INT32_C( -3), INT32_C( -4), INT32_C( 2), INT32_C( -5)) },
{ simde_mm_set_epi32(INT32_C( 542053192), INT32_C( 499863549), INT32_C( 957375358), INT32_C(-1291033589)),
simde_mm_set_epi32(INT32_C( 427537184), INT32_C( 493530770), INT32_C( -356091799), INT32_C( 29647056)),
simde_mm_set_epi32(INT32_C( 114516008), INT32_C( 6332779), INT32_C( 245191760), INT32_C( -16210181)),
simde_mm_set_epi32(INT32_C( 1), INT32_C( 1), INT32_C( -2), INT32_C( -43)) },
{ simde_mm_set_epi32(INT32_C( -193211433), INT32_C( -857989172), INT32_C( -448329300), INT32_C(-1601364212)),
simde_mm_set_epi32(INT32_C( -284723308), INT32_C( -171790410), INT32_C( 457043765), INT32_C( -97355006)),
simde_mm_set_epi32(INT32_C( -193211433), INT32_C( -170827532), INT32_C( -448329300), INT32_C( -43684116)),
simde_mm_set_epi32(INT32_C( 0), INT32_C( 4), INT32_C( 0), INT32_C( 16)) }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])); i++) {
simde__m128i rem;
simde__m128i r = simde_mm_idivrem_epi32(&rem, test_vec[i].a, test_vec[i].b);
simde_assert_m128i_i32(r, ==, test_vec[i].r);
simde_assert_m128i_i32(rem, ==, test_vec[i].rem);
}
return 0;
}
static int
test_simde_mm256_idivrem_epi32(SIMDE_MUNIT_TEST_ARGS) {
const struct {
simde__m256i a;
simde__m256i b;
simde__m256i rem;
simde__m256i r;
} test_vec[8] = {
{ simde_mm256_set_epi32(INT32_C(-1079516608), INT32_C( -708153743), INT32_C( 1508722402), INT32_C(-2074345640),
INT32_C( 1747596798), INT32_C(-2063703989), INT32_C( 527472553), INT32_C(-1403096998)),
simde_mm256_set_epi32(INT32_C( 172780273), INT32_C( 168508556), INT32_C( -491358722), INT32_C( -230071737),
INT32_C( -93668257), INT32_C( -310201295), INT32_C( 392212716), INT32_C( -285744385)),
simde_mm256_set_epi32(INT32_C( -42834970), INT32_C( -34119519), INT32_C( 34646236), INT32_C( -3700007),
INT32_C( 61568172), INT32_C( -202496219), INT32_C( 135259837), INT32_C( -260119458)),
simde_mm256_set_epi32(INT32_C( -6), INT32_C( -4), INT32_C( -3), INT32_C( 9),
INT32_C( -18), INT32_C( 6), INT32_C( 1), INT32_C( 4)) },
{ simde_mm256_set_epi32(INT32_C( 1192263444), INT32_C(-2086343723), INT32_C( 1322777130), INT32_C( 163989560),
INT32_C( 1492341726), INT32_C( 298608154), INT32_C( 1250819173), INT32_C( -650971253)),
simde_mm256_set_epi32(INT32_C( -441202718), INT32_C( 294920921), INT32_C( -411581651), INT32_C( -167991823),
INT32_C( -396581817), INT32_C( 422762821), INT32_C( 12586973), INT32_C( 182106357)),
simde_mm256_set_epi32(INT32_C( 309858008), INT32_C( -21897276), INT32_C( 88032177), INT32_C( 163989560),
INT32_C( 302596275), INT32_C( 298608154), INT32_C( 4708846), INT32_C( -104652182)),
simde_mm256_set_epi32(INT32_C( -2), INT32_C( -7), INT32_C( -3), INT32_C( 0),
INT32_C( -3), INT32_C( 0), INT32_C( 99), INT32_C( -3)) },
{ simde_mm256_set_epi32(INT32_C( 493161721), INT32_C(-1195115819), INT32_C( 894221337), INT32_C(-1330460172),
INT32_C( 492373082), INT32_C( -13096811), INT32_C(-2087181083), INT32_C( -341007878)),
simde_mm256_set_epi32(INT32_C( 328620632), INT32_C( -324312655), INT32_C( -184752009), INT32_C( -354760000),
INT32_C( -251066163), INT32_C( 395141437), INT32_C( -117766115), INT32_C( 520340456)),
simde_mm256_set_epi32(INT32_C( 164541089), INT32_C( -222177854), INT32_C( 155213301), INT32_C( -266180172),
INT32_C( 241306919), INT32_C( -13096811), INT32_C( -85157128), INT32_C( -341007878)),
simde_mm256_set_epi32(INT32_C( 1), INT32_C( 3), INT32_C( -4), INT32_C( 3),
INT32_C( -1), INT32_C( 0), INT32_C( 17), INT32_C( 0)) },
{ simde_mm256_set_epi32(INT32_C( 1710148738), INT32_C( 1974123080), INT32_C(-1424367196), INT32_C( 118588227),
INT32_C( 542053192), INT32_C( 499863549), INT32_C( 957375358), INT32_C(-1291033589)),
simde_mm256_set_epi32(INT32_C( -284723308), INT32_C( -171790410), INT32_C( 457043765), INT32_C( -97355006),
INT32_C( -48302859), INT32_C( -214497293), INT32_C( -112082325), INT32_C( -400341053)),
simde_mm256_set_epi32(INT32_C( 1808890), INT32_C( 84428570), INT32_C( -53235901), INT32_C( 21233221),
INT32_C( 10721743), INT32_C( 70868963), INT32_C( 60716758), INT32_C( -90010430)),
simde_mm256_set_epi32(INT32_C( -6), INT32_C( -11), INT32_C( -3), INT32_C( -1),
INT32_C( -11), INT32_C( -2), INT32_C( -8), INT32_C( 3)) },
{ simde_mm256_set_epi32(INT32_C( 1734496959), INT32_C( 380846712), INT32_C( -941967689), INT32_C( -739443621),
INT32_C( 1995198557), INT32_C( -980655097), INT32_C(-1888383043), INT32_C( 1779168063)),
simde_mm256_set_epi32(INT32_C( 440775120), INT32_C( -129501140), INT32_C( -362589725), INT32_C( -352466550),
INT32_C( 67477586), INT32_C( 108492873), INT32_C( 360489056), INT32_C( 254567893)),
simde_mm256_set_epi32(INT32_C( 412171599), INT32_C( 121844432), INT32_C( -216788239), INT32_C( -34510521),
INT32_C( 38348563), INT32_C( -4219240), INT32_C( -85937763), INT32_C( 251760705)),
simde_mm256_set_epi32(INT32_C( 3), INT32_C( -2), INT32_C( 2), INT32_C( 2),
INT32_C( 29), INT32_C( -9), INT32_C( -5), INT32_C( 6)) },
{ simde_mm256_set_epi32(INT32_C( -362876916), INT32_C(-1845390533), INT32_C( -48621016), INT32_C( 201516689),
INT32_C(-1435930720), INT32_C(-1932876068), INT32_C(-1153303869), INT32_C( 562234020)),
simde_mm256_set_epi32(INT32_C( -166366311), INT32_C( -85548959), INT32_C( 525546139), INT32_C( 219277873),
INT32_C( 295872976), INT32_C( -144152745), INT32_C( -265329050), INT32_C( -202024350)),
simde_mm256_set_epi32(INT32_C( -30144294), INT32_C( -48862394), INT32_C( -48621016), INT32_C( 201516689),
INT32_C( -252438816), INT32_C( -58890383), INT32_C( -91987669), INT32_C( 158185320)),
simde_mm256_set_epi32(INT32_C( 2), INT32_C( 21), INT32_C( 0), INT32_C( 0),
INT32_C( -4), INT32_C( 13), INT32_C( 4), INT32_C( -2)) },
{ simde_mm256_set_epi32(INT32_C( 910061584), INT32_C( 2002226944), INT32_C( -621963189), INT32_C( -48343218),
INT32_C( 523093293), INT32_C(-1235205724), INT32_C(-2088961787), INT32_C( 1943141679)),
simde_mm256_set_epi32(INT32_C( 123967721), INT32_C( -95531607), INT32_C( 228811177), INT32_C( 1270356),
INT32_C( 355625346), INT32_C( -40994931), INT32_C( -379225067), INT32_C( 124491394)),
simde_mm256_set_epi32(INT32_C( 42287537), INT32_C( 91594804), INT32_C( -164340835), INT32_C( -69690),
INT32_C( 167467947), INT32_C( -5357794), INT32_C( -192836452), INT32_C( 75770769)),
simde_mm256_set_epi32(INT32_C( 7), INT32_C( -20), INT32_C( -2), INT32_C( -38),
INT32_C( 1), INT32_C( 30), INT32_C( 5), INT32_C( 15)) },
{ simde_mm256_set_epi32(INT32_C( 1755684145), INT32_C(-2061726371), INT32_C(-1050443653), INT32_C(-1299940555),
INT32_C(-2116696545), INT32_C( 1493088054), INT32_C( -179829877), INT32_C( 651362699)),
simde_mm256_set_epi32(INT32_C( 301617823), INT32_C( 343728879), INT32_C( 132913279), INT32_C( 518796827),
INT32_C( -36154638), INT32_C( -532966429), INT32_C( 361195763), INT32_C( 469656308)),
simde_mm256_set_epi32(INT32_C( 247595030), INT32_C( -343081976), INT32_C( -120050700), INT32_C( -262346901),
INT32_C( -19727541), INT32_C( 427155196), INT32_C( -179829877), INT32_C( 181706391)),
simde_mm256_set_epi32(INT32_C( 5), INT32_C( -5), INT32_C( -7), INT32_C( -2),
INT32_C( 58), INT32_C( -2), INT32_C( 0), INT32_C( 1)) }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])); i++) {
simde__m256i rem;
simde__m256i r = simde_mm256_idivrem_epi32(&rem, test_vec[i].a, test_vec[i].b);
simde_assert_m256i_i32(r, ==, test_vec[i].r);
simde_assert_m256i_i32(rem, ==, test_vec[i].rem);
}
return 0;
}
static int
test_simde_mm_hypot_ps (SIMDE_MUNIT_TEST_ARGS) {
static const struct {
const simde_float32 a[4];
const simde_float32 b[4];
const simde_float32 r[4];
} test_vec[] = {
{ { SIMDE_FLOAT32_C( 492.01), SIMDE_FLOAT32_C( 211.12), SIMDE_FLOAT32_C( 12.31), SIMDE_FLOAT32_C( 870.52) },
{ SIMDE_FLOAT32_C( -363.60), SIMDE_FLOAT32_C( 789.00), SIMDE_FLOAT32_C( 397.20), SIMDE_FLOAT32_C( -757.25) },
{ SIMDE_FLOAT32_C( 611.78), SIMDE_FLOAT32_C( 816.76), SIMDE_FLOAT32_C( 397.39), SIMDE_FLOAT32_C( 1153.79) } },
{ { SIMDE_FLOAT32_C( -192.59), SIMDE_FLOAT32_C( -586.23), SIMDE_FLOAT32_C( 571.12), SIMDE_FLOAT32_C( -717.05) },
{ SIMDE_FLOAT32_C( -663.78), SIMDE_FLOAT32_C( 66.94), SIMDE_FLOAT32_C( -412.69), SIMDE_FLOAT32_C( -769.47) },
{ SIMDE_FLOAT32_C( 691.15), SIMDE_FLOAT32_C( 590.04), SIMDE_FLOAT32_C( 704.62), SIMDE_FLOAT32_C( 1051.78) } },
{ { SIMDE_FLOAT32_C( -594.99), SIMDE_FLOAT32_C( -442.39), SIMDE_FLOAT32_C( -303.17), SIMDE_FLOAT32_C( 275.57) },
{ SIMDE_FLOAT32_C( 293.68), SIMDE_FLOAT32_C( 44.26), SIMDE_FLOAT32_C( -780.93), SIMDE_FLOAT32_C( -309.10) },
{ SIMDE_FLOAT32_C( 663.52), SIMDE_FLOAT32_C( 444.60), SIMDE_FLOAT32_C( 837.71), SIMDE_FLOAT32_C( 414.10) } },
{ { SIMDE_FLOAT32_C( -878.78), SIMDE_FLOAT32_C( -647.94), SIMDE_FLOAT32_C( 445.74), SIMDE_FLOAT32_C( 697.72) },
{ SIMDE_FLOAT32_C( 98.72), SIMDE_FLOAT32_C( -787.29), SIMDE_FLOAT32_C( -3.77), SIMDE_FLOAT32_C( -409.27) },
{ SIMDE_FLOAT32_C( 884.31), SIMDE_FLOAT32_C( 1019.63), SIMDE_FLOAT32_C( 445.76), SIMDE_FLOAT32_C( 808.90) } },
{ { SIMDE_FLOAT32_C( 423.83), SIMDE_FLOAT32_C( -991.46), SIMDE_FLOAT32_C( -538.75), SIMDE_FLOAT32_C( -939.77) },
{ SIMDE_FLOAT32_C( 797.54), SIMDE_FLOAT32_C( 858.45), SIMDE_FLOAT32_C( -697.02), SIMDE_FLOAT32_C( -395.04) },
{ SIMDE_FLOAT32_C( 903.16), SIMDE_FLOAT32_C( 1311.46), SIMDE_FLOAT32_C( 880.96), SIMDE_FLOAT32_C( 1019.42) } },
{ { SIMDE_FLOAT32_C( -727.78), SIMDE_FLOAT32_C( 874.10), SIMDE_FLOAT32_C( -112.10), SIMDE_FLOAT32_C( -391.56) },
{ SIMDE_FLOAT32_C( -58.96), SIMDE_FLOAT32_C( 475.22), SIMDE_FLOAT32_C( -161.04), SIMDE_FLOAT32_C( 346.05) },
{ SIMDE_FLOAT32_C( 730.16), SIMDE_FLOAT32_C( 994.93), SIMDE_FLOAT32_C( 196.21), SIMDE_FLOAT32_C( 522.56) } },
{ { SIMDE_FLOAT32_C( -967.17), SIMDE_FLOAT32_C( 535.80), SIMDE_FLOAT32_C( -378.38), SIMDE_FLOAT32_C( 326.51) },
{ SIMDE_FLOAT32_C( -419.95), SIMDE_FLOAT32_C( -159.32), SIMDE_FLOAT32_C( -982.59), SIMDE_FLOAT32_C( -298.72) },
{ SIMDE_FLOAT32_C( 1054.41), SIMDE_FLOAT32_C( 558.99), SIMDE_FLOAT32_C( 1052.93), SIMDE_FLOAT32_C( 442.54) } },
{ { SIMDE_FLOAT32_C( 192.74), SIMDE_FLOAT32_C( 463.15), SIMDE_FLOAT32_C( -601.00), SIMDE_FLOAT32_C( -708.54) },
{ SIMDE_FLOAT32_C( 675.86), SIMDE_FLOAT32_C( 395.23), SIMDE_FLOAT32_C( -117.81), SIMDE_FLOAT32_C( 99.70) },
{ SIMDE_FLOAT32_C( 702.81), SIMDE_FLOAT32_C( 608.86), SIMDE_FLOAT32_C( 612.44), SIMDE_FLOAT32_C( 715.52) } }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])) ; i++) {
simde__m128 a = simde_mm_loadu_ps(test_vec[i].a);
simde__m128 b = simde_mm_loadu_ps(test_vec[i].b);
simde__m128 r = simde_mm_hypot_ps(a, b);
simde_test_x86_assert_equal_f32x4(r, simde_mm_loadu_ps(test_vec[i].r), 1);
}
return 0;
}
static int
test_simde_mm_hypot_pd (SIMDE_MUNIT_TEST_ARGS) {
static const struct {
const simde_float64 a[2];
const simde_float64 b[2];
const simde_float64 r[2];
} test_vec[] = {
{ { SIMDE_FLOAT64_C( -603.93), SIMDE_FLOAT64_C( 656.94) },
{ SIMDE_FLOAT64_C( 263.86), SIMDE_FLOAT64_C( -668.26) },
{ SIMDE_FLOAT64_C( 659.06), SIMDE_FLOAT64_C( 937.09) } },
{ { SIMDE_FLOAT64_C( -573.72), SIMDE_FLOAT64_C( 127.62) },
{ SIMDE_FLOAT64_C( -494.33), SIMDE_FLOAT64_C( 413.83) },
{ SIMDE_FLOAT64_C( 757.31), SIMDE_FLOAT64_C( 433.06) } },
{ { SIMDE_FLOAT64_C( 92.50), SIMDE_FLOAT64_C( 179.32) },
{ SIMDE_FLOAT64_C( -379.77), SIMDE_FLOAT64_C( 381.33) },
{ SIMDE_FLOAT64_C( 390.87), SIMDE_FLOAT64_C( 421.39) } },
{ { SIMDE_FLOAT64_C( 344.30), SIMDE_FLOAT64_C( 576.77) },
{ SIMDE_FLOAT64_C( -663.77), SIMDE_FLOAT64_C( 656.74) },
{ SIMDE_FLOAT64_C( 747.75), SIMDE_FLOAT64_C( 874.05) } },
{ { SIMDE_FLOAT64_C( 499.56), SIMDE_FLOAT64_C( 761.69) },
{ SIMDE_FLOAT64_C( -752.98), SIMDE_FLOAT64_C( -522.11) },
{ SIMDE_FLOAT64_C( 903.63), SIMDE_FLOAT64_C( 923.46) } },
{ { SIMDE_FLOAT64_C( 242.72), SIMDE_FLOAT64_C( 412.75) },
{ SIMDE_FLOAT64_C( -101.50), SIMDE_FLOAT64_C( 96.94) },
{ SIMDE_FLOAT64_C( 263.09), SIMDE_FLOAT64_C( 423.98) } },
{ { SIMDE_FLOAT64_C( -934.53), SIMDE_FLOAT64_C( -147.86) },
{ SIMDE_FLOAT64_C( -959.33), SIMDE_FLOAT64_C( 790.23) },
{ SIMDE_FLOAT64_C( 1339.28), SIMDE_FLOAT64_C( 803.94) } },
{ { SIMDE_FLOAT64_C( 239.33), SIMDE_FLOAT64_C( -100.41) },
{ SIMDE_FLOAT64_C( -270.12), SIMDE_FLOAT64_C( 635.40) },
{ SIMDE_FLOAT64_C( 360.89), SIMDE_FLOAT64_C( 643.28) } }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])) ; i++) {
simde__m128d a = simde_mm_loadu_pd(test_vec[i].a);
simde__m128d b = simde_mm_loadu_pd(test_vec[i].b);
simde__m128d r = simde_mm_hypot_pd(a, b);
simde_test_x86_assert_equal_f64x2(r, simde_mm_loadu_pd(test_vec[i].r), 1);
}
return 0;
}
static int
test_simde_mm256_hypot_ps (SIMDE_MUNIT_TEST_ARGS) {
static const struct {
const simde_float32 a[8];
const simde_float32 b[8];
const simde_float32 r[8];
} test_vec[] = {
{ { SIMDE_FLOAT32_C( -777.18), SIMDE_FLOAT32_C( 159.63), SIMDE_FLOAT32_C( 756.34), SIMDE_FLOAT32_C( -76.33),
SIMDE_FLOAT32_C( 113.08), SIMDE_FLOAT32_C( 246.24), SIMDE_FLOAT32_C( 841.85), SIMDE_FLOAT32_C( -845.53) },
{ SIMDE_FLOAT32_C( -621.65), SIMDE_FLOAT32_C( 72.13), SIMDE_FLOAT32_C( 721.27), SIMDE_FLOAT32_C( -427.76),
SIMDE_FLOAT32_C( -945.55), SIMDE_FLOAT32_C( -213.25), SIMDE_FLOAT32_C( -603.55), SIMDE_FLOAT32_C( 373.40) },
{ SIMDE_FLOAT32_C( 995.22), SIMDE_FLOAT32_C( 175.17), SIMDE_FLOAT32_C( 1045.12), SIMDE_FLOAT32_C( 434.52),
SIMDE_FLOAT32_C( 952.29), SIMDE_FLOAT32_C( 325.74), SIMDE_FLOAT32_C( 1035.85), SIMDE_FLOAT32_C( 924.31) } },
{ { SIMDE_FLOAT32_C( -731.26), SIMDE_FLOAT32_C( -820.00), SIMDE_FLOAT32_C( 393.03), SIMDE_FLOAT32_C( -720.80),
SIMDE_FLOAT32_C( -923.20), SIMDE_FLOAT32_C( -65.81), SIMDE_FLOAT32_C( -541.82), SIMDE_FLOAT32_C( -812.46) },
{ SIMDE_FLOAT32_C( 833.72), SIMDE_FLOAT32_C( -217.64), SIMDE_FLOAT32_C( 806.57), SIMDE_FLOAT32_C( -582.91),
SIMDE_FLOAT32_C( 620.23), SIMDE_FLOAT32_C( -724.63), SIMDE_FLOAT32_C( 373.46), SIMDE_FLOAT32_C( 843.05) },
{ SIMDE_FLOAT32_C( 1108.98), SIMDE_FLOAT32_C( 848.39), SIMDE_FLOAT32_C( 897.23), SIMDE_FLOAT32_C( 927.00),
SIMDE_FLOAT32_C( 1112.20), SIMDE_FLOAT32_C( 727.61), SIMDE_FLOAT32_C( 658.06), SIMDE_FLOAT32_C( 1170.82) } },
{ { SIMDE_FLOAT32_C( 435.00), SIMDE_FLOAT32_C( 129.80), SIMDE_FLOAT32_C( -233.28), SIMDE_FLOAT32_C( -451.92),
SIMDE_FLOAT32_C( -623.96), SIMDE_FLOAT32_C( -391.43), SIMDE_FLOAT32_C( -297.45), SIMDE_FLOAT32_C( -245.61) },
{ SIMDE_FLOAT32_C( 680.70), SIMDE_FLOAT32_C( -576.18), SIMDE_FLOAT32_C( 326.63), SIMDE_FLOAT32_C( 735.15),
SIMDE_FLOAT32_C( 210.56), SIMDE_FLOAT32_C( 723.09), SIMDE_FLOAT32_C( 108.56), SIMDE_FLOAT32_C( 479.30) },
{ SIMDE_FLOAT32_C( 807.82), SIMDE_FLOAT32_C( 590.62), SIMDE_FLOAT32_C( 401.38), SIMDE_FLOAT32_C( 862.95),
SIMDE_FLOAT32_C( 658.53), SIMDE_FLOAT32_C( 822.24), SIMDE_FLOAT32_C( 316.64), SIMDE_FLOAT32_C( 538.57) } },
{ { SIMDE_FLOAT32_C( 903.09), SIMDE_FLOAT32_C( -498.41), SIMDE_FLOAT32_C( 758.50), SIMDE_FLOAT32_C( 979.89),
SIMDE_FLOAT32_C( 435.78), SIMDE_FLOAT32_C( -783.32), SIMDE_FLOAT32_C( -832.57), SIMDE_FLOAT32_C( 269.50) },
{ SIMDE_FLOAT32_C( -0.96), SIMDE_FLOAT32_C( 973.99), SIMDE_FLOAT32_C( 686.59), SIMDE_FLOAT32_C( -380.74),
SIMDE_FLOAT32_C( -750.64), SIMDE_FLOAT32_C( 60.05), SIMDE_FLOAT32_C( -537.69), SIMDE_FLOAT32_C( 684.36) },
{ SIMDE_FLOAT32_C( 903.09), SIMDE_FLOAT32_C( 1094.11), SIMDE_FLOAT32_C( 1023.10), SIMDE_FLOAT32_C( 1051.26),
SIMDE_FLOAT32_C( 867.97), SIMDE_FLOAT32_C( 785.62), SIMDE_FLOAT32_C( 991.10), SIMDE_FLOAT32_C( 735.51) } },
{ { SIMDE_FLOAT32_C( -810.16), SIMDE_FLOAT32_C( 229.03), SIMDE_FLOAT32_C( -767.56), SIMDE_FLOAT32_C( -434.12),
SIMDE_FLOAT32_C( 837.60), SIMDE_FLOAT32_C( -65.02), SIMDE_FLOAT32_C( 320.28), SIMDE_FLOAT32_C( 518.30) },
{ SIMDE_FLOAT32_C( 358.80), SIMDE_FLOAT32_C( -353.09), SIMDE_FLOAT32_C( 253.45), SIMDE_FLOAT32_C( -430.64),
SIMDE_FLOAT32_C( -630.00), SIMDE_FLOAT32_C( -637.99), SIMDE_FLOAT32_C( -951.34), SIMDE_FLOAT32_C( -726.92) },
{ SIMDE_FLOAT32_C( 886.06), SIMDE_FLOAT32_C( 420.86), SIMDE_FLOAT32_C( 808.32), SIMDE_FLOAT32_C( 611.48),
SIMDE_FLOAT32_C( 1048.08), SIMDE_FLOAT32_C( 641.29), SIMDE_FLOAT32_C( 1003.81), SIMDE_FLOAT32_C( 892.78) } },
{ { SIMDE_FLOAT32_C( -136.40), SIMDE_FLOAT32_C( 807.17), SIMDE_FLOAT32_C( -747.03), SIMDE_FLOAT32_C( -700.62),
SIMDE_FLOAT32_C( -976.15), SIMDE_FLOAT32_C( -579.60), SIMDE_FLOAT32_C( 568.87), SIMDE_FLOAT32_C( 22.88) },
{ SIMDE_FLOAT32_C( -605.60), SIMDE_FLOAT32_C( 255.46), SIMDE_FLOAT32_C( 642.15), SIMDE_FLOAT32_C( -356.24),
SIMDE_FLOAT32_C( -684.50), SIMDE_FLOAT32_C( -895.54), SIMDE_FLOAT32_C( -671.88), SIMDE_FLOAT32_C( -494.65) },
{ SIMDE_FLOAT32_C( 620.77), SIMDE_FLOAT32_C( 846.63), SIMDE_FLOAT32_C( 985.09), SIMDE_FLOAT32_C( 785.99),
SIMDE_FLOAT32_C( 1192.23), SIMDE_FLOAT32_C( 1066.74), SIMDE_FLOAT32_C( 880.36), SIMDE_FLOAT32_C( 495.18) } },
{ { SIMDE_FLOAT32_C( 333.49), SIMDE_FLOAT32_C( -439.45), SIMDE_FLOAT32_C( 71.23), SIMDE_FLOAT32_C( 171.09),
SIMDE_FLOAT32_C( 495.54), SIMDE_FLOAT32_C( -608.49), SIMDE_FLOAT32_C( -310.61), SIMDE_FLOAT32_C( -145.66) },
{ SIMDE_FLOAT32_C( 38.42), SIMDE_FLOAT32_C( 942.84), SIMDE_FLOAT32_C( 423.70), SIMDE_FLOAT32_C( 408.42),
SIMDE_FLOAT32_C( -695.15), SIMDE_FLOAT32_C( 472.36), SIMDE_FLOAT32_C( 681.50), SIMDE_FLOAT32_C( 168.45) },
{ SIMDE_FLOAT32_C( 335.70), SIMDE_FLOAT32_C( 1040.22), SIMDE_FLOAT32_C( 429.65), SIMDE_FLOAT32_C( 442.81),
SIMDE_FLOAT32_C( 853.69), SIMDE_FLOAT32_C( 770.31), SIMDE_FLOAT32_C( 748.95), SIMDE_FLOAT32_C( 222.69) } },
{ { SIMDE_FLOAT32_C( 279.53), SIMDE_FLOAT32_C( 934.47), SIMDE_FLOAT32_C( 467.83), SIMDE_FLOAT32_C( 303.38),
SIMDE_FLOAT32_C( -645.12), SIMDE_FLOAT32_C( 36.70), SIMDE_FLOAT32_C( -673.74), SIMDE_FLOAT32_C( -250.73) },
{ SIMDE_FLOAT32_C( -707.84), SIMDE_FLOAT32_C( 968.41), SIMDE_FLOAT32_C( 393.03), SIMDE_FLOAT32_C( -392.34),
SIMDE_FLOAT32_C( -927.14), SIMDE_FLOAT32_C( 721.15), SIMDE_FLOAT32_C( 113.01), SIMDE_FLOAT32_C( 406.35) },
{ SIMDE_FLOAT32_C( 761.04), SIMDE_FLOAT32_C( 1345.75), SIMDE_FLOAT32_C( 611.01), SIMDE_FLOAT32_C( 495.95),
SIMDE_FLOAT32_C( 1129.50), SIMDE_FLOAT32_C( 722.08), SIMDE_FLOAT32_C( 683.15), SIMDE_FLOAT32_C( 477.48) } }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])) ; i++) {
simde__m256 a = simde_mm256_loadu_ps(test_vec[i].a);
simde__m256 b = simde_mm256_loadu_ps(test_vec[i].b);
simde__m256 r = simde_mm256_hypot_ps(a, b);
simde_test_x86_assert_equal_f32x8(r, simde_mm256_loadu_ps(test_vec[i].r), 1);
}
return 0;
}
static int
test_simde_mm256_hypot_pd (SIMDE_MUNIT_TEST_ARGS) {
static const struct {
const simde_float64 a[4];
const simde_float64 b[4];
const simde_float64 r[4];
} test_vec[] = {
{ { SIMDE_FLOAT64_C( -477.45), SIMDE_FLOAT64_C( 593.44), SIMDE_FLOAT64_C( 978.32), SIMDE_FLOAT64_C( -584.34) },
{ SIMDE_FLOAT64_C( 161.70), SIMDE_FLOAT64_C( -36.27), SIMDE_FLOAT64_C( 750.17), SIMDE_FLOAT64_C( -655.19) },
{ SIMDE_FLOAT64_C( 504.09), SIMDE_FLOAT64_C( 594.55), SIMDE_FLOAT64_C( 1232.83), SIMDE_FLOAT64_C( 877.91) } },
{ { SIMDE_FLOAT64_C( -840.17), SIMDE_FLOAT64_C( -429.90), SIMDE_FLOAT64_C( 790.20), SIMDE_FLOAT64_C( -18.28) },
{ SIMDE_FLOAT64_C( 964.56), SIMDE_FLOAT64_C( 136.47), SIMDE_FLOAT64_C( 164.17), SIMDE_FLOAT64_C( 892.62) },
{ SIMDE_FLOAT64_C( 1279.16), SIMDE_FLOAT64_C( 451.04), SIMDE_FLOAT64_C( 807.07), SIMDE_FLOAT64_C( 892.81) } },
{ { SIMDE_FLOAT64_C( 115.18), SIMDE_FLOAT64_C( 353.33), SIMDE_FLOAT64_C( -41.82), SIMDE_FLOAT64_C( 836.90) },
{ SIMDE_FLOAT64_C( 325.83), SIMDE_FLOAT64_C( 174.90), SIMDE_FLOAT64_C( -541.27), SIMDE_FLOAT64_C( -977.07) },
{ SIMDE_FLOAT64_C( 345.59), SIMDE_FLOAT64_C( 394.25), SIMDE_FLOAT64_C( 542.88), SIMDE_FLOAT64_C( 1286.49) } },
{ { SIMDE_FLOAT64_C( 604.56), SIMDE_FLOAT64_C( 980.27), SIMDE_FLOAT64_C( 536.46), SIMDE_FLOAT64_C( 153.38) },
{ SIMDE_FLOAT64_C( -931.38), SIMDE_FLOAT64_C( -178.15), SIMDE_FLOAT64_C( -619.34), SIMDE_FLOAT64_C( -408.83) },
{ SIMDE_FLOAT64_C( 1110.39), SIMDE_FLOAT64_C( 996.33), SIMDE_FLOAT64_C( 819.37), SIMDE_FLOAT64_C( 436.65) } },
{ { SIMDE_FLOAT64_C( -584.72), SIMDE_FLOAT64_C( -641.02), SIMDE_FLOAT64_C( 6.83), SIMDE_FLOAT64_C( 576.98) },
{ SIMDE_FLOAT64_C( 322.71), SIMDE_FLOAT64_C( -242.99), SIMDE_FLOAT64_C( 921.80), SIMDE_FLOAT64_C( 482.53) },
{ SIMDE_FLOAT64_C( 667.86), SIMDE_FLOAT64_C( 685.53), SIMDE_FLOAT64_C( 921.83), SIMDE_FLOAT64_C( 752.16) } },
{ { SIMDE_FLOAT64_C( 327.10), SIMDE_FLOAT64_C( 712.00), SIMDE_FLOAT64_C( -535.75), SIMDE_FLOAT64_C( 291.66) },
{ SIMDE_FLOAT64_C( -151.54), SIMDE_FLOAT64_C( 628.42), SIMDE_FLOAT64_C( 184.28), SIMDE_FLOAT64_C( 963.64) },
{ SIMDE_FLOAT64_C( 360.50), SIMDE_FLOAT64_C( 949.66), SIMDE_FLOAT64_C( 566.56), SIMDE_FLOAT64_C( 1006.81) } },
{ { SIMDE_FLOAT64_C( -18.25), SIMDE_FLOAT64_C( -857.54), SIMDE_FLOAT64_C( 800.54), SIMDE_FLOAT64_C( -692.42) },
{ SIMDE_FLOAT64_C( 317.36), SIMDE_FLOAT64_C( -740.72), SIMDE_FLOAT64_C( -669.48), SIMDE_FLOAT64_C( -78.07) },
{ SIMDE_FLOAT64_C( 317.88), SIMDE_FLOAT64_C( 1133.16), SIMDE_FLOAT64_C( 1043.58), SIMDE_FLOAT64_C( 696.81) } },
{ { SIMDE_FLOAT64_C( -760.45), SIMDE_FLOAT64_C( 866.98), SIMDE_FLOAT64_C( -924.70), SIMDE_FLOAT64_C( -691.83) },
{ SIMDE_FLOAT64_C( -311.18), SIMDE_FLOAT64_C( -544.04), SIMDE_FLOAT64_C( -100.66), SIMDE_FLOAT64_C( 104.10) },
{ SIMDE_FLOAT64_C( 821.66), SIMDE_FLOAT64_C( 1023.54), SIMDE_FLOAT64_C( 930.16), SIMDE_FLOAT64_C( 699.62) } }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])) ; i++) {
simde__m256d a = simde_mm256_loadu_pd(test_vec[i].a);
simde__m256d b = simde_mm256_loadu_pd(test_vec[i].b);
simde__m256d r = simde_mm256_hypot_pd(a, b);
simde_test_x86_assert_equal_f64x4(r, simde_mm256_loadu_pd(test_vec[i].r), 1);
}
return 0;
}
static int
test_simde_mm512_hypot_ps (SIMDE_MUNIT_TEST_ARGS) {
static const struct {
const simde_float32 a[16];
const simde_float32 b[16];
const simde_float32 r[16];
} test_vec[] = {
{ { SIMDE_FLOAT32_C( -926.16), SIMDE_FLOAT32_C( -45.76), SIMDE_FLOAT32_C( 907.70), SIMDE_FLOAT32_C( -928.37),
SIMDE_FLOAT32_C( 496.55), SIMDE_FLOAT32_C( 566.66), SIMDE_FLOAT32_C( -501.51), SIMDE_FLOAT32_C( -575.98),
SIMDE_FLOAT32_C( -281.74), SIMDE_FLOAT32_C( -821.54), SIMDE_FLOAT32_C( 198.58), SIMDE_FLOAT32_C( -379.20),
SIMDE_FLOAT32_C( 104.18), SIMDE_FLOAT32_C( -675.07), SIMDE_FLOAT32_C( -169.00), SIMDE_FLOAT32_C( 502.70) },
{ SIMDE_FLOAT32_C( 7.03), SIMDE_FLOAT32_C( -875.48), SIMDE_FLOAT32_C( -451.63), SIMDE_FLOAT32_C( -815.00),
SIMDE_FLOAT32_C( 37.83), SIMDE_FLOAT32_C( -588.92), SIMDE_FLOAT32_C( -905.87), SIMDE_FLOAT32_C( -49.63),
SIMDE_FLOAT32_C( 813.22), SIMDE_FLOAT32_C( -962.83), SIMDE_FLOAT32_C( -486.45), SIMDE_FLOAT32_C( -367.13),
SIMDE_FLOAT32_C( -242.02), SIMDE_FLOAT32_C( 475.59), SIMDE_FLOAT32_C( -31.20), SIMDE_FLOAT32_C( -168.18) },
{ SIMDE_FLOAT32_C( 926.19), SIMDE_FLOAT32_C( 876.68), SIMDE_FLOAT32_C( 1013.85), SIMDE_FLOAT32_C( 1235.35),
SIMDE_FLOAT32_C( 497.99), SIMDE_FLOAT32_C( 817.27), SIMDE_FLOAT32_C( 1035.43), SIMDE_FLOAT32_C( 578.11),
SIMDE_FLOAT32_C( 860.64), SIMDE_FLOAT32_C( 1265.69), SIMDE_FLOAT32_C( 525.42), SIMDE_FLOAT32_C( 527.80),
SIMDE_FLOAT32_C( 263.49), SIMDE_FLOAT32_C( 825.78), SIMDE_FLOAT32_C( 171.86), SIMDE_FLOAT32_C( 530.09) } },
{ { SIMDE_FLOAT32_C( -570.17), SIMDE_FLOAT32_C( -123.51), SIMDE_FLOAT32_C( -96.55), SIMDE_FLOAT32_C( 926.38),
SIMDE_FLOAT32_C( -556.85), SIMDE_FLOAT32_C( 401.94), SIMDE_FLOAT32_C( -649.60), SIMDE_FLOAT32_C( 161.41),
SIMDE_FLOAT32_C( 580.39), SIMDE_FLOAT32_C( 548.98), SIMDE_FLOAT32_C( 782.21), SIMDE_FLOAT32_C( -315.43),
SIMDE_FLOAT32_C( 873.91), SIMDE_FLOAT32_C( -386.79), SIMDE_FLOAT32_C( -812.72), SIMDE_FLOAT32_C( -119.05) },
{ SIMDE_FLOAT32_C( -262.27), SIMDE_FLOAT32_C( -264.35), SIMDE_FLOAT32_C( 65.94), SIMDE_FLOAT32_C( 775.56),
SIMDE_FLOAT32_C( 146.72), SIMDE_FLOAT32_C( 160.08), SIMDE_FLOAT32_C( -274.07), SIMDE_FLOAT32_C( -40.05),
SIMDE_FLOAT32_C( 197.24), SIMDE_FLOAT32_C( 239.47), SIMDE_FLOAT32_C( 592.82), SIMDE_FLOAT32_C( 955.23),
SIMDE_FLOAT32_C( -284.94), SIMDE_FLOAT32_C( -438.38), SIMDE_FLOAT32_C( -212.95), SIMDE_FLOAT32_C( 144.89) },
{ SIMDE_FLOAT32_C( 627.60), SIMDE_FLOAT32_C( 291.78), SIMDE_FLOAT32_C( 116.92), SIMDE_FLOAT32_C( 1208.17),
SIMDE_FLOAT32_C( 575.85), SIMDE_FLOAT32_C( 432.64), SIMDE_FLOAT32_C( 705.05), SIMDE_FLOAT32_C( 166.30),
SIMDE_FLOAT32_C( 612.99), SIMDE_FLOAT32_C( 598.94), SIMDE_FLOAT32_C( 981.47), SIMDE_FLOAT32_C( 1005.96),
SIMDE_FLOAT32_C( 919.19), SIMDE_FLOAT32_C( 584.62), SIMDE_FLOAT32_C( 840.16), SIMDE_FLOAT32_C( 187.53) } },
{ { SIMDE_FLOAT32_C( 438.11), SIMDE_FLOAT32_C( 690.50), SIMDE_FLOAT32_C( 71.27), SIMDE_FLOAT32_C( 881.27),
SIMDE_FLOAT32_C( 92.44), SIMDE_FLOAT32_C( 421.67), SIMDE_FLOAT32_C( 42.68), SIMDE_FLOAT32_C( -327.17),
SIMDE_FLOAT32_C( -29.36), SIMDE_FLOAT32_C( -175.11), SIMDE_FLOAT32_C( 357.41), SIMDE_FLOAT32_C( -155.45),
SIMDE_FLOAT32_C( 438.11), SIMDE_FLOAT32_C( 544.68), SIMDE_FLOAT32_C( 725.50), SIMDE_FLOAT32_C( -824.16) },
{ SIMDE_FLOAT32_C( -719.67), SIMDE_FLOAT32_C( -208.56), SIMDE_FLOAT32_C( 951.40), SIMDE_FLOAT32_C( 427.05),
SIMDE_FLOAT32_C( 951.52), SIMDE_FLOAT32_C( -322.67), SIMDE_FLOAT32_C( -613.00), SIMDE_FLOAT32_C( 148.76),
SIMDE_FLOAT32_C( 916.80), SIMDE_FLOAT32_C( 979.82), SIMDE_FLOAT32_C( 103.99), SIMDE_FLOAT32_C( -368.15),
SIMDE_FLOAT32_C( -458.56), SIMDE_FLOAT32_C( 891.04), SIMDE_FLOAT32_C( 776.74), SIMDE_FLOAT32_C( 979.55) },
{ SIMDE_FLOAT32_C( 842.54), SIMDE_FLOAT32_C( 721.31), SIMDE_FLOAT32_C( 954.07), SIMDE_FLOAT32_C( 979.29),
SIMDE_FLOAT32_C( 956.00), SIMDE_FLOAT32_C( 530.96), SIMDE_FLOAT32_C( 614.48), SIMDE_FLOAT32_C( 359.40),
SIMDE_FLOAT32_C( 917.27), SIMDE_FLOAT32_C( 995.34), SIMDE_FLOAT32_C( 372.23), SIMDE_FLOAT32_C( 399.62),
SIMDE_FLOAT32_C( 634.21), SIMDE_FLOAT32_C( 1044.33), SIMDE_FLOAT32_C( 1062.86), SIMDE_FLOAT32_C( 1280.14) } },
{ { SIMDE_FLOAT32_C( 581.54), SIMDE_FLOAT32_C( -151.99), SIMDE_FLOAT32_C( 860.81), SIMDE_FLOAT32_C( -326.03),
SIMDE_FLOAT32_C( -730.33), SIMDE_FLOAT32_C( -96.51), SIMDE_FLOAT32_C( 346.80), SIMDE_FLOAT32_C( 240.31),
SIMDE_FLOAT32_C( 728.39), SIMDE_FLOAT32_C( -295.79), SIMDE_FLOAT32_C( -915.13), SIMDE_FLOAT32_C( 166.50),
SIMDE_FLOAT32_C( -751.11), SIMDE_FLOAT32_C( 810.37), SIMDE_FLOAT32_C( 342.34), SIMDE_FLOAT32_C( -470.78) },
{ SIMDE_FLOAT32_C( -398.19), SIMDE_FLOAT32_C( 293.73), SIMDE_FLOAT32_C( 956.27), SIMDE_FLOAT32_C( -446.67),
SIMDE_FLOAT32_C( 971.06), SIMDE_FLOAT32_C( -656.73), SIMDE_FLOAT32_C( 702.10), SIMDE_FLOAT32_C( 887.86),
SIMDE_FLOAT32_C( -676.91), SIMDE_FLOAT32_C( -193.91), SIMDE_FLOAT32_C( -480.29), SIMDE_FLOAT32_C( -135.48),
SIMDE_FLOAT32_C( -302.88), SIMDE_FLOAT32_C( -703.55), SIMDE_FLOAT32_C( -155.93), SIMDE_FLOAT32_C( -721.34) },
{ SIMDE_FLOAT32_C( 704.80), SIMDE_FLOAT32_C( 330.72), SIMDE_FLOAT32_C( 1286.64), SIMDE_FLOAT32_C( 553.00),
SIMDE_FLOAT32_C( 1215.05), SIMDE_FLOAT32_C( 663.78), SIMDE_FLOAT32_C( 783.08), SIMDE_FLOAT32_C( 919.81),
SIMDE_FLOAT32_C( 994.36), SIMDE_FLOAT32_C( 353.68), SIMDE_FLOAT32_C( 1033.51), SIMDE_FLOAT32_C( 214.66),
SIMDE_FLOAT32_C( 809.88), SIMDE_FLOAT32_C( 1073.16), SIMDE_FLOAT32_C( 376.18), SIMDE_FLOAT32_C( 861.37) } },
{ { SIMDE_FLOAT32_C( 144.45), SIMDE_FLOAT32_C( -295.12), SIMDE_FLOAT32_C( -47.37), SIMDE_FLOAT32_C( 414.12),
SIMDE_FLOAT32_C( 608.38), SIMDE_FLOAT32_C( -700.56), SIMDE_FLOAT32_C( -345.56), SIMDE_FLOAT32_C( 336.76),
SIMDE_FLOAT32_C( 3.65), SIMDE_FLOAT32_C( -260.69), SIMDE_FLOAT32_C( -496.74), SIMDE_FLOAT32_C( 252.54),
SIMDE_FLOAT32_C( -450.32), SIMDE_FLOAT32_C( 845.60), SIMDE_FLOAT32_C( 781.76), SIMDE_FLOAT32_C( 151.49) },
{ SIMDE_FLOAT32_C( 139.33), SIMDE_FLOAT32_C( 738.03), SIMDE_FLOAT32_C( 704.82), SIMDE_FLOAT32_C( 110.39),
SIMDE_FLOAT32_C( -918.70), SIMDE_FLOAT32_C( 406.92), SIMDE_FLOAT32_C( -1.75), SIMDE_FLOAT32_C( -595.61),
SIMDE_FLOAT32_C( -787.00), SIMDE_FLOAT32_C( 517.95), SIMDE_FLOAT32_C( 268.91), SIMDE_FLOAT32_C( -89.87),
SIMDE_FLOAT32_C( 814.40), SIMDE_FLOAT32_C( -887.02), SIMDE_FLOAT32_C( 188.79), SIMDE_FLOAT32_C( -41.15) },
{ SIMDE_FLOAT32_C( 200.70), SIMDE_FLOAT32_C( 794.85), SIMDE_FLOAT32_C( 706.41), SIMDE_FLOAT32_C( 428.58),
SIMDE_FLOAT32_C( 1101.88), SIMDE_FLOAT32_C( 810.17), SIMDE_FLOAT32_C( 345.56), SIMDE_FLOAT32_C( 684.22),
SIMDE_FLOAT32_C( 787.01), SIMDE_FLOAT32_C( 579.85), SIMDE_FLOAT32_C( 564.86), SIMDE_FLOAT32_C( 268.05),
SIMDE_FLOAT32_C( 930.61), SIMDE_FLOAT32_C( 1225.50), SIMDE_FLOAT32_C( 804.23), SIMDE_FLOAT32_C( 156.98) } },
{ { SIMDE_FLOAT32_C( -182.14), SIMDE_FLOAT32_C( -858.58), SIMDE_FLOAT32_C( -627.02), SIMDE_FLOAT32_C( -573.76),
SIMDE_FLOAT32_C( -559.14), SIMDE_FLOAT32_C( 27.42), SIMDE_FLOAT32_C( 763.00), SIMDE_FLOAT32_C( 444.51),
SIMDE_FLOAT32_C( 766.72), SIMDE_FLOAT32_C( -733.74), SIMDE_FLOAT32_C( -302.95), SIMDE_FLOAT32_C( -683.60),
SIMDE_FLOAT32_C( -888.14), SIMDE_FLOAT32_C( -521.19), SIMDE_FLOAT32_C( 467.89), SIMDE_FLOAT32_C( 251.19) },
{ SIMDE_FLOAT32_C( -783.16), SIMDE_FLOAT32_C( 172.71), SIMDE_FLOAT32_C( -638.42), SIMDE_FLOAT32_C( -701.86),
SIMDE_FLOAT32_C( -420.37), SIMDE_FLOAT32_C( 359.83), SIMDE_FLOAT32_C( -297.47), SIMDE_FLOAT32_C( -207.37),
SIMDE_FLOAT32_C( -122.22), SIMDE_FLOAT32_C( 971.44), SIMDE_FLOAT32_C( 702.76), SIMDE_FLOAT32_C( -307.82),
SIMDE_FLOAT32_C( -915.59), SIMDE_FLOAT32_C( -108.45), SIMDE_FLOAT32_C( 651.04), SIMDE_FLOAT32_C( -97.72) },
{ SIMDE_FLOAT32_C( 804.06), SIMDE_FLOAT32_C( 875.78), SIMDE_FLOAT32_C( 894.84), SIMDE_FLOAT32_C( 906.54),
SIMDE_FLOAT32_C( 699.53), SIMDE_FLOAT32_C( 360.87), SIMDE_FLOAT32_C( 818.94), SIMDE_FLOAT32_C( 490.50),
SIMDE_FLOAT32_C( 776.40), SIMDE_FLOAT32_C( 1217.40), SIMDE_FLOAT32_C( 765.28), SIMDE_FLOAT32_C( 749.71),
SIMDE_FLOAT32_C( 1275.58), SIMDE_FLOAT32_C( 532.35), SIMDE_FLOAT32_C( 801.73), SIMDE_FLOAT32_C( 269.53) } },
{ { SIMDE_FLOAT32_C( 32.97), SIMDE_FLOAT32_C( -975.98), SIMDE_FLOAT32_C( 328.52), SIMDE_FLOAT32_C( 473.84),
SIMDE_FLOAT32_C( 51.43), SIMDE_FLOAT32_C( 91.52), SIMDE_FLOAT32_C( -81.65), SIMDE_FLOAT32_C( -181.85),
SIMDE_FLOAT32_C( 357.78), SIMDE_FLOAT32_C( 615.40), SIMDE_FLOAT32_C( 134.55), SIMDE_FLOAT32_C( 469.64),
SIMDE_FLOAT32_C( -905.79), SIMDE_FLOAT32_C( -397.56), SIMDE_FLOAT32_C( -279.17), SIMDE_FLOAT32_C( -688.95) },
{ SIMDE_FLOAT32_C( 775.15), SIMDE_FLOAT32_C( 82.41), SIMDE_FLOAT32_C( -390.80), SIMDE_FLOAT32_C( -645.22),
SIMDE_FLOAT32_C( -557.76), SIMDE_FLOAT32_C( 311.72), SIMDE_FLOAT32_C( 147.41), SIMDE_FLOAT32_C( 320.02),
SIMDE_FLOAT32_C( 283.16), SIMDE_FLOAT32_C( -149.83), SIMDE_FLOAT32_C( -987.80), SIMDE_FLOAT32_C( 367.57),
SIMDE_FLOAT32_C( 741.72), SIMDE_FLOAT32_C( 663.24), SIMDE_FLOAT32_C( -730.15), SIMDE_FLOAT32_C( -225.30) },
{ SIMDE_FLOAT32_C( 775.85), SIMDE_FLOAT32_C( 979.45), SIMDE_FLOAT32_C( 510.54), SIMDE_FLOAT32_C( 800.52),
SIMDE_FLOAT32_C( 560.13), SIMDE_FLOAT32_C( 324.88), SIMDE_FLOAT32_C( 168.51), SIMDE_FLOAT32_C( 368.08),
SIMDE_FLOAT32_C( 456.27), SIMDE_FLOAT32_C( 633.38), SIMDE_FLOAT32_C( 996.92), SIMDE_FLOAT32_C( 596.38),
SIMDE_FLOAT32_C( 1170.73), SIMDE_FLOAT32_C( 773.27), SIMDE_FLOAT32_C( 781.70), SIMDE_FLOAT32_C( 724.85) } },
{ { SIMDE_FLOAT32_C( 687.25), SIMDE_FLOAT32_C( 598.37), SIMDE_FLOAT32_C( -751.47), SIMDE_FLOAT32_C( -261.32),
SIMDE_FLOAT32_C( -310.12), SIMDE_FLOAT32_C( 166.88), SIMDE_FLOAT32_C( 556.84), SIMDE_FLOAT32_C( -952.33),
SIMDE_FLOAT32_C( -217.72), SIMDE_FLOAT32_C( -308.61), SIMDE_FLOAT32_C( 517.31), SIMDE_FLOAT32_C( -123.51),
SIMDE_FLOAT32_C( 293.83), SIMDE_FLOAT32_C( -761.86), SIMDE_FLOAT32_C( 187.55), SIMDE_FLOAT32_C( 68.99) },
{ SIMDE_FLOAT32_C( 320.55), SIMDE_FLOAT32_C( 796.74), SIMDE_FLOAT32_C( 423.77), SIMDE_FLOAT32_C( 762.79),
SIMDE_FLOAT32_C( 108.47), SIMDE_FLOAT32_C( -428.82), SIMDE_FLOAT32_C( 82.81), SIMDE_FLOAT32_C( -608.37),
SIMDE_FLOAT32_C( 421.35), SIMDE_FLOAT32_C( 95.01), SIMDE_FLOAT32_C( 759.20), SIMDE_FLOAT32_C( 163.07),
SIMDE_FLOAT32_C( -241.76), SIMDE_FLOAT32_C( -970.95), SIMDE_FLOAT32_C( 937.77), SIMDE_FLOAT32_C( -554.50) },
{ SIMDE_FLOAT32_C( 758.33), SIMDE_FLOAT32_C( 996.41), SIMDE_FLOAT32_C( 862.72), SIMDE_FLOAT32_C( 806.31),
SIMDE_FLOAT32_C( 328.54), SIMDE_FLOAT32_C( 460.15), SIMDE_FLOAT32_C( 562.96), SIMDE_FLOAT32_C( 1130.06),
SIMDE_FLOAT32_C( 474.28), SIMDE_FLOAT32_C( 322.90), SIMDE_FLOAT32_C( 918.69), SIMDE_FLOAT32_C( 204.56),
SIMDE_FLOAT32_C( 380.50), SIMDE_FLOAT32_C( 1234.17), SIMDE_FLOAT32_C( 956.34), SIMDE_FLOAT32_C( 558.78) } }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])) ; i++) {
simde__m512 a = simde_mm512_loadu_ps(test_vec[i].a);
simde__m512 b = simde_mm512_loadu_ps(test_vec[i].b);
simde__m512 r = simde_mm512_hypot_ps(a, b);
simde_test_x86_assert_equal_f32x16(r, simde_mm512_loadu_ps(test_vec[i].r), 1);
}
return 0;
}
static int
test_simde_mm512_mask_hypot_ps (SIMDE_MUNIT_TEST_ARGS) {
static const struct {
const simde_float32 src[16];
const simde__mmask8 k;
const simde_float32 a[16];
const simde_float32 b[16];
const simde_float32 r[16];
} test_vec[] = {
{ { SIMDE_FLOAT32_C( 367.16), SIMDE_FLOAT32_C( 534.12), SIMDE_FLOAT32_C( 840.53), SIMDE_FLOAT32_C( -620.21),
SIMDE_FLOAT32_C( 261.27), SIMDE_FLOAT32_C( 223.14), SIMDE_FLOAT32_C( -163.58), SIMDE_FLOAT32_C( 267.96),
SIMDE_FLOAT32_C( -882.06), SIMDE_FLOAT32_C( -703.87), SIMDE_FLOAT32_C( 527.51), SIMDE_FLOAT32_C( -734.80),
SIMDE_FLOAT32_C( -828.23), SIMDE_FLOAT32_C( -822.70), SIMDE_FLOAT32_C( -911.73), SIMDE_FLOAT32_C( 856.22) },
UINT8_C(182),
{ SIMDE_FLOAT32_C( 508.95), SIMDE_FLOAT32_C( 401.36), SIMDE_FLOAT32_C( -896.06), SIMDE_FLOAT32_C( 773.16),
SIMDE_FLOAT32_C( -9.93), SIMDE_FLOAT32_C( -389.05), SIMDE_FLOAT32_C( -811.06), SIMDE_FLOAT32_C( 179.53),
SIMDE_FLOAT32_C( -842.09), SIMDE_FLOAT32_C( 34.81), SIMDE_FLOAT32_C( -170.09), SIMDE_FLOAT32_C( 888.35),
SIMDE_FLOAT32_C( -467.85), SIMDE_FLOAT32_C( 381.00), SIMDE_FLOAT32_C( 255.51), SIMDE_FLOAT32_C( -933.73) },
{ SIMDE_FLOAT32_C( 221.53), SIMDE_FLOAT32_C( 635.30), SIMDE_FLOAT32_C( 327.54), SIMDE_FLOAT32_C( -555.33),
SIMDE_FLOAT32_C( -528.28), SIMDE_FLOAT32_C( -404.50), SIMDE_FLOAT32_C( -437.39), SIMDE_FLOAT32_C( -232.15),
SIMDE_FLOAT32_C( -876.99), SIMDE_FLOAT32_C( -172.19), SIMDE_FLOAT32_C( -60.39), SIMDE_FLOAT32_C( -699.69),
SIMDE_FLOAT32_C( -83.92), SIMDE_FLOAT32_C( -204.17), SIMDE_FLOAT32_C( 701.45), SIMDE_FLOAT32_C( -574.97) },
{ SIMDE_FLOAT32_C( 367.16), SIMDE_FLOAT32_C( 751.46), SIMDE_FLOAT32_C( 954.05), SIMDE_FLOAT32_C( -620.21),
SIMDE_FLOAT32_C( 528.37), SIMDE_FLOAT32_C( 561.23), SIMDE_FLOAT32_C( -163.58), SIMDE_FLOAT32_C( 293.47),
SIMDE_FLOAT32_C( -882.06), SIMDE_FLOAT32_C( -703.87), SIMDE_FLOAT32_C( 527.51), SIMDE_FLOAT32_C( -734.80),
SIMDE_FLOAT32_C( -828.23), SIMDE_FLOAT32_C( -822.70), SIMDE_FLOAT32_C( -911.73), SIMDE_FLOAT32_C( 856.22) } },
{ { SIMDE_FLOAT32_C( -802.80), SIMDE_FLOAT32_C( 805.39), SIMDE_FLOAT32_C( -801.81), SIMDE_FLOAT32_C( 187.27),
SIMDE_FLOAT32_C( -583.65), SIMDE_FLOAT32_C( -612.87), SIMDE_FLOAT32_C( -633.20), SIMDE_FLOAT32_C( -425.74),
SIMDE_FLOAT32_C( 421.94), SIMDE_FLOAT32_C( 196.71), SIMDE_FLOAT32_C( -537.40), SIMDE_FLOAT32_C( 954.08),
SIMDE_FLOAT32_C( -422.29), SIMDE_FLOAT32_C( 718.11), SIMDE_FLOAT32_C( -979.65), SIMDE_FLOAT32_C( 799.24) },
UINT8_C( 1),
{ SIMDE_FLOAT32_C( 347.90), SIMDE_FLOAT32_C( -756.09), SIMDE_FLOAT32_C( 825.13), SIMDE_FLOAT32_C( 943.40),
SIMDE_FLOAT32_C( -193.47), SIMDE_FLOAT32_C( -407.03), SIMDE_FLOAT32_C( -933.59), SIMDE_FLOAT32_C( 634.34),
SIMDE_FLOAT32_C( 532.59), SIMDE_FLOAT32_C( -633.28), SIMDE_FLOAT32_C( -449.58), SIMDE_FLOAT32_C( -671.58),
SIMDE_FLOAT32_C( -931.83), SIMDE_FLOAT32_C( -24.55), SIMDE_FLOAT32_C( -474.38), SIMDE_FLOAT32_C( 873.57) },
{ SIMDE_FLOAT32_C( 173.64), SIMDE_FLOAT32_C( 712.89), SIMDE_FLOAT32_C( -710.09), SIMDE_FLOAT32_C( 560.77),
SIMDE_FLOAT32_C( -920.31), SIMDE_FLOAT32_C( -135.83), SIMDE_FLOAT32_C( -17.30), SIMDE_FLOAT32_C( 276.39),
SIMDE_FLOAT32_C( 326.78), SIMDE_FLOAT32_C( -63.21), SIMDE_FLOAT32_C( 854.10), SIMDE_FLOAT32_C( 44.89),
SIMDE_FLOAT32_C( -42.86), SIMDE_FLOAT32_C( 653.34), SIMDE_FLOAT32_C( -601.70), SIMDE_FLOAT32_C( -694.96) },
{ SIMDE_FLOAT32_C( 388.83), SIMDE_FLOAT32_C( 805.39), SIMDE_FLOAT32_C( -801.81), SIMDE_FLOAT32_C( 187.27),
SIMDE_FLOAT32_C( -583.65), SIMDE_FLOAT32_C( -612.87), SIMDE_FLOAT32_C( -633.20), SIMDE_FLOAT32_C( -425.74),
SIMDE_FLOAT32_C( 421.94), SIMDE_FLOAT32_C( 196.71), SIMDE_FLOAT32_C( -537.40), SIMDE_FLOAT32_C( 954.08),
SIMDE_FLOAT32_C( -422.29), SIMDE_FLOAT32_C( 718.11), SIMDE_FLOAT32_C( -979.65), SIMDE_FLOAT32_C( 799.24) } },
{ { SIMDE_FLOAT32_C( 897.26), SIMDE_FLOAT32_C( -776.57), SIMDE_FLOAT32_C( -751.56), SIMDE_FLOAT32_C( -296.22),
SIMDE_FLOAT32_C( -183.60), SIMDE_FLOAT32_C( -685.15), SIMDE_FLOAT32_C( -661.88), SIMDE_FLOAT32_C( -651.01),
SIMDE_FLOAT32_C( -318.42), SIMDE_FLOAT32_C( -111.46), SIMDE_FLOAT32_C( -322.60), SIMDE_FLOAT32_C( -250.25),
SIMDE_FLOAT32_C( 863.99), SIMDE_FLOAT32_C( 203.02), SIMDE_FLOAT32_C( -376.68), SIMDE_FLOAT32_C( 37.62) },
UINT8_C( 54),
{ SIMDE_FLOAT32_C( -86.77), SIMDE_FLOAT32_C( -401.61), SIMDE_FLOAT32_C( -4.41), SIMDE_FLOAT32_C( 777.40),
SIMDE_FLOAT32_C( 581.09), SIMDE_FLOAT32_C( -728.01), SIMDE_FLOAT32_C( 104.18), SIMDE_FLOAT32_C( -482.12),
SIMDE_FLOAT32_C( -873.91), SIMDE_FLOAT32_C( -850.93), SIMDE_FLOAT32_C( 475.02), SIMDE_FLOAT32_C( 779.43),
SIMDE_FLOAT32_C( -452.63), SIMDE_FLOAT32_C( 780.06), SIMDE_FLOAT32_C( 676.69), SIMDE_FLOAT32_C( -229.20) },
{ SIMDE_FLOAT32_C( -971.50), SIMDE_FLOAT32_C( -619.53), SIMDE_FLOAT32_C( 587.20), SIMDE_FLOAT32_C( -656.65),
SIMDE_FLOAT32_C( -281.40), SIMDE_FLOAT32_C( 936.19), SIMDE_FLOAT32_C( 24.93), SIMDE_FLOAT32_C( 607.14),
SIMDE_FLOAT32_C( -386.41), SIMDE_FLOAT32_C( 774.68), SIMDE_FLOAT32_C( 471.12), SIMDE_FLOAT32_C( 816.61),
SIMDE_FLOAT32_C( -602.00), SIMDE_FLOAT32_C( -491.25), SIMDE_FLOAT32_C( -267.48), SIMDE_FLOAT32_C( 311.23) },
{ SIMDE_FLOAT32_C( 897.26), SIMDE_FLOAT32_C( 738.31), SIMDE_FLOAT32_C( 587.22), SIMDE_FLOAT32_C( -296.22),
SIMDE_FLOAT32_C( 645.64), SIMDE_FLOAT32_C( 1185.94), SIMDE_FLOAT32_C( -661.88), SIMDE_FLOAT32_C( -651.01),
SIMDE_FLOAT32_C( -318.42), SIMDE_FLOAT32_C( -111.46), SIMDE_FLOAT32_C( -322.60), SIMDE_FLOAT32_C( -250.25),
SIMDE_FLOAT32_C( 863.99), SIMDE_FLOAT32_C( 203.02), SIMDE_FLOAT32_C( -376.68), SIMDE_FLOAT32_C( 37.62) } },
{ { SIMDE_FLOAT32_C( 107.14), SIMDE_FLOAT32_C( 728.11), SIMDE_FLOAT32_C( 88.63), SIMDE_FLOAT32_C( -311.77),
SIMDE_FLOAT32_C( -999.90), SIMDE_FLOAT32_C( -807.18), SIMDE_FLOAT32_C( 206.11), SIMDE_FLOAT32_C( -873.82),
SIMDE_FLOAT32_C( -658.11), SIMDE_FLOAT32_C( -318.87), SIMDE_FLOAT32_C( 905.61), SIMDE_FLOAT32_C( -110.74),
SIMDE_FLOAT32_C( -538.82), SIMDE_FLOAT32_C( 582.30), SIMDE_FLOAT32_C( 660.06), SIMDE_FLOAT32_C( -510.32) },
UINT8_C(112),
{ SIMDE_FLOAT32_C( 247.26), SIMDE_FLOAT32_C( -166.97), SIMDE_FLOAT32_C( -318.63), SIMDE_FLOAT32_C( 183.45),
SIMDE_FLOAT32_C( 857.96), SIMDE_FLOAT32_C( -711.49), SIMDE_FLOAT32_C( 797.04), SIMDE_FLOAT32_C( 632.64),
SIMDE_FLOAT32_C( 759.63), SIMDE_FLOAT32_C( 613.65), SIMDE_FLOAT32_C( -969.36), SIMDE_FLOAT32_C( -731.62),
SIMDE_FLOAT32_C( -653.84), SIMDE_FLOAT32_C( 341.87), SIMDE_FLOAT32_C( 375.52), SIMDE_FLOAT32_C( -925.73) },
{ SIMDE_FLOAT32_C( -569.50), SIMDE_FLOAT32_C( -936.25), SIMDE_FLOAT32_C( -925.63), SIMDE_FLOAT32_C( -376.68),
SIMDE_FLOAT32_C( 269.87), SIMDE_FLOAT32_C( -799.45), SIMDE_FLOAT32_C( -34.80), SIMDE_FLOAT32_C( 950.99),
SIMDE_FLOAT32_C( -893.84), SIMDE_FLOAT32_C( 854.47), SIMDE_FLOAT32_C( -587.82), SIMDE_FLOAT32_C( 688.47),
SIMDE_FLOAT32_C( 514.53), SIMDE_FLOAT32_C( -98.14), SIMDE_FLOAT32_C( 651.24), SIMDE_FLOAT32_C( -238.21) },
{ SIMDE_FLOAT32_C( 107.14), SIMDE_FLOAT32_C( 728.11), SIMDE_FLOAT32_C( 88.63), SIMDE_FLOAT32_C( -311.77),
SIMDE_FLOAT32_C( 899.40), SIMDE_FLOAT32_C( 1070.20), SIMDE_FLOAT32_C( 797.80), SIMDE_FLOAT32_C( -873.82),
SIMDE_FLOAT32_C( -658.11), SIMDE_FLOAT32_C( -318.87), SIMDE_FLOAT32_C( 905.61), SIMDE_FLOAT32_C( -110.74),
SIMDE_FLOAT32_C( -538.82), SIMDE_FLOAT32_C( 582.30), SIMDE_FLOAT32_C( 660.06), SIMDE_FLOAT32_C( -510.32) } },
{ { SIMDE_FLOAT32_C( 734.89), SIMDE_FLOAT32_C( -667.39), SIMDE_FLOAT32_C( 945.23), SIMDE_FLOAT32_C( 592.85),
SIMDE_FLOAT32_C( -378.88), SIMDE_FLOAT32_C( 742.27), SIMDE_FLOAT32_C( 225.49), SIMDE_FLOAT32_C( -619.25),
SIMDE_FLOAT32_C( 355.91), SIMDE_FLOAT32_C( 256.12), SIMDE_FLOAT32_C( -350.87), SIMDE_FLOAT32_C( 702.07),
SIMDE_FLOAT32_C( -402.01), SIMDE_FLOAT32_C( -975.35), SIMDE_FLOAT32_C( 776.35), SIMDE_FLOAT32_C( 28.49) },
UINT8_C( 29),
{ SIMDE_FLOAT32_C( 850.71), SIMDE_FLOAT32_C( 651.81), SIMDE_FLOAT32_C( 358.27), SIMDE_FLOAT32_C( -948.74),
SIMDE_FLOAT32_C( -382.99), SIMDE_FLOAT32_C( 309.27), SIMDE_FLOAT32_C( -842.57), SIMDE_FLOAT32_C( -528.52),
SIMDE_FLOAT32_C( 721.45), SIMDE_FLOAT32_C( 845.89), SIMDE_FLOAT32_C( 986.00), SIMDE_FLOAT32_C( -376.69),
SIMDE_FLOAT32_C( 497.14), SIMDE_FLOAT32_C( -252.21), SIMDE_FLOAT32_C( -641.80), SIMDE_FLOAT32_C( 829.75) },
{ SIMDE_FLOAT32_C( -306.98), SIMDE_FLOAT32_C( 951.05), SIMDE_FLOAT32_C( -549.13), SIMDE_FLOAT32_C( -564.71),
SIMDE_FLOAT32_C( 176.53), SIMDE_FLOAT32_C( -168.38), SIMDE_FLOAT32_C( 791.20), SIMDE_FLOAT32_C( -567.34),
SIMDE_FLOAT32_C( 480.75), SIMDE_FLOAT32_C( 493.27), SIMDE_FLOAT32_C( 30.65), SIMDE_FLOAT32_C( 505.41),
SIMDE_FLOAT32_C( 269.62), SIMDE_FLOAT32_C( -940.86), SIMDE_FLOAT32_C( 593.82), SIMDE_FLOAT32_C( 120.33) },
{ SIMDE_FLOAT32_C( 904.40), SIMDE_FLOAT32_C( -667.39), SIMDE_FLOAT32_C( 655.67), SIMDE_FLOAT32_C( 1104.09),
SIMDE_FLOAT32_C( 421.72), SIMDE_FLOAT32_C( 742.27), SIMDE_FLOAT32_C( 225.49), SIMDE_FLOAT32_C( -619.25),
SIMDE_FLOAT32_C( 355.91), SIMDE_FLOAT32_C( 256.12), SIMDE_FLOAT32_C( -350.87), SIMDE_FLOAT32_C( 702.07),
SIMDE_FLOAT32_C( -402.01), SIMDE_FLOAT32_C( -975.35), SIMDE_FLOAT32_C( 776.35), SIMDE_FLOAT32_C( 28.49) } },
{ { SIMDE_FLOAT32_C( 710.95), SIMDE_FLOAT32_C( -47.91), SIMDE_FLOAT32_C( 171.59), SIMDE_FLOAT32_C( -672.04),
SIMDE_FLOAT32_C( -738.64), SIMDE_FLOAT32_C( 329.02), SIMDE_FLOAT32_C( -200.57), SIMDE_FLOAT32_C( 982.81),
SIMDE_FLOAT32_C( 174.91), SIMDE_FLOAT32_C( -214.56), SIMDE_FLOAT32_C( -393.88), SIMDE_FLOAT32_C( -327.95),
SIMDE_FLOAT32_C( 533.22), SIMDE_FLOAT32_C( -35.69), SIMDE_FLOAT32_C( -498.20), SIMDE_FLOAT32_C( -773.76) },
UINT8_C(210),
{ SIMDE_FLOAT32_C( -47.34), SIMDE_FLOAT32_C( -338.47), SIMDE_FLOAT32_C( -908.10), SIMDE_FLOAT32_C( 784.28),
SIMDE_FLOAT32_C( -547.27), SIMDE_FLOAT32_C( -475.45), SIMDE_FLOAT32_C( 265.03), SIMDE_FLOAT32_C( 946.00),
SIMDE_FLOAT32_C( 555.20), SIMDE_FLOAT32_C( -229.56), SIMDE_FLOAT32_C( 215.62), SIMDE_FLOAT32_C( 614.34),
SIMDE_FLOAT32_C( -635.74), SIMDE_FLOAT32_C( -664.05), SIMDE_FLOAT32_C( 325.29), SIMDE_FLOAT32_C( 316.35) },
{ SIMDE_FLOAT32_C( 507.54), SIMDE_FLOAT32_C( 653.24), SIMDE_FLOAT32_C( 577.71), SIMDE_FLOAT32_C( -163.44),
SIMDE_FLOAT32_C( -547.32), SIMDE_FLOAT32_C( 560.52), SIMDE_FLOAT32_C( -988.53), SIMDE_FLOAT32_C( 238.11),
SIMDE_FLOAT32_C( -833.36), SIMDE_FLOAT32_C( -316.48), SIMDE_FLOAT32_C( -228.66), SIMDE_FLOAT32_C( 130.95),
SIMDE_FLOAT32_C( 185.32), SIMDE_FLOAT32_C( -2.42), SIMDE_FLOAT32_C( -953.69), SIMDE_FLOAT32_C( -862.02) },
{ SIMDE_FLOAT32_C( 710.95), SIMDE_FLOAT32_C( 735.72), SIMDE_FLOAT32_C( 171.59), SIMDE_FLOAT32_C( -672.04),
SIMDE_FLOAT32_C( 773.99), SIMDE_FLOAT32_C( 329.02), SIMDE_FLOAT32_C( 1023.44), SIMDE_FLOAT32_C( 975.51),
SIMDE_FLOAT32_C( 174.91), SIMDE_FLOAT32_C( -214.56), SIMDE_FLOAT32_C( -393.88), SIMDE_FLOAT32_C( -327.95),
SIMDE_FLOAT32_C( 533.22), SIMDE_FLOAT32_C( -35.69), SIMDE_FLOAT32_C( -498.20), SIMDE_FLOAT32_C( -773.76) } },
{ { SIMDE_FLOAT32_C( 659.11), SIMDE_FLOAT32_C( -861.79), SIMDE_FLOAT32_C( 922.26), SIMDE_FLOAT32_C( -888.16),
SIMDE_FLOAT32_C( -337.24), SIMDE_FLOAT32_C( 187.30), SIMDE_FLOAT32_C( -942.16), SIMDE_FLOAT32_C( -782.04),
SIMDE_FLOAT32_C( 957.74), SIMDE_FLOAT32_C( 273.45), SIMDE_FLOAT32_C( 832.30), SIMDE_FLOAT32_C( -678.00),
SIMDE_FLOAT32_C( 609.40), SIMDE_FLOAT32_C( 157.59), SIMDE_FLOAT32_C( 638.35), SIMDE_FLOAT32_C( 116.94) },
UINT8_C(122),
{ SIMDE_FLOAT32_C( 216.06), SIMDE_FLOAT32_C( 953.51), SIMDE_FLOAT32_C( 263.51), SIMDE_FLOAT32_C( -223.42),
SIMDE_FLOAT32_C( 964.98), SIMDE_FLOAT32_C( -498.37), SIMDE_FLOAT32_C( -56.78), SIMDE_FLOAT32_C( -351.50),
SIMDE_FLOAT32_C( 272.97), SIMDE_FLOAT32_C( -925.83), SIMDE_FLOAT32_C( 833.82), SIMDE_FLOAT32_C( -729.45),
SIMDE_FLOAT32_C( -879.52), SIMDE_FLOAT32_C( 971.80), SIMDE_FLOAT32_C( 929.66), SIMDE_FLOAT32_C( -741.31) },
{ SIMDE_FLOAT32_C( 894.07), SIMDE_FLOAT32_C( -958.51), SIMDE_FLOAT32_C( -78.55), SIMDE_FLOAT32_C( 81.37),
SIMDE_FLOAT32_C( -900.67), SIMDE_FLOAT32_C( 139.42), SIMDE_FLOAT32_C( 39.11), SIMDE_FLOAT32_C( 372.78),
SIMDE_FLOAT32_C( -28.28), SIMDE_FLOAT32_C( 361.11), SIMDE_FLOAT32_C( -17.81), SIMDE_FLOAT32_C( -870.69),
SIMDE_FLOAT32_C( -0.54), SIMDE_FLOAT32_C( -900.87), SIMDE_FLOAT32_C( -59.85), SIMDE_FLOAT32_C( -784.48) },
{ SIMDE_FLOAT32_C( 659.11), SIMDE_FLOAT32_C( 1352.01), SIMDE_FLOAT32_C( 922.26), SIMDE_FLOAT32_C( 237.78),
SIMDE_FLOAT32_C( 1320.00), SIMDE_FLOAT32_C( 517.50), SIMDE_FLOAT32_C( 68.95), SIMDE_FLOAT32_C( -782.04),
SIMDE_FLOAT32_C( 957.74), SIMDE_FLOAT32_C( 273.45), SIMDE_FLOAT32_C( 832.30), SIMDE_FLOAT32_C( -678.00),
SIMDE_FLOAT32_C( 609.40), SIMDE_FLOAT32_C( 157.59), SIMDE_FLOAT32_C( 638.35), SIMDE_FLOAT32_C( 116.94) } },
{ { SIMDE_FLOAT32_C( -947.37), SIMDE_FLOAT32_C( -796.34), SIMDE_FLOAT32_C( -7.90), SIMDE_FLOAT32_C( -982.39),
SIMDE_FLOAT32_C( -294.71), SIMDE_FLOAT32_C( 935.32), SIMDE_FLOAT32_C( -333.88), SIMDE_FLOAT32_C( 978.25),
SIMDE_FLOAT32_C( -990.51), SIMDE_FLOAT32_C( -500.06), SIMDE_FLOAT32_C( -751.20), SIMDE_FLOAT32_C( -870.03),
SIMDE_FLOAT32_C( -528.26), SIMDE_FLOAT32_C( -821.55), SIMDE_FLOAT32_C( -611.34), SIMDE_FLOAT32_C( -634.19) },
UINT8_C(234),
{ SIMDE_FLOAT32_C( 310.12), SIMDE_FLOAT32_C( 447.18), SIMDE_FLOAT32_C( -680.72), SIMDE_FLOAT32_C( -550.47),
SIMDE_FLOAT32_C( -513.72), SIMDE_FLOAT32_C( 692.06), SIMDE_FLOAT32_C( 421.25), SIMDE_FLOAT32_C( 847.39),
SIMDE_FLOAT32_C( -325.76), SIMDE_FLOAT32_C( 550.57), SIMDE_FLOAT32_C( -153.15), SIMDE_FLOAT32_C( -226.63),
SIMDE_FLOAT32_C( -509.29), SIMDE_FLOAT32_C( 62.37), SIMDE_FLOAT32_C( -173.99), SIMDE_FLOAT32_C( -305.63) },
{ SIMDE_FLOAT32_C( -945.53), SIMDE_FLOAT32_C( -156.38), SIMDE_FLOAT32_C( 399.66), SIMDE_FLOAT32_C( 989.79),
SIMDE_FLOAT32_C( 509.74), SIMDE_FLOAT32_C( 377.91), SIMDE_FLOAT32_C( 999.28), SIMDE_FLOAT32_C( -990.32),
SIMDE_FLOAT32_C( 626.71), SIMDE_FLOAT32_C( -870.75), SIMDE_FLOAT32_C( -518.58), SIMDE_FLOAT32_C( 805.16),
SIMDE_FLOAT32_C( -482.08), SIMDE_FLOAT32_C( -152.77), SIMDE_FLOAT32_C( -974.89), SIMDE_FLOAT32_C( 828.03) },
{ SIMDE_FLOAT32_C( -947.37), SIMDE_FLOAT32_C( 473.73), SIMDE_FLOAT32_C( -7.90), SIMDE_FLOAT32_C( 1132.56),
SIMDE_FLOAT32_C( -294.71), SIMDE_FLOAT32_C( 788.52), SIMDE_FLOAT32_C( 1084.44), SIMDE_FLOAT32_C( 1303.38),
SIMDE_FLOAT32_C( -990.51), SIMDE_FLOAT32_C( -500.06), SIMDE_FLOAT32_C( -751.20), SIMDE_FLOAT32_C( -870.03),
SIMDE_FLOAT32_C( -528.26), SIMDE_FLOAT32_C( -821.55), SIMDE_FLOAT32_C( -611.34), SIMDE_FLOAT32_C( -634.19) } }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])) ; i++) {
simde__m512 src = simde_mm512_loadu_ps(test_vec[i].src);
simde__m512 a = simde_mm512_loadu_ps(test_vec[i].a);
simde__m512 b = simde_mm512_loadu_ps(test_vec[i].b);
simde__m512 r = simde_mm512_mask_hypot_ps(src, test_vec[i].k, a, b);
simde_test_x86_assert_equal_f32x16(r, simde_mm512_loadu_ps(test_vec[i].r), 1);
}
return 0;
}
static int
test_simde_mm512_hypot_pd (SIMDE_MUNIT_TEST_ARGS) {
static const struct {
const simde_float64 a[8];
const simde_float64 b[8];
const simde_float64 r[8];
} test_vec[] = {
{ { SIMDE_FLOAT64_C( 275.20), SIMDE_FLOAT64_C( 366.24), SIMDE_FLOAT64_C( 966.11), SIMDE_FLOAT64_C( -937.96),
SIMDE_FLOAT64_C( 570.22), SIMDE_FLOAT64_C( -7.21), SIMDE_FLOAT64_C( 612.58), SIMDE_FLOAT64_C( -184.69) },
{ SIMDE_FLOAT64_C( -503.58), SIMDE_FLOAT64_C( 256.83), SIMDE_FLOAT64_C( 80.98), SIMDE_FLOAT64_C( -364.25),
SIMDE_FLOAT64_C( 598.02), SIMDE_FLOAT64_C( -961.08), SIMDE_FLOAT64_C( 560.19), SIMDE_FLOAT64_C( -553.76) },
{ SIMDE_FLOAT64_C( 573.87), SIMDE_FLOAT64_C( 447.32), SIMDE_FLOAT64_C( 969.50), SIMDE_FLOAT64_C( 1006.20),
SIMDE_FLOAT64_C( 826.30), SIMDE_FLOAT64_C( 961.11), SIMDE_FLOAT64_C( 830.10), SIMDE_FLOAT64_C( 583.75) } },
{ { SIMDE_FLOAT64_C( -373.20), SIMDE_FLOAT64_C( 625.48), SIMDE_FLOAT64_C( 871.64), SIMDE_FLOAT64_C( -503.55),
SIMDE_FLOAT64_C( -900.28), SIMDE_FLOAT64_C( 58.59), SIMDE_FLOAT64_C( -493.99), SIMDE_FLOAT64_C( 103.21) },
{ SIMDE_FLOAT64_C( 916.41), SIMDE_FLOAT64_C( 70.36), SIMDE_FLOAT64_C( -720.02), SIMDE_FLOAT64_C( -164.66),
SIMDE_FLOAT64_C( 487.58), SIMDE_FLOAT64_C( -677.71), SIMDE_FLOAT64_C( -865.62), SIMDE_FLOAT64_C( -237.21) },
{ SIMDE_FLOAT64_C( 989.49), SIMDE_FLOAT64_C( 629.42), SIMDE_FLOAT64_C( 1130.57), SIMDE_FLOAT64_C( 529.79),
SIMDE_FLOAT64_C( 1023.84), SIMDE_FLOAT64_C( 680.24), SIMDE_FLOAT64_C( 996.66), SIMDE_FLOAT64_C( 258.69) } },
{ { SIMDE_FLOAT64_C( 688.53), SIMDE_FLOAT64_C( -899.51), SIMDE_FLOAT64_C( -175.18), SIMDE_FLOAT64_C( 258.75),
SIMDE_FLOAT64_C( 93.28), SIMDE_FLOAT64_C( -562.60), SIMDE_FLOAT64_C( -925.94), SIMDE_FLOAT64_C( 589.69) },
{ SIMDE_FLOAT64_C( 694.23), SIMDE_FLOAT64_C( 155.04), SIMDE_FLOAT64_C( -774.56), SIMDE_FLOAT64_C( 292.25),
SIMDE_FLOAT64_C( 193.96), SIMDE_FLOAT64_C( 785.64), SIMDE_FLOAT64_C( 738.49), SIMDE_FLOAT64_C( 820.76) },
{ SIMDE_FLOAT64_C( 977.77), SIMDE_FLOAT64_C( 912.77), SIMDE_FLOAT64_C( 794.12), SIMDE_FLOAT64_C( 390.34),
SIMDE_FLOAT64_C( 215.22), SIMDE_FLOAT64_C( 966.31), SIMDE_FLOAT64_C( 1184.37), SIMDE_FLOAT64_C( 1010.63) } },
{ { SIMDE_FLOAT64_C( 411.12), SIMDE_FLOAT64_C( 610.13), SIMDE_FLOAT64_C( -682.79), SIMDE_FLOAT64_C( 510.84),
SIMDE_FLOAT64_C( -331.28), SIMDE_FLOAT64_C( -176.78), SIMDE_FLOAT64_C( -385.95), SIMDE_FLOAT64_C( -414.87) },
{ SIMDE_FLOAT64_C( 893.58), SIMDE_FLOAT64_C( -105.97), SIMDE_FLOAT64_C( 420.47), SIMDE_FLOAT64_C( 381.16),
SIMDE_FLOAT64_C( 216.32), SIMDE_FLOAT64_C( 554.85), SIMDE_FLOAT64_C( -856.05), SIMDE_FLOAT64_C( -95.14) },
{ SIMDE_FLOAT64_C( 983.62), SIMDE_FLOAT64_C( 619.26), SIMDE_FLOAT64_C( 801.87), SIMDE_FLOAT64_C( 637.37),
SIMDE_FLOAT64_C( 395.65), SIMDE_FLOAT64_C( 582.33), SIMDE_FLOAT64_C( 939.03), SIMDE_FLOAT64_C( 425.64) } },
{ { SIMDE_FLOAT64_C( 655.34), SIMDE_FLOAT64_C( -31.23), SIMDE_FLOAT64_C( -836.39), SIMDE_FLOAT64_C( -251.38),
SIMDE_FLOAT64_C( 406.17), SIMDE_FLOAT64_C( -762.33), SIMDE_FLOAT64_C( -661.69), SIMDE_FLOAT64_C( 100.40) },
{ SIMDE_FLOAT64_C( 392.71), SIMDE_FLOAT64_C( -436.24), SIMDE_FLOAT64_C( -607.35), SIMDE_FLOAT64_C( -413.33),
SIMDE_FLOAT64_C( -650.61), SIMDE_FLOAT64_C( -868.86), SIMDE_FLOAT64_C( -592.57), SIMDE_FLOAT64_C( 760.51) },
{ SIMDE_FLOAT64_C( 764.00), SIMDE_FLOAT64_C( 437.36), SIMDE_FLOAT64_C( 1033.65), SIMDE_FLOAT64_C( 483.77),
SIMDE_FLOAT64_C( 766.99), SIMDE_FLOAT64_C( 1155.88), SIMDE_FLOAT64_C( 888.24), SIMDE_FLOAT64_C( 767.11) } },
{ { SIMDE_FLOAT64_C( 741.27), SIMDE_FLOAT64_C( -275.37), SIMDE_FLOAT64_C( 271.35), SIMDE_FLOAT64_C( -590.01),
SIMDE_FLOAT64_C( 547.85), SIMDE_FLOAT64_C( 885.41), SIMDE_FLOAT64_C( -4.88), SIMDE_FLOAT64_C( 441.42) },
{ SIMDE_FLOAT64_C( -220.56), SIMDE_FLOAT64_C( -584.41), SIMDE_FLOAT64_C( -177.42), SIMDE_FLOAT64_C( 995.76),
SIMDE_FLOAT64_C( 970.44), SIMDE_FLOAT64_C( -33.47), SIMDE_FLOAT64_C( -99.38), SIMDE_FLOAT64_C( 625.78) },
{ SIMDE_FLOAT64_C( 773.39), SIMDE_FLOAT64_C( 646.04), SIMDE_FLOAT64_C( 324.20), SIMDE_FLOAT64_C( 1157.43),
SIMDE_FLOAT64_C( 1114.40), SIMDE_FLOAT64_C( 886.04), SIMDE_FLOAT64_C( 99.50), SIMDE_FLOAT64_C( 765.80) } },
{ { SIMDE_FLOAT64_C( 935.30), SIMDE_FLOAT64_C( 64.23), SIMDE_FLOAT64_C( -625.60), SIMDE_FLOAT64_C( 341.47),
SIMDE_FLOAT64_C( 301.89), SIMDE_FLOAT64_C( -287.29), SIMDE_FLOAT64_C( -558.13), SIMDE_FLOAT64_C( -305.40) },
{ SIMDE_FLOAT64_C( 276.47), SIMDE_FLOAT64_C( -165.48), SIMDE_FLOAT64_C( 281.27), SIMDE_FLOAT64_C( 625.86),
SIMDE_FLOAT64_C( -34.34), SIMDE_FLOAT64_C( 688.70), SIMDE_FLOAT64_C( 386.37), SIMDE_FLOAT64_C( -293.08) },
{ SIMDE_FLOAT64_C( 975.31), SIMDE_FLOAT64_C( 177.51), SIMDE_FLOAT64_C( 685.92), SIMDE_FLOAT64_C( 712.95),
SIMDE_FLOAT64_C( 303.84), SIMDE_FLOAT64_C( 746.22), SIMDE_FLOAT64_C( 678.82), SIMDE_FLOAT64_C( 423.28) } },
{ { SIMDE_FLOAT64_C( -586.67), SIMDE_FLOAT64_C( -342.28), SIMDE_FLOAT64_C( 116.91), SIMDE_FLOAT64_C( 961.18),
SIMDE_FLOAT64_C( -456.87), SIMDE_FLOAT64_C( -887.97), SIMDE_FLOAT64_C( 402.60), SIMDE_FLOAT64_C( 322.57) },
{ SIMDE_FLOAT64_C( -472.39), SIMDE_FLOAT64_C( -774.82), SIMDE_FLOAT64_C( 318.33), SIMDE_FLOAT64_C( -501.95),
SIMDE_FLOAT64_C( 191.71), SIMDE_FLOAT64_C( -781.04), SIMDE_FLOAT64_C( -876.17), SIMDE_FLOAT64_C( 127.01) },
{ SIMDE_FLOAT64_C( 753.22), SIMDE_FLOAT64_C( 847.05), SIMDE_FLOAT64_C( 339.12), SIMDE_FLOAT64_C( 1084.35),
SIMDE_FLOAT64_C( 495.46), SIMDE_FLOAT64_C( 1182.59), SIMDE_FLOAT64_C( 964.24), SIMDE_FLOAT64_C( 346.67) } }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])) ; i++) {
simde__m512d a = simde_mm512_loadu_pd(test_vec[i].a);
simde__m512d b = simde_mm512_loadu_pd(test_vec[i].b);
simde__m512d r = simde_mm512_hypot_pd(a, b);
simde_test_x86_assert_equal_f64x8(r, simde_mm512_loadu_pd(test_vec[i].r), 1);
}
return 0;
}
static int
test_simde_mm512_mask_hypot_pd (SIMDE_MUNIT_TEST_ARGS) {
static const struct {
const simde_float64 src[8];
const simde__mmask8 k;
const simde_float64 a[8];
const simde_float64 b[8];
const simde_float64 r[8];
} test_vec[] = {
{ { SIMDE_FLOAT64_C( -431.95), SIMDE_FLOAT64_C( -237.84), SIMDE_FLOAT64_C( 748.51), SIMDE_FLOAT64_C( 841.10),
SIMDE_FLOAT64_C( -673.54), SIMDE_FLOAT64_C( 668.62), SIMDE_FLOAT64_C( 514.70), SIMDE_FLOAT64_C( -656.78) },
UINT8_C(201),
{ SIMDE_FLOAT64_C( 160.07), SIMDE_FLOAT64_C( -729.81), SIMDE_FLOAT64_C( -33.18), SIMDE_FLOAT64_C( 130.28),
SIMDE_FLOAT64_C( 345.30), SIMDE_FLOAT64_C( -333.34), SIMDE_FLOAT64_C( -285.62), SIMDE_FLOAT64_C( -843.08) },
{ SIMDE_FLOAT64_C( -705.31), SIMDE_FLOAT64_C( -528.34), SIMDE_FLOAT64_C( 222.02), SIMDE_FLOAT64_C( -760.66),
SIMDE_FLOAT64_C( -344.72), SIMDE_FLOAT64_C( -209.64), SIMDE_FLOAT64_C( -687.68), SIMDE_FLOAT64_C( 52.34) },
{ SIMDE_FLOAT64_C( 723.25), SIMDE_FLOAT64_C( -237.84), SIMDE_FLOAT64_C( 748.51), SIMDE_FLOAT64_C( 771.74),
SIMDE_FLOAT64_C( -673.54), SIMDE_FLOAT64_C( 668.62), SIMDE_FLOAT64_C( 744.64), SIMDE_FLOAT64_C( 844.70) } },
{ { SIMDE_FLOAT64_C( 859.76), SIMDE_FLOAT64_C( 134.54), SIMDE_FLOAT64_C( -771.62), SIMDE_FLOAT64_C( -408.76),
SIMDE_FLOAT64_C( 106.34), SIMDE_FLOAT64_C( -575.90), SIMDE_FLOAT64_C( 159.29), SIMDE_FLOAT64_C( 868.50) },
UINT8_C(223),
{ SIMDE_FLOAT64_C( 0.39), SIMDE_FLOAT64_C( -805.04), SIMDE_FLOAT64_C( 841.23), SIMDE_FLOAT64_C( -484.91),
SIMDE_FLOAT64_C( -461.82), SIMDE_FLOAT64_C( 403.45), SIMDE_FLOAT64_C( 675.17), SIMDE_FLOAT64_C( -191.63) },
{ SIMDE_FLOAT64_C( -629.72), SIMDE_FLOAT64_C( -194.56), SIMDE_FLOAT64_C( -846.33), SIMDE_FLOAT64_C( 36.94),
SIMDE_FLOAT64_C( 519.83), SIMDE_FLOAT64_C( -689.41), SIMDE_FLOAT64_C( 331.63), SIMDE_FLOAT64_C( 991.49) },
{ SIMDE_FLOAT64_C( 629.72), SIMDE_FLOAT64_C( 828.22), SIMDE_FLOAT64_C( 1193.29), SIMDE_FLOAT64_C( 486.31),
SIMDE_FLOAT64_C( 695.34), SIMDE_FLOAT64_C( -575.90), SIMDE_FLOAT64_C( 752.22), SIMDE_FLOAT64_C( 1009.84) } },
{ { SIMDE_FLOAT64_C( 532.61), SIMDE_FLOAT64_C( 570.97), SIMDE_FLOAT64_C( -353.24), SIMDE_FLOAT64_C( -677.03),
SIMDE_FLOAT64_C( 883.29), SIMDE_FLOAT64_C( 699.10), SIMDE_FLOAT64_C( -817.27), SIMDE_FLOAT64_C( 17.83) },
UINT8_C(222),
{ SIMDE_FLOAT64_C( -226.03), SIMDE_FLOAT64_C( -875.83), SIMDE_FLOAT64_C( -648.42), SIMDE_FLOAT64_C( 933.26),
SIMDE_FLOAT64_C( 992.67), SIMDE_FLOAT64_C( -475.82), SIMDE_FLOAT64_C( -66.35), SIMDE_FLOAT64_C( -812.37) },
{ SIMDE_FLOAT64_C( -634.58), SIMDE_FLOAT64_C( 448.74), SIMDE_FLOAT64_C( -274.19), SIMDE_FLOAT64_C( 768.87),
SIMDE_FLOAT64_C( 123.91), SIMDE_FLOAT64_C( 534.18), SIMDE_FLOAT64_C( -860.86), SIMDE_FLOAT64_C( 929.35) },
{ SIMDE_FLOAT64_C( 532.61), SIMDE_FLOAT64_C( 984.10), SIMDE_FLOAT64_C( 704.01), SIMDE_FLOAT64_C( 1209.19),
SIMDE_FLOAT64_C( 1000.37), SIMDE_FLOAT64_C( 699.10), SIMDE_FLOAT64_C( 863.41), SIMDE_FLOAT64_C( 1234.36) } },
{ { SIMDE_FLOAT64_C( 687.85), SIMDE_FLOAT64_C( 176.08), SIMDE_FLOAT64_C( 449.18), SIMDE_FLOAT64_C( 998.45),
SIMDE_FLOAT64_C( -492.29), SIMDE_FLOAT64_C( 440.66), SIMDE_FLOAT64_C( 531.06), SIMDE_FLOAT64_C( -921.32) },
UINT8_C( 88),
{ SIMDE_FLOAT64_C( 854.03), SIMDE_FLOAT64_C( 961.97), SIMDE_FLOAT64_C( 786.53), SIMDE_FLOAT64_C( -963.25),
SIMDE_FLOAT64_C( -20.20), SIMDE_FLOAT64_C( 714.01), SIMDE_FLOAT64_C( -189.28), SIMDE_FLOAT64_C( 103.97) },
{ SIMDE_FLOAT64_C( -934.41), SIMDE_FLOAT64_C( -256.02), SIMDE_FLOAT64_C( 96.64), SIMDE_FLOAT64_C( -410.23),
SIMDE_FLOAT64_C( 677.63), SIMDE_FLOAT64_C( 284.27), SIMDE_FLOAT64_C( -44.81), SIMDE_FLOAT64_C( 126.37) },
{ SIMDE_FLOAT64_C( 687.85), SIMDE_FLOAT64_C( 176.08), SIMDE_FLOAT64_C( 449.18), SIMDE_FLOAT64_C( 1046.97),
SIMDE_FLOAT64_C( 677.93), SIMDE_FLOAT64_C( 440.66), SIMDE_FLOAT64_C( 194.51), SIMDE_FLOAT64_C( -921.32) } },
{ { SIMDE_FLOAT64_C( -989.92), SIMDE_FLOAT64_C( -275.94), SIMDE_FLOAT64_C( -749.72), SIMDE_FLOAT64_C( 544.27),
SIMDE_FLOAT64_C( -136.80), SIMDE_FLOAT64_C( -820.37), SIMDE_FLOAT64_C( 232.12), SIMDE_FLOAT64_C( -960.72) },
UINT8_C( 98),
{ SIMDE_FLOAT64_C( 230.57), SIMDE_FLOAT64_C( -453.01), SIMDE_FLOAT64_C( 69.47), SIMDE_FLOAT64_C( -238.38),
SIMDE_FLOAT64_C( -374.34), SIMDE_FLOAT64_C( 156.90), SIMDE_FLOAT64_C( -384.35), SIMDE_FLOAT64_C( -412.37) },
{ SIMDE_FLOAT64_C( -56.57), SIMDE_FLOAT64_C( -347.60), SIMDE_FLOAT64_C( 567.43), SIMDE_FLOAT64_C( -342.56),
SIMDE_FLOAT64_C( 463.12), SIMDE_FLOAT64_C( -328.60), SIMDE_FLOAT64_C( -276.97), SIMDE_FLOAT64_C( -792.90) },
{ SIMDE_FLOAT64_C( -989.92), SIMDE_FLOAT64_C( 571.00), SIMDE_FLOAT64_C( -749.72), SIMDE_FLOAT64_C( 544.27),
SIMDE_FLOAT64_C( -136.80), SIMDE_FLOAT64_C( 364.14), SIMDE_FLOAT64_C( 473.75), SIMDE_FLOAT64_C( -960.72) } },
{ { SIMDE_FLOAT64_C( 768.04), SIMDE_FLOAT64_C( 312.80), SIMDE_FLOAT64_C( 884.73), SIMDE_FLOAT64_C( 52.31),
SIMDE_FLOAT64_C( -732.01), SIMDE_FLOAT64_C( 11.11), SIMDE_FLOAT64_C( 62.39), SIMDE_FLOAT64_C( -7.95) },
UINT8_C(156),
{ SIMDE_FLOAT64_C( -393.34), SIMDE_FLOAT64_C( 855.25), SIMDE_FLOAT64_C( 441.02), SIMDE_FLOAT64_C( 838.78),
SIMDE_FLOAT64_C( 894.53), SIMDE_FLOAT64_C( 69.83), SIMDE_FLOAT64_C( 69.35), SIMDE_FLOAT64_C( -558.49) },
{ SIMDE_FLOAT64_C( -860.69), SIMDE_FLOAT64_C( 830.97), SIMDE_FLOAT64_C( 67.18), SIMDE_FLOAT64_C( 296.21),
SIMDE_FLOAT64_C( -553.38), SIMDE_FLOAT64_C( 654.81), SIMDE_FLOAT64_C( -760.36), SIMDE_FLOAT64_C( 99.02) },
{ SIMDE_FLOAT64_C( 768.04), SIMDE_FLOAT64_C( 312.80), SIMDE_FLOAT64_C( 446.11), SIMDE_FLOAT64_C( 889.55),
SIMDE_FLOAT64_C( 1051.86), SIMDE_FLOAT64_C( 11.11), SIMDE_FLOAT64_C( 62.39), SIMDE_FLOAT64_C( 567.20) } },
{ { SIMDE_FLOAT64_C( 222.24), SIMDE_FLOAT64_C( -102.92), SIMDE_FLOAT64_C( -437.85), SIMDE_FLOAT64_C( 893.64),
SIMDE_FLOAT64_C( 620.10), SIMDE_FLOAT64_C( -230.75), SIMDE_FLOAT64_C( 661.68), SIMDE_FLOAT64_C( -67.10) },
UINT8_C( 62),
{ SIMDE_FLOAT64_C( -286.01), SIMDE_FLOAT64_C( 200.89), SIMDE_FLOAT64_C( 665.09), SIMDE_FLOAT64_C( 776.38),
SIMDE_FLOAT64_C( -807.06), SIMDE_FLOAT64_C( -73.52), SIMDE_FLOAT64_C( -616.96), SIMDE_FLOAT64_C( -951.82) },
{ SIMDE_FLOAT64_C( -632.50), SIMDE_FLOAT64_C( -778.18), SIMDE_FLOAT64_C( 942.71), SIMDE_FLOAT64_C( 437.33),
SIMDE_FLOAT64_C( 291.17), SIMDE_FLOAT64_C( -615.78), SIMDE_FLOAT64_C( 576.64), SIMDE_FLOAT64_C( 122.14) },
{ SIMDE_FLOAT64_C( 222.24), SIMDE_FLOAT64_C( 803.69), SIMDE_FLOAT64_C( 1153.71), SIMDE_FLOAT64_C( 891.08),
SIMDE_FLOAT64_C( 857.98), SIMDE_FLOAT64_C( 620.15), SIMDE_FLOAT64_C( 661.68), SIMDE_FLOAT64_C( -67.10) } },
{ { SIMDE_FLOAT64_C( 451.40), SIMDE_FLOAT64_C( -127.16), SIMDE_FLOAT64_C( 568.75), SIMDE_FLOAT64_C( 106.22),
SIMDE_FLOAT64_C( 112.48), SIMDE_FLOAT64_C( -332.22), SIMDE_FLOAT64_C( -671.54), SIMDE_FLOAT64_C( -990.45) },
UINT8_C(133),
{ SIMDE_FLOAT64_C( -777.90), SIMDE_FLOAT64_C( 629.66), SIMDE_FLOAT64_C( 999.17), SIMDE_FLOAT64_C( 883.78),
SIMDE_FLOAT64_C( -437.44), SIMDE_FLOAT64_C( -346.84), SIMDE_FLOAT64_C( -402.24), SIMDE_FLOAT64_C( 763.45) },
{ SIMDE_FLOAT64_C( -681.75), SIMDE_FLOAT64_C( -625.86), SIMDE_FLOAT64_C( 956.39), SIMDE_FLOAT64_C( 244.73),
SIMDE_FLOAT64_C( -242.82), SIMDE_FLOAT64_C( -995.43), SIMDE_FLOAT64_C( 612.23), SIMDE_FLOAT64_C( -21.00) },
{ SIMDE_FLOAT64_C( 1034.37), SIMDE_FLOAT64_C( -127.16), SIMDE_FLOAT64_C( 1383.12), SIMDE_FLOAT64_C( 106.22),
SIMDE_FLOAT64_C( 112.48), SIMDE_FLOAT64_C( -332.22), SIMDE_FLOAT64_C( -671.54), SIMDE_FLOAT64_C( 763.74) } }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])) ; i++) {
simde__m512d src = simde_mm512_loadu_pd(test_vec[i].src);
simde__m512d a = simde_mm512_loadu_pd(test_vec[i].a);
simde__m512d b = simde_mm512_loadu_pd(test_vec[i].b);
simde__m512d r = simde_mm512_mask_hypot_pd(src, test_vec[i].k, a, b);
simde_test_x86_assert_equal_f64x8(r, simde_mm512_loadu_pd(test_vec[i].r), 1);
}
return 0;
}
static int
test_simde_mm_invcbrt_ps (SIMDE_MUNIT_TEST_ARGS) {
static const struct {
const simde_float32 a[4];
const simde_float32 r[4];
} test_vec[] = {
{ { SIMDE_FLOAT32_C( -830.78), SIMDE_FLOAT32_C( 407.78), SIMDE_FLOAT32_C( 34.12), SIMDE_FLOAT32_C( -431.04) },
{ SIMDE_FLOAT32_C( -0.11), SIMDE_FLOAT32_C( 0.13), SIMDE_FLOAT32_C( 0.31), SIMDE_FLOAT32_C( -0.13) } },
{ { SIMDE_FLOAT32_C( -838.35), SIMDE_FLOAT32_C( -741.30), SIMDE_FLOAT32_C( 354.85), SIMDE_FLOAT32_C( -840.30) },
{ SIMDE_FLOAT32_C( -0.11), SIMDE_FLOAT32_C( -0.11), SIMDE_FLOAT32_C( 0.14), SIMDE_FLOAT32_C( -0.11) } },
{ { SIMDE_FLOAT32_C( -332.67), SIMDE_FLOAT32_C( 463.71), SIMDE_FLOAT32_C( -606.20), SIMDE_FLOAT32_C( -312.79) },
{ SIMDE_FLOAT32_C( -0.14), SIMDE_FLOAT32_C( 0.13), SIMDE_FLOAT32_C( -0.12), SIMDE_FLOAT32_C( -0.15) } },
{ { SIMDE_FLOAT32_C( -0.27), SIMDE_FLOAT32_C( -815.81), SIMDE_FLOAT32_C( -819.10), SIMDE_FLOAT32_C( -853.90) },
{ SIMDE_FLOAT32_C( -1.55), SIMDE_FLOAT32_C( -0.11), SIMDE_FLOAT32_C( -0.11), SIMDE_FLOAT32_C( -0.11) } },
{ { SIMDE_FLOAT32_C( -112.18), SIMDE_FLOAT32_C( 14.21), SIMDE_FLOAT32_C( 387.92), SIMDE_FLOAT32_C( -952.65) },
{ SIMDE_FLOAT32_C( -0.21), SIMDE_FLOAT32_C( 0.41), SIMDE_FLOAT32_C( 0.14), SIMDE_FLOAT32_C( -0.10) } },
{ { SIMDE_FLOAT32_C( -492.35), SIMDE_FLOAT32_C( 204.52), SIMDE_FLOAT32_C( -434.43), SIMDE_FLOAT32_C( 455.92) },
{ SIMDE_FLOAT32_C( -0.13), SIMDE_FLOAT32_C( 0.17), SIMDE_FLOAT32_C( -0.13), SIMDE_FLOAT32_C( 0.13) } },
{ { SIMDE_FLOAT32_C( -372.57), SIMDE_FLOAT32_C( -697.63), SIMDE_FLOAT32_C( -993.40), SIMDE_FLOAT32_C( 96.43) },
{ SIMDE_FLOAT32_C( -0.14), SIMDE_FLOAT32_C( -0.11), SIMDE_FLOAT32_C( -0.10), SIMDE_FLOAT32_C( 0.22) } },
{ { SIMDE_FLOAT32_C( -450.23), SIMDE_FLOAT32_C( 393.40), SIMDE_FLOAT32_C( 531.72), SIMDE_FLOAT32_C( -281.01) },
{ SIMDE_FLOAT32_C( -0.13), SIMDE_FLOAT32_C( 0.14), SIMDE_FLOAT32_C( 0.12), SIMDE_FLOAT32_C( -0.15) } }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])) ; i++) {
simde__m128 a = simde_mm_loadu_ps(test_vec[i].a);
simde__m128 r = simde_mm_invcbrt_ps(a);
simde_test_x86_assert_equal_f32x4(r, simde_mm_loadu_ps(test_vec[i].r), 1);
}
return 0;
}
static int
test_simde_mm_invcbrt_pd (SIMDE_MUNIT_TEST_ARGS) {
static const struct {
const simde_float64 a[2];
const simde_float64 r[2];
} test_vec[] = {
{ { SIMDE_FLOAT64_C( -362.46), SIMDE_FLOAT64_C( 897.33) },
{ SIMDE_FLOAT64_C( -0.14), SIMDE_FLOAT64_C( 0.10) } },
{ { SIMDE_FLOAT64_C( -324.66), SIMDE_FLOAT64_C( -116.25) },
{ SIMDE_FLOAT64_C( -0.15), SIMDE_FLOAT64_C( -0.20) } },
{ { SIMDE_FLOAT64_C( -229.39), SIMDE_FLOAT64_C( -924.64) },
{ SIMDE_FLOAT64_C( -0.16), SIMDE_FLOAT64_C( -0.10) } },
{ { SIMDE_FLOAT64_C( 619.01), SIMDE_FLOAT64_C( -919.66) },
{ SIMDE_FLOAT64_C( 0.12), SIMDE_FLOAT64_C( -0.10) } },
{ { SIMDE_FLOAT64_C( -996.99), SIMDE_FLOAT64_C( -352.60) },
{ SIMDE_FLOAT64_C( -0.10), SIMDE_FLOAT64_C( -0.14) } },
{ { SIMDE_FLOAT64_C( -639.25), SIMDE_FLOAT64_C( 29.93) },
{ SIMDE_FLOAT64_C( -0.12), SIMDE_FLOAT64_C( 0.32) } },
{ { SIMDE_FLOAT64_C( -468.42), SIMDE_FLOAT64_C( 775.98) },
{ SIMDE_FLOAT64_C( -0.13), SIMDE_FLOAT64_C( 0.11) } },
{ { SIMDE_FLOAT64_C( -721.32), SIMDE_FLOAT64_C( 122.22) },
{ SIMDE_FLOAT64_C( -0.11), SIMDE_FLOAT64_C( 0.20) } }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])) ; i++) {
simde__m128d a = simde_mm_loadu_pd(test_vec[i].a);
simde__m128d r = simde_mm_invcbrt_pd(a);
simde_test_x86_assert_equal_f64x2(r, simde_mm_loadu_pd(test_vec[i].r), 1);
}
return 0;
}
static int
test_simde_mm256_invcbrt_ps (SIMDE_MUNIT_TEST_ARGS) {
static const struct {
const simde_float32 a[8];
const simde_float32 r[8];
} test_vec[] = {
{ { SIMDE_FLOAT32_C( 91.84), SIMDE_FLOAT32_C( -751.70), SIMDE_FLOAT32_C( 15.02), SIMDE_FLOAT32_C( -388.95),
SIMDE_FLOAT32_C( 99.77), SIMDE_FLOAT32_C( 919.81), SIMDE_FLOAT32_C( 65.75), SIMDE_FLOAT32_C( -859.67) },
{ SIMDE_FLOAT32_C( 0.22), SIMDE_FLOAT32_C( -0.11), SIMDE_FLOAT32_C( 0.41), SIMDE_FLOAT32_C( -0.14),
SIMDE_FLOAT32_C( 0.22), SIMDE_FLOAT32_C( 0.10), SIMDE_FLOAT32_C( 0.25), SIMDE_FLOAT32_C( -0.11) } },
{ { SIMDE_FLOAT32_C( -294.11), SIMDE_FLOAT32_C( 51.33), SIMDE_FLOAT32_C( -783.32), SIMDE_FLOAT32_C( -179.27),
SIMDE_FLOAT32_C( -759.73), SIMDE_FLOAT32_C( -346.33), SIMDE_FLOAT32_C( 701.43), SIMDE_FLOAT32_C( 29.88) },
{ SIMDE_FLOAT32_C( -0.15), SIMDE_FLOAT32_C( 0.27), SIMDE_FLOAT32_C( -0.11), SIMDE_FLOAT32_C( -0.18),
SIMDE_FLOAT32_C( -0.11), SIMDE_FLOAT32_C( -0.14), SIMDE_FLOAT32_C( 0.11), SIMDE_FLOAT32_C( 0.32) } },
{ { SIMDE_FLOAT32_C( -448.16), SIMDE_FLOAT32_C( -516.54), SIMDE_FLOAT32_C( -452.98), SIMDE_FLOAT32_C( 948.25),
SIMDE_FLOAT32_C( 387.51), SIMDE_FLOAT32_C( 585.82), SIMDE_FLOAT32_C( -920.12), SIMDE_FLOAT32_C( -81.56) },
{ SIMDE_FLOAT32_C( -0.13), SIMDE_FLOAT32_C( -0.12), SIMDE_FLOAT32_C( -0.13), SIMDE_FLOAT32_C( 0.10),
SIMDE_FLOAT32_C( 0.14), SIMDE_FLOAT32_C( 0.12), SIMDE_FLOAT32_C( -0.10), SIMDE_FLOAT32_C( -0.23) } },
{ { SIMDE_FLOAT32_C( -341.26), SIMDE_FLOAT32_C( -436.41), SIMDE_FLOAT32_C( 422.76), SIMDE_FLOAT32_C( -782.86),
SIMDE_FLOAT32_C( -131.30), SIMDE_FLOAT32_C( -313.86), SIMDE_FLOAT32_C( 339.30), SIMDE_FLOAT32_C( 960.53) },
{ SIMDE_FLOAT32_C( -0.14), SIMDE_FLOAT32_C( -0.13), SIMDE_FLOAT32_C( 0.13), SIMDE_FLOAT32_C( -0.11),
SIMDE_FLOAT32_C( -0.20), SIMDE_FLOAT32_C( -0.15), SIMDE_FLOAT32_C( 0.14), SIMDE_FLOAT32_C( 0.10) } },
{ { SIMDE_FLOAT32_C( -65.56), SIMDE_FLOAT32_C( -645.68), SIMDE_FLOAT32_C( -428.41), SIMDE_FLOAT32_C( -965.79),
SIMDE_FLOAT32_C( -725.86), SIMDE_FLOAT32_C( 637.33), SIMDE_FLOAT32_C( -825.46), SIMDE_FLOAT32_C( -19.97) },
{ SIMDE_FLOAT32_C( -0.25), SIMDE_FLOAT32_C( -0.12), SIMDE_FLOAT32_C( -0.13), SIMDE_FLOAT32_C( -0.10),
SIMDE_FLOAT32_C( -0.11), SIMDE_FLOAT32_C( 0.12), SIMDE_FLOAT32_C( -0.11), SIMDE_FLOAT32_C( -0.37) } },
{ { SIMDE_FLOAT32_C( -311.34), SIMDE_FLOAT32_C( -608.78), SIMDE_FLOAT32_C( 800.75), SIMDE_FLOAT32_C( -71.07),
SIMDE_FLOAT32_C( 44.89), SIMDE_FLOAT32_C( 502.19), SIMDE_FLOAT32_C( 958.81), SIMDE_FLOAT32_C( 596.72) },
{ SIMDE_FLOAT32_C( -0.15), SIMDE_FLOAT32_C( -0.12), SIMDE_FLOAT32_C( 0.11), SIMDE_FLOAT32_C( -0.24),
SIMDE_FLOAT32_C( 0.28), SIMDE_FLOAT32_C( 0.13), SIMDE_FLOAT32_C( 0.10), SIMDE_FLOAT32_C( 0.12) } },
{ { SIMDE_FLOAT32_C( 985.65), SIMDE_FLOAT32_C( -494.17), SIMDE_FLOAT32_C( 544.98), SIMDE_FLOAT32_C( 373.15),
SIMDE_FLOAT32_C( -908.35), SIMDE_FLOAT32_C( 624.86), SIMDE_FLOAT32_C( -708.41), SIMDE_FLOAT32_C( -249.62) },
{ SIMDE_FLOAT32_C( 0.10), SIMDE_FLOAT32_C( -0.13), SIMDE_FLOAT32_C( 0.12), SIMDE_FLOAT32_C( 0.14),
SIMDE_FLOAT32_C( -0.10), SIMDE_FLOAT32_C( 0.12), SIMDE_FLOAT32_C( -0.11), SIMDE_FLOAT32_C( -0.16) } },
{ { SIMDE_FLOAT32_C( -811.55), SIMDE_FLOAT32_C( 714.36), SIMDE_FLOAT32_C( -32.48), SIMDE_FLOAT32_C( 57.15),
SIMDE_FLOAT32_C( -599.50), SIMDE_FLOAT32_C( -693.18), SIMDE_FLOAT32_C( 17.68), SIMDE_FLOAT32_C( 334.94) },
{ SIMDE_FLOAT32_C( -0.11), SIMDE_FLOAT32_C( 0.11), SIMDE_FLOAT32_C( -0.31), SIMDE_FLOAT32_C( 0.26),
SIMDE_FLOAT32_C( -0.12), SIMDE_FLOAT32_C( -0.11), SIMDE_FLOAT32_C( 0.38), SIMDE_FLOAT32_C( 0.14) } }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])) ; i++) {
simde__m256 a = simde_mm256_loadu_ps(test_vec[i].a);
simde__m256 r = simde_mm256_invcbrt_ps(a);
simde_test_x86_assert_equal_f32x8(r, simde_mm256_loadu_ps(test_vec[i].r), 1);
}
return 0;
}
static int
test_simde_mm256_invcbrt_pd (SIMDE_MUNIT_TEST_ARGS) {
static const struct {
const simde_float64 a[4];
const simde_float64 r[4];
} test_vec[] = {
{ { SIMDE_FLOAT64_C( -253.42), SIMDE_FLOAT64_C( -775.86), SIMDE_FLOAT64_C( 7.55), SIMDE_FLOAT64_C( 246.09) },
{ SIMDE_FLOAT64_C( -0.16), SIMDE_FLOAT64_C( -0.11), SIMDE_FLOAT64_C( 0.51), SIMDE_FLOAT64_C( 0.16) } },
{ { SIMDE_FLOAT64_C( -201.99), SIMDE_FLOAT64_C( 0.55), SIMDE_FLOAT64_C( -584.03), SIMDE_FLOAT64_C( -671.92) },
{ SIMDE_FLOAT64_C( -0.17), SIMDE_FLOAT64_C( 1.22), SIMDE_FLOAT64_C( -0.12), SIMDE_FLOAT64_C( -0.11) } },
{ { SIMDE_FLOAT64_C( 851.57), SIMDE_FLOAT64_C( 459.01), SIMDE_FLOAT64_C( 394.56), SIMDE_FLOAT64_C( 866.29) },
{ SIMDE_FLOAT64_C( 0.11), SIMDE_FLOAT64_C( 0.13), SIMDE_FLOAT64_C( 0.14), SIMDE_FLOAT64_C( 0.10) } },
{ { SIMDE_FLOAT64_C( 645.75), SIMDE_FLOAT64_C( 575.99), SIMDE_FLOAT64_C( 41.51), SIMDE_FLOAT64_C( -177.11) },
{ SIMDE_FLOAT64_C( 0.12), SIMDE_FLOAT64_C( 0.12), SIMDE_FLOAT64_C( 0.29), SIMDE_FLOAT64_C( -0.18) } },
{ { SIMDE_FLOAT64_C( -632.82), SIMDE_FLOAT64_C( 815.53), SIMDE_FLOAT64_C( -21.43), SIMDE_FLOAT64_C( -406.93) },
{ SIMDE_FLOAT64_C( -0.12), SIMDE_FLOAT64_C( 0.11), SIMDE_FLOAT64_C( -0.36), SIMDE_FLOAT64_C( -0.13) } },
{ { SIMDE_FLOAT64_C( 471.99), SIMDE_FLOAT64_C( -996.82), SIMDE_FLOAT64_C( -716.04), SIMDE_FLOAT64_C( -550.05) },
{ SIMDE_FLOAT64_C( 0.13), SIMDE_FLOAT64_C( -0.10), SIMDE_FLOAT64_C( -0.11), SIMDE_FLOAT64_C( -0.12) } },
{ { SIMDE_FLOAT64_C( 564.26), SIMDE_FLOAT64_C( -164.60), SIMDE_FLOAT64_C( -303.42), SIMDE_FLOAT64_C( -304.34) },
{ SIMDE_FLOAT64_C( 0.12), SIMDE_FLOAT64_C( -0.18), SIMDE_FLOAT64_C( -0.15), SIMDE_FLOAT64_C( -0.15) } },
{ { SIMDE_FLOAT64_C( 749.99), SIMDE_FLOAT64_C( 564.62), SIMDE_FLOAT64_C( -957.88), SIMDE_FLOAT64_C( -503.43) },
{ SIMDE_FLOAT64_C( 0.11), SIMDE_FLOAT64_C( 0.12), SIMDE_FLOAT64_C( -0.10), SIMDE_FLOAT64_C( -0.13) } }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])) ; i++) {
simde__m256d a = simde_mm256_loadu_pd(test_vec[i].a);
simde__m256d r = simde_mm256_invcbrt_pd(a);
simde_test_x86_assert_equal_f64x4(r, simde_mm256_loadu_pd(test_vec[i].r), 1);
}
return 0;
}
static int
test_simde_mm_invsqrt_ps (SIMDE_MUNIT_TEST_ARGS) {
static const struct {
const simde_float32 a[4];
const simde_float32 r[4];
} test_vec[] = {
{ { SIMDE_FLOAT32_C( 963.10), SIMDE_FLOAT32_C( 544.41), SIMDE_FLOAT32_C( 741.04), SIMDE_FLOAT32_C( 478.93) },
{ SIMDE_FLOAT32_C( 0.03), SIMDE_FLOAT32_C( 0.04), SIMDE_FLOAT32_C( 0.04), SIMDE_FLOAT32_C( 0.05) } },
{ { SIMDE_FLOAT32_C( 289.81), SIMDE_FLOAT32_C( 489.84), SIMDE_FLOAT32_C( 576.93), SIMDE_FLOAT32_C( 960.27) },
{ SIMDE_FLOAT32_C( 0.06), SIMDE_FLOAT32_C( 0.05), SIMDE_FLOAT32_C( 0.04), SIMDE_FLOAT32_C( 0.03) } },
{ { SIMDE_FLOAT32_C( 308.08), SIMDE_FLOAT32_C( 66.08), SIMDE_FLOAT32_C( 486.27), SIMDE_FLOAT32_C( 318.16) },
{ SIMDE_FLOAT32_C( 0.06), SIMDE_FLOAT32_C( 0.12), SIMDE_FLOAT32_C( 0.05), SIMDE_FLOAT32_C( 0.06) } },
{ { SIMDE_FLOAT32_C( 848.25), SIMDE_FLOAT32_C( 887.84), SIMDE_FLOAT32_C( 814.84), SIMDE_FLOAT32_C( 533.08) },
{ SIMDE_FLOAT32_C( 0.03), SIMDE_FLOAT32_C( 0.03), SIMDE_FLOAT32_C( 0.04), SIMDE_FLOAT32_C( 0.04) } },
{ { SIMDE_FLOAT32_C( 476.90), SIMDE_FLOAT32_C( 887.49), SIMDE_FLOAT32_C( 751.34), SIMDE_FLOAT32_C( 508.49) },
{ SIMDE_FLOAT32_C( 0.05), SIMDE_FLOAT32_C( 0.03), SIMDE_FLOAT32_C( 0.04), SIMDE_FLOAT32_C( 0.04) } },
{ { SIMDE_FLOAT32_C( 679.70), SIMDE_FLOAT32_C( 603.84), SIMDE_FLOAT32_C( 905.34), SIMDE_FLOAT32_C( 39.88) },
{ SIMDE_FLOAT32_C( 0.04), SIMDE_FLOAT32_C( 0.04), SIMDE_FLOAT32_C( 0.03), SIMDE_FLOAT32_C( 0.16) } },
{ { SIMDE_FLOAT32_C( 629.17), SIMDE_FLOAT32_C( 401.81), SIMDE_FLOAT32_C( 823.42), SIMDE_FLOAT32_C( 435.02) },
{ SIMDE_FLOAT32_C( 0.04), SIMDE_FLOAT32_C( 0.05), SIMDE_FLOAT32_C( 0.03), SIMDE_FLOAT32_C( 0.05) } },
{ { SIMDE_FLOAT32_C( 727.18), SIMDE_FLOAT32_C( 800.47), SIMDE_FLOAT32_C( 32.70), SIMDE_FLOAT32_C( 690.28) },
{ SIMDE_FLOAT32_C( 0.04), SIMDE_FLOAT32_C( 0.04), SIMDE_FLOAT32_C( 0.17), SIMDE_FLOAT32_C( 0.04) } }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])) ; i++) {
simde__m128 a = simde_mm_loadu_ps(test_vec[i].a);
simde__m128 r = simde_mm_invsqrt_ps(a);
simde_test_x86_assert_equal_f32x4(r, simde_mm_loadu_ps(test_vec[i].r), 1);
}
return 0;
}
static int
test_simde_mm_invsqrt_pd (SIMDE_MUNIT_TEST_ARGS) {
static const struct {
const simde_float64 a[2];
const simde_float64 r[2];
} test_vec[] = {
{ { SIMDE_FLOAT64_C( 387.27), SIMDE_FLOAT64_C( 266.58) },
{ SIMDE_FLOAT64_C( 0.05), SIMDE_FLOAT64_C( 0.06) } },
{ { SIMDE_FLOAT64_C( 629.96), SIMDE_FLOAT64_C( 591.67) },
{ SIMDE_FLOAT64_C( 0.04), SIMDE_FLOAT64_C( 0.04) } },
{ { SIMDE_FLOAT64_C( 185.36), SIMDE_FLOAT64_C( 529.90) },
{ SIMDE_FLOAT64_C( 0.07), SIMDE_FLOAT64_C( 0.04) } },
{ { SIMDE_FLOAT64_C( 429.91), SIMDE_FLOAT64_C( 539.03) },
{ SIMDE_FLOAT64_C( 0.05), SIMDE_FLOAT64_C( 0.04) } },
{ { SIMDE_FLOAT64_C( 626.90), SIMDE_FLOAT64_C( 833.69) },
{ SIMDE_FLOAT64_C( 0.04), SIMDE_FLOAT64_C( 0.03) } },
{ { SIMDE_FLOAT64_C( 722.07), SIMDE_FLOAT64_C( 296.55) },
{ SIMDE_FLOAT64_C( 0.04), SIMDE_FLOAT64_C( 0.06) } },
{ { SIMDE_FLOAT64_C( 474.49), SIMDE_FLOAT64_C( 271.22) },
{ SIMDE_FLOAT64_C( 0.05), SIMDE_FLOAT64_C( 0.06) } },
{ { SIMDE_FLOAT64_C( 980.81), SIMDE_FLOAT64_C( 981.24) },
{ SIMDE_FLOAT64_C( 0.03), SIMDE_FLOAT64_C( 0.03) } }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])) ; i++) {
simde__m128d a = simde_mm_loadu_pd(test_vec[i].a);
simde__m128d r = simde_mm_invsqrt_pd(a);
simde_test_x86_assert_equal_f64x2(r, simde_mm_loadu_pd(test_vec[i].r), 1);
}
return 0;
}
static int
test_simde_mm256_invsqrt_ps (SIMDE_MUNIT_TEST_ARGS) {
static const struct {
const simde_float32 a[8];
const simde_float32 r[8];
} test_vec[] = {
{ { SIMDE_FLOAT32_C( 523.53), SIMDE_FLOAT32_C( 456.96), SIMDE_FLOAT32_C( 204.64), SIMDE_FLOAT32_C( 395.38),
SIMDE_FLOAT32_C( 112.91), SIMDE_FLOAT32_C( 473.53), SIMDE_FLOAT32_C( 965.22), SIMDE_FLOAT32_C( 423.85) },
{ SIMDE_FLOAT32_C( 0.04), SIMDE_FLOAT32_C( 0.05), SIMDE_FLOAT32_C( 0.07), SIMDE_FLOAT32_C( 0.05),
SIMDE_FLOAT32_C( 0.09), SIMDE_FLOAT32_C( 0.05), SIMDE_FLOAT32_C( 0.03), SIMDE_FLOAT32_C( 0.05) } },
{ { SIMDE_FLOAT32_C( 834.19), SIMDE_FLOAT32_C( 352.97), SIMDE_FLOAT32_C( 156.12), SIMDE_FLOAT32_C( 635.31),
SIMDE_FLOAT32_C( 962.63), SIMDE_FLOAT32_C( 823.80), SIMDE_FLOAT32_C( 454.23), SIMDE_FLOAT32_C( 413.73) },
{ SIMDE_FLOAT32_C( 0.03), SIMDE_FLOAT32_C( 0.05), SIMDE_FLOAT32_C( 0.08), SIMDE_FLOAT32_C( 0.04),
SIMDE_FLOAT32_C( 0.03), SIMDE_FLOAT32_C( 0.03), SIMDE_FLOAT32_C( 0.05), SIMDE_FLOAT32_C( 0.05) } },
{ { SIMDE_FLOAT32_C( 443.70), SIMDE_FLOAT32_C( 770.20), SIMDE_FLOAT32_C( 506.36), SIMDE_FLOAT32_C( 13.18),
SIMDE_FLOAT32_C( 957.34), SIMDE_FLOAT32_C( 388.10), SIMDE_FLOAT32_C( 124.63), SIMDE_FLOAT32_C( 5.64) },
{ SIMDE_FLOAT32_C( 0.05), SIMDE_FLOAT32_C( 0.04), SIMDE_FLOAT32_C( 0.04), SIMDE_FLOAT32_C( 0.28),
SIMDE_FLOAT32_C( 0.03), SIMDE_FLOAT32_C( 0.05), SIMDE_FLOAT32_C( 0.09), SIMDE_FLOAT32_C( 0.42) } },
{ { SIMDE_FLOAT32_C( 141.65), SIMDE_FLOAT32_C( 772.61), SIMDE_FLOAT32_C( 451.36), SIMDE_FLOAT32_C( 350.31),
SIMDE_FLOAT32_C( 74.48), SIMDE_FLOAT32_C( 384.43), SIMDE_FLOAT32_C( 380.41), SIMDE_FLOAT32_C( 598.01) },
{ SIMDE_FLOAT32_C( 0.08), SIMDE_FLOAT32_C( 0.04), SIMDE_FLOAT32_C( 0.05), SIMDE_FLOAT32_C( 0.05),
SIMDE_FLOAT32_C( 0.12), SIMDE_FLOAT32_C( 0.05), SIMDE_FLOAT32_C( 0.05), SIMDE_FLOAT32_C( 0.04) } },
{ { SIMDE_FLOAT32_C( 841.39), SIMDE_FLOAT32_C( 585.05), SIMDE_FLOAT32_C( 993.40), SIMDE_FLOAT32_C( 954.30),
SIMDE_FLOAT32_C( 58.58), SIMDE_FLOAT32_C( 958.61), SIMDE_FLOAT32_C( 378.15), SIMDE_FLOAT32_C( 892.77) },
{ SIMDE_FLOAT32_C( 0.03), SIMDE_FLOAT32_C( 0.04), SIMDE_FLOAT32_C( 0.03), SIMDE_FLOAT32_C( 0.03),
SIMDE_FLOAT32_C( 0.13), SIMDE_FLOAT32_C( 0.03), SIMDE_FLOAT32_C( 0.05), SIMDE_FLOAT32_C( 0.03) } },
{ { SIMDE_FLOAT32_C( 311.58), SIMDE_FLOAT32_C( 534.27), SIMDE_FLOAT32_C( 528.07), SIMDE_FLOAT32_C( 274.21),
SIMDE_FLOAT32_C( 358.06), SIMDE_FLOAT32_C( 982.30), SIMDE_FLOAT32_C( 687.94), SIMDE_FLOAT32_C( 801.76) },
{ SIMDE_FLOAT32_C( 0.06), SIMDE_FLOAT32_C( 0.04), SIMDE_FLOAT32_C( 0.04), SIMDE_FLOAT32_C( 0.06),
SIMDE_FLOAT32_C( 0.05), SIMDE_FLOAT32_C( 0.03), SIMDE_FLOAT32_C( 0.04), SIMDE_FLOAT32_C( 0.04) } },
{ { SIMDE_FLOAT32_C( 752.50), SIMDE_FLOAT32_C( 194.30), SIMDE_FLOAT32_C( 814.95), SIMDE_FLOAT32_C( 709.84),
SIMDE_FLOAT32_C( 582.40), SIMDE_FLOAT32_C( 939.58), SIMDE_FLOAT32_C( 715.48), SIMDE_FLOAT32_C( 724.05) },
{ SIMDE_FLOAT32_C( 0.04), SIMDE_FLOAT32_C( 0.07), SIMDE_FLOAT32_C( 0.04), SIMDE_FLOAT32_C( 0.04),
SIMDE_FLOAT32_C( 0.04), SIMDE_FLOAT32_C( 0.03), SIMDE_FLOAT32_C( 0.04), SIMDE_FLOAT32_C( 0.04) } },
{ { SIMDE_FLOAT32_C( 712.19), SIMDE_FLOAT32_C( 166.84), SIMDE_FLOAT32_C( 74.36), SIMDE_FLOAT32_C( 786.67),
SIMDE_FLOAT32_C( 551.27), SIMDE_FLOAT32_C( 454.77), SIMDE_FLOAT32_C( 384.69), SIMDE_FLOAT32_C( 392.66) },
{ SIMDE_FLOAT32_C( 0.04), SIMDE_FLOAT32_C( 0.08), SIMDE_FLOAT32_C( 0.12), SIMDE_FLOAT32_C( 0.04),
SIMDE_FLOAT32_C( 0.04), SIMDE_FLOAT32_C( 0.05), SIMDE_FLOAT32_C( 0.05), SIMDE_FLOAT32_C( 0.05) } }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])) ; i++) {
simde__m256 a = simde_mm256_loadu_ps(test_vec[i].a);
simde__m256 r = simde_mm256_invsqrt_ps(a);
simde_test_x86_assert_equal_f32x8(r, simde_mm256_loadu_ps(test_vec[i].r), 1);
}
return 0;
}
static int
test_simde_mm256_invsqrt_pd (SIMDE_MUNIT_TEST_ARGS) {
static const struct {
const simde_float64 a[4];
const simde_float64 r[4];
} test_vec[] = {
{ { SIMDE_FLOAT64_C( 35.16), SIMDE_FLOAT64_C( 340.96), SIMDE_FLOAT64_C( 60.32), SIMDE_FLOAT64_C( 560.44) },
{ SIMDE_FLOAT64_C( 0.17), SIMDE_FLOAT64_C( 0.05), SIMDE_FLOAT64_C( 0.13), SIMDE_FLOAT64_C( 0.04) } },
{ { SIMDE_FLOAT64_C( 259.52), SIMDE_FLOAT64_C( 415.50), SIMDE_FLOAT64_C( 716.63), SIMDE_FLOAT64_C( 444.07) },
{ SIMDE_FLOAT64_C( 0.06), SIMDE_FLOAT64_C( 0.05), SIMDE_FLOAT64_C( 0.04), SIMDE_FLOAT64_C( 0.05) } },
{ { SIMDE_FLOAT64_C( 714.85), SIMDE_FLOAT64_C( 53.22), SIMDE_FLOAT64_C( 199.06), SIMDE_FLOAT64_C( 714.03) },
{ SIMDE_FLOAT64_C( 0.04), SIMDE_FLOAT64_C( 0.14), SIMDE_FLOAT64_C( 0.07), SIMDE_FLOAT64_C( 0.04) } },
{ { SIMDE_FLOAT64_C( 807.60), SIMDE_FLOAT64_C( 19.21), SIMDE_FLOAT64_C( 401.27), SIMDE_FLOAT64_C( 275.62) },
{ SIMDE_FLOAT64_C( 0.04), SIMDE_FLOAT64_C( 0.23), SIMDE_FLOAT64_C( 0.05), SIMDE_FLOAT64_C( 0.06) } },
{ { SIMDE_FLOAT64_C( 69.48), SIMDE_FLOAT64_C( 716.42), SIMDE_FLOAT64_C( 754.51), SIMDE_FLOAT64_C( 517.80) },
{ SIMDE_FLOAT64_C( 0.12), SIMDE_FLOAT64_C( 0.04), SIMDE_FLOAT64_C( 0.04), SIMDE_FLOAT64_C( 0.04) } },
{ { SIMDE_FLOAT64_C( 294.75), SIMDE_FLOAT64_C( 671.92), SIMDE_FLOAT64_C( 712.33), SIMDE_FLOAT64_C( 826.45) },
{ SIMDE_FLOAT64_C( 0.06), SIMDE_FLOAT64_C( 0.04), SIMDE_FLOAT64_C( 0.04), SIMDE_FLOAT64_C( 0.03) } },
{ { SIMDE_FLOAT64_C( 47.66), SIMDE_FLOAT64_C( 965.47), SIMDE_FLOAT64_C( 318.45), SIMDE_FLOAT64_C( 190.50) },
{ SIMDE_FLOAT64_C( 0.14), SIMDE_FLOAT64_C( 0.03), SIMDE_FLOAT64_C( 0.06), SIMDE_FLOAT64_C( 0.07) } },
{ { SIMDE_FLOAT64_C( 58.25), SIMDE_FLOAT64_C( 429.76), SIMDE_FLOAT64_C( 771.19), SIMDE_FLOAT64_C( 93.42) },
{ SIMDE_FLOAT64_C( 0.13), SIMDE_FLOAT64_C( 0.05), SIMDE_FLOAT64_C( 0.04), SIMDE_FLOAT64_C( 0.10) } }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])) ; i++) {
simde__m256d a = simde_mm256_loadu_pd(test_vec[i].a);
simde__m256d r = simde_mm256_invsqrt_pd(a);
simde_test_x86_assert_equal_f64x4(r, simde_mm256_loadu_pd(test_vec[i].r), 1);
}
return 0;
}
static int
test_simde_mm512_invsqrt_ps (SIMDE_MUNIT_TEST_ARGS) {
static const struct {
const simde_float32 a[16];
const simde_float32 r[16];
} test_vec[] = {
{ { SIMDE_FLOAT32_C( 22.96), SIMDE_FLOAT32_C( 915.74), SIMDE_FLOAT32_C( 22.13), SIMDE_FLOAT32_C( 201.67),
SIMDE_FLOAT32_C( 223.81), SIMDE_FLOAT32_C( 949.13), SIMDE_FLOAT32_C( 18.28), SIMDE_FLOAT32_C( 237.29),
SIMDE_FLOAT32_C( 95.68), SIMDE_FLOAT32_C( 358.07), SIMDE_FLOAT32_C( 974.18), SIMDE_FLOAT32_C( 343.28),
SIMDE_FLOAT32_C( 900.66), SIMDE_FLOAT32_C( 905.83), SIMDE_FLOAT32_C( 810.45), SIMDE_FLOAT32_C( 409.74) },
{ SIMDE_FLOAT32_C( 0.21), SIMDE_FLOAT32_C( 0.03), SIMDE_FLOAT32_C( 0.21), SIMDE_FLOAT32_C( 0.07),
SIMDE_FLOAT32_C( 0.07), SIMDE_FLOAT32_C( 0.03), SIMDE_FLOAT32_C( 0.23), SIMDE_FLOAT32_C( 0.06),
SIMDE_FLOAT32_C( 0.10), SIMDE_FLOAT32_C( 0.05), SIMDE_FLOAT32_C( 0.03), SIMDE_FLOAT32_C( 0.05),
SIMDE_FLOAT32_C( 0.03), SIMDE_FLOAT32_C( 0.03), SIMDE_FLOAT32_C( 0.04), SIMDE_FLOAT32_C( 0.05) } },
{ { SIMDE_FLOAT32_C( 332.59), SIMDE_FLOAT32_C( 299.68), SIMDE_FLOAT32_C( 414.08), SIMDE_FLOAT32_C( 229.81),
SIMDE_FLOAT32_C( 905.70), SIMDE_FLOAT32_C( 204.12), SIMDE_FLOAT32_C( 480.98), SIMDE_FLOAT32_C( 846.82),
SIMDE_FLOAT32_C( 367.27), SIMDE_FLOAT32_C( 670.54), SIMDE_FLOAT32_C( 936.86), SIMDE_FLOAT32_C( 972.95),
SIMDE_FLOAT32_C( 695.70), SIMDE_FLOAT32_C( 781.82), SIMDE_FLOAT32_C( 825.14), SIMDE_FLOAT32_C( 718.66) },
{ SIMDE_FLOAT32_C( 0.05), SIMDE_FLOAT32_C( 0.06), SIMDE_FLOAT32_C( 0.05), SIMDE_FLOAT32_C( 0.07),
SIMDE_FLOAT32_C( 0.03), SIMDE_FLOAT32_C( 0.07), SIMDE_FLOAT32_C( 0.05), SIMDE_FLOAT32_C( 0.03),
SIMDE_FLOAT32_C( 0.05), SIMDE_FLOAT32_C( 0.04), SIMDE_FLOAT32_C( 0.03), SIMDE_FLOAT32_C( 0.03),
SIMDE_FLOAT32_C( 0.04), SIMDE_FLOAT32_C( 0.04), SIMDE_FLOAT32_C( 0.03), SIMDE_FLOAT32_C( 0.04) } },
{ { SIMDE_FLOAT32_C( 697.56), SIMDE_FLOAT32_C( 847.27), SIMDE_FLOAT32_C( 920.33), SIMDE_FLOAT32_C( 921.36),
SIMDE_FLOAT32_C( 796.40), SIMDE_FLOAT32_C( 938.61), SIMDE_FLOAT32_C( 158.65), SIMDE_FLOAT32_C( 892.08),
SIMDE_FLOAT32_C( 296.69), SIMDE_FLOAT32_C( 132.83), SIMDE_FLOAT32_C( 235.36), SIMDE_FLOAT32_C( 197.35),
SIMDE_FLOAT32_C( 38.67), SIMDE_FLOAT32_C( 45.81), SIMDE_FLOAT32_C( 607.10), SIMDE_FLOAT32_C( 371.26) },
{ SIMDE_FLOAT32_C( 0.04), SIMDE_FLOAT32_C( 0.03), SIMDE_FLOAT32_C( 0.03), SIMDE_FLOAT32_C( 0.03),
SIMDE_FLOAT32_C( 0.04), SIMDE_FLOAT32_C( 0.03), SIMDE_FLOAT32_C( 0.08), SIMDE_FLOAT32_C( 0.03),
SIMDE_FLOAT32_C( 0.06), SIMDE_FLOAT32_C( 0.09), SIMDE_FLOAT32_C( 0.07), SIMDE_FLOAT32_C( 0.07),
SIMDE_FLOAT32_C( 0.16), SIMDE_FLOAT32_C( 0.15), SIMDE_FLOAT32_C( 0.04), SIMDE_FLOAT32_C( 0.05) } },
{ { SIMDE_FLOAT32_C( 345.49), SIMDE_FLOAT32_C( 21.18), SIMDE_FLOAT32_C( 601.07), SIMDE_FLOAT32_C( 251.19),
SIMDE_FLOAT32_C( 225.29), SIMDE_FLOAT32_C( 82.05), SIMDE_FLOAT32_C( 98.01), SIMDE_FLOAT32_C( 592.56),
SIMDE_FLOAT32_C( 752.59), SIMDE_FLOAT32_C( 34.87), SIMDE_FLOAT32_C( 565.51), SIMDE_FLOAT32_C( 448.29),
SIMDE_FLOAT32_C( 816.69), SIMDE_FLOAT32_C( 390.65), SIMDE_FLOAT32_C( 166.96), SIMDE_FLOAT32_C( 514.24) },
{ SIMDE_FLOAT32_C( 0.05), SIMDE_FLOAT32_C( 0.22), SIMDE_FLOAT32_C( 0.04), SIMDE_FLOAT32_C( 0.06),
SIMDE_FLOAT32_C( 0.07), SIMDE_FLOAT32_C( 0.11), SIMDE_FLOAT32_C( 0.10), SIMDE_FLOAT32_C( 0.04),
SIMDE_FLOAT32_C( 0.04), SIMDE_FLOAT32_C( 0.17), SIMDE_FLOAT32_C( 0.04), SIMDE_FLOAT32_C( 0.05),
SIMDE_FLOAT32_C( 0.03), SIMDE_FLOAT32_C( 0.05), SIMDE_FLOAT32_C( 0.08), SIMDE_FLOAT32_C( 0.04) } },
{ { SIMDE_FLOAT32_C( 237.92), SIMDE_FLOAT32_C( 87.29), SIMDE_FLOAT32_C( 435.61), SIMDE_FLOAT32_C( 34.32),
SIMDE_FLOAT32_C( 25.90), SIMDE_FLOAT32_C( 594.25), SIMDE_FLOAT32_C( 926.40), SIMDE_FLOAT32_C( 322.59),
SIMDE_FLOAT32_C( 727.09), SIMDE_FLOAT32_C( 161.76), SIMDE_FLOAT32_C( 519.95), SIMDE_FLOAT32_C( 765.75),
SIMDE_FLOAT32_C( 207.57), SIMDE_FLOAT32_C( 127.04), SIMDE_FLOAT32_C( 137.01), SIMDE_FLOAT32_C( 553.06) },
{ SIMDE_FLOAT32_C( 0.06), SIMDE_FLOAT32_C( 0.11), SIMDE_FLOAT32_C( 0.05), SIMDE_FLOAT32_C( 0.17),
SIMDE_FLOAT32_C( 0.20), SIMDE_FLOAT32_C( 0.04), SIMDE_FLOAT32_C( 0.03), SIMDE_FLOAT32_C( 0.06),
SIMDE_FLOAT32_C( 0.04), SIMDE_FLOAT32_C( 0.08), SIMDE_FLOAT32_C( 0.04), SIMDE_FLOAT32_C( 0.04),
SIMDE_FLOAT32_C( 0.07), SIMDE_FLOAT32_C( 0.09), SIMDE_FLOAT32_C( 0.09), SIMDE_FLOAT32_C( 0.04) } },
{ { SIMDE_FLOAT32_C( 148.22), SIMDE_FLOAT32_C( 738.08), SIMDE_FLOAT32_C( 804.24), SIMDE_FLOAT32_C( 373.51),
SIMDE_FLOAT32_C( 820.13), SIMDE_FLOAT32_C( 902.25), SIMDE_FLOAT32_C( 966.07), SIMDE_FLOAT32_C( 572.72),
SIMDE_FLOAT32_C( 937.12), SIMDE_FLOAT32_C( 531.58), SIMDE_FLOAT32_C( 21.01), SIMDE_FLOAT32_C( 753.81),
SIMDE_FLOAT32_C( 922.24), SIMDE_FLOAT32_C( 187.97), SIMDE_FLOAT32_C( 268.05), SIMDE_FLOAT32_C( 160.16) },
{ SIMDE_FLOAT32_C( 0.08), SIMDE_FLOAT32_C( 0.04), SIMDE_FLOAT32_C( 0.04), SIMDE_FLOAT32_C( 0.05),
SIMDE_FLOAT32_C( 0.03), SIMDE_FLOAT32_C( 0.03), SIMDE_FLOAT32_C( 0.03), SIMDE_FLOAT32_C( 0.04),
SIMDE_FLOAT32_C( 0.03), SIMDE_FLOAT32_C( 0.04), SIMDE_FLOAT32_C( 0.22), SIMDE_FLOAT32_C( 0.04),
SIMDE_FLOAT32_C( 0.03), SIMDE_FLOAT32_C( 0.07), SIMDE_FLOAT32_C( 0.06), SIMDE_FLOAT32_C( 0.08) } },
{ { SIMDE_FLOAT32_C( 275.26), SIMDE_FLOAT32_C( 703.65), SIMDE_FLOAT32_C( 194.48), SIMDE_FLOAT32_C( 301.16),
SIMDE_FLOAT32_C( 297.91), SIMDE_FLOAT32_C( 120.89), SIMDE_FLOAT32_C( 623.76), SIMDE_FLOAT32_C( 25.00),
SIMDE_FLOAT32_C( 282.65), SIMDE_FLOAT32_C( 143.70), SIMDE_FLOAT32_C( 790.75), SIMDE_FLOAT32_C( 490.22),
SIMDE_FLOAT32_C( 270.74), SIMDE_FLOAT32_C( 927.76), SIMDE_FLOAT32_C( 43.28), SIMDE_FLOAT32_C( 418.96) },
{ SIMDE_FLOAT32_C( 0.06), SIMDE_FLOAT32_C( 0.04), SIMDE_FLOAT32_C( 0.07), SIMDE_FLOAT32_C( 0.06),
SIMDE_FLOAT32_C( 0.06), SIMDE_FLOAT32_C( 0.09), SIMDE_FLOAT32_C( 0.04), SIMDE_FLOAT32_C( 0.20),
SIMDE_FLOAT32_C( 0.06), SIMDE_FLOAT32_C( 0.08), SIMDE_FLOAT32_C( 0.04), SIMDE_FLOAT32_C( 0.05),
SIMDE_FLOAT32_C( 0.06), SIMDE_FLOAT32_C( 0.03), SIMDE_FLOAT32_C( 0.15), SIMDE_FLOAT32_C( 0.05) } },
{ { SIMDE_FLOAT32_C( 665.84), SIMDE_FLOAT32_C( 847.52), SIMDE_FLOAT32_C( 792.47), SIMDE_FLOAT32_C( 485.97),
SIMDE_FLOAT32_C( 749.77), SIMDE_FLOAT32_C( 758.54), SIMDE_FLOAT32_C( 58.69), SIMDE_FLOAT32_C( 686.89),
SIMDE_FLOAT32_C( 290.13), SIMDE_FLOAT32_C( 79.70), SIMDE_FLOAT32_C( 440.70), SIMDE_FLOAT32_C( 212.36),
SIMDE_FLOAT32_C( 267.67), SIMDE_FLOAT32_C( 708.75), SIMDE_FLOAT32_C( 372.52), SIMDE_FLOAT32_C( 542.93) },
{ SIMDE_FLOAT32_C( 0.04), SIMDE_FLOAT32_C( 0.03), SIMDE_FLOAT32_C( 0.04), SIMDE_FLOAT32_C( 0.05),
SIMDE_FLOAT32_C( 0.04), SIMDE_FLOAT32_C( 0.04), SIMDE_FLOAT32_C( 0.13), SIMDE_FLOAT32_C( 0.04),
SIMDE_FLOAT32_C( 0.06), SIMDE_FLOAT32_C( 0.11), SIMDE_FLOAT32_C( 0.05), SIMDE_FLOAT32_C( 0.07),
SIMDE_FLOAT32_C( 0.06), SIMDE_FLOAT32_C( 0.04), SIMDE_FLOAT32_C( 0.05), SIMDE_FLOAT32_C( 0.04) } }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])) ; i++) {
simde__m512 a = simde_mm512_loadu_ps(test_vec[i].a);
simde__m512 r = simde_mm512_invsqrt_ps(a);
simde_test_x86_assert_equal_f32x16(r, simde_mm512_loadu_ps(test_vec[i].r), 1);
}
return 0;
}
static int
test_simde_mm512_mask_invsqrt_ps (SIMDE_MUNIT_TEST_ARGS) {
static const struct {
const simde_float32 src[16];
const simde__mmask8 k;
const simde_float32 a[16];
const simde_float32 r[16];
} test_vec[] = {
{ { SIMDE_FLOAT32_C( 624.14), SIMDE_FLOAT32_C( 819.60), SIMDE_FLOAT32_C( 672.51), SIMDE_FLOAT32_C( 550.11),
SIMDE_FLOAT32_C( 812.34), SIMDE_FLOAT32_C( 166.77), SIMDE_FLOAT32_C( 70.17), SIMDE_FLOAT32_C( 377.64),
SIMDE_FLOAT32_C( 183.00), SIMDE_FLOAT32_C( 818.17), SIMDE_FLOAT32_C( 404.48), SIMDE_FLOAT32_C( 187.86),
SIMDE_FLOAT32_C( 392.86), SIMDE_FLOAT32_C( 212.92), SIMDE_FLOAT32_C( 867.57), SIMDE_FLOAT32_C( 410.64) },
UINT8_C( 3),
{ SIMDE_FLOAT32_C( 33.63), SIMDE_FLOAT32_C( 77.51), SIMDE_FLOAT32_C( 932.62), SIMDE_FLOAT32_C( 356.45),
SIMDE_FLOAT32_C( 533.80), SIMDE_FLOAT32_C( 680.31), SIMDE_FLOAT32_C( 975.45), SIMDE_FLOAT32_C( 578.12),
SIMDE_FLOAT32_C( 558.84), SIMDE_FLOAT32_C( 281.04), SIMDE_FLOAT32_C( 747.18), SIMDE_FLOAT32_C( 909.72),
SIMDE_FLOAT32_C( 312.02), SIMDE_FLOAT32_C( 748.71), SIMDE_FLOAT32_C( 533.86), SIMDE_FLOAT32_C( 131.63) },
{ SIMDE_FLOAT32_C( 0.17), SIMDE_FLOAT32_C( 0.11), SIMDE_FLOAT32_C( 672.51), SIMDE_FLOAT32_C( 550.11),
SIMDE_FLOAT32_C( 812.34), SIMDE_FLOAT32_C( 166.77), SIMDE_FLOAT32_C( 70.17), SIMDE_FLOAT32_C( 377.64),
SIMDE_FLOAT32_C( 183.00), SIMDE_FLOAT32_C( 818.17), SIMDE_FLOAT32_C( 404.48), SIMDE_FLOAT32_C( 187.86),
SIMDE_FLOAT32_C( 392.86), SIMDE_FLOAT32_C( 212.92), SIMDE_FLOAT32_C( 867.57), SIMDE_FLOAT32_C( 410.64) } },
{ { SIMDE_FLOAT32_C( 421.22), SIMDE_FLOAT32_C( 83.97), SIMDE_FLOAT32_C( 943.97), SIMDE_FLOAT32_C( 587.99),
SIMDE_FLOAT32_C( 154.14), SIMDE_FLOAT32_C( 321.61), SIMDE_FLOAT32_C( 770.98), SIMDE_FLOAT32_C( 972.32),
SIMDE_FLOAT32_C( 726.09), SIMDE_FLOAT32_C( 958.84), SIMDE_FLOAT32_C( 365.17), SIMDE_FLOAT32_C( 939.01),
SIMDE_FLOAT32_C( 826.41), SIMDE_FLOAT32_C( 775.81), SIMDE_FLOAT32_C( 236.82), SIMDE_FLOAT32_C( 860.05) },
UINT8_C( 38),
{ SIMDE_FLOAT32_C( 169.44), SIMDE_FLOAT32_C( 216.49), SIMDE_FLOAT32_C( 387.13), SIMDE_FLOAT32_C( 849.74),
SIMDE_FLOAT32_C( 191.94), SIMDE_FLOAT32_C( 965.24), SIMDE_FLOAT32_C( 408.58), SIMDE_FLOAT32_C( 472.98),
SIMDE_FLOAT32_C( 712.43), SIMDE_FLOAT32_C( 318.30), SIMDE_FLOAT32_C( 785.00), SIMDE_FLOAT32_C( 461.13),
SIMDE_FLOAT32_C( 852.16), SIMDE_FLOAT32_C( 916.63), SIMDE_FLOAT32_C( 882.35), SIMDE_FLOAT32_C( 936.13) },
{ SIMDE_FLOAT32_C( 421.22), SIMDE_FLOAT32_C( 0.07), SIMDE_FLOAT32_C( 0.05), SIMDE_FLOAT32_C( 587.99),
SIMDE_FLOAT32_C( 154.14), SIMDE_FLOAT32_C( 0.03), SIMDE_FLOAT32_C( 770.98), SIMDE_FLOAT32_C( 972.32),
SIMDE_FLOAT32_C( 726.09), SIMDE_FLOAT32_C( 958.84), SIMDE_FLOAT32_C( 365.17), SIMDE_FLOAT32_C( 939.01),
SIMDE_FLOAT32_C( 826.41), SIMDE_FLOAT32_C( 775.81), SIMDE_FLOAT32_C( 236.82), SIMDE_FLOAT32_C( 860.05) } },
{ { SIMDE_FLOAT32_C( 860.60), SIMDE_FLOAT32_C( 470.34), SIMDE_FLOAT32_C( 90.27), SIMDE_FLOAT32_C( 182.21),
SIMDE_FLOAT32_C( 241.32), SIMDE_FLOAT32_C( 62.59), SIMDE_FLOAT32_C( 908.29), SIMDE_FLOAT32_C( 200.16),
SIMDE_FLOAT32_C( 427.77), SIMDE_FLOAT32_C( 847.30), SIMDE_FLOAT32_C( 26.58), SIMDE_FLOAT32_C( 203.58),
SIMDE_FLOAT32_C( 84.12), SIMDE_FLOAT32_C( 886.63), SIMDE_FLOAT32_C( 56.91), SIMDE_FLOAT32_C( 253.56) },
UINT8_C( 27),
{ SIMDE_FLOAT32_C( 444.03), SIMDE_FLOAT32_C( 103.30), SIMDE_FLOAT32_C( 295.06), SIMDE_FLOAT32_C( 409.28),
SIMDE_FLOAT32_C( 511.88), SIMDE_FLOAT32_C( 768.04), SIMDE_FLOAT32_C( 121.70), SIMDE_FLOAT32_C( 830.18),
SIMDE_FLOAT32_C( 553.04), SIMDE_FLOAT32_C( 582.83), SIMDE_FLOAT32_C( 682.34), SIMDE_FLOAT32_C( 469.67),
SIMDE_FLOAT32_C( 465.19), SIMDE_FLOAT32_C( 618.47), SIMDE_FLOAT32_C( 330.27), SIMDE_FLOAT32_C( 935.53) },
{ SIMDE_FLOAT32_C( 0.05), SIMDE_FLOAT32_C( 0.10), SIMDE_FLOAT32_C( 90.27), SIMDE_FLOAT32_C( 0.05),
SIMDE_FLOAT32_C( 0.04), SIMDE_FLOAT32_C( 62.59), SIMDE_FLOAT32_C( 908.29), SIMDE_FLOAT32_C( 200.16),
SIMDE_FLOAT32_C( 427.77), SIMDE_FLOAT32_C( 847.30), SIMDE_FLOAT32_C( 26.58), SIMDE_FLOAT32_C( 203.58),
SIMDE_FLOAT32_C( 84.12), SIMDE_FLOAT32_C( 886.63), SIMDE_FLOAT32_C( 56.91), SIMDE_FLOAT32_C( 253.56) } },
{ { SIMDE_FLOAT32_C( 708.74), SIMDE_FLOAT32_C( 512.48), SIMDE_FLOAT32_C( 176.85), SIMDE_FLOAT32_C( 771.33),
SIMDE_FLOAT32_C( 420.77), SIMDE_FLOAT32_C( 377.02), SIMDE_FLOAT32_C( 199.10), SIMDE_FLOAT32_C( 268.07),
SIMDE_FLOAT32_C( 403.59), SIMDE_FLOAT32_C( 402.68), SIMDE_FLOAT32_C( 352.19), SIMDE_FLOAT32_C( 290.22),
SIMDE_FLOAT32_C( 459.59), SIMDE_FLOAT32_C( 605.74), SIMDE_FLOAT32_C( 393.34), SIMDE_FLOAT32_C( 903.62) },
UINT8_C( 7),
{ SIMDE_FLOAT32_C( 688.40), SIMDE_FLOAT32_C( 312.89), SIMDE_FLOAT32_C( 220.93), SIMDE_FLOAT32_C( 456.44),
SIMDE_FLOAT32_C( 434.59), SIMDE_FLOAT32_C( 51.11), SIMDE_FLOAT32_C( 9.48), SIMDE_FLOAT32_C( 17.43),
SIMDE_FLOAT32_C( 733.45), SIMDE_FLOAT32_C( 479.15), SIMDE_FLOAT32_C( 482.62), SIMDE_FLOAT32_C( 351.92),
SIMDE_FLOAT32_C( 809.42), SIMDE_FLOAT32_C( 418.14), SIMDE_FLOAT32_C( 60.66), SIMDE_FLOAT32_C( 321.90) },
{ SIMDE_FLOAT32_C( 0.04), SIMDE_FLOAT32_C( 0.06), SIMDE_FLOAT32_C( 0.07), SIMDE_FLOAT32_C( 771.33),
SIMDE_FLOAT32_C( 420.77), SIMDE_FLOAT32_C( 377.02), SIMDE_FLOAT32_C( 199.10), SIMDE_FLOAT32_C( 268.07),
SIMDE_FLOAT32_C( 403.59), SIMDE_FLOAT32_C( 402.68), SIMDE_FLOAT32_C( 352.19), SIMDE_FLOAT32_C( 290.22),
SIMDE_FLOAT32_C( 459.59), SIMDE_FLOAT32_C( 605.74), SIMDE_FLOAT32_C( 393.34), SIMDE_FLOAT32_C( 903.62) } },
{ { SIMDE_FLOAT32_C( 594.99), SIMDE_FLOAT32_C( 832.00), SIMDE_FLOAT32_C( 742.67), SIMDE_FLOAT32_C( 972.01),
SIMDE_FLOAT32_C( 31.10), SIMDE_FLOAT32_C( 10.74), SIMDE_FLOAT32_C( 375.60), SIMDE_FLOAT32_C( 433.77),
SIMDE_FLOAT32_C( 362.92), SIMDE_FLOAT32_C( 665.82), SIMDE_FLOAT32_C( 893.36), SIMDE_FLOAT32_C( 968.67),
SIMDE_FLOAT32_C( 59.16), SIMDE_FLOAT32_C( 796.98), SIMDE_FLOAT32_C( 677.71), SIMDE_FLOAT32_C( 747.56) },
UINT8_C(104),
{ SIMDE_FLOAT32_C( 898.63), SIMDE_FLOAT32_C( 203.99), SIMDE_FLOAT32_C( 544.46), SIMDE_FLOAT32_C( 949.74),
SIMDE_FLOAT32_C( 213.47), SIMDE_FLOAT32_C( 561.89), SIMDE_FLOAT32_C( 683.19), SIMDE_FLOAT32_C( 692.63),
SIMDE_FLOAT32_C( 44.51), SIMDE_FLOAT32_C( 35.11), SIMDE_FLOAT32_C( 502.05), SIMDE_FLOAT32_C( 462.65),
SIMDE_FLOAT32_C( 95.77), SIMDE_FLOAT32_C( 823.95), SIMDE_FLOAT32_C( 57.64), SIMDE_FLOAT32_C( 927.76) },
{ SIMDE_FLOAT32_C( 594.99), SIMDE_FLOAT32_C( 832.00), SIMDE_FLOAT32_C( 742.67), SIMDE_FLOAT32_C( 0.03),
SIMDE_FLOAT32_C( 31.10), SIMDE_FLOAT32_C( 0.04), SIMDE_FLOAT32_C( 0.04), SIMDE_FLOAT32_C( 433.77),
SIMDE_FLOAT32_C( 362.92), SIMDE_FLOAT32_C( 665.82), SIMDE_FLOAT32_C( 893.36), SIMDE_FLOAT32_C( 968.67),
SIMDE_FLOAT32_C( 59.16), SIMDE_FLOAT32_C( 796.98), SIMDE_FLOAT32_C( 677.71), SIMDE_FLOAT32_C( 747.56) } },
{ { SIMDE_FLOAT32_C( 566.62), SIMDE_FLOAT32_C( 29.65), SIMDE_FLOAT32_C( 958.86), SIMDE_FLOAT32_C( 577.36),
SIMDE_FLOAT32_C( 405.26), SIMDE_FLOAT32_C( 392.63), SIMDE_FLOAT32_C( 940.29), SIMDE_FLOAT32_C( 71.08),
SIMDE_FLOAT32_C( 285.99), SIMDE_FLOAT32_C( 908.95), SIMDE_FLOAT32_C( 130.24), SIMDE_FLOAT32_C( 82.97),
SIMDE_FLOAT32_C( 586.66), SIMDE_FLOAT32_C( 877.80), SIMDE_FLOAT32_C( 192.84), SIMDE_FLOAT32_C( 485.30) },
UINT8_C( 59),
{ SIMDE_FLOAT32_C( 737.31), SIMDE_FLOAT32_C( 435.04), SIMDE_FLOAT32_C( 295.27), SIMDE_FLOAT32_C( 299.20),
SIMDE_FLOAT32_C( 118.23), SIMDE_FLOAT32_C( 987.89), SIMDE_FLOAT32_C( 343.70), SIMDE_FLOAT32_C( 153.34),
SIMDE_FLOAT32_C( 489.94), SIMDE_FLOAT32_C( 806.35), SIMDE_FLOAT32_C( 249.11), SIMDE_FLOAT32_C( 313.90),
SIMDE_FLOAT32_C( 864.00), SIMDE_FLOAT32_C( 176.87), SIMDE_FLOAT32_C( 880.52), SIMDE_FLOAT32_C( 893.65) },
{ SIMDE_FLOAT32_C( 0.04), SIMDE_FLOAT32_C( 0.05), SIMDE_FLOAT32_C( 958.86), SIMDE_FLOAT32_C( 0.06),
SIMDE_FLOAT32_C( 0.09), SIMDE_FLOAT32_C( 0.03), SIMDE_FLOAT32_C( 940.29), SIMDE_FLOAT32_C( 71.08),
SIMDE_FLOAT32_C( 285.99), SIMDE_FLOAT32_C( 908.95), SIMDE_FLOAT32_C( 130.24), SIMDE_FLOAT32_C( 82.97),
SIMDE_FLOAT32_C( 586.66), SIMDE_FLOAT32_C( 877.80), SIMDE_FLOAT32_C( 192.84), SIMDE_FLOAT32_C( 485.30) } },
{ { SIMDE_FLOAT32_C( 135.73), SIMDE_FLOAT32_C( 457.88), SIMDE_FLOAT32_C( 298.91), SIMDE_FLOAT32_C( 528.36),
SIMDE_FLOAT32_C( 398.17), SIMDE_FLOAT32_C( 369.99), SIMDE_FLOAT32_C( 814.36), SIMDE_FLOAT32_C( 307.12),
SIMDE_FLOAT32_C( 500.23), SIMDE_FLOAT32_C( 897.33), SIMDE_FLOAT32_C( 893.78), SIMDE_FLOAT32_C( 378.03),
SIMDE_FLOAT32_C( 90.17), SIMDE_FLOAT32_C( 379.08), SIMDE_FLOAT32_C( 459.82), SIMDE_FLOAT32_C( 827.48) },
UINT8_C(163),
{ SIMDE_FLOAT32_C( 755.09), SIMDE_FLOAT32_C( 126.67), SIMDE_FLOAT32_C( 932.35), SIMDE_FLOAT32_C( 742.98),
SIMDE_FLOAT32_C( 470.38), SIMDE_FLOAT32_C( 85.68), SIMDE_FLOAT32_C( 232.93), SIMDE_FLOAT32_C( 276.73),
SIMDE_FLOAT32_C( 334.79), SIMDE_FLOAT32_C( 546.82), SIMDE_FLOAT32_C( 140.73), SIMDE_FLOAT32_C( 511.66),
SIMDE_FLOAT32_C( 427.34), SIMDE_FLOAT32_C( 34.38), SIMDE_FLOAT32_C( 647.39), SIMDE_FLOAT32_C( 885.22) },
{ SIMDE_FLOAT32_C( 0.04), SIMDE_FLOAT32_C( 0.09), SIMDE_FLOAT32_C( 298.91), SIMDE_FLOAT32_C( 528.36),
SIMDE_FLOAT32_C( 398.17), SIMDE_FLOAT32_C( 0.11), SIMDE_FLOAT32_C( 814.36), SIMDE_FLOAT32_C( 0.06),
SIMDE_FLOAT32_C( 500.23), SIMDE_FLOAT32_C( 897.33), SIMDE_FLOAT32_C( 893.78), SIMDE_FLOAT32_C( 378.03),
SIMDE_FLOAT32_C( 90.17), SIMDE_FLOAT32_C( 379.08), SIMDE_FLOAT32_C( 459.82), SIMDE_FLOAT32_C( 827.48) } },
{ { SIMDE_FLOAT32_C( 333.29), SIMDE_FLOAT32_C( 175.75), SIMDE_FLOAT32_C( 283.39), SIMDE_FLOAT32_C( 703.28),
SIMDE_FLOAT32_C( 990.11), SIMDE_FLOAT32_C( 590.51), SIMDE_FLOAT32_C( 203.51), SIMDE_FLOAT32_C( 887.44),
SIMDE_FLOAT32_C( 484.30), SIMDE_FLOAT32_C( 581.54), SIMDE_FLOAT32_C( 977.62), SIMDE_FLOAT32_C( 863.38),
SIMDE_FLOAT32_C( 41.36), SIMDE_FLOAT32_C( 805.09), SIMDE_FLOAT32_C( 677.49), SIMDE_FLOAT32_C( 796.45) },
UINT8_C(166),
{ SIMDE_FLOAT32_C( 609.84), SIMDE_FLOAT32_C( 539.43), SIMDE_FLOAT32_C( 402.14), SIMDE_FLOAT32_C( 695.53),
SIMDE_FLOAT32_C( 772.36), SIMDE_FLOAT32_C( 678.87), SIMDE_FLOAT32_C( 30.32), SIMDE_FLOAT32_C( 319.18),
SIMDE_FLOAT32_C( 819.60), SIMDE_FLOAT32_C( 541.97), SIMDE_FLOAT32_C( 746.52), SIMDE_FLOAT32_C( 853.98),
SIMDE_FLOAT32_C( 189.36), SIMDE_FLOAT32_C( 631.74), SIMDE_FLOAT32_C( 187.26), SIMDE_FLOAT32_C( 365.12) },
{ SIMDE_FLOAT32_C( 333.29), SIMDE_FLOAT32_C( 0.04), SIMDE_FLOAT32_C( 0.05), SIMDE_FLOAT32_C( 703.28),
SIMDE_FLOAT32_C( 990.11), SIMDE_FLOAT32_C( 0.04), SIMDE_FLOAT32_C( 203.51), SIMDE_FLOAT32_C( 0.06),
SIMDE_FLOAT32_C( 484.30), SIMDE_FLOAT32_C( 581.54), SIMDE_FLOAT32_C( 977.62), SIMDE_FLOAT32_C( 863.38),
SIMDE_FLOAT32_C( 41.36), SIMDE_FLOAT32_C( 805.09), SIMDE_FLOAT32_C( 677.49), SIMDE_FLOAT32_C( 796.45) } }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])) ; i++) {
simde__m512 src = simde_mm512_loadu_ps(test_vec[i].src);
simde__m512 a = simde_mm512_loadu_ps(test_vec[i].a);
simde__m512 r = simde_mm512_mask_invsqrt_ps(src, test_vec[i].k, a);
simde_test_x86_assert_equal_f32x16(r, simde_mm512_loadu_ps(test_vec[i].r), 1);
}
return 0;
}
static int
test_simde_mm512_invsqrt_pd (SIMDE_MUNIT_TEST_ARGS) {
static const struct {
const simde_float64 a[8];
const simde_float64 r[8];
} test_vec[] = {
{ { SIMDE_FLOAT64_C( 38.73), SIMDE_FLOAT64_C( 19.20), SIMDE_FLOAT64_C( 260.68), SIMDE_FLOAT64_C( 258.52),
SIMDE_FLOAT64_C( 136.00), SIMDE_FLOAT64_C( 121.97), SIMDE_FLOAT64_C( 936.95), SIMDE_FLOAT64_C( 333.67) },
{ SIMDE_FLOAT64_C( 0.16), SIMDE_FLOAT64_C( 0.23), SIMDE_FLOAT64_C( 0.06), SIMDE_FLOAT64_C( 0.06),
SIMDE_FLOAT64_C( 0.09), SIMDE_FLOAT64_C( 0.09), SIMDE_FLOAT64_C( 0.03), SIMDE_FLOAT64_C( 0.05) } },
{ { SIMDE_FLOAT64_C( 609.86), SIMDE_FLOAT64_C( 837.14), SIMDE_FLOAT64_C( 372.68), SIMDE_FLOAT64_C( 549.80),
SIMDE_FLOAT64_C( 402.57), SIMDE_FLOAT64_C( 960.80), SIMDE_FLOAT64_C( 489.90), SIMDE_FLOAT64_C( 885.65) },
{ SIMDE_FLOAT64_C( 0.04), SIMDE_FLOAT64_C( 0.03), SIMDE_FLOAT64_C( 0.05), SIMDE_FLOAT64_C( 0.04),
SIMDE_FLOAT64_C( 0.05), SIMDE_FLOAT64_C( 0.03), SIMDE_FLOAT64_C( 0.05), SIMDE_FLOAT64_C( 0.03) } },
{ { SIMDE_FLOAT64_C( 875.53), SIMDE_FLOAT64_C( 411.92), SIMDE_FLOAT64_C( 548.19), SIMDE_FLOAT64_C( 708.42),
SIMDE_FLOAT64_C( 455.90), SIMDE_FLOAT64_C( 110.13), SIMDE_FLOAT64_C( 88.56), SIMDE_FLOAT64_C( 499.24) },
{ SIMDE_FLOAT64_C( 0.03), SIMDE_FLOAT64_C( 0.05), SIMDE_FLOAT64_C( 0.04), SIMDE_FLOAT64_C( 0.04),
SIMDE_FLOAT64_C( 0.05), SIMDE_FLOAT64_C( 0.10), SIMDE_FLOAT64_C( 0.11), SIMDE_FLOAT64_C( 0.04) } },
{ { SIMDE_FLOAT64_C( 161.32), SIMDE_FLOAT64_C( 442.19), SIMDE_FLOAT64_C( 573.08), SIMDE_FLOAT64_C( 621.10),
SIMDE_FLOAT64_C( 338.32), SIMDE_FLOAT64_C( 172.08), SIMDE_FLOAT64_C( 822.98), SIMDE_FLOAT64_C( 377.05) },
{ SIMDE_FLOAT64_C( 0.08), SIMDE_FLOAT64_C( 0.05), SIMDE_FLOAT64_C( 0.04), SIMDE_FLOAT64_C( 0.04),
SIMDE_FLOAT64_C( 0.05), SIMDE_FLOAT64_C( 0.08), SIMDE_FLOAT64_C( 0.03), SIMDE_FLOAT64_C( 0.05) } },
{ { SIMDE_FLOAT64_C( 191.28), SIMDE_FLOAT64_C( 83.66), SIMDE_FLOAT64_C( 635.57), SIMDE_FLOAT64_C( 327.28),
SIMDE_FLOAT64_C( 205.63), SIMDE_FLOAT64_C( 572.53), SIMDE_FLOAT64_C( 660.94), SIMDE_FLOAT64_C( 815.49) },
{ SIMDE_FLOAT64_C( 0.07), SIMDE_FLOAT64_C( 0.11), SIMDE_FLOAT64_C( 0.04), SIMDE_FLOAT64_C( 0.06),
SIMDE_FLOAT64_C( 0.07), SIMDE_FLOAT64_C( 0.04), SIMDE_FLOAT64_C( 0.04), SIMDE_FLOAT64_C( 0.04) } },
{ { SIMDE_FLOAT64_C( 409.67), SIMDE_FLOAT64_C( 33.63), SIMDE_FLOAT64_C( 365.30), SIMDE_FLOAT64_C( 812.24),
SIMDE_FLOAT64_C( 994.43), SIMDE_FLOAT64_C( 855.19), SIMDE_FLOAT64_C( 697.89), SIMDE_FLOAT64_C( 869.96) },
{ SIMDE_FLOAT64_C( 0.05), SIMDE_FLOAT64_C( 0.17), SIMDE_FLOAT64_C( 0.05), SIMDE_FLOAT64_C( 0.04),
SIMDE_FLOAT64_C( 0.03), SIMDE_FLOAT64_C( 0.03), SIMDE_FLOAT64_C( 0.04), SIMDE_FLOAT64_C( 0.03) } },
{ { SIMDE_FLOAT64_C( 267.11), SIMDE_FLOAT64_C( 246.07), SIMDE_FLOAT64_C( 578.38), SIMDE_FLOAT64_C( 723.01),
SIMDE_FLOAT64_C( 356.21), SIMDE_FLOAT64_C( 666.94), SIMDE_FLOAT64_C( 222.25), SIMDE_FLOAT64_C( 517.53) },
{ SIMDE_FLOAT64_C( 0.06), SIMDE_FLOAT64_C( 0.06), SIMDE_FLOAT64_C( 0.04), SIMDE_FLOAT64_C( 0.04),
SIMDE_FLOAT64_C( 0.05), SIMDE_FLOAT64_C( 0.04), SIMDE_FLOAT64_C( 0.07), SIMDE_FLOAT64_C( 0.04) } },
{ { SIMDE_FLOAT64_C( 109.13), SIMDE_FLOAT64_C( 795.33), SIMDE_FLOAT64_C( 138.62), SIMDE_FLOAT64_C( 447.45),
SIMDE_FLOAT64_C( 967.41), SIMDE_FLOAT64_C( 961.61), SIMDE_FLOAT64_C( 824.50), SIMDE_FLOAT64_C( 158.69) },
{ SIMDE_FLOAT64_C( 0.10), SIMDE_FLOAT64_C( 0.04), SIMDE_FLOAT64_C( 0.08), SIMDE_FLOAT64_C( 0.05),
SIMDE_FLOAT64_C( 0.03), SIMDE_FLOAT64_C( 0.03), SIMDE_FLOAT64_C( 0.03), SIMDE_FLOAT64_C( 0.08) } }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])) ; i++) {
simde__m512d a = simde_mm512_loadu_pd(test_vec[i].a);
simde__m512d r = simde_mm512_invsqrt_pd(a);
simde_test_x86_assert_equal_f64x8(r, simde_mm512_loadu_pd(test_vec[i].r), 1);
}
return 0;
}
static int
test_simde_mm512_mask_invsqrt_pd (SIMDE_MUNIT_TEST_ARGS) {
static const struct {
const simde_float64 src[8];
const simde__mmask8 k;
const simde_float64 a[8];
const simde_float64 r[8];
} test_vec[] = {
{ { SIMDE_FLOAT64_C( 784.96), SIMDE_FLOAT64_C( 815.29), SIMDE_FLOAT64_C( 578.00), SIMDE_FLOAT64_C( 693.34),
SIMDE_FLOAT64_C( 899.84), SIMDE_FLOAT64_C( 476.45), SIMDE_FLOAT64_C( 558.50), SIMDE_FLOAT64_C( 745.07) },
UINT8_C( 77),
{ SIMDE_FLOAT64_C( 864.69), SIMDE_FLOAT64_C( 953.84), SIMDE_FLOAT64_C( 134.83), SIMDE_FLOAT64_C( 167.75),
SIMDE_FLOAT64_C( 474.65), SIMDE_FLOAT64_C( 536.52), SIMDE_FLOAT64_C( 563.54), SIMDE_FLOAT64_C( 963.69) },
{ SIMDE_FLOAT64_C( 0.03), SIMDE_FLOAT64_C( 815.29), SIMDE_FLOAT64_C( 0.09), SIMDE_FLOAT64_C( 0.08),
SIMDE_FLOAT64_C( 899.84), SIMDE_FLOAT64_C( 476.45), SIMDE_FLOAT64_C( 0.04), SIMDE_FLOAT64_C( 745.07) } },
{ { SIMDE_FLOAT64_C( 410.86), SIMDE_FLOAT64_C( 470.77), SIMDE_FLOAT64_C( 329.50), SIMDE_FLOAT64_C( 65.82),
SIMDE_FLOAT64_C( 510.47), SIMDE_FLOAT64_C( 748.64), SIMDE_FLOAT64_C( 130.13), SIMDE_FLOAT64_C( 819.32) },
UINT8_C(180),
{ SIMDE_FLOAT64_C( 969.69), SIMDE_FLOAT64_C( 176.66), SIMDE_FLOAT64_C( 270.39), SIMDE_FLOAT64_C( 73.35),
SIMDE_FLOAT64_C( 618.94), SIMDE_FLOAT64_C( 55.36), SIMDE_FLOAT64_C( 888.64), SIMDE_FLOAT64_C( 196.94) },
{ SIMDE_FLOAT64_C( 410.86), SIMDE_FLOAT64_C( 470.77), SIMDE_FLOAT64_C( 0.06), SIMDE_FLOAT64_C( 65.82),
SIMDE_FLOAT64_C( 0.04), SIMDE_FLOAT64_C( 0.13), SIMDE_FLOAT64_C( 130.13), SIMDE_FLOAT64_C( 0.07) } },
{ { SIMDE_FLOAT64_C( 748.70), SIMDE_FLOAT64_C( 788.48), SIMDE_FLOAT64_C( 673.39), SIMDE_FLOAT64_C( 307.20),
SIMDE_FLOAT64_C( 533.54), SIMDE_FLOAT64_C( 118.92), SIMDE_FLOAT64_C( 171.90), SIMDE_FLOAT64_C( 487.39) },
UINT8_C( 67),
{ SIMDE_FLOAT64_C( 339.65), SIMDE_FLOAT64_C( 962.04), SIMDE_FLOAT64_C( 790.27), SIMDE_FLOAT64_C( 903.19),
SIMDE_FLOAT64_C( 925.73), SIMDE_FLOAT64_C( 201.14), SIMDE_FLOAT64_C( 373.95), SIMDE_FLOAT64_C( 255.23) },
{ SIMDE_FLOAT64_C( 0.05), SIMDE_FLOAT64_C( 0.03), SIMDE_FLOAT64_C( 673.39), SIMDE_FLOAT64_C( 307.20),
SIMDE_FLOAT64_C( 533.54), SIMDE_FLOAT64_C( 118.92), SIMDE_FLOAT64_C( 0.05), SIMDE_FLOAT64_C( 487.39) } },
{ { SIMDE_FLOAT64_C( 266.96), SIMDE_FLOAT64_C( 884.43), SIMDE_FLOAT64_C( 3.88), SIMDE_FLOAT64_C( 397.10),
SIMDE_FLOAT64_C( 703.75), SIMDE_FLOAT64_C( 335.69), SIMDE_FLOAT64_C( 366.79), SIMDE_FLOAT64_C( 880.41) },
UINT8_C(138),
{ SIMDE_FLOAT64_C( 440.13), SIMDE_FLOAT64_C( 499.35), SIMDE_FLOAT64_C( 661.44), SIMDE_FLOAT64_C( 328.77),
SIMDE_FLOAT64_C( 696.29), SIMDE_FLOAT64_C( 410.14), SIMDE_FLOAT64_C( 117.25), SIMDE_FLOAT64_C( 369.69) },
{ SIMDE_FLOAT64_C( 266.96), SIMDE_FLOAT64_C( 0.04), SIMDE_FLOAT64_C( 3.88), SIMDE_FLOAT64_C( 0.06),
SIMDE_FLOAT64_C( 703.75), SIMDE_FLOAT64_C( 335.69), SIMDE_FLOAT64_C( 366.79), SIMDE_FLOAT64_C( 0.05) } },
{ { SIMDE_FLOAT64_C( 717.34), SIMDE_FLOAT64_C( 650.79), SIMDE_FLOAT64_C( 488.60), SIMDE_FLOAT64_C( 889.24),
SIMDE_FLOAT64_C( 138.18), SIMDE_FLOAT64_C( 742.35), SIMDE_FLOAT64_C( 228.88), SIMDE_FLOAT64_C( 100.22) },
UINT8_C( 3),
{ SIMDE_FLOAT64_C( 132.07), SIMDE_FLOAT64_C( 25.94), SIMDE_FLOAT64_C( 733.76), SIMDE_FLOAT64_C( 506.02),
SIMDE_FLOAT64_C( 281.17), SIMDE_FLOAT64_C( 0.72), SIMDE_FLOAT64_C( 390.45), SIMDE_FLOAT64_C( 285.05) },
{ SIMDE_FLOAT64_C( 0.09), SIMDE_FLOAT64_C( 0.20), SIMDE_FLOAT64_C( 488.60), SIMDE_FLOAT64_C( 889.24),
SIMDE_FLOAT64_C( 138.18), SIMDE_FLOAT64_C( 742.35), SIMDE_FLOAT64_C( 228.88), SIMDE_FLOAT64_C( 100.22) } },
{ { SIMDE_FLOAT64_C( 397.82), SIMDE_FLOAT64_C( 94.20), SIMDE_FLOAT64_C( 620.74), SIMDE_FLOAT64_C( 764.60),
SIMDE_FLOAT64_C( 974.61), SIMDE_FLOAT64_C( 226.82), SIMDE_FLOAT64_C( 204.74), SIMDE_FLOAT64_C( 473.96) },
UINT8_C(205),
{ SIMDE_FLOAT64_C( 533.51), SIMDE_FLOAT64_C( 170.26), SIMDE_FLOAT64_C( 298.40), SIMDE_FLOAT64_C( 650.76),
SIMDE_FLOAT64_C( 539.94), SIMDE_FLOAT64_C( 15.74), SIMDE_FLOAT64_C( 301.54), SIMDE_FLOAT64_C( 28.54) },
{ SIMDE_FLOAT64_C( 0.04), SIMDE_FLOAT64_C( 94.20), SIMDE_FLOAT64_C( 0.06), SIMDE_FLOAT64_C( 0.04),
SIMDE_FLOAT64_C( 974.61), SIMDE_FLOAT64_C( 226.82), SIMDE_FLOAT64_C( 0.06), SIMDE_FLOAT64_C( 0.19) } },
{ { SIMDE_FLOAT64_C( 904.98), SIMDE_FLOAT64_C( 439.72), SIMDE_FLOAT64_C( 770.90), SIMDE_FLOAT64_C( 133.86),
SIMDE_FLOAT64_C( 539.94), SIMDE_FLOAT64_C( 303.52), SIMDE_FLOAT64_C( 265.93), SIMDE_FLOAT64_C( 565.88) },
UINT8_C( 41),
{ SIMDE_FLOAT64_C( 771.96), SIMDE_FLOAT64_C( 847.05), SIMDE_FLOAT64_C( 38.01), SIMDE_FLOAT64_C( 162.41),
SIMDE_FLOAT64_C( 132.10), SIMDE_FLOAT64_C( 435.83), SIMDE_FLOAT64_C( 256.61), SIMDE_FLOAT64_C( 752.84) },
{ SIMDE_FLOAT64_C( 0.04), SIMDE_FLOAT64_C( 439.72), SIMDE_FLOAT64_C( 770.90), SIMDE_FLOAT64_C( 0.08),
SIMDE_FLOAT64_C( 539.94), SIMDE_FLOAT64_C( 0.05), SIMDE_FLOAT64_C( 265.93), SIMDE_FLOAT64_C( 565.88) } },
{ { SIMDE_FLOAT64_C( 200.43), SIMDE_FLOAT64_C( 231.22), SIMDE_FLOAT64_C( 979.66), SIMDE_FLOAT64_C( 405.17),
SIMDE_FLOAT64_C( 705.18), SIMDE_FLOAT64_C( 867.92), SIMDE_FLOAT64_C( 938.68), SIMDE_FLOAT64_C( 875.43) },
UINT8_C( 32),
{ SIMDE_FLOAT64_C( 589.43), SIMDE_FLOAT64_C( 415.38), SIMDE_FLOAT64_C( 182.05), SIMDE_FLOAT64_C( 890.98),
SIMDE_FLOAT64_C( 443.92), SIMDE_FLOAT64_C( 87.03), SIMDE_FLOAT64_C( 330.70), SIMDE_FLOAT64_C( 214.82) },
{ SIMDE_FLOAT64_C( 200.43), SIMDE_FLOAT64_C( 231.22), SIMDE_FLOAT64_C( 979.66), SIMDE_FLOAT64_C( 405.17),
SIMDE_FLOAT64_C( 705.18), SIMDE_FLOAT64_C( 0.11), SIMDE_FLOAT64_C( 938.68), SIMDE_FLOAT64_C( 875.43) } }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])) ; i++) {
simde__m512d src = simde_mm512_loadu_pd(test_vec[i].src);
simde__m512d a = simde_mm512_loadu_pd(test_vec[i].a);
simde__m512d r = simde_mm512_mask_invsqrt_pd(src, test_vec[i].k, a);
simde_test_x86_assert_equal_f64x8(r, simde_mm512_loadu_pd(test_vec[i].r), 1);
}
return 0;
}
static int
test_simde_mm_log_ps(SIMDE_MUNIT_TEST_ARGS) {
const struct {
simde__m128 a;
simde__m128 r;
} test_vec[8] = {
{ simde_mm_set_ps(SIMDE_FLOAT32_C( 4068.94), SIMDE_FLOAT32_C( 5195.06), SIMDE_FLOAT32_C( 1228.12), SIMDE_FLOAT32_C( 6733.16)),
simde_mm_set_ps(SIMDE_FLOAT32_C( 8.31), SIMDE_FLOAT32_C( 8.56), SIMDE_FLOAT32_C( 7.11), SIMDE_FLOAT32_C( 8.81)) },
{ simde_mm_set_ps(SIMDE_FLOAT32_C( 7486.55), SIMDE_FLOAT32_C( 8351.20), SIMDE_FLOAT32_C( 3512.77), SIMDE_FLOAT32_C( 5170.29)),
simde_mm_set_ps(SIMDE_FLOAT32_C( 8.92), SIMDE_FLOAT32_C( 9.03), SIMDE_FLOAT32_C( 8.16), SIMDE_FLOAT32_C( 8.55)) },
{ simde_mm_set_ps(SIMDE_FLOAT32_C( 9127.65), SIMDE_FLOAT32_C( 7111.03), SIMDE_FLOAT32_C( 3652.77), SIMDE_FLOAT32_C( 7338.80)),
simde_mm_set_ps(SIMDE_FLOAT32_C( 9.12), SIMDE_FLOAT32_C( 8.87), SIMDE_FLOAT32_C( 8.20), SIMDE_FLOAT32_C( 8.90)) },
{ simde_mm_set_ps(SIMDE_FLOAT32_C( 1609.14), SIMDE_FLOAT32_C( 1569.36), SIMDE_FLOAT32_C( 5423.87), SIMDE_FLOAT32_C( 7857.29)),
simde_mm_set_ps(SIMDE_FLOAT32_C( 7.38), SIMDE_FLOAT32_C( 7.36), SIMDE_FLOAT32_C( 8.60), SIMDE_FLOAT32_C( 8.97)) },
{ simde_mm_set_ps(SIMDE_FLOAT32_C( 3474.63), SIMDE_FLOAT32_C( 695.25), SIMDE_FLOAT32_C( 2912.29), SIMDE_FLOAT32_C( 8484.34)),
simde_mm_set_ps(SIMDE_FLOAT32_C( 8.15), SIMDE_FLOAT32_C( 6.54), SIMDE_FLOAT32_C( 7.98), SIMDE_FLOAT32_C( 9.05)) },
{ simde_mm_set_ps(SIMDE_FLOAT32_C( 2775.95), SIMDE_FLOAT32_C( 5142.35), SIMDE_FLOAT32_C( 3079.83), SIMDE_FLOAT32_C( 381.82)),
simde_mm_set_ps(SIMDE_FLOAT32_C( 7.93), SIMDE_FLOAT32_C( 8.55), SIMDE_FLOAT32_C( 8.03), SIMDE_FLOAT32_C( 5.94)) },
{ simde_mm_set_ps(SIMDE_FLOAT32_C( 6306.54), SIMDE_FLOAT32_C( 3937.29), SIMDE_FLOAT32_C( 117.23), SIMDE_FLOAT32_C( 1696.00)),
simde_mm_set_ps(SIMDE_FLOAT32_C( 8.75), SIMDE_FLOAT32_C( 8.28), SIMDE_FLOAT32_C( 4.76), SIMDE_FLOAT32_C( 7.44)) },
{ simde_mm_set_ps(SIMDE_FLOAT32_C( 5890.98), SIMDE_FLOAT32_C( 2746.67), SIMDE_FLOAT32_C( 6166.85), SIMDE_FLOAT32_C( 8435.45)),
simde_mm_set_ps(SIMDE_FLOAT32_C( 8.68), SIMDE_FLOAT32_C( 7.92), SIMDE_FLOAT32_C( 8.73), SIMDE_FLOAT32_C( 9.04)) }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])); i++) {
simde__m128 r = simde_mm_log_ps(test_vec[i].a);
simde_assert_m128_close(r, test_vec[i].r, 1);
}
return 0;
}
static int
test_simde_mm_log_pd(SIMDE_MUNIT_TEST_ARGS) {
const struct {
simde__m128d a;
simde__m128d r;
} test_vec[8] = {
{ simde_mm_set_pd(SIMDE_FLOAT64_C( 1228.12), SIMDE_FLOAT64_C( 6733.16)),
simde_mm_set_pd(SIMDE_FLOAT64_C( 7.11), SIMDE_FLOAT64_C( 8.81)) },
{ simde_mm_set_pd(SIMDE_FLOAT64_C( 4068.94), SIMDE_FLOAT64_C( 5195.06)),
simde_mm_set_pd(SIMDE_FLOAT64_C( 8.31), SIMDE_FLOAT64_C( 8.56)) },
{ simde_mm_set_pd(SIMDE_FLOAT64_C( 3512.77), SIMDE_FLOAT64_C( 5170.29)),
simde_mm_set_pd(SIMDE_FLOAT64_C( 8.16), SIMDE_FLOAT64_C( 8.55)) },
{ simde_mm_set_pd(SIMDE_FLOAT64_C( 7486.55), SIMDE_FLOAT64_C( 8351.20)),
simde_mm_set_pd(SIMDE_FLOAT64_C( 8.92), SIMDE_FLOAT64_C( 9.03)) },
{ simde_mm_set_pd(SIMDE_FLOAT64_C( 3652.77), SIMDE_FLOAT64_C( 7338.80)),
simde_mm_set_pd(SIMDE_FLOAT64_C( 8.20), SIMDE_FLOAT64_C( 8.90)) },
{ simde_mm_set_pd(SIMDE_FLOAT64_C( 9127.65), SIMDE_FLOAT64_C( 7111.03)),
simde_mm_set_pd(SIMDE_FLOAT64_C( 9.12), SIMDE_FLOAT64_C( 8.87)) },
{ simde_mm_set_pd(SIMDE_FLOAT64_C( 5423.87), SIMDE_FLOAT64_C( 7857.29)),
simde_mm_set_pd(SIMDE_FLOAT64_C( 8.60), SIMDE_FLOAT64_C( 8.97)) },
{ simde_mm_set_pd(SIMDE_FLOAT64_C( 1609.14), SIMDE_FLOAT64_C( 1569.36)),
simde_mm_set_pd(SIMDE_FLOAT64_C( 7.38), SIMDE_FLOAT64_C( 7.36)) }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])); i++) {
simde__m128d r = simde_mm_log_pd(test_vec[i].a);
simde_assert_m128d_close(r, test_vec[i].r, 1);
}
return 0;
}
static int
test_simde_mm256_log_ps(SIMDE_MUNIT_TEST_ARGS) {
const struct {
simde__m256 a;
simde__m256 r;
} test_vec[8] = {
{ simde_mm256_set_ps(SIMDE_FLOAT32_C( 7486.55), SIMDE_FLOAT32_C( 8351.20),
SIMDE_FLOAT32_C( 3512.77), SIMDE_FLOAT32_C( 5170.29),
SIMDE_FLOAT32_C( 4068.94), SIMDE_FLOAT32_C( 5195.06),
SIMDE_FLOAT32_C( 1228.12), SIMDE_FLOAT32_C( 6733.16)),
simde_mm256_set_ps(SIMDE_FLOAT32_C( 8.92), SIMDE_FLOAT32_C( 9.03),
SIMDE_FLOAT32_C( 8.16), SIMDE_FLOAT32_C( 8.55),
SIMDE_FLOAT32_C( 8.31), SIMDE_FLOAT32_C( 8.56),
SIMDE_FLOAT32_C( 7.11), SIMDE_FLOAT32_C( 8.81)) },
{ simde_mm256_set_ps(SIMDE_FLOAT32_C( 1609.14), SIMDE_FLOAT32_C( 1569.36),
SIMDE_FLOAT32_C( 5423.87), SIMDE_FLOAT32_C( 7857.29),
SIMDE_FLOAT32_C( 9127.65), SIMDE_FLOAT32_C( 7111.03),
SIMDE_FLOAT32_C( 3652.77), SIMDE_FLOAT32_C( 7338.80)),
simde_mm256_set_ps(SIMDE_FLOAT32_C( 7.38), SIMDE_FLOAT32_C( 7.36),
SIMDE_FLOAT32_C( 8.60), SIMDE_FLOAT32_C( 8.97),
SIMDE_FLOAT32_C( 9.12), SIMDE_FLOAT32_C( 8.87),
SIMDE_FLOAT32_C( 8.20), SIMDE_FLOAT32_C( 8.90)) },
{ simde_mm256_set_ps(SIMDE_FLOAT32_C( 2775.95), SIMDE_FLOAT32_C( 5142.35),
SIMDE_FLOAT32_C( 3079.83), SIMDE_FLOAT32_C( 381.82),
SIMDE_FLOAT32_C( 3474.63), SIMDE_FLOAT32_C( 695.25),
SIMDE_FLOAT32_C( 2912.29), SIMDE_FLOAT32_C( 8484.34)),
simde_mm256_set_ps(SIMDE_FLOAT32_C( 7.93), SIMDE_FLOAT32_C( 8.55),
SIMDE_FLOAT32_C( 8.03), SIMDE_FLOAT32_C( 5.94),
SIMDE_FLOAT32_C( 8.15), SIMDE_FLOAT32_C( 6.54),
SIMDE_FLOAT32_C( 7.98), SIMDE_FLOAT32_C( 9.05)) },
{ simde_mm256_set_ps(SIMDE_FLOAT32_C( 5890.98), SIMDE_FLOAT32_C( 2746.67),
SIMDE_FLOAT32_C( 6166.85), SIMDE_FLOAT32_C( 8435.45),
SIMDE_FLOAT32_C( 6306.54), SIMDE_FLOAT32_C( 3937.29),
SIMDE_FLOAT32_C( 117.23), SIMDE_FLOAT32_C( 1696.00)),
simde_mm256_set_ps(SIMDE_FLOAT32_C( 8.68), SIMDE_FLOAT32_C( 7.92),
SIMDE_FLOAT32_C( 8.73), SIMDE_FLOAT32_C( 9.04),
SIMDE_FLOAT32_C( 8.75), SIMDE_FLOAT32_C( 8.28),
SIMDE_FLOAT32_C( 4.76), SIMDE_FLOAT32_C( 7.44)) },
{ simde_mm256_set_ps(SIMDE_FLOAT32_C( 1148.23), SIMDE_FLOAT32_C( 7217.40),
SIMDE_FLOAT32_C( 2082.02), SIMDE_FLOAT32_C( 6902.28),
SIMDE_FLOAT32_C( 1146.40), SIMDE_FLOAT32_C( 9969.51),
SIMDE_FLOAT32_C( 5140.40), SIMDE_FLOAT32_C( 9206.03)),
simde_mm256_set_ps(SIMDE_FLOAT32_C( 7.05), SIMDE_FLOAT32_C( 8.88),
SIMDE_FLOAT32_C( 7.64), SIMDE_FLOAT32_C( 8.84),
SIMDE_FLOAT32_C( 7.04), SIMDE_FLOAT32_C( 9.21),
SIMDE_FLOAT32_C( 8.54), SIMDE_FLOAT32_C( 9.13)) },
{ simde_mm256_set_ps(SIMDE_FLOAT32_C( 3060.52), SIMDE_FLOAT32_C( 6979.60),
SIMDE_FLOAT32_C( 8279.36), SIMDE_FLOAT32_C( 6696.04),
SIMDE_FLOAT32_C( 7661.76), SIMDE_FLOAT32_C( 3680.04),
SIMDE_FLOAT32_C( 8903.22), SIMDE_FLOAT32_C( 4846.05)),
simde_mm256_set_ps(SIMDE_FLOAT32_C( 8.03), SIMDE_FLOAT32_C( 8.85),
SIMDE_FLOAT32_C( 9.02), SIMDE_FLOAT32_C( 8.81),
SIMDE_FLOAT32_C( 8.94), SIMDE_FLOAT32_C( 8.21),
SIMDE_FLOAT32_C( 9.09), SIMDE_FLOAT32_C( 8.49)) },
{ simde_mm256_set_ps(SIMDE_FLOAT32_C( 3981.75), SIMDE_FLOAT32_C( 4596.36),
SIMDE_FLOAT32_C( 6683.64), SIMDE_FLOAT32_C( 276.11),
SIMDE_FLOAT32_C( 1262.07), SIMDE_FLOAT32_C( 1163.84),
SIMDE_FLOAT32_C( 2229.06), SIMDE_FLOAT32_C( 6994.08)),
simde_mm256_set_ps(SIMDE_FLOAT32_C( 8.29), SIMDE_FLOAT32_C( 8.43),
SIMDE_FLOAT32_C( 8.81), SIMDE_FLOAT32_C( 5.62),
SIMDE_FLOAT32_C( 7.14), SIMDE_FLOAT32_C( 7.06),
SIMDE_FLOAT32_C( 7.71), SIMDE_FLOAT32_C( 8.85)) },
{ simde_mm256_set_ps(SIMDE_FLOAT32_C( 7348.31), SIMDE_FLOAT32_C( 8400.08),
SIMDE_FLOAT32_C( 4256.55), SIMDE_FLOAT32_C( 9093.31),
SIMDE_FLOAT32_C( 9550.14), SIMDE_FLOAT32_C( 8002.34),
SIMDE_FLOAT32_C( 8956.15), SIMDE_FLOAT32_C( 6271.53)),
simde_mm256_set_ps(SIMDE_FLOAT32_C( 8.90), SIMDE_FLOAT32_C( 9.04),
SIMDE_FLOAT32_C( 8.36), SIMDE_FLOAT32_C( 9.12),
SIMDE_FLOAT32_C( 9.16), SIMDE_FLOAT32_C( 8.99),
SIMDE_FLOAT32_C( 9.10), SIMDE_FLOAT32_C( 8.74)) }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])); i++) {
simde__m256 r = simde_mm256_log_ps(test_vec[i].a);
simde_assert_m256_close(r, test_vec[i].r, 1);
}
return 0;
}
static int
test_simde_mm256_log_pd(SIMDE_MUNIT_TEST_ARGS) {
const struct {
simde__m256d a;
simde__m256d r;
} test_vec[8] = {
{ simde_mm256_set_pd(SIMDE_FLOAT64_C( 4068.94), SIMDE_FLOAT64_C( 5195.06),
SIMDE_FLOAT64_C( 1228.12), SIMDE_FLOAT64_C( 6733.16)),
simde_mm256_set_pd(SIMDE_FLOAT64_C( 8.31), SIMDE_FLOAT64_C( 8.56),
SIMDE_FLOAT64_C( 7.11), SIMDE_FLOAT64_C( 8.81)) },
{ simde_mm256_set_pd(SIMDE_FLOAT64_C( 7486.55), SIMDE_FLOAT64_C( 8351.20),
SIMDE_FLOAT64_C( 3512.77), SIMDE_FLOAT64_C( 5170.29)),
simde_mm256_set_pd(SIMDE_FLOAT64_C( 8.92), SIMDE_FLOAT64_C( 9.03),
SIMDE_FLOAT64_C( 8.16), SIMDE_FLOAT64_C( 8.55)) },
{ simde_mm256_set_pd(SIMDE_FLOAT64_C( 9127.65), SIMDE_FLOAT64_C( 7111.03),
SIMDE_FLOAT64_C( 3652.77), SIMDE_FLOAT64_C( 7338.80)),
simde_mm256_set_pd(SIMDE_FLOAT64_C( 9.12), SIMDE_FLOAT64_C( 8.87),
SIMDE_FLOAT64_C( 8.20), SIMDE_FLOAT64_C( 8.90)) },
{ simde_mm256_set_pd(SIMDE_FLOAT64_C( 1609.14), SIMDE_FLOAT64_C( 1569.36),
SIMDE_FLOAT64_C( 5423.87), SIMDE_FLOAT64_C( 7857.29)),
simde_mm256_set_pd(SIMDE_FLOAT64_C( 7.38), SIMDE_FLOAT64_C( 7.36),
SIMDE_FLOAT64_C( 8.60), SIMDE_FLOAT64_C( 8.97)) },
{ simde_mm256_set_pd(SIMDE_FLOAT64_C( 3474.63), SIMDE_FLOAT64_C( 695.25),
SIMDE_FLOAT64_C( 2912.29), SIMDE_FLOAT64_C( 8484.34)),
simde_mm256_set_pd(SIMDE_FLOAT64_C( 8.15), SIMDE_FLOAT64_C( 6.54),
SIMDE_FLOAT64_C( 7.98), SIMDE_FLOAT64_C( 9.05)) },
{ simde_mm256_set_pd(SIMDE_FLOAT64_C( 2775.95), SIMDE_FLOAT64_C( 5142.35),
SIMDE_FLOAT64_C( 3079.83), SIMDE_FLOAT64_C( 381.82)),
simde_mm256_set_pd(SIMDE_FLOAT64_C( 7.93), SIMDE_FLOAT64_C( 8.55),
SIMDE_FLOAT64_C( 8.03), SIMDE_FLOAT64_C( 5.94)) },
{ simde_mm256_set_pd(SIMDE_FLOAT64_C( 6306.54), SIMDE_FLOAT64_C( 3937.29),
SIMDE_FLOAT64_C( 117.23), SIMDE_FLOAT64_C( 1696.00)),
simde_mm256_set_pd(SIMDE_FLOAT64_C( 8.75), SIMDE_FLOAT64_C( 8.28),
SIMDE_FLOAT64_C( 4.76), SIMDE_FLOAT64_C( 7.44)) },
{ simde_mm256_set_pd(SIMDE_FLOAT64_C( 5890.98), SIMDE_FLOAT64_C( 2746.67),
SIMDE_FLOAT64_C( 6166.85), SIMDE_FLOAT64_C( 8435.45)),
simde_mm256_set_pd(SIMDE_FLOAT64_C( 8.68), SIMDE_FLOAT64_C( 7.92),
SIMDE_FLOAT64_C( 8.73), SIMDE_FLOAT64_C( 9.04)) }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])); i++) {
simde__m256d r = simde_mm256_log_pd(test_vec[i].a);
simde_assert_m256d_close(r, test_vec[i].r, 1);
}
return 0;
}
static int
test_simde_mm512_log_ps(SIMDE_MUNIT_TEST_ARGS) {
const struct {
simde__m512 a;
simde__m512 r;
} test_vec[8] = {
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( 1609.14), SIMDE_FLOAT32_C( 1569.36), SIMDE_FLOAT32_C( 5423.87), SIMDE_FLOAT32_C( 7857.29),
SIMDE_FLOAT32_C( 9127.65), SIMDE_FLOAT32_C( 7111.03), SIMDE_FLOAT32_C( 3652.77), SIMDE_FLOAT32_C( 7338.80),
SIMDE_FLOAT32_C( 7486.55), SIMDE_FLOAT32_C( 8351.20), SIMDE_FLOAT32_C( 3512.77), SIMDE_FLOAT32_C( 5170.29),
SIMDE_FLOAT32_C( 4068.94), SIMDE_FLOAT32_C( 5195.06), SIMDE_FLOAT32_C( 1228.12), SIMDE_FLOAT32_C( 6733.16)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 7.38), SIMDE_FLOAT32_C( 7.36), SIMDE_FLOAT32_C( 8.60), SIMDE_FLOAT32_C( 8.97),
SIMDE_FLOAT32_C( 9.12), SIMDE_FLOAT32_C( 8.87), SIMDE_FLOAT32_C( 8.20), SIMDE_FLOAT32_C( 8.90),
SIMDE_FLOAT32_C( 8.92), SIMDE_FLOAT32_C( 9.03), SIMDE_FLOAT32_C( 8.16), SIMDE_FLOAT32_C( 8.55),
SIMDE_FLOAT32_C( 8.31), SIMDE_FLOAT32_C( 8.56), SIMDE_FLOAT32_C( 7.11), SIMDE_FLOAT32_C( 8.81)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( 5890.98), SIMDE_FLOAT32_C( 2746.67), SIMDE_FLOAT32_C( 6166.85), SIMDE_FLOAT32_C( 8435.45),
SIMDE_FLOAT32_C( 6306.54), SIMDE_FLOAT32_C( 3937.29), SIMDE_FLOAT32_C( 117.23), SIMDE_FLOAT32_C( 1696.00),
SIMDE_FLOAT32_C( 2775.95), SIMDE_FLOAT32_C( 5142.35), SIMDE_FLOAT32_C( 3079.83), SIMDE_FLOAT32_C( 381.82),
SIMDE_FLOAT32_C( 3474.63), SIMDE_FLOAT32_C( 695.25), SIMDE_FLOAT32_C( 2912.29), SIMDE_FLOAT32_C( 8484.34)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 8.68), SIMDE_FLOAT32_C( 7.92), SIMDE_FLOAT32_C( 8.73), SIMDE_FLOAT32_C( 9.04),
SIMDE_FLOAT32_C( 8.75), SIMDE_FLOAT32_C( 8.28), SIMDE_FLOAT32_C( 4.76), SIMDE_FLOAT32_C( 7.44),
SIMDE_FLOAT32_C( 7.93), SIMDE_FLOAT32_C( 8.55), SIMDE_FLOAT32_C( 8.03), SIMDE_FLOAT32_C( 5.94),
SIMDE_FLOAT32_C( 8.15), SIMDE_FLOAT32_C( 6.54), SIMDE_FLOAT32_C( 7.98), SIMDE_FLOAT32_C( 9.05)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( 3060.52), SIMDE_FLOAT32_C( 6979.60), SIMDE_FLOAT32_C( 8279.36), SIMDE_FLOAT32_C( 6696.04),
SIMDE_FLOAT32_C( 7661.76), SIMDE_FLOAT32_C( 3680.04), SIMDE_FLOAT32_C( 8903.22), SIMDE_FLOAT32_C( 4846.05),
SIMDE_FLOAT32_C( 1148.23), SIMDE_FLOAT32_C( 7217.40), SIMDE_FLOAT32_C( 2082.02), SIMDE_FLOAT32_C( 6902.28),
SIMDE_FLOAT32_C( 1146.40), SIMDE_FLOAT32_C( 9969.51), SIMDE_FLOAT32_C( 5140.40), SIMDE_FLOAT32_C( 9206.03)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 8.03), SIMDE_FLOAT32_C( 8.85), SIMDE_FLOAT32_C( 9.02), SIMDE_FLOAT32_C( 8.81),
SIMDE_FLOAT32_C( 8.94), SIMDE_FLOAT32_C( 8.21), SIMDE_FLOAT32_C( 9.09), SIMDE_FLOAT32_C( 8.49),
SIMDE_FLOAT32_C( 7.05), SIMDE_FLOAT32_C( 8.88), SIMDE_FLOAT32_C( 7.64), SIMDE_FLOAT32_C( 8.84),
SIMDE_FLOAT32_C( 7.04), SIMDE_FLOAT32_C( 9.21), SIMDE_FLOAT32_C( 8.54), SIMDE_FLOAT32_C( 9.13)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( 7348.31), SIMDE_FLOAT32_C( 8400.08), SIMDE_FLOAT32_C( 4256.55), SIMDE_FLOAT32_C( 9093.31),
SIMDE_FLOAT32_C( 9550.14), SIMDE_FLOAT32_C( 8002.34), SIMDE_FLOAT32_C( 8956.15), SIMDE_FLOAT32_C( 6271.53),
SIMDE_FLOAT32_C( 3981.75), SIMDE_FLOAT32_C( 4596.36), SIMDE_FLOAT32_C( 6683.64), SIMDE_FLOAT32_C( 276.11),
SIMDE_FLOAT32_C( 1262.07), SIMDE_FLOAT32_C( 1163.84), SIMDE_FLOAT32_C( 2229.06), SIMDE_FLOAT32_C( 6994.08)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 8.90), SIMDE_FLOAT32_C( 9.04), SIMDE_FLOAT32_C( 8.36), SIMDE_FLOAT32_C( 9.12),
SIMDE_FLOAT32_C( 9.16), SIMDE_FLOAT32_C( 8.99), SIMDE_FLOAT32_C( 9.10), SIMDE_FLOAT32_C( 8.74),
SIMDE_FLOAT32_C( 8.29), SIMDE_FLOAT32_C( 8.43), SIMDE_FLOAT32_C( 8.81), SIMDE_FLOAT32_C( 5.62),
SIMDE_FLOAT32_C( 7.14), SIMDE_FLOAT32_C( 7.06), SIMDE_FLOAT32_C( 7.71), SIMDE_FLOAT32_C( 8.85)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( 4105.04), SIMDE_FLOAT32_C( 8793.93), SIMDE_FLOAT32_C( 6623.12), SIMDE_FLOAT32_C( 6717.40),
SIMDE_FLOAT32_C( 628.43), SIMDE_FLOAT32_C( 1010.42), SIMDE_FLOAT32_C( 3357.32), SIMDE_FLOAT32_C( 2370.85),
SIMDE_FLOAT32_C( 4038.44), SIMDE_FLOAT32_C( 886.73), SIMDE_FLOAT32_C( 7806.81), SIMDE_FLOAT32_C( 8278.35),
SIMDE_FLOAT32_C( 4645.43), SIMDE_FLOAT32_C( 7716.73), SIMDE_FLOAT32_C( 5603.27), SIMDE_FLOAT32_C( 4142.45)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 8.32), SIMDE_FLOAT32_C( 9.08), SIMDE_FLOAT32_C( 8.80), SIMDE_FLOAT32_C( 8.81),
SIMDE_FLOAT32_C( 6.44), SIMDE_FLOAT32_C( 6.92), SIMDE_FLOAT32_C( 8.12), SIMDE_FLOAT32_C( 7.77),
SIMDE_FLOAT32_C( 8.30), SIMDE_FLOAT32_C( 6.79), SIMDE_FLOAT32_C( 8.96), SIMDE_FLOAT32_C( 9.02),
SIMDE_FLOAT32_C( 8.44), SIMDE_FLOAT32_C( 8.95), SIMDE_FLOAT32_C( 8.63), SIMDE_FLOAT32_C( 8.33)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( 8450.59), SIMDE_FLOAT32_C( 9203.26), SIMDE_FLOAT32_C( 4894.53), SIMDE_FLOAT32_C( 2042.18),
SIMDE_FLOAT32_C( 2755.53), SIMDE_FLOAT32_C( 8657.47), SIMDE_FLOAT32_C( 7528.93), SIMDE_FLOAT32_C( 8118.50),
SIMDE_FLOAT32_C( 9155.11), SIMDE_FLOAT32_C( 5703.37), SIMDE_FLOAT32_C( 9886.80), SIMDE_FLOAT32_C( 469.19),
SIMDE_FLOAT32_C( 6656.71), SIMDE_FLOAT32_C( 5499.67), SIMDE_FLOAT32_C( 7314.76), SIMDE_FLOAT32_C( 1309.05)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 9.04), SIMDE_FLOAT32_C( 9.13), SIMDE_FLOAT32_C( 8.50), SIMDE_FLOAT32_C( 7.62),
SIMDE_FLOAT32_C( 7.92), SIMDE_FLOAT32_C( 9.07), SIMDE_FLOAT32_C( 8.93), SIMDE_FLOAT32_C( 9.00),
SIMDE_FLOAT32_C( 9.12), SIMDE_FLOAT32_C( 8.65), SIMDE_FLOAT32_C( 9.20), SIMDE_FLOAT32_C( 6.15),
SIMDE_FLOAT32_C( 8.80), SIMDE_FLOAT32_C( 8.61), SIMDE_FLOAT32_C( 8.90), SIMDE_FLOAT32_C( 7.18)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( 1154.54), SIMDE_FLOAT32_C( 9110.29), SIMDE_FLOAT32_C( 2130.97), SIMDE_FLOAT32_C( 11.83),
SIMDE_FLOAT32_C( 3312.02), SIMDE_FLOAT32_C( 9618.20), SIMDE_FLOAT32_C( 6468.19), SIMDE_FLOAT32_C( 1159.42),
SIMDE_FLOAT32_C( 2118.90), SIMDE_FLOAT32_C( 4661.80), SIMDE_FLOAT32_C( 8551.88), SIMDE_FLOAT32_C( 9887.44),
SIMDE_FLOAT32_C( 1217.92), SIMDE_FLOAT32_C( 7124.06), SIMDE_FLOAT32_C( 5136.26), SIMDE_FLOAT32_C( 4524.23)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 7.05), SIMDE_FLOAT32_C( 9.12), SIMDE_FLOAT32_C( 7.66), SIMDE_FLOAT32_C( 2.47),
SIMDE_FLOAT32_C( 8.11), SIMDE_FLOAT32_C( 9.17), SIMDE_FLOAT32_C( 8.77), SIMDE_FLOAT32_C( 7.06),
SIMDE_FLOAT32_C( 7.66), SIMDE_FLOAT32_C( 8.45), SIMDE_FLOAT32_C( 9.05), SIMDE_FLOAT32_C( 9.20),
SIMDE_FLOAT32_C( 7.10), SIMDE_FLOAT32_C( 8.87), SIMDE_FLOAT32_C( 8.54), SIMDE_FLOAT32_C( 8.42)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( 2809.03), SIMDE_FLOAT32_C( 3201.22), SIMDE_FLOAT32_C( 1237.85), SIMDE_FLOAT32_C( 4831.67),
SIMDE_FLOAT32_C( 9663.28), SIMDE_FLOAT32_C( 5036.36), SIMDE_FLOAT32_C( 3363.90), SIMDE_FLOAT32_C( 4374.02),
SIMDE_FLOAT32_C( 4087.77), SIMDE_FLOAT32_C( 5199.67), SIMDE_FLOAT32_C( 7554.25), SIMDE_FLOAT32_C( 6973.34),
SIMDE_FLOAT32_C( 5071.68), SIMDE_FLOAT32_C( 3476.37), SIMDE_FLOAT32_C( 9581.30), SIMDE_FLOAT32_C( 1516.57)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 7.94), SIMDE_FLOAT32_C( 8.07), SIMDE_FLOAT32_C( 7.12), SIMDE_FLOAT32_C( 8.48),
SIMDE_FLOAT32_C( 9.18), SIMDE_FLOAT32_C( 8.52), SIMDE_FLOAT32_C( 8.12), SIMDE_FLOAT32_C( 8.38),
SIMDE_FLOAT32_C( 8.32), SIMDE_FLOAT32_C( 8.56), SIMDE_FLOAT32_C( 8.93), SIMDE_FLOAT32_C( 8.85),
SIMDE_FLOAT32_C( 8.53), SIMDE_FLOAT32_C( 8.15), SIMDE_FLOAT32_C( 9.17), SIMDE_FLOAT32_C( 7.32)) }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])); i++) {
simde__m512 r = simde_mm512_log_ps(test_vec[i].a);
simde_assert_m512_close(r, test_vec[i].r, 1);
}
return 0;
}
static int
test_simde_mm512_mask_log_ps(SIMDE_MUNIT_TEST_ARGS) {
const struct {
simde__m512 src;
simde__mmask16 k;
simde__m512 a;
simde__m512 r;
} test_vec[8] = {
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( 2746.67), SIMDE_FLOAT32_C( 8435.45), SIMDE_FLOAT32_C( 3937.29), SIMDE_FLOAT32_C( 1696.00),
SIMDE_FLOAT32_C( 5142.35), SIMDE_FLOAT32_C( 381.82), SIMDE_FLOAT32_C( 695.25), SIMDE_FLOAT32_C( 8484.34),
SIMDE_FLOAT32_C( 1569.36), SIMDE_FLOAT32_C( 7857.29), SIMDE_FLOAT32_C( 7111.03), SIMDE_FLOAT32_C( 7338.80),
SIMDE_FLOAT32_C( 8351.20), SIMDE_FLOAT32_C( 5170.29), SIMDE_FLOAT32_C( 5195.06), SIMDE_FLOAT32_C( 6733.16)),
UINT16_C(41466),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 5890.98), SIMDE_FLOAT32_C( 6166.85), SIMDE_FLOAT32_C( 6306.54), SIMDE_FLOAT32_C( 117.23),
SIMDE_FLOAT32_C( 2775.95), SIMDE_FLOAT32_C( 3079.83), SIMDE_FLOAT32_C( 3474.63), SIMDE_FLOAT32_C( 2912.29),
SIMDE_FLOAT32_C( 1609.14), SIMDE_FLOAT32_C( 5423.87), SIMDE_FLOAT32_C( 9127.65), SIMDE_FLOAT32_C( 3652.77),
SIMDE_FLOAT32_C( 7486.55), SIMDE_FLOAT32_C( 3512.77), SIMDE_FLOAT32_C( 4068.94), SIMDE_FLOAT32_C( 1228.12)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 8.68), SIMDE_FLOAT32_C( 8435.45), SIMDE_FLOAT32_C( 8.75), SIMDE_FLOAT32_C( 1696.00),
SIMDE_FLOAT32_C( 5142.35), SIMDE_FLOAT32_C( 381.82), SIMDE_FLOAT32_C( 695.25), SIMDE_FLOAT32_C( 7.98),
SIMDE_FLOAT32_C( 7.38), SIMDE_FLOAT32_C( 8.60), SIMDE_FLOAT32_C( 9.12), SIMDE_FLOAT32_C( 8.20),
SIMDE_FLOAT32_C( 8.92), SIMDE_FLOAT32_C( 5170.29), SIMDE_FLOAT32_C( 8.31), SIMDE_FLOAT32_C( 6733.16)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( 7348.31), SIMDE_FLOAT32_C( 4256.55), SIMDE_FLOAT32_C( 9550.14), SIMDE_FLOAT32_C( 8956.15),
SIMDE_FLOAT32_C( 3981.75), SIMDE_FLOAT32_C( 6683.64), SIMDE_FLOAT32_C( 1262.07), SIMDE_FLOAT32_C( 2229.06),
SIMDE_FLOAT32_C( 3060.52), SIMDE_FLOAT32_C( 8279.36), SIMDE_FLOAT32_C( 7661.76), SIMDE_FLOAT32_C( 8903.22),
SIMDE_FLOAT32_C( 1148.23), SIMDE_FLOAT32_C( 2082.02), SIMDE_FLOAT32_C( 1146.40), SIMDE_FLOAT32_C( 5140.40)),
UINT16_C(36797),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 4142.45), SIMDE_FLOAT32_C( 8400.08), SIMDE_FLOAT32_C( 9093.31), SIMDE_FLOAT32_C( 8002.34),
SIMDE_FLOAT32_C( 6271.53), SIMDE_FLOAT32_C( 4596.36), SIMDE_FLOAT32_C( 276.11), SIMDE_FLOAT32_C( 1163.84),
SIMDE_FLOAT32_C( 6994.08), SIMDE_FLOAT32_C( 6979.60), SIMDE_FLOAT32_C( 6696.04), SIMDE_FLOAT32_C( 3680.04),
SIMDE_FLOAT32_C( 4846.05), SIMDE_FLOAT32_C( 7217.40), SIMDE_FLOAT32_C( 6902.28), SIMDE_FLOAT32_C( 9969.51)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 8.33), SIMDE_FLOAT32_C( 4256.55), SIMDE_FLOAT32_C( 9550.14), SIMDE_FLOAT32_C( 8956.15),
SIMDE_FLOAT32_C( 8.74), SIMDE_FLOAT32_C( 8.43), SIMDE_FLOAT32_C( 5.62), SIMDE_FLOAT32_C( 7.06),
SIMDE_FLOAT32_C( 8.85), SIMDE_FLOAT32_C( 8279.36), SIMDE_FLOAT32_C( 8.81), SIMDE_FLOAT32_C( 8.21),
SIMDE_FLOAT32_C( 8.49), SIMDE_FLOAT32_C( 8.88), SIMDE_FLOAT32_C( 1146.40), SIMDE_FLOAT32_C( 9.21)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( 4524.23), SIMDE_FLOAT32_C( 9203.26), SIMDE_FLOAT32_C( 2042.18), SIMDE_FLOAT32_C( 8657.47),
SIMDE_FLOAT32_C( 8118.50), SIMDE_FLOAT32_C( 5703.37), SIMDE_FLOAT32_C( 469.19), SIMDE_FLOAT32_C( 5499.67),
SIMDE_FLOAT32_C( 1309.05), SIMDE_FLOAT32_C( 8793.93), SIMDE_FLOAT32_C( 6717.40), SIMDE_FLOAT32_C( 1010.42),
SIMDE_FLOAT32_C( 2370.85), SIMDE_FLOAT32_C( 886.73), SIMDE_FLOAT32_C( 8278.35), SIMDE_FLOAT32_C( 7716.73)),
UINT16_C(16804),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 5136.26), SIMDE_FLOAT32_C( 8450.59), SIMDE_FLOAT32_C( 4894.53), SIMDE_FLOAT32_C( 2755.53),
SIMDE_FLOAT32_C( 7528.93), SIMDE_FLOAT32_C( 9155.11), SIMDE_FLOAT32_C( 9886.80), SIMDE_FLOAT32_C( 6656.71),
SIMDE_FLOAT32_C( 7314.76), SIMDE_FLOAT32_C( 4105.04), SIMDE_FLOAT32_C( 6623.12), SIMDE_FLOAT32_C( 628.43),
SIMDE_FLOAT32_C( 3357.32), SIMDE_FLOAT32_C( 4038.44), SIMDE_FLOAT32_C( 7806.81), SIMDE_FLOAT32_C( 4645.43)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 4524.23), SIMDE_FLOAT32_C( 9.04), SIMDE_FLOAT32_C( 2042.18), SIMDE_FLOAT32_C( 8657.47),
SIMDE_FLOAT32_C( 8118.50), SIMDE_FLOAT32_C( 5703.37), SIMDE_FLOAT32_C( 469.19), SIMDE_FLOAT32_C( 8.80),
SIMDE_FLOAT32_C( 8.90), SIMDE_FLOAT32_C( 8793.93), SIMDE_FLOAT32_C( 8.80), SIMDE_FLOAT32_C( 1010.42),
SIMDE_FLOAT32_C( 2370.85), SIMDE_FLOAT32_C( 8.30), SIMDE_FLOAT32_C( 8278.35), SIMDE_FLOAT32_C( 7716.73)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( 3256.50), SIMDE_FLOAT32_C( 2809.03), SIMDE_FLOAT32_C( 1237.85), SIMDE_FLOAT32_C( 9663.28),
SIMDE_FLOAT32_C( 3363.90), SIMDE_FLOAT32_C( 4087.77), SIMDE_FLOAT32_C( 7554.25), SIMDE_FLOAT32_C( 5071.68),
SIMDE_FLOAT32_C( 9581.30), SIMDE_FLOAT32_C( 1154.54), SIMDE_FLOAT32_C( 2130.97), SIMDE_FLOAT32_C( 3312.02),
SIMDE_FLOAT32_C( 6468.19), SIMDE_FLOAT32_C( 2118.90), SIMDE_FLOAT32_C( 8551.88), SIMDE_FLOAT32_C( 1217.92)),
UINT16_C( 2107),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 9486.33), SIMDE_FLOAT32_C( 4010.56), SIMDE_FLOAT32_C( 3201.22), SIMDE_FLOAT32_C( 4831.67),
SIMDE_FLOAT32_C( 5036.36), SIMDE_FLOAT32_C( 4374.02), SIMDE_FLOAT32_C( 5199.67), SIMDE_FLOAT32_C( 6973.34),
SIMDE_FLOAT32_C( 3476.37), SIMDE_FLOAT32_C( 1516.57), SIMDE_FLOAT32_C( 9110.29), SIMDE_FLOAT32_C( 11.83),
SIMDE_FLOAT32_C( 9618.20), SIMDE_FLOAT32_C( 1159.42), SIMDE_FLOAT32_C( 4661.80), SIMDE_FLOAT32_C( 9887.44)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 3256.50), SIMDE_FLOAT32_C( 2809.03), SIMDE_FLOAT32_C( 1237.85), SIMDE_FLOAT32_C( 9663.28),
SIMDE_FLOAT32_C( 8.52), SIMDE_FLOAT32_C( 4087.77), SIMDE_FLOAT32_C( 7554.25), SIMDE_FLOAT32_C( 5071.68),
SIMDE_FLOAT32_C( 9581.30), SIMDE_FLOAT32_C( 1154.54), SIMDE_FLOAT32_C( 9.12), SIMDE_FLOAT32_C( 2.47),
SIMDE_FLOAT32_C( 9.17), SIMDE_FLOAT32_C( 2118.90), SIMDE_FLOAT32_C( 8.45), SIMDE_FLOAT32_C( 9.20)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( 4921.97), SIMDE_FLOAT32_C( 1314.36), SIMDE_FLOAT32_C( 3425.34), SIMDE_FLOAT32_C( 5889.62),
SIMDE_FLOAT32_C( 6729.66), SIMDE_FLOAT32_C( 9443.57), SIMDE_FLOAT32_C( 9578.53), SIMDE_FLOAT32_C( 5667.58),
SIMDE_FLOAT32_C( 7424.68), SIMDE_FLOAT32_C( 2009.69), SIMDE_FLOAT32_C( 1044.67), SIMDE_FLOAT32_C( 1170.36),
SIMDE_FLOAT32_C( 6106.86), SIMDE_FLOAT32_C( 1058.19), SIMDE_FLOAT32_C( 1124.78), SIMDE_FLOAT32_C( 7203.19)),
UINT16_C(22274),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 7482.85), SIMDE_FLOAT32_C( 9575.95), SIMDE_FLOAT32_C( 1407.98), SIMDE_FLOAT32_C( 5799.87),
SIMDE_FLOAT32_C( 694.94), SIMDE_FLOAT32_C( 7133.07), SIMDE_FLOAT32_C( 9660.54), SIMDE_FLOAT32_C( 5551.82),
SIMDE_FLOAT32_C( 9134.21), SIMDE_FLOAT32_C( 4616.24), SIMDE_FLOAT32_C( 6187.92), SIMDE_FLOAT32_C( 3107.51),
SIMDE_FLOAT32_C( 1991.62), SIMDE_FLOAT32_C( 1882.51), SIMDE_FLOAT32_C( 287.66), SIMDE_FLOAT32_C( 7377.56)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 4921.97), SIMDE_FLOAT32_C( 9.17), SIMDE_FLOAT32_C( 3425.34), SIMDE_FLOAT32_C( 8.67),
SIMDE_FLOAT32_C( 6729.66), SIMDE_FLOAT32_C( 8.87), SIMDE_FLOAT32_C( 9.18), SIMDE_FLOAT32_C( 8.62),
SIMDE_FLOAT32_C( 7424.68), SIMDE_FLOAT32_C( 2009.69), SIMDE_FLOAT32_C( 1044.67), SIMDE_FLOAT32_C( 1170.36),
SIMDE_FLOAT32_C( 6106.86), SIMDE_FLOAT32_C( 1058.19), SIMDE_FLOAT32_C( 5.66), SIMDE_FLOAT32_C( 7203.19)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( 9415.27), SIMDE_FLOAT32_C( 963.59), SIMDE_FLOAT32_C( 4649.74), SIMDE_FLOAT32_C( 1078.30),
SIMDE_FLOAT32_C( 5462.61), SIMDE_FLOAT32_C( 6033.01), SIMDE_FLOAT32_C( 9173.00), SIMDE_FLOAT32_C( 4672.02),
SIMDE_FLOAT32_C( 3569.65), SIMDE_FLOAT32_C( 3935.68), SIMDE_FLOAT32_C( 3408.08), SIMDE_FLOAT32_C( 8917.42),
SIMDE_FLOAT32_C( 1855.90), SIMDE_FLOAT32_C( 7781.74), SIMDE_FLOAT32_C( 7197.17), SIMDE_FLOAT32_C( 7170.16)),
UINT16_C(27396),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 178.74), SIMDE_FLOAT32_C( 2968.36), SIMDE_FLOAT32_C( 1281.72), SIMDE_FLOAT32_C( 1177.11),
SIMDE_FLOAT32_C( 8949.44), SIMDE_FLOAT32_C( 5024.17), SIMDE_FLOAT32_C( 907.29), SIMDE_FLOAT32_C( 5805.32),
SIMDE_FLOAT32_C( 7896.24), SIMDE_FLOAT32_C( 4941.12), SIMDE_FLOAT32_C( 3457.39), SIMDE_FLOAT32_C( 1402.13),
SIMDE_FLOAT32_C( 6670.00), SIMDE_FLOAT32_C( 6373.56), SIMDE_FLOAT32_C( 415.89), SIMDE_FLOAT32_C( 2550.00)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 9415.27), SIMDE_FLOAT32_C( 8.00), SIMDE_FLOAT32_C( 7.16), SIMDE_FLOAT32_C( 1078.30),
SIMDE_FLOAT32_C( 9.10), SIMDE_FLOAT32_C( 6033.01), SIMDE_FLOAT32_C( 6.81), SIMDE_FLOAT32_C( 8.67),
SIMDE_FLOAT32_C( 3569.65), SIMDE_FLOAT32_C( 3935.68), SIMDE_FLOAT32_C( 3408.08), SIMDE_FLOAT32_C( 8917.42),
SIMDE_FLOAT32_C( 1855.90), SIMDE_FLOAT32_C( 8.76), SIMDE_FLOAT32_C( 7197.17), SIMDE_FLOAT32_C( 7170.16)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( 7648.13), SIMDE_FLOAT32_C( 4875.56), SIMDE_FLOAT32_C( 161.12), SIMDE_FLOAT32_C( 8194.68),
SIMDE_FLOAT32_C( 7254.51), SIMDE_FLOAT32_C( 1142.29), SIMDE_FLOAT32_C( 5528.96), SIMDE_FLOAT32_C( 7950.51),
SIMDE_FLOAT32_C( 5154.57), SIMDE_FLOAT32_C( 8176.75), SIMDE_FLOAT32_C( 4580.00), SIMDE_FLOAT32_C( 5400.22),
SIMDE_FLOAT32_C( 1452.71), SIMDE_FLOAT32_C( 8039.28), SIMDE_FLOAT32_C( 6972.90), SIMDE_FLOAT32_C( 554.46)),
UINT16_C( 953),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 5093.74), SIMDE_FLOAT32_C( 9045.23), SIMDE_FLOAT32_C( 5720.26), SIMDE_FLOAT32_C( 2861.39),
SIMDE_FLOAT32_C( 6541.39), SIMDE_FLOAT32_C( 4114.75), SIMDE_FLOAT32_C( 2711.17), SIMDE_FLOAT32_C( 8391.22),
SIMDE_FLOAT32_C( 5330.27), SIMDE_FLOAT32_C( 3661.45), SIMDE_FLOAT32_C( 5586.41), SIMDE_FLOAT32_C( 2116.00),
SIMDE_FLOAT32_C( 4808.04), SIMDE_FLOAT32_C( 3749.32), SIMDE_FLOAT32_C( 4730.38), SIMDE_FLOAT32_C( 5459.69)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 7648.13), SIMDE_FLOAT32_C( 4875.56), SIMDE_FLOAT32_C( 161.12), SIMDE_FLOAT32_C( 8194.68),
SIMDE_FLOAT32_C( 7254.51), SIMDE_FLOAT32_C( 1142.29), SIMDE_FLOAT32_C( 7.91), SIMDE_FLOAT32_C( 9.03),
SIMDE_FLOAT32_C( 8.58), SIMDE_FLOAT32_C( 8176.75), SIMDE_FLOAT32_C( 8.63), SIMDE_FLOAT32_C( 7.66),
SIMDE_FLOAT32_C( 8.48), SIMDE_FLOAT32_C( 8039.28), SIMDE_FLOAT32_C( 6972.90), SIMDE_FLOAT32_C( 8.61)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( 1058.07), SIMDE_FLOAT32_C( 6652.15), SIMDE_FLOAT32_C( 2532.95), SIMDE_FLOAT32_C( 9113.62),
SIMDE_FLOAT32_C( 9783.41), SIMDE_FLOAT32_C( 9773.08), SIMDE_FLOAT32_C( 9127.47), SIMDE_FLOAT32_C( 918.64),
SIMDE_FLOAT32_C( 3953.30), SIMDE_FLOAT32_C( 333.95), SIMDE_FLOAT32_C( 1356.49), SIMDE_FLOAT32_C( 2899.69),
SIMDE_FLOAT32_C( 5501.59), SIMDE_FLOAT32_C( 5515.77), SIMDE_FLOAT32_C( 7198.84), SIMDE_FLOAT32_C( 3978.34)),
UINT16_C(12713),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 792.83), SIMDE_FLOAT32_C( 4929.19), SIMDE_FLOAT32_C( 9124.38), SIMDE_FLOAT32_C( 8968.13),
SIMDE_FLOAT32_C( 1316.26), SIMDE_FLOAT32_C( 3447.13), SIMDE_FLOAT32_C( 8644.35), SIMDE_FLOAT32_C( 3246.39),
SIMDE_FLOAT32_C( 5304.47), SIMDE_FLOAT32_C( 5549.07), SIMDE_FLOAT32_C( 8579.68), SIMDE_FLOAT32_C( 3747.01),
SIMDE_FLOAT32_C( 9720.69), SIMDE_FLOAT32_C( 6809.26), SIMDE_FLOAT32_C( 4934.63), SIMDE_FLOAT32_C( 9263.02)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 1058.07), SIMDE_FLOAT32_C( 6652.15), SIMDE_FLOAT32_C( 9.12), SIMDE_FLOAT32_C( 9.10),
SIMDE_FLOAT32_C( 9783.41), SIMDE_FLOAT32_C( 9773.08), SIMDE_FLOAT32_C( 9127.47), SIMDE_FLOAT32_C( 8.09),
SIMDE_FLOAT32_C( 8.58), SIMDE_FLOAT32_C( 333.95), SIMDE_FLOAT32_C( 9.06), SIMDE_FLOAT32_C( 2899.69),
SIMDE_FLOAT32_C( 9.18), SIMDE_FLOAT32_C( 5515.77), SIMDE_FLOAT32_C( 7198.84), SIMDE_FLOAT32_C( 9.13)) }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])); i++) {
simde__m512 r = simde_mm512_mask_log_ps(test_vec[i].src, test_vec[i].k, test_vec[i].a);
simde_assert_m512_close(r, test_vec[i].r, 1);
}
return 0;
}
static int
test_simde_mm512_log_pd(SIMDE_MUNIT_TEST_ARGS) {
const struct {
simde__m512d a;
simde__m512d r;
} test_vec[8] = {
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( 7486.55), SIMDE_FLOAT64_C( 8351.20),
SIMDE_FLOAT64_C( 3512.77), SIMDE_FLOAT64_C( 5170.29),
SIMDE_FLOAT64_C( 4068.94), SIMDE_FLOAT64_C( 5195.06),
SIMDE_FLOAT64_C( 1228.12), SIMDE_FLOAT64_C( 6733.16)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 8.92), SIMDE_FLOAT64_C( 9.03),
SIMDE_FLOAT64_C( 8.16), SIMDE_FLOAT64_C( 8.55),
SIMDE_FLOAT64_C( 8.31), SIMDE_FLOAT64_C( 8.56),
SIMDE_FLOAT64_C( 7.11), SIMDE_FLOAT64_C( 8.81)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( 1609.14), SIMDE_FLOAT64_C( 1569.36),
SIMDE_FLOAT64_C( 5423.87), SIMDE_FLOAT64_C( 7857.29),
SIMDE_FLOAT64_C( 9127.65), SIMDE_FLOAT64_C( 7111.03),
SIMDE_FLOAT64_C( 3652.77), SIMDE_FLOAT64_C( 7338.80)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 7.38), SIMDE_FLOAT64_C( 7.36),
SIMDE_FLOAT64_C( 8.60), SIMDE_FLOAT64_C( 8.97),
SIMDE_FLOAT64_C( 9.12), SIMDE_FLOAT64_C( 8.87),
SIMDE_FLOAT64_C( 8.20), SIMDE_FLOAT64_C( 8.90)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( 2775.95), SIMDE_FLOAT64_C( 5142.35),
SIMDE_FLOAT64_C( 3079.83), SIMDE_FLOAT64_C( 381.82),
SIMDE_FLOAT64_C( 3474.63), SIMDE_FLOAT64_C( 695.25),
SIMDE_FLOAT64_C( 2912.29), SIMDE_FLOAT64_C( 8484.34)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 7.93), SIMDE_FLOAT64_C( 8.55),
SIMDE_FLOAT64_C( 8.03), SIMDE_FLOAT64_C( 5.94),
SIMDE_FLOAT64_C( 8.15), SIMDE_FLOAT64_C( 6.54),
SIMDE_FLOAT64_C( 7.98), SIMDE_FLOAT64_C( 9.05)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( 5890.98), SIMDE_FLOAT64_C( 2746.67),
SIMDE_FLOAT64_C( 6166.85), SIMDE_FLOAT64_C( 8435.45),
SIMDE_FLOAT64_C( 6306.54), SIMDE_FLOAT64_C( 3937.29),
SIMDE_FLOAT64_C( 117.23), SIMDE_FLOAT64_C( 1696.00)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 8.68), SIMDE_FLOAT64_C( 7.92),
SIMDE_FLOAT64_C( 8.73), SIMDE_FLOAT64_C( 9.04),
SIMDE_FLOAT64_C( 8.75), SIMDE_FLOAT64_C( 8.28),
SIMDE_FLOAT64_C( 4.76), SIMDE_FLOAT64_C( 7.44)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( 1148.23), SIMDE_FLOAT64_C( 7217.40),
SIMDE_FLOAT64_C( 2082.02), SIMDE_FLOAT64_C( 6902.28),
SIMDE_FLOAT64_C( 1146.40), SIMDE_FLOAT64_C( 9969.51),
SIMDE_FLOAT64_C( 5140.40), SIMDE_FLOAT64_C( 9206.03)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 7.05), SIMDE_FLOAT64_C( 8.88),
SIMDE_FLOAT64_C( 7.64), SIMDE_FLOAT64_C( 8.84),
SIMDE_FLOAT64_C( 7.04), SIMDE_FLOAT64_C( 9.21),
SIMDE_FLOAT64_C( 8.54), SIMDE_FLOAT64_C( 9.13)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( 3060.52), SIMDE_FLOAT64_C( 6979.60),
SIMDE_FLOAT64_C( 8279.36), SIMDE_FLOAT64_C( 6696.04),
SIMDE_FLOAT64_C( 7661.76), SIMDE_FLOAT64_C( 3680.04),
SIMDE_FLOAT64_C( 8903.22), SIMDE_FLOAT64_C( 4846.05)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 8.03), SIMDE_FLOAT64_C( 8.85),
SIMDE_FLOAT64_C( 9.02), SIMDE_FLOAT64_C( 8.81),
SIMDE_FLOAT64_C( 8.94), SIMDE_FLOAT64_C( 8.21),
SIMDE_FLOAT64_C( 9.09), SIMDE_FLOAT64_C( 8.49)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( 3981.75), SIMDE_FLOAT64_C( 4596.36),
SIMDE_FLOAT64_C( 6683.64), SIMDE_FLOAT64_C( 276.11),
SIMDE_FLOAT64_C( 1262.07), SIMDE_FLOAT64_C( 1163.84),
SIMDE_FLOAT64_C( 2229.06), SIMDE_FLOAT64_C( 6994.08)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 8.29), SIMDE_FLOAT64_C( 8.43),
SIMDE_FLOAT64_C( 8.81), SIMDE_FLOAT64_C( 5.62),
SIMDE_FLOAT64_C( 7.14), SIMDE_FLOAT64_C( 7.06),
SIMDE_FLOAT64_C( 7.71), SIMDE_FLOAT64_C( 8.85)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( 7348.31), SIMDE_FLOAT64_C( 8400.08),
SIMDE_FLOAT64_C( 4256.55), SIMDE_FLOAT64_C( 9093.31),
SIMDE_FLOAT64_C( 9550.14), SIMDE_FLOAT64_C( 8002.34),
SIMDE_FLOAT64_C( 8956.15), SIMDE_FLOAT64_C( 6271.53)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 8.90), SIMDE_FLOAT64_C( 9.04),
SIMDE_FLOAT64_C( 8.36), SIMDE_FLOAT64_C( 9.12),
SIMDE_FLOAT64_C( 9.16), SIMDE_FLOAT64_C( 8.99),
SIMDE_FLOAT64_C( 9.10), SIMDE_FLOAT64_C( 8.74)) }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])); i++) {
simde__m512d r = simde_mm512_log_pd(test_vec[i].a);
simde_assert_m512d_close(r, test_vec[i].r, 1);
}
return 0;
}
static int
test_simde_mm512_mask_log_pd(SIMDE_MUNIT_TEST_ARGS) {
const struct {
simde__m512d src;
simde__mmask8 k;
simde__m512d a;
simde__m512d r;
} test_vec[8] = {
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( 1569.36), SIMDE_FLOAT64_C( 7857.29),
SIMDE_FLOAT64_C( 7111.03), SIMDE_FLOAT64_C( 7338.80),
SIMDE_FLOAT64_C( 8351.20), SIMDE_FLOAT64_C( 5170.29),
SIMDE_FLOAT64_C( 5195.06), SIMDE_FLOAT64_C( 6733.16)),
UINT8_C(139),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 1609.14), SIMDE_FLOAT64_C( 5423.87),
SIMDE_FLOAT64_C( 9127.65), SIMDE_FLOAT64_C( 3652.77),
SIMDE_FLOAT64_C( 7486.55), SIMDE_FLOAT64_C( 3512.77),
SIMDE_FLOAT64_C( 4068.94), SIMDE_FLOAT64_C( 1228.12)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 7.38), SIMDE_FLOAT64_C( 7857.29),
SIMDE_FLOAT64_C( 7111.03), SIMDE_FLOAT64_C( 7338.80),
SIMDE_FLOAT64_C( 8.92), SIMDE_FLOAT64_C( 5170.29),
SIMDE_FLOAT64_C( 8.31), SIMDE_FLOAT64_C( 7.11)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( 5890.98), SIMDE_FLOAT64_C( 6166.85),
SIMDE_FLOAT64_C( 6306.54), SIMDE_FLOAT64_C( 117.23),
SIMDE_FLOAT64_C( 2775.95), SIMDE_FLOAT64_C( 3079.83),
SIMDE_FLOAT64_C( 3474.63), SIMDE_FLOAT64_C( 2912.29)),
UINT8_C(229),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 9206.03), SIMDE_FLOAT64_C( 2746.67),
SIMDE_FLOAT64_C( 8435.45), SIMDE_FLOAT64_C( 3937.29),
SIMDE_FLOAT64_C( 1696.00), SIMDE_FLOAT64_C( 5142.35),
SIMDE_FLOAT64_C( 381.82), SIMDE_FLOAT64_C( 695.25)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 9.13), SIMDE_FLOAT64_C( 7.92),
SIMDE_FLOAT64_C( 9.04), SIMDE_FLOAT64_C( 117.23),
SIMDE_FLOAT64_C( 2775.95), SIMDE_FLOAT64_C( 8.55),
SIMDE_FLOAT64_C( 3474.63), SIMDE_FLOAT64_C( 6.54)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( 6994.08), SIMDE_FLOAT64_C( 6979.60),
SIMDE_FLOAT64_C( 6696.04), SIMDE_FLOAT64_C( 3680.04),
SIMDE_FLOAT64_C( 4846.05), SIMDE_FLOAT64_C( 7217.40),
SIMDE_FLOAT64_C( 6902.28), SIMDE_FLOAT64_C( 9969.51)),
UINT8_C(253),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 2229.06), SIMDE_FLOAT64_C( 3060.52),
SIMDE_FLOAT64_C( 8279.36), SIMDE_FLOAT64_C( 7661.76),
SIMDE_FLOAT64_C( 8903.22), SIMDE_FLOAT64_C( 1148.23),
SIMDE_FLOAT64_C( 2082.02), SIMDE_FLOAT64_C( 1146.40)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 7.71), SIMDE_FLOAT64_C( 8.03),
SIMDE_FLOAT64_C( 9.02), SIMDE_FLOAT64_C( 8.94),
SIMDE_FLOAT64_C( 9.09), SIMDE_FLOAT64_C( 7.05),
SIMDE_FLOAT64_C( 6902.28), SIMDE_FLOAT64_C( 7.04)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( 5603.27), SIMDE_FLOAT64_C( 7348.31),
SIMDE_FLOAT64_C( 4256.55), SIMDE_FLOAT64_C( 9550.14),
SIMDE_FLOAT64_C( 8956.15), SIMDE_FLOAT64_C( 3981.75),
SIMDE_FLOAT64_C( 6683.64), SIMDE_FLOAT64_C( 1262.07)),
UINT8_C( 93),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 7716.73), SIMDE_FLOAT64_C( 4142.45),
SIMDE_FLOAT64_C( 8400.08), SIMDE_FLOAT64_C( 9093.31),
SIMDE_FLOAT64_C( 8002.34), SIMDE_FLOAT64_C( 6271.53),
SIMDE_FLOAT64_C( 4596.36), SIMDE_FLOAT64_C( 276.11)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 5603.27), SIMDE_FLOAT64_C( 8.33),
SIMDE_FLOAT64_C( 4256.55), SIMDE_FLOAT64_C( 9.12),
SIMDE_FLOAT64_C( 8.99), SIMDE_FLOAT64_C( 8.74),
SIMDE_FLOAT64_C( 6683.64), SIMDE_FLOAT64_C( 5.62)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( 5499.67), SIMDE_FLOAT64_C( 1309.05),
SIMDE_FLOAT64_C( 8793.93), SIMDE_FLOAT64_C( 6717.40),
SIMDE_FLOAT64_C( 1010.42), SIMDE_FLOAT64_C( 2370.85),
SIMDE_FLOAT64_C( 886.73), SIMDE_FLOAT64_C( 8278.35)),
UINT8_C(145),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 6656.71), SIMDE_FLOAT64_C( 7314.76),
SIMDE_FLOAT64_C( 4105.04), SIMDE_FLOAT64_C( 6623.12),
SIMDE_FLOAT64_C( 628.43), SIMDE_FLOAT64_C( 3357.32),
SIMDE_FLOAT64_C( 4038.44), SIMDE_FLOAT64_C( 7806.81)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 8.80), SIMDE_FLOAT64_C( 1309.05),
SIMDE_FLOAT64_C( 8793.93), SIMDE_FLOAT64_C( 8.80),
SIMDE_FLOAT64_C( 1010.42), SIMDE_FLOAT64_C( 2370.85),
SIMDE_FLOAT64_C( 886.73), SIMDE_FLOAT64_C( 8.96)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( 1217.92), SIMDE_FLOAT64_C( 5136.26),
SIMDE_FLOAT64_C( 8450.59), SIMDE_FLOAT64_C( 4894.53),
SIMDE_FLOAT64_C( 2755.53), SIMDE_FLOAT64_C( 7528.93),
SIMDE_FLOAT64_C( 9155.11), SIMDE_FLOAT64_C( 9886.80)),
UINT8_C( 75),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 9887.44), SIMDE_FLOAT64_C( 7124.06),
SIMDE_FLOAT64_C( 4524.23), SIMDE_FLOAT64_C( 9203.26),
SIMDE_FLOAT64_C( 2042.18), SIMDE_FLOAT64_C( 8657.47),
SIMDE_FLOAT64_C( 8118.50), SIMDE_FLOAT64_C( 5703.37)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 1217.92), SIMDE_FLOAT64_C( 8.87),
SIMDE_FLOAT64_C( 8450.59), SIMDE_FLOAT64_C( 4894.53),
SIMDE_FLOAT64_C( 7.62), SIMDE_FLOAT64_C( 7528.93),
SIMDE_FLOAT64_C( 9.00), SIMDE_FLOAT64_C( 8.65)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( 6973.34), SIMDE_FLOAT64_C( 3476.37),
SIMDE_FLOAT64_C( 1516.57), SIMDE_FLOAT64_C( 9110.29),
SIMDE_FLOAT64_C( 11.83), SIMDE_FLOAT64_C( 9618.20),
SIMDE_FLOAT64_C( 1159.42), SIMDE_FLOAT64_C( 4661.80)),
UINT8_C( 93),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 7554.25), SIMDE_FLOAT64_C( 5071.68),
SIMDE_FLOAT64_C( 9581.30), SIMDE_FLOAT64_C( 1154.54),
SIMDE_FLOAT64_C( 2130.97), SIMDE_FLOAT64_C( 3312.02),
SIMDE_FLOAT64_C( 6468.19), SIMDE_FLOAT64_C( 2118.90)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 6973.34), SIMDE_FLOAT64_C( 8.53),
SIMDE_FLOAT64_C( 1516.57), SIMDE_FLOAT64_C( 7.05),
SIMDE_FLOAT64_C( 7.66), SIMDE_FLOAT64_C( 8.11),
SIMDE_FLOAT64_C( 1159.42), SIMDE_FLOAT64_C( 7.66)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( 7377.56), SIMDE_FLOAT64_C( 9683.23),
SIMDE_FLOAT64_C( 3256.50), SIMDE_FLOAT64_C( 2809.03),
SIMDE_FLOAT64_C( 1237.85), SIMDE_FLOAT64_C( 9663.28),
SIMDE_FLOAT64_C( 3363.90), SIMDE_FLOAT64_C( 4087.77)),
UINT8_C(213),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 1124.78), SIMDE_FLOAT64_C( 7203.19),
SIMDE_FLOAT64_C( 9486.33), SIMDE_FLOAT64_C( 4010.56),
SIMDE_FLOAT64_C( 3201.22), SIMDE_FLOAT64_C( 4831.67),
SIMDE_FLOAT64_C( 5036.36), SIMDE_FLOAT64_C( 4374.02)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 7.03), SIMDE_FLOAT64_C( 8.88),
SIMDE_FLOAT64_C( 3256.50), SIMDE_FLOAT64_C( 8.30),
SIMDE_FLOAT64_C( 1237.85), SIMDE_FLOAT64_C( 8.48),
SIMDE_FLOAT64_C( 3363.90), SIMDE_FLOAT64_C( 8.38)) }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])); i++) {
simde__m512d r = simde_mm512_mask_log_pd(test_vec[i].src, test_vec[i].k, test_vec[i].a);
simde_assert_m512d_close(r, test_vec[i].r, 1);
}
return 0;
}
static int
test_simde_mm_log1p_ps (SIMDE_MUNIT_TEST_ARGS) {
static const struct {
const simde_float32 a[4];
const simde_float32 r[4];
} test_vec[] = {
{ { SIMDE_FLOAT32_C( 75.94), SIMDE_FLOAT32_C( 8.83), SIMDE_FLOAT32_C( 79.72), SIMDE_FLOAT32_C( 43.97) },
{ SIMDE_FLOAT32_C( 4.34), SIMDE_FLOAT32_C( 2.29), SIMDE_FLOAT32_C( 4.39), SIMDE_FLOAT32_C( 3.81) } },
{ { SIMDE_FLOAT32_C( 40.77), SIMDE_FLOAT32_C( 95.32), SIMDE_FLOAT32_C( 68.75), SIMDE_FLOAT32_C( 17.84) },
{ SIMDE_FLOAT32_C( 3.73), SIMDE_FLOAT32_C( 4.57), SIMDE_FLOAT32_C( 4.24), SIMDE_FLOAT32_C( 2.94) } },
{ { SIMDE_FLOAT32_C( 87.84), SIMDE_FLOAT32_C( 9.10), SIMDE_FLOAT32_C( 51.15), SIMDE_FLOAT32_C( 49.38) },
{ SIMDE_FLOAT32_C( 4.49), SIMDE_FLOAT32_C( 2.31), SIMDE_FLOAT32_C( 3.95), SIMDE_FLOAT32_C( 3.92) } },
{ { SIMDE_FLOAT32_C( 72.43), SIMDE_FLOAT32_C( 10.89), SIMDE_FLOAT32_C( 17.62), SIMDE_FLOAT32_C( 49.42) },
{ SIMDE_FLOAT32_C( 4.30), SIMDE_FLOAT32_C( 2.48), SIMDE_FLOAT32_C( 2.92), SIMDE_FLOAT32_C( 3.92) } },
{ { SIMDE_FLOAT32_C( 61.53), SIMDE_FLOAT32_C( 6.26), SIMDE_FLOAT32_C( 3.60), SIMDE_FLOAT32_C( 56.29) },
{ SIMDE_FLOAT32_C( 4.14), SIMDE_FLOAT32_C( 1.98), SIMDE_FLOAT32_C( 1.53), SIMDE_FLOAT32_C( 4.05) } },
{ { SIMDE_FLOAT32_C( 33.37), SIMDE_FLOAT32_C( 28.79), SIMDE_FLOAT32_C( 10.52), SIMDE_FLOAT32_C( 86.16) },
{ SIMDE_FLOAT32_C( 3.54), SIMDE_FLOAT32_C( 3.39), SIMDE_FLOAT32_C( 2.44), SIMDE_FLOAT32_C( 4.47) } },
{ { SIMDE_FLOAT32_C( 75.88), SIMDE_FLOAT32_C( 38.85), SIMDE_FLOAT32_C( 41.92), SIMDE_FLOAT32_C( 15.06) },
{ SIMDE_FLOAT32_C( 4.34), SIMDE_FLOAT32_C( 3.69), SIMDE_FLOAT32_C( 3.76), SIMDE_FLOAT32_C( 2.78) } },
{ { SIMDE_FLOAT32_C( 49.93), SIMDE_FLOAT32_C( 45.63), SIMDE_FLOAT32_C( 11.83), SIMDE_FLOAT32_C( 25.87) },
{ SIMDE_FLOAT32_C( 3.93), SIMDE_FLOAT32_C( 3.84), SIMDE_FLOAT32_C( 2.55), SIMDE_FLOAT32_C( 3.29) } }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])) ; i++) {
simde__m128 a = simde_mm_loadu_ps(test_vec[i].a);
simde__m128 r = simde_mm_log1p_ps(a);
simde_test_x86_assert_equal_f32x4(r, simde_mm_loadu_ps(test_vec[i].r), 1);
}
return 0;
}
static int
test_simde_mm_log1p_pd (SIMDE_MUNIT_TEST_ARGS) {
static const struct {
const simde_float64 a[2];
const simde_float64 r[2];
} test_vec[] = {
{ { SIMDE_FLOAT64_C( 71.66), SIMDE_FLOAT64_C( 23.63) },
{ SIMDE_FLOAT64_C( 4.29), SIMDE_FLOAT64_C( 3.20) } },
{ { SIMDE_FLOAT64_C( 39.38), SIMDE_FLOAT64_C( 45.82) },
{ SIMDE_FLOAT64_C( 3.70), SIMDE_FLOAT64_C( 3.85) } },
{ { SIMDE_FLOAT64_C( 26.23), SIMDE_FLOAT64_C( 40.67) },
{ SIMDE_FLOAT64_C( 3.30), SIMDE_FLOAT64_C( 3.73) } },
{ { SIMDE_FLOAT64_C( 88.01), SIMDE_FLOAT64_C( 4.27) },
{ SIMDE_FLOAT64_C( 4.49), SIMDE_FLOAT64_C( 1.66) } },
{ { SIMDE_FLOAT64_C( 8.61), SIMDE_FLOAT64_C( 48.32) },
{ SIMDE_FLOAT64_C( 2.26), SIMDE_FLOAT64_C( 3.90) } },
{ { SIMDE_FLOAT64_C( 83.85), SIMDE_FLOAT64_C( 77.45) },
{ SIMDE_FLOAT64_C( 4.44), SIMDE_FLOAT64_C( 4.36) } },
{ { SIMDE_FLOAT64_C( 28.87), SIMDE_FLOAT64_C( 9.70) },
{ SIMDE_FLOAT64_C( 3.40), SIMDE_FLOAT64_C( 2.37) } },
{ { SIMDE_FLOAT64_C( 59.45), SIMDE_FLOAT64_C( 89.65) },
{ SIMDE_FLOAT64_C( 4.10), SIMDE_FLOAT64_C( 4.51) } }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])) ; i++) {
simde__m128d a = simde_mm_loadu_pd(test_vec[i].a);
simde__m128d r = simde_mm_log1p_pd(a);
simde_test_x86_assert_equal_f64x2(r, simde_mm_loadu_pd(test_vec[i].r), 1);
}
return 0;
}
static int
test_simde_mm256_log1p_ps (SIMDE_MUNIT_TEST_ARGS) {
static const struct {
const simde_float32 a[8];
const simde_float32 r[8];
} test_vec[] = {
{ { SIMDE_FLOAT32_C( 7.73), SIMDE_FLOAT32_C( 44.58), SIMDE_FLOAT32_C( 1.09), SIMDE_FLOAT32_C( 37.39),
SIMDE_FLOAT32_C( 81.72), SIMDE_FLOAT32_C( 97.03), SIMDE_FLOAT32_C( 32.40), SIMDE_FLOAT32_C( 46.21) },
{ SIMDE_FLOAT32_C( 2.17), SIMDE_FLOAT32_C( 3.82), SIMDE_FLOAT32_C( 0.74), SIMDE_FLOAT32_C( 3.65),
SIMDE_FLOAT32_C( 4.42), SIMDE_FLOAT32_C( 4.59), SIMDE_FLOAT32_C( 3.51), SIMDE_FLOAT32_C( 3.85) } },
{ { SIMDE_FLOAT32_C( 68.19), SIMDE_FLOAT32_C( 59.69), SIMDE_FLOAT32_C( 65.16), SIMDE_FLOAT32_C( 49.14),
SIMDE_FLOAT32_C( 16.80), SIMDE_FLOAT32_C( 22.15), SIMDE_FLOAT32_C( 15.49), SIMDE_FLOAT32_C( 40.38) },
{ SIMDE_FLOAT32_C( 4.24), SIMDE_FLOAT32_C( 4.11), SIMDE_FLOAT32_C( 4.19), SIMDE_FLOAT32_C( 3.91),
SIMDE_FLOAT32_C( 2.88), SIMDE_FLOAT32_C( 3.14), SIMDE_FLOAT32_C( 2.80), SIMDE_FLOAT32_C( 3.72) } },
{ { SIMDE_FLOAT32_C( 30.77), SIMDE_FLOAT32_C( 61.57), SIMDE_FLOAT32_C( 50.60), SIMDE_FLOAT32_C( 43.40),
SIMDE_FLOAT32_C( 79.43), SIMDE_FLOAT32_C( 23.65), SIMDE_FLOAT32_C( 55.47), SIMDE_FLOAT32_C( 29.32) },
{ SIMDE_FLOAT32_C( 3.46), SIMDE_FLOAT32_C( 4.14), SIMDE_FLOAT32_C( 3.94), SIMDE_FLOAT32_C( 3.79),
SIMDE_FLOAT32_C( 4.39), SIMDE_FLOAT32_C( 3.20), SIMDE_FLOAT32_C( 4.03), SIMDE_FLOAT32_C( 3.41) } },
{ { SIMDE_FLOAT32_C( 54.13), SIMDE_FLOAT32_C( 82.81), SIMDE_FLOAT32_C( 78.99), SIMDE_FLOAT32_C( 50.88),
SIMDE_FLOAT32_C( 5.92), SIMDE_FLOAT32_C( 42.82), SIMDE_FLOAT32_C( 53.24), SIMDE_FLOAT32_C( 13.65) },
{ SIMDE_FLOAT32_C( 4.01), SIMDE_FLOAT32_C( 4.43), SIMDE_FLOAT32_C( 4.38), SIMDE_FLOAT32_C( 3.95),
SIMDE_FLOAT32_C( 1.93), SIMDE_FLOAT32_C( 3.78), SIMDE_FLOAT32_C( 3.99), SIMDE_FLOAT32_C( 2.68) } },
{ { SIMDE_FLOAT32_C( 87.40), SIMDE_FLOAT32_C( 54.33), SIMDE_FLOAT32_C( 51.04), SIMDE_FLOAT32_C( 69.12),
SIMDE_FLOAT32_C( 51.36), SIMDE_FLOAT32_C( 83.44), SIMDE_FLOAT32_C( 15.34), SIMDE_FLOAT32_C( 19.54) },
{ SIMDE_FLOAT32_C( 4.48), SIMDE_FLOAT32_C( 4.01), SIMDE_FLOAT32_C( 3.95), SIMDE_FLOAT32_C( 4.25),
SIMDE_FLOAT32_C( 3.96), SIMDE_FLOAT32_C( 4.44), SIMDE_FLOAT32_C( 2.79), SIMDE_FLOAT32_C( 3.02) } },
{ { SIMDE_FLOAT32_C( 43.13), SIMDE_FLOAT32_C( 80.50), SIMDE_FLOAT32_C( 68.69), SIMDE_FLOAT32_C( 59.93),
SIMDE_FLOAT32_C( 2.65), SIMDE_FLOAT32_C( 84.18), SIMDE_FLOAT32_C( 0.31), SIMDE_FLOAT32_C( 33.43) },
{ SIMDE_FLOAT32_C( 3.79), SIMDE_FLOAT32_C( 4.40), SIMDE_FLOAT32_C( 4.24), SIMDE_FLOAT32_C( 4.11),
SIMDE_FLOAT32_C( 1.29), SIMDE_FLOAT32_C( 4.44), SIMDE_FLOAT32_C( 0.27), SIMDE_FLOAT32_C( 3.54) } },
{ { SIMDE_FLOAT32_C( 45.75), SIMDE_FLOAT32_C( 50.91), SIMDE_FLOAT32_C( 76.83), SIMDE_FLOAT32_C( 25.17),
SIMDE_FLOAT32_C( 74.56), SIMDE_FLOAT32_C( 32.30), SIMDE_FLOAT32_C( 54.49), SIMDE_FLOAT32_C( 28.69) },
{ SIMDE_FLOAT32_C( 3.84), SIMDE_FLOAT32_C( 3.95), SIMDE_FLOAT32_C( 4.35), SIMDE_FLOAT32_C( 3.26),
SIMDE_FLOAT32_C( 4.32), SIMDE_FLOAT32_C( 3.51), SIMDE_FLOAT32_C( 4.02), SIMDE_FLOAT32_C( 3.39) } },
{ { SIMDE_FLOAT32_C( 15.11), SIMDE_FLOAT32_C( 33.49), SIMDE_FLOAT32_C( 79.56), SIMDE_FLOAT32_C( 21.03),
SIMDE_FLOAT32_C( 76.31), SIMDE_FLOAT32_C( 32.80), SIMDE_FLOAT32_C( 34.68), SIMDE_FLOAT32_C( 63.71) },
{ SIMDE_FLOAT32_C( 2.78), SIMDE_FLOAT32_C( 3.54), SIMDE_FLOAT32_C( 4.39), SIMDE_FLOAT32_C( 3.09),
SIMDE_FLOAT32_C( 4.35), SIMDE_FLOAT32_C( 3.52), SIMDE_FLOAT32_C( 3.57), SIMDE_FLOAT32_C( 4.17) } }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])) ; i++) {
simde__m256 a = simde_mm256_loadu_ps(test_vec[i].a);
simde__m256 r = simde_mm256_log1p_ps(a);
simde_test_x86_assert_equal_f32x8(r, simde_mm256_loadu_ps(test_vec[i].r), 1);
}
return 0;
}
static int
test_simde_mm256_log1p_pd (SIMDE_MUNIT_TEST_ARGS) {
static const struct {
const simde_float64 a[4];
const simde_float64 r[4];
} test_vec[] = {
{ { SIMDE_FLOAT64_C( 82.81), SIMDE_FLOAT64_C( 0.68), SIMDE_FLOAT64_C( 54.22), SIMDE_FLOAT64_C( 13.29) },
{ SIMDE_FLOAT64_C( 4.43), SIMDE_FLOAT64_C( 0.52), SIMDE_FLOAT64_C( 4.01), SIMDE_FLOAT64_C( 2.66) } },
{ { SIMDE_FLOAT64_C( 34.27), SIMDE_FLOAT64_C( 86.02), SIMDE_FLOAT64_C( 66.74), SIMDE_FLOAT64_C( 46.61) },
{ SIMDE_FLOAT64_C( 3.56), SIMDE_FLOAT64_C( 4.47), SIMDE_FLOAT64_C( 4.22), SIMDE_FLOAT64_C( 3.86) } },
{ { SIMDE_FLOAT64_C( 95.48), SIMDE_FLOAT64_C( 40.65), SIMDE_FLOAT64_C( 39.71), SIMDE_FLOAT64_C( 33.88) },
{ SIMDE_FLOAT64_C( 4.57), SIMDE_FLOAT64_C( 3.73), SIMDE_FLOAT64_C( 3.71), SIMDE_FLOAT64_C( 3.55) } },
{ { SIMDE_FLOAT64_C( 25.60), SIMDE_FLOAT64_C( 96.16), SIMDE_FLOAT64_C( 45.65), SIMDE_FLOAT64_C( 11.33) },
{ SIMDE_FLOAT64_C( 3.28), SIMDE_FLOAT64_C( 4.58), SIMDE_FLOAT64_C( 3.84), SIMDE_FLOAT64_C( 2.51) } },
{ { SIMDE_FLOAT64_C( 12.09), SIMDE_FLOAT64_C( 86.42), SIMDE_FLOAT64_C( 87.72), SIMDE_FLOAT64_C( 82.93) },
{ SIMDE_FLOAT64_C( 2.57), SIMDE_FLOAT64_C( 4.47), SIMDE_FLOAT64_C( 4.49), SIMDE_FLOAT64_C( 4.43) } },
{ { SIMDE_FLOAT64_C( 74.51), SIMDE_FLOAT64_C( 10.22), SIMDE_FLOAT64_C( 42.74), SIMDE_FLOAT64_C( 42.04) },
{ SIMDE_FLOAT64_C( 4.32), SIMDE_FLOAT64_C( 2.42), SIMDE_FLOAT64_C( 3.78), SIMDE_FLOAT64_C( 3.76) } },
{ { SIMDE_FLOAT64_C( 56.03), SIMDE_FLOAT64_C( 46.45), SIMDE_FLOAT64_C( 79.57), SIMDE_FLOAT64_C( 53.99) },
{ SIMDE_FLOAT64_C( 4.04), SIMDE_FLOAT64_C( 3.86), SIMDE_FLOAT64_C( 4.39), SIMDE_FLOAT64_C( 4.01) } },
{ { SIMDE_FLOAT64_C( 65.41), SIMDE_FLOAT64_C( 86.99), SIMDE_FLOAT64_C( 98.63), SIMDE_FLOAT64_C( 48.22) },
{ SIMDE_FLOAT64_C( 4.20), SIMDE_FLOAT64_C( 4.48), SIMDE_FLOAT64_C( 4.60), SIMDE_FLOAT64_C( 3.90) } }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])) ; i++) {
simde__m256d a = simde_mm256_loadu_pd(test_vec[i].a);
simde__m256d r = simde_mm256_log1p_pd(a);
simde_test_x86_assert_equal_f64x4(r, simde_mm256_loadu_pd(test_vec[i].r), 1);
}
return 0;
}
static int
test_simde_mm512_log1p_ps (SIMDE_MUNIT_TEST_ARGS) {
static const struct {
const simde_float32 a[16];
const simde_float32 r[16];
} test_vec[] = {
{ { SIMDE_FLOAT32_C( 56.49), SIMDE_FLOAT32_C( 45.26), SIMDE_FLOAT32_C( 0.21), SIMDE_FLOAT32_C( 8.51),
SIMDE_FLOAT32_C( 84.43), SIMDE_FLOAT32_C( 90.20), SIMDE_FLOAT32_C( 58.37), SIMDE_FLOAT32_C( 91.03),
SIMDE_FLOAT32_C( 16.56), SIMDE_FLOAT32_C( 42.47), SIMDE_FLOAT32_C( 30.02), SIMDE_FLOAT32_C( 0.38),
SIMDE_FLOAT32_C( 36.91), SIMDE_FLOAT32_C( 32.16), SIMDE_FLOAT32_C( 13.56), SIMDE_FLOAT32_C( 95.86) },
{ SIMDE_FLOAT32_C( 4.05), SIMDE_FLOAT32_C( 3.83), SIMDE_FLOAT32_C( 0.19), SIMDE_FLOAT32_C( 2.25),
SIMDE_FLOAT32_C( 4.45), SIMDE_FLOAT32_C( 4.51), SIMDE_FLOAT32_C( 4.08), SIMDE_FLOAT32_C( 4.52),
SIMDE_FLOAT32_C( 2.87), SIMDE_FLOAT32_C( 3.77), SIMDE_FLOAT32_C( 3.43), SIMDE_FLOAT32_C( 0.32),
SIMDE_FLOAT32_C( 3.64), SIMDE_FLOAT32_C( 3.50), SIMDE_FLOAT32_C( 2.68), SIMDE_FLOAT32_C( 4.57) } },
{ { SIMDE_FLOAT32_C( 15.15), SIMDE_FLOAT32_C( 66.91), SIMDE_FLOAT32_C( 89.77), SIMDE_FLOAT32_C( 66.71),
SIMDE_FLOAT32_C( 24.15), SIMDE_FLOAT32_C( 55.93), SIMDE_FLOAT32_C( 84.52), SIMDE_FLOAT32_C( 55.70),
SIMDE_FLOAT32_C( 44.08), SIMDE_FLOAT32_C( 33.97), SIMDE_FLOAT32_C( 77.87), SIMDE_FLOAT32_C( 36.54),
SIMDE_FLOAT32_C( 89.83), SIMDE_FLOAT32_C( 75.19), SIMDE_FLOAT32_C( 48.64), SIMDE_FLOAT32_C( 46.32) },
{ SIMDE_FLOAT32_C( 2.78), SIMDE_FLOAT32_C( 4.22), SIMDE_FLOAT32_C( 4.51), SIMDE_FLOAT32_C( 4.22),
SIMDE_FLOAT32_C( 3.22), SIMDE_FLOAT32_C( 4.04), SIMDE_FLOAT32_C( 4.45), SIMDE_FLOAT32_C( 4.04),
SIMDE_FLOAT32_C( 3.81), SIMDE_FLOAT32_C( 3.55), SIMDE_FLOAT32_C( 4.37), SIMDE_FLOAT32_C( 3.63),
SIMDE_FLOAT32_C( 4.51), SIMDE_FLOAT32_C( 4.33), SIMDE_FLOAT32_C( 3.90), SIMDE_FLOAT32_C( 3.86) } },
{ { SIMDE_FLOAT32_C( 20.45), SIMDE_FLOAT32_C( 48.85), SIMDE_FLOAT32_C( 54.83), SIMDE_FLOAT32_C( 4.88),
SIMDE_FLOAT32_C( 39.05), SIMDE_FLOAT32_C( 13.20), SIMDE_FLOAT32_C( 95.91), SIMDE_FLOAT32_C( 55.62),
SIMDE_FLOAT32_C( 55.68), SIMDE_FLOAT32_C( 25.92), SIMDE_FLOAT32_C( 55.99), SIMDE_FLOAT32_C( 92.58),
SIMDE_FLOAT32_C( 58.09), SIMDE_FLOAT32_C( 69.55), SIMDE_FLOAT32_C( 88.44), SIMDE_FLOAT32_C( 73.24) },
{ SIMDE_FLOAT32_C( 3.07), SIMDE_FLOAT32_C( 3.91), SIMDE_FLOAT32_C( 4.02), SIMDE_FLOAT32_C( 1.77),
SIMDE_FLOAT32_C( 3.69), SIMDE_FLOAT32_C( 2.65), SIMDE_FLOAT32_C( 4.57), SIMDE_FLOAT32_C( 4.04),
SIMDE_FLOAT32_C( 4.04), SIMDE_FLOAT32_C( 3.29), SIMDE_FLOAT32_C( 4.04), SIMDE_FLOAT32_C( 4.54),
SIMDE_FLOAT32_C( 4.08), SIMDE_FLOAT32_C( 4.26), SIMDE_FLOAT32_C( 4.49), SIMDE_FLOAT32_C( 4.31) } },
{ { SIMDE_FLOAT32_C( 36.47), SIMDE_FLOAT32_C( 78.21), SIMDE_FLOAT32_C( 39.95), SIMDE_FLOAT32_C( 60.62),
SIMDE_FLOAT32_C( 34.14), SIMDE_FLOAT32_C( 24.47), SIMDE_FLOAT32_C( 16.32), SIMDE_FLOAT32_C( 78.22),
SIMDE_FLOAT32_C( 58.44), SIMDE_FLOAT32_C( 94.19), SIMDE_FLOAT32_C( 14.75), SIMDE_FLOAT32_C( 48.27),
SIMDE_FLOAT32_C( 69.38), SIMDE_FLOAT32_C( 63.39), SIMDE_FLOAT32_C( 94.60), SIMDE_FLOAT32_C( 89.83) },
{ SIMDE_FLOAT32_C( 3.62), SIMDE_FLOAT32_C( 4.37), SIMDE_FLOAT32_C( 3.71), SIMDE_FLOAT32_C( 4.12),
SIMDE_FLOAT32_C( 3.56), SIMDE_FLOAT32_C( 3.24), SIMDE_FLOAT32_C( 2.85), SIMDE_FLOAT32_C( 4.37),
SIMDE_FLOAT32_C( 4.08), SIMDE_FLOAT32_C( 4.56), SIMDE_FLOAT32_C( 2.76), SIMDE_FLOAT32_C( 3.90),
SIMDE_FLOAT32_C( 4.25), SIMDE_FLOAT32_C( 4.16), SIMDE_FLOAT32_C( 4.56), SIMDE_FLOAT32_C( 4.51) } },
{ { SIMDE_FLOAT32_C( 12.25), SIMDE_FLOAT32_C( 49.43), SIMDE_FLOAT32_C( 94.71), SIMDE_FLOAT32_C( 51.30),
SIMDE_FLOAT32_C( 62.63), SIMDE_FLOAT32_C( 90.62), SIMDE_FLOAT32_C( 6.92), SIMDE_FLOAT32_C( 18.31),
SIMDE_FLOAT32_C( 16.54), SIMDE_FLOAT32_C( 62.91), SIMDE_FLOAT32_C( 10.89), SIMDE_FLOAT32_C( 74.63),
SIMDE_FLOAT32_C( 32.47), SIMDE_FLOAT32_C( 99.33), SIMDE_FLOAT32_C( 47.86), SIMDE_FLOAT32_C( 68.94) },
{ SIMDE_FLOAT32_C( 2.58), SIMDE_FLOAT32_C( 3.92), SIMDE_FLOAT32_C( 4.56), SIMDE_FLOAT32_C( 3.96),
SIMDE_FLOAT32_C( 4.15), SIMDE_FLOAT32_C( 4.52), SIMDE_FLOAT32_C( 2.07), SIMDE_FLOAT32_C( 2.96),
SIMDE_FLOAT32_C( 2.86), SIMDE_FLOAT32_C( 4.16), SIMDE_FLOAT32_C( 2.48), SIMDE_FLOAT32_C( 4.33),
SIMDE_FLOAT32_C( 3.51), SIMDE_FLOAT32_C( 4.61), SIMDE_FLOAT32_C( 3.89), SIMDE_FLOAT32_C( 4.25) } },
{ { SIMDE_FLOAT32_C( 77.54), SIMDE_FLOAT32_C( 87.82), SIMDE_FLOAT32_C( 29.55), SIMDE_FLOAT32_C( 11.68),
SIMDE_FLOAT32_C( 12.29), SIMDE_FLOAT32_C( 45.87), SIMDE_FLOAT32_C( 89.89), SIMDE_FLOAT32_C( 70.73),
SIMDE_FLOAT32_C( 40.05), SIMDE_FLOAT32_C( 4.64), SIMDE_FLOAT32_C( 19.00), SIMDE_FLOAT32_C( 9.43),
SIMDE_FLOAT32_C( 68.04), SIMDE_FLOAT32_C( 13.59), SIMDE_FLOAT32_C( 99.26), SIMDE_FLOAT32_C( 80.28) },
{ SIMDE_FLOAT32_C( 4.36), SIMDE_FLOAT32_C( 4.49), SIMDE_FLOAT32_C( 3.42), SIMDE_FLOAT32_C( 2.54),
SIMDE_FLOAT32_C( 2.59), SIMDE_FLOAT32_C( 3.85), SIMDE_FLOAT32_C( 4.51), SIMDE_FLOAT32_C( 4.27),
SIMDE_FLOAT32_C( 3.71), SIMDE_FLOAT32_C( 1.73), SIMDE_FLOAT32_C( 3.00), SIMDE_FLOAT32_C( 2.34),
SIMDE_FLOAT32_C( 4.23), SIMDE_FLOAT32_C( 2.68), SIMDE_FLOAT32_C( 4.61), SIMDE_FLOAT32_C( 4.40) } },
{ { SIMDE_FLOAT32_C( 63.02), SIMDE_FLOAT32_C( 93.97), SIMDE_FLOAT32_C( 31.58), SIMDE_FLOAT32_C( 25.65),
SIMDE_FLOAT32_C( 84.59), SIMDE_FLOAT32_C( 38.50), SIMDE_FLOAT32_C( 43.96), SIMDE_FLOAT32_C( 1.13),
SIMDE_FLOAT32_C( 1.41), SIMDE_FLOAT32_C( 54.85), SIMDE_FLOAT32_C( 75.76), SIMDE_FLOAT32_C( 33.88),
SIMDE_FLOAT32_C( 54.18), SIMDE_FLOAT32_C( 23.62), SIMDE_FLOAT32_C( 2.81), SIMDE_FLOAT32_C( 31.72) },
{ SIMDE_FLOAT32_C( 4.16), SIMDE_FLOAT32_C( 4.55), SIMDE_FLOAT32_C( 3.48), SIMDE_FLOAT32_C( 3.28),
SIMDE_FLOAT32_C( 4.45), SIMDE_FLOAT32_C( 3.68), SIMDE_FLOAT32_C( 3.81), SIMDE_FLOAT32_C( 0.76),
SIMDE_FLOAT32_C( 0.88), SIMDE_FLOAT32_C( 4.02), SIMDE_FLOAT32_C( 4.34), SIMDE_FLOAT32_C( 3.55),
SIMDE_FLOAT32_C( 4.01), SIMDE_FLOAT32_C( 3.20), SIMDE_FLOAT32_C( 1.34), SIMDE_FLOAT32_C( 3.49) } },
{ { SIMDE_FLOAT32_C( 11.44), SIMDE_FLOAT32_C( 32.37), SIMDE_FLOAT32_C( 43.39), SIMDE_FLOAT32_C( 23.72),
SIMDE_FLOAT32_C( 78.23), SIMDE_FLOAT32_C( 33.28), SIMDE_FLOAT32_C( 94.45), SIMDE_FLOAT32_C( 18.29),
SIMDE_FLOAT32_C( 37.93), SIMDE_FLOAT32_C( 13.45), SIMDE_FLOAT32_C( 27.72), SIMDE_FLOAT32_C( 5.96),
SIMDE_FLOAT32_C( 27.05), SIMDE_FLOAT32_C( 26.98), SIMDE_FLOAT32_C( 86.25), SIMDE_FLOAT32_C( 90.07) },
{ SIMDE_FLOAT32_C( 2.52), SIMDE_FLOAT32_C( 3.51), SIMDE_FLOAT32_C( 3.79), SIMDE_FLOAT32_C( 3.21),
SIMDE_FLOAT32_C( 4.37), SIMDE_FLOAT32_C( 3.53), SIMDE_FLOAT32_C( 4.56), SIMDE_FLOAT32_C( 2.96),
SIMDE_FLOAT32_C( 3.66), SIMDE_FLOAT32_C( 2.67), SIMDE_FLOAT32_C( 3.36), SIMDE_FLOAT32_C( 1.94),
SIMDE_FLOAT32_C( 3.33), SIMDE_FLOAT32_C( 3.33), SIMDE_FLOAT32_C( 4.47), SIMDE_FLOAT32_C( 4.51) } }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])) ; i++) {
simde__m512 a = simde_mm512_loadu_ps(test_vec[i].a);
simde__m512 r = simde_mm512_log1p_ps(a);
simde_test_x86_assert_equal_f32x16(r, simde_mm512_loadu_ps(test_vec[i].r), 1);
}
return 0;
}
static int
test_simde_mm512_mask_log1p_ps (SIMDE_MUNIT_TEST_ARGS) {
static const struct {
const simde_float32 src[16];
const simde__mmask8 k;
const simde_float32 a[16];
const simde_float32 r[16];
} test_vec[] = {
{ { SIMDE_FLOAT32_C( 12.54), SIMDE_FLOAT32_C( 63.14), SIMDE_FLOAT32_C( 41.17), SIMDE_FLOAT32_C( 60.95),
SIMDE_FLOAT32_C( 4.09), SIMDE_FLOAT32_C( 68.78), SIMDE_FLOAT32_C( 40.84), SIMDE_FLOAT32_C( 68.42),
SIMDE_FLOAT32_C( 63.18), SIMDE_FLOAT32_C( 48.47), SIMDE_FLOAT32_C( 50.42), SIMDE_FLOAT32_C( 37.77),
SIMDE_FLOAT32_C( 19.29), SIMDE_FLOAT32_C( 67.41), SIMDE_FLOAT32_C( 0.70), SIMDE_FLOAT32_C( 31.94) },
UINT8_C(226),
{ SIMDE_FLOAT32_C( 39.12), SIMDE_FLOAT32_C( 35.10), SIMDE_FLOAT32_C( 9.96), SIMDE_FLOAT32_C( 0.10),
SIMDE_FLOAT32_C( 31.24), SIMDE_FLOAT32_C( 60.86), SIMDE_FLOAT32_C( 46.96), SIMDE_FLOAT32_C( 34.48),
SIMDE_FLOAT32_C( 76.57), SIMDE_FLOAT32_C( 78.00), SIMDE_FLOAT32_C( 14.95), SIMDE_FLOAT32_C( 17.36),
SIMDE_FLOAT32_C( 66.84), SIMDE_FLOAT32_C( 3.16), SIMDE_FLOAT32_C( 29.89), SIMDE_FLOAT32_C( 29.98) },
{ SIMDE_FLOAT32_C( 12.54), SIMDE_FLOAT32_C( 3.59), SIMDE_FLOAT32_C( 41.17), SIMDE_FLOAT32_C( 60.95),
SIMDE_FLOAT32_C( 4.09), SIMDE_FLOAT32_C( 4.12), SIMDE_FLOAT32_C( 3.87), SIMDE_FLOAT32_C( 3.57),
SIMDE_FLOAT32_C( 63.18), SIMDE_FLOAT32_C( 48.47), SIMDE_FLOAT32_C( 50.42), SIMDE_FLOAT32_C( 37.77),
SIMDE_FLOAT32_C( 19.29), SIMDE_FLOAT32_C( 67.41), SIMDE_FLOAT32_C( 0.70), SIMDE_FLOAT32_C( 31.94) } },
{ { SIMDE_FLOAT32_C( 44.33), SIMDE_FLOAT32_C( 90.84), SIMDE_FLOAT32_C( 34.08), SIMDE_FLOAT32_C( 13.10),
SIMDE_FLOAT32_C( 31.68), SIMDE_FLOAT32_C( 2.49), SIMDE_FLOAT32_C( 76.28), SIMDE_FLOAT32_C( 80.15),
SIMDE_FLOAT32_C( 52.91), SIMDE_FLOAT32_C( 14.05), SIMDE_FLOAT32_C( 99.44), SIMDE_FLOAT32_C( 20.32),
SIMDE_FLOAT32_C( 14.75), SIMDE_FLOAT32_C( 31.39), SIMDE_FLOAT32_C( 83.76), SIMDE_FLOAT32_C( 53.87) },
UINT8_C(211),
{ SIMDE_FLOAT32_C( 93.72), SIMDE_FLOAT32_C( 53.97), SIMDE_FLOAT32_C( 97.73), SIMDE_FLOAT32_C( 54.58),
SIMDE_FLOAT32_C( 0.93), SIMDE_FLOAT32_C( 32.21), SIMDE_FLOAT32_C( 31.15), SIMDE_FLOAT32_C( 78.93),
SIMDE_FLOAT32_C( 47.16), SIMDE_FLOAT32_C( 48.50), SIMDE_FLOAT32_C( 45.77), SIMDE_FLOAT32_C( 50.32),
SIMDE_FLOAT32_C( 78.40), SIMDE_FLOAT32_C( 75.75), SIMDE_FLOAT32_C( 94.65), SIMDE_FLOAT32_C( 69.24) },
{ SIMDE_FLOAT32_C( 4.55), SIMDE_FLOAT32_C( 4.01), SIMDE_FLOAT32_C( 34.08), SIMDE_FLOAT32_C( 13.10),
SIMDE_FLOAT32_C( 0.66), SIMDE_FLOAT32_C( 2.49), SIMDE_FLOAT32_C( 3.47), SIMDE_FLOAT32_C( 4.38),
SIMDE_FLOAT32_C( 52.91), SIMDE_FLOAT32_C( 14.05), SIMDE_FLOAT32_C( 99.44), SIMDE_FLOAT32_C( 20.32),
SIMDE_FLOAT32_C( 14.75), SIMDE_FLOAT32_C( 31.39), SIMDE_FLOAT32_C( 83.76), SIMDE_FLOAT32_C( 53.87) } },
{ { SIMDE_FLOAT32_C( 9.83), SIMDE_FLOAT32_C( 7.75), SIMDE_FLOAT32_C( 0.91), SIMDE_FLOAT32_C( 12.32),
SIMDE_FLOAT32_C( 84.03), SIMDE_FLOAT32_C( 81.06), SIMDE_FLOAT32_C( 65.23), SIMDE_FLOAT32_C( 98.08),
SIMDE_FLOAT32_C( 80.51), SIMDE_FLOAT32_C( 85.55), SIMDE_FLOAT32_C( 12.83), SIMDE_FLOAT32_C( 11.90),
SIMDE_FLOAT32_C( 69.31), SIMDE_FLOAT32_C( 66.70), SIMDE_FLOAT32_C( 78.39), SIMDE_FLOAT32_C( 63.03) },
UINT8_MAX,
{ SIMDE_FLOAT32_C( 76.12), SIMDE_FLOAT32_C( 17.61), SIMDE_FLOAT32_C( 21.60), SIMDE_FLOAT32_C( 8.33),
SIMDE_FLOAT32_C( 48.76), SIMDE_FLOAT32_C( 0.52), SIMDE_FLOAT32_C( 55.49), SIMDE_FLOAT32_C( 97.26),
SIMDE_FLOAT32_C( 46.29), SIMDE_FLOAT32_C( 5.81), SIMDE_FLOAT32_C( 75.66), SIMDE_FLOAT32_C( 22.04),
SIMDE_FLOAT32_C( 0.46), SIMDE_FLOAT32_C( 44.89), SIMDE_FLOAT32_C( 31.87), SIMDE_FLOAT32_C( 8.21) },
{ SIMDE_FLOAT32_C( 4.35), SIMDE_FLOAT32_C( 2.92), SIMDE_FLOAT32_C( 3.12), SIMDE_FLOAT32_C( 2.23),
SIMDE_FLOAT32_C( 3.91), SIMDE_FLOAT32_C( 0.42), SIMDE_FLOAT32_C( 4.03), SIMDE_FLOAT32_C( 4.59),
SIMDE_FLOAT32_C( 80.51), SIMDE_FLOAT32_C( 85.55), SIMDE_FLOAT32_C( 12.83), SIMDE_FLOAT32_C( 11.90),
SIMDE_FLOAT32_C( 69.31), SIMDE_FLOAT32_C( 66.70), SIMDE_FLOAT32_C( 78.39), SIMDE_FLOAT32_C( 63.03) } },
{ { SIMDE_FLOAT32_C( 45.81), SIMDE_FLOAT32_C( 44.19), SIMDE_FLOAT32_C( 92.24), SIMDE_FLOAT32_C( 26.87),
SIMDE_FLOAT32_C( 9.42), SIMDE_FLOAT32_C( 90.33), SIMDE_FLOAT32_C( 7.38), SIMDE_FLOAT32_C( 94.98),
SIMDE_FLOAT32_C( 3.16), SIMDE_FLOAT32_C( 19.28), SIMDE_FLOAT32_C( 64.29), SIMDE_FLOAT32_C( 69.86),
SIMDE_FLOAT32_C( 97.67), SIMDE_FLOAT32_C( 27.32), SIMDE_FLOAT32_C( 90.53), SIMDE_FLOAT32_C( 73.79) },
UINT8_C(154),
{ SIMDE_FLOAT32_C( 12.13), SIMDE_FLOAT32_C( 82.12), SIMDE_FLOAT32_C( 93.69), SIMDE_FLOAT32_C( 12.65),
SIMDE_FLOAT32_C( 37.62), SIMDE_FLOAT32_C( 90.95), SIMDE_FLOAT32_C( 58.94), SIMDE_FLOAT32_C( 43.43),
SIMDE_FLOAT32_C( 66.61), SIMDE_FLOAT32_C( 80.98), SIMDE_FLOAT32_C( 43.89), SIMDE_FLOAT32_C( 11.51),
SIMDE_FLOAT32_C( 12.84), SIMDE_FLOAT32_C( 52.10), SIMDE_FLOAT32_C( 57.32), SIMDE_FLOAT32_C( 57.03) },
{ SIMDE_FLOAT32_C( 45.81), SIMDE_FLOAT32_C( 4.42), SIMDE_FLOAT32_C( 92.24), SIMDE_FLOAT32_C( 2.61),
SIMDE_FLOAT32_C( 3.65), SIMDE_FLOAT32_C( 90.33), SIMDE_FLOAT32_C( 7.38), SIMDE_FLOAT32_C( 3.79),
SIMDE_FLOAT32_C( 3.16), SIMDE_FLOAT32_C( 19.28), SIMDE_FLOAT32_C( 64.29), SIMDE_FLOAT32_C( 69.86),
SIMDE_FLOAT32_C( 97.67), SIMDE_FLOAT32_C( 27.32), SIMDE_FLOAT32_C( 90.53), SIMDE_FLOAT32_C( 73.79) } },
{ { SIMDE_FLOAT32_C( 44.35), SIMDE_FLOAT32_C( 84.19), SIMDE_FLOAT32_C( 66.46), SIMDE_FLOAT32_C( 34.67),
SIMDE_FLOAT32_C( 91.58), SIMDE_FLOAT32_C( 61.43), SIMDE_FLOAT32_C( 37.83), SIMDE_FLOAT32_C( 10.85),
SIMDE_FLOAT32_C( 25.72), SIMDE_FLOAT32_C( 7.69), SIMDE_FLOAT32_C( 8.52), SIMDE_FLOAT32_C( 53.04),
SIMDE_FLOAT32_C( 98.23), SIMDE_FLOAT32_C( 82.31), SIMDE_FLOAT32_C( 97.98), SIMDE_FLOAT32_C( 10.35) },
UINT8_C( 99),
{ SIMDE_FLOAT32_C( 91.67), SIMDE_FLOAT32_C( 23.00), SIMDE_FLOAT32_C( 2.05), SIMDE_FLOAT32_C( 82.62),
SIMDE_FLOAT32_C( 81.94), SIMDE_FLOAT32_C( 45.48), SIMDE_FLOAT32_C( 49.24), SIMDE_FLOAT32_C( 62.91),
SIMDE_FLOAT32_C( 89.37), SIMDE_FLOAT32_C( 60.75), SIMDE_FLOAT32_C( 75.76), SIMDE_FLOAT32_C( 41.47),
SIMDE_FLOAT32_C( 18.07), SIMDE_FLOAT32_C( 32.79), SIMDE_FLOAT32_C( 85.82), SIMDE_FLOAT32_C( 2.26) },
{ SIMDE_FLOAT32_C( 4.53), SIMDE_FLOAT32_C( 3.18), SIMDE_FLOAT32_C( 66.46), SIMDE_FLOAT32_C( 34.67),
SIMDE_FLOAT32_C( 91.58), SIMDE_FLOAT32_C( 3.84), SIMDE_FLOAT32_C( 3.92), SIMDE_FLOAT32_C( 10.85),
SIMDE_FLOAT32_C( 25.72), SIMDE_FLOAT32_C( 7.69), SIMDE_FLOAT32_C( 8.52), SIMDE_FLOAT32_C( 53.04),
SIMDE_FLOAT32_C( 98.23), SIMDE_FLOAT32_C( 82.31), SIMDE_FLOAT32_C( 97.98), SIMDE_FLOAT32_C( 10.35) } },
{ { SIMDE_FLOAT32_C( 99.25), SIMDE_FLOAT32_C( 20.49), SIMDE_FLOAT32_C( 93.84), SIMDE_FLOAT32_C( 60.68),
SIMDE_FLOAT32_C( 58.33), SIMDE_FLOAT32_C( 4.69), SIMDE_FLOAT32_C( 86.40), SIMDE_FLOAT32_C( 66.02),
SIMDE_FLOAT32_C( 13.21), SIMDE_FLOAT32_C( 39.45), SIMDE_FLOAT32_C( 64.25), SIMDE_FLOAT32_C( 95.52),
SIMDE_FLOAT32_C( 37.43), SIMDE_FLOAT32_C( 74.60), SIMDE_FLOAT32_C( 59.95), SIMDE_FLOAT32_C( 29.10) },
UINT8_C( 67),
{ SIMDE_FLOAT32_C( 62.00), SIMDE_FLOAT32_C( 11.72), SIMDE_FLOAT32_C( 79.53), SIMDE_FLOAT32_C( 7.47),
SIMDE_FLOAT32_C( 60.96), SIMDE_FLOAT32_C( 42.45), SIMDE_FLOAT32_C( 96.84), SIMDE_FLOAT32_C( 21.71),
SIMDE_FLOAT32_C( 18.20), SIMDE_FLOAT32_C( 38.31), SIMDE_FLOAT32_C( 39.77), SIMDE_FLOAT32_C( 50.99),
SIMDE_FLOAT32_C( 24.13), SIMDE_FLOAT32_C( 42.03), SIMDE_FLOAT32_C( 50.24), SIMDE_FLOAT32_C( 44.62) },
{ SIMDE_FLOAT32_C( 4.14), SIMDE_FLOAT32_C( 2.54), SIMDE_FLOAT32_C( 93.84), SIMDE_FLOAT32_C( 60.68),
SIMDE_FLOAT32_C( 58.33), SIMDE_FLOAT32_C( 4.69), SIMDE_FLOAT32_C( 4.58), SIMDE_FLOAT32_C( 66.02),
SIMDE_FLOAT32_C( 13.21), SIMDE_FLOAT32_C( 39.45), SIMDE_FLOAT32_C( 64.25), SIMDE_FLOAT32_C( 95.52),
SIMDE_FLOAT32_C( 37.43), SIMDE_FLOAT32_C( 74.60), SIMDE_FLOAT32_C( 59.95), SIMDE_FLOAT32_C( 29.10) } },
{ { SIMDE_FLOAT32_C( 35.87), SIMDE_FLOAT32_C( 10.92), SIMDE_FLOAT32_C( 2.95), SIMDE_FLOAT32_C( 40.56),
SIMDE_FLOAT32_C( 97.33), SIMDE_FLOAT32_C( 68.97), SIMDE_FLOAT32_C( 53.77), SIMDE_FLOAT32_C( 36.78),
SIMDE_FLOAT32_C( 33.22), SIMDE_FLOAT32_C( 49.29), SIMDE_FLOAT32_C( 74.20), SIMDE_FLOAT32_C( 7.81),
SIMDE_FLOAT32_C( 9.24), SIMDE_FLOAT32_C( 3.30), SIMDE_FLOAT32_C( 5.41), SIMDE_FLOAT32_C( 71.23) },
UINT8_C(113),
{ SIMDE_FLOAT32_C( 84.95), SIMDE_FLOAT32_C( 78.70), SIMDE_FLOAT32_C( 75.98), SIMDE_FLOAT32_C( 27.40),
SIMDE_FLOAT32_C( 75.54), SIMDE_FLOAT32_C( 97.69), SIMDE_FLOAT32_C( 45.60), SIMDE_FLOAT32_C( 13.85),
SIMDE_FLOAT32_C( 37.46), SIMDE_FLOAT32_C( 96.59), SIMDE_FLOAT32_C( 37.98), SIMDE_FLOAT32_C( 79.49),
SIMDE_FLOAT32_C( 46.83), SIMDE_FLOAT32_C( 82.60), SIMDE_FLOAT32_C( 15.36), SIMDE_FLOAT32_C( 57.76) },
{ SIMDE_FLOAT32_C( 4.45), SIMDE_FLOAT32_C( 10.92), SIMDE_FLOAT32_C( 2.95), SIMDE_FLOAT32_C( 40.56),
SIMDE_FLOAT32_C( 4.34), SIMDE_FLOAT32_C( 4.59), SIMDE_FLOAT32_C( 3.84), SIMDE_FLOAT32_C( 36.78),
SIMDE_FLOAT32_C( 33.22), SIMDE_FLOAT32_C( 49.29), SIMDE_FLOAT32_C( 74.20), SIMDE_FLOAT32_C( 7.81),
SIMDE_FLOAT32_C( 9.24), SIMDE_FLOAT32_C( 3.30), SIMDE_FLOAT32_C( 5.41), SIMDE_FLOAT32_C( 71.23) } },
{ { SIMDE_FLOAT32_C( 85.55), SIMDE_FLOAT32_C( 55.92), SIMDE_FLOAT32_C( 55.08), SIMDE_FLOAT32_C( 54.52),
SIMDE_FLOAT32_C( 9.69), SIMDE_FLOAT32_C( 91.86), SIMDE_FLOAT32_C( 87.73), SIMDE_FLOAT32_C( 58.97),
SIMDE_FLOAT32_C( 66.07), SIMDE_FLOAT32_C( 95.55), SIMDE_FLOAT32_C( 68.21), SIMDE_FLOAT32_C( 69.37),
SIMDE_FLOAT32_C( 0.96), SIMDE_FLOAT32_C( 39.44), SIMDE_FLOAT32_C( 84.39), SIMDE_FLOAT32_C( 85.91) },
UINT8_C(146),
{ SIMDE_FLOAT32_C( 60.37), SIMDE_FLOAT32_C( 13.30), SIMDE_FLOAT32_C( 93.69), SIMDE_FLOAT32_C( 58.06),
SIMDE_FLOAT32_C( 58.90), SIMDE_FLOAT32_C( 7.54), SIMDE_FLOAT32_C( 95.52), SIMDE_FLOAT32_C( 55.49),
SIMDE_FLOAT32_C( 45.52), SIMDE_FLOAT32_C( 75.01), SIMDE_FLOAT32_C( 2.33), SIMDE_FLOAT32_C( 28.12),
SIMDE_FLOAT32_C( 90.38), SIMDE_FLOAT32_C( 60.08), SIMDE_FLOAT32_C( 13.67), SIMDE_FLOAT32_C( 46.30) },
{ SIMDE_FLOAT32_C( 85.55), SIMDE_FLOAT32_C( 2.66), SIMDE_FLOAT32_C( 55.08), SIMDE_FLOAT32_C( 54.52),
SIMDE_FLOAT32_C( 4.09), SIMDE_FLOAT32_C( 91.86), SIMDE_FLOAT32_C( 87.73), SIMDE_FLOAT32_C( 4.03),
SIMDE_FLOAT32_C( 66.07), SIMDE_FLOAT32_C( 95.55), SIMDE_FLOAT32_C( 68.21), SIMDE_FLOAT32_C( 69.37),
SIMDE_FLOAT32_C( 0.96), SIMDE_FLOAT32_C( 39.44), SIMDE_FLOAT32_C( 84.39), SIMDE_FLOAT32_C( 85.91) } }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])) ; i++) {
simde__m512 src = simde_mm512_loadu_ps(test_vec[i].src);
simde__m512 a = simde_mm512_loadu_ps(test_vec[i].a);
simde__m512 r = simde_mm512_mask_log1p_ps(src, test_vec[i].k, a);
simde_test_x86_assert_equal_f32x16(r, simde_mm512_loadu_ps(test_vec[i].r), 1);
}
return 0;
}
static int
test_simde_mm512_log1p_pd (SIMDE_MUNIT_TEST_ARGS) {
static const struct {
const simde_float64 a[8];
const simde_float64 r[8];
} test_vec[] = {
{ { SIMDE_FLOAT64_C( 80.88), SIMDE_FLOAT64_C( 97.21), SIMDE_FLOAT64_C( 22.72), SIMDE_FLOAT64_C( 88.57),
SIMDE_FLOAT64_C( 7.11), SIMDE_FLOAT64_C( 33.20), SIMDE_FLOAT64_C( 0.91), SIMDE_FLOAT64_C( 68.60) },
{ SIMDE_FLOAT64_C( 4.41), SIMDE_FLOAT64_C( 4.59), SIMDE_FLOAT64_C( 3.17), SIMDE_FLOAT64_C( 4.50),
SIMDE_FLOAT64_C( 2.09), SIMDE_FLOAT64_C( 3.53), SIMDE_FLOAT64_C( 0.65), SIMDE_FLOAT64_C( 4.24) } },
{ { SIMDE_FLOAT64_C( 18.60), SIMDE_FLOAT64_C( 97.22), SIMDE_FLOAT64_C( 36.01), SIMDE_FLOAT64_C( 5.77),
SIMDE_FLOAT64_C( 37.64), SIMDE_FLOAT64_C( 8.06), SIMDE_FLOAT64_C( 89.11), SIMDE_FLOAT64_C( 35.34) },
{ SIMDE_FLOAT64_C( 2.98), SIMDE_FLOAT64_C( 4.59), SIMDE_FLOAT64_C( 3.61), SIMDE_FLOAT64_C( 1.91),
SIMDE_FLOAT64_C( 3.65), SIMDE_FLOAT64_C( 2.20), SIMDE_FLOAT64_C( 4.50), SIMDE_FLOAT64_C( 3.59) } },
{ { SIMDE_FLOAT64_C( 29.67), SIMDE_FLOAT64_C( 90.68), SIMDE_FLOAT64_C( 39.64), SIMDE_FLOAT64_C( 62.60),
SIMDE_FLOAT64_C( 75.54), SIMDE_FLOAT64_C( 10.18), SIMDE_FLOAT64_C( 92.73), SIMDE_FLOAT64_C( 94.58) },
{ SIMDE_FLOAT64_C( 3.42), SIMDE_FLOAT64_C( 4.52), SIMDE_FLOAT64_C( 3.70), SIMDE_FLOAT64_C( 4.15),
SIMDE_FLOAT64_C( 4.34), SIMDE_FLOAT64_C( 2.41), SIMDE_FLOAT64_C( 4.54), SIMDE_FLOAT64_C( 4.56) } },
{ { SIMDE_FLOAT64_C( 76.16), SIMDE_FLOAT64_C( 5.81), SIMDE_FLOAT64_C( 62.23), SIMDE_FLOAT64_C( 5.12),
SIMDE_FLOAT64_C( 77.73), SIMDE_FLOAT64_C( 84.72), SIMDE_FLOAT64_C( 14.00), SIMDE_FLOAT64_C( 58.61) },
{ SIMDE_FLOAT64_C( 4.35), SIMDE_FLOAT64_C( 1.92), SIMDE_FLOAT64_C( 4.15), SIMDE_FLOAT64_C( 1.81),
SIMDE_FLOAT64_C( 4.37), SIMDE_FLOAT64_C( 4.45), SIMDE_FLOAT64_C( 2.71), SIMDE_FLOAT64_C( 4.09) } },
{ { SIMDE_FLOAT64_C( 81.93), SIMDE_FLOAT64_C( 36.72), SIMDE_FLOAT64_C( 47.19), SIMDE_FLOAT64_C( 89.04),
SIMDE_FLOAT64_C( 69.92), SIMDE_FLOAT64_C( 48.10), SIMDE_FLOAT64_C( 57.64), SIMDE_FLOAT64_C( 88.52) },
{ SIMDE_FLOAT64_C( 4.42), SIMDE_FLOAT64_C( 3.63), SIMDE_FLOAT64_C( 3.88), SIMDE_FLOAT64_C( 4.50),
SIMDE_FLOAT64_C( 4.26), SIMDE_FLOAT64_C( 3.89), SIMDE_FLOAT64_C( 4.07), SIMDE_FLOAT64_C( 4.49) } },
{ { SIMDE_FLOAT64_C( 45.32), SIMDE_FLOAT64_C( 93.65), SIMDE_FLOAT64_C( 94.30), SIMDE_FLOAT64_C( 82.96),
SIMDE_FLOAT64_C( 1.71), SIMDE_FLOAT64_C( 83.41), SIMDE_FLOAT64_C( 18.30), SIMDE_FLOAT64_C( 31.38) },
{ SIMDE_FLOAT64_C( 3.84), SIMDE_FLOAT64_C( 4.55), SIMDE_FLOAT64_C( 4.56), SIMDE_FLOAT64_C( 4.43),
SIMDE_FLOAT64_C( 1.00), SIMDE_FLOAT64_C( 4.44), SIMDE_FLOAT64_C( 2.96), SIMDE_FLOAT64_C( 3.48) } },
{ { SIMDE_FLOAT64_C( 74.09), SIMDE_FLOAT64_C( 57.95), SIMDE_FLOAT64_C( 93.98), SIMDE_FLOAT64_C( 49.63),
SIMDE_FLOAT64_C( 68.12), SIMDE_FLOAT64_C( 86.71), SIMDE_FLOAT64_C( 44.21), SIMDE_FLOAT64_C( 44.28) },
{ SIMDE_FLOAT64_C( 4.32), SIMDE_FLOAT64_C( 4.08), SIMDE_FLOAT64_C( 4.55), SIMDE_FLOAT64_C( 3.92),
SIMDE_FLOAT64_C( 4.24), SIMDE_FLOAT64_C( 4.47), SIMDE_FLOAT64_C( 3.81), SIMDE_FLOAT64_C( 3.81) } },
{ { SIMDE_FLOAT64_C( 92.51), SIMDE_FLOAT64_C( 6.45), SIMDE_FLOAT64_C( 49.40), SIMDE_FLOAT64_C( 70.25),
SIMDE_FLOAT64_C( 91.16), SIMDE_FLOAT64_C( 63.40), SIMDE_FLOAT64_C( 28.86), SIMDE_FLOAT64_C( 73.09) },
{ SIMDE_FLOAT64_C( 4.54), SIMDE_FLOAT64_C( 2.01), SIMDE_FLOAT64_C( 3.92), SIMDE_FLOAT64_C( 4.27),
SIMDE_FLOAT64_C( 4.52), SIMDE_FLOAT64_C( 4.17), SIMDE_FLOAT64_C( 3.40), SIMDE_FLOAT64_C( 4.31) } }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])) ; i++) {
simde__m512d a = simde_mm512_loadu_pd(test_vec[i].a);
simde__m512d r = simde_mm512_log1p_pd(a);
simde_test_x86_assert_equal_f64x8(r, simde_mm512_loadu_pd(test_vec[i].r), 1);
}
return 0;
}
static int
test_simde_mm512_mask_log1p_pd (SIMDE_MUNIT_TEST_ARGS) {
static const struct {
const simde_float64 src[8];
const simde__mmask8 k;
const simde_float64 a[8];
const simde_float64 r[8];
} test_vec[] = {
{ { SIMDE_FLOAT64_C( 52.74), SIMDE_FLOAT64_C( 77.28), SIMDE_FLOAT64_C( 34.34), SIMDE_FLOAT64_C( 52.30),
SIMDE_FLOAT64_C( 78.12), SIMDE_FLOAT64_C( 51.61), SIMDE_FLOAT64_C( 6.35), SIMDE_FLOAT64_C( 45.83) },
UINT8_C( 39),
{ SIMDE_FLOAT64_C( 43.10), SIMDE_FLOAT64_C( 47.48), SIMDE_FLOAT64_C( 21.67), SIMDE_FLOAT64_C( 82.04),
SIMDE_FLOAT64_C( 40.45), SIMDE_FLOAT64_C( 94.76), SIMDE_FLOAT64_C( 61.37), SIMDE_FLOAT64_C( 11.74) },
{ SIMDE_FLOAT64_C( 3.79), SIMDE_FLOAT64_C( 3.88), SIMDE_FLOAT64_C( 3.12), SIMDE_FLOAT64_C( 52.30),
SIMDE_FLOAT64_C( 78.12), SIMDE_FLOAT64_C( 4.56), SIMDE_FLOAT64_C( 6.35), SIMDE_FLOAT64_C( 45.83) } },
{ { SIMDE_FLOAT64_C( 1.10), SIMDE_FLOAT64_C( 18.75), SIMDE_FLOAT64_C( 3.08), SIMDE_FLOAT64_C( 98.55),
SIMDE_FLOAT64_C( 92.65), SIMDE_FLOAT64_C( 11.89), SIMDE_FLOAT64_C( 24.76), SIMDE_FLOAT64_C( 36.96) },
UINT8_C(244),
{ SIMDE_FLOAT64_C( 46.12), SIMDE_FLOAT64_C( 85.44), SIMDE_FLOAT64_C( 4.83), SIMDE_FLOAT64_C( 24.72),
SIMDE_FLOAT64_C( 98.67), SIMDE_FLOAT64_C( 57.57), SIMDE_FLOAT64_C( 2.00), SIMDE_FLOAT64_C( 33.01) },
{ SIMDE_FLOAT64_C( 1.10), SIMDE_FLOAT64_C( 18.75), SIMDE_FLOAT64_C( 1.76), SIMDE_FLOAT64_C( 98.55),
SIMDE_FLOAT64_C( 4.60), SIMDE_FLOAT64_C( 4.07), SIMDE_FLOAT64_C( 1.10), SIMDE_FLOAT64_C( 3.53) } },
{ { SIMDE_FLOAT64_C( 9.87), SIMDE_FLOAT64_C( 80.12), SIMDE_FLOAT64_C( 84.62), SIMDE_FLOAT64_C( 16.22),
SIMDE_FLOAT64_C( 25.95), SIMDE_FLOAT64_C( 41.00), SIMDE_FLOAT64_C( 59.31), SIMDE_FLOAT64_C( 73.43) },
UINT8_C( 77),
{ SIMDE_FLOAT64_C( 41.35), SIMDE_FLOAT64_C( 13.88), SIMDE_FLOAT64_C( 57.44), SIMDE_FLOAT64_C( 2.72),
SIMDE_FLOAT64_C( 25.62), SIMDE_FLOAT64_C( 58.53), SIMDE_FLOAT64_C( 21.47), SIMDE_FLOAT64_C( 28.69) },
{ SIMDE_FLOAT64_C( 3.75), SIMDE_FLOAT64_C( 80.12), SIMDE_FLOAT64_C( 4.07), SIMDE_FLOAT64_C( 1.31),
SIMDE_FLOAT64_C( 25.95), SIMDE_FLOAT64_C( 41.00), SIMDE_FLOAT64_C( 3.11), SIMDE_FLOAT64_C( 73.43) } },
{ { SIMDE_FLOAT64_C( 57.09), SIMDE_FLOAT64_C( 14.11), SIMDE_FLOAT64_C( 40.58), SIMDE_FLOAT64_C( 81.85),
SIMDE_FLOAT64_C( 51.08), SIMDE_FLOAT64_C( 0.58), SIMDE_FLOAT64_C( 27.97), SIMDE_FLOAT64_C( 36.52) },
UINT8_C(164),
{ SIMDE_FLOAT64_C( 52.69), SIMDE_FLOAT64_C( 35.19), SIMDE_FLOAT64_C( 62.99), SIMDE_FLOAT64_C( 54.69),
SIMDE_FLOAT64_C( 68.20), SIMDE_FLOAT64_C( 72.85), SIMDE_FLOAT64_C( 34.81), SIMDE_FLOAT64_C( 52.82) },
{ SIMDE_FLOAT64_C( 57.09), SIMDE_FLOAT64_C( 14.11), SIMDE_FLOAT64_C( 4.16), SIMDE_FLOAT64_C( 81.85),
SIMDE_FLOAT64_C( 51.08), SIMDE_FLOAT64_C( 4.30), SIMDE_FLOAT64_C( 27.97), SIMDE_FLOAT64_C( 3.99) } },
{ { SIMDE_FLOAT64_C( 89.07), SIMDE_FLOAT64_C( 60.76), SIMDE_FLOAT64_C( 93.82), SIMDE_FLOAT64_C( 48.38),
SIMDE_FLOAT64_C( 34.19), SIMDE_FLOAT64_C( 56.49), SIMDE_FLOAT64_C( 89.74), SIMDE_FLOAT64_C( 48.07) },
UINT8_C( 13),
{ SIMDE_FLOAT64_C( 92.46), SIMDE_FLOAT64_C( 73.68), SIMDE_FLOAT64_C( 72.46), SIMDE_FLOAT64_C( 13.92),
SIMDE_FLOAT64_C( 2.38), SIMDE_FLOAT64_C( 29.55), SIMDE_FLOAT64_C( 28.03), SIMDE_FLOAT64_C( 42.96) },
{ SIMDE_FLOAT64_C( 4.54), SIMDE_FLOAT64_C( 60.76), SIMDE_FLOAT64_C( 4.30), SIMDE_FLOAT64_C( 2.70),
SIMDE_FLOAT64_C( 34.19), SIMDE_FLOAT64_C( 56.49), SIMDE_FLOAT64_C( 89.74), SIMDE_FLOAT64_C( 48.07) } },
{ { SIMDE_FLOAT64_C( 11.40), SIMDE_FLOAT64_C( 79.11), SIMDE_FLOAT64_C( 43.54), SIMDE_FLOAT64_C( 39.37),
SIMDE_FLOAT64_C( 15.63), SIMDE_FLOAT64_C( 48.95), SIMDE_FLOAT64_C( 92.06), SIMDE_FLOAT64_C( 50.82) },
UINT8_C( 26),
{ SIMDE_FLOAT64_C( 46.75), SIMDE_FLOAT64_C( 19.02), SIMDE_FLOAT64_C( 84.79), SIMDE_FLOAT64_C( 81.56),
SIMDE_FLOAT64_C( 71.83), SIMDE_FLOAT64_C( 73.86), SIMDE_FLOAT64_C( 42.33), SIMDE_FLOAT64_C( 65.65) },
{ SIMDE_FLOAT64_C( 11.40), SIMDE_FLOAT64_C( 3.00), SIMDE_FLOAT64_C( 43.54), SIMDE_FLOAT64_C( 4.41),
SIMDE_FLOAT64_C( 4.29), SIMDE_FLOAT64_C( 48.95), SIMDE_FLOAT64_C( 92.06), SIMDE_FLOAT64_C( 50.82) } },
{ { SIMDE_FLOAT64_C( 22.25), SIMDE_FLOAT64_C( 76.52), SIMDE_FLOAT64_C( 22.14), SIMDE_FLOAT64_C( 11.98),
SIMDE_FLOAT64_C( 24.58), SIMDE_FLOAT64_C( 36.07), SIMDE_FLOAT64_C( 4.44), SIMDE_FLOAT64_C( 98.27) },
UINT8_C(254),
{ SIMDE_FLOAT64_C( 18.36), SIMDE_FLOAT64_C( 0.64), SIMDE_FLOAT64_C( 38.07), SIMDE_FLOAT64_C( 46.40),
SIMDE_FLOAT64_C( 43.60), SIMDE_FLOAT64_C( 49.47), SIMDE_FLOAT64_C( 25.51), SIMDE_FLOAT64_C( 87.14) },
{ SIMDE_FLOAT64_C( 22.25), SIMDE_FLOAT64_C( 0.49), SIMDE_FLOAT64_C( 3.67), SIMDE_FLOAT64_C( 3.86),
SIMDE_FLOAT64_C( 3.80), SIMDE_FLOAT64_C( 3.92), SIMDE_FLOAT64_C( 3.28), SIMDE_FLOAT64_C( 4.48) } },
{ { SIMDE_FLOAT64_C( 88.84), SIMDE_FLOAT64_C( 41.14), SIMDE_FLOAT64_C( 36.09), SIMDE_FLOAT64_C( 80.90),
SIMDE_FLOAT64_C( 91.96), SIMDE_FLOAT64_C( 48.03), SIMDE_FLOAT64_C( 27.65), SIMDE_FLOAT64_C( 10.98) },
UINT8_C(171),
{ SIMDE_FLOAT64_C( 9.21), SIMDE_FLOAT64_C( 82.81), SIMDE_FLOAT64_C( 6.69), SIMDE_FLOAT64_C( 51.54),
SIMDE_FLOAT64_C( 48.46), SIMDE_FLOAT64_C( 28.94), SIMDE_FLOAT64_C( 28.06), SIMDE_FLOAT64_C( 70.60) },
{ SIMDE_FLOAT64_C( 2.32), SIMDE_FLOAT64_C( 4.43), SIMDE_FLOAT64_C( 36.09), SIMDE_FLOAT64_C( 3.96),
SIMDE_FLOAT64_C( 91.96), SIMDE_FLOAT64_C( 3.40), SIMDE_FLOAT64_C( 27.65), SIMDE_FLOAT64_C( 4.27) } }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])) ; i++) {
simde__m512d src = simde_mm512_loadu_pd(test_vec[i].src);
simde__m512d a = simde_mm512_loadu_pd(test_vec[i].a);
simde__m512d r = simde_mm512_mask_log1p_pd(src, test_vec[i].k, a);
simde_test_x86_assert_equal_f64x8(r, simde_mm512_loadu_pd(test_vec[i].r), 1);
}
return 0;
}
static int
test_simde_mm_log2_ps (SIMDE_MUNIT_TEST_ARGS) {
static const struct {
const simde_float32 a[4];
const simde_float32 r[4];
} test_vec[] = {
{ { SIMDE_FLOAT32_C( 631.47), SIMDE_FLOAT32_C( 844.23), SIMDE_FLOAT32_C( 439.63), SIMDE_FLOAT32_C( 13.01) },
{ SIMDE_FLOAT32_C( 9.30), SIMDE_FLOAT32_C( 9.72), SIMDE_FLOAT32_C( 8.78), SIMDE_FLOAT32_C( 3.70) } },
{ { SIMDE_FLOAT32_C( 66.81), SIMDE_FLOAT32_C( 88.82), SIMDE_FLOAT32_C( 350.44), SIMDE_FLOAT32_C( 636.52) },
{ SIMDE_FLOAT32_C( 6.06), SIMDE_FLOAT32_C( 6.47), SIMDE_FLOAT32_C( 8.45), SIMDE_FLOAT32_C( 9.31) } },
{ { SIMDE_FLOAT32_C( 636.53), SIMDE_FLOAT32_C( 411.53), SIMDE_FLOAT32_C( 396.60), SIMDE_FLOAT32_C( 131.18) },
{ SIMDE_FLOAT32_C( 9.31), SIMDE_FLOAT32_C( 8.68), SIMDE_FLOAT32_C( 8.63), SIMDE_FLOAT32_C( 7.04) } },
{ { SIMDE_FLOAT32_C( 749.84), SIMDE_FLOAT32_C( 385.14), SIMDE_FLOAT32_C( 384.93), SIMDE_FLOAT32_C( 165.27) },
{ SIMDE_FLOAT32_C( 9.55), SIMDE_FLOAT32_C( 8.59), SIMDE_FLOAT32_C( 8.59), SIMDE_FLOAT32_C( 7.37) } },
{ { SIMDE_FLOAT32_C( 246.49), SIMDE_FLOAT32_C( 520.56), SIMDE_FLOAT32_C( 778.62), SIMDE_FLOAT32_C( 71.34) },
{ SIMDE_FLOAT32_C( 7.95), SIMDE_FLOAT32_C( 9.02), SIMDE_FLOAT32_C( 9.60), SIMDE_FLOAT32_C( 6.16) } },
{ { SIMDE_FLOAT32_C( 946.80), SIMDE_FLOAT32_C( 380.92), SIMDE_FLOAT32_C( 894.84), SIMDE_FLOAT32_C( 902.24) },
{ SIMDE_FLOAT32_C( 9.89), SIMDE_FLOAT32_C( 8.57), SIMDE_FLOAT32_C( 9.81), SIMDE_FLOAT32_C( 9.82) } },
{ { SIMDE_FLOAT32_C( 574.27), SIMDE_FLOAT32_C( 214.93), SIMDE_FLOAT32_C( 953.03), SIMDE_FLOAT32_C( 638.26) },
{ SIMDE_FLOAT32_C( 9.17), SIMDE_FLOAT32_C( 7.75), SIMDE_FLOAT32_C( 9.90), SIMDE_FLOAT32_C( 9.32) } },
{ { SIMDE_FLOAT32_C( 991.13), SIMDE_FLOAT32_C( 188.32), SIMDE_FLOAT32_C( 949.37), SIMDE_FLOAT32_C( 622.60) },
{ SIMDE_FLOAT32_C( 9.95), SIMDE_FLOAT32_C( 7.56), SIMDE_FLOAT32_C( 9.89), SIMDE_FLOAT32_C( 9.28) } }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])) ; i++) {
simde__m128 a = simde_mm_loadu_ps(test_vec[i].a);
simde__m128 r = simde_mm_log2_ps(a);
simde_test_x86_assert_equal_f32x4(r, simde_mm_loadu_ps(test_vec[i].r), 1);
}
return 0;
}
static int
test_simde_mm_log2_pd (SIMDE_MUNIT_TEST_ARGS) {
static const struct {
const simde_float64 a[2];
const simde_float64 r[2];
} test_vec[] = {
{ { SIMDE_FLOAT64_C( 982.90), SIMDE_FLOAT64_C( 619.50) },
{ SIMDE_FLOAT64_C( 9.94), SIMDE_FLOAT64_C( 9.27) } },
{ { SIMDE_FLOAT64_C( 102.39), SIMDE_FLOAT64_C( 923.09) },
{ SIMDE_FLOAT64_C( 6.68), SIMDE_FLOAT64_C( 9.85) } },
{ { SIMDE_FLOAT64_C( 243.48), SIMDE_FLOAT64_C( 494.45) },
{ SIMDE_FLOAT64_C( 7.93), SIMDE_FLOAT64_C( 8.95) } },
{ { SIMDE_FLOAT64_C( 45.35), SIMDE_FLOAT64_C( 416.91) },
{ SIMDE_FLOAT64_C( 5.50), SIMDE_FLOAT64_C( 8.70) } },
{ { SIMDE_FLOAT64_C( 259.45), SIMDE_FLOAT64_C( 290.22) },
{ SIMDE_FLOAT64_C( 8.02), SIMDE_FLOAT64_C( 8.18) } },
{ { SIMDE_FLOAT64_C( 923.80), SIMDE_FLOAT64_C( 970.52) },
{ SIMDE_FLOAT64_C( 9.85), SIMDE_FLOAT64_C( 9.92) } },
{ { SIMDE_FLOAT64_C( 646.50), SIMDE_FLOAT64_C( 264.22) },
{ SIMDE_FLOAT64_C( 9.34), SIMDE_FLOAT64_C( 8.05) } },
{ { SIMDE_FLOAT64_C( 634.41), SIMDE_FLOAT64_C( 510.63) },
{ SIMDE_FLOAT64_C( 9.31), SIMDE_FLOAT64_C( 9.00) } }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])) ; i++) {
simde__m128d a = simde_mm_loadu_pd(test_vec[i].a);
simde__m128d r = simde_mm_log2_pd(a);
simde_test_x86_assert_equal_f64x2(r, simde_mm_loadu_pd(test_vec[i].r), 1);
}
return 0;
}
static int
test_simde_mm256_log2_ps (SIMDE_MUNIT_TEST_ARGS) {
static const struct {
const simde_float32 a[8];
const simde_float32 r[8];
} test_vec[] = {
{ { SIMDE_FLOAT32_C( 889.40), SIMDE_FLOAT32_C( 779.21), SIMDE_FLOAT32_C( 198.92), SIMDE_FLOAT32_C( 945.28),
SIMDE_FLOAT32_C( 42.71), SIMDE_FLOAT32_C( 341.50), SIMDE_FLOAT32_C( 958.60), SIMDE_FLOAT32_C( 736.56) },
{ SIMDE_FLOAT32_C( 9.80), SIMDE_FLOAT32_C( 9.61), SIMDE_FLOAT32_C( 7.64), SIMDE_FLOAT32_C( 9.88),
SIMDE_FLOAT32_C( 5.42), SIMDE_FLOAT32_C( 8.42), SIMDE_FLOAT32_C( 9.90), SIMDE_FLOAT32_C( 9.52) } },
{ { SIMDE_FLOAT32_C( 74.89), SIMDE_FLOAT32_C( 979.36), SIMDE_FLOAT32_C( 587.94), SIMDE_FLOAT32_C( 960.37),
SIMDE_FLOAT32_C( 497.73), SIMDE_FLOAT32_C( 286.82), SIMDE_FLOAT32_C( 507.33), SIMDE_FLOAT32_C( 616.64) },
{ SIMDE_FLOAT32_C( 6.23), SIMDE_FLOAT32_C( 9.94), SIMDE_FLOAT32_C( 9.20), SIMDE_FLOAT32_C( 9.91),
SIMDE_FLOAT32_C( 8.96), SIMDE_FLOAT32_C( 8.16), SIMDE_FLOAT32_C( 8.99), SIMDE_FLOAT32_C( 9.27) } },
{ { SIMDE_FLOAT32_C( 307.44), SIMDE_FLOAT32_C( 437.70), SIMDE_FLOAT32_C( 685.73), SIMDE_FLOAT32_C( 291.17),
SIMDE_FLOAT32_C( 840.55), SIMDE_FLOAT32_C( 438.07), SIMDE_FLOAT32_C( 676.25), SIMDE_FLOAT32_C( 160.97) },
{ SIMDE_FLOAT32_C( 8.26), SIMDE_FLOAT32_C( 8.77), SIMDE_FLOAT32_C( 9.42), SIMDE_FLOAT32_C( 8.19),
SIMDE_FLOAT32_C( 9.72), SIMDE_FLOAT32_C( 8.78), SIMDE_FLOAT32_C( 9.40), SIMDE_FLOAT32_C( 7.33) } },
{ { SIMDE_FLOAT32_C( 788.67), SIMDE_FLOAT32_C( 843.13), SIMDE_FLOAT32_C( 381.11), SIMDE_FLOAT32_C( 499.16),
SIMDE_FLOAT32_C( 309.83), SIMDE_FLOAT32_C( 369.53), SIMDE_FLOAT32_C( 957.38), SIMDE_FLOAT32_C( 199.23) },
{ SIMDE_FLOAT32_C( 9.62), SIMDE_FLOAT32_C( 9.72), SIMDE_FLOAT32_C( 8.57), SIMDE_FLOAT32_C( 8.96),
SIMDE_FLOAT32_C( 8.28), SIMDE_FLOAT32_C( 8.53), SIMDE_FLOAT32_C( 9.90), SIMDE_FLOAT32_C( 7.64) } },
{ { SIMDE_FLOAT32_C( 148.75), SIMDE_FLOAT32_C( 156.30), SIMDE_FLOAT32_C( 144.51), SIMDE_FLOAT32_C( 191.45),
SIMDE_FLOAT32_C( 497.81), SIMDE_FLOAT32_C( 103.11), SIMDE_FLOAT32_C( 928.02), SIMDE_FLOAT32_C( 572.70) },
{ SIMDE_FLOAT32_C( 7.22), SIMDE_FLOAT32_C( 7.29), SIMDE_FLOAT32_C( 7.18), SIMDE_FLOAT32_C( 7.58),
SIMDE_FLOAT32_C( 8.96), SIMDE_FLOAT32_C( 6.69), SIMDE_FLOAT32_C( 9.86), SIMDE_FLOAT32_C( 9.16) } },
{ { SIMDE_FLOAT32_C( 82.46), SIMDE_FLOAT32_C( 515.95), SIMDE_FLOAT32_C( 533.07), SIMDE_FLOAT32_C( 580.19),
SIMDE_FLOAT32_C( 802.77), SIMDE_FLOAT32_C( 40.40), SIMDE_FLOAT32_C( 196.83), SIMDE_FLOAT32_C( 110.21) },
{ SIMDE_FLOAT32_C( 6.37), SIMDE_FLOAT32_C( 9.01), SIMDE_FLOAT32_C( 9.06), SIMDE_FLOAT32_C( 9.18),
SIMDE_FLOAT32_C( 9.65), SIMDE_FLOAT32_C( 5.34), SIMDE_FLOAT32_C( 7.62), SIMDE_FLOAT32_C( 6.78) } },
{ { SIMDE_FLOAT32_C( 478.10), SIMDE_FLOAT32_C( 882.57), SIMDE_FLOAT32_C( 401.38), SIMDE_FLOAT32_C( 318.65),
SIMDE_FLOAT32_C( 320.63), SIMDE_FLOAT32_C( 77.63), SIMDE_FLOAT32_C( 479.61), SIMDE_FLOAT32_C( 109.31) },
{ SIMDE_FLOAT32_C( 8.90), SIMDE_FLOAT32_C( 9.79), SIMDE_FLOAT32_C( 8.65), SIMDE_FLOAT32_C( 8.32),
SIMDE_FLOAT32_C( 8.32), SIMDE_FLOAT32_C( 6.28), SIMDE_FLOAT32_C( 8.91), SIMDE_FLOAT32_C( 6.77) } },
{ { SIMDE_FLOAT32_C( 920.76), SIMDE_FLOAT32_C( 860.72), SIMDE_FLOAT32_C( 608.46), SIMDE_FLOAT32_C( 230.59),
SIMDE_FLOAT32_C( 230.26), SIMDE_FLOAT32_C( 565.84), SIMDE_FLOAT32_C( 429.82), SIMDE_FLOAT32_C( 379.00) },
{ SIMDE_FLOAT32_C( 9.85), SIMDE_FLOAT32_C( 9.75), SIMDE_FLOAT32_C( 9.25), SIMDE_FLOAT32_C( 7.85),
SIMDE_FLOAT32_C( 7.85), SIMDE_FLOAT32_C( 9.14), SIMDE_FLOAT32_C( 8.75), SIMDE_FLOAT32_C( 8.57) } }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])) ; i++) {
simde__m256 a = simde_mm256_loadu_ps(test_vec[i].a);
simde__m256 r = simde_mm256_log2_ps(a);
simde_test_x86_assert_equal_f32x8(r, simde_mm256_loadu_ps(test_vec[i].r), 1);
}
return 0;
}
static int
test_simde_mm256_log2_pd (SIMDE_MUNIT_TEST_ARGS) {
static const struct {
const simde_float64 a[4];
const simde_float64 r[4];
} test_vec[] = {
{ { SIMDE_FLOAT64_C( 760.38), SIMDE_FLOAT64_C( 341.71), SIMDE_FLOAT64_C( 933.93), SIMDE_FLOAT64_C( 964.91) },
{ SIMDE_FLOAT64_C( 9.57), SIMDE_FLOAT64_C( 8.42), SIMDE_FLOAT64_C( 9.87), SIMDE_FLOAT64_C( 9.91) } },
{ { SIMDE_FLOAT64_C( 115.25), SIMDE_FLOAT64_C( 77.12), SIMDE_FLOAT64_C( 667.61), SIMDE_FLOAT64_C( 365.22) },
{ SIMDE_FLOAT64_C( 6.85), SIMDE_FLOAT64_C( 6.27), SIMDE_FLOAT64_C( 9.38), SIMDE_FLOAT64_C( 8.51) } },
{ { SIMDE_FLOAT64_C( 679.91), SIMDE_FLOAT64_C( 892.57), SIMDE_FLOAT64_C( 787.62), SIMDE_FLOAT64_C( 588.83) },
{ SIMDE_FLOAT64_C( 9.41), SIMDE_FLOAT64_C( 9.80), SIMDE_FLOAT64_C( 9.62), SIMDE_FLOAT64_C( 9.20) } },
{ { SIMDE_FLOAT64_C( 30.55), SIMDE_FLOAT64_C( 713.90), SIMDE_FLOAT64_C( 332.19), SIMDE_FLOAT64_C( 616.75) },
{ SIMDE_FLOAT64_C( 4.93), SIMDE_FLOAT64_C( 9.48), SIMDE_FLOAT64_C( 8.38), SIMDE_FLOAT64_C( 9.27) } },
{ { SIMDE_FLOAT64_C( 183.75), SIMDE_FLOAT64_C( 550.51), SIMDE_FLOAT64_C( 693.58), SIMDE_FLOAT64_C( 893.18) },
{ SIMDE_FLOAT64_C( 7.52), SIMDE_FLOAT64_C( 9.10), SIMDE_FLOAT64_C( 9.44), SIMDE_FLOAT64_C( 9.80) } },
{ { SIMDE_FLOAT64_C( 430.95), SIMDE_FLOAT64_C( 320.69), SIMDE_FLOAT64_C( 576.89), SIMDE_FLOAT64_C( 863.61) },
{ SIMDE_FLOAT64_C( 8.75), SIMDE_FLOAT64_C( 8.33), SIMDE_FLOAT64_C( 9.17), SIMDE_FLOAT64_C( 9.75) } },
{ { SIMDE_FLOAT64_C( 830.18), SIMDE_FLOAT64_C( 881.23), SIMDE_FLOAT64_C( 596.73), SIMDE_FLOAT64_C( 514.46) },
{ SIMDE_FLOAT64_C( 9.70), SIMDE_FLOAT64_C( 9.78), SIMDE_FLOAT64_C( 9.22), SIMDE_FLOAT64_C( 9.01) } },
{ { SIMDE_FLOAT64_C( 253.95), SIMDE_FLOAT64_C( 753.04), SIMDE_FLOAT64_C( 535.98), SIMDE_FLOAT64_C( 14.32) },
{ SIMDE_FLOAT64_C( 7.99), SIMDE_FLOAT64_C( 9.56), SIMDE_FLOAT64_C( 9.07), SIMDE_FLOAT64_C( 3.84) } }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])) ; i++) {
simde__m256d a = simde_mm256_loadu_pd(test_vec[i].a);
simde__m256d r = simde_mm256_log2_pd(a);
simde_test_x86_assert_equal_f64x4(r, simde_mm256_loadu_pd(test_vec[i].r), 1);
}
return 0;
}
static int
test_simde_mm512_log2_ps (SIMDE_MUNIT_TEST_ARGS) {
static const struct {
const simde_float32 a[16];
const simde_float32 r[16];
} test_vec[] = {
{ { SIMDE_FLOAT32_C( 483.98), SIMDE_FLOAT32_C( 550.49), SIMDE_FLOAT32_C( 612.79), SIMDE_FLOAT32_C( 652.36),
SIMDE_FLOAT32_C( 702.86), SIMDE_FLOAT32_C( 993.84), SIMDE_FLOAT32_C( 608.42), SIMDE_FLOAT32_C( 923.16),
SIMDE_FLOAT32_C( 531.91), SIMDE_FLOAT32_C( 675.78), SIMDE_FLOAT32_C( 571.39), SIMDE_FLOAT32_C( 422.11),
SIMDE_FLOAT32_C( 520.20), SIMDE_FLOAT32_C( 536.40), SIMDE_FLOAT32_C( 462.32), SIMDE_FLOAT32_C( 841.06) },
{ SIMDE_FLOAT32_C( 8.92), SIMDE_FLOAT32_C( 9.10), SIMDE_FLOAT32_C( 9.26), SIMDE_FLOAT32_C( 9.35),
SIMDE_FLOAT32_C( 9.46), SIMDE_FLOAT32_C( 9.96), SIMDE_FLOAT32_C( 9.25), SIMDE_FLOAT32_C( 9.85),
SIMDE_FLOAT32_C( 9.06), SIMDE_FLOAT32_C( 9.40), SIMDE_FLOAT32_C( 9.16), SIMDE_FLOAT32_C( 8.72),
SIMDE_FLOAT32_C( 9.02), SIMDE_FLOAT32_C( 9.07), SIMDE_FLOAT32_C( 8.85), SIMDE_FLOAT32_C( 9.72) } },
{ { SIMDE_FLOAT32_C( 513.13), SIMDE_FLOAT32_C( 741.74), SIMDE_FLOAT32_C( 931.43), SIMDE_FLOAT32_C( 670.23),
SIMDE_FLOAT32_C( 393.50), SIMDE_FLOAT32_C( 862.99), SIMDE_FLOAT32_C( 343.67), SIMDE_FLOAT32_C( 818.00),
SIMDE_FLOAT32_C( 637.20), SIMDE_FLOAT32_C( 123.18), SIMDE_FLOAT32_C( 888.07), SIMDE_FLOAT32_C( 327.64),
SIMDE_FLOAT32_C( 438.36), SIMDE_FLOAT32_C( 579.84), SIMDE_FLOAT32_C( 783.89), SIMDE_FLOAT32_C( 922.33) },
{ SIMDE_FLOAT32_C( 9.00), SIMDE_FLOAT32_C( 9.53), SIMDE_FLOAT32_C( 9.86), SIMDE_FLOAT32_C( 9.39),
SIMDE_FLOAT32_C( 8.62), SIMDE_FLOAT32_C( 9.75), SIMDE_FLOAT32_C( 8.42), SIMDE_FLOAT32_C( 9.68),
SIMDE_FLOAT32_C( 9.32), SIMDE_FLOAT32_C( 6.94), SIMDE_FLOAT32_C( 9.79), SIMDE_FLOAT32_C( 8.36),
SIMDE_FLOAT32_C( 8.78), SIMDE_FLOAT32_C( 9.18), SIMDE_FLOAT32_C( 9.61), SIMDE_FLOAT32_C( 9.85) } },
{ { SIMDE_FLOAT32_C( 130.33), SIMDE_FLOAT32_C( 396.68), SIMDE_FLOAT32_C( 574.70), SIMDE_FLOAT32_C( 833.19),
SIMDE_FLOAT32_C( 390.52), SIMDE_FLOAT32_C( 183.11), SIMDE_FLOAT32_C( 756.35), SIMDE_FLOAT32_C( 922.43),
SIMDE_FLOAT32_C( 858.89), SIMDE_FLOAT32_C( 327.75), SIMDE_FLOAT32_C( 344.53), SIMDE_FLOAT32_C( 379.09),
SIMDE_FLOAT32_C( 864.14), SIMDE_FLOAT32_C( 806.85), SIMDE_FLOAT32_C( 220.15), SIMDE_FLOAT32_C( 377.27) },
{ SIMDE_FLOAT32_C( 7.03), SIMDE_FLOAT32_C( 8.63), SIMDE_FLOAT32_C( 9.17), SIMDE_FLOAT32_C( 9.70),
SIMDE_FLOAT32_C( 8.61), SIMDE_FLOAT32_C( 7.52), SIMDE_FLOAT32_C( 9.56), SIMDE_FLOAT32_C( 9.85),
SIMDE_FLOAT32_C( 9.75), SIMDE_FLOAT32_C( 8.36), SIMDE_FLOAT32_C( 8.43), SIMDE_FLOAT32_C( 8.57),
SIMDE_FLOAT32_C( 9.76), SIMDE_FLOAT32_C( 9.66), SIMDE_FLOAT32_C( 7.78), SIMDE_FLOAT32_C( 8.56) } },
{ { SIMDE_FLOAT32_C( 548.60), SIMDE_FLOAT32_C( 151.58), SIMDE_FLOAT32_C( 47.50), SIMDE_FLOAT32_C( 942.10),
SIMDE_FLOAT32_C( 14.58), SIMDE_FLOAT32_C( 391.17), SIMDE_FLOAT32_C( 760.10), SIMDE_FLOAT32_C( 651.77),
SIMDE_FLOAT32_C( 514.35), SIMDE_FLOAT32_C( 648.17), SIMDE_FLOAT32_C( 979.41), SIMDE_FLOAT32_C( 952.70),
SIMDE_FLOAT32_C( 228.00), SIMDE_FLOAT32_C( 763.30), SIMDE_FLOAT32_C( 875.04), SIMDE_FLOAT32_C( 358.34) },
{ SIMDE_FLOAT32_C( 9.10), SIMDE_FLOAT32_C( 7.24), SIMDE_FLOAT32_C( 5.57), SIMDE_FLOAT32_C( 9.88),
SIMDE_FLOAT32_C( 3.87), SIMDE_FLOAT32_C( 8.61), SIMDE_FLOAT32_C( 9.57), SIMDE_FLOAT32_C( 9.35),
SIMDE_FLOAT32_C( 9.01), SIMDE_FLOAT32_C( 9.34), SIMDE_FLOAT32_C( 9.94), SIMDE_FLOAT32_C( 9.90),
SIMDE_FLOAT32_C( 7.83), SIMDE_FLOAT32_C( 9.58), SIMDE_FLOAT32_C( 9.77), SIMDE_FLOAT32_C( 8.49) } },
{ { SIMDE_FLOAT32_C( 159.99), SIMDE_FLOAT32_C( 449.73), SIMDE_FLOAT32_C( 191.53), SIMDE_FLOAT32_C( 550.50),
SIMDE_FLOAT32_C( 632.84), SIMDE_FLOAT32_C( 947.88), SIMDE_FLOAT32_C( 472.93), SIMDE_FLOAT32_C( 491.73),
SIMDE_FLOAT32_C( 275.62), SIMDE_FLOAT32_C( 817.47), SIMDE_FLOAT32_C( 870.83), SIMDE_FLOAT32_C( 139.76),
SIMDE_FLOAT32_C( 624.32), SIMDE_FLOAT32_C( 90.98), SIMDE_FLOAT32_C( 517.04), SIMDE_FLOAT32_C( 172.92) },
{ SIMDE_FLOAT32_C( 7.32), SIMDE_FLOAT32_C( 8.81), SIMDE_FLOAT32_C( 7.58), SIMDE_FLOAT32_C( 9.10),
SIMDE_FLOAT32_C( 9.31), SIMDE_FLOAT32_C( 9.89), SIMDE_FLOAT32_C( 8.89), SIMDE_FLOAT32_C( 8.94),
SIMDE_FLOAT32_C( 8.11), SIMDE_FLOAT32_C( 9.68), SIMDE_FLOAT32_C( 9.77), SIMDE_FLOAT32_C( 7.13),
SIMDE_FLOAT32_C( 9.29), SIMDE_FLOAT32_C( 6.51), SIMDE_FLOAT32_C( 9.01), SIMDE_FLOAT32_C( 7.43) } },
{ { SIMDE_FLOAT32_C( 242.56), SIMDE_FLOAT32_C( 564.54), SIMDE_FLOAT32_C( 115.01), SIMDE_FLOAT32_C( 257.14),
SIMDE_FLOAT32_C( 955.71), SIMDE_FLOAT32_C( 875.12), SIMDE_FLOAT32_C( 908.91), SIMDE_FLOAT32_C( 470.05),
SIMDE_FLOAT32_C( 523.28), SIMDE_FLOAT32_C( 888.32), SIMDE_FLOAT32_C( 422.76), SIMDE_FLOAT32_C( 751.29),
SIMDE_FLOAT32_C( 651.63), SIMDE_FLOAT32_C( 297.79), SIMDE_FLOAT32_C( 109.62), SIMDE_FLOAT32_C( 811.61) },
{ SIMDE_FLOAT32_C( 7.92), SIMDE_FLOAT32_C( 9.14), SIMDE_FLOAT32_C( 6.85), SIMDE_FLOAT32_C( 8.01),
SIMDE_FLOAT32_C( 9.90), SIMDE_FLOAT32_C( 9.77), SIMDE_FLOAT32_C( 9.83), SIMDE_FLOAT32_C( 8.88),
SIMDE_FLOAT32_C( 9.03), SIMDE_FLOAT32_C( 9.79), SIMDE_FLOAT32_C( 8.72), SIMDE_FLOAT32_C( 9.55),
SIMDE_FLOAT32_C( 9.35), SIMDE_FLOAT32_C( 8.22), SIMDE_FLOAT32_C( 6.78), SIMDE_FLOAT32_C( 9.66) } },
{ { SIMDE_FLOAT32_C( 747.52), SIMDE_FLOAT32_C( 301.15), SIMDE_FLOAT32_C( 362.12), SIMDE_FLOAT32_C( 380.36),
SIMDE_FLOAT32_C( 249.03), SIMDE_FLOAT32_C( 835.05), SIMDE_FLOAT32_C( 872.10), SIMDE_FLOAT32_C( 524.65),
SIMDE_FLOAT32_C( 652.52), SIMDE_FLOAT32_C( 742.92), SIMDE_FLOAT32_C( 664.41), SIMDE_FLOAT32_C( 276.84),
SIMDE_FLOAT32_C( 833.90), SIMDE_FLOAT32_C( 181.45), SIMDE_FLOAT32_C( 449.75), SIMDE_FLOAT32_C( 76.46) },
{ SIMDE_FLOAT32_C( 9.55), SIMDE_FLOAT32_C( 8.23), SIMDE_FLOAT32_C( 8.50), SIMDE_FLOAT32_C( 8.57),
SIMDE_FLOAT32_C( 7.96), SIMDE_FLOAT32_C( 9.71), SIMDE_FLOAT32_C( 9.77), SIMDE_FLOAT32_C( 9.04),
SIMDE_FLOAT32_C( 9.35), SIMDE_FLOAT32_C( 9.54), SIMDE_FLOAT32_C( 9.38), SIMDE_FLOAT32_C( 8.11),
SIMDE_FLOAT32_C( 9.70), SIMDE_FLOAT32_C( 7.50), SIMDE_FLOAT32_C( 8.81), SIMDE_FLOAT32_C( 6.26) } },
{ { SIMDE_FLOAT32_C( 745.98), SIMDE_FLOAT32_C( 564.77), SIMDE_FLOAT32_C( 333.60), SIMDE_FLOAT32_C( 701.69),
SIMDE_FLOAT32_C( 439.88), SIMDE_FLOAT32_C( 242.51), SIMDE_FLOAT32_C( 171.74), SIMDE_FLOAT32_C( 963.17),
SIMDE_FLOAT32_C( 130.83), SIMDE_FLOAT32_C( 594.50), SIMDE_FLOAT32_C( 714.46), SIMDE_FLOAT32_C( 782.46),
SIMDE_FLOAT32_C( 892.29), SIMDE_FLOAT32_C( 824.08), SIMDE_FLOAT32_C( 594.07), SIMDE_FLOAT32_C( 639.81) },
{ SIMDE_FLOAT32_C( 9.54), SIMDE_FLOAT32_C( 9.14), SIMDE_FLOAT32_C( 8.38), SIMDE_FLOAT32_C( 9.45),
SIMDE_FLOAT32_C( 8.78), SIMDE_FLOAT32_C( 7.92), SIMDE_FLOAT32_C( 7.42), SIMDE_FLOAT32_C( 9.91),
SIMDE_FLOAT32_C( 7.03), SIMDE_FLOAT32_C( 9.22), SIMDE_FLOAT32_C( 9.48), SIMDE_FLOAT32_C( 9.61),
SIMDE_FLOAT32_C( 9.80), SIMDE_FLOAT32_C( 9.69), SIMDE_FLOAT32_C( 9.21), SIMDE_FLOAT32_C( 9.32) } }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])) ; i++) {
simde__m512 a = simde_mm512_loadu_ps(test_vec[i].a);
simde__m512 r = simde_mm512_log2_ps(a);
simde_test_x86_assert_equal_f32x16(r, simde_mm512_loadu_ps(test_vec[i].r), 1);
}
return 0;
}
static int
test_simde_mm512_mask_log2_ps (SIMDE_MUNIT_TEST_ARGS) {
static const struct {
const simde_float32 src[16];
const simde__mmask8 k;
const simde_float32 a[16];
const simde_float32 r[16];
} test_vec[] = {
{ { SIMDE_FLOAT32_C( 951.54), SIMDE_FLOAT32_C( 999.54), SIMDE_FLOAT32_C( 334.61), SIMDE_FLOAT32_C( 345.31),
SIMDE_FLOAT32_C( 632.13), SIMDE_FLOAT32_C( 486.36), SIMDE_FLOAT32_C( 855.38), SIMDE_FLOAT32_C( 575.68),
SIMDE_FLOAT32_C( 586.36), SIMDE_FLOAT32_C( 821.37), SIMDE_FLOAT32_C( 638.17), SIMDE_FLOAT32_C( 965.64),
SIMDE_FLOAT32_C( 565.55), SIMDE_FLOAT32_C( 416.08), SIMDE_FLOAT32_C( 543.83), SIMDE_FLOAT32_C( 785.84) },
UINT8_C( 38),
{ SIMDE_FLOAT32_C( 694.42), SIMDE_FLOAT32_C( 92.26), SIMDE_FLOAT32_C( 723.42), SIMDE_FLOAT32_C( 203.15),
SIMDE_FLOAT32_C( 315.73), SIMDE_FLOAT32_C( 806.95), SIMDE_FLOAT32_C( 395.41), SIMDE_FLOAT32_C( 157.52),
SIMDE_FLOAT32_C( 1.57), SIMDE_FLOAT32_C( 504.24), SIMDE_FLOAT32_C( 237.89), SIMDE_FLOAT32_C( 806.42),
SIMDE_FLOAT32_C( 668.52), SIMDE_FLOAT32_C( 921.63), SIMDE_FLOAT32_C( 757.96), SIMDE_FLOAT32_C( 668.06) },
{ SIMDE_FLOAT32_C( 951.54), SIMDE_FLOAT32_C( 6.53), SIMDE_FLOAT32_C( 9.50), SIMDE_FLOAT32_C( 345.31),
SIMDE_FLOAT32_C( 632.13), SIMDE_FLOAT32_C( 9.66), SIMDE_FLOAT32_C( 855.38), SIMDE_FLOAT32_C( 575.68),
SIMDE_FLOAT32_C( 586.36), SIMDE_FLOAT32_C( 821.37), SIMDE_FLOAT32_C( 638.17), SIMDE_FLOAT32_C( 965.64),
SIMDE_FLOAT32_C( 565.55), SIMDE_FLOAT32_C( 416.08), SIMDE_FLOAT32_C( 543.83), SIMDE_FLOAT32_C( 785.84) } },
{ { SIMDE_FLOAT32_C( 256.24), SIMDE_FLOAT32_C( 103.27), SIMDE_FLOAT32_C( 300.20), SIMDE_FLOAT32_C( 742.60),
SIMDE_FLOAT32_C( 958.65), SIMDE_FLOAT32_C( 875.88), SIMDE_FLOAT32_C( 328.96), SIMDE_FLOAT32_C( 780.02),
SIMDE_FLOAT32_C( 514.05), SIMDE_FLOAT32_C( 294.61), SIMDE_FLOAT32_C( 345.57), SIMDE_FLOAT32_C( 930.14),
SIMDE_FLOAT32_C( 838.44), SIMDE_FLOAT32_C( 131.42), SIMDE_FLOAT32_C( 65.69), SIMDE_FLOAT32_C( 532.86) },
UINT8_C(234),
{ SIMDE_FLOAT32_C( 789.11), SIMDE_FLOAT32_C( 736.01), SIMDE_FLOAT32_C( 539.40), SIMDE_FLOAT32_C( 596.06),
SIMDE_FLOAT32_C( 131.42), SIMDE_FLOAT32_C( 696.92), SIMDE_FLOAT32_C( 597.63), SIMDE_FLOAT32_C( 635.66),
SIMDE_FLOAT32_C( 934.80), SIMDE_FLOAT32_C( 404.05), SIMDE_FLOAT32_C( 304.18), SIMDE_FLOAT32_C( 856.43),
SIMDE_FLOAT32_C( 162.01), SIMDE_FLOAT32_C( 972.25), SIMDE_FLOAT32_C( 112.67), SIMDE_FLOAT32_C( 265.28) },
{ SIMDE_FLOAT32_C( 256.24), SIMDE_FLOAT32_C( 9.52), SIMDE_FLOAT32_C( 300.20), SIMDE_FLOAT32_C( 9.22),
SIMDE_FLOAT32_C( 958.65), SIMDE_FLOAT32_C( 9.44), SIMDE_FLOAT32_C( 9.22), SIMDE_FLOAT32_C( 9.31),
SIMDE_FLOAT32_C( 514.05), SIMDE_FLOAT32_C( 294.61), SIMDE_FLOAT32_C( 345.57), SIMDE_FLOAT32_C( 930.14),
SIMDE_FLOAT32_C( 838.44), SIMDE_FLOAT32_C( 131.42), SIMDE_FLOAT32_C( 65.69), SIMDE_FLOAT32_C( 532.86) } },
{ { SIMDE_FLOAT32_C( 272.44), SIMDE_FLOAT32_C( 855.27), SIMDE_FLOAT32_C( 223.93), SIMDE_FLOAT32_C( 148.32),
SIMDE_FLOAT32_C( 184.23), SIMDE_FLOAT32_C( 3.95), SIMDE_FLOAT32_C( 662.37), SIMDE_FLOAT32_C( 478.84),
SIMDE_FLOAT32_C( 349.52), SIMDE_FLOAT32_C( 592.51), SIMDE_FLOAT32_C( 317.28), SIMDE_FLOAT32_C( 480.94),
SIMDE_FLOAT32_C( 658.20), SIMDE_FLOAT32_C( 850.14), SIMDE_FLOAT32_C( 704.61), SIMDE_FLOAT32_C( 447.31) },
UINT8_C(189),
{ SIMDE_FLOAT32_C( 244.01), SIMDE_FLOAT32_C( 43.37), SIMDE_FLOAT32_C( 717.57), SIMDE_FLOAT32_C( 940.93),
SIMDE_FLOAT32_C( 641.00), SIMDE_FLOAT32_C( 353.24), SIMDE_FLOAT32_C( 875.73), SIMDE_FLOAT32_C( 45.05),
SIMDE_FLOAT32_C( 657.42), SIMDE_FLOAT32_C( 732.16), SIMDE_FLOAT32_C( 207.05), SIMDE_FLOAT32_C( 629.67),
SIMDE_FLOAT32_C( 844.83), SIMDE_FLOAT32_C( 472.33), SIMDE_FLOAT32_C( 902.11), SIMDE_FLOAT32_C( 700.10) },
{ SIMDE_FLOAT32_C( 7.93), SIMDE_FLOAT32_C( 855.27), SIMDE_FLOAT32_C( 9.49), SIMDE_FLOAT32_C( 9.88),
SIMDE_FLOAT32_C( 9.32), SIMDE_FLOAT32_C( 8.46), SIMDE_FLOAT32_C( 662.37), SIMDE_FLOAT32_C( 5.49),
SIMDE_FLOAT32_C( 349.52), SIMDE_FLOAT32_C( 592.51), SIMDE_FLOAT32_C( 317.28), SIMDE_FLOAT32_C( 480.94),
SIMDE_FLOAT32_C( 658.20), SIMDE_FLOAT32_C( 850.14), SIMDE_FLOAT32_C( 704.61), SIMDE_FLOAT32_C( 447.31) } },
{ { SIMDE_FLOAT32_C( 696.26), SIMDE_FLOAT32_C( 50.44), SIMDE_FLOAT32_C( 884.33), SIMDE_FLOAT32_C( 700.20),
SIMDE_FLOAT32_C( 712.81), SIMDE_FLOAT32_C( 363.17), SIMDE_FLOAT32_C( 49.73), SIMDE_FLOAT32_C( 305.32),
SIMDE_FLOAT32_C( 680.45), SIMDE_FLOAT32_C( 530.67), SIMDE_FLOAT32_C( 963.52), SIMDE_FLOAT32_C( 530.59),
SIMDE_FLOAT32_C( 235.28), SIMDE_FLOAT32_C( 410.84), SIMDE_FLOAT32_C( 116.75), SIMDE_FLOAT32_C( 479.29) },
UINT8_C(235),
{ SIMDE_FLOAT32_C( 834.32), SIMDE_FLOAT32_C( 420.22), SIMDE_FLOAT32_C( 95.21), SIMDE_FLOAT32_C( 187.56),
SIMDE_FLOAT32_C( 295.95), SIMDE_FLOAT32_C( 140.25), SIMDE_FLOAT32_C( 844.98), SIMDE_FLOAT32_C( 28.11),
SIMDE_FLOAT32_C( 347.31), SIMDE_FLOAT32_C( 474.66), SIMDE_FLOAT32_C( 872.94), SIMDE_FLOAT32_C( 819.64),
SIMDE_FLOAT32_C( 376.77), SIMDE_FLOAT32_C( 573.04), SIMDE_FLOAT32_C( 515.89), SIMDE_FLOAT32_C( 427.21) },
{ SIMDE_FLOAT32_C( 9.70), SIMDE_FLOAT32_C( 8.72), SIMDE_FLOAT32_C( 884.33), SIMDE_FLOAT32_C( 7.55),
SIMDE_FLOAT32_C( 712.81), SIMDE_FLOAT32_C( 7.13), SIMDE_FLOAT32_C( 9.72), SIMDE_FLOAT32_C( 4.81),
SIMDE_FLOAT32_C( 680.45), SIMDE_FLOAT32_C( 530.67), SIMDE_FLOAT32_C( 963.52), SIMDE_FLOAT32_C( 530.59),
SIMDE_FLOAT32_C( 235.28), SIMDE_FLOAT32_C( 410.84), SIMDE_FLOAT32_C( 116.75), SIMDE_FLOAT32_C( 479.29) } },
{ { SIMDE_FLOAT32_C( 457.38), SIMDE_FLOAT32_C( 216.10), SIMDE_FLOAT32_C( 140.02), SIMDE_FLOAT32_C( 820.55),
SIMDE_FLOAT32_C( 265.82), SIMDE_FLOAT32_C( 445.34), SIMDE_FLOAT32_C( 501.00), SIMDE_FLOAT32_C( 796.49),
SIMDE_FLOAT32_C( 408.86), SIMDE_FLOAT32_C( 31.60), SIMDE_FLOAT32_C( 31.77), SIMDE_FLOAT32_C( 819.70),
SIMDE_FLOAT32_C( 148.34), SIMDE_FLOAT32_C( 511.06), SIMDE_FLOAT32_C( 273.91), SIMDE_FLOAT32_C( 982.67) },
UINT8_C(170),
{ SIMDE_FLOAT32_C( 369.11), SIMDE_FLOAT32_C( 170.23), SIMDE_FLOAT32_C( 227.24), SIMDE_FLOAT32_C( 509.37),
SIMDE_FLOAT32_C( 15.21), SIMDE_FLOAT32_C( 255.36), SIMDE_FLOAT32_C( 856.67), SIMDE_FLOAT32_C( 489.87),
SIMDE_FLOAT32_C( 128.30), SIMDE_FLOAT32_C( 676.31), SIMDE_FLOAT32_C( 866.64), SIMDE_FLOAT32_C( 701.34),
SIMDE_FLOAT32_C( 192.20), SIMDE_FLOAT32_C( 293.84), SIMDE_FLOAT32_C( 158.72), SIMDE_FLOAT32_C( 408.30) },
{ SIMDE_FLOAT32_C( 457.38), SIMDE_FLOAT32_C( 7.41), SIMDE_FLOAT32_C( 140.02), SIMDE_FLOAT32_C( 8.99),
SIMDE_FLOAT32_C( 265.82), SIMDE_FLOAT32_C( 8.00), SIMDE_FLOAT32_C( 501.00), SIMDE_FLOAT32_C( 8.94),
SIMDE_FLOAT32_C( 408.86), SIMDE_FLOAT32_C( 31.60), SIMDE_FLOAT32_C( 31.77), SIMDE_FLOAT32_C( 819.70),
SIMDE_FLOAT32_C( 148.34), SIMDE_FLOAT32_C( 511.06), SIMDE_FLOAT32_C( 273.91), SIMDE_FLOAT32_C( 982.67) } },
{ { SIMDE_FLOAT32_C( 433.86), SIMDE_FLOAT32_C( 979.27), SIMDE_FLOAT32_C( 674.13), SIMDE_FLOAT32_C( 879.20),
SIMDE_FLOAT32_C( 480.27), SIMDE_FLOAT32_C( 470.62), SIMDE_FLOAT32_C( 288.06), SIMDE_FLOAT32_C( 511.87),
SIMDE_FLOAT32_C( 502.39), SIMDE_FLOAT32_C( 107.76), SIMDE_FLOAT32_C( 660.21), SIMDE_FLOAT32_C( 13.45),
SIMDE_FLOAT32_C( 381.67), SIMDE_FLOAT32_C( 642.88), SIMDE_FLOAT32_C( 944.74), SIMDE_FLOAT32_C( 750.78) },
UINT8_C( 15),
{ SIMDE_FLOAT32_C( 171.98), SIMDE_FLOAT32_C( 260.15), SIMDE_FLOAT32_C( 828.32), SIMDE_FLOAT32_C( 427.33),
SIMDE_FLOAT32_C( 116.82), SIMDE_FLOAT32_C( 318.18), SIMDE_FLOAT32_C( 555.63), SIMDE_FLOAT32_C( 793.13),
SIMDE_FLOAT32_C( 184.82), SIMDE_FLOAT32_C( 256.97), SIMDE_FLOAT32_C( 985.33), SIMDE_FLOAT32_C( 478.66),
SIMDE_FLOAT32_C( 415.69), SIMDE_FLOAT32_C( 393.63), SIMDE_FLOAT32_C( 912.52), SIMDE_FLOAT32_C( 394.96) },
{ SIMDE_FLOAT32_C( 7.43), SIMDE_FLOAT32_C( 8.02), SIMDE_FLOAT32_C( 9.69), SIMDE_FLOAT32_C( 8.74),
SIMDE_FLOAT32_C( 480.27), SIMDE_FLOAT32_C( 470.62), SIMDE_FLOAT32_C( 288.06), SIMDE_FLOAT32_C( 511.87),
SIMDE_FLOAT32_C( 502.39), SIMDE_FLOAT32_C( 107.76), SIMDE_FLOAT32_C( 660.21), SIMDE_FLOAT32_C( 13.45),
SIMDE_FLOAT32_C( 381.67), SIMDE_FLOAT32_C( 642.88), SIMDE_FLOAT32_C( 944.74), SIMDE_FLOAT32_C( 750.78) } },
{ { SIMDE_FLOAT32_C( 67.76), SIMDE_FLOAT32_C( 791.72), SIMDE_FLOAT32_C( 875.23), SIMDE_FLOAT32_C( 538.38),
SIMDE_FLOAT32_C( 79.78), SIMDE_FLOAT32_C( 387.09), SIMDE_FLOAT32_C( 40.77), SIMDE_FLOAT32_C( 187.54),
SIMDE_FLOAT32_C( 47.31), SIMDE_FLOAT32_C( 54.22), SIMDE_FLOAT32_C( 569.20), SIMDE_FLOAT32_C( 690.18),
SIMDE_FLOAT32_C( 998.96), SIMDE_FLOAT32_C( 319.98), SIMDE_FLOAT32_C( 503.29), SIMDE_FLOAT32_C( 170.94) },
UINT8_C( 81),
{ SIMDE_FLOAT32_C( 331.60), SIMDE_FLOAT32_C( 598.27), SIMDE_FLOAT32_C( 696.95), SIMDE_FLOAT32_C( 649.79),
SIMDE_FLOAT32_C( 153.90), SIMDE_FLOAT32_C( 490.08), SIMDE_FLOAT32_C( 834.61), SIMDE_FLOAT32_C( 410.88),
SIMDE_FLOAT32_C( 475.41), SIMDE_FLOAT32_C( 313.27), SIMDE_FLOAT32_C( 826.57), SIMDE_FLOAT32_C( 869.04),
SIMDE_FLOAT32_C( 225.79), SIMDE_FLOAT32_C( 221.52), SIMDE_FLOAT32_C( 936.81), SIMDE_FLOAT32_C( 17.51) },
{ SIMDE_FLOAT32_C( 8.37), SIMDE_FLOAT32_C( 791.72), SIMDE_FLOAT32_C( 875.23), SIMDE_FLOAT32_C( 538.38),
SIMDE_FLOAT32_C( 7.27), SIMDE_FLOAT32_C( 387.09), SIMDE_FLOAT32_C( 9.70), SIMDE_FLOAT32_C( 187.54),
SIMDE_FLOAT32_C( 47.31), SIMDE_FLOAT32_C( 54.22), SIMDE_FLOAT32_C( 569.20), SIMDE_FLOAT32_C( 690.18),
SIMDE_FLOAT32_C( 998.96), SIMDE_FLOAT32_C( 319.98), SIMDE_FLOAT32_C( 503.29), SIMDE_FLOAT32_C( 170.94) } },
{ { SIMDE_FLOAT32_C( 96.75), SIMDE_FLOAT32_C( 475.18), SIMDE_FLOAT32_C( 97.29), SIMDE_FLOAT32_C( 483.84),
SIMDE_FLOAT32_C( 515.95), SIMDE_FLOAT32_C( 284.83), SIMDE_FLOAT32_C( 531.15), SIMDE_FLOAT32_C( 570.17),
SIMDE_FLOAT32_C( 854.03), SIMDE_FLOAT32_C( 221.33), SIMDE_FLOAT32_C( 569.13), SIMDE_FLOAT32_C( 174.01),
SIMDE_FLOAT32_C( 724.62), SIMDE_FLOAT32_C( 740.06), SIMDE_FLOAT32_C( 754.14), SIMDE_FLOAT32_C( 56.23) },
UINT8_C(124),
{ SIMDE_FLOAT32_C( 451.09), SIMDE_FLOAT32_C( 706.02), SIMDE_FLOAT32_C( 492.24), SIMDE_FLOAT32_C( 941.16),
SIMDE_FLOAT32_C( 540.62), SIMDE_FLOAT32_C( 903.11), SIMDE_FLOAT32_C( 416.57), SIMDE_FLOAT32_C( 853.89),
SIMDE_FLOAT32_C( 729.68), SIMDE_FLOAT32_C( 285.62), SIMDE_FLOAT32_C( 79.69), SIMDE_FLOAT32_C( 951.20),
SIMDE_FLOAT32_C( 222.42), SIMDE_FLOAT32_C( 97.20), SIMDE_FLOAT32_C( 47.95), SIMDE_FLOAT32_C( 697.61) },
{ SIMDE_FLOAT32_C( 96.75), SIMDE_FLOAT32_C( 475.18), SIMDE_FLOAT32_C( 8.94), SIMDE_FLOAT32_C( 9.88),
SIMDE_FLOAT32_C( 9.08), SIMDE_FLOAT32_C( 9.82), SIMDE_FLOAT32_C( 8.70), SIMDE_FLOAT32_C( 570.17),
SIMDE_FLOAT32_C( 854.03), SIMDE_FLOAT32_C( 221.33), SIMDE_FLOAT32_C( 569.13), SIMDE_FLOAT32_C( 174.01),
SIMDE_FLOAT32_C( 724.62), SIMDE_FLOAT32_C( 740.06), SIMDE_FLOAT32_C( 754.14), SIMDE_FLOAT32_C( 56.23) } }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])) ; i++) {
simde__m512 src = simde_mm512_loadu_ps(test_vec[i].src);
simde__m512 a = simde_mm512_loadu_ps(test_vec[i].a);
simde__m512 r = simde_mm512_mask_log2_ps(src, test_vec[i].k, a);
simde_test_x86_assert_equal_f32x16(r, simde_mm512_loadu_ps(test_vec[i].r), 1);
}
return 0;
}
static int
test_simde_mm512_log2_pd (SIMDE_MUNIT_TEST_ARGS) {
static const struct {
const simde_float64 a[8];
const simde_float64 r[8];
} test_vec[] = {
{ { SIMDE_FLOAT64_C( 119.65), SIMDE_FLOAT64_C( 209.41), SIMDE_FLOAT64_C( 983.20), SIMDE_FLOAT64_C( 968.40),
SIMDE_FLOAT64_C( 158.45), SIMDE_FLOAT64_C( 611.79), SIMDE_FLOAT64_C( 202.67), SIMDE_FLOAT64_C( 73.75) },
{ SIMDE_FLOAT64_C( 6.90), SIMDE_FLOAT64_C( 7.71), SIMDE_FLOAT64_C( 9.94), SIMDE_FLOAT64_C( 9.92),
SIMDE_FLOAT64_C( 7.31), SIMDE_FLOAT64_C( 9.26), SIMDE_FLOAT64_C( 7.66), SIMDE_FLOAT64_C( 6.20) } },
{ { SIMDE_FLOAT64_C( 875.12), SIMDE_FLOAT64_C( 357.46), SIMDE_FLOAT64_C( 960.14), SIMDE_FLOAT64_C( 477.36),
SIMDE_FLOAT64_C( 185.60), SIMDE_FLOAT64_C( 437.48), SIMDE_FLOAT64_C( 656.75), SIMDE_FLOAT64_C( 468.11) },
{ SIMDE_FLOAT64_C( 9.77), SIMDE_FLOAT64_C( 8.48), SIMDE_FLOAT64_C( 9.91), SIMDE_FLOAT64_C( 8.90),
SIMDE_FLOAT64_C( 7.54), SIMDE_FLOAT64_C( 8.77), SIMDE_FLOAT64_C( 9.36), SIMDE_FLOAT64_C( 8.87) } },
{ { SIMDE_FLOAT64_C( 538.86), SIMDE_FLOAT64_C( 465.92), SIMDE_FLOAT64_C( 597.15), SIMDE_FLOAT64_C( 858.12),
SIMDE_FLOAT64_C( 110.06), SIMDE_FLOAT64_C( 149.17), SIMDE_FLOAT64_C( 41.30), SIMDE_FLOAT64_C( 954.56) },
{ SIMDE_FLOAT64_C( 9.07), SIMDE_FLOAT64_C( 8.86), SIMDE_FLOAT64_C( 9.22), SIMDE_FLOAT64_C( 9.75),
SIMDE_FLOAT64_C( 6.78), SIMDE_FLOAT64_C( 7.22), SIMDE_FLOAT64_C( 5.37), SIMDE_FLOAT64_C( 9.90) } },
{ { SIMDE_FLOAT64_C( 919.40), SIMDE_FLOAT64_C( 93.55), SIMDE_FLOAT64_C( 761.38), SIMDE_FLOAT64_C( 128.98),
SIMDE_FLOAT64_C( 873.27), SIMDE_FLOAT64_C( 719.89), SIMDE_FLOAT64_C( 554.57), SIMDE_FLOAT64_C( 992.93) },
{ SIMDE_FLOAT64_C( 9.84), SIMDE_FLOAT64_C( 6.55), SIMDE_FLOAT64_C( 9.57), SIMDE_FLOAT64_C( 7.01),
SIMDE_FLOAT64_C( 9.77), SIMDE_FLOAT64_C( 9.49), SIMDE_FLOAT64_C( 9.12), SIMDE_FLOAT64_C( 9.96) } },
{ { SIMDE_FLOAT64_C( 929.29), SIMDE_FLOAT64_C( 537.77), SIMDE_FLOAT64_C( 961.32), SIMDE_FLOAT64_C( 87.74),
SIMDE_FLOAT64_C( 149.55), SIMDE_FLOAT64_C( 164.00), SIMDE_FLOAT64_C( 161.49), SIMDE_FLOAT64_C( 24.67) },
{ SIMDE_FLOAT64_C( 9.86), SIMDE_FLOAT64_C( 9.07), SIMDE_FLOAT64_C( 9.91), SIMDE_FLOAT64_C( 6.46),
SIMDE_FLOAT64_C( 7.22), SIMDE_FLOAT64_C( 7.36), SIMDE_FLOAT64_C( 7.34), SIMDE_FLOAT64_C( 4.62) } },
{ { SIMDE_FLOAT64_C( 521.46), SIMDE_FLOAT64_C( 121.63), SIMDE_FLOAT64_C( 502.03), SIMDE_FLOAT64_C( 707.07),
SIMDE_FLOAT64_C( 559.11), SIMDE_FLOAT64_C( 158.78), SIMDE_FLOAT64_C( 175.18), SIMDE_FLOAT64_C( 97.96) },
{ SIMDE_FLOAT64_C( 9.03), SIMDE_FLOAT64_C( 6.93), SIMDE_FLOAT64_C( 8.97), SIMDE_FLOAT64_C( 9.47),
SIMDE_FLOAT64_C( 9.13), SIMDE_FLOAT64_C( 7.31), SIMDE_FLOAT64_C( 7.45), SIMDE_FLOAT64_C( 6.61) } },
{ { SIMDE_FLOAT64_C( 624.70), SIMDE_FLOAT64_C( 772.32), SIMDE_FLOAT64_C( 956.08), SIMDE_FLOAT64_C( 734.75),
SIMDE_FLOAT64_C( 921.49), SIMDE_FLOAT64_C( 997.38), SIMDE_FLOAT64_C( 689.31), SIMDE_FLOAT64_C( 840.89) },
{ SIMDE_FLOAT64_C( 9.29), SIMDE_FLOAT64_C( 9.59), SIMDE_FLOAT64_C( 9.90), SIMDE_FLOAT64_C( 9.52),
SIMDE_FLOAT64_C( 9.85), SIMDE_FLOAT64_C( 9.96), SIMDE_FLOAT64_C( 9.43), SIMDE_FLOAT64_C( 9.72) } },
{ { SIMDE_FLOAT64_C( 90.93), SIMDE_FLOAT64_C( 450.70), SIMDE_FLOAT64_C( 969.87), SIMDE_FLOAT64_C( 964.20),
SIMDE_FLOAT64_C( 170.58), SIMDE_FLOAT64_C( 524.44), SIMDE_FLOAT64_C( 957.13), SIMDE_FLOAT64_C( 99.88) },
{ SIMDE_FLOAT64_C( 6.51), SIMDE_FLOAT64_C( 8.82), SIMDE_FLOAT64_C( 9.92), SIMDE_FLOAT64_C( 9.91),
SIMDE_FLOAT64_C( 7.41), SIMDE_FLOAT64_C( 9.03), SIMDE_FLOAT64_C( 9.90), SIMDE_FLOAT64_C( 6.64) } }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])) ; i++) {
simde__m512d a = simde_mm512_loadu_pd(test_vec[i].a);
simde__m512d r = simde_mm512_log2_pd(a);
simde_test_x86_assert_equal_f64x8(r, simde_mm512_loadu_pd(test_vec[i].r), 1);
}
return 0;
}
static int
test_simde_mm512_mask_log2_pd (SIMDE_MUNIT_TEST_ARGS) {
static const struct {
const simde_float64 src[8];
const simde__mmask8 k;
const simde_float64 a[8];
const simde_float64 r[8];
} test_vec[] = {
{ { SIMDE_FLOAT64_C( 996.08), SIMDE_FLOAT64_C( 61.08), SIMDE_FLOAT64_C( 921.49), SIMDE_FLOAT64_C( 192.89),
SIMDE_FLOAT64_C( 553.14), SIMDE_FLOAT64_C( 14.27), SIMDE_FLOAT64_C( 408.18), SIMDE_FLOAT64_C( 275.52) },
UINT8_C( 23),
{ SIMDE_FLOAT64_C( 470.67), SIMDE_FLOAT64_C( 327.03), SIMDE_FLOAT64_C( 335.52), SIMDE_FLOAT64_C( 992.77),
SIMDE_FLOAT64_C( 465.65), SIMDE_FLOAT64_C( 524.14), SIMDE_FLOAT64_C( 178.22), SIMDE_FLOAT64_C( 860.48) },
{ SIMDE_FLOAT64_C( 8.88), SIMDE_FLOAT64_C( 8.35), SIMDE_FLOAT64_C( 8.39), SIMDE_FLOAT64_C( 192.89),
SIMDE_FLOAT64_C( 8.86), SIMDE_FLOAT64_C( 14.27), SIMDE_FLOAT64_C( 408.18), SIMDE_FLOAT64_C( 275.52) } },
{ { SIMDE_FLOAT64_C( 594.48), SIMDE_FLOAT64_C( 196.19), SIMDE_FLOAT64_C( 493.93), SIMDE_FLOAT64_C( 252.94),
SIMDE_FLOAT64_C( 940.21), SIMDE_FLOAT64_C( 104.98), SIMDE_FLOAT64_C( 946.96), SIMDE_FLOAT64_C( 783.58) },
UINT8_C(251),
{ SIMDE_FLOAT64_C( 815.52), SIMDE_FLOAT64_C( 353.82), SIMDE_FLOAT64_C( 583.31), SIMDE_FLOAT64_C( 335.41),
SIMDE_FLOAT64_C( 693.48), SIMDE_FLOAT64_C( 579.39), SIMDE_FLOAT64_C( 396.49), SIMDE_FLOAT64_C( 614.97) },
{ SIMDE_FLOAT64_C( 9.67), SIMDE_FLOAT64_C( 8.47), SIMDE_FLOAT64_C( 493.93), SIMDE_FLOAT64_C( 8.39),
SIMDE_FLOAT64_C( 9.44), SIMDE_FLOAT64_C( 9.18), SIMDE_FLOAT64_C( 8.63), SIMDE_FLOAT64_C( 9.26) } },
{ { SIMDE_FLOAT64_C( 772.28), SIMDE_FLOAT64_C( 949.63), SIMDE_FLOAT64_C( 629.24), SIMDE_FLOAT64_C( 180.46),
SIMDE_FLOAT64_C( 225.15), SIMDE_FLOAT64_C( 527.05), SIMDE_FLOAT64_C( 651.14), SIMDE_FLOAT64_C( 552.19) },
UINT8_C(241),
{ SIMDE_FLOAT64_C( 643.90), SIMDE_FLOAT64_C( 17.84), SIMDE_FLOAT64_C( 386.72), SIMDE_FLOAT64_C( 822.12),
SIMDE_FLOAT64_C( 878.32), SIMDE_FLOAT64_C( 981.20), SIMDE_FLOAT64_C( 18.32), SIMDE_FLOAT64_C( 372.25) },
{ SIMDE_FLOAT64_C( 9.33), SIMDE_FLOAT64_C( 949.63), SIMDE_FLOAT64_C( 629.24), SIMDE_FLOAT64_C( 180.46),
SIMDE_FLOAT64_C( 9.78), SIMDE_FLOAT64_C( 9.94), SIMDE_FLOAT64_C( 4.19), SIMDE_FLOAT64_C( 8.54) } },
{ { SIMDE_FLOAT64_C( 234.14), SIMDE_FLOAT64_C( 958.52), SIMDE_FLOAT64_C( 477.23), SIMDE_FLOAT64_C( 181.10),
SIMDE_FLOAT64_C( 742.10), SIMDE_FLOAT64_C( 235.40), SIMDE_FLOAT64_C( 996.62), SIMDE_FLOAT64_C( 95.92) },
UINT8_C( 71),
{ SIMDE_FLOAT64_C( 332.03), SIMDE_FLOAT64_C( 789.40), SIMDE_FLOAT64_C( 398.10), SIMDE_FLOAT64_C( 728.52),
SIMDE_FLOAT64_C( 404.38), SIMDE_FLOAT64_C( 170.38), SIMDE_FLOAT64_C( 678.16), SIMDE_FLOAT64_C( 33.62) },
{ SIMDE_FLOAT64_C( 8.38), SIMDE_FLOAT64_C( 9.62), SIMDE_FLOAT64_C( 8.64), SIMDE_FLOAT64_C( 181.10),
SIMDE_FLOAT64_C( 742.10), SIMDE_FLOAT64_C( 235.40), SIMDE_FLOAT64_C( 9.41), SIMDE_FLOAT64_C( 95.92) } },
{ { SIMDE_FLOAT64_C( 350.85), SIMDE_FLOAT64_C( 903.31), SIMDE_FLOAT64_C( 560.67), SIMDE_FLOAT64_C( 1.98),
SIMDE_FLOAT64_C( 455.50), SIMDE_FLOAT64_C( 423.25), SIMDE_FLOAT64_C( 645.89), SIMDE_FLOAT64_C( 473.34) },
UINT8_C(167),
{ SIMDE_FLOAT64_C( 468.01), SIMDE_FLOAT64_C( 351.66), SIMDE_FLOAT64_C( 791.16), SIMDE_FLOAT64_C( 486.32),
SIMDE_FLOAT64_C( 723.90), SIMDE_FLOAT64_C( 25.30), SIMDE_FLOAT64_C( 444.84), SIMDE_FLOAT64_C( 201.13) },
{ SIMDE_FLOAT64_C( 8.87), SIMDE_FLOAT64_C( 8.46), SIMDE_FLOAT64_C( 9.63), SIMDE_FLOAT64_C( 1.98),
SIMDE_FLOAT64_C( 455.50), SIMDE_FLOAT64_C( 4.66), SIMDE_FLOAT64_C( 645.89), SIMDE_FLOAT64_C( 7.65) } },
{ { SIMDE_FLOAT64_C( 206.40), SIMDE_FLOAT64_C( 186.94), SIMDE_FLOAT64_C( 436.54), SIMDE_FLOAT64_C( 203.02),
SIMDE_FLOAT64_C( 282.87), SIMDE_FLOAT64_C( 255.25), SIMDE_FLOAT64_C( 535.05), SIMDE_FLOAT64_C( 72.27) },
UINT8_C(195),
{ SIMDE_FLOAT64_C( 263.57), SIMDE_FLOAT64_C( 476.64), SIMDE_FLOAT64_C( 823.73), SIMDE_FLOAT64_C( 941.73),
SIMDE_FLOAT64_C( 510.26), SIMDE_FLOAT64_C( 174.57), SIMDE_FLOAT64_C( 845.04), SIMDE_FLOAT64_C( 70.93) },
{ SIMDE_FLOAT64_C( 8.04), SIMDE_FLOAT64_C( 8.90), SIMDE_FLOAT64_C( 436.54), SIMDE_FLOAT64_C( 203.02),
SIMDE_FLOAT64_C( 282.87), SIMDE_FLOAT64_C( 255.25), SIMDE_FLOAT64_C( 9.72), SIMDE_FLOAT64_C( 6.15) } },
{ { SIMDE_FLOAT64_C( 176.55), SIMDE_FLOAT64_C( 300.54), SIMDE_FLOAT64_C( 494.17), SIMDE_FLOAT64_C( 822.44),
SIMDE_FLOAT64_C( 773.88), SIMDE_FLOAT64_C( 304.14), SIMDE_FLOAT64_C( 290.45), SIMDE_FLOAT64_C( 125.54) },
UINT8_C( 79),
{ SIMDE_FLOAT64_C( 776.77), SIMDE_FLOAT64_C( 849.44), SIMDE_FLOAT64_C( 120.60), SIMDE_FLOAT64_C( 221.61),
SIMDE_FLOAT64_C( 50.57), SIMDE_FLOAT64_C( 326.99), SIMDE_FLOAT64_C( 408.55), SIMDE_FLOAT64_C( 487.11) },
{ SIMDE_FLOAT64_C( 9.60), SIMDE_FLOAT64_C( 9.73), SIMDE_FLOAT64_C( 6.91), SIMDE_FLOAT64_C( 7.79),
SIMDE_FLOAT64_C( 773.88), SIMDE_FLOAT64_C( 304.14), SIMDE_FLOAT64_C( 8.67), SIMDE_FLOAT64_C( 125.54) } },
{ { SIMDE_FLOAT64_C( 530.01), SIMDE_FLOAT64_C( 691.42), SIMDE_FLOAT64_C( 742.35), SIMDE_FLOAT64_C( 65.06),
SIMDE_FLOAT64_C( 763.69), SIMDE_FLOAT64_C( 395.70), SIMDE_FLOAT64_C( 328.63), SIMDE_FLOAT64_C( 240.33) },
UINT8_C( 12),
{ SIMDE_FLOAT64_C( 270.37), SIMDE_FLOAT64_C( 750.59), SIMDE_FLOAT64_C( 394.00), SIMDE_FLOAT64_C( 115.41),
SIMDE_FLOAT64_C( 821.52), SIMDE_FLOAT64_C( 570.56), SIMDE_FLOAT64_C( 415.95), SIMDE_FLOAT64_C( 315.69) },
{ SIMDE_FLOAT64_C( 530.01), SIMDE_FLOAT64_C( 691.42), SIMDE_FLOAT64_C( 8.62), SIMDE_FLOAT64_C( 6.85),
SIMDE_FLOAT64_C( 763.69), SIMDE_FLOAT64_C( 395.70), SIMDE_FLOAT64_C( 328.63), SIMDE_FLOAT64_C( 240.33) } }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])) ; i++) {
simde__m512d src = simde_mm512_loadu_pd(test_vec[i].src);
simde__m512d a = simde_mm512_loadu_pd(test_vec[i].a);
simde__m512d r = simde_mm512_mask_log2_pd(src, test_vec[i].k, a);
simde_test_x86_assert_equal_f64x8(r, simde_mm512_loadu_pd(test_vec[i].r), 1);
}
return 0;
}
static int
test_simde_mm_log10_ps(SIMDE_MUNIT_TEST_ARGS) {
const struct {
simde__m128 a;
simde__m128 r;
} test_vec[8] = {
{ simde_mm_set_ps(SIMDE_FLOAT32_C( 4068.94), SIMDE_FLOAT32_C( 5195.06), SIMDE_FLOAT32_C( 1228.12), SIMDE_FLOAT32_C( 6733.16)),
simde_mm_set_ps(SIMDE_FLOAT32_C( 3.61), SIMDE_FLOAT32_C( 3.72), SIMDE_FLOAT32_C( 3.09), SIMDE_FLOAT32_C( 3.83)) },
{ simde_mm_set_ps(SIMDE_FLOAT32_C( 7486.55), SIMDE_FLOAT32_C( 8351.20), SIMDE_FLOAT32_C( 3512.77), SIMDE_FLOAT32_C( 5170.29)),
simde_mm_set_ps(SIMDE_FLOAT32_C( 3.87), SIMDE_FLOAT32_C( 3.92), SIMDE_FLOAT32_C( 3.55), SIMDE_FLOAT32_C( 3.71)) },
{ simde_mm_set_ps(SIMDE_FLOAT32_C( 9127.65), SIMDE_FLOAT32_C( 7111.03), SIMDE_FLOAT32_C( 3652.77), SIMDE_FLOAT32_C( 7338.80)),
simde_mm_set_ps(SIMDE_FLOAT32_C( 3.96), SIMDE_FLOAT32_C( 3.85), SIMDE_FLOAT32_C( 3.56), SIMDE_FLOAT32_C( 3.87)) },
{ simde_mm_set_ps(SIMDE_FLOAT32_C( 1609.14), SIMDE_FLOAT32_C( 1569.36), SIMDE_FLOAT32_C( 5423.87), SIMDE_FLOAT32_C( 7857.29)),
simde_mm_set_ps(SIMDE_FLOAT32_C( 3.21), SIMDE_FLOAT32_C( 3.20), SIMDE_FLOAT32_C( 3.73), SIMDE_FLOAT32_C( 3.90)) },
{ simde_mm_set_ps(SIMDE_FLOAT32_C( 3474.63), SIMDE_FLOAT32_C( 695.25), SIMDE_FLOAT32_C( 2912.29), SIMDE_FLOAT32_C( 8484.34)),
simde_mm_set_ps(SIMDE_FLOAT32_C( 3.54), SIMDE_FLOAT32_C( 2.84), SIMDE_FLOAT32_C( 3.46), SIMDE_FLOAT32_C( 3.93)) },
{ simde_mm_set_ps(SIMDE_FLOAT32_C( 2775.95), SIMDE_FLOAT32_C( 5142.35), SIMDE_FLOAT32_C( 3079.83), SIMDE_FLOAT32_C( 381.82)),
simde_mm_set_ps(SIMDE_FLOAT32_C( 3.44), SIMDE_FLOAT32_C( 3.71), SIMDE_FLOAT32_C( 3.49), SIMDE_FLOAT32_C( 2.58)) },
{ simde_mm_set_ps(SIMDE_FLOAT32_C( 6306.54), SIMDE_FLOAT32_C( 3937.29), SIMDE_FLOAT32_C( 117.23), SIMDE_FLOAT32_C( 1696.00)),
simde_mm_set_ps(SIMDE_FLOAT32_C( 3.80), SIMDE_FLOAT32_C( 3.60), SIMDE_FLOAT32_C( 2.07), SIMDE_FLOAT32_C( 3.23)) },
{ simde_mm_set_ps(SIMDE_FLOAT32_C( 5890.98), SIMDE_FLOAT32_C( 2746.67), SIMDE_FLOAT32_C( 6166.85), SIMDE_FLOAT32_C( 8435.45)),
simde_mm_set_ps(SIMDE_FLOAT32_C( 3.77), SIMDE_FLOAT32_C( 3.44), SIMDE_FLOAT32_C( 3.79), SIMDE_FLOAT32_C( 3.93)) }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])); i++) {
simde__m128 r = simde_mm_log10_ps(test_vec[i].a);
simde_assert_m128_close(r, test_vec[i].r, 1);
}
return 0;
}
static int
test_simde_mm_log10_pd(SIMDE_MUNIT_TEST_ARGS) {
const struct {
simde__m128d a;
simde__m128d r;
} test_vec[8] = {
{ simde_mm_set_pd(SIMDE_FLOAT64_C( 1228.12), SIMDE_FLOAT64_C( 6733.16)),
simde_mm_set_pd(SIMDE_FLOAT64_C( 3.09), SIMDE_FLOAT64_C( 3.83)) },
{ simde_mm_set_pd(SIMDE_FLOAT64_C( 4068.94), SIMDE_FLOAT64_C( 5195.06)),
simde_mm_set_pd(SIMDE_FLOAT64_C( 3.61), SIMDE_FLOAT64_C( 3.72)) },
{ simde_mm_set_pd(SIMDE_FLOAT64_C( 3512.77), SIMDE_FLOAT64_C( 5170.29)),
simde_mm_set_pd(SIMDE_FLOAT64_C( 3.55), SIMDE_FLOAT64_C( 3.71)) },
{ simde_mm_set_pd(SIMDE_FLOAT64_C( 7486.55), SIMDE_FLOAT64_C( 8351.20)),
simde_mm_set_pd(SIMDE_FLOAT64_C( 3.87), SIMDE_FLOAT64_C( 3.92)) },
{ simde_mm_set_pd(SIMDE_FLOAT64_C( 3652.77), SIMDE_FLOAT64_C( 7338.80)),
simde_mm_set_pd(SIMDE_FLOAT64_C( 3.56), SIMDE_FLOAT64_C( 3.87)) },
{ simde_mm_set_pd(SIMDE_FLOAT64_C( 9127.65), SIMDE_FLOAT64_C( 7111.03)),
simde_mm_set_pd(SIMDE_FLOAT64_C( 3.96), SIMDE_FLOAT64_C( 3.85)) },
{ simde_mm_set_pd(SIMDE_FLOAT64_C( 5423.87), SIMDE_FLOAT64_C( 7857.29)),
simde_mm_set_pd(SIMDE_FLOAT64_C( 3.73), SIMDE_FLOAT64_C( 3.90)) },
{ simde_mm_set_pd(SIMDE_FLOAT64_C( 1609.14), SIMDE_FLOAT64_C( 1569.36)),
simde_mm_set_pd(SIMDE_FLOAT64_C( 3.21), SIMDE_FLOAT64_C( 3.20)) }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])); i++) {
simde__m128d r = simde_mm_log10_pd(test_vec[i].a);
simde_assert_m128d_close(r, test_vec[i].r, 1);
}
return 0;
}
static int
test_simde_mm256_log10_ps(SIMDE_MUNIT_TEST_ARGS) {
const struct {
simde__m256 a;
simde__m256 r;
} test_vec[8] = {
{ simde_mm256_set_ps(SIMDE_FLOAT32_C( 7486.55), SIMDE_FLOAT32_C( 8351.20),
SIMDE_FLOAT32_C( 3512.77), SIMDE_FLOAT32_C( 5170.29),
SIMDE_FLOAT32_C( 4068.94), SIMDE_FLOAT32_C( 5195.06),
SIMDE_FLOAT32_C( 1228.12), SIMDE_FLOAT32_C( 6733.16)),
simde_mm256_set_ps(SIMDE_FLOAT32_C( 3.87), SIMDE_FLOAT32_C( 3.92),
SIMDE_FLOAT32_C( 3.55), SIMDE_FLOAT32_C( 3.71),
SIMDE_FLOAT32_C( 3.61), SIMDE_FLOAT32_C( 3.72),
SIMDE_FLOAT32_C( 3.09), SIMDE_FLOAT32_C( 3.83)) },
{ simde_mm256_set_ps(SIMDE_FLOAT32_C( 1609.14), SIMDE_FLOAT32_C( 1569.36),
SIMDE_FLOAT32_C( 5423.87), SIMDE_FLOAT32_C( 7857.29),
SIMDE_FLOAT32_C( 9127.65), SIMDE_FLOAT32_C( 7111.03),
SIMDE_FLOAT32_C( 3652.77), SIMDE_FLOAT32_C( 7338.80)),
simde_mm256_set_ps(SIMDE_FLOAT32_C( 3.21), SIMDE_FLOAT32_C( 3.20),
SIMDE_FLOAT32_C( 3.73), SIMDE_FLOAT32_C( 3.90),
SIMDE_FLOAT32_C( 3.96), SIMDE_FLOAT32_C( 3.85),
SIMDE_FLOAT32_C( 3.56), SIMDE_FLOAT32_C( 3.87)) },
{ simde_mm256_set_ps(SIMDE_FLOAT32_C( 2775.95), SIMDE_FLOAT32_C( 5142.35),
SIMDE_FLOAT32_C( 3079.83), SIMDE_FLOAT32_C( 381.82),
SIMDE_FLOAT32_C( 3474.63), SIMDE_FLOAT32_C( 695.25),
SIMDE_FLOAT32_C( 2912.29), SIMDE_FLOAT32_C( 8484.34)),
simde_mm256_set_ps(SIMDE_FLOAT32_C( 3.44), SIMDE_FLOAT32_C( 3.71),
SIMDE_FLOAT32_C( 3.49), SIMDE_FLOAT32_C( 2.58),
SIMDE_FLOAT32_C( 3.54), SIMDE_FLOAT32_C( 2.84),
SIMDE_FLOAT32_C( 3.46), SIMDE_FLOAT32_C( 3.93)) },
{ simde_mm256_set_ps(SIMDE_FLOAT32_C( 5890.98), SIMDE_FLOAT32_C( 2746.67),
SIMDE_FLOAT32_C( 6166.85), SIMDE_FLOAT32_C( 8435.45),
SIMDE_FLOAT32_C( 6306.54), SIMDE_FLOAT32_C( 3937.29),
SIMDE_FLOAT32_C( 117.23), SIMDE_FLOAT32_C( 1696.00)),
simde_mm256_set_ps(SIMDE_FLOAT32_C( 3.77), SIMDE_FLOAT32_C( 3.44),
SIMDE_FLOAT32_C( 3.79), SIMDE_FLOAT32_C( 3.93),
SIMDE_FLOAT32_C( 3.80), SIMDE_FLOAT32_C( 3.60),
SIMDE_FLOAT32_C( 2.07), SIMDE_FLOAT32_C( 3.23)) },
{ simde_mm256_set_ps(SIMDE_FLOAT32_C( 1148.23), SIMDE_FLOAT32_C( 7217.40),
SIMDE_FLOAT32_C( 2082.02), SIMDE_FLOAT32_C( 6902.28),
SIMDE_FLOAT32_C( 1146.40), SIMDE_FLOAT32_C( 9969.51),
SIMDE_FLOAT32_C( 5140.40), SIMDE_FLOAT32_C( 9206.03)),
simde_mm256_set_ps(SIMDE_FLOAT32_C( 3.06), SIMDE_FLOAT32_C( 3.86),
SIMDE_FLOAT32_C( 3.32), SIMDE_FLOAT32_C( 3.84),
SIMDE_FLOAT32_C( 3.06), SIMDE_FLOAT32_C( 4.00),
SIMDE_FLOAT32_C( 3.71), SIMDE_FLOAT32_C( 3.96)) },
{ simde_mm256_set_ps(SIMDE_FLOAT32_C( 3060.52), SIMDE_FLOAT32_C( 6979.60),
SIMDE_FLOAT32_C( 8279.36), SIMDE_FLOAT32_C( 6696.04),
SIMDE_FLOAT32_C( 7661.76), SIMDE_FLOAT32_C( 3680.04),
SIMDE_FLOAT32_C( 8903.22), SIMDE_FLOAT32_C( 4846.05)),
simde_mm256_set_ps(SIMDE_FLOAT32_C( 3.49), SIMDE_FLOAT32_C( 3.84),
SIMDE_FLOAT32_C( 3.92), SIMDE_FLOAT32_C( 3.83),
SIMDE_FLOAT32_C( 3.88), SIMDE_FLOAT32_C( 3.57),
SIMDE_FLOAT32_C( 3.95), SIMDE_FLOAT32_C( 3.69)) },
{ simde_mm256_set_ps(SIMDE_FLOAT32_C( 3981.75), SIMDE_FLOAT32_C( 4596.36),
SIMDE_FLOAT32_C( 6683.64), SIMDE_FLOAT32_C( 276.11),
SIMDE_FLOAT32_C( 1262.07), SIMDE_FLOAT32_C( 1163.84),
SIMDE_FLOAT32_C( 2229.06), SIMDE_FLOAT32_C( 6994.08)),
simde_mm256_set_ps(SIMDE_FLOAT32_C( 3.60), SIMDE_FLOAT32_C( 3.66),
SIMDE_FLOAT32_C( 3.83), SIMDE_FLOAT32_C( 2.44),
SIMDE_FLOAT32_C( 3.10), SIMDE_FLOAT32_C( 3.07),
SIMDE_FLOAT32_C( 3.35), SIMDE_FLOAT32_C( 3.84)) },
{ simde_mm256_set_ps(SIMDE_FLOAT32_C( 7348.31), SIMDE_FLOAT32_C( 8400.08),
SIMDE_FLOAT32_C( 4256.55), SIMDE_FLOAT32_C( 9093.31),
SIMDE_FLOAT32_C( 9550.14), SIMDE_FLOAT32_C( 8002.34),
SIMDE_FLOAT32_C( 8956.15), SIMDE_FLOAT32_C( 6271.53)),
simde_mm256_set_ps(SIMDE_FLOAT32_C( 3.87), SIMDE_FLOAT32_C( 3.92),
SIMDE_FLOAT32_C( 3.63), SIMDE_FLOAT32_C( 3.96),
SIMDE_FLOAT32_C( 3.98), SIMDE_FLOAT32_C( 3.90),
SIMDE_FLOAT32_C( 3.95), SIMDE_FLOAT32_C( 3.80)) }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])); i++) {
simde__m256 r = simde_mm256_log10_ps(test_vec[i].a);
simde_assert_m256_close(r, test_vec[i].r, 1);
}
return 0;
}
static int
test_simde_mm256_log10_pd(SIMDE_MUNIT_TEST_ARGS) {
const struct {
simde__m256d a;
simde__m256d r;
} test_vec[8] = {
{ simde_mm256_set_pd(SIMDE_FLOAT64_C( 4068.94), SIMDE_FLOAT64_C( 5195.06),
SIMDE_FLOAT64_C( 1228.12), SIMDE_FLOAT64_C( 6733.16)),
simde_mm256_set_pd(SIMDE_FLOAT64_C( 3.61), SIMDE_FLOAT64_C( 3.72),
SIMDE_FLOAT64_C( 3.09), SIMDE_FLOAT64_C( 3.83)) },
{ simde_mm256_set_pd(SIMDE_FLOAT64_C( 7486.55), SIMDE_FLOAT64_C( 8351.20),
SIMDE_FLOAT64_C( 3512.77), SIMDE_FLOAT64_C( 5170.29)),
simde_mm256_set_pd(SIMDE_FLOAT64_C( 3.87), SIMDE_FLOAT64_C( 3.92),
SIMDE_FLOAT64_C( 3.55), SIMDE_FLOAT64_C( 3.71)) },
{ simde_mm256_set_pd(SIMDE_FLOAT64_C( 9127.65), SIMDE_FLOAT64_C( 7111.03),
SIMDE_FLOAT64_C( 3652.77), SIMDE_FLOAT64_C( 7338.80)),
simde_mm256_set_pd(SIMDE_FLOAT64_C( 3.96), SIMDE_FLOAT64_C( 3.85),
SIMDE_FLOAT64_C( 3.56), SIMDE_FLOAT64_C( 3.87)) },
{ simde_mm256_set_pd(SIMDE_FLOAT64_C( 1609.14), SIMDE_FLOAT64_C( 1569.36),
SIMDE_FLOAT64_C( 5423.87), SIMDE_FLOAT64_C( 7857.29)),
simde_mm256_set_pd(SIMDE_FLOAT64_C( 3.21), SIMDE_FLOAT64_C( 3.20),
SIMDE_FLOAT64_C( 3.73), SIMDE_FLOAT64_C( 3.90)) },
{ simde_mm256_set_pd(SIMDE_FLOAT64_C( 3474.63), SIMDE_FLOAT64_C( 695.25),
SIMDE_FLOAT64_C( 2912.29), SIMDE_FLOAT64_C( 8484.34)),
simde_mm256_set_pd(SIMDE_FLOAT64_C( 3.54), SIMDE_FLOAT64_C( 2.84),
SIMDE_FLOAT64_C( 3.46), SIMDE_FLOAT64_C( 3.93)) },
{ simde_mm256_set_pd(SIMDE_FLOAT64_C( 2775.95), SIMDE_FLOAT64_C( 5142.35),
SIMDE_FLOAT64_C( 3079.83), SIMDE_FLOAT64_C( 381.82)),
simde_mm256_set_pd(SIMDE_FLOAT64_C( 3.44), SIMDE_FLOAT64_C( 3.71),
SIMDE_FLOAT64_C( 3.49), SIMDE_FLOAT64_C( 2.58)) },
{ simde_mm256_set_pd(SIMDE_FLOAT64_C( 6306.54), SIMDE_FLOAT64_C( 3937.29),
SIMDE_FLOAT64_C( 117.23), SIMDE_FLOAT64_C( 1696.00)),
simde_mm256_set_pd(SIMDE_FLOAT64_C( 3.80), SIMDE_FLOAT64_C( 3.60),
SIMDE_FLOAT64_C( 2.07), SIMDE_FLOAT64_C( 3.23)) },
{ simde_mm256_set_pd(SIMDE_FLOAT64_C( 5890.98), SIMDE_FLOAT64_C( 2746.67),
SIMDE_FLOAT64_C( 6166.85), SIMDE_FLOAT64_C( 8435.45)),
simde_mm256_set_pd(SIMDE_FLOAT64_C( 3.77), SIMDE_FLOAT64_C( 3.44),
SIMDE_FLOAT64_C( 3.79), SIMDE_FLOAT64_C( 3.93)) }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])); i++) {
simde__m256d r = simde_mm256_log10_pd(test_vec[i].a);
simde_assert_m256d_close(r, test_vec[i].r, 1);
}
return 0;
}
static int
test_simde_mm512_log10_ps(SIMDE_MUNIT_TEST_ARGS) {
const struct {
simde__m512 a;
simde__m512 r;
} test_vec[8] = {
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( 1609.14), SIMDE_FLOAT32_C( 1569.36), SIMDE_FLOAT32_C( 5423.87), SIMDE_FLOAT32_C( 7857.29),
SIMDE_FLOAT32_C( 9127.65), SIMDE_FLOAT32_C( 7111.03), SIMDE_FLOAT32_C( 3652.77), SIMDE_FLOAT32_C( 7338.80),
SIMDE_FLOAT32_C( 7486.55), SIMDE_FLOAT32_C( 8351.20), SIMDE_FLOAT32_C( 3512.77), SIMDE_FLOAT32_C( 5170.29),
SIMDE_FLOAT32_C( 4068.94), SIMDE_FLOAT32_C( 5195.06), SIMDE_FLOAT32_C( 1228.12), SIMDE_FLOAT32_C( 6733.16)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 3.21), SIMDE_FLOAT32_C( 3.20), SIMDE_FLOAT32_C( 3.73), SIMDE_FLOAT32_C( 3.90),
SIMDE_FLOAT32_C( 3.96), SIMDE_FLOAT32_C( 3.85), SIMDE_FLOAT32_C( 3.56), SIMDE_FLOAT32_C( 3.87),
SIMDE_FLOAT32_C( 3.87), SIMDE_FLOAT32_C( 3.92), SIMDE_FLOAT32_C( 3.55), SIMDE_FLOAT32_C( 3.71),
SIMDE_FLOAT32_C( 3.61), SIMDE_FLOAT32_C( 3.72), SIMDE_FLOAT32_C( 3.09), SIMDE_FLOAT32_C( 3.83)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( 5890.98), SIMDE_FLOAT32_C( 2746.67), SIMDE_FLOAT32_C( 6166.85), SIMDE_FLOAT32_C( 8435.45),
SIMDE_FLOAT32_C( 6306.54), SIMDE_FLOAT32_C( 3937.29), SIMDE_FLOAT32_C( 117.23), SIMDE_FLOAT32_C( 1696.00),
SIMDE_FLOAT32_C( 2775.95), SIMDE_FLOAT32_C( 5142.35), SIMDE_FLOAT32_C( 3079.83), SIMDE_FLOAT32_C( 381.82),
SIMDE_FLOAT32_C( 3474.63), SIMDE_FLOAT32_C( 695.25), SIMDE_FLOAT32_C( 2912.29), SIMDE_FLOAT32_C( 8484.34)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 3.77), SIMDE_FLOAT32_C( 3.44), SIMDE_FLOAT32_C( 3.79), SIMDE_FLOAT32_C( 3.93),
SIMDE_FLOAT32_C( 3.80), SIMDE_FLOAT32_C( 3.60), SIMDE_FLOAT32_C( 2.07), SIMDE_FLOAT32_C( 3.23),
SIMDE_FLOAT32_C( 3.44), SIMDE_FLOAT32_C( 3.71), SIMDE_FLOAT32_C( 3.49), SIMDE_FLOAT32_C( 2.58),
SIMDE_FLOAT32_C( 3.54), SIMDE_FLOAT32_C( 2.84), SIMDE_FLOAT32_C( 3.46), SIMDE_FLOAT32_C( 3.93)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( 3060.52), SIMDE_FLOAT32_C( 6979.60), SIMDE_FLOAT32_C( 8279.36), SIMDE_FLOAT32_C( 6696.04),
SIMDE_FLOAT32_C( 7661.76), SIMDE_FLOAT32_C( 3680.04), SIMDE_FLOAT32_C( 8903.22), SIMDE_FLOAT32_C( 4846.05),
SIMDE_FLOAT32_C( 1148.23), SIMDE_FLOAT32_C( 7217.40), SIMDE_FLOAT32_C( 2082.02), SIMDE_FLOAT32_C( 6902.28),
SIMDE_FLOAT32_C( 1146.40), SIMDE_FLOAT32_C( 9969.51), SIMDE_FLOAT32_C( 5140.40), SIMDE_FLOAT32_C( 9206.03)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 3.49), SIMDE_FLOAT32_C( 3.84), SIMDE_FLOAT32_C( 3.92), SIMDE_FLOAT32_C( 3.83),
SIMDE_FLOAT32_C( 3.88), SIMDE_FLOAT32_C( 3.57), SIMDE_FLOAT32_C( 3.95), SIMDE_FLOAT32_C( 3.69),
SIMDE_FLOAT32_C( 3.06), SIMDE_FLOAT32_C( 3.86), SIMDE_FLOAT32_C( 3.32), SIMDE_FLOAT32_C( 3.84),
SIMDE_FLOAT32_C( 3.06), SIMDE_FLOAT32_C( 4.00), SIMDE_FLOAT32_C( 3.71), SIMDE_FLOAT32_C( 3.96)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( 7348.31), SIMDE_FLOAT32_C( 8400.08), SIMDE_FLOAT32_C( 4256.55), SIMDE_FLOAT32_C( 9093.31),
SIMDE_FLOAT32_C( 9550.14), SIMDE_FLOAT32_C( 8002.34), SIMDE_FLOAT32_C( 8956.15), SIMDE_FLOAT32_C( 6271.53),
SIMDE_FLOAT32_C( 3981.75), SIMDE_FLOAT32_C( 4596.36), SIMDE_FLOAT32_C( 6683.64), SIMDE_FLOAT32_C( 276.11),
SIMDE_FLOAT32_C( 1262.07), SIMDE_FLOAT32_C( 1163.84), SIMDE_FLOAT32_C( 2229.06), SIMDE_FLOAT32_C( 6994.08)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 3.87), SIMDE_FLOAT32_C( 3.92), SIMDE_FLOAT32_C( 3.63), SIMDE_FLOAT32_C( 3.96),
SIMDE_FLOAT32_C( 3.98), SIMDE_FLOAT32_C( 3.90), SIMDE_FLOAT32_C( 3.95), SIMDE_FLOAT32_C( 3.80),
SIMDE_FLOAT32_C( 3.60), SIMDE_FLOAT32_C( 3.66), SIMDE_FLOAT32_C( 3.83), SIMDE_FLOAT32_C( 2.44),
SIMDE_FLOAT32_C( 3.10), SIMDE_FLOAT32_C( 3.07), SIMDE_FLOAT32_C( 3.35), SIMDE_FLOAT32_C( 3.84)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( 4105.04), SIMDE_FLOAT32_C( 8793.93), SIMDE_FLOAT32_C( 6623.12), SIMDE_FLOAT32_C( 6717.40),
SIMDE_FLOAT32_C( 628.43), SIMDE_FLOAT32_C( 1010.42), SIMDE_FLOAT32_C( 3357.32), SIMDE_FLOAT32_C( 2370.85),
SIMDE_FLOAT32_C( 4038.44), SIMDE_FLOAT32_C( 886.73), SIMDE_FLOAT32_C( 7806.81), SIMDE_FLOAT32_C( 8278.35),
SIMDE_FLOAT32_C( 4645.43), SIMDE_FLOAT32_C( 7716.73), SIMDE_FLOAT32_C( 5603.27), SIMDE_FLOAT32_C( 4142.45)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 3.61), SIMDE_FLOAT32_C( 3.94), SIMDE_FLOAT32_C( 3.82), SIMDE_FLOAT32_C( 3.83),
SIMDE_FLOAT32_C( 2.80), SIMDE_FLOAT32_C( 3.00), SIMDE_FLOAT32_C( 3.53), SIMDE_FLOAT32_C( 3.37),
SIMDE_FLOAT32_C( 3.61), SIMDE_FLOAT32_C( 2.95), SIMDE_FLOAT32_C( 3.89), SIMDE_FLOAT32_C( 3.92),
SIMDE_FLOAT32_C( 3.67), SIMDE_FLOAT32_C( 3.89), SIMDE_FLOAT32_C( 3.75), SIMDE_FLOAT32_C( 3.62)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( 8450.59), SIMDE_FLOAT32_C( 9203.26), SIMDE_FLOAT32_C( 4894.53), SIMDE_FLOAT32_C( 2042.18),
SIMDE_FLOAT32_C( 2755.53), SIMDE_FLOAT32_C( 8657.47), SIMDE_FLOAT32_C( 7528.93), SIMDE_FLOAT32_C( 8118.50),
SIMDE_FLOAT32_C( 9155.11), SIMDE_FLOAT32_C( 5703.37), SIMDE_FLOAT32_C( 9886.80), SIMDE_FLOAT32_C( 469.19),
SIMDE_FLOAT32_C( 6656.71), SIMDE_FLOAT32_C( 5499.67), SIMDE_FLOAT32_C( 7314.76), SIMDE_FLOAT32_C( 1309.05)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 3.93), SIMDE_FLOAT32_C( 3.96), SIMDE_FLOAT32_C( 3.69), SIMDE_FLOAT32_C( 3.31),
SIMDE_FLOAT32_C( 3.44), SIMDE_FLOAT32_C( 3.94), SIMDE_FLOAT32_C( 3.88), SIMDE_FLOAT32_C( 3.91),
SIMDE_FLOAT32_C( 3.96), SIMDE_FLOAT32_C( 3.76), SIMDE_FLOAT32_C( 4.00), SIMDE_FLOAT32_C( 2.67),
SIMDE_FLOAT32_C( 3.82), SIMDE_FLOAT32_C( 3.74), SIMDE_FLOAT32_C( 3.86), SIMDE_FLOAT32_C( 3.12)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( 1154.54), SIMDE_FLOAT32_C( 9110.29), SIMDE_FLOAT32_C( 2130.97), SIMDE_FLOAT32_C( 11.83),
SIMDE_FLOAT32_C( 3312.02), SIMDE_FLOAT32_C( 9618.20), SIMDE_FLOAT32_C( 6468.19), SIMDE_FLOAT32_C( 1159.42),
SIMDE_FLOAT32_C( 2118.90), SIMDE_FLOAT32_C( 4661.80), SIMDE_FLOAT32_C( 8551.88), SIMDE_FLOAT32_C( 9887.44),
SIMDE_FLOAT32_C( 1217.92), SIMDE_FLOAT32_C( 7124.06), SIMDE_FLOAT32_C( 5136.26), SIMDE_FLOAT32_C( 4524.23)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 3.06), SIMDE_FLOAT32_C( 3.96), SIMDE_FLOAT32_C( 3.33), SIMDE_FLOAT32_C( 1.07),
SIMDE_FLOAT32_C( 3.52), SIMDE_FLOAT32_C( 3.98), SIMDE_FLOAT32_C( 3.81), SIMDE_FLOAT32_C( 3.06),
SIMDE_FLOAT32_C( 3.33), SIMDE_FLOAT32_C( 3.67), SIMDE_FLOAT32_C( 3.93), SIMDE_FLOAT32_C( 4.00),
SIMDE_FLOAT32_C( 3.09), SIMDE_FLOAT32_C( 3.85), SIMDE_FLOAT32_C( 3.71), SIMDE_FLOAT32_C( 3.66)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( 2809.03), SIMDE_FLOAT32_C( 3201.22), SIMDE_FLOAT32_C( 1237.85), SIMDE_FLOAT32_C( 4831.67),
SIMDE_FLOAT32_C( 9663.28), SIMDE_FLOAT32_C( 5036.36), SIMDE_FLOAT32_C( 3363.90), SIMDE_FLOAT32_C( 4374.02),
SIMDE_FLOAT32_C( 4087.77), SIMDE_FLOAT32_C( 5199.67), SIMDE_FLOAT32_C( 7554.25), SIMDE_FLOAT32_C( 6973.34),
SIMDE_FLOAT32_C( 5071.68), SIMDE_FLOAT32_C( 3476.37), SIMDE_FLOAT32_C( 9581.30), SIMDE_FLOAT32_C( 1516.57)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 3.45), SIMDE_FLOAT32_C( 3.51), SIMDE_FLOAT32_C( 3.09), SIMDE_FLOAT32_C( 3.68),
SIMDE_FLOAT32_C( 3.99), SIMDE_FLOAT32_C( 3.70), SIMDE_FLOAT32_C( 3.53), SIMDE_FLOAT32_C( 3.64),
SIMDE_FLOAT32_C( 3.61), SIMDE_FLOAT32_C( 3.72), SIMDE_FLOAT32_C( 3.88), SIMDE_FLOAT32_C( 3.84),
SIMDE_FLOAT32_C( 3.71), SIMDE_FLOAT32_C( 3.54), SIMDE_FLOAT32_C( 3.98), SIMDE_FLOAT32_C( 3.18)) }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])); i++) {
simde__m512 r = simde_mm512_log10_ps(test_vec[i].a);
simde_assert_m512_close(r, test_vec[i].r, 1);
}
return 0;
}
static int
test_simde_mm512_mask_log10_ps(SIMDE_MUNIT_TEST_ARGS) {
const struct {
simde__m512 src;
simde__mmask16 k;
simde__m512 a;
simde__m512 r;
} test_vec[8] = {
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( 2746.67), SIMDE_FLOAT32_C( 8435.45), SIMDE_FLOAT32_C( 3937.29), SIMDE_FLOAT32_C( 1696.00),
SIMDE_FLOAT32_C( 5142.35), SIMDE_FLOAT32_C( 381.82), SIMDE_FLOAT32_C( 695.25), SIMDE_FLOAT32_C( 8484.34),
SIMDE_FLOAT32_C( 1569.36), SIMDE_FLOAT32_C( 7857.29), SIMDE_FLOAT32_C( 7111.03), SIMDE_FLOAT32_C( 7338.80),
SIMDE_FLOAT32_C( 8351.20), SIMDE_FLOAT32_C( 5170.29), SIMDE_FLOAT32_C( 5195.06), SIMDE_FLOAT32_C( 6733.16)),
UINT16_C(41466),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 5890.98), SIMDE_FLOAT32_C( 6166.85), SIMDE_FLOAT32_C( 6306.54), SIMDE_FLOAT32_C( 117.23),
SIMDE_FLOAT32_C( 2775.95), SIMDE_FLOAT32_C( 3079.83), SIMDE_FLOAT32_C( 3474.63), SIMDE_FLOAT32_C( 2912.29),
SIMDE_FLOAT32_C( 1609.14), SIMDE_FLOAT32_C( 5423.87), SIMDE_FLOAT32_C( 9127.65), SIMDE_FLOAT32_C( 3652.77),
SIMDE_FLOAT32_C( 7486.55), SIMDE_FLOAT32_C( 3512.77), SIMDE_FLOAT32_C( 4068.94), SIMDE_FLOAT32_C( 1228.12)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 3.77), SIMDE_FLOAT32_C( 8435.45), SIMDE_FLOAT32_C( 3.80), SIMDE_FLOAT32_C( 1696.00),
SIMDE_FLOAT32_C( 5142.35), SIMDE_FLOAT32_C( 381.82), SIMDE_FLOAT32_C( 695.25), SIMDE_FLOAT32_C( 3.46),
SIMDE_FLOAT32_C( 3.21), SIMDE_FLOAT32_C( 3.73), SIMDE_FLOAT32_C( 3.96), SIMDE_FLOAT32_C( 3.56),
SIMDE_FLOAT32_C( 3.87), SIMDE_FLOAT32_C( 5170.29), SIMDE_FLOAT32_C( 3.61), SIMDE_FLOAT32_C( 6733.16)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( 7348.31), SIMDE_FLOAT32_C( 4256.55), SIMDE_FLOAT32_C( 9550.14), SIMDE_FLOAT32_C( 8956.15),
SIMDE_FLOAT32_C( 3981.75), SIMDE_FLOAT32_C( 6683.64), SIMDE_FLOAT32_C( 1262.07), SIMDE_FLOAT32_C( 2229.06),
SIMDE_FLOAT32_C( 3060.52), SIMDE_FLOAT32_C( 8279.36), SIMDE_FLOAT32_C( 7661.76), SIMDE_FLOAT32_C( 8903.22),
SIMDE_FLOAT32_C( 1148.23), SIMDE_FLOAT32_C( 2082.02), SIMDE_FLOAT32_C( 1146.40), SIMDE_FLOAT32_C( 5140.40)),
UINT16_C(36797),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 4142.45), SIMDE_FLOAT32_C( 8400.08), SIMDE_FLOAT32_C( 9093.31), SIMDE_FLOAT32_C( 8002.34),
SIMDE_FLOAT32_C( 6271.53), SIMDE_FLOAT32_C( 4596.36), SIMDE_FLOAT32_C( 276.11), SIMDE_FLOAT32_C( 1163.84),
SIMDE_FLOAT32_C( 6994.08), SIMDE_FLOAT32_C( 6979.60), SIMDE_FLOAT32_C( 6696.04), SIMDE_FLOAT32_C( 3680.04),
SIMDE_FLOAT32_C( 4846.05), SIMDE_FLOAT32_C( 7217.40), SIMDE_FLOAT32_C( 6902.28), SIMDE_FLOAT32_C( 9969.51)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 3.62), SIMDE_FLOAT32_C( 4256.55), SIMDE_FLOAT32_C( 9550.14), SIMDE_FLOAT32_C( 8956.15),
SIMDE_FLOAT32_C( 3.80), SIMDE_FLOAT32_C( 3.66), SIMDE_FLOAT32_C( 2.44), SIMDE_FLOAT32_C( 3.07),
SIMDE_FLOAT32_C( 3.84), SIMDE_FLOAT32_C( 8279.36), SIMDE_FLOAT32_C( 3.83), SIMDE_FLOAT32_C( 3.57),
SIMDE_FLOAT32_C( 3.69), SIMDE_FLOAT32_C( 3.86), SIMDE_FLOAT32_C( 1146.40), SIMDE_FLOAT32_C( 4.00)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( 4524.23), SIMDE_FLOAT32_C( 9203.26), SIMDE_FLOAT32_C( 2042.18), SIMDE_FLOAT32_C( 8657.47),
SIMDE_FLOAT32_C( 8118.50), SIMDE_FLOAT32_C( 5703.37), SIMDE_FLOAT32_C( 469.19), SIMDE_FLOAT32_C( 5499.67),
SIMDE_FLOAT32_C( 1309.05), SIMDE_FLOAT32_C( 8793.93), SIMDE_FLOAT32_C( 6717.40), SIMDE_FLOAT32_C( 1010.42),
SIMDE_FLOAT32_C( 2370.85), SIMDE_FLOAT32_C( 886.73), SIMDE_FLOAT32_C( 8278.35), SIMDE_FLOAT32_C( 7716.73)),
UINT16_C(16804),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 5136.26), SIMDE_FLOAT32_C( 8450.59), SIMDE_FLOAT32_C( 4894.53), SIMDE_FLOAT32_C( 2755.53),
SIMDE_FLOAT32_C( 7528.93), SIMDE_FLOAT32_C( 9155.11), SIMDE_FLOAT32_C( 9886.80), SIMDE_FLOAT32_C( 6656.71),
SIMDE_FLOAT32_C( 7314.76), SIMDE_FLOAT32_C( 4105.04), SIMDE_FLOAT32_C( 6623.12), SIMDE_FLOAT32_C( 628.43),
SIMDE_FLOAT32_C( 3357.32), SIMDE_FLOAT32_C( 4038.44), SIMDE_FLOAT32_C( 7806.81), SIMDE_FLOAT32_C( 4645.43)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 4524.23), SIMDE_FLOAT32_C( 3.93), SIMDE_FLOAT32_C( 2042.18), SIMDE_FLOAT32_C( 8657.47),
SIMDE_FLOAT32_C( 8118.50), SIMDE_FLOAT32_C( 5703.37), SIMDE_FLOAT32_C( 469.19), SIMDE_FLOAT32_C( 3.82),
SIMDE_FLOAT32_C( 3.86), SIMDE_FLOAT32_C( 8793.93), SIMDE_FLOAT32_C( 3.82), SIMDE_FLOAT32_C( 1010.42),
SIMDE_FLOAT32_C( 2370.85), SIMDE_FLOAT32_C( 3.61), SIMDE_FLOAT32_C( 8278.35), SIMDE_FLOAT32_C( 7716.73)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( 3256.50), SIMDE_FLOAT32_C( 2809.03), SIMDE_FLOAT32_C( 1237.85), SIMDE_FLOAT32_C( 9663.28),
SIMDE_FLOAT32_C( 3363.90), SIMDE_FLOAT32_C( 4087.77), SIMDE_FLOAT32_C( 7554.25), SIMDE_FLOAT32_C( 5071.68),
SIMDE_FLOAT32_C( 9581.30), SIMDE_FLOAT32_C( 1154.54), SIMDE_FLOAT32_C( 2130.97), SIMDE_FLOAT32_C( 3312.02),
SIMDE_FLOAT32_C( 6468.19), SIMDE_FLOAT32_C( 2118.90), SIMDE_FLOAT32_C( 8551.88), SIMDE_FLOAT32_C( 1217.92)),
UINT16_C( 2107),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 9486.33), SIMDE_FLOAT32_C( 4010.56), SIMDE_FLOAT32_C( 3201.22), SIMDE_FLOAT32_C( 4831.67),
SIMDE_FLOAT32_C( 5036.36), SIMDE_FLOAT32_C( 4374.02), SIMDE_FLOAT32_C( 5199.67), SIMDE_FLOAT32_C( 6973.34),
SIMDE_FLOAT32_C( 3476.37), SIMDE_FLOAT32_C( 1516.57), SIMDE_FLOAT32_C( 9110.29), SIMDE_FLOAT32_C( 11.83),
SIMDE_FLOAT32_C( 9618.20), SIMDE_FLOAT32_C( 1159.42), SIMDE_FLOAT32_C( 4661.80), SIMDE_FLOAT32_C( 9887.44)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 3256.50), SIMDE_FLOAT32_C( 2809.03), SIMDE_FLOAT32_C( 1237.85), SIMDE_FLOAT32_C( 9663.28),
SIMDE_FLOAT32_C( 3.70), SIMDE_FLOAT32_C( 4087.77), SIMDE_FLOAT32_C( 7554.25), SIMDE_FLOAT32_C( 5071.68),
SIMDE_FLOAT32_C( 9581.30), SIMDE_FLOAT32_C( 1154.54), SIMDE_FLOAT32_C( 3.96), SIMDE_FLOAT32_C( 1.07),
SIMDE_FLOAT32_C( 3.98), SIMDE_FLOAT32_C( 2118.90), SIMDE_FLOAT32_C( 3.67), SIMDE_FLOAT32_C( 4.00)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( 4921.97), SIMDE_FLOAT32_C( 1314.36), SIMDE_FLOAT32_C( 3425.34), SIMDE_FLOAT32_C( 5889.62),
SIMDE_FLOAT32_C( 6729.66), SIMDE_FLOAT32_C( 9443.57), SIMDE_FLOAT32_C( 9578.53), SIMDE_FLOAT32_C( 5667.58),
SIMDE_FLOAT32_C( 7424.68), SIMDE_FLOAT32_C( 2009.69), SIMDE_FLOAT32_C( 1044.67), SIMDE_FLOAT32_C( 1170.36),
SIMDE_FLOAT32_C( 6106.86), SIMDE_FLOAT32_C( 1058.19), SIMDE_FLOAT32_C( 1124.78), SIMDE_FLOAT32_C( 7203.19)),
UINT16_C(22274),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 7482.85), SIMDE_FLOAT32_C( 9575.95), SIMDE_FLOAT32_C( 1407.98), SIMDE_FLOAT32_C( 5799.87),
SIMDE_FLOAT32_C( 694.94), SIMDE_FLOAT32_C( 7133.07), SIMDE_FLOAT32_C( 9660.54), SIMDE_FLOAT32_C( 5551.82),
SIMDE_FLOAT32_C( 9134.21), SIMDE_FLOAT32_C( 4616.24), SIMDE_FLOAT32_C( 6187.92), SIMDE_FLOAT32_C( 3107.51),
SIMDE_FLOAT32_C( 1991.62), SIMDE_FLOAT32_C( 1882.51), SIMDE_FLOAT32_C( 287.66), SIMDE_FLOAT32_C( 7377.56)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 4921.97), SIMDE_FLOAT32_C( 3.98), SIMDE_FLOAT32_C( 3425.34), SIMDE_FLOAT32_C( 3.76),
SIMDE_FLOAT32_C( 6729.66), SIMDE_FLOAT32_C( 3.85), SIMDE_FLOAT32_C( 3.99), SIMDE_FLOAT32_C( 3.74),
SIMDE_FLOAT32_C( 7424.68), SIMDE_FLOAT32_C( 2009.69), SIMDE_FLOAT32_C( 1044.67), SIMDE_FLOAT32_C( 1170.36),
SIMDE_FLOAT32_C( 6106.86), SIMDE_FLOAT32_C( 1058.19), SIMDE_FLOAT32_C( 2.46), SIMDE_FLOAT32_C( 7203.19)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( 9415.27), SIMDE_FLOAT32_C( 963.59), SIMDE_FLOAT32_C( 4649.74), SIMDE_FLOAT32_C( 1078.30),
SIMDE_FLOAT32_C( 5462.61), SIMDE_FLOAT32_C( 6033.01), SIMDE_FLOAT32_C( 9173.00), SIMDE_FLOAT32_C( 4672.02),
SIMDE_FLOAT32_C( 3569.65), SIMDE_FLOAT32_C( 3935.68), SIMDE_FLOAT32_C( 3408.08), SIMDE_FLOAT32_C( 8917.42),
SIMDE_FLOAT32_C( 1855.90), SIMDE_FLOAT32_C( 7781.74), SIMDE_FLOAT32_C( 7197.17), SIMDE_FLOAT32_C( 7170.16)),
UINT16_C(27396),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 178.74), SIMDE_FLOAT32_C( 2968.36), SIMDE_FLOAT32_C( 1281.72), SIMDE_FLOAT32_C( 1177.11),
SIMDE_FLOAT32_C( 8949.44), SIMDE_FLOAT32_C( 5024.17), SIMDE_FLOAT32_C( 907.29), SIMDE_FLOAT32_C( 5805.32),
SIMDE_FLOAT32_C( 7896.24), SIMDE_FLOAT32_C( 4941.12), SIMDE_FLOAT32_C( 3457.39), SIMDE_FLOAT32_C( 1402.13),
SIMDE_FLOAT32_C( 6670.00), SIMDE_FLOAT32_C( 6373.56), SIMDE_FLOAT32_C( 415.89), SIMDE_FLOAT32_C( 2550.00)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 9415.27), SIMDE_FLOAT32_C( 3.47), SIMDE_FLOAT32_C( 3.11), SIMDE_FLOAT32_C( 1078.30),
SIMDE_FLOAT32_C( 3.95), SIMDE_FLOAT32_C( 6033.01), SIMDE_FLOAT32_C( 2.96), SIMDE_FLOAT32_C( 3.76),
SIMDE_FLOAT32_C( 3569.65), SIMDE_FLOAT32_C( 3935.68), SIMDE_FLOAT32_C( 3408.08), SIMDE_FLOAT32_C( 8917.42),
SIMDE_FLOAT32_C( 1855.90), SIMDE_FLOAT32_C( 3.80), SIMDE_FLOAT32_C( 7197.17), SIMDE_FLOAT32_C( 7170.16)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( 7648.13), SIMDE_FLOAT32_C( 4875.56), SIMDE_FLOAT32_C( 161.12), SIMDE_FLOAT32_C( 8194.68),
SIMDE_FLOAT32_C( 7254.51), SIMDE_FLOAT32_C( 1142.29), SIMDE_FLOAT32_C( 5528.96), SIMDE_FLOAT32_C( 7950.51),
SIMDE_FLOAT32_C( 5154.57), SIMDE_FLOAT32_C( 8176.75), SIMDE_FLOAT32_C( 4580.00), SIMDE_FLOAT32_C( 5400.22),
SIMDE_FLOAT32_C( 1452.71), SIMDE_FLOAT32_C( 8039.28), SIMDE_FLOAT32_C( 6972.90), SIMDE_FLOAT32_C( 554.46)),
UINT16_C( 953),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 5093.74), SIMDE_FLOAT32_C( 9045.23), SIMDE_FLOAT32_C( 5720.26), SIMDE_FLOAT32_C( 2861.39),
SIMDE_FLOAT32_C( 6541.39), SIMDE_FLOAT32_C( 4114.75), SIMDE_FLOAT32_C( 2711.17), SIMDE_FLOAT32_C( 8391.22),
SIMDE_FLOAT32_C( 5330.27), SIMDE_FLOAT32_C( 3661.45), SIMDE_FLOAT32_C( 5586.41), SIMDE_FLOAT32_C( 2116.00),
SIMDE_FLOAT32_C( 4808.04), SIMDE_FLOAT32_C( 3749.32), SIMDE_FLOAT32_C( 4730.38), SIMDE_FLOAT32_C( 5459.69)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 7648.13), SIMDE_FLOAT32_C( 4875.56), SIMDE_FLOAT32_C( 161.12), SIMDE_FLOAT32_C( 8194.68),
SIMDE_FLOAT32_C( 7254.51), SIMDE_FLOAT32_C( 1142.29), SIMDE_FLOAT32_C( 3.43), SIMDE_FLOAT32_C( 3.92),
SIMDE_FLOAT32_C( 3.73), SIMDE_FLOAT32_C( 8176.75), SIMDE_FLOAT32_C( 3.75), SIMDE_FLOAT32_C( 3.33),
SIMDE_FLOAT32_C( 3.68), SIMDE_FLOAT32_C( 8039.28), SIMDE_FLOAT32_C( 6972.90), SIMDE_FLOAT32_C( 3.74)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( 1058.07), SIMDE_FLOAT32_C( 6652.15), SIMDE_FLOAT32_C( 2532.95), SIMDE_FLOAT32_C( 9113.62),
SIMDE_FLOAT32_C( 9783.41), SIMDE_FLOAT32_C( 9773.08), SIMDE_FLOAT32_C( 9127.47), SIMDE_FLOAT32_C( 918.64),
SIMDE_FLOAT32_C( 3953.30), SIMDE_FLOAT32_C( 333.95), SIMDE_FLOAT32_C( 1356.49), SIMDE_FLOAT32_C( 2899.69),
SIMDE_FLOAT32_C( 5501.59), SIMDE_FLOAT32_C( 5515.77), SIMDE_FLOAT32_C( 7198.84), SIMDE_FLOAT32_C( 3978.34)),
UINT16_C(12713),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 792.83), SIMDE_FLOAT32_C( 4929.19), SIMDE_FLOAT32_C( 9124.38), SIMDE_FLOAT32_C( 8968.13),
SIMDE_FLOAT32_C( 1316.26), SIMDE_FLOAT32_C( 3447.13), SIMDE_FLOAT32_C( 8644.35), SIMDE_FLOAT32_C( 3246.39),
SIMDE_FLOAT32_C( 5304.47), SIMDE_FLOAT32_C( 5549.07), SIMDE_FLOAT32_C( 8579.68), SIMDE_FLOAT32_C( 3747.01),
SIMDE_FLOAT32_C( 9720.69), SIMDE_FLOAT32_C( 6809.26), SIMDE_FLOAT32_C( 4934.63), SIMDE_FLOAT32_C( 9263.02)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 1058.07), SIMDE_FLOAT32_C( 6652.15), SIMDE_FLOAT32_C( 3.96), SIMDE_FLOAT32_C( 3.95),
SIMDE_FLOAT32_C( 9783.41), SIMDE_FLOAT32_C( 9773.08), SIMDE_FLOAT32_C( 9127.47), SIMDE_FLOAT32_C( 3.51),
SIMDE_FLOAT32_C( 3.72), SIMDE_FLOAT32_C( 333.95), SIMDE_FLOAT32_C( 3.93), SIMDE_FLOAT32_C( 2899.69),
SIMDE_FLOAT32_C( 3.99), SIMDE_FLOAT32_C( 5515.77), SIMDE_FLOAT32_C( 7198.84), SIMDE_FLOAT32_C( 3.97)) }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])); i++) {
simde__m512 r = simde_mm512_mask_log10_ps(test_vec[i].src, test_vec[i].k, test_vec[i].a);
simde_assert_m512_close(r, test_vec[i].r, 1);
}
return 0;
}
static int
test_simde_mm512_log10_pd(SIMDE_MUNIT_TEST_ARGS) {
const struct {
simde__m512d a;
simde__m512d r;
} test_vec[8] = {
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( 7486.55), SIMDE_FLOAT64_C( 8351.20),
SIMDE_FLOAT64_C( 3512.77), SIMDE_FLOAT64_C( 5170.29),
SIMDE_FLOAT64_C( 4068.94), SIMDE_FLOAT64_C( 5195.06),
SIMDE_FLOAT64_C( 1228.12), SIMDE_FLOAT64_C( 6733.16)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 3.87), SIMDE_FLOAT64_C( 3.92),
SIMDE_FLOAT64_C( 3.55), SIMDE_FLOAT64_C( 3.71),
SIMDE_FLOAT64_C( 3.61), SIMDE_FLOAT64_C( 3.72),
SIMDE_FLOAT64_C( 3.09), SIMDE_FLOAT64_C( 3.83)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( 1609.14), SIMDE_FLOAT64_C( 1569.36),
SIMDE_FLOAT64_C( 5423.87), SIMDE_FLOAT64_C( 7857.29),
SIMDE_FLOAT64_C( 9127.65), SIMDE_FLOAT64_C( 7111.03),
SIMDE_FLOAT64_C( 3652.77), SIMDE_FLOAT64_C( 7338.80)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 3.21), SIMDE_FLOAT64_C( 3.20),
SIMDE_FLOAT64_C( 3.73), SIMDE_FLOAT64_C( 3.90),
SIMDE_FLOAT64_C( 3.96), SIMDE_FLOAT64_C( 3.85),
SIMDE_FLOAT64_C( 3.56), SIMDE_FLOAT64_C( 3.87)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( 2775.95), SIMDE_FLOAT64_C( 5142.35),
SIMDE_FLOAT64_C( 3079.83), SIMDE_FLOAT64_C( 381.82),
SIMDE_FLOAT64_C( 3474.63), SIMDE_FLOAT64_C( 695.25),
SIMDE_FLOAT64_C( 2912.29), SIMDE_FLOAT64_C( 8484.34)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 3.44), SIMDE_FLOAT64_C( 3.71),
SIMDE_FLOAT64_C( 3.49), SIMDE_FLOAT64_C( 2.58),
SIMDE_FLOAT64_C( 3.54), SIMDE_FLOAT64_C( 2.84),
SIMDE_FLOAT64_C( 3.46), SIMDE_FLOAT64_C( 3.93)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( 5890.98), SIMDE_FLOAT64_C( 2746.67),
SIMDE_FLOAT64_C( 6166.85), SIMDE_FLOAT64_C( 8435.45),
SIMDE_FLOAT64_C( 6306.54), SIMDE_FLOAT64_C( 3937.29),
SIMDE_FLOAT64_C( 117.23), SIMDE_FLOAT64_C( 1696.00)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 3.77), SIMDE_FLOAT64_C( 3.44),
SIMDE_FLOAT64_C( 3.79), SIMDE_FLOAT64_C( 3.93),
SIMDE_FLOAT64_C( 3.80), SIMDE_FLOAT64_C( 3.60),
SIMDE_FLOAT64_C( 2.07), SIMDE_FLOAT64_C( 3.23)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( 1148.23), SIMDE_FLOAT64_C( 7217.40),
SIMDE_FLOAT64_C( 2082.02), SIMDE_FLOAT64_C( 6902.28),
SIMDE_FLOAT64_C( 1146.40), SIMDE_FLOAT64_C( 9969.51),
SIMDE_FLOAT64_C( 5140.40), SIMDE_FLOAT64_C( 9206.03)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 3.06), SIMDE_FLOAT64_C( 3.86),
SIMDE_FLOAT64_C( 3.32), SIMDE_FLOAT64_C( 3.84),
SIMDE_FLOAT64_C( 3.06), SIMDE_FLOAT64_C( 4.00),
SIMDE_FLOAT64_C( 3.71), SIMDE_FLOAT64_C( 3.96)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( 3060.52), SIMDE_FLOAT64_C( 6979.60),
SIMDE_FLOAT64_C( 8279.36), SIMDE_FLOAT64_C( 6696.04),
SIMDE_FLOAT64_C( 7661.76), SIMDE_FLOAT64_C( 3680.04),
SIMDE_FLOAT64_C( 8903.22), SIMDE_FLOAT64_C( 4846.05)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 3.49), SIMDE_FLOAT64_C( 3.84),
SIMDE_FLOAT64_C( 3.92), SIMDE_FLOAT64_C( 3.83),
SIMDE_FLOAT64_C( 3.88), SIMDE_FLOAT64_C( 3.57),
SIMDE_FLOAT64_C( 3.95), SIMDE_FLOAT64_C( 3.69)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( 3981.75), SIMDE_FLOAT64_C( 4596.36),
SIMDE_FLOAT64_C( 6683.64), SIMDE_FLOAT64_C( 276.11),
SIMDE_FLOAT64_C( 1262.07), SIMDE_FLOAT64_C( 1163.84),
SIMDE_FLOAT64_C( 2229.06), SIMDE_FLOAT64_C( 6994.08)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 3.60), SIMDE_FLOAT64_C( 3.66),
SIMDE_FLOAT64_C( 3.83), SIMDE_FLOAT64_C( 2.44),
SIMDE_FLOAT64_C( 3.10), SIMDE_FLOAT64_C( 3.07),
SIMDE_FLOAT64_C( 3.35), SIMDE_FLOAT64_C( 3.84)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( 7348.31), SIMDE_FLOAT64_C( 8400.08),
SIMDE_FLOAT64_C( 4256.55), SIMDE_FLOAT64_C( 9093.31),
SIMDE_FLOAT64_C( 9550.14), SIMDE_FLOAT64_C( 8002.34),
SIMDE_FLOAT64_C( 8956.15), SIMDE_FLOAT64_C( 6271.53)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 3.87), SIMDE_FLOAT64_C( 3.92),
SIMDE_FLOAT64_C( 3.63), SIMDE_FLOAT64_C( 3.96),
SIMDE_FLOAT64_C( 3.98), SIMDE_FLOAT64_C( 3.90),
SIMDE_FLOAT64_C( 3.95), SIMDE_FLOAT64_C( 3.80)) }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])); i++) {
simde__m512d r = simde_mm512_log10_pd(test_vec[i].a);
simde_assert_m512d_close(r, test_vec[i].r, 1);
}
return 0;
}
static int
test_simde_mm512_mask_log10_pd(SIMDE_MUNIT_TEST_ARGS) {
const struct {
simde__m512d src;
simde__mmask8 k;
simde__m512d a;
simde__m512d r;
} test_vec[8] = {
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( 1569.36), SIMDE_FLOAT64_C( 7857.29),
SIMDE_FLOAT64_C( 7111.03), SIMDE_FLOAT64_C( 7338.80),
SIMDE_FLOAT64_C( 8351.20), SIMDE_FLOAT64_C( 5170.29),
SIMDE_FLOAT64_C( 5195.06), SIMDE_FLOAT64_C( 6733.16)),
UINT8_C(139),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 1609.14), SIMDE_FLOAT64_C( 5423.87),
SIMDE_FLOAT64_C( 9127.65), SIMDE_FLOAT64_C( 3652.77),
SIMDE_FLOAT64_C( 7486.55), SIMDE_FLOAT64_C( 3512.77),
SIMDE_FLOAT64_C( 4068.94), SIMDE_FLOAT64_C( 1228.12)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 3.21), SIMDE_FLOAT64_C( 7857.29),
SIMDE_FLOAT64_C( 7111.03), SIMDE_FLOAT64_C( 7338.80),
SIMDE_FLOAT64_C( 3.87), SIMDE_FLOAT64_C( 5170.29),
SIMDE_FLOAT64_C( 3.61), SIMDE_FLOAT64_C( 3.09)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( 5890.98), SIMDE_FLOAT64_C( 6166.85),
SIMDE_FLOAT64_C( 6306.54), SIMDE_FLOAT64_C( 117.23),
SIMDE_FLOAT64_C( 2775.95), SIMDE_FLOAT64_C( 3079.83),
SIMDE_FLOAT64_C( 3474.63), SIMDE_FLOAT64_C( 2912.29)),
UINT8_C(229),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 9206.03), SIMDE_FLOAT64_C( 2746.67),
SIMDE_FLOAT64_C( 8435.45), SIMDE_FLOAT64_C( 3937.29),
SIMDE_FLOAT64_C( 1696.00), SIMDE_FLOAT64_C( 5142.35),
SIMDE_FLOAT64_C( 381.82), SIMDE_FLOAT64_C( 695.25)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 3.96), SIMDE_FLOAT64_C( 3.44),
SIMDE_FLOAT64_C( 3.93), SIMDE_FLOAT64_C( 117.23),
SIMDE_FLOAT64_C( 2775.95), SIMDE_FLOAT64_C( 3.71),
SIMDE_FLOAT64_C( 3474.63), SIMDE_FLOAT64_C( 2.84)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( 6994.08), SIMDE_FLOAT64_C( 6979.60),
SIMDE_FLOAT64_C( 6696.04), SIMDE_FLOAT64_C( 3680.04),
SIMDE_FLOAT64_C( 4846.05), SIMDE_FLOAT64_C( 7217.40),
SIMDE_FLOAT64_C( 6902.28), SIMDE_FLOAT64_C( 9969.51)),
UINT8_C(253),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 2229.06), SIMDE_FLOAT64_C( 3060.52),
SIMDE_FLOAT64_C( 8279.36), SIMDE_FLOAT64_C( 7661.76),
SIMDE_FLOAT64_C( 8903.22), SIMDE_FLOAT64_C( 1148.23),
SIMDE_FLOAT64_C( 2082.02), SIMDE_FLOAT64_C( 1146.40)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 3.35), SIMDE_FLOAT64_C( 3.49),
SIMDE_FLOAT64_C( 3.92), SIMDE_FLOAT64_C( 3.88),
SIMDE_FLOAT64_C( 3.95), SIMDE_FLOAT64_C( 3.06),
SIMDE_FLOAT64_C( 6902.28), SIMDE_FLOAT64_C( 3.06)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( 5603.27), SIMDE_FLOAT64_C( 7348.31),
SIMDE_FLOAT64_C( 4256.55), SIMDE_FLOAT64_C( 9550.14),
SIMDE_FLOAT64_C( 8956.15), SIMDE_FLOAT64_C( 3981.75),
SIMDE_FLOAT64_C( 6683.64), SIMDE_FLOAT64_C( 1262.07)),
UINT8_C( 93),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 7716.73), SIMDE_FLOAT64_C( 4142.45),
SIMDE_FLOAT64_C( 8400.08), SIMDE_FLOAT64_C( 9093.31),
SIMDE_FLOAT64_C( 8002.34), SIMDE_FLOAT64_C( 6271.53),
SIMDE_FLOAT64_C( 4596.36), SIMDE_FLOAT64_C( 276.11)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 5603.27), SIMDE_FLOAT64_C( 3.62),
SIMDE_FLOAT64_C( 4256.55), SIMDE_FLOAT64_C( 3.96),
SIMDE_FLOAT64_C( 3.90), SIMDE_FLOAT64_C( 3.80),
SIMDE_FLOAT64_C( 6683.64), SIMDE_FLOAT64_C( 2.44)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( 5499.67), SIMDE_FLOAT64_C( 1309.05),
SIMDE_FLOAT64_C( 8793.93), SIMDE_FLOAT64_C( 6717.40),
SIMDE_FLOAT64_C( 1010.42), SIMDE_FLOAT64_C( 2370.85),
SIMDE_FLOAT64_C( 886.73), SIMDE_FLOAT64_C( 8278.35)),
UINT8_C(145),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 6656.71), SIMDE_FLOAT64_C( 7314.76),
SIMDE_FLOAT64_C( 4105.04), SIMDE_FLOAT64_C( 6623.12),
SIMDE_FLOAT64_C( 628.43), SIMDE_FLOAT64_C( 3357.32),
SIMDE_FLOAT64_C( 4038.44), SIMDE_FLOAT64_C( 7806.81)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 3.82), SIMDE_FLOAT64_C( 1309.05),
SIMDE_FLOAT64_C( 8793.93), SIMDE_FLOAT64_C( 3.82),
SIMDE_FLOAT64_C( 1010.42), SIMDE_FLOAT64_C( 2370.85),
SIMDE_FLOAT64_C( 886.73), SIMDE_FLOAT64_C( 3.89)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( 1217.92), SIMDE_FLOAT64_C( 5136.26),
SIMDE_FLOAT64_C( 8450.59), SIMDE_FLOAT64_C( 4894.53),
SIMDE_FLOAT64_C( 2755.53), SIMDE_FLOAT64_C( 7528.93),
SIMDE_FLOAT64_C( 9155.11), SIMDE_FLOAT64_C( 9886.80)),
UINT8_C( 75),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 9887.44), SIMDE_FLOAT64_C( 7124.06),
SIMDE_FLOAT64_C( 4524.23), SIMDE_FLOAT64_C( 9203.26),
SIMDE_FLOAT64_C( 2042.18), SIMDE_FLOAT64_C( 8657.47),
SIMDE_FLOAT64_C( 8118.50), SIMDE_FLOAT64_C( 5703.37)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 1217.92), SIMDE_FLOAT64_C( 3.85),
SIMDE_FLOAT64_C( 8450.59), SIMDE_FLOAT64_C( 4894.53),
SIMDE_FLOAT64_C( 3.31), SIMDE_FLOAT64_C( 7528.93),
SIMDE_FLOAT64_C( 3.91), SIMDE_FLOAT64_C( 3.76)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( 6973.34), SIMDE_FLOAT64_C( 3476.37),
SIMDE_FLOAT64_C( 1516.57), SIMDE_FLOAT64_C( 9110.29),
SIMDE_FLOAT64_C( 11.83), SIMDE_FLOAT64_C( 9618.20),
SIMDE_FLOAT64_C( 1159.42), SIMDE_FLOAT64_C( 4661.80)),
UINT8_C( 93),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 7554.25), SIMDE_FLOAT64_C( 5071.68),
SIMDE_FLOAT64_C( 9581.30), SIMDE_FLOAT64_C( 1154.54),
SIMDE_FLOAT64_C( 2130.97), SIMDE_FLOAT64_C( 3312.02),
SIMDE_FLOAT64_C( 6468.19), SIMDE_FLOAT64_C( 2118.90)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 6973.34), SIMDE_FLOAT64_C( 3.71),
SIMDE_FLOAT64_C( 1516.57), SIMDE_FLOAT64_C( 3.06),
SIMDE_FLOAT64_C( 3.33), SIMDE_FLOAT64_C( 3.52),
SIMDE_FLOAT64_C( 1159.42), SIMDE_FLOAT64_C( 3.33)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( 7377.56), SIMDE_FLOAT64_C( 9683.23),
SIMDE_FLOAT64_C( 3256.50), SIMDE_FLOAT64_C( 2809.03),
SIMDE_FLOAT64_C( 1237.85), SIMDE_FLOAT64_C( 9663.28),
SIMDE_FLOAT64_C( 3363.90), SIMDE_FLOAT64_C( 4087.77)),
UINT8_C(213),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 1124.78), SIMDE_FLOAT64_C( 7203.19),
SIMDE_FLOAT64_C( 9486.33), SIMDE_FLOAT64_C( 4010.56),
SIMDE_FLOAT64_C( 3201.22), SIMDE_FLOAT64_C( 4831.67),
SIMDE_FLOAT64_C( 5036.36), SIMDE_FLOAT64_C( 4374.02)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 3.05), SIMDE_FLOAT64_C( 3.86),
SIMDE_FLOAT64_C( 3256.50), SIMDE_FLOAT64_C( 3.60),
SIMDE_FLOAT64_C( 1237.85), SIMDE_FLOAT64_C( 3.68),
SIMDE_FLOAT64_C( 3363.90), SIMDE_FLOAT64_C( 3.64)) }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])); i++) {
simde__m512d r = simde_mm512_mask_log10_pd(test_vec[i].src, test_vec[i].k, test_vec[i].a);
simde_assert_m512d_close(r, test_vec[i].r, 1);
}
return 0;
}
static int
test_simde_mm_logb_ps (SIMDE_MUNIT_TEST_ARGS) {
static const struct {
const simde_float32 a[4];
const simde_float32 r[4];
} test_vec[] = {
{ { SIMDE_FLOAT32_C( 52.75), SIMDE_FLOAT32_C( 12.37), SIMDE_FLOAT32_C( 32.32), SIMDE_FLOAT32_C( 26.90) },
{ SIMDE_FLOAT32_C( 5.00), SIMDE_FLOAT32_C( 3.00), SIMDE_FLOAT32_C( 5.00), SIMDE_FLOAT32_C( 4.00) } },
{ { SIMDE_FLOAT32_C( 28.49), SIMDE_FLOAT32_C( 18.47), SIMDE_FLOAT32_C( 63.22), SIMDE_FLOAT32_C( 55.89) },
{ SIMDE_FLOAT32_C( 4.00), SIMDE_FLOAT32_C( 4.00), SIMDE_FLOAT32_C( 5.00), SIMDE_FLOAT32_C( 5.00) } },
{ { SIMDE_FLOAT32_C( 55.03), SIMDE_FLOAT32_C( 53.88), SIMDE_FLOAT32_C( 60.21), SIMDE_FLOAT32_C( 98.39) },
{ SIMDE_FLOAT32_C( 5.00), SIMDE_FLOAT32_C( 5.00), SIMDE_FLOAT32_C( 5.00), SIMDE_FLOAT32_C( 6.00) } },
{ { SIMDE_FLOAT32_C( 48.09), SIMDE_FLOAT32_C( 71.36), SIMDE_FLOAT32_C( 70.54), SIMDE_FLOAT32_C( 16.55) },
{ SIMDE_FLOAT32_C( 5.00), SIMDE_FLOAT32_C( 6.00), SIMDE_FLOAT32_C( 6.00), SIMDE_FLOAT32_C( 4.00) } },
{ { SIMDE_FLOAT32_C( 80.97), SIMDE_FLOAT32_C( 4.96), SIMDE_FLOAT32_C( 37.49), SIMDE_FLOAT32_C( 46.77) },
{ SIMDE_FLOAT32_C( 6.00), SIMDE_FLOAT32_C( 2.00), SIMDE_FLOAT32_C( 5.00), SIMDE_FLOAT32_C( 5.00) } },
{ { SIMDE_FLOAT32_C( 90.48), SIMDE_FLOAT32_C( 58.54), SIMDE_FLOAT32_C( 37.33), SIMDE_FLOAT32_C( 31.14) },
{ SIMDE_FLOAT32_C( 6.00), SIMDE_FLOAT32_C( 5.00), SIMDE_FLOAT32_C( 5.00), SIMDE_FLOAT32_C( 4.00) } },
{ { SIMDE_FLOAT32_C( 72.20), SIMDE_FLOAT32_C( 35.18), SIMDE_FLOAT32_C( 41.35), SIMDE_FLOAT32_C( 41.72) },
{ SIMDE_FLOAT32_C( 6.00), SIMDE_FLOAT32_C( 5.00), SIMDE_FLOAT32_C( 5.00), SIMDE_FLOAT32_C( 5.00) } },
{ { SIMDE_FLOAT32_C( 30.55), SIMDE_FLOAT32_C( 90.31), SIMDE_FLOAT32_C( 81.30), SIMDE_FLOAT32_C( 83.30) },
{ SIMDE_FLOAT32_C( 4.00), SIMDE_FLOAT32_C( 6.00), SIMDE_FLOAT32_C( 6.00), SIMDE_FLOAT32_C( 6.00) } }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])) ; i++) {
simde__m128 a = simde_mm_loadu_ps(test_vec[i].a);
simde__m128 r = simde_mm_logb_ps(a);
simde_test_x86_assert_equal_f32x4(r, simde_mm_loadu_ps(test_vec[i].r), 1);
}
return 0;
}
static int
test_simde_mm_logb_pd (SIMDE_MUNIT_TEST_ARGS) {
static const struct {
const simde_float64 a[2];
const simde_float64 r[2];
} test_vec[] = {
{ { SIMDE_FLOAT64_C( 42.51), SIMDE_FLOAT64_C( 67.09) },
{ SIMDE_FLOAT64_C( 5.00), SIMDE_FLOAT64_C( 6.00) } },
{ { SIMDE_FLOAT64_C( 79.25), SIMDE_FLOAT64_C( 26.02) },
{ SIMDE_FLOAT64_C( 6.00), SIMDE_FLOAT64_C( 4.00) } },
{ { SIMDE_FLOAT64_C( 47.58), SIMDE_FLOAT64_C( 12.11) },
{ SIMDE_FLOAT64_C( 5.00), SIMDE_FLOAT64_C( 3.00) } },
{ { SIMDE_FLOAT64_C( 67.84), SIMDE_FLOAT64_C( 75.08) },
{ SIMDE_FLOAT64_C( 6.00), SIMDE_FLOAT64_C( 6.00) } },
{ { SIMDE_FLOAT64_C( 6.25), SIMDE_FLOAT64_C( 48.99) },
{ SIMDE_FLOAT64_C( 2.00), SIMDE_FLOAT64_C( 5.00) } },
{ { SIMDE_FLOAT64_C( 74.95), SIMDE_FLOAT64_C( 97.10) },
{ SIMDE_FLOAT64_C( 6.00), SIMDE_FLOAT64_C( 6.00) } },
{ { SIMDE_FLOAT64_C( 9.84), SIMDE_FLOAT64_C( 31.53) },
{ SIMDE_FLOAT64_C( 3.00), SIMDE_FLOAT64_C( 4.00) } },
{ { SIMDE_FLOAT64_C( 85.29), SIMDE_FLOAT64_C( 31.26) },
{ SIMDE_FLOAT64_C( 6.00), SIMDE_FLOAT64_C( 4.00) } }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])) ; i++) {
simde__m128d a = simde_mm_loadu_pd(test_vec[i].a);
simde__m128d r = simde_mm_logb_pd(a);
simde_test_x86_assert_equal_f64x2(r, simde_mm_loadu_pd(test_vec[i].r), 1);
}
return 0;
}
static int
test_simde_mm256_logb_ps (SIMDE_MUNIT_TEST_ARGS) {
static const struct {
const simde_float32 a[8];
const simde_float32 r[8];
} test_vec[] = {
{ { SIMDE_FLOAT32_C( 3.33), SIMDE_FLOAT32_C( 14.78), SIMDE_FLOAT32_C( 3.51), SIMDE_FLOAT32_C( 41.15),
SIMDE_FLOAT32_C( 36.54), SIMDE_FLOAT32_C( 70.74), SIMDE_FLOAT32_C( 85.77), SIMDE_FLOAT32_C( 73.18) },
{ SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( 3.00), SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( 5.00),
SIMDE_FLOAT32_C( 5.00), SIMDE_FLOAT32_C( 6.00), SIMDE_FLOAT32_C( 6.00), SIMDE_FLOAT32_C( 6.00) } },
{ { SIMDE_FLOAT32_C( 8.54), SIMDE_FLOAT32_C( 76.06), SIMDE_FLOAT32_C( 3.47), SIMDE_FLOAT32_C( 1.66),
SIMDE_FLOAT32_C( 10.98), SIMDE_FLOAT32_C( 98.59), SIMDE_FLOAT32_C( 85.97), SIMDE_FLOAT32_C( 34.95) },
{ SIMDE_FLOAT32_C( 3.00), SIMDE_FLOAT32_C( 6.00), SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( 0.00),
SIMDE_FLOAT32_C( 3.00), SIMDE_FLOAT32_C( 6.00), SIMDE_FLOAT32_C( 6.00), SIMDE_FLOAT32_C( 5.00) } },
{ { SIMDE_FLOAT32_C( 50.04), SIMDE_FLOAT32_C( 79.93), SIMDE_FLOAT32_C( 79.22), SIMDE_FLOAT32_C( 75.66),
SIMDE_FLOAT32_C( 78.73), SIMDE_FLOAT32_C( 98.52), SIMDE_FLOAT32_C( 71.74), SIMDE_FLOAT32_C( 29.91) },
{ SIMDE_FLOAT32_C( 5.00), SIMDE_FLOAT32_C( 6.00), SIMDE_FLOAT32_C( 6.00), SIMDE_FLOAT32_C( 6.00),
SIMDE_FLOAT32_C( 6.00), SIMDE_FLOAT32_C( 6.00), SIMDE_FLOAT32_C( 6.00), SIMDE_FLOAT32_C( 4.00) } },
{ { SIMDE_FLOAT32_C( 36.91), SIMDE_FLOAT32_C( 76.48), SIMDE_FLOAT32_C( 92.50), SIMDE_FLOAT32_C( 91.82),
SIMDE_FLOAT32_C( 48.28), SIMDE_FLOAT32_C( 85.39), SIMDE_FLOAT32_C( 15.78), SIMDE_FLOAT32_C( 51.62) },
{ SIMDE_FLOAT32_C( 5.00), SIMDE_FLOAT32_C( 6.00), SIMDE_FLOAT32_C( 6.00), SIMDE_FLOAT32_C( 6.00),
SIMDE_FLOAT32_C( 5.00), SIMDE_FLOAT32_C( 6.00), SIMDE_FLOAT32_C( 3.00), SIMDE_FLOAT32_C( 5.00) } },
{ { SIMDE_FLOAT32_C( 0.17), SIMDE_FLOAT32_C( 19.29), SIMDE_FLOAT32_C( 92.76), SIMDE_FLOAT32_C( 36.71),
SIMDE_FLOAT32_C( 90.02), SIMDE_FLOAT32_C( 78.53), SIMDE_FLOAT32_C( 9.89), SIMDE_FLOAT32_C( 98.56) },
{ SIMDE_FLOAT32_C( -3.00), SIMDE_FLOAT32_C( 4.00), SIMDE_FLOAT32_C( 6.00), SIMDE_FLOAT32_C( 5.00),
SIMDE_FLOAT32_C( 6.00), SIMDE_FLOAT32_C( 6.00), SIMDE_FLOAT32_C( 3.00), SIMDE_FLOAT32_C( 6.00) } },
{ { SIMDE_FLOAT32_C( 54.59), SIMDE_FLOAT32_C( 13.36), SIMDE_FLOAT32_C( 0.23), SIMDE_FLOAT32_C( 65.57),
SIMDE_FLOAT32_C( 11.95), SIMDE_FLOAT32_C( 86.19), SIMDE_FLOAT32_C( 0.51), SIMDE_FLOAT32_C( 61.99) },
{ SIMDE_FLOAT32_C( 5.00), SIMDE_FLOAT32_C( 3.00), SIMDE_FLOAT32_C( -3.00), SIMDE_FLOAT32_C( 6.00),
SIMDE_FLOAT32_C( 3.00), SIMDE_FLOAT32_C( 6.00), SIMDE_FLOAT32_C( -1.00), SIMDE_FLOAT32_C( 5.00) } },
{ { SIMDE_FLOAT32_C( 66.13), SIMDE_FLOAT32_C( 79.73), SIMDE_FLOAT32_C( 37.65), SIMDE_FLOAT32_C( 44.86),
SIMDE_FLOAT32_C( 78.25), SIMDE_FLOAT32_C( 9.39), SIMDE_FLOAT32_C( 74.76), SIMDE_FLOAT32_C( 15.16) },
{ SIMDE_FLOAT32_C( 6.00), SIMDE_FLOAT32_C( 6.00), SIMDE_FLOAT32_C( 5.00), SIMDE_FLOAT32_C( 5.00),
SIMDE_FLOAT32_C( 6.00), SIMDE_FLOAT32_C( 3.00), SIMDE_FLOAT32_C( 6.00), SIMDE_FLOAT32_C( 3.00) } },
{ { SIMDE_FLOAT32_C( 85.87), SIMDE_FLOAT32_C( 67.26), SIMDE_FLOAT32_C( 6.97), SIMDE_FLOAT32_C( 34.15),
SIMDE_FLOAT32_C( 52.65), SIMDE_FLOAT32_C( 22.75), SIMDE_FLOAT32_C( 85.77), SIMDE_FLOAT32_C( 52.82) },
{ SIMDE_FLOAT32_C( 6.00), SIMDE_FLOAT32_C( 6.00), SIMDE_FLOAT32_C( 2.00), SIMDE_FLOAT32_C( 5.00),
SIMDE_FLOAT32_C( 5.00), SIMDE_FLOAT32_C( 4.00), SIMDE_FLOAT32_C( 6.00), SIMDE_FLOAT32_C( 5.00) } }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])) ; i++) {
simde__m256 a = simde_mm256_loadu_ps(test_vec[i].a);
simde__m256 r = simde_mm256_logb_ps(a);
simde_test_x86_assert_equal_f32x8(r, simde_mm256_loadu_ps(test_vec[i].r), 1);
}
return 0;
}
static int
test_simde_mm256_logb_pd (SIMDE_MUNIT_TEST_ARGS) {
static const struct {
const simde_float64 a[4];
const simde_float64 r[4];
} test_vec[] = {
{ { SIMDE_FLOAT64_C( 12.13), SIMDE_FLOAT64_C( 86.21), SIMDE_FLOAT64_C( 41.78), SIMDE_FLOAT64_C( 6.77) },
{ SIMDE_FLOAT64_C( 3.00), SIMDE_FLOAT64_C( 6.00), SIMDE_FLOAT64_C( 5.00), SIMDE_FLOAT64_C( 2.00) } },
{ { SIMDE_FLOAT64_C( 9.71), SIMDE_FLOAT64_C( 21.14), SIMDE_FLOAT64_C( 79.78), SIMDE_FLOAT64_C( 24.32) },
{ SIMDE_FLOAT64_C( 3.00), SIMDE_FLOAT64_C( 4.00), SIMDE_FLOAT64_C( 6.00), SIMDE_FLOAT64_C( 4.00) } },
{ { SIMDE_FLOAT64_C( 11.31), SIMDE_FLOAT64_C( 66.21), SIMDE_FLOAT64_C( 43.11), SIMDE_FLOAT64_C( 34.90) },
{ SIMDE_FLOAT64_C( 3.00), SIMDE_FLOAT64_C( 6.00), SIMDE_FLOAT64_C( 5.00), SIMDE_FLOAT64_C( 5.00) } },
{ { SIMDE_FLOAT64_C( 20.79), SIMDE_FLOAT64_C( 71.26), SIMDE_FLOAT64_C( 78.76), SIMDE_FLOAT64_C( 61.13) },
{ SIMDE_FLOAT64_C( 4.00), SIMDE_FLOAT64_C( 6.00), SIMDE_FLOAT64_C( 6.00), SIMDE_FLOAT64_C( 5.00) } },
{ { SIMDE_FLOAT64_C( 36.20), SIMDE_FLOAT64_C( 5.13), SIMDE_FLOAT64_C( 45.05), SIMDE_FLOAT64_C( 35.23) },
{ SIMDE_FLOAT64_C( 5.00), SIMDE_FLOAT64_C( 2.00), SIMDE_FLOAT64_C( 5.00), SIMDE_FLOAT64_C( 5.00) } },
{ { SIMDE_FLOAT64_C( 73.81), SIMDE_FLOAT64_C( 52.97), SIMDE_FLOAT64_C( 18.59), SIMDE_FLOAT64_C( 15.62) },
{ SIMDE_FLOAT64_C( 6.00), SIMDE_FLOAT64_C( 5.00), SIMDE_FLOAT64_C( 4.00), SIMDE_FLOAT64_C( 3.00) } },
{ { SIMDE_FLOAT64_C( 69.75), SIMDE_FLOAT64_C( 24.82), SIMDE_FLOAT64_C( 30.54), SIMDE_FLOAT64_C( 67.55) },
{ SIMDE_FLOAT64_C( 6.00), SIMDE_FLOAT64_C( 4.00), SIMDE_FLOAT64_C( 4.00), SIMDE_FLOAT64_C( 6.00) } },
{ { SIMDE_FLOAT64_C( 11.30), SIMDE_FLOAT64_C( 38.09), SIMDE_FLOAT64_C( 44.42), SIMDE_FLOAT64_C( 23.43) },
{ SIMDE_FLOAT64_C( 3.00), SIMDE_FLOAT64_C( 5.00), SIMDE_FLOAT64_C( 5.00), SIMDE_FLOAT64_C( 4.00) } }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])) ; i++) {
simde__m256d a = simde_mm256_loadu_pd(test_vec[i].a);
simde__m256d r = simde_mm256_logb_pd(a);
simde_test_x86_assert_equal_f64x4(r, simde_mm256_loadu_pd(test_vec[i].r), 1);
}
return 0;
}
static int
test_simde_mm512_logb_ps (SIMDE_MUNIT_TEST_ARGS) {
static const struct {
const simde_float32 a[16];
const simde_float32 r[16];
} test_vec[] = {
{ { SIMDE_FLOAT32_C( 21.10), SIMDE_FLOAT32_C( 11.56), SIMDE_FLOAT32_C( 9.28), SIMDE_FLOAT32_C( 74.19),
SIMDE_FLOAT32_C( 63.11), SIMDE_FLOAT32_C( 46.70), SIMDE_FLOAT32_C( 5.76), SIMDE_FLOAT32_C( 81.08),
SIMDE_FLOAT32_C( 64.90), SIMDE_FLOAT32_C( 46.85), SIMDE_FLOAT32_C( 89.59), SIMDE_FLOAT32_C( 87.79),
SIMDE_FLOAT32_C( 91.37), SIMDE_FLOAT32_C( 41.43), SIMDE_FLOAT32_C( 25.79), SIMDE_FLOAT32_C( 88.74) },
{ SIMDE_FLOAT32_C( 4.00), SIMDE_FLOAT32_C( 3.00), SIMDE_FLOAT32_C( 3.00), SIMDE_FLOAT32_C( 6.00),
SIMDE_FLOAT32_C( 5.00), SIMDE_FLOAT32_C( 5.00), SIMDE_FLOAT32_C( 2.00), SIMDE_FLOAT32_C( 6.00),
SIMDE_FLOAT32_C( 6.00), SIMDE_FLOAT32_C( 5.00), SIMDE_FLOAT32_C( 6.00), SIMDE_FLOAT32_C( 6.00),
SIMDE_FLOAT32_C( 6.00), SIMDE_FLOAT32_C( 5.00), SIMDE_FLOAT32_C( 4.00), SIMDE_FLOAT32_C( 6.00) } },
{ { SIMDE_FLOAT32_C( 11.74), SIMDE_FLOAT32_C( 71.01), SIMDE_FLOAT32_C( 59.27), SIMDE_FLOAT32_C( 4.58),
SIMDE_FLOAT32_C( 8.70), SIMDE_FLOAT32_C( 79.13), SIMDE_FLOAT32_C( 97.09), SIMDE_FLOAT32_C( 48.86),
SIMDE_FLOAT32_C( 12.81), SIMDE_FLOAT32_C( 63.88), SIMDE_FLOAT32_C( 81.17), SIMDE_FLOAT32_C( 72.37),
SIMDE_FLOAT32_C( 6.60), SIMDE_FLOAT32_C( 41.15), SIMDE_FLOAT32_C( 9.63), SIMDE_FLOAT32_C( 27.69) },
{ SIMDE_FLOAT32_C( 3.00), SIMDE_FLOAT32_C( 6.00), SIMDE_FLOAT32_C( 5.00), SIMDE_FLOAT32_C( 2.00),
SIMDE_FLOAT32_C( 3.00), SIMDE_FLOAT32_C( 6.00), SIMDE_FLOAT32_C( 6.00), SIMDE_FLOAT32_C( 5.00),
SIMDE_FLOAT32_C( 3.00), SIMDE_FLOAT32_C( 5.00), SIMDE_FLOAT32_C( 6.00), SIMDE_FLOAT32_C( 6.00),
SIMDE_FLOAT32_C( 2.00), SIMDE_FLOAT32_C( 5.00), SIMDE_FLOAT32_C( 3.00), SIMDE_FLOAT32_C( 4.00) } },
{ { SIMDE_FLOAT32_C( 52.70), SIMDE_FLOAT32_C( 18.90), SIMDE_FLOAT32_C( 1.88), SIMDE_FLOAT32_C( 15.81),
SIMDE_FLOAT32_C( 65.61), SIMDE_FLOAT32_C( 7.64), SIMDE_FLOAT32_C( 96.89), SIMDE_FLOAT32_C( 30.50),
SIMDE_FLOAT32_C( 54.49), SIMDE_FLOAT32_C( 86.48), SIMDE_FLOAT32_C( 18.30), SIMDE_FLOAT32_C( 45.86),
SIMDE_FLOAT32_C( 27.91), SIMDE_FLOAT32_C( 44.09), SIMDE_FLOAT32_C( 34.59), SIMDE_FLOAT32_C( 39.65) },
{ SIMDE_FLOAT32_C( 5.00), SIMDE_FLOAT32_C( 4.00), SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 3.00),
SIMDE_FLOAT32_C( 6.00), SIMDE_FLOAT32_C( 2.00), SIMDE_FLOAT32_C( 6.00), SIMDE_FLOAT32_C( 4.00),
SIMDE_FLOAT32_C( 5.00), SIMDE_FLOAT32_C( 6.00), SIMDE_FLOAT32_C( 4.00), SIMDE_FLOAT32_C( 5.00),
SIMDE_FLOAT32_C( 4.00), SIMDE_FLOAT32_C( 5.00), SIMDE_FLOAT32_C( 5.00), SIMDE_FLOAT32_C( 5.00) } },
{ { SIMDE_FLOAT32_C( 15.10), SIMDE_FLOAT32_C( 93.86), SIMDE_FLOAT32_C( 44.23), SIMDE_FLOAT32_C( 23.80),
SIMDE_FLOAT32_C( 72.99), SIMDE_FLOAT32_C( 41.32), SIMDE_FLOAT32_C( 72.65), SIMDE_FLOAT32_C( 85.79),
SIMDE_FLOAT32_C( 5.20), SIMDE_FLOAT32_C( 53.82), SIMDE_FLOAT32_C( 58.16), SIMDE_FLOAT32_C( 11.80),
SIMDE_FLOAT32_C( 94.97), SIMDE_FLOAT32_C( 67.79), SIMDE_FLOAT32_C( 39.49), SIMDE_FLOAT32_C( 47.67) },
{ SIMDE_FLOAT32_C( 3.00), SIMDE_FLOAT32_C( 6.00), SIMDE_FLOAT32_C( 5.00), SIMDE_FLOAT32_C( 4.00),
SIMDE_FLOAT32_C( 6.00), SIMDE_FLOAT32_C( 5.00), SIMDE_FLOAT32_C( 6.00), SIMDE_FLOAT32_C( 6.00),
SIMDE_FLOAT32_C( 2.00), SIMDE_FLOAT32_C( 5.00), SIMDE_FLOAT32_C( 5.00), SIMDE_FLOAT32_C( 3.00),
SIMDE_FLOAT32_C( 6.00), SIMDE_FLOAT32_C( 6.00), SIMDE_FLOAT32_C( 5.00), SIMDE_FLOAT32_C( 5.00) } },
{ { SIMDE_FLOAT32_C( 86.69), SIMDE_FLOAT32_C( 41.37), SIMDE_FLOAT32_C( 63.48), SIMDE_FLOAT32_C( 52.30),
SIMDE_FLOAT32_C( 49.01), SIMDE_FLOAT32_C( 60.37), SIMDE_FLOAT32_C( 82.80), SIMDE_FLOAT32_C( 3.50),
SIMDE_FLOAT32_C( 46.85), SIMDE_FLOAT32_C( 1.10), SIMDE_FLOAT32_C( 49.36), SIMDE_FLOAT32_C( 74.76),
SIMDE_FLOAT32_C( 45.19), SIMDE_FLOAT32_C( 83.95), SIMDE_FLOAT32_C( 14.42), SIMDE_FLOAT32_C( 60.29) },
{ SIMDE_FLOAT32_C( 6.00), SIMDE_FLOAT32_C( 5.00), SIMDE_FLOAT32_C( 5.00), SIMDE_FLOAT32_C( 5.00),
SIMDE_FLOAT32_C( 5.00), SIMDE_FLOAT32_C( 5.00), SIMDE_FLOAT32_C( 6.00), SIMDE_FLOAT32_C( 1.00),
SIMDE_FLOAT32_C( 5.00), SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 5.00), SIMDE_FLOAT32_C( 6.00),
SIMDE_FLOAT32_C( 5.00), SIMDE_FLOAT32_C( 6.00), SIMDE_FLOAT32_C( 3.00), SIMDE_FLOAT32_C( 5.00) } },
{ { SIMDE_FLOAT32_C( 77.81), SIMDE_FLOAT32_C( 58.65), SIMDE_FLOAT32_C( 84.09), SIMDE_FLOAT32_C( 50.80),
SIMDE_FLOAT32_C( 99.97), SIMDE_FLOAT32_C( 56.74), SIMDE_FLOAT32_C( 36.60), SIMDE_FLOAT32_C( 5.17),
SIMDE_FLOAT32_C( 10.56), SIMDE_FLOAT32_C( 94.76), SIMDE_FLOAT32_C( 16.97), SIMDE_FLOAT32_C( 5.53),
SIMDE_FLOAT32_C( 62.55), SIMDE_FLOAT32_C( 56.46), SIMDE_FLOAT32_C( 53.21), SIMDE_FLOAT32_C( 49.24) },
{ SIMDE_FLOAT32_C( 6.00), SIMDE_FLOAT32_C( 5.00), SIMDE_FLOAT32_C( 6.00), SIMDE_FLOAT32_C( 5.00),
SIMDE_FLOAT32_C( 6.00), SIMDE_FLOAT32_C( 5.00), SIMDE_FLOAT32_C( 5.00), SIMDE_FLOAT32_C( 2.00),
SIMDE_FLOAT32_C( 3.00), SIMDE_FLOAT32_C( 6.00), SIMDE_FLOAT32_C( 4.00), SIMDE_FLOAT32_C( 2.00),
SIMDE_FLOAT32_C( 5.00), SIMDE_FLOAT32_C( 5.00), SIMDE_FLOAT32_C( 5.00), SIMDE_FLOAT32_C( 5.00) } },
{ { SIMDE_FLOAT32_C( 97.83), SIMDE_FLOAT32_C( 16.69), SIMDE_FLOAT32_C( 1.54), SIMDE_FLOAT32_C( 46.83),
SIMDE_FLOAT32_C( 77.05), SIMDE_FLOAT32_C( 84.34), SIMDE_FLOAT32_C( 50.33), SIMDE_FLOAT32_C( 23.90),
SIMDE_FLOAT32_C( 85.44), SIMDE_FLOAT32_C( 99.69), SIMDE_FLOAT32_C( 98.67), SIMDE_FLOAT32_C( 30.63),
SIMDE_FLOAT32_C( 83.65), SIMDE_FLOAT32_C( 13.08), SIMDE_FLOAT32_C( 90.93), SIMDE_FLOAT32_C( 61.46) },
{ SIMDE_FLOAT32_C( 6.00), SIMDE_FLOAT32_C( 4.00), SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 5.00),
SIMDE_FLOAT32_C( 6.00), SIMDE_FLOAT32_C( 6.00), SIMDE_FLOAT32_C( 5.00), SIMDE_FLOAT32_C( 4.00),
SIMDE_FLOAT32_C( 6.00), SIMDE_FLOAT32_C( 6.00), SIMDE_FLOAT32_C( 6.00), SIMDE_FLOAT32_C( 4.00),
SIMDE_FLOAT32_C( 6.00), SIMDE_FLOAT32_C( 3.00), SIMDE_FLOAT32_C( 6.00), SIMDE_FLOAT32_C( 5.00) } },
{ { SIMDE_FLOAT32_C( 71.73), SIMDE_FLOAT32_C( 75.01), SIMDE_FLOAT32_C( 12.26), SIMDE_FLOAT32_C( 71.69),
SIMDE_FLOAT32_C( 31.76), SIMDE_FLOAT32_C( 48.85), SIMDE_FLOAT32_C( 76.86), SIMDE_FLOAT32_C( 42.32),
SIMDE_FLOAT32_C( 43.61), SIMDE_FLOAT32_C( 93.83), SIMDE_FLOAT32_C( 47.85), SIMDE_FLOAT32_C( 6.16),
SIMDE_FLOAT32_C( 50.28), SIMDE_FLOAT32_C( 1.06), SIMDE_FLOAT32_C( 55.40), SIMDE_FLOAT32_C( 48.11) },
{ SIMDE_FLOAT32_C( 6.00), SIMDE_FLOAT32_C( 6.00), SIMDE_FLOAT32_C( 3.00), SIMDE_FLOAT32_C( 6.00),
SIMDE_FLOAT32_C( 4.00), SIMDE_FLOAT32_C( 5.00), SIMDE_FLOAT32_C( 6.00), SIMDE_FLOAT32_C( 5.00),
SIMDE_FLOAT32_C( 5.00), SIMDE_FLOAT32_C( 6.00), SIMDE_FLOAT32_C( 5.00), SIMDE_FLOAT32_C( 2.00),
SIMDE_FLOAT32_C( 5.00), SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 5.00), SIMDE_FLOAT32_C( 5.00) } }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])) ; i++) {
simde__m512 a = simde_mm512_loadu_ps(test_vec[i].a);
simde__m512 r = simde_mm512_logb_ps(a);
simde_test_x86_assert_equal_f32x16(r, simde_mm512_loadu_ps(test_vec[i].r), 1);
}
return 0;
}
static int
test_simde_mm512_mask_logb_ps (SIMDE_MUNIT_TEST_ARGS) {
static const struct {
const simde_float32 src[16];
const simde__mmask8 k;
const simde_float32 a[16];
const simde_float32 r[16];
} test_vec[] = {
{ { SIMDE_FLOAT32_C( 59.92), SIMDE_FLOAT32_C( 53.26), SIMDE_FLOAT32_C( 9.59), SIMDE_FLOAT32_C( 7.55),
SIMDE_FLOAT32_C( 46.15), SIMDE_FLOAT32_C( 64.62), SIMDE_FLOAT32_C( 71.46), SIMDE_FLOAT32_C( 14.44),
SIMDE_FLOAT32_C( 20.71), SIMDE_FLOAT32_C( 37.36), SIMDE_FLOAT32_C( 74.54), SIMDE_FLOAT32_C( 71.98),
SIMDE_FLOAT32_C( 5.60), SIMDE_FLOAT32_C( 24.56), SIMDE_FLOAT32_C( 41.64), SIMDE_FLOAT32_C( 65.45) },
UINT8_C( 74),
{ SIMDE_FLOAT32_C( 94.52), SIMDE_FLOAT32_C( 66.49), SIMDE_FLOAT32_C( 56.15), SIMDE_FLOAT32_C( 82.67),
SIMDE_FLOAT32_C( 41.42), SIMDE_FLOAT32_C( 98.41), SIMDE_FLOAT32_C( 74.30), SIMDE_FLOAT32_C( 60.40),
SIMDE_FLOAT32_C( 20.04), SIMDE_FLOAT32_C( 51.01), SIMDE_FLOAT32_C( 8.26), SIMDE_FLOAT32_C( 26.15),
SIMDE_FLOAT32_C( 61.43), SIMDE_FLOAT32_C( 26.22), SIMDE_FLOAT32_C( 86.06), SIMDE_FLOAT32_C( 14.69) },
{ SIMDE_FLOAT32_C( 59.92), SIMDE_FLOAT32_C( 6.00), SIMDE_FLOAT32_C( 9.59), SIMDE_FLOAT32_C( 6.00),
SIMDE_FLOAT32_C( 46.15), SIMDE_FLOAT32_C( 64.62), SIMDE_FLOAT32_C( 6.00), SIMDE_FLOAT32_C( 14.44),
SIMDE_FLOAT32_C( 20.71), SIMDE_FLOAT32_C( 37.36), SIMDE_FLOAT32_C( 74.54), SIMDE_FLOAT32_C( 71.98),
SIMDE_FLOAT32_C( 5.60), SIMDE_FLOAT32_C( 24.56), SIMDE_FLOAT32_C( 41.64), SIMDE_FLOAT32_C( 65.45) } },
{ { SIMDE_FLOAT32_C( 35.81), SIMDE_FLOAT32_C( 93.61), SIMDE_FLOAT32_C( 60.84), SIMDE_FLOAT32_C( 0.43),
SIMDE_FLOAT32_C( 65.08), SIMDE_FLOAT32_C( 75.28), SIMDE_FLOAT32_C( 21.13), SIMDE_FLOAT32_C( 2.43),
SIMDE_FLOAT32_C( 49.82), SIMDE_FLOAT32_C( 93.11), SIMDE_FLOAT32_C( 8.03), SIMDE_FLOAT32_C( 74.37),
SIMDE_FLOAT32_C( 34.75), SIMDE_FLOAT32_C( 73.48), SIMDE_FLOAT32_C( 66.83), SIMDE_FLOAT32_C( 29.26) },
UINT8_C(187),
{ SIMDE_FLOAT32_C( 22.98), SIMDE_FLOAT32_C( 11.94), SIMDE_FLOAT32_C( 81.39), SIMDE_FLOAT32_C( 21.39),
SIMDE_FLOAT32_C( 86.23), SIMDE_FLOAT32_C( 41.79), SIMDE_FLOAT32_C( 41.43), SIMDE_FLOAT32_C( 37.25),
SIMDE_FLOAT32_C( 50.05), SIMDE_FLOAT32_C( 67.58), SIMDE_FLOAT32_C( 98.68), SIMDE_FLOAT32_C( 76.27),
SIMDE_FLOAT32_C( 53.64), SIMDE_FLOAT32_C( 13.37), SIMDE_FLOAT32_C( 12.08), SIMDE_FLOAT32_C( 47.25) },
{ SIMDE_FLOAT32_C( 4.00), SIMDE_FLOAT32_C( 3.00), SIMDE_FLOAT32_C( 60.84), SIMDE_FLOAT32_C( 4.00),
SIMDE_FLOAT32_C( 6.00), SIMDE_FLOAT32_C( 5.00), SIMDE_FLOAT32_C( 21.13), SIMDE_FLOAT32_C( 5.00),
SIMDE_FLOAT32_C( 49.82), SIMDE_FLOAT32_C( 93.11), SIMDE_FLOAT32_C( 8.03), SIMDE_FLOAT32_C( 74.37),
SIMDE_FLOAT32_C( 34.75), SIMDE_FLOAT32_C( 73.48), SIMDE_FLOAT32_C( 66.83), SIMDE_FLOAT32_C( 29.26) } },
{ { SIMDE_FLOAT32_C( 74.20), SIMDE_FLOAT32_C( 12.51), SIMDE_FLOAT32_C( 12.33), SIMDE_FLOAT32_C( 49.48),
SIMDE_FLOAT32_C( 33.65), SIMDE_FLOAT32_C( 14.76), SIMDE_FLOAT32_C( 99.30), SIMDE_FLOAT32_C( 26.76),
SIMDE_FLOAT32_C( 22.79), SIMDE_FLOAT32_C( 73.68), SIMDE_FLOAT32_C( 61.50), SIMDE_FLOAT32_C( 96.27),
SIMDE_FLOAT32_C( 40.51), SIMDE_FLOAT32_C( 90.77), SIMDE_FLOAT32_C( 36.25), SIMDE_FLOAT32_C( 63.49) },
UINT8_C(162),
{ SIMDE_FLOAT32_C( 17.64), SIMDE_FLOAT32_C( 84.88), SIMDE_FLOAT32_C( 88.94), SIMDE_FLOAT32_C( 59.43),
SIMDE_FLOAT32_C( 26.31), SIMDE_FLOAT32_C( 26.18), SIMDE_FLOAT32_C( 9.49), SIMDE_FLOAT32_C( 93.89),
SIMDE_FLOAT32_C( 24.86), SIMDE_FLOAT32_C( 85.76), SIMDE_FLOAT32_C( 47.53), SIMDE_FLOAT32_C( 38.23),
SIMDE_FLOAT32_C( 97.84), SIMDE_FLOAT32_C( 94.78), SIMDE_FLOAT32_C( 12.43), SIMDE_FLOAT32_C( 10.35) },
{ SIMDE_FLOAT32_C( 74.20), SIMDE_FLOAT32_C( 6.00), SIMDE_FLOAT32_C( 12.33), SIMDE_FLOAT32_C( 49.48),
SIMDE_FLOAT32_C( 33.65), SIMDE_FLOAT32_C( 4.00), SIMDE_FLOAT32_C( 99.30), SIMDE_FLOAT32_C( 6.00),
SIMDE_FLOAT32_C( 22.79), SIMDE_FLOAT32_C( 73.68), SIMDE_FLOAT32_C( 61.50), SIMDE_FLOAT32_C( 96.27),
SIMDE_FLOAT32_C( 40.51), SIMDE_FLOAT32_C( 90.77), SIMDE_FLOAT32_C( 36.25), SIMDE_FLOAT32_C( 63.49) } },
{ { SIMDE_FLOAT32_C( 7.11), SIMDE_FLOAT32_C( 61.92), SIMDE_FLOAT32_C( 44.00), SIMDE_FLOAT32_C( 21.88),
SIMDE_FLOAT32_C( 61.22), SIMDE_FLOAT32_C( 70.75), SIMDE_FLOAT32_C( 44.67), SIMDE_FLOAT32_C( 34.90),
SIMDE_FLOAT32_C( 32.26), SIMDE_FLOAT32_C( 40.94), SIMDE_FLOAT32_C( 75.40), SIMDE_FLOAT32_C( 23.02),
SIMDE_FLOAT32_C( 77.19), SIMDE_FLOAT32_C( 38.89), SIMDE_FLOAT32_C( 25.73), SIMDE_FLOAT32_C( 94.83) },
UINT8_C(143),
{ SIMDE_FLOAT32_C( 14.67), SIMDE_FLOAT32_C( 54.26), SIMDE_FLOAT32_C( 50.08), SIMDE_FLOAT32_C( 40.85),
SIMDE_FLOAT32_C( 63.75), SIMDE_FLOAT32_C( 43.97), SIMDE_FLOAT32_C( 65.71), SIMDE_FLOAT32_C( 49.51),
SIMDE_FLOAT32_C( 91.50), SIMDE_FLOAT32_C( 3.94), SIMDE_FLOAT32_C( 47.35), SIMDE_FLOAT32_C( 86.29),
SIMDE_FLOAT32_C( 16.37), SIMDE_FLOAT32_C( 57.70), SIMDE_FLOAT32_C( 93.40), SIMDE_FLOAT32_C( 78.29) },
{ SIMDE_FLOAT32_C( 3.00), SIMDE_FLOAT32_C( 5.00), SIMDE_FLOAT32_C( 5.00), SIMDE_FLOAT32_C( 5.00),
SIMDE_FLOAT32_C( 61.22), SIMDE_FLOAT32_C( 70.75), SIMDE_FLOAT32_C( 44.67), SIMDE_FLOAT32_C( 5.00),
SIMDE_FLOAT32_C( 32.26), SIMDE_FLOAT32_C( 40.94), SIMDE_FLOAT32_C( 75.40), SIMDE_FLOAT32_C( 23.02),
SIMDE_FLOAT32_C( 77.19), SIMDE_FLOAT32_C( 38.89), SIMDE_FLOAT32_C( 25.73), SIMDE_FLOAT32_C( 94.83) } },
{ { SIMDE_FLOAT32_C( 1.70), SIMDE_FLOAT32_C( 15.27), SIMDE_FLOAT32_C( 39.51), SIMDE_FLOAT32_C( 72.45),
SIMDE_FLOAT32_C( 59.94), SIMDE_FLOAT32_C( 74.41), SIMDE_FLOAT32_C( 4.71), SIMDE_FLOAT32_C( 0.89),
SIMDE_FLOAT32_C( 49.81), SIMDE_FLOAT32_C( 27.73), SIMDE_FLOAT32_C( 78.08), SIMDE_FLOAT32_C( 88.70),
SIMDE_FLOAT32_C( 53.46), SIMDE_FLOAT32_C( 72.91), SIMDE_FLOAT32_C( 12.47), SIMDE_FLOAT32_C( 68.13) },
UINT8_C(127),
{ SIMDE_FLOAT32_C( 62.56), SIMDE_FLOAT32_C( 8.97), SIMDE_FLOAT32_C( 90.92), SIMDE_FLOAT32_C( 6.53),
SIMDE_FLOAT32_C( 74.69), SIMDE_FLOAT32_C( 40.42), SIMDE_FLOAT32_C( 98.03), SIMDE_FLOAT32_C( 78.63),
SIMDE_FLOAT32_C( 87.77), SIMDE_FLOAT32_C( 84.32), SIMDE_FLOAT32_C( 95.00), SIMDE_FLOAT32_C( 45.47),
SIMDE_FLOAT32_C( 77.72), SIMDE_FLOAT32_C( 73.29), SIMDE_FLOAT32_C( 47.17), SIMDE_FLOAT32_C( 92.99) },
{ SIMDE_FLOAT32_C( 5.00), SIMDE_FLOAT32_C( 3.00), SIMDE_FLOAT32_C( 6.00), SIMDE_FLOAT32_C( 2.00),
SIMDE_FLOAT32_C( 6.00), SIMDE_FLOAT32_C( 5.00), SIMDE_FLOAT32_C( 6.00), SIMDE_FLOAT32_C( 0.89),
SIMDE_FLOAT32_C( 49.81), SIMDE_FLOAT32_C( 27.73), SIMDE_FLOAT32_C( 78.08), SIMDE_FLOAT32_C( 88.70),
SIMDE_FLOAT32_C( 53.46), SIMDE_FLOAT32_C( 72.91), SIMDE_FLOAT32_C( 12.47), SIMDE_FLOAT32_C( 68.13) } },
{ { SIMDE_FLOAT32_C( 12.80), SIMDE_FLOAT32_C( 19.62), SIMDE_FLOAT32_C( 52.94), SIMDE_FLOAT32_C( 87.20),
SIMDE_FLOAT32_C( 24.32), SIMDE_FLOAT32_C( 53.82), SIMDE_FLOAT32_C( 37.01), SIMDE_FLOAT32_C( 52.06),
SIMDE_FLOAT32_C( 31.90), SIMDE_FLOAT32_C( 25.71), SIMDE_FLOAT32_C( 5.52), SIMDE_FLOAT32_C( 4.81),
SIMDE_FLOAT32_C( 38.19), SIMDE_FLOAT32_C( 73.64), SIMDE_FLOAT32_C( 31.98), SIMDE_FLOAT32_C( 0.74) },
UINT8_C( 81),
{ SIMDE_FLOAT32_C( 22.90), SIMDE_FLOAT32_C( 7.28), SIMDE_FLOAT32_C( 57.30), SIMDE_FLOAT32_C( 63.32),
SIMDE_FLOAT32_C( 5.31), SIMDE_FLOAT32_C( 35.93), SIMDE_FLOAT32_C( 51.08), SIMDE_FLOAT32_C( 89.63),
SIMDE_FLOAT32_C( 30.93), SIMDE_FLOAT32_C( 96.55), SIMDE_FLOAT32_C( 67.35), SIMDE_FLOAT32_C( 4.22),
SIMDE_FLOAT32_C( 43.72), SIMDE_FLOAT32_C( 60.34), SIMDE_FLOAT32_C( 17.01), SIMDE_FLOAT32_C( 63.33) },
{ SIMDE_FLOAT32_C( 4.00), SIMDE_FLOAT32_C( 19.62), SIMDE_FLOAT32_C( 52.94), SIMDE_FLOAT32_C( 87.20),
SIMDE_FLOAT32_C( 2.00), SIMDE_FLOAT32_C( 53.82), SIMDE_FLOAT32_C( 5.00), SIMDE_FLOAT32_C( 52.06),
SIMDE_FLOAT32_C( 31.90), SIMDE_FLOAT32_C( 25.71), SIMDE_FLOAT32_C( 5.52), SIMDE_FLOAT32_C( 4.81),
SIMDE_FLOAT32_C( 38.19), SIMDE_FLOAT32_C( 73.64), SIMDE_FLOAT32_C( 31.98), SIMDE_FLOAT32_C( 0.74) } },
{ { SIMDE_FLOAT32_C( 13.27), SIMDE_FLOAT32_C( 4.22), SIMDE_FLOAT32_C( 87.66), SIMDE_FLOAT32_C( 67.10),
SIMDE_FLOAT32_C( 41.23), SIMDE_FLOAT32_C( 39.71), SIMDE_FLOAT32_C( 99.00), SIMDE_FLOAT32_C( 66.95),
SIMDE_FLOAT32_C( 45.23), SIMDE_FLOAT32_C( 3.81), SIMDE_FLOAT32_C( 5.13), SIMDE_FLOAT32_C( 18.87),
SIMDE_FLOAT32_C( 35.79), SIMDE_FLOAT32_C( 5.88), SIMDE_FLOAT32_C( 1.48), SIMDE_FLOAT32_C( 58.69) },
UINT8_C(192),
{ SIMDE_FLOAT32_C( 58.78), SIMDE_FLOAT32_C( 22.00), SIMDE_FLOAT32_C( 18.46), SIMDE_FLOAT32_C( 94.71),
SIMDE_FLOAT32_C( 73.09), SIMDE_FLOAT32_C( 8.09), SIMDE_FLOAT32_C( 25.64), SIMDE_FLOAT32_C( 69.64),
SIMDE_FLOAT32_C( 75.44), SIMDE_FLOAT32_C( 29.86), SIMDE_FLOAT32_C( 13.36), SIMDE_FLOAT32_C( 35.77),
SIMDE_FLOAT32_C( 46.87), SIMDE_FLOAT32_C( 76.69), SIMDE_FLOAT32_C( 49.05), SIMDE_FLOAT32_C( 51.09) },
{ SIMDE_FLOAT32_C( 13.27), SIMDE_FLOAT32_C( 4.22), SIMDE_FLOAT32_C( 87.66), SIMDE_FLOAT32_C( 67.10),
SIMDE_FLOAT32_C( 41.23), SIMDE_FLOAT32_C( 39.71), SIMDE_FLOAT32_C( 4.00), SIMDE_FLOAT32_C( 6.00),
SIMDE_FLOAT32_C( 45.23), SIMDE_FLOAT32_C( 3.81), SIMDE_FLOAT32_C( 5.13), SIMDE_FLOAT32_C( 18.87),
SIMDE_FLOAT32_C( 35.79), SIMDE_FLOAT32_C( 5.88), SIMDE_FLOAT32_C( 1.48), SIMDE_FLOAT32_C( 58.69) } },
{ { SIMDE_FLOAT32_C( 64.34), SIMDE_FLOAT32_C( 16.14), SIMDE_FLOAT32_C( 92.32), SIMDE_FLOAT32_C( 4.06),
SIMDE_FLOAT32_C( 15.14), SIMDE_FLOAT32_C( 59.27), SIMDE_FLOAT32_C( 49.28), SIMDE_FLOAT32_C( 18.96),
SIMDE_FLOAT32_C( 64.40), SIMDE_FLOAT32_C( 68.15), SIMDE_FLOAT32_C( 54.75), SIMDE_FLOAT32_C( 70.28),
SIMDE_FLOAT32_C( 69.63), SIMDE_FLOAT32_C( 13.43), SIMDE_FLOAT32_C( 83.43), SIMDE_FLOAT32_C( 28.42) },
UINT8_C( 42),
{ SIMDE_FLOAT32_C( 1.89), SIMDE_FLOAT32_C( 23.13), SIMDE_FLOAT32_C( 8.52), SIMDE_FLOAT32_C( 9.98),
SIMDE_FLOAT32_C( 48.77), SIMDE_FLOAT32_C( 78.16), SIMDE_FLOAT32_C( 85.41), SIMDE_FLOAT32_C( 78.63),
SIMDE_FLOAT32_C( 91.52), SIMDE_FLOAT32_C( 21.19), SIMDE_FLOAT32_C( 25.50), SIMDE_FLOAT32_C( 68.21),
SIMDE_FLOAT32_C( 70.23), SIMDE_FLOAT32_C( 76.59), SIMDE_FLOAT32_C( 32.55), SIMDE_FLOAT32_C( 86.38) },
{ SIMDE_FLOAT32_C( 64.34), SIMDE_FLOAT32_C( 4.00), SIMDE_FLOAT32_C( 92.32), SIMDE_FLOAT32_C( 3.00),
SIMDE_FLOAT32_C( 15.14), SIMDE_FLOAT32_C( 6.00), SIMDE_FLOAT32_C( 49.28), SIMDE_FLOAT32_C( 18.96),
SIMDE_FLOAT32_C( 64.40), SIMDE_FLOAT32_C( 68.15), SIMDE_FLOAT32_C( 54.75), SIMDE_FLOAT32_C( 70.28),
SIMDE_FLOAT32_C( 69.63), SIMDE_FLOAT32_C( 13.43), SIMDE_FLOAT32_C( 83.43), SIMDE_FLOAT32_C( 28.42) } }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])) ; i++) {
simde__m512 src = simde_mm512_loadu_ps(test_vec[i].src);
simde__m512 a = simde_mm512_loadu_ps(test_vec[i].a);
simde__m512 r = simde_mm512_mask_logb_ps(src, test_vec[i].k, a);
simde_test_x86_assert_equal_f32x16(r, simde_mm512_loadu_ps(test_vec[i].r), 1);
}
return 0;
}
static int
test_simde_mm512_logb_pd (SIMDE_MUNIT_TEST_ARGS) {
static const struct {
const simde_float64 a[8];
const simde_float64 r[8];
} test_vec[] = {
{ { SIMDE_FLOAT64_C( 61.06), SIMDE_FLOAT64_C( 56.07), SIMDE_FLOAT64_C( 3.95), SIMDE_FLOAT64_C( 60.43),
SIMDE_FLOAT64_C( 57.40), SIMDE_FLOAT64_C( 69.53), SIMDE_FLOAT64_C( 29.03), SIMDE_FLOAT64_C( 89.93) },
{ SIMDE_FLOAT64_C( 5.00), SIMDE_FLOAT64_C( 5.00), SIMDE_FLOAT64_C( 1.00), SIMDE_FLOAT64_C( 5.00),
SIMDE_FLOAT64_C( 5.00), SIMDE_FLOAT64_C( 6.00), SIMDE_FLOAT64_C( 4.00), SIMDE_FLOAT64_C( 6.00) } },
{ { SIMDE_FLOAT64_C( 49.22), SIMDE_FLOAT64_C( 9.42), SIMDE_FLOAT64_C( 73.55), SIMDE_FLOAT64_C( 15.48),
SIMDE_FLOAT64_C( 60.82), SIMDE_FLOAT64_C( 84.59), SIMDE_FLOAT64_C( 3.74), SIMDE_FLOAT64_C( 54.66) },
{ SIMDE_FLOAT64_C( 5.00), SIMDE_FLOAT64_C( 3.00), SIMDE_FLOAT64_C( 6.00), SIMDE_FLOAT64_C( 3.00),
SIMDE_FLOAT64_C( 5.00), SIMDE_FLOAT64_C( 6.00), SIMDE_FLOAT64_C( 1.00), SIMDE_FLOAT64_C( 5.00) } },
{ { SIMDE_FLOAT64_C( 33.37), SIMDE_FLOAT64_C( 75.87), SIMDE_FLOAT64_C( 58.52), SIMDE_FLOAT64_C( 48.59),
SIMDE_FLOAT64_C( 90.24), SIMDE_FLOAT64_C( 63.58), SIMDE_FLOAT64_C( 62.75), SIMDE_FLOAT64_C( 73.90) },
{ SIMDE_FLOAT64_C( 5.00), SIMDE_FLOAT64_C( 6.00), SIMDE_FLOAT64_C( 5.00), SIMDE_FLOAT64_C( 5.00),
SIMDE_FLOAT64_C( 6.00), SIMDE_FLOAT64_C( 5.00), SIMDE_FLOAT64_C( 5.00), SIMDE_FLOAT64_C( 6.00) } },
{ { SIMDE_FLOAT64_C( 18.87), SIMDE_FLOAT64_C( 24.32), SIMDE_FLOAT64_C( 24.02), SIMDE_FLOAT64_C( 25.17),
SIMDE_FLOAT64_C( 77.02), SIMDE_FLOAT64_C( 14.07), SIMDE_FLOAT64_C( 2.94), SIMDE_FLOAT64_C( 38.08) },
{ SIMDE_FLOAT64_C( 4.00), SIMDE_FLOAT64_C( 4.00), SIMDE_FLOAT64_C( 4.00), SIMDE_FLOAT64_C( 4.00),
SIMDE_FLOAT64_C( 6.00), SIMDE_FLOAT64_C( 3.00), SIMDE_FLOAT64_C( 1.00), SIMDE_FLOAT64_C( 5.00) } },
{ { SIMDE_FLOAT64_C( 70.14), SIMDE_FLOAT64_C( 6.89), SIMDE_FLOAT64_C( 98.50), SIMDE_FLOAT64_C( 27.53),
SIMDE_FLOAT64_C( 76.42), SIMDE_FLOAT64_C( 27.53), SIMDE_FLOAT64_C( 17.47), SIMDE_FLOAT64_C( 25.65) },
{ SIMDE_FLOAT64_C( 6.00), SIMDE_FLOAT64_C( 2.00), SIMDE_FLOAT64_C( 6.00), SIMDE_FLOAT64_C( 4.00),
SIMDE_FLOAT64_C( 6.00), SIMDE_FLOAT64_C( 4.00), SIMDE_FLOAT64_C( 4.00), SIMDE_FLOAT64_C( 4.00) } },
{ { SIMDE_FLOAT64_C( 36.95), SIMDE_FLOAT64_C( 91.02), SIMDE_FLOAT64_C( 41.13), SIMDE_FLOAT64_C( 97.76),
SIMDE_FLOAT64_C( 75.61), SIMDE_FLOAT64_C( 44.87), SIMDE_FLOAT64_C( 52.42), SIMDE_FLOAT64_C( 8.99) },
{ SIMDE_FLOAT64_C( 5.00), SIMDE_FLOAT64_C( 6.00), SIMDE_FLOAT64_C( 5.00), SIMDE_FLOAT64_C( 6.00),
SIMDE_FLOAT64_C( 6.00), SIMDE_FLOAT64_C( 5.00), SIMDE_FLOAT64_C( 5.00), SIMDE_FLOAT64_C( 3.00) } },
{ { SIMDE_FLOAT64_C( 20.74), SIMDE_FLOAT64_C( 10.94), SIMDE_FLOAT64_C( 57.58), SIMDE_FLOAT64_C( 10.98),
SIMDE_FLOAT64_C( 74.52), SIMDE_FLOAT64_C( 20.32), SIMDE_FLOAT64_C( 84.88), SIMDE_FLOAT64_C( 93.39) },
{ SIMDE_FLOAT64_C( 4.00), SIMDE_FLOAT64_C( 3.00), SIMDE_FLOAT64_C( 5.00), SIMDE_FLOAT64_C( 3.00),
SIMDE_FLOAT64_C( 6.00), SIMDE_FLOAT64_C( 4.00), SIMDE_FLOAT64_C( 6.00), SIMDE_FLOAT64_C( 6.00) } },
{ { SIMDE_FLOAT64_C( 44.64), SIMDE_FLOAT64_C( 8.90), SIMDE_FLOAT64_C( 18.56), SIMDE_FLOAT64_C( 21.66),
SIMDE_FLOAT64_C( 22.97), SIMDE_FLOAT64_C( 21.51), SIMDE_FLOAT64_C( 59.73), SIMDE_FLOAT64_C( 93.10) },
{ SIMDE_FLOAT64_C( 5.00), SIMDE_FLOAT64_C( 3.00), SIMDE_FLOAT64_C( 4.00), SIMDE_FLOAT64_C( 4.00),
SIMDE_FLOAT64_C( 4.00), SIMDE_FLOAT64_C( 4.00), SIMDE_FLOAT64_C( 5.00), SIMDE_FLOAT64_C( 6.00) } }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])) ; i++) {
simde__m512d a = simde_mm512_loadu_pd(test_vec[i].a);
simde__m512d r = simde_mm512_logb_pd(a);
simde_test_x86_assert_equal_f64x8(r, simde_mm512_loadu_pd(test_vec[i].r), 1);
}
return 0;
}
static int
test_simde_mm512_mask_logb_pd (SIMDE_MUNIT_TEST_ARGS) {
static const struct {
const simde_float64 src[8];
const simde__mmask8 k;
const simde_float64 a[8];
const simde_float64 r[8];
} test_vec[] = {
{ { SIMDE_FLOAT64_C( 44.91), SIMDE_FLOAT64_C( 88.38), SIMDE_FLOAT64_C( 45.58), SIMDE_FLOAT64_C( 12.77),
SIMDE_FLOAT64_C( 31.32), SIMDE_FLOAT64_C( 50.43), SIMDE_FLOAT64_C( 60.04), SIMDE_FLOAT64_C( 3.47) },
UINT8_C(214),
{ SIMDE_FLOAT64_C( 86.80), SIMDE_FLOAT64_C( 42.80), SIMDE_FLOAT64_C( 69.48), SIMDE_FLOAT64_C( 71.71),
SIMDE_FLOAT64_C( 94.56), SIMDE_FLOAT64_C( 31.31), SIMDE_FLOAT64_C( 74.51), SIMDE_FLOAT64_C( 72.92) },
{ SIMDE_FLOAT64_C( 44.91), SIMDE_FLOAT64_C( 5.00), SIMDE_FLOAT64_C( 6.00), SIMDE_FLOAT64_C( 12.77),
SIMDE_FLOAT64_C( 6.00), SIMDE_FLOAT64_C( 50.43), SIMDE_FLOAT64_C( 6.00), SIMDE_FLOAT64_C( 6.00) } },
{ { SIMDE_FLOAT64_C( 29.96), SIMDE_FLOAT64_C( 29.49), SIMDE_FLOAT64_C( 88.44), SIMDE_FLOAT64_C( 26.63),
SIMDE_FLOAT64_C( 15.97), SIMDE_FLOAT64_C( 77.55), SIMDE_FLOAT64_C( 47.96), SIMDE_FLOAT64_C( 96.03) },
UINT8_C( 76),
{ SIMDE_FLOAT64_C( 85.66), SIMDE_FLOAT64_C( 58.61), SIMDE_FLOAT64_C( 61.13), SIMDE_FLOAT64_C( 28.12),
SIMDE_FLOAT64_C( 0.08), SIMDE_FLOAT64_C( 6.05), SIMDE_FLOAT64_C( 16.50), SIMDE_FLOAT64_C( 45.67) },
{ SIMDE_FLOAT64_C( 29.96), SIMDE_FLOAT64_C( 29.49), SIMDE_FLOAT64_C( 5.00), SIMDE_FLOAT64_C( 4.00),
SIMDE_FLOAT64_C( 15.97), SIMDE_FLOAT64_C( 77.55), SIMDE_FLOAT64_C( 4.00), SIMDE_FLOAT64_C( 96.03) } },
{ { SIMDE_FLOAT64_C( 18.81), SIMDE_FLOAT64_C( 47.82), SIMDE_FLOAT64_C( 96.10), SIMDE_FLOAT64_C( 78.86),
SIMDE_FLOAT64_C( 51.29), SIMDE_FLOAT64_C( 7.80), SIMDE_FLOAT64_C( 65.66), SIMDE_FLOAT64_C( 94.09) },
UINT8_C( 98),
{ SIMDE_FLOAT64_C( 37.37), SIMDE_FLOAT64_C( 88.65), SIMDE_FLOAT64_C( 8.59), SIMDE_FLOAT64_C( 11.88),
SIMDE_FLOAT64_C( 61.57), SIMDE_FLOAT64_C( 38.54), SIMDE_FLOAT64_C( 41.37), SIMDE_FLOAT64_C( 50.02) },
{ SIMDE_FLOAT64_C( 18.81), SIMDE_FLOAT64_C( 6.00), SIMDE_FLOAT64_C( 96.10), SIMDE_FLOAT64_C( 78.86),
SIMDE_FLOAT64_C( 51.29), SIMDE_FLOAT64_C( 5.00), SIMDE_FLOAT64_C( 5.00), SIMDE_FLOAT64_C( 94.09) } },
{ { SIMDE_FLOAT64_C( 65.18), SIMDE_FLOAT64_C( 57.34), SIMDE_FLOAT64_C( 27.56), SIMDE_FLOAT64_C( 13.13),
SIMDE_FLOAT64_C( 53.38), SIMDE_FLOAT64_C( 10.85), SIMDE_FLOAT64_C( 98.80), SIMDE_FLOAT64_C( 11.98) },
UINT8_C(227),
{ SIMDE_FLOAT64_C( 26.92), SIMDE_FLOAT64_C( 12.07), SIMDE_FLOAT64_C( 78.04), SIMDE_FLOAT64_C( 43.42),
SIMDE_FLOAT64_C( 57.74), SIMDE_FLOAT64_C( 96.85), SIMDE_FLOAT64_C( 91.25), SIMDE_FLOAT64_C( 53.84) },
{ SIMDE_FLOAT64_C( 4.00), SIMDE_FLOAT64_C( 3.00), SIMDE_FLOAT64_C( 27.56), SIMDE_FLOAT64_C( 13.13),
SIMDE_FLOAT64_C( 53.38), SIMDE_FLOAT64_C( 6.00), SIMDE_FLOAT64_C( 6.00), SIMDE_FLOAT64_C( 5.00) } },
{ { SIMDE_FLOAT64_C( 75.71), SIMDE_FLOAT64_C( 42.54), SIMDE_FLOAT64_C( 61.63), SIMDE_FLOAT64_C( 41.37),
SIMDE_FLOAT64_C( 36.63), SIMDE_FLOAT64_C( 38.91), SIMDE_FLOAT64_C( 78.74), SIMDE_FLOAT64_C( 25.28) },
UINT8_C(133),
{ SIMDE_FLOAT64_C( 90.62), SIMDE_FLOAT64_C( 86.86), SIMDE_FLOAT64_C( 86.04), SIMDE_FLOAT64_C( 31.99),
SIMDE_FLOAT64_C( 36.87), SIMDE_FLOAT64_C( 51.22), SIMDE_FLOAT64_C( 89.34), SIMDE_FLOAT64_C( 64.43) },
{ SIMDE_FLOAT64_C( 6.00), SIMDE_FLOAT64_C( 42.54), SIMDE_FLOAT64_C( 6.00), SIMDE_FLOAT64_C( 41.37),
SIMDE_FLOAT64_C( 36.63), SIMDE_FLOAT64_C( 38.91), SIMDE_FLOAT64_C( 78.74), SIMDE_FLOAT64_C( 6.00) } },
{ { SIMDE_FLOAT64_C( 64.36), SIMDE_FLOAT64_C( 42.71), SIMDE_FLOAT64_C( 75.29), SIMDE_FLOAT64_C( 63.15),
SIMDE_FLOAT64_C( 54.70), SIMDE_FLOAT64_C( 47.28), SIMDE_FLOAT64_C( 90.08), SIMDE_FLOAT64_C( 66.76) },
UINT8_C(185),
{ SIMDE_FLOAT64_C( 33.50), SIMDE_FLOAT64_C( 24.50), SIMDE_FLOAT64_C( 22.16), SIMDE_FLOAT64_C( 24.75),
SIMDE_FLOAT64_C( 78.34), SIMDE_FLOAT64_C( 97.87), SIMDE_FLOAT64_C( 67.29), SIMDE_FLOAT64_C( 39.97) },
{ SIMDE_FLOAT64_C( 5.00), SIMDE_FLOAT64_C( 42.71), SIMDE_FLOAT64_C( 75.29), SIMDE_FLOAT64_C( 4.00),
SIMDE_FLOAT64_C( 6.00), SIMDE_FLOAT64_C( 6.00), SIMDE_FLOAT64_C( 90.08), SIMDE_FLOAT64_C( 5.00) } },
{ { SIMDE_FLOAT64_C( 39.24), SIMDE_FLOAT64_C( 3.92), SIMDE_FLOAT64_C( 78.88), SIMDE_FLOAT64_C( 17.98),
SIMDE_FLOAT64_C( 29.20), SIMDE_FLOAT64_C( 26.38), SIMDE_FLOAT64_C( 8.60), SIMDE_FLOAT64_C( 16.06) },
UINT8_C(216),
{ SIMDE_FLOAT64_C( 40.59), SIMDE_FLOAT64_C( 52.93), SIMDE_FLOAT64_C( 63.64), SIMDE_FLOAT64_C( 29.93),
SIMDE_FLOAT64_C( 17.36), SIMDE_FLOAT64_C( 28.00), SIMDE_FLOAT64_C( 72.65), SIMDE_FLOAT64_C( 92.65) },
{ SIMDE_FLOAT64_C( 39.24), SIMDE_FLOAT64_C( 3.92), SIMDE_FLOAT64_C( 78.88), SIMDE_FLOAT64_C( 4.00),
SIMDE_FLOAT64_C( 4.00), SIMDE_FLOAT64_C( 26.38), SIMDE_FLOAT64_C( 6.00), SIMDE_FLOAT64_C( 6.00) } },
{ { SIMDE_FLOAT64_C( 91.15), SIMDE_FLOAT64_C( 27.34), SIMDE_FLOAT64_C( 39.93), SIMDE_FLOAT64_C( 81.23),
SIMDE_FLOAT64_C( 94.10), SIMDE_FLOAT64_C( 65.24), SIMDE_FLOAT64_C( 14.73), SIMDE_FLOAT64_C( 18.60) },
UINT8_C(111),
{ SIMDE_FLOAT64_C( 39.48), SIMDE_FLOAT64_C( 96.94), SIMDE_FLOAT64_C( 85.27), SIMDE_FLOAT64_C( 6.77),
SIMDE_FLOAT64_C( 36.91), SIMDE_FLOAT64_C( 24.51), SIMDE_FLOAT64_C( 10.68), SIMDE_FLOAT64_C( 15.79) },
{ SIMDE_FLOAT64_C( 5.00), SIMDE_FLOAT64_C( 6.00), SIMDE_FLOAT64_C( 6.00), SIMDE_FLOAT64_C( 2.00),
SIMDE_FLOAT64_C( 94.10), SIMDE_FLOAT64_C( 4.00), SIMDE_FLOAT64_C( 3.00), SIMDE_FLOAT64_C( 18.60) } }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])) ; i++) {
simde__m512d src = simde_mm512_loadu_pd(test_vec[i].src);
simde__m512d a = simde_mm512_loadu_pd(test_vec[i].a);
simde__m512d r = simde_mm512_mask_logb_pd(src, test_vec[i].k, a);
simde_test_x86_assert_equal_f64x8(r, simde_mm512_loadu_pd(test_vec[i].r), 1);
}
return 0;
}
static int
test_simde_mm512_nearbyint_ps (SIMDE_MUNIT_TEST_ARGS) {
static const struct {
const simde_float32 a[16];
const simde_float32 r[16];
} test_vec[] = {
{ { SIMDE_FLOAT32_C( -914.49), SIMDE_FLOAT32_C( 460.45), SIMDE_FLOAT32_C( -816.31), SIMDE_FLOAT32_C( 969.94),
SIMDE_FLOAT32_C( -904.29), SIMDE_FLOAT32_C( -267.48), SIMDE_FLOAT32_C( -362.84), SIMDE_FLOAT32_C( -10.93),
SIMDE_FLOAT32_C( -124.62), SIMDE_FLOAT32_C( 667.93), SIMDE_FLOAT32_C( 512.15), SIMDE_FLOAT32_C( -37.80),
SIMDE_FLOAT32_C( 894.40), SIMDE_FLOAT32_C( 135.20), SIMDE_FLOAT32_C( -763.47), SIMDE_FLOAT32_C( -593.20) },
{ SIMDE_FLOAT32_C( -914.00), SIMDE_FLOAT32_C( 460.00), SIMDE_FLOAT32_C( -816.00), SIMDE_FLOAT32_C( 970.00),
SIMDE_FLOAT32_C( -904.00), SIMDE_FLOAT32_C( -267.00), SIMDE_FLOAT32_C( -363.00), SIMDE_FLOAT32_C( -11.00),
SIMDE_FLOAT32_C( -125.00), SIMDE_FLOAT32_C( 668.00), SIMDE_FLOAT32_C( 512.00), SIMDE_FLOAT32_C( -38.00),
SIMDE_FLOAT32_C( 894.00), SIMDE_FLOAT32_C( 135.00), SIMDE_FLOAT32_C( -763.00), SIMDE_FLOAT32_C( -593.00) } },
{ { SIMDE_FLOAT32_C( -849.14), SIMDE_FLOAT32_C( 852.22), SIMDE_FLOAT32_C( -400.69), SIMDE_FLOAT32_C( 171.29),
SIMDE_FLOAT32_C( 508.23), SIMDE_FLOAT32_C( -765.53), SIMDE_FLOAT32_C( -382.38), SIMDE_FLOAT32_C( -765.99),
SIMDE_FLOAT32_C( -92.44), SIMDE_FLOAT32_C( 141.65), SIMDE_FLOAT32_C( 748.46), SIMDE_FLOAT32_C( 28.81),
SIMDE_FLOAT32_C( -715.24), SIMDE_FLOAT32_C( -786.64), SIMDE_FLOAT32_C( -54.59), SIMDE_FLOAT32_C( -629.74) },
{ SIMDE_FLOAT32_C( -849.00), SIMDE_FLOAT32_C( 852.00), SIMDE_FLOAT32_C( -401.00), SIMDE_FLOAT32_C( 171.00),
SIMDE_FLOAT32_C( 508.00), SIMDE_FLOAT32_C( -766.00), SIMDE_FLOAT32_C( -382.00), SIMDE_FLOAT32_C( -766.00),
SIMDE_FLOAT32_C( -92.00), SIMDE_FLOAT32_C( 142.00), SIMDE_FLOAT32_C( 748.00), SIMDE_FLOAT32_C( 29.00),
SIMDE_FLOAT32_C( -715.00), SIMDE_FLOAT32_C( -787.00), SIMDE_FLOAT32_C( -55.00), SIMDE_FLOAT32_C( -630.00) } },
{ { SIMDE_FLOAT32_C( 673.81), SIMDE_FLOAT32_C( 129.11), SIMDE_FLOAT32_C( -659.80), SIMDE_FLOAT32_C( 769.52),
SIMDE_FLOAT32_C( 861.62), SIMDE_FLOAT32_C( -22.64), SIMDE_FLOAT32_C( -241.41), SIMDE_FLOAT32_C( -263.00),
SIMDE_FLOAT32_C( -354.71), SIMDE_FLOAT32_C( -729.27), SIMDE_FLOAT32_C( 699.19), SIMDE_FLOAT32_C( -460.31),
SIMDE_FLOAT32_C( 405.93), SIMDE_FLOAT32_C( 935.73), SIMDE_FLOAT32_C( -53.51), SIMDE_FLOAT32_C( 556.79) },
{ SIMDE_FLOAT32_C( 674.00), SIMDE_FLOAT32_C( 129.00), SIMDE_FLOAT32_C( -660.00), SIMDE_FLOAT32_C( 770.00),
SIMDE_FLOAT32_C( 862.00), SIMDE_FLOAT32_C( -23.00), SIMDE_FLOAT32_C( -241.00), SIMDE_FLOAT32_C( -263.00),
SIMDE_FLOAT32_C( -355.00), SIMDE_FLOAT32_C( -729.00), SIMDE_FLOAT32_C( 699.00), SIMDE_FLOAT32_C( -460.00),
SIMDE_FLOAT32_C( 406.00), SIMDE_FLOAT32_C( 936.00), SIMDE_FLOAT32_C( -54.00), SIMDE_FLOAT32_C( 557.00) } },
{ { SIMDE_FLOAT32_C( 787.95), SIMDE_FLOAT32_C( 545.80), SIMDE_FLOAT32_C( -271.92), SIMDE_FLOAT32_C( 296.18),
SIMDE_FLOAT32_C( 780.27), SIMDE_FLOAT32_C( 345.70), SIMDE_FLOAT32_C( 530.19), SIMDE_FLOAT32_C( -312.17),
SIMDE_FLOAT32_C( -512.65), SIMDE_FLOAT32_C( 278.65), SIMDE_FLOAT32_C( 716.64), SIMDE_FLOAT32_C( -227.89),
SIMDE_FLOAT32_C( 492.01), SIMDE_FLOAT32_C( -337.94), SIMDE_FLOAT32_C( 142.37), SIMDE_FLOAT32_C( 165.82) },
{ SIMDE_FLOAT32_C( 788.00), SIMDE_FLOAT32_C( 546.00), SIMDE_FLOAT32_C( -272.00), SIMDE_FLOAT32_C( 296.00),
SIMDE_FLOAT32_C( 780.00), SIMDE_FLOAT32_C( 346.00), SIMDE_FLOAT32_C( 530.00), SIMDE_FLOAT32_C( -312.00),
SIMDE_FLOAT32_C( -513.00), SIMDE_FLOAT32_C( 279.00), SIMDE_FLOAT32_C( 717.00), SIMDE_FLOAT32_C( -228.00),
SIMDE_FLOAT32_C( 492.00), SIMDE_FLOAT32_C( -338.00), SIMDE_FLOAT32_C( 142.00), SIMDE_FLOAT32_C( 166.00) } },
{ { SIMDE_FLOAT32_C( 791.16), SIMDE_FLOAT32_C( 482.57), SIMDE_FLOAT32_C( -64.66), SIMDE_FLOAT32_C( 652.78),
SIMDE_FLOAT32_C( -540.07), SIMDE_FLOAT32_C( 693.92), SIMDE_FLOAT32_C( -610.22), SIMDE_FLOAT32_C( 105.21),
SIMDE_FLOAT32_C( 964.66), SIMDE_FLOAT32_C( -911.03), SIMDE_FLOAT32_C( 644.90), SIMDE_FLOAT32_C( 370.59),
SIMDE_FLOAT32_C( -975.30), SIMDE_FLOAT32_C( -408.60), SIMDE_FLOAT32_C( -72.62), SIMDE_FLOAT32_C( 812.65) },
{ SIMDE_FLOAT32_C( 791.00), SIMDE_FLOAT32_C( 483.00), SIMDE_FLOAT32_C( -65.00), SIMDE_FLOAT32_C( 653.00),
SIMDE_FLOAT32_C( -540.00), SIMDE_FLOAT32_C( 694.00), SIMDE_FLOAT32_C( -610.00), SIMDE_FLOAT32_C( 105.00),
SIMDE_FLOAT32_C( 965.00), SIMDE_FLOAT32_C( -911.00), SIMDE_FLOAT32_C( 645.00), SIMDE_FLOAT32_C( 371.00),
SIMDE_FLOAT32_C( -975.00), SIMDE_FLOAT32_C( -409.00), SIMDE_FLOAT32_C( -73.00), SIMDE_FLOAT32_C( 813.00) } },
{ { SIMDE_FLOAT32_C( -862.80), SIMDE_FLOAT32_C( 655.47), SIMDE_FLOAT32_C( 108.83), SIMDE_FLOAT32_C( 917.47),
SIMDE_FLOAT32_C( 1.16), SIMDE_FLOAT32_C( -360.98), SIMDE_FLOAT32_C( -394.70), SIMDE_FLOAT32_C( 488.51),
SIMDE_FLOAT32_C( 917.67), SIMDE_FLOAT32_C( -678.06), SIMDE_FLOAT32_C( -739.38), SIMDE_FLOAT32_C( 409.68),
SIMDE_FLOAT32_C( -16.00), SIMDE_FLOAT32_C( 402.99), SIMDE_FLOAT32_C( -424.50), SIMDE_FLOAT32_C( -224.84) },
{ SIMDE_FLOAT32_C( -863.00), SIMDE_FLOAT32_C( 655.00), SIMDE_FLOAT32_C( 109.00), SIMDE_FLOAT32_C( 917.00),
SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( -361.00), SIMDE_FLOAT32_C( -395.00), SIMDE_FLOAT32_C( 489.00),
SIMDE_FLOAT32_C( 918.00), SIMDE_FLOAT32_C( -678.00), SIMDE_FLOAT32_C( -739.00), SIMDE_FLOAT32_C( 410.00),
SIMDE_FLOAT32_C( -16.00), SIMDE_FLOAT32_C( 403.00), SIMDE_FLOAT32_C( -424.00), SIMDE_FLOAT32_C( -225.00) } },
{ { SIMDE_FLOAT32_C( -114.44), SIMDE_FLOAT32_C( 510.83), SIMDE_FLOAT32_C( -572.05), SIMDE_FLOAT32_C( 345.49),
SIMDE_FLOAT32_C( 204.76), SIMDE_FLOAT32_C( -182.27), SIMDE_FLOAT32_C( -549.30), SIMDE_FLOAT32_C( 169.42),
SIMDE_FLOAT32_C( -93.30), SIMDE_FLOAT32_C( -904.39), SIMDE_FLOAT32_C( -459.99), SIMDE_FLOAT32_C( -68.59),
SIMDE_FLOAT32_C( -313.00), SIMDE_FLOAT32_C( 467.39), SIMDE_FLOAT32_C( -255.94), SIMDE_FLOAT32_C( -175.80) },
{ SIMDE_FLOAT32_C( -114.00), SIMDE_FLOAT32_C( 511.00), SIMDE_FLOAT32_C( -572.00), SIMDE_FLOAT32_C( 345.00),
SIMDE_FLOAT32_C( 205.00), SIMDE_FLOAT32_C( -182.00), SIMDE_FLOAT32_C( -549.00), SIMDE_FLOAT32_C( 169.00),
SIMDE_FLOAT32_C( -93.00), SIMDE_FLOAT32_C( -904.00), SIMDE_FLOAT32_C( -460.00), SIMDE_FLOAT32_C( -69.00),
SIMDE_FLOAT32_C( -313.00), SIMDE_FLOAT32_C( 467.00), SIMDE_FLOAT32_C( -256.00), SIMDE_FLOAT32_C( -176.00) } },
{ { SIMDE_FLOAT32_C( 122.86), SIMDE_FLOAT32_C( 852.89), SIMDE_FLOAT32_C( -258.33), SIMDE_FLOAT32_C( -875.98),
SIMDE_FLOAT32_C( -508.09), SIMDE_FLOAT32_C( 346.97), SIMDE_FLOAT32_C( 612.54), SIMDE_FLOAT32_C( -590.42),
SIMDE_FLOAT32_C( 668.92), SIMDE_FLOAT32_C( 873.16), SIMDE_FLOAT32_C( 819.25), SIMDE_FLOAT32_C( -347.08),
SIMDE_FLOAT32_C( 276.15), SIMDE_FLOAT32_C( -605.25), SIMDE_FLOAT32_C( 428.08), SIMDE_FLOAT32_C( -838.29) },
{ SIMDE_FLOAT32_C( 123.00), SIMDE_FLOAT32_C( 853.00), SIMDE_FLOAT32_C( -258.00), SIMDE_FLOAT32_C( -876.00),
SIMDE_FLOAT32_C( -508.00), SIMDE_FLOAT32_C( 347.00), SIMDE_FLOAT32_C( 613.00), SIMDE_FLOAT32_C( -590.00),
SIMDE_FLOAT32_C( 669.00), SIMDE_FLOAT32_C( 873.00), SIMDE_FLOAT32_C( 819.00), SIMDE_FLOAT32_C( -347.00),
SIMDE_FLOAT32_C( 276.00), SIMDE_FLOAT32_C( -605.00), SIMDE_FLOAT32_C( 428.00), SIMDE_FLOAT32_C( -838.00) } }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])) ; i++) {
simde__m512 a = simde_mm512_loadu_ps(test_vec[i].a);
simde__m512 r = simde_mm512_nearbyint_ps(a);
simde_test_x86_assert_equal_f32x16(r, simde_mm512_loadu_ps(test_vec[i].r), 1);
}
return 0;
}
static int
test_simde_mm512_mask_nearbyint_ps (SIMDE_MUNIT_TEST_ARGS) {
static const struct {
const simde_float32 src[16];
const simde__mmask8 k;
const simde_float32 a[16];
const simde_float32 r[16];
} test_vec[] = {
{ { SIMDE_FLOAT32_C( 307.14), SIMDE_FLOAT32_C( 482.19), SIMDE_FLOAT32_C( 518.39), SIMDE_FLOAT32_C( 499.59),
SIMDE_FLOAT32_C( -143.12), SIMDE_FLOAT32_C( 912.60), SIMDE_FLOAT32_C( 782.99), SIMDE_FLOAT32_C( -267.99),
SIMDE_FLOAT32_C( -349.63), SIMDE_FLOAT32_C( -394.28), SIMDE_FLOAT32_C( -735.06), SIMDE_FLOAT32_C( 319.94),
SIMDE_FLOAT32_C( -352.44), SIMDE_FLOAT32_C( 639.50), SIMDE_FLOAT32_C( -238.81), SIMDE_FLOAT32_C( 516.17) },
UINT8_C(107),
{ SIMDE_FLOAT32_C( -887.79), SIMDE_FLOAT32_C( 742.81), SIMDE_FLOAT32_C( -913.42), SIMDE_FLOAT32_C( -611.43),
SIMDE_FLOAT32_C( 204.70), SIMDE_FLOAT32_C( 940.63), SIMDE_FLOAT32_C( -825.04), SIMDE_FLOAT32_C( 37.94),
SIMDE_FLOAT32_C( 967.28), SIMDE_FLOAT32_C( -950.31), SIMDE_FLOAT32_C( -916.12), SIMDE_FLOAT32_C( 338.61),
SIMDE_FLOAT32_C( -151.13), SIMDE_FLOAT32_C( -229.02), SIMDE_FLOAT32_C( -354.25), SIMDE_FLOAT32_C( -668.94) },
{ SIMDE_FLOAT32_C( -888.00), SIMDE_FLOAT32_C( 743.00), SIMDE_FLOAT32_C( 518.39), SIMDE_FLOAT32_C( -611.00),
SIMDE_FLOAT32_C( -143.12), SIMDE_FLOAT32_C( 941.00), SIMDE_FLOAT32_C( -825.00), SIMDE_FLOAT32_C( -267.99),
SIMDE_FLOAT32_C( -349.63), SIMDE_FLOAT32_C( -394.28), SIMDE_FLOAT32_C( -735.06), SIMDE_FLOAT32_C( 319.94),
SIMDE_FLOAT32_C( -352.44), SIMDE_FLOAT32_C( 639.50), SIMDE_FLOAT32_C( -238.81), SIMDE_FLOAT32_C( 516.17) } },
{ { SIMDE_FLOAT32_C( -710.63), SIMDE_FLOAT32_C( -854.67), SIMDE_FLOAT32_C( 187.94), SIMDE_FLOAT32_C( -798.03),
SIMDE_FLOAT32_C( 928.32), SIMDE_FLOAT32_C( 919.94), SIMDE_FLOAT32_C( -147.65), SIMDE_FLOAT32_C( -465.96),
SIMDE_FLOAT32_C( -815.12), SIMDE_FLOAT32_C( -827.71), SIMDE_FLOAT32_C( 181.60), SIMDE_FLOAT32_C( 824.38),
SIMDE_FLOAT32_C( -66.52), SIMDE_FLOAT32_C( -302.23), SIMDE_FLOAT32_C( -118.38), SIMDE_FLOAT32_C( 45.69) },
UINT8_C(170),
{ SIMDE_FLOAT32_C( -31.81), SIMDE_FLOAT32_C( 434.25), SIMDE_FLOAT32_C( 645.28), SIMDE_FLOAT32_C( -91.18),
SIMDE_FLOAT32_C( 609.22), SIMDE_FLOAT32_C( -316.78), SIMDE_FLOAT32_C( -123.90), SIMDE_FLOAT32_C( 658.90),
SIMDE_FLOAT32_C( -232.89), SIMDE_FLOAT32_C( -785.30), SIMDE_FLOAT32_C( -492.22), SIMDE_FLOAT32_C( 538.09),
SIMDE_FLOAT32_C( -139.55), SIMDE_FLOAT32_C( -161.16), SIMDE_FLOAT32_C( 827.46), SIMDE_FLOAT32_C( 5.78) },
{ SIMDE_FLOAT32_C( -710.63), SIMDE_FLOAT32_C( 434.00), SIMDE_FLOAT32_C( 187.94), SIMDE_FLOAT32_C( -91.00),
SIMDE_FLOAT32_C( 928.32), SIMDE_FLOAT32_C( -317.00), SIMDE_FLOAT32_C( -147.65), SIMDE_FLOAT32_C( 659.00),
SIMDE_FLOAT32_C( -815.12), SIMDE_FLOAT32_C( -827.71), SIMDE_FLOAT32_C( 181.60), SIMDE_FLOAT32_C( 824.38),
SIMDE_FLOAT32_C( -66.52), SIMDE_FLOAT32_C( -302.23), SIMDE_FLOAT32_C( -118.38), SIMDE_FLOAT32_C( 45.69) } },
{ { SIMDE_FLOAT32_C( -973.23), SIMDE_FLOAT32_C( -970.57), SIMDE_FLOAT32_C( -65.89), SIMDE_FLOAT32_C( 946.72),
SIMDE_FLOAT32_C( -118.22), SIMDE_FLOAT32_C( 468.15), SIMDE_FLOAT32_C( -868.40), SIMDE_FLOAT32_C( 54.07),
SIMDE_FLOAT32_C( -350.25), SIMDE_FLOAT32_C( 955.97), SIMDE_FLOAT32_C( 987.55), SIMDE_FLOAT32_C( 347.52),
SIMDE_FLOAT32_C( -162.41), SIMDE_FLOAT32_C( 33.24), SIMDE_FLOAT32_C( 788.11), SIMDE_FLOAT32_C( 805.78) },
UINT8_C(147),
{ SIMDE_FLOAT32_C( 433.39), SIMDE_FLOAT32_C( -285.40), SIMDE_FLOAT32_C( -923.29), SIMDE_FLOAT32_C( -883.39),
SIMDE_FLOAT32_C( 590.69), SIMDE_FLOAT32_C( 735.61), SIMDE_FLOAT32_C( -116.28), SIMDE_FLOAT32_C( 805.40),
SIMDE_FLOAT32_C( -756.61), SIMDE_FLOAT32_C( -578.19), SIMDE_FLOAT32_C( -334.15), SIMDE_FLOAT32_C( 82.23),
SIMDE_FLOAT32_C( -750.73), SIMDE_FLOAT32_C( 671.63), SIMDE_FLOAT32_C( 109.00), SIMDE_FLOAT32_C( -721.30) },
{ SIMDE_FLOAT32_C( 433.00), SIMDE_FLOAT32_C( -285.00), SIMDE_FLOAT32_C( -65.89), SIMDE_FLOAT32_C( 946.72),
SIMDE_FLOAT32_C( 591.00), SIMDE_FLOAT32_C( 468.15), SIMDE_FLOAT32_C( -868.40), SIMDE_FLOAT32_C( 805.00),
SIMDE_FLOAT32_C( -350.25), SIMDE_FLOAT32_C( 955.97), SIMDE_FLOAT32_C( 987.55), SIMDE_FLOAT32_C( 347.52),
SIMDE_FLOAT32_C( -162.41), SIMDE_FLOAT32_C( 33.24), SIMDE_FLOAT32_C( 788.11), SIMDE_FLOAT32_C( 805.78) } },
{ { SIMDE_FLOAT32_C( -394.26), SIMDE_FLOAT32_C( 55.71), SIMDE_FLOAT32_C( 160.48), SIMDE_FLOAT32_C( -926.11),
SIMDE_FLOAT32_C( 187.31), SIMDE_FLOAT32_C( -785.45), SIMDE_FLOAT32_C( -276.36), SIMDE_FLOAT32_C( 143.28),
SIMDE_FLOAT32_C( -797.89), SIMDE_FLOAT32_C( -928.84), SIMDE_FLOAT32_C( 980.87), SIMDE_FLOAT32_C( 235.35),
SIMDE_FLOAT32_C( 859.27), SIMDE_FLOAT32_C( 786.65), SIMDE_FLOAT32_C( 702.84), SIMDE_FLOAT32_C( 292.65) },
UINT8_C( 5),
{ SIMDE_FLOAT32_C( 779.55), SIMDE_FLOAT32_C( 409.26), SIMDE_FLOAT32_C( -908.05), SIMDE_FLOAT32_C( 515.17),
SIMDE_FLOAT32_C( -707.02), SIMDE_FLOAT32_C( 897.34), SIMDE_FLOAT32_C( 758.56), SIMDE_FLOAT32_C( -285.21),
SIMDE_FLOAT32_C( -436.81), SIMDE_FLOAT32_C( -159.22), SIMDE_FLOAT32_C( -35.94), SIMDE_FLOAT32_C( -765.18),
SIMDE_FLOAT32_C( 949.78), SIMDE_FLOAT32_C( 242.76), SIMDE_FLOAT32_C( -159.44), SIMDE_FLOAT32_C( 5.49) },
{ SIMDE_FLOAT32_C( 780.00), SIMDE_FLOAT32_C( 55.71), SIMDE_FLOAT32_C( -908.00), SIMDE_FLOAT32_C( -926.11),
SIMDE_FLOAT32_C( 187.31), SIMDE_FLOAT32_C( -785.45), SIMDE_FLOAT32_C( -276.36), SIMDE_FLOAT32_C( 143.28),
SIMDE_FLOAT32_C( -797.89), SIMDE_FLOAT32_C( -928.84), SIMDE_FLOAT32_C( 980.87), SIMDE_FLOAT32_C( 235.35),
SIMDE_FLOAT32_C( 859.27), SIMDE_FLOAT32_C( 786.65), SIMDE_FLOAT32_C( 702.84), SIMDE_FLOAT32_C( 292.65) } },
{ { SIMDE_FLOAT32_C( -596.76), SIMDE_FLOAT32_C( -85.56), SIMDE_FLOAT32_C( -807.20), SIMDE_FLOAT32_C( -382.21),
SIMDE_FLOAT32_C( 638.08), SIMDE_FLOAT32_C( 336.09), SIMDE_FLOAT32_C( -180.10), SIMDE_FLOAT32_C( 709.25),
SIMDE_FLOAT32_C( 316.96), SIMDE_FLOAT32_C( -944.76), SIMDE_FLOAT32_C( 568.51), SIMDE_FLOAT32_C( 103.62),
SIMDE_FLOAT32_C( 758.08), SIMDE_FLOAT32_C( -138.83), SIMDE_FLOAT32_C( 604.87), SIMDE_FLOAT32_C( 537.64) },
UINT8_C( 9),
{ SIMDE_FLOAT32_C( 696.82), SIMDE_FLOAT32_C( 52.80), SIMDE_FLOAT32_C( -436.59), SIMDE_FLOAT32_C( 594.16),
SIMDE_FLOAT32_C( -188.64), SIMDE_FLOAT32_C( 278.20), SIMDE_FLOAT32_C( -842.65), SIMDE_FLOAT32_C( 652.14),
SIMDE_FLOAT32_C( -757.74), SIMDE_FLOAT32_C( -607.83), SIMDE_FLOAT32_C( 601.92), SIMDE_FLOAT32_C( 485.02),
SIMDE_FLOAT32_C( 232.73), SIMDE_FLOAT32_C( -392.58), SIMDE_FLOAT32_C( 888.25), SIMDE_FLOAT32_C( -852.82) },
{ SIMDE_FLOAT32_C( 697.00), SIMDE_FLOAT32_C( -85.56), SIMDE_FLOAT32_C( -807.20), SIMDE_FLOAT32_C( 594.00),
SIMDE_FLOAT32_C( 638.08), SIMDE_FLOAT32_C( 336.09), SIMDE_FLOAT32_C( -180.10), SIMDE_FLOAT32_C( 709.25),
SIMDE_FLOAT32_C( 316.96), SIMDE_FLOAT32_C( -944.76), SIMDE_FLOAT32_C( 568.51), SIMDE_FLOAT32_C( 103.62),
SIMDE_FLOAT32_C( 758.08), SIMDE_FLOAT32_C( -138.83), SIMDE_FLOAT32_C( 604.87), SIMDE_FLOAT32_C( 537.64) } },
{ { SIMDE_FLOAT32_C( -199.78), SIMDE_FLOAT32_C( -493.96), SIMDE_FLOAT32_C( 785.26), SIMDE_FLOAT32_C( -863.69),
SIMDE_FLOAT32_C( 325.94), SIMDE_FLOAT32_C( 494.50), SIMDE_FLOAT32_C( 453.27), SIMDE_FLOAT32_C( 381.18),
SIMDE_FLOAT32_C( 63.02), SIMDE_FLOAT32_C( -443.12), SIMDE_FLOAT32_C( 139.26), SIMDE_FLOAT32_C( 924.18),
SIMDE_FLOAT32_C( -838.25), SIMDE_FLOAT32_C( -323.10), SIMDE_FLOAT32_C( -805.38), SIMDE_FLOAT32_C( 858.57) },
UINT8_C(245),
{ SIMDE_FLOAT32_C( -241.97), SIMDE_FLOAT32_C( 452.73), SIMDE_FLOAT32_C( -458.94), SIMDE_FLOAT32_C( -963.77),
SIMDE_FLOAT32_C( 610.08), SIMDE_FLOAT32_C( -806.80), SIMDE_FLOAT32_C( -721.51), SIMDE_FLOAT32_C( -997.75),
SIMDE_FLOAT32_C( 795.12), SIMDE_FLOAT32_C( 763.51), SIMDE_FLOAT32_C( 234.98), SIMDE_FLOAT32_C( -597.47),
SIMDE_FLOAT32_C( 651.76), SIMDE_FLOAT32_C( 382.16), SIMDE_FLOAT32_C( 202.75), SIMDE_FLOAT32_C( -842.20) },
{ SIMDE_FLOAT32_C( -242.00), SIMDE_FLOAT32_C( -493.96), SIMDE_FLOAT32_C( -459.00), SIMDE_FLOAT32_C( -863.69),
SIMDE_FLOAT32_C( 610.00), SIMDE_FLOAT32_C( -807.00), SIMDE_FLOAT32_C( -722.00), SIMDE_FLOAT32_C( -998.00),
SIMDE_FLOAT32_C( 63.02), SIMDE_FLOAT32_C( -443.12), SIMDE_FLOAT32_C( 139.26), SIMDE_FLOAT32_C( 924.18),
SIMDE_FLOAT32_C( -838.25), SIMDE_FLOAT32_C( -323.10), SIMDE_FLOAT32_C( -805.38), SIMDE_FLOAT32_C( 858.57) } },
{ { SIMDE_FLOAT32_C( 167.42), SIMDE_FLOAT32_C( 339.06), SIMDE_FLOAT32_C( 483.74), SIMDE_FLOAT32_C( -338.08),
SIMDE_FLOAT32_C( -207.67), SIMDE_FLOAT32_C( -135.08), SIMDE_FLOAT32_C( 724.94), SIMDE_FLOAT32_C( 349.21),
SIMDE_FLOAT32_C( -995.82), SIMDE_FLOAT32_C( 649.12), SIMDE_FLOAT32_C( 510.96), SIMDE_FLOAT32_C( -318.92),
SIMDE_FLOAT32_C( 843.74), SIMDE_FLOAT32_C( 369.53), SIMDE_FLOAT32_C( -589.22), SIMDE_FLOAT32_C( -398.24) },
UINT8_C( 64),
{ SIMDE_FLOAT32_C( -48.16), SIMDE_FLOAT32_C( -362.01), SIMDE_FLOAT32_C( -567.67), SIMDE_FLOAT32_C( 145.04),
SIMDE_FLOAT32_C( -83.52), SIMDE_FLOAT32_C( -565.41), SIMDE_FLOAT32_C( -59.84), SIMDE_FLOAT32_C( -320.01),
SIMDE_FLOAT32_C( 669.57), SIMDE_FLOAT32_C( 342.69), SIMDE_FLOAT32_C( -668.25), SIMDE_FLOAT32_C( 51.73),
SIMDE_FLOAT32_C( -454.56), SIMDE_FLOAT32_C( -510.45), SIMDE_FLOAT32_C( -780.86), SIMDE_FLOAT32_C( 884.50) },
{ SIMDE_FLOAT32_C( 167.42), SIMDE_FLOAT32_C( 339.06), SIMDE_FLOAT32_C( 483.74), SIMDE_FLOAT32_C( -338.08),
SIMDE_FLOAT32_C( -207.67), SIMDE_FLOAT32_C( -135.08), SIMDE_FLOAT32_C( -60.00), SIMDE_FLOAT32_C( 349.21),
SIMDE_FLOAT32_C( -995.82), SIMDE_FLOAT32_C( 649.12), SIMDE_FLOAT32_C( 510.96), SIMDE_FLOAT32_C( -318.92),
SIMDE_FLOAT32_C( 843.74), SIMDE_FLOAT32_C( 369.53), SIMDE_FLOAT32_C( -589.22), SIMDE_FLOAT32_C( -398.24) } },
{ { SIMDE_FLOAT32_C( 973.29), SIMDE_FLOAT32_C( -118.94), SIMDE_FLOAT32_C( -323.17), SIMDE_FLOAT32_C( -161.78),
SIMDE_FLOAT32_C( -394.00), SIMDE_FLOAT32_C( -973.95), SIMDE_FLOAT32_C( -157.60), SIMDE_FLOAT32_C( -744.88),
SIMDE_FLOAT32_C( 537.01), SIMDE_FLOAT32_C( 523.48), SIMDE_FLOAT32_C( -901.15), SIMDE_FLOAT32_C( -93.46),
SIMDE_FLOAT32_C( 934.26), SIMDE_FLOAT32_C( -299.38), SIMDE_FLOAT32_C( 728.79), SIMDE_FLOAT32_C( -113.90) },
UINT8_C( 86),
{ SIMDE_FLOAT32_C( -838.87), SIMDE_FLOAT32_C( -968.86), SIMDE_FLOAT32_C( -744.90), SIMDE_FLOAT32_C( -404.28),
SIMDE_FLOAT32_C( -28.71), SIMDE_FLOAT32_C( -64.91), SIMDE_FLOAT32_C( -734.71), SIMDE_FLOAT32_C( -686.02),
SIMDE_FLOAT32_C( 266.84), SIMDE_FLOAT32_C( 317.01), SIMDE_FLOAT32_C( -140.57), SIMDE_FLOAT32_C( 756.39),
SIMDE_FLOAT32_C( 536.16), SIMDE_FLOAT32_C( -256.07), SIMDE_FLOAT32_C( 729.69), SIMDE_FLOAT32_C( -582.78) },
{ SIMDE_FLOAT32_C( 973.29), SIMDE_FLOAT32_C( -969.00), SIMDE_FLOAT32_C( -745.00), SIMDE_FLOAT32_C( -161.78),
SIMDE_FLOAT32_C( -29.00), SIMDE_FLOAT32_C( -973.95), SIMDE_FLOAT32_C( -735.00), SIMDE_FLOAT32_C( -744.88),
SIMDE_FLOAT32_C( 537.01), SIMDE_FLOAT32_C( 523.48), SIMDE_FLOAT32_C( -901.15), SIMDE_FLOAT32_C( -93.46),
SIMDE_FLOAT32_C( 934.26), SIMDE_FLOAT32_C( -299.38), SIMDE_FLOAT32_C( 728.79), SIMDE_FLOAT32_C( -113.90) } }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])) ; i++) {
simde__m512 src = simde_mm512_loadu_ps(test_vec[i].src);
simde__m512 a = simde_mm512_loadu_ps(test_vec[i].a);
simde__m512 r = simde_mm512_mask_nearbyint_ps(src, test_vec[i].k, a);
simde_test_x86_assert_equal_f32x16(r, simde_mm512_loadu_ps(test_vec[i].r), 1);
}
return 0;
}
static int
test_simde_mm512_nearbyint_pd (SIMDE_MUNIT_TEST_ARGS) {
static const struct {
const simde_float64 a[8];
const simde_float64 r[8];
} test_vec[] = {
{ { SIMDE_FLOAT64_C( -456.37), SIMDE_FLOAT64_C( 239.71), SIMDE_FLOAT64_C( -214.46), SIMDE_FLOAT64_C( -228.66),
SIMDE_FLOAT64_C( -452.56), SIMDE_FLOAT64_C( -734.09), SIMDE_FLOAT64_C( 235.92), SIMDE_FLOAT64_C( 143.86) },
{ SIMDE_FLOAT64_C( -456.00), SIMDE_FLOAT64_C( 240.00), SIMDE_FLOAT64_C( -214.00), SIMDE_FLOAT64_C( -229.00),
SIMDE_FLOAT64_C( -453.00), SIMDE_FLOAT64_C( -734.00), SIMDE_FLOAT64_C( 236.00), SIMDE_FLOAT64_C( 144.00) } },
{ { SIMDE_FLOAT64_C( -285.31), SIMDE_FLOAT64_C( -86.71), SIMDE_FLOAT64_C( 920.29), SIMDE_FLOAT64_C( -690.90),
SIMDE_FLOAT64_C( -912.99), SIMDE_FLOAT64_C( -452.36), SIMDE_FLOAT64_C( -958.90), SIMDE_FLOAT64_C( -103.11) },
{ SIMDE_FLOAT64_C( -285.00), SIMDE_FLOAT64_C( -87.00), SIMDE_FLOAT64_C( 920.00), SIMDE_FLOAT64_C( -691.00),
SIMDE_FLOAT64_C( -913.00), SIMDE_FLOAT64_C( -452.00), SIMDE_FLOAT64_C( -959.00), SIMDE_FLOAT64_C( -103.00) } },
{ { SIMDE_FLOAT64_C( -186.33), SIMDE_FLOAT64_C( -533.97), SIMDE_FLOAT64_C( 740.01), SIMDE_FLOAT64_C( -835.54),
SIMDE_FLOAT64_C( 905.55), SIMDE_FLOAT64_C( 918.31), SIMDE_FLOAT64_C( 254.16), SIMDE_FLOAT64_C( -207.74) },
{ SIMDE_FLOAT64_C( -186.00), SIMDE_FLOAT64_C( -534.00), SIMDE_FLOAT64_C( 740.00), SIMDE_FLOAT64_C( -836.00),
SIMDE_FLOAT64_C( 906.00), SIMDE_FLOAT64_C( 918.00), SIMDE_FLOAT64_C( 254.00), SIMDE_FLOAT64_C( -208.00) } },
{ { SIMDE_FLOAT64_C( -15.89), SIMDE_FLOAT64_C( 697.49), SIMDE_FLOAT64_C( -777.91), SIMDE_FLOAT64_C( -743.01),
SIMDE_FLOAT64_C( 145.93), SIMDE_FLOAT64_C( 408.99), SIMDE_FLOAT64_C( -288.89), SIMDE_FLOAT64_C( 689.55) },
{ SIMDE_FLOAT64_C( -16.00), SIMDE_FLOAT64_C( 697.00), SIMDE_FLOAT64_C( -778.00), SIMDE_FLOAT64_C( -743.00),
SIMDE_FLOAT64_C( 146.00), SIMDE_FLOAT64_C( 409.00), SIMDE_FLOAT64_C( -289.00), SIMDE_FLOAT64_C( 690.00) } },
{ { SIMDE_FLOAT64_C( -351.30), SIMDE_FLOAT64_C( 496.65), SIMDE_FLOAT64_C( -539.11), SIMDE_FLOAT64_C( 196.13),
SIMDE_FLOAT64_C( 762.55), SIMDE_FLOAT64_C( 696.81), SIMDE_FLOAT64_C( -660.01), SIMDE_FLOAT64_C( -522.75) },
{ SIMDE_FLOAT64_C( -351.00), SIMDE_FLOAT64_C( 497.00), SIMDE_FLOAT64_C( -539.00), SIMDE_FLOAT64_C( 196.00),
SIMDE_FLOAT64_C( 763.00), SIMDE_FLOAT64_C( 697.00), SIMDE_FLOAT64_C( -660.00), SIMDE_FLOAT64_C( -523.00) } },
{ { SIMDE_FLOAT64_C( -389.90), SIMDE_FLOAT64_C( -739.72), SIMDE_FLOAT64_C( -213.65), SIMDE_FLOAT64_C( -302.89),
SIMDE_FLOAT64_C( -192.08), SIMDE_FLOAT64_C( -172.55), SIMDE_FLOAT64_C( 594.00), SIMDE_FLOAT64_C( 621.59) },
{ SIMDE_FLOAT64_C( -390.00), SIMDE_FLOAT64_C( -740.00), SIMDE_FLOAT64_C( -214.00), SIMDE_FLOAT64_C( -303.00),
SIMDE_FLOAT64_C( -192.00), SIMDE_FLOAT64_C( -173.00), SIMDE_FLOAT64_C( 594.00), SIMDE_FLOAT64_C( 622.00) } },
{ { SIMDE_FLOAT64_C( 293.48), SIMDE_FLOAT64_C( 334.01), SIMDE_FLOAT64_C( 786.05), SIMDE_FLOAT64_C( 199.03),
SIMDE_FLOAT64_C( 252.33), SIMDE_FLOAT64_C( 40.22), SIMDE_FLOAT64_C( 991.29), SIMDE_FLOAT64_C( -763.57) },
{ SIMDE_FLOAT64_C( 293.00), SIMDE_FLOAT64_C( 334.00), SIMDE_FLOAT64_C( 786.00), SIMDE_FLOAT64_C( 199.00),
SIMDE_FLOAT64_C( 252.00), SIMDE_FLOAT64_C( 40.00), SIMDE_FLOAT64_C( 991.00), SIMDE_FLOAT64_C( -764.00) } },
{ { SIMDE_FLOAT64_C( -262.29), SIMDE_FLOAT64_C( -786.62), SIMDE_FLOAT64_C( -506.58), SIMDE_FLOAT64_C( 883.63),
SIMDE_FLOAT64_C( 622.37), SIMDE_FLOAT64_C( 204.53), SIMDE_FLOAT64_C( 573.19), SIMDE_FLOAT64_C( -728.93) },
{ SIMDE_FLOAT64_C( -262.00), SIMDE_FLOAT64_C( -787.00), SIMDE_FLOAT64_C( -507.00), SIMDE_FLOAT64_C( 884.00),
SIMDE_FLOAT64_C( 622.00), SIMDE_FLOAT64_C( 205.00), SIMDE_FLOAT64_C( 573.00), SIMDE_FLOAT64_C( -729.00) } }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])) ; i++) {
simde__m512d a = simde_mm512_loadu_pd(test_vec[i].a);
simde__m512d r = simde_mm512_nearbyint_pd(a);
simde_test_x86_assert_equal_f64x8(r, simde_mm512_loadu_pd(test_vec[i].r), 1);
}
return 0;
}
static int
test_simde_mm512_mask_nearbyint_pd (SIMDE_MUNIT_TEST_ARGS) {
static const struct {
const simde_float64 src[8];
const simde__mmask8 k;
const simde_float64 a[8];
const simde_float64 r[8];
} test_vec[] = {
{ { SIMDE_FLOAT64_C( 381.89), SIMDE_FLOAT64_C( -277.28), SIMDE_FLOAT64_C( -400.11), SIMDE_FLOAT64_C( -872.84),
SIMDE_FLOAT64_C( -66.17), SIMDE_FLOAT64_C( -250.32), SIMDE_FLOAT64_C( -214.38), SIMDE_FLOAT64_C( 965.87) },
UINT8_C(106),
{ SIMDE_FLOAT64_C( 141.85), SIMDE_FLOAT64_C( 88.38), SIMDE_FLOAT64_C( -374.04), SIMDE_FLOAT64_C( 906.38),
SIMDE_FLOAT64_C( 851.98), SIMDE_FLOAT64_C( -170.13), SIMDE_FLOAT64_C( -142.10), SIMDE_FLOAT64_C( -367.42) },
{ SIMDE_FLOAT64_C( 381.89), SIMDE_FLOAT64_C( 88.00), SIMDE_FLOAT64_C( -400.11), SIMDE_FLOAT64_C( 906.00),
SIMDE_FLOAT64_C( -66.17), SIMDE_FLOAT64_C( -170.00), SIMDE_FLOAT64_C( -142.00), SIMDE_FLOAT64_C( 965.87) } },
{ { SIMDE_FLOAT64_C( 49.27), SIMDE_FLOAT64_C( 950.21), SIMDE_FLOAT64_C( 214.00), SIMDE_FLOAT64_C( 575.74),
SIMDE_FLOAT64_C( -350.82), SIMDE_FLOAT64_C( 512.95), SIMDE_FLOAT64_C( -227.13), SIMDE_FLOAT64_C( -609.67) },
UINT8_C( 61),
{ SIMDE_FLOAT64_C( 586.44), SIMDE_FLOAT64_C( 381.99), SIMDE_FLOAT64_C( 608.18), SIMDE_FLOAT64_C( 184.92),
SIMDE_FLOAT64_C( -474.55), SIMDE_FLOAT64_C( -9.93), SIMDE_FLOAT64_C( 907.64), SIMDE_FLOAT64_C( 125.34) },
{ SIMDE_FLOAT64_C( 586.00), SIMDE_FLOAT64_C( 950.21), SIMDE_FLOAT64_C( 608.00), SIMDE_FLOAT64_C( 185.00),
SIMDE_FLOAT64_C( -475.00), SIMDE_FLOAT64_C( -10.00), SIMDE_FLOAT64_C( -227.13), SIMDE_FLOAT64_C( -609.67) } },
{ { SIMDE_FLOAT64_C( 117.23), SIMDE_FLOAT64_C( -158.52), SIMDE_FLOAT64_C( 875.02), SIMDE_FLOAT64_C( 902.85),
SIMDE_FLOAT64_C( -192.66), SIMDE_FLOAT64_C( -256.64), SIMDE_FLOAT64_C( 44.70), SIMDE_FLOAT64_C( 895.72) },
UINT8_C(180),
{ SIMDE_FLOAT64_C( -48.92), SIMDE_FLOAT64_C( 747.70), SIMDE_FLOAT64_C( -800.80), SIMDE_FLOAT64_C( 808.98),
SIMDE_FLOAT64_C( -619.73), SIMDE_FLOAT64_C( 248.47), SIMDE_FLOAT64_C( 759.18), SIMDE_FLOAT64_C( 594.28) },
{ SIMDE_FLOAT64_C( 117.23), SIMDE_FLOAT64_C( -158.52), SIMDE_FLOAT64_C( -801.00), SIMDE_FLOAT64_C( 902.85),
SIMDE_FLOAT64_C( -620.00), SIMDE_FLOAT64_C( 248.00), SIMDE_FLOAT64_C( 44.70), SIMDE_FLOAT64_C( 594.00) } },
{ { SIMDE_FLOAT64_C( -175.78), SIMDE_FLOAT64_C( -591.64), SIMDE_FLOAT64_C( 107.22), SIMDE_FLOAT64_C( 597.09),
SIMDE_FLOAT64_C( -201.31), SIMDE_FLOAT64_C( -742.21), SIMDE_FLOAT64_C( 183.53), SIMDE_FLOAT64_C( -819.31) },
UINT8_C(241),
{ SIMDE_FLOAT64_C( -631.55), SIMDE_FLOAT64_C( -293.87), SIMDE_FLOAT64_C( -143.96), SIMDE_FLOAT64_C( -723.91),
SIMDE_FLOAT64_C( 831.47), SIMDE_FLOAT64_C( 973.27), SIMDE_FLOAT64_C( 117.57), SIMDE_FLOAT64_C( 706.49) },
{ SIMDE_FLOAT64_C( -632.00), SIMDE_FLOAT64_C( -591.64), SIMDE_FLOAT64_C( 107.22), SIMDE_FLOAT64_C( 597.09),
SIMDE_FLOAT64_C( 831.00), SIMDE_FLOAT64_C( 973.00), SIMDE_FLOAT64_C( 118.00), SIMDE_FLOAT64_C( 706.00) } },
{ { SIMDE_FLOAT64_C( 876.13), SIMDE_FLOAT64_C( 924.91), SIMDE_FLOAT64_C( -550.14), SIMDE_FLOAT64_C( -79.17),
SIMDE_FLOAT64_C( 820.63), SIMDE_FLOAT64_C( 819.19), SIMDE_FLOAT64_C( 871.91), SIMDE_FLOAT64_C( 568.33) },
UINT8_C(250),
{ SIMDE_FLOAT64_C( 680.89), SIMDE_FLOAT64_C( 948.60), SIMDE_FLOAT64_C( 266.86), SIMDE_FLOAT64_C( 440.07),
SIMDE_FLOAT64_C( 542.88), SIMDE_FLOAT64_C( -908.92), SIMDE_FLOAT64_C( 848.43), SIMDE_FLOAT64_C( -349.90) },
{ SIMDE_FLOAT64_C( 876.13), SIMDE_FLOAT64_C( 949.00), SIMDE_FLOAT64_C( -550.14), SIMDE_FLOAT64_C( 440.00),
SIMDE_FLOAT64_C( 543.00), SIMDE_FLOAT64_C( -909.00), SIMDE_FLOAT64_C( 848.00), SIMDE_FLOAT64_C( -350.00) } },
{ { SIMDE_FLOAT64_C( 688.16), SIMDE_FLOAT64_C( -352.87), SIMDE_FLOAT64_C( -92.11), SIMDE_FLOAT64_C( -128.31),
SIMDE_FLOAT64_C( -172.19), SIMDE_FLOAT64_C( -226.14), SIMDE_FLOAT64_C( 240.14), SIMDE_FLOAT64_C( 533.94) },
UINT8_C( 61),
{ SIMDE_FLOAT64_C( 516.23), SIMDE_FLOAT64_C( 365.42), SIMDE_FLOAT64_C( 603.18), SIMDE_FLOAT64_C( -366.20),
SIMDE_FLOAT64_C( 71.91), SIMDE_FLOAT64_C( 479.30), SIMDE_FLOAT64_C( -441.29), SIMDE_FLOAT64_C( 521.77) },
{ SIMDE_FLOAT64_C( 516.00), SIMDE_FLOAT64_C( -352.87), SIMDE_FLOAT64_C( 603.00), SIMDE_FLOAT64_C( -366.00),
SIMDE_FLOAT64_C( 72.00), SIMDE_FLOAT64_C( 479.00), SIMDE_FLOAT64_C( 240.14), SIMDE_FLOAT64_C( 533.94) } },
{ { SIMDE_FLOAT64_C( -599.87), SIMDE_FLOAT64_C( -620.66), SIMDE_FLOAT64_C( 340.95), SIMDE_FLOAT64_C( -727.96),
SIMDE_FLOAT64_C( 947.67), SIMDE_FLOAT64_C( 359.34), SIMDE_FLOAT64_C( 952.92), SIMDE_FLOAT64_C( 896.27) },
UINT8_C( 22),
{ SIMDE_FLOAT64_C( 392.99), SIMDE_FLOAT64_C( 439.14), SIMDE_FLOAT64_C( -282.72), SIMDE_FLOAT64_C( 241.43),
SIMDE_FLOAT64_C( -910.76), SIMDE_FLOAT64_C( -594.56), SIMDE_FLOAT64_C( 888.55), SIMDE_FLOAT64_C( -2.87) },
{ SIMDE_FLOAT64_C( -599.87), SIMDE_FLOAT64_C( 439.00), SIMDE_FLOAT64_C( -283.00), SIMDE_FLOAT64_C( -727.96),
SIMDE_FLOAT64_C( -911.00), SIMDE_FLOAT64_C( 359.34), SIMDE_FLOAT64_C( 952.92), SIMDE_FLOAT64_C( 896.27) } },
{ { SIMDE_FLOAT64_C( 277.14), SIMDE_FLOAT64_C( -283.64), SIMDE_FLOAT64_C( 770.99), SIMDE_FLOAT64_C( -482.72),
SIMDE_FLOAT64_C( -749.69), SIMDE_FLOAT64_C( 400.90), SIMDE_FLOAT64_C( -966.49), SIMDE_FLOAT64_C( 615.72) },
UINT8_C(173),
{ SIMDE_FLOAT64_C( -332.68), SIMDE_FLOAT64_C( -312.37), SIMDE_FLOAT64_C( -516.63), SIMDE_FLOAT64_C( 226.03),
SIMDE_FLOAT64_C( -790.60), SIMDE_FLOAT64_C( -116.50), SIMDE_FLOAT64_C( 605.37), SIMDE_FLOAT64_C( 550.35) },
{ SIMDE_FLOAT64_C( -333.00), SIMDE_FLOAT64_C( -283.64), SIMDE_FLOAT64_C( -517.00), SIMDE_FLOAT64_C( 226.00),
SIMDE_FLOAT64_C( -749.69), SIMDE_FLOAT64_C( -116.00), SIMDE_FLOAT64_C( -966.49), SIMDE_FLOAT64_C( 550.00) } }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])) ; i++) {
simde__m512d src = simde_mm512_loadu_pd(test_vec[i].src);
simde__m512d a = simde_mm512_loadu_pd(test_vec[i].a);
simde__m512d r = simde_mm512_mask_nearbyint_pd(src, test_vec[i].k, a);
simde_test_x86_assert_equal_f64x8(r, simde_mm512_loadu_pd(test_vec[i].r), 1);
}
return 0;
}
static int
test_simde_mm_pow_ps (SIMDE_MUNIT_TEST_ARGS) {
static const struct {
const simde_float32 a[4];
const simde_float32 b[4];
const simde_float32 r[4];
} test_vec[] = {
{ { SIMDE_FLOAT32_C( 3.20), SIMDE_FLOAT32_C( 4.42), SIMDE_FLOAT32_C( 0.93), SIMDE_FLOAT32_C( 1.48) },
{ SIMDE_FLOAT32_C( 0.50), SIMDE_FLOAT32_C( 0.56), SIMDE_FLOAT32_C( 3.01), SIMDE_FLOAT32_C( 3.83) },
{ SIMDE_FLOAT32_C( 1.79), SIMDE_FLOAT32_C( 2.30), SIMDE_FLOAT32_C( 0.80), SIMDE_FLOAT32_C( 4.49) } },
{ { SIMDE_FLOAT32_C( 4.49), SIMDE_FLOAT32_C( 1.24), SIMDE_FLOAT32_C( 4.20), SIMDE_FLOAT32_C( 3.10) },
{ SIMDE_FLOAT32_C( 2.65), SIMDE_FLOAT32_C( 0.97), SIMDE_FLOAT32_C( 0.05), SIMDE_FLOAT32_C( 1.17) },
{ SIMDE_FLOAT32_C( 53.51), SIMDE_FLOAT32_C( 1.23), SIMDE_FLOAT32_C( 1.07), SIMDE_FLOAT32_C( 3.76) } },
{ { SIMDE_FLOAT32_C( 3.21), SIMDE_FLOAT32_C( 4.91), SIMDE_FLOAT32_C( 4.05), SIMDE_FLOAT32_C( 0.12) },
{ SIMDE_FLOAT32_C( 1.31), SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 2.91), SIMDE_FLOAT32_C( 4.46) },
{ SIMDE_FLOAT32_C( 4.61), SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( 58.57), SIMDE_FLOAT32_C( 0.00) } },
{ { SIMDE_FLOAT32_C( 3.77), SIMDE_FLOAT32_C( 3.76), SIMDE_FLOAT32_C( 0.20), SIMDE_FLOAT32_C( 2.56) },
{ SIMDE_FLOAT32_C( 1.47), SIMDE_FLOAT32_C( 3.39), SIMDE_FLOAT32_C( 3.29), SIMDE_FLOAT32_C( 4.67) },
{ SIMDE_FLOAT32_C( 7.03), SIMDE_FLOAT32_C( 89.10), SIMDE_FLOAT32_C( 0.01), SIMDE_FLOAT32_C( 80.63) } },
{ { SIMDE_FLOAT32_C( 2.81), SIMDE_FLOAT32_C( 4.23), SIMDE_FLOAT32_C( 1.15), SIMDE_FLOAT32_C( 3.31) },
{ SIMDE_FLOAT32_C( 4.79), SIMDE_FLOAT32_C( 4.15), SIMDE_FLOAT32_C( 2.14), SIMDE_FLOAT32_C( 4.28) },
{ SIMDE_FLOAT32_C( 141.03), SIMDE_FLOAT32_C( 397.48), SIMDE_FLOAT32_C( 1.35), SIMDE_FLOAT32_C( 167.83) } },
{ { SIMDE_FLOAT32_C( 0.39), SIMDE_FLOAT32_C( 1.34), SIMDE_FLOAT32_C( 2.38), SIMDE_FLOAT32_C( 3.04) },
{ SIMDE_FLOAT32_C( 2.31), SIMDE_FLOAT32_C( 2.43), SIMDE_FLOAT32_C( 4.21), SIMDE_FLOAT32_C( 0.52) },
{ SIMDE_FLOAT32_C( 0.11), SIMDE_FLOAT32_C( 2.04), SIMDE_FLOAT32_C( 38.49), SIMDE_FLOAT32_C( 1.78) } },
{ { SIMDE_FLOAT32_C( 2.34), SIMDE_FLOAT32_C( 3.26), SIMDE_FLOAT32_C( 0.64), SIMDE_FLOAT32_C( 3.65) },
{ SIMDE_FLOAT32_C( 3.26), SIMDE_FLOAT32_C( 3.55), SIMDE_FLOAT32_C( 3.11), SIMDE_FLOAT32_C( 2.03) },
{ SIMDE_FLOAT32_C( 15.98), SIMDE_FLOAT32_C( 66.36), SIMDE_FLOAT32_C( 0.25), SIMDE_FLOAT32_C( 13.85) } },
{ { SIMDE_FLOAT32_C( 2.31), SIMDE_FLOAT32_C( 3.31), SIMDE_FLOAT32_C( 4.59), SIMDE_FLOAT32_C( 3.78) },
{ SIMDE_FLOAT32_C( 1.69), SIMDE_FLOAT32_C( 2.88), SIMDE_FLOAT32_C( 3.45), SIMDE_FLOAT32_C( 4.50) },
{ SIMDE_FLOAT32_C( 4.12), SIMDE_FLOAT32_C( 31.41), SIMDE_FLOAT32_C( 191.98), SIMDE_FLOAT32_C( 396.93) } }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])) ; i++) {
simde__m128 a = simde_mm_loadu_ps(test_vec[i].a);
simde__m128 b = simde_mm_loadu_ps(test_vec[i].b);
simde__m128 r = simde_mm_pow_ps(a, b);
simde_test_x86_assert_equal_f32x4(r, simde_mm_loadu_ps(test_vec[i].r), 1);
}
return 0;
}
static int
test_simde_mm_pow_pd (SIMDE_MUNIT_TEST_ARGS) {
static const struct {
const simde_float64 a[2];
const simde_float64 b[2];
const simde_float64 r[2];
} test_vec[] = {
{ { SIMDE_FLOAT64_C( 6.86), SIMDE_FLOAT64_C( 4.28) },
{ SIMDE_FLOAT64_C( 2.99), SIMDE_FLOAT64_C( 7.45) },
{ SIMDE_FLOAT64_C( 316.67), SIMDE_FLOAT64_C( 50612.30) } },
{ { SIMDE_FLOAT64_C( 7.72), SIMDE_FLOAT64_C( 8.36) },
{ SIMDE_FLOAT64_C( 4.17), SIMDE_FLOAT64_C( 1.82) },
{ SIMDE_FLOAT64_C( 5027.64), SIMDE_FLOAT64_C( 47.69) } },
{ { SIMDE_FLOAT64_C( 9.11), SIMDE_FLOAT64_C( 6.23) },
{ SIMDE_FLOAT64_C( 1.26), SIMDE_FLOAT64_C( 4.65) },
{ SIMDE_FLOAT64_C( 16.18), SIMDE_FLOAT64_C( 4947.31) } },
{ { SIMDE_FLOAT64_C( 2.75), SIMDE_FLOAT64_C( 7.48) },
{ SIMDE_FLOAT64_C( 0.85), SIMDE_FLOAT64_C( 0.71) },
{ SIMDE_FLOAT64_C( 2.36), SIMDE_FLOAT64_C( 4.17) } },
{ { SIMDE_FLOAT64_C( 5.91), SIMDE_FLOAT64_C( 7.19) },
{ SIMDE_FLOAT64_C( 1.19), SIMDE_FLOAT64_C( 5.92) },
{ SIMDE_FLOAT64_C( 8.28), SIMDE_FLOAT64_C(117987.24) } },
{ { SIMDE_FLOAT64_C( 5.42), SIMDE_FLOAT64_C( 3.06) },
{ SIMDE_FLOAT64_C( 9.46), SIMDE_FLOAT64_C( 0.23) },
{ SIMDE_FLOAT64_C(8782805.21), SIMDE_FLOAT64_C( 1.29) } },
{ { SIMDE_FLOAT64_C( 6.88), SIMDE_FLOAT64_C( 9.69) },
{ SIMDE_FLOAT64_C( 2.44), SIMDE_FLOAT64_C( 7.03) },
{ SIMDE_FLOAT64_C( 110.59), SIMDE_FLOAT64_C(8587290.46) } },
{ { SIMDE_FLOAT64_C( 9.85), SIMDE_FLOAT64_C( 1.85) },
{ SIMDE_FLOAT64_C( 1.77), SIMDE_FLOAT64_C( 6.71) },
{ SIMDE_FLOAT64_C( 57.33), SIMDE_FLOAT64_C( 62.05) } }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])) ; i++) {
simde__m128d a = simde_mm_loadu_pd(test_vec[i].a);
simde__m128d b = simde_mm_loadu_pd(test_vec[i].b);
simde__m128d r = simde_mm_pow_pd(a, b);
simde_test_x86_assert_equal_f64x2(r, simde_mm_loadu_pd(test_vec[i].r), 1);
}
return 0;
}
static int
test_simde_mm256_pow_ps (SIMDE_MUNIT_TEST_ARGS) {
static const struct {
const simde_float32 a[8];
const simde_float32 b[8];
const simde_float32 r[8];
} test_vec[] = {
{ { SIMDE_FLOAT32_C( 0.81), SIMDE_FLOAT32_C( 0.90), SIMDE_FLOAT32_C( 4.47), SIMDE_FLOAT32_C( 0.76),
SIMDE_FLOAT32_C( 0.65), SIMDE_FLOAT32_C( 0.83), SIMDE_FLOAT32_C( 4.19), SIMDE_FLOAT32_C( 4.26) },
{ SIMDE_FLOAT32_C( 4.92), SIMDE_FLOAT32_C( 2.33), SIMDE_FLOAT32_C( 3.87), SIMDE_FLOAT32_C( 2.15),
SIMDE_FLOAT32_C( 3.59), SIMDE_FLOAT32_C( 0.76), SIMDE_FLOAT32_C( 3.79), SIMDE_FLOAT32_C( 3.42) },
{ SIMDE_FLOAT32_C( 0.35), SIMDE_FLOAT32_C( 0.78), SIMDE_FLOAT32_C( 328.62), SIMDE_FLOAT32_C( 0.55),
SIMDE_FLOAT32_C( 0.21), SIMDE_FLOAT32_C( 0.87), SIMDE_FLOAT32_C( 228.13), SIMDE_FLOAT32_C( 142.10) } },
{ { SIMDE_FLOAT32_C( 3.72), SIMDE_FLOAT32_C( 4.06), SIMDE_FLOAT32_C( 2.24), SIMDE_FLOAT32_C( 3.04),
SIMDE_FLOAT32_C( 1.04), SIMDE_FLOAT32_C( 0.23), SIMDE_FLOAT32_C( 1.49), SIMDE_FLOAT32_C( 2.02) },
{ SIMDE_FLOAT32_C( 0.62), SIMDE_FLOAT32_C( 0.46), SIMDE_FLOAT32_C( 3.82), SIMDE_FLOAT32_C( 0.42),
SIMDE_FLOAT32_C( 3.50), SIMDE_FLOAT32_C( 1.39), SIMDE_FLOAT32_C( 0.44), SIMDE_FLOAT32_C( 4.31) },
{ SIMDE_FLOAT32_C( 2.26), SIMDE_FLOAT32_C( 1.91), SIMDE_FLOAT32_C( 21.77), SIMDE_FLOAT32_C( 1.60),
SIMDE_FLOAT32_C( 1.15), SIMDE_FLOAT32_C( 0.13), SIMDE_FLOAT32_C( 1.19), SIMDE_FLOAT32_C( 20.70) } },
{ { SIMDE_FLOAT32_C( 2.29), SIMDE_FLOAT32_C( 4.91), SIMDE_FLOAT32_C( 0.07), SIMDE_FLOAT32_C( 2.94),
SIMDE_FLOAT32_C( 0.74), SIMDE_FLOAT32_C( 4.26), SIMDE_FLOAT32_C( 2.20), SIMDE_FLOAT32_C( 0.66) },
{ SIMDE_FLOAT32_C( 1.59), SIMDE_FLOAT32_C( 1.07), SIMDE_FLOAT32_C( 2.81), SIMDE_FLOAT32_C( 0.18),
SIMDE_FLOAT32_C( 1.83), SIMDE_FLOAT32_C( 1.60), SIMDE_FLOAT32_C( 3.61), SIMDE_FLOAT32_C( 0.55) },
{ SIMDE_FLOAT32_C( 3.73), SIMDE_FLOAT32_C( 5.49), SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 1.21),
SIMDE_FLOAT32_C( 0.58), SIMDE_FLOAT32_C( 10.16), SIMDE_FLOAT32_C( 17.22), SIMDE_FLOAT32_C( 0.80) } },
{ { SIMDE_FLOAT32_C( 0.65), SIMDE_FLOAT32_C( 0.85), SIMDE_FLOAT32_C( 3.59), SIMDE_FLOAT32_C( 1.70),
SIMDE_FLOAT32_C( 1.08), SIMDE_FLOAT32_C( 0.07), SIMDE_FLOAT32_C( 3.72), SIMDE_FLOAT32_C( 1.70) },
{ SIMDE_FLOAT32_C( 0.53), SIMDE_FLOAT32_C( 2.53), SIMDE_FLOAT32_C( 2.12), SIMDE_FLOAT32_C( 4.04),
SIMDE_FLOAT32_C( 3.92), SIMDE_FLOAT32_C( 2.56), SIMDE_FLOAT32_C( 3.35), SIMDE_FLOAT32_C( 1.21) },
{ SIMDE_FLOAT32_C( 0.80), SIMDE_FLOAT32_C( 0.66), SIMDE_FLOAT32_C( 15.02), SIMDE_FLOAT32_C( 8.53),
SIMDE_FLOAT32_C( 1.35), SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 81.53), SIMDE_FLOAT32_C( 1.90) } },
{ { SIMDE_FLOAT32_C( 2.46), SIMDE_FLOAT32_C( 3.42), SIMDE_FLOAT32_C( 4.15), SIMDE_FLOAT32_C( 3.21),
SIMDE_FLOAT32_C( 2.68), SIMDE_FLOAT32_C( 1.35), SIMDE_FLOAT32_C( 3.87), SIMDE_FLOAT32_C( 4.27) },
{ SIMDE_FLOAT32_C( 2.42), SIMDE_FLOAT32_C( 1.68), SIMDE_FLOAT32_C( 4.45), SIMDE_FLOAT32_C( 4.25),
SIMDE_FLOAT32_C( 3.28), SIMDE_FLOAT32_C( 3.06), SIMDE_FLOAT32_C( 4.80), SIMDE_FLOAT32_C( 3.94) },
{ SIMDE_FLOAT32_C( 8.83), SIMDE_FLOAT32_C( 7.89), SIMDE_FLOAT32_C( 562.75), SIMDE_FLOAT32_C( 142.12),
SIMDE_FLOAT32_C( 25.37), SIMDE_FLOAT32_C( 2.51), SIMDE_FLOAT32_C( 662.24), SIMDE_FLOAT32_C( 304.71) } },
{ { SIMDE_FLOAT32_C( 3.90), SIMDE_FLOAT32_C( 3.39), SIMDE_FLOAT32_C( 0.64), SIMDE_FLOAT32_C( 4.98),
SIMDE_FLOAT32_C( 3.46), SIMDE_FLOAT32_C( 4.35), SIMDE_FLOAT32_C( 1.68), SIMDE_FLOAT32_C( 3.99) },
{ SIMDE_FLOAT32_C( 1.89), SIMDE_FLOAT32_C( 3.80), SIMDE_FLOAT32_C( 3.03), SIMDE_FLOAT32_C( 0.81),
SIMDE_FLOAT32_C( 1.36), SIMDE_FLOAT32_C( 1.37), SIMDE_FLOAT32_C( 2.02), SIMDE_FLOAT32_C( 3.82) },
{ SIMDE_FLOAT32_C( 13.10), SIMDE_FLOAT32_C( 103.46), SIMDE_FLOAT32_C( 0.26), SIMDE_FLOAT32_C( 3.67),
SIMDE_FLOAT32_C( 5.41), SIMDE_FLOAT32_C( 7.49), SIMDE_FLOAT32_C( 2.85), SIMDE_FLOAT32_C( 197.57) } },
{ { SIMDE_FLOAT32_C( 4.79), SIMDE_FLOAT32_C( 1.18), SIMDE_FLOAT32_C( 2.03), SIMDE_FLOAT32_C( 2.47),
SIMDE_FLOAT32_C( 2.53), SIMDE_FLOAT32_C( 0.90), SIMDE_FLOAT32_C( 1.73), SIMDE_FLOAT32_C( 4.95) },
{ SIMDE_FLOAT32_C( 2.58), SIMDE_FLOAT32_C( 1.18), SIMDE_FLOAT32_C( 4.20), SIMDE_FLOAT32_C( 0.86),
SIMDE_FLOAT32_C( 4.24), SIMDE_FLOAT32_C( 4.00), SIMDE_FLOAT32_C( 4.80), SIMDE_FLOAT32_C( 3.14) },
{ SIMDE_FLOAT32_C( 56.92), SIMDE_FLOAT32_C( 1.22), SIMDE_FLOAT32_C( 19.57), SIMDE_FLOAT32_C( 2.18),
SIMDE_FLOAT32_C( 51.20), SIMDE_FLOAT32_C( 0.66), SIMDE_FLOAT32_C( 13.89), SIMDE_FLOAT32_C( 151.73) } },
{ { SIMDE_FLOAT32_C( 2.39), SIMDE_FLOAT32_C( 0.43), SIMDE_FLOAT32_C( 3.12), SIMDE_FLOAT32_C( 0.84),
SIMDE_FLOAT32_C( 4.79), SIMDE_FLOAT32_C( 4.80), SIMDE_FLOAT32_C( 4.84), SIMDE_FLOAT32_C( 1.67) },
{ SIMDE_FLOAT32_C( 3.61), SIMDE_FLOAT32_C( 2.87), SIMDE_FLOAT32_C( 2.48), SIMDE_FLOAT32_C( 4.96),
SIMDE_FLOAT32_C( 4.24), SIMDE_FLOAT32_C( 4.50), SIMDE_FLOAT32_C( 3.79), SIMDE_FLOAT32_C( 4.03) },
{ SIMDE_FLOAT32_C( 23.23), SIMDE_FLOAT32_C( 0.09), SIMDE_FLOAT32_C( 16.81), SIMDE_FLOAT32_C( 0.42),
SIMDE_FLOAT32_C( 766.69), SIMDE_FLOAT32_C( 1163.02), SIMDE_FLOAT32_C( 394.06), SIMDE_FLOAT32_C( 7.90) } }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])) ; i++) {
simde__m256 a = simde_mm256_loadu_ps(test_vec[i].a);
simde__m256 b = simde_mm256_loadu_ps(test_vec[i].b);
simde__m256 r = simde_mm256_pow_ps(a, b);
simde_test_x86_assert_equal_f32x8(r, simde_mm256_loadu_ps(test_vec[i].r), 1);
}
return 0;
}
static int
test_simde_mm256_pow_pd (SIMDE_MUNIT_TEST_ARGS) {
static const struct {
const simde_float64 a[4];
const simde_float64 b[4];
const simde_float64 r[4];
} test_vec[] = {
{ { SIMDE_FLOAT64_C( 7.17), SIMDE_FLOAT64_C( 4.56), SIMDE_FLOAT64_C( 5.81), SIMDE_FLOAT64_C( 1.86) },
{ SIMDE_FLOAT64_C( 7.20), SIMDE_FLOAT64_C( 2.88), SIMDE_FLOAT64_C( 6.56), SIMDE_FLOAT64_C( 0.87) },
{ SIMDE_FLOAT64_C(1444567.77), SIMDE_FLOAT64_C( 79.03), SIMDE_FLOAT64_C(103037.53), SIMDE_FLOAT64_C( 1.72) } },
{ { SIMDE_FLOAT64_C( 6.39), SIMDE_FLOAT64_C( 1.20), SIMDE_FLOAT64_C( 4.73), SIMDE_FLOAT64_C( 0.14) },
{ SIMDE_FLOAT64_C( 9.00), SIMDE_FLOAT64_C( 7.96), SIMDE_FLOAT64_C( 0.10), SIMDE_FLOAT64_C( 0.99) },
{ SIMDE_FLOAT64_C(17762648.57), SIMDE_FLOAT64_C( 4.27), SIMDE_FLOAT64_C( 1.17), SIMDE_FLOAT64_C( 0.14) } },
{ { SIMDE_FLOAT64_C( 1.05), SIMDE_FLOAT64_C( 6.55), SIMDE_FLOAT64_C( 5.85), SIMDE_FLOAT64_C( 2.38) },
{ SIMDE_FLOAT64_C( 7.70), SIMDE_FLOAT64_C( 1.92), SIMDE_FLOAT64_C( 2.76), SIMDE_FLOAT64_C( 9.17) },
{ SIMDE_FLOAT64_C( 1.46), SIMDE_FLOAT64_C( 36.91), SIMDE_FLOAT64_C( 131.02), SIMDE_FLOAT64_C( 2839.30) } },
{ { SIMDE_FLOAT64_C( 1.22), SIMDE_FLOAT64_C( 3.47), SIMDE_FLOAT64_C( 2.69), SIMDE_FLOAT64_C( 4.53) },
{ SIMDE_FLOAT64_C( 8.94), SIMDE_FLOAT64_C( 7.35), SIMDE_FLOAT64_C( 0.67), SIMDE_FLOAT64_C( 6.10) },
{ SIMDE_FLOAT64_C( 5.92), SIMDE_FLOAT64_C( 9363.14), SIMDE_FLOAT64_C( 1.94), SIMDE_FLOAT64_C( 10050.76) } },
{ { SIMDE_FLOAT64_C( 1.91), SIMDE_FLOAT64_C( 6.48), SIMDE_FLOAT64_C( 7.96), SIMDE_FLOAT64_C( 9.11) },
{ SIMDE_FLOAT64_C( 9.36), SIMDE_FLOAT64_C( 4.52), SIMDE_FLOAT64_C( 9.98), SIMDE_FLOAT64_C( 5.75) },
{ SIMDE_FLOAT64_C( 427.04), SIMDE_FLOAT64_C( 4659.28), SIMDE_FLOAT64_C(979743556.72), SIMDE_FLOAT64_C(329026.34) } },
{ { SIMDE_FLOAT64_C( 5.73), SIMDE_FLOAT64_C( 4.71), SIMDE_FLOAT64_C( 5.89), SIMDE_FLOAT64_C( 4.73) },
{ SIMDE_FLOAT64_C( 2.67), SIMDE_FLOAT64_C( 5.99), SIMDE_FLOAT64_C( 5.71), SIMDE_FLOAT64_C( 3.72) },
{ SIMDE_FLOAT64_C( 105.75), SIMDE_FLOAT64_C( 10749.67), SIMDE_FLOAT64_C( 24966.54), SIMDE_FLOAT64_C( 323.95) } },
{ { SIMDE_FLOAT64_C( 2.54), SIMDE_FLOAT64_C( 1.56), SIMDE_FLOAT64_C( 6.10), SIMDE_FLOAT64_C( 0.24) },
{ SIMDE_FLOAT64_C( 3.48), SIMDE_FLOAT64_C( 8.87), SIMDE_FLOAT64_C( 9.41), SIMDE_FLOAT64_C( 4.71) },
{ SIMDE_FLOAT64_C( 25.63), SIMDE_FLOAT64_C( 51.64), SIMDE_FLOAT64_C(24544475.02), SIMDE_FLOAT64_C( 0.00) } },
{ { SIMDE_FLOAT64_C( 2.33), SIMDE_FLOAT64_C( 2.10), SIMDE_FLOAT64_C( 9.23), SIMDE_FLOAT64_C( 1.27) },
{ SIMDE_FLOAT64_C( 9.45), SIMDE_FLOAT64_C( 9.90), SIMDE_FLOAT64_C( 7.37), SIMDE_FLOAT64_C( 1.37) },
{ SIMDE_FLOAT64_C( 2961.51), SIMDE_FLOAT64_C( 1548.71), SIMDE_FLOAT64_C(12987828.24), SIMDE_FLOAT64_C( 1.39) } }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])) ; i++) {
simde__m256d a = simde_mm256_loadu_pd(test_vec[i].a);
simde__m256d b = simde_mm256_loadu_pd(test_vec[i].b);
simde__m256d r = simde_mm256_pow_pd(a, b);
simde_test_x86_assert_equal_f64x4(r, simde_mm256_loadu_pd(test_vec[i].r), 1);
}
return 0;
}
static int
test_simde_mm512_pow_ps (SIMDE_MUNIT_TEST_ARGS) {
static const struct {
const simde_float32 a[16];
const simde_float32 b[16];
const simde_float32 r[16];
} test_vec[] = {
{ { SIMDE_FLOAT32_C( 2.61), SIMDE_FLOAT32_C( 1.08), SIMDE_FLOAT32_C( 3.70), SIMDE_FLOAT32_C( 3.22),
SIMDE_FLOAT32_C( 4.10), SIMDE_FLOAT32_C( 4.95), SIMDE_FLOAT32_C( 4.92), SIMDE_FLOAT32_C( 4.08),
SIMDE_FLOAT32_C( 4.52), SIMDE_FLOAT32_C( 2.15), SIMDE_FLOAT32_C( 4.72), SIMDE_FLOAT32_C( 1.20),
SIMDE_FLOAT32_C( 0.79), SIMDE_FLOAT32_C( 4.45), SIMDE_FLOAT32_C( 4.31), SIMDE_FLOAT32_C( 1.66) },
{ SIMDE_FLOAT32_C( 4.65), SIMDE_FLOAT32_C( 2.28), SIMDE_FLOAT32_C( 1.70), SIMDE_FLOAT32_C( 3.82),
SIMDE_FLOAT32_C( 1.78), SIMDE_FLOAT32_C( 2.73), SIMDE_FLOAT32_C( 1.38), SIMDE_FLOAT32_C( 3.44),
SIMDE_FLOAT32_C( 1.09), SIMDE_FLOAT32_C( 1.75), SIMDE_FLOAT32_C( 3.09), SIMDE_FLOAT32_C( 3.82),
SIMDE_FLOAT32_C( 2.39), SIMDE_FLOAT32_C( 2.39), SIMDE_FLOAT32_C( 1.67), SIMDE_FLOAT32_C( 5.00) },
{ SIMDE_FLOAT32_C( 86.57), SIMDE_FLOAT32_C( 1.19), SIMDE_FLOAT32_C( 9.25), SIMDE_FLOAT32_C( 87.10),
SIMDE_FLOAT32_C( 12.32), SIMDE_FLOAT32_C( 78.75), SIMDE_FLOAT32_C( 9.01), SIMDE_FLOAT32_C( 126.09),
SIMDE_FLOAT32_C( 5.18), SIMDE_FLOAT32_C( 3.82), SIMDE_FLOAT32_C( 120.92), SIMDE_FLOAT32_C( 2.01),
SIMDE_FLOAT32_C( 0.57), SIMDE_FLOAT32_C( 35.45), SIMDE_FLOAT32_C( 11.47), SIMDE_FLOAT32_C( 12.60) } },
{ { SIMDE_FLOAT32_C( 3.47), SIMDE_FLOAT32_C( 0.37), SIMDE_FLOAT32_C( 3.22), SIMDE_FLOAT32_C( 2.57),
SIMDE_FLOAT32_C( 0.32), SIMDE_FLOAT32_C( 3.14), SIMDE_FLOAT32_C( 1.64), SIMDE_FLOAT32_C( 4.84),
SIMDE_FLOAT32_C( 0.28), SIMDE_FLOAT32_C( 1.36), SIMDE_FLOAT32_C( 1.04), SIMDE_FLOAT32_C( 1.08),
SIMDE_FLOAT32_C( 0.81), SIMDE_FLOAT32_C( 0.35), SIMDE_FLOAT32_C( 2.74), SIMDE_FLOAT32_C( 0.46) },
{ SIMDE_FLOAT32_C( 2.63), SIMDE_FLOAT32_C( 4.43), SIMDE_FLOAT32_C( 4.28), SIMDE_FLOAT32_C( 4.41),
SIMDE_FLOAT32_C( 2.17), SIMDE_FLOAT32_C( 0.66), SIMDE_FLOAT32_C( 2.85), SIMDE_FLOAT32_C( 3.26),
SIMDE_FLOAT32_C( 2.41), SIMDE_FLOAT32_C( 0.94), SIMDE_FLOAT32_C( 2.08), SIMDE_FLOAT32_C( 4.80),
SIMDE_FLOAT32_C( 3.33), SIMDE_FLOAT32_C( 3.75), SIMDE_FLOAT32_C( 4.79), SIMDE_FLOAT32_C( 1.80) },
{ SIMDE_FLOAT32_C( 26.37), SIMDE_FLOAT32_C( 0.01), SIMDE_FLOAT32_C( 149.15), SIMDE_FLOAT32_C( 64.24),
SIMDE_FLOAT32_C( 0.08), SIMDE_FLOAT32_C( 2.13), SIMDE_FLOAT32_C( 4.10), SIMDE_FLOAT32_C( 170.84),
SIMDE_FLOAT32_C( 0.05), SIMDE_FLOAT32_C( 1.34), SIMDE_FLOAT32_C( 1.08), SIMDE_FLOAT32_C( 1.45),
SIMDE_FLOAT32_C( 0.50), SIMDE_FLOAT32_C( 0.02), SIMDE_FLOAT32_C( 124.98), SIMDE_FLOAT32_C( 0.25) } },
{ { SIMDE_FLOAT32_C( 4.12), SIMDE_FLOAT32_C( 3.01), SIMDE_FLOAT32_C( 4.36), SIMDE_FLOAT32_C( 4.44),
SIMDE_FLOAT32_C( 1.15), SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( 4.28), SIMDE_FLOAT32_C( 1.43),
SIMDE_FLOAT32_C( 2.36), SIMDE_FLOAT32_C( 0.32), SIMDE_FLOAT32_C( 2.51), SIMDE_FLOAT32_C( 3.17),
SIMDE_FLOAT32_C( 0.67), SIMDE_FLOAT32_C( 0.24), SIMDE_FLOAT32_C( 3.64), SIMDE_FLOAT32_C( 3.30) },
{ SIMDE_FLOAT32_C( 4.68), SIMDE_FLOAT32_C( 2.92), SIMDE_FLOAT32_C( 2.71), SIMDE_FLOAT32_C( 1.85),
SIMDE_FLOAT32_C( 3.57), SIMDE_FLOAT32_C( 0.55), SIMDE_FLOAT32_C( 0.11), SIMDE_FLOAT32_C( 0.98),
SIMDE_FLOAT32_C( 1.49), SIMDE_FLOAT32_C( 2.19), SIMDE_FLOAT32_C( 0.78), SIMDE_FLOAT32_C( 4.82),
SIMDE_FLOAT32_C( 0.94), SIMDE_FLOAT32_C( 0.57), SIMDE_FLOAT32_C( 1.62), SIMDE_FLOAT32_C( 0.05) },
{ SIMDE_FLOAT32_C( 754.60), SIMDE_FLOAT32_C( 24.97), SIMDE_FLOAT32_C( 54.08), SIMDE_FLOAT32_C( 15.76),
SIMDE_FLOAT32_C( 1.65), SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( 1.17), SIMDE_FLOAT32_C( 1.42),
SIMDE_FLOAT32_C( 3.59), SIMDE_FLOAT32_C( 0.08), SIMDE_FLOAT32_C( 2.05), SIMDE_FLOAT32_C( 260.08),
SIMDE_FLOAT32_C( 0.69), SIMDE_FLOAT32_C( 0.44), SIMDE_FLOAT32_C( 8.11), SIMDE_FLOAT32_C( 1.06) } },
{ { SIMDE_FLOAT32_C( 3.58), SIMDE_FLOAT32_C( 0.98), SIMDE_FLOAT32_C( 4.49), SIMDE_FLOAT32_C( 4.73),
SIMDE_FLOAT32_C( 1.98), SIMDE_FLOAT32_C( 3.77), SIMDE_FLOAT32_C( 1.16), SIMDE_FLOAT32_C( 4.35),
SIMDE_FLOAT32_C( 4.09), SIMDE_FLOAT32_C( 3.67), SIMDE_FLOAT32_C( 2.52), SIMDE_FLOAT32_C( 4.75),
SIMDE_FLOAT32_C( 3.91), SIMDE_FLOAT32_C( 1.16), SIMDE_FLOAT32_C( 3.05), SIMDE_FLOAT32_C( 3.59) },
{ SIMDE_FLOAT32_C( 4.07), SIMDE_FLOAT32_C( 0.76), SIMDE_FLOAT32_C( 0.44), SIMDE_FLOAT32_C( 2.64),
SIMDE_FLOAT32_C( 1.31), SIMDE_FLOAT32_C( 0.55), SIMDE_FLOAT32_C( 3.62), SIMDE_FLOAT32_C( 2.81),
SIMDE_FLOAT32_C( 2.73), SIMDE_FLOAT32_C( 4.40), SIMDE_FLOAT32_C( 2.63), SIMDE_FLOAT32_C( 3.67),
SIMDE_FLOAT32_C( 4.97), SIMDE_FLOAT32_C( 4.25), SIMDE_FLOAT32_C( 3.72), SIMDE_FLOAT32_C( 3.55) },
{ SIMDE_FLOAT32_C( 179.60), SIMDE_FLOAT32_C( 0.98), SIMDE_FLOAT32_C( 1.94), SIMDE_FLOAT32_C( 60.48),
SIMDE_FLOAT32_C( 2.45), SIMDE_FLOAT32_C( 2.07), SIMDE_FLOAT32_C( 1.71), SIMDE_FLOAT32_C( 62.25),
SIMDE_FLOAT32_C( 46.77), SIMDE_FLOAT32_C( 305.16), SIMDE_FLOAT32_C( 11.37), SIMDE_FLOAT32_C( 304.41),
SIMDE_FLOAT32_C( 877.24), SIMDE_FLOAT32_C( 1.88), SIMDE_FLOAT32_C( 63.33), SIMDE_FLOAT32_C( 93.45) } },
{ { SIMDE_FLOAT32_C( 0.23), SIMDE_FLOAT32_C( 3.21), SIMDE_FLOAT32_C( 3.29), SIMDE_FLOAT32_C( 2.21),
SIMDE_FLOAT32_C( 1.98), SIMDE_FLOAT32_C( 4.45), SIMDE_FLOAT32_C( 1.56), SIMDE_FLOAT32_C( 1.07),
SIMDE_FLOAT32_C( 3.11), SIMDE_FLOAT32_C( 4.08), SIMDE_FLOAT32_C( 0.82), SIMDE_FLOAT32_C( 2.03),
SIMDE_FLOAT32_C( 0.24), SIMDE_FLOAT32_C( 3.87), SIMDE_FLOAT32_C( 0.62), SIMDE_FLOAT32_C( 4.31) },
{ SIMDE_FLOAT32_C( 4.63), SIMDE_FLOAT32_C( 1.06), SIMDE_FLOAT32_C( 1.96), SIMDE_FLOAT32_C( 0.94),
SIMDE_FLOAT32_C( 1.60), SIMDE_FLOAT32_C( 0.58), SIMDE_FLOAT32_C( 3.75), SIMDE_FLOAT32_C( 4.34),
SIMDE_FLOAT32_C( 4.98), SIMDE_FLOAT32_C( 1.38), SIMDE_FLOAT32_C( 3.00), SIMDE_FLOAT32_C( 4.95),
SIMDE_FLOAT32_C( 0.62), SIMDE_FLOAT32_C( 1.73), SIMDE_FLOAT32_C( 3.50), SIMDE_FLOAT32_C( 0.85) },
{ SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 3.44), SIMDE_FLOAT32_C( 10.32), SIMDE_FLOAT32_C( 2.11),
SIMDE_FLOAT32_C( 2.98), SIMDE_FLOAT32_C( 2.38), SIMDE_FLOAT32_C( 5.30), SIMDE_FLOAT32_C( 1.34),
SIMDE_FLOAT32_C( 284.41), SIMDE_FLOAT32_C( 6.96), SIMDE_FLOAT32_C( 0.55), SIMDE_FLOAT32_C( 33.27),
SIMDE_FLOAT32_C( 0.41), SIMDE_FLOAT32_C( 10.39), SIMDE_FLOAT32_C( 0.19), SIMDE_FLOAT32_C( 3.46) } },
{ { SIMDE_FLOAT32_C( 4.94), SIMDE_FLOAT32_C( 1.79), SIMDE_FLOAT32_C( 3.06), SIMDE_FLOAT32_C( 1.92),
SIMDE_FLOAT32_C( 1.23), SIMDE_FLOAT32_C( 4.63), SIMDE_FLOAT32_C( 2.99), SIMDE_FLOAT32_C( 4.35),
SIMDE_FLOAT32_C( 3.71), SIMDE_FLOAT32_C( 3.81), SIMDE_FLOAT32_C( 1.38), SIMDE_FLOAT32_C( 3.95),
SIMDE_FLOAT32_C( 2.68), SIMDE_FLOAT32_C( 1.99), SIMDE_FLOAT32_C( 3.26), SIMDE_FLOAT32_C( 2.31) },
{ SIMDE_FLOAT32_C( 3.05), SIMDE_FLOAT32_C( 0.22), SIMDE_FLOAT32_C( 3.25), SIMDE_FLOAT32_C( 4.65),
SIMDE_FLOAT32_C( 0.80), SIMDE_FLOAT32_C( 2.00), SIMDE_FLOAT32_C( 3.99), SIMDE_FLOAT32_C( 0.78),
SIMDE_FLOAT32_C( 3.38), SIMDE_FLOAT32_C( 1.99), SIMDE_FLOAT32_C( 0.73), SIMDE_FLOAT32_C( 4.00),
SIMDE_FLOAT32_C( 3.72), SIMDE_FLOAT32_C( 4.23), SIMDE_FLOAT32_C( 4.85), SIMDE_FLOAT32_C( 3.66) },
{ SIMDE_FLOAT32_C( 130.58), SIMDE_FLOAT32_C( 1.14), SIMDE_FLOAT32_C( 37.90), SIMDE_FLOAT32_C( 20.77),
SIMDE_FLOAT32_C( 1.18), SIMDE_FLOAT32_C( 21.44), SIMDE_FLOAT32_C( 79.05), SIMDE_FLOAT32_C( 3.15),
SIMDE_FLOAT32_C( 84.04), SIMDE_FLOAT32_C( 14.32), SIMDE_FLOAT32_C( 1.27), SIMDE_FLOAT32_C( 243.44),
SIMDE_FLOAT32_C( 39.14), SIMDE_FLOAT32_C( 18.37), SIMDE_FLOAT32_C( 308.39), SIMDE_FLOAT32_C( 21.42) } },
{ { SIMDE_FLOAT32_C( 1.02), SIMDE_FLOAT32_C( 2.92), SIMDE_FLOAT32_C( 0.58), SIMDE_FLOAT32_C( 2.25),
SIMDE_FLOAT32_C( 2.54), SIMDE_FLOAT32_C( 3.57), SIMDE_FLOAT32_C( 1.60), SIMDE_FLOAT32_C( 1.25),
SIMDE_FLOAT32_C( 2.37), SIMDE_FLOAT32_C( 2.98), SIMDE_FLOAT32_C( 0.20), SIMDE_FLOAT32_C( 0.05),
SIMDE_FLOAT32_C( 4.97), SIMDE_FLOAT32_C( 3.46), SIMDE_FLOAT32_C( 2.36), SIMDE_FLOAT32_C( 3.02) },
{ SIMDE_FLOAT32_C( 3.68), SIMDE_FLOAT32_C( 0.62), SIMDE_FLOAT32_C( 2.67), SIMDE_FLOAT32_C( 4.48),
SIMDE_FLOAT32_C( 2.62), SIMDE_FLOAT32_C( 1.66), SIMDE_FLOAT32_C( 0.25), SIMDE_FLOAT32_C( 1.00),
SIMDE_FLOAT32_C( 3.65), SIMDE_FLOAT32_C( 0.98), SIMDE_FLOAT32_C( 5.00), SIMDE_FLOAT32_C( 2.37),
SIMDE_FLOAT32_C( 0.21), SIMDE_FLOAT32_C( 4.85), SIMDE_FLOAT32_C( 1.03), SIMDE_FLOAT32_C( 1.23) },
{ SIMDE_FLOAT32_C( 1.08), SIMDE_FLOAT32_C( 1.94), SIMDE_FLOAT32_C( 0.23), SIMDE_FLOAT32_C( 37.82),
SIMDE_FLOAT32_C( 11.50), SIMDE_FLOAT32_C( 8.27), SIMDE_FLOAT32_C( 1.12), SIMDE_FLOAT32_C( 1.25),
SIMDE_FLOAT32_C( 23.33), SIMDE_FLOAT32_C( 2.92), SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 0.00),
SIMDE_FLOAT32_C( 1.40), SIMDE_FLOAT32_C( 411.64), SIMDE_FLOAT32_C( 2.42), SIMDE_FLOAT32_C( 3.89) } },
{ { SIMDE_FLOAT32_C( 2.77), SIMDE_FLOAT32_C( 1.62), SIMDE_FLOAT32_C( 3.48), SIMDE_FLOAT32_C( 0.31),
SIMDE_FLOAT32_C( 0.18), SIMDE_FLOAT32_C( 0.08), SIMDE_FLOAT32_C( 1.56), SIMDE_FLOAT32_C( 2.56),
SIMDE_FLOAT32_C( 3.06), SIMDE_FLOAT32_C( 1.76), SIMDE_FLOAT32_C( 2.61), SIMDE_FLOAT32_C( 3.03),
SIMDE_FLOAT32_C( 0.22), SIMDE_FLOAT32_C( 4.97), SIMDE_FLOAT32_C( 1.05), SIMDE_FLOAT32_C( 3.90) },
{ SIMDE_FLOAT32_C( 0.59), SIMDE_FLOAT32_C( 3.72), SIMDE_FLOAT32_C( 3.38), SIMDE_FLOAT32_C( 3.21),
SIMDE_FLOAT32_C( 0.38), SIMDE_FLOAT32_C( 3.63), SIMDE_FLOAT32_C( 4.20), SIMDE_FLOAT32_C( 4.03),
SIMDE_FLOAT32_C( 4.61), SIMDE_FLOAT32_C( 4.20), SIMDE_FLOAT32_C( 1.41), SIMDE_FLOAT32_C( 4.82),
SIMDE_FLOAT32_C( 4.05), SIMDE_FLOAT32_C( 2.44), SIMDE_FLOAT32_C( 1.04), SIMDE_FLOAT32_C( 1.82) },
{ SIMDE_FLOAT32_C( 1.82), SIMDE_FLOAT32_C( 6.02), SIMDE_FLOAT32_C( 67.69), SIMDE_FLOAT32_C( 0.02),
SIMDE_FLOAT32_C( 0.52), SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 6.47), SIMDE_FLOAT32_C( 44.18),
SIMDE_FLOAT32_C( 173.45), SIMDE_FLOAT32_C( 10.74), SIMDE_FLOAT32_C( 3.87), SIMDE_FLOAT32_C( 209.20),
SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 50.02), SIMDE_FLOAT32_C( 1.05), SIMDE_FLOAT32_C( 11.91) } }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])) ; i++) {
simde__m512 a = simde_mm512_loadu_ps(test_vec[i].a);
simde__m512 b = simde_mm512_loadu_ps(test_vec[i].b);
simde__m512 r = simde_mm512_pow_ps(a, b);
simde_test_x86_assert_equal_f32x16(r, simde_mm512_loadu_ps(test_vec[i].r), 1);
}
return 0;
}
static int
test_simde_mm512_mask_pow_ps (SIMDE_MUNIT_TEST_ARGS) {
static const struct {
const simde_float32 src[16];
const simde__mmask8 k;
const simde_float32 a[16];
const simde_float32 b[16];
const simde_float32 r[16];
} test_vec[] = {
{ { SIMDE_FLOAT32_C( 3.50), SIMDE_FLOAT32_C( 0.86), SIMDE_FLOAT32_C( 1.87), SIMDE_FLOAT32_C( 2.52),
SIMDE_FLOAT32_C( 3.74), SIMDE_FLOAT32_C( 0.07), SIMDE_FLOAT32_C( 0.96), SIMDE_FLOAT32_C( 4.86),
SIMDE_FLOAT32_C( 1.16), SIMDE_FLOAT32_C( 0.65), SIMDE_FLOAT32_C( 1.92), SIMDE_FLOAT32_C( 3.89),
SIMDE_FLOAT32_C( 0.19), SIMDE_FLOAT32_C( 3.79), SIMDE_FLOAT32_C( 0.79), SIMDE_FLOAT32_C( 3.85) },
UINT8_C( 81),
{ SIMDE_FLOAT32_C( 3.43), SIMDE_FLOAT32_C( 0.17), SIMDE_FLOAT32_C( 2.56), SIMDE_FLOAT32_C( 0.44),
SIMDE_FLOAT32_C( 3.38), SIMDE_FLOAT32_C( 3.30), SIMDE_FLOAT32_C( 3.55), SIMDE_FLOAT32_C( 3.99),
SIMDE_FLOAT32_C( 2.88), SIMDE_FLOAT32_C( 1.85), SIMDE_FLOAT32_C( 2.70), SIMDE_FLOAT32_C( 1.54),
SIMDE_FLOAT32_C( 2.63), SIMDE_FLOAT32_C( 4.93), SIMDE_FLOAT32_C( 0.04), SIMDE_FLOAT32_C( 3.48) },
{ SIMDE_FLOAT32_C( 1.79), SIMDE_FLOAT32_C( 2.57), SIMDE_FLOAT32_C( 2.22), SIMDE_FLOAT32_C( 1.87),
SIMDE_FLOAT32_C( 3.53), SIMDE_FLOAT32_C( 2.08), SIMDE_FLOAT32_C( 3.03), SIMDE_FLOAT32_C( 4.18),
SIMDE_FLOAT32_C( 4.00), SIMDE_FLOAT32_C( 1.92), SIMDE_FLOAT32_C( 4.38), SIMDE_FLOAT32_C( 2.79),
SIMDE_FLOAT32_C( 2.71), SIMDE_FLOAT32_C( 3.23), SIMDE_FLOAT32_C( 4.72), SIMDE_FLOAT32_C( 1.13) },
{ SIMDE_FLOAT32_C( 9.08), SIMDE_FLOAT32_C( 0.86), SIMDE_FLOAT32_C( 1.87), SIMDE_FLOAT32_C( 2.52),
SIMDE_FLOAT32_C( 73.63), SIMDE_FLOAT32_C( 0.07), SIMDE_FLOAT32_C( 46.47), SIMDE_FLOAT32_C( 4.86),
SIMDE_FLOAT32_C( 1.16), SIMDE_FLOAT32_C( 0.65), SIMDE_FLOAT32_C( 1.92), SIMDE_FLOAT32_C( 3.89),
SIMDE_FLOAT32_C( 0.19), SIMDE_FLOAT32_C( 3.79), SIMDE_FLOAT32_C( 0.79), SIMDE_FLOAT32_C( 3.85) } },
{ { SIMDE_FLOAT32_C( 3.40), SIMDE_FLOAT32_C( 2.28), SIMDE_FLOAT32_C( 1.57), SIMDE_FLOAT32_C( 1.78),
SIMDE_FLOAT32_C( 0.57), SIMDE_FLOAT32_C( 0.12), SIMDE_FLOAT32_C( 0.77), SIMDE_FLOAT32_C( 3.46),
SIMDE_FLOAT32_C( 1.97), SIMDE_FLOAT32_C( 3.46), SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 4.59),
SIMDE_FLOAT32_C( 3.39), SIMDE_FLOAT32_C( 0.05), SIMDE_FLOAT32_C( 3.07), SIMDE_FLOAT32_C( 0.19) },
UINT8_C(140),
{ SIMDE_FLOAT32_C( 0.29), SIMDE_FLOAT32_C( 2.05), SIMDE_FLOAT32_C( 1.14), SIMDE_FLOAT32_C( 2.37),
SIMDE_FLOAT32_C( 0.08), SIMDE_FLOAT32_C( 0.33), SIMDE_FLOAT32_C( 1.37), SIMDE_FLOAT32_C( 2.00),
SIMDE_FLOAT32_C( 4.70), SIMDE_FLOAT32_C( 4.16), SIMDE_FLOAT32_C( 4.71), SIMDE_FLOAT32_C( 2.93),
SIMDE_FLOAT32_C( 3.88), SIMDE_FLOAT32_C( 0.84), SIMDE_FLOAT32_C( 1.33), SIMDE_FLOAT32_C( 1.15) },
{ SIMDE_FLOAT32_C( 2.42), SIMDE_FLOAT32_C( 3.11), SIMDE_FLOAT32_C( 1.73), SIMDE_FLOAT32_C( 2.54),
SIMDE_FLOAT32_C( 3.88), SIMDE_FLOAT32_C( 0.19), SIMDE_FLOAT32_C( 4.51), SIMDE_FLOAT32_C( 2.34),
SIMDE_FLOAT32_C( 0.19), SIMDE_FLOAT32_C( 4.10), SIMDE_FLOAT32_C( 0.73), SIMDE_FLOAT32_C( 0.24),
SIMDE_FLOAT32_C( 2.17), SIMDE_FLOAT32_C( 0.92), SIMDE_FLOAT32_C( 2.85), SIMDE_FLOAT32_C( 2.46) },
{ SIMDE_FLOAT32_C( 3.40), SIMDE_FLOAT32_C( 2.28), SIMDE_FLOAT32_C( 1.25), SIMDE_FLOAT32_C( 8.95),
SIMDE_FLOAT32_C( 0.57), SIMDE_FLOAT32_C( 0.12), SIMDE_FLOAT32_C( 0.77), SIMDE_FLOAT32_C( 5.06),
SIMDE_FLOAT32_C( 1.97), SIMDE_FLOAT32_C( 3.46), SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 4.59),
SIMDE_FLOAT32_C( 3.39), SIMDE_FLOAT32_C( 0.05), SIMDE_FLOAT32_C( 3.07), SIMDE_FLOAT32_C( 0.19) } },
{ { SIMDE_FLOAT32_C( 2.97), SIMDE_FLOAT32_C( 3.99), SIMDE_FLOAT32_C( 4.84), SIMDE_FLOAT32_C( 3.05),
SIMDE_FLOAT32_C( 4.32), SIMDE_FLOAT32_C( 1.21), SIMDE_FLOAT32_C( 0.06), SIMDE_FLOAT32_C( 4.03),
SIMDE_FLOAT32_C( 0.37), SIMDE_FLOAT32_C( 4.77), SIMDE_FLOAT32_C( 1.96), SIMDE_FLOAT32_C( 4.25),
SIMDE_FLOAT32_C( 0.61), SIMDE_FLOAT32_C( 3.29), SIMDE_FLOAT32_C( 0.40), SIMDE_FLOAT32_C( 3.03) },
UINT8_C(179),
{ SIMDE_FLOAT32_C( 2.13), SIMDE_FLOAT32_C( 0.57), SIMDE_FLOAT32_C( 0.27), SIMDE_FLOAT32_C( 2.31),
SIMDE_FLOAT32_C( 0.08), SIMDE_FLOAT32_C( 2.61), SIMDE_FLOAT32_C( 2.50), SIMDE_FLOAT32_C( 4.17),
SIMDE_FLOAT32_C( 3.34), SIMDE_FLOAT32_C( 2.74), SIMDE_FLOAT32_C( 1.35), SIMDE_FLOAT32_C( 4.26),
SIMDE_FLOAT32_C( 0.59), SIMDE_FLOAT32_C( 3.81), SIMDE_FLOAT32_C( 2.23), SIMDE_FLOAT32_C( 4.58) },
{ SIMDE_FLOAT32_C( 3.65), SIMDE_FLOAT32_C( 0.29), SIMDE_FLOAT32_C( 3.90), SIMDE_FLOAT32_C( 4.86),
SIMDE_FLOAT32_C( 0.34), SIMDE_FLOAT32_C( 2.93), SIMDE_FLOAT32_C( 0.23), SIMDE_FLOAT32_C( 0.11),
SIMDE_FLOAT32_C( 4.89), SIMDE_FLOAT32_C( 4.48), SIMDE_FLOAT32_C( 0.73), SIMDE_FLOAT32_C( 3.17),
SIMDE_FLOAT32_C( 4.88), SIMDE_FLOAT32_C( 3.76), SIMDE_FLOAT32_C( 4.57), SIMDE_FLOAT32_C( 2.00) },
{ SIMDE_FLOAT32_C( 15.80), SIMDE_FLOAT32_C( 0.85), SIMDE_FLOAT32_C( 4.84), SIMDE_FLOAT32_C( 3.05),
SIMDE_FLOAT32_C( 0.42), SIMDE_FLOAT32_C( 16.62), SIMDE_FLOAT32_C( 0.06), SIMDE_FLOAT32_C( 1.17),
SIMDE_FLOAT32_C( 0.37), SIMDE_FLOAT32_C( 4.77), SIMDE_FLOAT32_C( 1.96), SIMDE_FLOAT32_C( 4.25),
SIMDE_FLOAT32_C( 0.61), SIMDE_FLOAT32_C( 3.29), SIMDE_FLOAT32_C( 0.40), SIMDE_FLOAT32_C( 3.03) } },
{ { SIMDE_FLOAT32_C( 4.33), SIMDE_FLOAT32_C( 4.84), SIMDE_FLOAT32_C( 4.31), SIMDE_FLOAT32_C( 4.40),
SIMDE_FLOAT32_C( 2.45), SIMDE_FLOAT32_C( 1.82), SIMDE_FLOAT32_C( 3.58), SIMDE_FLOAT32_C( 0.79),
SIMDE_FLOAT32_C( 4.55), SIMDE_FLOAT32_C( 4.92), SIMDE_FLOAT32_C( 0.05), SIMDE_FLOAT32_C( 0.14),
SIMDE_FLOAT32_C( 3.74), SIMDE_FLOAT32_C( 2.29), SIMDE_FLOAT32_C( 4.73), SIMDE_FLOAT32_C( 2.39) },
UINT8_C( 97),
{ SIMDE_FLOAT32_C( 3.63), SIMDE_FLOAT32_C( 2.25), SIMDE_FLOAT32_C( 2.92), SIMDE_FLOAT32_C( 1.56),
SIMDE_FLOAT32_C( 2.48), SIMDE_FLOAT32_C( 3.03), SIMDE_FLOAT32_C( 1.45), SIMDE_FLOAT32_C( 1.96),
SIMDE_FLOAT32_C( 3.76), SIMDE_FLOAT32_C( 4.62), SIMDE_FLOAT32_C( 1.83), SIMDE_FLOAT32_C( 2.51),
SIMDE_FLOAT32_C( 4.19), SIMDE_FLOAT32_C( 3.83), SIMDE_FLOAT32_C( 1.84), SIMDE_FLOAT32_C( 4.03) },
{ SIMDE_FLOAT32_C( 3.15), SIMDE_FLOAT32_C( 1.24), SIMDE_FLOAT32_C( 1.48), SIMDE_FLOAT32_C( 4.97),
SIMDE_FLOAT32_C( 4.82), SIMDE_FLOAT32_C( 2.27), SIMDE_FLOAT32_C( 4.52), SIMDE_FLOAT32_C( 4.75),
SIMDE_FLOAT32_C( 2.33), SIMDE_FLOAT32_C( 4.66), SIMDE_FLOAT32_C( 3.48), SIMDE_FLOAT32_C( 4.61),
SIMDE_FLOAT32_C( 4.39), SIMDE_FLOAT32_C( 0.87), SIMDE_FLOAT32_C( 2.19), SIMDE_FLOAT32_C( 3.02) },
{ SIMDE_FLOAT32_C( 58.04), SIMDE_FLOAT32_C( 4.84), SIMDE_FLOAT32_C( 4.31), SIMDE_FLOAT32_C( 4.40),
SIMDE_FLOAT32_C( 2.45), SIMDE_FLOAT32_C( 12.38), SIMDE_FLOAT32_C( 5.36), SIMDE_FLOAT32_C( 0.79),
SIMDE_FLOAT32_C( 4.55), SIMDE_FLOAT32_C( 4.92), SIMDE_FLOAT32_C( 0.05), SIMDE_FLOAT32_C( 0.14),
SIMDE_FLOAT32_C( 3.74), SIMDE_FLOAT32_C( 2.29), SIMDE_FLOAT32_C( 4.73), SIMDE_FLOAT32_C( 2.39) } },
{ { SIMDE_FLOAT32_C( 3.11), SIMDE_FLOAT32_C( 0.10), SIMDE_FLOAT32_C( 4.58), SIMDE_FLOAT32_C( 0.59),
SIMDE_FLOAT32_C( 3.13), SIMDE_FLOAT32_C( 1.02), SIMDE_FLOAT32_C( 2.55), SIMDE_FLOAT32_C( 1.89),
SIMDE_FLOAT32_C( 0.64), SIMDE_FLOAT32_C( 4.38), SIMDE_FLOAT32_C( 4.40), SIMDE_FLOAT32_C( 4.83),
SIMDE_FLOAT32_C( 3.22), SIMDE_FLOAT32_C( 1.24), SIMDE_FLOAT32_C( 3.86), SIMDE_FLOAT32_C( 1.37) },
UINT8_C( 99),
{ SIMDE_FLOAT32_C( 0.34), SIMDE_FLOAT32_C( 1.33), SIMDE_FLOAT32_C( 2.31), SIMDE_FLOAT32_C( 2.61),
SIMDE_FLOAT32_C( 0.85), SIMDE_FLOAT32_C( 2.06), SIMDE_FLOAT32_C( 4.94), SIMDE_FLOAT32_C( 0.51),
SIMDE_FLOAT32_C( 0.54), SIMDE_FLOAT32_C( 4.55), SIMDE_FLOAT32_C( 4.90), SIMDE_FLOAT32_C( 1.41),
SIMDE_FLOAT32_C( 1.73), SIMDE_FLOAT32_C( 2.92), SIMDE_FLOAT32_C( 4.52), SIMDE_FLOAT32_C( 1.84) },
{ SIMDE_FLOAT32_C( 2.50), SIMDE_FLOAT32_C( 0.11), SIMDE_FLOAT32_C( 4.97), SIMDE_FLOAT32_C( 3.52),
SIMDE_FLOAT32_C( 2.66), SIMDE_FLOAT32_C( 1.86), SIMDE_FLOAT32_C( 4.17), SIMDE_FLOAT32_C( 2.05),
SIMDE_FLOAT32_C( 1.27), SIMDE_FLOAT32_C( 4.00), SIMDE_FLOAT32_C( 0.26), SIMDE_FLOAT32_C( 2.51),
SIMDE_FLOAT32_C( 2.86), SIMDE_FLOAT32_C( 1.63), SIMDE_FLOAT32_C( 5.00), SIMDE_FLOAT32_C( 3.20) },
{ SIMDE_FLOAT32_C( 0.07), SIMDE_FLOAT32_C( 1.03), SIMDE_FLOAT32_C( 4.58), SIMDE_FLOAT32_C( 0.59),
SIMDE_FLOAT32_C( 3.13), SIMDE_FLOAT32_C( 3.84), SIMDE_FLOAT32_C( 781.34), SIMDE_FLOAT32_C( 1.89),
SIMDE_FLOAT32_C( 0.64), SIMDE_FLOAT32_C( 4.38), SIMDE_FLOAT32_C( 4.40), SIMDE_FLOAT32_C( 4.83),
SIMDE_FLOAT32_C( 3.22), SIMDE_FLOAT32_C( 1.24), SIMDE_FLOAT32_C( 3.86), SIMDE_FLOAT32_C( 1.37) } },
{ { SIMDE_FLOAT32_C( 2.96), SIMDE_FLOAT32_C( 2.31), SIMDE_FLOAT32_C( 0.81), SIMDE_FLOAT32_C( 3.81),
SIMDE_FLOAT32_C( 4.36), SIMDE_FLOAT32_C( 0.75), SIMDE_FLOAT32_C( 4.32), SIMDE_FLOAT32_C( 4.90),
SIMDE_FLOAT32_C( 0.30), SIMDE_FLOAT32_C( 4.22), SIMDE_FLOAT32_C( 1.31), SIMDE_FLOAT32_C( 2.03),
SIMDE_FLOAT32_C( 2.14), SIMDE_FLOAT32_C( 0.83), SIMDE_FLOAT32_C( 3.87), SIMDE_FLOAT32_C( 4.64) },
UINT8_C(216),
{ SIMDE_FLOAT32_C( 3.84), SIMDE_FLOAT32_C( 3.16), SIMDE_FLOAT32_C( 3.61), SIMDE_FLOAT32_C( 0.70),
SIMDE_FLOAT32_C( 2.33), SIMDE_FLOAT32_C( 0.65), SIMDE_FLOAT32_C( 1.97), SIMDE_FLOAT32_C( 1.32),
SIMDE_FLOAT32_C( 0.91), SIMDE_FLOAT32_C( 4.48), SIMDE_FLOAT32_C( 4.19), SIMDE_FLOAT32_C( 2.54),
SIMDE_FLOAT32_C( 4.47), SIMDE_FLOAT32_C( 2.39), SIMDE_FLOAT32_C( 0.50), SIMDE_FLOAT32_C( 1.78) },
{ SIMDE_FLOAT32_C( 3.20), SIMDE_FLOAT32_C( 4.31), SIMDE_FLOAT32_C( 1.14), SIMDE_FLOAT32_C( 3.95),
SIMDE_FLOAT32_C( 3.63), SIMDE_FLOAT32_C( 1.05), SIMDE_FLOAT32_C( 4.25), SIMDE_FLOAT32_C( 2.86),
SIMDE_FLOAT32_C( 2.35), SIMDE_FLOAT32_C( 1.28), SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 3.18),
SIMDE_FLOAT32_C( 0.15), SIMDE_FLOAT32_C( 4.64), SIMDE_FLOAT32_C( 4.13), SIMDE_FLOAT32_C( 3.99) },
{ SIMDE_FLOAT32_C( 2.96), SIMDE_FLOAT32_C( 2.31), SIMDE_FLOAT32_C( 0.81), SIMDE_FLOAT32_C( 0.24),
SIMDE_FLOAT32_C( 21.55), SIMDE_FLOAT32_C( 0.75), SIMDE_FLOAT32_C( 17.84), SIMDE_FLOAT32_C( 2.21),
SIMDE_FLOAT32_C( 0.30), SIMDE_FLOAT32_C( 4.22), SIMDE_FLOAT32_C( 1.31), SIMDE_FLOAT32_C( 2.03),
SIMDE_FLOAT32_C( 2.14), SIMDE_FLOAT32_C( 0.83), SIMDE_FLOAT32_C( 3.87), SIMDE_FLOAT32_C( 4.64) } },
{ { SIMDE_FLOAT32_C( 2.80), SIMDE_FLOAT32_C( 2.73), SIMDE_FLOAT32_C( 4.69), SIMDE_FLOAT32_C( 0.13),
SIMDE_FLOAT32_C( 3.38), SIMDE_FLOAT32_C( 1.66), SIMDE_FLOAT32_C( 1.45), SIMDE_FLOAT32_C( 4.29),
SIMDE_FLOAT32_C( 1.13), SIMDE_FLOAT32_C( 0.64), SIMDE_FLOAT32_C( 1.83), SIMDE_FLOAT32_C( 0.60),
SIMDE_FLOAT32_C( 3.03), SIMDE_FLOAT32_C( 2.34), SIMDE_FLOAT32_C( 2.38), SIMDE_FLOAT32_C( 1.23) },
UINT8_C(247),
{ SIMDE_FLOAT32_C( 3.53), SIMDE_FLOAT32_C( 0.18), SIMDE_FLOAT32_C( 0.28), SIMDE_FLOAT32_C( 4.57),
SIMDE_FLOAT32_C( 4.42), SIMDE_FLOAT32_C( 3.14), SIMDE_FLOAT32_C( 1.93), SIMDE_FLOAT32_C( 0.70),
SIMDE_FLOAT32_C( 3.14), SIMDE_FLOAT32_C( 0.11), SIMDE_FLOAT32_C( 0.85), SIMDE_FLOAT32_C( 2.78),
SIMDE_FLOAT32_C( 4.23), SIMDE_FLOAT32_C( 4.84), SIMDE_FLOAT32_C( 0.59), SIMDE_FLOAT32_C( 1.97) },
{ SIMDE_FLOAT32_C( 4.52), SIMDE_FLOAT32_C( 0.71), SIMDE_FLOAT32_C( 0.35), SIMDE_FLOAT32_C( 1.18),
SIMDE_FLOAT32_C( 2.17), SIMDE_FLOAT32_C( 4.64), SIMDE_FLOAT32_C( 2.31), SIMDE_FLOAT32_C( 2.80),
SIMDE_FLOAT32_C( 1.48), SIMDE_FLOAT32_C( 2.92), SIMDE_FLOAT32_C( 0.83), SIMDE_FLOAT32_C( 3.81),
SIMDE_FLOAT32_C( 0.30), SIMDE_FLOAT32_C( 2.06), SIMDE_FLOAT32_C( 0.46), SIMDE_FLOAT32_C( 3.83) },
{ SIMDE_FLOAT32_C( 299.19), SIMDE_FLOAT32_C( 0.30), SIMDE_FLOAT32_C( 0.64), SIMDE_FLOAT32_C( 0.13),
SIMDE_FLOAT32_C( 25.15), SIMDE_FLOAT32_C( 202.19), SIMDE_FLOAT32_C( 4.57), SIMDE_FLOAT32_C( 0.37),
SIMDE_FLOAT32_C( 1.13), SIMDE_FLOAT32_C( 0.64), SIMDE_FLOAT32_C( 1.83), SIMDE_FLOAT32_C( 0.60),
SIMDE_FLOAT32_C( 3.03), SIMDE_FLOAT32_C( 2.34), SIMDE_FLOAT32_C( 2.38), SIMDE_FLOAT32_C( 1.23) } },
{ { SIMDE_FLOAT32_C( 2.23), SIMDE_FLOAT32_C( 0.74), SIMDE_FLOAT32_C( 3.40), SIMDE_FLOAT32_C( 1.66),
SIMDE_FLOAT32_C( 3.88), SIMDE_FLOAT32_C( 0.32), SIMDE_FLOAT32_C( 2.36), SIMDE_FLOAT32_C( 2.02),
SIMDE_FLOAT32_C( 0.43), SIMDE_FLOAT32_C( 3.21), SIMDE_FLOAT32_C( 4.81), SIMDE_FLOAT32_C( 4.67),
SIMDE_FLOAT32_C( 3.04), SIMDE_FLOAT32_C( 0.39), SIMDE_FLOAT32_C( 1.63), SIMDE_FLOAT32_C( 2.57) },
UINT8_C(207),
{ SIMDE_FLOAT32_C( 1.98), SIMDE_FLOAT32_C( 3.75), SIMDE_FLOAT32_C( 3.27), SIMDE_FLOAT32_C( 1.62),
SIMDE_FLOAT32_C( 1.06), SIMDE_FLOAT32_C( 1.08), SIMDE_FLOAT32_C( 3.10), SIMDE_FLOAT32_C( 3.97),
SIMDE_FLOAT32_C( 1.91), SIMDE_FLOAT32_C( 1.91), SIMDE_FLOAT32_C( 4.27), SIMDE_FLOAT32_C( 3.96),
SIMDE_FLOAT32_C( 2.36), SIMDE_FLOAT32_C( 3.10), SIMDE_FLOAT32_C( 1.19), SIMDE_FLOAT32_C( 3.10) },
{ SIMDE_FLOAT32_C( 1.50), SIMDE_FLOAT32_C( 2.85), SIMDE_FLOAT32_C( 1.98), SIMDE_FLOAT32_C( 1.82),
SIMDE_FLOAT32_C( 0.21), SIMDE_FLOAT32_C( 4.00), SIMDE_FLOAT32_C( 2.26), SIMDE_FLOAT32_C( 3.41),
SIMDE_FLOAT32_C( 3.81), SIMDE_FLOAT32_C( 1.92), SIMDE_FLOAT32_C( 1.46), SIMDE_FLOAT32_C( 4.20),
SIMDE_FLOAT32_C( 3.56), SIMDE_FLOAT32_C( 4.02), SIMDE_FLOAT32_C( 0.30), SIMDE_FLOAT32_C( 0.54) },
{ SIMDE_FLOAT32_C( 2.79), SIMDE_FLOAT32_C( 43.25), SIMDE_FLOAT32_C( 10.44), SIMDE_FLOAT32_C( 2.41),
SIMDE_FLOAT32_C( 3.88), SIMDE_FLOAT32_C( 0.32), SIMDE_FLOAT32_C( 12.90), SIMDE_FLOAT32_C( 110.12),
SIMDE_FLOAT32_C( 0.43), SIMDE_FLOAT32_C( 3.21), SIMDE_FLOAT32_C( 4.81), SIMDE_FLOAT32_C( 4.67),
SIMDE_FLOAT32_C( 3.04), SIMDE_FLOAT32_C( 0.39), SIMDE_FLOAT32_C( 1.63), SIMDE_FLOAT32_C( 2.57) } }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])) ; i++) {
simde__m512 src = simde_mm512_loadu_ps(test_vec[i].src);
simde__m512 a = simde_mm512_loadu_ps(test_vec[i].a);
simde__m512 b = simde_mm512_loadu_ps(test_vec[i].b);
simde__m512 r = simde_mm512_mask_pow_ps(src, test_vec[i].k, a, b);
simde_test_x86_assert_equal_f32x16(r, simde_mm512_loadu_ps(test_vec[i].r), 1);
}
return 0;
}
static int
test_simde_mm512_pow_pd (SIMDE_MUNIT_TEST_ARGS) {
static const struct {
const simde_float64 a[8];
const simde_float64 b[8];
const simde_float64 r[8];
} test_vec[] = {
{ { SIMDE_FLOAT64_C( 4.13), SIMDE_FLOAT64_C( 8.81), SIMDE_FLOAT64_C( 2.27), SIMDE_FLOAT64_C( 5.77),
SIMDE_FLOAT64_C( 3.43), SIMDE_FLOAT64_C( 9.71), SIMDE_FLOAT64_C( 1.86), SIMDE_FLOAT64_C( 0.10) },
{ SIMDE_FLOAT64_C( 8.72), SIMDE_FLOAT64_C( 9.17), SIMDE_FLOAT64_C( 7.13), SIMDE_FLOAT64_C( 1.02),
SIMDE_FLOAT64_C( 3.40), SIMDE_FLOAT64_C( 5.53), SIMDE_FLOAT64_C( 2.12), SIMDE_FLOAT64_C( 0.29) },
{ SIMDE_FLOAT64_C(235008.98), SIMDE_FLOAT64_C(462838076.60), SIMDE_FLOAT64_C( 345.51), SIMDE_FLOAT64_C( 5.98),
SIMDE_FLOAT64_C( 66.07), SIMDE_FLOAT64_C(287953.49), SIMDE_FLOAT64_C( 3.73), SIMDE_FLOAT64_C( 0.51) } },
{ { SIMDE_FLOAT64_C( 4.06), SIMDE_FLOAT64_C( 3.82), SIMDE_FLOAT64_C( 0.67), SIMDE_FLOAT64_C( 7.27),
SIMDE_FLOAT64_C( 4.30), SIMDE_FLOAT64_C( 3.31), SIMDE_FLOAT64_C( 6.31), SIMDE_FLOAT64_C( 8.11) },
{ SIMDE_FLOAT64_C( 1.51), SIMDE_FLOAT64_C( 1.05), SIMDE_FLOAT64_C( 6.76), SIMDE_FLOAT64_C( 9.20),
SIMDE_FLOAT64_C( 5.39), SIMDE_FLOAT64_C( 5.09), SIMDE_FLOAT64_C( 0.17), SIMDE_FLOAT64_C( 9.52) },
{ SIMDE_FLOAT64_C( 8.30), SIMDE_FLOAT64_C( 4.08), SIMDE_FLOAT64_C( 0.07), SIMDE_FLOAT64_C(84356116.88),
SIMDE_FLOAT64_C( 2596.54), SIMDE_FLOAT64_C( 442.51), SIMDE_FLOAT64_C( 1.37), SIMDE_FLOAT64_C(450690633.16) } },
{ { SIMDE_FLOAT64_C( 3.90), SIMDE_FLOAT64_C( 2.44), SIMDE_FLOAT64_C( 5.29), SIMDE_FLOAT64_C( 7.33),
SIMDE_FLOAT64_C( 2.15), SIMDE_FLOAT64_C( 7.16), SIMDE_FLOAT64_C( 7.43), SIMDE_FLOAT64_C( 0.86) },
{ SIMDE_FLOAT64_C( 6.32), SIMDE_FLOAT64_C( 4.56), SIMDE_FLOAT64_C( 1.88), SIMDE_FLOAT64_C( 9.72),
SIMDE_FLOAT64_C( 0.09), SIMDE_FLOAT64_C( 4.00), SIMDE_FLOAT64_C( 0.01), SIMDE_FLOAT64_C( 4.15) },
{ SIMDE_FLOAT64_C( 5439.12), SIMDE_FLOAT64_C( 58.41), SIMDE_FLOAT64_C( 22.91), SIMDE_FLOAT64_C(256336608.20),
SIMDE_FLOAT64_C( 1.07), SIMDE_FLOAT64_C( 2628.16), SIMDE_FLOAT64_C( 1.02), SIMDE_FLOAT64_C( 0.53) } },
{ { SIMDE_FLOAT64_C( 7.82), SIMDE_FLOAT64_C( 0.68), SIMDE_FLOAT64_C( 1.42), SIMDE_FLOAT64_C( 2.12),
SIMDE_FLOAT64_C( 3.99), SIMDE_FLOAT64_C( 7.73), SIMDE_FLOAT64_C( 0.22), SIMDE_FLOAT64_C( 5.49) },
{ SIMDE_FLOAT64_C( 8.77), SIMDE_FLOAT64_C( 6.98), SIMDE_FLOAT64_C( 4.70), SIMDE_FLOAT64_C( 4.16),
SIMDE_FLOAT64_C( 2.08), SIMDE_FLOAT64_C( 4.87), SIMDE_FLOAT64_C( 3.68), SIMDE_FLOAT64_C( 5.98) },
{ SIMDE_FLOAT64_C(68143309.86), SIMDE_FLOAT64_C( 0.07), SIMDE_FLOAT64_C( 5.20), SIMDE_FLOAT64_C( 22.78),
SIMDE_FLOAT64_C( 17.78), SIMDE_FLOAT64_C( 21156.03), SIMDE_FLOAT64_C( 0.00), SIMDE_FLOAT64_C( 26463.22) } },
{ { SIMDE_FLOAT64_C( 7.30), SIMDE_FLOAT64_C( 8.98), SIMDE_FLOAT64_C( 3.31), SIMDE_FLOAT64_C( 9.45),
SIMDE_FLOAT64_C( 6.13), SIMDE_FLOAT64_C( 0.74), SIMDE_FLOAT64_C( 0.31), SIMDE_FLOAT64_C( 2.46) },
{ SIMDE_FLOAT64_C( 5.30), SIMDE_FLOAT64_C( 2.18), SIMDE_FLOAT64_C( 2.18), SIMDE_FLOAT64_C( 5.38),
SIMDE_FLOAT64_C( 6.18), SIMDE_FLOAT64_C( 2.19), SIMDE_FLOAT64_C( 9.53), SIMDE_FLOAT64_C( 4.00) },
{ SIMDE_FLOAT64_C( 37636.67), SIMDE_FLOAT64_C( 119.71), SIMDE_FLOAT64_C( 13.59), SIMDE_FLOAT64_C(176938.82),
SIMDE_FLOAT64_C( 73536.97), SIMDE_FLOAT64_C( 0.52), SIMDE_FLOAT64_C( 0.00), SIMDE_FLOAT64_C( 36.62) } },
{ { SIMDE_FLOAT64_C( 2.87), SIMDE_FLOAT64_C( 0.95), SIMDE_FLOAT64_C( 6.12), SIMDE_FLOAT64_C( 6.85),
SIMDE_FLOAT64_C( 8.67), SIMDE_FLOAT64_C( 6.34), SIMDE_FLOAT64_C( 2.35), SIMDE_FLOAT64_C( 7.45) },
{ SIMDE_FLOAT64_C( 3.33), SIMDE_FLOAT64_C( 7.04), SIMDE_FLOAT64_C( 1.61), SIMDE_FLOAT64_C( 5.40),
SIMDE_FLOAT64_C( 1.91), SIMDE_FLOAT64_C( 5.30), SIMDE_FLOAT64_C( 1.38), SIMDE_FLOAT64_C( 9.21) },
{ SIMDE_FLOAT64_C( 33.48), SIMDE_FLOAT64_C( 0.70), SIMDE_FLOAT64_C( 18.48), SIMDE_FLOAT64_C( 32563.35),
SIMDE_FLOAT64_C( 61.89), SIMDE_FLOAT64_C( 17826.79), SIMDE_FLOAT64_C( 3.25), SIMDE_FLOAT64_C(107785234.77) } },
{ { SIMDE_FLOAT64_C( 4.27), SIMDE_FLOAT64_C( 4.69), SIMDE_FLOAT64_C( 8.66), SIMDE_FLOAT64_C( 0.40),
SIMDE_FLOAT64_C( 5.42), SIMDE_FLOAT64_C( 8.96), SIMDE_FLOAT64_C( 2.86), SIMDE_FLOAT64_C( 0.72) },
{ SIMDE_FLOAT64_C( 1.15), SIMDE_FLOAT64_C( 5.04), SIMDE_FLOAT64_C( 6.10), SIMDE_FLOAT64_C( 7.33),
SIMDE_FLOAT64_C( 7.23), SIMDE_FLOAT64_C( 5.63), SIMDE_FLOAT64_C( 1.33), SIMDE_FLOAT64_C( 0.10) },
{ SIMDE_FLOAT64_C( 5.31), SIMDE_FLOAT64_C( 2413.85), SIMDE_FLOAT64_C(523430.64), SIMDE_FLOAT64_C( 0.00),
SIMDE_FLOAT64_C(202681.84), SIMDE_FLOAT64_C(229876.25), SIMDE_FLOAT64_C( 4.05), SIMDE_FLOAT64_C( 0.97) } },
{ { SIMDE_FLOAT64_C( 6.58), SIMDE_FLOAT64_C( 7.45), SIMDE_FLOAT64_C( 6.95), SIMDE_FLOAT64_C( 5.25),
SIMDE_FLOAT64_C( 3.79), SIMDE_FLOAT64_C( 9.30), SIMDE_FLOAT64_C( 2.70), SIMDE_FLOAT64_C( 7.12) },
{ SIMDE_FLOAT64_C( 6.34), SIMDE_FLOAT64_C( 4.32), SIMDE_FLOAT64_C( 2.52), SIMDE_FLOAT64_C( 8.25),
SIMDE_FLOAT64_C( 9.61), SIMDE_FLOAT64_C( 3.90), SIMDE_FLOAT64_C( 7.46), SIMDE_FLOAT64_C( 3.88) },
{ SIMDE_FLOAT64_C(154011.15), SIMDE_FLOAT64_C( 5857.54), SIMDE_FLOAT64_C( 132.37), SIMDE_FLOAT64_C(873603.27),
SIMDE_FLOAT64_C(363682.84), SIMDE_FLOAT64_C( 5985.27), SIMDE_FLOAT64_C( 1651.86), SIMDE_FLOAT64_C( 2030.59) } }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])) ; i++) {
simde__m512d a = simde_mm512_loadu_pd(test_vec[i].a);
simde__m512d b = simde_mm512_loadu_pd(test_vec[i].b);
simde__m512d r = simde_mm512_pow_pd(a, b);
simde_test_x86_assert_equal_f64x8(r, simde_mm512_loadu_pd(test_vec[i].r), 1);
}
return 0;
}
static int
test_simde_mm512_mask_pow_pd (SIMDE_MUNIT_TEST_ARGS) {
static const struct {
const simde_float64 src[8];
const simde__mmask8 k;
const simde_float64 a[8];
const simde_float64 b[8];
const simde_float64 r[8];
} test_vec[] = {
{ { SIMDE_FLOAT64_C( 0.55), SIMDE_FLOAT64_C( 1.66), SIMDE_FLOAT64_C( 8.31), SIMDE_FLOAT64_C( 9.30),
SIMDE_FLOAT64_C( 8.14), SIMDE_FLOAT64_C( 3.76), SIMDE_FLOAT64_C( 2.75), SIMDE_FLOAT64_C( 2.84) },
UINT8_C(150),
{ SIMDE_FLOAT64_C( 0.55), SIMDE_FLOAT64_C( 1.52), SIMDE_FLOAT64_C( 1.22), SIMDE_FLOAT64_C( 1.57),
SIMDE_FLOAT64_C( 9.30), SIMDE_FLOAT64_C( 5.53), SIMDE_FLOAT64_C( 8.36), SIMDE_FLOAT64_C( 5.48) },
{ SIMDE_FLOAT64_C( 1.36), SIMDE_FLOAT64_C( 7.20), SIMDE_FLOAT64_C( 8.45), SIMDE_FLOAT64_C( 1.77),
SIMDE_FLOAT64_C( 6.75), SIMDE_FLOAT64_C( 6.44), SIMDE_FLOAT64_C( 2.61), SIMDE_FLOAT64_C( 0.07) },
{ SIMDE_FLOAT64_C( 0.55), SIMDE_FLOAT64_C( 20.38), SIMDE_FLOAT64_C( 5.37), SIMDE_FLOAT64_C( 9.30),
SIMDE_FLOAT64_C(3445560.68), SIMDE_FLOAT64_C( 3.76), SIMDE_FLOAT64_C( 2.75), SIMDE_FLOAT64_C( 1.13) } },
{ { SIMDE_FLOAT64_C( 0.80), SIMDE_FLOAT64_C( 8.62), SIMDE_FLOAT64_C( 9.49), SIMDE_FLOAT64_C( 2.94),
SIMDE_FLOAT64_C( 0.58), SIMDE_FLOAT64_C( 3.00), SIMDE_FLOAT64_C( 3.49), SIMDE_FLOAT64_C( 2.24) },
UINT8_C(147),
{ SIMDE_FLOAT64_C( 2.79), SIMDE_FLOAT64_C( 0.38), SIMDE_FLOAT64_C( 5.06), SIMDE_FLOAT64_C( 5.54),
SIMDE_FLOAT64_C( 3.22), SIMDE_FLOAT64_C( 0.74), SIMDE_FLOAT64_C( 6.09), SIMDE_FLOAT64_C( 4.74) },
{ SIMDE_FLOAT64_C( 1.96), SIMDE_FLOAT64_C( 7.66), SIMDE_FLOAT64_C( 4.04), SIMDE_FLOAT64_C( 7.49),
SIMDE_FLOAT64_C( 6.02), SIMDE_FLOAT64_C( 9.52), SIMDE_FLOAT64_C( 8.85), SIMDE_FLOAT64_C( 3.22) },
{ SIMDE_FLOAT64_C( 7.47), SIMDE_FLOAT64_C( 0.00), SIMDE_FLOAT64_C( 9.49), SIMDE_FLOAT64_C( 2.94),
SIMDE_FLOAT64_C( 1141.02), SIMDE_FLOAT64_C( 3.00), SIMDE_FLOAT64_C( 3.49), SIMDE_FLOAT64_C( 149.97) } },
{ { SIMDE_FLOAT64_C( 7.97), SIMDE_FLOAT64_C( 0.62), SIMDE_FLOAT64_C( 9.97), SIMDE_FLOAT64_C( 4.41),
SIMDE_FLOAT64_C( 3.23), SIMDE_FLOAT64_C( 0.04), SIMDE_FLOAT64_C( 5.21), SIMDE_FLOAT64_C( 1.85) },
UINT8_C(167),
{ SIMDE_FLOAT64_C( 8.14), SIMDE_FLOAT64_C( 2.43), SIMDE_FLOAT64_C( 2.53), SIMDE_FLOAT64_C( 1.63),
SIMDE_FLOAT64_C( 4.67), SIMDE_FLOAT64_C( 3.83), SIMDE_FLOAT64_C( 4.42), SIMDE_FLOAT64_C( 5.05) },
{ SIMDE_FLOAT64_C( 8.89), SIMDE_FLOAT64_C( 9.96), SIMDE_FLOAT64_C( 8.27), SIMDE_FLOAT64_C( 9.63),
SIMDE_FLOAT64_C( 6.05), SIMDE_FLOAT64_C( 3.01), SIMDE_FLOAT64_C( 1.59), SIMDE_FLOAT64_C( 3.71) },
{ SIMDE_FLOAT64_C(124580755.27), SIMDE_FLOAT64_C( 6928.49), SIMDE_FLOAT64_C( 2156.78), SIMDE_FLOAT64_C( 4.41),
SIMDE_FLOAT64_C( 3.23), SIMDE_FLOAT64_C( 56.94), SIMDE_FLOAT64_C( 5.21), SIMDE_FLOAT64_C( 406.64) } },
{ { SIMDE_FLOAT64_C( 7.05), SIMDE_FLOAT64_C( 9.08), SIMDE_FLOAT64_C( 9.73), SIMDE_FLOAT64_C( 6.57),
SIMDE_FLOAT64_C( 7.92), SIMDE_FLOAT64_C( 2.94), SIMDE_FLOAT64_C( 4.54), SIMDE_FLOAT64_C( 8.54) },
UINT8_C(148),
{ SIMDE_FLOAT64_C( 8.95), SIMDE_FLOAT64_C( 1.77), SIMDE_FLOAT64_C( 2.95), SIMDE_FLOAT64_C( 4.15),
SIMDE_FLOAT64_C( 3.63), SIMDE_FLOAT64_C( 2.48), SIMDE_FLOAT64_C( 2.30), SIMDE_FLOAT64_C( 6.06) },
{ SIMDE_FLOAT64_C( 5.01), SIMDE_FLOAT64_C( 3.93), SIMDE_FLOAT64_C( 0.73), SIMDE_FLOAT64_C( 8.84),
SIMDE_FLOAT64_C( 8.35), SIMDE_FLOAT64_C( 5.77), SIMDE_FLOAT64_C( 7.74), SIMDE_FLOAT64_C( 8.32) },
{ SIMDE_FLOAT64_C( 7.05), SIMDE_FLOAT64_C( 9.08), SIMDE_FLOAT64_C( 2.20), SIMDE_FLOAT64_C( 6.57),
SIMDE_FLOAT64_C( 47339.14), SIMDE_FLOAT64_C( 2.94), SIMDE_FLOAT64_C( 4.54), SIMDE_FLOAT64_C(3237220.14) } },
{ { SIMDE_FLOAT64_C( 4.04), SIMDE_FLOAT64_C( 7.37), SIMDE_FLOAT64_C( 4.37), SIMDE_FLOAT64_C( 7.05),
SIMDE_FLOAT64_C( 8.95), SIMDE_FLOAT64_C( 8.08), SIMDE_FLOAT64_C( 4.10), SIMDE_FLOAT64_C( 8.03) },
UINT8_C(201),
{ SIMDE_FLOAT64_C( 0.67), SIMDE_FLOAT64_C( 5.95), SIMDE_FLOAT64_C( 0.75), SIMDE_FLOAT64_C( 5.20),
SIMDE_FLOAT64_C( 4.50), SIMDE_FLOAT64_C( 3.66), SIMDE_FLOAT64_C( 4.15), SIMDE_FLOAT64_C( 6.27) },
{ SIMDE_FLOAT64_C( 6.61), SIMDE_FLOAT64_C( 8.31), SIMDE_FLOAT64_C( 9.90), SIMDE_FLOAT64_C( 9.09),
SIMDE_FLOAT64_C( 0.60), SIMDE_FLOAT64_C( 5.95), SIMDE_FLOAT64_C( 4.10), SIMDE_FLOAT64_C( 4.53) },
{ SIMDE_FLOAT64_C( 0.07), SIMDE_FLOAT64_C( 7.37), SIMDE_FLOAT64_C( 4.37), SIMDE_FLOAT64_C(3224559.49),
SIMDE_FLOAT64_C( 8.95), SIMDE_FLOAT64_C( 8.08), SIMDE_FLOAT64_C( 341.98), SIMDE_FLOAT64_C( 4089.05) } },
{ { SIMDE_FLOAT64_C( 6.68), SIMDE_FLOAT64_C( 2.94), SIMDE_FLOAT64_C( 2.89), SIMDE_FLOAT64_C( 2.45),
SIMDE_FLOAT64_C( 0.68), SIMDE_FLOAT64_C( 1.20), SIMDE_FLOAT64_C( 6.49), SIMDE_FLOAT64_C( 8.05) },
UINT8_C( 44),
{ SIMDE_FLOAT64_C( 3.53), SIMDE_FLOAT64_C( 7.00), SIMDE_FLOAT64_C( 3.65), SIMDE_FLOAT64_C( 7.63),
SIMDE_FLOAT64_C( 5.03), SIMDE_FLOAT64_C( 1.45), SIMDE_FLOAT64_C( 8.30), SIMDE_FLOAT64_C( 0.98) },
{ SIMDE_FLOAT64_C( 2.20), SIMDE_FLOAT64_C( 3.50), SIMDE_FLOAT64_C( 5.47), SIMDE_FLOAT64_C( 5.86),
SIMDE_FLOAT64_C( 7.66), SIMDE_FLOAT64_C( 1.74), SIMDE_FLOAT64_C( 2.46), SIMDE_FLOAT64_C( 5.96) },
{ SIMDE_FLOAT64_C( 6.68), SIMDE_FLOAT64_C( 2.94), SIMDE_FLOAT64_C( 1190.53), SIMDE_FLOAT64_C(148454.65),
SIMDE_FLOAT64_C( 0.68), SIMDE_FLOAT64_C( 1.91), SIMDE_FLOAT64_C( 6.49), SIMDE_FLOAT64_C( 8.05) } },
{ { SIMDE_FLOAT64_C( 1.64), SIMDE_FLOAT64_C( 1.55), SIMDE_FLOAT64_C( 6.56), SIMDE_FLOAT64_C( 7.59),
SIMDE_FLOAT64_C( 5.66), SIMDE_FLOAT64_C( 1.10), SIMDE_FLOAT64_C( 4.27), SIMDE_FLOAT64_C( 8.60) },
UINT8_C(119),
{ SIMDE_FLOAT64_C( 6.72), SIMDE_FLOAT64_C( 9.28), SIMDE_FLOAT64_C( 5.18), SIMDE_FLOAT64_C( 3.21),
SIMDE_FLOAT64_C( 7.32), SIMDE_FLOAT64_C( 0.76), SIMDE_FLOAT64_C( 6.75), SIMDE_FLOAT64_C( 4.32) },
{ SIMDE_FLOAT64_C( 4.41), SIMDE_FLOAT64_C( 4.38), SIMDE_FLOAT64_C( 9.35), SIMDE_FLOAT64_C( 5.86),
SIMDE_FLOAT64_C( 2.68), SIMDE_FLOAT64_C( 0.33), SIMDE_FLOAT64_C( 8.06), SIMDE_FLOAT64_C( 6.18) },
{ SIMDE_FLOAT64_C( 4453.47), SIMDE_FLOAT64_C( 17292.59), SIMDE_FLOAT64_C(4775108.60), SIMDE_FLOAT64_C( 7.59),
SIMDE_FLOAT64_C( 207.44), SIMDE_FLOAT64_C( 0.91), SIMDE_FLOAT64_C(4832684.12), SIMDE_FLOAT64_C( 8.60) } },
{ { SIMDE_FLOAT64_C( 5.80), SIMDE_FLOAT64_C( 3.91), SIMDE_FLOAT64_C( 3.84), SIMDE_FLOAT64_C( 7.54),
SIMDE_FLOAT64_C( 6.38), SIMDE_FLOAT64_C( 9.80), SIMDE_FLOAT64_C( 9.18), SIMDE_FLOAT64_C( 7.93) },
UINT8_C(224),
{ SIMDE_FLOAT64_C( 6.78), SIMDE_FLOAT64_C( 3.59), SIMDE_FLOAT64_C( 7.46), SIMDE_FLOAT64_C( 1.05),
SIMDE_FLOAT64_C( 2.19), SIMDE_FLOAT64_C( 1.44), SIMDE_FLOAT64_C( 7.77), SIMDE_FLOAT64_C( 1.46) },
{ SIMDE_FLOAT64_C( 6.62), SIMDE_FLOAT64_C( 0.98), SIMDE_FLOAT64_C( 8.79), SIMDE_FLOAT64_C( 7.38),
SIMDE_FLOAT64_C( 7.73), SIMDE_FLOAT64_C( 3.11), SIMDE_FLOAT64_C( 1.78), SIMDE_FLOAT64_C( 2.11) },
{ SIMDE_FLOAT64_C( 5.80), SIMDE_FLOAT64_C( 3.91), SIMDE_FLOAT64_C( 3.84), SIMDE_FLOAT64_C( 7.54),
SIMDE_FLOAT64_C( 6.38), SIMDE_FLOAT64_C( 3.11), SIMDE_FLOAT64_C( 38.45), SIMDE_FLOAT64_C( 2.22) } }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])) ; i++) {
simde__m512d src = simde_mm512_loadu_pd(test_vec[i].src);
simde__m512d a = simde_mm512_loadu_pd(test_vec[i].a);
simde__m512d b = simde_mm512_loadu_pd(test_vec[i].b);
simde__m512d r = simde_mm512_mask_pow_pd(src, test_vec[i].k, a, b);
simde_test_x86_assert_equal_f64x8(r, simde_mm512_loadu_pd(test_vec[i].r), 1);
}
return 0;
}
static int
test_simde_mm_rem_epi8(SIMDE_MUNIT_TEST_ARGS) {
const struct {
simde__m128i a;
simde__m128i b;
simde__m128i r;
} test_vec[8] = {
{ simde_mm_set_epi8(INT8_C( 104), INT8_C( 42), INT8_C( 53), INT8_C( -2),
INT8_C(-124), INT8_C( -2), INT8_C( 96), INT8_C( 75),
INT8_C( 31), INT8_C( 112), INT8_C(-105), INT8_C( -87),
INT8_C( -84), INT8_C( 94), INT8_C( 112), INT8_C( 90)),
simde_mm_set_epi8(INT8_C( -65), INT8_C( -89), INT8_C( -30), INT8_C( 64),
INT8_C( -43), INT8_C( -54), INT8_C( 110), INT8_C( 113),
INT8_C( 89), INT8_C( -19), INT8_C( 70), INT8_C( -30),
INT8_C(-124), INT8_C( 91), INT8_C( -1), INT8_C( 88)),
simde_mm_set_epi8(INT8_C( 39), INT8_C( 42), INT8_C( 23), INT8_C( -2),
INT8_C( -38), INT8_C( -2), INT8_C( 96), INT8_C( 75),
INT8_C( 31), INT8_C( 17), INT8_C( -35), INT8_C( -27),
INT8_C( -84), INT8_C( 3), INT8_C( 0), INT8_C( 2)) },
{ simde_mm_set_epi8(INT8_C( -23), INT8_C( -86), INT8_C( -15), INT8_C( 126),
INT8_C( -74), INT8_C( 10), INT8_C( -48), INT8_C( -58),
INT8_C( 93), INT8_C(-126), INT8_C( -61), INT8_C( -79),
INT8_C( -69), INT8_C( -33), INT8_C(-117), INT8_C( -3)),
simde_mm_set_epi8(INT8_C( 41), INT8_C( 49), INT8_C( -85), INT8_C( -58),
INT8_C( 40), INT8_C( 44), INT8_C( -14), INT8_C( 51),
INT8_C(-118), INT8_C( -39), INT8_C( -41), INT8_C( -7),
INT8_C( -55), INT8_C( 37), INT8_C(-119), INT8_C( 29)),
simde_mm_set_epi8(INT8_C( -23), INT8_C( -37), INT8_C( -15), INT8_C( 10),
INT8_C( -34), INT8_C( 10), INT8_C( -6), INT8_C( -7),
INT8_C( 93), INT8_C( -9), INT8_C( -20), INT8_C( -2),
INT8_C( -14), INT8_C( -33), INT8_C(-117), INT8_C( -3)) },
{ simde_mm_set_epi8(INT8_C( 88), INT8_C( -13), INT8_C( 83), INT8_C( -34),
INT8_C( 17), INT8_C( -52), INT8_C( 102), INT8_C( 26),
INT8_C( 74), INT8_C(-115), INT8_C( -4), INT8_C( 101),
INT8_C( -39), INT8_C( 50), INT8_C( -9), INT8_C(-117)),
simde_mm_set_epi8(INT8_C( 71), INT8_C( 16), INT8_C( 127), INT8_C( 20),
INT8_C(-125), INT8_C( -92), INT8_C( -21), INT8_C( -43),
INT8_C( 78), INT8_C( -41), INT8_C( -6), INT8_C( 42),
INT8_C( 9), INT8_C( -58), INT8_C( 72), INT8_C( 56)),
simde_mm_set_epi8(INT8_C( 17), INT8_C( -13), INT8_C( 83), INT8_C( -14),
INT8_C( 17), INT8_C( -52), INT8_C( 18), INT8_C( 26),
INT8_C( 74), INT8_C( -33), INT8_C( -4), INT8_C( 17),
INT8_C( -3), INT8_C( 50), INT8_C( -9), INT8_C( -5)) },
{ simde_mm_set_epi8(INT8_C( -95), INT8_C( 114), INT8_C(-111), INT8_C( 28),
INT8_C( 100), INT8_C( -53), INT8_C( 101), INT8_C( 21),
INT8_C( 3), INT8_C( 0), INT8_C( 63), INT8_C( 116),
INT8_C( 43), INT8_C( 106), INT8_C( -29), INT8_C( -44)),
simde_mm_set_epi8(INT8_C(-106), INT8_C( -49), INT8_C( 31), INT8_C(-118),
INT8_C( 70), INT8_C( 80), INT8_C(-117), INT8_C( 103),
INT8_C( -99), INT8_C( -33), INT8_C( 12), INT8_C( -74),
INT8_C( -41), INT8_C( -14), INT8_C(-105), INT8_C( -57)),
simde_mm_set_epi8(INT8_C( -95), INT8_C( 16), INT8_C( -18), INT8_C( 28),
INT8_C( 30), INT8_C( -53), INT8_C( 101), INT8_C( 21),
INT8_C( 3), INT8_C( 0), INT8_C( 3), INT8_C( 42),
INT8_C( 2), INT8_C( 8), INT8_C( -29), INT8_C( -44)) },
{ simde_mm_set_epi8(INT8_C( 29), INT8_C( 89), INT8_C( 4), INT8_C( 90),
INT8_C( -1), INT8_C( 56), INT8_C( 40), INT8_C(-107),
INT8_C(-125), INT8_C(-104), INT8_C( 36), INT8_C( -27),
INT8_C( -21), INT8_C( -84), INT8_C( -95), INT8_C( -6)),
simde_mm_set_epi8(INT8_C( 29), INT8_C( 101), INT8_C( 12), INT8_C( -7),
INT8_C( -72), INT8_C( -61), INT8_C( -6), INT8_C( -43),
INT8_C( 53), INT8_C( 76), INT8_C( -68), INT8_C( 25),
INT8_C( -80), INT8_C( -78), INT8_C( -55), INT8_C( -12)),
simde_mm_set_epi8(INT8_C( 0), INT8_C( 89), INT8_C( 4), INT8_C( 6),
INT8_C( -1), INT8_C( 56), INT8_C( 4), INT8_C( -21),
INT8_C( -19), INT8_C( -28), INT8_C( 36), INT8_C( -2),
INT8_C( -21), INT8_C( -6), INT8_C( -40), INT8_C( -6)) },
{ simde_mm_set_epi8(INT8_C( -60), INT8_C( 36), INT8_C( 35), INT8_C( 54),
INT8_C( 94), INT8_C( 53), INT8_C(-124), INT8_C( -9),
INT8_C( -29), INT8_C( -20), INT8_C( 32), INT8_C( 119),
INT8_C( 124), INT8_C( 15), INT8_C( 15), INT8_C( -94)),
simde_mm_set_epi8(INT8_C( 78), INT8_C( 89), INT8_C( 105), INT8_C( 98),
INT8_C( -78), INT8_C( -83), INT8_C(-122), INT8_C( -57),
INT8_C( -45), INT8_C( -13), INT8_C( -95), INT8_C( -36),
INT8_C( -85), INT8_C( 107), INT8_C( 43), INT8_C( 1)),
simde_mm_set_epi8(INT8_C( -60), INT8_C( 36), INT8_C( 35), INT8_C( 54),
INT8_C( 16), INT8_C( 53), INT8_C( -2), INT8_C( -9),
INT8_C( -29), INT8_C( -7), INT8_C( 32), INT8_C( 11),
INT8_C( 39), INT8_C( 15), INT8_C( 15), INT8_C( 0)) },
{ simde_mm_set_epi8(INT8_C( 32), INT8_C( 79), INT8_C( 19), INT8_C( 72),
INT8_C( 29), INT8_C( -53), INT8_C( 79), INT8_C( -3),
INT8_C( 57), INT8_C( 16), INT8_C( 99), INT8_C( 126),
INT8_C( -77), INT8_C( 12), INT8_C( 100), INT8_C( 11)),
simde_mm_set_epi8(INT8_C( 101), INT8_C( -18), INT8_C( -52), INT8_C(-126),
INT8_C( 117), INT8_C( -86), INT8_C( -70), INT8_C( 72),
INT8_C( -85), INT8_C( 25), INT8_C( -31), INT8_C( -92),
INT8_C( 7), INT8_C( 17), INT8_C(-125), INT8_C( 67)),
simde_mm_set_epi8(INT8_C( 32), INT8_C( 7), INT8_C( 19), INT8_C( 72),
INT8_C( 29), INT8_C( -53), INT8_C( 9), INT8_C( -3),
INT8_C( 57), INT8_C( 16), INT8_C( 6), INT8_C( 34),
INT8_C( 0), INT8_C( 12), INT8_C( 100), INT8_C( 11)) },
{ simde_mm_set_epi8(INT8_C( -12), INT8_C( 123), INT8_C( -45), INT8_C( -41),
INT8_C( -52), INT8_C( -36), INT8_C( 31), INT8_C( -52),
INT8_C( -27), INT8_C( 71), INT8_C( 9), INT8_C( -84),
INT8_C( -96), INT8_C(-115), INT8_C( 31), INT8_C( 12)),
simde_mm_set_epi8(INT8_C( -68), INT8_C( 29), INT8_C( -34), INT8_C( 81),
INT8_C( -41), INT8_C( 10), INT8_C( -66), INT8_C( -37),
INT8_C( 108), INT8_C( -9), INT8_C( -68), INT8_C( -41),
INT8_C( -24), INT8_C( -55), INT8_C( -20), INT8_C( 9)),
simde_mm_set_epi8(INT8_C( -12), INT8_C( 7), INT8_C( -11), INT8_C( -41),
INT8_C( -11), INT8_C( -6), INT8_C( 31), INT8_C( -15),
INT8_C( -27), INT8_C( 8), INT8_C( 9), INT8_C( -2),
INT8_C( 0), INT8_C( -5), INT8_C( 11), INT8_C( 3)) }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])); i++) {
simde__m128i r = simde_mm_rem_epi8(test_vec[i].a, test_vec[i].b);
simde_assert_m128i_i8(r, ==, test_vec[i].r);
}
return 0;
}
static int
test_simde_mm_rem_epi16(SIMDE_MUNIT_TEST_ARGS) {
const struct {
simde__m128i a;
simde__m128i b;
simde__m128i r;
} test_vec[8] = {
{ simde_mm_set_epi16(INT16_C( 26666), INT16_C( 13822), INT16_C(-31490), INT16_C( 24651),
INT16_C( 8048), INT16_C(-26711), INT16_C(-21410), INT16_C( 28762)),
simde_mm_set_epi16(INT16_C(-16473), INT16_C( -7616), INT16_C(-10806), INT16_C( 28273),
INT16_C( 23021), INT16_C( 18146), INT16_C(-31653), INT16_C( -168)),
simde_mm_set_epi16(INT16_C( 10193), INT16_C( 6206), INT16_C( -9878), INT16_C( 24651),
INT16_C( 8048), INT16_C( -8565), INT16_C(-21410), INT16_C( 34)) },
{ simde_mm_set_epi16(INT16_C( -5718), INT16_C( -3714), INT16_C(-18934), INT16_C(-12090),
INT16_C( 23938), INT16_C(-15439), INT16_C(-17441), INT16_C(-29699)),
simde_mm_set_epi16(INT16_C( 10545), INT16_C(-21562), INT16_C( 10284), INT16_C( -3533),
INT16_C(-29991), INT16_C(-10247), INT16_C(-14043), INT16_C(-30435)),
simde_mm_set_epi16(INT16_C( -5718), INT16_C( -3714), INT16_C( -8650), INT16_C( -1491),
INT16_C( 23938), INT16_C( -5192), INT16_C( -3398), INT16_C(-29699)) },
{ simde_mm_set_epi16(INT16_C( 22771), INT16_C( 21470), INT16_C( 4556), INT16_C( 26138),
INT16_C( 19085), INT16_C( -923), INT16_C( -9934), INT16_C( -2165)),
simde_mm_set_epi16(INT16_C( 18192), INT16_C( 32532), INT16_C(-31836), INT16_C( -5163),
INT16_C( 20183), INT16_C( -1494), INT16_C( 2502), INT16_C( 18488)),
simde_mm_set_epi16(INT16_C( 4579), INT16_C( 21470), INT16_C( 4556), INT16_C( 323),
INT16_C( 19085), INT16_C( -923), INT16_C( -2428), INT16_C( -2165)) },
{ simde_mm_set_epi16(INT16_C(-24206), INT16_C(-28388), INT16_C( 25803), INT16_C( 25877),
INT16_C( 768), INT16_C( 16244), INT16_C( 11114), INT16_C( -7212)),
simde_mm_set_epi16(INT16_C(-26929), INT16_C( 8074), INT16_C( 18000), INT16_C(-29849),
INT16_C(-25121), INT16_C( 3254), INT16_C(-10254), INT16_C(-26681)),
simde_mm_set_epi16(INT16_C(-24206), INT16_C( -4166), INT16_C( 7803), INT16_C( 25877),
INT16_C( 768), INT16_C( 3228), INT16_C( 860), INT16_C( -7212)) },
{ simde_mm_set_epi16(INT16_C( 7513), INT16_C( 1114), INT16_C( -200), INT16_C( 10389),
INT16_C(-31848), INT16_C( 9445), INT16_C( -5204), INT16_C(-24070)),
simde_mm_set_epi16(INT16_C( 7525), INT16_C( 3321), INT16_C(-18237), INT16_C( -1323),
INT16_C( 13644), INT16_C(-17383), INT16_C(-20302), INT16_C(-13836)),
simde_mm_set_epi16(INT16_C( 7513), INT16_C( 1114), INT16_C( -200), INT16_C( 1128),
INT16_C( -4560), INT16_C( 9445), INT16_C( -5204), INT16_C(-10234)) },
{ simde_mm_set_epi16(INT16_C(-15324), INT16_C( 9014), INT16_C( 24117), INT16_C(-31497),
INT16_C( -7188), INT16_C( 8311), INT16_C( 31759), INT16_C( 4002)),
simde_mm_set_epi16(INT16_C( 20057), INT16_C( 26978), INT16_C(-19795), INT16_C(-31033),
INT16_C(-11277), INT16_C(-24100), INT16_C(-21653), INT16_C( 11009)),
simde_mm_set_epi16(INT16_C(-15324), INT16_C( 9014), INT16_C( 4322), INT16_C( -464),
INT16_C( -7188), INT16_C( 8311), INT16_C( 10106), INT16_C( 4002)) },
{ simde_mm_set_epi16(INT16_C( 8271), INT16_C( 4936), INT16_C( 7627), INT16_C( 20477),
INT16_C( 14608), INT16_C( 25470), INT16_C(-19700), INT16_C( 25611)),
simde_mm_set_epi16(INT16_C( 26094), INT16_C(-13182), INT16_C( 30122), INT16_C(-17848),
INT16_C(-21735), INT16_C( -7772), INT16_C( 1809), INT16_C(-31933)),
simde_mm_set_epi16(INT16_C( 8271), INT16_C( 4936), INT16_C( 7627), INT16_C( 2629),
INT16_C( 14608), INT16_C( 2154), INT16_C( -1610), INT16_C( 25611)) },
{ simde_mm_set_epi16(INT16_C( -2949), INT16_C(-11305), INT16_C(-13092), INT16_C( 8140),
INT16_C( -6841), INT16_C( 2476), INT16_C(-24435), INT16_C( 7948)),
simde_mm_set_epi16(INT16_C(-17379), INT16_C( -8623), INT16_C(-10486), INT16_C(-16677),
INT16_C( 27895), INT16_C(-17193), INT16_C( -5943), INT16_C( -5111)),
simde_mm_set_epi16(INT16_C( -2949), INT16_C( -2682), INT16_C( -2606), INT16_C( 8140),
INT16_C( -6841), INT16_C( 2476), INT16_C( -663), INT16_C( 2837)) }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])); i++) {
simde__m128i r = simde_mm_rem_epi16(test_vec[i].a, test_vec[i].b);
simde_assert_m128i_i16(r, ==, test_vec[i].r);
}
return 0;
}
static int
test_simde_mm_rem_epi32(SIMDE_MUNIT_TEST_ARGS) {
const struct {
simde__m128i a;
simde__m128i b;
simde__m128i r;
} test_vec[8] = {
{ simde_mm_set_epi32(INT32_C( 1747596798), INT32_C(-2063703989), INT32_C( 527472553), INT32_C(-1403096998)),
simde_mm_set_epi32(INT32_C(-1079516608), INT32_C( -708153743), INT32_C( 1508722402), INT32_C(-2074345640)),
simde_mm_set_epi32(INT32_C( 668080190), INT32_C( -647396503), INT32_C( 527472553), INT32_C(-1403096998)) },
{ simde_mm_set_epi32(INT32_C( -374673026), INT32_C(-1240805178), INT32_C( 1568850865), INT32_C(-1142977539)),
simde_mm_set_epi32(INT32_C( 691121094), INT32_C( 674034227), INT32_C(-1965434887), INT32_C( -920286947)),
simde_mm_set_epi32(INT32_C( -374673026), INT32_C( -566770951), INT32_C( 1568850865), INT32_C( -222690592)) },
{ simde_mm_set_epi32(INT32_C( 1492341726), INT32_C( 298608154), INT32_C( 1250819173), INT32_C( -650971253)),
simde_mm_set_epi32(INT32_C( 1192263444), INT32_C(-2086343723), INT32_C( 1322777130), INT32_C( 163989560)),
simde_mm_set_epi32(INT32_C( 300078282), INT32_C( 298608154), INT32_C( 1250819173), INT32_C( -159002573)) },
{ simde_mm_set_epi32(INT32_C(-1586327268), INT32_C( 1691051285), INT32_C( 50347892), INT32_C( 728425428)),
simde_mm_set_epi32(INT32_C(-1764810870), INT32_C( 1179683687), INT32_C(-1646326602), INT32_C( -671967289)),
simde_mm_set_epi32(INT32_C(-1586327268), INT32_C( 511367598), INT32_C( 50347892), INT32_C( 56458139)) },
{ simde_mm_set_epi32(INT32_C( 492373082), INT32_C( -13096811), INT32_C(-2087181083), INT32_C( -341007878)),
simde_mm_set_epi32(INT32_C( 493161721), INT32_C(-1195115819), INT32_C( 894221337), INT32_C(-1330460172)),
simde_mm_set_epi32(INT32_C( 492373082), INT32_C( -13096811), INT32_C( -298738409), INT32_C( -341007878)) },
{ simde_mm_set_epi32(INT32_C(-1004264650), INT32_C( 1580565751), INT32_C( -471064457), INT32_C( 2081361826)),
simde_mm_set_epi32(INT32_C( 1314482530), INT32_C(-1297250617), INT32_C( -739008036), INT32_C(-1419039999)),
simde_mm_set_epi32(INT32_C(-1004264650), INT32_C( 283315134), INT32_C( -471064457), INT32_C( 662321827)) },
{ simde_mm_set_epi32(INT32_C( 542053192), INT32_C( 499863549), INT32_C( 957375358), INT32_C(-1291033589)),
simde_mm_set_epi32(INT32_C( 1710148738), INT32_C( 1974123080), INT32_C(-1424367196), INT32_C( 118588227)),
simde_mm_set_epi32(INT32_C( 542053192), INT32_C( 499863549), INT32_C( 957375358), INT32_C( -105151319)) },
{ simde_mm_set_epi32(INT32_C( -193211433), INT32_C( -857989172), INT32_C( -448329300), INT32_C(-1601364212)),
simde_mm_set_epi32(INT32_C(-1138893231), INT32_C( -687161637), INT32_C( 1828175063), INT32_C( -389420023)),
simde_mm_set_epi32(INT32_C( -193211433), INT32_C( -170827535), INT32_C( -448329300), INT32_C( -43684120)) }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])); i++) {
simde__m128i r = simde_mm_rem_epi32(test_vec[i].a, test_vec[i].b);
simde_assert_m128i_i32(r, ==, test_vec[i].r);
}
return 0;
}
static int
test_simde_mm_rem_epi64(SIMDE_MUNIT_TEST_ARGS) {
const struct {
simde__m128i a;
simde__m128i b;
simde__m128i r;
} test_vec[8] = {
{ simde_mm_set_epi64x(INT64_C( 7505871096235581515), INT64_C( 2265477367564496986)),
simde_mm_set_epi64x(INT64_C(-4636488523262038415), INT64_C( 6479913377553186648)),
simde_mm_set_epi64x(INT64_C( 2869382572973543100), INT64_C( 2265477367564496986)) },
{ simde_mm_set_epi64x(INT64_C(-1609208390309195578), INT64_C( 6738163160628300797)),
simde_mm_set_epi64x(INT64_C( 2968342496979776051), INT64_C(-8441478558707775203)),
simde_mm_set_epi64x(INT64_C(-1609208390309195578), INT64_C( 6738163160628300797)) },
{ simde_mm_set_epi64x(INT64_C( 6409558907924801050), INT64_C( 5372227444888762251)),
simde_mm_set_epi64x(INT64_C( 5120732502404950997), INT64_C( 5681284513410730040)),
simde_mm_set_epi64x(INT64_C( 1288826405519850053), INT64_C( 5372227444888762251)) },
{ simde_mm_set_epi64x(INT64_C(-6813223735121976043), INT64_C( 216242550290965460)),
simde_mm_set_epi64x(INT64_C(-7579804969095623833), INT64_C(-7070918910501808185)),
simde_mm_set_epi64x(INT64_C(-6813223735121976043), INT64_C( 216242550290965460)) },
{ simde_mm_set_epi64x(INT64_C( 2114726288902596757), INT64_C(-8964374488360902150)),
simde_mm_set_epi64x(INT64_C( 2118113466433927893), INT64_C( 3840651400764901876)),
simde_mm_set_epi64x(INT64_C( 2114726288902596757), INT64_C(-1283071686831098398)) },
{ simde_mm_set_epi64x(INT64_C(-4313283826698320649), INT64_C(-2023206435041636446)),
simde_mm_set_epi64x(INT64_C( 5645659480511055559), INT64_C(-3174015343225263359)),
simde_mm_set_epi64x(INT64_C(-4313283826698320649), INT64_C(-2023206435041636446)) },
{ simde_mm_set_epi64x(INT64_C( 2328100732832272381), INT64_C( 4111895855610225675)),
simde_mm_set_epi64x(INT64_C( 7345032902979795528), INT64_C(-6117610524196633789)),
simde_mm_set_epi64x(INT64_C( 2328100732832272381), INT64_C( 4111895855610225675)) },
{ simde_mm_set_epi64x(INT64_C( -829836782511317044), INT64_C(-1925559678644969716)),
simde_mm_set_epi64x(INT64_C(-4891509177172967717), INT64_C( 7851952110853286921)),
simde_mm_set_epi64x(INT64_C( -829836782511317044), INT64_C(-1925559678644969716)) }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])); i++) {
simde__m128i r = simde_mm_rem_epi64(test_vec[i].a, test_vec[i].b);
simde_assert_m128i_i64(r, ==, test_vec[i].r);
}
return 0;
}
static int
test_simde_mm_rem_epu8(SIMDE_MUNIT_TEST_ARGS) {
const struct {
simde__m128i a;
simde__m128i b;
simde__m128i r;
} test_vec[8] = {
{ simde_x_mm_set_epu8(UINT8_C(104), UINT8_C( 42), UINT8_C( 53), UINT8_C(254),
UINT8_C(132), UINT8_C(254), UINT8_C( 96), UINT8_C( 75),
UINT8_C( 31), UINT8_C(112), UINT8_C(151), UINT8_C(169),
UINT8_C(172), UINT8_C( 94), UINT8_C(112), UINT8_C( 90)),
simde_x_mm_set_epu8(UINT8_C(191), UINT8_C(167), UINT8_C(226), UINT8_C( 64),
UINT8_C(213), UINT8_C(202), UINT8_C(110), UINT8_C(113),
UINT8_C( 89), UINT8_C(237), UINT8_C( 70), UINT8_C(226),
UINT8_C(132), UINT8_C( 91), UINT8_C(255), UINT8_C( 88)),
simde_x_mm_set_epu8(UINT8_C(104), UINT8_C( 42), UINT8_C( 53), UINT8_C( 62),
UINT8_C(132), UINT8_C( 52), UINT8_C( 96), UINT8_C( 75),
UINT8_C( 31), UINT8_C(112), UINT8_C( 11), UINT8_C(169),
UINT8_C( 40), UINT8_C( 3), UINT8_C(112), UINT8_C( 2)) },
{ simde_x_mm_set_epu8(UINT8_C(233), UINT8_C(170), UINT8_C(241), UINT8_C(126),
UINT8_C(182), UINT8_C( 10), UINT8_C(208), UINT8_C(198),
UINT8_C( 93), UINT8_C(130), UINT8_C(195), UINT8_C(177),
UINT8_C(187), UINT8_C(223), UINT8_C(139), UINT8_C(253)),
simde_x_mm_set_epu8(UINT8_C( 41), UINT8_C( 49), UINT8_C(171), UINT8_C(198),
UINT8_C( 40), UINT8_C( 44), UINT8_C(242), UINT8_C( 51),
UINT8_C(138), UINT8_C(217), UINT8_C(215), UINT8_C(249),
UINT8_C(201), UINT8_C( 37), UINT8_C(137), UINT8_C( 29)),
simde_x_mm_set_epu8(UINT8_C( 28), UINT8_C( 23), UINT8_C( 70), UINT8_C(126),
UINT8_C( 22), UINT8_C( 10), UINT8_C(208), UINT8_C( 45),
UINT8_C( 93), UINT8_C(130), UINT8_C(195), UINT8_C(177),
UINT8_C(187), UINT8_C( 1), UINT8_C( 2), UINT8_C( 21)) },
{ simde_x_mm_set_epu8(UINT8_C( 88), UINT8_C(243), UINT8_C( 83), UINT8_C(222),
UINT8_C( 17), UINT8_C(204), UINT8_C(102), UINT8_C( 26),
UINT8_C( 74), UINT8_C(141), UINT8_C(252), UINT8_C(101),
UINT8_C(217), UINT8_C( 50), UINT8_C(247), UINT8_C(139)),
simde_x_mm_set_epu8(UINT8_C( 71), UINT8_C( 16), UINT8_C(127), UINT8_C( 20),
UINT8_C(131), UINT8_C(164), UINT8_C(235), UINT8_C(213),
UINT8_C( 78), UINT8_C(215), UINT8_C(250), UINT8_C( 42),
UINT8_C( 9), UINT8_C(198), UINT8_C( 72), UINT8_C( 56)),
simde_x_mm_set_epu8(UINT8_C( 17), UINT8_C( 3), UINT8_C( 83), UINT8_C( 2),
UINT8_C( 17), UINT8_C( 40), UINT8_C(102), UINT8_C( 26),
UINT8_C( 74), UINT8_C(141), UINT8_C( 2), UINT8_C( 17),
UINT8_C( 1), UINT8_C( 50), UINT8_C( 31), UINT8_C( 27)) },
{ simde_x_mm_set_epu8(UINT8_C(161), UINT8_C(114), UINT8_C(145), UINT8_C( 28),
UINT8_C(100), UINT8_C(203), UINT8_C(101), UINT8_C( 21),
UINT8_C( 3), UINT8_C( 0), UINT8_C( 63), UINT8_C(116),
UINT8_C( 43), UINT8_C(106), UINT8_C(227), UINT8_C(212)),
simde_x_mm_set_epu8(UINT8_C(150), UINT8_C(207), UINT8_C( 31), UINT8_C(138),
UINT8_C( 70), UINT8_C( 80), UINT8_C(139), UINT8_C(103),
UINT8_C(157), UINT8_C(223), UINT8_C( 12), UINT8_C(182),
UINT8_C(215), UINT8_C(242), UINT8_C(151), UINT8_C(199)),
simde_x_mm_set_epu8(UINT8_C( 11), UINT8_C(114), UINT8_C( 21), UINT8_C( 28),
UINT8_C( 30), UINT8_C( 43), UINT8_C(101), UINT8_C( 21),
UINT8_C( 3), UINT8_C( 0), UINT8_C( 3), UINT8_C(116),
UINT8_C( 43), UINT8_C(106), UINT8_C( 76), UINT8_C( 13)) },
{ simde_x_mm_set_epu8(UINT8_C( 29), UINT8_C( 89), UINT8_C( 4), UINT8_C( 90),
UINT8_C(255), UINT8_C( 56), UINT8_C( 40), UINT8_C(149),
UINT8_C(131), UINT8_C(152), UINT8_C( 36), UINT8_C(229),
UINT8_C(235), UINT8_C(172), UINT8_C(161), UINT8_C(250)),
simde_x_mm_set_epu8(UINT8_C( 29), UINT8_C(101), UINT8_C( 12), UINT8_C(249),
UINT8_C(184), UINT8_C(195), UINT8_C(250), UINT8_C(213),
UINT8_C( 53), UINT8_C( 76), UINT8_C(188), UINT8_C( 25),
UINT8_C(176), UINT8_C(178), UINT8_C(201), UINT8_C(244)),
simde_x_mm_set_epu8(UINT8_C( 0), UINT8_C( 89), UINT8_C( 4), UINT8_C( 90),
UINT8_C( 71), UINT8_C( 56), UINT8_C( 40), UINT8_C(149),
UINT8_C( 25), UINT8_C( 0), UINT8_C( 36), UINT8_C( 4),
UINT8_C( 59), UINT8_C(172), UINT8_C(161), UINT8_C( 6)) },
{ simde_x_mm_set_epu8(UINT8_C(196), UINT8_C( 36), UINT8_C( 35), UINT8_C( 54),
UINT8_C( 94), UINT8_C( 53), UINT8_C(132), UINT8_C(247),
UINT8_C(227), UINT8_C(236), UINT8_C( 32), UINT8_C(119),
UINT8_C(124), UINT8_C( 15), UINT8_C( 15), UINT8_C(162)),
simde_x_mm_set_epu8(UINT8_C( 78), UINT8_C( 89), UINT8_C(105), UINT8_C( 98),
UINT8_C(178), UINT8_C(173), UINT8_C(134), UINT8_C(199),
UINT8_C(211), UINT8_C(243), UINT8_C(161), UINT8_C(220),
UINT8_C(171), UINT8_C(107), UINT8_C( 43), UINT8_C( 1)),
simde_x_mm_set_epu8(UINT8_C( 40), UINT8_C( 36), UINT8_C( 35), UINT8_C( 54),
UINT8_C( 94), UINT8_C( 53), UINT8_C(132), UINT8_C( 48),
UINT8_C( 16), UINT8_C(236), UINT8_C( 32), UINT8_C(119),
UINT8_C(124), UINT8_C( 15), UINT8_C( 15), UINT8_C( 0)) },
{ simde_x_mm_set_epu8(UINT8_C( 32), UINT8_C( 79), UINT8_C( 19), UINT8_C( 72),
UINT8_C( 29), UINT8_C(203), UINT8_C( 79), UINT8_C(253),
UINT8_C( 57), UINT8_C( 16), UINT8_C( 99), UINT8_C(126),
UINT8_C(179), UINT8_C( 12), UINT8_C(100), UINT8_C( 11)),
simde_x_mm_set_epu8(UINT8_C(101), UINT8_C(238), UINT8_C(204), UINT8_C(130),
UINT8_C(117), UINT8_C(170), UINT8_C(186), UINT8_C( 72),
UINT8_C(171), UINT8_C( 25), UINT8_C(225), UINT8_C(164),
UINT8_C( 7), UINT8_C( 17), UINT8_C(131), UINT8_C( 67)),
simde_x_mm_set_epu8(UINT8_C( 32), UINT8_C( 79), UINT8_C( 19), UINT8_C( 72),
UINT8_C( 29), UINT8_C( 33), UINT8_C( 79), UINT8_C( 37),
UINT8_C( 57), UINT8_C( 16), UINT8_C( 99), UINT8_C(126),
UINT8_C( 4), UINT8_C( 12), UINT8_C(100), UINT8_C( 11)) },
{ simde_x_mm_set_epu8(UINT8_C(244), UINT8_C(123), UINT8_C(211), UINT8_C(215),
UINT8_C(204), UINT8_C(220), UINT8_C( 31), UINT8_C(204),
UINT8_C(229), UINT8_C( 71), UINT8_C( 9), UINT8_C(172),
UINT8_C(160), UINT8_C(141), UINT8_C( 31), UINT8_C( 12)),
simde_x_mm_set_epu8(UINT8_C(188), UINT8_C( 29), UINT8_C(222), UINT8_C( 81),
UINT8_C(215), UINT8_C( 10), UINT8_C(190), UINT8_C(219),
UINT8_C(108), UINT8_C(247), UINT8_C(188), UINT8_C(215),
UINT8_C(232), UINT8_C(201), UINT8_C(236), UINT8_C( 9)),
simde_x_mm_set_epu8(UINT8_C( 56), UINT8_C( 7), UINT8_C(211), UINT8_C( 53),
UINT8_C(204), UINT8_C( 0), UINT8_C( 31), UINT8_C(204),
UINT8_C( 13), UINT8_C( 71), UINT8_C( 9), UINT8_C(172),
UINT8_C(160), UINT8_C(141), UINT8_C( 31), UINT8_C( 3)) }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])); i++) {
simde__m128i r = simde_mm_rem_epu8(test_vec[i].a, test_vec[i].b);
simde_assert_m128i_u8(r, ==, test_vec[i].r);
}
return 0;
}
static int
test_simde_mm_rem_epu16(SIMDE_MUNIT_TEST_ARGS) {
const struct {
simde__m128i a;
simde__m128i b;
simde__m128i r;
} test_vec[8] = {
{ simde_x_mm_set_epu16(UINT16_C(26666), UINT16_C(13822), UINT16_C(34046), UINT16_C(24651),
UINT16_C( 8048), UINT16_C(38825), UINT16_C(44126), UINT16_C(28762)),
simde_x_mm_set_epu16(UINT16_C(49063), UINT16_C(57920), UINT16_C(54730), UINT16_C(28273),
UINT16_C(23021), UINT16_C(18146), UINT16_C(33883), UINT16_C(65368)),
simde_x_mm_set_epu16(UINT16_C(26666), UINT16_C(13822), UINT16_C(34046), UINT16_C(24651),
UINT16_C( 8048), UINT16_C( 2533), UINT16_C(10243), UINT16_C(28762)) },
{ simde_x_mm_set_epu16(UINT16_C(59818), UINT16_C(61822), UINT16_C(46602), UINT16_C(53446),
UINT16_C(23938), UINT16_C(50097), UINT16_C(48095), UINT16_C(35837)),
simde_x_mm_set_epu16(UINT16_C(10545), UINT16_C(43974), UINT16_C(10284), UINT16_C(62003),
UINT16_C(35545), UINT16_C(55289), UINT16_C(51493), UINT16_C(35101)),
simde_x_mm_set_epu16(UINT16_C( 7093), UINT16_C(17848), UINT16_C( 5466), UINT16_C(53446),
UINT16_C(23938), UINT16_C(50097), UINT16_C(48095), UINT16_C( 736)) },
{ simde_x_mm_set_epu16(UINT16_C(22771), UINT16_C(21470), UINT16_C( 4556), UINT16_C(26138),
UINT16_C(19085), UINT16_C(64613), UINT16_C(55602), UINT16_C(63371)),
simde_x_mm_set_epu16(UINT16_C(18192), UINT16_C(32532), UINT16_C(33700), UINT16_C(60373),
UINT16_C(20183), UINT16_C(64042), UINT16_C( 2502), UINT16_C(18488)),
simde_x_mm_set_epu16(UINT16_C( 4579), UINT16_C(21470), UINT16_C( 4556), UINT16_C(26138),
UINT16_C(19085), UINT16_C( 571), UINT16_C( 558), UINT16_C( 7907)) },
{ simde_x_mm_set_epu16(UINT16_C(41330), UINT16_C(37148), UINT16_C(25803), UINT16_C(25877),
UINT16_C( 768), UINT16_C(16244), UINT16_C(11114), UINT16_C(58324)),
simde_x_mm_set_epu16(UINT16_C(38607), UINT16_C( 8074), UINT16_C(18000), UINT16_C(35687),
UINT16_C(40415), UINT16_C( 3254), UINT16_C(55282), UINT16_C(38855)),
simde_x_mm_set_epu16(UINT16_C( 2723), UINT16_C( 4852), UINT16_C( 7803), UINT16_C(25877),
UINT16_C( 768), UINT16_C( 3228), UINT16_C(11114), UINT16_C(19469)) },
{ simde_x_mm_set_epu16(UINT16_C( 7513), UINT16_C( 1114), UINT16_C(65336), UINT16_C(10389),
UINT16_C(33688), UINT16_C( 9445), UINT16_C(60332), UINT16_C(41466)),
simde_x_mm_set_epu16(UINT16_C( 7525), UINT16_C( 3321), UINT16_C(47299), UINT16_C(64213),
UINT16_C(13644), UINT16_C(48153), UINT16_C(45234), UINT16_C(51700)),
simde_x_mm_set_epu16(UINT16_C( 7513), UINT16_C( 1114), UINT16_C(18037), UINT16_C(10389),
UINT16_C( 6400), UINT16_C( 9445), UINT16_C(15098), UINT16_C(41466)) },
{ simde_x_mm_set_epu16(UINT16_C(50212), UINT16_C( 9014), UINT16_C(24117), UINT16_C(34039),
UINT16_C(58348), UINT16_C( 8311), UINT16_C(31759), UINT16_C( 4002)),
simde_x_mm_set_epu16(UINT16_C(20057), UINT16_C(26978), UINT16_C(45741), UINT16_C(34503),
UINT16_C(54259), UINT16_C(41436), UINT16_C(43883), UINT16_C(11009)),
simde_x_mm_set_epu16(UINT16_C(10098), UINT16_C( 9014), UINT16_C(24117), UINT16_C(34039),
UINT16_C( 4089), UINT16_C( 8311), UINT16_C(31759), UINT16_C( 4002)) },
{ simde_x_mm_set_epu16(UINT16_C( 8271), UINT16_C( 4936), UINT16_C( 7627), UINT16_C(20477),
UINT16_C(14608), UINT16_C(25470), UINT16_C(45836), UINT16_C(25611)),
simde_x_mm_set_epu16(UINT16_C(26094), UINT16_C(52354), UINT16_C(30122), UINT16_C(47688),
UINT16_C(43801), UINT16_C(57764), UINT16_C( 1809), UINT16_C(33603)),
simde_x_mm_set_epu16(UINT16_C( 8271), UINT16_C( 4936), UINT16_C( 7627), UINT16_C(20477),
UINT16_C(14608), UINT16_C(25470), UINT16_C( 611), UINT16_C(25611)) },
{ simde_x_mm_set_epu16(UINT16_C(62587), UINT16_C(54231), UINT16_C(52444), UINT16_C( 8140),
UINT16_C(58695), UINT16_C( 2476), UINT16_C(41101), UINT16_C( 7948)),
simde_x_mm_set_epu16(UINT16_C(48157), UINT16_C(56913), UINT16_C(55050), UINT16_C(48859),
UINT16_C(27895), UINT16_C(48343), UINT16_C(59593), UINT16_C(60425)),
simde_x_mm_set_epu16(UINT16_C(14430), UINT16_C(54231), UINT16_C(52444), UINT16_C( 8140),
UINT16_C( 2905), UINT16_C( 2476), UINT16_C(41101), UINT16_C( 7948)) }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])); i++) {
simde__m128i r = simde_mm_rem_epu16(test_vec[i].a, test_vec[i].b);
simde_assert_m128i_u16(r, ==, test_vec[i].r);
}
return 0;
}
static int
test_simde_mm_rem_epu32(SIMDE_MUNIT_TEST_ARGS) {
const struct {
simde__m128i a;
simde__m128i b;
simde__m128i r;
} test_vec[8] = {
{ simde_x_mm_set_epu32(UINT32_C(1747596798), UINT32_C(2231263307), UINT32_C( 527472553), UINT32_C(2891870298)),
simde_x_mm_set_epu32(UINT32_C(3215450688), UINT32_C(3586813553), UINT32_C(1508722402), UINT32_C(2220621656)),
simde_x_mm_set_epu32(UINT32_C(1747596798), UINT32_C(2231263307), UINT32_C( 527472553), UINT32_C( 671248642)) },
{ simde_x_mm_set_epu32(UINT32_C(3920294270), UINT32_C(3054162118), UINT32_C(1568850865), UINT32_C(3151989757)),
simde_x_mm_set_epu32(UINT32_C( 691121094), UINT32_C( 674034227), UINT32_C(2329532409), UINT32_C(3374680349)),
simde_x_mm_set_epu32(UINT32_C( 464688800), UINT32_C( 358025210), UINT32_C(1568850865), UINT32_C(3151989757)) },
{ simde_x_mm_set_epu32(UINT32_C(1492341726), UINT32_C( 298608154), UINT32_C(1250819173), UINT32_C(3643996043)),
simde_x_mm_set_epu32(UINT32_C(1192263444), UINT32_C(2208623573), UINT32_C(1322777130), UINT32_C( 163989560)),
simde_x_mm_set_epu32(UINT32_C( 300078282), UINT32_C( 298608154), UINT32_C(1250819173), UINT32_C( 36225723)) },
{ simde_x_mm_set_epu32(UINT32_C(2708640028), UINT32_C(1691051285), UINT32_C( 50347892), UINT32_C( 728425428)),
simde_x_mm_set_epu32(UINT32_C(2530156426), UINT32_C(1179683687), UINT32_C(2648640694), UINT32_C(3623000007)),
simde_x_mm_set_epu32(UINT32_C( 178483602), UINT32_C( 511367598), UINT32_C( 50347892), UINT32_C( 728425428)) },
{ simde_x_mm_set_epu32(UINT32_C( 492373082), UINT32_C(4281870485), UINT32_C(2207786213), UINT32_C(3953959418)),
simde_x_mm_set_epu32(UINT32_C( 493161721), UINT32_C(3099851477), UINT32_C( 894221337), UINT32_C(2964507124)),
simde_x_mm_set_epu32(UINT32_C( 492373082), UINT32_C(1182019008), UINT32_C( 419343539), UINT32_C( 989452294)) },
{ simde_x_mm_set_epu32(UINT32_C(3290702646), UINT32_C(1580565751), UINT32_C(3823902839), UINT32_C(2081361826)),
simde_x_mm_set_epu32(UINT32_C(1314482530), UINT32_C(2997716679), UINT32_C(3555959260), UINT32_C(2875927297)),
simde_x_mm_set_epu32(UINT32_C( 661737586), UINT32_C(1580565751), UINT32_C( 267943579), UINT32_C(2081361826)) },
{ simde_x_mm_set_epu32(UINT32_C( 542053192), UINT32_C( 499863549), UINT32_C( 957375358), UINT32_C(3003933707)),
simde_x_mm_set_epu32(UINT32_C(1710148738), UINT32_C(1974123080), UINT32_C(2870600100), UINT32_C( 118588227)),
simde_x_mm_set_epu32(UINT32_C( 542053192), UINT32_C( 499863549), UINT32_C( 957375358), UINT32_C( 39228032)) },
{ simde_x_mm_set_epu32(UINT32_C(4101755863), UINT32_C(3436978124), UINT32_C(3846637996), UINT32_C(2693603084)),
simde_x_mm_set_epu32(UINT32_C(3156074065), UINT32_C(3607805659), UINT32_C(1828175063), UINT32_C(3905547273)),
simde_x_mm_set_epu32(UINT32_C( 945681798), UINT32_C(3436978124), UINT32_C( 190287870), UINT32_C(2693603084)) }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])); i++) {
simde__m128i r = simde_mm_rem_epu32(test_vec[i].a, test_vec[i].b);
simde_assert_m128i_u32(r, ==, test_vec[i].r);
}
return 0;
}
static int
test_simde_mm_rem_epu64(SIMDE_MUNIT_TEST_ARGS) {
const struct {
simde__m128i a;
simde__m128i b;
simde__m128i r;
} test_vec[8] = {
{ simde_x_mm_set_epu64x(UINT64_C( 7505871096235581515), UINT64_C( 2265477367564496986)),
simde_x_mm_set_epu64x(UINT64_C(13810255550447513201), UINT64_C( 6479913377553186648)),
simde_x_mm_set_epu64x(UINT64_C( 7505871096235581515), UINT64_C( 2265477367564496986)) },
{ simde_x_mm_set_epu64x(UINT64_C(16837535683400356038), UINT64_C( 6738163160628300797)),
simde_x_mm_set_epu64x(UINT64_C( 2968342496979776051), UINT64_C(10005265515001776413)),
simde_x_mm_set_epu64x(UINT64_C( 1995823198501475783), UINT64_C( 6738163160628300797)) },
{ simde_x_mm_set_epu64x(UINT64_C( 6409558907924801050), UINT64_C( 5372227444888762251)),
simde_x_mm_set_epu64x(UINT64_C( 5120732502404950997), UINT64_C( 5681284513410730040)),
simde_x_mm_set_epu64x(UINT64_C( 1288826405519850053), UINT64_C( 5372227444888762251)) },
{ simde_x_mm_set_epu64x(UINT64_C(11633520338587575573), UINT64_C( 216242550290965460)),
simde_x_mm_set_epu64x(UINT64_C(10866939104613927783), UINT64_C(11375825163207743431)),
simde_x_mm_set_epu64x(UINT64_C( 766581233973647790), UINT64_C( 216242550290965460)) },
{ simde_x_mm_set_epu64x(UINT64_C( 2114726288902596757), UINT64_C( 9482369585348649466)),
simde_x_mm_set_epu64x(UINT64_C( 2118113466433927893), UINT64_C( 3840651400764901876)),
simde_x_mm_set_epu64x(UINT64_C( 2114726288902596757), UINT64_C( 1801066783818845714)) },
{ simde_x_mm_set_epu64x(UINT64_C(14133460247011230967), UINT64_C(16423537638667915170)),
simde_x_mm_set_epu64x(UINT64_C( 5645659480511055559), UINT64_C(15272728730484288257)),
simde_x_mm_set_epu64x(UINT64_C( 2842141285989119849), UINT64_C( 1150808908183626913)) },
{ simde_x_mm_set_epu64x(UINT64_C( 2328100732832272381), UINT64_C( 4111895855610225675)),
simde_x_mm_set_epu64x(UINT64_C( 7345032902979795528), UINT64_C(12329133549512917827)),
simde_x_mm_set_epu64x(UINT64_C( 2328100732832272381), UINT64_C( 4111895855610225675)) },
{ simde_x_mm_set_epu64x(UINT64_C(17616907291198234572), UINT64_C(16521184395064581900)),
simde_x_mm_set_epu64x(UINT64_C(13555234896536583899), UINT64_C( 7851952110853286921)),
simde_x_mm_set_epu64x(UINT64_C( 4061672394661650673), UINT64_C( 817280173358008058)) }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])); i++) {
simde__m128i r = simde_mm_rem_epu64(test_vec[i].a, test_vec[i].b);
simde_assert_m128i_u64(r, ==, test_vec[i].r);
}
return 0;
}
static int
test_simde_mm256_rem_epi8(SIMDE_MUNIT_TEST_ARGS) {
const struct {
simde__m256i a;
simde__m256i b;
simde__m256i r;
} test_vec[8] = {
{ simde_mm256_set_epi8(INT8_C( -65), INT8_C( -89), INT8_C( -30), INT8_C( 64),
INT8_C( -43), INT8_C( -54), INT8_C( 110), INT8_C( 113),
INT8_C( 89), INT8_C( -19), INT8_C( 70), INT8_C( -30),
INT8_C(-124), INT8_C( 91), INT8_C( -1), INT8_C( 88),
INT8_C( 104), INT8_C( 42), INT8_C( 53), INT8_C( -2),
INT8_C(-124), INT8_C( -2), INT8_C( 96), INT8_C( 75),
INT8_C( 31), INT8_C( 112), INT8_C(-105), INT8_C( -87),
INT8_C( -84), INT8_C( 94), INT8_C( 112), INT8_C( 90)),
simde_mm256_set_epi8(INT8_C( 121), INT8_C( 85), INT8_C(-103), INT8_C( 116),
INT8_C( -38), INT8_C( 21), INT8_C( 101), INT8_C( 122),
INT8_C( 10), INT8_C( -25), INT8_C( 54), INT8_C( 71),
INT8_C(-100), INT8_C(-107), INT8_C( -12), INT8_C( 84),
INT8_C(-108), INT8_C( 85), INT8_C( -86), INT8_C( -72),
INT8_C( 94), INT8_C(-102), INT8_C( -27), INT8_C( 11),
INT8_C( 70), INT8_C( -77), INT8_C( 121), INT8_C( -99),
INT8_C( -2), INT8_C( 70), INT8_C( 49), INT8_C( 125)),
simde_mm256_set_epi8(INT8_C( -65), INT8_C( -4), INT8_C( -30), INT8_C( 64),
INT8_C( -5), INT8_C( -12), INT8_C( 9), INT8_C( 113),
INT8_C( 9), INT8_C( -19), INT8_C( 16), INT8_C( -30),
INT8_C( -24), INT8_C( 91), INT8_C( -1), INT8_C( 4),
INT8_C( 104), INT8_C( 42), INT8_C( 53), INT8_C( -2),
INT8_C( -30), INT8_C( -2), INT8_C( 15), INT8_C( 9),
INT8_C( 31), INT8_C( 35), INT8_C(-105), INT8_C( -87),
INT8_C( 0), INT8_C( 24), INT8_C( 14), INT8_C( 90)) },
{ simde_mm256_set_epi8(INT8_C( 78), INT8_C( 89), INT8_C( 105), INT8_C( 98),
INT8_C( -78), INT8_C( -83), INT8_C(-122), INT8_C( -57),
INT8_C( -45), INT8_C( -13), INT8_C( -95), INT8_C( -36),
INT8_C( -85), INT8_C( 107), INT8_C( 43), INT8_C( 1),
INT8_C( -60), INT8_C( 36), INT8_C( 35), INT8_C( 54),
INT8_C( 94), INT8_C( 53), INT8_C(-124), INT8_C( -9),
INT8_C( -29), INT8_C( -20), INT8_C( 32), INT8_C( 119),
INT8_C( 124), INT8_C( 15), INT8_C( 15), INT8_C( -94)),
simde_mm256_set_epi8(INT8_C( -61), INT8_C( 49), INT8_C( 14), INT8_C( -86),
INT8_C( -53), INT8_C( -89), INT8_C( 3), INT8_C( -41),
INT8_C( 63), INT8_C( -8), INT8_C( 55), INT8_C( -37),
INT8_C( -35), INT8_C(-121), INT8_C( 61), INT8_C( -65),
INT8_C( -47), INT8_C( 91), INT8_C( 87), INT8_C(-119),
INT8_C( 87), INT8_C( 76), INT8_C( 44), INT8_C(-116),
INT8_C( 2), INT8_C( -56), INT8_C( 36), INT8_C( -61),
INT8_C( -56), INT8_C( 125), INT8_C( -2), INT8_C(-117)),
simde_mm256_set_epi8(INT8_C( 17), INT8_C( 40), INT8_C( 7), INT8_C( 12),
INT8_C( -25), INT8_C( -83), INT8_C( -2), INT8_C( -16),
INT8_C( -45), INT8_C( -5), INT8_C( -40), INT8_C( -36),
INT8_C( -15), INT8_C( 107), INT8_C( 43), INT8_C( 1),
INT8_C( -13), INT8_C( 36), INT8_C( 35), INT8_C( 54),
INT8_C( 7), INT8_C( 53), INT8_C( -36), INT8_C( -9),
INT8_C( -1), INT8_C( -20), INT8_C( 32), INT8_C( 58),
INT8_C( 12), INT8_C( 15), INT8_C( 1), INT8_C( -94)) },
{ simde_mm256_set_epi8(INT8_C( -22), INT8_C( 94), INT8_C( -16), INT8_C( 12),
INT8_C(-110), INT8_C( 1), INT8_C(-109), INT8_C( 59),
INT8_C( -3), INT8_C( 26), INT8_C( 26), INT8_C( 40),
INT8_C( 12), INT8_C( 2), INT8_C( -26), INT8_C(-111),
INT8_C( -86), INT8_C( 105), INT8_C( 111), INT8_C( -96),
INT8_C(-116), INT8_C( -54), INT8_C( -90), INT8_C( -36),
INT8_C( -69), INT8_C( 65), INT8_C( -6), INT8_C( -61),
INT8_C( 33), INT8_C(-125), INT8_C( 2), INT8_C( -92)),
simde_mm256_set_epi8(INT8_C( -79), INT8_C( -35), INT8_C( -5), INT8_C( -75),
INT8_C( -97), INT8_C( -74), INT8_C( 11), INT8_C( 11),
INT8_C( 39), INT8_C( 37), INT8_C( 39), INT8_C( -48),
INT8_C(-120), INT8_C( -76), INT8_C( -41), INT8_C(-117),
INT8_C(-112), INT8_C(-128), INT8_C( -53), INT8_C( -50),
INT8_C( -83), INT8_C( 36), INT8_C(-123), INT8_C( -81),
INT8_C( -25), INT8_C( 7), INT8_C( -20), INT8_C( 68),
INT8_C( -63), INT8_C( -35), INT8_C( 27), INT8_C( 8)),
simde_mm256_set_epi8(INT8_C( -22), INT8_C( 24), INT8_C( -1), INT8_C( 12),
INT8_C( -13), INT8_C( 1), INT8_C( -10), INT8_C( 4),
INT8_C( -3), INT8_C( 26), INT8_C( 26), INT8_C( 40),
INT8_C( 12), INT8_C( 2), INT8_C( -26), INT8_C(-111),
INT8_C( -86), INT8_C( 105), INT8_C( 5), INT8_C( -46),
INT8_C( -33), INT8_C( -18), INT8_C( -90), INT8_C( -36),
INT8_C( -19), INT8_C( 2), INT8_C( -6), INT8_C( -61),
INT8_C( 33), INT8_C( -20), INT8_C( 2), INT8_C( -4)) },
{ simde_mm256_set_epi8(INT8_C( 71), INT8_C( -23), INT8_C( 74), INT8_C( 125),
INT8_C( 81), INT8_C( -13), INT8_C(-117), INT8_C( -66),
INT8_C( 31), INT8_C( -80), INT8_C( 97), INT8_C( -3),
INT8_C( 123), INT8_C( -80), INT8_C( -40), INT8_C( 108),
INT8_C( -9), INT8_C( 97), INT8_C( 75), INT8_C( -53),
INT8_C(-128), INT8_C( -18), INT8_C( 79), INT8_C(-115),
INT8_C( 86), INT8_C( 29), INT8_C( -93), INT8_C( -49),
INT8_C( 111), INT8_C( -7), INT8_C(-117), INT8_C( -47)),
simde_mm256_set_epi8(INT8_C( 120), INT8_C( 127), INT8_C( 28), INT8_C( 95),
INT8_C( -81), INT8_C( -33), INT8_C( 119), INT8_C( -42),
INT8_C( -36), INT8_C( 102), INT8_C( 86), INT8_C( 22),
INT8_C( 119), INT8_C( -49), INT8_C( 12), INT8_C( -73),
INT8_C( -84), INT8_C( -14), INT8_C( -83), INT8_C( -7),
INT8_C( 52), INT8_C( 108), INT8_C(-128), INT8_C( -53),
INT8_C( 85), INT8_C(-121), INT8_C( -29), INT8_C( 35),
INT8_C( -69), INT8_C( 24), INT8_C( -6), INT8_C( -37)),
simde_mm256_set_epi8(INT8_C( 71), INT8_C( -23), INT8_C( 18), INT8_C( 30),
INT8_C( 0), INT8_C( -13), INT8_C(-117), INT8_C( -24),
INT8_C( 31), INT8_C( -80), INT8_C( 11), INT8_C( -3),
INT8_C( 4), INT8_C( -31), INT8_C( -4), INT8_C( 35),
INT8_C( -9), INT8_C( 13), INT8_C( 75), INT8_C( -4),
INT8_C( -24), INT8_C( -18), INT8_C( 79), INT8_C( -9),
INT8_C( 1), INT8_C( 29), INT8_C( -6), INT8_C( -14),
INT8_C( 42), INT8_C( -7), INT8_C( -3), INT8_C( -10)) },
{ simde_mm256_set_epi8(INT8_C( -72), INT8_C( 63), INT8_C( 95), INT8_C( -92),
INT8_C( 65), INT8_C( 71), INT8_C( -82), INT8_C( 88),
INT8_C( -73), INT8_C(-114), INT8_C( 98), INT8_C( 14),
INT8_C( 25), INT8_C( -83), INT8_C( 87), INT8_C( 2),
INT8_C( -65), INT8_C(-113), INT8_C(-104), INT8_C( 2),
INT8_C( 126), INT8_C( 0), INT8_C( -94), INT8_C( 57),
INT8_C( -11), INT8_C( 36), INT8_C( -17), INT8_C( 54),
INT8_C( 33), INT8_C( -91), INT8_C( -57), INT8_C( 84)),
simde_mm256_set_epi8(INT8_C( -82), INT8_C( 60), INT8_C(-124), INT8_C( -48),
INT8_C( 58), INT8_C( -78), INT8_C( 116), INT8_C( -16),
INT8_C( 37), INT8_C(-125), INT8_C( 100), INT8_C( -79),
INT8_C( 19), INT8_C( 102), INT8_C( 81), INT8_C( 86),
INT8_C( 25), INT8_C( 43), INT8_C( 51), INT8_C(-116),
INT8_C( 9), INT8_C( 40), INT8_C( -29), INT8_C( 75),
INT8_C( -48), INT8_C( -97), INT8_C( -81), INT8_C( 109),
INT8_C( -26), INT8_C( 87), INT8_C( -2), INT8_C( -40)),
simde_mm256_set_epi8(INT8_C( -72), INT8_C( 3), INT8_C( 95), INT8_C( -44),
INT8_C( 7), INT8_C( 71), INT8_C( -82), INT8_C( 8),
INT8_C( -36), INT8_C(-114), INT8_C( 98), INT8_C( 14),
INT8_C( 6), INT8_C( -83), INT8_C( 6), INT8_C( 2),
INT8_C( -15), INT8_C( -27), INT8_C( -2), INT8_C( 2),
INT8_C( 0), INT8_C( 0), INT8_C( -7), INT8_C( 57),
INT8_C( -11), INT8_C( 36), INT8_C( -17), INT8_C( 54),
INT8_C( 7), INT8_C( -4), INT8_C( -1), INT8_C( 4)) },
{ simde_mm256_set_epi8(INT8_C( 54), INT8_C( 43), INT8_C( 109), INT8_C( -69),
INT8_C(-118), INT8_C( 62), INT8_C( -34), INT8_C(-102),
INT8_C( 123), INT8_C( 21), INT8_C( -9), INT8_C( 99),
INT8_C( 37), INT8_C( 48), INT8_C( 116), INT8_C( -23),
INT8_C( 95), INT8_C( -5), INT8_C(-109), INT8_C( 109),
INT8_C( -51), INT8_C( -50), INT8_C( 57), INT8_C( 17),
INT8_C( 121), INT8_C( 25), INT8_C( 3), INT8_C( 55),
INT8_C( -78), INT8_C(-127), INT8_C(-107), INT8_C( -49)),
simde_mm256_set_epi8(INT8_C(-125), INT8_C( 42), INT8_C(-105), INT8_C( -46),
INT8_C( 12), INT8_C( -93), INT8_C(-118), INT8_C( -49),
INT8_C( 43), INT8_C( 57), INT8_C( 61), INT8_C( 62),
INT8_C( 81), INT8_C( -72), INT8_C( 6), INT8_C( 93),
INT8_C( -89), INT8_C( 1), INT8_C(-111), INT8_C( 9),
INT8_C( 4), INT8_C( 17), INT8_C( 10), INT8_C( 101),
INT8_C( -70), INT8_C( -75), INT8_C(-101), INT8_C( -13),
INT8_C( -67), INT8_C( -65), INT8_C( -34), INT8_C( -51)),
simde_mm256_set_epi8(INT8_C( 54), INT8_C( 1), INT8_C( 4), INT8_C( -23),
INT8_C( -10), INT8_C( 62), INT8_C( -34), INT8_C( -4),
INT8_C( 37), INT8_C( 21), INT8_C( -9), INT8_C( 37),
INT8_C( 37), INT8_C( 48), INT8_C( 2), INT8_C( -23),
INT8_C( 6), INT8_C( 0), INT8_C(-109), INT8_C( 1),
INT8_C( -3), INT8_C( -16), INT8_C( 7), INT8_C( 17),
INT8_C( 51), INT8_C( 25), INT8_C( 3), INT8_C( 3),
INT8_C( -11), INT8_C( -62), INT8_C( -5), INT8_C( -49)) },
{ simde_mm256_set_epi8(INT8_C( 23), INT8_C(-124), INT8_C( 106), INT8_C( 109),
INT8_C(-121), INT8_C( -53), INT8_C( 98), INT8_C( 120),
INT8_C( 101), INT8_C( 52), INT8_C( 82), INT8_C( 44),
INT8_C(-114), INT8_C( 14), INT8_C( 99), INT8_C( -11),
INT8_C( 8), INT8_C(-116), INT8_C(-115), INT8_C( 123),
INT8_C( -37), INT8_C( -93), INT8_C( -60), INT8_C( -23),
INT8_C( 34), INT8_C( -71), INT8_C( -28), INT8_C( 108),
INT8_C( 95), INT8_C( -20), INT8_C( 97), INT8_C( 41)),
simde_mm256_set_epi8(INT8_C( 125), INT8_C( -27), INT8_C( -53), INT8_C( 45),
INT8_C( 24), INT8_C( 5), INT8_C( 90), INT8_C( 83),
INT8_C(-111), INT8_C( 85), INT8_C(-100), INT8_C( -92),
INT8_C(-107), INT8_C( -55), INT8_C( 48), INT8_C( -1),
INT8_C( 41), INT8_C( 42), INT8_C( 94), INT8_C(-127),
INT8_C(-121), INT8_C( 8), INT8_C( 12), INT8_C( -53),
INT8_C(-128), INT8_C( -54), INT8_C(-108), INT8_C( -4),
INT8_C( 104), INT8_C( -48), INT8_C( 98), INT8_C( -94)),
simde_mm256_set_epi8(INT8_C( 23), INT8_C( -16), INT8_C( 0), INT8_C( 19),
INT8_C( -1), INT8_C( -3), INT8_C( 8), INT8_C( 37),
INT8_C( 101), INT8_C( 52), INT8_C( 82), INT8_C( 44),
INT8_C( -7), INT8_C( 14), INT8_C( 3), INT8_C( 0),
INT8_C( 8), INT8_C( -32), INT8_C( -21), INT8_C( 123),
INT8_C( -37), INT8_C( -5), INT8_C( 0), INT8_C( -23),
INT8_C( 34), INT8_C( -17), INT8_C( -28), INT8_C( 0),
INT8_C( 95), INT8_C( -20), INT8_C( 97), INT8_C( 41)) },
{ simde_mm256_set_epi8(INT8_C( -94), INT8_C( 31), INT8_C( -88), INT8_C( 17),
INT8_C( 50), INT8_C( 110), INT8_C( -25), INT8_C( -40),
INT8_C( 94), INT8_C( 20), INT8_C( -93), INT8_C( -73),
INT8_C( -99), INT8_C( 16), INT8_C( 91), INT8_C( 54),
INT8_C( 62), INT8_C( 81), INT8_C( -97), INT8_C(-105),
INT8_C( 57), INT8_C( 12), INT8_C( 118), INT8_C( 33),
INT8_C( -76), INT8_C(-117), INT8_C( 1), INT8_C( 5),
INT8_C( 78), INT8_C( 13), INT8_C( 93), INT8_C(-101)),
simde_mm256_set_epi8(INT8_C( -63), INT8_C( -26), INT8_C( 93), INT8_C( 23),
INT8_C( -63), INT8_C( 52), INT8_C( -33), INT8_C( -81),
INT8_C( -51), INT8_C( 45), INT8_C( -90), INT8_C( 24),
INT8_C( 71), INT8_C( -22), INT8_C( -95), INT8_C(-114),
INT8_C( -72), INT8_C( -38), INT8_C( -66), INT8_C( -44),
INT8_C( 116), INT8_C( -97), INT8_C( 44), INT8_C( 55),
INT8_C( -43), INT8_C(-123), INT8_C( 60), INT8_C( 3),
INT8_C( 58), INT8_C( -1), INT8_C( 125), INT8_C( -67)),
simde_mm256_set_epi8(INT8_C( -31), INT8_C( 5), INT8_C( -88), INT8_C( 17),
INT8_C( 50), INT8_C( 6), INT8_C( -25), INT8_C( -40),
INT8_C( 43), INT8_C( 20), INT8_C( -3), INT8_C( -1),
INT8_C( -28), INT8_C( 16), INT8_C( 91), INT8_C( 54),
INT8_C( 62), INT8_C( 5), INT8_C( -31), INT8_C( -17),
INT8_C( 57), INT8_C( 12), INT8_C( 30), INT8_C( 33),
INT8_C( -33), INT8_C(-117), INT8_C( 1), INT8_C( 2),
INT8_C( 20), INT8_C( 0), INT8_C( 93), INT8_C( -34)) }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])); i++) {
simde__m256i r = simde_mm256_rem_epi8(test_vec[i].a, test_vec[i].b);
simde_assert_m256i_i8(r, ==, test_vec[i].r);
}
return 0;
}
static int
test_simde_mm256_rem_epi16(SIMDE_MUNIT_TEST_ARGS) {
const struct {
simde__m256i a;
simde__m256i b;
simde__m256i r;
} test_vec[8] = {
{ simde_mm256_set_epi16(INT16_C(-16473), INT16_C( -7616), INT16_C(-10806), INT16_C( 28273),
INT16_C( 23021), INT16_C( 18146), INT16_C(-31653), INT16_C( -168),
INT16_C( 26666), INT16_C( 13822), INT16_C(-31490), INT16_C( 24651),
INT16_C( 8048), INT16_C(-26711), INT16_C(-21410), INT16_C( 28762)),
simde_mm256_set_epi16(INT16_C( 10545), INT16_C(-21562), INT16_C( 10284), INT16_C( -3533),
INT16_C(-29991), INT16_C(-10247), INT16_C(-14043), INT16_C(-30435),
INT16_C( -5718), INT16_C( -3714), INT16_C(-18934), INT16_C(-12090),
INT16_C( 23938), INT16_C(-15439), INT16_C(-17441), INT16_C(-29699)),
simde_mm256_set_epi16(INT16_C( -5928), INT16_C( -7616), INT16_C( -522), INT16_C( 9),
INT16_C( 23021), INT16_C( 7899), INT16_C( -3567), INT16_C( -168),
INT16_C( 3794), INT16_C( 2680), INT16_C(-12556), INT16_C( 471),
INT16_C( 8048), INT16_C(-11272), INT16_C( -3969), INT16_C( 28762)) },
{ simde_mm256_set_epi16(INT16_C( 18192), INT16_C( 32532), INT16_C(-31836), INT16_C( -5163),
INT16_C( 20183), INT16_C( -1494), INT16_C( 2502), INT16_C( 18488),
INT16_C( 22771), INT16_C( 21470), INT16_C( 4556), INT16_C( 26138),
INT16_C( 19085), INT16_C( -923), INT16_C( -9934), INT16_C( -2165)),
simde_mm256_set_epi16(INT16_C(-26929), INT16_C( 8074), INT16_C( 18000), INT16_C(-29849),
INT16_C(-25121), INT16_C( 3254), INT16_C(-10254), INT16_C(-26681),
INT16_C(-24206), INT16_C(-28388), INT16_C( 25803), INT16_C( 25877),
INT16_C( 768), INT16_C( 16244), INT16_C( 11114), INT16_C( -7212)),
simde_mm256_set_epi16(INT16_C( 18192), INT16_C( 236), INT16_C(-13836), INT16_C( -5163),
INT16_C( 20183), INT16_C( -1494), INT16_C( 2502), INT16_C( 18488),
INT16_C( 22771), INT16_C( 21470), INT16_C( 4556), INT16_C( 261),
INT16_C( 653), INT16_C( -923), INT16_C( -9934), INT16_C( -2165)) },
{ simde_mm256_set_epi16(INT16_C( 7525), INT16_C( 3321), INT16_C(-18237), INT16_C( -1323),
INT16_C( 13644), INT16_C(-17383), INT16_C(-20302), INT16_C(-13836),
INT16_C( 7513), INT16_C( 1114), INT16_C( -200), INT16_C( 10389),
INT16_C(-31848), INT16_C( 9445), INT16_C( -5204), INT16_C(-24070)),
simde_mm256_set_epi16(INT16_C( 20057), INT16_C( 26978), INT16_C(-19795), INT16_C(-31033),
INT16_C(-11277), INT16_C(-24100), INT16_C(-21653), INT16_C( 11009),
INT16_C(-15324), INT16_C( 9014), INT16_C( 24117), INT16_C(-31497),
INT16_C( -7188), INT16_C( 8311), INT16_C( 31759), INT16_C( 4002)),
simde_mm256_set_epi16(INT16_C( 7525), INT16_C( 3321), INT16_C(-18237), INT16_C( -1323),
INT16_C( 2367), INT16_C(-17383), INT16_C(-20302), INT16_C( -2827),
INT16_C( 7513), INT16_C( 1114), INT16_C( -200), INT16_C( 10389),
INT16_C( -3096), INT16_C( 1134), INT16_C( -5204), INT16_C( -58)) },
{ simde_mm256_set_epi16(INT16_C( 26094), INT16_C(-13182), INT16_C( 30122), INT16_C(-17848),
INT16_C(-21735), INT16_C( -7772), INT16_C( 1809), INT16_C(-31933),
INT16_C( 8271), INT16_C( 4936), INT16_C( 7627), INT16_C( 20477),
INT16_C( 14608), INT16_C( 25470), INT16_C(-19700), INT16_C( 25611)),
simde_mm256_set_epi16(INT16_C(-17379), INT16_C( -8623), INT16_C(-10486), INT16_C(-16677),
INT16_C( 27895), INT16_C(-17193), INT16_C( -5943), INT16_C( -5111),
INT16_C( -2949), INT16_C(-11305), INT16_C(-13092), INT16_C( 8140),
INT16_C( -6841), INT16_C( 2476), INT16_C(-24435), INT16_C( 7948)),
simde_mm256_set_epi16(INT16_C( 8715), INT16_C( -4559), INT16_C( 9150), INT16_C( -1171),
INT16_C(-21735), INT16_C( -7772), INT16_C( 1809), INT16_C( -1267),
INT16_C( 2373), INT16_C( 4936), INT16_C( 7627), INT16_C( 4197),
INT16_C( 926), INT16_C( 710), INT16_C(-19700), INT16_C( 1767)) },
{ simde_mm256_set_epi16(INT16_C( 26466), INT16_C( 21183), INT16_C( 5811), INT16_C( 17016),
INT16_C(-14374), INT16_C(-18761), INT16_C(-11284), INT16_C( -933),
INT16_C( 30444), INT16_C( 20573), INT16_C(-14964), INT16_C( 25607),
INT16_C(-28815), INT16_C(-28739), INT16_C( 27147), INT16_C( -3265)),
simde_mm256_set_epi16(INT16_C( 26902), INT16_C(-14525), INT16_C( -7905), INT16_C( -8015),
INT16_C(-22131), INT16_C( 18318), INT16_C(-21513), INT16_C( 9770),
INT16_C( 4118), INT16_C(-32437), INT16_C( 6621), INT16_C( -7897),
INT16_C( 22002), INT16_C(-32381), INT16_C( 15537), INT16_C(-26793)),
simde_mm256_set_epi16(INT16_C( 26466), INT16_C( 6658), INT16_C( 5811), INT16_C( 986),
INT16_C(-14374), INT16_C( -443), INT16_C(-11284), INT16_C( -933),
INT16_C( 1618), INT16_C( 20573), INT16_C( -1722), INT16_C( 1916),
INT16_C( -6813), INT16_C(-28739), INT16_C( 11610), INT16_C( -3265)) },
{ simde_mm256_set_epi16(INT16_C( -5538), INT16_C( -4084), INT16_C(-28159), INT16_C(-27845),
INT16_C( -742), INT16_C( 6696), INT16_C( 3074), INT16_C( -6511),
INT16_C(-21911), INT16_C( 28576), INT16_C(-29494), INT16_C(-22820),
INT16_C(-17599), INT16_C( -1341), INT16_C( 8579), INT16_C( 676)),
simde_mm256_set_epi16(INT16_C(-10155), INT16_C(-12697), INT16_C( -5222), INT16_C(-32377),
INT16_C( 32076), INT16_C(-13716), INT16_C( 13383), INT16_C(-22332),
INT16_C( 18058), INT16_C(-22719), INT16_C( -8799), INT16_C(-25251),
INT16_C(-16195), INT16_C(-26213), INT16_C(-12331), INT16_C( 27016)),
simde_mm256_set_epi16(INT16_C( -5538), INT16_C( -4084), INT16_C( -2049), INT16_C(-27845),
INT16_C( -742), INT16_C( 6696), INT16_C( 3074), INT16_C( -6511),
INT16_C( -3853), INT16_C( 5857), INT16_C( -3097), INT16_C(-22820),
INT16_C( -1404), INT16_C( -1341), INT16_C( 8579), INT16_C( 676)) },
{ simde_mm256_set_epi16(INT16_C( 13886), INT16_C( 28688), INT16_C( 30551), INT16_C(-28928),
INT16_C( -9491), INT16_C(-26549), INT16_C( -738), INT16_C( 22350),
INT16_C( 7981), INT16_C(-15059), INT16_C(-18848), INT16_C( 16804),
INT16_C(-31876), INT16_C( -1787), INT16_C( 29649), INT16_C( -721)),
simde_mm256_set_epi16(INT16_C( 7566), INT16_C( 25511), INT16_C( -5831), INT16_C( 13989),
INT16_C( 13965), INT16_C(-31065), INT16_C( 77), INT16_C(-30384),
INT16_C( 21705), INT16_C(-23032), INT16_C( -2503), INT16_C( -8652),
INT16_C(-23147), INT16_C( -4009), INT16_C( 7598), INT16_C( 23051)),
simde_mm256_set_epi16(INT16_C( 6320), INT16_C( 3177), INT16_C( 1396), INT16_C( -950),
INT16_C( -9491), INT16_C(-26549), INT16_C( -45), INT16_C( 22350),
INT16_C( 7981), INT16_C(-15059), INT16_C( -1327), INT16_C( 8152),
INT16_C( -8729), INT16_C( -1787), INT16_C( 6855), INT16_C( -721)) },
{ simde_mm256_set_epi16(INT16_C( 26789), INT16_C(-25295), INT16_C(-31460), INT16_C(-29347),
INT16_C(-16029), INT16_C(-32645), INT16_C(-19836), INT16_C( 31541),
INT16_C(-32299), INT16_C(-14817), INT16_C( 22782), INT16_C(-18634),
INT16_C( -2744), INT16_C( 907), INT16_C( 9939), INT16_C( 395)),
simde_mm256_set_epi16(INT16_C( 18409), INT16_C( 19069), INT16_C( 20979), INT16_C(-29762),
INT16_C( 8112), INT16_C( 25085), INT16_C( 31664), INT16_C(-10132),
INT16_C( -2207), INT16_C( 19403), INT16_C(-32530), INT16_C( 20365),
INT16_C( 22045), INT16_C(-23601), INT16_C( 28665), INT16_C(-29743)),
simde_mm256_set_epi16(INT16_C( 8380), INT16_C( -6226), INT16_C(-10481), INT16_C(-29347),
INT16_C( -7917), INT16_C( -7560), INT16_C(-19836), INT16_C( 1145),
INT16_C( -1401), INT16_C(-14817), INT16_C( 22782), INT16_C(-18634),
INT16_C( -2744), INT16_C( 907), INT16_C( 9939), INT16_C( 395)) }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])); i++) {
simde__m256i r = simde_mm256_rem_epi16(test_vec[i].a, test_vec[i].b);
simde_assert_m256i_i16(r, ==, test_vec[i].r);
}
return 0;
}
static int
test_simde_mm256_rem_epi32(SIMDE_MUNIT_TEST_ARGS) {
const struct {
simde__m256i a;
simde__m256i b;
simde__m256i r;
} test_vec[8] = {
{ simde_mm256_set_epi32(INT32_C(-1079516608), INT32_C( -708153743), INT32_C( 1508722402), INT32_C(-2074345640),
INT32_C( 1747596798), INT32_C(-2063703989), INT32_C( 527472553), INT32_C(-1403096998)),
simde_mm256_set_epi32(INT32_C( 691121094), INT32_C( 674034227), INT32_C(-1965434887), INT32_C( -920286947),
INT32_C( -374673026), INT32_C(-1240805178), INT32_C( 1568850865), INT32_C(-1142977539)),
simde_mm256_set_epi32(INT32_C( -388395514), INT32_C( -34119516), INT32_C( 1508722402), INT32_C( -233771746),
INT32_C( 248904694), INT32_C( -822898811), INT32_C( 527472553), INT32_C( -260119459)) },
{ simde_mm256_set_epi32(INT32_C( 1192263444), INT32_C(-2086343723), INT32_C( 1322777130), INT32_C( 163989560),
INT32_C( 1492341726), INT32_C( 298608154), INT32_C( 1250819173), INT32_C( -650971253)),
simde_mm256_set_epi32(INT32_C(-1764810870), INT32_C( 1179683687), INT32_C(-1646326602), INT32_C( -671967289),
INT32_C(-1586327268), INT32_C( 1691051285), INT32_C( 50347892), INT32_C( 728425428)),
simde_mm256_set_epi32(INT32_C( 1192263444), INT32_C( -906660036), INT32_C( 1322777130), INT32_C( 163989560),
INT32_C( 1492341726), INT32_C( 298608154), INT32_C( 42469765), INT32_C( -650971253)) },
{ simde_mm256_set_epi32(INT32_C( 493161721), INT32_C(-1195115819), INT32_C( 894221337), INT32_C(-1330460172),
INT32_C( 492373082), INT32_C( -13096811), INT32_C(-2087181083), INT32_C( -341007878)),
simde_mm256_set_epi32(INT32_C( 1314482530), INT32_C(-1297250617), INT32_C( -739008036), INT32_C(-1419039999),
INT32_C(-1004264650), INT32_C( 1580565751), INT32_C( -471064457), INT32_C( 2081361826)),
simde_mm256_set_epi32(INT32_C( 493161721), INT32_C(-1195115819), INT32_C( 155213301), INT32_C(-1330460172),
INT32_C( 492373082), INT32_C( -13096811), INT32_C( -202923255), INT32_C( -341007878)) },
{ simde_mm256_set_epi32(INT32_C( 1710148738), INT32_C( 1974123080), INT32_C(-1424367196), INT32_C( 118588227),
INT32_C( 542053192), INT32_C( 499863549), INT32_C( 957375358), INT32_C(-1291033589)),
simde_mm256_set_epi32(INT32_C(-1138893231), INT32_C( -687161637), INT32_C( 1828175063), INT32_C( -389420023),
INT32_C( -193211433), INT32_C( -857989172), INT32_C( -448329300), INT32_C(-1601364212)),
simde_mm256_set_epi32(INT32_C( 571255507), INT32_C( 599799806), INT32_C(-1424367196), INT32_C( 118588227),
INT32_C( 155630326), INT32_C( 499863549), INT32_C( 60716758), INT32_C(-1291033589)) },
{ simde_mm256_set_epi32(INT32_C( 1734496959), INT32_C( 380846712), INT32_C( -941967689), INT32_C( -739443621),
INT32_C( 1995198557), INT32_C( -980655097), INT32_C(-1888383043), INT32_C( 1779168063)),
simde_mm256_set_epi32(INT32_C( 1763100483), INT32_C( -518004559), INT32_C(-1450358898), INT32_C(-1409866198),
INT32_C( 269910347), INT32_C( 433971495), INT32_C( 1441956227), INT32_C( 1018271575)),
simde_mm256_set_epi32(INT32_C( 1734496959), INT32_C( 380846712), INT32_C( -941967689), INT32_C( -739443621),
INT32_C( 105826128), INT32_C( -112712107), INT32_C( -446426816), INT32_C( 760896488)) },
{ simde_mm256_set_epi32(INT32_C( -362876916), INT32_C(-1845390533), INT32_C( -48621016), INT32_C( 201516689),
INT32_C(-1435930720), INT32_C(-1932876068), INT32_C(-1153303869), INT32_C( 562234020)),
simde_mm256_set_epi32(INT32_C( -665465241), INT32_C( -342195833), INT32_C( 2102184556), INT32_C( 877111492),
INT32_C( 1183491905), INT32_C( -576610979), INT32_C(-1061316197), INT32_C( -808097400)),
simde_mm256_set_epi32(INT32_C( -362876916), INT32_C( -134411368), INT32_C( -48621016), INT32_C( 201516689),
INT32_C( -252438815), INT32_C( -203043131), INT32_C( -91987672), INT32_C( 562234020)) },
{ simde_mm256_set_epi32(INT32_C( 910061584), INT32_C( 2002226944), INT32_C( -621963189), INT32_C( -48343218),
INT32_C( 523093293), INT32_C(-1235205724), INT32_C(-2088961787), INT32_C( 1943141679)),
simde_mm256_set_epi32(INT32_C( 495870887), INT32_C( -382126427), INT32_C( 915244711), INT32_C( 5081424),
INT32_C( 1422501384), INT32_C( -163979724), INT32_C(-1516900265), INT32_C( 497965579)),
simde_mm256_set_epi32(INT32_C( 414190697), INT32_C( 91594809), INT32_C( -621963189), INT32_C( -2610402),
INT32_C( 523093293), INT32_C( -87347656), INT32_C( -572061522), INT32_C( 449244942)) },
{ simde_mm256_set_epi32(INT32_C( 1755684145), INT32_C(-2061726371), INT32_C(-1050443653), INT32_C(-1299940555),
INT32_C(-2116696545), INT32_C( 1493088054), INT32_C( -179829877), INT32_C( 651362699)),
simde_mm256_set_epi32(INT32_C( 1206471293), INT32_C( 1374915518), INT32_C( 531653117), INT32_C( 2075187308),
INT32_C( -144618549), INT32_C(-2131865715), INT32_C( 1444783055), INT32_C( 1878625233)),
simde_mm256_set_epi32(INT32_C( 549212852), INT32_C( -686810853), INT32_C( -518790536), INT32_C(-1299940555),
INT32_C( -92036859), INT32_C( 1493088054), INT32_C( -179829877), INT32_C( 651362699)) }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])); i++) {
simde__m256i r = simde_mm256_rem_epi32(test_vec[i].a, test_vec[i].b);
simde_assert_m256i_i32(r, ==, test_vec[i].r);
}
return 0;
}
static int
test_simde_mm256_rem_epi64(SIMDE_MUNIT_TEST_ARGS) {
const struct {
simde__m256i a;
simde__m256i b;
simde__m256i r;
} test_vec[8] = {
{ simde_mm256_set_epi64x(INT64_C(-4636488523262038415), INT64_C( 6479913377553186648),
INT64_C( 7505871096235581515), INT64_C( 2265477367564496986)),
simde_mm256_set_epi64x(INT64_C( 2968342496979776051), INT64_C(-8441478558707775203),
INT64_C(-1609208390309195578), INT64_C( 6738163160628300797)),
simde_mm256_set_epi64x(INT64_C(-1668146026282262364), INT64_C( 6479913377553186648),
INT64_C( 1069037534998799203), INT64_C( 2265477367564496986)) },
{ simde_mm256_set_epi64x(INT64_C( 5120732502404950997), INT64_C( 5681284513410730040),
INT64_C( 6409558907924801050), INT64_C( 5372227444888762251)),
simde_mm256_set_epi64x(INT64_C(-7579804969095623833), INT64_C(-7070918910501808185),
INT64_C(-6813223735121976043), INT64_C( 216242550290965460)),
simde_mm256_set_epi64x(INT64_C( 5120732502404950997), INT64_C( 5681284513410730040),
INT64_C( 6409558907924801050), INT64_C( 182406237905591211)) },
{ simde_mm256_set_epi64x(INT64_C( 2118113466433927893), INT64_C( 3840651400764901876),
INT64_C( 2114726288902596757), INT64_C(-8964374488360902150)),
simde_mm256_set_epi64x(INT64_C( 5645659480511055559), INT64_C(-3174015343225263359),
INT64_C(-4313283826698320649), INT64_C(-2023206435041636446)),
simde_mm256_set_epi64x(INT64_C( 2118113466433927893), INT64_C( 666636057539638517),
INT64_C( 2114726288902596757), INT64_C( -871548748194356366)) },
{ simde_mm256_set_epi64x(INT64_C( 7345032902979795528), INT64_C(-6117610524196633789),
INT64_C( 2328100732832272381), INT64_C( 4111895855610225675)),
simde_mm256_set_epi64x(INT64_C(-4891509177172967717), INT64_C( 7851952110853286921),
INT64_C( -829836782511317044), INT64_C(-1925559678644969716)),
simde_mm256_set_epi64x(INT64_C( 2453523725806827811), INT64_C(-6117610524196633789),
INT64_C( 668427167809638293), INT64_C( 260776498320286243)) },
{ simde_mm256_set_epi64x(INT64_C( 7449607714297299576), INT64_C(-4045720414588175269),
INT64_C( 8569312554655704071), INT64_C(-8110543410226793665)),
simde_mm256_set_epi64x(INT64_C( 7572458917823766705), INT64_C(-6229244031487498710),
INT64_C( 1159256113650983207), INT64_C( 6193154838246823767)),
simde_mm256_set_epi64x(INT64_C( 7449607714297299576), INT64_C(-4045720414588175269),
INT64_C( 454519759098821622), INT64_C(-1917388571979969898)) },
{ simde_mm256_set_epi64x(INT64_C(-1558544484243762373), INT64_C( -208825673416776047),
INT64_C(-6167275479359641892), INT64_C(-4953402399143034204)),
simde_mm256_set_epi64x(INT64_C(-2858151442766986873), INT64_C( 9028813919053392068),
INT64_C( 5083059030774095197), INT64_C(-4558318353343223416)),
simde_mm256_set_epi64x(INT64_C(-1558544484243762373), INT64_C( -208825673416776047),
INT64_C(-1084216448585546695), INT64_C( -395084045799810788)) },
{ simde_mm256_set_epi64x(INT64_C( 3908684742628183808), INT64_C(-2671311551824242866),
INT64_C( 2246668589251707300), INT64_C(-8972022555815576273)),
simde_mm256_set_epi64x(INT64_C( 2129749246616352421), INT64_C( 3930946101587052880),
INT64_C( 6109596926925725236), INT64_C(-6515037028970767861)),
simde_mm256_set_epi64x(INT64_C( 1778935496011831387), INT64_C(-2671311551824242866),
INT64_C( 2246668589251707300), INT64_C(-2456985526844808412)) },
{ simde_mm256_set_epi64x(INT64_C( 7540605987113962845), INT64_C(-4511621132930745547),
INT64_C(-9091142434838104266), INT64_C( -772363439907339893)),
simde_mm256_set_epi64x(INT64_C( 5181754748372749246), INT64_C( 2283432752406648940),
INT64_C( -621131936186871923), INT64_C( 6205295972918594513)),
simde_mm256_set_epi64x(INT64_C( 2358851238741213599), INT64_C(-2228188380524096607),
INT64_C( -395295328221897344), INT64_C( -772363439907339893)) }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])); i++) {
simde__m256i r = simde_mm256_rem_epi64(test_vec[i].a, test_vec[i].b);
simde_assert_m256i_i64(r, ==, test_vec[i].r);
}
return 0;
}
static int
test_simde_mm256_rem_epu8(SIMDE_MUNIT_TEST_ARGS) {
const struct {
simde__m256i a;
simde__m256i b;
simde__m256i r;
} test_vec[8] = {
{ simde_x_mm256_set_epu8(UINT8_C(191), UINT8_C(167), UINT8_C(226), UINT8_C( 64),
UINT8_C(213), UINT8_C(202), UINT8_C(110), UINT8_C(113),
UINT8_C( 89), UINT8_C(237), UINT8_C( 70), UINT8_C(226),
UINT8_C(132), UINT8_C( 91), UINT8_C(255), UINT8_C( 88),
UINT8_C(104), UINT8_C( 42), UINT8_C( 53), UINT8_C(254),
UINT8_C(132), UINT8_C(254), UINT8_C( 96), UINT8_C( 75),
UINT8_C( 31), UINT8_C(112), UINT8_C(151), UINT8_C(169),
UINT8_C(172), UINT8_C( 94), UINT8_C(112), UINT8_C( 90)),
simde_x_mm256_set_epu8(UINT8_C(121), UINT8_C( 85), UINT8_C(153), UINT8_C(116),
UINT8_C(218), UINT8_C( 21), UINT8_C(101), UINT8_C(122),
UINT8_C( 10), UINT8_C(231), UINT8_C( 54), UINT8_C( 71),
UINT8_C(156), UINT8_C(149), UINT8_C(244), UINT8_C( 84),
UINT8_C(148), UINT8_C( 85), UINT8_C(170), UINT8_C(184),
UINT8_C( 94), UINT8_C(154), UINT8_C(229), UINT8_C( 11),
UINT8_C( 70), UINT8_C(179), UINT8_C(121), UINT8_C(157),
UINT8_C(254), UINT8_C( 70), UINT8_C( 49), UINT8_C(125)),
simde_x_mm256_set_epu8(UINT8_C( 70), UINT8_C( 82), UINT8_C( 73), UINT8_C( 64),
UINT8_C(213), UINT8_C( 13), UINT8_C( 9), UINT8_C(113),
UINT8_C( 9), UINT8_C( 6), UINT8_C( 16), UINT8_C( 13),
UINT8_C(132), UINT8_C( 91), UINT8_C( 11), UINT8_C( 4),
UINT8_C(104), UINT8_C( 42), UINT8_C( 53), UINT8_C( 70),
UINT8_C( 38), UINT8_C(100), UINT8_C( 96), UINT8_C( 9),
UINT8_C( 31), UINT8_C(112), UINT8_C( 30), UINT8_C( 12),
UINT8_C(172), UINT8_C( 24), UINT8_C( 14), UINT8_C( 90)) },
{ simde_x_mm256_set_epu8(UINT8_C( 78), UINT8_C( 89), UINT8_C(105), UINT8_C( 98),
UINT8_C(178), UINT8_C(173), UINT8_C(134), UINT8_C(199),
UINT8_C(211), UINT8_C(243), UINT8_C(161), UINT8_C(220),
UINT8_C(171), UINT8_C(107), UINT8_C( 43), UINT8_C( 1),
UINT8_C(196), UINT8_C( 36), UINT8_C( 35), UINT8_C( 54),
UINT8_C( 94), UINT8_C( 53), UINT8_C(132), UINT8_C(247),
UINT8_C(227), UINT8_C(236), UINT8_C( 32), UINT8_C(119),
UINT8_C(124), UINT8_C( 15), UINT8_C( 15), UINT8_C(162)),
simde_x_mm256_set_epu8(UINT8_C(195), UINT8_C( 49), UINT8_C( 14), UINT8_C(170),
UINT8_C(203), UINT8_C(167), UINT8_C( 3), UINT8_C(215),
UINT8_C( 63), UINT8_C(248), UINT8_C( 55), UINT8_C(219),
UINT8_C(221), UINT8_C(135), UINT8_C( 61), UINT8_C(191),
UINT8_C(209), UINT8_C( 91), UINT8_C( 87), UINT8_C(137),
UINT8_C( 87), UINT8_C( 76), UINT8_C( 44), UINT8_C(140),
UINT8_C( 2), UINT8_C(200), UINT8_C( 36), UINT8_C(195),
UINT8_C(200), UINT8_C(125), UINT8_C(254), UINT8_C(139)),
simde_x_mm256_set_epu8(UINT8_C( 78), UINT8_C( 40), UINT8_C( 7), UINT8_C( 98),
UINT8_C(178), UINT8_C( 6), UINT8_C( 2), UINT8_C(199),
UINT8_C( 22), UINT8_C(243), UINT8_C( 51), UINT8_C( 1),
UINT8_C(171), UINT8_C(107), UINT8_C( 43), UINT8_C( 1),
UINT8_C(196), UINT8_C( 36), UINT8_C( 35), UINT8_C( 54),
UINT8_C( 7), UINT8_C( 53), UINT8_C( 0), UINT8_C(107),
UINT8_C( 1), UINT8_C( 36), UINT8_C( 32), UINT8_C(119),
UINT8_C(124), UINT8_C( 15), UINT8_C( 15), UINT8_C( 23)) },
{ simde_x_mm256_set_epu8(UINT8_C(234), UINT8_C( 94), UINT8_C(240), UINT8_C( 12),
UINT8_C(146), UINT8_C( 1), UINT8_C(147), UINT8_C( 59),
UINT8_C(253), UINT8_C( 26), UINT8_C( 26), UINT8_C( 40),
UINT8_C( 12), UINT8_C( 2), UINT8_C(230), UINT8_C(145),
UINT8_C(170), UINT8_C(105), UINT8_C(111), UINT8_C(160),
UINT8_C(140), UINT8_C(202), UINT8_C(166), UINT8_C(220),
UINT8_C(187), UINT8_C( 65), UINT8_C(250), UINT8_C(195),
UINT8_C( 33), UINT8_C(131), UINT8_C( 2), UINT8_C(164)),
simde_x_mm256_set_epu8(UINT8_C(177), UINT8_C(221), UINT8_C(251), UINT8_C(181),
UINT8_C(159), UINT8_C(182), UINT8_C( 11), UINT8_C( 11),
UINT8_C( 39), UINT8_C( 37), UINT8_C( 39), UINT8_C(208),
UINT8_C(136), UINT8_C(180), UINT8_C(215), UINT8_C(139),
UINT8_C(144), UINT8_C(128), UINT8_C(203), UINT8_C(206),
UINT8_C(173), UINT8_C( 36), UINT8_C(133), UINT8_C(175),
UINT8_C(231), UINT8_C( 7), UINT8_C(236), UINT8_C( 68),
UINT8_C(193), UINT8_C(221), UINT8_C( 27), UINT8_C( 8)),
simde_x_mm256_set_epu8(UINT8_C( 57), UINT8_C( 94), UINT8_C(240), UINT8_C( 12),
UINT8_C(146), UINT8_C( 1), UINT8_C( 4), UINT8_C( 4),
UINT8_C( 19), UINT8_C( 26), UINT8_C( 26), UINT8_C( 40),
UINT8_C( 12), UINT8_C( 2), UINT8_C( 15), UINT8_C( 6),
UINT8_C( 26), UINT8_C(105), UINT8_C(111), UINT8_C(160),
UINT8_C(140), UINT8_C( 22), UINT8_C( 33), UINT8_C( 45),
UINT8_C(187), UINT8_C( 2), UINT8_C( 14), UINT8_C( 59),
UINT8_C( 33), UINT8_C(131), UINT8_C( 2), UINT8_C( 4)) },
{ simde_x_mm256_set_epu8(UINT8_C( 71), UINT8_C(233), UINT8_C( 74), UINT8_C(125),
UINT8_C( 81), UINT8_C(243), UINT8_C(139), UINT8_C(190),
UINT8_C( 31), UINT8_C(176), UINT8_C( 97), UINT8_C(253),
UINT8_C(123), UINT8_C(176), UINT8_C(216), UINT8_C(108),
UINT8_C(247), UINT8_C( 97), UINT8_C( 75), UINT8_C(203),
UINT8_C(128), UINT8_C(238), UINT8_C( 79), UINT8_C(141),
UINT8_C( 86), UINT8_C( 29), UINT8_C(163), UINT8_C(207),
UINT8_C(111), UINT8_C(249), UINT8_C(139), UINT8_C(209)),
simde_x_mm256_set_epu8(UINT8_C(120), UINT8_C(127), UINT8_C( 28), UINT8_C( 95),
UINT8_C(175), UINT8_C(223), UINT8_C(119), UINT8_C(214),
UINT8_C(220), UINT8_C(102), UINT8_C( 86), UINT8_C( 22),
UINT8_C(119), UINT8_C(207), UINT8_C( 12), UINT8_C(183),
UINT8_C(172), UINT8_C(242), UINT8_C(173), UINT8_C(249),
UINT8_C( 52), UINT8_C(108), UINT8_C(128), UINT8_C(203),
UINT8_C( 85), UINT8_C(135), UINT8_C(227), UINT8_C( 35),
UINT8_C(187), UINT8_C( 24), UINT8_C(250), UINT8_C(219)),
simde_x_mm256_set_epu8(UINT8_C( 71), UINT8_C(106), UINT8_C( 18), UINT8_C( 30),
UINT8_C( 81), UINT8_C( 20), UINT8_C( 20), UINT8_C(190),
UINT8_C( 31), UINT8_C( 74), UINT8_C( 11), UINT8_C( 11),
UINT8_C( 4), UINT8_C(176), UINT8_C( 0), UINT8_C(108),
UINT8_C( 75), UINT8_C( 97), UINT8_C( 75), UINT8_C(203),
UINT8_C( 24), UINT8_C( 22), UINT8_C( 79), UINT8_C(141),
UINT8_C( 1), UINT8_C( 29), UINT8_C(163), UINT8_C( 32),
UINT8_C(111), UINT8_C( 9), UINT8_C(139), UINT8_C(209)) },
{ simde_x_mm256_set_epu8(UINT8_C(184), UINT8_C( 63), UINT8_C( 95), UINT8_C(164),
UINT8_C( 65), UINT8_C( 71), UINT8_C(174), UINT8_C( 88),
UINT8_C(183), UINT8_C(142), UINT8_C( 98), UINT8_C( 14),
UINT8_C( 25), UINT8_C(173), UINT8_C( 87), UINT8_C( 2),
UINT8_C(191), UINT8_C(143), UINT8_C(152), UINT8_C( 2),
UINT8_C(126), UINT8_C( 0), UINT8_C(162), UINT8_C( 57),
UINT8_C(245), UINT8_C( 36), UINT8_C(239), UINT8_C( 54),
UINT8_C( 33), UINT8_C(165), UINT8_C(199), UINT8_C( 84)),
simde_x_mm256_set_epu8(UINT8_C(174), UINT8_C( 60), UINT8_C(132), UINT8_C(208),
UINT8_C( 58), UINT8_C(178), UINT8_C(116), UINT8_C(240),
UINT8_C( 37), UINT8_C(131), UINT8_C(100), UINT8_C(177),
UINT8_C( 19), UINT8_C(102), UINT8_C( 81), UINT8_C( 86),
UINT8_C( 25), UINT8_C( 43), UINT8_C( 51), UINT8_C(140),
UINT8_C( 9), UINT8_C( 40), UINT8_C(227), UINT8_C( 75),
UINT8_C(208), UINT8_C(159), UINT8_C(175), UINT8_C(109),
UINT8_C(230), UINT8_C( 87), UINT8_C(254), UINT8_C(216)),
simde_x_mm256_set_epu8(UINT8_C( 10), UINT8_C( 3), UINT8_C( 95), UINT8_C(164),
UINT8_C( 7), UINT8_C( 71), UINT8_C( 58), UINT8_C( 88),
UINT8_C( 35), UINT8_C( 11), UINT8_C( 98), UINT8_C( 14),
UINT8_C( 6), UINT8_C( 71), UINT8_C( 6), UINT8_C( 2),
UINT8_C( 16), UINT8_C( 14), UINT8_C( 50), UINT8_C( 2),
UINT8_C( 0), UINT8_C( 0), UINT8_C(162), UINT8_C( 57),
UINT8_C( 37), UINT8_C( 36), UINT8_C( 64), UINT8_C( 54),
UINT8_C( 33), UINT8_C( 78), UINT8_C(199), UINT8_C( 84)) },
{ simde_x_mm256_set_epu8(UINT8_C( 54), UINT8_C( 43), UINT8_C(109), UINT8_C(187),
UINT8_C(138), UINT8_C( 62), UINT8_C(222), UINT8_C(154),
UINT8_C(123), UINT8_C( 21), UINT8_C(247), UINT8_C( 99),
UINT8_C( 37), UINT8_C( 48), UINT8_C(116), UINT8_C(233),
UINT8_C( 95), UINT8_C(251), UINT8_C(147), UINT8_C(109),
UINT8_C(205), UINT8_C(206), UINT8_C( 57), UINT8_C( 17),
UINT8_C(121), UINT8_C( 25), UINT8_C( 3), UINT8_C( 55),
UINT8_C(178), UINT8_C(129), UINT8_C(149), UINT8_C(207)),
simde_x_mm256_set_epu8(UINT8_C(131), UINT8_C( 42), UINT8_C(151), UINT8_C(210),
UINT8_C( 12), UINT8_C(163), UINT8_C(138), UINT8_C(207),
UINT8_C( 43), UINT8_C( 57), UINT8_C( 61), UINT8_C( 62),
UINT8_C( 81), UINT8_C(184), UINT8_C( 6), UINT8_C( 93),
UINT8_C(167), UINT8_C( 1), UINT8_C(145), UINT8_C( 9),
UINT8_C( 4), UINT8_C( 17), UINT8_C( 10), UINT8_C(101),
UINT8_C(186), UINT8_C(181), UINT8_C(155), UINT8_C(243),
UINT8_C(189), UINT8_C(191), UINT8_C(222), UINT8_C(205)),
simde_x_mm256_set_epu8(UINT8_C( 54), UINT8_C( 1), UINT8_C(109), UINT8_C(187),
UINT8_C( 6), UINT8_C( 62), UINT8_C( 84), UINT8_C(154),
UINT8_C( 37), UINT8_C( 21), UINT8_C( 3), UINT8_C( 37),
UINT8_C( 37), UINT8_C( 48), UINT8_C( 2), UINT8_C( 47),
UINT8_C( 95), UINT8_C( 0), UINT8_C( 2), UINT8_C( 1),
UINT8_C( 1), UINT8_C( 2), UINT8_C( 7), UINT8_C( 17),
UINT8_C(121), UINT8_C( 25), UINT8_C( 3), UINT8_C( 55),
UINT8_C(178), UINT8_C(129), UINT8_C(149), UINT8_C( 2)) },
{ simde_x_mm256_set_epu8(UINT8_C( 23), UINT8_C(132), UINT8_C(106), UINT8_C(109),
UINT8_C(135), UINT8_C(203), UINT8_C( 98), UINT8_C(120),
UINT8_C(101), UINT8_C( 52), UINT8_C( 82), UINT8_C( 44),
UINT8_C(142), UINT8_C( 14), UINT8_C( 99), UINT8_C(245),
UINT8_C( 8), UINT8_C(140), UINT8_C(141), UINT8_C(123),
UINT8_C(219), UINT8_C(163), UINT8_C(196), UINT8_C(233),
UINT8_C( 34), UINT8_C(185), UINT8_C(228), UINT8_C(108),
UINT8_C( 95), UINT8_C(236), UINT8_C( 97), UINT8_C( 41)),
simde_x_mm256_set_epu8(UINT8_C(125), UINT8_C(229), UINT8_C(203), UINT8_C( 45),
UINT8_C( 24), UINT8_C( 5), UINT8_C( 90), UINT8_C( 83),
UINT8_C(145), UINT8_C( 85), UINT8_C(156), UINT8_C(164),
UINT8_C(149), UINT8_C(201), UINT8_C( 48), UINT8_C(255),
UINT8_C( 41), UINT8_C( 42), UINT8_C( 94), UINT8_C(129),
UINT8_C(135), UINT8_C( 8), UINT8_C( 12), UINT8_C(203),
UINT8_C(128), UINT8_C(202), UINT8_C(148), UINT8_C(252),
UINT8_C(104), UINT8_C(208), UINT8_C( 98), UINT8_C(162)),
simde_x_mm256_set_epu8(UINT8_C( 23), UINT8_C(132), UINT8_C(106), UINT8_C( 19),
UINT8_C( 15), UINT8_C( 3), UINT8_C( 8), UINT8_C( 37),
UINT8_C(101), UINT8_C( 52), UINT8_C( 82), UINT8_C( 44),
UINT8_C(142), UINT8_C( 14), UINT8_C( 3), UINT8_C(245),
UINT8_C( 8), UINT8_C( 14), UINT8_C( 47), UINT8_C(123),
UINT8_C( 84), UINT8_C( 3), UINT8_C( 4), UINT8_C( 30),
UINT8_C( 34), UINT8_C(185), UINT8_C( 80), UINT8_C(108),
UINT8_C( 95), UINT8_C( 28), UINT8_C( 97), UINT8_C( 41)) },
{ simde_x_mm256_set_epu8(UINT8_C(162), UINT8_C( 31), UINT8_C(168), UINT8_C( 17),
UINT8_C( 50), UINT8_C(110), UINT8_C(231), UINT8_C(216),
UINT8_C( 94), UINT8_C( 20), UINT8_C(163), UINT8_C(183),
UINT8_C(157), UINT8_C( 16), UINT8_C( 91), UINT8_C( 54),
UINT8_C( 62), UINT8_C( 81), UINT8_C(159), UINT8_C(151),
UINT8_C( 57), UINT8_C( 12), UINT8_C(118), UINT8_C( 33),
UINT8_C(180), UINT8_C(139), UINT8_C( 1), UINT8_C( 5),
UINT8_C( 78), UINT8_C( 13), UINT8_C( 93), UINT8_C(155)),
simde_x_mm256_set_epu8(UINT8_C(193), UINT8_C(230), UINT8_C( 93), UINT8_C( 23),
UINT8_C(193), UINT8_C( 52), UINT8_C(223), UINT8_C(175),
UINT8_C(205), UINT8_C( 45), UINT8_C(166), UINT8_C( 24),
UINT8_C( 71), UINT8_C(234), UINT8_C(161), UINT8_C(142),
UINT8_C(184), UINT8_C(218), UINT8_C(190), UINT8_C(212),
UINT8_C(116), UINT8_C(159), UINT8_C( 44), UINT8_C( 55),
UINT8_C(213), UINT8_C(133), UINT8_C( 60), UINT8_C( 3),
UINT8_C( 58), UINT8_C(255), UINT8_C(125), UINT8_C(189)),
simde_x_mm256_set_epu8(UINT8_C(162), UINT8_C( 31), UINT8_C( 75), UINT8_C( 17),
UINT8_C( 50), UINT8_C( 6), UINT8_C( 8), UINT8_C( 41),
UINT8_C( 94), UINT8_C( 20), UINT8_C(163), UINT8_C( 15),
UINT8_C( 15), UINT8_C( 16), UINT8_C( 91), UINT8_C( 54),
UINT8_C( 62), UINT8_C( 81), UINT8_C(159), UINT8_C(151),
UINT8_C( 57), UINT8_C( 12), UINT8_C( 30), UINT8_C( 33),
UINT8_C(180), UINT8_C( 6), UINT8_C( 1), UINT8_C( 2),
UINT8_C( 20), UINT8_C( 13), UINT8_C( 93), UINT8_C(155)) }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])); i++) {
simde__m256i r = simde_mm256_rem_epu8(test_vec[i].a, test_vec[i].b);
simde_assert_m256i_u8(r, ==, test_vec[i].r);
}
return 0;
}
static int
test_simde_mm256_rem_epu16(SIMDE_MUNIT_TEST_ARGS) {
const struct {
simde__m256i a;
simde__m256i b;
simde__m256i r;
} test_vec[8] = {
{ simde_x_mm256_set_epu16(UINT16_C( 49063), UINT16_C( 57920), UINT16_C( 54730), UINT16_C( 28273),
UINT16_C( 23021), UINT16_C( 18146), UINT16_C( 33883), UINT16_C( 65368),
UINT16_C( 26666), UINT16_C( 13822), UINT16_C( 34046), UINT16_C( 24651),
UINT16_C( 8048), UINT16_C( 38825), UINT16_C( 44126), UINT16_C( 28762)),
simde_x_mm256_set_epu16(UINT16_C( 10545), UINT16_C( 43974), UINT16_C( 10284), UINT16_C( 62003),
UINT16_C( 35545), UINT16_C( 55289), UINT16_C( 51493), UINT16_C( 35101),
UINT16_C( 59818), UINT16_C( 61822), UINT16_C( 46602), UINT16_C( 53446),
UINT16_C( 23938), UINT16_C( 50097), UINT16_C( 48095), UINT16_C( 35837)),
simde_x_mm256_set_epu16(UINT16_C( 6883), UINT16_C( 13946), UINT16_C( 3310), UINT16_C( 28273),
UINT16_C( 23021), UINT16_C( 18146), UINT16_C( 33883), UINT16_C( 30267),
UINT16_C( 26666), UINT16_C( 13822), UINT16_C( 34046), UINT16_C( 24651),
UINT16_C( 8048), UINT16_C( 38825), UINT16_C( 44126), UINT16_C( 28762)) },
{ simde_x_mm256_set_epu16(UINT16_C( 18192), UINT16_C( 32532), UINT16_C( 33700), UINT16_C( 60373),
UINT16_C( 20183), UINT16_C( 64042), UINT16_C( 2502), UINT16_C( 18488),
UINT16_C( 22771), UINT16_C( 21470), UINT16_C( 4556), UINT16_C( 26138),
UINT16_C( 19085), UINT16_C( 64613), UINT16_C( 55602), UINT16_C( 63371)),
simde_x_mm256_set_epu16(UINT16_C( 38607), UINT16_C( 8074), UINT16_C( 18000), UINT16_C( 35687),
UINT16_C( 40415), UINT16_C( 3254), UINT16_C( 55282), UINT16_C( 38855),
UINT16_C( 41330), UINT16_C( 37148), UINT16_C( 25803), UINT16_C( 25877),
UINT16_C( 768), UINT16_C( 16244), UINT16_C( 11114), UINT16_C( 58324)),
simde_x_mm256_set_epu16(UINT16_C( 18192), UINT16_C( 236), UINT16_C( 15700), UINT16_C( 24686),
UINT16_C( 20183), UINT16_C( 2216), UINT16_C( 2502), UINT16_C( 18488),
UINT16_C( 22771), UINT16_C( 21470), UINT16_C( 4556), UINT16_C( 261),
UINT16_C( 653), UINT16_C( 15881), UINT16_C( 32), UINT16_C( 5047)) },
{ simde_x_mm256_set_epu16(UINT16_C( 7525), UINT16_C( 3321), UINT16_C( 47299), UINT16_C( 64213),
UINT16_C( 13644), UINT16_C( 48153), UINT16_C( 45234), UINT16_C( 51700),
UINT16_C( 7513), UINT16_C( 1114), UINT16_C( 65336), UINT16_C( 10389),
UINT16_C( 33688), UINT16_C( 9445), UINT16_C( 60332), UINT16_C( 41466)),
simde_x_mm256_set_epu16(UINT16_C( 20057), UINT16_C( 26978), UINT16_C( 45741), UINT16_C( 34503),
UINT16_C( 54259), UINT16_C( 41436), UINT16_C( 43883), UINT16_C( 11009),
UINT16_C( 50212), UINT16_C( 9014), UINT16_C( 24117), UINT16_C( 34039),
UINT16_C( 58348), UINT16_C( 8311), UINT16_C( 31759), UINT16_C( 4002)),
simde_x_mm256_set_epu16(UINT16_C( 7525), UINT16_C( 3321), UINT16_C( 1558), UINT16_C( 29710),
UINT16_C( 13644), UINT16_C( 6717), UINT16_C( 1351), UINT16_C( 7664),
UINT16_C( 7513), UINT16_C( 1114), UINT16_C( 17102), UINT16_C( 10389),
UINT16_C( 33688), UINT16_C( 1134), UINT16_C( 28573), UINT16_C( 1446)) },
{ simde_x_mm256_set_epu16(UINT16_C( 26094), UINT16_C( 52354), UINT16_C( 30122), UINT16_C( 47688),
UINT16_C( 43801), UINT16_C( 57764), UINT16_C( 1809), UINT16_C( 33603),
UINT16_C( 8271), UINT16_C( 4936), UINT16_C( 7627), UINT16_C( 20477),
UINT16_C( 14608), UINT16_C( 25470), UINT16_C( 45836), UINT16_C( 25611)),
simde_x_mm256_set_epu16(UINT16_C( 48157), UINT16_C( 56913), UINT16_C( 55050), UINT16_C( 48859),
UINT16_C( 27895), UINT16_C( 48343), UINT16_C( 59593), UINT16_C( 60425),
UINT16_C( 62587), UINT16_C( 54231), UINT16_C( 52444), UINT16_C( 8140),
UINT16_C( 58695), UINT16_C( 2476), UINT16_C( 41101), UINT16_C( 7948)),
simde_x_mm256_set_epu16(UINT16_C( 26094), UINT16_C( 52354), UINT16_C( 30122), UINT16_C( 47688),
UINT16_C( 15906), UINT16_C( 9421), UINT16_C( 1809), UINT16_C( 33603),
UINT16_C( 8271), UINT16_C( 4936), UINT16_C( 7627), UINT16_C( 4197),
UINT16_C( 14608), UINT16_C( 710), UINT16_C( 4735), UINT16_C( 1767)) },
{ simde_x_mm256_set_epu16(UINT16_C( 26466), UINT16_C( 21183), UINT16_C( 5811), UINT16_C( 17016),
UINT16_C( 51162), UINT16_C( 46775), UINT16_C( 54252), UINT16_C( 64603),
UINT16_C( 30444), UINT16_C( 20573), UINT16_C( 50572), UINT16_C( 25607),
UINT16_C( 36721), UINT16_C( 36797), UINT16_C( 27147), UINT16_C( 62271)),
simde_x_mm256_set_epu16(UINT16_C( 26902), UINT16_C( 51011), UINT16_C( 57631), UINT16_C( 57521),
UINT16_C( 43405), UINT16_C( 18318), UINT16_C( 44023), UINT16_C( 9770),
UINT16_C( 4118), UINT16_C( 33099), UINT16_C( 6621), UINT16_C( 57639),
UINT16_C( 22002), UINT16_C( 33155), UINT16_C( 15537), UINT16_C( 38743)),
simde_x_mm256_set_epu16(UINT16_C( 26466), UINT16_C( 21183), UINT16_C( 5811), UINT16_C( 17016),
UINT16_C( 7757), UINT16_C( 10139), UINT16_C( 10229), UINT16_C( 5983),
UINT16_C( 1618), UINT16_C( 20573), UINT16_C( 4225), UINT16_C( 25607),
UINT16_C( 14719), UINT16_C( 3642), UINT16_C( 11610), UINT16_C( 23528)) },
{ simde_x_mm256_set_epu16(UINT16_C( 59998), UINT16_C( 61452), UINT16_C( 37377), UINT16_C( 37691),
UINT16_C( 64794), UINT16_C( 6696), UINT16_C( 3074), UINT16_C( 59025),
UINT16_C( 43625), UINT16_C( 28576), UINT16_C( 36042), UINT16_C( 42716),
UINT16_C( 47937), UINT16_C( 64195), UINT16_C( 8579), UINT16_C( 676)),
simde_x_mm256_set_epu16(UINT16_C( 55381), UINT16_C( 52839), UINT16_C( 60314), UINT16_C( 33159),
UINT16_C( 32076), UINT16_C( 51820), UINT16_C( 13383), UINT16_C( 43204),
UINT16_C( 18058), UINT16_C( 42817), UINT16_C( 56737), UINT16_C( 40285),
UINT16_C( 49341), UINT16_C( 39323), UINT16_C( 53205), UINT16_C( 27016)),
simde_x_mm256_set_epu16(UINT16_C( 4617), UINT16_C( 8613), UINT16_C( 37377), UINT16_C( 4532),
UINT16_C( 642), UINT16_C( 6696), UINT16_C( 3074), UINT16_C( 15821),
UINT16_C( 7509), UINT16_C( 28576), UINT16_C( 36042), UINT16_C( 2431),
UINT16_C( 47937), UINT16_C( 24872), UINT16_C( 8579), UINT16_C( 676)) },
{ simde_x_mm256_set_epu16(UINT16_C( 13886), UINT16_C( 28688), UINT16_C( 30551), UINT16_C( 36608),
UINT16_C( 56045), UINT16_C( 38987), UINT16_C( 64798), UINT16_C( 22350),
UINT16_C( 7981), UINT16_C( 50477), UINT16_C( 46688), UINT16_C( 16804),
UINT16_C( 33660), UINT16_C( 63749), UINT16_C( 29649), UINT16_C( 64815)),
simde_x_mm256_set_epu16(UINT16_C( 7566), UINT16_C( 25511), UINT16_C( 59705), UINT16_C( 13989),
UINT16_C( 13965), UINT16_C( 34471), UINT16_C( 77), UINT16_C( 35152),
UINT16_C( 21705), UINT16_C( 42504), UINT16_C( 63033), UINT16_C( 56884),
UINT16_C( 42389), UINT16_C( 61527), UINT16_C( 7598), UINT16_C( 23051)),
simde_x_mm256_set_epu16(UINT16_C( 6320), UINT16_C( 3177), UINT16_C( 30551), UINT16_C( 8630),
UINT16_C( 185), UINT16_C( 4516), UINT16_C( 41), UINT16_C( 22350),
UINT16_C( 7981), UINT16_C( 7973), UINT16_C( 46688), UINT16_C( 16804),
UINT16_C( 33660), UINT16_C( 2222), UINT16_C( 6855), UINT16_C( 18713)) },
{ simde_x_mm256_set_epu16(UINT16_C( 26789), UINT16_C( 40241), UINT16_C( 34076), UINT16_C( 36189),
UINT16_C( 49507), UINT16_C( 32891), UINT16_C( 45700), UINT16_C( 31541),
UINT16_C( 33237), UINT16_C( 50719), UINT16_C( 22782), UINT16_C( 46902),
UINT16_C( 62792), UINT16_C( 907), UINT16_C( 9939), UINT16_C( 395)),
simde_x_mm256_set_epu16(UINT16_C( 18409), UINT16_C( 19069), UINT16_C( 20979), UINT16_C( 35774),
UINT16_C( 8112), UINT16_C( 25085), UINT16_C( 31664), UINT16_C( 55404),
UINT16_C( 63329), UINT16_C( 19403), UINT16_C( 33006), UINT16_C( 20365),
UINT16_C( 22045), UINT16_C( 41935), UINT16_C( 28665), UINT16_C( 35793)),
simde_x_mm256_set_epu16(UINT16_C( 8380), UINT16_C( 2103), UINT16_C( 13097), UINT16_C( 415),
UINT16_C( 835), UINT16_C( 7806), UINT16_C( 14036), UINT16_C( 31541),
UINT16_C( 33237), UINT16_C( 11913), UINT16_C( 22782), UINT16_C( 6172),
UINT16_C( 18702), UINT16_C( 907), UINT16_C( 9939), UINT16_C( 395)) }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])); i++) {
simde__m256i r = simde_mm256_rem_epu16(test_vec[i].a, test_vec[i].b);
simde_assert_m256i_u16(r, ==, test_vec[i].r);
}
return 0;
}
static int
test_simde_mm256_rem_epu32(SIMDE_MUNIT_TEST_ARGS) {
const struct {
simde__m256i a;
simde__m256i b;
simde__m256i r;
} test_vec[8] = {
{ simde_x_mm256_set_epu32(UINT32_C(3215450688), UINT32_C(3586813553), UINT32_C(1508722402), UINT32_C(2220621656),
UINT32_C(1747596798), UINT32_C(2231263307), UINT32_C( 527472553), UINT32_C(2891870298)),
simde_x_mm256_set_epu32(UINT32_C( 691121094), UINT32_C( 674034227), UINT32_C(2329532409), UINT32_C(3374680349),
UINT32_C(3920294270), UINT32_C(3054162118), UINT32_C(1568850865), UINT32_C(3151989757)),
simde_x_mm256_set_epu32(UINT32_C( 450966312), UINT32_C( 216642418), UINT32_C(1508722402), UINT32_C(2220621656),
UINT32_C(1747596798), UINT32_C(2231263307), UINT32_C( 527472553), UINT32_C(2891870298)) },
{ simde_x_mm256_set_epu32(UINT32_C(1192263444), UINT32_C(2208623573), UINT32_C(1322777130), UINT32_C( 163989560),
UINT32_C(1492341726), UINT32_C( 298608154), UINT32_C(1250819173), UINT32_C(3643996043)),
simde_x_mm256_set_epu32(UINT32_C(2530156426), UINT32_C(1179683687), UINT32_C(2648640694), UINT32_C(3623000007),
UINT32_C(2708640028), UINT32_C(1691051285), UINT32_C( 50347892), UINT32_C( 728425428)),
simde_x_mm256_set_epu32(UINT32_C(1192263444), UINT32_C(1028939886), UINT32_C(1322777130), UINT32_C( 163989560),
UINT32_C(1492341726), UINT32_C( 298608154), UINT32_C( 42469765), UINT32_C( 1868903)) },
{ simde_x_mm256_set_epu32(UINT32_C( 493161721), UINT32_C(3099851477), UINT32_C( 894221337), UINT32_C(2964507124),
UINT32_C( 492373082), UINT32_C(4281870485), UINT32_C(2207786213), UINT32_C(3953959418)),
simde_x_mm256_set_epu32(UINT32_C(1314482530), UINT32_C(2997716679), UINT32_C(3555959260), UINT32_C(2875927297),
UINT32_C(3290702646), UINT32_C(1580565751), UINT32_C(3823902839), UINT32_C(2081361826)),
simde_x_mm256_set_epu32(UINT32_C( 493161721), UINT32_C( 102134798), UINT32_C( 894221337), UINT32_C( 88579827),
UINT32_C( 492373082), UINT32_C(1120738983), UINT32_C(2207786213), UINT32_C(1872597592)) },
{ simde_x_mm256_set_epu32(UINT32_C(1710148738), UINT32_C(1974123080), UINT32_C(2870600100), UINT32_C( 118588227),
UINT32_C( 542053192), UINT32_C( 499863549), UINT32_C( 957375358), UINT32_C(3003933707)),
simde_x_mm256_set_epu32(UINT32_C(3156074065), UINT32_C(3607805659), UINT32_C(1828175063), UINT32_C(3905547273),
UINT32_C(4101755863), UINT32_C(3436978124), UINT32_C(3846637996), UINT32_C(2693603084)),
simde_x_mm256_set_epu32(UINT32_C(1710148738), UINT32_C(1974123080), UINT32_C(1042425037), UINT32_C( 118588227),
UINT32_C( 542053192), UINT32_C( 499863549), UINT32_C( 957375358), UINT32_C( 310330623)) },
{ simde_x_mm256_set_epu32(UINT32_C(1734496959), UINT32_C( 380846712), UINT32_C(3352999607), UINT32_C(3555523675),
UINT32_C(1995198557), UINT32_C(3314312199), UINT32_C(2406584253), UINT32_C(1779168063)),
simde_x_mm256_set_epu32(UINT32_C(1763100483), UINT32_C(3776962737), UINT32_C(2844608398), UINT32_C(2885101098),
UINT32_C( 269910347), UINT32_C( 433971495), UINT32_C(1441956227), UINT32_C(1018271575)),
simde_x_mm256_set_epu32(UINT32_C(1734496959), UINT32_C( 380846712), UINT32_C( 508391209), UINT32_C( 670422577),
UINT32_C( 105826128), UINT32_C( 276511734), UINT32_C( 964628026), UINT32_C( 760896488)) },
{ simde_x_mm256_set_epu32(UINT32_C(3932090380), UINT32_C(2449576763), UINT32_C(4246346280), UINT32_C( 201516689),
UINT32_C(2859036576), UINT32_C(2362091228), UINT32_C(3141663427), UINT32_C( 562234020)),
simde_x_mm256_set_epu32(UINT32_C(3629502055), UINT32_C(3952771463), UINT32_C(2102184556), UINT32_C( 877111492),
UINT32_C(1183491905), UINT32_C(3718356317), UINT32_C(3233651099), UINT32_C(3486869896)),
simde_x_mm256_set_epu32(UINT32_C( 302588325), UINT32_C(2449576763), UINT32_C( 41977168), UINT32_C( 201516689),
UINT32_C( 492052766), UINT32_C(2362091228), UINT32_C(3141663427), UINT32_C( 562234020)) },
{ simde_x_mm256_set_epu32(UINT32_C( 910061584), UINT32_C(2002226944), UINT32_C(3673004107), UINT32_C(4246624078),
UINT32_C( 523093293), UINT32_C(3059761572), UINT32_C(2206005509), UINT32_C(1943141679)),
simde_x_mm256_set_epu32(UINT32_C( 495870887), UINT32_C(3912840869), UINT32_C( 915244711), UINT32_C( 5081424),
UINT32_C(1422501384), UINT32_C(4130987572), UINT32_C(2778067031), UINT32_C( 497965579)),
simde_x_mm256_set_epu32(UINT32_C( 414190697), UINT32_C(2002226944), UINT32_C( 12025263), UINT32_C( 3635038),
UINT32_C( 523093293), UINT32_C(3059761572), UINT32_C(2206005509), UINT32_C( 449244942)) },
{ simde_x_mm256_set_epu32(UINT32_C(1755684145), UINT32_C(2233240925), UINT32_C(3244523643), UINT32_C(2995026741),
UINT32_C(2178270751), UINT32_C(1493088054), UINT32_C(4115137419), UINT32_C( 651362699)),
simde_x_mm256_set_epu32(UINT32_C(1206471293), UINT32_C(1374915518), UINT32_C( 531653117), UINT32_C(2075187308),
UINT32_C(4150348747), UINT32_C(2163101581), UINT32_C(1444783055), UINT32_C(1878625233)),
simde_x_mm256_set_epu32(UINT32_C( 549212852), UINT32_C( 858325407), UINT32_C( 54604941), UINT32_C( 919839433),
UINT32_C(2178270751), UINT32_C(1493088054), UINT32_C(1225571309), UINT32_C( 651362699)) }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])); i++) {
simde__m256i r = simde_mm256_rem_epu32(test_vec[i].a, test_vec[i].b);
simde_assert_m256i_u32(r, ==, test_vec[i].r);
}
return 0;
}
static int
test_simde_mm512_mask_rem_epu32(SIMDE_MUNIT_TEST_ARGS) {
const struct {
simde__m512i src;
simde__mmask16 k;
simde__m512i a;
simde__m512i b;
simde__m512i r;
} test_vec[8] = {
{ simde_x_mm512_set_epu32(UINT32_C( 691121094), UINT32_C( 674034227), UINT32_C(2329532409), UINT32_C(3374680349),
UINT32_C(3920294270), UINT32_C(3054162118), UINT32_C(1568850865), UINT32_C(3151989757),
UINT32_C(3215450688), UINT32_C(3586813553), UINT32_C(1508722402), UINT32_C(2220621656),
UINT32_C(1747596798), UINT32_C(2231263307), UINT32_C( 527472553), UINT32_C(2891870298)),
UINT16_C(63371),
simde_x_mm512_set_epu32(UINT32_C(3953959418), UINT32_C(2530156426), UINT32_C(1179683687), UINT32_C(2648640694),
UINT32_C(3623000007), UINT32_C(2708640028), UINT32_C(1691051285), UINT32_C( 50347892),
UINT32_C( 728425428), UINT32_C(1192263444), UINT32_C(2208623573), UINT32_C(1322777130),
UINT32_C( 163989560), UINT32_C(1492341726), UINT32_C( 298608154), UINT32_C(1250819173)),
simde_x_mm512_set_epu32(UINT32_C(3003933707), UINT32_C(1314482530), UINT32_C(2997716679), UINT32_C(3555959260),
UINT32_C(2875927297), UINT32_C(3290702646), UINT32_C(1580565751), UINT32_C(3823902839),
UINT32_C(2081361826), UINT32_C( 493161721), UINT32_C(3099851477), UINT32_C( 894221337),
UINT32_C(2964507124), UINT32_C( 492373082), UINT32_C(4281870485), UINT32_C(2207786213)),
simde_x_mm512_set_epu32(UINT32_C( 950025711), UINT32_C(1215673896), UINT32_C(1179683687), UINT32_C(2648640694),
UINT32_C(3920294270), UINT32_C(2708640028), UINT32_C( 110485534), UINT32_C( 50347892),
UINT32_C( 728425428), UINT32_C(3586813553), UINT32_C(1508722402), UINT32_C(2220621656),
UINT32_C( 163989560), UINT32_C(2231263307), UINT32_C( 298608154), UINT32_C(1250819173)) },
{ simde_x_mm512_set_epu32(UINT32_C(1779168063), UINT32_C(3156074065), UINT32_C(3607805659), UINT32_C(1828175063),
UINT32_C(3905547273), UINT32_C(4101755863), UINT32_C(3436978124), UINT32_C(3846637996),
UINT32_C(2693603084), UINT32_C(1710148738), UINT32_C(1974123080), UINT32_C(2870600100),
UINT32_C( 118588227), UINT32_C( 542053192), UINT32_C( 499863549), UINT32_C( 957375358)),
UINT16_C(36797),
simde_x_mm512_set_epu32(UINT32_C(3141663427), UINT32_C( 562234020), UINT32_C(1763100483), UINT32_C(3776962737),
UINT32_C(2844608398), UINT32_C(2885101098), UINT32_C( 269910347), UINT32_C( 433971495),
UINT32_C(1441956227), UINT32_C(1018271575), UINT32_C(1734496959), UINT32_C( 380846712),
UINT32_C(3352999607), UINT32_C(3555523675), UINT32_C(1995198557), UINT32_C(3314312199)),
simde_x_mm512_set_epu32(UINT32_C(2206005509), UINT32_C(1943141679), UINT32_C(3629502055), UINT32_C(3952771463),
UINT32_C(2102184556), UINT32_C( 877111492), UINT32_C(1183491905), UINT32_C(3718356317),
UINT32_C(3233651099), UINT32_C(3486869896), UINT32_C(3932090380), UINT32_C(2449576763),
UINT32_C(4246346280), UINT32_C( 201516689), UINT32_C(2859036576), UINT32_C(2362091228)),
simde_x_mm512_set_epu32(UINT32_C( 935657918), UINT32_C(3156074065), UINT32_C(3607805659), UINT32_C(1828175063),
UINT32_C( 742423842), UINT32_C( 253766622), UINT32_C( 269910347), UINT32_C( 433971495),
UINT32_C(1441956227), UINT32_C(1710148738), UINT32_C(1734496959), UINT32_C( 380846712),
UINT32_C(3352999607), UINT32_C( 129739962), UINT32_C( 499863549), UINT32_C( 952220971)) },
{ simde_x_mm512_set_epu32(UINT32_C(4115137419), UINT32_C( 651362699), UINT32_C( 495870887), UINT32_C(3912840869),
UINT32_C( 915244711), UINT32_C( 5081424), UINT32_C(1422501384), UINT32_C(4130987572),
UINT32_C(2778067031), UINT32_C( 497965579), UINT32_C( 910061584), UINT32_C(2002226944),
UINT32_C(3673004107), UINT32_C(4246624078), UINT32_C( 523093293), UINT32_C(3059761572)),
UINT16_C(46902),
simde_x_mm512_set_epu32(UINT32_C(4074346392), UINT32_C(1398655610), UINT32_C(1722520923), UINT32_C(1206471293),
UINT32_C(1374915518), UINT32_C( 531653117), UINT32_C(2075187308), UINT32_C(4150348747),
UINT32_C(2163101581), UINT32_C(1444783055), UINT32_C(1878625233), UINT32_C(1755684145),
UINT32_C(2233240925), UINT32_C(3244523643), UINT32_C(2995026741), UINT32_C(2178270751)),
simde_x_mm512_set_epu32(UINT32_C(3188873807), UINT32_C(1982658188), UINT32_C( 863153207), UINT32_C(2657690668),
UINT32_C( 448681074), UINT32_C(1334667053), UINT32_C( 502667641), UINT32_C( 855395764),
UINT32_C(2622874348), UINT32_C( 808531712), UINT32_C( 454488139), UINT32_C( 123547093),
UINT32_C( 483090439), UINT32_C(3168637539), UINT32_C(3093747107), UINT32_C(4158916667)),
simde_x_mm512_set_epu32(UINT32_C( 885472585), UINT32_C( 651362699), UINT32_C( 859367716), UINT32_C(1206471293),
UINT32_C( 915244711), UINT32_C( 531653117), UINT32_C( 64516744), UINT32_C( 728765691),
UINT32_C(2778067031), UINT32_C( 497965579), UINT32_C( 60672677), UINT32_C( 26024843),
UINT32_C(3673004107), UINT32_C( 75886104), UINT32_C(2995026741), UINT32_C(3059761572)) },
{ simde_x_mm512_set_epu32(UINT32_C(2113970745), UINT32_C(4112838454), UINT32_C( 564512596), UINT32_C( 604721400),
UINT32_C(1471174399), UINT32_C(2491026588), UINT32_C(2529574367), UINT32_C( 298473775),
UINT32_C(2890366559), UINT32_C(3063632375), UINT32_C(4055983958), UINT32_C(4149169500),
UINT32_C(4113948134), UINT32_C(2384487126), UINT32_C(2434207126), UINT32_C(3923111671)),
UINT16_C(38914),
simde_x_mm512_set_epu32(UINT32_C(1533151625), UINT32_C(2122196136), UINT32_C(1690360675), UINT32_C(1484935627),
UINT32_C(1463758672), UINT32_C( 602211615), UINT32_C(3830002991), UINT32_C(2864741101),
UINT32_C( 797104998), UINT32_C(2737423319), UINT32_C(3342229886), UINT32_C( 178625368),
UINT32_C(3091160996), UINT32_C(1095216728), UINT32_C(3079561742), UINT32_C( 430790402)),
simde_x_mm512_set_epu32(UINT32_C(4043825594), UINT32_C(1274901810), UINT32_C( 413860084), UINT32_C( 550494320),
UINT32_C(1997049765), UINT32_C( 505563651), UINT32_C( 463125220), UINT32_C(3843753777),
UINT32_C(2346173843), UINT32_C(2157864934), UINT32_C(2591157969), UINT32_C( 389679318),
UINT32_C(3939775129), UINT32_C(2493364907), UINT32_C(2006619059), UINT32_C(3391409164)),
simde_x_mm512_set_epu32(UINT32_C(1533151625), UINT32_C(4112838454), UINT32_C( 564512596), UINT32_C( 383946987),
UINT32_C(1463758672), UINT32_C(2491026588), UINT32_C(2529574367), UINT32_C( 298473775),
UINT32_C(2890366559), UINT32_C(3063632375), UINT32_C(4055983958), UINT32_C(4149169500),
UINT32_C(4113948134), UINT32_C(2384487126), UINT32_C(1072942683), UINT32_C(3923111671)) },
{ simde_x_mm512_set_epu32(UINT32_C(1572579389), UINT32_C(3511888959), UINT32_C(2399346014), UINT32_C(1967093325),
UINT32_C( 908815803), UINT32_C(2319376026), UINT32_C(2065037155), UINT32_C( 623932649),
UINT32_C(1610322797), UINT32_C(3452844305), UINT32_C(2031682359), UINT32_C(2994836943),
UINT32_C(2344919086), UINT32_C( 238137788), UINT32_C(1978166020), UINT32_C( 76768592)),
UINT16_C( 883),
simde_x_mm512_set_epu32(UINT32_C(3284847806), UINT32_C(3884897233), UINT32_C(2094036024), UINT32_C(2456834182),
UINT32_C( 69201629), UINT32_C(1228958503), UINT32_C(3519587969), UINT32_C(2809504529),
UINT32_C(3115789449), UINT32_C(1767270276), UINT32_C( 490610321), UINT32_C(1164436618),
UINT32_C(2374669797), UINT32_C(3604002618), UINT32_C(3414719029), UINT32_C(2289333019)),
simde_x_mm512_set_epu32(UINT32_C(2383307765), UINT32_C( 143428987), UINT32_C(3684943081), UINT32_C( 582607980),
UINT32_C(1609326889), UINT32_C(1245407235), UINT32_C(4175005098), UINT32_C(2362914327),
UINT32_C(2924553042), UINT32_C(2369006988), UINT32_C(2119408419), UINT32_C(3091878410),
UINT32_C(3978436943), UINT32_C(1708684203), UINT32_C(1202455481), UINT32_C(2187745469)),
simde_x_mm512_set_epu32(UINT32_C(1572579389), UINT32_C(3511888959), UINT32_C(2399346014), UINT32_C(1967093325),
UINT32_C( 908815803), UINT32_C(2319376026), UINT32_C(3519587969), UINT32_C( 446590202),
UINT32_C(1610322797), UINT32_C(1767270276), UINT32_C( 490610321), UINT32_C(1164436618),
UINT32_C(2344919086), UINT32_C( 238137788), UINT32_C(1009808067), UINT32_C( 101587550)) },
{ simde_x_mm512_set_epu32(UINT32_C(2117071873), UINT32_C(2857077767), UINT32_C(3918893192), UINT32_C(1087893388),
UINT32_C(3851784011), UINT32_C(3914271744), UINT32_C( 565328458), UINT32_C(4201942548),
UINT32_C(1480532604), UINT32_C(4197506536), UINT32_C(3712719696), UINT32_C(3920217826),
UINT32_C(1394313506), UINT32_C( 394553965), UINT32_C(2278253176), UINT32_C(1697927724)),
UINT16_C(12254),
simde_x_mm512_set_epu32(UINT32_C( 56443211), UINT32_C(2258452653), UINT32_C(3784696472), UINT32_C(1139427205),
UINT32_C(1090384090), UINT32_C(2389735891), UINT32_C(2215607313), UINT32_C(3817672405),
UINT32_C(3621770268), UINT32_C(2071747620), UINT32_C(3852178197), UINT32_C(3693632585),
UINT32_C( 319530416), UINT32_C(2179954815), UINT32_C(3793236393), UINT32_C( 340519338)),
simde_x_mm512_set_epu32(UINT32_C(1219537084), UINT32_C(1349635715), UINT32_C( 732887738), UINT32_C(2566325375),
UINT32_C(2906533885), UINT32_C(1765754685), UINT32_C(2719983633), UINT32_C( 846129112),
UINT32_C(1578410935), UINT32_C(2635094838), UINT32_C(1045536663), UINT32_C( 957117985),
UINT32_C(3029008645), UINT32_C(1309498779), UINT32_C(3293951997), UINT32_C(1022360677)),
simde_x_mm512_set_epu32(UINT32_C(2117071873), UINT32_C(2857077767), UINT32_C( 120257782), UINT32_C(1087893388),
UINT32_C(1090384090), UINT32_C( 623981206), UINT32_C(2215607313), UINT32_C( 433155957),
UINT32_C( 464948398), UINT32_C(2071747620), UINT32_C(3712719696), UINT32_C( 822278630),
UINT32_C( 319530416), UINT32_C( 870456036), UINT32_C( 499284396), UINT32_C(1697927724)) },
{ simde_x_mm512_set_epu32(UINT32_C(3990081318), UINT32_C( 991545752), UINT32_C(4151932359), UINT32_C( 843112042),
UINT32_C(4067412513), UINT32_C(2124182542), UINT32_C(2768721208), UINT32_C(2302989914),
UINT32_C(1224533822), UINT32_C(3475606100), UINT32_C(3610957044), UINT32_C(2556046111),
UINT32_C(3035396524), UINT32_C(3603101367), UINT32_C(3321443925), UINT32_C( 45581573)),
UINT16_C(42669),
simde_x_mm512_set_epu32(UINT32_C(4138167693), UINT32_C(3221954957), UINT32_C(2164435171), UINT32_C( 397240391),
UINT32_C( 200936922), UINT32_C(3263986987), UINT32_C(2536604122), UINT32_C(3629380929),
UINT32_C( 453331046), UINT32_C(1704580573), UINT32_C(1606190487), UINT32_C(3209309249),
UINT32_C(2959497652), UINT32_C(3926896735), UINT32_C(2875407663), UINT32_C(2069966669)),
simde_x_mm512_set_epu32(UINT32_C(1379668640), UINT32_C( 66581512), UINT32_C(3737665499), UINT32_C( 304428974),
UINT32_C(2686704508), UINT32_C( 532978979), UINT32_C( 946958552), UINT32_C(2383642627),
UINT32_C(2176874140), UINT32_C( 283691898), UINT32_C(3848894665), UINT32_C(3836186002),
UINT32_C(1951055651), UINT32_C( 765387914), UINT32_C( 822559116), UINT32_C( 7445617)),
simde_x_mm512_set_epu32(UINT32_C(1378830413), UINT32_C( 991545752), UINT32_C(2164435171), UINT32_C( 843112042),
UINT32_C(4067412513), UINT32_C( 66113113), UINT32_C( 642687018), UINT32_C(2302989914),
UINT32_C( 453331046), UINT32_C(3475606100), UINT32_C(1606190487), UINT32_C(2556046111),
UINT32_C(1008442001), UINT32_C( 99957165), UINT32_C(3321443925), UINT32_C( 85143)) },
{ simde_x_mm512_set_epu32(UINT32_C(2313028370), UINT32_C( 869237081), UINT32_C(4104913762), UINT32_C(2825691966),
UINT32_C(3577866502), UINT32_C(2991894408), UINT32_C(2172048625), UINT32_C(1617119933),
UINT32_C(1521363431), UINT32_C( 553638116), UINT32_C(1036201367), UINT32_C(3107033445),
UINT32_C(3882811410), UINT32_C(3534384353), UINT32_C(3871215839), UINT32_C(1273589632)),
UINT16_C(35103),
simde_x_mm512_set_epu32(UINT32_C(2458371652), UINT32_C( 260676470), UINT32_C(1724614860), UINT32_C(4150452663),
UINT32_C(3816336716), UINT32_C(2208212235), UINT32_C( 932145867), UINT32_C(2432594561),
UINT32_C(1756892633), UINT32_C( 382632965), UINT32_C(1295078740), UINT32_C(3299165262),
UINT32_C( 152308919), UINT32_C(3943411788), UINT32_C( 31813624), UINT32_C( 807463845)),
simde_x_mm512_set_epu32(UINT32_C( 615301803), UINT32_C( 382786341), UINT32_C(1852603705), UINT32_C(1998007730),
UINT32_C( 231325888), UINT32_C(1842039329), UINT32_C( 968682756), UINT32_C( 316335394),
UINT32_C(2223585202), UINT32_C(3491781959), UINT32_C(2167971796), UINT32_C(1587647099),
UINT32_C(2966608712), UINT32_C( 320339033), UINT32_C( 282380179), UINT32_C(4186865204)),
simde_x_mm512_set_epu32(UINT32_C( 612466243), UINT32_C( 869237081), UINT32_C(4104913762), UINT32_C(2825691966),
UINT32_C( 115122508), UINT32_C(2991894408), UINT32_C(2172048625), UINT32_C( 218246803),
UINT32_C(1521363431), UINT32_C( 553638116), UINT32_C(1036201367), UINT32_C( 123871064),
UINT32_C( 152308919), UINT32_C( 99343392), UINT32_C( 31813624), UINT32_C( 807463845)) }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])); i++) {
simde__m512i r = simde_mm512_mask_rem_epu32(test_vec[i].src, test_vec[i].k, test_vec[i].a, test_vec[i].b);
simde_assert_m512i_u32(r, ==, test_vec[i].r);
}
return 0;
}
static int
test_simde_mm256_rem_epu64(SIMDE_MUNIT_TEST_ARGS) {
const struct {
simde__m256i a;
simde__m256i b;
simde__m256i r;
} test_vec[8] = {
{ simde_x_mm256_set_epu64x(UINT64_C(13810255550447513201), UINT64_C( 6479913377553186648),
UINT64_C( 7505871096235581515), UINT64_C( 2265477367564496986)),
simde_x_mm256_set_epu64x(UINT64_C( 2968342496979776051), UINT64_C(10005265515001776413),
UINT64_C(16837535683400356038), UINT64_C( 6738163160628300797)),
simde_x_mm256_set_epu64x(UINT64_C( 1936885562528408997), UINT64_C( 6479913377553186648),
UINT64_C( 7505871096235581515), UINT64_C( 2265477367564496986)) },
{ simde_x_mm256_set_epu64x(UINT64_C( 5120732502404950997), UINT64_C( 5681284513410730040),
UINT64_C( 6409558907924801050), UINT64_C( 5372227444888762251)),
simde_x_mm256_set_epu64x(UINT64_C(10866939104613927783), UINT64_C(11375825163207743431),
UINT64_C(11633520338587575573), UINT64_C( 216242550290965460)),
simde_x_mm256_set_epu64x(UINT64_C( 5120732502404950997), UINT64_C( 5681284513410730040),
UINT64_C( 6409558907924801050), UINT64_C( 182406237905591211)) },
{ simde_x_mm256_set_epu64x(UINT64_C( 2118113466433927893), UINT64_C( 3840651400764901876),
UINT64_C( 2114726288902596757), UINT64_C( 9482369585348649466)),
simde_x_mm256_set_epu64x(UINT64_C( 5645659480511055559), UINT64_C(15272728730484288257),
UINT64_C(14133460247011230967), UINT64_C(16423537638667915170)),
simde_x_mm256_set_epu64x(UINT64_C( 2118113466433927893), UINT64_C( 3840651400764901876),
UINT64_C( 2114726288902596757), UINT64_C( 9482369585348649466)) },
{ simde_x_mm256_set_epu64x(UINT64_C( 7345032902979795528), UINT64_C(12329133549512917827),
UINT64_C( 2328100732832272381), UINT64_C( 4111895855610225675)),
simde_x_mm256_set_epu64x(UINT64_C(13555234896536583899), UINT64_C( 7851952110853286921),
UINT64_C(17616907291198234572), UINT64_C(16521184395064581900)),
simde_x_mm256_set_epu64x(UINT64_C( 7345032902979795528), UINT64_C( 4477181438659630906),
UINT64_C( 2328100732832272381), UINT64_C( 4111895855610225675)) },
{ simde_x_mm256_set_epu64x(UINT64_C( 7449607714297299576), UINT64_C(14401023659121376347),
UINT64_C( 8569312554655704071), UINT64_C(10336200663482757951)),
simde_x_mm256_set_epu64x(UINT64_C( 7572458917823766705), UINT64_C(12217500042222052906),
UINT64_C( 1159256113650983207), UINT64_C( 6193154838246823767)),
simde_x_mm256_set_epu64x(UINT64_C( 7449607714297299576), UINT64_C( 2183523616899323441),
UINT64_C( 454519759098821622), UINT64_C( 4143045825235934184)) },
{ simde_x_mm256_set_epu64x(UINT64_C(16888199589465789243), UINT64_C(18237918400292775569),
UINT64_C(12279468594349909724), UINT64_C(13493341674566517412)),
simde_x_mm256_set_epu64x(UINT64_C(15588592630942564743), UINT64_C( 9028813919053392068),
UINT64_C( 5083059030774095197), UINT64_C(13888425720366328200)),
simde_x_mm256_set_epu64x(UINT64_C( 1299606958523224500), UINT64_C( 180290562185991433),
UINT64_C( 2113350532801719330), UINT64_C(13493341674566517412)) },
{ simde_x_mm256_set_epu64x(UINT64_C( 3908684742628183808), UINT64_C(15775432521885308750),
UINT64_C( 2246668589251707300), UINT64_C( 9474721517893975343)),
simde_x_mm256_set_epu64x(UINT64_C( 2129749246616352421), UINT64_C( 3930946101587052880),
UINT64_C( 6109596926925725236), UINT64_C(11931707044738783755)),
simde_x_mm256_set_epu64x(UINT64_C( 1778935496011831387), UINT64_C( 51648115537097230),
UINT64_C( 2246668589251707300), UINT64_C( 9474721517893975343)) },
{ simde_x_mm256_set_epu64x(UINT64_C( 7540605987113962845), UINT64_C(13935122940778806069),
UINT64_C( 9355601638871447350), UINT64_C(17674380633802211723)),
simde_x_mm256_set_epu64x(UINT64_C( 5181754748372749246), UINT64_C( 2283432752406648940),
UINT64_C(17825612137522679693), UINT64_C( 6205295972918594513)),
simde_x_mm256_set_epu64x(UINT64_C( 2358851238741213599), UINT64_C( 234526426338912429),
UINT64_C( 9355601638871447350), UINT64_C( 5263788687965022697)) }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])); i++) {
simde__m256i r = simde_mm256_rem_epu64(test_vec[i].a, test_vec[i].b);
simde_assert_m256i_u64(r, ==, test_vec[i].r);
}
return 0;
}
static int
test_simde_mm512_rem_epi8(SIMDE_MUNIT_TEST_ARGS) {
const struct {
simde__m512i a;
simde__m512i b;
simde__m512i r;
} test_vec[8] = {
{ simde_mm512_set_epi8(INT8_C( 41), INT8_C( 49), INT8_C( -85), INT8_C( -58),
INT8_C( 40), INT8_C( 44), INT8_C( -14), INT8_C( 51),
INT8_C(-118), INT8_C( -39), INT8_C( -41), INT8_C( -7),
INT8_C( -55), INT8_C( 37), INT8_C(-119), INT8_C( 29),
INT8_C( -23), INT8_C( -86), INT8_C( -15), INT8_C( 126),
INT8_C( -74), INT8_C( 10), INT8_C( -48), INT8_C( -58),
INT8_C( 93), INT8_C(-126), INT8_C( -61), INT8_C( -79),
INT8_C( -69), INT8_C( -33), INT8_C(-117), INT8_C( -3),
INT8_C( -65), INT8_C( -89), INT8_C( -30), INT8_C( 64),
INT8_C( -43), INT8_C( -54), INT8_C( 110), INT8_C( 113),
INT8_C( 89), INT8_C( -19), INT8_C( 70), INT8_C( -30),
INT8_C(-124), INT8_C( 91), INT8_C( -1), INT8_C( 88),
INT8_C( 104), INT8_C( 42), INT8_C( 53), INT8_C( -2),
INT8_C(-124), INT8_C( -2), INT8_C( 96), INT8_C( 75),
INT8_C( 31), INT8_C( 112), INT8_C(-105), INT8_C( -87),
INT8_C( -84), INT8_C( 94), INT8_C( 112), INT8_C( 90)),
simde_mm512_set_epi8(INT8_C( -61), INT8_C( 49), INT8_C( 14), INT8_C( -86),
INT8_C( -53), INT8_C( -89), INT8_C( 3), INT8_C( -41),
INT8_C( 63), INT8_C( -8), INT8_C( 55), INT8_C( -37),
INT8_C( -35), INT8_C(-121), INT8_C( 61), INT8_C( -65),
INT8_C( -47), INT8_C( 91), INT8_C( 87), INT8_C(-119),
INT8_C( 87), INT8_C( 76), INT8_C( 44), INT8_C(-116),
INT8_C( 2), INT8_C( -56), INT8_C( 36), INT8_C( -61),
INT8_C( -56), INT8_C( 125), INT8_C( -2), INT8_C(-117),
INT8_C( -30), INT8_C( 71), INT8_C( 92), INT8_C(-127),
INT8_C( -74), INT8_C( 119), INT8_C( -9), INT8_C( 34),
INT8_C( 121), INT8_C( 85), INT8_C(-103), INT8_C( 116),
INT8_C( -38), INT8_C( 21), INT8_C( 101), INT8_C( 122),
INT8_C( 10), INT8_C( -25), INT8_C( 54), INT8_C( 71),
INT8_C(-100), INT8_C(-107), INT8_C( -12), INT8_C( 84),
INT8_C(-108), INT8_C( 85), INT8_C( -86), INT8_C( -72),
INT8_C( 94), INT8_C(-102), INT8_C( -27), INT8_C( 11)),
simde_mm512_set_epi8(INT8_C( 41), INT8_C( 0), INT8_C( -1), INT8_C( -58),
INT8_C( 40), INT8_C( 44), INT8_C( -2), INT8_C( 10),
INT8_C( -55), INT8_C( -7), INT8_C( -41), INT8_C( -7),
INT8_C( -20), INT8_C( 37), INT8_C( -58), INT8_C( 29),
INT8_C( -23), INT8_C( -86), INT8_C( -15), INT8_C( 7),
INT8_C( -74), INT8_C( 10), INT8_C( -4), INT8_C( -58),
INT8_C( 1), INT8_C( -14), INT8_C( -25), INT8_C( -18),
INT8_C( -13), INT8_C( -33), INT8_C( -1), INT8_C( -3),
INT8_C( -5), INT8_C( -18), INT8_C( -30), INT8_C( 64),
INT8_C( -43), INT8_C( -54), INT8_C( 2), INT8_C( 11),
INT8_C( 89), INT8_C( -19), INT8_C( 70), INT8_C( -30),
INT8_C( -10), INT8_C( 7), INT8_C( -1), INT8_C( 88),
INT8_C( 4), INT8_C( 17), INT8_C( 53), INT8_C( -2),
INT8_C( -24), INT8_C( -2), INT8_C( 0), INT8_C( 75),
INT8_C( 31), INT8_C( 27), INT8_C( -19), INT8_C( -15),
INT8_C( -84), INT8_C( 94), INT8_C( 4), INT8_C( 2)) },
{ simde_mm512_set_epi8(INT8_C( -40), INT8_C( 85), INT8_C( -50), INT8_C( 103),
INT8_C( -21), INT8_C(-102), INT8_C(-127), INT8_C(-121),
INT8_C( 125), INT8_C( 76), INT8_C( -54), INT8_C( 108),
INT8_C( 52), INT8_C( 71), INT8_C( -88), INT8_C( -60),
INT8_C( 70), INT8_C(-118), INT8_C( -89), INT8_C( 65),
INT8_C( -35), INT8_C( -95), INT8_C( -99), INT8_C( 93),
INT8_C( -64), INT8_C( -67), INT8_C(-103), INT8_C(-101),
INT8_C( -49), INT8_C( -43), INT8_C( 105), INT8_C(-120),
INT8_C( -22), INT8_C( 94), INT8_C( -16), INT8_C( 12),
INT8_C(-110), INT8_C( 1), INT8_C(-109), INT8_C( 59),
INT8_C( -3), INT8_C( 26), INT8_C( 26), INT8_C( 40),
INT8_C( 12), INT8_C( 2), INT8_C( -26), INT8_C(-111),
INT8_C( -86), INT8_C( 105), INT8_C( 111), INT8_C( -96),
INT8_C(-116), INT8_C( -54), INT8_C( -90), INT8_C( -36),
INT8_C( -69), INT8_C( 65), INT8_C( -6), INT8_C( -61),
INT8_C( 33), INT8_C(-125), INT8_C( 2), INT8_C( -92)),
simde_mm512_set_epi8(INT8_C( 120), INT8_C( 127), INT8_C( 28), INT8_C( 95),
INT8_C( -81), INT8_C( -33), INT8_C( 119), INT8_C( -42),
INT8_C( -36), INT8_C( 102), INT8_C( 86), INT8_C( 22),
INT8_C( 119), INT8_C( -49), INT8_C( 12), INT8_C( -73),
INT8_C( -84), INT8_C( -14), INT8_C( -83), INT8_C( -7),
INT8_C( 52), INT8_C( 108), INT8_C(-128), INT8_C( -53),
INT8_C( 85), INT8_C(-121), INT8_C( -29), INT8_C( 35),
INT8_C( -69), INT8_C( 24), INT8_C( -6), INT8_C( -37),
INT8_C( -3), INT8_C( 62), INT8_C( 125), INT8_C( -20),
INT8_C( 75), INT8_C( 13), INT8_C( 79), INT8_C( 81),
INT8_C( -79), INT8_C( -35), INT8_C( -5), INT8_C( -75),
INT8_C( -97), INT8_C( -74), INT8_C( 11), INT8_C( 11),
INT8_C( 39), INT8_C( 37), INT8_C( 39), INT8_C( -48),
INT8_C(-120), INT8_C( -76), INT8_C( -41), INT8_C(-117),
INT8_C(-112), INT8_C(-128), INT8_C( -53), INT8_C( -50),
INT8_C( -83), INT8_C( 36), INT8_C(-123), INT8_C( -81)),
simde_mm512_set_epi8(INT8_C( -40), INT8_C( 85), INT8_C( -22), INT8_C( 8),
INT8_C( -21), INT8_C( -3), INT8_C( -8), INT8_C( -37),
INT8_C( 17), INT8_C( 76), INT8_C( -54), INT8_C( 20),
INT8_C( 52), INT8_C( 22), INT8_C( -4), INT8_C( -60),
INT8_C( 70), INT8_C( -6), INT8_C( -6), INT8_C( 2),
INT8_C( -35), INT8_C( -95), INT8_C( -99), INT8_C( 40),
INT8_C( -64), INT8_C( -67), INT8_C( -16), INT8_C( -31),
INT8_C( -49), INT8_C( -19), INT8_C( 3), INT8_C( -9),
INT8_C( -1), INT8_C( 32), INT8_C( -16), INT8_C( 12),
INT8_C( -35), INT8_C( 1), INT8_C( -30), INT8_C( 59),
INT8_C( -3), INT8_C( 26), INT8_C( 1), INT8_C( 40),
INT8_C( 12), INT8_C( 2), INT8_C( -4), INT8_C( -1),
INT8_C( -8), INT8_C( 31), INT8_C( 33), INT8_C( 0),
INT8_C(-116), INT8_C( -54), INT8_C( -8), INT8_C( -36),
INT8_C( -69), INT8_C( 65), INT8_C( -6), INT8_C( -11),
INT8_C( 33), INT8_C( -17), INT8_C( 2), INT8_C( -11)) },
{ simde_mm512_set_epi8(INT8_C( 87), INT8_C( 63), INT8_C( 47), INT8_C( 80),
INT8_C( 35), INT8_C( -27), INT8_C( 5), INT8_C( 31),
INT8_C( -28), INT8_C( 73), INT8_C( 53), INT8_C( 47),
INT8_C( -86), INT8_C( -64), INT8_C( 122), INT8_C( -19),
INT8_C( 47), INT8_C(-126), INT8_C( -37), INT8_C( 102),
INT8_C( -93), INT8_C( 41), INT8_C( -61), INT8_C( -41),
INT8_C( -57), INT8_C( 54), INT8_C( 97), INT8_C( 126),
INT8_C( 10), INT8_C( -91), INT8_C(-101), INT8_C( 88),
INT8_C( -72), INT8_C( 63), INT8_C( 95), INT8_C( -92),
INT8_C( 65), INT8_C( 71), INT8_C( -82), INT8_C( 88),
INT8_C( -73), INT8_C(-114), INT8_C( 98), INT8_C( 14),
INT8_C( 25), INT8_C( -83), INT8_C( 87), INT8_C( 2),
INT8_C( -65), INT8_C(-113), INT8_C(-104), INT8_C( 2),
INT8_C( 126), INT8_C( 0), INT8_C( -94), INT8_C( 57),
INT8_C( -11), INT8_C( 36), INT8_C( -17), INT8_C( 54),
INT8_C( 33), INT8_C( -91), INT8_C( -57), INT8_C( 84)),
simde_mm512_set_epi8(INT8_C(-125), INT8_C( 42), INT8_C(-105), INT8_C( -46),
INT8_C( 12), INT8_C( -93), INT8_C(-118), INT8_C( -49),
INT8_C( 43), INT8_C( 57), INT8_C( 61), INT8_C( 62),
INT8_C( 81), INT8_C( -72), INT8_C( 6), INT8_C( 93),
INT8_C( -89), INT8_C( 1), INT8_C(-111), INT8_C( 9),
INT8_C( 4), INT8_C( 17), INT8_C( 10), INT8_C( 101),
INT8_C( -70), INT8_C( -75), INT8_C(-101), INT8_C( -13),
INT8_C( -67), INT8_C( -65), INT8_C( -34), INT8_C( -51),
INT8_C( 59), INT8_C( 26), INT8_C( -29), INT8_C( 105),
INT8_C( -19), INT8_C(-111), INT8_C( -73), INT8_C( 79),
INT8_C( -82), INT8_C( 60), INT8_C(-124), INT8_C( -48),
INT8_C( 58), INT8_C( -78), INT8_C( 116), INT8_C( -16),
INT8_C( 37), INT8_C(-125), INT8_C( 100), INT8_C( -79),
INT8_C( 19), INT8_C( 102), INT8_C( 81), INT8_C( 86),
INT8_C( 25), INT8_C( 43), INT8_C( 51), INT8_C(-116),
INT8_C( 9), INT8_C( 40), INT8_C( -29), INT8_C( 75)),
simde_mm512_set_epi8(INT8_C( 87), INT8_C( 21), INT8_C( 47), INT8_C( 34),
INT8_C( 11), INT8_C( -27), INT8_C( 5), INT8_C( 31),
INT8_C( -28), INT8_C( 16), INT8_C( 53), INT8_C( 47),
INT8_C( -5), INT8_C( -64), INT8_C( 2), INT8_C( -19),
INT8_C( 47), INT8_C( 0), INT8_C( -37), INT8_C( 3),
INT8_C( -1), INT8_C( 7), INT8_C( -1), INT8_C( -41),
INT8_C( -57), INT8_C( 54), INT8_C( 97), INT8_C( 9),
INT8_C( 10), INT8_C( -26), INT8_C( -33), INT8_C( 37),
INT8_C( -13), INT8_C( 11), INT8_C( 8), INT8_C( -92),
INT8_C( 8), INT8_C( 71), INT8_C( -9), INT8_C( 9),
INT8_C( -73), INT8_C( -54), INT8_C( 98), INT8_C( 14),
INT8_C( 25), INT8_C( -5), INT8_C( 87), INT8_C( 2),
INT8_C( -28), INT8_C(-113), INT8_C( -4), INT8_C( 2),
INT8_C( 12), INT8_C( 0), INT8_C( -13), INT8_C( 57),
INT8_C( -11), INT8_C( 36), INT8_C( -17), INT8_C( 54),
INT8_C( 6), INT8_C( -11), INT8_C( -28), INT8_C( 9)) },
{ simde_mm512_set_epi8(INT8_C( -23), INT8_C( 79), INT8_C( 12), INT8_C( 0),
INT8_C( 33), INT8_C( -78), INT8_C( 58), INT8_C( 74),
INT8_C( -6), INT8_C( 116), INT8_C(-114), INT8_C( 20),
INT8_C( 88), INT8_C( 63), INT8_C( 34), INT8_C( 124),
INT8_C( -6), INT8_C( 48), INT8_C( -35), INT8_C( -24),
INT8_C( -35), INT8_C( 75), INT8_C(-101), INT8_C( 80),
INT8_C( -23), INT8_C( -87), INT8_C( -58), INT8_C( -30),
INT8_C( 83), INT8_C( 27), INT8_C(-119), INT8_C( 34),
INT8_C( 23), INT8_C(-124), INT8_C( 106), INT8_C( 109),
INT8_C(-121), INT8_C( -53), INT8_C( 98), INT8_C( 120),
INT8_C( 101), INT8_C( 52), INT8_C( 82), INT8_C( 44),
INT8_C(-114), INT8_C( 14), INT8_C( 99), INT8_C( -11),
INT8_C( 8), INT8_C(-116), INT8_C(-115), INT8_C( 123),
INT8_C( -37), INT8_C( -93), INT8_C( -60), INT8_C( -23),
INT8_C( 34), INT8_C( -71), INT8_C( -28), INT8_C( 108),
INT8_C( 95), INT8_C( -20), INT8_C( 97), INT8_C( 41)),
simde_mm512_set_epi8(INT8_C( -63), INT8_C( -26), INT8_C( 93), INT8_C( 23),
INT8_C( -63), INT8_C( 52), INT8_C( -33), INT8_C( -81),
INT8_C( -51), INT8_C( 45), INT8_C( -90), INT8_C( 24),
INT8_C( 71), INT8_C( -22), INT8_C( -95), INT8_C(-114),
INT8_C( -72), INT8_C( -38), INT8_C( -66), INT8_C( -44),
INT8_C( 116), INT8_C( -97), INT8_C( 44), INT8_C( 55),
INT8_C( -43), INT8_C(-123), INT8_C( 60), INT8_C( 3),
INT8_C( 58), INT8_C( -1), INT8_C( 125), INT8_C( -67),
INT8_C(-111), INT8_C( 88), INT8_C( 55), INT8_C( -74),
INT8_C( 23), INT8_C( -95), INT8_C(-123), INT8_C( 27),
INT8_C( 125), INT8_C( -27), INT8_C( -53), INT8_C( 45),
INT8_C( 24), INT8_C( 5), INT8_C( 90), INT8_C( 83),
INT8_C(-111), INT8_C( 85), INT8_C(-100), INT8_C( -92),
INT8_C(-107), INT8_C( -55), INT8_C( 48), INT8_C( -1),
INT8_C( 41), INT8_C( 42), INT8_C( 94), INT8_C(-127),
INT8_C(-121), INT8_C( 8), INT8_C( 12), INT8_C( -53)),
simde_mm512_set_epi8(INT8_C( -23), INT8_C( 1), INT8_C( 12), INT8_C( 0),
INT8_C( 33), INT8_C( -26), INT8_C( 25), INT8_C( 74),
INT8_C( -6), INT8_C( 26), INT8_C( -24), INT8_C( 20),
INT8_C( 17), INT8_C( 19), INT8_C( 34), INT8_C( 10),
INT8_C( -6), INT8_C( 10), INT8_C( -35), INT8_C( -24),
INT8_C( -35), INT8_C( 75), INT8_C( -13), INT8_C( 25),
INT8_C( -23), INT8_C( -87), INT8_C( -58), INT8_C( 0),
INT8_C( 25), INT8_C( 0), INT8_C(-119), INT8_C( 34),
INT8_C( 23), INT8_C( -36), INT8_C( 51), INT8_C( 35),
INT8_C( -6), INT8_C( -53), INT8_C( 98), INT8_C( 12),
INT8_C( 101), INT8_C( 25), INT8_C( 29), INT8_C( 44),
INT8_C( -18), INT8_C( 4), INT8_C( 9), INT8_C( -11),
INT8_C( 8), INT8_C( -31), INT8_C( -15), INT8_C( 31),
INT8_C( -37), INT8_C( -38), INT8_C( -12), INT8_C( 0),
INT8_C( 34), INT8_C( -29), INT8_C( -28), INT8_C( 108),
INT8_C( 95), INT8_C( -4), INT8_C( 1), INT8_C( 41)) },
{ simde_mm512_set_epi8(INT8_C(-114), INT8_C( 19), INT8_C(-128), INT8_C( 3),
INT8_C(-127), INT8_C( -64), INT8_C( 118), INT8_C(-100),
INT8_C( 16), INT8_C( -24), INT8_C( -53), INT8_C( 122),
INT8_C( -27), INT8_C( 105), INT8_C( 120), INT8_C( -55),
INT8_C( -28), INT8_C( -89), INT8_C(-115), INT8_C(-110),
INT8_C( 116), INT8_C( 74), INT8_C( -65), INT8_C( 35),
INT8_C( 45), INT8_C( -98), INT8_C( -28), INT8_C(-118),
INT8_C( 49), INT8_C( 7), INT8_C( 65), INT8_C(-116),
INT8_C( 0), INT8_C( 113), INT8_C(-100), INT8_C( 113),
INT8_C( -10), INT8_C( -89), INT8_C( 109), INT8_C(-115),
INT8_C( -64), INT8_C( 11), INT8_C( 33), INT8_C(-115),
INT8_C(-127), INT8_C( 2), INT8_C( -88), INT8_C( -29),
INT8_C( 23), INT8_C( -83), INT8_C( 104), INT8_C( 71),
INT8_C( 11), INT8_C( -6), INT8_C( 13), INT8_C( -38),
INT8_C( -62), INT8_C(-116), INT8_C( 125), INT8_C( 43),
INT8_C(-105), INT8_C( 49), INT8_C(-127), INT8_C( -38)),
simde_mm512_set_epi8(INT8_C( 8), INT8_C( 25), INT8_C(-109), INT8_C( -36),
INT8_C( -83), INT8_C(-118), INT8_C( 38), INT8_C(-106),
INT8_C( 35), INT8_C( 43), INT8_C( -91), INT8_C( -71),
INT8_C( 50), INT8_C( 64), INT8_C( -95), INT8_C(-124),
INT8_C( -94), INT8_C( 50), INT8_C( -57), INT8_C( 84),
INT8_C( -5), INT8_C( -56), INT8_C( -39), INT8_C( 19),
INT8_C( -76), INT8_C( -60), INT8_C( -10), INT8_C( 76),
INT8_C( 55), INT8_C( -52), INT8_C(-117), INT8_C( 75),
INT8_C( 1), INT8_C( 89), INT8_C(-123), INT8_C( -44),
INT8_C( -50), INT8_C( 55), INT8_C( -52), INT8_C( 120),
INT8_C( 37), INT8_C( -97), INT8_C(-110), INT8_C( -39),
INT8_C( -30), INT8_C( -66), INT8_C(-122), INT8_C( 8),
INT8_C( 113), INT8_C( 61), INT8_C( 103), INT8_C( 100),
INT8_C( 23), INT8_C( -27), INT8_C(-110), INT8_C( 97),
INT8_C( 95), INT8_C( 32), INT8_C(-120), INT8_C( 91),
INT8_C( 46), INT8_C( -4), INT8_C( -93), INT8_C( 88)),
simde_mm512_set_epi8(INT8_C( -2), INT8_C( 19), INT8_C( -19), INT8_C( 3),
INT8_C( -44), INT8_C( -64), INT8_C( 4), INT8_C(-100),
INT8_C( 16), INT8_C( -24), INT8_C( -53), INT8_C( 51),
INT8_C( -27), INT8_C( 41), INT8_C( 25), INT8_C( -55),
INT8_C( -28), INT8_C( -39), INT8_C( -1), INT8_C( -26),
INT8_C( 1), INT8_C( 18), INT8_C( -26), INT8_C( 16),
INT8_C( 45), INT8_C( -38), INT8_C( -8), INT8_C( -42),
INT8_C( 49), INT8_C( 7), INT8_C( 65), INT8_C( -41),
INT8_C( 0), INT8_C( 24), INT8_C(-100), INT8_C( 25),
INT8_C( -10), INT8_C( -34), INT8_C( 5), INT8_C(-115),
INT8_C( -27), INT8_C( 11), INT8_C( 33), INT8_C( -37),
INT8_C( -7), INT8_C( 2), INT8_C( -88), INT8_C( -5),
INT8_C( 23), INT8_C( -22), INT8_C( 1), INT8_C( 71),
INT8_C( 11), INT8_C( -6), INT8_C( 13), INT8_C( -38),
INT8_C( -62), INT8_C( -20), INT8_C( 5), INT8_C( 43),
INT8_C( -13), INT8_C( 1), INT8_C( -34), INT8_C( -38)) },
{ simde_mm512_set_epi8(INT8_C( 46), INT8_C( 43), INT8_C( -10), INT8_C( -99),
INT8_C( 80), INT8_C(-102), INT8_C( 27), INT8_C( 118),
INT8_C( -80), INT8_C( -40), INT8_C( 46), INT8_C(-114),
INT8_C( -58), INT8_C( -8), INT8_C( 88), INT8_C( 29),
INT8_C( -80), INT8_C( 25), INT8_C( 101), INT8_C( 54),
INT8_C( 103), INT8_C( 120), INT8_C( 94), INT8_C( 16),
INT8_C( -59), INT8_C( -51), INT8_C( 71), INT8_C( -10),
INT8_C( -98), INT8_C( -80), INT8_C( -38), INT8_C( 43),
INT8_C( -21), INT8_C( -7), INT8_C( 116), INT8_C(-119),
INT8_C( 89), INT8_C( -44), INT8_C(-124), INT8_C( 56),
INT8_C( -26), INT8_C(-119), INT8_C( 66), INT8_C( 41),
INT8_C( 44), INT8_C( 35), INT8_C( -67), INT8_C(-101),
INT8_C( 125), INT8_C(-126), INT8_C( 123), INT8_C( 117),
INT8_C( 123), INT8_C( 127), INT8_C(-105), INT8_C( 60),
INT8_C(-103), INT8_C( -71), INT8_C( -6), INT8_C( 100),
INT8_C( 83), INT8_C( 112), INT8_C( 33), INT8_C(-116)),
simde_mm512_set_epi8(INT8_C( 36), INT8_C( 33), INT8_C( 42), INT8_C( 75),
INT8_C( -77), INT8_C( -84), INT8_C( 126), INT8_C( -85),
INT8_C( 110), INT8_C(-106), INT8_C( 107), INT8_C( -76),
INT8_C(-122), INT8_C( 73), INT8_C( -49), INT8_C( 15),
INT8_C( -15), INT8_C( 103), INT8_C( 103), INT8_C(-106),
INT8_C( 103), INT8_C( 58), INT8_C( 104), INT8_C( 35),
INT8_C( -7), INT8_C( 79), INT8_C( 113), INT8_C( 97),
INT8_C( -67), INT8_C( -59), INT8_C( -82), INT8_C( -34),
INT8_C( -32), INT8_C( 104), INT8_C( 123), INT8_C( 124),
INT8_C( 49), INT8_C( -30), INT8_C( 37), INT8_C( 22),
INT8_C( 105), INT8_C( -99), INT8_C( 110), INT8_C( 52),
INT8_C( -2), INT8_C( 103), INT8_C( -94), INT8_C( -46),
INT8_C( -54), INT8_C( 39), INT8_C( -63), INT8_C(-105),
INT8_C( -73), INT8_C( 73), INT8_C( 97), INT8_C( -69),
INT8_C( 102), INT8_C( -61), INT8_C( 68), INT8_C( -66),
INT8_C( 65), INT8_C( 60), INT8_C( -91), INT8_C( 126)),
simde_mm512_set_epi8(INT8_C( 10), INT8_C( 10), INT8_C( -10), INT8_C( -24),
INT8_C( 3), INT8_C( -18), INT8_C( 27), INT8_C( 33),
INT8_C( -80), INT8_C( -40), INT8_C( 46), INT8_C( -38),
INT8_C( -58), INT8_C( -8), INT8_C( 39), INT8_C( 14),
INT8_C( -5), INT8_C( 25), INT8_C( 101), INT8_C( 54),
INT8_C( 0), INT8_C( 4), INT8_C( 94), INT8_C( 16),
INT8_C( -3), INT8_C( -51), INT8_C( 71), INT8_C( -10),
INT8_C( -31), INT8_C( -21), INT8_C( -38), INT8_C( 9),
INT8_C( -21), INT8_C( -7), INT8_C( 116), INT8_C(-119),
INT8_C( 40), INT8_C( -14), INT8_C( -13), INT8_C( 12),
INT8_C( -26), INT8_C( -20), INT8_C( 66), INT8_C( 41),
INT8_C( 0), INT8_C( 35), INT8_C( -67), INT8_C( -9),
INT8_C( 17), INT8_C( -9), INT8_C( 60), INT8_C( 12),
INT8_C( 50), INT8_C( 54), INT8_C( -8), INT8_C( 60),
INT8_C( -1), INT8_C( -10), INT8_C( -6), INT8_C( 34),
INT8_C( 18), INT8_C( 52), INT8_C( 33), INT8_C(-116)) },
{ simde_mm512_set_epi8(INT8_C( -16), INT8_C( -87), INT8_C( 8), INT8_C( 54),
INT8_C( 66), INT8_C( 99), INT8_C( 14), INT8_C( 32),
INT8_C(-108), INT8_C( 92), INT8_C( 122), INT8_C( -56),
INT8_C( -64), INT8_C( -70), INT8_C( -31), INT8_C( 52),
INT8_C( -74), INT8_C( -12), INT8_C( -3), INT8_C( -28),
INT8_C(-115), INT8_C( -28), INT8_C(-108), INT8_C( -88),
INT8_C( -25), INT8_C( 107), INT8_C( 47), INT8_C( -51),
INT8_C( 126), INT8_C( 7), INT8_C( -74), INT8_C( -11),
INT8_C( -91), INT8_C( -70), INT8_C( -43), INT8_C( 84),
INT8_C( 19), INT8_C(-125), INT8_C( 54), INT8_C( 13),
INT8_C( -71), INT8_C( -74), INT8_C( 72), INT8_C( 61),
INT8_C( 125), INT8_C( 104), INT8_C(-109), INT8_C( 11),
INT8_C( 89), INT8_C( -52), INT8_C( 62), INT8_C( -93),
INT8_C( -58), INT8_C( -94), INT8_C( -51), INT8_C( 9),
INT8_C( -74), INT8_C( 123), INT8_C( 65), INT8_C( -48),
INT8_C(-111), INT8_C( -77), INT8_C( 34), INT8_C( -61)),
simde_mm512_set_epi8(INT8_C(-115), INT8_C( 103), INT8_C( 116), INT8_C( 12),
INT8_C( -82), INT8_C( -30), INT8_C( -63), INT8_C( -81),
INT8_C(-101), INT8_C( -82), INT8_C( 73), INT8_C( 6),
INT8_C(-115), INT8_C(-116), INT8_C( -2), INT8_C( -63),
INT8_C( 100), INT8_C(-105), INT8_C( 14), INT8_C( 19),
INT8_C( 38), INT8_C( 115), INT8_C( -55), INT8_C( 118),
INT8_C( 74), INT8_C( -70), INT8_C( 89), INT8_C( -73),
INT8_C( 65), INT8_C(-118), INT8_C( 64), INT8_C( 90),
INT8_C(-104), INT8_C( -15), INT8_C( -27), INT8_C( -38),
INT8_C( 126), INT8_C( 38), INT8_C( -97), INT8_C( 27),
INT8_C( -92), INT8_C( -57), INT8_C( 25), INT8_C( -3),
INT8_C( -75), INT8_C( 104), INT8_C( 6), INT8_C( -73),
INT8_C( 36), INT8_C( -53), INT8_C(-118), INT8_C(-111),
INT8_C( 116), INT8_C(-101), INT8_C( -38), INT8_C( 24),
INT8_C( -51), INT8_C( -18), INT8_C( -14), INT8_C( 26),
INT8_C( -30), INT8_C( 76), INT8_C( -30), INT8_C( -42)),
simde_mm512_set_epi8(INT8_C( -16), INT8_C( -87), INT8_C( 8), INT8_C( 6),
INT8_C( 66), INT8_C( 9), INT8_C( 14), INT8_C( 32),
INT8_C( -7), INT8_C( 10), INT8_C( 49), INT8_C( -2),
INT8_C( -64), INT8_C( -70), INT8_C( -1), INT8_C( 52),
INT8_C( -74), INT8_C( -12), INT8_C( -3), INT8_C( -9),
INT8_C( -1), INT8_C( -28), INT8_C( -53), INT8_C( -88),
INT8_C( -25), INT8_C( 37), INT8_C( 47), INT8_C( -51),
INT8_C( 61), INT8_C( 7), INT8_C( -10), INT8_C( -11),
INT8_C( -91), INT8_C( -10), INT8_C( -16), INT8_C( 8),
INT8_C( 19), INT8_C( -11), INT8_C( 54), INT8_C( 13),
INT8_C( -71), INT8_C( -17), INT8_C( 22), INT8_C( 1),
INT8_C( 50), INT8_C( 0), INT8_C( -1), INT8_C( 11),
INT8_C( 17), INT8_C( -52), INT8_C( 62), INT8_C( -93),
INT8_C( -58), INT8_C( -94), INT8_C( -13), INT8_C( 9),
INT8_C( -23), INT8_C( 15), INT8_C( 9), INT8_C( -22),
INT8_C( -21), INT8_C( -1), INT8_C( 4), INT8_C( -19)) },
{ simde_mm512_set_epi8(INT8_C( -59), INT8_C( 52), INT8_C(-111), INT8_C( 20),
INT8_C( 26), INT8_C( -78), INT8_C( 121), INT8_C( 16),
INT8_C( 45), INT8_C( -27), INT8_C( 11), INT8_C( -26),
INT8_C( 53), INT8_C( 2), INT8_C( -22), INT8_C( 7),
INT8_C( -49), INT8_C(-110), INT8_C( -87), INT8_C( -23),
INT8_C( -50), INT8_C( 116), INT8_C( 55), INT8_C(-100),
INT8_C( -76), INT8_C( 91), INT8_C( 56), INT8_C(-110),
INT8_C( 55), INT8_C(-119), INT8_C( -56), INT8_C( 76),
INT8_C( 43), INT8_C( -11), INT8_C(-118), INT8_C( 3),
INT8_C( -43), INT8_C(-100), INT8_C( -90), INT8_C( -22),
INT8_C( -57), INT8_C( 2), INT8_C( 86), INT8_C( 72),
INT8_C( 93), INT8_C( -2), INT8_C( -66), INT8_C( 121),
INT8_C( 119), INT8_C( 75), INT8_C( -97), INT8_C( 76),
INT8_C( 70), INT8_C( -38), INT8_C( 17), INT8_C( -17),
INT8_C( 43), INT8_C(-104), INT8_C( -34), INT8_C( 80),
INT8_C( -59), INT8_C( 113), INT8_C( 112), INT8_C( 81)),
simde_mm512_set_epi8(INT8_C( -63), INT8_C( -94), INT8_C( -78), INT8_C( 36),
INT8_C( -78), INT8_C( 86), INT8_C( 79), INT8_C( -89),
INT8_C( -77), INT8_C( 45), INT8_C( 18), INT8_C( -25),
INT8_C( 113), INT8_C( 127), INT8_C( -45), INT8_C( -75),
INT8_C( 121), INT8_C( -85), INT8_C( 76), INT8_C(-121),
INT8_C( 15), INT8_C(-123), INT8_C( -9), INT8_C( 32),
INT8_C( -75), INT8_C( -88), INT8_C( -20), INT8_C( 99),
INT8_C( 85), INT8_C(-105), INT8_C( 36), INT8_C( 99),
INT8_C( 101), INT8_C( 42), INT8_C( 63), INT8_C( 96),
INT8_C( -46), INT8_C( -58), INT8_C( -54), INT8_C( 105),
INT8_C( -42), INT8_C( 74), INT8_C( -57), INT8_C( 17),
INT8_C( -22), INT8_C( 22), INT8_C(-122), INT8_C( 112),
INT8_C( 62), INT8_C(-115), INT8_C(-100), INT8_C( 91),
INT8_C( 99), INT8_C( 24), INT8_C( -58), INT8_C(-125),
INT8_C( 88), INT8_C(-120), INT8_C( 61), INT8_C( 94),
INT8_C( -67), INT8_C( -43), INT8_C( -7), INT8_C(-125)),
simde_mm512_set_epi8(INT8_C( -59), INT8_C( 52), INT8_C( -33), INT8_C( 20),
INT8_C( 26), INT8_C( -78), INT8_C( 42), INT8_C( 16),
INT8_C( 45), INT8_C( -27), INT8_C( 11), INT8_C( -1),
INT8_C( 53), INT8_C( 2), INT8_C( -22), INT8_C( 7),
INT8_C( -49), INT8_C( -25), INT8_C( -11), INT8_C( -23),
INT8_C( -5), INT8_C( 116), INT8_C( 1), INT8_C( -4),
INT8_C( -1), INT8_C( 3), INT8_C( 16), INT8_C( -11),
INT8_C( 55), INT8_C( -14), INT8_C( -20), INT8_C( 76),
INT8_C( 43), INT8_C( -11), INT8_C( -55), INT8_C( 3),
INT8_C( -43), INT8_C( -42), INT8_C( -36), INT8_C( -22),
INT8_C( -15), INT8_C( 2), INT8_C( 29), INT8_C( 4),
INT8_C( 5), INT8_C( -2), INT8_C( -66), INT8_C( 9),
INT8_C( 57), INT8_C( 75), INT8_C( -97), INT8_C( 76),
INT8_C( 70), INT8_C( -14), INT8_C( 17), INT8_C( -17),
INT8_C( 43), INT8_C(-104), INT8_C( -34), INT8_C( 80),
INT8_C( -59), INT8_C( 27), INT8_C( 0), INT8_C( 81)) }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])); i++) {
simde__m512i r = simde_mm512_rem_epi8(test_vec[i].a, test_vec[i].b);
simde_assert_m512i_i8(r, ==, test_vec[i].r);
}
return 0;
}
static int
test_simde_mm512_rem_epi16(SIMDE_MUNIT_TEST_ARGS) {
const struct {
simde__m512i a;
simde__m512i b;
simde__m512i r;
} test_vec[8] = {
{ simde_mm512_set_epi16(INT16_C( 10545), INT16_C(-21562), INT16_C( 10284), INT16_C( -3533),
INT16_C(-29991), INT16_C(-10247), INT16_C(-14043), INT16_C(-30435),
INT16_C( -5718), INT16_C( -3714), INT16_C(-18934), INT16_C(-12090),
INT16_C( 23938), INT16_C(-15439), INT16_C(-17441), INT16_C(-29699),
INT16_C(-16473), INT16_C( -7616), INT16_C(-10806), INT16_C( 28273),
INT16_C( 23021), INT16_C( 18146), INT16_C(-31653), INT16_C( -168),
INT16_C( 26666), INT16_C( 13822), INT16_C(-31490), INT16_C( 24651),
INT16_C( 8048), INT16_C(-26711), INT16_C(-21410), INT16_C( 28762)),
simde_mm512_set_epi16(INT16_C(-26929), INT16_C( 8074), INT16_C( 18000), INT16_C(-29849),
INT16_C(-25121), INT16_C( 3254), INT16_C(-10254), INT16_C(-26681),
INT16_C(-24206), INT16_C(-28388), INT16_C( 25803), INT16_C( 25877),
INT16_C( 768), INT16_C( 16244), INT16_C( 11114), INT16_C( -7212),
INT16_C( 18192), INT16_C( 32532), INT16_C(-31836), INT16_C( -5163),
INT16_C( 20183), INT16_C( -1494), INT16_C( 2502), INT16_C( 18488),
INT16_C( 22771), INT16_C( 21470), INT16_C( 4556), INT16_C( 26138),
INT16_C( 19085), INT16_C( -923), INT16_C( -9934), INT16_C( -2165)),
simde_mm512_set_epi16(INT16_C( 10545), INT16_C( -5414), INT16_C( 10284), INT16_C( -3533),
INT16_C( -4870), INT16_C( -485), INT16_C( -3789), INT16_C( -3754),
INT16_C( -5718), INT16_C( -3714), INT16_C(-18934), INT16_C(-12090),
INT16_C( 130), INT16_C(-15439), INT16_C( -6327), INT16_C( -851),
INT16_C(-16473), INT16_C( -7616), INT16_C(-10806), INT16_C( 2458),
INT16_C( 2838), INT16_C( 218), INT16_C( -1629), INT16_C( -168),
INT16_C( 3895), INT16_C( 13822), INT16_C( -4154), INT16_C( 24651),
INT16_C( 8048), INT16_C( -867), INT16_C( -1542), INT16_C( 617)) },
{ simde_mm512_set_epi16(INT16_C( 20057), INT16_C( 26978), INT16_C(-19795), INT16_C(-31033),
INT16_C(-11277), INT16_C(-24100), INT16_C(-21653), INT16_C( 11009),
INT16_C(-15324), INT16_C( 9014), INT16_C( 24117), INT16_C(-31497),
INT16_C( -7188), INT16_C( 8311), INT16_C( 31759), INT16_C( 4002),
INT16_C( 7525), INT16_C( 3321), INT16_C(-18237), INT16_C( -1323),
INT16_C( 13644), INT16_C(-17383), INT16_C(-20302), INT16_C(-13836),
INT16_C( 7513), INT16_C( 1114), INT16_C( -200), INT16_C( 10389),
INT16_C(-31848), INT16_C( 9445), INT16_C( -5204), INT16_C(-24070)),
simde_mm512_set_epi16(INT16_C(-17379), INT16_C( -8623), INT16_C(-10486), INT16_C(-16677),
INT16_C( 27895), INT16_C(-17193), INT16_C( -5943), INT16_C( -5111),
INT16_C( -2949), INT16_C(-11305), INT16_C(-13092), INT16_C( 8140),
INT16_C( -6841), INT16_C( 2476), INT16_C(-24435), INT16_C( 7948),
INT16_C( 26094), INT16_C(-13182), INT16_C( 30122), INT16_C(-17848),
INT16_C(-21735), INT16_C( -7772), INT16_C( 1809), INT16_C(-31933),
INT16_C( 8271), INT16_C( 4936), INT16_C( 7627), INT16_C( 20477),
INT16_C( 14608), INT16_C( 25470), INT16_C(-19700), INT16_C( 25611)),
simde_mm512_set_epi16(INT16_C( 2678), INT16_C( 1109), INT16_C( -9309), INT16_C(-14356),
INT16_C(-11277), INT16_C( -6907), INT16_C( -3824), INT16_C( 787),
INT16_C( -579), INT16_C( 9014), INT16_C( 11025), INT16_C( -7077),
INT16_C( -347), INT16_C( 883), INT16_C( 7324), INT16_C( 4002),
INT16_C( 7525), INT16_C( 3321), INT16_C(-18237), INT16_C( -1323),
INT16_C( 13644), INT16_C( -1839), INT16_C( -403), INT16_C(-13836),
INT16_C( 7513), INT16_C( 1114), INT16_C( -200), INT16_C( 10389),
INT16_C( -2632), INT16_C( 9445), INT16_C( -5204), INT16_C(-24070)) },
{ simde_mm512_set_epi16(INT16_C( 26902), INT16_C(-14525), INT16_C( -7905), INT16_C( -8015),
INT16_C(-22131), INT16_C( 18318), INT16_C(-21513), INT16_C( 9770),
INT16_C( 4118), INT16_C(-32437), INT16_C( 6621), INT16_C( -7897),
INT16_C( 22002), INT16_C(-32381), INT16_C( 15537), INT16_C(-26793),
INT16_C( 26466), INT16_C( 21183), INT16_C( 5811), INT16_C( 17016),
INT16_C(-14374), INT16_C(-18761), INT16_C(-11284), INT16_C( -933),
INT16_C( 30444), INT16_C( 20573), INT16_C(-14964), INT16_C( 25607),
INT16_C(-28815), INT16_C(-28739), INT16_C( 27147), INT16_C( -3265)),
simde_mm512_set_epi16(INT16_C(-10155), INT16_C(-12697), INT16_C( -5222), INT16_C(-32377),
INT16_C( 32076), INT16_C(-13716), INT16_C( 13383), INT16_C(-22332),
INT16_C( 18058), INT16_C(-22719), INT16_C( -8799), INT16_C(-25251),
INT16_C(-16195), INT16_C(-26213), INT16_C(-12331), INT16_C( 27016),
INT16_C( -5538), INT16_C( -4084), INT16_C(-28159), INT16_C(-27845),
INT16_C( -742), INT16_C( 6696), INT16_C( 3074), INT16_C( -6511),
INT16_C(-21911), INT16_C( 28576), INT16_C(-29494), INT16_C(-22820),
INT16_C(-17599), INT16_C( -1341), INT16_C( 8579), INT16_C( 676)),
simde_mm512_set_epi16(INT16_C( 6592), INT16_C( -1828), INT16_C( -2683), INT16_C( -8015),
INT16_C(-22131), INT16_C( 4602), INT16_C( -8130), INT16_C( 9770),
INT16_C( 4118), INT16_C( -9718), INT16_C( 6621), INT16_C( -7897),
INT16_C( 5807), INT16_C( -6168), INT16_C( 3206), INT16_C(-26793),
INT16_C( 4314), INT16_C( 763), INT16_C( 5811), INT16_C( 17016),
INT16_C( -276), INT16_C( -5369), INT16_C( -2062), INT16_C( -933),
INT16_C( 8533), INT16_C( 20573), INT16_C(-14964), INT16_C( 2787),
INT16_C(-11216), INT16_C( -578), INT16_C( 1410), INT16_C( -561)) },
{ simde_mm512_set_epi16(INT16_C( 7566), INT16_C( 25511), INT16_C( -5831), INT16_C( 13989),
INT16_C( 13965), INT16_C(-31065), INT16_C( 77), INT16_C(-30384),
INT16_C( 21705), INT16_C(-23032), INT16_C( -2503), INT16_C( -8652),
INT16_C(-23147), INT16_C( -4009), INT16_C( 7598), INT16_C( 23051),
INT16_C( 13886), INT16_C( 28688), INT16_C( 30551), INT16_C(-28928),
INT16_C( -9491), INT16_C(-26549), INT16_C( -738), INT16_C( 22350),
INT16_C( 7981), INT16_C(-15059), INT16_C(-18848), INT16_C( 16804),
INT16_C(-31876), INT16_C( -1787), INT16_C( 29649), INT16_C( -721)),
simde_mm512_set_epi16(INT16_C( 18409), INT16_C( 19069), INT16_C( 20979), INT16_C(-29762),
INT16_C( 8112), INT16_C( 25085), INT16_C( 31664), INT16_C(-10132),
INT16_C( -2207), INT16_C( 19403), INT16_C(-32530), INT16_C( 20365),
INT16_C( 22045), INT16_C(-23601), INT16_C( 28665), INT16_C(-29743),
INT16_C( 26789), INT16_C(-25295), INT16_C(-31460), INT16_C(-29347),
INT16_C(-16029), INT16_C(-32645), INT16_C(-19836), INT16_C( 31541),
INT16_C(-32299), INT16_C(-14817), INT16_C( 22782), INT16_C(-18634),
INT16_C( -2744), INT16_C( 907), INT16_C( 9939), INT16_C( 395)),
simde_mm512_set_epi16(INT16_C( 7566), INT16_C( 6442), INT16_C( -5831), INT16_C( 13989),
INT16_C( 5853), INT16_C( -5980), INT16_C( 77), INT16_C(-10120),
INT16_C( 1842), INT16_C( -3629), INT16_C( -2503), INT16_C( -8652),
INT16_C( -1102), INT16_C( -4009), INT16_C( 7598), INT16_C( 23051),
INT16_C( 13886), INT16_C( 3393), INT16_C( 30551), INT16_C(-28928),
INT16_C( -9491), INT16_C(-26549), INT16_C( -738), INT16_C( 22350),
INT16_C( 7981), INT16_C( -242), INT16_C(-18848), INT16_C( 16804),
INT16_C( -1692), INT16_C( -880), INT16_C( 9771), INT16_C( -326)) },
{ simde_mm512_set_epi16(INT16_C(-24983), INT16_C( 9260), INT16_C( 6846), INT16_C( 21618),
INT16_C( 20365), INT16_C( 26413), INT16_C( 7670), INT16_C( 6521),
INT16_C( 13052), INT16_C( 19892), INT16_C(-25515), INT16_C( -7444),
INT16_C( 12337), INT16_C( 14080), INT16_C( 6934), INT16_C( -4021),
INT16_C( 1885), INT16_C( 11733), INT16_C( 7371), INT16_C( 24583),
INT16_C(-17187), INT16_C(-28061), INT16_C(-18330), INT16_C(-10845),
INT16_C( -2076), INT16_C( 2107), INT16_C( -3367), INT16_C(-26728),
INT16_C( 21341), INT16_C(-13702), INT16_C( 26283), INT16_C(-27301)),
simde_mm512_set_epi16(INT16_C( 9227), INT16_C( 20728), INT16_C( 22448), INT16_C( 22271),
INT16_C(-27526), INT16_C( 3228), INT16_C(-26938), INT16_C( 15839),
INT16_C( 4554), INT16_C( 22831), INT16_C(-21433), INT16_C( 32351),
INT16_C(-18789), INT16_C( 20983), INT16_C( -3647), INT16_C( 26454),
INT16_C( -2225), INT16_C( 19804), INT16_C( -2763), INT16_C( -8730),
INT16_C(-29152), INT16_C( 25302), INT16_C(-28393), INT16_C( 3478),
INT16_C( -5675), INT16_C( -4361), INT16_C(-16878), INT16_C( 23119),
INT16_C( 30252), INT16_C( -2420), INT16_C( 13170), INT16_C(-21449)),
simde_mm512_set_epi16(INT16_C( -6529), INT16_C( 9260), INT16_C( 6846), INT16_C( 21618),
INT16_C( 20365), INT16_C( 589), INT16_C( 7670), INT16_C( 6521),
INT16_C( 3944), INT16_C( 19892), INT16_C( -4082), INT16_C( -7444),
INT16_C( 12337), INT16_C( 14080), INT16_C( 3287), INT16_C( -4021),
INT16_C( 1885), INT16_C( 11733), INT16_C( 1845), INT16_C( 7123),
INT16_C(-17187), INT16_C( -2759), INT16_C(-18330), INT16_C( -411),
INT16_C( -2076), INT16_C( 2107), INT16_C( -3367), INT16_C( -3609),
INT16_C( 21341), INT16_C( -1602), INT16_C( 13113), INT16_C( -5852)) },
{ simde_mm512_set_epi16(INT16_C( 22335), INT16_C( 12112), INT16_C( 9189), INT16_C( 1311),
INT16_C( -7095), INT16_C( 13615), INT16_C(-21824), INT16_C( 31469),
INT16_C( 12162), INT16_C( -9370), INT16_C(-23767), INT16_C(-15401),
INT16_C(-14538), INT16_C( 24958), INT16_C( 2725), INT16_C(-25768),
INT16_C(-18369), INT16_C( 24484), INT16_C( 16711), INT16_C(-20904),
INT16_C(-18546), INT16_C( 25102), INT16_C( 6573), INT16_C( 22274),
INT16_C(-16497), INT16_C(-26622), INT16_C( 32256), INT16_C(-24007),
INT16_C( -2780), INT16_C( -4298), INT16_C( 8613), INT16_C(-14508)),
simde_mm512_set_epi16(INT16_C( 30472), INT16_C(-28763), INT16_C( 7714), INT16_C( 18947),
INT16_C( 7066), INT16_C(-17692), INT16_C( -6885), INT16_C( 1841),
INT16_C(-29737), INT16_C(-14957), INT16_C(-32610), INT16_C( 26598),
INT16_C(-25999), INT16_C( -4399), INT16_C( 5946), INT16_C( 2262),
INT16_C( -5420), INT16_C( 12953), INT16_C(-27491), INT16_C(-17749),
INT16_C( 30618), INT16_C(-27725), INT16_C(-13788), INT16_C(-13300),
INT16_C( 23394), INT16_C( 2441), INT16_C( 32382), INT16_C( 9384),
INT16_C( 25792), INT16_C( -9373), INT16_C( 22658), INT16_C( 20939)),
simde_mm512_set_epi16(INT16_C( 22335), INT16_C( 12112), INT16_C( 1475), INT16_C( 1311),
INT16_C( -29), INT16_C( 13615), INT16_C( -1169), INT16_C( 172),
INT16_C( 12162), INT16_C( -9370), INT16_C(-23767), INT16_C(-15401),
INT16_C(-14538), INT16_C( 2963), INT16_C( 2725), INT16_C( -886),
INT16_C( -2109), INT16_C( 11531), INT16_C( 16711), INT16_C( -3155),
INT16_C(-18546), INT16_C( 25102), INT16_C( 6573), INT16_C( 8974),
INT16_C(-16497), INT16_C( -2212), INT16_C( 32256), INT16_C( -5239),
INT16_C( -2780), INT16_C( -4298), INT16_C( 8613), INT16_C(-14508)) },
{ simde_mm512_set_epi16(INT16_C( 13867), INT16_C( 28091), INT16_C(-30146), INT16_C( -8550),
INT16_C( 31509), INT16_C( -2205), INT16_C( 9520), INT16_C( 29929),
INT16_C( 24571), INT16_C(-27795), INT16_C(-12850), INT16_C( 14609),
INT16_C( 31001), INT16_C( 823), INT16_C(-19839), INT16_C(-27185),
INT16_C(-29756), INT16_C(-24530), INT16_C( 3633), INT16_C(-20036),
INT16_C( 30184), INT16_C( 27396), INT16_C( 1171), INT16_C( 25936),
INT16_C( -3833), INT16_C( -7750), INT16_C( 19453), INT16_C( 30002),
INT16_C( 6315), INT16_C( 244), INT16_C( 8399), INT16_C( -8080)),
simde_mm512_set_epi16(INT16_C( 18752), INT16_C( 27431), INT16_C(-11832), INT16_C(-22911),
INT16_C(-22667), INT16_C(-23791), INT16_C(-17993), INT16_C( 11401),
INT16_C( 26966), INT16_C( 26500), INT16_C( 7486), INT16_C( 7825),
INT16_C( 17767), INT16_C( -7030), INT16_C(-29302), INT16_C(-27163),
INT16_C(-10544), INT16_C(-18630), INT16_C(-13432), INT16_C( 31285),
INT16_C(-30604), INT16_C( 29467), INT16_C(-31755), INT16_C( 883),
INT16_C( 23995), INT16_C(-22467), INT16_C(-11949), INT16_C( 11327),
INT16_C(-28925), INT16_C( 7518), INT16_C( 30015), INT16_C( 30285)),
simde_mm512_set_epi16(INT16_C( 13867), INT16_C( 660), INT16_C( -6482), INT16_C( -8550),
INT16_C( 8842), INT16_C( -2205), INT16_C( 9520), INT16_C( 7127),
INT16_C( 24571), INT16_C( -1295), INT16_C( -5364), INT16_C( 6784),
INT16_C( 13234), INT16_C( 823), INT16_C(-19839), INT16_C( -22),
INT16_C( -8668), INT16_C( -5900), INT16_C( 3633), INT16_C(-20036),
INT16_C( 30184), INT16_C( 27396), INT16_C( 1171), INT16_C( 329),
INT16_C( -3833), INT16_C( -7750), INT16_C( 7504), INT16_C( 7348),
INT16_C( 6315), INT16_C( 244), INT16_C( 8399), INT16_C( -8080)) },
{ simde_mm512_set_epi16(INT16_C( 19003), INT16_C( 26627), INT16_C( -1831), INT16_C(-31318),
INT16_C(-29481), INT16_C( 13847), INT16_C(-20911), INT16_C( 9042),
INT16_C(-29388), INT16_C( 11660), INT16_C( 32339), INT16_C(-25821),
INT16_C(-18358), INT16_C( 21002), INT16_C( -4830), INT16_C( 8527),
INT16_C( 26072), INT16_C( 29611), INT16_C( 18348), INT16_C( 953),
INT16_C(-32154), INT16_C( 22717), INT16_C(-15414), INT16_C(-13122),
INT16_C( -6258), INT16_C(-11311), INT16_C( 31952), INT16_C( 29752),
INT16_C(-28048), INT16_C( 20614), INT16_C( 1055), INT16_C( -4387)),
simde_mm512_set_epi16(INT16_C( -5809), INT16_C( 3072), INT16_C( 8626), INT16_C( 14922),
INT16_C( -1420), INT16_C(-29164), INT16_C( 22591), INT16_C( 8828),
INT16_C( -1488), INT16_C( -8728), INT16_C( -8885), INT16_C(-25776),
INT16_C( -5719), INT16_C(-14622), INT16_C( 21275), INT16_C(-30430),
INT16_C( 6020), INT16_C( 27245), INT16_C(-30773), INT16_C( 25208),
INT16_C( 25908), INT16_C( 21036), INT16_C(-29170), INT16_C( 25589),
INT16_C( 2188), INT16_C(-29317), INT16_C( -9309), INT16_C(-15127),
INT16_C( 8889), INT16_C( -7060), INT16_C( 24556), INT16_C( 24873)),
simde_mm512_set_epi16(INT16_C( 1576), INT16_C( 2051), INT16_C( -1831), INT16_C( -1474),
INT16_C( -1081), INT16_C( 13847), INT16_C(-20911), INT16_C( 214),
INT16_C( -1116), INT16_C( 2932), INT16_C( 5684), INT16_C( -45),
INT16_C( -1201), INT16_C( 6380), INT16_C( -4830), INT16_C( 8527),
INT16_C( 1992), INT16_C( 2366), INT16_C( 18348), INT16_C( 953),
INT16_C( -6246), INT16_C( 1681), INT16_C(-15414), INT16_C(-13122),
INT16_C( -1882), INT16_C(-11311), INT16_C( 4025), INT16_C( 14625),
INT16_C( -1381), INT16_C( 6494), INT16_C( 1055), INT16_C( -4387)) }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])); i++) {
simde__m512i r = simde_mm512_rem_epi16(test_vec[i].a, test_vec[i].b);
simde_assert_m512i_i16(r, ==, test_vec[i].r);
}
return 0;
}
static int
test_simde_mm512_rem_epi32(SIMDE_MUNIT_TEST_ARGS) {
const struct {
simde__m512i a;
simde__m512i b;
simde__m512i r;
} test_vec[8] = {
{ simde_mm512_set_epi32(INT32_C( 691121094), INT32_C( 674034227), INT32_C(-1965434887), INT32_C( -920286947),
INT32_C( -374673026), INT32_C(-1240805178), INT32_C( 1568850865), INT32_C(-1142977539),
INT32_C(-1079516608), INT32_C( -708153743), INT32_C( 1508722402), INT32_C(-2074345640),
INT32_C( 1747596798), INT32_C(-2063703989), INT32_C( 527472553), INT32_C(-1403096998)),
simde_mm512_set_epi32(INT32_C(-1764810870), INT32_C( 1179683687), INT32_C(-1646326602), INT32_C( -671967289),
INT32_C(-1586327268), INT32_C( 1691051285), INT32_C( 50347892), INT32_C( 728425428),
INT32_C( 1192263444), INT32_C(-2086343723), INT32_C( 1322777130), INT32_C( 163989560),
INT32_C( 1492341726), INT32_C( 298608154), INT32_C( 1250819173), INT32_C( -650971253)),
simde_mm512_set_epi32(INT32_C( 691121094), INT32_C( 674034227), INT32_C( -319108285), INT32_C( -248319658),
INT32_C( -374673026), INT32_C(-1240805178), INT32_C( 8066213), INT32_C( -414552111),
INT32_C(-1079516608), INT32_C( -708153743), INT32_C( 185945272), INT32_C( -106470920),
INT32_C( 255255072), INT32_C( -272055065), INT32_C( 527472553), INT32_C( -101154492)) },
{ simde_mm512_set_epi32(INT32_C( 1314482530), INT32_C(-1297250617), INT32_C( -739008036), INT32_C(-1419039999),
INT32_C(-1004264650), INT32_C( 1580565751), INT32_C( -471064457), INT32_C( 2081361826),
INT32_C( 493161721), INT32_C(-1195115819), INT32_C( 894221337), INT32_C(-1330460172),
INT32_C( 492373082), INT32_C( -13096811), INT32_C(-2087181083), INT32_C( -341007878)),
simde_mm512_set_epi32(INT32_C(-1138893231), INT32_C( -687161637), INT32_C( 1828175063), INT32_C( -389420023),
INT32_C( -193211433), INT32_C( -857989172), INT32_C( -448329300), INT32_C(-1601364212),
INT32_C( 1710148738), INT32_C( 1974123080), INT32_C(-1424367196), INT32_C( 118588227),
INT32_C( 542053192), INT32_C( 499863549), INT32_C( 957375358), INT32_C(-1291033589)),
simde_mm512_set_epi32(INT32_C( 175589299), INT32_C( -610088980), INT32_C( -739008036), INT32_C( -250779930),
INT32_C( -38207485), INT32_C( 722576579), INT32_C( -22735157), INT32_C( 479997614),
INT32_C( 493161721), INT32_C(-1195115819), INT32_C( 894221337), INT32_C( -25989675),
INT32_C( 492373082), INT32_C( -13096811), INT32_C( -172430367), INT32_C( -341007878)) },
{ simde_mm512_set_epi32(INT32_C( 1763100483), INT32_C( -518004559), INT32_C(-1450358898), INT32_C(-1409866198),
INT32_C( 269910347), INT32_C( 433971495), INT32_C( 1441956227), INT32_C( 1018271575),
INT32_C( 1734496959), INT32_C( 380846712), INT32_C( -941967689), INT32_C( -739443621),
INT32_C( 1995198557), INT32_C( -980655097), INT32_C(-1888383043), INT32_C( 1779168063)),
simde_mm512_set_epi32(INT32_C( -665465241), INT32_C( -342195833), INT32_C( 2102184556), INT32_C( 877111492),
INT32_C( 1183491905), INT32_C( -576610979), INT32_C(-1061316197), INT32_C( -808097400),
INT32_C( -362876916), INT32_C(-1845390533), INT32_C( -48621016), INT32_C( 201516689),
INT32_C(-1435930720), INT32_C(-1932876068), INT32_C(-1153303869), INT32_C( 562234020)),
simde_mm512_set_epi32(INT32_C( 432170001), INT32_C( -175808726), INT32_C(-1450358898), INT32_C( -532754706),
INT32_C( 269910347), INT32_C( 433971495), INT32_C( 380640030), INT32_C( 210174175),
INT32_C( 282989295), INT32_C( 380846712), INT32_C( -18168385), INT32_C( -134893554),
INT32_C( 559267837), INT32_C( -980655097), INT32_C( -735079174), INT32_C( 92466003)) },
{ simde_mm512_set_epi32(INT32_C( 495870887), INT32_C( -382126427), INT32_C( 915244711), INT32_C( 5081424),
INT32_C( 1422501384), INT32_C( -163979724), INT32_C(-1516900265), INT32_C( 497965579),
INT32_C( 910061584), INT32_C( 2002226944), INT32_C( -621963189), INT32_C( -48343218),
INT32_C( 523093293), INT32_C(-1235205724), INT32_C(-2088961787), INT32_C( 1943141679)),
simde_mm512_set_epi32(INT32_C( 1206471293), INT32_C( 1374915518), INT32_C( 531653117), INT32_C( 2075187308),
INT32_C( -144618549), INT32_C(-2131865715), INT32_C( 1444783055), INT32_C( 1878625233),
INT32_C( 1755684145), INT32_C(-2061726371), INT32_C(-1050443653), INT32_C(-1299940555),
INT32_C(-2116696545), INT32_C( 1493088054), INT32_C( -179829877), INT32_C( 651362699)),
simde_mm512_set_epi32(INT32_C( 495870887), INT32_C( -382126427), INT32_C( 383591594), INT32_C( 5081424),
INT32_C( 120934443), INT32_C( -163979724), INT32_C( -72117210), INT32_C( 497965579),
INT32_C( 910061584), INT32_C( 2002226944), INT32_C( -621963189), INT32_C( -48343218),
INT32_C( 523093293), INT32_C(-1235205724), INT32_C( -110833140), INT32_C( 640416281)) },
{ simde_mm512_set_epi32(INT32_C(-1637276628), INT32_C( 448681074), INT32_C( 1334667053), INT32_C( 502667641),
INT32_C( 855395764), INT32_C(-1672092948), INT32_C( 808531712), INT32_C( 454488139),
INT32_C( 123547093), INT32_C( 483090439), INT32_C(-1126329757), INT32_C(-1201220189),
INT32_C( -136050629), INT32_C( -220620904), INT32_C( 1398655610), INT32_C( 1722520923)),
simde_mm512_set_epi32(INT32_C( 604721400), INT32_C( 1471174399), INT32_C(-1803940708), INT32_C(-1765392929),
INT32_C( 298473775), INT32_C(-1404600737), INT32_C(-1231334921), INT32_C( -238983338),
INT32_C( -145797796), INT32_C( -181019162), INT32_C(-1910480170), INT32_C(-1860760170),
INT32_C( -371855625), INT32_C(-1106093489), INT32_C( 1982658188), INT32_C( 863153207)),
simde_mm512_set_epi32(INT32_C( -427833828), INT32_C( 448681074), INT32_C( 1334667053), INT32_C( 502667641),
INT32_C( 258448214), INT32_C( -267492211), INT32_C( 808531712), INT32_C( 215504801),
INT32_C( 123547093), INT32_C( 121052115), INT32_C(-1126329757), INT32_C(-1201220189),
INT32_C( -136050629), INT32_C( -220620904), INT32_C( 1398655610), INT32_C( 859367716)) },
{ simde_mm512_set_epi32(INT32_C( 1463758672), INT32_C( 602211615), INT32_C( -464964305), INT32_C(-1430226195),
INT32_C( 797104998), INT32_C(-1557543977), INT32_C( -952737410), INT32_C( 178625368),
INT32_C(-1203806300), INT32_C( 1095216728), INT32_C(-1215405554), INT32_C( 430790402),
INT32_C(-1081108478), INT32_C( 2113970745), INT32_C( -182128842), INT32_C( 564512596)),
simde_mm512_set_epi32(INT32_C( 1997049765), INT32_C( 505563651), INT32_C( 463125220), INT32_C( -451213519),
INT32_C(-1948793453), INT32_C(-2137102362), INT32_C(-1703809327), INT32_C( 389679318),
INT32_C( -355192167), INT32_C(-1801602389), INT32_C( 2006619059), INT32_C( -903558132),
INT32_C( 1533151625), INT32_C( 2122196136), INT32_C( 1690360675), INT32_C( 1484935627)),
simde_mm512_set_epi32(INT32_C( 1463758672), INT32_C( 96647964), INT32_C( -1839085), INT32_C( -76585638),
INT32_C( 797104998), INT32_C(-1557543977), INT32_C( -952737410), INT32_C( 178625368),
INT32_C( -138229799), INT32_C( 1095216728), INT32_C(-1215405554), INT32_C( 430790402),
INT32_C(-1081108478), INT32_C( 2113970745), INT32_C( -182128842), INT32_C( 564512596)) },
{ simde_mm512_set_epi32(INT32_C( 908815803), INT32_C(-1975591270), INT32_C( 2065037155), INT32_C( 623932649),
INT32_C( 1610322797), INT32_C( -842122991), INT32_C( 2031682359), INT32_C(-1300130353),
INT32_C(-1950048210), INT32_C( 238137788), INT32_C( 1978166020), INT32_C( 76768592),
INT32_C( -251141702), INT32_C( 1274901810), INT32_C( 413860084), INT32_C( 550494320)),
simde_mm512_set_epi32(INT32_C( 1228958503), INT32_C( -775379327), INT32_C(-1485462767), INT32_C(-1179177847),
INT32_C( 1767270276), INT32_C( 490610321), INT32_C( 1164436618), INT32_C(-1920297499),
INT32_C( -690964678), INT32_C( -880248267), INT32_C(-2005634277), INT32_C(-2081094797),
INT32_C( 1572579389), INT32_C( -783078337), INT32_C(-1895621282), INT32_C( 1967093325)),
simde_mm512_set_epi32(INT32_C( 908815803), INT32_C( -424832616), INT32_C( 579574388), INT32_C( 623932649),
INT32_C( 1610322797), INT32_C( -351512670), INT32_C( 867245741), INT32_C(-1300130353),
INT32_C( -568118854), INT32_C( 238137788), INT32_C( 1978166020), INT32_C( 76768592),
INT32_C( -251141702), INT32_C( 491823473), INT32_C( 413860084), INT32_C( 550494320)) },
{ simde_mm512_set_epi32(INT32_C( 1245407235), INT32_C( -119962198), INT32_C(-1932052969), INT32_C(-1370414254),
INT32_C(-1925960308), INT32_C( 2119408419), INT32_C(-1203088886), INT32_C( -316530353),
INT32_C( 1708684203), INT32_C( 1202455481), INT32_C(-2107221827), INT32_C(-1010119490),
INT32_C( -410070063), INT32_C( 2094036024), INT32_C(-1838133114), INT32_C( 69201629)),
simde_mm512_set_epi32(INT32_C( -380695552), INT32_C( 565328458), INT32_C( -93024748), INT32_C( 1480532604),
INT32_C( -97460760), INT32_C( -582247600), INT32_C( -374749470), INT32_C( 1394313506),
INT32_C( 394553965), INT32_C(-2016714120), INT32_C( 1697927724), INT32_C(-1911659531),
INT32_C( 143428987), INT32_C( -610024215), INT32_C( 582607980), INT32_C( 1609326889)),
simde_mm512_set_epi32(INT32_C( 103320579), INT32_C( -119962198), INT32_C( -71558009), INT32_C(-1370414254),
INT32_C( -74205868), INT32_C( 372665619), INT32_C( -78840476), INT32_C( -316530353),
INT32_C( 130468343), INT32_C( 1202455481), INT32_C( -409294103), INT32_C(-1010119490),
INT32_C( -123212089), INT32_C( 263963379), INT32_C( -90309174), INT32_C( 69201629)) }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])); i++) {
simde__m512i r = simde_mm512_rem_epi32(test_vec[i].a, test_vec[i].b);
simde_assert_m512i_i32(r, ==, test_vec[i].r);
}
return 0;
}
static int
test_simde_mm512_mask_rem_epi32(SIMDE_MUNIT_TEST_ARGS) {
const struct {
simde__m512i src;
simde__mmask16 k;
simde__m512i a;
simde__m512i b;
simde__m512i r;
} test_vec[8] = {
{ simde_mm512_set_epi32(INT32_C( 691121094), INT32_C( 674034227), INT32_C(-1965434887), INT32_C( -920286947),
INT32_C( -374673026), INT32_C(-1240805178), INT32_C( 1568850865), INT32_C(-1142977539),
INT32_C(-1079516608), INT32_C( -708153743), INT32_C( 1508722402), INT32_C(-2074345640),
INT32_C( 1747596798), INT32_C(-2063703989), INT32_C( 527472553), INT32_C(-1403096998)),
UINT16_C(63371),
simde_mm512_set_epi32(INT32_C( -341007878), INT32_C(-1764810870), INT32_C( 1179683687), INT32_C(-1646326602),
INT32_C( -671967289), INT32_C(-1586327268), INT32_C( 1691051285), INT32_C( 50347892),
INT32_C( 728425428), INT32_C( 1192263444), INT32_C(-2086343723), INT32_C( 1322777130),
INT32_C( 163989560), INT32_C( 1492341726), INT32_C( 298608154), INT32_C( 1250819173)),
simde_mm512_set_epi32(INT32_C(-1291033589), INT32_C( 1314482530), INT32_C(-1297250617), INT32_C( -739008036),
INT32_C(-1419039999), INT32_C(-1004264650), INT32_C( 1580565751), INT32_C( -471064457),
INT32_C( 2081361826), INT32_C( 493161721), INT32_C(-1195115819), INT32_C( 894221337),
INT32_C(-1330460172), INT32_C( 492373082), INT32_C( -13096811), INT32_C(-2087181083)),
simde_mm512_set_epi32(INT32_C( -341007878), INT32_C( -450328340), INT32_C( 1179683687), INT32_C( -168310530),
INT32_C( -374673026), INT32_C( -582062618), INT32_C( 110485534), INT32_C( 50347892),
INT32_C( 728425428), INT32_C( -708153743), INT32_C( 1508722402), INT32_C(-2074345640),
INT32_C( 163989560), INT32_C(-2063703989), INT32_C( 10478312), INT32_C( 1250819173)) },
{ simde_mm512_set_epi32(INT32_C( 1779168063), INT32_C(-1138893231), INT32_C( -687161637), INT32_C( 1828175063),
INT32_C( -389420023), INT32_C( -193211433), INT32_C( -857989172), INT32_C( -448329300),
INT32_C(-1601364212), INT32_C( 1710148738), INT32_C( 1974123080), INT32_C(-1424367196),
INT32_C( 118588227), INT32_C( 542053192), INT32_C( 499863549), INT32_C( 957375358)),
UINT16_C(36797),
simde_mm512_set_epi32(INT32_C(-1153303869), INT32_C( 562234020), INT32_C( 1763100483), INT32_C( -518004559),
INT32_C(-1450358898), INT32_C(-1409866198), INT32_C( 269910347), INT32_C( 433971495),
INT32_C( 1441956227), INT32_C( 1018271575), INT32_C( 1734496959), INT32_C( 380846712),
INT32_C( -941967689), INT32_C( -739443621), INT32_C( 1995198557), INT32_C( -980655097)),
simde_mm512_set_epi32(INT32_C(-2088961787), INT32_C( 1943141679), INT32_C( -665465241), INT32_C( -342195833),
INT32_C( 2102184556), INT32_C( 877111492), INT32_C( 1183491905), INT32_C( -576610979),
INT32_C(-1061316197), INT32_C( -808097400), INT32_C( -362876916), INT32_C(-1845390533),
INT32_C( -48621016), INT32_C( 201516689), INT32_C(-1435930720), INT32_C(-1932876068)),
simde_mm512_set_epi32(INT32_C(-1153303869), INT32_C(-1138893231), INT32_C( -687161637), INT32_C( 1828175063),
INT32_C(-1450358898), INT32_C( -532754706), INT32_C( 269910347), INT32_C( 433971495),
INT32_C( 380640030), INT32_C( 1710148738), INT32_C( 282989295), INT32_C( 380846712),
INT32_C( -18168385), INT32_C( -134893554), INT32_C( 499863549), INT32_C( -980655097)) },
{ simde_mm512_set_epi32(INT32_C( -179829877), INT32_C( 651362699), INT32_C( 495870887), INT32_C( -382126427),
INT32_C( 915244711), INT32_C( 5081424), INT32_C( 1422501384), INT32_C( -163979724),
INT32_C(-1516900265), INT32_C( 497965579), INT32_C( 910061584), INT32_C( 2002226944),
INT32_C( -621963189), INT32_C( -48343218), INT32_C( 523093293), INT32_C(-1235205724)),
UINT16_C(46902),
simde_mm512_set_epi32(INT32_C( -220620904), INT32_C( 1398655610), INT32_C( 1722520923), INT32_C( 1206471293),
INT32_C( 1374915518), INT32_C( 531653117), INT32_C( 2075187308), INT32_C( -144618549),
INT32_C(-2131865715), INT32_C( 1444783055), INT32_C( 1878625233), INT32_C( 1755684145),
INT32_C(-2061726371), INT32_C(-1050443653), INT32_C(-1299940555), INT32_C(-2116696545)),
simde_mm512_set_epi32(INT32_C(-1106093489), INT32_C( 1982658188), INT32_C( 863153207), INT32_C(-1637276628),
INT32_C( 448681074), INT32_C( 1334667053), INT32_C( 502667641), INT32_C( 855395764),
INT32_C(-1672092948), INT32_C( 808531712), INT32_C( 454488139), INT32_C( 123547093),
INT32_C( 483090439), INT32_C(-1126329757), INT32_C(-1201220189), INT32_C( -136050629)),
simde_mm512_set_epi32(INT32_C( -220620904), INT32_C( 651362699), INT32_C( 859367716), INT32_C( 1206471293),
INT32_C( 915244711), INT32_C( 531653117), INT32_C( 64516744), INT32_C( -144618549),
INT32_C(-1516900265), INT32_C( 497965579), INT32_C( 60672677), INT32_C( 26024843),
INT32_C( -621963189), INT32_C(-1050443653), INT32_C( -98720366), INT32_C(-1235205724)) },
{ simde_mm512_set_epi32(INT32_C( 2113970745), INT32_C( -182128842), INT32_C( 564512596), INT32_C( 604721400),
INT32_C( 1471174399), INT32_C(-1803940708), INT32_C(-1765392929), INT32_C( 298473775),
INT32_C(-1404600737), INT32_C(-1231334921), INT32_C( -238983338), INT32_C( -145797796),
INT32_C( -181019162), INT32_C(-1910480170), INT32_C(-1860760170), INT32_C( -371855625)),
UINT16_C(38914),
simde_mm512_set_epi32(INT32_C( 1533151625), INT32_C( 2122196136), INT32_C( 1690360675), INT32_C( 1484935627),
INT32_C( 1463758672), INT32_C( 602211615), INT32_C( -464964305), INT32_C(-1430226195),
INT32_C( 797104998), INT32_C(-1557543977), INT32_C( -952737410), INT32_C( 178625368),
INT32_C(-1203806300), INT32_C( 1095216728), INT32_C(-1215405554), INT32_C( 430790402)),
simde_mm512_set_epi32(INT32_C( -251141702), INT32_C( 1274901810), INT32_C( 413860084), INT32_C( 550494320),
INT32_C( 1997049765), INT32_C( 505563651), INT32_C( 463125220), INT32_C( -451213519),
INT32_C(-1948793453), INT32_C(-2137102362), INT32_C(-1703809327), INT32_C( 389679318),
INT32_C( -355192167), INT32_C(-1801602389), INT32_C( 2006619059), INT32_C( -903558132)),
simde_mm512_set_epi32(INT32_C( 26301413), INT32_C( -182128842), INT32_C( 564512596), INT32_C( 383946987),
INT32_C( 1463758672), INT32_C(-1803940708), INT32_C(-1765392929), INT32_C( 298473775),
INT32_C(-1404600737), INT32_C(-1231334921), INT32_C( -238983338), INT32_C( -145797796),
INT32_C( -181019162), INT32_C(-1910480170), INT32_C(-1215405554), INT32_C( -371855625)) },
{ simde_mm512_set_epi32(INT32_C( 1572579389), INT32_C( -783078337), INT32_C(-1895621282), INT32_C( 1967093325),
INT32_C( 908815803), INT32_C(-1975591270), INT32_C( 2065037155), INT32_C( 623932649),
INT32_C( 1610322797), INT32_C( -842122991), INT32_C( 2031682359), INT32_C(-1300130353),
INT32_C(-1950048210), INT32_C( 238137788), INT32_C( 1978166020), INT32_C( 76768592)),
UINT16_C( 883),
simde_mm512_set_epi32(INT32_C(-1010119490), INT32_C( -410070063), INT32_C( 2094036024), INT32_C(-1838133114),
INT32_C( 69201629), INT32_C( 1228958503), INT32_C( -775379327), INT32_C(-1485462767),
INT32_C(-1179177847), INT32_C( 1767270276), INT32_C( 490610321), INT32_C( 1164436618),
INT32_C(-1920297499), INT32_C( -690964678), INT32_C( -880248267), INT32_C(-2005634277)),
simde_mm512_set_epi32(INT32_C(-1911659531), INT32_C( 143428987), INT32_C( -610024215), INT32_C( 582607980),
INT32_C( 1609326889), INT32_C( 1245407235), INT32_C( -119962198), INT32_C(-1932052969),
INT32_C(-1370414254), INT32_C(-1925960308), INT32_C( 2119408419), INT32_C(-1203088886),
INT32_C( -316530353), INT32_C( 1708684203), INT32_C( 1202455481), INT32_C(-2107221827)),
simde_mm512_set_epi32(INT32_C( 1572579389), INT32_C( -783078337), INT32_C(-1895621282), INT32_C( 1967093325),
INT32_C( 908815803), INT32_C(-1975591270), INT32_C( -55606139), INT32_C(-1485462767),
INT32_C( 1610322797), INT32_C( 1767270276), INT32_C( 490610321), INT32_C( 1164436618),
INT32_C(-1950048210), INT32_C( 238137788), INT32_C( -880248267), INT32_C(-2005634277)) },
{ simde_mm512_set_epi32(INT32_C( 2117071873), INT32_C(-1437889529), INT32_C( -376074104), INT32_C( 1087893388),
INT32_C( -443183285), INT32_C( -380695552), INT32_C( 565328458), INT32_C( -93024748),
INT32_C( 1480532604), INT32_C( -97460760), INT32_C( -582247600), INT32_C( -374749470),
INT32_C( 1394313506), INT32_C( 394553965), INT32_C(-2016714120), INT32_C( 1697927724)),
UINT16_C(12254),
simde_mm512_set_epi32(INT32_C( 56443211), INT32_C(-2036514643), INT32_C( -510270824), INT32_C( 1139427205),
INT32_C( 1090384090), INT32_C(-1905231405), INT32_C(-2079359983), INT32_C( -477294891),
INT32_C( -673197028), INT32_C( 2071747620), INT32_C( -442789099), INT32_C( -601334711),
INT32_C( 319530416), INT32_C(-2115012481), INT32_C( -501730903), INT32_C( 340519338)),
simde_mm512_set_epi32(INT32_C( 1219537084), INT32_C( 1349635715), INT32_C( 732887738), INT32_C(-1728641921),
INT32_C(-1388433411), INT32_C( 1765754685), INT32_C(-1574983663), INT32_C( 846129112),
INT32_C( 1578410935), INT32_C(-1659872458), INT32_C( 1045536663), INT32_C( 957117985),
INT32_C(-1265958651), INT32_C( 1309498779), INT32_C(-1001015299), INT32_C( 1022360677)),
simde_mm512_set_epi32(INT32_C( 2117071873), INT32_C(-1437889529), INT32_C( -510270824), INT32_C( 1087893388),
INT32_C( 1090384090), INT32_C( -139476720), INT32_C( -504376320), INT32_C( -477294891),
INT32_C( -673197028), INT32_C( 411875162), INT32_C( -582247600), INT32_C( -601334711),
INT32_C( 319530416), INT32_C( -805513702), INT32_C( -501730903), INT32_C( 1697927724)) },
{ simde_mm512_set_epi32(INT32_C( -304885978), INT32_C( 991545752), INT32_C( -143034937), INT32_C( 843112042),
INT32_C( -227554783), INT32_C( 2124182542), INT32_C(-1526246088), INT32_C(-1991977382),
INT32_C( 1224533822), INT32_C( -819361196), INT32_C( -684010252), INT32_C(-1738921185),
INT32_C(-1259570772), INT32_C( -691865929), INT32_C( -973523371), INT32_C( 45581573)),
UINT16_C(42669),
simde_mm512_set_epi32(INT32_C( -156799603), INT32_C(-1073012339), INT32_C(-2130532125), INT32_C( 397240391),
INT32_C( 200936922), INT32_C(-1030980309), INT32_C(-1758363174), INT32_C( -665586367),
INT32_C( 453331046), INT32_C( 1704580573), INT32_C( 1606190487), INT32_C(-1085658047),
INT32_C(-1335469644), INT32_C( -368070561), INT32_C(-1419559633), INT32_C( 2069966669)),
simde_mm512_set_epi32(INT32_C( 1379668640), INT32_C( 66581512), INT32_C( -557301797), INT32_C( 304428974),
INT32_C(-1608262788), INT32_C( 532978979), INT32_C( 946958552), INT32_C(-1911324669),
INT32_C(-2118093156), INT32_C( 283691898), INT32_C( -446072631), INT32_C( -458781294),
INT32_C( 1951055651), INT32_C( 765387914), INT32_C( 822559116), INT32_C( 7445617)),
simde_mm512_set_epi32(INT32_C( -156799603), INT32_C( 991545752), INT32_C( -458626734), INT32_C( 843112042),
INT32_C( -227554783), INT32_C( -498001330), INT32_C( -811404622), INT32_C(-1991977382),
INT32_C( 453331046), INT32_C( -819361196), INT32_C( 267972594), INT32_C(-1738921185),
INT32_C(-1335469644), INT32_C( -368070561), INT32_C( -973523371), INT32_C( 85143)) },
{ simde_mm512_set_epi32(INT32_C(-1981938926), INT32_C( 869237081), INT32_C( -190053534), INT32_C(-1469275330),
INT32_C( -717100794), INT32_C(-1303072888), INT32_C(-2122918671), INT32_C( 1617119933),
INT32_C( 1521363431), INT32_C( 553638116), INT32_C( 1036201367), INT32_C(-1187933851),
INT32_C( -412155886), INT32_C( -760582943), INT32_C( -423751457), INT32_C( 1273589632)),
UINT16_C(35103),
simde_mm512_set_epi32(INT32_C(-1836595644), INT32_C( 260676470), INT32_C( 1724614860), INT32_C( -144514633),
INT32_C( -478630580), INT32_C(-2086755061), INT32_C( 932145867), INT32_C(-1862372735),
INT32_C( 1756892633), INT32_C( 382632965), INT32_C( 1295078740), INT32_C( -995802034),
INT32_C( 152308919), INT32_C( -351555508), INT32_C( 31813624), INT32_C( 807463845)),
simde_mm512_set_epi32(INT32_C( 615301803), INT32_C( 382786341), INT32_C( 1852603705), INT32_C( 1998007730),
INT32_C( 231325888), INT32_C( 1842039329), INT32_C( 968682756), INT32_C( 316335394),
INT32_C(-2071382094), INT32_C( -803185337), INT32_C(-2126995500), INT32_C( 1587647099),
INT32_C(-1328358584), INT32_C( 320339033), INT32_C( 282380179), INT32_C( -108102092)),
simde_mm512_set_epi32(INT32_C( -605992038), INT32_C( 869237081), INT32_C( -190053534), INT32_C(-1469275330),
INT32_C( -15978804), INT32_C(-1303072888), INT32_C(-2122918671), INT32_C( -280695765),
INT32_C( 1521363431), INT32_C( 553638116), INT32_C( 1036201367), INT32_C( -995802034),
INT32_C( 152308919), INT32_C( -31216475), INT32_C( 31813624), INT32_C( 50749201)) }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])); i++) {
simde__m512i r = simde_mm512_mask_rem_epi32(test_vec[i].src, test_vec[i].k, test_vec[i].a, test_vec[i].b);
simde_assert_m512i_i32(r, ==, test_vec[i].r);
}
return 0;
}
static int
test_simde_mm512_rem_epi64(SIMDE_MUNIT_TEST_ARGS) {
const struct {
simde__m512i a;
simde__m512i b;
simde__m512i r;
} test_vec[8] = {
{ simde_mm512_set_epi64(INT64_C( 2968342496979776051), INT64_C(-8441478558707775203),
INT64_C(-1609208390309195578), INT64_C( 6738163160628300797),
INT64_C(-4636488523262038415), INT64_C( 6479913377553186648),
INT64_C( 7505871096235581515), INT64_C( 2265477367564496986)),
simde_mm512_set_epi64(INT64_C(-7579804969095623833), INT64_C(-7070918910501808185),
INT64_C(-6813223735121976043), INT64_C( 216242550290965460),
INT64_C( 5120732502404950997), INT64_C( 5681284513410730040),
INT64_C( 6409558907924801050), INT64_C( 5372227444888762251)),
simde_mm512_set_epi64(INT64_C( 2968342496979776051), INT64_C(-1370559648205967018),
INT64_C(-1609208390309195578), INT64_C( 34644101608371537),
INT64_C(-4636488523262038415), INT64_C( 798628864142456608),
INT64_C( 1096312188310780465), INT64_C( 2265477367564496986)) },
{ simde_mm512_set_epi64(INT64_C( 5645659480511055559), INT64_C(-3174015343225263359),
INT64_C(-4313283826698320649), INT64_C(-2023206435041636446),
INT64_C( 2118113466433927893), INT64_C( 3840651400764901876),
INT64_C( 2114726288902596757), INT64_C(-8964374488360902150)),
simde_mm512_set_epi64(INT64_C(-4891509177172967717), INT64_C( 7851952110853286921),
INT64_C( -829836782511317044), INT64_C(-1925559678644969716),
INT64_C( 7345032902979795528), INT64_C(-6117610524196633789),
INT64_C( 2328100732832272381), INT64_C( 4111895855610225675)),
simde_mm512_set_epi64(INT64_C( 754150303338087842), INT64_C(-3174015343225263359),
INT64_C( -164099914141735429), INT64_C( -97646756396666730),
INT64_C( 2118113466433927893), INT64_C( 3840651400764901876),
INT64_C( 2114726288902596757), INT64_C( -740582777140450800)) },
{ simde_mm512_set_epi64(INT64_C( 7572458917823766705), INT64_C(-6229244031487498710),
INT64_C( 1159256113650983207), INT64_C( 6193154838246823767),
INT64_C( 7449607714297299576), INT64_C(-4045720414588175269),
INT64_C( 8569312554655704071), INT64_C(-8110543410226793665)),
simde_mm512_set_epi64(INT64_C(-2858151442766986873), INT64_C( 9028813919053392068),
INT64_C( 5083059030774095197), INT64_C(-4558318353343223416),
INT64_C(-1558544484243762373), INT64_C( -208825673416776047),
INT64_C(-6167275479359641892), INT64_C(-4953402399143034204)),
simde_mm512_set_epi64(INT64_C( 1856156032289792959), INT64_C(-6229244031487498710),
INT64_C( 1159256113650983207), INT64_C( 1634836484903600351),
INT64_C( 1215429777322250084), INT64_C( -78032619669430376),
INT64_C( 2402037075296062179), INT64_C(-3157141011083759461)) },
{ simde_mm512_set_epi64(INT64_C( 2129749246616352421), INT64_C( 3930946101587052880),
INT64_C( 6109596926925725236), INT64_C(-6515037028970767861),
INT64_C( 3908684742628183808), INT64_C(-2671311551824242866),
INT64_C( 2246668589251707300), INT64_C(-8972022555815576273)),
simde_mm512_set_epi64(INT64_C( 5181754748372749246), INT64_C( 2283432752406648940),
INT64_C( -621131936186871923), INT64_C( 6205295972918594513),
INT64_C( 7540605987113962845), INT64_C(-4511621132930745547),
INT64_C(-9091142434838104266), INT64_C( -772363439907339893)),
simde_mm512_set_epi64(INT64_C( 2129749246616352421), INT64_C( 1647513349180403940),
INT64_C( 519409501243877929), INT64_C( -309741056052173348),
INT64_C( 3908684742628183808), INT64_C(-2671311551824242866),
INT64_C( 2246668589251707300), INT64_C( -476024716834837450)) },
{ simde_mm512_set_epi64(INT64_C(-7032049571316476814), INT64_C( 5732351344186366329),
INT64_C( 3673896834139808492), INT64_C( 3472617261273378891),
INT64_C( 530630724433960967), INT64_C(-4837549467732879965),
INT64_C( -584332998080882792), INT64_C( 6007180105039451483)),
simde_mm512_set_epi64(INT64_C( 2597258637662508799), INT64_C(-7747866342253511201),
INT64_C( 1281935105229028959), INT64_C(-5288543212061759658),
INT64_C( -626196761534931482), INT64_C(-8205449847372313194),
INT64_C(-1597107745019766193), INT64_C( 8515452077469772855)),
simde_mm512_set_epi64(INT64_C(-1837532295991459216), INT64_C( 5732351344186366329),
INT64_C( 1110026623681750574), INT64_C( 3472617261273378891),
INT64_C( 530630724433960967), INT64_C(-4837549467732879965),
INT64_C( -584332998080882792), INT64_C( 6007180105039451483)) },
{ simde_mm512_set_epi64(INT64_C( 6286795626078602527), INT64_C(-1997006480917628179),
INT64_C( 3423539900625568727), INT64_C(-4091976017447117992),
INT64_C(-5170308688123548072), INT64_C(-5220127105375971582),
INT64_C(-4643325554324364743), INT64_C( -782237419483838636)),
simde_mm512_set_epi64(INT64_C( 8577263429665049091), INT64_C( 1989107677696558897),
INT64_C(-8370004145136048154), INT64_C(-7317805337695090474),
INT64_C(-1525538738567005525), INT64_C( 8618363237326703628),
INT64_C( 6584836091306452136), INT64_C( 7260043819054420427)),
simde_mm512_set_epi64(INT64_C( 6286795626078602527), INT64_C( -7898803221069282),
INT64_C( 3423539900625568727), INT64_C(-4091976017447117992),
INT64_C( -593692472422531497), INT64_C(-5220127105375971582),
INT64_C(-4643325554324364743), INT64_C( -782237419483838636)) },
{ simde_mm512_set_epi64(INT64_C( 3903334154292354714), INT64_C( 8869267046373815529),
INT64_C( 6916283752571091217), INT64_C( 8726009290759968207),
INT64_C(-8375393287335202372), INT64_C( 8496158362035250512),
INT64_C(-1078645395476875982), INT64_C( 1777515526450307184)),
simde_mm512_set_epi64(INT64_C( 5278336582045705857), INT64_C(-6380014000574878583),
INT64_C( 7590368039103504017), INT64_C( 5001217194949514725),
INT64_C(-2967670691286451659), INT64_C(-8614133625237732493),
INT64_C( 6754177049630551103), INT64_C(-8141631409824500147)),
simde_mm512_set_epi64(INT64_C( 3903334154292354714), INT64_C( 2489253045798936946),
INT64_C( 6916283752571091217), INT64_C( 3724792095810453482),
INT64_C(-2440051904762299054), INT64_C( 8496158362035250512),
INT64_C(-1078645395476875982), INT64_C( 1777515526450307184)) },
{ simde_mm512_set_epi64(INT64_C( 5348983348701791658), INT64_C(-8298104313070148782),
INT64_C(-8271936534134678749), INT64_C(-5167227415572635313),
INT64_C( 7338742772279280569), INT64_C(-9050448829097521986),
INT64_C(-1761237507559623624), INT64_C(-7894721610255438115)),
simde_mm512_set_epi64(INT64_C(-1635074945007338934), INT64_C( -399538248898108804),
INT64_C( -418590773130585264), INT64_C(-1609536716449019614),
INT64_C( 1694596378460381816), INT64_C( 7292544047935022069),
INT64_C( 616022812148352233), INT64_C( 2502282222097948969)),
simde_mm512_set_epi64(INT64_C( 443758513679774856), INT64_C( -307339335107972702),
INT64_C( -318711844653558733), INT64_C( -338617266225576471),
INT64_C( 560357258437753305), INT64_C(-1757904781162499917),
INT64_C( -529191883262919158), INT64_C( -387874943961591208)) }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])); i++) {
simde__m512i r = simde_mm512_rem_epi64(test_vec[i].a, test_vec[i].b);
simde_assert_m512i_i64(r, ==, test_vec[i].r);
}
return 0;
}
static int
test_simde_mm512_rem_epu8(SIMDE_MUNIT_TEST_ARGS) {
const struct {
simde__m512i a;
simde__m512i b;
simde__m512i r;
} test_vec[8] = {
{ simde_x_mm512_set_epu8(UINT8_C( 41), UINT8_C( 49), UINT8_C(171), UINT8_C(198),
UINT8_C( 40), UINT8_C( 44), UINT8_C(242), UINT8_C( 51),
UINT8_C(138), UINT8_C(217), UINT8_C(215), UINT8_C(249),
UINT8_C(201), UINT8_C( 37), UINT8_C(137), UINT8_C( 29),
UINT8_C(233), UINT8_C(170), UINT8_C(241), UINT8_C(126),
UINT8_C(182), UINT8_C( 10), UINT8_C(208), UINT8_C(198),
UINT8_C( 93), UINT8_C(130), UINT8_C(195), UINT8_C(177),
UINT8_C(187), UINT8_C(223), UINT8_C(139), UINT8_C(253),
UINT8_C(191), UINT8_C(167), UINT8_C(226), UINT8_C( 64),
UINT8_C(213), UINT8_C(202), UINT8_C(110), UINT8_C(113),
UINT8_C( 89), UINT8_C(237), UINT8_C( 70), UINT8_C(226),
UINT8_C(132), UINT8_C( 91), UINT8_C(255), UINT8_C( 88),
UINT8_C(104), UINT8_C( 42), UINT8_C( 53), UINT8_C(254),
UINT8_C(132), UINT8_C(254), UINT8_C( 96), UINT8_C( 75),
UINT8_C( 31), UINT8_C(112), UINT8_C(151), UINT8_C(169),
UINT8_C(172), UINT8_C( 94), UINT8_C(112), UINT8_C( 90)),
simde_x_mm512_set_epu8(UINT8_C(195), UINT8_C( 49), UINT8_C( 14), UINT8_C(170),
UINT8_C(203), UINT8_C(167), UINT8_C( 3), UINT8_C(215),
UINT8_C( 63), UINT8_C(248), UINT8_C( 55), UINT8_C(219),
UINT8_C(221), UINT8_C(135), UINT8_C( 61), UINT8_C(191),
UINT8_C(209), UINT8_C( 91), UINT8_C( 87), UINT8_C(137),
UINT8_C( 87), UINT8_C( 76), UINT8_C( 44), UINT8_C(140),
UINT8_C( 2), UINT8_C(200), UINT8_C( 36), UINT8_C(195),
UINT8_C(200), UINT8_C(125), UINT8_C(254), UINT8_C(139),
UINT8_C(226), UINT8_C( 71), UINT8_C( 92), UINT8_C(129),
UINT8_C(182), UINT8_C(119), UINT8_C(247), UINT8_C( 34),
UINT8_C(121), UINT8_C( 85), UINT8_C(153), UINT8_C(116),
UINT8_C(218), UINT8_C( 21), UINT8_C(101), UINT8_C(122),
UINT8_C( 10), UINT8_C(231), UINT8_C( 54), UINT8_C( 71),
UINT8_C(156), UINT8_C(149), UINT8_C(244), UINT8_C( 84),
UINT8_C(148), UINT8_C( 85), UINT8_C(170), UINT8_C(184),
UINT8_C( 94), UINT8_C(154), UINT8_C(229), UINT8_C( 11)),
simde_x_mm512_set_epu8(UINT8_C( 41), UINT8_C( 0), UINT8_C( 3), UINT8_C( 28),
UINT8_C( 40), UINT8_C( 44), UINT8_C( 2), UINT8_C( 51),
UINT8_C( 12), UINT8_C(217), UINT8_C( 50), UINT8_C( 30),
UINT8_C(201), UINT8_C( 37), UINT8_C( 15), UINT8_C( 29),
UINT8_C( 24), UINT8_C( 79), UINT8_C( 67), UINT8_C(126),
UINT8_C( 8), UINT8_C( 10), UINT8_C( 32), UINT8_C( 58),
UINT8_C( 1), UINT8_C(130), UINT8_C( 15), UINT8_C(177),
UINT8_C(187), UINT8_C( 98), UINT8_C(139), UINT8_C(114),
UINT8_C(191), UINT8_C( 25), UINT8_C( 42), UINT8_C( 64),
UINT8_C( 31), UINT8_C( 83), UINT8_C(110), UINT8_C( 11),
UINT8_C( 89), UINT8_C( 67), UINT8_C( 70), UINT8_C(110),
UINT8_C(132), UINT8_C( 7), UINT8_C( 53), UINT8_C( 88),
UINT8_C( 4), UINT8_C( 42), UINT8_C( 53), UINT8_C( 41),
UINT8_C(132), UINT8_C(105), UINT8_C( 96), UINT8_C( 75),
UINT8_C( 31), UINT8_C( 27), UINT8_C(151), UINT8_C(169),
UINT8_C( 78), UINT8_C( 94), UINT8_C(112), UINT8_C( 2)) },
{ simde_x_mm512_set_epu8(UINT8_C(216), UINT8_C( 85), UINT8_C(206), UINT8_C(103),
UINT8_C(235), UINT8_C(154), UINT8_C(129), UINT8_C(135),
UINT8_C(125), UINT8_C( 76), UINT8_C(202), UINT8_C(108),
UINT8_C( 52), UINT8_C( 71), UINT8_C(168), UINT8_C(196),
UINT8_C( 70), UINT8_C(138), UINT8_C(167), UINT8_C( 65),
UINT8_C(221), UINT8_C(161), UINT8_C(157), UINT8_C( 93),
UINT8_C(192), UINT8_C(189), UINT8_C(153), UINT8_C(155),
UINT8_C(207), UINT8_C(213), UINT8_C(105), UINT8_C(136),
UINT8_C(234), UINT8_C( 94), UINT8_C(240), UINT8_C( 12),
UINT8_C(146), UINT8_C( 1), UINT8_C(147), UINT8_C( 59),
UINT8_C(253), UINT8_C( 26), UINT8_C( 26), UINT8_C( 40),
UINT8_C( 12), UINT8_C( 2), UINT8_C(230), UINT8_C(145),
UINT8_C(170), UINT8_C(105), UINT8_C(111), UINT8_C(160),
UINT8_C(140), UINT8_C(202), UINT8_C(166), UINT8_C(220),
UINT8_C(187), UINT8_C( 65), UINT8_C(250), UINT8_C(195),
UINT8_C( 33), UINT8_C(131), UINT8_C( 2), UINT8_C(164)),
simde_x_mm512_set_epu8(UINT8_C(120), UINT8_C(127), UINT8_C( 28), UINT8_C( 95),
UINT8_C(175), UINT8_C(223), UINT8_C(119), UINT8_C(214),
UINT8_C(220), UINT8_C(102), UINT8_C( 86), UINT8_C( 22),
UINT8_C(119), UINT8_C(207), UINT8_C( 12), UINT8_C(183),
UINT8_C(172), UINT8_C(242), UINT8_C(173), UINT8_C(249),
UINT8_C( 52), UINT8_C(108), UINT8_C(128), UINT8_C(203),
UINT8_C( 85), UINT8_C(135), UINT8_C(227), UINT8_C( 35),
UINT8_C(187), UINT8_C( 24), UINT8_C(250), UINT8_C(219),
UINT8_C(253), UINT8_C( 62), UINT8_C(125), UINT8_C(236),
UINT8_C( 75), UINT8_C( 13), UINT8_C( 79), UINT8_C( 81),
UINT8_C(177), UINT8_C(221), UINT8_C(251), UINT8_C(181),
UINT8_C(159), UINT8_C(182), UINT8_C( 11), UINT8_C( 11),
UINT8_C( 39), UINT8_C( 37), UINT8_C( 39), UINT8_C(208),
UINT8_C(136), UINT8_C(180), UINT8_C(215), UINT8_C(139),
UINT8_C(144), UINT8_C(128), UINT8_C(203), UINT8_C(206),
UINT8_C(173), UINT8_C( 36), UINT8_C(133), UINT8_C(175)),
simde_x_mm512_set_epu8(UINT8_C( 96), UINT8_C( 85), UINT8_C( 10), UINT8_C( 8),
UINT8_C( 60), UINT8_C(154), UINT8_C( 10), UINT8_C(135),
UINT8_C(125), UINT8_C( 76), UINT8_C( 30), UINT8_C( 20),
UINT8_C( 52), UINT8_C( 71), UINT8_C( 0), UINT8_C( 13),
UINT8_C( 70), UINT8_C(138), UINT8_C(167), UINT8_C( 65),
UINT8_C( 13), UINT8_C( 53), UINT8_C( 29), UINT8_C( 93),
UINT8_C( 22), UINT8_C( 54), UINT8_C(153), UINT8_C( 15),
UINT8_C( 20), UINT8_C( 21), UINT8_C(105), UINT8_C(136),
UINT8_C(234), UINT8_C( 32), UINT8_C(115), UINT8_C( 12),
UINT8_C( 71), UINT8_C( 1), UINT8_C( 68), UINT8_C( 59),
UINT8_C( 76), UINT8_C( 26), UINT8_C( 26), UINT8_C( 40),
UINT8_C( 12), UINT8_C( 2), UINT8_C( 10), UINT8_C( 2),
UINT8_C( 14), UINT8_C( 31), UINT8_C( 33), UINT8_C(160),
UINT8_C( 4), UINT8_C( 22), UINT8_C(166), UINT8_C( 81),
UINT8_C( 43), UINT8_C( 65), UINT8_C( 47), UINT8_C(195),
UINT8_C( 33), UINT8_C( 23), UINT8_C( 2), UINT8_C(164)) },
{ simde_x_mm512_set_epu8(UINT8_C( 87), UINT8_C( 63), UINT8_C( 47), UINT8_C( 80),
UINT8_C( 35), UINT8_C(229), UINT8_C( 5), UINT8_C( 31),
UINT8_C(228), UINT8_C( 73), UINT8_C( 53), UINT8_C( 47),
UINT8_C(170), UINT8_C(192), UINT8_C(122), UINT8_C(237),
UINT8_C( 47), UINT8_C(130), UINT8_C(219), UINT8_C(102),
UINT8_C(163), UINT8_C( 41), UINT8_C(195), UINT8_C(215),
UINT8_C(199), UINT8_C( 54), UINT8_C( 97), UINT8_C(126),
UINT8_C( 10), UINT8_C(165), UINT8_C(155), UINT8_C( 88),
UINT8_C(184), UINT8_C( 63), UINT8_C( 95), UINT8_C(164),
UINT8_C( 65), UINT8_C( 71), UINT8_C(174), UINT8_C( 88),
UINT8_C(183), UINT8_C(142), UINT8_C( 98), UINT8_C( 14),
UINT8_C( 25), UINT8_C(173), UINT8_C( 87), UINT8_C( 2),
UINT8_C(191), UINT8_C(143), UINT8_C(152), UINT8_C( 2),
UINT8_C(126), UINT8_C( 0), UINT8_C(162), UINT8_C( 57),
UINT8_C(245), UINT8_C( 36), UINT8_C(239), UINT8_C( 54),
UINT8_C( 33), UINT8_C(165), UINT8_C(199), UINT8_C( 84)),
simde_x_mm512_set_epu8(UINT8_C(131), UINT8_C( 42), UINT8_C(151), UINT8_C(210),
UINT8_C( 12), UINT8_C(163), UINT8_C(138), UINT8_C(207),
UINT8_C( 43), UINT8_C( 57), UINT8_C( 61), UINT8_C( 62),
UINT8_C( 81), UINT8_C(184), UINT8_C( 6), UINT8_C( 93),
UINT8_C(167), UINT8_C( 1), UINT8_C(145), UINT8_C( 9),
UINT8_C( 4), UINT8_C( 17), UINT8_C( 10), UINT8_C(101),
UINT8_C(186), UINT8_C(181), UINT8_C(155), UINT8_C(243),
UINT8_C(189), UINT8_C(191), UINT8_C(222), UINT8_C(205),
UINT8_C( 59), UINT8_C( 26), UINT8_C(227), UINT8_C(105),
UINT8_C(237), UINT8_C(145), UINT8_C(183), UINT8_C( 79),
UINT8_C(174), UINT8_C( 60), UINT8_C(132), UINT8_C(208),
UINT8_C( 58), UINT8_C(178), UINT8_C(116), UINT8_C(240),
UINT8_C( 37), UINT8_C(131), UINT8_C(100), UINT8_C(177),
UINT8_C( 19), UINT8_C(102), UINT8_C( 81), UINT8_C( 86),
UINT8_C( 25), UINT8_C( 43), UINT8_C( 51), UINT8_C(140),
UINT8_C( 9), UINT8_C( 40), UINT8_C(227), UINT8_C( 75)),
simde_x_mm512_set_epu8(UINT8_C( 87), UINT8_C( 21), UINT8_C( 47), UINT8_C( 80),
UINT8_C( 11), UINT8_C( 66), UINT8_C( 5), UINT8_C( 31),
UINT8_C( 13), UINT8_C( 16), UINT8_C( 53), UINT8_C( 47),
UINT8_C( 8), UINT8_C( 8), UINT8_C( 2), UINT8_C( 51),
UINT8_C( 47), UINT8_C( 0), UINT8_C( 74), UINT8_C( 3),
UINT8_C( 3), UINT8_C( 7), UINT8_C( 5), UINT8_C( 13),
UINT8_C( 13), UINT8_C( 54), UINT8_C( 97), UINT8_C(126),
UINT8_C( 10), UINT8_C(165), UINT8_C(155), UINT8_C( 88),
UINT8_C( 7), UINT8_C( 11), UINT8_C( 95), UINT8_C( 59),
UINT8_C( 65), UINT8_C( 71), UINT8_C(174), UINT8_C( 9),
UINT8_C( 9), UINT8_C( 22), UINT8_C( 98), UINT8_C( 14),
UINT8_C( 25), UINT8_C(173), UINT8_C( 87), UINT8_C( 2),
UINT8_C( 6), UINT8_C( 12), UINT8_C( 52), UINT8_C( 2),
UINT8_C( 12), UINT8_C( 0), UINT8_C( 0), UINT8_C( 57),
UINT8_C( 20), UINT8_C( 36), UINT8_C( 35), UINT8_C( 54),
UINT8_C( 6), UINT8_C( 5), UINT8_C(199), UINT8_C( 9)) },
{ simde_x_mm512_set_epu8(UINT8_C(233), UINT8_C( 79), UINT8_C( 12), UINT8_C( 0),
UINT8_C( 33), UINT8_C(178), UINT8_C( 58), UINT8_C( 74),
UINT8_C(250), UINT8_C(116), UINT8_C(142), UINT8_C( 20),
UINT8_C( 88), UINT8_C( 63), UINT8_C( 34), UINT8_C(124),
UINT8_C(250), UINT8_C( 48), UINT8_C(221), UINT8_C(232),
UINT8_C(221), UINT8_C( 75), UINT8_C(155), UINT8_C( 80),
UINT8_C(233), UINT8_C(169), UINT8_C(198), UINT8_C(226),
UINT8_C( 83), UINT8_C( 27), UINT8_C(137), UINT8_C( 34),
UINT8_C( 23), UINT8_C(132), UINT8_C(106), UINT8_C(109),
UINT8_C(135), UINT8_C(203), UINT8_C( 98), UINT8_C(120),
UINT8_C(101), UINT8_C( 52), UINT8_C( 82), UINT8_C( 44),
UINT8_C(142), UINT8_C( 14), UINT8_C( 99), UINT8_C(245),
UINT8_C( 8), UINT8_C(140), UINT8_C(141), UINT8_C(123),
UINT8_C(219), UINT8_C(163), UINT8_C(196), UINT8_C(233),
UINT8_C( 34), UINT8_C(185), UINT8_C(228), UINT8_C(108),
UINT8_C( 95), UINT8_C(236), UINT8_C( 97), UINT8_C( 41)),
simde_x_mm512_set_epu8(UINT8_C(193), UINT8_C(230), UINT8_C( 93), UINT8_C( 23),
UINT8_C(193), UINT8_C( 52), UINT8_C(223), UINT8_C(175),
UINT8_C(205), UINT8_C( 45), UINT8_C(166), UINT8_C( 24),
UINT8_C( 71), UINT8_C(234), UINT8_C(161), UINT8_C(142),
UINT8_C(184), UINT8_C(218), UINT8_C(190), UINT8_C(212),
UINT8_C(116), UINT8_C(159), UINT8_C( 44), UINT8_C( 55),
UINT8_C(213), UINT8_C(133), UINT8_C( 60), UINT8_C( 3),
UINT8_C( 58), UINT8_C(255), UINT8_C(125), UINT8_C(189),
UINT8_C(145), UINT8_C( 88), UINT8_C( 55), UINT8_C(182),
UINT8_C( 23), UINT8_C(161), UINT8_C(133), UINT8_C( 27),
UINT8_C(125), UINT8_C(229), UINT8_C(203), UINT8_C( 45),
UINT8_C( 24), UINT8_C( 5), UINT8_C( 90), UINT8_C( 83),
UINT8_C(145), UINT8_C( 85), UINT8_C(156), UINT8_C(164),
UINT8_C(149), UINT8_C(201), UINT8_C( 48), UINT8_C(255),
UINT8_C( 41), UINT8_C( 42), UINT8_C( 94), UINT8_C(129),
UINT8_C(135), UINT8_C( 8), UINT8_C( 12), UINT8_C(203)),
simde_x_mm512_set_epu8(UINT8_C( 40), UINT8_C( 79), UINT8_C( 12), UINT8_C( 0),
UINT8_C( 33), UINT8_C( 22), UINT8_C( 58), UINT8_C( 74),
UINT8_C( 45), UINT8_C( 26), UINT8_C(142), UINT8_C( 20),
UINT8_C( 17), UINT8_C( 63), UINT8_C( 34), UINT8_C(124),
UINT8_C( 66), UINT8_C( 48), UINT8_C( 31), UINT8_C( 20),
UINT8_C(105), UINT8_C( 75), UINT8_C( 23), UINT8_C( 25),
UINT8_C( 20), UINT8_C( 36), UINT8_C( 18), UINT8_C( 1),
UINT8_C( 25), UINT8_C( 27), UINT8_C( 12), UINT8_C( 34),
UINT8_C( 23), UINT8_C( 44), UINT8_C( 51), UINT8_C(109),
UINT8_C( 20), UINT8_C( 42), UINT8_C( 98), UINT8_C( 12),
UINT8_C(101), UINT8_C( 52), UINT8_C( 82), UINT8_C( 44),
UINT8_C( 22), UINT8_C( 4), UINT8_C( 9), UINT8_C( 79),
UINT8_C( 8), UINT8_C( 55), UINT8_C(141), UINT8_C(123),
UINT8_C( 70), UINT8_C(163), UINT8_C( 4), UINT8_C(233),
UINT8_C( 34), UINT8_C( 17), UINT8_C( 40), UINT8_C(108),
UINT8_C( 95), UINT8_C( 4), UINT8_C( 1), UINT8_C( 41)) },
{ simde_x_mm512_set_epu8(UINT8_C(142), UINT8_C( 19), UINT8_C(128), UINT8_C( 3),
UINT8_C(129), UINT8_C(192), UINT8_C(118), UINT8_C(156),
UINT8_C( 16), UINT8_C(232), UINT8_C(203), UINT8_C(122),
UINT8_C(229), UINT8_C(105), UINT8_C(120), UINT8_C(201),
UINT8_C(228), UINT8_C(167), UINT8_C(141), UINT8_C(146),
UINT8_C(116), UINT8_C( 74), UINT8_C(191), UINT8_C( 35),
UINT8_C( 45), UINT8_C(158), UINT8_C(228), UINT8_C(138),
UINT8_C( 49), UINT8_C( 7), UINT8_C( 65), UINT8_C(140),
UINT8_C( 0), UINT8_C(113), UINT8_C(156), UINT8_C(113),
UINT8_C(246), UINT8_C(167), UINT8_C(109), UINT8_C(141),
UINT8_C(192), UINT8_C( 11), UINT8_C( 33), UINT8_C(141),
UINT8_C(129), UINT8_C( 2), UINT8_C(168), UINT8_C(227),
UINT8_C( 23), UINT8_C(173), UINT8_C(104), UINT8_C( 71),
UINT8_C( 11), UINT8_C(250), UINT8_C( 13), UINT8_C(218),
UINT8_C(194), UINT8_C(140), UINT8_C(125), UINT8_C( 43),
UINT8_C(151), UINT8_C( 49), UINT8_C(129), UINT8_C(218)),
simde_x_mm512_set_epu8(UINT8_C( 8), UINT8_C( 25), UINT8_C(147), UINT8_C(220),
UINT8_C(173), UINT8_C(138), UINT8_C( 38), UINT8_C(150),
UINT8_C( 35), UINT8_C( 43), UINT8_C(165), UINT8_C(185),
UINT8_C( 50), UINT8_C( 64), UINT8_C(161), UINT8_C(132),
UINT8_C(162), UINT8_C( 50), UINT8_C(199), UINT8_C( 84),
UINT8_C(251), UINT8_C(200), UINT8_C(217), UINT8_C( 19),
UINT8_C(180), UINT8_C(196), UINT8_C(246), UINT8_C( 76),
UINT8_C( 55), UINT8_C(204), UINT8_C(139), UINT8_C( 75),
UINT8_C( 1), UINT8_C( 89), UINT8_C(133), UINT8_C(212),
UINT8_C(206), UINT8_C( 55), UINT8_C(204), UINT8_C(120),
UINT8_C( 37), UINT8_C(159), UINT8_C(146), UINT8_C(217),
UINT8_C(226), UINT8_C(190), UINT8_C(134), UINT8_C( 8),
UINT8_C(113), UINT8_C( 61), UINT8_C(103), UINT8_C(100),
UINT8_C( 23), UINT8_C(229), UINT8_C(146), UINT8_C( 97),
UINT8_C( 95), UINT8_C( 32), UINT8_C(136), UINT8_C( 91),
UINT8_C( 46), UINT8_C(252), UINT8_C(163), UINT8_C( 88)),
simde_x_mm512_set_epu8(UINT8_C( 6), UINT8_C( 19), UINT8_C(128), UINT8_C( 3),
UINT8_C(129), UINT8_C( 54), UINT8_C( 4), UINT8_C( 6),
UINT8_C( 16), UINT8_C( 17), UINT8_C( 38), UINT8_C(122),
UINT8_C( 29), UINT8_C( 41), UINT8_C(120), UINT8_C( 69),
UINT8_C( 66), UINT8_C( 17), UINT8_C(141), UINT8_C( 62),
UINT8_C(116), UINT8_C( 74), UINT8_C(191), UINT8_C( 16),
UINT8_C( 45), UINT8_C(158), UINT8_C(228), UINT8_C( 62),
UINT8_C( 49), UINT8_C( 7), UINT8_C( 65), UINT8_C( 65),
UINT8_C( 0), UINT8_C( 24), UINT8_C( 23), UINT8_C(113),
UINT8_C( 40), UINT8_C( 2), UINT8_C(109), UINT8_C( 21),
UINT8_C( 7), UINT8_C( 11), UINT8_C( 33), UINT8_C(141),
UINT8_C(129), UINT8_C( 2), UINT8_C( 34), UINT8_C( 3),
UINT8_C( 23), UINT8_C( 51), UINT8_C( 1), UINT8_C( 71),
UINT8_C( 11), UINT8_C( 21), UINT8_C( 13), UINT8_C( 24),
UINT8_C( 4), UINT8_C( 12), UINT8_C(125), UINT8_C( 43),
UINT8_C( 13), UINT8_C( 49), UINT8_C(129), UINT8_C( 42)) },
{ simde_x_mm512_set_epu8(UINT8_C( 46), UINT8_C( 43), UINT8_C(246), UINT8_C(157),
UINT8_C( 80), UINT8_C(154), UINT8_C( 27), UINT8_C(118),
UINT8_C(176), UINT8_C(216), UINT8_C( 46), UINT8_C(142),
UINT8_C(198), UINT8_C(248), UINT8_C( 88), UINT8_C( 29),
UINT8_C(176), UINT8_C( 25), UINT8_C(101), UINT8_C( 54),
UINT8_C(103), UINT8_C(120), UINT8_C( 94), UINT8_C( 16),
UINT8_C(197), UINT8_C(205), UINT8_C( 71), UINT8_C(246),
UINT8_C(158), UINT8_C(176), UINT8_C(218), UINT8_C( 43),
UINT8_C(235), UINT8_C(249), UINT8_C(116), UINT8_C(137),
UINT8_C( 89), UINT8_C(212), UINT8_C(132), UINT8_C( 56),
UINT8_C(230), UINT8_C(137), UINT8_C( 66), UINT8_C( 41),
UINT8_C( 44), UINT8_C( 35), UINT8_C(189), UINT8_C(155),
UINT8_C(125), UINT8_C(130), UINT8_C(123), UINT8_C(117),
UINT8_C(123), UINT8_C(127), UINT8_C(151), UINT8_C( 60),
UINT8_C(153), UINT8_C(185), UINT8_C(250), UINT8_C(100),
UINT8_C( 83), UINT8_C(112), UINT8_C( 33), UINT8_C(140)),
simde_x_mm512_set_epu8(UINT8_C( 36), UINT8_C( 33), UINT8_C( 42), UINT8_C( 75),
UINT8_C(179), UINT8_C(172), UINT8_C(126), UINT8_C(171),
UINT8_C(110), UINT8_C(150), UINT8_C(107), UINT8_C(180),
UINT8_C(134), UINT8_C( 73), UINT8_C(207), UINT8_C( 15),
UINT8_C(241), UINT8_C(103), UINT8_C(103), UINT8_C(150),
UINT8_C(103), UINT8_C( 58), UINT8_C(104), UINT8_C( 35),
UINT8_C(249), UINT8_C( 79), UINT8_C(113), UINT8_C( 97),
UINT8_C(189), UINT8_C(197), UINT8_C(174), UINT8_C(222),
UINT8_C(224), UINT8_C(104), UINT8_C(123), UINT8_C(124),
UINT8_C( 49), UINT8_C(226), UINT8_C( 37), UINT8_C( 22),
UINT8_C(105), UINT8_C(157), UINT8_C(110), UINT8_C( 52),
UINT8_C(254), UINT8_C(103), UINT8_C(162), UINT8_C(210),
UINT8_C(202), UINT8_C( 39), UINT8_C(193), UINT8_C(151),
UINT8_C(183), UINT8_C( 73), UINT8_C( 97), UINT8_C(187),
UINT8_C(102), UINT8_C(195), UINT8_C( 68), UINT8_C(190),
UINT8_C( 65), UINT8_C( 60), UINT8_C(165), UINT8_C(126)),
simde_x_mm512_set_epu8(UINT8_C( 10), UINT8_C( 10), UINT8_C( 36), UINT8_C( 7),
UINT8_C( 80), UINT8_C(154), UINT8_C( 27), UINT8_C(118),
UINT8_C( 66), UINT8_C( 66), UINT8_C( 46), UINT8_C(142),
UINT8_C( 64), UINT8_C( 29), UINT8_C( 88), UINT8_C( 14),
UINT8_C(176), UINT8_C( 25), UINT8_C(101), UINT8_C( 54),
UINT8_C( 0), UINT8_C( 4), UINT8_C( 94), UINT8_C( 16),
UINT8_C(197), UINT8_C( 47), UINT8_C( 71), UINT8_C( 52),
UINT8_C(158), UINT8_C(176), UINT8_C( 44), UINT8_C( 43),
UINT8_C( 11), UINT8_C( 41), UINT8_C(116), UINT8_C( 13),
UINT8_C( 40), UINT8_C(212), UINT8_C( 21), UINT8_C( 12),
UINT8_C( 20), UINT8_C(137), UINT8_C( 66), UINT8_C( 41),
UINT8_C( 44), UINT8_C( 35), UINT8_C( 27), UINT8_C(155),
UINT8_C(125), UINT8_C( 13), UINT8_C(123), UINT8_C(117),
UINT8_C(123), UINT8_C( 54), UINT8_C( 54), UINT8_C( 60),
UINT8_C( 51), UINT8_C(185), UINT8_C( 46), UINT8_C(100),
UINT8_C( 18), UINT8_C( 52), UINT8_C( 33), UINT8_C( 14)) },
{ simde_x_mm512_set_epu8(UINT8_C(240), UINT8_C(169), UINT8_C( 8), UINT8_C( 54),
UINT8_C( 66), UINT8_C( 99), UINT8_C( 14), UINT8_C( 32),
UINT8_C(148), UINT8_C( 92), UINT8_C(122), UINT8_C(200),
UINT8_C(192), UINT8_C(186), UINT8_C(225), UINT8_C( 52),
UINT8_C(182), UINT8_C(244), UINT8_C(253), UINT8_C(228),
UINT8_C(141), UINT8_C(228), UINT8_C(148), UINT8_C(168),
UINT8_C(231), UINT8_C(107), UINT8_C( 47), UINT8_C(205),
UINT8_C(126), UINT8_C( 7), UINT8_C(182), UINT8_C(245),
UINT8_C(165), UINT8_C(186), UINT8_C(213), UINT8_C( 84),
UINT8_C( 19), UINT8_C(131), UINT8_C( 54), UINT8_C( 13),
UINT8_C(185), UINT8_C(182), UINT8_C( 72), UINT8_C( 61),
UINT8_C(125), UINT8_C(104), UINT8_C(147), UINT8_C( 11),
UINT8_C( 89), UINT8_C(204), UINT8_C( 62), UINT8_C(163),
UINT8_C(198), UINT8_C(162), UINT8_C(205), UINT8_C( 9),
UINT8_C(182), UINT8_C(123), UINT8_C( 65), UINT8_C(208),
UINT8_C(145), UINT8_C(179), UINT8_C( 34), UINT8_C(195)),
simde_x_mm512_set_epu8(UINT8_C(141), UINT8_C(103), UINT8_C(116), UINT8_C( 12),
UINT8_C(174), UINT8_C(226), UINT8_C(193), UINT8_C(175),
UINT8_C(155), UINT8_C(174), UINT8_C( 73), UINT8_C( 6),
UINT8_C(141), UINT8_C(140), UINT8_C(254), UINT8_C(193),
UINT8_C(100), UINT8_C(151), UINT8_C( 14), UINT8_C( 19),
UINT8_C( 38), UINT8_C(115), UINT8_C(201), UINT8_C(118),
UINT8_C( 74), UINT8_C(186), UINT8_C( 89), UINT8_C(183),
UINT8_C( 65), UINT8_C(138), UINT8_C( 64), UINT8_C( 90),
UINT8_C(152), UINT8_C(241), UINT8_C(229), UINT8_C(218),
UINT8_C(126), UINT8_C( 38), UINT8_C(159), UINT8_C( 27),
UINT8_C(164), UINT8_C(199), UINT8_C( 25), UINT8_C(253),
UINT8_C(181), UINT8_C(104), UINT8_C( 6), UINT8_C(183),
UINT8_C( 36), UINT8_C(203), UINT8_C(138), UINT8_C(145),
UINT8_C(116), UINT8_C(155), UINT8_C(218), UINT8_C( 24),
UINT8_C(205), UINT8_C(238), UINT8_C(242), UINT8_C( 26),
UINT8_C(226), UINT8_C( 76), UINT8_C(226), UINT8_C(214)),
simde_x_mm512_set_epu8(UINT8_C( 99), UINT8_C( 66), UINT8_C( 8), UINT8_C( 6),
UINT8_C( 66), UINT8_C( 99), UINT8_C( 14), UINT8_C( 32),
UINT8_C(148), UINT8_C( 92), UINT8_C( 49), UINT8_C( 2),
UINT8_C( 51), UINT8_C( 46), UINT8_C(225), UINT8_C( 52),
UINT8_C( 82), UINT8_C( 93), UINT8_C( 1), UINT8_C( 0),
UINT8_C( 27), UINT8_C(113), UINT8_C(148), UINT8_C( 50),
UINT8_C( 9), UINT8_C(107), UINT8_C( 47), UINT8_C( 22),
UINT8_C( 61), UINT8_C( 7), UINT8_C( 54), UINT8_C( 65),
UINT8_C( 13), UINT8_C(186), UINT8_C(213), UINT8_C( 84),
UINT8_C( 19), UINT8_C( 17), UINT8_C( 54), UINT8_C( 13),
UINT8_C( 21), UINT8_C(182), UINT8_C( 22), UINT8_C( 61),
UINT8_C(125), UINT8_C( 0), UINT8_C( 3), UINT8_C( 11),
UINT8_C( 17), UINT8_C( 1), UINT8_C( 62), UINT8_C( 18),
UINT8_C( 82), UINT8_C( 7), UINT8_C(205), UINT8_C( 9),
UINT8_C(182), UINT8_C(123), UINT8_C( 65), UINT8_C( 0),
UINT8_C(145), UINT8_C( 27), UINT8_C( 34), UINT8_C(195)) },
{ simde_x_mm512_set_epu8(UINT8_C(197), UINT8_C( 52), UINT8_C(145), UINT8_C( 20),
UINT8_C( 26), UINT8_C(178), UINT8_C(121), UINT8_C( 16),
UINT8_C( 45), UINT8_C(229), UINT8_C( 11), UINT8_C(230),
UINT8_C( 53), UINT8_C( 2), UINT8_C(234), UINT8_C( 7),
UINT8_C(207), UINT8_C(146), UINT8_C(169), UINT8_C(233),
UINT8_C(206), UINT8_C(116), UINT8_C( 55), UINT8_C(156),
UINT8_C(180), UINT8_C( 91), UINT8_C( 56), UINT8_C(146),
UINT8_C( 55), UINT8_C(137), UINT8_C(200), UINT8_C( 76),
UINT8_C( 43), UINT8_C(245), UINT8_C(138), UINT8_C( 3),
UINT8_C(213), UINT8_C(156), UINT8_C(166), UINT8_C(234),
UINT8_C(199), UINT8_C( 2), UINT8_C( 86), UINT8_C( 72),
UINT8_C( 93), UINT8_C(254), UINT8_C(190), UINT8_C(121),
UINT8_C(119), UINT8_C( 75), UINT8_C(159), UINT8_C( 76),
UINT8_C( 70), UINT8_C(218), UINT8_C( 17), UINT8_C(239),
UINT8_C( 43), UINT8_C(152), UINT8_C(222), UINT8_C( 80),
UINT8_C(197), UINT8_C(113), UINT8_C(112), UINT8_C( 81)),
simde_x_mm512_set_epu8(UINT8_C(193), UINT8_C(162), UINT8_C(178), UINT8_C( 36),
UINT8_C(178), UINT8_C( 86), UINT8_C( 79), UINT8_C(167),
UINT8_C(179), UINT8_C( 45), UINT8_C( 18), UINT8_C(231),
UINT8_C(113), UINT8_C(127), UINT8_C(211), UINT8_C(181),
UINT8_C(121), UINT8_C(171), UINT8_C( 76), UINT8_C(135),
UINT8_C( 15), UINT8_C(133), UINT8_C(247), UINT8_C( 32),
UINT8_C(181), UINT8_C(168), UINT8_C(236), UINT8_C( 99),
UINT8_C( 85), UINT8_C(151), UINT8_C( 36), UINT8_C( 99),
UINT8_C(101), UINT8_C( 42), UINT8_C( 63), UINT8_C( 96),
UINT8_C(210), UINT8_C(198), UINT8_C(202), UINT8_C(105),
UINT8_C(214), UINT8_C( 74), UINT8_C(199), UINT8_C( 17),
UINT8_C(234), UINT8_C( 22), UINT8_C(134), UINT8_C(112),
UINT8_C( 62), UINT8_C(141), UINT8_C(156), UINT8_C( 91),
UINT8_C( 99), UINT8_C( 24), UINT8_C(198), UINT8_C(131),
UINT8_C( 88), UINT8_C(136), UINT8_C( 61), UINT8_C( 94),
UINT8_C(189), UINT8_C(213), UINT8_C(249), UINT8_C(131)),
simde_x_mm512_set_epu8(UINT8_C( 4), UINT8_C( 52), UINT8_C(145), UINT8_C( 20),
UINT8_C( 26), UINT8_C( 6), UINT8_C( 42), UINT8_C( 16),
UINT8_C( 45), UINT8_C( 4), UINT8_C( 11), UINT8_C(230),
UINT8_C( 53), UINT8_C( 2), UINT8_C( 23), UINT8_C( 7),
UINT8_C( 86), UINT8_C(146), UINT8_C( 17), UINT8_C( 98),
UINT8_C( 11), UINT8_C(116), UINT8_C( 55), UINT8_C( 28),
UINT8_C(180), UINT8_C( 91), UINT8_C( 56), UINT8_C( 47),
UINT8_C( 55), UINT8_C(137), UINT8_C( 20), UINT8_C( 76),
UINT8_C( 43), UINT8_C( 35), UINT8_C( 12), UINT8_C( 3),
UINT8_C( 3), UINT8_C(156), UINT8_C(166), UINT8_C( 24),
UINT8_C(199), UINT8_C( 2), UINT8_C( 86), UINT8_C( 4),
UINT8_C( 93), UINT8_C( 12), UINT8_C( 56), UINT8_C( 9),
UINT8_C( 57), UINT8_C( 75), UINT8_C( 3), UINT8_C( 76),
UINT8_C( 70), UINT8_C( 2), UINT8_C( 17), UINT8_C(108),
UINT8_C( 43), UINT8_C( 16), UINT8_C( 39), UINT8_C( 80),
UINT8_C( 8), UINT8_C(113), UINT8_C(112), UINT8_C( 81)) }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])); i++) {
simde__m512i r = simde_mm512_rem_epu8(test_vec[i].a, test_vec[i].b);
simde_assert_m512i_u8(r, ==, test_vec[i].r);
}
return 0;
}
static int
test_simde_mm512_rem_epu16(SIMDE_MUNIT_TEST_ARGS) {
const struct {
simde__m512i a;
simde__m512i b;
simde__m512i r;
} test_vec[8] = {
{ simde_x_mm512_set_epu16(UINT16_C( 10545), UINT16_C( 43974), UINT16_C( 10284), UINT16_C( 62003),
UINT16_C( 35545), UINT16_C( 55289), UINT16_C( 51493), UINT16_C( 35101),
UINT16_C( 59818), UINT16_C( 61822), UINT16_C( 46602), UINT16_C( 53446),
UINT16_C( 23938), UINT16_C( 50097), UINT16_C( 48095), UINT16_C( 35837),
UINT16_C( 49063), UINT16_C( 57920), UINT16_C( 54730), UINT16_C( 28273),
UINT16_C( 23021), UINT16_C( 18146), UINT16_C( 33883), UINT16_C( 65368),
UINT16_C( 26666), UINT16_C( 13822), UINT16_C( 34046), UINT16_C( 24651),
UINT16_C( 8048), UINT16_C( 38825), UINT16_C( 44126), UINT16_C( 28762)),
simde_x_mm512_set_epu16(UINT16_C( 38607), UINT16_C( 8074), UINT16_C( 18000), UINT16_C( 35687),
UINT16_C( 40415), UINT16_C( 3254), UINT16_C( 55282), UINT16_C( 38855),
UINT16_C( 41330), UINT16_C( 37148), UINT16_C( 25803), UINT16_C( 25877),
UINT16_C( 768), UINT16_C( 16244), UINT16_C( 11114), UINT16_C( 58324),
UINT16_C( 18192), UINT16_C( 32532), UINT16_C( 33700), UINT16_C( 60373),
UINT16_C( 20183), UINT16_C( 64042), UINT16_C( 2502), UINT16_C( 18488),
UINT16_C( 22771), UINT16_C( 21470), UINT16_C( 4556), UINT16_C( 26138),
UINT16_C( 19085), UINT16_C( 64613), UINT16_C( 55602), UINT16_C( 63371)),
simde_x_mm512_set_epu16(UINT16_C( 10545), UINT16_C( 3604), UINT16_C( 10284), UINT16_C( 26316),
UINT16_C( 35545), UINT16_C( 3225), UINT16_C( 51493), UINT16_C( 35101),
UINT16_C( 18488), UINT16_C( 24674), UINT16_C( 20799), UINT16_C( 1692),
UINT16_C( 130), UINT16_C( 1365), UINT16_C( 3639), UINT16_C( 35837),
UINT16_C( 12679), UINT16_C( 25388), UINT16_C( 21030), UINT16_C( 28273),
UINT16_C( 2838), UINT16_C( 18146), UINT16_C( 1357), UINT16_C( 9904),
UINT16_C( 3895), UINT16_C( 13822), UINT16_C( 2154), UINT16_C( 24651),
UINT16_C( 8048), UINT16_C( 38825), UINT16_C( 44126), UINT16_C( 28762)) },
{ simde_x_mm512_set_epu16(UINT16_C( 20057), UINT16_C( 26978), UINT16_C( 45741), UINT16_C( 34503),
UINT16_C( 54259), UINT16_C( 41436), UINT16_C( 43883), UINT16_C( 11009),
UINT16_C( 50212), UINT16_C( 9014), UINT16_C( 24117), UINT16_C( 34039),
UINT16_C( 58348), UINT16_C( 8311), UINT16_C( 31759), UINT16_C( 4002),
UINT16_C( 7525), UINT16_C( 3321), UINT16_C( 47299), UINT16_C( 64213),
UINT16_C( 13644), UINT16_C( 48153), UINT16_C( 45234), UINT16_C( 51700),
UINT16_C( 7513), UINT16_C( 1114), UINT16_C( 65336), UINT16_C( 10389),
UINT16_C( 33688), UINT16_C( 9445), UINT16_C( 60332), UINT16_C( 41466)),
simde_x_mm512_set_epu16(UINT16_C( 48157), UINT16_C( 56913), UINT16_C( 55050), UINT16_C( 48859),
UINT16_C( 27895), UINT16_C( 48343), UINT16_C( 59593), UINT16_C( 60425),
UINT16_C( 62587), UINT16_C( 54231), UINT16_C( 52444), UINT16_C( 8140),
UINT16_C( 58695), UINT16_C( 2476), UINT16_C( 41101), UINT16_C( 7948),
UINT16_C( 26094), UINT16_C( 52354), UINT16_C( 30122), UINT16_C( 47688),
UINT16_C( 43801), UINT16_C( 57764), UINT16_C( 1809), UINT16_C( 33603),
UINT16_C( 8271), UINT16_C( 4936), UINT16_C( 7627), UINT16_C( 20477),
UINT16_C( 14608), UINT16_C( 25470), UINT16_C( 45836), UINT16_C( 25611)),
simde_x_mm512_set_epu16(UINT16_C( 20057), UINT16_C( 26978), UINT16_C( 45741), UINT16_C( 34503),
UINT16_C( 26364), UINT16_C( 41436), UINT16_C( 43883), UINT16_C( 11009),
UINT16_C( 50212), UINT16_C( 9014), UINT16_C( 24117), UINT16_C( 1479),
UINT16_C( 58348), UINT16_C( 883), UINT16_C( 31759), UINT16_C( 4002),
UINT16_C( 7525), UINT16_C( 3321), UINT16_C( 17177), UINT16_C( 16525),
UINT16_C( 13644), UINT16_C( 48153), UINT16_C( 9), UINT16_C( 18097),
UINT16_C( 7513), UINT16_C( 1114), UINT16_C( 4320), UINT16_C( 10389),
UINT16_C( 4472), UINT16_C( 9445), UINT16_C( 14496), UINT16_C( 15855)) },
{ simde_x_mm512_set_epu16(UINT16_C( 26902), UINT16_C( 51011), UINT16_C( 57631), UINT16_C( 57521),
UINT16_C( 43405), UINT16_C( 18318), UINT16_C( 44023), UINT16_C( 9770),
UINT16_C( 4118), UINT16_C( 33099), UINT16_C( 6621), UINT16_C( 57639),
UINT16_C( 22002), UINT16_C( 33155), UINT16_C( 15537), UINT16_C( 38743),
UINT16_C( 26466), UINT16_C( 21183), UINT16_C( 5811), UINT16_C( 17016),
UINT16_C( 51162), UINT16_C( 46775), UINT16_C( 54252), UINT16_C( 64603),
UINT16_C( 30444), UINT16_C( 20573), UINT16_C( 50572), UINT16_C( 25607),
UINT16_C( 36721), UINT16_C( 36797), UINT16_C( 27147), UINT16_C( 62271)),
simde_x_mm512_set_epu16(UINT16_C( 55381), UINT16_C( 52839), UINT16_C( 60314), UINT16_C( 33159),
UINT16_C( 32076), UINT16_C( 51820), UINT16_C( 13383), UINT16_C( 43204),
UINT16_C( 18058), UINT16_C( 42817), UINT16_C( 56737), UINT16_C( 40285),
UINT16_C( 49341), UINT16_C( 39323), UINT16_C( 53205), UINT16_C( 27016),
UINT16_C( 59998), UINT16_C( 61452), UINT16_C( 37377), UINT16_C( 37691),
UINT16_C( 64794), UINT16_C( 6696), UINT16_C( 3074), UINT16_C( 59025),
UINT16_C( 43625), UINT16_C( 28576), UINT16_C( 36042), UINT16_C( 42716),
UINT16_C( 47937), UINT16_C( 64195), UINT16_C( 8579), UINT16_C( 676)),
simde_x_mm512_set_epu16(UINT16_C( 26902), UINT16_C( 51011), UINT16_C( 57631), UINT16_C( 24362),
UINT16_C( 11329), UINT16_C( 18318), UINT16_C( 3874), UINT16_C( 9770),
UINT16_C( 4118), UINT16_C( 33099), UINT16_C( 6621), UINT16_C( 17354),
UINT16_C( 22002), UINT16_C( 33155), UINT16_C( 15537), UINT16_C( 11727),
UINT16_C( 26466), UINT16_C( 21183), UINT16_C( 5811), UINT16_C( 17016),
UINT16_C( 51162), UINT16_C( 6599), UINT16_C( 1994), UINT16_C( 5578),
UINT16_C( 30444), UINT16_C( 20573), UINT16_C( 14530), UINT16_C( 25607),
UINT16_C( 36721), UINT16_C( 36797), UINT16_C( 1410), UINT16_C( 79)) },
{ simde_x_mm512_set_epu16(UINT16_C( 7566), UINT16_C( 25511), UINT16_C( 59705), UINT16_C( 13989),
UINT16_C( 13965), UINT16_C( 34471), UINT16_C( 77), UINT16_C( 35152),
UINT16_C( 21705), UINT16_C( 42504), UINT16_C( 63033), UINT16_C( 56884),
UINT16_C( 42389), UINT16_C( 61527), UINT16_C( 7598), UINT16_C( 23051),
UINT16_C( 13886), UINT16_C( 28688), UINT16_C( 30551), UINT16_C( 36608),
UINT16_C( 56045), UINT16_C( 38987), UINT16_C( 64798), UINT16_C( 22350),
UINT16_C( 7981), UINT16_C( 50477), UINT16_C( 46688), UINT16_C( 16804),
UINT16_C( 33660), UINT16_C( 63749), UINT16_C( 29649), UINT16_C( 64815)),
simde_x_mm512_set_epu16(UINT16_C( 18409), UINT16_C( 19069), UINT16_C( 20979), UINT16_C( 35774),
UINT16_C( 8112), UINT16_C( 25085), UINT16_C( 31664), UINT16_C( 55404),
UINT16_C( 63329), UINT16_C( 19403), UINT16_C( 33006), UINT16_C( 20365),
UINT16_C( 22045), UINT16_C( 41935), UINT16_C( 28665), UINT16_C( 35793),
UINT16_C( 26789), UINT16_C( 40241), UINT16_C( 34076), UINT16_C( 36189),
UINT16_C( 49507), UINT16_C( 32891), UINT16_C( 45700), UINT16_C( 31541),
UINT16_C( 33237), UINT16_C( 50719), UINT16_C( 22782), UINT16_C( 46902),
UINT16_C( 62792), UINT16_C( 907), UINT16_C( 9939), UINT16_C( 395)),
simde_x_mm512_set_epu16(UINT16_C( 7566), UINT16_C( 6442), UINT16_C( 17747), UINT16_C( 13989),
UINT16_C( 5853), UINT16_C( 9386), UINT16_C( 77), UINT16_C( 35152),
UINT16_C( 21705), UINT16_C( 3698), UINT16_C( 30027), UINT16_C( 16154),
UINT16_C( 20344), UINT16_C( 19592), UINT16_C( 7598), UINT16_C( 23051),
UINT16_C( 13886), UINT16_C( 28688), UINT16_C( 30551), UINT16_C( 419),
UINT16_C( 6538), UINT16_C( 6096), UINT16_C( 19098), UINT16_C( 22350),
UINT16_C( 7981), UINT16_C( 50477), UINT16_C( 1124), UINT16_C( 16804),
UINT16_C( 33660), UINT16_C( 259), UINT16_C( 9771), UINT16_C( 35)) },
{ simde_x_mm512_set_epu16(UINT16_C( 40553), UINT16_C( 9260), UINT16_C( 6846), UINT16_C( 21618),
UINT16_C( 20365), UINT16_C( 26413), UINT16_C( 7670), UINT16_C( 6521),
UINT16_C( 13052), UINT16_C( 19892), UINT16_C( 40021), UINT16_C( 58092),
UINT16_C( 12337), UINT16_C( 14080), UINT16_C( 6934), UINT16_C( 61515),
UINT16_C( 1885), UINT16_C( 11733), UINT16_C( 7371), UINT16_C( 24583),
UINT16_C( 48349), UINT16_C( 37475), UINT16_C( 47206), UINT16_C( 54691),
UINT16_C( 63460), UINT16_C( 2107), UINT16_C( 62169), UINT16_C( 38808),
UINT16_C( 21341), UINT16_C( 51834), UINT16_C( 26283), UINT16_C( 38235)),
simde_x_mm512_set_epu16(UINT16_C( 9227), UINT16_C( 20728), UINT16_C( 22448), UINT16_C( 22271),
UINT16_C( 38010), UINT16_C( 3228), UINT16_C( 38598), UINT16_C( 15839),
UINT16_C( 4554), UINT16_C( 22831), UINT16_C( 44103), UINT16_C( 32351),
UINT16_C( 46747), UINT16_C( 20983), UINT16_C( 61889), UINT16_C( 26454),
UINT16_C( 63311), UINT16_C( 19804), UINT16_C( 62773), UINT16_C( 56806),
UINT16_C( 36384), UINT16_C( 25302), UINT16_C( 37143), UINT16_C( 3478),
UINT16_C( 59861), UINT16_C( 61175), UINT16_C( 48658), UINT16_C( 23119),
UINT16_C( 30252), UINT16_C( 63116), UINT16_C( 13170), UINT16_C( 44087)),
simde_x_mm512_set_epu16(UINT16_C( 3645), UINT16_C( 9260), UINT16_C( 6846), UINT16_C( 21618),
UINT16_C( 20365), UINT16_C( 589), UINT16_C( 7670), UINT16_C( 6521),
UINT16_C( 3944), UINT16_C( 19892), UINT16_C( 40021), UINT16_C( 25741),
UINT16_C( 12337), UINT16_C( 14080), UINT16_C( 6934), UINT16_C( 8607),
UINT16_C( 1885), UINT16_C( 11733), UINT16_C( 7371), UINT16_C( 24583),
UINT16_C( 11965), UINT16_C( 12173), UINT16_C( 10063), UINT16_C( 2521),
UINT16_C( 3599), UINT16_C( 2107), UINT16_C( 13511), UINT16_C( 15689),
UINT16_C( 21341), UINT16_C( 51834), UINT16_C( 13113), UINT16_C( 38235)) },
{ simde_x_mm512_set_epu16(UINT16_C( 22335), UINT16_C( 12112), UINT16_C( 9189), UINT16_C( 1311),
UINT16_C( 58441), UINT16_C( 13615), UINT16_C( 43712), UINT16_C( 31469),
UINT16_C( 12162), UINT16_C( 56166), UINT16_C( 41769), UINT16_C( 50135),
UINT16_C( 50998), UINT16_C( 24958), UINT16_C( 2725), UINT16_C( 39768),
UINT16_C( 47167), UINT16_C( 24484), UINT16_C( 16711), UINT16_C( 44632),
UINT16_C( 46990), UINT16_C( 25102), UINT16_C( 6573), UINT16_C( 22274),
UINT16_C( 49039), UINT16_C( 38914), UINT16_C( 32256), UINT16_C( 41529),
UINT16_C( 62756), UINT16_C( 61238), UINT16_C( 8613), UINT16_C( 51028)),
simde_x_mm512_set_epu16(UINT16_C( 30472), UINT16_C( 36773), UINT16_C( 7714), UINT16_C( 18947),
UINT16_C( 7066), UINT16_C( 47844), UINT16_C( 58651), UINT16_C( 1841),
UINT16_C( 35799), UINT16_C( 50579), UINT16_C( 32926), UINT16_C( 26598),
UINT16_C( 39537), UINT16_C( 61137), UINT16_C( 5946), UINT16_C( 2262),
UINT16_C( 60116), UINT16_C( 12953), UINT16_C( 38045), UINT16_C( 47787),
UINT16_C( 30618), UINT16_C( 37811), UINT16_C( 51748), UINT16_C( 52236),
UINT16_C( 23394), UINT16_C( 2441), UINT16_C( 32382), UINT16_C( 9384),
UINT16_C( 25792), UINT16_C( 56163), UINT16_C( 22658), UINT16_C( 20939)),
simde_x_mm512_set_epu16(UINT16_C( 22335), UINT16_C( 12112), UINT16_C( 1475), UINT16_C( 1311),
UINT16_C( 1913), UINT16_C( 13615), UINT16_C( 43712), UINT16_C( 172),
UINT16_C( 12162), UINT16_C( 5587), UINT16_C( 8843), UINT16_C( 23537),
UINT16_C( 11461), UINT16_C( 24958), UINT16_C( 2725), UINT16_C( 1314),
UINT16_C( 47167), UINT16_C( 11531), UINT16_C( 16711), UINT16_C( 44632),
UINT16_C( 16372), UINT16_C( 25102), UINT16_C( 6573), UINT16_C( 22274),
UINT16_C( 2251), UINT16_C( 2299), UINT16_C( 32256), UINT16_C( 3993),
UINT16_C( 11172), UINT16_C( 5075), UINT16_C( 8613), UINT16_C( 9150)) },
{ simde_x_mm512_set_epu16(UINT16_C( 13867), UINT16_C( 28091), UINT16_C( 35390), UINT16_C( 56986),
UINT16_C( 31509), UINT16_C( 63331), UINT16_C( 9520), UINT16_C( 29929),
UINT16_C( 24571), UINT16_C( 37741), UINT16_C( 52686), UINT16_C( 14609),
UINT16_C( 31001), UINT16_C( 823), UINT16_C( 45697), UINT16_C( 38351),
UINT16_C( 35780), UINT16_C( 41006), UINT16_C( 3633), UINT16_C( 45500),
UINT16_C( 30184), UINT16_C( 27396), UINT16_C( 1171), UINT16_C( 25936),
UINT16_C( 61703), UINT16_C( 57786), UINT16_C( 19453), UINT16_C( 30002),
UINT16_C( 6315), UINT16_C( 244), UINT16_C( 8399), UINT16_C( 57456)),
simde_x_mm512_set_epu16(UINT16_C( 18752), UINT16_C( 27431), UINT16_C( 53704), UINT16_C( 42625),
UINT16_C( 42869), UINT16_C( 41745), UINT16_C( 47543), UINT16_C( 11401),
UINT16_C( 26966), UINT16_C( 26500), UINT16_C( 7486), UINT16_C( 7825),
UINT16_C( 17767), UINT16_C( 58506), UINT16_C( 36234), UINT16_C( 38373),
UINT16_C( 54992), UINT16_C( 46906), UINT16_C( 52104), UINT16_C( 31285),
UINT16_C( 34932), UINT16_C( 29467), UINT16_C( 33781), UINT16_C( 883),
UINT16_C( 23995), UINT16_C( 43069), UINT16_C( 53587), UINT16_C( 11327),
UINT16_C( 36611), UINT16_C( 7518), UINT16_C( 30015), UINT16_C( 30285)),
simde_x_mm512_set_epu16(UINT16_C( 13867), UINT16_C( 660), UINT16_C( 35390), UINT16_C( 14361),
UINT16_C( 31509), UINT16_C( 21586), UINT16_C( 9520), UINT16_C( 7127),
UINT16_C( 24571), UINT16_C( 11241), UINT16_C( 284), UINT16_C( 6784),
UINT16_C( 13234), UINT16_C( 823), UINT16_C( 9463), UINT16_C( 38351),
UINT16_C( 35780), UINT16_C( 41006), UINT16_C( 3633), UINT16_C( 14215),
UINT16_C( 30184), UINT16_C( 27396), UINT16_C( 1171), UINT16_C( 329),
UINT16_C( 13713), UINT16_C( 14717), UINT16_C( 19453), UINT16_C( 7348),
UINT16_C( 6315), UINT16_C( 244), UINT16_C( 8399), UINT16_C( 27171)) },
{ simde_x_mm512_set_epu16(UINT16_C( 19003), UINT16_C( 26627), UINT16_C( 63705), UINT16_C( 34218),
UINT16_C( 36055), UINT16_C( 13847), UINT16_C( 44625), UINT16_C( 9042),
UINT16_C( 36148), UINT16_C( 11660), UINT16_C( 32339), UINT16_C( 39715),
UINT16_C( 47178), UINT16_C( 21002), UINT16_C( 60706), UINT16_C( 8527),
UINT16_C( 26072), UINT16_C( 29611), UINT16_C( 18348), UINT16_C( 953),
UINT16_C( 33382), UINT16_C( 22717), UINT16_C( 50122), UINT16_C( 52414),
UINT16_C( 59278), UINT16_C( 54225), UINT16_C( 31952), UINT16_C( 29752),
UINT16_C( 37488), UINT16_C( 20614), UINT16_C( 1055), UINT16_C( 61149)),
simde_x_mm512_set_epu16(UINT16_C( 59727), UINT16_C( 3072), UINT16_C( 8626), UINT16_C( 14922),
UINT16_C( 64116), UINT16_C( 36372), UINT16_C( 22591), UINT16_C( 8828),
UINT16_C( 64048), UINT16_C( 56808), UINT16_C( 56651), UINT16_C( 39760),
UINT16_C( 59817), UINT16_C( 50914), UINT16_C( 21275), UINT16_C( 35106),
UINT16_C( 6020), UINT16_C( 27245), UINT16_C( 34763), UINT16_C( 25208),
UINT16_C( 25908), UINT16_C( 21036), UINT16_C( 36366), UINT16_C( 25589),
UINT16_C( 2188), UINT16_C( 36219), UINT16_C( 56227), UINT16_C( 50409),
UINT16_C( 8889), UINT16_C( 58476), UINT16_C( 24556), UINT16_C( 24873)),
simde_x_mm512_set_epu16(UINT16_C( 19003), UINT16_C( 2051), UINT16_C( 3323), UINT16_C( 4374),
UINT16_C( 36055), UINT16_C( 13847), UINT16_C( 22034), UINT16_C( 214),
UINT16_C( 36148), UINT16_C( 11660), UINT16_C( 32339), UINT16_C( 39715),
UINT16_C( 47178), UINT16_C( 21002), UINT16_C( 18156), UINT16_C( 8527),
UINT16_C( 1992), UINT16_C( 2366), UINT16_C( 18348), UINT16_C( 953),
UINT16_C( 7474), UINT16_C( 1681), UINT16_C( 13756), UINT16_C( 1236),
UINT16_C( 202), UINT16_C( 18006), UINT16_C( 31952), UINT16_C( 29752),
UINT16_C( 1932), UINT16_C( 20614), UINT16_C( 1055), UINT16_C( 11403)) }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])); i++) {
simde__m512i r = simde_mm512_rem_epu16(test_vec[i].a, test_vec[i].b);
simde_assert_m512i_u16(r, ==, test_vec[i].r);
}
return 0;
}
static int
test_simde_mm512_rem_epu32(SIMDE_MUNIT_TEST_ARGS) {
const struct {
simde__m512i a;
simde__m512i b;
simde__m512i r;
} test_vec[8] = {
{ simde_x_mm512_set_epu32(UINT32_C( 691121094), UINT32_C( 674034227), UINT32_C(2329532409), UINT32_C(3374680349),
UINT32_C(3920294270), UINT32_C(3054162118), UINT32_C(1568850865), UINT32_C(3151989757),
UINT32_C(3215450688), UINT32_C(3586813553), UINT32_C(1508722402), UINT32_C(2220621656),
UINT32_C(1747596798), UINT32_C(2231263307), UINT32_C( 527472553), UINT32_C(2891870298)),
simde_x_mm512_set_epu32(UINT32_C(2530156426), UINT32_C(1179683687), UINT32_C(2648640694), UINT32_C(3623000007),
UINT32_C(2708640028), UINT32_C(1691051285), UINT32_C( 50347892), UINT32_C( 728425428),
UINT32_C(1192263444), UINT32_C(2208623573), UINT32_C(1322777130), UINT32_C( 163989560),
UINT32_C(1492341726), UINT32_C( 298608154), UINT32_C(1250819173), UINT32_C(3643996043)),
simde_x_mm512_set_epu32(UINT32_C( 691121094), UINT32_C( 674034227), UINT32_C(2329532409), UINT32_C(3374680349),
UINT32_C(1211654242), UINT32_C(1363110833), UINT32_C( 8066213), UINT32_C( 238288045),
UINT32_C( 830923800), UINT32_C(1378189980), UINT32_C( 185945272), UINT32_C( 88757376),
UINT32_C( 255255072), UINT32_C( 141006229), UINT32_C( 527472553), UINT32_C(2891870298)) },
{ simde_x_mm512_set_epu32(UINT32_C(1314482530), UINT32_C(2997716679), UINT32_C(3555959260), UINT32_C(2875927297),
UINT32_C(3290702646), UINT32_C(1580565751), UINT32_C(3823902839), UINT32_C(2081361826),
UINT32_C( 493161721), UINT32_C(3099851477), UINT32_C( 894221337), UINT32_C(2964507124),
UINT32_C( 492373082), UINT32_C(4281870485), UINT32_C(2207786213), UINT32_C(3953959418)),
simde_x_mm512_set_epu32(UINT32_C(3156074065), UINT32_C(3607805659), UINT32_C(1828175063), UINT32_C(3905547273),
UINT32_C(4101755863), UINT32_C(3436978124), UINT32_C(3846637996), UINT32_C(2693603084),
UINT32_C(1710148738), UINT32_C(1974123080), UINT32_C(2870600100), UINT32_C( 118588227),
UINT32_C( 542053192), UINT32_C( 499863549), UINT32_C( 957375358), UINT32_C(3003933707)),
simde_x_mm512_set_epu32(UINT32_C(1314482530), UINT32_C(2997716679), UINT32_C(1727784197), UINT32_C(2875927297),
UINT32_C(3290702646), UINT32_C(1580565751), UINT32_C(3823902839), UINT32_C(2081361826),
UINT32_C( 493161721), UINT32_C(1125728397), UINT32_C( 894221337), UINT32_C( 118389676),
UINT32_C( 492373082), UINT32_C( 282962093), UINT32_C( 293035497), UINT32_C( 950025711)) },
{ simde_x_mm512_set_epu32(UINT32_C(1763100483), UINT32_C(3776962737), UINT32_C(2844608398), UINT32_C(2885101098),
UINT32_C( 269910347), UINT32_C( 433971495), UINT32_C(1441956227), UINT32_C(1018271575),
UINT32_C(1734496959), UINT32_C( 380846712), UINT32_C(3352999607), UINT32_C(3555523675),
UINT32_C(1995198557), UINT32_C(3314312199), UINT32_C(2406584253), UINT32_C(1779168063)),
simde_x_mm512_set_epu32(UINT32_C(3629502055), UINT32_C(3952771463), UINT32_C(2102184556), UINT32_C( 877111492),
UINT32_C(1183491905), UINT32_C(3718356317), UINT32_C(3233651099), UINT32_C(3486869896),
UINT32_C(3932090380), UINT32_C(2449576763), UINT32_C(4246346280), UINT32_C( 201516689),
UINT32_C(2859036576), UINT32_C(2362091228), UINT32_C(3141663427), UINT32_C( 562234020)),
simde_x_mm512_set_epu32(UINT32_C(1763100483), UINT32_C(3776962737), UINT32_C( 742423842), UINT32_C( 253766622),
UINT32_C( 269910347), UINT32_C( 433971495), UINT32_C(1441956227), UINT32_C(1018271575),
UINT32_C(1734496959), UINT32_C( 380846712), UINT32_C(3352999607), UINT32_C( 129739962),
UINT32_C(1995198557), UINT32_C( 952220971), UINT32_C(2406584253), UINT32_C( 92466003)) },
{ simde_x_mm512_set_epu32(UINT32_C( 495870887), UINT32_C(3912840869), UINT32_C( 915244711), UINT32_C( 5081424),
UINT32_C(1422501384), UINT32_C(4130987572), UINT32_C(2778067031), UINT32_C( 497965579),
UINT32_C( 910061584), UINT32_C(2002226944), UINT32_C(3673004107), UINT32_C(4246624078),
UINT32_C( 523093293), UINT32_C(3059761572), UINT32_C(2206005509), UINT32_C(1943141679)),
simde_x_mm512_set_epu32(UINT32_C(1206471293), UINT32_C(1374915518), UINT32_C( 531653117), UINT32_C(2075187308),
UINT32_C(4150348747), UINT32_C(2163101581), UINT32_C(1444783055), UINT32_C(1878625233),
UINT32_C(1755684145), UINT32_C(2233240925), UINT32_C(3244523643), UINT32_C(2995026741),
UINT32_C(2178270751), UINT32_C(1493088054), UINT32_C(4115137419), UINT32_C( 651362699)),
simde_x_mm512_set_epu32(UINT32_C( 495870887), UINT32_C(1163009833), UINT32_C( 383591594), UINT32_C( 5081424),
UINT32_C(1422501384), UINT32_C(1967885991), UINT32_C(1333283976), UINT32_C( 497965579),
UINT32_C( 910061584), UINT32_C(2002226944), UINT32_C( 428480464), UINT32_C(1251597337),
UINT32_C( 523093293), UINT32_C( 73585464), UINT32_C(2206005509), UINT32_C( 640416281)) },
{ simde_x_mm512_set_epu32(UINT32_C(2657690668), UINT32_C( 448681074), UINT32_C(1334667053), UINT32_C( 502667641),
UINT32_C( 855395764), UINT32_C(2622874348), UINT32_C( 808531712), UINT32_C( 454488139),
UINT32_C( 123547093), UINT32_C( 483090439), UINT32_C(3168637539), UINT32_C(3093747107),
UINT32_C(4158916667), UINT32_C(4074346392), UINT32_C(1398655610), UINT32_C(1722520923)),
simde_x_mm512_set_epu32(UINT32_C( 604721400), UINT32_C(1471174399), UINT32_C(2491026588), UINT32_C(2529574367),
UINT32_C( 298473775), UINT32_C(2890366559), UINT32_C(3063632375), UINT32_C(4055983958),
UINT32_C(4149169500), UINT32_C(4113948134), UINT32_C(2384487126), UINT32_C(2434207126),
UINT32_C(3923111671), UINT32_C(3188873807), UINT32_C(1982658188), UINT32_C( 863153207)),
simde_x_mm512_set_epu32(UINT32_C( 238805068), UINT32_C( 448681074), UINT32_C(1334667053), UINT32_C( 502667641),
UINT32_C( 258448214), UINT32_C(2622874348), UINT32_C( 808531712), UINT32_C( 454488139),
UINT32_C( 123547093), UINT32_C( 483090439), UINT32_C( 784150413), UINT32_C( 659539981),
UINT32_C( 235804996), UINT32_C( 885472585), UINT32_C(1398655610), UINT32_C( 859367716)) },
{ simde_x_mm512_set_epu32(UINT32_C(1463758672), UINT32_C( 602211615), UINT32_C(3830002991), UINT32_C(2864741101),
UINT32_C( 797104998), UINT32_C(2737423319), UINT32_C(3342229886), UINT32_C( 178625368),
UINT32_C(3091160996), UINT32_C(1095216728), UINT32_C(3079561742), UINT32_C( 430790402),
UINT32_C(3213858818), UINT32_C(2113970745), UINT32_C(4112838454), UINT32_C( 564512596)),
simde_x_mm512_set_epu32(UINT32_C(1997049765), UINT32_C( 505563651), UINT32_C( 463125220), UINT32_C(3843753777),
UINT32_C(2346173843), UINT32_C(2157864934), UINT32_C(2591157969), UINT32_C( 389679318),
UINT32_C(3939775129), UINT32_C(2493364907), UINT32_C(2006619059), UINT32_C(3391409164),
UINT32_C(1533151625), UINT32_C(2122196136), UINT32_C(1690360675), UINT32_C(1484935627)),
simde_x_mm512_set_epu32(UINT32_C(1463758672), UINT32_C( 96647964), UINT32_C( 125001231), UINT32_C(2864741101),
UINT32_C( 797104998), UINT32_C( 579558385), UINT32_C( 751071917), UINT32_C( 178625368),
UINT32_C(3091160996), UINT32_C(1095216728), UINT32_C(1072942683), UINT32_C( 430790402),
UINT32_C( 147555568), UINT32_C(2113970745), UINT32_C( 732117104), UINT32_C( 564512596)) },
{ simde_x_mm512_set_epu32(UINT32_C( 908815803), UINT32_C(2319376026), UINT32_C(2065037155), UINT32_C( 623932649),
UINT32_C(1610322797), UINT32_C(3452844305), UINT32_C(2031682359), UINT32_C(2994836943),
UINT32_C(2344919086), UINT32_C( 238137788), UINT32_C(1978166020), UINT32_C( 76768592),
UINT32_C(4043825594), UINT32_C(1274901810), UINT32_C( 413860084), UINT32_C( 550494320)),
simde_x_mm512_set_epu32(UINT32_C(1228958503), UINT32_C(3519587969), UINT32_C(2809504529), UINT32_C(3115789449),
UINT32_C(1767270276), UINT32_C( 490610321), UINT32_C(1164436618), UINT32_C(2374669797),
UINT32_C(3604002618), UINT32_C(3414719029), UINT32_C(2289333019), UINT32_C(2213872499),
UINT32_C(1572579389), UINT32_C(3511888959), UINT32_C(2399346014), UINT32_C(1967093325)),
simde_x_mm512_set_epu32(UINT32_C( 908815803), UINT32_C(2319376026), UINT32_C(2065037155), UINT32_C( 623932649),
UINT32_C(1610322797), UINT32_C( 18572058), UINT32_C( 867245741), UINT32_C( 620167146),
UINT32_C(2344919086), UINT32_C( 238137788), UINT32_C(1978166020), UINT32_C( 76768592),
UINT32_C( 898666816), UINT32_C(1274901810), UINT32_C( 413860084), UINT32_C( 550494320)) },
{ simde_x_mm512_set_epu32(UINT32_C(1245407235), UINT32_C(4175005098), UINT32_C(2362914327), UINT32_C(2924553042),
UINT32_C(2369006988), UINT32_C(2119408419), UINT32_C(3091878410), UINT32_C(3978436943),
UINT32_C(1708684203), UINT32_C(1202455481), UINT32_C(2187745469), UINT32_C(3284847806),
UINT32_C(3884897233), UINT32_C(2094036024), UINT32_C(2456834182), UINT32_C( 69201629)),
simde_x_mm512_set_epu32(UINT32_C(3914271744), UINT32_C( 565328458), UINT32_C(4201942548), UINT32_C(1480532604),
UINT32_C(4197506536), UINT32_C(3712719696), UINT32_C(3920217826), UINT32_C(1394313506),
UINT32_C( 394553965), UINT32_C(2278253176), UINT32_C(1697927724), UINT32_C(2383307765),
UINT32_C( 143428987), UINT32_C(3684943081), UINT32_C( 582607980), UINT32_C(1609326889)),
simde_x_mm512_set_epu32(UINT32_C(1245407235), UINT32_C( 217705892), UINT32_C(2362914327), UINT32_C(1444020438),
UINT32_C(2369006988), UINT32_C(2119408419), UINT32_C(3091878410), UINT32_C(1189809931),
UINT32_C( 130468343), UINT32_C(1202455481), UINT32_C( 489817745), UINT32_C( 901540041),
UINT32_C( 12314584), UINT32_C(2094036024), UINT32_C( 126402262), UINT32_C( 69201629)) }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])); i++) {
simde__m512i r = simde_mm512_rem_epu32(test_vec[i].a, test_vec[i].b);
simde_assert_m512i_u32(r, ==, test_vec[i].r);
}
return 0;
}
static int
test_simde_mm512_rem_epu64(SIMDE_MUNIT_TEST_ARGS) {
const struct {
simde__m512i a;
simde__m512i b;
simde__m512i r;
} test_vec[8] = {
{ simde_x_mm512_set_epu64(UINT64_C( 2968342496979776051), UINT64_C(10005265515001776413),
UINT64_C(16837535683400356038), UINT64_C( 6738163160628300797),
UINT64_C(13810255550447513201), UINT64_C( 6479913377553186648),
UINT64_C( 7505871096235581515), UINT64_C( 2265477367564496986)),
simde_x_mm512_set_epu64(UINT64_C(10866939104613927783), UINT64_C(11375825163207743431),
UINT64_C(11633520338587575573), UINT64_C( 216242550290965460),
UINT64_C( 5120732502404950997), UINT64_C( 5681284513410730040),
UINT64_C( 6409558907924801050), UINT64_C( 5372227444888762251)),
simde_x_mm512_set_epu64(UINT64_C( 2968342496979776051), UINT64_C(10005265515001776413),
UINT64_C( 5204015344812780465), UINT64_C( 34644101608371537),
UINT64_C( 3568790545637611207), UINT64_C( 798628864142456608),
UINT64_C( 1096312188310780465), UINT64_C( 2265477367564496986)) },
{ simde_x_mm512_set_epu64(UINT64_C( 5645659480511055559), UINT64_C(15272728730484288257),
UINT64_C(14133460247011230967), UINT64_C(16423537638667915170),
UINT64_C( 2118113466433927893), UINT64_C( 3840651400764901876),
UINT64_C( 2114726288902596757), UINT64_C( 9482369585348649466)),
simde_x_mm512_set_epu64(UINT64_C(13555234896536583899), UINT64_C( 7851952110853286921),
UINT64_C(17616907291198234572), UINT64_C(16521184395064581900),
UINT64_C( 7345032902979795528), UINT64_C(12329133549512917827),
UINT64_C( 2328100732832272381), UINT64_C( 4111895855610225675)),
simde_x_mm512_set_epu64(UINT64_C( 5645659480511055559), UINT64_C( 7420776619631001336),
UINT64_C(14133460247011230967), UINT64_C(16423537638667915170),
UINT64_C( 2118113466433927893), UINT64_C( 3840651400764901876),
UINT64_C( 2114726288902596757), UINT64_C( 1258577874128198116)) },
{ simde_x_mm512_set_epu64(UINT64_C( 7572458917823766705), UINT64_C(12217500042222052906),
UINT64_C( 1159256113650983207), UINT64_C( 6193154838246823767),
UINT64_C( 7449607714297299576), UINT64_C(14401023659121376347),
UINT64_C( 8569312554655704071), UINT64_C(10336200663482757951)),
simde_x_mm512_set_epu64(UINT64_C(15588592630942564743), UINT64_C( 9028813919053392068),
UINT64_C( 5083059030774095197), UINT64_C(13888425720366328200),
UINT64_C(16888199589465789243), UINT64_C(18237918400292775569),
UINT64_C(12279468594349909724), UINT64_C(13493341674566517412)),
simde_x_mm512_set_epu64(UINT64_C( 7572458917823766705), UINT64_C( 3188686123168660838),
UINT64_C( 1159256113650983207), UINT64_C( 6193154838246823767),
UINT64_C( 7449607714297299576), UINT64_C(14401023659121376347),
UINT64_C( 8569312554655704071), UINT64_C(10336200663482757951)) },
{ simde_x_mm512_set_epu64(UINT64_C( 2129749246616352421), UINT64_C( 3930946101587052880),
UINT64_C( 6109596926925725236), UINT64_C(11931707044738783755),
UINT64_C( 3908684742628183808), UINT64_C(15775432521885308750),
UINT64_C( 2246668589251707300), UINT64_C( 9474721517893975343)),
simde_x_mm512_set_epu64(UINT64_C( 5181754748372749246), UINT64_C( 2283432752406648940),
UINT64_C(17825612137522679693), UINT64_C( 6205295972918594513),
UINT64_C( 7540605987113962845), UINT64_C(13935122940778806069),
UINT64_C( 9355601638871447350), UINT64_C(17674380633802211723)),
simde_x_mm512_set_epu64(UINT64_C( 2129749246616352421), UINT64_C( 1647513349180403940),
UINT64_C( 6109596926925725236), UINT64_C( 5726411071820189242),
UINT64_C( 3908684742628183808), UINT64_C( 1840309581106502681),
UINT64_C( 2246668589251707300), UINT64_C( 9474721517893975343)) },
{ simde_x_mm512_set_epu64(UINT64_C(11414694502393074802), UINT64_C( 5732351344186366329),
UINT64_C( 3673896834139808492), UINT64_C( 3472617261273378891),
UINT64_C( 530630724433960967), UINT64_C(13609194605976671651),
UINT64_C(17862411075628668824), UINT64_C( 6007180105039451483)),
simde_x_mm512_set_epu64(UINT64_C( 2597258637662508799), UINT64_C(10698877731456040415),
UINT64_C( 1281935105229028959), UINT64_C(13158200861647791958),
UINT64_C(17820547312174620134), UINT64_C(10241294226337238422),
UINT64_C(16849636328689785423), UINT64_C( 8515452077469772855)),
simde_x_mm512_set_epu64(UINT64_C( 1025659951743039606), UINT64_C( 5732351344186366329),
UINT64_C( 1110026623681750574), UINT64_C( 3472617261273378891),
UINT64_C( 530630724433960967), UINT64_C( 3367900379639433229),
UINT64_C( 1012774746938883401), UINT64_C( 6007180105039451483)) },
{ simde_x_mm512_set_epu64(UINT64_C( 6286795626078602527), UINT64_C(16449737592791923437),
UINT64_C( 3423539900625568727), UINT64_C(14354768056262433624),
UINT64_C(13276435385586003544), UINT64_C(13226616968333580034),
UINT64_C(13803418519385186873), UINT64_C(17664506654225712980)),
simde_x_mm512_set_epu64(UINT64_C( 8577263429665049091), UINT64_C( 1989107677696558897),
UINT64_C(10076739928573503462), UINT64_C(11128938736014461142),
UINT64_C(16921205335142546091), UINT64_C( 8618363237326703628),
UINT64_C( 6584836091306452136), UINT64_C( 7260043819054420427)),
simde_x_mm512_set_epu64(UINT64_C( 6286795626078602527), UINT64_C( 536876171219452261),
UINT64_C( 3423539900625568727), UINT64_C( 3225829320247972482),
UINT64_C(13276435385586003544), UINT64_C( 4608253731006876406),
UINT64_C( 633746336772282601), UINT64_C( 3144419016116872126)) },
{ simde_x_mm512_set_epu64(UINT64_C( 3903334154292354714), UINT64_C( 8869267046373815529),
UINT64_C( 6916283752571091217), UINT64_C( 8726009290759968207),
UINT64_C(10071350786374349244), UINT64_C( 8496158362035250512),
UINT64_C(17368098678232675634), UINT64_C( 1777515526450307184)),
simde_x_mm512_set_epu64(UINT64_C( 5278336582045705857), UINT64_C(12066730073134673033),
UINT64_C( 7590368039103504017), UINT64_C( 5001217194949514725),
UINT64_C(15479073382423099957), UINT64_C( 9832610448471819123),
UINT64_C( 6754177049630551103), UINT64_C(10305112663885051469)),
simde_x_mm512_set_epu64(UINT64_C( 3903334154292354714), UINT64_C( 8869267046373815529),
UINT64_C( 6916283752571091217), UINT64_C( 3724792095810453482),
UINT64_C(10071350786374349244), UINT64_C( 8496158362035250512),
UINT64_C( 3859744578971573428), UINT64_C( 1777515526450307184)) },
{ simde_x_mm512_set_epu64(UINT64_C( 5348983348701791658), UINT64_C(10148639760639402834),
UINT64_C(10174807539574872867), UINT64_C(13279516658136916303),
UINT64_C( 7338742772279280569), UINT64_C( 9396295244612029630),
UINT64_C(16685506566149927992), UINT64_C(10552022463454113501)),
simde_x_mm512_set_epu64(UINT64_C(16811669128702212682), UINT64_C(18047205824811442812),
UINT64_C(18028153300578966352), UINT64_C(16837207357260532002),
UINT64_C( 1694596378460381816), UINT64_C( 7292544047935022069),
UINT64_C( 616022812148352233), UINT64_C( 2502282222097948969)),
simde_x_mm512_set_epu64(UINT64_C( 5348983348701791658), UINT64_C(10148639760639402834),
UINT64_C(10174807539574872867), UINT64_C(13279516658136916303),
UINT64_C( 560357258437753305), UINT64_C( 2103751196677007561),
UINT64_C( 52890638144417701), UINT64_C( 542893575062317625)) }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])); i++) {
simde__m512i r = simde_mm512_rem_epu64(test_vec[i].a, test_vec[i].b);
simde_assert_m512i_u64(r, ==, test_vec[i].r);
}
return 0;
}
static int
test_simde_mm512_recip_ps (SIMDE_MUNIT_TEST_ARGS) {
static const struct {
const simde_float32 a[16];
const simde_float32 r[16];
} test_vec[] = {
{ { SIMDE_FLOAT32_C( -838.19), SIMDE_FLOAT32_C( -143.82), SIMDE_FLOAT32_C( -921.01), SIMDE_FLOAT32_C( 206.87),
SIMDE_FLOAT32_C( -588.92), SIMDE_FLOAT32_C( -497.03), SIMDE_FLOAT32_C( -701.44), SIMDE_FLOAT32_C( -106.77),
SIMDE_FLOAT32_C( 464.17), SIMDE_FLOAT32_C( 464.85), SIMDE_FLOAT32_C( 819.12), SIMDE_FLOAT32_C( 908.79),
SIMDE_FLOAT32_C( -61.04), SIMDE_FLOAT32_C( -36.34), SIMDE_FLOAT32_C( -38.98), SIMDE_FLOAT32_C( -132.37) },
{ SIMDE_FLOAT32_C( -0.00), SIMDE_FLOAT32_C( -0.01), SIMDE_FLOAT32_C( -0.00), SIMDE_FLOAT32_C( 0.00),
SIMDE_FLOAT32_C( -0.00), SIMDE_FLOAT32_C( -0.00), SIMDE_FLOAT32_C( -0.00), SIMDE_FLOAT32_C( -0.01),
SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 0.00),
SIMDE_FLOAT32_C( -0.02), SIMDE_FLOAT32_C( -0.03), SIMDE_FLOAT32_C( -0.03), SIMDE_FLOAT32_C( -0.01) } },
{ { SIMDE_FLOAT32_C( -324.68), SIMDE_FLOAT32_C( 773.13), SIMDE_FLOAT32_C( -941.14), SIMDE_FLOAT32_C( 753.16),
SIMDE_FLOAT32_C( -838.44), SIMDE_FLOAT32_C( -965.63), SIMDE_FLOAT32_C( 698.21), SIMDE_FLOAT32_C( -608.98),
SIMDE_FLOAT32_C( -35.12), SIMDE_FLOAT32_C( 227.88), SIMDE_FLOAT32_C( -531.46), SIMDE_FLOAT32_C( 933.01),
SIMDE_FLOAT32_C( 160.30), SIMDE_FLOAT32_C( 700.78), SIMDE_FLOAT32_C( -193.29), SIMDE_FLOAT32_C( 322.12) },
{ SIMDE_FLOAT32_C( -0.00), SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( -0.00), SIMDE_FLOAT32_C( 0.00),
SIMDE_FLOAT32_C( -0.00), SIMDE_FLOAT32_C( -0.00), SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( -0.00),
SIMDE_FLOAT32_C( -0.03), SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( -0.00), SIMDE_FLOAT32_C( 0.00),
SIMDE_FLOAT32_C( 0.01), SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( -0.01), SIMDE_FLOAT32_C( 0.00) } },
{ { SIMDE_FLOAT32_C( -443.04), SIMDE_FLOAT32_C( -114.30), SIMDE_FLOAT32_C( -471.01), SIMDE_FLOAT32_C( -31.96),
SIMDE_FLOAT32_C( 388.67), SIMDE_FLOAT32_C( -172.45), SIMDE_FLOAT32_C( 861.27), SIMDE_FLOAT32_C( -147.16),
SIMDE_FLOAT32_C( -707.59), SIMDE_FLOAT32_C( 680.39), SIMDE_FLOAT32_C( -238.37), SIMDE_FLOAT32_C( 231.37),
SIMDE_FLOAT32_C( -355.96), SIMDE_FLOAT32_C( 722.66), SIMDE_FLOAT32_C( -901.00), SIMDE_FLOAT32_C( 319.36) },
{ SIMDE_FLOAT32_C( -0.00), SIMDE_FLOAT32_C( -0.01), SIMDE_FLOAT32_C( -0.00), SIMDE_FLOAT32_C( -0.03),
SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( -0.01), SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( -0.01),
SIMDE_FLOAT32_C( -0.00), SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( -0.00), SIMDE_FLOAT32_C( 0.00),
SIMDE_FLOAT32_C( -0.00), SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( -0.00), SIMDE_FLOAT32_C( 0.00) } },
{ { SIMDE_FLOAT32_C( 495.79), SIMDE_FLOAT32_C( -842.14), SIMDE_FLOAT32_C( 72.53), SIMDE_FLOAT32_C( 657.34),
SIMDE_FLOAT32_C( -807.78), SIMDE_FLOAT32_C( -229.27), SIMDE_FLOAT32_C( -951.64), SIMDE_FLOAT32_C( 157.10),
SIMDE_FLOAT32_C( 998.62), SIMDE_FLOAT32_C( -483.10), SIMDE_FLOAT32_C( 90.12), SIMDE_FLOAT32_C( 158.92),
SIMDE_FLOAT32_C( -782.32), SIMDE_FLOAT32_C( 896.82), SIMDE_FLOAT32_C( -518.96), SIMDE_FLOAT32_C( -225.36) },
{ SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( -0.00), SIMDE_FLOAT32_C( 0.01), SIMDE_FLOAT32_C( 0.00),
SIMDE_FLOAT32_C( -0.00), SIMDE_FLOAT32_C( -0.00), SIMDE_FLOAT32_C( -0.00), SIMDE_FLOAT32_C( 0.01),
SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( -0.00), SIMDE_FLOAT32_C( 0.01), SIMDE_FLOAT32_C( 0.01),
SIMDE_FLOAT32_C( -0.00), SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( -0.00), SIMDE_FLOAT32_C( -0.00) } },
{ { SIMDE_FLOAT32_C( -217.48), SIMDE_FLOAT32_C( 10.04), SIMDE_FLOAT32_C( 742.68), SIMDE_FLOAT32_C( -828.81),
SIMDE_FLOAT32_C( 837.59), SIMDE_FLOAT32_C( 603.95), SIMDE_FLOAT32_C( 24.04), SIMDE_FLOAT32_C( -870.01),
SIMDE_FLOAT32_C( 284.34), SIMDE_FLOAT32_C( 785.67), SIMDE_FLOAT32_C( 361.36), SIMDE_FLOAT32_C( 928.38),
SIMDE_FLOAT32_C( 508.33), SIMDE_FLOAT32_C( 460.36), SIMDE_FLOAT32_C( 247.75), SIMDE_FLOAT32_C( 4.11) },
{ SIMDE_FLOAT32_C( -0.00), SIMDE_FLOAT32_C( 0.10), SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( -0.00),
SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 0.04), SIMDE_FLOAT32_C( -0.00),
SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 0.00),
SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 0.24) } },
{ { SIMDE_FLOAT32_C( 618.21), SIMDE_FLOAT32_C( -679.72), SIMDE_FLOAT32_C( -338.54), SIMDE_FLOAT32_C( 810.43),
SIMDE_FLOAT32_C( 91.01), SIMDE_FLOAT32_C( -290.18), SIMDE_FLOAT32_C( -32.46), SIMDE_FLOAT32_C( 89.63),
SIMDE_FLOAT32_C( 226.71), SIMDE_FLOAT32_C( -942.35), SIMDE_FLOAT32_C( -751.45), SIMDE_FLOAT32_C( 444.40),
SIMDE_FLOAT32_C( 954.48), SIMDE_FLOAT32_C( -270.41), SIMDE_FLOAT32_C( -780.96), SIMDE_FLOAT32_C( -263.00) },
{ SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( -0.00), SIMDE_FLOAT32_C( -0.00), SIMDE_FLOAT32_C( 0.00),
SIMDE_FLOAT32_C( 0.01), SIMDE_FLOAT32_C( -0.00), SIMDE_FLOAT32_C( -0.03), SIMDE_FLOAT32_C( 0.01),
SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( -0.00), SIMDE_FLOAT32_C( -0.00), SIMDE_FLOAT32_C( 0.00),
SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( -0.00), SIMDE_FLOAT32_C( -0.00), SIMDE_FLOAT32_C( -0.00) } },
{ { SIMDE_FLOAT32_C( 739.63), SIMDE_FLOAT32_C( 961.72), SIMDE_FLOAT32_C( -91.80), SIMDE_FLOAT32_C( 577.21),
SIMDE_FLOAT32_C( 565.67), SIMDE_FLOAT32_C( 932.23), SIMDE_FLOAT32_C( 707.21), SIMDE_FLOAT32_C( -149.99),
SIMDE_FLOAT32_C( 717.90), SIMDE_FLOAT32_C( 68.56), SIMDE_FLOAT32_C( -221.60), SIMDE_FLOAT32_C( 226.23),
SIMDE_FLOAT32_C( -471.08), SIMDE_FLOAT32_C( -973.85), SIMDE_FLOAT32_C( -769.66), SIMDE_FLOAT32_C( -852.87) },
{ SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( -0.01), SIMDE_FLOAT32_C( 0.00),
SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( -0.01),
SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 0.01), SIMDE_FLOAT32_C( -0.00), SIMDE_FLOAT32_C( 0.00),
SIMDE_FLOAT32_C( -0.00), SIMDE_FLOAT32_C( -0.00), SIMDE_FLOAT32_C( -0.00), SIMDE_FLOAT32_C( -0.00) } },
{ { SIMDE_FLOAT32_C( -653.58), SIMDE_FLOAT32_C( -108.21), SIMDE_FLOAT32_C( 957.57), SIMDE_FLOAT32_C( 437.43),
SIMDE_FLOAT32_C( 601.61), SIMDE_FLOAT32_C( -74.89), SIMDE_FLOAT32_C( -472.94), SIMDE_FLOAT32_C( -171.67),
SIMDE_FLOAT32_C( -17.24), SIMDE_FLOAT32_C( -224.39), SIMDE_FLOAT32_C( -727.28), SIMDE_FLOAT32_C( -62.76),
SIMDE_FLOAT32_C( 505.21), SIMDE_FLOAT32_C( -508.24), SIMDE_FLOAT32_C( 674.24), SIMDE_FLOAT32_C( 244.83) },
{ SIMDE_FLOAT32_C( -0.00), SIMDE_FLOAT32_C( -0.01), SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 0.00),
SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( -0.01), SIMDE_FLOAT32_C( -0.00), SIMDE_FLOAT32_C( -0.01),
SIMDE_FLOAT32_C( -0.06), SIMDE_FLOAT32_C( -0.00), SIMDE_FLOAT32_C( -0.00), SIMDE_FLOAT32_C( -0.02),
SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( -0.00), SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 0.00) } }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])) ; i++) {
simde__m512 a = simde_mm512_loadu_ps(test_vec[i].a);
simde__m512 r = simde_mm512_recip_ps(a);
simde_test_x86_assert_equal_f32x16(r, simde_mm512_loadu_ps(test_vec[i].r), 1);
}
return 0;
}
static int
test_simde_mm512_mask_recip_ps (SIMDE_MUNIT_TEST_ARGS) {
static const struct {
const simde_float32 src[16];
const simde__mmask8 k;
const simde_float32 a[16];
const simde_float32 r[16];
} test_vec[] = {
{ { SIMDE_FLOAT32_C( -559.02), SIMDE_FLOAT32_C( -653.98), SIMDE_FLOAT32_C( -629.51), SIMDE_FLOAT32_C( 712.50),
SIMDE_FLOAT32_C( 485.85), SIMDE_FLOAT32_C( 827.80), SIMDE_FLOAT32_C( 553.84), SIMDE_FLOAT32_C( -702.08),
SIMDE_FLOAT32_C( 943.96), SIMDE_FLOAT32_C( -619.45), SIMDE_FLOAT32_C( -617.57), SIMDE_FLOAT32_C( 132.09),
SIMDE_FLOAT32_C( 914.75), SIMDE_FLOAT32_C( -571.13), SIMDE_FLOAT32_C( 684.78), SIMDE_FLOAT32_C( 888.84) },
UINT8_C( 30),
{ SIMDE_FLOAT32_C( 989.94), SIMDE_FLOAT32_C( 139.65), SIMDE_FLOAT32_C( 430.34), SIMDE_FLOAT32_C( 509.85),
SIMDE_FLOAT32_C( -762.94), SIMDE_FLOAT32_C( -610.66), SIMDE_FLOAT32_C( -278.26), SIMDE_FLOAT32_C( 571.59),
SIMDE_FLOAT32_C( -698.60), SIMDE_FLOAT32_C( 66.97), SIMDE_FLOAT32_C( 404.01), SIMDE_FLOAT32_C( -382.91),
SIMDE_FLOAT32_C( -808.74), SIMDE_FLOAT32_C( 383.72), SIMDE_FLOAT32_C( 58.06), SIMDE_FLOAT32_C( -462.73) },
{ SIMDE_FLOAT32_C( -559.02), SIMDE_FLOAT32_C( 0.01), SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 0.00),
SIMDE_FLOAT32_C( -0.00), SIMDE_FLOAT32_C( 827.80), SIMDE_FLOAT32_C( 553.84), SIMDE_FLOAT32_C( -702.08),
SIMDE_FLOAT32_C( 943.96), SIMDE_FLOAT32_C( -619.45), SIMDE_FLOAT32_C( -617.57), SIMDE_FLOAT32_C( 132.09),
SIMDE_FLOAT32_C( 914.75), SIMDE_FLOAT32_C( -571.13), SIMDE_FLOAT32_C( 684.78), SIMDE_FLOAT32_C( 888.84) } },
{ { SIMDE_FLOAT32_C( 754.21), SIMDE_FLOAT32_C( -229.44), SIMDE_FLOAT32_C( -976.87), SIMDE_FLOAT32_C( 582.01),
SIMDE_FLOAT32_C( -675.60), SIMDE_FLOAT32_C( -678.95), SIMDE_FLOAT32_C( 525.97), SIMDE_FLOAT32_C( -295.05),
SIMDE_FLOAT32_C( -296.52), SIMDE_FLOAT32_C( -341.94), SIMDE_FLOAT32_C( -380.30), SIMDE_FLOAT32_C( 132.35),
SIMDE_FLOAT32_C( -657.15), SIMDE_FLOAT32_C( -491.46), SIMDE_FLOAT32_C( 10.23), SIMDE_FLOAT32_C( -667.22) },
UINT8_C(254),
{ SIMDE_FLOAT32_C( -559.43), SIMDE_FLOAT32_C( 842.63), SIMDE_FLOAT32_C( 885.25), SIMDE_FLOAT32_C( -170.09),
SIMDE_FLOAT32_C( -435.64), SIMDE_FLOAT32_C( 456.84), SIMDE_FLOAT32_C( 131.32), SIMDE_FLOAT32_C( 631.33),
SIMDE_FLOAT32_C( -139.15), SIMDE_FLOAT32_C( 748.40), SIMDE_FLOAT32_C( 822.59), SIMDE_FLOAT32_C( -755.43),
SIMDE_FLOAT32_C( -193.54), SIMDE_FLOAT32_C( -640.14), SIMDE_FLOAT32_C( 998.78), SIMDE_FLOAT32_C( 577.02) },
{ SIMDE_FLOAT32_C( 754.21), SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( -0.01),
SIMDE_FLOAT32_C( -0.00), SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 0.01), SIMDE_FLOAT32_C( 0.00),
SIMDE_FLOAT32_C( -296.52), SIMDE_FLOAT32_C( -341.94), SIMDE_FLOAT32_C( -380.30), SIMDE_FLOAT32_C( 132.35),
SIMDE_FLOAT32_C( -657.15), SIMDE_FLOAT32_C( -491.46), SIMDE_FLOAT32_C( 10.23), SIMDE_FLOAT32_C( -667.22) } },
{ { SIMDE_FLOAT32_C( -617.01), SIMDE_FLOAT32_C( 580.79), SIMDE_FLOAT32_C( 901.43), SIMDE_FLOAT32_C( -295.96),
SIMDE_FLOAT32_C( 106.76), SIMDE_FLOAT32_C( -393.62), SIMDE_FLOAT32_C( 407.52), SIMDE_FLOAT32_C( 764.82),
SIMDE_FLOAT32_C( 226.07), SIMDE_FLOAT32_C( -460.13), SIMDE_FLOAT32_C( -892.33), SIMDE_FLOAT32_C( 734.61),
SIMDE_FLOAT32_C( 550.10), SIMDE_FLOAT32_C( -559.55), SIMDE_FLOAT32_C( 382.81), SIMDE_FLOAT32_C( 990.67) },
UINT8_C( 97),
{ SIMDE_FLOAT32_C( 268.05), SIMDE_FLOAT32_C( -179.42), SIMDE_FLOAT32_C( -152.56), SIMDE_FLOAT32_C( -275.11),
SIMDE_FLOAT32_C( 951.90), SIMDE_FLOAT32_C( -521.22), SIMDE_FLOAT32_C( 585.74), SIMDE_FLOAT32_C( 700.30),
SIMDE_FLOAT32_C( -698.63), SIMDE_FLOAT32_C( 830.31), SIMDE_FLOAT32_C( -493.24), SIMDE_FLOAT32_C( -338.77),
SIMDE_FLOAT32_C( 829.08), SIMDE_FLOAT32_C( -916.21), SIMDE_FLOAT32_C( 44.23), SIMDE_FLOAT32_C( 409.87) },
{ SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 580.79), SIMDE_FLOAT32_C( 901.43), SIMDE_FLOAT32_C( -295.96),
SIMDE_FLOAT32_C( 106.76), SIMDE_FLOAT32_C( -0.00), SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 764.82),
SIMDE_FLOAT32_C( 226.07), SIMDE_FLOAT32_C( -460.13), SIMDE_FLOAT32_C( -892.33), SIMDE_FLOAT32_C( 734.61),
SIMDE_FLOAT32_C( 550.10), SIMDE_FLOAT32_C( -559.55), SIMDE_FLOAT32_C( 382.81), SIMDE_FLOAT32_C( 990.67) } },
{ { SIMDE_FLOAT32_C( 985.22), SIMDE_FLOAT32_C( 748.27), SIMDE_FLOAT32_C( -483.37), SIMDE_FLOAT32_C( -408.41),
SIMDE_FLOAT32_C( 155.79), SIMDE_FLOAT32_C( -718.54), SIMDE_FLOAT32_C( 817.67), SIMDE_FLOAT32_C( 695.66),
SIMDE_FLOAT32_C( -610.87), SIMDE_FLOAT32_C( 552.28), SIMDE_FLOAT32_C( 245.77), SIMDE_FLOAT32_C( -170.42),
SIMDE_FLOAT32_C( -64.91), SIMDE_FLOAT32_C( 236.44), SIMDE_FLOAT32_C( 112.66), SIMDE_FLOAT32_C( -796.86) },
UINT8_C(153),
{ SIMDE_FLOAT32_C( 960.10), SIMDE_FLOAT32_C( -71.97), SIMDE_FLOAT32_C( -991.08), SIMDE_FLOAT32_C( -561.12),
SIMDE_FLOAT32_C( -486.23), SIMDE_FLOAT32_C( 709.22), SIMDE_FLOAT32_C( -259.75), SIMDE_FLOAT32_C( -655.92),
SIMDE_FLOAT32_C( -784.01), SIMDE_FLOAT32_C( 401.48), SIMDE_FLOAT32_C( -826.84), SIMDE_FLOAT32_C( -700.22),
SIMDE_FLOAT32_C( -554.30), SIMDE_FLOAT32_C( 583.03), SIMDE_FLOAT32_C( -715.01), SIMDE_FLOAT32_C( -806.03) },
{ SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 748.27), SIMDE_FLOAT32_C( -483.37), SIMDE_FLOAT32_C( -0.00),
SIMDE_FLOAT32_C( -0.00), SIMDE_FLOAT32_C( -718.54), SIMDE_FLOAT32_C( 817.67), SIMDE_FLOAT32_C( -0.00),
SIMDE_FLOAT32_C( -610.87), SIMDE_FLOAT32_C( 552.28), SIMDE_FLOAT32_C( 245.77), SIMDE_FLOAT32_C( -170.42),
SIMDE_FLOAT32_C( -64.91), SIMDE_FLOAT32_C( 236.44), SIMDE_FLOAT32_C( 112.66), SIMDE_FLOAT32_C( -796.86) } },
{ { SIMDE_FLOAT32_C( -900.34), SIMDE_FLOAT32_C( -123.41), SIMDE_FLOAT32_C( 349.77), SIMDE_FLOAT32_C( -618.88),
SIMDE_FLOAT32_C( -305.75), SIMDE_FLOAT32_C( 45.43), SIMDE_FLOAT32_C( -229.75), SIMDE_FLOAT32_C( -753.47),
SIMDE_FLOAT32_C( -708.80), SIMDE_FLOAT32_C( 599.82), SIMDE_FLOAT32_C( 181.62), SIMDE_FLOAT32_C( 527.63),
SIMDE_FLOAT32_C( -287.52), SIMDE_FLOAT32_C( 384.76), SIMDE_FLOAT32_C( 584.65), SIMDE_FLOAT32_C( -327.41) },
UINT8_C( 60),
{ SIMDE_FLOAT32_C( 593.57), SIMDE_FLOAT32_C( 111.46), SIMDE_FLOAT32_C( -173.43), SIMDE_FLOAT32_C( 302.80),
SIMDE_FLOAT32_C( 851.71), SIMDE_FLOAT32_C( 170.65), SIMDE_FLOAT32_C( 518.78), SIMDE_FLOAT32_C( 253.19),
SIMDE_FLOAT32_C( 343.82), SIMDE_FLOAT32_C( 818.56), SIMDE_FLOAT32_C( 698.89), SIMDE_FLOAT32_C( -73.15),
SIMDE_FLOAT32_C( -896.45), SIMDE_FLOAT32_C( 892.87), SIMDE_FLOAT32_C( 26.51), SIMDE_FLOAT32_C( -19.86) },
{ SIMDE_FLOAT32_C( -900.34), SIMDE_FLOAT32_C( -123.41), SIMDE_FLOAT32_C( -0.01), SIMDE_FLOAT32_C( 0.00),
SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 0.01), SIMDE_FLOAT32_C( -229.75), SIMDE_FLOAT32_C( -753.47),
SIMDE_FLOAT32_C( -708.80), SIMDE_FLOAT32_C( 599.82), SIMDE_FLOAT32_C( 181.62), SIMDE_FLOAT32_C( 527.63),
SIMDE_FLOAT32_C( -287.52), SIMDE_FLOAT32_C( 384.76), SIMDE_FLOAT32_C( 584.65), SIMDE_FLOAT32_C( -327.41) } },
{ { SIMDE_FLOAT32_C( 242.63), SIMDE_FLOAT32_C( 407.63), SIMDE_FLOAT32_C( 674.39), SIMDE_FLOAT32_C( -711.94),
SIMDE_FLOAT32_C( -822.12), SIMDE_FLOAT32_C( 920.93), SIMDE_FLOAT32_C( -420.74), SIMDE_FLOAT32_C( 777.70),
SIMDE_FLOAT32_C( 102.55), SIMDE_FLOAT32_C( -893.11), SIMDE_FLOAT32_C( -509.82), SIMDE_FLOAT32_C( -512.69),
SIMDE_FLOAT32_C( 691.54), SIMDE_FLOAT32_C( 162.77), SIMDE_FLOAT32_C( -199.89), SIMDE_FLOAT32_C( 285.12) },
UINT8_C( 58),
{ SIMDE_FLOAT32_C( 626.68), SIMDE_FLOAT32_C( -412.08), SIMDE_FLOAT32_C( -874.05), SIMDE_FLOAT32_C( -202.66),
SIMDE_FLOAT32_C( -893.30), SIMDE_FLOAT32_C( 379.14), SIMDE_FLOAT32_C( -858.85), SIMDE_FLOAT32_C( 925.26),
SIMDE_FLOAT32_C( 78.03), SIMDE_FLOAT32_C( 68.00), SIMDE_FLOAT32_C( -971.19), SIMDE_FLOAT32_C( -29.10),
SIMDE_FLOAT32_C( -905.49), SIMDE_FLOAT32_C( 8.95), SIMDE_FLOAT32_C( -786.47), SIMDE_FLOAT32_C( 502.14) },
{ SIMDE_FLOAT32_C( 242.63), SIMDE_FLOAT32_C( -0.00), SIMDE_FLOAT32_C( 674.39), SIMDE_FLOAT32_C( -0.00),
SIMDE_FLOAT32_C( -0.00), SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( -420.74), SIMDE_FLOAT32_C( 777.70),
SIMDE_FLOAT32_C( 102.55), SIMDE_FLOAT32_C( -893.11), SIMDE_FLOAT32_C( -509.82), SIMDE_FLOAT32_C( -512.69),
SIMDE_FLOAT32_C( 691.54), SIMDE_FLOAT32_C( 162.77), SIMDE_FLOAT32_C( -199.89), SIMDE_FLOAT32_C( 285.12) } },
{ { SIMDE_FLOAT32_C( -316.66), SIMDE_FLOAT32_C( -498.40), SIMDE_FLOAT32_C( 680.02), SIMDE_FLOAT32_C( -395.73),
SIMDE_FLOAT32_C( 80.86), SIMDE_FLOAT32_C( 457.72), SIMDE_FLOAT32_C( 706.82), SIMDE_FLOAT32_C( 187.75),
SIMDE_FLOAT32_C( 947.90), SIMDE_FLOAT32_C( -805.87), SIMDE_FLOAT32_C( -120.71), SIMDE_FLOAT32_C( 110.67),
SIMDE_FLOAT32_C( -5.76), SIMDE_FLOAT32_C( -835.59), SIMDE_FLOAT32_C( 384.91), SIMDE_FLOAT32_C( -379.07) },
UINT8_C(169),
{ SIMDE_FLOAT32_C( 510.85), SIMDE_FLOAT32_C( 418.26), SIMDE_FLOAT32_C( -140.98), SIMDE_FLOAT32_C( -110.01),
SIMDE_FLOAT32_C( 559.41), SIMDE_FLOAT32_C( -215.72), SIMDE_FLOAT32_C( 968.02), SIMDE_FLOAT32_C( -372.59),
SIMDE_FLOAT32_C( -186.90), SIMDE_FLOAT32_C( -61.08), SIMDE_FLOAT32_C( -278.08), SIMDE_FLOAT32_C( 822.05),
SIMDE_FLOAT32_C( 152.45), SIMDE_FLOAT32_C( -775.94), SIMDE_FLOAT32_C( -494.61), SIMDE_FLOAT32_C( 654.05) },
{ SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( -498.40), SIMDE_FLOAT32_C( 680.02), SIMDE_FLOAT32_C( -0.01),
SIMDE_FLOAT32_C( 80.86), SIMDE_FLOAT32_C( -0.00), SIMDE_FLOAT32_C( 706.82), SIMDE_FLOAT32_C( -0.00),
SIMDE_FLOAT32_C( 947.90), SIMDE_FLOAT32_C( -805.87), SIMDE_FLOAT32_C( -120.71), SIMDE_FLOAT32_C( 110.67),
SIMDE_FLOAT32_C( -5.76), SIMDE_FLOAT32_C( -835.59), SIMDE_FLOAT32_C( 384.91), SIMDE_FLOAT32_C( -379.07) } },
{ { SIMDE_FLOAT32_C( 904.08), SIMDE_FLOAT32_C( 109.66), SIMDE_FLOAT32_C( -265.09), SIMDE_FLOAT32_C( 361.80),
SIMDE_FLOAT32_C( -183.52), SIMDE_FLOAT32_C( 922.65), SIMDE_FLOAT32_C( 309.70), SIMDE_FLOAT32_C( 10.61),
SIMDE_FLOAT32_C( -198.06), SIMDE_FLOAT32_C( -579.63), SIMDE_FLOAT32_C( -995.15), SIMDE_FLOAT32_C( -33.65),
SIMDE_FLOAT32_C( 805.28), SIMDE_FLOAT32_C( -374.23), SIMDE_FLOAT32_C( 718.68), SIMDE_FLOAT32_C( 316.13) },
UINT8_C(232),
{ SIMDE_FLOAT32_C( -422.30), SIMDE_FLOAT32_C( -793.87), SIMDE_FLOAT32_C( 603.45), SIMDE_FLOAT32_C( 361.98),
SIMDE_FLOAT32_C( -825.85), SIMDE_FLOAT32_C( -769.14), SIMDE_FLOAT32_C( -824.92), SIMDE_FLOAT32_C( 113.07),
SIMDE_FLOAT32_C( -47.22), SIMDE_FLOAT32_C( 997.13), SIMDE_FLOAT32_C( -734.48), SIMDE_FLOAT32_C( 176.84),
SIMDE_FLOAT32_C( -497.48), SIMDE_FLOAT32_C( 919.57), SIMDE_FLOAT32_C( 80.93), SIMDE_FLOAT32_C( 612.18) },
{ SIMDE_FLOAT32_C( 904.08), SIMDE_FLOAT32_C( 109.66), SIMDE_FLOAT32_C( -265.09), SIMDE_FLOAT32_C( 0.00),
SIMDE_FLOAT32_C( -183.52), SIMDE_FLOAT32_C( -0.00), SIMDE_FLOAT32_C( -0.00), SIMDE_FLOAT32_C( 0.01),
SIMDE_FLOAT32_C( -198.06), SIMDE_FLOAT32_C( -579.63), SIMDE_FLOAT32_C( -995.15), SIMDE_FLOAT32_C( -33.65),
SIMDE_FLOAT32_C( 805.28), SIMDE_FLOAT32_C( -374.23), SIMDE_FLOAT32_C( 718.68), SIMDE_FLOAT32_C( 316.13) } }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])) ; i++) {
simde__m512 src = simde_mm512_loadu_ps(test_vec[i].src);
simde__m512 a = simde_mm512_loadu_ps(test_vec[i].a);
simde__m512 r = simde_mm512_mask_recip_ps(src, test_vec[i].k, a);
simde_test_x86_assert_equal_f32x16(r, simde_mm512_loadu_ps(test_vec[i].r), 1);
}
return 0;
}
static int
test_simde_mm512_recip_pd (SIMDE_MUNIT_TEST_ARGS) {
static const struct {
const simde_float64 a[8];
const simde_float64 r[8];
} test_vec[] = {
{ { SIMDE_FLOAT64_C( 428.72), SIMDE_FLOAT64_C( -458.86), SIMDE_FLOAT64_C( 806.54), SIMDE_FLOAT64_C( 539.23),
SIMDE_FLOAT64_C( -146.88), SIMDE_FLOAT64_C( 637.59), SIMDE_FLOAT64_C( 196.11), SIMDE_FLOAT64_C( -116.19) },
{ SIMDE_FLOAT64_C( 0.00), SIMDE_FLOAT64_C( -0.00), SIMDE_FLOAT64_C( 0.00), SIMDE_FLOAT64_C( 0.00),
SIMDE_FLOAT64_C( -0.01), SIMDE_FLOAT64_C( 0.00), SIMDE_FLOAT64_C( 0.01), SIMDE_FLOAT64_C( -0.01) } },
{ { SIMDE_FLOAT64_C( 736.77), SIMDE_FLOAT64_C( -342.16), SIMDE_FLOAT64_C( -904.30), SIMDE_FLOAT64_C( 476.08),
SIMDE_FLOAT64_C( 944.13), SIMDE_FLOAT64_C( 149.78), SIMDE_FLOAT64_C( -235.14), SIMDE_FLOAT64_C( 736.57) },
{ SIMDE_FLOAT64_C( 0.00), SIMDE_FLOAT64_C( -0.00), SIMDE_FLOAT64_C( -0.00), SIMDE_FLOAT64_C( 0.00),
SIMDE_FLOAT64_C( 0.00), SIMDE_FLOAT64_C( 0.01), SIMDE_FLOAT64_C( -0.00), SIMDE_FLOAT64_C( 0.00) } },
{ { SIMDE_FLOAT64_C( -510.10), SIMDE_FLOAT64_C( 107.44), SIMDE_FLOAT64_C( -102.43), SIMDE_FLOAT64_C( 808.81),
SIMDE_FLOAT64_C( 777.98), SIMDE_FLOAT64_C( -457.12), SIMDE_FLOAT64_C( -403.55), SIMDE_FLOAT64_C( -682.37) },
{ SIMDE_FLOAT64_C( -0.00), SIMDE_FLOAT64_C( 0.01), SIMDE_FLOAT64_C( -0.01), SIMDE_FLOAT64_C( 0.00),
SIMDE_FLOAT64_C( 0.00), SIMDE_FLOAT64_C( -0.00), SIMDE_FLOAT64_C( -0.00), SIMDE_FLOAT64_C( -0.00) } },
{ { SIMDE_FLOAT64_C( -420.25), SIMDE_FLOAT64_C( 346.45), SIMDE_FLOAT64_C( 923.73), SIMDE_FLOAT64_C( -651.25),
SIMDE_FLOAT64_C( 204.13), SIMDE_FLOAT64_C( 115.66), SIMDE_FLOAT64_C( -627.27), SIMDE_FLOAT64_C( -367.15) },
{ SIMDE_FLOAT64_C( -0.00), SIMDE_FLOAT64_C( 0.00), SIMDE_FLOAT64_C( 0.00), SIMDE_FLOAT64_C( -0.00),
SIMDE_FLOAT64_C( 0.00), SIMDE_FLOAT64_C( 0.01), SIMDE_FLOAT64_C( -0.00), SIMDE_FLOAT64_C( -0.00) } },
{ { SIMDE_FLOAT64_C( 656.80), SIMDE_FLOAT64_C( -820.73), SIMDE_FLOAT64_C( -827.92), SIMDE_FLOAT64_C( -490.07),
SIMDE_FLOAT64_C( 816.86), SIMDE_FLOAT64_C( 368.19), SIMDE_FLOAT64_C( 393.74), SIMDE_FLOAT64_C( 553.62) },
{ SIMDE_FLOAT64_C( 0.00), SIMDE_FLOAT64_C( -0.00), SIMDE_FLOAT64_C( -0.00), SIMDE_FLOAT64_C( -0.00),
SIMDE_FLOAT64_C( 0.00), SIMDE_FLOAT64_C( 0.00), SIMDE_FLOAT64_C( 0.00), SIMDE_FLOAT64_C( 0.00) } },
{ { SIMDE_FLOAT64_C( -973.97), SIMDE_FLOAT64_C( 489.44), SIMDE_FLOAT64_C( 29.71), SIMDE_FLOAT64_C( 970.16),
SIMDE_FLOAT64_C( -360.78), SIMDE_FLOAT64_C( 794.57), SIMDE_FLOAT64_C( 706.74), SIMDE_FLOAT64_C( 129.11) },
{ SIMDE_FLOAT64_C( -0.00), SIMDE_FLOAT64_C( 0.00), SIMDE_FLOAT64_C( 0.03), SIMDE_FLOAT64_C( 0.00),
SIMDE_FLOAT64_C( -0.00), SIMDE_FLOAT64_C( 0.00), SIMDE_FLOAT64_C( 0.00), SIMDE_FLOAT64_C( 0.01) } },
{ { SIMDE_FLOAT64_C( -97.99), SIMDE_FLOAT64_C( -395.69), SIMDE_FLOAT64_C( -62.07), SIMDE_FLOAT64_C( -320.01),
SIMDE_FLOAT64_C( 147.19), SIMDE_FLOAT64_C( 534.38), SIMDE_FLOAT64_C( -2.39), SIMDE_FLOAT64_C( 726.95) },
{ SIMDE_FLOAT64_C( -0.01), SIMDE_FLOAT64_C( -0.00), SIMDE_FLOAT64_C( -0.02), SIMDE_FLOAT64_C( -0.00),
SIMDE_FLOAT64_C( 0.01), SIMDE_FLOAT64_C( 0.00), SIMDE_FLOAT64_C( -0.42), SIMDE_FLOAT64_C( 0.00) } },
{ { SIMDE_FLOAT64_C( -119.17), SIMDE_FLOAT64_C( -78.65), SIMDE_FLOAT64_C( -924.30), SIMDE_FLOAT64_C( -915.04),
SIMDE_FLOAT64_C( -962.99), SIMDE_FLOAT64_C( -551.57), SIMDE_FLOAT64_C( -282.19), SIMDE_FLOAT64_C( 693.81) },
{ SIMDE_FLOAT64_C( -0.01), SIMDE_FLOAT64_C( -0.01), SIMDE_FLOAT64_C( -0.00), SIMDE_FLOAT64_C( -0.00),
SIMDE_FLOAT64_C( -0.00), SIMDE_FLOAT64_C( -0.00), SIMDE_FLOAT64_C( -0.00), SIMDE_FLOAT64_C( 0.00) } }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])) ; i++) {
simde__m512d a = simde_mm512_loadu_pd(test_vec[i].a);
simde__m512d r = simde_mm512_recip_pd(a);
simde_test_x86_assert_equal_f64x8(r, simde_mm512_loadu_pd(test_vec[i].r), 1);
}
return 0;
}
static int
test_simde_mm512_mask_recip_pd (SIMDE_MUNIT_TEST_ARGS) {
static const struct {
const simde_float64 src[8];
const simde__mmask8 k;
const simde_float64 a[8];
const simde_float64 r[8];
} test_vec[] = {
{ { SIMDE_FLOAT64_C( 209.25), SIMDE_FLOAT64_C( -726.84), SIMDE_FLOAT64_C( -123.44), SIMDE_FLOAT64_C( 592.78),
SIMDE_FLOAT64_C( -139.26), SIMDE_FLOAT64_C( -313.25), SIMDE_FLOAT64_C( 562.79), SIMDE_FLOAT64_C( -134.44) },
UINT8_C(203),
{ SIMDE_FLOAT64_C( 624.55), SIMDE_FLOAT64_C( -863.70), SIMDE_FLOAT64_C( 788.13), SIMDE_FLOAT64_C( 415.51),
SIMDE_FLOAT64_C( -772.51), SIMDE_FLOAT64_C( -934.49), SIMDE_FLOAT64_C( -140.87), SIMDE_FLOAT64_C( -265.50) },
{ SIMDE_FLOAT64_C( 0.00), SIMDE_FLOAT64_C( -0.00), SIMDE_FLOAT64_C( -123.44), SIMDE_FLOAT64_C( 0.00),
SIMDE_FLOAT64_C( -139.26), SIMDE_FLOAT64_C( -313.25), SIMDE_FLOAT64_C( -0.01), SIMDE_FLOAT64_C( -0.00) } },
{ { SIMDE_FLOAT64_C( 420.64), SIMDE_FLOAT64_C( -690.14), SIMDE_FLOAT64_C( -96.93), SIMDE_FLOAT64_C( -275.78),
SIMDE_FLOAT64_C( -453.21), SIMDE_FLOAT64_C( 875.20), SIMDE_FLOAT64_C( 895.34), SIMDE_FLOAT64_C( -766.82) },
UINT8_C(181),
{ SIMDE_FLOAT64_C( 503.15), SIMDE_FLOAT64_C( 966.97), SIMDE_FLOAT64_C( 164.84), SIMDE_FLOAT64_C( -672.96),
SIMDE_FLOAT64_C( 332.40), SIMDE_FLOAT64_C( -625.91), SIMDE_FLOAT64_C( -399.81), SIMDE_FLOAT64_C( -791.04) },
{ SIMDE_FLOAT64_C( 0.00), SIMDE_FLOAT64_C( -690.14), SIMDE_FLOAT64_C( 0.01), SIMDE_FLOAT64_C( -275.78),
SIMDE_FLOAT64_C( 0.00), SIMDE_FLOAT64_C( -0.00), SIMDE_FLOAT64_C( 895.34), SIMDE_FLOAT64_C( -0.00) } },
{ { SIMDE_FLOAT64_C( 966.87), SIMDE_FLOAT64_C( 460.94), SIMDE_FLOAT64_C( -104.29), SIMDE_FLOAT64_C( 529.67),
SIMDE_FLOAT64_C( -673.50), SIMDE_FLOAT64_C( 637.76), SIMDE_FLOAT64_C( 154.22), SIMDE_FLOAT64_C( -537.20) },
UINT8_C( 88),
{ SIMDE_FLOAT64_C( -430.27), SIMDE_FLOAT64_C( -309.71), SIMDE_FLOAT64_C( 491.40), SIMDE_FLOAT64_C( 428.86),
SIMDE_FLOAT64_C( 424.79), SIMDE_FLOAT64_C( -87.96), SIMDE_FLOAT64_C( 738.72), SIMDE_FLOAT64_C( -672.13) },
{ SIMDE_FLOAT64_C( 966.87), SIMDE_FLOAT64_C( 460.94), SIMDE_FLOAT64_C( -104.29), SIMDE_FLOAT64_C( 0.00),
SIMDE_FLOAT64_C( 0.00), SIMDE_FLOAT64_C( 637.76), SIMDE_FLOAT64_C( 0.00), SIMDE_FLOAT64_C( -537.20) } },
{ { SIMDE_FLOAT64_C( 636.26), SIMDE_FLOAT64_C( -714.50), SIMDE_FLOAT64_C( -796.93), SIMDE_FLOAT64_C( 531.61),
SIMDE_FLOAT64_C( -481.32), SIMDE_FLOAT64_C( -374.02), SIMDE_FLOAT64_C( 34.75), SIMDE_FLOAT64_C( -514.35) },
UINT8_C(120),
{ SIMDE_FLOAT64_C( 361.79), SIMDE_FLOAT64_C( 818.05), SIMDE_FLOAT64_C( -835.08), SIMDE_FLOAT64_C( 961.98),
SIMDE_FLOAT64_C( -973.00), SIMDE_FLOAT64_C( -868.21), SIMDE_FLOAT64_C( 422.92), SIMDE_FLOAT64_C( -77.29) },
{ SIMDE_FLOAT64_C( 636.26), SIMDE_FLOAT64_C( -714.50), SIMDE_FLOAT64_C( -796.93), SIMDE_FLOAT64_C( 0.00),
SIMDE_FLOAT64_C( -0.00), SIMDE_FLOAT64_C( -0.00), SIMDE_FLOAT64_C( 0.00), SIMDE_FLOAT64_C( -514.35) } },
{ { SIMDE_FLOAT64_C( 661.46), SIMDE_FLOAT64_C( 749.42), SIMDE_FLOAT64_C( -439.53), SIMDE_FLOAT64_C( -184.33),
SIMDE_FLOAT64_C( -787.78), SIMDE_FLOAT64_C( 986.36), SIMDE_FLOAT64_C( 385.40), SIMDE_FLOAT64_C( -97.48) },
UINT8_C(166),
{ SIMDE_FLOAT64_C( -185.74), SIMDE_FLOAT64_C( -672.69), SIMDE_FLOAT64_C( -610.20), SIMDE_FLOAT64_C( -447.03),
SIMDE_FLOAT64_C( -344.82), SIMDE_FLOAT64_C( -973.94), SIMDE_FLOAT64_C( -161.52), SIMDE_FLOAT64_C( -141.75) },
{ SIMDE_FLOAT64_C( 661.46), SIMDE_FLOAT64_C( -0.00), SIMDE_FLOAT64_C( -0.00), SIMDE_FLOAT64_C( -184.33),
SIMDE_FLOAT64_C( -787.78), SIMDE_FLOAT64_C( -0.00), SIMDE_FLOAT64_C( 385.40), SIMDE_FLOAT64_C( -0.01) } },
{ { SIMDE_FLOAT64_C( 557.67), SIMDE_FLOAT64_C( 357.15), SIMDE_FLOAT64_C( 484.23), SIMDE_FLOAT64_C( -407.58),
SIMDE_FLOAT64_C( 842.80), SIMDE_FLOAT64_C( 275.05), SIMDE_FLOAT64_C( 954.21), SIMDE_FLOAT64_C( 660.85) },
UINT8_C( 53),
{ SIMDE_FLOAT64_C( 916.20), SIMDE_FLOAT64_C( 687.85), SIMDE_FLOAT64_C( 571.76), SIMDE_FLOAT64_C( 339.11),
SIMDE_FLOAT64_C( -389.44), SIMDE_FLOAT64_C( 233.22), SIMDE_FLOAT64_C( 88.53), SIMDE_FLOAT64_C( 171.03) },
{ SIMDE_FLOAT64_C( 0.00), SIMDE_FLOAT64_C( 357.15), SIMDE_FLOAT64_C( 0.00), SIMDE_FLOAT64_C( -407.58),
SIMDE_FLOAT64_C( -0.00), SIMDE_FLOAT64_C( 0.00), SIMDE_FLOAT64_C( 954.21), SIMDE_FLOAT64_C( 660.85) } },
{ { SIMDE_FLOAT64_C( -951.11), SIMDE_FLOAT64_C( 300.76), SIMDE_FLOAT64_C( 157.39), SIMDE_FLOAT64_C( 434.29),
SIMDE_FLOAT64_C( -796.73), SIMDE_FLOAT64_C( -364.85), SIMDE_FLOAT64_C( -751.45), SIMDE_FLOAT64_C( -469.41) },
UINT8_C(211),
{ SIMDE_FLOAT64_C( -198.47), SIMDE_FLOAT64_C( 185.77), SIMDE_FLOAT64_C( 51.02), SIMDE_FLOAT64_C( 640.00),
SIMDE_FLOAT64_C( -955.99), SIMDE_FLOAT64_C( -391.31), SIMDE_FLOAT64_C( -2.84), SIMDE_FLOAT64_C( 528.24) },
{ SIMDE_FLOAT64_C( -0.01), SIMDE_FLOAT64_C( 0.01), SIMDE_FLOAT64_C( 157.39), SIMDE_FLOAT64_C( 434.29),
SIMDE_FLOAT64_C( -0.00), SIMDE_FLOAT64_C( -364.85), SIMDE_FLOAT64_C( -0.35), SIMDE_FLOAT64_C( 0.00) } },
{ { SIMDE_FLOAT64_C( 201.11), SIMDE_FLOAT64_C( -160.04), SIMDE_FLOAT64_C( -196.70), SIMDE_FLOAT64_C( 155.32),
SIMDE_FLOAT64_C( -499.19), SIMDE_FLOAT64_C( -756.73), SIMDE_FLOAT64_C( 71.52), SIMDE_FLOAT64_C( -811.33) },
UINT8_C(173),
{ SIMDE_FLOAT64_C( -589.37), SIMDE_FLOAT64_C( -200.77), SIMDE_FLOAT64_C( 48.24), SIMDE_FLOAT64_C( 499.16),
SIMDE_FLOAT64_C( 970.26), SIMDE_FLOAT64_C( 97.13), SIMDE_FLOAT64_C( -200.08), SIMDE_FLOAT64_C( 127.65) },
{ SIMDE_FLOAT64_C( -0.00), SIMDE_FLOAT64_C( -160.04), SIMDE_FLOAT64_C( 0.02), SIMDE_FLOAT64_C( 0.00),
SIMDE_FLOAT64_C( -499.19), SIMDE_FLOAT64_C( 0.01), SIMDE_FLOAT64_C( 71.52), SIMDE_FLOAT64_C( 0.01) } }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])) ; i++) {
simde__m512d src = simde_mm512_loadu_pd(test_vec[i].src);
simde__m512d a = simde_mm512_loadu_pd(test_vec[i].a);
simde__m512d r = simde_mm512_mask_recip_pd(src, test_vec[i].k, a);
simde_test_x86_assert_equal_f64x8(r, simde_mm512_loadu_pd(test_vec[i].r), 1);
}
return 0;
}
static int
test_simde_mm512_rint_ps (SIMDE_MUNIT_TEST_ARGS) {
static const struct {
const simde_float32 a[16];
const simde_float32 r[16];
} test_vec[] = {
{ { SIMDE_FLOAT32_C( -665.69), SIMDE_FLOAT32_C( -529.73), SIMDE_FLOAT32_C( -462.47), SIMDE_FLOAT32_C( 909.14),
SIMDE_FLOAT32_C( 211.54), SIMDE_FLOAT32_C( 67.95), SIMDE_FLOAT32_C( -26.51), SIMDE_FLOAT32_C( -276.52),
SIMDE_FLOAT32_C( 812.99), SIMDE_FLOAT32_C( 513.31), SIMDE_FLOAT32_C( -214.67), SIMDE_FLOAT32_C( 502.05),
SIMDE_FLOAT32_C( 96.51), SIMDE_FLOAT32_C( -399.31), SIMDE_FLOAT32_C( 783.78), SIMDE_FLOAT32_C( -69.17) },
{ SIMDE_FLOAT32_C( -666.00), SIMDE_FLOAT32_C( -530.00), SIMDE_FLOAT32_C( -462.00), SIMDE_FLOAT32_C( 909.00),
SIMDE_FLOAT32_C( 212.00), SIMDE_FLOAT32_C( 68.00), SIMDE_FLOAT32_C( -27.00), SIMDE_FLOAT32_C( -277.00),
SIMDE_FLOAT32_C( 813.00), SIMDE_FLOAT32_C( 513.00), SIMDE_FLOAT32_C( -215.00), SIMDE_FLOAT32_C( 502.00),
SIMDE_FLOAT32_C( 97.00), SIMDE_FLOAT32_C( -399.00), SIMDE_FLOAT32_C( 784.00), SIMDE_FLOAT32_C( -69.00) } },
{ { SIMDE_FLOAT32_C( -445.96), SIMDE_FLOAT32_C( 637.70), SIMDE_FLOAT32_C( 890.97), SIMDE_FLOAT32_C( -578.19),
SIMDE_FLOAT32_C( 730.74), SIMDE_FLOAT32_C( -499.66), SIMDE_FLOAT32_C( -463.47), SIMDE_FLOAT32_C( -93.74),
SIMDE_FLOAT32_C( -617.08), SIMDE_FLOAT32_C( -340.40), SIMDE_FLOAT32_C( -933.85), SIMDE_FLOAT32_C( 901.57),
SIMDE_FLOAT32_C( 629.93), SIMDE_FLOAT32_C( 901.12), SIMDE_FLOAT32_C( 755.15), SIMDE_FLOAT32_C( 964.24) },
{ SIMDE_FLOAT32_C( -446.00), SIMDE_FLOAT32_C( 638.00), SIMDE_FLOAT32_C( 891.00), SIMDE_FLOAT32_C( -578.00),
SIMDE_FLOAT32_C( 731.00), SIMDE_FLOAT32_C( -500.00), SIMDE_FLOAT32_C( -463.00), SIMDE_FLOAT32_C( -94.00),
SIMDE_FLOAT32_C( -617.00), SIMDE_FLOAT32_C( -340.00), SIMDE_FLOAT32_C( -934.00), SIMDE_FLOAT32_C( 902.00),
SIMDE_FLOAT32_C( 630.00), SIMDE_FLOAT32_C( 901.00), SIMDE_FLOAT32_C( 755.00), SIMDE_FLOAT32_C( 964.00) } },
{ { SIMDE_FLOAT32_C( -628.61), SIMDE_FLOAT32_C( -707.33), SIMDE_FLOAT32_C( 873.38), SIMDE_FLOAT32_C( 582.93),
SIMDE_FLOAT32_C( 360.62), SIMDE_FLOAT32_C( -153.12), SIMDE_FLOAT32_C( -693.59), SIMDE_FLOAT32_C( 173.61),
SIMDE_FLOAT32_C( -639.82), SIMDE_FLOAT32_C( 91.74), SIMDE_FLOAT32_C( -324.34), SIMDE_FLOAT32_C( 456.69),
SIMDE_FLOAT32_C( 692.43), SIMDE_FLOAT32_C( -540.56), SIMDE_FLOAT32_C( -612.48), SIMDE_FLOAT32_C( -753.53) },
{ SIMDE_FLOAT32_C( -629.00), SIMDE_FLOAT32_C( -707.00), SIMDE_FLOAT32_C( 873.00), SIMDE_FLOAT32_C( 583.00),
SIMDE_FLOAT32_C( 361.00), SIMDE_FLOAT32_C( -153.00), SIMDE_FLOAT32_C( -694.00), SIMDE_FLOAT32_C( 174.00),
SIMDE_FLOAT32_C( -640.00), SIMDE_FLOAT32_C( 92.00), SIMDE_FLOAT32_C( -324.00), SIMDE_FLOAT32_C( 457.00),
SIMDE_FLOAT32_C( 692.00), SIMDE_FLOAT32_C( -541.00), SIMDE_FLOAT32_C( -612.00), SIMDE_FLOAT32_C( -754.00) } },
{ { SIMDE_FLOAT32_C( -902.86), SIMDE_FLOAT32_C( -721.51), SIMDE_FLOAT32_C( -331.72), SIMDE_FLOAT32_C( 827.88),
SIMDE_FLOAT32_C( -221.17), SIMDE_FLOAT32_C( 204.81), SIMDE_FLOAT32_C( -265.86), SIMDE_FLOAT32_C( 161.75),
SIMDE_FLOAT32_C( 864.41), SIMDE_FLOAT32_C( -199.71), SIMDE_FLOAT32_C( 63.32), SIMDE_FLOAT32_C( 494.34),
SIMDE_FLOAT32_C( -298.59), SIMDE_FLOAT32_C( -181.53), SIMDE_FLOAT32_C( 458.58), SIMDE_FLOAT32_C( 72.80) },
{ SIMDE_FLOAT32_C( -903.00), SIMDE_FLOAT32_C( -722.00), SIMDE_FLOAT32_C( -332.00), SIMDE_FLOAT32_C( 828.00),
SIMDE_FLOAT32_C( -221.00), SIMDE_FLOAT32_C( 205.00), SIMDE_FLOAT32_C( -266.00), SIMDE_FLOAT32_C( 162.00),
SIMDE_FLOAT32_C( 864.00), SIMDE_FLOAT32_C( -200.00), SIMDE_FLOAT32_C( 63.00), SIMDE_FLOAT32_C( 494.00),
SIMDE_FLOAT32_C( -299.00), SIMDE_FLOAT32_C( -182.00), SIMDE_FLOAT32_C( 459.00), SIMDE_FLOAT32_C( 73.00) } },
{ { SIMDE_FLOAT32_C( 111.14), SIMDE_FLOAT32_C( 331.96), SIMDE_FLOAT32_C( -344.27), SIMDE_FLOAT32_C( -528.24),
SIMDE_FLOAT32_C( -821.17), SIMDE_FLOAT32_C( -37.86), SIMDE_FLOAT32_C( 645.37), SIMDE_FLOAT32_C( -460.98),
SIMDE_FLOAT32_C( -946.11), SIMDE_FLOAT32_C( -678.97), SIMDE_FLOAT32_C( 995.71), SIMDE_FLOAT32_C( 746.32),
SIMDE_FLOAT32_C( -219.53), SIMDE_FLOAT32_C( -616.77), SIMDE_FLOAT32_C( 992.79), SIMDE_FLOAT32_C( -122.39) },
{ SIMDE_FLOAT32_C( 111.00), SIMDE_FLOAT32_C( 332.00), SIMDE_FLOAT32_C( -344.00), SIMDE_FLOAT32_C( -528.00),
SIMDE_FLOAT32_C( -821.00), SIMDE_FLOAT32_C( -38.00), SIMDE_FLOAT32_C( 645.00), SIMDE_FLOAT32_C( -461.00),
SIMDE_FLOAT32_C( -946.00), SIMDE_FLOAT32_C( -679.00), SIMDE_FLOAT32_C( 996.00), SIMDE_FLOAT32_C( 746.00),
SIMDE_FLOAT32_C( -220.00), SIMDE_FLOAT32_C( -617.00), SIMDE_FLOAT32_C( 993.00), SIMDE_FLOAT32_C( -122.00) } },
{ { SIMDE_FLOAT32_C( -338.27), SIMDE_FLOAT32_C( -338.93), SIMDE_FLOAT32_C( -294.51), SIMDE_FLOAT32_C( 440.55),
SIMDE_FLOAT32_C( 865.88), SIMDE_FLOAT32_C( 439.63), SIMDE_FLOAT32_C( -397.70), SIMDE_FLOAT32_C( 730.29),
SIMDE_FLOAT32_C( -760.09), SIMDE_FLOAT32_C( 665.63), SIMDE_FLOAT32_C( 224.63), SIMDE_FLOAT32_C( -58.68),
SIMDE_FLOAT32_C( -515.91), SIMDE_FLOAT32_C( -316.80), SIMDE_FLOAT32_C( -985.88), SIMDE_FLOAT32_C( 595.23) },
{ SIMDE_FLOAT32_C( -338.00), SIMDE_FLOAT32_C( -339.00), SIMDE_FLOAT32_C( -295.00), SIMDE_FLOAT32_C( 441.00),
SIMDE_FLOAT32_C( 866.00), SIMDE_FLOAT32_C( 440.00), SIMDE_FLOAT32_C( -398.00), SIMDE_FLOAT32_C( 730.00),
SIMDE_FLOAT32_C( -760.00), SIMDE_FLOAT32_C( 666.00), SIMDE_FLOAT32_C( 225.00), SIMDE_FLOAT32_C( -59.00),
SIMDE_FLOAT32_C( -516.00), SIMDE_FLOAT32_C( -317.00), SIMDE_FLOAT32_C( -986.00), SIMDE_FLOAT32_C( 595.00) } },
{ { SIMDE_FLOAT32_C( -984.84), SIMDE_FLOAT32_C( -330.15), SIMDE_FLOAT32_C( -933.01), SIMDE_FLOAT32_C( -806.00),
SIMDE_FLOAT32_C( 632.00), SIMDE_FLOAT32_C( 712.36), SIMDE_FLOAT32_C( -266.98), SIMDE_FLOAT32_C( 685.88),
SIMDE_FLOAT32_C( -966.61), SIMDE_FLOAT32_C( -271.27), SIMDE_FLOAT32_C( 432.20), SIMDE_FLOAT32_C( -186.14),
SIMDE_FLOAT32_C( 111.96), SIMDE_FLOAT32_C( 424.99), SIMDE_FLOAT32_C( 691.48), SIMDE_FLOAT32_C( 773.69) },
{ SIMDE_FLOAT32_C( -985.00), SIMDE_FLOAT32_C( -330.00), SIMDE_FLOAT32_C( -933.00), SIMDE_FLOAT32_C( -806.00),
SIMDE_FLOAT32_C( 632.00), SIMDE_FLOAT32_C( 712.00), SIMDE_FLOAT32_C( -267.00), SIMDE_FLOAT32_C( 686.00),
SIMDE_FLOAT32_C( -967.00), SIMDE_FLOAT32_C( -271.00), SIMDE_FLOAT32_C( 432.00), SIMDE_FLOAT32_C( -186.00),
SIMDE_FLOAT32_C( 112.00), SIMDE_FLOAT32_C( 425.00), SIMDE_FLOAT32_C( 691.00), SIMDE_FLOAT32_C( 774.00) } },
{ { SIMDE_FLOAT32_C( -913.94), SIMDE_FLOAT32_C( -603.03), SIMDE_FLOAT32_C( 214.24), SIMDE_FLOAT32_C( 951.94),
SIMDE_FLOAT32_C( 836.60), SIMDE_FLOAT32_C( 816.55), SIMDE_FLOAT32_C( 682.23), SIMDE_FLOAT32_C( -923.49),
SIMDE_FLOAT32_C( 482.17), SIMDE_FLOAT32_C( -93.14), SIMDE_FLOAT32_C( 17.84), SIMDE_FLOAT32_C( 966.27),
SIMDE_FLOAT32_C( 590.07), SIMDE_FLOAT32_C( 31.96), SIMDE_FLOAT32_C( 561.50), SIMDE_FLOAT32_C( 605.23) },
{ SIMDE_FLOAT32_C( -914.00), SIMDE_FLOAT32_C( -603.00), SIMDE_FLOAT32_C( 214.00), SIMDE_FLOAT32_C( 952.00),
SIMDE_FLOAT32_C( 837.00), SIMDE_FLOAT32_C( 817.00), SIMDE_FLOAT32_C( 682.00), SIMDE_FLOAT32_C( -923.00),
SIMDE_FLOAT32_C( 482.00), SIMDE_FLOAT32_C( -93.00), SIMDE_FLOAT32_C( 18.00), SIMDE_FLOAT32_C( 966.00),
SIMDE_FLOAT32_C( 590.00), SIMDE_FLOAT32_C( 32.00), SIMDE_FLOAT32_C( 562.00), SIMDE_FLOAT32_C( 605.00) } }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])) ; i++) {
simde__m512 a = simde_mm512_loadu_ps(test_vec[i].a);
simde__m512 r = simde_mm512_rint_ps(a);
simde_test_x86_assert_equal_f32x16(r, simde_mm512_loadu_ps(test_vec[i].r), 1);
}
return 0;
}
static int
test_simde_mm512_mask_rint_ps (SIMDE_MUNIT_TEST_ARGS) {
static const struct {
const simde_float32 src[16];
const simde__mmask8 k;
const simde_float32 a[16];
const simde_float32 r[16];
} test_vec[] = {
{ { SIMDE_FLOAT32_C( -528.78), SIMDE_FLOAT32_C( 785.86), SIMDE_FLOAT32_C( -381.92), SIMDE_FLOAT32_C( -860.14),
SIMDE_FLOAT32_C( 577.18), SIMDE_FLOAT32_C( -21.79), SIMDE_FLOAT32_C( -56.29), SIMDE_FLOAT32_C( -835.30),
SIMDE_FLOAT32_C( 126.46), SIMDE_FLOAT32_C( -806.06), SIMDE_FLOAT32_C( -450.59), SIMDE_FLOAT32_C( -478.17),
SIMDE_FLOAT32_C( -707.43), SIMDE_FLOAT32_C( -543.19), SIMDE_FLOAT32_C( -401.16), SIMDE_FLOAT32_C( -180.42) },
UINT8_C( 91),
{ SIMDE_FLOAT32_C( 923.27), SIMDE_FLOAT32_C( 86.29), SIMDE_FLOAT32_C( 691.94), SIMDE_FLOAT32_C( 293.32),
SIMDE_FLOAT32_C( -23.72), SIMDE_FLOAT32_C( -199.60), SIMDE_FLOAT32_C( 909.94), SIMDE_FLOAT32_C( 715.72),
SIMDE_FLOAT32_C( -312.75), SIMDE_FLOAT32_C( 291.35), SIMDE_FLOAT32_C( -637.29), SIMDE_FLOAT32_C( -832.86),
SIMDE_FLOAT32_C( -939.64), SIMDE_FLOAT32_C( -775.32), SIMDE_FLOAT32_C( -361.64), SIMDE_FLOAT32_C( 846.22) },
{ SIMDE_FLOAT32_C( 923.00), SIMDE_FLOAT32_C( 86.00), SIMDE_FLOAT32_C( -381.92), SIMDE_FLOAT32_C( 293.00),
SIMDE_FLOAT32_C( -24.00), SIMDE_FLOAT32_C( -21.79), SIMDE_FLOAT32_C( 910.00), SIMDE_FLOAT32_C( -835.30),
SIMDE_FLOAT32_C( 126.46), SIMDE_FLOAT32_C( -806.06), SIMDE_FLOAT32_C( -450.59), SIMDE_FLOAT32_C( -478.17),
SIMDE_FLOAT32_C( -707.43), SIMDE_FLOAT32_C( -543.19), SIMDE_FLOAT32_C( -401.16), SIMDE_FLOAT32_C( -180.42) } },
{ { SIMDE_FLOAT32_C( -157.24), SIMDE_FLOAT32_C( -221.78), SIMDE_FLOAT32_C( 423.40), SIMDE_FLOAT32_C( 820.97),
SIMDE_FLOAT32_C( 721.93), SIMDE_FLOAT32_C( 588.10), SIMDE_FLOAT32_C( -52.57), SIMDE_FLOAT32_C( 915.87),
SIMDE_FLOAT32_C( -862.49), SIMDE_FLOAT32_C( 469.26), SIMDE_FLOAT32_C( -791.57), SIMDE_FLOAT32_C( -405.68),
SIMDE_FLOAT32_C( -931.90), SIMDE_FLOAT32_C( 28.01), SIMDE_FLOAT32_C( 16.04), SIMDE_FLOAT32_C( 991.37) },
UINT8_C( 35),
{ SIMDE_FLOAT32_C( -292.02), SIMDE_FLOAT32_C( 284.69), SIMDE_FLOAT32_C( 90.57), SIMDE_FLOAT32_C( 508.38),
SIMDE_FLOAT32_C( 194.63), SIMDE_FLOAT32_C( -193.71), SIMDE_FLOAT32_C( -804.38), SIMDE_FLOAT32_C( -514.01),
SIMDE_FLOAT32_C( 169.00), SIMDE_FLOAT32_C( -637.23), SIMDE_FLOAT32_C( -453.66), SIMDE_FLOAT32_C( 393.68),
SIMDE_FLOAT32_C( 1.13), SIMDE_FLOAT32_C( -607.44), SIMDE_FLOAT32_C( -763.56), SIMDE_FLOAT32_C( 779.35) },
{ SIMDE_FLOAT32_C( -292.00), SIMDE_FLOAT32_C( 285.00), SIMDE_FLOAT32_C( 423.40), SIMDE_FLOAT32_C( 820.97),
SIMDE_FLOAT32_C( 721.93), SIMDE_FLOAT32_C( -194.00), SIMDE_FLOAT32_C( -52.57), SIMDE_FLOAT32_C( 915.87),
SIMDE_FLOAT32_C( -862.49), SIMDE_FLOAT32_C( 469.26), SIMDE_FLOAT32_C( -791.57), SIMDE_FLOAT32_C( -405.68),
SIMDE_FLOAT32_C( -931.90), SIMDE_FLOAT32_C( 28.01), SIMDE_FLOAT32_C( 16.04), SIMDE_FLOAT32_C( 991.37) } },
{ { SIMDE_FLOAT32_C( 815.97), SIMDE_FLOAT32_C( -942.60), SIMDE_FLOAT32_C( 501.28), SIMDE_FLOAT32_C( 404.07),
SIMDE_FLOAT32_C( 4.83), SIMDE_FLOAT32_C( 417.15), SIMDE_FLOAT32_C( 541.58), SIMDE_FLOAT32_C( -525.90),
SIMDE_FLOAT32_C( 625.58), SIMDE_FLOAT32_C( -864.10), SIMDE_FLOAT32_C( -457.80), SIMDE_FLOAT32_C( -346.41),
SIMDE_FLOAT32_C( 151.94), SIMDE_FLOAT32_C( -466.43), SIMDE_FLOAT32_C( -232.11), SIMDE_FLOAT32_C( 859.92) },
UINT8_C(181),
{ SIMDE_FLOAT32_C( 858.46), SIMDE_FLOAT32_C( 368.30), SIMDE_FLOAT32_C( 12.90), SIMDE_FLOAT32_C( -335.24),
SIMDE_FLOAT32_C( 563.92), SIMDE_FLOAT32_C( 498.88), SIMDE_FLOAT32_C( 833.76), SIMDE_FLOAT32_C( 926.69),
SIMDE_FLOAT32_C( -954.77), SIMDE_FLOAT32_C( 227.44), SIMDE_FLOAT32_C( -72.18), SIMDE_FLOAT32_C( -562.21),
SIMDE_FLOAT32_C( 463.87), SIMDE_FLOAT32_C( -292.83), SIMDE_FLOAT32_C( -746.24), SIMDE_FLOAT32_C( 521.28) },
{ SIMDE_FLOAT32_C( 858.00), SIMDE_FLOAT32_C( -942.60), SIMDE_FLOAT32_C( 13.00), SIMDE_FLOAT32_C( 404.07),
SIMDE_FLOAT32_C( 564.00), SIMDE_FLOAT32_C( 499.00), SIMDE_FLOAT32_C( 541.58), SIMDE_FLOAT32_C( 927.00),
SIMDE_FLOAT32_C( 625.58), SIMDE_FLOAT32_C( -864.10), SIMDE_FLOAT32_C( -457.80), SIMDE_FLOAT32_C( -346.41),
SIMDE_FLOAT32_C( 151.94), SIMDE_FLOAT32_C( -466.43), SIMDE_FLOAT32_C( -232.11), SIMDE_FLOAT32_C( 859.92) } },
{ { SIMDE_FLOAT32_C( -791.54), SIMDE_FLOAT32_C( 657.83), SIMDE_FLOAT32_C( -473.89), SIMDE_FLOAT32_C( 625.60),
SIMDE_FLOAT32_C( 199.41), SIMDE_FLOAT32_C( 0.20), SIMDE_FLOAT32_C( 251.18), SIMDE_FLOAT32_C( 335.31),
SIMDE_FLOAT32_C( 542.40), SIMDE_FLOAT32_C( 904.77), SIMDE_FLOAT32_C( -512.75), SIMDE_FLOAT32_C( -924.03),
SIMDE_FLOAT32_C( -327.34), SIMDE_FLOAT32_C( -652.83), SIMDE_FLOAT32_C( 894.23), SIMDE_FLOAT32_C( -468.87) },
UINT8_C(106),
{ SIMDE_FLOAT32_C( -92.87), SIMDE_FLOAT32_C( 195.88), SIMDE_FLOAT32_C( 279.39), SIMDE_FLOAT32_C( -593.99),
SIMDE_FLOAT32_C( 29.64), SIMDE_FLOAT32_C( 206.08), SIMDE_FLOAT32_C( -548.77), SIMDE_FLOAT32_C( -742.92),
SIMDE_FLOAT32_C( -866.10), SIMDE_FLOAT32_C( -110.98), SIMDE_FLOAT32_C( 720.95), SIMDE_FLOAT32_C( -158.93),
SIMDE_FLOAT32_C( 142.78), SIMDE_FLOAT32_C( 242.22), SIMDE_FLOAT32_C( 49.53), SIMDE_FLOAT32_C( -199.39) },
{ SIMDE_FLOAT32_C( -791.54), SIMDE_FLOAT32_C( 196.00), SIMDE_FLOAT32_C( -473.89), SIMDE_FLOAT32_C( -594.00),
SIMDE_FLOAT32_C( 199.41), SIMDE_FLOAT32_C( 206.00), SIMDE_FLOAT32_C( -549.00), SIMDE_FLOAT32_C( 335.31),
SIMDE_FLOAT32_C( 542.40), SIMDE_FLOAT32_C( 904.77), SIMDE_FLOAT32_C( -512.75), SIMDE_FLOAT32_C( -924.03),
SIMDE_FLOAT32_C( -327.34), SIMDE_FLOAT32_C( -652.83), SIMDE_FLOAT32_C( 894.23), SIMDE_FLOAT32_C( -468.87) } },
{ { SIMDE_FLOAT32_C( 768.33), SIMDE_FLOAT32_C( -324.87), SIMDE_FLOAT32_C( -999.98), SIMDE_FLOAT32_C( -231.46),
SIMDE_FLOAT32_C( 926.31), SIMDE_FLOAT32_C( 335.33), SIMDE_FLOAT32_C( -689.06), SIMDE_FLOAT32_C( 831.09),
SIMDE_FLOAT32_C( 822.57), SIMDE_FLOAT32_C( -613.09), SIMDE_FLOAT32_C( -496.25), SIMDE_FLOAT32_C( -830.26),
SIMDE_FLOAT32_C( -718.86), SIMDE_FLOAT32_C( 34.88), SIMDE_FLOAT32_C( 885.21), SIMDE_FLOAT32_C( 188.27) },
UINT8_C(197),
{ SIMDE_FLOAT32_C( 164.59), SIMDE_FLOAT32_C( 594.28), SIMDE_FLOAT32_C( 260.41), SIMDE_FLOAT32_C( -629.33),
SIMDE_FLOAT32_C( -954.49), SIMDE_FLOAT32_C( 517.49), SIMDE_FLOAT32_C( -495.43), SIMDE_FLOAT32_C( -65.47),
SIMDE_FLOAT32_C( 238.43), SIMDE_FLOAT32_C( 345.64), SIMDE_FLOAT32_C( -922.68), SIMDE_FLOAT32_C( -519.34),
SIMDE_FLOAT32_C( -604.83), SIMDE_FLOAT32_C( -122.08), SIMDE_FLOAT32_C( -751.01), SIMDE_FLOAT32_C( 70.30) },
{ SIMDE_FLOAT32_C( 165.00), SIMDE_FLOAT32_C( -324.87), SIMDE_FLOAT32_C( 260.00), SIMDE_FLOAT32_C( -231.46),
SIMDE_FLOAT32_C( 926.31), SIMDE_FLOAT32_C( 335.33), SIMDE_FLOAT32_C( -495.00), SIMDE_FLOAT32_C( -65.00),
SIMDE_FLOAT32_C( 822.57), SIMDE_FLOAT32_C( -613.09), SIMDE_FLOAT32_C( -496.25), SIMDE_FLOAT32_C( -830.26),
SIMDE_FLOAT32_C( -718.86), SIMDE_FLOAT32_C( 34.88), SIMDE_FLOAT32_C( 885.21), SIMDE_FLOAT32_C( 188.27) } },
{ { SIMDE_FLOAT32_C( -122.06), SIMDE_FLOAT32_C( 17.53), SIMDE_FLOAT32_C( -3.38), SIMDE_FLOAT32_C( -786.73),
SIMDE_FLOAT32_C( 328.46), SIMDE_FLOAT32_C( -172.29), SIMDE_FLOAT32_C( -964.16), SIMDE_FLOAT32_C( 715.37),
SIMDE_FLOAT32_C( 331.46), SIMDE_FLOAT32_C( -794.41), SIMDE_FLOAT32_C( 996.51), SIMDE_FLOAT32_C( -633.66),
SIMDE_FLOAT32_C( -909.21), SIMDE_FLOAT32_C( 184.77), SIMDE_FLOAT32_C( -402.90), SIMDE_FLOAT32_C( 255.39) },
UINT8_C( 2),
{ SIMDE_FLOAT32_C( 857.51), SIMDE_FLOAT32_C( 626.06), SIMDE_FLOAT32_C( -175.44), SIMDE_FLOAT32_C( 375.00),
SIMDE_FLOAT32_C( -869.37), SIMDE_FLOAT32_C( 759.09), SIMDE_FLOAT32_C( -386.57), SIMDE_FLOAT32_C( 476.27),
SIMDE_FLOAT32_C( 836.41), SIMDE_FLOAT32_C( 94.09), SIMDE_FLOAT32_C( 871.44), SIMDE_FLOAT32_C( -285.67),
SIMDE_FLOAT32_C( 343.08), SIMDE_FLOAT32_C( -58.26), SIMDE_FLOAT32_C( 592.27), SIMDE_FLOAT32_C( -639.39) },
{ SIMDE_FLOAT32_C( -122.06), SIMDE_FLOAT32_C( 626.00), SIMDE_FLOAT32_C( -3.38), SIMDE_FLOAT32_C( -786.73),
SIMDE_FLOAT32_C( 328.46), SIMDE_FLOAT32_C( -172.29), SIMDE_FLOAT32_C( -964.16), SIMDE_FLOAT32_C( 715.37),
SIMDE_FLOAT32_C( 331.46), SIMDE_FLOAT32_C( -794.41), SIMDE_FLOAT32_C( 996.51), SIMDE_FLOAT32_C( -633.66),
SIMDE_FLOAT32_C( -909.21), SIMDE_FLOAT32_C( 184.77), SIMDE_FLOAT32_C( -402.90), SIMDE_FLOAT32_C( 255.39) } },
{ { SIMDE_FLOAT32_C( 938.35), SIMDE_FLOAT32_C( 805.54), SIMDE_FLOAT32_C( 689.07), SIMDE_FLOAT32_C( -233.94),
SIMDE_FLOAT32_C( 841.38), SIMDE_FLOAT32_C( 404.44), SIMDE_FLOAT32_C( -902.48), SIMDE_FLOAT32_C( -953.03),
SIMDE_FLOAT32_C( 400.95), SIMDE_FLOAT32_C( -536.14), SIMDE_FLOAT32_C( -862.24), SIMDE_FLOAT32_C( -414.28),
SIMDE_FLOAT32_C( 60.96), SIMDE_FLOAT32_C( 393.15), SIMDE_FLOAT32_C( 364.77), SIMDE_FLOAT32_C( -81.52) },
UINT8_C( 26),
{ SIMDE_FLOAT32_C( -810.67), SIMDE_FLOAT32_C( -706.52), SIMDE_FLOAT32_C( 149.83), SIMDE_FLOAT32_C( 948.42),
SIMDE_FLOAT32_C( -93.09), SIMDE_FLOAT32_C( -373.90), SIMDE_FLOAT32_C( 784.83), SIMDE_FLOAT32_C( -999.00),
SIMDE_FLOAT32_C( -502.46), SIMDE_FLOAT32_C( -500.84), SIMDE_FLOAT32_C( 344.08), SIMDE_FLOAT32_C( 439.27),
SIMDE_FLOAT32_C( -908.56), SIMDE_FLOAT32_C( 704.69), SIMDE_FLOAT32_C( 377.63), SIMDE_FLOAT32_C( 896.98) },
{ SIMDE_FLOAT32_C( 938.35), SIMDE_FLOAT32_C( -707.00), SIMDE_FLOAT32_C( 689.07), SIMDE_FLOAT32_C( 948.00),
SIMDE_FLOAT32_C( -93.00), SIMDE_FLOAT32_C( 404.44), SIMDE_FLOAT32_C( -902.48), SIMDE_FLOAT32_C( -953.03),
SIMDE_FLOAT32_C( 400.95), SIMDE_FLOAT32_C( -536.14), SIMDE_FLOAT32_C( -862.24), SIMDE_FLOAT32_C( -414.28),
SIMDE_FLOAT32_C( 60.96), SIMDE_FLOAT32_C( 393.15), SIMDE_FLOAT32_C( 364.77), SIMDE_FLOAT32_C( -81.52) } },
{ { SIMDE_FLOAT32_C( 393.76), SIMDE_FLOAT32_C( -856.31), SIMDE_FLOAT32_C( 738.36), SIMDE_FLOAT32_C( -201.81),
SIMDE_FLOAT32_C( -758.79), SIMDE_FLOAT32_C( 785.33), SIMDE_FLOAT32_C( -800.86), SIMDE_FLOAT32_C( -294.93),
SIMDE_FLOAT32_C( 923.10), SIMDE_FLOAT32_C( -215.14), SIMDE_FLOAT32_C( 766.03), SIMDE_FLOAT32_C( 316.25),
SIMDE_FLOAT32_C( -850.37), SIMDE_FLOAT32_C( -315.49), SIMDE_FLOAT32_C( -664.55), SIMDE_FLOAT32_C( -661.04) },
UINT8_C(104),
{ SIMDE_FLOAT32_C( 485.29), SIMDE_FLOAT32_C( -712.62), SIMDE_FLOAT32_C( 884.89), SIMDE_FLOAT32_C( -888.61),
SIMDE_FLOAT32_C( -927.79), SIMDE_FLOAT32_C( 885.89), SIMDE_FLOAT32_C( -391.08), SIMDE_FLOAT32_C( -428.63),
SIMDE_FLOAT32_C( 229.97), SIMDE_FLOAT32_C( -951.80), SIMDE_FLOAT32_C( -337.19), SIMDE_FLOAT32_C( -65.34),
SIMDE_FLOAT32_C( 425.83), SIMDE_FLOAT32_C( -440.21), SIMDE_FLOAT32_C( -671.58), SIMDE_FLOAT32_C( 569.52) },
{ SIMDE_FLOAT32_C( 393.76), SIMDE_FLOAT32_C( -856.31), SIMDE_FLOAT32_C( 738.36), SIMDE_FLOAT32_C( -889.00),
SIMDE_FLOAT32_C( -758.79), SIMDE_FLOAT32_C( 886.00), SIMDE_FLOAT32_C( -391.00), SIMDE_FLOAT32_C( -294.93),
SIMDE_FLOAT32_C( 923.10), SIMDE_FLOAT32_C( -215.14), SIMDE_FLOAT32_C( 766.03), SIMDE_FLOAT32_C( 316.25),
SIMDE_FLOAT32_C( -850.37), SIMDE_FLOAT32_C( -315.49), SIMDE_FLOAT32_C( -664.55), SIMDE_FLOAT32_C( -661.04) } }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])) ; i++) {
simde__m512 src = simde_mm512_loadu_ps(test_vec[i].src);
simde__m512 a = simde_mm512_loadu_ps(test_vec[i].a);
simde__m512 r = simde_mm512_mask_rint_ps(src, test_vec[i].k, a);
simde_test_x86_assert_equal_f32x16(r, simde_mm512_loadu_ps(test_vec[i].r), 1);
}
return 0;
}
static int
test_simde_mm512_rint_pd (SIMDE_MUNIT_TEST_ARGS) {
static const struct {
const simde_float64 a[8];
const simde_float64 r[8];
} test_vec[] = {
{ { SIMDE_FLOAT64_C( -246.76), SIMDE_FLOAT64_C( 995.20), SIMDE_FLOAT64_C( 968.30), SIMDE_FLOAT64_C( 593.75),
SIMDE_FLOAT64_C( 235.19), SIMDE_FLOAT64_C( 73.30), SIMDE_FLOAT64_C( -552.80), SIMDE_FLOAT64_C( -271.48) },
{ SIMDE_FLOAT64_C( -247.00), SIMDE_FLOAT64_C( 995.00), SIMDE_FLOAT64_C( 968.00), SIMDE_FLOAT64_C( 594.00),
SIMDE_FLOAT64_C( 235.00), SIMDE_FLOAT64_C( 73.00), SIMDE_FLOAT64_C( -553.00), SIMDE_FLOAT64_C( -271.00) } },
{ { SIMDE_FLOAT64_C( -135.03), SIMDE_FLOAT64_C( -911.80), SIMDE_FLOAT64_C( -344.75), SIMDE_FLOAT64_C( -200.72),
SIMDE_FLOAT64_C( 333.22), SIMDE_FLOAT64_C( 889.93), SIMDE_FLOAT64_C( -90.00), SIMDE_FLOAT64_C( 700.69) },
{ SIMDE_FLOAT64_C( -135.00), SIMDE_FLOAT64_C( -912.00), SIMDE_FLOAT64_C( -345.00), SIMDE_FLOAT64_C( -201.00),
SIMDE_FLOAT64_C( 333.00), SIMDE_FLOAT64_C( 890.00), SIMDE_FLOAT64_C( -90.00), SIMDE_FLOAT64_C( 701.00) } },
{ { SIMDE_FLOAT64_C( -507.88), SIMDE_FLOAT64_C( 21.18), SIMDE_FLOAT64_C( -600.24), SIMDE_FLOAT64_C( -90.19),
SIMDE_FLOAT64_C( -792.15), SIMDE_FLOAT64_C( 778.81), SIMDE_FLOAT64_C( 116.68), SIMDE_FLOAT64_C( 97.12) },
{ SIMDE_FLOAT64_C( -508.00), SIMDE_FLOAT64_C( 21.00), SIMDE_FLOAT64_C( -600.00), SIMDE_FLOAT64_C( -90.00),
SIMDE_FLOAT64_C( -792.00), SIMDE_FLOAT64_C( 779.00), SIMDE_FLOAT64_C( 117.00), SIMDE_FLOAT64_C( 97.00) } },
{ { SIMDE_FLOAT64_C( 426.71), SIMDE_FLOAT64_C( 210.55), SIMDE_FLOAT64_C( -406.04), SIMDE_FLOAT64_C( 169.01),
SIMDE_FLOAT64_C( 164.78), SIMDE_FLOAT64_C( -734.90), SIMDE_FLOAT64_C( -482.68), SIMDE_FLOAT64_C( 918.02) },
{ SIMDE_FLOAT64_C( 427.00), SIMDE_FLOAT64_C( 211.00), SIMDE_FLOAT64_C( -406.00), SIMDE_FLOAT64_C( 169.00),
SIMDE_FLOAT64_C( 165.00), SIMDE_FLOAT64_C( -735.00), SIMDE_FLOAT64_C( -483.00), SIMDE_FLOAT64_C( 918.00) } },
{ { SIMDE_FLOAT64_C( -739.70), SIMDE_FLOAT64_C( -514.38), SIMDE_FLOAT64_C( 511.78), SIMDE_FLOAT64_C( 495.49),
SIMDE_FLOAT64_C( 558.92), SIMDE_FLOAT64_C( 958.98), SIMDE_FLOAT64_C( -775.99), SIMDE_FLOAT64_C( -576.12) },
{ SIMDE_FLOAT64_C( -740.00), SIMDE_FLOAT64_C( -514.00), SIMDE_FLOAT64_C( 512.00), SIMDE_FLOAT64_C( 495.00),
SIMDE_FLOAT64_C( 559.00), SIMDE_FLOAT64_C( 959.00), SIMDE_FLOAT64_C( -776.00), SIMDE_FLOAT64_C( -576.00) } },
{ { SIMDE_FLOAT64_C( -952.82), SIMDE_FLOAT64_C( -120.74), SIMDE_FLOAT64_C( 223.17), SIMDE_FLOAT64_C( 380.40),
SIMDE_FLOAT64_C( -230.81), SIMDE_FLOAT64_C( -866.83), SIMDE_FLOAT64_C( 81.08), SIMDE_FLOAT64_C( 261.31) },
{ SIMDE_FLOAT64_C( -953.00), SIMDE_FLOAT64_C( -121.00), SIMDE_FLOAT64_C( 223.00), SIMDE_FLOAT64_C( 380.00),
SIMDE_FLOAT64_C( -231.00), SIMDE_FLOAT64_C( -867.00), SIMDE_FLOAT64_C( 81.00), SIMDE_FLOAT64_C( 261.00) } },
{ { SIMDE_FLOAT64_C( 154.35), SIMDE_FLOAT64_C( 480.85), SIMDE_FLOAT64_C( -828.88), SIMDE_FLOAT64_C( 362.20),
SIMDE_FLOAT64_C( 259.66), SIMDE_FLOAT64_C( 287.79), SIMDE_FLOAT64_C( -540.68), SIMDE_FLOAT64_C( -313.64) },
{ SIMDE_FLOAT64_C( 154.00), SIMDE_FLOAT64_C( 481.00), SIMDE_FLOAT64_C( -829.00), SIMDE_FLOAT64_C( 362.00),
SIMDE_FLOAT64_C( 260.00), SIMDE_FLOAT64_C( 288.00), SIMDE_FLOAT64_C( -541.00), SIMDE_FLOAT64_C( -314.00) } },
{ { SIMDE_FLOAT64_C( -501.66), SIMDE_FLOAT64_C( 53.28), SIMDE_FLOAT64_C( 855.37), SIMDE_FLOAT64_C( 663.12),
SIMDE_FLOAT64_C( 318.39), SIMDE_FLOAT64_C( -627.30), SIMDE_FLOAT64_C( 581.15), SIMDE_FLOAT64_C( 578.68) },
{ SIMDE_FLOAT64_C( -502.00), SIMDE_FLOAT64_C( 53.00), SIMDE_FLOAT64_C( 855.00), SIMDE_FLOAT64_C( 663.00),
SIMDE_FLOAT64_C( 318.00), SIMDE_FLOAT64_C( -627.00), SIMDE_FLOAT64_C( 581.00), SIMDE_FLOAT64_C( 579.00) } }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])) ; i++) {
simde__m512d a = simde_mm512_loadu_pd(test_vec[i].a);
simde__m512d r = simde_mm512_rint_pd(a);
simde_test_x86_assert_equal_f64x8(r, simde_mm512_loadu_pd(test_vec[i].r), 1);
}
return 0;
}
static int
test_simde_mm512_mask_rint_pd (SIMDE_MUNIT_TEST_ARGS) {
static const struct {
const simde_float64 src[8];
const simde__mmask8 k;
const simde_float64 a[8];
const simde_float64 r[8];
} test_vec[] = {
{ { SIMDE_FLOAT64_C( -208.54), SIMDE_FLOAT64_C( -850.79), SIMDE_FLOAT64_C( -979.95), SIMDE_FLOAT64_C( -415.72),
SIMDE_FLOAT64_C( 722.54), SIMDE_FLOAT64_C( -386.30), SIMDE_FLOAT64_C( 827.55), SIMDE_FLOAT64_C( -329.72) },
UINT8_C( 33),
{ SIMDE_FLOAT64_C( -547.16), SIMDE_FLOAT64_C( 343.76), SIMDE_FLOAT64_C( -161.57), SIMDE_FLOAT64_C( 958.51),
SIMDE_FLOAT64_C( 185.76), SIMDE_FLOAT64_C( 479.23), SIMDE_FLOAT64_C( 948.46), SIMDE_FLOAT64_C( 354.63) },
{ SIMDE_FLOAT64_C( -547.00), SIMDE_FLOAT64_C( -850.79), SIMDE_FLOAT64_C( -979.95), SIMDE_FLOAT64_C( -415.72),
SIMDE_FLOAT64_C( 722.54), SIMDE_FLOAT64_C( 479.00), SIMDE_FLOAT64_C( 827.55), SIMDE_FLOAT64_C( -329.72) } },
{ { SIMDE_FLOAT64_C( 164.70), SIMDE_FLOAT64_C( 580.02), SIMDE_FLOAT64_C( 369.11), SIMDE_FLOAT64_C( -928.66),
SIMDE_FLOAT64_C( 607.84), SIMDE_FLOAT64_C( 793.55), SIMDE_FLOAT64_C( -417.32), SIMDE_FLOAT64_C( -33.65) },
UINT8_C(142),
{ SIMDE_FLOAT64_C( 85.45), SIMDE_FLOAT64_C( 426.84), SIMDE_FLOAT64_C( -691.54), SIMDE_FLOAT64_C( 519.42),
SIMDE_FLOAT64_C( 413.73), SIMDE_FLOAT64_C( 99.92), SIMDE_FLOAT64_C( 668.63), SIMDE_FLOAT64_C( 433.78) },
{ SIMDE_FLOAT64_C( 164.70), SIMDE_FLOAT64_C( 427.00), SIMDE_FLOAT64_C( -692.00), SIMDE_FLOAT64_C( 519.00),
SIMDE_FLOAT64_C( 607.84), SIMDE_FLOAT64_C( 793.55), SIMDE_FLOAT64_C( -417.32), SIMDE_FLOAT64_C( 434.00) } },
{ { SIMDE_FLOAT64_C( 684.20), SIMDE_FLOAT64_C( 391.17), SIMDE_FLOAT64_C( -952.53), SIMDE_FLOAT64_C( 511.75),
SIMDE_FLOAT64_C( -938.55), SIMDE_FLOAT64_C( -562.45), SIMDE_FLOAT64_C( 964.59), SIMDE_FLOAT64_C( 405.21) },
UINT8_C(209),
{ SIMDE_FLOAT64_C( 923.10), SIMDE_FLOAT64_C( -409.02), SIMDE_FLOAT64_C( -244.78), SIMDE_FLOAT64_C( 871.57),
SIMDE_FLOAT64_C( 945.61), SIMDE_FLOAT64_C( 919.91), SIMDE_FLOAT64_C( 451.58), SIMDE_FLOAT64_C( 314.71) },
{ SIMDE_FLOAT64_C( 923.00), SIMDE_FLOAT64_C( 391.17), SIMDE_FLOAT64_C( -952.53), SIMDE_FLOAT64_C( 511.75),
SIMDE_FLOAT64_C( 946.00), SIMDE_FLOAT64_C( -562.45), SIMDE_FLOAT64_C( 452.00), SIMDE_FLOAT64_C( 315.00) } },
{ { SIMDE_FLOAT64_C( 991.25), SIMDE_FLOAT64_C( 59.43), SIMDE_FLOAT64_C( 108.26), SIMDE_FLOAT64_C( -426.07),
SIMDE_FLOAT64_C( -974.22), SIMDE_FLOAT64_C( 827.67), SIMDE_FLOAT64_C( 659.39), SIMDE_FLOAT64_C( 452.62) },
UINT8_C( 74),
{ SIMDE_FLOAT64_C( 178.81), SIMDE_FLOAT64_C( -133.64), SIMDE_FLOAT64_C( 236.06), SIMDE_FLOAT64_C( -152.57),
SIMDE_FLOAT64_C( -699.87), SIMDE_FLOAT64_C( -79.74), SIMDE_FLOAT64_C( -761.39), SIMDE_FLOAT64_C( -652.39) },
{ SIMDE_FLOAT64_C( 991.25), SIMDE_FLOAT64_C( -134.00), SIMDE_FLOAT64_C( 108.26), SIMDE_FLOAT64_C( -153.00),
SIMDE_FLOAT64_C( -974.22), SIMDE_FLOAT64_C( 827.67), SIMDE_FLOAT64_C( -761.00), SIMDE_FLOAT64_C( 452.62) } },
{ { SIMDE_FLOAT64_C( -567.98), SIMDE_FLOAT64_C( -699.94), SIMDE_FLOAT64_C( -214.84), SIMDE_FLOAT64_C( -603.39),
SIMDE_FLOAT64_C( 705.27), SIMDE_FLOAT64_C( -938.85), SIMDE_FLOAT64_C( -680.29), SIMDE_FLOAT64_C( -703.75) },
UINT8_C(254),
{ SIMDE_FLOAT64_C( -808.72), SIMDE_FLOAT64_C( -758.15), SIMDE_FLOAT64_C( -263.72), SIMDE_FLOAT64_C( 642.86),
SIMDE_FLOAT64_C( 556.57), SIMDE_FLOAT64_C( -272.47), SIMDE_FLOAT64_C( -297.71), SIMDE_FLOAT64_C( -335.17) },
{ SIMDE_FLOAT64_C( -567.98), SIMDE_FLOAT64_C( -758.00), SIMDE_FLOAT64_C( -264.00), SIMDE_FLOAT64_C( 643.00),
SIMDE_FLOAT64_C( 557.00), SIMDE_FLOAT64_C( -272.00), SIMDE_FLOAT64_C( -298.00), SIMDE_FLOAT64_C( -335.00) } },
{ { SIMDE_FLOAT64_C( 301.46), SIMDE_FLOAT64_C( -271.93), SIMDE_FLOAT64_C( -507.50), SIMDE_FLOAT64_C( -39.16),
SIMDE_FLOAT64_C( -819.31), SIMDE_FLOAT64_C( -371.36), SIMDE_FLOAT64_C( -860.35), SIMDE_FLOAT64_C( 47.05) },
UINT8_C( 9),
{ SIMDE_FLOAT64_C( -12.91), SIMDE_FLOAT64_C( 347.18), SIMDE_FLOAT64_C( -215.03), SIMDE_FLOAT64_C( 225.69),
SIMDE_FLOAT64_C( 694.79), SIMDE_FLOAT64_C( 216.99), SIMDE_FLOAT64_C( 525.75), SIMDE_FLOAT64_C( -520.05) },
{ SIMDE_FLOAT64_C( -13.00), SIMDE_FLOAT64_C( -271.93), SIMDE_FLOAT64_C( -507.50), SIMDE_FLOAT64_C( 226.00),
SIMDE_FLOAT64_C( -819.31), SIMDE_FLOAT64_C( -371.36), SIMDE_FLOAT64_C( -860.35), SIMDE_FLOAT64_C( 47.05) } },
{ { SIMDE_FLOAT64_C( 613.60), SIMDE_FLOAT64_C( 231.02), SIMDE_FLOAT64_C( -458.90), SIMDE_FLOAT64_C( 933.31),
SIMDE_FLOAT64_C( 527.27), SIMDE_FLOAT64_C( 357.46), SIMDE_FLOAT64_C( -875.42), SIMDE_FLOAT64_C( 769.12) },
UINT8_C(129),
{ SIMDE_FLOAT64_C( 767.45), SIMDE_FLOAT64_C( 325.69), SIMDE_FLOAT64_C( -178.73), SIMDE_FLOAT64_C( -530.26),
SIMDE_FLOAT64_C( 990.52), SIMDE_FLOAT64_C( -877.27), SIMDE_FLOAT64_C( 197.81), SIMDE_FLOAT64_C( -516.98) },
{ SIMDE_FLOAT64_C( 767.00), SIMDE_FLOAT64_C( 231.02), SIMDE_FLOAT64_C( -458.90), SIMDE_FLOAT64_C( 933.31),
SIMDE_FLOAT64_C( 527.27), SIMDE_FLOAT64_C( 357.46), SIMDE_FLOAT64_C( -875.42), SIMDE_FLOAT64_C( -517.00) } },
{ { SIMDE_FLOAT64_C( 83.57), SIMDE_FLOAT64_C( 378.50), SIMDE_FLOAT64_C( 111.66), SIMDE_FLOAT64_C( 223.22),
SIMDE_FLOAT64_C( -574.45), SIMDE_FLOAT64_C( -23.63), SIMDE_FLOAT64_C( -789.69), SIMDE_FLOAT64_C( 772.73) },
UINT8_C(203),
{ SIMDE_FLOAT64_C( 436.00), SIMDE_FLOAT64_C( 467.52), SIMDE_FLOAT64_C( -21.68), SIMDE_FLOAT64_C( -38.25),
SIMDE_FLOAT64_C( 947.47), SIMDE_FLOAT64_C( -408.08), SIMDE_FLOAT64_C( -807.23), SIMDE_FLOAT64_C( -511.43) },
{ SIMDE_FLOAT64_C( 436.00), SIMDE_FLOAT64_C( 468.00), SIMDE_FLOAT64_C( 111.66), SIMDE_FLOAT64_C( -38.00),
SIMDE_FLOAT64_C( -574.45), SIMDE_FLOAT64_C( -23.63), SIMDE_FLOAT64_C( -807.00), SIMDE_FLOAT64_C( -511.00) } }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])) ; i++) {
simde__m512d src = simde_mm512_loadu_pd(test_vec[i].src);
simde__m512d a = simde_mm512_loadu_pd(test_vec[i].a);
simde__m512d r = simde_mm512_mask_rint_pd(src, test_vec[i].k, a);
simde_test_x86_assert_equal_f64x8(r, simde_mm512_loadu_pd(test_vec[i].r), 1);
}
return 0;
}
static int
test_simde_mm_sin_ps(SIMDE_MUNIT_TEST_ARGS) {
const struct {
simde__m128 a;
simde__m128 r;
} test_vec[8] = {
{ simde_mm_set_ps(SIMDE_FLOAT32_C( -186.21), SIMDE_FLOAT32_C( 39.01), SIMDE_FLOAT32_C( -754.38), SIMDE_FLOAT32_C( 346.63)),
simde_mm_set_ps(SIMDE_FLOAT32_C( 0.76), SIMDE_FLOAT32_C( 0.97), SIMDE_FLOAT32_C( -0.39), SIMDE_FLOAT32_C( 0.87)) },
{ simde_mm_set_ps(SIMDE_FLOAT32_C( 497.31), SIMDE_FLOAT32_C( 670.24), SIMDE_FLOAT32_C( -297.45), SIMDE_FLOAT32_C( 34.06)),
simde_mm_set_ps(SIMDE_FLOAT32_C( 0.81), SIMDE_FLOAT32_C( -0.88), SIMDE_FLOAT32_C( -0.84), SIMDE_FLOAT32_C( 0.48)) },
{ simde_mm_set_ps(SIMDE_FLOAT32_C( 825.53), SIMDE_FLOAT32_C( 422.21), SIMDE_FLOAT32_C( -269.45), SIMDE_FLOAT32_C( 467.76)),
simde_mm_set_ps(SIMDE_FLOAT32_C( 0.65), SIMDE_FLOAT32_C( 0.94), SIMDE_FLOAT32_C( 0.66), SIMDE_FLOAT32_C( 0.33)) },
{ simde_mm_set_ps(SIMDE_FLOAT32_C( -678.17), SIMDE_FLOAT32_C( -686.13), SIMDE_FLOAT32_C( 84.77), SIMDE_FLOAT32_C( 571.46)),
simde_mm_set_ps(SIMDE_FLOAT32_C( 0.40), SIMDE_FLOAT32_C( -0.95), SIMDE_FLOAT32_C( 0.05), SIMDE_FLOAT32_C( -0.30)) },
{ simde_mm_set_ps(SIMDE_FLOAT32_C( -305.07), SIMDE_FLOAT32_C( -860.95), SIMDE_FLOAT32_C( -417.54), SIMDE_FLOAT32_C( 696.87)),
simde_mm_set_ps(SIMDE_FLOAT32_C( 0.33), SIMDE_FLOAT32_C( -0.15), SIMDE_FLOAT32_C( -0.29), SIMDE_FLOAT32_C( -0.53)) },
{ simde_mm_set_ps(SIMDE_FLOAT32_C( -444.81), SIMDE_FLOAT32_C( 28.47), SIMDE_FLOAT32_C( -384.03), SIMDE_FLOAT32_C( -923.64)),
simde_mm_set_ps(SIMDE_FLOAT32_C( 0.96), SIMDE_FLOAT32_C( -0.19), SIMDE_FLOAT32_C( -0.69), SIMDE_FLOAT32_C( -0.01)) },
{ simde_mm_set_ps(SIMDE_FLOAT32_C( 261.31), SIMDE_FLOAT32_C( -212.54), SIMDE_FLOAT32_C( -976.55), SIMDE_FLOAT32_C( -660.80)),
simde_mm_set_ps(SIMDE_FLOAT32_C( -0.53), SIMDE_FLOAT32_C( 0.89), SIMDE_FLOAT32_C( -0.47), SIMDE_FLOAT32_C( -0.88)) },
{ simde_mm_set_ps(SIMDE_FLOAT32_C( 178.20), SIMDE_FLOAT32_C( -450.67), SIMDE_FLOAT32_C( 233.37), SIMDE_FLOAT32_C( 687.09)),
simde_mm_set_ps(SIMDE_FLOAT32_C( 0.76), SIMDE_FLOAT32_C( 0.99), SIMDE_FLOAT32_C( 0.78), SIMDE_FLOAT32_C( 0.79)) }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])); i++) {
simde__m128 r = simde_mm_sin_ps(test_vec[i].a);
simde_assert_m128_close(r, test_vec[i].r, 1);
}
return 0;
}
static int
test_simde_mm_sin_pd(SIMDE_MUNIT_TEST_ARGS) {
const struct {
simde__m128d a;
simde__m128d r;
} test_vec[8] = {
{ simde_mm_set_pd(SIMDE_FLOAT64_C( -754.38), SIMDE_FLOAT64_C( 346.63)),
simde_mm_set_pd(SIMDE_FLOAT64_C( -0.39), SIMDE_FLOAT64_C( 0.87)) },
{ simde_mm_set_pd(SIMDE_FLOAT64_C( -186.21), SIMDE_FLOAT64_C( 39.01)),
simde_mm_set_pd(SIMDE_FLOAT64_C( 0.76), SIMDE_FLOAT64_C( 0.97)) },
{ simde_mm_set_pd(SIMDE_FLOAT64_C( -297.45), SIMDE_FLOAT64_C( 34.06)),
simde_mm_set_pd(SIMDE_FLOAT64_C( -0.84), SIMDE_FLOAT64_C( 0.48)) },
{ simde_mm_set_pd(SIMDE_FLOAT64_C( 497.31), SIMDE_FLOAT64_C( 670.24)),
simde_mm_set_pd(SIMDE_FLOAT64_C( 0.81), SIMDE_FLOAT64_C( -0.88)) },
{ simde_mm_set_pd(SIMDE_FLOAT64_C( -269.45), SIMDE_FLOAT64_C( 467.76)),
simde_mm_set_pd(SIMDE_FLOAT64_C( 0.66), SIMDE_FLOAT64_C( 0.33)) },
{ simde_mm_set_pd(SIMDE_FLOAT64_C( 825.53), SIMDE_FLOAT64_C( 422.21)),
simde_mm_set_pd(SIMDE_FLOAT64_C( 0.65), SIMDE_FLOAT64_C( 0.94)) },
{ simde_mm_set_pd(SIMDE_FLOAT64_C( 84.77), SIMDE_FLOAT64_C( 571.46)),
simde_mm_set_pd(SIMDE_FLOAT64_C( 0.05), SIMDE_FLOAT64_C( -0.30)) },
{ simde_mm_set_pd(SIMDE_FLOAT64_C( -678.17), SIMDE_FLOAT64_C( -686.13)),
simde_mm_set_pd(SIMDE_FLOAT64_C( 0.40), SIMDE_FLOAT64_C( -0.95)) }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])); i++) {
simde__m128d r = simde_mm_sin_pd(test_vec[i].a);
simde_assert_m128d_close(r, test_vec[i].r, 1);
}
return 0;
}
static int
test_simde_mm256_sin_ps(SIMDE_MUNIT_TEST_ARGS) {
const struct {
simde__m256 a;
simde__m256 r;
} test_vec[8] = {
{ simde_mm256_set_ps(SIMDE_FLOAT32_C( 497.31), SIMDE_FLOAT32_C( 670.24),
SIMDE_FLOAT32_C( -297.45), SIMDE_FLOAT32_C( 34.06),
SIMDE_FLOAT32_C( -186.21), SIMDE_FLOAT32_C( 39.01),
SIMDE_FLOAT32_C( -754.38), SIMDE_FLOAT32_C( 346.63)),
simde_mm256_set_ps(SIMDE_FLOAT32_C( 0.81), SIMDE_FLOAT32_C( -0.88),
SIMDE_FLOAT32_C( -0.84), SIMDE_FLOAT32_C( 0.48),
SIMDE_FLOAT32_C( 0.76), SIMDE_FLOAT32_C( 0.97),
SIMDE_FLOAT32_C( -0.39), SIMDE_FLOAT32_C( 0.87)) },
{ simde_mm256_set_ps(SIMDE_FLOAT32_C( -678.17), SIMDE_FLOAT32_C( -686.13),
SIMDE_FLOAT32_C( 84.77), SIMDE_FLOAT32_C( 571.46),
SIMDE_FLOAT32_C( 825.53), SIMDE_FLOAT32_C( 422.21),
SIMDE_FLOAT32_C( -269.45), SIMDE_FLOAT32_C( 467.76)),
simde_mm256_set_ps(SIMDE_FLOAT32_C( 0.40), SIMDE_FLOAT32_C( -0.95),
SIMDE_FLOAT32_C( 0.05), SIMDE_FLOAT32_C( -0.30),
SIMDE_FLOAT32_C( 0.65), SIMDE_FLOAT32_C( 0.94),
SIMDE_FLOAT32_C( 0.66), SIMDE_FLOAT32_C( 0.33)) },
{ simde_mm256_set_ps(SIMDE_FLOAT32_C( -444.81), SIMDE_FLOAT32_C( 28.47),
SIMDE_FLOAT32_C( -384.03), SIMDE_FLOAT32_C( -923.64),
SIMDE_FLOAT32_C( -305.07), SIMDE_FLOAT32_C( -860.95),
SIMDE_FLOAT32_C( -417.54), SIMDE_FLOAT32_C( 696.87)),
simde_mm256_set_ps(SIMDE_FLOAT32_C( 0.96), SIMDE_FLOAT32_C( -0.19),
SIMDE_FLOAT32_C( -0.69), SIMDE_FLOAT32_C( -0.01),
SIMDE_FLOAT32_C( 0.33), SIMDE_FLOAT32_C( -0.15),
SIMDE_FLOAT32_C( -0.29), SIMDE_FLOAT32_C( -0.53)) },
{ simde_mm256_set_ps(SIMDE_FLOAT32_C( 178.20), SIMDE_FLOAT32_C( -450.67),
SIMDE_FLOAT32_C( 233.37), SIMDE_FLOAT32_C( 687.09),
SIMDE_FLOAT32_C( 261.31), SIMDE_FLOAT32_C( -212.54),
SIMDE_FLOAT32_C( -976.55), SIMDE_FLOAT32_C( -660.80)),
simde_mm256_set_ps(SIMDE_FLOAT32_C( 0.76), SIMDE_FLOAT32_C( 0.99),
SIMDE_FLOAT32_C( 0.78), SIMDE_FLOAT32_C( 0.79),
SIMDE_FLOAT32_C( -0.53), SIMDE_FLOAT32_C( 0.89),
SIMDE_FLOAT32_C( -0.47), SIMDE_FLOAT32_C( -0.88)) },
{ simde_mm256_set_ps(SIMDE_FLOAT32_C( -770.35), SIMDE_FLOAT32_C( 443.48),
SIMDE_FLOAT32_C( -583.60), SIMDE_FLOAT32_C( 380.46),
SIMDE_FLOAT32_C( -770.72), SIMDE_FLOAT32_C( 993.90),
SIMDE_FLOAT32_C( 28.08), SIMDE_FLOAT32_C( 841.21)),
simde_mm256_set_ps(SIMDE_FLOAT32_C( 0.61), SIMDE_FLOAT32_C( -0.49),
SIMDE_FLOAT32_C( 0.67), SIMDE_FLOAT32_C( -0.32),
SIMDE_FLOAT32_C( 0.86), SIMDE_FLOAT32_C( 0.92),
SIMDE_FLOAT32_C( 0.19), SIMDE_FLOAT32_C( -0.67)) },
{ simde_mm256_set_ps(SIMDE_FLOAT32_C( -387.90), SIMDE_FLOAT32_C( 395.92),
SIMDE_FLOAT32_C( 655.87), SIMDE_FLOAT32_C( 339.21),
SIMDE_FLOAT32_C( 532.35), SIMDE_FLOAT32_C( -263.99),
SIMDE_FLOAT32_C( 780.64), SIMDE_FLOAT32_C( -30.79)),
simde_mm256_set_ps(SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( 0.08),
SIMDE_FLOAT32_C( 0.66), SIMDE_FLOAT32_C( -0.08),
SIMDE_FLOAT32_C( -0.99), SIMDE_FLOAT32_C( -0.10),
SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( 0.59)) },
{ simde_mm256_set_ps(SIMDE_FLOAT32_C( -203.65), SIMDE_FLOAT32_C( -80.73),
SIMDE_FLOAT32_C( 336.73), SIMDE_FLOAT32_C( -944.78),
SIMDE_FLOAT32_C( -747.59), SIMDE_FLOAT32_C( -767.23),
SIMDE_FLOAT32_C( -554.19), SIMDE_FLOAT32_C( 398.82)),
simde_mm256_set_ps(SIMDE_FLOAT32_C( -0.53), SIMDE_FLOAT32_C( 0.81),
SIMDE_FLOAT32_C( -0.55), SIMDE_FLOAT32_C( -0.74),
SIMDE_FLOAT32_C( 0.11), SIMDE_FLOAT32_C( -0.63),
SIMDE_FLOAT32_C( -0.96), SIMDE_FLOAT32_C( 0.16)) },
{ simde_mm256_set_ps(SIMDE_FLOAT32_C( 469.66), SIMDE_FLOAT32_C( 680.02),
SIMDE_FLOAT32_C( -148.69), SIMDE_FLOAT32_C( 818.66),
SIMDE_FLOAT32_C( 910.03), SIMDE_FLOAT32_C( 600.47),
SIMDE_FLOAT32_C( 791.23), SIMDE_FLOAT32_C( 254.31)),
simde_mm256_set_ps(SIMDE_FLOAT32_C( -1.00), SIMDE_FLOAT32_C( 0.99),
SIMDE_FLOAT32_C( 0.86), SIMDE_FLOAT32_C( 0.96),
SIMDE_FLOAT32_C( -0.86), SIMDE_FLOAT32_C( -0.41),
SIMDE_FLOAT32_C( -0.44), SIMDE_FLOAT32_C( 0.16)) }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])); i++) {
simde__m256 r = simde_mm256_sin_ps(test_vec[i].a);
simde_assert_m256_close(r, test_vec[i].r, 1);
}
return 0;
}
static int
test_simde_mm256_sin_pd(SIMDE_MUNIT_TEST_ARGS) {
const struct {
simde__m256d a;
simde__m256d r;
} test_vec[8] = {
{ simde_mm256_set_pd(SIMDE_FLOAT64_C( -186.21), SIMDE_FLOAT64_C( 39.01),
SIMDE_FLOAT64_C( -754.38), SIMDE_FLOAT64_C( 346.63)),
simde_mm256_set_pd(SIMDE_FLOAT64_C( 0.76), SIMDE_FLOAT64_C( 0.97),
SIMDE_FLOAT64_C( -0.39), SIMDE_FLOAT64_C( 0.87)) },
{ simde_mm256_set_pd(SIMDE_FLOAT64_C( 497.31), SIMDE_FLOAT64_C( 670.24),
SIMDE_FLOAT64_C( -297.45), SIMDE_FLOAT64_C( 34.06)),
simde_mm256_set_pd(SIMDE_FLOAT64_C( 0.81), SIMDE_FLOAT64_C( -0.88),
SIMDE_FLOAT64_C( -0.84), SIMDE_FLOAT64_C( 0.48)) },
{ simde_mm256_set_pd(SIMDE_FLOAT64_C( 825.53), SIMDE_FLOAT64_C( 422.21),
SIMDE_FLOAT64_C( -269.45), SIMDE_FLOAT64_C( 467.76)),
simde_mm256_set_pd(SIMDE_FLOAT64_C( 0.65), SIMDE_FLOAT64_C( 0.94),
SIMDE_FLOAT64_C( 0.66), SIMDE_FLOAT64_C( 0.33)) },
{ simde_mm256_set_pd(SIMDE_FLOAT64_C( -678.17), SIMDE_FLOAT64_C( -686.13),
SIMDE_FLOAT64_C( 84.77), SIMDE_FLOAT64_C( 571.46)),
simde_mm256_set_pd(SIMDE_FLOAT64_C( 0.40), SIMDE_FLOAT64_C( -0.95),
SIMDE_FLOAT64_C( 0.05), SIMDE_FLOAT64_C( -0.30)) },
{ simde_mm256_set_pd(SIMDE_FLOAT64_C( -305.07), SIMDE_FLOAT64_C( -860.95),
SIMDE_FLOAT64_C( -417.54), SIMDE_FLOAT64_C( 696.87)),
simde_mm256_set_pd(SIMDE_FLOAT64_C( 0.33), SIMDE_FLOAT64_C( -0.15),
SIMDE_FLOAT64_C( -0.29), SIMDE_FLOAT64_C( -0.53)) },
{ simde_mm256_set_pd(SIMDE_FLOAT64_C( -444.81), SIMDE_FLOAT64_C( 28.47),
SIMDE_FLOAT64_C( -384.03), SIMDE_FLOAT64_C( -923.64)),
simde_mm256_set_pd(SIMDE_FLOAT64_C( 0.96), SIMDE_FLOAT64_C( -0.19),
SIMDE_FLOAT64_C( -0.69), SIMDE_FLOAT64_C( -0.01)) },
{ simde_mm256_set_pd(SIMDE_FLOAT64_C( 261.31), SIMDE_FLOAT64_C( -212.54),
SIMDE_FLOAT64_C( -976.55), SIMDE_FLOAT64_C( -660.80)),
simde_mm256_set_pd(SIMDE_FLOAT64_C( -0.53), SIMDE_FLOAT64_C( 0.89),
SIMDE_FLOAT64_C( -0.47), SIMDE_FLOAT64_C( -0.88)) },
{ simde_mm256_set_pd(SIMDE_FLOAT64_C( 178.20), SIMDE_FLOAT64_C( -450.67),
SIMDE_FLOAT64_C( 233.37), SIMDE_FLOAT64_C( 687.09)),
simde_mm256_set_pd(SIMDE_FLOAT64_C( 0.76), SIMDE_FLOAT64_C( 0.99),
SIMDE_FLOAT64_C( 0.78), SIMDE_FLOAT64_C( 0.79)) }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])); i++) {
simde__m256d r = simde_mm256_sin_pd(test_vec[i].a);
simde_assert_m256d_close(r, test_vec[i].r, 1);
}
return 0;
}
static int
test_simde_mm512_sin_ps(SIMDE_MUNIT_TEST_ARGS) {
const struct {
simde__m512 a;
simde__m512 r;
} test_vec[8] = {
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( -678.17), SIMDE_FLOAT32_C( -686.13), SIMDE_FLOAT32_C( 84.77), SIMDE_FLOAT32_C( 571.46),
SIMDE_FLOAT32_C( 825.53), SIMDE_FLOAT32_C( 422.21), SIMDE_FLOAT32_C( -269.45), SIMDE_FLOAT32_C( 467.76),
SIMDE_FLOAT32_C( 497.31), SIMDE_FLOAT32_C( 670.24), SIMDE_FLOAT32_C( -297.45), SIMDE_FLOAT32_C( 34.06),
SIMDE_FLOAT32_C( -186.21), SIMDE_FLOAT32_C( 39.01), SIMDE_FLOAT32_C( -754.38), SIMDE_FLOAT32_C( 346.63)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 0.40), SIMDE_FLOAT32_C( -0.95), SIMDE_FLOAT32_C( 0.05), SIMDE_FLOAT32_C( -0.30),
SIMDE_FLOAT32_C( 0.65), SIMDE_FLOAT32_C( 0.94), SIMDE_FLOAT32_C( 0.66), SIMDE_FLOAT32_C( 0.33),
SIMDE_FLOAT32_C( 0.81), SIMDE_FLOAT32_C( -0.88), SIMDE_FLOAT32_C( -0.84), SIMDE_FLOAT32_C( 0.48),
SIMDE_FLOAT32_C( 0.76), SIMDE_FLOAT32_C( 0.97), SIMDE_FLOAT32_C( -0.39), SIMDE_FLOAT32_C( 0.87)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( 178.20), SIMDE_FLOAT32_C( -450.67), SIMDE_FLOAT32_C( 233.37), SIMDE_FLOAT32_C( 687.09),
SIMDE_FLOAT32_C( 261.31), SIMDE_FLOAT32_C( -212.54), SIMDE_FLOAT32_C( -976.55), SIMDE_FLOAT32_C( -660.80),
SIMDE_FLOAT32_C( -444.81), SIMDE_FLOAT32_C( 28.47), SIMDE_FLOAT32_C( -384.03), SIMDE_FLOAT32_C( -923.64),
SIMDE_FLOAT32_C( -305.07), SIMDE_FLOAT32_C( -860.95), SIMDE_FLOAT32_C( -417.54), SIMDE_FLOAT32_C( 696.87)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 0.76), SIMDE_FLOAT32_C( 0.99), SIMDE_FLOAT32_C( 0.78), SIMDE_FLOAT32_C( 0.79),
SIMDE_FLOAT32_C( -0.53), SIMDE_FLOAT32_C( 0.89), SIMDE_FLOAT32_C( -0.47), SIMDE_FLOAT32_C( -0.88),
SIMDE_FLOAT32_C( 0.96), SIMDE_FLOAT32_C( -0.19), SIMDE_FLOAT32_C( -0.69), SIMDE_FLOAT32_C( -0.01),
SIMDE_FLOAT32_C( 0.33), SIMDE_FLOAT32_C( -0.15), SIMDE_FLOAT32_C( -0.29), SIMDE_FLOAT32_C( -0.53)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( -387.90), SIMDE_FLOAT32_C( 395.92), SIMDE_FLOAT32_C( 655.87), SIMDE_FLOAT32_C( 339.21),
SIMDE_FLOAT32_C( 532.35), SIMDE_FLOAT32_C( -263.99), SIMDE_FLOAT32_C( 780.64), SIMDE_FLOAT32_C( -30.79),
SIMDE_FLOAT32_C( -770.35), SIMDE_FLOAT32_C( 443.48), SIMDE_FLOAT32_C( -583.60), SIMDE_FLOAT32_C( 380.46),
SIMDE_FLOAT32_C( -770.72), SIMDE_FLOAT32_C( 993.90), SIMDE_FLOAT32_C( 28.08), SIMDE_FLOAT32_C( 841.21)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( 0.08), SIMDE_FLOAT32_C( 0.66), SIMDE_FLOAT32_C( -0.08),
SIMDE_FLOAT32_C( -0.99), SIMDE_FLOAT32_C( -0.10), SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( 0.59),
SIMDE_FLOAT32_C( 0.61), SIMDE_FLOAT32_C( -0.49), SIMDE_FLOAT32_C( 0.67), SIMDE_FLOAT32_C( -0.32),
SIMDE_FLOAT32_C( 0.86), SIMDE_FLOAT32_C( 0.92), SIMDE_FLOAT32_C( 0.19), SIMDE_FLOAT32_C( -0.67)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( 469.66), SIMDE_FLOAT32_C( 680.02), SIMDE_FLOAT32_C( -148.69), SIMDE_FLOAT32_C( 818.66),
SIMDE_FLOAT32_C( 910.03), SIMDE_FLOAT32_C( 600.47), SIMDE_FLOAT32_C( 791.23), SIMDE_FLOAT32_C( 254.31),
SIMDE_FLOAT32_C( -203.65), SIMDE_FLOAT32_C( -80.73), SIMDE_FLOAT32_C( 336.73), SIMDE_FLOAT32_C( -944.78),
SIMDE_FLOAT32_C( -747.59), SIMDE_FLOAT32_C( -767.23), SIMDE_FLOAT32_C( -554.19), SIMDE_FLOAT32_C( 398.82)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( -1.00), SIMDE_FLOAT32_C( 0.99), SIMDE_FLOAT32_C( 0.86), SIMDE_FLOAT32_C( 0.96),
SIMDE_FLOAT32_C( -0.86), SIMDE_FLOAT32_C( -0.41), SIMDE_FLOAT32_C( -0.44), SIMDE_FLOAT32_C( 0.16),
SIMDE_FLOAT32_C( -0.53), SIMDE_FLOAT32_C( 0.81), SIMDE_FLOAT32_C( -0.55), SIMDE_FLOAT32_C( -0.74),
SIMDE_FLOAT32_C( 0.11), SIMDE_FLOAT32_C( -0.63), SIMDE_FLOAT32_C( -0.96), SIMDE_FLOAT32_C( 0.16)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( -178.99), SIMDE_FLOAT32_C( 758.79), SIMDE_FLOAT32_C( 324.62), SIMDE_FLOAT32_C( 343.48),
SIMDE_FLOAT32_C( -874.31), SIMDE_FLOAT32_C( -797.92), SIMDE_FLOAT32_C( -328.54), SIMDE_FLOAT32_C( -525.83),
SIMDE_FLOAT32_C( -192.31), SIMDE_FLOAT32_C( -822.65), SIMDE_FLOAT32_C( 561.36), SIMDE_FLOAT32_C( 655.67),
SIMDE_FLOAT32_C( -70.91), SIMDE_FLOAT32_C( 543.35), SIMDE_FLOAT32_C( 120.65), SIMDE_FLOAT32_C( -171.51)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( -0.08), SIMDE_FLOAT32_C( -1.00), SIMDE_FLOAT32_C( -0.86), SIMDE_FLOAT32_C( -0.87),
SIMDE_FLOAT32_C( -0.81), SIMDE_FLOAT32_C( 0.04), SIMDE_FLOAT32_C( -0.97), SIMDE_FLOAT32_C( 0.93),
SIMDE_FLOAT32_C( 0.62), SIMDE_FLOAT32_C( 0.43), SIMDE_FLOAT32_C( 0.83), SIMDE_FLOAT32_C( 0.80),
SIMDE_FLOAT32_C( -0.97), SIMDE_FLOAT32_C( 0.15), SIMDE_FLOAT32_C( 0.95), SIMDE_FLOAT32_C( -0.96)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( 690.12), SIMDE_FLOAT32_C( 840.65), SIMDE_FLOAT32_C( -21.09), SIMDE_FLOAT32_C( -591.56),
SIMDE_FLOAT32_C( -448.89), SIMDE_FLOAT32_C( 731.49), SIMDE_FLOAT32_C( 505.79), SIMDE_FLOAT32_C( 623.70),
SIMDE_FLOAT32_C( 831.02), SIMDE_FLOAT32_C( 140.67), SIMDE_FLOAT32_C( 977.36), SIMDE_FLOAT32_C( -906.16),
SIMDE_FLOAT32_C( 331.34), SIMDE_FLOAT32_C( 99.93), SIMDE_FLOAT32_C( 462.95), SIMDE_FLOAT32_C( -738.19)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( -0.86), SIMDE_FLOAT32_C( -0.96), SIMDE_FLOAT32_C( -0.78), SIMDE_FLOAT32_C( -0.81),
SIMDE_FLOAT32_C( -0.35), SIMDE_FLOAT32_C( 0.48), SIMDE_FLOAT32_C( 0.01), SIMDE_FLOAT32_C( 1.00),
SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( 0.65), SIMDE_FLOAT32_C( -0.32), SIMDE_FLOAT32_C( -0.98),
SIMDE_FLOAT32_C( -1.00), SIMDE_FLOAT32_C( -0.57), SIMDE_FLOAT32_C( -0.91), SIMDE_FLOAT32_C( -0.08)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( -769.09), SIMDE_FLOAT32_C( 822.06), SIMDE_FLOAT32_C( -573.81), SIMDE_FLOAT32_C( -997.63),
SIMDE_FLOAT32_C( -337.60), SIMDE_FLOAT32_C( 923.64), SIMDE_FLOAT32_C( 293.64), SIMDE_FLOAT32_C( -768.12),
SIMDE_FLOAT32_C( -576.22), SIMDE_FLOAT32_C( -67.64), SIMDE_FLOAT32_C( 710.38), SIMDE_FLOAT32_C( 977.49),
SIMDE_FLOAT32_C( -756.42), SIMDE_FLOAT32_C( 424.81), SIMDE_FLOAT32_C( 27.25), SIMDE_FLOAT32_C( -95.15)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( -0.56), SIMDE_FLOAT32_C( -0.86), SIMDE_FLOAT32_C( -0.89), SIMDE_FLOAT32_C( 0.98),
SIMDE_FLOAT32_C( 0.99), SIMDE_FLOAT32_C( 0.01), SIMDE_FLOAT32_C( -1.00), SIMDE_FLOAT32_C( -1.00),
SIMDE_FLOAT32_C( 0.97), SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( 0.37), SIMDE_FLOAT32_C( -0.44),
SIMDE_FLOAT32_C( -0.65), SIMDE_FLOAT32_C( -0.64), SIMDE_FLOAT32_C( 0.85), SIMDE_FLOAT32_C( -0.78)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( -438.19), SIMDE_FLOAT32_C( -359.76), SIMDE_FLOAT32_C( -752.43), SIMDE_FLOAT32_C( -33.67),
SIMDE_FLOAT32_C( 932.66), SIMDE_FLOAT32_C( 7.27), SIMDE_FLOAT32_C( -327.22), SIMDE_FLOAT32_C( -125.20),
SIMDE_FLOAT32_C( -182.45), SIMDE_FLOAT32_C( 39.93), SIMDE_FLOAT32_C( 510.85), SIMDE_FLOAT32_C( 394.67),
SIMDE_FLOAT32_C( 14.34), SIMDE_FLOAT32_C( -304.73), SIMDE_FLOAT32_C( 916.26), SIMDE_FLOAT32_C( -696.69)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( -1.00), SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( -0.78),
SIMDE_FLOAT32_C( 0.38), SIMDE_FLOAT32_C( 0.83), SIMDE_FLOAT32_C( -0.47), SIMDE_FLOAT32_C( 0.45),
SIMDE_FLOAT32_C( -0.24), SIMDE_FLOAT32_C( 0.79), SIMDE_FLOAT32_C( 0.94), SIMDE_FLOAT32_C( -0.92),
SIMDE_FLOAT32_C( 0.98), SIMDE_FLOAT32_C( -0.00), SIMDE_FLOAT32_C( -0.88), SIMDE_FLOAT32_C( 0.68)) }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])); i++) {
simde__m512 r = simde_mm512_sin_ps(test_vec[i].a);
simde_assert_m512_close(r, test_vec[i].r, 1);
}
return 0;
}
static int
test_simde_mm512_mask_sin_ps(SIMDE_MUNIT_TEST_ARGS) {
const struct {
simde__m512 src;
simde__mmask16 k;
simde__m512 a;
simde__m512 r;
} test_vec[8] = {
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( -450.67), SIMDE_FLOAT32_C( 687.09), SIMDE_FLOAT32_C( -212.54), SIMDE_FLOAT32_C( -660.80),
SIMDE_FLOAT32_C( 28.47), SIMDE_FLOAT32_C( -923.64), SIMDE_FLOAT32_C( -860.95), SIMDE_FLOAT32_C( 696.87),
SIMDE_FLOAT32_C( -686.13), SIMDE_FLOAT32_C( 571.46), SIMDE_FLOAT32_C( 422.21), SIMDE_FLOAT32_C( 467.76),
SIMDE_FLOAT32_C( 670.24), SIMDE_FLOAT32_C( 34.06), SIMDE_FLOAT32_C( 39.01), SIMDE_FLOAT32_C( 346.63)),
UINT16_C(41466),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 178.20), SIMDE_FLOAT32_C( 233.37), SIMDE_FLOAT32_C( 261.31), SIMDE_FLOAT32_C( -976.55),
SIMDE_FLOAT32_C( -444.81), SIMDE_FLOAT32_C( -384.03), SIMDE_FLOAT32_C( -305.07), SIMDE_FLOAT32_C( -417.54),
SIMDE_FLOAT32_C( -678.17), SIMDE_FLOAT32_C( 84.77), SIMDE_FLOAT32_C( 825.53), SIMDE_FLOAT32_C( -269.45),
SIMDE_FLOAT32_C( 497.31), SIMDE_FLOAT32_C( -297.45), SIMDE_FLOAT32_C( -186.21), SIMDE_FLOAT32_C( -754.38)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 0.76), SIMDE_FLOAT32_C( 687.09), SIMDE_FLOAT32_C( -0.53), SIMDE_FLOAT32_C( -660.80),
SIMDE_FLOAT32_C( 28.47), SIMDE_FLOAT32_C( -923.64), SIMDE_FLOAT32_C( -860.95), SIMDE_FLOAT32_C( -0.29),
SIMDE_FLOAT32_C( 0.40), SIMDE_FLOAT32_C( 0.05), SIMDE_FLOAT32_C( 0.65), SIMDE_FLOAT32_C( 0.66),
SIMDE_FLOAT32_C( 0.81), SIMDE_FLOAT32_C( 34.06), SIMDE_FLOAT32_C( 0.76), SIMDE_FLOAT32_C( 346.63)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( 469.66), SIMDE_FLOAT32_C( -148.69), SIMDE_FLOAT32_C( 910.03), SIMDE_FLOAT32_C( 791.23),
SIMDE_FLOAT32_C( -203.65), SIMDE_FLOAT32_C( 336.73), SIMDE_FLOAT32_C( -747.59), SIMDE_FLOAT32_C( -554.19),
SIMDE_FLOAT32_C( -387.90), SIMDE_FLOAT32_C( 655.87), SIMDE_FLOAT32_C( 532.35), SIMDE_FLOAT32_C( 780.64),
SIMDE_FLOAT32_C( -770.35), SIMDE_FLOAT32_C( -583.60), SIMDE_FLOAT32_C( -770.72), SIMDE_FLOAT32_C( 28.08)),
UINT16_C(36797),
simde_mm512_set_ps(SIMDE_FLOAT32_C( -171.51), SIMDE_FLOAT32_C( 680.02), SIMDE_FLOAT32_C( 818.66), SIMDE_FLOAT32_C( 600.47),
SIMDE_FLOAT32_C( 254.31), SIMDE_FLOAT32_C( -80.73), SIMDE_FLOAT32_C( -944.78), SIMDE_FLOAT32_C( -767.23),
SIMDE_FLOAT32_C( 398.82), SIMDE_FLOAT32_C( 395.92), SIMDE_FLOAT32_C( 339.21), SIMDE_FLOAT32_C( -263.99),
SIMDE_FLOAT32_C( -30.79), SIMDE_FLOAT32_C( 443.48), SIMDE_FLOAT32_C( 380.46), SIMDE_FLOAT32_C( 993.90)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( -0.96), SIMDE_FLOAT32_C( -148.69), SIMDE_FLOAT32_C( 910.03), SIMDE_FLOAT32_C( 791.23),
SIMDE_FLOAT32_C( 0.16), SIMDE_FLOAT32_C( 0.81), SIMDE_FLOAT32_C( -0.74), SIMDE_FLOAT32_C( -0.63),
SIMDE_FLOAT32_C( 0.16), SIMDE_FLOAT32_C( 655.87), SIMDE_FLOAT32_C( -0.08), SIMDE_FLOAT32_C( -0.10),
SIMDE_FLOAT32_C( 0.59), SIMDE_FLOAT32_C( -0.49), SIMDE_FLOAT32_C( -770.72), SIMDE_FLOAT32_C( 0.92)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( -95.15), SIMDE_FLOAT32_C( 840.65), SIMDE_FLOAT32_C( -591.56), SIMDE_FLOAT32_C( 731.49),
SIMDE_FLOAT32_C( 623.70), SIMDE_FLOAT32_C( 140.67), SIMDE_FLOAT32_C( -906.16), SIMDE_FLOAT32_C( 99.93),
SIMDE_FLOAT32_C( -738.19), SIMDE_FLOAT32_C( 758.79), SIMDE_FLOAT32_C( 343.48), SIMDE_FLOAT32_C( -797.92),
SIMDE_FLOAT32_C( -525.83), SIMDE_FLOAT32_C( -822.65), SIMDE_FLOAT32_C( 655.67), SIMDE_FLOAT32_C( 543.35)),
UINT16_C(16804),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 27.25), SIMDE_FLOAT32_C( 690.12), SIMDE_FLOAT32_C( -21.09), SIMDE_FLOAT32_C( -448.89),
SIMDE_FLOAT32_C( 505.79), SIMDE_FLOAT32_C( 831.02), SIMDE_FLOAT32_C( 977.36), SIMDE_FLOAT32_C( 331.34),
SIMDE_FLOAT32_C( 462.95), SIMDE_FLOAT32_C( -178.99), SIMDE_FLOAT32_C( 324.62), SIMDE_FLOAT32_C( -874.31),
SIMDE_FLOAT32_C( -328.54), SIMDE_FLOAT32_C( -192.31), SIMDE_FLOAT32_C( 561.36), SIMDE_FLOAT32_C( -70.91)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( -95.15), SIMDE_FLOAT32_C( -0.86), SIMDE_FLOAT32_C( -591.56), SIMDE_FLOAT32_C( 731.49),
SIMDE_FLOAT32_C( 623.70), SIMDE_FLOAT32_C( 140.67), SIMDE_FLOAT32_C( -906.16), SIMDE_FLOAT32_C( -1.00),
SIMDE_FLOAT32_C( -0.91), SIMDE_FLOAT32_C( 758.79), SIMDE_FLOAT32_C( -0.86), SIMDE_FLOAT32_C( -797.92),
SIMDE_FLOAT32_C( -525.83), SIMDE_FLOAT32_C( 0.62), SIMDE_FLOAT32_C( 655.67), SIMDE_FLOAT32_C( 543.35)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( -348.70), SIMDE_FLOAT32_C( -438.19), SIMDE_FLOAT32_C( -752.43), SIMDE_FLOAT32_C( 932.66),
SIMDE_FLOAT32_C( -327.22), SIMDE_FLOAT32_C( -182.45), SIMDE_FLOAT32_C( 510.85), SIMDE_FLOAT32_C( 14.34),
SIMDE_FLOAT32_C( 916.26), SIMDE_FLOAT32_C( -769.09), SIMDE_FLOAT32_C( -573.81), SIMDE_FLOAT32_C( -337.60),
SIMDE_FLOAT32_C( 293.64), SIMDE_FLOAT32_C( -576.22), SIMDE_FLOAT32_C( 710.38), SIMDE_FLOAT32_C( -756.42)),
UINT16_C( 2107),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 897.27), SIMDE_FLOAT32_C( -197.89), SIMDE_FLOAT32_C( -359.76), SIMDE_FLOAT32_C( -33.67),
SIMDE_FLOAT32_C( 7.27), SIMDE_FLOAT32_C( -125.20), SIMDE_FLOAT32_C( 39.93), SIMDE_FLOAT32_C( 394.67),
SIMDE_FLOAT32_C( -304.73), SIMDE_FLOAT32_C( -696.69), SIMDE_FLOAT32_C( 822.06), SIMDE_FLOAT32_C( -997.63),
SIMDE_FLOAT32_C( 923.64), SIMDE_FLOAT32_C( -768.12), SIMDE_FLOAT32_C( -67.64), SIMDE_FLOAT32_C( 977.49)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( -348.70), SIMDE_FLOAT32_C( -438.19), SIMDE_FLOAT32_C( -752.43), SIMDE_FLOAT32_C( 932.66),
SIMDE_FLOAT32_C( 0.83), SIMDE_FLOAT32_C( -182.45), SIMDE_FLOAT32_C( 510.85), SIMDE_FLOAT32_C( 14.34),
SIMDE_FLOAT32_C( 916.26), SIMDE_FLOAT32_C( -769.09), SIMDE_FLOAT32_C( -0.86), SIMDE_FLOAT32_C( 0.98),
SIMDE_FLOAT32_C( 0.01), SIMDE_FLOAT32_C( -576.22), SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( -0.44)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( -15.61), SIMDE_FLOAT32_C( -737.13), SIMDE_FLOAT32_C( -314.93), SIMDE_FLOAT32_C( 177.92),
SIMDE_FLOAT32_C( 345.93), SIMDE_FLOAT32_C( 888.71), SIMDE_FLOAT32_C( 915.71), SIMDE_FLOAT32_C( 133.52),
SIMDE_FLOAT32_C( 484.94), SIMDE_FLOAT32_C( -598.06), SIMDE_FLOAT32_C( -791.07), SIMDE_FLOAT32_C( -765.93),
SIMDE_FLOAT32_C( 221.37), SIMDE_FLOAT32_C( -788.36), SIMDE_FLOAT32_C( -775.04), SIMDE_FLOAT32_C( 440.64)),
UINT16_C(22274),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 496.57), SIMDE_FLOAT32_C( 915.19), SIMDE_FLOAT32_C( -718.40), SIMDE_FLOAT32_C( 159.97),
SIMDE_FLOAT32_C( -861.01), SIMDE_FLOAT32_C( 426.61), SIMDE_FLOAT32_C( 932.11), SIMDE_FLOAT32_C( 110.36),
SIMDE_FLOAT32_C( 826.84), SIMDE_FLOAT32_C( -76.75), SIMDE_FLOAT32_C( 237.58), SIMDE_FLOAT32_C( -378.50),
SIMDE_FLOAT32_C( -601.68), SIMDE_FLOAT32_C( -623.50), SIMDE_FLOAT32_C( -942.47), SIMDE_FLOAT32_C( 475.51)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( -15.61), SIMDE_FLOAT32_C( -0.83), SIMDE_FLOAT32_C( -314.93), SIMDE_FLOAT32_C( 0.25),
SIMDE_FLOAT32_C( 345.93), SIMDE_FLOAT32_C( -0.60), SIMDE_FLOAT32_C( 0.81), SIMDE_FLOAT32_C( -0.39),
SIMDE_FLOAT32_C( 484.94), SIMDE_FLOAT32_C( -598.06), SIMDE_FLOAT32_C( -791.07), SIMDE_FLOAT32_C( -765.93),
SIMDE_FLOAT32_C( 221.37), SIMDE_FLOAT32_C( -788.36), SIMDE_FLOAT32_C( 0.01), SIMDE_FLOAT32_C( 440.64)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( 883.05), SIMDE_FLOAT32_C( -807.28), SIMDE_FLOAT32_C( -70.05), SIMDE_FLOAT32_C( -784.34),
SIMDE_FLOAT32_C( 92.52), SIMDE_FLOAT32_C( 206.60), SIMDE_FLOAT32_C( 834.60), SIMDE_FLOAT32_C( -65.60),
SIMDE_FLOAT32_C( -286.07), SIMDE_FLOAT32_C( -212.86), SIMDE_FLOAT32_C( -318.38), SIMDE_FLOAT32_C( 783.48),
SIMDE_FLOAT32_C( -628.82), SIMDE_FLOAT32_C( 556.35), SIMDE_FLOAT32_C( 439.43), SIMDE_FLOAT32_C( 434.03)),
UINT16_C(27396),
simde_mm512_set_ps(SIMDE_FLOAT32_C( -964.25), SIMDE_FLOAT32_C( -406.33), SIMDE_FLOAT32_C( -743.66), SIMDE_FLOAT32_C( -764.58),
SIMDE_FLOAT32_C( 789.89), SIMDE_FLOAT32_C( 4.83), SIMDE_FLOAT32_C( -818.54), SIMDE_FLOAT32_C( 161.06),
SIMDE_FLOAT32_C( 579.25), SIMDE_FLOAT32_C( -11.78), SIMDE_FLOAT32_C( -308.52), SIMDE_FLOAT32_C( -719.57),
SIMDE_FLOAT32_C( 334.00), SIMDE_FLOAT32_C( 274.71), SIMDE_FLOAT32_C( -916.82), SIMDE_FLOAT32_C( -490.00)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 883.05), SIMDE_FLOAT32_C( 0.87), SIMDE_FLOAT32_C( -0.78), SIMDE_FLOAT32_C( -784.34),
SIMDE_FLOAT32_C( -0.98), SIMDE_FLOAT32_C( 206.60), SIMDE_FLOAT32_C( -0.99), SIMDE_FLOAT32_C( -0.74),
SIMDE_FLOAT32_C( -286.07), SIMDE_FLOAT32_C( -212.86), SIMDE_FLOAT32_C( -318.38), SIMDE_FLOAT32_C( 783.48),
SIMDE_FLOAT32_C( -628.82), SIMDE_FLOAT32_C( -0.98), SIMDE_FLOAT32_C( 439.43), SIMDE_FLOAT32_C( 434.03)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( 529.63), SIMDE_FLOAT32_C( -24.89), SIMDE_FLOAT32_C( -967.78), SIMDE_FLOAT32_C( 638.94),
SIMDE_FLOAT32_C( 450.90), SIMDE_FLOAT32_C( -771.54), SIMDE_FLOAT32_C( 105.79), SIMDE_FLOAT32_C( 590.10),
SIMDE_FLOAT32_C( 30.91), SIMDE_FLOAT32_C( 635.35), SIMDE_FLOAT32_C( -84.00), SIMDE_FLOAT32_C( 80.04),
SIMDE_FLOAT32_C( -709.46), SIMDE_FLOAT32_C( 607.86), SIMDE_FLOAT32_C( 394.58), SIMDE_FLOAT32_C( -889.11)),
UINT16_C( 953),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 18.75), SIMDE_FLOAT32_C( 809.05), SIMDE_FLOAT32_C( 144.05), SIMDE_FLOAT32_C( -427.72),
SIMDE_FLOAT32_C( 308.28), SIMDE_FLOAT32_C( -177.05), SIMDE_FLOAT32_C( -457.77), SIMDE_FLOAT32_C( 678.24),
SIMDE_FLOAT32_C( 66.05), SIMDE_FLOAT32_C( -267.71), SIMDE_FLOAT32_C( 117.28), SIMDE_FLOAT32_C( -576.80),
SIMDE_FLOAT32_C( -38.39), SIMDE_FLOAT32_C( -250.14), SIMDE_FLOAT32_C( -53.92), SIMDE_FLOAT32_C( 91.94)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 529.63), SIMDE_FLOAT32_C( -24.89), SIMDE_FLOAT32_C( -967.78), SIMDE_FLOAT32_C( 638.94),
SIMDE_FLOAT32_C( 450.90), SIMDE_FLOAT32_C( -771.54), SIMDE_FLOAT32_C( 0.78), SIMDE_FLOAT32_C( -0.34),
SIMDE_FLOAT32_C( -0.08), SIMDE_FLOAT32_C( 635.35), SIMDE_FLOAT32_C( -0.86), SIMDE_FLOAT32_C( 0.95),
SIMDE_FLOAT32_C( -0.64), SIMDE_FLOAT32_C( 607.86), SIMDE_FLOAT32_C( 394.58), SIMDE_FLOAT32_C( -0.74)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( -788.39), SIMDE_FLOAT32_C( 330.43), SIMDE_FLOAT32_C( -493.41), SIMDE_FLOAT32_C( 822.72),
SIMDE_FLOAT32_C( 956.68), SIMDE_FLOAT32_C( 954.62), SIMDE_FLOAT32_C( 825.49), SIMDE_FLOAT32_C( -816.27),
SIMDE_FLOAT32_C( -209.34), SIMDE_FLOAT32_C( -933.21), SIMDE_FLOAT32_C( -728.70), SIMDE_FLOAT32_C( -420.06),
SIMDE_FLOAT32_C( 100.32), SIMDE_FLOAT32_C( 103.15), SIMDE_FLOAT32_C( 439.77), SIMDE_FLOAT32_C( -204.33)),
UINT16_C(12713),
simde_mm512_set_ps(SIMDE_FLOAT32_C( -841.43), SIMDE_FLOAT32_C( -14.16), SIMDE_FLOAT32_C( 824.88), SIMDE_FLOAT32_C( 793.63),
SIMDE_FLOAT32_C( -736.75), SIMDE_FLOAT32_C( -310.57), SIMDE_FLOAT32_C( 728.87), SIMDE_FLOAT32_C( -350.72),
SIMDE_FLOAT32_C( 60.89), SIMDE_FLOAT32_C( 109.81), SIMDE_FLOAT32_C( 715.94), SIMDE_FLOAT32_C( -250.60),
SIMDE_FLOAT32_C( 944.14), SIMDE_FLOAT32_C( 361.85), SIMDE_FLOAT32_C( -13.07), SIMDE_FLOAT32_C( 852.60)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( -788.39), SIMDE_FLOAT32_C( 330.43), SIMDE_FLOAT32_C( 0.98), SIMDE_FLOAT32_C( 0.93),
SIMDE_FLOAT32_C( 956.68), SIMDE_FLOAT32_C( 954.62), SIMDE_FLOAT32_C( 825.49), SIMDE_FLOAT32_C( 0.91),
SIMDE_FLOAT32_C( -0.93), SIMDE_FLOAT32_C( -933.21), SIMDE_FLOAT32_C( -0.34), SIMDE_FLOAT32_C( -420.06),
SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( 103.15), SIMDE_FLOAT32_C( 439.77), SIMDE_FLOAT32_C( -0.94)) }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])); i++) {
simde__m512 r = simde_mm512_mask_sin_ps(test_vec[i].src, test_vec[i].k, test_vec[i].a);
simde_assert_m512_close(r, test_vec[i].r, 1);
}
return 0;
}
static int
test_simde_mm512_sin_pd(SIMDE_MUNIT_TEST_ARGS) {
const struct {
simde__m512d a;
simde__m512d r;
} test_vec[8] = {
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( 497.31), SIMDE_FLOAT64_C( 670.24),
SIMDE_FLOAT64_C( -297.45), SIMDE_FLOAT64_C( 34.06),
SIMDE_FLOAT64_C( -186.21), SIMDE_FLOAT64_C( 39.01),
SIMDE_FLOAT64_C( -754.38), SIMDE_FLOAT64_C( 346.63)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 0.81), SIMDE_FLOAT64_C( -0.88),
SIMDE_FLOAT64_C( -0.84), SIMDE_FLOAT64_C( 0.48),
SIMDE_FLOAT64_C( 0.76), SIMDE_FLOAT64_C( 0.97),
SIMDE_FLOAT64_C( -0.39), SIMDE_FLOAT64_C( 0.87)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( -678.17), SIMDE_FLOAT64_C( -686.13),
SIMDE_FLOAT64_C( 84.77), SIMDE_FLOAT64_C( 571.46),
SIMDE_FLOAT64_C( 825.53), SIMDE_FLOAT64_C( 422.21),
SIMDE_FLOAT64_C( -269.45), SIMDE_FLOAT64_C( 467.76)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 0.40), SIMDE_FLOAT64_C( -0.95),
SIMDE_FLOAT64_C( 0.05), SIMDE_FLOAT64_C( -0.30),
SIMDE_FLOAT64_C( 0.65), SIMDE_FLOAT64_C( 0.94),
SIMDE_FLOAT64_C( 0.66), SIMDE_FLOAT64_C( 0.33)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( -444.81), SIMDE_FLOAT64_C( 28.47),
SIMDE_FLOAT64_C( -384.03), SIMDE_FLOAT64_C( -923.64),
SIMDE_FLOAT64_C( -305.07), SIMDE_FLOAT64_C( -860.95),
SIMDE_FLOAT64_C( -417.54), SIMDE_FLOAT64_C( 696.87)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 0.96), SIMDE_FLOAT64_C( -0.19),
SIMDE_FLOAT64_C( -0.69), SIMDE_FLOAT64_C( -0.01),
SIMDE_FLOAT64_C( 0.33), SIMDE_FLOAT64_C( -0.15),
SIMDE_FLOAT64_C( -0.29), SIMDE_FLOAT64_C( -0.53)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( 178.20), SIMDE_FLOAT64_C( -450.67),
SIMDE_FLOAT64_C( 233.37), SIMDE_FLOAT64_C( 687.09),
SIMDE_FLOAT64_C( 261.31), SIMDE_FLOAT64_C( -212.54),
SIMDE_FLOAT64_C( -976.55), SIMDE_FLOAT64_C( -660.80)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 0.76), SIMDE_FLOAT64_C( 0.99),
SIMDE_FLOAT64_C( 0.78), SIMDE_FLOAT64_C( 0.79),
SIMDE_FLOAT64_C( -0.53), SIMDE_FLOAT64_C( 0.89),
SIMDE_FLOAT64_C( -0.47), SIMDE_FLOAT64_C( -0.88)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( -770.35), SIMDE_FLOAT64_C( 443.48),
SIMDE_FLOAT64_C( -583.60), SIMDE_FLOAT64_C( 380.46),
SIMDE_FLOAT64_C( -770.72), SIMDE_FLOAT64_C( 993.90),
SIMDE_FLOAT64_C( 28.08), SIMDE_FLOAT64_C( 841.21)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 0.61), SIMDE_FLOAT64_C( -0.49),
SIMDE_FLOAT64_C( 0.67), SIMDE_FLOAT64_C( -0.32),
SIMDE_FLOAT64_C( 0.86), SIMDE_FLOAT64_C( 0.92),
SIMDE_FLOAT64_C( 0.19), SIMDE_FLOAT64_C( -0.67)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( -387.90), SIMDE_FLOAT64_C( 395.92),
SIMDE_FLOAT64_C( 655.87), SIMDE_FLOAT64_C( 339.21),
SIMDE_FLOAT64_C( 532.35), SIMDE_FLOAT64_C( -263.99),
SIMDE_FLOAT64_C( 780.64), SIMDE_FLOAT64_C( -30.79)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 1.00), SIMDE_FLOAT64_C( 0.08),
SIMDE_FLOAT64_C( 0.66), SIMDE_FLOAT64_C( -0.08),
SIMDE_FLOAT64_C( -0.99), SIMDE_FLOAT64_C( -0.10),
SIMDE_FLOAT64_C( 1.00), SIMDE_FLOAT64_C( 0.59)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( -203.65), SIMDE_FLOAT64_C( -80.73),
SIMDE_FLOAT64_C( 336.73), SIMDE_FLOAT64_C( -944.78),
SIMDE_FLOAT64_C( -747.59), SIMDE_FLOAT64_C( -767.23),
SIMDE_FLOAT64_C( -554.19), SIMDE_FLOAT64_C( 398.82)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( -0.53), SIMDE_FLOAT64_C( 0.81),
SIMDE_FLOAT64_C( -0.55), SIMDE_FLOAT64_C( -0.74),
SIMDE_FLOAT64_C( 0.11), SIMDE_FLOAT64_C( -0.63),
SIMDE_FLOAT64_C( -0.96), SIMDE_FLOAT64_C( 0.16)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( 469.66), SIMDE_FLOAT64_C( 680.02),
SIMDE_FLOAT64_C( -148.69), SIMDE_FLOAT64_C( 818.66),
SIMDE_FLOAT64_C( 910.03), SIMDE_FLOAT64_C( 600.47),
SIMDE_FLOAT64_C( 791.23), SIMDE_FLOAT64_C( 254.31)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( -1.00), SIMDE_FLOAT64_C( 0.99),
SIMDE_FLOAT64_C( 0.86), SIMDE_FLOAT64_C( 0.96),
SIMDE_FLOAT64_C( -0.86), SIMDE_FLOAT64_C( -0.41),
SIMDE_FLOAT64_C( -0.44), SIMDE_FLOAT64_C( 0.16)) }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])); i++) {
simde__m512d r = simde_mm512_sin_pd(test_vec[i].a);
simde_assert_m512d_close(r, test_vec[i].r, 1);
}
return 0;
}
static int
test_simde_mm512_mask_sin_pd(SIMDE_MUNIT_TEST_ARGS) {
const struct {
simde__m512d src;
simde__mmask8 k;
simde__m512d a;
simde__m512d r;
} test_vec[8] = {
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( -686.13), SIMDE_FLOAT64_C( 571.46),
SIMDE_FLOAT64_C( 422.21), SIMDE_FLOAT64_C( 467.76),
SIMDE_FLOAT64_C( 670.24), SIMDE_FLOAT64_C( 34.06),
SIMDE_FLOAT64_C( 39.01), SIMDE_FLOAT64_C( 346.63)),
UINT8_C(139),
simde_mm512_set_pd(SIMDE_FLOAT64_C( -678.17), SIMDE_FLOAT64_C( 84.77),
SIMDE_FLOAT64_C( 825.53), SIMDE_FLOAT64_C( -269.45),
SIMDE_FLOAT64_C( 497.31), SIMDE_FLOAT64_C( -297.45),
SIMDE_FLOAT64_C( -186.21), SIMDE_FLOAT64_C( -754.38)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 0.40), SIMDE_FLOAT64_C( 571.46),
SIMDE_FLOAT64_C( 422.21), SIMDE_FLOAT64_C( 467.76),
SIMDE_FLOAT64_C( 0.81), SIMDE_FLOAT64_C( 34.06),
SIMDE_FLOAT64_C( 0.76), SIMDE_FLOAT64_C( -0.39)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( 178.20), SIMDE_FLOAT64_C( 233.37),
SIMDE_FLOAT64_C( 261.31), SIMDE_FLOAT64_C( -976.55),
SIMDE_FLOAT64_C( -444.81), SIMDE_FLOAT64_C( -384.03),
SIMDE_FLOAT64_C( -305.07), SIMDE_FLOAT64_C( -417.54)),
UINT8_C(229),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 841.21), SIMDE_FLOAT64_C( -450.67),
SIMDE_FLOAT64_C( 687.09), SIMDE_FLOAT64_C( -212.54),
SIMDE_FLOAT64_C( -660.80), SIMDE_FLOAT64_C( 28.47),
SIMDE_FLOAT64_C( -923.64), SIMDE_FLOAT64_C( -860.95)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( -0.67), SIMDE_FLOAT64_C( 0.99),
SIMDE_FLOAT64_C( 0.79), SIMDE_FLOAT64_C( -976.55),
SIMDE_FLOAT64_C( -444.81), SIMDE_FLOAT64_C( -0.19),
SIMDE_FLOAT64_C( -305.07), SIMDE_FLOAT64_C( -0.15)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( 398.82), SIMDE_FLOAT64_C( 395.92),
SIMDE_FLOAT64_C( 339.21), SIMDE_FLOAT64_C( -263.99),
SIMDE_FLOAT64_C( -30.79), SIMDE_FLOAT64_C( 443.48),
SIMDE_FLOAT64_C( 380.46), SIMDE_FLOAT64_C( 993.90)),
UINT8_C(253),
simde_mm512_set_pd(SIMDE_FLOAT64_C( -554.19), SIMDE_FLOAT64_C( -387.90),
SIMDE_FLOAT64_C( 655.87), SIMDE_FLOAT64_C( 532.35),
SIMDE_FLOAT64_C( 780.64), SIMDE_FLOAT64_C( -770.35),
SIMDE_FLOAT64_C( -583.60), SIMDE_FLOAT64_C( -770.72)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( -0.96), SIMDE_FLOAT64_C( 1.00),
SIMDE_FLOAT64_C( 0.66), SIMDE_FLOAT64_C( -0.99),
SIMDE_FLOAT64_C( 1.00), SIMDE_FLOAT64_C( 0.61),
SIMDE_FLOAT64_C( 380.46), SIMDE_FLOAT64_C( 0.86)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( 120.65), SIMDE_FLOAT64_C( 469.66),
SIMDE_FLOAT64_C( -148.69), SIMDE_FLOAT64_C( 910.03),
SIMDE_FLOAT64_C( 791.23), SIMDE_FLOAT64_C( -203.65),
SIMDE_FLOAT64_C( 336.73), SIMDE_FLOAT64_C( -747.59)),
UINT8_C( 93),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 543.35), SIMDE_FLOAT64_C( -171.51),
SIMDE_FLOAT64_C( 680.02), SIMDE_FLOAT64_C( 818.66),
SIMDE_FLOAT64_C( 600.47), SIMDE_FLOAT64_C( 254.31),
SIMDE_FLOAT64_C( -80.73), SIMDE_FLOAT64_C( -944.78)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 120.65), SIMDE_FLOAT64_C( -0.96),
SIMDE_FLOAT64_C( -148.69), SIMDE_FLOAT64_C( 0.96),
SIMDE_FLOAT64_C( -0.41), SIMDE_FLOAT64_C( 0.16),
SIMDE_FLOAT64_C( 336.73), SIMDE_FLOAT64_C( -0.74)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( 99.93), SIMDE_FLOAT64_C( -738.19),
SIMDE_FLOAT64_C( 758.79), SIMDE_FLOAT64_C( 343.48),
SIMDE_FLOAT64_C( -797.92), SIMDE_FLOAT64_C( -525.83),
SIMDE_FLOAT64_C( -822.65), SIMDE_FLOAT64_C( 655.67)),
UINT8_C(145),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 331.34), SIMDE_FLOAT64_C( 462.95),
SIMDE_FLOAT64_C( -178.99), SIMDE_FLOAT64_C( 324.62),
SIMDE_FLOAT64_C( -874.31), SIMDE_FLOAT64_C( -328.54),
SIMDE_FLOAT64_C( -192.31), SIMDE_FLOAT64_C( 561.36)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( -1.00), SIMDE_FLOAT64_C( -738.19),
SIMDE_FLOAT64_C( 758.79), SIMDE_FLOAT64_C( -0.86),
SIMDE_FLOAT64_C( -797.92), SIMDE_FLOAT64_C( -525.83),
SIMDE_FLOAT64_C( -822.65), SIMDE_FLOAT64_C( 0.83)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( -756.42), SIMDE_FLOAT64_C( 27.25),
SIMDE_FLOAT64_C( 690.12), SIMDE_FLOAT64_C( -21.09),
SIMDE_FLOAT64_C( -448.89), SIMDE_FLOAT64_C( 505.79),
SIMDE_FLOAT64_C( 831.02), SIMDE_FLOAT64_C( 977.36)),
UINT8_C( 75),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 977.49), SIMDE_FLOAT64_C( 424.81),
SIMDE_FLOAT64_C( -95.15), SIMDE_FLOAT64_C( 840.65),
SIMDE_FLOAT64_C( -591.56), SIMDE_FLOAT64_C( 731.49),
SIMDE_FLOAT64_C( 623.70), SIMDE_FLOAT64_C( 140.67)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( -756.42), SIMDE_FLOAT64_C( -0.64),
SIMDE_FLOAT64_C( 690.12), SIMDE_FLOAT64_C( -21.09),
SIMDE_FLOAT64_C( -0.81), SIMDE_FLOAT64_C( 505.79),
SIMDE_FLOAT64_C( 1.00), SIMDE_FLOAT64_C( 0.65)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( 394.67), SIMDE_FLOAT64_C( -304.73),
SIMDE_FLOAT64_C( -696.69), SIMDE_FLOAT64_C( 822.06),
SIMDE_FLOAT64_C( -997.63), SIMDE_FLOAT64_C( 923.64),
SIMDE_FLOAT64_C( -768.12), SIMDE_FLOAT64_C( -67.64)),
UINT8_C( 93),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 510.85), SIMDE_FLOAT64_C( 14.34),
SIMDE_FLOAT64_C( 916.26), SIMDE_FLOAT64_C( -769.09),
SIMDE_FLOAT64_C( -573.81), SIMDE_FLOAT64_C( -337.60),
SIMDE_FLOAT64_C( 293.64), SIMDE_FLOAT64_C( -576.22)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 394.67), SIMDE_FLOAT64_C( 0.98),
SIMDE_FLOAT64_C( -696.69), SIMDE_FLOAT64_C( -0.56),
SIMDE_FLOAT64_C( -0.89), SIMDE_FLOAT64_C( 0.99),
SIMDE_FLOAT64_C( -768.12), SIMDE_FLOAT64_C( 0.97)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( 475.51), SIMDE_FLOAT64_C( 936.65),
SIMDE_FLOAT64_C( -348.70), SIMDE_FLOAT64_C( -438.19),
SIMDE_FLOAT64_C( -752.43), SIMDE_FLOAT64_C( 932.66),
SIMDE_FLOAT64_C( -327.22), SIMDE_FLOAT64_C( -182.45)),
UINT8_C(213),
simde_mm512_set_pd(SIMDE_FLOAT64_C( -775.04), SIMDE_FLOAT64_C( 440.64),
SIMDE_FLOAT64_C( 897.27), SIMDE_FLOAT64_C( -197.89),
SIMDE_FLOAT64_C( -359.76), SIMDE_FLOAT64_C( -33.67),
SIMDE_FLOAT64_C( 7.27), SIMDE_FLOAT64_C( -125.20)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( -0.80), SIMDE_FLOAT64_C( 0.73),
SIMDE_FLOAT64_C( -348.70), SIMDE_FLOAT64_C( -0.03),
SIMDE_FLOAT64_C( -752.43), SIMDE_FLOAT64_C( -0.78),
SIMDE_FLOAT64_C( -327.22), SIMDE_FLOAT64_C( 0.45)) }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])); i++) {
simde__m512d r = simde_mm512_mask_sin_pd(test_vec[i].src, test_vec[i].k, test_vec[i].a);
simde_assert_m512d_close(r, test_vec[i].r, 1);
}
return 0;
}
static int
test_simde_mm_sind_ps(SIMDE_MUNIT_TEST_ARGS) {
const struct {
simde__m128 a;
simde__m128 r;
} test_vec[8] = {
{ simde_mm_set_ps(SIMDE_FLOAT32_C( -186.21), SIMDE_FLOAT32_C( 39.01), SIMDE_FLOAT32_C( -754.38), SIMDE_FLOAT32_C( 346.63)),
simde_mm_set_ps(SIMDE_FLOAT32_C( 0.11), SIMDE_FLOAT32_C( 0.63), SIMDE_FLOAT32_C( -0.56), SIMDE_FLOAT32_C( -0.23)) },
{ simde_mm_set_ps(SIMDE_FLOAT32_C( 497.31), SIMDE_FLOAT32_C( 670.24), SIMDE_FLOAT32_C( -297.45), SIMDE_FLOAT32_C( 34.06)),
simde_mm_set_ps(SIMDE_FLOAT32_C( 0.68), SIMDE_FLOAT32_C( -0.76), SIMDE_FLOAT32_C( 0.89), SIMDE_FLOAT32_C( 0.56)) },
{ simde_mm_set_ps(SIMDE_FLOAT32_C( 825.53), SIMDE_FLOAT32_C( 422.21), SIMDE_FLOAT32_C( -269.45), SIMDE_FLOAT32_C( 467.76)),
simde_mm_set_ps(SIMDE_FLOAT32_C( 0.96), SIMDE_FLOAT32_C( 0.88), SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( 0.95)) },
{ simde_mm_set_ps(SIMDE_FLOAT32_C( -678.17), SIMDE_FLOAT32_C( -686.13), SIMDE_FLOAT32_C( 84.77), SIMDE_FLOAT32_C( 571.46)),
simde_mm_set_ps(SIMDE_FLOAT32_C( 0.67), SIMDE_FLOAT32_C( 0.56), SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( -0.52)) },
{ simde_mm_set_ps(SIMDE_FLOAT32_C( -305.07), SIMDE_FLOAT32_C( -860.95), SIMDE_FLOAT32_C( -417.54), SIMDE_FLOAT32_C( 696.87)),
simde_mm_set_ps(SIMDE_FLOAT32_C( 0.82), SIMDE_FLOAT32_C( -0.63), SIMDE_FLOAT32_C( -0.84), SIMDE_FLOAT32_C( -0.39)) },
{ simde_mm_set_ps(SIMDE_FLOAT32_C( -444.81), SIMDE_FLOAT32_C( 28.47), SIMDE_FLOAT32_C( -384.03), SIMDE_FLOAT32_C( -923.64)),
simde_mm_set_ps(SIMDE_FLOAT32_C( -1.00), SIMDE_FLOAT32_C( 0.48), SIMDE_FLOAT32_C( -0.41), SIMDE_FLOAT32_C( 0.40)) },
{ simde_mm_set_ps(SIMDE_FLOAT32_C( 261.31), SIMDE_FLOAT32_C( -212.54), SIMDE_FLOAT32_C( -976.55), SIMDE_FLOAT32_C( -660.80)),
simde_mm_set_ps(SIMDE_FLOAT32_C( -0.99), SIMDE_FLOAT32_C( 0.54), SIMDE_FLOAT32_C( 0.97), SIMDE_FLOAT32_C( 0.86)) },
{ simde_mm_set_ps(SIMDE_FLOAT32_C( 178.20), SIMDE_FLOAT32_C( -450.67), SIMDE_FLOAT32_C( 233.37), SIMDE_FLOAT32_C( 687.09)),
simde_mm_set_ps(SIMDE_FLOAT32_C( 0.03), SIMDE_FLOAT32_C( -1.00), SIMDE_FLOAT32_C( -0.80), SIMDE_FLOAT32_C( -0.54)) }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])); i++) {
simde__m128 r = simde_mm_sind_ps(test_vec[i].a);
simde_assert_m128_close(r, test_vec[i].r, 1);
}
return 0;
}
static int
test_simde_mm_sincos_ps (SIMDE_MUNIT_TEST_ARGS) {
static const struct {
const simde_float32 mem[4];
const simde_float32 a[4];
const simde_float32 r[4];
} test_vec[] = {
{ { SIMDE_FLOAT32_C( 0.79), SIMDE_FLOAT32_C( 0.94), SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( 0.66) },
{ SIMDE_FLOAT32_C( -0.66), SIMDE_FLOAT32_C( 0.35), SIMDE_FLOAT32_C( 0.01), SIMDE_FLOAT32_C( -0.85) },
{ SIMDE_FLOAT32_C( -0.61), SIMDE_FLOAT32_C( 0.34), SIMDE_FLOAT32_C( 0.01), SIMDE_FLOAT32_C( -0.75) } },
{ { SIMDE_FLOAT32_C( 0.63), SIMDE_FLOAT32_C( 0.76), SIMDE_FLOAT32_C( 0.73), SIMDE_FLOAT32_C( 0.86) },
{ SIMDE_FLOAT32_C( 0.89), SIMDE_FLOAT32_C( -0.71), SIMDE_FLOAT32_C( -0.75), SIMDE_FLOAT32_C( -0.53) },
{ SIMDE_FLOAT32_C( 0.78), SIMDE_FLOAT32_C( -0.65), SIMDE_FLOAT32_C( -0.68), SIMDE_FLOAT32_C( -0.51) } },
{ { SIMDE_FLOAT32_C( 0.67), SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( 0.92), SIMDE_FLOAT32_C( 0.93) },
{ SIMDE_FLOAT32_C( -0.84), SIMDE_FLOAT32_C( -0.06), SIMDE_FLOAT32_C( -0.40), SIMDE_FLOAT32_C( 0.37) },
{ SIMDE_FLOAT32_C( -0.74), SIMDE_FLOAT32_C( -0.06), SIMDE_FLOAT32_C( -0.39), SIMDE_FLOAT32_C( 0.36) } },
{ { SIMDE_FLOAT32_C( 0.92), SIMDE_FLOAT32_C( 0.61), SIMDE_FLOAT32_C( 0.82), SIMDE_FLOAT32_C( 0.64) },
{ SIMDE_FLOAT32_C( 0.39), SIMDE_FLOAT32_C( 0.91), SIMDE_FLOAT32_C( 0.61), SIMDE_FLOAT32_C( 0.88) },
{ SIMDE_FLOAT32_C( 0.38), SIMDE_FLOAT32_C( 0.79), SIMDE_FLOAT32_C( 0.57), SIMDE_FLOAT32_C( 0.77) } },
{ { SIMDE_FLOAT32_C( 0.81), SIMDE_FLOAT32_C( 0.99), SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( 0.77) },
{ SIMDE_FLOAT32_C( -0.63), SIMDE_FLOAT32_C( -0.11), SIMDE_FLOAT32_C( -0.00), SIMDE_FLOAT32_C( -0.69) },
{ SIMDE_FLOAT32_C( -0.59), SIMDE_FLOAT32_C( -0.11), SIMDE_FLOAT32_C( -0.00), SIMDE_FLOAT32_C( -0.64) } },
{ { SIMDE_FLOAT32_C( 0.57), SIMDE_FLOAT32_C( 0.57), SIMDE_FLOAT32_C( 0.81), SIMDE_FLOAT32_C( 0.91) },
{ SIMDE_FLOAT32_C( 0.97), SIMDE_FLOAT32_C( 0.97), SIMDE_FLOAT32_C( -0.63), SIMDE_FLOAT32_C( -0.42) },
{ SIMDE_FLOAT32_C( 0.82), SIMDE_FLOAT32_C( 0.82), SIMDE_FLOAT32_C( -0.59), SIMDE_FLOAT32_C( -0.41) } },
{ { SIMDE_FLOAT32_C( 0.83), SIMDE_FLOAT32_C( 0.78), SIMDE_FLOAT32_C( 0.90), SIMDE_FLOAT32_C( 0.92) },
{ SIMDE_FLOAT32_C( 0.60), SIMDE_FLOAT32_C( -0.67), SIMDE_FLOAT32_C( -0.44), SIMDE_FLOAT32_C( 0.40) },
{ SIMDE_FLOAT32_C( 0.56), SIMDE_FLOAT32_C( -0.62), SIMDE_FLOAT32_C( -0.43), SIMDE_FLOAT32_C( 0.39) } },
{ { SIMDE_FLOAT32_C( 0.99), SIMDE_FLOAT32_C( 0.84), SIMDE_FLOAT32_C( 0.99), SIMDE_FLOAT32_C( 0.98) },
{ SIMDE_FLOAT32_C( -0.15), SIMDE_FLOAT32_C( -0.57), SIMDE_FLOAT32_C( -0.17), SIMDE_FLOAT32_C( 0.19) },
{ SIMDE_FLOAT32_C( -0.15), SIMDE_FLOAT32_C( -0.54), SIMDE_FLOAT32_C( -0.17), SIMDE_FLOAT32_C( 0.19) } }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])) ; i++) {
simde__m128 mem;
simde__m128 a = simde_mm_loadu_ps(test_vec[i].a);
simde__m128 r = simde_mm_sincos_ps(&mem, a);
simde_test_x86_assert_equal_f32x4(r, simde_mm_loadu_ps(test_vec[i].r), 1);
simde_test_x86_assert_equal_f32x4(mem, simde_mm_loadu_ps(test_vec[i].mem), 1);
}
return 0;
}
static int
test_simde_mm_sincos_pd (SIMDE_MUNIT_TEST_ARGS) {
static const struct {
const simde_float64 mem[2];
const simde_float64 a[2];
const simde_float64 r[2];
} test_vec[] = {
{ { SIMDE_FLOAT64_C( 0.96), SIMDE_FLOAT64_C( 0.90) },
{ SIMDE_FLOAT64_C( 0.30), SIMDE_FLOAT64_C( 0.45) },
{ SIMDE_FLOAT64_C( 0.30), SIMDE_FLOAT64_C( 0.43) } },
{ { SIMDE_FLOAT64_C( 0.67), SIMDE_FLOAT64_C( 0.73) },
{ SIMDE_FLOAT64_C( 0.83), SIMDE_FLOAT64_C( 0.75) },
{ SIMDE_FLOAT64_C( 0.74), SIMDE_FLOAT64_C( 0.68) } },
{ { SIMDE_FLOAT64_C( 0.57), SIMDE_FLOAT64_C( 1.00) },
{ SIMDE_FLOAT64_C( 0.96), SIMDE_FLOAT64_C( 0.01) },
{ SIMDE_FLOAT64_C( 0.82), SIMDE_FLOAT64_C( 0.01) } },
{ { SIMDE_FLOAT64_C( 0.67), SIMDE_FLOAT64_C( 0.79) },
{ SIMDE_FLOAT64_C( 0.83), SIMDE_FLOAT64_C( -0.66) },
{ SIMDE_FLOAT64_C( 0.74), SIMDE_FLOAT64_C( -0.61) } },
{ { SIMDE_FLOAT64_C( 1.00), SIMDE_FLOAT64_C( 0.76) },
{ SIMDE_FLOAT64_C( -0.10), SIMDE_FLOAT64_C( 0.71) },
{ SIMDE_FLOAT64_C( -0.10), SIMDE_FLOAT64_C( 0.65) } },
{ { SIMDE_FLOAT64_C( 0.93), SIMDE_FLOAT64_C( 0.55) },
{ SIMDE_FLOAT64_C( 0.37), SIMDE_FLOAT64_C( -0.99) },
{ SIMDE_FLOAT64_C( 0.36), SIMDE_FLOAT64_C( -0.84) } },
{ { SIMDE_FLOAT64_C( 1.00), SIMDE_FLOAT64_C( 0.93) },
{ SIMDE_FLOAT64_C( -0.02), SIMDE_FLOAT64_C( -0.37) },
{ SIMDE_FLOAT64_C( -0.02), SIMDE_FLOAT64_C( -0.36) } },
{ { SIMDE_FLOAT64_C( 0.96), SIMDE_FLOAT64_C( 0.90) },
{ SIMDE_FLOAT64_C( -0.27), SIMDE_FLOAT64_C( 0.44) },
{ SIMDE_FLOAT64_C( -0.27), SIMDE_FLOAT64_C( 0.43) } }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])) ; i++) {
simde__m128d mem;
simde__m128d a = simde_mm_loadu_pd(test_vec[i].a);
simde__m128d r = simde_mm_sincos_pd(&mem, a);
simde_test_x86_assert_equal_f64x2(r, simde_mm_loadu_pd(test_vec[i].r), 1);
simde_test_x86_assert_equal_f64x2(mem, simde_mm_loadu_pd(test_vec[i].mem), 1);
}
return 0;
}
static int
test_simde_mm256_sincos_ps (SIMDE_MUNIT_TEST_ARGS) {
static const struct {
const simde_float32 mem[8];
const simde_float32 a[8];
const simde_float32 r[8];
} test_vec[] = {
{ { SIMDE_FLOAT32_C( 0.90), SIMDE_FLOAT32_C( 0.97), SIMDE_FLOAT32_C( 0.98), SIMDE_FLOAT32_C( 0.58),
SIMDE_FLOAT32_C( 0.63), SIMDE_FLOAT32_C( 0.57), SIMDE_FLOAT32_C( 0.91), SIMDE_FLOAT32_C( 0.86) },
{ SIMDE_FLOAT32_C( -0.44), SIMDE_FLOAT32_C( -0.24), SIMDE_FLOAT32_C( -0.22), SIMDE_FLOAT32_C( 0.95),
SIMDE_FLOAT32_C( -0.89), SIMDE_FLOAT32_C( 0.97), SIMDE_FLOAT32_C( -0.43), SIMDE_FLOAT32_C( 0.53) },
{ SIMDE_FLOAT32_C( -0.43), SIMDE_FLOAT32_C( -0.24), SIMDE_FLOAT32_C( -0.22), SIMDE_FLOAT32_C( 0.81),
SIMDE_FLOAT32_C( -0.78), SIMDE_FLOAT32_C( 0.82), SIMDE_FLOAT32_C( -0.42), SIMDE_FLOAT32_C( 0.51) } },
{ { SIMDE_FLOAT32_C( 0.85), SIMDE_FLOAT32_C( 0.57), SIMDE_FLOAT32_C( 0.54), SIMDE_FLOAT32_C( 0.90),
SIMDE_FLOAT32_C( 0.99), SIMDE_FLOAT32_C( 0.57), SIMDE_FLOAT32_C( 0.78), SIMDE_FLOAT32_C( 0.76) },
{ SIMDE_FLOAT32_C( 0.56), SIMDE_FLOAT32_C( 0.96), SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( -0.45),
SIMDE_FLOAT32_C( -0.15), SIMDE_FLOAT32_C( -0.97), SIMDE_FLOAT32_C( 0.67), SIMDE_FLOAT32_C( 0.71) },
{ SIMDE_FLOAT32_C( 0.53), SIMDE_FLOAT32_C( 0.82), SIMDE_FLOAT32_C( 0.84), SIMDE_FLOAT32_C( -0.43),
SIMDE_FLOAT32_C( -0.15), SIMDE_FLOAT32_C( -0.82), SIMDE_FLOAT32_C( 0.62), SIMDE_FLOAT32_C( 0.65) } },
{ { SIMDE_FLOAT32_C( 0.88), SIMDE_FLOAT32_C( 0.96), SIMDE_FLOAT32_C( 0.89), SIMDE_FLOAT32_C( 0.90),
SIMDE_FLOAT32_C( 0.94), SIMDE_FLOAT32_C( 0.76), SIMDE_FLOAT32_C( 0.82), SIMDE_FLOAT32_C( 0.83) },
{ SIMDE_FLOAT32_C( 0.49), SIMDE_FLOAT32_C( -0.30), SIMDE_FLOAT32_C( -0.48), SIMDE_FLOAT32_C( -0.45),
SIMDE_FLOAT32_C( 0.34), SIMDE_FLOAT32_C( 0.71), SIMDE_FLOAT32_C( -0.61), SIMDE_FLOAT32_C( 0.60) },
{ SIMDE_FLOAT32_C( 0.47), SIMDE_FLOAT32_C( -0.30), SIMDE_FLOAT32_C( -0.46), SIMDE_FLOAT32_C( -0.43),
SIMDE_FLOAT32_C( 0.33), SIMDE_FLOAT32_C( 0.65), SIMDE_FLOAT32_C( -0.57), SIMDE_FLOAT32_C( 0.56) } },
{ { SIMDE_FLOAT32_C( 0.87), SIMDE_FLOAT32_C( 0.84), SIMDE_FLOAT32_C( 0.80), SIMDE_FLOAT32_C( 0.97),
SIMDE_FLOAT32_C( 0.79), SIMDE_FLOAT32_C( 0.87), SIMDE_FLOAT32_C( 0.61), SIMDE_FLOAT32_C( 0.99) },
{ SIMDE_FLOAT32_C( -0.52), SIMDE_FLOAT32_C( 0.57), SIMDE_FLOAT32_C( -0.65), SIMDE_FLOAT32_C( 0.26),
SIMDE_FLOAT32_C( -0.66), SIMDE_FLOAT32_C( -0.52), SIMDE_FLOAT32_C( 0.92), SIMDE_FLOAT32_C( -0.11) },
{ SIMDE_FLOAT32_C( -0.50), SIMDE_FLOAT32_C( 0.54), SIMDE_FLOAT32_C( -0.61), SIMDE_FLOAT32_C( 0.26),
SIMDE_FLOAT32_C( -0.61), SIMDE_FLOAT32_C( -0.50), SIMDE_FLOAT32_C( 0.80), SIMDE_FLOAT32_C( -0.11) } },
{ { SIMDE_FLOAT32_C( 0.97), SIMDE_FLOAT32_C( 0.96), SIMDE_FLOAT32_C( 0.99), SIMDE_FLOAT32_C( 0.94),
SIMDE_FLOAT32_C( 0.95), SIMDE_FLOAT32_C( 0.92), SIMDE_FLOAT32_C( 0.99), SIMDE_FLOAT32_C( 0.72) },
{ SIMDE_FLOAT32_C( 0.25), SIMDE_FLOAT32_C( -0.30), SIMDE_FLOAT32_C( -0.15), SIMDE_FLOAT32_C( 0.36),
SIMDE_FLOAT32_C( -0.33), SIMDE_FLOAT32_C( 0.41), SIMDE_FLOAT32_C( -0.11), SIMDE_FLOAT32_C( -0.77) },
{ SIMDE_FLOAT32_C( 0.25), SIMDE_FLOAT32_C( -0.30), SIMDE_FLOAT32_C( -0.15), SIMDE_FLOAT32_C( 0.35),
SIMDE_FLOAT32_C( -0.32), SIMDE_FLOAT32_C( 0.40), SIMDE_FLOAT32_C( -0.11), SIMDE_FLOAT32_C( -0.70) } },
{ { SIMDE_FLOAT32_C( 0.93), SIMDE_FLOAT32_C( 0.99), SIMDE_FLOAT32_C( 0.98), SIMDE_FLOAT32_C( 0.71),
SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( 0.85), SIMDE_FLOAT32_C( 0.60), SIMDE_FLOAT32_C( 0.83) },
{ SIMDE_FLOAT32_C( 0.38), SIMDE_FLOAT32_C( -0.11), SIMDE_FLOAT32_C( -0.22), SIMDE_FLOAT32_C( -0.78),
SIMDE_FLOAT32_C( -0.08), SIMDE_FLOAT32_C( -0.55), SIMDE_FLOAT32_C( 0.93), SIMDE_FLOAT32_C( -0.59) },
{ SIMDE_FLOAT32_C( 0.37), SIMDE_FLOAT32_C( -0.11), SIMDE_FLOAT32_C( -0.22), SIMDE_FLOAT32_C( -0.70),
SIMDE_FLOAT32_C( -0.08), SIMDE_FLOAT32_C( -0.52), SIMDE_FLOAT32_C( 0.80), SIMDE_FLOAT32_C( -0.56) } },
{ { SIMDE_FLOAT32_C( 0.99), SIMDE_FLOAT32_C( 0.85), SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( 0.87),
SIMDE_FLOAT32_C( 0.67), SIMDE_FLOAT32_C( 0.94), SIMDE_FLOAT32_C( 0.61), SIMDE_FLOAT32_C( 0.93) },
{ SIMDE_FLOAT32_C( 0.15), SIMDE_FLOAT32_C( -0.56), SIMDE_FLOAT32_C( -0.04), SIMDE_FLOAT32_C( -0.51),
SIMDE_FLOAT32_C( -0.84), SIMDE_FLOAT32_C( 0.35), SIMDE_FLOAT32_C( -0.91), SIMDE_FLOAT32_C( -0.37) },
{ SIMDE_FLOAT32_C( 0.15), SIMDE_FLOAT32_C( -0.53), SIMDE_FLOAT32_C( -0.04), SIMDE_FLOAT32_C( -0.49),
SIMDE_FLOAT32_C( -0.74), SIMDE_FLOAT32_C( 0.34), SIMDE_FLOAT32_C( -0.79), SIMDE_FLOAT32_C( -0.36) } },
{ { SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( 0.85), SIMDE_FLOAT32_C( 0.63), SIMDE_FLOAT32_C( 0.97),
SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( 0.69), SIMDE_FLOAT32_C( 0.67), SIMDE_FLOAT32_C( 0.67) },
{ SIMDE_FLOAT32_C( -0.08), SIMDE_FLOAT32_C( -0.56), SIMDE_FLOAT32_C( 0.89), SIMDE_FLOAT32_C( 0.26),
SIMDE_FLOAT32_C( -0.07), SIMDE_FLOAT32_C( 0.81), SIMDE_FLOAT32_C( -0.84), SIMDE_FLOAT32_C( -0.83) },
{ SIMDE_FLOAT32_C( -0.08), SIMDE_FLOAT32_C( -0.53), SIMDE_FLOAT32_C( 0.78), SIMDE_FLOAT32_C( 0.26),
SIMDE_FLOAT32_C( -0.07), SIMDE_FLOAT32_C( 0.72), SIMDE_FLOAT32_C( -0.74), SIMDE_FLOAT32_C( -0.74) } }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])) ; i++) {
simde__m256 mem;
simde__m256 a = simde_mm256_loadu_ps(test_vec[i].a);
simde__m256 r = simde_mm256_sincos_ps(&mem, a);
simde_test_x86_assert_equal_f32x8(r, simde_mm256_loadu_ps(test_vec[i].r), 1);
simde_test_x86_assert_equal_f32x8(mem, simde_mm256_loadu_ps(test_vec[i].mem), 1);
}
return 0;
}
static int
test_simde_mm256_sincos_pd (SIMDE_MUNIT_TEST_ARGS) {
static const struct {
const simde_float64 mem[4];
const simde_float64 a[4];
const simde_float64 r[4];
} test_vec[] = {
{ { SIMDE_FLOAT64_C( 0.98), SIMDE_FLOAT64_C( 0.56), SIMDE_FLOAT64_C( 0.95), SIMDE_FLOAT64_C( 0.79) },
{ SIMDE_FLOAT64_C( 0.19), SIMDE_FLOAT64_C( 0.98), SIMDE_FLOAT64_C( -0.32), SIMDE_FLOAT64_C( -0.66) },
{ SIMDE_FLOAT64_C( 0.19), SIMDE_FLOAT64_C( 0.83), SIMDE_FLOAT64_C( -0.31), SIMDE_FLOAT64_C( -0.61) } },
{ { SIMDE_FLOAT64_C( 0.90), SIMDE_FLOAT64_C( 1.00), SIMDE_FLOAT64_C( 0.78), SIMDE_FLOAT64_C( 0.97) },
{ SIMDE_FLOAT64_C( 0.46), SIMDE_FLOAT64_C( -0.00), SIMDE_FLOAT64_C( 0.68), SIMDE_FLOAT64_C( 0.25) },
{ SIMDE_FLOAT64_C( 0.44), SIMDE_FLOAT64_C( -0.00), SIMDE_FLOAT64_C( 0.63), SIMDE_FLOAT64_C( 0.25) } },
{ { SIMDE_FLOAT64_C( 0.80), SIMDE_FLOAT64_C( 0.86), SIMDE_FLOAT64_C( 0.90), SIMDE_FLOAT64_C( 0.72) },
{ SIMDE_FLOAT64_C( -0.65), SIMDE_FLOAT64_C( -0.53), SIMDE_FLOAT64_C( -0.45), SIMDE_FLOAT64_C( -0.77) },
{ SIMDE_FLOAT64_C( -0.61), SIMDE_FLOAT64_C( -0.51), SIMDE_FLOAT64_C( -0.43), SIMDE_FLOAT64_C( -0.70) } },
{ { SIMDE_FLOAT64_C( 0.69), SIMDE_FLOAT64_C( 1.00), SIMDE_FLOAT64_C( 0.81), SIMDE_FLOAT64_C( 0.76) },
{ SIMDE_FLOAT64_C( 0.81), SIMDE_FLOAT64_C( -0.09), SIMDE_FLOAT64_C( 0.63), SIMDE_FLOAT64_C( 0.70) },
{ SIMDE_FLOAT64_C( 0.72), SIMDE_FLOAT64_C( -0.09), SIMDE_FLOAT64_C( 0.59), SIMDE_FLOAT64_C( 0.64) } },
{ { SIMDE_FLOAT64_C( 0.86), SIMDE_FLOAT64_C( 0.95), SIMDE_FLOAT64_C( 0.99), SIMDE_FLOAT64_C( 0.76) },
{ SIMDE_FLOAT64_C( -0.53), SIMDE_FLOAT64_C( -0.33), SIMDE_FLOAT64_C( -0.16), SIMDE_FLOAT64_C( 0.71) },
{ SIMDE_FLOAT64_C( -0.51), SIMDE_FLOAT64_C( -0.32), SIMDE_FLOAT64_C( -0.16), SIMDE_FLOAT64_C( 0.65) } },
{ { SIMDE_FLOAT64_C( 0.55), SIMDE_FLOAT64_C( 0.87), SIMDE_FLOAT64_C( 0.64), SIMDE_FLOAT64_C( 0.90) },
{ SIMDE_FLOAT64_C( -0.99), SIMDE_FLOAT64_C( -0.52), SIMDE_FLOAT64_C( -0.87), SIMDE_FLOAT64_C( -0.44) },
{ SIMDE_FLOAT64_C( -0.84), SIMDE_FLOAT64_C( -0.50), SIMDE_FLOAT64_C( -0.76), SIMDE_FLOAT64_C( -0.43) } },
{ { SIMDE_FLOAT64_C( 0.90), SIMDE_FLOAT64_C( 0.92), SIMDE_FLOAT64_C( 0.80), SIMDE_FLOAT64_C( 1.00) },
{ SIMDE_FLOAT64_C( 0.45), SIMDE_FLOAT64_C( 0.41), SIMDE_FLOAT64_C( -0.65), SIMDE_FLOAT64_C( 0.05) },
{ SIMDE_FLOAT64_C( 0.43), SIMDE_FLOAT64_C( 0.40), SIMDE_FLOAT64_C( -0.61), SIMDE_FLOAT64_C( 0.05) } },
{ { SIMDE_FLOAT64_C( 0.55), SIMDE_FLOAT64_C( 0.80), SIMDE_FLOAT64_C( 0.97), SIMDE_FLOAT64_C( 0.98) },
{ SIMDE_FLOAT64_C( -0.99), SIMDE_FLOAT64_C( 0.65), SIMDE_FLOAT64_C( -0.26), SIMDE_FLOAT64_C( 0.20) },
{ SIMDE_FLOAT64_C( -0.84), SIMDE_FLOAT64_C( 0.61), SIMDE_FLOAT64_C( -0.26), SIMDE_FLOAT64_C( 0.20) } }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])) ; i++) {
simde__m256d mem;
simde__m256d a = simde_mm256_loadu_pd(test_vec[i].a);
simde__m256d r = simde_mm256_sincos_pd(&mem, a);
simde_test_x86_assert_equal_f64x4(r, simde_mm256_loadu_pd(test_vec[i].r), 1);
simde_test_x86_assert_equal_f64x4(mem, simde_mm256_loadu_pd(test_vec[i].mem), 1);
}
return 0;
}
static int
test_simde_mm512_sincos_ps (SIMDE_MUNIT_TEST_ARGS) {
static const struct {
const simde_float32 mem[16];
const simde_float32 a[16];
const simde_float32 r[16];
} test_vec[] = {
{ { SIMDE_FLOAT32_C( 0.99), SIMDE_FLOAT32_C( 0.55), SIMDE_FLOAT32_C( 0.88), SIMDE_FLOAT32_C( 0.97),
SIMDE_FLOAT32_C( 0.61), SIMDE_FLOAT32_C( 0.67), SIMDE_FLOAT32_C( 0.90), SIMDE_FLOAT32_C( 0.57),
SIMDE_FLOAT32_C( 0.94), SIMDE_FLOAT32_C( 0.89), SIMDE_FLOAT32_C( 0.99), SIMDE_FLOAT32_C( 0.96),
SIMDE_FLOAT32_C( 0.80), SIMDE_FLOAT32_C( 0.61), SIMDE_FLOAT32_C( 0.61), SIMDE_FLOAT32_C( 0.69) },
{ SIMDE_FLOAT32_C( 0.14), SIMDE_FLOAT32_C( -0.99), SIMDE_FLOAT32_C( -0.49), SIMDE_FLOAT32_C( -0.25),
SIMDE_FLOAT32_C( -0.91), SIMDE_FLOAT32_C( -0.83), SIMDE_FLOAT32_C( -0.45), SIMDE_FLOAT32_C( 0.96),
SIMDE_FLOAT32_C( 0.36), SIMDE_FLOAT32_C( -0.47), SIMDE_FLOAT32_C( -0.17), SIMDE_FLOAT32_C( 0.28),
SIMDE_FLOAT32_C( -0.65), SIMDE_FLOAT32_C( -0.92), SIMDE_FLOAT32_C( 0.92), SIMDE_FLOAT32_C( -0.81) },
{ SIMDE_FLOAT32_C( 0.14), SIMDE_FLOAT32_C( -0.84), SIMDE_FLOAT32_C( -0.47), SIMDE_FLOAT32_C( -0.25),
SIMDE_FLOAT32_C( -0.79), SIMDE_FLOAT32_C( -0.74), SIMDE_FLOAT32_C( -0.43), SIMDE_FLOAT32_C( 0.82),
SIMDE_FLOAT32_C( 0.35), SIMDE_FLOAT32_C( -0.45), SIMDE_FLOAT32_C( -0.17), SIMDE_FLOAT32_C( 0.28),
SIMDE_FLOAT32_C( -0.61), SIMDE_FLOAT32_C( -0.80), SIMDE_FLOAT32_C( 0.80), SIMDE_FLOAT32_C( -0.72) } },
{ { SIMDE_FLOAT32_C( 0.75), SIMDE_FLOAT32_C( 0.55), SIMDE_FLOAT32_C( 0.65), SIMDE_FLOAT32_C( 1.00),
SIMDE_FLOAT32_C( 0.94), SIMDE_FLOAT32_C( 0.76), SIMDE_FLOAT32_C( 0.75), SIMDE_FLOAT32_C( 0.94),
SIMDE_FLOAT32_C( 0.63), SIMDE_FLOAT32_C( 0.78), SIMDE_FLOAT32_C( 0.61), SIMDE_FLOAT32_C( 0.72),
SIMDE_FLOAT32_C( 0.84), SIMDE_FLOAT32_C( 0.83), SIMDE_FLOAT32_C( 0.92), SIMDE_FLOAT32_C( 0.84) },
{ SIMDE_FLOAT32_C( -0.73), SIMDE_FLOAT32_C( -0.99), SIMDE_FLOAT32_C( -0.86), SIMDE_FLOAT32_C( -0.03),
SIMDE_FLOAT32_C( -0.36), SIMDE_FLOAT32_C( -0.71), SIMDE_FLOAT32_C( 0.72), SIMDE_FLOAT32_C( -0.35),
SIMDE_FLOAT32_C( -0.89), SIMDE_FLOAT32_C( 0.67), SIMDE_FLOAT32_C( -0.91), SIMDE_FLOAT32_C( -0.77),
SIMDE_FLOAT32_C( -0.57), SIMDE_FLOAT32_C( -0.59), SIMDE_FLOAT32_C( -0.39), SIMDE_FLOAT32_C( 0.57) },
{ SIMDE_FLOAT32_C( -0.67), SIMDE_FLOAT32_C( -0.84), SIMDE_FLOAT32_C( -0.76), SIMDE_FLOAT32_C( -0.03),
SIMDE_FLOAT32_C( -0.35), SIMDE_FLOAT32_C( -0.65), SIMDE_FLOAT32_C( 0.66), SIMDE_FLOAT32_C( -0.34),
SIMDE_FLOAT32_C( -0.78), SIMDE_FLOAT32_C( 0.62), SIMDE_FLOAT32_C( -0.79), SIMDE_FLOAT32_C( -0.70),
SIMDE_FLOAT32_C( -0.54), SIMDE_FLOAT32_C( -0.56), SIMDE_FLOAT32_C( -0.38), SIMDE_FLOAT32_C( 0.54) } },
{ { SIMDE_FLOAT32_C( 0.84), SIMDE_FLOAT32_C( 0.99), SIMDE_FLOAT32_C( 0.78), SIMDE_FLOAT32_C( 0.88),
SIMDE_FLOAT32_C( 0.96), SIMDE_FLOAT32_C( 0.99), SIMDE_FLOAT32_C( 0.86), SIMDE_FLOAT32_C( 0.94),
SIMDE_FLOAT32_C( 0.92), SIMDE_FLOAT32_C( 0.96), SIMDE_FLOAT32_C( 0.60), SIMDE_FLOAT32_C( 0.73),
SIMDE_FLOAT32_C( 0.94), SIMDE_FLOAT32_C( 0.67), SIMDE_FLOAT32_C( 0.59), SIMDE_FLOAT32_C( 0.81) },
{ SIMDE_FLOAT32_C( -0.58), SIMDE_FLOAT32_C( 0.11), SIMDE_FLOAT32_C( -0.68), SIMDE_FLOAT32_C( -0.50),
SIMDE_FLOAT32_C( 0.28), SIMDE_FLOAT32_C( -0.13), SIMDE_FLOAT32_C( -0.54), SIMDE_FLOAT32_C( -0.36),
SIMDE_FLOAT32_C( 0.40), SIMDE_FLOAT32_C( 0.28), SIMDE_FLOAT32_C( 0.93), SIMDE_FLOAT32_C( 0.75),
SIMDE_FLOAT32_C( 0.36), SIMDE_FLOAT32_C( 0.84), SIMDE_FLOAT32_C( 0.94), SIMDE_FLOAT32_C( 0.63) },
{ SIMDE_FLOAT32_C( -0.55), SIMDE_FLOAT32_C( 0.11), SIMDE_FLOAT32_C( -0.63), SIMDE_FLOAT32_C( -0.48),
SIMDE_FLOAT32_C( 0.28), SIMDE_FLOAT32_C( -0.13), SIMDE_FLOAT32_C( -0.51), SIMDE_FLOAT32_C( -0.35),
SIMDE_FLOAT32_C( 0.39), SIMDE_FLOAT32_C( 0.28), SIMDE_FLOAT32_C( 0.80), SIMDE_FLOAT32_C( 0.68),
SIMDE_FLOAT32_C( 0.35), SIMDE_FLOAT32_C( 0.74), SIMDE_FLOAT32_C( 0.81), SIMDE_FLOAT32_C( 0.59) } },
{ { SIMDE_FLOAT32_C( 0.66), SIMDE_FLOAT32_C( 0.60), SIMDE_FLOAT32_C( 0.92), SIMDE_FLOAT32_C( 0.87),
SIMDE_FLOAT32_C( 0.80), SIMDE_FLOAT32_C( 0.78), SIMDE_FLOAT32_C( 0.99), SIMDE_FLOAT32_C( 0.86),
SIMDE_FLOAT32_C( 0.55), SIMDE_FLOAT32_C( 0.97), SIMDE_FLOAT32_C( 0.96), SIMDE_FLOAT32_C( 0.84),
SIMDE_FLOAT32_C( 0.80), SIMDE_FLOAT32_C( 0.95), SIMDE_FLOAT32_C( 0.55), SIMDE_FLOAT32_C( 0.58) },
{ SIMDE_FLOAT32_C( 0.85), SIMDE_FLOAT32_C( -0.93), SIMDE_FLOAT32_C( -0.40), SIMDE_FLOAT32_C( -0.52),
SIMDE_FLOAT32_C( -0.64), SIMDE_FLOAT32_C( -0.68), SIMDE_FLOAT32_C( 0.14), SIMDE_FLOAT32_C( -0.53),
SIMDE_FLOAT32_C( 0.99), SIMDE_FLOAT32_C( 0.23), SIMDE_FLOAT32_C( -0.29), SIMDE_FLOAT32_C( -0.58),
SIMDE_FLOAT32_C( 0.64), SIMDE_FLOAT32_C( 0.32), SIMDE_FLOAT32_C( 0.99), SIMDE_FLOAT32_C( -0.95) },
{ SIMDE_FLOAT32_C( 0.75), SIMDE_FLOAT32_C( -0.80), SIMDE_FLOAT32_C( -0.39), SIMDE_FLOAT32_C( -0.50),
SIMDE_FLOAT32_C( -0.60), SIMDE_FLOAT32_C( -0.63), SIMDE_FLOAT32_C( 0.14), SIMDE_FLOAT32_C( -0.51),
SIMDE_FLOAT32_C( 0.84), SIMDE_FLOAT32_C( 0.23), SIMDE_FLOAT32_C( -0.29), SIMDE_FLOAT32_C( -0.55),
SIMDE_FLOAT32_C( 0.60), SIMDE_FLOAT32_C( 0.31), SIMDE_FLOAT32_C( 0.84), SIMDE_FLOAT32_C( -0.81) } },
{ { SIMDE_FLOAT32_C( 0.84), SIMDE_FLOAT32_C( 0.77), SIMDE_FLOAT32_C( 0.90), SIMDE_FLOAT32_C( 0.76),
SIMDE_FLOAT32_C( 0.98), SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( 0.80), SIMDE_FLOAT32_C( 0.92),
SIMDE_FLOAT32_C( 0.76), SIMDE_FLOAT32_C( 0.76), SIMDE_FLOAT32_C( 0.79), SIMDE_FLOAT32_C( 0.79),
SIMDE_FLOAT32_C( 0.64), SIMDE_FLOAT32_C( 0.75), SIMDE_FLOAT32_C( 0.96), SIMDE_FLOAT32_C( 0.56) },
{ SIMDE_FLOAT32_C( -0.57), SIMDE_FLOAT32_C( -0.69), SIMDE_FLOAT32_C( -0.45), SIMDE_FLOAT32_C( 0.71),
SIMDE_FLOAT32_C( 0.19), SIMDE_FLOAT32_C( 0.01), SIMDE_FLOAT32_C( -0.64), SIMDE_FLOAT32_C( -0.41),
SIMDE_FLOAT32_C( -0.71), SIMDE_FLOAT32_C( -0.71), SIMDE_FLOAT32_C( -0.66), SIMDE_FLOAT32_C( 0.66),
SIMDE_FLOAT32_C( -0.87), SIMDE_FLOAT32_C( -0.72), SIMDE_FLOAT32_C( 0.29), SIMDE_FLOAT32_C( 0.98) },
{ SIMDE_FLOAT32_C( -0.54), SIMDE_FLOAT32_C( -0.64), SIMDE_FLOAT32_C( -0.43), SIMDE_FLOAT32_C( 0.65),
SIMDE_FLOAT32_C( 0.19), SIMDE_FLOAT32_C( 0.01), SIMDE_FLOAT32_C( -0.60), SIMDE_FLOAT32_C( -0.40),
SIMDE_FLOAT32_C( -0.65), SIMDE_FLOAT32_C( -0.65), SIMDE_FLOAT32_C( -0.61), SIMDE_FLOAT32_C( 0.61),
SIMDE_FLOAT32_C( -0.76), SIMDE_FLOAT32_C( -0.66), SIMDE_FLOAT32_C( 0.29), SIMDE_FLOAT32_C( 0.83) } },
{ { SIMDE_FLOAT32_C( 0.80), SIMDE_FLOAT32_C( 0.63), SIMDE_FLOAT32_C( 0.86), SIMDE_FLOAT32_C( 0.96),
SIMDE_FLOAT32_C( 0.70), SIMDE_FLOAT32_C( 0.83), SIMDE_FLOAT32_C( 0.98), SIMDE_FLOAT32_C( 0.70),
SIMDE_FLOAT32_C( 0.99), SIMDE_FLOAT32_C( 0.63), SIMDE_FLOAT32_C( 0.93), SIMDE_FLOAT32_C( 0.86),
SIMDE_FLOAT32_C( 0.98), SIMDE_FLOAT32_C( 0.93), SIMDE_FLOAT32_C( 0.88), SIMDE_FLOAT32_C( 0.80) },
{ SIMDE_FLOAT32_C( -0.65), SIMDE_FLOAT32_C( 0.89), SIMDE_FLOAT32_C( -0.54), SIMDE_FLOAT32_C( -0.29),
SIMDE_FLOAT32_C( -0.79), SIMDE_FLOAT32_C( 0.60), SIMDE_FLOAT32_C( 0.19), SIMDE_FLOAT32_C( -0.80),
SIMDE_FLOAT32_C( -0.17), SIMDE_FLOAT32_C( 0.89), SIMDE_FLOAT32_C( -0.38), SIMDE_FLOAT32_C( -0.54),
SIMDE_FLOAT32_C( 0.21), SIMDE_FLOAT32_C( -0.38), SIMDE_FLOAT32_C( -0.49), SIMDE_FLOAT32_C( 0.64) },
{ SIMDE_FLOAT32_C( -0.61), SIMDE_FLOAT32_C( 0.78), SIMDE_FLOAT32_C( -0.51), SIMDE_FLOAT32_C( -0.29),
SIMDE_FLOAT32_C( -0.71), SIMDE_FLOAT32_C( 0.56), SIMDE_FLOAT32_C( 0.19), SIMDE_FLOAT32_C( -0.72),
SIMDE_FLOAT32_C( -0.17), SIMDE_FLOAT32_C( 0.78), SIMDE_FLOAT32_C( -0.37), SIMDE_FLOAT32_C( -0.51),
SIMDE_FLOAT32_C( 0.21), SIMDE_FLOAT32_C( -0.37), SIMDE_FLOAT32_C( -0.47), SIMDE_FLOAT32_C( 0.60) } },
{ { SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( 0.94), SIMDE_FLOAT32_C( 0.63),
SIMDE_FLOAT32_C( 0.60), SIMDE_FLOAT32_C( 0.76), SIMDE_FLOAT32_C( 0.96), SIMDE_FLOAT32_C( 0.81),
SIMDE_FLOAT32_C( 0.54), SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( 0.56), SIMDE_FLOAT32_C( 0.64),
SIMDE_FLOAT32_C( 0.95), SIMDE_FLOAT32_C( 0.95), SIMDE_FLOAT32_C( 0.63), SIMDE_FLOAT32_C( 0.78) },
{ SIMDE_FLOAT32_C( -0.07), SIMDE_FLOAT32_C( 0.07), SIMDE_FLOAT32_C( 0.36), SIMDE_FLOAT32_C( -0.89),
SIMDE_FLOAT32_C( -0.93), SIMDE_FLOAT32_C( 0.71), SIMDE_FLOAT32_C( -0.30), SIMDE_FLOAT32_C( -0.63),
SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( 0.04), SIMDE_FLOAT32_C( -0.98), SIMDE_FLOAT32_C( -0.87),
SIMDE_FLOAT32_C( 0.32), SIMDE_FLOAT32_C( 0.31), SIMDE_FLOAT32_C( -0.89), SIMDE_FLOAT32_C( 0.67) },
{ SIMDE_FLOAT32_C( -0.07), SIMDE_FLOAT32_C( 0.07), SIMDE_FLOAT32_C( 0.35), SIMDE_FLOAT32_C( -0.78),
SIMDE_FLOAT32_C( -0.80), SIMDE_FLOAT32_C( 0.65), SIMDE_FLOAT32_C( -0.30), SIMDE_FLOAT32_C( -0.59),
SIMDE_FLOAT32_C( 0.84), SIMDE_FLOAT32_C( 0.04), SIMDE_FLOAT32_C( -0.83), SIMDE_FLOAT32_C( -0.76),
SIMDE_FLOAT32_C( 0.31), SIMDE_FLOAT32_C( 0.31), SIMDE_FLOAT32_C( -0.78), SIMDE_FLOAT32_C( 0.62) } },
{ { SIMDE_FLOAT32_C( 0.98), SIMDE_FLOAT32_C( 0.91), SIMDE_FLOAT32_C( 0.82), SIMDE_FLOAT32_C( 0.92),
SIMDE_FLOAT32_C( 0.67), SIMDE_FLOAT32_C( 0.84), SIMDE_FLOAT32_C( 0.83), SIMDE_FLOAT32_C( 1.00),
SIMDE_FLOAT32_C( 0.89), SIMDE_FLOAT32_C( 0.72), SIMDE_FLOAT32_C( 0.90), SIMDE_FLOAT32_C( 0.95),
SIMDE_FLOAT32_C( 0.99), SIMDE_FLOAT32_C( 0.57), SIMDE_FLOAT32_C( 0.78), SIMDE_FLOAT32_C( 0.72) },
{ SIMDE_FLOAT32_C( 0.20), SIMDE_FLOAT32_C( -0.43), SIMDE_FLOAT32_C( -0.61), SIMDE_FLOAT32_C( 0.40),
SIMDE_FLOAT32_C( -0.83), SIMDE_FLOAT32_C( 0.57), SIMDE_FLOAT32_C( 0.60), SIMDE_FLOAT32_C( -0.01),
SIMDE_FLOAT32_C( 0.47), SIMDE_FLOAT32_C( -0.77), SIMDE_FLOAT32_C( 0.45), SIMDE_FLOAT32_C( -0.32),
SIMDE_FLOAT32_C( -0.16), SIMDE_FLOAT32_C( 0.97), SIMDE_FLOAT32_C( -0.68), SIMDE_FLOAT32_C( 0.77) },
{ SIMDE_FLOAT32_C( 0.20), SIMDE_FLOAT32_C( -0.42), SIMDE_FLOAT32_C( -0.57), SIMDE_FLOAT32_C( 0.39),
SIMDE_FLOAT32_C( -0.74), SIMDE_FLOAT32_C( 0.54), SIMDE_FLOAT32_C( 0.56), SIMDE_FLOAT32_C( -0.01),
SIMDE_FLOAT32_C( 0.45), SIMDE_FLOAT32_C( -0.70), SIMDE_FLOAT32_C( 0.43), SIMDE_FLOAT32_C( -0.31),
SIMDE_FLOAT32_C( -0.16), SIMDE_FLOAT32_C( 0.82), SIMDE_FLOAT32_C( -0.63), SIMDE_FLOAT32_C( 0.70) } }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])) ; i++) {
simde__m512 mem;
simde__m512 a = simde_mm512_loadu_ps(test_vec[i].a);
simde__m512 r = simde_mm512_sincos_ps(&mem, a);
simde_test_x86_assert_equal_f32x16(r, simde_mm512_loadu_ps(test_vec[i].r), 1);
simde_test_x86_assert_equal_f32x16(mem, simde_mm512_loadu_ps(test_vec[i].mem), 1);
}
return 0;
}
static int
test_simde_mm512_mask_sincos_ps (SIMDE_MUNIT_TEST_ARGS) {
static const struct {
const simde_float32 mem[16];
const simde_float32 sin_src[16];
const simde_float32 cos_src[16];
const simde__mmask16 k;
const simde_float32 a[16];
const simde_float32 r[16];
} test_vec[] = {
{ { SIMDE_FLOAT32_C( 0.85), SIMDE_FLOAT32_C( 0.64), SIMDE_FLOAT32_C( 0.60), SIMDE_FLOAT32_C( 0.72),
SIMDE_FLOAT32_C( 0.99), SIMDE_FLOAT32_C( -0.70), SIMDE_FLOAT32_C( 0.86), SIMDE_FLOAT32_C( -0.79),
SIMDE_FLOAT32_C( 0.99), SIMDE_FLOAT32_C( 0.65), SIMDE_FLOAT32_C( -0.32), SIMDE_FLOAT32_C( 0.16),
SIMDE_FLOAT32_C( 0.94), SIMDE_FLOAT32_C( 0.03), SIMDE_FLOAT32_C( 0.68), SIMDE_FLOAT32_C( 0.64) },
{ SIMDE_FLOAT32_C( 0.72), SIMDE_FLOAT32_C( 0.21), SIMDE_FLOAT32_C( 0.49), SIMDE_FLOAT32_C( 0.49),
SIMDE_FLOAT32_C( 0.64), SIMDE_FLOAT32_C( -0.25), SIMDE_FLOAT32_C( 0.11), SIMDE_FLOAT32_C( 0.00),
SIMDE_FLOAT32_C( 0.25), SIMDE_FLOAT32_C( -0.28), SIMDE_FLOAT32_C( -0.60), SIMDE_FLOAT32_C( 0.73),
SIMDE_FLOAT32_C( 0.18), SIMDE_FLOAT32_C( 0.79), SIMDE_FLOAT32_C( -0.85), SIMDE_FLOAT32_C( 0.21) },
{ SIMDE_FLOAT32_C( 0.85), SIMDE_FLOAT32_C( -0.26), SIMDE_FLOAT32_C( 0.60), SIMDE_FLOAT32_C( -0.76),
SIMDE_FLOAT32_C( 0.97), SIMDE_FLOAT32_C( -0.70), SIMDE_FLOAT32_C( 0.86), SIMDE_FLOAT32_C( -0.79),
SIMDE_FLOAT32_C( 0.32), SIMDE_FLOAT32_C( -0.85), SIMDE_FLOAT32_C( -0.32), SIMDE_FLOAT32_C( 0.16),
SIMDE_FLOAT32_C( 0.92), SIMDE_FLOAT32_C( 0.03), SIMDE_FLOAT32_C( 0.68), SIMDE_FLOAT32_C( 0.64) },
UINT16_C( 4890),
{ SIMDE_FLOAT32_C( 0.13), SIMDE_FLOAT32_C( 0.88), SIMDE_FLOAT32_C( 0.92), SIMDE_FLOAT32_C( -0.76),
SIMDE_FLOAT32_C( -0.12), SIMDE_FLOAT32_C( 0.17), SIMDE_FLOAT32_C( -0.05), SIMDE_FLOAT32_C( 0.28),
SIMDE_FLOAT32_C( -0.11), SIMDE_FLOAT32_C( -0.86), SIMDE_FLOAT32_C( 0.07), SIMDE_FLOAT32_C( 0.04),
SIMDE_FLOAT32_C( 0.34), SIMDE_FLOAT32_C( -0.08), SIMDE_FLOAT32_C( 0.78), SIMDE_FLOAT32_C( -0.06) },
{ SIMDE_FLOAT32_C( 0.72), SIMDE_FLOAT32_C( 0.77), SIMDE_FLOAT32_C( 0.49), SIMDE_FLOAT32_C( -0.69),
SIMDE_FLOAT32_C( -0.12), SIMDE_FLOAT32_C( -0.25), SIMDE_FLOAT32_C( 0.11), SIMDE_FLOAT32_C( 0.00),
SIMDE_FLOAT32_C( -0.11), SIMDE_FLOAT32_C( -0.76), SIMDE_FLOAT32_C( -0.60), SIMDE_FLOAT32_C( 0.73),
SIMDE_FLOAT32_C( 0.33), SIMDE_FLOAT32_C( 0.79), SIMDE_FLOAT32_C( -0.85), SIMDE_FLOAT32_C( 0.21) } },
{ { SIMDE_FLOAT32_C( 0.78), SIMDE_FLOAT32_C( -0.51), SIMDE_FLOAT32_C( -0.92), SIMDE_FLOAT32_C( -0.34),
SIMDE_FLOAT32_C( 0.66), SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( 0.94), SIMDE_FLOAT32_C( -0.45),
SIMDE_FLOAT32_C( 0.81), SIMDE_FLOAT32_C( 0.02), SIMDE_FLOAT32_C( 0.59), SIMDE_FLOAT32_C( 1.00),
SIMDE_FLOAT32_C( 0.94), SIMDE_FLOAT32_C( 0.37), SIMDE_FLOAT32_C( 0.91), SIMDE_FLOAT32_C( 0.09) },
{ SIMDE_FLOAT32_C( 0.16), SIMDE_FLOAT32_C( 0.74), SIMDE_FLOAT32_C( 0.24), SIMDE_FLOAT32_C( 0.02),
SIMDE_FLOAT32_C( 0.95), SIMDE_FLOAT32_C( -0.44), SIMDE_FLOAT32_C( 0.16), SIMDE_FLOAT32_C( -0.37),
SIMDE_FLOAT32_C( 0.72), SIMDE_FLOAT32_C( 0.08), SIMDE_FLOAT32_C( 0.66), SIMDE_FLOAT32_C( 0.40),
SIMDE_FLOAT32_C( -0.28), SIMDE_FLOAT32_C( 0.89), SIMDE_FLOAT32_C( -0.43), SIMDE_FLOAT32_C( 0.84) },
{ SIMDE_FLOAT32_C( 0.78), SIMDE_FLOAT32_C( -0.51), SIMDE_FLOAT32_C( -0.92), SIMDE_FLOAT32_C( -0.34),
SIMDE_FLOAT32_C( 0.66), SIMDE_FLOAT32_C( 0.03), SIMDE_FLOAT32_C( 0.94), SIMDE_FLOAT32_C( -0.45),
SIMDE_FLOAT32_C( 0.17), SIMDE_FLOAT32_C( 0.02), SIMDE_FLOAT32_C( 0.59), SIMDE_FLOAT32_C( -0.49),
SIMDE_FLOAT32_C( 0.94), SIMDE_FLOAT32_C( 0.37), SIMDE_FLOAT32_C( 0.45), SIMDE_FLOAT32_C( 0.09) },
UINT16_C(18720),
{ SIMDE_FLOAT32_C( -0.89), SIMDE_FLOAT32_C( 0.06), SIMDE_FLOAT32_C( 0.26), SIMDE_FLOAT32_C( 0.27),
SIMDE_FLOAT32_C( 0.69), SIMDE_FLOAT32_C( -0.02), SIMDE_FLOAT32_C( -0.65), SIMDE_FLOAT32_C( 0.35),
SIMDE_FLOAT32_C( -0.62), SIMDE_FLOAT32_C( 0.06), SIMDE_FLOAT32_C( 0.25), SIMDE_FLOAT32_C( -0.05),
SIMDE_FLOAT32_C( -0.09), SIMDE_FLOAT32_C( 0.02), SIMDE_FLOAT32_C( 0.43), SIMDE_FLOAT32_C( -0.01) },
{ SIMDE_FLOAT32_C( 0.16), SIMDE_FLOAT32_C( 0.74), SIMDE_FLOAT32_C( 0.24), SIMDE_FLOAT32_C( 0.02),
SIMDE_FLOAT32_C( 0.95), SIMDE_FLOAT32_C( -0.02), SIMDE_FLOAT32_C( 0.16), SIMDE_FLOAT32_C( -0.37),
SIMDE_FLOAT32_C( -0.58), SIMDE_FLOAT32_C( 0.08), SIMDE_FLOAT32_C( 0.66), SIMDE_FLOAT32_C( -0.05),
SIMDE_FLOAT32_C( -0.28), SIMDE_FLOAT32_C( 0.89), SIMDE_FLOAT32_C( 0.42), SIMDE_FLOAT32_C( 0.84) } },
{ { SIMDE_FLOAT32_C( 0.92), SIMDE_FLOAT32_C( 0.84), SIMDE_FLOAT32_C( 0.04), SIMDE_FLOAT32_C( -0.52),
SIMDE_FLOAT32_C( -0.91), SIMDE_FLOAT32_C( 0.97), SIMDE_FLOAT32_C( 0.83), SIMDE_FLOAT32_C( -0.54),
SIMDE_FLOAT32_C( 0.84), SIMDE_FLOAT32_C( 0.08), SIMDE_FLOAT32_C( 0.41), SIMDE_FLOAT32_C( 0.36),
SIMDE_FLOAT32_C( 0.60), SIMDE_FLOAT32_C( -0.16), SIMDE_FLOAT32_C( -0.65), SIMDE_FLOAT32_C( 0.78) },
{ SIMDE_FLOAT32_C( 0.68), SIMDE_FLOAT32_C( 0.09), SIMDE_FLOAT32_C( -0.98), SIMDE_FLOAT32_C( 0.62),
SIMDE_FLOAT32_C( 0.64), SIMDE_FLOAT32_C( 0.19), SIMDE_FLOAT32_C( -0.36), SIMDE_FLOAT32_C( 0.24),
SIMDE_FLOAT32_C( 0.70), SIMDE_FLOAT32_C( -0.42), SIMDE_FLOAT32_C( -0.40), SIMDE_FLOAT32_C( 0.16),
SIMDE_FLOAT32_C( 0.67), SIMDE_FLOAT32_C( 0.72), SIMDE_FLOAT32_C( 0.85), SIMDE_FLOAT32_C( 0.77) },
{ SIMDE_FLOAT32_C( -0.22), SIMDE_FLOAT32_C( 0.11), SIMDE_FLOAT32_C( 0.04), SIMDE_FLOAT32_C( -0.52),
SIMDE_FLOAT32_C( -0.91), SIMDE_FLOAT32_C( 0.39), SIMDE_FLOAT32_C( 0.83), SIMDE_FLOAT32_C( -0.54),
SIMDE_FLOAT32_C( -0.55), SIMDE_FLOAT32_C( 0.08), SIMDE_FLOAT32_C( 0.41), SIMDE_FLOAT32_C( 0.36),
SIMDE_FLOAT32_C( -0.90), SIMDE_FLOAT32_C( -0.16), SIMDE_FLOAT32_C( -0.65), SIMDE_FLOAT32_C( 0.78) },
UINT16_C( 4387),
{ SIMDE_FLOAT32_C( 0.41), SIMDE_FLOAT32_C( 0.58), SIMDE_FLOAT32_C( 0.56), SIMDE_FLOAT32_C( -0.95),
SIMDE_FLOAT32_C( -0.19), SIMDE_FLOAT32_C( 0.26), SIMDE_FLOAT32_C( -0.38), SIMDE_FLOAT32_C( 0.42),
SIMDE_FLOAT32_C( -0.58), SIMDE_FLOAT32_C( -0.71), SIMDE_FLOAT32_C( 0.14), SIMDE_FLOAT32_C( -0.73),
SIMDE_FLOAT32_C( -0.93), SIMDE_FLOAT32_C( 0.92), SIMDE_FLOAT32_C( 0.38), SIMDE_FLOAT32_C( 0.11) },
{ SIMDE_FLOAT32_C( 0.40), SIMDE_FLOAT32_C( 0.55), SIMDE_FLOAT32_C( -0.98), SIMDE_FLOAT32_C( 0.62),
SIMDE_FLOAT32_C( 0.64), SIMDE_FLOAT32_C( 0.26), SIMDE_FLOAT32_C( -0.36), SIMDE_FLOAT32_C( 0.24),
SIMDE_FLOAT32_C( -0.55), SIMDE_FLOAT32_C( -0.42), SIMDE_FLOAT32_C( -0.40), SIMDE_FLOAT32_C( 0.16),
SIMDE_FLOAT32_C( -0.80), SIMDE_FLOAT32_C( 0.72), SIMDE_FLOAT32_C( 0.85), SIMDE_FLOAT32_C( 0.77) } },
{ { SIMDE_FLOAT32_C( 0.70), SIMDE_FLOAT32_C( -0.42), SIMDE_FLOAT32_C( 0.67), SIMDE_FLOAT32_C( 1.00),
SIMDE_FLOAT32_C( 0.84), SIMDE_FLOAT32_C( 0.25), SIMDE_FLOAT32_C( 0.77), SIMDE_FLOAT32_C( 0.98),
SIMDE_FLOAT32_C( 0.55), SIMDE_FLOAT32_C( 0.88), SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( 0.67),
SIMDE_FLOAT32_C( -0.01), SIMDE_FLOAT32_C( 0.91), SIMDE_FLOAT32_C( -0.28), SIMDE_FLOAT32_C( 0.96) },
{ SIMDE_FLOAT32_C( -0.60), SIMDE_FLOAT32_C( 0.46), SIMDE_FLOAT32_C( -0.50), SIMDE_FLOAT32_C( -0.77),
SIMDE_FLOAT32_C( 0.93), SIMDE_FLOAT32_C( -0.05), SIMDE_FLOAT32_C( 0.30), SIMDE_FLOAT32_C( 0.34),
SIMDE_FLOAT32_C( -0.69), SIMDE_FLOAT32_C( 0.40), SIMDE_FLOAT32_C( -0.82), SIMDE_FLOAT32_C( -0.34),
SIMDE_FLOAT32_C( 0.18), SIMDE_FLOAT32_C( -0.88), SIMDE_FLOAT32_C( 0.02), SIMDE_FLOAT32_C( -0.41) },
{ SIMDE_FLOAT32_C( 0.70), SIMDE_FLOAT32_C( -0.42), SIMDE_FLOAT32_C( -0.37), SIMDE_FLOAT32_C( -0.49),
SIMDE_FLOAT32_C( 0.84), SIMDE_FLOAT32_C( 0.25), SIMDE_FLOAT32_C( 0.93), SIMDE_FLOAT32_C( -0.74),
SIMDE_FLOAT32_C( 0.55), SIMDE_FLOAT32_C( 0.07), SIMDE_FLOAT32_C( -0.47), SIMDE_FLOAT32_C( 0.61),
SIMDE_FLOAT32_C( -0.01), SIMDE_FLOAT32_C( 0.91), SIMDE_FLOAT32_C( -0.28), SIMDE_FLOAT32_C( 0.39) },
UINT16_C(36556),
{ SIMDE_FLOAT32_C( 0.61), SIMDE_FLOAT32_C( 0.30), SIMDE_FLOAT32_C( -0.83), SIMDE_FLOAT32_C( -0.09),
SIMDE_FLOAT32_C( -0.36), SIMDE_FLOAT32_C( -0.52), SIMDE_FLOAT32_C( -0.69), SIMDE_FLOAT32_C( -0.18),
SIMDE_FLOAT32_C( 0.13), SIMDE_FLOAT32_C( 0.49), SIMDE_FLOAT32_C( -0.06), SIMDE_FLOAT32_C( -0.84),
SIMDE_FLOAT32_C( -0.92), SIMDE_FLOAT32_C( -0.36), SIMDE_FLOAT32_C( -0.26), SIMDE_FLOAT32_C( -0.29) },
{ SIMDE_FLOAT32_C( -0.60), SIMDE_FLOAT32_C( 0.46), SIMDE_FLOAT32_C( -0.74), SIMDE_FLOAT32_C( -0.09),
SIMDE_FLOAT32_C( 0.93), SIMDE_FLOAT32_C( -0.05), SIMDE_FLOAT32_C( -0.64), SIMDE_FLOAT32_C( -0.18),
SIMDE_FLOAT32_C( -0.69), SIMDE_FLOAT32_C( 0.47), SIMDE_FLOAT32_C( -0.06), SIMDE_FLOAT32_C( -0.74),
SIMDE_FLOAT32_C( 0.18), SIMDE_FLOAT32_C( -0.88), SIMDE_FLOAT32_C( 0.02), SIMDE_FLOAT32_C( -0.29) } },
{ { SIMDE_FLOAT32_C( 0.98), SIMDE_FLOAT32_C( 0.98), SIMDE_FLOAT32_C( 0.99), SIMDE_FLOAT32_C( -0.40),
SIMDE_FLOAT32_C( 0.71), SIMDE_FLOAT32_C( 0.37), SIMDE_FLOAT32_C( 0.42), SIMDE_FLOAT32_C( 0.67),
SIMDE_FLOAT32_C( 0.63), SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( -0.06),
SIMDE_FLOAT32_C( -0.00), SIMDE_FLOAT32_C( 0.86), SIMDE_FLOAT32_C( 0.83), SIMDE_FLOAT32_C( -0.86) },
{ SIMDE_FLOAT32_C( 0.15), SIMDE_FLOAT32_C( -0.42), SIMDE_FLOAT32_C( 0.97), SIMDE_FLOAT32_C( 0.08),
SIMDE_FLOAT32_C( -0.15), SIMDE_FLOAT32_C( 0.51), SIMDE_FLOAT32_C( -0.85), SIMDE_FLOAT32_C( 0.38),
SIMDE_FLOAT32_C( 0.13), SIMDE_FLOAT32_C( 0.14), SIMDE_FLOAT32_C( 0.29), SIMDE_FLOAT32_C( 0.85),
SIMDE_FLOAT32_C( -0.48), SIMDE_FLOAT32_C( -0.34), SIMDE_FLOAT32_C( 0.06), SIMDE_FLOAT32_C( -0.86) },
{ SIMDE_FLOAT32_C( 0.96), SIMDE_FLOAT32_C( 0.23), SIMDE_FLOAT32_C( 0.05), SIMDE_FLOAT32_C( -0.40),
SIMDE_FLOAT32_C( 0.71), SIMDE_FLOAT32_C( 0.37), SIMDE_FLOAT32_C( 0.42), SIMDE_FLOAT32_C( -0.15),
SIMDE_FLOAT32_C( -0.14), SIMDE_FLOAT32_C( -0.64), SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( -0.06),
SIMDE_FLOAT32_C( -0.00), SIMDE_FLOAT32_C( 0.74), SIMDE_FLOAT32_C( 0.66), SIMDE_FLOAT32_C( -0.86) },
UINT16_C(25479),
{ SIMDE_FLOAT32_C( 0.22), SIMDE_FLOAT32_C( 0.18), SIMDE_FLOAT32_C( 0.14), SIMDE_FLOAT32_C( 0.37),
SIMDE_FLOAT32_C( -0.45), SIMDE_FLOAT32_C( -0.74), SIMDE_FLOAT32_C( -0.49), SIMDE_FLOAT32_C( 0.84),
SIMDE_FLOAT32_C( -0.89), SIMDE_FLOAT32_C( 0.04), SIMDE_FLOAT32_C( -0.50), SIMDE_FLOAT32_C( 0.17),
SIMDE_FLOAT32_C( 0.17), SIMDE_FLOAT32_C( -0.54), SIMDE_FLOAT32_C( -0.60), SIMDE_FLOAT32_C( -0.78) },
{ SIMDE_FLOAT32_C( 0.22), SIMDE_FLOAT32_C( 0.18), SIMDE_FLOAT32_C( 0.14), SIMDE_FLOAT32_C( 0.08),
SIMDE_FLOAT32_C( -0.15), SIMDE_FLOAT32_C( 0.51), SIMDE_FLOAT32_C( -0.85), SIMDE_FLOAT32_C( 0.74),
SIMDE_FLOAT32_C( -0.78), SIMDE_FLOAT32_C( 0.04), SIMDE_FLOAT32_C( 0.29), SIMDE_FLOAT32_C( 0.85),
SIMDE_FLOAT32_C( -0.48), SIMDE_FLOAT32_C( -0.51), SIMDE_FLOAT32_C( -0.56), SIMDE_FLOAT32_C( -0.86) } },
{ { SIMDE_FLOAT32_C( 0.22), SIMDE_FLOAT32_C( 0.99), SIMDE_FLOAT32_C( -0.43), SIMDE_FLOAT32_C( 0.77),
SIMDE_FLOAT32_C( -0.93), SIMDE_FLOAT32_C( 0.08), SIMDE_FLOAT32_C( 0.61), SIMDE_FLOAT32_C( 0.89),
SIMDE_FLOAT32_C( -0.89), SIMDE_FLOAT32_C( 0.96), SIMDE_FLOAT32_C( 0.35), SIMDE_FLOAT32_C( 0.94),
SIMDE_FLOAT32_C( -0.44), SIMDE_FLOAT32_C( 0.76), SIMDE_FLOAT32_C( 0.51), SIMDE_FLOAT32_C( 0.62) },
{ SIMDE_FLOAT32_C( 0.05), SIMDE_FLOAT32_C( -0.88), SIMDE_FLOAT32_C( 0.59), SIMDE_FLOAT32_C( -0.53),
SIMDE_FLOAT32_C( -0.04), SIMDE_FLOAT32_C( -0.55), SIMDE_FLOAT32_C( -0.17), SIMDE_FLOAT32_C( 0.97),
SIMDE_FLOAT32_C( 0.39), SIMDE_FLOAT32_C( 0.83), SIMDE_FLOAT32_C( 0.71), SIMDE_FLOAT32_C( 0.05),
SIMDE_FLOAT32_C( 0.97), SIMDE_FLOAT32_C( -0.96), SIMDE_FLOAT32_C( -0.33), SIMDE_FLOAT32_C( 0.20) },
{ SIMDE_FLOAT32_C( 0.22), SIMDE_FLOAT32_C( 0.81), SIMDE_FLOAT32_C( -0.43), SIMDE_FLOAT32_C( 0.77),
SIMDE_FLOAT32_C( -0.93), SIMDE_FLOAT32_C( 0.08), SIMDE_FLOAT32_C( 0.61), SIMDE_FLOAT32_C( -0.82),
SIMDE_FLOAT32_C( -0.89), SIMDE_FLOAT32_C( -0.89), SIMDE_FLOAT32_C( 0.35), SIMDE_FLOAT32_C( 0.29),
SIMDE_FLOAT32_C( -0.44), SIMDE_FLOAT32_C( 0.76), SIMDE_FLOAT32_C( 0.51), SIMDE_FLOAT32_C( 0.62) },
UINT16_C( 2690),
{ SIMDE_FLOAT32_C( -0.91), SIMDE_FLOAT32_C( -0.16), SIMDE_FLOAT32_C( 0.55), SIMDE_FLOAT32_C( -0.08),
SIMDE_FLOAT32_C( -0.20), SIMDE_FLOAT32_C( -0.06), SIMDE_FLOAT32_C( -0.25), SIMDE_FLOAT32_C( -0.48),
SIMDE_FLOAT32_C( 0.99), SIMDE_FLOAT32_C( -0.28), SIMDE_FLOAT32_C( -0.44), SIMDE_FLOAT32_C( -0.34),
SIMDE_FLOAT32_C( 0.92), SIMDE_FLOAT32_C( 0.78), SIMDE_FLOAT32_C( -0.53), SIMDE_FLOAT32_C( -0.52) },
{ SIMDE_FLOAT32_C( 0.05), SIMDE_FLOAT32_C( -0.16), SIMDE_FLOAT32_C( 0.59), SIMDE_FLOAT32_C( -0.53),
SIMDE_FLOAT32_C( -0.04), SIMDE_FLOAT32_C( -0.55), SIMDE_FLOAT32_C( -0.17), SIMDE_FLOAT32_C( -0.46),
SIMDE_FLOAT32_C( 0.39), SIMDE_FLOAT32_C( -0.28), SIMDE_FLOAT32_C( 0.71), SIMDE_FLOAT32_C( -0.33),
SIMDE_FLOAT32_C( 0.97), SIMDE_FLOAT32_C( -0.96), SIMDE_FLOAT32_C( -0.33), SIMDE_FLOAT32_C( 0.20) } },
{ { SIMDE_FLOAT32_C( -0.45), SIMDE_FLOAT32_C( 0.86), SIMDE_FLOAT32_C( 0.84), SIMDE_FLOAT32_C( 0.35),
SIMDE_FLOAT32_C( -0.93), SIMDE_FLOAT32_C( 0.21), SIMDE_FLOAT32_C( 0.55), SIMDE_FLOAT32_C( 0.81),
SIMDE_FLOAT32_C( 0.93), SIMDE_FLOAT32_C( 0.90), SIMDE_FLOAT32_C( -0.28), SIMDE_FLOAT32_C( 0.85),
SIMDE_FLOAT32_C( -0.80), SIMDE_FLOAT32_C( 0.93), SIMDE_FLOAT32_C( -0.67), SIMDE_FLOAT32_C( 0.90) },
{ SIMDE_FLOAT32_C( 0.55), SIMDE_FLOAT32_C( -0.46), SIMDE_FLOAT32_C( 0.56), SIMDE_FLOAT32_C( 0.16),
SIMDE_FLOAT32_C( -0.28), SIMDE_FLOAT32_C( 0.68), SIMDE_FLOAT32_C( 0.26), SIMDE_FLOAT32_C( -0.92),
SIMDE_FLOAT32_C( -0.04), SIMDE_FLOAT32_C( 0.83), SIMDE_FLOAT32_C( 0.84), SIMDE_FLOAT32_C( -0.52),
SIMDE_FLOAT32_C( 0.45), SIMDE_FLOAT32_C( 0.71), SIMDE_FLOAT32_C( 0.58), SIMDE_FLOAT32_C( 0.54) },
{ SIMDE_FLOAT32_C( -0.45), SIMDE_FLOAT32_C( 0.13), SIMDE_FLOAT32_C( -0.54), SIMDE_FLOAT32_C( 0.35),
SIMDE_FLOAT32_C( -0.93), SIMDE_FLOAT32_C( 0.21), SIMDE_FLOAT32_C( 0.87), SIMDE_FLOAT32_C( -0.94),
SIMDE_FLOAT32_C( 0.93), SIMDE_FLOAT32_C( -0.58), SIMDE_FLOAT32_C( -0.28), SIMDE_FLOAT32_C( 0.85),
SIMDE_FLOAT32_C( -0.80), SIMDE_FLOAT32_C( 0.19), SIMDE_FLOAT32_C( -0.67), SIMDE_FLOAT32_C( 0.75) },
UINT16_C(41670),
{ SIMDE_FLOAT32_C( -0.10), SIMDE_FLOAT32_C( -0.54), SIMDE_FLOAT32_C( 0.57), SIMDE_FLOAT32_C( -0.84),
SIMDE_FLOAT32_C( -0.46), SIMDE_FLOAT32_C( -0.46), SIMDE_FLOAT32_C( 0.99), SIMDE_FLOAT32_C( -0.63),
SIMDE_FLOAT32_C( 0.01), SIMDE_FLOAT32_C( 0.44), SIMDE_FLOAT32_C( -0.92), SIMDE_FLOAT32_C( -0.41),
SIMDE_FLOAT32_C( -0.03), SIMDE_FLOAT32_C( -0.38), SIMDE_FLOAT32_C( 0.72), SIMDE_FLOAT32_C( 0.44) },
{ SIMDE_FLOAT32_C( 0.55), SIMDE_FLOAT32_C( -0.51), SIMDE_FLOAT32_C( 0.54), SIMDE_FLOAT32_C( 0.16),
SIMDE_FLOAT32_C( -0.28), SIMDE_FLOAT32_C( 0.68), SIMDE_FLOAT32_C( 0.84), SIMDE_FLOAT32_C( -0.59),
SIMDE_FLOAT32_C( -0.04), SIMDE_FLOAT32_C( 0.43), SIMDE_FLOAT32_C( 0.84), SIMDE_FLOAT32_C( -0.52),
SIMDE_FLOAT32_C( 0.45), SIMDE_FLOAT32_C( -0.37), SIMDE_FLOAT32_C( 0.58), SIMDE_FLOAT32_C( 0.43) } },
{ { SIMDE_FLOAT32_C( 0.93), SIMDE_FLOAT32_C( -0.77), SIMDE_FLOAT32_C( 0.27), SIMDE_FLOAT32_C( -0.51),
SIMDE_FLOAT32_C( 0.77), SIMDE_FLOAT32_C( 0.26), SIMDE_FLOAT32_C( -0.14), SIMDE_FLOAT32_C( 0.79),
SIMDE_FLOAT32_C( -0.30), SIMDE_FLOAT32_C( -0.06), SIMDE_FLOAT32_C( 0.84), SIMDE_FLOAT32_C( 0.68),
SIMDE_FLOAT32_C( 0.91), SIMDE_FLOAT32_C( -0.90), SIMDE_FLOAT32_C( 0.11), SIMDE_FLOAT32_C( 0.54) },
{ SIMDE_FLOAT32_C( 0.97), SIMDE_FLOAT32_C( 0.79), SIMDE_FLOAT32_C( -0.35), SIMDE_FLOAT32_C( 0.84),
SIMDE_FLOAT32_C( 0.85), SIMDE_FLOAT32_C( -0.42), SIMDE_FLOAT32_C( -0.74), SIMDE_FLOAT32_C( -0.43),
SIMDE_FLOAT32_C( -0.57), SIMDE_FLOAT32_C( -0.54), SIMDE_FLOAT32_C( 0.76), SIMDE_FLOAT32_C( -0.24),
SIMDE_FLOAT32_C( -0.79), SIMDE_FLOAT32_C( 0.50), SIMDE_FLOAT32_C( -0.34), SIMDE_FLOAT32_C( 0.11) },
{ SIMDE_FLOAT32_C( 0.96), SIMDE_FLOAT32_C( -0.77), SIMDE_FLOAT32_C( 0.27), SIMDE_FLOAT32_C( -0.51),
SIMDE_FLOAT32_C( -0.23), SIMDE_FLOAT32_C( 0.26), SIMDE_FLOAT32_C( -0.14), SIMDE_FLOAT32_C( 0.79),
SIMDE_FLOAT32_C( -0.30), SIMDE_FLOAT32_C( -0.06), SIMDE_FLOAT32_C( -0.62), SIMDE_FLOAT32_C( 0.67),
SIMDE_FLOAT32_C( 0.57), SIMDE_FLOAT32_C( -0.90), SIMDE_FLOAT32_C( 0.11), SIMDE_FLOAT32_C( 0.54) },
UINT16_C( 7185),
{ SIMDE_FLOAT32_C( 0.38), SIMDE_FLOAT32_C( 0.74), SIMDE_FLOAT32_C( -0.66), SIMDE_FLOAT32_C( 0.64),
SIMDE_FLOAT32_C( -0.69), SIMDE_FLOAT32_C( -0.23), SIMDE_FLOAT32_C( -0.90), SIMDE_FLOAT32_C( -0.93),
SIMDE_FLOAT32_C( 0.53), SIMDE_FLOAT32_C( -0.69), SIMDE_FLOAT32_C( 0.57), SIMDE_FLOAT32_C( -0.82),
SIMDE_FLOAT32_C( 0.42), SIMDE_FLOAT32_C( 0.52), SIMDE_FLOAT32_C( -0.58), SIMDE_FLOAT32_C( -0.31) },
{ SIMDE_FLOAT32_C( 0.37), SIMDE_FLOAT32_C( 0.79), SIMDE_FLOAT32_C( -0.35), SIMDE_FLOAT32_C( 0.84),
SIMDE_FLOAT32_C( -0.64), SIMDE_FLOAT32_C( -0.42), SIMDE_FLOAT32_C( -0.74), SIMDE_FLOAT32_C( -0.43),
SIMDE_FLOAT32_C( -0.57), SIMDE_FLOAT32_C( -0.54), SIMDE_FLOAT32_C( 0.54), SIMDE_FLOAT32_C( -0.73),
SIMDE_FLOAT32_C( 0.41), SIMDE_FLOAT32_C( 0.50), SIMDE_FLOAT32_C( -0.34), SIMDE_FLOAT32_C( 0.11) } }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])) ; i++) {
simde__m512 mem;
simde__m512 sin_src = simde_mm512_loadu_ps(test_vec[i].sin_src);
simde__m512 cos_src = simde_mm512_loadu_ps(test_vec[i].cos_src);
simde__m512 a = simde_mm512_loadu_ps(test_vec[i].a);
simde__m512 r = simde_mm512_mask_sincos_ps(&mem, sin_src, cos_src, test_vec[i].k, a);
simde_test_x86_assert_equal_f32x16(r, simde_mm512_loadu_ps(test_vec[i].r), 1);
simde_test_x86_assert_equal_f32x16(mem, simde_mm512_loadu_ps(test_vec[i].mem), 1);
}
return 0;
}
static int
test_simde_mm512_sincos_pd (SIMDE_MUNIT_TEST_ARGS) {
static const struct {
const simde_float64 mem[8];
const simde_float64 a[8];
const simde_float64 r[8];
} test_vec[] = {
{ { SIMDE_FLOAT64_C( 0.93), SIMDE_FLOAT64_C( 0.76), SIMDE_FLOAT64_C( 0.93), SIMDE_FLOAT64_C( 1.00),
SIMDE_FLOAT64_C( 0.99), SIMDE_FLOAT64_C( 0.90), SIMDE_FLOAT64_C( 0.82), SIMDE_FLOAT64_C( 1.00) },
{ SIMDE_FLOAT64_C( -0.37), SIMDE_FLOAT64_C( 0.71), SIMDE_FLOAT64_C( 0.37), SIMDE_FLOAT64_C( -0.02),
SIMDE_FLOAT64_C( -0.16), SIMDE_FLOAT64_C( -0.45), SIMDE_FLOAT64_C( -0.61), SIMDE_FLOAT64_C( -0.06) },
{ SIMDE_FLOAT64_C( -0.36), SIMDE_FLOAT64_C( 0.65), SIMDE_FLOAT64_C( 0.36), SIMDE_FLOAT64_C( -0.02),
SIMDE_FLOAT64_C( -0.16), SIMDE_FLOAT64_C( -0.43), SIMDE_FLOAT64_C( -0.57), SIMDE_FLOAT64_C( -0.06) } },
{ { SIMDE_FLOAT64_C( 0.76), SIMDE_FLOAT64_C( 0.98), SIMDE_FLOAT64_C( 0.88), SIMDE_FLOAT64_C( 0.86),
SIMDE_FLOAT64_C( 0.81), SIMDE_FLOAT64_C( 0.78), SIMDE_FLOAT64_C( 1.00), SIMDE_FLOAT64_C( 0.86) },
{ SIMDE_FLOAT64_C( 0.70), SIMDE_FLOAT64_C( 0.21), SIMDE_FLOAT64_C( -0.50), SIMDE_FLOAT64_C( 0.54),
SIMDE_FLOAT64_C( -0.62), SIMDE_FLOAT64_C( 0.67), SIMDE_FLOAT64_C( -0.09), SIMDE_FLOAT64_C( 0.53) },
{ SIMDE_FLOAT64_C( 0.64), SIMDE_FLOAT64_C( 0.21), SIMDE_FLOAT64_C( -0.48), SIMDE_FLOAT64_C( 0.51),
SIMDE_FLOAT64_C( -0.58), SIMDE_FLOAT64_C( 0.62), SIMDE_FLOAT64_C( -0.09), SIMDE_FLOAT64_C( 0.51) } },
{ { SIMDE_FLOAT64_C( 1.00), SIMDE_FLOAT64_C( 0.90), SIMDE_FLOAT64_C( 0.73), SIMDE_FLOAT64_C( 0.86),
SIMDE_FLOAT64_C( 0.67), SIMDE_FLOAT64_C( 0.55), SIMDE_FLOAT64_C( 0.72), SIMDE_FLOAT64_C( 0.93) },
{ SIMDE_FLOAT64_C( -0.09), SIMDE_FLOAT64_C( -0.45), SIMDE_FLOAT64_C( 0.75), SIMDE_FLOAT64_C( -0.53),
SIMDE_FLOAT64_C( 0.84), SIMDE_FLOAT64_C( -0.99), SIMDE_FLOAT64_C( -0.76), SIMDE_FLOAT64_C( -0.37) },
{ SIMDE_FLOAT64_C( -0.09), SIMDE_FLOAT64_C( -0.43), SIMDE_FLOAT64_C( 0.68), SIMDE_FLOAT64_C( -0.51),
SIMDE_FLOAT64_C( 0.74), SIMDE_FLOAT64_C( -0.84), SIMDE_FLOAT64_C( -0.69), SIMDE_FLOAT64_C( -0.36) } },
{ { SIMDE_FLOAT64_C( 0.58), SIMDE_FLOAT64_C( 0.86), SIMDE_FLOAT64_C( 0.96), SIMDE_FLOAT64_C( 0.76),
SIMDE_FLOAT64_C( 0.55), SIMDE_FLOAT64_C( 0.76), SIMDE_FLOAT64_C( 0.81), SIMDE_FLOAT64_C( 0.93) },
{ SIMDE_FLOAT64_C( -0.95), SIMDE_FLOAT64_C( -0.54), SIMDE_FLOAT64_C( 0.29), SIMDE_FLOAT64_C( -0.71),
SIMDE_FLOAT64_C( 0.99), SIMDE_FLOAT64_C( -0.71), SIMDE_FLOAT64_C( 0.63), SIMDE_FLOAT64_C( -0.38) },
{ SIMDE_FLOAT64_C( -0.81), SIMDE_FLOAT64_C( -0.51), SIMDE_FLOAT64_C( 0.29), SIMDE_FLOAT64_C( -0.65),
SIMDE_FLOAT64_C( 0.84), SIMDE_FLOAT64_C( -0.65), SIMDE_FLOAT64_C( 0.59), SIMDE_FLOAT64_C( -0.37) } },
{ { SIMDE_FLOAT64_C( 0.54), SIMDE_FLOAT64_C( 1.00), SIMDE_FLOAT64_C( 0.83), SIMDE_FLOAT64_C( 0.99),
SIMDE_FLOAT64_C( 0.85), SIMDE_FLOAT64_C( 0.56), SIMDE_FLOAT64_C( 0.71), SIMDE_FLOAT64_C( 0.97) },
{ SIMDE_FLOAT64_C( -1.00), SIMDE_FLOAT64_C( 0.00), SIMDE_FLOAT64_C( 0.59), SIMDE_FLOAT64_C( -0.16),
SIMDE_FLOAT64_C( 0.55), SIMDE_FLOAT64_C( 0.98), SIMDE_FLOAT64_C( 0.78), SIMDE_FLOAT64_C( 0.25) },
{ SIMDE_FLOAT64_C( -0.84), SIMDE_FLOAT64_C( 0.00), SIMDE_FLOAT64_C( 0.56), SIMDE_FLOAT64_C( -0.16),
SIMDE_FLOAT64_C( 0.52), SIMDE_FLOAT64_C( 0.83), SIMDE_FLOAT64_C( 0.70), SIMDE_FLOAT64_C( 0.25) } },
{ { SIMDE_FLOAT64_C( 0.98), SIMDE_FLOAT64_C( 0.76), SIMDE_FLOAT64_C( 0.98), SIMDE_FLOAT64_C( 0.84),
SIMDE_FLOAT64_C( 0.58), SIMDE_FLOAT64_C( 0.76), SIMDE_FLOAT64_C( 1.00), SIMDE_FLOAT64_C( 0.99) },
{ SIMDE_FLOAT64_C( 0.19), SIMDE_FLOAT64_C( -0.71), SIMDE_FLOAT64_C( -0.21), SIMDE_FLOAT64_C( 0.57),
SIMDE_FLOAT64_C( 0.95), SIMDE_FLOAT64_C( 0.70), SIMDE_FLOAT64_C( 0.10), SIMDE_FLOAT64_C( -0.13) },
{ SIMDE_FLOAT64_C( 0.19), SIMDE_FLOAT64_C( -0.65), SIMDE_FLOAT64_C( -0.21), SIMDE_FLOAT64_C( 0.54),
SIMDE_FLOAT64_C( 0.81), SIMDE_FLOAT64_C( 0.64), SIMDE_FLOAT64_C( 0.10), SIMDE_FLOAT64_C( -0.13) } },
{ { SIMDE_FLOAT64_C( 0.73), SIMDE_FLOAT64_C( 0.99), SIMDE_FLOAT64_C( 0.94), SIMDE_FLOAT64_C( 0.61),
SIMDE_FLOAT64_C( 0.99), SIMDE_FLOAT64_C( 0.84), SIMDE_FLOAT64_C( 0.96), SIMDE_FLOAT64_C( 1.00) },
{ SIMDE_FLOAT64_C( -0.75), SIMDE_FLOAT64_C( -0.15), SIMDE_FLOAT64_C( 0.34), SIMDE_FLOAT64_C( -0.91),
SIMDE_FLOAT64_C( -0.14), SIMDE_FLOAT64_C( 0.58), SIMDE_FLOAT64_C( -0.29), SIMDE_FLOAT64_C( -0.09) },
{ SIMDE_FLOAT64_C( -0.68), SIMDE_FLOAT64_C( -0.15), SIMDE_FLOAT64_C( 0.33), SIMDE_FLOAT64_C( -0.79),
SIMDE_FLOAT64_C( -0.14), SIMDE_FLOAT64_C( 0.55), SIMDE_FLOAT64_C( -0.29), SIMDE_FLOAT64_C( -0.09) } },
{ { SIMDE_FLOAT64_C( 0.57), SIMDE_FLOAT64_C( 0.54), SIMDE_FLOAT64_C( 0.98), SIMDE_FLOAT64_C( 0.57),
SIMDE_FLOAT64_C( 0.76), SIMDE_FLOAT64_C( 0.99), SIMDE_FLOAT64_C( 0.94), SIMDE_FLOAT64_C( 0.76) },
{ SIMDE_FLOAT64_C( -0.96), SIMDE_FLOAT64_C( -1.00), SIMDE_FLOAT64_C( 0.20), SIMDE_FLOAT64_C( -0.97),
SIMDE_FLOAT64_C( -0.70), SIMDE_FLOAT64_C( -0.17), SIMDE_FLOAT64_C( -0.35), SIMDE_FLOAT64_C( -0.70) },
{ SIMDE_FLOAT64_C( -0.82), SIMDE_FLOAT64_C( -0.84), SIMDE_FLOAT64_C( 0.20), SIMDE_FLOAT64_C( -0.82),
SIMDE_FLOAT64_C( -0.64), SIMDE_FLOAT64_C( -0.17), SIMDE_FLOAT64_C( -0.34), SIMDE_FLOAT64_C( -0.64) } }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])) ; i++) {
simde__m512d mem;
simde__m512d a = simde_mm512_loadu_pd(test_vec[i].a);
simde__m512d r = simde_mm512_sincos_pd(&mem, a);
simde_test_x86_assert_equal_f64x8(r, simde_mm512_loadu_pd(test_vec[i].r), 1);
simde_test_x86_assert_equal_f64x8(mem, simde_mm512_loadu_pd(test_vec[i].mem), 1);
}
return 0;
}
static int
test_simde_mm512_mask_sincos_pd (SIMDE_MUNIT_TEST_ARGS) {
static const struct {
const simde_float64 mem[8];
const simde_float64 sin_src[8];
const simde_float64 cos_src[8];
const simde__mmask8 k;
const simde_float64 a[8];
const simde_float64 r[8];
} test_vec[] = {
{ { SIMDE_FLOAT64_C( 0.89), SIMDE_FLOAT64_C( 0.99), SIMDE_FLOAT64_C( -0.11), SIMDE_FLOAT64_C( 1.00),
SIMDE_FLOAT64_C( 0.80), SIMDE_FLOAT64_C( -0.28), SIMDE_FLOAT64_C( 0.94), SIMDE_FLOAT64_C( 0.05) },
{ SIMDE_FLOAT64_C( 0.83), SIMDE_FLOAT64_C( 0.35), SIMDE_FLOAT64_C( -0.35), SIMDE_FLOAT64_C( 0.72),
SIMDE_FLOAT64_C( -0.08), SIMDE_FLOAT64_C( 0.45), SIMDE_FLOAT64_C( -0.71), SIMDE_FLOAT64_C( -0.51) },
{ SIMDE_FLOAT64_C( 0.89), SIMDE_FLOAT64_C( 0.69), SIMDE_FLOAT64_C( -0.11), SIMDE_FLOAT64_C( -1.00),
SIMDE_FLOAT64_C( 0.80), SIMDE_FLOAT64_C( -0.28), SIMDE_FLOAT64_C( 0.07), SIMDE_FLOAT64_C( 0.05) },
UINT8_C( 74),
{ SIMDE_FLOAT64_C( 0.10), SIMDE_FLOAT64_C( -0.14), SIMDE_FLOAT64_C( 0.55), SIMDE_FLOAT64_C( 0.05),
SIMDE_FLOAT64_C( 0.80), SIMDE_FLOAT64_C( 0.65), SIMDE_FLOAT64_C( -0.35), SIMDE_FLOAT64_C( 0.49) },
{ SIMDE_FLOAT64_C( 0.83), SIMDE_FLOAT64_C( -0.14), SIMDE_FLOAT64_C( -0.35), SIMDE_FLOAT64_C( 0.05),
SIMDE_FLOAT64_C( -0.08), SIMDE_FLOAT64_C( 0.45), SIMDE_FLOAT64_C( -0.34), SIMDE_FLOAT64_C( -0.51) } },
{ { SIMDE_FLOAT64_C( 0.10), SIMDE_FLOAT64_C( 0.78), SIMDE_FLOAT64_C( 0.09), SIMDE_FLOAT64_C( -0.44),
SIMDE_FLOAT64_C( 0.96), SIMDE_FLOAT64_C( 0.58), SIMDE_FLOAT64_C( 0.92), SIMDE_FLOAT64_C( 0.87) },
{ SIMDE_FLOAT64_C( -0.29), SIMDE_FLOAT64_C( -0.24), SIMDE_FLOAT64_C( 0.56), SIMDE_FLOAT64_C( -0.66),
SIMDE_FLOAT64_C( -0.17), SIMDE_FLOAT64_C( -0.55), SIMDE_FLOAT64_C( -0.83), SIMDE_FLOAT64_C( -0.82) },
{ SIMDE_FLOAT64_C( 0.10), SIMDE_FLOAT64_C( 0.89), SIMDE_FLOAT64_C( 0.09), SIMDE_FLOAT64_C( -0.44),
SIMDE_FLOAT64_C( -0.82), SIMDE_FLOAT64_C( 0.58), SIMDE_FLOAT64_C( -0.55), SIMDE_FLOAT64_C( 0.87) },
UINT8_C( 82),
{ SIMDE_FLOAT64_C( -0.55), SIMDE_FLOAT64_C( 0.67), SIMDE_FLOAT64_C( 0.19), SIMDE_FLOAT64_C( 0.51),
SIMDE_FLOAT64_C( -0.28), SIMDE_FLOAT64_C( 0.17), SIMDE_FLOAT64_C( -0.39), SIMDE_FLOAT64_C( 0.59) },
{ SIMDE_FLOAT64_C( -0.29), SIMDE_FLOAT64_C( 0.62), SIMDE_FLOAT64_C( 0.56), SIMDE_FLOAT64_C( -0.66),
SIMDE_FLOAT64_C( -0.28), SIMDE_FLOAT64_C( -0.55), SIMDE_FLOAT64_C( -0.38), SIMDE_FLOAT64_C( -0.82) } },
{ { SIMDE_FLOAT64_C( 0.90), SIMDE_FLOAT64_C( 0.42), SIMDE_FLOAT64_C( 0.90), SIMDE_FLOAT64_C( -0.10),
SIMDE_FLOAT64_C( 0.59), SIMDE_FLOAT64_C( 0.98), SIMDE_FLOAT64_C( 1.00), SIMDE_FLOAT64_C( 0.48) },
{ SIMDE_FLOAT64_C( -0.28), SIMDE_FLOAT64_C( 0.66), SIMDE_FLOAT64_C( 0.39), SIMDE_FLOAT64_C( -0.63),
SIMDE_FLOAT64_C( -0.68), SIMDE_FLOAT64_C( -0.12), SIMDE_FLOAT64_C( 0.08), SIMDE_FLOAT64_C( 0.08) },
{ SIMDE_FLOAT64_C( -0.56), SIMDE_FLOAT64_C( 0.42), SIMDE_FLOAT64_C( 0.90), SIMDE_FLOAT64_C( -0.10),
SIMDE_FLOAT64_C( 0.59), SIMDE_FLOAT64_C( -0.92), SIMDE_FLOAT64_C( 1.00), SIMDE_FLOAT64_C( 0.48) },
UINT8_C( 33),
{ SIMDE_FLOAT64_C( -0.44), SIMDE_FLOAT64_C( 0.65), SIMDE_FLOAT64_C( -0.25), SIMDE_FLOAT64_C( 0.00),
SIMDE_FLOAT64_C( 0.52), SIMDE_FLOAT64_C( 0.22), SIMDE_FLOAT64_C( 0.45), SIMDE_FLOAT64_C( 0.20) },
{ SIMDE_FLOAT64_C( -0.43), SIMDE_FLOAT64_C( 0.66), SIMDE_FLOAT64_C( 0.39), SIMDE_FLOAT64_C( -0.63),
SIMDE_FLOAT64_C( -0.68), SIMDE_FLOAT64_C( 0.22), SIMDE_FLOAT64_C( 0.08), SIMDE_FLOAT64_C( 0.08) } },
{ { SIMDE_FLOAT64_C( 0.99), SIMDE_FLOAT64_C( -0.33), SIMDE_FLOAT64_C( 0.56), SIMDE_FLOAT64_C( 0.78),
SIMDE_FLOAT64_C( 0.76), SIMDE_FLOAT64_C( 0.70), SIMDE_FLOAT64_C( 0.78), SIMDE_FLOAT64_C( 0.63) },
{ SIMDE_FLOAT64_C( -0.59), SIMDE_FLOAT64_C( -0.03), SIMDE_FLOAT64_C( 0.92), SIMDE_FLOAT64_C( 0.58),
SIMDE_FLOAT64_C( 0.58), SIMDE_FLOAT64_C( 0.51), SIMDE_FLOAT64_C( -0.69), SIMDE_FLOAT64_C( 0.24) },
{ SIMDE_FLOAT64_C( -0.11), SIMDE_FLOAT64_C( -0.33), SIMDE_FLOAT64_C( 0.56), SIMDE_FLOAT64_C( 0.78),
SIMDE_FLOAT64_C( 0.76), SIMDE_FLOAT64_C( -0.37), SIMDE_FLOAT64_C( -0.78), SIMDE_FLOAT64_C( 0.17) },
UINT8_C(225),
{ SIMDE_FLOAT64_C( 0.11), SIMDE_FLOAT64_C( -0.24), SIMDE_FLOAT64_C( -0.38), SIMDE_FLOAT64_C( 0.11),
SIMDE_FLOAT64_C( -0.76), SIMDE_FLOAT64_C( 0.79), SIMDE_FLOAT64_C( 0.67), SIMDE_FLOAT64_C( 0.89) },
{ SIMDE_FLOAT64_C( 0.11), SIMDE_FLOAT64_C( -0.03), SIMDE_FLOAT64_C( 0.92), SIMDE_FLOAT64_C( 0.58),
SIMDE_FLOAT64_C( 0.58), SIMDE_FLOAT64_C( 0.71), SIMDE_FLOAT64_C( 0.62), SIMDE_FLOAT64_C( 0.78) } },
{ { SIMDE_FLOAT64_C( -0.47), SIMDE_FLOAT64_C( 0.76), SIMDE_FLOAT64_C( -0.33), SIMDE_FLOAT64_C( 1.00),
SIMDE_FLOAT64_C( -0.93), SIMDE_FLOAT64_C( 0.91), SIMDE_FLOAT64_C( -0.07), SIMDE_FLOAT64_C( 0.84) },
{ SIMDE_FLOAT64_C( -0.46), SIMDE_FLOAT64_C( -0.33), SIMDE_FLOAT64_C( 0.42), SIMDE_FLOAT64_C( 0.76),
SIMDE_FLOAT64_C( -0.88), SIMDE_FLOAT64_C( -0.39), SIMDE_FLOAT64_C( -0.82), SIMDE_FLOAT64_C( 0.09) },
{ SIMDE_FLOAT64_C( -0.47), SIMDE_FLOAT64_C( 0.76), SIMDE_FLOAT64_C( -0.33), SIMDE_FLOAT64_C( -0.96),
SIMDE_FLOAT64_C( -0.93), SIMDE_FLOAT64_C( 0.91), SIMDE_FLOAT64_C( -0.07), SIMDE_FLOAT64_C( -0.26) },
UINT8_C(136),
{ SIMDE_FLOAT64_C( -0.29), SIMDE_FLOAT64_C( -0.50), SIMDE_FLOAT64_C( -0.91), SIMDE_FLOAT64_C( -0.08),
SIMDE_FLOAT64_C( 0.67), SIMDE_FLOAT64_C( -0.37), SIMDE_FLOAT64_C( -0.96), SIMDE_FLOAT64_C( -0.57) },
{ SIMDE_FLOAT64_C( -0.46), SIMDE_FLOAT64_C( -0.33), SIMDE_FLOAT64_C( 0.42), SIMDE_FLOAT64_C( -0.08),
SIMDE_FLOAT64_C( -0.88), SIMDE_FLOAT64_C( -0.39), SIMDE_FLOAT64_C( -0.82), SIMDE_FLOAT64_C( -0.54) } },
{ { SIMDE_FLOAT64_C( 0.98), SIMDE_FLOAT64_C( 0.94), SIMDE_FLOAT64_C( 0.61), SIMDE_FLOAT64_C( -0.40),
SIMDE_FLOAT64_C( 0.52), SIMDE_FLOAT64_C( -0.30), SIMDE_FLOAT64_C( 0.14), SIMDE_FLOAT64_C( 0.28) },
{ SIMDE_FLOAT64_C( 0.24), SIMDE_FLOAT64_C( 0.15), SIMDE_FLOAT64_C( -0.32), SIMDE_FLOAT64_C( 0.03),
SIMDE_FLOAT64_C( -0.19), SIMDE_FLOAT64_C( -0.43), SIMDE_FLOAT64_C( 0.57), SIMDE_FLOAT64_C( 0.49) },
{ SIMDE_FLOAT64_C( 0.99), SIMDE_FLOAT64_C( 0.34), SIMDE_FLOAT64_C( 0.61), SIMDE_FLOAT64_C( -0.40),
SIMDE_FLOAT64_C( 0.52), SIMDE_FLOAT64_C( -0.30), SIMDE_FLOAT64_C( 0.14), SIMDE_FLOAT64_C( 0.28) },
UINT8_C( 3),
{ SIMDE_FLOAT64_C( 0.18), SIMDE_FLOAT64_C( 0.34), SIMDE_FLOAT64_C( 0.27), SIMDE_FLOAT64_C( -0.89),
SIMDE_FLOAT64_C( -0.92), SIMDE_FLOAT64_C( -0.27), SIMDE_FLOAT64_C( -0.18), SIMDE_FLOAT64_C( -0.42) },
{ SIMDE_FLOAT64_C( 0.18), SIMDE_FLOAT64_C( 0.33), SIMDE_FLOAT64_C( -0.32), SIMDE_FLOAT64_C( 0.03),
SIMDE_FLOAT64_C( -0.19), SIMDE_FLOAT64_C( -0.43), SIMDE_FLOAT64_C( 0.57), SIMDE_FLOAT64_C( 0.49) } },
{ { SIMDE_FLOAT64_C( 0.89), SIMDE_FLOAT64_C( 0.66), SIMDE_FLOAT64_C( 0.74), SIMDE_FLOAT64_C( 0.93),
SIMDE_FLOAT64_C( 0.30), SIMDE_FLOAT64_C( 0.22), SIMDE_FLOAT64_C( 0.99), SIMDE_FLOAT64_C( 0.97) },
{ SIMDE_FLOAT64_C( -0.18), SIMDE_FLOAT64_C( 0.74), SIMDE_FLOAT64_C( -0.75), SIMDE_FLOAT64_C( 0.45),
SIMDE_FLOAT64_C( 0.78), SIMDE_FLOAT64_C( -0.32), SIMDE_FLOAT64_C( -0.31), SIMDE_FLOAT64_C( -0.08) },
{ SIMDE_FLOAT64_C( 0.36), SIMDE_FLOAT64_C( 0.72), SIMDE_FLOAT64_C( 0.74), SIMDE_FLOAT64_C( 0.93),
SIMDE_FLOAT64_C( 0.30), SIMDE_FLOAT64_C( 0.22), SIMDE_FLOAT64_C( 0.92), SIMDE_FLOAT64_C( -0.36) },
UINT8_C(195),
{ SIMDE_FLOAT64_C( -0.48), SIMDE_FLOAT64_C( -0.85), SIMDE_FLOAT64_C( 0.53), SIMDE_FLOAT64_C( 0.66),
SIMDE_FLOAT64_C( 0.43), SIMDE_FLOAT64_C( -0.11), SIMDE_FLOAT64_C( -0.17), SIMDE_FLOAT64_C( -0.23) },
{ SIMDE_FLOAT64_C( -0.46), SIMDE_FLOAT64_C( -0.75), SIMDE_FLOAT64_C( -0.75), SIMDE_FLOAT64_C( 0.45),
SIMDE_FLOAT64_C( 0.78), SIMDE_FLOAT64_C( -0.32), SIMDE_FLOAT64_C( -0.17), SIMDE_FLOAT64_C( -0.23) } },
{ { SIMDE_FLOAT64_C( 0.68), SIMDE_FLOAT64_C( 0.15), SIMDE_FLOAT64_C( 0.27), SIMDE_FLOAT64_C( -0.64),
SIMDE_FLOAT64_C( 0.84), SIMDE_FLOAT64_C( -0.80), SIMDE_FLOAT64_C( 0.72), SIMDE_FLOAT64_C( 0.56) },
{ SIMDE_FLOAT64_C( -0.84), SIMDE_FLOAT64_C( -0.06), SIMDE_FLOAT64_C( -0.15), SIMDE_FLOAT64_C( -0.11),
SIMDE_FLOAT64_C( 0.76), SIMDE_FLOAT64_C( 0.43), SIMDE_FLOAT64_C( 0.70), SIMDE_FLOAT64_C( 0.50) },
{ SIMDE_FLOAT64_C( 0.68), SIMDE_FLOAT64_C( 0.15), SIMDE_FLOAT64_C( 0.27), SIMDE_FLOAT64_C( -0.64),
SIMDE_FLOAT64_C( 0.84), SIMDE_FLOAT64_C( -0.80), SIMDE_FLOAT64_C( 0.72), SIMDE_FLOAT64_C( 0.56) },
UINT8_C( 0),
{ SIMDE_FLOAT64_C( 0.65), SIMDE_FLOAT64_C( -0.14), SIMDE_FLOAT64_C( 0.16), SIMDE_FLOAT64_C( 0.56),
SIMDE_FLOAT64_C( 0.49), SIMDE_FLOAT64_C( 0.99), SIMDE_FLOAT64_C( -0.92), SIMDE_FLOAT64_C( 0.64) },
{ SIMDE_FLOAT64_C( -0.84), SIMDE_FLOAT64_C( -0.06), SIMDE_FLOAT64_C( -0.15), SIMDE_FLOAT64_C( -0.11),
SIMDE_FLOAT64_C( 0.76), SIMDE_FLOAT64_C( 0.43), SIMDE_FLOAT64_C( 0.70), SIMDE_FLOAT64_C( 0.50) } }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])) ; i++) {
simde__m512d mem;
simde__m512d sin_src = simde_mm512_loadu_pd(test_vec[i].sin_src);
simde__m512d cos_src = simde_mm512_loadu_pd(test_vec[i].cos_src);
simde__m512d a = simde_mm512_loadu_pd(test_vec[i].a);
simde__m512d r = simde_mm512_mask_sincos_pd(&mem, sin_src, cos_src, test_vec[i].k, a);
simde_test_x86_assert_equal_f64x8(r, simde_mm512_loadu_pd(test_vec[i].r), 1);
simde_test_x86_assert_equal_f64x8(mem, simde_mm512_loadu_pd(test_vec[i].mem), 1);
}
return 0;
}
static int
test_simde_mm_sind_pd(SIMDE_MUNIT_TEST_ARGS) {
const struct {
simde__m128d a;
simde__m128d r;
} test_vec[8] = {
{ simde_mm_set_pd(SIMDE_FLOAT64_C( -754.38), SIMDE_FLOAT64_C( 346.63)),
simde_mm_set_pd(SIMDE_FLOAT64_C( -0.56), SIMDE_FLOAT64_C( -0.23)) },
{ simde_mm_set_pd(SIMDE_FLOAT64_C( -186.21), SIMDE_FLOAT64_C( 39.01)),
simde_mm_set_pd(SIMDE_FLOAT64_C( 0.11), SIMDE_FLOAT64_C( 0.63)) },
{ simde_mm_set_pd(SIMDE_FLOAT64_C( -297.45), SIMDE_FLOAT64_C( 34.06)),
simde_mm_set_pd(SIMDE_FLOAT64_C( 0.89), SIMDE_FLOAT64_C( 0.56)) },
{ simde_mm_set_pd(SIMDE_FLOAT64_C( 497.31), SIMDE_FLOAT64_C( 670.24)),
simde_mm_set_pd(SIMDE_FLOAT64_C( 0.68), SIMDE_FLOAT64_C( -0.76)) },
{ simde_mm_set_pd(SIMDE_FLOAT64_C( -269.45), SIMDE_FLOAT64_C( 467.76)),
simde_mm_set_pd(SIMDE_FLOAT64_C( 1.00), SIMDE_FLOAT64_C( 0.95)) },
{ simde_mm_set_pd(SIMDE_FLOAT64_C( 825.53), SIMDE_FLOAT64_C( 422.21)),
simde_mm_set_pd(SIMDE_FLOAT64_C( 0.96), SIMDE_FLOAT64_C( 0.88)) },
{ simde_mm_set_pd(SIMDE_FLOAT64_C( 84.77), SIMDE_FLOAT64_C( 571.46)),
simde_mm_set_pd(SIMDE_FLOAT64_C( 1.00), SIMDE_FLOAT64_C( -0.52)) },
{ simde_mm_set_pd(SIMDE_FLOAT64_C( -678.17), SIMDE_FLOAT64_C( -686.13)),
simde_mm_set_pd(SIMDE_FLOAT64_C( 0.67), SIMDE_FLOAT64_C( 0.56)) }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])); i++) {
simde__m128d r = simde_mm_sind_pd(test_vec[i].a);
simde_assert_m128d_close(r, test_vec[i].r, 1);
}
return 0;
}
static int
test_simde_mm256_sind_ps(SIMDE_MUNIT_TEST_ARGS) {
const struct {
simde__m256 a;
simde__m256 r;
} test_vec[8] = {
{ simde_mm256_set_ps(SIMDE_FLOAT32_C( 497.31), SIMDE_FLOAT32_C( 670.24),
SIMDE_FLOAT32_C( -297.45), SIMDE_FLOAT32_C( 34.06),
SIMDE_FLOAT32_C( -186.21), SIMDE_FLOAT32_C( 39.01),
SIMDE_FLOAT32_C( -754.38), SIMDE_FLOAT32_C( 346.63)),
simde_mm256_set_ps(SIMDE_FLOAT32_C( 0.68), SIMDE_FLOAT32_C( -0.76),
SIMDE_FLOAT32_C( 0.89), SIMDE_FLOAT32_C( 0.56),
SIMDE_FLOAT32_C( 0.11), SIMDE_FLOAT32_C( 0.63),
SIMDE_FLOAT32_C( -0.56), SIMDE_FLOAT32_C( -0.23)) },
{ simde_mm256_set_ps(SIMDE_FLOAT32_C( -678.17), SIMDE_FLOAT32_C( -686.13),
SIMDE_FLOAT32_C( 84.77), SIMDE_FLOAT32_C( 571.46),
SIMDE_FLOAT32_C( 825.53), SIMDE_FLOAT32_C( 422.21),
SIMDE_FLOAT32_C( -269.45), SIMDE_FLOAT32_C( 467.76)),
simde_mm256_set_ps(SIMDE_FLOAT32_C( 0.67), SIMDE_FLOAT32_C( 0.56),
SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( -0.52),
SIMDE_FLOAT32_C( 0.96), SIMDE_FLOAT32_C( 0.88),
SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( 0.95)) },
{ simde_mm256_set_ps(SIMDE_FLOAT32_C( -444.81), SIMDE_FLOAT32_C( 28.47),
SIMDE_FLOAT32_C( -384.03), SIMDE_FLOAT32_C( -923.64),
SIMDE_FLOAT32_C( -305.07), SIMDE_FLOAT32_C( -860.95),
SIMDE_FLOAT32_C( -417.54), SIMDE_FLOAT32_C( 696.87)),
simde_mm256_set_ps(SIMDE_FLOAT32_C( -1.00), SIMDE_FLOAT32_C( 0.48),
SIMDE_FLOAT32_C( -0.41), SIMDE_FLOAT32_C( 0.40),
SIMDE_FLOAT32_C( 0.82), SIMDE_FLOAT32_C( -0.63),
SIMDE_FLOAT32_C( -0.84), SIMDE_FLOAT32_C( -0.39)) },
{ simde_mm256_set_ps(SIMDE_FLOAT32_C( 178.20), SIMDE_FLOAT32_C( -450.67),
SIMDE_FLOAT32_C( 233.37), SIMDE_FLOAT32_C( 687.09),
SIMDE_FLOAT32_C( 261.31), SIMDE_FLOAT32_C( -212.54),
SIMDE_FLOAT32_C( -976.55), SIMDE_FLOAT32_C( -660.80)),
simde_mm256_set_ps(SIMDE_FLOAT32_C( 0.03), SIMDE_FLOAT32_C( -1.00),
SIMDE_FLOAT32_C( -0.80), SIMDE_FLOAT32_C( -0.54),
SIMDE_FLOAT32_C( -0.99), SIMDE_FLOAT32_C( 0.54),
SIMDE_FLOAT32_C( 0.97), SIMDE_FLOAT32_C( 0.86)) },
{ simde_mm256_set_ps(SIMDE_FLOAT32_C( -770.35), SIMDE_FLOAT32_C( 443.48),
SIMDE_FLOAT32_C( -583.60), SIMDE_FLOAT32_C( 380.46),
SIMDE_FLOAT32_C( -770.72), SIMDE_FLOAT32_C( 993.90),
SIMDE_FLOAT32_C( 28.08), SIMDE_FLOAT32_C( 841.21)),
simde_mm256_set_ps(SIMDE_FLOAT32_C( -0.77), SIMDE_FLOAT32_C( 0.99),
SIMDE_FLOAT32_C( 0.69), SIMDE_FLOAT32_C( 0.35),
SIMDE_FLOAT32_C( -0.77), SIMDE_FLOAT32_C( -1.00),
SIMDE_FLOAT32_C( 0.47), SIMDE_FLOAT32_C( 0.86)) },
{ simde_mm256_set_ps(SIMDE_FLOAT32_C( -387.90), SIMDE_FLOAT32_C( 395.92),
SIMDE_FLOAT32_C( 655.87), SIMDE_FLOAT32_C( 339.21),
SIMDE_FLOAT32_C( 532.35), SIMDE_FLOAT32_C( -263.99),
SIMDE_FLOAT32_C( 780.64), SIMDE_FLOAT32_C( -30.79)),
simde_mm256_set_ps(SIMDE_FLOAT32_C( -0.47), SIMDE_FLOAT32_C( 0.59),
SIMDE_FLOAT32_C( -0.90), SIMDE_FLOAT32_C( -0.35),
SIMDE_FLOAT32_C( 0.13), SIMDE_FLOAT32_C( 0.99),
SIMDE_FLOAT32_C( 0.87), SIMDE_FLOAT32_C( -0.51)) },
{ simde_mm256_set_ps(SIMDE_FLOAT32_C( -203.65), SIMDE_FLOAT32_C( -80.73),
SIMDE_FLOAT32_C( 336.73), SIMDE_FLOAT32_C( -944.78),
SIMDE_FLOAT32_C( -747.59), SIMDE_FLOAT32_C( -767.23),
SIMDE_FLOAT32_C( -554.19), SIMDE_FLOAT32_C( 398.82)),
simde_mm256_set_ps(SIMDE_FLOAT32_C( 0.40), SIMDE_FLOAT32_C( -0.99),
SIMDE_FLOAT32_C( -0.40), SIMDE_FLOAT32_C( 0.70),
SIMDE_FLOAT32_C( -0.46), SIMDE_FLOAT32_C( -0.73),
SIMDE_FLOAT32_C( 0.25), SIMDE_FLOAT32_C( 0.63)) },
{ simde_mm256_set_ps(SIMDE_FLOAT32_C( 469.66), SIMDE_FLOAT32_C( 680.02),
SIMDE_FLOAT32_C( -148.69), SIMDE_FLOAT32_C( 818.66),
SIMDE_FLOAT32_C( 910.03), SIMDE_FLOAT32_C( 600.47),
SIMDE_FLOAT32_C( 791.23), SIMDE_FLOAT32_C( 254.31)),
simde_mm256_set_ps(SIMDE_FLOAT32_C( 0.94), SIMDE_FLOAT32_C( -0.64),
SIMDE_FLOAT32_C( -0.52), SIMDE_FLOAT32_C( 0.99),
SIMDE_FLOAT32_C( -0.17), SIMDE_FLOAT32_C( -0.87),
SIMDE_FLOAT32_C( 0.95), SIMDE_FLOAT32_C( -0.96)) }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])); i++) {
simde__m256 r = simde_mm256_sind_ps(test_vec[i].a);
simde_assert_m256_close(r, test_vec[i].r, 1);
}
return 0;
}
static int
test_simde_mm256_sind_pd(SIMDE_MUNIT_TEST_ARGS) {
const struct {
simde__m256d a;
simde__m256d r;
} test_vec[8] = {
{ simde_mm256_set_pd(SIMDE_FLOAT64_C( -186.21), SIMDE_FLOAT64_C( 39.01),
SIMDE_FLOAT64_C( -754.38), SIMDE_FLOAT64_C( 346.63)),
simde_mm256_set_pd(SIMDE_FLOAT64_C( 0.11), SIMDE_FLOAT64_C( 0.63),
SIMDE_FLOAT64_C( -0.56), SIMDE_FLOAT64_C( -0.23)) },
{ simde_mm256_set_pd(SIMDE_FLOAT64_C( 497.31), SIMDE_FLOAT64_C( 670.24),
SIMDE_FLOAT64_C( -297.45), SIMDE_FLOAT64_C( 34.06)),
simde_mm256_set_pd(SIMDE_FLOAT64_C( 0.68), SIMDE_FLOAT64_C( -0.76),
SIMDE_FLOAT64_C( 0.89), SIMDE_FLOAT64_C( 0.56)) },
{ simde_mm256_set_pd(SIMDE_FLOAT64_C( 825.53), SIMDE_FLOAT64_C( 422.21),
SIMDE_FLOAT64_C( -269.45), SIMDE_FLOAT64_C( 467.76)),
simde_mm256_set_pd(SIMDE_FLOAT64_C( 0.96), SIMDE_FLOAT64_C( 0.88),
SIMDE_FLOAT64_C( 1.00), SIMDE_FLOAT64_C( 0.95)) },
{ simde_mm256_set_pd(SIMDE_FLOAT64_C( -678.17), SIMDE_FLOAT64_C( -686.13),
SIMDE_FLOAT64_C( 84.77), SIMDE_FLOAT64_C( 571.46)),
simde_mm256_set_pd(SIMDE_FLOAT64_C( 0.67), SIMDE_FLOAT64_C( 0.56),
SIMDE_FLOAT64_C( 1.00), SIMDE_FLOAT64_C( -0.52)) },
{ simde_mm256_set_pd(SIMDE_FLOAT64_C( -305.07), SIMDE_FLOAT64_C( -860.95),
SIMDE_FLOAT64_C( -417.54), SIMDE_FLOAT64_C( 696.87)),
simde_mm256_set_pd(SIMDE_FLOAT64_C( 0.82), SIMDE_FLOAT64_C( -0.63),
SIMDE_FLOAT64_C( -0.84), SIMDE_FLOAT64_C( -0.39)) },
{ simde_mm256_set_pd(SIMDE_FLOAT64_C( -444.81), SIMDE_FLOAT64_C( 28.47),
SIMDE_FLOAT64_C( -384.03), SIMDE_FLOAT64_C( -923.64)),
simde_mm256_set_pd(SIMDE_FLOAT64_C( -1.00), SIMDE_FLOAT64_C( 0.48),
SIMDE_FLOAT64_C( -0.41), SIMDE_FLOAT64_C( 0.40)) },
{ simde_mm256_set_pd(SIMDE_FLOAT64_C( 261.31), SIMDE_FLOAT64_C( -212.54),
SIMDE_FLOAT64_C( -976.55), SIMDE_FLOAT64_C( -660.80)),
simde_mm256_set_pd(SIMDE_FLOAT64_C( -0.99), SIMDE_FLOAT64_C( 0.54),
SIMDE_FLOAT64_C( 0.97), SIMDE_FLOAT64_C( 0.86)) },
{ simde_mm256_set_pd(SIMDE_FLOAT64_C( 178.20), SIMDE_FLOAT64_C( -450.67),
SIMDE_FLOAT64_C( 233.37), SIMDE_FLOAT64_C( 687.09)),
simde_mm256_set_pd(SIMDE_FLOAT64_C( 0.03), SIMDE_FLOAT64_C( -1.00),
SIMDE_FLOAT64_C( -0.80), SIMDE_FLOAT64_C( -0.54)) }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])); i++) {
simde__m256d r = simde_mm256_sind_pd(test_vec[i].a);
simde_assert_m256d_close(r, test_vec[i].r, 1);
}
return 0;
}
static int
test_simde_mm512_sind_ps(SIMDE_MUNIT_TEST_ARGS) {
const struct {
simde__m512 a;
simde__m512 r;
} test_vec[8] = {
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( -678.17), SIMDE_FLOAT32_C( -686.13), SIMDE_FLOAT32_C( 84.77), SIMDE_FLOAT32_C( 571.46),
SIMDE_FLOAT32_C( 825.53), SIMDE_FLOAT32_C( 422.21), SIMDE_FLOAT32_C( -269.45), SIMDE_FLOAT32_C( 467.76),
SIMDE_FLOAT32_C( 497.31), SIMDE_FLOAT32_C( 670.24), SIMDE_FLOAT32_C( -297.45), SIMDE_FLOAT32_C( 34.06),
SIMDE_FLOAT32_C( -186.21), SIMDE_FLOAT32_C( 39.01), SIMDE_FLOAT32_C( -754.38), SIMDE_FLOAT32_C( 346.63)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 0.67), SIMDE_FLOAT32_C( 0.56), SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( -0.52),
SIMDE_FLOAT32_C( 0.96), SIMDE_FLOAT32_C( 0.88), SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( 0.95),
SIMDE_FLOAT32_C( 0.68), SIMDE_FLOAT32_C( -0.76), SIMDE_FLOAT32_C( 0.89), SIMDE_FLOAT32_C( 0.56),
SIMDE_FLOAT32_C( 0.11), SIMDE_FLOAT32_C( 0.63), SIMDE_FLOAT32_C( -0.56), SIMDE_FLOAT32_C( -0.23)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( 178.20), SIMDE_FLOAT32_C( -450.67), SIMDE_FLOAT32_C( 233.37), SIMDE_FLOAT32_C( 687.09),
SIMDE_FLOAT32_C( 261.31), SIMDE_FLOAT32_C( -212.54), SIMDE_FLOAT32_C( -976.55), SIMDE_FLOAT32_C( -660.80),
SIMDE_FLOAT32_C( -444.81), SIMDE_FLOAT32_C( 28.47), SIMDE_FLOAT32_C( -384.03), SIMDE_FLOAT32_C( -923.64),
SIMDE_FLOAT32_C( -305.07), SIMDE_FLOAT32_C( -860.95), SIMDE_FLOAT32_C( -417.54), SIMDE_FLOAT32_C( 696.87)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 0.03), SIMDE_FLOAT32_C( -1.00), SIMDE_FLOAT32_C( -0.80), SIMDE_FLOAT32_C( -0.54),
SIMDE_FLOAT32_C( -0.99), SIMDE_FLOAT32_C( 0.54), SIMDE_FLOAT32_C( 0.97), SIMDE_FLOAT32_C( 0.86),
SIMDE_FLOAT32_C( -1.00), SIMDE_FLOAT32_C( 0.48), SIMDE_FLOAT32_C( -0.41), SIMDE_FLOAT32_C( 0.40),
SIMDE_FLOAT32_C( 0.82), SIMDE_FLOAT32_C( -0.63), SIMDE_FLOAT32_C( -0.84), SIMDE_FLOAT32_C( -0.39)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( -387.90), SIMDE_FLOAT32_C( 395.92), SIMDE_FLOAT32_C( 655.87), SIMDE_FLOAT32_C( 339.21),
SIMDE_FLOAT32_C( 532.35), SIMDE_FLOAT32_C( -263.99), SIMDE_FLOAT32_C( 780.64), SIMDE_FLOAT32_C( -30.79),
SIMDE_FLOAT32_C( -770.35), SIMDE_FLOAT32_C( 443.48), SIMDE_FLOAT32_C( -583.60), SIMDE_FLOAT32_C( 380.46),
SIMDE_FLOAT32_C( -770.72), SIMDE_FLOAT32_C( 993.90), SIMDE_FLOAT32_C( 28.08), SIMDE_FLOAT32_C( 841.21)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( -0.47), SIMDE_FLOAT32_C( 0.59), SIMDE_FLOAT32_C( -0.90), SIMDE_FLOAT32_C( -0.35),
SIMDE_FLOAT32_C( 0.13), SIMDE_FLOAT32_C( 0.99), SIMDE_FLOAT32_C( 0.87), SIMDE_FLOAT32_C( -0.51),
SIMDE_FLOAT32_C( -0.77), SIMDE_FLOAT32_C( 0.99), SIMDE_FLOAT32_C( 0.69), SIMDE_FLOAT32_C( 0.35),
SIMDE_FLOAT32_C( -0.77), SIMDE_FLOAT32_C( -1.00), SIMDE_FLOAT32_C( 0.47), SIMDE_FLOAT32_C( 0.86)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( 469.66), SIMDE_FLOAT32_C( 680.02), SIMDE_FLOAT32_C( -148.69), SIMDE_FLOAT32_C( 818.66),
SIMDE_FLOAT32_C( 910.03), SIMDE_FLOAT32_C( 600.47), SIMDE_FLOAT32_C( 791.23), SIMDE_FLOAT32_C( 254.31),
SIMDE_FLOAT32_C( -203.65), SIMDE_FLOAT32_C( -80.73), SIMDE_FLOAT32_C( 336.73), SIMDE_FLOAT32_C( -944.78),
SIMDE_FLOAT32_C( -747.59), SIMDE_FLOAT32_C( -767.23), SIMDE_FLOAT32_C( -554.19), SIMDE_FLOAT32_C( 398.82)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 0.94), SIMDE_FLOAT32_C( -0.64), SIMDE_FLOAT32_C( -0.52), SIMDE_FLOAT32_C( 0.99),
SIMDE_FLOAT32_C( -0.17), SIMDE_FLOAT32_C( -0.87), SIMDE_FLOAT32_C( 0.95), SIMDE_FLOAT32_C( -0.96),
SIMDE_FLOAT32_C( 0.40), SIMDE_FLOAT32_C( -0.99), SIMDE_FLOAT32_C( -0.40), SIMDE_FLOAT32_C( 0.70),
SIMDE_FLOAT32_C( -0.46), SIMDE_FLOAT32_C( -0.73), SIMDE_FLOAT32_C( 0.25), SIMDE_FLOAT32_C( 0.63)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( -178.99), SIMDE_FLOAT32_C( 758.79), SIMDE_FLOAT32_C( 324.62), SIMDE_FLOAT32_C( 343.48),
SIMDE_FLOAT32_C( -874.31), SIMDE_FLOAT32_C( -797.92), SIMDE_FLOAT32_C( -328.54), SIMDE_FLOAT32_C( -525.83),
SIMDE_FLOAT32_C( -192.31), SIMDE_FLOAT32_C( -822.65), SIMDE_FLOAT32_C( 561.36), SIMDE_FLOAT32_C( 655.67),
SIMDE_FLOAT32_C( -70.91), SIMDE_FLOAT32_C( 543.35), SIMDE_FLOAT32_C( 120.65), SIMDE_FLOAT32_C( -171.51)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( -0.02), SIMDE_FLOAT32_C( 0.63), SIMDE_FLOAT32_C( -0.58), SIMDE_FLOAT32_C( -0.28),
SIMDE_FLOAT32_C( -0.43), SIMDE_FLOAT32_C( -0.98), SIMDE_FLOAT32_C( 0.52), SIMDE_FLOAT32_C( -0.24),
SIMDE_FLOAT32_C( 0.21), SIMDE_FLOAT32_C( -0.98), SIMDE_FLOAT32_C( -0.36), SIMDE_FLOAT32_C( -0.90),
SIMDE_FLOAT32_C( -0.95), SIMDE_FLOAT32_C( -0.06), SIMDE_FLOAT32_C( 0.86), SIMDE_FLOAT32_C( -0.15)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( 690.12), SIMDE_FLOAT32_C( 840.65), SIMDE_FLOAT32_C( -21.09), SIMDE_FLOAT32_C( -591.56),
SIMDE_FLOAT32_C( -448.89), SIMDE_FLOAT32_C( 731.49), SIMDE_FLOAT32_C( 505.79), SIMDE_FLOAT32_C( 623.70),
SIMDE_FLOAT32_C( 831.02), SIMDE_FLOAT32_C( 140.67), SIMDE_FLOAT32_C( 977.36), SIMDE_FLOAT32_C( -906.16),
SIMDE_FLOAT32_C( 331.34), SIMDE_FLOAT32_C( 99.93), SIMDE_FLOAT32_C( 462.95), SIMDE_FLOAT32_C( -738.19)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( -0.50), SIMDE_FLOAT32_C( 0.86), SIMDE_FLOAT32_C( -0.36), SIMDE_FLOAT32_C( 0.78),
SIMDE_FLOAT32_C( -1.00), SIMDE_FLOAT32_C( 0.20), SIMDE_FLOAT32_C( 0.56), SIMDE_FLOAT32_C( -0.99),
SIMDE_FLOAT32_C( 0.93), SIMDE_FLOAT32_C( 0.63), SIMDE_FLOAT32_C( -0.98), SIMDE_FLOAT32_C( 0.11),
SIMDE_FLOAT32_C( -0.48), SIMDE_FLOAT32_C( 0.99), SIMDE_FLOAT32_C( 0.97), SIMDE_FLOAT32_C( -0.31)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( -769.09), SIMDE_FLOAT32_C( 822.06), SIMDE_FLOAT32_C( -573.81), SIMDE_FLOAT32_C( -997.63),
SIMDE_FLOAT32_C( -337.60), SIMDE_FLOAT32_C( 923.64), SIMDE_FLOAT32_C( 293.64), SIMDE_FLOAT32_C( -768.12),
SIMDE_FLOAT32_C( -576.22), SIMDE_FLOAT32_C( -67.64), SIMDE_FLOAT32_C( 710.38), SIMDE_FLOAT32_C( 977.49),
SIMDE_FLOAT32_C( -756.42), SIMDE_FLOAT32_C( 424.81), SIMDE_FLOAT32_C( 27.25), SIMDE_FLOAT32_C( -95.15)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( -0.76), SIMDE_FLOAT32_C( 0.98), SIMDE_FLOAT32_C( 0.56), SIMDE_FLOAT32_C( 0.99),
SIMDE_FLOAT32_C( 0.38), SIMDE_FLOAT32_C( -0.40), SIMDE_FLOAT32_C( -0.92), SIMDE_FLOAT32_C( -0.74),
SIMDE_FLOAT32_C( 0.59), SIMDE_FLOAT32_C( -0.92), SIMDE_FLOAT32_C( -0.17), SIMDE_FLOAT32_C( -0.98),
SIMDE_FLOAT32_C( -0.59), SIMDE_FLOAT32_C( 0.90), SIMDE_FLOAT32_C( 0.46), SIMDE_FLOAT32_C( -1.00)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( -438.19), SIMDE_FLOAT32_C( -359.76), SIMDE_FLOAT32_C( -752.43), SIMDE_FLOAT32_C( -33.67),
SIMDE_FLOAT32_C( 932.66), SIMDE_FLOAT32_C( 7.27), SIMDE_FLOAT32_C( -327.22), SIMDE_FLOAT32_C( -125.20),
SIMDE_FLOAT32_C( -182.45), SIMDE_FLOAT32_C( 39.93), SIMDE_FLOAT32_C( 510.85), SIMDE_FLOAT32_C( 394.67),
SIMDE_FLOAT32_C( 14.34), SIMDE_FLOAT32_C( -304.73), SIMDE_FLOAT32_C( 916.26), SIMDE_FLOAT32_C( -696.69)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( -0.98), SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( -0.54), SIMDE_FLOAT32_C( -0.55),
SIMDE_FLOAT32_C( -0.54), SIMDE_FLOAT32_C( 0.13), SIMDE_FLOAT32_C( 0.54), SIMDE_FLOAT32_C( -0.82),
SIMDE_FLOAT32_C( 0.04), SIMDE_FLOAT32_C( 0.64), SIMDE_FLOAT32_C( 0.49), SIMDE_FLOAT32_C( 0.57),
SIMDE_FLOAT32_C( 0.25), SIMDE_FLOAT32_C( 0.82), SIMDE_FLOAT32_C( -0.28), SIMDE_FLOAT32_C( 0.40)) }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])); i++) {
simde__m512 r = simde_mm512_sind_ps(test_vec[i].a);
simde_assert_m512_close(r, test_vec[i].r, 1);
}
return 0;
}
static int
test_simde_mm512_mask_sind_ps(SIMDE_MUNIT_TEST_ARGS) {
const struct {
simde__m512 src;
simde__mmask16 k;
simde__m512 a;
simde__m512 r;
} test_vec[8] = {
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( -450.67), SIMDE_FLOAT32_C( 687.09), SIMDE_FLOAT32_C( -212.54), SIMDE_FLOAT32_C( -660.80),
SIMDE_FLOAT32_C( 28.47), SIMDE_FLOAT32_C( -923.64), SIMDE_FLOAT32_C( -860.95), SIMDE_FLOAT32_C( 696.87),
SIMDE_FLOAT32_C( -686.13), SIMDE_FLOAT32_C( 571.46), SIMDE_FLOAT32_C( 422.21), SIMDE_FLOAT32_C( 467.76),
SIMDE_FLOAT32_C( 670.24), SIMDE_FLOAT32_C( 34.06), SIMDE_FLOAT32_C( 39.01), SIMDE_FLOAT32_C( 346.63)),
UINT16_C(41466),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 178.20), SIMDE_FLOAT32_C( 233.37), SIMDE_FLOAT32_C( 261.31), SIMDE_FLOAT32_C( -976.55),
SIMDE_FLOAT32_C( -444.81), SIMDE_FLOAT32_C( -384.03), SIMDE_FLOAT32_C( -305.07), SIMDE_FLOAT32_C( -417.54),
SIMDE_FLOAT32_C( -678.17), SIMDE_FLOAT32_C( 84.77), SIMDE_FLOAT32_C( 825.53), SIMDE_FLOAT32_C( -269.45),
SIMDE_FLOAT32_C( 497.31), SIMDE_FLOAT32_C( -297.45), SIMDE_FLOAT32_C( -186.21), SIMDE_FLOAT32_C( -754.38)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 0.03), SIMDE_FLOAT32_C( 687.09), SIMDE_FLOAT32_C( -0.99), SIMDE_FLOAT32_C( -660.80),
SIMDE_FLOAT32_C( 28.47), SIMDE_FLOAT32_C( -923.64), SIMDE_FLOAT32_C( -860.95), SIMDE_FLOAT32_C( -0.84),
SIMDE_FLOAT32_C( 0.67), SIMDE_FLOAT32_C( 1.00), SIMDE_FLOAT32_C( 0.96), SIMDE_FLOAT32_C( 1.00),
SIMDE_FLOAT32_C( 0.68), SIMDE_FLOAT32_C( 34.06), SIMDE_FLOAT32_C( 0.11), SIMDE_FLOAT32_C( 346.63)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( 469.66), SIMDE_FLOAT32_C( -148.69), SIMDE_FLOAT32_C( 910.03), SIMDE_FLOAT32_C( 791.23),
SIMDE_FLOAT32_C( -203.65), SIMDE_FLOAT32_C( 336.73), SIMDE_FLOAT32_C( -747.59), SIMDE_FLOAT32_C( -554.19),
SIMDE_FLOAT32_C( -387.90), SIMDE_FLOAT32_C( 655.87), SIMDE_FLOAT32_C( 532.35), SIMDE_FLOAT32_C( 780.64),
SIMDE_FLOAT32_C( -770.35), SIMDE_FLOAT32_C( -583.60), SIMDE_FLOAT32_C( -770.72), SIMDE_FLOAT32_C( 28.08)),
UINT16_C(36797),
simde_mm512_set_ps(SIMDE_FLOAT32_C( -171.51), SIMDE_FLOAT32_C( 680.02), SIMDE_FLOAT32_C( 818.66), SIMDE_FLOAT32_C( 600.47),
SIMDE_FLOAT32_C( 254.31), SIMDE_FLOAT32_C( -80.73), SIMDE_FLOAT32_C( -944.78), SIMDE_FLOAT32_C( -767.23),
SIMDE_FLOAT32_C( 398.82), SIMDE_FLOAT32_C( 395.92), SIMDE_FLOAT32_C( 339.21), SIMDE_FLOAT32_C( -263.99),
SIMDE_FLOAT32_C( -30.79), SIMDE_FLOAT32_C( 443.48), SIMDE_FLOAT32_C( 380.46), SIMDE_FLOAT32_C( 993.90)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( -0.15), SIMDE_FLOAT32_C( -148.69), SIMDE_FLOAT32_C( 910.03), SIMDE_FLOAT32_C( 791.23),
SIMDE_FLOAT32_C( -0.96), SIMDE_FLOAT32_C( -0.99), SIMDE_FLOAT32_C( 0.70), SIMDE_FLOAT32_C( -0.73),
SIMDE_FLOAT32_C( 0.63), SIMDE_FLOAT32_C( 655.87), SIMDE_FLOAT32_C( -0.35), SIMDE_FLOAT32_C( 0.99),
SIMDE_FLOAT32_C( -0.51), SIMDE_FLOAT32_C( 0.99), SIMDE_FLOAT32_C( -770.72), SIMDE_FLOAT32_C( -1.00)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( -95.15), SIMDE_FLOAT32_C( 840.65), SIMDE_FLOAT32_C( -591.56), SIMDE_FLOAT32_C( 731.49),
SIMDE_FLOAT32_C( 623.70), SIMDE_FLOAT32_C( 140.67), SIMDE_FLOAT32_C( -906.16), SIMDE_FLOAT32_C( 99.93),
SIMDE_FLOAT32_C( -738.19), SIMDE_FLOAT32_C( 758.79), SIMDE_FLOAT32_C( 343.48), SIMDE_FLOAT32_C( -797.92),
SIMDE_FLOAT32_C( -525.83), SIMDE_FLOAT32_C( -822.65), SIMDE_FLOAT32_C( 655.67), SIMDE_FLOAT32_C( 543.35)),
UINT16_C(16804),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 27.25), SIMDE_FLOAT32_C( 690.12), SIMDE_FLOAT32_C( -21.09), SIMDE_FLOAT32_C( -448.89),
SIMDE_FLOAT32_C( 505.79), SIMDE_FLOAT32_C( 831.02), SIMDE_FLOAT32_C( 977.36), SIMDE_FLOAT32_C( 331.34),
SIMDE_FLOAT32_C( 462.95), SIMDE_FLOAT32_C( -178.99), SIMDE_FLOAT32_C( 324.62), SIMDE_FLOAT32_C( -874.31),
SIMDE_FLOAT32_C( -328.54), SIMDE_FLOAT32_C( -192.31), SIMDE_FLOAT32_C( 561.36), SIMDE_FLOAT32_C( -70.91)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( -95.15), SIMDE_FLOAT32_C( -0.50), SIMDE_FLOAT32_C( -591.56), SIMDE_FLOAT32_C( 731.49),
SIMDE_FLOAT32_C( 623.70), SIMDE_FLOAT32_C( 140.67), SIMDE_FLOAT32_C( -906.16), SIMDE_FLOAT32_C( -0.48),
SIMDE_FLOAT32_C( 0.97), SIMDE_FLOAT32_C( 758.79), SIMDE_FLOAT32_C( -0.58), SIMDE_FLOAT32_C( -797.92),
SIMDE_FLOAT32_C( -525.83), SIMDE_FLOAT32_C( 0.21), SIMDE_FLOAT32_C( 655.67), SIMDE_FLOAT32_C( 543.35)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( -348.70), SIMDE_FLOAT32_C( -438.19), SIMDE_FLOAT32_C( -752.43), SIMDE_FLOAT32_C( 932.66),
SIMDE_FLOAT32_C( -327.22), SIMDE_FLOAT32_C( -182.45), SIMDE_FLOAT32_C( 510.85), SIMDE_FLOAT32_C( 14.34),
SIMDE_FLOAT32_C( 916.26), SIMDE_FLOAT32_C( -769.09), SIMDE_FLOAT32_C( -573.81), SIMDE_FLOAT32_C( -337.60),
SIMDE_FLOAT32_C( 293.64), SIMDE_FLOAT32_C( -576.22), SIMDE_FLOAT32_C( 710.38), SIMDE_FLOAT32_C( -756.42)),
UINT16_C( 2107),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 897.27), SIMDE_FLOAT32_C( -197.89), SIMDE_FLOAT32_C( -359.76), SIMDE_FLOAT32_C( -33.67),
SIMDE_FLOAT32_C( 7.27), SIMDE_FLOAT32_C( -125.20), SIMDE_FLOAT32_C( 39.93), SIMDE_FLOAT32_C( 394.67),
SIMDE_FLOAT32_C( -304.73), SIMDE_FLOAT32_C( -696.69), SIMDE_FLOAT32_C( 822.06), SIMDE_FLOAT32_C( -997.63),
SIMDE_FLOAT32_C( 923.64), SIMDE_FLOAT32_C( -768.12), SIMDE_FLOAT32_C( -67.64), SIMDE_FLOAT32_C( 977.49)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( -348.70), SIMDE_FLOAT32_C( -438.19), SIMDE_FLOAT32_C( -752.43), SIMDE_FLOAT32_C( 932.66),
SIMDE_FLOAT32_C( 0.13), SIMDE_FLOAT32_C( -182.45), SIMDE_FLOAT32_C( 510.85), SIMDE_FLOAT32_C( 14.34),
SIMDE_FLOAT32_C( 916.26), SIMDE_FLOAT32_C( -769.09), SIMDE_FLOAT32_C( 0.98), SIMDE_FLOAT32_C( 0.99),
SIMDE_FLOAT32_C( -0.40), SIMDE_FLOAT32_C( -576.22), SIMDE_FLOAT32_C( -0.92), SIMDE_FLOAT32_C( -0.98)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( -15.61), SIMDE_FLOAT32_C( -737.13), SIMDE_FLOAT32_C( -314.93), SIMDE_FLOAT32_C( 177.92),
SIMDE_FLOAT32_C( 345.93), SIMDE_FLOAT32_C( 888.71), SIMDE_FLOAT32_C( 915.71), SIMDE_FLOAT32_C( 133.52),
SIMDE_FLOAT32_C( 484.94), SIMDE_FLOAT32_C( -598.06), SIMDE_FLOAT32_C( -791.07), SIMDE_FLOAT32_C( -765.93),
SIMDE_FLOAT32_C( 221.37), SIMDE_FLOAT32_C( -788.36), SIMDE_FLOAT32_C( -775.04), SIMDE_FLOAT32_C( 440.64)),
UINT16_C(22274),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 496.57), SIMDE_FLOAT32_C( 915.19), SIMDE_FLOAT32_C( -718.40), SIMDE_FLOAT32_C( 159.97),
SIMDE_FLOAT32_C( -861.01), SIMDE_FLOAT32_C( 426.61), SIMDE_FLOAT32_C( 932.11), SIMDE_FLOAT32_C( 110.36),
SIMDE_FLOAT32_C( 826.84), SIMDE_FLOAT32_C( -76.75), SIMDE_FLOAT32_C( 237.58), SIMDE_FLOAT32_C( -378.50),
SIMDE_FLOAT32_C( -601.68), SIMDE_FLOAT32_C( -623.50), SIMDE_FLOAT32_C( -942.47), SIMDE_FLOAT32_C( 475.51)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( -15.61), SIMDE_FLOAT32_C( -0.26), SIMDE_FLOAT32_C( -314.93), SIMDE_FLOAT32_C( 0.34),
SIMDE_FLOAT32_C( 345.93), SIMDE_FLOAT32_C( 0.92), SIMDE_FLOAT32_C( -0.53), SIMDE_FLOAT32_C( 0.94),
SIMDE_FLOAT32_C( 484.94), SIMDE_FLOAT32_C( -598.06), SIMDE_FLOAT32_C( -791.07), SIMDE_FLOAT32_C( -765.93),
SIMDE_FLOAT32_C( 221.37), SIMDE_FLOAT32_C( -788.36), SIMDE_FLOAT32_C( 0.68), SIMDE_FLOAT32_C( 440.64)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( 883.05), SIMDE_FLOAT32_C( -807.28), SIMDE_FLOAT32_C( -70.05), SIMDE_FLOAT32_C( -784.34),
SIMDE_FLOAT32_C( 92.52), SIMDE_FLOAT32_C( 206.60), SIMDE_FLOAT32_C( 834.60), SIMDE_FLOAT32_C( -65.60),
SIMDE_FLOAT32_C( -286.07), SIMDE_FLOAT32_C( -212.86), SIMDE_FLOAT32_C( -318.38), SIMDE_FLOAT32_C( 783.48),
SIMDE_FLOAT32_C( -628.82), SIMDE_FLOAT32_C( 556.35), SIMDE_FLOAT32_C( 439.43), SIMDE_FLOAT32_C( 434.03)),
UINT16_C(27396),
simde_mm512_set_ps(SIMDE_FLOAT32_C( -964.25), SIMDE_FLOAT32_C( -406.33), SIMDE_FLOAT32_C( -743.66), SIMDE_FLOAT32_C( -764.58),
SIMDE_FLOAT32_C( 789.89), SIMDE_FLOAT32_C( 4.83), SIMDE_FLOAT32_C( -818.54), SIMDE_FLOAT32_C( 161.06),
SIMDE_FLOAT32_C( 579.25), SIMDE_FLOAT32_C( -11.78), SIMDE_FLOAT32_C( -308.52), SIMDE_FLOAT32_C( -719.57),
SIMDE_FLOAT32_C( 334.00), SIMDE_FLOAT32_C( 274.71), SIMDE_FLOAT32_C( -916.82), SIMDE_FLOAT32_C( -490.00)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 883.05), SIMDE_FLOAT32_C( -0.72), SIMDE_FLOAT32_C( -0.40), SIMDE_FLOAT32_C( -784.34),
SIMDE_FLOAT32_C( 0.94), SIMDE_FLOAT32_C( 206.60), SIMDE_FLOAT32_C( -0.99), SIMDE_FLOAT32_C( 0.32),
SIMDE_FLOAT32_C( -286.07), SIMDE_FLOAT32_C( -212.86), SIMDE_FLOAT32_C( -318.38), SIMDE_FLOAT32_C( 783.48),
SIMDE_FLOAT32_C( -628.82), SIMDE_FLOAT32_C( -1.00), SIMDE_FLOAT32_C( 439.43), SIMDE_FLOAT32_C( 434.03)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( 529.63), SIMDE_FLOAT32_C( -24.89), SIMDE_FLOAT32_C( -967.78), SIMDE_FLOAT32_C( 638.94),
SIMDE_FLOAT32_C( 450.90), SIMDE_FLOAT32_C( -771.54), SIMDE_FLOAT32_C( 105.79), SIMDE_FLOAT32_C( 590.10),
SIMDE_FLOAT32_C( 30.91), SIMDE_FLOAT32_C( 635.35), SIMDE_FLOAT32_C( -84.00), SIMDE_FLOAT32_C( 80.04),
SIMDE_FLOAT32_C( -709.46), SIMDE_FLOAT32_C( 607.86), SIMDE_FLOAT32_C( 394.58), SIMDE_FLOAT32_C( -889.11)),
UINT16_C( 953),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 18.75), SIMDE_FLOAT32_C( 809.05), SIMDE_FLOAT32_C( 144.05), SIMDE_FLOAT32_C( -427.72),
SIMDE_FLOAT32_C( 308.28), SIMDE_FLOAT32_C( -177.05), SIMDE_FLOAT32_C( -457.77), SIMDE_FLOAT32_C( 678.24),
SIMDE_FLOAT32_C( 66.05), SIMDE_FLOAT32_C( -267.71), SIMDE_FLOAT32_C( 117.28), SIMDE_FLOAT32_C( -576.80),
SIMDE_FLOAT32_C( -38.39), SIMDE_FLOAT32_C( -250.14), SIMDE_FLOAT32_C( -53.92), SIMDE_FLOAT32_C( 91.94)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 529.63), SIMDE_FLOAT32_C( -24.89), SIMDE_FLOAT32_C( -967.78), SIMDE_FLOAT32_C( 638.94),
SIMDE_FLOAT32_C( 450.90), SIMDE_FLOAT32_C( -771.54), SIMDE_FLOAT32_C( -0.99), SIMDE_FLOAT32_C( -0.67),
SIMDE_FLOAT32_C( 0.91), SIMDE_FLOAT32_C( 635.35), SIMDE_FLOAT32_C( 0.89), SIMDE_FLOAT32_C( 0.60),
SIMDE_FLOAT32_C( -0.62), SIMDE_FLOAT32_C( 607.86), SIMDE_FLOAT32_C( 394.58), SIMDE_FLOAT32_C( 1.00)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( -788.39), SIMDE_FLOAT32_C( 330.43), SIMDE_FLOAT32_C( -493.41), SIMDE_FLOAT32_C( 822.72),
SIMDE_FLOAT32_C( 956.68), SIMDE_FLOAT32_C( 954.62), SIMDE_FLOAT32_C( 825.49), SIMDE_FLOAT32_C( -816.27),
SIMDE_FLOAT32_C( -209.34), SIMDE_FLOAT32_C( -933.21), SIMDE_FLOAT32_C( -728.70), SIMDE_FLOAT32_C( -420.06),
SIMDE_FLOAT32_C( 100.32), SIMDE_FLOAT32_C( 103.15), SIMDE_FLOAT32_C( 439.77), SIMDE_FLOAT32_C( -204.33)),
UINT16_C(12713),
simde_mm512_set_ps(SIMDE_FLOAT32_C( -841.43), SIMDE_FLOAT32_C( -14.16), SIMDE_FLOAT32_C( 824.88), SIMDE_FLOAT32_C( 793.63),
SIMDE_FLOAT32_C( -736.75), SIMDE_FLOAT32_C( -310.57), SIMDE_FLOAT32_C( 728.87), SIMDE_FLOAT32_C( -350.72),
SIMDE_FLOAT32_C( 60.89), SIMDE_FLOAT32_C( 109.81), SIMDE_FLOAT32_C( 715.94), SIMDE_FLOAT32_C( -250.60),
SIMDE_FLOAT32_C( 944.14), SIMDE_FLOAT32_C( 361.85), SIMDE_FLOAT32_C( -13.07), SIMDE_FLOAT32_C( 852.60)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( -788.39), SIMDE_FLOAT32_C( 330.43), SIMDE_FLOAT32_C( 0.97), SIMDE_FLOAT32_C( 0.96),
SIMDE_FLOAT32_C( 956.68), SIMDE_FLOAT32_C( 954.62), SIMDE_FLOAT32_C( 825.49), SIMDE_FLOAT32_C( 0.16),
SIMDE_FLOAT32_C( 0.87), SIMDE_FLOAT32_C( -933.21), SIMDE_FLOAT32_C( -0.07), SIMDE_FLOAT32_C( -420.06),
SIMDE_FLOAT32_C( -0.70), SIMDE_FLOAT32_C( 103.15), SIMDE_FLOAT32_C( 439.77), SIMDE_FLOAT32_C( 0.74)) }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])); i++) {
simde__m512 r = simde_mm512_mask_sind_ps(test_vec[i].src, test_vec[i].k, test_vec[i].a);
simde_assert_m512_close(r, test_vec[i].r, 1);
}
return 0;
}
static int
test_simde_mm512_sind_pd(SIMDE_MUNIT_TEST_ARGS) {
const struct {
simde__m512d a;
simde__m512d r;
} test_vec[8] = {
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( 497.31), SIMDE_FLOAT64_C( 670.24),
SIMDE_FLOAT64_C( -297.45), SIMDE_FLOAT64_C( 34.06),
SIMDE_FLOAT64_C( -186.21), SIMDE_FLOAT64_C( 39.01),
SIMDE_FLOAT64_C( -754.38), SIMDE_FLOAT64_C( 346.63)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 0.68), SIMDE_FLOAT64_C( -0.76),
SIMDE_FLOAT64_C( 0.89), SIMDE_FLOAT64_C( 0.56),
SIMDE_FLOAT64_C( 0.11), SIMDE_FLOAT64_C( 0.63),
SIMDE_FLOAT64_C( -0.56), SIMDE_FLOAT64_C( -0.23)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( -678.17), SIMDE_FLOAT64_C( -686.13),
SIMDE_FLOAT64_C( 84.77), SIMDE_FLOAT64_C( 571.46),
SIMDE_FLOAT64_C( 825.53), SIMDE_FLOAT64_C( 422.21),
SIMDE_FLOAT64_C( -269.45), SIMDE_FLOAT64_C( 467.76)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 0.67), SIMDE_FLOAT64_C( 0.56),
SIMDE_FLOAT64_C( 1.00), SIMDE_FLOAT64_C( -0.52),
SIMDE_FLOAT64_C( 0.96), SIMDE_FLOAT64_C( 0.88),
SIMDE_FLOAT64_C( 1.00), SIMDE_FLOAT64_C( 0.95)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( -444.81), SIMDE_FLOAT64_C( 28.47),
SIMDE_FLOAT64_C( -384.03), SIMDE_FLOAT64_C( -923.64),
SIMDE_FLOAT64_C( -305.07), SIMDE_FLOAT64_C( -860.95),
SIMDE_FLOAT64_C( -417.54), SIMDE_FLOAT64_C( 696.87)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( -1.00), SIMDE_FLOAT64_C( 0.48),
SIMDE_FLOAT64_C( -0.41), SIMDE_FLOAT64_C( 0.40),
SIMDE_FLOAT64_C( 0.82), SIMDE_FLOAT64_C( -0.63),
SIMDE_FLOAT64_C( -0.84), SIMDE_FLOAT64_C( -0.39)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( 178.20), SIMDE_FLOAT64_C( -450.67),
SIMDE_FLOAT64_C( 233.37), SIMDE_FLOAT64_C( 687.09),
SIMDE_FLOAT64_C( 261.31), SIMDE_FLOAT64_C( -212.54),
SIMDE_FLOAT64_C( -976.55), SIMDE_FLOAT64_C( -660.80)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 0.03), SIMDE_FLOAT64_C( -1.00),
SIMDE_FLOAT64_C( -0.80), SIMDE_FLOAT64_C( -0.54),
SIMDE_FLOAT64_C( -0.99), SIMDE_FLOAT64_C( 0.54),
SIMDE_FLOAT64_C( 0.97), SIMDE_FLOAT64_C( 0.86)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( -770.35), SIMDE_FLOAT64_C( 443.48),
SIMDE_FLOAT64_C( -583.60), SIMDE_FLOAT64_C( 380.46),
SIMDE_FLOAT64_C( -770.72), SIMDE_FLOAT64_C( 993.90),
SIMDE_FLOAT64_C( 28.08), SIMDE_FLOAT64_C( 841.21)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( -0.77), SIMDE_FLOAT64_C( 0.99),
SIMDE_FLOAT64_C( 0.69), SIMDE_FLOAT64_C( 0.35),
SIMDE_FLOAT64_C( -0.77), SIMDE_FLOAT64_C( -1.00),
SIMDE_FLOAT64_C( 0.47), SIMDE_FLOAT64_C( 0.86)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( -387.90), SIMDE_FLOAT64_C( 395.92),
SIMDE_FLOAT64_C( 655.87), SIMDE_FLOAT64_C( 339.21),
SIMDE_FLOAT64_C( 532.35), SIMDE_FLOAT64_C( -263.99),
SIMDE_FLOAT64_C( 780.64), SIMDE_FLOAT64_C( -30.79)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( -0.47), SIMDE_FLOAT64_C( 0.59),
SIMDE_FLOAT64_C( -0.90), SIMDE_FLOAT64_C( -0.35),
SIMDE_FLOAT64_C( 0.13), SIMDE_FLOAT64_C( 0.99),
SIMDE_FLOAT64_C( 0.87), SIMDE_FLOAT64_C( -0.51)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( -203.65), SIMDE_FLOAT64_C( -80.73),
SIMDE_FLOAT64_C( 336.73), SIMDE_FLOAT64_C( -944.78),
SIMDE_FLOAT64_C( -747.59), SIMDE_FLOAT64_C( -767.23),
SIMDE_FLOAT64_C( -554.19), SIMDE_FLOAT64_C( 398.82)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 0.40), SIMDE_FLOAT64_C( -0.99),
SIMDE_FLOAT64_C( -0.40), SIMDE_FLOAT64_C( 0.70),
SIMDE_FLOAT64_C( -0.46), SIMDE_FLOAT64_C( -0.73),
SIMDE_FLOAT64_C( 0.25), SIMDE_FLOAT64_C( 0.63)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( 469.66), SIMDE_FLOAT64_C( 680.02),
SIMDE_FLOAT64_C( -148.69), SIMDE_FLOAT64_C( 818.66),
SIMDE_FLOAT64_C( 910.03), SIMDE_FLOAT64_C( 600.47),
SIMDE_FLOAT64_C( 791.23), SIMDE_FLOAT64_C( 254.31)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 0.94), SIMDE_FLOAT64_C( -0.64),
SIMDE_FLOAT64_C( -0.52), SIMDE_FLOAT64_C( 0.99),
SIMDE_FLOAT64_C( -0.17), SIMDE_FLOAT64_C( -0.87),
SIMDE_FLOAT64_C( 0.95), SIMDE_FLOAT64_C( -0.96)) }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])); i++) {
simde__m512d r = simde_mm512_sind_pd(test_vec[i].a);
simde_assert_m512d_close(r, test_vec[i].r, 1);
}
return 0;
}
static int
test_simde_mm512_mask_sind_pd(SIMDE_MUNIT_TEST_ARGS) {
const struct {
simde__m512d src;
simde__mmask8 k;
simde__m512d a;
simde__m512d r;
} test_vec[8] = {
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( -686.13), SIMDE_FLOAT64_C( 571.46),
SIMDE_FLOAT64_C( 422.21), SIMDE_FLOAT64_C( 467.76),
SIMDE_FLOAT64_C( 670.24), SIMDE_FLOAT64_C( 34.06),
SIMDE_FLOAT64_C( 39.01), SIMDE_FLOAT64_C( 346.63)),
UINT8_C(139),
simde_mm512_set_pd(SIMDE_FLOAT64_C( -678.17), SIMDE_FLOAT64_C( 84.77),
SIMDE_FLOAT64_C( 825.53), SIMDE_FLOAT64_C( -269.45),
SIMDE_FLOAT64_C( 497.31), SIMDE_FLOAT64_C( -297.45),
SIMDE_FLOAT64_C( -186.21), SIMDE_FLOAT64_C( -754.38)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 0.67), SIMDE_FLOAT64_C( 571.46),
SIMDE_FLOAT64_C( 422.21), SIMDE_FLOAT64_C( 467.76),
SIMDE_FLOAT64_C( 0.68), SIMDE_FLOAT64_C( 34.06),
SIMDE_FLOAT64_C( 0.11), SIMDE_FLOAT64_C( -0.56)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( 178.20), SIMDE_FLOAT64_C( 233.37),
SIMDE_FLOAT64_C( 261.31), SIMDE_FLOAT64_C( -976.55),
SIMDE_FLOAT64_C( -444.81), SIMDE_FLOAT64_C( -384.03),
SIMDE_FLOAT64_C( -305.07), SIMDE_FLOAT64_C( -417.54)),
UINT8_C(229),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 841.21), SIMDE_FLOAT64_C( -450.67),
SIMDE_FLOAT64_C( 687.09), SIMDE_FLOAT64_C( -212.54),
SIMDE_FLOAT64_C( -660.80), SIMDE_FLOAT64_C( 28.47),
SIMDE_FLOAT64_C( -923.64), SIMDE_FLOAT64_C( -860.95)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 0.86), SIMDE_FLOAT64_C( -1.00),
SIMDE_FLOAT64_C( -0.54), SIMDE_FLOAT64_C( -976.55),
SIMDE_FLOAT64_C( -444.81), SIMDE_FLOAT64_C( 0.48),
SIMDE_FLOAT64_C( -305.07), SIMDE_FLOAT64_C( -0.63)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( 398.82), SIMDE_FLOAT64_C( 395.92),
SIMDE_FLOAT64_C( 339.21), SIMDE_FLOAT64_C( -263.99),
SIMDE_FLOAT64_C( -30.79), SIMDE_FLOAT64_C( 443.48),
SIMDE_FLOAT64_C( 380.46), SIMDE_FLOAT64_C( 993.90)),
UINT8_C(253),
simde_mm512_set_pd(SIMDE_FLOAT64_C( -554.19), SIMDE_FLOAT64_C( -387.90),
SIMDE_FLOAT64_C( 655.87), SIMDE_FLOAT64_C( 532.35),
SIMDE_FLOAT64_C( 780.64), SIMDE_FLOAT64_C( -770.35),
SIMDE_FLOAT64_C( -583.60), SIMDE_FLOAT64_C( -770.72)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 0.25), SIMDE_FLOAT64_C( -0.47),
SIMDE_FLOAT64_C( -0.90), SIMDE_FLOAT64_C( 0.13),
SIMDE_FLOAT64_C( 0.87), SIMDE_FLOAT64_C( -0.77),
SIMDE_FLOAT64_C( 380.46), SIMDE_FLOAT64_C( -0.77)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( 120.65), SIMDE_FLOAT64_C( 469.66),
SIMDE_FLOAT64_C( -148.69), SIMDE_FLOAT64_C( 910.03),
SIMDE_FLOAT64_C( 791.23), SIMDE_FLOAT64_C( -203.65),
SIMDE_FLOAT64_C( 336.73), SIMDE_FLOAT64_C( -747.59)),
UINT8_C( 93),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 543.35), SIMDE_FLOAT64_C( -171.51),
SIMDE_FLOAT64_C( 680.02), SIMDE_FLOAT64_C( 818.66),
SIMDE_FLOAT64_C( 600.47), SIMDE_FLOAT64_C( 254.31),
SIMDE_FLOAT64_C( -80.73), SIMDE_FLOAT64_C( -944.78)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 120.65), SIMDE_FLOAT64_C( -0.15),
SIMDE_FLOAT64_C( -148.69), SIMDE_FLOAT64_C( 0.99),
SIMDE_FLOAT64_C( -0.87), SIMDE_FLOAT64_C( -0.96),
SIMDE_FLOAT64_C( 336.73), SIMDE_FLOAT64_C( 0.70)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( 99.93), SIMDE_FLOAT64_C( -738.19),
SIMDE_FLOAT64_C( 758.79), SIMDE_FLOAT64_C( 343.48),
SIMDE_FLOAT64_C( -797.92), SIMDE_FLOAT64_C( -525.83),
SIMDE_FLOAT64_C( -822.65), SIMDE_FLOAT64_C( 655.67)),
UINT8_C(145),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 331.34), SIMDE_FLOAT64_C( 462.95),
SIMDE_FLOAT64_C( -178.99), SIMDE_FLOAT64_C( 324.62),
SIMDE_FLOAT64_C( -874.31), SIMDE_FLOAT64_C( -328.54),
SIMDE_FLOAT64_C( -192.31), SIMDE_FLOAT64_C( 561.36)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( -0.48), SIMDE_FLOAT64_C( -738.19),
SIMDE_FLOAT64_C( 758.79), SIMDE_FLOAT64_C( -0.58),
SIMDE_FLOAT64_C( -797.92), SIMDE_FLOAT64_C( -525.83),
SIMDE_FLOAT64_C( -822.65), SIMDE_FLOAT64_C( -0.36)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( -756.42), SIMDE_FLOAT64_C( 27.25),
SIMDE_FLOAT64_C( 690.12), SIMDE_FLOAT64_C( -21.09),
SIMDE_FLOAT64_C( -448.89), SIMDE_FLOAT64_C( 505.79),
SIMDE_FLOAT64_C( 831.02), SIMDE_FLOAT64_C( 977.36)),
UINT8_C( 75),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 977.49), SIMDE_FLOAT64_C( 424.81),
SIMDE_FLOAT64_C( -95.15), SIMDE_FLOAT64_C( 840.65),
SIMDE_FLOAT64_C( -591.56), SIMDE_FLOAT64_C( 731.49),
SIMDE_FLOAT64_C( 623.70), SIMDE_FLOAT64_C( 140.67)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( -756.42), SIMDE_FLOAT64_C( 0.90),
SIMDE_FLOAT64_C( 690.12), SIMDE_FLOAT64_C( -21.09),
SIMDE_FLOAT64_C( 0.78), SIMDE_FLOAT64_C( 505.79),
SIMDE_FLOAT64_C( -0.99), SIMDE_FLOAT64_C( 0.63)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( 394.67), SIMDE_FLOAT64_C( -304.73),
SIMDE_FLOAT64_C( -696.69), SIMDE_FLOAT64_C( 822.06),
SIMDE_FLOAT64_C( -997.63), SIMDE_FLOAT64_C( 923.64),
SIMDE_FLOAT64_C( -768.12), SIMDE_FLOAT64_C( -67.64)),
UINT8_C( 93),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 510.85), SIMDE_FLOAT64_C( 14.34),
SIMDE_FLOAT64_C( 916.26), SIMDE_FLOAT64_C( -769.09),
SIMDE_FLOAT64_C( -573.81), SIMDE_FLOAT64_C( -337.60),
SIMDE_FLOAT64_C( 293.64), SIMDE_FLOAT64_C( -576.22)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 394.67), SIMDE_FLOAT64_C( 0.25),
SIMDE_FLOAT64_C( -696.69), SIMDE_FLOAT64_C( -0.76),
SIMDE_FLOAT64_C( 0.56), SIMDE_FLOAT64_C( 0.38),
SIMDE_FLOAT64_C( -768.12), SIMDE_FLOAT64_C( 0.59)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( 475.51), SIMDE_FLOAT64_C( 936.65),
SIMDE_FLOAT64_C( -348.70), SIMDE_FLOAT64_C( -438.19),
SIMDE_FLOAT64_C( -752.43), SIMDE_FLOAT64_C( 932.66),
SIMDE_FLOAT64_C( -327.22), SIMDE_FLOAT64_C( -182.45)),
UINT8_C(213),
simde_mm512_set_pd(SIMDE_FLOAT64_C( -775.04), SIMDE_FLOAT64_C( 440.64),
SIMDE_FLOAT64_C( 897.27), SIMDE_FLOAT64_C( -197.89),
SIMDE_FLOAT64_C( -359.76), SIMDE_FLOAT64_C( -33.67),
SIMDE_FLOAT64_C( 7.27), SIMDE_FLOAT64_C( -125.20)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( -0.82), SIMDE_FLOAT64_C( 0.99),
SIMDE_FLOAT64_C( -348.70), SIMDE_FLOAT64_C( 0.31),
SIMDE_FLOAT64_C( -752.43), SIMDE_FLOAT64_C( -0.55),
SIMDE_FLOAT64_C( -327.22), SIMDE_FLOAT64_C( -0.82)) }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])); i++) {
simde__m512d r = simde_mm512_mask_sind_pd(test_vec[i].src, test_vec[i].k, test_vec[i].a);
simde_assert_m512d_close(r, test_vec[i].r, 1);
}
return 0;
}
static int
test_simde_mm_sinh_ps(SIMDE_MUNIT_TEST_ARGS) {
const struct {
simde__m128 a;
simde__m128 r;
} test_vec[8] = {
{ simde_mm_set_ps(SIMDE_FLOAT32_C( 2.50), SIMDE_FLOAT32_C( 3.47), SIMDE_FLOAT32_C( 0.06), SIMDE_FLOAT32_C( 4.79)),
simde_mm_set_ps(SIMDE_FLOAT32_C( 6.05), SIMDE_FLOAT32_C( 16.05), SIMDE_FLOAT32_C( 0.06), SIMDE_FLOAT32_C( 60.15)) },
{ simde_mm_set_ps(SIMDE_FLOAT32_C( 5.44), SIMDE_FLOAT32_C( 6.18), SIMDE_FLOAT32_C( 2.02), SIMDE_FLOAT32_C( 3.45)),
simde_mm_set_ps(SIMDE_FLOAT32_C( 115.22), SIMDE_FLOAT32_C( 241.49), SIMDE_FLOAT32_C( 3.70), SIMDE_FLOAT32_C( 15.73)) },
{ simde_mm_set_ps(SIMDE_FLOAT32_C( 6.85), SIMDE_FLOAT32_C( 5.12), SIMDE_FLOAT32_C( 2.14), SIMDE_FLOAT32_C( 5.31)),
simde_mm_set_ps(SIMDE_FLOAT32_C( 471.94), SIMDE_FLOAT32_C( 83.66), SIMDE_FLOAT32_C( 4.19), SIMDE_FLOAT32_C( 101.17)) },
{ simde_mm_set_ps(SIMDE_FLOAT32_C( 0.38), SIMDE_FLOAT32_C( 0.35), SIMDE_FLOAT32_C( 3.66), SIMDE_FLOAT32_C( 5.76)),
simde_mm_set_ps(SIMDE_FLOAT32_C( 0.39), SIMDE_FLOAT32_C( 0.36), SIMDE_FLOAT32_C( 19.42), SIMDE_FLOAT32_C( 158.67)) },
{ simde_mm_set_ps(SIMDE_FLOAT32_C( 1.99), SIMDE_FLOAT32_C( -0.40), SIMDE_FLOAT32_C( 1.50), SIMDE_FLOAT32_C( 6.30)),
simde_mm_set_ps(SIMDE_FLOAT32_C( 3.59), SIMDE_FLOAT32_C( -0.41), SIMDE_FLOAT32_C( 2.13), SIMDE_FLOAT32_C( 272.29)) },
{ simde_mm_set_ps(SIMDE_FLOAT32_C( 1.39), SIMDE_FLOAT32_C( 3.42), SIMDE_FLOAT32_C( 1.65), SIMDE_FLOAT32_C( -0.67)),
simde_mm_set_ps(SIMDE_FLOAT32_C( 1.88), SIMDE_FLOAT32_C( 15.27), SIMDE_FLOAT32_C( 2.51), SIMDE_FLOAT32_C( -0.72)) },
{ simde_mm_set_ps(SIMDE_FLOAT32_C( 4.42), SIMDE_FLOAT32_C( 2.39), SIMDE_FLOAT32_C( -0.90), SIMDE_FLOAT32_C( 0.46)),
simde_mm_set_ps(SIMDE_FLOAT32_C( 41.54), SIMDE_FLOAT32_C( 5.41), SIMDE_FLOAT32_C( -1.03), SIMDE_FLOAT32_C( 0.48)) },
{ simde_mm_set_ps(SIMDE_FLOAT32_C( 4.07), SIMDE_FLOAT32_C( 1.36), SIMDE_FLOAT32_C( 4.30), SIMDE_FLOAT32_C( 6.25)),
simde_mm_set_ps(SIMDE_FLOAT32_C( 29.27), SIMDE_FLOAT32_C( 1.82), SIMDE_FLOAT32_C( 36.84), SIMDE_FLOAT32_C( 259.01)) }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])); i++) {
simde__m128 r = simde_mm_sinh_ps(test_vec[i].a);
simde_assert_m128_close(r, test_vec[i].r, 1);
}
return 0;
}
static int
test_simde_mm_sinh_pd(SIMDE_MUNIT_TEST_ARGS) {
const struct {
simde__m128d a;
simde__m128d r;
} test_vec[8] = {
{ simde_mm_set_pd(SIMDE_FLOAT64_C( 0.06), SIMDE_FLOAT64_C( 4.79)),
simde_mm_set_pd(SIMDE_FLOAT64_C( 0.06), SIMDE_FLOAT64_C( 60.15)) },
{ simde_mm_set_pd(SIMDE_FLOAT64_C( 2.50), SIMDE_FLOAT64_C( 3.47)),
simde_mm_set_pd(SIMDE_FLOAT64_C( 6.05), SIMDE_FLOAT64_C( 16.05)) },
{ simde_mm_set_pd(SIMDE_FLOAT64_C( 2.02), SIMDE_FLOAT64_C( 3.45)),
simde_mm_set_pd(SIMDE_FLOAT64_C( 3.70), SIMDE_FLOAT64_C( 15.73)) },
{ simde_mm_set_pd(SIMDE_FLOAT64_C( 5.44), SIMDE_FLOAT64_C( 6.18)),
simde_mm_set_pd(SIMDE_FLOAT64_C( 115.22), SIMDE_FLOAT64_C( 241.49)) },
{ simde_mm_set_pd(SIMDE_FLOAT64_C( 2.14), SIMDE_FLOAT64_C( 5.31)),
simde_mm_set_pd(SIMDE_FLOAT64_C( 4.19), SIMDE_FLOAT64_C( 101.17)) },
{ simde_mm_set_pd(SIMDE_FLOAT64_C( 6.85), SIMDE_FLOAT64_C( 5.12)),
simde_mm_set_pd(SIMDE_FLOAT64_C( 471.94), SIMDE_FLOAT64_C( 83.66)) },
{ simde_mm_set_pd(SIMDE_FLOAT64_C( 3.66), SIMDE_FLOAT64_C( 5.76)),
simde_mm_set_pd(SIMDE_FLOAT64_C( 19.42), SIMDE_FLOAT64_C( 158.67)) },
{ simde_mm_set_pd(SIMDE_FLOAT64_C( 0.38), SIMDE_FLOAT64_C( 0.35)),
simde_mm_set_pd(SIMDE_FLOAT64_C( 0.39), SIMDE_FLOAT64_C( 0.36)) }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])); i++) {
simde__m128d r = simde_mm_sinh_pd(test_vec[i].a);
simde_assert_m128d_close(r, test_vec[i].r, 1);
}
return 0;
}
static int
test_simde_mm256_sinh_ps(SIMDE_MUNIT_TEST_ARGS) {
const struct {
simde__m256 a;
simde__m256 r;
} test_vec[8] = {
{ simde_mm256_set_ps(SIMDE_FLOAT32_C( 5.44), SIMDE_FLOAT32_C( 6.18),
SIMDE_FLOAT32_C( 2.02), SIMDE_FLOAT32_C( 3.45),
SIMDE_FLOAT32_C( 2.50), SIMDE_FLOAT32_C( 3.47),
SIMDE_FLOAT32_C( 0.06), SIMDE_FLOAT32_C( 4.79)),
simde_mm256_set_ps(SIMDE_FLOAT32_C( 115.22), SIMDE_FLOAT32_C( 241.49),
SIMDE_FLOAT32_C( 3.70), SIMDE_FLOAT32_C( 15.73),
SIMDE_FLOAT32_C( 6.05), SIMDE_FLOAT32_C( 16.05),
SIMDE_FLOAT32_C( 0.06), SIMDE_FLOAT32_C( 60.15)) },
{ simde_mm256_set_ps(SIMDE_FLOAT32_C( 0.38), SIMDE_FLOAT32_C( 0.35),
SIMDE_FLOAT32_C( 3.66), SIMDE_FLOAT32_C( 5.76),
SIMDE_FLOAT32_C( 6.85), SIMDE_FLOAT32_C( 5.12),
SIMDE_FLOAT32_C( 2.14), SIMDE_FLOAT32_C( 5.31)),
simde_mm256_set_ps(SIMDE_FLOAT32_C( 0.39), SIMDE_FLOAT32_C( 0.36),
SIMDE_FLOAT32_C( 19.42), SIMDE_FLOAT32_C( 158.67),
SIMDE_FLOAT32_C( 471.94), SIMDE_FLOAT32_C( 83.66),
SIMDE_FLOAT32_C( 4.19), SIMDE_FLOAT32_C( 101.17)) },
{ simde_mm256_set_ps(SIMDE_FLOAT32_C( 1.39), SIMDE_FLOAT32_C( 3.42),
SIMDE_FLOAT32_C( 1.65), SIMDE_FLOAT32_C( -0.67),
SIMDE_FLOAT32_C( 1.99), SIMDE_FLOAT32_C( -0.40),
SIMDE_FLOAT32_C( 1.50), SIMDE_FLOAT32_C( 6.30)),
simde_mm256_set_ps(SIMDE_FLOAT32_C( 1.88), SIMDE_FLOAT32_C( 15.27),
SIMDE_FLOAT32_C( 2.51), SIMDE_FLOAT32_C( -0.72),
SIMDE_FLOAT32_C( 3.59), SIMDE_FLOAT32_C( -0.41),
SIMDE_FLOAT32_C( 2.13), SIMDE_FLOAT32_C( 272.29)) },
{ simde_mm256_set_ps(SIMDE_FLOAT32_C( 4.07), SIMDE_FLOAT32_C( 1.36),
SIMDE_FLOAT32_C( 4.30), SIMDE_FLOAT32_C( 6.25),
SIMDE_FLOAT32_C( 4.42), SIMDE_FLOAT32_C( 2.39),
SIMDE_FLOAT32_C( -0.90), SIMDE_FLOAT32_C( 0.46)),
simde_mm256_set_ps(SIMDE_FLOAT32_C( 29.27), SIMDE_FLOAT32_C( 1.82),
SIMDE_FLOAT32_C( 36.84), SIMDE_FLOAT32_C( 259.01),
SIMDE_FLOAT32_C( 41.54), SIMDE_FLOAT32_C( 5.41),
SIMDE_FLOAT32_C( -1.03), SIMDE_FLOAT32_C( 0.48)) },
{ simde_mm256_set_ps(SIMDE_FLOAT32_C( -0.01), SIMDE_FLOAT32_C( 5.21),
SIMDE_FLOAT32_C( 0.79), SIMDE_FLOAT32_C( 4.94),
SIMDE_FLOAT32_C( -0.01), SIMDE_FLOAT32_C( 7.57),
SIMDE_FLOAT32_C( 3.42), SIMDE_FLOAT32_C( 6.92)),
simde_mm256_set_ps(SIMDE_FLOAT32_C( -0.01), SIMDE_FLOAT32_C( 91.54),
SIMDE_FLOAT32_C( 0.87), SIMDE_FLOAT32_C( 69.88),
SIMDE_FLOAT32_C( -0.01), SIMDE_FLOAT32_C( 969.57),
SIMDE_FLOAT32_C( 15.27), SIMDE_FLOAT32_C( 506.16)) },
{ simde_mm256_set_ps(SIMDE_FLOAT32_C( 1.63), SIMDE_FLOAT32_C( 5.00),
SIMDE_FLOAT32_C( 6.12), SIMDE_FLOAT32_C( 4.76),
SIMDE_FLOAT32_C( 5.59), SIMDE_FLOAT32_C( 2.16),
SIMDE_FLOAT32_C( 6.66), SIMDE_FLOAT32_C( 3.17)),
simde_mm256_set_ps(SIMDE_FLOAT32_C( 2.45), SIMDE_FLOAT32_C( 74.20),
SIMDE_FLOAT32_C( 227.43), SIMDE_FLOAT32_C( 58.37),
SIMDE_FLOAT32_C( 133.87), SIMDE_FLOAT32_C( 4.28),
SIMDE_FLOAT32_C( 390.27), SIMDE_FLOAT32_C( 11.88)) },
{ simde_mm256_set_ps(SIMDE_FLOAT32_C( 2.42), SIMDE_FLOAT32_C( 2.95),
SIMDE_FLOAT32_C( 4.75), SIMDE_FLOAT32_C( -0.76),
SIMDE_FLOAT32_C( 0.09), SIMDE_FLOAT32_C( 0.00),
SIMDE_FLOAT32_C( 0.92), SIMDE_FLOAT32_C( 5.01)),
simde_mm256_set_ps(SIMDE_FLOAT32_C( 5.58), SIMDE_FLOAT32_C( 9.53),
SIMDE_FLOAT32_C( 57.79), SIMDE_FLOAT32_C( -0.84),
SIMDE_FLOAT32_C( 0.09), SIMDE_FLOAT32_C( 0.00),
SIMDE_FLOAT32_C( 1.06), SIMDE_FLOAT32_C( 74.95)) },
{ simde_mm256_set_ps(SIMDE_FLOAT32_C( 5.32), SIMDE_FLOAT32_C( 6.22),
SIMDE_FLOAT32_C( 2.66), SIMDE_FLOAT32_C( 6.82),
SIMDE_FLOAT32_C( 7.21), SIMDE_FLOAT32_C( 5.88),
SIMDE_FLOAT32_C( 6.70), SIMDE_FLOAT32_C( 4.39)),
simde_mm256_set_ps(SIMDE_FLOAT32_C( 102.19), SIMDE_FLOAT32_C( 251.35),
SIMDE_FLOAT32_C( 7.11), SIMDE_FLOAT32_C( 457.99),
SIMDE_FLOAT32_C( 676.45), SIMDE_FLOAT32_C( 178.90),
SIMDE_FLOAT32_C( 406.20), SIMDE_FLOAT32_C( 40.31)) }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])); i++) {
simde__m256 r = simde_mm256_sinh_ps(test_vec[i].a);
simde_assert_m256_close(r, test_vec[i].r, 1);
}
return 0;
}
static int
test_simde_mm256_sinh_pd(SIMDE_MUNIT_TEST_ARGS) {
const struct {
simde__m256d a;
simde__m256d r;
} test_vec[8] = {
{ simde_mm256_set_pd(SIMDE_FLOAT64_C( 2.50), SIMDE_FLOAT64_C( 3.47),
SIMDE_FLOAT64_C( 0.06), SIMDE_FLOAT64_C( 4.79)),
simde_mm256_set_pd(SIMDE_FLOAT64_C( 6.05), SIMDE_FLOAT64_C( 16.05),
SIMDE_FLOAT64_C( 0.06), SIMDE_FLOAT64_C( 60.15)) },
{ simde_mm256_set_pd(SIMDE_FLOAT64_C( 5.44), SIMDE_FLOAT64_C( 6.18),
SIMDE_FLOAT64_C( 2.02), SIMDE_FLOAT64_C( 3.45)),
simde_mm256_set_pd(SIMDE_FLOAT64_C( 115.22), SIMDE_FLOAT64_C( 241.49),
SIMDE_FLOAT64_C( 3.70), SIMDE_FLOAT64_C( 15.73)) },
{ simde_mm256_set_pd(SIMDE_FLOAT64_C( 6.85), SIMDE_FLOAT64_C( 5.12),
SIMDE_FLOAT64_C( 2.14), SIMDE_FLOAT64_C( 5.31)),
simde_mm256_set_pd(SIMDE_FLOAT64_C( 471.94), SIMDE_FLOAT64_C( 83.66),
SIMDE_FLOAT64_C( 4.19), SIMDE_FLOAT64_C( 101.17)) },
{ simde_mm256_set_pd(SIMDE_FLOAT64_C( 0.38), SIMDE_FLOAT64_C( 0.35),
SIMDE_FLOAT64_C( 3.66), SIMDE_FLOAT64_C( 5.76)),
simde_mm256_set_pd(SIMDE_FLOAT64_C( 0.39), SIMDE_FLOAT64_C( 0.36),
SIMDE_FLOAT64_C( 19.42), SIMDE_FLOAT64_C( 158.67)) },
{ simde_mm256_set_pd(SIMDE_FLOAT64_C( 1.99), SIMDE_FLOAT64_C( -0.40),
SIMDE_FLOAT64_C( 1.50), SIMDE_FLOAT64_C( 6.30)),
simde_mm256_set_pd(SIMDE_FLOAT64_C( 3.59), SIMDE_FLOAT64_C( -0.41),
SIMDE_FLOAT64_C( 2.13), SIMDE_FLOAT64_C( 272.29)) },
{ simde_mm256_set_pd(SIMDE_FLOAT64_C( 1.39), SIMDE_FLOAT64_C( 3.42),
SIMDE_FLOAT64_C( 1.65), SIMDE_FLOAT64_C( -0.67)),
simde_mm256_set_pd(SIMDE_FLOAT64_C( 1.88), SIMDE_FLOAT64_C( 15.27),
SIMDE_FLOAT64_C( 2.51), SIMDE_FLOAT64_C( -0.72)) },
{ simde_mm256_set_pd(SIMDE_FLOAT64_C( 4.42), SIMDE_FLOAT64_C( 2.39),
SIMDE_FLOAT64_C( -0.90), SIMDE_FLOAT64_C( 0.46)),
simde_mm256_set_pd(SIMDE_FLOAT64_C( 41.54), SIMDE_FLOAT64_C( 5.41),
SIMDE_FLOAT64_C( -1.03), SIMDE_FLOAT64_C( 0.48)) },
{ simde_mm256_set_pd(SIMDE_FLOAT64_C( 4.07), SIMDE_FLOAT64_C( 1.36),
SIMDE_FLOAT64_C( 4.30), SIMDE_FLOAT64_C( 6.25)),
simde_mm256_set_pd(SIMDE_FLOAT64_C( 29.27), SIMDE_FLOAT64_C( 1.82),
SIMDE_FLOAT64_C( 36.84), SIMDE_FLOAT64_C( 259.01)) }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])); i++) {
simde__m256d r = simde_mm256_sinh_pd(test_vec[i].a);
simde_assert_m256d_close(r, test_vec[i].r, 1);
}
return 0;
}
static int
test_simde_mm512_sinh_ps(SIMDE_MUNIT_TEST_ARGS) {
const struct {
simde__m512 a;
simde__m512 r;
} test_vec[8] = {
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( 0.38), SIMDE_FLOAT32_C( 0.35), SIMDE_FLOAT32_C( 3.66), SIMDE_FLOAT32_C( 5.76),
SIMDE_FLOAT32_C( 6.85), SIMDE_FLOAT32_C( 5.12), SIMDE_FLOAT32_C( 2.14), SIMDE_FLOAT32_C( 5.31),
SIMDE_FLOAT32_C( 5.44), SIMDE_FLOAT32_C( 6.18), SIMDE_FLOAT32_C( 2.02), SIMDE_FLOAT32_C( 3.45),
SIMDE_FLOAT32_C( 2.50), SIMDE_FLOAT32_C( 3.47), SIMDE_FLOAT32_C( 0.06), SIMDE_FLOAT32_C( 4.79)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 0.39), SIMDE_FLOAT32_C( 0.36), SIMDE_FLOAT32_C( 19.42), SIMDE_FLOAT32_C( 158.67),
SIMDE_FLOAT32_C( 471.94), SIMDE_FLOAT32_C( 83.66), SIMDE_FLOAT32_C( 4.19), SIMDE_FLOAT32_C( 101.17),
SIMDE_FLOAT32_C( 115.22), SIMDE_FLOAT32_C( 241.49), SIMDE_FLOAT32_C( 3.70), SIMDE_FLOAT32_C( 15.73),
SIMDE_FLOAT32_C( 6.05), SIMDE_FLOAT32_C( 16.05), SIMDE_FLOAT32_C( 0.06), SIMDE_FLOAT32_C( 60.15)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( 4.07), SIMDE_FLOAT32_C( 1.36), SIMDE_FLOAT32_C( 4.30), SIMDE_FLOAT32_C( 6.25),
SIMDE_FLOAT32_C( 4.42), SIMDE_FLOAT32_C( 2.39), SIMDE_FLOAT32_C( -0.90), SIMDE_FLOAT32_C( 0.46),
SIMDE_FLOAT32_C( 1.39), SIMDE_FLOAT32_C( 3.42), SIMDE_FLOAT32_C( 1.65), SIMDE_FLOAT32_C( -0.67),
SIMDE_FLOAT32_C( 1.99), SIMDE_FLOAT32_C( -0.40), SIMDE_FLOAT32_C( 1.50), SIMDE_FLOAT32_C( 6.30)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 29.27), SIMDE_FLOAT32_C( 1.82), SIMDE_FLOAT32_C( 36.84), SIMDE_FLOAT32_C( 259.01),
SIMDE_FLOAT32_C( 41.54), SIMDE_FLOAT32_C( 5.41), SIMDE_FLOAT32_C( -1.03), SIMDE_FLOAT32_C( 0.48),
SIMDE_FLOAT32_C( 1.88), SIMDE_FLOAT32_C( 15.27), SIMDE_FLOAT32_C( 2.51), SIMDE_FLOAT32_C( -0.72),
SIMDE_FLOAT32_C( 3.59), SIMDE_FLOAT32_C( -0.41), SIMDE_FLOAT32_C( 2.13), SIMDE_FLOAT32_C( 272.29)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( 1.63), SIMDE_FLOAT32_C( 5.00), SIMDE_FLOAT32_C( 6.12), SIMDE_FLOAT32_C( 4.76),
SIMDE_FLOAT32_C( 5.59), SIMDE_FLOAT32_C( 2.16), SIMDE_FLOAT32_C( 6.66), SIMDE_FLOAT32_C( 3.17),
SIMDE_FLOAT32_C( -0.01), SIMDE_FLOAT32_C( 5.21), SIMDE_FLOAT32_C( 0.79), SIMDE_FLOAT32_C( 4.94),
SIMDE_FLOAT32_C( -0.01), SIMDE_FLOAT32_C( 7.57), SIMDE_FLOAT32_C( 3.42), SIMDE_FLOAT32_C( 6.92)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 2.45), SIMDE_FLOAT32_C( 74.20), SIMDE_FLOAT32_C( 227.43), SIMDE_FLOAT32_C( 58.37),
SIMDE_FLOAT32_C( 133.87), SIMDE_FLOAT32_C( 4.28), SIMDE_FLOAT32_C( 390.27), SIMDE_FLOAT32_C( 11.88),
SIMDE_FLOAT32_C( -0.01), SIMDE_FLOAT32_C( 91.54), SIMDE_FLOAT32_C( 0.87), SIMDE_FLOAT32_C( 69.88),
SIMDE_FLOAT32_C( -0.01), SIMDE_FLOAT32_C( 969.57), SIMDE_FLOAT32_C( 15.27), SIMDE_FLOAT32_C( 506.16)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( 5.32), SIMDE_FLOAT32_C( 6.22), SIMDE_FLOAT32_C( 2.66), SIMDE_FLOAT32_C( 6.82),
SIMDE_FLOAT32_C( 7.21), SIMDE_FLOAT32_C( 5.88), SIMDE_FLOAT32_C( 6.70), SIMDE_FLOAT32_C( 4.39),
SIMDE_FLOAT32_C( 2.42), SIMDE_FLOAT32_C( 2.95), SIMDE_FLOAT32_C( 4.75), SIMDE_FLOAT32_C( -0.76),
SIMDE_FLOAT32_C( 0.09), SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 0.92), SIMDE_FLOAT32_C( 5.01)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 102.19), SIMDE_FLOAT32_C( 251.35), SIMDE_FLOAT32_C( 7.11), SIMDE_FLOAT32_C( 457.99),
SIMDE_FLOAT32_C( 676.45), SIMDE_FLOAT32_C( 178.90), SIMDE_FLOAT32_C( 406.20), SIMDE_FLOAT32_C( 40.31),
SIMDE_FLOAT32_C( 5.58), SIMDE_FLOAT32_C( 9.53), SIMDE_FLOAT32_C( 57.79), SIMDE_FLOAT32_C( -0.84),
SIMDE_FLOAT32_C( 0.09), SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( 1.06), SIMDE_FLOAT32_C( 74.95)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( 2.53), SIMDE_FLOAT32_C( 6.56), SIMDE_FLOAT32_C( 4.70), SIMDE_FLOAT32_C( 4.78),
SIMDE_FLOAT32_C( -0.46), SIMDE_FLOAT32_C( -0.13), SIMDE_FLOAT32_C( 1.89), SIMDE_FLOAT32_C( 1.04),
SIMDE_FLOAT32_C( 2.47), SIMDE_FLOAT32_C( -0.24), SIMDE_FLOAT32_C( 5.71), SIMDE_FLOAT32_C( 6.12),
SIMDE_FLOAT32_C( 3.00), SIMDE_FLOAT32_C( 5.64), SIMDE_FLOAT32_C( 3.82), SIMDE_FLOAT32_C( 2.56)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 6.24), SIMDE_FLOAT32_C( 353.14), SIMDE_FLOAT32_C( 54.97), SIMDE_FLOAT32_C( 59.55),
SIMDE_FLOAT32_C( -0.48), SIMDE_FLOAT32_C( -0.13), SIMDE_FLOAT32_C( 3.23), SIMDE_FLOAT32_C( 1.24),
SIMDE_FLOAT32_C( 5.87), SIMDE_FLOAT32_C( -0.24), SIMDE_FLOAT32_C( 150.93), SIMDE_FLOAT32_C( 227.43),
SIMDE_FLOAT32_C( 10.02), SIMDE_FLOAT32_C( 140.73), SIMDE_FLOAT32_C( 22.79), SIMDE_FLOAT32_C( 6.43)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( 6.27), SIMDE_FLOAT32_C( 6.91), SIMDE_FLOAT32_C( 3.21), SIMDE_FLOAT32_C( 0.76),
SIMDE_FLOAT32_C( 1.37), SIMDE_FLOAT32_C( 6.45), SIMDE_FLOAT32_C( 5.47), SIMDE_FLOAT32_C( 5.98),
SIMDE_FLOAT32_C( 6.87), SIMDE_FLOAT32_C( 3.90), SIMDE_FLOAT32_C( 7.50), SIMDE_FLOAT32_C( -0.60),
SIMDE_FLOAT32_C( 4.72), SIMDE_FLOAT32_C( 3.73), SIMDE_FLOAT32_C( 5.29), SIMDE_FLOAT32_C( 0.13)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 264.24), SIMDE_FLOAT32_C( 501.12), SIMDE_FLOAT32_C( 12.37), SIMDE_FLOAT32_C( 0.84),
SIMDE_FLOAT32_C( 1.84), SIMDE_FLOAT32_C( 316.35), SIMDE_FLOAT32_C( 118.73), SIMDE_FLOAT32_C( 197.72),
SIMDE_FLOAT32_C( 481.47), SIMDE_FLOAT32_C( 24.69), SIMDE_FLOAT32_C( 904.02), SIMDE_FLOAT32_C( -0.64),
SIMDE_FLOAT32_C( 56.08), SIMDE_FLOAT32_C( 20.83), SIMDE_FLOAT32_C( 99.17), SIMDE_FLOAT32_C( 0.13)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( -0.01), SIMDE_FLOAT32_C( 6.83), SIMDE_FLOAT32_C( 0.83), SIMDE_FLOAT32_C( -0.99),
SIMDE_FLOAT32_C( 1.85), SIMDE_FLOAT32_C( 7.27), SIMDE_FLOAT32_C( 4.56), SIMDE_FLOAT32_C( -0.00),
SIMDE_FLOAT32_C( 0.82), SIMDE_FLOAT32_C( 3.01), SIMDE_FLOAT32_C( 6.35), SIMDE_FLOAT32_C( 7.50),
SIMDE_FLOAT32_C( 0.05), SIMDE_FLOAT32_C( 5.13), SIMDE_FLOAT32_C( 3.42), SIMDE_FLOAT32_C( 2.89)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( -0.01), SIMDE_FLOAT32_C( 462.59), SIMDE_FLOAT32_C( 0.93), SIMDE_FLOAT32_C( -1.16),
SIMDE_FLOAT32_C( 3.10), SIMDE_FLOAT32_C( 718.27), SIMDE_FLOAT32_C( 47.79), SIMDE_FLOAT32_C( -0.00),
SIMDE_FLOAT32_C( 0.92), SIMDE_FLOAT32_C( 10.12), SIMDE_FLOAT32_C( 286.25), SIMDE_FLOAT32_C( 904.02),
SIMDE_FLOAT32_C( 0.05), SIMDE_FLOAT32_C( 84.51), SIMDE_FLOAT32_C( 15.27), SIMDE_FLOAT32_C( 8.97)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( 1.42), SIMDE_FLOAT32_C( 1.75), SIMDE_FLOAT32_C( 0.06), SIMDE_FLOAT32_C( 3.16),
SIMDE_FLOAT32_C( 7.31), SIMDE_FLOAT32_C( 3.33), SIMDE_FLOAT32_C( 1.89), SIMDE_FLOAT32_C( 2.76),
SIMDE_FLOAT32_C( 2.52), SIMDE_FLOAT32_C( 3.47), SIMDE_FLOAT32_C( 5.50), SIMDE_FLOAT32_C( 5.00),
SIMDE_FLOAT32_C( 3.36), SIMDE_FLOAT32_C( 1.99), SIMDE_FLOAT32_C( 7.24), SIMDE_FLOAT32_C( 0.30)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 1.95), SIMDE_FLOAT32_C( 2.79), SIMDE_FLOAT32_C( 0.06), SIMDE_FLOAT32_C( 11.76),
SIMDE_FLOAT32_C( 747.59), SIMDE_FLOAT32_C( 13.95), SIMDE_FLOAT32_C( 3.23), SIMDE_FLOAT32_C( 7.87),
SIMDE_FLOAT32_C( 6.17), SIMDE_FLOAT32_C( 16.05), SIMDE_FLOAT32_C( 122.34), SIMDE_FLOAT32_C( 74.20),
SIMDE_FLOAT32_C( 14.38), SIMDE_FLOAT32_C( 3.59), SIMDE_FLOAT32_C( 697.05), SIMDE_FLOAT32_C( 0.30)) }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])); i++) {
simde__m512 r = simde_mm512_sinh_ps(test_vec[i].a);
simde_assert_m512_close(r, test_vec[i].r, 1);
}
return 0;
}
static int
test_simde_mm512_mask_sinh_ps(SIMDE_MUNIT_TEST_ARGS) {
const struct {
simde__m512 src;
simde__mmask16 k;
simde__m512 a;
simde__m512 r;
} test_vec[8] = {
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( 1.36), SIMDE_FLOAT32_C( 6.25), SIMDE_FLOAT32_C( 2.39), SIMDE_FLOAT32_C( 0.46),
SIMDE_FLOAT32_C( 3.42), SIMDE_FLOAT32_C( -0.67), SIMDE_FLOAT32_C( -0.40), SIMDE_FLOAT32_C( 6.30),
SIMDE_FLOAT32_C( 0.35), SIMDE_FLOAT32_C( 5.76), SIMDE_FLOAT32_C( 5.12), SIMDE_FLOAT32_C( 5.31),
SIMDE_FLOAT32_C( 6.18), SIMDE_FLOAT32_C( 3.45), SIMDE_FLOAT32_C( 3.47), SIMDE_FLOAT32_C( 4.79)),
UINT16_C(41466),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 4.07), SIMDE_FLOAT32_C( 4.30), SIMDE_FLOAT32_C( 4.42), SIMDE_FLOAT32_C( -0.90),
SIMDE_FLOAT32_C( 1.39), SIMDE_FLOAT32_C( 1.65), SIMDE_FLOAT32_C( 1.99), SIMDE_FLOAT32_C( 1.50),
SIMDE_FLOAT32_C( 0.38), SIMDE_FLOAT32_C( 3.66), SIMDE_FLOAT32_C( 6.85), SIMDE_FLOAT32_C( 2.14),
SIMDE_FLOAT32_C( 5.44), SIMDE_FLOAT32_C( 2.02), SIMDE_FLOAT32_C( 2.50), SIMDE_FLOAT32_C( 0.06)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 29.27), SIMDE_FLOAT32_C( 6.25), SIMDE_FLOAT32_C( 41.54), SIMDE_FLOAT32_C( 0.46),
SIMDE_FLOAT32_C( 3.42), SIMDE_FLOAT32_C( -0.67), SIMDE_FLOAT32_C( -0.40), SIMDE_FLOAT32_C( 2.13),
SIMDE_FLOAT32_C( 0.39), SIMDE_FLOAT32_C( 19.42), SIMDE_FLOAT32_C( 471.94), SIMDE_FLOAT32_C( 4.19),
SIMDE_FLOAT32_C( 115.22), SIMDE_FLOAT32_C( 3.45), SIMDE_FLOAT32_C( 6.05), SIMDE_FLOAT32_C( 4.79)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( 5.32), SIMDE_FLOAT32_C( 2.66), SIMDE_FLOAT32_C( 7.21), SIMDE_FLOAT32_C( 6.70),
SIMDE_FLOAT32_C( 2.42), SIMDE_FLOAT32_C( 4.75), SIMDE_FLOAT32_C( 0.09), SIMDE_FLOAT32_C( 0.92),
SIMDE_FLOAT32_C( 1.63), SIMDE_FLOAT32_C( 6.12), SIMDE_FLOAT32_C( 5.59), SIMDE_FLOAT32_C( 6.66),
SIMDE_FLOAT32_C( -0.01), SIMDE_FLOAT32_C( 0.79), SIMDE_FLOAT32_C( -0.01), SIMDE_FLOAT32_C( 3.42)),
UINT16_C(36797),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 2.56), SIMDE_FLOAT32_C( 6.22), SIMDE_FLOAT32_C( 6.82), SIMDE_FLOAT32_C( 5.88),
SIMDE_FLOAT32_C( 4.39), SIMDE_FLOAT32_C( 2.95), SIMDE_FLOAT32_C( -0.76), SIMDE_FLOAT32_C( 0.00),
SIMDE_FLOAT32_C( 5.01), SIMDE_FLOAT32_C( 5.00), SIMDE_FLOAT32_C( 4.76), SIMDE_FLOAT32_C( 2.16),
SIMDE_FLOAT32_C( 3.17), SIMDE_FLOAT32_C( 5.21), SIMDE_FLOAT32_C( 4.94), SIMDE_FLOAT32_C( 7.57)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 6.43), SIMDE_FLOAT32_C( 2.66), SIMDE_FLOAT32_C( 7.21), SIMDE_FLOAT32_C( 6.70),
SIMDE_FLOAT32_C( 40.31), SIMDE_FLOAT32_C( 9.53), SIMDE_FLOAT32_C( -0.84), SIMDE_FLOAT32_C( 0.00),
SIMDE_FLOAT32_C( 74.95), SIMDE_FLOAT32_C( 6.12), SIMDE_FLOAT32_C( 58.37), SIMDE_FLOAT32_C( 4.28),
SIMDE_FLOAT32_C( 11.88), SIMDE_FLOAT32_C( 91.54), SIMDE_FLOAT32_C( -0.01), SIMDE_FLOAT32_C( 969.57)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( 2.89), SIMDE_FLOAT32_C( 6.91), SIMDE_FLOAT32_C( 0.76), SIMDE_FLOAT32_C( 6.45),
SIMDE_FLOAT32_C( 5.98), SIMDE_FLOAT32_C( 3.90), SIMDE_FLOAT32_C( -0.60), SIMDE_FLOAT32_C( 3.73),
SIMDE_FLOAT32_C( 0.13), SIMDE_FLOAT32_C( 6.56), SIMDE_FLOAT32_C( 4.78), SIMDE_FLOAT32_C( -0.13),
SIMDE_FLOAT32_C( 1.04), SIMDE_FLOAT32_C( -0.24), SIMDE_FLOAT32_C( 6.12), SIMDE_FLOAT32_C( 5.64)),
UINT16_C(16804),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 3.42), SIMDE_FLOAT32_C( 6.27), SIMDE_FLOAT32_C( 3.21), SIMDE_FLOAT32_C( 1.37),
SIMDE_FLOAT32_C( 5.47), SIMDE_FLOAT32_C( 6.87), SIMDE_FLOAT32_C( 7.50), SIMDE_FLOAT32_C( 4.72),
SIMDE_FLOAT32_C( 5.29), SIMDE_FLOAT32_C( 2.53), SIMDE_FLOAT32_C( 4.70), SIMDE_FLOAT32_C( -0.46),
SIMDE_FLOAT32_C( 1.89), SIMDE_FLOAT32_C( 2.47), SIMDE_FLOAT32_C( 5.71), SIMDE_FLOAT32_C( 3.00)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 2.89), SIMDE_FLOAT32_C( 264.24), SIMDE_FLOAT32_C( 0.76), SIMDE_FLOAT32_C( 6.45),
SIMDE_FLOAT32_C( 5.98), SIMDE_FLOAT32_C( 3.90), SIMDE_FLOAT32_C( -0.60), SIMDE_FLOAT32_C( 56.08),
SIMDE_FLOAT32_C( 99.17), SIMDE_FLOAT32_C( 6.56), SIMDE_FLOAT32_C( 54.97), SIMDE_FLOAT32_C( -0.13),
SIMDE_FLOAT32_C( 1.04), SIMDE_FLOAT32_C( 5.87), SIMDE_FLOAT32_C( 6.12), SIMDE_FLOAT32_C( 5.64)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( 1.80), SIMDE_FLOAT32_C( 1.42), SIMDE_FLOAT32_C( 0.06), SIMDE_FLOAT32_C( 7.31),
SIMDE_FLOAT32_C( 1.89), SIMDE_FLOAT32_C( 2.52), SIMDE_FLOAT32_C( 5.50), SIMDE_FLOAT32_C( 3.36),
SIMDE_FLOAT32_C( 7.24), SIMDE_FLOAT32_C( -0.01), SIMDE_FLOAT32_C( 0.83), SIMDE_FLOAT32_C( 1.85),
SIMDE_FLOAT32_C( 4.56), SIMDE_FLOAT32_C( 0.82), SIMDE_FLOAT32_C( 6.35), SIMDE_FLOAT32_C( 0.05)),
UINT16_C( 2107),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 7.16), SIMDE_FLOAT32_C( 2.45), SIMDE_FLOAT32_C( 1.75), SIMDE_FLOAT32_C( 3.16),
SIMDE_FLOAT32_C( 3.33), SIMDE_FLOAT32_C( 2.76), SIMDE_FLOAT32_C( 3.47), SIMDE_FLOAT32_C( 5.00),
SIMDE_FLOAT32_C( 1.99), SIMDE_FLOAT32_C( 0.30), SIMDE_FLOAT32_C( 6.83), SIMDE_FLOAT32_C( -0.99),
SIMDE_FLOAT32_C( 7.27), SIMDE_FLOAT32_C( -0.00), SIMDE_FLOAT32_C( 3.01), SIMDE_FLOAT32_C( 7.50)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 1.80), SIMDE_FLOAT32_C( 1.42), SIMDE_FLOAT32_C( 0.06), SIMDE_FLOAT32_C( 7.31),
SIMDE_FLOAT32_C( 13.95), SIMDE_FLOAT32_C( 2.52), SIMDE_FLOAT32_C( 5.50), SIMDE_FLOAT32_C( 3.36),
SIMDE_FLOAT32_C( 7.24), SIMDE_FLOAT32_C( -0.01), SIMDE_FLOAT32_C( 462.59), SIMDE_FLOAT32_C( -1.16),
SIMDE_FLOAT32_C( 718.27), SIMDE_FLOAT32_C( 0.82), SIMDE_FLOAT32_C( 10.12), SIMDE_FLOAT32_C( 904.02)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( 3.23), SIMDE_FLOAT32_C( 0.13), SIMDE_FLOAT32_C( 1.95), SIMDE_FLOAT32_C( 4.07),
SIMDE_FLOAT32_C( 4.79), SIMDE_FLOAT32_C( 7.12), SIMDE_FLOAT32_C( 7.24), SIMDE_FLOAT32_C( 3.87),
SIMDE_FLOAT32_C( 5.39), SIMDE_FLOAT32_C( 0.73), SIMDE_FLOAT32_C( -0.10), SIMDE_FLOAT32_C( 0.01),
SIMDE_FLOAT32_C( 4.25), SIMDE_FLOAT32_C( -0.09), SIMDE_FLOAT32_C( -0.03), SIMDE_FLOAT32_C( 5.19)),
UINT16_C(22274),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 5.44), SIMDE_FLOAT32_C( 7.24), SIMDE_FLOAT32_C( 0.21), SIMDE_FLOAT32_C( 3.99),
SIMDE_FLOAT32_C( -0.40), SIMDE_FLOAT32_C( 5.13), SIMDE_FLOAT32_C( 7.31), SIMDE_FLOAT32_C( 3.77),
SIMDE_FLOAT32_C( 6.86), SIMDE_FLOAT32_C( 2.97), SIMDE_FLOAT32_C( 4.32), SIMDE_FLOAT32_C( 1.67),
SIMDE_FLOAT32_C( 0.71), SIMDE_FLOAT32_C( 0.62), SIMDE_FLOAT32_C( -0.75), SIMDE_FLOAT32_C( 5.34)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 3.23), SIMDE_FLOAT32_C( 697.05), SIMDE_FLOAT32_C( 1.95), SIMDE_FLOAT32_C( 27.02),
SIMDE_FLOAT32_C( 4.79), SIMDE_FLOAT32_C( 84.51), SIMDE_FLOAT32_C( 747.59), SIMDE_FLOAT32_C( 21.68),
SIMDE_FLOAT32_C( 5.39), SIMDE_FLOAT32_C( 0.73), SIMDE_FLOAT32_C( -0.10), SIMDE_FLOAT32_C( 0.01),
SIMDE_FLOAT32_C( 4.25), SIMDE_FLOAT32_C( -0.09), SIMDE_FLOAT32_C( -0.82), SIMDE_FLOAT32_C( 5.19)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( 7.10), SIMDE_FLOAT32_C( -0.17), SIMDE_FLOAT32_C( 3.00), SIMDE_FLOAT32_C( -0.07),
SIMDE_FLOAT32_C( 3.70), SIMDE_FLOAT32_C( 4.19), SIMDE_FLOAT32_C( 6.89), SIMDE_FLOAT32_C( 3.02),
SIMDE_FLOAT32_C( 2.07), SIMDE_FLOAT32_C( 2.38), SIMDE_FLOAT32_C( 1.93), SIMDE_FLOAT32_C( 6.67),
SIMDE_FLOAT32_C( 0.60), SIMDE_FLOAT32_C( 5.69), SIMDE_FLOAT32_C( 5.19), SIMDE_FLOAT32_C( 5.17)),
UINT16_C(27396),
simde_mm512_set_ps(SIMDE_FLOAT32_C( -0.85), SIMDE_FLOAT32_C( 1.55), SIMDE_FLOAT32_C( 0.10), SIMDE_FLOAT32_C( 0.01),
SIMDE_FLOAT32_C( 6.70), SIMDE_FLOAT32_C( 3.32), SIMDE_FLOAT32_C( -0.22), SIMDE_FLOAT32_C( 3.99),
SIMDE_FLOAT32_C( 5.79), SIMDE_FLOAT32_C( 3.25), SIMDE_FLOAT32_C( 1.97), SIMDE_FLOAT32_C( 0.21),
SIMDE_FLOAT32_C( 4.74), SIMDE_FLOAT32_C( 4.48), SIMDE_FLOAT32_C( -0.64), SIMDE_FLOAT32_C( 1.19)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 7.10), SIMDE_FLOAT32_C( 2.25), SIMDE_FLOAT32_C( 0.10), SIMDE_FLOAT32_C( -0.07),
SIMDE_FLOAT32_C( 406.20), SIMDE_FLOAT32_C( 4.19), SIMDE_FLOAT32_C( -0.22), SIMDE_FLOAT32_C( 27.02),
SIMDE_FLOAT32_C( 2.07), SIMDE_FLOAT32_C( 2.38), SIMDE_FLOAT32_C( 1.93), SIMDE_FLOAT32_C( 6.67),
SIMDE_FLOAT32_C( 0.60), SIMDE_FLOAT32_C( 44.11), SIMDE_FLOAT32_C( 5.19), SIMDE_FLOAT32_C( 5.17)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( 5.58), SIMDE_FLOAT32_C( 3.19), SIMDE_FLOAT32_C( -0.86), SIMDE_FLOAT32_C( 6.05),
SIMDE_FLOAT32_C( 5.24), SIMDE_FLOAT32_C( -0.02), SIMDE_FLOAT32_C( 3.75), SIMDE_FLOAT32_C( 5.84),
SIMDE_FLOAT32_C( 3.43), SIMDE_FLOAT32_C( 6.03), SIMDE_FLOAT32_C( 2.94), SIMDE_FLOAT32_C( 3.64),
SIMDE_FLOAT32_C( 0.25), SIMDE_FLOAT32_C( 5.91), SIMDE_FLOAT32_C( 5.00), SIMDE_FLOAT32_C( -0.52)),
UINT16_C( 953),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 3.38), SIMDE_FLOAT32_C( 6.78), SIMDE_FLOAT32_C( 3.92), SIMDE_FLOAT32_C( 1.46),
SIMDE_FLOAT32_C( 4.63), SIMDE_FLOAT32_C( 2.54), SIMDE_FLOAT32_C( 1.33), SIMDE_FLOAT32_C( 6.22),
SIMDE_FLOAT32_C( 3.58), SIMDE_FLOAT32_C( 2.15), SIMDE_FLOAT32_C( 3.80), SIMDE_FLOAT32_C( 0.82),
SIMDE_FLOAT32_C( 3.13), SIMDE_FLOAT32_C( 2.22), SIMDE_FLOAT32_C( 3.07), SIMDE_FLOAT32_C( 3.70)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 5.58), SIMDE_FLOAT32_C( 3.19), SIMDE_FLOAT32_C( -0.86), SIMDE_FLOAT32_C( 6.05),
SIMDE_FLOAT32_C( 5.24), SIMDE_FLOAT32_C( -0.02), SIMDE_FLOAT32_C( 1.76), SIMDE_FLOAT32_C( 251.35),
SIMDE_FLOAT32_C( 17.92), SIMDE_FLOAT32_C( 6.03), SIMDE_FLOAT32_C( 22.34), SIMDE_FLOAT32_C( 0.92),
SIMDE_FLOAT32_C( 11.42), SIMDE_FLOAT32_C( 5.91), SIMDE_FLOAT32_C( 5.00), SIMDE_FLOAT32_C( 20.21)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( -0.09), SIMDE_FLOAT32_C( 4.72), SIMDE_FLOAT32_C( 1.18), SIMDE_FLOAT32_C( 6.84),
SIMDE_FLOAT32_C( 7.41), SIMDE_FLOAT32_C( 7.40), SIMDE_FLOAT32_C( 6.85), SIMDE_FLOAT32_C( -0.21),
SIMDE_FLOAT32_C( 2.40), SIMDE_FLOAT32_C( -0.71), SIMDE_FLOAT32_C( 0.17), SIMDE_FLOAT32_C( 1.49),
SIMDE_FLOAT32_C( 3.73), SIMDE_FLOAT32_C( 3.74), SIMDE_FLOAT32_C( 5.19), SIMDE_FLOAT32_C( 2.42)),
UINT16_C(12713),
simde_mm512_set_ps(SIMDE_FLOAT32_C( -0.32), SIMDE_FLOAT32_C( 3.24), SIMDE_FLOAT32_C( 6.85), SIMDE_FLOAT32_C( 6.71),
SIMDE_FLOAT32_C( 0.13), SIMDE_FLOAT32_C( 1.96), SIMDE_FLOAT32_C( 6.43), SIMDE_FLOAT32_C( 1.79),
SIMDE_FLOAT32_C( 3.56), SIMDE_FLOAT32_C( 3.77), SIMDE_FLOAT32_C( 6.38), SIMDE_FLOAT32_C( 2.22),
SIMDE_FLOAT32_C( 7.36), SIMDE_FLOAT32_C( 4.86), SIMDE_FLOAT32_C( 3.24), SIMDE_FLOAT32_C( 6.97)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( -0.09), SIMDE_FLOAT32_C( 4.72), SIMDE_FLOAT32_C( 471.94), SIMDE_FLOAT32_C( 410.28),
SIMDE_FLOAT32_C( 7.41), SIMDE_FLOAT32_C( 7.40), SIMDE_FLOAT32_C( 6.85), SIMDE_FLOAT32_C( 2.91),
SIMDE_FLOAT32_C( 17.57), SIMDE_FLOAT32_C( -0.71), SIMDE_FLOAT32_C( 294.96), SIMDE_FLOAT32_C( 1.49),
SIMDE_FLOAT32_C( 785.92), SIMDE_FLOAT32_C( 3.74), SIMDE_FLOAT32_C( 5.19), SIMDE_FLOAT32_C( 532.11)) }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])); i++) {
simde__m512 r = simde_mm512_mask_sinh_ps(test_vec[i].src, test_vec[i].k, test_vec[i].a);
simde_assert_m512_close(r, test_vec[i].r, 1);
}
return 0;
}
static int
test_simde_mm512_sinh_pd(SIMDE_MUNIT_TEST_ARGS) {
const struct {
simde__m512d a;
simde__m512d r;
} test_vec[8] = {
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( 5.44), SIMDE_FLOAT64_C( 6.18),
SIMDE_FLOAT64_C( 2.02), SIMDE_FLOAT64_C( 3.45),
SIMDE_FLOAT64_C( 2.50), SIMDE_FLOAT64_C( 3.47),
SIMDE_FLOAT64_C( 0.06), SIMDE_FLOAT64_C( 4.79)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 115.22), SIMDE_FLOAT64_C( 241.49),
SIMDE_FLOAT64_C( 3.70), SIMDE_FLOAT64_C( 15.73),
SIMDE_FLOAT64_C( 6.05), SIMDE_FLOAT64_C( 16.05),
SIMDE_FLOAT64_C( 0.06), SIMDE_FLOAT64_C( 60.15)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( 0.38), SIMDE_FLOAT64_C( 0.35),
SIMDE_FLOAT64_C( 3.66), SIMDE_FLOAT64_C( 5.76),
SIMDE_FLOAT64_C( 6.85), SIMDE_FLOAT64_C( 5.12),
SIMDE_FLOAT64_C( 2.14), SIMDE_FLOAT64_C( 5.31)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 0.39), SIMDE_FLOAT64_C( 0.36),
SIMDE_FLOAT64_C( 19.42), SIMDE_FLOAT64_C( 158.67),
SIMDE_FLOAT64_C( 471.94), SIMDE_FLOAT64_C( 83.66),
SIMDE_FLOAT64_C( 4.19), SIMDE_FLOAT64_C( 101.17)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( 1.39), SIMDE_FLOAT64_C( 3.42),
SIMDE_FLOAT64_C( 1.65), SIMDE_FLOAT64_C( -0.67),
SIMDE_FLOAT64_C( 1.99), SIMDE_FLOAT64_C( -0.40),
SIMDE_FLOAT64_C( 1.50), SIMDE_FLOAT64_C( 6.30)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 1.88), SIMDE_FLOAT64_C( 15.27),
SIMDE_FLOAT64_C( 2.51), SIMDE_FLOAT64_C( -0.72),
SIMDE_FLOAT64_C( 3.59), SIMDE_FLOAT64_C( -0.41),
SIMDE_FLOAT64_C( 2.13), SIMDE_FLOAT64_C( 272.29)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( 4.07), SIMDE_FLOAT64_C( 1.36),
SIMDE_FLOAT64_C( 4.30), SIMDE_FLOAT64_C( 6.25),
SIMDE_FLOAT64_C( 4.42), SIMDE_FLOAT64_C( 2.39),
SIMDE_FLOAT64_C( -0.90), SIMDE_FLOAT64_C( 0.46)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 29.27), SIMDE_FLOAT64_C( 1.82),
SIMDE_FLOAT64_C( 36.84), SIMDE_FLOAT64_C( 259.01),
SIMDE_FLOAT64_C( 41.54), SIMDE_FLOAT64_C( 5.41),
SIMDE_FLOAT64_C( -1.03), SIMDE_FLOAT64_C( 0.48)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( -0.01), SIMDE_FLOAT64_C( 5.21),
SIMDE_FLOAT64_C( 0.79), SIMDE_FLOAT64_C( 4.94),
SIMDE_FLOAT64_C( -0.01), SIMDE_FLOAT64_C( 7.57),
SIMDE_FLOAT64_C( 3.42), SIMDE_FLOAT64_C( 6.92)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( -0.01), SIMDE_FLOAT64_C( 91.54),
SIMDE_FLOAT64_C( 0.87), SIMDE_FLOAT64_C( 69.88),
SIMDE_FLOAT64_C( -0.01), SIMDE_FLOAT64_C( 969.57),
SIMDE_FLOAT64_C( 15.27), SIMDE_FLOAT64_C( 506.16)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( 1.63), SIMDE_FLOAT64_C( 5.00),
SIMDE_FLOAT64_C( 6.12), SIMDE_FLOAT64_C( 4.76),
SIMDE_FLOAT64_C( 5.59), SIMDE_FLOAT64_C( 2.16),
SIMDE_FLOAT64_C( 6.66), SIMDE_FLOAT64_C( 3.17)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 2.45), SIMDE_FLOAT64_C( 74.20),
SIMDE_FLOAT64_C( 227.43), SIMDE_FLOAT64_C( 58.37),
SIMDE_FLOAT64_C( 133.87), SIMDE_FLOAT64_C( 4.28),
SIMDE_FLOAT64_C( 390.27), SIMDE_FLOAT64_C( 11.88)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( 2.42), SIMDE_FLOAT64_C( 2.95),
SIMDE_FLOAT64_C( 4.75), SIMDE_FLOAT64_C( -0.76),
SIMDE_FLOAT64_C( 0.09), SIMDE_FLOAT64_C( 0.00),
SIMDE_FLOAT64_C( 0.92), SIMDE_FLOAT64_C( 5.01)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 5.58), SIMDE_FLOAT64_C( 9.53),
SIMDE_FLOAT64_C( 57.79), SIMDE_FLOAT64_C( -0.84),
SIMDE_FLOAT64_C( 0.09), SIMDE_FLOAT64_C( 0.00),
SIMDE_FLOAT64_C( 1.06), SIMDE_FLOAT64_C( 74.95)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( 5.32), SIMDE_FLOAT64_C( 6.22),
SIMDE_FLOAT64_C( 2.66), SIMDE_FLOAT64_C( 6.82),
SIMDE_FLOAT64_C( 7.21), SIMDE_FLOAT64_C( 5.88),
SIMDE_FLOAT64_C( 6.70), SIMDE_FLOAT64_C( 4.39)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 102.19), SIMDE_FLOAT64_C( 251.35),
SIMDE_FLOAT64_C( 7.11), SIMDE_FLOAT64_C( 457.99),
SIMDE_FLOAT64_C( 676.45), SIMDE_FLOAT64_C( 178.90),
SIMDE_FLOAT64_C( 406.20), SIMDE_FLOAT64_C( 40.31)) }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])); i++) {
simde__m512d r = simde_mm512_sinh_pd(test_vec[i].a);
simde_assert_m512d_close(r, test_vec[i].r, 1);
}
return 0;
}
static int
test_simde_mm512_mask_sinh_pd(SIMDE_MUNIT_TEST_ARGS) {
const struct {
simde__m512d src;
simde__mmask8 k;
simde__m512d a;
simde__m512d r;
} test_vec[8] = {
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( 0.35), SIMDE_FLOAT64_C( 5.76),
SIMDE_FLOAT64_C( 5.12), SIMDE_FLOAT64_C( 5.31),
SIMDE_FLOAT64_C( 6.18), SIMDE_FLOAT64_C( 3.45),
SIMDE_FLOAT64_C( 3.47), SIMDE_FLOAT64_C( 4.79)),
UINT8_C(139),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 0.38), SIMDE_FLOAT64_C( 3.66),
SIMDE_FLOAT64_C( 6.85), SIMDE_FLOAT64_C( 2.14),
SIMDE_FLOAT64_C( 5.44), SIMDE_FLOAT64_C( 2.02),
SIMDE_FLOAT64_C( 2.50), SIMDE_FLOAT64_C( 0.06)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 0.39), SIMDE_FLOAT64_C( 5.76),
SIMDE_FLOAT64_C( 5.12), SIMDE_FLOAT64_C( 5.31),
SIMDE_FLOAT64_C( 115.22), SIMDE_FLOAT64_C( 3.45),
SIMDE_FLOAT64_C( 6.05), SIMDE_FLOAT64_C( 0.06)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( 4.07), SIMDE_FLOAT64_C( 4.30),
SIMDE_FLOAT64_C( 4.42), SIMDE_FLOAT64_C( -0.90),
SIMDE_FLOAT64_C( 1.39), SIMDE_FLOAT64_C( 1.65),
SIMDE_FLOAT64_C( 1.99), SIMDE_FLOAT64_C( 1.50)),
UINT8_C(229),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 6.92), SIMDE_FLOAT64_C( 1.36),
SIMDE_FLOAT64_C( 6.25), SIMDE_FLOAT64_C( 2.39),
SIMDE_FLOAT64_C( 0.46), SIMDE_FLOAT64_C( 3.42),
SIMDE_FLOAT64_C( -0.67), SIMDE_FLOAT64_C( -0.40)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 506.16), SIMDE_FLOAT64_C( 1.82),
SIMDE_FLOAT64_C( 259.01), SIMDE_FLOAT64_C( -0.90),
SIMDE_FLOAT64_C( 1.39), SIMDE_FLOAT64_C( 15.27),
SIMDE_FLOAT64_C( 1.99), SIMDE_FLOAT64_C( -0.41)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( 5.01), SIMDE_FLOAT64_C( 5.00),
SIMDE_FLOAT64_C( 4.76), SIMDE_FLOAT64_C( 2.16),
SIMDE_FLOAT64_C( 3.17), SIMDE_FLOAT64_C( 5.21),
SIMDE_FLOAT64_C( 4.94), SIMDE_FLOAT64_C( 7.57)),
UINT8_C(253),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 0.92), SIMDE_FLOAT64_C( 1.63),
SIMDE_FLOAT64_C( 6.12), SIMDE_FLOAT64_C( 5.59),
SIMDE_FLOAT64_C( 6.66), SIMDE_FLOAT64_C( -0.01),
SIMDE_FLOAT64_C( 0.79), SIMDE_FLOAT64_C( -0.01)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 1.06), SIMDE_FLOAT64_C( 2.45),
SIMDE_FLOAT64_C( 227.43), SIMDE_FLOAT64_C( 133.87),
SIMDE_FLOAT64_C( 390.27), SIMDE_FLOAT64_C( -0.01),
SIMDE_FLOAT64_C( 4.94), SIMDE_FLOAT64_C( -0.01)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( 3.82), SIMDE_FLOAT64_C( 5.32),
SIMDE_FLOAT64_C( 2.66), SIMDE_FLOAT64_C( 7.21),
SIMDE_FLOAT64_C( 6.70), SIMDE_FLOAT64_C( 2.42),
SIMDE_FLOAT64_C( 4.75), SIMDE_FLOAT64_C( 0.09)),
UINT8_C( 93),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 5.64), SIMDE_FLOAT64_C( 2.56),
SIMDE_FLOAT64_C( 6.22), SIMDE_FLOAT64_C( 6.82),
SIMDE_FLOAT64_C( 5.88), SIMDE_FLOAT64_C( 4.39),
SIMDE_FLOAT64_C( 2.95), SIMDE_FLOAT64_C( -0.76)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 3.82), SIMDE_FLOAT64_C( 6.43),
SIMDE_FLOAT64_C( 2.66), SIMDE_FLOAT64_C( 457.99),
SIMDE_FLOAT64_C( 178.90), SIMDE_FLOAT64_C( 40.31),
SIMDE_FLOAT64_C( 4.75), SIMDE_FLOAT64_C( -0.84)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( 3.73), SIMDE_FLOAT64_C( 0.13),
SIMDE_FLOAT64_C( 6.56), SIMDE_FLOAT64_C( 4.78),
SIMDE_FLOAT64_C( -0.13), SIMDE_FLOAT64_C( 1.04),
SIMDE_FLOAT64_C( -0.24), SIMDE_FLOAT64_C( 6.12)),
UINT8_C(145),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 4.72), SIMDE_FLOAT64_C( 5.29),
SIMDE_FLOAT64_C( 2.53), SIMDE_FLOAT64_C( 4.70),
SIMDE_FLOAT64_C( -0.46), SIMDE_FLOAT64_C( 1.89),
SIMDE_FLOAT64_C( 2.47), SIMDE_FLOAT64_C( 5.71)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 56.08), SIMDE_FLOAT64_C( 0.13),
SIMDE_FLOAT64_C( 6.56), SIMDE_FLOAT64_C( 54.97),
SIMDE_FLOAT64_C( -0.13), SIMDE_FLOAT64_C( 1.04),
SIMDE_FLOAT64_C( -0.24), SIMDE_FLOAT64_C( 150.93)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( 0.05), SIMDE_FLOAT64_C( 3.42),
SIMDE_FLOAT64_C( 6.27), SIMDE_FLOAT64_C( 3.21),
SIMDE_FLOAT64_C( 1.37), SIMDE_FLOAT64_C( 5.47),
SIMDE_FLOAT64_C( 6.87), SIMDE_FLOAT64_C( 7.50)),
UINT8_C( 75),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 7.50), SIMDE_FLOAT64_C( 5.13),
SIMDE_FLOAT64_C( 2.89), SIMDE_FLOAT64_C( 6.91),
SIMDE_FLOAT64_C( 0.76), SIMDE_FLOAT64_C( 6.45),
SIMDE_FLOAT64_C( 5.98), SIMDE_FLOAT64_C( 3.90)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 0.05), SIMDE_FLOAT64_C( 84.51),
SIMDE_FLOAT64_C( 6.27), SIMDE_FLOAT64_C( 3.21),
SIMDE_FLOAT64_C( 0.84), SIMDE_FLOAT64_C( 5.47),
SIMDE_FLOAT64_C( 197.72), SIMDE_FLOAT64_C( 24.69)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( 5.00), SIMDE_FLOAT64_C( 1.99),
SIMDE_FLOAT64_C( 0.30), SIMDE_FLOAT64_C( 6.83),
SIMDE_FLOAT64_C( -0.99), SIMDE_FLOAT64_C( 7.27),
SIMDE_FLOAT64_C( -0.00), SIMDE_FLOAT64_C( 3.01)),
UINT8_C( 93),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 5.50), SIMDE_FLOAT64_C( 3.36),
SIMDE_FLOAT64_C( 7.24), SIMDE_FLOAT64_C( -0.01),
SIMDE_FLOAT64_C( 0.83), SIMDE_FLOAT64_C( 1.85),
SIMDE_FLOAT64_C( 4.56), SIMDE_FLOAT64_C( 0.82)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 5.00), SIMDE_FLOAT64_C( 14.38),
SIMDE_FLOAT64_C( 0.30), SIMDE_FLOAT64_C( -0.01),
SIMDE_FLOAT64_C( 0.93), SIMDE_FLOAT64_C( 3.10),
SIMDE_FLOAT64_C( -0.00), SIMDE_FLOAT64_C( 0.92)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( 5.34), SIMDE_FLOAT64_C( 7.33),
SIMDE_FLOAT64_C( 1.80), SIMDE_FLOAT64_C( 1.42),
SIMDE_FLOAT64_C( 0.06), SIMDE_FLOAT64_C( 7.31),
SIMDE_FLOAT64_C( 1.89), SIMDE_FLOAT64_C( 2.52)),
UINT8_C(213),
simde_mm512_set_pd(SIMDE_FLOAT64_C( -0.03), SIMDE_FLOAT64_C( 5.19),
SIMDE_FLOAT64_C( 7.16), SIMDE_FLOAT64_C( 2.45),
SIMDE_FLOAT64_C( 1.75), SIMDE_FLOAT64_C( 3.16),
SIMDE_FLOAT64_C( 3.33), SIMDE_FLOAT64_C( 2.76)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( -0.03), SIMDE_FLOAT64_C( 89.73),
SIMDE_FLOAT64_C( 1.80), SIMDE_FLOAT64_C( 5.75),
SIMDE_FLOAT64_C( 0.06), SIMDE_FLOAT64_C( 11.76),
SIMDE_FLOAT64_C( 1.89), SIMDE_FLOAT64_C( 7.87)) }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])); i++) {
simde__m512d r = simde_mm512_mask_sinh_pd(test_vec[i].src, test_vec[i].k, test_vec[i].a);
simde_assert_m512d_close(r, test_vec[i].r, 1);
}
return 0;
}
static int
test_simde_mm_svml_ceil_ps (SIMDE_MUNIT_TEST_ARGS) {
static const struct {
const simde_float32 a[4];
const simde_float32 r[4];
} test_vec[] = {
{ { SIMDE_FLOAT32_C( -169.65), SIMDE_FLOAT32_C( 267.82), SIMDE_FLOAT32_C( 302.20), SIMDE_FLOAT32_C( -31.93) },
{ SIMDE_FLOAT32_C( -169.00), SIMDE_FLOAT32_C( 268.00), SIMDE_FLOAT32_C( 303.00), SIMDE_FLOAT32_C( -31.00) } },
{ { SIMDE_FLOAT32_C( -142.32), SIMDE_FLOAT32_C( -661.66), SIMDE_FLOAT32_C( 156.37), SIMDE_FLOAT32_C( 396.69) },
{ SIMDE_FLOAT32_C( -142.00), SIMDE_FLOAT32_C( -661.00), SIMDE_FLOAT32_C( 157.00), SIMDE_FLOAT32_C( 397.00) } },
{ { SIMDE_FLOAT32_C( 382.01), SIMDE_FLOAT32_C( 656.47), SIMDE_FLOAT32_C( -361.06), SIMDE_FLOAT32_C( -343.68) },
{ SIMDE_FLOAT32_C( 383.00), SIMDE_FLOAT32_C( 657.00), SIMDE_FLOAT32_C( -361.00), SIMDE_FLOAT32_C( -343.00) } },
{ { SIMDE_FLOAT32_C( -331.36), SIMDE_FLOAT32_C( 68.89), SIMDE_FLOAT32_C( 476.92), SIMDE_FLOAT32_C( -40.59) },
{ SIMDE_FLOAT32_C( -331.00), SIMDE_FLOAT32_C( 69.00), SIMDE_FLOAT32_C( 477.00), SIMDE_FLOAT32_C( -40.00) } },
{ { SIMDE_FLOAT32_C( 390.65), SIMDE_FLOAT32_C( -570.02), SIMDE_FLOAT32_C( -935.28), SIMDE_FLOAT32_C( 672.43) },
{ SIMDE_FLOAT32_C( 391.00), SIMDE_FLOAT32_C( -570.00), SIMDE_FLOAT32_C( -935.00), SIMDE_FLOAT32_C( 673.00) } },
{ { SIMDE_FLOAT32_C( 681.18), SIMDE_FLOAT32_C( -100.50), SIMDE_FLOAT32_C( 206.11), SIMDE_FLOAT32_C( 943.93) },
{ SIMDE_FLOAT32_C( 682.00), SIMDE_FLOAT32_C( -100.00), SIMDE_FLOAT32_C( 207.00), SIMDE_FLOAT32_C( 944.00) } },
{ { SIMDE_FLOAT32_C( 786.98), SIMDE_FLOAT32_C( -51.78), SIMDE_FLOAT32_C( -481.30), SIMDE_FLOAT32_C( 955.46) },
{ SIMDE_FLOAT32_C( 787.00), SIMDE_FLOAT32_C( -51.00), SIMDE_FLOAT32_C( -481.00), SIMDE_FLOAT32_C( 956.00) } },
{ { SIMDE_FLOAT32_C( -832.82), SIMDE_FLOAT32_C( 115.81), SIMDE_FLOAT32_C( -954.30), SIMDE_FLOAT32_C( -2.48) },
{ SIMDE_FLOAT32_C( -832.00), SIMDE_FLOAT32_C( 116.00), SIMDE_FLOAT32_C( -954.00), SIMDE_FLOAT32_C( -2.00) } }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])) ; i++) {
simde__m128 a = simde_mm_loadu_ps(test_vec[i].a);
simde__m128 r = simde_mm_svml_ceil_ps(a);
simde_test_x86_assert_equal_f32x4(r, simde_mm_loadu_ps(test_vec[i].r), 1);
}
return 0;
}
static int
test_simde_mm_svml_ceil_pd (SIMDE_MUNIT_TEST_ARGS) {
static const struct {
const simde_float64 a[2];
const simde_float64 r[2];
} test_vec[] = {
{ { SIMDE_FLOAT64_C( 674.99), SIMDE_FLOAT64_C( 114.55) },
{ SIMDE_FLOAT64_C( 675.00), SIMDE_FLOAT64_C( 115.00) } },
{ { SIMDE_FLOAT64_C( 69.63), SIMDE_FLOAT64_C( -469.97) },
{ SIMDE_FLOAT64_C( 70.00), SIMDE_FLOAT64_C( -469.00) } },
{ { SIMDE_FLOAT64_C( 28.21), SIMDE_FLOAT64_C( 212.97) },
{ SIMDE_FLOAT64_C( 29.00), SIMDE_FLOAT64_C( 213.00) } },
{ { SIMDE_FLOAT64_C( 763.99), SIMDE_FLOAT64_C( -272.25) },
{ SIMDE_FLOAT64_C( 764.00), SIMDE_FLOAT64_C( -272.00) } },
{ { SIMDE_FLOAT64_C( -938.61), SIMDE_FLOAT64_C( 282.65) },
{ SIMDE_FLOAT64_C( -938.00), SIMDE_FLOAT64_C( 283.00) } },
{ { SIMDE_FLOAT64_C( -881.63), SIMDE_FLOAT64_C( 347.00) },
{ SIMDE_FLOAT64_C( -881.00), SIMDE_FLOAT64_C( 347.00) } },
{ { SIMDE_FLOAT64_C( 95.36), SIMDE_FLOAT64_C( -9.46) },
{ SIMDE_FLOAT64_C( 96.00), SIMDE_FLOAT64_C( -9.00) } },
{ { SIMDE_FLOAT64_C( -56.68), SIMDE_FLOAT64_C( 444.40) },
{ SIMDE_FLOAT64_C( -56.00), SIMDE_FLOAT64_C( 445.00) } }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])) ; i++) {
simde__m128d a = simde_mm_loadu_pd(test_vec[i].a);
simde__m128d r = simde_mm_svml_ceil_pd(a);
simde_test_x86_assert_equal_f64x2(r, simde_mm_loadu_pd(test_vec[i].r), 1);
}
return 0;
}
static int
test_simde_mm256_svml_ceil_ps (SIMDE_MUNIT_TEST_ARGS) {
static const struct {
const simde_float32 a[8];
const simde_float32 r[8];
} test_vec[] = {
{ { SIMDE_FLOAT32_C( -76.72), SIMDE_FLOAT32_C( -639.26), SIMDE_FLOAT32_C( 440.96), SIMDE_FLOAT32_C( -729.70),
SIMDE_FLOAT32_C( 846.93), SIMDE_FLOAT32_C( 5.62), SIMDE_FLOAT32_C( -834.54), SIMDE_FLOAT32_C( -216.99) },
{ SIMDE_FLOAT32_C( -76.00), SIMDE_FLOAT32_C( -639.00), SIMDE_FLOAT32_C( 441.00), SIMDE_FLOAT32_C( -729.00),
SIMDE_FLOAT32_C( 847.00), SIMDE_FLOAT32_C( 6.00), SIMDE_FLOAT32_C( -834.00), SIMDE_FLOAT32_C( -216.00) } },
{ { SIMDE_FLOAT32_C( -602.71), SIMDE_FLOAT32_C( -551.43), SIMDE_FLOAT32_C( 949.68), SIMDE_FLOAT32_C( -637.56),
SIMDE_FLOAT32_C( -279.53), SIMDE_FLOAT32_C( 553.99), SIMDE_FLOAT32_C( -582.80), SIMDE_FLOAT32_C( 265.64) },
{ SIMDE_FLOAT32_C( -602.00), SIMDE_FLOAT32_C( -551.00), SIMDE_FLOAT32_C( 950.00), SIMDE_FLOAT32_C( -637.00),
SIMDE_FLOAT32_C( -279.00), SIMDE_FLOAT32_C( 554.00), SIMDE_FLOAT32_C( -582.00), SIMDE_FLOAT32_C( 266.00) } },
{ { SIMDE_FLOAT32_C( 457.99), SIMDE_FLOAT32_C( 385.92), SIMDE_FLOAT32_C( 814.23), SIMDE_FLOAT32_C( -511.82),
SIMDE_FLOAT32_C( -834.29), SIMDE_FLOAT32_C( 45.52), SIMDE_FLOAT32_C( 999.48), SIMDE_FLOAT32_C( -489.95) },
{ SIMDE_FLOAT32_C( 458.00), SIMDE_FLOAT32_C( 386.00), SIMDE_FLOAT32_C( 815.00), SIMDE_FLOAT32_C( -511.00),
SIMDE_FLOAT32_C( -834.00), SIMDE_FLOAT32_C( 46.00), SIMDE_FLOAT32_C( 1000.00), SIMDE_FLOAT32_C( -489.00) } },
{ { SIMDE_FLOAT32_C( 499.94), SIMDE_FLOAT32_C( 847.57), SIMDE_FLOAT32_C( 656.49), SIMDE_FLOAT32_C( 169.03),
SIMDE_FLOAT32_C( -361.51), SIMDE_FLOAT32_C( 697.36), SIMDE_FLOAT32_C( -537.79), SIMDE_FLOAT32_C( 561.78) },
{ SIMDE_FLOAT32_C( 500.00), SIMDE_FLOAT32_C( 848.00), SIMDE_FLOAT32_C( 657.00), SIMDE_FLOAT32_C( 170.00),
SIMDE_FLOAT32_C( -361.00), SIMDE_FLOAT32_C( 698.00), SIMDE_FLOAT32_C( -537.00), SIMDE_FLOAT32_C( 562.00) } },
{ { SIMDE_FLOAT32_C( -941.90), SIMDE_FLOAT32_C( 903.17), SIMDE_FLOAT32_C( 832.08), SIMDE_FLOAT32_C( 905.03),
SIMDE_FLOAT32_C( -91.21), SIMDE_FLOAT32_C( 997.54), SIMDE_FLOAT32_C( -311.96), SIMDE_FLOAT32_C( 306.08) },
{ SIMDE_FLOAT32_C( -941.00), SIMDE_FLOAT32_C( 904.00), SIMDE_FLOAT32_C( 833.00), SIMDE_FLOAT32_C( 906.00),
SIMDE_FLOAT32_C( -91.00), SIMDE_FLOAT32_C( 998.00), SIMDE_FLOAT32_C( -311.00), SIMDE_FLOAT32_C( 307.00) } },
{ { SIMDE_FLOAT32_C( -553.88), SIMDE_FLOAT32_C( -362.28), SIMDE_FLOAT32_C( 668.53), SIMDE_FLOAT32_C( 166.59),
SIMDE_FLOAT32_C( -808.29), SIMDE_FLOAT32_C( -914.27), SIMDE_FLOAT32_C( -567.77), SIMDE_FLOAT32_C( 649.70) },
{ SIMDE_FLOAT32_C( -553.00), SIMDE_FLOAT32_C( -362.00), SIMDE_FLOAT32_C( 669.00), SIMDE_FLOAT32_C( 167.00),
SIMDE_FLOAT32_C( -808.00), SIMDE_FLOAT32_C( -914.00), SIMDE_FLOAT32_C( -567.00), SIMDE_FLOAT32_C( 650.00) } },
{ { SIMDE_FLOAT32_C( 471.65), SIMDE_FLOAT32_C( -753.54), SIMDE_FLOAT32_C( -862.12), SIMDE_FLOAT32_C( 637.36),
SIMDE_FLOAT32_C( 291.98), SIMDE_FLOAT32_C( -862.64), SIMDE_FLOAT32_C( -852.59), SIMDE_FLOAT32_C( -208.07) },
{ SIMDE_FLOAT32_C( 472.00), SIMDE_FLOAT32_C( -753.00), SIMDE_FLOAT32_C( -862.00), SIMDE_FLOAT32_C( 638.00),
SIMDE_FLOAT32_C( 292.00), SIMDE_FLOAT32_C( -862.00), SIMDE_FLOAT32_C( -852.00), SIMDE_FLOAT32_C( -208.00) } },
{ { SIMDE_FLOAT32_C( 984.93), SIMDE_FLOAT32_C( 803.90), SIMDE_FLOAT32_C( 960.96), SIMDE_FLOAT32_C( -376.58),
SIMDE_FLOAT32_C( 501.26), SIMDE_FLOAT32_C( -576.83), SIMDE_FLOAT32_C( -814.80), SIMDE_FLOAT32_C( 559.36) },
{ SIMDE_FLOAT32_C( 985.00), SIMDE_FLOAT32_C( 804.00), SIMDE_FLOAT32_C( 961.00), SIMDE_FLOAT32_C( -376.00),
SIMDE_FLOAT32_C( 502.00), SIMDE_FLOAT32_C( -576.00), SIMDE_FLOAT32_C( -814.00), SIMDE_FLOAT32_C( 560.00) } }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])) ; i++) {
simde__m256 a = simde_mm256_loadu_ps(test_vec[i].a);
simde__m256 r = simde_mm256_svml_ceil_ps(a);
simde_test_x86_assert_equal_f32x8(r, simde_mm256_loadu_ps(test_vec[i].r), 1);
}
return 0;
}
static int
test_simde_mm256_svml_ceil_pd (SIMDE_MUNIT_TEST_ARGS) {
static const struct {
const simde_float64 a[4];
const simde_float64 r[4];
} test_vec[] = {
{ { SIMDE_FLOAT64_C( -362.72), SIMDE_FLOAT64_C( -517.27), SIMDE_FLOAT64_C( -680.39), SIMDE_FLOAT64_C( -370.55) },
{ SIMDE_FLOAT64_C( -362.00), SIMDE_FLOAT64_C( -517.00), SIMDE_FLOAT64_C( -680.00), SIMDE_FLOAT64_C( -370.00) } },
{ { SIMDE_FLOAT64_C( -614.98), SIMDE_FLOAT64_C( 499.96), SIMDE_FLOAT64_C( -673.46), SIMDE_FLOAT64_C( 813.10) },
{ SIMDE_FLOAT64_C( -614.00), SIMDE_FLOAT64_C( 500.00), SIMDE_FLOAT64_C( -673.00), SIMDE_FLOAT64_C( 814.00) } },
{ { SIMDE_FLOAT64_C( -134.44), SIMDE_FLOAT64_C( 719.80), SIMDE_FLOAT64_C( -164.15), SIMDE_FLOAT64_C( -617.21) },
{ SIMDE_FLOAT64_C( -134.00), SIMDE_FLOAT64_C( 720.00), SIMDE_FLOAT64_C( -164.00), SIMDE_FLOAT64_C( -617.00) } },
{ { SIMDE_FLOAT64_C( -500.24), SIMDE_FLOAT64_C( 381.09), SIMDE_FLOAT64_C( 264.50), SIMDE_FLOAT64_C( 668.11) },
{ SIMDE_FLOAT64_C( -500.00), SIMDE_FLOAT64_C( 382.00), SIMDE_FLOAT64_C( 265.00), SIMDE_FLOAT64_C( 669.00) } },
{ { SIMDE_FLOAT64_C( 934.75), SIMDE_FLOAT64_C( -779.04), SIMDE_FLOAT64_C( 549.14), SIMDE_FLOAT64_C( -476.20) },
{ SIMDE_FLOAT64_C( 935.00), SIMDE_FLOAT64_C( -779.00), SIMDE_FLOAT64_C( 550.00), SIMDE_FLOAT64_C( -476.00) } },
{ { SIMDE_FLOAT64_C( -15.07), SIMDE_FLOAT64_C( 858.66), SIMDE_FLOAT64_C( -174.63), SIMDE_FLOAT64_C( -609.29) },
{ SIMDE_FLOAT64_C( -15.00), SIMDE_FLOAT64_C( 859.00), SIMDE_FLOAT64_C( -174.00), SIMDE_FLOAT64_C( -609.00) } },
{ { SIMDE_FLOAT64_C( -71.58), SIMDE_FLOAT64_C( 432.38), SIMDE_FLOAT64_C( -26.35), SIMDE_FLOAT64_C( -67.29) },
{ SIMDE_FLOAT64_C( -71.00), SIMDE_FLOAT64_C( 433.00), SIMDE_FLOAT64_C( -26.00), SIMDE_FLOAT64_C( -67.00) } },
{ { SIMDE_FLOAT64_C( 708.92), SIMDE_FLOAT64_C( 346.09), SIMDE_FLOAT64_C( -697.36), SIMDE_FLOAT64_C( -653.80) },
{ SIMDE_FLOAT64_C( 709.00), SIMDE_FLOAT64_C( 347.00), SIMDE_FLOAT64_C( -697.00), SIMDE_FLOAT64_C( -653.00) } }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])) ; i++) {
simde__m256d a = simde_mm256_loadu_pd(test_vec[i].a);
simde__m256d r = simde_mm256_svml_ceil_pd(a);
simde_test_x86_assert_equal_f64x4(r, simde_mm256_loadu_pd(test_vec[i].r), 1);
}
return 0;
}
static int
test_simde_mm512_ceil_ps (SIMDE_MUNIT_TEST_ARGS) {
static const struct {
const simde_float32 a[16];
const simde_float32 r[16];
} test_vec[] = {
{ { SIMDE_FLOAT32_C( -212.12), SIMDE_FLOAT32_C( -438.18), SIMDE_FLOAT32_C( 403.70), SIMDE_FLOAT32_C( 369.30),
SIMDE_FLOAT32_C( 75.33), SIMDE_FLOAT32_C( 898.48), SIMDE_FLOAT32_C( 1.19), SIMDE_FLOAT32_C( -480.16),
SIMDE_FLOAT32_C( -450.03), SIMDE_FLOAT32_C( -382.53), SIMDE_FLOAT32_C( 364.23), SIMDE_FLOAT32_C( 496.15),
SIMDE_FLOAT32_C( 778.39), SIMDE_FLOAT32_C( -311.07), SIMDE_FLOAT32_C( 656.92), SIMDE_FLOAT32_C( -16.90) },
{ SIMDE_FLOAT32_C( -212.00), SIMDE_FLOAT32_C( -438.00), SIMDE_FLOAT32_C( 404.00), SIMDE_FLOAT32_C( 370.00),
SIMDE_FLOAT32_C( 76.00), SIMDE_FLOAT32_C( 899.00), SIMDE_FLOAT32_C( 2.00), SIMDE_FLOAT32_C( -480.00),
SIMDE_FLOAT32_C( -450.00), SIMDE_FLOAT32_C( -382.00), SIMDE_FLOAT32_C( 365.00), SIMDE_FLOAT32_C( 497.00),
SIMDE_FLOAT32_C( 779.00), SIMDE_FLOAT32_C( -311.00), SIMDE_FLOAT32_C( 657.00), SIMDE_FLOAT32_C( -16.00) } },
{ { SIMDE_FLOAT32_C( -112.72), SIMDE_FLOAT32_C( -813.31), SIMDE_FLOAT32_C( 470.40), SIMDE_FLOAT32_C( -748.73),
SIMDE_FLOAT32_C( -795.37), SIMDE_FLOAT32_C( -65.01), SIMDE_FLOAT32_C( 904.80), SIMDE_FLOAT32_C( -706.59),
SIMDE_FLOAT32_C( 54.57), SIMDE_FLOAT32_C( -248.19), SIMDE_FLOAT32_C( -352.77), SIMDE_FLOAT32_C( 334.66),
SIMDE_FLOAT32_C( 568.34), SIMDE_FLOAT32_C( 976.72), SIMDE_FLOAT32_C( 104.61), SIMDE_FLOAT32_C( -643.78) },
{ SIMDE_FLOAT32_C( -112.00), SIMDE_FLOAT32_C( -813.00), SIMDE_FLOAT32_C( 471.00), SIMDE_FLOAT32_C( -748.00),
SIMDE_FLOAT32_C( -795.00), SIMDE_FLOAT32_C( -65.00), SIMDE_FLOAT32_C( 905.00), SIMDE_FLOAT32_C( -706.00),
SIMDE_FLOAT32_C( 55.00), SIMDE_FLOAT32_C( -248.00), SIMDE_FLOAT32_C( -352.00), SIMDE_FLOAT32_C( 335.00),
SIMDE_FLOAT32_C( 569.00), SIMDE_FLOAT32_C( 977.00), SIMDE_FLOAT32_C( 105.00), SIMDE_FLOAT32_C( -643.00) } },
{ { SIMDE_FLOAT32_C( -461.46), SIMDE_FLOAT32_C( -491.69), SIMDE_FLOAT32_C( 725.52), SIMDE_FLOAT32_C( 613.87),
SIMDE_FLOAT32_C( -593.21), SIMDE_FLOAT32_C( -273.28), SIMDE_FLOAT32_C( -866.30), SIMDE_FLOAT32_C( -43.24),
SIMDE_FLOAT32_C( 344.18), SIMDE_FLOAT32_C( 497.93), SIMDE_FLOAT32_C( -547.09), SIMDE_FLOAT32_C( 122.57),
SIMDE_FLOAT32_C( -813.14), SIMDE_FLOAT32_C( -890.17), SIMDE_FLOAT32_C( -894.33), SIMDE_FLOAT32_C( 74.15) },
{ SIMDE_FLOAT32_C( -461.00), SIMDE_FLOAT32_C( -491.00), SIMDE_FLOAT32_C( 726.00), SIMDE_FLOAT32_C( 614.00),
SIMDE_FLOAT32_C( -593.00), SIMDE_FLOAT32_C( -273.00), SIMDE_FLOAT32_C( -866.00), SIMDE_FLOAT32_C( -43.00),
SIMDE_FLOAT32_C( 345.00), SIMDE_FLOAT32_C( 498.00), SIMDE_FLOAT32_C( -547.00), SIMDE_FLOAT32_C( 123.00),
SIMDE_FLOAT32_C( -813.00), SIMDE_FLOAT32_C( -890.00), SIMDE_FLOAT32_C( -894.00), SIMDE_FLOAT32_C( 75.00) } },
{ { SIMDE_FLOAT32_C( -703.48), SIMDE_FLOAT32_C( 576.07), SIMDE_FLOAT32_C( 325.42), SIMDE_FLOAT32_C( -498.84),
SIMDE_FLOAT32_C( -488.94), SIMDE_FLOAT32_C( 230.22), SIMDE_FLOAT32_C( -205.43), SIMDE_FLOAT32_C( 565.63),
SIMDE_FLOAT32_C( 982.03), SIMDE_FLOAT32_C( 441.80), SIMDE_FLOAT32_C( -99.71), SIMDE_FLOAT32_C( 550.37),
SIMDE_FLOAT32_C( 418.51), SIMDE_FLOAT32_C( -995.10), SIMDE_FLOAT32_C( 906.59), SIMDE_FLOAT32_C( 957.05) },
{ SIMDE_FLOAT32_C( -703.00), SIMDE_FLOAT32_C( 577.00), SIMDE_FLOAT32_C( 326.00), SIMDE_FLOAT32_C( -498.00),
SIMDE_FLOAT32_C( -488.00), SIMDE_FLOAT32_C( 231.00), SIMDE_FLOAT32_C( -205.00), SIMDE_FLOAT32_C( 566.00),
SIMDE_FLOAT32_C( 983.00), SIMDE_FLOAT32_C( 442.00), SIMDE_FLOAT32_C( -99.00), SIMDE_FLOAT32_C( 551.00),
SIMDE_FLOAT32_C( 419.00), SIMDE_FLOAT32_C( -995.00), SIMDE_FLOAT32_C( 907.00), SIMDE_FLOAT32_C( 958.00) } },
{ { SIMDE_FLOAT32_C( -486.79), SIMDE_FLOAT32_C( 632.11), SIMDE_FLOAT32_C( 570.92), SIMDE_FLOAT32_C( -80.00),
SIMDE_FLOAT32_C( -641.18), SIMDE_FLOAT32_C( 704.62), SIMDE_FLOAT32_C( 876.76), SIMDE_FLOAT32_C( 703.01),
SIMDE_FLOAT32_C( 202.55), SIMDE_FLOAT32_C( -670.32), SIMDE_FLOAT32_C( -174.43), SIMDE_FLOAT32_C( 389.41),
SIMDE_FLOAT32_C( -560.49), SIMDE_FLOAT32_C( -68.76), SIMDE_FLOAT32_C( -536.44), SIMDE_FLOAT32_C( -263.97) },
{ SIMDE_FLOAT32_C( -486.00), SIMDE_FLOAT32_C( 633.00), SIMDE_FLOAT32_C( 571.00), SIMDE_FLOAT32_C( -80.00),
SIMDE_FLOAT32_C( -641.00), SIMDE_FLOAT32_C( 705.00), SIMDE_FLOAT32_C( 877.00), SIMDE_FLOAT32_C( 704.00),
SIMDE_FLOAT32_C( 203.00), SIMDE_FLOAT32_C( -670.00), SIMDE_FLOAT32_C( -174.00), SIMDE_FLOAT32_C( 390.00),
SIMDE_FLOAT32_C( -560.00), SIMDE_FLOAT32_C( -68.00), SIMDE_FLOAT32_C( -536.00), SIMDE_FLOAT32_C( -263.00) } },
{ { SIMDE_FLOAT32_C( -492.69), SIMDE_FLOAT32_C( 788.98), SIMDE_FLOAT32_C( 237.19), SIMDE_FLOAT32_C( 18.37),
SIMDE_FLOAT32_C( 19.20), SIMDE_FLOAT32_C( -968.24), SIMDE_FLOAT32_C( -416.00), SIMDE_FLOAT32_C( 1.23),
SIMDE_FLOAT32_C( 473.56), SIMDE_FLOAT32_C( 484.29), SIMDE_FLOAT32_C( -448.40), SIMDE_FLOAT32_C( -107.93),
SIMDE_FLOAT32_C( 489.18), SIMDE_FLOAT32_C( -541.82), SIMDE_FLOAT32_C( -150.87), SIMDE_FLOAT32_C( -997.61) },
{ SIMDE_FLOAT32_C( -492.00), SIMDE_FLOAT32_C( 789.00), SIMDE_FLOAT32_C( 238.00), SIMDE_FLOAT32_C( 19.00),
SIMDE_FLOAT32_C( 20.00), SIMDE_FLOAT32_C( -968.00), SIMDE_FLOAT32_C( -416.00), SIMDE_FLOAT32_C( 2.00),
SIMDE_FLOAT32_C( 474.00), SIMDE_FLOAT32_C( 485.00), SIMDE_FLOAT32_C( -448.00), SIMDE_FLOAT32_C( -107.00),
SIMDE_FLOAT32_C( 490.00), SIMDE_FLOAT32_C( -541.00), SIMDE_FLOAT32_C( -150.00), SIMDE_FLOAT32_C( -997.00) } },
{ { SIMDE_FLOAT32_C( -909.71), SIMDE_FLOAT32_C( -579.96), SIMDE_FLOAT32_C( -77.61), SIMDE_FLOAT32_C( -550.89),
SIMDE_FLOAT32_C( -875.34), SIMDE_FLOAT32_C( -200.84), SIMDE_FLOAT32_C( -847.88), SIMDE_FLOAT32_C( 327.21),
SIMDE_FLOAT32_C( 128.83), SIMDE_FLOAT32_C( -22.31), SIMDE_FLOAT32_C( -283.37), SIMDE_FLOAT32_C( 568.34),
SIMDE_FLOAT32_C( 908.94), SIMDE_FLOAT32_C( 180.19), SIMDE_FLOAT32_C( -695.63), SIMDE_FLOAT32_C( -583.75) },
{ SIMDE_FLOAT32_C( -909.00), SIMDE_FLOAT32_C( -579.00), SIMDE_FLOAT32_C( -77.00), SIMDE_FLOAT32_C( -550.00),
SIMDE_FLOAT32_C( -875.00), SIMDE_FLOAT32_C( -200.00), SIMDE_FLOAT32_C( -847.00), SIMDE_FLOAT32_C( 328.00),
SIMDE_FLOAT32_C( 129.00), SIMDE_FLOAT32_C( -22.00), SIMDE_FLOAT32_C( -283.00), SIMDE_FLOAT32_C( 569.00),
SIMDE_FLOAT32_C( 909.00), SIMDE_FLOAT32_C( 181.00), SIMDE_FLOAT32_C( -695.00), SIMDE_FLOAT32_C( -583.00) } },
{ { SIMDE_FLOAT32_C( -30.83), SIMDE_FLOAT32_C( 541.56), SIMDE_FLOAT32_C( 434.62), SIMDE_FLOAT32_C( 988.37),
SIMDE_FLOAT32_C( 573.33), SIMDE_FLOAT32_C( -981.38), SIMDE_FLOAT32_C( -10.40), SIMDE_FLOAT32_C( 46.89),
SIMDE_FLOAT32_C( 502.90), SIMDE_FLOAT32_C( 541.19), SIMDE_FLOAT32_C( 938.96), SIMDE_FLOAT32_C( -7.91),
SIMDE_FLOAT32_C( 999.37), SIMDE_FLOAT32_C( -211.91), SIMDE_FLOAT32_C( -5.52), SIMDE_FLOAT32_C( -910.34) },
{ SIMDE_FLOAT32_C( -30.00), SIMDE_FLOAT32_C( 542.00), SIMDE_FLOAT32_C( 435.00), SIMDE_FLOAT32_C( 989.00),
SIMDE_FLOAT32_C( 574.00), SIMDE_FLOAT32_C( -981.00), SIMDE_FLOAT32_C( -10.00), SIMDE_FLOAT32_C( 47.00),
SIMDE_FLOAT32_C( 503.00), SIMDE_FLOAT32_C( 542.00), SIMDE_FLOAT32_C( 939.00), SIMDE_FLOAT32_C( -7.00),
SIMDE_FLOAT32_C( 1000.00), SIMDE_FLOAT32_C( -211.00), SIMDE_FLOAT32_C( -5.00), SIMDE_FLOAT32_C( -910.00) } }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])) ; i++) {
simde__m512 a = simde_mm512_loadu_ps(test_vec[i].a);
simde__m512 r = simde_mm512_ceil_ps(a);
simde_test_x86_assert_equal_f32x16(r, simde_mm512_loadu_ps(test_vec[i].r), 1);
}
return 0;
}
static int
test_simde_mm512_mask_ceil_ps (SIMDE_MUNIT_TEST_ARGS) {
static const struct {
const simde_float32 src[16];
const simde__mmask8 k;
const simde_float32 a[16];
const simde_float32 r[16];
} test_vec[] = {
{ { SIMDE_FLOAT32_C( 264.66), SIMDE_FLOAT32_C( 621.32), SIMDE_FLOAT32_C( -827.32), SIMDE_FLOAT32_C( -154.51),
SIMDE_FLOAT32_C( 337.38), SIMDE_FLOAT32_C( 187.19), SIMDE_FLOAT32_C( 659.53), SIMDE_FLOAT32_C( 559.33),
SIMDE_FLOAT32_C( 209.98), SIMDE_FLOAT32_C( 625.49), SIMDE_FLOAT32_C( 656.87), SIMDE_FLOAT32_C( -793.87),
SIMDE_FLOAT32_C( 746.37), SIMDE_FLOAT32_C( -721.16), SIMDE_FLOAT32_C( 184.21), SIMDE_FLOAT32_C( 251.36) },
UINT8_C(157),
{ SIMDE_FLOAT32_C( 769.49), SIMDE_FLOAT32_C( -152.19), SIMDE_FLOAT32_C( 746.20), SIMDE_FLOAT32_C( -444.46),
SIMDE_FLOAT32_C( -336.10), SIMDE_FLOAT32_C( -772.83), SIMDE_FLOAT32_C( 887.52), SIMDE_FLOAT32_C( 966.03),
SIMDE_FLOAT32_C( 490.22), SIMDE_FLOAT32_C( -510.29), SIMDE_FLOAT32_C( -30.50), SIMDE_FLOAT32_C( 1.38),
SIMDE_FLOAT32_C( -217.82), SIMDE_FLOAT32_C( 12.97), SIMDE_FLOAT32_C( -733.96), SIMDE_FLOAT32_C( -596.50) },
{ SIMDE_FLOAT32_C( 770.00), SIMDE_FLOAT32_C( 621.32), SIMDE_FLOAT32_C( 747.00), SIMDE_FLOAT32_C( -444.00),
SIMDE_FLOAT32_C( -336.00), SIMDE_FLOAT32_C( 187.19), SIMDE_FLOAT32_C( 659.53), SIMDE_FLOAT32_C( 967.00),
SIMDE_FLOAT32_C( 209.98), SIMDE_FLOAT32_C( 625.49), SIMDE_FLOAT32_C( 656.87), SIMDE_FLOAT32_C( -793.87),
SIMDE_FLOAT32_C( 746.37), SIMDE_FLOAT32_C( -721.16), SIMDE_FLOAT32_C( 184.21), SIMDE_FLOAT32_C( 251.36) } },
{ { SIMDE_FLOAT32_C( 185.65), SIMDE_FLOAT32_C( 111.53), SIMDE_FLOAT32_C( 740.88), SIMDE_FLOAT32_C( -627.16),
SIMDE_FLOAT32_C( -228.94), SIMDE_FLOAT32_C( 300.20), SIMDE_FLOAT32_C( 582.82), SIMDE_FLOAT32_C( -603.45),
SIMDE_FLOAT32_C( -42.93), SIMDE_FLOAT32_C( 788.96), SIMDE_FLOAT32_C( -857.08), SIMDE_FLOAT32_C( 235.91),
SIMDE_FLOAT32_C( -26.83), SIMDE_FLOAT32_C( 394.28), SIMDE_FLOAT32_C( 795.93), SIMDE_FLOAT32_C( -257.35) },
UINT8_C(195),
{ SIMDE_FLOAT32_C( 542.13), SIMDE_FLOAT32_C( 298.19), SIMDE_FLOAT32_C( -94.01), SIMDE_FLOAT32_C( 769.30),
SIMDE_FLOAT32_C( 185.71), SIMDE_FLOAT32_C( -127.98), SIMDE_FLOAT32_C( 259.52), SIMDE_FLOAT32_C( 675.42),
SIMDE_FLOAT32_C( 841.52), SIMDE_FLOAT32_C( -739.10), SIMDE_FLOAT32_C( -542.40), SIMDE_FLOAT32_C( -145.50),
SIMDE_FLOAT32_C( -473.06), SIMDE_FLOAT32_C( -138.90), SIMDE_FLOAT32_C( -959.85), SIMDE_FLOAT32_C( 638.47) },
{ SIMDE_FLOAT32_C( 543.00), SIMDE_FLOAT32_C( 299.00), SIMDE_FLOAT32_C( 740.88), SIMDE_FLOAT32_C( -627.16),
SIMDE_FLOAT32_C( -228.94), SIMDE_FLOAT32_C( 300.20), SIMDE_FLOAT32_C( 260.00), SIMDE_FLOAT32_C( 676.00),
SIMDE_FLOAT32_C( -42.93), SIMDE_FLOAT32_C( 788.96), SIMDE_FLOAT32_C( -857.08), SIMDE_FLOAT32_C( 235.91),
SIMDE_FLOAT32_C( -26.83), SIMDE_FLOAT32_C( 394.28), SIMDE_FLOAT32_C( 795.93), SIMDE_FLOAT32_C( -257.35) } },
{ { SIMDE_FLOAT32_C( -398.03), SIMDE_FLOAT32_C( -587.00), SIMDE_FLOAT32_C( -590.48), SIMDE_FLOAT32_C( 902.17),
SIMDE_FLOAT32_C( 995.82), SIMDE_FLOAT32_C( -193.93), SIMDE_FLOAT32_C( -140.76), SIMDE_FLOAT32_C( 784.78),
SIMDE_FLOAT32_C( -51.01), SIMDE_FLOAT32_C( -904.84), SIMDE_FLOAT32_C( -242.06), SIMDE_FLOAT32_C( -656.73),
SIMDE_FLOAT32_C( 891.09), SIMDE_FLOAT32_C( 500.60), SIMDE_FLOAT32_C( -414.64), SIMDE_FLOAT32_C( 433.21) },
UINT8_C(211),
{ SIMDE_FLOAT32_C( 491.34), SIMDE_FLOAT32_C( 202.51), SIMDE_FLOAT32_C( 984.50), SIMDE_FLOAT32_C( -636.64),
SIMDE_FLOAT32_C( -537.96), SIMDE_FLOAT32_C( 659.92), SIMDE_FLOAT32_C( -795.12), SIMDE_FLOAT32_C( -277.06),
SIMDE_FLOAT32_C( -882.48), SIMDE_FLOAT32_C( 59.38), SIMDE_FLOAT32_C( 249.88), SIMDE_FLOAT32_C( -21.39),
SIMDE_FLOAT32_C( 99.53), SIMDE_FLOAT32_C( -111.65), SIMDE_FLOAT32_C( 580.58), SIMDE_FLOAT32_C( 512.52) },
{ SIMDE_FLOAT32_C( 492.00), SIMDE_FLOAT32_C( 203.00), SIMDE_FLOAT32_C( -590.48), SIMDE_FLOAT32_C( 902.17),
SIMDE_FLOAT32_C( -537.00), SIMDE_FLOAT32_C( -193.93), SIMDE_FLOAT32_C( -795.00), SIMDE_FLOAT32_C( -277.00),
SIMDE_FLOAT32_C( -51.01), SIMDE_FLOAT32_C( -904.84), SIMDE_FLOAT32_C( -242.06), SIMDE_FLOAT32_C( -656.73),
SIMDE_FLOAT32_C( 891.09), SIMDE_FLOAT32_C( 500.60), SIMDE_FLOAT32_C( -414.64), SIMDE_FLOAT32_C( 433.21) } },
{ { SIMDE_FLOAT32_C( 297.87), SIMDE_FLOAT32_C( 482.76), SIMDE_FLOAT32_C( 508.34), SIMDE_FLOAT32_C( -896.06),
SIMDE_FLOAT32_C( -658.00), SIMDE_FLOAT32_C( 293.12), SIMDE_FLOAT32_C( 52.94), SIMDE_FLOAT32_C( -562.84),
SIMDE_FLOAT32_C( -948.94), SIMDE_FLOAT32_C( 396.21), SIMDE_FLOAT32_C( -671.75), SIMDE_FLOAT32_C( 551.66),
SIMDE_FLOAT32_C( 981.56), SIMDE_FLOAT32_C( 761.46), SIMDE_FLOAT32_C( -649.56), SIMDE_FLOAT32_C( 472.90) },
UINT8_C(186),
{ SIMDE_FLOAT32_C( -665.06), SIMDE_FLOAT32_C( 836.26), SIMDE_FLOAT32_C( 426.01), SIMDE_FLOAT32_C( 994.86),
SIMDE_FLOAT32_C( -958.85), SIMDE_FLOAT32_C( -851.05), SIMDE_FLOAT32_C( -887.63), SIMDE_FLOAT32_C( 100.52),
SIMDE_FLOAT32_C( 398.83), SIMDE_FLOAT32_C( 90.99), SIMDE_FLOAT32_C( -799.95), SIMDE_FLOAT32_C( -712.82),
SIMDE_FLOAT32_C( -328.43), SIMDE_FLOAT32_C( 712.57), SIMDE_FLOAT32_C( 585.05), SIMDE_FLOAT32_C( -845.67) },
{ SIMDE_FLOAT32_C( 297.87), SIMDE_FLOAT32_C( 837.00), SIMDE_FLOAT32_C( 508.34), SIMDE_FLOAT32_C( 995.00),
SIMDE_FLOAT32_C( -958.00), SIMDE_FLOAT32_C( -851.00), SIMDE_FLOAT32_C( 52.94), SIMDE_FLOAT32_C( 101.00),
SIMDE_FLOAT32_C( -948.94), SIMDE_FLOAT32_C( 396.21), SIMDE_FLOAT32_C( -671.75), SIMDE_FLOAT32_C( 551.66),
SIMDE_FLOAT32_C( 981.56), SIMDE_FLOAT32_C( 761.46), SIMDE_FLOAT32_C( -649.56), SIMDE_FLOAT32_C( 472.90) } },
{ { SIMDE_FLOAT32_C( 220.91), SIMDE_FLOAT32_C( 688.99), SIMDE_FLOAT32_C( -503.67), SIMDE_FLOAT32_C( -485.97),
SIMDE_FLOAT32_C( -258.07), SIMDE_FLOAT32_C( -66.51), SIMDE_FLOAT32_C( -434.91), SIMDE_FLOAT32_C( -861.87),
SIMDE_FLOAT32_C( 261.74), SIMDE_FLOAT32_C( -883.26), SIMDE_FLOAT32_C( -880.31), SIMDE_FLOAT32_C( 23.19),
SIMDE_FLOAT32_C( -532.81), SIMDE_FLOAT32_C( 592.60), SIMDE_FLOAT32_C( 987.17), SIMDE_FLOAT32_C( -197.87) },
UINT8_C(171),
{ SIMDE_FLOAT32_C( 413.18), SIMDE_FLOAT32_C( -203.02), SIMDE_FLOAT32_C( 470.01), SIMDE_FLOAT32_C( 562.13),
SIMDE_FLOAT32_C( -90.64), SIMDE_FLOAT32_C( -429.47), SIMDE_FLOAT32_C( -39.04), SIMDE_FLOAT32_C( -999.66),
SIMDE_FLOAT32_C( -229.42), SIMDE_FLOAT32_C( 248.13), SIMDE_FLOAT32_C( -328.09), SIMDE_FLOAT32_C( -516.85),
SIMDE_FLOAT32_C( -166.82), SIMDE_FLOAT32_C( -173.76), SIMDE_FLOAT32_C( 704.07), SIMDE_FLOAT32_C( -477.83) },
{ SIMDE_FLOAT32_C( 414.00), SIMDE_FLOAT32_C( -203.00), SIMDE_FLOAT32_C( -503.67), SIMDE_FLOAT32_C( 563.00),
SIMDE_FLOAT32_C( -258.07), SIMDE_FLOAT32_C( -429.00), SIMDE_FLOAT32_C( -434.91), SIMDE_FLOAT32_C( -999.00),
SIMDE_FLOAT32_C( 261.74), SIMDE_FLOAT32_C( -883.26), SIMDE_FLOAT32_C( -880.31), SIMDE_FLOAT32_C( 23.19),
SIMDE_FLOAT32_C( -532.81), SIMDE_FLOAT32_C( 592.60), SIMDE_FLOAT32_C( 987.17), SIMDE_FLOAT32_C( -197.87) } },
{ { SIMDE_FLOAT32_C( 322.58), SIMDE_FLOAT32_C( -781.90), SIMDE_FLOAT32_C( 264.10), SIMDE_FLOAT32_C( -743.93),
SIMDE_FLOAT32_C( -216.81), SIMDE_FLOAT32_C( 402.23), SIMDE_FLOAT32_C( 517.80), SIMDE_FLOAT32_C( -100.07),
SIMDE_FLOAT32_C( 521.92), SIMDE_FLOAT32_C( -459.00), SIMDE_FLOAT32_C( 367.12), SIMDE_FLOAT32_C( 114.52),
SIMDE_FLOAT32_C( -471.84), SIMDE_FLOAT32_C( -830.76), SIMDE_FLOAT32_C( -456.62), SIMDE_FLOAT32_C( 941.34) },
UINT8_C( 19),
{ SIMDE_FLOAT32_C( -986.61), SIMDE_FLOAT32_C( 503.47), SIMDE_FLOAT32_C( 875.58), SIMDE_FLOAT32_C( -416.08),
SIMDE_FLOAT32_C( -535.57), SIMDE_FLOAT32_C( 875.92), SIMDE_FLOAT32_C( 354.51), SIMDE_FLOAT32_C( 712.56),
SIMDE_FLOAT32_C( -452.16), SIMDE_FLOAT32_C( 837.66), SIMDE_FLOAT32_C( -454.26), SIMDE_FLOAT32_C( 374.08),
SIMDE_FLOAT32_C( 541.73), SIMDE_FLOAT32_C( 67.91), SIMDE_FLOAT32_C( -303.34), SIMDE_FLOAT32_C( 759.83) },
{ SIMDE_FLOAT32_C( -986.00), SIMDE_FLOAT32_C( 504.00), SIMDE_FLOAT32_C( 264.10), SIMDE_FLOAT32_C( -743.93),
SIMDE_FLOAT32_C( -535.00), SIMDE_FLOAT32_C( 402.23), SIMDE_FLOAT32_C( 517.80), SIMDE_FLOAT32_C( -100.07),
SIMDE_FLOAT32_C( 521.92), SIMDE_FLOAT32_C( -459.00), SIMDE_FLOAT32_C( 367.12), SIMDE_FLOAT32_C( 114.52),
SIMDE_FLOAT32_C( -471.84), SIMDE_FLOAT32_C( -830.76), SIMDE_FLOAT32_C( -456.62), SIMDE_FLOAT32_C( 941.34) } },
{ { SIMDE_FLOAT32_C( -668.00), SIMDE_FLOAT32_C( -47.28), SIMDE_FLOAT32_C( -456.99), SIMDE_FLOAT32_C( 734.23),
SIMDE_FLOAT32_C( -529.48), SIMDE_FLOAT32_C( 442.94), SIMDE_FLOAT32_C( 256.15), SIMDE_FLOAT32_C( 11.52),
SIMDE_FLOAT32_C( -189.94), SIMDE_FLOAT32_C( -629.33), SIMDE_FLOAT32_C( 539.68), SIMDE_FLOAT32_C( -20.70),
SIMDE_FLOAT32_C( -85.95), SIMDE_FLOAT32_C( 481.02), SIMDE_FLOAT32_C( 945.52), SIMDE_FLOAT32_C( -72.56) },
UINT8_C(158),
{ SIMDE_FLOAT32_C( 821.10), SIMDE_FLOAT32_C( 511.37), SIMDE_FLOAT32_C( 448.92), SIMDE_FLOAT32_C( 697.03),
SIMDE_FLOAT32_C( -134.12), SIMDE_FLOAT32_C( 161.48), SIMDE_FLOAT32_C( -755.14), SIMDE_FLOAT32_C( -296.46),
SIMDE_FLOAT32_C( 707.22), SIMDE_FLOAT32_C( 618.95), SIMDE_FLOAT32_C( -754.73), SIMDE_FLOAT32_C( -224.87),
SIMDE_FLOAT32_C( -684.40), SIMDE_FLOAT32_C( -994.91), SIMDE_FLOAT32_C( 107.14), SIMDE_FLOAT32_C( 268.32) },
{ SIMDE_FLOAT32_C( -668.00), SIMDE_FLOAT32_C( 512.00), SIMDE_FLOAT32_C( 449.00), SIMDE_FLOAT32_C( 698.00),
SIMDE_FLOAT32_C( -134.00), SIMDE_FLOAT32_C( 442.94), SIMDE_FLOAT32_C( 256.15), SIMDE_FLOAT32_C( -296.00),
SIMDE_FLOAT32_C( -189.94), SIMDE_FLOAT32_C( -629.33), SIMDE_FLOAT32_C( 539.68), SIMDE_FLOAT32_C( -20.70),
SIMDE_FLOAT32_C( -85.95), SIMDE_FLOAT32_C( 481.02), SIMDE_FLOAT32_C( 945.52), SIMDE_FLOAT32_C( -72.56) } },
{ { SIMDE_FLOAT32_C( -451.89), SIMDE_FLOAT32_C( -158.63), SIMDE_FLOAT32_C( 738.85), SIMDE_FLOAT32_C( 991.05),
SIMDE_FLOAT32_C( -902.48), SIMDE_FLOAT32_C( -249.63), SIMDE_FLOAT32_C( -198.89), SIMDE_FLOAT32_C( -531.81),
SIMDE_FLOAT32_C( -709.95), SIMDE_FLOAT32_C( 780.40), SIMDE_FLOAT32_C( 382.24), SIMDE_FLOAT32_C( 771.07),
SIMDE_FLOAT32_C( 725.93), SIMDE_FLOAT32_C( -690.31), SIMDE_FLOAT32_C( -244.43), SIMDE_FLOAT32_C( 547.03) },
UINT8_C(207),
{ SIMDE_FLOAT32_C( -795.51), SIMDE_FLOAT32_C( 244.06), SIMDE_FLOAT32_C( -313.07), SIMDE_FLOAT32_C( 365.97),
SIMDE_FLOAT32_C( 488.92), SIMDE_FLOAT32_C( 390.47), SIMDE_FLOAT32_C( 73.20), SIMDE_FLOAT32_C( 107.87),
SIMDE_FLOAT32_C( 635.73), SIMDE_FLOAT32_C( 848.33), SIMDE_FLOAT32_C( 423.47), SIMDE_FLOAT32_C( 640.83),
SIMDE_FLOAT32_C( -44.53), SIMDE_FLOAT32_C( -308.21), SIMDE_FLOAT32_C( -811.07), SIMDE_FLOAT32_C( 796.84) },
{ SIMDE_FLOAT32_C( -795.00), SIMDE_FLOAT32_C( 245.00), SIMDE_FLOAT32_C( -313.00), SIMDE_FLOAT32_C( 366.00),
SIMDE_FLOAT32_C( -902.48), SIMDE_FLOAT32_C( -249.63), SIMDE_FLOAT32_C( 74.00), SIMDE_FLOAT32_C( 108.00),
SIMDE_FLOAT32_C( -709.95), SIMDE_FLOAT32_C( 780.40), SIMDE_FLOAT32_C( 382.24), SIMDE_FLOAT32_C( 771.07),
SIMDE_FLOAT32_C( 725.93), SIMDE_FLOAT32_C( -690.31), SIMDE_FLOAT32_C( -244.43), SIMDE_FLOAT32_C( 547.03) } }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])) ; i++) {
simde__m512 src = simde_mm512_loadu_ps(test_vec[i].src);
simde__m512 a = simde_mm512_loadu_ps(test_vec[i].a);
simde__m512 r = simde_mm512_mask_ceil_ps(src, test_vec[i].k, a);
simde_test_x86_assert_equal_f32x16(r, simde_mm512_loadu_ps(test_vec[i].r), 1);
}
return 0;
}
static int
test_simde_mm512_ceil_pd (SIMDE_MUNIT_TEST_ARGS) {
static const struct {
const simde_float64 a[8];
const simde_float64 r[8];
} test_vec[] = {
{ { SIMDE_FLOAT64_C( 568.62), SIMDE_FLOAT64_C( 832.90), SIMDE_FLOAT64_C( 451.04), SIMDE_FLOAT64_C( 205.98),
SIMDE_FLOAT64_C( 456.63), SIMDE_FLOAT64_C( 924.23), SIMDE_FLOAT64_C( -658.88), SIMDE_FLOAT64_C( -472.23) },
{ SIMDE_FLOAT64_C( 569.00), SIMDE_FLOAT64_C( 833.00), SIMDE_FLOAT64_C( 452.00), SIMDE_FLOAT64_C( 206.00),
SIMDE_FLOAT64_C( 457.00), SIMDE_FLOAT64_C( 925.00), SIMDE_FLOAT64_C( -658.00), SIMDE_FLOAT64_C( -472.00) } },
{ { SIMDE_FLOAT64_C( -579.06), SIMDE_FLOAT64_C( 724.10), SIMDE_FLOAT64_C( -922.32), SIMDE_FLOAT64_C( 603.12),
SIMDE_FLOAT64_C( -550.68), SIMDE_FLOAT64_C( -479.10), SIMDE_FLOAT64_C( -837.50), SIMDE_FLOAT64_C( 925.16) },
{ SIMDE_FLOAT64_C( -579.00), SIMDE_FLOAT64_C( 725.00), SIMDE_FLOAT64_C( -922.00), SIMDE_FLOAT64_C( 604.00),
SIMDE_FLOAT64_C( -550.00), SIMDE_FLOAT64_C( -479.00), SIMDE_FLOAT64_C( -837.00), SIMDE_FLOAT64_C( 926.00) } },
{ { SIMDE_FLOAT64_C( -415.08), SIMDE_FLOAT64_C( 718.97), SIMDE_FLOAT64_C( -850.54), SIMDE_FLOAT64_C( 464.10),
SIMDE_FLOAT64_C( 558.79), SIMDE_FLOAT64_C( 424.83), SIMDE_FLOAT64_C( -281.91), SIMDE_FLOAT64_C( 440.87) },
{ SIMDE_FLOAT64_C( -415.00), SIMDE_FLOAT64_C( 719.00), SIMDE_FLOAT64_C( -850.00), SIMDE_FLOAT64_C( 465.00),
SIMDE_FLOAT64_C( 559.00), SIMDE_FLOAT64_C( 425.00), SIMDE_FLOAT64_C( -281.00), SIMDE_FLOAT64_C( 441.00) } },
{ { SIMDE_FLOAT64_C( 834.86), SIMDE_FLOAT64_C( -787.94), SIMDE_FLOAT64_C( 560.68), SIMDE_FLOAT64_C( -896.06),
SIMDE_FLOAT64_C( -74.24), SIMDE_FLOAT64_C( 400.53), SIMDE_FLOAT64_C( -101.01), SIMDE_FLOAT64_C( -505.62) },
{ SIMDE_FLOAT64_C( 835.00), SIMDE_FLOAT64_C( -787.00), SIMDE_FLOAT64_C( 561.00), SIMDE_FLOAT64_C( -896.00),
SIMDE_FLOAT64_C( -74.00), SIMDE_FLOAT64_C( 401.00), SIMDE_FLOAT64_C( -101.00), SIMDE_FLOAT64_C( -505.00) } },
{ { SIMDE_FLOAT64_C( 233.43), SIMDE_FLOAT64_C( -649.98), SIMDE_FLOAT64_C( 700.36), SIMDE_FLOAT64_C( -309.94),
SIMDE_FLOAT64_C( -725.75), SIMDE_FLOAT64_C( -958.52), SIMDE_FLOAT64_C( 217.83), SIMDE_FLOAT64_C( -304.81) },
{ SIMDE_FLOAT64_C( 234.00), SIMDE_FLOAT64_C( -649.00), SIMDE_FLOAT64_C( 701.00), SIMDE_FLOAT64_C( -309.00),
SIMDE_FLOAT64_C( -725.00), SIMDE_FLOAT64_C( -958.00), SIMDE_FLOAT64_C( 218.00), SIMDE_FLOAT64_C( -304.00) } },
{ { SIMDE_FLOAT64_C( 765.58), SIMDE_FLOAT64_C( 295.51), SIMDE_FLOAT64_C( -701.69), SIMDE_FLOAT64_C( -785.11),
SIMDE_FLOAT64_C( 816.41), SIMDE_FLOAT64_C( -539.19), SIMDE_FLOAT64_C( -859.95), SIMDE_FLOAT64_C( -598.68) },
{ SIMDE_FLOAT64_C( 766.00), SIMDE_FLOAT64_C( 296.00), SIMDE_FLOAT64_C( -701.00), SIMDE_FLOAT64_C( -785.00),
SIMDE_FLOAT64_C( 817.00), SIMDE_FLOAT64_C( -539.00), SIMDE_FLOAT64_C( -859.00), SIMDE_FLOAT64_C( -598.00) } },
{ { SIMDE_FLOAT64_C( -820.22), SIMDE_FLOAT64_C( -710.49), SIMDE_FLOAT64_C( 865.42), SIMDE_FLOAT64_C( 738.57),
SIMDE_FLOAT64_C( 714.34), SIMDE_FLOAT64_C( -416.48), SIMDE_FLOAT64_C( 179.44), SIMDE_FLOAT64_C( 549.20) },
{ SIMDE_FLOAT64_C( -820.00), SIMDE_FLOAT64_C( -710.00), SIMDE_FLOAT64_C( 866.00), SIMDE_FLOAT64_C( 739.00),
SIMDE_FLOAT64_C( 715.00), SIMDE_FLOAT64_C( -416.00), SIMDE_FLOAT64_C( 180.00), SIMDE_FLOAT64_C( 550.00) } },
{ { SIMDE_FLOAT64_C( -204.42), SIMDE_FLOAT64_C( -259.88), SIMDE_FLOAT64_C( 653.14), SIMDE_FLOAT64_C( 721.34),
SIMDE_FLOAT64_C( -859.35), SIMDE_FLOAT64_C( -447.87), SIMDE_FLOAT64_C( -784.28), SIMDE_FLOAT64_C( 374.08) },
{ SIMDE_FLOAT64_C( -204.00), SIMDE_FLOAT64_C( -259.00), SIMDE_FLOAT64_C( 654.00), SIMDE_FLOAT64_C( 722.00),
SIMDE_FLOAT64_C( -859.00), SIMDE_FLOAT64_C( -447.00), SIMDE_FLOAT64_C( -784.00), SIMDE_FLOAT64_C( 375.00) } }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])) ; i++) {
simde__m512d a = simde_mm512_loadu_pd(test_vec[i].a);
simde__m512d r = simde_mm512_ceil_pd(a);
simde_test_x86_assert_equal_f64x8(r, simde_mm512_loadu_pd(test_vec[i].r), 1);
}
return 0;
}
static int
test_simde_mm512_mask_ceil_pd (SIMDE_MUNIT_TEST_ARGS) {
static const struct {
const simde_float64 src[8];
const simde__mmask8 k;
const simde_float64 a[8];
const simde_float64 r[8];
} test_vec[] = {
{ { SIMDE_FLOAT64_C( 229.81), SIMDE_FLOAT64_C( 525.51), SIMDE_FLOAT64_C( -6.46), SIMDE_FLOAT64_C( -218.89),
SIMDE_FLOAT64_C( -607.98), SIMDE_FLOAT64_C( -552.09), SIMDE_FLOAT64_C( 531.98), SIMDE_FLOAT64_C( 900.69) },
UINT8_C(198),
{ SIMDE_FLOAT64_C( -545.02), SIMDE_FLOAT64_C( 596.71), SIMDE_FLOAT64_C( 311.19), SIMDE_FLOAT64_C( -696.35),
SIMDE_FLOAT64_C( -125.03), SIMDE_FLOAT64_C( -375.13), SIMDE_FLOAT64_C( 455.71), SIMDE_FLOAT64_C( 769.17) },
{ SIMDE_FLOAT64_C( 229.81), SIMDE_FLOAT64_C( 597.00), SIMDE_FLOAT64_C( 312.00), SIMDE_FLOAT64_C( -218.89),
SIMDE_FLOAT64_C( -607.98), SIMDE_FLOAT64_C( -552.09), SIMDE_FLOAT64_C( 456.00), SIMDE_FLOAT64_C( 770.00) } },
{ { SIMDE_FLOAT64_C( 871.47), SIMDE_FLOAT64_C( -774.90), SIMDE_FLOAT64_C( 592.74), SIMDE_FLOAT64_C( -416.66),
SIMDE_FLOAT64_C( -243.97), SIMDE_FLOAT64_C( 106.58), SIMDE_FLOAT64_C( -923.77), SIMDE_FLOAT64_C( -472.30) },
UINT8_C(119),
{ SIMDE_FLOAT64_C( -407.44), SIMDE_FLOAT64_C( -264.38), SIMDE_FLOAT64_C( 828.67), SIMDE_FLOAT64_C( -804.49),
SIMDE_FLOAT64_C( 95.85), SIMDE_FLOAT64_C( 58.48), SIMDE_FLOAT64_C( 721.02), SIMDE_FLOAT64_C( -910.62) },
{ SIMDE_FLOAT64_C( -407.00), SIMDE_FLOAT64_C( -264.00), SIMDE_FLOAT64_C( 829.00), SIMDE_FLOAT64_C( -416.66),
SIMDE_FLOAT64_C( 96.00), SIMDE_FLOAT64_C( 59.00), SIMDE_FLOAT64_C( 722.00), SIMDE_FLOAT64_C( -472.30) } },
{ { SIMDE_FLOAT64_C( 839.59), SIMDE_FLOAT64_C( -886.96), SIMDE_FLOAT64_C( -462.70), SIMDE_FLOAT64_C( 371.56),
SIMDE_FLOAT64_C( -986.28), SIMDE_FLOAT64_C( 467.93), SIMDE_FLOAT64_C( 826.54), SIMDE_FLOAT64_C( 610.43) },
UINT8_C( 11),
{ SIMDE_FLOAT64_C( -869.81), SIMDE_FLOAT64_C( -514.60), SIMDE_FLOAT64_C( 404.00), SIMDE_FLOAT64_C( 585.90),
SIMDE_FLOAT64_C( -745.43), SIMDE_FLOAT64_C( 275.47), SIMDE_FLOAT64_C( 811.00), SIMDE_FLOAT64_C( 847.30) },
{ SIMDE_FLOAT64_C( -869.00), SIMDE_FLOAT64_C( -514.00), SIMDE_FLOAT64_C( -462.70), SIMDE_FLOAT64_C( 586.00),
SIMDE_FLOAT64_C( -986.28), SIMDE_FLOAT64_C( 467.93), SIMDE_FLOAT64_C( 826.54), SIMDE_FLOAT64_C( 610.43) } },
{ { SIMDE_FLOAT64_C( 858.82), SIMDE_FLOAT64_C( -432.97), SIMDE_FLOAT64_C( -46.12), SIMDE_FLOAT64_C( 935.05),
SIMDE_FLOAT64_C( 94.73), SIMDE_FLOAT64_C( -233.07), SIMDE_FLOAT64_C( -472.39), SIMDE_FLOAT64_C( 830.35) },
UINT8_C( 12),
{ SIMDE_FLOAT64_C( -276.88), SIMDE_FLOAT64_C( -73.80), SIMDE_FLOAT64_C( 654.07), SIMDE_FLOAT64_C( -555.86),
SIMDE_FLOAT64_C( 15.59), SIMDE_FLOAT64_C( 493.66), SIMDE_FLOAT64_C( -442.83), SIMDE_FLOAT64_C( 552.88) },
{ SIMDE_FLOAT64_C( 858.82), SIMDE_FLOAT64_C( -432.97), SIMDE_FLOAT64_C( 655.00), SIMDE_FLOAT64_C( -555.00),
SIMDE_FLOAT64_C( 94.73), SIMDE_FLOAT64_C( -233.07), SIMDE_FLOAT64_C( -472.39), SIMDE_FLOAT64_C( 830.35) } },
{ { SIMDE_FLOAT64_C( -134.77), SIMDE_FLOAT64_C( -429.10), SIMDE_FLOAT64_C( 20.82), SIMDE_FLOAT64_C( -308.24),
SIMDE_FLOAT64_C( -818.67), SIMDE_FLOAT64_C( 799.94), SIMDE_FLOAT64_C( -178.05), SIMDE_FLOAT64_C( -333.27) },
UINT8_C(157),
{ SIMDE_FLOAT64_C( -592.15), SIMDE_FLOAT64_C( -78.71), SIMDE_FLOAT64_C( -520.59), SIMDE_FLOAT64_C( -781.15),
SIMDE_FLOAT64_C( -231.40), SIMDE_FLOAT64_C( -661.77), SIMDE_FLOAT64_C( -214.12), SIMDE_FLOAT64_C( 722.48) },
{ SIMDE_FLOAT64_C( -592.00), SIMDE_FLOAT64_C( -429.10), SIMDE_FLOAT64_C( -520.00), SIMDE_FLOAT64_C( -781.00),
SIMDE_FLOAT64_C( -231.00), SIMDE_FLOAT64_C( 799.94), SIMDE_FLOAT64_C( -178.05), SIMDE_FLOAT64_C( 723.00) } },
{ { SIMDE_FLOAT64_C( -726.72), SIMDE_FLOAT64_C( 880.61), SIMDE_FLOAT64_C( -510.59), SIMDE_FLOAT64_C( -199.11),
SIMDE_FLOAT64_C( 710.96), SIMDE_FLOAT64_C( 85.00), SIMDE_FLOAT64_C( 524.01), SIMDE_FLOAT64_C( -362.83) },
UINT8_C(189),
{ SIMDE_FLOAT64_C( 968.14), SIMDE_FLOAT64_C( 652.75), SIMDE_FLOAT64_C( -767.26), SIMDE_FLOAT64_C( -474.68),
SIMDE_FLOAT64_C( 205.64), SIMDE_FLOAT64_C( 97.96), SIMDE_FLOAT64_C( 96.22), SIMDE_FLOAT64_C( -773.55) },
{ SIMDE_FLOAT64_C( 969.00), SIMDE_FLOAT64_C( 880.61), SIMDE_FLOAT64_C( -767.00), SIMDE_FLOAT64_C( -474.00),
SIMDE_FLOAT64_C( 206.00), SIMDE_FLOAT64_C( 98.00), SIMDE_FLOAT64_C( 524.01), SIMDE_FLOAT64_C( -773.00) } },
{ { SIMDE_FLOAT64_C( 789.73), SIMDE_FLOAT64_C( 277.54), SIMDE_FLOAT64_C( -973.60), SIMDE_FLOAT64_C( -388.32),
SIMDE_FLOAT64_C( 944.27), SIMDE_FLOAT64_C( 230.34), SIMDE_FLOAT64_C( 19.53), SIMDE_FLOAT64_C( -134.44) },
UINT8_C( 15),
{ SIMDE_FLOAT64_C( 238.38), SIMDE_FLOAT64_C( 634.16), SIMDE_FLOAT64_C( -952.02), SIMDE_FLOAT64_C( -975.74),
SIMDE_FLOAT64_C( 356.64), SIMDE_FLOAT64_C( -678.74), SIMDE_FLOAT64_C( 904.87), SIMDE_FLOAT64_C( 846.05) },
{ SIMDE_FLOAT64_C( 239.00), SIMDE_FLOAT64_C( 635.00), SIMDE_FLOAT64_C( -952.00), SIMDE_FLOAT64_C( -975.00),
SIMDE_FLOAT64_C( 944.27), SIMDE_FLOAT64_C( 230.34), SIMDE_FLOAT64_C( 19.53), SIMDE_FLOAT64_C( -134.44) } },
{ { SIMDE_FLOAT64_C( 122.14), SIMDE_FLOAT64_C( 615.84), SIMDE_FLOAT64_C( -68.95), SIMDE_FLOAT64_C( -353.85),
SIMDE_FLOAT64_C( -747.00), SIMDE_FLOAT64_C( 670.13), SIMDE_FLOAT64_C( -385.71), SIMDE_FLOAT64_C( 905.76) },
UINT8_C( 69),
{ SIMDE_FLOAT64_C( 139.61), SIMDE_FLOAT64_C( 111.39), SIMDE_FLOAT64_C( 0.83), SIMDE_FLOAT64_C( -764.17),
SIMDE_FLOAT64_C( 337.85), SIMDE_FLOAT64_C( -209.44), SIMDE_FLOAT64_C( 513.37), SIMDE_FLOAT64_C( 364.24) },
{ SIMDE_FLOAT64_C( 140.00), SIMDE_FLOAT64_C( 615.84), SIMDE_FLOAT64_C( 1.00), SIMDE_FLOAT64_C( -353.85),
SIMDE_FLOAT64_C( -747.00), SIMDE_FLOAT64_C( 670.13), SIMDE_FLOAT64_C( 514.00), SIMDE_FLOAT64_C( 905.76) } }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])) ; i++) {
simde__m512d src = simde_mm512_loadu_pd(test_vec[i].src);
simde__m512d a = simde_mm512_loadu_pd(test_vec[i].a);
simde__m512d r = simde_mm512_mask_ceil_pd(src, test_vec[i].k, a);
simde_test_x86_assert_equal_f64x8(r, simde_mm512_loadu_pd(test_vec[i].r), 1);
}
return 0;
}
static int
test_simde_mm_svml_sqrt_ps (SIMDE_MUNIT_TEST_ARGS) {
static const struct {
const simde_float32 a[4];
const simde_float32 r[4];
} test_vec[] = {
{ { SIMDE_FLOAT32_C( 528.60), SIMDE_FLOAT32_C( 322.33), SIMDE_FLOAT32_C( 385.23), SIMDE_FLOAT32_C( 814.87) },
{ SIMDE_FLOAT32_C( 22.99), SIMDE_FLOAT32_C( 17.95), SIMDE_FLOAT32_C( 19.63), SIMDE_FLOAT32_C( 28.55) } },
{ { SIMDE_FLOAT32_C( 587.72), SIMDE_FLOAT32_C( 685.82), SIMDE_FLOAT32_C( 593.20), SIMDE_FLOAT32_C( 733.30) },
{ SIMDE_FLOAT32_C( 24.24), SIMDE_FLOAT32_C( 26.19), SIMDE_FLOAT32_C( 24.36), SIMDE_FLOAT32_C( 27.08) } },
{ { SIMDE_FLOAT32_C( 325.19), SIMDE_FLOAT32_C( 348.73), SIMDE_FLOAT32_C( 342.79), SIMDE_FLOAT32_C( 565.69) },
{ SIMDE_FLOAT32_C( 18.03), SIMDE_FLOAT32_C( 18.67), SIMDE_FLOAT32_C( 18.51), SIMDE_FLOAT32_C( 23.78) } },
{ { SIMDE_FLOAT32_C( 148.43), SIMDE_FLOAT32_C( 85.30), SIMDE_FLOAT32_C( 679.23), SIMDE_FLOAT32_C( 235.95) },
{ SIMDE_FLOAT32_C( 12.18), SIMDE_FLOAT32_C( 9.24), SIMDE_FLOAT32_C( 26.06), SIMDE_FLOAT32_C( 15.36) } },
{ { SIMDE_FLOAT32_C( 741.81), SIMDE_FLOAT32_C( 327.17), SIMDE_FLOAT32_C( 932.33), SIMDE_FLOAT32_C( 431.37) },
{ SIMDE_FLOAT32_C( 27.24), SIMDE_FLOAT32_C( 18.09), SIMDE_FLOAT32_C( 30.53), SIMDE_FLOAT32_C( 20.77) } },
{ { SIMDE_FLOAT32_C( 630.74), SIMDE_FLOAT32_C( 622.98), SIMDE_FLOAT32_C( 345.17), SIMDE_FLOAT32_C( 666.65) },
{ SIMDE_FLOAT32_C( 25.11), SIMDE_FLOAT32_C( 24.96), SIMDE_FLOAT32_C( 18.58), SIMDE_FLOAT32_C( 25.82) } },
{ { SIMDE_FLOAT32_C( 95.65), SIMDE_FLOAT32_C( 585.30), SIMDE_FLOAT32_C( 996.40), SIMDE_FLOAT32_C( 212.96) },
{ SIMDE_FLOAT32_C( 9.78), SIMDE_FLOAT32_C( 24.19), SIMDE_FLOAT32_C( 31.57), SIMDE_FLOAT32_C( 14.59) } },
{ { SIMDE_FLOAT32_C( 691.00), SIMDE_FLOAT32_C( 383.56), SIMDE_FLOAT32_C( 356.19), SIMDE_FLOAT32_C( 219.60) },
{ SIMDE_FLOAT32_C( 26.29), SIMDE_FLOAT32_C( 19.58), SIMDE_FLOAT32_C( 18.87), SIMDE_FLOAT32_C( 14.82) } }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])) ; i++) {
simde__m128 a = simde_mm_loadu_ps(test_vec[i].a);
simde__m128 r = simde_mm_svml_sqrt_ps(a);
simde_test_x86_assert_equal_f32x4(r, simde_mm_loadu_ps(test_vec[i].r), 1);
}
return 0;
}
static int
test_simde_mm_svml_floor_ps (SIMDE_MUNIT_TEST_ARGS) {
static const struct {
const simde_float32 a[4];
const simde_float32 r[4];
} test_vec[] = {
{ { SIMDE_FLOAT32_C( -368.97), SIMDE_FLOAT32_C( -986.85), SIMDE_FLOAT32_C( 853.49), SIMDE_FLOAT32_C( 45.17) },
{ SIMDE_FLOAT32_C( -369.00), SIMDE_FLOAT32_C( -987.00), SIMDE_FLOAT32_C( 853.00), SIMDE_FLOAT32_C( 45.00) } },
{ { SIMDE_FLOAT32_C( 562.02), SIMDE_FLOAT32_C( -924.44), SIMDE_FLOAT32_C( -802.09), SIMDE_FLOAT32_C( 17.88) },
{ SIMDE_FLOAT32_C( 562.00), SIMDE_FLOAT32_C( -925.00), SIMDE_FLOAT32_C( -803.00), SIMDE_FLOAT32_C( 17.00) } },
{ { SIMDE_FLOAT32_C( -773.69), SIMDE_FLOAT32_C( -929.41), SIMDE_FLOAT32_C( -376.84), SIMDE_FLOAT32_C( -575.41) },
{ SIMDE_FLOAT32_C( -774.00), SIMDE_FLOAT32_C( -930.00), SIMDE_FLOAT32_C( -377.00), SIMDE_FLOAT32_C( -576.00) } },
{ { SIMDE_FLOAT32_C( 694.60), SIMDE_FLOAT32_C( 556.86), SIMDE_FLOAT32_C( 755.76), SIMDE_FLOAT32_C( -3.15) },
{ SIMDE_FLOAT32_C( 694.00), SIMDE_FLOAT32_C( 556.00), SIMDE_FLOAT32_C( 755.00), SIMDE_FLOAT32_C( -4.00) } },
{ { SIMDE_FLOAT32_C( -225.40), SIMDE_FLOAT32_C( 440.47), SIMDE_FLOAT32_C( -328.64), SIMDE_FLOAT32_C( -113.66) },
{ SIMDE_FLOAT32_C( -226.00), SIMDE_FLOAT32_C( 440.00), SIMDE_FLOAT32_C( -329.00), SIMDE_FLOAT32_C( -114.00) } },
{ { SIMDE_FLOAT32_C( -752.27), SIMDE_FLOAT32_C( -305.67), SIMDE_FLOAT32_C( -135.72), SIMDE_FLOAT32_C( -501.04) },
{ SIMDE_FLOAT32_C( -753.00), SIMDE_FLOAT32_C( -306.00), SIMDE_FLOAT32_C( -136.00), SIMDE_FLOAT32_C( -502.00) } },
{ { SIMDE_FLOAT32_C( 156.35), SIMDE_FLOAT32_C( 898.85), SIMDE_FLOAT32_C( -988.19), SIMDE_FLOAT32_C( 407.13) },
{ SIMDE_FLOAT32_C( 156.00), SIMDE_FLOAT32_C( 898.00), SIMDE_FLOAT32_C( -989.00), SIMDE_FLOAT32_C( 407.00) } },
{ { SIMDE_FLOAT32_C( 973.98), SIMDE_FLOAT32_C( 721.39), SIMDE_FLOAT32_C( -631.24), SIMDE_FLOAT32_C( -394.99) },
{ SIMDE_FLOAT32_C( 973.00), SIMDE_FLOAT32_C( 721.00), SIMDE_FLOAT32_C( -632.00), SIMDE_FLOAT32_C( -395.00) } }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])) ; i++) {
simde__m128 a = simde_mm_loadu_ps(test_vec[i].a);
simde__m128 r = simde_mm_svml_floor_ps(a);
simde_test_x86_assert_equal_f32x4(r, simde_mm_loadu_ps(test_vec[i].r), 1);
}
return 0;
}
static int
test_simde_mm_svml_floor_pd (SIMDE_MUNIT_TEST_ARGS) {
static const struct {
const simde_float64 a[2];
const simde_float64 r[2];
} test_vec[] = {
{ { SIMDE_FLOAT64_C( -495.36), SIMDE_FLOAT64_C( 574.97) },
{ SIMDE_FLOAT64_C( -496.00), SIMDE_FLOAT64_C( 574.00) } },
{ { SIMDE_FLOAT64_C( -571.90), SIMDE_FLOAT64_C( -4.02) },
{ SIMDE_FLOAT64_C( -572.00), SIMDE_FLOAT64_C( -5.00) } },
{ { SIMDE_FLOAT64_C( -111.97), SIMDE_FLOAT64_C( -326.91) },
{ SIMDE_FLOAT64_C( -112.00), SIMDE_FLOAT64_C( -327.00) } },
{ { SIMDE_FLOAT64_C( -366.90), SIMDE_FLOAT64_C( 909.28) },
{ SIMDE_FLOAT64_C( -367.00), SIMDE_FLOAT64_C( 909.00) } },
{ { SIMDE_FLOAT64_C( -637.61), SIMDE_FLOAT64_C( 377.44) },
{ SIMDE_FLOAT64_C( -638.00), SIMDE_FLOAT64_C( 377.00) } },
{ { SIMDE_FLOAT64_C( 358.88), SIMDE_FLOAT64_C( 783.39) },
{ SIMDE_FLOAT64_C( 358.00), SIMDE_FLOAT64_C( 783.00) } },
{ { SIMDE_FLOAT64_C( 137.00), SIMDE_FLOAT64_C( -315.38) },
{ SIMDE_FLOAT64_C( 137.00), SIMDE_FLOAT64_C( -316.00) } },
{ { SIMDE_FLOAT64_C( 20.73), SIMDE_FLOAT64_C( -927.12) },
{ SIMDE_FLOAT64_C( 20.00), SIMDE_FLOAT64_C( -928.00) } }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])) ; i++) {
simde__m128d a = simde_mm_loadu_pd(test_vec[i].a);
simde__m128d r = simde_mm_svml_floor_pd(a);
simde_test_x86_assert_equal_f64x2(r, simde_mm_loadu_pd(test_vec[i].r), 1);
}
return 0;
}
static int
test_simde_mm256_svml_floor_ps (SIMDE_MUNIT_TEST_ARGS) {
static const struct {
const simde_float32 a[8];
const simde_float32 r[8];
} test_vec[] = {
{ { SIMDE_FLOAT32_C( -100.83), SIMDE_FLOAT32_C( -16.23), SIMDE_FLOAT32_C( 689.00), SIMDE_FLOAT32_C( 627.12),
SIMDE_FLOAT32_C( -725.64), SIMDE_FLOAT32_C( -272.67), SIMDE_FLOAT32_C( 477.57), SIMDE_FLOAT32_C( 968.62) },
{ SIMDE_FLOAT32_C( -101.00), SIMDE_FLOAT32_C( -17.00), SIMDE_FLOAT32_C( 689.00), SIMDE_FLOAT32_C( 627.00),
SIMDE_FLOAT32_C( -726.00), SIMDE_FLOAT32_C( -273.00), SIMDE_FLOAT32_C( 477.00), SIMDE_FLOAT32_C( 968.00) } },
{ { SIMDE_FLOAT32_C( 259.55), SIMDE_FLOAT32_C( -892.87), SIMDE_FLOAT32_C( 37.54), SIMDE_FLOAT32_C( -594.84),
SIMDE_FLOAT32_C( 992.66), SIMDE_FLOAT32_C( 528.53), SIMDE_FLOAT32_C( -44.54), SIMDE_FLOAT32_C( 305.85) },
{ SIMDE_FLOAT32_C( 259.00), SIMDE_FLOAT32_C( -893.00), SIMDE_FLOAT32_C( 37.00), SIMDE_FLOAT32_C( -595.00),
SIMDE_FLOAT32_C( 992.00), SIMDE_FLOAT32_C( 528.00), SIMDE_FLOAT32_C( -45.00), SIMDE_FLOAT32_C( 305.00) } },
{ { SIMDE_FLOAT32_C( 785.51), SIMDE_FLOAT32_C( -262.72), SIMDE_FLOAT32_C( 566.52), SIMDE_FLOAT32_C( -760.14),
SIMDE_FLOAT32_C( 801.95), SIMDE_FLOAT32_C( 597.73), SIMDE_FLOAT32_C( -180.14), SIMDE_FLOAT32_C( 556.25) },
{ SIMDE_FLOAT32_C( 785.00), SIMDE_FLOAT32_C( -263.00), SIMDE_FLOAT32_C( 566.00), SIMDE_FLOAT32_C( -761.00),
SIMDE_FLOAT32_C( 801.00), SIMDE_FLOAT32_C( 597.00), SIMDE_FLOAT32_C( -181.00), SIMDE_FLOAT32_C( 556.00) } },
{ { SIMDE_FLOAT32_C( -337.69), SIMDE_FLOAT32_C( -509.08), SIMDE_FLOAT32_C( 665.71), SIMDE_FLOAT32_C( 342.73),
SIMDE_FLOAT32_C( 672.76), SIMDE_FLOAT32_C( -625.02), SIMDE_FLOAT32_C( -13.36), SIMDE_FLOAT32_C( -428.07) },
{ SIMDE_FLOAT32_C( -338.00), SIMDE_FLOAT32_C( -510.00), SIMDE_FLOAT32_C( 665.00), SIMDE_FLOAT32_C( 342.00),
SIMDE_FLOAT32_C( 672.00), SIMDE_FLOAT32_C( -626.00), SIMDE_FLOAT32_C( -14.00), SIMDE_FLOAT32_C( -429.00) } },
{ { SIMDE_FLOAT32_C( 358.75), SIMDE_FLOAT32_C( -324.36), SIMDE_FLOAT32_C( -800.95), SIMDE_FLOAT32_C( 633.11),
SIMDE_FLOAT32_C( 402.96), SIMDE_FLOAT32_C( 676.62), SIMDE_FLOAT32_C( 601.73), SIMDE_FLOAT32_C( -337.48) },
{ SIMDE_FLOAT32_C( 358.00), SIMDE_FLOAT32_C( -325.00), SIMDE_FLOAT32_C( -801.00), SIMDE_FLOAT32_C( 633.00),
SIMDE_FLOAT32_C( 402.00), SIMDE_FLOAT32_C( 676.00), SIMDE_FLOAT32_C( 601.00), SIMDE_FLOAT32_C( -338.00) } },
{ { SIMDE_FLOAT32_C( 783.75), SIMDE_FLOAT32_C( -360.73), SIMDE_FLOAT32_C( 67.67), SIMDE_FLOAT32_C( 776.41),
SIMDE_FLOAT32_C( -832.20), SIMDE_FLOAT32_C( -976.87), SIMDE_FLOAT32_C( 82.26), SIMDE_FLOAT32_C( 953.31) },
{ SIMDE_FLOAT32_C( 783.00), SIMDE_FLOAT32_C( -361.00), SIMDE_FLOAT32_C( 67.00), SIMDE_FLOAT32_C( 776.00),
SIMDE_FLOAT32_C( -833.00), SIMDE_FLOAT32_C( -977.00), SIMDE_FLOAT32_C( 82.00), SIMDE_FLOAT32_C( 953.00) } },
{ { SIMDE_FLOAT32_C( -239.59), SIMDE_FLOAT32_C( -351.22), SIMDE_FLOAT32_C( -806.83), SIMDE_FLOAT32_C( -437.64),
SIMDE_FLOAT32_C( -753.50), SIMDE_FLOAT32_C( 13.03), SIMDE_FLOAT32_C( -881.39), SIMDE_FLOAT32_C( -91.19) },
{ SIMDE_FLOAT32_C( -240.00), SIMDE_FLOAT32_C( -352.00), SIMDE_FLOAT32_C( -807.00), SIMDE_FLOAT32_C( -438.00),
SIMDE_FLOAT32_C( -754.00), SIMDE_FLOAT32_C( 13.00), SIMDE_FLOAT32_C( -882.00), SIMDE_FLOAT32_C( -92.00) } },
{ { SIMDE_FLOAT32_C( 503.95), SIMDE_FLOAT32_C( 784.32), SIMDE_FLOAT32_C( -748.46), SIMDE_FLOAT32_C( 176.71),
SIMDE_FLOAT32_C( -840.70), SIMDE_FLOAT32_C( 238.18), SIMDE_FLOAT32_C( 748.64), SIMDE_FLOAT32_C( 518.06) },
{ SIMDE_FLOAT32_C( 503.00), SIMDE_FLOAT32_C( 784.00), SIMDE_FLOAT32_C( -749.00), SIMDE_FLOAT32_C( 176.00),
SIMDE_FLOAT32_C( -841.00), SIMDE_FLOAT32_C( 238.00), SIMDE_FLOAT32_C( 748.00), SIMDE_FLOAT32_C( 518.00) } }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])) ; i++) {
simde__m256 a = simde_mm256_loadu_ps(test_vec[i].a);
simde__m256 r = simde_mm256_svml_floor_ps(a);
simde_test_x86_assert_equal_f32x8(r, simde_mm256_loadu_ps(test_vec[i].r), 1);
}
return 0;
}
static int
test_simde_mm256_svml_floor_pd (SIMDE_MUNIT_TEST_ARGS) {
static const struct {
const simde_float64 a[4];
const simde_float64 r[4];
} test_vec[] = {
{ { SIMDE_FLOAT64_C( -780.47), SIMDE_FLOAT64_C( -616.82), SIMDE_FLOAT64_C( -962.48), SIMDE_FLOAT64_C( -74.66) },
{ SIMDE_FLOAT64_C( -781.00), SIMDE_FLOAT64_C( -617.00), SIMDE_FLOAT64_C( -963.00), SIMDE_FLOAT64_C( -75.00) } },
{ { SIMDE_FLOAT64_C( -359.82), SIMDE_FLOAT64_C( -704.98), SIMDE_FLOAT64_C( 11.20), SIMDE_FLOAT64_C( 223.91) },
{ SIMDE_FLOAT64_C( -360.00), SIMDE_FLOAT64_C( -705.00), SIMDE_FLOAT64_C( 11.00), SIMDE_FLOAT64_C( 223.00) } },
{ { SIMDE_FLOAT64_C( 173.90), SIMDE_FLOAT64_C( 506.89), SIMDE_FLOAT64_C( 153.15), SIMDE_FLOAT64_C( -180.08) },
{ SIMDE_FLOAT64_C( 173.00), SIMDE_FLOAT64_C( 506.00), SIMDE_FLOAT64_C( 153.00), SIMDE_FLOAT64_C( -181.00) } },
{ { SIMDE_FLOAT64_C( -673.54), SIMDE_FLOAT64_C( 252.79), SIMDE_FLOAT64_C( 95.13), SIMDE_FLOAT64_C( -639.41) },
{ SIMDE_FLOAT64_C( -674.00), SIMDE_FLOAT64_C( 252.00), SIMDE_FLOAT64_C( 95.00), SIMDE_FLOAT64_C( -640.00) } },
{ { SIMDE_FLOAT64_C( -419.46), SIMDE_FLOAT64_C( 418.21), SIMDE_FLOAT64_C( -778.55), SIMDE_FLOAT64_C( -706.38) },
{ SIMDE_FLOAT64_C( -420.00), SIMDE_FLOAT64_C( 418.00), SIMDE_FLOAT64_C( -779.00), SIMDE_FLOAT64_C( -707.00) } },
{ { SIMDE_FLOAT64_C( -178.87), SIMDE_FLOAT64_C( -923.30), SIMDE_FLOAT64_C( -302.46), SIMDE_FLOAT64_C( -406.02) },
{ SIMDE_FLOAT64_C( -179.00), SIMDE_FLOAT64_C( -924.00), SIMDE_FLOAT64_C( -303.00), SIMDE_FLOAT64_C( -407.00) } },
{ { SIMDE_FLOAT64_C( 447.97), SIMDE_FLOAT64_C( 431.46), SIMDE_FLOAT64_C( -217.97), SIMDE_FLOAT64_C( -97.70) },
{ SIMDE_FLOAT64_C( 447.00), SIMDE_FLOAT64_C( 431.00), SIMDE_FLOAT64_C( -218.00), SIMDE_FLOAT64_C( -98.00) } },
{ { SIMDE_FLOAT64_C( 148.46), SIMDE_FLOAT64_C( 945.32), SIMDE_FLOAT64_C( -663.02), SIMDE_FLOAT64_C( 367.98) },
{ SIMDE_FLOAT64_C( 148.00), SIMDE_FLOAT64_C( 945.00), SIMDE_FLOAT64_C( -664.00), SIMDE_FLOAT64_C( 367.00) } }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])) ; i++) {
simde__m256d a = simde_mm256_loadu_pd(test_vec[i].a);
simde__m256d r = simde_mm256_svml_floor_pd(a);
simde_test_x86_assert_equal_f64x4(r, simde_mm256_loadu_pd(test_vec[i].r), 1);
}
return 0;
}
static int
test_simde_mm512_floor_ps (SIMDE_MUNIT_TEST_ARGS) {
static const struct {
const simde_float32 a[16];
const simde_float32 r[16];
} test_vec[] = {
{ { SIMDE_FLOAT32_C( 739.06), SIMDE_FLOAT32_C( 515.75), SIMDE_FLOAT32_C( -321.43), SIMDE_FLOAT32_C( -796.82),
SIMDE_FLOAT32_C( -649.68), SIMDE_FLOAT32_C( -774.53), SIMDE_FLOAT32_C( 961.31), SIMDE_FLOAT32_C( 113.28),
SIMDE_FLOAT32_C( -29.07), SIMDE_FLOAT32_C( -213.74), SIMDE_FLOAT32_C( -272.53), SIMDE_FLOAT32_C( -357.78),
SIMDE_FLOAT32_C( 211.62), SIMDE_FLOAT32_C( 164.32), SIMDE_FLOAT32_C( -909.49), SIMDE_FLOAT32_C( 809.56) },
{ SIMDE_FLOAT32_C( 739.00), SIMDE_FLOAT32_C( 515.00), SIMDE_FLOAT32_C( -322.00), SIMDE_FLOAT32_C( -797.00),
SIMDE_FLOAT32_C( -650.00), SIMDE_FLOAT32_C( -775.00), SIMDE_FLOAT32_C( 961.00), SIMDE_FLOAT32_C( 113.00),
SIMDE_FLOAT32_C( -30.00), SIMDE_FLOAT32_C( -214.00), SIMDE_FLOAT32_C( -273.00), SIMDE_FLOAT32_C( -358.00),
SIMDE_FLOAT32_C( 211.00), SIMDE_FLOAT32_C( 164.00), SIMDE_FLOAT32_C( -910.00), SIMDE_FLOAT32_C( 809.00) } },
{ { SIMDE_FLOAT32_C( 405.65), SIMDE_FLOAT32_C( -257.98), SIMDE_FLOAT32_C( -364.12), SIMDE_FLOAT32_C( -228.18),
SIMDE_FLOAT32_C( 200.69), SIMDE_FLOAT32_C( 614.44), SIMDE_FLOAT32_C( -198.53), SIMDE_FLOAT32_C( -756.05),
SIMDE_FLOAT32_C( -833.98), SIMDE_FLOAT32_C( 480.36), SIMDE_FLOAT32_C( 574.27), SIMDE_FLOAT32_C( -408.80),
SIMDE_FLOAT32_C( 768.69), SIMDE_FLOAT32_C( 342.19), SIMDE_FLOAT32_C( -17.03), SIMDE_FLOAT32_C( 507.75) },
{ SIMDE_FLOAT32_C( 405.00), SIMDE_FLOAT32_C( -258.00), SIMDE_FLOAT32_C( -365.00), SIMDE_FLOAT32_C( -229.00),
SIMDE_FLOAT32_C( 200.00), SIMDE_FLOAT32_C( 614.00), SIMDE_FLOAT32_C( -199.00), SIMDE_FLOAT32_C( -757.00),
SIMDE_FLOAT32_C( -834.00), SIMDE_FLOAT32_C( 480.00), SIMDE_FLOAT32_C( 574.00), SIMDE_FLOAT32_C( -409.00),
SIMDE_FLOAT32_C( 768.00), SIMDE_FLOAT32_C( 342.00), SIMDE_FLOAT32_C( -18.00), SIMDE_FLOAT32_C( 507.00) } },
{ { SIMDE_FLOAT32_C( -142.06), SIMDE_FLOAT32_C( 661.53), SIMDE_FLOAT32_C( 710.93), SIMDE_FLOAT32_C( 208.26),
SIMDE_FLOAT32_C( 887.01), SIMDE_FLOAT32_C( 672.24), SIMDE_FLOAT32_C( -678.46), SIMDE_FLOAT32_C( -142.06),
SIMDE_FLOAT32_C( -541.50), SIMDE_FLOAT32_C( 49.01), SIMDE_FLOAT32_C( 500.16), SIMDE_FLOAT32_C( 670.12),
SIMDE_FLOAT32_C( -786.67), SIMDE_FLOAT32_C( 590.66), SIMDE_FLOAT32_C( 479.68), SIMDE_FLOAT32_C( 618.98) },
{ SIMDE_FLOAT32_C( -143.00), SIMDE_FLOAT32_C( 661.00), SIMDE_FLOAT32_C( 710.00), SIMDE_FLOAT32_C( 208.00),
SIMDE_FLOAT32_C( 887.00), SIMDE_FLOAT32_C( 672.00), SIMDE_FLOAT32_C( -679.00), SIMDE_FLOAT32_C( -143.00),
SIMDE_FLOAT32_C( -542.00), SIMDE_FLOAT32_C( 49.00), SIMDE_FLOAT32_C( 500.00), SIMDE_FLOAT32_C( 670.00),
SIMDE_FLOAT32_C( -787.00), SIMDE_FLOAT32_C( 590.00), SIMDE_FLOAT32_C( 479.00), SIMDE_FLOAT32_C( 618.00) } },
{ { SIMDE_FLOAT32_C( -667.32), SIMDE_FLOAT32_C( -884.44), SIMDE_FLOAT32_C( -609.20), SIMDE_FLOAT32_C( 533.37),
SIMDE_FLOAT32_C( 730.00), SIMDE_FLOAT32_C( 192.28), SIMDE_FLOAT32_C( 777.32), SIMDE_FLOAT32_C( 896.02),
SIMDE_FLOAT32_C( -327.36), SIMDE_FLOAT32_C( 351.59), SIMDE_FLOAT32_C( -512.78), SIMDE_FLOAT32_C( -558.68),
SIMDE_FLOAT32_C( -306.22), SIMDE_FLOAT32_C( 470.19), SIMDE_FLOAT32_C( 949.07), SIMDE_FLOAT32_C( 551.72) },
{ SIMDE_FLOAT32_C( -668.00), SIMDE_FLOAT32_C( -885.00), SIMDE_FLOAT32_C( -610.00), SIMDE_FLOAT32_C( 533.00),
SIMDE_FLOAT32_C( 730.00), SIMDE_FLOAT32_C( 192.00), SIMDE_FLOAT32_C( 777.00), SIMDE_FLOAT32_C( 896.00),
SIMDE_FLOAT32_C( -328.00), SIMDE_FLOAT32_C( 351.00), SIMDE_FLOAT32_C( -513.00), SIMDE_FLOAT32_C( -559.00),
SIMDE_FLOAT32_C( -307.00), SIMDE_FLOAT32_C( 470.00), SIMDE_FLOAT32_C( 949.00), SIMDE_FLOAT32_C( 551.00) } },
{ { SIMDE_FLOAT32_C( 131.72), SIMDE_FLOAT32_C( 660.01), SIMDE_FLOAT32_C( -240.02), SIMDE_FLOAT32_C( 18.73),
SIMDE_FLOAT32_C( 332.25), SIMDE_FLOAT32_C( 81.52), SIMDE_FLOAT32_C( 876.67), SIMDE_FLOAT32_C( 790.75),
SIMDE_FLOAT32_C( -869.47), SIMDE_FLOAT32_C( 376.83), SIMDE_FLOAT32_C( 460.87), SIMDE_FLOAT32_C( -656.14),
SIMDE_FLOAT32_C( -32.51), SIMDE_FLOAT32_C( -59.45), SIMDE_FLOAT32_C( 962.84), SIMDE_FLOAT32_C( 300.17) },
{ SIMDE_FLOAT32_C( 131.00), SIMDE_FLOAT32_C( 660.00), SIMDE_FLOAT32_C( -241.00), SIMDE_FLOAT32_C( 18.00),
SIMDE_FLOAT32_C( 332.00), SIMDE_FLOAT32_C( 81.00), SIMDE_FLOAT32_C( 876.00), SIMDE_FLOAT32_C( 790.00),
SIMDE_FLOAT32_C( -870.00), SIMDE_FLOAT32_C( 376.00), SIMDE_FLOAT32_C( 460.00), SIMDE_FLOAT32_C( -657.00),
SIMDE_FLOAT32_C( -33.00), SIMDE_FLOAT32_C( -60.00), SIMDE_FLOAT32_C( 962.00), SIMDE_FLOAT32_C( 300.00) } },
{ { SIMDE_FLOAT32_C( 56.12), SIMDE_FLOAT32_C( -646.35), SIMDE_FLOAT32_C( -166.46), SIMDE_FLOAT32_C( -213.88),
SIMDE_FLOAT32_C( 545.92), SIMDE_FLOAT32_C( -389.14), SIMDE_FLOAT32_C( -317.86), SIMDE_FLOAT32_C( -781.44),
SIMDE_FLOAT32_C( 962.45), SIMDE_FLOAT32_C( 169.37), SIMDE_FLOAT32_C( -340.12), SIMDE_FLOAT32_C( -343.77),
SIMDE_FLOAT32_C( -360.44), SIMDE_FLOAT32_C( -391.05), SIMDE_FLOAT32_C( -792.05), SIMDE_FLOAT32_C( 771.28) },
{ SIMDE_FLOAT32_C( 56.00), SIMDE_FLOAT32_C( -647.00), SIMDE_FLOAT32_C( -167.00), SIMDE_FLOAT32_C( -214.00),
SIMDE_FLOAT32_C( 545.00), SIMDE_FLOAT32_C( -390.00), SIMDE_FLOAT32_C( -318.00), SIMDE_FLOAT32_C( -782.00),
SIMDE_FLOAT32_C( 962.00), SIMDE_FLOAT32_C( 169.00), SIMDE_FLOAT32_C( -341.00), SIMDE_FLOAT32_C( -344.00),
SIMDE_FLOAT32_C( -361.00), SIMDE_FLOAT32_C( -392.00), SIMDE_FLOAT32_C( -793.00), SIMDE_FLOAT32_C( 771.00) } },
{ { SIMDE_FLOAT32_C( -731.04), SIMDE_FLOAT32_C( -32.07), SIMDE_FLOAT32_C( -209.99), SIMDE_FLOAT32_C( 601.21),
SIMDE_FLOAT32_C( -950.55), SIMDE_FLOAT32_C( -333.32), SIMDE_FLOAT32_C( 391.96), SIMDE_FLOAT32_C( -820.02),
SIMDE_FLOAT32_C( -956.49), SIMDE_FLOAT32_C( -147.17), SIMDE_FLOAT32_C( -476.16), SIMDE_FLOAT32_C( 11.00),
SIMDE_FLOAT32_C( 793.38), SIMDE_FLOAT32_C( -513.32), SIMDE_FLOAT32_C( -688.82), SIMDE_FLOAT32_C( -150.50) },
{ SIMDE_FLOAT32_C( -732.00), SIMDE_FLOAT32_C( -33.00), SIMDE_FLOAT32_C( -210.00), SIMDE_FLOAT32_C( 601.00),
SIMDE_FLOAT32_C( -951.00), SIMDE_FLOAT32_C( -334.00), SIMDE_FLOAT32_C( 391.00), SIMDE_FLOAT32_C( -821.00),
SIMDE_FLOAT32_C( -957.00), SIMDE_FLOAT32_C( -148.00), SIMDE_FLOAT32_C( -477.00), SIMDE_FLOAT32_C( 11.00),
SIMDE_FLOAT32_C( 793.00), SIMDE_FLOAT32_C( -514.00), SIMDE_FLOAT32_C( -689.00), SIMDE_FLOAT32_C( -151.00) } },
{ { SIMDE_FLOAT32_C( -159.67), SIMDE_FLOAT32_C( 144.72), SIMDE_FLOAT32_C( 635.62), SIMDE_FLOAT32_C( -613.75),
SIMDE_FLOAT32_C( 755.58), SIMDE_FLOAT32_C( -682.24), SIMDE_FLOAT32_C( -395.19), SIMDE_FLOAT32_C( 718.03),
SIMDE_FLOAT32_C( 487.12), SIMDE_FLOAT32_C( 264.69), SIMDE_FLOAT32_C( -625.74), SIMDE_FLOAT32_C( -873.32),
SIMDE_FLOAT32_C( 873.65), SIMDE_FLOAT32_C( -417.79), SIMDE_FLOAT32_C( 897.96), SIMDE_FLOAT32_C( -857.39) },
{ SIMDE_FLOAT32_C( -160.00), SIMDE_FLOAT32_C( 144.00), SIMDE_FLOAT32_C( 635.00), SIMDE_FLOAT32_C( -614.00),
SIMDE_FLOAT32_C( 755.00), SIMDE_FLOAT32_C( -683.00), SIMDE_FLOAT32_C( -396.00), SIMDE_FLOAT32_C( 718.00),
SIMDE_FLOAT32_C( 487.00), SIMDE_FLOAT32_C( 264.00), SIMDE_FLOAT32_C( -626.00), SIMDE_FLOAT32_C( -874.00),
SIMDE_FLOAT32_C( 873.00), SIMDE_FLOAT32_C( -418.00), SIMDE_FLOAT32_C( 897.00), SIMDE_FLOAT32_C( -858.00) } }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])) ; i++) {
simde__m512 a = simde_mm512_loadu_ps(test_vec[i].a);
simde__m512 r = simde_mm512_floor_ps(a);
simde_test_x86_assert_equal_f32x16(r, simde_mm512_loadu_ps(test_vec[i].r), 1);
}
return 0;
}
static int
test_simde_mm512_mask_floor_ps (SIMDE_MUNIT_TEST_ARGS) {
static const struct {
const simde_float32 src[16];
const simde__mmask8 k;
const simde_float32 a[16];
const simde_float32 r[16];
} test_vec[] = {
{ { SIMDE_FLOAT32_C( 838.31), SIMDE_FLOAT32_C( 390.56), SIMDE_FLOAT32_C( -564.73), SIMDE_FLOAT32_C( 308.39),
SIMDE_FLOAT32_C( 260.44), SIMDE_FLOAT32_C( -533.14), SIMDE_FLOAT32_C( -978.85), SIMDE_FLOAT32_C( -130.38),
SIMDE_FLOAT32_C( 176.76), SIMDE_FLOAT32_C( -227.10), SIMDE_FLOAT32_C( -128.58), SIMDE_FLOAT32_C( 463.85),
SIMDE_FLOAT32_C( -349.81), SIMDE_FLOAT32_C( 938.22), SIMDE_FLOAT32_C( -414.35), SIMDE_FLOAT32_C( 715.39) },
UINT8_C( 56),
{ SIMDE_FLOAT32_C( 324.66), SIMDE_FLOAT32_C( -904.07), SIMDE_FLOAT32_C( 834.59), SIMDE_FLOAT32_C( -638.12),
SIMDE_FLOAT32_C( -994.43), SIMDE_FLOAT32_C( -322.02), SIMDE_FLOAT32_C( 105.22), SIMDE_FLOAT32_C( -770.91),
SIMDE_FLOAT32_C( 604.26), SIMDE_FLOAT32_C( -988.26), SIMDE_FLOAT32_C( -580.41), SIMDE_FLOAT32_C( 673.34),
SIMDE_FLOAT32_C( 425.23), SIMDE_FLOAT32_C( 713.78), SIMDE_FLOAT32_C( 511.64), SIMDE_FLOAT32_C( -184.21) },
{ SIMDE_FLOAT32_C( 838.31), SIMDE_FLOAT32_C( 390.56), SIMDE_FLOAT32_C( -564.73), SIMDE_FLOAT32_C( -639.00),
SIMDE_FLOAT32_C( -995.00), SIMDE_FLOAT32_C( -323.00), SIMDE_FLOAT32_C( -978.85), SIMDE_FLOAT32_C( -130.38),
SIMDE_FLOAT32_C( 176.76), SIMDE_FLOAT32_C( -227.10), SIMDE_FLOAT32_C( -128.58), SIMDE_FLOAT32_C( 463.85),
SIMDE_FLOAT32_C( -349.81), SIMDE_FLOAT32_C( 938.22), SIMDE_FLOAT32_C( -414.35), SIMDE_FLOAT32_C( 715.39) } },
{ { SIMDE_FLOAT32_C( -850.94), SIMDE_FLOAT32_C( -179.97), SIMDE_FLOAT32_C( -923.77), SIMDE_FLOAT32_C( -384.08),
SIMDE_FLOAT32_C( -158.82), SIMDE_FLOAT32_C( -54.15), SIMDE_FLOAT32_C( 792.68), SIMDE_FLOAT32_C( 614.08),
SIMDE_FLOAT32_C( 817.27), SIMDE_FLOAT32_C( 256.54), SIMDE_FLOAT32_C( -735.74), SIMDE_FLOAT32_C( 755.49),
SIMDE_FLOAT32_C( 842.19), SIMDE_FLOAT32_C( 979.66), SIMDE_FLOAT32_C( 610.86), SIMDE_FLOAT32_C( 166.85) },
UINT8_C(134),
{ SIMDE_FLOAT32_C( 445.45), SIMDE_FLOAT32_C( 528.73), SIMDE_FLOAT32_C( -918.84), SIMDE_FLOAT32_C( -876.56),
SIMDE_FLOAT32_C( -366.04), SIMDE_FLOAT32_C( -689.75), SIMDE_FLOAT32_C( 727.70), SIMDE_FLOAT32_C( -354.31),
SIMDE_FLOAT32_C( -270.16), SIMDE_FLOAT32_C( 401.04), SIMDE_FLOAT32_C( -929.08), SIMDE_FLOAT32_C( -556.38),
SIMDE_FLOAT32_C( -87.32), SIMDE_FLOAT32_C( -113.29), SIMDE_FLOAT32_C( -407.33), SIMDE_FLOAT32_C( 732.72) },
{ SIMDE_FLOAT32_C( -850.94), SIMDE_FLOAT32_C( 528.00), SIMDE_FLOAT32_C( -919.00), SIMDE_FLOAT32_C( -384.08),
SIMDE_FLOAT32_C( -158.82), SIMDE_FLOAT32_C( -54.15), SIMDE_FLOAT32_C( 792.68), SIMDE_FLOAT32_C( -355.00),
SIMDE_FLOAT32_C( 817.27), SIMDE_FLOAT32_C( 256.54), SIMDE_FLOAT32_C( -735.74), SIMDE_FLOAT32_C( 755.49),
SIMDE_FLOAT32_C( 842.19), SIMDE_FLOAT32_C( 979.66), SIMDE_FLOAT32_C( 610.86), SIMDE_FLOAT32_C( 166.85) } },
{ { SIMDE_FLOAT32_C( -37.05), SIMDE_FLOAT32_C( 208.59), SIMDE_FLOAT32_C( -426.10), SIMDE_FLOAT32_C( 908.80),
SIMDE_FLOAT32_C( 1.27), SIMDE_FLOAT32_C( -812.02), SIMDE_FLOAT32_C( 726.06), SIMDE_FLOAT32_C( -742.19),
SIMDE_FLOAT32_C( -547.76), SIMDE_FLOAT32_C( 481.55), SIMDE_FLOAT32_C( -900.00), SIMDE_FLOAT32_C( -568.10),
SIMDE_FLOAT32_C( 92.41), SIMDE_FLOAT32_C( 266.85), SIMDE_FLOAT32_C( -492.51), SIMDE_FLOAT32_C( -462.13) },
UINT8_C(130),
{ SIMDE_FLOAT32_C( -411.36), SIMDE_FLOAT32_C( -338.69), SIMDE_FLOAT32_C( 429.54), SIMDE_FLOAT32_C( -101.11),
SIMDE_FLOAT32_C( -610.99), SIMDE_FLOAT32_C( -924.77), SIMDE_FLOAT32_C( 628.73), SIMDE_FLOAT32_C( 790.05),
SIMDE_FLOAT32_C( -853.85), SIMDE_FLOAT32_C( -927.65), SIMDE_FLOAT32_C( -297.26), SIMDE_FLOAT32_C( 32.86),
SIMDE_FLOAT32_C( -334.98), SIMDE_FLOAT32_C( -564.55), SIMDE_FLOAT32_C( 995.81), SIMDE_FLOAT32_C( 873.62) },
{ SIMDE_FLOAT32_C( -37.05), SIMDE_FLOAT32_C( -339.00), SIMDE_FLOAT32_C( -426.10), SIMDE_FLOAT32_C( 908.80),
SIMDE_FLOAT32_C( 1.27), SIMDE_FLOAT32_C( -812.02), SIMDE_FLOAT32_C( 726.06), SIMDE_FLOAT32_C( 790.00),
SIMDE_FLOAT32_C( -547.76), SIMDE_FLOAT32_C( 481.55), SIMDE_FLOAT32_C( -900.00), SIMDE_FLOAT32_C( -568.10),
SIMDE_FLOAT32_C( 92.41), SIMDE_FLOAT32_C( 266.85), SIMDE_FLOAT32_C( -492.51), SIMDE_FLOAT32_C( -462.13) } },
{ { SIMDE_FLOAT32_C( 9.35), SIMDE_FLOAT32_C( 904.61), SIMDE_FLOAT32_C( -125.11), SIMDE_FLOAT32_C( 197.33),
SIMDE_FLOAT32_C( 630.67), SIMDE_FLOAT32_C( 132.70), SIMDE_FLOAT32_C( 649.56), SIMDE_FLOAT32_C( 112.22),
SIMDE_FLOAT32_C( 232.70), SIMDE_FLOAT32_C( -918.54), SIMDE_FLOAT32_C( -795.36), SIMDE_FLOAT32_C( -500.45),
SIMDE_FLOAT32_C( -411.05), SIMDE_FLOAT32_C( -257.49), SIMDE_FLOAT32_C( 295.13), SIMDE_FLOAT32_C( 177.59) },
UINT8_C(202),
{ SIMDE_FLOAT32_C( -275.34), SIMDE_FLOAT32_C( -923.51), SIMDE_FLOAT32_C( 792.83), SIMDE_FLOAT32_C( -200.11),
SIMDE_FLOAT32_C( 705.22), SIMDE_FLOAT32_C( 582.88), SIMDE_FLOAT32_C( -53.96), SIMDE_FLOAT32_C( 777.57),
SIMDE_FLOAT32_C( -714.38), SIMDE_FLOAT32_C( 978.91), SIMDE_FLOAT32_C( -557.41), SIMDE_FLOAT32_C( -278.93),
SIMDE_FLOAT32_C( 974.71), SIMDE_FLOAT32_C( -683.79), SIMDE_FLOAT32_C( 730.42), SIMDE_FLOAT32_C( 879.32) },
{ SIMDE_FLOAT32_C( 9.35), SIMDE_FLOAT32_C( -924.00), SIMDE_FLOAT32_C( -125.11), SIMDE_FLOAT32_C( -201.00),
SIMDE_FLOAT32_C( 630.67), SIMDE_FLOAT32_C( 132.70), SIMDE_FLOAT32_C( -54.00), SIMDE_FLOAT32_C( 777.00),
SIMDE_FLOAT32_C( 232.70), SIMDE_FLOAT32_C( -918.54), SIMDE_FLOAT32_C( -795.36), SIMDE_FLOAT32_C( -500.45),
SIMDE_FLOAT32_C( -411.05), SIMDE_FLOAT32_C( -257.49), SIMDE_FLOAT32_C( 295.13), SIMDE_FLOAT32_C( 177.59) } },
{ { SIMDE_FLOAT32_C( 191.09), SIMDE_FLOAT32_C( -72.26), SIMDE_FLOAT32_C( 509.99), SIMDE_FLOAT32_C( -676.21),
SIMDE_FLOAT32_C( -422.69), SIMDE_FLOAT32_C( -377.79), SIMDE_FLOAT32_C( 556.49), SIMDE_FLOAT32_C( -341.23),
SIMDE_FLOAT32_C( -173.15), SIMDE_FLOAT32_C( -943.96), SIMDE_FLOAT32_C( 247.72), SIMDE_FLOAT32_C( 569.36),
SIMDE_FLOAT32_C( 351.17), SIMDE_FLOAT32_C( -574.69), SIMDE_FLOAT32_C( -26.83), SIMDE_FLOAT32_C( -924.17) },
UINT8_C(134),
{ SIMDE_FLOAT32_C( -234.00), SIMDE_FLOAT32_C( -124.28), SIMDE_FLOAT32_C( -792.99), SIMDE_FLOAT32_C( -651.12),
SIMDE_FLOAT32_C( 821.76), SIMDE_FLOAT32_C( 984.58), SIMDE_FLOAT32_C( -365.50), SIMDE_FLOAT32_C( 800.67),
SIMDE_FLOAT32_C( -572.83), SIMDE_FLOAT32_C( 355.57), SIMDE_FLOAT32_C( 775.38), SIMDE_FLOAT32_C( -256.62),
SIMDE_FLOAT32_C( 85.98), SIMDE_FLOAT32_C( 654.71), SIMDE_FLOAT32_C( 934.47), SIMDE_FLOAT32_C( -986.27) },
{ SIMDE_FLOAT32_C( 191.09), SIMDE_FLOAT32_C( -125.00), SIMDE_FLOAT32_C( -793.00), SIMDE_FLOAT32_C( -676.21),
SIMDE_FLOAT32_C( -422.69), SIMDE_FLOAT32_C( -377.79), SIMDE_FLOAT32_C( 556.49), SIMDE_FLOAT32_C( 800.00),
SIMDE_FLOAT32_C( -173.15), SIMDE_FLOAT32_C( -943.96), SIMDE_FLOAT32_C( 247.72), SIMDE_FLOAT32_C( 569.36),
SIMDE_FLOAT32_C( 351.17), SIMDE_FLOAT32_C( -574.69), SIMDE_FLOAT32_C( -26.83), SIMDE_FLOAT32_C( -924.17) } },
{ { SIMDE_FLOAT32_C( 164.70), SIMDE_FLOAT32_C( -741.74), SIMDE_FLOAT32_C( -408.96), SIMDE_FLOAT32_C( 786.91),
SIMDE_FLOAT32_C( 814.76), SIMDE_FLOAT32_C( 249.81), SIMDE_FLOAT32_C( -386.24), SIMDE_FLOAT32_C( 870.80),
SIMDE_FLOAT32_C( -502.47), SIMDE_FLOAT32_C( -816.88), SIMDE_FLOAT32_C( 221.97), SIMDE_FLOAT32_C( -77.16),
SIMDE_FLOAT32_C( 156.29), SIMDE_FLOAT32_C( 297.80), SIMDE_FLOAT32_C( 424.63), SIMDE_FLOAT32_C( 922.29) },
UINT8_C(198),
{ SIMDE_FLOAT32_C( 631.65), SIMDE_FLOAT32_C( -728.83), SIMDE_FLOAT32_C( 995.29), SIMDE_FLOAT32_C( 616.23),
SIMDE_FLOAT32_C( -94.34), SIMDE_FLOAT32_C( 795.96), SIMDE_FLOAT32_C( -956.60), SIMDE_FLOAT32_C( -738.77),
SIMDE_FLOAT32_C( 571.34), SIMDE_FLOAT32_C( -213.23), SIMDE_FLOAT32_C( 347.21), SIMDE_FLOAT32_C( 226.05),
SIMDE_FLOAT32_C( -278.76), SIMDE_FLOAT32_C( 360.94), SIMDE_FLOAT32_C( -609.25), SIMDE_FLOAT32_C( -20.49) },
{ SIMDE_FLOAT32_C( 164.70), SIMDE_FLOAT32_C( -729.00), SIMDE_FLOAT32_C( 995.00), SIMDE_FLOAT32_C( 786.91),
SIMDE_FLOAT32_C( 814.76), SIMDE_FLOAT32_C( 249.81), SIMDE_FLOAT32_C( -957.00), SIMDE_FLOAT32_C( -739.00),
SIMDE_FLOAT32_C( -502.47), SIMDE_FLOAT32_C( -816.88), SIMDE_FLOAT32_C( 221.97), SIMDE_FLOAT32_C( -77.16),
SIMDE_FLOAT32_C( 156.29), SIMDE_FLOAT32_C( 297.80), SIMDE_FLOAT32_C( 424.63), SIMDE_FLOAT32_C( 922.29) } },
{ { SIMDE_FLOAT32_C( 951.98), SIMDE_FLOAT32_C( -822.34), SIMDE_FLOAT32_C( -205.73), SIMDE_FLOAT32_C( 201.79),
SIMDE_FLOAT32_C( -208.58), SIMDE_FLOAT32_C( -334.93), SIMDE_FLOAT32_C( 699.32), SIMDE_FLOAT32_C( -25.46),
SIMDE_FLOAT32_C( 887.04), SIMDE_FLOAT32_C( -377.85), SIMDE_FLOAT32_C( -869.17), SIMDE_FLOAT32_C( 184.84),
SIMDE_FLOAT32_C( -953.21), SIMDE_FLOAT32_C( -946.88), SIMDE_FLOAT32_C( 358.36), SIMDE_FLOAT32_C( 678.43) },
UINT8_C(118),
{ SIMDE_FLOAT32_C( 353.65), SIMDE_FLOAT32_C( 294.66), SIMDE_FLOAT32_C( 229.95), SIMDE_FLOAT32_C( 149.61),
SIMDE_FLOAT32_C( 338.06), SIMDE_FLOAT32_C( 491.18), SIMDE_FLOAT32_C( -279.05), SIMDE_FLOAT32_C( -875.17),
SIMDE_FLOAT32_C( -161.61), SIMDE_FLOAT32_C( 947.00), SIMDE_FLOAT32_C( -153.92), SIMDE_FLOAT32_C( -800.67),
SIMDE_FLOAT32_C( -662.25), SIMDE_FLOAT32_C( 825.58), SIMDE_FLOAT32_C( -848.68), SIMDE_FLOAT32_C( -484.59) },
{ SIMDE_FLOAT32_C( 951.98), SIMDE_FLOAT32_C( 294.00), SIMDE_FLOAT32_C( 229.00), SIMDE_FLOAT32_C( 201.79),
SIMDE_FLOAT32_C( 338.00), SIMDE_FLOAT32_C( 491.00), SIMDE_FLOAT32_C( -280.00), SIMDE_FLOAT32_C( -25.46),
SIMDE_FLOAT32_C( 887.04), SIMDE_FLOAT32_C( -377.85), SIMDE_FLOAT32_C( -869.17), SIMDE_FLOAT32_C( 184.84),
SIMDE_FLOAT32_C( -953.21), SIMDE_FLOAT32_C( -946.88), SIMDE_FLOAT32_C( 358.36), SIMDE_FLOAT32_C( 678.43) } },
{ { SIMDE_FLOAT32_C( -380.15), SIMDE_FLOAT32_C( 353.11), SIMDE_FLOAT32_C( 306.83), SIMDE_FLOAT32_C( 284.92),
SIMDE_FLOAT32_C( 52.42), SIMDE_FLOAT32_C( -718.63), SIMDE_FLOAT32_C( 171.96), SIMDE_FLOAT32_C( 674.58),
SIMDE_FLOAT32_C( -587.81), SIMDE_FLOAT32_C( -643.20), SIMDE_FLOAT32_C( 721.36), SIMDE_FLOAT32_C( -534.69),
SIMDE_FLOAT32_C( 715.16), SIMDE_FLOAT32_C( 399.80), SIMDE_FLOAT32_C( -210.40), SIMDE_FLOAT32_C( 68.81) },
UINT8_MAX,
{ SIMDE_FLOAT32_C( -980.45), SIMDE_FLOAT32_C( -781.58), SIMDE_FLOAT32_C( -967.49), SIMDE_FLOAT32_C( 510.73),
SIMDE_FLOAT32_C( -60.62), SIMDE_FLOAT32_C( -842.65), SIMDE_FLOAT32_C( -650.88), SIMDE_FLOAT32_C( -113.62),
SIMDE_FLOAT32_C( 3.42), SIMDE_FLOAT32_C( -451.55), SIMDE_FLOAT32_C( 224.13), SIMDE_FLOAT32_C( -170.99),
SIMDE_FLOAT32_C( -300.23), SIMDE_FLOAT32_C( 739.54), SIMDE_FLOAT32_C( 448.86), SIMDE_FLOAT32_C( -947.12) },
{ SIMDE_FLOAT32_C( -981.00), SIMDE_FLOAT32_C( -782.00), SIMDE_FLOAT32_C( -968.00), SIMDE_FLOAT32_C( 510.00),
SIMDE_FLOAT32_C( -61.00), SIMDE_FLOAT32_C( -843.00), SIMDE_FLOAT32_C( -651.00), SIMDE_FLOAT32_C( -114.00),
SIMDE_FLOAT32_C( -587.81), SIMDE_FLOAT32_C( -643.20), SIMDE_FLOAT32_C( 721.36), SIMDE_FLOAT32_C( -534.69),
SIMDE_FLOAT32_C( 715.16), SIMDE_FLOAT32_C( 399.80), SIMDE_FLOAT32_C( -210.40), SIMDE_FLOAT32_C( 68.81) } }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])) ; i++) {
simde__m512 src = simde_mm512_loadu_ps(test_vec[i].src);
simde__m512 a = simde_mm512_loadu_ps(test_vec[i].a);
simde__m512 r = simde_mm512_mask_floor_ps(src, test_vec[i].k, a);
simde_test_x86_assert_equal_f32x16(r, simde_mm512_loadu_ps(test_vec[i].r), 1);
}
return 0;
}
static int
test_simde_mm512_floor_pd (SIMDE_MUNIT_TEST_ARGS) {
static const struct {
const simde_float64 a[8];
const simde_float64 r[8];
} test_vec[] = {
{ { SIMDE_FLOAT64_C( -794.32), SIMDE_FLOAT64_C( -48.70), SIMDE_FLOAT64_C( 277.79), SIMDE_FLOAT64_C( -475.80),
SIMDE_FLOAT64_C( -876.95), SIMDE_FLOAT64_C( -924.41), SIMDE_FLOAT64_C( 255.35), SIMDE_FLOAT64_C( -243.50) },
{ SIMDE_FLOAT64_C( -795.00), SIMDE_FLOAT64_C( -49.00), SIMDE_FLOAT64_C( 277.00), SIMDE_FLOAT64_C( -476.00),
SIMDE_FLOAT64_C( -877.00), SIMDE_FLOAT64_C( -925.00), SIMDE_FLOAT64_C( 255.00), SIMDE_FLOAT64_C( -244.00) } },
{ { SIMDE_FLOAT64_C( -620.91), SIMDE_FLOAT64_C( -173.96), SIMDE_FLOAT64_C( 275.90), SIMDE_FLOAT64_C( -717.33),
SIMDE_FLOAT64_C( -402.37), SIMDE_FLOAT64_C( -882.40), SIMDE_FLOAT64_C( 45.04), SIMDE_FLOAT64_C( -141.04) },
{ SIMDE_FLOAT64_C( -621.00), SIMDE_FLOAT64_C( -174.00), SIMDE_FLOAT64_C( 275.00), SIMDE_FLOAT64_C( -718.00),
SIMDE_FLOAT64_C( -403.00), SIMDE_FLOAT64_C( -883.00), SIMDE_FLOAT64_C( 45.00), SIMDE_FLOAT64_C( -142.00) } },
{ { SIMDE_FLOAT64_C( -548.52), SIMDE_FLOAT64_C( -215.27), SIMDE_FLOAT64_C( 977.63), SIMDE_FLOAT64_C( 913.41),
SIMDE_FLOAT64_C( -371.07), SIMDE_FLOAT64_C( 460.81), SIMDE_FLOAT64_C( 547.36), SIMDE_FLOAT64_C( -452.52) },
{ SIMDE_FLOAT64_C( -549.00), SIMDE_FLOAT64_C( -216.00), SIMDE_FLOAT64_C( 977.00), SIMDE_FLOAT64_C( 913.00),
SIMDE_FLOAT64_C( -372.00), SIMDE_FLOAT64_C( 460.00), SIMDE_FLOAT64_C( 547.00), SIMDE_FLOAT64_C( -453.00) } },
{ { SIMDE_FLOAT64_C( -61.27), SIMDE_FLOAT64_C( -606.40), SIMDE_FLOAT64_C( 310.76), SIMDE_FLOAT64_C( 420.51),
SIMDE_FLOAT64_C( -353.71), SIMDE_FLOAT64_C( -327.75), SIMDE_FLOAT64_C( 663.33), SIMDE_FLOAT64_C( -148.03) },
{ SIMDE_FLOAT64_C( -62.00), SIMDE_FLOAT64_C( -607.00), SIMDE_FLOAT64_C( 310.00), SIMDE_FLOAT64_C( 420.00),
SIMDE_FLOAT64_C( -354.00), SIMDE_FLOAT64_C( -328.00), SIMDE_FLOAT64_C( 663.00), SIMDE_FLOAT64_C( -149.00) } },
{ { SIMDE_FLOAT64_C( 623.55), SIMDE_FLOAT64_C( -58.88), SIMDE_FLOAT64_C( 376.17), SIMDE_FLOAT64_C( 746.60),
SIMDE_FLOAT64_C( 16.71), SIMDE_FLOAT64_C( -368.49), SIMDE_FLOAT64_C( -496.90), SIMDE_FLOAT64_C( 395.80) },
{ SIMDE_FLOAT64_C( 623.00), SIMDE_FLOAT64_C( -59.00), SIMDE_FLOAT64_C( 376.00), SIMDE_FLOAT64_C( 746.00),
SIMDE_FLOAT64_C( 16.00), SIMDE_FLOAT64_C( -369.00), SIMDE_FLOAT64_C( -497.00), SIMDE_FLOAT64_C( 395.00) } },
{ { SIMDE_FLOAT64_C( 457.55), SIMDE_FLOAT64_C( 779.00), SIMDE_FLOAT64_C( 678.47), SIMDE_FLOAT64_C( -944.81),
SIMDE_FLOAT64_C( 896.60), SIMDE_FLOAT64_C( -276.49), SIMDE_FLOAT64_C( -85.86), SIMDE_FLOAT64_C( -651.92) },
{ SIMDE_FLOAT64_C( 457.00), SIMDE_FLOAT64_C( 779.00), SIMDE_FLOAT64_C( 678.00), SIMDE_FLOAT64_C( -945.00),
SIMDE_FLOAT64_C( 896.00), SIMDE_FLOAT64_C( -277.00), SIMDE_FLOAT64_C( -86.00), SIMDE_FLOAT64_C( -652.00) } },
{ { SIMDE_FLOAT64_C( 508.25), SIMDE_FLOAT64_C( -108.22), SIMDE_FLOAT64_C( -738.51), SIMDE_FLOAT64_C( -862.82),
SIMDE_FLOAT64_C( -647.41), SIMDE_FLOAT64_C( 808.85), SIMDE_FLOAT64_C( -315.34), SIMDE_FLOAT64_C( 291.32) },
{ SIMDE_FLOAT64_C( 508.00), SIMDE_FLOAT64_C( -109.00), SIMDE_FLOAT64_C( -739.00), SIMDE_FLOAT64_C( -863.00),
SIMDE_FLOAT64_C( -648.00), SIMDE_FLOAT64_C( 808.00), SIMDE_FLOAT64_C( -316.00), SIMDE_FLOAT64_C( 291.00) } },
{ { SIMDE_FLOAT64_C( -797.54), SIMDE_FLOAT64_C( 995.42), SIMDE_FLOAT64_C( -288.16), SIMDE_FLOAT64_C( -151.25),
SIMDE_FLOAT64_C( -332.32), SIMDE_FLOAT64_C( -624.84), SIMDE_FLOAT64_C( 700.72), SIMDE_FLOAT64_C( -708.77) },
{ SIMDE_FLOAT64_C( -798.00), SIMDE_FLOAT64_C( 995.00), SIMDE_FLOAT64_C( -289.00), SIMDE_FLOAT64_C( -152.00),
SIMDE_FLOAT64_C( -333.00), SIMDE_FLOAT64_C( -625.00), SIMDE_FLOAT64_C( 700.00), SIMDE_FLOAT64_C( -709.00) } }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])) ; i++) {
simde__m512d a = simde_mm512_loadu_pd(test_vec[i].a);
simde__m512d r = simde_mm512_floor_pd(a);
simde_test_x86_assert_equal_f64x8(r, simde_mm512_loadu_pd(test_vec[i].r), 1);
}
return 0;
}
static int
test_simde_mm512_mask_floor_pd (SIMDE_MUNIT_TEST_ARGS) {
static const struct {
const simde_float64 src[8];
const simde__mmask8 k;
const simde_float64 a[8];
const simde_float64 r[8];
} test_vec[] = {
{ { SIMDE_FLOAT64_C( -886.28), SIMDE_FLOAT64_C( -614.03), SIMDE_FLOAT64_C( -883.55), SIMDE_FLOAT64_C( 865.29),
SIMDE_FLOAT64_C( -206.48), SIMDE_FLOAT64_C( -34.33), SIMDE_FLOAT64_C( -987.38), SIMDE_FLOAT64_C( 127.49) },
UINT8_C(157),
{ SIMDE_FLOAT64_C( 163.94), SIMDE_FLOAT64_C( 134.54), SIMDE_FLOAT64_C( 245.58), SIMDE_FLOAT64_C( -615.09),
SIMDE_FLOAT64_C( 80.28), SIMDE_FLOAT64_C( -93.17), SIMDE_FLOAT64_C( 181.16), SIMDE_FLOAT64_C( 303.02) },
{ SIMDE_FLOAT64_C( 163.00), SIMDE_FLOAT64_C( -614.03), SIMDE_FLOAT64_C( 245.00), SIMDE_FLOAT64_C( -616.00),
SIMDE_FLOAT64_C( 80.00), SIMDE_FLOAT64_C( -34.33), SIMDE_FLOAT64_C( -987.38), SIMDE_FLOAT64_C( 303.00) } },
{ { SIMDE_FLOAT64_C( 377.85), SIMDE_FLOAT64_C( 999.13), SIMDE_FLOAT64_C( -474.80), SIMDE_FLOAT64_C( -29.53),
SIMDE_FLOAT64_C( 777.92), SIMDE_FLOAT64_C( 307.60), SIMDE_FLOAT64_C( 178.13), SIMDE_FLOAT64_C( 680.84) },
UINT8_C(246),
{ SIMDE_FLOAT64_C( 47.73), SIMDE_FLOAT64_C( 681.42), SIMDE_FLOAT64_C( -141.66), SIMDE_FLOAT64_C( 574.99),
SIMDE_FLOAT64_C( -969.81), SIMDE_FLOAT64_C( -27.94), SIMDE_FLOAT64_C( 960.96), SIMDE_FLOAT64_C( -853.36) },
{ SIMDE_FLOAT64_C( 377.85), SIMDE_FLOAT64_C( 681.00), SIMDE_FLOAT64_C( -142.00), SIMDE_FLOAT64_C( -29.53),
SIMDE_FLOAT64_C( -970.00), SIMDE_FLOAT64_C( -28.00), SIMDE_FLOAT64_C( 960.00), SIMDE_FLOAT64_C( -854.00) } },
{ { SIMDE_FLOAT64_C( -162.66), SIMDE_FLOAT64_C( -245.52), SIMDE_FLOAT64_C( 112.31), SIMDE_FLOAT64_C( -150.03),
SIMDE_FLOAT64_C( 881.98), SIMDE_FLOAT64_C( 426.57), SIMDE_FLOAT64_C( -986.09), SIMDE_FLOAT64_C( 16.51) },
UINT8_C( 53),
{ SIMDE_FLOAT64_C( -601.18), SIMDE_FLOAT64_C( -903.21), SIMDE_FLOAT64_C( 578.99), SIMDE_FLOAT64_C( 579.98),
SIMDE_FLOAT64_C( 399.82), SIMDE_FLOAT64_C( -43.16), SIMDE_FLOAT64_C( 579.10), SIMDE_FLOAT64_C( 925.02) },
{ SIMDE_FLOAT64_C( -602.00), SIMDE_FLOAT64_C( -245.52), SIMDE_FLOAT64_C( 578.00), SIMDE_FLOAT64_C( -150.03),
SIMDE_FLOAT64_C( 399.00), SIMDE_FLOAT64_C( -44.00), SIMDE_FLOAT64_C( -986.09), SIMDE_FLOAT64_C( 16.51) } },
{ { SIMDE_FLOAT64_C( 927.31), SIMDE_FLOAT64_C( 357.02), SIMDE_FLOAT64_C( 232.62), SIMDE_FLOAT64_C( 105.44),
SIMDE_FLOAT64_C( 37.87), SIMDE_FLOAT64_C( 434.25), SIMDE_FLOAT64_C( -846.83), SIMDE_FLOAT64_C( -280.72) },
UINT8_C(253),
{ SIMDE_FLOAT64_C( 728.16), SIMDE_FLOAT64_C( -250.53), SIMDE_FLOAT64_C( 264.65), SIMDE_FLOAT64_C( 689.12),
SIMDE_FLOAT64_C( -103.89), SIMDE_FLOAT64_C( -898.01), SIMDE_FLOAT64_C( -556.40), SIMDE_FLOAT64_C( -991.58) },
{ SIMDE_FLOAT64_C( 728.00), SIMDE_FLOAT64_C( 357.02), SIMDE_FLOAT64_C( 264.00), SIMDE_FLOAT64_C( 689.00),
SIMDE_FLOAT64_C( -104.00), SIMDE_FLOAT64_C( -899.00), SIMDE_FLOAT64_C( -557.00), SIMDE_FLOAT64_C( -992.00) } },
{ { SIMDE_FLOAT64_C( -48.04), SIMDE_FLOAT64_C( -674.42), SIMDE_FLOAT64_C( 434.99), SIMDE_FLOAT64_C( -34.14),
SIMDE_FLOAT64_C( 342.09), SIMDE_FLOAT64_C( -892.85), SIMDE_FLOAT64_C( 364.68), SIMDE_FLOAT64_C( 438.89) },
UINT8_C( 35),
{ SIMDE_FLOAT64_C( -55.34), SIMDE_FLOAT64_C( -161.30), SIMDE_FLOAT64_C( -357.03), SIMDE_FLOAT64_C( -476.24),
SIMDE_FLOAT64_C( -236.28), SIMDE_FLOAT64_C( -429.72), SIMDE_FLOAT64_C( 880.78), SIMDE_FLOAT64_C( 996.35) },
{ SIMDE_FLOAT64_C( -56.00), SIMDE_FLOAT64_C( -162.00), SIMDE_FLOAT64_C( 434.99), SIMDE_FLOAT64_C( -34.14),
SIMDE_FLOAT64_C( 342.09), SIMDE_FLOAT64_C( -430.00), SIMDE_FLOAT64_C( 364.68), SIMDE_FLOAT64_C( 438.89) } },
{ { SIMDE_FLOAT64_C( 675.71), SIMDE_FLOAT64_C( -81.35), SIMDE_FLOAT64_C( 430.60), SIMDE_FLOAT64_C( 828.89),
SIMDE_FLOAT64_C( 637.93), SIMDE_FLOAT64_C( 723.19), SIMDE_FLOAT64_C( 557.05), SIMDE_FLOAT64_C( -612.60) },
UINT8_C(162),
{ SIMDE_FLOAT64_C( 246.17), SIMDE_FLOAT64_C( 283.52), SIMDE_FLOAT64_C( 89.83), SIMDE_FLOAT64_C( 689.78),
SIMDE_FLOAT64_C( 291.94), SIMDE_FLOAT64_C( -958.21), SIMDE_FLOAT64_C( -984.64), SIMDE_FLOAT64_C( -273.07) },
{ SIMDE_FLOAT64_C( 675.71), SIMDE_FLOAT64_C( 283.00), SIMDE_FLOAT64_C( 430.60), SIMDE_FLOAT64_C( 828.89),
SIMDE_FLOAT64_C( 637.93), SIMDE_FLOAT64_C( -959.00), SIMDE_FLOAT64_C( 557.05), SIMDE_FLOAT64_C( -274.00) } },
{ { SIMDE_FLOAT64_C( 7.65), SIMDE_FLOAT64_C( 357.45), SIMDE_FLOAT64_C( -165.92), SIMDE_FLOAT64_C( -627.67),
SIMDE_FLOAT64_C( -203.66), SIMDE_FLOAT64_C( -479.79), SIMDE_FLOAT64_C( 316.99), SIMDE_FLOAT64_C( 635.04) },
UINT8_C(211),
{ SIMDE_FLOAT64_C( 840.75), SIMDE_FLOAT64_C( -601.24), SIMDE_FLOAT64_C( 733.46), SIMDE_FLOAT64_C( 721.53),
SIMDE_FLOAT64_C( -604.89), SIMDE_FLOAT64_C( 409.18), SIMDE_FLOAT64_C( -359.82), SIMDE_FLOAT64_C( 825.71) },
{ SIMDE_FLOAT64_C( 840.00), SIMDE_FLOAT64_C( -602.00), SIMDE_FLOAT64_C( -165.92), SIMDE_FLOAT64_C( -627.67),
SIMDE_FLOAT64_C( -605.00), SIMDE_FLOAT64_C( -479.79), SIMDE_FLOAT64_C( -360.00), SIMDE_FLOAT64_C( 825.00) } },
{ { SIMDE_FLOAT64_C( 238.07), SIMDE_FLOAT64_C( -721.89), SIMDE_FLOAT64_C( 548.91), SIMDE_FLOAT64_C( -204.89),
SIMDE_FLOAT64_C( -334.48), SIMDE_FLOAT64_C( -463.26), SIMDE_FLOAT64_C( -958.71), SIMDE_FLOAT64_C( 949.03) },
UINT8_C(120),
{ SIMDE_FLOAT64_C( 731.06), SIMDE_FLOAT64_C( 240.97), SIMDE_FLOAT64_C( 668.36), SIMDE_FLOAT64_C( 746.42),
SIMDE_FLOAT64_C( 967.90), SIMDE_FLOAT64_C( -323.99), SIMDE_FLOAT64_C( 103.87), SIMDE_FLOAT64_C( -198.02) },
{ SIMDE_FLOAT64_C( 238.07), SIMDE_FLOAT64_C( -721.89), SIMDE_FLOAT64_C( 548.91), SIMDE_FLOAT64_C( 746.00),
SIMDE_FLOAT64_C( 967.00), SIMDE_FLOAT64_C( -324.00), SIMDE_FLOAT64_C( 103.00), SIMDE_FLOAT64_C( 949.03) } }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])) ; i++) {
simde__m512d src = simde_mm512_loadu_pd(test_vec[i].src);
simde__m512d a = simde_mm512_loadu_pd(test_vec[i].a);
simde__m512d r = simde_mm512_mask_floor_pd(src, test_vec[i].k, a);
simde_test_x86_assert_equal_f64x8(r, simde_mm512_loadu_pd(test_vec[i].r), 1);
}
return 0;
}
static int
test_simde_mm_svml_round_ps (SIMDE_MUNIT_TEST_ARGS) {
static const struct {
const simde_float32 a[4];
const simde_float32 r[4];
} test_vec[] = {
{ { SIMDE_FLOAT32_C( -381.64), SIMDE_FLOAT32_C( -952.18), SIMDE_FLOAT32_C( 936.50), SIMDE_FLOAT32_C( -269.57) },
{ SIMDE_FLOAT32_C( -382.00), SIMDE_FLOAT32_C( -952.00), SIMDE_FLOAT32_C( 937.00), SIMDE_FLOAT32_C( -270.00) } },
{ { SIMDE_FLOAT32_C( 524.01), SIMDE_FLOAT32_C( 820.80), SIMDE_FLOAT32_C( -576.54), SIMDE_FLOAT32_C( 493.48) },
{ SIMDE_FLOAT32_C( 524.00), SIMDE_FLOAT32_C( 821.00), SIMDE_FLOAT32_C( -577.00), SIMDE_FLOAT32_C( 493.00) } },
{ { SIMDE_FLOAT32_C( -183.12), SIMDE_FLOAT32_C( -410.38), SIMDE_FLOAT32_C( 918.43), SIMDE_FLOAT32_C( 555.31) },
{ SIMDE_FLOAT32_C( -183.00), SIMDE_FLOAT32_C( -410.00), SIMDE_FLOAT32_C( 918.00), SIMDE_FLOAT32_C( 555.00) } },
{ { SIMDE_FLOAT32_C( -777.47), SIMDE_FLOAT32_C( 961.82), SIMDE_FLOAT32_C( -15.88), SIMDE_FLOAT32_C( -545.38) },
{ SIMDE_FLOAT32_C( -777.00), SIMDE_FLOAT32_C( 962.00), SIMDE_FLOAT32_C( -16.00), SIMDE_FLOAT32_C( -545.00) } },
{ { SIMDE_FLOAT32_C( 827.92), SIMDE_FLOAT32_C( -576.14), SIMDE_FLOAT32_C( 188.86), SIMDE_FLOAT32_C( -194.33) },
{ SIMDE_FLOAT32_C( 828.00), SIMDE_FLOAT32_C( -576.00), SIMDE_FLOAT32_C( 189.00), SIMDE_FLOAT32_C( -194.00) } },
{ { SIMDE_FLOAT32_C( -357.49), SIMDE_FLOAT32_C( 544.93), SIMDE_FLOAT32_C( -548.96), SIMDE_FLOAT32_C( 982.95) },
{ SIMDE_FLOAT32_C( -357.00), SIMDE_FLOAT32_C( 545.00), SIMDE_FLOAT32_C( -549.00), SIMDE_FLOAT32_C( 983.00) } },
{ { SIMDE_FLOAT32_C( -811.59), SIMDE_FLOAT32_C( 502.24), SIMDE_FLOAT32_C( 18.44), SIMDE_FLOAT32_C( -985.11) },
{ SIMDE_FLOAT32_C( -812.00), SIMDE_FLOAT32_C( 502.00), SIMDE_FLOAT32_C( 18.00), SIMDE_FLOAT32_C( -985.00) } },
{ { SIMDE_FLOAT32_C( -901.60), SIMDE_FLOAT32_C( 1.79), SIMDE_FLOAT32_C( -119.54), SIMDE_FLOAT32_C( -283.24) },
{ SIMDE_FLOAT32_C( -902.00), SIMDE_FLOAT32_C( 2.00), SIMDE_FLOAT32_C( -120.00), SIMDE_FLOAT32_C( -283.00) } }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])) ; i++) {
simde__m128 a = simde_mm_loadu_ps(test_vec[i].a);
simde__m128 r = simde_mm_svml_round_ps(a);
simde_test_x86_assert_equal_f32x4(r, simde_mm_loadu_ps(test_vec[i].r), 1);
}
return 0;
}
static int
test_simde_mm_svml_round_pd (SIMDE_MUNIT_TEST_ARGS) {
static const struct {
const simde_float64 a[2];
const simde_float64 r[2];
} test_vec[] = {
{ { SIMDE_FLOAT64_C( -775.87), SIMDE_FLOAT64_C( 258.36) },
{ SIMDE_FLOAT64_C( -776.00), SIMDE_FLOAT64_C( 258.00) } },
{ { SIMDE_FLOAT64_C( 698.30), SIMDE_FLOAT64_C( -24.21) },
{ SIMDE_FLOAT64_C( 698.00), SIMDE_FLOAT64_C( -24.00) } },
{ { SIMDE_FLOAT64_C( -755.31), SIMDE_FLOAT64_C( -751.07) },
{ SIMDE_FLOAT64_C( -755.00), SIMDE_FLOAT64_C( -751.00) } },
{ { SIMDE_FLOAT64_C( 607.87), SIMDE_FLOAT64_C( -999.16) },
{ SIMDE_FLOAT64_C( 608.00), SIMDE_FLOAT64_C( -999.00) } },
{ { SIMDE_FLOAT64_C( -558.18), SIMDE_FLOAT64_C( -447.90) },
{ SIMDE_FLOAT64_C( -558.00), SIMDE_FLOAT64_C( -448.00) } },
{ { SIMDE_FLOAT64_C( -159.19), SIMDE_FLOAT64_C( 675.96) },
{ SIMDE_FLOAT64_C( -159.00), SIMDE_FLOAT64_C( 676.00) } },
{ { SIMDE_FLOAT64_C( -682.16), SIMDE_FLOAT64_C( 502.15) },
{ SIMDE_FLOAT64_C( -682.00), SIMDE_FLOAT64_C( 502.00) } },
{ { SIMDE_FLOAT64_C( -591.87), SIMDE_FLOAT64_C( 775.61) },
{ SIMDE_FLOAT64_C( -592.00), SIMDE_FLOAT64_C( 776.00) } }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])) ; i++) {
simde__m128d a = simde_mm_loadu_pd(test_vec[i].a);
simde__m128d r = simde_mm_svml_round_pd(a);
simde_test_x86_assert_equal_f64x2(r, simde_mm_loadu_pd(test_vec[i].r), 1);
}
return 0;
}
static int
test_simde_mm256_svml_round_ps (SIMDE_MUNIT_TEST_ARGS) {
static const struct {
const simde_float32 a[8];
const simde_float32 r[8];
} test_vec[] = {
{ { SIMDE_FLOAT32_C( 380.84), SIMDE_FLOAT32_C( -788.70), SIMDE_FLOAT32_C( 937.85), SIMDE_FLOAT32_C( 321.73),
SIMDE_FLOAT32_C( 250.52), SIMDE_FLOAT32_C( -410.85), SIMDE_FLOAT32_C( -411.50), SIMDE_FLOAT32_C( -342.15) },
{ SIMDE_FLOAT32_C( 381.00), SIMDE_FLOAT32_C( -789.00), SIMDE_FLOAT32_C( 938.00), SIMDE_FLOAT32_C( 322.00),
SIMDE_FLOAT32_C( 251.00), SIMDE_FLOAT32_C( -411.00), SIMDE_FLOAT32_C( -412.00), SIMDE_FLOAT32_C( -342.00) } },
{ { SIMDE_FLOAT32_C( -410.55), SIMDE_FLOAT32_C( 648.37), SIMDE_FLOAT32_C( 294.06), SIMDE_FLOAT32_C( 315.36),
SIMDE_FLOAT32_C( -375.65), SIMDE_FLOAT32_C( 783.04), SIMDE_FLOAT32_C( -600.22), SIMDE_FLOAT32_C( -208.94) },
{ SIMDE_FLOAT32_C( -411.00), SIMDE_FLOAT32_C( 648.00), SIMDE_FLOAT32_C( 294.00), SIMDE_FLOAT32_C( 315.00),
SIMDE_FLOAT32_C( -376.00), SIMDE_FLOAT32_C( 783.00), SIMDE_FLOAT32_C( -600.00), SIMDE_FLOAT32_C( -209.00) } },
{ { SIMDE_FLOAT32_C( 628.12), SIMDE_FLOAT32_C( 178.11), SIMDE_FLOAT32_C( -902.32), SIMDE_FLOAT32_C( -420.94),
SIMDE_FLOAT32_C( -113.02), SIMDE_FLOAT32_C( 352.97), SIMDE_FLOAT32_C( -796.40), SIMDE_FLOAT32_C( -795.50) },
{ SIMDE_FLOAT32_C( 628.00), SIMDE_FLOAT32_C( 178.00), SIMDE_FLOAT32_C( -902.00), SIMDE_FLOAT32_C( -421.00),
SIMDE_FLOAT32_C( -113.00), SIMDE_FLOAT32_C( 353.00), SIMDE_FLOAT32_C( -796.00), SIMDE_FLOAT32_C( -796.00) } },
{ { SIMDE_FLOAT32_C( -712.04), SIMDE_FLOAT32_C( 880.10), SIMDE_FLOAT32_C( 698.48), SIMDE_FLOAT32_C( -638.58),
SIMDE_FLOAT32_C( 349.16), SIMDE_FLOAT32_C( 163.60), SIMDE_FLOAT32_C( -690.90), SIMDE_FLOAT32_C( -270.00) },
{ SIMDE_FLOAT32_C( -712.00), SIMDE_FLOAT32_C( 880.00), SIMDE_FLOAT32_C( 698.00), SIMDE_FLOAT32_C( -639.00),
SIMDE_FLOAT32_C( 349.00), SIMDE_FLOAT32_C( 164.00), SIMDE_FLOAT32_C( -691.00), SIMDE_FLOAT32_C( -270.00) } },
{ { SIMDE_FLOAT32_C( 374.90), SIMDE_FLOAT32_C( -753.05), SIMDE_FLOAT32_C( -948.26), SIMDE_FLOAT32_C( -374.58),
SIMDE_FLOAT32_C( -163.90), SIMDE_FLOAT32_C( -359.76), SIMDE_FLOAT32_C( 283.27), SIMDE_FLOAT32_C( 425.55) },
{ SIMDE_FLOAT32_C( 375.00), SIMDE_FLOAT32_C( -753.00), SIMDE_FLOAT32_C( -948.00), SIMDE_FLOAT32_C( -375.00),
SIMDE_FLOAT32_C( -164.00), SIMDE_FLOAT32_C( -360.00), SIMDE_FLOAT32_C( 283.00), SIMDE_FLOAT32_C( 426.00) } },
{ { SIMDE_FLOAT32_C( -711.40), SIMDE_FLOAT32_C( -422.67), SIMDE_FLOAT32_C( -259.09), SIMDE_FLOAT32_C( -87.05),
SIMDE_FLOAT32_C( -639.63), SIMDE_FLOAT32_C( 140.69), SIMDE_FLOAT32_C( 704.01), SIMDE_FLOAT32_C( 988.49) },
{ SIMDE_FLOAT32_C( -711.00), SIMDE_FLOAT32_C( -423.00), SIMDE_FLOAT32_C( -259.00), SIMDE_FLOAT32_C( -87.00),
SIMDE_FLOAT32_C( -640.00), SIMDE_FLOAT32_C( 141.00), SIMDE_FLOAT32_C( 704.00), SIMDE_FLOAT32_C( 988.00) } },
{ { SIMDE_FLOAT32_C( -681.20), SIMDE_FLOAT32_C( 801.69), SIMDE_FLOAT32_C( -432.45), SIMDE_FLOAT32_C( 205.78),
SIMDE_FLOAT32_C( 154.66), SIMDE_FLOAT32_C( -228.84), SIMDE_FLOAT32_C( 410.28), SIMDE_FLOAT32_C( 442.62) },
{ SIMDE_FLOAT32_C( -681.00), SIMDE_FLOAT32_C( 802.00), SIMDE_FLOAT32_C( -432.00), SIMDE_FLOAT32_C( 206.00),
SIMDE_FLOAT32_C( 155.00), SIMDE_FLOAT32_C( -229.00), SIMDE_FLOAT32_C( 410.00), SIMDE_FLOAT32_C( 443.00) } },
{ { SIMDE_FLOAT32_C( -348.74), SIMDE_FLOAT32_C( 108.77), SIMDE_FLOAT32_C( 804.05), SIMDE_FLOAT32_C( -999.58),
SIMDE_FLOAT32_C( -727.63), SIMDE_FLOAT32_C( -886.85), SIMDE_FLOAT32_C( -269.57), SIMDE_FLOAT32_C( 647.26) },
{ SIMDE_FLOAT32_C( -349.00), SIMDE_FLOAT32_C( 109.00), SIMDE_FLOAT32_C( 804.00), SIMDE_FLOAT32_C( -1000.00),
SIMDE_FLOAT32_C( -728.00), SIMDE_FLOAT32_C( -887.00), SIMDE_FLOAT32_C( -270.00), SIMDE_FLOAT32_C( 647.00) } }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])) ; i++) {
simde__m256 a = simde_mm256_loadu_ps(test_vec[i].a);
simde__m256 r = simde_mm256_svml_round_ps(a);
simde_test_x86_assert_equal_f32x8(r, simde_mm256_loadu_ps(test_vec[i].r), 1);
}
return 0;
}
static int
test_simde_mm256_svml_round_pd (SIMDE_MUNIT_TEST_ARGS) {
static const struct {
const simde_float64 a[4];
const simde_float64 r[4];
} test_vec[] = {
{ { SIMDE_FLOAT64_C( -844.84), SIMDE_FLOAT64_C( -247.28), SIMDE_FLOAT64_C( 192.26), SIMDE_FLOAT64_C( 426.25) },
{ SIMDE_FLOAT64_C( -845.00), SIMDE_FLOAT64_C( -247.00), SIMDE_FLOAT64_C( 192.00), SIMDE_FLOAT64_C( 426.00) } },
{ { SIMDE_FLOAT64_C( -53.32), SIMDE_FLOAT64_C( -778.93), SIMDE_FLOAT64_C( -167.10), SIMDE_FLOAT64_C( -593.25) },
{ SIMDE_FLOAT64_C( -53.00), SIMDE_FLOAT64_C( -779.00), SIMDE_FLOAT64_C( -167.00), SIMDE_FLOAT64_C( -593.00) } },
{ { SIMDE_FLOAT64_C( -450.17), SIMDE_FLOAT64_C( -606.32), SIMDE_FLOAT64_C( 101.38), SIMDE_FLOAT64_C( -341.77) },
{ SIMDE_FLOAT64_C( -450.00), SIMDE_FLOAT64_C( -606.00), SIMDE_FLOAT64_C( 101.00), SIMDE_FLOAT64_C( -342.00) } },
{ { SIMDE_FLOAT64_C( -461.44), SIMDE_FLOAT64_C( 674.51), SIMDE_FLOAT64_C( 145.37), SIMDE_FLOAT64_C( 148.63) },
{ SIMDE_FLOAT64_C( -461.00), SIMDE_FLOAT64_C( 675.00), SIMDE_FLOAT64_C( 145.00), SIMDE_FLOAT64_C( 149.00) } },
{ { SIMDE_FLOAT64_C( -693.71), SIMDE_FLOAT64_C( -933.34), SIMDE_FLOAT64_C( 117.11), SIMDE_FLOAT64_C( 52.36) },
{ SIMDE_FLOAT64_C( -694.00), SIMDE_FLOAT64_C( -933.00), SIMDE_FLOAT64_C( 117.00), SIMDE_FLOAT64_C( 52.00) } },
{ { SIMDE_FLOAT64_C( 574.82), SIMDE_FLOAT64_C( -929.55), SIMDE_FLOAT64_C( 113.17), SIMDE_FLOAT64_C( -272.97) },
{ SIMDE_FLOAT64_C( 575.00), SIMDE_FLOAT64_C( -930.00), SIMDE_FLOAT64_C( 113.00), SIMDE_FLOAT64_C( -273.00) } },
{ { SIMDE_FLOAT64_C( 102.14), SIMDE_FLOAT64_C( -880.36), SIMDE_FLOAT64_C( 222.01), SIMDE_FLOAT64_C( -844.37) },
{ SIMDE_FLOAT64_C( 102.00), SIMDE_FLOAT64_C( -880.00), SIMDE_FLOAT64_C( 222.00), SIMDE_FLOAT64_C( -844.00) } },
{ { SIMDE_FLOAT64_C( 363.52), SIMDE_FLOAT64_C( -723.41), SIMDE_FLOAT64_C( -68.69), SIMDE_FLOAT64_C( 518.69) },
{ SIMDE_FLOAT64_C( 364.00), SIMDE_FLOAT64_C( -723.00), SIMDE_FLOAT64_C( -69.00), SIMDE_FLOAT64_C( 519.00) } }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])) ; i++) {
simde__m256d a = simde_mm256_loadu_pd(test_vec[i].a);
simde__m256d r = simde_mm256_svml_round_pd(a);
simde_test_x86_assert_equal_f64x4(r, simde_mm256_loadu_pd(test_vec[i].r), 1);
}
return 0;
}
static int
test_simde_mm512_svml_round_pd (SIMDE_MUNIT_TEST_ARGS) {
static const struct {
const simde_float64 a[8];
const simde_float64 r[8];
} test_vec[] = {
{ { SIMDE_FLOAT64_C( 838.26), SIMDE_FLOAT64_C( 713.09), SIMDE_FLOAT64_C( 41.96), SIMDE_FLOAT64_C( -274.12),
SIMDE_FLOAT64_C( 674.75), SIMDE_FLOAT64_C( 434.35), SIMDE_FLOAT64_C( -404.30), SIMDE_FLOAT64_C( -706.45) },
{ SIMDE_FLOAT64_C( 838.00), SIMDE_FLOAT64_C( 713.00), SIMDE_FLOAT64_C( 42.00), SIMDE_FLOAT64_C( -274.00),
SIMDE_FLOAT64_C( 675.00), SIMDE_FLOAT64_C( 434.00), SIMDE_FLOAT64_C( -404.00), SIMDE_FLOAT64_C( -706.00) } },
{ { SIMDE_FLOAT64_C( 764.33), SIMDE_FLOAT64_C( 549.73), SIMDE_FLOAT64_C( 946.10), SIMDE_FLOAT64_C( 543.69),
SIMDE_FLOAT64_C( 399.24), SIMDE_FLOAT64_C( 840.23), SIMDE_FLOAT64_C( -804.12), SIMDE_FLOAT64_C( 92.87) },
{ SIMDE_FLOAT64_C( 764.00), SIMDE_FLOAT64_C( 550.00), SIMDE_FLOAT64_C( 946.00), SIMDE_FLOAT64_C( 544.00),
SIMDE_FLOAT64_C( 399.00), SIMDE_FLOAT64_C( 840.00), SIMDE_FLOAT64_C( -804.00), SIMDE_FLOAT64_C( 93.00) } },
{ { SIMDE_FLOAT64_C( -719.75), SIMDE_FLOAT64_C( -288.44), SIMDE_FLOAT64_C( -7.73), SIMDE_FLOAT64_C( -17.69),
SIMDE_FLOAT64_C( -135.39), SIMDE_FLOAT64_C( -783.16), SIMDE_FLOAT64_C( -89.69), SIMDE_FLOAT64_C( -576.47) },
{ SIMDE_FLOAT64_C( -720.00), SIMDE_FLOAT64_C( -288.00), SIMDE_FLOAT64_C( -8.00), SIMDE_FLOAT64_C( -18.00),
SIMDE_FLOAT64_C( -135.00), SIMDE_FLOAT64_C( -783.00), SIMDE_FLOAT64_C( -90.00), SIMDE_FLOAT64_C( -576.00) } },
{ { SIMDE_FLOAT64_C( 729.17), SIMDE_FLOAT64_C( 679.53), SIMDE_FLOAT64_C( -484.77), SIMDE_FLOAT64_C( 898.47),
SIMDE_FLOAT64_C( -408.70), SIMDE_FLOAT64_C( -621.23), SIMDE_FLOAT64_C( -109.48), SIMDE_FLOAT64_C( -570.45) },
{ SIMDE_FLOAT64_C( 729.00), SIMDE_FLOAT64_C( 680.00), SIMDE_FLOAT64_C( -485.00), SIMDE_FLOAT64_C( 898.00),
SIMDE_FLOAT64_C( -409.00), SIMDE_FLOAT64_C( -621.00), SIMDE_FLOAT64_C( -109.00), SIMDE_FLOAT64_C( -570.00) } },
{ { SIMDE_FLOAT64_C( -908.13), SIMDE_FLOAT64_C( 932.48), SIMDE_FLOAT64_C( 155.44), SIMDE_FLOAT64_C( 766.61),
SIMDE_FLOAT64_C( 366.83), SIMDE_FLOAT64_C( 751.14), SIMDE_FLOAT64_C( -939.84), SIMDE_FLOAT64_C( 131.16) },
{ SIMDE_FLOAT64_C( -908.00), SIMDE_FLOAT64_C( 932.00), SIMDE_FLOAT64_C( 155.00), SIMDE_FLOAT64_C( 767.00),
SIMDE_FLOAT64_C( 367.00), SIMDE_FLOAT64_C( 751.00), SIMDE_FLOAT64_C( -940.00), SIMDE_FLOAT64_C( 131.00) } },
{ { SIMDE_FLOAT64_C( 300.87), SIMDE_FLOAT64_C( -993.74), SIMDE_FLOAT64_C( -325.15), SIMDE_FLOAT64_C( -299.89),
SIMDE_FLOAT64_C( 846.49), SIMDE_FLOAT64_C( -129.27), SIMDE_FLOAT64_C( 792.98), SIMDE_FLOAT64_C( -873.26) },
{ SIMDE_FLOAT64_C( 301.00), SIMDE_FLOAT64_C( -994.00), SIMDE_FLOAT64_C( -325.00), SIMDE_FLOAT64_C( -300.00),
SIMDE_FLOAT64_C( 846.00), SIMDE_FLOAT64_C( -129.00), SIMDE_FLOAT64_C( 793.00), SIMDE_FLOAT64_C( -873.00) } },
{ { SIMDE_FLOAT64_C( 582.29), SIMDE_FLOAT64_C( -214.75), SIMDE_FLOAT64_C( 109.05), SIMDE_FLOAT64_C( -553.10),
SIMDE_FLOAT64_C( 2.09), SIMDE_FLOAT64_C( -980.64), SIMDE_FLOAT64_C( -129.57), SIMDE_FLOAT64_C( -268.74) },
{ SIMDE_FLOAT64_C( 582.00), SIMDE_FLOAT64_C( -215.00), SIMDE_FLOAT64_C( 109.00), SIMDE_FLOAT64_C( -553.00),
SIMDE_FLOAT64_C( 2.00), SIMDE_FLOAT64_C( -981.00), SIMDE_FLOAT64_C( -130.00), SIMDE_FLOAT64_C( -269.00) } },
{ { SIMDE_FLOAT64_C( 698.88), SIMDE_FLOAT64_C( 385.66), SIMDE_FLOAT64_C( -370.28), SIMDE_FLOAT64_C( -709.82),
SIMDE_FLOAT64_C( 764.44), SIMDE_FLOAT64_C( 520.25), SIMDE_FLOAT64_C( -280.27), SIMDE_FLOAT64_C( 856.30) },
{ SIMDE_FLOAT64_C( 699.00), SIMDE_FLOAT64_C( 386.00), SIMDE_FLOAT64_C( -370.00), SIMDE_FLOAT64_C( -710.00),
SIMDE_FLOAT64_C( 764.00), SIMDE_FLOAT64_C( 520.00), SIMDE_FLOAT64_C( -280.00), SIMDE_FLOAT64_C( 856.00) } }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])) ; i++) {
simde__m512d a = simde_mm512_loadu_pd(test_vec[i].a);
simde__m512d r = simde_mm512_svml_round_pd(a);
simde_test_x86_assert_equal_f64x8(r, simde_mm512_loadu_pd(test_vec[i].r), 1);
}
return 0;
}
static int
test_simde_mm512_mask_svml_round_pd (SIMDE_MUNIT_TEST_ARGS) {
static const struct {
const simde_float64 src[8];
const simde__mmask8 k;
const simde_float64 a[8];
const simde_float64 r[8];
} test_vec[] = {
{ { SIMDE_FLOAT64_C( 686.15), SIMDE_FLOAT64_C( 113.85), SIMDE_FLOAT64_C( 91.19), SIMDE_FLOAT64_C( 346.08),
SIMDE_FLOAT64_C( -785.05), SIMDE_FLOAT64_C( 656.94), SIMDE_FLOAT64_C( 111.39), SIMDE_FLOAT64_C( -488.16) },
UINT8_C(184),
{ SIMDE_FLOAT64_C( -283.66), SIMDE_FLOAT64_C( 587.43), SIMDE_FLOAT64_C( -235.49), SIMDE_FLOAT64_C( 163.12),
SIMDE_FLOAT64_C( 571.24), SIMDE_FLOAT64_C( 582.37), SIMDE_FLOAT64_C( -370.22), SIMDE_FLOAT64_C( 474.92) },
{ SIMDE_FLOAT64_C( 686.15), SIMDE_FLOAT64_C( 113.85), SIMDE_FLOAT64_C( 91.19), SIMDE_FLOAT64_C( 163.00),
SIMDE_FLOAT64_C( 571.00), SIMDE_FLOAT64_C( 582.00), SIMDE_FLOAT64_C( 111.39), SIMDE_FLOAT64_C( 475.00) } },
{ { SIMDE_FLOAT64_C( -66.51), SIMDE_FLOAT64_C( -591.67), SIMDE_FLOAT64_C( -91.31), SIMDE_FLOAT64_C( 225.56),
SIMDE_FLOAT64_C( 12.37), SIMDE_FLOAT64_C( -659.70), SIMDE_FLOAT64_C( -760.80), SIMDE_FLOAT64_C( 231.33) },
UINT8_C( 69),
{ SIMDE_FLOAT64_C( 115.84), SIMDE_FLOAT64_C( -400.68), SIMDE_FLOAT64_C( -849.91), SIMDE_FLOAT64_C( -49.83),
SIMDE_FLOAT64_C( 85.28), SIMDE_FLOAT64_C( 836.24), SIMDE_FLOAT64_C( -935.98), SIMDE_FLOAT64_C( -823.53) },
{ SIMDE_FLOAT64_C( 116.00), SIMDE_FLOAT64_C( -591.67), SIMDE_FLOAT64_C( -850.00), SIMDE_FLOAT64_C( 225.56),
SIMDE_FLOAT64_C( 12.37), SIMDE_FLOAT64_C( -659.70), SIMDE_FLOAT64_C( -936.00), SIMDE_FLOAT64_C( 231.33) } },
{ { SIMDE_FLOAT64_C( 182.32), SIMDE_FLOAT64_C( -721.03), SIMDE_FLOAT64_C( 833.41), SIMDE_FLOAT64_C( -706.29),
SIMDE_FLOAT64_C( -209.20), SIMDE_FLOAT64_C( -511.45), SIMDE_FLOAT64_C( 10.05), SIMDE_FLOAT64_C( -621.76) },
UINT8_C(223),
{ SIMDE_FLOAT64_C( -826.83), SIMDE_FLOAT64_C( 949.47), SIMDE_FLOAT64_C( -164.57), SIMDE_FLOAT64_C( -197.05),
SIMDE_FLOAT64_C( 424.40), SIMDE_FLOAT64_C( 768.92), SIMDE_FLOAT64_C( 211.28), SIMDE_FLOAT64_C( -666.92) },
{ SIMDE_FLOAT64_C( -827.00), SIMDE_FLOAT64_C( 949.00), SIMDE_FLOAT64_C( -165.00), SIMDE_FLOAT64_C( -197.00),
SIMDE_FLOAT64_C( 424.00), SIMDE_FLOAT64_C( -511.45), SIMDE_FLOAT64_C( 211.00), SIMDE_FLOAT64_C( -667.00) } },
{ { SIMDE_FLOAT64_C( -5.52), SIMDE_FLOAT64_C( -776.35), SIMDE_FLOAT64_C( -326.62), SIMDE_FLOAT64_C( 233.68),
SIMDE_FLOAT64_C( 454.98), SIMDE_FLOAT64_C( 714.97), SIMDE_FLOAT64_C( -650.48), SIMDE_FLOAT64_C( -945.69) },
UINT8_C(115),
{ SIMDE_FLOAT64_C( 299.69), SIMDE_FLOAT64_C( 139.59), SIMDE_FLOAT64_C( 701.29), SIMDE_FLOAT64_C( 363.71),
SIMDE_FLOAT64_C( 316.05), SIMDE_FLOAT64_C( -116.39), SIMDE_FLOAT64_C( 642.67), SIMDE_FLOAT64_C( 149.46) },
{ SIMDE_FLOAT64_C( 300.00), SIMDE_FLOAT64_C( 140.00), SIMDE_FLOAT64_C( -326.62), SIMDE_FLOAT64_C( 233.68),
SIMDE_FLOAT64_C( 316.00), SIMDE_FLOAT64_C( -116.00), SIMDE_FLOAT64_C( 643.00), SIMDE_FLOAT64_C( -945.69) } },
{ { SIMDE_FLOAT64_C( 177.32), SIMDE_FLOAT64_C( -566.52), SIMDE_FLOAT64_C( 638.01), SIMDE_FLOAT64_C( -812.62),
SIMDE_FLOAT64_C( -188.29), SIMDE_FLOAT64_C( -108.94), SIMDE_FLOAT64_C( -639.45), SIMDE_FLOAT64_C( -238.81) },
UINT8_C( 57),
{ SIMDE_FLOAT64_C( 163.50), SIMDE_FLOAT64_C( -814.42), SIMDE_FLOAT64_C( 495.41), SIMDE_FLOAT64_C( -625.21),
SIMDE_FLOAT64_C( -481.34), SIMDE_FLOAT64_C( -510.10), SIMDE_FLOAT64_C( -401.56), SIMDE_FLOAT64_C( 192.04) },
{ SIMDE_FLOAT64_C( 164.00), SIMDE_FLOAT64_C( -566.52), SIMDE_FLOAT64_C( 638.01), SIMDE_FLOAT64_C( -625.00),
SIMDE_FLOAT64_C( -481.00), SIMDE_FLOAT64_C( -510.00), SIMDE_FLOAT64_C( -639.45), SIMDE_FLOAT64_C( -238.81) } },
{ { SIMDE_FLOAT64_C( 723.58), SIMDE_FLOAT64_C( -946.57), SIMDE_FLOAT64_C( -92.99), SIMDE_FLOAT64_C( -926.90),
SIMDE_FLOAT64_C( -892.27), SIMDE_FLOAT64_C( -227.94), SIMDE_FLOAT64_C( 372.79), SIMDE_FLOAT64_C( 247.32) },
UINT8_C(253),
{ SIMDE_FLOAT64_C( -263.51), SIMDE_FLOAT64_C( -436.63), SIMDE_FLOAT64_C( 356.97), SIMDE_FLOAT64_C( -620.84),
SIMDE_FLOAT64_C( 712.84), SIMDE_FLOAT64_C( -465.71), SIMDE_FLOAT64_C( -187.36), SIMDE_FLOAT64_C( 350.85) },
{ SIMDE_FLOAT64_C( -264.00), SIMDE_FLOAT64_C( -946.57), SIMDE_FLOAT64_C( 357.00), SIMDE_FLOAT64_C( -621.00),
SIMDE_FLOAT64_C( 713.00), SIMDE_FLOAT64_C( -466.00), SIMDE_FLOAT64_C( -187.00), SIMDE_FLOAT64_C( 351.00) } },
{ { SIMDE_FLOAT64_C( -278.33), SIMDE_FLOAT64_C( 624.35), SIMDE_FLOAT64_C( -758.09), SIMDE_FLOAT64_C( 82.22),
SIMDE_FLOAT64_C( -614.46), SIMDE_FLOAT64_C( 968.40), SIMDE_FLOAT64_C( -754.27), SIMDE_FLOAT64_C( -428.88) },
UINT8_C( 24),
{ SIMDE_FLOAT64_C( -379.49), SIMDE_FLOAT64_C( 89.78), SIMDE_FLOAT64_C( 953.71), SIMDE_FLOAT64_C( 218.96),
SIMDE_FLOAT64_C( -718.17), SIMDE_FLOAT64_C( 677.29), SIMDE_FLOAT64_C( 272.38), SIMDE_FLOAT64_C( 188.83) },
{ SIMDE_FLOAT64_C( -278.33), SIMDE_FLOAT64_C( 624.35), SIMDE_FLOAT64_C( -758.09), SIMDE_FLOAT64_C( 219.00),
SIMDE_FLOAT64_C( -718.00), SIMDE_FLOAT64_C( 968.40), SIMDE_FLOAT64_C( -754.27), SIMDE_FLOAT64_C( -428.88) } },
{ { SIMDE_FLOAT64_C( 750.39), SIMDE_FLOAT64_C( 380.12), SIMDE_FLOAT64_C( 960.90), SIMDE_FLOAT64_C( 123.18),
SIMDE_FLOAT64_C( -372.56), SIMDE_FLOAT64_C( -565.75), SIMDE_FLOAT64_C( 859.67), SIMDE_FLOAT64_C( 190.81) },
UINT8_C(196),
{ SIMDE_FLOAT64_C( -761.17), SIMDE_FLOAT64_C( -96.36), SIMDE_FLOAT64_C( -674.48), SIMDE_FLOAT64_C( 51.47),
SIMDE_FLOAT64_C( -745.51), SIMDE_FLOAT64_C( 47.19), SIMDE_FLOAT64_C( -324.18), SIMDE_FLOAT64_C( -503.60) },
{ SIMDE_FLOAT64_C( 750.39), SIMDE_FLOAT64_C( 380.12), SIMDE_FLOAT64_C( -674.00), SIMDE_FLOAT64_C( 123.18),
SIMDE_FLOAT64_C( -372.56), SIMDE_FLOAT64_C( -565.75), SIMDE_FLOAT64_C( -324.00), SIMDE_FLOAT64_C( -504.00) } }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])) ; i++) {
simde__m512d src = simde_mm512_loadu_pd(test_vec[i].src);
simde__m512d a = simde_mm512_loadu_pd(test_vec[i].a);
simde__m512d r = simde_mm512_mask_svml_round_pd(src, test_vec[i].k, a);
simde_test_x86_assert_equal_f64x8(r, simde_mm512_loadu_pd(test_vec[i].r), 1);
}
return 0;
}
static int
test_simde_mm_svml_sqrt_pd (SIMDE_MUNIT_TEST_ARGS) {
static const struct {
const simde_float64 a[2];
const simde_float64 r[2];
} test_vec[] = {
{ { SIMDE_FLOAT64_C( 770.44), SIMDE_FLOAT64_C( 798.21) },
{ SIMDE_FLOAT64_C( 27.76), SIMDE_FLOAT64_C( 28.25) } },
{ { SIMDE_FLOAT64_C( 609.46), SIMDE_FLOAT64_C( 219.02) },
{ SIMDE_FLOAT64_C( 24.69), SIMDE_FLOAT64_C( 14.80) } },
{ { SIMDE_FLOAT64_C( 514.28), SIMDE_FLOAT64_C( 301.39) },
{ SIMDE_FLOAT64_C( 22.68), SIMDE_FLOAT64_C( 17.36) } },
{ { SIMDE_FLOAT64_C( 520.55), SIMDE_FLOAT64_C( 108.95) },
{ SIMDE_FLOAT64_C( 22.82), SIMDE_FLOAT64_C( 10.44) } },
{ { SIMDE_FLOAT64_C( 417.19), SIMDE_FLOAT64_C( 212.16) },
{ SIMDE_FLOAT64_C( 20.43), SIMDE_FLOAT64_C( 14.57) } },
{ { SIMDE_FLOAT64_C( 40.41), SIMDE_FLOAT64_C( 807.43) },
{ SIMDE_FLOAT64_C( 6.36), SIMDE_FLOAT64_C( 28.42) } },
{ { SIMDE_FLOAT64_C( 746.18), SIMDE_FLOAT64_C( 239.87) },
{ SIMDE_FLOAT64_C( 27.32), SIMDE_FLOAT64_C( 15.49) } },
{ { SIMDE_FLOAT64_C( 461.80), SIMDE_FLOAT64_C( 420.17) },
{ SIMDE_FLOAT64_C( 21.49), SIMDE_FLOAT64_C( 20.50) } }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])) ; i++) {
simde__m128d a = simde_mm_loadu_pd(test_vec[i].a);
simde__m128d r = simde_mm_svml_sqrt_pd(a);
simde_test_x86_assert_equal_f64x2(r, simde_mm_loadu_pd(test_vec[i].r), 1);
}
return 0;
}
static int
test_simde_mm256_svml_sqrt_ps (SIMDE_MUNIT_TEST_ARGS) {
static const struct {
const simde_float32 a[8];
const simde_float32 r[8];
} test_vec[] = {
{ { SIMDE_FLOAT32_C( 935.36), SIMDE_FLOAT32_C( 463.00), SIMDE_FLOAT32_C( 356.54), SIMDE_FLOAT32_C( 614.58),
SIMDE_FLOAT32_C( 720.00), SIMDE_FLOAT32_C( 747.09), SIMDE_FLOAT32_C( 873.09), SIMDE_FLOAT32_C( 461.84) },
{ SIMDE_FLOAT32_C( 30.58), SIMDE_FLOAT32_C( 21.52), SIMDE_FLOAT32_C( 18.88), SIMDE_FLOAT32_C( 24.79),
SIMDE_FLOAT32_C( 26.83), SIMDE_FLOAT32_C( 27.33), SIMDE_FLOAT32_C( 29.55), SIMDE_FLOAT32_C( 21.49) } },
{ { SIMDE_FLOAT32_C( 718.30), SIMDE_FLOAT32_C( 297.75), SIMDE_FLOAT32_C( 46.73), SIMDE_FLOAT32_C( -42.51),
SIMDE_FLOAT32_C( 207.50), SIMDE_FLOAT32_C( 492.51), SIMDE_FLOAT32_C( 15.08), SIMDE_FLOAT32_C( 719.29) },
{ SIMDE_FLOAT32_C( 26.80), SIMDE_FLOAT32_C( 17.26), SIMDE_FLOAT32_C( 6.84), SIMDE_MATH_NANF,
SIMDE_FLOAT32_C( 14.40), SIMDE_FLOAT32_C( 22.19), SIMDE_FLOAT32_C( 3.88), SIMDE_FLOAT32_C( 26.82) } },
{ { SIMDE_FLOAT32_C( 347.10), SIMDE_FLOAT32_C( 575.60), SIMDE_FLOAT32_C( 719.84), SIMDE_FLOAT32_C( 241.71),
SIMDE_FLOAT32_C( 139.48), SIMDE_FLOAT32_C( 757.17), SIMDE_FLOAT32_C( 132.17), SIMDE_FLOAT32_C( 152.46) },
{ SIMDE_FLOAT32_C( 18.63), SIMDE_FLOAT32_C( 23.99), SIMDE_FLOAT32_C( 26.83), SIMDE_FLOAT32_C( 15.55),
SIMDE_FLOAT32_C( 11.81), SIMDE_FLOAT32_C( 27.52), SIMDE_FLOAT32_C( 11.50), SIMDE_FLOAT32_C( 12.35) } },
{ { SIMDE_FLOAT32_C( 780.23), SIMDE_FLOAT32_C( 823.65), SIMDE_FLOAT32_C( 290.06), SIMDE_FLOAT32_C( 492.64),
SIMDE_FLOAT32_C( 944.24), SIMDE_FLOAT32_C( 836.21), SIMDE_FLOAT32_C( 785.55), SIMDE_FLOAT32_C( 879.60) },
{ SIMDE_FLOAT32_C( 27.93), SIMDE_FLOAT32_C( 28.70), SIMDE_FLOAT32_C( 17.03), SIMDE_FLOAT32_C( 22.20),
SIMDE_FLOAT32_C( 30.73), SIMDE_FLOAT32_C( 28.92), SIMDE_FLOAT32_C( 28.03), SIMDE_FLOAT32_C( 29.66) } },
{ { SIMDE_FLOAT32_C( 299.21), SIMDE_FLOAT32_C( 142.09), SIMDE_FLOAT32_C( 494.18), SIMDE_FLOAT32_C( 19.21),
SIMDE_FLOAT32_C( 989.19), SIMDE_FLOAT32_C( 367.28), SIMDE_FLOAT32_C( 581.05), SIMDE_FLOAT32_C( 707.48) },
{ SIMDE_FLOAT32_C( 17.30), SIMDE_FLOAT32_C( 11.92), SIMDE_FLOAT32_C( 22.23), SIMDE_FLOAT32_C( 4.38),
SIMDE_FLOAT32_C( 31.45), SIMDE_FLOAT32_C( 19.16), SIMDE_FLOAT32_C( 24.10), SIMDE_FLOAT32_C( 26.60) } },
{ { SIMDE_FLOAT32_C( 765.03), SIMDE_FLOAT32_C( 727.79), SIMDE_FLOAT32_C( 764.97), SIMDE_FLOAT32_C( -27.47),
SIMDE_FLOAT32_C( 220.30), SIMDE_FLOAT32_C( 880.05), SIMDE_FLOAT32_C( 791.82), SIMDE_FLOAT32_C( 667.40) },
{ SIMDE_FLOAT32_C( 27.66), SIMDE_FLOAT32_C( 26.98), SIMDE_FLOAT32_C( 27.66), SIMDE_MATH_NANF,
SIMDE_FLOAT32_C( 14.84), SIMDE_FLOAT32_C( 29.67), SIMDE_FLOAT32_C( 28.14), SIMDE_FLOAT32_C( 25.83) } },
{ { SIMDE_FLOAT32_C( 455.65), SIMDE_FLOAT32_C( 511.66), SIMDE_FLOAT32_C( -90.90), SIMDE_FLOAT32_C( 695.13),
SIMDE_FLOAT32_C( 268.83), SIMDE_FLOAT32_C( 141.28), SIMDE_FLOAT32_C( 947.59), SIMDE_FLOAT32_C( 49.06) },
{ SIMDE_FLOAT32_C( 21.35), SIMDE_FLOAT32_C( 22.62), SIMDE_MATH_NANF, SIMDE_FLOAT32_C( 26.37),
SIMDE_FLOAT32_C( 16.40), SIMDE_FLOAT32_C( 11.89), SIMDE_FLOAT32_C( 30.78), SIMDE_FLOAT32_C( 7.00) } },
{ { SIMDE_FLOAT32_C( -35.07), SIMDE_FLOAT32_C( 237.65), SIMDE_FLOAT32_C( 641.70), SIMDE_FLOAT32_C( -90.83),
SIMDE_FLOAT32_C( 73.86), SIMDE_FLOAT32_C( 427.26), SIMDE_FLOAT32_C( 888.77), SIMDE_FLOAT32_C( 473.07) },
{ SIMDE_MATH_NANF, SIMDE_FLOAT32_C( 15.42), SIMDE_FLOAT32_C( 25.33), SIMDE_MATH_NANF,
SIMDE_FLOAT32_C( 8.59), SIMDE_FLOAT32_C( 20.67), SIMDE_FLOAT32_C( 29.81), SIMDE_FLOAT32_C( 21.75) } }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])) ; i++) {
simde__m256 a = simde_mm256_loadu_ps(test_vec[i].a);
simde__m256 r = simde_mm256_svml_sqrt_ps(a);
simde_test_x86_assert_equal_f32x8(r, simde_mm256_loadu_ps(test_vec[i].r), 1);
}
return 0;
}
static int
test_simde_mm256_svml_sqrt_pd (SIMDE_MUNIT_TEST_ARGS) {
static const struct {
const simde_float64 a[4];
const simde_float64 r[4];
} test_vec[] = {
{ { SIMDE_FLOAT64_C( 898.02), SIMDE_FLOAT64_C( 77.23), SIMDE_FLOAT64_C( 690.30), SIMDE_FLOAT64_C( 742.27) },
{ SIMDE_FLOAT64_C( 29.97), SIMDE_FLOAT64_C( 8.79), SIMDE_FLOAT64_C( 26.27), SIMDE_FLOAT64_C( 27.24) } },
{ { SIMDE_FLOAT64_C( 301.75), SIMDE_FLOAT64_C( 377.86), SIMDE_FLOAT64_C( 38.07), SIMDE_FLOAT64_C( 270.72) },
{ SIMDE_FLOAT64_C( 17.37), SIMDE_FLOAT64_C( 19.44), SIMDE_FLOAT64_C( 6.17), SIMDE_FLOAT64_C( 16.45) } },
{ { SIMDE_FLOAT64_C( 661.06), SIMDE_FLOAT64_C( 955.80), SIMDE_FLOAT64_C( 540.55), SIMDE_FLOAT64_C( 699.66) },
{ SIMDE_FLOAT64_C( 25.71), SIMDE_FLOAT64_C( 30.92), SIMDE_FLOAT64_C( 23.25), SIMDE_FLOAT64_C( 26.45) } },
{ { SIMDE_FLOAT64_C( 41.79), SIMDE_FLOAT64_C( 429.36), SIMDE_FLOAT64_C( 830.75), SIMDE_FLOAT64_C( 836.32) },
{ SIMDE_FLOAT64_C( 6.46), SIMDE_FLOAT64_C( 20.72), SIMDE_FLOAT64_C( 28.82), SIMDE_FLOAT64_C( 28.92) } },
{ { SIMDE_FLOAT64_C( 153.46), SIMDE_FLOAT64_C( 994.23), SIMDE_FLOAT64_C( 913.53), SIMDE_FLOAT64_C( 889.00) },
{ SIMDE_FLOAT64_C( 12.39), SIMDE_FLOAT64_C( 31.53), SIMDE_FLOAT64_C( 30.22), SIMDE_FLOAT64_C( 29.82) } },
{ { SIMDE_FLOAT64_C( 140.95), SIMDE_FLOAT64_C( 65.36), SIMDE_FLOAT64_C( 968.68), SIMDE_FLOAT64_C( 947.21) },
{ SIMDE_FLOAT64_C( 11.87), SIMDE_FLOAT64_C( 8.08), SIMDE_FLOAT64_C( 31.12), SIMDE_FLOAT64_C( 30.78) } },
{ { SIMDE_FLOAT64_C( -31.19), SIMDE_FLOAT64_C( 466.94), SIMDE_FLOAT64_C( 225.29), SIMDE_FLOAT64_C( 967.56) },
{ SIMDE_MATH_NAN, SIMDE_FLOAT64_C( 21.61), SIMDE_FLOAT64_C( 15.01), SIMDE_FLOAT64_C( 31.11) } },
{ { SIMDE_FLOAT64_C( 710.29), SIMDE_FLOAT64_C( 718.44), SIMDE_FLOAT64_C( 305.66), SIMDE_FLOAT64_C( 608.32) },
{ SIMDE_FLOAT64_C( 26.65), SIMDE_FLOAT64_C( 26.80), SIMDE_FLOAT64_C( 17.48), SIMDE_FLOAT64_C( 24.66) } }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])) ; i++) {
simde__m256d a = simde_mm256_loadu_pd(test_vec[i].a);
simde__m256d r = simde_mm256_svml_sqrt_pd(a);
simde_test_x86_assert_equal_f64x4(r, simde_mm256_loadu_pd(test_vec[i].r), 1);
}
return 0;
}
static int
test_simde_mm512_svml_sqrt_ps (SIMDE_MUNIT_TEST_ARGS) {
static const struct {
const simde_float32 a[16];
const simde_float32 r[16];
} test_vec[] = {
{ { SIMDE_FLOAT32_C( 914.68), SIMDE_FLOAT32_C( 142.36), SIMDE_FLOAT32_C( 751.77), SIMDE_FLOAT32_C( 42.61),
SIMDE_FLOAT32_C( 433.18), SIMDE_FLOAT32_C( -95.01), SIMDE_FLOAT32_C( 535.55), SIMDE_FLOAT32_C( 168.98),
SIMDE_FLOAT32_C( 508.03), SIMDE_FLOAT32_C( 713.68), SIMDE_FLOAT32_C( 502.38), SIMDE_FLOAT32_C( 504.11),
SIMDE_FLOAT32_C( 643.10), SIMDE_FLOAT32_C( 546.21), SIMDE_FLOAT32_C( 975.24), SIMDE_FLOAT32_C( 770.62) },
{ SIMDE_FLOAT32_C( 30.24), SIMDE_FLOAT32_C( 11.93), SIMDE_FLOAT32_C( 27.42), SIMDE_FLOAT32_C( 6.53),
SIMDE_FLOAT32_C( 20.81), SIMDE_MATH_NANF, SIMDE_FLOAT32_C( 23.14), SIMDE_FLOAT32_C( 13.00),
SIMDE_FLOAT32_C( 22.54), SIMDE_FLOAT32_C( 26.71), SIMDE_FLOAT32_C( 22.41), SIMDE_FLOAT32_C( 22.45),
SIMDE_FLOAT32_C( 25.36), SIMDE_FLOAT32_C( 23.37), SIMDE_FLOAT32_C( 31.23), SIMDE_FLOAT32_C( 27.76) } },
{ { SIMDE_FLOAT32_C( 799.15), SIMDE_FLOAT32_C( 249.41), SIMDE_FLOAT32_C( 246.93), SIMDE_FLOAT32_C( -33.60),
SIMDE_FLOAT32_C( 336.37), SIMDE_FLOAT32_C( 867.92), SIMDE_FLOAT32_C( 50.92), SIMDE_FLOAT32_C( 348.52),
SIMDE_FLOAT32_C( 870.30), SIMDE_FLOAT32_C( 193.09), SIMDE_FLOAT32_C( 153.59), SIMDE_FLOAT32_C( 803.32),
SIMDE_FLOAT32_C( 802.44), SIMDE_FLOAT32_C( 360.38), SIMDE_FLOAT32_C( 481.46), SIMDE_FLOAT32_C( 717.12) },
{ SIMDE_FLOAT32_C( 28.27), SIMDE_FLOAT32_C( 15.79), SIMDE_FLOAT32_C( 15.71), SIMDE_MATH_NANF,
SIMDE_FLOAT32_C( 18.34), SIMDE_FLOAT32_C( 29.46), SIMDE_FLOAT32_C( 7.14), SIMDE_FLOAT32_C( 18.67),
SIMDE_FLOAT32_C( 29.50), SIMDE_FLOAT32_C( 13.90), SIMDE_FLOAT32_C( 12.39), SIMDE_FLOAT32_C( 28.34),
SIMDE_FLOAT32_C( 28.33), SIMDE_FLOAT32_C( 18.98), SIMDE_FLOAT32_C( 21.94), SIMDE_FLOAT32_C( 26.78) } },
{ { SIMDE_FLOAT32_C( 602.74), SIMDE_FLOAT32_C( 233.23), SIMDE_FLOAT32_C( 859.73), SIMDE_FLOAT32_C( 35.92),
SIMDE_FLOAT32_C( 238.22), SIMDE_FLOAT32_C( 395.29), SIMDE_FLOAT32_C( 304.89), SIMDE_FLOAT32_C( 846.24),
SIMDE_FLOAT32_C( 108.97), SIMDE_FLOAT32_C( 907.27), SIMDE_FLOAT32_C( 350.35), SIMDE_FLOAT32_C( 852.07),
SIMDE_FLOAT32_C( 453.48), SIMDE_FLOAT32_C( 325.59), SIMDE_FLOAT32_C( 622.69), SIMDE_FLOAT32_C( 252.63) },
{ SIMDE_FLOAT32_C( 24.55), SIMDE_FLOAT32_C( 15.27), SIMDE_FLOAT32_C( 29.32), SIMDE_FLOAT32_C( 5.99),
SIMDE_FLOAT32_C( 15.43), SIMDE_FLOAT32_C( 19.88), SIMDE_FLOAT32_C( 17.46), SIMDE_FLOAT32_C( 29.09),
SIMDE_FLOAT32_C( 10.44), SIMDE_FLOAT32_C( 30.12), SIMDE_FLOAT32_C( 18.72), SIMDE_FLOAT32_C( 29.19),
SIMDE_FLOAT32_C( 21.30), SIMDE_FLOAT32_C( 18.04), SIMDE_FLOAT32_C( 24.95), SIMDE_FLOAT32_C( 15.89) } },
{ { SIMDE_FLOAT32_C( 675.00), SIMDE_FLOAT32_C( 969.62), SIMDE_FLOAT32_C( 319.04), SIMDE_FLOAT32_C( 11.37),
SIMDE_FLOAT32_C( 837.54), SIMDE_FLOAT32_C( 469.95), SIMDE_FLOAT32_C( 459.89), SIMDE_FLOAT32_C( 707.84),
SIMDE_FLOAT32_C( 763.05), SIMDE_FLOAT32_C( 713.48), SIMDE_FLOAT32_C( 511.15), SIMDE_FLOAT32_C( 565.49),
SIMDE_FLOAT32_C( 73.86), SIMDE_FLOAT32_C( -7.39), SIMDE_FLOAT32_C( 282.61), SIMDE_FLOAT32_C( 776.60) },
{ SIMDE_FLOAT32_C( 25.98), SIMDE_FLOAT32_C( 31.14), SIMDE_FLOAT32_C( 17.86), SIMDE_FLOAT32_C( 3.37),
SIMDE_FLOAT32_C( 28.94), SIMDE_FLOAT32_C( 21.68), SIMDE_FLOAT32_C( 21.44), SIMDE_FLOAT32_C( 26.61),
SIMDE_FLOAT32_C( 27.62), SIMDE_FLOAT32_C( 26.71), SIMDE_FLOAT32_C( 22.61), SIMDE_FLOAT32_C( 23.78),
SIMDE_FLOAT32_C( 8.59), SIMDE_MATH_NANF, SIMDE_FLOAT32_C( 16.81), SIMDE_FLOAT32_C( 27.87) } },
{ { SIMDE_FLOAT32_C( 325.84), SIMDE_FLOAT32_C( 142.35), SIMDE_FLOAT32_C( 912.52), SIMDE_FLOAT32_C( 664.06),
SIMDE_FLOAT32_C( 637.63), SIMDE_FLOAT32_C( 217.41), SIMDE_FLOAT32_C( 510.30), SIMDE_FLOAT32_C( 846.60),
SIMDE_FLOAT32_C( 124.68), SIMDE_FLOAT32_C( 960.65), SIMDE_FLOAT32_C( 698.67), SIMDE_FLOAT32_C( 678.16),
SIMDE_FLOAT32_C( 286.24), SIMDE_FLOAT32_C( 321.36), SIMDE_FLOAT32_C( -69.20), SIMDE_FLOAT32_C( -38.77) },
{ SIMDE_FLOAT32_C( 18.05), SIMDE_FLOAT32_C( 11.93), SIMDE_FLOAT32_C( 30.21), SIMDE_FLOAT32_C( 25.77),
SIMDE_FLOAT32_C( 25.25), SIMDE_FLOAT32_C( 14.74), SIMDE_FLOAT32_C( 22.59), SIMDE_FLOAT32_C( 29.10),
SIMDE_FLOAT32_C( 11.17), SIMDE_FLOAT32_C( 30.99), SIMDE_FLOAT32_C( 26.43), SIMDE_FLOAT32_C( 26.04),
SIMDE_FLOAT32_C( 16.92), SIMDE_FLOAT32_C( 17.93), SIMDE_MATH_NANF, SIMDE_MATH_NANF } },
{ { SIMDE_FLOAT32_C( 290.98), SIMDE_FLOAT32_C( 349.84), SIMDE_FLOAT32_C( 72.60), SIMDE_FLOAT32_C( 128.51),
SIMDE_FLOAT32_C( 919.79), SIMDE_FLOAT32_C( 632.49), SIMDE_FLOAT32_C( 936.35), SIMDE_FLOAT32_C( 682.84),
SIMDE_FLOAT32_C( 345.97), SIMDE_FLOAT32_C( 447.51), SIMDE_FLOAT32_C( 248.33), SIMDE_FLOAT32_C( 519.83),
SIMDE_FLOAT32_C( 540.12), SIMDE_FLOAT32_C( 630.94), SIMDE_FLOAT32_C( 296.43), SIMDE_FLOAT32_C( 965.96) },
{ SIMDE_FLOAT32_C( 17.06), SIMDE_FLOAT32_C( 18.70), SIMDE_FLOAT32_C( 8.52), SIMDE_FLOAT32_C( 11.34),
SIMDE_FLOAT32_C( 30.33), SIMDE_FLOAT32_C( 25.15), SIMDE_FLOAT32_C( 30.60), SIMDE_FLOAT32_C( 26.13),
SIMDE_FLOAT32_C( 18.60), SIMDE_FLOAT32_C( 21.15), SIMDE_FLOAT32_C( 15.76), SIMDE_FLOAT32_C( 22.80),
SIMDE_FLOAT32_C( 23.24), SIMDE_FLOAT32_C( 25.12), SIMDE_FLOAT32_C( 17.22), SIMDE_FLOAT32_C( 31.08) } },
{ { SIMDE_FLOAT32_C( 873.29), SIMDE_FLOAT32_C( 208.95), SIMDE_FLOAT32_C( 630.01), SIMDE_FLOAT32_C( 510.92),
SIMDE_FLOAT32_C( 526.36), SIMDE_FLOAT32_C( 140.32), SIMDE_FLOAT32_C( 357.53), SIMDE_FLOAT32_C( 751.05),
SIMDE_FLOAT32_C( 100.97), SIMDE_FLOAT32_C( 56.20), SIMDE_FLOAT32_C( 429.21), SIMDE_FLOAT32_C( 487.20),
SIMDE_FLOAT32_C( 477.55), SIMDE_FLOAT32_C( 460.01), SIMDE_FLOAT32_C( 548.44), SIMDE_FLOAT32_C( 868.53) },
{ SIMDE_FLOAT32_C( 29.55), SIMDE_FLOAT32_C( 14.46), SIMDE_FLOAT32_C( 25.10), SIMDE_FLOAT32_C( 22.60),
SIMDE_FLOAT32_C( 22.94), SIMDE_FLOAT32_C( 11.85), SIMDE_FLOAT32_C( 18.91), SIMDE_FLOAT32_C( 27.41),
SIMDE_FLOAT32_C( 10.05), SIMDE_FLOAT32_C( 7.50), SIMDE_FLOAT32_C( 20.72), SIMDE_FLOAT32_C( 22.07),
SIMDE_FLOAT32_C( 21.85), SIMDE_FLOAT32_C( 21.45), SIMDE_FLOAT32_C( 23.42), SIMDE_FLOAT32_C( 29.47) } },
{ { SIMDE_FLOAT32_C( 909.84), SIMDE_FLOAT32_C( 721.04), SIMDE_FLOAT32_C( -2.95), SIMDE_FLOAT32_C( 829.64),
SIMDE_FLOAT32_C( 353.53), SIMDE_FLOAT32_C( -66.60), SIMDE_FLOAT32_C( 512.48), SIMDE_FLOAT32_C( 799.49),
SIMDE_FLOAT32_C( 480.91), SIMDE_FLOAT32_C( 860.80), SIMDE_FLOAT32_C( 319.32), SIMDE_FLOAT32_C( 21.02),
SIMDE_FLOAT32_C( 491.75), SIMDE_FLOAT32_C( 715.75), SIMDE_FLOAT32_C( -13.02), SIMDE_FLOAT32_C( 365.04) },
{ SIMDE_FLOAT32_C( 30.16), SIMDE_FLOAT32_C( 26.85), SIMDE_MATH_NANF, SIMDE_FLOAT32_C( 28.80),
SIMDE_FLOAT32_C( 18.80), SIMDE_MATH_NANF, SIMDE_FLOAT32_C( 22.64), SIMDE_FLOAT32_C( 28.28),
SIMDE_FLOAT32_C( 21.93), SIMDE_FLOAT32_C( 29.34), SIMDE_FLOAT32_C( 17.87), SIMDE_FLOAT32_C( 4.59),
SIMDE_FLOAT32_C( 22.18), SIMDE_FLOAT32_C( 26.75), SIMDE_MATH_NANF, SIMDE_FLOAT32_C( 19.11) } }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])) ; i++) {
simde__m512 a = simde_mm512_loadu_ps(test_vec[i].a);
simde__m512 r = simde_mm512_svml_sqrt_ps(a);
simde_test_x86_assert_equal_f32x16(r, simde_mm512_loadu_ps(test_vec[i].r), 1);
}
return 0;
}
static int
test_simde_mm512_svml_sqrt_pd (SIMDE_MUNIT_TEST_ARGS) {
static const struct {
const simde_float64 a[8];
const simde_float64 r[8];
} test_vec[] = {
{ { SIMDE_FLOAT64_C( 375.58), SIMDE_FLOAT64_C( 46.46), SIMDE_FLOAT64_C( 312.83), SIMDE_FLOAT64_C( 445.14),
SIMDE_FLOAT64_C( 187.32), SIMDE_FLOAT64_C( 952.90), SIMDE_FLOAT64_C( 987.69), SIMDE_FLOAT64_C( 406.24) },
{ SIMDE_FLOAT64_C( 19.38), SIMDE_FLOAT64_C( 6.82), SIMDE_FLOAT64_C( 17.69), SIMDE_FLOAT64_C( 21.10),
SIMDE_FLOAT64_C( 13.69), SIMDE_FLOAT64_C( 30.87), SIMDE_FLOAT64_C( 31.43), SIMDE_FLOAT64_C( 20.16) } },
{ { SIMDE_FLOAT64_C( 293.47), SIMDE_FLOAT64_C( 304.52), SIMDE_FLOAT64_C( 836.60), SIMDE_FLOAT64_C( 342.20),
SIMDE_FLOAT64_C( 740.40), SIMDE_FLOAT64_C( 328.94), SIMDE_FLOAT64_C( 360.36), SIMDE_FLOAT64_C( 97.23) },
{ SIMDE_FLOAT64_C( 17.13), SIMDE_FLOAT64_C( 17.45), SIMDE_FLOAT64_C( 28.92), SIMDE_FLOAT64_C( 18.50),
SIMDE_FLOAT64_C( 27.21), SIMDE_FLOAT64_C( 18.14), SIMDE_FLOAT64_C( 18.98), SIMDE_FLOAT64_C( 9.86) } },
{ { SIMDE_FLOAT64_C( 931.22), SIMDE_FLOAT64_C( 239.31), SIMDE_FLOAT64_C( 533.01), SIMDE_FLOAT64_C( 413.09),
SIMDE_FLOAT64_C( -30.52), SIMDE_FLOAT64_C( 220.33), SIMDE_FLOAT64_C( 224.40), SIMDE_FLOAT64_C( 591.21) },
{ SIMDE_FLOAT64_C( 30.52), SIMDE_FLOAT64_C( 15.47), SIMDE_FLOAT64_C( 23.09), SIMDE_FLOAT64_C( 20.32),
SIMDE_MATH_NAN, SIMDE_FLOAT64_C( 14.84), SIMDE_FLOAT64_C( 14.98), SIMDE_FLOAT64_C( 24.31) } },
{ { SIMDE_FLOAT64_C( 737.21), SIMDE_FLOAT64_C( 927.12), SIMDE_FLOAT64_C( 685.90), SIMDE_FLOAT64_C( 452.75),
SIMDE_FLOAT64_C( 896.77), SIMDE_FLOAT64_C( 752.44), SIMDE_FLOAT64_C( 780.06), SIMDE_FLOAT64_C( 272.35) },
{ SIMDE_FLOAT64_C( 27.15), SIMDE_FLOAT64_C( 30.45), SIMDE_FLOAT64_C( 26.19), SIMDE_FLOAT64_C( 21.28),
SIMDE_FLOAT64_C( 29.95), SIMDE_FLOAT64_C( 27.43), SIMDE_FLOAT64_C( 27.93), SIMDE_FLOAT64_C( 16.50) } },
{ { SIMDE_FLOAT64_C( 898.90), SIMDE_FLOAT64_C( 92.89), SIMDE_FLOAT64_C( 817.49), SIMDE_FLOAT64_C( 86.22),
SIMDE_FLOAT64_C( 45.79), SIMDE_FLOAT64_C( 805.18), SIMDE_FLOAT64_C( 592.46), SIMDE_FLOAT64_C( 439.26) },
{ SIMDE_FLOAT64_C( 29.98), SIMDE_FLOAT64_C( 9.64), SIMDE_FLOAT64_C( 28.59), SIMDE_FLOAT64_C( 9.29),
SIMDE_FLOAT64_C( 6.77), SIMDE_FLOAT64_C( 28.38), SIMDE_FLOAT64_C( 24.34), SIMDE_FLOAT64_C( 20.96) } },
{ { SIMDE_FLOAT64_C( 109.70), SIMDE_FLOAT64_C( 429.07), SIMDE_FLOAT64_C( 881.46), SIMDE_FLOAT64_C( 950.09),
SIMDE_FLOAT64_C( 858.01), SIMDE_FLOAT64_C( 241.82), SIMDE_FLOAT64_C( 47.32), SIMDE_FLOAT64_C( 789.23) },
{ SIMDE_FLOAT64_C( 10.47), SIMDE_FLOAT64_C( 20.71), SIMDE_FLOAT64_C( 29.69), SIMDE_FLOAT64_C( 30.82),
SIMDE_FLOAT64_C( 29.29), SIMDE_FLOAT64_C( 15.55), SIMDE_FLOAT64_C( 6.88), SIMDE_FLOAT64_C( 28.09) } },
{ { SIMDE_FLOAT64_C( 581.13), SIMDE_FLOAT64_C( 680.33), SIMDE_FLOAT64_C( 202.32), SIMDE_FLOAT64_C( 650.61),
SIMDE_FLOAT64_C( -99.34), SIMDE_FLOAT64_C( 526.72), SIMDE_FLOAT64_C( 241.82), SIMDE_FLOAT64_C( 737.87) },
{ SIMDE_FLOAT64_C( 24.11), SIMDE_FLOAT64_C( 26.08), SIMDE_FLOAT64_C( 14.22), SIMDE_FLOAT64_C( 25.51),
SIMDE_MATH_NAN, SIMDE_FLOAT64_C( 22.95), SIMDE_FLOAT64_C( 15.55), SIMDE_FLOAT64_C( 27.16) } },
{ { SIMDE_FLOAT64_C( 453.84), SIMDE_FLOAT64_C( -72.28), SIMDE_FLOAT64_C( 190.62), SIMDE_FLOAT64_C( 350.61),
SIMDE_FLOAT64_C( 780.16), SIMDE_FLOAT64_C( -29.31), SIMDE_FLOAT64_C( 722.96), SIMDE_FLOAT64_C( 679.07) },
{ SIMDE_FLOAT64_C( 21.30), SIMDE_MATH_NAN, SIMDE_FLOAT64_C( 13.81), SIMDE_FLOAT64_C( 18.72),
SIMDE_FLOAT64_C( 27.93), SIMDE_MATH_NAN, SIMDE_FLOAT64_C( 26.89), SIMDE_FLOAT64_C( 26.06) } }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])) ; i++) {
simde__m512d a = simde_mm512_loadu_pd(test_vec[i].a);
simde__m512d r = simde_mm512_svml_sqrt_pd(a);
simde_test_x86_assert_equal_f64x8(r, simde_mm512_loadu_pd(test_vec[i].r), 1);
}
return 0;
}
static int
test_simde_mm_tan_ps(SIMDE_MUNIT_TEST_ARGS) {
const struct {
simde__m128 a;
simde__m128 r;
} test_vec[8] = {
{ simde_mm_set_ps(SIMDE_FLOAT32_C( -186.21), SIMDE_FLOAT32_C( 39.01), SIMDE_FLOAT32_C( -754.38), SIMDE_FLOAT32_C( 346.63)),
simde_mm_set_ps(SIMDE_FLOAT32_C( -1.15), SIMDE_FLOAT32_C( 3.76), SIMDE_FLOAT32_C( -0.42), SIMDE_FLOAT32_C( 1.76)) },
{ simde_mm_set_ps(SIMDE_FLOAT32_C( 497.31), SIMDE_FLOAT32_C( 670.24), SIMDE_FLOAT32_C( -297.45), SIMDE_FLOAT32_C( 34.06)),
simde_mm_set_ps(SIMDE_FLOAT32_C( 1.36), SIMDE_FLOAT32_C( 1.87), SIMDE_FLOAT32_C( 1.56), SIMDE_FLOAT32_C( -0.54)) },
{ simde_mm_set_ps(SIMDE_FLOAT32_C( 825.53), SIMDE_FLOAT32_C( 422.21), SIMDE_FLOAT32_C( -269.45), SIMDE_FLOAT32_C( 467.76)),
simde_mm_set_ps(SIMDE_FLOAT32_C( -0.86), SIMDE_FLOAT32_C( 2.88), SIMDE_FLOAT32_C( 0.89), SIMDE_FLOAT32_C( -0.35)) },
{ simde_mm_set_ps(SIMDE_FLOAT32_C( -678.17), SIMDE_FLOAT32_C( -686.13), SIMDE_FLOAT32_C( 84.77), SIMDE_FLOAT32_C( 571.46)),
simde_mm_set_ps(SIMDE_FLOAT32_C( 0.44), SIMDE_FLOAT32_C( -3.14), SIMDE_FLOAT32_C( -0.05), SIMDE_FLOAT32_C( -0.32)) },
{ simde_mm_set_ps(SIMDE_FLOAT32_C( -305.07), SIMDE_FLOAT32_C( -860.95), SIMDE_FLOAT32_C( -417.54), SIMDE_FLOAT32_C( 696.87)),
simde_mm_set_ps(SIMDE_FLOAT32_C( -0.35), SIMDE_FLOAT32_C( -0.15), SIMDE_FLOAT32_C( 0.30), SIMDE_FLOAT32_C( -0.63)) },
{ simde_mm_set_ps(SIMDE_FLOAT32_C( -444.81), SIMDE_FLOAT32_C( 28.47), SIMDE_FLOAT32_C( -384.03), SIMDE_FLOAT32_C( -923.64)),
simde_mm_set_ps(SIMDE_FLOAT32_C( 3.55), SIMDE_FLOAT32_C( 0.20), SIMDE_FLOAT32_C( -0.94), SIMDE_FLOAT32_C( -0.01)) },
{ simde_mm_set_ps(SIMDE_FLOAT32_C( 261.31), SIMDE_FLOAT32_C( -212.54), SIMDE_FLOAT32_C( -976.55), SIMDE_FLOAT32_C( -660.80)),
simde_mm_set_ps(SIMDE_FLOAT32_C( 0.62), SIMDE_FLOAT32_C( 1.91), SIMDE_FLOAT32_C( 0.53), SIMDE_FLOAT32_C( -1.81)) },
{ simde_mm_set_ps(SIMDE_FLOAT32_C( 178.20), SIMDE_FLOAT32_C( -450.67), SIMDE_FLOAT32_C( 233.37), SIMDE_FLOAT32_C( 687.09)),
simde_mm_set_ps(SIMDE_FLOAT32_C( -1.19), SIMDE_FLOAT32_C( -6.68), SIMDE_FLOAT32_C( 1.24), SIMDE_FLOAT32_C( -1.31)) }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])); i++) {
simde__m128 r = simde_mm_tan_ps(test_vec[i].a);
simde_assert_m128_close(r, test_vec[i].r, 1);
}
return 0;
}
static int
test_simde_mm_tan_pd(SIMDE_MUNIT_TEST_ARGS) {
const struct {
simde__m128d a;
simde__m128d r;
} test_vec[8] = {
{ simde_mm_set_pd(SIMDE_FLOAT64_C( -754.38), SIMDE_FLOAT64_C( 346.63)),
simde_mm_set_pd(SIMDE_FLOAT64_C( -0.42), SIMDE_FLOAT64_C( 1.76)) },
{ simde_mm_set_pd(SIMDE_FLOAT64_C( -186.21), SIMDE_FLOAT64_C( 39.01)),
simde_mm_set_pd(SIMDE_FLOAT64_C( -1.15), SIMDE_FLOAT64_C( 3.76)) },
{ simde_mm_set_pd(SIMDE_FLOAT64_C( -297.45), SIMDE_FLOAT64_C( 34.06)),
simde_mm_set_pd(SIMDE_FLOAT64_C( 1.56), SIMDE_FLOAT64_C( -0.54)) },
{ simde_mm_set_pd(SIMDE_FLOAT64_C( 497.31), SIMDE_FLOAT64_C( 670.24)),
simde_mm_set_pd(SIMDE_FLOAT64_C( 1.36), SIMDE_FLOAT64_C( 1.87)) },
{ simde_mm_set_pd(SIMDE_FLOAT64_C( -269.45), SIMDE_FLOAT64_C( 467.76)),
simde_mm_set_pd(SIMDE_FLOAT64_C( 0.89), SIMDE_FLOAT64_C( -0.35)) },
{ simde_mm_set_pd(SIMDE_FLOAT64_C( 825.53), SIMDE_FLOAT64_C( 422.21)),
simde_mm_set_pd(SIMDE_FLOAT64_C( -0.86), SIMDE_FLOAT64_C( 2.88)) },
{ simde_mm_set_pd(SIMDE_FLOAT64_C( 84.77), SIMDE_FLOAT64_C( 571.46)),
simde_mm_set_pd(SIMDE_FLOAT64_C( -0.05), SIMDE_FLOAT64_C( -0.32)) },
{ simde_mm_set_pd(SIMDE_FLOAT64_C( -678.17), SIMDE_FLOAT64_C( -686.13)),
simde_mm_set_pd(SIMDE_FLOAT64_C( 0.44), SIMDE_FLOAT64_C( -3.14)) }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])); i++) {
simde__m128d r = simde_mm_tan_pd(test_vec[i].a);
simde_assert_m128d_close(r, test_vec[i].r, 1);
}
return 0;
}
static int
test_simde_mm256_tan_ps(SIMDE_MUNIT_TEST_ARGS) {
const struct {
simde__m256 a;
simde__m256 r;
} test_vec[8] = {
{ simde_mm256_set_ps(SIMDE_FLOAT32_C( 497.31), SIMDE_FLOAT32_C( 670.24),
SIMDE_FLOAT32_C( -297.45), SIMDE_FLOAT32_C( 34.06),
SIMDE_FLOAT32_C( -186.21), SIMDE_FLOAT32_C( 39.01),
SIMDE_FLOAT32_C( -754.38), SIMDE_FLOAT32_C( 346.63)),
simde_mm256_set_ps(SIMDE_FLOAT32_C( 1.36), SIMDE_FLOAT32_C( 1.87),
SIMDE_FLOAT32_C( 1.56), SIMDE_FLOAT32_C( -0.54),
SIMDE_FLOAT32_C( -1.15), SIMDE_FLOAT32_C( 3.76),
SIMDE_FLOAT32_C( -0.42), SIMDE_FLOAT32_C( 1.76)) },
{ simde_mm256_set_ps(SIMDE_FLOAT32_C( -678.17), SIMDE_FLOAT32_C( -686.13),
SIMDE_FLOAT32_C( 84.77), SIMDE_FLOAT32_C( 571.46),
SIMDE_FLOAT32_C( 825.53), SIMDE_FLOAT32_C( 422.21),
SIMDE_FLOAT32_C( -269.45), SIMDE_FLOAT32_C( 467.76)),
simde_mm256_set_ps(SIMDE_FLOAT32_C( 0.44), SIMDE_FLOAT32_C( -3.14),
SIMDE_FLOAT32_C( -0.05), SIMDE_FLOAT32_C( -0.32),
SIMDE_FLOAT32_C( -0.86), SIMDE_FLOAT32_C( 2.88),
SIMDE_FLOAT32_C( 0.89), SIMDE_FLOAT32_C( -0.35)) },
{ simde_mm256_set_ps(SIMDE_FLOAT32_C( -444.81), SIMDE_FLOAT32_C( 28.47),
SIMDE_FLOAT32_C( -384.03), SIMDE_FLOAT32_C( -923.64),
SIMDE_FLOAT32_C( -305.07), SIMDE_FLOAT32_C( -860.95),
SIMDE_FLOAT32_C( -417.54), SIMDE_FLOAT32_C( 696.87)),
simde_mm256_set_ps(SIMDE_FLOAT32_C( 3.55), SIMDE_FLOAT32_C( 0.20),
SIMDE_FLOAT32_C( -0.94), SIMDE_FLOAT32_C( -0.01),
SIMDE_FLOAT32_C( -0.35), SIMDE_FLOAT32_C( -0.15),
SIMDE_FLOAT32_C( 0.30), SIMDE_FLOAT32_C( -0.63)) },
{ simde_mm256_set_ps(SIMDE_FLOAT32_C( 178.20), SIMDE_FLOAT32_C( -450.67),
SIMDE_FLOAT32_C( 233.37), SIMDE_FLOAT32_C( 687.09),
SIMDE_FLOAT32_C( 261.31), SIMDE_FLOAT32_C( -212.54),
SIMDE_FLOAT32_C( -976.55), SIMDE_FLOAT32_C( -660.80)),
simde_mm256_set_ps(SIMDE_FLOAT32_C( -1.19), SIMDE_FLOAT32_C( -6.68),
SIMDE_FLOAT32_C( 1.24), SIMDE_FLOAT32_C( -1.31),
SIMDE_FLOAT32_C( 0.62), SIMDE_FLOAT32_C( 1.91),
SIMDE_FLOAT32_C( 0.53), SIMDE_FLOAT32_C( -1.81)) },
{ simde_mm256_set_ps(SIMDE_FLOAT32_C( -770.35), SIMDE_FLOAT32_C( 443.48),
SIMDE_FLOAT32_C( -583.60), SIMDE_FLOAT32_C( 380.46),
SIMDE_FLOAT32_C( -770.72), SIMDE_FLOAT32_C( 993.90),
SIMDE_FLOAT32_C( 28.08), SIMDE_FLOAT32_C( 841.21)),
simde_mm256_set_ps(SIMDE_FLOAT32_C( -0.78), SIMDE_FLOAT32_C( 0.57),
SIMDE_FLOAT32_C( 0.91), SIMDE_FLOAT32_C( 0.34),
SIMDE_FLOAT32_C( -1.66), SIMDE_FLOAT32_C( 2.28),
SIMDE_FLOAT32_C( -0.20), SIMDE_FLOAT32_C( -0.91)) },
{ simde_mm256_set_ps(SIMDE_FLOAT32_C( -387.90), SIMDE_FLOAT32_C( 395.92),
SIMDE_FLOAT32_C( 655.87), SIMDE_FLOAT32_C( 339.21),
SIMDE_FLOAT32_C( 532.35), SIMDE_FLOAT32_C( -263.99),
SIMDE_FLOAT32_C( 780.64), SIMDE_FLOAT32_C( -30.79)),
simde_mm256_set_ps(SIMDE_FLOAT32_C( -11.51), SIMDE_FLOAT32_C( 0.08),
SIMDE_FLOAT32_C( -0.88), SIMDE_FLOAT32_C( -0.08),
SIMDE_FLOAT32_C( 6.62), SIMDE_FLOAT32_C( -0.10),
SIMDE_FLOAT32_C( 21.84), SIMDE_FLOAT32_C( 0.72)) },
{ simde_mm256_set_ps(SIMDE_FLOAT32_C( -203.65), SIMDE_FLOAT32_C( -80.73),
SIMDE_FLOAT32_C( 336.73), SIMDE_FLOAT32_C( -944.78),
SIMDE_FLOAT32_C( -747.59), SIMDE_FLOAT32_C( -767.23),
SIMDE_FLOAT32_C( -554.19), SIMDE_FLOAT32_C( 398.82)),
simde_mm256_set_ps(SIMDE_FLOAT32_C( 0.62), SIMDE_FLOAT32_C( 1.40),
SIMDE_FLOAT32_C( 0.65), SIMDE_FLOAT32_C( 1.11),
SIMDE_FLOAT32_C( 0.11), SIMDE_FLOAT32_C( -0.81),
SIMDE_FLOAT32_C( -3.22), SIMDE_FLOAT32_C( -0.16)) },
{ simde_mm256_set_ps(SIMDE_FLOAT32_C( 469.66), SIMDE_FLOAT32_C( 680.02),
SIMDE_FLOAT32_C( -148.69), SIMDE_FLOAT32_C( 818.66),
SIMDE_FLOAT32_C( 910.03), SIMDE_FLOAT32_C( 600.47),
SIMDE_FLOAT32_C( 791.23), SIMDE_FLOAT32_C( 254.31)),
simde_mm256_set_ps(SIMDE_FLOAT32_C( 123.48), SIMDE_FLOAT32_C( 7.37),
SIMDE_FLOAT32_C( -1.68), SIMDE_FLOAT32_C( -3.54),
SIMDE_FLOAT32_C( -1.67), SIMDE_FLOAT32_C( 0.45),
SIMDE_FLOAT32_C( -0.48), SIMDE_FLOAT32_C( -0.16)) }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])); i++) {
simde__m256 r = simde_mm256_tan_ps(test_vec[i].a);
simde_assert_m256_close(r, test_vec[i].r, 1);
}
return 0;
}
static int
test_simde_mm256_tan_pd(SIMDE_MUNIT_TEST_ARGS) {
const struct {
simde__m256d a;
simde__m256d r;
} test_vec[8] = {
{ simde_mm256_set_pd(SIMDE_FLOAT64_C( -186.21), SIMDE_FLOAT64_C( 39.01),
SIMDE_FLOAT64_C( -754.38), SIMDE_FLOAT64_C( 346.63)),
simde_mm256_set_pd(SIMDE_FLOAT64_C( -1.15), SIMDE_FLOAT64_C( 3.76),
SIMDE_FLOAT64_C( -0.42), SIMDE_FLOAT64_C( 1.76)) },
{ simde_mm256_set_pd(SIMDE_FLOAT64_C( 497.31), SIMDE_FLOAT64_C( 670.24),
SIMDE_FLOAT64_C( -297.45), SIMDE_FLOAT64_C( 34.06)),
simde_mm256_set_pd(SIMDE_FLOAT64_C( 1.36), SIMDE_FLOAT64_C( 1.87),
SIMDE_FLOAT64_C( 1.56), SIMDE_FLOAT64_C( -0.54)) },
{ simde_mm256_set_pd(SIMDE_FLOAT64_C( 825.53), SIMDE_FLOAT64_C( 422.21),
SIMDE_FLOAT64_C( -269.45), SIMDE_FLOAT64_C( 467.76)),
simde_mm256_set_pd(SIMDE_FLOAT64_C( -0.86), SIMDE_FLOAT64_C( 2.88),
SIMDE_FLOAT64_C( 0.89), SIMDE_FLOAT64_C( -0.35)) },
{ simde_mm256_set_pd(SIMDE_FLOAT64_C( -678.17), SIMDE_FLOAT64_C( -686.13),
SIMDE_FLOAT64_C( 84.77), SIMDE_FLOAT64_C( 571.46)),
simde_mm256_set_pd(SIMDE_FLOAT64_C( 0.44), SIMDE_FLOAT64_C( -3.14),
SIMDE_FLOAT64_C( -0.05), SIMDE_FLOAT64_C( -0.32)) },
{ simde_mm256_set_pd(SIMDE_FLOAT64_C( -305.07), SIMDE_FLOAT64_C( -860.95),
SIMDE_FLOAT64_C( -417.54), SIMDE_FLOAT64_C( 696.87)),
simde_mm256_set_pd(SIMDE_FLOAT64_C( -0.35), SIMDE_FLOAT64_C( -0.15),
SIMDE_FLOAT64_C( 0.30), SIMDE_FLOAT64_C( -0.63)) },
{ simde_mm256_set_pd(SIMDE_FLOAT64_C( -444.81), SIMDE_FLOAT64_C( 28.47),
SIMDE_FLOAT64_C( -384.03), SIMDE_FLOAT64_C( -923.64)),
simde_mm256_set_pd(SIMDE_FLOAT64_C( 3.55), SIMDE_FLOAT64_C( 0.20),
SIMDE_FLOAT64_C( -0.94), SIMDE_FLOAT64_C( -0.01)) },
{ simde_mm256_set_pd(SIMDE_FLOAT64_C( 261.31), SIMDE_FLOAT64_C( -212.54),
SIMDE_FLOAT64_C( -976.55), SIMDE_FLOAT64_C( -660.80)),
simde_mm256_set_pd(SIMDE_FLOAT64_C( 0.62), SIMDE_FLOAT64_C( 1.91),
SIMDE_FLOAT64_C( 0.53), SIMDE_FLOAT64_C( -1.81)) },
{ simde_mm256_set_pd(SIMDE_FLOAT64_C( 178.20), SIMDE_FLOAT64_C( -450.67),
SIMDE_FLOAT64_C( 233.37), SIMDE_FLOAT64_C( 687.09)),
simde_mm256_set_pd(SIMDE_FLOAT64_C( -1.19), SIMDE_FLOAT64_C( -6.68),
SIMDE_FLOAT64_C( 1.24), SIMDE_FLOAT64_C( -1.31)) }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])); i++) {
simde__m256d r = simde_mm256_tan_pd(test_vec[i].a);
simde_assert_m256d_close(r, test_vec[i].r, 1);
}
return 0;
}
static int
test_simde_mm512_tan_ps(SIMDE_MUNIT_TEST_ARGS) {
const struct {
simde__m512 a;
simde__m512 r;
} test_vec[8] = {
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( -678.17), SIMDE_FLOAT32_C( -686.13), SIMDE_FLOAT32_C( 84.77), SIMDE_FLOAT32_C( 571.46),
SIMDE_FLOAT32_C( 825.53), SIMDE_FLOAT32_C( 422.21), SIMDE_FLOAT32_C( -269.45), SIMDE_FLOAT32_C( 467.76),
SIMDE_FLOAT32_C( 497.31), SIMDE_FLOAT32_C( 670.24), SIMDE_FLOAT32_C( -297.45), SIMDE_FLOAT32_C( 34.06),
SIMDE_FLOAT32_C( -186.21), SIMDE_FLOAT32_C( 39.01), SIMDE_FLOAT32_C( -754.38), SIMDE_FLOAT32_C( 346.63)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 0.44), SIMDE_FLOAT32_C( -3.14), SIMDE_FLOAT32_C( -0.05), SIMDE_FLOAT32_C( -0.32),
SIMDE_FLOAT32_C( -0.86), SIMDE_FLOAT32_C( 2.88), SIMDE_FLOAT32_C( 0.89), SIMDE_FLOAT32_C( -0.35),
SIMDE_FLOAT32_C( 1.36), SIMDE_FLOAT32_C( 1.87), SIMDE_FLOAT32_C( 1.56), SIMDE_FLOAT32_C( -0.54),
SIMDE_FLOAT32_C( -1.15), SIMDE_FLOAT32_C( 3.76), SIMDE_FLOAT32_C( -0.42), SIMDE_FLOAT32_C( 1.76)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( 178.20), SIMDE_FLOAT32_C( -450.67), SIMDE_FLOAT32_C( 233.37), SIMDE_FLOAT32_C( 687.09),
SIMDE_FLOAT32_C( 261.31), SIMDE_FLOAT32_C( -212.54), SIMDE_FLOAT32_C( -976.55), SIMDE_FLOAT32_C( -660.80),
SIMDE_FLOAT32_C( -444.81), SIMDE_FLOAT32_C( 28.47), SIMDE_FLOAT32_C( -384.03), SIMDE_FLOAT32_C( -923.64),
SIMDE_FLOAT32_C( -305.07), SIMDE_FLOAT32_C( -860.95), SIMDE_FLOAT32_C( -417.54), SIMDE_FLOAT32_C( 696.87)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( -1.19), SIMDE_FLOAT32_C( -6.68), SIMDE_FLOAT32_C( 1.24), SIMDE_FLOAT32_C( -1.31),
SIMDE_FLOAT32_C( 0.62), SIMDE_FLOAT32_C( 1.91), SIMDE_FLOAT32_C( 0.53), SIMDE_FLOAT32_C( -1.81),
SIMDE_FLOAT32_C( 3.55), SIMDE_FLOAT32_C( 0.20), SIMDE_FLOAT32_C( -0.94), SIMDE_FLOAT32_C( -0.01),
SIMDE_FLOAT32_C( -0.35), SIMDE_FLOAT32_C( -0.15), SIMDE_FLOAT32_C( 0.30), SIMDE_FLOAT32_C( -0.63)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( -387.90), SIMDE_FLOAT32_C( 395.92), SIMDE_FLOAT32_C( 655.87), SIMDE_FLOAT32_C( 339.21),
SIMDE_FLOAT32_C( 532.35), SIMDE_FLOAT32_C( -263.99), SIMDE_FLOAT32_C( 780.64), SIMDE_FLOAT32_C( -30.79),
SIMDE_FLOAT32_C( -770.35), SIMDE_FLOAT32_C( 443.48), SIMDE_FLOAT32_C( -583.60), SIMDE_FLOAT32_C( 380.46),
SIMDE_FLOAT32_C( -770.72), SIMDE_FLOAT32_C( 993.90), SIMDE_FLOAT32_C( 28.08), SIMDE_FLOAT32_C( 841.21)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( -11.51), SIMDE_FLOAT32_C( 0.08), SIMDE_FLOAT32_C( -0.88), SIMDE_FLOAT32_C( -0.08),
SIMDE_FLOAT32_C( 6.62), SIMDE_FLOAT32_C( -0.10), SIMDE_FLOAT32_C( 21.84), SIMDE_FLOAT32_C( 0.72),
SIMDE_FLOAT32_C( -0.78), SIMDE_FLOAT32_C( 0.57), SIMDE_FLOAT32_C( 0.91), SIMDE_FLOAT32_C( 0.34),
SIMDE_FLOAT32_C( -1.66), SIMDE_FLOAT32_C( 2.28), SIMDE_FLOAT32_C( -0.20), SIMDE_FLOAT32_C( -0.91)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( 469.66), SIMDE_FLOAT32_C( 680.02), SIMDE_FLOAT32_C( -148.69), SIMDE_FLOAT32_C( 818.66),
SIMDE_FLOAT32_C( 910.03), SIMDE_FLOAT32_C( 600.47), SIMDE_FLOAT32_C( 791.23), SIMDE_FLOAT32_C( 254.31),
SIMDE_FLOAT32_C( -203.65), SIMDE_FLOAT32_C( -80.73), SIMDE_FLOAT32_C( 336.73), SIMDE_FLOAT32_C( -944.78),
SIMDE_FLOAT32_C( -747.59), SIMDE_FLOAT32_C( -767.23), SIMDE_FLOAT32_C( -554.19), SIMDE_FLOAT32_C( 398.82)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 123.48), SIMDE_FLOAT32_C( 7.37), SIMDE_FLOAT32_C( -1.68), SIMDE_FLOAT32_C( -3.54),
SIMDE_FLOAT32_C( -1.67), SIMDE_FLOAT32_C( 0.45), SIMDE_FLOAT32_C( -0.48), SIMDE_FLOAT32_C( -0.16),
SIMDE_FLOAT32_C( 0.62), SIMDE_FLOAT32_C( 1.40), SIMDE_FLOAT32_C( 0.65), SIMDE_FLOAT32_C( 1.11),
SIMDE_FLOAT32_C( 0.11), SIMDE_FLOAT32_C( -0.81), SIMDE_FLOAT32_C( -3.22), SIMDE_FLOAT32_C( -0.16)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( -178.99), SIMDE_FLOAT32_C( 758.79), SIMDE_FLOAT32_C( 324.62), SIMDE_FLOAT32_C( 343.48),
SIMDE_FLOAT32_C( -874.31), SIMDE_FLOAT32_C( -797.92), SIMDE_FLOAT32_C( -328.54), SIMDE_FLOAT32_C( -525.83),
SIMDE_FLOAT32_C( -192.31), SIMDE_FLOAT32_C( -822.65), SIMDE_FLOAT32_C( 561.36), SIMDE_FLOAT32_C( 655.67),
SIMDE_FLOAT32_C( -70.91), SIMDE_FLOAT32_C( 543.35), SIMDE_FLOAT32_C( 120.65), SIMDE_FLOAT32_C( -171.51)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 0.08), SIMDE_FLOAT32_C( -10.46), SIMDE_FLOAT32_C( 1.69), SIMDE_FLOAT32_C( 1.73),
SIMDE_FLOAT32_C( -1.39), SIMDE_FLOAT32_C( 0.04), SIMDE_FLOAT32_C( 4.02), SIMDE_FLOAT32_C( -2.46),
SIMDE_FLOAT32_C( -0.80), SIMDE_FLOAT32_C( 0.48), SIMDE_FLOAT32_C( -1.51), SIMDE_FLOAT32_C( -1.32),
SIMDE_FLOAT32_C( 4.39), SIMDE_FLOAT32_C( -0.15), SIMDE_FLOAT32_C( 3.22), SIMDE_FLOAT32_C( 3.31)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( 690.12), SIMDE_FLOAT32_C( 840.65), SIMDE_FLOAT32_C( -21.09), SIMDE_FLOAT32_C( -591.56),
SIMDE_FLOAT32_C( -448.89), SIMDE_FLOAT32_C( 731.49), SIMDE_FLOAT32_C( 505.79), SIMDE_FLOAT32_C( 623.70),
SIMDE_FLOAT32_C( 831.02), SIMDE_FLOAT32_C( 140.67), SIMDE_FLOAT32_C( 977.36), SIMDE_FLOAT32_C( -906.16),
SIMDE_FLOAT32_C( 331.34), SIMDE_FLOAT32_C( 99.93), SIMDE_FLOAT32_C( 462.95), SIMDE_FLOAT32_C( -738.19)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( -1.67), SIMDE_FLOAT32_C( -3.56), SIMDE_FLOAT32_C( 1.26), SIMDE_FLOAT32_C( -1.37),
SIMDE_FLOAT32_C( 0.37), SIMDE_FLOAT32_C( -0.55), SIMDE_FLOAT32_C( -0.01), SIMDE_FLOAT32_C( -10.62),
SIMDE_FLOAT32_C( -14.52), SIMDE_FLOAT32_C( -0.85), SIMDE_FLOAT32_C( 0.34), SIMDE_FLOAT32_C( -5.21),
SIMDE_FLOAT32_C( 10.17), SIMDE_FLOAT32_C( -0.69), SIMDE_FLOAT32_C( 2.15), SIMDE_FLOAT32_C( 0.08)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( -769.09), SIMDE_FLOAT32_C( 822.06), SIMDE_FLOAT32_C( -573.81), SIMDE_FLOAT32_C( -997.63),
SIMDE_FLOAT32_C( -337.60), SIMDE_FLOAT32_C( 923.64), SIMDE_FLOAT32_C( 293.64), SIMDE_FLOAT32_C( -768.12),
SIMDE_FLOAT32_C( -576.22), SIMDE_FLOAT32_C( -67.64), SIMDE_FLOAT32_C( 710.38), SIMDE_FLOAT32_C( 977.49),
SIMDE_FLOAT32_C( -756.42), SIMDE_FLOAT32_C( 424.81), SIMDE_FLOAT32_C( 27.25), SIMDE_FLOAT32_C( -95.15)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 0.68), SIMDE_FLOAT32_C( -1.69), SIMDE_FLOAT32_C( 1.97), SIMDE_FLOAT32_C( 5.68),
SIMDE_FLOAT32_C( -8.21), SIMDE_FLOAT32_C( 0.01), SIMDE_FLOAT32_C( 10.08), SIMDE_FLOAT32_C( 1691.15),
SIMDE_FLOAT32_C( -3.72), SIMDE_FLOAT32_C( 10.41), SIMDE_FLOAT32_C( 0.40), SIMDE_FLOAT32_C( 0.49),
SIMDE_FLOAT32_C( 0.85), SIMDE_FLOAT32_C( 0.83), SIMDE_FLOAT32_C( -1.64), SIMDE_FLOAT32_C( -1.27)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( -438.19), SIMDE_FLOAT32_C( -359.76), SIMDE_FLOAT32_C( -752.43), SIMDE_FLOAT32_C( -33.67),
SIMDE_FLOAT32_C( 932.66), SIMDE_FLOAT32_C( 7.27), SIMDE_FLOAT32_C( -327.22), SIMDE_FLOAT32_C( -125.20),
SIMDE_FLOAT32_C( -182.45), SIMDE_FLOAT32_C( 39.93), SIMDE_FLOAT32_C( 510.85), SIMDE_FLOAT32_C( 394.67),
SIMDE_FLOAT32_C( 14.34), SIMDE_FLOAT32_C( -304.73), SIMDE_FLOAT32_C( 916.26), SIMDE_FLOAT32_C( -696.69)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( -16.06), SIMDE_FLOAT32_C( 20.97), SIMDE_FLOAT32_C( 53.90), SIMDE_FLOAT32_C( 1.23),
SIMDE_FLOAT32_C( -0.41), SIMDE_FLOAT32_C( 1.51), SIMDE_FLOAT32_C( -0.54), SIMDE_FLOAT32_C( 0.50),
SIMDE_FLOAT32_C( -0.24), SIMDE_FLOAT32_C( -1.29), SIMDE_FLOAT32_C( -2.82), SIMDE_FLOAT32_C( -2.36),
SIMDE_FLOAT32_C( -4.86), SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( -1.89), SIMDE_FLOAT32_C( 0.92)) }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])); i++) {
simde__m512 r = simde_mm512_tan_ps(test_vec[i].a);
simde_assert_m512_close(r, test_vec[i].r, 1);
}
return 0;
}
static int
test_simde_mm512_mask_tan_ps(SIMDE_MUNIT_TEST_ARGS) {
const struct {
simde__m512 src;
simde__mmask16 k;
simde__m512 a;
simde__m512 r;
} test_vec[8] = {
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( -450.67), SIMDE_FLOAT32_C( 687.09), SIMDE_FLOAT32_C( -212.54), SIMDE_FLOAT32_C( -660.80),
SIMDE_FLOAT32_C( 28.47), SIMDE_FLOAT32_C( -923.64), SIMDE_FLOAT32_C( -860.95), SIMDE_FLOAT32_C( 696.87),
SIMDE_FLOAT32_C( -686.13), SIMDE_FLOAT32_C( 571.46), SIMDE_FLOAT32_C( 422.21), SIMDE_FLOAT32_C( 467.76),
SIMDE_FLOAT32_C( 670.24), SIMDE_FLOAT32_C( 34.06), SIMDE_FLOAT32_C( 39.01), SIMDE_FLOAT32_C( 346.63)),
UINT16_C(41466),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 178.20), SIMDE_FLOAT32_C( 233.37), SIMDE_FLOAT32_C( 261.31), SIMDE_FLOAT32_C( -976.55),
SIMDE_FLOAT32_C( -444.81), SIMDE_FLOAT32_C( -384.03), SIMDE_FLOAT32_C( -305.07), SIMDE_FLOAT32_C( -417.54),
SIMDE_FLOAT32_C( -678.17), SIMDE_FLOAT32_C( 84.77), SIMDE_FLOAT32_C( 825.53), SIMDE_FLOAT32_C( -269.45),
SIMDE_FLOAT32_C( 497.31), SIMDE_FLOAT32_C( -297.45), SIMDE_FLOAT32_C( -186.21), SIMDE_FLOAT32_C( -754.38)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( -1.19), SIMDE_FLOAT32_C( 687.09), SIMDE_FLOAT32_C( 0.62), SIMDE_FLOAT32_C( -660.80),
SIMDE_FLOAT32_C( 28.47), SIMDE_FLOAT32_C( -923.64), SIMDE_FLOAT32_C( -860.95), SIMDE_FLOAT32_C( 0.30),
SIMDE_FLOAT32_C( 0.44), SIMDE_FLOAT32_C( -0.05), SIMDE_FLOAT32_C( -0.86), SIMDE_FLOAT32_C( 0.89),
SIMDE_FLOAT32_C( 1.36), SIMDE_FLOAT32_C( 34.06), SIMDE_FLOAT32_C( -1.15), SIMDE_FLOAT32_C( 346.63)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( 469.66), SIMDE_FLOAT32_C( -148.69), SIMDE_FLOAT32_C( 910.03), SIMDE_FLOAT32_C( 791.23),
SIMDE_FLOAT32_C( -203.65), SIMDE_FLOAT32_C( 336.73), SIMDE_FLOAT32_C( -747.59), SIMDE_FLOAT32_C( -554.19),
SIMDE_FLOAT32_C( -387.90), SIMDE_FLOAT32_C( 655.87), SIMDE_FLOAT32_C( 532.35), SIMDE_FLOAT32_C( 780.64),
SIMDE_FLOAT32_C( -770.35), SIMDE_FLOAT32_C( -583.60), SIMDE_FLOAT32_C( -770.72), SIMDE_FLOAT32_C( 28.08)),
UINT16_C(36797),
simde_mm512_set_ps(SIMDE_FLOAT32_C( -171.51), SIMDE_FLOAT32_C( 680.02), SIMDE_FLOAT32_C( 818.66), SIMDE_FLOAT32_C( 600.47),
SIMDE_FLOAT32_C( 254.31), SIMDE_FLOAT32_C( -80.73), SIMDE_FLOAT32_C( -944.78), SIMDE_FLOAT32_C( -767.23),
SIMDE_FLOAT32_C( 398.82), SIMDE_FLOAT32_C( 395.92), SIMDE_FLOAT32_C( 339.21), SIMDE_FLOAT32_C( -263.99),
SIMDE_FLOAT32_C( -30.79), SIMDE_FLOAT32_C( 443.48), SIMDE_FLOAT32_C( 380.46), SIMDE_FLOAT32_C( 993.90)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 3.31), SIMDE_FLOAT32_C( -148.69), SIMDE_FLOAT32_C( 910.03), SIMDE_FLOAT32_C( 791.23),
SIMDE_FLOAT32_C( -0.16), SIMDE_FLOAT32_C( 1.40), SIMDE_FLOAT32_C( 1.11), SIMDE_FLOAT32_C( -0.81),
SIMDE_FLOAT32_C( -0.16), SIMDE_FLOAT32_C( 655.87), SIMDE_FLOAT32_C( -0.08), SIMDE_FLOAT32_C( -0.10),
SIMDE_FLOAT32_C( 0.72), SIMDE_FLOAT32_C( 0.57), SIMDE_FLOAT32_C( -770.72), SIMDE_FLOAT32_C( 2.28)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( -95.15), SIMDE_FLOAT32_C( 840.65), SIMDE_FLOAT32_C( -591.56), SIMDE_FLOAT32_C( 731.49),
SIMDE_FLOAT32_C( 623.70), SIMDE_FLOAT32_C( 140.67), SIMDE_FLOAT32_C( -906.16), SIMDE_FLOAT32_C( 99.93),
SIMDE_FLOAT32_C( -738.19), SIMDE_FLOAT32_C( 758.79), SIMDE_FLOAT32_C( 343.48), SIMDE_FLOAT32_C( -797.92),
SIMDE_FLOAT32_C( -525.83), SIMDE_FLOAT32_C( -822.65), SIMDE_FLOAT32_C( 655.67), SIMDE_FLOAT32_C( 543.35)),
UINT16_C(16804),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 27.25), SIMDE_FLOAT32_C( 690.12), SIMDE_FLOAT32_C( -21.09), SIMDE_FLOAT32_C( -448.89),
SIMDE_FLOAT32_C( 505.79), SIMDE_FLOAT32_C( 831.02), SIMDE_FLOAT32_C( 977.36), SIMDE_FLOAT32_C( 331.34),
SIMDE_FLOAT32_C( 462.95), SIMDE_FLOAT32_C( -178.99), SIMDE_FLOAT32_C( 324.62), SIMDE_FLOAT32_C( -874.31),
SIMDE_FLOAT32_C( -328.54), SIMDE_FLOAT32_C( -192.31), SIMDE_FLOAT32_C( 561.36), SIMDE_FLOAT32_C( -70.91)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( -95.15), SIMDE_FLOAT32_C( -1.67), SIMDE_FLOAT32_C( -591.56), SIMDE_FLOAT32_C( 731.49),
SIMDE_FLOAT32_C( 623.70), SIMDE_FLOAT32_C( 140.67), SIMDE_FLOAT32_C( -906.16), SIMDE_FLOAT32_C( 10.17),
SIMDE_FLOAT32_C( 2.15), SIMDE_FLOAT32_C( 758.79), SIMDE_FLOAT32_C( 1.69), SIMDE_FLOAT32_C( -797.92),
SIMDE_FLOAT32_C( -525.83), SIMDE_FLOAT32_C( -0.80), SIMDE_FLOAT32_C( 655.67), SIMDE_FLOAT32_C( 543.35)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( -348.70), SIMDE_FLOAT32_C( -438.19), SIMDE_FLOAT32_C( -752.43), SIMDE_FLOAT32_C( 932.66),
SIMDE_FLOAT32_C( -327.22), SIMDE_FLOAT32_C( -182.45), SIMDE_FLOAT32_C( 510.85), SIMDE_FLOAT32_C( 14.34),
SIMDE_FLOAT32_C( 916.26), SIMDE_FLOAT32_C( -769.09), SIMDE_FLOAT32_C( -573.81), SIMDE_FLOAT32_C( -337.60),
SIMDE_FLOAT32_C( 293.64), SIMDE_FLOAT32_C( -576.22), SIMDE_FLOAT32_C( 710.38), SIMDE_FLOAT32_C( -756.42)),
UINT16_C( 2107),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 897.27), SIMDE_FLOAT32_C( -197.89), SIMDE_FLOAT32_C( -359.76), SIMDE_FLOAT32_C( -33.67),
SIMDE_FLOAT32_C( 7.27), SIMDE_FLOAT32_C( -125.20), SIMDE_FLOAT32_C( 39.93), SIMDE_FLOAT32_C( 394.67),
SIMDE_FLOAT32_C( -304.73), SIMDE_FLOAT32_C( -696.69), SIMDE_FLOAT32_C( 822.06), SIMDE_FLOAT32_C( -997.63),
SIMDE_FLOAT32_C( 923.64), SIMDE_FLOAT32_C( -768.12), SIMDE_FLOAT32_C( -67.64), SIMDE_FLOAT32_C( 977.49)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( -348.70), SIMDE_FLOAT32_C( -438.19), SIMDE_FLOAT32_C( -752.43), SIMDE_FLOAT32_C( 932.66),
SIMDE_FLOAT32_C( 1.51), SIMDE_FLOAT32_C( -182.45), SIMDE_FLOAT32_C( 510.85), SIMDE_FLOAT32_C( 14.34),
SIMDE_FLOAT32_C( 916.26), SIMDE_FLOAT32_C( -769.09), SIMDE_FLOAT32_C( -1.69), SIMDE_FLOAT32_C( 5.68),
SIMDE_FLOAT32_C( 0.01), SIMDE_FLOAT32_C( -576.22), SIMDE_FLOAT32_C( 10.41), SIMDE_FLOAT32_C( 0.49)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( -15.61), SIMDE_FLOAT32_C( -737.13), SIMDE_FLOAT32_C( -314.93), SIMDE_FLOAT32_C( 177.92),
SIMDE_FLOAT32_C( 345.93), SIMDE_FLOAT32_C( 888.71), SIMDE_FLOAT32_C( 915.71), SIMDE_FLOAT32_C( 133.52),
SIMDE_FLOAT32_C( 484.94), SIMDE_FLOAT32_C( -598.06), SIMDE_FLOAT32_C( -791.07), SIMDE_FLOAT32_C( -765.93),
SIMDE_FLOAT32_C( 221.37), SIMDE_FLOAT32_C( -788.36), SIMDE_FLOAT32_C( -775.04), SIMDE_FLOAT32_C( 440.64)),
UINT16_C(22274),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 496.57), SIMDE_FLOAT32_C( 915.19), SIMDE_FLOAT32_C( -718.40), SIMDE_FLOAT32_C( 159.97),
SIMDE_FLOAT32_C( -861.01), SIMDE_FLOAT32_C( 426.61), SIMDE_FLOAT32_C( 932.11), SIMDE_FLOAT32_C( 110.36),
SIMDE_FLOAT32_C( 826.84), SIMDE_FLOAT32_C( -76.75), SIMDE_FLOAT32_C( 237.58), SIMDE_FLOAT32_C( -378.50),
SIMDE_FLOAT32_C( -601.68), SIMDE_FLOAT32_C( -623.50), SIMDE_FLOAT32_C( -942.47), SIMDE_FLOAT32_C( 475.51)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( -15.61), SIMDE_FLOAT32_C( 1.51), SIMDE_FLOAT32_C( -314.93), SIMDE_FLOAT32_C( -0.26),
SIMDE_FLOAT32_C( 345.93), SIMDE_FLOAT32_C( -0.75), SIMDE_FLOAT32_C( -1.38), SIMDE_FLOAT32_C( 0.43),
SIMDE_FLOAT32_C( 484.94), SIMDE_FLOAT32_C( -598.06), SIMDE_FLOAT32_C( -791.07), SIMDE_FLOAT32_C( -765.93),
SIMDE_FLOAT32_C( 221.37), SIMDE_FLOAT32_C( -788.36), SIMDE_FLOAT32_C( 0.01), SIMDE_FLOAT32_C( 440.64)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( 883.05), SIMDE_FLOAT32_C( -807.28), SIMDE_FLOAT32_C( -70.05), SIMDE_FLOAT32_C( -784.34),
SIMDE_FLOAT32_C( 92.52), SIMDE_FLOAT32_C( 206.60), SIMDE_FLOAT32_C( 834.60), SIMDE_FLOAT32_C( -65.60),
SIMDE_FLOAT32_C( -286.07), SIMDE_FLOAT32_C( -212.86), SIMDE_FLOAT32_C( -318.38), SIMDE_FLOAT32_C( 783.48),
SIMDE_FLOAT32_C( -628.82), SIMDE_FLOAT32_C( 556.35), SIMDE_FLOAT32_C( 439.43), SIMDE_FLOAT32_C( 434.03)),
UINT16_C(27396),
simde_mm512_set_ps(SIMDE_FLOAT32_C( -964.25), SIMDE_FLOAT32_C( -406.33), SIMDE_FLOAT32_C( -743.66), SIMDE_FLOAT32_C( -764.58),
SIMDE_FLOAT32_C( 789.89), SIMDE_FLOAT32_C( 4.83), SIMDE_FLOAT32_C( -818.54), SIMDE_FLOAT32_C( 161.06),
SIMDE_FLOAT32_C( 579.25), SIMDE_FLOAT32_C( -11.78), SIMDE_FLOAT32_C( -308.52), SIMDE_FLOAT32_C( -719.57),
SIMDE_FLOAT32_C( 334.00), SIMDE_FLOAT32_C( 274.71), SIMDE_FLOAT32_C( -916.82), SIMDE_FLOAT32_C( -490.00)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 883.05), SIMDE_FLOAT32_C( -1.80), SIMDE_FLOAT32_C( 1.25), SIMDE_FLOAT32_C( -784.34),
SIMDE_FLOAT32_C( 4.46), SIMDE_FLOAT32_C( 206.60), SIMDE_FLOAT32_C( 6.40), SIMDE_FLOAT32_C( 1.11),
SIMDE_FLOAT32_C( -286.07), SIMDE_FLOAT32_C( -212.86), SIMDE_FLOAT32_C( -318.38), SIMDE_FLOAT32_C( 783.48),
SIMDE_FLOAT32_C( -628.82), SIMDE_FLOAT32_C( 5.52), SIMDE_FLOAT32_C( 439.43), SIMDE_FLOAT32_C( 434.03)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( 529.63), SIMDE_FLOAT32_C( -24.89), SIMDE_FLOAT32_C( -967.78), SIMDE_FLOAT32_C( 638.94),
SIMDE_FLOAT32_C( 450.90), SIMDE_FLOAT32_C( -771.54), SIMDE_FLOAT32_C( 105.79), SIMDE_FLOAT32_C( 590.10),
SIMDE_FLOAT32_C( 30.91), SIMDE_FLOAT32_C( 635.35), SIMDE_FLOAT32_C( -84.00), SIMDE_FLOAT32_C( 80.04),
SIMDE_FLOAT32_C( -709.46), SIMDE_FLOAT32_C( 607.86), SIMDE_FLOAT32_C( 394.58), SIMDE_FLOAT32_C( -889.11)),
UINT16_C( 953),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 18.75), SIMDE_FLOAT32_C( 809.05), SIMDE_FLOAT32_C( 144.05), SIMDE_FLOAT32_C( -427.72),
SIMDE_FLOAT32_C( 308.28), SIMDE_FLOAT32_C( -177.05), SIMDE_FLOAT32_C( -457.77), SIMDE_FLOAT32_C( 678.24),
SIMDE_FLOAT32_C( 66.05), SIMDE_FLOAT32_C( -267.71), SIMDE_FLOAT32_C( 117.28), SIMDE_FLOAT32_C( -576.80),
SIMDE_FLOAT32_C( -38.39), SIMDE_FLOAT32_C( -250.14), SIMDE_FLOAT32_C( -53.92), SIMDE_FLOAT32_C( 91.94)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 529.63), SIMDE_FLOAT32_C( -24.89), SIMDE_FLOAT32_C( -967.78), SIMDE_FLOAT32_C( 638.94),
SIMDE_FLOAT32_C( 450.90), SIMDE_FLOAT32_C( -771.54), SIMDE_FLOAT32_C( 1.27), SIMDE_FLOAT32_C( -0.36),
SIMDE_FLOAT32_C( 0.08), SIMDE_FLOAT32_C( 635.35), SIMDE_FLOAT32_C( 1.71), SIMDE_FLOAT32_C( 3.04),
SIMDE_FLOAT32_C( -0.83), SIMDE_FLOAT32_C( 607.86), SIMDE_FLOAT32_C( 394.58), SIMDE_FLOAT32_C( 1.10)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( -788.39), SIMDE_FLOAT32_C( 330.43), SIMDE_FLOAT32_C( -493.41), SIMDE_FLOAT32_C( 822.72),
SIMDE_FLOAT32_C( 956.68), SIMDE_FLOAT32_C( 954.62), SIMDE_FLOAT32_C( 825.49), SIMDE_FLOAT32_C( -816.27),
SIMDE_FLOAT32_C( -209.34), SIMDE_FLOAT32_C( -933.21), SIMDE_FLOAT32_C( -728.70), SIMDE_FLOAT32_C( -420.06),
SIMDE_FLOAT32_C( 100.32), SIMDE_FLOAT32_C( 103.15), SIMDE_FLOAT32_C( 439.77), SIMDE_FLOAT32_C( -204.33)),
UINT16_C(12713),
simde_mm512_set_ps(SIMDE_FLOAT32_C( -841.43), SIMDE_FLOAT32_C( -14.16), SIMDE_FLOAT32_C( 824.88), SIMDE_FLOAT32_C( 793.63),
SIMDE_FLOAT32_C( -736.75), SIMDE_FLOAT32_C( -310.57), SIMDE_FLOAT32_C( 728.87), SIMDE_FLOAT32_C( -350.72),
SIMDE_FLOAT32_C( 60.89), SIMDE_FLOAT32_C( 109.81), SIMDE_FLOAT32_C( 715.94), SIMDE_FLOAT32_C( -250.60),
SIMDE_FLOAT32_C( 944.14), SIMDE_FLOAT32_C( 361.85), SIMDE_FLOAT32_C( -13.07), SIMDE_FLOAT32_C( 852.60)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( -788.39), SIMDE_FLOAT32_C( 330.43), SIMDE_FLOAT32_C( -4.65), SIMDE_FLOAT32_C( -2.52),
SIMDE_FLOAT32_C( 956.68), SIMDE_FLOAT32_C( 954.62), SIMDE_FLOAT32_C( 825.49), SIMDE_FLOAT32_C( 2.17),
SIMDE_FLOAT32_C( 2.57), SIMDE_FLOAT32_C( -933.21), SIMDE_FLOAT32_C( -0.36), SIMDE_FLOAT32_C( -420.06),
SIMDE_FLOAT32_C( -10.91), SIMDE_FLOAT32_C( 103.15), SIMDE_FLOAT32_C( 439.77), SIMDE_FLOAT32_C( 2.81)) }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])); i++) {
simde__m512 r = simde_mm512_mask_tan_ps(test_vec[i].src, test_vec[i].k, test_vec[i].a);
simde_assert_m512_close(r, test_vec[i].r, 1);
}
return 0;
}
static int
test_simde_mm512_tan_pd(SIMDE_MUNIT_TEST_ARGS) {
const struct {
simde__m512d a;
simde__m512d r;
} test_vec[8] = {
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( 497.31), SIMDE_FLOAT64_C( 670.24),
SIMDE_FLOAT64_C( -297.45), SIMDE_FLOAT64_C( 34.06),
SIMDE_FLOAT64_C( -186.21), SIMDE_FLOAT64_C( 39.01),
SIMDE_FLOAT64_C( -754.38), SIMDE_FLOAT64_C( 346.63)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 1.36), SIMDE_FLOAT64_C( 1.87),
SIMDE_FLOAT64_C( 1.56), SIMDE_FLOAT64_C( -0.54),
SIMDE_FLOAT64_C( -1.15), SIMDE_FLOAT64_C( 3.76),
SIMDE_FLOAT64_C( -0.42), SIMDE_FLOAT64_C( 1.76)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( -678.17), SIMDE_FLOAT64_C( -686.13),
SIMDE_FLOAT64_C( 84.77), SIMDE_FLOAT64_C( 571.46),
SIMDE_FLOAT64_C( 825.53), SIMDE_FLOAT64_C( 422.21),
SIMDE_FLOAT64_C( -269.45), SIMDE_FLOAT64_C( 467.76)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 0.44), SIMDE_FLOAT64_C( -3.14),
SIMDE_FLOAT64_C( -0.05), SIMDE_FLOAT64_C( -0.32),
SIMDE_FLOAT64_C( -0.86), SIMDE_FLOAT64_C( 2.88),
SIMDE_FLOAT64_C( 0.89), SIMDE_FLOAT64_C( -0.35)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( -444.81), SIMDE_FLOAT64_C( 28.47),
SIMDE_FLOAT64_C( -384.03), SIMDE_FLOAT64_C( -923.64),
SIMDE_FLOAT64_C( -305.07), SIMDE_FLOAT64_C( -860.95),
SIMDE_FLOAT64_C( -417.54), SIMDE_FLOAT64_C( 696.87)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 3.55), SIMDE_FLOAT64_C( 0.20),
SIMDE_FLOAT64_C( -0.94), SIMDE_FLOAT64_C( -0.01),
SIMDE_FLOAT64_C( -0.35), SIMDE_FLOAT64_C( -0.15),
SIMDE_FLOAT64_C( 0.30), SIMDE_FLOAT64_C( -0.63)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( 178.20), SIMDE_FLOAT64_C( -450.67),
SIMDE_FLOAT64_C( 233.37), SIMDE_FLOAT64_C( 687.09),
SIMDE_FLOAT64_C( 261.31), SIMDE_FLOAT64_C( -212.54),
SIMDE_FLOAT64_C( -976.55), SIMDE_FLOAT64_C( -660.80)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( -1.19), SIMDE_FLOAT64_C( -6.68),
SIMDE_FLOAT64_C( 1.24), SIMDE_FLOAT64_C( -1.31),
SIMDE_FLOAT64_C( 0.62), SIMDE_FLOAT64_C( 1.91),
SIMDE_FLOAT64_C( 0.53), SIMDE_FLOAT64_C( -1.81)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( -770.35), SIMDE_FLOAT64_C( 443.48),
SIMDE_FLOAT64_C( -583.60), SIMDE_FLOAT64_C( 380.46),
SIMDE_FLOAT64_C( -770.72), SIMDE_FLOAT64_C( 993.90),
SIMDE_FLOAT64_C( 28.08), SIMDE_FLOAT64_C( 841.21)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( -0.78), SIMDE_FLOAT64_C( 0.57),
SIMDE_FLOAT64_C( 0.91), SIMDE_FLOAT64_C( 0.34),
SIMDE_FLOAT64_C( -1.66), SIMDE_FLOAT64_C( 2.28),
SIMDE_FLOAT64_C( -0.20), SIMDE_FLOAT64_C( -0.91)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( -387.90), SIMDE_FLOAT64_C( 395.92),
SIMDE_FLOAT64_C( 655.87), SIMDE_FLOAT64_C( 339.21),
SIMDE_FLOAT64_C( 532.35), SIMDE_FLOAT64_C( -263.99),
SIMDE_FLOAT64_C( 780.64), SIMDE_FLOAT64_C( -30.79)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( -11.51), SIMDE_FLOAT64_C( 0.08),
SIMDE_FLOAT64_C( -0.88), SIMDE_FLOAT64_C( -0.08),
SIMDE_FLOAT64_C( 6.62), SIMDE_FLOAT64_C( -0.10),
SIMDE_FLOAT64_C( 21.83), SIMDE_FLOAT64_C( 0.72)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( -203.65), SIMDE_FLOAT64_C( -80.73),
SIMDE_FLOAT64_C( 336.73), SIMDE_FLOAT64_C( -944.78),
SIMDE_FLOAT64_C( -747.59), SIMDE_FLOAT64_C( -767.23),
SIMDE_FLOAT64_C( -554.19), SIMDE_FLOAT64_C( 398.82)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 0.62), SIMDE_FLOAT64_C( 1.40),
SIMDE_FLOAT64_C( 0.65), SIMDE_FLOAT64_C( 1.11),
SIMDE_FLOAT64_C( 0.11), SIMDE_FLOAT64_C( -0.81),
SIMDE_FLOAT64_C( -3.22), SIMDE_FLOAT64_C( -0.16)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( 469.66), SIMDE_FLOAT64_C( 680.02),
SIMDE_FLOAT64_C( -148.69), SIMDE_FLOAT64_C( 818.66),
SIMDE_FLOAT64_C( 910.03), SIMDE_FLOAT64_C( 600.47),
SIMDE_FLOAT64_C( 791.23), SIMDE_FLOAT64_C( 254.31)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 123.43), SIMDE_FLOAT64_C( 7.37),
SIMDE_FLOAT64_C( -1.68), SIMDE_FLOAT64_C( -3.54),
SIMDE_FLOAT64_C( -1.67), SIMDE_FLOAT64_C( 0.45),
SIMDE_FLOAT64_C( -0.48), SIMDE_FLOAT64_C( -0.16)) }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])); i++) {
simde__m512d r = simde_mm512_tan_pd(test_vec[i].a);
simde_assert_m512d_close(r, test_vec[i].r, 1);
}
return 0;
}
static int
test_simde_mm512_mask_tan_pd(SIMDE_MUNIT_TEST_ARGS) {
const struct {
simde__m512d src;
simde__mmask8 k;
simde__m512d a;
simde__m512d r;
} test_vec[8] = {
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( -686.13), SIMDE_FLOAT64_C( 571.46),
SIMDE_FLOAT64_C( 422.21), SIMDE_FLOAT64_C( 467.76),
SIMDE_FLOAT64_C( 670.24), SIMDE_FLOAT64_C( 34.06),
SIMDE_FLOAT64_C( 39.01), SIMDE_FLOAT64_C( 346.63)),
UINT8_C(139),
simde_mm512_set_pd(SIMDE_FLOAT64_C( -678.17), SIMDE_FLOAT64_C( 84.77),
SIMDE_FLOAT64_C( 825.53), SIMDE_FLOAT64_C( -269.45),
SIMDE_FLOAT64_C( 497.31), SIMDE_FLOAT64_C( -297.45),
SIMDE_FLOAT64_C( -186.21), SIMDE_FLOAT64_C( -754.38)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 0.44), SIMDE_FLOAT64_C( 571.46),
SIMDE_FLOAT64_C( 422.21), SIMDE_FLOAT64_C( 467.76),
SIMDE_FLOAT64_C( 1.36), SIMDE_FLOAT64_C( 34.06),
SIMDE_FLOAT64_C( -1.15), SIMDE_FLOAT64_C( -0.42)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( 178.20), SIMDE_FLOAT64_C( 233.37),
SIMDE_FLOAT64_C( 261.31), SIMDE_FLOAT64_C( -976.55),
SIMDE_FLOAT64_C( -444.81), SIMDE_FLOAT64_C( -384.03),
SIMDE_FLOAT64_C( -305.07), SIMDE_FLOAT64_C( -417.54)),
UINT8_C(229),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 841.21), SIMDE_FLOAT64_C( -450.67),
SIMDE_FLOAT64_C( 687.09), SIMDE_FLOAT64_C( -212.54),
SIMDE_FLOAT64_C( -660.80), SIMDE_FLOAT64_C( 28.47),
SIMDE_FLOAT64_C( -923.64), SIMDE_FLOAT64_C( -860.95)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( -0.91), SIMDE_FLOAT64_C( -6.68),
SIMDE_FLOAT64_C( -1.31), SIMDE_FLOAT64_C( -976.55),
SIMDE_FLOAT64_C( -444.81), SIMDE_FLOAT64_C( 0.20),
SIMDE_FLOAT64_C( -305.07), SIMDE_FLOAT64_C( -0.15)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( 398.82), SIMDE_FLOAT64_C( 395.92),
SIMDE_FLOAT64_C( 339.21), SIMDE_FLOAT64_C( -263.99),
SIMDE_FLOAT64_C( -30.79), SIMDE_FLOAT64_C( 443.48),
SIMDE_FLOAT64_C( 380.46), SIMDE_FLOAT64_C( 993.90)),
UINT8_C(253),
simde_mm512_set_pd(SIMDE_FLOAT64_C( -554.19), SIMDE_FLOAT64_C( -387.90),
SIMDE_FLOAT64_C( 655.87), SIMDE_FLOAT64_C( 532.35),
SIMDE_FLOAT64_C( 780.64), SIMDE_FLOAT64_C( -770.35),
SIMDE_FLOAT64_C( -583.60), SIMDE_FLOAT64_C( -770.72)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( -3.22), SIMDE_FLOAT64_C( -11.51),
SIMDE_FLOAT64_C( -0.88), SIMDE_FLOAT64_C( 6.62),
SIMDE_FLOAT64_C( 21.83), SIMDE_FLOAT64_C( -0.78),
SIMDE_FLOAT64_C( 380.46), SIMDE_FLOAT64_C( -1.66)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( 120.65), SIMDE_FLOAT64_C( 469.66),
SIMDE_FLOAT64_C( -148.69), SIMDE_FLOAT64_C( 910.03),
SIMDE_FLOAT64_C( 791.23), SIMDE_FLOAT64_C( -203.65),
SIMDE_FLOAT64_C( 336.73), SIMDE_FLOAT64_C( -747.59)),
UINT8_C( 93),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 543.35), SIMDE_FLOAT64_C( -171.51),
SIMDE_FLOAT64_C( 680.02), SIMDE_FLOAT64_C( 818.66),
SIMDE_FLOAT64_C( 600.47), SIMDE_FLOAT64_C( 254.31),
SIMDE_FLOAT64_C( -80.73), SIMDE_FLOAT64_C( -944.78)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 120.65), SIMDE_FLOAT64_C( 3.31),
SIMDE_FLOAT64_C( -148.69), SIMDE_FLOAT64_C( -3.54),
SIMDE_FLOAT64_C( 0.45), SIMDE_FLOAT64_C( -0.16),
SIMDE_FLOAT64_C( 336.73), SIMDE_FLOAT64_C( 1.11)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( 99.93), SIMDE_FLOAT64_C( -738.19),
SIMDE_FLOAT64_C( 758.79), SIMDE_FLOAT64_C( 343.48),
SIMDE_FLOAT64_C( -797.92), SIMDE_FLOAT64_C( -525.83),
SIMDE_FLOAT64_C( -822.65), SIMDE_FLOAT64_C( 655.67)),
UINT8_C(145),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 331.34), SIMDE_FLOAT64_C( 462.95),
SIMDE_FLOAT64_C( -178.99), SIMDE_FLOAT64_C( 324.62),
SIMDE_FLOAT64_C( -874.31), SIMDE_FLOAT64_C( -328.54),
SIMDE_FLOAT64_C( -192.31), SIMDE_FLOAT64_C( 561.36)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 10.17), SIMDE_FLOAT64_C( -738.19),
SIMDE_FLOAT64_C( 758.79), SIMDE_FLOAT64_C( 1.69),
SIMDE_FLOAT64_C( -797.92), SIMDE_FLOAT64_C( -525.83),
SIMDE_FLOAT64_C( -822.65), SIMDE_FLOAT64_C( -1.51)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( -756.42), SIMDE_FLOAT64_C( 27.25),
SIMDE_FLOAT64_C( 690.12), SIMDE_FLOAT64_C( -21.09),
SIMDE_FLOAT64_C( -448.89), SIMDE_FLOAT64_C( 505.79),
SIMDE_FLOAT64_C( 831.02), SIMDE_FLOAT64_C( 977.36)),
UINT8_C( 75),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 977.49), SIMDE_FLOAT64_C( 424.81),
SIMDE_FLOAT64_C( -95.15), SIMDE_FLOAT64_C( 840.65),
SIMDE_FLOAT64_C( -591.56), SIMDE_FLOAT64_C( 731.49),
SIMDE_FLOAT64_C( 623.70), SIMDE_FLOAT64_C( 140.67)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( -756.42), SIMDE_FLOAT64_C( 0.83),
SIMDE_FLOAT64_C( 690.12), SIMDE_FLOAT64_C( -21.09),
SIMDE_FLOAT64_C( -1.37), SIMDE_FLOAT64_C( 505.79),
SIMDE_FLOAT64_C( -10.62), SIMDE_FLOAT64_C( -0.85)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( 394.67), SIMDE_FLOAT64_C( -304.73),
SIMDE_FLOAT64_C( -696.69), SIMDE_FLOAT64_C( 822.06),
SIMDE_FLOAT64_C( -997.63), SIMDE_FLOAT64_C( 923.64),
SIMDE_FLOAT64_C( -768.12), SIMDE_FLOAT64_C( -67.64)),
UINT8_C( 93),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 510.85), SIMDE_FLOAT64_C( 14.34),
SIMDE_FLOAT64_C( 916.26), SIMDE_FLOAT64_C( -769.09),
SIMDE_FLOAT64_C( -573.81), SIMDE_FLOAT64_C( -337.60),
SIMDE_FLOAT64_C( 293.64), SIMDE_FLOAT64_C( -576.22)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 394.67), SIMDE_FLOAT64_C( -4.86),
SIMDE_FLOAT64_C( -696.69), SIMDE_FLOAT64_C( 0.68),
SIMDE_FLOAT64_C( 1.97), SIMDE_FLOAT64_C( -8.21),
SIMDE_FLOAT64_C( -768.12), SIMDE_FLOAT64_C( -3.73)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( 475.51), SIMDE_FLOAT64_C( 936.65),
SIMDE_FLOAT64_C( -348.70), SIMDE_FLOAT64_C( -438.19),
SIMDE_FLOAT64_C( -752.43), SIMDE_FLOAT64_C( 932.66),
SIMDE_FLOAT64_C( -327.22), SIMDE_FLOAT64_C( -182.45)),
UINT8_C(213),
simde_mm512_set_pd(SIMDE_FLOAT64_C( -775.04), SIMDE_FLOAT64_C( 440.64),
SIMDE_FLOAT64_C( 897.27), SIMDE_FLOAT64_C( -197.89),
SIMDE_FLOAT64_C( -359.76), SIMDE_FLOAT64_C( -33.67),
SIMDE_FLOAT64_C( 7.27), SIMDE_FLOAT64_C( -125.20)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 1.35), SIMDE_FLOAT64_C( 1.07),
SIMDE_FLOAT64_C( -348.70), SIMDE_FLOAT64_C( 0.03),
SIMDE_FLOAT64_C( -752.43), SIMDE_FLOAT64_C( 1.23),
SIMDE_FLOAT64_C( -327.22), SIMDE_FLOAT64_C( 0.50)) }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])); i++) {
simde__m512d r = simde_mm512_mask_tan_pd(test_vec[i].src, test_vec[i].k, test_vec[i].a);
simde_assert_m512d_close(r, test_vec[i].r, 1);
}
return 0;
}
static int
test_simde_mm_tand_ps(SIMDE_MUNIT_TEST_ARGS) {
const struct {
simde__m128 a;
simde__m128 r;
} test_vec[8] = {
{ simde_mm_set_ps(SIMDE_FLOAT32_C( -186.21), SIMDE_FLOAT32_C( 39.01), SIMDE_FLOAT32_C( -754.38), SIMDE_FLOAT32_C( 346.63)),
simde_mm_set_ps(SIMDE_FLOAT32_C( -0.11), SIMDE_FLOAT32_C( 0.81), SIMDE_FLOAT32_C( -0.68), SIMDE_FLOAT32_C( -0.24)) },
{ simde_mm_set_ps(SIMDE_FLOAT32_C( 497.31), SIMDE_FLOAT32_C( 670.24), SIMDE_FLOAT32_C( -297.45), SIMDE_FLOAT32_C( 34.06)),
simde_mm_set_ps(SIMDE_FLOAT32_C( -0.92), SIMDE_FLOAT32_C( -1.18), SIMDE_FLOAT32_C( 1.93), SIMDE_FLOAT32_C( 0.68)) },
{ simde_mm_set_ps(SIMDE_FLOAT32_C( 825.53), SIMDE_FLOAT32_C( 422.21), SIMDE_FLOAT32_C( -269.45), SIMDE_FLOAT32_C( 467.76)),
simde_mm_set_ps(SIMDE_FLOAT32_C( -3.60), SIMDE_FLOAT32_C( 1.90), SIMDE_FLOAT32_C( -104.17), SIMDE_FLOAT32_C( -3.12)) },
{ simde_mm_set_ps(SIMDE_FLOAT32_C( -678.17), SIMDE_FLOAT32_C( -686.13), SIMDE_FLOAT32_C( 84.77), SIMDE_FLOAT32_C( 571.46)),
simde_mm_set_ps(SIMDE_FLOAT32_C( 0.90), SIMDE_FLOAT32_C( 0.67), SIMDE_FLOAT32_C( 10.92), SIMDE_FLOAT32_C( 0.61)) },
{ simde_mm_set_ps(SIMDE_FLOAT32_C( -305.07), SIMDE_FLOAT32_C( -860.95), SIMDE_FLOAT32_C( -417.54), SIMDE_FLOAT32_C( 696.87)),
simde_mm_set_ps(SIMDE_FLOAT32_C( 1.42), SIMDE_FLOAT32_C( 0.81), SIMDE_FLOAT32_C( -1.57), SIMDE_FLOAT32_C( -0.43)) },
{ simde_mm_set_ps(SIMDE_FLOAT32_C( -444.81), SIMDE_FLOAT32_C( 28.47), SIMDE_FLOAT32_C( -384.03), SIMDE_FLOAT32_C( -923.64)),
simde_mm_set_ps(SIMDE_FLOAT32_C( -11.01), SIMDE_FLOAT32_C( 0.54), SIMDE_FLOAT32_C( -0.45), SIMDE_FLOAT32_C( -0.44)) },
{ simde_mm_set_ps(SIMDE_FLOAT32_C( 261.31), SIMDE_FLOAT32_C( -212.54), SIMDE_FLOAT32_C( -976.55), SIMDE_FLOAT32_C( -660.80)),
simde_mm_set_ps(SIMDE_FLOAT32_C( 6.54), SIMDE_FLOAT32_C( -0.64), SIMDE_FLOAT32_C( -4.18), SIMDE_FLOAT32_C( 1.68)) },
{ simde_mm_set_ps(SIMDE_FLOAT32_C( 178.20), SIMDE_FLOAT32_C( -450.67), SIMDE_FLOAT32_C( 233.37), SIMDE_FLOAT32_C( 687.09)),
simde_mm_set_ps(SIMDE_FLOAT32_C( -0.03), SIMDE_FLOAT32_C( 85.51), SIMDE_FLOAT32_C( 1.35), SIMDE_FLOAT32_C( -0.65)) }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])); i++) {
simde__m128 r = simde_mm_tand_ps(test_vec[i].a);
simde_assert_m128_close(r, test_vec[i].r, 1);
}
return 0;
}
static int
test_simde_mm_tand_pd(SIMDE_MUNIT_TEST_ARGS) {
const struct {
simde__m128d a;
simde__m128d r;
} test_vec[8] = {
{ simde_mm_set_pd(SIMDE_FLOAT64_C( -754.38), SIMDE_FLOAT64_C( 346.63)),
simde_mm_set_pd(SIMDE_FLOAT64_C( -0.68), SIMDE_FLOAT64_C( -0.24)) },
{ simde_mm_set_pd(SIMDE_FLOAT64_C( -186.21), SIMDE_FLOAT64_C( 39.01)),
simde_mm_set_pd(SIMDE_FLOAT64_C( -0.11), SIMDE_FLOAT64_C( 0.81)) },
{ simde_mm_set_pd(SIMDE_FLOAT64_C( -297.45), SIMDE_FLOAT64_C( 34.06)),
simde_mm_set_pd(SIMDE_FLOAT64_C( 1.93), SIMDE_FLOAT64_C( 0.68)) },
{ simde_mm_set_pd(SIMDE_FLOAT64_C( 497.31), SIMDE_FLOAT64_C( 670.24)),
simde_mm_set_pd(SIMDE_FLOAT64_C( -0.92), SIMDE_FLOAT64_C( -1.18)) },
{ simde_mm_set_pd(SIMDE_FLOAT64_C( -269.45), SIMDE_FLOAT64_C( 467.76)),
simde_mm_set_pd(SIMDE_FLOAT64_C( -104.17), SIMDE_FLOAT64_C( -3.12)) },
{ simde_mm_set_pd(SIMDE_FLOAT64_C( 825.53), SIMDE_FLOAT64_C( 422.21)),
simde_mm_set_pd(SIMDE_FLOAT64_C( -3.60), SIMDE_FLOAT64_C( 1.90)) },
{ simde_mm_set_pd(SIMDE_FLOAT64_C( 84.77), SIMDE_FLOAT64_C( 571.46)),
simde_mm_set_pd(SIMDE_FLOAT64_C( 10.92), SIMDE_FLOAT64_C( 0.61)) },
{ simde_mm_set_pd(SIMDE_FLOAT64_C( -678.17), SIMDE_FLOAT64_C( -686.13)),
simde_mm_set_pd(SIMDE_FLOAT64_C( 0.90), SIMDE_FLOAT64_C( 0.67)) }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])); i++) {
simde__m128d r = simde_mm_tand_pd(test_vec[i].a);
simde_assert_m128d_close(r, test_vec[i].r, 1);
}
return 0;
}
static int
test_simde_mm256_tand_ps(SIMDE_MUNIT_TEST_ARGS) {
const struct {
simde__m256 a;
simde__m256 r;
} test_vec[8] = {
{ simde_mm256_set_ps(SIMDE_FLOAT32_C( 497.31), SIMDE_FLOAT32_C( 670.24),
SIMDE_FLOAT32_C( -297.45), SIMDE_FLOAT32_C( 34.06),
SIMDE_FLOAT32_C( -186.21), SIMDE_FLOAT32_C( 39.01),
SIMDE_FLOAT32_C( -754.38), SIMDE_FLOAT32_C( 346.63)),
simde_mm256_set_ps(SIMDE_FLOAT32_C( -0.92), SIMDE_FLOAT32_C( -1.18),
SIMDE_FLOAT32_C( 1.93), SIMDE_FLOAT32_C( 0.68),
SIMDE_FLOAT32_C( -0.11), SIMDE_FLOAT32_C( 0.81),
SIMDE_FLOAT32_C( -0.68), SIMDE_FLOAT32_C( -0.24)) },
{ simde_mm256_set_ps(SIMDE_FLOAT32_C( -678.17), SIMDE_FLOAT32_C( -686.13),
SIMDE_FLOAT32_C( 84.77), SIMDE_FLOAT32_C( 571.46),
SIMDE_FLOAT32_C( 825.53), SIMDE_FLOAT32_C( 422.21),
SIMDE_FLOAT32_C( -269.45), SIMDE_FLOAT32_C( 467.76)),
simde_mm256_set_ps(SIMDE_FLOAT32_C( 0.90), SIMDE_FLOAT32_C( 0.67),
SIMDE_FLOAT32_C( 10.92), SIMDE_FLOAT32_C( 0.61),
SIMDE_FLOAT32_C( -3.60), SIMDE_FLOAT32_C( 1.90),
SIMDE_FLOAT32_C( -104.17), SIMDE_FLOAT32_C( -3.12)) },
{ simde_mm256_set_ps(SIMDE_FLOAT32_C( -444.81), SIMDE_FLOAT32_C( 28.47),
SIMDE_FLOAT32_C( -384.03), SIMDE_FLOAT32_C( -923.64),
SIMDE_FLOAT32_C( -305.07), SIMDE_FLOAT32_C( -860.95),
SIMDE_FLOAT32_C( -417.54), SIMDE_FLOAT32_C( 696.87)),
simde_mm256_set_ps(SIMDE_FLOAT32_C( -11.01), SIMDE_FLOAT32_C( 0.54),
SIMDE_FLOAT32_C( -0.45), SIMDE_FLOAT32_C( -0.44),
SIMDE_FLOAT32_C( 1.42), SIMDE_FLOAT32_C( 0.81),
SIMDE_FLOAT32_C( -1.57), SIMDE_FLOAT32_C( -0.43)) },
{ simde_mm256_set_ps(SIMDE_FLOAT32_C( 178.20), SIMDE_FLOAT32_C( -450.67),
SIMDE_FLOAT32_C( 233.37), SIMDE_FLOAT32_C( 687.09),
SIMDE_FLOAT32_C( 261.31), SIMDE_FLOAT32_C( -212.54),
SIMDE_FLOAT32_C( -976.55), SIMDE_FLOAT32_C( -660.80)),
simde_mm256_set_ps(SIMDE_FLOAT32_C( -0.03), SIMDE_FLOAT32_C( 85.51),
SIMDE_FLOAT32_C( 1.35), SIMDE_FLOAT32_C( -0.65),
SIMDE_FLOAT32_C( 6.54), SIMDE_FLOAT32_C( -0.64),
SIMDE_FLOAT32_C( -4.18), SIMDE_FLOAT32_C( 1.68)) },
{ simde_mm256_set_ps(SIMDE_FLOAT32_C( -770.35), SIMDE_FLOAT32_C( 443.48),
SIMDE_FLOAT32_C( -583.60), SIMDE_FLOAT32_C( 380.46),
SIMDE_FLOAT32_C( -770.72), SIMDE_FLOAT32_C( 993.90),
SIMDE_FLOAT32_C( 28.08), SIMDE_FLOAT32_C( 841.21)),
simde_mm256_set_ps(SIMDE_FLOAT32_C( -1.21), SIMDE_FLOAT32_C( 8.75),
SIMDE_FLOAT32_C( -0.95), SIMDE_FLOAT32_C( 0.37),
SIMDE_FLOAT32_C( -1.22), SIMDE_FLOAT32_C( -14.67),
SIMDE_FLOAT32_C( 0.53), SIMDE_FLOAT32_C( -1.65)) },
{ simde_mm256_set_ps(SIMDE_FLOAT32_C( -387.90), SIMDE_FLOAT32_C( 395.92),
SIMDE_FLOAT32_C( 655.87), SIMDE_FLOAT32_C( 339.21),
SIMDE_FLOAT32_C( 532.35), SIMDE_FLOAT32_C( -263.99),
SIMDE_FLOAT32_C( 780.64), SIMDE_FLOAT32_C( -30.79)),
simde_mm256_set_ps(SIMDE_FLOAT32_C( -0.53), SIMDE_FLOAT32_C( 0.72),
SIMDE_FLOAT32_C( -2.06), SIMDE_FLOAT32_C( -0.38),
SIMDE_FLOAT32_C( -0.13), SIMDE_FLOAT32_C( -9.50),
SIMDE_FLOAT32_C( 1.78), SIMDE_FLOAT32_C( -0.60)) },
{ simde_mm256_set_ps(SIMDE_FLOAT32_C( -203.65), SIMDE_FLOAT32_C( -80.73),
SIMDE_FLOAT32_C( 336.73), SIMDE_FLOAT32_C( -944.78),
SIMDE_FLOAT32_C( -747.59), SIMDE_FLOAT32_C( -767.23),
SIMDE_FLOAT32_C( -554.19), SIMDE_FLOAT32_C( 398.82)),
simde_mm256_set_ps(SIMDE_FLOAT32_C( -0.44), SIMDE_FLOAT32_C( -6.13),
SIMDE_FLOAT32_C( -0.43), SIMDE_FLOAT32_C( -0.99),
SIMDE_FLOAT32_C( -0.52), SIMDE_FLOAT32_C( -1.08),
SIMDE_FLOAT32_C( -0.25), SIMDE_FLOAT32_C( 0.80)) },
{ simde_mm256_set_ps(SIMDE_FLOAT32_C( 469.66), SIMDE_FLOAT32_C( 680.02),
SIMDE_FLOAT32_C( -148.69), SIMDE_FLOAT32_C( 818.66),
SIMDE_FLOAT32_C( 910.03), SIMDE_FLOAT32_C( 600.47),
SIMDE_FLOAT32_C( 791.23), SIMDE_FLOAT32_C( 254.31)),
simde_mm256_set_ps(SIMDE_FLOAT32_C( -2.80), SIMDE_FLOAT32_C( -0.84),
SIMDE_FLOAT32_C( 0.61), SIMDE_FLOAT32_C( -6.57),
SIMDE_FLOAT32_C( 0.18), SIMDE_FLOAT32_C( 1.77),
SIMDE_FLOAT32_C( 2.94), SIMDE_FLOAT32_C( 3.56)) }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])); i++) {
simde__m256 r = simde_mm256_tand_ps(test_vec[i].a);
simde_assert_m256_close(r, test_vec[i].r, 1);
}
return 0;
}
static int
test_simde_mm256_tand_pd(SIMDE_MUNIT_TEST_ARGS) {
const struct {
simde__m256d a;
simde__m256d r;
} test_vec[8] = {
{ simde_mm256_set_pd(SIMDE_FLOAT64_C( -186.21), SIMDE_FLOAT64_C( 39.01),
SIMDE_FLOAT64_C( -754.38), SIMDE_FLOAT64_C( 346.63)),
simde_mm256_set_pd(SIMDE_FLOAT64_C( -0.11), SIMDE_FLOAT64_C( 0.81),
SIMDE_FLOAT64_C( -0.68), SIMDE_FLOAT64_C( -0.24)) },
{ simde_mm256_set_pd(SIMDE_FLOAT64_C( 497.31), SIMDE_FLOAT64_C( 670.24),
SIMDE_FLOAT64_C( -297.45), SIMDE_FLOAT64_C( 34.06)),
simde_mm256_set_pd(SIMDE_FLOAT64_C( -0.92), SIMDE_FLOAT64_C( -1.18),
SIMDE_FLOAT64_C( 1.93), SIMDE_FLOAT64_C( 0.68)) },
{ simde_mm256_set_pd(SIMDE_FLOAT64_C( 825.53), SIMDE_FLOAT64_C( 422.21),
SIMDE_FLOAT64_C( -269.45), SIMDE_FLOAT64_C( 467.76)),
simde_mm256_set_pd(SIMDE_FLOAT64_C( -3.60), SIMDE_FLOAT64_C( 1.90),
SIMDE_FLOAT64_C( -104.17), SIMDE_FLOAT64_C( -3.12)) },
{ simde_mm256_set_pd(SIMDE_FLOAT64_C( -678.17), SIMDE_FLOAT64_C( -686.13),
SIMDE_FLOAT64_C( 84.77), SIMDE_FLOAT64_C( 571.46)),
simde_mm256_set_pd(SIMDE_FLOAT64_C( 0.90), SIMDE_FLOAT64_C( 0.67),
SIMDE_FLOAT64_C( 10.92), SIMDE_FLOAT64_C( 0.61)) },
{ simde_mm256_set_pd(SIMDE_FLOAT64_C( -305.07), SIMDE_FLOAT64_C( -860.95),
SIMDE_FLOAT64_C( -417.54), SIMDE_FLOAT64_C( 696.87)),
simde_mm256_set_pd(SIMDE_FLOAT64_C( 1.42), SIMDE_FLOAT64_C( 0.81),
SIMDE_FLOAT64_C( -1.57), SIMDE_FLOAT64_C( -0.43)) },
{ simde_mm256_set_pd(SIMDE_FLOAT64_C( -444.81), SIMDE_FLOAT64_C( 28.47),
SIMDE_FLOAT64_C( -384.03), SIMDE_FLOAT64_C( -923.64)),
simde_mm256_set_pd(SIMDE_FLOAT64_C( -11.01), SIMDE_FLOAT64_C( 0.54),
SIMDE_FLOAT64_C( -0.45), SIMDE_FLOAT64_C( -0.44)) },
{ simde_mm256_set_pd(SIMDE_FLOAT64_C( 261.31), SIMDE_FLOAT64_C( -212.54),
SIMDE_FLOAT64_C( -976.55), SIMDE_FLOAT64_C( -660.80)),
simde_mm256_set_pd(SIMDE_FLOAT64_C( 6.54), SIMDE_FLOAT64_C( -0.64),
SIMDE_FLOAT64_C( -4.18), SIMDE_FLOAT64_C( 1.68)) },
{ simde_mm256_set_pd(SIMDE_FLOAT64_C( 178.20), SIMDE_FLOAT64_C( -450.67),
SIMDE_FLOAT64_C( 233.37), SIMDE_FLOAT64_C( 687.09)),
simde_mm256_set_pd(SIMDE_FLOAT64_C( -0.03), SIMDE_FLOAT64_C( 85.51),
SIMDE_FLOAT64_C( 1.35), SIMDE_FLOAT64_C( -0.65)) }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])); i++) {
simde__m256d r = simde_mm256_tand_pd(test_vec[i].a);
simde_assert_m256d_close(r, test_vec[i].r, 1);
}
return 0;
}
static int
test_simde_mm512_tand_ps(SIMDE_MUNIT_TEST_ARGS) {
const struct {
simde__m512 a;
simde__m512 r;
} test_vec[8] = {
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( -678.17), SIMDE_FLOAT32_C( -686.13), SIMDE_FLOAT32_C( 84.77), SIMDE_FLOAT32_C( 571.46),
SIMDE_FLOAT32_C( 825.53), SIMDE_FLOAT32_C( 422.21), SIMDE_FLOAT32_C( -269.45), SIMDE_FLOAT32_C( 467.76),
SIMDE_FLOAT32_C( 497.31), SIMDE_FLOAT32_C( 670.24), SIMDE_FLOAT32_C( -297.45), SIMDE_FLOAT32_C( 34.06),
SIMDE_FLOAT32_C( -186.21), SIMDE_FLOAT32_C( 39.01), SIMDE_FLOAT32_C( -754.38), SIMDE_FLOAT32_C( 346.63)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 0.90), SIMDE_FLOAT32_C( 0.67), SIMDE_FLOAT32_C( 10.92), SIMDE_FLOAT32_C( 0.61),
SIMDE_FLOAT32_C( -3.60), SIMDE_FLOAT32_C( 1.90), SIMDE_FLOAT32_C( -104.17), SIMDE_FLOAT32_C( -3.12),
SIMDE_FLOAT32_C( -0.92), SIMDE_FLOAT32_C( -1.18), SIMDE_FLOAT32_C( 1.93), SIMDE_FLOAT32_C( 0.68),
SIMDE_FLOAT32_C( -0.11), SIMDE_FLOAT32_C( 0.81), SIMDE_FLOAT32_C( -0.68), SIMDE_FLOAT32_C( -0.24)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( 178.20), SIMDE_FLOAT32_C( -450.67), SIMDE_FLOAT32_C( 233.37), SIMDE_FLOAT32_C( 687.09),
SIMDE_FLOAT32_C( 261.31), SIMDE_FLOAT32_C( -212.54), SIMDE_FLOAT32_C( -976.55), SIMDE_FLOAT32_C( -660.80),
SIMDE_FLOAT32_C( -444.81), SIMDE_FLOAT32_C( 28.47), SIMDE_FLOAT32_C( -384.03), SIMDE_FLOAT32_C( -923.64),
SIMDE_FLOAT32_C( -305.07), SIMDE_FLOAT32_C( -860.95), SIMDE_FLOAT32_C( -417.54), SIMDE_FLOAT32_C( 696.87)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( -0.03), SIMDE_FLOAT32_C( 85.51), SIMDE_FLOAT32_C( 1.35), SIMDE_FLOAT32_C( -0.65),
SIMDE_FLOAT32_C( 6.54), SIMDE_FLOAT32_C( -0.64), SIMDE_FLOAT32_C( -4.18), SIMDE_FLOAT32_C( 1.68),
SIMDE_FLOAT32_C( -11.01), SIMDE_FLOAT32_C( 0.54), SIMDE_FLOAT32_C( -0.45), SIMDE_FLOAT32_C( -0.44),
SIMDE_FLOAT32_C( 1.42), SIMDE_FLOAT32_C( 0.81), SIMDE_FLOAT32_C( -1.57), SIMDE_FLOAT32_C( -0.43)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( -387.90), SIMDE_FLOAT32_C( 395.92), SIMDE_FLOAT32_C( 655.87), SIMDE_FLOAT32_C( 339.21),
SIMDE_FLOAT32_C( 532.35), SIMDE_FLOAT32_C( -263.99), SIMDE_FLOAT32_C( 780.64), SIMDE_FLOAT32_C( -30.79),
SIMDE_FLOAT32_C( -770.35), SIMDE_FLOAT32_C( 443.48), SIMDE_FLOAT32_C( -583.60), SIMDE_FLOAT32_C( 380.46),
SIMDE_FLOAT32_C( -770.72), SIMDE_FLOAT32_C( 993.90), SIMDE_FLOAT32_C( 28.08), SIMDE_FLOAT32_C( 841.21)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( -0.53), SIMDE_FLOAT32_C( 0.72), SIMDE_FLOAT32_C( -2.06), SIMDE_FLOAT32_C( -0.38),
SIMDE_FLOAT32_C( -0.13), SIMDE_FLOAT32_C( -9.50), SIMDE_FLOAT32_C( 1.78), SIMDE_FLOAT32_C( -0.60),
SIMDE_FLOAT32_C( -1.21), SIMDE_FLOAT32_C( 8.75), SIMDE_FLOAT32_C( -0.95), SIMDE_FLOAT32_C( 0.37),
SIMDE_FLOAT32_C( -1.22), SIMDE_FLOAT32_C( -14.67), SIMDE_FLOAT32_C( 0.53), SIMDE_FLOAT32_C( -1.65)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( 469.66), SIMDE_FLOAT32_C( 680.02), SIMDE_FLOAT32_C( -148.69), SIMDE_FLOAT32_C( 818.66),
SIMDE_FLOAT32_C( 910.03), SIMDE_FLOAT32_C( 600.47), SIMDE_FLOAT32_C( 791.23), SIMDE_FLOAT32_C( 254.31),
SIMDE_FLOAT32_C( -203.65), SIMDE_FLOAT32_C( -80.73), SIMDE_FLOAT32_C( 336.73), SIMDE_FLOAT32_C( -944.78),
SIMDE_FLOAT32_C( -747.59), SIMDE_FLOAT32_C( -767.23), SIMDE_FLOAT32_C( -554.19), SIMDE_FLOAT32_C( 398.82)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( -2.80), SIMDE_FLOAT32_C( -0.84), SIMDE_FLOAT32_C( 0.61), SIMDE_FLOAT32_C( -6.57),
SIMDE_FLOAT32_C( 0.18), SIMDE_FLOAT32_C( 1.77), SIMDE_FLOAT32_C( 2.94), SIMDE_FLOAT32_C( 3.56),
SIMDE_FLOAT32_C( -0.44), SIMDE_FLOAT32_C( -6.13), SIMDE_FLOAT32_C( -0.43), SIMDE_FLOAT32_C( -0.99),
SIMDE_FLOAT32_C( -0.52), SIMDE_FLOAT32_C( -1.08), SIMDE_FLOAT32_C( -0.25), SIMDE_FLOAT32_C( 0.80)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( -178.99), SIMDE_FLOAT32_C( 758.79), SIMDE_FLOAT32_C( 324.62), SIMDE_FLOAT32_C( 343.48),
SIMDE_FLOAT32_C( -874.31), SIMDE_FLOAT32_C( -797.92), SIMDE_FLOAT32_C( -328.54), SIMDE_FLOAT32_C( -525.83),
SIMDE_FLOAT32_C( -192.31), SIMDE_FLOAT32_C( -822.65), SIMDE_FLOAT32_C( 561.36), SIMDE_FLOAT32_C( 655.67),
SIMDE_FLOAT32_C( -70.91), SIMDE_FLOAT32_C( 543.35), SIMDE_FLOAT32_C( 120.65), SIMDE_FLOAT32_C( -171.51)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 0.02), SIMDE_FLOAT32_C( 0.80), SIMDE_FLOAT32_C( -0.71), SIMDE_FLOAT32_C( -0.30),
SIMDE_FLOAT32_C( 0.48), SIMDE_FLOAT32_C( -4.67), SIMDE_FLOAT32_C( 0.61), SIMDE_FLOAT32_C( 0.25),
SIMDE_FLOAT32_C( -0.22), SIMDE_FLOAT32_C( 4.46), SIMDE_FLOAT32_C( 0.39), SIMDE_FLOAT32_C( -2.08),
SIMDE_FLOAT32_C( -2.89), SIMDE_FLOAT32_C( 0.06), SIMDE_FLOAT32_C( -1.69), SIMDE_FLOAT32_C( 0.15)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( 690.12), SIMDE_FLOAT32_C( 840.65), SIMDE_FLOAT32_C( -21.09), SIMDE_FLOAT32_C( -591.56),
SIMDE_FLOAT32_C( -448.89), SIMDE_FLOAT32_C( 731.49), SIMDE_FLOAT32_C( 505.79), SIMDE_FLOAT32_C( 623.70),
SIMDE_FLOAT32_C( 831.02), SIMDE_FLOAT32_C( 140.67), SIMDE_FLOAT32_C( 977.36), SIMDE_FLOAT32_C( -906.16),
SIMDE_FLOAT32_C( 331.34), SIMDE_FLOAT32_C( 99.93), SIMDE_FLOAT32_C( 462.95), SIMDE_FLOAT32_C( -738.19)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( -0.57), SIMDE_FLOAT32_C( -1.69), SIMDE_FLOAT32_C( -0.39), SIMDE_FLOAT32_C( -1.26),
SIMDE_FLOAT32_C( -51.61), SIMDE_FLOAT32_C( 0.20), SIMDE_FLOAT32_C( -0.68), SIMDE_FLOAT32_C( 9.06),
SIMDE_FLOAT32_C( -2.60), SIMDE_FLOAT32_C( -0.82), SIMDE_FLOAT32_C( 4.46), SIMDE_FLOAT32_C( -0.11),
SIMDE_FLOAT32_C( -0.55), SIMDE_FLOAT32_C( -5.71), SIMDE_FLOAT32_C( -4.35), SIMDE_FLOAT32_C( -0.33)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( -769.09), SIMDE_FLOAT32_C( 822.06), SIMDE_FLOAT32_C( -573.81), SIMDE_FLOAT32_C( -997.63),
SIMDE_FLOAT32_C( -337.60), SIMDE_FLOAT32_C( 923.64), SIMDE_FLOAT32_C( 293.64), SIMDE_FLOAT32_C( -768.12),
SIMDE_FLOAT32_C( -576.22), SIMDE_FLOAT32_C( -67.64), SIMDE_FLOAT32_C( 710.38), SIMDE_FLOAT32_C( 977.49),
SIMDE_FLOAT32_C( -756.42), SIMDE_FLOAT32_C( 424.81), SIMDE_FLOAT32_C( 27.25), SIMDE_FLOAT32_C( -95.15)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( -1.15), SIMDE_FLOAT32_C( -4.68), SIMDE_FLOAT32_C( -0.67), SIMDE_FLOAT32_C( 7.46),
SIMDE_FLOAT32_C( 0.41), SIMDE_FLOAT32_C( 0.44), SIMDE_FLOAT32_C( -2.28), SIMDE_FLOAT32_C( -1.12),
SIMDE_FLOAT32_C( -0.73), SIMDE_FLOAT32_C( -2.43), SIMDE_FLOAT32_C( -0.17), SIMDE_FLOAT32_C( 4.51),
SIMDE_FLOAT32_C( -0.74), SIMDE_FLOAT32_C( 2.13), SIMDE_FLOAT32_C( 0.52), SIMDE_FLOAT32_C( 11.10)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( -438.19), SIMDE_FLOAT32_C( -359.76), SIMDE_FLOAT32_C( -752.43), SIMDE_FLOAT32_C( -33.67),
SIMDE_FLOAT32_C( 932.66), SIMDE_FLOAT32_C( 7.27), SIMDE_FLOAT32_C( -327.22), SIMDE_FLOAT32_C( -125.20),
SIMDE_FLOAT32_C( -182.45), SIMDE_FLOAT32_C( 39.93), SIMDE_FLOAT32_C( 510.85), SIMDE_FLOAT32_C( 394.67),
SIMDE_FLOAT32_C( 14.34), SIMDE_FLOAT32_C( -304.73), SIMDE_FLOAT32_C( 916.26), SIMDE_FLOAT32_C( -696.69)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( -4.78), SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( -0.64), SIMDE_FLOAT32_C( -0.67),
SIMDE_FLOAT32_C( 0.64), SIMDE_FLOAT32_C( 0.13), SIMDE_FLOAT32_C( 0.64), SIMDE_FLOAT32_C( 1.42),
SIMDE_FLOAT32_C( -0.04), SIMDE_FLOAT32_C( 0.84), SIMDE_FLOAT32_C( -0.56), SIMDE_FLOAT32_C( 0.69),
SIMDE_FLOAT32_C( 0.26), SIMDE_FLOAT32_C( 1.44), SIMDE_FLOAT32_C( 0.29), SIMDE_FLOAT32_C( 0.43)) }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])); i++) {
simde__m512 r = simde_mm512_tand_ps(test_vec[i].a);
simde_assert_m512_close(r, test_vec[i].r, 1);
}
return 0;
}
static int
test_simde_mm512_mask_tand_ps(SIMDE_MUNIT_TEST_ARGS) {
const struct {
simde__m512 src;
simde__mmask16 k;
simde__m512 a;
simde__m512 r;
} test_vec[8] = {
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( -450.67), SIMDE_FLOAT32_C( 687.09), SIMDE_FLOAT32_C( -212.54), SIMDE_FLOAT32_C( -660.80),
SIMDE_FLOAT32_C( 28.47), SIMDE_FLOAT32_C( -923.64), SIMDE_FLOAT32_C( -860.95), SIMDE_FLOAT32_C( 696.87),
SIMDE_FLOAT32_C( -686.13), SIMDE_FLOAT32_C( 571.46), SIMDE_FLOAT32_C( 422.21), SIMDE_FLOAT32_C( 467.76),
SIMDE_FLOAT32_C( 670.24), SIMDE_FLOAT32_C( 34.06), SIMDE_FLOAT32_C( 39.01), SIMDE_FLOAT32_C( 346.63)),
UINT16_C(41466),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 178.20), SIMDE_FLOAT32_C( 233.37), SIMDE_FLOAT32_C( 261.31), SIMDE_FLOAT32_C( -976.55),
SIMDE_FLOAT32_C( -444.81), SIMDE_FLOAT32_C( -384.03), SIMDE_FLOAT32_C( -305.07), SIMDE_FLOAT32_C( -417.54),
SIMDE_FLOAT32_C( -678.17), SIMDE_FLOAT32_C( 84.77), SIMDE_FLOAT32_C( 825.53), SIMDE_FLOAT32_C( -269.45),
SIMDE_FLOAT32_C( 497.31), SIMDE_FLOAT32_C( -297.45), SIMDE_FLOAT32_C( -186.21), SIMDE_FLOAT32_C( -754.38)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( -0.03), SIMDE_FLOAT32_C( 687.09), SIMDE_FLOAT32_C( 6.54), SIMDE_FLOAT32_C( -660.80),
SIMDE_FLOAT32_C( 28.47), SIMDE_FLOAT32_C( -923.64), SIMDE_FLOAT32_C( -860.95), SIMDE_FLOAT32_C( -1.57),
SIMDE_FLOAT32_C( 0.90), SIMDE_FLOAT32_C( 10.92), SIMDE_FLOAT32_C( -3.60), SIMDE_FLOAT32_C( -104.17),
SIMDE_FLOAT32_C( -0.92), SIMDE_FLOAT32_C( 34.06), SIMDE_FLOAT32_C( -0.11), SIMDE_FLOAT32_C( 346.63)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( 469.66), SIMDE_FLOAT32_C( -148.69), SIMDE_FLOAT32_C( 910.03), SIMDE_FLOAT32_C( 791.23),
SIMDE_FLOAT32_C( -203.65), SIMDE_FLOAT32_C( 336.73), SIMDE_FLOAT32_C( -747.59), SIMDE_FLOAT32_C( -554.19),
SIMDE_FLOAT32_C( -387.90), SIMDE_FLOAT32_C( 655.87), SIMDE_FLOAT32_C( 532.35), SIMDE_FLOAT32_C( 780.64),
SIMDE_FLOAT32_C( -770.35), SIMDE_FLOAT32_C( -583.60), SIMDE_FLOAT32_C( -770.72), SIMDE_FLOAT32_C( 28.08)),
UINT16_C(36797),
simde_mm512_set_ps(SIMDE_FLOAT32_C( -171.51), SIMDE_FLOAT32_C( 680.02), SIMDE_FLOAT32_C( 818.66), SIMDE_FLOAT32_C( 600.47),
SIMDE_FLOAT32_C( 254.31), SIMDE_FLOAT32_C( -80.73), SIMDE_FLOAT32_C( -944.78), SIMDE_FLOAT32_C( -767.23),
SIMDE_FLOAT32_C( 398.82), SIMDE_FLOAT32_C( 395.92), SIMDE_FLOAT32_C( 339.21), SIMDE_FLOAT32_C( -263.99),
SIMDE_FLOAT32_C( -30.79), SIMDE_FLOAT32_C( 443.48), SIMDE_FLOAT32_C( 380.46), SIMDE_FLOAT32_C( 993.90)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 0.15), SIMDE_FLOAT32_C( -148.69), SIMDE_FLOAT32_C( 910.03), SIMDE_FLOAT32_C( 791.23),
SIMDE_FLOAT32_C( 3.56), SIMDE_FLOAT32_C( -6.13), SIMDE_FLOAT32_C( -0.99), SIMDE_FLOAT32_C( -1.08),
SIMDE_FLOAT32_C( 0.80), SIMDE_FLOAT32_C( 655.87), SIMDE_FLOAT32_C( -0.38), SIMDE_FLOAT32_C( -9.50),
SIMDE_FLOAT32_C( -0.60), SIMDE_FLOAT32_C( 8.75), SIMDE_FLOAT32_C( -770.72), SIMDE_FLOAT32_C( -14.67)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( -95.15), SIMDE_FLOAT32_C( 840.65), SIMDE_FLOAT32_C( -591.56), SIMDE_FLOAT32_C( 731.49),
SIMDE_FLOAT32_C( 623.70), SIMDE_FLOAT32_C( 140.67), SIMDE_FLOAT32_C( -906.16), SIMDE_FLOAT32_C( 99.93),
SIMDE_FLOAT32_C( -738.19), SIMDE_FLOAT32_C( 758.79), SIMDE_FLOAT32_C( 343.48), SIMDE_FLOAT32_C( -797.92),
SIMDE_FLOAT32_C( -525.83), SIMDE_FLOAT32_C( -822.65), SIMDE_FLOAT32_C( 655.67), SIMDE_FLOAT32_C( 543.35)),
UINT16_C(16804),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 27.25), SIMDE_FLOAT32_C( 690.12), SIMDE_FLOAT32_C( -21.09), SIMDE_FLOAT32_C( -448.89),
SIMDE_FLOAT32_C( 505.79), SIMDE_FLOAT32_C( 831.02), SIMDE_FLOAT32_C( 977.36), SIMDE_FLOAT32_C( 331.34),
SIMDE_FLOAT32_C( 462.95), SIMDE_FLOAT32_C( -178.99), SIMDE_FLOAT32_C( 324.62), SIMDE_FLOAT32_C( -874.31),
SIMDE_FLOAT32_C( -328.54), SIMDE_FLOAT32_C( -192.31), SIMDE_FLOAT32_C( 561.36), SIMDE_FLOAT32_C( -70.91)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( -95.15), SIMDE_FLOAT32_C( -0.57), SIMDE_FLOAT32_C( -591.56), SIMDE_FLOAT32_C( 731.49),
SIMDE_FLOAT32_C( 623.70), SIMDE_FLOAT32_C( 140.67), SIMDE_FLOAT32_C( -906.16), SIMDE_FLOAT32_C( -0.55),
SIMDE_FLOAT32_C( -4.35), SIMDE_FLOAT32_C( 758.79), SIMDE_FLOAT32_C( -0.71), SIMDE_FLOAT32_C( -797.92),
SIMDE_FLOAT32_C( -525.83), SIMDE_FLOAT32_C( -0.22), SIMDE_FLOAT32_C( 655.67), SIMDE_FLOAT32_C( 543.35)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( -348.70), SIMDE_FLOAT32_C( -438.19), SIMDE_FLOAT32_C( -752.43), SIMDE_FLOAT32_C( 932.66),
SIMDE_FLOAT32_C( -327.22), SIMDE_FLOAT32_C( -182.45), SIMDE_FLOAT32_C( 510.85), SIMDE_FLOAT32_C( 14.34),
SIMDE_FLOAT32_C( 916.26), SIMDE_FLOAT32_C( -769.09), SIMDE_FLOAT32_C( -573.81), SIMDE_FLOAT32_C( -337.60),
SIMDE_FLOAT32_C( 293.64), SIMDE_FLOAT32_C( -576.22), SIMDE_FLOAT32_C( 710.38), SIMDE_FLOAT32_C( -756.42)),
UINT16_C( 2107),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 897.27), SIMDE_FLOAT32_C( -197.89), SIMDE_FLOAT32_C( -359.76), SIMDE_FLOAT32_C( -33.67),
SIMDE_FLOAT32_C( 7.27), SIMDE_FLOAT32_C( -125.20), SIMDE_FLOAT32_C( 39.93), SIMDE_FLOAT32_C( 394.67),
SIMDE_FLOAT32_C( -304.73), SIMDE_FLOAT32_C( -696.69), SIMDE_FLOAT32_C( 822.06), SIMDE_FLOAT32_C( -997.63),
SIMDE_FLOAT32_C( 923.64), SIMDE_FLOAT32_C( -768.12), SIMDE_FLOAT32_C( -67.64), SIMDE_FLOAT32_C( 977.49)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( -348.70), SIMDE_FLOAT32_C( -438.19), SIMDE_FLOAT32_C( -752.43), SIMDE_FLOAT32_C( 932.66),
SIMDE_FLOAT32_C( 0.13), SIMDE_FLOAT32_C( -182.45), SIMDE_FLOAT32_C( 510.85), SIMDE_FLOAT32_C( 14.34),
SIMDE_FLOAT32_C( 916.26), SIMDE_FLOAT32_C( -769.09), SIMDE_FLOAT32_C( -4.68), SIMDE_FLOAT32_C( 7.46),
SIMDE_FLOAT32_C( 0.44), SIMDE_FLOAT32_C( -576.22), SIMDE_FLOAT32_C( -2.43), SIMDE_FLOAT32_C( 4.51)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( -15.61), SIMDE_FLOAT32_C( -737.13), SIMDE_FLOAT32_C( -314.93), SIMDE_FLOAT32_C( 177.92),
SIMDE_FLOAT32_C( 345.93), SIMDE_FLOAT32_C( 888.71), SIMDE_FLOAT32_C( 915.71), SIMDE_FLOAT32_C( 133.52),
SIMDE_FLOAT32_C( 484.94), SIMDE_FLOAT32_C( -598.06), SIMDE_FLOAT32_C( -791.07), SIMDE_FLOAT32_C( -765.93),
SIMDE_FLOAT32_C( 221.37), SIMDE_FLOAT32_C( -788.36), SIMDE_FLOAT32_C( -775.04), SIMDE_FLOAT32_C( 440.64)),
UINT16_C(22274),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 496.57), SIMDE_FLOAT32_C( 915.19), SIMDE_FLOAT32_C( -718.40), SIMDE_FLOAT32_C( 159.97),
SIMDE_FLOAT32_C( -861.01), SIMDE_FLOAT32_C( 426.61), SIMDE_FLOAT32_C( 932.11), SIMDE_FLOAT32_C( 110.36),
SIMDE_FLOAT32_C( 826.84), SIMDE_FLOAT32_C( -76.75), SIMDE_FLOAT32_C( 237.58), SIMDE_FLOAT32_C( -378.50),
SIMDE_FLOAT32_C( -601.68), SIMDE_FLOAT32_C( -623.50), SIMDE_FLOAT32_C( -942.47), SIMDE_FLOAT32_C( 475.51)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( -15.61), SIMDE_FLOAT32_C( 0.27), SIMDE_FLOAT32_C( -314.93), SIMDE_FLOAT32_C( -0.36),
SIMDE_FLOAT32_C( 345.93), SIMDE_FLOAT32_C( 2.31), SIMDE_FLOAT32_C( 0.63), SIMDE_FLOAT32_C( -2.69),
SIMDE_FLOAT32_C( 484.94), SIMDE_FLOAT32_C( -598.06), SIMDE_FLOAT32_C( -791.07), SIMDE_FLOAT32_C( -765.93),
SIMDE_FLOAT32_C( 221.37), SIMDE_FLOAT32_C( -788.36), SIMDE_FLOAT32_C( -0.92), SIMDE_FLOAT32_C( 440.64)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( 883.05), SIMDE_FLOAT32_C( -807.28), SIMDE_FLOAT32_C( -70.05), SIMDE_FLOAT32_C( -784.34),
SIMDE_FLOAT32_C( 92.52), SIMDE_FLOAT32_C( 206.60), SIMDE_FLOAT32_C( 834.60), SIMDE_FLOAT32_C( -65.60),
SIMDE_FLOAT32_C( -286.07), SIMDE_FLOAT32_C( -212.86), SIMDE_FLOAT32_C( -318.38), SIMDE_FLOAT32_C( 783.48),
SIMDE_FLOAT32_C( -628.82), SIMDE_FLOAT32_C( 556.35), SIMDE_FLOAT32_C( 439.43), SIMDE_FLOAT32_C( 434.03)),
UINT16_C(27396),
simde_mm512_set_ps(SIMDE_FLOAT32_C( -964.25), SIMDE_FLOAT32_C( -406.33), SIMDE_FLOAT32_C( -743.66), SIMDE_FLOAT32_C( -764.58),
SIMDE_FLOAT32_C( 789.89), SIMDE_FLOAT32_C( 4.83), SIMDE_FLOAT32_C( -818.54), SIMDE_FLOAT32_C( 161.06),
SIMDE_FLOAT32_C( 579.25), SIMDE_FLOAT32_C( -11.78), SIMDE_FLOAT32_C( -308.52), SIMDE_FLOAT32_C( -719.57),
SIMDE_FLOAT32_C( 334.00), SIMDE_FLOAT32_C( 274.71), SIMDE_FLOAT32_C( -916.82), SIMDE_FLOAT32_C( -490.00)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 883.05), SIMDE_FLOAT32_C( -1.05), SIMDE_FLOAT32_C( -0.44), SIMDE_FLOAT32_C( -784.34),
SIMDE_FLOAT32_C( 2.73), SIMDE_FLOAT32_C( 206.60), SIMDE_FLOAT32_C( 6.66), SIMDE_FLOAT32_C( -0.34),
SIMDE_FLOAT32_C( -286.07), SIMDE_FLOAT32_C( -212.86), SIMDE_FLOAT32_C( -318.38), SIMDE_FLOAT32_C( 783.48),
SIMDE_FLOAT32_C( -628.82), SIMDE_FLOAT32_C( -12.14), SIMDE_FLOAT32_C( 439.43), SIMDE_FLOAT32_C( 434.03)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( 529.63), SIMDE_FLOAT32_C( -24.89), SIMDE_FLOAT32_C( -967.78), SIMDE_FLOAT32_C( 638.94),
SIMDE_FLOAT32_C( 450.90), SIMDE_FLOAT32_C( -771.54), SIMDE_FLOAT32_C( 105.79), SIMDE_FLOAT32_C( 590.10),
SIMDE_FLOAT32_C( 30.91), SIMDE_FLOAT32_C( 635.35), SIMDE_FLOAT32_C( -84.00), SIMDE_FLOAT32_C( 80.04),
SIMDE_FLOAT32_C( -709.46), SIMDE_FLOAT32_C( 607.86), SIMDE_FLOAT32_C( 394.58), SIMDE_FLOAT32_C( -889.11)),
UINT16_C( 953),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 18.75), SIMDE_FLOAT32_C( 809.05), SIMDE_FLOAT32_C( 144.05), SIMDE_FLOAT32_C( -427.72),
SIMDE_FLOAT32_C( 308.28), SIMDE_FLOAT32_C( -177.05), SIMDE_FLOAT32_C( -457.77), SIMDE_FLOAT32_C( 678.24),
SIMDE_FLOAT32_C( 66.05), SIMDE_FLOAT32_C( -267.71), SIMDE_FLOAT32_C( 117.28), SIMDE_FLOAT32_C( -576.80),
SIMDE_FLOAT32_C( -38.39), SIMDE_FLOAT32_C( -250.14), SIMDE_FLOAT32_C( -53.92), SIMDE_FLOAT32_C( 91.94)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 529.63), SIMDE_FLOAT32_C( -24.89), SIMDE_FLOAT32_C( -967.78), SIMDE_FLOAT32_C( 638.94),
SIMDE_FLOAT32_C( 450.90), SIMDE_FLOAT32_C( -771.54), SIMDE_FLOAT32_C( 7.33), SIMDE_FLOAT32_C( -0.89),
SIMDE_FLOAT32_C( 2.25), SIMDE_FLOAT32_C( 635.35), SIMDE_FLOAT32_C( -1.94), SIMDE_FLOAT32_C( -0.75),
SIMDE_FLOAT32_C( -0.79), SIMDE_FLOAT32_C( 607.86), SIMDE_FLOAT32_C( 394.58), SIMDE_FLOAT32_C( -29.52)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( -788.39), SIMDE_FLOAT32_C( 330.43), SIMDE_FLOAT32_C( -493.41), SIMDE_FLOAT32_C( 822.72),
SIMDE_FLOAT32_C( 956.68), SIMDE_FLOAT32_C( 954.62), SIMDE_FLOAT32_C( 825.49), SIMDE_FLOAT32_C( -816.27),
SIMDE_FLOAT32_C( -209.34), SIMDE_FLOAT32_C( -933.21), SIMDE_FLOAT32_C( -728.70), SIMDE_FLOAT32_C( -420.06),
SIMDE_FLOAT32_C( 100.32), SIMDE_FLOAT32_C( 103.15), SIMDE_FLOAT32_C( 439.77), SIMDE_FLOAT32_C( -204.33)),
UINT16_C(12713),
simde_mm512_set_ps(SIMDE_FLOAT32_C( -841.43), SIMDE_FLOAT32_C( -14.16), SIMDE_FLOAT32_C( 824.88), SIMDE_FLOAT32_C( 793.63),
SIMDE_FLOAT32_C( -736.75), SIMDE_FLOAT32_C( -310.57), SIMDE_FLOAT32_C( 728.87), SIMDE_FLOAT32_C( -350.72),
SIMDE_FLOAT32_C( 60.89), SIMDE_FLOAT32_C( 109.81), SIMDE_FLOAT32_C( 715.94), SIMDE_FLOAT32_C( -250.60),
SIMDE_FLOAT32_C( 944.14), SIMDE_FLOAT32_C( 361.85), SIMDE_FLOAT32_C( -13.07), SIMDE_FLOAT32_C( 852.60)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( -788.39), SIMDE_FLOAT32_C( 330.43), SIMDE_FLOAT32_C( -3.76), SIMDE_FLOAT32_C( 3.40),
SIMDE_FLOAT32_C( 956.68), SIMDE_FLOAT32_C( 954.62), SIMDE_FLOAT32_C( 825.49), SIMDE_FLOAT32_C( 0.16),
SIMDE_FLOAT32_C( 1.80), SIMDE_FLOAT32_C( -933.21), SIMDE_FLOAT32_C( -0.07), SIMDE_FLOAT32_C( -420.06),
SIMDE_FLOAT32_C( 0.97), SIMDE_FLOAT32_C( 103.15), SIMDE_FLOAT32_C( 439.77), SIMDE_FLOAT32_C( -1.09)) }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])); i++) {
simde__m512 r = simde_mm512_mask_tand_ps(test_vec[i].src, test_vec[i].k, test_vec[i].a);
simde_assert_m512_close(r, test_vec[i].r, 1);
}
return 0;
}
static int
test_simde_mm512_tand_pd(SIMDE_MUNIT_TEST_ARGS) {
const struct {
simde__m512d a;
simde__m512d r;
} test_vec[8] = {
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( 497.31), SIMDE_FLOAT64_C( 670.24),
SIMDE_FLOAT64_C( -297.45), SIMDE_FLOAT64_C( 34.06),
SIMDE_FLOAT64_C( -186.21), SIMDE_FLOAT64_C( 39.01),
SIMDE_FLOAT64_C( -754.38), SIMDE_FLOAT64_C( 346.63)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( -0.92), SIMDE_FLOAT64_C( -1.18),
SIMDE_FLOAT64_C( 1.93), SIMDE_FLOAT64_C( 0.68),
SIMDE_FLOAT64_C( -0.11), SIMDE_FLOAT64_C( 0.81),
SIMDE_FLOAT64_C( -0.68), SIMDE_FLOAT64_C( -0.24)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( -678.17), SIMDE_FLOAT64_C( -686.13),
SIMDE_FLOAT64_C( 84.77), SIMDE_FLOAT64_C( 571.46),
SIMDE_FLOAT64_C( 825.53), SIMDE_FLOAT64_C( 422.21),
SIMDE_FLOAT64_C( -269.45), SIMDE_FLOAT64_C( 467.76)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 0.90), SIMDE_FLOAT64_C( 0.67),
SIMDE_FLOAT64_C( 10.92), SIMDE_FLOAT64_C( 0.61),
SIMDE_FLOAT64_C( -3.60), SIMDE_FLOAT64_C( 1.90),
SIMDE_FLOAT64_C( -104.17), SIMDE_FLOAT64_C( -3.12)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( -444.81), SIMDE_FLOAT64_C( 28.47),
SIMDE_FLOAT64_C( -384.03), SIMDE_FLOAT64_C( -923.64),
SIMDE_FLOAT64_C( -305.07), SIMDE_FLOAT64_C( -860.95),
SIMDE_FLOAT64_C( -417.54), SIMDE_FLOAT64_C( 696.87)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( -11.01), SIMDE_FLOAT64_C( 0.54),
SIMDE_FLOAT64_C( -0.45), SIMDE_FLOAT64_C( -0.44),
SIMDE_FLOAT64_C( 1.42), SIMDE_FLOAT64_C( 0.81),
SIMDE_FLOAT64_C( -1.57), SIMDE_FLOAT64_C( -0.43)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( 178.20), SIMDE_FLOAT64_C( -450.67),
SIMDE_FLOAT64_C( 233.37), SIMDE_FLOAT64_C( 687.09),
SIMDE_FLOAT64_C( 261.31), SIMDE_FLOAT64_C( -212.54),
SIMDE_FLOAT64_C( -976.55), SIMDE_FLOAT64_C( -660.80)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( -0.03), SIMDE_FLOAT64_C( 85.51),
SIMDE_FLOAT64_C( 1.35), SIMDE_FLOAT64_C( -0.65),
SIMDE_FLOAT64_C( 6.54), SIMDE_FLOAT64_C( -0.64),
SIMDE_FLOAT64_C( -4.18), SIMDE_FLOAT64_C( 1.68)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( -770.35), SIMDE_FLOAT64_C( 443.48),
SIMDE_FLOAT64_C( -583.60), SIMDE_FLOAT64_C( 380.46),
SIMDE_FLOAT64_C( -770.72), SIMDE_FLOAT64_C( 993.90),
SIMDE_FLOAT64_C( 28.08), SIMDE_FLOAT64_C( 841.21)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( -1.21), SIMDE_FLOAT64_C( 8.75),
SIMDE_FLOAT64_C( -0.95), SIMDE_FLOAT64_C( 0.37),
SIMDE_FLOAT64_C( -1.22), SIMDE_FLOAT64_C( -14.67),
SIMDE_FLOAT64_C( 0.53), SIMDE_FLOAT64_C( -1.65)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( -387.90), SIMDE_FLOAT64_C( 395.92),
SIMDE_FLOAT64_C( 655.87), SIMDE_FLOAT64_C( 339.21),
SIMDE_FLOAT64_C( 532.35), SIMDE_FLOAT64_C( -263.99),
SIMDE_FLOAT64_C( 780.64), SIMDE_FLOAT64_C( -30.79)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( -0.53), SIMDE_FLOAT64_C( 0.72),
SIMDE_FLOAT64_C( -2.06), SIMDE_FLOAT64_C( -0.38),
SIMDE_FLOAT64_C( -0.13), SIMDE_FLOAT64_C( -9.50),
SIMDE_FLOAT64_C( 1.78), SIMDE_FLOAT64_C( -0.60)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( -203.65), SIMDE_FLOAT64_C( -80.73),
SIMDE_FLOAT64_C( 336.73), SIMDE_FLOAT64_C( -944.78),
SIMDE_FLOAT64_C( -747.59), SIMDE_FLOAT64_C( -767.23),
SIMDE_FLOAT64_C( -554.19), SIMDE_FLOAT64_C( 398.82)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( -0.44), SIMDE_FLOAT64_C( -6.13),
SIMDE_FLOAT64_C( -0.43), SIMDE_FLOAT64_C( -0.99),
SIMDE_FLOAT64_C( -0.52), SIMDE_FLOAT64_C( -1.08),
SIMDE_FLOAT64_C( -0.25), SIMDE_FLOAT64_C( 0.80)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( 469.66), SIMDE_FLOAT64_C( 680.02),
SIMDE_FLOAT64_C( -148.69), SIMDE_FLOAT64_C( 818.66),
SIMDE_FLOAT64_C( 910.03), SIMDE_FLOAT64_C( 600.47),
SIMDE_FLOAT64_C( 791.23), SIMDE_FLOAT64_C( 254.31)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( -2.80), SIMDE_FLOAT64_C( -0.84),
SIMDE_FLOAT64_C( 0.61), SIMDE_FLOAT64_C( -6.57),
SIMDE_FLOAT64_C( 0.18), SIMDE_FLOAT64_C( 1.77),
SIMDE_FLOAT64_C( 2.94), SIMDE_FLOAT64_C( 3.56)) }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])); i++) {
simde__m512d r = simde_mm512_tand_pd(test_vec[i].a);
simde_assert_m512d_close(r, test_vec[i].r, 1);
}
return 0;
}
static int
test_simde_mm512_mask_tand_pd(SIMDE_MUNIT_TEST_ARGS) {
const struct {
simde__m512d src;
simde__mmask8 k;
simde__m512d a;
simde__m512d r;
} test_vec[8] = {
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( -686.13), SIMDE_FLOAT64_C( 571.46),
SIMDE_FLOAT64_C( 422.21), SIMDE_FLOAT64_C( 467.76),
SIMDE_FLOAT64_C( 670.24), SIMDE_FLOAT64_C( 34.06),
SIMDE_FLOAT64_C( 39.01), SIMDE_FLOAT64_C( 346.63)),
UINT8_C(139),
simde_mm512_set_pd(SIMDE_FLOAT64_C( -678.17), SIMDE_FLOAT64_C( 84.77),
SIMDE_FLOAT64_C( 825.53), SIMDE_FLOAT64_C( -269.45),
SIMDE_FLOAT64_C( 497.31), SIMDE_FLOAT64_C( -297.45),
SIMDE_FLOAT64_C( -186.21), SIMDE_FLOAT64_C( -754.38)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 0.90), SIMDE_FLOAT64_C( 571.46),
SIMDE_FLOAT64_C( 422.21), SIMDE_FLOAT64_C( 467.76),
SIMDE_FLOAT64_C( -0.92), SIMDE_FLOAT64_C( 34.06),
SIMDE_FLOAT64_C( -0.11), SIMDE_FLOAT64_C( -0.68)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( 178.20), SIMDE_FLOAT64_C( 233.37),
SIMDE_FLOAT64_C( 261.31), SIMDE_FLOAT64_C( -976.55),
SIMDE_FLOAT64_C( -444.81), SIMDE_FLOAT64_C( -384.03),
SIMDE_FLOAT64_C( -305.07), SIMDE_FLOAT64_C( -417.54)),
UINT8_C(229),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 841.21), SIMDE_FLOAT64_C( -450.67),
SIMDE_FLOAT64_C( 687.09), SIMDE_FLOAT64_C( -212.54),
SIMDE_FLOAT64_C( -660.80), SIMDE_FLOAT64_C( 28.47),
SIMDE_FLOAT64_C( -923.64), SIMDE_FLOAT64_C( -860.95)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( -1.65), SIMDE_FLOAT64_C( 85.51),
SIMDE_FLOAT64_C( -0.65), SIMDE_FLOAT64_C( -976.55),
SIMDE_FLOAT64_C( -444.81), SIMDE_FLOAT64_C( 0.54),
SIMDE_FLOAT64_C( -305.07), SIMDE_FLOAT64_C( 0.81)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( 398.82), SIMDE_FLOAT64_C( 395.92),
SIMDE_FLOAT64_C( 339.21), SIMDE_FLOAT64_C( -263.99),
SIMDE_FLOAT64_C( -30.79), SIMDE_FLOAT64_C( 443.48),
SIMDE_FLOAT64_C( 380.46), SIMDE_FLOAT64_C( 993.90)),
UINT8_C(253),
simde_mm512_set_pd(SIMDE_FLOAT64_C( -554.19), SIMDE_FLOAT64_C( -387.90),
SIMDE_FLOAT64_C( 655.87), SIMDE_FLOAT64_C( 532.35),
SIMDE_FLOAT64_C( 780.64), SIMDE_FLOAT64_C( -770.35),
SIMDE_FLOAT64_C( -583.60), SIMDE_FLOAT64_C( -770.72)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( -0.25), SIMDE_FLOAT64_C( -0.53),
SIMDE_FLOAT64_C( -2.06), SIMDE_FLOAT64_C( -0.13),
SIMDE_FLOAT64_C( 1.78), SIMDE_FLOAT64_C( -1.21),
SIMDE_FLOAT64_C( 380.46), SIMDE_FLOAT64_C( -1.22)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( 120.65), SIMDE_FLOAT64_C( 469.66),
SIMDE_FLOAT64_C( -148.69), SIMDE_FLOAT64_C( 910.03),
SIMDE_FLOAT64_C( 791.23), SIMDE_FLOAT64_C( -203.65),
SIMDE_FLOAT64_C( 336.73), SIMDE_FLOAT64_C( -747.59)),
UINT8_C( 93),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 543.35), SIMDE_FLOAT64_C( -171.51),
SIMDE_FLOAT64_C( 680.02), SIMDE_FLOAT64_C( 818.66),
SIMDE_FLOAT64_C( 600.47), SIMDE_FLOAT64_C( 254.31),
SIMDE_FLOAT64_C( -80.73), SIMDE_FLOAT64_C( -944.78)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 120.65), SIMDE_FLOAT64_C( 0.15),
SIMDE_FLOAT64_C( -148.69), SIMDE_FLOAT64_C( -6.57),
SIMDE_FLOAT64_C( 1.77), SIMDE_FLOAT64_C( 3.56),
SIMDE_FLOAT64_C( 336.73), SIMDE_FLOAT64_C( -0.99)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( 99.93), SIMDE_FLOAT64_C( -738.19),
SIMDE_FLOAT64_C( 758.79), SIMDE_FLOAT64_C( 343.48),
SIMDE_FLOAT64_C( -797.92), SIMDE_FLOAT64_C( -525.83),
SIMDE_FLOAT64_C( -822.65), SIMDE_FLOAT64_C( 655.67)),
UINT8_C(145),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 331.34), SIMDE_FLOAT64_C( 462.95),
SIMDE_FLOAT64_C( -178.99), SIMDE_FLOAT64_C( 324.62),
SIMDE_FLOAT64_C( -874.31), SIMDE_FLOAT64_C( -328.54),
SIMDE_FLOAT64_C( -192.31), SIMDE_FLOAT64_C( 561.36)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( -0.55), SIMDE_FLOAT64_C( -738.19),
SIMDE_FLOAT64_C( 758.79), SIMDE_FLOAT64_C( -0.71),
SIMDE_FLOAT64_C( -797.92), SIMDE_FLOAT64_C( -525.83),
SIMDE_FLOAT64_C( -822.65), SIMDE_FLOAT64_C( 0.39)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( -756.42), SIMDE_FLOAT64_C( 27.25),
SIMDE_FLOAT64_C( 690.12), SIMDE_FLOAT64_C( -21.09),
SIMDE_FLOAT64_C( -448.89), SIMDE_FLOAT64_C( 505.79),
SIMDE_FLOAT64_C( 831.02), SIMDE_FLOAT64_C( 977.36)),
UINT8_C( 75),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 977.49), SIMDE_FLOAT64_C( 424.81),
SIMDE_FLOAT64_C( -95.15), SIMDE_FLOAT64_C( 840.65),
SIMDE_FLOAT64_C( -591.56), SIMDE_FLOAT64_C( 731.49),
SIMDE_FLOAT64_C( 623.70), SIMDE_FLOAT64_C( 140.67)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( -756.42), SIMDE_FLOAT64_C( 2.13),
SIMDE_FLOAT64_C( 690.12), SIMDE_FLOAT64_C( -21.09),
SIMDE_FLOAT64_C( -1.26), SIMDE_FLOAT64_C( 505.79),
SIMDE_FLOAT64_C( 9.06), SIMDE_FLOAT64_C( -0.82)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( 394.67), SIMDE_FLOAT64_C( -304.73),
SIMDE_FLOAT64_C( -696.69), SIMDE_FLOAT64_C( 822.06),
SIMDE_FLOAT64_C( -997.63), SIMDE_FLOAT64_C( 923.64),
SIMDE_FLOAT64_C( -768.12), SIMDE_FLOAT64_C( -67.64)),
UINT8_C( 93),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 510.85), SIMDE_FLOAT64_C( 14.34),
SIMDE_FLOAT64_C( 916.26), SIMDE_FLOAT64_C( -769.09),
SIMDE_FLOAT64_C( -573.81), SIMDE_FLOAT64_C( -337.60),
SIMDE_FLOAT64_C( 293.64), SIMDE_FLOAT64_C( -576.22)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 394.67), SIMDE_FLOAT64_C( 0.26),
SIMDE_FLOAT64_C( -696.69), SIMDE_FLOAT64_C( -1.15),
SIMDE_FLOAT64_C( -0.67), SIMDE_FLOAT64_C( 0.41),
SIMDE_FLOAT64_C( -768.12), SIMDE_FLOAT64_C( -0.73)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( 475.51), SIMDE_FLOAT64_C( 936.65),
SIMDE_FLOAT64_C( -348.70), SIMDE_FLOAT64_C( -438.19),
SIMDE_FLOAT64_C( -752.43), SIMDE_FLOAT64_C( 932.66),
SIMDE_FLOAT64_C( -327.22), SIMDE_FLOAT64_C( -182.45)),
UINT8_C(213),
simde_mm512_set_pd(SIMDE_FLOAT64_C( -775.04), SIMDE_FLOAT64_C( 440.64),
SIMDE_FLOAT64_C( 897.27), SIMDE_FLOAT64_C( -197.89),
SIMDE_FLOAT64_C( -359.76), SIMDE_FLOAT64_C( -33.67),
SIMDE_FLOAT64_C( 7.27), SIMDE_FLOAT64_C( -125.20)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( -1.43), SIMDE_FLOAT64_C( 6.07),
SIMDE_FLOAT64_C( -348.70), SIMDE_FLOAT64_C( -0.32),
SIMDE_FLOAT64_C( -752.43), SIMDE_FLOAT64_C( -0.67),
SIMDE_FLOAT64_C( -327.22), SIMDE_FLOAT64_C( 1.42)) }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])); i++) {
simde__m512d r = simde_mm512_mask_tand_pd(test_vec[i].src, test_vec[i].k, test_vec[i].a);
simde_assert_m512d_close(r, test_vec[i].r, 1);
}
return 0;
}
static int
test_simde_mm_trunc_ps (SIMDE_MUNIT_TEST_ARGS) {
static const struct {
const simde_float32 a[4];
const simde_float32 r[4];
} test_vec[] = {
{ { SIMDE_FLOAT32_C( -12.21), SIMDE_FLOAT32_C( -120.46), SIMDE_FLOAT32_C( 116.11), SIMDE_FLOAT32_C( -957.73) },
{ SIMDE_FLOAT32_C( -12.00), SIMDE_FLOAT32_C( -120.00), SIMDE_FLOAT32_C( 116.00), SIMDE_FLOAT32_C( -957.00) } },
{ { SIMDE_FLOAT32_C( -970.43), SIMDE_FLOAT32_C( 73.72), SIMDE_FLOAT32_C( 741.23), SIMDE_FLOAT32_C( -161.72) },
{ SIMDE_FLOAT32_C( -970.00), SIMDE_FLOAT32_C( 73.00), SIMDE_FLOAT32_C( 741.00), SIMDE_FLOAT32_C( -161.00) } },
{ { SIMDE_FLOAT32_C( -669.85), SIMDE_FLOAT32_C( 861.65), SIMDE_FLOAT32_C( 481.06), SIMDE_FLOAT32_C( -607.16) },
{ SIMDE_FLOAT32_C( -669.00), SIMDE_FLOAT32_C( 861.00), SIMDE_FLOAT32_C( 481.00), SIMDE_FLOAT32_C( -607.00) } },
{ { SIMDE_FLOAT32_C( 227.64), SIMDE_FLOAT32_C( -106.69), SIMDE_FLOAT32_C( -76.28), SIMDE_FLOAT32_C( 195.74) },
{ SIMDE_FLOAT32_C( 227.00), SIMDE_FLOAT32_C( -106.00), SIMDE_FLOAT32_C( -76.00), SIMDE_FLOAT32_C( 195.00) } },
{ { SIMDE_FLOAT32_C( -755.50), SIMDE_FLOAT32_C( -618.75), SIMDE_FLOAT32_C( -293.56), SIMDE_FLOAT32_C( -686.30) },
{ SIMDE_FLOAT32_C( -755.00), SIMDE_FLOAT32_C( -618.00), SIMDE_FLOAT32_C( -293.00), SIMDE_FLOAT32_C( -686.00) } },
{ { SIMDE_FLOAT32_C( -454.44), SIMDE_FLOAT32_C( -493.17), SIMDE_FLOAT32_C( 45.88), SIMDE_FLOAT32_C( -307.36) },
{ SIMDE_FLOAT32_C( -454.00), SIMDE_FLOAT32_C( -493.00), SIMDE_FLOAT32_C( 45.00), SIMDE_FLOAT32_C( -307.00) } },
{ { SIMDE_FLOAT32_C( -593.72), SIMDE_FLOAT32_C( -346.10), SIMDE_FLOAT32_C( -356.52), SIMDE_FLOAT32_C( -727.29) },
{ SIMDE_FLOAT32_C( -593.00), SIMDE_FLOAT32_C( -346.00), SIMDE_FLOAT32_C( -356.00), SIMDE_FLOAT32_C( -727.00) } },
{ { SIMDE_FLOAT32_C( 304.91), SIMDE_FLOAT32_C( 961.56), SIMDE_FLOAT32_C( 582.51), SIMDE_FLOAT32_C( -707.29) },
{ SIMDE_FLOAT32_C( 304.00), SIMDE_FLOAT32_C( 961.00), SIMDE_FLOAT32_C( 582.00), SIMDE_FLOAT32_C( -707.00) } }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])) ; i++) {
simde__m128 a = simde_mm_loadu_ps(test_vec[i].a);
simde__m128 r = simde_mm_trunc_ps(a);
simde_test_x86_assert_equal_f32x4(r, simde_mm_loadu_ps(test_vec[i].r), 1);
}
return 0;
}
static int
test_simde_mm_trunc_pd (SIMDE_MUNIT_TEST_ARGS) {
static const struct {
const simde_float64 a[2];
const simde_float64 r[2];
} test_vec[] = {
{ { SIMDE_FLOAT64_C( -536.40), SIMDE_FLOAT64_C( -763.02) },
{ SIMDE_FLOAT64_C( -536.00), SIMDE_FLOAT64_C( -763.00) } },
{ { SIMDE_FLOAT64_C( -999.42), SIMDE_FLOAT64_C( -310.98) },
{ SIMDE_FLOAT64_C( -999.00), SIMDE_FLOAT64_C( -310.00) } },
{ { SIMDE_FLOAT64_C( -951.25), SIMDE_FLOAT64_C( 277.33) },
{ SIMDE_FLOAT64_C( -951.00), SIMDE_FLOAT64_C( 277.00) } },
{ { SIMDE_FLOAT64_C( -98.58), SIMDE_FLOAT64_C( -936.47) },
{ SIMDE_FLOAT64_C( -98.00), SIMDE_FLOAT64_C( -936.00) } },
{ { SIMDE_FLOAT64_C( -124.20), SIMDE_FLOAT64_C( -990.68) },
{ SIMDE_FLOAT64_C( -124.00), SIMDE_FLOAT64_C( -990.00) } },
{ { SIMDE_FLOAT64_C( -319.44), SIMDE_FLOAT64_C( 434.58) },
{ SIMDE_FLOAT64_C( -319.00), SIMDE_FLOAT64_C( 434.00) } },
{ { SIMDE_FLOAT64_C( 209.02), SIMDE_FLOAT64_C( 196.07) },
{ SIMDE_FLOAT64_C( 209.00), SIMDE_FLOAT64_C( 196.00) } },
{ { SIMDE_FLOAT64_C( -740.77), SIMDE_FLOAT64_C( 179.41) },
{ SIMDE_FLOAT64_C( -740.00), SIMDE_FLOAT64_C( 179.00) } }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])) ; i++) {
simde__m128d a = simde_mm_loadu_pd(test_vec[i].a);
simde__m128d r = simde_mm_trunc_pd(a);
simde_test_x86_assert_equal_f64x2(r, simde_mm_loadu_pd(test_vec[i].r), 1);
}
return 0;
}
static int
test_simde_mm256_trunc_ps (SIMDE_MUNIT_TEST_ARGS) {
static const struct {
const simde_float32 a[8];
const simde_float32 r[8];
} test_vec[] = {
{ { SIMDE_FLOAT32_C( -239.01), SIMDE_FLOAT32_C( -492.80), SIMDE_FLOAT32_C( -937.05), SIMDE_FLOAT32_C( -286.30),
SIMDE_FLOAT32_C( 826.89), SIMDE_FLOAT32_C( 311.87), SIMDE_FLOAT32_C( -290.83), SIMDE_FLOAT32_C( 155.81) },
{ SIMDE_FLOAT32_C( -239.00), SIMDE_FLOAT32_C( -492.00), SIMDE_FLOAT32_C( -937.00), SIMDE_FLOAT32_C( -286.00),
SIMDE_FLOAT32_C( 826.00), SIMDE_FLOAT32_C( 311.00), SIMDE_FLOAT32_C( -290.00), SIMDE_FLOAT32_C( 155.00) } },
{ { SIMDE_FLOAT32_C( 497.98), SIMDE_FLOAT32_C( 770.36), SIMDE_FLOAT32_C( -368.92), SIMDE_FLOAT32_C( -362.61),
SIMDE_FLOAT32_C( -693.36), SIMDE_FLOAT32_C( -206.15), SIMDE_FLOAT32_C( -571.56), SIMDE_FLOAT32_C( -305.34) },
{ SIMDE_FLOAT32_C( 497.00), SIMDE_FLOAT32_C( 770.00), SIMDE_FLOAT32_C( -368.00), SIMDE_FLOAT32_C( -362.00),
SIMDE_FLOAT32_C( -693.00), SIMDE_FLOAT32_C( -206.00), SIMDE_FLOAT32_C( -571.00), SIMDE_FLOAT32_C( -305.00) } },
{ { SIMDE_FLOAT32_C( -237.16), SIMDE_FLOAT32_C( 968.44), SIMDE_FLOAT32_C( -77.70), SIMDE_FLOAT32_C( 170.55),
SIMDE_FLOAT32_C( -930.56), SIMDE_FLOAT32_C( 755.06), SIMDE_FLOAT32_C( 78.43), SIMDE_FLOAT32_C( -634.89) },
{ SIMDE_FLOAT32_C( -237.00), SIMDE_FLOAT32_C( 968.00), SIMDE_FLOAT32_C( -77.00), SIMDE_FLOAT32_C( 170.00),
SIMDE_FLOAT32_C( -930.00), SIMDE_FLOAT32_C( 755.00), SIMDE_FLOAT32_C( 78.00), SIMDE_FLOAT32_C( -634.00) } },
{ { SIMDE_FLOAT32_C( 107.17), SIMDE_FLOAT32_C( 191.02), SIMDE_FLOAT32_C( -424.61), SIMDE_FLOAT32_C( -603.58),
SIMDE_FLOAT32_C( -501.82), SIMDE_FLOAT32_C( -855.61), SIMDE_FLOAT32_C( 927.91), SIMDE_FLOAT32_C( 259.17) },
{ SIMDE_FLOAT32_C( 107.00), SIMDE_FLOAT32_C( 191.00), SIMDE_FLOAT32_C( -424.00), SIMDE_FLOAT32_C( -603.00),
SIMDE_FLOAT32_C( -501.00), SIMDE_FLOAT32_C( -855.00), SIMDE_FLOAT32_C( 927.00), SIMDE_FLOAT32_C( 259.00) } },
{ { SIMDE_FLOAT32_C( -348.41), SIMDE_FLOAT32_C( 990.86), SIMDE_FLOAT32_C( 972.87), SIMDE_FLOAT32_C( -521.52),
SIMDE_FLOAT32_C( 302.73), SIMDE_FLOAT32_C( -317.96), SIMDE_FLOAT32_C( 634.29), SIMDE_FLOAT32_C( -199.28) },
{ SIMDE_FLOAT32_C( -348.00), SIMDE_FLOAT32_C( 990.00), SIMDE_FLOAT32_C( 972.00), SIMDE_FLOAT32_C( -521.00),
SIMDE_FLOAT32_C( 302.00), SIMDE_FLOAT32_C( -317.00), SIMDE_FLOAT32_C( 634.00), SIMDE_FLOAT32_C( -199.00) } },
{ { SIMDE_FLOAT32_C( -547.60), SIMDE_FLOAT32_C( -734.63), SIMDE_FLOAT32_C( 438.11), SIMDE_FLOAT32_C( -240.96),
SIMDE_FLOAT32_C( 59.22), SIMDE_FLOAT32_C( 866.55), SIMDE_FLOAT32_C( 453.70), SIMDE_FLOAT32_C( 822.06) },
{ SIMDE_FLOAT32_C( -547.00), SIMDE_FLOAT32_C( -734.00), SIMDE_FLOAT32_C( 438.00), SIMDE_FLOAT32_C( -240.00),
SIMDE_FLOAT32_C( 59.00), SIMDE_FLOAT32_C( 866.00), SIMDE_FLOAT32_C( 453.00), SIMDE_FLOAT32_C( 822.00) } },
{ { SIMDE_FLOAT32_C( 834.99), SIMDE_FLOAT32_C( -624.00), SIMDE_FLOAT32_C( -7.39), SIMDE_FLOAT32_C( 904.43),
SIMDE_FLOAT32_C( -868.94), SIMDE_FLOAT32_C( -928.96), SIMDE_FLOAT32_C( -730.46), SIMDE_FLOAT32_C( 238.23) },
{ SIMDE_FLOAT32_C( 834.00), SIMDE_FLOAT32_C( -624.00), SIMDE_FLOAT32_C( -7.00), SIMDE_FLOAT32_C( 904.00),
SIMDE_FLOAT32_C( -868.00), SIMDE_FLOAT32_C( -928.00), SIMDE_FLOAT32_C( -730.00), SIMDE_FLOAT32_C( 238.00) } },
{ { SIMDE_FLOAT32_C( 262.05), SIMDE_FLOAT32_C( -155.07), SIMDE_FLOAT32_C( 634.65), SIMDE_FLOAT32_C( 760.24),
SIMDE_FLOAT32_C( -10.68), SIMDE_FLOAT32_C( 562.56), SIMDE_FLOAT32_C( 19.41), SIMDE_FLOAT32_C( 640.92) },
{ SIMDE_FLOAT32_C( 262.00), SIMDE_FLOAT32_C( -155.00), SIMDE_FLOAT32_C( 634.00), SIMDE_FLOAT32_C( 760.00),
SIMDE_FLOAT32_C( -10.00), SIMDE_FLOAT32_C( 562.00), SIMDE_FLOAT32_C( 19.00), SIMDE_FLOAT32_C( 640.00) } }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])) ; i++) {
simde__m256 a = simde_mm256_loadu_ps(test_vec[i].a);
simde__m256 r = simde_mm256_trunc_ps(a);
simde_test_x86_assert_equal_f32x8(r, simde_mm256_loadu_ps(test_vec[i].r), 1);
}
return 0;
}
static int
test_simde_mm256_trunc_pd (SIMDE_MUNIT_TEST_ARGS) {
static const struct {
const simde_float64 a[4];
const simde_float64 r[4];
} test_vec[] = {
{ { SIMDE_FLOAT64_C( 693.29), SIMDE_FLOAT64_C( 980.27), SIMDE_FLOAT64_C( -292.17), SIMDE_FLOAT64_C( -318.62) },
{ SIMDE_FLOAT64_C( 693.00), SIMDE_FLOAT64_C( 980.00), SIMDE_FLOAT64_C( -292.00), SIMDE_FLOAT64_C( -318.00) } },
{ { SIMDE_FLOAT64_C( -733.59), SIMDE_FLOAT64_C( -256.43), SIMDE_FLOAT64_C( 726.81), SIMDE_FLOAT64_C( 443.36) },
{ SIMDE_FLOAT64_C( -733.00), SIMDE_FLOAT64_C( -256.00), SIMDE_FLOAT64_C( 726.00), SIMDE_FLOAT64_C( 443.00) } },
{ { SIMDE_FLOAT64_C( -589.23), SIMDE_FLOAT64_C( -428.07), SIMDE_FLOAT64_C( -734.42), SIMDE_FLOAT64_C( 315.59) },
{ SIMDE_FLOAT64_C( -589.00), SIMDE_FLOAT64_C( -428.00), SIMDE_FLOAT64_C( -734.00), SIMDE_FLOAT64_C( 315.00) } },
{ { SIMDE_FLOAT64_C( 286.91), SIMDE_FLOAT64_C( -276.33), SIMDE_FLOAT64_C( -306.67), SIMDE_FLOAT64_C( -257.37) },
{ SIMDE_FLOAT64_C( 286.00), SIMDE_FLOAT64_C( -276.00), SIMDE_FLOAT64_C( -306.00), SIMDE_FLOAT64_C( -257.00) } },
{ { SIMDE_FLOAT64_C( -92.17), SIMDE_FLOAT64_C( -253.48), SIMDE_FLOAT64_C( 663.58), SIMDE_FLOAT64_C( -246.72) },
{ SIMDE_FLOAT64_C( -92.00), SIMDE_FLOAT64_C( -253.00), SIMDE_FLOAT64_C( 663.00), SIMDE_FLOAT64_C( -246.00) } },
{ { SIMDE_FLOAT64_C( -825.67), SIMDE_FLOAT64_C( -678.59), SIMDE_FLOAT64_C( 803.95), SIMDE_FLOAT64_C( 565.59) },
{ SIMDE_FLOAT64_C( -825.00), SIMDE_FLOAT64_C( -678.00), SIMDE_FLOAT64_C( 803.00), SIMDE_FLOAT64_C( 565.00) } },
{ { SIMDE_FLOAT64_C( -428.00), SIMDE_FLOAT64_C( -167.27), SIMDE_FLOAT64_C( 718.24), SIMDE_FLOAT64_C( -22.78) },
{ SIMDE_FLOAT64_C( -428.00), SIMDE_FLOAT64_C( -167.00), SIMDE_FLOAT64_C( 718.00), SIMDE_FLOAT64_C( -22.00) } },
{ { SIMDE_FLOAT64_C( -376.65), SIMDE_FLOAT64_C( -190.00), SIMDE_FLOAT64_C( -12.78), SIMDE_FLOAT64_C( -683.35) },
{ SIMDE_FLOAT64_C( -376.00), SIMDE_FLOAT64_C( -190.00), SIMDE_FLOAT64_C( -12.00), SIMDE_FLOAT64_C( -683.00) } }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])) ; i++) {
simde__m256d a = simde_mm256_loadu_pd(test_vec[i].a);
simde__m256d r = simde_mm256_trunc_pd(a);
simde_test_x86_assert_equal_f64x4(r, simde_mm256_loadu_pd(test_vec[i].r), 1);
}
return 0;
}
static int
test_simde_mm512_trunc_ps (SIMDE_MUNIT_TEST_ARGS) {
static const struct {
const simde_float32 a[16];
const simde_float32 r[16];
} test_vec[] = {
{ { SIMDE_FLOAT32_C( 606.57), SIMDE_FLOAT32_C( 426.10), SIMDE_FLOAT32_C( -271.64), SIMDE_FLOAT32_C( -693.93),
SIMDE_FLOAT32_C( 123.39), SIMDE_FLOAT32_C( -323.73), SIMDE_FLOAT32_C( -823.48), SIMDE_FLOAT32_C( 154.72),
SIMDE_FLOAT32_C( 215.73), SIMDE_FLOAT32_C( 870.22), SIMDE_FLOAT32_C( -205.21), SIMDE_FLOAT32_C( 262.07),
SIMDE_FLOAT32_C( 173.72), SIMDE_FLOAT32_C( 310.35), SIMDE_FLOAT32_C( -516.54), SIMDE_FLOAT32_C( -500.11) },
{ SIMDE_FLOAT32_C( 606.00), SIMDE_FLOAT32_C( 426.00), SIMDE_FLOAT32_C( -271.00), SIMDE_FLOAT32_C( -693.00),
SIMDE_FLOAT32_C( 123.00), SIMDE_FLOAT32_C( -323.00), SIMDE_FLOAT32_C( -823.00), SIMDE_FLOAT32_C( 154.00),
SIMDE_FLOAT32_C( 215.00), SIMDE_FLOAT32_C( 870.00), SIMDE_FLOAT32_C( -205.00), SIMDE_FLOAT32_C( 262.00),
SIMDE_FLOAT32_C( 173.00), SIMDE_FLOAT32_C( 310.00), SIMDE_FLOAT32_C( -516.00), SIMDE_FLOAT32_C( -500.00) } },
{ { SIMDE_FLOAT32_C( -175.79), SIMDE_FLOAT32_C( -258.58), SIMDE_FLOAT32_C( -46.96), SIMDE_FLOAT32_C( 515.02),
SIMDE_FLOAT32_C( 317.58), SIMDE_FLOAT32_C( 852.75), SIMDE_FLOAT32_C( 404.36), SIMDE_FLOAT32_C( 87.35),
SIMDE_FLOAT32_C( -977.95), SIMDE_FLOAT32_C( -929.41), SIMDE_FLOAT32_C( 560.67), SIMDE_FLOAT32_C( 89.12),
SIMDE_FLOAT32_C( 773.32), SIMDE_FLOAT32_C( 918.64), SIMDE_FLOAT32_C( 751.41), SIMDE_FLOAT32_C( 379.89) },
{ SIMDE_FLOAT32_C( -175.00), SIMDE_FLOAT32_C( -258.00), SIMDE_FLOAT32_C( -46.00), SIMDE_FLOAT32_C( 515.00),
SIMDE_FLOAT32_C( 317.00), SIMDE_FLOAT32_C( 852.00), SIMDE_FLOAT32_C( 404.00), SIMDE_FLOAT32_C( 87.00),
SIMDE_FLOAT32_C( -977.00), SIMDE_FLOAT32_C( -929.00), SIMDE_FLOAT32_C( 560.00), SIMDE_FLOAT32_C( 89.00),
SIMDE_FLOAT32_C( 773.00), SIMDE_FLOAT32_C( 918.00), SIMDE_FLOAT32_C( 751.00), SIMDE_FLOAT32_C( 379.00) } },
{ { SIMDE_FLOAT32_C( 344.74), SIMDE_FLOAT32_C( -520.24), SIMDE_FLOAT32_C( 685.96), SIMDE_FLOAT32_C( -531.87),
SIMDE_FLOAT32_C( 156.03), SIMDE_FLOAT32_C( 862.48), SIMDE_FLOAT32_C( 622.85), SIMDE_FLOAT32_C( -628.23),
SIMDE_FLOAT32_C( 732.70), SIMDE_FLOAT32_C( -582.36), SIMDE_FLOAT32_C( 633.84), SIMDE_FLOAT32_C( -93.59),
SIMDE_FLOAT32_C( 728.00), SIMDE_FLOAT32_C( -882.70), SIMDE_FLOAT32_C( 406.31), SIMDE_FLOAT32_C( -447.79) },
{ SIMDE_FLOAT32_C( 344.00), SIMDE_FLOAT32_C( -520.00), SIMDE_FLOAT32_C( 685.00), SIMDE_FLOAT32_C( -531.00),
SIMDE_FLOAT32_C( 156.00), SIMDE_FLOAT32_C( 862.00), SIMDE_FLOAT32_C( 622.00), SIMDE_FLOAT32_C( -628.00),
SIMDE_FLOAT32_C( 732.00), SIMDE_FLOAT32_C( -582.00), SIMDE_FLOAT32_C( 633.00), SIMDE_FLOAT32_C( -93.00),
SIMDE_FLOAT32_C( 728.00), SIMDE_FLOAT32_C( -882.00), SIMDE_FLOAT32_C( 406.00), SIMDE_FLOAT32_C( -447.00) } },
{ { SIMDE_FLOAT32_C( -141.28), SIMDE_FLOAT32_C( -640.65), SIMDE_FLOAT32_C( -932.78), SIMDE_FLOAT32_C( -823.70),
SIMDE_FLOAT32_C( -787.91), SIMDE_FLOAT32_C( 471.59), SIMDE_FLOAT32_C( 263.65), SIMDE_FLOAT32_C( -765.85),
SIMDE_FLOAT32_C( 542.17), SIMDE_FLOAT32_C( -175.67), SIMDE_FLOAT32_C( 323.27), SIMDE_FLOAT32_C( 315.49),
SIMDE_FLOAT32_C( -257.03), SIMDE_FLOAT32_C( 74.67), SIMDE_FLOAT32_C( -304.62), SIMDE_FLOAT32_C( -912.29) },
{ SIMDE_FLOAT32_C( -141.00), SIMDE_FLOAT32_C( -640.00), SIMDE_FLOAT32_C( -932.00), SIMDE_FLOAT32_C( -823.00),
SIMDE_FLOAT32_C( -787.00), SIMDE_FLOAT32_C( 471.00), SIMDE_FLOAT32_C( 263.00), SIMDE_FLOAT32_C( -765.00),
SIMDE_FLOAT32_C( 542.00), SIMDE_FLOAT32_C( -175.00), SIMDE_FLOAT32_C( 323.00), SIMDE_FLOAT32_C( 315.00),
SIMDE_FLOAT32_C( -257.00), SIMDE_FLOAT32_C( 74.00), SIMDE_FLOAT32_C( -304.00), SIMDE_FLOAT32_C( -912.00) } },
{ { SIMDE_FLOAT32_C( 554.43), SIMDE_FLOAT32_C( -618.67), SIMDE_FLOAT32_C( -444.16), SIMDE_FLOAT32_C( -289.53),
SIMDE_FLOAT32_C( -756.19), SIMDE_FLOAT32_C( -821.31), SIMDE_FLOAT32_C( 82.23), SIMDE_FLOAT32_C( 976.51),
SIMDE_FLOAT32_C( -403.66), SIMDE_FLOAT32_C( -283.93), SIMDE_FLOAT32_C( -117.08), SIMDE_FLOAT32_C( -675.67),
SIMDE_FLOAT32_C( -166.63), SIMDE_FLOAT32_C( -710.77), SIMDE_FLOAT32_C( -123.46), SIMDE_FLOAT32_C( 692.09) },
{ SIMDE_FLOAT32_C( 554.00), SIMDE_FLOAT32_C( -618.00), SIMDE_FLOAT32_C( -444.00), SIMDE_FLOAT32_C( -289.00),
SIMDE_FLOAT32_C( -756.00), SIMDE_FLOAT32_C( -821.00), SIMDE_FLOAT32_C( 82.00), SIMDE_FLOAT32_C( 976.00),
SIMDE_FLOAT32_C( -403.00), SIMDE_FLOAT32_C( -283.00), SIMDE_FLOAT32_C( -117.00), SIMDE_FLOAT32_C( -675.00),
SIMDE_FLOAT32_C( -166.00), SIMDE_FLOAT32_C( -710.00), SIMDE_FLOAT32_C( -123.00), SIMDE_FLOAT32_C( 692.00) } },
{ { SIMDE_FLOAT32_C( -351.43), SIMDE_FLOAT32_C( -56.24), SIMDE_FLOAT32_C( 868.39), SIMDE_FLOAT32_C( -139.33),
SIMDE_FLOAT32_C( -584.65), SIMDE_FLOAT32_C( 132.04), SIMDE_FLOAT32_C( 94.81), SIMDE_FLOAT32_C( 957.53),
SIMDE_FLOAT32_C( 956.37), SIMDE_FLOAT32_C( -581.92), SIMDE_FLOAT32_C( 273.02), SIMDE_FLOAT32_C( -300.66),
SIMDE_FLOAT32_C( 492.75), SIMDE_FLOAT32_C( 968.40), SIMDE_FLOAT32_C( -212.96), SIMDE_FLOAT32_C( 47.18) },
{ SIMDE_FLOAT32_C( -351.00), SIMDE_FLOAT32_C( -56.00), SIMDE_FLOAT32_C( 868.00), SIMDE_FLOAT32_C( -139.00),
SIMDE_FLOAT32_C( -584.00), SIMDE_FLOAT32_C( 132.00), SIMDE_FLOAT32_C( 94.00), SIMDE_FLOAT32_C( 957.00),
SIMDE_FLOAT32_C( 956.00), SIMDE_FLOAT32_C( -581.00), SIMDE_FLOAT32_C( 273.00), SIMDE_FLOAT32_C( -300.00),
SIMDE_FLOAT32_C( 492.00), SIMDE_FLOAT32_C( 968.00), SIMDE_FLOAT32_C( -212.00), SIMDE_FLOAT32_C( 47.00) } },
{ { SIMDE_FLOAT32_C( -650.27), SIMDE_FLOAT32_C( 342.89), SIMDE_FLOAT32_C( 757.65), SIMDE_FLOAT32_C( -406.46),
SIMDE_FLOAT32_C( 521.58), SIMDE_FLOAT32_C( -160.12), SIMDE_FLOAT32_C( -429.95), SIMDE_FLOAT32_C( -882.09),
SIMDE_FLOAT32_C( 555.95), SIMDE_FLOAT32_C( 452.97), SIMDE_FLOAT32_C( -557.75), SIMDE_FLOAT32_C( -610.67),
SIMDE_FLOAT32_C( 742.20), SIMDE_FLOAT32_C( 318.79), SIMDE_FLOAT32_C( -918.58), SIMDE_FLOAT32_C( -609.23) },
{ SIMDE_FLOAT32_C( -650.00), SIMDE_FLOAT32_C( 342.00), SIMDE_FLOAT32_C( 757.00), SIMDE_FLOAT32_C( -406.00),
SIMDE_FLOAT32_C( 521.00), SIMDE_FLOAT32_C( -160.00), SIMDE_FLOAT32_C( -429.00), SIMDE_FLOAT32_C( -882.00),
SIMDE_FLOAT32_C( 555.00), SIMDE_FLOAT32_C( 452.00), SIMDE_FLOAT32_C( -557.00), SIMDE_FLOAT32_C( -610.00),
SIMDE_FLOAT32_C( 742.00), SIMDE_FLOAT32_C( 318.00), SIMDE_FLOAT32_C( -918.00), SIMDE_FLOAT32_C( -609.00) } },
{ { SIMDE_FLOAT32_C( -737.45), SIMDE_FLOAT32_C( 949.82), SIMDE_FLOAT32_C( 251.44), SIMDE_FLOAT32_C( -322.10),
SIMDE_FLOAT32_C( 81.86), SIMDE_FLOAT32_C( -653.75), SIMDE_FLOAT32_C( -364.57), SIMDE_FLOAT32_C( 38.23),
SIMDE_FLOAT32_C( -235.67), SIMDE_FLOAT32_C( 908.45), SIMDE_FLOAT32_C( 737.57), SIMDE_FLOAT32_C( -742.92),
SIMDE_FLOAT32_C( 876.84), SIMDE_FLOAT32_C( -475.39), SIMDE_FLOAT32_C( 304.27), SIMDE_FLOAT32_C( -773.43) },
{ SIMDE_FLOAT32_C( -737.00), SIMDE_FLOAT32_C( 949.00), SIMDE_FLOAT32_C( 251.00), SIMDE_FLOAT32_C( -322.00),
SIMDE_FLOAT32_C( 81.00), SIMDE_FLOAT32_C( -653.00), SIMDE_FLOAT32_C( -364.00), SIMDE_FLOAT32_C( 38.00),
SIMDE_FLOAT32_C( -235.00), SIMDE_FLOAT32_C( 908.00), SIMDE_FLOAT32_C( 737.00), SIMDE_FLOAT32_C( -742.00),
SIMDE_FLOAT32_C( 876.00), SIMDE_FLOAT32_C( -475.00), SIMDE_FLOAT32_C( 304.00), SIMDE_FLOAT32_C( -773.00) } }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])) ; i++) {
simde__m512 a = simde_mm512_loadu_ps(test_vec[i].a);
simde__m512 r = simde_mm512_trunc_ps(a);
simde_test_x86_assert_equal_f32x16(r, simde_mm512_loadu_ps(test_vec[i].r), 1);
}
return 0;
}
static int
test_simde_mm512_mask_trunc_ps (SIMDE_MUNIT_TEST_ARGS) {
static const struct {
const simde_float32 src[16];
const simde__mmask8 k;
const simde_float32 a[16];
const simde_float32 r[16];
} test_vec[] = {
{ { SIMDE_FLOAT32_C( -49.77), SIMDE_FLOAT32_C( -686.71), SIMDE_FLOAT32_C( -757.78), SIMDE_FLOAT32_C( 110.09),
SIMDE_FLOAT32_C( 324.87), SIMDE_FLOAT32_C( -371.31), SIMDE_FLOAT32_C( 784.70), SIMDE_FLOAT32_C( 832.26),
SIMDE_FLOAT32_C( 569.37), SIMDE_FLOAT32_C( 756.19), SIMDE_FLOAT32_C( 204.94), SIMDE_FLOAT32_C( 274.85),
SIMDE_FLOAT32_C( -873.98), SIMDE_FLOAT32_C( -346.20), SIMDE_FLOAT32_C( -78.53), SIMDE_FLOAT32_C( -191.48) },
UINT8_C( 44),
{ SIMDE_FLOAT32_C( -81.77), SIMDE_FLOAT32_C( -137.21), SIMDE_FLOAT32_C( 797.93), SIMDE_FLOAT32_C( -424.41),
SIMDE_FLOAT32_C( -278.83), SIMDE_FLOAT32_C( -767.08), SIMDE_FLOAT32_C( -764.79), SIMDE_FLOAT32_C( 76.32),
SIMDE_FLOAT32_C( 979.09), SIMDE_FLOAT32_C( -188.68), SIMDE_FLOAT32_C( -648.91), SIMDE_FLOAT32_C( 84.00),
SIMDE_FLOAT32_C( -272.96), SIMDE_FLOAT32_C( 552.79), SIMDE_FLOAT32_C( -965.78), SIMDE_FLOAT32_C( 40.34) },
{ SIMDE_FLOAT32_C( -49.77), SIMDE_FLOAT32_C( -686.71), SIMDE_FLOAT32_C( 797.00), SIMDE_FLOAT32_C( -424.00),
SIMDE_FLOAT32_C( 324.87), SIMDE_FLOAT32_C( -767.00), SIMDE_FLOAT32_C( 784.70), SIMDE_FLOAT32_C( 832.26),
SIMDE_FLOAT32_C( 569.37), SIMDE_FLOAT32_C( 756.19), SIMDE_FLOAT32_C( 204.94), SIMDE_FLOAT32_C( 274.85),
SIMDE_FLOAT32_C( -873.98), SIMDE_FLOAT32_C( -346.20), SIMDE_FLOAT32_C( -78.53), SIMDE_FLOAT32_C( -191.48) } },
{ { SIMDE_FLOAT32_C( 795.01), SIMDE_FLOAT32_C( 144.31), SIMDE_FLOAT32_C( -634.80), SIMDE_FLOAT32_C( -576.30),
SIMDE_FLOAT32_C( -71.00), SIMDE_FLOAT32_C( -802.54), SIMDE_FLOAT32_C( 993.08), SIMDE_FLOAT32_C( -314.81),
SIMDE_FLOAT32_C( 402.40), SIMDE_FLOAT32_C( 267.93), SIMDE_FLOAT32_C( -188.79), SIMDE_FLOAT32_C( -943.80),
SIMDE_FLOAT32_C( -810.60), SIMDE_FLOAT32_C( 619.74), SIMDE_FLOAT32_C( 857.90), SIMDE_FLOAT32_C( 107.62) },
UINT8_C(232),
{ SIMDE_FLOAT32_C( 655.83), SIMDE_FLOAT32_C( 683.21), SIMDE_FLOAT32_C( 203.69), SIMDE_FLOAT32_C( 888.75),
SIMDE_FLOAT32_C( 918.42), SIMDE_FLOAT32_C( -720.00), SIMDE_FLOAT32_C( 867.84), SIMDE_FLOAT32_C( -270.26),
SIMDE_FLOAT32_C( -368.90), SIMDE_FLOAT32_C( -48.16), SIMDE_FLOAT32_C( 456.78), SIMDE_FLOAT32_C( -816.11),
SIMDE_FLOAT32_C( -13.93), SIMDE_FLOAT32_C( -502.88), SIMDE_FLOAT32_C( 978.90), SIMDE_FLOAT32_C( -869.63) },
{ SIMDE_FLOAT32_C( 795.01), SIMDE_FLOAT32_C( 144.31), SIMDE_FLOAT32_C( -634.80), SIMDE_FLOAT32_C( 888.00),
SIMDE_FLOAT32_C( -71.00), SIMDE_FLOAT32_C( -720.00), SIMDE_FLOAT32_C( 867.00), SIMDE_FLOAT32_C( -270.00),
SIMDE_FLOAT32_C( 402.40), SIMDE_FLOAT32_C( 267.93), SIMDE_FLOAT32_C( -188.79), SIMDE_FLOAT32_C( -943.80),
SIMDE_FLOAT32_C( -810.60), SIMDE_FLOAT32_C( 619.74), SIMDE_FLOAT32_C( 857.90), SIMDE_FLOAT32_C( 107.62) } },
{ { SIMDE_FLOAT32_C( -137.68), SIMDE_FLOAT32_C( -597.40), SIMDE_FLOAT32_C( 59.38), SIMDE_FLOAT32_C( 59.79),
SIMDE_FLOAT32_C( -604.32), SIMDE_FLOAT32_C( 744.57), SIMDE_FLOAT32_C( -537.81), SIMDE_FLOAT32_C( 663.60),
SIMDE_FLOAT32_C( -444.21), SIMDE_FLOAT32_C( -481.61), SIMDE_FLOAT32_C( 853.00), SIMDE_FLOAT32_C( -824.48),
SIMDE_FLOAT32_C( -623.71), SIMDE_FLOAT32_C( -39.38), SIMDE_FLOAT32_C( -341.96), SIMDE_FLOAT32_C( -967.88) },
UINT8_C( 37),
{ SIMDE_FLOAT32_C( 861.73), SIMDE_FLOAT32_C( 920.87), SIMDE_FLOAT32_C( -437.74), SIMDE_FLOAT32_C( -858.26),
SIMDE_FLOAT32_C( 788.71), SIMDE_FLOAT32_C( 291.99), SIMDE_FLOAT32_C( -227.16), SIMDE_FLOAT32_C( -259.44),
SIMDE_FLOAT32_C( -251.22), SIMDE_FLOAT32_C( -43.28), SIMDE_FLOAT32_C( 726.62), SIMDE_FLOAT32_C( 245.90),
SIMDE_FLOAT32_C( -64.38), SIMDE_FLOAT32_C( 857.00), SIMDE_FLOAT32_C( -891.78), SIMDE_FLOAT32_C( 338.22) },
{ SIMDE_FLOAT32_C( 861.00), SIMDE_FLOAT32_C( -597.40), SIMDE_FLOAT32_C( -437.00), SIMDE_FLOAT32_C( 59.79),
SIMDE_FLOAT32_C( -604.32), SIMDE_FLOAT32_C( 291.00), SIMDE_FLOAT32_C( -537.81), SIMDE_FLOAT32_C( 663.60),
SIMDE_FLOAT32_C( -444.21), SIMDE_FLOAT32_C( -481.61), SIMDE_FLOAT32_C( 853.00), SIMDE_FLOAT32_C( -824.48),
SIMDE_FLOAT32_C( -623.71), SIMDE_FLOAT32_C( -39.38), SIMDE_FLOAT32_C( -341.96), SIMDE_FLOAT32_C( -967.88) } },
{ { SIMDE_FLOAT32_C( -83.63), SIMDE_FLOAT32_C( 168.01), SIMDE_FLOAT32_C( 733.90), SIMDE_FLOAT32_C( -339.05),
SIMDE_FLOAT32_C( 630.19), SIMDE_FLOAT32_C( 397.50), SIMDE_FLOAT32_C( 216.73), SIMDE_FLOAT32_C( -851.42),
SIMDE_FLOAT32_C( 250.50), SIMDE_FLOAT32_C( 392.25), SIMDE_FLOAT32_C( -475.13), SIMDE_FLOAT32_C( -788.88),
SIMDE_FLOAT32_C( -949.70), SIMDE_FLOAT32_C( -443.01), SIMDE_FLOAT32_C( -145.04), SIMDE_FLOAT32_C( 912.03) },
UINT8_C(240),
{ SIMDE_FLOAT32_C( 417.21), SIMDE_FLOAT32_C( -946.23), SIMDE_FLOAT32_C( -733.43), SIMDE_FLOAT32_C( -290.79),
SIMDE_FLOAT32_C( -173.40), SIMDE_FLOAT32_C( 7.13), SIMDE_FLOAT32_C( 457.98), SIMDE_FLOAT32_C( 783.33),
SIMDE_FLOAT32_C( -266.25), SIMDE_FLOAT32_C( -296.12), SIMDE_FLOAT32_C( -281.05), SIMDE_FLOAT32_C( -409.26),
SIMDE_FLOAT32_C( -187.90), SIMDE_FLOAT32_C( -942.83), SIMDE_FLOAT32_C( 507.12), SIMDE_FLOAT32_C( 980.11) },
{ SIMDE_FLOAT32_C( -83.63), SIMDE_FLOAT32_C( 168.01), SIMDE_FLOAT32_C( 733.90), SIMDE_FLOAT32_C( -339.05),
SIMDE_FLOAT32_C( -173.00), SIMDE_FLOAT32_C( 7.00), SIMDE_FLOAT32_C( 457.00), SIMDE_FLOAT32_C( 783.00),
SIMDE_FLOAT32_C( 250.50), SIMDE_FLOAT32_C( 392.25), SIMDE_FLOAT32_C( -475.13), SIMDE_FLOAT32_C( -788.88),
SIMDE_FLOAT32_C( -949.70), SIMDE_FLOAT32_C( -443.01), SIMDE_FLOAT32_C( -145.04), SIMDE_FLOAT32_C( 912.03) } },
{ { SIMDE_FLOAT32_C( 791.07), SIMDE_FLOAT32_C( -831.94), SIMDE_FLOAT32_C( 610.30), SIMDE_FLOAT32_C( 188.58),
SIMDE_FLOAT32_C( 384.80), SIMDE_FLOAT32_C( 758.88), SIMDE_FLOAT32_C( -560.92), SIMDE_FLOAT32_C( -222.95),
SIMDE_FLOAT32_C( -716.25), SIMDE_FLOAT32_C( -349.80), SIMDE_FLOAT32_C( -172.65), SIMDE_FLOAT32_C( -159.27),
SIMDE_FLOAT32_C( 505.16), SIMDE_FLOAT32_C( -260.62), SIMDE_FLOAT32_C( 318.59), SIMDE_FLOAT32_C( -77.63) },
UINT8_C( 96),
{ SIMDE_FLOAT32_C( 585.16), SIMDE_FLOAT32_C( 631.57), SIMDE_FLOAT32_C( 619.75), SIMDE_FLOAT32_C( -407.71),
SIMDE_FLOAT32_C( 89.55), SIMDE_FLOAT32_C( 403.08), SIMDE_FLOAT32_C( 326.04), SIMDE_FLOAT32_C( 793.43),
SIMDE_FLOAT32_C( -877.97), SIMDE_FLOAT32_C( 916.78), SIMDE_FLOAT32_C( -394.47), SIMDE_FLOAT32_C( -820.80),
SIMDE_FLOAT32_C( 423.90), SIMDE_FLOAT32_C( -414.36), SIMDE_FLOAT32_C( 970.28), SIMDE_FLOAT32_C( 591.96) },
{ SIMDE_FLOAT32_C( 791.07), SIMDE_FLOAT32_C( -831.94), SIMDE_FLOAT32_C( 610.30), SIMDE_FLOAT32_C( 188.58),
SIMDE_FLOAT32_C( 384.80), SIMDE_FLOAT32_C( 403.00), SIMDE_FLOAT32_C( 326.00), SIMDE_FLOAT32_C( -222.95),
SIMDE_FLOAT32_C( -716.25), SIMDE_FLOAT32_C( -349.80), SIMDE_FLOAT32_C( -172.65), SIMDE_FLOAT32_C( -159.27),
SIMDE_FLOAT32_C( 505.16), SIMDE_FLOAT32_C( -260.62), SIMDE_FLOAT32_C( 318.59), SIMDE_FLOAT32_C( -77.63) } },
{ { SIMDE_FLOAT32_C( -804.06), SIMDE_FLOAT32_C( 158.86), SIMDE_FLOAT32_C( -23.24), SIMDE_FLOAT32_C( 954.82),
SIMDE_FLOAT32_C( 597.93), SIMDE_FLOAT32_C( 753.81), SIMDE_FLOAT32_C( -761.43), SIMDE_FLOAT32_C( -751.86),
SIMDE_FLOAT32_C( -418.84), SIMDE_FLOAT32_C( 79.30), SIMDE_FLOAT32_C( 753.29), SIMDE_FLOAT32_C( 320.53),
SIMDE_FLOAT32_C( -602.11), SIMDE_FLOAT32_C( -324.34), SIMDE_FLOAT32_C( -886.32), SIMDE_FLOAT32_C( 983.05) },
UINT8_C(109),
{ SIMDE_FLOAT32_C( 733.43), SIMDE_FLOAT32_C( -424.66), SIMDE_FLOAT32_C( 396.78), SIMDE_FLOAT32_C( 136.51),
SIMDE_FLOAT32_C( 901.37), SIMDE_FLOAT32_C( 190.22), SIMDE_FLOAT32_C( 258.54), SIMDE_FLOAT32_C( 818.15),
SIMDE_FLOAT32_C( 795.75), SIMDE_FLOAT32_C( 437.74), SIMDE_FLOAT32_C( 242.05), SIMDE_FLOAT32_C( -618.61),
SIMDE_FLOAT32_C( 408.02), SIMDE_FLOAT32_C( -165.99), SIMDE_FLOAT32_C( -422.67), SIMDE_FLOAT32_C( -433.12) },
{ SIMDE_FLOAT32_C( 733.00), SIMDE_FLOAT32_C( 158.86), SIMDE_FLOAT32_C( 396.00), SIMDE_FLOAT32_C( 136.00),
SIMDE_FLOAT32_C( 597.93), SIMDE_FLOAT32_C( 190.00), SIMDE_FLOAT32_C( 258.00), SIMDE_FLOAT32_C( -751.86),
SIMDE_FLOAT32_C( -418.84), SIMDE_FLOAT32_C( 79.30), SIMDE_FLOAT32_C( 753.29), SIMDE_FLOAT32_C( 320.53),
SIMDE_FLOAT32_C( -602.11), SIMDE_FLOAT32_C( -324.34), SIMDE_FLOAT32_C( -886.32), SIMDE_FLOAT32_C( 983.05) } },
{ { SIMDE_FLOAT32_C( 810.77), SIMDE_FLOAT32_C( -467.85), SIMDE_FLOAT32_C( -835.19), SIMDE_FLOAT32_C( 564.58),
SIMDE_FLOAT32_C( -229.28), SIMDE_FLOAT32_C( -587.05), SIMDE_FLOAT32_C( -854.26), SIMDE_FLOAT32_C( 850.02),
SIMDE_FLOAT32_C( -833.76), SIMDE_FLOAT32_C( 466.27), SIMDE_FLOAT32_C( -752.09), SIMDE_FLOAT32_C( -158.10),
SIMDE_FLOAT32_C( 579.95), SIMDE_FLOAT32_C( -769.04), SIMDE_FLOAT32_C( 149.13), SIMDE_FLOAT32_C( 313.38) },
UINT8_C(125),
{ SIMDE_FLOAT32_C( -454.09), SIMDE_FLOAT32_C( -550.11), SIMDE_FLOAT32_C( -292.33), SIMDE_FLOAT32_C( 736.13),
SIMDE_FLOAT32_C( 708.43), SIMDE_FLOAT32_C( -474.18), SIMDE_FLOAT32_C( 531.88), SIMDE_FLOAT32_C( 146.17),
SIMDE_FLOAT32_C( 767.87), SIMDE_FLOAT32_C( 913.26), SIMDE_FLOAT32_C( -445.81), SIMDE_FLOAT32_C( -398.12),
SIMDE_FLOAT32_C( -509.41), SIMDE_FLOAT32_C( 121.07), SIMDE_FLOAT32_C( -587.35), SIMDE_FLOAT32_C( 22.74) },
{ SIMDE_FLOAT32_C( -454.00), SIMDE_FLOAT32_C( -467.85), SIMDE_FLOAT32_C( -292.00), SIMDE_FLOAT32_C( 736.00),
SIMDE_FLOAT32_C( 708.00), SIMDE_FLOAT32_C( -474.00), SIMDE_FLOAT32_C( 531.00), SIMDE_FLOAT32_C( 850.02),
SIMDE_FLOAT32_C( -833.76), SIMDE_FLOAT32_C( 466.27), SIMDE_FLOAT32_C( -752.09), SIMDE_FLOAT32_C( -158.10),
SIMDE_FLOAT32_C( 579.95), SIMDE_FLOAT32_C( -769.04), SIMDE_FLOAT32_C( 149.13), SIMDE_FLOAT32_C( 313.38) } },
{ { SIMDE_FLOAT32_C( 285.88), SIMDE_FLOAT32_C( 977.23), SIMDE_FLOAT32_C( 793.45), SIMDE_FLOAT32_C( 698.82),
SIMDE_FLOAT32_C( -877.03), SIMDE_FLOAT32_C( 643.47), SIMDE_FLOAT32_C( 865.06), SIMDE_FLOAT32_C( 589.25),
SIMDE_FLOAT32_C( 891.38), SIMDE_FLOAT32_C( -293.04), SIMDE_FLOAT32_C( 169.20), SIMDE_FLOAT32_C( -877.66),
SIMDE_FLOAT32_C( 856.08), SIMDE_FLOAT32_C( -517.41), SIMDE_FLOAT32_C( -71.37), SIMDE_FLOAT32_C( -598.01) },
UINT8_C(105),
{ SIMDE_FLOAT32_C( 636.30), SIMDE_FLOAT32_C( -861.88), SIMDE_FLOAT32_C( -359.09), SIMDE_FLOAT32_C( -837.88),
SIMDE_FLOAT32_C( 670.00), SIMDE_FLOAT32_C( 787.08), SIMDE_FLOAT32_C( 929.98), SIMDE_FLOAT32_C( 583.26),
SIMDE_FLOAT32_C( -658.72), SIMDE_FLOAT32_C( -468.14), SIMDE_FLOAT32_C( -926.15), SIMDE_FLOAT32_C( 462.35),
SIMDE_FLOAT32_C( -55.49), SIMDE_FLOAT32_C( 96.59), SIMDE_FLOAT32_C( -251.77), SIMDE_FLOAT32_C( -78.25) },
{ SIMDE_FLOAT32_C( 636.00), SIMDE_FLOAT32_C( 977.23), SIMDE_FLOAT32_C( 793.45), SIMDE_FLOAT32_C( -837.00),
SIMDE_FLOAT32_C( -877.03), SIMDE_FLOAT32_C( 787.00), SIMDE_FLOAT32_C( 929.00), SIMDE_FLOAT32_C( 589.25),
SIMDE_FLOAT32_C( 891.38), SIMDE_FLOAT32_C( -293.04), SIMDE_FLOAT32_C( 169.20), SIMDE_FLOAT32_C( -877.66),
SIMDE_FLOAT32_C( 856.08), SIMDE_FLOAT32_C( -517.41), SIMDE_FLOAT32_C( -71.37), SIMDE_FLOAT32_C( -598.01) } }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])) ; i++) {
simde__m512 src = simde_mm512_loadu_ps(test_vec[i].src);
simde__m512 a = simde_mm512_loadu_ps(test_vec[i].a);
simde__m512 r = simde_mm512_mask_trunc_ps(src, test_vec[i].k, a);
simde_test_x86_assert_equal_f32x16(r, simde_mm512_loadu_ps(test_vec[i].r), 1);
}
return 0;
}
static int
test_simde_mm512_trunc_pd (SIMDE_MUNIT_TEST_ARGS) {
static const struct {
const simde_float64 a[8];
const simde_float64 r[8];
} test_vec[] = {
{ { SIMDE_FLOAT64_C( 90.45), SIMDE_FLOAT64_C( 195.98), SIMDE_FLOAT64_C( -83.38), SIMDE_FLOAT64_C( -236.26),
SIMDE_FLOAT64_C( -941.16), SIMDE_FLOAT64_C( 125.78), SIMDE_FLOAT64_C( -753.74), SIMDE_FLOAT64_C( -729.24) },
{ SIMDE_FLOAT64_C( 90.00), SIMDE_FLOAT64_C( 195.00), SIMDE_FLOAT64_C( -83.00), SIMDE_FLOAT64_C( -236.00),
SIMDE_FLOAT64_C( -941.00), SIMDE_FLOAT64_C( 125.00), SIMDE_FLOAT64_C( -753.00), SIMDE_FLOAT64_C( -729.00) } },
{ { SIMDE_FLOAT64_C( 663.53), SIMDE_FLOAT64_C( 196.60), SIMDE_FLOAT64_C( -90.58), SIMDE_FLOAT64_C( 229.06),
SIMDE_FLOAT64_C( -925.87), SIMDE_FLOAT64_C( -621.28), SIMDE_FLOAT64_C( 631.54), SIMDE_FLOAT64_C( -475.70) },
{ SIMDE_FLOAT64_C( 663.00), SIMDE_FLOAT64_C( 196.00), SIMDE_FLOAT64_C( -90.00), SIMDE_FLOAT64_C( 229.00),
SIMDE_FLOAT64_C( -925.00), SIMDE_FLOAT64_C( -621.00), SIMDE_FLOAT64_C( 631.00), SIMDE_FLOAT64_C( -475.00) } },
{ { SIMDE_FLOAT64_C( 499.40), SIMDE_FLOAT64_C( -577.93), SIMDE_FLOAT64_C( -603.42), SIMDE_FLOAT64_C( -226.68),
SIMDE_FLOAT64_C( 674.64), SIMDE_FLOAT64_C( -116.71), SIMDE_FLOAT64_C( 605.38), SIMDE_FLOAT64_C( -749.41) },
{ SIMDE_FLOAT64_C( 499.00), SIMDE_FLOAT64_C( -577.00), SIMDE_FLOAT64_C( -603.00), SIMDE_FLOAT64_C( -226.00),
SIMDE_FLOAT64_C( 674.00), SIMDE_FLOAT64_C( -116.00), SIMDE_FLOAT64_C( 605.00), SIMDE_FLOAT64_C( -749.00) } },
{ { SIMDE_FLOAT64_C( -866.90), SIMDE_FLOAT64_C( 273.08), SIMDE_FLOAT64_C( 910.37), SIMDE_FLOAT64_C( -223.08),
SIMDE_FLOAT64_C( 229.45), SIMDE_FLOAT64_C( -919.92), SIMDE_FLOAT64_C( 179.63), SIMDE_FLOAT64_C( -680.10) },
{ SIMDE_FLOAT64_C( -866.00), SIMDE_FLOAT64_C( 273.00), SIMDE_FLOAT64_C( 910.00), SIMDE_FLOAT64_C( -223.00),
SIMDE_FLOAT64_C( 229.00), SIMDE_FLOAT64_C( -919.00), SIMDE_FLOAT64_C( 179.00), SIMDE_FLOAT64_C( -680.00) } },
{ { SIMDE_FLOAT64_C( 276.06), SIMDE_FLOAT64_C( -903.75), SIMDE_FLOAT64_C( 83.64), SIMDE_FLOAT64_C( 334.90),
SIMDE_FLOAT64_C( 222.03), SIMDE_FLOAT64_C( 329.90), SIMDE_FLOAT64_C( 605.67), SIMDE_FLOAT64_C( -114.44) },
{ SIMDE_FLOAT64_C( 276.00), SIMDE_FLOAT64_C( -903.00), SIMDE_FLOAT64_C( 83.00), SIMDE_FLOAT64_C( 334.00),
SIMDE_FLOAT64_C( 222.00), SIMDE_FLOAT64_C( 329.00), SIMDE_FLOAT64_C( 605.00), SIMDE_FLOAT64_C( -114.00) } },
{ { SIMDE_FLOAT64_C( -473.49), SIMDE_FLOAT64_C( -484.91), SIMDE_FLOAT64_C( -885.38), SIMDE_FLOAT64_C( -399.36),
SIMDE_FLOAT64_C( -106.19), SIMDE_FLOAT64_C( 746.15), SIMDE_FLOAT64_C( 124.93), SIMDE_FLOAT64_C( -606.79) },
{ SIMDE_FLOAT64_C( -473.00), SIMDE_FLOAT64_C( -484.00), SIMDE_FLOAT64_C( -885.00), SIMDE_FLOAT64_C( -399.00),
SIMDE_FLOAT64_C( -106.00), SIMDE_FLOAT64_C( 746.00), SIMDE_FLOAT64_C( 124.00), SIMDE_FLOAT64_C( -606.00) } },
{ { SIMDE_FLOAT64_C( -831.78), SIMDE_FLOAT64_C( 521.52), SIMDE_FLOAT64_C( 166.54), SIMDE_FLOAT64_C( 842.86),
SIMDE_FLOAT64_C( -595.19), SIMDE_FLOAT64_C( -228.09), SIMDE_FLOAT64_C( -906.55), SIMDE_FLOAT64_C( -462.09) },
{ SIMDE_FLOAT64_C( -831.00), SIMDE_FLOAT64_C( 521.00), SIMDE_FLOAT64_C( 166.00), SIMDE_FLOAT64_C( 842.00),
SIMDE_FLOAT64_C( -595.00), SIMDE_FLOAT64_C( -228.00), SIMDE_FLOAT64_C( -906.00), SIMDE_FLOAT64_C( -462.00) } },
{ { SIMDE_FLOAT64_C( -955.00), SIMDE_FLOAT64_C( -996.18), SIMDE_FLOAT64_C( 314.83), SIMDE_FLOAT64_C( 274.44),
SIMDE_FLOAT64_C( -916.10), SIMDE_FLOAT64_C( -505.54), SIMDE_FLOAT64_C( 594.34), SIMDE_FLOAT64_C( 359.96) },
{ SIMDE_FLOAT64_C( -955.00), SIMDE_FLOAT64_C( -996.00), SIMDE_FLOAT64_C( 314.00), SIMDE_FLOAT64_C( 274.00),
SIMDE_FLOAT64_C( -916.00), SIMDE_FLOAT64_C( -505.00), SIMDE_FLOAT64_C( 594.00), SIMDE_FLOAT64_C( 359.00) } }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])) ; i++) {
simde__m512d a = simde_mm512_loadu_pd(test_vec[i].a);
simde__m512d r = simde_mm512_trunc_pd(a);
simde_test_x86_assert_equal_f64x8(r, simde_mm512_loadu_pd(test_vec[i].r), 1);
}
return 0;
}
static int
test_simde_mm512_mask_trunc_pd (SIMDE_MUNIT_TEST_ARGS) {
static const struct {
const simde_float64 src[8];
const simde__mmask8 k;
const simde_float64 a[8];
const simde_float64 r[8];
} test_vec[] = {
{ { SIMDE_FLOAT64_C( 818.03), SIMDE_FLOAT64_C( 444.72), SIMDE_FLOAT64_C( 916.04), SIMDE_FLOAT64_C( -825.66),
SIMDE_FLOAT64_C( 941.31), SIMDE_FLOAT64_C( -37.20), SIMDE_FLOAT64_C( -948.28), SIMDE_FLOAT64_C( -408.19) },
UINT8_C( 90),
{ SIMDE_FLOAT64_C( -903.02), SIMDE_FLOAT64_C( 326.13), SIMDE_FLOAT64_C( -77.85), SIMDE_FLOAT64_C( 808.82),
SIMDE_FLOAT64_C( -385.32), SIMDE_FLOAT64_C( -921.95), SIMDE_FLOAT64_C( -879.51), SIMDE_FLOAT64_C( 447.28) },
{ SIMDE_FLOAT64_C( 818.03), SIMDE_FLOAT64_C( 326.00), SIMDE_FLOAT64_C( 916.04), SIMDE_FLOAT64_C( 808.00),
SIMDE_FLOAT64_C( -385.00), SIMDE_FLOAT64_C( -37.20), SIMDE_FLOAT64_C( -879.00), SIMDE_FLOAT64_C( -408.19) } },
{ { SIMDE_FLOAT64_C( -281.72), SIMDE_FLOAT64_C( 142.99), SIMDE_FLOAT64_C( -182.68), SIMDE_FLOAT64_C( -63.76),
SIMDE_FLOAT64_C( 164.70), SIMDE_FLOAT64_C( -994.58), SIMDE_FLOAT64_C( -84.09), SIMDE_FLOAT64_C( 455.69) },
UINT8_C(145),
{ SIMDE_FLOAT64_C( 892.02), SIMDE_FLOAT64_C( 632.35), SIMDE_FLOAT64_C( 571.19), SIMDE_FLOAT64_C( -642.67),
SIMDE_FLOAT64_C( -756.86), SIMDE_FLOAT64_C( 389.22), SIMDE_FLOAT64_C( 802.05), SIMDE_FLOAT64_C( -840.82) },
{ SIMDE_FLOAT64_C( 892.00), SIMDE_FLOAT64_C( 142.99), SIMDE_FLOAT64_C( -182.68), SIMDE_FLOAT64_C( -63.76),
SIMDE_FLOAT64_C( -756.00), SIMDE_FLOAT64_C( -994.58), SIMDE_FLOAT64_C( -84.09), SIMDE_FLOAT64_C( -840.00) } },
{ { SIMDE_FLOAT64_C( 563.57), SIMDE_FLOAT64_C( 743.36), SIMDE_FLOAT64_C( 121.98), SIMDE_FLOAT64_C( 615.28),
SIMDE_FLOAT64_C( -664.83), SIMDE_FLOAT64_C( 388.96), SIMDE_FLOAT64_C( 712.26), SIMDE_FLOAT64_C( 661.30) },
UINT8_C(219),
{ SIMDE_FLOAT64_C( 521.09), SIMDE_FLOAT64_C( -724.02), SIMDE_FLOAT64_C( -610.84), SIMDE_FLOAT64_C( 641.58),
SIMDE_FLOAT64_C( 723.26), SIMDE_FLOAT64_C( 107.43), SIMDE_FLOAT64_C( -215.43), SIMDE_FLOAT64_C( -459.42) },
{ SIMDE_FLOAT64_C( 521.00), SIMDE_FLOAT64_C( -724.00), SIMDE_FLOAT64_C( 121.98), SIMDE_FLOAT64_C( 641.00),
SIMDE_FLOAT64_C( 723.00), SIMDE_FLOAT64_C( 388.96), SIMDE_FLOAT64_C( -215.00), SIMDE_FLOAT64_C( -459.00) } },
{ { SIMDE_FLOAT64_C( -956.33), SIMDE_FLOAT64_C( 949.27), SIMDE_FLOAT64_C( -454.00), SIMDE_FLOAT64_C( -40.42),
SIMDE_FLOAT64_C( 404.97), SIMDE_FLOAT64_C( -418.67), SIMDE_FLOAT64_C( -148.40), SIMDE_FLOAT64_C( 37.32) },
UINT8_C( 67),
{ SIMDE_FLOAT64_C( 208.93), SIMDE_FLOAT64_C( 280.46), SIMDE_FLOAT64_C( 541.75), SIMDE_FLOAT64_C( 10.98),
SIMDE_FLOAT64_C( 439.64), SIMDE_FLOAT64_C( 105.31), SIMDE_FLOAT64_C( -245.66), SIMDE_FLOAT64_C( -438.38) },
{ SIMDE_FLOAT64_C( 208.00), SIMDE_FLOAT64_C( 280.00), SIMDE_FLOAT64_C( -454.00), SIMDE_FLOAT64_C( -40.42),
SIMDE_FLOAT64_C( 404.97), SIMDE_FLOAT64_C( -418.67), SIMDE_FLOAT64_C( -245.00), SIMDE_FLOAT64_C( 37.32) } },
{ { SIMDE_FLOAT64_C( -279.41), SIMDE_FLOAT64_C( 89.51), SIMDE_FLOAT64_C( 950.57), SIMDE_FLOAT64_C( -567.14),
SIMDE_FLOAT64_C( -249.19), SIMDE_FLOAT64_C( -738.32), SIMDE_FLOAT64_C( 953.94), SIMDE_FLOAT64_C( 26.79) },
UINT8_C(166),
{ SIMDE_FLOAT64_C( 595.52), SIMDE_FLOAT64_C( -249.94), SIMDE_FLOAT64_C( 758.28), SIMDE_FLOAT64_C( -619.90),
SIMDE_FLOAT64_C( 290.64), SIMDE_FLOAT64_C( 801.95), SIMDE_FLOAT64_C( -670.63), SIMDE_FLOAT64_C( 836.64) },
{ SIMDE_FLOAT64_C( -279.41), SIMDE_FLOAT64_C( -249.00), SIMDE_FLOAT64_C( 758.00), SIMDE_FLOAT64_C( -567.14),
SIMDE_FLOAT64_C( -249.19), SIMDE_FLOAT64_C( 801.00), SIMDE_FLOAT64_C( 953.94), SIMDE_FLOAT64_C( 836.00) } },
{ { SIMDE_FLOAT64_C( -238.47), SIMDE_FLOAT64_C( 734.34), SIMDE_FLOAT64_C( -582.03), SIMDE_FLOAT64_C( 613.13),
SIMDE_FLOAT64_C( -228.35), SIMDE_FLOAT64_C( -429.51), SIMDE_FLOAT64_C( -177.94), SIMDE_FLOAT64_C( -947.89) },
UINT8_C(123),
{ SIMDE_FLOAT64_C( 833.04), SIMDE_FLOAT64_C( 491.75), SIMDE_FLOAT64_C( 217.55), SIMDE_FLOAT64_C( -412.62),
SIMDE_FLOAT64_C( -946.63), SIMDE_FLOAT64_C( 938.15), SIMDE_FLOAT64_C( 676.89), SIMDE_FLOAT64_C( -996.06) },
{ SIMDE_FLOAT64_C( 833.00), SIMDE_FLOAT64_C( 491.00), SIMDE_FLOAT64_C( -582.03), SIMDE_FLOAT64_C( -412.00),
SIMDE_FLOAT64_C( -946.00), SIMDE_FLOAT64_C( 938.00), SIMDE_FLOAT64_C( 676.00), SIMDE_FLOAT64_C( -947.89) } },
{ { SIMDE_FLOAT64_C( -629.00), SIMDE_FLOAT64_C( -572.30), SIMDE_FLOAT64_C( -734.38), SIMDE_FLOAT64_C( -675.05),
SIMDE_FLOAT64_C( 454.50), SIMDE_FLOAT64_C( -83.54), SIMDE_FLOAT64_C( 920.47), SIMDE_FLOAT64_C( -795.45) },
UINT8_C( 5),
{ SIMDE_FLOAT64_C( -699.43), SIMDE_FLOAT64_C( 495.19), SIMDE_FLOAT64_C( -523.31), SIMDE_FLOAT64_C( -370.06),
SIMDE_FLOAT64_C( 331.83), SIMDE_FLOAT64_C( 238.22), SIMDE_FLOAT64_C( -635.72), SIMDE_FLOAT64_C( 749.81) },
{ SIMDE_FLOAT64_C( -699.00), SIMDE_FLOAT64_C( -572.30), SIMDE_FLOAT64_C( -523.00), SIMDE_FLOAT64_C( -675.05),
SIMDE_FLOAT64_C( 454.50), SIMDE_FLOAT64_C( -83.54), SIMDE_FLOAT64_C( 920.47), SIMDE_FLOAT64_C( -795.45) } },
{ { SIMDE_FLOAT64_C( -148.65), SIMDE_FLOAT64_C( 135.93), SIMDE_FLOAT64_C( -679.70), SIMDE_FLOAT64_C( 673.41),
SIMDE_FLOAT64_C( 188.04), SIMDE_FLOAT64_C( -567.46), SIMDE_FLOAT64_C( 506.46), SIMDE_FLOAT64_C( -320.21) },
UINT8_C(186),
{ SIMDE_FLOAT64_C( -906.16), SIMDE_FLOAT64_C( -266.84), SIMDE_FLOAT64_C( 588.24), SIMDE_FLOAT64_C( 770.73),
SIMDE_FLOAT64_C( -262.91), SIMDE_FLOAT64_C( 959.24), SIMDE_FLOAT64_C( -801.57), SIMDE_FLOAT64_C( 2.71) },
{ SIMDE_FLOAT64_C( -148.65), SIMDE_FLOAT64_C( -266.00), SIMDE_FLOAT64_C( -679.70), SIMDE_FLOAT64_C( 770.00),
SIMDE_FLOAT64_C( -262.00), SIMDE_FLOAT64_C( 959.00), SIMDE_FLOAT64_C( 506.46), SIMDE_FLOAT64_C( 2.00) } }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])) ; i++) {
simde__m512d src = simde_mm512_loadu_pd(test_vec[i].src);
simde__m512d a = simde_mm512_loadu_pd(test_vec[i].a);
simde__m512d r = simde_mm512_mask_trunc_pd(src, test_vec[i].k, a);
simde_test_x86_assert_equal_f64x8(r, simde_mm512_loadu_pd(test_vec[i].r), 1);
}
return 0;
}
static int
test_simde_mm_udivrem_epi32(SIMDE_MUNIT_TEST_ARGS) {
const struct {
simde__m128i a;
simde__m128i b;
simde__m128i rem;
simde__m128i r;
} test_vec[8] = {
{ simde_x_mm_set_epu32(UINT32_C(1747596798), UINT32_C(2231263307), UINT32_C( 527472553), UINT32_C(2891870298)),
simde_x_mm_set_epu32(UINT32_C(4025088144), UINT32_C(4117928860), UINT32_C( 377180600), UINT32_C(3776380886)),
simde_x_mm_set_epu32(UINT32_C(1747596798), UINT32_C(2231263307), UINT32_C( 150291953), UINT32_C(2891870298)),
simde_x_mm_set_epu32(UINT32_C( 0), UINT32_C( 0), UINT32_C( 1), UINT32_C( 0)) },
{ simde_x_mm_set_epu32(UINT32_C(3920294270), UINT32_C(3054162118), UINT32_C(1568850865), UINT32_C(3151989757)),
simde_x_mm_set_epu32(UINT32_C( 172780273), UINT32_C( 168508556), UINT32_C(3803608574), UINT32_C(4064895559)),
simde_x_mm_set_epu32(UINT32_C( 119128264), UINT32_C( 21008110), UINT32_C(1568850865), UINT32_C(3151989757)),
simde_x_mm_set_epu32(UINT32_C( 22), UINT32_C( 18), UINT32_C( 0), UINT32_C( 0)) },
{ simde_x_mm_set_epu32(UINT32_C(1492341726), UINT32_C( 298608154), UINT32_C(1250819173), UINT32_C(3643996043)),
simde_x_mm_set_epu32(UINT32_C( 298065861), UINT32_C(3773381365), UINT32_C( 330694282), UINT32_C( 40997390)),
simde_x_mm_set_epu32(UINT32_C( 2012421), UINT32_C( 298608154), UINT32_C( 258736327), UINT32_C( 36225723)),
simde_x_mm_set_epu32(UINT32_C( 5), UINT32_C( 0), UINT32_C( 3), UINT32_C( 88)) },
{ simde_x_mm_set_epu32(UINT32_C(2708640028), UINT32_C(1691051285), UINT32_C( 50347892), UINT32_C( 728425428)),
simde_x_mm_set_epu32(UINT32_C(3853764578), UINT32_C( 294920921), UINT32_C(3883385645), UINT32_C(4126975473)),
simde_x_mm_set_epu32(UINT32_C(2708640028), UINT32_C( 216446680), UINT32_C( 50347892), UINT32_C( 728425428)),
simde_x_mm_set_epu32(UINT32_C( 0), UINT32_C( 5), UINT32_C( 0), UINT32_C( 0)) },
{ simde_x_mm_set_epu32(UINT32_C( 492373082), UINT32_C(4281870485), UINT32_C(2207786213), UINT32_C(3953959418)),
simde_x_mm_set_epu32(UINT32_C( 123290430), UINT32_C(3996188341), UINT32_C( 223555334), UINT32_C(3962352253)),
simde_x_mm_set_epu32(UINT32_C( 122501792), UINT32_C( 285682144), UINT32_C( 195788207), UINT32_C(3953959418)),
simde_x_mm_set_epu32(UINT32_C( 3), UINT32_C( 1), UINT32_C( 9), UINT32_C( 0)) },
{ simde_x_mm_set_epu32(UINT32_C(3290702646), UINT32_C(1580565751), UINT32_C(3823902839), UINT32_C(2081361826)),
simde_x_mm_set_epu32(UINT32_C( 328620632), UINT32_C(3970654641), UINT32_C(4110215287), UINT32_C(3940207296)),
simde_x_mm_set_epu32(UINT32_C( 4496326), UINT32_C(1580565751), UINT32_C(3823902839), UINT32_C(2081361826)),
simde_x_mm_set_epu32(UINT32_C( 10), UINT32_C( 0), UINT32_C( 0), UINT32_C( 0)) },
{ simde_x_mm_set_epu32(UINT32_C( 542053192), UINT32_C( 499863549), UINT32_C( 957375358), UINT32_C(3003933707)),
simde_x_mm_set_epu32(UINT32_C( 427537184), UINT32_C( 493530770), UINT32_C(3938875497), UINT32_C( 29647056)),
simde_x_mm_set_epu32(UINT32_C( 114516008), UINT32_C( 6332779), UINT32_C( 957375358), UINT32_C( 9581051)),
simde_x_mm_set_epu32(UINT32_C( 1), UINT32_C( 1), UINT32_C( 0), UINT32_C( 101)) },
{ simde_x_mm_set_epu32(UINT32_C(4101755863), UINT32_C(3436978124), UINT32_C(3846637996), UINT32_C(2693603084)),
simde_x_mm_set_epu32(UINT32_C(4010243988), UINT32_C(4123176886), UINT32_C( 457043765), UINT32_C(4197612290)),
simde_x_mm_set_epu32(UINT32_C( 91511875), UINT32_C(3436978124), UINT32_C( 190287876), UINT32_C(2693603084)),
simde_x_mm_set_epu32(UINT32_C( 1), UINT32_C( 0), UINT32_C( 8), UINT32_C( 0)) }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])); i++) {
simde__m128i rem;
simde__m128i r = simde_mm_udivrem_epi32(&rem, test_vec[i].a, test_vec[i].b);
simde_assert_m128i_u32(r, ==, test_vec[i].r);
simde_assert_m128i_u32(rem, ==, test_vec[i].rem);
}
return 0;
}
static int
test_simde_mm_tanh_ps(SIMDE_MUNIT_TEST_ARGS) {
const struct {
simde__m128 a;
simde__m128 r;
} test_vec[8] = {
{ simde_mm_set_ps(SIMDE_FLOAT32_C( -0.19), SIMDE_FLOAT32_C( 0.04), SIMDE_FLOAT32_C( -0.75), SIMDE_FLOAT32_C( 0.35)),
simde_mm_set_ps(SIMDE_FLOAT32_C( -0.19), SIMDE_FLOAT32_C( 0.04), SIMDE_FLOAT32_C( -0.64), SIMDE_FLOAT32_C( 0.34)) },
{ simde_mm_set_ps(SIMDE_FLOAT32_C( 0.50), SIMDE_FLOAT32_C( 0.67), SIMDE_FLOAT32_C( -0.30), SIMDE_FLOAT32_C( 0.03)),
simde_mm_set_ps(SIMDE_FLOAT32_C( 0.46), SIMDE_FLOAT32_C( 0.58), SIMDE_FLOAT32_C( -0.29), SIMDE_FLOAT32_C( 0.03)) },
{ simde_mm_set_ps(SIMDE_FLOAT32_C( 0.83), SIMDE_FLOAT32_C( 0.42), SIMDE_FLOAT32_C( -0.27), SIMDE_FLOAT32_C( 0.47)),
simde_mm_set_ps(SIMDE_FLOAT32_C( 0.68), SIMDE_FLOAT32_C( 0.40), SIMDE_FLOAT32_C( -0.26), SIMDE_FLOAT32_C( 0.44)) },
{ simde_mm_set_ps(SIMDE_FLOAT32_C( -0.68), SIMDE_FLOAT32_C( -0.69), SIMDE_FLOAT32_C( 0.08), SIMDE_FLOAT32_C( 0.57)),
simde_mm_set_ps(SIMDE_FLOAT32_C( -0.59), SIMDE_FLOAT32_C( -0.60), SIMDE_FLOAT32_C( 0.08), SIMDE_FLOAT32_C( 0.52)) },
{ simde_mm_set_ps(SIMDE_FLOAT32_C( -0.31), SIMDE_FLOAT32_C( -0.86), SIMDE_FLOAT32_C( -0.42), SIMDE_FLOAT32_C( 0.70)),
simde_mm_set_ps(SIMDE_FLOAT32_C( -0.30), SIMDE_FLOAT32_C( -0.70), SIMDE_FLOAT32_C( -0.40), SIMDE_FLOAT32_C( 0.60)) },
{ simde_mm_set_ps(SIMDE_FLOAT32_C( -0.44), SIMDE_FLOAT32_C( 0.03), SIMDE_FLOAT32_C( -0.38), SIMDE_FLOAT32_C( -0.92)),
simde_mm_set_ps(SIMDE_FLOAT32_C( -0.41), SIMDE_FLOAT32_C( 0.03), SIMDE_FLOAT32_C( -0.36), SIMDE_FLOAT32_C( -0.73)) },
{ simde_mm_set_ps(SIMDE_FLOAT32_C( 0.26), SIMDE_FLOAT32_C( -0.21), SIMDE_FLOAT32_C( -0.98), SIMDE_FLOAT32_C( -0.66)),
simde_mm_set_ps(SIMDE_FLOAT32_C( 0.25), SIMDE_FLOAT32_C( -0.21), SIMDE_FLOAT32_C( -0.75), SIMDE_FLOAT32_C( -0.58)) },
{ simde_mm_set_ps(SIMDE_FLOAT32_C( 0.18), SIMDE_FLOAT32_C( -0.45), SIMDE_FLOAT32_C( 0.23), SIMDE_FLOAT32_C( 0.69)),
simde_mm_set_ps(SIMDE_FLOAT32_C( 0.18), SIMDE_FLOAT32_C( -0.42), SIMDE_FLOAT32_C( 0.23), SIMDE_FLOAT32_C( 0.60)) }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])); i++) {
simde__m128 r = simde_mm_tanh_ps(test_vec[i].a);
simde_assert_m128_close(r, test_vec[i].r, 1);
}
return 0;
}
static int
test_simde_mm_tanh_pd(SIMDE_MUNIT_TEST_ARGS) {
const struct {
simde__m128d a;
simde__m128d r;
} test_vec[8] = {
{ simde_mm_set_pd(SIMDE_FLOAT64_C( -0.75), SIMDE_FLOAT64_C( 0.35)),
simde_mm_set_pd(SIMDE_FLOAT64_C( -0.64), SIMDE_FLOAT64_C( 0.34)) },
{ simde_mm_set_pd(SIMDE_FLOAT64_C( -0.19), SIMDE_FLOAT64_C( 0.04)),
simde_mm_set_pd(SIMDE_FLOAT64_C( -0.19), SIMDE_FLOAT64_C( 0.04)) },
{ simde_mm_set_pd(SIMDE_FLOAT64_C( -0.30), SIMDE_FLOAT64_C( 0.03)),
simde_mm_set_pd(SIMDE_FLOAT64_C( -0.29), SIMDE_FLOAT64_C( 0.03)) },
{ simde_mm_set_pd(SIMDE_FLOAT64_C( 0.50), SIMDE_FLOAT64_C( 0.67)),
simde_mm_set_pd(SIMDE_FLOAT64_C( 0.46), SIMDE_FLOAT64_C( 0.58)) },
{ simde_mm_set_pd(SIMDE_FLOAT64_C( -0.27), SIMDE_FLOAT64_C( 0.47)),
simde_mm_set_pd(SIMDE_FLOAT64_C( -0.26), SIMDE_FLOAT64_C( 0.44)) },
{ simde_mm_set_pd(SIMDE_FLOAT64_C( 0.83), SIMDE_FLOAT64_C( 0.42)),
simde_mm_set_pd(SIMDE_FLOAT64_C( 0.68), SIMDE_FLOAT64_C( 0.40)) },
{ simde_mm_set_pd(SIMDE_FLOAT64_C( 0.08), SIMDE_FLOAT64_C( 0.57)),
simde_mm_set_pd(SIMDE_FLOAT64_C( 0.08), SIMDE_FLOAT64_C( 0.52)) },
{ simde_mm_set_pd(SIMDE_FLOAT64_C( -0.68), SIMDE_FLOAT64_C( -0.69)),
simde_mm_set_pd(SIMDE_FLOAT64_C( -0.59), SIMDE_FLOAT64_C( -0.60)) }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])); i++) {
simde__m128d r = simde_mm_tanh_pd(test_vec[i].a);
simde_assert_m128d_close(r, test_vec[i].r, 1);
}
return 0;
}
static int
test_simde_mm256_tanh_ps(SIMDE_MUNIT_TEST_ARGS) {
const struct {
simde__m256 a;
simde__m256 r;
} test_vec[8] = {
{ simde_mm256_set_ps(SIMDE_FLOAT32_C( 0.50), SIMDE_FLOAT32_C( 0.67),
SIMDE_FLOAT32_C( -0.30), SIMDE_FLOAT32_C( 0.03),
SIMDE_FLOAT32_C( -0.19), SIMDE_FLOAT32_C( 0.04),
SIMDE_FLOAT32_C( -0.75), SIMDE_FLOAT32_C( 0.35)),
simde_mm256_set_ps(SIMDE_FLOAT32_C( 0.46), SIMDE_FLOAT32_C( 0.58),
SIMDE_FLOAT32_C( -0.29), SIMDE_FLOAT32_C( 0.03),
SIMDE_FLOAT32_C( -0.19), SIMDE_FLOAT32_C( 0.04),
SIMDE_FLOAT32_C( -0.64), SIMDE_FLOAT32_C( 0.34)) },
{ simde_mm256_set_ps(SIMDE_FLOAT32_C( -0.68), SIMDE_FLOAT32_C( -0.69),
SIMDE_FLOAT32_C( 0.08), SIMDE_FLOAT32_C( 0.57),
SIMDE_FLOAT32_C( 0.83), SIMDE_FLOAT32_C( 0.42),
SIMDE_FLOAT32_C( -0.27), SIMDE_FLOAT32_C( 0.47)),
simde_mm256_set_ps(SIMDE_FLOAT32_C( -0.59), SIMDE_FLOAT32_C( -0.60),
SIMDE_FLOAT32_C( 0.08), SIMDE_FLOAT32_C( 0.52),
SIMDE_FLOAT32_C( 0.68), SIMDE_FLOAT32_C( 0.40),
SIMDE_FLOAT32_C( -0.26), SIMDE_FLOAT32_C( 0.44)) },
{ simde_mm256_set_ps(SIMDE_FLOAT32_C( -0.44), SIMDE_FLOAT32_C( 0.03),
SIMDE_FLOAT32_C( -0.38), SIMDE_FLOAT32_C( -0.92),
SIMDE_FLOAT32_C( -0.31), SIMDE_FLOAT32_C( -0.86),
SIMDE_FLOAT32_C( -0.42), SIMDE_FLOAT32_C( 0.70)),
simde_mm256_set_ps(SIMDE_FLOAT32_C( -0.41), SIMDE_FLOAT32_C( 0.03),
SIMDE_FLOAT32_C( -0.36), SIMDE_FLOAT32_C( -0.73),
SIMDE_FLOAT32_C( -0.30), SIMDE_FLOAT32_C( -0.70),
SIMDE_FLOAT32_C( -0.40), SIMDE_FLOAT32_C( 0.60)) },
{ simde_mm256_set_ps(SIMDE_FLOAT32_C( 0.18), SIMDE_FLOAT32_C( -0.45),
SIMDE_FLOAT32_C( 0.23), SIMDE_FLOAT32_C( 0.69),
SIMDE_FLOAT32_C( 0.26), SIMDE_FLOAT32_C( -0.21),
SIMDE_FLOAT32_C( -0.98), SIMDE_FLOAT32_C( -0.66)),
simde_mm256_set_ps(SIMDE_FLOAT32_C( 0.18), SIMDE_FLOAT32_C( -0.42),
SIMDE_FLOAT32_C( 0.23), SIMDE_FLOAT32_C( 0.60),
SIMDE_FLOAT32_C( 0.25), SIMDE_FLOAT32_C( -0.21),
SIMDE_FLOAT32_C( -0.75), SIMDE_FLOAT32_C( -0.58)) },
{ simde_mm256_set_ps(SIMDE_FLOAT32_C( -0.77), SIMDE_FLOAT32_C( 0.44),
SIMDE_FLOAT32_C( -0.58), SIMDE_FLOAT32_C( 0.38),
SIMDE_FLOAT32_C( -0.77), SIMDE_FLOAT32_C( 0.99),
SIMDE_FLOAT32_C( 0.03), SIMDE_FLOAT32_C( 0.84)),
simde_mm256_set_ps(SIMDE_FLOAT32_C( -0.65), SIMDE_FLOAT32_C( 0.41),
SIMDE_FLOAT32_C( -0.52), SIMDE_FLOAT32_C( 0.36),
SIMDE_FLOAT32_C( -0.65), SIMDE_FLOAT32_C( 0.76),
SIMDE_FLOAT32_C( 0.03), SIMDE_FLOAT32_C( 0.69)) },
{ simde_mm256_set_ps(SIMDE_FLOAT32_C( -0.39), SIMDE_FLOAT32_C( 0.40),
SIMDE_FLOAT32_C( 0.66), SIMDE_FLOAT32_C( 0.34),
SIMDE_FLOAT32_C( 0.53), SIMDE_FLOAT32_C( -0.26),
SIMDE_FLOAT32_C( 0.78), SIMDE_FLOAT32_C( -0.03)),
simde_mm256_set_ps(SIMDE_FLOAT32_C( -0.37), SIMDE_FLOAT32_C( 0.38),
SIMDE_FLOAT32_C( 0.58), SIMDE_FLOAT32_C( 0.33),
SIMDE_FLOAT32_C( 0.49), SIMDE_FLOAT32_C( -0.25),
SIMDE_FLOAT32_C( 0.65), SIMDE_FLOAT32_C( -0.03)) },
{ simde_mm256_set_ps(SIMDE_FLOAT32_C( -0.20), SIMDE_FLOAT32_C( -0.08),
SIMDE_FLOAT32_C( 0.34), SIMDE_FLOAT32_C( -0.94),
SIMDE_FLOAT32_C( -0.75), SIMDE_FLOAT32_C( -0.77),
SIMDE_FLOAT32_C( -0.55), SIMDE_FLOAT32_C( 0.40)),
simde_mm256_set_ps(SIMDE_FLOAT32_C( -0.20), SIMDE_FLOAT32_C( -0.08),
SIMDE_FLOAT32_C( 0.33), SIMDE_FLOAT32_C( -0.74),
SIMDE_FLOAT32_C( -0.64), SIMDE_FLOAT32_C( -0.65),
SIMDE_FLOAT32_C( -0.50), SIMDE_FLOAT32_C( 0.38)) },
{ simde_mm256_set_ps(SIMDE_FLOAT32_C( 0.47), SIMDE_FLOAT32_C( 0.68),
SIMDE_FLOAT32_C( -0.15), SIMDE_FLOAT32_C( 0.82),
SIMDE_FLOAT32_C( 0.91), SIMDE_FLOAT32_C( 0.60),
SIMDE_FLOAT32_C( 0.79), SIMDE_FLOAT32_C( 0.25)),
simde_mm256_set_ps(SIMDE_FLOAT32_C( 0.44), SIMDE_FLOAT32_C( 0.59),
SIMDE_FLOAT32_C( -0.15), SIMDE_FLOAT32_C( 0.68),
SIMDE_FLOAT32_C( 0.72), SIMDE_FLOAT32_C( 0.54),
SIMDE_FLOAT32_C( 0.66), SIMDE_FLOAT32_C( 0.24)) }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])); i++) {
simde__m256 r = simde_mm256_tanh_ps(test_vec[i].a);
simde_assert_m256_close(r, test_vec[i].r, 1);
}
return 0;
}
static int
test_simde_mm256_tanh_pd(SIMDE_MUNIT_TEST_ARGS) {
const struct {
simde__m256d a;
simde__m256d r;
} test_vec[8] = {
{ simde_mm256_set_pd(SIMDE_FLOAT64_C( -0.19), SIMDE_FLOAT64_C( 0.04),
SIMDE_FLOAT64_C( -0.75), SIMDE_FLOAT64_C( 0.35)),
simde_mm256_set_pd(SIMDE_FLOAT64_C( -0.19), SIMDE_FLOAT64_C( 0.04),
SIMDE_FLOAT64_C( -0.64), SIMDE_FLOAT64_C( 0.34)) },
{ simde_mm256_set_pd(SIMDE_FLOAT64_C( 0.50), SIMDE_FLOAT64_C( 0.67),
SIMDE_FLOAT64_C( -0.30), SIMDE_FLOAT64_C( 0.03)),
simde_mm256_set_pd(SIMDE_FLOAT64_C( 0.46), SIMDE_FLOAT64_C( 0.58),
SIMDE_FLOAT64_C( -0.29), SIMDE_FLOAT64_C( 0.03)) },
{ simde_mm256_set_pd(SIMDE_FLOAT64_C( 0.83), SIMDE_FLOAT64_C( 0.42),
SIMDE_FLOAT64_C( -0.27), SIMDE_FLOAT64_C( 0.47)),
simde_mm256_set_pd(SIMDE_FLOAT64_C( 0.68), SIMDE_FLOAT64_C( 0.40),
SIMDE_FLOAT64_C( -0.26), SIMDE_FLOAT64_C( 0.44)) },
{ simde_mm256_set_pd(SIMDE_FLOAT64_C( -0.68), SIMDE_FLOAT64_C( -0.69),
SIMDE_FLOAT64_C( 0.08), SIMDE_FLOAT64_C( 0.57)),
simde_mm256_set_pd(SIMDE_FLOAT64_C( -0.59), SIMDE_FLOAT64_C( -0.60),
SIMDE_FLOAT64_C( 0.08), SIMDE_FLOAT64_C( 0.52)) },
{ simde_mm256_set_pd(SIMDE_FLOAT64_C( -0.31), SIMDE_FLOAT64_C( -0.86),
SIMDE_FLOAT64_C( -0.42), SIMDE_FLOAT64_C( 0.70)),
simde_mm256_set_pd(SIMDE_FLOAT64_C( -0.30), SIMDE_FLOAT64_C( -0.70),
SIMDE_FLOAT64_C( -0.40), SIMDE_FLOAT64_C( 0.60)) },
{ simde_mm256_set_pd(SIMDE_FLOAT64_C( -0.44), SIMDE_FLOAT64_C( 0.03),
SIMDE_FLOAT64_C( -0.38), SIMDE_FLOAT64_C( -0.92)),
simde_mm256_set_pd(SIMDE_FLOAT64_C( -0.41), SIMDE_FLOAT64_C( 0.03),
SIMDE_FLOAT64_C( -0.36), SIMDE_FLOAT64_C( -0.73)) },
{ simde_mm256_set_pd(SIMDE_FLOAT64_C( 0.26), SIMDE_FLOAT64_C( -0.21),
SIMDE_FLOAT64_C( -0.98), SIMDE_FLOAT64_C( -0.66)),
simde_mm256_set_pd(SIMDE_FLOAT64_C( 0.25), SIMDE_FLOAT64_C( -0.21),
SIMDE_FLOAT64_C( -0.75), SIMDE_FLOAT64_C( -0.58)) },
{ simde_mm256_set_pd(SIMDE_FLOAT64_C( 0.18), SIMDE_FLOAT64_C( -0.45),
SIMDE_FLOAT64_C( 0.23), SIMDE_FLOAT64_C( 0.69)),
simde_mm256_set_pd(SIMDE_FLOAT64_C( 0.18), SIMDE_FLOAT64_C( -0.42),
SIMDE_FLOAT64_C( 0.23), SIMDE_FLOAT64_C( 0.60)) }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])); i++) {
simde__m256d r = simde_mm256_tanh_pd(test_vec[i].a);
simde_assert_m256d_close(r, test_vec[i].r, 1);
}
return 0;
}
static int
test_simde_mm512_tanh_ps(SIMDE_MUNIT_TEST_ARGS) {
const struct {
simde__m512 a;
simde__m512 r;
} test_vec[8] = {
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( -0.68), SIMDE_FLOAT32_C( -0.69), SIMDE_FLOAT32_C( 0.08), SIMDE_FLOAT32_C( 0.57),
SIMDE_FLOAT32_C( 0.83), SIMDE_FLOAT32_C( 0.42), SIMDE_FLOAT32_C( -0.27), SIMDE_FLOAT32_C( 0.47),
SIMDE_FLOAT32_C( 0.50), SIMDE_FLOAT32_C( 0.67), SIMDE_FLOAT32_C( -0.30), SIMDE_FLOAT32_C( 0.03),
SIMDE_FLOAT32_C( -0.19), SIMDE_FLOAT32_C( 0.04), SIMDE_FLOAT32_C( -0.75), SIMDE_FLOAT32_C( 0.35)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( -0.59), SIMDE_FLOAT32_C( -0.60), SIMDE_FLOAT32_C( 0.08), SIMDE_FLOAT32_C( 0.52),
SIMDE_FLOAT32_C( 0.68), SIMDE_FLOAT32_C( 0.40), SIMDE_FLOAT32_C( -0.26), SIMDE_FLOAT32_C( 0.44),
SIMDE_FLOAT32_C( 0.46), SIMDE_FLOAT32_C( 0.58), SIMDE_FLOAT32_C( -0.29), SIMDE_FLOAT32_C( 0.03),
SIMDE_FLOAT32_C( -0.19), SIMDE_FLOAT32_C( 0.04), SIMDE_FLOAT32_C( -0.64), SIMDE_FLOAT32_C( 0.34)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( 0.18), SIMDE_FLOAT32_C( -0.45), SIMDE_FLOAT32_C( 0.23), SIMDE_FLOAT32_C( 0.69),
SIMDE_FLOAT32_C( 0.26), SIMDE_FLOAT32_C( -0.21), SIMDE_FLOAT32_C( -0.98), SIMDE_FLOAT32_C( -0.66),
SIMDE_FLOAT32_C( -0.44), SIMDE_FLOAT32_C( 0.03), SIMDE_FLOAT32_C( -0.38), SIMDE_FLOAT32_C( -0.92),
SIMDE_FLOAT32_C( -0.31), SIMDE_FLOAT32_C( -0.86), SIMDE_FLOAT32_C( -0.42), SIMDE_FLOAT32_C( 0.70)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 0.18), SIMDE_FLOAT32_C( -0.42), SIMDE_FLOAT32_C( 0.23), SIMDE_FLOAT32_C( 0.60),
SIMDE_FLOAT32_C( 0.25), SIMDE_FLOAT32_C( -0.21), SIMDE_FLOAT32_C( -0.75), SIMDE_FLOAT32_C( -0.58),
SIMDE_FLOAT32_C( -0.41), SIMDE_FLOAT32_C( 0.03), SIMDE_FLOAT32_C( -0.36), SIMDE_FLOAT32_C( -0.73),
SIMDE_FLOAT32_C( -0.30), SIMDE_FLOAT32_C( -0.70), SIMDE_FLOAT32_C( -0.40), SIMDE_FLOAT32_C( 0.60)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( -0.39), SIMDE_FLOAT32_C( 0.40), SIMDE_FLOAT32_C( 0.66), SIMDE_FLOAT32_C( 0.34),
SIMDE_FLOAT32_C( 0.53), SIMDE_FLOAT32_C( -0.26), SIMDE_FLOAT32_C( 0.78), SIMDE_FLOAT32_C( -0.03),
SIMDE_FLOAT32_C( -0.77), SIMDE_FLOAT32_C( 0.44), SIMDE_FLOAT32_C( -0.58), SIMDE_FLOAT32_C( 0.38),
SIMDE_FLOAT32_C( -0.77), SIMDE_FLOAT32_C( 0.99), SIMDE_FLOAT32_C( 0.03), SIMDE_FLOAT32_C( 0.84)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( -0.37), SIMDE_FLOAT32_C( 0.38), SIMDE_FLOAT32_C( 0.58), SIMDE_FLOAT32_C( 0.33),
SIMDE_FLOAT32_C( 0.49), SIMDE_FLOAT32_C( -0.25), SIMDE_FLOAT32_C( 0.65), SIMDE_FLOAT32_C( -0.03),
SIMDE_FLOAT32_C( -0.65), SIMDE_FLOAT32_C( 0.41), SIMDE_FLOAT32_C( -0.52), SIMDE_FLOAT32_C( 0.36),
SIMDE_FLOAT32_C( -0.65), SIMDE_FLOAT32_C( 0.76), SIMDE_FLOAT32_C( 0.03), SIMDE_FLOAT32_C( 0.69)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( 0.47), SIMDE_FLOAT32_C( 0.68), SIMDE_FLOAT32_C( -0.15), SIMDE_FLOAT32_C( 0.82),
SIMDE_FLOAT32_C( 0.91), SIMDE_FLOAT32_C( 0.60), SIMDE_FLOAT32_C( 0.79), SIMDE_FLOAT32_C( 0.25),
SIMDE_FLOAT32_C( -0.20), SIMDE_FLOAT32_C( -0.08), SIMDE_FLOAT32_C( 0.34), SIMDE_FLOAT32_C( -0.94),
SIMDE_FLOAT32_C( -0.75), SIMDE_FLOAT32_C( -0.77), SIMDE_FLOAT32_C( -0.55), SIMDE_FLOAT32_C( 0.40)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 0.44), SIMDE_FLOAT32_C( 0.59), SIMDE_FLOAT32_C( -0.15), SIMDE_FLOAT32_C( 0.68),
SIMDE_FLOAT32_C( 0.72), SIMDE_FLOAT32_C( 0.54), SIMDE_FLOAT32_C( 0.66), SIMDE_FLOAT32_C( 0.24),
SIMDE_FLOAT32_C( -0.20), SIMDE_FLOAT32_C( -0.08), SIMDE_FLOAT32_C( 0.33), SIMDE_FLOAT32_C( -0.74),
SIMDE_FLOAT32_C( -0.64), SIMDE_FLOAT32_C( -0.65), SIMDE_FLOAT32_C( -0.50), SIMDE_FLOAT32_C( 0.38)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( -0.18), SIMDE_FLOAT32_C( 0.76), SIMDE_FLOAT32_C( 0.32), SIMDE_FLOAT32_C( 0.34),
SIMDE_FLOAT32_C( -0.87), SIMDE_FLOAT32_C( -0.80), SIMDE_FLOAT32_C( -0.33), SIMDE_FLOAT32_C( -0.53),
SIMDE_FLOAT32_C( -0.19), SIMDE_FLOAT32_C( -0.82), SIMDE_FLOAT32_C( 0.56), SIMDE_FLOAT32_C( 0.66),
SIMDE_FLOAT32_C( -0.07), SIMDE_FLOAT32_C( 0.54), SIMDE_FLOAT32_C( 0.12), SIMDE_FLOAT32_C( -0.17)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( -0.18), SIMDE_FLOAT32_C( 0.64), SIMDE_FLOAT32_C( 0.31), SIMDE_FLOAT32_C( 0.33),
SIMDE_FLOAT32_C( -0.70), SIMDE_FLOAT32_C( -0.66), SIMDE_FLOAT32_C( -0.32), SIMDE_FLOAT32_C( -0.49),
SIMDE_FLOAT32_C( -0.19), SIMDE_FLOAT32_C( -0.68), SIMDE_FLOAT32_C( 0.51), SIMDE_FLOAT32_C( 0.58),
SIMDE_FLOAT32_C( -0.07), SIMDE_FLOAT32_C( 0.49), SIMDE_FLOAT32_C( 0.12), SIMDE_FLOAT32_C( -0.17)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( 0.69), SIMDE_FLOAT32_C( 0.84), SIMDE_FLOAT32_C( -0.02), SIMDE_FLOAT32_C( -0.59),
SIMDE_FLOAT32_C( -0.45), SIMDE_FLOAT32_C( 0.73), SIMDE_FLOAT32_C( 0.51), SIMDE_FLOAT32_C( 0.62),
SIMDE_FLOAT32_C( 0.83), SIMDE_FLOAT32_C( 0.14), SIMDE_FLOAT32_C( 0.98), SIMDE_FLOAT32_C( -0.91),
SIMDE_FLOAT32_C( 0.33), SIMDE_FLOAT32_C( 0.10), SIMDE_FLOAT32_C( 0.46), SIMDE_FLOAT32_C( -0.74)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 0.60), SIMDE_FLOAT32_C( 0.69), SIMDE_FLOAT32_C( -0.02), SIMDE_FLOAT32_C( -0.53),
SIMDE_FLOAT32_C( -0.42), SIMDE_FLOAT32_C( 0.62), SIMDE_FLOAT32_C( 0.47), SIMDE_FLOAT32_C( 0.55),
SIMDE_FLOAT32_C( 0.68), SIMDE_FLOAT32_C( 0.14), SIMDE_FLOAT32_C( 0.75), SIMDE_FLOAT32_C( -0.72),
SIMDE_FLOAT32_C( 0.32), SIMDE_FLOAT32_C( 0.10), SIMDE_FLOAT32_C( 0.43), SIMDE_FLOAT32_C( -0.63)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( -0.77), SIMDE_FLOAT32_C( 0.82), SIMDE_FLOAT32_C( -0.57), SIMDE_FLOAT32_C( -1.00),
SIMDE_FLOAT32_C( -0.34), SIMDE_FLOAT32_C( 0.92), SIMDE_FLOAT32_C( 0.29), SIMDE_FLOAT32_C( -0.77),
SIMDE_FLOAT32_C( -0.58), SIMDE_FLOAT32_C( -0.07), SIMDE_FLOAT32_C( 0.71), SIMDE_FLOAT32_C( 0.98),
SIMDE_FLOAT32_C( -0.76), SIMDE_FLOAT32_C( 0.42), SIMDE_FLOAT32_C( 0.03), SIMDE_FLOAT32_C( -0.10)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( -0.65), SIMDE_FLOAT32_C( 0.68), SIMDE_FLOAT32_C( -0.52), SIMDE_FLOAT32_C( -0.76),
SIMDE_FLOAT32_C( -0.33), SIMDE_FLOAT32_C( 0.73), SIMDE_FLOAT32_C( 0.28), SIMDE_FLOAT32_C( -0.65),
SIMDE_FLOAT32_C( -0.52), SIMDE_FLOAT32_C( -0.07), SIMDE_FLOAT32_C( 0.61), SIMDE_FLOAT32_C( 0.75),
SIMDE_FLOAT32_C( -0.64), SIMDE_FLOAT32_C( 0.40), SIMDE_FLOAT32_C( 0.03), SIMDE_FLOAT32_C( -0.10)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( -0.44), SIMDE_FLOAT32_C( -0.36), SIMDE_FLOAT32_C( -0.75), SIMDE_FLOAT32_C( -0.03),
SIMDE_FLOAT32_C( 0.93), SIMDE_FLOAT32_C( 0.01), SIMDE_FLOAT32_C( -0.33), SIMDE_FLOAT32_C( -0.13),
SIMDE_FLOAT32_C( -0.18), SIMDE_FLOAT32_C( 0.04), SIMDE_FLOAT32_C( 0.51), SIMDE_FLOAT32_C( 0.39),
SIMDE_FLOAT32_C( 0.01), SIMDE_FLOAT32_C( -0.30), SIMDE_FLOAT32_C( 0.92), SIMDE_FLOAT32_C( -0.70)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( -0.41), SIMDE_FLOAT32_C( -0.35), SIMDE_FLOAT32_C( -0.64), SIMDE_FLOAT32_C( -0.03),
SIMDE_FLOAT32_C( 0.73), SIMDE_FLOAT32_C( 0.01), SIMDE_FLOAT32_C( -0.32), SIMDE_FLOAT32_C( -0.13),
SIMDE_FLOAT32_C( -0.18), SIMDE_FLOAT32_C( 0.04), SIMDE_FLOAT32_C( 0.47), SIMDE_FLOAT32_C( 0.37),
SIMDE_FLOAT32_C( 0.01), SIMDE_FLOAT32_C( -0.29), SIMDE_FLOAT32_C( 0.73), SIMDE_FLOAT32_C( -0.60)) }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])); i++) {
simde__m512 r = simde_mm512_tanh_ps(test_vec[i].a);
simde_assert_m512_close(r, test_vec[i].r, 1);
}
return 0;
}
static int
test_simde_mm512_mask_tanh_ps(SIMDE_MUNIT_TEST_ARGS) {
const struct {
simde__m512 src;
simde__mmask16 k;
simde__m512 a;
simde__m512 r;
} test_vec[8] = {
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( -0.45), SIMDE_FLOAT32_C( 0.69), SIMDE_FLOAT32_C( -0.21), SIMDE_FLOAT32_C( -0.66),
SIMDE_FLOAT32_C( 0.03), SIMDE_FLOAT32_C( -0.92), SIMDE_FLOAT32_C( -0.86), SIMDE_FLOAT32_C( 0.70),
SIMDE_FLOAT32_C( -0.69), SIMDE_FLOAT32_C( 0.57), SIMDE_FLOAT32_C( 0.42), SIMDE_FLOAT32_C( 0.47),
SIMDE_FLOAT32_C( 0.67), SIMDE_FLOAT32_C( 0.03), SIMDE_FLOAT32_C( 0.04), SIMDE_FLOAT32_C( 0.35)),
UINT16_C(41466),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 0.18), SIMDE_FLOAT32_C( 0.23), SIMDE_FLOAT32_C( 0.26), SIMDE_FLOAT32_C( -0.98),
SIMDE_FLOAT32_C( -0.44), SIMDE_FLOAT32_C( -0.38), SIMDE_FLOAT32_C( -0.31), SIMDE_FLOAT32_C( -0.42),
SIMDE_FLOAT32_C( -0.68), SIMDE_FLOAT32_C( 0.08), SIMDE_FLOAT32_C( 0.83), SIMDE_FLOAT32_C( -0.27),
SIMDE_FLOAT32_C( 0.50), SIMDE_FLOAT32_C( -0.30), SIMDE_FLOAT32_C( -0.19), SIMDE_FLOAT32_C( -0.75)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 0.18), SIMDE_FLOAT32_C( 0.69), SIMDE_FLOAT32_C( 0.25), SIMDE_FLOAT32_C( -0.66),
SIMDE_FLOAT32_C( 0.03), SIMDE_FLOAT32_C( -0.92), SIMDE_FLOAT32_C( -0.86), SIMDE_FLOAT32_C( -0.40),
SIMDE_FLOAT32_C( -0.59), SIMDE_FLOAT32_C( 0.08), SIMDE_FLOAT32_C( 0.68), SIMDE_FLOAT32_C( -0.26),
SIMDE_FLOAT32_C( 0.46), SIMDE_FLOAT32_C( 0.03), SIMDE_FLOAT32_C( -0.19), SIMDE_FLOAT32_C( 0.35)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( 0.47), SIMDE_FLOAT32_C( -0.15), SIMDE_FLOAT32_C( 0.91), SIMDE_FLOAT32_C( 0.79),
SIMDE_FLOAT32_C( -0.20), SIMDE_FLOAT32_C( 0.34), SIMDE_FLOAT32_C( -0.75), SIMDE_FLOAT32_C( -0.55),
SIMDE_FLOAT32_C( -0.39), SIMDE_FLOAT32_C( 0.66), SIMDE_FLOAT32_C( 0.53), SIMDE_FLOAT32_C( 0.78),
SIMDE_FLOAT32_C( -0.77), SIMDE_FLOAT32_C( -0.58), SIMDE_FLOAT32_C( -0.77), SIMDE_FLOAT32_C( 0.03)),
UINT16_C(36797),
simde_mm512_set_ps(SIMDE_FLOAT32_C( -0.17), SIMDE_FLOAT32_C( 0.68), SIMDE_FLOAT32_C( 0.82), SIMDE_FLOAT32_C( 0.60),
SIMDE_FLOAT32_C( 0.25), SIMDE_FLOAT32_C( -0.08), SIMDE_FLOAT32_C( -0.94), SIMDE_FLOAT32_C( -0.77),
SIMDE_FLOAT32_C( 0.40), SIMDE_FLOAT32_C( 0.40), SIMDE_FLOAT32_C( 0.34), SIMDE_FLOAT32_C( -0.26),
SIMDE_FLOAT32_C( -0.03), SIMDE_FLOAT32_C( 0.44), SIMDE_FLOAT32_C( 0.38), SIMDE_FLOAT32_C( 0.99)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( -0.17), SIMDE_FLOAT32_C( -0.15), SIMDE_FLOAT32_C( 0.91), SIMDE_FLOAT32_C( 0.79),
SIMDE_FLOAT32_C( 0.24), SIMDE_FLOAT32_C( -0.08), SIMDE_FLOAT32_C( -0.74), SIMDE_FLOAT32_C( -0.65),
SIMDE_FLOAT32_C( 0.38), SIMDE_FLOAT32_C( 0.66), SIMDE_FLOAT32_C( 0.33), SIMDE_FLOAT32_C( -0.25),
SIMDE_FLOAT32_C( -0.03), SIMDE_FLOAT32_C( 0.41), SIMDE_FLOAT32_C( -0.77), SIMDE_FLOAT32_C( 0.76)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( -0.10), SIMDE_FLOAT32_C( 0.84), SIMDE_FLOAT32_C( -0.59), SIMDE_FLOAT32_C( 0.73),
SIMDE_FLOAT32_C( 0.62), SIMDE_FLOAT32_C( 0.14), SIMDE_FLOAT32_C( -0.91), SIMDE_FLOAT32_C( 0.10),
SIMDE_FLOAT32_C( -0.74), SIMDE_FLOAT32_C( 0.76), SIMDE_FLOAT32_C( 0.34), SIMDE_FLOAT32_C( -0.80),
SIMDE_FLOAT32_C( -0.53), SIMDE_FLOAT32_C( -0.82), SIMDE_FLOAT32_C( 0.66), SIMDE_FLOAT32_C( 0.54)),
UINT16_C(16804),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 0.03), SIMDE_FLOAT32_C( 0.69), SIMDE_FLOAT32_C( -0.02), SIMDE_FLOAT32_C( -0.45),
SIMDE_FLOAT32_C( 0.51), SIMDE_FLOAT32_C( 0.83), SIMDE_FLOAT32_C( 0.98), SIMDE_FLOAT32_C( 0.33),
SIMDE_FLOAT32_C( 0.46), SIMDE_FLOAT32_C( -0.18), SIMDE_FLOAT32_C( 0.32), SIMDE_FLOAT32_C( -0.87),
SIMDE_FLOAT32_C( -0.33), SIMDE_FLOAT32_C( -0.19), SIMDE_FLOAT32_C( 0.56), SIMDE_FLOAT32_C( -0.07)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( -0.10), SIMDE_FLOAT32_C( 0.60), SIMDE_FLOAT32_C( -0.59), SIMDE_FLOAT32_C( 0.73),
SIMDE_FLOAT32_C( 0.62), SIMDE_FLOAT32_C( 0.14), SIMDE_FLOAT32_C( -0.91), SIMDE_FLOAT32_C( 0.32),
SIMDE_FLOAT32_C( 0.43), SIMDE_FLOAT32_C( 0.76), SIMDE_FLOAT32_C( 0.31), SIMDE_FLOAT32_C( -0.80),
SIMDE_FLOAT32_C( -0.53), SIMDE_FLOAT32_C( -0.19), SIMDE_FLOAT32_C( 0.66), SIMDE_FLOAT32_C( 0.54)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( -0.35), SIMDE_FLOAT32_C( -0.44), SIMDE_FLOAT32_C( -0.75), SIMDE_FLOAT32_C( 0.93),
SIMDE_FLOAT32_C( -0.33), SIMDE_FLOAT32_C( -0.18), SIMDE_FLOAT32_C( 0.51), SIMDE_FLOAT32_C( 0.01),
SIMDE_FLOAT32_C( 0.92), SIMDE_FLOAT32_C( -0.77), SIMDE_FLOAT32_C( -0.57), SIMDE_FLOAT32_C( -0.34),
SIMDE_FLOAT32_C( 0.29), SIMDE_FLOAT32_C( -0.58), SIMDE_FLOAT32_C( 0.71), SIMDE_FLOAT32_C( -0.76)),
UINT16_C( 2107),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 0.90), SIMDE_FLOAT32_C( -0.20), SIMDE_FLOAT32_C( -0.36), SIMDE_FLOAT32_C( -0.03),
SIMDE_FLOAT32_C( 0.01), SIMDE_FLOAT32_C( -0.13), SIMDE_FLOAT32_C( 0.04), SIMDE_FLOAT32_C( 0.39),
SIMDE_FLOAT32_C( -0.30), SIMDE_FLOAT32_C( -0.70), SIMDE_FLOAT32_C( 0.82), SIMDE_FLOAT32_C( -1.00),
SIMDE_FLOAT32_C( 0.92), SIMDE_FLOAT32_C( -0.77), SIMDE_FLOAT32_C( -0.07), SIMDE_FLOAT32_C( 0.98)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( -0.35), SIMDE_FLOAT32_C( -0.44), SIMDE_FLOAT32_C( -0.75), SIMDE_FLOAT32_C( 0.93),
SIMDE_FLOAT32_C( 0.01), SIMDE_FLOAT32_C( -0.18), SIMDE_FLOAT32_C( 0.51), SIMDE_FLOAT32_C( 0.01),
SIMDE_FLOAT32_C( 0.92), SIMDE_FLOAT32_C( -0.77), SIMDE_FLOAT32_C( 0.68), SIMDE_FLOAT32_C( -0.76),
SIMDE_FLOAT32_C( 0.73), SIMDE_FLOAT32_C( -0.58), SIMDE_FLOAT32_C( -0.07), SIMDE_FLOAT32_C( 0.75)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( -0.02), SIMDE_FLOAT32_C( -0.74), SIMDE_FLOAT32_C( -0.31), SIMDE_FLOAT32_C( 0.18),
SIMDE_FLOAT32_C( 0.35), SIMDE_FLOAT32_C( 0.89), SIMDE_FLOAT32_C( 0.92), SIMDE_FLOAT32_C( 0.13),
SIMDE_FLOAT32_C( 0.48), SIMDE_FLOAT32_C( -0.60), SIMDE_FLOAT32_C( -0.79), SIMDE_FLOAT32_C( -0.77),
SIMDE_FLOAT32_C( 0.22), SIMDE_FLOAT32_C( -0.79), SIMDE_FLOAT32_C( -0.78), SIMDE_FLOAT32_C( 0.44)),
UINT16_C(22274),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 0.50), SIMDE_FLOAT32_C( 0.92), SIMDE_FLOAT32_C( -0.72), SIMDE_FLOAT32_C( 0.16),
SIMDE_FLOAT32_C( -0.86), SIMDE_FLOAT32_C( 0.43), SIMDE_FLOAT32_C( 0.93), SIMDE_FLOAT32_C( 0.11),
SIMDE_FLOAT32_C( 0.83), SIMDE_FLOAT32_C( -0.08), SIMDE_FLOAT32_C( 0.24), SIMDE_FLOAT32_C( -0.38),
SIMDE_FLOAT32_C( -0.60), SIMDE_FLOAT32_C( -0.62), SIMDE_FLOAT32_C( -0.94), SIMDE_FLOAT32_C( 0.48)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( -0.02), SIMDE_FLOAT32_C( 0.73), SIMDE_FLOAT32_C( -0.31), SIMDE_FLOAT32_C( 0.16),
SIMDE_FLOAT32_C( 0.35), SIMDE_FLOAT32_C( 0.41), SIMDE_FLOAT32_C( 0.73), SIMDE_FLOAT32_C( 0.11),
SIMDE_FLOAT32_C( 0.48), SIMDE_FLOAT32_C( -0.60), SIMDE_FLOAT32_C( -0.79), SIMDE_FLOAT32_C( -0.77),
SIMDE_FLOAT32_C( 0.22), SIMDE_FLOAT32_C( -0.79), SIMDE_FLOAT32_C( -0.74), SIMDE_FLOAT32_C( 0.44)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( 0.88), SIMDE_FLOAT32_C( -0.81), SIMDE_FLOAT32_C( -0.07), SIMDE_FLOAT32_C( -0.78),
SIMDE_FLOAT32_C( 0.09), SIMDE_FLOAT32_C( 0.21), SIMDE_FLOAT32_C( 0.83), SIMDE_FLOAT32_C( -0.07),
SIMDE_FLOAT32_C( -0.29), SIMDE_FLOAT32_C( -0.21), SIMDE_FLOAT32_C( -0.32), SIMDE_FLOAT32_C( 0.78),
SIMDE_FLOAT32_C( -0.63), SIMDE_FLOAT32_C( 0.56), SIMDE_FLOAT32_C( 0.44), SIMDE_FLOAT32_C( 0.43)),
UINT16_C(27396),
simde_mm512_set_ps(SIMDE_FLOAT32_C( -0.96), SIMDE_FLOAT32_C( -0.41), SIMDE_FLOAT32_C( -0.74), SIMDE_FLOAT32_C( -0.76),
SIMDE_FLOAT32_C( 0.79), SIMDE_FLOAT32_C( 0.00), SIMDE_FLOAT32_C( -0.82), SIMDE_FLOAT32_C( 0.16),
SIMDE_FLOAT32_C( 0.58), SIMDE_FLOAT32_C( -0.01), SIMDE_FLOAT32_C( -0.31), SIMDE_FLOAT32_C( -0.72),
SIMDE_FLOAT32_C( 0.33), SIMDE_FLOAT32_C( 0.27), SIMDE_FLOAT32_C( -0.92), SIMDE_FLOAT32_C( -0.49)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 0.88), SIMDE_FLOAT32_C( -0.39), SIMDE_FLOAT32_C( -0.63), SIMDE_FLOAT32_C( -0.78),
SIMDE_FLOAT32_C( 0.66), SIMDE_FLOAT32_C( 0.21), SIMDE_FLOAT32_C( -0.68), SIMDE_FLOAT32_C( 0.16),
SIMDE_FLOAT32_C( -0.29), SIMDE_FLOAT32_C( -0.21), SIMDE_FLOAT32_C( -0.32), SIMDE_FLOAT32_C( 0.78),
SIMDE_FLOAT32_C( -0.63), SIMDE_FLOAT32_C( 0.26), SIMDE_FLOAT32_C( 0.44), SIMDE_FLOAT32_C( 0.43)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( 0.53), SIMDE_FLOAT32_C( -0.02), SIMDE_FLOAT32_C( -0.97), SIMDE_FLOAT32_C( 0.64),
SIMDE_FLOAT32_C( 0.45), SIMDE_FLOAT32_C( -0.77), SIMDE_FLOAT32_C( 0.11), SIMDE_FLOAT32_C( 0.59),
SIMDE_FLOAT32_C( 0.03), SIMDE_FLOAT32_C( 0.64), SIMDE_FLOAT32_C( -0.08), SIMDE_FLOAT32_C( 0.08),
SIMDE_FLOAT32_C( -0.71), SIMDE_FLOAT32_C( 0.61), SIMDE_FLOAT32_C( 0.39), SIMDE_FLOAT32_C( -0.89)),
UINT16_C( 953),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 0.02), SIMDE_FLOAT32_C( 0.81), SIMDE_FLOAT32_C( 0.14), SIMDE_FLOAT32_C( -0.43),
SIMDE_FLOAT32_C( 0.31), SIMDE_FLOAT32_C( -0.18), SIMDE_FLOAT32_C( -0.46), SIMDE_FLOAT32_C( 0.68),
SIMDE_FLOAT32_C( 0.07), SIMDE_FLOAT32_C( -0.27), SIMDE_FLOAT32_C( 0.12), SIMDE_FLOAT32_C( -0.58),
SIMDE_FLOAT32_C( -0.04), SIMDE_FLOAT32_C( -0.25), SIMDE_FLOAT32_C( -0.05), SIMDE_FLOAT32_C( 0.09)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( 0.53), SIMDE_FLOAT32_C( -0.02), SIMDE_FLOAT32_C( -0.97), SIMDE_FLOAT32_C( 0.64),
SIMDE_FLOAT32_C( 0.45), SIMDE_FLOAT32_C( -0.77), SIMDE_FLOAT32_C( -0.43), SIMDE_FLOAT32_C( 0.59),
SIMDE_FLOAT32_C( 0.07), SIMDE_FLOAT32_C( 0.64), SIMDE_FLOAT32_C( 0.12), SIMDE_FLOAT32_C( -0.52),
SIMDE_FLOAT32_C( -0.04), SIMDE_FLOAT32_C( 0.61), SIMDE_FLOAT32_C( 0.39), SIMDE_FLOAT32_C( 0.09)) },
{ simde_mm512_set_ps(SIMDE_FLOAT32_C( -0.79), SIMDE_FLOAT32_C( 0.33), SIMDE_FLOAT32_C( -0.49), SIMDE_FLOAT32_C( 0.82),
SIMDE_FLOAT32_C( 0.96), SIMDE_FLOAT32_C( 0.95), SIMDE_FLOAT32_C( 0.83), SIMDE_FLOAT32_C( -0.82),
SIMDE_FLOAT32_C( -0.21), SIMDE_FLOAT32_C( -0.93), SIMDE_FLOAT32_C( -0.73), SIMDE_FLOAT32_C( -0.42),
SIMDE_FLOAT32_C( 0.10), SIMDE_FLOAT32_C( 0.10), SIMDE_FLOAT32_C( 0.44), SIMDE_FLOAT32_C( -0.20)),
UINT16_C(12713),
simde_mm512_set_ps(SIMDE_FLOAT32_C( -0.84), SIMDE_FLOAT32_C( -0.01), SIMDE_FLOAT32_C( 0.82), SIMDE_FLOAT32_C( 0.79),
SIMDE_FLOAT32_C( -0.74), SIMDE_FLOAT32_C( -0.31), SIMDE_FLOAT32_C( 0.73), SIMDE_FLOAT32_C( -0.35),
SIMDE_FLOAT32_C( 0.06), SIMDE_FLOAT32_C( 0.11), SIMDE_FLOAT32_C( 0.72), SIMDE_FLOAT32_C( -0.25),
SIMDE_FLOAT32_C( 0.94), SIMDE_FLOAT32_C( 0.36), SIMDE_FLOAT32_C( -0.01), SIMDE_FLOAT32_C( 0.85)),
simde_mm512_set_ps(SIMDE_FLOAT32_C( -0.79), SIMDE_FLOAT32_C( 0.33), SIMDE_FLOAT32_C( 0.68), SIMDE_FLOAT32_C( 0.66),
SIMDE_FLOAT32_C( 0.96), SIMDE_FLOAT32_C( 0.95), SIMDE_FLOAT32_C( 0.83), SIMDE_FLOAT32_C( -0.34),
SIMDE_FLOAT32_C( 0.06), SIMDE_FLOAT32_C( -0.93), SIMDE_FLOAT32_C( 0.62), SIMDE_FLOAT32_C( -0.42),
SIMDE_FLOAT32_C( 0.74), SIMDE_FLOAT32_C( 0.10), SIMDE_FLOAT32_C( 0.44), SIMDE_FLOAT32_C( 0.69)) }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])); i++) {
simde__m512 r = simde_mm512_mask_tanh_ps(test_vec[i].src, test_vec[i].k, test_vec[i].a);
simde_assert_m512_close(r, test_vec[i].r, 1);
}
return 0;
}
static int
test_simde_mm512_tanh_pd(SIMDE_MUNIT_TEST_ARGS) {
const struct {
simde__m512d a;
simde__m512d r;
} test_vec[8] = {
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( 0.50), SIMDE_FLOAT64_C( 0.67),
SIMDE_FLOAT64_C( -0.30), SIMDE_FLOAT64_C( 0.03),
SIMDE_FLOAT64_C( -0.19), SIMDE_FLOAT64_C( 0.04),
SIMDE_FLOAT64_C( -0.75), SIMDE_FLOAT64_C( 0.35)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 0.46), SIMDE_FLOAT64_C( 0.58),
SIMDE_FLOAT64_C( -0.29), SIMDE_FLOAT64_C( 0.03),
SIMDE_FLOAT64_C( -0.19), SIMDE_FLOAT64_C( 0.04),
SIMDE_FLOAT64_C( -0.64), SIMDE_FLOAT64_C( 0.34)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( -0.68), SIMDE_FLOAT64_C( -0.69),
SIMDE_FLOAT64_C( 0.08), SIMDE_FLOAT64_C( 0.57),
SIMDE_FLOAT64_C( 0.83), SIMDE_FLOAT64_C( 0.42),
SIMDE_FLOAT64_C( -0.27), SIMDE_FLOAT64_C( 0.47)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( -0.59), SIMDE_FLOAT64_C( -0.60),
SIMDE_FLOAT64_C( 0.08), SIMDE_FLOAT64_C( 0.52),
SIMDE_FLOAT64_C( 0.68), SIMDE_FLOAT64_C( 0.40),
SIMDE_FLOAT64_C( -0.26), SIMDE_FLOAT64_C( 0.44)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( -0.44), SIMDE_FLOAT64_C( 0.03),
SIMDE_FLOAT64_C( -0.38), SIMDE_FLOAT64_C( -0.92),
SIMDE_FLOAT64_C( -0.31), SIMDE_FLOAT64_C( -0.86),
SIMDE_FLOAT64_C( -0.42), SIMDE_FLOAT64_C( 0.70)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( -0.41), SIMDE_FLOAT64_C( 0.03),
SIMDE_FLOAT64_C( -0.36), SIMDE_FLOAT64_C( -0.73),
SIMDE_FLOAT64_C( -0.30), SIMDE_FLOAT64_C( -0.70),
SIMDE_FLOAT64_C( -0.40), SIMDE_FLOAT64_C( 0.60)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( 0.18), SIMDE_FLOAT64_C( -0.45),
SIMDE_FLOAT64_C( 0.23), SIMDE_FLOAT64_C( 0.69),
SIMDE_FLOAT64_C( 0.26), SIMDE_FLOAT64_C( -0.21),
SIMDE_FLOAT64_C( -0.98), SIMDE_FLOAT64_C( -0.66)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 0.18), SIMDE_FLOAT64_C( -0.42),
SIMDE_FLOAT64_C( 0.23), SIMDE_FLOAT64_C( 0.60),
SIMDE_FLOAT64_C( 0.25), SIMDE_FLOAT64_C( -0.21),
SIMDE_FLOAT64_C( -0.75), SIMDE_FLOAT64_C( -0.58)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( -0.77), SIMDE_FLOAT64_C( 0.44),
SIMDE_FLOAT64_C( -0.58), SIMDE_FLOAT64_C( 0.38),
SIMDE_FLOAT64_C( -0.77), SIMDE_FLOAT64_C( 0.99),
SIMDE_FLOAT64_C( 0.03), SIMDE_FLOAT64_C( 0.84)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( -0.65), SIMDE_FLOAT64_C( 0.41),
SIMDE_FLOAT64_C( -0.52), SIMDE_FLOAT64_C( 0.36),
SIMDE_FLOAT64_C( -0.65), SIMDE_FLOAT64_C( 0.76),
SIMDE_FLOAT64_C( 0.03), SIMDE_FLOAT64_C( 0.69)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( -0.39), SIMDE_FLOAT64_C( 0.40),
SIMDE_FLOAT64_C( 0.66), SIMDE_FLOAT64_C( 0.34),
SIMDE_FLOAT64_C( 0.53), SIMDE_FLOAT64_C( -0.26),
SIMDE_FLOAT64_C( 0.78), SIMDE_FLOAT64_C( -0.03)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( -0.37), SIMDE_FLOAT64_C( 0.38),
SIMDE_FLOAT64_C( 0.58), SIMDE_FLOAT64_C( 0.33),
SIMDE_FLOAT64_C( 0.49), SIMDE_FLOAT64_C( -0.25),
SIMDE_FLOAT64_C( 0.65), SIMDE_FLOAT64_C( -0.03)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( -0.20), SIMDE_FLOAT64_C( -0.08),
SIMDE_FLOAT64_C( 0.34), SIMDE_FLOAT64_C( -0.94),
SIMDE_FLOAT64_C( -0.75), SIMDE_FLOAT64_C( -0.77),
SIMDE_FLOAT64_C( -0.55), SIMDE_FLOAT64_C( 0.40)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( -0.20), SIMDE_FLOAT64_C( -0.08),
SIMDE_FLOAT64_C( 0.33), SIMDE_FLOAT64_C( -0.74),
SIMDE_FLOAT64_C( -0.64), SIMDE_FLOAT64_C( -0.65),
SIMDE_FLOAT64_C( -0.50), SIMDE_FLOAT64_C( 0.38)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( 0.47), SIMDE_FLOAT64_C( 0.68),
SIMDE_FLOAT64_C( -0.15), SIMDE_FLOAT64_C( 0.82),
SIMDE_FLOAT64_C( 0.91), SIMDE_FLOAT64_C( 0.60),
SIMDE_FLOAT64_C( 0.79), SIMDE_FLOAT64_C( 0.25)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 0.44), SIMDE_FLOAT64_C( 0.59),
SIMDE_FLOAT64_C( -0.15), SIMDE_FLOAT64_C( 0.68),
SIMDE_FLOAT64_C( 0.72), SIMDE_FLOAT64_C( 0.54),
SIMDE_FLOAT64_C( 0.66), SIMDE_FLOAT64_C( 0.24)) }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])); i++) {
simde__m512d r = simde_mm512_tanh_pd(test_vec[i].a);
simde_assert_m512d_close(r, test_vec[i].r, 1);
}
return 0;
}
static int
test_simde_mm512_mask_tanh_pd(SIMDE_MUNIT_TEST_ARGS) {
const struct {
simde__m512d src;
simde__mmask8 k;
simde__m512d a;
simde__m512d r;
} test_vec[8] = {
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( -0.69), SIMDE_FLOAT64_C( 0.57),
SIMDE_FLOAT64_C( 0.42), SIMDE_FLOAT64_C( 0.47),
SIMDE_FLOAT64_C( 0.67), SIMDE_FLOAT64_C( 0.03),
SIMDE_FLOAT64_C( 0.04), SIMDE_FLOAT64_C( 0.35)),
UINT8_C(139),
simde_mm512_set_pd(SIMDE_FLOAT64_C( -0.68), SIMDE_FLOAT64_C( 0.08),
SIMDE_FLOAT64_C( 0.83), SIMDE_FLOAT64_C( -0.27),
SIMDE_FLOAT64_C( 0.50), SIMDE_FLOAT64_C( -0.30),
SIMDE_FLOAT64_C( -0.19), SIMDE_FLOAT64_C( -0.75)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( -0.59), SIMDE_FLOAT64_C( 0.57),
SIMDE_FLOAT64_C( 0.42), SIMDE_FLOAT64_C( 0.47),
SIMDE_FLOAT64_C( 0.46), SIMDE_FLOAT64_C( 0.03),
SIMDE_FLOAT64_C( -0.19), SIMDE_FLOAT64_C( -0.64)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( 0.18), SIMDE_FLOAT64_C( 0.23),
SIMDE_FLOAT64_C( 0.26), SIMDE_FLOAT64_C( -0.98),
SIMDE_FLOAT64_C( -0.44), SIMDE_FLOAT64_C( -0.38),
SIMDE_FLOAT64_C( -0.31), SIMDE_FLOAT64_C( -0.42)),
UINT8_C(229),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 0.84), SIMDE_FLOAT64_C( -0.45),
SIMDE_FLOAT64_C( 0.69), SIMDE_FLOAT64_C( -0.21),
SIMDE_FLOAT64_C( -0.66), SIMDE_FLOAT64_C( 0.03),
SIMDE_FLOAT64_C( -0.92), SIMDE_FLOAT64_C( -0.86)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 0.69), SIMDE_FLOAT64_C( -0.42),
SIMDE_FLOAT64_C( 0.60), SIMDE_FLOAT64_C( -0.98),
SIMDE_FLOAT64_C( -0.44), SIMDE_FLOAT64_C( 0.03),
SIMDE_FLOAT64_C( -0.31), SIMDE_FLOAT64_C( -0.70)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( 0.40), SIMDE_FLOAT64_C( 0.40),
SIMDE_FLOAT64_C( 0.34), SIMDE_FLOAT64_C( -0.26),
SIMDE_FLOAT64_C( -0.03), SIMDE_FLOAT64_C( 0.44),
SIMDE_FLOAT64_C( 0.38), SIMDE_FLOAT64_C( 0.99)),
UINT8_C(253),
simde_mm512_set_pd(SIMDE_FLOAT64_C( -0.55), SIMDE_FLOAT64_C( -0.39),
SIMDE_FLOAT64_C( 0.66), SIMDE_FLOAT64_C( 0.53),
SIMDE_FLOAT64_C( 0.78), SIMDE_FLOAT64_C( -0.77),
SIMDE_FLOAT64_C( -0.58), SIMDE_FLOAT64_C( -0.77)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( -0.50), SIMDE_FLOAT64_C( -0.37),
SIMDE_FLOAT64_C( 0.58), SIMDE_FLOAT64_C( 0.49),
SIMDE_FLOAT64_C( 0.65), SIMDE_FLOAT64_C( -0.65),
SIMDE_FLOAT64_C( 0.38), SIMDE_FLOAT64_C( -0.65)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( 0.12), SIMDE_FLOAT64_C( 0.47),
SIMDE_FLOAT64_C( -0.15), SIMDE_FLOAT64_C( 0.91),
SIMDE_FLOAT64_C( 0.79), SIMDE_FLOAT64_C( -0.20),
SIMDE_FLOAT64_C( 0.34), SIMDE_FLOAT64_C( -0.75)),
UINT8_C( 93),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 0.54), SIMDE_FLOAT64_C( -0.17),
SIMDE_FLOAT64_C( 0.68), SIMDE_FLOAT64_C( 0.82),
SIMDE_FLOAT64_C( 0.60), SIMDE_FLOAT64_C( 0.25),
SIMDE_FLOAT64_C( -0.08), SIMDE_FLOAT64_C( -0.94)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 0.12), SIMDE_FLOAT64_C( -0.17),
SIMDE_FLOAT64_C( -0.15), SIMDE_FLOAT64_C( 0.68),
SIMDE_FLOAT64_C( 0.54), SIMDE_FLOAT64_C( 0.24),
SIMDE_FLOAT64_C( 0.34), SIMDE_FLOAT64_C( -0.74)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( 0.10), SIMDE_FLOAT64_C( -0.74),
SIMDE_FLOAT64_C( 0.76), SIMDE_FLOAT64_C( 0.34),
SIMDE_FLOAT64_C( -0.80), SIMDE_FLOAT64_C( -0.53),
SIMDE_FLOAT64_C( -0.82), SIMDE_FLOAT64_C( 0.66)),
UINT8_C(145),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 0.33), SIMDE_FLOAT64_C( 0.46),
SIMDE_FLOAT64_C( -0.18), SIMDE_FLOAT64_C( 0.32),
SIMDE_FLOAT64_C( -0.87), SIMDE_FLOAT64_C( -0.33),
SIMDE_FLOAT64_C( -0.19), SIMDE_FLOAT64_C( 0.56)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 0.32), SIMDE_FLOAT64_C( -0.74),
SIMDE_FLOAT64_C( 0.76), SIMDE_FLOAT64_C( 0.31),
SIMDE_FLOAT64_C( -0.80), SIMDE_FLOAT64_C( -0.53),
SIMDE_FLOAT64_C( -0.82), SIMDE_FLOAT64_C( 0.51)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( -0.76), SIMDE_FLOAT64_C( 0.03),
SIMDE_FLOAT64_C( 0.69), SIMDE_FLOAT64_C( -0.02),
SIMDE_FLOAT64_C( -0.45), SIMDE_FLOAT64_C( 0.51),
SIMDE_FLOAT64_C( 0.83), SIMDE_FLOAT64_C( 0.98)),
UINT8_C( 75),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 0.98), SIMDE_FLOAT64_C( 0.42),
SIMDE_FLOAT64_C( -0.10), SIMDE_FLOAT64_C( 0.84),
SIMDE_FLOAT64_C( -0.59), SIMDE_FLOAT64_C( 0.73),
SIMDE_FLOAT64_C( 0.62), SIMDE_FLOAT64_C( 0.14)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( -0.76), SIMDE_FLOAT64_C( 0.40),
SIMDE_FLOAT64_C( 0.69), SIMDE_FLOAT64_C( -0.02),
SIMDE_FLOAT64_C( -0.53), SIMDE_FLOAT64_C( 0.51),
SIMDE_FLOAT64_C( 0.55), SIMDE_FLOAT64_C( 0.14)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( 0.39), SIMDE_FLOAT64_C( -0.30),
SIMDE_FLOAT64_C( -0.70), SIMDE_FLOAT64_C( 0.82),
SIMDE_FLOAT64_C( -1.00), SIMDE_FLOAT64_C( 0.92),
SIMDE_FLOAT64_C( -0.77), SIMDE_FLOAT64_C( -0.07)),
UINT8_C( 93),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 0.51), SIMDE_FLOAT64_C( 0.01),
SIMDE_FLOAT64_C( 0.92), SIMDE_FLOAT64_C( -0.77),
SIMDE_FLOAT64_C( -0.57), SIMDE_FLOAT64_C( -0.34),
SIMDE_FLOAT64_C( 0.29), SIMDE_FLOAT64_C( -0.58)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( 0.39), SIMDE_FLOAT64_C( 0.01),
SIMDE_FLOAT64_C( -0.70), SIMDE_FLOAT64_C( -0.65),
SIMDE_FLOAT64_C( -0.52), SIMDE_FLOAT64_C( -0.33),
SIMDE_FLOAT64_C( -0.77), SIMDE_FLOAT64_C( -0.52)) },
{ simde_mm512_set_pd(SIMDE_FLOAT64_C( 0.48), SIMDE_FLOAT64_C( 0.94),
SIMDE_FLOAT64_C( -0.35), SIMDE_FLOAT64_C( -0.44),
SIMDE_FLOAT64_C( -0.75), SIMDE_FLOAT64_C( 0.93),
SIMDE_FLOAT64_C( -0.33), SIMDE_FLOAT64_C( -0.18)),
UINT8_C(213),
simde_mm512_set_pd(SIMDE_FLOAT64_C( -0.78), SIMDE_FLOAT64_C( 0.44),
SIMDE_FLOAT64_C( 0.90), SIMDE_FLOAT64_C( -0.20),
SIMDE_FLOAT64_C( -0.36), SIMDE_FLOAT64_C( -0.03),
SIMDE_FLOAT64_C( 0.01), SIMDE_FLOAT64_C( -0.13)),
simde_mm512_set_pd(SIMDE_FLOAT64_C( -0.65), SIMDE_FLOAT64_C( 0.41),
SIMDE_FLOAT64_C( -0.35), SIMDE_FLOAT64_C( -0.20),
SIMDE_FLOAT64_C( -0.75), SIMDE_FLOAT64_C( -0.03),
SIMDE_FLOAT64_C( -0.33), SIMDE_FLOAT64_C( -0.13)) }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])); i++) {
simde__m512d r = simde_mm512_mask_tanh_pd(test_vec[i].src, test_vec[i].k, test_vec[i].a);
simde_assert_m512d_close(r, test_vec[i].r, 1);
}
return 0;
}
static int
test_simde_mm256_udivrem_epi32(SIMDE_MUNIT_TEST_ARGS) {
const struct {
simde__m256i a;
simde__m256i b;
simde__m256i rem;
simde__m256i r;
} test_vec[8] = {
{ simde_x_mm256_set_epu32(UINT32_C(3215450688), UINT32_C(3586813553), UINT32_C(1508722402), UINT32_C(2220621656),
UINT32_C(1747596798), UINT32_C(2231263307), UINT32_C( 527472553), UINT32_C(2891870298)),
simde_x_mm256_set_epu32(UINT32_C( 172780273), UINT32_C( 168508556), UINT32_C(3803608574), UINT32_C(4064895559),
UINT32_C(4201299039), UINT32_C(3984766001), UINT32_C( 392212716), UINT32_C(4009222911)),
simde_x_mm256_set_epu32(UINT32_C( 105405774), UINT32_C( 48133877), UINT32_C(1508722402), UINT32_C(2220621656),
UINT32_C(1747596798), UINT32_C(2231263307), UINT32_C( 135259837), UINT32_C(2891870298)),
simde_mm256_set_epi32(INT32_C( 18), INT32_C( 21), INT32_C( 0), INT32_C( 0),
INT32_C( 0), INT32_C( 0), INT32_C( 1), INT32_C( 0)) },
{ simde_x_mm256_set_epu32(UINT32_C(1192263444), UINT32_C(2208623573), UINT32_C(1322777130), UINT32_C( 163989560),
UINT32_C(1492341726), UINT32_C( 298608154), UINT32_C(1250819173), UINT32_C(3643996043)),
simde_x_mm256_set_epu32(UINT32_C(3853764578), UINT32_C( 294920921), UINT32_C(3883385645), UINT32_C(4126975473),
UINT32_C(3898385479), UINT32_C( 422762821), UINT32_C( 12586973), UINT32_C( 182106357)),
simde_x_mm256_set_epu32(UINT32_C(1192263444), UINT32_C( 144177126), UINT32_C(1322777130), UINT32_C( 163989560),
UINT32_C(1492341726), UINT32_C( 298608154), UINT32_C( 4708846), UINT32_C( 1868903)),
simde_mm256_set_epi32(INT32_C( 0), INT32_C( 7), INT32_C( 0), INT32_C( 0),
INT32_C( 0), INT32_C( 0), INT32_C( 99), INT32_C( 20)) },
{ simde_x_mm256_set_epu32(UINT32_C( 493161721), UINT32_C(3099851477), UINT32_C( 894221337), UINT32_C(2964507124),
UINT32_C( 492373082), UINT32_C(4281870485), UINT32_C(2207786213), UINT32_C(3953959418)),
simde_x_mm256_set_epu32(UINT32_C( 328620632), UINT32_C(3970654641), UINT32_C(4110215287), UINT32_C(3940207296),
UINT32_C(4043901133), UINT32_C( 395141437), UINT32_C(4177201181), UINT32_C( 520340456)),
simde_x_mm256_set_epu32(UINT32_C( 164541089), UINT32_C(3099851477), UINT32_C( 894221337), UINT32_C(2964507124),
UINT32_C( 492373082), UINT32_C( 330456115), UINT32_C(2207786213), UINT32_C( 311576226)),
simde_mm256_set_epi32(INT32_C( 1), INT32_C( 0), INT32_C( 0), INT32_C( 0),
INT32_C( 0), INT32_C( 10), INT32_C( 0), INT32_C( 7)) },
{ simde_x_mm256_set_epu32(UINT32_C(1710148738), UINT32_C(1974123080), UINT32_C(2870600100), UINT32_C( 118588227),
UINT32_C( 542053192), UINT32_C( 499863549), UINT32_C( 957375358), UINT32_C(3003933707)),
simde_x_mm256_set_epu32(UINT32_C(4010243988), UINT32_C(4123176886), UINT32_C( 457043765), UINT32_C(4197612290),
UINT32_C(4246664437), UINT32_C(4080470003), UINT32_C(4182884971), UINT32_C(3894626243)),
simde_x_mm256_set_epu32(UINT32_C(1710148738), UINT32_C(1974123080), UINT32_C( 128337510), UINT32_C( 118588227),
UINT32_C( 542053192), UINT32_C( 499863549), UINT32_C( 957375358), UINT32_C(3003933707)),
simde_mm256_set_epi32(INT32_C( 0), INT32_C( 0), INT32_C( 6), INT32_C( 0),
INT32_C( 0), INT32_C( 0), INT32_C( 0), INT32_C( 0)) },
{ simde_x_mm256_set_epu32(UINT32_C(1734496959), UINT32_C( 380846712), UINT32_C(3352999607), UINT32_C(3555523675),
UINT32_C(1995198557), UINT32_C(3314312199), UINT32_C(2406584253), UINT32_C(1779168063)),
simde_x_mm256_set_epu32(UINT32_C( 440775120), UINT32_C(4165466156), UINT32_C(3932377571), UINT32_C(3942500746),
UINT32_C( 67477586), UINT32_C( 108492873), UINT32_C( 360489056), UINT32_C( 254567893)),
simde_x_mm256_set_epu32(UINT32_C( 412171599), UINT32_C( 380846712), UINT32_C(3352999607), UINT32_C(3555523675),
UINT32_C( 38348563), UINT32_C( 59526009), UINT32_C( 243649917), UINT32_C( 251760705)),
simde_mm256_set_epi32(INT32_C( 3), INT32_C( 0), INT32_C( 0), INT32_C( 0),
INT32_C( 29), INT32_C( 30), INT32_C( 6), INT32_C( 6)) },
{ simde_x_mm256_set_epu32(UINT32_C(3932090380), UINT32_C(2449576763), UINT32_C(4246346280), UINT32_C( 201516689),
UINT32_C(2859036576), UINT32_C(2362091228), UINT32_C(3141663427), UINT32_C( 562234020)),
simde_x_mm256_set_epu32(UINT32_C(4128600985), UINT32_C(4209418337), UINT32_C( 525546139), UINT32_C( 219277873),
UINT32_C( 295872976), UINT32_C(4150814551), UINT32_C(4029638246), UINT32_C(4092942946)),
simde_x_mm256_set_epu32(UINT32_C(3932090380), UINT32_C(2449576763), UINT32_C( 41977168), UINT32_C( 201516689),
UINT32_C( 196179792), UINT32_C(2362091228), UINT32_C(3141663427), UINT32_C( 562234020)),
simde_mm256_set_epi32(INT32_C( 0), INT32_C( 0), INT32_C( 8), INT32_C( 0),
INT32_C( 9), INT32_C( 0), INT32_C( 0), INT32_C( 0)) },
{ simde_x_mm256_set_epu32(UINT32_C( 910061584), UINT32_C(2002226944), UINT32_C(3673004107), UINT32_C(4246624078),
UINT32_C( 523093293), UINT32_C(3059761572), UINT32_C(2206005509), UINT32_C(1943141679)),
simde_x_mm256_set_epu32(UINT32_C( 123967721), UINT32_C(4199435689), UINT32_C( 228811177), UINT32_C( 1270356),
UINT32_C( 355625346), UINT32_C(4253972365), UINT32_C(3915742229), UINT32_C( 124491394)),
simde_x_mm256_set_epu32(UINT32_C( 42287537), UINT32_C(2002226944), UINT32_C( 12025275), UINT32_C( 1094326),
UINT32_C( 167467947), UINT32_C(3059761572), UINT32_C(2206005509), UINT32_C( 75770769)),
simde_mm256_set_epi32(INT32_C( 7), INT32_C( 0), INT32_C( 16), INT32_C( 3342),
INT32_C( 1), INT32_C( 0), INT32_C( 0), INT32_C( 15)) },
{ simde_x_mm256_set_epu32(UINT32_C(1755684145), UINT32_C(2233240925), UINT32_C(3244523643), UINT32_C(2995026741),
UINT32_C(2178270751), UINT32_C(1493088054), UINT32_C(4115137419), UINT32_C( 651362699)),
simde_x_mm256_set_epu32(UINT32_C( 301617823), UINT32_C( 343728879), UINT32_C( 132913279), UINT32_C( 518796827),
UINT32_C(4258812658), UINT32_C(3762000867), UINT32_C( 361195763), UINT32_C( 469656308)),
simde_x_mm256_set_epu32(UINT32_C( 247595030), UINT32_C( 170867651), UINT32_C( 54604947), UINT32_C( 401042606),
UINT32_C(2178270751), UINT32_C(1493088054), UINT32_C( 141984026), UINT32_C( 181706391)),
simde_mm256_set_epi32(INT32_C( 5), INT32_C( 6), INT32_C( 24), INT32_C( 5),
INT32_C( 0), INT32_C( 0), INT32_C( 11), INT32_C( 1)) }
};
for (size_t i = 0 ; i < (sizeof(test_vec) / sizeof(test_vec[0])); i++) {
simde__m256i rem;
simde__m256i r = simde_mm256_udivrem_epi32(&rem, test_vec[i].a, test_vec[i].b);
simde_assert_m256i_u32(r, ==, test_vec[i].r);
simde_assert_m256i_u32(rem, ==, test_vec[i].rem);
}
return 0;
}
HEDLEY_DIAGNOSTIC_PUSH
HEDLEY_DIAGNOSTIC_DISABLE_CAST_QUAL
#if HEDLEY_HAS_WARNING("-Wold-style-cast")
#pragma clang diagnostic ignored "-Wold-style-cast"
#endif
#if HEDLEY_HAS_WARNING("-Wzero-as-null-pointer-constant")
#pragma clang diagnostic ignored "-Wzero-as-null-pointer-constant"
#endif
SIMDE_TEST_FUNC_LIST_BEGIN
SIMDE_TEST_FUNC_LIST_ENTRY(mm_acos_ps)
SIMDE_TEST_FUNC_LIST_ENTRY(mm_acos_pd)
SIMDE_TEST_FUNC_LIST_ENTRY(mm256_acos_ps)
SIMDE_TEST_FUNC_LIST_ENTRY(mm256_acos_pd)
SIMDE_TEST_FUNC_LIST_ENTRY(mm512_acos_ps)
SIMDE_TEST_FUNC_LIST_ENTRY(mm512_acos_pd)
SIMDE_TEST_FUNC_LIST_ENTRY(mm512_mask_acos_ps)
SIMDE_TEST_FUNC_LIST_ENTRY(mm512_mask_acos_pd)
SIMDE_TEST_FUNC_LIST_ENTRY(mm_acosh_ps)
SIMDE_TEST_FUNC_LIST_ENTRY(mm_acosh_pd)
SIMDE_TEST_FUNC_LIST_ENTRY(mm256_acosh_ps)
SIMDE_TEST_FUNC_LIST_ENTRY(mm256_acosh_pd)
SIMDE_TEST_FUNC_LIST_ENTRY(mm512_acosh_ps)
SIMDE_TEST_FUNC_LIST_ENTRY(mm512_acosh_pd)
SIMDE_TEST_FUNC_LIST_ENTRY(mm512_mask_acosh_ps)
SIMDE_TEST_FUNC_LIST_ENTRY(mm512_mask_acosh_pd)
SIMDE_TEST_FUNC_LIST_ENTRY(mm_asin_ps)
SIMDE_TEST_FUNC_LIST_ENTRY(mm_asin_pd)
SIMDE_TEST_FUNC_LIST_ENTRY(mm256_asin_ps)
SIMDE_TEST_FUNC_LIST_ENTRY(mm256_asin_pd)
SIMDE_TEST_FUNC_LIST_ENTRY(mm512_asin_ps)
SIMDE_TEST_FUNC_LIST_ENTRY(mm512_asin_pd)
SIMDE_TEST_FUNC_LIST_ENTRY(mm512_mask_asin_ps)
SIMDE_TEST_FUNC_LIST_ENTRY(mm512_mask_asin_pd)
SIMDE_TEST_FUNC_LIST_ENTRY(mm_asinh_ps)
SIMDE_TEST_FUNC_LIST_ENTRY(mm_asinh_pd)
SIMDE_TEST_FUNC_LIST_ENTRY(mm256_asinh_ps)
SIMDE_TEST_FUNC_LIST_ENTRY(mm256_asinh_pd)
SIMDE_TEST_FUNC_LIST_ENTRY(mm512_asinh_ps)
SIMDE_TEST_FUNC_LIST_ENTRY(mm512_asinh_pd)
SIMDE_TEST_FUNC_LIST_ENTRY(mm512_mask_asinh_ps)
SIMDE_TEST_FUNC_LIST_ENTRY(mm512_mask_asinh_pd)
SIMDE_TEST_FUNC_LIST_ENTRY(mm_atan_ps)
SIMDE_TEST_FUNC_LIST_ENTRY(mm_atan_pd)
SIMDE_TEST_FUNC_LIST_ENTRY(mm256_atan_ps)
SIMDE_TEST_FUNC_LIST_ENTRY(mm256_atan_pd)
SIMDE_TEST_FUNC_LIST_ENTRY(mm512_atan_ps)
SIMDE_TEST_FUNC_LIST_ENTRY(mm512_atan_pd)
SIMDE_TEST_FUNC_LIST_ENTRY(mm512_mask_atan_ps)
SIMDE_TEST_FUNC_LIST_ENTRY(mm512_mask_atan_pd)
SIMDE_TEST_FUNC_LIST_ENTRY(mm_atan2_ps)
SIMDE_TEST_FUNC_LIST_ENTRY(mm_atan2_pd)
SIMDE_TEST_FUNC_LIST_ENTRY(mm256_atan2_ps)
SIMDE_TEST_FUNC_LIST_ENTRY(mm256_atan2_pd)
SIMDE_TEST_FUNC_LIST_ENTRY(mm512_atan2_ps)
SIMDE_TEST_FUNC_LIST_ENTRY(mm512_atan2_pd)
SIMDE_TEST_FUNC_LIST_ENTRY(mm512_mask_atan2_ps)
SIMDE_TEST_FUNC_LIST_ENTRY(mm512_mask_atan2_pd)
SIMDE_TEST_FUNC_LIST_ENTRY(mm_atanh_ps)
SIMDE_TEST_FUNC_LIST_ENTRY(mm_atanh_pd)
SIMDE_TEST_FUNC_LIST_ENTRY(mm256_atanh_ps)
SIMDE_TEST_FUNC_LIST_ENTRY(mm256_atanh_pd)
SIMDE_TEST_FUNC_LIST_ENTRY(mm512_atanh_ps)
SIMDE_TEST_FUNC_LIST_ENTRY(mm512_atanh_pd)
SIMDE_TEST_FUNC_LIST_ENTRY(mm512_mask_atanh_ps)
SIMDE_TEST_FUNC_LIST_ENTRY(mm512_mask_atanh_pd)
SIMDE_TEST_FUNC_LIST_ENTRY(mm_cbrt_ps)
SIMDE_TEST_FUNC_LIST_ENTRY(mm_cbrt_pd)
SIMDE_TEST_FUNC_LIST_ENTRY(mm256_cbrt_ps)
SIMDE_TEST_FUNC_LIST_ENTRY(mm256_cbrt_pd)
SIMDE_TEST_FUNC_LIST_ENTRY(mm512_cbrt_ps)
SIMDE_TEST_FUNC_LIST_ENTRY(mm512_cbrt_pd)
SIMDE_TEST_FUNC_LIST_ENTRY(mm512_mask_cbrt_ps)
SIMDE_TEST_FUNC_LIST_ENTRY(mm512_mask_cbrt_pd)
SIMDE_TEST_FUNC_LIST_ENTRY(mm_cdfnorm_ps)
SIMDE_TEST_FUNC_LIST_ENTRY(mm_cdfnorm_pd)
SIMDE_TEST_FUNC_LIST_ENTRY(mm256_cdfnorm_ps)
SIMDE_TEST_FUNC_LIST_ENTRY(mm256_cdfnorm_pd)
SIMDE_TEST_FUNC_LIST_ENTRY(mm512_cdfnorm_ps)
SIMDE_TEST_FUNC_LIST_ENTRY(mm512_cdfnorm_pd)
SIMDE_TEST_FUNC_LIST_ENTRY(mm512_mask_cdfnorm_ps)
SIMDE_TEST_FUNC_LIST_ENTRY(mm512_mask_cdfnorm_pd)
SIMDE_TEST_FUNC_LIST_ENTRY(mm_cdfnorminv_ps)
SIMDE_TEST_FUNC_LIST_ENTRY(mm_cdfnorminv_pd)
SIMDE_TEST_FUNC_LIST_ENTRY(mm256_cdfnorminv_ps)
SIMDE_TEST_FUNC_LIST_ENTRY(mm256_cdfnorminv_pd)
SIMDE_TEST_FUNC_LIST_ENTRY(mm512_cdfnorminv_ps)
SIMDE_TEST_FUNC_LIST_ENTRY(mm512_cdfnorminv_pd)
SIMDE_TEST_FUNC_LIST_ENTRY(mm512_mask_cdfnorminv_ps)
SIMDE_TEST_FUNC_LIST_ENTRY(mm512_mask_cdfnorminv_pd)
SIMDE_TEST_FUNC_LIST_ENTRY(mm_cos_ps)
SIMDE_TEST_FUNC_LIST_ENTRY(mm_cos_pd)
SIMDE_TEST_FUNC_LIST_ENTRY(mm256_cos_ps)
SIMDE_TEST_FUNC_LIST_ENTRY(mm256_cos_pd)
SIMDE_TEST_FUNC_LIST_ENTRY(mm512_cos_ps)
SIMDE_TEST_FUNC_LIST_ENTRY(mm512_cos_pd)
SIMDE_TEST_FUNC_LIST_ENTRY(mm512_mask_cos_ps)
SIMDE_TEST_FUNC_LIST_ENTRY(mm512_mask_cos_pd)
SIMDE_TEST_FUNC_LIST_ENTRY(mm_cosd_ps)
SIMDE_TEST_FUNC_LIST_ENTRY(mm_cosd_pd)
SIMDE_TEST_FUNC_LIST_ENTRY(mm256_cosd_ps)
SIMDE_TEST_FUNC_LIST_ENTRY(mm256_cosd_pd)
SIMDE_TEST_FUNC_LIST_ENTRY(mm512_cosd_ps)
SIMDE_TEST_FUNC_LIST_ENTRY(mm512_cosd_pd)
SIMDE_TEST_FUNC_LIST_ENTRY(mm512_mask_cosd_ps)
SIMDE_TEST_FUNC_LIST_ENTRY(mm512_mask_cosd_pd)
SIMDE_TEST_FUNC_LIST_ENTRY(mm_cosh_ps)
SIMDE_TEST_FUNC_LIST_ENTRY(mm_cosh_pd)
SIMDE_TEST_FUNC_LIST_ENTRY(mm256_cosh_ps)
SIMDE_TEST_FUNC_LIST_ENTRY(mm256_cosh_pd)
SIMDE_TEST_FUNC_LIST_ENTRY(mm512_cosh_ps)
SIMDE_TEST_FUNC_LIST_ENTRY(mm512_cosh_pd)
SIMDE_TEST_FUNC_LIST_ENTRY(mm512_mask_cosh_ps)
SIMDE_TEST_FUNC_LIST_ENTRY(mm512_mask_cosh_pd)
SIMDE_TEST_FUNC_LIST_ENTRY(mm_cexp_ps)
SIMDE_TEST_FUNC_LIST_ENTRY(mm256_cexp_ps)
SIMDE_TEST_FUNC_LIST_ENTRY(mm_clog_ps)
SIMDE_TEST_FUNC_LIST_ENTRY(mm256_clog_ps)
SIMDE_TEST_FUNC_LIST_ENTRY(mm_csqrt_ps)
SIMDE_TEST_FUNC_LIST_ENTRY(mm256_csqrt_ps)
SIMDE_TEST_FUNC_LIST_ENTRY(x_mm_deg2rad_ps)
SIMDE_TEST_FUNC_LIST_ENTRY(x_mm_deg2rad_pd)
SIMDE_TEST_FUNC_LIST_ENTRY(x_mm256_deg2rad_ps)
SIMDE_TEST_FUNC_LIST_ENTRY(x_mm256_deg2rad_pd)
SIMDE_TEST_FUNC_LIST_ENTRY(x_mm512_deg2rad_ps)
SIMDE_TEST_FUNC_LIST_ENTRY(x_mm512_deg2rad_pd)
SIMDE_TEST_FUNC_LIST_ENTRY(mm_div_epi8)
SIMDE_TEST_FUNC_LIST_ENTRY(mm_div_epi16)
SIMDE_TEST_FUNC_LIST_ENTRY(mm_div_epi32)
SIMDE_TEST_FUNC_LIST_ENTRY(mm512_mask_div_epi32)
SIMDE_TEST_FUNC_LIST_ENTRY(mm_div_epi64)
SIMDE_TEST_FUNC_LIST_ENTRY(mm_div_epu8)
SIMDE_TEST_FUNC_LIST_ENTRY(mm_div_epu16)
SIMDE_TEST_FUNC_LIST_ENTRY(mm_div_epu32)
SIMDE_TEST_FUNC_LIST_ENTRY(mm_div_epu64)
SIMDE_TEST_FUNC_LIST_ENTRY(mm256_div_epi8)
SIMDE_TEST_FUNC_LIST_ENTRY(mm256_div_epi16)
SIMDE_TEST_FUNC_LIST_ENTRY(mm256_div_epi32)
SIMDE_TEST_FUNC_LIST_ENTRY(mm256_div_epi64)
SIMDE_TEST_FUNC_LIST_ENTRY(mm256_div_epu8)
SIMDE_TEST_FUNC_LIST_ENTRY(mm256_div_epu16)
SIMDE_TEST_FUNC_LIST_ENTRY(mm256_div_epu32)
SIMDE_TEST_FUNC_LIST_ENTRY(mm512_mask_div_epu32)
SIMDE_TEST_FUNC_LIST_ENTRY(mm256_div_epu64)
SIMDE_TEST_FUNC_LIST_ENTRY(mm512_div_epi8)
SIMDE_TEST_FUNC_LIST_ENTRY(mm512_div_epi16)
SIMDE_TEST_FUNC_LIST_ENTRY(mm512_div_epi32)
SIMDE_TEST_FUNC_LIST_ENTRY(mm512_div_epi64)
SIMDE_TEST_FUNC_LIST_ENTRY(mm512_div_epu8)
SIMDE_TEST_FUNC_LIST_ENTRY(mm512_div_epu16)
SIMDE_TEST_FUNC_LIST_ENTRY(mm512_div_epu32)
SIMDE_TEST_FUNC_LIST_ENTRY(mm512_div_epu64)
SIMDE_TEST_FUNC_LIST_ENTRY(mm_erf_ps)
SIMDE_TEST_FUNC_LIST_ENTRY(mm_erf_pd)
SIMDE_TEST_FUNC_LIST_ENTRY(mm256_erf_ps)
SIMDE_TEST_FUNC_LIST_ENTRY(mm256_erf_pd)
SIMDE_TEST_FUNC_LIST_ENTRY(mm512_erf_ps)
SIMDE_TEST_FUNC_LIST_ENTRY(mm512_erf_pd)
SIMDE_TEST_FUNC_LIST_ENTRY(mm512_mask_erf_ps)
SIMDE_TEST_FUNC_LIST_ENTRY(mm512_mask_erf_pd)
SIMDE_TEST_FUNC_LIST_ENTRY(mm_erfinv_ps)
SIMDE_TEST_FUNC_LIST_ENTRY(mm_erfinv_pd)
SIMDE_TEST_FUNC_LIST_ENTRY(mm256_erfinv_ps)
SIMDE_TEST_FUNC_LIST_ENTRY(mm256_erfinv_pd)
SIMDE_TEST_FUNC_LIST_ENTRY(mm512_erfinv_ps)
SIMDE_TEST_FUNC_LIST_ENTRY(mm512_erfinv_pd)
SIMDE_TEST_FUNC_LIST_ENTRY(mm512_mask_erfinv_ps)
SIMDE_TEST_FUNC_LIST_ENTRY(mm512_mask_erfinv_pd)
SIMDE_TEST_FUNC_LIST_ENTRY(mm_erfc_ps)
SIMDE_TEST_FUNC_LIST_ENTRY(mm_erfc_pd)
SIMDE_TEST_FUNC_LIST_ENTRY(mm256_erfc_ps)
SIMDE_TEST_FUNC_LIST_ENTRY(mm256_erfc_pd)
SIMDE_TEST_FUNC_LIST_ENTRY(mm512_erfc_ps)
SIMDE_TEST_FUNC_LIST_ENTRY(mm512_erfc_pd)
SIMDE_TEST_FUNC_LIST_ENTRY(mm512_mask_erfc_ps)
SIMDE_TEST_FUNC_LIST_ENTRY(mm512_mask_erfc_pd)
SIMDE_TEST_FUNC_LIST_ENTRY(mm_erfcinv_ps)
SIMDE_TEST_FUNC_LIST_ENTRY(mm_erfcinv_pd)
SIMDE_TEST_FUNC_LIST_ENTRY(mm256_erfcinv_ps)
SIMDE_TEST_FUNC_LIST_ENTRY(mm256_erfcinv_pd)
SIMDE_TEST_FUNC_LIST_ENTRY(mm512_erfcinv_ps)
SIMDE_TEST_FUNC_LIST_ENTRY(mm512_erfcinv_pd)
SIMDE_TEST_FUNC_LIST_ENTRY(mm512_mask_erfcinv_ps)
SIMDE_TEST_FUNC_LIST_ENTRY(mm512_mask_erfcinv_pd)
SIMDE_TEST_FUNC_LIST_ENTRY(mm_exp_ps)
SIMDE_TEST_FUNC_LIST_ENTRY(mm_exp_pd)
SIMDE_TEST_FUNC_LIST_ENTRY(mm256_exp_ps)
SIMDE_TEST_FUNC_LIST_ENTRY(mm256_exp_pd)
SIMDE_TEST_FUNC_LIST_ENTRY(mm512_exp_ps)
SIMDE_TEST_FUNC_LIST_ENTRY(mm512_exp_pd)
SIMDE_TEST_FUNC_LIST_ENTRY(mm512_mask_exp_ps)
SIMDE_TEST_FUNC_LIST_ENTRY(mm512_mask_exp_pd)
SIMDE_TEST_FUNC_LIST_ENTRY(mm_expm1_ps)
SIMDE_TEST_FUNC_LIST_ENTRY(mm_expm1_pd)
SIMDE_TEST_FUNC_LIST_ENTRY(mm256_expm1_ps)
SIMDE_TEST_FUNC_LIST_ENTRY(mm256_expm1_pd)
SIMDE_TEST_FUNC_LIST_ENTRY(mm512_expm1_ps)
SIMDE_TEST_FUNC_LIST_ENTRY(mm512_expm1_pd)
SIMDE_TEST_FUNC_LIST_ENTRY(mm512_mask_expm1_ps)
SIMDE_TEST_FUNC_LIST_ENTRY(mm512_mask_expm1_pd)
SIMDE_TEST_FUNC_LIST_ENTRY(mm_exp2_ps)
SIMDE_TEST_FUNC_LIST_ENTRY(mm_exp2_pd)
SIMDE_TEST_FUNC_LIST_ENTRY(mm256_exp2_ps)
SIMDE_TEST_FUNC_LIST_ENTRY(mm256_exp2_pd)
SIMDE_TEST_FUNC_LIST_ENTRY(mm512_exp2_ps)
SIMDE_TEST_FUNC_LIST_ENTRY(mm512_exp2_pd)
SIMDE_TEST_FUNC_LIST_ENTRY(mm512_mask_exp2_ps)
SIMDE_TEST_FUNC_LIST_ENTRY(mm512_mask_exp2_pd)
SIMDE_TEST_FUNC_LIST_ENTRY(mm_exp10_ps)
SIMDE_TEST_FUNC_LIST_ENTRY(mm_exp10_pd)
SIMDE_TEST_FUNC_LIST_ENTRY(mm256_exp10_ps)
SIMDE_TEST_FUNC_LIST_ENTRY(mm256_exp10_pd)
SIMDE_TEST_FUNC_LIST_ENTRY(mm512_exp10_ps)
SIMDE_TEST_FUNC_LIST_ENTRY(mm512_exp10_pd)
SIMDE_TEST_FUNC_LIST_ENTRY(mm512_mask_exp10_ps)
SIMDE_TEST_FUNC_LIST_ENTRY(mm512_mask_exp10_pd)
SIMDE_TEST_FUNC_LIST_ENTRY(mm_idivrem_epi32)
SIMDE_TEST_FUNC_LIST_ENTRY(mm256_idivrem_epi32)
SIMDE_TEST_FUNC_LIST_ENTRY(mm_hypot_ps)
SIMDE_TEST_FUNC_LIST_ENTRY(mm_hypot_pd)
SIMDE_TEST_FUNC_LIST_ENTRY(mm256_hypot_ps)
SIMDE_TEST_FUNC_LIST_ENTRY(mm256_hypot_pd)
SIMDE_TEST_FUNC_LIST_ENTRY(mm512_hypot_ps)
SIMDE_TEST_FUNC_LIST_ENTRY(mm512_hypot_pd)
SIMDE_TEST_FUNC_LIST_ENTRY(mm512_mask_hypot_ps)
SIMDE_TEST_FUNC_LIST_ENTRY(mm512_mask_hypot_pd)
SIMDE_TEST_FUNC_LIST_ENTRY(mm_invcbrt_ps)
SIMDE_TEST_FUNC_LIST_ENTRY(mm_invcbrt_pd)
SIMDE_TEST_FUNC_LIST_ENTRY(mm256_invcbrt_ps)
SIMDE_TEST_FUNC_LIST_ENTRY(mm256_invcbrt_pd)
SIMDE_TEST_FUNC_LIST_ENTRY(mm_invsqrt_ps)
SIMDE_TEST_FUNC_LIST_ENTRY(mm_invsqrt_pd)
SIMDE_TEST_FUNC_LIST_ENTRY(mm256_invsqrt_ps)
SIMDE_TEST_FUNC_LIST_ENTRY(mm256_invsqrt_pd)
SIMDE_TEST_FUNC_LIST_ENTRY(mm512_invsqrt_ps)
SIMDE_TEST_FUNC_LIST_ENTRY(mm512_invsqrt_pd)
SIMDE_TEST_FUNC_LIST_ENTRY(mm512_mask_invsqrt_ps)
SIMDE_TEST_FUNC_LIST_ENTRY(mm512_mask_invsqrt_pd)
SIMDE_TEST_FUNC_LIST_ENTRY(mm_log_ps)
SIMDE_TEST_FUNC_LIST_ENTRY(mm_log_pd)
SIMDE_TEST_FUNC_LIST_ENTRY(mm256_log_ps)
SIMDE_TEST_FUNC_LIST_ENTRY(mm256_log_pd)
SIMDE_TEST_FUNC_LIST_ENTRY(mm512_log_ps)
SIMDE_TEST_FUNC_LIST_ENTRY(mm512_log_pd)
SIMDE_TEST_FUNC_LIST_ENTRY(mm512_mask_log_ps)
SIMDE_TEST_FUNC_LIST_ENTRY(mm512_mask_log_pd)
SIMDE_TEST_FUNC_LIST_ENTRY(mm_log1p_ps)
SIMDE_TEST_FUNC_LIST_ENTRY(mm_log1p_pd)
SIMDE_TEST_FUNC_LIST_ENTRY(mm256_log1p_ps)
SIMDE_TEST_FUNC_LIST_ENTRY(mm256_log1p_pd)
SIMDE_TEST_FUNC_LIST_ENTRY(mm512_log1p_ps)
SIMDE_TEST_FUNC_LIST_ENTRY(mm512_log1p_pd)
SIMDE_TEST_FUNC_LIST_ENTRY(mm512_mask_log1p_ps)
SIMDE_TEST_FUNC_LIST_ENTRY(mm512_mask_log1p_pd)
SIMDE_TEST_FUNC_LIST_ENTRY(mm_log2_ps)
SIMDE_TEST_FUNC_LIST_ENTRY(mm_log2_pd)
SIMDE_TEST_FUNC_LIST_ENTRY(mm256_log2_ps)
SIMDE_TEST_FUNC_LIST_ENTRY(mm256_log2_pd)
SIMDE_TEST_FUNC_LIST_ENTRY(mm512_log2_ps)
SIMDE_TEST_FUNC_LIST_ENTRY(mm512_log2_pd)
SIMDE_TEST_FUNC_LIST_ENTRY(mm512_mask_log2_ps)
SIMDE_TEST_FUNC_LIST_ENTRY(mm512_mask_log2_pd)
SIMDE_TEST_FUNC_LIST_ENTRY(mm_log10_ps)
SIMDE_TEST_FUNC_LIST_ENTRY(mm_log10_pd)
SIMDE_TEST_FUNC_LIST_ENTRY(mm256_log10_ps)
SIMDE_TEST_FUNC_LIST_ENTRY(mm256_log10_pd)
SIMDE_TEST_FUNC_LIST_ENTRY(mm512_log10_ps)
SIMDE_TEST_FUNC_LIST_ENTRY(mm512_log10_pd)
SIMDE_TEST_FUNC_LIST_ENTRY(mm512_mask_log10_ps)
SIMDE_TEST_FUNC_LIST_ENTRY(mm512_mask_log10_pd)
SIMDE_TEST_FUNC_LIST_ENTRY(mm_logb_ps)
SIMDE_TEST_FUNC_LIST_ENTRY(mm_logb_pd)
SIMDE_TEST_FUNC_LIST_ENTRY(mm256_logb_ps)
SIMDE_TEST_FUNC_LIST_ENTRY(mm256_logb_pd)
SIMDE_TEST_FUNC_LIST_ENTRY(mm512_logb_ps)
SIMDE_TEST_FUNC_LIST_ENTRY(mm512_logb_pd)
SIMDE_TEST_FUNC_LIST_ENTRY(mm512_mask_logb_ps)
SIMDE_TEST_FUNC_LIST_ENTRY(mm512_mask_logb_pd)
SIMDE_TEST_FUNC_LIST_ENTRY(mm512_nearbyint_ps)
SIMDE_TEST_FUNC_LIST_ENTRY(mm512_nearbyint_pd)
SIMDE_TEST_FUNC_LIST_ENTRY(mm512_mask_nearbyint_ps)
SIMDE_TEST_FUNC_LIST_ENTRY(mm512_mask_nearbyint_pd)
SIMDE_TEST_FUNC_LIST_ENTRY(mm_pow_ps)
SIMDE_TEST_FUNC_LIST_ENTRY(mm_pow_pd)
SIMDE_TEST_FUNC_LIST_ENTRY(mm256_pow_ps)
SIMDE_TEST_FUNC_LIST_ENTRY(mm256_pow_pd)
SIMDE_TEST_FUNC_LIST_ENTRY(mm512_pow_ps)
SIMDE_TEST_FUNC_LIST_ENTRY(mm512_pow_pd)
SIMDE_TEST_FUNC_LIST_ENTRY(mm512_mask_pow_ps)
SIMDE_TEST_FUNC_LIST_ENTRY(mm512_mask_pow_pd)
SIMDE_TEST_FUNC_LIST_ENTRY(mm_rem_epi8)
SIMDE_TEST_FUNC_LIST_ENTRY(mm_rem_epi16)
SIMDE_TEST_FUNC_LIST_ENTRY(mm_rem_epi32)
SIMDE_TEST_FUNC_LIST_ENTRY(mm_rem_epi64)
SIMDE_TEST_FUNC_LIST_ENTRY(mm_rem_epu8)
SIMDE_TEST_FUNC_LIST_ENTRY(mm_rem_epu16)
SIMDE_TEST_FUNC_LIST_ENTRY(mm_rem_epu32)
SIMDE_TEST_FUNC_LIST_ENTRY(mm_rem_epu64)
SIMDE_TEST_FUNC_LIST_ENTRY(mm256_rem_epi8)
SIMDE_TEST_FUNC_LIST_ENTRY(mm256_rem_epi16)
SIMDE_TEST_FUNC_LIST_ENTRY(mm256_rem_epi32)
SIMDE_TEST_FUNC_LIST_ENTRY(mm512_mask_rem_epi32)
SIMDE_TEST_FUNC_LIST_ENTRY(mm256_rem_epi64)
SIMDE_TEST_FUNC_LIST_ENTRY(mm256_rem_epu8)
SIMDE_TEST_FUNC_LIST_ENTRY(mm256_rem_epu16)
SIMDE_TEST_FUNC_LIST_ENTRY(mm256_rem_epu32)
SIMDE_TEST_FUNC_LIST_ENTRY(mm512_mask_rem_epu32)
SIMDE_TEST_FUNC_LIST_ENTRY(mm256_rem_epu64)
SIMDE_TEST_FUNC_LIST_ENTRY(mm512_rem_epi8)
SIMDE_TEST_FUNC_LIST_ENTRY(mm512_rem_epi16)
SIMDE_TEST_FUNC_LIST_ENTRY(mm512_rem_epi32)
SIMDE_TEST_FUNC_LIST_ENTRY(mm512_rem_epi64)
SIMDE_TEST_FUNC_LIST_ENTRY(mm512_rem_epu8)
SIMDE_TEST_FUNC_LIST_ENTRY(mm512_rem_epu16)
SIMDE_TEST_FUNC_LIST_ENTRY(mm512_rem_epu32)
SIMDE_TEST_FUNC_LIST_ENTRY(mm512_rem_epu64)
SIMDE_TEST_FUNC_LIST_ENTRY(mm512_recip_ps)
SIMDE_TEST_FUNC_LIST_ENTRY(mm512_recip_pd)
SIMDE_TEST_FUNC_LIST_ENTRY(mm512_mask_recip_ps)
SIMDE_TEST_FUNC_LIST_ENTRY(mm512_mask_recip_pd)
SIMDE_TEST_FUNC_LIST_ENTRY(mm512_rint_ps)
SIMDE_TEST_FUNC_LIST_ENTRY(mm512_rint_pd)
SIMDE_TEST_FUNC_LIST_ENTRY(mm512_mask_rint_ps)
SIMDE_TEST_FUNC_LIST_ENTRY(mm512_mask_rint_pd)
SIMDE_TEST_FUNC_LIST_ENTRY(mm_sin_ps)
SIMDE_TEST_FUNC_LIST_ENTRY(mm_sin_pd)
SIMDE_TEST_FUNC_LIST_ENTRY(mm256_sin_ps)
SIMDE_TEST_FUNC_LIST_ENTRY(mm256_sin_pd)
SIMDE_TEST_FUNC_LIST_ENTRY(mm512_sin_ps)
SIMDE_TEST_FUNC_LIST_ENTRY(mm512_sin_pd)
SIMDE_TEST_FUNC_LIST_ENTRY(mm512_mask_sin_ps)
SIMDE_TEST_FUNC_LIST_ENTRY(mm512_mask_sin_pd)
SIMDE_TEST_FUNC_LIST_ENTRY(mm_sincos_ps)
SIMDE_TEST_FUNC_LIST_ENTRY(mm_sincos_pd)
SIMDE_TEST_FUNC_LIST_ENTRY(mm256_sincos_ps)
SIMDE_TEST_FUNC_LIST_ENTRY(mm256_sincos_pd)
SIMDE_TEST_FUNC_LIST_ENTRY(mm512_sincos_ps)
SIMDE_TEST_FUNC_LIST_ENTRY(mm512_sincos_pd)
SIMDE_TEST_FUNC_LIST_ENTRY(mm512_mask_sincos_ps)
SIMDE_TEST_FUNC_LIST_ENTRY(mm512_mask_sincos_pd)
SIMDE_TEST_FUNC_LIST_ENTRY(mm_sind_ps)
SIMDE_TEST_FUNC_LIST_ENTRY(mm_sind_pd)
SIMDE_TEST_FUNC_LIST_ENTRY(mm256_sind_ps)
SIMDE_TEST_FUNC_LIST_ENTRY(mm256_sind_pd)
SIMDE_TEST_FUNC_LIST_ENTRY(mm512_sind_ps)
SIMDE_TEST_FUNC_LIST_ENTRY(mm512_sind_pd)
SIMDE_TEST_FUNC_LIST_ENTRY(mm512_mask_sind_ps)
SIMDE_TEST_FUNC_LIST_ENTRY(mm512_mask_sind_pd)
SIMDE_TEST_FUNC_LIST_ENTRY(mm_sinh_ps)
SIMDE_TEST_FUNC_LIST_ENTRY(mm_sinh_pd)
SIMDE_TEST_FUNC_LIST_ENTRY(mm256_sinh_ps)
SIMDE_TEST_FUNC_LIST_ENTRY(mm256_sinh_pd)
SIMDE_TEST_FUNC_LIST_ENTRY(mm512_sinh_ps)
SIMDE_TEST_FUNC_LIST_ENTRY(mm512_sinh_pd)
SIMDE_TEST_FUNC_LIST_ENTRY(mm512_mask_sinh_ps)
SIMDE_TEST_FUNC_LIST_ENTRY(mm512_mask_sinh_pd)
SIMDE_TEST_FUNC_LIST_ENTRY(mm_svml_ceil_ps)
SIMDE_TEST_FUNC_LIST_ENTRY(mm_svml_ceil_pd)
SIMDE_TEST_FUNC_LIST_ENTRY(mm256_svml_ceil_ps)
SIMDE_TEST_FUNC_LIST_ENTRY(mm256_svml_ceil_pd)
SIMDE_TEST_FUNC_LIST_ENTRY(mm512_ceil_ps)
SIMDE_TEST_FUNC_LIST_ENTRY(mm512_ceil_pd)
SIMDE_TEST_FUNC_LIST_ENTRY(mm512_mask_ceil_ps)
SIMDE_TEST_FUNC_LIST_ENTRY(mm512_mask_ceil_pd)
SIMDE_TEST_FUNC_LIST_ENTRY(mm_svml_floor_ps)
SIMDE_TEST_FUNC_LIST_ENTRY(mm_svml_floor_pd)
SIMDE_TEST_FUNC_LIST_ENTRY(mm256_svml_floor_ps)
SIMDE_TEST_FUNC_LIST_ENTRY(mm256_svml_floor_pd)
SIMDE_TEST_FUNC_LIST_ENTRY(mm512_floor_ps)
SIMDE_TEST_FUNC_LIST_ENTRY(mm512_floor_pd)
SIMDE_TEST_FUNC_LIST_ENTRY(mm512_mask_floor_ps)
SIMDE_TEST_FUNC_LIST_ENTRY(mm512_mask_floor_pd)
SIMDE_TEST_FUNC_LIST_ENTRY(mm_svml_round_ps)
SIMDE_TEST_FUNC_LIST_ENTRY(mm_svml_round_pd)
SIMDE_TEST_FUNC_LIST_ENTRY(mm256_svml_round_ps)
SIMDE_TEST_FUNC_LIST_ENTRY(mm256_svml_round_pd)
SIMDE_TEST_FUNC_LIST_ENTRY(mm512_svml_round_pd)
SIMDE_TEST_FUNC_LIST_ENTRY(mm512_mask_svml_round_pd)
SIMDE_TEST_FUNC_LIST_ENTRY(mm_svml_sqrt_ps)
SIMDE_TEST_FUNC_LIST_ENTRY(mm_svml_sqrt_pd)
SIMDE_TEST_FUNC_LIST_ENTRY(mm256_svml_sqrt_ps)
SIMDE_TEST_FUNC_LIST_ENTRY(mm256_svml_sqrt_pd)
SIMDE_TEST_FUNC_LIST_ENTRY(mm512_svml_sqrt_ps)
SIMDE_TEST_FUNC_LIST_ENTRY(mm512_svml_sqrt_pd)
SIMDE_TEST_FUNC_LIST_ENTRY(mm_tan_ps)
SIMDE_TEST_FUNC_LIST_ENTRY(mm_tan_pd)
SIMDE_TEST_FUNC_LIST_ENTRY(mm256_tan_ps)
SIMDE_TEST_FUNC_LIST_ENTRY(mm256_tan_pd)
SIMDE_TEST_FUNC_LIST_ENTRY(mm512_tan_ps)
SIMDE_TEST_FUNC_LIST_ENTRY(mm512_tan_pd)
SIMDE_TEST_FUNC_LIST_ENTRY(mm512_mask_tan_ps)
SIMDE_TEST_FUNC_LIST_ENTRY(mm512_mask_tan_pd)
SIMDE_TEST_FUNC_LIST_ENTRY(mm_tand_ps)
SIMDE_TEST_FUNC_LIST_ENTRY(mm_tand_pd)
SIMDE_TEST_FUNC_LIST_ENTRY(mm256_tand_ps)
SIMDE_TEST_FUNC_LIST_ENTRY(mm256_tand_pd)
SIMDE_TEST_FUNC_LIST_ENTRY(mm512_tand_ps)
SIMDE_TEST_FUNC_LIST_ENTRY(mm512_tand_pd)
SIMDE_TEST_FUNC_LIST_ENTRY(mm512_mask_tand_ps)
SIMDE_TEST_FUNC_LIST_ENTRY(mm512_mask_tand_pd)
SIMDE_TEST_FUNC_LIST_ENTRY(mm_tanh_ps)
SIMDE_TEST_FUNC_LIST_ENTRY(mm_tanh_pd)
SIMDE_TEST_FUNC_LIST_ENTRY(mm256_tanh_ps)
SIMDE_TEST_FUNC_LIST_ENTRY(mm256_tanh_pd)
SIMDE_TEST_FUNC_LIST_ENTRY(mm512_tanh_ps)
SIMDE_TEST_FUNC_LIST_ENTRY(mm512_tanh_pd)
SIMDE_TEST_FUNC_LIST_ENTRY(mm512_mask_tanh_ps)
SIMDE_TEST_FUNC_LIST_ENTRY(mm512_mask_tanh_pd)
SIMDE_TEST_FUNC_LIST_ENTRY(mm_trunc_ps)
SIMDE_TEST_FUNC_LIST_ENTRY(mm_trunc_pd)
SIMDE_TEST_FUNC_LIST_ENTRY(mm256_trunc_ps)
SIMDE_TEST_FUNC_LIST_ENTRY(mm256_trunc_pd)
SIMDE_TEST_FUNC_LIST_ENTRY(mm512_trunc_ps)
SIMDE_TEST_FUNC_LIST_ENTRY(mm512_trunc_pd)
SIMDE_TEST_FUNC_LIST_ENTRY(mm512_mask_trunc_ps)
SIMDE_TEST_FUNC_LIST_ENTRY(mm512_mask_trunc_pd)
SIMDE_TEST_FUNC_LIST_ENTRY(mm_udivrem_epi32)
SIMDE_TEST_FUNC_LIST_ENTRY(mm256_udivrem_epi32)
SIMDE_TEST_FUNC_LIST_END
#include <test/x86/test-x86-footer.h>
|