1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 112 113 114 115 116 117 118 119 120 121 122 123 124 125 126 127 128 129 130 131 132 133 134 135 136 137 138 139 140 141 142 143 144 145 146 147 148 149 150 151 152 153 154 155 156 157 158 159 160 161 162 163 164 165 166 167 168 169 170 171 172 173 174 175 176 177 178 179 180 181 182 183 184 185 186 187 188 189 190 191 192 193 194 195 196 197 198 199 200 201 202 203 204 205 206 207 208 209 210 211 212 213 214 215 216 217 218 219 220 221 222 223 224 225 226 227 228 229 230 231 232 233 234 235 236 237 238 239 240 241 242 243 244 245 246 247 248 249 250 251 252 253 254 255 256 257 258 259 260 261 262 263 264 265 266 267 268 269 270 271 272 273 274 275 276 277 278 279 280 281 282 283 284 285 286 287 288 289 290 291 292 293 294 295 296 297 298 299 300 301 302 303 304 305 306 307 308 309 310 311 312 313 314 315 316 317 318 319 320 321 322 323 324 325 326 327 328 329 330 331 332 333 334 335 336 337 338 339 340 341 342 343 344 345 346 347 348 349 350 351 352 353 354 355 356 357 358 359 360 361 362 363 364 365 366 367 368 369 370 371 372 373 374 375 376 377 378 379 380 381 382 383 384 385 386 387 388 389 390 391 392 393 394 395 396 397 398 399 400 401 402 403 404 405 406 407 408 409 410 411 412 413 414 415 416 417 418 419 420 421 422 423 424 425 426 427 428 429 430 431 432 433 434 435 436 437 438 439 440 441 442 443 444 445 446 447 448 449 450 451 452 453 454 455 456 457 458 459 460 461 462 463 464 465 466 467 468 469 470 471 472 473 474 475 476 477 478 479 480 481 482 483 484 485 486 487 488 489 490 491 492 493 494 495 496 497 498 499 500 501 502 503 504 505 506 507 508 509 510 511 512 513 514 515 516 517 518 519 520 521 522 523 524 525 526 527 528 529 530 531 532 533 534 535 536 537 538 539 540 541 542 543 544 545 546 547 548 549 550 551 552 553 554 555 556 557 558 559 560 561 562 563 564 565 566 567 568 569 570 571 572 573 574 575 576 577 578 579 580 581 582 583 584 585 586 587 588 589 590 591 592 593 594 595 596 597 598 599 600 601 602 603 604 605 606 607 608 609 610 611 612 613 614 615 616 617 618 619 620 621 622 623 624 625 626 627 628 629 630 631 632 633 634 635 636 637 638 639 640 641 642 643 644 645 646 647 648 649 650 651 652 653 654 655 656 657 658 659 660 661 662 663 664 665 666 667 668 669 670 671 672 673 674 675 676 677 678 679 680 681 682 683 684 685 686 687 688 689 690 691 692 693 694 695 696 697 698 699 700 701 702 703 704 705 706 707 708 709 710 711 712 713 714 715 716 717 718 719 720 721 722 723 724 725 726 727 728 729 730 731 732 733 734 735 736 737 738 739 740 741 742 743 744 745 746 747 748 749 750 751 752 753 754 755 756 757 758 759 760 761 762 763 764 765 766 767 768 769 770 771 772 773 774 775 776 777 778 779 780 781 782 783 784 785 786 787 788 789 790 791 792 793 794 795 796 797 798 799 800 801 802 803 804 805 806 807 808 809 810 811 812 813 814 815 816 817 818 819 820 821 822 823 824 825 826 827 828 829 830 831 832 833 834 835 836 837 838 839 840 841 842 843 844 845 846 847 848 849 850 851 852 853 854 855 856 857 858 859 860 861 862 863 864 865 866 867 868 869 870 871 872 873 874 875 876 877 878 879 880 881 882 883 884 885 886 887 888 889 890 891 892 893 894 895 896 897 898 899 900 901 902 903 904 905 906 907 908 909 910 911 912 913 914 915 916 917 918 919 920 921 922 923 924 925 926 927 928 929 930 931 932 933 934 935 936 937 938 939 940 941 942 943 944 945 946 947 948 949 950 951 952 953 954 955 956 957 958 959 960 961 962 963 964 965 966 967 968 969 970 971 972 973 974 975 976 977 978 979 980 981 982 983 984 985 986 987 988 989 990 991 992 993 994 995 996 997 998 999 1000 1001 1002 1003 1004 1005 1006 1007 1008 1009 1010 1011 1012 1013 1014 1015 1016 1017 1018 1019 1020 1021 1022 1023 1024 1025 1026 1027 1028 1029 1030 1031 1032 1033 1034 1035 1036 1037 1038 1039 1040 1041 1042 1043 1044 1045 1046 1047 1048 1049 1050 1051 1052 1053 1054 1055 1056 1057 1058 1059 1060 1061 1062 1063 1064 1065 1066 1067 1068 1069 1070 1071 1072 1073 1074 1075 1076 1077 1078 1079 1080 1081 1082 1083 1084 1085 1086 1087 1088 1089 1090 1091 1092 1093 1094 1095 1096 1097 1098 1099 1100 1101 1102 1103 1104 1105 1106 1107 1108 1109 1110 1111 1112 1113 1114 1115 1116 1117 1118 1119 1120 1121 1122 1123 1124 1125 1126 1127 1128 1129 1130 1131 1132 1133 1134 1135 1136 1137 1138 1139 1140 1141 1142 1143 1144 1145 1146 1147 1148 1149 1150 1151 1152 1153 1154 1155 1156 1157 1158 1159 1160 1161 1162 1163 1164 1165 1166 1167 1168 1169 1170 1171 1172 1173 1174 1175 1176 1177 1178 1179 1180 1181 1182 1183 1184 1185 1186 1187 1188 1189 1190 1191 1192 1193 1194 1195 1196 1197 1198 1199 1200 1201 1202 1203 1204 1205 1206 1207 1208 1209 1210 1211 1212 1213 1214 1215 1216 1217 1218 1219 1220 1221 1222 1223 1224 1225 1226 1227 1228 1229 1230 1231 1232 1233 1234 1235 1236 1237 1238 1239 1240 1241 1242 1243 1244 1245 1246 1247 1248 1249 1250 1251 1252 1253 1254 1255 1256 1257 1258 1259 1260 1261 1262 1263 1264 1265 1266 1267 1268 1269 1270 1271 1272 1273 1274 1275 1276 1277 1278 1279 1280 1281 1282 1283 1284 1285 1286 1287 1288 1289 1290 1291 1292 1293 1294 1295 1296 1297 1298 1299 1300 1301 1302 1303 1304 1305 1306 1307 1308 1309 1310 1311 1312 1313 1314 1315 1316 1317 1318 1319 1320 1321 1322 1323 1324 1325 1326 1327 1328 1329 1330 1331 1332 1333 1334 1335 1336 1337 1338 1339 1340 1341 1342 1343 1344 1345 1346 1347 1348 1349 1350 1351 1352 1353 1354 1355 1356 1357 1358 1359 1360 1361 1362 1363 1364 1365 1366 1367 1368 1369 1370 1371 1372 1373 1374 1375 1376 1377 1378 1379 1380 1381 1382 1383 1384 1385 1386 1387 1388 1389 1390 1391 1392 1393 1394 1395 1396 1397 1398 1399 1400 1401 1402 1403 1404 1405 1406 1407 1408 1409 1410 1411 1412 1413 1414 1415 1416 1417 1418 1419 1420 1421 1422 1423 1424 1425 1426 1427 1428 1429 1430 1431 1432 1433 1434 1435 1436 1437 1438 1439 1440 1441 1442 1443 1444 1445 1446 1447 1448 1449 1450 1451 1452 1453 1454 1455 1456 1457 1458 1459 1460 1461 1462 1463 1464 1465 1466 1467 1468 1469 1470 1471 1472 1473 1474 1475 1476 1477 1478 1479 1480 1481 1482 1483 1484 1485 1486 1487 1488 1489 1490 1491 1492 1493 1494 1495 1496 1497 1498 1499 1500 1501 1502 1503 1504 1505 1506 1507 1508 1509 1510 1511 1512 1513 1514 1515 1516 1517 1518 1519 1520 1521 1522 1523 1524 1525 1526 1527 1528 1529 1530 1531 1532 1533 1534 1535 1536 1537 1538 1539 1540 1541 1542 1543 1544 1545 1546 1547 1548 1549 1550 1551 1552 1553 1554 1555 1556 1557 1558 1559 1560 1561 1562 1563 1564 1565 1566 1567 1568 1569 1570 1571 1572 1573 1574 1575 1576 1577 1578 1579 1580 1581 1582 1583 1584 1585 1586 1587 1588 1589 1590 1591 1592 1593 1594 1595 1596 1597 1598 1599 1600 1601 1602 1603 1604 1605 1606 1607 1608 1609 1610 1611 1612 1613 1614 1615 1616 1617 1618 1619 1620 1621 1622 1623 1624 1625 1626 1627 1628 1629 1630 1631 1632 1633 1634 1635 1636 1637 1638 1639 1640 1641 1642 1643 1644 1645 1646 1647 1648 1649 1650 1651 1652 1653 1654 1655 1656 1657 1658 1659 1660 1661 1662 1663 1664 1665 1666 1667 1668 1669 1670 1671 1672 1673 1674 1675 1676 1677 1678 1679 1680 1681 1682 1683 1684 1685 1686 1687 1688 1689 1690 1691 1692 1693 1694 1695 1696 1697 1698 1699 1700 1701 1702 1703 1704 1705 1706 1707 1708 1709 1710 1711 1712 1713 1714 1715 1716 1717 1718 1719 1720 1721 1722 1723 1724 1725 1726 1727 1728 1729 1730 1731 1732 1733 1734 1735 1736 1737 1738 1739 1740 1741 1742 1743 1744 1745 1746 1747 1748 1749 1750 1751 1752 1753 1754 1755 1756 1757 1758 1759 1760 1761 1762 1763 1764 1765 1766 1767 1768 1769 1770 1771 1772 1773 1774 1775 1776 1777 1778 1779 1780 1781 1782 1783 1784 1785 1786 1787 1788 1789 1790 1791 1792 1793 1794 1795 1796 1797 1798 1799 1800 1801 1802 1803 1804 1805 1806 1807 1808 1809 1810 1811 1812 1813 1814 1815 1816 1817 1818 1819 1820 1821 1822 1823 1824 1825 1826 1827 1828 1829 1830 1831 1832 1833 1834 1835 1836 1837 1838 1839 1840 1841 1842 1843 1844 1845 1846 1847 1848 1849 1850 1851 1852 1853 1854 1855 1856 1857 1858 1859 1860 1861 1862 1863 1864 1865 1866 1867 1868 1869 1870 1871 1872 1873 1874 1875 1876 1877 1878 1879 1880 1881 1882 1883 1884 1885 1886 1887 1888 1889 1890 1891 1892 1893 1894 1895 1896 1897 1898 1899 1900 1901 1902 1903 1904 1905 1906 1907 1908 1909 1910 1911 1912 1913 1914 1915 1916 1917 1918 1919 1920 1921 1922 1923 1924 1925 1926 1927 1928 1929 1930 1931 1932 1933 1934 1935 1936 1937 1938 1939 1940 1941 1942 1943 1944 1945 1946 1947 1948 1949 1950 1951 1952 1953 1954 1955 1956 1957 1958 1959 1960 1961 1962 1963 1964 1965 1966 1967 1968 1969 1970 1971 1972 1973 1974 1975 1976 1977 1978 1979 1980 1981 1982 1983 1984 1985 1986 1987 1988 1989 1990 1991 1992 1993 1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 2022 2023 2024 2025 2026 2027 2028 2029 2030 2031 2032 2033 2034 2035 2036 2037 2038 2039 2040 2041 2042 2043 2044 2045 2046 2047 2048 2049 2050 2051 2052 2053 2054 2055 2056 2057 2058 2059 2060 2061 2062 2063 2064 2065 2066 2067 2068 2069 2070 2071 2072 2073 2074 2075 2076 2077 2078 2079 2080 2081 2082 2083 2084 2085 2086 2087 2088 2089 2090 2091 2092 2093 2094 2095 2096 2097 2098 2099 2100 2101 2102 2103 2104 2105 2106 2107 2108 2109 2110 2111 2112 2113 2114 2115 2116 2117 2118 2119 2120 2121 2122 2123 2124 2125 2126 2127 2128 2129 2130 2131 2132 2133 2134 2135 2136 2137 2138 2139 2140 2141 2142 2143 2144 2145 2146 2147 2148 2149 2150 2151 2152 2153 2154 2155 2156 2157 2158 2159 2160 2161 2162 2163 2164 2165 2166 2167 2168 2169 2170 2171 2172 2173 2174 2175 2176 2177 2178 2179 2180 2181 2182 2183 2184 2185 2186 2187 2188 2189 2190 2191 2192 2193 2194 2195 2196 2197 2198 2199 2200 2201 2202 2203 2204 2205 2206 2207 2208 2209 2210 2211 2212 2213 2214 2215 2216 2217 2218 2219 2220 2221 2222 2223 2224 2225 2226 2227 2228 2229 2230 2231 2232 2233 2234 2235 2236 2237 2238 2239 2240 2241 2242 2243 2244 2245 2246 2247 2248 2249 2250 2251 2252 2253 2254 2255 2256 2257 2258 2259 2260 2261 2262 2263 2264 2265 2266 2267 2268 2269 2270 2271 2272 2273 2274 2275 2276 2277 2278 2279 2280 2281 2282 2283 2284 2285 2286 2287 2288 2289 2290 2291 2292 2293 2294 2295 2296 2297 2298 2299 2300 2301 2302 2303 2304 2305 2306 2307 2308 2309 2310 2311 2312 2313 2314 2315 2316 2317 2318 2319 2320 2321 2322 2323 2324 2325 2326 2327 2328 2329 2330 2331 2332 2333 2334 2335 2336 2337 2338 2339 2340 2341 2342 2343 2344 2345 2346 2347 2348 2349 2350 2351 2352 2353 2354 2355 2356 2357 2358 2359 2360 2361 2362 2363 2364 2365 2366 2367 2368 2369 2370 2371 2372 2373 2374 2375 2376 2377 2378 2379 2380 2381 2382 2383 2384 2385 2386 2387 2388 2389 2390 2391 2392 2393 2394 2395 2396 2397 2398 2399 2400 2401 2402 2403 2404 2405 2406 2407 2408 2409 2410 2411 2412 2413 2414 2415 2416 2417 2418 2419 2420 2421 2422 2423 2424 2425 2426 2427 2428 2429 2430 2431 2432 2433 2434 2435 2436 2437 2438 2439 2440 2441 2442 2443 2444 2445 2446 2447 2448 2449 2450 2451 2452 2453 2454 2455 2456 2457 2458 2459 2460 2461 2462 2463 2464 2465 2466 2467 2468 2469 2470 2471 2472 2473 2474 2475 2476 2477 2478 2479 2480 2481 2482 2483 2484 2485 2486 2487 2488 2489 2490 2491 2492 2493 2494 2495 2496 2497 2498 2499 2500 2501 2502 2503 2504 2505 2506 2507 2508 2509 2510 2511 2512 2513 2514 2515 2516 2517 2518 2519 2520 2521 2522 2523 2524 2525 2526 2527 2528 2529 2530 2531 2532 2533 2534 2535 2536 2537 2538 2539 2540 2541 2542 2543 2544 2545 2546 2547 2548 2549 2550 2551 2552 2553 2554 2555 2556 2557 2558 2559 2560 2561 2562 2563 2564 2565 2566 2567 2568 2569 2570 2571 2572 2573 2574 2575 2576 2577 2578 2579 2580 2581 2582 2583 2584 2585 2586 2587 2588 2589 2590 2591 2592 2593 2594 2595 2596 2597 2598 2599 2600 2601 2602 2603 2604 2605 2606 2607 2608 2609 2610 2611 2612 2613 2614 2615 2616 2617 2618 2619 2620 2621 2622 2623 2624 2625 2626 2627 2628 2629 2630 2631 2632 2633 2634 2635 2636 2637 2638 2639 2640 2641 2642 2643 2644 2645 2646 2647 2648 2649 2650 2651 2652 2653 2654 2655 2656 2657 2658 2659 2660 2661 2662 2663 2664 2665 2666 2667 2668 2669 2670 2671 2672 2673 2674 2675 2676 2677 2678 2679 2680 2681 2682 2683 2684 2685 2686 2687 2688 2689 2690 2691 2692 2693 2694 2695 2696 2697 2698 2699 2700 2701 2702 2703 2704 2705 2706 2707 2708 2709 2710 2711 2712 2713 2714 2715 2716 2717 2718 2719 2720 2721 2722 2723 2724 2725 2726 2727 2728 2729 2730 2731 2732 2733 2734 2735 2736 2737 2738 2739 2740 2741 2742 2743 2744 2745 2746 2747 2748 2749 2750 2751 2752 2753 2754 2755 2756 2757 2758 2759 2760 2761 2762 2763 2764 2765 2766 2767 2768 2769 2770 2771 2772 2773 2774 2775 2776 2777 2778 2779 2780 2781 2782 2783 2784 2785 2786 2787 2788 2789 2790 2791 2792 2793 2794 2795 2796 2797 2798 2799 2800 2801 2802 2803 2804 2805 2806 2807 2808 2809 2810 2811 2812 2813 2814 2815 2816 2817 2818 2819 2820 2821 2822 2823 2824 2825 2826 2827 2828 2829 2830 2831 2832 2833 2834 2835 2836 2837 2838 2839 2840 2841 2842 2843 2844 2845 2846 2847 2848 2849 2850 2851 2852 2853 2854 2855 2856 2857 2858 2859 2860 2861 2862 2863 2864 2865 2866 2867 2868 2869 2870 2871 2872 2873 2874
|
// Code generated by smithy-go-codegen DO NOT EDIT.
package types
import (
smithydocument "github.com/aws/smithy-go/document"
"time"
)
// An augmented manifest file that provides training data for your custom model.
// An augmented manifest file is a labeled dataset that is produced by Amazon
// SageMaker Ground Truth.
type AugmentedManifestsListItem struct {
// The JSON attribute that contains the annotations for your training documents.
// The number of attribute names that you specify depends on whether your augmented
// manifest file is the output of a single labeling job or a chained labeling job.
// If your file is the output of a single labeling job, specify the
// LabelAttributeName key that was used when the job was created in Ground Truth.
// If your file is the output of a chained labeling job, specify the
// LabelAttributeName key for one or more jobs in the chain. Each
// LabelAttributeName key provides the annotations from an individual job.
//
// This member is required.
AttributeNames []string
// The Amazon S3 location of the augmented manifest file.
//
// This member is required.
S3Uri *string
// The S3 prefix to the annotation files that are referred in the augmented
// manifest file.
AnnotationDataS3Uri *string
// The type of augmented manifest. PlainTextDocument or SemiStructuredDocument. If
// you don't specify, the default is PlainTextDocument.
// - PLAIN_TEXT_DOCUMENT A document type that represents any unicode text that is
// encoded in UTF-8.
// - SEMI_STRUCTURED_DOCUMENT A document type with positional and structural
// context, like a PDF. For training with Amazon Comprehend, only PDFs are
// supported. For inference, Amazon Comprehend support PDFs, DOCX and TXT.
DocumentType AugmentedManifestsDocumentTypeFormat
// The S3 prefix to the source files (PDFs) that are referred to in the augmented
// manifest file.
SourceDocumentsS3Uri *string
// The purpose of the data you've provided in the augmented manifest. You can
// either train or test this data. If you don't specify, the default is train.
// TRAIN - all of the documents in the manifest will be used for training. If no
// test documents are provided, Amazon Comprehend will automatically reserve a
// portion of the training documents for testing. TEST - all of the documents in
// the manifest will be used for testing.
Split Split
noSmithyDocumentSerde
}
// The result of calling the operation. The operation returns one object for each
// document that is successfully processed by the operation.
type BatchDetectDominantLanguageItemResult struct {
// The zero-based index of the document in the input list.
Index *int32
// One or more DominantLanguage objects describing the dominant languages in the
// document.
Languages []DominantLanguage
noSmithyDocumentSerde
}
// The result of calling the operation. The operation returns one object for each
// document that is successfully processed by the operation.
type BatchDetectEntitiesItemResult struct {
// One or more Entity objects, one for each entity detected in the document.
Entities []Entity
// The zero-based index of the document in the input list.
Index *int32
noSmithyDocumentSerde
}
// The result of calling the operation. The operation returns one object for each
// document that is successfully processed by the operation.
type BatchDetectKeyPhrasesItemResult struct {
// The zero-based index of the document in the input list.
Index *int32
// One or more KeyPhrase objects, one for each key phrase detected in the document.
KeyPhrases []KeyPhrase
noSmithyDocumentSerde
}
// The result of calling the operation. The operation returns one object for each
// document that is successfully processed by the operation.
type BatchDetectSentimentItemResult struct {
// The zero-based index of the document in the input list.
Index *int32
// The sentiment detected in the document.
Sentiment SentimentType
// The level of confidence that Amazon Comprehend has in the accuracy of its
// sentiment detection.
SentimentScore *SentimentScore
noSmithyDocumentSerde
}
// The result of calling the operation. The operation returns one object that is
// successfully processed by the operation.
type BatchDetectSyntaxItemResult struct {
// The zero-based index of the document in the input list.
Index *int32
// The syntax tokens for the words in the document, one token for each word.
SyntaxTokens []SyntaxToken
noSmithyDocumentSerde
}
// Analysis results for one of the documents in the batch.
type BatchDetectTargetedSentimentItemResult struct {
// An array of targeted sentiment entities.
Entities []TargetedSentimentEntity
// The zero-based index of this result in the input list.
Index *int32
noSmithyDocumentSerde
}
// Describes an error that occurred while processing a document in a batch. The
// operation returns on BatchItemError object for each document that contained an
// error.
type BatchItemError struct {
// The numeric error code of the error.
ErrorCode *string
// A text description of the error.
ErrorMessage *string
// The zero-based index of the document in the input list.
Index *int32
noSmithyDocumentSerde
}
// Information about each word or line of text in the input document. For
// additional information, see Block (https://docs.aws.amazon.com/textract/latest/dg/API_Block.html)
// in the Amazon Textract API reference.
type Block struct {
// The block represents a line of text or one word of text.
// - WORD - A word that's detected on a document page. A word is one or more ISO
// basic Latin script characters that aren't separated by spaces.
// - LINE - A string of tab-delimited, contiguous words that are detected on a
// document page
BlockType BlockType
// Co-ordinates of the rectangle or polygon that contains the text.
Geometry *Geometry
// Unique identifier for the block.
Id *string
// Page number where the block appears.
Page *int32
// A list of child blocks of the current block. For example, a LINE object has
// child blocks for each WORD block that's part of the line of text.
Relationships []RelationshipsListItem
// The word or line of text extracted from the block.
Text *string
noSmithyDocumentSerde
}
// A reference to a block.
type BlockReference struct {
// Offset of the start of the block within its parent block.
BeginOffset *int32
// Unique identifier for the block.
BlockId *string
// List of child blocks within this block.
ChildBlocks []ChildBlock
// Offset of the end of the block within its parent block.
EndOffset *int32
noSmithyDocumentSerde
}
// The bounding box around the detected page or around an element on a document
// page. The left (x-coordinate) and top (y-coordinate) are coordinates that
// represent the top and left sides of the bounding box. Note that the upper-left
// corner of the image is the origin (0,0). For additional information, see
// BoundingBox (https://docs.aws.amazon.com/textract/latest/dg/API_BoundingBox.html)
// in the Amazon Textract API reference.
type BoundingBox struct {
// The height of the bounding box as a ratio of the overall document page height.
Height *float32
// The left coordinate of the bounding box as a ratio of overall document page
// width.
Left *float32
// The top coordinate of the bounding box as a ratio of overall document page
// height.
Top *float32
// The width of the bounding box as a ratio of the overall document page width.
Width *float32
noSmithyDocumentSerde
}
// Nested block contained within a block.
type ChildBlock struct {
// Offset of the start of the child block within its parent block.
BeginOffset *int32
// Unique identifier for the child block.
ChildBlockId *string
// Offset of the end of the child block within its parent block.
EndOffset *int32
noSmithyDocumentSerde
}
// Describes the result metrics for the test data associated with an documentation
// classifier.
type ClassifierEvaluationMetrics struct {
// The fraction of the labels that were correct recognized. It is computed by
// dividing the number of labels in the test documents that were correctly
// recognized by the total number of labels in the test documents.
Accuracy *float64
// A measure of how accurate the classifier results are for the test data. It is
// derived from the Precision and Recall values. The F1Score is the harmonic
// average of the two scores. The highest score is 1, and the worst score is 0.
F1Score *float64
// Indicates the fraction of labels that are incorrectly predicted. Also seen as
// the fraction of wrong labels compared to the total number of labels. Scores
// closer to zero are better.
HammingLoss *float64
// A measure of how accurate the classifier results are for the test data. It is a
// combination of the Micro Precision and Micro Recall values. The Micro F1Score
// is the harmonic mean of the two scores. The highest score is 1, and the worst
// score is 0.
MicroF1Score *float64
// A measure of the usefulness of the recognizer results in the test data. High
// precision means that the recognizer returned substantially more relevant results
// than irrelevant ones. Unlike the Precision metric which comes from averaging the
// precision of all available labels, this is based on the overall score of all
// precision scores added together.
MicroPrecision *float64
// A measure of how complete the classifier results are for the test data. High
// recall means that the classifier returned most of the relevant results.
// Specifically, this indicates how many of the correct categories in the text that
// the model can predict. It is a percentage of correct categories in the text that
// can found. Instead of averaging the recall scores of all labels (as with
// Recall), micro Recall is based on the overall score of all recall scores added
// together.
MicroRecall *float64
// A measure of the usefulness of the classifier results in the test data. High
// precision means that the classifier returned substantially more relevant results
// than irrelevant ones.
Precision *float64
// A measure of how complete the classifier results are for the test data. High
// recall means that the classifier returned most of the relevant results.
Recall *float64
noSmithyDocumentSerde
}
// Provides information about a document classifier.
type ClassifierMetadata struct {
// Describes the result metrics for the test data associated with an documentation
// classifier.
EvaluationMetrics *ClassifierEvaluationMetrics
// The number of labels in the input data.
NumberOfLabels *int32
// The number of documents in the input data that were used to test the
// classifier. Typically this is 10 to 20 percent of the input documents, up to
// 10,000 documents.
NumberOfTestDocuments *int32
// The number of documents in the input data that were used to train the
// classifier. Typically this is 80 to 90 percent of the input documents.
NumberOfTrainedDocuments *int32
noSmithyDocumentSerde
}
// Data security configuration.
type DataSecurityConfig struct {
// ID for the KMS key that Amazon Comprehend uses to encrypt the data in the data
// lake.
DataLakeKmsKeyId *string
// ID for the KMS key that Amazon Comprehend uses to encrypt trained custom
// models. The ModelKmsKeyId can be either of the following formats:
// - KMS Key ID: "1234abcd-12ab-34cd-56ef-1234567890ab"
// - Amazon Resource Name (ARN) of a KMS Key:
// "arn:aws:kms:us-west-2:111122223333:key/1234abcd-12ab-34cd-56ef-1234567890ab"
ModelKmsKeyId *string
// ID for the KMS key that Amazon Comprehend uses to encrypt the volume.
VolumeKmsKeyId *string
// Configuration parameters for an optional private Virtual Private Cloud (VPC)
// containing the resources you are using for the job. For more information, see
// Amazon VPC (https://docs.aws.amazon.com/vpc/latest/userguide/what-is-amazon-vpc.html)
// .
VpcConfig *VpcConfig
noSmithyDocumentSerde
}
// An augmented manifest file that provides training data for your custom model.
// An augmented manifest file is a labeled dataset that is produced by Amazon
// SageMaker Ground Truth.
type DatasetAugmentedManifestsListItem struct {
// The JSON attribute that contains the annotations for your training documents.
// The number of attribute names that you specify depends on whether your augmented
// manifest file is the output of a single labeling job or a chained labeling job.
// If your file is the output of a single labeling job, specify the
// LabelAttributeName key that was used when the job was created in Ground Truth.
// If your file is the output of a chained labeling job, specify the
// LabelAttributeName key for one or more jobs in the chain. Each
// LabelAttributeName key provides the annotations from an individual job.
//
// This member is required.
AttributeNames []string
// The Amazon S3 location of the augmented manifest file.
//
// This member is required.
S3Uri *string
// The S3 prefix to the annotation files that are referred in the augmented
// manifest file.
AnnotationDataS3Uri *string
// The type of augmented manifest. If you don't specify, the default is
// PlainTextDocument. PLAIN_TEXT_DOCUMENT A document type that represents any
// unicode text that is encoded in UTF-8.
DocumentType AugmentedManifestsDocumentTypeFormat
// The S3 prefix to the source files (PDFs) that are referred to in the augmented
// manifest file.
SourceDocumentsS3Uri *string
noSmithyDocumentSerde
}
// Describes the dataset input data configuration for a document classifier model.
// For more information on how the input file is formatted, see Preparing training
// data (https://docs.aws.amazon.com/comprehend/latest/dg/prep-classifier-data.html)
// in the Comprehend Developer Guide.
type DatasetDocumentClassifierInputDataConfig struct {
// The Amazon S3 URI for the input data. The S3 bucket must be in the same Region
// as the API endpoint that you are calling. The URI can point to a single input
// file or it can provide the prefix for a collection of input files. For example,
// if you use the URI S3://bucketName/prefix , if the prefix is a single file,
// Amazon Comprehend uses that file as input. If more than one file begins with the
// prefix, Amazon Comprehend uses all of them as input. This parameter is required
// if you set DataFormat to COMPREHEND_CSV .
//
// This member is required.
S3Uri *string
// Indicates the delimiter used to separate each label for training a multi-label
// classifier. The default delimiter between labels is a pipe (|). You can use a
// different character as a delimiter (if it's an allowed character) by specifying
// it under Delimiter for labels. If the training documents use a delimiter other
// than the default or the delimiter you specify, the labels on that line will be
// combined to make a single unique label, such as LABELLABELLABEL.
LabelDelimiter *string
noSmithyDocumentSerde
}
// Describes the annotations associated with a entity recognizer.
type DatasetEntityRecognizerAnnotations struct {
// Specifies the Amazon S3 location where the training documents for an entity
// recognizer are located. The URI must be in the same Region as the API endpoint
// that you are calling.
//
// This member is required.
S3Uri *string
noSmithyDocumentSerde
}
// Describes the documents submitted with a dataset for an entity recognizer model.
type DatasetEntityRecognizerDocuments struct {
// Specifies the Amazon S3 location where the documents for the dataset are
// located.
//
// This member is required.
S3Uri *string
// Specifies how the text in an input file should be processed. This is optional,
// and the default is ONE_DOC_PER_LINE. ONE_DOC_PER_FILE - Each file is considered
// a separate document. Use this option when you are processing large documents,
// such as newspaper articles or scientific papers. ONE_DOC_PER_LINE - Each line in
// a file is considered a separate document. Use this option when you are
// processing many short documents, such as text messages.
InputFormat InputFormat
noSmithyDocumentSerde
}
// Describes the dataset entity list for an entity recognizer model. For more
// information on how the input file is formatted, see Preparing training data (https://docs.aws.amazon.com/comprehend/latest/dg/prep-training-data-cer.html)
// in the Comprehend Developer Guide.
type DatasetEntityRecognizerEntityList struct {
// Specifies the Amazon S3 location where the entity list is located.
//
// This member is required.
S3Uri *string
noSmithyDocumentSerde
}
// Specifies the format and location of the input data. You must provide either
// the Annotations parameter or the EntityList parameter.
type DatasetEntityRecognizerInputDataConfig struct {
// The format and location of the training documents for your custom entity
// recognizer.
//
// This member is required.
Documents *DatasetEntityRecognizerDocuments
// The S3 location of the annotation documents for your custom entity recognizer.
Annotations *DatasetEntityRecognizerAnnotations
// The S3 location of the entity list for your custom entity recognizer.
EntityList *DatasetEntityRecognizerEntityList
noSmithyDocumentSerde
}
// Filter the datasets based on creation time or dataset status.
type DatasetFilter struct {
// Filter the datasets to include datasets created after the specified time.
CreationTimeAfter *time.Time
// Filter the datasets to include datasets created before the specified time.
CreationTimeBefore *time.Time
// Filter the datasets based on the dataset type.
DatasetType DatasetType
// Filter the datasets based on the dataset status.
Status DatasetStatus
noSmithyDocumentSerde
}
// Specifies the format and location of the input data for the dataset.
type DatasetInputDataConfig struct {
// A list of augmented manifest files that provide training data for your custom
// model. An augmented manifest file is a labeled dataset that is produced by
// Amazon SageMaker Ground Truth.
AugmentedManifests []DatasetAugmentedManifestsListItem
// COMPREHEND_CSV : The data format is a two-column CSV file, where the first
// column contains labels and the second column contains documents.
// AUGMENTED_MANIFEST : The data format
DataFormat DatasetDataFormat
// The input properties for training a document classifier model. For more
// information on how the input file is formatted, see Preparing training data (https://docs.aws.amazon.com/comprehend/latest/dg/prep-classifier-data.html)
// in the Comprehend Developer Guide.
DocumentClassifierInputDataConfig *DatasetDocumentClassifierInputDataConfig
// The input properties for training an entity recognizer model.
EntityRecognizerInputDataConfig *DatasetEntityRecognizerInputDataConfig
noSmithyDocumentSerde
}
// Properties associated with the dataset.
type DatasetProperties struct {
// Creation time of the dataset.
CreationTime *time.Time
// The ARN of the dataset.
DatasetArn *string
// The name of the dataset.
DatasetName *string
// The S3 URI where the dataset is stored.
DatasetS3Uri *string
// The dataset type (training data or test data).
DatasetType DatasetType
// Description of the dataset.
Description *string
// Time when the data from the dataset becomes available in the data lake.
EndTime *time.Time
// A description of the status of the dataset.
Message *string
// The number of documents in the dataset.
NumberOfDocuments *int64
// The dataset status. While the system creates the dataset, the status is CREATING
// . When the dataset is ready to use, the status changes to COMPLETED .
Status DatasetStatus
noSmithyDocumentSerde
}
// Specifies the class that categorizes the document being analyzed
type DocumentClass struct {
// The name of the class.
Name *string
// Page number in the input document. This field is present in the response only
// if your request includes the Byte parameter.
Page *int32
// The confidence score that Amazon Comprehend has this class correctly attributed.
Score *float32
noSmithyDocumentSerde
}
// Configuration required for a document classification model.
type DocumentClassificationConfig struct {
// Classification mode indicates whether the documents are MULTI_CLASS or
// MULTI_LABEL .
//
// This member is required.
Mode DocumentClassifierMode
// One or more labels to associate with the custom classifier.
Labels []string
noSmithyDocumentSerde
}
// Provides information for filtering a list of document classification jobs. For
// more information, see the operation. You can provide only one filter parameter
// in each request.
type DocumentClassificationJobFilter struct {
// Filters on the name of the job.
JobName *string
// Filters the list based on job status. Returns only jobs with the specified
// status.
JobStatus JobStatus
// Filters the list of jobs based on the time that the job was submitted for
// processing. Returns only jobs submitted after the specified time. Jobs are
// returned in descending order, newest to oldest.
SubmitTimeAfter *time.Time
// Filters the list of jobs based on the time that the job was submitted for
// processing. Returns only jobs submitted before the specified time. Jobs are
// returned in ascending order, oldest to newest.
SubmitTimeBefore *time.Time
noSmithyDocumentSerde
}
// Provides information about a document classification job.
type DocumentClassificationJobProperties struct {
// The Amazon Resource Name (ARN) of the IAM role that grants Amazon Comprehend
// read access to your input data.
DataAccessRoleArn *string
// The Amazon Resource Name (ARN) that identifies the document classifier.
DocumentClassifierArn *string
// The time that the document classification job completed.
EndTime *time.Time
// The Amazon Resource Number (ARN) of the flywheel
FlywheelArn *string
// The input data configuration that you supplied when you created the document
// classification job.
InputDataConfig *InputDataConfig
// The Amazon Resource Name (ARN) of the document classification job. It is a
// unique, fully qualified identifier for the job. It includes the Amazon Web
// Services account, Amazon Web Services Region, and the job ID. The format of the
// ARN is as follows: arn::comprehend:::document-classification-job/ The following
// is an example job ARN:
// arn:aws:comprehend:us-west-2:111122223333:document-classification-job/1234abcd12ab34cd56ef1234567890ab
JobArn *string
// The identifier assigned to the document classification job.
JobId *string
// The name that you assigned to the document classification job.
JobName *string
// The current status of the document classification job. If the status is FAILED ,
// the Message field shows the reason for the failure.
JobStatus JobStatus
// A description of the status of the job.
Message *string
// The output data configuration that you supplied when you created the document
// classification job.
OutputDataConfig *OutputDataConfig
// The time that the document classification job was submitted for processing.
SubmitTime *time.Time
// ID for the Amazon Web Services Key Management Service (KMS) key that Amazon
// Comprehend uses to encrypt data on the storage volume attached to the ML compute
// instance(s) that process the analysis job. The VolumeKmsKeyId can be either of
// the following formats:
// - KMS Key ID: "1234abcd-12ab-34cd-56ef-1234567890ab"
// - Amazon Resource Name (ARN) of a KMS Key:
// "arn:aws:kms:us-west-2:111122223333:key/1234abcd-12ab-34cd-56ef-1234567890ab"
VolumeKmsKeyId *string
// Configuration parameters for a private Virtual Private Cloud (VPC) containing
// the resources you are using for your document classification job. For more
// information, see Amazon VPC (https://docs.aws.amazon.com/vpc/latest/userguide/what-is-amazon-vpc.html)
// .
VpcConfig *VpcConfig
noSmithyDocumentSerde
}
// The location of the training documents. This parameter is required in a request
// to create a semi-structured document classification model.
type DocumentClassifierDocuments struct {
// The S3 URI location of the training documents specified in the S3Uri CSV file.
//
// This member is required.
S3Uri *string
// The S3 URI location of the test documents included in the TestS3Uri CSV file.
// This field is not required if you do not specify a test CSV file.
TestS3Uri *string
noSmithyDocumentSerde
}
// Provides information for filtering a list of document classifiers. You can only
// specify one filtering parameter in a request. For more information, see the
// ListDocumentClassifiers operation.
type DocumentClassifierFilter struct {
// The name that you assigned to the document classifier
DocumentClassifierName *string
// Filters the list of classifiers based on status.
Status ModelStatus
// Filters the list of classifiers based on the time that the classifier was
// submitted for processing. Returns only classifiers submitted after the specified
// time. Classifiers are returned in descending order, newest to oldest.
SubmitTimeAfter *time.Time
// Filters the list of classifiers based on the time that the classifier was
// submitted for processing. Returns only classifiers submitted before the
// specified time. Classifiers are returned in ascending order, oldest to newest.
SubmitTimeBefore *time.Time
noSmithyDocumentSerde
}
// The input properties for training a document classifier. For more information
// on how the input file is formatted, see Preparing training data (https://docs.aws.amazon.com/comprehend/latest/dg/prep-classifier-data.html)
// in the Comprehend Developer Guide.
type DocumentClassifierInputDataConfig struct {
// A list of augmented manifest files that provide training data for your custom
// model. An augmented manifest file is a labeled dataset that is produced by
// Amazon SageMaker Ground Truth. This parameter is required if you set DataFormat
// to AUGMENTED_MANIFEST .
AugmentedManifests []AugmentedManifestsListItem
// The format of your training data:
// - COMPREHEND_CSV : A two-column CSV file, where labels are provided in the
// first column, and documents are provided in the second. If you use this value,
// you must provide the S3Uri parameter in your request.
// - AUGMENTED_MANIFEST : A labeled dataset that is produced by Amazon SageMaker
// Ground Truth. This file is in JSON lines format. Each line is a complete JSON
// object that contains a training document and its associated labels. If you use
// this value, you must provide the AugmentedManifests parameter in your request.
// If you don't specify a value, Amazon Comprehend uses COMPREHEND_CSV as the
// default.
DataFormat DocumentClassifierDataFormat
// Provides configuration parameters to override the default actions for
// extracting text from PDF documents and image files. By default, Amazon
// Comprehend performs the following actions to extract text from files, based on
// the input file type:
// - Word files - Amazon Comprehend parser extracts the text.
// - Digital PDF files - Amazon Comprehend parser extracts the text.
// - Image files and scanned PDF files - Amazon Comprehend uses the Amazon
// Textract DetectDocumentText API to extract the text.
// DocumentReaderConfig does not apply to plain text files or Word files. For
// image files and PDF documents, you can override these default actions using the
// fields listed below. For more information, see Setting text extraction options (https://docs.aws.amazon.com/comprehend/latest/dg/idp-set-textract-options.html)
// in the Comprehend Developer Guide.
DocumentReaderConfig *DocumentReaderConfig
// The type of input documents for training the model. Provide plain-text
// documents to create a plain-text model, and provide semi-structured documents to
// create a native document model.
DocumentType DocumentClassifierDocumentTypeFormat
// The S3 location of the training documents. This parameter is required in a
// request to create a native document model.
Documents *DocumentClassifierDocuments
// Indicates the delimiter used to separate each label for training a multi-label
// classifier. The default delimiter between labels is a pipe (|). You can use a
// different character as a delimiter (if it's an allowed character) by specifying
// it under Delimiter for labels. If the training documents use a delimiter other
// than the default or the delimiter you specify, the labels on that line will be
// combined to make a single unique label, such as LABELLABELLABEL.
LabelDelimiter *string
// The Amazon S3 URI for the input data. The S3 bucket must be in the same Region
// as the API endpoint that you are calling. The URI can point to a single input
// file or it can provide the prefix for a collection of input files. For example,
// if you use the URI S3://bucketName/prefix , if the prefix is a single file,
// Amazon Comprehend uses that file as input. If more than one file begins with the
// prefix, Amazon Comprehend uses all of them as input. This parameter is required
// if you set DataFormat to COMPREHEND_CSV .
S3Uri *string
// This specifies the Amazon S3 location that contains the test annotations for
// the document classifier. The URI must be in the same Amazon Web Services Region
// as the API endpoint that you are calling.
TestS3Uri *string
noSmithyDocumentSerde
}
// Provide the location for output data from a custom classifier job. This field
// is mandatory if you are training a native document model.
type DocumentClassifierOutputDataConfig struct {
// The Amazon S3 prefix for the data lake location of the flywheel statistics.
FlywheelStatsS3Prefix *string
// ID for the Amazon Web Services Key Management Service (KMS) key that Amazon
// Comprehend uses to encrypt the output results from an analysis job. The KmsKeyId
// can be one of the following formats:
// - KMS Key ID: "1234abcd-12ab-34cd-56ef-1234567890ab"
// - Amazon Resource Name (ARN) of a KMS Key:
// "arn:aws:kms:us-west-2:111122223333:key/1234abcd-12ab-34cd-56ef-1234567890ab"
// - KMS Key Alias: "alias/ExampleAlias"
// - ARN of a KMS Key Alias:
// "arn:aws:kms:us-west-2:111122223333:alias/ExampleAlias"
KmsKeyId *string
// When you use the OutputDataConfig object while creating a custom classifier,
// you specify the Amazon S3 location where you want to write the confusion matrix
// and other output files. The URI must be in the same Region as the API endpoint
// that you are calling. The location is used as the prefix for the actual location
// of this output file. When the custom classifier job is finished, the service
// creates the output file in a directory specific to the job. The S3Uri field
// contains the location of the output file, called output.tar.gz . It is a
// compressed archive that contains the confusion matrix.
S3Uri *string
noSmithyDocumentSerde
}
// Provides information about a document classifier.
type DocumentClassifierProperties struct {
// Information about the document classifier, including the number of documents
// used for training the classifier, the number of documents used for test the
// classifier, and an accuracy rating.
ClassifierMetadata *ClassifierMetadata
// The Amazon Resource Name (ARN) of the IAM role that grants Amazon Comprehend
// read access to your input data.
DataAccessRoleArn *string
// The Amazon Resource Name (ARN) that identifies the document classifier.
DocumentClassifierArn *string
// The time that training the document classifier completed.
EndTime *time.Time
// The Amazon Resource Number (ARN) of the flywheel
FlywheelArn *string
// The input data configuration that you supplied when you created the document
// classifier for training.
InputDataConfig *DocumentClassifierInputDataConfig
// The language code for the language of the documents that the classifier was
// trained on.
LanguageCode LanguageCode
// Additional information about the status of the classifier.
Message *string
// Indicates the mode in which the specific classifier was trained. This also
// indicates the format of input documents and the format of the confusion matrix.
// Each classifier can only be trained in one mode and this cannot be changed once
// the classifier is trained.
Mode DocumentClassifierMode
// ID for the KMS key that Amazon Comprehend uses to encrypt trained custom
// models. The ModelKmsKeyId can be either of the following formats:
// - KMS Key ID: "1234abcd-12ab-34cd-56ef-1234567890ab"
// - Amazon Resource Name (ARN) of a KMS Key:
// "arn:aws:kms:us-west-2:111122223333:key/1234abcd-12ab-34cd-56ef-1234567890ab"
ModelKmsKeyId *string
// Provides output results configuration parameters for custom classifier jobs.
OutputDataConfig *DocumentClassifierOutputDataConfig
// The Amazon Resource Name (ARN) of the source model. This model was imported
// from a different Amazon Web Services account to create the document classifier
// model in your Amazon Web Services account.
SourceModelArn *string
// The status of the document classifier. If the status is TRAINED the classifier
// is ready to use. If the status is TRAINED_WITH_WARNINGS the classifier training
// succeeded, but you should review the warnings returned in the
// CreateDocumentClassifier response. If the status is FAILED you can see
// additional information about why the classifier wasn't trained in the Message
// field.
Status ModelStatus
// The time that the document classifier was submitted for training.
SubmitTime *time.Time
// The time that training of the document classifier was completed. Indicates the
// time when the training completes on documentation classifiers. You are billed
// for the time interval between this time and the value of TrainingStartTime.
TrainingEndTime *time.Time
// Indicates the time when the training starts on documentation classifiers. You
// are billed for the time interval between this time and the value of
// TrainingEndTime.
TrainingStartTime *time.Time
// The version name that you assigned to the document classifier.
VersionName *string
// ID for the Amazon Web Services Key Management Service (KMS) key that Amazon
// Comprehend uses to encrypt data on the storage volume attached to the ML compute
// instance(s) that process the analysis job. The VolumeKmsKeyId can be either of
// the following formats:
// - KMS Key ID: "1234abcd-12ab-34cd-56ef-1234567890ab"
// - Amazon Resource Name (ARN) of a KMS Key:
// "arn:aws:kms:us-west-2:111122223333:key/1234abcd-12ab-34cd-56ef-1234567890ab"
VolumeKmsKeyId *string
// Configuration parameters for a private Virtual Private Cloud (VPC) containing
// the resources you are using for your custom classifier. For more information,
// see Amazon VPC (https://docs.aws.amazon.com/vpc/latest/userguide/what-is-amazon-vpc.html)
// .
VpcConfig *VpcConfig
noSmithyDocumentSerde
}
// Describes information about a document classifier and its versions.
type DocumentClassifierSummary struct {
// The name that you assigned the document classifier.
DocumentClassifierName *string
// The time that the latest document classifier version was submitted for
// processing.
LatestVersionCreatedAt *time.Time
// The version name you assigned to the latest document classifier version.
LatestVersionName *string
// Provides the status of the latest document classifier version.
LatestVersionStatus ModelStatus
// The number of versions you created.
NumberOfVersions *int32
noSmithyDocumentSerde
}
// Specifies one of the label or labels that categorize the document being
// analyzed.
type DocumentLabel struct {
// The name of the label.
Name *string
// Page number where the label occurs. This field is present in the response only
// if your request includes the Byte parameter.
Page *int32
// The confidence score that Amazon Comprehend has this label correctly attributed.
Score *float32
noSmithyDocumentSerde
}
// Information about the document, discovered during text extraction.
type DocumentMetadata struct {
// List of pages in the document, with the number of characters extracted from
// each page.
ExtractedCharacters []ExtractedCharactersListItem
// Number of pages in the document.
Pages *int32
noSmithyDocumentSerde
}
// Provides configuration parameters to override the default actions for
// extracting text from PDF documents and image files. By default, Amazon
// Comprehend performs the following actions to extract text from files, based on
// the input file type:
// - Word files - Amazon Comprehend parser extracts the text.
// - Digital PDF files - Amazon Comprehend parser extracts the text.
// - Image files and scanned PDF files - Amazon Comprehend uses the Amazon
// Textract DetectDocumentText API to extract the text.
//
// DocumentReaderConfig does not apply to plain text files or Word files. For
// image files and PDF documents, you can override these default actions using the
// fields listed below. For more information, see Setting text extraction options (https://docs.aws.amazon.com/comprehend/latest/dg/idp-set-textract-options.html)
// in the Comprehend Developer Guide.
type DocumentReaderConfig struct {
// This field defines the Amazon Textract API operation that Amazon Comprehend
// uses to extract text from PDF files and image files. Enter one of the following
// values:
// - TEXTRACT_DETECT_DOCUMENT_TEXT - The Amazon Comprehend service uses the
// DetectDocumentText API operation.
// - TEXTRACT_ANALYZE_DOCUMENT - The Amazon Comprehend service uses the
// AnalyzeDocument API operation.
//
// This member is required.
DocumentReadAction DocumentReadAction
// Determines the text extraction actions for PDF files. Enter one of the
// following values:
// - SERVICE_DEFAULT - use the Amazon Comprehend service defaults for PDF files.
// - FORCE_DOCUMENT_READ_ACTION - Amazon Comprehend uses the Textract API
// specified by DocumentReadAction for all PDF files, including digital PDF files.
DocumentReadMode DocumentReadMode
// Specifies the type of Amazon Textract features to apply. If you chose
// TEXTRACT_ANALYZE_DOCUMENT as the read action, you must specify one or both of
// the following values:
// - TABLES - Returns additional information about any tables that are detected
// in the input document.
// - FORMS - Returns additional information about any forms that are detected in
// the input document.
FeatureTypes []DocumentReadFeatureTypes
noSmithyDocumentSerde
}
// Document type for each page in the document.
type DocumentTypeListItem struct {
// Page number.
Page *int32
// Document type.
Type DocumentType
noSmithyDocumentSerde
}
// Returns the code for the dominant language in the input text and the level of
// confidence that Amazon Comprehend has in the accuracy of the detection.
type DominantLanguage struct {
// The RFC 5646 language code for the dominant language. For more information
// about RFC 5646, see Tags for Identifying Languages (https://tools.ietf.org/html/rfc5646)
// on the IETF Tools web site.
LanguageCode *string
// The level of confidence that Amazon Comprehend has in the accuracy of the
// detection.
Score *float32
noSmithyDocumentSerde
}
// Provides information for filtering a list of dominant language detection jobs.
// For more information, see the operation.
type DominantLanguageDetectionJobFilter struct {
// Filters on the name of the job.
JobName *string
// Filters the list of jobs based on job status. Returns only jobs with the
// specified status.
JobStatus JobStatus
// Filters the list of jobs based on the time that the job was submitted for
// processing. Returns only jobs submitted after the specified time. Jobs are
// returned in descending order, newest to oldest.
SubmitTimeAfter *time.Time
// Filters the list of jobs based on the time that the job was submitted for
// processing. Returns only jobs submitted before the specified time. Jobs are
// returned in ascending order, oldest to newest.
SubmitTimeBefore *time.Time
noSmithyDocumentSerde
}
// Provides information about a dominant language detection job.
type DominantLanguageDetectionJobProperties struct {
// The Amazon Resource Name (ARN) of the IAM role that grants Amazon Comprehend
// read access to your input data.
DataAccessRoleArn *string
// The time that the dominant language detection job completed.
EndTime *time.Time
// The input data configuration that you supplied when you created the dominant
// language detection job.
InputDataConfig *InputDataConfig
// The Amazon Resource Name (ARN) of the dominant language detection job. It is a
// unique, fully qualified identifier for the job. It includes the Amazon Web
// Services account, Amazon Web Services Region, and the job ID. The format of the
// ARN is as follows: arn::comprehend:::dominant-language-detection-job/ The
// following is an example job ARN:
// arn:aws:comprehend:us-west-2:111122223333:dominant-language-detection-job/1234abcd12ab34cd56ef1234567890ab
JobArn *string
// The identifier assigned to the dominant language detection job.
JobId *string
// The name that you assigned to the dominant language detection job.
JobName *string
// The current status of the dominant language detection job. If the status is
// FAILED , the Message field shows the reason for the failure.
JobStatus JobStatus
// A description for the status of a job.
Message *string
// The output data configuration that you supplied when you created the dominant
// language detection job.
OutputDataConfig *OutputDataConfig
// The time that the dominant language detection job was submitted for processing.
SubmitTime *time.Time
// ID for the Amazon Web Services Key Management Service (KMS) key that Amazon
// Comprehend uses to encrypt data on the storage volume attached to the ML compute
// instance(s) that process the analysis job. The VolumeKmsKeyId can be either of
// the following formats:
// - KMS Key ID: "1234abcd-12ab-34cd-56ef-1234567890ab"
// - Amazon Resource Name (ARN) of a KMS Key:
// "arn:aws:kms:us-west-2:111122223333:key/1234abcd-12ab-34cd-56ef-1234567890ab"
VolumeKmsKeyId *string
// Configuration parameters for a private Virtual Private Cloud (VPC) containing
// the resources you are using for your dominant language detection job. For more
// information, see Amazon VPC (https://docs.aws.amazon.com/vpc/latest/userguide/what-is-amazon-vpc.html)
// .
VpcConfig *VpcConfig
noSmithyDocumentSerde
}
// The filter used to determine which endpoints are returned. You can filter jobs
// on their name, model, status, or the date and time that they were created. You
// can only set one filter at a time.
type EndpointFilter struct {
// Specifies a date after which the returned endpoint or endpoints were created.
CreationTimeAfter *time.Time
// Specifies a date before which the returned endpoint or endpoints were created.
CreationTimeBefore *time.Time
// The Amazon Resource Number (ARN) of the model to which the endpoint is attached.
ModelArn *string
// Specifies the status of the endpoint being returned. Possible values are:
// Creating, Ready, Updating, Deleting, Failed.
Status EndpointStatus
noSmithyDocumentSerde
}
// Specifies information about the specified endpoint. For information about
// endpoints, see Managing endpoints (https://docs.aws.amazon.com/comprehend/latest/dg/manage-endpoints.html)
// .
type EndpointProperties struct {
// The creation date and time of the endpoint.
CreationTime *time.Time
// The number of inference units currently used by the model using this endpoint.
CurrentInferenceUnits *int32
// The Amazon Resource Name (ARN) of the IAM role that grants Amazon Comprehend
// read access to trained custom models encrypted with a customer managed key
// (ModelKmsKeyId).
DataAccessRoleArn *string
// Data access role ARN to use in case the new model is encrypted with a customer
// KMS key.
DesiredDataAccessRoleArn *string
// The desired number of inference units to be used by the model using this
// endpoint. Each inference unit represents of a throughput of 100 characters per
// second.
DesiredInferenceUnits *int32
// ARN of the new model to use for updating an existing endpoint. This ARN is
// going to be different from the model ARN when the update is in progress
DesiredModelArn *string
// The Amazon Resource Number (ARN) of the endpoint.
EndpointArn *string
// The Amazon Resource Number (ARN) of the flywheel
FlywheelArn *string
// The date and time that the endpoint was last modified.
LastModifiedTime *time.Time
// Specifies a reason for failure in cases of Failed status.
Message *string
// The Amazon Resource Number (ARN) of the model to which the endpoint is attached.
ModelArn *string
// Specifies the status of the endpoint. Because the endpoint updates and creation
// are asynchronous, so customers will need to wait for the endpoint to be Ready
// status before making inference requests.
Status EndpointStatus
noSmithyDocumentSerde
}
// Provides information for filtering a list of dominant language detection jobs.
// For more information, see the operation.
type EntitiesDetectionJobFilter struct {
// Filters on the name of the job.
JobName *string
// Filters the list of jobs based on job status. Returns only jobs with the
// specified status.
JobStatus JobStatus
// Filters the list of jobs based on the time that the job was submitted for
// processing. Returns only jobs submitted after the specified time. Jobs are
// returned in descending order, newest to oldest.
SubmitTimeAfter *time.Time
// Filters the list of jobs based on the time that the job was submitted for
// processing. Returns only jobs submitted before the specified time. Jobs are
// returned in ascending order, oldest to newest.
SubmitTimeBefore *time.Time
noSmithyDocumentSerde
}
// Provides information about an entities detection job.
type EntitiesDetectionJobProperties struct {
// The Amazon Resource Name (ARN) of the IAM role that grants Amazon Comprehend
// read access to your input data.
DataAccessRoleArn *string
// The time that the entities detection job completed
EndTime *time.Time
// The Amazon Resource Name (ARN) that identifies the entity recognizer.
EntityRecognizerArn *string
// The Amazon Resource Name (ARN) of the flywheel associated with this job.
FlywheelArn *string
// The input data configuration that you supplied when you created the entities
// detection job.
InputDataConfig *InputDataConfig
// The Amazon Resource Name (ARN) of the entities detection job. It is a unique,
// fully qualified identifier for the job. It includes the Amazon Web Services
// account, Amazon Web Services Region, and the job ID. The format of the ARN is as
// follows: arn::comprehend:::entities-detection-job/ The following is an example
// job ARN:
// arn:aws:comprehend:us-west-2:111122223333:entities-detection-job/1234abcd12ab34cd56ef1234567890ab
JobArn *string
// The identifier assigned to the entities detection job.
JobId *string
// The name that you assigned the entities detection job.
JobName *string
// The current status of the entities detection job. If the status is FAILED , the
// Message field shows the reason for the failure.
JobStatus JobStatus
// The language code of the input documents.
LanguageCode LanguageCode
// A description of the status of a job.
Message *string
// The output data configuration that you supplied when you created the entities
// detection job.
OutputDataConfig *OutputDataConfig
// The time that the entities detection job was submitted for processing.
SubmitTime *time.Time
// ID for the Amazon Web Services Key Management Service (KMS) key that Amazon
// Comprehend uses to encrypt data on the storage volume attached to the ML compute
// instance(s) that process the analysis job. The VolumeKmsKeyId can be either of
// the following formats:
// - KMS Key ID: "1234abcd-12ab-34cd-56ef-1234567890ab"
// - Amazon Resource Name (ARN) of a KMS Key:
// "arn:aws:kms:us-west-2:111122223333:key/1234abcd-12ab-34cd-56ef-1234567890ab"
VolumeKmsKeyId *string
// Configuration parameters for a private Virtual Private Cloud (VPC) containing
// the resources you are using for your entity detection job. For more information,
// see Amazon VPC (https://docs.aws.amazon.com/vpc/latest/userguide/what-is-amazon-vpc.html)
// .
VpcConfig *VpcConfig
noSmithyDocumentSerde
}
// Provides information about an entity.
type Entity struct {
// The zero-based offset from the beginning of the source text to the first
// character in the entity. This field is empty for non-text input.
BeginOffset *int32
// A reference to each block for this entity. This field is empty for plain-text
// input.
BlockReferences []BlockReference
// The zero-based offset from the beginning of the source text to the last
// character in the entity. This field is empty for non-text input.
EndOffset *int32
// The level of confidence that Amazon Comprehend has in the accuracy of the
// detection.
Score *float32
// The text of the entity.
Text *string
// The entity type. For entity detection using the built-in model, this field
// contains one of the standard entity types listed below. For custom entity
// detection, this field contains one of the entity types that you specified when
// you trained your custom model.
Type EntityType
noSmithyDocumentSerde
}
// Specifies one of the label or labels that categorize the personally
// identifiable information (PII) entity being analyzed.
type EntityLabel struct {
// The name of the label.
Name PiiEntityType
// The level of confidence that Amazon Comprehend has in the accuracy of the
// detection.
Score *float32
noSmithyDocumentSerde
}
// Configuration required for an entity recognition model.
type EntityRecognitionConfig struct {
// Up to 25 entity types that the model is trained to recognize.
//
// This member is required.
EntityTypes []EntityTypesListItem
noSmithyDocumentSerde
}
// Describes the annotations associated with a entity recognizer.
type EntityRecognizerAnnotations struct {
// Specifies the Amazon S3 location where the annotations for an entity recognizer
// are located. The URI must be in the same Region as the API endpoint that you are
// calling.
//
// This member is required.
S3Uri *string
// Specifies the Amazon S3 location where the test annotations for an entity
// recognizer are located. The URI must be in the same Region as the API endpoint
// that you are calling.
TestS3Uri *string
noSmithyDocumentSerde
}
// Describes the training documents submitted with an entity recognizer.
type EntityRecognizerDocuments struct {
// Specifies the Amazon S3 location where the training documents for an entity
// recognizer are located. The URI must be in the same Region as the API endpoint
// that you are calling.
//
// This member is required.
S3Uri *string
// Specifies how the text in an input file should be processed. This is optional,
// and the default is ONE_DOC_PER_LINE. ONE_DOC_PER_FILE - Each file is considered
// a separate document. Use this option when you are processing large documents,
// such as newspaper articles or scientific papers. ONE_DOC_PER_LINE - Each line in
// a file is considered a separate document. Use this option when you are
// processing many short documents, such as text messages.
InputFormat InputFormat
// Specifies the Amazon S3 location where the test documents for an entity
// recognizer are located. The URI must be in the same Amazon Web Services Region
// as the API endpoint that you are calling.
TestS3Uri *string
noSmithyDocumentSerde
}
// Describes the entity list submitted with an entity recognizer.
type EntityRecognizerEntityList struct {
// Specifies the Amazon S3 location where the entity list is located. The URI must
// be in the same Region as the API endpoint that you are calling.
//
// This member is required.
S3Uri *string
noSmithyDocumentSerde
}
// Detailed information about the accuracy of an entity recognizer.
type EntityRecognizerEvaluationMetrics struct {
// A measure of how accurate the recognizer results are for the test data. It is
// derived from the Precision and Recall values. The F1Score is the harmonic
// average of the two scores. For plain text entity recognizer models, the range is
// 0 to 100, where 100 is the best score. For PDF/Word entity recognizer models,
// the range is 0 to 1, where 1 is the best score.
F1Score *float64
// A measure of the usefulness of the recognizer results in the test data. High
// precision means that the recognizer returned substantially more relevant results
// than irrelevant ones.
Precision *float64
// A measure of how complete the recognizer results are for the test data. High
// recall means that the recognizer returned most of the relevant results.
Recall *float64
noSmithyDocumentSerde
}
// Provides information for filtering a list of entity recognizers. You can only
// specify one filtering parameter in a request. For more information, see the
// ListEntityRecognizers operation./>
type EntityRecognizerFilter struct {
// The name that you assigned the entity recognizer.
RecognizerName *string
// The status of an entity recognizer.
Status ModelStatus
// Filters the list of entities based on the time that the list was submitted for
// processing. Returns only jobs submitted after the specified time. Jobs are
// returned in ascending order, oldest to newest.
SubmitTimeAfter *time.Time
// Filters the list of entities based on the time that the list was submitted for
// processing. Returns only jobs submitted before the specified time. Jobs are
// returned in descending order, newest to oldest.
SubmitTimeBefore *time.Time
noSmithyDocumentSerde
}
// Specifies the format and location of the input data.
type EntityRecognizerInputDataConfig struct {
// The entity types in the labeled training data that Amazon Comprehend uses to
// train the custom entity recognizer. Any entity types that you don't specify are
// ignored. A maximum of 25 entity types can be used at one time to train an entity
// recognizer. Entity types must not contain the following invalid characters: \n
// (line break), \\n (escaped line break), \r (carriage return), \\r (escaped
// carriage return), \t (tab), \\t (escaped tab), space, and , (comma).
//
// This member is required.
EntityTypes []EntityTypesListItem
// The S3 location of the CSV file that annotates your training documents.
Annotations *EntityRecognizerAnnotations
// A list of augmented manifest files that provide training data for your custom
// model. An augmented manifest file is a labeled dataset that is produced by
// Amazon SageMaker Ground Truth. This parameter is required if you set DataFormat
// to AUGMENTED_MANIFEST .
AugmentedManifests []AugmentedManifestsListItem
// The format of your training data:
// - COMPREHEND_CSV : A CSV file that supplements your training documents. The
// CSV file contains information about the custom entities that your trained model
// will detect. The required format of the file depends on whether you are
// providing annotations or an entity list. If you use this value, you must provide
// your CSV file by using either the Annotations or EntityList parameters. You
// must provide your training documents by using the Documents parameter.
// - AUGMENTED_MANIFEST : A labeled dataset that is produced by Amazon SageMaker
// Ground Truth. This file is in JSON lines format. Each line is a complete JSON
// object that contains a training document and its labels. Each label annotates a
// named entity in the training document. If you use this value, you must provide
// the AugmentedManifests parameter in your request.
// If you don't specify a value, Amazon Comprehend uses COMPREHEND_CSV as the
// default.
DataFormat EntityRecognizerDataFormat
// The S3 location of the folder that contains the training documents for your
// custom entity recognizer. This parameter is required if you set DataFormat to
// COMPREHEND_CSV .
Documents *EntityRecognizerDocuments
// The S3 location of the CSV file that has the entity list for your custom entity
// recognizer.
EntityList *EntityRecognizerEntityList
noSmithyDocumentSerde
}
// Detailed information about an entity recognizer.
type EntityRecognizerMetadata struct {
// Entity types from the metadata of an entity recognizer.
EntityTypes []EntityRecognizerMetadataEntityTypesListItem
// Detailed information about the accuracy of an entity recognizer.
EvaluationMetrics *EntityRecognizerEvaluationMetrics
// The number of documents in the input data that were used to test the entity
// recognizer. Typically this is 10 to 20 percent of the input documents.
NumberOfTestDocuments *int32
// The number of documents in the input data that were used to train the entity
// recognizer. Typically this is 80 to 90 percent of the input documents.
NumberOfTrainedDocuments *int32
noSmithyDocumentSerde
}
// Individual item from the list of entity types in the metadata of an entity
// recognizer.
type EntityRecognizerMetadataEntityTypesListItem struct {
// Detailed information about the accuracy of the entity recognizer for a specific
// item on the list of entity types.
EvaluationMetrics *EntityTypesEvaluationMetrics
// Indicates the number of times the given entity type was seen in the training
// data.
NumberOfTrainMentions *int32
// Type of entity from the list of entity types in the metadata of an entity
// recognizer.
Type *string
noSmithyDocumentSerde
}
// Output data configuration.
type EntityRecognizerOutputDataConfig struct {
// The Amazon S3 prefix for the data lake location of the flywheel statistics.
FlywheelStatsS3Prefix *string
noSmithyDocumentSerde
}
// Describes information about an entity recognizer.
type EntityRecognizerProperties struct {
// The Amazon Resource Name (ARN) of the IAM role that grants Amazon Comprehend
// read access to your input data.
DataAccessRoleArn *string
// The time that the recognizer creation completed.
EndTime *time.Time
// The Amazon Resource Name (ARN) that identifies the entity recognizer.
EntityRecognizerArn *string
// The Amazon Resource Number (ARN) of the flywheel
FlywheelArn *string
// The input data properties of an entity recognizer.
InputDataConfig *EntityRecognizerInputDataConfig
// The language of the input documents. All documents must be in the same
// language. Only English ("en") is currently supported.
LanguageCode LanguageCode
// A description of the status of the recognizer.
Message *string
// ID for the KMS key that Amazon Comprehend uses to encrypt trained custom
// models. The ModelKmsKeyId can be either of the following formats:
// - KMS Key ID: "1234abcd-12ab-34cd-56ef-1234567890ab"
// - Amazon Resource Name (ARN) of a KMS Key:
// "arn:aws:kms:us-west-2:111122223333:key/1234abcd-12ab-34cd-56ef-1234567890ab"
ModelKmsKeyId *string
// Output data configuration.
OutputDataConfig *EntityRecognizerOutputDataConfig
// Provides information about an entity recognizer.
RecognizerMetadata *EntityRecognizerMetadata
// The Amazon Resource Name (ARN) of the source model. This model was imported
// from a different Amazon Web Services account to create the entity recognizer
// model in your Amazon Web Services account.
SourceModelArn *string
// Provides the status of the entity recognizer.
Status ModelStatus
// The time that the recognizer was submitted for processing.
SubmitTime *time.Time
// The time that training of the entity recognizer was completed.
TrainingEndTime *time.Time
// The time that training of the entity recognizer started.
TrainingStartTime *time.Time
// The version name you assigned to the entity recognizer.
VersionName *string
// ID for the Amazon Web Services Key Management Service (KMS) key that Amazon
// Comprehend uses to encrypt data on the storage volume attached to the ML compute
// instance(s) that process the analysis job. The VolumeKmsKeyId can be either of
// the following formats:
// - KMS Key ID: "1234abcd-12ab-34cd-56ef-1234567890ab"
// - Amazon Resource Name (ARN) of a KMS Key:
// "arn:aws:kms:us-west-2:111122223333:key/1234abcd-12ab-34cd-56ef-1234567890ab"
VolumeKmsKeyId *string
// Configuration parameters for a private Virtual Private Cloud (VPC) containing
// the resources you are using for your custom entity recognizer. For more
// information, see Amazon VPC (https://docs.aws.amazon.com/vpc/latest/userguide/what-is-amazon-vpc.html)
// .
VpcConfig *VpcConfig
noSmithyDocumentSerde
}
// Describes the information about an entity recognizer and its versions.
type EntityRecognizerSummary struct {
// The time that the latest entity recognizer version was submitted for processing.
LatestVersionCreatedAt *time.Time
// The version name you assigned to the latest entity recognizer version.
LatestVersionName *string
// Provides the status of the latest entity recognizer version.
LatestVersionStatus ModelStatus
// The number of versions you created.
NumberOfVersions *int32
// The name that you assigned the entity recognizer.
RecognizerName *string
noSmithyDocumentSerde
}
// Detailed information about the accuracy of an entity recognizer for a specific
// entity type.
type EntityTypesEvaluationMetrics struct {
// A measure of how accurate the recognizer results are for a specific entity type
// in the test data. It is derived from the Precision and Recall values. The
// F1Score is the harmonic average of the two scores. The highest score is 1, and
// the worst score is 0.
F1Score *float64
// A measure of the usefulness of the recognizer results for a specific entity
// type in the test data. High precision means that the recognizer returned
// substantially more relevant results than irrelevant ones.
Precision *float64
// A measure of how complete the recognizer results are for a specific entity type
// in the test data. High recall means that the recognizer returned most of the
// relevant results.
Recall *float64
noSmithyDocumentSerde
}
// An entity type within a labeled training dataset that Amazon Comprehend uses to
// train a custom entity recognizer.
type EntityTypesListItem struct {
// An entity type within a labeled training dataset that Amazon Comprehend uses to
// train a custom entity recognizer. Entity types must not contain the following
// invalid characters: \n (line break), \\n (escaped line break, \r (carriage
// return), \\r (escaped carriage return), \t (tab), \\t (escaped tab), and ,
// (comma).
//
// This member is required.
Type *string
noSmithyDocumentSerde
}
// Text extraction encountered one or more page-level errors in the input
// document. The ErrorCode contains one of the following values:
// - TEXTRACT_BAD_PAGE - Amazon Textract cannot read the page. For more
// information about page limits in Amazon Textract, see Page Quotas in Amazon
// Textract (https://docs.aws.amazon.com/textract/latest/dg/limits-document.html)
// .
// - TEXTRACT_PROVISIONED_THROUGHPUT_EXCEEDED - The number of requests exceeded
// your throughput limit. For more information about throughput quotas in Amazon
// Textract, see Default quotas in Amazon Textract (https://docs.aws.amazon.com/textract/latest/dg/limits-quotas-explained.html)
// .
// - PAGE_CHARACTERS_EXCEEDED - Too many text characters on the page (10,000
// characters maximum).
// - PAGE_SIZE_EXCEEDED - The maximum page size is 10 MB.
// - INTERNAL_SERVER_ERROR - The request encountered a service issue. Try the
// API request again.
type ErrorsListItem struct {
// Error code for the cause of the error.
ErrorCode PageBasedErrorCode
// Text message explaining the reason for the error.
ErrorMessage *string
// Page number where the error occurred.
Page *int32
noSmithyDocumentSerde
}
// Provides information for filtering a list of event detection jobs.
type EventsDetectionJobFilter struct {
// Filters on the name of the events detection job.
JobName *string
// Filters the list of jobs based on job status. Returns only jobs with the
// specified status.
JobStatus JobStatus
// Filters the list of jobs based on the time that the job was submitted for
// processing. Returns only jobs submitted after the specified time. Jobs are
// returned in descending order, newest to oldest.
SubmitTimeAfter *time.Time
// Filters the list of jobs based on the time that the job was submitted for
// processing. Returns only jobs submitted before the specified time. Jobs are
// returned in ascending order, oldest to newest.
SubmitTimeBefore *time.Time
noSmithyDocumentSerde
}
// Provides information about an events detection job.
type EventsDetectionJobProperties struct {
// The Amazon Resource Name (ARN) of the IAM role that grants Amazon Comprehend
// read access to your input data.
DataAccessRoleArn *string
// The time that the events detection job completed.
EndTime *time.Time
// The input data configuration that you supplied when you created the events
// detection job.
InputDataConfig *InputDataConfig
// The Amazon Resource Name (ARN) of the events detection job. It is a unique,
// fully qualified identifier for the job. It includes the Amazon Web Services
// account, Amazon Web Services Region, and the job ID. The format of the ARN is as
// follows: arn::comprehend:::events-detection-job/ The following is an example
// job ARN:
// arn:aws:comprehend:us-west-2:111122223333:events-detection-job/1234abcd12ab34cd56ef1234567890ab
JobArn *string
// The identifier assigned to the events detection job.
JobId *string
// The name you assigned the events detection job.
JobName *string
// The current status of the events detection job.
JobStatus JobStatus
// The language code of the input documents.
LanguageCode LanguageCode
// A description of the status of the events detection job.
Message *string
// The output data configuration that you supplied when you created the events
// detection job.
OutputDataConfig *OutputDataConfig
// The time that the events detection job was submitted for processing.
SubmitTime *time.Time
// The types of events that are detected by the job.
TargetEventTypes []string
noSmithyDocumentSerde
}
// Array of the number of characters extracted from each page.
type ExtractedCharactersListItem struct {
// Number of characters extracted from each page.
Count *int32
// Page number.
Page *int32
noSmithyDocumentSerde
}
// Filter the flywheels based on creation time or flywheel status.
type FlywheelFilter struct {
// Filter the flywheels to include flywheels created after the specified time.
CreationTimeAfter *time.Time
// Filter the flywheels to include flywheels created before the specified time.
CreationTimeBefore *time.Time
// Filter the flywheels based on the flywheel status.
Status FlywheelStatus
noSmithyDocumentSerde
}
// Filter the flywheel iterations based on creation time.
type FlywheelIterationFilter struct {
// Filter the flywheel iterations to include iterations created after the
// specified time.
CreationTimeAfter *time.Time
// Filter the flywheel iterations to include iterations created before the
// specified time.
CreationTimeBefore *time.Time
noSmithyDocumentSerde
}
// The configuration properties of a flywheel iteration.
type FlywheelIterationProperties struct {
// The creation start time of the flywheel iteration.
CreationTime *time.Time
// The completion time of this flywheel iteration.
EndTime *time.Time
// The ARN of the evaluated model associated with this flywheel iteration.
EvaluatedModelArn *string
// The evaluation metrics associated with the evaluated model.
EvaluatedModelMetrics *FlywheelModelEvaluationMetrics
//
EvaluationManifestS3Prefix *string
//
FlywheelArn *string
//
FlywheelIterationId *string
// A description of the status of the flywheel iteration.
Message *string
// The status of the flywheel iteration.
Status FlywheelIterationStatus
// The ARN of the trained model associated with this flywheel iteration.
TrainedModelArn *string
// The metrics associated with the trained model.
TrainedModelMetrics *FlywheelModelEvaluationMetrics
noSmithyDocumentSerde
}
// The evaluation metrics associated with the evaluated model.
type FlywheelModelEvaluationMetrics struct {
// Average accuracy metric for the model.
AverageAccuracy *float64
// The average F1 score from the evaluation metrics.
AverageF1Score *float64
// Average precision metric for the model.
AveragePrecision *float64
// Average recall metric for the model.
AverageRecall *float64
noSmithyDocumentSerde
}
// The flywheel properties.
type FlywheelProperties struct {
// The Amazon Resource Number (ARN) of the active model version.
ActiveModelArn *string
// Creation time of the flywheel.
CreationTime *time.Time
// The Amazon Resource Name (ARN) of the IAM role that grants Amazon Comprehend
// permission to access the flywheel data.
DataAccessRoleArn *string
// Amazon S3 URI of the data lake location.
DataLakeS3Uri *string
// Data security configuration.
DataSecurityConfig *DataSecurityConfig
// The Amazon Resource Number (ARN) of the flywheel.
FlywheelArn *string
// Last modified time for the flywheel.
LastModifiedTime *time.Time
// The most recent flywheel iteration.
LatestFlywheelIteration *string
// A description of the status of the flywheel.
Message *string
// Model type of the flywheel's model.
ModelType ModelType
// The status of the flywheel.
Status FlywheelStatus
// Configuration about the model associated with a flywheel.
TaskConfig *TaskConfig
noSmithyDocumentSerde
}
// Flywheel summary information.
type FlywheelSummary struct {
// ARN of the active model version for the flywheel.
ActiveModelArn *string
// Creation time of the flywheel.
CreationTime *time.Time
// Amazon S3 URI of the data lake location.
DataLakeS3Uri *string
// The Amazon Resource Number (ARN) of the flywheel
FlywheelArn *string
// Last modified time for the flywheel.
LastModifiedTime *time.Time
// The most recent flywheel iteration.
LatestFlywheelIteration *string
// A description of the status of the flywheel.
Message *string
// Model type of the flywheel's model.
ModelType ModelType
// The status of the flywheel.
Status FlywheelStatus
noSmithyDocumentSerde
}
// Information about the location of items on a document page. For additional
// information, see Geometry (https://docs.aws.amazon.com/textract/latest/dg/API_Geometry.html)
// in the Amazon Textract API reference.
type Geometry struct {
// An axis-aligned coarse representation of the location of the recognized item on
// the document page.
BoundingBox *BoundingBox
// Within the bounding box, a fine-grained polygon around the recognized item.
Polygon []Point
noSmithyDocumentSerde
}
// The input properties for an inference job. The document reader config field
// applies only to non-text inputs for custom analysis.
type InputDataConfig struct {
// The Amazon S3 URI for the input data. The URI must be in same Region as the API
// endpoint that you are calling. The URI can point to a single input file or it
// can provide the prefix for a collection of data files. For example, if you use
// the URI S3://bucketName/prefix , if the prefix is a single file, Amazon
// Comprehend uses that file as input. If more than one file begins with the
// prefix, Amazon Comprehend uses all of them as input.
//
// This member is required.
S3Uri *string
// Provides configuration parameters to override the default actions for
// extracting text from PDF documents and image files.
DocumentReaderConfig *DocumentReaderConfig
// Specifies how the text in an input file should be processed:
// - ONE_DOC_PER_FILE - Each file is considered a separate document. Use this
// option when you are processing large documents, such as newspaper articles or
// scientific papers.
// - ONE_DOC_PER_LINE - Each line in a file is considered a separate document.
// Use this option when you are processing many short documents, such as text
// messages.
InputFormat InputFormat
noSmithyDocumentSerde
}
// Provides additional detail about why the request failed.
type InvalidRequestDetail struct {
// Reason codes include the following values:
// - DOCUMENT_SIZE_EXCEEDED - Document size is too large. Check the size of your
// file and resubmit the request.
// - UNSUPPORTED_DOC_TYPE - Document type is not supported. Check the file type
// and resubmit the request.
// - PAGE_LIMIT_EXCEEDED - Too many pages in the document. Check the number of
// pages in your file and resubmit the request.
// - TEXTRACT_ACCESS_DENIED - Access denied to Amazon Textract. Verify that your
// account has permission to use Amazon Textract API operations and resubmit the
// request.
// - NOT_TEXTRACT_JSON - Document is not Amazon Textract JSON format. Verify the
// format and resubmit the request.
// - MISMATCHED_TOTAL_PAGE_COUNT - Check the number of pages in your file and
// resubmit the request.
// - INVALID_DOCUMENT - Invalid document. Check the file and resubmit the
// request.
Reason InvalidRequestDetailReason
noSmithyDocumentSerde
}
// Describes a key noun phrase.
type KeyPhrase struct {
// The zero-based offset from the beginning of the source text to the first
// character in the key phrase.
BeginOffset *int32
// The zero-based offset from the beginning of the source text to the last
// character in the key phrase.
EndOffset *int32
// The level of confidence that Amazon Comprehend has in the accuracy of the
// detection.
Score *float32
// The text of a key noun phrase.
Text *string
noSmithyDocumentSerde
}
// Provides information for filtering a list of dominant language detection jobs.
// For more information, see the operation.
type KeyPhrasesDetectionJobFilter struct {
// Filters on the name of the job.
JobName *string
// Filters the list of jobs based on job status. Returns only jobs with the
// specified status.
JobStatus JobStatus
// Filters the list of jobs based on the time that the job was submitted for
// processing. Returns only jobs submitted after the specified time. Jobs are
// returned in descending order, newest to oldest.
SubmitTimeAfter *time.Time
// Filters the list of jobs based on the time that the job was submitted for
// processing. Returns only jobs submitted before the specified time. Jobs are
// returned in ascending order, oldest to newest.
SubmitTimeBefore *time.Time
noSmithyDocumentSerde
}
// Provides information about a key phrases detection job.
type KeyPhrasesDetectionJobProperties struct {
// The Amazon Resource Name (ARN) of the IAM role that grants Amazon Comprehend
// read access to your input data.
DataAccessRoleArn *string
// The time that the key phrases detection job completed.
EndTime *time.Time
// The input data configuration that you supplied when you created the key phrases
// detection job.
InputDataConfig *InputDataConfig
// The Amazon Resource Name (ARN) of the key phrases detection job. It is a
// unique, fully qualified identifier for the job. It includes the Amazon Web
// Services account, Amazon Web Services Region, and the job ID. The format of the
// ARN is as follows: arn::comprehend:::key-phrases-detection-job/ The following
// is an example job ARN:
// arn:aws:comprehend:us-west-2:111122223333:key-phrases-detection-job/1234abcd12ab34cd56ef1234567890ab
JobArn *string
// The identifier assigned to the key phrases detection job.
JobId *string
// The name that you assigned the key phrases detection job.
JobName *string
// The current status of the key phrases detection job. If the status is FAILED ,
// the Message field shows the reason for the failure.
JobStatus JobStatus
// The language code of the input documents.
LanguageCode LanguageCode
// A description of the status of a job.
Message *string
// The output data configuration that you supplied when you created the key
// phrases detection job.
OutputDataConfig *OutputDataConfig
// The time that the key phrases detection job was submitted for processing.
SubmitTime *time.Time
// ID for the KMS key that Amazon Comprehend uses to encrypt data on the storage
// volume attached to the ML compute instance(s) that process the analysis job. The
// VolumeKmsKeyId can be either of the following formats:
// - KMS Key ID: "1234abcd-12ab-34cd-56ef-1234567890ab"
// - Amazon Resource Name (ARN) of a KMS Key:
// "arn:aws:kms:us-west-2:111122223333:key/1234abcd-12ab-34cd-56ef-1234567890ab"
VolumeKmsKeyId *string
// Configuration parameters for a private Virtual Private Cloud (VPC) containing
// the resources you are using for your key phrases detection job. For more
// information, see Amazon VPC (https://docs.aws.amazon.com/vpc/latest/userguide/what-is-amazon-vpc.html)
// .
VpcConfig *VpcConfig
noSmithyDocumentSerde
}
// Contains the sentiment and sentiment score for one mention of an entity. For
// more information about targeted sentiment, see Targeted sentiment (https://docs.aws.amazon.com/comprehend/latest/dg/how-targeted-sentiment.html)
// in the Amazon Comprehend Developer Guide.
type MentionSentiment struct {
// The sentiment of the mention.
Sentiment SentimentType
// Describes the level of confidence that Amazon Comprehend has in the accuracy of
// its detection of sentiments.
SentimentScore *SentimentScore
noSmithyDocumentSerde
}
// Provides configuration parameters for the output of inference jobs.
type OutputDataConfig struct {
// When you use the OutputDataConfig object with asynchronous operations, you
// specify the Amazon S3 location where you want to write the output data. The URI
// must be in the same Region as the API endpoint that you are calling. The
// location is used as the prefix for the actual location of the output file. When
// the topic detection job is finished, the service creates an output file in a
// directory specific to the job. The S3Uri field contains the location of the
// output file, called output.tar.gz . It is a compressed archive that contains the
// ouput of the operation. For a PII entity detection job, the output file is plain
// text, not a compressed archive. The output file name is the same as the input
// file, with .out appended at the end.
//
// This member is required.
S3Uri *string
// ID for the Amazon Web Services Key Management Service (KMS) key that Amazon
// Comprehend uses to encrypt the output results from an analysis job. Specify the
// Key Id of a symmetric key, because you cannot use an asymmetric key for
// uploading data to S3. The KmsKeyId can be one of the following formats:
// - KMS Key ID: "1234abcd-12ab-34cd-56ef-1234567890ab"
// - Amazon Resource Name (ARN) of a KMS Key:
// "arn:aws:kms:us-west-2:111122223333:key/1234abcd-12ab-34cd-56ef-1234567890ab"
// - KMS Key Alias: "alias/ExampleAlias"
// - ARN of a KMS Key Alias:
// "arn:aws:kms:us-west-2:111122223333:alias/ExampleAlias"
KmsKeyId *string
noSmithyDocumentSerde
}
// Identifies the part of speech represented by the token and gives the confidence
// that Amazon Comprehend has that the part of speech was correctly identified. For
// more information about the parts of speech that Amazon Comprehend can identify,
// see Syntax (https://docs.aws.amazon.com/comprehend/latest/dg/how-syntax.html)
// in the Comprehend Developer Guide.
type PartOfSpeechTag struct {
// The confidence that Amazon Comprehend has that the part of speech was correctly
// identified.
Score *float32
// Identifies the part of speech that the token represents.
Tag PartOfSpeechTagType
noSmithyDocumentSerde
}
// Provides information for filtering a list of PII entity detection jobs.
type PiiEntitiesDetectionJobFilter struct {
// Filters on the name of the job.
JobName *string
// Filters the list of jobs based on job status. Returns only jobs with the
// specified status.
JobStatus JobStatus
// Filters the list of jobs based on the time that the job was submitted for
// processing. Returns only jobs submitted after the specified time. Jobs are
// returned in descending order, newest to oldest.
SubmitTimeAfter *time.Time
// Filters the list of jobs based on the time that the job was submitted for
// processing. Returns only jobs submitted before the specified time. Jobs are
// returned in ascending order, oldest to newest.
SubmitTimeBefore *time.Time
noSmithyDocumentSerde
}
// Provides information about a PII entities detection job.
type PiiEntitiesDetectionJobProperties struct {
// The Amazon Resource Name (ARN) of the IAM role that grants Amazon Comprehend
// read access to your input data.
DataAccessRoleArn *string
// The time that the PII entities detection job completed.
EndTime *time.Time
// The input properties for a PII entities detection job.
InputDataConfig *InputDataConfig
// The Amazon Resource Name (ARN) of the PII entities detection job. It is a
// unique, fully qualified identifier for the job. It includes the Amazon Web
// Services account, Amazon Web Services Region, and the job ID. The format of the
// ARN is as follows: arn::comprehend:::pii-entities-detection-job/ The following
// is an example job ARN:
// arn:aws:comprehend:us-west-2:111122223333:pii-entities-detection-job/1234abcd12ab34cd56ef1234567890ab
JobArn *string
// The identifier assigned to the PII entities detection job.
JobId *string
// The name that you assigned the PII entities detection job.
JobName *string
// The current status of the PII entities detection job. If the status is FAILED ,
// the Message field shows the reason for the failure.
JobStatus JobStatus
// The language code of the input documents
LanguageCode LanguageCode
// A description of the status of a job.
Message *string
// Specifies whether the output provides the locations (offsets) of PII entities
// or a file in which PII entities are redacted.
Mode PiiEntitiesDetectionMode
// The output data configuration that you supplied when you created the PII
// entities detection job.
OutputDataConfig *PiiOutputDataConfig
// Provides configuration parameters for PII entity redaction. This parameter is
// required if you set the Mode parameter to ONLY_REDACTION . In that case, you
// must provide a RedactionConfig definition that includes the PiiEntityTypes
// parameter.
RedactionConfig *RedactionConfig
// The time that the PII entities detection job was submitted for processing.
SubmitTime *time.Time
noSmithyDocumentSerde
}
// Provides information about a PII entity.
type PiiEntity struct {
// The zero-based offset from the beginning of the source text to the first
// character in the entity.
BeginOffset *int32
// The zero-based offset from the beginning of the source text to the last
// character in the entity.
EndOffset *int32
// The level of confidence that Amazon Comprehend has in the accuracy of the
// detection.
Score *float32
// The entity's type.
Type PiiEntityType
noSmithyDocumentSerde
}
// Provides configuration parameters for the output of PII entity detection jobs.
type PiiOutputDataConfig struct {
// When you use the PiiOutputDataConfig object with asynchronous operations, you
// specify the Amazon S3 location where you want to write the output data. For a
// PII entity detection job, the output file is plain text, not a compressed
// archive. The output file name is the same as the input file, with .out appended
// at the end.
//
// This member is required.
S3Uri *string
// ID for the Amazon Web Services Key Management Service (KMS) key that Amazon
// Comprehend uses to encrypt the output results from an analysis job.
KmsKeyId *string
noSmithyDocumentSerde
}
// The X and Y coordinates of a point on a document page. For additional
// information, see Point (https://docs.aws.amazon.com/textract/latest/dg/API_Point.html)
// in the Amazon Textract API reference.
type Point struct {
// The value of the X coordinate for a point on a polygon
X *float32
// The value of the Y coordinate for a point on a polygon
Y *float32
noSmithyDocumentSerde
}
// Provides configuration parameters for PII entity redaction.
type RedactionConfig struct {
// A character that replaces each character in the redacted PII entity.
MaskCharacter *string
// Specifies whether the PII entity is redacted with the mask character or the
// entity type.
MaskMode PiiEntitiesDetectionMaskMode
// An array of the types of PII entities that Amazon Comprehend detects in the
// input text for your request.
PiiEntityTypes []PiiEntityType
noSmithyDocumentSerde
}
// List of child blocks for the current block.
type RelationshipsListItem struct {
// Identifers of the child blocks.
Ids []string
// Only supported relationship is a child relationship.
Type RelationshipType
noSmithyDocumentSerde
}
// Provides information for filtering a list of dominant language detection jobs.
// For more information, see the operation.
type SentimentDetectionJobFilter struct {
// Filters on the name of the job.
JobName *string
// Filters the list of jobs based on job status. Returns only jobs with the
// specified status.
JobStatus JobStatus
// Filters the list of jobs based on the time that the job was submitted for
// processing. Returns only jobs submitted after the specified time. Jobs are
// returned in descending order, newest to oldest.
SubmitTimeAfter *time.Time
// Filters the list of jobs based on the time that the job was submitted for
// processing. Returns only jobs submitted before the specified time. Jobs are
// returned in ascending order, oldest to newest.
SubmitTimeBefore *time.Time
noSmithyDocumentSerde
}
// Provides information about a sentiment detection job.
type SentimentDetectionJobProperties struct {
// The Amazon Resource Name (ARN) of the IAM role that grants Amazon Comprehend
// read access to your input data.
DataAccessRoleArn *string
// The time that the sentiment detection job ended.
EndTime *time.Time
// The input data configuration that you supplied when you created the sentiment
// detection job.
InputDataConfig *InputDataConfig
// The Amazon Resource Name (ARN) of the sentiment detection job. It is a unique,
// fully qualified identifier for the job. It includes the Amazon Web Services
// account, Amazon Web Services Region, and the job ID. The format of the ARN is as
// follows: arn::comprehend:::sentiment-detection-job/ The following is an example
// job ARN:
// arn:aws:comprehend:us-west-2:111122223333:sentiment-detection-job/1234abcd12ab34cd56ef1234567890ab
JobArn *string
// The identifier assigned to the sentiment detection job.
JobId *string
// The name that you assigned to the sentiment detection job
JobName *string
// The current status of the sentiment detection job. If the status is FAILED , the
// Messages field shows the reason for the failure.
JobStatus JobStatus
// The language code of the input documents.
LanguageCode LanguageCode
// A description of the status of a job.
Message *string
// The output data configuration that you supplied when you created the sentiment
// detection job.
OutputDataConfig *OutputDataConfig
// The time that the sentiment detection job was submitted for processing.
SubmitTime *time.Time
// ID for the Amazon Web Services Key Management Service (KMS) key that Amazon
// Comprehend uses to encrypt data on the storage volume attached to the ML compute
// instance(s) that process the analysis job. The VolumeKmsKeyId can be either of
// the following formats:
// - KMS Key ID: "1234abcd-12ab-34cd-56ef-1234567890ab"
// - Amazon Resource Name (ARN) of a KMS Key:
// "arn:aws:kms:us-west-2:111122223333:key/1234abcd-12ab-34cd-56ef-1234567890ab"
VolumeKmsKeyId *string
// Configuration parameters for a private Virtual Private Cloud (VPC) containing
// the resources you are using for your sentiment detection job. For more
// information, see Amazon VPC (https://docs.aws.amazon.com/vpc/latest/userguide/what-is-amazon-vpc.html)
// .
VpcConfig *VpcConfig
noSmithyDocumentSerde
}
// Describes the level of confidence that Amazon Comprehend has in the accuracy of
// its detection of sentiments.
type SentimentScore struct {
// The level of confidence that Amazon Comprehend has in the accuracy of its
// detection of the MIXED sentiment.
Mixed *float32
// The level of confidence that Amazon Comprehend has in the accuracy of its
// detection of the NEGATIVE sentiment.
Negative *float32
// The level of confidence that Amazon Comprehend has in the accuracy of its
// detection of the NEUTRAL sentiment.
Neutral *float32
// The level of confidence that Amazon Comprehend has in the accuracy of its
// detection of the POSITIVE sentiment.
Positive *float32
noSmithyDocumentSerde
}
// Represents a work in the input text that was recognized and assigned a part of
// speech. There is one syntax token record for each word in the source text.
type SyntaxToken struct {
// The zero-based offset from the beginning of the source text to the first
// character in the word.
BeginOffset *int32
// The zero-based offset from the beginning of the source text to the last
// character in the word.
EndOffset *int32
// Provides the part of speech label and the confidence level that Amazon
// Comprehend has that the part of speech was correctly identified. For more
// information, see Syntax (https://docs.aws.amazon.com/comprehend/latest/dg/how-syntax.html)
// in the Comprehend Developer Guide.
PartOfSpeech *PartOfSpeechTag
// The word that was recognized in the source text.
Text *string
// A unique identifier for a token.
TokenId *int32
noSmithyDocumentSerde
}
// A key-value pair that adds as a metadata to a resource used by Amazon
// Comprehend. For example, a tag with the key-value pair ‘Department’:’Sales’
// might be added to a resource to indicate its use by a particular department.
type Tag struct {
// The initial part of a key-value pair that forms a tag associated with a given
// resource. For instance, if you want to show which resources are used by which
// departments, you might use “Department” as the key portion of the pair, with
// multiple possible values such as “sales,” “legal,” and “administration.”
//
// This member is required.
Key *string
// The second part of a key-value pair that forms a tag associated with a given
// resource. For instance, if you want to show which resources are used by which
// departments, you might use “Department” as the initial (key) portion of the
// pair, with a value of “sales” to indicate the sales department.
Value *string
noSmithyDocumentSerde
}
// Provides information for filtering a list of dominant language detection jobs.
// For more information, see the ListTargetedSentimentDetectionJobs operation.
type TargetedSentimentDetectionJobFilter struct {
// Filters on the name of the job.
JobName *string
// Filters the list of jobs based on job status. Returns only jobs with the
// specified status.
JobStatus JobStatus
// Filters the list of jobs based on the time that the job was submitted for
// processing. Returns only jobs submitted after the specified time. Jobs are
// returned in descending order, newest to oldest.
SubmitTimeAfter *time.Time
// Filters the list of jobs based on the time that the job was submitted for
// processing. Returns only jobs submitted before the specified time. Jobs are
// returned in ascending order, oldest to newest.
SubmitTimeBefore *time.Time
noSmithyDocumentSerde
}
// Provides information about a targeted sentiment detection job.
type TargetedSentimentDetectionJobProperties struct {
// The Amazon Resource Name (ARN) of the IAM role that grants Amazon Comprehend
// read access to your input data.
DataAccessRoleArn *string
// The time that the targeted sentiment detection job ended.
EndTime *time.Time
// The input properties for an inference job. The document reader config field
// applies only to non-text inputs for custom analysis.
InputDataConfig *InputDataConfig
// The Amazon Resource Name (ARN) of the targeted sentiment detection job. It is a
// unique, fully qualified identifier for the job. It includes the Amazon Web
// Services account, Amazon Web Services Region, and the job ID. The format of the
// ARN is as follows: arn::comprehend:::targeted-sentiment-detection-job/ The
// following is an example job ARN:
// arn:aws:comprehend:us-west-2:111122223333:targeted-sentiment-detection-job/1234abcd12ab34cd56ef1234567890ab
JobArn *string
// The identifier assigned to the targeted sentiment detection job.
JobId *string
// The name that you assigned to the targeted sentiment detection job.
JobName *string
// The current status of the targeted sentiment detection job. If the status is
// FAILED , the Messages field shows the reason for the failure.
JobStatus JobStatus
// The language code of the input documents.
LanguageCode LanguageCode
// A description of the status of a job.
Message *string
// Provides configuration parameters for the output of inference jobs.
OutputDataConfig *OutputDataConfig
// The time that the targeted sentiment detection job was submitted for processing.
SubmitTime *time.Time
// ID for the KMS key that Amazon Comprehend uses to encrypt the data on the
// storage volume attached to the ML compute instance(s) that process the targeted
// sentiment detection job. The VolumeKmsKeyId can be either of the following
// formats:
// - KMS Key ID: "1234abcd-12ab-34cd-56ef-1234567890ab"
// - Amazon Resource Name (ARN) of a KMS Key:
// "arn:aws:kms:us-west-2:111122223333:key/1234abcd-12ab-34cd-56ef-1234567890ab"
VolumeKmsKeyId *string
// Configuration parameters for an optional private Virtual Private Cloud (VPC)
// containing the resources you are using for the job. For more information, see
// Amazon VPC (https://docs.aws.amazon.com/vpc/latest/userguide/what-is-amazon-vpc.html)
// .
VpcConfig *VpcConfig
noSmithyDocumentSerde
}
// Information about one of the entities found by targeted sentiment analysis. For
// more information about targeted sentiment, see Targeted sentiment (https://docs.aws.amazon.com/comprehend/latest/dg/how-targeted-sentiment.html)
// in the Amazon Comprehend Developer Guide.
type TargetedSentimentEntity struct {
// One or more index into the Mentions array that provides the best name for the
// entity group.
DescriptiveMentionIndex []int32
// An array of mentions of the entity in the document. The array represents a
// co-reference group. See Co-reference group (https://docs.aws.amazon.com/comprehend/latest/dg/how-targeted-sentiment.html#how-targeted-sentiment-values)
// for an example.
Mentions []TargetedSentimentMention
noSmithyDocumentSerde
}
// Information about one mention of an entity. The mention information includes
// the location of the mention in the text and the sentiment of the mention. For
// more information about targeted sentiment, see Targeted sentiment (https://docs.aws.amazon.com/comprehend/latest/dg/how-targeted-sentiment.html)
// in the Amazon Comprehend Developer Guide.
type TargetedSentimentMention struct {
// The offset into the document text where the mention begins.
BeginOffset *int32
// The offset into the document text where the mention ends.
EndOffset *int32
// The confidence that all the entities mentioned in the group relate to the same
// entity.
GroupScore *float32
// Contains the sentiment and sentiment score for the mention.
MentionSentiment *MentionSentiment
// Model confidence that the entity is relevant. Value range is zero to one, where
// one is highest confidence.
Score *float32
// The text in the document that identifies the entity.
Text *string
// The type of the entity. Amazon Comprehend supports a variety of entity types (https://docs.aws.amazon.com/comprehend/latest/dg/how-targeted-sentiment.html#how-targeted-sentiment-entities)
// .
Type TargetedSentimentEntityType
noSmithyDocumentSerde
}
// Configuration about the model associated with a flywheel.
type TaskConfig struct {
// Language code for the language that the model supports.
//
// This member is required.
LanguageCode LanguageCode
// Configuration required for a document classification model.
DocumentClassificationConfig *DocumentClassificationConfig
// Configuration required for an entity recognition model.
EntityRecognitionConfig *EntityRecognitionConfig
noSmithyDocumentSerde
}
// One of the of text strings. Each string has a size limit of 1KB.
type TextSegment struct {
// The text content.
//
// This member is required.
Text *string
noSmithyDocumentSerde
}
// Provides information for filtering topic detection jobs. For more information,
// see .
type TopicsDetectionJobFilter struct {
//
JobName *string
// Filters the list of topic detection jobs based on job status. Returns only jobs
// with the specified status.
JobStatus JobStatus
// Filters the list of jobs based on the time that the job was submitted for
// processing. Only returns jobs submitted after the specified time. Jobs are
// returned in ascending order, oldest to newest.
SubmitTimeAfter *time.Time
// Filters the list of jobs based on the time that the job was submitted for
// processing. Only returns jobs submitted before the specified time. Jobs are
// returned in descending order, newest to oldest.
SubmitTimeBefore *time.Time
noSmithyDocumentSerde
}
// Provides information about a topic detection job.
type TopicsDetectionJobProperties struct {
// The Amazon Resource Name (ARN) of the IAM role that grants Amazon Comprehend
// read access to your job data.
DataAccessRoleArn *string
// The time that the topic detection job was completed.
EndTime *time.Time
// The input data configuration supplied when you created the topic detection job.
InputDataConfig *InputDataConfig
// The Amazon Resource Name (ARN) of the topics detection job. It is a unique,
// fully qualified identifier for the job. It includes the Amazon Web Services
// account, Amazon Web Services Region, and the job ID. The format of the ARN is as
// follows: arn::comprehend:::topics-detection-job/ The following is an example
// job ARN:
// arn:aws:comprehend:us-west-2:111122223333:topics-detection-job/1234abcd12ab34cd56ef1234567890ab
JobArn *string
// The identifier assigned to the topic detection job.
JobId *string
// The name of the topic detection job.
JobName *string
// The current status of the topic detection job. If the status is Failed , the
// reason for the failure is shown in the Message field.
JobStatus JobStatus
// A description for the status of a job.
Message *string
// The number of topics to detect supplied when you created the topic detection
// job. The default is 10.
NumberOfTopics *int32
// The output data configuration supplied when you created the topic detection job.
OutputDataConfig *OutputDataConfig
// The time that the topic detection job was submitted for processing.
SubmitTime *time.Time
// ID for the Amazon Web Services Key Management Service (KMS) key that Amazon
// Comprehend uses to encrypt data on the storage volume attached to the ML compute
// instance(s) that process the analysis job. The VolumeKmsKeyId can be either of
// the following formats:
// - KMS Key ID: "1234abcd-12ab-34cd-56ef-1234567890ab"
// - Amazon Resource Name (ARN) of a KMS Key:
// "arn:aws:kms:us-west-2:111122223333:key/1234abcd-12ab-34cd-56ef-1234567890ab"
VolumeKmsKeyId *string
// Configuration parameters for a private Virtual Private Cloud (VPC) containing
// the resources you are using for your topic detection job. For more information,
// see Amazon VPC (https://docs.aws.amazon.com/vpc/latest/userguide/what-is-amazon-vpc.html)
// .
VpcConfig *VpcConfig
noSmithyDocumentSerde
}
// Toxic content analysis result for one string. For more information about
// toxicity detection, see Toxicity detection (https://docs.aws.amazon.com/comprehend/latest/dg/toxicity-detection.html)
// in the Amazon Comprehend Developer Guide
type ToxicContent struct {
// The name of the toxic content type.
Name ToxicContentType
// Model confidence in the detected content type. Value range is zero to one,
// where one is highest confidence.
Score *float32
noSmithyDocumentSerde
}
// Toxicity analysis result for one string. For more information about toxicity
// detection, see Toxicity detection (https://docs.aws.amazon.com/comprehend/latest/dg/toxicity-detection.html)
// in the Amazon Comprehend Developer Guide.
type ToxicLabels struct {
// Array of toxic content types identified in the string.
Labels []ToxicContent
// Overall toxicity score for the string. Value range is zero to one, where one is
// the highest confidence.
Toxicity *float32
noSmithyDocumentSerde
}
// Data security configuration.
type UpdateDataSecurityConfig struct {
// ID for the KMS key that Amazon Comprehend uses to encrypt trained custom
// models. The ModelKmsKeyId can be either of the following formats:
// - KMS Key ID: "1234abcd-12ab-34cd-56ef-1234567890ab"
// - Amazon Resource Name (ARN) of a KMS Key:
// "arn:aws:kms:us-west-2:111122223333:key/1234abcd-12ab-34cd-56ef-1234567890ab"
ModelKmsKeyId *string
// ID for the KMS key that Amazon Comprehend uses to encrypt the volume.
VolumeKmsKeyId *string
// Configuration parameters for an optional private Virtual Private Cloud (VPC)
// containing the resources you are using for the job. For more information, see
// Amazon VPC (https://docs.aws.amazon.com/vpc/latest/userguide/what-is-amazon-vpc.html)
// .
VpcConfig *VpcConfig
noSmithyDocumentSerde
}
// Configuration parameters for an optional private Virtual Private Cloud (VPC)
// containing the resources you are using for the job. For more information, see
// Amazon VPC (https://docs.aws.amazon.com/vpc/latest/userguide/what-is-amazon-vpc.html)
// .
type VpcConfig struct {
// The ID number for a security group on an instance of your private VPC. Security
// groups on your VPC function serve as a virtual firewall to control inbound and
// outbound traffic and provides security for the resources that you’ll be
// accessing on the VPC. This ID number is preceded by "sg-", for instance:
// "sg-03b388029b0a285ea". For more information, see Security Groups for your VPC (https://docs.aws.amazon.com/vpc/latest/userguide/VPC_SecurityGroups.html)
// .
//
// This member is required.
SecurityGroupIds []string
// The ID for each subnet being used in your private VPC. This subnet is a subset
// of the a range of IPv4 addresses used by the VPC and is specific to a given
// availability zone in the VPC’s Region. This ID number is preceded by "subnet-",
// for instance: "subnet-04ccf456919e69055". For more information, see VPCs and
// Subnets (https://docs.aws.amazon.com/vpc/latest/userguide/VPC_Subnets.html) .
//
// This member is required.
Subnets []string
noSmithyDocumentSerde
}
// The system identified one of the following warnings while processing the input
// document:
// - The document to classify is plain text, but the classifier is a native
// document model.
// - The document to classify is semi-structured, but the classifier is a
// plain-text model.
type WarningsListItem struct {
// Page number in the input document.
Page *int32
// The type of warning.
WarnCode PageBasedWarningCode
// Text message associated with the warning.
WarnMessage *string
noSmithyDocumentSerde
}
type noSmithyDocumentSerde = smithydocument.NoSerde
|