1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 112 113 114 115 116 117 118 119 120 121 122 123 124 125 126 127 128 129 130 131 132 133 134 135 136 137 138 139 140 141 142 143 144 145 146 147 148 149 150 151 152 153 154 155 156 157 158 159 160 161 162 163 164 165 166 167 168 169 170 171 172 173 174 175 176 177 178 179 180 181 182 183 184 185 186 187 188 189 190 191 192 193 194 195 196 197 198 199 200 201 202 203 204 205 206 207 208 209 210 211 212 213 214 215 216 217 218 219 220 221 222 223 224 225 226 227 228 229 230 231 232 233 234 235 236 237 238 239 240 241 242 243 244 245 246 247 248 249 250 251 252 253 254 255 256 257 258 259 260 261 262 263 264 265 266 267 268 269 270 271 272 273 274 275 276 277 278 279 280 281 282 283 284 285 286 287 288 289 290 291 292 293 294 295 296 297 298 299 300 301 302 303 304 305 306 307 308 309 310 311 312 313 314 315 316 317 318 319 320 321 322 323 324 325 326 327 328 329 330 331 332 333 334 335 336 337 338 339 340 341 342 343 344 345 346 347 348 349 350 351 352 353 354 355 356 357 358 359 360 361 362 363 364 365 366 367 368 369 370 371 372 373 374 375 376 377 378 379 380 381 382 383 384 385 386 387 388 389 390 391 392 393 394 395 396 397 398 399 400 401 402 403 404 405 406 407 408 409 410 411 412 413 414 415 416 417 418 419 420 421 422 423 424 425 426 427 428 429 430 431 432 433 434 435 436 437 438 439 440 441 442 443 444 445 446 447 448 449 450 451 452 453 454 455 456 457 458 459 460 461 462 463 464 465 466 467 468 469 470 471 472 473 474 475 476 477 478 479 480 481 482 483 484 485 486 487 488 489 490 491 492 493 494 495 496 497 498 499 500 501 502 503 504 505 506 507 508 509 510 511 512 513 514 515 516 517 518 519 520 521 522 523 524 525 526 527 528 529 530 531 532 533 534 535 536 537 538 539 540 541 542 543 544 545 546 547 548 549 550 551 552 553 554 555 556 557 558 559 560 561 562 563 564 565 566 567 568 569 570 571 572 573 574 575 576 577 578 579 580 581 582 583 584 585 586 587 588 589 590 591 592 593 594 595 596 597 598 599 600 601 602 603 604 605 606 607 608 609 610 611 612 613 614 615 616 617 618 619 620 621 622 623 624 625 626 627 628 629 630 631 632 633 634 635 636 637 638 639 640 641 642 643 644 645 646 647 648 649 650 651 652 653 654 655 656 657 658 659 660 661 662 663 664 665 666 667 668 669 670 671 672 673 674 675 676 677 678 679 680 681 682 683 684 685 686 687 688 689 690 691 692 693 694 695 696 697 698 699 700 701 702 703 704 705 706 707 708 709 710 711 712 713 714 715 716 717 718 719 720 721 722 723 724 725 726 727 728 729 730 731 732 733 734 735 736 737 738 739 740 741 742 743 744 745 746 747 748 749 750 751 752 753 754 755 756 757 758 759 760 761 762 763 764 765 766 767 768 769 770 771 772 773 774 775 776 777 778 779 780 781 782 783 784 785 786 787 788 789 790 791 792 793 794 795 796 797 798 799 800 801 802 803 804 805 806 807 808 809 810 811 812 813 814 815 816 817 818 819 820 821 822 823 824 825 826 827 828 829 830 831 832 833 834 835 836 837 838 839 840 841 842 843 844 845 846 847 848 849 850 851 852 853 854 855 856 857 858 859 860 861 862 863 864 865 866 867 868 869 870 871 872 873 874 875 876 877 878 879 880 881 882 883 884 885 886 887 888 889 890 891 892 893 894 895 896 897 898 899 900 901 902 903 904 905 906 907 908 909 910 911 912 913 914 915 916 917 918 919 920 921 922 923 924 925 926 927 928 929 930 931 932 933 934 935 936 937 938 939 940 941 942 943 944 945 946 947 948 949 950 951 952 953 954 955 956 957 958 959 960 961 962 963 964 965 966 967 968 969 970 971 972 973 974 975 976 977 978 979 980 981 982 983 984 985 986 987 988 989 990 991 992 993 994 995 996 997 998 999 1000 1001 1002 1003 1004 1005 1006 1007 1008 1009 1010 1011 1012 1013 1014 1015 1016 1017 1018 1019 1020 1021 1022 1023 1024 1025 1026 1027 1028 1029 1030 1031 1032 1033 1034 1035 1036 1037 1038 1039 1040 1041 1042 1043 1044 1045 1046 1047 1048 1049 1050 1051 1052 1053 1054 1055 1056 1057 1058 1059 1060 1061 1062 1063 1064 1065 1066 1067 1068 1069 1070 1071 1072 1073 1074 1075 1076 1077 1078 1079 1080 1081 1082 1083 1084 1085 1086 1087 1088 1089 1090 1091 1092 1093 1094 1095 1096 1097 1098 1099 1100 1101 1102 1103 1104 1105 1106 1107 1108 1109 1110 1111 1112 1113 1114 1115 1116 1117 1118 1119 1120 1121 1122 1123 1124 1125 1126 1127 1128 1129 1130 1131 1132 1133 1134 1135 1136 1137 1138 1139 1140 1141 1142 1143 1144 1145 1146 1147 1148 1149 1150 1151 1152 1153 1154 1155 1156 1157 1158 1159 1160 1161 1162 1163 1164 1165 1166 1167 1168 1169 1170 1171 1172 1173 1174 1175 1176 1177 1178 1179 1180 1181 1182 1183 1184 1185 1186 1187 1188 1189 1190 1191 1192 1193 1194 1195 1196 1197 1198 1199 1200 1201 1202 1203 1204 1205 1206 1207 1208 1209 1210 1211 1212 1213 1214 1215 1216 1217 1218 1219 1220 1221 1222 1223 1224 1225 1226 1227 1228 1229 1230 1231 1232 1233 1234 1235 1236 1237 1238 1239 1240 1241 1242 1243 1244 1245 1246 1247 1248 1249 1250 1251 1252 1253 1254 1255 1256 1257 1258 1259 1260 1261 1262 1263 1264 1265 1266 1267 1268 1269 1270 1271 1272 1273 1274 1275 1276 1277 1278 1279 1280 1281 1282 1283 1284 1285 1286 1287 1288 1289 1290 1291 1292 1293 1294 1295 1296 1297 1298 1299 1300 1301 1302 1303 1304 1305 1306 1307 1308 1309 1310 1311 1312 1313 1314 1315 1316 1317 1318 1319 1320 1321 1322 1323 1324 1325 1326 1327 1328 1329 1330 1331 1332 1333 1334 1335 1336 1337 1338 1339 1340 1341 1342 1343 1344 1345 1346 1347 1348 1349 1350 1351 1352 1353 1354 1355 1356 1357 1358 1359 1360 1361 1362 1363 1364 1365 1366 1367 1368 1369 1370 1371 1372 1373 1374 1375 1376 1377 1378 1379 1380 1381 1382 1383 1384 1385 1386 1387 1388 1389 1390 1391 1392 1393 1394 1395 1396 1397 1398 1399 1400 1401 1402 1403 1404 1405 1406 1407 1408 1409 1410 1411 1412 1413 1414 1415 1416 1417 1418 1419 1420 1421 1422 1423 1424 1425 1426 1427 1428 1429 1430 1431 1432 1433 1434 1435 1436 1437 1438 1439 1440 1441 1442 1443 1444 1445 1446 1447 1448 1449 1450 1451 1452 1453 1454 1455 1456 1457 1458 1459 1460 1461 1462 1463 1464 1465 1466 1467 1468 1469 1470 1471 1472 1473 1474 1475 1476 1477 1478 1479 1480 1481 1482 1483 1484 1485 1486 1487 1488 1489 1490 1491 1492 1493 1494 1495 1496 1497 1498 1499 1500 1501 1502 1503 1504 1505 1506 1507 1508 1509 1510 1511 1512 1513 1514 1515 1516 1517 1518 1519 1520 1521 1522 1523 1524 1525 1526 1527 1528 1529 1530 1531 1532 1533 1534 1535 1536 1537 1538 1539 1540 1541 1542 1543 1544 1545 1546 1547 1548 1549 1550 1551 1552 1553 1554 1555 1556 1557 1558 1559 1560 1561 1562 1563 1564 1565 1566 1567 1568 1569 1570 1571 1572 1573 1574 1575 1576 1577 1578 1579 1580 1581 1582 1583 1584 1585 1586 1587 1588 1589 1590 1591 1592 1593 1594 1595 1596 1597 1598 1599 1600 1601 1602 1603 1604 1605 1606 1607 1608 1609 1610 1611 1612 1613 1614 1615 1616 1617 1618 1619 1620 1621 1622 1623 1624 1625 1626 1627 1628 1629 1630 1631 1632 1633 1634 1635 1636 1637 1638 1639 1640 1641 1642 1643 1644 1645 1646 1647 1648 1649 1650 1651 1652 1653 1654 1655 1656 1657 1658 1659 1660 1661 1662 1663 1664 1665 1666 1667 1668 1669 1670 1671 1672 1673 1674 1675 1676 1677 1678 1679 1680 1681 1682 1683 1684 1685 1686 1687 1688 1689 1690 1691 1692 1693 1694 1695 1696 1697 1698 1699 1700 1701 1702 1703 1704 1705 1706 1707 1708 1709 1710 1711 1712 1713 1714 1715 1716 1717 1718 1719 1720 1721 1722 1723 1724 1725 1726 1727 1728 1729 1730 1731 1732 1733 1734 1735 1736 1737 1738 1739 1740 1741 1742 1743 1744 1745 1746 1747 1748 1749 1750 1751 1752 1753 1754 1755 1756 1757 1758 1759 1760 1761 1762 1763 1764 1765 1766 1767 1768 1769 1770 1771 1772 1773 1774 1775 1776 1777 1778 1779 1780 1781 1782 1783 1784 1785 1786 1787 1788 1789 1790 1791 1792 1793 1794 1795 1796 1797 1798 1799 1800 1801 1802 1803 1804 1805 1806 1807 1808 1809 1810 1811 1812 1813 1814 1815 1816 1817 1818 1819 1820 1821 1822 1823 1824 1825 1826 1827 1828 1829 1830 1831 1832 1833 1834 1835 1836 1837 1838 1839 1840 1841 1842 1843 1844 1845 1846 1847 1848 1849 1850 1851 1852 1853 1854 1855 1856 1857 1858 1859 1860 1861 1862 1863 1864 1865 1866 1867 1868 1869 1870 1871 1872 1873 1874 1875 1876 1877 1878 1879 1880 1881 1882 1883 1884 1885 1886 1887 1888 1889 1890 1891 1892 1893 1894 1895 1896 1897 1898 1899 1900 1901 1902 1903 1904 1905 1906 1907 1908 1909 1910 1911 1912 1913 1914 1915 1916 1917 1918 1919 1920 1921 1922 1923 1924 1925 1926 1927 1928 1929 1930 1931 1932 1933 1934 1935 1936 1937 1938 1939 1940 1941 1942 1943 1944 1945 1946 1947 1948 1949 1950 1951 1952 1953 1954 1955 1956 1957 1958 1959 1960 1961 1962 1963 1964 1965 1966 1967 1968 1969 1970 1971 1972 1973 1974 1975 1976 1977 1978 1979 1980 1981 1982 1983 1984 1985 1986 1987 1988 1989 1990 1991 1992 1993 1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 2022 2023 2024 2025 2026 2027 2028 2029 2030 2031 2032 2033 2034 2035 2036 2037 2038 2039 2040 2041 2042 2043 2044 2045 2046 2047 2048 2049 2050 2051 2052 2053 2054 2055 2056 2057 2058 2059 2060 2061 2062 2063 2064 2065 2066 2067 2068 2069 2070 2071 2072 2073 2074 2075 2076 2077 2078 2079 2080 2081 2082 2083 2084 2085 2086 2087 2088 2089 2090 2091 2092 2093 2094 2095 2096 2097 2098 2099 2100 2101 2102 2103 2104 2105 2106 2107 2108 2109 2110 2111 2112 2113 2114 2115 2116 2117 2118 2119 2120 2121 2122 2123 2124 2125 2126 2127 2128 2129 2130 2131 2132 2133 2134 2135 2136 2137 2138 2139 2140 2141 2142 2143 2144 2145 2146 2147 2148 2149 2150 2151 2152 2153 2154 2155 2156 2157 2158 2159 2160 2161 2162 2163 2164 2165 2166 2167 2168 2169 2170 2171 2172 2173 2174 2175 2176 2177 2178 2179 2180 2181 2182 2183 2184 2185 2186 2187 2188 2189 2190 2191 2192 2193 2194 2195 2196 2197 2198 2199 2200 2201 2202 2203 2204 2205 2206 2207 2208 2209 2210 2211 2212 2213 2214 2215 2216 2217 2218 2219 2220 2221 2222 2223 2224 2225 2226 2227 2228 2229 2230 2231 2232 2233 2234 2235 2236 2237 2238 2239 2240 2241 2242 2243 2244 2245 2246 2247 2248 2249 2250 2251 2252 2253 2254 2255 2256 2257 2258 2259 2260 2261 2262 2263 2264 2265 2266 2267 2268 2269 2270 2271 2272 2273 2274 2275 2276 2277 2278 2279 2280 2281 2282 2283 2284 2285 2286 2287 2288 2289 2290 2291 2292 2293 2294 2295 2296 2297 2298 2299 2300 2301 2302 2303 2304 2305 2306 2307 2308 2309 2310 2311 2312 2313 2314 2315 2316 2317 2318 2319 2320 2321 2322 2323 2324 2325 2326 2327 2328 2329 2330 2331 2332 2333 2334 2335 2336 2337 2338 2339 2340 2341 2342 2343 2344 2345 2346 2347 2348 2349 2350 2351 2352 2353 2354 2355 2356 2357 2358 2359 2360 2361 2362 2363 2364 2365 2366 2367 2368 2369 2370 2371 2372 2373 2374 2375 2376 2377 2378 2379 2380 2381 2382 2383 2384 2385 2386 2387 2388 2389 2390 2391 2392 2393 2394 2395 2396 2397 2398 2399 2400 2401 2402 2403 2404 2405 2406 2407 2408 2409 2410 2411 2412 2413 2414 2415 2416 2417 2418 2419 2420 2421 2422 2423 2424 2425 2426 2427 2428 2429 2430 2431 2432 2433 2434 2435 2436 2437 2438 2439 2440 2441 2442 2443 2444 2445 2446 2447 2448 2449 2450 2451 2452 2453 2454 2455 2456 2457 2458 2459 2460 2461 2462 2463 2464 2465 2466 2467 2468 2469 2470 2471 2472 2473 2474 2475 2476 2477 2478 2479 2480 2481 2482 2483 2484 2485 2486 2487 2488 2489 2490 2491 2492 2493 2494 2495 2496 2497 2498 2499 2500 2501 2502 2503 2504 2505 2506 2507 2508 2509 2510 2511 2512 2513 2514 2515 2516 2517 2518 2519 2520 2521 2522 2523 2524 2525 2526 2527 2528 2529 2530 2531 2532 2533 2534 2535 2536 2537 2538 2539 2540 2541 2542 2543 2544 2545 2546 2547 2548 2549 2550 2551 2552 2553 2554 2555 2556 2557 2558 2559 2560 2561 2562 2563 2564 2565 2566 2567 2568 2569 2570 2571 2572 2573 2574 2575 2576 2577 2578 2579 2580 2581 2582 2583 2584 2585 2586 2587 2588 2589 2590 2591 2592 2593 2594 2595 2596 2597 2598 2599 2600 2601 2602 2603 2604 2605 2606 2607 2608 2609 2610 2611 2612 2613 2614 2615 2616 2617 2618 2619 2620 2621 2622 2623 2624 2625 2626 2627 2628 2629 2630 2631 2632 2633 2634 2635 2636 2637 2638 2639 2640 2641 2642 2643 2644 2645 2646 2647 2648 2649 2650 2651 2652 2653 2654 2655 2656 2657 2658 2659 2660 2661 2662 2663 2664 2665 2666 2667 2668 2669 2670 2671 2672 2673 2674 2675 2676 2677 2678 2679 2680 2681 2682 2683 2684 2685 2686 2687 2688 2689 2690 2691 2692 2693 2694 2695 2696 2697 2698 2699 2700 2701 2702 2703 2704 2705 2706 2707 2708 2709 2710 2711 2712 2713 2714 2715 2716 2717 2718 2719 2720 2721 2722 2723 2724 2725 2726 2727 2728 2729 2730 2731 2732 2733 2734 2735 2736 2737 2738 2739 2740 2741 2742 2743 2744 2745 2746 2747 2748 2749 2750 2751 2752 2753 2754 2755 2756 2757 2758 2759 2760 2761 2762 2763 2764 2765 2766 2767 2768 2769 2770 2771 2772 2773 2774 2775 2776 2777 2778 2779 2780 2781 2782 2783 2784 2785 2786 2787 2788 2789 2790 2791 2792 2793 2794 2795 2796 2797 2798 2799 2800 2801 2802 2803 2804 2805 2806 2807 2808 2809 2810 2811 2812 2813 2814 2815 2816 2817 2818 2819 2820 2821 2822 2823 2824 2825 2826 2827 2828 2829 2830 2831 2832 2833 2834 2835 2836 2837 2838 2839 2840 2841 2842 2843 2844 2845 2846 2847 2848 2849 2850 2851 2852 2853 2854 2855 2856 2857 2858 2859 2860 2861 2862 2863 2864 2865 2866 2867 2868 2869 2870 2871 2872 2873 2874 2875 2876 2877 2878 2879 2880 2881 2882 2883 2884 2885 2886 2887 2888 2889 2890 2891 2892 2893 2894 2895 2896 2897 2898 2899 2900 2901 2902 2903 2904 2905 2906 2907 2908 2909 2910 2911 2912 2913 2914 2915 2916 2917 2918 2919 2920 2921 2922 2923 2924 2925 2926 2927 2928 2929 2930 2931 2932 2933 2934 2935 2936 2937 2938 2939 2940 2941 2942 2943 2944 2945 2946 2947 2948 2949 2950 2951 2952 2953 2954 2955 2956 2957 2958 2959 2960 2961 2962 2963 2964 2965 2966 2967 2968 2969 2970 2971 2972 2973 2974 2975 2976 2977 2978 2979 2980 2981 2982 2983 2984 2985 2986 2987 2988 2989 2990 2991 2992 2993 2994 2995 2996 2997 2998 2999 3000 3001 3002 3003 3004 3005 3006 3007 3008 3009 3010 3011 3012 3013 3014 3015 3016 3017 3018 3019 3020 3021 3022 3023 3024 3025 3026 3027 3028 3029 3030 3031 3032 3033 3034 3035 3036 3037 3038 3039 3040 3041 3042 3043 3044 3045 3046 3047 3048 3049 3050 3051 3052 3053 3054 3055 3056 3057 3058 3059 3060 3061 3062 3063 3064 3065 3066 3067 3068 3069 3070 3071 3072 3073 3074 3075 3076 3077 3078 3079 3080 3081 3082 3083 3084 3085 3086 3087 3088 3089 3090 3091 3092 3093 3094 3095 3096 3097 3098 3099 3100 3101 3102 3103 3104 3105 3106 3107 3108 3109 3110 3111 3112 3113 3114 3115 3116 3117 3118 3119 3120 3121 3122 3123 3124 3125 3126 3127 3128 3129 3130 3131 3132 3133 3134 3135 3136 3137 3138 3139 3140 3141 3142 3143 3144 3145 3146 3147 3148 3149 3150 3151 3152 3153 3154 3155 3156 3157 3158 3159 3160 3161 3162 3163 3164 3165 3166 3167 3168 3169 3170 3171 3172 3173 3174 3175 3176 3177 3178 3179 3180 3181 3182 3183 3184 3185 3186 3187 3188 3189 3190 3191 3192 3193 3194 3195 3196 3197 3198 3199 3200 3201 3202 3203 3204 3205 3206 3207 3208 3209 3210 3211 3212 3213 3214 3215 3216 3217 3218 3219 3220 3221 3222 3223 3224 3225 3226 3227 3228 3229 3230 3231 3232 3233 3234 3235 3236 3237 3238 3239 3240 3241 3242 3243 3244 3245 3246 3247 3248 3249 3250 3251 3252 3253 3254 3255 3256 3257 3258 3259 3260 3261 3262 3263 3264 3265 3266 3267 3268 3269 3270 3271 3272 3273 3274 3275 3276 3277 3278 3279 3280 3281 3282 3283 3284 3285 3286 3287 3288 3289 3290 3291 3292 3293 3294 3295 3296 3297 3298 3299 3300 3301 3302 3303 3304 3305 3306 3307 3308 3309 3310 3311 3312 3313 3314 3315 3316 3317 3318 3319 3320 3321 3322 3323 3324 3325 3326 3327 3328 3329 3330 3331 3332 3333 3334 3335 3336 3337 3338 3339 3340 3341 3342 3343 3344 3345 3346 3347 3348 3349 3350 3351 3352 3353 3354 3355 3356 3357 3358 3359 3360 3361 3362 3363 3364 3365 3366 3367 3368 3369 3370 3371 3372 3373 3374 3375 3376 3377 3378 3379 3380 3381 3382 3383 3384 3385 3386 3387 3388 3389 3390 3391 3392 3393 3394 3395 3396 3397 3398 3399 3400 3401 3402 3403 3404 3405 3406 3407 3408 3409 3410 3411 3412 3413 3414 3415 3416 3417 3418 3419 3420 3421 3422 3423 3424 3425 3426 3427 3428 3429 3430 3431 3432 3433 3434 3435 3436 3437 3438 3439 3440 3441 3442 3443 3444 3445 3446 3447 3448 3449 3450 3451 3452 3453 3454 3455 3456 3457 3458 3459 3460 3461 3462 3463 3464 3465 3466 3467 3468 3469 3470 3471 3472 3473 3474 3475 3476 3477 3478 3479 3480 3481 3482 3483 3484 3485 3486 3487 3488 3489 3490 3491 3492 3493 3494 3495 3496 3497 3498 3499 3500 3501 3502 3503 3504 3505 3506 3507 3508 3509 3510 3511 3512 3513 3514 3515 3516 3517 3518 3519 3520 3521 3522 3523 3524 3525 3526 3527 3528 3529 3530 3531 3532 3533 3534 3535 3536 3537 3538 3539 3540 3541 3542 3543 3544 3545 3546 3547 3548 3549 3550 3551 3552 3553 3554 3555 3556 3557 3558 3559 3560 3561 3562 3563 3564 3565 3566 3567 3568 3569 3570 3571 3572 3573 3574 3575 3576 3577 3578 3579 3580 3581 3582 3583 3584 3585 3586 3587 3588 3589 3590 3591 3592 3593 3594 3595 3596 3597 3598 3599 3600 3601 3602 3603 3604 3605 3606 3607 3608 3609 3610 3611 3612 3613 3614 3615 3616 3617 3618 3619 3620 3621 3622 3623 3624 3625 3626 3627 3628 3629 3630 3631 3632 3633 3634 3635 3636 3637 3638 3639 3640 3641 3642 3643 3644 3645 3646 3647 3648 3649 3650 3651 3652 3653 3654 3655 3656 3657 3658 3659 3660 3661 3662 3663 3664 3665 3666 3667 3668 3669 3670 3671 3672 3673 3674 3675 3676 3677 3678 3679 3680 3681 3682 3683 3684 3685 3686 3687 3688 3689 3690 3691 3692 3693
|
<pre>Network Working Group R. Recio
Request for Comments: 5040 B. Metzler
Category: Standards Track IBM Corporation
P. Culley
J. Hilland
Hewlett-Packard Company
D. Garcia
October 2007
<span class="h1">A Remote Direct Memory Access Protocol Specification</span>
Status of This Memo
This document specifies an Internet standards track protocol for the
Internet community, and requests discussion and suggestions for
improvements. Please refer to the current edition of the "Internet
Official Protocol Standards" (STD 1) for the standardization state
and status of this protocol. Distribution of this memo is unlimited.
Abstract
This document defines a Remote Direct Memory Access Protocol (RDMAP)
that operates over the Direct Data Placement Protocol (DDP protocol).
RDMAP provides read and write services directly to applications and
enables data to be transferred directly into Upper Layer Protocol
(ULP) Buffers without intermediate data copies. It also enables a
kernel bypass implementation.
<span class="grey">Recio, et al. Standards Track [Page 1]</span></pre>
<hr class='noprint'/><!--NewPage--><pre class='newpage'><span id="page-2" ></span>
<span class="grey"><a href="./rfc5040">RFC 5040</a> RDMA Protocol Specification October 2007</span>
Table of Contents
<a href="#section-1">1</a>. Introduction ....................................................<a href="#page-4">4</a>
<a href="#section-1.1">1.1</a>. Architectural Goals ........................................<a href="#page-4">4</a>
<a href="#section-1.2">1.2</a>. Protocol Overview ..........................................<a href="#page-5">5</a>
<a href="#section-1.3">1.3</a>. RDMAP Layering .............................................<a href="#page-7">7</a>
<a href="#section-2">2</a>. Glossary ........................................................<a href="#page-8">8</a>
<a href="#section-2.1">2.1</a>. General ....................................................<a href="#page-8">8</a>
<a href="#section-2.2">2.2</a>. LLP .......................................................<a href="#page-10">10</a>
<a href="#section-2.3">2.3</a>. Direct Data Placement (DDP) ...............................<a href="#page-11">11</a>
<a href="#section-2.4">2.4</a>. Remote Direct Memory Access (RDMA) ........................<a href="#page-13">13</a>
<a href="#section-3">3</a>. ULP and Transport Attributes ...................................<a href="#page-15">15</a>
<a href="#section-3.1">3.1</a>. Transport Requirements and Assumptions ....................<a href="#page-15">15</a>
<a href="#section-3.2">3.2</a>. RDMAP Interactions with the ULP ...........................<a href="#page-16">16</a>
<a href="#section-4">4</a>. Header Format ..................................................<a href="#page-19">19</a>
<a href="#section-4.1">4.1</a>. RDMAP Control and Invalidate STag Field ...................<a href="#page-20">20</a>
<a href="#section-4.2">4.2</a>. RDMA Message Definitions ..................................<a href="#page-23">23</a>
<a href="#section-4.3">4.3</a>. RDMA Write Header .........................................<a href="#page-24">24</a>
<a href="#section-4.4">4.4</a>. RDMA Read Request Header ..................................<a href="#page-24">24</a>
<a href="#section-4.5">4.5</a>. RDMA Read Response Header .................................<a href="#page-26">26</a>
<a href="#section-4.6">4.6</a>. Send Header and Send with Solicited Event Header ..........<a href="#page-26">26</a>
4.7. Send with Invalidate Header and Send with SE and
Invalidate Header .........................................<a href="#page-26">26</a>
<a href="#section-4.8">4.8</a>. Terminate Header ..........................................<a href="#page-26">26</a>
<a href="#section-5">5</a>. Data Transfer ..................................................<a href="#page-32">32</a>
<a href="#section-5.1">5.1</a>. RDMA Write Message ........................................<a href="#page-32">32</a>
<a href="#section-5.2">5.2</a>. RDMA Read Operation .......................................<a href="#page-33">33</a>
<a href="#section-5.2.1">5.2.1</a>. RDMA Read Request Message ..........................<a href="#page-33">33</a>
<a href="#section-5.2.2">5.2.2</a>. RDMA Read Response Message .........................<a href="#page-35">35</a>
<a href="#section-5.3">5.3</a>. Send Message Type .........................................<a href="#page-36">36</a>
<a href="#section-5.4">5.4</a>. Terminate Message .........................................<a href="#page-37">37</a>
<a href="#section-5.5">5.5</a>. Ordering and Completions ..................................<a href="#page-38">38</a>
<a href="#section-6">6</a>. RDMAP Stream Management ........................................<a href="#page-41">41</a>
<a href="#section-6.1">6.1</a>. Stream Initialization .....................................<a href="#page-41">41</a>
<a href="#section-6.2">6.2</a>. Stream Teardown ...........................................<a href="#page-42">42</a>
<a href="#section-6.2.1">6.2.1</a>. RDMAP Abortive Termination .........................<a href="#page-43">43</a>
<a href="#section-7">7</a>. RDMAP Error Management .........................................<a href="#page-43">43</a>
<a href="#section-7.1">7.1</a>. RDMAP Error Surfacing .....................................<a href="#page-44">44</a>
7.2. Errors Detected at the Remote Peer on Incoming
RDMA Messages .............................................<a href="#page-45">45</a>
<a href="#section-8">8</a>. Security Considerations ........................................<a href="#page-46">46</a>
<a href="#section-8.1">8.1</a>. Summary of RDMAP-Specific Security Requirements ...........<a href="#page-46">46</a>
<a href="#section-8.1.1">8.1.1</a>. RDMAP (RNIC) Requirements ..........................<a href="#page-47">47</a>
<a href="#section-8.1.2">8.1.2</a>. Privileged Resource Manager Requirements ...........<a href="#page-48">48</a>
<a href="#section-8.2">8.2</a>. Security Services for RDMAP ...............................<a href="#page-49">49</a>
<a href="#section-8.2.1">8.2.1</a>. Available Security Services ........................<a href="#page-49">49</a>
<a href="#section-8.2.2">8.2.2</a>. Requirements for IPsec Services for RDMAP ..........<a href="#page-50">50</a>
<a href="#section-9">9</a>. IANA Considerations ............................................<a href="#page-51">51</a>
<span class="grey">Recio, et al. Standards Track [Page 2]</span></pre>
<hr class='noprint'/><!--NewPage--><pre class='newpage'><span id="page-3" ></span>
<span class="grey"><a href="./rfc5040">RFC 5040</a> RDMA Protocol Specification October 2007</span>
<a href="#section-10">10</a>. References ....................................................<a href="#page-52">52</a>
<a href="#section-10.1">10.1</a>. Normative References .....................................<a href="#page-52">52</a>
<a href="#section-10.2">10.2</a>. Informative References ...................................<a href="#page-53">53</a>
<a href="#appendix-A">Appendix A</a>. DDP Segment Formats for RDMA Messages .................<a href="#page-54">54</a>
<a href="#appendix-A.1">A.1</a>. DDP Segment for RDMA Write ................................<a href="#page-54">54</a>
<a href="#appendix-A.2">A.2</a>. DDP Segment for RDMA Read Request .........................<a href="#page-55">55</a>
<a href="#appendix-A.3">A.3</a>. DDP Segment for RDMA Read Response ........................<a href="#page-56">56</a>
<a href="#appendix-A.4">A.4</a>. DDP Segment for Send and Send with Solicited Event ........<a href="#page-56">56</a>
A.5. DDP Segment for Send with Invalidate and Send with SE and
Invalidate ................................................<a href="#page-57">57</a>
<a href="#appendix-A.6">A.6</a>. DDP Segment for Terminate .................................<a href="#page-58">58</a>
<a href="#appendix-B">Appendix B</a>. Ordering and Completion Table .........................<a href="#page-59">59</a>
<a href="#appendix-C">Appendix C</a>. Contributors ..........................................<a href="#page-61">61</a>
Table of Figures
Figure 1: RDMAP Layering ...........................................<a href="#page-7">7</a>
Figure 2: Example of MPA, DDP, and RDMAP Header Alignment over TCP .8
Figure 3: DDP Control, RDMAP Control, and Invalidate STag Fields ..20
Figure 4: RDMA Usage of DDP Fields ................................<a href="#page-22">22</a>
Figure 5: RDMA Message Definitions ................................<a href="#page-23">23</a>
Figure 6: RDMA Read Request Header Format .........................<a href="#page-24">24</a>
Figure 7: Terminate Header Format .................................<a href="#page-27">27</a>
Figure 8: Terminate Control Field .................................<a href="#page-27">27</a>
Figure 9: Terminate Control Field Values ..........................<a href="#page-29">29</a>
Figure 10: Error Type to RDMA Message Mapping .....................<a href="#page-32">32</a>
Figure 11: RDMA Write, DDP Segment Format .........................<a href="#page-54">54</a>
Figure 12: RDMA Read Request, DDP Segment Format ..................<a href="#page-55">55</a>
Figure 13: RDMA Read Response, DDP Segment Format .................<a href="#page-56">56</a>
Figure 14: Send and Send with Solicited Event, DDP Segment Format .56
Figure 15: Send with Invalidate and Send with SE and Invalidate,
DDP Segment Format .....................................<a href="#page-57">57</a>
Figure 16: Terminate, DDP Segment Format ..........................<a href="#page-58">58</a>
Figure 17: Operation Ordering .....................................<a href="#page-59">59</a>
<span class="grey">Recio, et al. Standards Track [Page 3]</span></pre>
<hr class='noprint'/><!--NewPage--><pre class='newpage'><span id="page-4" ></span>
<span class="grey"><a href="./rfc5040">RFC 5040</a> RDMA Protocol Specification October 2007</span>
<span class="h2"><a class="selflink" id="section-1" href="#section-1">1</a>. Introduction</span>
Today, communications over TCP/IP typically require copy operations,
which add latency and consume significant CPU and memory resources.
The Remote Direct Memory Access Protocol (RDMAP) enables removal of
data copy operations and enables reduction in latencies by allowing a
local application to read or write data on a remote computer's memory
with minimal demands on memory bus bandwidth and CPU processing
overhead, while preserving memory protection semantics.
RDMAP is layered on top of Direct Data Placement (DDP) and uses the
two buffer models available from DDP. DDP-related terminology is
discussed in <a href="#section-2.3">Section 2.3</a>. As RDMAP builds on DDP, the reader is
advised to become familiar with [<a href="#ref-DDP" title=""Direct Data Placement over Reliable Transports"">DDP</a>].
The key words "MUST", "MUST NOT", "REQUIRED", "SHALL", "SHALL NOT",
"SHOULD", "SHOULD NOT", "RECOMMENDED", "MAY", and "OPTIONAL" in this
document are to be interpreted as described in <a href="./rfc2119">RFC 2119</a> [<a href="./rfc2119" title=""Key words for use in RFCs to Indicate Requirement Levels"">RFC2119</a>].
<span class="h3"><a class="selflink" id="section-1.1" href="#section-1.1">1.1</a>. Architectural Goals</span>
RDMAP has been designed with the following high-level architectural
goals:
* Provide a data transfer operation that allows a Local Peer to
transfer up to 2^32 - 1 octets directly into a previously
Advertised Buffer (i.e., Tagged Buffer) located at a Remote Peer
without requiring a copy operation. This is referred to as the
RDMA Write data transfer operation.
* Provide a data transfer operation that allows a Local Peer to
retrieve up to 2^32 - 1 octets directly from a previously
Advertised Buffer (i.e., Tagged Buffer) located at a Remote Peer
without requiring a copy operation. This is referred to as the
RDMA Read data transfer operation.
* Provide a data transfer operation that allows a Local Peer to send
up to 2^32 - 1 octets directly into a buffer located at a Remote
Peer that has not been explicitly Advertised. This is referred to
as the Send (Send with Invalidate, Send with Solicited Event, and
Send with Solicited Event and Invalidate) data transfer operation.
* Enable the local ULP to use the Send Operation Type (includes
Send, Send with Invalidate, Send with Solicited Event, and Send
with Solicited Event and Invalidate) to signal to the remote ULP
the Completion of all previous Messages initiated by the local
ULP.
<span class="grey">Recio, et al. Standards Track [Page 4]</span></pre>
<hr class='noprint'/><!--NewPage--><pre class='newpage'><span id="page-5" ></span>
<span class="grey"><a href="./rfc5040">RFC 5040</a> RDMA Protocol Specification October 2007</span>
* Provide for all operations on a single RDMAP Stream to be reliably
transmitted in the order that they were submitted.
* Provide RDMAP capabilities independently for each Stream when the
LLP supports multiple data Streams within an LLP connection.
<span class="h3"><a class="selflink" id="section-1.2" href="#section-1.2">1.2</a>. Protocol Overview</span>
RDMAP provides seven data transfer operations. Except for the RDMA
Read operation, each operation generates exactly one RDMA Message.
Following is a brief overview of the RDMA Operations and RDMA
Messages:
1. Send - A Send operation uses a Send Message to transfer data from
the Data Source into a buffer that has not been explicitly
Advertised by the Data Sink. The Send Message uses the DDP
Untagged Buffer Model to transfer the ULP Message into the Data
Sink's Untagged Buffer.
2. Send with Invalidate - A Send with Invalidate operation uses a
Send with Invalidate Message to transfer data from the Data
Source into a buffer that has not been explicitly Advertised by
the Data Sink. The Send with Invalidate Message includes all
functionality of the Send Message, with one addition: an STag
field is included in the Send with Invalidate Message. After the
message has been Placed and Delivered at the Data Sink, the
Remote Peer's buffer identified by the STag can no longer be
accessed remotely until the Remote Peer's ULP re-enables access
and Advertises the buffer.
3. Send with Solicited Event (Send with SE) - A Send with Solicited
Event operation uses a Send with Solicited Event Message to
transfer data from the Data Source into an Untagged Buffer at the
Data Sink. The Send with Solicited Event Message is similar to
the Send Message, with one addition: when the Send with Solicited
Event Message has been Placed and Delivered, an Event may be
generated at the recipient, if the recipient is configured to
generate such an Event.
4. Send with Solicited Event and Invalidate (Send with SE and
Invalidate) - A Send with Solicited Event and Invalidate
operation uses a Send with Solicited Event and Invalidate Message
to transfer data from the Data Source into a buffer that has not
been explicitly Advertised by the Data Sink. The Send with
Solicited Event and Invalidate Message is similar to the Send
with Invalidate Message, with one addition: when the Send with
<span class="grey">Recio, et al. Standards Track [Page 5]</span></pre>
<hr class='noprint'/><!--NewPage--><pre class='newpage'><span id="page-6" ></span>
<span class="grey"><a href="./rfc5040">RFC 5040</a> RDMA Protocol Specification October 2007</span>
Solicited Event and Invalidate Message has been Placed and
Delivered, an Event may be generated at the recipient, if the
recipient is configured to generate such an Event.
5. Remote Direct Memory Access Write - An RDMA Write operation uses
an RDMA Write Message to transfer data from the Data Source to a
previously Advertised Buffer at the Data Sink.
The ULP at the Remote Peer, which in this case is the Data Sink,
enables the Data Sink Tagged Buffer for access and Advertises the
buffer's size (length), location (Tagged Offset), and Steering
Tag (STag) to the Data Source through a ULP-specific mechanism.
The ULP at the Local Peer, which in this case is the Data Source,
initiates the RDMA Write operation. The RDMA Write Message uses
the DDP Tagged Buffer Model to transfer the ULP Message into the
Data Sink's Tagged Buffer. Note: the STag associated with the
Tagged Buffer remains valid until the ULP at the Remote Peer
invalidates it or the ULP at the Local Peer invalidates it
through a Send with Invalidate or Send with Solicited Event and
Invalidate.
6. Remote Direct Memory Access Read - The RDMA Read operation
transfers data to a Tagged Buffer at the Local Peer, which in
this case is the Data Sink, from a Tagged Buffer at the Remote
Peer, which in this case is the Data Source. The ULP at the Data
Source enables the Data Source Tagged Buffer for access and
Advertises the buffer's size (length), location (Tagged Offset),
and Steering Tag (STag) to the Data Sink through a ULP-specific
mechanism. The ULP at the Data Sink enables the Data Sink Tagged
Buffer for access and initiates the RDMA Read operation. The
RDMA Read operation consists of a single RDMA Read Request
Message and a single RDMA Read Response Message, and the latter
may be segmented into multiple DDP Segments.
The RDMA Read Request Message uses the DDP Untagged Buffer Model
to Deliver the STag, starting Tagged Offset, and length for both
the Data Source and Data Sink Tagged Buffers to the Remote Peer's
RDMA Read Request Queue.
The RDMA Read Response Message uses the DDP Tagged Buffer Model
to Deliver the Data Source's Tagged Buffer to the Data Sink,
without any involvement from the ULP at the Data Source.
Note: the Data Source STag associated with the Tagged Buffer
remains valid until the ULP at the Data Source invalidates it or
the ULP at the Data Sink invalidates it through a Send with
<span class="grey">Recio, et al. Standards Track [Page 6]</span></pre>
<hr class='noprint'/><!--NewPage--><pre class='newpage'><span id="page-7" ></span>
<span class="grey"><a href="./rfc5040">RFC 5040</a> RDMA Protocol Specification October 2007</span>
Invalidate or Send with Solicited Event and Invalidate. The Data
Sink STag associated with the Tagged Buffer remains valid until
the ULP at the Data Sink invalidates it.
7. Terminate - A Terminate operation uses a Terminate Message to
transfer to the Remote Peer information associated with an error
that occurred at the Local Peer. The Terminate Message uses the
DDP Untagged Buffer Model to transfer the Message into the Data
Sink's Untagged Buffer.
<span class="h3"><a class="selflink" id="section-1.3" href="#section-1.3">1.3</a>. RDMAP Layering</span>
RDMAP is dependent on DDP, subject to the requirements defined in
<a href="#section-3.1">Section 3.1</a>, "Transport Requirements and Assumptions". Figure 1,
"RDMAP Layering", depicts the relationship between Upper Layer
Protocols (ULPs), RDMAP, DDP protocol, the framing layer, and the
transport. For LLP protocol definitions of each LLP, see [<a href="#ref-MPA" title=""Marker PDU Aligned Framing for TCP Specification"">MPA</a>],
[<a href="#ref-TCP" title=""Transmission Control Protocol"">TCP</a>], and [<a href="#ref-SCTP" title=""Stream Control Transmission Protocol"">SCTP</a>].
+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
| |
| Upper Layer Protocol (ULP) |
| |
+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
| |
| RDMAP |
| |
+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
| |
| DDP protocol |
| |
+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
| | |
| MPA | |
| | |
+-+-+-+-+-+-+-+-+-+ SCTP |
| | |
| TCP | |
| | |
+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
Figure 1: RDMAP Layering
If RDMAP is layered over DDP/MPA/TCP, then the respective headers and
ULP Payload are arranged as follows (Note: For clarity, MPA header
and CRC fields are included but MPA markers are not shown):
<span class="grey">Recio, et al. Standards Track [Page 7]</span></pre>
<hr class='noprint'/><!--NewPage--><pre class='newpage'><span id="page-8" ></span>
<span class="grey"><a href="./rfc5040">RFC 5040</a> RDMA Protocol Specification October 2007</span>
0 1 2 3
0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1
+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
| |
// TCP Header //
| |
+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
| MPA Header | |
+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+ +
| |
// DDP Header //
| |
+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
| |
// RDMA Header //
| |
+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
| |
// ULP Payload //
// (shown with no pad bytes) //
// //
| |
+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
| MPA CRC |
+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
Figure 2: Example of MPA, DDP, and RDMAP Header Alignment over TCP
<span class="h2"><a class="selflink" id="section-2" href="#section-2">2</a>. Glossary</span>
<span class="h3"><a class="selflink" id="section-2.1" href="#section-2.1">2.1</a>. General</span>
Advertisement (Advertised, Advertise, Advertisements, Advertises) -
the act of informing a Remote Peer that a local RDMA Buffer is
available to it. A Node makes available an RDMA Buffer for
incoming RDMA Read or RDMA Write access by informing its RDMA/DDP
peer of the Tagged Buffer identifiers (STag, base address, and
buffer length). This Advertisement of Tagged Buffer information
is not defined by RDMA/DDP and is left to the ULP. A typical
method would be for the Local Peer to embed the Tagged Buffer's
Steering Tag, base address, and length in a Send Message destined
for the Remote Peer.
Completion - Refer to "RDMA Completion" in <a href="#section-2.4">Section 2.4</a>.
Completed - See "RDMA Completion" in <a href="#section-2.4">Section 2.4</a>.
Complete - See "RDMA Completion" in <a href="#section-2.4">Section 2.4</a>.
<span class="grey">Recio, et al. Standards Track [Page 8]</span></pre>
<hr class='noprint'/><!--NewPage--><pre class='newpage'><span id="page-9" ></span>
<span class="grey"><a href="./rfc5040">RFC 5040</a> RDMA Protocol Specification October 2007</span>
Completes - See "RDMA Completion" in <a href="#section-2.4">Section 2.4</a>.
Data Sink - The peer receiving a data payload. Note that the Data
Sink can be required to both send and receive RDMA/DDP Messages
to transfer a data payload.
Data Source - The peer sending a data payload. Note that the Data
Source can be required to both send and receive RDMA/DDP Messages
to transfer a data payload.
Data Delivery (Delivery, Delivered, Delivers) - Delivery is defined
as the process of informing the ULP or consumer that a particular
Message is available for use. This is specifically different
from "Placement", which may generally occur in any order, while
the order of "Delivery" is strictly defined. See "Data
Placement" in <a href="#section-2.3">Section 2.3</a>.
Delivery - See Data Delivery in <a href="#section-2.1">Section 2.1</a>.
Delivered - See Data Delivery in <a href="#section-2.1">Section 2.1</a>.
Delivers - See Data Delivery in <a href="#section-2.1">Section 2.1</a>.
Fabric - The collection of links, switches, and routers that connect
a set of Nodes with RDMA/DDP protocol implementations.
Fence (Fenced, Fences) - To block the current RDMA Operation from
executing until prior RDMA Operations have Completed.
iWARP - A suite of wire protocols comprised of RDMAP, DDP, and MPA.
The iWARP protocol suite may be layered above TCP, SCTP, or other
transport protocols.
Local Peer - The RDMA/DDP protocol implementation on the local end of
the connection. Used to refer to the local entity when
describing a protocol exchange or other interaction between two
Nodes.
Node - A computing device attached to one or more links of a Fabric
(network). A Node in this context does not refer to a specific
application or protocol instantiation running on the computer. A
Node may consist of one or more RNICs installed in a host
computer.
Placement - See "Data Placement" in <a href="#section-2.3">Section 2.3</a>.
Placed - See "Data Placement" in <a href="#section-2.3">Section 2.3</a>.
<span class="grey">Recio, et al. Standards Track [Page 9]</span></pre>
<hr class='noprint'/><!--NewPage--><pre class='newpage'><span id="page-10" ></span>
<span class="grey"><a href="./rfc5040">RFC 5040</a> RDMA Protocol Specification October 2007</span>
Places - See "Data Placement" in <a href="#section-2.3">Section 2.3</a>.
Remote Peer - The RDMA/DDP protocol implementation on the opposite
end of the connection. Used to refer to the remote entity when
describing protocol exchanges or other interactions between two
Nodes.
RNIC - RDMA Network Interface Controller. In this context, this
would be a network I/O adapter or embedded controller with iWARP
and Verbs functionality.
RNIC Interface (RI) - The presentation of the RNIC to the Verbs
Consumer as implemented through the combination of the RNIC and
the RNIC driver.
Termination - See "RDMAP Abortive Termination" in <a href="#section-2.4">Section 2.4</a>.
Terminated - See "RDMAP Abortive Termination" in <a href="#section-2.4">Section 2.4</a>.
Terminate - See "RDMAP Abortive Termination" in <a href="#section-2.4">Section 2.4</a>.
Terminates - See "RDMAP Abortive Termination" in <a href="#section-2.4">Section 2.4</a>.
ULP - Upper Layer Protocol. The protocol layer above the one
currently being referenced. The ULP for RDMA/DDP is expected to
be an OS, Application, adaptation layer, or proprietary device.
The RDMA/DDP documents do not specify a ULP -- they provide a set
of semantics that allow a ULP to be designed to utilize RDMA/DDP.
ULP Payload - The ULP data that is contained within a single protocol
segment or packet (e.g., a DDP Segment).
Verbs - An abstract description of the functionality of an RNIC
Interface. The OS may expose some or all of this functionality
via one or more APIs to applications. The OS will also use some
of the functionality to manage the RNIC Interface.
<span class="h3"><a class="selflink" id="section-2.2" href="#section-2.2">2.2</a>. LLP</span>
LLP - Lower Layer Protocol. The protocol layer beneath the protocol
layer currently being referenced. For example, for DDP, the LLP
is SCTP, MPA, or other transport protocols. For RDMA, the LLP is
DDP.
LLP Connection - Corresponds to an LLP transport-level connection
between the peer LLP layers on two Nodes.
<span class="grey">Recio, et al. Standards Track [Page 10]</span></pre>
<hr class='noprint'/><!--NewPage--><pre class='newpage'><span id="page-11" ></span>
<span class="grey"><a href="./rfc5040">RFC 5040</a> RDMA Protocol Specification October 2007</span>
LLP Stream - Corresponds to a single LLP transport-level Stream
between the peer LLP layers on two Nodes. One or more LLP
Streams may map to a single transport-level LLP connection. For
transport protocols that support multiple Streams per connection
(e.g., SCTP), an LLP Stream corresponds to one transport-level
Stream.
MULPDU - Maximum ULPDU. The current maximum size of the record that
is acceptable for DDP to pass to the LLP for transmission.
ULPDU - Upper Layer Protocol Data Unit. The data record defined by
the layer above MPA.
<span class="h3"><a class="selflink" id="section-2.3" href="#section-2.3">2.3</a>. Direct Data Placement (DDP)</span>
Data Placement (Placement, Placed, Places) - For DDP, this term is
specifically used to indicate the process of writing to a data
buffer by a DDP implementation. DDP Segments carry Placement
information, which may be used by the receiving DDP
implementation to perform Data Placement of the DDP Segment ULP
Payload. See "Data Delivery".
DDP Abortive Teardown - The act of closing a DDP Stream without
attempting to Complete in-progress and pending DDP Messages.
DDP Graceful Teardown - The act of closing a DDP Stream such that all
in-progress and pending DDP Messages are allowed to Complete
successfully.
DDP Control Field - A fixed 16-bit field in the DDP Header. The DDP
Control Field contains an 8-bit field whose contents are reserved
for use by the ULP.
DDP Header - The header present in all DDP segments. The DDP Header
contains control and Placement fields that are used to define the
final Placement location for the ULP Payload carried in a DDP
Segment.
DDP Message - A ULP-defined unit of data interchange, which is
subdivided into one or more DDP segments. This segmentation may
occur for a variety of reasons, including segmentation to respect
the maximum segment size of the underlying transport protocol.
DDP Segment - The smallest unit of data transfer for the DDP
protocol. It includes a DDP Header and ULP Payload (if present).
A DDP Segment should be sized to fit within the underlying
transport protocol MULPDU.
<span class="grey">Recio, et al. Standards Track [Page 11]</span></pre>
<hr class='noprint'/><!--NewPage--><pre class='newpage'><span id="page-12" ></span>
<span class="grey"><a href="./rfc5040">RFC 5040</a> RDMA Protocol Specification October 2007</span>
DDP Stream - A sequence of DDP Messages whose ordering is defined by
the LLP. For SCTP, a DDP Stream maps directly to an SCTP Stream.
For MPA, a DDP Stream maps directly to a TCP connection, and a
single DDP Stream is supported. Note that DDP has no ordering
guarantees between DDP Streams.
Direct Data Placement - A mechanism whereby ULP data contained within
DDP Segments may be Placed directly into its final destination in
memory without processing of the ULP. This may occur even when
the DDP Segments arrive out of order. Out-of-order Placement
support may require the Data Sink to implement the LLP and DDP as
one functional block.
Direct Data Placement Protocol (DDP) - Also, a wire protocol that
supports Direct Data Placement by associating explicit memory
buffer placement information with the LLP payload units.
Message Offset (MO) - For the DDP Untagged Buffer Model, specifies
the offset, in bytes, from the start of a DDP Message.
Message Sequence Number (MSN) - For the DDP Untagged Buffer Model,
specifies a sequence number that is increasing with each DDP
Message.
Queue Number (QN) - For the DDP Untagged Buffer Model, identifies a
destination Data Sink queue for a DDP Segment.
Steering Tag - An identifier of a Tagged Buffer on a Node, valid as
defined within a protocol specification.
STag - Steering Tag
Tagged Buffer - A buffer that is explicitly Advertised to the Remote
Peer through exchange of an STag, Tagged Offset, and length.
Tagged Buffer Model - A DDP data transfer model used to transfer
Tagged Buffers from the Local Peer to the Remote Peer.
Tagged DDP Message - A DDP Message that targets a Tagged Buffer.
Tagged Offset (TO) - The offset within a Tagged Buffer on a Node.
Untagged Buffer - A buffer that is not explicitly Advertised to the
Remote Peer. Untagged Buffers support one of the two available
data transfer mechanisms called the Untagged Buffer Model. An
Untagged Buffer is used to send asynchronous control messages to
the Remote Peer for RDMA Read, Send, and Terminate requests.
Untagged Buffers handle Untagged DDP Messages.
<span class="grey">Recio, et al. Standards Track [Page 12]</span></pre>
<hr class='noprint'/><!--NewPage--><pre class='newpage'><span id="page-13" ></span>
<span class="grey"><a href="./rfc5040">RFC 5040</a> RDMA Protocol Specification October 2007</span>
Untagged Buffer Model - A DDP data transfer model used to transfer
Untagged Buffers from the Local Peer to the Remote Peer.
Untagged DDP Message - A DDP Message that targets an Untagged Buffer.
<span class="h3"><a class="selflink" id="section-2.4" href="#section-2.4">2.4</a>. Remote Direct Memory Access (RDMA)</span>
Completion Queues (CQs) - Logical components of the RNIC Interface
that conceptually represent how an RNIC notifies the ULP about
the completion of the transmission of data, or the completion of
the reception of data; see [<a href="#ref-RDMASEC" title=""Direct Data Placement Protocol (DDP) / Remote Direct Memory Access Protocol (RDMAP) Security"">RDMASEC</a>].
Event - An indication provided by the RDMAP layer to the ULP to
indicate a Completion or other condition requiring immediate
attention.
Invalidate STag - A mechanism used to prevent the Remote Peer from
reusing a previous explicitly Advertised STag, until the Local
Peer makes it available through a subsequent explicit
Advertisement. The STag cannot be accessed remotely until it is
explicitly Advertised again.
RDMA Completion (Completion, Completed, Complete, Completes) - For
RDMA, Completion is defined as the process of informing the ULP
that a particular RDMA Operation has performed all functions
specified for the RDMA Operations, including Placement and
Delivery. The Completion semantic of each RDMA Operation is
distinctly defined.
RDMA Message - A data transfer mechanism used to fulfill an RDMA
Operation.
RDMA Operation - A sequence of RDMA Messages, including control
Messages, to transfer data from a Data Source to a Data Sink.
The following RDMA Operations are defined: RDMA Writes, RDMA
Read, Send, Send with Invalidate, Send with Solicited Event, Send
with Solicited Event and Invalidate, and Terminate.
RDMA Protocol (RDMAP) - A wire protocol that supports RDMA Operations
to transfer ULP data between a Local Peer and the Remote Peer.
RDMAP Abortive Termination (Termination, Terminated, Terminate,
Terminates) - The act of closing an RDMAP Stream without
attempting to Complete in-progress and pending RDMA Operations.
RDMAP Graceful Termination - The act of closing an RDMAP Stream such
that all in-progress and pending RDMA Operations are allowed to
Complete successfully.
<span class="grey">Recio, et al. Standards Track [Page 13]</span></pre>
<hr class='noprint'/><!--NewPage--><pre class='newpage'><span id="page-14" ></span>
<span class="grey"><a href="./rfc5040">RFC 5040</a> RDMA Protocol Specification October 2007</span>
RDMA Read - An RDMA Operation used by the Data Sink to transfer the
contents of a source RDMA buffer from the Remote Peer to the
Local Peer. An RDMA Read operation consists of a single RDMA
Read Request Message and a single RDMA Read Response Message.
RDMA Read Request - An RDMA Message used by the Data Sink to request
the Data Source to transfer the contents of an RDMA buffer. The
RDMA Read Request Message describes both the Data Source and Data
Sink RDMA buffers.
RDMA Read Request Queue - The queue used for processing RDMA Read
Requests. The RDMA Read Request Queue has a DDP Queue Number of
1.
RDMA Read Response - An RDMA Message used by the Data Source to
transfer the contents of an RDMA buffer to the Data Sink, in
response to an RDMA Read Request. The RDMA Read Response Message
only describes the data sink RDMA buffer.
RDMAP Stream - An association between a pair of RDMAP
implementations, possibly on different Nodes, which transfer ULP
data using RDMA Operations. There may be multiple RDMAP Streams
on a single Node. An RDMAP Stream maps directly to a single DDP
Stream.
RDMA Write - An RDMA Operation that transfers the contents of a
source RDMA Buffer from the Local Peer to a destination RDMA
Buffer at the Remote Peer using RDMA. The RDMA Write Message
only describes the Data Sink RDMA buffer.
Remote Direct Memory Access (RDMA) - A method of accessing memory on
a remote system in which the local system specifies the remote
location of the data to be transferred. Employing an RNIC in the
remote system allows the access to take place without
interrupting the processing of the CPU(s) on the system.
Send - An RDMA Operation that transfers the contents of a ULP Buffer
from the Local Peer to an Untagged Buffer at the Remote Peer.
Send Message Type - A Send Message, Send with Invalidate Message,
Send with Solicited Event Message, or Send with Solicited Event
and Invalidate Message.
Send Operation Type - A Send Operation, Send with Invalidate
Operation, Send with Solicited Event Operation, or Send with
Solicited Event and Invalidate Operation.
<span class="grey">Recio, et al. Standards Track [Page 14]</span></pre>
<hr class='noprint'/><!--NewPage--><pre class='newpage'><span id="page-15" ></span>
<span class="grey"><a href="./rfc5040">RFC 5040</a> RDMA Protocol Specification October 2007</span>
Solicited Event (SE) - A facility by which an RDMA Operation sender
may cause an Event to be generated at the recipient, if the
recipient is configured to generate such an Event, when a Send
with Solicited Event Message or Send with Solicited Event and
Invalidate Message is received. Note: The Local Peer's ULP can
use the Solicited Event mechanism to ensure that Messages
designated as important to the ULP are handled in an expeditious
manner by the Remote Peer's ULP. The ULP at the Local Peer can
indicate a given Send Message Type is important by using the Send
with Solicited Event Message or Send with Solicited Event and
Invalidate Message. The ULP at the Remote Peer can choose to
only be notified when valid Send with Solicited Event Messages
and/or Send with Solicited Event and Invalidate Messages arrive
and handle other valid incoming Send Messages or Send with
Invalidate Messages at its leisure.
Terminate - An RDMA Message used by a Node to pass an error
indication to the peer Node on an RDMAP Stream. This operation
is for RDMAP use only.
ULP Buffer - A buffer owned above the RDMAP layer and Advertised to
the RDMAP layer either as a Tagged Buffer or an Untagged ULP
Buffer.
ULP Message - The ULP data that is handed to a specific protocol
layer for transmission. Data boundaries are preserved as they
are transmitted through iWARP.
<span class="h2"><a class="selflink" id="section-3" href="#section-3">3</a>. ULP and Transport Attributes</span>
<span class="h3"><a class="selflink" id="section-3.1" href="#section-3.1">3.1</a>. Transport Requirements and Assumptions</span>
RDMAP MUST be layered on top of the Direct Data Placement Protocol
[<a href="#ref-DDP" title=""Direct Data Placement over Reliable Transports"">DDP</a>].
RDMAP requires the following DDP support:
* RDMAP uses three queues for Untagged Buffers:
* Queue Number 0 (used by RDMAP for Send, Send with Invalidate,
Send with Solicited Event, and Send with Solicited Event and
Invalidate operations).
* Queue Number 1 (used by RDMAP for RDMA Read operations).
* Queue Number 2 (used by RDMAP for Terminate operations).
* DDP maps a single RDMA Message to a single DDP Message.
<span class="grey">Recio, et al. Standards Track [Page 15]</span></pre>
<hr class='noprint'/><!--NewPage--><pre class='newpage'><span id="page-16" ></span>
<span class="grey"><a href="./rfc5040">RFC 5040</a> RDMA Protocol Specification October 2007</span>
* DDP uses the STag and Tagged Offset provided by the RDMAP for
Tagged Buffer Messages (i.e., RDMA Write and RDMA Read Response).
* When the DDP layer Delivers an Untagged DDP Message to the RDMAP
layer, DDP provides the length of the DDP Message. This ensures
that RDMAP does not have to carry a length field in its header.
* When the RDMAP layer provides an RDMA Message to the DDP layer,
DDP must insert the RsvdULP field value provided by the RDMAP
layer into the associated DDP Message.
* When the DDP layer Delivers a DDP Message to the RDMAP layer, DDP
provides the RsvdULP field.
* The RsvdULP field must be 1 octet for DDP Tagged Messages and 5
octets for DDP Untagged Messages.
* DDP propagates to RDMAP all operation or protection errors (used
by RDMAP Terminate) and, when appropriate, the DDP Header fields
of the DDP Segment that encountered the error.
* If an RDMA Operation is aborted by DDP or a lower layer, the
contents of the Data Sink buffers associated with the operation
are considered indeterminate.
* DDP, in conjunction with the lower layers, provides reliable, in-
order Delivery.
<span class="h3"><a class="selflink" id="section-3.2" href="#section-3.2">3.2</a>. RDMAP Interactions with the ULP</span>
RDMAP provides the ULP with access to the following RDMA Operations
as defined in this specification:
* Send
* Send with Solicited Event
* Send with Invalidate
* Send with Solicited Event and Invalidate
* RDMA Write
* RDMA Read
<span class="grey">Recio, et al. Standards Track [Page 16]</span></pre>
<hr class='noprint'/><!--NewPage--><pre class='newpage'><span id="page-17" ></span>
<span class="grey"><a href="./rfc5040">RFC 5040</a> RDMA Protocol Specification October 2007</span>
For Send Operation Types, the following are the interactions between
the RDMAP layer and the ULP:
* At the Data Source:
* The ULP passes to the RDMAP layer the following:
* ULP Message Length
* ULP Message
* An indication of the Send Operation Type, where the valid
types are: Send, Send with Solicited Event, Send with
Invalidate, or Send with Solicited Event and Invalidate.
* An Invalidate STag, if the Send Operation Type was Send with
Invalidate or Send with Solicited Event and Invalidate.
* When the Send Operation Type Completes, an indication of the
Completion results.
* At the Data Sink:
* If the Send Operation Type Completed successfully, the RDMAP
layer passes the following information to the ULP Layer:
* ULP Message Length
* ULP Message
* An Event, if the Data Sink is configured to generate an
Event.
* An Invalidated STag, if the Send Operation Type was Send
with Invalidate or Send with Solicited Event and Invalidate.
* If the Send Operation Type Completed in error, the Data Sink
RDMAP layer will pass up the corresponding error information to
the Data Sink ULP and send a Terminate Message to the Data
Source RDMAP layer. The Data Source RDMAP layer will then pass
up the Terminate Message to the ULP.
For RDMA Write operations, the following are the interactions between
the RDMAP layer and the ULP:
* At the Data Source:
* The ULP passes to the RDMAP layer the following:
<span class="grey">Recio, et al. Standards Track [Page 17]</span></pre>
<hr class='noprint'/><!--NewPage--><pre class='newpage'><span id="page-18" ></span>
<span class="grey"><a href="./rfc5040">RFC 5040</a> RDMA Protocol Specification October 2007</span>
* ULP Message Length
* ULP Message
* Data Sink STag
* Data Sink Tagged Offset
* When the RDMA Write operation Completes, an indication of
the Completion results.
* At the Data Sink:
* If the RDMA Write completed successfully, the RDMAP layer does
not Deliver the RDMA Write to the ULP. It does Place the ULP
Message transferred through the RDMA Write Message into the ULP
Buffer.
* If the RDMA Write completed in error, the Data Sink RDMAP layer
will pass up the corresponding error information to the Data
Sink ULP and send a Terminate Message to the Data Source RDMAP
layer. The Data Source RDMAP layer will then pass up the
Terminate Message to the ULP.
For RDMA Read operations, the following are the interactions between
the RDMAP layer and the ULP:
* At the Data Sink:
* The ULP passes to the RDMAP layer the following:
* ULP Message Length
* Data Source STag
* Data Sink STag
* Data Source Tagged Offset
* Data Sink Tagged Offset
* When the RDMA Read operation Completes, an indication of the
Completion results.
* At the Data Source:
* If no error occurred while processing the RDMA Read Request,
the Data Source will not pass up any information to the ULP.
<span class="grey">Recio, et al. Standards Track [Page 18]</span></pre>
<hr class='noprint'/><!--NewPage--><pre class='newpage'><span id="page-19" ></span>
<span class="grey"><a href="./rfc5040">RFC 5040</a> RDMA Protocol Specification October 2007</span>
* If an error occurred while processing the RDMA Read Request,
the Data Source RDMAP layer will pass up the corresponding
error information to the Data Source ULP and send a Terminate
Message to the Data Sink RDMAP layer. The Data Sink RDMAP
layer will then pass up the Terminate Message to the ULP.
For STags made available to the RDMAP layer, following are the
interactions between the RDMAP layer and the ULP:
* If the ULP enables an STag, the ULP passes the following to the
RDMAP layer:
* STag;
* range of Tagged Offsets that are associated with a given STag;
* remote access rights (read, write, or read and write)
associated with a given, valid STag; and
* association between a given STag and a given RDMAP Stream.
* If the ULP disables an STag, the ULP passes to the RDMAP layer the
STag.
If an error occurs at the RDMAP layer, the RDMAP layer may pass back
error information (e.g., the content of a Terminate Message) to the
ULP.
<span class="h2"><a class="selflink" id="section-4" href="#section-4">4</a>. Header Format</span>
The control information of RDMA Messages is included in DDP
protocol-defined header fields, with the following exceptions:
* The first octet reserved for ULP usage on all DDP Messages in the
DDP Protocol (i.e., the RsvdULP Field) is used by RDMAP to carry
the RDMA Message Opcode and the RDMAP version. This octet is
known as the RDMAP Control Field in this specification. For Send
with Invalidate and Send with Solicited Event and Invalidate,
RDMAP uses the second through fifth octets, provided by DDP on
Untagged DDP Messages, to carry the STag that will be Invalidated.
* The RDMA Message length is passed by the RDMAP layer to the DDP
layer on all outbound transfers.
* For RDMA Read Request Messages, the RDMA Read Message Size is
included in the RDMA Read Request Header.
<span class="grey">Recio, et al. Standards Track [Page 19]</span></pre>
<hr class='noprint'/><!--NewPage--><pre class='newpage'><span id="page-20" ></span>
<span class="grey"><a href="./rfc5040">RFC 5040</a> RDMA Protocol Specification October 2007</span>
* The RDMA Message length is passed to the RDMAP layer by the DDP
layer on inbound Untagged Buffer transfers.
* Two RDMA Messages carry additional RDMAP headers. The RDMA Read
Request carries the Data Sink and Data Source buffer descriptions,
including buffer length. The Terminate carries additional
information associated with the error that caused the Terminate.
<span class="h3"><a class="selflink" id="section-4.1" href="#section-4.1">4.1</a>. RDMAP Control and Invalidate STag Field</span>
The version of RDMAP defined by this specification uses all 8 bits of
the RDMAP Control Field. The first octet reserved for ULP use in the
DDP Protocol MUST be used by the RDMAP to carry the RDMAP Control
Field. The ordering of the bits in the first octet MUST be as
defined in Figure 3, "DDP Control, RDMAP Control, and Invalidate STag
Fields". For Send with Invalidate and Send with Solicited Event and
Invalidate, the second through fifth octets of the DDP RsvdULP field
MUST be used by RDMAP to carry the Invalidate STag. Figure 3 depicts
the format of the DDP Control and RDMAP Control fields. (Note: In
Figure 3, the DDP Header is offset by 16 bits to accommodate the MPA
header defined in [<a href="#ref-MPA" title=""Marker PDU Aligned Framing for TCP Specification"">MPA</a>]. The MPA header is only present if DDP is
layered on top of MPA.)
0 1 2 3
0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1
+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
|T|L| Resrv | DV| RV|Rsv| Opcode|
+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
| Invalidate STag |
+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
Figure 3: DDP Control, RDMAP Control, and Invalidate STag Fields
All RDMA Messages handed by the RDMAP layer to the DDP layer MUST
define the value of the Tagged flag in the DDP Header. Figure 4,
"RDMA Usage of DDP Fields", MUST be used to define the value of the
Tagged flag that is handed to the DDP layer for each RDMA Message.
Figure 4 defines the value of the RDMA Opcode field that MUST be used
for each RDMA Message.
Figure 4 defines when the STag, Queue Number, and Tagged Offset
fields MUST be provided for each RDMA Message.
<span class="grey">Recio, et al. Standards Track [Page 20]</span></pre>
<hr class='noprint'/><!--NewPage--><pre class='newpage'><span id="page-21" ></span>
<span class="grey"><a href="./rfc5040">RFC 5040</a> RDMA Protocol Specification October 2007</span>
For this version of the RDMAP, all RDMA Messages MUST have:
* Bits 24-25; RDMA Version field: 01b for an RNIC that complies with
this RDMA protocol specification. 00b for an RNIC that complies
with the RDMA Consortium's RDMA protocol specification. Both
version numbers are valid. Interoperability is dependent on MPA
protocol version negotiation (e.g., MPA marker and MPA CRC).
* Bits 26-27; Reserved. MUST be set to zero by sender, ignored by
the receiver.
* Bits 28-31; OpCode field: see Figure 4.
* Bits 32-63; Invalidate STag. However, this field is only valid
for Send with Invalidate and Send with Solicited Event and
Invalidate Messages (see Figure 4).
For Send, Send with Solicited Event, RDMA Read Request, and
Terminate, the Invalidate STag field MUST be set to zero on
transmit and ignored by the receiver.
<span class="grey">Recio, et al. Standards Track [Page 21]</span></pre>
<hr class='noprint'/><!--NewPage--><pre class='newpage'><span id="page-22" ></span>
<span class="grey"><a href="./rfc5040">RFC 5040</a> RDMA Protocol Specification October 2007</span>
-------+-----------+-------+------+-------+-----------+--------------
RDMA | Message | Tagged| STag | Queue | Invalidate| Message
Message| Type | Flag | and | Number| STag | Length
OpCode | | | TO | | | Communicated
| | | | | | between DDP
| | | | | | and RDMAP
-------+-----------+-------+------+-------+-----------+--------------
0000b | RDMA Write| 1 | Valid| N/A | N/A | Yes
| | | | | |
-------+-----------+-------+------+-------+-----------+--------------
0001b | RDMA Read | 0 | N/A | 1 | N/A | Yes
| Request | | | | |
-------+-----------+-------+------+-------+-----------+--------------
0010b | RDMA Read | 1 | Valid| N/A | N/A | Yes
| Response | | | | |
-------+-----------+-------+------+-------+-----------+--------------
0011b | Send | 0 | N/A | 0 | N/A | Yes
| | | | | |
-------+-----------+-------+------+-------+-----------+--------------
0100b | Send with | 0 | N/A | 0 | Valid | Yes
| Invalidate| | | | |
-------+-----------+-------+------+-------+-----------+--------------
0101b | Send with | 0 | N/A | 0 | N/A | Yes
| SE | | | | |
-------+-----------+-------+------+-------+-----------+--------------
0110b | Send with | 0 | N/A | 0 | Valid | Yes
| SE and | | | | |
| Invalidate| | | | |
-------+-----------+-------+------+-------+-----------+--------------
0111b | Terminate | 0 | N/A | 2 | N/A | Yes
| | | | | |
-------+-----------+-------+------+-------+-----------+--------------
1000b | |
to | Reserved | Not Specified
1111b | |
-------+-----------+-------------------------------------------------
Figure 4: RDMA Usage of DDP Fields
Note: N/A means Not Applicable.
<span class="grey">Recio, et al. Standards Track [Page 22]</span></pre>
<hr class='noprint'/><!--NewPage--><pre class='newpage'><span id="page-23" ></span>
<span class="grey"><a href="./rfc5040">RFC 5040</a> RDMA Protocol Specification October 2007</span>
<span class="h3"><a class="selflink" id="section-4.2" href="#section-4.2">4.2</a>. RDMA Message Definitions</span>
The following figure defines which RDMA Headers MUST be used on each
RDMA Message and which RDMA Messages are allowed to carry ULP
Payload:
-------+-----------+-------------------+-------------------------
RDMA | Message | RDMA Header Used | ULP Message allowed in
Message| Type | | the RDMA Message
OpCode | | |
| | |
-------+-----------+-------------------+-------------------------
0000b | RDMA Write| None | Yes
| | |
-------+-----------+-------------------+-------------------------
0001b | RDMA Read | RDMA Read Request | No
| Request | Header |
-------+-----------+-------------------+-------------------------
0010b | RDMA Read | None | Yes
| Response | |
-------+-----------+-------------------+-------------------------
0011b | Send | None | Yes
| | |
-------+-----------+-------------------+-------------------------
0100b | Send with | None | Yes
| Invalidate| |
-------+-----------+-------------------+-------------------------
0101b | Send with | None | Yes
| SE | |
-------+-----------+-------------------+-------------------------
0110b | Send with | None | Yes
| SE and | |
| Invalidate| |
-------+-----------+-------------------+-------------------------
0111b | Terminate | Terminate Header | No
| | |
-------+-----------+-------------------+-------------------------
1000b | |
to | Reserved | Not Specified
1111b | |
-------+-----------+-------------------+-------------------------
Figure 5: RDMA Message Definitions
<span class="grey">Recio, et al. Standards Track [Page 23]</span></pre>
<hr class='noprint'/><!--NewPage--><pre class='newpage'><span id="page-24" ></span>
<span class="grey"><a href="./rfc5040">RFC 5040</a> RDMA Protocol Specification October 2007</span>
<span class="h3"><a class="selflink" id="section-4.3" href="#section-4.3">4.3</a>. RDMA Write Header</span>
The RDMA Write Message does not include an RDMAP header. The RDMAP
layer passes to the DDP layer an RDMAP Control Field. The RDMA Write
Message is fully described by the DDP Headers of the DDP Segments
associated with the Message.
See <a href="#appendix-A">Appendix A</a> for a description of the DDP Segment format associated
with RDMA Write Messages.
<span class="h3"><a class="selflink" id="section-4.4" href="#section-4.4">4.4</a>. RDMA Read Request Header</span>
The RDMA Read Request Message carries an RDMA Read Request Header
that describes the Data Sink and Data Source Buffers used by the RDMA
Read operation. The RDMA Read Request Header immediately follows the
DDP header. The RDMAP layer passes to the DDP layer an RDMAP Control
Field. The following figure depicts the RDMA Read Request Header
that MUST be used for all RDMA Read Request Messages:
0 1 2 3
0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1
+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
| Data Sink STag (SinkSTag) |
+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
| |
+ Data Sink Tagged Offset (SinkTO) +
| |
+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
| RDMA Read Message Size (RDMARDSZ) |
+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
| Data Source STag (SrcSTag) |
+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
| |
+ Data Source Tagged Offset (SrcTO) +
| |
+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
Figure 6: RDMA Read Request Header Format
Data Sink Steering Tag: 32 bits.
The Data Sink Steering Tag identifies the Data Sink's Tagged
Buffer. This field MUST be copied, without interpretation,
from the RDMA Read Request into the corresponding RDMA Read
Response; this field allows the Data Sink to place the
returning data. The STag is associated with the RDMAP Stream
through a mechanism that is outside the scope of the RDMAP
specification.
<span class="grey">Recio, et al. Standards Track [Page 24]</span></pre>
<hr class='noprint'/><!--NewPage--><pre class='newpage'><span id="page-25" ></span>
<span class="grey"><a href="./rfc5040">RFC 5040</a> RDMA Protocol Specification October 2007</span>
Data Sink Tagged Offset: 64 bits.
The Data Sink Tagged Offset specifies the starting offset, in
octets, from the base of the Data Sink's Tagged Buffer, where
the data is to be written by the Data Source. This field is
copied from the RDMA Read Request into the corresponding RDMA
Read Response and allows the Data Sink to place the returning
data. The Data Sink Tagged Offset MAY start at an arbitrary
offset.
The Data Sink STag and Data Sink Tagged Offset fields
describe the buffer to which the RDMA Read data is written.
Note: the DDP layer protects against a wrap of the Data Sink
Tagged Offset.
RDMA Read Message Size: 32 bits.
The RDMA Read Message Size is the amount of data, in octets,
read from the Data Source. A single RDMA Read Request
Message can retrieve from 0 to 2^32-1 data octets from the
Data Source.
Data Source Steering Tag: 32 bits.
The Data Source Steering Tag identifies the Data Source's
Tagged Buffer. The STag is associated with the RDMAP Stream
through a mechanism that is outside the scope of the RDMAP
specification.
Data Source Tagged Offset: 64 bits.
The Tagged Offset specifies the starting offset, in octets,
that is to be read from the Data Source's Tagged Buffer. The
Data Source Tagged Offset MAY start at an arbitrary offset.
The Data Source STag and Data Source Tagged Offset fields
describe the buffer from which the RDMA Read data is read.
See <a href="#section-7.2">Section 7.2</a>, "Errors Detected at the Remote Peer on Incoming RDMA
Messages", for a description of error checking required upon
processing of an RDMA Read Request at the Data Source.
<span class="grey">Recio, et al. Standards Track [Page 25]</span></pre>
<hr class='noprint'/><!--NewPage--><pre class='newpage'><span id="page-26" ></span>
<span class="grey"><a href="./rfc5040">RFC 5040</a> RDMA Protocol Specification October 2007</span>
<span class="h3"><a class="selflink" id="section-4.5" href="#section-4.5">4.5</a>. RDMA Read Response Header</span>
The RDMA Read Response Message does not include an RDMAP header. The
RDMAP layer passes to the DDP layer an RDMAP Control Field. The RDMA
Read Response Message is fully described by the DDP Headers of the
DDP Segments associated with the Message.
See <a href="#appendix-A">Appendix A</a> for a description of the DDP Segment format associated
with RDMA Read Response Messages.
<span class="h3"><a class="selflink" id="section-4.6" href="#section-4.6">4.6</a>. Send Header and Send with Solicited Event Header</span>
The Send and Send with Solicited Event Messages do not include an
RDMAP header. The RDMAP layer passes to the DDP layer an RDMAP
Control Field. The Send and Send with Solicited Event Messages are
fully described by the DDP Headers of the DDP Segments associated
with the Messages.
See <a href="#appendix-A">Appendix A</a> for a description of the DDP Segment format associated
with Send and Send with Solicited Event Messages.
<span class="h3"><a class="selflink" id="section-4.7" href="#section-4.7">4.7</a>. Send with Invalidate Header and Send with SE and Invalidate Header</span>
The Send with Invalidate and Send with Solicited Event and Invalidate
Messages do not include an RDMAP header. The RDMAP layer passes to
the DDP layer an RDMAP Control Field and the Invalidate STag field
(see <a href="#section-4.1">section 4.1</a> RDMAP Control and Invalidate STag Field). The Send
with Invalidate and Send with Solicited Event and Invalidate Messages
are fully described by the DDP Headers of the DDP Segments associated
with the Messages.
See <a href="#appendix-A">Appendix A</a> for a description of the DDP Segment format associated
with Send and Send with Solicited Event Messages.
<span class="h3"><a class="selflink" id="section-4.8" href="#section-4.8">4.8</a>. Terminate Header</span>
The Terminate Message carries a Terminate Header that contains
additional information associated with the cause of the Terminate.
The Terminate Header immediately follows the DDP header. The RDMAP
layer passes to the DDP layer an RDMAP Control Field. The following
figure depicts a Terminate Header that MUST be used for the Terminate
Message:
<span class="grey">Recio, et al. Standards Track [Page 26]</span></pre>
<hr class='noprint'/><!--NewPage--><pre class='newpage'><span id="page-27" ></span>
<span class="grey"><a href="./rfc5040">RFC 5040</a> RDMA Protocol Specification October 2007</span>
0 1 2 3
0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1
+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
| Terminate Control | Reserved |
+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
| DDP Segment Length (if any) | |
+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+ +
| |
// //
| Terminated DDP Header (if any) |
+ +
| |
+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
| |
// //
| Terminated RDMA Header (if any) |
+ +
| |
+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
Figure 7: Terminate Header Format
Terminate Control: 19 bits.
The Terminate Control field MUST have the format defined in
Figure 8 below.
0 1 2 3
0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1
+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
| Layer | EType | Error Code |HdrCt|
+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
Figure 8: Terminate Control Field
* Figure 9, "Terminate Control Field Values", defines the valid
values that MUST be used for this field.
* Layer: 4 bits.
Identifies the layer that encountered the error.
* EType (RDMA Error Type): 4 bits.
Identifies the type of error that caused the Terminate. When
the error is detected at the RDMAP layer, the RDMAP layer
inserts the Error Type into this field. When the error is
detected at an LLP layer, an LLP layer creates the Error Type
<span class="grey">Recio, et al. Standards Track [Page 27]</span></pre>
<hr class='noprint'/><!--NewPage--><pre class='newpage'><span id="page-28" ></span>
<span class="grey"><a href="./rfc5040">RFC 5040</a> RDMA Protocol Specification October 2007</span>
and the DDP layer passes it up to the RDMAP layer, and the
RDMAP layer inserts it into this field.
* Error Code: 8 bits.
This field identifies the specific error that caused the
Terminate. When the error is detected at the RDMAP layer, the
RDMAP layer creates the Error Code. When the error is detected
at an LLP layer, the LLP layer creates the Error Code, the DDP
layer passes it up to the RDMAP layer, and the RDMAP layer
inserts it into this field.
* HdrCt: 3 bits.
Header control bits:
* M: bit 16. DDP Segment Length valid. See Figure 10 for
when this bit SHOULD be set.
* D: bit 17. DDP Header Included. See Figure 10 for when
this bit SHOULD be set.
* R: bit 18. RDMAP Header Included. See Figure 10 for when
this bit SHOULD be set.
<span class="grey">Recio, et al. Standards Track [Page 28]</span></pre>
<hr class='noprint'/><!--NewPage--><pre class='newpage'><span id="page-29" ></span>
<span class="grey"><a href="./rfc5040">RFC 5040</a> RDMA Protocol Specification October 2007</span>
-------+-----------+-------+-------------+------+--------------------
Layer | Layer | Error | Error Type | Error| Error Code Name
| Name | Type | Name | Code |
-------+-----------+-------+-------------+------+--------------------
| | 0000b | Local | None | None - This error
| | | Catastrophic| | type does not have
| | | Error | | an error code. Any
| | | | | value in this field
| | | | | is acceptable.
| +-------+-------------+------+--------------------
| | | | 00X | Invalid STag
| | | +------+--------------------
| | | | 01X | Base or bounds
| | | | | violation
| | | Remote +------+--------------------
| | 0001b | Protection | 02X | Access rights
| | | Error | | violation
| | | +------+--------------------
0000b | RDMA | | | 03X | STag not associated
| | | | | with RDMAP Stream
| | | +------+--------------------
| | | | 04X | TO wrap
| | | +------+--------------------
| | | | 09X | STag cannot be
| | | | | Invalidated
| | | +------+--------------------
| | | | FFX | Unspecified Error
| +-------+-------------+------+--------------------
| | | | 05X | Invalid RDMAP
| | | | | version
| | | +------+--------------------
| | | | 06X | Unexpected OpCode
| | | Remote +------+--------------------
| | 0010b | Operation | 07X | Catastrophic error,
| | | Error | | localized to RDMAP
| | | | | Stream
| | | +------+--------------------
| | | | 08X | Catastrophic error,
| | | | | global
| | | +------+--------------------
| | | | 09X | STag cannot be
| | | | | Invalidated
| | | +------+--------------------
| | | | FFX | Unspecified Error
<span class="grey">Recio, et al. Standards Track [Page 29]</span></pre>
<hr class='noprint'/><!--NewPage--><pre class='newpage'><span id="page-30" ></span>
<span class="grey"><a href="./rfc5040">RFC 5040</a> RDMA Protocol Specification October 2007</span>
-------+-----------+-------+-------------+------+--------------------
0001b | DDP | See DDP Specification [<a href="#ref-DDP" title=""Direct Data Placement over Reliable Transports"">DDP</a>] for a description of
| | the values and names.
-------+-----------+-------+-----------------------------------------
0010b | LLP | For MPA, see MPA Specification [<a href="#ref-MPA" title=""Marker PDU Aligned Framing for TCP Specification"">MPA</a>] for a
|(e.g., MPA)| description of the values and names.
-------+-----------+-------+-----------------------------------------
Figure 9: Terminate Control Field Values
Reserved: 13 bits. This field MUST be set to zero on transmit,
ignored on receive.
DDP Segment Length: 16 bits
The length handed up by the DDP layer when the error was
detected. It MUST be valid if the M bit is set. It MUST be
present when the D bit is set.
Terminated DDP Header: 112 bits for Tagged Messages and 144 bits
for Untagged Messages.
The DDP Header of the incoming Message that is associated
with the Terminate. The DDP Header is not present if the
Terminate Error Type is a Local Catastrophic Error. It MUST
be present if the D bit is set.
Terminated RDMA Header: 224 bits.
The Terminated RDMA Header is only sent back if the terminate
is associated with an RDMA Read Request Message. It MUST be
present if the R bit is set.
If the terminate occurs before the first RDMA Read Request
byte is processed, the original RDMA Read Request Header is
sent back.
If the terminate occurs after the first RDMA Read Request
byte is processed, the RDMA Read Request Header is updated to
reflect the current location of the RDMA Read operation that
is in process:
* Data Sink STag = Data Sink STag originally sent in the
RDMA Read Request.
<span class="grey">Recio, et al. Standards Track [Page 30]</span></pre>
<hr class='noprint'/><!--NewPage--><pre class='newpage'><span id="page-31" ></span>
<span class="grey"><a href="./rfc5040">RFC 5040</a> RDMA Protocol Specification October 2007</span>
* Data Sink Tagged Offset = Current offset into the Data
Sink Tagged Buffer. For example, if the RDMA Read
Request was terminated after 2048 octets were sent,
then the Data Sink Tagged Offset = the original Data
Sink Tagged Offset + 2048.
* Data Message size = Number of bytes left to transfer.
* Data Source STag = Data Source STag in the RDMA Read
Request.
* Data Source Tagged Offset = Current offset into the
Data Source Tagged Buffer. For example, if the RDMA
Read Request was terminated after 2048 octets were
sent, then the Data Source Tagged Offset = the
original Data Source Tagged Offset + 2048.
Note: if a given LLP does not define any termination codes for the
RDMAP Termination message to use, then none would be used for that
LLP.
Figure 10, "Error Type to RDMA Message Mapping", maps layer name and
error types to each RDMA Message type:
<span class="grey">Recio, et al. Standards Track [Page 31]</span></pre>
<hr class='noprint'/><!--NewPage--><pre class='newpage'><span id="page-32" ></span>
<span class="grey"><a href="./rfc5040">RFC 5040</a> RDMA Protocol Specification October 2007</span>
---------+-------------+------------+------------+-----------------
Layer | Error Type | Terminate | Terminate | What type of
Name | Name | Includes | Includes | RDMA Message can
| | DDP Header | RDMA Header| cause the error
| | and DDP | |
| | Segment | |
| | Length | |
---------+-------------+------------+------------+-----------------
| Local | No | No | Any
| Catastrophic| | |
| Error | | |
+-------------+------------+------------+-----------------
| Remote | Yes, if | Yes | Only RDMA Read
RDMA | Protection | possible | | Request, Send
| Error | | | with Invalidate,
| | | | and Send with SE
| | | | and Invalidate
+-------------+------------+------------+-----------------
| Remote | Yes, if | No | Any
| Operation | possible | |
| Error | | |
---------+-------------+------------+------------+-----------------
DDP | See DDP Spec| Yes | No | Any
| [<a href="#ref-DDP" title=""Direct Data Placement over Reliable Transports"">DDP</a>] | | |
---------+-------------+------------+------------+-----------------
LLP | See LLP Spec| No | No | Any
| (e.g., MPA) | | |
Figure 10: Error Type to RDMA Message Mapping
<span class="h2"><a class="selflink" id="section-5" href="#section-5">5</a>. Data Transfer</span>
<span class="h3"><a class="selflink" id="section-5.1" href="#section-5.1">5.1</a>. RDMA Write Message</span>
An RDMA Write is used by the Data Source to transfer data to a
previously Advertised Tagged Buffer at the Data Sink. The RDMA Write
Message has the following semantics:
* An RDMA Write Message MUST reference a Tagged Buffer. That is,
the Data Source RDMAP layer MUST request that the DDP layer mark
the Message as Tagged.
* A valid RDMA Write Message MUST NOT be delivered to the Data
Sink's ULP (i.e., it is placed by the DDP layer).
* At the Remote Peer, when an invalid RDMA Write Message is
delivered to the Remote Peer's RDMAP layer, an error is surfaced
(see <a href="#section-7.1">Section 7.1</a>, "RDMAP Error Surfacing").
<span class="grey">Recio, et al. Standards Track [Page 32]</span></pre>
<hr class='noprint'/><!--NewPage--><pre class='newpage'><span id="page-33" ></span>
<span class="grey"><a href="./rfc5040">RFC 5040</a> RDMA Protocol Specification October 2007</span>
* The Tagged Offset of a Tagged Buffer MAY start at a non-zero
value.
* An RDMA Write Message MAY target all or part of a previously
Advertised Buffer.
* The RDMAP does not define how the buffer(s) are used by an
outbound RDMA Write or how they are addressed. For example, an
implementation of RDMA may choose to allow a gather-list of non-
contiguous data blocks to be the source of an RDMA Write. In this
case, the data blocks would be combined by the Data Source and
sent as a single RDMA Write Message to the Data Sink.
* The Data Source RDMAP layer MUST issue RDMA Write Messages to the
DDP layer in the order they were submitted by the ULP.
* At the Data Source, a subsequent Send (Send with Invalidate, Send
with Solicited Event, or Send with Solicited Event and Invalidate)
Message MAY be used to signal Delivery of previous RDMA Write
Messages to the Data Sink, if the ULP chooses to signal Delivery
in this fashion.
* If the Local Peer wishes to write to multiple Tagged Buffers on
the Remote Peer, the Local Peer MUST use multiple RDMA Write
Messages. That is, a single RDMA Write Message can only write to
one remote Tagged Buffer.
* The Data Source MAY issue a zero-length RDMA Write Message.
<span class="h3"><a class="selflink" id="section-5.2" href="#section-5.2">5.2</a>. RDMA Read Operation</span>
The RDMA Read operation MUST consist of a single RDMA Read Request
Message and a single RDMA Read Response Message.
<span class="h4"><a class="selflink" id="section-5.2.1" href="#section-5.2.1">5.2.1</a>. RDMA Read Request Message</span>
An RDMA Read Request is used by the Data Sink to transfer data from a
previously Advertised Tagged Buffer at the Data Source to a Tagged
Buffer at the Data Sink. The RDMA Read Request Message has the
following semantics:
* An RDMA Read Request Message MUST reference an Untagged Buffer.
That is, the Local Peer's RDMAP layer MUST request that the DDP
mark the Message as Untagged.
* One RDMA Read Request Message MUST consume one Untagged Buffer.
<span class="grey">Recio, et al. Standards Track [Page 33]</span></pre>
<hr class='noprint'/><!--NewPage--><pre class='newpage'><span id="page-34" ></span>
<span class="grey"><a href="./rfc5040">RFC 5040</a> RDMA Protocol Specification October 2007</span>
* The Remote Peer's RDMAP layer MUST process an RDMA Read Request
Message. A valid RDMA Read Request Message MUST NOT be delivered
to the Data Sink's ULP (i.e., it is processed by the RDMAP layer).
* At the Remote Peer, when an invalid RDMA Read Request Message is
delivered to the Remote Peer's RDMAP layer, an error is surfaced
(see <a href="#section-7.1">Section 7.1</a>, "RDMAP Error Surfacing").
* An RDMA Read Request Message MUST reference the RDMA Read Request
Queue. That is, the Local Peer's RDMAP layer MUST request that
the DDP layer set the Queue Number field to one.
* The Local Peer MUST pass to the DDP layer RDMA Read Request
Messages in the order they were submitted by the ULP.
* The Remote Peer MUST process the RDMA Read Request Messages in the
order they were sent.
* If the Local Peer wishes to read from multiple Tagged Buffers on
the Remote Peer, the Local Peer MUST use multiple RDMA Read
Request Messages. That is, a single RDMA Read Request Message
MUST only read from one remote Tagged Buffer.
* AN RDMA Read Request Message MAY target all or part of a
previously Advertised Buffer.
* If the Data Source receives a valid RDMA Read Request Message, it
MUST respond with a valid RDMA Read Response Message.
* The Data Sink MAY issue a zero-length RDMA Read Request Message by
setting the RDMA Read Message Size field to zero in the RDMA Read
Request Header.
* If the Data Source receives a non-zero-length RDMA Read Message
Size, the Data Source RDMAP MUST validate the Data Source STag and
Data Source Tagged Offset contained in the RDMA Read Request
Header.
* If the Data Source receives an RDMA Read Request Header with the
RDMA Read Message Size set to zero, the Data Source RDMAP:
* MUST NOT validate the Data Source STag and Data Source Tagged
Offset contained in the RDMA Read Request Header, and
* MUST respond with a zero-length RDMA Read Response Message.
<span class="grey">Recio, et al. Standards Track [Page 34]</span></pre>
<hr class='noprint'/><!--NewPage--><pre class='newpage'><span id="page-35" ></span>
<span class="grey"><a href="./rfc5040">RFC 5040</a> RDMA Protocol Specification October 2007</span>
<span class="h4"><a class="selflink" id="section-5.2.2" href="#section-5.2.2">5.2.2</a>. RDMA Read Response Message</span>
The RDMA Read Response Message uses the DDP Tagged Buffer Model to
Deliver the contents of a previously requested Data Source Tagged
Buffer to the Data Sink, without any involvement from the ULP at the
Remote Peer. The RDMA Read Response Message has the following
semantics:
* The RDMA Read Response Message for the associated RDMA Read
Request Message travels in the opposite direction.
* An RDMA Read Response Message MUST reference a Tagged Buffer.
That is, the Data Source RDMAP layer MUST request that the DDP
mark the Message as Tagged.
* The Data Source MUST ensure that a sufficient number of Untagged
Buffers are available on the RDMA Read Request Queue (Queue with
DDP Queue Number 1) to support the maximum number of RDMA Read
Requests negotiated by the ULP.
* The RDMAP layer MUST Deliver the RDMA Read Response Message to the
ULP.
* At the Remote Peer, when an invalid RDMA Read Response Message is
delivered to the Remote Peer's RDMAP layer, an error is surfaced
(see <a href="#section-7.1">Section 7.1</a>, "RDMAP Error Surfacing").
* The Tagged Offset of a Tagged Buffer MAY start at a non-zero
value.
* The Data Source RDMAP layer MUST pass RDMA Read Response Messages
to the DDP layer, in the order that the RDMA Read Request Messages
were received by the RDMAP layer, at the Data Source.
* The Data Sink MAY validate that the STag, Tagged Offset, and
length of the RDMA Read Response Message are the same as the STag,
Tagged Offset, and length included in the corresponding RDMA Read
Request Message.
* A single RDMA Read Response Message MUST write to one remote
Tagged Buffer. If the Data Sink wishes to read multiple Tagged
Buffers, the Data Sink can use multiple RDMA Read Request
Messages.
<span class="grey">Recio, et al. Standards Track [Page 35]</span></pre>
<hr class='noprint'/><!--NewPage--><pre class='newpage'><span id="page-36" ></span>
<span class="grey"><a href="./rfc5040">RFC 5040</a> RDMA Protocol Specification October 2007</span>
<span class="h3"><a class="selflink" id="section-5.3" href="#section-5.3">5.3</a>. Send Message Type</span>
The Send Message Type uses the DDP Untagged Buffer Model to transfer
data from the Data Source into an Untagged Buffer at the Data Sink.
* A Send Message Type MUST reference an Untagged Buffer. That is,
the Local Peer's RDMAP layer MUST request that the DDP layer mark
the Message as Untagged.
* One Send Message Type MUST consume one Untagged Buffer.
* The ULP Message sent using a Send Message Type MAY be less than
or equal to the size of the consumed Untagged Buffer. The
RDMAP layer communicates to the ULP the size of the data
written into the Untagged Buffer.
* If the ULP Message sent via Send Message Type is larger than
the Data Sink's Untagged Buffer, it is an error (see <a href="#section-9.1">Section</a>
<a href="#section-9.1">9.1</a>, "RDMAP Error Surfacing").
* At the Remote Peer, the Send Message Type MUST be Delivered to the
Remote Peer's ULP in the order they were sent.
* After the Send with Solicited Event or Send with Solicited Event
and Invalidate Message is Delivered to the ULP, the RDMAP MAY
generate an Event, if the Data Sink is configured to generate such
an Event.
* At the Remote Peer, when an invalid Send Message Type is Delivered
to the Remote Peer's RDMAP layer, an error is surfaced (see
<a href="#section-7.1">Section 7.1</a>, "RDMAP Error Surfacing").
* The RDMAP does not specify the structure of the buffer(s) used by
an outbound RDMA Write nor does it specify how the buffer(s) are
addressed. For example, an implementation of RDMA may choose to
allow a gather-list of non-contiguous data blocks to be the source
of a Send Message Type. In this case, the data blocks would be
combined by the Data Source and sent as a single Send Message Type
to the Data Sink.
* For a Send Message Type, the Local Peer's RDMAP layer MUST request
that the DDP layer set the Queue Number field to zero.
* The Local Peer MUST issue Send Message Type Messages in the order
they were submitted by the ULP.
<span class="grey">Recio, et al. Standards Track [Page 36]</span></pre>
<hr class='noprint'/><!--NewPage--><pre class='newpage'><span id="page-37" ></span>
<span class="grey"><a href="./rfc5040">RFC 5040</a> RDMA Protocol Specification October 2007</span>
* The Data Source MAY pass a zero-length Send Message Type. A
zero-length Send Message Type MUST consume an Untagged Buffer at
the Data Sink. A Send with Invalidate or Send with Solicited
Event and Invalidate Message MUST reference an STag. That is, the
Local Peer's RDMAP layer MUST pass the RDMA control field and the
STag that will be Invalidated to the DDP layer.
* When the Send with Invalidate and Send with Solicited Event and
Invalidate Message are Delivered to the Remote Peer's RDMAP layer,
the RDMAP layer MUST:
* Verify the STag that is associated with the RDMAP Stream; and
* Invalidate the STag if it is associated with the RDMAP Stream;
or issue a Terminate Message with the STag Cannot be
Invalidated Terminate Error Code, if the STag is not associated
with the RDMAP Stream.
<span class="h3"><a class="selflink" id="section-5.4" href="#section-5.4">5.4</a>. Terminate Message</span>
The Terminate Message uses the DDP Untagged Buffer Model to
transfer-error-related information from the Data Source into an
Untagged Buffer at the Data Sink and then ceases all further
communications on the underlying DDP Stream. The Terminate Message
has the following semantics:
* A Terminate Message MUST reference an Untagged Buffer. That is,
the Local Peer's RDMAP layer MUST request that the DDP layer mark
the Message as Untagged.
* A Terminate Message references the Terminate Queue. That is, the
Local Peer's RDMAP layer MUST request that the DDP layer set the
Queue Number field to two.
* One Terminate Message MUST consume one Untagged Buffer.
* On a single RDMAP Stream, the RDMAP layer MUST guarantee placement
of a single Terminate Message.
* A Terminate Message MUST be Delivered to the Remote Peer's RDMAP
layer. The RDMAP layer MUST Deliver the Terminate Message to the
ULP.
* At the Remote Peer, when an invalid Terminate Message is delivered
to the Remote Peer's RDMAP layer, an error is surfaced (see
<a href="#section-7.1">Section 7.1</a> "RDMAP Error Surfacing").
<span class="grey">Recio, et al. Standards Track [Page 37]</span></pre>
<hr class='noprint'/><!--NewPage--><pre class='newpage'><span id="page-38" ></span>
<span class="grey"><a href="./rfc5040">RFC 5040</a> RDMA Protocol Specification October 2007</span>
* The RDMAP layer Completes in error all ULP operations that have
not been provided to the DDP layer.
* After sending a Terminate Message on an RDMAP Stream, the Local
Peer MUST NOT send any more Messages on that specific RDMAP
Stream.
* After receiving a Terminate Message on an RDMAP Stream, the Remote
Peer MAY stop sending Messages on that specific RDMAP Stream.
<span class="h3"><a class="selflink" id="section-5.5" href="#section-5.5">5.5</a>. Ordering and Completions</span>
It is important to understand the difference between Placement and
Delivery ordering since RDMAP provides quite different semantics for
the two.
Note that many current protocols, both as used in the Internet and
elsewhere, assume that data is both Placed and Delivered in order.
Taking advantage of this fact allowed applications to take a variety
of shortcuts. For RDMAP, many of these shortcuts are no longer safe
to use, and could cause application failure.
The following rules apply to implementations of the RDMAP protocol.
Note that in these rules, Send includes Send, Send with Invalidate,
Send with Solicited Event, and Send with Solicited Event and
Invalidate:
1. RDMAP does not provide ordering among Messages on different RDMAP
Streams.
2. RDMAP does not provide ordering between operations that are
generated from the two ends of an RDMAP Stream.
3. RDMA Messages that use Tagged and Untagged Buffers MAY be Placed
in any order. If an application uses overlapping buffers (points
different Messages or portions of a single Message at the same
buffer), then it is possible that the last incoming write to the
Data Sink buffer will not be the last outgoing data sent from the
Data Source.
4. For a Send operation, the contents of an Untagged Buffer at the
Data Sink MAY be indeterminate until the Send is Delivered to the
ULP at the Data Sink.
5. For an RDMA Write operation, the contents of the Tagged Buffer at
the Data Sink MAY be indeterminate until a subsequent Send is
Delivered to the ULP at the Data Sink.
<span class="grey">Recio, et al. Standards Track [Page 38]</span></pre>
<hr class='noprint'/><!--NewPage--><pre class='newpage'><span id="page-39" ></span>
<span class="grey"><a href="./rfc5040">RFC 5040</a> RDMA Protocol Specification October 2007</span>
6. For an RDMA Read operation, the contents of the Tagged Buffer at
the Data Sink MAY be indeterminate until the RDMA Read Response
Message has been Delivered at the Local Peer.
Statements 4, 5, and 6 imply "no peeking" at the data to see if it is
done. It is possible for some data to arrive before logically
earlier data does, and peeking may cause unpredictable application
failure.
7. If the ULP or Application modifies the contents of Tagged or
Untagged Buffers, which are being modified by an RDMA Operation
while the RDMAP is processing the RDMA Operation, the state of
the Buffers is indeterminate.
8. If the ULP or Application modifies the contents of Tagged or
Untagged Buffers, which are read by an RDMA Operation while the
RDMAP is processing the RDMA Operation, the results of the read
are indeterminate.
9. The Completion of an RDMA Write or Send Operation at the Local
Peer does not guarantee that the ULP Message has yet reached the
Remote Peer ULP Buffer or been examined by the Remote ULP.
10. Send Messages MUST be Delivered to the ULP at the Remote Peer
after they are Delivered to RDMAP by DDP and in the order that
they were Delivered to RDMAP.
Note that DDP ordering rules ensure that this will be the same
order that they were submitted at the Local Peer and that any
prior RDMA Writes have been submitted for ordered Placement at
the Remote Peer. This means that when the ULP sees the Delivery
of the Send, the memory buffers targeted by any preceding RDMA
Writes and Sends are available to be accessed locally or remotely
as authorized. If the ULP overlaps its buffers for different
operations, the data from the RDMA Write or Send may be
overwritten by subsequent RDMA Operations before the ULP receives
and processes the Delivery.
11. RDMA Read Response Messages MUST be Delivered to the ULP at the
Remote Peer after they are Delivered to RDMAP by DDP and in the
order that the they were Delivered to RDMAP.
DDP ordering rules ensure that this will be the same order that
they were submitted at the Local Peer. This means that when the
ULP sees the Delivery of the RDMA Read Response, the memory
buffers targeted by the RDMA Read Response are available to be
accessed locally or remotely as authorized. If the ULP overlaps
<span class="grey">Recio, et al. Standards Track [Page 39]</span></pre>
<hr class='noprint'/><!--NewPage--><pre class='newpage'><span id="page-40" ></span>
<span class="grey"><a href="./rfc5040">RFC 5040</a> RDMA Protocol Specification October 2007</span>
its buffers for different operations, the data from the RDMA Read
Response may be overwritten by subsequent RDMA Operations before
the ULP receives and processes the Delivery.
12. RDMA Read Request Messages, including zero-length RDMA Read
Requests, MUST NOT start processing at the Remote Peer until they
have been Delivered to RDMAP by DDP.
Note: the ULP is assured that data written can be read back. For
example, if
a) an RDMA Read Request is issued by the local peer,
b) the Request targets the same ULP Buffer as a preceding Send
or RDMA Write (in the same direction as the RDMA Read
Request), and
c) there are no other sources of update for the ULP Buffer,
then the Remote Peer will send back the data written by the Send
or RDMA Write. That is, for this example, the ULP Buffer is
Advertised for use on a series of RDMA Messages, is only valid on
the RDMAP Stream for which it is Advertised, and is not locally
updated while the series of RDMAP Messages are performed. For
this example, order rule (12) assures that subsequent local or
remote accesses to the ULP Buffer contain the data written by the
Send or RDMA Write.
RDMA Read Response Messages MAY be generated at the Remote Peer
after subsequent RDMA Write Messages or Send Messages have been
Placed or Delivered. Therefore, when an application does an RDMA
Read Request followed by an RDMA Write (or Send) to the same
buffer, it may get the data from the later RDMA Write (or Send)
in the RDMA Read Response Message, even though the operations
completed in order at the Local Peer. If this behavior is not
desired, the Local Peer ULP must Fence the later RDMA write (or
Send) by withholding the RDMA Write Message until all outstanding
RDMA Read Responses have been Delivered.
13. The RDMAP layer MUST submit RDMA Messages to the DDP layer in the
order the RDMA Operations are submitted to the RDMAP layer by the
ULP.
14. A Send or RDMA Write Message MUST NOT be considered Complete at
the Local Peer (Data Source) until it has been successfully
completed at the DDP layer.
15. RDMA Operations MUST be Completed at the Local Peer in the order
that they were submitted by the ULP.
<span class="grey">Recio, et al. Standards Track [Page 40]</span></pre>
<hr class='noprint'/><!--NewPage--><pre class='newpage'><span id="page-41" ></span>
<span class="grey"><a href="./rfc5040">RFC 5040</a> RDMA Protocol Specification October 2007</span>
16. At the Data Sink, an incoming Send Message MUST be Delivered to
the ULP only after the DDP Message has been Delivered to the
RDMAP layer by the DDP layer.
17. RDMA Read Response Message processing at the Remote Peer (reading
the specified Tagged Buffer) MUST be started only after the RDMA
Read Request Message has been Delivered by the DDP layer (thus,
all previous RDMA Messages have been properly submitted for
ordered Placement).
18. Send Messages MAY be Completed at the Remote Peer (Data Sink)
before prior incoming RDMA Read Request Messages have completed
their response processing.
19. An RDMA Read operation MUST NOT be Completed at the Local Peer
until the DDP layer Delivers the associated incoming RDMA Read
Response Message.
20. If more than one outstanding RDMA Read Request Messages are
supported by both peers, the RDMA Read Response Messages MUST be
submitted to the DDP layer on the Remote Peer in the order the
RDMA Read Request Messages were Delivered by DDP, but the actual
read of the buffer contents MAY take place in any order at the
Remote Peer.
This simplifies Local Peer Completion processing for RDMA Reads
in that a Delivered RDMA Read Response MUST be sufficient to
Complete the RDMA Read operation.
<span class="h2"><a class="selflink" id="section-6" href="#section-6">6</a>. RDMAP Stream Management</span>
RDMAP Stream management consists of RDMAP Stream Initialization and
RDMAP Stream Termination.
<span class="h3"><a class="selflink" id="section-6.1" href="#section-6.1">6.1</a>. Stream Initialization</span>
RDMAP Stream initialization occurs after the LLP Stream has been
created (e.g., for DDP/MPA over TCP, the first TCP Segment after the
SYN, SYN/ACK exchange). The ULP is responsible for transitioning the
LLP Stream into RDMA-enabled mode. The switch to RDMA mode typically
occurs sometime after LLP Stream setup. Once in RDMA enabled mode,
an implementation MUST send only RDMA Messages across the transport
Stream until the RDMAP Stream is torn down.
For each direction of an RDMAP Stream:
* For a given RDMAP Stream, the number of outstanding RDMA Read
Requests is limited per RDMAP Stream direction.
<span class="grey">Recio, et al. Standards Track [Page 41]</span></pre>
<hr class='noprint'/><!--NewPage--><pre class='newpage'><span id="page-42" ></span>
<span class="grey"><a href="./rfc5040">RFC 5040</a> RDMA Protocol Specification October 2007</span>
* It is the ULP's responsibility to set the maximum number of
outstanding, inbound RDMA Read Requests per RDMAP Stream
direction.
* The RDMAP layer MUST provide the maximum number of outstanding,
inbound RDMA Read Requests per RDMAP Stream direction that were
negotiated between the ULP and the Local Peer's RDMAP layer. The
negotiation mechanism is outside the scope of this specification.
* It is the ULP's responsibility to set the maximum number of
outstanding, outbound RDMA Read Requests per RDMAP Stream
direction.
* The RDMAP layer MUST provide the maximum number of outstanding,
outbound RDMA Read Requests for the RDMAP Stream direction that
were negotiated between the ULP and the Local Peer's RDMAP layer.
The negotiation mechanism is outside the scope of this
specification.
* The Local Peer's ULP is responsible for negotiating with the
Remote Peer's ULP the maximum number of outstanding RDMA Read
Requests for the RDMAP Stream direction. It is recommended that
the ULP set the maximum number of outstanding, inbound RDMA Read
Requests equal to the maximum number of outstanding, outbound RDMA
Read Requests for a given RDMAP Stream direction.
* For outbound RDMA Read Requests, the RDMAP layer MUST NOT exceed
the maximum number of outstanding, outbound RDMA Read Requests
that were negotiated between the ULP and the Local Peer's RDMAP
layer.
* For inbound RDMA Read Requests, the RDMAP layer MUST NOT exceed
the maximum number of outstanding, inbound RDMA Read Requests that
were negotiated between the ULP and the Local Peer's RDMAP layer.
<span class="h3"><a class="selflink" id="section-6.2" href="#section-6.2">6.2</a>. Stream Teardown</span>
There are three methods for terminating an RDMAP Stream: ULP Graceful
Termination, RDMAP Abortive Termination, and LLP Abortive
Termination.
The ULP is responsible for performing ULP Graceful Termination.
After a ULP Graceful Termination, either side of the Stream can
initiate LLP Graceful Termination, using the graceful termination
mechanism provided by the LLP.
<span class="grey">Recio, et al. Standards Track [Page 42]</span></pre>
<hr class='noprint'/><!--NewPage--><pre class='newpage'><span id="page-43" ></span>
<span class="grey"><a href="./rfc5040">RFC 5040</a> RDMA Protocol Specification October 2007</span>
RDMAP Abortive Termination allows the RDMAP to issue a Terminate
Message describing the reason the RDMAP Stream was terminated. The
next section (6.2.1, "RDMAP Abortive Termination") describes the
RDMAP Abortive Termination in detail.
LLP Abortive Termination results due to an LLP error and causes the
RDMAP Stream to be torn down midstream, without an RDMAP Terminate
Message. While this last method is highly undesirable, it is
possible, and the ULP should take this into consideration.
<span class="h4"><a class="selflink" id="section-6.2.1" href="#section-6.2.1">6.2.1</a>. RDMAP Abortive Termination</span>
RDMAP defines a Terminate operation that SHOULD be invoked when
either an RDMAP error is encountered or an LLP error is surfaced to
the RDMAP layer by the LLP.
It is not always possible to send the Terminate Message. For
example, certain LLP errors may occur that cause the LLP Stream to be
torn down a) before RDMAP is aware of the error, b) before RDMAP is
able to send the Terminate Message, or c) after RDMAP has posted the
Terminate Message to the LLP, but it has not yet been transmitted by
the LLP.
Note that an RDMAP Abortive Termination may entail loss of data. In
general, when a Terminate Message is received, it is impossible to
tell for sure what unacknowledged RDMA Messages were Completed
successfully at the Remote Peer. Thus, the state of all outstanding
RDMA Messages is indeterminate, and the Messages SHOULD be considered
Completed in error.
When a peer sends or receives a Terminate Message, it MAY immediately
tear down the LLP Stream. The peer SHOULD perform a graceful LLP
teardown to ensure the Terminate Message is successfully Delivered.
See <a href="#section-4.8">Section 4.8</a>, "Terminate Header", for a description of the
Terminate Message and its contents. See <a href="#section-5.4">Section 5.4</a>, "Terminate
Message", for a description of the Terminate Message semantics.
<span class="h2"><a class="selflink" id="section-7" href="#section-7">7</a>. RDMAP Error Management</span>
The RDMAP protocol does not have RDMAP- or DDP-layer error recovery
operations built in. If everything is working, the LLP guarantees
will ensure that the Messages are arriving at the destination.
If errors are detected at the RDMAP or DDP layer, then the RDMAP,
DDP, and LLP Streams are Abortively Terminated (see <a href="#section-4.8">Section 4.8</a>,
"Terminate Header").
<span class="grey">Recio, et al. Standards Track [Page 43]</span></pre>
<hr class='noprint'/><!--NewPage--><pre class='newpage'><span id="page-44" ></span>
<span class="grey"><a href="./rfc5040">RFC 5040</a> RDMA Protocol Specification October 2007</span>
In general, poor implementations or improper ULP programming cause
the errors detected at the RDMAP and DDP layers. In these cases,
returning a diagnostic termination error Message and closing the
RDMAP Stream is far simpler than attempting to maintain the RDMAP
Stream, particularly when the cause of the error is not known.
If an LLP does not support teardown of a Stream independent of other
Streams, and an RDMAP error results in the Termination of a specific
Stream, then the LLP MUST label the Stream as an erroneous Stream and
MUST NOT allow any further data transfer on that Stream after RDMAP
requests the Stream to be torn down.
For a specific LLP connection, when all Streams are either gracefully
torn down or are labeled as erroneous Streams, the LLP connection
MUST be torn down.
Since errors are detected at the Remote Peer (possibly long) after
RDMA Messages are passed to the DDP and the LLP at the Local Peer and
after the RDMA Operations conveyed by the Messages are Completed, the
sender cannot easily determine which of its Messages have been
received. (RDMA Reads are an exception to this rule.)
For a list of errors returned to the Remote Peer as a result of an
Abortive Termination, see <a href="#section-4.8">Section 4.8</a>, "Terminate Header".
<span class="h3"><a class="selflink" id="section-7.1" href="#section-7.1">7.1</a>. RDMAP Error Surfacing</span>
If an error occurs at the Local Peer, the RDMAP layer MUST attempt to
inform the local ULP that the error has occurred.
The Local Peer MUST send a Terminate Message for each of the
following cases:
1. For errors detected while creating RDMA Write, Send, Send with
Invalidate, Send with Solicited Event, Send with Solicited Event
and Invalidate, or RDMA Read Requests, or other reasons not
directly associated with an incoming Message, the Terminate
Message and Error code are sent instead of the request. In this
case, the Error Type and Error Code fields are included in the
Terminate Message, but the Terminated DDP Header and Terminated
RDMA Header fields are set to zero.
2. For errors detected on an incoming RDMA Write, Send, Send with
Invalidate, Send with Solicited Event, Send with Solicited Event
and Invalidate, or Read Response Message (after the Message has
been Delivered by DDP), the Terminate Message is sent at the
earliest possible opportunity, preferably in the next outgoing
RDMA Message. In this case, the Error Type, Error Code, ULP PDU
<span class="grey">Recio, et al. Standards Track [Page 44]</span></pre>
<hr class='noprint'/><!--NewPage--><pre class='newpage'><span id="page-45" ></span>
<span class="grey"><a href="./rfc5040">RFC 5040</a> RDMA Protocol Specification October 2007</span>
Length, and Terminated DDP Header fields are included in the
Terminate Message, but the Terminated RDMA Header field is set to
zero.
3. For errors detected on an incoming RDMA Read Request Message
(after the Message has been Delivered by DDP), the Terminate
Message is sent at the earliest possible opportunity, preferably
in the next outgoing RDMA Message. In this case, the Error Type,
Error Code, ULP PDU Length, Terminated DDP Header, and Terminated
RDMA Header fields are included in the Terminate Message.
4. If more than one error is detected on incoming RDMA Messages,
before the Terminate Message can be sent, then the first RDMA
Message (and its associated DDP Segment) that experienced an
error MUST be captured by the Terminate Message, in accordance
with rules 2 and 3 above.
<span class="h3"><a class="selflink" id="section-7.2" href="#section-7.2">7.2</a>. Errors Detected at the Remote Peer on Incoming RDMA Messages</span>
On incoming RDMA Writes, RDMA Read Response, Sends, Send with
Invalidate, Send with Solicited Event, Send with Solicited Event and
Invalidate, and Terminate Messages, the following must be validated:
1. The DDP layer MUST validate all DDP Segment fields.
2. The RDMA OpCode MUST be valid.
3. The RDMA Version MUST be valid.
Additionally, on incoming Send with Invalidate and Send with
Solicited Event and Invalidate Messages, the following must also
be validated:
4. The Invalidate STag MUST be valid.
5. The STag MUST be associated to this RDMAP Stream.
On incoming RDMA Request Messages, the following must be validated:
1. The DDP layer MUST validate all Untagged DDP Segment fields.
2. The RDMA OpCode MUST be valid.
3. The RDMA Version MUST be valid.
4. For non-zero length RDMA Read Request Messages:
a. The Data Source STag MUST be valid.
<span class="grey">Recio, et al. Standards Track [Page 45]</span></pre>
<hr class='noprint'/><!--NewPage--><pre class='newpage'><span id="page-46" ></span>
<span class="grey"><a href="./rfc5040">RFC 5040</a> RDMA Protocol Specification October 2007</span>
b. The Data Source STag MUST be associated to this RDMAP Stream.
c. The Data Source Tagged Offset MUST fall in the range of legal
offsets associated with the Data Source STag.
d. The sum of the Data Source Tagged Offset and the RDMA Read
Message Size MUST fall in the range of legal offsets
associated with the Data Source STag.
e. The sum of the Data Source Tagged Offset and the RDMA Read
Message Size MUST NOT cause the Data Source Tagged Offset to
wrap.
<span class="h2"><a class="selflink" id="section-8" href="#section-8">8</a>. Security Considerations</span>
This section references the resources that discuss protocol- specific
security considerations and implications of using RDMAP with existing
security services. A detailed analysis of the security issues around
implementation and use of the RDMAP can be found in [<a href="#ref-RDMASEC" title=""Direct Data Placement Protocol (DDP) / Remote Direct Memory Access Protocol (RDMAP) Security"">RDMASEC</a>].
[<a id="ref-RDMASEC">RDMASEC</a>] introduces the RDMA reference model and discusses how the
resources of this model are vulnerable to attacks and the types of
attack these vulnerabilities are subject to. It also details the
levels of Trust available in this peer-to-peer model and how this
defines the nature of resource sharing.
The IPsec requirements for RDDP are based on the version of IPsec
specified in <a href="./rfc2401">RFC 2401</a> [<a href="./rfc2401" title=""Security Architecture for the Internet Protocol"">RFC2401</a>] and related RFCs, as profiled by <a href="./rfc3723">RFC</a>
<a href="./rfc3723">3723</a> [<a href="./rfc3723" title=""Securing Block Storage Protocols over IP"">RFC3723</a>], despite the existence of a newer version of IPsec
specified in <a href="./rfc4301">RFC 4301</a> [<a href="./rfc4301" title=""Security Architecture for the Internet Protocol"">RFC4301</a>] and related RFCs [<a href="./rfc4303" title=""IP Encapsulating Security Payload (ESP)"">RFC4303</a>],
[<a href="./rfc4306" title=""Internet Key Exchange (IKEv2) Protocol"">RFC4306</a>], [<a href="./rfc4835" title=""Cryptographic Algorithm Implementation Requirements for Encapsulating Security Payload (ESP) and Authentication Header (AH)"">RFC4835</a>]. One of the important early applications of the
RDDP protocols is their use with iSCSI [<a href="#ref-iSER" title=""Internet Small Computer System Interface (iSCSI) Extensions for Remote Direct Memory Access (RDMA)"">iSER</a>]; RDDP's IPsec
requirements follow those of IPsec in order to facilitate that usage
by allowing a common profile of IPsec to be used with iSCSI and the
RDDP protocols. In the future, <a href="./rfc3723">RFC 3723</a> may be updated to the newer
version of IPsec, and the IPsec security requirements of any such
update should apply uniformly to iSCSI and the RDDP protocols.
<span class="h3"><a class="selflink" id="section-8.1" href="#section-8.1">8.1</a>. Summary of RDMAP-Specific Security Requirements</span>
[<a id="ref-RDMASEC">RDMASEC</a>] defines the security requirements for the implementation of
the components of the RDMA reference model, namely the RDMA enabled
NIC (RNIC) and the Privileged Resource Manager. An RDMAP
implementation conforming to this specification MUST conform to these
requirements.
<span class="grey">Recio, et al. Standards Track [Page 46]</span></pre>
<hr class='noprint'/><!--NewPage--><pre class='newpage'><span id="page-47" ></span>
<span class="grey"><a href="./rfc5040">RFC 5040</a> RDMA Protocol Specification October 2007</span>
<span class="h4"><a class="selflink" id="section-8.1.1" href="#section-8.1.1">8.1.1</a>. RDMAP (RNIC) Requirements</span>
RDMAP provides several countermeasures for all types of attacks as
introduced in [<a href="#ref-RDMASEC" title=""Direct Data Placement Protocol (DDP) / Remote Direct Memory Access Protocol (RDMAP) Security"">RDMASEC</a>]. In the following, this specification lists
all security requirements that MUST be implemented by the RNIC. A
more detailed discussion of RNIC security requirements can be found
in Section 5 of [<a href="#ref-RDMASEC" title=""Direct Data Placement Protocol (DDP) / Remote Direct Memory Access Protocol (RDMAP) Security"">RDMASEC</a>].
1. An RNIC MUST ensure that a specific Stream in a specific
Protection Domain cannot access an STag in a different Protection
Domain.
2. An RNIC MUST ensure that if an STag is limited in scope to a
single Stream, no other Stream can use the STag.
3. An RNIC MUST ensure that a Remote Peer is not able to access
memory outside of the buffer specified when the STag was enabled
for remote access.
4. An RNIC MUST provide a mechanism for the ULP to establish and
revoke the association of a ULP Buffer to an STag and TO range.
5. An RNIC MUST provide a mechanism for the ULP to establish and
revoke read, write, or read and write access to the ULP Buffer
referenced by an STag.
6. An RNIC MUST ensure that the network interface can no longer
modify an Advertised Buffer after the ULP revokes remote access
rights for an STag.
7. An RNIC MUST ensure that a Remote Peer is not able to invalidate
an STag enabled for remote access, if the STag is shared on
multiple streams.
8. An RNIC MUST choose the value of STags in a way difficult to
predict. It is RECOMMENDED to sparsely populate them over the
full available range.
9. An RNIC MUST NOT enable sharing a Completion Queue (CQ) across
ULPs that do not share partial mutual trust.
10. An RNIC MUST ensure that if a CQ overflows, any Streams that do
not use the CQ MUST remain unaffected.
11. An RNIC implementation SHOULD provide a mechanism to cap the
number of outstanding RDMA Read Requests.
<span class="grey">Recio, et al. Standards Track [Page 47]</span></pre>
<hr class='noprint'/><!--NewPage--><pre class='newpage'><span id="page-48" ></span>
<span class="grey"><a href="./rfc5040">RFC 5040</a> RDMA Protocol Specification October 2007</span>
12. An RNIC MUST NOT enable firmware to be loaded on the RNIC
directly from an untrusted Local Peer or Remote Peer, unless the
Peer is properly authenticated*, and the update is done via a
secure protocol, such as IPsec.
* by a mechanism outside the scope of this specification. The
mechanism presumably entails authenticating that the remote ULP
has the right to perform the update.
<span class="h4"><a class="selflink" id="section-8.1.2" href="#section-8.1.2">8.1.2</a>. Privileged Resource Manager Requirements</span>
With RDMAP, all reservations of local resources are initiated from
local ULPs. To protect from local attacks including unfair resource
distribution and gaining unauthorized access to RNIC resources, a
Privileged Resource Manager (PRM) must be implemented, which manages
all local resource allocation. Note that the PRM must not be
provided as an independent component, and its functionality can also
be implemented as part of the privileged ULP or as part of the RNIC
itself.
A PRM implementation must meet the following security requirements (a
more detailed discussion of PRM security requirements can be found in
Section 5 of [<a href="#ref-RDMASEC" title=""Direct Data Placement Protocol (DDP) / Remote Direct Memory Access Protocol (RDMAP) Security"">RDMASEC</a>]):
1. All Non-Privileged ULP interactions with the RNIC Engine that
could affect other ULPs MUST be done using the Resource Manager
as a proxy.
2. All ULP resource allocation requests for scarce resources MUST
also be done using a Privileged Resource Manager.
3. The Privileged Resource Manager MUST NOT assume that different
ULPs share Partial Mutual Trust unless there is a mechanism to
ensure that the ULPs do indeed share partial mutual trust.
4. If Non-Privileged ULPs are supported, the Privileged Resource
Manager MUST verify that the Non-Privileged ULP has the right to
access a specific Data Buffer before allowing an STag for which
the ULP has access rights to be associated with a specific Data
Buffer.
5. The Privileged Resource Manager MUST control the allocation of CQ
entries.
6. The Privileged Resource Manager SHOULD prevent a Local Peer from
allocating more than its fair share of resources.
<span class="grey">Recio, et al. Standards Track [Page 48]</span></pre>
<hr class='noprint'/><!--NewPage--><pre class='newpage'><span id="page-49" ></span>
<span class="grey"><a href="./rfc5040">RFC 5040</a> RDMA Protocol Specification October 2007</span>
7. RDMA Read Request Queue resource consumption MUST be controlled
by the Privileged Resource Manager such that RDMAP/DDP Streams
that do not share Partial Mutual Trust do not share RDMA Read
Request Queue resources.
8. If an RNIC provides the ability to share receive buffers across
multiple Streams, the combination of the RNIC and the Privileged
Resource Manager MUST be able to detect if the Remote Peer is
attempting to consume more than its fair share of resources so
that the Local Peer can apply countermeasures to detect and
prevent the attack.
<span class="h3"><a class="selflink" id="section-8.2" href="#section-8.2">8.2</a>. Security Services for RDMAP</span>
RDMAP is using IP-based network services to control, read, and write
data buffers over the network. Therefore, all exchanged control and
data packets are vulnerable to spoofing, tampering, and information
disclosure attacks.
RDMAP Streams that are subject to impersonation attacks or Stream
hijacking attacks can be authenticated, have their integrity
protected, and be protected from replay attacks. Furthermore,
confidentiality protection can be used to protect from eavesdropping.
<span class="h4"><a class="selflink" id="section-8.2.1" href="#section-8.2.1">8.2.1</a>. Available Security Services</span>
The IPsec protocol suite [<a href="./rfc2401" title=""Security Architecture for the Internet Protocol"">RFC2401</a>] defines strong countermeasures to
protect an IP stream from those attacks. Several levels of
protection can guarantee session confidentiality, per-packet source
authentication, per-packet integrity, and correct packet sequencing.
RDMAP security may also profit from SSL or TLS security services
provided for TCP-based ULPs [<a href="./rfc4346" title=""The TLS Protocol Version 1.1"">RFC4346</a>]. Used underneath RDMAP, these
security services also provide for stream authentication, data
integrity, and confidentiality. As discussed in [<a href="#ref-RDMASEC" title=""Direct Data Placement Protocol (DDP) / Remote Direct Memory Access Protocol (RDMAP) Security"">RDMASEC</a>],
limitations on the maximum packet length to be carried over the
network and potentially inefficient out-of-order packet processing at
the data sink make SSL and TLS less appropriate for RDMAP than IPsec.
If SSL is layered on top of RDMAP, SSL does not protect the RDMAP
headers. Thus, a man-in-the-middle attack can still occur by
modifying the RDMAP header to incorrectly place the data into the
wrong buffer, thus effectively corrupting the data stream.
By remaining independent of ULP and LLP security protocols, RDMAP
will benefit from continuing improvements at those layers. Users are
provided flexibility to adapt to their specific security requirements
and the ability to adapt to future security challenges. Given this,
<span class="grey">Recio, et al. Standards Track [Page 49]</span></pre>
<hr class='noprint'/><!--NewPage--><pre class='newpage'><span id="page-50" ></span>
<span class="grey"><a href="./rfc5040">RFC 5040</a> RDMA Protocol Specification October 2007</span>
the vulnerabilities of RDMAP to active third-party interference are
no greater than any other protocol running over an LLP such as TCP or
SCTP.
<span class="h4"><a class="selflink" id="section-8.2.2" href="#section-8.2.2">8.2.2</a>. Requirements for IPsec Services for RDMAP</span>
Because IPsec is designed to secure arbitrary IP packet streams,
including streams where packets are lost, RDMAP can run on top of
IPsec without any change. IPsec packets are processed (e.g.,
integrity checked and possibly decrypted) in the order they are
received, and an RDMAP Data Sink will process the decrypted RDMA
Messages contained in these packets in the same manner as RDMA
Messages contained in unsecured IP packets.
The IP Storage working group has defined the normative IPsec
requirements for IP Storage [<a href="./rfc3723" title=""Securing Block Storage Protocols over IP"">RFC3723</a>]. Portions of this
specification are applicable to the RDMAP. In particular, a
compliant implementation of IPsec services for RDMAP MUST meet the
requirements as outlined in <a href="./rfc3723#section-2.3">Section 2.3 of [RFC3723]</a>. Without
replicating the detailed discussion in [<a href="./rfc3723" title=""Securing Block Storage Protocols over IP"">RFC3723</a>], this includes the
following requirements:
1. The implementation MUST support IPsec ESP [<a href="./rfc2406" title=""IP Encapsulating Security Payload (ESP)"">RFC2406</a>], as well as
the replay protection mechanisms of IPsec. When ESP is utilized,
per-packet data origin authentication, integrity, and replay
protection MUST be used.
2. It MUST support ESP in tunnel mode and MAY implement ESP in
transport mode.
3. It MUST support IKE [<a href="./rfc2409" title=""The Internet Key Exchange (IKE)"">RFC2409</a>] for peer authentication,
negotiation of security associations, and key management, using
the IPsec DOI [<a href="./rfc2407" title=""The Internet IP Security Domain of Interpretation of ISAKMP"">RFC2407</a>].
4. It MUST NOT interpret the receipt of a IKE Phase 2 delete message
as a reason for tearing down the RDMAP stream. Since IPsec
acceleration hardware may only be able to handle a limited number
of active IKE Phase 2 SAs, idle SAs may be dynamically brought
down, and a new SA be brought up again, if activity resumes.
5. It MUST support peer authentication using a pre-shared key, and
MAY support certificate-based peer authentication using digital
signatures. Peer authentication using the public key encryption
methods [<a href="./rfc2409" title=""The Internet Key Exchange (IKE)"">RFC2409</a>] SHOULD NOT be used.
<span class="grey">Recio, et al. Standards Track [Page 50]</span></pre>
<hr class='noprint'/><!--NewPage--><pre class='newpage'><span id="page-51" ></span>
<span class="grey"><a href="./rfc5040">RFC 5040</a> RDMA Protocol Specification October 2007</span>
6. It MUST support IKE Main Mode and SHOULD support Aggressive Mode.
IKE Main Mode with pre-shared key authentication SHOULD NOT be
used when either of the peers uses a dynamically assigned IP
address.
7. When digital signatures are used to achieve authentication,
either IKE Main Mode or IKE Aggressive Mode MAY be used. In
these cases, an IKE negotiator SHOULD use IKE Certificate Request
Payload(s) to specify the certificate authority (or authorities)
that are trusted in accordance with its local policy. IKE
negotiators SHOULD check the pertinent Certificate Revocation
List (CRL) before accepting a PKI certificate for use in IKE's
authentication procedures.
8. Access to locally stored secret information (pre-shared or
private key for digital signing) must be suitably restricted,
since compromise of the secret information nullifies the security
properties of the IKE/IPsec protocols.
9. It MUST follow the guidelines of <a href="./rfc3723#section-2.3.4">Section 2.3.4 of [RFC3723]</a> on
the setting of IKE parameters to achieve a high level of
interoperability without requiring extensive configuration.
Furthermore, implementation and deployment of the IPsec services for
RDDP should follow the Security Considerations outlined in <a href="./rfc3723#section-5">Section 5
of [RFC3723]</a>.
<span class="h2"><a class="selflink" id="section-9" href="#section-9">9</a>. IANA Considerations</span>
This document requests no direct action from IANA. The following
consideration is listed here as commentary.
If RDMAP was enabled a priori for a ULP by connecting to a well-known
port, this well-known port would be registered for the RDMAP with
IANA. The registration of the well-known port will be the
responsibility of the ULP specification.
<span class="grey">Recio, et al. Standards Track [Page 51]</span></pre>
<hr class='noprint'/><!--NewPage--><pre class='newpage'><span id="page-52" ></span>
<span class="grey"><a href="./rfc5040">RFC 5040</a> RDMA Protocol Specification October 2007</span>
<span class="h2"><a class="selflink" id="section-10" href="#section-10">10</a>. References</span>
<span class="h3"><a class="selflink" id="section-10.1" href="#section-10.1">10.1</a>. Normative References</span>
[<a id="ref-DDP">DDP</a>] Shah, H., Pinkerton, J., Recio, R., and P. Culley, "Direct
Data Placement over Reliable Transports", <a href="./rfc5041">RFC 5041</a>, October
2007.
[<a id="ref-iSER">iSER</a>] Ko, M., Chadalapaka, M., Hufferd, J., Elzur, U., Shah, H.,
and P. Thaler, "Internet Small Computer System Interface
(iSCSI) Extensions for Remote Direct Memory Access (RDMA)"
<a href="./rfc5046">RFC 5046</a>, October 2007.
[<a id="ref-MPA">MPA</a>] Culley, P., Elzur, U., Recio, R., Bailey, S., and J.
Carrier, "Marker PDU Aligned Framing for TCP
Specification", <a href="./rfc5044">RFC 5044</a>, October 2007.
[<a id="ref-RDMASEC">RDMASEC</a>] Pinkerton, J. and E. Deleganes, "Direct Data Placement
Protocol (DDP) / Remote Direct Memory Access Protocol
(RDMAP) Security", <a href="./rfc5042">RFC 5042</a>, October 2007.
[<a id="ref-RFC2119">RFC2119</a>] Bradner, S., "Key words for use in RFCs to Indicate
Requirement Levels", <a href="https://www.rfc-editor.org/bcp/bcp14">BCP 14</a>, <a href="./rfc2119">RFC 2119</a>, March 1997.
[<a id="ref-RFC2406">RFC2406</a>] Kent, S. and R. Atkinson, "IP Encapsulating Security
Payload (ESP)", <a href="./rfc2406">RFC 2406</a>, November 1998.
[<a id="ref-RFC2407">RFC2407</a>] Piper, D., "The Internet IP Security Domain of
Interpretation of ISAKMP", <a href="./rfc2407">RFC 2407</a>, November 1998.
[<a id="ref-RFC2409">RFC2409</a>] Harkins, D. and D. Carrel, "The Internet Key Exchange
(IKE)", <a href="./rfc2409">RFC 2409</a>, November 1998.
[<a id="ref-RFC3723">RFC3723</a>] Aboba, B., Tseng, J., Walker, J., Rangan, V., and F.
Travostino, "Securing Block Storage Protocols over IP", <a href="./rfc3723">RFC</a>
<a href="./rfc3723">3723</a>, April 2004.
[<a id="ref-RFC2401">RFC2401</a>] Kent, S. and R. Atkinson, "Security Architecture for the
Internet Protocol", <a href="./rfc2401">RFC 2401</a>, November 1998.
[<a id="ref-SCTP">SCTP</a>] Stewart, R., Ed., "Stream Control Transmission Protocol",
<a href="./rfc4960">RFC 4960</a>, September 2007.
[<a id="ref-TCP">TCP</a>] Postel, J., "Transmission Control Protocol", STD 7, <a href="./rfc793">RFC</a>
<a href="./rfc793">793</a>, September 1981.
<span class="grey">Recio, et al. Standards Track [Page 52]</span></pre>
<hr class='noprint'/><!--NewPage--><pre class='newpage'><span id="page-53" ></span>
<span class="grey"><a href="./rfc5040">RFC 5040</a> RDMA Protocol Specification October 2007</span>
<span class="h3"><a class="selflink" id="section-10.2" href="#section-10.2">10.2</a>. Informative References</span>
[<a id="ref-RFC4301">RFC4301</a>] Kent, S. and K. Seo, "Security Architecture for the
Internet Protocol", <a href="./rfc4301">RFC 4301</a>, December 2005.
[<a id="ref-RFC4303">RFC4303</a>] Kent, S., "IP Encapsulating Security Payload (ESP)", <a href="./rfc4303">RFC</a>
<a href="./rfc4303">4303</a>, December 2005.
[<a id="ref-RFC4306">RFC4306</a>] Kaufman, C., "Internet Key Exchange (IKEv2) Protocol", <a href="./rfc4306">RFC</a>
<a href="./rfc4306">4306</a>, December 2005.
[<a id="ref-RFC4346">RFC4346</a>] Dierks, T. and E. Rescorla, "The TLS Protocol Version 1.1",
<a href="./rfc4346">RFC 4346</a>, April 2006.
[<a id="ref-RFC4835">RFC4835</a>] Manral, V., "Cryptographic Algorithm Implementation
Requirements for Encapsulating Security Payload (ESP) and
Authentication Header (AH)", <a href="./rfc4835">RFC 4835</a>, April 2007.
<span class="grey">Recio, et al. Standards Track [Page 53]</span></pre>
<hr class='noprint'/><!--NewPage--><pre class='newpage'><span id="page-54" ></span>
<span class="grey"><a href="./rfc5040">RFC 5040</a> RDMA Protocol Specification October 2007</span>
<span class="h2"><a class="selflink" id="appendix-A" href="#appendix-A">Appendix A</a>. DDP Segment Formats for RDMA Messages</span>
This appendix is for information only and is NOT part of the
standard. It simply depicts the DDP Segment format for the various
RDMA Messages.
<span class="h3"><a class="selflink" id="appendix-A.1" href="#appendix-A.1">A.1</a>. DDP Segment for RDMA Write</span>
The following figure depicts an RDMA Write, DDP Segment:
0 1 2 3
0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1
+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
| DDP Control | RDMA Control |
+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
| Data Sink STag |
+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
| Data Sink Tagged Offset |
+ +
| |
+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
| RDMA Write ULP Payload |
// //
| |
+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
Figure 11: RDMA Write, DDP Segment Format
<span class="grey">Recio, et al. Standards Track [Page 54]</span></pre>
<hr class='noprint'/><!--NewPage--><pre class='newpage'><span id="page-55" ></span>
<span class="grey"><a href="./rfc5040">RFC 5040</a> RDMA Protocol Specification October 2007</span>
<span class="h3"><a class="selflink" id="appendix-A.2" href="#appendix-A.2">A.2</a>. DDP Segment for RDMA Read Request</span>
The following figure depicts an RDMA Read Request, DDP Segment:
0 1 2 3
0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1
+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
| DDP Control | RDMA Control |
+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
| Reserved (Not Used) |
+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
| DDP (RDMA Read Request) Queue Number |
+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
| DDP (RDMA Read Request) Message Sequence Number |
+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
| DDP (RDMA Read Request) Message Offset |
+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
| Data Sink STag (SinkSTag) |
+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
| |
+ Data Sink Tagged Offset (SinkTO) +
| |
+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
| RDMA Read Message Size (RDMARDSZ) |
+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
| Data Source STag (SrcSTag) |
+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
| |
+ Data Source Tagged Offset (SrcTO) +
| |
+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
Figure 12: RDMA Read Request, DDP Segment format
<span class="grey">Recio, et al. Standards Track [Page 55]</span></pre>
<hr class='noprint'/><!--NewPage--><pre class='newpage'><span id="page-56" ></span>
<span class="grey"><a href="./rfc5040">RFC 5040</a> RDMA Protocol Specification October 2007</span>
<span class="h3"><a class="selflink" id="appendix-A.3" href="#appendix-A.3">A.3</a>. DDP Segment for RDMA Read Response</span>
The following figure depicts an RDMA Read Response, DDP Segment:
0 1 2 3
0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1
+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
| DDP Control | RDMA Control |
+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
| Data Sink STag |
+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
| Data Sink Tagged Offset |
+ +
| |
+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
| RDMA Read Response ULP Payload |
// //
| |
+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
Figure 13: RDMA Read Response, DDP Segment Format
<span class="h3"><a class="selflink" id="appendix-A.4" href="#appendix-A.4">A.4</a>. DDP Segment for Send and Send with Solicited Event</span>
The following figure depicts a Send and Send with Solicited
Request, DDP Segment:
0 1 2 3
0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1
+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
| DDP Control | RDMA Control |
+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
| Reserved (Not Used) |
+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
| (Send) Queue Number |
+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
| (Send) Message Sequence Number |
+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
| (Send) Message Offset |
+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
| Send ULP Payload |
// //
| |
+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
Figure 14: Send and Send with Solicited Event, DDP Segment Format
<span class="grey">Recio, et al. Standards Track [Page 56]</span></pre>
<hr class='noprint'/><!--NewPage--><pre class='newpage'><span id="page-57" ></span>
<span class="grey"><a href="./rfc5040">RFC 5040</a> RDMA Protocol Specification October 2007</span>
<span class="h3"><a class="selflink" id="appendix-A.5" href="#appendix-A.5">A.5</a>. DDP Segment for Send with Invalidate and Send with SE and</span>
Invalidate
The following figure depicts a Send with Invalidate and Send with
Solicited and Invalidate Request, DDP Segment:
0 1 2 3
0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1
+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
| DDP Control | RDMA Control |
+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
| Invalidate STag |
+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
| (Send) Queue Number |
+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
| (Send) Message Sequence Number |
+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
| (Send) Message Offset |
+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
| Send ULP Payload |
// //
| |
+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
Figure 15: Send with Invalidate and Send with SE and Invalidate,
DDP Segment Format
<span class="grey">Recio, et al. Standards Track [Page 57]</span></pre>
<hr class='noprint'/><!--NewPage--><pre class='newpage'><span id="page-58" ></span>
<span class="grey"><a href="./rfc5040">RFC 5040</a> RDMA Protocol Specification October 2007</span>
<span class="h3"><a class="selflink" id="appendix-A.6" href="#appendix-A.6">A.6</a>. DDP Segment for Terminate</span>
The following figure depicts a Terminate, DDP Segment:
0 1 2 3
0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1
+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
| DDP Control | RDMA Control |
+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
| Reserved (Not Used) |
+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
| DDP (Terminate) Queue Number |
+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
| DDP (Terminate) Message Sequence Number |
+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
| DDP (Terminate) Message Offset |
+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
| Terminate Control | Reserved |
+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
| DDP Segment Length (if any) | |
+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+ +
| |
+ +
| Terminated DDP Header (if any) |
+ +
| |
+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
| |
// //
| Terminated RDMA Header (if any) |
+ +
| |
+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
Figure 16: Terminate, DDP Segment Format
<span class="grey">Recio, et al. Standards Track [Page 58]</span></pre>
<hr class='noprint'/><!--NewPage--><pre class='newpage'><span id="page-59" ></span>
<span class="grey"><a href="./rfc5040">RFC 5040</a> RDMA Protocol Specification October 2007</span>
<span class="h2"><a class="selflink" id="appendix-B" href="#appendix-B">Appendix B</a>. Ordering and Completion Table</span>
The following table summarizes the ordering relationships that are
defined in <a href="#section-5.5">Section 5.5</a>, "Ordering and Completions", from the
standpoint of the local peer issuing the two Operations. Note that
in the table that follows, Send includes Send, Send with Invalidate,
Send with Solicited Event, and Send with Solicited Event and
Invalidate.
------+-------+----------------+----------------+----------------
First | Later | Placement | Placement | Ordering
Op | Op | guarantee at | guarantee at | guarantee at
| | Remote Peer | Local Peer | Remote Peer
| | | |
------+-------+----------------+----------------+----------------
Send | Send | No placement | Not applicable | Completed in
| | guarantee. If | | order.
| | guarantee is | |
| | necessary, see | |
| | footnote 1. | |
------+-------+----------------+----------------+----------------
Send | RDMA | No placement | Not applicable | Not applicable
| Write | guarantee. If | |
| | guarantee is | |
| | necessary, see | |
| | footnote 1. | |
------+-------+----------------+----------------+----------------
Send | RDMA | No placement | RDMA Read | RDMA Read
| Read | guarantee | Response | Response
| | between Send | Payload will | Message will
| | Payload and | not be placed | not be
| | RDMA Read | at the local | generated until
| | Request Header | peer until the | Send has been
| | | Send Payload is| Completed
| | | placed at the |
| | | Remote Peer |
------+-------+----------------+----------------+----------------
RDMA | Send | No placement | Not applicable | Not applicable
Write | | guarantee. If | |
| | guarantee is | |
| | necessary, see | |
| | footnote 1. | |
<span class="grey">Recio, et al. Standards Track [Page 59]</span></pre>
<hr class='noprint'/><!--NewPage--><pre class='newpage'><span id="page-60" ></span>
<span class="grey"><a href="./rfc5040">RFC 5040</a> RDMA Protocol Specification October 2007</span>
------+-------+----------------+----------------+----------------
RDMA | RDMA | No placement | Not applicable | Not applicable
Write | Write | guarantee. If | |
| | guarantee is | |
| | necessary, see | |
| | footnote 1. | |
------+-------+----------------+----------------+----------------
RDMA | RDMA | No placement | RDMA Read | Not applicable
Write | Read | guarantee | Response |
| | between RDMA | Payload will |
| | Write Payload | not be placed |
| | and RDMA Read | at the local |
| | Request Header | peer until the |
| | | RDMA Write |
| | | Payload is |
| | | placed at the |
| | | Remote Peer |
------+-------+----------------+----------------+----------------
RDMA | Send | No placement | Send Payload | Not applicable
Read | | guarantee | may be placed |
| | between RDMA | at the remote |
| | Read Request | peer before the|
| | Header and Send| RDMA Read |
| | payload | Response is |
| | | generated. |
| | | If guarantee is|
| | | necessary, see |
| | | footnote 2. |
------+-------+----------------+----------------+----------------
RDMA | RDMA | No placement | RDMA Write | Not applicable
Read | Write | guarantee | Payload may be |
| | between RDMA | placed at the |
| | Read Request | Remote Peer |
| | Header and RDMA| before the RDMA|
| | Write payload | Read Response |
| | | is generated. |
| | | If guarantee is|
| | | necessary, see |
| | | footnote 2. |
<span class="grey">Recio, et al. Standards Track [Page 60]</span></pre>
<hr class='noprint'/><!--NewPage--><pre class='newpage'><span id="page-61" ></span>
<span class="grey"><a href="./rfc5040">RFC 5040</a> RDMA Protocol Specification October 2007</span>
------+-------+----------------+----------------+----------------
RDMA | RDMA | No placement | No placement | Second RDMA
Read | Read | guarantee of | guarantee of | Read Response
| | the two RDMA | the two RDMA | will not be
| | Read Request | Read Response | generated until
| | Headers | Payloads. | first RDMA Read
| | Additionally, | | Response is
| | there is no | | generated.
| | guarantee that | |
| | the Tagged | |
| | Buffers | |
| | referenced in | |
| | the RDMA Read | |
| | will be read in| |
| | order | |
Figure 17: Operation Ordering
Footnote 1: If the guarantee is necessary, a ULP may insert an RDMA
Read operation and wait for it to complete to act as a Fence.
Footnote 2: If the guarantee is necessary, a ULP may wait for the
RDMA Read operation to complete before performing the Send.
<span class="h2"><a class="selflink" id="appendix-C" href="#appendix-C">Appendix C</a>. Contributors</span>
Dwight Barron
Hewlett-Packard Company
20555 SH 249
Houston, TX 77070-2698 USA
Phone: 281-514-2769
EMail: dwight.barron@hp.com
Caitlin Bestler
Broadcom Corporation
16215 Alton Parkway
Irvine, CA 92619-7013 USA
Phone: 949-926-6383
EMail: caitlinb@broadcom.com
John Carrier
Cray, Inc.
411 First Avenue S, Suite 600
Seattle, WA 98104-2860 USA
Phone: 206-701-2090
EMail: carrier@cray.com
<span class="grey">Recio, et al. Standards Track [Page 61]</span></pre>
<hr class='noprint'/><!--NewPage--><pre class='newpage'><span id="page-62" ></span>
<span class="grey"><a href="./rfc5040">RFC 5040</a> RDMA Protocol Specification October 2007</span>
Ted Compton
EMC Corporation
Research Triangle Park, NC 27709 USA
Phone: 919-248-6075
EMail: compton_ted@emc.com
Uri Elzur
Broadcom Corporation
16215 Alton Parkway
Irvine, California 92619-7013 USA
Phone: +1 (949) 585-6432
EMail: Uri@Broadcom.com
Hari Ghadia
Gen10 Technology, Inc.
1501 W Shady Grove Road
Grand Prairie, TX 75050
Phone: (972) 301 3630
EMail: hghadia@gen10technology.com
Howard C. Herbert
Intel Corporation
MS CH7-404
5000 West Chandler Blvd.
Chandler, Arizona 85226
Phone: 480-554-3116
EMail: howard.c.herbert@intel.com
Mike Ko
IBM
650 Harry Rd.
San Jose, CA 95120
Phone: (408) 927-2085
EMail: mako@us.ibm.com
Mike Krause
Hewlett-Packard Company
43LN
19410 Homestead Road
Cupertino, CA 95014 USA
Phone: 408-447-3191
EMail: krause@cup.hp.com
<span class="grey">Recio, et al. Standards Track [Page 62]</span></pre>
<hr class='noprint'/><!--NewPage--><pre class='newpage'><span id="page-63" ></span>
<span class="grey"><a href="./rfc5040">RFC 5040</a> RDMA Protocol Specification October 2007</span>
Dave Minturn
Intel Corporation
MS JF1-210
5200 North East Elam Young Parkway
Hillsboro, Oregon 97124
Phone: 503-712-4106
EMail: dave.b.minturn@intel.com
Mike Penna
Broadcom Corporation
16215 Alton Parkway
Irvine, California 92619-7013 USA
Phone: +1 (949) 926-7149
EMail: MPenna@Broadcom.com
Jim Pinkerton
Microsoft, Inc.
One Microsoft Way
Redmond, WA 98052 USA
EMail: jpink@microsoft.com
Hemal Shah
Broadcom Corporation
5300 California Avenue
Irvine, CA 92617 USA
Phone: +1 (949) 926-6941
EMail: hemal@broadcom.com
Allyn Romanow
Cisco Systems
170 W Tasman Drive
San Jose, CA 95134 USA
Phone: +1 408 525 8836
EMail: allyn@cisco.com
Tom Talpey
Network Appliance
1601 Trapelo Road #16
Waltham, MA 02451 USA
Phone: +1 (781) 768-5329
EMail: thomas.talpey@netapp.com
<span class="grey">Recio, et al. Standards Track [Page 63]</span></pre>
<hr class='noprint'/><!--NewPage--><pre class='newpage'><span id="page-64" ></span>
<span class="grey"><a href="./rfc5040">RFC 5040</a> RDMA Protocol Specification October 2007</span>
Patricia Thaler
Broadcom Corporation
16215 Alton Parkway
Irvine, CA 92619-7013 USA
Phone: +1-916-570-2707
EMail: pthaler@broadcom.com
Jim Wendt
Hewlett-Packard Company
8000 Foothills Boulevard MS 5668
Roseville, CA 95747-5668 USA
Phone: +1 916 785 5198
EMail: jim_wendt@hp.com
Madeline Vega
IBM
11400 Burnet Rd. Bld.45-2L-007
Austin, TX 78758 USA
Phone: 512-838-7739
EMail: mvega1@us.ibm.com
Claudia Salzberg
IBM
11501 Burnet Rd. Bld.902-5B-014
Austin, TX 78758 USA
Phone: 512-838-5156
EMail: salzberg@us.ibm.com
<span class="grey">Recio, et al. Standards Track [Page 64]</span></pre>
<hr class='noprint'/><!--NewPage--><pre class='newpage'><span id="page-65" ></span>
<span class="grey"><a href="./rfc5040">RFC 5040</a> RDMA Protocol Specification October 2007</span>
Authors' Addresses
Renato J. Recio
IBM Corp.
11501 Burnett Road
Austin, TX 78758 USA
Phone: 512-838-3685
EMail: recio@us.ibm.com
Bernard Metzler
IBM Research GmbH
Zurich Research Laboratory
Saeumerstrasse 4
CH-8803 Rueschlikon, Switzerland
Phone: +41 44 724 8605
EMail: bmt@zurich.ibm.com
Paul R. Culley
Hewlett-Packard Company
20555 SH 249
Houston, TX 77070-2698 USA
Phone: 281-514-5543
EMail: paul.culley@hp.com
Jeff Hilland
Hewlett-Packard Company
20555 SH 249
Houston, TX 77070-2698 USA
Phone: 281-514-9489
EMail: jeff.hilland@hp.com
Dave Garcia
24100 Hutchinson Rd.
Los Gatos, CA 95033 USA
Phone: +1 (831) 247-4464
Email: Dave.Garcia@StanfordAlumni.org
<span class="grey">Recio, et al. Standards Track [Page 65]</span></pre>
<hr class='noprint'/><!--NewPage--><pre class='newpage'><span id="page-66" ></span>
<span class="grey"><a href="./rfc5040">RFC 5040</a> RDMA Protocol Specification October 2007</span>
Full Copyright Statement
Copyright (C) The IETF Trust (2007).
This document is subject to the rights, licenses and restrictions
contained in <a href="https://www.rfc-editor.org/bcp/bcp78">BCP 78</a>, and except as set forth therein, the authors
retain all their rights.
This document and the information contained herein are provided on an
"AS IS" basis and THE CONTRIBUTOR, THE ORGANIZATION HE/SHE REPRESENTS
OR IS SPONSORED BY (IF ANY), THE INTERNET SOCIETY, THE IETF TRUST AND
THE INTERNET ENGINEERING TASK FORCE DISCLAIM ALL WARRANTIES, EXPRESS
OR IMPLIED, INCLUDING BUT NOT LIMITED TO ANY WARRANTY THAT THE USE OF
THE INFORMATION HEREIN WILL NOT INFRINGE ANY RIGHTS OR ANY IMPLIED
WARRANTIES OF MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE.
Intellectual Property
The IETF takes no position regarding the validity or scope of any
Intellectual Property Rights or other rights that might be claimed to
pertain to the implementation or use of the technology described in
this document or the extent to which any license under such rights
might or might not be available; nor does it represent that it has
made any independent effort to identify any such rights. Information
on the procedures with respect to rights in RFC documents can be
found in <a href="https://www.rfc-editor.org/bcp/bcp78">BCP 78</a> and <a href="https://www.rfc-editor.org/bcp/bcp79">BCP 79</a>.
Copies of IPR disclosures made to the IETF Secretariat and any
assurances of licenses to be made available, or the result of an
attempt made to obtain a general license or permission for the use of
such proprietary rights by implementers or users of this
specification can be obtained from the IETF on-line IPR repository at
<a href="http://www.ietf.org/ipr">http://www.ietf.org/ipr</a>.
The IETF invites any interested party to bring to its attention any
copyrights, patents or patent applications, or other proprietary
rights that may cover technology that may be required to implement
this standard. Please address the information to the IETF at
ietf-ipr@ietf.org.
Recio, et al. Standards Track [Page 66]
</pre>
|