1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 112 113 114 115 116 117 118 119 120 121 122 123 124 125 126 127 128 129 130 131 132 133 134 135 136 137 138 139 140 141 142 143 144 145 146 147 148 149 150 151 152 153 154 155 156 157 158 159 160 161 162 163 164 165 166 167 168 169 170 171 172 173 174 175 176 177 178 179 180 181 182 183 184 185 186 187 188 189 190 191 192 193 194 195 196 197 198 199 200 201 202 203 204 205 206 207 208 209 210 211 212 213 214 215 216 217 218 219 220 221 222 223 224 225 226 227 228 229 230 231 232 233 234 235 236 237 238 239 240 241 242 243 244 245 246 247 248 249 250 251 252 253 254 255 256 257 258 259 260 261 262 263 264 265 266 267 268 269 270 271 272 273 274 275 276 277 278 279 280 281 282 283 284 285 286 287 288 289 290 291 292 293 294 295 296 297 298 299 300 301 302 303 304 305 306 307 308 309 310 311 312 313 314 315 316 317 318 319 320 321 322 323 324 325 326 327 328 329 330 331 332 333 334 335 336 337 338 339 340 341 342 343 344 345 346 347 348 349 350 351 352 353 354 355 356 357 358 359 360 361 362 363 364 365 366 367 368 369 370 371 372 373 374 375 376 377 378 379 380 381 382 383 384 385 386 387 388 389 390 391 392 393 394 395 396 397 398 399 400 401 402 403 404 405 406 407 408 409 410 411 412 413 414 415 416 417 418 419 420 421 422 423 424 425 426 427 428 429 430 431 432 433 434 435 436 437 438 439 440 441 442 443 444 445 446 447 448 449 450 451 452 453 454 455 456 457 458 459 460 461 462 463 464 465 466 467 468 469 470 471 472 473 474 475 476 477 478 479 480 481 482 483 484 485 486 487 488 489 490 491 492 493 494 495 496 497 498 499 500 501 502 503 504 505 506 507 508 509 510 511 512 513 514 515 516 517 518 519 520 521 522 523 524 525 526 527 528 529 530 531 532 533 534 535 536 537 538 539 540 541 542 543 544 545 546 547 548 549 550 551 552 553 554 555 556 557 558 559 560 561 562 563 564 565 566 567 568 569 570 571 572 573 574 575 576 577 578 579 580 581 582 583 584 585 586 587 588 589 590 591 592 593 594 595 596 597 598 599 600 601 602 603 604 605 606 607 608 609 610 611 612 613 614 615 616 617 618 619 620 621 622 623 624 625 626 627 628 629 630 631 632 633 634 635 636 637 638 639 640 641 642 643 644 645 646 647 648 649 650 651 652 653 654 655 656 657 658 659 660 661 662 663 664 665 666 667 668 669 670 671 672 673 674 675 676 677 678 679 680 681 682 683 684 685 686 687 688 689 690 691 692 693 694 695 696 697 698 699 700 701 702 703 704 705 706 707 708 709 710 711 712 713 714 715 716 717 718 719 720 721 722 723 724 725 726 727 728 729 730 731 732 733 734 735 736 737 738 739 740 741 742 743 744 745 746 747 748 749 750 751 752 753 754 755 756 757 758 759 760 761 762 763 764 765 766 767 768 769 770 771 772 773 774 775 776 777 778 779 780 781 782 783 784 785 786 787 788 789 790 791 792 793 794 795 796 797 798 799 800 801 802 803 804 805 806 807 808 809 810 811 812 813 814 815 816 817 818 819 820 821 822 823 824 825 826 827 828 829 830 831 832 833 834 835 836 837 838 839 840 841 842 843 844 845 846 847 848 849 850 851 852 853 854 855 856 857 858 859 860 861 862 863 864 865 866 867 868 869 870 871 872 873 874 875 876 877 878 879 880 881 882 883 884 885 886 887 888 889 890 891 892 893 894 895 896 897 898 899 900 901 902 903 904 905 906 907 908 909 910 911 912 913 914 915 916 917 918 919 920 921 922 923 924 925 926 927 928 929 930 931 932 933 934 935 936 937 938 939 940 941 942 943 944 945 946 947 948 949 950 951 952 953 954 955 956 957 958 959 960 961 962 963 964 965 966 967 968 969 970 971 972 973 974 975 976 977 978 979 980 981 982 983 984 985 986 987 988 989 990 991 992 993 994 995 996 997 998 999 1000 1001 1002 1003 1004 1005 1006 1007 1008 1009 1010 1011 1012 1013 1014 1015 1016 1017 1018 1019 1020 1021 1022 1023 1024 1025 1026 1027 1028 1029 1030 1031 1032 1033 1034 1035 1036 1037 1038 1039 1040 1041 1042 1043 1044 1045 1046 1047 1048 1049 1050 1051 1052 1053 1054 1055 1056 1057 1058 1059 1060 1061 1062 1063 1064 1065 1066 1067 1068 1069 1070 1071 1072 1073 1074 1075 1076 1077 1078 1079 1080 1081 1082 1083 1084 1085 1086 1087 1088 1089 1090 1091 1092 1093 1094 1095 1096 1097 1098 1099 1100 1101 1102 1103 1104 1105 1106 1107 1108 1109 1110 1111 1112 1113 1114 1115 1116 1117 1118 1119 1120 1121 1122 1123 1124 1125 1126 1127 1128 1129 1130 1131 1132 1133 1134 1135 1136 1137 1138 1139 1140 1141 1142 1143 1144 1145 1146 1147 1148 1149 1150 1151 1152 1153 1154 1155 1156 1157 1158 1159 1160 1161 1162 1163 1164 1165 1166 1167 1168 1169 1170 1171 1172 1173 1174 1175 1176 1177 1178 1179 1180 1181 1182 1183 1184 1185 1186 1187 1188 1189 1190 1191 1192 1193 1194 1195 1196 1197 1198 1199 1200 1201 1202 1203 1204 1205 1206 1207 1208 1209 1210 1211 1212 1213 1214 1215 1216 1217 1218 1219 1220 1221 1222 1223 1224 1225 1226 1227 1228 1229 1230 1231 1232 1233 1234 1235 1236 1237 1238 1239 1240 1241 1242 1243 1244 1245 1246 1247 1248 1249 1250 1251 1252 1253 1254 1255 1256 1257 1258 1259 1260 1261 1262 1263 1264 1265 1266 1267 1268 1269 1270 1271 1272 1273 1274 1275 1276 1277 1278 1279 1280 1281 1282 1283 1284 1285 1286 1287 1288 1289 1290 1291 1292 1293 1294 1295 1296 1297 1298 1299 1300 1301 1302 1303 1304 1305 1306 1307 1308 1309 1310 1311 1312 1313 1314 1315 1316 1317 1318 1319 1320 1321 1322 1323 1324 1325 1326 1327 1328 1329 1330 1331 1332 1333 1334 1335 1336 1337 1338 1339 1340 1341 1342 1343 1344 1345 1346 1347 1348 1349 1350 1351 1352 1353 1354 1355 1356 1357 1358 1359 1360 1361 1362 1363 1364 1365 1366 1367 1368 1369 1370 1371 1372 1373 1374 1375 1376 1377 1378 1379 1380 1381 1382 1383 1384 1385 1386 1387 1388 1389 1390 1391 1392 1393 1394 1395 1396 1397 1398 1399 1400 1401 1402 1403 1404 1405 1406 1407 1408 1409 1410 1411 1412 1413 1414 1415 1416 1417 1418 1419 1420 1421 1422 1423 1424 1425 1426 1427 1428 1429 1430 1431 1432 1433 1434 1435 1436 1437 1438 1439 1440 1441 1442 1443 1444 1445 1446 1447 1448 1449 1450 1451 1452 1453 1454 1455 1456 1457 1458 1459 1460 1461 1462 1463 1464 1465 1466 1467 1468 1469 1470 1471 1472 1473 1474 1475 1476 1477 1478 1479 1480 1481 1482 1483 1484 1485 1486 1487 1488 1489 1490 1491 1492 1493 1494 1495 1496 1497 1498 1499 1500 1501 1502 1503 1504 1505 1506 1507 1508 1509 1510 1511 1512 1513 1514 1515 1516 1517 1518 1519 1520 1521 1522 1523 1524 1525 1526 1527 1528 1529 1530 1531 1532 1533 1534 1535 1536 1537 1538 1539 1540 1541 1542 1543 1544 1545 1546 1547 1548 1549 1550 1551 1552 1553 1554 1555 1556 1557 1558 1559 1560 1561 1562 1563 1564 1565 1566 1567 1568 1569 1570 1571 1572 1573 1574 1575 1576 1577 1578 1579 1580 1581 1582 1583 1584 1585 1586 1587 1588 1589 1590 1591 1592 1593 1594 1595 1596 1597 1598 1599 1600 1601 1602 1603 1604 1605 1606 1607 1608 1609 1610 1611 1612 1613 1614 1615 1616 1617 1618 1619 1620 1621 1622 1623 1624 1625 1626 1627 1628 1629 1630 1631 1632 1633 1634 1635 1636 1637 1638 1639 1640 1641 1642 1643 1644 1645 1646 1647 1648 1649 1650 1651 1652 1653 1654 1655 1656 1657 1658 1659 1660 1661 1662 1663 1664 1665 1666 1667 1668 1669 1670 1671 1672 1673 1674 1675 1676 1677 1678 1679 1680 1681 1682 1683 1684 1685 1686 1687 1688 1689 1690 1691 1692 1693 1694 1695 1696 1697 1698 1699 1700 1701 1702 1703 1704 1705 1706 1707 1708 1709 1710 1711 1712 1713 1714 1715 1716 1717 1718 1719 1720 1721 1722 1723 1724 1725 1726 1727 1728 1729 1730 1731 1732 1733 1734 1735 1736 1737 1738 1739 1740 1741 1742 1743 1744 1745 1746 1747 1748 1749 1750 1751 1752 1753 1754 1755 1756 1757 1758 1759 1760 1761 1762 1763 1764 1765 1766 1767 1768 1769 1770 1771 1772 1773 1774 1775 1776 1777 1778 1779 1780 1781 1782 1783 1784 1785 1786 1787 1788 1789 1790 1791 1792 1793 1794 1795 1796 1797 1798 1799 1800 1801 1802 1803 1804 1805 1806 1807 1808 1809 1810 1811 1812 1813 1814 1815 1816 1817 1818 1819 1820 1821 1822 1823 1824 1825 1826 1827 1828 1829 1830 1831 1832 1833 1834 1835 1836 1837 1838 1839 1840 1841 1842 1843 1844 1845 1846 1847 1848 1849 1850 1851 1852 1853 1854 1855 1856 1857 1858 1859 1860 1861 1862 1863 1864 1865 1866 1867 1868 1869 1870 1871 1872 1873 1874 1875 1876 1877 1878 1879 1880 1881 1882 1883 1884 1885 1886 1887 1888 1889 1890 1891 1892 1893 1894 1895 1896 1897 1898 1899 1900 1901 1902 1903 1904 1905 1906 1907 1908 1909 1910 1911 1912 1913 1914 1915 1916 1917 1918 1919 1920 1921 1922 1923 1924 1925 1926 1927 1928 1929 1930 1931 1932 1933 1934 1935 1936 1937 1938 1939 1940 1941 1942 1943 1944 1945 1946 1947 1948 1949 1950 1951 1952 1953 1954 1955 1956 1957 1958 1959 1960 1961 1962 1963 1964 1965 1966 1967 1968 1969 1970 1971 1972 1973 1974 1975 1976 1977 1978 1979 1980 1981 1982 1983 1984 1985 1986 1987 1988 1989 1990 1991 1992 1993 1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 2022 2023 2024 2025 2026 2027 2028 2029 2030 2031 2032 2033 2034 2035 2036 2037 2038 2039 2040 2041 2042 2043 2044 2045 2046 2047 2048 2049 2050 2051 2052 2053 2054 2055 2056 2057 2058 2059 2060 2061 2062 2063 2064 2065 2066 2067 2068 2069 2070 2071 2072 2073 2074 2075 2076 2077 2078 2079 2080 2081 2082 2083 2084 2085 2086 2087 2088 2089 2090 2091 2092 2093 2094 2095 2096 2097 2098 2099 2100 2101 2102 2103 2104 2105 2106 2107 2108 2109 2110 2111 2112 2113 2114 2115 2116 2117 2118 2119 2120 2121 2122 2123 2124 2125 2126 2127 2128 2129 2130 2131 2132 2133 2134 2135 2136 2137 2138 2139 2140 2141 2142 2143 2144 2145 2146 2147 2148 2149 2150 2151 2152 2153 2154 2155 2156 2157 2158 2159 2160 2161 2162 2163 2164 2165 2166 2167 2168 2169 2170 2171 2172 2173 2174 2175 2176 2177 2178 2179 2180 2181 2182 2183 2184 2185 2186 2187 2188 2189 2190 2191 2192 2193 2194 2195 2196 2197 2198 2199 2200 2201 2202 2203 2204 2205 2206 2207 2208 2209 2210 2211 2212 2213 2214 2215 2216 2217 2218 2219 2220 2221 2222 2223 2224 2225 2226 2227 2228 2229 2230 2231 2232 2233 2234 2235 2236 2237 2238 2239 2240 2241 2242 2243 2244 2245 2246 2247 2248 2249 2250 2251 2252 2253 2254 2255 2256 2257 2258 2259 2260 2261 2262 2263 2264 2265 2266 2267 2268 2269 2270 2271 2272 2273 2274 2275 2276 2277 2278 2279 2280 2281 2282 2283 2284 2285 2286 2287 2288 2289 2290 2291 2292 2293 2294 2295 2296 2297 2298 2299 2300 2301 2302 2303 2304 2305 2306 2307 2308 2309 2310 2311 2312 2313 2314 2315 2316 2317 2318 2319 2320 2321 2322 2323 2324 2325 2326 2327 2328 2329 2330 2331 2332 2333 2334 2335 2336 2337 2338 2339 2340 2341 2342 2343 2344 2345 2346 2347 2348 2349 2350 2351 2352 2353 2354 2355 2356 2357 2358 2359 2360 2361 2362 2363 2364 2365 2366 2367 2368 2369 2370 2371 2372 2373 2374 2375 2376 2377 2378 2379 2380 2381 2382 2383 2384 2385 2386 2387 2388 2389 2390 2391 2392 2393 2394 2395 2396 2397 2398 2399 2400 2401 2402 2403 2404 2405 2406 2407 2408 2409 2410 2411 2412 2413 2414 2415 2416 2417 2418 2419 2420 2421 2422 2423 2424 2425 2426 2427 2428 2429 2430 2431 2432 2433 2434 2435 2436 2437 2438 2439 2440 2441 2442 2443 2444 2445 2446 2447 2448 2449 2450 2451 2452 2453 2454 2455 2456 2457 2458 2459 2460 2461 2462 2463 2464 2465 2466 2467 2468 2469 2470 2471 2472 2473 2474 2475 2476 2477 2478 2479 2480 2481 2482 2483 2484 2485 2486 2487 2488 2489 2490 2491 2492 2493 2494 2495 2496 2497 2498 2499 2500 2501 2502 2503 2504 2505 2506 2507 2508 2509 2510 2511 2512 2513 2514 2515 2516 2517 2518 2519 2520 2521 2522 2523 2524 2525 2526 2527 2528 2529 2530 2531 2532 2533 2534 2535 2536 2537 2538 2539 2540 2541 2542 2543 2544 2545 2546 2547 2548 2549 2550 2551 2552 2553 2554 2555 2556 2557 2558 2559 2560 2561 2562 2563 2564 2565 2566 2567 2568 2569 2570 2571 2572 2573 2574 2575 2576 2577 2578 2579 2580 2581 2582 2583 2584 2585 2586 2587 2588 2589 2590 2591 2592 2593 2594 2595 2596 2597 2598 2599 2600 2601 2602 2603 2604 2605 2606 2607 2608 2609 2610 2611 2612 2613 2614 2615 2616 2617 2618 2619 2620 2621 2622 2623 2624 2625 2626 2627 2628 2629 2630 2631 2632 2633 2634 2635 2636 2637 2638 2639 2640 2641 2642 2643 2644 2645 2646 2647 2648 2649 2650 2651 2652 2653 2654 2655 2656 2657 2658 2659 2660 2661 2662 2663 2664 2665 2666 2667 2668 2669 2670 2671 2672 2673 2674 2675 2676 2677 2678 2679 2680 2681 2682 2683 2684 2685 2686 2687 2688 2689 2690 2691 2692 2693 2694 2695 2696 2697 2698 2699 2700 2701 2702 2703 2704 2705 2706 2707 2708 2709 2710 2711 2712 2713 2714 2715 2716 2717 2718 2719 2720 2721 2722 2723 2724 2725 2726 2727 2728 2729 2730 2731 2732 2733 2734 2735 2736 2737 2738 2739 2740 2741 2742 2743 2744 2745 2746 2747 2748 2749 2750 2751 2752 2753 2754 2755 2756 2757 2758 2759 2760 2761 2762 2763 2764 2765 2766 2767 2768 2769 2770 2771 2772 2773 2774 2775 2776 2777 2778 2779 2780 2781 2782 2783 2784 2785 2786 2787 2788 2789 2790 2791 2792 2793 2794 2795 2796 2797 2798 2799 2800 2801 2802 2803 2804 2805 2806 2807 2808 2809 2810 2811 2812 2813 2814 2815 2816 2817 2818 2819 2820 2821 2822 2823 2824 2825 2826 2827 2828 2829 2830 2831 2832 2833 2834 2835 2836 2837 2838 2839 2840 2841 2842 2843 2844 2845 2846 2847 2848 2849 2850 2851 2852 2853 2854 2855 2856 2857 2858 2859 2860 2861 2862 2863 2864 2865 2866 2867 2868 2869 2870 2871 2872 2873 2874 2875 2876 2877 2878 2879 2880 2881 2882 2883 2884 2885 2886 2887 2888 2889 2890 2891 2892 2893 2894 2895 2896 2897 2898 2899 2900 2901 2902 2903 2904 2905 2906 2907 2908 2909 2910 2911 2912 2913 2914 2915 2916 2917 2918 2919 2920 2921 2922 2923 2924 2925 2926 2927 2928 2929 2930 2931 2932 2933 2934 2935 2936 2937 2938 2939 2940 2941 2942 2943 2944 2945 2946 2947 2948 2949 2950 2951 2952 2953 2954 2955 2956 2957 2958 2959 2960 2961 2962 2963 2964 2965 2966 2967 2968 2969 2970 2971 2972 2973 2974 2975 2976 2977 2978 2979 2980 2981 2982 2983 2984 2985 2986 2987 2988 2989 2990 2991 2992 2993 2994 2995 2996 2997 2998 2999 3000 3001 3002 3003 3004 3005 3006 3007 3008 3009 3010 3011 3012 3013 3014 3015 3016 3017 3018 3019 3020 3021 3022 3023 3024 3025 3026 3027 3028 3029 3030 3031 3032 3033 3034 3035 3036 3037 3038 3039 3040 3041 3042 3043 3044 3045 3046 3047 3048 3049 3050 3051 3052 3053 3054 3055 3056 3057 3058 3059 3060 3061 3062 3063 3064 3065 3066 3067 3068 3069 3070 3071 3072 3073 3074 3075 3076 3077 3078 3079 3080 3081 3082 3083 3084 3085 3086 3087 3088 3089 3090 3091 3092 3093 3094 3095 3096 3097 3098 3099 3100 3101 3102 3103 3104 3105 3106 3107 3108 3109 3110 3111 3112 3113 3114 3115 3116 3117 3118 3119 3120 3121 3122 3123 3124 3125 3126 3127 3128 3129 3130 3131 3132 3133 3134 3135 3136 3137 3138 3139 3140 3141 3142 3143 3144 3145 3146 3147 3148 3149 3150 3151 3152 3153 3154 3155 3156 3157 3158 3159 3160 3161 3162 3163 3164 3165 3166 3167 3168 3169 3170 3171 3172 3173 3174 3175 3176 3177 3178 3179 3180 3181 3182 3183 3184 3185 3186 3187 3188 3189 3190 3191 3192 3193 3194 3195 3196 3197 3198 3199 3200 3201 3202 3203 3204 3205 3206 3207 3208 3209 3210 3211 3212 3213 3214 3215 3216 3217 3218 3219 3220 3221 3222 3223 3224 3225 3226 3227 3228 3229 3230 3231 3232 3233 3234 3235 3236 3237 3238 3239 3240 3241 3242 3243 3244 3245 3246 3247 3248 3249 3250 3251 3252 3253 3254 3255 3256 3257 3258 3259 3260 3261 3262 3263 3264 3265 3266 3267 3268 3269 3270 3271 3272 3273 3274 3275 3276 3277 3278 3279 3280 3281 3282 3283 3284 3285 3286 3287 3288 3289 3290 3291 3292 3293 3294 3295 3296 3297 3298 3299 3300 3301 3302 3303 3304 3305 3306 3307 3308 3309 3310 3311 3312 3313 3314 3315 3316 3317 3318 3319 3320 3321 3322 3323 3324 3325 3326 3327 3328 3329 3330 3331 3332 3333 3334 3335 3336 3337 3338 3339 3340 3341 3342 3343 3344 3345 3346 3347 3348 3349 3350 3351 3352 3353 3354 3355 3356 3357 3358 3359 3360 3361 3362 3363 3364 3365 3366 3367 3368 3369 3370 3371 3372 3373 3374 3375 3376 3377 3378 3379 3380 3381 3382 3383 3384 3385 3386 3387 3388 3389 3390 3391 3392 3393 3394 3395 3396 3397 3398 3399 3400 3401 3402 3403 3404 3405 3406 3407 3408 3409 3410 3411 3412 3413 3414 3415 3416 3417 3418 3419 3420 3421 3422 3423 3424 3425 3426 3427 3428 3429 3430 3431 3432 3433 3434 3435 3436 3437 3438 3439 3440 3441 3442 3443 3444 3445 3446 3447 3448 3449 3450 3451 3452 3453 3454 3455 3456 3457 3458 3459 3460 3461 3462 3463 3464 3465 3466 3467 3468 3469 3470 3471 3472 3473 3474 3475 3476 3477 3478 3479 3480 3481 3482 3483 3484 3485 3486 3487 3488 3489 3490 3491 3492 3493 3494 3495 3496 3497 3498 3499 3500 3501 3502 3503 3504 3505 3506 3507 3508 3509 3510 3511 3512 3513 3514 3515 3516 3517 3518 3519 3520 3521 3522 3523 3524 3525 3526 3527 3528 3529 3530 3531 3532 3533 3534 3535 3536 3537 3538 3539 3540 3541 3542 3543 3544 3545 3546 3547 3548 3549 3550 3551 3552 3553 3554 3555 3556 3557 3558 3559 3560 3561 3562 3563 3564 3565 3566 3567
|
Mesa 20.0.0 Release Notes / 2020-02-19
======================================
Mesa 20.0.0 is a new development release. People who are concerned with
stability and reliability should stick with a previous release or wait
for Mesa 20.0.1.
Mesa 20.0.0 implements the OpenGL 4.6 API, but the version reported by
glGetString(GL_VERSION) or glGetIntegerv(GL_MAJOR_VERSION) /
glGetIntegerv(GL_MINOR_VERSION) depends on the particular driver being
used. Some drivers don't support all the features required in OpenGL
4.6. OpenGL 4.6 is **only** available if requested at context creation.
Compatibility contexts may report a lower version depending on each
driver.
Mesa 20.0.0 implements the Vulkan 1.2 API, but the version reported by
the apiVersion property of the VkPhysicalDeviceProperties struct depends
on the particular driver being used.
SHA256 checksum
---------------
::
bb6db3e54b608d2536d4000b3de7dd3ae115fc114e8acbb5afff4b3bbed04b34 mesa-20.0.0.tar.xz
New features
------------
- OpenGL 4.6 on radeonsi.
- GL_ARB_gl_spirv on radeonsi.
- GL_ARB_spirv_extensions on radeonsi.
- GL_EXT_direct_state_access for compatibility profile.
- VK_AMD_device_coherent_memory on RADV.
- VK_AMD_mixed_attachment_samples on RADV.
- VK_AMD_shader_explicit_vertex_parameter on RADV.
- VK_AMD_shader_image_load_store_lod on RADV.
- VK_AMD_shader_fragment_mask on RADV.
- VK_EXT_subgroup_size_control on RADV/LLVM.
- VK_KHR_separate_depth_stencil_layouts on Intel, RADV.
- VK_KHR_shader_subgroup_extended_types on RADV.
- VK_KHR_swapchain_mutable_format on RADV.
- VK_KHR_shader_float_controls on RADV/ACO.
- GFX6 (Southern Islands) and GFX7 (Sea Islands) support on RADV/ACO.
- Wave32 support for GFX10 (Navi) on RADV/ACO.
- Compilation of Geometry Shaders on RADV/ACO.
- Vulkan 1.2 on Intel, RADV.
- GL_INTEL_shader_integer_functions2 and
VK_INTEL_shader_integer_functions2 on Intel.
Bug fixes
---------
- drisw crashes on calling NULL putImage on EGL surfaceless platform
(pbuffer EGLSurface)
- [radeonsi][vaapi][bisected] invalid VASurfaceID when playing
interlaced DVB stream in Kodi
- [RADV] GPU hangs while the cutscene plays in the game Assassin's
Creed Origins
- ACO: The Elder Scrolls Online crashes on startup (Navi)
- Broken rendering of glxgears on S/390 architecture (64bit, BigEndian)
- aco: sun flickering with Assassins Creeds Origins
- !1896 broke ext_image_dma_buf_import piglit tests with radeonsi
- aco: wrong geometry with Assassins Creed Origins on GFX6
- valgrind errors since commit a8ec4082a41
- OSMesa osmesa_choose_format returns a format not supported by
st_new_renderbuffer_fb
- Build error with VS on WIN
- Using EGL_KHR_surfaceless_context causes spurious "libEGL warning:
FIXME: egl/x11 doesn't support front buffer rendering."
- !3460 broke texsubimage test with piglit on zink+anv
- The screen is black when using ACO
- [Regression] JavaFX unbounded VRAM+RAM usage
- radv: implement VK_AMD_shader_explicit_vertex_parameter
- Civilization VI crashes when loading game (AMD Vega Mobile)
- [radeonsi] X-Server crashes when trying to start Guild Wars 2 with
the commits from !3421
- aco: implement GFX6 support
- Add support for VK_KHR_swapchain_mutable_format
- radv: The Surge 2 crashes in ac_get_elem_bits()
- [Regression] JavaFX unbounded VRAM+RAM usage
- Use the OpenCL dispatch defnitions from OpenCL_Headers
- [regression][ilk,g965,g45] various dEQP-GLES2.functional.shaders.\*
failures
- aco: Dead Rising 4 crashes in lower_to_hw_instr() on GFX6-GFX7
- libvulkan_radeon.so crash with \`free(): double free detected in
tcache 2\`
- Commit be08e6a causes crash in com.android.launcher3 (Launcher)
- anv: Regression causing issues for radv when there are no Intel
devices
- Mesa no longer compiles with GCC 10
- [Navi/aco] Guild Wars 2 - ring gfx timeout with commit 3bca0af2
- [radv/aco] Regression is causing a soft crash in The Witcher 3
- [bisected] [radeonsi] GPU hangs/resets while playing interlaced
content on Kodi with VAAPI
- [radeonsi] MSAA image not copied properly after image store through
texture view
- T-Rex and Manhattan onscreen performance issue on Android
- VkSamplerCreateInfo compareEnable not respected
- VkSamplerCreateInfo compareEnable not respected
- Freedreno drm softpin driver implementation leaks memory
- [POLARIS10] VRAM leak involving glTexImage2D with non-NULL data
argument
- [regression][bisected][ivb/byt] crucible test
func.push-constants.basic.q0 causes gpu hang
- MR 3096 broke lots of piglit ext_framebuffer_object tests on Raven
- Rise of the Tomb Raider benchmark crash on Dell XPS 7390 2-in-1 w/
Iris Plus Graphics (Ice Lake 8x8 GT2)
- Raven Ridge (2400G): Resident Evil 2 crashes my machine
- Common practice of glGetActiveUniform leads to O(N²) behavior in Mesa
- Rocket League ingame artifacts
- [radv] SteamVR direct mode no longer works
- [ANV] unused create parameters not properly ignored
- [Bisected] Mesa fails to start alacritty with the wayland backend
(AMD Vega).
- [iris] piglit test clip-distance-vs-gs-out fails due to VUE map
mismatch between VS <-> GS stages
- [radv] SteamVR direct mode no longer works
- Blocky corruption in The Surge 2
- radeonsi: Floating point exception on R9 270 gpu for a set of traces
- [RADV] [Navi] LOD artifacting in Halo - The Master Chief Collection
(Halo Reach)
- [CTS]
dEQP-VK.api.image_clearing.core.clear_color_image.2d.linear.single_layer.r32g32b32\_\*
fail on GFX6-GFX8
- Vulkan: Please consider adding another sample count to
sampledImageIntegerSampleCounts
- Navi10: Bitrate based encoding with VAAPI/RadeonSI unusable
- [RADV] create parameters not properly ignored
- [regression][bdw,gen9,hsw,icl][iris] gltcs failures on
mesa=8172b1fa03f
- Bugs in RadeonSI VAAPI implementation
- [GFX10] Glitch rendering Custom Avatars in Beat Saber
- intel/fs: Check for 16-bit immediates in
fs_visitor::lower_mul_dword_inst is too strict
- i965/iris: assert when destroy GL context with active query
- Visuals without alpha bits are not sRGB-capable
- swapchain throttling: wait for fence has 1ns timeout
- radeonsi: OpenGL app always produces page fault in gfxhub on Navi 10
- [regression]
KHR-GLES31.core.geometry_shader.api.program_pipeline_vs_gs_capture
fails for various drivers
- [CTS]
dEQP-VK.spirv_assembly.instruction.spirv1p4.entrypoint.tess_con_pc_entry_point
hangs on GFX10
- [RADV] SPIR-V warning when compiling shader using storage
multisampled image array
- [RADV] The Dead Rising 4 is causing a GPU hang with LLVM backend
- macOS u_thread.h:156:4: error: implicit declaration of function
'pthread_getcpuclockid'
- [Wine / Vulkan] Doom 2016 Hangs on Main Menu
- NULL resource when playing VP9 video through VDPAU on RX 570
- radeonsi: mpv --vo=vaapi incorrect rendering on gfx9+
- [BSW/BDW] skia lcdblendmode & lcdoverlap test failure
- Create a way to prefer iris vs i965 via driconf
- [Bisected] i965: CS:GO crashes in emit_deref_copy_load_store with
debug Mesa
- radv/aco Jedi Fallen Order hair rendering buggy
- Inaccurate information on https://docs.mesa3d.org/repository.html
about how to get git write access.
- [RADV] VK_KHR_timeline_semaphore balloons in runtime
- Shadow of Mordor has randomly dancing black shadows on Talion's face
- gen7 crucible failures func.push-constants.basic.q0 and
func.shader-subgroup-vote.basic.q0
- GL_EXT_disjoint_timer_query failing with GL_INVALID_ENUM
- Unreal 4 Elemental and MatineeFightScene demos misrender
- gputest gimark has unwanted black liquorice flakes
- triangle strip clipping with GL_FIRST_VERTEX_CONVENTION causes wrong
vertex's attribute to be broadcasted for flat interpolation
- [bisected][regression][g45,g965,ilk] piglit arb_fragment_program kil
failures
- glcts crashes since the enablement of ARB_shading_language_include
- Android build broken
- ld.lld: error: duplicate symbol (mesa-19.3.0-rc1)
- Divinity: Original Sin Enhanced Edition(Native) crash on start
- HSW. Tropico 6 and SuperTuxKart have shadows flickering
- GL_EXT_disjoint_timer_query failing with GL_INVALID_ENUM
- glxgears segfaults on POWER / Xvnc
- [regression][bdw,gen9,icl][iris] piglit failures on mesa
f9fd04aca15fd00889caa666ba38007268e67f5c
- Redundant builds of libmesa_classic and libmesa_gallium
- [IVB,BYT] [Regression] [Bisected] Core dump at launching
arb_compute_shader/linker/bug-93840.shader_test
- Vulkan drivers need access to format utils of gallium
- Disabling lower_fragdata_array causes shader-db to crash for some
drivers
- GL_EXT_disjoint_timer_query failing with GL_INVALID_ENUM
- Android build broken by commit 9020f51 "util/u_endian: Add error
checks"
- radv secure compile feature breaks compilation of RADV on armhf EABI
(19.3-rc1)
- radv_debug.c warnings when compiling on 32 bits : cast to pointer
from integer of different size
- Meson: Mesa3D build failure with standalone Mingw-w64 multilib
- [regression][bisected] KHR46 VertexArrayAttribFormat has unexpectedly
generated GL_INVALID_OPERATION
- textureSize(samplerExternalOES, int) missing in desktop mesa 19.1.7
implementation
- zink: implicly casting integers to pointers, warnings on 32-bit
compile
- Objects leaving trails in Firefox with antialias and
preserveDrawingBuffer in three.js WebGLRednerer with mesa 19.2
Changes
-------
Aaron Watry (1):
- clover/llvm: fix build after llvm 10 commit 1dfede3122ee
Adam Jackson (1):
- drisw: Cache the depth of the X drawable
Afonso Bordado (4):
- pan/midgard: Optimize comparisions with similar operations
- pan/midgard: Move midgard_is_branch_unit to helpers
- pan/midgard: Optimize branches with inverted arguments
- pan/midgard: Fix midgard_compile.h includes
Alan Coopersmith (1):
- intel/perf: adapt to platforms like Solaris without d_type in struct
dirent
Alejandro Piñeiro (4):
- v3d: adds an extra MOV for any sig.ld\*
- mesa/main/util: moving gallium u_mm to util, remove main/mm
- nir/opt_peephole_select: remove unused variables
- turnip: remove unused descriptor state dirty
Alexander van der Grinten (1):
- egl: Fix \_eglPointerIsDereferencable w/o mincore()
Alexander von Gluck IV (1):
- haiku/hgl: Fix build via header reordering
Alyssa Rosenzweig (223):
- pipe-loader: Build kmsro loader for with all kmsro targets
- pan/midgard: Remove OP_IS_STORE_VARY
- pan/midgard: Add a dummy source for loads
- pan/midgard: Refactor swizzles
- pan/midgard: Eliminate blank_alu_src
- pan/midgard: Use fp32 blend shaders
- pan/midgard: Validate tags when branching
- pan/midgard: Fix quadword_count handling
- pan/midgard: Compute bundle interference
- pan/midgard: Add bizarre corner case
- pan/midgard: offset_swizzle doesn't need dstsize
- pan/midgard: Extend offset_swizzle to non-32-bit
- pan/midgard: Extend swizzle packing for vec4/16-bit
- pan/midgard: Extend default_phys_reg to !32-bit
- panfrost/ci: Update T760 expectations
- pan/midgard: Fix printing of half-registers in texture ops
- pan/midgard: Disassemble half-steps correctly
- pan/midgard: Pass shader stage to disassembler
- pan/midgard: Switch base for vertex texturing on T720
- nir: Add load_output_u8_as_fp16_pan intrinsic
- pan/midgard: Identify ld_color_buffer_u8_as_fp16\*
- pan/midgard: Implement nir_intrinsic_load_output_u8_as_fp16_pan
- pan/midgard: Pack load/store masks
- panfrost: Select format-specific blending intrinsics
- pan/midgard: Add blend shader selection bits for MRT
- pan/midgard: Implement linearly-constrained register allocation
- pan/midgard: Integrate LCRA
- pan/midgard: Remove util/ra support
- pan/midgard: Compute spill costs
- pan/lcra: Use Chaitin's spilling heuristic
- pan/midgard: Copypropagate vector creation
- pan/midgard: Fix copypropagation for textures
- pan/midgard: Generalize texture registers across GPUs
- pan/midgard: Fix vertex texturing on early Midgard
- pan/midgard: Use texture, not textureLod, on early Midgard
- pan/midgard: Disassemble with old pipeline always on T720
- pan/midgard: Prioritize texture registers
- pan/midgard: Expand 64-bit writemasks
- pan/midgard: Implement i2i64 and u2u64
- pan/midgard: Fix mir_round_bytemask_down for !32b
- pan/midgard: Pack 64-bit swizzles
- pan/midgard: Use generic constant packing for 8/64-bit
- pan/midgard: Implement non-aligned UBOs
- pan/midgard: Expose more typesize helpers
- pan/midgard: Fix masks/alignment for 64-bit loads
- pan/midgard: Represent ld/st offset unpacked
- pan/midgard: Use shader stage in mir_op_computes_derivative
- panfrost: Stub out clover callbacks
- panfrost: Pass kernel inputs as uniforms
- panfrost: Disable tiling for GLOBAL resources
- panfrost: Set PIPE_COMPUTE_CAP_ADDRESS_BITS to 64
- pan/midgard: Introduce quirks checks
- panfrost: Add the lod_bias field
- nir: Add load_sampler_lod_paramaters_pan intrinsic
- pan/midgard: Implement load_sampler_lod_paramaters_pan
- pan/midgard: Add LOD bias/clamp lowering
- pan/midgard: Describe quirk MIDGARD_BROKEN_LOD
- pan/midgard: Enable LOD lowering only on buggy chips
- panfrost: Add lcra.c to Android.mk
- pan/midgard: Use lower_tex_without_implicit_lod
- panfrost: Add information about T720 tiling
- panfrost: Implement pan_tiler for non-hierarchy GPUs
- panfrost: Simplify draw_flags
- pan/midgard: Splatter on fragment out
- gitlab-ci: Remove non-default skips from Panfrost
- panfrost: Remove blend shader hack
- panfrost: Update SET_VALUE with information from igt
- panfrost: Rename SET_VALUE to WRITE_VALUE
- gallium/util: Support POLYGON in u_stream_outputs_for_vertices
- pan/midgard: Move spilling code out of scheduler
- pan/midgard: Split spill node selection/spilling
- pan/midgard: Simplify spillability test
- pan/midgard: Remove spill cost heuristic
- pan/midgard: Move bounds checking into LCRA
- pan/midgard: Remove consecutive_skip code
- pan/midgard: Remove code marked "TODO: remove me"
- pan/midgard: Dynamically allocate r26/27 for spills
- pan/midgard: Use no_spill bitmask
- pan/midgard: Don't use no_spill for memory spill src
- pan/midgard: Force alignment for csel_v
- pan/midgard: Don't try to free NULL in LCRA
- pan/midgard: Simplify and fix vector copyprop
- pan/midgard: Fix shift for TLS access
- panfrost: Describe thread local storage sizing rules
- panfrost: Rename unknown_address_0 -> scratchpad
- panfrost: Split stack_shift nibble from unk0
- panfrost: Add routines to calculate stack size/shift
- panfrost: Factor out panfrost_query_raw
- panfrost: Query core count and thread tls alloc
- panfrost: Route stack_size from compiler
- panfrost: Emit SFBD/MFBD after a batch, instead of before
- panfrost: Handle minor cppcheck issues
- pan/midgard: Remove unused ld/st packing hepers
- pan/midgard: Handle misc. cppcheck warnings
- panfrost: Calculate maximum stack_size per batch
- panfrost: Pass size to panfrost_batch_get_scratchpad
- pandecode: Add cast
- panfrost: Move nir_undef_to_zero to Midgard compiler
- panfrost: Move property queries to \_encoder
- panfrost: Add panfrost_model_name helper
- panfrost: Report GPU name in es2_info
- ci: Remove T760/T860 from CI temporarily
- panfrost: Pass blend RT number through
- pan/midgard: Add schedule barrier after fragment writeout
- pan/midgard: Writeout per render target
- pan/midgard: Fix liveness analysis with multiple epilogues
- pan/midgard: Set r1.w magic
- panfrost: Fix FBD issue
- ci: Reinstate Panfrost CI
- panfrost: Remove fbd_type enum
- panfrost: Pack invocation_shifts manually instead of a bit field
- panfrost: Remove asserts in panfrost_pack_work_groups_compute
- panfrost: Simplify sampler upload condition
- panfrost: Don't double-create scratchpad
- panfrost: Add PAN_MESA_DEBUG=precompile for shader-db
- panfrost: Let precompile imply shaderdb
- panfrost: Handle empty shaders
- pan/midgard: Use a reg temporary for mutiple writes
- pan/midgard: Hoist temporary coordinate for cubemaps
- pan/midgard: Set .shadow for shadow samplers
- pan/midgard: Set Z to shadow comparator for 2D
- pan/midgard: Add uniform/work heuristic
- pan/midgard: Implement textureOffset for 2D textures
- pan/midgard: Fix crash with txs
- pan/midgard: Lower txd with lower_tex
- panfrost: Decode shader types in pantrace shader-db
- pan/decode: Skip COMPUTE in blobber-db
- pan/decode: Prefix blobberdb with MESA_SHADER\_\*
- pan/decode: Append 0:0 spills:fills to blobber-db
- pan/midgard: Fix disassembler cycle/quadword counting
- pan/midgard: Bounds check lcra_restrict_range
- pan/midgard: Extend IS_VEC4_ONLY to arguments
- pan/midgard: Clamp LOD register swizzle
- pan/midgard: Expand swizzle for texelFetch
- pan/midgard: Fix fallthrough from offset to comparator
- pan/midgard: Do witchcraft on texture offsets
- pan/midgard: Generalize temp coordinate to non-2D
- pan/midgard: Implement shadow cubemaps
- pan/midgard: Enable lower_(un)pack\_\* lowering
- pan/midgard: Support loads from R11G11B10 in a blend shader
- pan/midgard: Add mir_upper_override helper
- pan/midgard: Compute destination override
- panfrost: Rename pan_instancing.c -> pan_attributes.c
- panfrost: Factor batch/resource out of instancing routines
- panfrost: Move instancing routines to encoder/
- panfrost: Factor out panfrost_compute_magic_divisor
- panfrost: Fix off-by-one in pan_invocation.c
- pan/decode: Fix reference computation for invocations
- panfrost: Slight cleanup of Gallium's pan_attribute.c
- panfrost: Remove pan_shift_odd
- pan/decode: Handle gl_VertexID/gl_InstanceID
- panfrost: Unset vertex_id_zero_based
- pan/midgard: Factor out emit_attr_read
- pan/midgard: Lower gl_VertexID/gl_InstanceID to attributes
- panfrost: Extend attribute_count for vertex builtins
- panfrost: Route gl_VertexID through cmdstream
- pan/midgard: Fix minor typo
- panfrost: Remove MALI_SPECIAL_ATTRIBUTE_BASE defines
- panfrost: Update information on fixed attributes/varyings
- panfrost: Remove MALI_ATTR_INTERNAL
- panfrost: Inline away MALI_NEGATIVE
- panfrost: Implement remaining texture wrap modes
- panfrost: Add pan_attributes.c to Android.mk
- panfrost: Add missing #include in common header
- panfrost: Remove mali_alt_func
- panfrost; Update comment about work/uniform_count
- panfrost: Remove 32-bit next_job path
- glsl: Set .flat for gl_FrontFacing
- pan/midgard: Promote tilebuffer reads to 32-bit
- pan/midgard: Use type-appropriate st_vary
- pan/midgard: Implement flat shading
- panfrost: Identify glProvokingVertex flag
- panfrost: Disable some CAPs we want lowered
- panfrost: Implement integer varyings
- panfrost: Remove MRT indirection in blend shaders
- panfrost: Respect glPointSize()
- pan/midgard: Convert fragment writeout to proper branches
- pan/midgard: Remove prepacked_branch
- panfrost: Handle RGB16F colour clear
- panfrost: Pack MRT blend shaders into a single BO
- pan/midgard: Fix memory corruption in constant combining
- pan/midgard: Use better heuristic for shader termination
- pan/midgard: Generalize IS_ALU and quadword_size
- pan/midgard: Generate MRT writeout loops
- pan/midgard: Remove old comment
- pan/midgard: Identity ld_color_buffer as 32-bit
- pan/midgard: Use upper ALU tags for MFBD writeout
- panfrost: Texture from Z32F_S8 as R32F
- panfrost: Support rendering to non-zero Z/S layers
- panfrost: Implement sRGB blend shaders
- panfrost: Cleanup tiling selection logic
- panfrost: Report MSAA 4x supported for dEQP
- panfrost: Handle PIPE_FORMAT_R10G10B10A2_USCALED
- panfrost: Respect constant buffer_offset
- panfrost: Adjust for mismatch between hardware/Gallium in arrays/cube
- pan/midgard: Account for z/w flip in texelFetch
- panfrost: Don't double-flip Z/W for 2D arrays
- pan/midgard: Support indirect UBO offsets
- panfrost: Fix linear depth textures
- pan/midgard: Bytemasks should round up, not round down
- panfrost: Identify un/pack colour opcodes
- pan/midgard: Fix recursive csel scheduling
- panfrost: Expose some functionality with dEQP flag
- panfrost: Compile tiling routines with -O3
- panfrost,lima: De-Galliumize tiling routines
- panfrost: Rework linear<--->tiled conversions
- panfrost: Add pandecode entries for ASTC/ETC formats
- panfrost: Fix crash in compute variant allocation
- panfrost: Drop mysterious zero=0xFFFF field
- panfrost: Don't use implicit mali_exception_status enum
- pan/decode: Remove last_size
- pan/midgard: Remove pack_color define
- pan/decode: Remove SHORT_SLIDE indirection
- panfrost: Fix 32-bit warning for \`indices\`
- pan/decode: Drop MFBD compute shader stuff
- pan/midgard: Record TEXTURE_OP_BARRIER
- pan/midgard: Disassemble barrier instructions
- pan/midgard: Validate barriers use a barrier tag
- pan/midgard: Handle tag 0x4 as texture
- pan/midgard: Remove float_bitcast
- pan/midgard: Fix missing prefixes
- pan/midgard: Don't crash with constants on unknown ops
- pan/midgard: Use fprintf instead of printf for constants
Andreas Baierl (14):
- lima: Beautify stream dumps
- lima: Parse VS and PLBU command stream while making a dump
- lima/streamparser: Fix typo in vs semaphore parser
- lima/streamparser: Add findings introduced with gl_PointSize
- lima/parser: Some fixes and cleanups
- lima/parser: Add RSW parsing
- lima/parser: Add texture descriptor parser
- lima: Rotate dump files after each finished pp frame
- lima: Fix dump file creation
- lima/parser: Fix rsw parser
- lima/parser: Fix VS cmd stream parser
- lima/parser: Make rsw alpha blend parsing more readable
- lima: Add stencil support
- lima: Fix alpha blending
Andres Rodriguez (1):
- vulkan/wsi: disable the hardware cursor
Andrii Simiklit (5):
- main: fix several 'may be used uninitialized' warnings
- glsl: fix an incorrect max_array_access after optimization of
ssbo/ubo
- glsl: fix a binding points assignment for ssbo/ubo arrays
- glsl/nir: do not change an element index to have correct block name
- mesa/st: fix a memory leak in get_version
Anthony Pesch (5):
- util: import xxhash
- util: move fnv1a hash implementation into its own header
- util/hash_table: replace \_mesa_hash_data's fnv1a hash function with
xxhash
- util/hash_table: added hash functions for integer types
- util/hash_table: update users to use new optimal integer hash
functions
Anuj Phogat (2):
- intel: Add device info for 1x4x6 Jasper Lake
- intel: Add pci-ids for Jasper Lake
Arno Messiaen (5):
- lima: fix stride in texture descriptor
- lima: add layer_stride field to lima_resource struct
- lima: introduce ppir_op_load_coords_reg to differentiate between
loading texture coordinates straight from a varying vs loading them
from a register
- lima: add cubemap support
- lima/ppir: add lod-bias support
Bas Nieuwenhuizen (33):
- radv: Fix timeout handling in syncobj wait.
- radv: Remove \_mesa_locale_init/fini calls.
- turnip: Remove \_mesa_locale_init/fini calls.
- anv: Remove \_mesa_locale_init/fini calls.
- radv: Fix disk_cache_get size argument.
- radv: Close all unnecessary fds in secure compile.
- radv: Do not change scratch settings while shaders are active.
- radv: Allocate cmdbuffer space for buffer marker write.
- radv: Enable VK_KHR_buffer_device_address.
- amd/llvm: Refactor ac_build_scan.
- radv: Unify max_descriptor_set_size.
- radv: Fix timeline semaphore refcounting.
- radv: Fix RGBX Android<->Vulkan format correspondence.
- amd/common: Fix tcCompatible degradation on Stoney.
- amd/common: Always use addrlib for HTILE tc-compat.
- radv: Limit workgroup size to 1024.
- radv: Expose all sample counts for integer formats as well.
- amd/common: Handle alignment of 96-bit formats.
- nir: Add clone/hash/serialize support for non-uniform tex
instructions.
- nir: print non-uniform tex fields.
- amd/common: Always initialize gfx9 mipmap offset/pitch.
- turnip: Use VK_NULL_HANDLE instead of NULL.
- meson: Enable -Werror=int-conversion.
- Revert "amd/common: Always initialize gfx9 mipmap offset/pitch."
- radv: Only use the gfx mipmap level offset/pitch for linear textures.
- spirv: Fix glsl type assert in spir2nir.
- radv: Emit a BATCH_BREAK when changing pixel shaders or
CB_TARGET_MASK.
- radv: Use new scanout gfx9 metadata flag.
- radv: Disable VK_EXT_sample_locations on GFX10.
- radv: Remove syncobj_handle variable in header.
- radv: Expose VK_KHR_swapchain_mutable_format.
- radv: Allow DCC & TC-compat HTILE with
VK_IMAGE_CREATE_EXTENDED_USAGE_BIT.
- radv: Do not set SX DISABLE bits for RB+ with unused surfaces.
Ben Crocker (1):
- llvmpipe: use ppc64le/ppc64 Large code model for JIT-compiled shaders
Bernd Kuhls (1):
- util/os_socket: Include unistd.h to fix build error
Boris Brezillon (21):
- panfrost: MALI_DEPTH_TEST is actually MALI_DEPTH_WRITEMASK
- panfrost: Destroy the upload manager allocated in
panfrost_create_context()
- panfrost: Release the ctx->pipe_framebuffer ref
- panfrost: Move BO cache related fields to a sub-struct
- panfrost: Try to evict unused BOs from the cache
- gallium: Fix the ->set_damage_region() implementation
- panfrost: Make sure we reset the damage region of RTs at flush time
- panfrost: Remove unneeded phi nodes
- panfrost/midgard: Fix swizzle for store instructions
- panfrost/midgard: Print the actual source register for store
operations
- panfrost/midgard: Use a union to manipulate embedded constants
- panfrost/midgard: Rework mir_adjust_constants() to make it type/size
agnostic
- panfrost/midgard: Make sure promote_fmov() only promotes 32-bit imovs
- panfrost/midgard: Factorize f2f and u2u handling
- panfrost/midgard: Add f2f64 support
- panfrost/midgard: Fix mir_print_instruction() for branch instructions
- panfrost/midgard: Add 64 bits float <-> int converters
- panfrost/midgard: Add missing lowering passes for type/size
conversion ops
- panfrost/midgard: Add a condense_writemask() helper
- panfrost/midgard: Prettify embedded constant prints
- panfrost: Fix the damage box clamping logic
Brian Ho (14):
- turnip: Update tu_query_pool with turnip-specific fields
- turnip: Implement vkCreateQueryPool for occlusion queries
- turnip: Implement vkCmdBeginQuery for occlusion queries
- turnip: Implement vkCmdEndQuery for occlusion queries
- turnip: Update query availability on render pass end
- turnip: Implement vkGetQueryPoolResults for occlusion queries
- turnip: Implement vkCmdResetQueryPool
- turnip: Implement vkCmdCopyQueryPoolResults for occlusion queries
- anv: Properly fetch partial results in vkGetQueryPoolResults
- anv: Handle unavailable queries in vkCmdCopyQueryPoolResults
- turnip: Enable occlusionQueryPrecise
- turnip: Free event->bo on vkDestroyEvent
- turnip: Fix vkGetQueryPoolResults with available flag
- turnip: Fix vkCmdCopyQueryPoolResults with available flag
Brian Paul (4):
- s/APIENTRY/GLAPIENTRY/ in teximage.c
- nir: fix a couple signed/unsigned comparison warnings in
nir_builder.h
- Call shmget() with permission 0600 instead of 0777
- nir: no-op C99 \_Pragma() with MSVC
C Stout (1):
- util/vector: Fix u_vector_foreach when head rolls over
Caio Marcelo de Oliveira Filho (24):
- spirv: Don't leak GS initialization to other stages
- glsl: Check earlier for MaxShaderStorageBlocks and MaxUniformBlocks
- glsl: Check earlier for MaxTextureImageUnits and MaxImageUniforms
- anv: Initialize depth_bounds_test_enable when not explicitly set
- spirv: Consider the sampled_image case in wa_glslang_179 workaround
- intel/fs: Lower 64-bit MOVs after lower_load_payload()
- intel/fs: Fix lowering of dword multiplication by 16-bit constant
- intel/vec4: Fix lowering of multiplication by 16-bit constant
- anv/gen12: Temporarily disable VK_KHR_buffer_device_address (and EXT)
- spirv: Implement SPV_KHR_non_semantic_info
- panfrost: Fix Makefile.sources
- anv: Drop unused function parameter
- anv: Ignore some CreateInfo structs when rasterization is disabled
- intel/fs: Only use SLM fence in compute shaders
- spirv: Drop EXT for PhysicalStorageBuffer symbols
- spirv: Handle PhysicalStorageBuffer in memory barriers
- nir: Add missing nir_var_mem_global to various passes
- intel/fs: Add FS_OPCODE_SCHEDULING_FENCE
- intel/fs: Add workgroup_size() helper
- intel/fs: Don't emit fence for shared memory if only one thread is
used
- intel/fs: Don't emit control barrier if only one thread is used
- anv: Always initialize target_stencil_layout
- intel/compiler: Add names for SHADER_OPCODE_[IU]SUB_SAT
- nir: Make nir_deref_path_init skip trivial casts
Chris Wilson (1):
- egl: Mention if swrast is being forced
Christian Gmeiner (24):
- drm-shim: fix EOF case
- etnaviv: rs: upsampling is not supported
- etnaviv: add drm-shim
- etnaviv: drop not used config_out function param
- etnaviv: use a more self-explanatory param name
- etnaviv: handle 8 byte block in tiling
- etnaviv: add support for extended pe formats
- etnaviv: fix integer vertex formats
- etnaviv: use NORMALIZE_SIGN_EXTEND
- etnaviv: fix R10G10B10A2 vertex format entries
- etnaviv: handle integer case for GENERIC_ATTRIB_SCALE
- etnaviv: remove dead code
- etnaviv: remove not used etna_bits_ones(..)
- etnaviv: drop compiled_rs_state forward declaration
- etnaviv: update resource status after flushing
- gallium: add PIPE_CAP_MAX_VERTEX_BUFFERS
- etnaviv: check if MSAA is supported
- etnaviv: gc400 does not support any vertex sampler
- etnaviv: use a better name for FE_VERTEX_STREAM_UNK14680
- etnaviv: move state based texture structs
- etnaviv: move descriptor based texture structs
- etnaviv: add deqp debug option
- etnaviv: drop default state for PE_STENCIL_CONFIG_EXT2
- etnaviv: drm-shim: add GC400
Connor Abbott (19):
- nir: Fix non-determinism in lower_global_vars_to_local
- radv: Rename ac_arg_regfile
- ac: Add a shared interface between radv, radeonsi, LLVM and ACO
- ac/nir, radv, radeonsi: Switch to using ac_shader_args
- radv: Move argument declaration out of nir_to_llvm
- aco: Constify radv_nir_compiler_options in isel
- aco: Use radv_shader_args in aco_compile_shader()
- aco: Split vector arguments at the beginning
- aco: Make num_workgroups and local_invocation_ids one argument each
- radv: Replace supports_spill with explict_scratch_args
- aco: Use common argument handling
- aco: Make unused workgroup id's 0
- nir: Maintain the algebraic automaton's state as we work.
- a6xx: Add more CP packets
- freedreno: Use new macros for CP_WAIT_REG_MEM and CP_WAIT_MEM_GTE
- freedreno: Fix CP_MEM_TO_REG flag definitions
- freedreno: Document CP_COND_REG_EXEC more
- freedreno: Document CP_UNK_A6XX_55
- freedreno: Document CP_INDIRECT_BUFFER_CHAIN
Daniel Ogorchock (2):
- panfrost: Fix panfrost_bo_access memory leak
- panfrost: Fix headers and gpu_headers memory leak
Daniel Schürmann (58):
- aco: fix immediate offset for spills if scratch is used
- aco: only use single-dword loads/stores for spilling
- aco: fix accidential reordering of instructions when scheduling
- aco: workaround Tonga/Iceland hardware bug
- aco: fix invalid access on Pseudo_instructions
- aco: preserve kill flag on moved operands during RA
- aco: rematerialize s_movk instructions
- aco: check if SALU instructions are predeceeded by exec when
calculating WQM needs
- aco: value number instructions using the execution mask
- aco: use s_and_b64 exec to reduce uniform booleans to one bit
- amd/llvm: Add Subgroup Scan functions for SI
- radv: Enable Subgroup Arithmetic and Clustered for SI
- aco: don't value-number instructions from within a loop with ones
after the loop.
- aco: don't split live-ranges of linear VGPRs
- aco: fix a couple of value numbering issues
- aco: refactor visit_store_fs_output() to use the Builder
- aco: Initial GFX7 Support
- aco: SI/CI - fix sampler aniso
- aco: fix SMEM offsets for SI/CI
- aco: implement nir_op_fquantize2f16 for SI/CI
- aco: only use scalar loads for readonly buffers on SI/CI
- aco: implement nir_op_isign on SI/CI
- aco: move buffer_store data to VGPR if needed
- aco: implement quad swizzles for SI/CI
- aco: recognize SI/CI SMRD hazards
- aco: fix disassembly of writelane instructions.
- aco: split read/writelane opcode into VOP2/VOP3 version for SI/CI
- aco: implement 64bit VGPR shifts for SI/CI
- aco: make 1/2*PI a literal constant on SI/CI
- aco: implement 64bit i2b for SI /CI
- aco: implement 64bit ine/ieq for SI/CI
- aco: disable disassembly for SI/CI due to lack of support by LLVM
- radv: only flush scalar cache for SSBO writes with ACO on GFX8+
- aco: flush denorms after fmin/fmax on pre-GFX9
- aco: don't use a scalar temporary for reductions on GFX10
- aco: implement (clustered) reductions for SI/CI
- aco: implement inclusive_scan for SI/CI
- aco: implement exclusive scan for SI/CI
- radv: disable Youngblood app profile if ACO is used
- aco: return to loop_active mask at continue_or_break blocks
- radv: Enable ACO on GFX7 (Sea Islands)
- aco: use soffset for MUBUF instructions on SI/CI
- aco: improve readfirstlane after uniform ssbo loads on GFX7
- aco: propagate temporaries into expanded vectors
- nir: fix printing of var_decl with more than 4 components.
- aco: compact various Instruction classes
- aco: compact aco::span<T> to use uint16_t offset and size instead of
pointer and size_t.
- aco: fix unconditional demote_to_helper
- aco: rework lower_to_cssa()
- aco: handle phi affinities transitively through parallelcopies
- aco: ignore parallelcopies to the same register on jump threading
- aco: fix combine_salu_not_bitwise() when SCC is used
- aco: reorder VMEM operands in ACO IR
- aco: fix register allocation with multiple live-range splits
- aco: simplify adjust_sample_index_using_fmask() & get_image_coords()
- aco: simplify gathering of MIMG address components
- docs: add new features for RADV/ACO.
- aco: fix image_atomic_cmp_swap
Daniel Stone (2):
- Revert "st/dri: do FLUSH_VERTICES before calling flush_resource"
- Revert "gallium: add st_context_iface::flush_resource to call
FLUSH_VERTICES"
Danylo Piliaiev (12):
- intel/blorp: Fix usage of uninitialized memory in key hashing
- i965/program_cache: Lift restriction on shader key size
- intel/blorp: Fix usage of uninitialized memory in key hashing
- intel/fs: Do not lower large local arrays to scratch on gen7
- i965: Unify CC_STATE and BLEND_STATE atoms on Haswell as a workaround
- glsl: Add varyings to "zero-init of uninitialized vars" workaround
- drirc: Add glsl_zero_init workaround for GpuTest
- iris/query: Implement PIPE_QUERY_GPU_FINISHED
- iris: Fix value of out-of-bounds accesses for vertex attributes
- i965: Do not set front_buffer_dirty if there is no front buffer
- st/mesa: Handle the rest renderbuffer formats from OSMesa
- st/nir: Unify inputs_read/outputs_written before serializing NIR
Dave Airlie (74):
- nir/serialize: pack function has name and entry point into flags.
- nir/serialize: fix serializing functions with no implementations.
- spirv: don't store 0 to cs.ptr_size for non kernel stages.
- spirv: get the correct type for function returns.
- spirv/nir/opencl: handle some multiply instructions.
- nir: add 64-bit ufind_msb lowering support. (v2)
- nouveau: request ufind_msb64 lowering in the frontend.
- vtn/opencl: add clz support
- nir: fix deref offset builder
- llvmpipe: initial query buffer object support. (v2)
- docs: add llvmpipe to ARB_query_buffer_object.
- gallivm: split out the flow control ir to a common file.
- gallivm: nir->tgsi info convertor (v2)
- gallivm: add popcount intrinsic wrapper
- gallivm: add cttz wrapper
- gallivm: add selection for non-32 bit types
- gallivm: add nir->llvm translation (v2)
- draw: add nir info gathering and building support
- gallium: add nir lowering passes for the draw pipe stages. (v2)
- gallivm: add swizzle support where one channel isn't defined.
- llvmpipe: add initial nir support
- nir/samplers: don't zero samplers_used/txf.
- llvmpipe/images: handle undefined atomic without crashing
- gallivm/llvmpipe: add support for front facing in sysval.
- llvmpipe: enable texcoord semantics
- gallium/scons: fix graw-xlib build on OSX.
- llvmpipe: add queries disabled flag
- llvmpipe: disable occlusion queries when requested by state tracker
- draw: add support for collecting primitives generated outside
streamout
- llvmpipe: enable support for primitives generated outside streamout
- aco: handle gfx7 int8/10 clamping on exports
- gallivm: add bitfield reverse and ufind_msb
- llvmpipe/nir: handle texcoord requirements
- gallivm: fix transpose for when first channel isn't created
- gallivm: fix perspective enable if usage_mask doesn't have 0 bit set
- gallivm/nir: cleanup code and call cmp wrapper
- gallivm/nir: copy compare ordering code from tgsi
- gallivm: add base instance sysval support
- gallivm/draw: add support for draw_id system value.
- gallivm: fixup base_vertex support
- llvmpipe: enable ARB_shader_draw_parameters.
- vtn: convert vload/store to single value loops
- vtn/opencl: add shuffle/shuffle support
- gallivm/nir: wrap idiv to avoid divide by 0 (v2)
- llvmpipe: switch to NIR by default
- nir: sanitize work group intrinsics to always be 32-bit.
- gallivm: add 64-bit const int creator.
- llvmpipe/gallivm: add kernel inputs
- gallivm: add support for 8-bit/16-bit integer builders
- gallivm: pick integer builders for alu instructions.
- gallivm/nir: allow 8/16-bit conversion and comparison.
- tgsi/mesa: handle KERNEL case
- gallivm/llvmpipe: add support for work dimension intrinsic.
- gallivm/llvmpipe: add support for block size intrinsic
- gallivm/llvmpipe: add support for global operations.
- llvmpipe: handle serialized nir as a shader type.
- llvmpipe: add support for compute shader params
- llvmpipe/nir: use nir_max_vec_components in more places
- gallivm: handle non-32 bit undefined
- llvmpipe: lower hadd/add_sat
- gallivm/nir: lower packing
- gallivm/nir: add vec8/16 support
- llvmpipe: add debug option to enable OpenCL support.
- gallivm: fixup const int64 builder.
- llvmpipe: enable ARB_shader_group_vote.
- gallium/util: add multi_draw_indirect to util_draw_indirect.
- llvmpipe: enable driver side multi draw indirect
- llvmpipe: add support for ARB_indirect_parameters.
- llvmpipe: add ARB_derivative_control support
- gallivm: fix gather component handling.
- llvmpipe: fix some integer instruction lowering.
- galllivm: fix gather offset casting
- gallivm: fix find lsb
- gallivm/nir: add missing break for isub.
David Heidelberg (1):
- .mailmap: use correct email address
David Stevens (1):
- virgl: support emulating planar image sampling
Denis Pauk (2):
- gallium/swr: Enable support bptc format.
- docs/features: mark GL_ARB_texture_compression_bptc as done for
llvmpipe, softpipe, swr
Dongwon Kim (3):
- gallium: enable INTEL_PERFORMANCE_QUERY
- iris: INTEL performance query implementation
- gallium: check all planes' pipe formats in case of multi-samplers
Drew Davenport (1):
- radeonsi: Clear uninitialized variable
Drew DeVault (1):
- st_get_external_sampler_key: improve error message
Duncan Hopkins (1):
- zink: make sure src image is transfer-src-optimal
Dylan Baker (69):
- Bump VERSION to 20.0.0-devel
- docs/new_features: Empty the feature list for the 20.0 cycle
- nir: correct use of identity check in python
- r200: use preprocessor for big vs little endian checks
- r100: Use preprocessor to select big vs little endian paths
- dri/osmesa: use preprocessor for selecting endian code paths
- util/u_endian: Use \_WIN32 instead of \_MSC_VER
- util/u_endian: set PIPE_ARCH_*_ENDIAN to 1
- mesa/main: replace uses of \_mesa_little_endian with preprocessor
- mesa/swrast: replace instances of \_mesa_little_endian with
preprocessor
- mesa/main: delete now unused \_mesa_little_endian
- gallium/osmesa: Use PIPE_ARCH_*_ENDIAN instead of little_endian
function
- util: rename PIPE_ARCH_*_ENDIAN to UTIL_ARCH_*_ENDIAN
- util/u_endian: Add error checks
- meson: Add dep_glvnd to egl deps when building with glvnd
- docs: add release notes for 19.2.3
- docs: add sha256 sum to 19.2.3 release notes
- docs: update calendar, add news item and link release notes for
19.2.2
- meson: gtest needs pthreads
- gallium/osmesa: Convert osmesa test to gtest
- osmesa/tests: Extend render test to cover other working cases
- util: Use ZSTD for shader cache if possible
- docs: Add release notes for 19.2.4
- docs: Add SHA256 sum for for 19.2.4
- docs: update calendar, add news item and link release notes for
19.2.4
- docs: Add relnotes for 19.2.5
- docs/relnotes/19.2.5: Add SHA256 sum
- docs: update calendar, add news item and link release notes for
19.2.5
- docs/release-calendar: Update for extended 19.3 rc period
- docs: Add release notes for 19.2.6
- docs: Add SHA256 sum for 19.2.6
- docs: update calendar, add news item and link release notes for
19.2.6
- gallium/auxiliary: Fix uses of gnu struct = {} extension
- meson: Add -Werror=gnu-empty-initializer to MSVC compat args
- docs: Add release notes for 19.2.7
- docs: Add SHA256 sums for 19.2.7
- docs: update calendar, add news item and link release notes for
19.2.7
- docs: Update mesa 19.3 release calendar
- meson/broadcom: libbroadcom_cle needs expat headers
- meson/broadcom: libbroadcom_cle also needs zlib
- docs: add release notes for 19.3.0
- docs/19.3.0: Add SHA256 sums
- docs: Update release notes, index, and calendar for 19.3.0
- dcos: add releanse notes for 19.3.1
- docs: Add release notes, update calendar, and add news for 19.3.1
- docs: add relnotes for 19.2.8
- docs/relnotes/19.2.8: Add SHA256 sum
- docs: Add release notes, news, and update calendar for 19.2.8
- docs: Add release notes for 19.3.2
- docs: add SHA256 sums for 19.3.2
- docs: Add release notes for 19.3.2, update calendar and home page
- docs: Update release calendar for 20.0
- docs: Add relnotes for 19.3.3 release
- docs: Add SHA 256 sums for 19.3.3
- docs: update news, calendar, and link release notes for 19.3.3
- VERSION: bump to 20.0.0-rc1
- bin/pick-ui: Add a new maintainer script for picking patches
- .pick_status.json: Update to 0d14f41625fa00187f690f283c1eb6a22e354a71
- .pick_status.json: Update to b550b7ef3b8d12f533b67b1a03159a127a3ff34a
- .pick_status.json: Update to 9afdcd64f2c96f3fcc1a28912987f2e8066aa995
- .pick_status.json: Update to 7eaf21cb6f67adbe0e79b80b4feb8c816a98a720
- VERSION: bump to 20.0-rc2
- .pick_status.json: Update to d8bae10bfe0f487dcaec721743cd51441bcc12f5
- .pick_status.json: Update to 689817c9dfde9a0852f2b2489cb0fa93ffbcb215
- .pick_status.json: Update to 23037627359e739c42b194dec54875aefbb9d00b
- VERSION: bump for 20.0.0-rc3
- .pick_status.json: Update to 2a98cf3b2ecea43cea148df7f77d2abadfd1c9db
- .pick_status.json: Update to 946eacbafb47c8b94d47e7c9d2a8b02fff5a22fa
- .pick_status.json: Update to bee5c9b0dc13dbae0ccf124124eaccebf7f2a435
Eduardo Lima Mitev (2):
- turnip: Remove failed command buffer from pool
- turnip: Fix issues in tu_compute_pipeline_create() that may lead to
crash
Elie Tournier (4):
- Docs: remove duplicate meson docs for windows
- docs: fix ascii html representation
- nir/algebraic: i2f(f2i()) -> trunc()
- nir/algebraic: sqrt(x)*sqrt(x) -> fabs(x)
Emmanuel Gil Peyrot (1):
- intel/compiler: Return early if read() failed
Eric Anholt (102):
- ci: Make lava inherit the ccache setup of the .build script.
- ci: Switch over to an autoscaling GKE cluster for builds.
- Revert "ci: Switch over to an autoscaling GKE cluster for builds."
- mesa/st: Add mapping of MESA_FORMAT_RGB_SNORM16 to gallium.
- gallium: Add defines for FXT1 texture compression.
- gallium: Add some more channel orderings of packed formats.
- gallium: Add an equivalent of MESA_FORMAT_BGR_UNORM8.
- gallium: Add equivalents of packed MESA_FORMAT_*UINT formats.
- mesa: Stop defining a full separate format for RGBA_UINT8.
- mesa/st: Test round-tripping of all compressed formats.
- mesa: Prepare for the MESA_FORMAT\_\* enum to be sparse.
- mesa: Redefine MESA_FORMAT\_\* in terms of PIPE_FORMAT_*.
- mesa/st: Gut most of st_mesa_format_to_pipe_format().
- mesa/st: Make st_pipe_format_to_mesa_format an effective no-op.
- u_format: Fix swizzle of A1R5G5B5.
- ci: Use several debian buster packages instead of hand-building.
- ci: Make the skip list regexes match the full test name.
- ci: Use cts_runner for our dEQP runs.
- ci: Enable all of GLES3/3.1 testing for softpipe.
- ci: Remove old commented copy of freedreno artifacts.
- ci: Disable flappy blit tests on a630.
- ci: Expand the freedreno blit skip regex to cover more cases.
- util: Move gallium's PIPE_FORMAT utils to /util/format/
- mesa: Move compile of common Mesa core files to a static lib.
- mesa/st: Simplify st_choose_matching_format().
- mesa: Don't put sRGB formats in the array format table.
- mesa/st: Reuse st_choose_matching_format from st_choose_format().
- util: Add a mapping from VkFormat to PIPE_FORMAT.
- turnip: Drop the copy of the formats table.
- ci: Move freedreno's parallelism to the runner instead of gitlab-ci
jobs.
- ci: Use a tag from the parallel-deqp-runner repo.
- nir: Add a scheduler pass to reduce maximum register pressure.
- nir: Refactor algebraic's block walk
- nir: Make algebraic backtrack and reprocess after a replacement.
- freedreno: Introduce a fd_resource_layer_stride() helper.
- freedreno: Introduce a fd_resource_tile_mode() helper.
- freedreno: Introduce a resource layout header.
- freedreno: Convert the slice struct to the new resource header.
- freedreno/a6xx: Log the tiling mode in resource layout debug.
- turnip: Disable timestamp queries for now.
- turnip: Fix unused variable warnings.
- turnip: Drop redefinition of VALIDREG now that it's in ir3.h.
- turnip: Reuse tu6_stage2opcode() more.
- turnip: Add basic SSBO support.
- turnip: Refactor the graphics pipeline create implementation.
- turnip: Add a helper function for getting tu_buffer iovas.
- turnip: Sanity check that we're adding valid BOs to the list.
- turnip: Move pipeline BO list adding to BindPipeline.
- turnip: Add support for compute shaders.
- ci: Disable egl_ext_device_drm tests in piglit.
- freedreno: Enable texture upload memory throttling.
- freedreno: Stop forcing ALLOW_MAPPED_BUFFERS_DURING_EXEC off.
- freedreno: Track the set of UBOs to be uploaded in UBO analysis.
- freedreno: Drop the extra offset field for mipmap slices.
- freedreno: Refactor the UBWC flags registers emission.
- freedreno: Move UBWC layout into a slices array like the non-UBWC
slices.
- tu: Move our image layout into a freedreno_layout struct.
- freedreno: Move a6xx's setup_slices() to a shareable helper function.
- freedreno: Switch the 16-bit workaround to match what turnip does.
- tu: Move UBWC layout into fdl6_layout() and use that function.
- turnip: Lower usub_borrow.
- turnip: Drop unused variable.
- turnip: Add support for descriptor arrays.
- turnip: Fix support for immutable samplers.
- ci: Fix caselist results archiving after parallel-deqp-runner rename.
- mesa: Fix detection of invalidating both depth and stencil.
- mesa/st: Deduplicate the NIR uniform lowering code.
- mesa/st: Move the vec4 type size function into core GLSL types.
- mesa/prog: Reuse count_vec4_slots() from ir_to_mesa.
- mesa/st: Move the dword slot counting function to glsl_types as well.
- i965: Reuse the new core glsl_count_dword_slots().
- nir: Fix printing of ~0 .locations.
- turnip: Refactor linkage state setup.
- mesa: Make atomic lowering put atomics above SSBOs.
- gallium: Pack the atomic counters just above the SSBOs.
- nir: Drop the ssbo_offset to atomic lowering.
- compiler: Add a note about how num_ssbos works in the program info.
- freedreno: Stop scattered remapping of SSBOs/images to IBOs.
- radeonsi: Remove a bunch of default handling of pipe caps.
- r600: Remove a bunch of default handling of pipe caps.
- r300: Remove a bunch of default handling of pipe caps.
- radeonsi: Drop PIPE_CAP_TGSI_ANY_REG_AS_ADDRESS.
- turnip: Fix some whitespace around binary operators.
- turnip: Refactor the intrinsic lowering.
- turnip: Add limited support for storage images.
- turnip: Disable UBWC on images used as storage images.
- turnip: Add support for non-zero (still constant) UBO buffer indices.
- turnip: Add support for uniform texel buffers.
- freedreno/ir3: Plumb the ir3_shader_variant into legalize.
- turnip: Add support for fine derivatives.
- turnip: Fix execution of secondary cmd bufs with nothing in primary.
- freedreno: Add some missing a6xx address declarations.
- freedreno: Fix OUT_REG() on address regs without a .bo supplied.
- turnip: Port krh's packing macros from freedreno to tu.
- turnip: Convert renderpass setup to the new register packing macros.
- turnip: Convert the rest of tu_cmd_buffer.c over to the new pack
macros.
- vulkan/wsi: Fix compiler warning when no WSI platforms are enabled.
- iris: Silence warning about AUX_USAGE_MC.
- mesa/st: Fix compiler warnings from INTEL_shader_integer_functions.
- ci: Enable -Werror on the meson-i386 build.
- tu: Fix binning address setup after pack macros change.
- Revert "gallium: Fix big-endian addressing of non-bitmask array
formats."
Eric Engestrom (58):
- meson: split out idep_xmlconfig_headers from idep_xmlconfig
- anv: add missing xmlconfig headers dependency
- radv: drop unnecessary xmlpool_options_h
- pipe-loader: drop unnecessary xmlpool_options_h
- loader: replace xmlpool_options_h with idep_xmlconfig_headers
- targets/omx: replace xmlpool_options_h with idep_xmlconfig_headers
- targets/va: replace xmlpool_options_h with idep_xmlconfig_headers
- targets/vdpau: replace xmlpool_options_h with idep_xmlconfig_headers
- targets/xa: replace xmlpool_options_h with idep_xmlconfig_headers
- targets/xvmc: replace xmlpool_options_h with idep_xmlconfig_headers
- dri: replace xmlpool_options_h with idep_xmlconfig_headers
- i915: replace xmlpool_options_h with idep_xmlconfig_headers
- nouveau: replace xmlpool_options_h with idep_xmlconfig_headers
- r200: replace xmlpool_options_h with idep_xmlconfig_headers
- radeon: replace xmlpool_options_h with idep_xmlconfig_headers
- meson: move idep_xmlconfig_headers to xmlpool/
- gitlab-ci: build a recent enough version of GLVND (ie. 1.2.0)
- meson: require glvnd 1.2.0
- meson: revert glvnd workaround
- meson: add variable to control the symbols checks
- meson: move the generic symbols check arguments to a common variable
- meson: add windows support to symbols checks
- meson: require \`nm\` again on Unix systems
- mesa/imports: let the build system detect strtok_r()
- egl: fix \_EGL_NATIVE_PLATFORM fallback
- egl: move #include of local headers out of Khronos headers
- gitlab-ci: build libdrm using meson instead of autotools
- gitlab-ci: auto-cancel CI runs when a newer commit is pushed to the
same branch
- CL: sync C headers with Khronos
- CL: sync C++ headers with Khronos
- vulkan: delete typo'd header
- egl: use EGL_CAST() macro in eglmesaext.h
- anv: add missing "fall-through" annotation
- vk_util: drop duplicate formats in vk_format_map[]
- meson: drop duplicate \`lib\` prefix on libiris_gen\*
- meson: drop \`intel_\` prefix on imgui_core
- docs: reword a bit and list HTTPS before FTP
- intel: add mi_builder_test for gen12
- intel/compiler: add ASSERTED annotation to avoid "unused variable"
warning
- intel/compiler: replace \`0\` pointer with \`NULL\`
- util/simple_mtx: don't set the canary when it can't be checked
- anv: drop unused #include
- travis: autodetect python version instead of hard-coding it
- util/format: remove left-over util_format_description_table
declaration
- util/format: add PIPE_FORMAT_ASTC_*x*x*_SRGB to
util_format_{srgb,linear}()
- util/format: add trivial srgb<->linear conversion test
- u_format: move format tests to util/tests/
- amd: fix empty-body issues
- nine: fix empty-body-issues
- meson: simplify install_megadrivers.py invocation
- mesa: avoid returning a value in a void function
- meson: use github URL for wraps instead of completely unreliable
wrapdb
- egl: drop confusing mincore() error message
- llvmpipe: drop LLVM < 3.4 support
- util/atomic: fix return type of p_atomic_add_return() fallback
- util/os_socket: fix header unavailable on windows
- freedreno/perfcntrs: fix fd leak
- util/disk_cache: check for write() failure in the zstd path
Erico Nunes (17):
- lima: fix nir shader memory leak
- lima: fix bo submit memory leak
- lima/ppir: enable lower_fdph
- gallium/util: add alignment parameter to util_upload_index_buffer
- lima: allocate separate bo to store varyings
- lima: refactor indexed draw indices upload
- vc4: move the draw splitting routine to shared code
- lima: split draw calls on 64k vertices
- lima/ppir: fix lod bias src
- lima/ppir: remove assert on ppir_emit_tex unsupported feature
- lima: set shader caps to optimize control flow
- lima/ppir: remove orphan load node after cloning
- lima/ppir: implement full liveness analysis for regalloc
- lima/ppir: handle write to dead registers in ppir
- lima/ppir: fix ssa undef emit
- lima/ppir: split ppir_op_undef into undef and dummy again
- lima/ppir: fix src read mask swizzling
Erik Faye-Lund (82):
- zink: heap-allocate samplers objects
- zink: emit line-width when using polygon line-mode
- anv: remove incorrect polygonMode=point early-out
- zink: use actual format for render-pass
- zink: always allow mutating the format
- zink: do not advertize coherent mapping
- zink: disable fragment-shader texture-lod
- zink: transition resources before resolving
- zink: always allow sampling of images
- zink: use u_blitter when format-reinterpreting
- zink/spirv: drop temp-array for component-count
- zink/spirv: support loading bool constants
- zink/spirv: implement bany_fnequal[2-4]
- zink/spirv: implement bany_inequal[2-4]
- zink/spirv: implement ball_iequal[2-4]
- zink/spirv: implement ball_fequal[2-4]
- zink: do advertize integer support in shaders
- zink/spirv: add support for nir_op_flrp
- zink: correct depth-stencil format
- nir: patch up deref-vars when lowering clip-planes
- zink: always allow transfer to/from buffers
- zink: implement buffer-to-buffer copies
- zink: remove no-longer-needed hack
- zink: move format-checking to separate source
- zink: move filter-helper to separate helper-header
- zink: move blitting to separate source
- zink: move drawing separate source
- st/mesa: unmap pbo after updating cache
- zink: use true/false instead of TRUE/FALSE
- zink: reject invalid sample-counts
- zink: fix crash when restoring sampler-states
- zink: delete query rather than allocating a new one
- zink: do not try to destroy NULL-fence
- zink: handle calloc-failure
- zink: avoid NULL-deref
- zink: avoid NULL-deref
- zink: avoid NULL-deref
- zink: error-check right variable
- zink: silence coverity error
- zink: enable PIPE_CAP_MIXED_COLORBUFFER_FORMATS
- zink: implement nir_texop_txd
- zink: implement txf
- zink: implement some more trivial opcodes
- zink: simplify front-face type
- zink: factor out builtin-var creation
- zink: implement load_vertex_id
- zink: use nir_fmul_imm
- zink: remove unused code-path in lower_pos_write
- nir/zink: move clip_halfz-lowering to common code
- etnaviv: use nir_lower_clip_halfz instead of open-coding
- st/mesa: use uint-samplers for sampling stencil buffers
- zink: fixup initialization of operand_mask / num_extra_operands
- util: initialize float-array with float-literals
- st/wgl: eliminate implicit cast warning
- gallium: fix a warning
- mesa/st: use float literals
- docs: fix typo in html tag name
- docs: fix paragraphs
- docs: open paragraph before closing it
- docs: use code-tag instead of pre-tag
- docs: use code-tags instead of pre-tags
- docs: use code-tags instead of pre-tags
- docs: move paragraph closing tag
- docs: remove double-closed definition-list
- docs: do not double-close link tag
- docs: do not use definition-list for sub-topics
- docs: use figure/figcaption instead of tables
- docs: remove trailing header
- docs: remove leading spaces
- docs: remove trailing newlines
- docs: use [1] instead of asterisk for footnote
- docs: remove pointless, stray newline
- docs: fixup indentation
- zink: implement nir_texop_txs
- zink: support offset-variants of texturing
- zink: avoid incorrect vector-construction
- zink: store image-type per texture
- zink: support sampling non-float textures
- zink: support arrays of samplers
- zink: set compareEnable when setting compareOp
- st/mesa: use uint-result for sampling stencil buffers
- Revert "nir: Add a couple trivial abs optimizations"
Florian Will (1):
- radv/winsys: set IB flags prior to submit in the sysmem path
Francisco Jerez (26):
- glsl: Fix software 64-bit integer to 32-bit float conversions.
- intel/fs/gen11+: Handle ROR/ROL in lower_simd_width().
- intel/fs/gen8+: Fix r127 dst/src overlap RA workaround for EOT
message payload.
- intel/fs: Fix nir_intrinsic_load_barycentric_at_sample for SIMD32.
- intel/fs/cse: Fix non-deterministic behavior due to inaccurate
liveness calculation.
- intel/fs: Make implied_mrf_writes() an fs_inst method.
- intel/fs: Try to vectorize header setup in lower_load_payload().
- intel/fs: Generalize fs_reg::is_contiguous() to register files other
than VGRF.
- intel/fs: Rework fs_inst::is_copy_payload() into multiple
classification helpers.
- intel/fs: Extend copy propagation dataflow analysis to copies with
FIXED_GRF source.
- intel/fs: Add partial support for copy-propagating FIXED_GRFs.
- intel/fs: Add support for copy-propagating a block of multiple
FIXED_GRFs.
- intel/fs: Allow limited copy propagation of a LOAD_PAYLOAD into
another.
- intel/fs/gen4-6: Allocate registers from aligned_pairs_class based on
LINTERP use.
- intel/fs/gen6: Constrain barycentric source of LINTERP during bank
conflict mitigation.
- intel/fs/gen6: Generalize aligned_pairs_class to SIMD16 aligned
barycentrics.
- intel/fs/gen6: Use SEL instead of bashing thread payload for unlit
centroid workaround.
- intel/fs: Split fetch_payload_reg() into separate helper for
barycentrics.
- intel/fs: Introduce barycentric layout lowering pass.
- intel/fs: Switch to standard vector layout for barycentrics at
optimization time.
- intel/fs/cse: Make HALT instruction act as CSE barrier.
- intel/fs/gen7: Fix fs_inst::flags_written() for
SHADER_OPCODE_FIND_LIVE_CHANNEL.
- intel/fs: Add virtual instruction to load mask of live channels into
flag register.
- intel/fs/gen12: Workaround unwanted SEND execution due to broken
NoMask control flow.
- intel/fs/gen12: Fixup/simplify SWSB annotations of SIMD32 scratch
writes.
- intel/fs/gen12: Workaround data coherency issues due to broken NoMask
control flow.
Fritz Koenig (1):
- freedreno: reorder format check
Georg Lehmann (3):
- Correctly wait in the fragment stage until all semaphores are
signaled
- Vulkan Overlay: Don't try to change the image layout to present twice
- Vulkan overlay: use the corresponding image index for each swapchain
Gert Wollny (12):
- r600: Disable eight bit three channel formats
- virgl: Increase the shader transfer buffer by doubling the size
- gallium/tgsi_from_mesa: Add 'extern "C"' to be able to include from
C++
- nir: make nir_get_texture_size/lod available outside nir_lower_tex
- gallium: tgsi_from_mesa - handle VARYING_SLOT_FACE
- r600: Add functions to dump the shader info
- r600: Make it possible to include r600_asm.h in a C++ file
- r600/sb: Correct SB disassambler for better debugging
- r600: Fix maximum line width
- r600: Make SID and unsigned value
- r600: Delete vertex buffer only if there is actually a shader state
- mesa/st: glsl_to_nir: don't lower atomics to SSBOs if driver supports
HW atomics
Guido Günther (2):
- etnaviv: drm: Don't miscalculate timeout
- freedreno/drm: Don't miscalculate timeout
Gurchetan Singh (11):
- drirc: set allow_higher_compat_version for Faster Than Light
- virgl/drm: update UAPI
- teximage: split out helper from EGLImageTargetTexture2DOES
- glapi / teximage: implement EGLImageTargetTexStorageEXT
- dri_util: add driImageFormatToSizedInternalGLFormat function
- i965: track if image is created by a dmabuf
- i965: refactor intel_image_target_texture_2d
- i965: support EXT_EGL_image_storage
- st/dri: track if image is created by a dmabuf
- st/mesa: refactor egl image binding a bit
- st/mesa: implement EGLImageTargetTexStorage
Hyunjun Ko (7):
- freedreno/ir3: cleanup by removing repeated code
- freedreno: support 16b for the sampler opcode
- freedreno/ir3: fix printing output registers of FS.
- freedreno/ir3: fixup when changing to mad.f16
- freedreno/ir3: enable half precision for pre-fs texture fetch
- turnip: fix invalid VK_ERROR_OUT_OF_POOL_MEMORY
- freedreno/ir3: put the conversion back for half const to the right
place.
Iago Toral Quiroga (32):
- v3d: rename vertex shader key (num)_fs_inputs fields
- mesa/st: make sure we remove dead IO variables before handing NIR to
backends
- glsl: add missing initialization of the location path field
- v3d: fix indirect BO allocation for uniforms
- v3d: actually root the first BO in a command list in the job
- v3d: add missing plumbing for VPM load instructions
- v3d: add debug assert
- v3d: enable debug options for geometry shader dumps
- v3d: remove unused variable
- v3d: add initial compiler plumbing for geometry shaders
- v3d: fix packet descriptions for geometry and tessellation shaders
- v3d: emit geometry shader state commands
- v3d: implement geometry shader instancing
- v3d: add 1-way SIMD packing definition
- v3d: compute appropriate VPM memory configuration for geometry shader
workloads
- v3d: we always have at least one output segment
- v3d: add support for adjacency primitives
- v3d: don't try to render if shaders failed to compile
- v3d: predicate geometry shader outputs inside non-uniform control
flow
- v3d: save geometry shader state for blitting
- v3d: support transform feedback with geometry shaders
- v3d: remove obsolete assertion
- v3d: do not limit new CL space allocations with branch to 4096 bytes
- v3d: support rendering to multi-layered framebuffers
- v3d: move layer rendering to a separate helper
- v3d: handle writes to gl_Layer from geometry shaders
- v3d: fix primitive queries for geometry shaders
- v3d: disable lowering of indirect inputs
- v3d: support precompiling geometry shaders
- v3d: expose OES_geometry_shader
- u_vbuf: don't try to delete NULL driver CSO
- v3d: fix bug when checking result of syncobj fence import
Ian Romanick (39):
- intel/compiler: Report the number of non-spill/fill SEND messages on
vec4 too
- nir/algebraic: Add the ability to mark a replacement as exact
- nir/algebraic: Mark other comparison exact when removing a == a
- intel/fs: Disable conditional discard optimization on Gen4 and Gen5
- nir/range-analysis: Add pragmas to help loop unrolling
- nir/range_analysis: Make sure the table validation only occurs once
- nir/opt_peephole_select: Don't count some unary operations
- intel/compiler: Increase nir_opt_peephole_select threshold
- nir/algebraic: Simplify some Inf and NaN avoidance code
- nir/algebraic: Rearrange bcsel sequences generated by
nir_opt_peephole_select
- intel/compiler: Fix 'comparison is always true' warning
- mesa: Silence 'left shift of negative value' warning in BPTC
compression code
- mesa: Silence unused parameter warning
- anv: Fix error message format string
- mesa: Extension boilerplate for INTEL_shader_integer_functions2
- glsl: Add new expressions for INTEL_shader_integer_functions2
- glsl_types: Add function to get an unsigned base type from a signed
type
- glsl: Add built-in functions for INTEL_shader_integer_functions2
- nir: Add new instructions for INTEL_shader_integer_functions2
- nir/algebraic: Add lowering for uabs_usub and uabs_isub
- nir/algebraic: Add lowering for 64-bit hadd and rhadd
- nir/algebraic: Add lowering for 64-bit usub_sat
- nir/algebraic: Add lowering for 64-bit uadd_sat
- nir/algebraic: Add lowering for 64-bit iadd_sat and isub_sat
- compiler: Translate GLSL IR to NIR for new
INTEL_shader_integer_functions2 expressions
- intel/fs: Don't lower integer multiplies that don't need lowering
- intel/fs: Add SHADER_OPCODE_[IU]SUB_SAT pseudo-ops
- intel/fs: Implement support for NIR opcodes for
INTEL_shader_integer_functions2
- nir/spirv: Translate SPIR-V to NIR for new
INTEL_shader_integer_functions2 opcodes
- spirv: Silence a bunch of unused parameter warnings
- spirv: Add support for IntegerFunctions2INTEL capability
- i965: Enable INTEL_shader_integer_functions2 on Gen8+
- gallium: Add a cap bit for OpenCL-style extended integer functions
- gallium: Add a cap bit for integer multiplication between 32-bit and
16-bit
- iris: Enable INTEL_shader_integer_functions2
- anv: Enable SPV_INTEL_shader_integer_functions2 and
VK_INTEL_shader_integer_functions2
- nir/algebraic: Optimize some 64-bit integer comparisons involving
zero
- relnotes: Add GL_INTEL_shader_integer_functions2 and
VK_INTEL_shader_integer_functions2
- intel/fs: Don't count integer instructions as being possibly coissue
Icecream95 (16):
- gallium/auxiliary: Reduce conversions in
u_vbuf_get_minmax_index_mapped
- gallium/auxiliary: Handle count == 0 in
u_vbuf_get_minmax_index_mapped
- panfrost: Add negative lod bias support
- panfrost: Compact the bo_access readers array
- panfrost: Dynamically allocate shader variants
- panfrost: Add ETC1/ETC2 texture formats
- panfrost: Add ASTC texture formats
- pan/midgard: Fix bundle dynarray leak
- pan/midgard: Fix a memory leak in the disassembler
- pan/midgard: Support disassembling to a file
- pan/bifrost: Support disassembling to a file
- pan/decode: Support dumping to a file
- pan/decode: Dump to a file
- pan/decode: Rotate trace files
- panfrost: Don't copy uniforms when the size is zero
- pan/midgard: Fix a liveness info leak
Icenowy Zheng (2):
- lima: support indexed draw with bias
- lima: fix lima_set_vertex_buffers()
Ilia Mirkin (7):
- gm107/ir: fix loading z offset for layered 3d image bindings
- nv50/ir: mark STORE destination inputs as used
- nv50,nvc0: fix destination coordinates of blit
- nvc0: add dummy reset status support
- gm107/ir: avoid combining geometry shader stores at 0x60
- nvc0: treat all draws without color0 broadcast as MRT
- nvc0: disable xfb's which don't have a stride
Italo Nicola (1):
- intel/compiler: remove old comment
Iván Briano (4):
- intel/compiler: Don't change hstride if not needed
- anv: Export filter_minmax support only when it's really supported
- anv: Export VK_KHR_buffer_device_address only when really supported
- anv: Enable Vulkan 1.2 support
James Xiong (3):
- iris: try to set the specified tiling when importing a dmabuf
- gallium: dmabuf support for yuv formats that are not natively
supported
- gallium: let the pipe drivers decide the supported modifiers
Jan Vesely (2):
- clover: Initialize Asm Parsers
- clover: Use explicit conversion from llvm::StringRef to std::string
Jan Zielinski (8):
- gallium/swr: Fix depth values for blit scenario
- swr/rasterizer: Add tessellator implementation to the rasterizer
- gallium/swr: Fix Windows build
- gallium/gallivm/tgsi: enable tessellation shaders
- gallium/gallivm: enable linking lp_bld_printf function with C++ code
- gallium/swr: implementation of tessellation shaders compilation
- gallium/swr: fix tessellation state save/restore
- docs: Update SWR tessellation support
Faith Ekstrand (212):
- util: Add a util_sparse_array data structure
- anv: Move refcount to anv_bo
- anv: Use a util_sparse_array for the GEM handle -> BO map
- anv: Fix a relocation race condition
- anv: Stop storing the GEM handle in anv_reloc_list_add
- anv: Declare the bo in the anv_block_pool_foreach_bo loop
- anv: Inline anv_block_pool_get_bo
- anv: Replace ANV_BO_EXTERNAL with anv_bo::is_external
- anv: Handle state pool relocations using "wrapper" BOs
- anv: Fix a potential BO handle leak
- anv: Rework anv_block_pool_expand_range
- anv: Use anv_block_pool_foreach_bo in get_bo_from_pool
- anv: Rework the internal BO allocation API
- anv: Choose BO flags internally in anv_block_pool
- anv/tests: Zero-initialize instances
- anv/tests: Initialize the BO cache and device mutex
- anv: Allocate block pool BOs from the cache
- anv: Use the query_slot helper in vkResetQueryPoolEXT
- anv: Allocate query pool BOs from the cache
- anv: Set more flags on descriptor pool buffers
- anv: Allocate descriptor buffers from the BO cache
- util: Add a free list structure for use with util_sparse_array
- anv: Allocate batch and fence buffers from the cache
- anv: Allocate scratch BOs from the cache
- anv: Allocate misc BOs from the cache
- anv: Drop anv_bo_init and anv_bo_init_new
- anv: Add a device parameter to anv_execbuf_add_bo
- anv: Set the batch allocator for compute pipelines
- anv: Use a bitset for tracking residency
- anv: Zero released anv_bo structs
- anv: Use the new BO alloc API for Android
- anv: Don't delete fragment shaders that write sample mask
- anv: Don't claim the null RT as a valid color target
- anv: Stop compacting render targets in the binding table
- anv: Move the RT BTI flush workaround to begin_subpass
- spirv: Remove the type from sampled_image
- spirv: Add a vtn_decorate_pointer helper
- spirv: Sort out the mess that is sampled image
- nir/builder: Add a nir_extract_bits helper
- nir: Add tests for nir_extract_bits
- intel/nir: Use nir_extract_bits in lower_mem_access_bit_sizes
- intel/fs: Add DWord scattered read/write opcodes
- intel/fs: refactor surface header setup
- intel/nir: Plumb devinfo through lower_mem_access_bit_sizes
- intel/fs: Implement the new load/store_scratch intrinsics
- intel/fs: Lower large local arrays to scratch
- anv: Lock around fetching sync file FDs from semaphores
- anv: Plumb timeline semaphore signal/wait values through from the API
- spirv: Fix the MSVC build
- anv/pipeline: Assume layout != NULL
- genxml: Mark everything in genX_pack.h always_inline
- anv: Input attachments are always single-plane
- anv: Flatten descriptor bindings in anv_nir_apply_pipeline_layout
- anv: Delete dead shader constant pushing code
- anv: Stop bounds-checking pushed UBOs
- anv: Pre-compute push ranges for graphics pipelines
- intel/compiler: Add a flag to avoid compacting push constants
- anv: Re-arrange push constant data a bit
- anv: Rework push constant handling
- anv: Use a switch statement for binding table setup
- anv: More carefully dirty state in BindDescriptorSets
- anv: More carefully dirty state in BindPipeline
- anv: Use an anv_state for the next binding table
- anv: Emit a NULL vertex for zero base_vertex/instance
- nir: Validate that variables are in the right lists
- iris: Re-enable param compaction
- Revert "i965/fs: Merge CMP and SEL into CSEL on Gen8+"
- vulkan/enum_to_str: Handle out-of-order aliases
- anv/entrypoints: Better handle promoted extensions
- vulkan: Update the XML and headers to 1.1.129
- anv: Push constants are relative to dynamic state on IVB
- anv: Set up SBE_SWIZ properly for gl_Viewport
- anv: Respect the always_flush_cache driconf option
- iris: Stop setting up fake params
- anv: Drop bo_flags from anv_bo_pool
- anv: Add a has_softpin boolean
- blorp: Pass the VB size to the VF cache workaround
- anv: Always invalidate the VF cache in BeginCommandBuffer
- anv: Apply cache flushes after setting index/draw VBs
- anv: Use PIPE_CONTROL flushes to implement the gen8 VF cache WA
- anv: Don't leak when set_tiling fails
- util/atomic: Add a \_return variant of p_atomic_add
- anv: Disallow allocating above heap sizes
- anv: Stop tracking VMA allocations
- anv: Set up VMA heaps independently from memory heaps
- anv: Stop advertising two heaps just for the VF cache WA
- anv: Add an explicit_address parameter to anv_device_alloc_bo
- util/vma: Factor out the hole splitting part of util_vma_heap_alloc
- util/vma: Add a function to allocate a particular address range
- anv: Add allocator support for client-visible addresses
- anv: Use a pNext loop in AllocateMemory
- anv: Implement VK_KHR_buffer_device_address
- util/atomic: Add p_atomic_add_return for the unlocked path
- vulkan/wsi: Provide the implicitly synchronized BO to vkQueueSubmit
- vulkan/wsi: Add a hooks for signaling semaphores and fences
- anv: Always add in EXEC_OBJECT_WRITE when specified in extra_flags
- anv: Use submit-time implicit sync instead of allocate-time
- anv: Add a fence_reset_reset_temporary helper
- anv: Use BO fences/semaphores for AcquireNextImage
- anv: Return VK_ERROR_OUT_OF_DEVICE_MEMORY for too-large buffers
- anv: Re-capture all batch and state buffers
- anv: Re-emit all compute state on pipeline switch
- ANV: Stop advertising smoothLines support on gen10+
- anv: Flush the queue on DeviceWaitIdle
- anv: Unconditionally advertise Vulkan 1.1
- anv: Bump the advertised patch version to 129
- i965: Enable GL_EXT_gpu_shader4 on Gen6+
- anv: Properly advertise sampledImageIntegerSampleCounts
- anv: Drop unneeded struct keywords
- blorp: Stop whacking Z24 depth to BGRA8
- blorp: Allow reading with HiZ
- i965/blorp: Don't resolve HiZ unless we're reinterpreting
- intel/blorp: Use the source format when using blorp_copy with HiZ
- anv: Allow HiZ in TRANSFER_SRC_OPTIMAL on Gen8-9
- i965: Allow HiZ for glCopyImageSubData sources
- intel/nir: Add a memory barrier before barrier()
- intel/disasm: Fix decoding of src0 of SENDS
- genxml: Remove a non-existant HW bit
- anv: Don't add dynamic state base address to push constants on Gen7
- anv: Flag descriptors dirty when gl_NumWorkgroups is used
- anv: Re-use flush_descriptor_sets in flush_compute_state
- intel/vec4: Support scoped_memory_barrier
- nir: Handle more barriers in dead_write and copy_prop
- nir: Handle barriers with more granularity in combine_stores
- llmvpipe: No-op implement more barriers
- nir: Add a new memory_barrier_tcs_patch intrinsic
- spirv: Add a workaround for OpControlBarrier on old GLSLang
- spirv: Add output memory semantics to OpControlBarrier in TCS
- nir/glsl: Emit memory barriers as part of barrier()
- intel/nir: Stop adding redundant barriers
- nir: Rename nir_intrinsic_barrier to control_barrier
- nir/lower_atomics_to_ssbo: Also lower barriers
- anv: Drop an unused variable
- intel/blorp: Fill out all the dwords of MI_ATOMIC
- anv: Don't over-advertise descriptor indexing features
- anv: Memset array properties
- vulkan/wsi: Add a driconf option to force WSI to advertise
BGRA8_UNORM first
- vulkan: Update the XML and headers to 1.2.131
- turnip: Pretend to support Vulkan 1.2
- anv: Bump the patch version to 131
- anv,nir: Lower quad_broadcast with dynamic index in NIR
- anv: Implement the new core version feature queries
- anv: Implement the new core version property queries
- relnotes: Add Vulkan 1.2
- anv: Drop some VK_IMAGE_TILING_OPTIMAL checks
- anv: Support modifiers in GetImageFormatProperties2
- vulkan/wsi: Move the ImageCreateInfo higher up
- vulkan/wsi: Use the interface from the real modifiers extension
- vulkan/wsi: Filter modifiers with ImageFormatProperties
- vulkan/wsi: Implement VK_KHR_swapchain_mutable_format
- anv/blorp: Rename buffer image stride parameters
- anv: Canonicalize buffer formats for image/buffer copies
- anv: Add an anv_physical_device field to anv_device
- anv: Take an anv_device in vk_errorf
- anv: Take a device in anv_perf_warn
- anv: Stop allocating WSI event fences off the instance
- anv: Drop the instance pointer from anv_device
- anv: Move the physical device dispatch table to anv_instance
- anv: Drop separate chipset_id fields
- anv: Re-arrange physical_device_init
- anv: Allow enumerating multiple physical devices
- anv/apply_pipeline_layout: Initialize the nir_builder before use
- intel/blorp: resize src and dst surfaces separately
- anv: Use TRANSFER_SRC_OPTIMAL for depth/stencil MSAA resolves
- anv: Add a layout_to_aux_state helper
- anv: Use isl_aux_state for HiZ resolves
- anv: Add a usage parameter to anv_layout_to_aux_usage
- anv: Allow HiZ in read-only depth layouts
- anv: Improve BTI change cache flushing
- intel/fs: Don't unnecessarily fall back to indirect sends on Gen12
- intel/disasm: Properly disassemble indirect SENDs
- intel/isl: Plumb devinfo into isl_genX(buffer_fill_state_s)
- intel/isl: Add a hack for the Gen12 A0 texture buffer bug
- anv: Rework the meaning of anv_image::planes[]::aux_usage
- anv: Replace aux_surface.isl.size_B checks with aux_usage checks
- intel/aux-map: Add some #defines
- intel/aux-map: Factor out some useful helpers
- anv: Delete a redundant calculation
- isl: Add a helper for calculating subimage memory ranges
- anv: Add another align_down helper
- anv: Make AUX table invalidate a PIPE\_\* bit
- anv: Make anv_vma_alloc/free a lot dumber
- anv: Rework CCS memory handling on TGL-LP
- intel/blorp: Add support for CCS_E copies with UNORM formats
- intel/isl: Allow CCS_E on more formats
- intel/genxml: Make SO_DECL::"Hole Flag" a Boolean
- anv: Insert holes for non-existant XFB varyings
- intel/blorp: Handle bit-casting UNORM and BGRA formats
- anv: Replace one more aux_surface.isl.size_B check
- intel/mi_builder: Force write completion on Gen12+
- anv: Set actual state pool sizes when we have softpin
- anv: Re-use one old BT block in reset_batch_bo_chain
- anv/block_pool: Ensure allocations have contiguous maps
- anv: Rename a variable
- genxml: Add a new 3DSTATE_SF field on gen12
- anv,iris: Set 3DSTATE_SF::DerefBlockSize to per-poly on Gen12+
- intel/genxml: Drop SLMEnable from L3CNTLREG on Gen11
- iris: Set SLMEnable based on the L3$ config
- iris: Store the L3$ configs in the screen
- iris: Use the URB size from the L3$ config
- i965: Re-emit l3 state before BLORP executes
- intel: Take a gen_l3_config in gen_get_urb_config
- intel/blorp: Always emit URB config on Gen7+
- iris: Consolodate URB emit
- anv: Emit URB setup earlier
- intel/common: Return the block size from get_urb_config
- intel/blorp: Plumb deref block size through to 3DSTATE_SF
- anv: Plumb deref block size through to 3DSTATE_SF
- iris: Plumb deref block size through to 3DSTATE_SF
- anv: Always fill out the AUX table even if CCS is disabled
- intel/fs: Write the address register with NoMask for MOV_INDIRECT
- anv/blorp: Use the correct size for vkCmdCopyBufferToImage
Jonathan Gray (4):
- winsys/amdgpu: avoid double simple_mtx_unlock()
- i965: update Makefile.sources for perf changes
- util/futex: use futex syscall on OpenBSD
- util/u_thread: don't restrict u_thread_get_time_nano() to \__linux_\_
Jonathan Marek (98):
- freedreno: add Adreno 640 ID
- freedreno/ir3: disable texture prefetch for 1d array textures
- freedreno/registers: fix a6xx_2d_blit_cntl ROTATE
- etnaviv: blt: use only for tiling, and add missing formats
- etnaviv: separate PE and RS formats, use only RS only for tiling
- etnaviv: blt: set TS dirty after clear
- turnip: add display wsi
- turnip: add x11 wsi
- turnip: implement CmdClearColorImage/CmdClearDepthStencilImage
- turnip: fix sRGB GMEM clear
- util: add missing R8G8B8A8_SRGB format to vk_format_map
- freedreno/regs: update UBWC related bits
- turnip: implement UBWC
- etnaviv: avoid using RS for 64bpp formats
- etnaviv: implement 64bpp clear
- etnaviv: blt: fix partial ZS clears with TS
- etnaviv: support 3d/array/integer formats in texture descriptors
- turnip: fix integer render targets
- freedreno/registers: add missing MH perfcounter enum for a2xx
- freedreno/perfcntrs: add a2xx MH counters
- freedreno/perfcntrs/fdperf: fix u64 print on 32-bit builds
- freedreno/perfcntrs/fdperf: add missing a20x compatible
- freedreno/perfcntrs/fdperf: add missing a2xx case in select_counter
- turnip: fix display wsi fence timing out
- turnip: don't skip unused attachments when setting up tiling config
- turnip: implement CmdClearAttachments
- turnip: don't set unused BLIT_DST_INFO bits for GMEM clear
- turnip: MSAA resolve directly from GMEM
- turnip: allow writes to draw_cs outside of render pass
- turnip: add function to allocate aligned memory in a substream cs
- turnip: improve emit_textures
- turnip: implement border color
- turnip: add hw binning
- turnip: fix incorrectly failing assert
- freedreno/ir3: add GLSL_SAMPLER_DIM_SUBPASS to tex_info
- freedreno/registers: add a6xx texture format for stencil sampler
- turnip: fix hw binning render area
- turnip: fix tile layout logic
- turnip: update tile_align_w/tile_align_h
- turnip: set load_layer_id to zero
- turnip: set FRAG_WRITES_SAMPMASK bit
- turnip: fix VK_IMAGE_ASPECT_STENCIL_BIT image view
- turnip: no 8x msaa on 128bpp formats
- turnip: add dirty bit for push constants
- turnip: subpass rework
- turnip: CmdClearAttachments fixes
- turnip: implement subpass input attachments
- etnaviv: remove sRGB formats from format table
- etnaviv: sRGB render target support
- etnaviv: set output mode and saturate bits
- etnaviv: update INT_FILTER choice for GLES3 formats
- etnaviv: disable integer vertex formats on pre-HALTI2 hardware
- etnaviv: remove swizzle from format table
- etnaviv: add missing formats
- etnaviv: add missing vs_needs_z_div handling to NIR backend
- turnip: use single substream cs
- turnip: use common blit path for buffer copy
- turnip: don't require src image to be set for clear blits
- turnip: implement CmdFillBuffer/CmdUpdateBuffer
- freedreno/ir3: lower mul_2x32_64
- turnip: fix emit_textures for compute shaders
- turnip: remove compute emit_border_color
- turnip: fix emit_ibo
- turnip: change emit_ibo to be like emit_textures
- turnip: remove duplicate A6XX_SP_CS_CONFIG_NIBO
- nir: add option to lower half packing opcodes
- freedreno/ir3: lower pack/unpack ops
- turnip: don't set LRZ enable at end of renderpass
- freedreno/ir3: update prefetch input_offset when packing inlocs
- turnip: add cache invalidate to fix input attachment cases
- turnip: don't set SP_FS_CTRL_REG0_VARYING if only fragcoord is used
- freedreno/ir3: fix vertex shader sysvals with pre_assign_inputs
- freedreno/registers: document vertex/instance id offset bits
- freedreno/ir3: support load_base_instance
- turnip: emit base instance vs driver param
- turnip: emit_compute_driver_params fixes
- turnip: compute gmem offsets at renderpass creation time
- turnip: implement secondary command buffers
- nir: fix assign_io_var_locations for vertex inputs
- turnip: minor warning fixes
- util/format: add missing vulkan formats
- turnip: disable B8G8R8 vertex formats
- etnaviv: fix incorrectly failing vertex size assert
- etnaviv: update headers from rnndb
- etnaviv: HALTI2+ instanced draw
- etnaviv: implement gl_VertexID/gl_InstanceID
- etnaviv: remove unnecessary vertex_elements_state_create error
checking
- st/mesa: don't lower YUV when driver supports it natively
- st/mesa: run st_nir_lower_tex_src_plane for lowered xyuv/ayuv
- freedreno/ir3: allow inputs with the same location
- turnip: remove tu_sort_variables_by_location
- turnip: fix array/matrix varyings
- turnip: hook up GetImageDrmFormatModifierPropertiesEXT
- turnip: set linear tiling for scanout images
- vulkan/wsi: remove unused image_get_modifier
- turnip: simplify tu_physical_device_get_format_properties
- etnaviv: implement UBOs
- turnip: hook up cmdbuffer event set/wait
Jordan Justen (7):
- iris: Add IRIS_DIRTY_RENDER_BUFFER state flag
- iris/gen11+: Move flush for render target change
- iris: Allow max dynamic pool size of 2GB for gen12
- intel: Remove unused Tigerlake PCI ID
- iris: Fix some indentation in iris_init_render_context
- iris: Emit CS Stall before Instruction Cache flush for gen12 WA
- anv: Emit CS Stall before Instruction Cache flush for gen12 WA
Jose Maria Casanova Crespo (1):
- v3d: Fix predication with atomic image operations
Juan A. Suarez Romero (3):
- nir/lower_double_ops: relax lower mod()
- Revert "nir/lower_double_ops: relax lower mod()"
- nir/spirv: skip unreachable blocks in Phi second pass
Kai Wasserbäch (4):
- nir: fix unused variable warning in nir_lower_vars_to_explicit_types
- nir: fix unused variable warning in
find_and_update_previous_uniform_storage
- nir: fix unused function warning in src/compiler/nir/nir.c
- intel/gen_decoder: Fix unused-but-set-variable warning
Karol Herbst (14):
- nv50/ir: fix crash in isUniform for undefined values
- nir/validate: validate num_components on registers and intrinsics
- nir/serialize: fix vec8 and vec16
- nir/tests: add serializer tests
- nir/tests: MSVC build fix
- spirv: handle UniformConstant for OpenCL kernels
- clover/nir: treat UniformConstant as global memory
- clover/nir: set spirv environment to OpenCL
- clover/spirv: allow Int64 Atomics for supported devices
- nir: handle nir_deref_type_ptr_as_array in
rematerialize_deref_in_block
- nv50/ir: implement global atomics and handle it for nir
- nir/serialize: cast swizzle before shifting
- aco: use NIR_MAX_VEC_COMPONENTS instead of 4
- nv50ir/nir: support vec8 and vec16
Kenneth Graunke (57):
- iris: Fix "Force Zero RTA Index Enable" setting again
- nir: Handle image arrays when setting variable data
- Revert "intel/blorp: Fix usage of uninitialized memory in key
hashing"
- iris: Properly move edgeflag_out from output list to global list
- iris: Wrap iris_fix_edge_flags in NIR_PASS
- mesa: Handle GL_COLOR_INDEX in \_mesa_format_from_format_and_type().
- iris: Change keybox parenting
- iris: Stop mutating the resource in get_rt_read_isl_surf().
- iris: Drop 'old_address' parameter from iris_rebind_buffer
- iris: Create an "iris_surface_state" wrapper struct
- iris: Maintain CPU-side SURFACE_STATE copies for views and surfaces.
- iris: Update SURFACE_STATE addresses when setting sampler views
- iris: Disable VF cache partial address workaround on Gen11+
- driconf, glsl: Add a vs_position_always_invariant option
- drirc: Set vs_position_always_invariant for Shadow of Mordor on Intel
- st/mesa: Add GL_TDFX_texture_compression_FXT1 support
- iris: Map FXT1 texture formats
- meson: Add a "prefer_iris" build option
- main: Change u_mmAllocMem align2 from bytes (old API) to bits (new
API)
- meson: Include iris in default gallium-drivers for x86/x86_64
- util: Detect use-after-destroy in simple_mtx
- intel/genxml: Add a partial TCCNTLREG definition
- iris: Enable Gen11 Color/Z write merging optimization
- anv: Enable Gen11 Color/Z write merging optimization
- intel/decoder: Make get_state_size take a full 64-bit address and a
base
- iris: Create smaller program keys without legacy features
- iris: Default to X-tiling for scanout buffers without modifiers
- iris: Alphabetize source files after iris_perf.c was added
- drirc: Final Fantasy VIII: Remastered needs
allow_higher_compat_version
- iris: Make helper functions to turn iris shader keys into brw keys.
- iris: Fix shader recompile debug printing
- iris: Avoid replacing backing storage for buffers with no contents
- intel: Drop Gen11 WaBTPPrefetchDisable workaround
- st/nir: Optionally unify inputs_read/outputs_written when linking.
- iris: Set nir_shader_compiler_options::unify_interfaces.
- st/mesa: Allow ASTC5x5 fallbacks separately from other ASTC LDR
formats.
- iris: Disable ASTC 5x5 support on Gen9 for now.
- iris: Delete remnants of the unimplemented ASTC 5x5 workaround
- iris: Allow HiZ for copy_region sources
- anv: Only enable EWA LOD algorithm when doing anisotropic filtering.
- Revert "nir: assert that nir_lower_tex runs after lowering derefs"
- i965: Simplify brw_get_renderer_string()
- iris: Simplify iris_get_renderer_string()
- intel: Use similar brand strings to the Windows drivers
- intel/compiler: Fix illegal mutation in get_nir_image_intrinsic_image
- iris: Fix export of fences that have already completed.
- st/mesa: Allocate full miplevels if MaxLevel is explicitly set
- iris: Drop some workarounds which are no longer necessary
- anv: Drop some workarounds that are no longer necessary
- intel: Fix aux map alignments on 32-bit builds.
- meson: Prefer 'iris' by default over 'i965'.
- loader: Check if the kernel driver is i915 before loading iris
- iris: Drop 'engine' from iris_batch.
- iris: Make iris_emit_default_l3_config pull devinfo from the batch
- iris: Support multiple chained batches.
- i965: Use brw_batch_references in tex_busy check
- loader: Fix leak of kernel driver name
Kristian Høgsberg (62):
- freedreno/registers: Fix typo
- freedreno/registers: Move SP_PRIMITIVE_CNTL and SP_VS_VPC_DST
- freedreno/registers: Add comments about primitive counters
- freedreno/a6xx: Fix primitive counters again
- freedreno/a6xx: Clear sysmem with CP_BLIT
- freedreno: Add nogmem debug option to force bypass rendering
- freedreno/a6xx: Fix layered texture type enum
- freedreno/a6x: Rename z/s formats
- freedreno/a6xx: Add register offset for STG/LDG
- freedreno/ir3: Emit link map as byte or dwords offsets as needed
- freedreno/ir3: Add load and store intrinsics for global io
- freedreno: Don't count primitives for patches
- freedreno/ir3: Add ir3 intrinsics for tessellation
- freedreno/ir3: Use imul24 in offset calculations
- freedreno/ir3: Add tessellation field to shader key
- freedreno/ir3: Extend geometry lowering pass to handle tessellation
- freedreno/ir3: Add new synchronization opcodes
- freedreno/ir3: End TES with chsh when using GS
- freedreno/ir3: Implement tess coord intrinsic
- freedreno/ir3: Implement TCS synchronization intrinsics
- freedreno/ir3: Setup inputs and outputs for tessellation stages
- freedreno/ir3: Don't assume binning shader is always VS
- freedreno/ir3: Pre-color TCS header and primitive ID inputs
- freedreno/ir3: Allocate const space for tessellation parameters
- freedreno/a6xx: Build the right draw command for tessellation
- freedreno/a6xx: Allocate and program tessellation buffer
- freedreno/a6xx: Emit constant parameters for tessellation stages
- freedreno/a6xx: Program state for tessellation stages
- freedreno: Use bypass rendering for tessellation
- freedreno/a6xx: Only set emit.hs/ds when we're drawing patches
- freedreno/blitter: Save tessellation state
- freedreno/a6xx: Only use merged regs and four quads for VS+FS
- freedreno/a6xx: Turn on tessellation shaders
- freedreno/ir3: Use regid() helper when setting up precolor regs
- freedreno/registers: Remove duplicate register definitions
- freedreno: New struct packing macros
- freedreno/registers: Add 64 bit address registers
- freedreno/a6xx: Drop stale include
- freedreno/a6xx: Include fd6_pack.h in a few files
- freedreno/a6xx: Convert emit_mrt() to OUT_REG()
- freedreno/a6xx: Convert emit_zs() to OUT_REG()
- freedreno/a6xx: Convert VSC pipe setup to OUT_REG()
- freedreno/a6xx: Convert gmem blits to OUT_REG()
- freedreno/a6xx: Convert some tile setup to OUT_REG()
- freedreno/a6xx: Silence warning for unused perf counters
- freedreno/a6xx: Document the CP_SET_DRAW_STATE enable bits
- freedreno/a6xx: Make DEBUG_BLIT_FALLBACK only dump fallbacks
- freedreno: Add debug flag for forcing linear layouts
- freedreno/a6xx: Program sampler swap based on resource tiling
- freedreno/a6xx: Pick blitter swap based on resource tiling
- freedreno/a6xx: Add fd_resource_swap() helper
- freedreno/a6xx: Use blitter for resolve blits
- freedreno/a6xx: RB6_R8G8B8 is actually 32 bit RGBX
- freedreno/a6xx: Use A6XX_SP_2D_SRC_FORMAT_MASK macro
- freedreno/a6xx: Handle srgb blits on the blitter
- freedreno/a6xx: Move handle_rgba_blit() up
- freedreno/a6xx: Rewrite compressed blits in a helper function
- freedreno/a6xx: Set up multisample sysmem MRTs correctly
- st/mesa: Lower vars to ssa and constant prop before
gl_nir_lower_buffers
- ir3: Set up full/half register conflicts correctly
- iris: Advertise PIPE_CAP_NATIVE_FENCE_FD
- iris: Print warning and return \*out = NULL when fd to syncobj fails
Krzysztof Raszkowski (10):
- gallium/swr: Fix GS invocation issues - Fixed proper setting
gl_InvocationID. - Fixed GS vertices output memory overflow.
- gallium/swr: Enable some ARB_gpu_shader5 extensions Enable / add to
features.txt: - Enhanced textureGather. - Geometry shader instancing.
- Geometry shader multiple streams.
- gallium/swr: Fix crash when use GL_TDFX_texture_compression_FXT1
format.
- gallivm: add TGSI bit arithmetic opcodes support
- gallium/swr: Fix glVertexPointer race condition.
- gallium/swr: Disable showing detected arch message.
- docs/GL4: update gallium/swr features
- gallium/swr: add option for static link
- gallium/swr: Fix gcc 4.8.5 compile error
- gallium/swr: simplify environmental variabled expansion code
Lasse Lopperi (1):
- freedreno/drm: Fix memory leak in softpin implementation
Laurent Carlier (1):
- egl: avoid local modifications for eglext.h Khronos standard header
file
Leo Liu (1):
- ac: add missing Arcturus to the info of pc lines
Lepton Wu (2):
- gallium: dri2: Use index as plane number.
- android: mesa: Revert "android: mesa: revert "Enable asm
unconditionally""
Lionel Landwerlin (60):
- intel/dev: set default num_eu_per_subslice on gen12
- intel/perf: add TGL support
- intel/perf: fix Android build
- mesa: check draw buffer completeness on
glClearBufferfi/glClearBufferiv
- vulkan: bump headers/registry to 1.1.127
- anv: Properly handle host query reset of performance queries
- anv: implement VK_KHR_separate_depth_stencil_layouts
- mesa: check framebuffer completeness only after state update
- anv: invalidate file descriptor of semaphore sync fd at vkQueueSubmit
- anv: remove list items on batch fini
- anv: detach batch emission allocation from device
- anv: expose timeout helpers outside of anv_queue.c
- anv: move queue init/finish to anv_queue.c
- anv: allow NULL batch parameter to anv_queue_submit_simple_batch
- anv: prepare driver to report submission error through queues
- anv: refcount semaphores
- anv: prepare the driver for delayed submissions
- anv/wsi: signal the semaphore in the acquireNextImage
- anv: implement VK_KHR_timeline_semaphore
- intel/dev: flag the Elkhart Lake platform
- intel/perf: add EHL performance query support
- intel/perf: fix invalid hw_id in query results
- intel/perf: set read buffer len to 0 to identify empty buffer
- intel/perf: take into account that reports read can be fairly old
- intel/perf: simplify the processing of OA reports
- intel/perf: fix improper pointer access
- anv: fix missing gen12 handling
- anv: fix incorrect VMA alignment for CCS main surfaces
- anv: fix fence underlying primitive checks
- anv: fix assumptions about temporary fence payload
- intel/perf: drop batchbuffer flushing at query begin
- i965/iris: perf-queries: don't invalidate/flush 3d pipeline
- anv: constify pipeline layout in nir passes
- anv: drop unused parameter from apply layout pass
- vulkan/wsi: error out when image fence doesn't signal
- mesa: avoid triggering assert in implementation
- i965/iris/perf: factor out frequency register capture
- loader: fix close on uninitialized file descriptor value
- anv: don't close invalid syncfd semaphore
- anv: fix intel perf queries availability writes
- anv: set stencil layout for input attachments
- iris: Implement Gen12 workaround for non pipelined state
- anv: Implement Gen12 workaround for non pipelined state
- anv: only use VkSamplerCreateInfo::compareOp if enabled
- anv: fix pipeline switch back for non pipelined states
- genxml: add new Gen11+ PIPE_CONTROL field
- iris: handle new PIPE_CONTROL field
- iris: implement another workaround for non pipelined states
- anv: implement another workaround for non pipelined states
- intel/perf: expose timestamp begin for mdapi
- intel/perf: report query split for mdapi
- anv: enable VK_KHR_swapchain_mutable_format
- anv: don't report error with other vendor DRM devices
- anv: ensure prog params are initialized with 0s
- anv/iris: warn gen12 3DSTATE_HS restriction
- intel: Implement Gen12 workaround for array textures of size 1
- isl: drop CCS row pitch requirement for linear surfaces
- isl: add gen12 comment about CCS for linear tiling
- anv: implement gen9 post sync pipe control workaround
- anv: set MOCS on push constants
Luis Mendes (1):
- radv: fix radv secure compile feature breaks compilation on armhf
EABI and aarch64
Marco Felsch (1):
- etnaviv: Fix assert when try to accumulate an invalid fd
Marek Olšák (245):
- glsl: encode/decode types using a union with bitfields for
readability
- glsl: encode vector_elements and matrix_columns better
- glsl: encode explicit_stride for basic types better
- glsl: encode array types better
- glsl: encode struct/interface types better
- st/mesa: call nir_opt_access only once
- st/mesa: call nir_lower_flrp only once per shader
- compiler: make variable::data::binding unsigned
- nir: pack nir_variable::\data::stream
- nir: pack nir_variable::\data::xfb\_\*
- radeonsi: use IR SHA1 as the cache key for the in-memory shader cache
- radeonsi: don't keep compute shader IR after compilation
- radeonsi: keep serialized NIR instead of nir_shader in
si_shader_selector
- nir: pack the rest of nir_variable::data
- nir/serialize: don't expand 16-bit variable state slots to 32 bits
- nir/serialize: store 32-bit object IDs instead of 64-bit
- nir/serialize: pack nir_variable flags
- mesa: expose SPIR-V extensions in the Compatibility profile too
- util: add blob_finish_get_buffer
- radeonsi/nir: call nir_serialize only once per shader
- radeonsi/nir: fix compute shader crash due to nir_binary == NULL
- glsl/linker: pass shader_info to analyze_clip_cull_usage directly
- compiler: pack shader_info from 160 bytes to 96 bytes
- st/mesa: fix Sanctuary and Tropics by disabling ARB_gpu_shader5 for
them
- st/mesa: rename DEBUG_TGSI -> DEBUG_PRINT_IR
- st/mesa: remove \\n being only printed in debug builds after printed
TGSI
- st/mesa: print TCS/TES/GS/CS TGSI in the right place & keep disk
cache enabled
- st/mesa: add ST_DEBUG=nir to print NIR shaders
- st/mesa: remove unused TGSI-only debug printing functions
- gallium/noop: call finalize_nir
- radeonsi/nir: remove dead function temps
- radeonsi/nir: call nir_lower_flrp only once per shader
- radeonsi/nir: don't lower fma, instead, fuse fma
- mesa: enable glthread for 7 Days To Die
- st/mesa: rename delete_basic_variant -> delete_common_variant
- st/mesa: decrease the size of st_fp_variant_key from 48 to 40 bytes
- st/mesa: start deduplicating some program code
- st/mesa: initialize affected_states and uniform storage earlier in
deserialize
- st/mesa: consolidate and simplify code flagging
program::affected_states
- st/mesa: trivially merge st_vertex_program into st_common_program
- st/mesa: rename st_common_program to st_program
- st/mesa: cleanups after unification of st_vertex/common program
- st/mesa: rename occurences of stcp to stp to correspond to st_program
- st/mesa: more cleanups after unification of st_vertex/common_program
- st/mesa: subclass st_vertex_program for VP-specific members
- st/mesa: call nir_sweep in st_finalize_nir
- st/mesa: keep serialized NIR instead of nir_shader in st_program
- st/mesa: call nir_serialize only once per shader
- nir: move data.image.access to data.access
- nir/print: only print image.format for image variables
- glsl_to_nir: rename image_access to mem_access
- nir: move data.descriptor_set above data.index for better packing
- nir: don't use GLenum16 in nir.h
- ac: add radeon_info::num_rings and move ring_type to amd_family.h
- ac: fill num_rings for remaining IPs
- winsys/amdgpu: detect noop dependencies on the same ring correctly
- nir: strip as we serialize to remove the nir_shader_clone call
- nir/serialize: do ctx = {0} instead of manual initializations
- util/blob: add 8-bit and 16-bit reads and writes
- nir/serialize: pack instructions better
- nir/serialize: pack src better and limit the object count to 1M from
1G
- nir/serialize: don't serialize var->data for temporaries
- nir/serialize: deduplicate serialized var types by reusing the last
unique one
- nir/serialize: try to store a diff in var data locations instead of
var data
- nir/serialize: pack load_const with non-64-bit constants better
- nir/serialize: pack 1-component constants into 20 bits if possible
- nir/serialize: pack nir_intrinsic_instr::const_index[] better
- nir/serialize: try to pack two alu srcs into 1 uint32
- nir/serialize: don't store deref types if not needed
- nir/serialize: don't serialize mode for deref non-cast instructions
- nir/serialize: try to put deref->var index into the unused bits of
the header
- nir/serialize: cleanup - fold nir_deref_type_var cases into switches
- nir/serialize: try to pack both deref array src into 32 bits
- nir/serialize: remove up to 3 consecutive equal ALU instruction
headers
- nir/serialize: reuse the writemask field for 2 src X swizzles of SSA
ALU
- nir/serialize: serialize swizzles for vec8 and vec16
- nir/serialize: serialize writemask for vec8 and vec16
- nir/serialize: don't serialize redundant
nir_intrinsic_instr::num_components
- nir/serialize: use 3 unused bits in intrinsic for
packed_const_indices
- nir/serialize: support any num_components for remaining instructions
- ac: set swizzled bit in cache policy as a hint not to merge
loads/stores
- radeonsi: initialize the per-context compiler on demand
- radeonsi/nir: don't run si_nir_opts again if there is no change
- st/mesa: don't serialize all streamout state if there are no SO
outputs
- st/mesa: don't use redundant stp->state.ir.nir
- st/mesa: don't call ProgramStringNotify in glsl_to_nir
- st/mesa: propagate gl_PatchVerticesIn from TCS to TES before linking
for NIR
- st/mesa: simplify looping over linked shaders when linking NIR
- st/mesa: don't use \*\* in the st_nir_link_shaders signature
- st/mesa: add st_variant base class to simplify code for shader
variants
- ac/nir: don't rely on data.patch for tess factors
- radeonsi/nir: implement subgroup system values for SPIR-V
- radeonsi: simplify the interface of
get_dw_address_from_generic_indices
- radeonsi: simplify get_tcs_tes_buffer_address_from_generic_indices
- radeonsi/nir: validate is_patch because SPIR-V doesn't set it for
tess factors
- radeonsi/nir: don't rely on data.patch for tess factors
- radeonsi/nir: fix location_frac handling for TCS outputs
- radeonsi/nir: support interface output types to fix SPIR-V xfb
piglits
- radeonsi: enable SPIR-V and GL 4.6 for NIR
- util/driconfig: print ATTENTION if MESA_DEBUG=silent is not set
- radeonsi/gfx10: simplify some duplicated NGG GS code
- radeonsi/gfx10: fix the vertex order for triangle strips emitted by a
GS
- llvmpipe: implement TEX_LZ and TXF_LZ opcodes
- gallivm: implement LOAD with CONSTBUF but don't enable it for
llvmpipe
- st/mesa: support UBOs for Selection/Feedback/RasterPos
- st/mesa: save currently bound vertex samplers and sampler views in
st_context
- st/mesa: support samplers for Selection/Feedback/RasterPos
- st/mesa: support SSBOs for Selection/Feedback/RasterPos
- st/mesa: support shader images for Selection/Feedback/RasterPos
- st/mesa: use a separate VS variant for the draw module
- st/mesa: remove st_vp_variant::num_inputs
- st/mesa: remove struct st_vp_variant in favor of st_common_variant
- st/mesa: don't generate VS TGSI if NIR is enabled
- draw, st/mesa: generate TGSI for ffvp/ARB_vp if draw lacks LLVM
- st/mesa: release the draw shader properly to fix driver crashes
(iris)
- st/dri: assume external consumers of back buffers can write to the
buffers
- radeonsi: enable NIR by default and document GL 4.6 support
- radeonsi/gfx10: disable vertex grouping
- radeonsi/gfx10: simplify the tess_turns_off_ngg condition
- radeonsi: don't rely on CLEAR_STATE to set PA_SC_GENERIC_SCISSOR\_\*
- ac: fix ac_get_i1_sgpr_mask for Wave32
- ac: fix the return value in cull_bbox when bbox culling is disabled
- radeonsi: deduplicate ES and GS thread enablement code
- radeonsi: disallow compute-based culling if polygon mode is enabled
- radeonsi: set is_monolithic for VS prologs when the shader is really
monolithic
- radeonsi: don't wrap the VS prolog in if (ES thread) .. endif
- radeonsi/gfx10: don't insert NGG streamout atomics if they are never
used
- radeonsi: allow generating VS prologs with 0 inputs
- radeonsi: fix determining whether the VS prolog is needed
- radeonsi: reset more fields in si_llvm_context_set_ir to fix reusing
ctx
- radeonsi/gfx10: fix ngg_get_ordered_id
- amd/addrlib: update to the latest version
- ac/surface: fix an assertion failure on gfx9 in CMASK computation
- radeonsi/gfx10: don't declare any LDS for NGG if it's not used
- radeonsi/gfx10: enable NGG passthrough for eligible shaders
- radeonsi/gfx10: improve performance for TES using PrimID but not
exporting it
- Revert "u_vbuf: Regard non-constant vbufs with non-instance elements
as free"
- winsys/radeon: initialize pte_fragment_size
- radeonsi: preserve the scanout flag for shared resources on gfx9 and
gfx10
- radeonsi: ignore PIPE_BIND_SCANOUT for imported textures
- radeonsi: remove the "display_dcc_offset == 0" assertion
- radeonsi: rename SDMA debug flags
- radeonsi: remove broken and unused SI SDMA image copy code
- radeonsi: add AMD_DEBUG=nodmaclear for debugging
- radeonsi: add AMD_DEBUG=nodmacopyimage for debugging
- radeonsi: rename dma_cs -> sdma_cs
- radeonsi: move SI and CIK+ SDMA code into 1 common function for
cleanups
- radeonsi: disable SDMA on gfx8 to fix corruption on RX 580
- radeonsi: remove TGSI
- gallium: put u_vbuf_get_caps return values into u_vbuf_caps
- gallium/cso_context: move non-vbuf vertex buffer and element code
into helpers
- gallium: bypass u_vbuf if it's not needed (no fallbacks and no user
VBOs)
- ac/gpu_info: always use distributed tessellation on gfx10
- radeonsi: fix monolithic pixel shaders with two-sided colors and
SampleMaskIn
- radeonsi: fix context roll tracking in si_emit_shader_vs
- radeonsi: test polygon mode enablement accurately
- radeonsi: determine accurately if line stippling is enabled for
performance
- radeonsi: clean up messy si_emit_rasterizer_prim_state
- ac: unify build_sendmsg_gs_alloc_req
- ac: unify primitive export code
- ac/gpu_info: add pc_lines and use it in radeonsi
- ac: add 128-bit bitcount
- ac: add ac_build_s_endpgm
- radeonsi/gfx9: force the micro tile mode for MSAA resolve correctly
on gfx9
- radeonsi: rename desc_list_byte_size -> vb_desc_list_alloc_size
- radeonsi: add si_context::num_vertex_elements
- radeonsi: don't allow draw calls with uninitialized VS inputs
- radeonsi: simplify si_set_vertex_buffers
- ac,radeonsi: increase the maximum number of shader args and return
values
- radeonsi: put up to 5 VBO descriptors into user SGPRs
- radeonsi: don't enable VBOs in user SGPRs if compute-based culling
can be used
- radeonsi: fix assertion and other failures in
si_emit_graphics_shader_pointers
- radeonsi: actually enable VBOs in user SGPRs
- radeonsi: don't adjust depth and stencil PS output locations
- radeonsi: rename DBG_NO_TGSI -> DBG_NO_NIR
- radeonsi: remove TGSI from comments
- radeonsi: rename si_shader_info -> si_shader_binary_info
- radeonsi: fork tgsi_shader_info and tgsi_tessctrl_info
- radeonsi: merge si_tessctrl_info into si_shader_info
- radeonsi: clean up si_shader_info
- radeonsi: rename si_compile_tgsi_main -> si_build_main_function
- radeonsi: rename si_shader_create -> si_create_shader_variant for
clarity
- radeonsi: fold si_create_function into si_llvm_create_func
- radeonsi: remove always constant ballot_mask_bits from
si_llvm_context_init
- radeonsi: move PS LLVM code into si_shader_llvm_ps.c
- radeonsi: separate code computing info for small primitive culling
- ac/cull: don't read Position.Z if it's not needed for culling
- radeonsi: make si_insert_input\_\* functions non-static
- radeonsi: move VS_STATE.LS_OUT_PATCH_SIZE a few bits higher to make
space there
- radeonsi/gfx10: separate code for getting edgeflags from the
gs_invocation_id VGPR
- radeonsi/gfx10: separate code for determining the number of vertices
for NGG
- radeonsi: fix si_build_wrapper_function for compute-based primitive
culling
- radeonsi: work around an LLVM crash when using
llvm.amdgcn.icmp.i64.i1
- radeonsi: move si_insert_input\_\* functions
- radeonsi: move tessellation shader code into si_shader_llvm_tess.c
- radeonsi: remove llvm_type_is_64bit
- radeonsi: move geometry shader code into si_shader_llvm_gs.c
- radeonsi: move code for shader resources into
si_shader_llvm_resources.c
- radeonsi: remove useless #includes
- radeonsi: merge si_compile_llvm and si_llvm_compile functions
- gallium: add st_context_iface::flush_resource to call FLUSH_VERTICES
- st/dri: do FLUSH_VERTICES before calling flush_resource
- Revert "radeonsi: unbind image before compute clear"
- radeonsi: clean up how internal compute dispatches are handled
- radeonsi: don't invoke decompression inside internal launch_grid
- radeonsi: fix doubles and int64
- radeonsi: turn an assertion into return in si_nir_store_output_tcs
- ac: add prefix bitcount functions
- ac: add ac_build_readlane without optimization barrier
- radeonsi/gfx10: update comments and remove invalid TODOs
- radeonsi/gfx10: correct VS PrimitiveID implementation for NGG
- radeonsi/gfx10: move s_sendmsg gs_alloc_req to the beginning of
shaders
- radeonsi/gfx10: export primitives at the beginning of VS/TES
- radeonsi/gfx10: merge main and pos/param export IF blocks into one if
possible
- radeonsi/gfx10: don't initialize VGPRs not used by NGG passthrough
- radeonsi/gfx10: move GE_PC_ALLOC setting to shader states
- radeonsi/gfx10: implement NGG culling for 4x wave32 subgroups
- ac: add helper ac_build_triangle_strip_indices_to_triangle
- radeonsi/gfx10: rewrite late alloc computation
- radeonsi/gfx10: enable GS fast launch for triangles and strips with
NGG culling
- radeonsi: use ctx->ac. for types and integer constants
- radeonsi: move non-LLVM code out of si_shader_llvm.c
- radeonsi: move VS shader code into si_shader_llvm_vs.c
- radeonsi: move si_shader_llvm_build.c content into si_shader_llvm.c
- radeonsi: minor cleanup in si_shader_internal.h
- radeonsi: move si_nir_build_llvm into si_shader_llvm.c
- radeonsi: fold si_shader_context_set_ir into si_build_main_function
- radeonsi: move more LLVM functions into si_shader_llvm.c
- radeonsi: make si_compile_llvm return bool
- radeonsi: make si_compile_shader return bool
- radeonsi: change prototypes of si_is_multi_part_shader &
si_is_merged_shader
- radeonsi: separate LLVM compilation from non-LLVM code
- util/simple_mtx: add a missing include to get ASSERTED
- gallium/util: add a cache of live shaders for shader CSO
deduplication
- radeonsi: use the live shader cache
- radeonsi: restructure si_shader_cache_load_shader
- radeonsi: print shader cache stats with AMD_DEBUG=cache_stats
- radeonsi: expose shader cache stats to the HUD
- radeonsi: make screen available to shader part compilation
- radeonsi: fix a regression since the addition of si_shader_llvm_vs.c
- Revert "winsys/amdgpu: Close KMS handles for other DRM file
descriptions"
- Revert "winsys/amdgpu: Re-use amdgpu_screen_winsys when possible"
- radeonsi: don't report that multi-plane formats are supported
- radeonsi: fix the DCC MSAA bug workaround
- radeonsi: don't wait for shader compilation to finish when destroying
a context
Marek Vasut (5):
- etnaviv: Replace bitwise OR with logical OR
- etnaviv: tgsi: Fix gl_FrontFacing support
- etnaviv: Report correct number of vertex buffers
- etnaviv: Do not filter out PIPE_FORMAT_S8_UINT_Z24_UNORM on
pre-HALTI2
- etnaviv: Destroy rsc->pending_ctx set in etna_resource_destroy()
Mark Janes (3):
- Revert "st/mesa: call nir_serialize only once per shader"
- Revert "st/mesa: keep serialized NIR instead of nir_shader in
st_program"
- iris: separating out common perf code
Markus Wick (3):
- mapi/glapi: Generate sizeof() helpers instead of fixed sizes.
- mesa/glthread: Implement ARB_multi_bind.
- drirc: Enable glthread for dolphin/citra/yuzu.
Martin Fuzzey (1):
- etnaviv: update Android build files
Mathias Fröhlich (1):
- egl: Implement getImage/putImage on pbuffer swrast.
Matt Turner (19):
- intel/compiler: Use ARRAY_SIZE()
- intel/compiler: Extract GEN\_\* macros into separate file
- intel/compiler: Split has_64bit_types into float/int
- intel/compiler: Don't disassemble align1 3-src operands on Gen < 10
- intel/compiler: Limit compaction unit tests to specific gens
- intel/compiler: Add NF some more places
- intel/compiler: Add a INVALID_{,HW_}REG_TYPE macros
- intel/compiler: Split hw_type tables
- intel/compiler: Handle invalid inputs to brw_reg_type_to_*()
- intel/compiler: Handle invalid compacted immediates
- intel/compiler: Factor out brw_validate_instruction()
- intel/compiler: Validate some instruction word encodings
- intel/compiler: Add unit tests for new EU validation checks
- intel/compiler: Validate fuzzed instructions
- intel/compiler: Test compaction on Gen <= 12
- gitlab-ci: Skip ext_timer_query/time-elapsed
- intel/compiler: Move Gen4/5 rounding to visitor
- util: Explain BITSET_FOREACH_SET params
- util: Remove tmp argument from BITSET_FOREACH_SET macro
Mauro Rossi (9):
- android: aco: fix Lower to CSSA
- android: radeonsi: fix build error due to wrong u_format.csv file
path
- android: util/format: fix include path list
- android: radeonsi: fix build after vl refactoring (v2)
- android: nir: add a load/store vectorization pass
- android: util: Add a mapping from VkFormat to PIPE_FORMAT.
- android: radv: fix vk_format_table.c generated source build
- android: radeonsi,ac: fix building error due to ac changes
- android: radv: build radv_shader_args.c
Michel Dänzer (36):
- gitlab-ci: Set arm job CCACHE_DIR properly
- gitlab-ci: Use separate arm64 build/test docker images
- gitlab-ci: Don't build libdrm for ARM
- gitlab-ci: Use ninja -j4 for building dEQP
- gitlab-ci: Move artifact preparation to separate script
- gitlab-ci: Share dEQP build process between x86 & ARM test image
scripts
- gitlab-ci: Sort packages in debian-install.sh
- gitlab-ci: Run piglit tests with llvmpipe
- gitlab-ci: Use separate docker images for x86 build/test jobs
- gitlab-ci: Delete install/bin from artifacts as well
- gitlab-ci: Document that ci-templates refs must be in sync
- gitlab-ci: Use functional container job names
- gitlab-ci: Rename container install scripts to match job names
(better)
- gitlab-ci: Organize images using new REPO_SUFFIX templates feature
- gitlab-ci: Directly use host-mapped directory for ccache
- gitlab-ci: Stop reporting piglit test results via JUnit
- gitlab-ci: Stop storing piglit test results as JUnit
- gitlab-ci: Put HTML summary in artifacts for failed piglit jobs
- gitlab-ci: Update to current ci-templates master
- gitlab-ci: Run piglit glslparser & quick_shader tests separately
- glsl/tests: Use splitlines() instead of strip()
- gitlab-ci: Use the common run policy for LAVA jobs as well again
- gitlab-ci: Overhaul job run policy
- gitlab-ci: Don't exclude any piglit quick_shader tests
- gitlab-ci: Test against LLVM / clang 9 on x86
- gitlab-ci: Stop using manual jobs for merge requests
- gitlab-ci: Set GIT_STRATEGY to none for the dummy job
- gitlab-ci: Use single if for manual job rules entry
- winsys/amdgpu: Keep a list of amdgpu_screen_winsyses in amdgpu_winsys
- winsys/amdgpu: Keep track of retrieved KMS handles using hash tables
- winsys/amdgpu: Only re-export KMS handles for different DRM FDs
- util: Add os_same_file_description helper
- winsys/amdgpu: Re-use amdgpu_screen_winsys when possible
- winsys/amdgpu: Close KMS handles for other DRM file descriptions
- winsys/amdgpu: Re-use amdgpu_screen_winsys when possible
- winsys/amdgpu: Close KMS handles for other DRM file descriptions
Michel Zou (3):
- Meson: Check for dladdr with MinGW
- disk_cache_get_function_timestamp: check for dladdr
- Meson: Add llvm>=9 modules
Miguel Casas-Sanchez (1):
- i965: Ensure that all 2101010 image imports can pass framebuffer
completeness.
Nanley Chery (3):
- gallium/dri2: Fix creation of multi-planar modifier images
- gallium: Store the image format in winsys_handle
- iris: Fix import of multi-planar surfaces with modifiers
Nataraj Deshpande (1):
- egl/android: Restrict minimum triple buffering for android
color_buffers
Nathan Kidd (1):
- llvmpipe: Check thread creation errors
Neha Bhende (3):
- st/mesa: release tgsi tokens for shader states
- svga: fix size of format_conversion_table[]
- svga: Use pipe_shader_state_from_tgsi to set shader state
Neil Armstrong (3):
- Add support for T820 CI Jobs
- ci: Remove T820 from CI temporarily
- gitlab-ci/lava: add pipeline information in the lava job name
Neil Roberts (9):
- nir/opcodes: Add a helper function to generate the comparison binops
- nir/opcodes: Add a helper function to generate reduce opcodes
- nir: Add a 16-bit bool type
- nir: Add a 8-bit bool type
- nir/lower_alu_to_scalar: Support lowering 8- and 16-bit reduce ops
- freedreno/ir3: Support 16-bit comparison instructions
- freedreno/ir3: Add implementation of nir_op_b16csel
- freedreno/ir3: Implement f2b16 and i2b16
- freedreno/ir3: Enabling lowering 16-bit flrp
Paul Cercueil (5):
- kmsro: Extend to include ingenic-drm
- u_vbuf: Mark vbufs incompatible if more were requested than HW
supports
- u_vbuf: Only create driver CSO if no incompatible elements
- u_vbuf: Regard non-constant vbufs with non-instance elements as free
- u_vbuf: Return true in u_vbuf_get_caps if nb of vbufs is below
minimum
Paul Gofman (1):
- state_tracker: Handle texture view min level in st_generate_mipmap()
Paulo Zanoni (2):
- intel/compiler: remove the operand restriction for src1 on GLK
- intel/compiler: fix nir_op_{i,u}*32 on ICL
Peng Huang (1):
- radeonsi: make si_fence_server_signal flush pipe without work
Philipp Sieweck (1):
- svga: check return value of define_query_vgpu{9,10}
Pierre Moreau (4):
- compiler/spirv: Fix uses of gnu struct = {} extension
- include/CL: Update OpenCL headers to latest
- clover: Use the dispatch table type from the OpenCL headers
- clover/meson: Define OpenCL header macros
Pierre-Eric Pelloux-Prayer (54):
- radeonsi: tell the shader disk cache what IR is used
- mesa: enable msaa in clear_with_quad if needed
- mesa: pass vao as a function paramter
- mesa: add EXT_dsa glVertexArray\* functions declarations
- mesa: rework \_mesa_lookup_vao_err to allow usage from EXT_dsa
- mesa: add vao/vbo lookup helper for EXT_dsa
- mesa: add EXT_dsa glVertexArray\* functions implementation
- mesa: add gl_vertex_array_object parameter to client state helpers
- mesa: add EXT_dsa glEnableVertexArrayEXT / glDisableVertexArrayEXT
- mesa: add EXT_dsa EnableVertexArrayAttribEXT /
DisableVertexArrayAttribEXT
- mesa: extract helper function from \_mesa_GetPointerv
- mesa: add EXT_dsa glGetVertexArray\* 4 functions
- mesa: fix call to \_mesa_lookup_vao_err
- radeonsi: fix shader disk cache key
- radeonsi: enable mesa_glthread for GfxBench
- mesa: update features.txt to reflect EXT_dsa status
- mesa: add ARB_framebuffer_no_attachments named functions
- mesa: add ARB_vertex_attrib_64bit VertexArrayVertexAttribLOffsetEXT
- mesa: add ARB_clear_buffer_object named functions
- mesa: add ARB_gpu_shader_fp64 selector-less functions
- mesa: add ARB_instanced_arrays EXT_dsa function
- mesa: add ARB_texture_buffer_range glTextureBufferRangeEXT function
- mesa: implement ARB_texture_storage_multisample + EXT_dsa functions
- mesa: extend vertex_array_attrib_format to support EXT_dsa
- mesa: add ARB_vertex_attrib_binding glVertexArray\* functions
- mesa: add ARB_sparse_buffer NamedBufferPageCommitmentEXT function
- mesa: enable EXT_direct_state_access
- mesa: fix warning in 32 bits build
- radeonsi: implement sdma for GFX9
- radeonsi: display cs blit count for AMD_DEBUG=testdma
- radeonsi: use gfx9.surf_offset to compute texture offset
- radeonsi: fix multi plane buffers creation
- radeonsi: dcc dirty flag
- st/mesa: add a notify_before_flush callback param to flush
- st/dri: use st->flush callback to flush the backbuffer
- radeonsi: disable dcc for 2x MSAA surface and bpe < 4
- gallium: refuse to create buffers larger than UINT32_MAX
- radeon/vcn2: enable rate control for hevc encoding
- radeonsi: check ctx->sdma_cs before using it
- radeonsi: release saved resources in si_retile_dcc
- radeonsi: release saved resources in si_compute_expand_fmask
- radeonsi: release saved resources in si_compute_clear_render_target
- radeonsi: release saved resources in si_compute_copy_image
- radeonsi: release saved resources in si_compute_clear_12bytes_buffer
- radeonsi: release saved resources in si_compute_do_clear_or_copy
- radeonsi: fix fmask expand compute shader
- radeonsi: make sure fmask expand is done if needed
- radeonsi: unbind image before compute clear
- radeonsi: drop the negation from fmask_is_not_identity
- util: call bind_sampler_states before setting sampler_views
- radeonsi: move AMD_DEBUG tests to AMD_TEST
- docs: document AMD_DEBUG variable
- radeonsi: stop using the VM_ALWAYS_VALID flag
- radeonsi/ngg: add VGT_FLUSH when enabling fast launch
Prodea Alexandru-Liviu (2):
- Meson: Remove lib prefix from graw and osmesa when building with
Mingw. Also remove version sufix from osmesa swrast on Windows.
- Appveyor: Quickly fix meson build. As this required use of Python
3.8, mako module also had to be updated.
Qiang Yu (3):
- lima: sync lima_drm.h with kernel
- lima: create heap buffer with new interface if available
- lima: add noheap debug option
Rafael Antognolli (23):
- intel/isl: Add MOCS settings to isl_device.
- anv: Use mocs settings from isl_dev.
- iris: Use mocs from isl_dev.
- intel: Add workaround for stencil state.
- intel/genxml: Add 3DSTATE_CONSTANT_ALL packet.
- intel/aubinator: Decode 3DSTATE_CONSTANT_ALL.
- intel/blorp: Use 3DSTATE_CONSTANT_ALL to setup push constants.
- iris: Rework push constants emitting code.
- iris: Use 3DSTATE_CONSTANT_ALL when possible.
- anv: Move gen8+ push constant packet workaround.
- anv: Add get_push_range_address() helper.
- anv: Move code for emitting push constants into its own function.
- anv: Use 3DSTATE_CONSTANT_ALL when possible.
- iris: Add restriction to 3DSTATE_CONSTANT\_ packets.
- util/os_socket: Add socket related functions.
- vulkan/overlay: Add a control socket.
- vulkan/overlay: Add support for a control socket.
- vulkan/overlay: Add a command to start capturing data to a file.
- vulkan/overlay: Add basic overlay control script.
- vulkan/overlay: Update docs.
- iris: Implement WA for push constants.
- utils/os_socket: Define ssize_t on windows.
- intel: Load the driver even if I915_PARAM_REVISION is not found.
Rhys Perry (131):
- radv: adjust loop unrolling heuristics for int64
- aco: add Instruction::usesModifiers() and add more checks in the
optimizer
- radv: fix radv_nir_get_max_workgroup_size when nir=NULL
- aco: use DPP instead of exec modification when lowering GFX10
shuffles
- aco: fix shuffle with uniform operands
- nir/divergence: improve DA of shuffle
- aco: fix read_invocation with VGPR lane index
- aco: don't propagate vgprs into v_readlane/v_writelane
- aco: combine read_invocation and shuffle implementations
- radv: enable FP16/FP64 denormals earlier and only for LLVM
- aco: don't combine literals into v_cndmask_b32/v_subb/v_addc
- aco: fix 64-bit fsign with 0
- aco: implement VK_KHR_shader_float_controls
- aco: refactor reduction lowering helpers
- aco: implement 64-bit integer reductions
- radv/aco: enable VK_KHR_shader_subgroup_extended_types
- nir: make nir_variable::{num_members,num_state_slots} a uint16_t
- nir: add nir_variable::index and nir_index_vars
- nir/large_constants: use nir_index_vars and nir_variable::index
- docs: update features.txt for RADV
- aco: improve waitcnt insertion around loops
- aco: fix copy+paste error
- aco: fix waitcnts for barriers at block ends
- nir: add nir_num_variable_modes and nir_var_mem_push_const
- radv: set alignment for load_ssbo/store_ssbo in meta shaders
- nir: add a load/store vectorization pass
- nir: add load/store vectorizer tests
- aco: enable load/store vectorizer
- aco: allow constant offsets for global/scratch instructions on GFX10
- aco: set dlc/glc correctly for image loads
- aco: propagate p_wqm on an image_sample's coordinate p_create_vector
- aco: fix i2i64
- aco: fix incorrect cast in parse_wait_instr()
- aco: add v_nop inbetween exec write and VMEM/DS/FLAT
- aco: improve WAR hazard workaround with >64bit stores
- aco: fix GFX10 opcodes for some global/flat atomics
- aco: fix assembly of FLAT/GLOBAL atomics
- aco: fix SADDR with FLAT on GFX10
- aco: don't enable store_global for helper invocations
- aco: improve FLAT/GLOBAL scheduling
- aco: implement global atomics
- ac/llvm: fix pointer type for global atomics
- ac/llvm: improve sync scope for global atomics
- radv: set writes_memory for global memory stores/atomics
- aco: validate the CFG
- aco: handle loop exit and IF merge phis with break/discard
- aco: fix block_kind_discard s_andn2 definition to exec
- nir/lower_io_to_vector: don't create arrays when not needed
- nir/load_store_vectorize: fix combining stores with aliasing loads
between
- aco/wave32: fix comparison optimizations
- aco: improve jump threading with wave32
- aco: fix vgpr alloc granule with wave32
- aco: limit register usage for large work groups
- aco: set vm for pos0 exports on GFX10
- aco: fix imageSize()/textureSize() with large buffers on GFX8
- aco: fix uninitialized data in the binary
- aco: handle VOP3 modifiers when combining a constant comparison's NaN
test
- aco: handle omod successors with the constant in the first operand
- aco: check usesModifiers() when identifying a neg/abs
- aco: better handle neg/abs of sgprs
- aco: set exec_potentially_empty for demotes
- aco: don't DCE atomics with return values
- aco: disable add combining for ds_swizzle_b32
- aco: check if multiplication/clamp is live when applying output
modifier
- nir/divergence: handle load_primitive_id in GS
- nir/lower_gs_intrinsics: add option for per-stream counts
- aco: update IR validator
- aco: apply literals to split mads
- aco: combine two sgprs into a VALU if they're the same
- aco: improve can_use_VOP3()
- aco: rewrite literal combining
- aco: rewrite apply_sgprs()
- aco: add check_vop3_operands()
- aco: be more careful with literals in combine_salu_{n2,lshl_add}
- aco: follow through temporary when merging tests into constant
comparisons
- aco: allow applying two sgprs to an instruction
- aco: allow an extra SGPR with multiple uses to be applied to VOP3
- aco: take advantage of GFX10's constant bus limit and VOP3 literals
- aco: improve creation of v_madmk_f32/v_madak_f32
- aco: fix clamp optimization
- aco: improve clamp optimization
- aco: add min(-max(), ) and max(-min(), ) optimization
- aco: don't move literal to reg when making an instruction VOP3 on
GFX10
- aco: allow input modifiers on v_cndmask_b32
- aco: replace extract_vector with copies
- aco: improve readfirstlane after uniform LDS loads
- aco: add integer min/max to can_swap_operands
- nir/sink,nir/move: move/sink load_per_vertex_input
- nir/sink,nir/move: move/sink nir_op_mov
- nir/algebraic: a & ~(a >> 31) -> imax(a, 0)
- aco: fix stack buffer overflow in apply_sgprs()
- aco: fix fall-through test in try_remove_simple_block() with
back-edges
- aco: fix operand kill flags when a temporary is used more than once
- aco: fix off-by-one error when initializing sgpr_live_in
- radv: move gs copy shader creation before other variants
- aco: improve support for s_sendmsg
- radv/aco,aco: implement GS on GFX9+
- aco: implement GS on GFX7-8
- radv/aco: allow ACO for GS
- aco: explicitly mark end blocks for exports
- aco: remove needs_instance_id
- aco: implement GS copy shaders
- radv/aco: use ACO for GS copy shaders
- aco: use nir_move_copies
- aco: fix WaR check for >64-bit FLAT/GLOBAL instructions
- aco: fix operand to scc when selecting SGPR ufind_msb/ifind_msb
- aco: always add sgprs to sgpr_ids when choosing literals
- aco: fix literal application with v_cndmask_b32/v_addc_co_u32/etc
- amd/common,radv: move vertex_format_table to ac_shader_util.{h,c}
- aco: rework vertex fetching a bit
- aco: skip unused channels at the start when fetching vertices
- aco: handle unaligned vertex fetch on GFX10
- aco: value-number MUBUF instructions
- aco: use MUBUF in some situations instead of splitting vertex fetches
- aco: fix rebase error from GS copy shader support
- aco: ensure predecessors' p_logical_end is in WQM when a p_phi is in
WQM
- aco: run p_wqm instructions in WQM
- nir/algebraic: add patterns for a >> #b << #b
- nir/algebraic: add some half packing optimizations
- aco: fix target calculation when vgpr spilling introduces sgpr
spilling
- aco: don't consider loop header blocks branch blocks in
add_coupling_code
- aco: don't update demand in add_coupling_code() for loop headers
- aco: only create parallelcopy to restore exec at loop exit if needed
- aco: don't always add logical edges from continue_break blocks to
headers
- aco: error when block has no logical preds but VGPRs are live at the
start
- aco: set exec_potentially_empty after continues/breaks in nested IFs
- aco: improve assertion at the end of spiller
- aco: fill reg_demand with sensible information in add_coupling_code()
- aco: parallelcopy exec mask before s_wqm
- aco: fix exec mask consistency issues
- aco: fix gfx10_wave64_bpermute
Ricardo Garcia (1):
- anv: Unify GetDeviceQueue and GetDeviceQueue2
Rob Clark (89):
- freedreno/ir3: split pre-coloring to it's own function
- freedreno/ir3: use SSA flag on dest register too
- freedreno/ir3: ir3_print tweaks
- freedreno/ir3/ra: move regs_count==0 check
- freedreno/ir3/ra: remove ir print after livein/out
- freedreno/ir3: remove obsolete comment
- freedreno/a3xx: fix SP_FS_MRT_REG.HALF_PRECISION
- freedreno/a4xx: fix SP_FS_MRT_REG.HALF_PRECISION
- freedreno/ir3: sync disasm changes from envytools
- freedreno/ir3: also track # of nops for shader-db
- freedreno: fix eglDupNativeFenceFD error
- freedreno/ir3: fix valgrind complaint with STLW
- freedreno/ir3: remove half-precision output
- freedreno/ir3: rename fanin/fanout to collect/split
- freedreno/ir3: remove impossible condition
- freedreno/ir3: add input/output iterators
- freedreno/ir3: show input/output wrmask's in disasm
- freedreno/ir3: helper to print ir if debug enabled
- freedreno/ir3: remove first-vertex sysval
- freedreno/ir3: simplify creating sysval inputs
- freedreno/ir3: re-work shader inputs/outputs
- freedreno/ir3: only tex instructions have wrmask
- freedreno/ir3: fix gpu hang with pre-fs-tex-fetch
- freedreno/ir3: legalize cleanups
- freedreno/ir3: remove unused parameter
- freedreno/perfcntrs: small cleanup
- freedreno/perfcntrs: remove gallium dependencies
- freedreno/perfcntrs: move to shared location
- freedreno/perfcntrs: add accessor to get per-gen tables
- freedreno/perfctrs/a2xx: move CP to be first group
- freedreno/perfcntrs/a6xx: remove RBBM counters
- freedreno/perfcntrs: add fdperf
- freedreno/perfctrs/fdperf: periodically restore counters
- gitlab-ci: update deqp build so we can generate xml
- gitlab-ci/deqp: preserve full list of unexpected results
- gitlab-ci/deqp: preserve caselists for blocks with fails
- gitlab-ci/deqp: detect and report flakes
- gitlab-ci: bump arm test container
- gitlab-ci/deqp: generate xml results for fails/flakes
- gitlab-ci/deqp: generate junit results
- gitlab-ci/freedreno/a6xx: remove most of the flakes
- freedreno: use rsc->slice accessor everywhere
- freedreno: switch to layout helper
- gitlab-ci: disable junit results for deqp
- freedreno/ir3: remove store_output lowered to store_shared_ir3
- freedreno/ir3: fix neverball assert in case of unused VS inputs
- nir/lower_clip: Fix incorrect driver loc for clipdist outputs
- freedreno/fdperf: use drmOpen()
- freedreno/a6xx: disable LRZ when blending
- freedreno/a5xx+a6xx: split LRZ layout to per-gen
- freedreno/a6xx: fix LRZ layout
- freedreno/a6xx: fix LRZ logic
- freedreno/a6xx: enable LRZ by default
- spirv: add OpLifetime\*
- freedreno/ir3: add last-baryf shaderdb stat
- freedreno/ir3: add scheduler traces
- freedreno/ir3: add iterator macros
- freedreno/a6xx: fix OUT_REG() vs growable cmdstream
- nir+vtn: vec8+vec16 support
- freedreno/ir3: fix flat shading again
- nir: assert that nir_lower_tex runs after lowering derefs
- mesa/st: lower samplers before nir_lower_tex
- freedreno/ir3: rename instructions
- gitlab-ci: fix missing caselist.css/xsl
- freedreno/a6xx: limit scratch/debug markers to debug builds
- freedreno/a6xx: cleanup rasterizer state
- freedreno/a6xx: separate rast stateobj for prim restart
- freedreno/a6xx: drop a few more per-draw registers
- freedreno/a6xx: move dynamic program state to streaming stateobj
- freedreno/a6xx: add PROG_FB_RAST stateobj
- freedreno/drm: fix invalid-cmdstream-size with older kernels
- freedreno: use PIPE_CAP_RGB_OVERRIDE_DST_ALPHA_BLEND
- mesa/st: random whitespace cleanup
- freedreno/a6xx: remove special handling based on MRT format
- freedreno/a6xx: convert blend state to stateobj
- freedreno: extract vsc pipe bo from GMEM state
- freedreno: consolidate GMEM state
- freedreno: constify fd_tile
- freedreno: constify fd_vsc_pipe
- freedreno/a6xx: constify gmem state
- freedreno/a5xx: constify gmem state
- freedreno/a4xx: constify gmem state
- freedreno/a3xx: constify gmem state
- freedreno/a2xx: constify gmem state
- freedreno: get GMEM state from batch
- freedreno: add gmem state cache
- freedreno: add gmem_lock
- freedreno: remove flush-queue
- freedreno: allow ctx->batch to be NULL
Robert Foss (5):
- nir: Build nir_lower_point_size.c in libmesa_nir
- android: Add panfrost support to build scripts
- android: Fix u_format_table.c being generated twice
- panfrost: Prefix schedule_program to prevent collision
- android: Fix whitespace issue
Rohan Garg (1):
- gitlab-ci: Use lavacli from packages
Roland Scheidegger (3):
- gallium/scons: fix graw_gdi build
- util/atomic: Fix p_atomic_add for unlocked and msvc paths
- winsys/svga: use new ioctl for logging
Roman Stratiienko (2):
- Android: Fix build issue without LLVM
- panfrost: Fix Android build
Ross Zwisler (1):
- intel: limit shader geometry on BDW GT1
Sagar Ghuge (1):
- intel/compiler: Clear accumulator register before EOT
Samuel Iglesias Gonsálvez (1):
- main: fix coverity error in \_mesa_program_resource_find_name()
Samuel Pitoiset (202):
- radv: declare NGG scratch for VS or TES and only on GFX10
- radv: fix compute pipeline keys when optimizations are disabled
- docs: document all RADV environment variables
- radv: add a note about perftest/debug options
- radv: fix 32-bit compiler warnings
- nir: fix packing of nir_variable
- radv/gfx10: enable wave32 for compute based on shader's wavesize
- radv: hardcode the number of waves for the GFX6 LS-HS bug
- radv: determine shaders wavesize at pipeline level
- radv: rely on shader's wavesize when computing NGG info
- radv: implement VK_EXT_subgroup_size_control
- radv/gfx10: fix primitive indices orientation for NGG GS
- ac: handle pointer types to LDS in ac_get_elem_bits()
- gitlab-ci: build a specific libdrm version for ARM64
- gitlab-ci: build RADV on ARM64
- ac: fix build with recent LLVM
- radv: remove useless RADV_DEBUG=unsafemath debug option
- radv: make sure to not clear the ds attachment after resolves
- ac: add radeon_info::has_l2_uncached
- radv: implement VK_AMD_device_coherent_memory
- spirv: fix lowering of OpGroupNonUniformAllEqual
- ac: remove useless cast in ac_build_set_inactive()
- ac: add 8-bit and 16-bit supports to ac_build_shuffle()
- ac: add 8-bit and 16-bit supports to ac_build_readlane()
- ac: add 8-bit and 16-bit supports to ac_build_set_inactive()
- ac: add 8-bit and 16-bit supports to ac_build_dpp()
- ac: add 8-bit and 16-bit supports to ac_build_swizzle()
- ac: add 8-bit and 16-bit supports to get_reduction_identity()
- ac: add 8-bit and 16-bit supports to ac_build_wwm()
- ac: add 8-bit and 16-bit supports to ac_build_optimization_barrier()
- ac: add 16-bit float support to ac_build_alu_op()
- radv: advertise VK_KHR_shader_subgroup_extended_types on GFX8-GFX9
- radv: enable VK_KHR_shader_subgroup_extended_types on GFX6-GFX7
- docs: add missing new features for RADV
- pipe-loader: check that the pointer to driconf_xml isn't NULL
- gitlab-ci: move building piglit into a separate script
- gitlab-ci: fix ldd check for Vulkan drivers
- gitlab-ci: add a job that only build things needed for testing
- gitlab-ci: do not build with debugoptimized for meson-main
- gitlab-ci: build swr in meson-main
- gitlab-ci: build GLVND in meson-clang
- gitlab-ci: remove now useless meson-swr-glvnd build job
- gitlab-ci: reduce the number of scons build
- radv: disable subgroup shuffle operations on GFX10
- ac/llvm: fix the local invocation index for wave32
- meson: only build imgui when needed
- radv: set the image view aspect mask during subpass transitions
- radv: set the image view aspect mask before resolves
- radv: rework creation of decompress/resummarize meta pipelines
- radv: create decompress pipelines for separate depth/stencil layouts
- radv: select the depth decompress path based on the aspect mask
- ac/llvm: fix warning in ac_build_canonicalize()
- radv: fix reporting subgroup size with
VK_KHR_pipeline_executable_properties
- radv: fix enabling sample shading with SampleID/SamplePosition
- radv/gfx10: fix implementation of exclusive scans
- ac: add 8-bit and 16-bit supports to ac_build_permlane16()
- radv: enable VK_KHR_shader_subgroup_extended_types on GFX10
- ac/llvm: convert src operands to pointers if necessary
- radv: add more constants to avoid using magic numbers
- radv,ac/nir: lower deref operations for shared memory
- aco: drop useless lowering of deref operations for shared memory
- ac/llvm: fix atomic var operations if source isn't a deref
- radv: remove dead shader input/output variables
- radv: simplify a check in radv_fixup_vertex_input_fetches()
- radv/gfx10: fix the vertex order for triangle strips emitted by a GS
- gitlab-ci: rename build-deqp.sh to build-deqp-gl.sh
- gitlab-ci: add a gl suffix to the x86 test image and all test jobs
- gitlab-ci: add a new job that builds a base test image for VK
- gitlab-ci: build cts_runner in the x86 test image for VK
- gitlab-ci: build dEQP VK 1.1.6 in the x86 test image for VK
- gitlab-ci: add a new base test job for VK
- gitlab-ci: allow to run dEQP Vulkan with DEQP_VER
- gitlab-ci: configure the Vulkan ICD export with VK_DRIVER
- gitlab-ci: build RADV in meson-testing
- gitlab-ci: add a job that runs Vulkan CTS with RADV conditionally
- radv: do not use VK_TRUE/VK_FALSE
- radv: move emission of two PA_SC\_\* registers to the pipeline CS
- radv: fix possibly wrong PA_SC_AA_CONFIG value for conservative rast
- radv: synchronize after performing a separate depth/stencil fast
clears
- radv: do not init HTILE as compressed state when dst layout allows it
- radv: initialize HTILE for separate depth/stencil aspects
- radv: implement VK_KHR_separate_depth_stencil_layouts
- gitlab-ci: set RADV_DEBUG=checkir for RADV test jobs
- ac/nir: fix out-of-bound access when loading constants from global
- radv: enable SpvCapabilityImageMSArray
- radv: handle unaligned vertex fetches on GFX6/GFX10
- radv/gfx10: fix ngg_get_ordered_id
- radv/gfx10: fix the out-of-bounds check for vertex descriptors
- ac: declare an enum for the OOB select field on GFX10
- radv: init a default multisample state for the resolve FS path
- radv: ignore pMultisampleState if rasterization is disabled
- radv: ignore pTessellationState if the pipeline doesn't use tess
- radv: ignore pDepthStencilState if rasterization is disabled
- radv: tidy up radv_pipeline_init_blend_state()
- radv: ignore pColorBlendState if rasterization is disabled
- radv: rely on pipeline layout when creating push descriptors with
template
- radv: return the correct pitch for linear mipmaps on GFX10
- radv: record number of color/depth samples for each subpass
- radv: implement VK_AMD_mixed_attachment_samples
- ac/surface: use uint16_t for mipmap level pitches
- radv: do not fill keys from fragment shader twice
- spirv: add SpvCapabilityImageReadWriteLodAMD
- spirv,nir: add new lod parameter to image_{load,store} intrinsics
- amd/llvm: handle nir_intrinsic_image_deref_{load,store} with lod
- aco: handle nir_intrinsic_image_deref_{load,store} with lod
- radv: advertise VK_AMD_shader_image_load_store_lod
- radv/gfx10: disable vertex grouping
- radv/gfx10: determine if a pipeline is eligible for NGG passthrough
- radv/gfx10: do not declare LDS for NGG if useless
- radv/gfx10: add support for NGG passthrough mode
- radv/gfx10: improve performance for TES using PrimID but not
exporting it
- radv: only use VkSamplerCreateInfo::compareOp if enabled
- radv/gfx10: enable all CUs if NGG is never used
- radv/gfx10: simplify some duplicated NGG GS code
- vulkan/overlay: Fix for Vulkan 1.2
- radv: update VK_EXT_descriptor_indexing for Vulkan 1.2
- radv: update VK_EXT_host_query_reset for Vulkan 1.2
- radv: update VK_EXT_sampler_filter_minmax for Vulkan 1.2
- radv: update VK_EXT_scalar_block_layout for Vulkan 1.2
- radv: update VK_KHR_8bit_storage for Vulkan 1.2
- radv: update VK_KHR_buffer_device_address for Vulkan 1.2
- radv: update VK_KHR_create_renderpass2 for Vulkan 1.2
- radv: update VK_KHR_depth_stencil_resolve for Vulkan 1.2
- radv: update VK_KHR_draw_indirect_count for Vulkan 1.2
- radv: update VK_KHR_driver_properties for Vulkan 1.2
- radv: update VK_KHR_image_format_list for Vulkan 1.2
- radv: update VK_KHR_imageless_framebuffer for Vulkan 1.2
- radv: update VK_KHR_shader_atomic_int64 for Vulkan 1.2
- radv: update VK_KHR_shader_float16_int8 for Vulkan 1.2
- radv: update VK_KHR_shader_float_controls for Vulkan 1.2
- radv: update VK_KHR_shader_subgroup_extended_types for Vulkan 1.2
- radv: update VK_KHR_uniform_buffer_standard_layout for Vulkan 1.2
- radv: update VK_KHR_timeline_semaphore for Vulkan 1.2
- radv: implement Vulkan 1.1 features and properties
- radv: implement Vulkan 1.2 features and properties
- radv: enable Vulkan 1.2
- aco: fix emitting SMEM instructions with no operands on GFX6-GFX7
- aco: do not select 96-bit/128-bit variants for ds_read/ds_write on
GFX6
- aco: do not combine additions of DS instructions on GFX6
- aco: implement stream output with vec3 on GFX6
- aco: fix emitting slc for MUBUF instructions on GFX6-GFX7
- aco: print assembly with CLRXdisasm for GFX6-GFX7 if found on the
system
- aco: fix constant folding of SMRD instructions on GFX6
- aco: do not use the vec3 variant for stores on GFX6
- aco: do not use the vec3 variant for loads on GFX6
- aco: add new addr64 bit to MUBUF instructions on GFX6-GFX7
- aco: implement nir_intrinsic_load_barycentric_at_sample on GFX6
- radv: fix double free corruption in radv_alloc_memory()
- radv: add explicit external subpass dependencies to meta operations
- radv: handle missing implicit subpass dependencies
- spirv: add SpvCapabilityFragmentMaskAMD
- nir: add two new texture ops for multisample fragment color/mask
fetches
- spirv: add support for SpvOpFragment{Mask}FetchAMD operations
- nir/lower_input_attachments: lower nir_texop_fragment_{mask}_fetch
- ac/nir: add support for nir_texop_fragment_{mask}_fetch
- aco: add support for nir_texop_fragment_{mask}_fetch
- radv: advertise VK_AMD_shader_fragment_mask
- aco: fix printing assembly with CLRXdisasm on GFX6
- aco: fix wrong IR in nir_intrinsic_load_barycentric_at_sample
- aco: implement nir_intrinsic_store_global on GFX6
- aco: implement nir_intrinsic_load_global on GFX6
- aco: implement nir_intrinsic_global_atomic\_\* on GFX6
- aco: implement 64-bit nir_op_ftrunc on GFX6
- aco: implement 64-bit nir_op_fceil on GFX6
- aco: implement 64-bit nir_op_fround_even on GFX6
- aco: implement 64-bit nir_op_ffloor on GFX6
- aco: implement nir_op_f2i64/nir_op_f2u64 on GFX6
- ac/llvm: fix missing casts in ac_build_readlane()
- aco: combine MRTZ (depth, stencil, sample mask) exports
- aco: fix a hardware bug for MRTZ exports on GFX6
- aco: fix a hazard with v_interp\_\* and v_{read,readfirst}lane\_\* on
GFX6
- aco: copy the literal offset of SMEM instructions to a temporary
- radv: enable ACO support for GFX6
- radv: print NIR shaders after lowering FS inputs/outputs
- radv: do not allow sparse resources with multi-planar formats
- radv: enable VK_AMD_shader_fragment_mask on GFX6-GFX7
- compiler: add a new explicit interpolation mode
- spirv: add support for SpvDecorationExplicitInterpAMD
- compiler: add PERSP to the existing barycentric system values
- compiler: add new SYSTEM_VALUE_BARYCENTRIC\_\*
- spirv: add support for SpvBuiltInBaryCoord\*
- nir: add nir_intrinsic_load_barycentric_model
- nir: lower SYSTEM_VALUE_BARYCENTRIC\_\* to nir_load_barycentric()
- nir: add nir_intrinsic_interp_deref_at_vertex
- nir: lower interp_deref_at_vertex to load_input_vertex
- spirv: implement SPV_AMD_shader_explicit_vertex_parameter
- ac/llvm: implement VK_AMD_shader_explicit_vertex_parameter
- aco: implement VK_AMD_shader_explicit_vertex_parameter
- radv: gather which input PS variables use an explicit interpolation
mode
- radv: implement VK_AMD_shader_explicit_vertex_parameter
- radv: bump conformance version to 1.2.0.0
- radv: remove the non conformant VK implementation warning on GFX10
- aco: fix VS input loads with MUBUF on GFX6
- radv/gfx10: add a separate flag for creating a GDS OA buffer
- radv/gfx10: implement NGG GS queries
- radv/gfx10: re-enable NGG GS
- radv: refactor physical device properties
- aco: fix MUBUF VS input loads when expanding vec3 to vec4 on GFX6
- aco: do not use ds_{read,write}2 on GFX6
- aco: fix waiting for scalar stores before "writing back" data on
GFX8-GFX9
- aco: fix creating v_madak if v_mad_f32 has two sgpr literals
- nir: do not use De Morgan's Law rules for flt and fge
Samuel Thibault (3):
- loader: #define PATH_MAX when undefined (eg. Hurd)
- util: Do not fail to build on unknown pthread_setname_np
- meson: Do not require libdrm for DRI2 on hurd
Satyajit Sahu (1):
- radeon/vcn: Handle crop parameters for encoder
Sonny Jiang (1):
- radeonsi: use compute shader for clear 12-byte buffer
Stephan Gerhold (1):
- kmsro: Add "mcde" entry point
Tapani Pälli (33):
- nir: fix couple of compile warnings
- util/android: fix android build errors
- Revert "egl: implement new functions from
EGL_EXT_image_flush_external"
- Revert "egl: handle EGL_IMAGE_EXTERNAL_FLUSH_EXT"
- Revert "st/dri: add support for EGL_EXT_image_flush_external"
- Revert "st/dri: assume external consumers of back buffers can write
to the buffers"
- Revert "dri_interface: add interface for
EGL_EXT_image_flush_external"
- mesa: allow bit queries for EXT_disjoint_timer_query
- Revert "mesa: allow bit queries for EXT_disjoint_timer_query"
- mesa: allow bit queries for EXT_disjoint_timer_query
- gitlab-ci: update Piglit commit, update skips
- mapi: add GetInteger64vEXT with EXT_disjoint_timer_query
- glsl: handle max uniform limits with lower_const_arrays_to_uniforms
- gitlab-ci: bump piglit checkout commit
- glsl: additional interface redeclaration check for SSO programs
- intel/compiler: add newline to limit_dispatch_width message
- intel/compiler: force simd8 when dual src blending on gen8
- dri: add \__DRI_IMAGE_FORMAT_SXRGB8
- i965: expose MESA_FORMAT_B8G8R8X8_SRGB visual
- mesa/st/i965: add a ProgramResourceHash for quicker resource lookup
- mesa: create program resource hash in a single place
- iris: set depth stall enabled when depth flush enabled on gen12
- anv: set depth stall enabled when depth flush enabled on gen12
- isl/gen12: add reminder comment about missing WA with 3D surfaces
- anv: fix assert in GetImageDrmFormatModifierPropertiesEXT
- anv: add assert for isl_mod_info in choose_isl_tiling_flags
- anv: initialize clear_color_is_zero_one
- egl/android: fix buffer_count for applications setting max count
- anv/android: setup gralloc1 usage from gralloc0 usage manually
- anv/android: make format_supported_with_usage static
- intel/vec4: fix valgrind errors with vf_values array
- glsl: fix a memory leak with resource_set
- iris: fix aux buf map failure in 32bits app on Android
Thomas Hellstrom (4):
- winsys/svga: Enable transhuge pages for buffer objects
- svga: Avoid discard DMA uploads
- gallium/util: Increase the debug_flush map depth
- svga: Fix banded DMA upload
Thong Thai (8):
- st/va: Convert interlaced NV12 to progressive
- util/format: Add the P010 format used for 10-bit videos
- gallium: Add PIPE_FORMAT_P010 support
- st/va: Add support for P010, used for 10-bit videos
- radeon: Use P010 for decoding of 10-bit videos
- r600: Remove HEVC related code since HEVC is not supported
- mesa: Prevent \_MaxLevel from being less than zero
- Revert "st/va: Convert interlaced NV12 to progressive"
Timothy Arceri (66):
- glsl: just use NIR to lower outputs when driver can't read outputs
- glsl: disable lower_fragdata_array() for NIR drivers
- mesa: add ARB_shading_language_include stubs
- glsl: add infrastructure for ARB_shading_language_include
- mesa: add ARB_shading_language_include infrastructure to
gl_shared_state
- mesa: add helper to validate tokenise shader include path
- mesa: add \_mesa_lookup_shader_include() helper
- mesa: add copy_string() helper
- mesa: add glNamedStringARB() support
- mesa: implement glGetNamedStringARB()
- mesa: make error checking optional in \_mesa_lookup_shader_include()
- mesa: implement glIsNamedStringARB()
- mesa: implement glGetNamedStringivARB()
- mesa: split \_mesa_lookup_shader_include() in two
- mesa: implement glDeleteNamedStringARB()
- glsl: add ARB_shading_language_include support to #line
- glsl: pass gl_context to glcpp_parser_create()
- glsl: add preprocessor #include support
- glsl: error if #include used while extension is disabled
- glsl: add can_skip_compile() helper
- glsl: delay compilation skip if shader contains an include
- mesa: add support cursor support for relative path shader includes
- mesa: add shader include lookup support for relative paths
- mesa: implement glCompileShaderIncludeARB()
- mesa: enable ARB_shading_language_include
- gitlab-ci: bump piglit checkout commit
- gitlab-ci: update for arb_shading_language_include
- compiler: move build definition of pp_standalone_scaffolding.c
- radv: add some infrastructure for fresh forks for each secure compile
- radv: add a secure_compile_open_fifo_fds() helper
- radv: create a fresh fork for each pipeline compile
- docs: update source code repository documentation
- glsl: move calculate_array_size_and_stride() to link_uniforms.cpp
- glsl: don't set uniform block as used when its not
- glsl: make use of active_shader_mask when building resource list
- glsl/nir: iterate the system values list when adding varyings
- docs: remove mailing list as way of submitting patches
- glsl: move nir_remap_dual_slot_attributes() call out of glsl_to_nir()
- glsl: copy the how_declared field when converting to nir
- nir: add some fields to nir_variable_data
- glsl: copy the new data fields when converting to nir
- glsl: add support for named varyings in
nir_build_program_resource_list()
- glsl: add subroutine support to nir_build_program_resource_list()
- st/glsl_to_nir: call gl_nir_lower_buffers() a little later
- st/glsl_to_nir: use nir based program resource list builder
- st/glsl_to_nir: fix SSO validation regression
- glsl: rename gl_nir_link() to gl_nir_link_spirv()
- glsl: add gl_nir_link_check_atomic_counter_resources()
- glsl: add new gl_nir_link_glsl() helper
- glsl: reorder link_and_validate_uniforms() calls
- mesa: add new UseNIRGLSLLinker constant
- glsl: use nir linker to link atomics
- glsl: add check_image_resources() for the nir linker
- glsl: use nir version of check_image_resources() for nir linker
- glsl: move check_subroutine_resources() into the shared util code
- glsl: call check_subroutine_resources() from the nir linker
- glsl: move uniform resource checks into the common linker code
- glsl: call uniform resource checks from the nir linker
- glsl: move calculate_subroutine_compat() to shared linker code
- glsl: call calculate_subroutine_compat() from the nir linker
- glsl: fix potential bug in nir uniform linker
- glsl: remove bogus assert in nir uniform linking
- glsl: fix check for matrices in blocks when using nir uniform linker
- glsl: count uniform components and storage better in nir linking
- glsl_to_nir: update interface type properly
- glsl: fix gl_nir_set_uniform_initializers() for image arrays
Timur Kristóf (39):
- ac: Handle invalid GFX10 format correctly in ac_get_tbuffer_format.
- aco: Make sure not to mistakenly propagate 64-bit constants.
- aco: Treat all booleans as per-lane.
- aco: Optimize out trivial code from uniform bools.
- aco: Fix operand of s_bcnt1_i32_b64 in emit_boolean_reduce.
- aco: Remove superfluous argument from emit_boolean_logic.
- aco: Remove lower_linear_bool_phi, it is not needed anymore.
- aco: Optimize load_subgroup_id to one bit field extract instruction.
- aco/wave32: Change uniform bool optimization to work with wave32.
- aco/wave32: Replace hardcoded numbers in spiller with wave size.
- aco/wave32: Introduce emit_mbcnt which takes wave size into account.
- aco/wave32: Add wave size specific opcodes to aco_builder.
- aco/wave32: Use lane mask regclass for exec/vcc.
- aco/wave32: Fix load_local_invocation_index to support wave32.
- aco/wave32: Use wave_size for barrier intrinsic.
- aco/wave32: Allow setting the subgroup ballot size to 64-bit.
- aco/wave32: Fix reductions.
- aco: Fix uniform i2i64.
- ac/llvm: Fix ac_build_reduce in wave32 mode.
- aco/wave32: Set the definitions of v_cmp instructions to the lane
mask.
- aco: Implement 64-bit constant propagation.
- aco: Allow optimizing vote_all and nir_op_iand.
- aco: Don't skip combine_instruction when definitions[1] is used.
- aco: Optimize out s_and with exec, when used on uniform bitwise
values.
- aco: Flip s_cbranch / s_cselect to optimize out an s_not if possible.
- nouveau/nvc0: add extern keyword to nvc0_miptree_vtbl.
- intel/compiler: Fix array bounds warning on GCC 10.
- radeon: Move si_get_pic_param to radeon_vce.c
- r600: Move get_pic_param to radeon_vce.c
- gallium: Fix a couple of multiple definition warnings.
- radeon: Fix multiple definition error with radeon_debug
- aco: Fix -Wstringop-overflow warnings in aco_span.
- aco: Fix maybe-uninitialized warnings.
- aco: Fix signedness compare warning.
- aco: Make a better guess at which instructions need the VCC hint.
- aco: Transform uniform bitwise instructions to 32-bit if possible.
- aco/gfx10: Fix VcmpxExecWARHazard mitigation.
- aco: Fix the meaning of is_atomic.
- aco/optimizer: Don't combine uniform bool s_and to s_andn2.
Tomasz Pyra (1):
- gallium/swr: Fix arb_transform_feedback2
Tomeu Vizoso (38):
- gitlab-ci: Disable lima jobs
- gitlab-ci: Run only LAVA jobs in special-named branches
- panfrost: Add checksum fields to SFBD descriptor
- panfrost: Set 0x10 bit on mali_shader_meta.unknown2_4 on T720
- panfrost: Rework format encoding on SFBD
- panfrost: Take into account texture layers in SFBD
- panfrost: Decode blend shaders for SFBD
- panfrost: Generate polygon list manually for SFBD
- panfrost: Print the right zero field
- panfrost: Pipe the GPU ID into compiler and disassembler
- panfrost: Set depth and stencil for SFBD based on the format
- panfrost: Multiply offset_units by 2
- panfrost: Make sure the shader descriptor is in sync with the GL
state
- gitlab-ci: Remove limit on kernel logging
- panfrost: Just print tiler fields as-is for Tx20
- panfrost: Rework buffers in SFBD
- gitlab-ci: Fix dir name for VK-GL-CTS sources
- panfrost: Don't print the midgard_blend_rt structs on SFBD
- panfrost: Add quirks system to cmdstream
- panfrost: Simplify shader patching
- panfrost: White list the Mali T720
- gitlab-ci: Test Panfrost on T720 GPUs
- panfrost: Add PAN_MESA_DEBUG=sync
- panfrost: Hold a reference to sampler views
- pan/midgard: Remove undefined behavior
- nir: Don't copy empty array
- util: Don't access members of NULL pointers
- panfrost: Don't lose bits!
- st/mesa: Don't access members of NULL pointers
- panfrost: Handle Z24_UNORM_S8_UINT as MALI_Z32_UNORM
- panfrost: Increase PIPE_SHADER_CAP_MAX_OUTPUTS to 16
- panfrost: Dynamically allocate array of texture pointers
- panfrost: Map with size of first layer for 3D textures
- panfrost: Store internal format
- gitlab-ci: Update kernel for LAVA to 5.5-rc1 plus fixes
- gitlab-ci: Switch LAVA jobs to use shared dEQP runner
- gitlab-ci: Upgrade kernel for LAVA jobs to v5.5-rc5
- gitlab-ci: Consolidate container and build stages for LAVA
Urja Rannikko (4):
- panfrost: free last_read/write tables in mir_create_dependency_graph
- panfrost: free allocations in schedule_block
- panfrost: add lcra_free() to free lcra state
- panfrost: free spill cost table in mir_spill_register
Vasily Khoruzhick (31):
- lima: add debug prints for BO cache
- lima: align size before trying to fetch BO from cache
- lima: ignore flags while looking for BO in cache
- lima: set dithering flag when necessary
- lima: add support for gl_PointSize
- lima: enable tiling
- lima: handle DRM_FORMAT_MOD_INVALID in resource_from_handle()
- lima: expose tiled format modifier in query_dmabuf_modifiers()
- lima: use single BO for GP outputs
- lima: drop suballocator
- lima: fix allocation of GP outputs storage for indexed draw
- lima: postpone PP stream generation
- lima: don't reload and redraw tiles that were not updated
- lima: fix PP stream terminator size
- lima: use linear layout for shared buffers if modifier is not
specified
- lima: add debug flag to disable tiling
- lima: drop support for R8G8B8 format
- lima: fix PLBU_CMD_PRIMITIVE_SETUP command
- lima: fix viewport clipping
- lima: implement polygon offset
- lima: fix PIPE_CAP\_\* to mark features that aren't supported yet
- lima: add new findings to texture descriptor
- lima: fix handling of reverse depth range
- ci: lava: pass CI_NODE_INDEX and CI_NODE_TOTAL to lava jobs
- ci: Re-enable CI for lima on mali450
- lima: implement invalidate_resource()
- nir: don't emit ishl in \_nir_mul_imm() if backend doesn't support
bitops
- lima: use imul for calculations with intrinsic src
- lima: ppir: don't delete root ld_tex nodes without successors in
current block
- lima: ppir: always create move and update ld_tex successors for all
blocks
- lima: disable early-z if fragment shader uses discard
Vinson Lee (9):
- swr: Fix build with llvm-10.0.
- panfrost: Fix gnu-empty-initializer build errors.
- scons: Bump C standard to gnu11 on macOS 10.15.
- util/u_thread: Restrict u_thread_get_time_nano on macOS.
- swr: Fix build with llvm-10.0.
- swr: Fix build with llvm-10.0.
- lima: Fix build with GCC 10.
- swr: Fix GCC 4.9 checks.
- panfrost: Remove unused anonymous enum variables.
Wladimir J. van der Laan (2):
- u_vbuf: add logic to use a limited number of vbufs
- u_vbuf: use single vertex buffer if it's not possible to have
multiple
X512 (1):
- util/u_thread: Fix build under Haiku
Yevhenii Kolesnikov (5):
- glsl: Enable textureSize for samplerExternalOES
- meson: Fix linkage of libgallium_nine with libgalliumvl
- meta: Cleanup function for DrawTex
- main: allow external textures for BindImageTexture
- meta: Add cleanup function for Bitmap
Zebediah Figura (1):
- Revert "draw: revert using correct order for prim decomposition."
luc (1):
- zink: confused compilation macro usage for zink in target helpers.
|