1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 112 113 114 115 116 117 118 119 120 121 122 123 124 125 126 127 128 129 130 131 132 133 134 135 136 137 138 139 140 141 142 143 144 145 146 147 148 149 150 151 152 153 154 155 156 157 158 159 160 161 162 163 164 165 166 167 168 169 170 171 172 173 174 175 176 177 178 179 180 181 182 183 184 185 186 187 188 189 190 191 192 193 194 195 196 197 198 199 200 201 202 203 204 205 206 207 208 209 210 211 212 213 214 215 216 217 218 219 220 221 222 223 224 225 226 227 228 229 230 231 232 233 234 235 236 237 238 239 240 241 242 243 244 245 246 247 248 249 250 251 252 253 254 255 256 257 258 259 260 261 262 263 264 265 266 267 268 269 270 271 272 273 274 275 276 277 278 279 280 281 282 283 284 285 286 287 288 289 290 291 292 293 294 295 296 297 298 299 300 301 302 303 304 305 306 307 308 309 310 311 312 313 314 315 316 317 318 319 320 321 322 323 324 325 326 327 328 329 330 331 332 333 334 335 336 337 338 339 340 341 342 343 344 345 346 347 348 349 350 351 352 353 354 355 356 357 358 359 360 361 362 363 364 365 366 367 368 369 370 371 372 373 374 375 376 377 378 379 380 381 382 383 384 385 386 387 388 389 390 391 392 393 394 395 396 397 398 399 400 401 402 403 404 405 406 407 408 409 410 411 412 413 414 415 416 417 418 419 420 421 422 423 424 425 426 427 428 429 430 431 432 433 434 435 436 437 438 439 440 441 442 443 444 445 446 447 448 449 450 451 452 453 454 455 456 457 458 459 460 461 462 463 464 465 466 467 468 469 470 471 472 473 474 475 476 477 478 479 480 481 482 483 484 485 486 487 488 489 490 491 492 493 494 495 496 497 498 499 500 501 502 503 504 505 506 507 508 509 510 511 512 513 514 515 516 517 518 519 520 521 522 523 524 525 526 527 528 529 530 531 532 533 534 535 536 537 538 539 540 541 542 543 544 545 546 547 548 549 550 551 552 553 554 555 556 557 558 559 560 561 562 563 564 565 566 567 568 569 570 571 572 573 574 575 576 577 578 579 580 581 582 583 584 585 586 587 588 589 590 591 592 593 594 595 596 597 598 599 600 601 602 603 604 605 606 607 608 609 610 611 612 613 614 615 616 617 618 619 620 621 622 623 624 625 626 627 628 629 630 631 632 633 634 635 636 637 638 639 640 641 642 643 644 645 646 647 648 649 650 651 652 653 654 655 656 657 658 659 660 661 662 663 664 665 666 667 668 669 670 671 672 673 674 675 676 677 678 679 680 681 682 683 684 685 686 687 688 689 690 691 692 693 694 695 696 697 698 699 700 701 702 703 704 705 706 707 708 709 710 711 712 713 714 715 716 717 718 719 720 721 722 723 724 725 726 727 728 729 730 731 732 733 734 735 736 737 738 739 740 741 742 743 744 745 746 747 748 749 750 751 752 753 754 755 756 757 758 759 760 761 762 763 764 765 766 767 768 769 770 771 772 773 774 775 776 777 778 779 780 781 782 783 784 785 786 787 788 789 790 791 792 793 794 795 796 797 798 799 800 801 802 803 804 805 806 807 808 809 810 811 812 813 814 815 816 817 818 819 820 821 822 823 824 825 826 827 828 829 830 831 832 833 834 835 836 837 838 839 840 841 842 843 844 845 846 847 848 849 850 851 852 853 854 855 856 857 858 859 860 861 862 863 864 865 866 867 868 869 870 871 872 873 874 875 876 877 878 879 880 881 882 883 884 885 886 887 888 889 890 891 892 893 894 895 896 897 898 899 900 901 902 903 904 905 906 907 908 909 910 911 912 913 914 915 916 917 918 919 920 921 922 923 924 925 926 927 928 929 930 931 932 933 934 935 936 937 938 939 940 941 942 943 944 945 946 947 948 949 950 951 952 953 954 955 956 957 958 959 960 961 962 963 964 965 966 967 968 969 970 971 972 973 974 975 976 977 978 979 980 981 982 983 984 985 986 987 988 989 990 991 992 993 994 995 996 997 998 999 1000 1001 1002 1003 1004 1005 1006 1007 1008 1009 1010 1011 1012 1013 1014 1015 1016 1017 1018 1019 1020 1021 1022 1023 1024 1025 1026 1027 1028 1029 1030 1031 1032 1033 1034 1035 1036 1037 1038 1039 1040 1041 1042 1043 1044 1045 1046 1047 1048 1049 1050 1051 1052 1053 1054 1055 1056 1057 1058 1059 1060 1061 1062 1063 1064 1065 1066 1067 1068 1069 1070 1071 1072 1073 1074 1075 1076 1077 1078 1079 1080 1081 1082 1083 1084 1085 1086 1087 1088 1089 1090 1091 1092 1093 1094 1095 1096 1097 1098 1099 1100 1101 1102 1103 1104 1105 1106 1107 1108 1109 1110 1111 1112 1113 1114 1115 1116 1117 1118 1119 1120 1121 1122 1123 1124 1125 1126 1127 1128 1129 1130 1131 1132 1133 1134 1135 1136 1137 1138 1139 1140 1141 1142 1143 1144 1145 1146 1147 1148 1149 1150 1151 1152 1153 1154 1155 1156 1157 1158 1159 1160 1161 1162 1163 1164 1165 1166 1167 1168 1169 1170 1171 1172 1173 1174 1175 1176 1177 1178 1179 1180 1181 1182 1183 1184 1185 1186 1187 1188 1189 1190 1191 1192 1193 1194 1195 1196 1197 1198 1199 1200 1201 1202 1203 1204 1205 1206 1207 1208 1209 1210 1211 1212 1213 1214 1215 1216 1217 1218 1219 1220 1221 1222 1223 1224 1225 1226 1227 1228 1229 1230 1231 1232 1233 1234 1235 1236 1237 1238 1239 1240 1241 1242 1243 1244 1245 1246 1247 1248 1249 1250 1251 1252 1253 1254 1255 1256 1257 1258 1259 1260 1261 1262 1263 1264 1265 1266 1267 1268 1269 1270 1271 1272 1273 1274 1275 1276 1277 1278 1279 1280 1281 1282 1283 1284 1285 1286 1287 1288 1289 1290 1291 1292 1293 1294 1295 1296 1297 1298 1299 1300 1301 1302 1303 1304 1305 1306 1307 1308 1309 1310 1311 1312 1313 1314 1315 1316 1317 1318 1319 1320 1321 1322 1323 1324 1325 1326 1327 1328 1329 1330 1331 1332 1333 1334 1335 1336 1337 1338 1339 1340 1341 1342 1343 1344 1345 1346 1347 1348 1349 1350 1351 1352 1353 1354 1355 1356 1357 1358 1359 1360 1361 1362 1363 1364 1365 1366 1367 1368 1369 1370 1371 1372 1373 1374 1375 1376 1377 1378 1379 1380 1381 1382 1383 1384 1385 1386 1387 1388 1389 1390 1391 1392 1393 1394 1395 1396 1397 1398 1399 1400 1401 1402 1403 1404 1405 1406 1407 1408 1409 1410 1411 1412 1413 1414 1415 1416 1417 1418 1419 1420 1421 1422 1423 1424 1425 1426 1427 1428 1429 1430 1431 1432 1433 1434 1435 1436 1437 1438 1439 1440 1441 1442 1443 1444 1445 1446 1447 1448 1449 1450 1451 1452 1453 1454 1455 1456 1457 1458 1459 1460 1461 1462 1463 1464 1465 1466 1467 1468 1469 1470 1471 1472 1473 1474 1475 1476 1477 1478 1479 1480 1481 1482 1483 1484 1485 1486 1487 1488 1489 1490 1491 1492 1493 1494 1495 1496 1497 1498 1499 1500 1501 1502 1503 1504 1505 1506 1507 1508 1509 1510 1511 1512 1513 1514 1515 1516 1517 1518 1519 1520 1521 1522 1523 1524 1525 1526 1527 1528 1529 1530 1531 1532 1533 1534 1535 1536 1537 1538 1539 1540 1541 1542 1543 1544 1545 1546 1547 1548 1549 1550 1551 1552 1553 1554 1555 1556 1557 1558 1559 1560 1561 1562 1563 1564 1565 1566 1567 1568 1569 1570 1571 1572 1573 1574 1575 1576 1577 1578 1579 1580 1581 1582 1583 1584 1585 1586 1587 1588 1589 1590 1591 1592 1593 1594 1595 1596 1597 1598 1599 1600 1601 1602 1603 1604 1605 1606 1607 1608 1609 1610 1611 1612 1613 1614 1615 1616 1617 1618 1619 1620 1621 1622 1623 1624 1625 1626 1627 1628 1629 1630 1631 1632 1633 1634 1635 1636 1637 1638 1639 1640 1641 1642 1643 1644 1645 1646 1647 1648 1649 1650 1651 1652 1653 1654 1655 1656 1657 1658 1659 1660 1661 1662 1663 1664 1665 1666 1667 1668 1669 1670 1671 1672 1673 1674 1675 1676 1677 1678 1679 1680 1681 1682 1683 1684 1685 1686 1687 1688 1689 1690 1691 1692 1693 1694 1695 1696 1697 1698 1699 1700 1701 1702 1703 1704 1705 1706 1707 1708 1709 1710 1711 1712 1713 1714 1715 1716 1717 1718 1719 1720 1721 1722 1723 1724 1725 1726 1727 1728 1729 1730 1731 1732 1733 1734 1735 1736 1737 1738 1739 1740 1741 1742 1743 1744 1745 1746 1747 1748 1749 1750 1751 1752 1753 1754 1755 1756 1757 1758 1759 1760 1761 1762 1763 1764 1765 1766 1767 1768 1769 1770 1771 1772 1773 1774 1775 1776 1777 1778 1779 1780 1781 1782 1783 1784 1785 1786 1787 1788 1789 1790 1791 1792 1793 1794 1795 1796 1797 1798 1799 1800 1801 1802 1803 1804 1805 1806 1807 1808 1809 1810 1811 1812 1813 1814 1815 1816 1817 1818 1819 1820 1821 1822 1823 1824 1825 1826 1827 1828 1829 1830 1831 1832 1833 1834 1835 1836 1837 1838 1839 1840 1841 1842 1843 1844 1845 1846 1847 1848 1849 1850 1851 1852 1853 1854 1855 1856 1857 1858 1859 1860 1861 1862 1863 1864 1865 1866 1867 1868 1869 1870 1871 1872 1873 1874 1875 1876 1877 1878 1879 1880 1881 1882 1883 1884 1885 1886 1887 1888 1889 1890 1891 1892 1893 1894 1895 1896 1897 1898 1899 1900 1901 1902 1903 1904 1905 1906 1907 1908 1909 1910 1911 1912 1913 1914 1915 1916 1917 1918 1919 1920 1921 1922 1923 1924 1925 1926 1927 1928 1929 1930 1931 1932 1933 1934 1935 1936 1937 1938 1939 1940 1941 1942 1943 1944 1945 1946 1947 1948 1949 1950 1951 1952 1953 1954 1955 1956 1957 1958 1959 1960 1961 1962 1963 1964 1965 1966 1967 1968 1969 1970 1971 1972 1973 1974 1975 1976 1977 1978 1979 1980 1981 1982 1983 1984 1985 1986 1987 1988 1989 1990 1991 1992 1993 1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 2022 2023 2024 2025 2026 2027 2028 2029 2030 2031 2032 2033 2034 2035 2036 2037 2038 2039 2040 2041 2042 2043 2044 2045 2046 2047 2048 2049 2050 2051 2052 2053 2054 2055 2056 2057 2058 2059 2060 2061 2062 2063 2064 2065 2066 2067 2068 2069 2070 2071 2072 2073 2074 2075 2076 2077 2078 2079 2080 2081 2082 2083 2084 2085 2086 2087 2088 2089 2090 2091 2092 2093 2094 2095 2096 2097 2098 2099 2100 2101 2102 2103 2104 2105 2106 2107 2108 2109 2110 2111 2112 2113 2114 2115 2116 2117 2118 2119 2120 2121 2122 2123 2124 2125 2126 2127 2128 2129 2130 2131 2132 2133 2134 2135 2136 2137 2138 2139 2140 2141 2142 2143 2144 2145 2146 2147 2148 2149 2150 2151 2152 2153 2154 2155 2156 2157 2158 2159 2160 2161 2162 2163 2164 2165 2166 2167 2168 2169 2170 2171 2172 2173 2174 2175 2176 2177 2178 2179 2180 2181 2182 2183 2184 2185 2186 2187 2188 2189 2190 2191 2192 2193 2194 2195 2196 2197 2198 2199 2200 2201 2202 2203 2204 2205 2206 2207 2208 2209 2210 2211 2212 2213 2214 2215 2216 2217 2218 2219 2220 2221 2222 2223 2224 2225 2226 2227 2228 2229 2230 2231 2232 2233 2234 2235 2236 2237 2238 2239 2240 2241 2242 2243 2244 2245 2246 2247 2248 2249 2250 2251 2252 2253 2254 2255 2256 2257 2258 2259 2260 2261 2262 2263 2264 2265 2266 2267 2268 2269 2270 2271 2272 2273 2274 2275 2276 2277 2278 2279 2280 2281 2282 2283 2284 2285 2286 2287 2288 2289 2290 2291 2292 2293 2294 2295 2296 2297 2298 2299 2300 2301 2302 2303 2304 2305 2306 2307 2308 2309 2310 2311 2312 2313 2314 2315 2316 2317 2318 2319 2320 2321 2322 2323 2324 2325 2326 2327 2328 2329 2330 2331 2332 2333 2334 2335 2336 2337 2338 2339 2340 2341 2342 2343 2344 2345 2346 2347 2348 2349 2350 2351 2352 2353 2354 2355 2356 2357 2358 2359 2360 2361 2362 2363 2364 2365 2366 2367 2368 2369 2370 2371 2372 2373 2374 2375 2376 2377 2378 2379 2380 2381 2382 2383 2384 2385 2386 2387 2388 2389 2390 2391 2392 2393 2394 2395 2396 2397 2398 2399 2400 2401 2402 2403 2404 2405 2406 2407 2408 2409 2410 2411 2412 2413 2414 2415 2416 2417 2418 2419 2420 2421 2422 2423 2424 2425 2426 2427 2428 2429 2430 2431 2432 2433 2434 2435 2436 2437 2438 2439 2440 2441 2442 2443 2444 2445 2446 2447 2448 2449 2450 2451 2452 2453 2454 2455 2456 2457 2458 2459 2460 2461 2462 2463 2464 2465 2466 2467 2468 2469 2470 2471 2472 2473 2474 2475 2476 2477 2478 2479 2480 2481 2482 2483 2484 2485 2486 2487 2488 2489 2490 2491 2492 2493 2494 2495 2496 2497 2498 2499 2500 2501 2502 2503 2504 2505 2506 2507 2508 2509 2510 2511 2512 2513 2514 2515 2516 2517 2518 2519 2520 2521 2522 2523 2524 2525 2526 2527 2528 2529 2530 2531 2532 2533 2534 2535 2536 2537 2538 2539 2540 2541 2542 2543 2544 2545 2546 2547 2548 2549 2550 2551 2552 2553 2554 2555 2556 2557 2558 2559 2560 2561 2562 2563 2564 2565 2566 2567 2568 2569 2570 2571 2572 2573 2574 2575 2576 2577 2578 2579 2580 2581 2582 2583 2584 2585 2586 2587 2588 2589 2590 2591 2592 2593 2594 2595 2596 2597 2598 2599 2600 2601 2602 2603 2604 2605 2606 2607 2608 2609 2610 2611 2612 2613 2614 2615 2616 2617 2618 2619 2620 2621 2622 2623 2624 2625 2626 2627 2628 2629 2630 2631 2632 2633 2634 2635 2636 2637 2638 2639 2640 2641 2642 2643 2644 2645 2646 2647 2648 2649 2650 2651 2652 2653 2654 2655 2656 2657 2658 2659 2660 2661 2662 2663 2664 2665 2666 2667 2668 2669 2670 2671 2672 2673 2674 2675 2676 2677 2678 2679 2680 2681 2682 2683 2684 2685 2686 2687 2688 2689 2690 2691 2692 2693 2694 2695 2696 2697 2698 2699 2700 2701 2702 2703 2704 2705 2706 2707 2708 2709 2710 2711 2712 2713 2714 2715 2716 2717 2718 2719 2720 2721 2722 2723 2724 2725 2726 2727 2728 2729 2730 2731 2732 2733 2734 2735 2736 2737 2738 2739 2740 2741 2742 2743 2744 2745 2746 2747 2748 2749 2750 2751 2752 2753 2754 2755 2756 2757 2758 2759 2760 2761 2762 2763 2764 2765 2766 2767 2768 2769 2770 2771 2772 2773 2774 2775 2776 2777 2778 2779 2780 2781 2782 2783 2784 2785 2786 2787 2788 2789 2790 2791 2792 2793 2794 2795 2796 2797 2798 2799 2800 2801 2802 2803 2804 2805 2806 2807 2808 2809 2810 2811 2812 2813 2814 2815 2816 2817 2818 2819 2820 2821 2822 2823 2824 2825 2826 2827 2828 2829 2830 2831 2832 2833 2834 2835 2836 2837 2838 2839 2840 2841 2842 2843 2844 2845 2846 2847 2848 2849 2850 2851 2852 2853 2854 2855 2856 2857 2858 2859 2860 2861 2862 2863 2864 2865 2866 2867 2868 2869 2870 2871 2872 2873 2874 2875 2876 2877 2878 2879 2880 2881 2882 2883 2884 2885 2886 2887 2888 2889 2890 2891 2892 2893 2894 2895 2896 2897 2898 2899 2900 2901 2902 2903 2904 2905 2906 2907 2908 2909 2910 2911 2912 2913 2914 2915 2916 2917 2918 2919 2920 2921 2922 2923 2924 2925 2926 2927 2928 2929 2930 2931 2932 2933 2934 2935 2936 2937 2938 2939 2940 2941 2942 2943 2944 2945 2946 2947 2948 2949 2950 2951 2952 2953 2954 2955 2956 2957 2958 2959 2960 2961 2962 2963 2964 2965 2966 2967 2968 2969 2970 2971 2972 2973 2974 2975 2976 2977 2978 2979 2980 2981 2982 2983 2984 2985 2986 2987 2988 2989 2990 2991 2992 2993 2994 2995 2996 2997 2998 2999 3000 3001 3002 3003 3004 3005 3006 3007 3008 3009 3010 3011 3012 3013 3014 3015 3016 3017 3018 3019 3020 3021 3022 3023 3024 3025 3026 3027 3028 3029 3030 3031 3032 3033 3034 3035 3036 3037 3038 3039 3040 3041 3042 3043 3044 3045 3046 3047 3048 3049 3050 3051 3052 3053 3054 3055 3056 3057 3058 3059 3060 3061 3062 3063 3064 3065 3066 3067 3068 3069 3070 3071 3072 3073 3074 3075 3076 3077 3078 3079 3080 3081 3082 3083 3084 3085 3086 3087 3088 3089 3090 3091 3092 3093 3094 3095 3096 3097 3098 3099 3100 3101 3102 3103 3104 3105 3106 3107 3108 3109 3110 3111 3112 3113 3114 3115 3116 3117 3118 3119 3120 3121 3122 3123 3124 3125 3126 3127 3128 3129 3130 3131 3132 3133 3134 3135 3136 3137 3138 3139 3140 3141 3142 3143 3144 3145 3146 3147 3148 3149 3150 3151 3152 3153 3154 3155 3156 3157 3158 3159 3160 3161 3162 3163 3164 3165 3166 3167 3168 3169 3170 3171 3172 3173 3174 3175 3176 3177 3178 3179 3180 3181 3182 3183 3184 3185 3186 3187 3188 3189 3190 3191 3192 3193 3194 3195 3196 3197 3198 3199 3200 3201 3202 3203 3204 3205 3206 3207 3208 3209 3210 3211 3212 3213 3214 3215 3216 3217 3218 3219 3220 3221 3222 3223 3224 3225 3226 3227 3228 3229 3230 3231 3232 3233 3234 3235 3236 3237 3238 3239 3240 3241 3242 3243 3244 3245 3246 3247 3248 3249 3250 3251 3252 3253 3254 3255 3256 3257 3258 3259 3260 3261 3262 3263 3264 3265 3266 3267 3268 3269 3270 3271 3272 3273 3274 3275 3276 3277 3278 3279 3280 3281 3282 3283 3284 3285 3286 3287 3288 3289 3290 3291 3292 3293 3294 3295 3296 3297 3298 3299 3300 3301 3302 3303 3304 3305 3306 3307 3308 3309 3310 3311 3312 3313 3314 3315 3316 3317 3318 3319 3320 3321 3322 3323 3324 3325 3326 3327 3328 3329 3330 3331 3332 3333 3334 3335 3336 3337 3338 3339 3340 3341 3342 3343 3344 3345 3346 3347 3348 3349 3350 3351 3352 3353 3354 3355 3356 3357 3358 3359 3360 3361 3362 3363 3364 3365 3366 3367 3368 3369 3370 3371 3372 3373 3374 3375 3376 3377 3378 3379 3380 3381 3382 3383 3384 3385 3386 3387 3388 3389 3390 3391 3392 3393 3394 3395 3396 3397 3398 3399 3400 3401 3402 3403 3404 3405 3406 3407 3408 3409 3410 3411 3412 3413 3414 3415 3416 3417 3418 3419 3420 3421 3422 3423 3424 3425 3426 3427 3428 3429 3430 3431 3432 3433 3434 3435 3436 3437 3438 3439 3440 3441 3442 3443 3444 3445 3446 3447 3448 3449 3450 3451 3452 3453 3454 3455 3456 3457 3458 3459 3460 3461 3462 3463 3464 3465 3466 3467 3468 3469 3470 3471 3472 3473 3474 3475 3476 3477 3478 3479 3480 3481 3482 3483 3484 3485 3486 3487 3488 3489 3490 3491 3492 3493 3494 3495 3496 3497 3498 3499 3500 3501 3502 3503 3504 3505 3506 3507 3508 3509 3510 3511 3512 3513 3514 3515 3516 3517 3518 3519 3520 3521 3522 3523 3524 3525 3526 3527 3528 3529 3530 3531 3532 3533 3534 3535 3536 3537 3538 3539 3540 3541 3542 3543 3544 3545 3546 3547 3548 3549 3550 3551 3552 3553 3554 3555 3556 3557 3558 3559 3560 3561 3562 3563 3564 3565 3566 3567 3568 3569 3570 3571 3572 3573 3574 3575 3576 3577 3578 3579 3580 3581 3582 3583 3584 3585 3586 3587 3588 3589 3590 3591 3592 3593 3594 3595 3596 3597 3598 3599 3600 3601 3602 3603 3604 3605 3606 3607 3608 3609 3610 3611 3612 3613 3614 3615 3616 3617 3618 3619 3620 3621 3622 3623 3624 3625 3626 3627 3628 3629 3630 3631 3632 3633 3634 3635 3636 3637 3638 3639 3640 3641 3642 3643 3644 3645 3646 3647 3648 3649 3650 3651 3652 3653 3654 3655 3656 3657 3658 3659 3660 3661 3662 3663 3664 3665 3666 3667 3668 3669 3670 3671 3672 3673 3674 3675 3676 3677 3678 3679 3680 3681 3682 3683 3684 3685 3686 3687 3688 3689 3690 3691 3692 3693 3694 3695 3696 3697 3698 3699 3700 3701 3702 3703 3704 3705 3706 3707 3708 3709 3710 3711 3712 3713 3714 3715 3716 3717 3718 3719 3720 3721 3722 3723 3724 3725 3726 3727 3728 3729 3730 3731 3732 3733 3734 3735 3736 3737 3738 3739 3740 3741 3742 3743 3744 3745 3746 3747 3748 3749 3750 3751 3752 3753 3754 3755 3756 3757 3758 3759 3760 3761 3762 3763 3764 3765 3766 3767 3768 3769 3770 3771 3772 3773 3774 3775 3776 3777 3778 3779 3780 3781 3782 3783 3784 3785 3786 3787 3788 3789 3790 3791 3792 3793 3794 3795 3796 3797 3798 3799 3800 3801 3802 3803 3804 3805 3806 3807 3808 3809 3810 3811 3812 3813 3814 3815 3816 3817 3818 3819 3820 3821 3822 3823 3824 3825 3826 3827 3828 3829 3830 3831 3832 3833 3834 3835 3836 3837 3838 3839 3840 3841 3842 3843 3844 3845 3846 3847 3848 3849 3850 3851 3852 3853 3854 3855 3856 3857 3858 3859 3860 3861 3862 3863 3864 3865 3866 3867 3868 3869 3870 3871 3872 3873 3874 3875 3876 3877 3878 3879 3880 3881 3882 3883 3884 3885 3886 3887 3888 3889 3890 3891 3892 3893 3894 3895 3896 3897 3898 3899 3900 3901 3902 3903 3904 3905 3906 3907 3908 3909 3910 3911 3912 3913 3914 3915 3916 3917 3918 3919 3920 3921 3922 3923 3924 3925 3926 3927 3928 3929 3930 3931 3932 3933 3934 3935 3936 3937 3938 3939 3940 3941 3942 3943 3944 3945 3946 3947 3948 3949 3950 3951 3952 3953 3954 3955 3956 3957 3958 3959 3960 3961 3962 3963 3964 3965 3966 3967 3968 3969 3970 3971 3972 3973 3974 3975 3976 3977 3978 3979 3980 3981 3982 3983 3984 3985 3986 3987 3988 3989 3990 3991 3992 3993 3994 3995 3996 3997 3998 3999 4000 4001 4002 4003 4004 4005 4006 4007 4008 4009 4010 4011 4012 4013 4014 4015 4016 4017 4018 4019 4020 4021 4022 4023 4024 4025 4026 4027 4028 4029 4030 4031 4032 4033 4034 4035 4036 4037 4038 4039 4040 4041 4042 4043 4044 4045 4046 4047 4048 4049 4050 4051 4052 4053 4054 4055 4056 4057 4058 4059 4060 4061 4062 4063 4064 4065 4066 4067 4068 4069 4070 4071 4072 4073 4074 4075 4076 4077 4078 4079 4080 4081 4082 4083 4084 4085 4086 4087 4088 4089 4090 4091 4092 4093 4094 4095 4096 4097 4098 4099 4100 4101 4102 4103 4104 4105 4106 4107 4108 4109 4110 4111 4112 4113 4114 4115 4116 4117 4118 4119 4120 4121 4122 4123 4124 4125 4126 4127 4128 4129 4130 4131 4132 4133 4134 4135 4136 4137 4138 4139 4140 4141 4142 4143 4144 4145 4146 4147 4148 4149 4150 4151 4152 4153 4154 4155 4156 4157 4158 4159 4160 4161 4162 4163 4164 4165 4166 4167 4168 4169 4170 4171 4172 4173 4174 4175 4176 4177 4178 4179 4180 4181 4182 4183 4184 4185 4186 4187 4188 4189 4190 4191 4192 4193 4194 4195 4196 4197 4198 4199 4200 4201 4202 4203 4204 4205 4206 4207 4208 4209 4210 4211 4212 4213 4214 4215 4216 4217 4218 4219 4220 4221 4222 4223 4224 4225 4226 4227 4228 4229 4230 4231 4232 4233 4234 4235 4236 4237 4238 4239 4240 4241 4242 4243 4244 4245 4246 4247 4248 4249 4250 4251 4252 4253 4254 4255 4256 4257 4258 4259 4260 4261 4262 4263 4264 4265 4266 4267 4268 4269 4270 4271 4272 4273 4274 4275 4276 4277 4278 4279 4280 4281 4282 4283 4284 4285 4286 4287 4288 4289 4290 4291 4292 4293 4294 4295 4296 4297 4298 4299 4300 4301 4302 4303 4304 4305 4306 4307 4308 4309 4310 4311 4312 4313 4314 4315 4316 4317 4318 4319 4320 4321 4322 4323 4324 4325 4326 4327 4328 4329 4330 4331 4332 4333 4334 4335 4336 4337 4338 4339 4340 4341 4342 4343 4344 4345 4346 4347 4348 4349 4350 4351 4352 4353 4354 4355 4356 4357 4358 4359 4360 4361 4362 4363 4364 4365 4366 4367 4368 4369 4370 4371 4372 4373 4374 4375 4376 4377 4378 4379 4380 4381 4382 4383 4384 4385 4386 4387 4388 4389 4390 4391 4392 4393 4394 4395 4396 4397 4398 4399 4400 4401 4402 4403 4404 4405 4406 4407 4408 4409 4410 4411 4412 4413 4414 4415 4416 4417 4418 4419 4420 4421 4422 4423 4424 4425 4426 4427 4428 4429 4430 4431 4432 4433 4434 4435 4436 4437 4438 4439 4440 4441 4442 4443 4444 4445 4446 4447 4448 4449 4450 4451 4452 4453 4454 4455 4456 4457 4458 4459 4460 4461 4462 4463 4464 4465 4466 4467 4468 4469 4470 4471 4472 4473 4474 4475 4476 4477 4478 4479 4480 4481 4482 4483 4484 4485 4486 4487 4488 4489 4490 4491 4492 4493 4494 4495 4496 4497 4498 4499 4500 4501 4502 4503 4504 4505 4506 4507 4508 4509 4510 4511 4512 4513 4514 4515 4516 4517 4518 4519 4520 4521 4522 4523 4524 4525 4526 4527 4528 4529 4530 4531 4532 4533 4534 4535 4536 4537 4538 4539 4540 4541 4542 4543 4544 4545 4546 4547 4548 4549 4550 4551 4552 4553 4554 4555 4556 4557 4558 4559 4560 4561 4562 4563 4564 4565 4566 4567 4568 4569 4570 4571 4572 4573 4574 4575 4576 4577 4578 4579 4580 4581 4582 4583 4584 4585 4586 4587 4588 4589 4590 4591 4592 4593 4594 4595 4596 4597 4598 4599 4600 4601 4602 4603 4604 4605 4606 4607 4608 4609 4610 4611 4612 4613 4614 4615 4616 4617 4618 4619 4620 4621 4622 4623 4624 4625 4626 4627 4628 4629 4630 4631 4632 4633 4634 4635 4636 4637 4638 4639 4640 4641 4642 4643 4644 4645 4646 4647 4648 4649 4650 4651 4652 4653 4654 4655 4656 4657 4658 4659 4660 4661 4662 4663 4664 4665 4666 4667 4668 4669 4670 4671 4672 4673 4674 4675 4676 4677 4678 4679 4680 4681 4682 4683 4684 4685 4686 4687 4688 4689 4690 4691 4692 4693 4694 4695 4696 4697 4698 4699 4700 4701 4702 4703 4704 4705 4706 4707 4708 4709 4710 4711 4712 4713 4714 4715 4716 4717 4718 4719 4720 4721 4722 4723 4724 4725 4726 4727 4728 4729 4730 4731 4732 4733 4734 4735 4736 4737 4738 4739 4740 4741 4742 4743 4744 4745 4746 4747 4748 4749 4750 4751 4752 4753 4754 4755 4756 4757 4758 4759 4760 4761 4762 4763 4764 4765 4766 4767 4768 4769 4770 4771 4772 4773 4774 4775 4776 4777 4778 4779 4780 4781 4782 4783 4784 4785 4786 4787 4788 4789 4790 4791 4792 4793 4794 4795 4796 4797 4798 4799 4800 4801 4802 4803 4804 4805 4806 4807 4808 4809 4810 4811 4812 4813 4814 4815 4816 4817 4818 4819 4820 4821 4822 4823 4824 4825 4826 4827 4828 4829 4830 4831 4832 4833 4834 4835 4836 4837 4838 4839 4840 4841 4842 4843 4844 4845 4846 4847 4848 4849 4850 4851 4852 4853 4854 4855 4856 4857 4858 4859 4860 4861 4862 4863 4864 4865 4866 4867 4868 4869 4870 4871 4872 4873 4874 4875 4876 4877 4878 4879 4880 4881 4882 4883 4884 4885 4886 4887 4888 4889 4890 4891 4892 4893 4894 4895 4896 4897 4898 4899 4900 4901 4902 4903 4904 4905 4906 4907 4908 4909 4910 4911 4912 4913 4914 4915 4916 4917 4918 4919 4920 4921 4922 4923 4924 4925 4926 4927 4928 4929 4930 4931 4932 4933 4934 4935 4936 4937 4938 4939 4940 4941 4942 4943 4944 4945 4946 4947 4948 4949 4950 4951 4952 4953 4954 4955 4956 4957 4958 4959 4960 4961 4962 4963 4964 4965 4966 4967 4968 4969 4970 4971 4972 4973 4974 4975 4976 4977 4978 4979 4980 4981 4982 4983 4984 4985 4986 4987 4988 4989 4990 4991 4992 4993 4994 4995 4996 4997 4998 4999 5000 5001 5002 5003 5004 5005 5006 5007 5008 5009 5010 5011 5012 5013 5014 5015 5016 5017 5018 5019 5020 5021 5022 5023 5024 5025 5026 5027 5028 5029 5030 5031 5032 5033 5034 5035 5036 5037 5038 5039 5040 5041 5042 5043 5044 5045 5046 5047 5048 5049 5050 5051 5052 5053 5054 5055 5056 5057 5058 5059 5060 5061 5062 5063 5064 5065 5066 5067 5068 5069 5070 5071 5072 5073 5074 5075 5076 5077 5078 5079 5080 5081 5082 5083 5084 5085 5086 5087 5088 5089 5090 5091 5092 5093 5094 5095 5096 5097 5098 5099 5100 5101 5102 5103 5104 5105 5106 5107 5108 5109 5110 5111 5112 5113 5114 5115 5116 5117 5118 5119 5120 5121 5122 5123 5124 5125 5126 5127 5128 5129 5130 5131 5132 5133 5134 5135 5136 5137 5138 5139 5140 5141 5142 5143 5144 5145 5146 5147 5148 5149 5150 5151 5152 5153 5154 5155 5156 5157 5158 5159 5160 5161 5162 5163 5164 5165 5166 5167 5168 5169 5170 5171 5172 5173 5174 5175 5176 5177 5178 5179 5180 5181 5182 5183 5184 5185 5186 5187 5188 5189 5190 5191 5192 5193 5194 5195 5196 5197 5198 5199 5200 5201 5202 5203 5204 5205 5206 5207 5208 5209 5210 5211 5212 5213 5214 5215 5216 5217 5218 5219 5220 5221 5222 5223 5224 5225 5226 5227 5228 5229 5230 5231 5232 5233 5234 5235 5236 5237 5238 5239 5240 5241 5242 5243 5244 5245 5246 5247 5248 5249 5250 5251 5252 5253 5254 5255 5256 5257 5258 5259 5260 5261 5262 5263 5264 5265 5266 5267 5268 5269 5270 5271 5272 5273 5274 5275 5276 5277 5278 5279 5280 5281 5282 5283 5284 5285 5286 5287 5288 5289 5290 5291 5292 5293 5294 5295 5296 5297 5298 5299 5300 5301 5302 5303 5304 5305 5306 5307 5308 5309 5310 5311 5312 5313 5314 5315 5316 5317 5318 5319 5320 5321 5322 5323 5324 5325 5326 5327 5328 5329 5330 5331 5332 5333 5334 5335 5336 5337 5338 5339 5340 5341 5342 5343 5344 5345 5346 5347 5348 5349 5350 5351 5352 5353 5354 5355 5356 5357 5358 5359 5360 5361 5362 5363 5364 5365 5366 5367 5368 5369 5370 5371 5372 5373 5374 5375 5376 5377 5378 5379 5380 5381 5382 5383 5384 5385 5386 5387 5388 5389 5390 5391 5392 5393 5394 5395 5396 5397 5398 5399 5400 5401 5402 5403 5404 5405 5406 5407 5408 5409 5410 5411 5412 5413 5414 5415 5416 5417 5418 5419 5420 5421 5422 5423 5424 5425 5426 5427 5428 5429 5430 5431 5432 5433 5434 5435 5436 5437 5438 5439 5440 5441 5442 5443 5444 5445 5446 5447 5448 5449 5450 5451 5452 5453 5454 5455 5456 5457 5458 5459 5460 5461 5462 5463 5464 5465 5466 5467 5468 5469 5470 5471 5472 5473 5474 5475 5476 5477 5478 5479 5480 5481 5482 5483 5484 5485 5486 5487 5488 5489 5490 5491 5492 5493 5494 5495 5496 5497 5498 5499 5500 5501 5502 5503 5504 5505 5506 5507 5508 5509 5510 5511 5512 5513 5514 5515 5516 5517 5518 5519 5520 5521 5522 5523 5524 5525 5526 5527 5528 5529 5530 5531 5532 5533 5534 5535 5536 5537 5538 5539 5540 5541 5542 5543 5544 5545 5546 5547 5548 5549 5550 5551 5552 5553 5554 5555 5556 5557 5558 5559 5560 5561 5562 5563 5564 5565 5566 5567 5568 5569 5570 5571 5572 5573 5574 5575 5576 5577 5578 5579 5580 5581 5582 5583 5584 5585 5586 5587 5588 5589 5590 5591 5592 5593 5594 5595 5596 5597 5598 5599 5600 5601 5602 5603 5604 5605 5606 5607 5608 5609 5610 5611 5612 5613 5614 5615 5616 5617 5618 5619 5620 5621 5622 5623 5624 5625 5626 5627 5628 5629 5630 5631 5632 5633 5634 5635 5636 5637 5638 5639 5640 5641 5642 5643 5644 5645 5646 5647 5648 5649 5650 5651 5652 5653 5654 5655 5656 5657 5658 5659 5660 5661 5662 5663 5664 5665 5666 5667 5668 5669 5670 5671 5672 5673 5674 5675 5676 5677 5678 5679 5680 5681 5682 5683 5684 5685 5686 5687 5688 5689 5690 5691 5692 5693 5694 5695 5696 5697 5698 5699 5700 5701 5702 5703 5704 5705 5706 5707 5708 5709 5710 5711 5712 5713 5714 5715 5716 5717 5718 5719 5720 5721 5722 5723 5724 5725 5726 5727 5728 5729 5730 5731 5732 5733 5734 5735 5736 5737 5738 5739 5740 5741 5742 5743 5744 5745 5746 5747 5748 5749 5750 5751 5752 5753 5754 5755 5756 5757 5758 5759 5760 5761 5762 5763 5764 5765 5766 5767 5768 5769 5770 5771 5772 5773 5774 5775 5776 5777 5778 5779 5780 5781 5782 5783 5784 5785 5786 5787 5788 5789 5790 5791 5792 5793 5794 5795 5796 5797 5798 5799 5800 5801 5802 5803 5804 5805 5806 5807 5808 5809 5810 5811 5812 5813 5814 5815 5816 5817 5818 5819 5820 5821 5822 5823 5824 5825 5826 5827 5828 5829 5830 5831 5832 5833 5834 5835 5836 5837 5838 5839 5840 5841 5842 5843 5844 5845 5846 5847 5848 5849 5850 5851 5852 5853 5854 5855 5856 5857 5858 5859 5860 5861 5862 5863 5864 5865 5866 5867 5868 5869 5870 5871 5872 5873 5874 5875 5876 5877 5878 5879 5880 5881 5882 5883 5884 5885 5886 5887 5888 5889 5890 5891 5892 5893 5894 5895 5896 5897 5898 5899 5900 5901 5902 5903 5904 5905 5906 5907 5908 5909 5910 5911 5912 5913 5914 5915 5916 5917 5918 5919 5920 5921 5922 5923 5924 5925 5926 5927 5928 5929 5930 5931 5932 5933 5934 5935 5936 5937 5938 5939 5940 5941 5942 5943 5944 5945 5946 5947 5948 5949 5950 5951 5952 5953 5954 5955 5956 5957 5958 5959 5960 5961 5962 5963 5964 5965 5966 5967 5968 5969 5970 5971 5972 5973 5974 5975 5976 5977 5978 5979 5980 5981 5982 5983 5984 5985 5986 5987 5988 5989 5990 5991 5992 5993 5994 5995 5996 5997 5998 5999 6000 6001 6002 6003 6004 6005 6006 6007 6008 6009 6010 6011 6012 6013 6014 6015 6016 6017 6018 6019 6020 6021 6022 6023 6024 6025 6026 6027 6028 6029 6030 6031 6032 6033 6034 6035 6036 6037 6038 6039 6040 6041 6042 6043 6044 6045 6046 6047 6048 6049 6050 6051 6052 6053 6054 6055 6056 6057 6058 6059 6060 6061 6062 6063 6064 6065 6066 6067 6068 6069 6070 6071 6072 6073 6074 6075 6076 6077 6078 6079 6080 6081 6082 6083 6084 6085 6086 6087 6088 6089 6090 6091 6092 6093 6094 6095 6096 6097 6098 6099 6100 6101 6102 6103 6104 6105 6106 6107 6108 6109 6110 6111 6112 6113 6114 6115 6116 6117 6118 6119 6120 6121 6122 6123 6124 6125 6126 6127 6128 6129 6130 6131 6132 6133 6134 6135 6136 6137 6138 6139 6140 6141 6142 6143 6144 6145 6146 6147 6148 6149 6150 6151 6152 6153 6154 6155 6156 6157 6158 6159 6160 6161 6162 6163 6164 6165 6166 6167 6168 6169 6170 6171 6172 6173 6174 6175 6176 6177 6178 6179 6180 6181 6182 6183 6184 6185 6186 6187 6188 6189 6190 6191 6192 6193 6194 6195 6196 6197 6198 6199 6200 6201 6202 6203 6204 6205 6206 6207 6208 6209 6210 6211 6212 6213 6214 6215 6216 6217 6218 6219 6220 6221 6222 6223 6224 6225 6226 6227 6228 6229 6230 6231 6232 6233 6234 6235 6236 6237 6238 6239 6240 6241 6242 6243 6244 6245 6246 6247 6248 6249 6250 6251 6252 6253 6254 6255 6256 6257 6258 6259 6260 6261 6262 6263 6264 6265 6266 6267 6268 6269 6270 6271 6272 6273 6274 6275 6276 6277 6278 6279 6280 6281 6282 6283 6284 6285 6286 6287 6288 6289 6290 6291 6292 6293 6294 6295 6296 6297 6298 6299 6300 6301 6302 6303 6304 6305 6306 6307 6308 6309 6310 6311 6312 6313 6314 6315 6316 6317 6318 6319 6320 6321 6322 6323 6324 6325 6326 6327 6328 6329 6330 6331 6332 6333 6334
|
Mesa 23.3.0 Release Notes / 2023-11-29
======================================
Mesa 23.3.0 is a new development release. People who are concerned
with stability and reliability should stick with a previous release or
wait for Mesa 23.3.1.
Mesa 23.3.0 implements the OpenGL 4.6 API, but the version reported by
glGetString(GL_VERSION) or glGetIntegerv(GL_MAJOR_VERSION) /
glGetIntegerv(GL_MINOR_VERSION) depends on the particular driver being used.
Some drivers don't support all the features required in OpenGL 4.6. OpenGL
4.6 is **only** available if requested at context creation.
Compatibility contexts may report a lower version depending on each driver.
Mesa 23.3.0 implements the Vulkan 1.3 API, but the version reported by
the apiVersion property of the VkPhysicalDeviceProperties struct
depends on the particular driver being used.
SHA256 checksum
---------------
::
50f729dd60ed6335b989095baad81ef5edf7cfdd4b4b48b9b955917cb07d69c5 mesa-23.3.0.tar.xz
New drivers
-----------
- NVK: A Vulkan driver for Nvidia hardware
New features
------------
- VK_EXT_pipeline_robustness on ANV
- VK_KHR_maintenance5 on RADV
- OpenGL ES 3.1 on Asahi
- GL_ARB_compute_shader on Asahi
- GL_ARB_shader_atomic_counters on Asahi
- GL_ARB_shader_image_load_store on Asahi
- GL_ARB_shader_image_size on Asahi
- GL_ARB_shader_storage_buffer_object on Asahi
- GL_ARB_sample_shading on Asahi
- GL_OES_sample_variables on Asahi
- GL_OES_shader_multisample_interpolation on Asahi
- GL_OES_gpu_shader5 on Asahi
- EGL_ANDROID_blob_cache works when disk caching is disabled
- VK_KHR_cooperative_matrix on RADV/GFX11+
Bug fixes
---------
- crash in si_update_tess_io_layout_state during _mesa_ReadPixels (radeonsi_dri, mesa 23.2.1)
- mesa: vertex attrib regression
- [RADV] War Thunder has some grass flickering.
- radv: satisfactory broken shader
- RADV problem with R7 M440 in some games
- gpu driver crashes when opening ingame map playing dead space 2023
- [anv] Valheim water misrendering
- EGL/v3d: EGL applications under a X compositor doesn't work
- RADV: trunc_coord breaks ambient occlusion in Dirt Rally and other games
- radv: Mass Effect Legendary Edition: a line going across the screen is visible in some areas with Ambient Occlusion enabled
- anv: DIRT5 gfx11_generated_draws_spv_source triggers "assert(!copy_value_is_divergent(src) || copy_value_is_divergent(dest));"
- panfrost: gbm_bo_get_offset() wrongly returns 0 for second plane of NV12 buffers
- [RADV][TONGA] - BeamNG.drive (284160) - Artifacts are present when looking at the skybox.
- LEGO Star Wars: The Skywalker Saga graphical glitches (DXVK) on R9 380
- [radv] Crypt not rendering properly
- Leaks of DescriptorSet debug names
- [Tracing flake] Missing geometry in trace\@freedreno-a630\@freedoom\@freedoom-phase2-gl-high.trace
- Unreal Engine 5.2 virtual shadow maps have glitchy/lazy tile updates
- RADV: Visual glitches in Unreal Engine 5.2.1 when using material with anisotropy and light channel 2
- radv: Regression with UE5 test
- SIGSEGV with MESA_VK_TRACE=rgp and compute only queue
- [ANV] Corruptions in Battlefield 4
- anv regression w/ commit e488773b29d97 ("anv: Fast clear depth/stencil surface in vkCmdClearAttachments")
- ir3: dEQP-GLES31.functional.synchronization.inter_invocation.image_atomic_read_write crash on a6xx gen4
- Zink + Venus: driver can't handle INVALID<->LINEAR!
- Anv: Particles have black square artifacts on Counter Strike 2 on Skylake
- Lords of the Fallen 2023 Red Eye mode crashing game and desktop
- [radeonsi] [vulkan] [23.3-rc1 regression] Video output corrupted in QMplay2 with Vulkan renderer
- [BISECTED] ac/radeon commit somehow breaks nv12 surface from HEVC decode
- Parsec displays completely green screen with hardware decoder selected while using Mesa 23.3 and Mesa 24
- H264 to H264 transcode output corruption with gst-vaapi
- opencl-jpeg-encoder does not work with nouveau/rusticl, works with nouveau/clover
- [R600] X-plane 11 demo (Linux Native) crashes upon launch on HD5870 and HD6970
- Ubuntu 23.10 build error with rusticl_opencl_bindings.rs
- Rusticl fails to build
- ANV not handling VkMutableDescriptorTypeCreateInfoEXT::pMutableDescriptorTypeLists[i] being out of range
- tu: Wolfenstein: The New Order misrenders on a740
- DRI_PRIME fails with ACO only radeonsi
- nir_to_tgsi: Incorrect handling of indirect array access
- ANV gen9 32 bit vulkan asserts on many cts tests
- GPU hang observed while launching 3DMark Wildlife Unlimited on MTL
- ac/gpu_info: Query maximum submitted IBs from the kernel
- RADV: regression in 23.2.1 causing GPU hang with RDNA1 in various UE5 games
- GPU page faults reported while playing Talos Principle 2 (demo)
- No CCS_E scanout on tgl+ with ANV
- anv: Modifier tests assert-fail on TGL+
- ci: zink-tu jobs no longer included in manual pipelines
- [ANV][A770] GravityMark segfaults and buffer allocation errors
- etnaviv: gc2000 gles2 regression
- ci_run_n_monitor: pipeline finding unreliable
- nvk: Implement VK_EXT_dynamic_rendering_unused_attachments
- anv: jsl timeline semaphores flaky
- anv: OOB access in vkDestroyDevice?
- nvk: Implement VK_EXT_primitive_topology_list_restart
- nvk: Implement VK_EXT_image_sliced_view_of_3d
- nvk: Implement VK_KHR_workgroup_memory_explicit_layout
- util/macros: BITFIELD64_RANGE raises an error with mesa-clang if we try to set last bit
- r300/r400 regression; can't compile \`if/then` in shaders
- iris: gbm_bo_get_offset() wrongly returns 0 for second plane of NV12/P010 buffers
- nvk: Implement VK_EXT_depth_bias_control
- ICL/zink: gpu hang on 'piglit.object namespace pollution.framebuffer with gldrawpixels'
- [R600] Wolfenstein: The New Order text glitch on menu
- need extension to request image/texture not use data dependent compression
- rusticl: segfault in clCreateKernel on AMD Instinct MI100
- !25587 broke xserver
- GPU Hang in Deep Rock Galactic on DG2
- intel: Wrong length for 3DSTATE_3D_MODE on gfx125
- [radeonsi] Wargame: Red Dragon /w OpenGL stopped working with ACO
- traces job reference images missing again sometimes
- Vulkan Texture/Polygon Glitches in Games
- freedreno: dmabuf modify query ignores format
- virgl: removing PIPE_CAP_CLEAR_TEXTURE completely breaks virglrenderer
- Turnip build error on termux
- failiure in amd llvm helper
- failiure in amd llvm helper
- radv_amdgpu_cs_submit: Assertion \`chunk_data[request->number_of_ibs - 1].ib_data.ip_type == request->ip_type' failed.
- hasvk: subgroups regression
- radeonsi: broken hardware decoding (vaapi/vulkan) on RDNA2 gpu (bisected)
- aco: SwizzleInvocationsMaskedAMD behavior is not correct for reads from inactive lanes
- anv: dEQP-VK.ssbo.phys.layout.random.16bit.scalar.13 slow
- [RDNA3] CS:GO - excessive power consumption and lower performance in Vulkan while MSAA is set to 4x or 8x
- [ICL] piglit.spec.arb_gl_spirv.execution.ssbo.unsized-array regression
- radv: Counter Strike 2 has multiple bugs while rendering smoke grenade effect
- Doom Eternal freezing on NAVI31 with current git
- iris CTS blend test fail with MSAA config on DG2
- anv: 32bit mesa asserts
- RADV: Randomly dissapearing objects in Starfield with RX 5xx and Vega graphics
- anv: missing barrier handling on video engines
- radv: Star Wars The Old Republic hang when DCC is enabled
- radv: Resident Evil 6 hangs 7900XTX GPU when DCC is enabled if in Options go to Display settings
- radv: Resident Evil 6 Benchmark Tool hangs 7900 XTX GPU when DCC is enabled immediately after splash screen
- ANV: fp64 shader leaked
- v3d: noop drm-shim raises some warnings
- freedreno: crashdec/etc chip_id support
- intel: compute dispatches with variable workgroup size have ralloc_asprintf CPU overhead
- ci build issues with builtin types
- freedreno: running angle perf traces with GALLIUM_THREAD=0 crashes
- RadeonSI: glClear() causes clear texture for some frames on RX580
- radeonsi: corruption when seeking video decoded with vaapi in mpv
- Zink/HasVK regression bisected to "gallium: move vertex stride to CSO"
- [radv] [Path Of Exile] - one setting in the workaround file breaks shadows/lighting rendering. Other workaround settings seems obsolete.
- radv: images don't always have extents in RGP
- shader_test causing a crash in compiler
- D3D12: Video decoding requirements are too restrictive. ID3D12VideoDevice3 should not be required.
- Crash in st_ReadPixels
- [regression] intel build issue on i386
- [ANV] [DG2/A770] The Spirit and The Mouse, miscellaneous issues with Mesa Git
- zink on hasvk regression: Assertion \`(dyn)->vi_binding_strides[first_binding + i] == (strides[i])' failed.
- Penumbra: Overture hangs on new game loading screen
- [r300, RV516] Some deqp-gles2\@performance\@shader\@control_statement vertex tests cause hard lockup & reboot in mesa 22.3.1 (regression over 22.1.7) on a Radeon X1550
- v3dv: Add a feature that implicitly copies the linear image to the tiled image prior to sampling from it
- radv: Regression from 266b2cfe5bf3feda16747c50c1638fb5a0426958
- h264 encoding picture showed randomly repeated frames.
- Mesa CI: NAVI10 hangs when running VKCTS on Linux 6.1
- zink: no uniform buffer objects support for v3dv?
- v3dv: Request for VkImageDrmFormatModifierExplicitCreateInfoEXT::pPlaneLayouts support
- [ANV] [DG2/A770] The Spirit and The Mouse, occasional flickering geometry
- [Google][Rex][anv] GLES dEQP test fails in anv when run via ANGLE-on-Venus on ChromeOS ARCVM.
- VAAPI on VCN: bad stream may crash whole gfx system
- Crash after GPU reset
- Bifrost PanVK should not be in CI
- [Intel][Vulkan][Gen12] vkCmdCopyImage() generates garbage data when the destination texture is bound to a piece of used device memory
- mesa: new glcts fails
- tu: GPL support is broken
- lavapipe: ycbcr regression
- aco: Assertion when compiling CP2077 shader
- anv: flakiness on tgl+ with samplemask handling
- [RADV] Dead by Daylight memory leak (shader-related?) on 23.1.6
- r300: optionally convert MULs into output modifier for the following MUL or DOT instructions
- r300: better 1-x presubtract pattern matching
- gpu hang on DG2 when running KHR-GLES31.core.texture_cube_map_array.image_op_tess*
- KHR-GLES31.core.texture_cube_map_array.image_op_tessellation_evaluation_sh fail on GFX12+
- wsi: deadlocks when DISPLAY is changed
- hasvk: Incompatible with minigbm/gralloc4 on Android
- VAAPI: AMDGPU crash on RX 6900 XT on corrupted video
- lavapipe/llvmpipe: shader unregister crash
- [ANV] [DG2/A380] Corruption in Borderlands 3
- blorp regression on dg2
- decouple -Dshader-cache= from EGL_ANDROID_blob_cache
- radv: commit 81641b01555faa4dd1dfc7de2513ad8d63e77ab7 leaded to artifacts in Quake II RTX
- [radv] Colors are distorted in Cyberpunk 2077 with ray tracing enabled
- Forza Horizon 5 stuttering since mesa 23.1.4 / 9b008673 revert as a FIX
- ubsan + gtest build fails
- glCopyTexSubImage2D is very slow on Intel
- NVE4 (GeForce 710) fails to get vdpau in mesa git
- [RADV] red and pink tinted shadows in Overwatch 2 on 7900 XTX
- nouveau prevents hardware acceleration with Chromium (Wayland)
- Corrupt text rendering in Blender
- DRI2 gallium frontend is using bad format type
- regression - MR 23089 - Hellblade RT crashing
- Incorrect vlVaCreateBuffer/vlVaMapBuffer behavior for buffer type VAEncCodedBufferType in Gallium
- Issue with clang-format
- Follow-up from "Draft: intel: Disable color fast-clears for blorp_copy"
- nightly VA-API build: new timeout
- r600: retire the SB optimizer
- ci: do not download perfetto on-fly in build jobs
- Shared Memory Leak With Qt OpenGL Applications
- OpenGL, SIGSEGV when program pipeline objects has separated vertex shader progam and separated fragment shader progam with in/out
- vaDeriveImage returns VA_STATUS_ERROR_OPERATION_FAILED
- 975a8ecc881873744d851ab0ef45ad7698eaa0ef "frontends/va: use resources instead of views" cause radeonsi can't play video.
- zink: reduce pipeline hash size
- Rusticl,radeonsi: ac_rtld error(2): too much LDS
- aco, radv Rage 2 menu corruption - bisected
- radv, aco: World War Z character texture regression on 7900xtx
- android: De-stage drm_gralloc support from mesa3d
- Cyberpunk screen goes black at game launch on integrated Gfx
- lavapipe/llvmpipe: regressions since descriptor rewrite
- intel: State cache invalidation after BLORP binding table setup ought to be unnecessary on ICL.
- ci: HW job logs have spam at the end
- kernel crash seen on AMD Raven device
- crocus: regression crashing in doubles/ubo tests
- turnip: object management CTS crashes
- a618: multiple assertions with different kernel config on u_vector_add
- [anv] Death Stranding crashes
- Can no longer build Clover without llvmspirvlib
- [radeonsi][vaapi] segfault in vl_video_buffer_sampler_view_components() when using vaapisink receiving I420 format
- Baldurs Gate 3 (DX11) - Graphical corruption on RDNA3 (ACO regression)
- [AMDGPU] Compiling large Blender Eevee shader node trees is unusably slow
- Building llvmpipe with LP_USE_TEXTURE_CACHE set fails since 23.2.0-rc1: error C2039: dynamic_state is not member of lp_build_sampler_soa in lp_tex_sample.c
- r300: calculate some cycles estimate for shader-db
- intel: Deathloop and other DX12 games fail assert(validated) with invalid SEL instruction
- GTF-GL46.gtf21.GL.build.CorrectFull_vert regressed on intel platforms
- error message when encoding via VAAPI AMD
- gpu hangs on dg2 with mesh shading enabled on vkcts
- radeonsi: Deadlock when creating a new GL context in parallel with linking a shader on another GL context
- robustness2 raygen tests intermittently fail in Intel Mesa CI
- ci/ci_run_n_monitor.py: KeyError: 'clang-format'
- glthread: huge performance regression
- DirectX games do not launch on Intel HD Graphics 4000 (IVB GT2) [bisected]
- rusticl: fails to build for iris + radeonsi
Changes
-------
Adam Jackson (3):
- egl: Implement EGL_EXT_explicit_device
- mesa: Implement and advertise GL_MESA_sampler_objects
- docs: Mention 'meson devenv' in the pre-install test instructions
Aditya Swarup (6):
- isl: enable Tile64 for 3D images
- intel/isl: Unittest for linear to Ytile conversion
- intel/isl: Convert linear texture to Tile4 format
- intel/isl: Convert Tile4 texture to linear format
- intel/isl: Linear to Tile-4 conversion unittest
- Revert "iris: Disable tiled memcpy for Tile4"
Alba Mendez (1):
- meson: support installation tags
Alejandro Piñeiro (61):
- v3dv: re-enable sync_fd import/export on the simulator
- broadcom(cle,clif,common,simulator): add 7.1 version on the list of versions to build
- broadcom/cle: update the packet definitions for new generation v71
- broadcom/common: add some common v71 helpers
- broadcom/qpu: add comments on waddr not used on V3D 7.x
- broadcom/qpu: set V3D 7.x names for some waddr aliasing
- broadcom/compiler: rename small_imm to small_imm_b
- broadcom/compiler: add small_imm a/c/d on v3d_qpu_sig
- broadcom/qpu: add v71 signal map
- broadcom/qpu: define v3d_qpu_input, use on v3d_qpu_alu_instr
- broadcom/qpu: add raddr on v3d_qpu_input
- broadcom/qpu: defining shift/mask for raddr_c/d
- broadcom/commmon: add has_accumulators field on v3d_device_info
- broadcom/qpu: add qpu_writes_rf0_implicitly helper
- broadcom/qpu: add pack/unpack support for v71
- broadcom/compiler: phys index depends on hw version
- broadcom/compiler: don't favor/select accum registers for hw not supporting it
- broadcom/vir: implement is_no_op_mov for v71
- broadcom/compiler: update vir_to_qpu::set_src for v71
- broadcom/qpu_schedule: add process_raddr_deps
- broadcom/qpu: update disasm_raddr for v71
- broadcom/qpu: return false on qpu_writes_accumulatorXX helpers for v71
- broadcom/compiler: add support for varyings on nir to vir generation for v71
- broadcom/compiler: payload_w is loaded on rf3 for v71
- broadcom/qpu_schedule: update write deps for v71
- broadcom/compiler: update register classes to not include accumulators on v71
- broadcom/qpu: implement switch rules for fmin/fmax fadd/faddnf for v71
- broadcom/compiler: update one TMUWT restriction for v71
- broadcom/compiler: update ldunif/ldvary comment for v71
- broadcom/compiler: update payload registers handling when computing live intervals
- broadcom/qpu: new packing/conversion v71 instructions
- v3dv/meson: add v71 hw generation
- v3dv: emit TILE_BINNING_MODE_CFG and TILE_RENDERING_MODE_CFG_COMMON for v71
- v3dv/cmd_buffer: emit TILE_RENDERING_MODE_CFG_RENDER_TARGET_PART1 for v71
- v3dvx/cmd_buffer: emit CLEAR_RENDER_TARGETS for v71
- v3dv/cmd_buffer: emit CLIPPER_XY_SCALING for v71
- v3dv/uniforms: update VIEWPORT_X/Y_SCALE uniforms for v71
- v3dv/cmd_buffer: just don't fill up early-z fields for CFG_BITS for v71
- v3dv: default vertex attribute values are gen dependant
- v3dv/pipeline: default vertex attributes values are not needed for v71
- v3dv/pipeline: handle GL_SHADER_STATE_RECORD changed size on v71
- v3dv: no specific separate_segments flag for V3D 7.1
- v3dv: add support for TFU jobs in v71
- v3d: add v71 hw generation
- v3d: emit TILE_BINNING_MODE_CFG and TILE_RENDERING_MODE_CFG_COMMON for v71
- v3d: TILE_RENDERING_MODE_CFG_RENDER_TARGET_PART1
- v3d: emit CLEAR_RENDER_TARGETS for v71
- v3d: just don't fill up early-z fields for CFG_BITS for v71
- v3d: emit CLIPPER_XY_SCALING for v71
- v3d: no specific separate_segments flag for V3D 7.1
- v3d: default vertex attributes values are not needed for v71
- v3d/uniforms: update VIEWPORT_X/Y_SCALE uniforms for v71
- v3d: handle new texture state transfer functions in v71
- v3d: handle new TEXTURE_SHADER_STATE v71 YCbCr fields
- v3d: setup render pass color clears for any format bpp in v71
- v3d: GFX-1461 does not affect V3D 7.x
- v3d: don't convert floating point border colors in v71
- v3d: handle Z clipping in v71
- v3d: add support for TFU blit in v71
- v3dv: implement depthBounds support for v71
- doc/features: update after last v3d changes
Alex Denes (1):
- virgl: link VA driver with build-id
Alexander Orzechowski (1):
- radeonsi: Set PIPE_CONTEXT_LOSE_CONTEXT_ON_RESET for auxiliary contexts
Alyssa Rosenzweig (431):
- zink: Switch to register intrinsics
- gallium/trace: Collect enums from multiple files
- gallium,util: Move blend enums to util/
- gallium,util: Move util_blend_dst_alpha_to_one
- util/blend: Add helpers for normalizing inverts
- vulkan: Add helpers for blend enum translation
- lvp: Use common blend/logicop translation
- nir/lower_blend: Use util enums
- panfrost: Convert to PIPE_BLEND enums internally
- gallium: Remove pipe->compiler BLEND enum translation
- compiler: Remove blend enums duplicating util
- nir/legacy: Fix fneg(load_reg) case
- nir/legacy: Fix handling of fsat(fabs)
- ntt: Switch to new-style registers and modifiers
- ir3: Convert to register intrinsics
- nir: Add fence_{pbe,mem}_to_tex(_pixel)_agx intrinsics
- nir: Devendor load_sample_mask
- nir: Promote tess_coord_r600 to tess_coord_xy
- nir: Add nir_lower_tess_coord_z pass
- r600: Use nir_lower_tess_coord_xy
- ir3: Use nir_lower_tess_coord_z
- nir: Initialize workgroup_size in builder_init_simple_shader
- v3dv: Rely on nir_builder setting workgroup size
- radv: Rely on workgroup_size initialization
- panfrost: Fix transform feedback on v9
- r600/sfn: Remove nir_register unit tests
- panfrost: Lower vertex_id for XFB
- panfrost: Fix transform feedback on v9 harder
- asahi: Augment fake drm_asahi_params_global
- asahi: Use nir_builder_at more
- asahi: Remove unused #define
- asahi: Refactor PBE upload routine
- asahi: Extract shader_initialize helper
- asahi: Serialize NIR in memory
- asahi: Identify background/EOT counts
- asahi,agx: Set coherency bit for clustered targets
- ail: Page-align layers for writable images
- asahi: Mark writeable images as such
- asahi: Reallocate to set the writeable image flag
- asahi: Add agx_batch_track_image helper
- asahi: Add texture/image indexing lowering pass
- asahi: Upload at most the max texture state registers
- asahi: Upload image descriptors
- asahi: Make clear the non-sRGBness of EOT images
- asahi: Don't restrict sampler views
- asahi: Forbid 2D Linear with images
- agx: Add try_coalesce_with helper
- agx: Try to allocate phis compatibly with sources
- agx: Try to allocate phi sources with phis
- agx: Try to allocate phi sources with loop phis
- agx: Vectorize 16-bit parallel copies
- agx: Reduce un/packs with mem access lowering
- agx: Fix bogus assert
- asahi: Augment PBE descriptor for software access
- asahi: Extend PBE packing for image support
- asahi: Use nir_lower_robust_access
- agx: Legalize image LODs to be 16-bit
- agx: Lower image size to txs
- agx: Generalize texture/PBE packing
- agx: Add image write instruction
- agx: Model texture bindless base
- agx: Handle bindless properly for txs lowering
- agx: Pack bindless textures
- agx: Translate texture bindless handles
- agx: Translate image_store from NIR
- agx: Handle frag side effects without render targets
- agx: Wait for outstanding stores before barriers
- agx: Implement image barriers
- agx: Handle early_fragment_tests
- agx: Add interleave opcode
- agx: Extract coords_for_buffer_texture helper
- agx: Extract texture_descriptor_ptr_for_* helpers
- agx: Lower image atomics
- agx: Lower buffer images
- asahi,agx: Fix txf sampler
- agx: Add image_load opcode
- agx: Extract texture write mask handling
- agx: Implement image_load
- agx: Emit global memory barriers for images
- agx: Don't emit silly barriers
- agx: Implement fence_*_to_tex_agx intrinsics
- agx: Add simple image fencing pass
- agx: Require tag writes with side effects
- agx: Plumb in coverage mask
- asahi: Extract sampler_view_for_surface
- asahi: Introduce concept of spilled render targets
- asahi: Add agx_tilebuffer_spills query
- asahi: Do not support masking with spilled RTs
- asahi: Ignore spilled render targets in EOT shaders
- asahi: Ignore spilled render targets with partial renders
- asahi: Extract some tilebuffer lowering code
- asahi: Lower tilebuffer access for spilled RTs
- asahi: Lower multisample image stores
- asahi: Permit meta shaders to use preambles
- asahi: Ignore spilled render targets for background load
- asahi: Offset clear colour uniform by 4
- asahi: Execute preambles for background programs
- asahi: Advertise Z16_UNORM
- ir2: Switch to nir_legacy
- intel/fs: Don't read reg.base_offset
- panfrost: Remove unused helpers
- nir: Remove nir_lower_locals_to_regs
- nir: Rename lower_locals_to_reg_intrinsics back
- nir: Remove register arrays
- asahi: Don't depend on glibc to decode
- pan/bi: Remove leftover include
- nir/trivialize: Handle more RaW hazards
- panfrost: Disable blending for no-op logic ops
- nir/lower_blend: Fix 32-bit logicops
- nir/lower_blend: Optimize out PIPE_LOGICOP_NOOP
- clang-format: Ignore original panfrost commit
- nir/schedule: Assume no old-style registers
- gallium/u_simple_shaders: Optimize out ffloors
- gallium/u_transfer_helper: Remove dead forward decl
- nir/loop_analyze: Drop unused inverse_comparison
- nir/passthrough_gs: Drop unused array_size_for_prim
- panfrost: Add missing static inline annotation
- pan/decode: Drop unused debug function
- pan/mdg: Add missing static inline annotation
- panfrost: Drop unused decode_position for samples
- panfrost: Only define pan_blitter_get_blend_shaders for midgard
- panfrost: Add missing inline
- panfrost: Gate overdraw_alpha on Bifrost+
- nir: Rename scoped_barrier -> barrier
- nir: Remove lower_to_source_mods
- nir: Remove lower_vec_to_movs
- nir: Remove reg_intrinsics parameter to convert_from_ssa
- nir: Remove register load/store builders
- r600/sfn: Stop referencing legacy functionality
- r600/sfn: Ignore instruction write masks
- nouveau/codegen: Drop writemask check
- vc4,broadcom/compiler: Drop write_mask handling
- zink: Collapse is_ssa check
- nir: Add {...} before case
- nir/from_ssa: Drop legacy reg support
- nir/schedule: Drop nir_schedule_dest_pressure
- nir: Drop NIR reg create/destroy
- nir: Remove nir_index_local_regs and callers
- nir/schedule: Drop more nir_register handling
- nir: Remove nir_foreach_register
- nir: remove nir_{src,dest}_for_reg
- ntt: Drop nir_register reference
- nir/print: Assume SSA
- nir/clone: Assume SSA
- nir/serialize: Drop legacy NIR
- nir/validate: Assume SSA
- nir: Remove impl->{registers,reg_alloc}
- nir: Remove nir_alu_dest::saturate
- treewide: Drop is_ssa asserts
- nir: Collapse some SSA checks
- treewide: Remove more is_ssa asserts
- nir: Remove reg-only dest manipulation
- nir: Remove stale todo
- nir/print: Drop legacy NIR
- nir: Drop nir_alu_src::{negate,abs}
- treewide: sed out more is_ssa
- pan/mdg: Assume SSA
- treewide: Drop some is_ssa if's
- nir: Drop trivial reg handling
- aco: Remove is_ssa check
- intel: Collapse is_ssa checks
- llvmpipe: Assume SSA
- ir3: Collapse is_ssa checks
- lima: Collapse is_ssa checks
- radeonsi: Collapse SSA check
- nir/gather_ssa_types: Collapse SSA checks
- nir/worklist: Assume SSA
- nir/range_analysis: Assume SSA
- treewide: Collapse more SSA checks
- nir/instr_set: Assume SSA
- nir: Collapse more SSA checks
- nir: Remove def_is_register
- nir: Do not init dests
- nir: Initialize source as a NULL SSA def
- nir: Collapse more SSA checks
- nir: Remove nir_{src,dest}::is_ssa
- nir: Drop nir_register
- nir/from_ssa: Remove pointless union
- ir3: Drop write_mask handling
- rogue: Stop reading write masks
- etnaviv: Don't use alu->dest.write_mask
- etnaviv: What if we just didn't have a compiler?
- intel/vec4: Don't use legacy write mask
- ntt: Evaluate write_mask check
- nir: Remove nir_alu_dest::write_mask
- nir: Remove nir_foreach_def
- lima: Clean up after deleting asserts
- nir: Remove no-op remove_def_cb
- nir: Drop no-op all_srcs_are_ssa
- nir: Simplify alu_instr_is_copy
- nir: Add load_coefficients_agx intrinsic
- agx: Implement nir_intrinsic_load_coefficients_agx
- agx: Allow more varying slots
- agx: Set lower_fisnormal
- agx: Forcibly vectorize pointcoord coeffs
- agx: Add interpolateAtOffset lowering pass
- agx: Lower flat shading in NIR
- asahi: Stub num_dies
- asahi: Move a bunch of helpers to common
- agx: Lower 8-bit ALU
- agx: Handle 8-bit vecs
- asahi,agx: Respect no16 even for I/O
- agx: Don't lower load_local_invocation_index
- agx/dce: Use the helper
- agx: Fix atomics with no destination
- agx: Fix shader info with sample mask writes
- agx: Do not move bindless handles
- agx: Put else instructions in the right block
- agx: Use unconditional else instruction
- agx: Optimize out pointless else instructions
- agx: Fix length bit confusion
- agx: Require an immediate for \`nest`
- agx: Use compressed fadd/fmul encodings
- agx: Optimize swaps of 2x16 channels
- agx: Optimize logical_end removal
- agx: Fix AGX_MESA_DEBUG=demand
- agx: Maintain ctx->max_reg while assigning regs
- agx: Allow 64-bit memory regs
- agx: Fix accounting for phis
- agx: Set phi sources in predecessors
- agx: Stop setting registers after the shader
- agx: Use agx_replace_src
- agx: Assert invariant stated in the comment
- agx: Don't use ssa_to_reg across blocks
- agx: Don't reuse ssa_to_reg across blocks
- agx: Remove unused allocation
- agx: Stop setting forwarding bit
- agx: Handle blocks with no predecessors
- agx: Lower f2u8/f2i8
- agx: Handle conversions to 8-bit
- agx: Fix uadd_sat packing
- agx: Fix 64-bit immediate moves
- agx: Lower f2f16_rtz
- agx: Handle f2f16_rtne like f2f16
- agx: Handle <32-bit local memory access
- agx: Do not allow creating vec8
- asahi: Legalize compression before blitting
- nir: Drop "SSA" from NIR language
- agx: Stop passing nir_dest around
- agx: Remove agx_nir_ssa_index
- pan/mdg: Don't reference nir_dest
- pan/bi: Don't reference nir_dest
- asahi: Do not reference nir_dest
- panfrost: Do not reference nir_dest
- zink: Do not reference nir_dest
- ir3: Do not reference nir_dest
- dxil: Do not reference nir_dest
- nir: Drop nir_dest_init
- panfrost: Pack stride at CSO create time on v9
- lvp,nir/lower_input_attachments: Use nir_trim_vector
- broadcom/compiler: Use nir_trim_vector explicitly
- nir: Assert that nir_ssa_for_src components matches
- nir: Add nir_shader_intrinsics_pass
- nir: Lower fquantize2f16
- agx: Lower fquantize2f16
- nir/lower_helper_writes: Consider bindless images
- nir/passthrough_gs: Correctly set vertices_in
- nir/passthrough_gs: Fix array size
- nir/print: Print access qualifiers for intrinsics
- nir/lower_gs_intrinsics: Remove end primitive for points
- panfrost/ci: Disable T720
- nir: Add load_sysval_agx intrinsic
- agx: Fix extraneous bits with b2b32
- agx: Use more barriers
- asahi: Copy CSO stride
- agx: Assert vertex_id, instance_id are VS-only
- asahi: Keep drawoverhead from OOMing itself
- agx: Don't blow up when lowering textures twice
- agx/lower_vbo: Handle nonzero component
- agx: Allow loop headers without later preds
- agx: Handle b2i8
- agx: Convert 8-bit comparisons
- agx: Implement imul_high
- asahi: Advertise OpenGL ES 3.1!
- asahi/decode: Turn assert into error
- asahi: Report local_size from compiler
- asahi: Use local_size from compiler directly
- asahi: Pass layer stride in pixels, not elements
- agx: Clear sample count after lowering MSAA
- agx: Clear image_array after lowering
- asahi: Preserve atomic ops when rewriting image to bindless
- agx: Use 16-bit reg for pixel_coord
- asahi: Generalize query logic
- asahi: Simplify occlusion query batch tracking
- asahi: Refactor agx_get_query_result
- asahi: Only touch batch->occlusion_queries for occlusion
- asahi: Sync when beginning a query
- asahi: Add non-occlusion query tracking
- asahi: Add get_query_address helper
- agx/fence_images: Use intrinsics_pass
- agx: Do not fence write-only images
- asahi: Add missing LOD source for agx_meta's txfs
- agx: Do some texture lowering early
- agx: Add helper returning if a descriptor crawl is needed
- nir,asahi: Remove texture_base_agx
- asahi: Move UBO lowering into GL driver
- asahi: Add sysval tables for each shader stage
- asahi: Split out per-stage sysvals
- asahi: Collapse grid_info
- asahi: Extract agx_upload_textures
- asahi: Upload a single draw_uniforms per draw
- asahi: Add real per-stage dirty flags
- asahi: Extract sampler upload
- asahi: Put unuploaded uniforms on the batch
- asahi: Decouple sysval lowering from uniform assignment
- asahi: Use finer dirty tracking for blend constant
- asahi: Use proper dirty tracking for VBOs
- asahi: Dirty track VBOs + blend const separately
- asahi: Dirty the shader stage when the shader changes
- asahi: Fix shader stage dirtying
- treewide: Use nir_shader_intrinsic_pass sometimes
- treewide: Also handle struct nir_builder form
- nir/lower_shader_calls: Fix warning with clang
- nir: Add nir_before/after_impl cursors
- treewide: Use nir_before/after_impl in easy cases
- treewide: Use nir_before/after_impl for more elaborate cases
- radv: Use before/after_cf_list for entrypoints
- ci: Disable known broken Bifrost Vulkan job
- ci: Disable WHL jobs
- nir/opt_if: Simplify if's with general conditions
- asahi: Fixes for clang-warnings
- agx: Fix jmp_exec_none encoding
- agx/validate: Print to stderr
- agx: Annotate opcodes with a scheduling class
- agx: Add schedule-specialized get_sr variants
- agx: Include schedule class in the opcode info
- agx: Schedule for register pressure
- agx: Lower pack_32_4x8_split
- asahi: Force translucency for ignored render targets
- agx: Remove logical_end instructions
- agx: Lower pseudo-ops later
- agx: Expand nest
- agx: Lower nest later
- agx: Split nest instruction into begin_cf + break
- agx: Add break_if_*cmp instructions
- agx: Add agx_first/last_instr helpers
- agx: Use agx_first_instr
- agx: Detect conditional breaks
- agx: Omit push_exec at top level
- agx: Omit while_icmp without continue
- agx: Add helper to determine if a NIR loop uses continue
- agx: Only use nest by 1 for loops w/o continue
- agx: Add pseudo-instructions for icmp/fcmp
- agx: Generate unfused comparison pseudo ops
- agx: Fuse conditions into if's
- agx: Fuse compares into selects
- agx: Add unit test for if_cmp fusing
- agx: Add unit test for cmp+sel fusing
- asahi: Translate cube array dimension
- ail: Force page-alignment for layered attachments
- agx: Handle cube arrays when clamping arrays
- agx: Lower coordinates for cube map array images
- agx: Run opt_idiv_const after lowering texture
- asahi: Forbid linear 1D Array images
- asahi: Handle linear 1D Arrays
- asahi: Conditionally expose cube arrays
- gallium,mesa/st: Add PIPE_CONTEXT_NO_LOD_BIAS flag
- asahi: Skip LOD bias lowering for GLES
- nir: Add nir_function_instructions_pass helper
- nir: Add NIR_OP_IS_DERIVATIVE property
- nir: Hoist nir_op_is_derivative
- nir/opt_preamble: Use nir_op_is_derivative
- nir/opt_gcm: Use nir_op_is_derivative more
- nir/gather_info: Use nir_op_is_derivative
- nir/opt_sink: Sink load_constant_agx
- nir/opt_sink: Sink load_local_pixel_agx
- nir/opt_sink: Sink frag coord instructions
- nir/opt_sink: Do not move derivatives
- nir/opt_sink: Move ALU with constant sources
- nir/opt_sink: Also consider load_preamble as const
- agx: Enable sinking ALU
- treewide: Drop nir_ssa_for_src users
- treewide: Remove remaining nir_ssa_for_src
- nir: Remove nir_ssa_for_src
- asahi: Clamp index buffer extent to what's read
- agx: Align the reg file for 256-bit vectors
- agx: Hoist sample_mask/zs_emit
- agx: Set PIPE_SHADER_CAP_CONT_SUPPORTED
- agx: Augment if/else/while_cmp with a target
- agx: Add jumps to block ends
- agx: Add agx_prev_block helper
- agx: Insert jmp_exec_none instructions
- nir: Add layer_id_written_agx sysval
- nir: Support arrays in block_image_store_agx
- agx/nir_lower_texture: Allow disabling layer clamping
- agx: Pack block image store dim correctly
- agx: Handle layered block image stores
- agx: Add pass to lower layer ID writes
- asahi: Add helper to get layer id in internal program
- asahi,agx: Select layered rendering outputs
- agx: Support packed layered rendering writes
- agx/tilebuffer: Support layered layouts
- agx/lower_tilebuffer: Support spilled layered RTs
- asahi: Use layered layouts
- asahi: Expose VS_LAYER_VIEWPORT behind a flag
- asahi: Account for layering for attachment views
- asahi: Assume LAYER is flat-shaded
- asahi: Add pass to predicate layer ID reads
- asahi: Predicate layer ID reads
- asahi: Write to cubes/etc attachments as 2D array
- asahi: Use a 2D Array texture for array render targets
- asahi: Generate layered EOT programs
- asahi: Handle layered background programs
- lima/pp: Do not use union undefined behaviour
- nir: Add trivial nir_src_* getters
- nir: Use set_parent_instr internally
- nir: Use getters for nir_src::parent_*
- nir: Assert the nir_src union is used safely
- nir: Use a tagged pointer for nir_src parents
- nir: Add ACCESS_CAN_SPECULATE
- ir3: Set CAN_SPECULATE before opt_preamble
- ir3: Model cost of phi nodes for opt_preamble
- nir/opt_preamble: Walk cf_list manually
- nir/opt_preamble: Preserve IR when replacing phis
- nir/opt_preamble: Unify foreach_use logic
- nir/opt_preamble: Move phis for movable if's
- nir/opt_preamble: Respect ACCESS_CAN_SPECULATE
- freedreno/ci: Minetest
- r600/sfn: Handle load_global_constant
- nir/opt_phi_precision: Work with libraries
- nir/legalize_16bit_sampler_srcs: Use instr_pass
- nir/print: Handle KERNEL
- nir/lower_io: Use load_global_constant for OpenCL
- nir/opt_algebraic: Reduce int64
- nir/opt_algebraic: Optimize LLVM booleans
- nir/trivialize_registers: Handle obscure load hazard
- hasvk: Support builiding on non-Intel
- crocus: Support building on non-Intel
- meson: Add vulkan-drivers=all option
- meson: Add gallium-drivers=all option
- agx: Fix fragment side effects scheduling
Amber (7):
- ir3: make wave_granularity configurable
- turnip: Add support for devices not supporting double thread size.
- turnip: make sampler_minmax support configurable.
- freedreno, turnip: set correct reg_size_vec4 for a6xx_gen1_low
- ir3: handle non-uniform case for atomic image/ssbo intrinsics
- freedreno: Add support for devices not supporting double thread size.
- turnip: Add debug option to allow non-conforming features.
Andrew Randrianasulu (1):
- nv50/ir: Remove few nvc0 specific defines from nv50-specific header.
Antonio Gomes (9):
- rusticl/kernel: Removing unnecessary clone in kernel launch
- rusticl/kernel: Add CsoWrapper
- rusticl/compiler: Add NirPrintfInfo
- rusticl: Move Cso to Program
- rusticl/compiler: Remove unnecessary functions
- rusticl: Move NirKernelBuild to ProgramDevBuild
- rusticl/program: New helper functions to NirKernelBuild
- rusticl/core: Delete KernelDevState and KernelDevStateInner
- rusticl/core: Make convert_spirv_to_nir output pair (KernelInfo, NirShader)
Asahi Lina (29):
- docs/tgsi: Specify that depth texture fetches are replicated
- asahi: Add synctvb debug flag
- asahi: Add smalltile debug option
- asahi: Add nomsaa debug flag
- asahi: decode: Add a params argument to pass through
- asahi: Add extra CDM header block for G14X
- asahi: wrap: Handle freeing shmems
- asahi: decode: Refactor to always copy GPU mem to local buffers
- asahi: decode: Add a function to construct decode_params from a chip_id
- asahi: Add a shared library interface for decode
- asahi: Add a noshadow debug flag
- asahi: Do not overallocate BOs by more than 2x
- asahi: Fix race in BO stats accounting
- asahi: Always use resource size, not BO size
- asahi: Print info about shadowed resources
- asahi: Impose limits on resource shadowing
- asahi: Force linear for SHARED buffers with no/implicit modifier
- asahi: Enable explicit coherency for G14D (multi-die)
- asahi: Handle non-written RTs correctly
- asahi: Fix incorrect BO bitmap reallocations
- asahi: Allocate staging resources as staging
- asahi: cmdbuf: Identify call/ret bits
- asahi: decode: Implement VDM call/ret
- asahi: decode: Do not assert on buffer overruns
- asahi: Fix VDM pipeline field width
- asahi: Add scaffolding for supporting driconf options
- asahi: Add and support the no_fp16 driconf flag
- driconf: Disable fp16 for browsers
- asahi: Allow no16 flag for disk cache
Bas Nieuwenhuizen (16):
- aco: fix nir_op_vec8/16 with 16-bit elements.
- aco: Fix some constant patterns in 16-bit vec4 construction with s_pack.
- nir: Fix 16-component nir_replicate.
- radv: Expose VK_EXT_external_memory_acquire_unmodified.
- util/perf: Add gpuvis integration.
- egl,venus,vulkan,turnip,freedreno: Update CPU trace init to init more than perfetto.
- vulkan: Add CPU tracing for vkWaitForFences.
- docs: Add documentation for gpuvis.
- vulkan: Add trace points for more Vulkan waiting functions.
- radv: Use a double jump to limit nops in DGC for dynamic sequence count.
- nir: Add AMD cooperative matrix intrinsics.
- aco: Add WMMA instructions.
- aco: Make RA understand WMMA instructions.
- radv: Don't transparently use wave32 with cooperative matrices.
- radv: Add cooperative matrix lowering.
- radv: Expose VK_KHR_cooperative_matrix.
Benjamin Cheng (10):
- radv/video: use app provided hevc scaling list order
- radv/video: copy from correct H264 scaling lists
- anv/video: copy from correct H264 scaling lists
- vulkan/video: add helper to derive H264 scaling lists
- radv/video: use vk_video_derive_h264_scaling_list
- anv/video: use vk_video_derive_h264_scaling_list
- util/vl: extract gallium vl scanning data to shared code
- radv/video: send h264 scaling list in raster order
- anv/video: send h264 scaling list in raster order
- radv/video: find SPS with pps_seq_parameter_set_id
Benjamin Lee (1):
- nvk: Fix segfault when opening DRI device file returns error
Biswapriyo Nath (1):
- radv/video: Match function definitions to declarations
Boris Brezillon (1):
- panfrost: Flag the right shader when updating images
Boyuan Zhang (3):
- virgl: Add vp9 picture desc
- virgl: Implement vp9 hardware decode
- radeonsi/vcn: disable tmz ctx buffer for VCN_2_2_0
Caio Oliveira (134):
- nir: Use instructions_pass() for nir_fixup_deref_modes()
- meson: Ensure that LLVMSPIRVLib is not required for Clover
- nir: Let nir_fixup_deref_modes() fix deref_casts when possible
- nir: Add nir_opt_reuse_constants()
- radv: Use nir_opt_reuse_constants()
- compiler/types: Use ralloc for the key in array_types
- compiler/types: Use smaller keys for array_types table
- compiler/types: Extract get_explicit_matrix_instance() function
- compiler/types: Use smaller keys for explicit_matrix_types table
- anv/tests: Refactor state_pool_test_helper to not use macros for parametrization
- anv/tests: Link a single anv_tests binary using gtest
- anv/tests: Propagate failures to gtest
- hasvk/tests: Refactor state_pool_test_helper to not use macros for parametrization
- hasvk/tests: Link a single hasvk_tests binary using gtest
- hasvk/tests: Propagate failures to gtest
- util: Add convenience macros for linear allocator
- compiler/types: Use right hash for function types
- compiler/types: Don't duplicate empty string
- compiler/types: Constify a couple of pointers in glsl_type
- compiler/types: Remove unused GLSL_TYPE_FUNCTION and related functions
- compiler/types: Move GLSL specific builtin structs into glsl/
- glsl: Add missing glsl_types initialization to test_optpass
- glsl: Don't create struct type builtins
- compiler/types: Add extra level of macro to builtin_macros
- compiler/types: Use designated initializer syntax to specify builtins
- compiler/types: Move local cache details to implementation file
- compiler/types: Add a mem_ctx for the glsl_type_cache
- compiler/types: Use type cache mem_ctx for hash tables
- compiler/types: Don't store a mem_ctx per type
- compiler/types: Simplify clearing the glsl_type_cache
- compiler/types: Move static asserts about glsl_type to a central place
- compiler/types: Store builtin types directly as data
- compiler/types: Use a linear (arena) allocator for glsl_types
- compiler/types: Make struct glsl_type visible to C code
- compiler/types: Add workaround to use builtin_type_macros.h in C
- compiler/types: Move builtin type initialization to C
- glsl: Annotate _mesa_glsl_error() with PRINTFLIKE
- compiler/types: Fix array name dimension flipping for unsized arrays
- compiler/types: Use Python to generate code for builtin types
- compiler/types: Use glsl_get_type_name() to access the type name
- compiler/types: Change glsl_type::name to be an uintptr_t
- compiler/types: Use a string table for builtin type names
- intel/compiler/xe2: Account for reg_unit() in TCS intrinsics
- intel/compiler/xe2: Account for reg_unit() in TES intrinsics
- intel/fs/xe2+: Update BS payload setup for Xe2 reg size.
- intel/fs/xe2+: Update TASK/MESH payload setup for Xe2 reg size.
- compiler: Use a meson dependency for libcompiler
- meson: Remove unnecessary inc_compiler mentions
- rusticl: Ensure NIR generated headers will be available
- clover: Hide SPIR-V related code behind HAVE_CLOVER_SPIRV
- clover: Only compile/depend libclspirv and libclnir when using SPIR-V support
- compiler: Only enable mesaclc helper if we have OpenCL SPIR-V support
- intel/compiler: Don't allocate memory for SIMD select error handling
- microsoft/compiler: Fix printf formatting string issues
- util: Add more PRINTFLIKE and MALLOCLIKE annotations
- util: Remove ralloc_parent from linear_header
- util: Use linear parent to (r)allocated extra nodes
- util: Remove size from linear_parent creation
- util: Make DECLARE_LINEAR_ALLOC_* macros assume no destructors
- util: Use an opaque type for linear context
- util: Remove usages of linear_realloc()
- util: Remove linear_realloc()
- util: Remove size information from child allocations
- util: Remove per-buffer header in linear alloc for release mode
- util: Add a few basic tests for linear_alloc
- util: Fix bookkeeping of linear node sizes
- intel/compiler: Don't store stage name and abbrev
- intel/compiler/xe2: URB fence uses LSC now
- intel/compiler/xe2: Fix URB writes in TCS
- intel/compiler/xe2: Update TCS ICP handle code to support SIMD16
- compiler/types: Add support for Cooperative Matrix types
- nir: Add new intrinsics for Cooperative Matrix
- nir: Handle cooperative matrix in various passes
- spirv: Expose some memory related functions in vtn_private.h
- spirv: Let vtn_ssa_value hold references to variables
- spirv: Implement SPV_KHR_cooperative_matrix
- compiler/types: Remove private related declarations
- compiler/types: Remove use of new/delete
- compiler/types: Remove use of references
- compiler/types: Remove use of auto
- compiler/types: Use C compatible cast syntax
- compiler/types: Spell struct and enum in type names
- compiler/types: Add void parameter to ensure these are valid C prototypes
- intel/fs: Tweak default case of fs_inst::size_read()
- compiler/types: Move the C++ inline functions in glsl_type out of the struct body
- compiler/types: Move C declarations into glsl_types.h
- compiler/types: Flip wrapping of base_type checks
- compiler/types: Flip wrapping of various type identification checks
- compiler/types: Flip wrapping of convenience accessors for vector types
- compiler/types: Flip wrapping of basic "get type" functions
- rusticl: Add Rust bindings for inline glsl_types functions
- util: Add size to ralloc_header in debug mode
- util: Add a canary to identify gc_ctx in debug mode
- util: Add function print information about a ralloc tree
- util: Avoid waste space when linear alloc'ing large sizes
- spirv: Expose stage enum conversion in vtn_private.h
- spirv: Change spirv2nir to use the shorter shader name abbreviations
- spirv: List entry-points in spirv2nir when unsure what to use
- spirv: Let spirv2nir find out the shader to use
- intel/compiler: Don't emit calls to validate() in release build
- compiler/types: Flip wrapping of "type contains?" predicate functions
- compiler/types: Flip wrapping of array related functions
- compiler/types: Flip wrapping of cmat related functions
- compiler/types: Flip wrapping of CL related functions
- compiler/types: Flip wrapping of size related functions
- compiler/types: Flip wrapping of struct related functions
- compiler/types: Flip wrapping of interface related functions
- compiler/types: Flip wrapping of layout related functions
- compiler/types: Flip wrapping of record_compare
- compiler/types: Flip wrapping of get_instance()
- compiler/types: Flip wrapping of texture/sampler/image get instance functions
- compiler/types: Flip wrapping of various get instance functions
- compiler/types: Flip wrapping of get row/column type helpers
- compiler/types: Flip wrapping of remaining non-trivial type getters
- compiler/types: Flip wrapping of remaining small data getters
- compiler/types: Flip wrapping of numeric type conversion functions
- compiler/types: Move remaining code from nir_types to glsl_types
- rusticl: Add bindings for glsl_vector_type()
- compiler/types: Add more glsl_contains_*() functions and use them in C++
- compiler/types: Add glsl_get_mul_type() and use it in C++
- compiler/types: Add glsl_type_compare_no_precision() and use it in C++
- compiler/types: Add glsl_type_uniform_locations() and use it in C++
- compiler/types: Add glsl_get_std430_array_stride() and use it in C++
- compiler/types: Add glsl_get_explicit_*() functions and use them in C++
- compiler/types: Implement glsl_type::field_type() in terms of existing functions
- compiler/types: Add glsl_simple_explicit_type() and simplify glsl_simple_type()
- compiler/types: Add remaining type extraction functions and use them in C++
- compiler/types: Use C instead of C++ constants for builtin types
- compiler/types: Remove usages of C++ members in glsl_types.cpp
- compiler/types: Annotate extern "C" only once in glsl_types.cpp
- compiler/types: Rename glsl_types.cpp to glsl_types.c
- compiler/types: Remove warnings about potential fallthrough
- compiler/types: Move comments and reorganize declarations
- anv: Fix leak when compiling internal kernels
Carsten Haitzler (2):
- kmsro: Add hdlcd DPU
- panfrost: Add GPU variant of G57 to the set of known ids
Charles Giessen (1):
- panvk: Use 1.0 in ICD Manifest json
Charmaine Lee (8):
- svga: set clear_texture to NULL for vgpu9
- svga: fix stride used in vertex declaration
- svga: fix persistent mapped surface update to constant buffer
- svga: restrict use of rawbuf for constant buffer access to GL43 device
- svga: fix immediates used in rawbuf for constant buffer
- svga: use srv raw buffer for accessing readonly shader buffer
- svga: sync resource content from backing resource before image upload
- svga: ignore sampler view resource if not used by shaders
Chia-I Wu (38):
- radv: fix separate depth/stencil layouts in fb state
- radv: fix separate depth/stencil layouts in resolve meta
- radv: refactor depth clear in clear meta
- radv: fix separate depth/stencil layouts in clear meta
- amd/ci: update radv-stoney-aco-fails.txt for depth/stencil clear
- radv: disable tc-compat htile for layered images on gfx8
- amd/ci: update radv-stoney-aco-fails.txt for depth/stencil resolve
- winsys/amdgpu: fix a race between import and destroy
- ac/surface: limit RADEON_SURF_NO_TEXTURE to color surfaces
- winsys/radeon: fix a race between bo import and destroy
- vulkan/runtime: add a helper for ETC2 emulation
- radv: use vk_tecompress_etc2 from the runtime
- vulkan/runtime: fix image type check for ETC2 emulation
- vulkan/runtime: fix a harmless typo for ETC2 emulation
- vulkan/runtime, radv: remove 1D support from ETC2 emulation
- radv: add radv_is_format_emulated
- radv: simplify view format override for emulated formats
- radv: hard code format features for emulated formats
- mesa: make astc_decoder.glsl vk-compatible
- radv, drirc: rename radv_require_{etc2,astc}
- anv: remove unused field from anv_image_view
- anv: add anv_image_view_{init,finish}
- anv: support image views with surface state stream
- anv: add anv_push_descriptor_set_{init,finish}
- anv: support alternative push descriptor sets
- anv: add anv_descriptor_set_write
- anv: add anv_cmd_buffer_{save,restore}_state
- anv: add anv_is_format_emulated
- anv: add a hidden plane for emulated formats
- anv: decompress on upload for emulated formats
- anv: fix up image views for emulated formats
- anv: fix up blit src for emulated formats
- anv: advertise emulated formats
- anv: add support for vk_require_astc driconf
- util: improve BITFIELD_MASK and BITFIELD64_MASK on clang
- anv: prep for gen9 astc workaround
- anv: add gen9 astc workaround
- radv: fix image view extent override for astc
Chris Spencer (9):
- radv: initialize result when pipeline cache creation fails
- anv/android: Fix importing hardware buffers with planar formats
- anv/android: Add support for AHARDWAREBUFFER_FORMAT_YV12
- anv: Advertise Vulkan 1.3 on Android 13
- anv: Don't reject Android image format if external props not supplied
- android: Add explanatory comment to u_gralloc
- anv/android: Enable shared presentable image support
- anv/video: use correct enum value for max level IDC
- radv/video: use correct enum value for max level IDC
Christian Gmeiner (41):
- nir/print: print instr pass_flags
- etnaviv: move nir texture lowerings into one pass
- nir: add enta specific intrinsic used for txs lowering
- etnaviv: nir: support intrinsic used for txs lowering
- etnaviv: nir: lower nir_texop_txs
- ci/etnaviv: update ci expectations
- etnaviv: make use of BITFIELD_BIT(..) macro
- etnaviv: name the enum used for pass_flags
- etnaviv: add is_dead_instruction(..) helper
- etnaviv: extend etna_pass_flags with source modifiers
- etnaviv: do not clear all pass_flags before RA
- etnaviv: nir: look at parent instr in lower_alu(..)
- etnaviv: nir: add etna_nir_lower_to_source_mods(..)
- etnaviv: nir: switch to etna_nir_lower_to_source_mods(..)
- etnaviv: nir: convert to new-style NIR registers
- freedreno/regs: remove double assignment of self.current_domain
- freedreno/regs: remove not used variable
- freedreno/regs: remove dead code
- freedreno/regs: python does not need ';'
- etnaviv: switch to log2f(..)
- etnaviv: switch to U_FIXED(..) macro
- etnaviv: switch to S_FIXED(..) macro
- etnaviv: fix null pointer dereference
- etnaviv: switch to float_to_ubyte(..)
- ci/etnaviv: update ci expectation
- etnaviv: unbreak cmdline compiler
- agx/lower_address: Use intrinsics_pass
- agx/lower_address: Remove not used has_offset
- isaspec: python does not need ';'
- docs: Move isaspec out of drivers/freedreno
- isaspec: Add support for templates
- isaspec: encode: Correct used regex
- isaspec: Add method to get all instrustions
- isaspec: Add support for custom meta information
- isaspec: Add BitSetEnumValue object
- spirv: Don't use libclc for rotate
- docs: update etnaviv extensions
- etnaviv: drm: Be able to mark end of context init
- etnaviv: Skip 'empty' cmd streams
- ci: Bump PyYAML to 6.0.1
- etnaviv: Don't leak disk_cache
Collabora's Gfx CI Team (2):
- Uprev Piglit to ed58dfbd12be34fa3dab97a7a2987b890e0637f1
- Uprev Piglit to f7db20b03de6896d013826c0a731bc4417c1a5a0
Cong Liu (2):
- r300: Fix out-of-bounds access in ntr_emit_store_output()
- virgl:Fix ITEM_CPY macro pointer copy bug
Connor Abbott (83):
- afuc: Rework and significantly expand README.rst
- tu: Fix vk2tu_*_stage flag type
- tu: Fix and simplify execution dependency handling
- tu, freedreno/a6xx: Remove has_ccu_flush_bug
- ir3: Handle GS stream "mixing" with non-point output primitives
- tu: Disable transformFeedbackPreservesProvokingVertex
- isaspec: Add "displayname" for altering {NAME} when decoding
- isaspec: Add support for "absolute" branches
- isaspec: Add support for function and entrypoint labels
- isaspec: Add "custom" field type
- isaspec: Add callback after decoding an instruction
- isaspec: Rename isa_decode() to isa_disasm()
- isaspec: Add initial decoding support
- afuc: Fix xmov lexer typo
- afuc: Convert to isaspec
- afuc: Add setbit/clrbit
- afuc: Fix writing $00
- freedreno/afuc: Initial a7xx support
- ir3: Parse (eq) flag
- ir3, freedreno, tu: Plumb through SP_FS_PREFETCH_CNTL::ENDOFQUAD
- tu: Add missing last_baryf statistic
- freedreno, tu, ir3: Add last_helper statistic
- ir3: Gather pixlod status earlier
- ir3: Implement helper invocation optimization
- vk/graphic_state, tu: Use dynamic blend count from subpass
- freedreno/a7xx: Add CP_RESET_CONTEXT_STATE
- vk/graphics_state: Fix copying MS locations pipeline state
- tu: Remove MSAA draw state
- tu: Merge SAMPLE_LOCATIONS and SAMPLE_LOCATIONS_ENABLE draw states
- tu: Merge PC_RASTER_CNTL into RAST draw state
- tu: Stop reusing base Vulkan dynamic state enums
- tu: Merge depth/stencil draw states
- tu: Rename PrimID-related registers
- tu, freedreno/a6xx: Don't use VS for PrimID passthru state
- tu: Pull entangled shader state into program config
- ir3: Add ir3_find_input_loc() helper
- tu: Split up tu6_emit_vpc()
- freedreno, ir3, tu: Constify various uses of ir3_shader_variant
- ir3: Add helper to determine when variant exceeds safe constlen
- tu: Split program draw state into per-shader states
- tu: Fix per-view viewport state propagation
- tu: Fix tu6_emit_*_fdm size call
- tu: Fix assert in FDM state emission
- tu: Actually emit patchpoint for viewports with FDM
- nir/lower_subgroups: Don't do multiple lowerings at once
- nir/spirv: Add inverse_ballot intrinsic
- amd: Use inverse ballot intrinsic if available
- tu: Create singleton "empty" shaders
- tu: Start tracking shaders independently of pipeline
- tu: Move FS-specific pipeline information to the shader
- tu: Use shader directly for VS/TCS output size and patch size
- tu: Rewrite tessellation modes handling
- tu: Rework passing shared consts
- tu: Decouple program state from the pipeline
- tu: Use pipeline feedback loop flag indirectly
- tu: Rewrite remaining pipeline LRZ handling
- tu: Don't reference pipeline for some draw states
- tu: Make compute dispatch use the shader
- tu: Don't use pipeline for dynamic draw states
- tu: Don't use pipeline for bandwidth validity
- tu: Don't use pipeline for per_view_viewport
- tu: Don't use pipeline for active stages
- tu: Remove pipeline from state
- zink: Rework color clamping and conversion
- freedreno/fdl: Use A8_UNORM HW format for sampling
- tu: Support clearing A8_UNORM
- freedreno/fdl: Support PIPE_FORMAT_R5G5B5A1_UNORM on a6xx
- tu/clear_blit: Fix staging image view layer count
- tu/clear_blit: Allow VK_REMAINING_ARRAY_LAYERS as layerCount
- tu: Allow VK_WHOLE_SIZE in tu_CmdBindVertexBuffers2EXT pSizes
- tu: Implement vkCmdBindIndexBuffer2KHR
- tu: Implement vkGetImageSubresourceLayout2KHR and vkGetDeviceImageSubresourceLayoutKHR
- tu: Implement vkGetRenderingAreaGranularityKHR
- tu: Use new buffer usage flags
- tu: Support VkPipelineCreateFlags2CreateInfoKHR
- tu: Check for DEVICE_LOST in vkGetEventStatus()
- tu: Add maintenance5 properties
- freedreno/ci: Skip dEQP-VK.info.device_extensions
- tu: Expose VK_KHR_maintenance5
- freedreno/ci: Remove minetest trace
- v3d/ci: Remove minetest trace
- ir3/ra: Don't swap killed sources for early-clobber destination
- tu: Fix re-emitting VS param state after it is re-enabled
Corentin Noël (16):
- ci: Add locked flag to bindgen-cli installation
- virgl: Do not expose EXT_texture_mirror_clamp when using a GLES host
- ci: disable Collabora's LAVA lab for maintenance
- llvmpipe: make sure to initialize the lp_setup_context slots with the default values
- virgl: Cover all the formats defined in the virgl definition
- mesa: Ensure that the baselevel will never exceed the maximal supported number
- ci: Uprev virglrenderer
- freedreno/drm/virtio: Use MESA_TRACE_SCOPE instead of _BEGIN/_END
- tu: Use MESA_TRACE_SCOPE instead of _BEGIN/_END
- aux/tc: Use MESA_TRACE_SCOPE instead of _BEGIN/_END
- venus: Change the only occurrence of VN_TRACE_BEGIN/END to VN_TRACE_SCOPE
- util: Avoid the use of MESA_TRACE_BEGIN/END
- util/perf: Remove the tracing categories
- util: Remove MESA_TRACE_BEGIN/END
- mesa/bufferobj: ensure that very large width+offset are always rejected
- frontends/va: Remove wrong use of ProfileToPipe
Daniel Schürmann (9):
- nir/opt_move: fix handling of if-condition
- aco: append p_logical_end after monolithic RT shaders
- aco/insert_exec_mask: set Exact mode after p_discard_if when necessary
- aco: don't optimize cross-lane instructions across p_wqm
- aco: make p_wqm a marker instruction without Operands/Definitions
- aco: don't insert a copy when emitting p_wqm
- aco: insert a single p_end_wqm after the last derivative calculation
- aco/insert_exec_mask: Simplify WQM handling (1/2)
- aco/insert_exec_mask: Simplify WQM handling (2/2)
Daniel Stone (23):
- dri: Support 1555/4444 formats
- egl/dri2: Don't look up image extension twice
- egl/wayland: Always initialise fd_display_gpu
- egl/wayland: Add image loader extension for swrast
- egl/wayland: Never use DRI2_LOADER extension
- egl/wayland: Assume modern DRI interface versions
- egl/drm: Use IMAGE_DRIVER instead of DRI2_LOADER
- egl/drm: Assume modern DRI interface versions
- ci: Disable nouveau CI
- panfrost/vk: Use correct sampler dimensions for MSAA
- ci: Declare stages before jobs
- ci/radeonsi: Add new flake
- ci/d3d12: Add new flake
- ci/intel: Add new skqp flake
- ci/zink: Add new zink-lvp flakes
- ci/radeonsi: Skip more really slow tests
- ci/zink: Add another conversion fail on a618
- ci: Move farm-disable rules before anything else
- ci: Always set user container jobs to manual
- ci: Use container rules for containers
- ci: Only look at file changes for MRs
- ci: Fix pre-merge pipelines with no code changes
- ci: Try really hard to print final result string
Daniel van Vugt (1):
- glx: Increment dpy->request before issuing an error that had no request
Danylo Piliaiev (71):
- freedreno/cffdec: Decode CP_DRAW_AUTO
- freedreno, turnip: Clarify some RB_CCU_CNTL fields
- freedreno,turnip: Make number of VSC pipes configurable
- freedreno,turnip: Make CS shared memory size configurable
- freedreno,turnip: Make VS input attr/binding count configurable
- freedreno: Add A605, A608, A610, A612 GPUs definition
- turnip: Make multiview support configurable per generation
- ir3: Make FS tex prefetch optimization optional
- ir3: Use NIR info to enable per sample shading
- freedreno/regs: Rename SP_FS_CTRL_REG0.DIFF_FINE into LODPIXMASK
- ir3: Fix FS quad ops returning wrong values from helper invocations
- tu,freedreno: Forbid blit event for R8G8_SRGB due to gpu faults
- radv: fix unused non-xfb shader outputs not being removed
- vulkan/nir: Add common helper to check if output is XFB
- radv: Use common nir_vk_is_not_xfb_output
- turnip: Use common nir_vk_is_not_xfb_output
- freedreno/regs: Define unknown SP_FS_PREFETCH_CNTL fields
- freedreno/registers: Refactor gen_header.py to allow more options
- freedreno/registers: Generate python files with reg offsets
- freedreno: Add a list of raw magic regs
- freedreno: Fully define a730 and a740 device properties
- ir3/tests: Use fd_dev_info to infer GPU generation
- freedreno/computerator: Fix remaining issues with A7XX
- isaspec: Make possible to obtain gpu_id in <expr> blocks
- ir3/a7xx: cat5 mode1 has swapped tex/samp ids
- ir3/a7xx: Don't multiply global mem instruction's offset by 4
- ir3/a7xx: insert lock/unlock at the end of every compute shader
- ir3/a7xx: Add ccinv instruction
- ir3/a7xx: Use ccinv for data synchronization
- ir3/a7xx: Disable shared consts for a7xx
- tu/common: Generalize TU_GENX macro
- tu: Basic a7xx support
- freedreno/fdl: Set LOSSLESSCOMPEN for image when ubwc is enabled on a7xx
- tu/a7xx: Fix geometry shaders
- tu/a7xx: Fix tesselation shaders
- tu/a7xx: Fix multiview
- tu/a7xx: Fix flat shading
- tu/a7xx: Fix occlusion query
- tu/a7xx: Fix 3d blits after multiview usage
- tu/a7xx: Fix CmdDrawIndirectByteCountEXT
- tu/a7xx: Disable LRZ
- ir3/lower_tex_prefetch: Fix crash with lowered load_barycentric_at_offset
- tu: Exclude SP_UNKNOWN_AE73 from reg stomping
- tu: Call tu_cs_dbg_stomp_regs with appropriate GPU gen
- freedreno/replay: Add limited support for KGSL
- freedreno/rddecompiler: Update to handle a7xx
- freedreno/replay: Add "print" instr to ir3 asm to be used in replay
- freedreno/replay: Add "gpu_print" function for command streams
- tu/perfetto: Remove now unnecessary tu_perfetto_util
- tu/perfetto: Allow gpu time to be passed into tu_perfetto_submit
- tu/kgsl: Fix memory leak of tmp allocations during submissions
- tu/kgsl: Support u_trace and perfetto
- tu/a7xx: Correctly record timestamps for u_trace
- tu/virtio: Fix incorrect call to tu_perfetto_submit
- ci: Compile Turnip's virtio kmd in debian-arm64
- freedreno/registers: Refine a7xx push consts registers
- ir3,tu: Refactor push consts info plumbing
- freedreno: Make possible to specify A7XX feature flags
- turnip,ir3: Implement A7XX push consts load via preamble
- tu: Add push_consts_per_stage debug option
- tu: Fix VK_FORMAT_A8_UNORM_KHR using UBWC when !has_8bpp_ubwc
- tu/kgsl: Fix field order in kgsl_command_object init
- tu: Fix stale tu_render_pass_attachment::store_stencil with dyn rendering
- tu: Zero init tu_render_pass and tu_subpass for dynamic rendering
- tu: Disable preamble push consts when they are not used
- ir3: Fix values of #wrmask not being compatible with ir3 parser
- tu: Count a whole push consts range in constlen for PREAMBLE push consts
- freedreno/rddecompiler: Use fd_dev_gen to pass gpu_id to ir3 disasm
- freedreno/rddecompiler: Decompile repeated IBs
- freedreno: Fix field size of A6XX_TEX_CONST[3].ARRAY_PITCH
- tu: Fix reading of stale (V)PC_PRIMITIVE_CNTL_0
Dave Airlie (163):
- ci: remove binding model from the asan skips for lavapipe.
- gallivm: fix atomic global temporary storage.
- llvmpipe: fix fragdata/lastfragdata heuristic a bit more.
- nvk: add missing finish calls
- nvk: add some initial wsi framework.
- nvk: fix header guards to be less generic.
- nvk: add bind buffer memory
- nvk: Add initial queue
- nvk: add cmd buffer framework
- nvk: Reset pushbufs on command buffer reset
- nvk: reindent descriptor sets to mesa std.
- nvk: add initial descriptor pool framework.
- nvk: some boilerplate for descriptor sets
- nvk: add descriptor set bo allocation.
- nvk: implement buffer address.
- nvk: descriptor set freeing fix
- nvk: move to new command stream generator.
- nvk: port the blit and copy code to new command submission.
- nouveau/ws: drop the old push generators.
- nvk: link in codegen without gallium bits.
- nvk: Initial wiring in of the compiler
- nvk: Basic descriptor binding
- nouveau/vk: add support for compute classes to generator.
- nvk: retrieve gpc/mp counts from kernel.
- nvk: add support for preamble and tls allocation.
- nvk: add record result to cmd_buffer.
- nvk: add command stream upload buffer.
- nouveau/winsys: Add m2mf/compute objects
- nvk: add some basic format wrapping framework
- nvk: add some compute limits
- nvk: add basic nve4+ compute support.
- nvk: fix empty cmd submission.
- nouveau/ws: add a push reset just for references.
- nouveau/classes: add 906f header support.
- nvk: add initial 8/16 byte clears.
- nvk: fix pipeline pushbuf sizing
- nvk: increase graphics cpu push buffer
- nvk: fix depth emission ordering.
- nvk: add some limits/features from binary driver.
- nvk: add indexed draw support.
- nvk: assign vertex locations according to input attrib index
- nvk: lower io to temps to avoid output reads in vertex shaders
- nvk: handle NULL to destroy descriptor pool
- nvk: add basic primitive restart
- nvk: fix copy lower address extraction
- nvk: fix multiple pipelines failure allocation case.
- nvk: init dev->physical_device earlier.
- nvk/winsys: store device ptr into bo instead of ptr
- nvk: set the device fd
- nil: Fix image align and size constraints
- nvk: Report image alignments from NIL
- nouveau/winsys: allocate unique object handles across channels.
- nvk/nil: don't ask for compressed image kind
- nvk/barrier: handle host bit.
- nvk: add compute support for ampere
- nvk: add min_lod to spirv caps.
- nvk: fix r32_sint format support
- nvk: expose EXT_sampler_filter_minmax
- nvk: fix transform feedback crash when optimiser removes things.
- nvk: merge tess info between tcs/tes.
- nvk: introduce an optimisation loop.
- nvk: add support for D32_SFLOAT_S8_UINT
- nvk/query: fix push buffer size for copy pool results.
- nvk: init image fields for requirements
- nvk: handle alignments in device memory
- nvk/tess: don't emit patch control points in pipeline
- nvk: align geometry clip setting with nvc0
- nvk: fix independent color write masks.
- nvk: enable rgb32 texel buffer support
- nvk: enable EXT_depth_clip_control
- nvk: enable EXT_depth_clip_enable
- nvk: always sync internal cmd bufs for vma lifetimes.
- nouveau/winsys: add support for the vma bind interfaces
- nvk: Add support for sparse buffers
- nvk: Add support for sparse images
- nvk/queue: add support for syncobjs and sparse binds
- nvk: Handle pre-turing indirect buffers with sparse
- nvk: enable sparse features
- nvk: enable a bunch of external fence/semaphore bits
- nvk: enable sparse residency buffer on maxwell+
- nvk: add new internal bo allocation flag.
- docs: add two nvk exts to features.txt
- zink: use fprintf instead of printf to align the requirements warnings
- nvk: align sampler allocation counts with nvidia.
- zink: turn off threaded cpu access if not visible.
- nvk: add gart forced cmd pool side buffer.
- nvk: add cond render upload buffer.
- nvk: enable KHR_shader_clock.
- nvk: NOUVEAU_WS_BO_LOCAL is a trap.
- gallivm: drop unused info parameter
- llvmpipe/fs: drop cbuf 0 since it's lowered now.
- gallivm/nir: avoid using params->info
- llvmpipe/fs: move some tgsi checks in nir path to nir code.
- llvmpipe/cs: convert to using tgsi->nir
- llvmpipe/cs: drop tgsi for compute/mesh/task shader internals.
- lavapipe: use vk_buffer common code.
- lavapipe: use vk_buffer_range common code.
- llvmpipe/fs: switch to using tgsi->nir instead of handling tgsi
- llvmpipe/analyse: drop TGSI path.
- llvmpipe/fs: start using nir info in some places.
- llvmpipe/fs: drop the simple shader logic
- llvmpipe/fs: rewrite output finding using nir.
- nvk: add build_id linker argument.
- nir/gather: add support for fbfetch and bindless image loads.
- llvmpipe/cs: further cleanups after tgsi removal.
- llvmpipe: move to nir lowering for fquantize2f16
- rusticl: don't store ptrs to nir_variables across opt passes.
- llvmpipe: enable f16 paths on aarch64.
- clover/llvm: move to modern pass manager.
- nir: use a _clone so users calling their variable clone don't get a warning
- nir: rename nir_inline_functions.c to nir_functions.c
- nir: use nir_function_instructions_pass in the inliner.
- nir: move the libclc lowering over to functions file.
- nir/functions: use helper to get function for a name.
- nir/functions: put link state into a struct
- nir/functions: move linker pass to new helper
- nir: add nir function clone
- nir: don't inline linked functions
- gallivm/nir: split prepasses out to make per-function work easier.
- gallivm: rework translator to allow per-impl work.
- spirv/nir: parse function control and store in nir.
- nir: add driver_functions option to avoid inlining.
- nir: add a function usage tracker
- rusticl: use cleanup funcs
- gallivm: add support for function calling
- llvmpipe/cs: add support for function calls.
- llvmpipe: enable driver functions.
- radv: don't emit event code on video queues.
- spirv: use a pointer sized int type for opencl event_t
- clover: fix parameter arguments since recent translator changes.
- radv/video: take db alignment into account when allocating images.
- ac,radeonsi: move vcn enc structs to common
- ac,radeonsi: move vcn enc av1 default cdf file to common
- nir: add a deref slot counter that handles compact
- llvmpipe/linear: drop tgsi path.
- gallivm: drop tgsi aos paths.
- llvmpipe/nir: call gather info to update inputs read properly
- llvmpipe/fs: start converting interp/input paths to nir.
- llvmpipe/fs: start converting dervied state to nir based.
- llvmpipe/linear: convert to using nir for output.
- llvmpipe/linear: move to nir inputs
- draw/mesh: reset some user state values on mesh draws.
- llvmpipe/fs: fix regression in sample mask handling from tgsi removal.
- llvmpipe: reset viewport_index_slot in fb bind
- llvmpipe/cs: migrate to generic jit texture from pipe code.
- llvmpipe/cs: migrate cs image handle to common jit code.
- lavapipe: fix some whitespace in advance of other changes.
- lavapipe: fix subresource layers asserts
- lavapipe: support host image copying on compressed texture formats
- llvmpipe: don't create texture functions for planar textures.
- lavapipe: don't emit blit src/dst for subsampled formats.
- llvmpipe: don't support planar formats for buffers.
- lavapipe: convert sampler to use vk base class.
- lavapipe: cleanup copy code to use a local region variable.
- lavapipe: start introducing planes structure.
- lavapipe: allocate image and image view planes.
- lavapipe: handle planes in copies
- lavapipe: handle planes in get image sub resource
- lavapipe: add descriptor sets bindings for planar images
- lavapipe: handle planes in texture lowering.
- lavapipe: expose planar ycbcr formats and new ycbcr features
- lavapipe + docs: update ycbcr extension enables.
- intel-clc: avoid using spirv-linker.
David Heidelberg (82):
- ci/freedreno: update a530 flakes
- ci: build kernel in gfx-ci/linux and just use binaries in Mesa3D CI
- ci: update kernel to 6.3.13
- ci/freedreno: add fails introduced by upreving to 6.3.13
- Revert "lima/ci: temporarily disable deqp-egl tests due to timeouts"
- ci/radeonsi: stoney arb_timer_query got fixed between kernel 6.3.1..13
- ci/lima: EGL testing was disabled when fp16 fail was removed
- ci/freedreno: fix unexpectedpass flake on a630
- ci/freedreno: add another a530 flakes
- ci: add quirk for GitLab assuming changes is always true for scheduled runs
- ci/microsoft: when re-enabling Windows Farm, always run the container
- ci/freedreno: add a530 flakes, remove one fail which recently started passing
- ci/panfrost: introduce OpenGL testing with Mali-G57 MP5 on Asurada chromebook
- ci/freedreno: cover all texture gather flakes
- ci/freedreno: add a530 flake vs-lessthanequal-uvec4-uvec4
- ci/farms: always compare the code against main repository
- Revert "ci/farms: always compare the code against main repository"
- ci/kernel: add amd patch to prevent crashes when starting X
- ci/kdl: remove extra-verbose ls command
- ci/nouveau: add 20 minutes timeout to gk20a and align gm20b
- ci/freedreno: document another mapbuffer flake on a530
- ci/amd: fix timeouting radeonsi-raven-va-full job
- docs/ci: default to port 80 for the caching proxy
- docs/ci: update to systemd and used version of the trace for testing
- docs/ci: remove default nginx config, which we don't need for proxy
- bin/ci: handle errors more gracefully in update_traces_checksum script
- ci/freedreno: document another flakes on Adreno 530
- ci: add perfetto into mesa git-cache
- ci/panfrost: re-enable t760 and t860 traces as a nightly job
- CI: Re-enable G52 Vulkan testing
- ci/panfrost: t760-gles is nightly job, test also GLES 3 and 3.1
- ci/zink: Add flake seen in the wild
- ci/build: limit debian-build-testing to 30 minutes
- ci/amd: add glx\@glx-visuals-depth flake to raven
- ci/freedreno: document vs-nested-return-sibling-loop2 flake on Adreno 530
- ci/farms: enabled Microsoft job only when conditions are met
- ci/deqp: really remove the uncompressed results.csv file
- ci/baremetal: do not install curl, it's already there
- ci/baremetal: shorten BM_KERNEL to filename and BM_DTB to name only
- ci/freedreno: document another a530 flake batch
- ci: remove LAVA prefix from variables which can be used also elsewhere
- ci/zink: drop a630, which we currently have very low amount available
- ci/freedreno: the tag belongs to the apq8016 only
- ci/freedreno: switch references, the farm-rules takes care about this
- ci/freedreno: handle disabling farm properly for each FD/Collabora farm
- ci/freedreno: another batch of Adreno 530 flakes
- gtest: backport ansi color fix
- ci: disable Material Testers.x86_64_2020.04.08_13.38_frame799.rdc trace
- panfrost/ci: revert Disable T720
- ci/piglit: add extra space on top to prevent single quote getting into URL
- ci/freedreno: There is only one King of Town.
- ci: switch to 6.4 kernel, improving Adreno 660 reliability
- ci/iris: add GL46.arrays_of_arrays_gl.SizedDeclarationsPrimitive timeout
- ci/panfrost: add G52 flakes
- ci/panfrost: we have enough device, parallelize Vulkan tests
- ci/virgl: flakes in functional.draw_buffers_indexed group
- ci/freedreno: add another a530 flake
- ci/panfrost: add G52 simple_tests.partial_image_pot_same_format_noclear flake
- panvk: architecture isn't invalid, just unsupported
- panvk: catch unsupported arch in the panvk_physical_device_init
- Revert "ci: disable a660 jobs"
- docs: add LAVA farm informations
- ci: disable Google Freedreno farm, currently timeouting on all jobs
- Revert "ci: disable Google Freedreno farm, currently timeouting on all jobs"
- ci/farms: no need to check RUNNER_TAG for Collabora farm
- ci/traces: extend no-output timeout by 5 minutes
- ci/venus: add fragment.32B_in_memory_with_vec4_s32 flake
- iris: do not mention specifically clover for OpenCL support
- ci/freedreno: disable broke cheza (Adreno 630) runners
- ci/bare-metal: correct workaround for R8152 issue while retrieving TFTP data
- ci/bare-metal: drop unused imports, sort, use SPDX license
- ci/lima: farm is down, disable for now
- ci: do not report failed job when flakes reporting fails
- ci/freedreno: re-enable Cheza (Adreno 630) runners
- ci/traces: upload only missing trace images
- ci/traces: keep images for every job except the performance testing
- ci/traces: rename upload function to reflect it works with S3
- ci/traces: always export piglit EXTRA_ARGS
- ci: ci_marge_queue.py
- ci/freedreno: fix copy paste causing a618_gl being run only in manual pipeline
- ci/freedreno: disable Adreno 660 Vulkan pre-merge
- ci/traces: drop the freedoom-phase2-gl-high.trace
David Rosca (70):
- radeonsi: Use DIV_ROUND_UP instead of ALIGN_POT
- frontends/va: Skip processing buffers already converted with EFC
- frontends/va: Don't use EFC with scaling or filtering enabled
- radeonsi/vcn: Don't use chroma in AV1 encode with RGB input
- frontends/va: Parse H264 SPS for video signal parameters
- frontends/va: Parse HEVC SPS for video signal parameters
- frontends/va: Add postproc support for converting to full range
- radeonsi/vcn: Set H264 video signal parameters in bitstream
- radeonsi/vcn: Set HEVC video signal parameters in bitstream
- radeonsi/vcn: Enable full/limited range support for H264/HEVC/AV1
- radeonsi/vcn: Fix setting color range in AV1 bitstream
- gallium/auxiliary/vl: Fix RGB->YCbCr full range matrix
- gallium/auxiliary/vl: Handle UV subsampling in compute_shader_yuv
- gallium/auxiliary/vl: Fix blurry output of compute_shader_yuv
- frontends/va: Add YUV420 to NV12 postproc conversion
- gallium/auxiliary/vl: Fix chroma and blurry output of cs video_buffer
- gallium/auxiliary/vl: Fix chroma offset of compute_shader_weave
- frontends/va: Also map VAImageBufferType for reading
- frontends/va: Alloc interlaced surface for interlaced pics
- frontends/vdpau: Alloc interlaced surface for interlaced pics
- radeonsi: Don't prefer interlaced for video decode
- ci/amd: Skip VAAPI CreateSurfacesWithConfigAttribs/1121 test
- frontends/va: Don't allow multi-plane derive without driver support
- frontends/va: Init view_resources array in vlVaPut/GetImage
- radeonsi: Copy all planes with multi-plane staging textures
- radeonsi: Enable PIPE_VIDEO_CAP_SUPPORTS_CONTIGUOUS_PLANES_MAP
- ci/amd: Skip all VAAPI tests that creates too many huge surfaces
- radeonsi/vcn: Update rate control when framerate changes with HEVC
- frontends/va: Ignore requested size when creating VAEncCodedBufferType
- gallium/auxiliary/vl: Set correct csc matrix in set_buffer_layer
- radeonsi/vcn: Fix leaking fences in decode
- gallium/auxiliary/vl: Add BT.709 full csc matrix
- frontends/va: Set csc matrix in postproc
- gallium/auxiliary/vl: Don't set csc matrix in video_buffer/rgb_to_yuv_layer
- frontends/va: Add BT.709 as supported postproc color standard
- Revert "radeonsi/vcn: add an exception of field case for h264 decoding"
- gallium/auxiliary/vl: Set vertex element src_stride in vl_deint_filter
- gallium/auxiliary: Fix util_compute_blit half texel offset with scaling
- gallium/auxiliary/vl: Map range when updating constants
- gallium/auxiliary/vl: Clamp coordinates in compute shaders
- gallium/auxiliary/vl: Support chroma sample location in compute shaders
- frontends/va: Support chroma sample location in postproc
- frontends/va: Flush after unmapping VAImageBufferType
- frontends/va: Parse chroma sample location in H264/HEVC SPS
- radeonsi/vcn: Set H264/HEVC chroma sample location in bitstream
- radeonsi/vcn: Don't hang GPU when using DCC surface as encoder input
- frontends/va: Track surfaces in context
- frontends/va: Destroy fences when destroying surface or context
- radeonsi/vcn: Implement destroy_fence vfunc
- frontends/va: Process VAEncSequenceParameterBufferType first in vaRenderPicture
- frontends/va: Set default rate control values once when creating encoder
- gallium/auxiliary/vl: Add RGB to YUV compute shader
- gallium/auxiliary/vl: Use chroma offset in YUV to RGB weave compute shader
- gallium/auxiliary/vl: Fix YUV to RGB bob compute shader deinterlacing
- gallium/auxiliary/vl: Only map the shader constants buffer in render
- frontends/va: Add High Quality preset mode
- radeonsi/vcn: Add High Quality encoding preset for AV1
- radeonsi: Fix plane size in si_copy_multi_plane_texture
- frontends/va: Implement vaMapBuffer2
- frontends/va: Fix locking in vlVaBeginPicture
- frontends/va: Parse H264 SPS for max_num_reorder_frames
- util/vl: Fix vl_rbsp parser with bitstreams without emulation bytes
- frontends/va: Fix parsing packed headers without emulation bytes
- radeonsi/vcn: Add encode support for H264 B-frames
- frontends/va: Map decoder and postproc surfaces for reading
- radeonsi: Fix offset for linear surfaces on GFX < 9
- gallium/auxiliary/vl: Fix coordinates clamp in compute shaders
- gallium/auxiliary: Fix coordinates clamp in util_compute_blit
- gallium/auxiliary/vl: Scale dst_rect x0/y0 when rendering chroma plane
- util/rbsp: Fill bits twice if reading more than 16 bits
Derek Foreman (2):
- vulkan/wsi: Allow binding presentation_timing when software rendering
- vulkan/wsi: warn about unset present_mode in PresentModeCompatibilityExt
Dmitry Baryshkov (3):
- gallium: move kmsro definition to the bottom of the file
- gallium: unbreak kmsro/freedreno case
- tu: Pass real size of prime buffers to allocator
Dmitry Osipenko (3):
- util/cache_test: Re-add test for disabled cache
- util/cache_test: Fix disabled cache test using SHADER_CACHE_DISABLE_BY_DEFAULT
- util/cache_test: Add test for get/put() with disabled cache
Dor Askayo (1):
- nouveau: add exported GEM handles to the global list
Dr. David Alan Gilbert (6):
- rusticl/core: Add profiling time storage (queued) to event
- rusticl: Wire the 'queued' profiling time up
- rusticl: Wire the 'submit' profiling time up
- rusticl: Wrap pipe queries
- rusticl: Wrap pipe query reads
- rusticl: Wire the 'start' and 'end' profilng times up
Dylan Baker (4):
- VERSION: bump to 23.3.0-devel
- docs: Update release calendar for 23.2.0-rc1
- docs: truncate feature list for 23.3-devel
- meson: use a single dependency call for lua
Echo J (5):
- nvk: Fix some cast defines
- nvk: Add A8B8G8R8_*_PACK32 format support
- nvk: Add bufferImageGranularity limit
- nvk: Reset offset value in ResetDescriptorPool
- nil: Add A4B4G4R4_UNORM format support
Emma Anholt (111):
- ci/radv: Clarify when the ANGLE GS failures started happening.
- ci: Uprev ANGLE to 0518a3ff4d4e ("Android: Simplify power metrics collection")
- ci/tgl: Improve the info for ANGLE's MSAA regression on TGL.
- ci/tu: Add more crash cases for the multithreading bugs caught on a630.
- ci/tu: Mark descriptor_buffer.basic.limits as failing in gmem too.
- ci/tu: Drop some xfails for !24086
- tu: Fix data race in userspace VMA management.
- ci/a5xx: Add another GPU hanging piglit test to the skips.
- Revert "ci: Disable nouveau CI"
- nvk: Avoid strict aliasing warning in the pushbuffer encoding.
- nvk: Fix uninitialized result usage in NVK_DEBUG_ZERO_MEMORY.
- nvk: Fix unused result warnings in pushbuf resets.
- nvk: Remove duplicate (disabled) point sprite setup.
- nvk: Fix missing init of the stages to sync against.
- nvk: Use depth_clamp_enable to select PIXEL_*_Z_CLAMP.
- nouveau/winsys: Fix an undefined use in the error path.
- nvk: Quiet a compiler warning.
- nvk: Clean up redundant vendor checking for physical device creation.
- nvk: Add support for probing as a platform device.
- nvk: Disable shaderStorageImageReadWithoutFormat pre-Maxwell.
- freedreno/a5xx: Fix border color structure size.
- freedreno/a5xx: Skip emitting unused texture descriptors for images.
- freedreno/ir3: Move pvtmem per-fiber size alignment to the compiler.
- ci/freedreno: Drop a bunch of stale a530 xfails.
- ci/freedreno: Sort another a530 xfail with its friends.
- ci/freedreno: Update comments for some a530 xfails.
- ci/freedreno: Add some more db820c xfails.
- freedreno/devices: Move fibers_per_sp to the common info struct.
- freedreno/devices: Set num_sp_cores explicitly for pre-gen6.
- freedreno/a6xx: Move pvtmem allocation to ir3_gallium.
- freedreno/a3xx: Add the shift for MEMSIZEPERITEM according to db410c docs.
- freedreno/a5xx: Refactor SHADER_OBJ emit to a helper function.
- freedreno/a5xx: Set num_sp_cores and set PC/VFD_POWER_CNTL accordingly.
- freedreno/a5xx: Add private mem support.
- freedreno/cffdec: Fix decode on pixel 2 blob's COMPUTE_CHECKPOINT
- ci/freedreno: Add a regression test for decoding a540 blob's compute shaders.
- freedreno: Fix crashdec pre-a6xx.
- freedreno/a5xx: Skip SSBO emit when none are enabled.
- vulkan/util: Make multialloc succeed with 0 allocations.
- turnip: Track the first/last subpass an attachment is used in.
- turnip: Skip emitting empty CP_COND_REG_EXEC.
- turnip: Save the renderpass's clear values in the cmdbuf state.
- turnip: Move gmem clears and loads to the first subpass that uses them.
- turnip: Move sysmem clears to the first subpass that uses them.
- ci/freedreno: Skip some tests on a5xx that destabilize other tests.
- freedreno/a3-5xx: Don't try to emit ISAM for SSBO loads.
- ci/turnip: Add a660 VK coverage.
- disk_cache: Disable the "List" test for RO disk cache.
- blorp: Disable unaligned partial HIZ fast clears for HIZ_CCS too.
- intel/fs: Move defin/defout setup to the start of the loop.
- intel/fs: Move the defin[]/defout[] screening up to livein[]/liveout[] setup.
- intel/fs: Simplify compute_start_end().
- ci/freedreno: Add another excessive-constlen UBO skip.
- ci/anv: Drop DEQP_VER:vk setting.
- ci/anv: Drop "-vk" from the job name.
- ci/anv: Add a manual full VK run for TGL.
- ci/anv: Add testing on JSL.
- freedreno: Build drm subdir before perfcntrs, which uses it.
- ci/intel: Add various updates from our nightly runs.
- ci/virgl: Disable virgl-iris-traces.
- ci/zink: Add a few updates for anv/tgl from the nightly runs.
- ci/fastboot: Use a case insensitive match for a fastboot line.
- ci/etnaviv: Skip some tests that hang the GPU and knock out other tests.
- ci/etnaviv: Drop some gc2k flakes that I think are resolved.
- ci/anv: Drop incorrect xfail addition for TGL
- ci/anv: Drop the 16bit.scalar.13 skip.
- ci/etnaviv: Minor xfail/flake polishing.
- ci/etnaviv: Skip a GLES2 test that times out the asan job.
- ci/zink: Skip more doubles tests on anv that flake at 3 minute timeouts.
- ci/docker: Clear the results file before starting a new deqp test run.
- ci/crocus: Add a related flake to a known one.
- ci/etnaviv: return gl-1.4-tex1d-2dborder as a known flake
- ci/crocus: Add known piglit flakes
- ci/hasvk: Add a bunch of new CTS border color fails.
- i915: Re-clang-format and enforce it in CI.
- i915: Print the relevant counts vs limits when throwing errors.
- i915: Don't log I915_DEBUG=fs output for blit shaders.
- i915: Save fragment program compile error messages in the fragment shader.
- i915: Do a test compile at glLinkShader() time.
- i915: Make exceeding tex indirect count fatal.
- i915: Use nir_group_loads() to reduce texture indirection phases.
- ci/crocus: Generalize the drawarrays-vertex-count flakes.
- ci/zink: Skip 3-minute-long glx-visuals timeouts.
- ci/zink: Skip dmat[34] op tests in general, as well
- ci/crocus: Disable flaky unvanquished-ultra trace
- nir/print: Decode system values in the variable declarations.
- ci/zink: Add a TGL flake that's showed up in nightlies recently.
- ci/radeonsi: Drop an xfail for vangogh.
- i915: Make I915_DEBUG=fs log shaders that fail to link due to CF.
- nir: Flatten ifs with discards in nir_opt_peephole_select for HW without CF.
- glsl: Remove lower_discard().
- ci/zink: Only test half of piglit pre-merge on anv.
- ci: Stop doing internal retries in bare-metal.
- ci/bare-metal: Drop the 2 vs 1 exit code from poe_run.
- ci/bare-metal: Default our boards to a 20-minute timeout for the whole job.
- ci/iris: Drop parallel on kbl piglit to 2.
- ci/freedreno: Fold a630_egl into a630_gl.
- ci/freedreno: Move skqp testing to a618.
- ci/zink: Cut zink-lvp coverage in half.
- ci/freedreno: Generalize the implicit_unmap timeouts.
- ci_run_n_monitor: Poll mesa/mesa and user/mesa for pipelines at the same time.
- glx: Delete support for GLX_OML_swap_method.
- ci: drop skip for glx-swap-copy.
- dri: Drop a duplicate mesa vs pipe format table.
- docs/ci: Drop old instructions for farm disabling
- docs/ci: Add some links in the CI docs to how to track job flakes
- glsl: Remove int64 div/mod lowering.
- llvmpipe: Set nir_lower_dround_even.
- nir: Add nir_lower_dsign as 64-bit fsign lowering.
- glsl: Retire dround lowering.
- ci_run_n_monitor: Always resolve --rev arguments for looking up pipelines.
Eric Engestrom (194):
- ci: avoid running hardware jobs if lint fails - now on LAVA too!
- ci: avoid running hardware jobs if lint fails - now on Windows too!
- ci: replace copy of nouveau rules with reference
- ci: drop leftover kernel configs
- ci: use !reference for scheduled_pipeline retry rule
- ci: add .llvmpipe-manual-rules and use it
- ci: add .gallium-core-rules and use it instead of gallium_core_file_list anchor
- ci: replace llvmpipe_file_list anchor with reference
- ci: replace softpipe_file_list anchor with reference
- ci: replace lavapipe_file_list anchor with reference
- ci: replace iris_file_list anchor with reference
- ci: replace radv_file_list anchor with reference
- ci: replace radeonsi_file_list anchor with reference
- ci: replace virgl_file_list anchor with reference
- ci: move etnaviv files rules to src/etnaviv/ci/gitlab-ci.yml
- ci: move freedreno files rules to src/freedreno/ci/gitlab-ci.yml
- ci: move nouveau files rules to src/gallium/drivers/nouveau/ci/gitlab-ci.yml
- ci: move panfrost files rules to src/panfrost/ci/gitlab-ci.yml
- ci: move broadcom files rules to src/broadcom/ci/gitlab-ci.yml
- ci: move lima files rules to src/gallium/drivers/lima/ci/gitlab-ci.yml
- ci: move amd files rules to src/amd/ci/gitlab-ci.yml
- ci: move microsoft files rules to src/microsoft/ci/gitlab-ci.yml
- ci: move zink files rules to src/gallium/drivers/zink/ci/gitlab-ci.yml
- ci: move virtio files rules to src/virtio/ci/gitlab-ci.yml
- ci: move intel files rules to src/intel/ci/gitlab-ci.yml
- ci: move virgl files rules to src/gallium/drivers/virgl/ci/gitlab-ci.yml
- ci: move llvmpipe files rules to src/gallium/drivers/llvmpipe/ci/gitlab-ci.yml
- ci: move softpipe files rules to src/gallium/drivers/softpipe/ci/gitlab-ci.yml
- ci: move lavapipe files rules to src/gallium/drivers/lavapipe/ci/gitlab-ci.yml
- ci: delete install.tar after extracting it to avoid re-uploading it
- docs: add release notes for 23.1.4
- docs: add sha256sum for 23.1.4
- docs: update calendar for 23.1.4
- asahi: drop unused include paths
- ci/lint: deduplicate formatting check jobs
- ci/lint: also print a diff for rust format issues
- ci: allow hw jobs even if lint jobs fail for non-Marge pipelines
- ci: print rustfmt's version
- ci: print clang-format's version
- bin/ci_run_n_monitor: get git sha from pipeline if specified, instead of requiring --rev to match
- lavapipe/ci: use tighter changes: rules
- ci: add a 10min job timeout to formatting checks
- ci: reduce bare-metal retries of poe_run to only 3 attempts
- broadcom/ci: reduce vc4-rpi3-gl timeout to 30min (instead of 1h)
- broadcom/ci: reduce v3d-rpi4-gl timeout to 30min (instead of 1h)
- broadcom/ci: reduce v3d-rpi4-traces timeout to 30min (instead of 1h)
- broadcom/ci: reduce v3dv-rpi4-vk timeout to 30min (instead of 1h)
- ci: add .core-rules to .gallium-core-rules
- ci: drop rule for non-existent src/include/
- docs: add release notes for 23.1.5
- docs: add sha256sum for 23.1.5
- docs: update calendar for 23.1.5
- ci: include some timing information in the git cache download script
- docs/ci: stop trying to enumerate drivers that are tested using VK-GL-CTS
- docs/ci: in paragraph about the CI being overwhelmed, mention our tool to help with that
- docs/ci: drop mention of build systems variants in the CI
- docs/ci: expand the description of test suites
- bin: add wrapper to run scripts in a python venv
- bin/ci/ci_run_n_monitor: use venv wrapper
- bin/ci/gitlab_gql: use venv wrapper
- bin/ci/update_traces_checksum: use venv wrapper
- bin/pick-ui: use venv wrapper
- ci: include mold in x86_64_test-base & rootfs images
- ci: use mold to build deqp
- zink/ci: set the default timeout for zink jobs to 30min instead of 1h
- egl: make _eglFilterConfigArray static
- egl: fixup _eglFilterConfigArray() params and drop _eglFallbackMatch() wrapper
- ci: build nvk
- ci: document max image tag length
- docs/radv: mark VK_EXT_tooling_info as implemented
- docs/radv: mark VK_INTEL_shader_integer_functions2 as implemented
- git-blame-ignore-revs: repeat instruction on how to enable to avoid having to look for it
- git-blame-ignore-revs: add radv formatting commit
- git-blame-ignore-revs: add pvr formatting commit
- meson: fix indentation
- docs/v3dv: mark direct display extensions as implemented
- ci: reorder vk drivers alphabetically in debian-vulkan job
- ci: build hasvk in debian-vulkan job
- ci/zink+radv: set a timeout of 2x the normal runtime
- amd/ci: drop duplicate test expectations
- panfrost: upcast uint8/uint16 before shifting them beyond their range
- ci/a530: document piglit flake
- docs: add release notes for 23.1.6
- docs: add sha256sum for 23.1.6
- docs: update calendar for 23.1.6
- docs: add one more 23.1.x release
- ci: rename \*.log to \*.txt to work around gitlab bug
- ci/freedreno: reuse freedreno_gl_file_list instead of re-definining it
- egl: bump extension string length
- vc4: drop duplicate .lower_ldexp
- zink: fix format in zink_make_{image,texture}_handle_resident()
- v3dv: fix VK_PIPELINE_ROBUSTNESS_{BUFFER,IMAGE}_BEHAVIOR_DEVICE_DEFAULT_EXT copy/paste typo
- v3dv: fix copy/pasted type of \`sample`
- v3dv: fix shader stage name in error message
- v3d/qpu: fix type of function argument
- ci/deqp: backport fix for dEQP-EGL.functional.wide_color.*_888_colorspace_*
- ci/farm-rules: fix missing valve-infra jobs in scheduled pipelines
- bin/ci_run_n_monitor: error out if both --project and --pipeline-url are passed
- ci: document farm rules
- ci/b2c: skip install.tar extraction if the tarball is not present
- ci/b2c: don't allow failures in test script preparation
- ci/b2c: assert that install folder is present whether or not the tarball was extracted
- ci/amd: split the polaris10 rules into one for each farm
- ci: skip containers & build jobs when disabling a farm
- docs: add release notes for 23.1.7
- docs: add sha256sum for 23.1.7
- docs: update calendar for 23.1.7
- docs: add one more 23.1.x release
- ci: taking igalia farm offline
- ci/b2c: drop logic to remove install.tar
- ci: drop clover leftover
- Revert "ci: taking igalia farm offline"
- bin/ci_run_n_monitor: print in which repo we're looking for the pipeline
- bin/ci_run_n_monitor: automatically pick MR pipelines when they exist
- ci: remove duplicate fork pipeline in MRs
- ci_run_n_monitor: add comment to explain "MR > fork" logic
- ci: don't run everything just because a farm gets re-enabled
- ci/windows: centralize definition of windows runners tags
- ci/windows: add windows docker runner tags to .windows-docker-vs2019
- ci/windows: drop build rules from test jobs
- ci: document which image tags need to be bumped when updating piglit
- ci: document which image tags need to be bumped when updating {alpine,debian,fedora}/x86_64
- ci/farm-rules: rename .disable-farm-mr-rules to make it clear it's only about MRs
- ci/farm-rules: re-add "run every container and build job when a farm gets re-enabled"
- ci/zink: drop redundant \`MESA_LOADER_DRIVER_OVERRIDE: zink`
- docs: add release notes for 23.1.8
- docs: add sha256sum for 23.1.8
- docs: update calendar for 23.1.8
- docs: add another 23.1.x
- ci: limit build jobs to 30min so that they can retry when they go wrong
- docs: drop outdated and redundant note about the minimum meson version
- ci/zink+radv: specify that zink-radv-navi10-valve should run in the mupuf farm
- ci/zink+radv: bump the timeout of zink-radv-navi10-valve by 10 minutes
- docs: add calendar for 23.3
- ci: unify container and build jobs rules
- docs/meson: drop mention that our meson is ready
- ci/docs: drop extra overwritten rules
- ci/zink+radv: document flake
- docs: document the merging process and what is allowed or not
- ci: drop unused shader-db clone + build from alpine image
- ci: drop unused shader-db clone + build from fedora image
- ci: move shader-db clone/build into its own script
- ci/deqp-runner: fix indentation
- ci/deqp-runner: restore exit-on-error after getting deqp-runner's exit code
- ci: fix shebang in build-deqp-runner.sh
- docs: add release notes for 23.1.9
- docs: add sha256sum for 23.1.9
- docs: update calendar for 23.1.9
- ci: drop unused ephemeral packages in alpine image
- docs/ci: rewrite the "farm maintenance ^ other change" rule to mean what we actually meant
- ci: skip dEQP-VK.api.driver_properties.conformance_version for everyone
- pick-ui: use assignment expressions
- pick-ui: use more expressive variable names
- pick-ui: add \`Backport-to: XX.Y` nomination
- v3d/ci: move traces job to wayland
- ci: print deqp version in the job log
- ci/b2c: move to the shiny new \`gfx-ci/ci-tron` repo
- ci/b2c: use latest mesa-trigger image
- include/dri_interface.h: restore define mistakenly removed in !25587
- ci_run_n_monitor: dependency jobs must always be started
- util/xmlconfig: drop driInjectDataDir() now that DRIRC_CONFIGDIR is always supported
- util/xmlconfig: inline datadir
- ci/b2c: change artifacts path to match baremetal and LAVA
- VERSION: bump for rc1
- .pick_status.json: Update to e64a97694ac9dc97f65e1a8e91a5c9789109fd2c
- .pick_status.json: Update to 4cdd094ae1e97d857a6b9dbc291d7bbe6ea266ac
- .pick_status.json: Update to e4a1bc70dd739ca8addddc940af08312b038e288
- .pick_status.json: Update to faed5d647f2416bb0ce3a9d33a3955169c70dc52
- VERSION: bump for 23.3.0-rc2
- .pick_status.json: Update to 1f1ec1c6bcc2a32a3c1df8c2cc7a2f4e7139b7ec
- .pick_status.json: Mark 8dda860f83ac30d042dc6beb4438cc925d1fd130 as denominated
- .pick_status.json: Update to 7d6f9ccfbeab050c26775d5e03578a01526cbfcb
- .pick_status.json: Update to aa33ca0a52591961f8ae01dc253354462ed17c18
- .pick_status.json: Update to a77ea9555aa00cc12f3d1c440252e940ff552500
- .pick_status.json: Mark 227300345ed38377190b0eaf08694d5c42ee7e60 as denominated
- VERSION: bump for 23.3.0-rc3
- .pick_status.json: Update to 56451ce773c11094a8c08fdc6b500bb8bdcf37e1
- .pick_status.json: Mark fa7ec4226bdf48bf63438e303af83ecd58ec95f2 as denominated
- .pick_status.json: Update to 08f851f4361cfbdb211dc70d03cf3ebff331c3ee
- .pick_status.json: Update to 03a7cb261828b350dd9b56bd74850197ca9eba33
- .pick_status.json: Mark fcfa68a632e5711cc657b103c9a0384928e9bf49 as denominated
- VERSION: bump for 23.3.0-rc4
- .pick_status.json: Update to f05688aa3299a27430119b27e45181a6f415bff8
- egl/dri2: increase NUM_ATTRIBS to fit all the attributes
- .pick_status.json: Update to f39ed0063b4cd3e5a71efad2d43ce31f574c698d
- .pick_status.json: Update to b07a58157d0b110dbc09a42cffe7046c3200dd3b
- VERSION: bump for 23.3.0-rc5
- .pick_status.json: Update to f843b14c171299e1696ca6d971ccaa496f60c3ab
- intel/perf: fix regex escaping
- intel/ci: fix .hasvk-manual-rules
- VERSION: bump for 23.3.0
- Revert "VERSION: bump for 23.3.0"
- docs: add release notes for 23.3.0
- Revert "docs: add release notes for 23.3.0"
Erico Nunes (10):
- lima/ppir: don't optimize loads with different block successors
- lima/ppir: convert to nir_legacy
- lima/gpir: switch to register intrinsics
- egl/drm: fix EGL_EXT_buffer_age with gbm contexts
- lima: fix plbu block stride calculation
- ci: disable lima LAVA lab for maintance
- Revert "ci: disable lima LAVA lab for maintance"
- v3dv: allow headless device without display device
- Revert "ci/lima: farm is down, disable for now"
- v3dv: Rework to remove drm authentication for wsi
Erik Faye-Lund (30):
- meson: report with_glvnd in summary
- docs: upgrade bootstrap to 5.3.1
- docs: expand mobile-menu without js
- panfrost: delete stale editorconfig file
- docs/panfrost: link to lima
- docs/panfrost: use code-blocks with wrapping for long blocks
- docs/panfrost: use math-role to denote powers of two
- docs: fix linkcheck
- docs: update a few links to https
- docs: update anchor for link
- docs: update link to git-wiki
- docs: link to upstream etnaviv
- docs: apply some trivial redirects
- docs: use doc-role when linking to lists article
- docs: keep up with intels ever-moving documentation
- docs: mark some redirects as allowed
- docs: only link to old docs from html
- docs: use html_static_path for static files
- ci/etnaviv: update ci expectation
- ci/etnaviv: allow failure on failing test
- zink: fix wording of warning
- ci/etnaviv: move failure to flake
- meson: add wayland-protocols from meson wrapdb
- util/xmlconfig: add an env-var for overriding drirc search dir
- meson: add src/util to the drirc search path
- docs/relnotes: remove cruft from end of lines
- docs/ci: escape at-symbols
- docs/relnotes: escape some at-symbols
- bin/gen_release_notes: escape at-symbols
- panfrost: use perf_debug instead of open-coding
Faith Ekstrand (809):
- nv50/ir: Convert to new-style NIR registers
- nv50/ir: Support vector movs
- intel/fs: Add support for new-style registers
- intel/vec4: Assume get_nir_dest() provides a sane write-mask
- intel/vec4: Add support for new-style registers
- intel: Switch to intrinsic-based registers
- intel/fs: Drop support for nir_register
- intel/vec4: Drop support for nir_register
- anv,hasvk,iris: sampler_prog_key::swizzles is only used on crocus
- nir: Properly handle divergence for load_reg
- nir/trivialize: Maintain divergence information
- nir/trivialize: Trivialize cross-block loads
- vc4: Convert to new-style NIR registers
- nir/schedule: Support load/store_reg
- broadcom/compiler: Convert to new-style NIR registers
- intel/fs: Use write masks from store_reg intrinsics
- intel/fs: Rework the overlapping mov/vec case
- intel/fs: Assume NIR is in SSA form
- nir: Add a backend_flags field to nir_tex_instr
- intel/fs: Add a parameter to speed up register spilling
- nir/builder: Allow tex helpers on image types
- nir/builder: Add a nir_txs_deref() helper
- vulkan: Add a core vk_buffer_view struct
- vulkan: Add a more direct way to use a NIR shader
- vulkan: Add a vk_query_pool base object
- vulkan: Add common vkCmdBegin/EndQuery wrappers
- vulkan/format: Add the remaining 1-plane YCbCr formats
- vulkan: Add a core vk_sampler struct
- nv50/nir: Lower to scratch AFTER optimization
- nouveau: Allow GLSL_SAMPLER_DIM_SUBPASS*
- nouveau/nir: Implement support for compact arrays
- nouveau/codegen: Handle/indirect goes before sample index
- nouveau/codegen: Use a NULL format for PIPE_FORMAT_NONE for images
- nouveau/codegen: Don't convertSurfaceFormat for unknown formats
- nv50/ir: Run nir_divergence_analysis before out-of-SSA
- anv: Use vk_sampler
- anv: Use vk_buffer_view
- vulkan: Add init/finish helpers for vk_query_pool
- anv: Use vk_query_pool
- anv: Use the common versions of vkBegin/EndQuery()
- nir/builder: Don't assume we have compiler options
- Revert "mesa, compiler: Move gl_texture_index to glsl_types.h"
- Revert "compiler: Combine duplicated implementation of is_gl_identifier into glsl_types.h"
- vulkan: Use VkBufferUsageFlags2 in vk_buffer
- clang-format: Set ColumnLimit to 78
- nvk: Implement EnumerateInstanceVersion
- nvk: Add stub implementations of VkImage and VkImageView
- nvk: Add stub implementation of VkSampler
- nvk: Add a stub implementation of VkBuffer
- nvk: Implement VkDescriptorSetLayout
- nvk: Implement VkPipelineLayout
- nvk: Add initial descriptor set lowering
- nvk: Implement vkUpdateDescriptorSets
- nvk: Expose nvk_descriptor_stride_align_for_type
- nvk: Re-format descriptor set layouts
- nvk: Re-format pipeline layouts
- nvk: Re-format descriptor sets some more
- nvk/buffer: Take an offset in nvk_buffer_address
- nvk/buffer: Add a push_buffer_ref helper
- nvk/copy: Use nvk_buffer_address in CmdCopyBuffer
- nvk/image: Add image address helpers
- nvk/copy: Use nvk_image_base_address()
- nvk: Add an nvk_device_physical helper
- nvk: Add a skeleton for pipelines
- nvk: Re-arrange nvk_descriptor_set.h a bit
- nvk: Reformat nvk_nir_lower_descriptors
- nvk: Add a couple descriptor set address helpers
- nvk: Move nvk_cmd_pool cast definitions
- nvk: Rework whitespace in nvk_cmd_buffer.c
- nvk: Add a root descriptor table
- nvk: Fetch descriptor set addresses from the root table
- nvk: Re-arrange nir_lower_explicit_io a bit
- nvk: Lower load_global_constant_offset
- nvk: Drop image_view_init
- nvk: Stop returning VK_ERROR_FORMAT_NOT_SUPPORTED for non-blitable
- nvk: Allow R32_UINT
- nvk: Mark nvk_push_descriptor_set_ref() inline
- nvk: Add a descriptor table data structure
- nvk: Copy in the nouveau TIC format table
- nvk/image_view: Reformat and fix Create/DestroyImageView
- nvk: Add an image descriptor table to the device
- nvk: Fill out TIC table entries for image views
- nvk: Set b->cursor when lowering image intrinsics
- nvk: Unify descriptor loading in lower_descriptors
- nvk: Re-format nvk_image_view.h a bit
- nvk: Re-format nvk_buffer.c a bit
- nvk: Add a stub implementation of buffer views
- nvk: Make texture descriptors a bit more acceptable to codegen
- nvk: GART os host-cache-coherent
- nvk: Reserve a null image descriptor
- nvk: Rework descriptor writes
- nouveau: Add stubs for an image layout library called NIL
- nil: Create images
- nil: Add the TIC format table from nouveau
- nil: Add a nil_view and code to fill out TIC entries
- nvk: Add an nvk_get_format helper
- nvk: Use helpers for push_ref
- nvk: Align arguments consistently in copy/blit code
- nvk: Move Fill/UpdateBuffer to nvk_cmd_copy
- Revert "nvk: Stop returning VK_ERROR_FORMAT_NOT_SUPPORTED for non-blitable"
- nvk: Manually offset for array layers in copy/blit
- nvk: Convert to using NIL for image layout
- nvk: Re-indent image entrypoints
- nvk: Implement VkGetImageSubresourceLyout
- nvk: Reset and properly clean up command buffer upload areas
- nvk: Rework format features queries
- nvk: Add a more competent GetPhysicalDeviceImageFormatProperties
- nvk: Support compressed images in copy commands
- nvk: Drop vk_sync BO refs after push_submit
- nil: Drop miptail support for now
- nil: Don't minify image dimensions when setting up TIC
- nil: Refactor TIC image extent setup
- nil: Fix image array layer alignments
- nvk: Teture pool sizes are maximums not sizes
- nvk: Re-format nvk_sampler.c
- nvk: Implement samplers
- nil: Add a helper for filling out buffer TIC entries
- nvk: Move is_storage_image_format to nvk_format.c
- nvk: Implement buffer views
- nvk: Advertise KHR_dedicated_allocation
- nvk: Use the correct root descriptor table size for CmdDispatch
- nvk: Add support for dynamic buffers
- nvk: Better advertise image format features
- nvk: Advertise descriptor array indexing
- nvk: Advertise non-zero descriptor set limits
- nvk: Use a descriptor type instead of a hand-rolled thing
- nvk: Handle cube storage images properly
- nvk: Load the requested descriptor size
- nvk: Implement push constants
- nvk: Properly indent a comment
- nvk: Fix descriptor offset alignment
- nvk: Use a switch for descriptor types in load_descriptor
- nvk: Support inline uniform blocks
- nvk: Delete the storage TIC in nvk_image_view_destroy
- nvk: Assert that we don't double-free descriptors
- nvk: Initial vkCmdClearImage support
- nvk: Unconditionally zero image format properties
- nvk: No-op sparse image format properties
- nvk: Advertise minUniformBufferOffsetAlignment
- nvk: Rework OOM handling for descriptor pools
- nvk: Bind immutable samplers on descriptor set creation
- nvk: Padd shader BOs by 4K to avoid I-cache overflow
- nvk: Include nvk_private.h in everything
- nvk: Make image/buffer address helpers const
- nouveau/push: Add a P_INLINE_FLOAT helper
- nvk: Init WSI after setting up supported_sync_types
- nouveau/parser: Fix an integer overflow and a typo
- nouveau/parser: Properly dump most arrays used by 3D
- nouveau/parser: Better dump float data
- nouveau/parser: Handle arrays properly in P_IMMD()
- nouveau/push: Make P_IMMD more versatile
- nouveau: Null terminate the debug flag list
- nouveau: Generate 3D headers
- nvk: Add graphics state to command buffers
- nvk: Split pipeline binding into helpers
- nvk: Switch to vk_pipeline_shader_stage_to_nir
- nvk: Don't free the NIR in nvk_compile_nir
- nvk: Add an nvk_shader_address helper
- nvk: Free pipeline shader BOs
- nvk: Expose pipeline alloc/free functions
- nvk: Make shader_upload take an nvk_device
- nvk/shader: Assign I/O locations and gather info
- nvk/shader: Populate headers for vertex and fragment shaders
- nvk: Add a nvk_cmd_buffer_device() helper
- nvk: Import 3D context init code from nouveau
- nil/format: Add helpers for render formats
- nvk: Add boilerplate for Begin/EndRendering
- nvk: Misc. additional state setup
- nvk: Emit dynamic graphics state
- nvk: Implement push constants and descriptors for graphics
- nouveau: Add CPU push buffers
- nvk: Graphics pipelines
- nvk: Implement vkCmdDraw()
- nvk: Color attachments clears via image clears
- vulkan/meta: Add the start of a meta framework
- vulkan/meta: Add an object tracking list
- vulkan/meta: Add a concept of rect pipelines
- vulkan/meta: Implement attachment clears
- vulkan/meta: Implement start-of-rendering clears
- vulkan/meta: Add implementations of Clear*Image
- nvk: Add an attachment format even for secondaries
- nvk: Add an addr field to nvk_buffer
- nvk: Expose a bind_vertex_buffer helper
- nvk: Use vk_meta for CmdClearAttachments
- nvk: Stop using vk_cmd_set_dynamic_graphics_state in meta_end()
- nvk: Enable all the dynamic state features
- nouveau: Fix pushbuf ref reset for user command buffers
- nvk: add linear image creation support.
- nvk: Use max alignment for descriptor pool sizes
- nil: Switch to using the new headers for TIC entries
- nvk: Use meta for CmdClear*Image
- nvk: Zero client memory objects
- nvk: Bind texture and sampler header pools for 3D
- nvk: Use the new headers for samplers
- nvk: Implement nir_intrinsic_load_frag_coord
- vulkan/meta_clear: Populate VkRenderingInfo::renderArea
- nvk: Don't assert when there are no attachments
- nvk: Track and reference all device memory objects
- vulkan: Allow scissors or viewports to be set without counts
- nvk/copy: Mape bpp part of nouveau_copy_buffer
- nvk: Implement copies for D24_UNORM_S8_UINT images
- nvk: Drop sample locations structs
- nvk/meta: Save and restore VI state
- nvk: Re-initialize dynamic_graphics_state.vi when recycling
- nvk: Move the vertex format table into nvk_format.h
- nvk: Advertise vertex buffer format featues
- nvk: Clean up try_create_physical_device error handling
- nouveau/parser: Dump more fields as float
- nvk: Depth bounds need fui()
- nouveau: Add class information to nouveau_ws_device
- nil: Properly depend on nouveau winsys and nvidia-headers
- nil: Use nvidia headers for texture format enums
- nil: Use the nvidia headers for render target format enums
- nil: Use nvidia headers for ZS format enums
- nil: Rename rt to czt in the format info struct
- nil: Rename rendering to color_target
- nil: Re-introduce the format capabilities
- nil: Add more format support helpers
- nvk: Advertise more format features
- nvk: Clear dynamic state dirty after flushing it all
- vulkan/meta: Make stencil reference dynamic for clears
- nvk: Depth buffers don't allow Z-tiling
- nvk: Disable sparse Z on Maxwell+
- nil: Compute PTE kinds and tile modes for images
- nouveau: Add a function to allocate a tiled buffer
- nvk: Add internal helpers for device memory allocation
- nvk: Do internal dedicated allocations for ZS images
- nvk: Fix depth/stencil render pass clears
- nvk: Fix viewport Z scale
- nvk: Enable two-sided stencil
- nvk: Flip the front-face setting
- nvk: Advertise depth/stencil support
- nvk: Don't destroy NULL descriptor pool BOs
- nvk: Call nir_lower_input_attachments
- nvk: Set GEOMETRY_SHADER_SELECTS_LAYER properly
- nvk: Return OUT_OF_DEVICE_MEMORY if bo_new fails
- nil: Add a PTE kind for Z32_FLOAT
- nvk: Add nvk_queue_init/finish() helpers
- nvk: Align descriptor buffers to NVK_MIN_UBO_ALIGNMENT
- nvk: Re-flow a couple function prototypes
- nvk: Assert samples == 1
- nvk: Allocate descriptors for input attachments
- nvk: Wire up early z and post depth coverage
- nvk: Save/restore push constants around meta ops
- nouveau/parser: Add array and float tags for clear values
- nvk: Use hardware clears for attachment clears
- nvk: Add image_view_init/finish functions
- nvk: Implement vkCmdClear*Image directly
- nvk: Use a UINT format to clear non-renderable images
- nvk: Don't advertise tiling on non-power-of-two formats
- nvk: Fix max anisotropy
- nvk: Assert on CmdExecuteCommands
- nvk: VkSamplerCreateInfo::mipLodBias is signed
- nvk: Fix border color alpha
- nil/format: Depth/stencil formats appear as red
- nil: Fix max mip level
- nil: Fix nonnormalized coordinates
- nvk: Set up clip and cull distances
- nvk: Fix dynamic buffer descriptor copies
- nvk: Inline nouveau_copy_linear
- nvk/copy: Rename push to p
- nvk/blit: Rename push to p
- nvk/dispatch: Rename push to p
- nvk: Drop most buffer tracking
- nvk: Rework TLS/SLM and image/sampler table handling
- nvk: Invalidate texture header and sampler caches each submit
- nvk/sampler: Free descriptor table entries
- nvk: Rework nvk_descriptor_table_add/remove
- nvk: Implement descriptor table growing
- nvk: Zero unused descriptors
- nvk: Add some asserts for nv50 compiler image restrictions
- nvk: Update to the new command buffer infrastructure
- nvk: Split nvk_queue into its own file
- nvk: Start every command buffer with a nop
- nvk: Initialize fixed draw/default state once
- nouveau/parser: Convert to mako
- nouveau/parser: Use more idiomatic python
- nouveau/parser: Put the dump helpers in C files
- nvk: Use f for extension features
- nvk: Drop a TODO
- nvk: Use VK_IMAGE_USAGE_*_ATTACHMENT_BIT for image clears
- nvk: Increase the graphics pipeline push space
- nil: Don't claim texture support for 2-bit SNORM
- nouveau/push: Fix a void pointer arithmetic bug
- nouveau/parser: Add more arrays
- nouveau/mme: Add basic structures for the Turing+ MME
- nouveau/mme: Add isaspec XML for the Turing+ MME
- nouveau/mme: Add an assembler and disassembler for the Turring+ MME
- nouveau/mme: Add a builder for the Turing+ MME
- nouveau/mme: Add a tiny simulator for the Turing+ MME
- nouveau/mme: Add an isaspec-based dumper
- nouveau/mme: Make the winsys headers C++ safe
- nouveau/mme: Add unit tests for the Turing+ MME simulator
- nvk: Add MME infrastructure
- nvk: Use MME for clears
- nouveau/mme: Add helper macros for setting fields
- nvk: Use MME for vkCmdDraw[Indexed]()
- nvk: Implement vkCmdDraw[Indexed]Indirect()
- nvk: Use p for the nouveau_ws_push_buffer in zero_vram
- nouveau: Add an nv_push struct
- nouveau: Rename the fields of vk_push
- nouveau: Move nv_push and helpers to their own header
- nouveau/parser: Take a FILE* in DUMP_*_MTHD_DATA
- nouveau: Move push validate to nv_push.c
- nouveau: Move push dumping to nv_push.c
- nvk: Use nv_push directly for graphics pipelines
- nouveau: Add a nouveau_ws_bo_new_mapped helper
- nvk: Use bo_new_mapped for the zero page
- nvk: Always allocate empty_push
- nvk: Move queue_sumbit to nvk_queue_drm_nouveau.c
- nvk: Submit pushbufs directly
- nvk: Use a regular BO for the empty push
- nvk: Use a regular BO for the queue state push
- nvk: Add an nvk_queue_submit_simple helper
- nvk: Initialize the queue later in device setup
- nvk: Use submit_simple for draw state init
- nvk: Use queue_submit_simple for zero_vram
- nvk: Break nvk_cmd_pool into its own file
- nvk: Use cmd instead of cmd_buffer
- nvk: Add BO recycling to the command pool
- nvk: Return VkResult from nvk_cmd_buffer_upload_alloc
- nvk: memcpy root descriptors for compute instead of doing a DMA
- nvk: Fully populate QMDs before uploading
- nvk: Constant buffer alignment is actually 64B
- nvk: Rework side-band data upload
- nvk: Add an nvk_cmd_buffer_push helper
- nvk: Add an nvk_cmd_buffer_ref_bo helper
- nvk: Allocate upload buffers from the command pool
- nvk: Use nvk_cmd_bo for push bufs
- nvk: Implement vkCmdExecuteCommands()
- nvk: Remove remaining references to nouveau_push.h
- nouveau: Use DRM interfaces directly in MME tests
- nouveau: Drop nouveau_ws_push
- nvk: Re-indent vk_instance.c
- nvk: Use vk_object_zalloc/free for descriptor pools/sets
- nvk: Fix up whitespace in nvk_descriptor_set.c
- nvk: Implement VK_KHR_push_descriptor
- nvk: Reference descriptor set layouts in the sets themselves
- nvk: Embed a nv_device_info in nvk_physical_device
- nvk: Add an nvk_queue_submit wrapper
- nvk: Also store the push BO map in nvk_queue_state
- nvk: Bring back push sync and dumping
- nvk: drop nvk_nir.h
- nvk: Add lowering for load_global_constant_bounded
- nvk: Properly implement robustBufferAccess
- vulkan/meta: Add key types
- vulkan/meta: Add a helper for image view types
- vulkan/meta: Add a create_sampler helper
- vulkan/meta: Fixes for clear
- vulkan/meta: Implement vkCmdBlitImage()
- nvk: Support load_layer_id
- nvk/meta: Save/restore descriptor set 0
- nvk: Use meta for doing blits with the 3D hardware
- nvk: WFI in pipeline barriers
- util/vma: Allow initializing zero-size heaps
- nvk: Rework nvk_queue_submit_simple()
- nvk: Add a heap data structure
- nvk: Return a VkResult from nvk_shader_upload()
- nvk: Add a shader heap to nvk_device
- nvk: Allocate shaders from a heap
- nvk: Rework whitespace in nvk_device_memory.c
- nvk: Style fixes in nvk_physical_device.c
- nvk: Reset semaphore syncs on wait
- nvk/wsi: Style fixes
- nvk/wsi: Use the common present implementation
- nouveau/parser: Parse all fields in each method
- nvk: Add a query pool object
- nvk: Implement timestamp queries
- nvk: Implement pipeline statistics and occlusion queries
- nouveau/mme: Allow ZERO as the destinatio nof mme_load_to
- nouveau/mme: Assert on OOB registers
- nouveau/mme: Add support for freeing registers
- nouveau/mme: Add a couple helpers for working 64-bit addresses
- nouveau/mme: Add a helper for MME_DMA_READ_FIFOED
- nvk: Use mme_tu104_read_fifoed()
- nvk: Implement vkCmdCopyQueryPoolResults()
- nvk: Handle large command buffer uploads better
- nvk: Use a normal DMA for CmdUpdateBuffer
- nouveau/parser: Handle 6F methods
- nvk: Use mme_load_addr64()
- nvk: Use poll for BO waits
- nvk: Events
- nvk: Don't crash if we fail to allocate a push BO
- nvk: Stop leaking command pool BOs
- nvk: Enable VK_KHR_create_renderpass2
- nvk: Advertise VK_KHR_imageless_framebuffer
- nvk: Flush the current pushbuf before allocating a new one
- nvk: Advertise VK_KHR_separate_depth_stencil_layout
- nvk: Tell WSI we don't support legacy scanout
- nouveau: Add PCI information to nv_device_info
- nvk: Implement VK_EXT_pci_bus_info
- nvk: Bind 3D images as 3D for clears
- nvk: Support copies between 3D and 2D images
- nil: Add a helper for getting 2D views of 3D images
- nvk: Support 2D views of 3D images
- nvk: Advertise VK_KHR_maintenance1
- nvk: Use 2D array views for 3D storage images
- nil: Fix include guards in nil_image.h
- nvk: Advertise custom border color features
- vulkan: Add a helper for swizzling color values
- nvk: Implement VK_EXT_border_color_swizzle
- nvk: Advertise VK_EXT_extended_dynamic_state3
- nvk: Move more states to dynamic
- nvk: Advertise VK_KHR_storage_buffer_storage_class
- nvk: Add a helper for pushing descriptors
- nouveau/headers: Add generated headers to dependencies
- nvk: Implement VK_EXT/KHR_buffer_device_address
- nvk: Break the guts of CmdDispatch into a helper
- nvk: Implement DispatchIndirect
- nouveau/mme: Add a mul64 helper
- nvk: Implement CS invocations statistics queries
- nil: Use ONE for the anixotropic coarse spread function
- nil: Properly support MSAA
- nil: Add an offset4d struct and some helpers
- nouveau/parser: Sort METHOD_ARRAY_SIZES
- nouveau/parser: Handle SET_ANTI_ALIAS_SAMPLE_POSITIONS
- nvk: Stop asserting on MSAA
- nvk: Handle zero color attachments better
- nvk: Handle multisampled render targets properly
- nvk: Support copies of MSAA images
- nvk: Use the right view format for stencil texturing
- nvk: Pass through a shader key for fragment shaders and MSAA
- nvk: Set correct multisample regs for graphics pipelines
- nvk: Stop creating a new upload BO every time
- nvk: Fill out sample locations on Maxwell B+
- vulkan/meta: Bind whole LODs of 3D blit destinations
- vulkan/meta: Add a helper for building texture ops
- vulkan/meta: Break the guts of blit into a helper
- vulkan/meta: Support writing stencil as iterative discard
- vulkan/meta: Rename vk_meta_blit.c to vk_meta_blit_resolve.c
- vulkan/meta: Add support for MSAA resolves
- nvk/meta: Fix restore for descriptor set 0
- nvk: Use meta for MSAA resolves
- nvk: Replace gl_SamplePosition with fract(gl_FragCoord.xy)
- nvk: Stop advertising higher framebufferNoAttachmentsSampleCounts
- nvk: Advertise MSAA via image format properties
- nvk: Advertise VK_KHR_depth_stencil_resolve
- nvk: Assert that descriptor buffer access stays in-bounds
- nvk: Add a bo size to nvk_descriptor_set
- nvk/format: Style fix for VkFormatProperties3KHR
- nvk: Support VK_FORMAT_B10G11R11_UFLOAT_PACK32 for vertex buffers
- nvk: Add a devenv ICD json file
- nvk: Advertise EXT_vertex_attribute_divisor
- nvk: Lower image_size to txs
- nvk: Fix a comment
- nvk: Add an nvk_buffer_addr_range helper
- nvk: Use nvk_buffer_addr_range for buffer descriptors
- nvk: Re-order Vulkan 1.0 feature bits
- nvk: Enable inheritedQueries
- nvk: Enable VK_EXT_provoking_vertex
- nvk: Advertise samplerMirrorClampToEdge via 1.2 features
- nvk: Advertise VK_KHR_bind_memory2
- nvk: Enable KHR_dynamic_rendering
- nvk: Advertise KHR_uniform_buffer_standard_layout
- nvk: Advertise EXT_index_type_uint8
- nvk: Advertise VK_EXT_separate_stencil_usage
- nvk: Capitalize NVK in user exposed strings
- nvk: Rename grid_size to group_count
- nvk: Lower load_num_workgroups ourselves
- nvk: Drop block_size from the root descriptor table
- nvk: Add a helper for loading resource_index-based descriptors
- nvk: Set maxMemoryAllocationCount
- nouveau/winsys: Take a drmDevicePtr in nouveau_ws_device_new()
- nouveau/winsys: Add an info to nouveau_ws_device
- nouveau/winsys: Move device type into nv_device_info
- nouveau/nil: Take an nv_device_info for image functions
- nouveau/nil: Use nv_device_info for format queries
- nouveau/mme: Invoke SET_OBJECT in the tests
- nouveau/mme: Make alu_op_to_str static
- nouveau/mme: Move mme_value into its own header
- nouveau/mme: Add a mme_reg_alloc struct
- nouveau/mme: Add an intermediate MME_ALU_OP enum
- nouveau/mme: Add an intermediate MME_CMP_OP enum
- nouveau/mme: Use mme_mov() for temp copies of register IMM32 sources
- nouveau/mme: Make helpers less Turing specific
- nouveau/mme: Break the Turing builder guts into a separate header
- nouveau/mme: Move the guts of mme_merge_to() into mme_tu104_builder.c
- nouveau/mme: Move the guts of mme_state_arr_to() into mme_tu104_builder.c
- nouveau/mme: Drop the implicit_imm parameter from mme_alu_to()
- nouveau/mme: Move the cf_stack struct to mme_builder.h
- nouveau/mme: Prepare the builder for multiple GPU generations
- nouveau/mme: Take an nv_device_info in mme_builder_init
- Support immediates in MERGE
- Add add immediate optimizations
- nvk: Add support for contiguous heaps to nvk_heap
- nvk: Use a contiguous shader heap pre-Volta
- nvk: Disable indirect draw/dispatch and query copy MMEs for now
- nvk: Free a couple regs in nvk_mme_build_draw_*()
- nvk: Properly align root descriptor tables for pre-Pascal
- nvk: Compile all NIR before running codegen
- vulkan/meta: Insert a geometry shader when needed
- nvk: Use a GS for layerered rendering pre-MaxwellB
- nvk: Handle zero-size index and vertex buffers pre-Turing
- nvk: Cosmetic clean-ups to Create/DestroyDevice
- nil: Only choose a PTE kind for tiled images
- nouveau/mme: Fix is_int18 for negative numbers
- nouveau/mme: Don't swap x and y in mme_fermi_merge_to()
- nouveau/mme: Take a const nv_device_info in mme_builder_init
- nouveau/mme: Unify some of the test framework
- nouveau/mme: Add some generic builder tests
- nouveau/mme: Add builder tests for SUB
- nouveau/mme: Use a uint32_t for size in mme_fermi_bfe()
- nouveau/mme: nouveau/mme: Add builder tests for SLL and SRL
- nvk/drm: Take a byte offset/range in push_add_push
- nvk: Rework nvk_cmd_push a bit
- nvk: Add a helper for pushing indirect data
- nvk: Make some MME builder names more consistent
- nouveau/mme: Don't allow WaW dependencies in the same Turing instruction
- nvk: Reduce register pressure in nvk_mme_build_draw*()
- nouveau/push: Add an NV_PUSH_MAX_COUNT #define
- nvk: Implement Draw*Indirect on pre-Turing
- vulkan/meta: Use the new NIR texture helpers
- nvk: Add a build test for MMEs
- nvk: Don't over-size push descriptor sets
- nvk: Return VK_ERROR_INCOMPATIBLE_DRIVER if the PCI vendor isn't NVIDIA
- nvk: Bump init context batch size
- nouveau/mme: Fix nested while instructions on Turing+
- nouveau/mme: Add a helper to dump instructions
- nvk: Rework extension enables
- nvk: Rework features enables
- nvk: Advertise shaderImageGatherExtended
- nouveau/mme: Add a bfe helper
- nouveau/mme: Ensure that zero-initizlied mme_value is ZERO
- nvk: De-duplicate MME code for setting draw params
- nvk: Clamp viewport clip to max range
- nvk: Use the same lock for the submit and the memory objects list
- nvk: Advertise ICD/loader interface version 4
- nvk: Add instace WSI entrypoints
- nouveau/mme: Use ADD for ine with an immediate
- nouveau/mme: Fix while loops pre-Turing
- nvk: Add begin to mme_scratch
- nvk: Use the new load/store_scratch helpers for DRAW_PAD_DW
- nouveau/mme: Add a helper for re-allocating registers
- nvk: Rework spill helpers and DRAW_COUNT spilling
- nvk: Spill DRAW_IDX pre-Turing
- nvk: Break the inner MME draw loop into a helper
- nvk: Increase the push runout to 512 dwords
- nil: Add a nil_image_for_level helper
- nil: Add an image_level_as_uncompressed helper
- nvk: Implement uncompressed views of compressed images
- nvk: Set pointClippingBehavior
- nvk: Expose VK_KHR_maintenance2
- nvk: Add a separate #define for SSBO alignment
- nvk: Set spirv_to_nir_options::min_*_alignment
- nvk: Use vk_device_memory
- nvk: Implement VK_KHR_map_memory2
- nvk: Sort SPIR-V caps
- nvk: Advertise EXT_shader_viewport_index_layer on MaxwellB+
- nvk: Only use view_id for layer in multiview
- nvk/heap: Set the right pitch for heap resize copies
- nvk: Advertise shaderStorageImageReadWithoutFormat
- nvk: Fix the NO_PREFETCH assert for CmdDrawIndirect
- nvk: Advertise KHR_spirv_1_4
- nvk: s/device/dev in nvk_image.c
- nvk: Add helpers for binding image planes
- nvk: Take an nvk_image_plane in nouveau_copy_rect_image
- nvk: Use the max descriptor alignemtn in GetDescriptorSetLayoutSupport
- nvk: Use NVIDIA_VENDOR_ID in pdev try_create()
- nvk: Use abbreviated names in nvk_device_memory.c
- nvk: Add device and driver UUIDs
- nvk: Add external memory queries
- nvk: Dedicated allocations override internal
- nvk: Require dedicated allocations for external images
- nouveau/winsys: Add dma-buf import support
- nvk: Support dma-buf import
- nvk: Support dma-buf export
- nvk: Enable external memory extensions
- nvk: Reformat nvk_buffer.c
- nvk: Add a buffer alignment helper
- nvk: Add an addr field to nvk_image_plane
- nvk: Use canonical variable names in nvk_physical_device.c
- nvk: Use canonical variable names in nvk_shader.c
- nvk: Use canonical variable names in nvk_bo_sync.c
- nvk: Use canonical variable names in nvk_sampler.c
- nvk: Drop nvk_physical_device::instance
- nvk: Only advertise EXT_pci_bus_info on discrete GPUs
- nouveau: Put PCI info in a pci substruct in nv_device_info
- nouveau: Stop using hex for SM numbers
- nvk: Set deviceType based on nv_device_info::type
- nouveau: Move more stuff into nv_device_info
- nouveau: Move gart_size to nv_device_info
- nvk: Use nv_device_info for class checks
- nvk: Rename nvk_device::ctx to ws_ctx
- nvk: Add a ws_dev to nvk_device and use it
- nvk: Move the winsys device to nvk_device
- nvk: Don't enumerate pre-Kepler GPUs
- nvk: Implement VK_EXT_physical_device_drm
- nvk: Require an environment variable for poorly tested hardware
- nvk: Use the new core vk_sampler struct
- Revert "vulkan: Allow scissors or viewports to be set without counts"
- vulkan/meta: Add a get_pipeline_layout helper
- vulkan/meta: Use vk_meta_get_pipeline_layout in blit/resolve
- nvk: Bind 3D depth/stencil images as 2D arrays
- nvk: Flush more state on VI_BINDINGS_VALID dirty
- nvk: Don't skip zero-size bindings in GetDescriptorSetLayoutSupport
- docs: Add a docs page for NVK
- docs: Add NVK to features.txt
- docs/relnotes: Stick something about NVK in new_features.txt
- nouveau: Drop GART size from nv_device_info
- nil: Add a nil_image_level_extent_px() helper
- nvk: Use the new NIL helper for image level extents for copies
- nvk: Improve image format properties and limits
- nvk: Rework multi-plane format features a bit
- nvk: Use nvk_root_descriptor_offset for drawInfoBase
- nvk: Add a root_desc_addr to the root descriptor table
- nvk: Add support for variable pointers
- nvk: Enable the SPIR-V DeviceGroup capability
- nvk: Separate the MME query copy code out a bit
- nvk: Implement CopyQueryPoolResults with a compute shader
- nvk: Misc. style nits
- nvk: Rework memory requirements to handle aspects correctly
- nvk: Implement the maintenance5 image layout queries
- nvk: Use VkBufferUsageFlags2
- nvk: Implement CmdBindIndexBuffer2KHR
- nvk: Implement GetRenderingAreaGranularityKHR
- nvk: Decorate CmdBegin/EndRendering entrypoints
- nouveau: Move shader topology info to nv_device_info
- drm-uapi: Import nouveau_drm.h
- nouveau/winsys: Use the imported nouveau_drm.h headers
- nvk: Use the imported nouveau_drm.h headers
- nouveau/shim: Use the imported nouveau_drm.h headers
- nouveau/mme: Support the new UAPI
- nvk: Use an empty EXEC for the empty submit case
- nouveau/winsys: Allow nouveau_ws_device_new() without VM_BIND
- nvk: Print an error message if VM_BIND support is missing
- nvk: Enable the new UAPI
- nvk: Use more consistent device variable names
- nvk: Call nir_lower_int64
- nir/gl: Move glsl_type::sampler_target() into a helper in its one caller
- nvk: Remove plane sources from tex instructions
- nvk: Use common physical device properties
- nv50/ir: Rework conversions for texture array indices
- clang-format: Add nir_foreach_reg_*
- clang-format: nir_foreach_src is not a foreach macro
- clang-format: Set the default ColumnLimit to 0
- nir: Re-align a couple enums and add clang-format comments
- nir: Don't clang-format const_value helpers
- nir: Don't clang-format a couple typedefs
- nir: Don't clang-format debug print setup
- nir: More manual formatting
- nir: Pretty format type mapping helpers
- nir: Wrap pass macros in braces
- nir: Add a do to the do/while in nir_const_value_t_array()
- nir: Add a .clang-format file
- nir: clang-format src/compiler/nir/\*.[ch]
- nvk: Don't use nir_ssa_for_src()
- nir: Drop most instances of nir_ssa_dest_init()
- nir: Drop more instances of nir_ssa_dest_init()
- nir/clone: Clone nir_def nor nir_dest
- nir/serialize: [De]serialize nir_def nor nir_dest
- nir: Drop nir_ssa_dest_init()
- nir: Drop nir_ssa_dest_init_for_type()
- nir: nir_foreach_ssa_def() -> nir_foreach_def()
- st,zink,sfn: Use nir_foreach_def instead of nir_foreach_dest
- dxil: Use nir_foreach_def() instead of nir_foreach_dest()
- nir/from_ssa: Use nir_foreach_def() instead of nir_foreach_dest()
- nir: Drop nir_foreach_dest()
- intel/vec4: Stop passing around nir_dest
- intel/fs: Stop passing around nir_dest and nir_alu_dest
- broadcom: Stop using nir_dest directly
- vc4: Stop passing around nir_dest
- nir,ntt,a2xx,lima: Stop using nir_dest directly
- lima: Stop using nir_dest directly
- etnaviv: Stop passing around nir_dest
- r600/sfn: Stop passing around nir_dest and nir_alu_dest
- nv50/ir: Stop passing around nir_dest and nir_alu_dest
- nir/gather_types: Stop passing around nir_dest
- nir/dce: Stop passing around nir_dest
- nir/propagate_invariant: Stop passing around nir_dest
- nir/validate: Replace all dest validation with validate_def
- nir/print: Replace all dest printing with print_def
- nir: Get rid of nir_dest_bit_size()
- nir: Get rid of nir_dest_num_components()
- nir: Get rid of nir_dest_is_divergent()
- nir: Drop nir_alu_dest
- nir: Drop nir_dest
- util/format: 8-bit interleaved YUV formats are UNORM
- gallivm: Support G8B8_G8R8_422_UNORM and B8G8_R8G8_422_UNORM
- blorp: Use R8G8_UINT for YCRCB_* formats with CCS
- anv: Disable CCS_E for ISL_FORMAT_YCRCB_*
- vulkan/format: Use correct swizzle for 1-plane YCbCr formats
- gallivm: Drop the Vulkan YUV format hacks
- nir: Rename nir_instr_type_ssa_undef to nir_instr_type_undef
- nir s/nir_get_ssa_scalar/nir_get_scalar/
- nir: s/live_ssa_def/live_def/
- nir: s/nir_instr_ssa_def/nir_instr_def/
- nir: Rework nir_scalar_chase_movs a bit
- nir: Fix nir_op_mov handling in nir_collect_src_uniforms
- nir: Handle nir_op_mov properly in opt_shrink_vectors
- nir: Don't handle nir_op_mov in get_undef_mask in opt_undef
- nir: Clean up nir_op_is_vec() and its callers
- nir/large_constants: Use nir_component_mask_t
- nir/large_constants: Add read/write_const_values helpers
- nir/opt_large_constants: Add Small constant handling
- spirv: Re-emit constants at their uses
- nir: Take a nir_def * in nir_tex_instr_add_src()
- nir: Take a nir_def * in nir_phi_instr_add_src()
- nir/opt_undef: Don't rewrite a bcsel to mov
- nir: Add a nir_instr_clear_src() helper and use it
- nir: Add and use a nir_instr_init_src() helper
- nir: Drop nir_if_rewrite_condition()
- nir: Drop most uses of nir_instr_rewrite_src_ssa()
- nir: Drop nir_instr_rewrite_src_ssa()
- nir: Drop most uses if nir_instr_rewrite_src()
- nir: Drop nir_instr_rewrite_src()
- nir: Drop nir_push_if_src()
- nir: Fix metadata in nir_lower_is_helper_invocation
- nir: Use nir_shader_intrinsic_pass() a few places
- drm-uapi: Sync nouveau_drm.h
- nvk: Plumb no_prefetch through to the DRM back-end
- nouveau/mme: Fix a compile warning
- intel/isl: Rename ISL_TILING_Yf/s to ISL_TILING_SKL_Yf/s
- intel/isl: Add ICL variants of Yf and Ys tiling
- intel/isl: Implement correct tile size calculations for Ys/Yf
- intel/isl: Use the depth field of phys_level0_sa for GFX4_2D 3D surfaces
- intel/isl: Fill out the correct phys_total_extent for Ys/Yf/Tile64
- intel/isl: Indent uncompressed surface code
- intel/isl: Support Ys, Yf & Tile64 in isl_surf_get_uncompressed_surf
- intel/isl: Support Yf/Ys tiling in surf_fill_state
- intel/isl: Support Yf/Ys tiling in emit_depth_stencil_hiz
- intel/isl: Add initial data-structure support for miptails
- intel/isl: Add support for computing offsets with miptails
- intel/isl: Support miptails in isl_surf_get_uncompressed_surf
- intel/isl: Start using miptails
- intel/isl: Disallow CCS on 3D surfaces with miptails
- intel/isl: Allow Ys tiling
- anv: Align memory VA to support for Ys, Tile64 tiled images
- nvk: Clean up includes
- nvk: Add include guards to nvk_bo_sync.h
- nvk: SPDX everything
- nouveau/nil: SPDX everything
- nouveau/mme: SPDX everything
- nvk: Don't add a dummy attachment when gl_SampleMask is written
- nvk: Set the discard bit for Z/S self-deps
- nvk: Invalidate the texture cache in PipelineBarrier
- nvk: Lower interp_at_sample to interp_at_offset
- nvk: Disable statistics around meta ops
- nvk: Clean up viewport math
- nvk: Fix depth clipping parameters
- nvk: Enable dynamic clip/clamp enable
- nvk: Set GUARDBAND_Z_SCALE_1 when Z-clipping
- r600: Use more auto-generated nir_builder helpers
- r600: Use nir_builder helpers for load/store_shared_r600
- nvk: Re-order physical device limits
- nvk: Advertise maxMemoryAllocationCount = 4096
- nvk: Advertise discreteQueuePriorities = 2
- nvk: Rip out old UAPI support
- nvk/drm: Drop the push_add_push_bo() helper
- nvk/drm: Drop the push_add_bo() helper
- nvk: Drop command buffer BO tracking
- nvk: Drop memory object tracking
- nvk: Drop the device-level mutex
- nvk: Get rid of the tiled memory allocation helpers
- nvk/drm: Restructure nvk_queue_submit_drm_nouveau()
- nvk/drm: Split exec as needed for large command buffers
- nvk: Don't store the descriptor pool BO in the set
- nvk: Store a 20-bit driver_build_sha in nvk_instance
- nvk: Hook up the disk cache
- nvk: Re-structure early shader compilation a bit
- nvk: Add a default pipeline cache
- nvk: Cache NIR shaders
- nvk: Init pipelineCacheUUID
- drm-uapi: Sync nouveau_drm.h
- nvk: Take GETPARAM_EXEC_PUSH_MAX into account
- nvk: Handle zero-sized sparse buffers
- nvk: Use align() and align64() instead of ALIGN_POT
- nouveau: Generate headers for Maxwell B compute
- nvk: Add a nvk_cmd_buffer_compute_cls() helper
- nvk: Invalidate sampler/texture header caches in BeginCommandBuffer()
- nvk: Invalidate SKED caches at the top of command buffers
- nvk: Advertise more inline uniform block limits
- nvk: Emit MME_DMA_SYSMEMBAR before indirect draw/dispatch
- nvk: Set max descriptors to 2^20 for most descriptor types
- nvk: Reset descriptor pool allocator when all sets are destroyed
- nil/format: Use A for alpha blend
- nil/format: Advertise R10G10B10A2_UINT texture buffer support
- nvk: Disable depth or stencil tests when unbound
- nvk: Always emit at least one color attachment
- nvk: Improve address space and buffer size limits
- nvk: Always set pixel_min/max_Z to CLAMP
- nvk: Use nouveau_ws_bo_unmap() instead of munmap()
- nvk: Free the disk cache
- nvk: Add an nvk_shader_finish() helper
- nvk: Handle unbinding images and buffers
- nvk: Clean up the disk cache on physical device create fail path
- vulkan/wsi: Allow for larger linear images
- nvk: Add a nvk_cmd_buffer_dirty_render_pass() helper
- nvk: Re-sort device features
- nvk: Implement VK_EXT_depth_bias_control
- nvk: Advertise VK_KHR_workgroup_memory_explicit_layout
- nvk: Implement VK_EXT_image_sliced_view_of_3d
- nvk: Advertise VK_EXT_primitive_topology_list_restart
- nvk: Advertise VK_EXT_attachment_feedback_loop_layout
- features: Mark VK_EXT_attachment_feedback_loop_layout done for NVK
- nvk: Re-arrange Vulkan 1.2 features to match the header
- nvk: Advertise shaderOutputLayer and shaderOutputViewportIndex
- nvk: Enable descriptorIndexing
- nvk: Implement VK_EXT_dynamic_rendering_unused_attachments
- nir: Add a nir_ssa_def_all_uses_are_fsat() helper
- nir: Add convert_alu_types to divergence analysis
- nir/lower_tex: Add a lower_txd_clamp option
- nir: Add a load_sysval_nv intrinsic
- nir: Add NV-specific texture opcodes
- nir: Add an load_barycentric_at_offset_nv intrinsic
- nir: Add a range to most I/O intrinsics
- nir: Add NVIDIA-specific I/O intrinsics
- nir/lower_bit_size: Fix subgroup lowering for floats
- nir: add deref follower builder for casts.
- nir: Handle wildcards with casts in copy_prop_vars
Felix DeGrood (12):
- anv: save a shader source uint32_t hash in gfx/compute pipelines
- anv: Add Source hash field to VkPipelineExecutableStatisticKHR
- iris: save shader source sha1 in ish
- mesa: propagate shader source sha1 from gl_shader to nir_shader
- intel: use shader source hash in INTEL_MEASURE
- intel/compiler: use shader source hash in shader dump code
- anv: add fake sparse support
- anv: enable fake sparse for Elden Ring
- anv: debug messaging for sparse texture usage
- anv: fix frame count reporting in INTEL_MEASURE
- anv: set ComputeMode.PixelAsyncComputeThreadLimit = 4
- anv: remove CS_FLUSH from query regression
Feng Jiang (9):
- virgl: Only PIPE_BUFFER with VIRGL_BIND_CUSTOM flag is considered busy during creation
- meson: Export winsys function symbols for target va
- frontends/va: Add slice_count to AV1 slice_parameter
- virgl/video: Add definition of virgl_av1_picture_desc
- virgl/video: Add support for AV1 decoding
- virgl/video: Enable AV1 decoding
- meson: Rename dri-vdpau.dyn to dri.dyn
- CODEOWNERS: Add \@flynnjiang for VirGL video
- meson: Move video to separate section in meson configuration summary
Filip Gawin (1):
- crocus: Avoid fast-clear with incompatible view
Flora Cui (1):
- radeonsi: limit CP DMA to skip holes in sparse bo
Francisco Jerez (29):
- intel/fs/ra: Define REG_CLASS_COUNT constant specifying the number of register classes.
- intel/vec4/ra: Define REG_CLASS_COUNT constant specifying the number of register classes.
- intel/compiler: Make MAX_VGRF_SIZE macro depend on devinfo and update it for Xe2.
- intel/fs/ra/xe2: Scale up register allocation granularity by 2x on Xe2+ platforms.
- intel/eu/xe2+: Fix encoding of various message descriptors for change in register size.
- intel/fs: Fix signedness of payload_node_count argument of calculate_payload_ranges().
- intel/fs/xe2+: Fix payload node live range calculations for change in register size.
- intel/fs/xe2+: Fix grf_count in post-RA scheduling for updated register file size.
- intel/fs/xe2+: Fixes for increased accumulator register width.
- intel/fs/xe2+: Scale MAX_SAMPLER_MESSAGE_SIZE by native register size.
- intel/eu/xe2+: Update validation of GRF region size to account for Xe2 reg size
- intel/fs/xe2+: Allow increased SIMD width for various get_fpu_lowered_simd_width() restrictions.
- intel/compiler/xe2+: Represent dispatch_grf_start_reg in native GRF units.
- intel/fs/xe2+: Update encoding of FB write message payload.
- intel/fs/xe2+: Round up fs_builder::vgrf() size calculation to HW register unit.
- intel/fs/xe2+: Scale BRW_MAX_MSG_LENGTH by native register size.
- intel/fs/xe2+: Fix payload layout of sampler messages for Xe2 reg size
- intel/fs/xe2+: Update GS payload setup for Xe2 reg size.
- intel/fs/xe2+: Update TCS payload setup for Xe2 reg size.
- intel/fs/xe2+: Update TES payload setup for Xe2 reg size.
- intel/fs: Lower unsupported regioning with non-trivial 2D regions on FIXED_GRFs.
- intel/fs/xe2+: Update regioning lowering offset alignment checks for Xe2 regs.
- intel/fs/xe2+: Fix execution width of SHADER_OPCODE_GET_BUFFER_SIZE for SIMD16 EU.
- intel/fs/xe2+: Fix calculation of spill message width for Xe2 regs.
- intel/xe2+: Round up size to reg_unit() in fs_reg_alloc::alloc_spill_reg().
- intel/fs/xe2+: Fix URB writes with 0 data components.
- intel/fs: Specify number of data components of logical URB writes via control immediate.
- intel/fs: Delete manual 'inst->mlen' calculations from all uses of logical URB writes.
- intel/fs: Delete manual 'inst->mlen' calculations from all uses of logical URB reads.
Frank Binns (10):
- pvr: clang-format fixes
- pvr: skip setting up SPM consts buffer when no const shared regs are used
- pvr: cleanup SPM EOT dynarray after upload
- pvr: treat VK_IMAGE_CREATE_MUTABLE_FORMAT_BIT as not supported
- pvr: remove pvr_pbe_get_src_pos()
- pvr: fix attachments segfault in pvr_is_stencil_store_load_needed()
- pvr: fix allocation size of clear colour consts shared regs buffer
- pvr: change a few places to use PVR_DW_TO_BYTES()
- pvr: fix setup of load op unresolved msaa mask
- pvr: emit PPP state when vis_test dirty bit is set
Friedrich Vock (19):
- radv/ci: Set DRIVER_NAME in LAVA raven vkcts jobs
- radv: Handle VK_SUBOPTIMAL_KHR in trace layers
- ac/msgpack: make fixstrs a const char
- ac/sqtt,radv: Split internal and API hash in PSO correlations
- ac/rgp: Write lds_size metadata
- ac/rgp: Add metadata for separate-compiled RT stages
- radv/sqtt: Move record filling to helper function
- radv/sqtt: Unregister records based on hash
- radv/sqtt: Write LDS size metadata in code objects
- radv/sqtt: Handle separately-compiled RT pipelines
- ac/sqtt,radv/sqtt: Add and use marker for separate RT compilation
- nir/load_store_vectorize: Handle intrinsics with constant base
- radv/rt: Pre-initialize instance address
- radv: Initialize shader freelist on allocation
- radv: Fix check in insert_block
- radv/rt: Reject hits within 10ULP of previous hits in emulated RT
- radv/rra: Recognize LPDDR memory
- radv/rmv: Recognize LPDDR memory
- vulkan: Don't use set_foreach_remove when destroying pipeline caches
Ganesh Belgur Ramachandra (5):
- radeonsi: stores bottom_edge_rule option in the rasterizer state
- radeonsi: sets OPTIMAL_BIN_SELECTION to 0 if using bottom_edge_rule
- radeonsi: "clear_render_target" shader in nir
- radeonsi: "clear_render_target_1d_array" shader in nir
- radeonsi: "clear_12bytes_buffer" shader in nir
Georg Lehmann (39):
- aco/gfx11: fix get_gfx11_true16_mask with v_cmp_class_f16
- aco: improve get_gfx11_true16_mask description
- aco: combine a & ~b to bfi(b, 0, a)
- aco/gfx11: use v_cmp_class_f16 with opsel for bitnz/bitz
- aco: fix non constant 16bit bitnz/bitz
- ac/nir: handle more special cases in ac_nir_unpack_arg
- aco: use s_bitreplicate_b64_b32 to set exec to 0xffff0000ffff0000
- nir/opt_intrinsics: optimize (exclusive_scan(op, a) op a) to inclusive scan
- aco: always use rtne for fquantize2f16
- nir/opt_if: also rewrite uniform uses for read_invocation
- nir: unify lower_bitfield_insert with has_{bfm,bfi,bitfield_select}
- nir: unify lower_bitfield_extract with has_bfe
- nir: unify lower_find_msb with has_{find_msb_rev,uclz}
- aco: fix u2f16 with 32bit input
- aco: combine a | ~b to bfi(b, a, -1)
- aco: use v_cvt_f32_ubyte for signed casts too
- nir: add nir_scalar intrinsic helpers
- nir: add nir_scalar_equal
- aco: implement some exclusive scans with inclusive scans
- aco/gfx11: don't use bfe for local_invocation_id if the others are always 0
- nir/opt_algebraic: remove broken fddx/fddy patterns
- aco: simplify masked swizzle dpp selection by removing or_mask first
- aco: fix p_extract with v1 dst and s1 operand
- aco: implement 64bit div find_lsb
- nir: scalarize masked_swizzle_amd created from shuffle_xor
- aco/optimizer: check if we can use omod before labeling it
- aco/optimizer: copy propagate to output modifier instructions
- aco: remove -0.0 for 32 bit fsign with mul_legacy/omod when denorms are flushed
- nir: make quad intrinsic dst bit size match src0
- nir/lower_subgroups: use intrinsic builder more
- aco: assume new generations are unsupported by clrx
- aco: assume newer generation will use GFX11 wait_imm packing
- aco: print final ir instead if printing asm is unsupported
- aco/gfx11: optimize dual source export
- aco/gfx11: apply clamp/omod to vinterp
- aco: support v_fma_f32_dpp as fma_mix
- aco/gfx11: support vinterp as fma_mix
- aco: add missing scc def for SALU quad broadcast
- aco/sched: treat p_dual_src_export_gfx11 like export
George Ouzounoudis (38):
- nouveau/codegen: Support compact clip distances with arrayed_io
- nouveau/codegen: Handle nir op amul
- nouveau/codegen: Fix compact patch varyings in case of NIR
- nouveau/codegen: Add capability to pre-specify tessellation domain
- nvk: Do not increment instance id across draws
- nvk: Add a macro for root descriptor table byte offsets
- nvk: Set base vertex state in sequential mme draw
- nvk: Support base instance in instanced draw calls
- nvk: Switch point rasterization to point sprites
- nvk: Support large points
- nvk: Compile geometry shaders
- nouveau/mme: Keep device info in mme_builder
- nvk: Simplify mme build function argument
- nvk: Support VK_KHR_shader_draw_parameters
- nvk: Support for vertex shader transform feedback
- nvk: Support transform feedback indirect draws
- nvk: Support transform feedback geometry streams
- nvk: Support transform feedback queries
- nvk: Support vertex shader transform feedback on Fermi
- nvk: Disable PRIMITIVE_RESTART_VERTEX_ARRAY by default
- nvk: Fix geometry shader active stream mask
- nvk: Support geometry shaders
- nvk: Basic tessellation shader support
- nvk: Assign locations correctly for arrayed IO
- nvk: Enable multiview with tessellation shader
- nvk: Fix cases where execution mode is specified in the tesc shader.
- nvk: Respect tessellation domain origin state
- nvk: Lower io to temporaries for tessellation evaluation nir
- nvk: Support VkDescriptorSetVariableDescriptorCountLayoutSupport
- nvk: Handle cases of descriptor bindings with variable counts
- nvk: Add nir non-uniform optimization pass
- nvk: Enable descriptor indexing
- nvk: Do not keep redundant info for tessellation domain
- nouveau/codegen: Do not keep redundant info for tessellation domain
- nvk: Enable dynamic line rasterization mode state
- nvk: Fix support for VK_EXT_sample_locations
- nvk: Support dynamic state for enabling sample locations
- nouveau/codegen: Add a 4th optimization level for MemoryOpts
Gert Wollny (63):
- r600/sfn: Switch to register intrinsics
- r600/sfn/tests: add simple copy-prop test with register source
- r600/sfn: Allow for larger ALU CF's
- r600/sfn: Handle indirect array load/store dependencies better
- r600/sfn: Increase LDS fetch schedule priority
- r600/sfn: Add peephole optimization to move a dest to the previous op
- r600/sfn: reorder the value factory class member declaration a bit
- r600/sfn: Add some tests for proper register access
- r600/sfn: Print more info if scheduling fails
- r600/sfn: remove debug output leftovers
- r600/sfn: Fix use of multiple IDX with kcache
- r600/sfn: Always check arrays writes before allowing copy propagation
- r600/sfn: set block sizes based on chip class
- r600/sfn: Fix typo with block type
- r600/sfn: override slot count for IfInstr
- r600/sfn: Add method to convert to AluGroup directly
- r600/sfn: Add flags to check whether a group starts CF and can do that
- r600/sfn: make remaining slots a signed value
- r600/sfn: on Cayman loading an index register needs only one slot
- r600/sfn: Splizt ALU blocks in scheduler to fit into 128 slots
- r600/sfn: rework checks for ALU CF emission
- r600/sfn: Schedule AR uses befor possible groups
- r600: Explicitly force new CF in gs copy shader
- r600: Assert when backend wants to create a new ALU CF
- r600: don't check possible size of ALU CF
- r600: don't use sb disasm to disassamble copy shader
- r600: Force CF when emitting a NOP on R600 in gs copy shader
- r600/sfn: Don't try to propagate to vec4 with more than one use
- r600/sfn: Only switch to other CF if no AR uses are pending
- r600/sfn: AR loads should depend on all previous non ALU instructions
- r600/sfn: Renumber shader blocks in scheduler
- r600/sfn: Track whether a register is ALU clause local
- r600/sfn: Use clause local registers in RA
- r600/sfn: Take source uses into account when switching channels
- r600/sfn: take number of dest values into account
- r600: retire SB optimizer
- r600/sfn: work around injecting extra CF's to handle hardware bugs
- r600: use correct cso pointer for fetch shader
- r600/sfn: Make use of four clause local registers
- r600/sfn: drop unused ControlFlowInstr type enum
- r600/sfn: factor out resource as extra class
- r600/sfn: Simplify dependency chain for index loads on EG
- r600: print texture resource index mode separately
- r600/sfn: Make address split pass obligatory
- r600/sfn: rename method resource_base to resource_id
- r600/sfn: Add old address to update_indirect_addr
- r600/sfn: Sepeate resource and sampler in texture instructions
- r600/sfn: get rid of the method to get the index mode
- r600/sfn: sort the uniforms of the right shader
- r600/sfn: Fix use of scheduled_shader vs shader
- virgl: report MIRROR_CLAMP features better
- ci: Upref virglrenderer
- copyimage: check requested slice early when cube maps are involved
- mesa: check numlevels and numlayers when creating a texture view
- virgl: Use common clear_texture if host doesn't support the feature
- r600/sfn: don't remove texture sources by using the enum value
- r600: drop egcm_load_index_reg
- r600/sfn: Don't override a chgr pinning during copy propagation
- r600/sfn: When simplifying src vec4 pinnings, also check all uses
- virgl: Fix logic for reporting PIPE_MIRROR_CLAMP
- r600: Add callbacks for get_driver_uuid and get_device_uuid
- r600: Link with libgalliumvl, when enabling rusticl this is needed
- r600/sfn: Fixup component count only if intrinsic has it
Guilherme Gallo (5):
- bin/ci: Ensure that all jobs have nodes in DAG
- ci/radeonsi: Update flake list
- ci/freedreno: Add a new flake
- ci/zink: Found some flakes
- ci/anv: Catch some flakes
Hannes Mann (1):
- vulkan/wsi/wayland: Fix detection of tearing control protocol
Hans-Kristian Arntzen (2):
- wsi/x11: Fix potential deadlock in present ID.
- wsi/x11: Don't allow signal_present_id to rewind.
Helen Koike (21):
- ci: re-add EXTRA_LOCAL_PACKAGES to rootfs
- ci: add EXTRA_LOCAL_PACKAGES to apt-get install
- docs/ci: Add docs for EXTRA_LOCAL_PACKAGES
- ci: disable duplicated pipelines triggered by marge
- ci: add --project option to ci_run_n_monitor.py
- ci/android: remove strace output from cuttlefish-runner.sh
- ci: add locked flag to bindgen-cli on x86_64_build.sh
- ci: separate hiden jobs to -inc.yml files
- ci/ci_run_n_monitor: add docs for multiple targets
- ci/ci_run_n_monitor: print stress test results per job
- ci/ci_run_n_monitor: simplify with defaultdict
- ci/ci_run_n_monitor: merge print_job_status_change with print_job_status
- ci/ci_run_n_monitor: make --target mandatory
- ci/ci_run_n_monitor: merge enable_job with retry_job
- ci/ci_run_n_monitor: simplify enable/cancel logic in monitor_pipeline()
- ci/ci_run_n_monitor: allow <user>/<project> in --project
- ci/ci_run_n_monitor: limit repetitions on --stress
- ci/marge_queue: add missing python-dateutils to requirements.txt
- ci/ci_run_n_monitor: keep monitoring if a job is still running
- ci/marge_queue: add pretty_dutation()
- ci/ci_run_n_monitor: print job duration time
Honglei Huang (7):
- virgl/video: Add support for mpeg12 decoding
- virgl/video: Add support for vc1 decoding
- virgl/video: Add support for jpeg decoding
- virgl/video: Add support for hevc10bit decoding.
- virgl/video: Add more pipe type in virgl formats convert table
- virgl/video: Add jpeg buf start code check
- virgl: Enable vp9 hardware decode
Hyunjun Ko (3):
- anv: use ycbcr_info for P010 format
- anv: don't use cmd_buffer after destroyed.
- anv: don't flush_llc on gen9
Iago Toral Quiroga (100):
- nir/trivialize: Move decl_reg to the start of the block
- v3dv: stop incrementing UBO indices by one
- nir/lower_robustness: drop skip_ubo_0 option
- v3dv: fix incorrect key setup
- broadcom/compiler: stop asserting on Vulkan environment
- broadcom/compiler: use NIR's lowering for dispatch base
- broadcom/compiler: move uniform offset lowering from compiler to GL driver
- broadcom/compiler: move vulkan's point coord lowering to the driver
- v3dv: don't set lower_wpos_pntc for Vulkan
- broadcom/compiler: always clamp results from logic ops
- broadcom/compiler: drop execution environment from the shader key
- v3dv: drop cpu path for buffer to image copies
- v3dv: remove unused code
- nir/lower_tex: copy backend_flags field when copying a tex instruction
- nir/lower_tex: use a callback to check sampler return size packing
- squash! v3dv,broadcom/compiler: don't abuse sampler index
- v3dv: assert that only tex instructions with sampler state have a sampler src
- v3d: fix texture packing lowering
- v3d,v3dv: use fquantize2f16 lowering in NIR
- v3dv: be more precise in vkGetImageSubresourceLayout
- v3dv: handle pPlaneLayouts in VkImageDrmFormatModifierExplicitCreateInfoEXT
- v3dv: bump up MAX_UNIFORM_BUFFERS to 16
- v3dv: add support for sampling simple 2D linear textures
- v3dv: expand sampling from linear image hack to support multi-planar images
- v3dv: don't assume that bound descriptors have been written
- v3dv: only handle Android Hardware Buffer on Android
- v3dv: we can sample from 1D array too
- broadcom/compiler: add a couple of shader key helpers
- v3d: compute nir sha1 for uncompiled shader state
- v3d: use pre-computed shader sha1 for disk cache
- v3d: fix RAM shader cache
- v3d: get rid of shader_state pointer in v3d_key
- broadcom/simulator: reset CFG7 for compute dispatch in v71
- broadcom/common: retrieve V3D revision number
- broadcom/compiler: update node/temp translation for v71
- broadcom/compiler: implement "reads/writes too soon" checks for v71
- broadcom/compiler: implement read stall check for v71
- broadcom/compiler: add a v3d71_qpu_writes_waddr_explicitly helper
- broadcom/compiler: prevent rf2-3 usage in thread end delay slots for v71
- broadcom/qpu: add new ADD opcodes for FMOV/MOV in v71
- broadcom/qpu: fix packing/unpacking of fmov variants for v71
- broadcom/compiler: make vir_write_rX return false on platforms without accums
- broadcom/compiler: rename vir_writes_rX to vir_writes_rX_implicitly
- broadcom/compiler: only handle accumulator classes if present
- broadcom/compiler: don't assign rf0 to temps across implicit rf0 writes
- broadcom/compiler: CS payload registers have changed in v71
- broadcom/compiler: don't schedule rf0 writes right after ldvary
- broadcom/compiler: allow instruction merges in v71
- broadcom/qpu: add MOV integer packing/unpacking variants
- broadcom/qpu: fail packing on unhandled mul pack/unpack
- broadcom/compiler: generalize check for shaders using pixel center W
- broadcom/compiler: v71 isn't affected by double-rounding of viewport X,Y coords
- broadcom/compiler: update peripheral access restrictions for v71
- broadcom/qpu: add packing for fmov on ADD alu
- broadcom/compiler: handle rf0 flops storage restriction in v71
- broadcom/compiler: enable ldvary pipelining on v71
- broadcom/compiler: try to use ldunif(a) instead of ldunif(a)rf in v71
- broadcom/compiler: don't assign rf0 to temps that conflict with ldvary
- broadcom/compiler: convert mul to add when needed to allow merge
- broadcom/compiler: implement small immediates for v71
- broadcom/compiler: update thread end restrictions for v7.x
- broadcom/compiler: update ldvary thread switch delay slot restriction for v7.x
- broadcom/compiler: lift restriction for branch + msfign after setmsf for v7.x
- broadcom/compiler: start allocating from RF 4 in V7.x
- broadcom/compiler: validate restrictions after TLB Z write
- broadcom/compiler: lift restriction on vpmwt in last instruction for V3D 7.x
- broadcom/compiler: fix up copy propagation for v71
- broadcom/compiler: don't allocate spill base to rf0 in V3D 7.x
- broadcom/compiler: improve allocation for final program instructions
- broadcom/compiler: don't assign registers to unused nodes/temps
- broadcom/compiler: only assign rf0 as last resort in V3D 7.x
- v3dv: expose V3D revision number in device name
- v3dv/device: handle new rpi5 device (bcm2712)
- v3dv: setup render pass color clears for any format bpp in v71
- v3dv: setup TLB clear color for meta operations in v71
- v3dv: fix up texture shader state for v71
- v3dv: handle new texture state transfer functions in v71
- v3dv: implement noop job for v71
- v3dv: handle render pass global clear for v71
- v3dv: GFX-1461 does not affect V3D 7.x
- broadcom/compiler: update thread end restrictions validation for v71
- v3dv: handle early Z/S clears for v71
- v3dv: handle RTs with no color targets in v71
- v3dv: don't convert floating point border colors in v71
- v3dv: handle Z clipping in v71
- v3dv: make v3dv_viewport_compute_xform depend on the V3D version
- v3dv: fix depth clipping then Z scale is too small in V3D 7.x
- v3d/v3dv: fix texture state array stride packing for V3D 7.1.5
- v3d,v3dv: support up to 8 render targets in v7.1+
- v3d,v3dv: don't use max internal bpp for tile sizing in V3D 7.x
- v3d,v3dv: propagate NaNs bits in shader state records are reserved in v7.x
- v3dv: use new texture shader state rb_swap and reverse fields in v3d 7.x
- v3dv: fix color write mask for v3d 7.x
- v3d,v3dv: fix depth bias for v3d 7.x
- v3d,v3dv: fix compute for V3D 7.1.6+
- v3dv: expose fullDrawIndexUint32 in V3D 7.x
- v3dv: expose depthClamp in V3D 7.x
- v3dv: expose scalarBlockLayout on V3D 7.x
- v3dv: fix confusing nomenclature about DRM nodes
- v3d,v3dv: fix MMU error from hardware prefetch after ldunifa
Ian Douglas Scott (1):
- egl/wayland: Don't segfault if \`create_wl_buffer` returns \`NULL`
Ian Romanick (38):
- intel/fs: Always do opt_algebraic after opt_copy_propagation makes progress
- intel/fs: Constant fold SHL
- intel/fs: Constant fold OR and AND
- util/rb-tree: Return the actual first node from rb_tree_search
- util/rb-tree: Fix typo in comment
- nir/builder: Add nir_extract_i8_imm and nir_extract_u8_imm helpers
- nir/algebraic: Remove redundant pack / unpack lowering patterns
- intel/fs: Completely re-write the combine constants pass
- intel/fs: Combine constants for SEL instructions too
- intel/fs: Combine constants for integer instructions too
- intel/fs: New VGRF packing scheme for constant combining
- intel/compiler: Combine control barriers with identical memory semantics
- intel/compiler: Don't evict for workgroup-scope fences
- glsl/list: Clean up an inappropriate comment
- util/rb-tree: Work around C++'s dislike of offsetof
- util/rb-tree: Inline rb_tree_init
- intel/fs: Don't continue fixed point iteration just because liveout changes
- intel/fs: Don't try to copy propagate into a source again after progress is made
- intel/fs: Make try_constant_propagate and try_copy_propagate file private
- intel/fs: Move src.file checks out of try_constant_propagate and try_copy_propagate
- intel/fs: Don't loop in try_constant_propagate
- intel/fs: Simplify check in can_propagate_from
- intel/fs: Make opt_copy_propagation_local file private
- intel/fs: Encapsulate per-block ACP in a structure
- intel/fs: Use rb_tree to store ACP entries by source
- intel/fs: Use rb_tree to store ACP entries by destination
- intel/fs: Use rb_tree for copy prop dataflow
- intel/fs: Merge copy prop dataflow loops
- intel/compiler/xe2: Update fs_visitor::setup_vs_payload to account for Xe2 reg size
- intel/compiler/xe2: Use SIMD16 for nir_intrinsic_image_size
- intel/compiler/xe2: TXD is lowered to SIMD16 in SIMD32 mode
- nir/rematerialize: Rematerialize ALUs used only by compares with zero
- intel/compiler/xe2: Handle new URB read messages
- intel/compiler/xe2: Handle new URB write messages
- intel/compiler/xe2: Update fs_visitor::emit_urb_writes to not assume SIMD8
- spirv: Track when a shader has a cooperative matrix
- intel/fs: Add DP4A to get_lowered_simd_width
- nir/split_vars: Don't split arrays of cooperative matrix types
Igor Torrente (4):
- zink: Fix enumerate devices when running compositor
- zink: Removes \`disable_xcb_surface`
- zink: Fix one addicional case when running a compositor
- zink: fix for startup crash of weston running on top of zink + venus
Illia Abernikhin (2):
- state_tracker: moving initialisation of whandle out from if statement whandle initialization inside if statement but used also outside
- i915: change format in dbg string Actually, uintptr_t is of type unsigned long, but the debug line uses the %d format specifier, which expects an int.
Illia Polishchuk (7):
- iris: remove NULL check for already dereferenced pointer earlier
- s/Intel: fix/anv: fix: potentially overflowing expression in genX
- glx: fix dead code when gc var cannot be null due to earlier check
- state_tracker: fix dereference before null check
- anv, drirc: Add workaround to speed up Cyberpunk 2077 reg allocation
- zink: move find_sampler_var from zink to nir core
- nir: fix invalid sampler search by texture id
Italo Nicola (24):
- mesa/main: account for RTT samples when updating framebuffer
- mesa/main: allow readpix/teximage to read from implicitly multisampled fbos
- panfrost/genxml: fix Surface With Stride descriptor alignment
- panfrost/genxml: add Multiplanar Surface descriptor
- panfrost: refactor (un)packing of surface descriptors
- pan/decode: decode Multiplanar Surface descriptors
- panfrost: prepare pan_image_view for multiplanar formats
- panfrost: prepare the driver to support YUYV and variants
- panfrost: advertise support for YUYV and variants
- panfrost: mandate proper alignment requirement depending format and arch
- panfrost: add PAN_MESA_DEBUG=yuv for debugging yuv sampler
- gallium/st: add non-CSC lowering of I420 as PIPE_FORMAT_R8_G8_B8_420
- gallium/st: add non-CSC lowering of YV12 as PIPE_FORMAT_R8_B8_G8_420
- pan/bi: add support for I420 and YV12 sampling
- gallium/st: lower NV21 to R8_B8G8 instead of G8_B8R8
- panfrost: fix invalid memory access in get_equation_str()
- pan/decode: handle more than one panfrost_device
- panfrost/ci: updated CI expectations
- egl: reenable partial redraw with a warning when using gallium hud
- pan/genxml: add Width/Height fields to v9+ Plane descriptor
- panfrost: rename _needs_multiplanar_descriptor to _is_yuv
- panfrost: prepare v9+ to support YUV sampling
- panfrost: use centered YUV chroma siting
- panfrost: advertise YUV formats for valhall
Iván Briano (23):
- anv: ensure CFE_STATE is emitted for ray tracing pipelines
- iris: ensure mesh is disabled on context init
- anv: ensure mesh is disabled on context init
- anv: implement Wa_14019750404
- intel/compiler: call brw_nir_adjust_payload from brw_postprocess_nir
- anv,hasvk: respect provoking vertex setting on geometry shaders
- anv: fix missing 3DSTATE_SBE_CLIP emission
- anv: ensure pipelines have all state
- anv: tell blorp to do mesh stuff only if it's enabled
- blorp: fix hangs with mesh enabled
- anv: use a simpler MUE layout for fast linked libraries
- anv: track what kind of pipeline a fragment shader may be used with
- intel/fs: read viewport and layer from the FS payload
- intel/fs: handle URB setup for fast linked mesh pipelines
- anv: enable VK_EXT_mesh_shader where supported
- intel/fs: use ffsll so we don't explode on 32 bits
- vulkan/runtime: add internal parameter to vk_spirv_to_nir
- nir/lower_int64: respect rounding mode when casting to float
- intel/compiler: round f2f16 correctly for RTNE case
- util: add double_to_float16 helpers
- nir: round f2f16{_rtne/_rtz} correctly for constant expressions
- anv: advertise VK_KHR_global_priority_queue
- anv: use the right vertexOffset on CmdDrawMultiIndexed
Jani Nikula (1):
- docs/vulkan: fixup some typos
Janne Grunau (4):
- asahi: toggle more barrier bits after transform feedback
- asahi,agx: Fix stack buffer overflow in agx_link_varyings_vs_fs
- asahi,agx: Upload constant buffers immediately
- asahi: decode: Fix uint64_t format modifiers in agxdecode_stateful()
Jesse Natalie (2):
- nir_lower_mem_access_bit_sizes: Fix write-mask-constrained 3-byte stores as atomics
- d3d12: Fix multidimensional array ordering
Jianxun Zhang (1):
- intel/common: Only set op mask on instructions in decoder
Jonathan Marek (2):
- freedreno: move redump.h to common code + cleanup
- tu: add a TU_DEBUG=rd option for cmdstream dumping
Jordan Justen (73):
- isl: Add ISL_SURF_USAGE_STREAM_OUT_BIT
- anv,iris,hasvk: Use ISL_SURF_USAGE_STREAM_OUT_BIT for setting stream-out MOCS
- genxml/hsw: Add additional MOCS field enumerations
- genxml/chv: Add MEMORY_OBJECT_CONTROL_STATE_CHV to document compared to BDW
- isl/dev: Add uncached MOCS value
- isl: Set MOCS to uncached for MTL stream-out
- intel/isl: Use intel_needs_workaround() for MTL CCS WA
- intel/compiler: Use nir SUBGROUP_INVOCATION for RT TOPOLOGY_ID
- intel/dev: Add LNL platform enum
- intel/dev: Support xe2 device init (for intel_device_info_test)
- intel/tools: Use 'env bash' to find bash executable
- intel/decoder: Fix xml filename when verx10 % 10 is not 0
- intel/decoder: Add intel_spec_load_common()
- intel/decoder: Make intel_spec_load_filename() have separate dir and name strings
- intel/genxml: Align "Texture Coordinate Mode" naming
- intel/genxml: Split some genxml sorting code into a intel_genxml module
- intel/genxml: Convert gen_bits_header to use ElementTree
- intel/genxml: Convert gen_pack_header to use ElementTree
- intel/genxml: Add GenXml class into intel_genxml module
- intel/genxml: Add filter_engines() to GenXml class
- intel/genxml: Move sorting & writing into GenXml class
- intel/genxml: Don't rewrite sorted xml if the contents didn't change
- intel/genxml: Add final newline to output when saving xml
- intel/genxml: Update xml with gen_sort_tags.py output
- intel/dev: Use RPL-U name on RPL-U devices
- intel/dev: Add more RPL PCI IDs
- anvil,hasvk: Rename need_clflush to need_flush
- intel/common: Move intel_clflush.h to intel_mem.h/intel_mem.c
- anvil,hasvk: Replace intel_clflush_range with intel_flush_range
- intel/common: Add intel_flush_range_no_fence
- anvil,hasvk: Use intel_flush_range_no_fence to flush command buffers
- util/u_cpu_detect: Drop unused has_tsc
- util/u_cpu_detect: Detect clflushopt support
- meson: Check for the __builtin_ia32_clflushopt function
- intel/clflush: Add support for clflushopt instruction
- intel/dev/xe: Move placeholder subslice info into XEHP_FEATURES
- intel/genxml: Ignore tail leading/trailing whitespace in node_validator()
- intel/genxml: Fix comparing xml when node counts differ
- intel/dev: Update device string for MTL PCI ID 0x7d55
- intel/genxml: Support importing from another genxml file
- intel/genxml: Add support for excluding items when importing
- intel/genxml: Add all xml files as pack dependencies
- intel/genxml: Add GenXml.optimize_xml_import()
- intel/genxml: Drop assertion to allow for importing
- intel/genxml: Add GenXml.add_xml_imports method
- intel/genxml: Add GenXml.flatten_xml() method
- intel/genxml: Add genxml_import.py script
- intel/decoder: ralloc_steal() values from spec context for fields and enums
- intel/decoder: Implement support for importing genxml
- intel/genxml: Start Xe2 support
- intel/genxml: Auto-import genxml files using genxml_import.py
- intel/common: Add sse2_args for 32-bit build when -Dsse2=false was set
- intel/compiler/fs: Support Xe2 reg size in assign_curb_setup
- intel/compiler: Update opt_split_sends() for Xe2 reg size
- intel/compiler: Update emit_rt_lsc_fence() for Xe2
- intel/compiler: Update lower_trace_ray_logical_send() for Xe2
- intel/compiler: Update ray-tracing intrinsic lowering for Xe2
- intel/compiler: Update RT stack_id access for Xe2
- intel/fs: Update SSBO & shared uniform block loads for Xe2
- intel/genxml: Build with gen20.xml
- intel/isl: Build for Xe2
- iris: Build for Xe2
- anv/blorp: Use anv_genX to set device->blorp.exec
- anv: Disable Ray Tracing on xe2 until our compiler supports Xe2 RT
- anv: Build for Xe2
- anv: Print warning that Xe2 is not supported rather than failing
- intel/compiler: Add enum xe2_lsc_cache_store
- intel/compiler: Use enum xe2_lsc_cache_store on xe2
- intel/compiler: Add enum xe2_lsc_cache_load
- intel/compiler: Use enum xe2_lsc_cache_load on xe2
- anv/batch: Check if batch already has an error in anv_queue_submit_simple_batch()
- anv/batch: Assert that extend_cb is non-NULL if the batch is out of space
- intel/dev: Add 0x56ba-0x56bd DG2 PCI IDs
Jose Maria Casanova Crespo (2):
- vc4: mark buffers as initialized at vc4_texture_subdata
- vc4: Fix mask RGBA validation at YUV blit
José Expósito (3):
- zink: Fix crash on zink_create_screen error path
- zink: fix dereference before NULL check
- zink: allow software rendering only if selected
José Roberto de Souza (51):
- anv: Use workaround framework to Wa_14016118574
- intel/aux_map: Nuke format_enum
- intel/aux_map: Use get_aux_entry() in remove_mapping()
- intel/aux_map: Replace magic number by INTEL_AUX_MAP_ENTRY_VALID_BIT
- intel/aux_map: Rename some variables to improve readability
- intel/aux_map: Mask out bits above index 47 in intel_aux_get_meta_address_mask()
- intel/aux_map: Convert l1_entry_addr_out to canonical
- intel/aux_map: Drop magic sub table size number
- intel/aux_map: Add function and macro to return l2 and l1 table masks
- anv: Add gem_create_userptr() to KMD backend
- anv: Replace handle by anv_bo in the gem_close()
- anv: Add support for userptr in Xe KMD
- intel: Sync xe_drm.h
- intel/dev/xe: Add support for small-bar setups
- anv: Request Xe KMD to place BOs to CPU visible VRAM when required
- iris: Request Xe KMD to place BOs to CPU visible VRAM when required
- iris/xe: Call iris_lost_context_state() when batch engine is replaced
- intel/dev: Port intel_dev_info tool to Xe KMD
- iris: Replace I915_EXEC_FENCE_SIGNAL by IRIS_BATCH_FENCE_SIGNAL in common code
- intel: Move i915_drm.h specific code from common/intel_gem.h to common/i915/intel_gem.h
- intel/common: Move functions inside of C++ ifdef
- intel: Rename intel_gem_add_ext() to intel_i915_gem_add_ext()
- iris: Move i915_gem_set_domain() call to i915 backend
- iris: Move iris_bufmgr_bo_close() to kmd backend
- iris: Add gem_create_userptr() to KMD backend
- iris: Add support for userptr in Xe KMD
- intel/genxml/gen125: Add missing fields in MI_MATH
- iris: Set MI_MATH MOCS field
- anv: Set MI_MATH MOCS field
- intel/tests/mi_builder: Set MI_MATH MOCS field
- intel/genxml/gen125: Set MI_MATH MOCS field as non-zero
- anv: Nuke unused READ_ONCE() from anv_batch_chain.c
- anv: Remove VkAllocationCallbacks parameter from reloc functions
- anv: Return earlier in anv_reloc_list functions
- intel: Sync xe_drm.h and rename engine to exec_queue
- anv: Override vendorID for Hogwarts Legacy
- intel/isl: Remove unknown workaround
- intel/isl: Remove Wa_22011186057
- anv: Update Wa_16014390852 for MTL
- intel: Sync xe_drm.h
- anv: Move i915 specific gem_set_caching to backend
- anv: Move i915 specific code from common anv_gem.c
- anv: Move bo_alloc_flags_to_bo_flags() to backend
- anv: Move i915 handling of imported bos bo_flags
- anv: Remove i915_drm.h include from common code
- iris: Lock bufmgr->lock before call vma_free() in error path
- iris: Nuke useless flags from iris_fine_fence_new()
- intel: Prepare implementation of Wa_18019816803 and Wa_16013994831 for future platforms
- intel: Sync xe_drm.h
- anv: Switch Xe KMD vm bind to sync
- anv: Add missing ANV_BO_ALLOC_EXTERNAL flags when calling anv_device_import_bo()
Juan A. Suarez Romero (7):
- broadcom/ci: update expected results
- vc4/ci: update expected results
- v3d/shim: include new ioctl parameters
- v3dv/ci: update expected list
- broadcom: add performance counters for V3D 7.x
- broadcom/simulator: add per-hw version calls
- v3d/vc4/ci: add new fails/timeout
Julia Tatz (10):
- gallium/dri: fix dri2_from_names
- aux/trace: skip multi-line comments in enums2names
- aux/trace: deduplicate enum dump macro work
- aux/trace: move trace_sample_view logic
- aux/trace: fix set_hw_atomic_buffers method name
- aux/trace: add screen video methods
- aux/trace: add context video methods
- aux/trace: wrap video_codec & video_buffer
- aux/trace: unwrap refrence frames in picture_desc
- aux/trace: trace video_buffer method return vals
Julia Zhang (1):
- radeonsi: modify algorithm of skipping holes of sparse bo
Julian Hagemeister (1):
- Gallium: Fix shared memory segment leak
Juston Li (10):
- zink: remove venus from renderpass optimizations
- venus: sync protocol for VK_EXT_vertex_input_dynamic_state
- venus: implement VK_EXT_vertex_input_dynamic_state
- venus: set lvp queries as saturate on overflow
- venus: add helper function to get cmd handle
- venus: refactor out common cmd feedback functions
- venus: support deferred query feedback recording
- venus: track/recycle appended query feedback cmds
- venus: append query feedback at submission time
- venus: switch to unconditionally deferred query feedback
Kai Wasserbäch (3):
- fix: clover: LLVM 18 renamed/moved CGFT_*, update compat layer
- fix: clover: LLVM 18: s/CodeGenOpt::/CodeGenOptLevel::/
- fix: clover: warning: ignoring return value of ‘int posix_memalign(…)’ [-Wunused-result]
Karmjit Mahil (29):
- pvr: Remove mrt setup from SPM EOT
- pvr: Compile SPM EOT shader
- pvr: Use the SPM EOT on barrier stores
- pvr: Remove some magic numbers and increments from km stream
- pvr: Restructure \`rogue_kmd_stream.xml`
- pvr: Submit PR commands
- pvr: Use the correct size for the unified store allocation
- pvr: Allow query stage for barrier sub cmds
- pvr: Fix occlusion query unaccounted for user fences
- pvr: Fix writing query availability write out
- pvr: Fix packing issue with max_{x,y}_clip
- pvr: Fix csb relocation status assert on \`pvr_csb_finish()`
- pvr: Fix \`for` loop itarator usage
- pvr: Fix dynamic desc offset storage
- pvr: Fix cubemap layer stride
- pvr: Use the render passes' attachments array to setup ISP state
- pvr: Adjust EOT PBE state to account for the iview's base array layer
- pvr: Fix MRT index in PBE state
- pvr: Fix pbe_emit assert
- pvr: Fix OOB access of pbe_{cs,reg}_words
- pvr: Order tile buffer EOT emits to be last
- pvr: Fix subpass sample count on ds attachment only
- pvr: Refactor subpass ds and sample count setup
- pvr: Fix SPM load shader sample rate
- pvr: Fix PPP_SCREEN sizes
- vulkan: Add \`vk_subpass_dependency_is_fb_local()` helper
- tu: Use common \`vk_subpass_dependency_is_fb_local()`
- pvr: Don't merge subpasses on framebuffer-global dependancy
- pvr: Only setup the bgobj to load if we have a load_op
Karol Herbst (213):
- nvc0: initial Ada enablement
- rusticl/mesa: make svm_migrate optional
- llvmpipe: enable system SVM
- nvc0: fix num_gprs for Volta+
- rusticl: fix warnings with newer rustc
- gm107/ir: fix SULDP for loads without a known format
- nv50/ir/nir: fix txq emission on MS textures
- nv50/ir/nir: Fix zero source handling of tex instructions.
- rusticl/kernel: only handle function_temp memory before lowering printf
- meson,ci: bump meson req for rusticl to 1.2
- rusticl/nir: add helper functions we need for a NIR_PASS macro
- rusticl/nir: add a nir_pass macro
- rusticl/nir: use the new nir_pass macro
- rusticl/kernel: rename res to internal_args inside lower_and_optimize_nir_late
- rusticl/kernel: merge lower_and_optimize_nir_pre_inputs and lower_and_optimize_nir_late
- rusticl/kernel: move things around in lower_and_optimize_nir
- rusticl/kernel: get rid of initial function_temp type lowering
- rusticl/kernel: mark can_remove_var as unsafe and document it
- n50/compute: submit initial compute state in nv50_screen_create
- nvk: add vulkan skeleton
- nouveau/winsys: add the new winsys implementation
- nvk: use winsys lib
- nvk: fix nvk_buffer include guards
- nouveau/headers: add script to sync in-tree headers with open-gpu-doc
- nouveau/headers: initial sync of headers
- nvk: implement GetPhysicalDeviceQueueFamilyProperties2 to make the CTS happy
- nvk: advertize memory heaps and types
- nouveau/ws: reorganize a little
- nouveau/ws: dup the fd
- nouveau/ws: add a field for the SM version
- nvk: set nonCoherentAtomSize as the CTS divides with this value
- nouveau/ws: add bo API
- nvk: add basic device memory support
- nouveau/headers: add nvtypes.h
- nouveau/headers: typedef Nv void types
- nouveau/headers: add host classes
- nouveau/ws: add context support
- nouveau/ws: add a cmd buffer
- novueau/bo: refcount it
- novueau/bo: add nouveau_ws_bo_wait
- nvk: allocate a GPU context for each VkDevice
- nvk: add nvk_bo_sync
- nvk: add nvk_CmdPipelineBarrier2 stub
- nvk: impl nvk_CmdCopyBuffer
- nouveau/ws: fix setting push bo domains
- nouveau/ws: PUSH_IMMD only works with 16 bit values
- nouveau/ws: set GPU object class
- nouveau/ws: bind 2D class
- nvk: use fermi class definitions
- nvk: add basic support for images
- nvk: simple format table
- nvk: add support for blits
- nvk: report maxMipLevels as 1
- nvk: optimize blit command buffer gen
- nvk: implement CmdFillBuffer
- nvk: implement CmdUpdateBuffer
- nvk: implement CmdCopyBuffer2
- nvk: advertise VK_KHR_copy_commands2
- nvk: implicitly reset the command buffer
- nouveau/ws: handle 0inc inside nvk_push_val as well
- nvk: reduce pitch even further in CmdFillBuffer
- nvk: support multiple miplevels
- nvk: support array blits over multiple layers
- nvk: tiling prep work for VK_EXT_image_2d_view_of_3d
- nouveau/ws: make sure we don't submit nonsense
- nouveau/ws: assert on broken channel
- nvk/blit: assert that formats are supported
- nouveau/headers: Generate parser functions
- nouveau/ws: initial debugging options for command submissions
- nouveau/ws: depend on generated class header files
- nouveau/ws: get rid of libdrm
- nouveau/ws: use new NVIF interface to query oclasses
- nvk: set deviceName
- nouveau/headers: add path for 3D headers
- nouveau/headers: initial 3D headers import
- nouveau/ws: allocate 3D subchan
- nouveau/ws: allocate copy subchan as well
- nouveau/ws: add API to query if the context was killed
- nouveau/ws: add a bo unmap helper function
- nvk: clean up bo mappings
- nouveau/ws: bound check nouveau_ws_push_append
- nouveau/ws: rework refing push buffer bos
- nouveau/ws: push chaining
- nvk: fix OOB read inside nvk_get_va_format
- nvk: alloc a zero page and use it for vertex runouts
- nvk: fix zero page refing
- nvk: support exporting buffers
- nvk: fix some class version checks
- nvk: properly align shaders pre Turing
- nvk: rework QMD handling to support pre Turing
- nvk: align desc root table
- nvk: Use SET_PIPELINE_PROGRAM pre-Volta
- nvk: properly align slm size
- nvk: use remaps for image copies
- nvk: reduce pitch for FillBuffer
- nvk: bind more subchans in init_context_state
- nvk: support pre Maxwell Texture Headers
- nvk/device: fix order of error handling
- nvk: allocate VAB memory area
- nvk: wire up M2MF for Fermi
- nouveau/mme: add test for BEQ with magic exit offset
- nouveau/mme: add a macro exit helper
- nvk: Add a macro to set MMIO registers via falcons
- nouveau/winsys: fix SM value for Ada
- nvk: fix num_gprs for Volta+
- nvk: replace mp with tpc
- nvk: properly calculate SLM region by taking per arch limits into account
- nouveau: fix max_warps_per_mp_for_sm for builds with asserts disabled
- nvk: enable fp helper invocations loads on more gens
- nv50/ir: use own info struct for sys vals
- nv50/ir: convert system values to gl_system_value
- nouveau/mme: fix OOB access inside while_ine builder test
- nouveau/mme: fix OOB inside tu104 simulator
- clc: use CLANG_RESOURCE_DIR for clang's resource path
- nv50: fix code uploads bigger than 0x10000 bytes
- nouveau: take glsl_type ref unconditionally
- rusticl/kernel: optimize nir between lowering io and explicit types
- nv50: limit max code uploads to 0x8000
- zink: fix source type in load/store scratch
- zink: fix global stores
- zink: update some compute caps
- rusticl: add debug option to sync every event
- rusticl/device: _MAX_CONST_BUFFER0_SIZE is unsigned
- ci: disable a660 jobs
- nir: make workgroup_id 32 bit only
- nir: make num_workgroups 32 bit only
- ac: drop 64 bit handling for cl workgroup intrinsics
- gallivm/nir: drop 64 bit handling for cl workgroup intrinsics
- intel/compiler: drop 64 bit handling for cl workgroup intrinsics
- panfrost: drop 64 bit handling for cl workgroup intrinsics
- rusticl: reduce global_invocation_id_zero_base to 32 bit
- panfrost: drop pan_nir_lower_64bit_intrin
- rusticl/disk_cache: fix stack corruption
- rusticl/query: fix use-after-free, but also fix incorrect usage of unsafe
- rusticl/event: disable profiling for devices without timestamps
- rusticl/queue: properly implement clCreateCommandQueueWithProperties
- rusticl/memory: do not verify pitch for IMAGE1D_BUFFER
- rusticl/memory: only specify PIPE_BIND_SHADER_IMAGE where supported
- asahi: fetch available system memory
- asahi: lower hadd
- asahi: handle kernels
- asahi: handle load_workgroup_size
- asahi: handle load_global_invocation_id_zero_base
- asahi: implement get_compute_state_info
- asahi: implement set_global_binding
- asahi: implement clear_buffer
- asahi: gracefully handle allocating linear images
- asahi: handle images in is_format_supported
- rusticl/memory: fallback if allocating linear images fails
- rusticl: enable asahi
- rusticl/mesa: create contexts with PIPE_CONTEXT_NO_LOD_BIAS
- docs/features: cl_khr_3d_image_writes needs driver support
- rusticl/mesa: fix \`set_constant_buffer` when passing an empty buffer
- rusticl/kernel: skip adding global id offsets if not used
- meson/rusticl: add sha1_h
- rusticl/mesa/context: fix clear_sampler_views
- nir: add nir_lower_alu_vec8_16_srcs pass
- zink: lower vec8/16
- rusticl/mesa: create COMPUTE_ONLY contexts
- rusticl: fix clippys bool_to_int_with_if
- rusticl/memory: fix potential use-after-free in clEnqueueSVMMemFill
- nir/load_libclc: fix libclc memory leak
- rusticl/kernel: Fix creation from programs not built for every device
- ci: add half-life 2 freedreno flake
- zink: implement get_compute_state_info
- zink: copy has_variable_shared_mem cs property
- zink: pass entire pipe_grid_info into zink_program_update_compute_pipeline_state
- zink: refactor spec constant handling
- zink: variable shared mem support
- zink: support more nir opcodes
- zink: make spirv_builder_emit_*op compatible with spec constants
- zink: support samplers with unnormalized_coords
- zink: implement remaining pack ops via bitcast
- zink: fix RA textures
- zink: fix load/store scratch offsets
- rusticl/mesa/screen,device: add driver_name
- rusticl: enable zink
- pipe-loader: allow to load multiple zink devices
- rusticl: bump rustc version to 1.66
- rusticl/mesa/nir: mark more methods as mut
- rusticl/mesa/nir: Mark NirShader and NirPrintfInfo as Send and Sync
- rusticl/mesa: mark PipeResource as Send and Sync
- rusticl/mesa: mark PipeTransfer as Send
- rusticl/cl: mark _cl_image_desc as Send and Sync
- rusticl/queue: get rid of pointless Option around our worker thread handle
- rusticl/queue: make it Sync
- rusticl/kernel: get rid of Arcs in KernelDevStateVariant
- rusticl/memory: use get_mut instead of lock in drop
- zink: implement PIPE_COMPUTE_CAP_MAX_COMPUTE_UNITS
- rusticl/api: remove cl_closure macro
- zink: implement load_global_constant
- zink: properly emit PhysicalStorageBufferAddresses cap
- nir/lower_mem_access_bit_sizes: fix invalid shift bit_size
- rusticl/device: restrict 1Dbuffer images for RGB and RGBx
- rusticl/memory: use PIPE_BUFFER for IMAGE1D_BUFFER images
- rusticl/format: disable all sRGB formats
- asahi: flush denorms on exact fmin/fmax
- zink: wrap shared memory blocks in a struct
- zink: properly alias shared memory
- zink: fix zink_destroy_screen for early screen creation fails
- docs/features: remove empty lines confusing mesamatrix
- rusticl/device: restrict image_buffer_size
- rusticl/device: restrict param_max_size further
- rusticl/mem: properly set pipe_image_view::access
- zink: lower fisnormal as it requires the Kernel Cap
- radv: fix buffers in vkGetDescriptorEXT with size not aligned to 4
- rusticl/queue: Only take a weak ref to the last Event
- rusticl/mesa: pass PIPE_BIND_LINEAR in resource_create_texture_from_user
- zink: deallocate global_bindings array
- rusticl/mesa/screen: do not derefence the entire pipe_screen struct
- nvc0: implement PIPE_CAP_TIMER_RESOLUTION
- rusticl/queue: do not send empty lists of event to worker queue
- rusticl/queue: fix implicit flushing of queue dependencies
Kenneth Graunke (21):
- iris: Re-emit 3DSTATE_DS for each primitive (workaround 14019750404)
- intel/compiler: Fix sparse cube map array coordinate lowering
- intel/compiler: Respect NIR_DEBUG_PRINT_INTERNAL for DEBUG_OPTIMIZER
- intel/fs: Account for payload GRFs when calculating register pressure
- intel/compiler: Move SCHEDULE_NONE handling into schedule_instructions()
- intel/fs: Index scheduler mode string table by mode enum
- intel/fs: Make helpers for saving/restoring instruction order
- intel/fs: Pick the lowest register pressure schedule when spilling
- intel/fs: Dump IR for pre-RA scheduler modes in DEBUG_OPTIMIZER
- iris: Check prog[] instead of uncompiled[] for BLORP state skipping
- nir: Fix function parameter indentation in nir_opt_barriers.c
- nir: Add an optimization pass to reduce barrier modes
- nir: Reduce the scope of shared memory barriers
- lavapipe: Don't delete control barriers
- virgl, nir_to_tgsi: Add a hack for promoting partial memory barriers
- dxil: Set UAV_FENCE_THREAD_GROUP any time global isn't required
- glsl: Use nir_opt_barrier_modes() to drop unnecessary barriers
- anv: Use nir_opt_barrier_modes() to drop unnecessary barriers
- mesa: Fix zeroing of new ParameterValues array entries when growing
- intel/fs: Fix Xe2 URB read/lowering with per-slot offsets
- anv: Add support for a transfer queue on Alchemist
Kevron Rees (1):
- Force vk vendor for spider-man remastered
Konrad Dybcio (5):
- freedreno: Set magic writes per-GPU, using existing data
- freedreno: Include speedbin fallback in 740 chipid to fix probing
- freedreno: Include speedbin fallback in 730 chipid to fix probing
- freedreno: Include speedbin fallback in 690 chipid to fix probing
- freedreno: Add Adreno 643
Konstantin Seurer (95):
- radv: Stop using the misleading round_up_u* functions
- radv/meta_buffer: Stop setting RADV_META_SAVE_DESCRIPTORS
- radv/meta_buffer: Rename size_minus16 to max_offset
- llvmpipe: Fix compiling with LP_USE_TEXTURE_CACHE
- nir/tests: Refactor boilerplate into a common header
- nir/tests: Use a single binary
- draw: Do not restart the primitive_id at 0
- gallivm: Fix subsampled format sampling under Vulkan
- gallivm: Ignore nir_tex_src_plane
- lavapipe: Remove dummy sampler ycbcr conversion
- lavapipe: Store immutable_samplers as lvp_sampler array
- lavapipe: Fix binding immutable samplers with desc buffers
- lavapipe: Implement samplerYcbcrConversion
- lavapipe: Advertise samplerYcbcrConversion
- llvmpipe: Zero extend vectors in widen_to_simd_width
- vulkan: Add a generated vk_properties struct
- radv: Use common physical device properties
- clang-format: Disable formatting by default
- lavapipe: Use common physical device properties
- nir/from_ssa: Don't insert store_reg instructions before phis
- gallivm: Run nir_convert_to_lcssa before nir_convert_from_ssa
- lavapipe/ci: Remove descriptor_indexing fails
- radv/rt: Rename shader_pc and next_shader
- radv/rt: Rename traversal_shader to traversal_shader_addr
- nir/opt_large_constants: Handle small float arrays
- bin: Update spirv sources
- vulkan: Allow beta extensions for physical device features
- vulkan: Allow beta extensions for physical device properties
- vulkan Add enqueue entrypoint for CmdDispatchGraphAMDX
- nir: Add shader enqueue data structures and handling
- spirv: Update headers and grammer JSON
- spirv: Implement SPV_AMDX_shader_enqueue
- lavapipe: Add lvp_pipeline_type
- lavapipe: Implement exec graph pipelines
- lavapipe: Implement AMDX_shader_enqueue commands
- lavapipe: Advertise AMDX_shader_enqueue
- radv: Add internal_nodes_offset to scratch_layout
- radv: Remove leaf_args::dst_offset
- radv/rt: Remove some dead code
- radv/rt: Do not apply stack_ptr for non-recursive stages
- radv/rt: Add and use radv_build_traversal
- radv/rt: Insert rt_return_amd before lowering shader calls
- radv/rt: Split stage initialization and hashing
- aco: Do not fixup registers if there are no shader calls
- radv: Stop updating the stack_size in insert_rt_case
- lavapipe: Lock around CSO destroys
- vulkan/wsi/x11: Implement capture hotkey using the keymap
- venus: Use the common GetPhysicalDeviceFeatures2 implementation
- nir/lower_shader_calls: Limit the remat chain length
- lavapipe: Avoid lowering shaders twice
- lavapipe: Fix the locking around cso destruction
- aco/validate: Handle p_wqm like p_parallelcopy
- aco: Use bytes() instead of size() in emit_wqm
- aco: Unify demote and demote_if selection
- radv: Only generate debug info if required
- aco/lower_to_cssa: Fix typo
- radv: Don't use the depth image view for depth bias emission
- radv/rt: Store NIR shaders separately
- radv/rt: Add monolithic raygen lowering
- radv/rt: Enable monolithic pipelines
- radv/ci: Document new flake
- vulkan/properties: Handle unsized arrays properly
- radv: Remove dead radix_sort_vk_get_memory_requirements call
- radv/radix_sort: Vendor the radix sort dispatch code
- radv: Perform multiple sorts in parallel
- radv/ci: Improve ray tracing skips
- ac/llvm: Fix typed loads with 16bit formats
- ac/llvm: Use the correct return type for uadd_carry and usub_borrow
- ac/llvm: Use float types for float atomics
- radv: Don't advertise features requiring PS epilogs with LLVM
- radv: Update navi21 llvm fails
- radv/rt: Handle stages without nir properly
- radv: Remove ray tracing shader module identifier skips
- radv/bvh: Treat instances with mask == 0 as inactive
- radv/ray_queries: Skip cull_mask handling if it is FF
- radv/rt: Skip cull_mask handling if it is FF
- aco/spill: Make sure that offset stays in bounds
- nir: Add nir_cf_node_cf_tree_prev
- nir: Add nir_foreach_block_in_cf_node_reverse
- nir: Add nir_rematerialize_deref_in_use_blocks
- nir/lcssa: Fix rematerializing derefs
- nir/deref: Layer rematerialization helpers
- lavapipe/ci: Fix asan expectations
- hasvk: Use the common GetPhysicalDeviceFeatures2 implementation
- vulkan: Remove vk_get_physical_device_core_1_*_feature_ext
- radv/bvh/ploc: Load child bounds from LDS
- radv: Merge the sync_data and header initialization
- radv: Do not sync after radv_update_buffer_cp
- zink: Initialize primitive types to an invalid value
- nir/passthrough_gs: Support edge flags with points
- zink: Enable edge flags with points
- mesa: Fix glBegin/End when LINE_LOOP is not supported
- llvmpipe: Compile a nop texture function for unsupported configurations
- radv/rt: Use nir_shader_instructions_pass for lower_rt_instructions
- radv/sqtt: Fix tracing acceleration structure commands
Lang Yu (5):
- amd/common: add AMD_CODE_PROPERTY_ENABLE_WAVEFRONT_SIZE32 property
- radeonsi: use AMD_CODE_PROPERTY_ENABLE_WAVEFRONT_SIZE32 to determine wave size
- radeonsi: use wave size to determine index stride
- amd/common: add missing stuff for gfx11.5
- amd/radeonsi: add missing stuff for gfx11.5
Leandro Ribeiro (13):
- egl: rewrite outdated comment in _eglFindDevice()
- egl: remove unused parameter from _eglAddDRMDevice()
- egl: simplify _eglAddDRMDevice()
- egl: make explicit that we don't support render nodes for software EGLDevice
- egl: move is_render_node flag to platform_wayland
- loader: rename loader_open_render_node() to loader_open_render_node_platform_device()
- loader: add driver list as parameter in loader_open_render_node_platform_device()
- pipe-loader: add pipe_loader_get_compatible_render_capable_device_fd()
- dri: add queryCompatibleRenderOnlyDeviceFd() to __DRI_MESA extension
- kmsro: try to use only compatible render-capable devices
- loader: add loader_is_device_render_capable()
- egl/drm: get compatible render-only device fd for kms-only device
- egl: error out if we can't find an EGLDevice in _eglFindDevice()
Leo Liu (4):
- radeonsi: add AV1 profile to supported profile
- radeonsi/vcn: fix the incorrect dt_size
- Revert "frontends/va: Also map VAImageBufferType for reading"
- ac/gpu_info: override ib_size_alignment for VCN_DEC and JPEG
Lina Versace (14):
- docs: Add row for VK_KHR_maintenance5
- intel/pci_ids: Consistently use lowercase
- venus: Sync protocol for VK_EXT_graphics_pipeline_library
- venus: Erase pViewports and pScissors in fewer cases
- venus: Fix crash when VkGraphicsPipelineCreateInfo::layout is missing
- venus: Fix subpass attachments
- venus: Drop incorrectly-used always-true pipeline vars
- venus: Use VkImageAspectFlags in vn_subpass
- venus: Add enum vn_pipeline_type
- venus: Renames for VkGraphicsPipelineCreateInfo fixes
- venus: Refactor pipeline fixup into two stages
- venus: Do pipeline fixes for VK_EXT_graphics_pipeline_library
- venus: Enable VK_EXT_graphics_pipeline_library behind debug flag
- venus: Fix -Wmaybe-uninitialized
LingMan (22):
- rusticl/memory: fix potential use-after-free in clEnqueueSVMFree
- rusticl: Rename XyzCB aliases to FuncXyzCB
- rusticl: add structs to hold the C callbacks
- rusticl: use CreateContextCB
- rusticl: use DeleteContextCB
- rusticl: use EventCB
- rusticl: use MemCB
- rusticl: use ProgramCB
- rusticl: use SVMFreeCb
- rusticl: Make EventSig take ownership of its environment
- rusticl: add a safe abstraction to execute a DeleteContextCB
- rusticl: add a safe abstraction to execute an EventCB
- rusticl: add a safe abstraction to execute a MemCB
- rusticl: add a safe abstraction to execute an SVMFreeCb
- rusticl: add a safe abstraction to execute a CreateContextCB
- rusticl: add a safe abstraction to execute a ProgramCB
- rusticl/api: drop a few include paths
- rusticl: mark the fields of callback structs private
- rusticl: drop an \`#[allow(dead_code)]` marker
- rusticl/core: don't take a lock while dropping \`Context`
- rusticl: Show an error message if the build is attempted with an outdated bindgen version
- rusticl: Show an error message if the version of bindgen can't be detected
Lionel Landwerlin (169):
- anv: hide exec_flags selection inside the i915 backend
- isl: add a tool to query surface parameters
- intel/fs: fix missing predicate on SEL instruction
- intel/compiler: rework input parameters
- ci/a530: switch a few tests to flakes to unblock CI
- vulkan: bump header register to 1.3.258
- intel/fs: don't try to rebuild sequences of non ssa values
- intel/vec4: fix log_data pointer
- intel/fs: consider UNDEF as non-partial write
- intel/fs: add more UNDEFs around SEND messages
- isl: add ability to store buffer size in unused RENDER_SURFACE_STATE fields
- anv: simplify buffer address+size loads from descriptor buffer
- intel/fs: add support for sparse accesses
- intel/nir: handle image_sparse_load in storage format lowering
- intel/nir: add lower for sparse images & textures
- anv: wire image sparse loads
- blorp: switch blorp_update_clear_color to early return
- blorp: update and move fast clear PIPE_CONTROLs to drivers
- anv: fix 3DSTATE_RASTER::APIMode field setting
- anv: enable EDS3 ConservativeRasterizationMode
- vulkan: skip non required extension structures
- vulkan/runtime: add a layered implementation of vkCmdBindIndexBuffer
- anv: enable INTEL_DEBUG=nofc
- anv: fake non intel vendorID for Death Stranding
- hasvk: fix null descriptor handling with A64 messages
- anv: remove descriptor array bounds checking
- hasvk: remove descriptor array bounds checking
- anv/hasvk: track robustness per pipeline stage
- anv: implement VK_EXT_pipeline_robustness
- intel/fs: track more steps with INTEL_DEBUG=optimizer
- intel/fs: add variable for output of debug backend optimizer
- intel/decoder: constify some input parameters
- blorp: drop programming of 3DSTATE_(MESH|TASK)_SHADER
- anv: emit 3DSTATE_GS only once per pipeline
- intel/decoder: add options to decode surfaces/samplers
- anv: get rid of genX(emit_multisample)
- anv: move genX(rasterization_mode) to gfx8_cmd_buffer.c
- anv: don't try to access dynamic buffers from surface states
- iris: ensure stalling pipe control before fast clear
- intel/compiler: disable per-sample interpolation modes with non-per-sample dispatch
- intel/compiler: fix dynamic alpha-to-coverage handling
- intel/fs: implement dynamic interpolation mode for dynamic persample shaders
- intel/fs: move lower of non-uniform at_sample barycentric to NIR
- zink+anv: add regression testing with pipeline libraries
- anv: implement vkCmdBindIndexBuffer2KHR
- anv: handle new VkBufferViewUsageCreateInfoKHR
- anv: add vkGetRenderingAreaGranularityKHR()
- anv: implement GetDeviceImageSubresourceLayoutKHR/GetImageSubresourceLayout2KHR
- anv: add maintenance5 A8_UNORM/A1B5G5R5_UNORM support
- anv: deal with new pipeline flags
- anv: enable KHR_maintenance5
- anv: add missing ISL storage usage
- genxml/gfx11: remove Tiled Resource Mode field from HIER_DEPTH_BUFFER
- genxml/gfx12: rename Tiled Resource Mode
- isl: program 3DSTATE_HIER_DEPTH_BUFFER_BODY::TiledMode as documented
- intel/isl: Disallow Yf, Ys and Tile64 for 3D depth/stencil surfaces
- isl: disable Yf/Ys/Tile64 tilings for 1D images
- isl: add a usage flag to request 2D/3D compatible views
- isl: disallow TileYs/Yf on 3D storage images on Gfx9/11
- intel/isl: Add a max_miptail_levels field to isl_tile_info
- isl: make isl_surf_get_uncompressed_surf robust to argument accesses
- isl: add Gfx12/12.5 restriction on 3D surfaces & compression
- isl: disallow miptails on planar formats
- isl: disable miptails on gfx12 with yuv formats
- isl: disable CCS on Ys/Yf
- blorp: allow 3D blits/copies on Ys/Yf/Tile64 tiling
- intel/aux_map: correctly program tiling mode for Ys
- isl: reorder tiling selection
- anv: enable standard Y tiles
- isl/tilememcpy_test: add multiple tile testing
- anv: rename total_batch_size
- anv: reuse cmd_buffer::total_batch_size
- intel/measure: track batch buffer sizes
- intel/nir: rerun lower_tex if it lowers something
- intel/fs: limit register flag interaction of FIND_*LIVE_CHANNEL
- hasvk: add state cache invalidation back before fast clears
- blorp: remove unused variable
- anv: remove ReorderMode from pipeline 3DSTATE_GS emission
- anv: change anv_batch_emit_merge to also do packing
- intel/anv: batch stats util
- intel/decoder: implement accumulated prints
- anv: move all dynamic state emission to cmd_buffer_flush_dynamic_state
- anv: rename files to represent their usage
- anv: categorize partial/final pipeline instruction
- anv: split 3DSTATE_TE packing between static & dynamic parts
- anv: split 3DSTATE_VFG emission
- anv: add a flag tracking occlusion query count change
- anv: split pipeline programming into instructions
- vulkan/runtime: add helper to name dirty states
- anv: add new low level emission & dirty state tracking
- anv: remove unused state emission
- anv: split BLEND_STATE packing from BLEND_STATE_POINTERS emit
- docs: update Anv documentation about dynamic state emission
- anv: create individual logical engines on i915 when possible
- anv: Copy/Clear MSAA images over companion RCS while we are on compute
- pps-producer: add ability to select device with DRI_PRIME
- anv: remove aux checking asserts
- anv: bound image usages to the associated queue family
- anv: fix 3DSTATE_VFG emission
- anv: emit 3DSTATE_URB_ALLOC_(MESH|TASK) only when mesh shaders are enabled
- anv: ensure mesh pipeline have all pre-rasterization stages disabled
- anv: ensure partially packed instructions are emitted in the pipeline
- anv: fix missing 3DSTATE_SBE_MESH emission
- anv: fix utrace timestamp buffer copies
- anv: add a memcpy compute internal kernel
- anv: add simple shader support without a command buffer
- anv: move simple shaders code to its own object
- anv: move utrace flush out of backends
- anv: enable utrace timestamp buffer copies on compute engine
- intel: don't assume Linux minor dev node
- intel/ds: lock submissions to u_trace_context
- util/u_trace: count number of tracepoints
- intel/ds: track number of tracepoint timestamp copies
- anv/utrace: trace CPU on timestamp buffer readiness
- intel/ds: avoid dropping traces when running out of shared memory
- anv/iris: widen Wa_14015946265 to Gfx11+
- anv: add missing workaround for 3DSTATE_LINE_STIPPLE
- iris: add missing workaround for 3DSTATE_LINE_STIPPLE
- intel/fs: handle ishl in surface/sampler rematerialization
- intel/fs: handle add3 in surface/sampler rematerialization
- intel/fs: switch from SIMD 1 to 8 instructions surface/sampler rematerialization
- anv: fix internal compute copy shader build
- anv: reduce working temporary memory for BVH builds
- anv: move bo_pool allocation flags to init caller
- anv: use buffer pools for BVH build buffers
- intel/ds: track acceleration RT commands
- anv: fix index buffer size programming
- anv: implement INTEL_DEBUG=reemit
- anv: add missing workaround handling in simple shader
- anv: fix a couple of missing input for 3DSTATE_RASTER programming
- anv: flag 3DSTATE_RASTER as dirty after simple shader primitive
- vulkan: bump headers/registry to 1.3.267
- anv: rename primary in container in ExecuteCommands()
- anv: add support for VK_EXT_nested_command_buffer
- anv: simplify push descriptors
- anv: fixup spirv cap for ImageReadWithoutFormat on Gfx12.5
- Revert "intel/fs: limit register flag interaction of FIND_*LIVE_CHANNEL"
- anv: update batch chaining to Gfx9 commands
- anv: workaround Gfx11 with optimized state emission
- u_trace: generate tracepoint index parameter in perfetto callbacks
- u_trace: generate tracepoint name array in perfetto header
- intel/ds: provide names for different events of a timeline's row
- anv: reuse local variable for gfx state
- anv: track render targets & render area changes separately
- anv: don't uninitialize bvh_bo_pool is not initialized
- anv: uninitialize queues before utrace
- anv: move generation shader return instruction to last draw lane
- anv: fix generated draws gl_DrawID with more than 8192 indirect draws
- anv: extract out draw call generation
- anv: identify internal shader in NIR
- anv: avoid MI commands to copy draw indirect count
- anv: move generation batch fields to a sub-struct
- util/glsl2spirv: add ability to pass defines
- anv: factor out host/gpu internal shaders interfaces
- anv: index indirect data buffer with absolute offset
- anv: add ring buffer mode to generated draw optimization
- anv: merge gfx9/11 indirect draw generation shaders
- anv: document the draw indirect optimization ring mode
- anv: fixup 32bit build of internal shaders
- anv: fix uninitialized use of compute initialization batch
- intel/fs: fix dynamic interpolation mode selection
- anv/meson: add missing dependency on the interface header
- anv: fix corner case of mutable descriptor pool creation
- isl: disable MCS compression on R9G9B9E5
- intel/fs: rerun divergence analysis prior to convert_from_ssa
- intel/nir/rt: fix reportIntersection() hitT handling
- anv: fix CC_VIEWPORT pointer dirty after blorp/simple-shaders
- anv: fix dirty state tracking for 3DSTATE_PUSH_CONSTANT_ALLOC
- intel/perf: fix querying of configurations
Louis-Francis Ratté-Boulianne (15):
- panfrost: Fix error in comment
- panfrost: Add methods to determine slice and body alignment
- panfrost: Add method to get size of AFBC subblocks
- panfrost: Precalculate stride and nr of blocks for AFBC layouts
- panfrost: Add panfrost_batch_write_bo
- panfrost: Make panfrost_resource_create_with_modifier public
- panfrost: Split out internal of \`panfrost_launch_grid`
- panfrost: Add infrastructure for internal AFBC compute shaders
- panfrost: Add method to get size of AFBC superblocks valid data
- panfrost: Add support for AFBC packing
- panfrost: Legalize resource when attaching to a batch
- panfrost: Don't force constant modifier after converting
- panfrost: Add debug flag to force packing of AFBC textures on upload
- panfrost: Add some debug utility methods for resources
- panfrost: Add env variable for max AFBC packing ratio
Lucas Stach (33):
- ci/etnaviv: update ci expectation
- etnaviv: move resource seqnos to level
- etnaviv: flush destination before executing blit
- etnaviv: optimize resource copies by skipping clean levels
- etnaviv: add helper to mark resource level as flushed
- etnaviv: add helper to mark resource level as changed
- etnaviv: add helper to transfer resource level age to another
- etnaviv: add helper to get TS validity
- etnaviv: add helper to set TS validity
- etnaviv: move TS meta into etna_resource_level
- etnaviv: add tile status buffer status into TS metadata
- etnaviv: optimize sampler source update
- etnaviv: allow sampler TS even if the resource is flushed
- etnaviv: keep blit destination tile status valid if possible
- etnaviv: optimize render resource update
- etnaviv: optimize transfers when whole resource level is discarded
- etnaviv: split etna_copy_resource_box levels parameter in src/dst
- etnaviv: don't allocate full resource as transfer staging
- etnaviv: check for valid TS as condition to create the staging resource
- etnaviv: reword comment about staging resource usage
- etnaviv: remove huge outdated comment
- etnaviv: move buffer range tracking into the PIPE_MAP_WRITE clause
- etnaviv: remove superfluous braces
- etnaviv: remove always true assert in etna_transfer_unmap
- etnaviv: remove bogus comment about replacing resource storage
- etnaviv: initialize VIVS_GL_BUG_FIXES
- etnaviv: fix read staging buffer leak
- Revert "ci/etnaviv: allow failure on failing test"
- mesa: enable NV_texture_barrier in GLES2+ (again)
- etnaviv: use correct blit box sizes when copying resource
- etnaviv: zero shared TS metadata block
- Revert "etnaviv: use correct blit box sizes when copying resource"
- mesa: add GL_APPLE_sync support
Luigi Santivetti (1):
- pvr: do not claim support for ASTC texture compression
M Henning (31):
- nv50/ir: Drop nir_jump_return handling
- nv50/ir: Remove ArgumentMovesPass
- nv50/ir: Remove Function.stackPtr
- nv50/ir: Remove dead loop from assignSlot
- nv50/ir: Remove SpillSlot
- nvc0: Keep nir directly in nvc0_program
- nv50: Keep nir directly in nv50_program
- nouveau: Delete nv50_ir_from_tgsi.cpp
- nouveau: Drop tgsi support from nv50_ir_prog_info
- nouveau: Drop ConverterCommon::Subroutine
- nouveau: Drop BuildUtil::DataArray
- nouveau: Drop BuildUtil::Location
- nouveau: Delete the nouveau_compiler tool
- nv/codegen: Call nir_shader_gather_info
- nv/codegen: Implement nir_op_fquantize2f16
- nvk: Remove reference to genUserClip
- nv/codegen: Use nir_lower_clip
- nv50_ir_from_nir: Use nir's lower_fpow
- nv/codegen: Delete OP_POW
- nv/codegen: Fix an uninitialized variable warning
- nv/codegen: Delete OP_WRSV
- nv/codegen: Delete OP_EXP, OP_LOG
- nv/codegen: Remove fragCoord variable.
- nv/codegen: Merge from_common into from_nir
- nv/codegen: Remove unused clipVertexOutput var
- nv50_ir_ra: Delete unused functions
- nv/codegen: Delete unused OP_CONSTRAINT
- nv/codegen: Delete periodicMask32
- nv/codegen: Remove Function::buildDefSets
- nv/codegen: Change copy-constructor call to assign
- nv/codegen: Delete copy and assign
Maaz Mombasawala (2):
- svga: Make surfaces shareable at creation.
- svga: Unify gmr and mob surface pool managers
Marcin Ślusarz (16):
- iris: avoid duplicating validation entries
- hasvk: remove dead code & comments related to mesh shading
- anv: drop support for VK_NV_mesh_shader
- intel/compiler: remove NV_mesh_shader support
- intel/compiler: remove redundant code
- anv: drop unused function
- anv: merge cases leading to the same code
- intel/compiler/mesh: compactify MUE layout
- intel/compiler,anv: put some vertex and primitive data in headers
- intel/compiler: load debug mesh compaction options once
- intel/compiler/test: fix crashes when TEST_DEBUG is set
- intel/compiler: add lsc_msg_desc_wcmask
- intel/compiler: add initial support for URB_LOGICAL_SRC_CHANNEL_MASK to lower_urb_write_logical_send_xe2
- intel/compiler/mesh: fix position of output URB handle for xe2
- intel/compiler/mesh: implement IO for xe2
- intel/compiler: mask GS URB handles at thread payload construction
Marek Olšák (125):
- Revert "ac/nir/ngg: Follow intrinsic sources when analyzing before culling."
- glthread: determine global locking once every 64 batches to fix get_time perf
- mesa: fix 38% decrease in display list performance of Viewperf2020/NX8_StudioAA
- freedreno,lima,zink: update CI fixes and flakes
- util/u_queue: fix util_queue_finish deadlock by merging lock and finish_lock
- util/u_queue: always enable UTIL_QUEUE_INIT_SCALE_THREADS, remove the flag
- radeonsi: fix a CDNA regression breaking compute
- glthread: sync for VDPAU sync functions
- radeonsi: turn sh_base[PIPE_SHADER_VERTEX] into a constant in emit_draw_packets
- radeonsi: restructure the loop for non-indexed multi draws
- radeonsi: cosmetic changes to radeon_opt_* macros
- radeonsi: handle draw user SGPRs as tracked registers
- radeonsi: update obsolete comments about compiler queues
- radeonsi: remove si_compute.h, move the contents into si_pipe.h
- radeonsi: move si_update/emit_tess_io_layout_state into si_state_shaders.cpp
- radeonsi: move si_emit_spi_map into si_state_shaders.cpp
- radeonsi: move si_emit_rasterizer_prim_state out of si_emit_all_states
- radeonsi: remove splitting IBs that use too much memory
- radeonsi: add padding to si_resource to fix Viewperf2020/catiav5test1 perf
- radeonsi: remove unused check_mem parameter from si_sampler_view_add_buffer
- radeonsi: remove the draw counter with primitive restart from the HUD
- radeonsi: always inline si_prefetch_shaders
- radeonsi: specialize si_draw_rectangle using a C++ template
- radeonsi: add index parameter into si_atom::emit
- radeonsi: split direct pm4 emission from si_pm4_emit
- radeonsi: move code around si_pm4_emit_state into si_pm4_emit_state
- radeonsi: merge pm4 state and atom emit loops into one
- radeonsi: add a simple version of si_pm4_emit_state for non-shader states
- radeonsi: handle deferred cache flushes as a state (si_atom)
- radeonsi: remove render condition logic from si_draw by reordering atoms
- radeonsi: abort when failing to upload descriptors instead of skipping draws
- radeonsi: rename shader_pointers state -> gfx_shader_pointers
- radeonsi: merge si_upload_*_descriptors into si_emit_*_shader_pointers
- radeonsi: convert si_gfx_resources_add_all_to_bo_list to a state atom
- radeonsi/ci: update gfx11 failures
- radeonsi: move GE_CNTL emission from si_draw into si_emit_vgt_pipeline_state
- radeonsi: use num_patches_per_workgroup directly in si_get_ia_multi_vgt_param
- radeonsi: enable shader culling by default because it helps Viewperf
- radeonsi: rewrite how occlusion query precision is determined for performance
- radeonsi: set PIPE_CONTEXT_LOSE_CONTEXT_ON_RESET on aux_context explicitly
- radeon_winsys: move allow_context_lost from cs_create to ctx_create
- winsys/amdgpu: rework how SW reset status is generated and reported
- radeon_winsys: add a ctx_set_sw_reset_status callback
- radeonsi: don't abort for descriptor failures, let the winsys handle it
- radeonsi: don't use threadID.yz/blockID.yz for copy_image if those are always 0
- radeonsi: don't use threadID.yz/blockID.yz for compute_blit if they're always 0
- nir: fix constant evaluation of fddx/fddy sourcing Inf & NaN constant
- nir/algebraic: collapse ALU opcodes sourcing NaN
- ac/gpu_info: add the /dev/dri/ filename into radeon_info
- Revert "ac: don't call ac_query_pci_bus_info from ac_query_gpu_info"
- ac: implement AMD_FORCE_FAMILY properly, remove SI_FORCE_FAMILY
- ac: document ac_shader_args::gs_vtx_offset
- ac: minor updates to packet documentation and definitions
- ac: change offsets of DMA_DATA dwords to prevent reg offset conflicts
- ac: improve the IB parser
- ac: update gfx11 shadowed register tables
- ac: add a standalone IB parser program
- ac/surface: trivial non-functional changes
- ac/surface: add radeon_surf::u::gfx9::uses_custom_pitch
- radeonsi: allow setting any index in radeon_set_sh_reg_idx
- radeonsi: rename uses_subgroup_info to uses_tg_size
- radeonsi: improve the heuristic when to use Wave32 for compute shaders
- radeonsi: simplify/merge emit_shader_ngg functions
- radeonsi: don't pass gl_Layer to PS for blit shaders
- radeonsi/gfx11: pass attribute ring addr via SGPR instead of memory for blits
- radeonsi: fix templated si_draw_rectangle callback for Navi14
- nir: replace undef only used by ALU opcodes with 0 or NaN
- nir: remove nir_op_unpack_64 handling from nir_opt_undef
- ac/llvm: don't convert undef to 0 because nir_opt_undef does it now
- meson: use llvm-config instead of cmake to fix linking errors with meson 1.2.1
- gallivm: fix build with LLVM 18
- amd/llvm: fix build with LLVM 18
- radeonsi: fix compute-only contexts
- ac/llvm: replace removed amdgcn.ldexp for LLVM 18
- ac/perfcounter: remove a bogus assert to fix an assertion failure on gfx11
- ac/llvm: set !fpmath 3.0 for llvm.sqrt
- ac/gpu_info: don't align IBs to the GL2 cache line size
- ac/llvm: fix flat PS input corruption
- amd: rename GFX110x to NAVI31-33
- ac/gpu_info: replace ib_alignment with per-IP IB base and size alignments
- ac/gpu_info: pad IBs according to ib_size_alignment
- winsys/amdgpu: pad gfx and compute IBs with a single NOP packet
- Revert "radeonsi: specialize si_draw_rectangle using a C++ template"
- radeonsi/ci: update navi10 results
- gallium/util: fix GALLIUM_TESTS=1 by using cso_set_vertex_buffers_and_elements
- gallium/util: add more tests for compute-only contexts
- radeonsi: add another aux context for uploading shaders
- radeonsi: upload shaders via a staging buffer so as not to map VRAM directly
- ac/surface: don't require exact pitch for gfx6-8 tiled imports
- Revert "ac/gpu_info: override ib_size_alignment for VCN_DEC and JPEG"
- Revert "radv/amdgpu: fix alignment of command buffers"
- Revert "radv: fix alignment of DGC command buffers"
- Revert "winsys/amdgpu: pad gfx and compute IBs with a single NOP packet"
- Revert "ac/gpu_info: pad IBs according to ib_size_alignment"
- Revert "ac/gpu_info: replace ib_alignment with per-IP IB base and size alignments"
- nir: sort variables by location in nir_lower_io_passes to work around a bug
- nir: recompute IO bases after DCE in nir_lower_io_passes
- nir: add dual-slot input information into load_input intrinsics
- nir: take dual slot input info into account when computing IO driver locations
- nir: gather dual slot input information
- nir: expose reusable linking helpers for cloning uniform loads
- nir: handle nir_var_mem_ubo in nir_clone_uniform_variable
- ac/gpu_info: split ib_alignment as ip[type].ib_alignment
- ac/gpu_info: move ib_pad_dw_mask into ip[]
- ac/gpu_info: drop the hack unifying all IB alignments
- ac/gpu_info: conservatively decrease IB alignment and padding to 256B
- ac/gpu_info: set gfx and compute IB padding to only 8 dwords
- winsys/amdgpu: properly pad the IB in amdgpu_submit_gfx_nop
- winsys/amdgpu: correctly pad noop IBs for RADEON_NOOP=1
- winsys/amdgpu: pad gfx and compute IBs with only 1 NOP
- ac/gpu_info: don't allow register shadowing with SR-IOV due to bad performance
- radeonsi: disable register shadowing without SR-IOV to fix bad performance
- winsys/amdgpu: don't send CP_GFX_SHADOW chunk if shadow address is not set
- radeonsi/ci: update gfx1100 results
- nir: split FLOAT_CONTROLS_SIGNED_ZERO_INF_NAN_PRESERVE_FP* flags
- nir/algebraic: use only signed_zero_preserve_* for addition by 0 patterns, etc.
- mesa: don't pass Infs to the shader via gl_Fog.scale
- radeonsi/ci: update the runner for new build scripts
- radeonsi/ci: enable GTF tests in the runner
- radeonsi/ci: enable GLES CTS in the runner
- radeonsi/ci: update failures and flakes
- amd/common: update DCC for gfx11.5
- radeonsi: initialize perfetto in the right place
- radeonsi/gfx11: don't set OREO_MODE to fix rare corruption
- nir: fix gathering TESS_LEVEL_INNER/OUTER usage with lowered IO
Marek Vasut (1):
- etnaviv: Fully replicate back stencil config
Mark Collins (10):
- tu/a7xx: Adapt r3d blits for A7xx
- freedreno/rnn: Remove %n usage in fprintf
- freedreno: Only add drm/computerator when system_has_kms_drm
- freedreno/decode: Support building replay for multiple KMDs
- freedreno+meson: Add lua+libarchive+libxml from Meson WrapDB
- meson: Warn about side-effects from DRM for FD KMDs
- meson: Update libarchive to v3.7.2-2
- freedreno/common: Add max_sets property to A6xxGPUInfo
- tu: Support higher descriptor set count for A7XX
- tu,util/driconf: Add option to not reserve descriptor set
Mark Janes (1):
- intel: allow reduced memory usage for INTEL_MEASURE
Martin Roukala (né Peres) (22):
- radv/ci: drop the auto-reboot-on-hang for vkcts-navi10
- radv/ci: use the default kernel on vkcts-navi10
- zink/ci: automatically reboot when hitting a kernel BUG on vangogh
- zink/ci: document more flakes seen on vangogh
- radv/ci: move vkcts-navi10 testing to KWS
- radv/ci: add more tests to the navi10 vkcts flake list
- radv/ci: increase the parallelism of the vkcts-navi21 job
- radv/ci: add more tests to the navi21 vkcts flake list
- radv/ci/vkcts-navi21: catch all the line_stipple_(enable|params) flakes
- radv/ci/vkcts-navi21: document more flakes
- radv/ci/vkcts-navi10: catch all the line-related flakes
- radv/ci: update the vkcts gfx1100 flake/fail lists
- radv/ci: add a manual job to run vkcts on navi31
- radv/ci: add a manual job for vkd3d-proton on navi31
- ci/vkcts-vangogh: mark dEQP-VK.dynamic_rendering.primary_cmd_buff.basic.* as flake
- ci/vkcts-navi21: mark more of the RT handles checks as flakes
- ci: make B2C_JOB_VOLUME_EXCLUSIONS to all .b2c-test jobs
- zink/ci: remove 19 tests from the zink-radv-polaris10-fails list
- ci/b2c: switch containers to a back-up ahead of valve-infra renaming
- zink/ci: remove 42 tests from the zink-radv-polaris10-fails list
- radv/ci: tighten the vkcts-navi21 timeouts
- zink/ci: tighten the zink-radv-vangogh timeouts
Martin Stransky (1):
- llvmpipe: fix UAF in lp_scene_is_resource_referenced.
Mary (6):
- nouveau/mme: Add initial Fermi definition
- nouveau/mme: Add Fermi builder
- nouveau/mme: Add Fermi simulator
- nouveau/mme: Add Fermi hardware tests
- agx: Move nir_lower_fragcolor out of agx_preprocess_nir
- agx: Ensure to lower 1D image load/store to 2D
Mary Guillemard (4):
- nir: Add NVIDIA-specific geometry shader opcodes
- venus: skip bind sparse info when checking for feedback query
- zink: Check for VK_EXT_extended_dynamic_state3 before setting A2C
- venus: Do not submit batch manually when no feedback is required
Matt Coster (21):
- pvr: Pad rogue_regarray_cache_key union members to avoid UB
- pvr: Clean up extension tables
- pvr: Refactor pvr_GetPhysicalDeviceProperties2()
- docs: Fixup imagination/pvr extension support
- pvr: Add VK_KHR_get_display_properties2
- pvr: Add VK_KHR_get_memory_requirements2
- pvr: Add VK_KHR_get_surface_capabilities2
- pvr: Print VkStructureType name on pvr_debug_ignored_stype()
- pvr: Add VK_KHR_copy_commands2
- pvr: Don't override commands copied to new buffer when extending cs
- pvr: Do not require TA_STATE_HEADER.pres_ispctl_dbsc for {db,sc}enable
- pvr: Zero tail of cs buffers after linking when dumping cs
- pvr: Cleanup comments in pvr_physical_device_get_supported_*()
- pvr: Don't rely on GNU void pointer arithmetic
- pvr: Force compile error on GNU void pointer arithmetic
- pvr: Switch to common pipeline cache implementation
- pvr: Use vk_sampler base
- pvr: Clean up & fix sampler border color support
- pvr: Don't pass pvr_physical_device when only device info is needed
- pvr: Minor refactor of pvr_device.c
- pvr: Use common physical device properties
Matt Turner (10):
- Revert "intel/fs: only avoid SIMD32 if strictly inferior in throughput"
- intel: Rearrange for next commit
- intel: Consider with_intel_clc in with_any_intel
- intel: Only build blorp if drivers are enabled
- intel: Only build ds if drivers are enabled
- intel: Only build perf if drivers or tools are enabled
- intel: Allow using intel_clc from the system
- intel: Limit Intel Vulkan RT to x86_64
- r600: Add missing dep on git_sha1.h
- util: Include stdint.h in libdrm.h
Mauro Rossi (7):
- nouveau/ws: fix building error in nouveau_ws_push_dump()
- vulkan/meta: fix gnu-empty-initializer build error
- nouveau/mme: fix print inst for case MME_FERMI_OP_MERGE
- anv/android: remove numFds check
- hasvk/android: remove numFds check
- Android.mk: filter out cflags to build with Android 14 bundled clang
- Android.mk: disable android-libbacktrace to build with Android 14
Mike Blumenkrantz (293):
- ci: bump VVL to 1.3.257
- zink: set pipeline dynamic state count after all dynamic states are set
- zink: set feedback attachments on batch init
- zink: be even dumber about buffer refs when replacing storage
- zink: emit SpvCapabilitySampleMaskPostDepthCoverage with SpvExecutionModePostDepthCoverage
- zink: fix the fix for separate shader program refcounting
- kopper: handle pixmap creation failure more gracefully
- glxsw: check geometry of drawables on creation
- kopper: move pixmap param for drawable creation to info struct
- glx/dri3: split out modifier check
- glx/sw: check for modifier support in the kopper path
- kopper: pass modifier availability to drawable creation
- kopper: determine modifier support per-drawable
- zink: don't clobber descriptor mode on multiple screen creation
- nir: fix slot calculations for compact variables with location_frac
- lavapipe: use the component offset directly for xfb
- nir: add a helper for calculating variable slots
- radv: bump max xfb output to 128
- ir3: bump max xfb output to 128
- gallium: bump PIPE_MAX_SO_OUTPUTS to 128
- zink: add feedback loop exts to optimal profile
- glsl: only explicitly check GS components in PSIZ injection with output variables
- lavapipe: statically allocate fb attachment array
- lavapipe: zero fb attachment array at rp start
- lavapipe: don't check geometry for fb attachments
- lavapipe: be slightly more permissive for bad apps (and cts) with dynrender
- lavapipe: VK_EXT_host_image_copy
- zink: better handle separate shader dsl creation when no bindings exist
- zink: force image barriers after dmabuf import
- ci: bump VVL to 1.3.261
- zink: use VK_WHOLE_SIZE when binding null db buffer descriptors
- zink: unset line stipple ds3 state flags when stipple not available
- nir/lower_io_to_scalar: fix 64bit io splitting
- nir/linking_helpers: force type matching in does_varying_match
- nir/print: print location names for (some) tess slots
- nir/print: always group variables by type when printing
- zink: add batch refs for transient images
- zink: fix zs resolve attachment indexing
- zink: don't add VK_IMAGE_USAGE_ATTACHMENT_FEEDBACK_LOOP_BIT_EXT for transient images
- zink: don't append msrtss to dynamic render if not supported
- zink: set msrtss depth resolve mode when enabled
- zink: hook up VK_KHR_workgroup_memory_explicit_layout
- zink: propagate have_workgroup_memory_explicit_layout to ntv
- zink: use SPV_KHR_workgroup_memory_explicit_layout when available
- zink: add more locking for pipeline cache
- zink: add VK_PIPELINE_CACHE_CREATE_EXTERNALLY_SYNCHRONIZED_BIT_EXT
- aux/trace: fix winsys handle dumping
- zink: generated tcs is on the tes, not the vs
- zink: apply ZINK_DEBUG=noopt to linked separate shaders
- gallivm: handle A8_UNORM image stores
- llvmpipe: enable A8_UNORM for shader images
- llvmpipe: export PIPE_CAP_IMAGE_LOAD_FORMATTED
- lavapipe: GetRenderingAreaGranularityKHR
- llvmpipe: block weird uses of subsampled formats in buffers
- llvmpipe: fix early depth + alpha2coverage + occlusion query interaction
- lavapipe: fix BindVertexBuffers2 buffer size handling
- lavapipe: fix resolves where src image has a layer offset
- lavapipe: block yuv formats from getting blit feature flags
- lavapipe: BindIndexBuffer2
- lavapipe: GetDeviceImageSubresourceLayoutKHR
- lavapipe: VK_REMAINING_ARRAY_LAYERS for copy ops
- lavapipe: maintenance5
- zink: fix xfb buffer array sizing to use buffer limit, not output
- zink: move ZINK_DEBUG=nir printing to just before compile
- draw: fix so debug offset printing
- zink: reindex ssa defs before dumping debug shaders
- lavapipe: zero-init pipe_sampler_state
- zink: explicitly set non-optimal last_vertex_stage shader key on ctx create
- zink: fix big tcs output io
- zink: don't try to replace separate shader prog in noopt mode
- zink: pre-convert mode in fixup_io_locations
- zink: add a special separate shader i/o mode for legacy variables
- nir: minor fixes for io_to_scalar
- nir/lower_io: add a new doubles-only 64bit lowering option
- nir: add a filter cb to lower_io_to_scalar
- d3d10umd: use cso_context to set vertex buffers and elements
- virgl: move virgl_vertex_elements_state to header
- virgl: fix some indentation
- nouveau: calloc vertex csos
- gallium: move vertex stride to CSO
- zink: fix null config screen creation
- zink: fix crash in lower_pv_mode_gs_store
- u/draw: skip zero-sized indirect draws
- lavapipe: handle VkPipelineCreateFlagBits2KHR
- lavapipe: handle VkBufferUsageFlags2KHR
- zink: ci updates
- zink: track start/stop of a couple query types
- zink: require EDS1 for CWE usage
- zink: unset primgen suspended flag when ending a primgen query
- zink: rework rast-discard for primgen queries
- zink: rip out some awkward parts of the old non-cwe path
- zink: drop CWE requirement for renderpass tracking with primgen queries
- nir/zink: fix gs emulation xfb_info sizing
- zink: move fragcolor lowering further along the compile process
- zink: add a mode param to find_var_with_location_frac
- zink: use lowered io (kinda) for i/o vars
- zink: stop lowering indirect derefs
- ntt: handle interp intrinsics as derefs
- zink: delete split_blocks pass
- zink: delete lower_64bit_vertex_attribs pass
- zink: fix clip/cull dist xfb inlining
- zink: delete all the extra gross xfb handling
- zink: stop using pipe_stream_output
- zink: remove pipe_stream_output from function params
- zink: ci updates
- aux/trace: print bindless handles as pointers
- zink: remove unused param from create_ici
- zink: split create_ici to init and eval
- zink: add maintenance extensions to profile
- zink: use maintenance5
- zink: use real A8_UNORM when possible
- vk/graphics: fix CWE handling with DS3
- Revert "vk/wsi/x11: handle geometry updating more asynchronously"
- r600: store the mask of buffers used by a vertex state
- r600: better tracking for vertex buffer emission
- zink: wait on async fence during ctx program removal
- zink: handle patch variable locations for separate shaders better
- zink: don't start multiple cache jobs for the same program
- zink: use the "set" optimal key for prog last_variant_hash for consistency
- zink: sanitize optimal keys
- zink: copy some cs shader properties to the program struct
- zink: handle global atomic intrinsics
- zink: use Aligned with global load/store ops
- zink: fix rewrite_read_as_0 filtering
- rusticl: fixes for zink shader images
- zink: pass KERNEL shaders through successfully
- zink: add a618 flake
- zink: break out ds3 state resetting
- zink: be consistent with ds3 state resetting for blits
- zink: fix optimal_keys warning message
- zink: force-reset unordered flags for buffer barriers on non-matching batch access
- zink: reset unordered flags for image barriers on non-matching batch access
- zink: make image barrier init functions void return
- zink: simplify some image barrier conditionals
- zink: remove sync TODO
- zink: add lavapipe flake
- ci: disable nouveau shaderdb
- egl/dri3: only set driver_name if not already set
- egl: call dri3_x11_connect() for zink
- egl: bind dri2_set_WL_bind_wayland_display for zink when necessary
- zink: be more precise about flagging rp changes around unordered u_blitter
- zink: don't block reordering during ref updates in unordered blits
- lavapipe: update vbo indices before propagating stride
- lavapipe: fix pipeline stride propagation
- zink: fix linear modifier dmabuf imports
- zink: polaris ci updates
- aux/tc: handle stride mismatch during rp-optimized subdata
- zink: always add a per-prog ref for gpl libs
- zink: use a pointer to simplify submit struct mechanics
- zink: make zink_resource_image_barrier2_init public
- zink: add a third submitinfo (unused for now)
- zink: make submitinfo handling easier to manage with enum
- zink: add another submitinfo for fd semaphore waits
- zink: add a screen cache for fd semaphores
- zink: add a util for getting cached fd semaphores
- zink: hook up cached fd semaphore usage for batch signal/waits
- zink: handle implicit sync for dmabufs
- zink: handle multi-plane implicit sync
- zink: ci updates
- zink: set is_xfb=false for all i/o variables
- zink: reorder bindless io lowering
- zink: fix typing on bindless io lowering
- zink: delete some bindless io lowering code
- zink: use nir_io_semantics::num_slots for indirect var creation
- zink: simplify an arrayed io check during variable creation
- zink: use explicit stride from types instead of copying old_var stride
- zink: use MAX_PATCH_VERTICES directly for arrayed io var sizing
- zink: use explicit sizing for builtins when creating variables
- zink: create new vars without copying existing ones
- zink: add a new linker pass to handle mismatched i/o components
- zink: use right function to get src_type in eliminate_io_wrmasks
- zink: re-rework i/o variable handling to make having variables entirely optional
- ci: bump VVL to 1.3.263
- zink: simplify redundant is_buffer check
- zink: use VkFormatProperties3
- lavapipe: handle VkHostImageCopyDevicePerformanceQueryEXT
- lavapipe: don't advertise UNDEFINED layout for HIC
- zink: hook up VK_EXT_host_image_copy
- zink: move mem type detection up in file
- zink: disable HIC without resizable BAR
- zink: add a fixup method for extra driver props
- zink: fix some off-by-one indentation
- zink: use some return codes for check_ici errors
- zink: check/use suboptimal HIC during ici init
- zink: use HIC for image subdata when possible
- zink: slightly refactor psiz deletion during linking
- zink: delete all psiz=1.0 stores if maintenance5 is present
- nir/inline_uniforms: fix oob access with nir_find_inlinable_uniforms
- zink: add ZINK_DEBUG=quiet
- zink: imply ZINK_DEBUG=quiet if ZINK_DEBUG=optimal_keys is set on turnip
- zink: set optimal_keys for turnip jobs
- aux/tc: fix staging buffer sizing for texture_subdata
- aux/tc: fix address calc for segmented texture subdata
- zink: ci updates
- lavapipe: KHR_map_memory2
- zink: slightly refactor pipeline compile selection
- zink: add a flag for combined pipeline compile for doing FAIL_ON_PIPELINE_COMPILE_REQUIRED
- zink: remove an intermediate variable in pipeline compile selection
- zink: use FAIL_ON_PIPELINE_COMPILE_REQUIRED for GPL path
- zink: pass a stage mask to pipeline create functions
- glsl: check for xfb setting xfb info
- zink: don't warn about missing scalarBlockLayout on v3dv
- aux/tc: fix renderpass tracking fb state clobber scenario
- vk/enum2str: add more max enum vendors
- aux/tc: fix rp info handling around tc_sync calls
- aux/tc: don't use pipe_buffer_create_with_data() for rp-optimized subdata
- zink: flag db maps as unsynchronized
- lavapipe: clamp cache uuid size
- lavapipe: EXT_load_store_op_none
- tu: handle unused color attachments without crashing
- zink: use much bigger dummy surfaces
- zink: propagate rp_tc_info_updated across unordered blits
- zink: use null attachments for null attachments with dynamic render
- egl/swrast: expose EXT_swap_buffers_with_damage and EXT_present_opaque
- egl/wayland: split out wl drm extension init
- egl/wayland: use more registry listeners to better handle device init
- egl/wayland: enable WL_bind_wayland_display for zink
- zink: delete injected pointsize during shader creation
- zink: require maintenance5 for shobj
- zink: delete a non-maintenance5 workaround for shobj use
- lavapipe: set separate_shaders for shader objects
- zink: set workgroup_memory_explicit_layout for shader validation
- zink: add a ZINK_DEBUG=validation alias
- zink: fix semaphore signal ordering
- zink: move swapchain fence to swapchain object
- zink: avoid UAF on wayland async present with to-be-retired swapchain
- zink: always trace_screen_unwrap in acquire
- lavapipe: fix variable descriptor count support handling
- lavapipe: always set independent blend
- lavapipe: more vertex stride fixups
- lavapipe: set default viewport and scissor count for cmdbufs
- lavapipe: set default min sample shading to 1
- glx: XFree visual info
- radv: fix external handle type queries for dmabuf/fd
- zink: fix crashing in image rebinds
- zink: move push descriptor disable to driver workarounds
- zink: move v3dv scalarBlockLayout workaround
- zink: fix end-of-batch barrier pipeline stages
- zink: guarantee egl syncobj lifetime
- aux/trace: dump enum names for map usage
- gallium: add PIPE_MAP_NONE
- Revert "egl/wayland: Add image loader extension for swrast"
- egl/wayland: don't block in swrast when updating buffers for zink
- egl/wayland: return sooner from swrast_update_buffers() if zink
- zink: don't check submit count for unflushed usage
- egl: don't set ForceSoftware for all zink loading
- zink: error at handle export on missing EXT_image_drm_format_modifier
- gbm: delete some zink handling
- zink: apply ZINK_DEBUG=quiet to all missing feature warnings
- zink: set ZINK_DEBUG=quiet for polaris jobs
- lavapipe: don't block begin/end cmdbuf pipeline barriers
- ci: add a630 trace flakes
- zink: shrink vectors during optimization
- zink: always clamp shader stage in descriptor handling
- zink: add set_global_binding
- zink: eliminate samplers from no-sampler CL texops
- zink: add some checks to determine whether queue is init on screen destroy
- zink: don't destroy any simple_mtx_t objects during screen destroy
- zink: don't destroy uninitialized disk cache thread
- zink: reorder glsl_type_singleton_init_or_ref call
- zink: use screen destructor for creation fails
- zink: fix readback_present locking
- zink: add automatic swapchain readback using heuristics
- lavapipe: VK_EXT_nested_command_buffer
- zink: ignore unacquired swapchain images during end-of-frame flush
- nir/lower_fragcolor: preserve location_frac
- zink: update pointer for GPL pipeline cache entry formats
- zink: fix legacy depth texture rewriting for single component reads
- egl: unify dri2_egl_display creation
- egl: init dri3 version info during screen creation
- egl/glx: don't load non-sw zink without dri3 support
- egl: add automatic zink fallback loading between hw and sw drivers
- glx: add automatic zink fallback loading between hw and sw drivers
- ci: don't set GALLIUM_DRIVER for zink
- egl/wayland: only add more registry listeners for hardware devices
- zink: only increment image_rebind_counter on image export if binds exist
- zink: check for sampler view existence during zink_rebind_all_images()
- zink: use weston for anv ci
- zink: blow up broken xservers more reliably
- zink: delete some dead modifier handling
- ci: skip implicit modifier piglits for zink
- zink: don't block large vram allocations
- zink: add copy box locking
- zink: emit SpvCapabilitySampleRateShading with SampleId
- zink: always set VK_EXTERNAL_MEMORY_HANDLE_TYPE_HOST_ALLOCATION_BIT_EXT for usermem
- zink: clamp resolve extents to src/dst geometry
- zink: only emit xfb execution mode for last vertex stage
- aux/u_transfer_helper: set rendertarget bind for msaa staging resource
- zink: unset explicit_xfb_buffer for non-xfb shaders
- mesa/st/texture: match width+height for texture downloads of cube textures
- zink: add more locking for compute pipelines
- radv: correctly return oom from the device when failing to create a cs
- zink: check for cbuf0 writes before setting A2C
Mohamed Ahmed (19):
- vulkan/util: Support 10-bit and 12-bit color formats in ycbcr_info in vk_format.c
- vulkan/util: Support VK_EXT_ycbcr_2plane_444_formats color formats in vk_format.c
- vulkan/util: Use ycbcr_info for multiplane helpers in vk_format.c
- nvk: implement vkGetDeviceImageMemoryRequirementsKHR()
- nvk: add stub for vkGetDeviceImageSparseMemoryRequirementsKHR()
- nvk: implement vkGetDeviceBufferMemoryRequirementsKHR()
- nvk: advertise VK_KHR_maintenance4
- nvk: advertise DemoteToHelperInvocation
- nvk: Enable multiplane images and image views
- nouveau/nvk: Add YCbCr sampler NIR lowering pass
- nouveau/nvk: Support multi-plane descriptors in nvk_nir_lower_descriptors.c
- nouveau/nvk: Create helper function for sampler creation
- nouveau/nvk: Add multiple sampler planes for CONVERSION_SEPARATE_RECONSTRUCTION_FILTER_BIT
- nouveau/nvk: Enable VK_KHR_sampler_ycbcr
- util/format: Add G8B8_G8R8_422_UNORM and B8G8_R8G8_422_UNORM formats
- vulkan/format: Translate G8B8G8R8_422_UNORM and B8G8R8G8_422_UNORM properly
- nvk: Enable SEPARATE_RECONSTRUCTION_FILTER_BIT for multi-planar formats only
- nvk: Enable MIDPOINT_CHROMA_SAMPLES_BIT for multi-planar formats only
- nil: Add support for G8B8_G8R8_UNORM and B8G8_R8G8_UNORM
Nanley Chery (33):
- iris: Remap DRM_FORMAT_MOD_INVALID more often during import
- anv: Don't support ASTC images with modifiers
- intel: Add and use isl_drm_modifier_get_plane_count
- anv: Handle explicit surface layout of DG2_RC_CCS
- anv: Reduce accesses of isl_mod_info->aux_usage
- iris: Reduce accesses of mod_info->aux_usage
- crocus: Delete modifier with aux code
- hasvk: Delete modifier with aux code
- iris: Swap stencil and modifier aux assignment order
- intel: Describe modifier compression with booleans
- intel/isl: Move the Tile4 modifier score case down
- intel/isl: Add a score for DG2_RC_CCS
- intel/blorp: Ambiguate after CCS resolves on gfx7-8
- iris: Reorder render_aux_usage parameters
- iris: Pass the render format to prepare_render
- iris: Create BLORP surfaces after resource preparation
- iris: Handle clear color compatibility in prepare_render
- iris: Sample more texture view fast-clears on gfx11+
- iris: Fix aux usage tracking in prepare_render
- iris: Fix iris_copy_region calls involving FCV_CCS_E
- iris: Drop get_copy_region_aux_settings
- iris: Inline iris_can_sample_mcs_with_clear
- anv: Initialize the clear color more often for FCV
- intel: Return a bool from intel_aux_map_add_mapping
- anv: Move scope of CCS binding determination
- anv: Allocate space for aux-map CCS in image bindings
- anv: Wrap aux surface image binding queries
- anv: Refactor CCS disabling at image bind time
- anv: Place images into the aux-map when safe to do so
- anv: Loosen anv_bo_allows_aux_map
- anv: Meet CCS alignment reqs with dedicated allocs
- anv: Delete implicit CCS code
- intel/isl: Add scores for GEN12_RC_CCS and MTL_RC_CCS
Neal Gompa (1):
- asahi: Fix 32-bit x86 build with correct data type for overflow error message
Neha Bhende (1):
- ntt: lower indirect tesslevels in ntt
Paul Gofman (2):
- driconf: add a workaround for Captain Lycop: Invasion of the Heters
- driconf: add a workaround for Rainbow Six Extraction
Paulo Zanoni (15):
- anv: rename the vm_bind vfuncs
- anv: add a new vm_bind vfunc
- anv/xe: make vm_binds async
- anv/xe: return failure in case waiting for the vm_bind syncobj fails
- anv: remove misleading comment about batch_len
- iris: assert bufmgr->bo_deps_lock is held
- iris: avoid stack overflow in iris_bo_wait_syncobj()
- iris: assert(bo->deps) after realloc()
- intel/isl: add ISL_SURF_USAGE_SPARSE_BIT
- intel/isl: simplify the check for maximum surface size
- anv/sparse: add the initial code for Sparse Resources
- anv/sparse: get ready to issue a single vm_bind ioctl per non-opaque bind
- anv/sparse: add INTEL_DEBUG=sparse
- anv: enable sparse resources by default
- vulkan: fix potential memory leak in create_rect_list_pipeline()
Pavel Ondračka (44):
- r300: update RV370 failures
- r300: check for index overflow when translating from TGSI
- r300: source register index is always unsigned
- r300: bump the RC_MAX_INDEX_BITS
- r300: normal instruction can't have presubtract op
- r300: add a helper for checking number of temporary sources
- r300: cycles estimate for shader-db
- r300: fix cycles calculation
- r300: don't abort on flow control when using draw for vs
- r300: add dEQP baseline for RV370 with forced swtcl
- r300: copy ntt to r300 compiler
- r300: add lower_sqrt to nir option
- r300: remove unused intrinsics in ntr
- r300: remove irrelevant opcodes in ntr
- r300: remove unused integer support in ntr
- r300: remove ntr_tgsi_usage_mask
- r300: remove more unused 64-bit pieces from ntr
- r300: simplify vectorization rules
- r300: remove more ntr unused helpers
- r300: remove the unneeded ntr_lower_vec_to_reg callback
- r300: remove unneeded 64bit and atomic lowering passes
- r300: remove unused ntr default settings
- r300: remove ntr default options
- r300: simplify ntr_emit_load_ubo
- r300: simplify ntr_emit_load_input
- r300: remove some virglrenderer specifics from ntr
- r300: simplify ntr_setup_uniforms
- r300: simplify ntr_output_decl
- r300: simplify ntr_try_store_in_tgsi_output
- r300: remove some unsupported texture opcodes
- r300: remove unused barrier code from ntr
- r300: simplify ntr_get_gl_varying_semantic
- r300: remove the nrt main optimization loop
- r300: reorder for easier presubtract 1-x pattern recognition
- r300: exit early in presubtract is not supported
- r300: implement bias presubtract
- r300: convert x * 2 into x + x for presubtract
- r300: move power of two multipliers down
- r300: there is no limitation on presubtract source file
- r300: use w channel for scalar opcodes if possible
- r300: reduce number of iterations for vertex shader loops
- r300: enable nir_move_vec_src_uses_to_dest
- nir/move_vec_src_uses_to_dest: skip reuse if vec is used only once in store_output
- nir/move_vec_src_uses_to_dest: allow to skip reuse of constant sources
Philipp Zabel (1):
- etnaviv: fix segfault after compile failure
Pierre-Eric Pelloux-Prayer (18):
- radeonsi/sdma: use multiple commands if required
- radv/sdma: use multiple commands if required
- radv/sdma: use correct limits for gfx10.3
- glx: drop the 'libGL' log prefix
- loader: refactor DRI_PRIME handling code
- loader: extend DRI_PRIME to support =N
- loader: add DRI_PRIME_DEBUG env var
- device_select_layer: support DRI_PRIME=n
- docs: update DRI_PRIME documentation
- device_select: add shortcut for MESA_VK_DEVICE_SELECT_FORCE_DEFAULT_DEVICE
- st/mesa: check renderbuffer before using it
- radeonsi: emit framebuffer state after allocating cmask
- amd/common: update addrlib for gfx11.5
- amd/common: add registers for gfx11.5
- ac/nir: extract must_wait_attr_ring helper
- amd, radeonsi: Add code to enable gfx11.5
- mesa: restore call to _mesa_set_varying_vp_inputs from set_vertex_processing_mode
- radeonsi: check sctx->tess_rings is valid before using it
Piotr Kocia (2):
- nir: Remove dead nir_const_value variables
- glsl: ir_function_param_visitor::visit_enter always true condition
Qiang Yu (77):
- aco,radv: replace tess_input_vertices shader info param
- radeonsi: aco does not pass LS outputs to HS by arg
- radeonsi: extract si_get_prev_stage_nir_shader to be shared with aco
- radeonsi: init aco shader info for merged LS/HS
- radeonsi: simplify si_build_wrapper_function
- radeonsi: move vertex shader vb desc input sgpr args to last
- radeonsi: remove param type check in wrapper function
- radeonsi: refine si_llvm_ls_build_end
- radeonsi: refine si_llvm_es_build_end
- radeonsi: aco compile support merged mono shader
- radeonsi: calculate lds size for merged shaders
- radeonsi: enable aco compile for mono merged LS/HS
- radeonsi: enable aco compile for mono merged ES/GS
- aco: extract aco_compile_shader_part from aco_compile_ps_epilog
- aco: add p_end_with_regs pseudo instruction
- aco: move jump to epilog out of ic_merged_wave_info
- aco: add tcs end regs for epilog usage
- aco: allow tcs with epilog to keep nir store output instruction
- aco: add pending_lds_access option for insert waitcnt
- aco: add tcs epilog generation for radeonsi
- aco: don't emit s_endpgm for tcs with epilog
- aco: skip scratch init when no scratch arg provide
- aco,radeonsi: save const addr to symbol
- ac/nir/tess: move tess factor output out of control flow
- aco: use semantic location as io temp index
- radeonsi: add exec_size to shader binary
- radeonsi: support upload multi part shader binary
- radeonsi: share si_get_tcs_out_patch_stride with aco
- radeonsi: fill part mode tcs aco shader info
- radeonsi: extract si_llvm_build_shader_part
- radeonsi: remove separate_prolog arg from prolog/epilog build
- radeonsi: add si_get_tcs_epilog_args
- radeonsi: change si_fill_aco_options args
- radeonsi: add si_aco_build_shader_part
- radeonsi: part mode standalone tcs support aco compile
- radeonsi: remove unused arg of get_tcs_tes_buffer_address
- aco: simplify setup_tcs_info
- aco: pass sw_stage when setup_isel_context
- aco: prepare fix_ls_vgpr_init_bug to be used by gl vs prolog
- aco: add vs prolog instruction selection for radeonsi
- aco: add aco compile interface for radeonsi vs prolog
- aco: do not fix_exports when program is prolog
- radeonsi: fill aco_shader_info->is_monolithic
- radeonsi: remove is_monolithic from vs prolog key
- radeonsi: extract si_get_vs_prolog_args to be shared with aco
- radeonsi: fix aco options has_ls_vgpr_init_bug setup
- radeonsi: add vs prolog aco build
- radeonsi: set vs has prolog aco shader info
- radeonsi: enable aco compile for part mode standalone vs
- aco,radv,radeonsi: rename is_monolithic to merged_shader_compiled_separately
- ac,radeonsi: move ps arg pos_fixed_pt to ac_shader_args
- aco: do not eliminate final exec write when p_end_with_regs block
- aco: remove p_end_with_regs from needs_exact()
- aco: add ps prolog generation for radeonsi
- aco: handle ps outputs from radeonsi
- aco: add create_fs_end_for_epilog for radeonsi
- aco,radv: remove unused ps epilog info fields
- aco,radv: rename ps epilog info inputs to colors
- aco: simplify export_fs_mrt_color
- aco,radv: add radeonsi spec ps epilog code
- aco: compact ps expilog color export for radeonsi
- aco,radv,radeonsi: pass spi ps input ena and addr
- aco: do not fix_exports when program has epilog
- aco: fix assertion fail when program contains empty block
- aco: create exit block for p_end_with_regs to branch to
- aco: wait memory ops done before go to next shader part
- radeonsi: reduce sgpr count for scratch_offset when aco
- radeonsi: init spi_ps_input_addr for part mode ps
- radeonsi: extract si_prolog_get_internal_binding_slot
- radeonsi: extract si_get_ps_prolog_args to be shared with aco
- ac,radeonsi: remove unused ps prolog key fields
- radeonsi: add ps prolog shader part build
- radeonsi: extract si_get_ps_epilog_args to be shared with aco
- radeonsi: fill aco shader info for ps part
- radeonsi: add ps epilog shader part build
- radeonsi: enable aco compile for part mode ps
- radeonsi: disable disk cache when use aco
Rebecca Mckeever (32):
- vulkan/runtime: Add helper functions for VK_EXT_host_image_copy
- nouveau/codegen: Support nir_intrinsic_load_workgroup_id_zero_base
- nouveau/codegen: Set lower_device_index_to_zero
- nvk: Convert system values for gl_PointCoord and PointCoord into inputs
- nvk: Add base_group to root descriptor table
- nvk: Lower base_workgroup_id
- nvk: Implement nvk_CmdDispatchBase and delete nvk_CmdDispatch
- nvk: Advertise KHR_device_group
- nvk: Add VK_FORMAT_B4G4R4A4_UNORM_PACK16 format to nil_format_info table
- nvk: Add A4B4G4R4 formats to nil_format_info table
- nvk: Advertise EXT_4444_formats
- nvk: Enable shadow sampling
- nvk: Implement VK_EXT_non_seamless_cube_map
- nouveau/nil: Add macros for ufixed
- nvk: Implement VK_EXT_image_view_min_lod
- nvk: Update mutable descriptor struct type
- nvk: Replace asserts with conditional that sets type_list = NULL
- nvk: Implement nvk_GetDescriptorSetLayoutSupport
- nvk: Enable VK_KHR_maintenance3
- nvk: Advertise VK_EXT_mutable_descriptor_type
- nvk: Set image index to zero for NULL nvk_buffer_view
- nvk: Advertise VK_EXT_image_robustness
- nvk: Advertise VK_EXT_robustness2
- nvk: Add view_index to root descriptor table
- nvk: Lower nir_intrinsic_load_view_index
- nvk: Add draw support for multiview
- nvk: Add query support for multiview
- nvk: Add input attachments support for multiview
- nvk: Advertise VK_KHR_multiview
- nvk: Load view_mask to shadow scratch in nvk_CmdBeginRendering
- nvk: Combine CLEAR_VIEWS and CLEAR_LAYERS MME macros
- nvk: Move code inside view mask loops to a helper function
Rhys Perry (89):
- ac/llvm: fix AC_TM_CHECK_IR
- radv: fix radv_get_ballot_bit_size with CS
- ac/llvm: fix wave32 ac_build_mbcnt_add with 64-bit mask
- ac/llvm: skip ballot zext for 32-bit dest with wave32-as-wave64
- radv: add conformant_trunc_coord to cache UUID
- radv: don't unset TRUNC_COORD if conformant_trunc_coord=true
- ac/nir: always round cube array layers
- nir/unsigned_upper_bound: fix phi(bcsel)
- nir/tests: add test for unsigned_upper_bound with loop header phis
- nir/opt_dead_cf: remove nodes after a jump earlier
- nir/tests: add nir_opt_dead_cf_test.jump_before_constant_if
- aco: insert s_nop before VGPR deallocation
- nir/lower_shader_calls: vectorize stack access for all shaders
- radv: workaround WWZ exporting index=1 through location=1
- radv: correctly skip MRT output NaN fixup for meta shaders
- radv: don't set vertex_attribute_strides on GFX8+
- radv/ci: skip some mesh shader tests on GFX1100
- aco: summarize register demand after handling branches
- aco: don't create sendmsg(dealloc_vgprs) if scratch is used
- radv: disable 64-bit color attachments
- radv: fix 128bpp comp-to-single clears
- radv: support 128bpp comp-to-single with all colors
- radv/gfx11: re-enable 0001/1110 clear values
- nir/lower_shader_calls: fix align_offset
- nir/opt_load_store_vectorize: support scratch access
- radv: vectorize RT stack access
- radv: vectorize scratch access
- aco: fix p_bpermute_gfx6 with input at non-zero byte
- aco: fix p_bpermute_gfx6's exec save/restore with wave32
- aco: clarify bpermute pseudo opcode names
- aco: add adjust_bpermute_dst helper
- aco/spill: skip p_branch in process_block
- aco/spill: add all live-in to merge block spill candidates
- nir/lower_system_values change num_workgroups to uint32_t
- radv: optimize mesh workgroup ID using ts_mesh_dispatch_dimensions
- radv: use shortcut_1d_workgroup_id
- aco: remove fast path in insert_exec_mask's process_instructions
- aco/optimizer_postRA: check overwritten_subdword in is_overwritten_since()
- aco: check logical_phi_info at p_logical_end when eliminating exec writes
- aco: remove unused p_logical_end check when optimizing branching sequence
- radv: disable mesh dispatch XYZ_DIM when possible
- nir/deref: remove rematerialize_deref_in_block cache
- aco: reset prefetch in the correct block after removing the exit
- aco/waitcnt: replace wait_cnt::\*_cnt with booleans
- aco/waitcnt: add print helpers
- nir/lower_int64: fix find_lsb(0)
- nir/algebraic: optimize u2u32(a >> 32)
- aco/optimizer_postRA: don't combine DPP across exec on GFX8/9
- aco: don't combine DPP into v_cmpx
- aco: disable zero offset optimization for strict WQM coords
- nir/constant_folding: remove zero texel offset
- aco: remove zero offset optimization
- aco: shrink DPP8_instruction
- aco: add fetch_inactive field to DPP instructions
- nir: add fetch inactive index to quad_swizzle_amd/masked_swizzle_amd
- aco: disable FI for quad/masked swizzle
- aco: fix LdsDirectVMEMHazard WaW with the wrong waitcnt
- aco: only mitigate VcmpxExecWARHazard when necessary
- aco: fix s_setreg hazards
- aco: consider exec_hi in reads_exec()
- aco: resolve all possible hazards at the end of shader parts
- aco/tests: test that hazards are resolved at the end of shader parts
- radv: skip zero-sized memcpy
- ac/nir: fix out-of-bounds access in ac_nir_export_position
- radv: fix signed integer overflow
- Revert "radv: pre-init surface info"
- nir: improve ms_cross_invocation_output_access with local_invocation_id
- aco,nir: add export_row_amd intrinsic
- ac/nir: add row parameter to helpers
- ac/nir: remove dead code
- ac/nir: refactor mesh vertex/primitive export
- ac/nir: implement mesh shader gs_fast_launch=2
- ac/nir: optimize mesh shader local_invocation_index
- radv: implement mesh shader gs_fast_launch=2
- ac/nir: add emit_ms_outputs helper
- ac/nir,radv: pass workgroup size to ac_nir_lower_ngg_ms
- ac/nir: implement mesh shader multi-row export
- radv: implement mesh shader multi-row export
- radv: enable mesh shader gs_fast_launch=2 and multi-row export
- nir/serialize: fix signed integer overflow
- nir/lower_shader_calls: skip zero-sized qsort
- util: skip zero-sized SHA1Update
- radv: call lower_array_deref_of_vec before lower_io_arrays_to_elements
- radv: skip radv_remove_varyings for mesh shaders
- radv: disable gs_fast_launch=2 by default
- docs: fix RADV_THREAD_TRACE_CACHE_COUNTERS default
- radv: add radv_disable_trunc_coord option
- radv: enable radv_disable_trunc_coord for vkd3d-proton/DXVK
- ac/nir: fix partial mesh shader output writes on GFX11
Rob Clark (60):
- freedreno: move virtgpu msm_proto.h to common
- freedreno/drm/virtio: Remove unused header
- tu/msm: staticify a couple things
- tu/knl: Remove some random const'ness
- drm-uapi: Update virtgpu header
- freedreno: Update virtgpu proto
- freedreno/drm/virtio: Use global_faults
- tu: close submitqueues before device_finish()
- tu/drm: Factor out shared helpers
- tu/drm: Add missing error path cleanup
- tu/drm: Split out helper for iova alloc
- tu: Add virtgpu support
- util: Decouple disk cache from EGL_ANDROID_blob_cache
- docs: Followup to !24636
- tu: Workaround bionic _SC_LEVEL1_DCACHE_LINESIZE
- ir3+tu: Simplify ir3_find_sysval_regid callers
- freedreno/a6xx: Drop unused screen args
- freedreno/a6xx: Re-work fd6_emit_shader
- freedreno/a6xx: Re-write the function-of-doom
- freedreno: Implement ATI_meminfo
- freedreno/a6xx: ARB_post_depth_coverage
- freedreno/a6xx: ARB_sample_locations
- freedreno/a6xx: ARB_texture_filter_minmax
- freedreno/a6xx: EXT_demote_to_helper_invocation
- freedreno/a6xx: EXT_shader_image_load_formatted
- freedreno/a6xx: EXT_depth_bounds_test
- freedreno/a6xx: Use pipe_blit_info::sample0_only
- freedreno/a6xx: Handle PIPE_BIND_BLENDABLE
- freedreno/a6xx: ARB_shader_viewport_layer_array
- tu: Fix heap size
- freedreno: Fix crash with debug msgs enabled
- freedreno/layout: Handle 565/etc MSAA special case
- freedreno/decode: Fix printing chip-id
- freedreno/a6xx: Add L8_SRGB
- freedreno: Add reformatting commits to .git-blame-ignore-revs
- freedreno/fence: Hold a strong ref to batch
- freedreno/decode: Lookup device info
- freedreno/decode: Use info->chip to decode
- freedreno/decode: Remove gpu_id
- freedreno: Indentation fix
- freedreno: Use explicit QCOM_TILED3 modifier
- freedreno/a6xx: Remove dummy packet for globals
- freedreno: Fix streamout offset_buf dirtiness
- freedreno: Fix user const buffer dirtiness
- freedreno/batch: Move query_buf allocation
- freedreno: Add private-BO tracking
- freedreno: Add missing indirect_draw_count tracking
- freedreno: Move/add some attach_bo()
- freedreno: Add attach-bo debugging
- freedreno: Rework supported-modifiers handling
- mesa: Introduce MESA_texture_const_bandwidth
- mesa: Implement MESA_texture_const_bandwidth
- freedreno: Add PIPE_CAP_HAS_CONST_BW support
- panfrost: Add PIPE_CAP_HAS_CONST_BW support
- iris: Add PIPE_CAP_HAS_CONST_BW support
- radeonsi: Add PIPE_CAP_HAS_CONST_BW support
- tu/msm: Fix timeline semaphore support
- tu/virtio: Fix timeline semaphore support
- freedreno/drm: Fix race in zombie import
- freedreno: Always attach bo to submit
Robert Foss (9):
- egl: Expose access to DeviceList
- egl: Rename _eglRefreshDeviceList() to _eglDeviceRefreshList()
- egl: Refresh DeviceList during eglInitialize()
- egl/surfaceless: Use EGL DeviceList instead of drmGetDevices2()
- egl/android: Use EGL DeviceList instead drmGetDevices2()
- egl: Rename _eglAddDevice() to _eglFindDevice()
- egl: Rename _eglAddDevice() to _eglFindDevice()
- egl: Fix attrib_list[0] == EGL_NONE check
- egl: Always set _EGLDisplay->Device during eglGetPlatformDisplay()
Robert Mader (6):
- egl/wayland: wait for compositor to release shm buffers
- iris: Support parameter queries for main planes
- util: Add new helpers for pipe resources
- panfrost: Support parameter queries for main planes
- vc4/resource: Support offset query for multi-planar planes
- v3d/resource: Support offset query for multi-planar planes
Rohan Garg (33):
- iris: migrate WA 14013910100 to use the WA framework
- iris: migrate WA 14016118574 to use the WA framework
- iris: fix iris for WA 16013000631
- intel/perf: add perf query support for Intel Raptorlake
- intel/genxml: set a default value for "Pixel Position Offset Enable" in genxml
- anv: use the WA infrastructure where possible when generating state
- anv: use the correct GFX_VERx10 macro for WA
- anv,iris: program the maximum number of threads on compute queue init
- anv: drop CFE state validation checks
- iris: track reset signalling instead of replacing the context
- iris: allow for a unsynchronized device reset query
- anv: partially revert 2e8b1f6d
- anv: emitting 3DSTATE_PRIMITIVE_REPLICATION is required on Gen12+
- anv: use the pre defined _3DPRIMITIVE_DIRECT macro
- anv: drop dead ifdef
- iris: use the correct WA macros and lineage numbers
- anv: use the lineage number for WA
- crocus: add a __gen_get_batch_address declaration
- crocus: fix GFX_VERx10 macro
- blorp: drop undefined macro
- iris: migrate preemption streamwout wa to WA infra
- intel/genxml: update PIPE_CONTROL instruction for dg2
- anv: define clear color localy within can_fast_clear_color_att
- intel/compiler: Adjust CS payload registers for new register width on Xe2+
- intel/compiler: Adjust fence message lengths for new register width on Xe2+
- intel/compiler: Adjust barrier emission for Xe2+
- intel/genxml: fix 3DSTATE_3D_MODE length to align with BSpec
- anv: ensure that FCV_CCS_E fast clears are properly tracked
- anv: enable FCV for Gen12.5
- anv: fix debug string for PC flush
- anv: cleanup includes
- anv: turn off non zero fast clears for CCS_E
- anv: selectively enable FCV optimization for DG2
Roland Scheidegger (1):
- lavapipe: further limit accurate_a0 hack
Roman Stratiienko (22):
- egl: android: Remove legacy name-based shared buffers support
- util: Add NONNULL macro
- android: Introduce the Android buffer info abstraction
- android: Fix num_planes assignment in u_gralloc_fallback
- v3dv/android: Use u_gralloc code
- v3dv/android: Enable shared presentable image support
- v3dv: Migrate to vk_device_memory
- v3dv/android: Skip swapchain binding
- v3dv: Rely on the internal tiled flag instead of the common vk structure
- v3dv/android: Add a helper function to support explicit layouts
- v3dv/android: Rework Android native buffer importing logic
- v3dv: Use format stored in vk_image and vk_image_view after init
- v3dv: Split v3dv_image_init to use layout setting logic separately
- v3dv/android: Add AHardwareBuffer support
- v3dv: Enable VK API v1.2 for Android
- panvk: Add Android ICD loader entry point
- u_gralloc: Remove inline modifiers from the functions
- u_gralloc: Remove usage of NONNULL macro
- Revert "util: Add NONNULL macro"
- u_gralloc: Add a function that returns gralloc type
- dri: Remove __driDriverExtensions leftovers
- v3d: Don't implicitly clear the content of the imported buffer
Ruijing Dong (2):
- frontends/va: checking va version for av1enc support
- radeonsi/vcn: change max_poc to fixed value for hevc encoder.
Ryan Neph (1):
- vulkan/android: add missed STACK_ARRAY_FINISH()
Sagar Ghuge (34):
- intel/compiler: Look at 2 register worth of data instead of 4
- isl: Disable MCS compression just on ACM platform
- intel: Add env variable to add break point on/before draw
- anv: Add GPU breakpoint before/after specific draw call
- iris: Add GPU breakpoint before/after draw call
- blorp: Implement blorp hooks to emit breakpoint
- docs: Add INTEL_DEBUG_BKP_BEFORE/AFTER_DRAW_COUNT
- intel/isl: Enable INTEL_DEBUG=noccs/nohiz in ISL helpers
- anv,hasvk: drop unnecessary DEBUG_NO_CCS/NO_HIZ checks
- iris,crocus: drop unnecessary DEBUG_NO_CCS/NO_HIZ checks
- blorp: Drop unnecessary assertions in blorp_can_hiz_clear_depth
- anv: Add helper to create companion RCS command buffer
- anv: Split out End/Destroy/Reset cmd buffer code into helper
- anv: Handle companion RCS in end/destory/reset code path
- intel: Add helper to create/destroy i915 VM
- intel: Pass virtual memory address space ID while creating context
- anv: Create companion RCS engine
- anv: Move compute specfic bits under compute queue init
- anv: Execute RCS init batch on companion RCS context/engine
- anv: Setup companion RCS command buffer submission
- anv: Execute an empty batch to sync main and companion RCS batch
- anv: Add secondary companion RCS cmd buffer to primary
- anv: Skip layout transition on the compute queue
- anv: Extract batch print code to anv_print_batch helper
- iris: Enable always flush cache with DEBUG_STALL option
- intel/genxml: Add STATE_COMPUTE_MODE instruction
- anv: Program and emit STATE_COMPUTE_MODE
- anv: Enable barrier handling on video engines
- isl: Use 16-bit instead of 8-bits for surface format info fields
- anv: Handle end of pipe with MI_FLUSH_DW on transfer queue
- anv: Enable transfer queue only on ACM+ platforms
- blorp: Use the correct miptail start LOD for surfaces
- anv: Write timestamp using MI_FLUSH_DW on blitter
- anv: Flush data cache while clearing depth using HIZ_CCS_WT
Saleemkhan Jamadar (1):
- radeonsi/vcn: set jpeg reg version for gfx 1150
Samuel Holland (3):
- Android.mk: Allow building only Vulkan drivers
- Android.mk: Explicitly enable/disable LLVM support
- Android.mk: Only link LLVM for radeonsi, not amd_vk
Samuel Pitoiset (299):
- radv: remove support for VK_INDIRECT_COMMANDS_TOKEN_TYPE_STATE_FLAGS_NV
- radv: make radv_get_pa_su_sc_mode_cntl() static
- zink/ci: update list of expected failures for NAVI10
- radv: stop using a pipeline for emitting VGT_VERTEX_REUSE_BLOCK_CNTL
- radv: remove unused param in radv_pipeline_emit_vgt_gs_out()
- radv: pass a shaders array for computing ia_multi_vgt_param
- radv: bind the pre-compiled PS epilog to the cmdbuf state
- radv: stop using an array of binaries when compiling a compute shader
- radv: add radv_compile_cs() to compile a compute shader
- radv: remove the pipeline dependency for creating a GS copy shader
- radv: add a helper to compute the ESGS itemsize
- radv: use the number of GS linked inputs to compute the ESGS itemsize
- radv: determine ES info for VS/TES with GS earlier
- radv: determine as_ls earlier by using the next stage
- radv: simplify getting next VS stage for VS prologs
- radv: use next_stage for determining the stage to lower NGG
- radv/amdgpu: fix dumping CS with the chained IBs path
- radv/amdgpu: rename old_ib to ib in radv_amdgpu_winsys_cs_dump()
- radv: pass submit info to radv_check_gpu_hangs()
- radv: initialize stage/next_stage earlier
- radv: set next_stage to MESA_SHADER_NONE if there is no FS
- radv: rework considering force VRS without relying on graphics pipeline
- radv: stop passing radv_graphics_pipeline to radv_fill_shader_info()
- radv: move removing all varyings when the FS is a noop
- radv: rename graphics pipeline linking helpers
- radv: simplify lowering NGG GS intrinsics
- radv: rework determining the NGG stage without a graphics pipeline
- radv: cleanup pipeline compute emit helpers
- radv: rename radv_pipeline_stage to radv_shader_stage
- radv: rename NGG query state to be more generic
- radv: declare the shader query user SGPR for emulating GS counters
- radv: enable pipelinestat query emulation for legacy GS
- radv: simplify the NGG vs legacy pipelinestat query path
- radv: rename RADV_SHADER_QUERY_PIPELINE_STAT_OFFSET
- radv: implement nir_intrinsic_atomic_add_gs_invocation_count_amd
- radv: emulate GEOMETRY_SHADER_INVOCATIONS query on RDNA1-2
- radv: track whether inputs/outputs are linked per shader stage
- radv: add support for VS/TES as ES without shaders IO linking
- radv: use next_stage to determine if the layer should be exported
- radv: use next stage to determine if primID/clip dist should be exported
- radv: compute the legacy GS info earlier
- radv: stop copying some NIR info fields from TES to TCS
- radv: stop lowering patch vertices for TES
- radv: do not always copy the number of tess patches to TES
- radv: initialize tcs.tes_{patch}_inputs_read to a default value
- radv: prevent linking TCS<->TES when TES is NULL
- radv: use a packed user SGPR for the TES state
- radv: stop checking if patch control points is dynamic everywhere
- radv: copy the number of TCS vertices out to TES shader info
- radv: add support for dynamic TCS vertices out for TES
- radv: remove radv_shader_info::tes::num_linked_patch_inputs
- amd,radeonsi: move si_shader_io_get_unique_index_patch() to common code
- radv: allow to use fixed IO locations for VS<->TCS<->TES without linking
- aco: add aco_shader_info::tcs::has_epilog
- aco: add infra for compiling TCS epilogs
- radv,aco: move has_epilog to radv_shader_info
- radv: assume a TCS needs an epilog unless it's linked with a TES
- radv: do not write tess factors in main TCS when it has an epilog
- radv: track if TES reads tess factors differently
- radv: declare new argument for the TCS epilog PC
- radv: add radv_tcs_epilog_key
- radv: add infra for creating TCS epilogs
- radv: add support for a TCS epilogs cache in the device
- radv: add support for emitting TCS epilogs in cmdbuf
- radv: remove unnecessary check in radv_pipeline_nir_to_asm()
- radv: stop passing a graphics pipeline to radv_pipeline_nir_to_asm()
- radv: inline radv_pipeline_get_nir() in radv_graphics_pipeline_compile()
- radv: add a struct for the retained shaders and GPL
- radv: add radv_graphics_shaders_compile() to compile graphics shaders
- radv: remove redundant check in radv_cmd_buffer_after_draw()
- radv: track if patch control points is dynamic from the cmdbuf state
- radv: re-emit binning state if the framebuffer is dirty
- radv: track if vertex binding stride is dynamic from the cmdbuf state
- vulkan: bump header register to 1.3.261
- vulkan/runtime: add common implementation for GetImageSubresourceLayout()
- vulkan/format: add VK_FORMAT_{A8_UNORM,A1B5G5R5_UNORM_PACK16}_KHR
- radv: use the RT prolog scratch size directly for tracing rays
- radv: add a helper to get the maximum number of scratch waves per shader
- radv: update the number of scratch waves for RT prolog at bind time
- radv: update cmdbuf scratch size info when shaders are bound
- vulkan: add init/finish helpers for vk_buffer_view
- radv: use vk_buffer_view
- radv: use vk_sampler
- radv: use common vkCmdBegin/EndQuery wrappers
- radv: use vk_query
- zink: fix setting VkShaderCreateInfoEXT::nextStage
- radv/rt: fix capture/replay support
- vulkan/render_pass: add common vkGetRenderingAreaGranularityKHR()
- radv: implement vkCmdBindIndexBuffer2KHR()
- radv: allow VK_WHOLE_SIZE for pSizes in vkCmdBindVertexBuffers2()
- radv/rmv: remove unused pipeline create flags when logging pipelines
- radv: store pipeline create flags to radv_pipeline::create_flags
- radv: add support for VkPipelineCreateFlags2CreateInfoKHR
- radv: add support for VkBufferUsageFlags2CreateInfoKHR
- radv: allow VK_REMAINING_ARRAY_LAYERS with VkImageSubresourceLayers
- radv: implement radv_Get{Device}ImageSubresourceLayout2KHR()
- radv: advertise VK_KHR_maintenance5
- radv: remove useless NULL for pipeline layout during shader info pass
- radv: introduce radv_shader_layout for per-stage descriptor layout
- radv: stop passing redundant stage to radv_shader_stage_init()
- radv: re-introduce radv_pipeline_stage_init()
- radv: add support for loading the LSHS vertex stride from a SGPR
- radv: use the number of VS outputs for computing the tessellation info
- vulkan: ignore VkPipelineColorWriteCreateInfoEXT if the state is dynamic
- radv: reduce TCS_OFFCHIP_LAYOUT_NUM_PATCHES to 6-bits
- radv: add missing comment about TCS_OFFCHIP_LAYOUT_LSHS_VERTEX_STRIDE
- radv: fix emitting TCS epilogs for GFX6-9
- radv: remove radv_cmd_buffer::cached_vertex_formats
- radv: remove unused param from radv_pipeline_init_multisample_state()
- radv: simplify declaring VS specific input SGPRs
- radv: stop copying if VS or TES uses the InvocationID built-in
- Revert "radv/amdgpu: workaround a kernel bug when replacing sparse mappings"
- Revert "radv/amdgpu: skip adding per VM BOs for sparse during CS BO list build"
- radv/amdgpu: allow to execute external IBs on the compute queue
- radv/amdgpu: add support for submitting external IBs with the chained path
- zink/ci: update list of expected failures for NAVI10
- radv: use the maximum possible workgroup size for TCS epilogs
- radv: stop declaring the scratch offset argument for TCS epilogs
- radv: declare shader arguments for TCS epilogs
- radv: add tcs_out_patch_fits_subgroup to radv_tcs_epilog_key
- aco: fix jumping from main TCS to epilog on GFX9+
- aco: adjust TCS epilogs for RADV
- aco: allow SGPRs operands with p_jump_to_epilog
- aco: implement create_tcs_jump_to_epilog()
- radv: track the pipeline bind point for indirect commands layout
- radv: prepare radv_get_sequence_size() for DGC compute
- radv: prepare radv_prepare_dgc() for DGC compute
- radv: implement NV_device_generated_commands_compute
- radv: allow DGC on the compute queue
- radv: advertise NV_device_generated_commands_compute
- aco: rework printing shader stages
- radv: fix the per-patch data offset when TES isn't linked with TCS
- radv: stop declaring unused SGPR arguments for PS epilogs
- radv: add radv_shader_info::is_monolithic
- radv: use info->uses_view_index directly when declaring shader arguments
- radv: do not inline push constants for non-monolithic shaders
- radv: force indirect descriptor sets for non-monolithic shaders
- radv: always declare some arguments for non-monolithic VS/TCS shaders
- radv: add a new shader argument for non-monolithic shaders PC
- ac: allow to mark shader arguments as preserved
- radv: preserve shader arguments for non-monolithic VS/TCS on GFX9+
- aco: disable shared VGPRs for non-monolithic shaders on GFX9+
- aco: ensure to initialize exec manually for VS as LS on GFX9+
- aco: add support for compiling VS+TCS separately on GFX9+
- radv: always declare some arguments for non-monolithic {VS,TES}/GS shaders
- radv: preserve shader arguments for non-monolithic {VS,TES}/GS on GFX9+
- aco: ensure to initialize exec manually for non-monolithic {VS,TES}/GS on GFX9+
- aco: add support for compiling {VS,TES}+GS separately on GFX9+
- radv,aco: remove unused clip/cull distances variables
- radv: rename tcs_shader to tcs in radv_emit_tcs_epilog_state()
- radv: small cleanups in radv_emit_patch_control_points()
- radv: fix emitting TCS epilogs if TES and GS are linked on GFX9+
- radv: remove the pipeline dependency for emitting VGT_GS_MODE
- aco: fix emitting TCS epilogs end on GFX9+
- radv: re-order IO slot layout for stages that aren't linked
- amd/ci: update list of failures/flakes for glcts-vangogh-valve
- ci: uprev vkd3d-proton
- ci: uprev Fossilize
- ci: add comment explaining which image tags to update for Fossilize
- radv: preserve shader argument for separate compilation of NGG shaders
- aco: flag blocks with long-jump as export_end for separate compilation
- aco: adjust fix_exports() for VS/TES as NGG and non-monolithic shaders
- aco: allow separate compilation of NGG shaders
- zink/ci: add zink-radv-polaris10-valve
- radv/ci: re-enable vkcts-polaris10-valve
- radv: fix capturing indirect dispatches with SQTT
- radv/ci: re-enable vkd3d-polaris10-valve
- ci: do not fail vkd3d-proton job when the expectations match
- radv/amdgpu: fix executing secondaries without IB2
- radv/amdgpu: do not copy the original chain link for IBs
- radv: avoid emitting SQTT markers for DGC calls
- radv: add support for DGC with SQTT
- zink/ci: merge GLCTS testing with GLESx for RADV
- zink/ci: merge piglit testing with deqp-runner for RADV
- radv: fix interactions with primitives generated queries and pipeline stats
- radv: skip DGC calls when the indirect sequence count is zero with a predicate
- radv: avoid emitting THREAD_TRACE_MARKER for predicated draws/dispatches
- radv: adjust next stage for VS prologs and merged shaders compiled separately
- radv: adjust emitted prolog regs for merged shaders compiled separately
- radv: do not use pre-compiled prologs when VS is compiled separately
- radv: remove useless PIPELINE_CREATE_2_LIBRARY_BIT check for retained shaders
- radv: fix enabling DGCC
- radv: fix emitting SQTT userdata when CAM is needed
- radv: fix capturing RGP on RDNA3 with more than one Shader Engine
- zink/ci: update list of expected failures for POLARIS10/NAVI10
- radv: set THREAD_TRACE_TOKEN_MASK.BOP_EVENTS_TOKEN_INCLUDE on GFX10.3+
- radv: disable unsupported hw shader stages for RGP on GFX11+
- radv: fix instruction timing on GFX11
- ac/rgp: use correct API stage string for mesh/task shaders
- radv: set THREAD_TRACE_MARKER_ENABLE for mesh/task draws
- radv: emit relocation for mesh/task shaders
- issue_templates/Bug Report: fix outdated URL for GFXReconstruct
- ac,radv,radeonsi: rework SPM counters configuration and share it
- ac/perfcounter: add new SQ_WGP block for GFX11+
- ac/spm: add SPM counters configuration for GFX11
- radv: enable the PKT3 CAM bit for some SPM register writes
- radv,radeonsi: use AC_SPM_SEGMENT_TYPE_xxx instead of magic values
- ac/spm: remove useless SPM block setting for GFX9 and older GPUs
- ac/spm: add SPM block definition for GFX10-GFX10.3
- ac/gpu_info: init num_cu_per_sh from the kernel
- ac/perfcounter: set the number of instances of GL1C to 4
- ac/perfcounter: compute the number of global instances of TCP,SQ,GL1C and GL2C
- ac/spm: fix checking if the counter instance is valid
- ac/spm: rework how segment muxsel RAM are filled
- ac/spm: initialize and set instance mapping for counters
- radv: reserve more CS space in SQTT/SPM paths
- ac/spm: use block flags to initialize instance mapping
- ac/spm: select correct segment type for per-SE blocks
- radv,radeonsi: make sure to emit GRBM_GFX_INDEX before SQ select registers
- ac/spm: fix number of instances of GL2C
- ac,radv,radeonsi: prepare support for multi-instance SPM SQ counters
- ac,radv,radeonsi: prepare support for multi-instance SPM generic counters
- ac/spm: move the counter instance to ac_spm_counter_create_info
- ac/spm: enable support for multi-instance counters
- radv: fix checking if RGP is enabled with others tracing tools
- radv: fix missing ISA with RGP and GPL
- ac/perfcounter: add SG_WQP group for GFX11
- ac/perfcounter: add GFX11 groups
- drirc: remove Path of Exile workarounds
- radv: remove drirc workarounds for Path Of Exile
- radv: remove absolute_depth_bias workaround
- ac/gpu_info: define AMD_MAX_WGP
- ac/spm: add new segment types for GFX11
- ac/spm: add support for GFX11
- radv: add SPM support for GFX11
- radv: enable cache counters for RGP on GFX11
- ci: update to vulkan-cts-1.3.6.3
- radv/ci: skip dEQP-VK.robustness.* on Vangogh due to weird GPU hangs
- nir: rename atomic_add_gs_invocation_count_amd to make it more generic
- ac/nir: add lowering for mesh shader queries
- ac/nir: add lowering for task shader queries
- radv: add GDS counters offset for mesh/task queries
- radv: adjust lowering of intrinsic queries for mesh/task shaders
- radv: enable lowering of mesh/task shader queries when enabled
- radv: declare shader_query_state for mesh/task shaders
- radv: stop skip emitting CB states when there is no color attachment
- radv: re-enable DCC with mipmaps on GFX11
- radv: fix COMPUTE_SHADER_INVOCATIONS query on compute queue
- radv: emit missing PA_{SC,SU}_LINE_STIPPLE_xxx regs in gfx preamble
- radv: fix alignment of DGC command buffers
- radv/ci: update list of expected failures on PITCAIRN
- radv/ci: update list of flakes for NAVI10/VEGA10
- radv/amdgpu: fix alignment of command buffers
- radv: enable DCC for MSAA images on GFX11
- zink/ci: update list of expectations for zink-anv-tgl
- zink/ci: bump zink-anv-tgl-full timeout to 1h45m
- radv/ci: rename GFX1100 lists to NAVI31
- radv: fix emulated geometry shader primitives/invocations queries
- radv/ci: remove duplicate skipped tests for RAVEN/STONEY
- radv/ci: exclude dEQP-VK.texture.explicit_lod.2d.sizes.128x128_* for all jobs
- radv: fix synchronization with emulated GS primitives/invocations queries
- radv/ci: remove no longer existing test for VANGOGH
- radv/ci: cleanup list of expected failures for NAVI10/NAVI21/VEGA10
- radv: always write the sample positions when a new descriptor BO is created
- radv: fill the scratch BO in radv_fill_shader_rings()
- radv: fix gang submissions with chaining
- radv: fix re-emitting streamout descriptors for NGG streamout
- radv: fix IB alignment
- zink: use warn_missing_feature for missing modifier support
- radv: fix destroying GDS/OA BOs
- radv: allocate only 1 GDS OA counter for gfx10 NGG streamout
- ac/nir: only consider overflow for valid feedback buffers
- radv/ci: update list of expected failures on RAVEN
- radv/ci: update list of flakes for VANGOGH
- radv/ci: update list of flakes for STONEY
- radv: disable primitive restart for non-indexed draws on GFX11
- radv: enable radv_disable_aniso_single_level=true for Zink too
- amd/llvm,aco,radv: implement NGG streamout with GDS_STRMOUT registers on GFX11
- radv: mark GDS as needed for XFB queries with NGG streamout on GFX11
- radv: skip GDS allocation for NGG streamout on GFX11
- zink/ci: remove expected failures that are skipped for RADV
- ci: update CTS to vulkan-cts-1.3.7.0
- ci: bump the number of tests per group from 500 to 5000 for Vulkan drivers
- ci: bump DEQP_FRACTION for some jobs
- radv: set ENABLE_PING_PONG_BIN_ORDER for GFX11.5
- radv: initialize video decoder for GFX11.5
- ac/gpu_info: query the maximum number of IBs per submit from the kernel
- Revert "radv: fix finding shaders by PC"
- radv: fix missing predicate bit for WRITE_DATA helper
- ac/gpu_info: fix querying the maximum number of IBs per ring
- radv: remove outdated RADV_DEBUG=vmfaults support
- amd: update amdgpu_drm.h
- amd: add has_gpuvm_fault_query
- radv/amdgpu: add support quering the last GPUVM fault
- radv: query and report the last GPUVM fault with RADV_DEBUG=hang
- radv: report the last GPUVM fault when a device lost is detected
- ac/gpu_info: remove bogus assertion about number of COMPUTE/SDMA queues
- radv: fix a synchronization issue with primitives generated query on RDNA1-2
- radv: bind the non-dynamic graphics state from the pipeline unconditionally
- radv: fix compute shader invocations query on compute queue on GFX6
- radv: emit COMPUTE_PIPELINESTAT_ENABLE for CS invocations on ACE
- nir: fix inserting the break instruction for partial loop unrolling
- radv: fix registering queues for RGP with compute only
- radv: set radv_zero_vram=true for Unreal Engine 4/5
- radv: fix a descriptor leak with debug names and host base descriptor set
- radv: add a missing async compute workaround for Tonga/Iceland
- radv: disable TC-compatible HTILE on Tonga and Iceland
- radv: set radv_invariant_geom=true for War Thunder
- radv: do not set OREO_MODE to fix rare corruption on GFX11
Saroj Kumar (4):
- radeonsi: Add perfetto support in radeonsi
- radeonsi: Add u_trace init code in radeonsi
- radeonsi: Add tracepoints in radeonsi driver
- radeonsi: fixes compilaton error when perfetto is disabled
Sathishkumar S (2):
- radeonsi/vcn: support variable number of bs_bufs
- radeonsi/vcn: num bs_bufs must be proportional to num jpeg engines
Semjon Kravtsenko (1):
- glx: Assign unique serial number to GLXBadFBConfig error
Seppo Yli-Olli (1):
- zink: Fix SyntaxWarning in zink_extensions script
Sergi Blanch Torne (7):
- Introduce ci-kdl builder and launcher.
- Integrate ci-kdl in the building process and launch process.
- ci: disable Collabora's LAVA lab for maintance
- Revert "ci: disable Collabora's LAVA lab for maintance"
- Revert "ci: disable Collabora's LAVA lab for maintance"
- ci: disable Collabora's LAVA lab for maintance
- Revert "ci: disable Collabora's LAVA lab for maintance"
Sid Pranjale (1):
- nvk: Enable VK_EXT_load_store_op_none
Sil Vilerino (20):
- util: Blake3 - Identify arm64ec as aarch64 instead of x64
- d3d12: Fix Map/Unmap of YUV resources
- d3d12: Fix H264 interlaced decode
- d3d12: Video Decode - Remove unnecessary copy for texture array case
- util/vl_vlc: Use UINT64_MAX instead of ~0UL with MSVC compiler
- d3d12: Extend video screen AV1 encode tile support checking
- aux/tc: Add ASSERTED to unreferenced release build variable
- d3d12: Video - Relax ID3D12VideoDevice QI version for decode, process
- frontends/va: Add profile param when querying PIPE_VIDEO_CAP_ENC_QUALITY_LEVEL
- d3d12: Upgrade to D3D12 Agility SDK 1.611 Video interface
- d3d12: Fixes AV1 tx_mode_support reporting and unsupported tx_mode overriding
- d3d12: Video Decode - Wait for GPU completion before destroying decoder in-flight objects
- d3d12: Do not destroy codec when destroying video buffer
- d3d12: AV1 encode - Add lower resolution fallback check for uniform tile support
- d3d12: AV1 encode - add fallback for app passing unsupported pic_params.InterpolationFilter
- d3d12: AV1 Encode - Fix VAConfigAttribEncMaxRefFrames reporting
- frontend/va: Add support for VAConfigAttribEncMaxTileRows/Cols
- d3d12: Add support for PIPE_VIDEO_CAP_ENC_MAX_TILE_ROWS/COLS
- d3d12: Allocate d3d12_video_buffer with higher alignment for compatibility
- d3d12: d3d12_video_buffer_create_impl - Fix resource importing
Simon Ser (7):
- wayland: enable use of wayland-protocols as a subproject
- vulkan/wsi/wayland: add support for IMMEDIATE
- vulkan/wsi/wayland: fix unset present_mode
- radv/winsys: check amdgpu_create_bo_from_user_mem() for EINVAL
- egl: extract EGLDevice setup in dedicated function
- egl: move dri2_setup_device() after dri2_setup_extensions()
- egl: ensure a render node is passed to _eglFindDevice()
Simon Zeni (1):
- nouveau/winsys: use mmap instead of mmap64 in nouveau_bo
SoroushIMG (1):
- pvr: fix mipmap size calculation for bc formats
Sviatoslav Peleshko (9):
- dri: Use RGB internal formats for RGBX formats
- intel/isl: Don't over-allocate CLEAR_COLOR size to use whole cache line
- anv: Do fast clear color initialization more delicately
- zink: Change zink_vertex_elements_hw_state::b.strides to VkDeviceSize
- intel/fs: Check if the whole ubo load range is in the push const range
- zink: Store zink_vertex_elements_hw_state::b.strides by binding id
- intel/fs: Fix "packed word exception" condition for register regioning
- intel/eu/validate: Validate "packed word exception" stricter
- nir/loop_analyze: Fix inverted condition handling in iterations calculation
Sylvain Munaut (9):
- egl/dri2: Add a couple of missing mutex release in error path
- mesa: Enable ARB_texture_border_clamp in GL Core
- include: Fix the PFN declarations to be pointers as they should
- glx: Add missing MesaGLInteropGLXFlushObjects
- glx: Export the MESA GL Interop functions through glXGetProcAddress
- egl: Export the MESA GL Interop functions through eglGetProcAddress
- glx: Remove MESA_depth_float_bit from enum
- glx: Advertise GLX_MESA_gl_interop extension if support present
- egl: Advertise EGL_MESA_gl_interop extension if support present
Tapani Pälli (34):
- intel/blorp: add a new flag to communicate PSS sync need
- anv: implement required PSS sync for Wa_18019816803
- iris: implement required PSS sync for Wa_18019816803
- vulkan/runtime: change assert to match specification needs
- anv: remove assert, size is asserted in the runtime
- anv: refactor batch_set_preemption to use batch_emit_pipe_control
- anv: implement a dummy depth flush for Wa_14016712196
- iris: implement a dummy depth flush for Wa_14016712196
- mesa: fix some TexParameter and SamplerParameter cases
- mesa: remove GL_UNSIGNED_BYTE as supported for snorm reads
- ci: add a fix for KHR-GLES3.packed_pixels.*snorm tests
- anv: implement Wa_14018912822
- iris: implement Wa_14018912822
- driconf: use lower_depth_range_rate for The Spirit and The Mouse
- mesa: disable snorm readpix clamping with EXT_render_snorm
- iris: modify Wa_14014414195 to use intel_needs_workaround
- mesa: some cleanups for texparam extension checks
- iris: avoid issues with undefined clip distance
- crocus: avoid issues with undefined clip distance
- anv: refactor to fix pipe control debugging
- anv: fix a leak of fp64_nir shader
- iris: use intel_needs_workaround for Wa_14014414195 part 2
- iris: correct dst alpha blend factor in Wa_14018912822
- iris/anv: move Wa_14018912822 as a drirc workaround
- iris: flush data cache when flushing HDC on GFX < 12
- anv: HDC flush is available only for GFX_VER 12+
- iris: HDC flush is available only for GFX_VER 12+
- intel/genxml: remove HDC from gen11.xml, it is not available
- mesa/st: ignore StencilSampling if stencil not part of the format
- intel/dev: expand existing fix for all gfx12 with small EU count
- egl: fix leaking drmDevicePtr in _eglFindDevice
- iris: add data cache flush for pre hiz op
- anv/drirc: add option to disable FCV optimization
- drirc: Set limit_trig_input_range option for Valheim
Tatsuyuki Ishi (8):
- radv/amdgpu: Remove unused bo_list variable from cs_submit.
- radv/winsys: Remove unused struct radv_winsys_bo_list.
- radv/amdgpu: Do not pass in a BO handle when clearing PRT VA region.
- radv: Fix IB size for RADV_DEBUG=hang.
- radv: Fix dumping vertex descriptors with RADV_DEBUG=hang.
- radv/amdgpu: Use rwlock to protect access to virtual BOs.
- zink: Fix missing sparse buffer bind synchronization.
- zink: Fix waiting for texture commit semaphores.
Thomas H.P. Andersen (65):
- tgsi: remove unused tgsi_shader_info.num_tokens
- tgsi: remove unused tgsi_shader_info.array_max
- tgsi: remove unused tgsi_shader_info.num_memory_instructions
- tgsi: remove unused tgsi_shader_info.colors_read
- tgsi: remove unused tgsi_shader_info.colors_written
- tgsi: remove unused tgsi_shader_info.reads_position
- tgsi: remove unused tgsi_shader_info.reads_samplemask
- svga: remove unused struct field
- tgsi: remove unused tgsi_shader_info.reads_tess_factors
- tgsi: remove unused tgsi_shader_info fields
- tgsi: remove unused tgsi_shader_info fields
- tgsi: remove unused tgsi_shader_info.uses_drawid
- tgsi: remove unused tgsi_shader_info fields
- tgsi: remove unused tgsi_shader_info.uses_subgroup_info
- tgsi: remove unused tgsi_shader_info.writes_primid
- tgsi: remove unused tgsi_shader_info.uses_doubles
- tgsi: remove unused tgsi_shader_info.uses_derivatives
- tgsi: remove unused tgsi_shader_info.uses_bindless_samplers
- tgsi: remove unused tgsi_shader_info.uses_bindless_images
- tgsi: remove unused tgsi_shader_info.clipdist_writemask
- tgsi: remove unused tgsi_shader_info.culldist_writemask
- tgsi: remove unused tgsi_shader_info.images_load
- tgsi: remove unused tgsi_shader_info.images_store
- tgsi: remove unused tgsi_shader_info.images_atomic
- tgsi: remove unused tgsi_shader_info.uses_bindless_buffer_load
- tgsi: remove unused tgsi_shader_info.uses_bindless_buffer_store
- tgsi: remove unused tgsi_shader_info.uses_bindless_buffer_atomic
- tgsi: remove unused tgsi_shader_info.uses_bindless_image_load
- tgsi: remove unused tgsi_shader_info.uses_bindless_image_store
- tgsi: remove unused tgsi_shader_info.uses_bindless_image_atomic
- tgsi: remove unused tgsi_shader_info.indirect_files_read
- tgsi: remove unused tgsi_shader_info.indirect_files_written
- tgsi: remove unused tgsi_shader_info.const_buffers_indirect
- tgsi: remove unused tgsi_shader_info.max_depth
- tgsi: drop two unused functions
- nvk: use common physical device enumeration
- nvk: fix implicit-fallthrough warnings with clang
- nvk: delete commented code
- nvk: fix mem leaks
- nvk: use common descriptor set layout code
- nvk: use common pipeline layout code
- nvk: advertise KHR_shader_non_semantic_info
- nvk: advertise KHR_image_format_list
- nvk: advertise EXT_private_data
- nvk: advertise KHR_sampler_mirror_clamp_to_edge
- nvk: KHR_descriptor_update_template
- nvk: CmdPushDescriptorSetWithTemplateKHR
- nvk: drop dead assignment
- nvk: drop dead assignment
- nvk: fix initialization override
- nvk: sort extensions
- nvk: advertize KHR_relaxed_block_layout
- nvk: add check for VK_IMAGE_CREATE_2D_VIEW_COMPATIBLE_BIT_EXT
- nvk: advertise EXT_image_2d_view_of_3d
- nvk: fix maxPushDescriptors
- nvk: call correct macro to clear views
- nouveau/mme: use fermi enum in fermi builder
- nvk: add warning on non-nouveau drm driver
- nvk: Implement VK_KHR_draw_indirect_count on Turing+
- nvk: set device info before use in nvk_get_device_extensions
- nvk: simplify code by using new helpers
- nvk: remove duplicated device features
- nvk: EXT_conditional_rendering
- nvk: advertise VK_EXT_tooling_info
- nvk: set optimization level to 3
Thong Thai (3):
- radeonsi: enable vcn encoder rgb input support
- Update radeon_vcn_enc.c
- frontends/va/config: report max width and height for encoding/decoding
Timothy Arceri (27):
- glsl: fix validation of ES vertex attribs
- nir/opt_copy_prop_vars: don't clone copies if branch empty
- nir/opt_copy_prop_vars: speedup cloning of copy tables
- nir/opt_copy_prop_vars: remove var hash entry on kill alias
- nir/opt_copy_prop_vars: skip cloning of copies arrays until needed
- nir/opt_copy_prop_vars: drop reuse of dynamic arrays
- glsl: fix spirv sso validation
- glsl: mark structs containing images as bindless
- util: add radeonsi workaround for Nowhere Patrol
- glsl: fix out params in glsl to nir
- glsl_to_nir: add more unhandled function types
- nir: replace use of nir_src_copy()
- nir: remove unused nir_src_copy()
- nir: remove unused param from nir_alu_src_copy()
- glsl: remove field from gl_shader_program
- glsl: move get_varying_type() declaration earlier
- glsl: add nir version of validate_first_and_last_interface_explicit_locations()
- glsl: switch to nir validate_first_and_last_interface_explicit_locations()
- glsl: remove unused validate_first_and_last_interface_explicit_locations()
- nir: fix typo in comment
- nir: copy explicit_invariant flag to nir vars
- glsl: move interpolation_string() to linker_util
- glsl: move is_gl_identifier() to linker_util
- nir: add used field to nir variables
- glsl: implement cross_validate_outputs_to_inputs() in nir linker
- glsl: switch to nir linkers cross_validate_outputs_to_inputs()
- glsl: remove now unused varying linker code
Timur Kristóf (39):
- aco: Fix subgroup_id intrinsic on GFX10.3+.
- ac/nir: Simplify arg unpacking when shift is zero.
- ac/nir: Add new pass to lower intrinsics to shader args.
- radv: Move radv_select_hw_stage to radv_shader_info.
- radv: Use ac_nir_lower_intrinsics_to_args.
- radeonsi: Move si_select_hw_stage to si_shader_info.
- radeonsi: Use ac_nir_lower_intrinsics_to_args.
- aco: Remove subgroup_id and num_subgroups intrinsics.
- ac/llvm: Remove subgroup_id and num_subgroups intrinsics.
- aco: Refactor select_program to smaller functions.
- nir/opt_dead_cf: Remove if branches with undef condition.
- ac/nir: Add done arg to ac_nir_export_position.
- ac/nir: Slightly refactor how pos0 exports are added when missing.
- ac/nir/ngg: Wait for attribute stores before VS/TES/GS pos0 export.
- ac/nir/ngg: Refactor mesh shader primitive export.
- ac/nir/ngg: Wait for attribute ring stores in mesh shaders.
- ac/nir/ngg: Extract nogs_export_vertex_params function.
- ac/gpu_info: Add some SDMA related information.
- ac: Clarify SDMA opcode defines.
- ac: Add amd_ip_type argument to ac_parse_ib and ac_parse_ib_chunk.
- ac: Rename ac_do_parse_ib to parse_pkt3_ib.
- ac: Print IP type for IBs.
- ac: Add rudimentary implementation of printing SDMA IBs.
- radv: Rename SDMA file to radv_sdma.c
- radv: Use const device argument in radv_sdma_copy_buffer.
- radv: Use const on vi_alpha_is_on_msb arguments.
- radv: Only call si_cp_dma_wait_for_idle on GFX and ACE queues.
- radv: Move radv_cp_wait_mem to radv_cs.h and add queue family argument.
- radv: Refactor WRITE_DATA helper function.
- radv: Use new WRITE_DATA helper in more places.
- radv: Add queue family argument to some functions.
- radv: Wait for bottom of pipe in ACE gang wait postamble.
- radv: Simplify gang CS and semaphore initialization.
- radv: Allow gang submit use cases other than task shaders.
- radv: Slightly refactor gang semaphore functions.
- radv: Add gang follower semaphore functions.
- radv: Support SDMA in radv_cs_write_data_head.
- radv: Support SDMA in radv_cp_wait_mem.
- radv: Support SDMA in si_cs_emit_write_event_eop.
Vignesh Raman (4):
- ci: add Vignesh Raman into restricted traces access list
- Do explicit cast to suppress clang warnings
- ci: enforce -Wimplicit-const-int-float-conversion for clang
- ci: Uprev crosvm
Vinson Lee (8):
- nvk: Fix assert
- lavapipe: Fix struct initialization
- intel/decoder: Fix memory leak on error path
- nv50: Remove unused value
- vk/wsi/x11: Remove dead code
- freedreno/replay: Fix implicit-function-declaration error
- anv: Fix transfer type assert
- broadcom/qpu: Remove duplicate variable opcode
Vitaliy Triang3l Kuzmin (3):
- r600/asm: Fix AR force_add_cf setting if a clause is not open
- r600/asm: Make sure MOVA and SET_CF_IDX are in the same clause
- r600: Replace R600_BIG_ENDIAN with UTIL_ARCH_BIG_ENDIAN
Vlad Schiller (15):
- pvr: Implement VK_EXT_tooling_info
- pvr: Add 'info' PVR_DEBUG flag
- pvr: Implement VK_KHR_format_feature_flags2
- pvr: Remove PVR_WINSYS_BO_FLAG_ZERO_ON_ALLOC flag
- pvr: Add VK_KHR_driver_properties
- pvr: Use correct index when writing query availability data
- pvr: Enable VK_EXT_scalar_block_layout
- pvr: Enable KHR_image_format_list
- pvr: Enable VK_KHR_uniform_buffer_standard_layout
- pvr: Implement VK_KHR_external_fence
- pvr: Implement VK_KHR_external_semaphore
- pvr: Enable VK_KHR_bind_memory2 extension
- pvr: Implement VK_EXT_texel_buffer_alignment
- pvr: Implement VK_EXT_host_query_reset
- pvr: Fix VK_EXT_texel_buffer_alignment
WinLinux1028 (1):
- radeonsi: prefix function with si\_ to prevent name collision
Xaver Hugl (1):
- vulkan wsi: add support for PresentOptionAsyncMayTear
Yiwei Zhang (46):
- venus: handle query feedback creation failure
- venus: ensure consistency of query overflow behavior
- venus: add a missing barrier before copying query feedback
- venus: refactor query feedback cmd record
- venus: reduce to use 4K mem suballoc align on platforms known to fit
- turnip: flush cache for dstBuffer in vkCmdCopyQueryPoolResults
- lvp: avoid reading immutable sampler from desc write info
- ci/venus: update venus-lavapipe expectations
- venus: fix a cmd builder render_pass state leak across reset
- venus: fix cmd state leak across implicit reset
- venus: log and doc the broken query feedback in suspended render pass
- venus: move transient storage from cmd to pool
- venus: remove redundant fb tracking from cmd builder
- venus: use tracked queue_family_index from the cmd pool
- venus: cleanup vn_cmd_begin_render_pass usage
- venus: add helpers to track subpass view mask
- venus: avoid redundant tracking of render pass
- venus: refactor more cmd states into cmd builder
- venus: use in_render_pass to skip present_src counting
- ci/venus: remove fixed tests that no longer run
- ci/venus: reenable pipeline cts
- venus: suppress a false logging
- venus: add no_sparse debug option to disable sparse resource support
- venus: set deviceMemoryReport feature
- venus: expose at least one cached memory type
- venus: expose KHR_external_fence/sempahore_fd extensions
- venus: fix a device memory report leak
- vulkan: remove a dup entry from vk_image_usage_to_ahb_usage
- vulkan/android: improve vkQueueSignalReleaseImageANDROID
- vulkan/android: add missing AHARDWAREBUFFER_USAGE_GPU_DATA_BUFFER usage
- vulkan/android: drop vk_buffer dependency from common AHB impl
- venus: use common vk_queue object
- venus: use common ANB implementation
- venus: use more common vk_queue related implementations
- venus: drop device, family, index, flags tracking from vn_queue
- venus: fix re-export of imported classic 3d resources
- venus: remove redundant bo roundtrip and add more docs
- venus: track VkPhysicalDeviceMemoryProperties instead
- venus: refactor vn_device_memory to prepare for async alloc
- venus: make device memory alloc async
- venus: enable Vulkan 1.3 for Android 13 and above
- zink: sync queue access for vkQueueWaitIdle
- venus: properly expose KHR_external_fence/sempahore_fd
- ci/venus: mark more flaky tests after recent cts uprev
- venus: fix query feedback batch leak and race upon submission
- zink: apply can_do_invalid_linear_modifier to Venus
Yogesh Mohan Marimuthu (12):
- gallium: remove start_slot parameter from pipe_context::set_vertex_buffers
- ac/surface: add astc block size to bpe_to_format() function
- util: move ASTCLutHolder from mesa/main to util
- vulkan/formats,zink: move vk_format_from_pipe_format() function
- vulkan/runtime: add compute astc decoder helper functions
- vulkan add 3D texture support for compute astc decoder
- radv: integrate meta astc compute decoder to radv
- radeonsi: add more documentation for dpbb debug env variable
- docs: remove document for unused variable dfsm from AMD_DEBUG
- radeonsi: correct old comment in si_emit_framebuffer_state()
- radeonsi: In gfx6_init_gfx_preamble_state() use gfx_level only from sctx
- radeonsi: add radeonsi to GL_RENDERER string
Yonggang Luo (43):
- lima: Convert to use nir_foreach_function_impl when possible
- freedreno: Switch to use nir_foreach_function_impl in tu_shader.cc
- zink: Convert to use nir_foreach_function_impl when possible
- lavapipe: Convert to use nir_foreach_function_impl
- lavapipe: fixes indent of function lvp_inline_uniforms
- microsoft/compiler: convert to use nir_foreach_function_with_impl in function emit_module
- microsoft/clc/compiler: Convert to use nir_foreach_function_impl when possible
- radeonsi: Convert to use nir_foreach_function_impl
- ac: Switch to use nir_foreach_function_impl in function analyze_shader_before_culling
- util: Move pipe_swizzle from p_defines.h to u_formats.h
- util: Move PIPE_MASK_* from p_defines.h to u_formats.h
- util: Move pipe_color_union from p_defines.h into u_formats.h
- util: Move u_pack_color.h and dbughelp.h into src/util from/src/gallium/auxiliary/util/
- util: Remove include "pipe/\*.h" in src/util/* files
- util:Move only gallium used u_debug_refcnt.* and u_debug_describe.* into src/gallium/auxiliary/util/
- util/meson: Getting mesa util core to be self contained
- pvr: decouple vulkan driver and compiler from gallium
- freedreno: decouple compiler and vulkan driver from gallium
- glx: decouple from gallium
- meson: Remove arm_neon_workaround
- nouveau/drm-shim: Decouple from gallium
- ac/radv: decouple radv vulkan driver and compiler from gallium
- etnaviv: decouple drm from gallium
- asahi: decouple layout from gallium
- compiler: Move WRITEMASK_* from prog_instruction.h into shader_enums.h
- intel/blorp: Use float directly to avoid #include "mesa/main/format_utils.h"
- intel/blorp: brw_sampler_prog_key_data::swizzles is only and should only accessed in crocus
- intel/brw: Define and use BRW_SWIZZLE_* instead of SWIZZLE_*
- crocus: #include "program/prog_instruction.h" for SWIZZLE_*
- intel/compiler,intel/blorp,intel/vulkan: decouple vulkan driver and compiler from gallium
- util/treewide: Use alignas(x) instead __attribute__((aligned(x)))
- v3dv: Use alignas(8) over 64 bit atomic value
- svga: use alignas over struct MKSGuestStatInfoEntry
- radv: Fixes mingw linkage error undefined reference to \`radv_GetCalibratedTimestampsEXT'
- v3d: Use DIV_ROUND_UP instead div_round_up
- freedreno: Use shared DIV_ROUND_UP instead div_round_up
- sfn: Use 4 instead of ATOMIC_COUNTER_SIZE
- intel/brw: use 4 instead of MAX_VERTEX_STREAMS to avoid #include "mesa/main/config.h"
- d3d12: replace use of MAX_VERTEX_STREAMS with PIPE_MAX_VERTEX_STREAMS
- compiler: use 4 instead ATOMIC_COUNTER_SIZE in glsl_types.h to avoid #include "mesa/main/config.h"
- compiler/glsl: Move glsl_print_type from glsl_types.* to ir_print_visitor.cpp
- util: Deduplicate macros between u_math.h and macros.h
- nvk: Should use alignment instead of align
Yusuf Khan (4):
- nouveau/ws: remove the drm.h header
- nvk: implement GetDeviceMemoryCommitment
- nvk: support GetImageSparseMemoryRequirements2
- nvk: expose KHR_driver_properties
Zhang Ning (1):
- Revert "intel/ci: disable iris-jsl-deqp because it always fails for an AMD MR"
antonino (14):
- virgl: add ci flake
- freedreno: add ci flake
- zink: remove unused indices from \`nir_load_push_constant` calls
- zink/nir: add a zink specific intrinsic for push constants
- vulkan/wsi: add \`vk_wsi_force_swapchain_to_current_extent` driconf
- drirc: enable \`vk_wsi_force_swapchain_to_current_extent` for "The Talos Principle"
- drirc: enable \`vk_wsi_force_swapchain_to_current_extent` for "Serious Sam Fusion"
- vulkan: Extend vkGet/SetPrivateDataEXT handling to all platforms
- vulkan: Extend vkGet/SetPrivateDataEXT handling to VkSurface
- vulkan: Handle vkSetDebugUtilsObjectNameEXT on WSI objects
- zink: store bindless var when creating it to avoid creating it again
- nir: fix several crashes in \`nir_lower_tex`
- nir: don't take the derivative of the array index in \`nir_lower_tex`
- vulkan: use instance allocator for \`object_name` in some objects
cheyang (1):
- isaspec : fix isaspec build error in aosp
georgeouzou (1):
- nvk: Support VK_EXT_line_rasterization
jazzfool (1):
- zink: Hash only first 32 bits of zink_gfx_pipeline_state with full DS3
lorn10 (1):
- docs: Update Clover's env variable documentation
norablackcat (2):
- spirv/nir_to_spirv: add expect assume op codes
- rusticl: add cl_khr_expect_assume
timmac-qmc (1):
- glsl: fix potential crash with DisableUniformArrayResize
twisted89 (1):
- util/driconf: add workarounds for the Chronicles of Riddick
wangra (1):
- tu/kgsl: Fix bitfield of DITHER_MODE_MRT6
xurui (1):
- glx: There is no need to psc++
|