1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 112 113 114 115 116 117 118 119 120 121 122 123 124 125 126 127 128 129 130 131 132 133 134 135 136 137 138 139 140 141 142 143 144 145 146 147 148 149 150 151 152 153 154 155 156 157 158 159 160 161 162 163 164 165 166 167 168 169 170 171 172 173 174 175 176 177 178 179 180 181 182 183 184 185 186 187 188 189 190 191 192 193 194 195 196 197 198 199 200 201 202 203 204 205 206 207 208 209 210 211 212 213 214 215 216 217 218 219 220 221 222 223 224 225 226 227 228 229 230 231 232 233 234 235 236 237 238 239 240 241 242 243 244 245 246 247 248 249 250 251 252 253 254 255 256 257 258 259 260 261 262 263 264 265 266 267 268 269 270 271 272 273 274 275 276 277 278 279 280 281 282 283 284 285 286 287 288 289 290 291 292 293 294 295 296 297 298 299 300 301 302 303 304 305 306 307 308 309 310 311 312 313 314 315 316 317 318 319 320 321 322 323 324 325 326 327 328 329 330 331 332 333 334 335 336 337 338 339 340 341 342 343 344 345 346 347 348 349 350 351 352 353 354 355 356 357 358 359 360 361 362 363 364 365 366 367 368 369 370 371 372 373 374 375 376 377 378 379 380 381 382 383 384 385 386 387 388 389 390 391 392 393 394 395 396 397 398 399 400 401 402 403 404 405 406 407 408 409 410 411 412 413 414 415 416 417 418 419 420 421 422 423 424 425 426 427 428 429 430 431 432 433 434 435 436 437 438 439 440 441 442 443 444 445 446 447 448 449 450 451 452 453 454 455 456 457 458 459 460 461 462 463 464 465 466 467 468 469 470 471 472 473 474 475 476 477 478 479 480 481 482 483 484 485 486 487 488 489 490 491 492 493 494 495 496 497 498 499 500 501 502 503 504 505 506 507 508 509 510 511 512 513 514 515 516 517 518 519 520 521 522 523 524 525 526 527 528 529 530 531 532 533 534 535 536 537 538 539 540 541 542 543 544 545 546 547 548 549 550 551 552 553 554 555 556 557 558 559 560 561 562 563 564 565 566 567 568 569 570 571 572 573 574 575 576 577 578 579 580 581 582 583 584 585 586 587 588 589 590 591 592 593 594 595 596 597 598 599 600 601 602 603 604 605 606 607 608 609 610 611 612 613 614 615 616 617 618 619 620 621 622 623 624 625 626 627 628 629 630 631 632 633 634 635 636 637 638 639 640 641 642 643 644 645 646 647 648 649 650 651 652 653 654 655 656 657 658 659 660 661 662 663 664 665 666 667 668 669 670 671 672 673 674 675 676 677 678 679 680 681 682 683 684 685 686 687 688 689 690 691 692 693 694 695 696 697 698 699 700 701 702 703 704 705 706 707 708 709 710 711 712 713 714 715 716 717 718 719 720 721 722 723 724 725 726 727 728 729 730 731 732 733 734 735 736 737 738 739 740 741 742 743 744 745 746 747 748 749 750 751 752 753 754 755 756 757 758 759 760 761 762 763 764 765 766 767 768 769 770 771 772 773 774 775 776 777 778 779 780 781 782 783 784 785 786 787 788 789 790 791 792 793 794 795 796 797 798 799 800 801 802 803 804 805 806 807 808 809 810 811 812 813 814 815 816 817 818 819 820 821 822 823 824 825 826 827 828 829 830 831 832 833 834 835 836 837 838 839 840 841 842 843 844 845 846 847 848 849 850 851 852 853 854 855 856 857 858 859 860 861 862 863 864 865 866 867 868 869 870 871 872 873 874 875 876 877 878 879 880 881 882 883 884 885 886 887 888 889 890 891 892 893 894 895 896 897 898 899 900 901 902 903 904 905 906 907 908 909 910 911 912 913 914 915 916 917 918 919 920 921 922 923 924 925 926 927 928 929 930 931 932 933 934 935 936 937 938 939 940 941 942 943 944 945 946 947 948 949 950 951 952 953 954 955 956 957 958 959 960 961 962 963 964 965 966 967 968 969 970 971 972 973 974 975 976 977 978 979 980 981 982 983 984 985 986 987 988 989 990 991 992 993 994 995 996 997 998 999 1000 1001 1002 1003 1004 1005 1006 1007 1008 1009 1010 1011 1012 1013 1014 1015 1016 1017 1018 1019 1020 1021 1022 1023 1024 1025 1026 1027 1028 1029 1030 1031 1032 1033 1034 1035 1036 1037 1038 1039 1040 1041 1042 1043 1044 1045 1046 1047 1048 1049 1050 1051 1052 1053 1054 1055 1056 1057 1058 1059 1060 1061 1062 1063 1064 1065 1066 1067 1068 1069 1070 1071 1072 1073 1074 1075 1076 1077 1078 1079 1080 1081 1082 1083 1084 1085 1086 1087 1088 1089 1090 1091 1092 1093 1094 1095 1096 1097 1098 1099 1100 1101 1102 1103 1104 1105 1106 1107 1108 1109 1110 1111 1112 1113 1114 1115 1116 1117 1118 1119 1120 1121 1122 1123 1124 1125 1126 1127 1128 1129 1130 1131 1132 1133 1134 1135 1136 1137 1138 1139 1140 1141 1142 1143 1144 1145 1146 1147 1148 1149 1150 1151 1152 1153 1154 1155 1156 1157 1158 1159 1160 1161 1162 1163 1164 1165 1166 1167 1168 1169 1170 1171 1172 1173 1174 1175 1176 1177 1178 1179 1180 1181 1182 1183 1184 1185 1186 1187 1188 1189 1190 1191 1192 1193 1194 1195 1196 1197 1198 1199 1200 1201 1202 1203 1204 1205 1206 1207 1208 1209 1210 1211 1212 1213 1214 1215 1216 1217 1218 1219 1220 1221 1222 1223 1224 1225 1226 1227 1228 1229 1230 1231 1232 1233 1234 1235 1236 1237 1238 1239 1240 1241 1242 1243 1244 1245 1246 1247 1248 1249 1250 1251 1252 1253 1254 1255 1256 1257 1258 1259 1260 1261 1262 1263 1264 1265 1266 1267 1268 1269 1270 1271 1272 1273 1274 1275 1276 1277 1278 1279 1280 1281 1282 1283 1284 1285 1286 1287 1288 1289 1290 1291 1292 1293 1294 1295 1296 1297 1298 1299 1300 1301 1302 1303 1304 1305 1306 1307 1308 1309 1310 1311 1312 1313 1314 1315 1316 1317 1318 1319 1320 1321 1322 1323 1324 1325 1326 1327 1328 1329 1330 1331 1332 1333 1334 1335 1336 1337 1338 1339 1340 1341 1342 1343 1344 1345 1346 1347 1348 1349 1350 1351 1352 1353 1354 1355 1356 1357 1358 1359 1360 1361 1362 1363 1364 1365 1366 1367 1368 1369 1370 1371 1372 1373 1374 1375 1376 1377 1378 1379 1380 1381 1382 1383 1384 1385 1386 1387 1388 1389 1390 1391 1392 1393 1394 1395 1396 1397 1398 1399 1400 1401 1402 1403 1404 1405 1406 1407 1408 1409 1410 1411 1412 1413 1414 1415 1416 1417 1418 1419 1420 1421 1422 1423 1424 1425 1426 1427 1428 1429 1430 1431 1432 1433 1434 1435 1436 1437 1438 1439 1440 1441 1442 1443 1444 1445 1446 1447 1448 1449 1450 1451 1452 1453 1454 1455 1456 1457 1458 1459 1460 1461 1462 1463 1464 1465 1466 1467 1468 1469 1470 1471 1472 1473 1474 1475 1476 1477 1478 1479 1480 1481 1482 1483 1484 1485 1486 1487 1488 1489 1490 1491 1492 1493 1494 1495 1496 1497 1498 1499 1500 1501 1502 1503 1504 1505 1506 1507 1508 1509 1510 1511 1512 1513 1514 1515 1516 1517 1518 1519 1520 1521 1522 1523 1524 1525 1526 1527 1528 1529 1530 1531 1532 1533 1534 1535 1536 1537 1538 1539 1540 1541 1542 1543 1544 1545 1546 1547 1548 1549 1550 1551 1552 1553 1554 1555 1556 1557 1558 1559 1560 1561 1562 1563 1564 1565 1566 1567 1568 1569 1570 1571 1572 1573 1574 1575 1576 1577 1578 1579 1580 1581 1582 1583 1584 1585 1586 1587 1588 1589 1590 1591 1592 1593 1594 1595 1596 1597 1598 1599 1600 1601 1602 1603 1604 1605 1606 1607 1608 1609 1610 1611 1612 1613 1614 1615 1616 1617 1618 1619 1620 1621 1622 1623 1624 1625 1626 1627 1628 1629 1630 1631 1632 1633 1634 1635 1636 1637 1638 1639 1640 1641 1642 1643 1644 1645 1646 1647 1648 1649 1650 1651 1652 1653 1654 1655 1656 1657 1658 1659 1660 1661 1662 1663 1664 1665 1666 1667 1668 1669 1670 1671 1672 1673 1674 1675 1676 1677 1678 1679 1680 1681 1682 1683 1684 1685 1686 1687 1688 1689 1690 1691 1692 1693 1694 1695 1696 1697 1698 1699 1700 1701 1702 1703 1704 1705 1706 1707 1708 1709 1710 1711 1712 1713 1714 1715 1716 1717 1718 1719 1720 1721 1722 1723 1724 1725 1726 1727 1728 1729 1730 1731 1732 1733 1734 1735 1736 1737 1738 1739 1740 1741 1742 1743 1744 1745 1746 1747 1748 1749 1750 1751 1752 1753 1754 1755 1756 1757 1758 1759 1760 1761 1762 1763 1764 1765 1766 1767 1768 1769 1770 1771 1772 1773 1774 1775 1776 1777 1778 1779 1780 1781 1782 1783 1784 1785 1786 1787 1788 1789 1790 1791 1792 1793 1794 1795 1796 1797 1798 1799 1800 1801 1802 1803 1804 1805 1806 1807 1808 1809 1810 1811 1812 1813 1814 1815 1816 1817 1818 1819 1820 1821 1822 1823 1824 1825 1826 1827 1828 1829 1830 1831 1832 1833 1834 1835 1836 1837 1838 1839 1840 1841 1842 1843 1844 1845 1846 1847 1848 1849 1850 1851 1852 1853 1854 1855 1856 1857 1858 1859 1860 1861 1862 1863 1864 1865 1866 1867 1868 1869 1870 1871 1872 1873 1874 1875 1876 1877 1878 1879 1880 1881 1882 1883 1884 1885 1886 1887 1888 1889 1890 1891 1892 1893 1894 1895 1896 1897 1898 1899 1900 1901 1902 1903 1904 1905 1906 1907 1908 1909 1910 1911 1912 1913 1914 1915 1916 1917 1918 1919 1920 1921 1922 1923 1924 1925 1926 1927 1928 1929 1930 1931 1932 1933 1934 1935 1936 1937 1938 1939 1940 1941 1942 1943 1944 1945 1946 1947 1948 1949 1950 1951 1952 1953 1954 1955 1956 1957 1958 1959 1960 1961 1962 1963 1964 1965 1966 1967 1968 1969 1970 1971 1972 1973 1974 1975 1976 1977 1978 1979 1980 1981 1982 1983 1984 1985 1986 1987 1988 1989 1990 1991 1992 1993 1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 2022 2023 2024 2025 2026 2027 2028 2029 2030 2031 2032 2033 2034 2035 2036 2037 2038 2039 2040 2041 2042 2043 2044 2045 2046 2047 2048 2049 2050 2051 2052 2053 2054 2055 2056 2057 2058 2059 2060 2061 2062 2063 2064 2065 2066 2067 2068 2069 2070 2071 2072 2073 2074 2075 2076 2077 2078 2079 2080 2081 2082 2083 2084 2085 2086 2087 2088 2089 2090 2091 2092 2093 2094 2095 2096 2097 2098 2099 2100 2101 2102 2103 2104 2105 2106 2107 2108 2109 2110 2111 2112 2113 2114 2115 2116 2117 2118 2119 2120 2121 2122 2123 2124 2125 2126 2127 2128 2129 2130 2131 2132 2133 2134 2135 2136 2137 2138 2139 2140 2141 2142 2143 2144 2145 2146 2147 2148 2149 2150 2151 2152 2153 2154 2155 2156 2157 2158 2159 2160 2161 2162 2163 2164 2165 2166 2167 2168 2169 2170 2171 2172 2173 2174 2175 2176 2177 2178 2179 2180 2181 2182 2183 2184 2185 2186 2187 2188 2189 2190 2191 2192 2193 2194 2195 2196 2197 2198 2199 2200 2201 2202 2203 2204 2205 2206 2207 2208 2209 2210 2211 2212 2213 2214 2215 2216 2217 2218 2219 2220 2221 2222 2223 2224 2225 2226 2227 2228 2229 2230 2231 2232 2233 2234 2235 2236 2237 2238 2239 2240 2241 2242 2243 2244 2245 2246 2247 2248 2249 2250 2251 2252 2253 2254 2255 2256 2257 2258 2259 2260 2261 2262 2263 2264 2265 2266 2267 2268 2269 2270 2271 2272 2273 2274 2275 2276 2277 2278 2279 2280 2281 2282 2283 2284 2285 2286 2287 2288 2289 2290 2291 2292 2293 2294 2295 2296 2297 2298 2299 2300 2301 2302 2303 2304 2305 2306 2307 2308 2309 2310 2311 2312 2313 2314 2315 2316 2317 2318 2319 2320 2321 2322 2323 2324 2325 2326 2327 2328 2329 2330 2331 2332 2333 2334 2335 2336 2337 2338 2339 2340 2341 2342 2343 2344 2345 2346 2347 2348 2349 2350 2351 2352 2353 2354 2355 2356 2357 2358 2359 2360 2361 2362 2363 2364 2365 2366 2367 2368 2369 2370 2371 2372 2373 2374 2375 2376 2377 2378 2379 2380 2381 2382 2383 2384 2385 2386 2387 2388 2389 2390 2391 2392 2393 2394 2395 2396 2397 2398 2399 2400 2401 2402 2403 2404 2405 2406 2407 2408 2409 2410 2411 2412 2413 2414 2415 2416 2417 2418 2419 2420 2421 2422 2423 2424 2425 2426 2427 2428 2429 2430 2431 2432 2433 2434 2435 2436 2437 2438 2439 2440 2441 2442 2443 2444 2445 2446 2447 2448 2449 2450 2451 2452 2453 2454 2455 2456 2457 2458 2459 2460 2461 2462 2463 2464 2465 2466 2467 2468 2469 2470 2471 2472 2473 2474 2475 2476 2477 2478 2479 2480 2481 2482 2483 2484 2485 2486 2487 2488 2489 2490 2491 2492 2493 2494 2495 2496 2497 2498 2499 2500 2501 2502 2503 2504 2505 2506 2507 2508 2509 2510 2511 2512 2513 2514 2515 2516 2517 2518 2519 2520 2521 2522 2523 2524 2525 2526 2527 2528 2529 2530 2531 2532 2533 2534 2535 2536 2537 2538 2539 2540 2541 2542 2543 2544 2545 2546 2547 2548 2549 2550 2551 2552 2553 2554 2555 2556 2557 2558 2559 2560 2561 2562 2563 2564 2565 2566 2567 2568 2569 2570 2571 2572 2573 2574 2575 2576 2577 2578 2579 2580 2581 2582 2583 2584 2585 2586 2587 2588 2589 2590 2591 2592 2593 2594 2595 2596 2597 2598 2599 2600 2601 2602 2603 2604 2605 2606 2607 2608 2609 2610 2611 2612 2613 2614 2615 2616 2617 2618 2619 2620 2621 2622 2623 2624 2625 2626 2627 2628 2629 2630 2631 2632 2633 2634 2635 2636 2637 2638 2639 2640 2641 2642 2643 2644 2645 2646 2647 2648 2649 2650 2651 2652 2653 2654 2655 2656 2657 2658 2659 2660 2661 2662 2663 2664 2665 2666 2667 2668 2669 2670 2671 2672 2673 2674 2675 2676 2677 2678 2679 2680 2681 2682 2683 2684 2685 2686 2687 2688 2689 2690 2691 2692 2693 2694 2695 2696 2697 2698 2699 2700 2701 2702 2703 2704 2705 2706 2707 2708 2709 2710 2711 2712 2713 2714 2715 2716 2717 2718 2719 2720 2721 2722 2723 2724 2725 2726 2727 2728 2729 2730 2731 2732 2733 2734 2735 2736 2737 2738 2739 2740 2741 2742 2743 2744 2745 2746 2747 2748 2749 2750 2751 2752 2753 2754 2755 2756 2757 2758 2759 2760 2761 2762 2763 2764 2765 2766 2767 2768 2769 2770 2771 2772 2773 2774 2775 2776 2777 2778 2779 2780 2781 2782 2783 2784 2785 2786 2787 2788 2789 2790 2791 2792 2793 2794 2795 2796 2797 2798 2799 2800 2801 2802 2803 2804 2805 2806 2807 2808 2809 2810 2811 2812 2813 2814 2815 2816 2817 2818 2819 2820 2821 2822 2823 2824 2825 2826 2827 2828 2829 2830 2831 2832 2833 2834 2835 2836 2837 2838 2839 2840 2841 2842 2843 2844 2845 2846 2847 2848 2849 2850 2851 2852 2853 2854 2855 2856 2857 2858 2859 2860 2861 2862 2863 2864 2865 2866 2867 2868 2869 2870 2871 2872 2873 2874 2875 2876 2877 2878 2879 2880 2881 2882 2883 2884 2885 2886 2887 2888 2889 2890 2891 2892 2893 2894 2895 2896 2897 2898 2899 2900 2901 2902 2903 2904 2905 2906 2907 2908 2909 2910 2911 2912 2913 2914 2915 2916 2917 2918 2919 2920 2921 2922 2923 2924 2925 2926 2927 2928 2929 2930 2931 2932 2933 2934 2935 2936 2937 2938 2939 2940 2941 2942 2943 2944 2945 2946 2947 2948 2949 2950 2951 2952 2953 2954 2955 2956 2957 2958 2959 2960 2961 2962 2963 2964 2965 2966 2967 2968 2969 2970 2971 2972 2973 2974 2975 2976 2977 2978 2979 2980 2981 2982 2983 2984 2985 2986 2987 2988 2989 2990 2991 2992 2993 2994 2995 2996 2997 2998 2999 3000 3001 3002 3003 3004 3005 3006 3007 3008 3009 3010 3011 3012 3013 3014 3015 3016 3017 3018 3019 3020 3021 3022 3023 3024 3025 3026 3027 3028 3029 3030 3031 3032 3033 3034 3035 3036 3037 3038 3039 3040 3041 3042 3043 3044 3045 3046 3047 3048 3049 3050 3051 3052 3053 3054 3055 3056 3057 3058 3059 3060 3061 3062 3063 3064 3065 3066 3067 3068 3069 3070 3071 3072 3073 3074 3075 3076 3077 3078 3079 3080 3081 3082 3083 3084 3085 3086 3087 3088 3089 3090 3091 3092 3093 3094 3095 3096 3097 3098 3099 3100 3101 3102 3103 3104 3105 3106 3107 3108 3109 3110 3111 3112 3113 3114 3115 3116 3117 3118 3119 3120 3121 3122 3123 3124 3125 3126 3127 3128 3129 3130 3131 3132 3133 3134 3135 3136 3137 3138 3139 3140 3141 3142 3143 3144 3145 3146 3147 3148 3149 3150 3151 3152 3153 3154 3155 3156 3157 3158 3159 3160 3161 3162 3163 3164 3165 3166 3167 3168 3169 3170 3171 3172 3173 3174 3175 3176 3177 3178 3179 3180 3181 3182 3183 3184 3185 3186 3187 3188 3189 3190 3191 3192 3193 3194 3195 3196 3197 3198 3199 3200 3201 3202 3203 3204 3205 3206 3207 3208 3209 3210 3211 3212 3213 3214 3215 3216 3217 3218 3219 3220 3221 3222 3223 3224 3225 3226 3227 3228 3229 3230 3231 3232 3233 3234 3235 3236 3237 3238 3239 3240 3241 3242 3243 3244 3245 3246 3247 3248 3249 3250 3251 3252 3253 3254 3255 3256 3257 3258 3259 3260 3261 3262 3263 3264 3265 3266 3267 3268 3269 3270 3271 3272 3273 3274 3275 3276 3277 3278 3279 3280 3281 3282 3283 3284 3285 3286 3287 3288 3289 3290 3291 3292 3293 3294 3295 3296 3297 3298 3299 3300 3301 3302 3303 3304 3305 3306 3307 3308 3309 3310 3311 3312 3313 3314 3315 3316 3317 3318 3319 3320 3321 3322 3323 3324 3325 3326 3327 3328 3329 3330 3331 3332 3333 3334 3335 3336 3337 3338 3339 3340 3341 3342 3343 3344 3345 3346 3347 3348 3349 3350 3351 3352 3353 3354 3355 3356 3357 3358 3359 3360 3361 3362 3363 3364 3365 3366 3367 3368 3369 3370 3371 3372 3373 3374 3375 3376 3377 3378 3379 3380 3381 3382 3383 3384 3385 3386 3387 3388 3389 3390 3391 3392 3393 3394 3395 3396 3397 3398 3399 3400 3401 3402 3403 3404 3405 3406 3407 3408 3409 3410 3411 3412 3413 3414 3415 3416 3417 3418 3419 3420 3421 3422 3423 3424 3425 3426 3427 3428 3429 3430 3431 3432 3433 3434 3435 3436 3437 3438 3439 3440 3441 3442 3443 3444 3445 3446 3447 3448 3449 3450 3451 3452 3453 3454 3455 3456 3457 3458 3459 3460 3461 3462 3463 3464 3465 3466 3467 3468 3469 3470 3471 3472 3473 3474 3475 3476 3477 3478 3479 3480 3481 3482 3483 3484 3485 3486 3487 3488 3489 3490 3491 3492 3493 3494 3495 3496 3497 3498 3499 3500 3501 3502 3503 3504 3505 3506 3507 3508 3509 3510 3511 3512 3513 3514 3515 3516 3517 3518 3519 3520 3521 3522 3523 3524 3525 3526 3527 3528 3529 3530 3531 3532 3533 3534 3535 3536 3537 3538 3539 3540 3541 3542 3543 3544 3545 3546 3547 3548 3549 3550 3551 3552 3553 3554 3555 3556 3557 3558 3559 3560 3561 3562 3563 3564 3565 3566 3567 3568 3569 3570 3571 3572 3573 3574 3575 3576 3577 3578 3579 3580 3581 3582 3583 3584 3585 3586 3587 3588 3589 3590 3591 3592 3593 3594 3595 3596 3597 3598 3599 3600 3601 3602 3603 3604 3605 3606 3607 3608 3609 3610 3611 3612 3613 3614 3615 3616 3617 3618 3619 3620 3621 3622 3623 3624 3625 3626 3627 3628 3629 3630 3631 3632 3633 3634 3635 3636 3637 3638 3639 3640 3641 3642 3643 3644 3645 3646 3647 3648 3649 3650 3651 3652 3653 3654 3655 3656 3657 3658 3659 3660 3661 3662 3663 3664 3665 3666 3667 3668 3669 3670 3671 3672 3673 3674 3675 3676 3677 3678 3679 3680 3681 3682 3683 3684 3685 3686 3687 3688 3689 3690 3691 3692 3693 3694 3695 3696 3697 3698 3699 3700 3701 3702 3703 3704 3705 3706 3707 3708 3709 3710 3711 3712 3713 3714 3715 3716 3717 3718 3719 3720 3721 3722 3723 3724 3725 3726 3727 3728 3729 3730 3731 3732 3733 3734 3735 3736 3737 3738 3739 3740 3741 3742 3743 3744 3745 3746 3747 3748 3749 3750 3751 3752 3753 3754 3755 3756 3757 3758 3759 3760 3761 3762 3763 3764 3765 3766 3767 3768 3769 3770 3771 3772 3773 3774 3775 3776 3777 3778 3779 3780 3781 3782 3783 3784 3785 3786 3787 3788 3789 3790 3791 3792 3793 3794 3795 3796 3797 3798 3799 3800 3801 3802 3803 3804 3805 3806 3807 3808 3809 3810 3811 3812 3813 3814 3815 3816 3817 3818 3819 3820 3821 3822 3823 3824 3825 3826 3827 3828 3829 3830 3831 3832 3833 3834 3835 3836 3837 3838 3839 3840 3841 3842 3843 3844 3845 3846 3847 3848 3849 3850 3851 3852 3853 3854 3855 3856 3857 3858 3859 3860 3861 3862 3863 3864 3865 3866 3867 3868 3869 3870 3871 3872 3873 3874 3875 3876 3877 3878 3879 3880 3881 3882 3883 3884 3885 3886 3887 3888 3889 3890 3891 3892 3893 3894 3895 3896 3897 3898 3899 3900 3901 3902 3903 3904 3905 3906 3907 3908 3909 3910 3911 3912 3913 3914 3915 3916 3917 3918 3919 3920 3921 3922 3923 3924 3925 3926 3927 3928 3929 3930 3931 3932 3933 3934 3935 3936 3937 3938 3939 3940 3941 3942 3943 3944 3945 3946 3947 3948 3949 3950 3951 3952 3953 3954 3955 3956 3957 3958 3959 3960 3961 3962 3963 3964 3965 3966 3967 3968 3969 3970 3971 3972 3973 3974 3975 3976 3977 3978 3979 3980 3981 3982 3983 3984 3985 3986 3987 3988 3989 3990 3991 3992 3993 3994 3995 3996 3997 3998 3999 4000 4001 4002 4003 4004 4005 4006 4007 4008 4009 4010 4011 4012 4013 4014 4015 4016 4017 4018 4019 4020 4021 4022 4023 4024 4025 4026 4027 4028 4029 4030 4031 4032 4033 4034 4035 4036 4037 4038 4039 4040 4041 4042 4043 4044 4045 4046 4047 4048 4049 4050 4051 4052 4053 4054 4055 4056 4057 4058 4059 4060 4061 4062 4063 4064 4065 4066 4067 4068 4069 4070 4071 4072 4073 4074 4075 4076 4077 4078 4079 4080 4081 4082 4083 4084 4085 4086 4087 4088 4089 4090 4091 4092 4093 4094 4095 4096 4097 4098 4099 4100 4101 4102 4103 4104 4105 4106 4107 4108 4109 4110 4111 4112 4113 4114 4115 4116 4117 4118 4119 4120 4121 4122 4123 4124 4125 4126 4127 4128 4129 4130 4131 4132 4133 4134 4135 4136 4137 4138 4139 4140 4141 4142 4143 4144 4145 4146 4147 4148 4149 4150 4151 4152 4153 4154 4155 4156 4157 4158 4159 4160 4161 4162 4163 4164 4165 4166 4167 4168 4169 4170 4171 4172 4173 4174 4175 4176 4177 4178 4179 4180 4181 4182 4183 4184 4185 4186 4187 4188 4189 4190 4191 4192 4193 4194 4195 4196 4197 4198 4199 4200 4201 4202 4203 4204 4205 4206 4207 4208 4209 4210 4211 4212 4213 4214 4215 4216 4217 4218 4219 4220 4221 4222 4223 4224 4225 4226 4227 4228 4229 4230 4231 4232 4233 4234 4235 4236 4237 4238 4239 4240 4241 4242 4243 4244 4245 4246 4247 4248 4249 4250 4251 4252 4253 4254 4255 4256 4257 4258 4259 4260 4261 4262 4263 4264 4265 4266 4267 4268 4269 4270 4271 4272 4273 4274 4275 4276 4277 4278 4279 4280 4281 4282 4283 4284 4285 4286 4287 4288 4289 4290 4291 4292 4293 4294 4295 4296 4297 4298 4299 4300 4301 4302 4303 4304 4305 4306 4307 4308 4309 4310 4311 4312 4313 4314 4315 4316 4317 4318 4319 4320 4321 4322 4323 4324 4325 4326 4327 4328 4329 4330 4331 4332 4333 4334 4335 4336 4337 4338 4339 4340 4341 4342 4343 4344 4345 4346 4347 4348 4349 4350 4351 4352 4353 4354 4355 4356 4357 4358 4359 4360 4361 4362 4363 4364 4365 4366 4367 4368 4369 4370 4371 4372 4373 4374 4375 4376 4377 4378 4379 4380 4381 4382 4383 4384 4385 4386 4387 4388 4389 4390 4391 4392 4393 4394 4395 4396 4397 4398 4399 4400 4401 4402 4403 4404 4405 4406 4407 4408 4409 4410 4411 4412 4413 4414 4415 4416 4417 4418 4419 4420 4421 4422 4423 4424 4425 4426 4427 4428 4429 4430 4431 4432 4433 4434 4435 4436 4437 4438 4439 4440 4441 4442 4443 4444 4445 4446 4447 4448 4449 4450 4451 4452 4453 4454 4455 4456 4457 4458 4459 4460 4461 4462 4463 4464 4465 4466 4467 4468 4469 4470 4471 4472 4473 4474 4475 4476 4477 4478 4479 4480 4481 4482 4483 4484 4485 4486 4487 4488 4489 4490 4491 4492 4493 4494 4495 4496 4497 4498 4499 4500 4501 4502 4503 4504 4505 4506 4507 4508 4509 4510 4511 4512 4513 4514 4515 4516 4517 4518 4519 4520 4521 4522 4523 4524 4525 4526 4527 4528 4529 4530 4531 4532 4533 4534 4535 4536 4537 4538 4539 4540 4541 4542 4543 4544 4545 4546 4547 4548 4549 4550 4551 4552 4553 4554 4555 4556 4557 4558 4559 4560 4561 4562 4563 4564 4565 4566 4567 4568 4569 4570 4571 4572 4573 4574 4575 4576 4577 4578 4579 4580 4581 4582 4583 4584 4585 4586 4587 4588 4589 4590 4591 4592 4593 4594 4595 4596 4597 4598 4599 4600 4601 4602 4603 4604 4605 4606 4607 4608 4609 4610 4611 4612 4613 4614 4615 4616 4617 4618 4619 4620 4621 4622 4623 4624 4625 4626 4627 4628 4629 4630 4631 4632 4633 4634 4635 4636 4637 4638 4639 4640 4641 4642 4643 4644 4645 4646 4647 4648 4649 4650 4651 4652 4653 4654 4655 4656 4657 4658 4659 4660 4661 4662 4663 4664 4665 4666 4667 4668 4669 4670 4671 4672 4673 4674 4675 4676 4677 4678 4679 4680 4681 4682 4683 4684 4685 4686 4687 4688 4689 4690 4691 4692 4693 4694 4695 4696 4697 4698 4699 4700 4701 4702 4703 4704 4705 4706 4707 4708 4709 4710 4711 4712 4713 4714 4715 4716 4717 4718 4719 4720 4721 4722 4723 4724 4725 4726 4727 4728 4729 4730 4731 4732 4733 4734 4735 4736 4737 4738 4739 4740 4741 4742 4743 4744 4745 4746 4747 4748 4749 4750 4751 4752 4753 4754 4755 4756 4757 4758 4759 4760 4761 4762 4763 4764 4765 4766 4767 4768 4769 4770 4771 4772 4773 4774 4775 4776 4777 4778 4779 4780 4781 4782 4783 4784 4785 4786 4787 4788 4789 4790 4791 4792 4793 4794 4795 4796 4797 4798 4799 4800 4801 4802 4803 4804 4805 4806 4807 4808 4809 4810 4811 4812 4813 4814 4815 4816 4817 4818 4819 4820 4821 4822 4823 4824 4825 4826 4827 4828 4829 4830 4831 4832 4833 4834 4835 4836 4837 4838 4839 4840 4841 4842 4843 4844 4845 4846 4847 4848 4849 4850 4851 4852 4853 4854 4855 4856 4857 4858 4859 4860 4861 4862 4863 4864 4865 4866 4867 4868 4869 4870 4871 4872 4873 4874 4875 4876 4877 4878 4879 4880 4881 4882 4883 4884 4885 4886 4887 4888 4889 4890 4891 4892 4893 4894 4895 4896 4897 4898 4899 4900 4901 4902 4903 4904 4905 4906 4907 4908 4909 4910 4911 4912 4913 4914 4915 4916 4917 4918 4919 4920 4921 4922 4923 4924 4925 4926 4927 4928 4929 4930 4931 4932 4933 4934 4935 4936 4937 4938 4939 4940 4941 4942 4943 4944 4945 4946 4947 4948 4949 4950 4951 4952 4953 4954 4955 4956 4957 4958 4959 4960 4961 4962 4963 4964 4965 4966 4967 4968 4969 4970 4971 4972 4973 4974 4975 4976 4977 4978 4979 4980 4981 4982 4983 4984 4985 4986 4987 4988 4989 4990 4991 4992 4993 4994 4995 4996 4997 4998 4999 5000 5001 5002 5003 5004 5005 5006 5007 5008 5009 5010 5011 5012 5013 5014 5015 5016 5017 5018 5019 5020 5021 5022 5023 5024 5025 5026 5027 5028 5029 5030 5031 5032 5033 5034 5035 5036 5037 5038 5039 5040 5041 5042 5043 5044 5045 5046 5047 5048 5049 5050 5051 5052 5053 5054 5055 5056 5057 5058 5059 5060 5061 5062 5063 5064 5065 5066 5067 5068 5069 5070 5071 5072 5073 5074 5075 5076 5077 5078 5079 5080 5081 5082 5083 5084 5085 5086 5087 5088 5089 5090 5091 5092 5093 5094 5095 5096 5097 5098 5099 5100 5101 5102 5103 5104 5105 5106 5107 5108 5109 5110 5111 5112 5113 5114 5115 5116 5117 5118 5119 5120 5121 5122 5123 5124 5125 5126 5127 5128 5129 5130 5131 5132 5133 5134 5135 5136 5137 5138 5139 5140 5141 5142 5143 5144 5145 5146 5147 5148 5149 5150 5151 5152 5153 5154 5155 5156 5157 5158 5159 5160 5161 5162 5163 5164 5165 5166 5167 5168 5169 5170 5171 5172 5173 5174 5175 5176 5177 5178 5179 5180 5181 5182 5183 5184 5185 5186 5187 5188 5189 5190 5191 5192 5193 5194 5195 5196 5197 5198 5199 5200 5201 5202 5203 5204 5205 5206 5207 5208 5209 5210 5211 5212 5213 5214 5215 5216 5217 5218 5219 5220 5221 5222 5223 5224 5225 5226 5227 5228 5229 5230 5231 5232 5233 5234 5235 5236 5237 5238 5239 5240 5241 5242 5243 5244 5245 5246 5247 5248 5249 5250 5251 5252 5253 5254 5255 5256 5257 5258 5259 5260 5261 5262 5263 5264 5265 5266 5267 5268 5269 5270 5271 5272 5273 5274 5275 5276 5277 5278 5279 5280 5281 5282 5283 5284 5285 5286 5287 5288 5289 5290 5291 5292 5293 5294 5295 5296 5297 5298 5299 5300 5301 5302 5303 5304 5305 5306 5307 5308 5309 5310 5311 5312 5313 5314 5315 5316 5317 5318 5319 5320 5321 5322 5323 5324 5325 5326 5327 5328 5329 5330 5331 5332 5333 5334 5335 5336 5337 5338 5339 5340 5341 5342 5343 5344 5345 5346 5347 5348 5349 5350 5351 5352 5353 5354 5355 5356 5357 5358 5359 5360 5361 5362 5363 5364 5365 5366 5367 5368 5369 5370 5371 5372 5373 5374 5375 5376 5377 5378 5379 5380 5381 5382 5383 5384 5385 5386 5387 5388 5389 5390 5391 5392 5393 5394 5395 5396 5397 5398 5399 5400 5401 5402 5403 5404 5405 5406 5407 5408 5409 5410 5411 5412 5413 5414 5415 5416 5417 5418 5419 5420 5421 5422 5423 5424 5425 5426 5427 5428 5429 5430 5431 5432 5433 5434 5435 5436 5437 5438 5439 5440 5441 5442 5443 5444 5445 5446 5447 5448 5449 5450 5451 5452 5453 5454 5455 5456 5457 5458 5459 5460 5461 5462 5463 5464 5465 5466 5467 5468 5469 5470 5471 5472 5473 5474 5475 5476 5477 5478 5479 5480 5481 5482 5483 5484 5485 5486 5487 5488 5489 5490 5491 5492 5493 5494 5495 5496 5497 5498 5499 5500 5501 5502 5503 5504 5505 5506 5507 5508 5509 5510 5511 5512 5513 5514 5515 5516 5517 5518 5519 5520 5521 5522 5523 5524 5525 5526 5527 5528 5529 5530 5531 5532 5533 5534 5535 5536 5537 5538 5539 5540 5541 5542 5543 5544 5545 5546 5547 5548 5549 5550 5551 5552 5553 5554 5555 5556 5557 5558 5559 5560 5561 5562 5563 5564 5565 5566 5567 5568 5569 5570 5571 5572 5573 5574 5575 5576 5577 5578 5579 5580 5581 5582 5583 5584 5585 5586 5587 5588 5589 5590 5591 5592 5593 5594 5595 5596 5597 5598 5599 5600 5601 5602 5603 5604 5605 5606 5607 5608 5609 5610 5611 5612 5613 5614 5615 5616 5617 5618 5619 5620 5621 5622 5623 5624 5625 5626 5627 5628 5629 5630 5631 5632 5633 5634 5635 5636 5637 5638 5639 5640 5641 5642 5643 5644 5645 5646 5647 5648 5649 5650 5651 5652 5653 5654 5655 5656 5657 5658 5659 5660 5661 5662 5663 5664 5665 5666 5667 5668 5669 5670 5671 5672 5673 5674 5675 5676 5677 5678 5679 5680 5681 5682 5683 5684 5685 5686 5687 5688 5689 5690 5691 5692 5693 5694 5695 5696 5697 5698 5699 5700 5701 5702 5703 5704 5705 5706 5707 5708 5709 5710 5711 5712 5713 5714 5715 5716 5717 5718 5719 5720 5721 5722 5723 5724 5725 5726 5727 5728 5729 5730 5731 5732 5733 5734 5735 5736 5737 5738 5739 5740 5741 5742 5743 5744 5745 5746 5747 5748 5749 5750 5751 5752 5753 5754 5755 5756 5757 5758 5759 5760 5761 5762 5763 5764 5765 5766 5767 5768 5769 5770 5771 5772 5773 5774 5775 5776 5777 5778 5779 5780 5781 5782 5783 5784 5785 5786 5787 5788 5789 5790 5791 5792 5793 5794 5795 5796 5797 5798 5799 5800 5801 5802 5803 5804 5805 5806 5807 5808 5809 5810 5811 5812 5813 5814 5815 5816 5817 5818 5819 5820 5821 5822 5823 5824 5825 5826 5827 5828 5829 5830 5831 5832 5833 5834 5835 5836 5837 5838 5839 5840 5841 5842 5843 5844 5845 5846 5847 5848 5849 5850 5851 5852 5853 5854 5855 5856 5857 5858 5859 5860 5861 5862 5863 5864 5865 5866 5867 5868 5869 5870 5871 5872 5873 5874 5875 5876 5877 5878 5879 5880 5881 5882 5883 5884 5885 5886 5887 5888 5889 5890 5891 5892 5893 5894 5895 5896 5897 5898 5899 5900 5901 5902 5903 5904 5905 5906 5907 5908 5909 5910 5911 5912 5913 5914 5915 5916 5917 5918 5919 5920 5921 5922 5923 5924 5925 5926 5927 5928 5929 5930 5931 5932 5933 5934 5935 5936 5937 5938 5939 5940 5941 5942 5943 5944 5945 5946 5947 5948 5949 5950 5951 5952 5953 5954 5955 5956 5957 5958 5959 5960 5961 5962 5963 5964 5965 5966 5967 5968 5969 5970 5971 5972 5973 5974 5975 5976 5977 5978 5979 5980 5981 5982 5983 5984 5985 5986 5987 5988 5989 5990 5991 5992 5993 5994 5995 5996 5997 5998 5999 6000 6001 6002 6003 6004 6005 6006 6007 6008 6009 6010 6011 6012 6013 6014 6015 6016 6017 6018 6019 6020 6021 6022 6023 6024 6025 6026 6027 6028 6029 6030 6031 6032 6033 6034 6035 6036 6037 6038 6039 6040 6041 6042 6043 6044 6045 6046 6047 6048 6049 6050 6051 6052 6053 6054 6055 6056 6057 6058 6059 6060 6061 6062 6063 6064 6065 6066 6067 6068 6069 6070 6071 6072 6073 6074 6075 6076 6077 6078 6079 6080 6081 6082 6083 6084 6085 6086 6087 6088 6089 6090 6091 6092 6093 6094 6095 6096 6097 6098 6099 6100 6101 6102 6103 6104 6105 6106 6107 6108 6109 6110 6111 6112 6113 6114 6115 6116 6117 6118 6119 6120 6121 6122 6123 6124 6125 6126 6127 6128 6129 6130 6131 6132 6133 6134 6135 6136 6137 6138 6139 6140 6141 6142 6143 6144 6145 6146 6147 6148 6149 6150 6151 6152 6153 6154 6155 6156 6157 6158 6159 6160 6161 6162 6163 6164 6165 6166 6167 6168 6169 6170 6171 6172 6173 6174 6175 6176 6177 6178 6179 6180 6181 6182 6183 6184 6185 6186 6187 6188 6189 6190 6191 6192 6193 6194 6195 6196 6197 6198 6199 6200 6201 6202 6203 6204 6205 6206 6207 6208 6209 6210 6211 6212 6213 6214 6215 6216 6217 6218 6219 6220 6221 6222 6223 6224 6225 6226 6227 6228 6229 6230 6231 6232 6233 6234 6235 6236 6237 6238 6239 6240 6241 6242 6243 6244 6245 6246 6247 6248 6249 6250 6251 6252 6253 6254 6255 6256 6257 6258 6259 6260 6261 6262 6263 6264 6265 6266 6267 6268 6269 6270 6271 6272 6273 6274 6275 6276 6277 6278 6279 6280 6281 6282 6283 6284 6285 6286 6287 6288 6289 6290 6291 6292 6293 6294 6295 6296 6297 6298 6299 6300 6301 6302 6303 6304 6305 6306 6307 6308 6309 6310 6311 6312 6313 6314 6315 6316 6317 6318 6319 6320 6321 6322 6323 6324 6325 6326 6327 6328 6329 6330 6331 6332 6333 6334 6335 6336 6337 6338 6339 6340 6341 6342 6343 6344 6345 6346 6347 6348 6349 6350 6351 6352 6353 6354 6355 6356 6357 6358 6359 6360 6361 6362 6363 6364 6365 6366 6367 6368 6369 6370 6371 6372 6373 6374 6375 6376 6377 6378 6379 6380 6381 6382 6383 6384 6385 6386 6387 6388 6389 6390 6391 6392 6393 6394 6395 6396 6397 6398 6399 6400 6401 6402 6403 6404 6405 6406 6407 6408 6409 6410 6411 6412 6413 6414 6415 6416 6417 6418 6419 6420 6421 6422 6423 6424 6425 6426 6427 6428 6429 6430 6431 6432 6433 6434 6435 6436 6437 6438 6439 6440 6441 6442 6443 6444 6445 6446 6447 6448 6449 6450 6451 6452 6453 6454 6455 6456 6457 6458 6459 6460 6461 6462 6463 6464 6465 6466 6467 6468 6469 6470 6471 6472 6473 6474 6475 6476 6477 6478 6479 6480 6481 6482 6483 6484 6485 6486 6487 6488 6489 6490 6491 6492 6493 6494 6495 6496 6497 6498 6499 6500 6501 6502 6503 6504 6505 6506 6507 6508 6509 6510 6511 6512 6513 6514 6515 6516 6517 6518 6519 6520 6521 6522 6523 6524 6525 6526 6527 6528 6529 6530 6531 6532 6533 6534 6535 6536 6537 6538 6539 6540 6541 6542 6543 6544 6545 6546 6547 6548 6549 6550 6551 6552 6553 6554 6555 6556 6557 6558 6559 6560 6561 6562 6563 6564 6565 6566 6567 6568 6569 6570 6571 6572 6573 6574 6575 6576 6577 6578 6579 6580 6581 6582 6583 6584 6585 6586 6587 6588 6589 6590 6591 6592 6593 6594 6595 6596 6597 6598 6599 6600 6601 6602 6603 6604 6605 6606 6607 6608 6609 6610 6611 6612 6613 6614 6615 6616 6617 6618 6619 6620 6621 6622 6623 6624 6625 6626 6627 6628 6629 6630 6631 6632 6633 6634 6635 6636 6637 6638 6639 6640 6641 6642 6643 6644 6645 6646 6647 6648 6649 6650 6651 6652 6653 6654 6655 6656 6657 6658 6659 6660 6661 6662 6663 6664 6665 6666 6667 6668 6669 6670 6671 6672 6673 6674 6675 6676 6677 6678 6679 6680 6681 6682 6683 6684 6685 6686 6687 6688 6689 6690 6691 6692 6693 6694 6695 6696 6697 6698 6699 6700 6701 6702 6703 6704 6705 6706 6707 6708 6709 6710 6711 6712 6713 6714 6715 6716 6717 6718 6719 6720 6721 6722 6723 6724 6725 6726 6727 6728 6729 6730 6731 6732 6733 6734 6735 6736 6737 6738 6739 6740 6741 6742 6743 6744 6745 6746 6747 6748 6749 6750 6751 6752 6753 6754 6755 6756 6757 6758 6759 6760 6761 6762 6763 6764 6765 6766 6767 6768 6769 6770 6771 6772 6773 6774 6775 6776 6777 6778 6779 6780 6781 6782 6783 6784 6785 6786 6787 6788 6789 6790 6791 6792 6793 6794 6795 6796 6797 6798 6799 6800 6801 6802 6803 6804 6805 6806 6807 6808 6809 6810 6811 6812 6813 6814 6815 6816 6817 6818 6819 6820 6821 6822 6823 6824 6825 6826 6827 6828 6829 6830 6831 6832 6833 6834 6835 6836 6837 6838 6839 6840 6841 6842 6843 6844 6845 6846 6847 6848 6849 6850 6851 6852 6853 6854 6855 6856 6857 6858 6859 6860 6861 6862 6863 6864 6865 6866 6867 6868 6869 6870 6871 6872 6873 6874 6875 6876 6877 6878 6879 6880 6881 6882 6883 6884 6885 6886 6887 6888 6889 6890 6891 6892 6893 6894 6895 6896 6897 6898 6899 6900 6901 6902 6903 6904 6905 6906 6907 6908 6909 6910 6911 6912 6913 6914 6915 6916 6917 6918 6919 6920 6921 6922 6923 6924 6925 6926 6927 6928 6929 6930 6931 6932 6933 6934 6935 6936 6937 6938 6939 6940 6941 6942 6943 6944 6945 6946 6947 6948 6949 6950 6951 6952 6953 6954 6955 6956 6957 6958 6959 6960 6961 6962 6963 6964 6965 6966 6967 6968 6969 6970 6971 6972 6973 6974 6975 6976 6977 6978 6979 6980 6981 6982 6983 6984 6985 6986 6987 6988 6989 6990 6991 6992 6993 6994 6995 6996 6997 6998 6999 7000 7001 7002 7003 7004 7005 7006 7007 7008 7009 7010 7011 7012 7013 7014 7015 7016 7017 7018 7019 7020 7021 7022 7023 7024 7025 7026 7027 7028 7029 7030 7031 7032 7033 7034 7035 7036 7037 7038 7039 7040 7041 7042 7043 7044 7045 7046 7047 7048 7049 7050 7051 7052 7053 7054 7055 7056 7057 7058 7059 7060 7061 7062 7063 7064 7065 7066 7067 7068 7069 7070 7071 7072 7073 7074 7075 7076 7077 7078 7079 7080 7081 7082 7083 7084 7085 7086 7087 7088 7089 7090 7091 7092 7093 7094 7095 7096 7097 7098 7099 7100 7101 7102 7103 7104 7105 7106 7107 7108 7109 7110 7111 7112 7113 7114 7115 7116 7117 7118 7119 7120 7121 7122 7123 7124 7125 7126 7127 7128 7129 7130 7131 7132 7133 7134 7135 7136 7137 7138 7139 7140 7141 7142 7143 7144 7145 7146 7147 7148 7149 7150 7151 7152 7153 7154 7155 7156 7157 7158 7159 7160 7161 7162 7163 7164 7165 7166 7167 7168 7169 7170 7171 7172 7173 7174 7175 7176 7177 7178 7179 7180 7181 7182 7183 7184 7185 7186 7187 7188 7189 7190 7191 7192 7193 7194 7195 7196 7197 7198 7199 7200 7201 7202 7203 7204 7205 7206 7207 7208 7209 7210 7211 7212 7213 7214 7215 7216 7217 7218 7219 7220 7221 7222 7223 7224 7225 7226 7227 7228 7229 7230 7231 7232 7233 7234 7235 7236 7237 7238 7239 7240 7241 7242 7243 7244 7245 7246 7247 7248 7249 7250 7251 7252 7253 7254 7255 7256 7257 7258 7259 7260 7261 7262 7263 7264 7265 7266 7267 7268 7269 7270 7271 7272 7273 7274 7275 7276 7277 7278 7279 7280 7281 7282 7283 7284 7285 7286 7287 7288 7289 7290 7291 7292 7293 7294 7295 7296 7297 7298 7299 7300 7301 7302 7303 7304 7305 7306 7307 7308 7309 7310 7311 7312 7313 7314 7315 7316 7317 7318 7319 7320 7321 7322 7323 7324 7325 7326 7327 7328 7329 7330 7331 7332 7333 7334 7335 7336 7337 7338 7339 7340 7341 7342 7343 7344 7345 7346 7347 7348 7349 7350 7351 7352 7353 7354 7355 7356 7357 7358 7359 7360 7361 7362 7363 7364 7365 7366 7367 7368 7369 7370 7371 7372 7373 7374 7375 7376 7377 7378 7379 7380 7381 7382 7383 7384 7385 7386 7387 7388 7389 7390 7391 7392 7393 7394 7395 7396 7397 7398 7399 7400 7401 7402 7403 7404 7405 7406 7407 7408 7409 7410 7411 7412 7413 7414 7415 7416 7417 7418 7419 7420 7421 7422 7423 7424 7425 7426 7427 7428 7429 7430 7431 7432 7433 7434 7435 7436 7437 7438 7439 7440 7441 7442 7443 7444 7445 7446 7447 7448 7449 7450 7451 7452 7453 7454 7455 7456 7457 7458 7459 7460 7461 7462 7463 7464 7465 7466 7467 7468 7469 7470 7471 7472 7473 7474 7475 7476 7477 7478 7479 7480 7481 7482 7483 7484 7485 7486 7487 7488 7489 7490 7491 7492 7493 7494 7495 7496 7497 7498 7499 7500 7501 7502 7503 7504 7505 7506 7507 7508 7509 7510 7511 7512 7513 7514 7515 7516 7517 7518 7519 7520 7521 7522 7523 7524 7525 7526 7527 7528 7529 7530 7531 7532 7533 7534 7535 7536 7537 7538 7539 7540 7541 7542 7543 7544 7545 7546 7547 7548 7549 7550 7551 7552 7553 7554 7555 7556 7557 7558 7559 7560 7561 7562 7563 7564 7565 7566 7567 7568 7569 7570 7571 7572 7573 7574 7575 7576 7577 7578 7579 7580 7581 7582 7583 7584 7585 7586 7587 7588 7589 7590 7591 7592 7593 7594 7595 7596 7597 7598 7599 7600 7601 7602 7603 7604 7605 7606 7607 7608 7609 7610 7611 7612 7613 7614 7615 7616 7617 7618 7619 7620 7621 7622 7623 7624 7625 7626 7627 7628 7629 7630 7631 7632 7633 7634 7635 7636 7637 7638 7639 7640 7641 7642 7643 7644 7645 7646 7647 7648 7649 7650 7651 7652 7653 7654 7655 7656 7657 7658 7659 7660 7661 7662 7663 7664 7665 7666 7667 7668 7669 7670 7671 7672 7673 7674 7675 7676 7677 7678 7679 7680 7681 7682 7683 7684 7685 7686 7687 7688 7689 7690 7691 7692 7693 7694 7695 7696 7697 7698 7699 7700 7701 7702 7703 7704 7705 7706 7707 7708 7709 7710 7711 7712 7713 7714 7715 7716 7717 7718 7719 7720 7721 7722 7723 7724 7725 7726 7727 7728 7729 7730 7731 7732 7733 7734 7735 7736 7737 7738 7739 7740 7741 7742 7743 7744 7745 7746 7747 7748 7749 7750 7751 7752 7753 7754 7755 7756 7757 7758 7759 7760 7761 7762 7763 7764 7765 7766 7767 7768 7769 7770 7771 7772 7773 7774 7775 7776 7777 7778 7779 7780 7781 7782 7783 7784 7785 7786 7787 7788 7789 7790 7791 7792 7793 7794 7795 7796 7797 7798 7799 7800 7801 7802 7803 7804 7805 7806 7807 7808 7809 7810 7811 7812 7813 7814 7815 7816 7817 7818 7819 7820 7821 7822 7823 7824 7825 7826 7827 7828 7829 7830 7831 7832 7833 7834 7835 7836 7837 7838 7839 7840 7841 7842 7843 7844 7845 7846 7847 7848 7849 7850 7851 7852 7853 7854 7855 7856 7857 7858 7859 7860 7861 7862 7863 7864 7865 7866 7867 7868 7869 7870 7871 7872 7873 7874 7875 7876 7877 7878 7879 7880 7881 7882 7883 7884 7885 7886 7887 7888 7889 7890 7891 7892 7893 7894 7895 7896 7897 7898 7899 7900 7901 7902 7903 7904 7905 7906 7907 7908 7909 7910 7911 7912 7913 7914 7915 7916 7917 7918 7919 7920 7921 7922 7923 7924 7925 7926 7927 7928 7929 7930 7931 7932 7933 7934 7935 7936 7937 7938 7939 7940 7941 7942 7943 7944 7945 7946 7947 7948 7949 7950 7951 7952 7953 7954 7955 7956 7957 7958 7959 7960 7961 7962 7963 7964 7965 7966 7967 7968 7969 7970 7971 7972 7973 7974 7975 7976 7977 7978 7979 7980 7981 7982 7983 7984 7985 7986 7987 7988 7989 7990 7991 7992 7993 7994 7995 7996 7997 7998 7999 8000 8001 8002 8003 8004 8005 8006 8007 8008 8009 8010 8011 8012 8013 8014 8015 8016 8017 8018 8019 8020 8021 8022 8023 8024 8025 8026 8027 8028 8029 8030 8031 8032 8033 8034 8035 8036 8037 8038 8039 8040 8041 8042 8043 8044 8045 8046 8047 8048 8049 8050 8051 8052 8053 8054 8055 8056 8057 8058 8059 8060 8061 8062 8063 8064 8065 8066 8067 8068 8069 8070 8071 8072 8073 8074 8075 8076 8077 8078 8079 8080 8081 8082 8083 8084 8085 8086 8087 8088 8089 8090 8091 8092 8093 8094 8095 8096 8097 8098 8099 8100 8101 8102 8103 8104 8105 8106 8107 8108 8109 8110 8111 8112 8113 8114 8115 8116 8117 8118 8119 8120 8121 8122 8123 8124 8125 8126 8127 8128 8129 8130 8131 8132 8133 8134 8135 8136 8137 8138 8139 8140 8141 8142 8143 8144 8145 8146 8147 8148 8149 8150 8151 8152 8153 8154 8155 8156 8157 8158 8159 8160 8161 8162 8163 8164 8165 8166 8167 8168 8169 8170 8171 8172 8173 8174 8175 8176 8177 8178 8179 8180 8181 8182 8183 8184 8185 8186 8187 8188 8189 8190 8191 8192 8193 8194 8195 8196 8197 8198 8199 8200 8201 8202 8203 8204 8205 8206 8207 8208 8209 8210 8211 8212 8213 8214 8215 8216 8217 8218 8219 8220 8221 8222 8223 8224 8225 8226 8227 8228 8229 8230 8231 8232 8233 8234 8235 8236 8237 8238 8239 8240 8241 8242 8243 8244 8245 8246 8247 8248 8249 8250 8251 8252 8253 8254 8255 8256 8257 8258 8259 8260 8261 8262 8263 8264 8265 8266 8267 8268 8269 8270 8271 8272 8273 8274 8275 8276 8277 8278 8279 8280 8281 8282 8283 8284 8285 8286 8287 8288 8289 8290 8291 8292 8293 8294 8295 8296 8297 8298 8299 8300 8301 8302 8303 8304 8305 8306 8307 8308 8309 8310 8311 8312 8313 8314 8315 8316 8317 8318 8319 8320 8321 8322 8323 8324 8325 8326 8327 8328 8329 8330 8331 8332 8333 8334 8335 8336 8337 8338 8339 8340 8341 8342 8343 8344 8345 8346 8347 8348 8349 8350 8351 8352 8353 8354 8355 8356 8357 8358 8359 8360 8361 8362 8363 8364 8365 8366 8367 8368 8369 8370 8371 8372 8373 8374 8375 8376 8377 8378 8379 8380 8381 8382 8383 8384 8385 8386 8387 8388 8389 8390 8391 8392 8393 8394 8395 8396 8397 8398 8399 8400 8401 8402 8403 8404 8405 8406 8407 8408 8409 8410 8411 8412 8413 8414 8415 8416 8417 8418 8419 8420 8421 8422 8423 8424 8425 8426 8427 8428 8429 8430 8431 8432 8433 8434 8435 8436 8437 8438 8439 8440 8441 8442 8443 8444 8445 8446 8447 8448 8449 8450 8451 8452 8453 8454 8455 8456 8457 8458 8459 8460 8461 8462 8463 8464 8465 8466 8467 8468 8469 8470 8471 8472 8473 8474 8475 8476 8477 8478 8479 8480 8481 8482 8483 8484 8485 8486 8487 8488 8489 8490 8491 8492 8493 8494 8495 8496 8497 8498 8499 8500 8501 8502 8503 8504 8505 8506 8507 8508 8509 8510 8511 8512 8513 8514 8515 8516 8517 8518 8519 8520 8521 8522 8523 8524 8525 8526 8527 8528 8529 8530 8531 8532 8533 8534 8535 8536 8537 8538 8539 8540 8541 8542 8543 8544 8545 8546 8547 8548 8549 8550 8551 8552 8553 8554 8555 8556 8557 8558 8559 8560 8561 8562 8563 8564 8565 8566 8567 8568 8569 8570 8571 8572 8573 8574 8575 8576 8577 8578 8579 8580 8581 8582 8583 8584 8585 8586 8587 8588 8589 8590 8591 8592 8593 8594 8595 8596 8597 8598 8599 8600 8601 8602 8603 8604 8605 8606 8607 8608 8609 8610 8611 8612 8613 8614 8615 8616 8617 8618 8619 8620 8621 8622 8623 8624 8625 8626 8627 8628 8629 8630 8631 8632 8633 8634 8635 8636 8637 8638 8639 8640 8641 8642 8643 8644 8645 8646 8647 8648 8649 8650 8651 8652 8653 8654 8655 8656 8657 8658 8659 8660 8661 8662 8663 8664 8665 8666 8667 8668 8669 8670 8671 8672 8673 8674 8675 8676 8677 8678 8679 8680 8681 8682 8683 8684 8685 8686 8687 8688 8689 8690 8691 8692 8693 8694 8695 8696 8697 8698 8699 8700 8701 8702 8703 8704 8705 8706 8707 8708 8709 8710 8711 8712 8713 8714 8715 8716 8717 8718 8719 8720 8721 8722 8723 8724 8725 8726 8727 8728 8729 8730 8731 8732 8733 8734 8735 8736 8737 8738 8739 8740 8741 8742 8743 8744 8745 8746 8747 8748 8749 8750 8751 8752 8753 8754 8755 8756 8757 8758 8759 8760 8761 8762 8763 8764 8765 8766 8767 8768 8769 8770 8771 8772 8773 8774 8775 8776 8777 8778 8779 8780 8781 8782 8783 8784 8785 8786 8787 8788 8789 8790 8791 8792 8793 8794 8795 8796 8797 8798 8799 8800 8801 8802 8803 8804 8805 8806 8807 8808 8809 8810 8811 8812 8813 8814 8815 8816 8817 8818 8819 8820 8821 8822 8823 8824 8825 8826 8827 8828 8829 8830 8831 8832 8833 8834 8835 8836 8837 8838 8839 8840 8841 8842 8843 8844 8845 8846 8847 8848 8849 8850 8851 8852 8853 8854 8855 8856 8857 8858 8859 8860 8861 8862 8863 8864 8865 8866 8867 8868 8869 8870 8871 8872 8873 8874 8875 8876 8877 8878 8879 8880 8881 8882 8883 8884 8885 8886 8887 8888 8889 8890 8891 8892 8893 8894 8895 8896 8897 8898 8899 8900 8901 8902 8903 8904 8905 8906 8907 8908 8909 8910 8911 8912 8913 8914 8915 8916 8917 8918 8919 8920 8921 8922 8923 8924 8925 8926 8927 8928 8929 8930 8931 8932 8933 8934 8935 8936 8937 8938 8939 8940 8941 8942 8943 8944 8945 8946 8947 8948 8949 8950 8951 8952 8953 8954 8955 8956 8957 8958 8959 8960 8961 8962 8963 8964 8965 8966 8967 8968 8969 8970 8971 8972 8973 8974 8975 8976 8977 8978 8979 8980 8981 8982 8983 8984 8985 8986 8987 8988 8989 8990 8991 8992 8993 8994 8995 8996 8997 8998 8999 9000 9001 9002 9003 9004 9005 9006 9007 9008 9009 9010 9011 9012 9013 9014 9015 9016 9017 9018 9019 9020 9021 9022 9023 9024 9025 9026 9027 9028 9029 9030 9031 9032 9033 9034 9035 9036 9037 9038 9039 9040 9041 9042 9043 9044 9045 9046 9047 9048 9049 9050 9051 9052 9053 9054 9055 9056 9057 9058 9059 9060 9061 9062 9063 9064 9065 9066 9067 9068 9069 9070 9071 9072 9073 9074 9075 9076 9077 9078 9079 9080 9081 9082 9083 9084 9085 9086 9087 9088 9089 9090 9091 9092 9093 9094 9095 9096 9097 9098 9099 9100 9101 9102 9103 9104 9105 9106 9107 9108 9109 9110 9111 9112 9113 9114 9115 9116 9117 9118 9119 9120 9121 9122 9123 9124 9125 9126 9127 9128 9129 9130 9131 9132 9133 9134 9135 9136 9137 9138 9139 9140 9141 9142 9143 9144 9145 9146 9147 9148 9149 9150 9151 9152 9153 9154 9155 9156 9157 9158 9159 9160 9161 9162 9163 9164 9165 9166 9167 9168 9169 9170 9171 9172 9173 9174 9175 9176 9177 9178 9179 9180 9181 9182 9183 9184 9185 9186 9187 9188 9189 9190 9191 9192 9193 9194 9195 9196 9197 9198 9199 9200 9201 9202 9203 9204 9205 9206 9207 9208 9209 9210 9211 9212 9213 9214 9215 9216 9217 9218 9219 9220 9221 9222 9223 9224 9225 9226 9227 9228 9229 9230 9231 9232 9233 9234 9235 9236 9237 9238 9239 9240 9241 9242 9243 9244 9245 9246 9247 9248 9249 9250 9251 9252 9253 9254 9255 9256 9257 9258 9259 9260 9261 9262 9263 9264 9265 9266 9267 9268 9269 9270 9271 9272 9273 9274 9275 9276 9277 9278 9279 9280 9281 9282 9283 9284 9285 9286 9287 9288 9289 9290 9291 9292 9293 9294 9295 9296 9297 9298 9299 9300 9301 9302 9303 9304 9305 9306 9307 9308 9309 9310 9311 9312 9313 9314 9315 9316 9317 9318 9319 9320 9321 9322 9323 9324 9325 9326 9327 9328 9329 9330 9331 9332 9333 9334 9335 9336 9337 9338 9339 9340 9341 9342 9343 9344 9345 9346 9347 9348 9349 9350 9351 9352 9353 9354 9355 9356 9357 9358 9359 9360 9361 9362 9363 9364 9365 9366 9367 9368 9369 9370 9371 9372 9373 9374 9375 9376 9377 9378 9379 9380 9381 9382 9383 9384 9385 9386 9387 9388 9389 9390 9391 9392 9393 9394 9395 9396 9397 9398 9399 9400 9401 9402 9403 9404 9405 9406 9407 9408 9409 9410 9411 9412 9413 9414 9415 9416 9417 9418 9419 9420 9421 9422 9423 9424 9425 9426 9427 9428 9429 9430 9431 9432 9433 9434 9435 9436 9437 9438 9439 9440 9441 9442 9443 9444 9445 9446 9447 9448 9449 9450 9451 9452 9453 9454 9455 9456 9457 9458 9459 9460 9461 9462 9463 9464 9465 9466 9467 9468 9469 9470 9471 9472 9473 9474 9475 9476 9477 9478 9479 9480 9481 9482 9483 9484 9485 9486 9487 9488 9489 9490 9491 9492 9493 9494 9495 9496 9497 9498 9499 9500 9501 9502 9503 9504 9505 9506 9507 9508 9509 9510 9511 9512 9513 9514 9515 9516 9517 9518 9519 9520 9521 9522 9523 9524 9525 9526 9527 9528 9529 9530 9531 9532 9533 9534 9535 9536 9537 9538 9539 9540 9541 9542 9543 9544 9545 9546 9547 9548 9549 9550 9551 9552 9553 9554 9555 9556 9557 9558 9559 9560 9561 9562 9563 9564 9565 9566 9567 9568 9569 9570 9571 9572 9573 9574 9575 9576 9577 9578 9579 9580 9581 9582 9583 9584 9585 9586 9587 9588 9589 9590 9591 9592 9593 9594 9595 9596 9597 9598 9599 9600 9601 9602 9603 9604 9605 9606 9607 9608 9609 9610 9611 9612 9613 9614 9615 9616 9617 9618 9619 9620 9621 9622 9623 9624 9625 9626 9627 9628 9629 9630 9631 9632 9633 9634 9635 9636 9637 9638 9639 9640 9641 9642 9643 9644 9645 9646 9647 9648 9649 9650 9651 9652 9653 9654 9655 9656 9657 9658 9659 9660 9661 9662 9663 9664 9665 9666 9667 9668 9669 9670 9671 9672 9673 9674 9675 9676 9677 9678 9679 9680 9681 9682 9683 9684 9685 9686 9687 9688 9689 9690 9691 9692 9693 9694 9695 9696 9697 9698 9699 9700 9701 9702 9703 9704 9705 9706 9707 9708 9709 9710 9711 9712 9713 9714 9715 9716 9717 9718 9719 9720 9721 9722 9723 9724 9725 9726 9727 9728 9729 9730 9731 9732 9733 9734 9735 9736 9737 9738 9739 9740 9741 9742 9743 9744 9745 9746 9747 9748 9749 9750 9751 9752 9753 9754 9755 9756 9757 9758 9759 9760 9761 9762 9763 9764 9765 9766 9767 9768 9769 9770 9771 9772 9773 9774 9775 9776 9777 9778 9779 9780 9781 9782 9783 9784 9785 9786 9787 9788 9789 9790 9791 9792 9793 9794 9795 9796 9797 9798 9799 9800 9801 9802 9803 9804 9805 9806 9807 9808 9809 9810 9811 9812 9813 9814 9815 9816 9817 9818 9819 9820 9821 9822 9823 9824 9825 9826 9827 9828 9829 9830 9831 9832 9833 9834 9835 9836 9837 9838 9839 9840 9841 9842 9843 9844 9845 9846 9847 9848 9849 9850 9851 9852 9853 9854 9855 9856 9857 9858 9859 9860 9861 9862 9863 9864 9865 9866 9867 9868 9869 9870 9871 9872 9873 9874 9875 9876 9877 9878 9879 9880 9881 9882 9883 9884 9885 9886 9887 9888 9889 9890 9891 9892 9893 9894 9895 9896 9897 9898 9899 9900 9901 9902 9903 9904 9905 9906 9907 9908 9909 9910 9911 9912 9913 9914 9915 9916 9917 9918 9919 9920 9921 9922 9923 9924 9925 9926 9927 9928 9929 9930 9931 9932 9933 9934 9935 9936 9937 9938 9939 9940 9941 9942 9943 9944 9945 9946 9947 9948 9949 9950 9951 9952 9953 9954 9955 9956 9957 9958 9959 9960 9961 9962 9963 9964 9965 9966 9967 9968 9969 9970 9971 9972 9973 9974 9975 9976 9977 9978 9979 9980 9981 9982 9983 9984 9985 9986 9987 9988 9989 9990 9991 9992 9993 9994 9995 9996 9997 9998 9999 10000 10001 10002 10003 10004 10005 10006 10007 10008 10009 10010 10011 10012 10013 10014 10015 10016 10017 10018 10019 10020 10021 10022 10023 10024 10025 10026 10027 10028 10029 10030 10031 10032 10033 10034 10035 10036 10037 10038 10039 10040 10041 10042 10043 10044 10045 10046 10047 10048 10049 10050 10051 10052 10053 10054 10055 10056 10057 10058 10059 10060 10061 10062 10063 10064 10065 10066 10067 10068 10069 10070 10071 10072 10073 10074 10075 10076 10077 10078 10079 10080 10081 10082 10083 10084 10085 10086 10087 10088 10089 10090 10091 10092 10093 10094 10095 10096 10097 10098 10099 10100 10101 10102 10103 10104 10105 10106 10107 10108 10109 10110 10111 10112 10113 10114 10115 10116 10117 10118 10119 10120 10121 10122 10123 10124 10125 10126 10127 10128 10129 10130 10131 10132 10133 10134 10135 10136 10137 10138 10139 10140 10141 10142 10143 10144 10145 10146 10147 10148 10149 10150 10151 10152 10153 10154 10155 10156 10157 10158 10159 10160 10161 10162 10163 10164 10165 10166 10167 10168 10169 10170 10171 10172 10173 10174 10175 10176 10177 10178 10179 10180 10181 10182 10183 10184 10185 10186 10187 10188 10189 10190 10191 10192 10193 10194 10195 10196 10197 10198 10199 10200 10201 10202 10203 10204 10205 10206 10207 10208 10209 10210 10211 10212 10213 10214 10215 10216 10217 10218 10219 10220 10221 10222 10223 10224 10225 10226 10227 10228 10229 10230 10231 10232 10233 10234 10235 10236 10237 10238 10239 10240 10241 10242 10243 10244 10245 10246 10247 10248 10249 10250 10251 10252 10253 10254 10255 10256 10257 10258 10259 10260 10261 10262 10263 10264 10265 10266 10267 10268 10269 10270 10271 10272 10273 10274 10275 10276 10277 10278 10279 10280 10281 10282 10283 10284 10285 10286 10287 10288 10289 10290 10291 10292 10293 10294 10295 10296 10297 10298 10299 10300 10301 10302 10303 10304 10305 10306 10307 10308 10309 10310 10311 10312 10313 10314 10315 10316 10317 10318 10319 10320 10321 10322 10323 10324 10325 10326 10327 10328 10329 10330 10331 10332 10333 10334 10335 10336 10337 10338 10339 10340 10341 10342 10343 10344 10345 10346 10347 10348 10349 10350 10351 10352 10353 10354 10355 10356 10357 10358 10359 10360 10361 10362 10363 10364 10365 10366 10367 10368 10369 10370 10371 10372 10373 10374 10375 10376 10377 10378 10379 10380 10381 10382 10383 10384 10385 10386 10387 10388 10389 10390 10391 10392 10393 10394 10395 10396 10397 10398 10399 10400 10401 10402 10403 10404 10405 10406 10407 10408 10409 10410 10411 10412 10413 10414 10415 10416 10417 10418 10419 10420 10421 10422 10423 10424 10425 10426 10427 10428 10429 10430 10431 10432 10433 10434 10435 10436 10437 10438 10439 10440 10441 10442 10443 10444 10445 10446 10447 10448 10449 10450 10451 10452 10453 10454 10455 10456 10457 10458 10459 10460 10461 10462 10463 10464 10465 10466 10467 10468 10469 10470 10471 10472 10473 10474 10475 10476 10477 10478 10479 10480 10481 10482 10483 10484 10485 10486 10487 10488 10489 10490 10491 10492 10493 10494 10495 10496 10497 10498 10499 10500 10501 10502 10503 10504 10505 10506 10507 10508 10509 10510 10511 10512 10513 10514 10515 10516 10517 10518 10519 10520 10521 10522 10523 10524 10525 10526 10527 10528 10529 10530 10531 10532 10533 10534 10535 10536 10537 10538 10539 10540 10541 10542 10543 10544 10545 10546 10547 10548 10549 10550 10551 10552 10553 10554 10555 10556 10557 10558 10559 10560 10561 10562 10563 10564 10565 10566 10567 10568 10569 10570 10571 10572 10573 10574 10575 10576 10577 10578 10579 10580 10581 10582 10583 10584 10585 10586 10587 10588 10589 10590 10591 10592 10593 10594 10595 10596 10597 10598 10599 10600 10601 10602 10603 10604 10605 10606 10607 10608 10609 10610 10611 10612 10613 10614 10615 10616 10617 10618 10619 10620 10621 10622 10623 10624 10625 10626 10627 10628 10629 10630 10631 10632 10633 10634 10635 10636 10637 10638 10639 10640 10641 10642 10643 10644 10645 10646 10647 10648 10649 10650 10651 10652 10653 10654 10655 10656 10657 10658 10659 10660 10661 10662 10663 10664 10665 10666 10667 10668 10669 10670 10671 10672 10673 10674 10675 10676 10677 10678 10679 10680 10681 10682 10683 10684 10685 10686 10687 10688 10689 10690 10691 10692 10693 10694 10695 10696 10697 10698 10699 10700 10701 10702 10703 10704 10705 10706 10707 10708 10709 10710 10711 10712 10713 10714 10715 10716 10717 10718 10719 10720 10721 10722 10723 10724 10725 10726 10727 10728 10729 10730 10731 10732 10733 10734 10735 10736 10737 10738 10739 10740 10741 10742 10743 10744 10745 10746 10747 10748 10749 10750 10751 10752 10753 10754 10755 10756 10757 10758 10759 10760 10761 10762 10763 10764 10765 10766 10767 10768 10769 10770 10771 10772 10773 10774 10775 10776 10777 10778 10779 10780 10781 10782 10783 10784 10785 10786 10787 10788 10789 10790 10791 10792 10793 10794 10795 10796 10797 10798 10799 10800 10801 10802 10803 10804 10805 10806 10807 10808 10809 10810 10811 10812 10813 10814 10815 10816 10817 10818 10819 10820 10821 10822 10823 10824 10825 10826 10827 10828 10829 10830 10831 10832 10833 10834 10835 10836 10837 10838 10839 10840 10841 10842 10843 10844 10845 10846 10847 10848 10849 10850 10851 10852 10853 10854 10855 10856 10857 10858 10859 10860 10861 10862 10863 10864 10865 10866 10867 10868 10869 10870 10871 10872 10873 10874 10875 10876 10877 10878 10879 10880 10881 10882 10883 10884 10885 10886 10887 10888 10889 10890 10891 10892 10893 10894 10895 10896 10897 10898 10899 10900 10901 10902 10903 10904 10905 10906 10907 10908 10909 10910 10911 10912 10913 10914 10915 10916 10917 10918 10919 10920 10921 10922 10923 10924 10925 10926 10927 10928 10929 10930 10931 10932 10933 10934 10935 10936 10937 10938 10939 10940 10941 10942 10943 10944 10945 10946 10947 10948 10949 10950 10951 10952 10953 10954 10955 10956 10957 10958 10959 10960 10961 10962 10963 10964 10965 10966 10967 10968 10969 10970 10971 10972 10973 10974 10975 10976 10977 10978 10979 10980 10981 10982 10983 10984 10985 10986 10987 10988 10989 10990 10991 10992 10993 10994 10995 10996 10997 10998 10999 11000 11001 11002 11003 11004 11005 11006 11007 11008 11009 11010 11011 11012 11013 11014 11015 11016 11017 11018 11019 11020 11021 11022 11023 11024 11025 11026 11027 11028 11029 11030 11031 11032 11033 11034 11035 11036 11037 11038 11039 11040 11041 11042 11043 11044 11045 11046 11047 11048 11049 11050 11051 11052 11053 11054 11055 11056 11057 11058 11059 11060 11061 11062 11063 11064 11065 11066 11067 11068 11069 11070 11071 11072 11073 11074 11075 11076 11077 11078 11079 11080 11081 11082 11083 11084 11085 11086 11087 11088 11089 11090 11091 11092 11093 11094 11095 11096 11097 11098 11099 11100 11101 11102 11103 11104 11105 11106 11107 11108 11109 11110 11111 11112 11113 11114 11115 11116 11117 11118 11119 11120 11121 11122 11123 11124 11125 11126 11127 11128 11129 11130 11131 11132 11133 11134 11135 11136 11137 11138 11139 11140 11141 11142 11143 11144 11145 11146 11147 11148 11149 11150 11151 11152 11153 11154 11155 11156 11157 11158 11159 11160 11161 11162 11163 11164 11165 11166 11167 11168 11169 11170 11171 11172 11173 11174 11175 11176 11177 11178 11179 11180 11181 11182 11183 11184 11185 11186 11187 11188 11189 11190 11191 11192 11193 11194 11195 11196 11197 11198 11199 11200 11201 11202 11203 11204 11205 11206 11207 11208 11209 11210 11211 11212 11213 11214 11215 11216 11217 11218 11219 11220 11221 11222 11223 11224 11225 11226 11227 11228 11229 11230 11231 11232 11233 11234 11235 11236 11237 11238 11239 11240 11241 11242 11243 11244 11245 11246 11247 11248 11249 11250 11251 11252 11253 11254 11255 11256 11257 11258 11259 11260 11261 11262 11263 11264 11265 11266 11267 11268 11269 11270 11271 11272 11273 11274 11275 11276 11277 11278 11279 11280 11281 11282 11283 11284 11285 11286 11287 11288 11289 11290 11291 11292 11293 11294 11295 11296 11297 11298 11299 11300 11301 11302 11303 11304 11305 11306 11307 11308 11309 11310 11311 11312 11313 11314 11315 11316 11317 11318 11319 11320 11321 11322 11323 11324 11325 11326 11327 11328 11329 11330 11331 11332 11333 11334 11335 11336 11337 11338 11339 11340 11341 11342 11343 11344 11345 11346 11347 11348 11349 11350 11351 11352 11353 11354 11355 11356 11357 11358 11359 11360 11361 11362 11363 11364 11365 11366 11367 11368 11369 11370 11371 11372 11373 11374 11375 11376 11377 11378 11379 11380 11381 11382 11383 11384 11385 11386 11387 11388 11389 11390 11391 11392 11393 11394 11395 11396 11397 11398 11399 11400 11401 11402 11403 11404 11405 11406 11407 11408 11409 11410 11411 11412 11413 11414 11415 11416 11417 11418 11419 11420 11421 11422 11423 11424 11425 11426 11427 11428 11429 11430 11431 11432 11433 11434 11435 11436 11437 11438 11439 11440 11441 11442 11443 11444 11445 11446 11447 11448 11449 11450 11451 11452 11453 11454 11455 11456 11457 11458 11459 11460 11461 11462 11463 11464 11465 11466 11467 11468 11469 11470 11471 11472 11473 11474 11475 11476 11477 11478 11479 11480 11481 11482 11483 11484 11485 11486 11487 11488 11489 11490 11491 11492 11493 11494 11495 11496 11497 11498 11499 11500 11501 11502 11503 11504 11505 11506 11507 11508 11509 11510 11511 11512 11513 11514 11515 11516 11517 11518 11519 11520 11521 11522 11523 11524 11525 11526 11527 11528 11529 11530 11531 11532 11533 11534 11535 11536 11537 11538 11539 11540 11541 11542 11543 11544 11545 11546 11547 11548 11549 11550 11551 11552 11553 11554 11555 11556 11557 11558 11559 11560 11561 11562 11563 11564 11565 11566 11567 11568 11569 11570 11571 11572 11573 11574 11575 11576 11577 11578 11579 11580 11581 11582 11583 11584 11585 11586 11587 11588 11589 11590 11591 11592 11593 11594 11595 11596 11597 11598 11599 11600 11601 11602 11603 11604 11605 11606 11607 11608 11609 11610 11611 11612 11613 11614 11615 11616 11617 11618 11619 11620 11621 11622 11623 11624 11625 11626 11627 11628 11629 11630 11631 11632 11633 11634 11635 11636 11637 11638 11639 11640 11641 11642 11643 11644 11645 11646 11647 11648 11649 11650 11651 11652 11653 11654 11655 11656 11657 11658 11659 11660 11661 11662 11663 11664 11665 11666 11667 11668 11669 11670 11671 11672 11673 11674 11675 11676 11677 11678 11679 11680 11681 11682 11683 11684 11685 11686 11687 11688 11689 11690 11691 11692 11693 11694 11695 11696 11697 11698 11699 11700 11701 11702 11703 11704 11705 11706 11707 11708 11709 11710 11711 11712 11713 11714 11715 11716 11717 11718 11719 11720 11721 11722 11723 11724 11725 11726 11727 11728 11729 11730 11731 11732 11733 11734 11735 11736 11737 11738 11739 11740 11741 11742 11743 11744 11745 11746 11747 11748 11749 11750 11751 11752 11753 11754 11755 11756 11757 11758 11759 11760 11761 11762 11763 11764 11765 11766 11767 11768 11769 11770 11771 11772 11773 11774 11775 11776 11777 11778 11779 11780 11781 11782 11783 11784 11785 11786 11787 11788 11789 11790 11791 11792 11793 11794 11795 11796 11797 11798 11799 11800 11801 11802 11803 11804 11805 11806 11807 11808 11809 11810 11811 11812 11813 11814 11815 11816 11817 11818 11819 11820 11821 11822 11823 11824 11825 11826 11827 11828 11829 11830 11831 11832 11833 11834 11835 11836 11837 11838 11839 11840 11841 11842 11843 11844 11845 11846 11847 11848 11849 11850 11851 11852 11853 11854 11855 11856 11857 11858 11859 11860 11861 11862 11863 11864 11865 11866 11867 11868 11869 11870 11871 11872 11873 11874 11875 11876 11877 11878 11879 11880 11881 11882 11883 11884 11885 11886 11887 11888 11889 11890 11891 11892 11893 11894 11895 11896 11897 11898 11899 11900 11901 11902 11903 11904 11905 11906 11907 11908 11909 11910 11911 11912 11913 11914 11915 11916 11917 11918 11919 11920 11921 11922 11923 11924 11925 11926 11927 11928 11929 11930 11931 11932 11933 11934 11935 11936 11937 11938 11939 11940 11941 11942 11943 11944 11945 11946 11947 11948 11949 11950 11951 11952 11953 11954 11955 11956 11957 11958 11959 11960 11961 11962 11963 11964 11965 11966 11967 11968 11969 11970 11971 11972 11973 11974 11975 11976 11977 11978 11979 11980 11981 11982 11983 11984 11985 11986 11987 11988 11989 11990 11991 11992 11993 11994 11995 11996 11997 11998 11999 12000 12001 12002 12003 12004 12005 12006 12007 12008 12009 12010 12011 12012 12013 12014 12015 12016 12017 12018 12019 12020 12021 12022 12023 12024 12025 12026 12027 12028 12029 12030 12031 12032 12033 12034 12035 12036 12037 12038 12039 12040 12041 12042 12043 12044 12045 12046 12047 12048 12049 12050 12051 12052 12053 12054 12055 12056 12057 12058 12059 12060 12061 12062 12063 12064 12065 12066 12067 12068 12069 12070 12071 12072 12073 12074 12075 12076 12077 12078 12079 12080 12081 12082 12083 12084 12085 12086 12087 12088 12089 12090 12091 12092 12093 12094 12095 12096 12097 12098 12099 12100 12101 12102 12103 12104 12105 12106 12107 12108 12109 12110 12111 12112 12113 12114 12115 12116 12117 12118 12119 12120 12121 12122 12123 12124 12125 12126 12127 12128 12129 12130 12131 12132 12133 12134 12135 12136 12137 12138 12139 12140 12141 12142 12143 12144 12145 12146 12147 12148 12149 12150 12151 12152 12153 12154 12155 12156 12157 12158 12159 12160 12161 12162 12163 12164 12165 12166 12167 12168 12169 12170 12171 12172 12173 12174 12175 12176 12177 12178 12179 12180 12181 12182 12183 12184 12185 12186 12187 12188 12189 12190 12191 12192 12193 12194 12195 12196 12197 12198 12199 12200 12201 12202 12203 12204 12205 12206 12207 12208 12209 12210 12211 12212 12213 12214 12215 12216 12217 12218 12219 12220 12221 12222 12223 12224 12225 12226 12227 12228 12229 12230 12231 12232 12233 12234 12235 12236 12237 12238 12239 12240 12241 12242 12243 12244 12245 12246 12247 12248 12249 12250 12251 12252 12253 12254 12255 12256 12257 12258 12259 12260 12261 12262 12263 12264 12265 12266 12267 12268 12269 12270 12271 12272 12273 12274 12275 12276 12277 12278 12279 12280 12281 12282 12283 12284 12285 12286 12287 12288 12289 12290 12291 12292 12293 12294 12295 12296 12297 12298 12299 12300 12301 12302 12303 12304 12305 12306 12307 12308 12309 12310 12311 12312 12313 12314 12315 12316 12317 12318 12319 12320 12321 12322 12323 12324 12325 12326 12327 12328 12329 12330 12331 12332 12333 12334 12335 12336 12337 12338 12339 12340 12341 12342 12343 12344 12345 12346 12347 12348 12349 12350 12351 12352 12353 12354 12355 12356 12357 12358 12359 12360 12361 12362 12363 12364 12365 12366 12367 12368 12369 12370 12371 12372 12373 12374 12375 12376 12377 12378 12379 12380 12381 12382 12383 12384 12385 12386 12387 12388 12389 12390 12391 12392 12393 12394 12395 12396 12397 12398 12399 12400 12401 12402 12403 12404 12405 12406 12407 12408 12409 12410 12411 12412 12413 12414 12415 12416 12417 12418 12419 12420 12421 12422 12423 12424 12425 12426 12427 12428 12429 12430 12431 12432 12433 12434 12435 12436 12437 12438 12439 12440 12441 12442 12443 12444 12445 12446 12447 12448 12449 12450 12451 12452 12453 12454 12455 12456 12457 12458 12459 12460 12461 12462 12463 12464 12465 12466 12467 12468 12469 12470 12471 12472 12473 12474 12475 12476 12477 12478 12479 12480 12481 12482 12483 12484 12485 12486 12487 12488 12489 12490 12491 12492 12493 12494 12495 12496 12497 12498 12499 12500 12501 12502 12503 12504 12505 12506 12507 12508 12509 12510 12511 12512 12513 12514 12515 12516 12517 12518 12519 12520 12521 12522 12523 12524 12525 12526 12527 12528 12529 12530 12531 12532 12533 12534 12535 12536 12537 12538 12539 12540 12541 12542 12543 12544 12545 12546 12547 12548 12549 12550 12551 12552 12553 12554 12555 12556 12557 12558 12559 12560 12561 12562 12563 12564 12565 12566 12567 12568 12569 12570 12571 12572 12573 12574 12575 12576 12577 12578 12579 12580 12581 12582 12583 12584 12585 12586 12587 12588 12589 12590 12591 12592 12593 12594 12595 12596 12597 12598 12599 12600 12601 12602 12603 12604 12605 12606 12607 12608 12609 12610 12611 12612 12613 12614 12615 12616 12617 12618 12619 12620 12621 12622 12623 12624 12625 12626 12627 12628 12629 12630 12631 12632 12633 12634 12635 12636 12637 12638 12639 12640 12641 12642 12643 12644 12645 12646 12647 12648 12649 12650 12651 12652 12653 12654 12655 12656 12657 12658 12659 12660 12661 12662 12663 12664 12665 12666 12667 12668 12669 12670 12671 12672 12673 12674 12675 12676 12677 12678 12679 12680 12681 12682 12683 12684 12685 12686 12687 12688 12689 12690 12691 12692 12693 12694 12695 12696 12697 12698 12699 12700 12701 12702 12703 12704 12705 12706 12707 12708 12709 12710 12711 12712 12713 12714 12715 12716 12717 12718 12719 12720 12721 12722 12723 12724 12725 12726 12727 12728 12729 12730 12731 12732 12733 12734 12735 12736 12737 12738 12739 12740 12741 12742 12743 12744 12745 12746 12747 12748 12749 12750 12751 12752 12753 12754 12755 12756 12757 12758 12759 12760 12761 12762 12763 12764 12765 12766 12767 12768 12769 12770 12771 12772 12773 12774 12775 12776 12777 12778 12779 12780 12781 12782 12783 12784 12785 12786 12787 12788 12789 12790 12791 12792 12793 12794 12795 12796 12797 12798 12799 12800 12801 12802 12803 12804 12805 12806 12807 12808 12809 12810 12811 12812 12813 12814 12815 12816 12817 12818 12819 12820 12821 12822 12823 12824 12825 12826 12827 12828 12829 12830 12831 12832 12833 12834 12835 12836 12837 12838 12839 12840 12841 12842 12843 12844 12845 12846 12847 12848 12849 12850 12851 12852 12853 12854 12855 12856 12857 12858 12859 12860 12861 12862 12863 12864 12865 12866 12867 12868 12869 12870 12871 12872 12873 12874 12875 12876 12877 12878 12879 12880 12881 12882 12883 12884 12885 12886 12887 12888 12889 12890 12891 12892 12893 12894 12895 12896 12897 12898 12899 12900 12901 12902 12903 12904 12905 12906 12907 12908 12909 12910 12911 12912 12913 12914 12915 12916 12917 12918 12919 12920 12921 12922 12923 12924 12925 12926 12927 12928 12929 12930 12931 12932 12933 12934 12935 12936 12937 12938 12939 12940 12941 12942 12943 12944 12945 12946 12947 12948 12949 12950 12951 12952 12953 12954 12955 12956 12957 12958 12959 12960 12961 12962 12963 12964 12965 12966 12967 12968 12969 12970 12971 12972 12973 12974 12975 12976 12977 12978 12979 12980 12981 12982 12983 12984 12985 12986 12987 12988 12989 12990 12991 12992 12993 12994 12995 12996 12997 12998 12999 13000 13001 13002 13003 13004 13005 13006 13007 13008 13009 13010 13011 13012 13013 13014 13015 13016 13017 13018 13019 13020 13021 13022 13023 13024 13025 13026 13027 13028 13029 13030 13031 13032 13033 13034 13035 13036 13037 13038 13039 13040 13041 13042 13043 13044 13045 13046 13047 13048 13049 13050 13051 13052 13053 13054 13055 13056 13057 13058 13059 13060 13061 13062 13063 13064 13065 13066 13067 13068 13069 13070 13071 13072 13073 13074 13075 13076 13077 13078 13079 13080 13081 13082 13083 13084 13085 13086 13087 13088 13089 13090 13091 13092 13093 13094 13095 13096 13097 13098 13099 13100 13101 13102 13103 13104 13105 13106 13107 13108 13109 13110 13111 13112 13113 13114 13115 13116 13117 13118 13119 13120 13121 13122 13123 13124 13125 13126 13127 13128 13129 13130 13131 13132 13133 13134 13135 13136 13137 13138 13139 13140 13141 13142 13143 13144 13145 13146 13147 13148 13149 13150 13151 13152 13153 13154 13155 13156 13157 13158 13159 13160 13161 13162 13163 13164 13165 13166 13167 13168 13169 13170 13171 13172 13173 13174 13175 13176 13177 13178 13179 13180 13181 13182 13183 13184 13185 13186 13187 13188 13189 13190 13191 13192 13193 13194 13195 13196 13197 13198 13199 13200 13201 13202 13203 13204 13205 13206 13207 13208 13209 13210 13211 13212 13213 13214 13215 13216 13217 13218 13219 13220 13221 13222 13223 13224 13225 13226 13227 13228 13229 13230 13231 13232 13233 13234 13235 13236 13237 13238 13239 13240 13241 13242 13243 13244 13245 13246 13247 13248 13249 13250 13251 13252 13253 13254 13255 13256 13257 13258 13259 13260 13261 13262 13263 13264 13265 13266 13267 13268 13269 13270 13271 13272 13273 13274 13275 13276 13277 13278 13279 13280 13281 13282 13283 13284 13285 13286 13287 13288 13289 13290 13291 13292 13293 13294 13295 13296 13297 13298 13299 13300 13301 13302 13303 13304 13305 13306 13307 13308 13309 13310 13311 13312 13313 13314 13315 13316 13317 13318 13319 13320 13321 13322 13323 13324 13325 13326 13327 13328 13329 13330 13331 13332 13333 13334 13335 13336 13337 13338 13339 13340 13341 13342 13343 13344 13345 13346 13347 13348 13349 13350 13351 13352 13353 13354 13355 13356 13357 13358 13359 13360 13361 13362 13363 13364 13365 13366 13367 13368 13369 13370 13371 13372 13373 13374 13375 13376 13377 13378 13379 13380 13381 13382 13383 13384 13385 13386 13387 13388 13389 13390 13391 13392 13393 13394 13395 13396 13397 13398 13399 13400 13401 13402 13403 13404 13405 13406 13407 13408 13409 13410 13411 13412 13413 13414 13415 13416 13417 13418 13419 13420 13421 13422 13423 13424 13425 13426 13427 13428 13429 13430 13431 13432 13433 13434 13435 13436 13437 13438 13439 13440 13441 13442 13443 13444 13445 13446 13447 13448 13449 13450 13451 13452 13453 13454 13455 13456 13457 13458 13459 13460 13461 13462 13463 13464 13465 13466 13467 13468 13469 13470 13471 13472 13473 13474 13475 13476 13477 13478 13479 13480 13481 13482 13483 13484 13485 13486 13487 13488 13489 13490 13491 13492 13493 13494 13495 13496 13497 13498 13499 13500 13501 13502 13503 13504 13505 13506 13507 13508 13509 13510 13511 13512 13513 13514 13515 13516 13517 13518 13519 13520 13521 13522 13523 13524 13525 13526 13527 13528 13529 13530 13531 13532 13533 13534 13535 13536 13537 13538 13539 13540 13541 13542 13543 13544 13545 13546 13547 13548 13549 13550 13551 13552 13553 13554 13555 13556 13557 13558 13559 13560 13561 13562 13563 13564 13565 13566 13567 13568 13569 13570 13571 13572 13573 13574 13575 13576 13577 13578 13579 13580 13581 13582 13583 13584 13585 13586 13587 13588 13589 13590 13591 13592 13593 13594 13595 13596 13597 13598 13599 13600 13601 13602 13603 13604 13605 13606 13607 13608 13609 13610 13611 13612 13613 13614 13615 13616 13617 13618 13619 13620 13621 13622 13623 13624 13625 13626 13627 13628 13629 13630 13631 13632 13633 13634 13635 13636 13637 13638 13639 13640 13641 13642 13643 13644 13645 13646 13647 13648 13649 13650 13651 13652 13653 13654 13655 13656 13657 13658 13659 13660 13661 13662 13663 13664 13665 13666 13667 13668 13669 13670 13671 13672 13673 13674 13675 13676 13677 13678 13679 13680 13681 13682 13683 13684 13685 13686 13687 13688 13689 13690 13691 13692 13693 13694 13695 13696 13697 13698 13699 13700 13701 13702 13703 13704 13705 13706 13707 13708 13709 13710 13711 13712 13713 13714 13715 13716 13717 13718 13719 13720 13721 13722 13723 13724 13725 13726 13727 13728 13729 13730 13731 13732 13733 13734 13735 13736 13737 13738 13739 13740 13741 13742 13743 13744 13745 13746 13747 13748 13749 13750 13751 13752 13753 13754 13755 13756 13757 13758 13759 13760 13761 13762 13763 13764 13765 13766 13767 13768 13769 13770 13771 13772 13773 13774 13775 13776 13777 13778 13779 13780 13781 13782 13783 13784 13785 13786 13787 13788 13789 13790 13791 13792 13793 13794 13795 13796 13797 13798 13799 13800 13801 13802 13803 13804 13805 13806 13807 13808 13809 13810 13811 13812 13813 13814 13815 13816 13817 13818 13819 13820 13821 13822 13823 13824 13825 13826 13827 13828 13829 13830 13831 13832 13833 13834 13835 13836 13837 13838 13839 13840 13841 13842 13843 13844 13845 13846 13847 13848 13849 13850 13851 13852 13853 13854 13855 13856 13857 13858 13859 13860 13861 13862 13863 13864 13865 13866 13867 13868 13869 13870 13871 13872 13873 13874 13875 13876 13877 13878 13879 13880 13881 13882 13883 13884 13885 13886 13887 13888 13889 13890 13891 13892 13893 13894 13895 13896 13897 13898 13899 13900 13901 13902 13903 13904 13905 13906 13907 13908 13909 13910 13911 13912 13913 13914 13915 13916 13917 13918 13919 13920 13921 13922 13923 13924 13925 13926 13927 13928 13929 13930 13931 13932 13933 13934 13935 13936 13937 13938 13939 13940 13941 13942 13943 13944 13945 13946 13947 13948 13949 13950 13951 13952 13953 13954 13955 13956 13957 13958 13959 13960 13961 13962 13963 13964 13965 13966 13967 13968 13969 13970 13971 13972 13973 13974 13975 13976 13977 13978 13979 13980 13981 13982 13983 13984 13985 13986 13987 13988 13989 13990 13991 13992 13993 13994 13995 13996 13997 13998 13999 14000 14001 14002 14003 14004 14005 14006 14007 14008 14009 14010 14011 14012 14013 14014 14015 14016 14017 14018 14019 14020 14021 14022 14023 14024 14025 14026 14027 14028 14029 14030 14031 14032 14033 14034 14035 14036 14037 14038 14039 14040 14041 14042 14043 14044 14045 14046 14047 14048 14049 14050 14051 14052 14053 14054 14055 14056 14057 14058 14059 14060 14061 14062 14063 14064 14065 14066 14067 14068 14069 14070 14071 14072 14073 14074 14075 14076 14077 14078 14079 14080 14081 14082 14083 14084 14085 14086 14087 14088 14089 14090 14091 14092 14093 14094 14095 14096 14097 14098 14099 14100 14101 14102 14103 14104 14105 14106 14107 14108 14109 14110 14111 14112 14113 14114 14115 14116 14117 14118 14119 14120 14121 14122 14123 14124 14125 14126 14127 14128 14129 14130 14131 14132 14133 14134 14135 14136 14137 14138 14139 14140 14141 14142 14143 14144 14145 14146 14147 14148 14149 14150 14151 14152 14153 14154 14155 14156 14157 14158 14159 14160 14161 14162 14163 14164 14165 14166 14167 14168 14169 14170 14171 14172 14173 14174 14175 14176 14177 14178 14179 14180 14181 14182 14183 14184 14185 14186 14187 14188 14189 14190 14191 14192 14193 14194 14195 14196 14197 14198 14199 14200 14201 14202 14203 14204 14205 14206 14207 14208 14209 14210 14211 14212 14213 14214 14215 14216 14217 14218 14219 14220 14221 14222 14223 14224 14225 14226 14227 14228 14229 14230 14231 14232 14233 14234 14235 14236 14237 14238 14239 14240 14241 14242 14243 14244 14245 14246 14247 14248 14249 14250 14251 14252 14253 14254 14255 14256 14257 14258 14259 14260 14261 14262 14263 14264 14265 14266 14267 14268 14269 14270 14271 14272 14273 14274 14275 14276 14277 14278 14279 14280 14281 14282 14283 14284 14285 14286 14287 14288 14289 14290 14291 14292 14293 14294 14295 14296 14297 14298 14299 14300 14301 14302 14303 14304 14305 14306 14307 14308 14309 14310 14311 14312 14313 14314 14315 14316 14317 14318 14319 14320 14321 14322 14323 14324 14325 14326 14327 14328 14329 14330 14331 14332 14333 14334 14335 14336 14337 14338 14339 14340 14341 14342 14343 14344 14345 14346 14347 14348 14349 14350 14351 14352 14353 14354 14355 14356 14357 14358 14359 14360 14361 14362 14363 14364 14365 14366 14367 14368 14369 14370 14371 14372 14373 14374 14375 14376 14377 14378 14379 14380 14381 14382 14383 14384 14385 14386 14387 14388 14389 14390 14391 14392 14393 14394 14395 14396 14397 14398 14399 14400 14401 14402 14403 14404 14405 14406 14407 14408 14409 14410 14411 14412 14413 14414 14415 14416 14417 14418 14419 14420 14421 14422 14423 14424 14425 14426 14427 14428 14429 14430 14431 14432 14433 14434 14435 14436 14437 14438 14439 14440 14441 14442 14443 14444 14445 14446 14447 14448 14449 14450 14451 14452 14453 14454 14455 14456 14457 14458 14459 14460 14461 14462 14463 14464 14465 14466 14467 14468 14469 14470 14471 14472 14473 14474 14475 14476 14477 14478 14479 14480 14481 14482 14483 14484 14485 14486 14487 14488 14489 14490 14491 14492 14493 14494 14495 14496 14497 14498 14499 14500 14501 14502 14503 14504 14505 14506 14507 14508 14509 14510 14511 14512 14513 14514 14515 14516 14517 14518 14519 14520 14521 14522 14523 14524 14525 14526 14527 14528 14529 14530 14531 14532 14533 14534 14535 14536 14537 14538 14539 14540 14541 14542 14543 14544 14545 14546 14547 14548 14549 14550 14551 14552 14553 14554 14555 14556 14557 14558 14559 14560 14561 14562 14563 14564 14565 14566 14567 14568 14569 14570 14571 14572 14573 14574 14575 14576 14577 14578 14579 14580 14581 14582 14583 14584 14585 14586 14587 14588 14589 14590 14591 14592 14593 14594 14595 14596 14597 14598 14599 14600 14601 14602 14603 14604 14605 14606 14607 14608 14609 14610 14611 14612 14613 14614 14615 14616 14617 14618 14619 14620 14621 14622 14623 14624 14625 14626 14627 14628 14629 14630 14631 14632 14633 14634 14635 14636 14637 14638 14639 14640 14641 14642 14643 14644 14645 14646 14647 14648 14649 14650 14651 14652 14653 14654 14655 14656 14657 14658 14659 14660 14661 14662 14663 14664 14665 14666 14667 14668 14669 14670 14671 14672 14673 14674 14675 14676 14677 14678 14679 14680 14681 14682 14683 14684 14685 14686 14687 14688 14689 14690 14691 14692 14693 14694 14695 14696 14697 14698 14699 14700 14701 14702 14703 14704 14705 14706 14707 14708 14709 14710 14711 14712 14713 14714 14715 14716 14717 14718 14719 14720 14721 14722 14723 14724 14725 14726 14727 14728 14729 14730 14731 14732 14733 14734 14735 14736 14737 14738 14739 14740 14741 14742 14743 14744 14745 14746 14747 14748 14749 14750 14751 14752 14753 14754 14755 14756 14757 14758 14759 14760 14761 14762 14763 14764 14765 14766 14767 14768 14769 14770 14771 14772 14773 14774 14775 14776 14777 14778 14779 14780 14781 14782 14783 14784 14785 14786 14787 14788 14789 14790 14791 14792 14793 14794 14795 14796 14797 14798 14799 14800 14801 14802 14803 14804 14805 14806 14807 14808 14809 14810 14811 14812 14813 14814 14815 14816 14817 14818 14819 14820 14821 14822 14823 14824 14825 14826 14827 14828 14829 14830 14831 14832 14833 14834 14835 14836 14837 14838 14839 14840 14841 14842 14843 14844 14845 14846 14847 14848 14849 14850 14851 14852 14853 14854 14855 14856 14857 14858 14859 14860 14861 14862 14863 14864 14865 14866 14867 14868 14869 14870 14871 14872 14873 14874 14875 14876 14877 14878 14879 14880 14881 14882 14883 14884 14885 14886 14887 14888 14889 14890 14891 14892 14893 14894 14895 14896 14897 14898 14899 14900 14901 14902 14903 14904 14905 14906 14907 14908 14909 14910 14911 14912 14913 14914 14915 14916 14917 14918 14919 14920 14921 14922 14923 14924 14925 14926 14927 14928 14929 14930 14931 14932 14933 14934 14935 14936 14937 14938 14939 14940 14941 14942 14943 14944 14945 14946 14947 14948 14949 14950 14951 14952 14953 14954 14955 14956 14957 14958 14959 14960 14961 14962 14963 14964 14965 14966 14967 14968 14969 14970 14971 14972 14973 14974 14975 14976 14977 14978 14979 14980 14981 14982 14983 14984 14985 14986 14987 14988 14989 14990 14991 14992 14993 14994 14995 14996 14997 14998 14999 15000 15001 15002 15003 15004 15005 15006 15007 15008 15009 15010 15011 15012 15013 15014 15015 15016 15017 15018 15019 15020 15021 15022 15023 15024 15025 15026 15027 15028 15029 15030 15031 15032 15033 15034 15035 15036 15037 15038 15039 15040 15041 15042 15043 15044 15045 15046 15047 15048 15049 15050 15051 15052 15053 15054 15055 15056 15057 15058 15059 15060 15061 15062 15063 15064 15065 15066 15067 15068 15069 15070 15071 15072 15073 15074 15075 15076 15077 15078 15079 15080 15081 15082 15083 15084 15085 15086 15087 15088 15089 15090 15091 15092 15093 15094 15095 15096 15097 15098 15099 15100 15101 15102 15103 15104 15105 15106 15107 15108 15109 15110 15111 15112 15113 15114 15115 15116 15117 15118 15119 15120 15121 15122 15123 15124 15125 15126 15127 15128 15129 15130 15131 15132 15133 15134 15135 15136 15137 15138 15139 15140 15141 15142 15143 15144 15145 15146 15147 15148 15149 15150 15151 15152 15153 15154 15155 15156 15157 15158 15159 15160 15161 15162 15163 15164 15165 15166 15167 15168 15169 15170 15171 15172 15173 15174 15175 15176 15177 15178 15179 15180 15181 15182 15183 15184 15185 15186 15187 15188 15189 15190 15191 15192 15193 15194 15195 15196 15197 15198 15199 15200 15201 15202 15203 15204 15205 15206 15207 15208 15209 15210 15211 15212 15213 15214 15215 15216 15217 15218 15219 15220 15221 15222 15223 15224 15225 15226 15227 15228 15229 15230 15231 15232 15233 15234 15235 15236 15237 15238 15239 15240 15241 15242 15243 15244 15245 15246 15247 15248 15249 15250 15251 15252 15253 15254 15255 15256 15257 15258 15259 15260 15261 15262 15263 15264 15265 15266 15267 15268 15269 15270 15271 15272 15273 15274 15275 15276 15277 15278 15279 15280 15281 15282 15283 15284 15285 15286 15287 15288 15289 15290 15291 15292 15293 15294 15295 15296 15297 15298 15299 15300 15301 15302 15303 15304 15305 15306 15307 15308 15309 15310 15311 15312 15313 15314 15315 15316 15317 15318 15319 15320 15321 15322 15323 15324 15325 15326 15327 15328 15329 15330 15331 15332 15333 15334 15335 15336 15337 15338 15339 15340 15341 15342 15343 15344 15345 15346 15347 15348 15349 15350 15351 15352 15353 15354 15355 15356 15357 15358 15359 15360 15361 15362 15363 15364 15365 15366 15367 15368 15369 15370 15371 15372 15373 15374 15375 15376 15377 15378 15379 15380 15381 15382 15383 15384 15385 15386 15387 15388 15389 15390 15391 15392 15393 15394 15395 15396 15397 15398 15399 15400 15401 15402 15403 15404 15405 15406 15407 15408 15409 15410 15411 15412 15413 15414 15415 15416 15417 15418 15419 15420 15421 15422 15423 15424 15425 15426 15427 15428 15429 15430 15431 15432 15433 15434 15435 15436 15437 15438 15439 15440 15441 15442 15443 15444 15445 15446 15447 15448 15449 15450 15451 15452 15453 15454 15455 15456 15457 15458 15459 15460 15461 15462 15463 15464 15465 15466 15467 15468 15469 15470 15471 15472 15473 15474 15475 15476 15477 15478 15479 15480 15481 15482 15483 15484 15485 15486 15487 15488 15489 15490 15491 15492 15493 15494 15495 15496 15497 15498 15499 15500 15501 15502 15503 15504 15505 15506 15507 15508 15509 15510 15511 15512 15513 15514 15515 15516 15517 15518 15519 15520 15521 15522 15523 15524 15525 15526 15527 15528 15529 15530 15531 15532 15533 15534 15535 15536 15537 15538 15539 15540 15541 15542 15543 15544 15545 15546 15547 15548 15549 15550 15551 15552 15553 15554 15555 15556 15557 15558 15559 15560 15561 15562 15563 15564 15565 15566 15567 15568 15569 15570 15571 15572 15573 15574 15575 15576 15577 15578 15579 15580 15581 15582 15583 15584 15585 15586 15587 15588 15589 15590 15591 15592 15593 15594 15595 15596 15597 15598 15599 15600 15601 15602 15603 15604 15605 15606 15607 15608 15609 15610 15611 15612 15613 15614 15615 15616 15617 15618 15619 15620 15621 15622 15623 15624 15625 15626 15627 15628 15629 15630 15631 15632 15633 15634 15635 15636 15637 15638 15639 15640 15641 15642 15643 15644 15645 15646 15647 15648 15649 15650 15651 15652 15653 15654 15655 15656 15657 15658 15659 15660 15661 15662 15663 15664 15665 15666 15667 15668 15669 15670 15671 15672 15673 15674 15675 15676 15677 15678 15679 15680 15681 15682 15683 15684 15685 15686 15687 15688 15689 15690 15691 15692 15693 15694 15695 15696 15697 15698 15699 15700 15701 15702 15703 15704 15705 15706 15707 15708 15709 15710 15711 15712 15713 15714 15715 15716 15717 15718 15719 15720 15721 15722 15723 15724 15725 15726 15727 15728 15729 15730 15731 15732 15733 15734 15735 15736 15737 15738 15739 15740 15741 15742 15743 15744 15745 15746 15747 15748 15749 15750 15751 15752 15753 15754 15755 15756 15757 15758 15759 15760 15761 15762 15763 15764 15765 15766 15767 15768 15769 15770 15771 15772 15773 15774 15775 15776 15777 15778 15779 15780 15781 15782 15783 15784 15785 15786 15787 15788 15789 15790 15791 15792 15793 15794 15795 15796 15797 15798 15799 15800 15801 15802 15803 15804 15805 15806 15807 15808 15809 15810 15811 15812 15813 15814 15815 15816 15817 15818 15819 15820 15821 15822 15823 15824 15825 15826 15827 15828 15829 15830 15831 15832 15833 15834 15835 15836 15837 15838 15839 15840 15841 15842 15843 15844 15845 15846 15847 15848 15849 15850 15851 15852 15853 15854 15855 15856 15857 15858 15859 15860 15861 15862 15863 15864 15865 15866 15867 15868 15869 15870 15871 15872 15873 15874 15875 15876 15877 15878 15879 15880 15881 15882 15883 15884 15885 15886 15887 15888 15889 15890 15891 15892 15893 15894 15895 15896 15897 15898 15899 15900 15901 15902 15903 15904 15905 15906 15907 15908 15909 15910 15911 15912 15913 15914 15915 15916 15917 15918 15919 15920 15921 15922 15923 15924 15925 15926 15927 15928 15929 15930 15931 15932 15933 15934 15935 15936 15937 15938 15939 15940 15941 15942 15943 15944 15945 15946 15947 15948 15949 15950 15951 15952 15953 15954 15955 15956 15957 15958 15959 15960 15961 15962 15963 15964 15965 15966 15967 15968 15969 15970 15971 15972 15973 15974 15975 15976 15977 15978 15979 15980 15981 15982 15983 15984 15985 15986 15987 15988 15989 15990 15991 15992 15993 15994 15995 15996 15997 15998 15999 16000 16001 16002 16003 16004 16005 16006 16007 16008 16009 16010 16011 16012 16013 16014 16015 16016 16017 16018 16019 16020 16021 16022 16023 16024 16025 16026 16027 16028 16029 16030 16031 16032 16033 16034 16035 16036 16037 16038 16039 16040 16041 16042 16043 16044 16045 16046 16047 16048 16049 16050 16051 16052 16053 16054 16055 16056 16057 16058 16059 16060 16061 16062 16063 16064 16065 16066 16067 16068 16069 16070 16071 16072 16073 16074 16075 16076 16077 16078 16079 16080 16081 16082 16083 16084 16085 16086 16087 16088 16089 16090 16091 16092 16093 16094 16095 16096 16097 16098 16099 16100 16101 16102 16103 16104 16105 16106 16107 16108 16109 16110 16111 16112 16113 16114 16115 16116 16117 16118 16119 16120 16121 16122 16123 16124 16125 16126 16127 16128 16129 16130 16131 16132 16133 16134 16135 16136 16137 16138 16139 16140 16141 16142 16143 16144 16145 16146 16147 16148 16149 16150 16151 16152 16153 16154 16155 16156 16157 16158 16159 16160 16161 16162 16163 16164 16165 16166 16167 16168 16169 16170 16171 16172 16173 16174 16175 16176 16177 16178 16179 16180 16181 16182 16183 16184 16185 16186 16187 16188 16189 16190 16191 16192 16193 16194 16195 16196 16197 16198 16199 16200 16201 16202 16203 16204 16205 16206 16207 16208 16209 16210 16211 16212 16213 16214 16215 16216 16217 16218 16219 16220 16221 16222 16223 16224 16225 16226 16227 16228 16229 16230 16231 16232 16233 16234 16235 16236 16237 16238 16239 16240 16241 16242 16243 16244 16245 16246 16247 16248 16249 16250 16251 16252 16253 16254 16255 16256 16257 16258 16259 16260 16261 16262 16263 16264 16265 16266 16267 16268 16269 16270 16271 16272 16273 16274 16275 16276 16277 16278 16279 16280 16281 16282 16283 16284 16285 16286 16287 16288 16289 16290 16291 16292 16293 16294 16295 16296 16297 16298 16299 16300 16301 16302 16303 16304 16305 16306 16307 16308 16309 16310 16311 16312 16313 16314 16315 16316 16317 16318 16319 16320 16321 16322 16323 16324 16325 16326 16327 16328 16329 16330 16331 16332 16333 16334 16335 16336 16337 16338 16339 16340 16341 16342 16343 16344 16345 16346 16347 16348 16349 16350 16351 16352 16353 16354 16355 16356 16357 16358 16359 16360 16361 16362 16363 16364 16365 16366 16367 16368 16369 16370 16371 16372 16373 16374 16375 16376 16377 16378 16379 16380 16381 16382 16383 16384 16385 16386 16387 16388 16389 16390 16391 16392 16393 16394 16395 16396 16397 16398 16399 16400 16401 16402 16403 16404 16405 16406 16407 16408 16409 16410 16411 16412 16413 16414 16415 16416 16417 16418 16419 16420 16421 16422 16423 16424 16425 16426 16427 16428 16429 16430 16431 16432 16433 16434 16435 16436 16437 16438 16439 16440 16441 16442 16443 16444 16445 16446 16447 16448 16449 16450 16451 16452 16453 16454 16455 16456 16457 16458 16459 16460 16461 16462 16463 16464 16465 16466 16467 16468 16469 16470 16471 16472 16473 16474 16475 16476 16477 16478 16479 16480 16481 16482 16483 16484 16485 16486 16487 16488 16489 16490 16491 16492 16493 16494 16495 16496 16497 16498 16499 16500 16501 16502 16503 16504 16505 16506 16507 16508 16509 16510 16511 16512 16513 16514 16515 16516 16517 16518 16519 16520 16521 16522 16523 16524 16525 16526 16527 16528 16529 16530 16531 16532 16533 16534 16535 16536 16537 16538 16539 16540 16541 16542 16543 16544 16545 16546 16547 16548 16549 16550 16551 16552 16553 16554 16555 16556 16557 16558 16559 16560 16561 16562 16563 16564 16565 16566 16567 16568 16569 16570 16571 16572 16573 16574 16575 16576 16577 16578 16579 16580 16581 16582 16583 16584 16585 16586 16587 16588 16589 16590 16591 16592 16593 16594 16595 16596 16597 16598 16599 16600 16601 16602 16603 16604 16605 16606 16607 16608 16609 16610 16611 16612 16613 16614 16615 16616 16617 16618 16619 16620 16621 16622 16623 16624 16625 16626 16627 16628 16629 16630 16631 16632 16633 16634 16635 16636 16637 16638 16639 16640 16641 16642 16643 16644 16645 16646 16647 16648 16649 16650 16651 16652 16653 16654 16655 16656 16657 16658 16659 16660 16661 16662 16663 16664 16665 16666 16667 16668 16669 16670 16671 16672 16673 16674 16675 16676 16677 16678 16679 16680 16681 16682 16683 16684 16685 16686 16687 16688 16689 16690 16691 16692 16693 16694 16695 16696 16697 16698 16699 16700 16701 16702 16703 16704 16705 16706 16707 16708 16709 16710 16711 16712 16713 16714 16715 16716 16717 16718 16719 16720 16721 16722 16723 16724 16725 16726 16727 16728 16729 16730 16731 16732 16733 16734 16735 16736 16737 16738 16739 16740 16741 16742 16743 16744 16745 16746 16747 16748 16749 16750 16751 16752 16753 16754 16755 16756 16757 16758 16759 16760 16761 16762 16763 16764 16765 16766 16767 16768 16769 16770 16771 16772 16773 16774 16775 16776 16777 16778 16779 16780 16781 16782 16783 16784 16785 16786 16787 16788 16789 16790 16791 16792 16793 16794 16795 16796 16797 16798 16799 16800 16801 16802 16803 16804 16805 16806 16807 16808 16809 16810 16811 16812 16813 16814 16815 16816 16817 16818 16819 16820 16821 16822 16823 16824 16825 16826 16827 16828 16829 16830 16831 16832 16833 16834 16835 16836 16837 16838 16839 16840 16841 16842 16843 16844 16845 16846 16847 16848 16849 16850 16851 16852 16853 16854 16855 16856 16857 16858 16859 16860 16861 16862 16863 16864 16865 16866 16867 16868 16869 16870 16871 16872 16873 16874 16875 16876 16877 16878 16879 16880 16881 16882 16883 16884 16885 16886 16887 16888 16889 16890 16891 16892 16893 16894 16895 16896 16897 16898 16899 16900 16901 16902 16903 16904 16905 16906 16907 16908 16909 16910 16911 16912 16913 16914 16915 16916 16917 16918 16919 16920 16921 16922 16923 16924 16925 16926 16927 16928 16929 16930 16931 16932 16933 16934 16935 16936 16937 16938 16939 16940 16941 16942 16943 16944 16945 16946 16947 16948 16949 16950 16951 16952 16953 16954 16955 16956 16957 16958 16959 16960 16961 16962 16963 16964 16965 16966 16967 16968 16969 16970 16971 16972 16973 16974 16975 16976 16977 16978 16979 16980 16981 16982 16983 16984 16985 16986 16987 16988 16989 16990 16991 16992 16993 16994 16995 16996 16997 16998 16999 17000 17001 17002 17003 17004 17005 17006 17007 17008 17009 17010 17011 17012 17013 17014 17015 17016 17017 17018 17019 17020 17021 17022 17023 17024 17025 17026 17027 17028 17029 17030 17031 17032 17033 17034 17035 17036 17037 17038 17039 17040 17041 17042 17043 17044 17045 17046 17047 17048 17049 17050 17051 17052 17053 17054 17055 17056 17057 17058 17059 17060 17061 17062 17063 17064 17065 17066 17067 17068 17069 17070 17071 17072 17073 17074 17075 17076 17077 17078 17079 17080 17081 17082 17083 17084 17085 17086 17087 17088 17089 17090 17091 17092 17093 17094 17095 17096 17097 17098 17099 17100 17101 17102 17103 17104 17105 17106 17107 17108 17109 17110 17111 17112 17113 17114 17115 17116 17117 17118 17119 17120 17121 17122 17123 17124 17125 17126 17127 17128 17129 17130 17131 17132 17133 17134 17135 17136 17137 17138 17139 17140 17141 17142 17143 17144 17145 17146 17147 17148 17149 17150 17151 17152 17153 17154 17155 17156 17157 17158 17159 17160 17161 17162 17163 17164 17165 17166 17167 17168 17169 17170 17171 17172 17173 17174 17175 17176 17177 17178 17179 17180 17181 17182 17183 17184 17185 17186 17187 17188 17189 17190 17191 17192 17193 17194 17195 17196 17197 17198 17199 17200 17201 17202 17203 17204 17205 17206 17207 17208 17209 17210 17211 17212 17213 17214 17215 17216 17217 17218 17219 17220 17221 17222 17223 17224 17225 17226 17227 17228 17229 17230 17231 17232 17233 17234 17235 17236 17237 17238 17239 17240 17241 17242 17243 17244 17245 17246 17247 17248 17249 17250 17251 17252 17253 17254 17255 17256 17257 17258 17259 17260 17261 17262 17263 17264 17265 17266 17267 17268 17269 17270 17271 17272 17273 17274 17275 17276 17277 17278 17279 17280 17281 17282 17283 17284 17285 17286 17287 17288 17289 17290 17291 17292 17293 17294 17295 17296 17297 17298 17299 17300 17301 17302 17303 17304 17305 17306 17307 17308 17309 17310 17311 17312 17313 17314 17315 17316 17317 17318 17319 17320 17321 17322 17323 17324 17325 17326 17327 17328 17329 17330 17331 17332 17333 17334 17335 17336 17337 17338 17339 17340 17341 17342 17343 17344 17345 17346 17347 17348 17349 17350 17351 17352 17353 17354 17355 17356 17357 17358 17359 17360 17361 17362 17363 17364 17365 17366 17367 17368 17369 17370 17371 17372 17373 17374 17375 17376 17377 17378 17379 17380 17381 17382 17383 17384 17385 17386 17387 17388 17389 17390 17391 17392 17393 17394 17395 17396 17397 17398 17399 17400 17401 17402 17403 17404 17405 17406 17407 17408 17409 17410 17411 17412 17413 17414 17415 17416 17417 17418 17419 17420 17421 17422 17423 17424 17425 17426 17427 17428 17429 17430 17431 17432 17433 17434 17435 17436 17437 17438 17439 17440 17441 17442 17443 17444 17445 17446 17447 17448 17449 17450 17451 17452 17453 17454 17455 17456 17457 17458 17459 17460 17461 17462 17463 17464 17465 17466 17467 17468 17469 17470 17471 17472 17473 17474 17475 17476 17477 17478 17479 17480 17481 17482 17483 17484 17485 17486 17487 17488 17489 17490 17491 17492 17493 17494 17495 17496 17497 17498 17499 17500 17501 17502 17503 17504 17505 17506 17507 17508 17509 17510 17511 17512 17513 17514 17515 17516 17517 17518 17519 17520 17521 17522 17523 17524 17525 17526 17527 17528 17529 17530 17531 17532 17533 17534 17535 17536 17537 17538 17539 17540 17541 17542 17543 17544 17545 17546 17547 17548 17549 17550 17551 17552 17553 17554 17555 17556 17557 17558 17559 17560 17561 17562 17563 17564 17565 17566 17567 17568 17569 17570 17571 17572 17573 17574 17575 17576 17577 17578 17579 17580 17581 17582 17583 17584 17585 17586 17587 17588 17589 17590 17591 17592 17593 17594 17595 17596 17597 17598 17599 17600 17601 17602 17603 17604 17605 17606 17607 17608 17609 17610 17611 17612 17613 17614 17615 17616 17617 17618 17619 17620 17621 17622 17623 17624 17625 17626 17627 17628 17629 17630 17631 17632 17633 17634 17635 17636 17637 17638 17639 17640 17641 17642 17643 17644 17645 17646 17647 17648 17649 17650 17651 17652 17653 17654 17655 17656 17657 17658 17659 17660 17661 17662 17663 17664 17665 17666 17667 17668 17669 17670 17671 17672 17673 17674 17675 17676 17677 17678 17679 17680 17681 17682 17683 17684 17685 17686 17687 17688 17689 17690 17691 17692 17693 17694 17695 17696 17697 17698 17699 17700 17701 17702 17703 17704 17705 17706 17707 17708 17709 17710 17711 17712 17713 17714 17715 17716 17717 17718 17719 17720 17721 17722 17723 17724 17725 17726 17727 17728 17729 17730 17731 17732 17733 17734 17735 17736 17737 17738 17739 17740 17741 17742 17743 17744 17745 17746 17747 17748 17749 17750 17751 17752 17753 17754 17755 17756 17757 17758 17759 17760 17761 17762 17763 17764 17765 17766 17767 17768 17769 17770 17771 17772 17773 17774 17775 17776 17777 17778 17779 17780 17781 17782 17783 17784 17785 17786 17787 17788 17789 17790 17791 17792 17793 17794 17795 17796 17797 17798 17799 17800 17801 17802 17803 17804 17805 17806 17807 17808 17809 17810 17811 17812 17813 17814 17815 17816 17817 17818 17819 17820 17821 17822 17823 17824 17825 17826 17827 17828 17829 17830 17831 17832 17833 17834 17835 17836 17837 17838 17839 17840 17841 17842 17843 17844 17845 17846 17847 17848 17849 17850 17851 17852 17853 17854 17855 17856 17857 17858 17859 17860 17861 17862 17863 17864 17865 17866 17867 17868 17869 17870 17871 17872 17873 17874 17875 17876 17877 17878 17879 17880 17881 17882 17883 17884 17885 17886 17887 17888 17889 17890 17891 17892 17893 17894 17895 17896 17897 17898 17899 17900 17901 17902 17903 17904 17905 17906 17907 17908 17909 17910 17911 17912 17913 17914 17915 17916 17917 17918 17919 17920 17921 17922 17923 17924 17925 17926 17927 17928 17929 17930 17931 17932 17933 17934 17935 17936 17937 17938 17939 17940 17941 17942 17943 17944 17945 17946 17947 17948 17949 17950 17951 17952 17953 17954 17955 17956 17957 17958 17959 17960 17961 17962 17963 17964 17965 17966 17967 17968 17969 17970 17971 17972 17973 17974 17975 17976 17977 17978 17979 17980 17981 17982 17983 17984 17985 17986 17987 17988 17989 17990 17991 17992 17993 17994 17995 17996 17997 17998 17999 18000 18001 18002 18003 18004 18005 18006 18007 18008 18009 18010 18011 18012 18013 18014 18015 18016 18017 18018 18019 18020 18021 18022 18023 18024 18025 18026 18027 18028 18029 18030 18031 18032 18033 18034 18035 18036 18037 18038 18039 18040 18041 18042 18043 18044 18045 18046 18047 18048 18049 18050 18051 18052 18053 18054 18055 18056 18057 18058 18059 18060 18061 18062 18063 18064 18065 18066 18067 18068 18069 18070 18071 18072 18073 18074 18075 18076 18077 18078 18079 18080 18081 18082 18083 18084 18085 18086 18087 18088 18089 18090 18091 18092 18093 18094 18095 18096 18097 18098 18099 18100 18101 18102 18103 18104 18105 18106 18107 18108 18109 18110 18111 18112 18113 18114 18115 18116 18117 18118 18119 18120 18121 18122 18123 18124 18125 18126 18127 18128 18129 18130 18131 18132 18133 18134 18135 18136 18137 18138 18139 18140 18141 18142 18143 18144 18145 18146 18147 18148 18149 18150 18151 18152 18153 18154 18155 18156 18157 18158 18159 18160 18161 18162 18163 18164 18165 18166 18167 18168 18169 18170 18171 18172 18173 18174 18175 18176 18177 18178 18179 18180 18181 18182 18183 18184 18185 18186 18187 18188 18189 18190 18191 18192 18193 18194 18195 18196 18197 18198 18199 18200 18201 18202 18203 18204 18205 18206 18207 18208 18209 18210 18211 18212 18213 18214 18215 18216 18217 18218 18219 18220 18221 18222 18223 18224 18225 18226 18227 18228 18229 18230 18231 18232 18233 18234 18235 18236 18237 18238 18239 18240 18241 18242 18243 18244 18245 18246 18247 18248 18249 18250 18251 18252 18253 18254 18255 18256 18257 18258 18259 18260 18261 18262 18263 18264 18265 18266 18267 18268 18269 18270 18271 18272 18273 18274 18275 18276 18277 18278 18279 18280 18281 18282 18283 18284 18285 18286 18287 18288 18289 18290 18291 18292 18293 18294 18295 18296 18297 18298 18299 18300 18301 18302 18303 18304 18305 18306 18307 18308 18309 18310 18311 18312 18313 18314 18315 18316 18317 18318 18319 18320 18321 18322 18323 18324 18325 18326 18327 18328 18329 18330 18331 18332 18333 18334 18335 18336 18337 18338 18339 18340 18341 18342 18343 18344 18345 18346 18347 18348 18349 18350 18351 18352 18353 18354 18355 18356 18357 18358 18359 18360 18361 18362 18363 18364 18365 18366 18367 18368 18369 18370 18371 18372 18373 18374 18375 18376 18377 18378 18379 18380 18381 18382 18383 18384 18385 18386 18387 18388 18389 18390 18391 18392 18393 18394 18395 18396 18397 18398 18399 18400 18401 18402 18403 18404 18405 18406 18407 18408 18409 18410 18411 18412 18413 18414 18415 18416 18417 18418 18419 18420 18421 18422 18423 18424 18425 18426 18427 18428 18429 18430 18431 18432 18433 18434 18435 18436 18437 18438 18439 18440 18441 18442 18443 18444 18445 18446 18447 18448 18449 18450 18451 18452 18453 18454 18455 18456 18457 18458 18459 18460 18461 18462 18463 18464 18465 18466 18467 18468 18469 18470 18471 18472 18473 18474 18475 18476 18477 18478 18479 18480 18481 18482 18483 18484 18485 18486 18487 18488 18489 18490 18491 18492 18493 18494 18495 18496 18497 18498 18499 18500 18501 18502 18503 18504 18505 18506 18507 18508 18509 18510 18511 18512 18513 18514 18515 18516 18517 18518 18519 18520 18521 18522 18523 18524 18525 18526 18527 18528 18529 18530 18531 18532 18533 18534 18535 18536 18537 18538 18539 18540 18541 18542 18543 18544 18545 18546 18547 18548 18549 18550 18551 18552 18553 18554 18555 18556 18557 18558 18559 18560 18561 18562 18563 18564 18565 18566 18567 18568 18569 18570 18571 18572 18573 18574 18575 18576 18577 18578 18579 18580 18581 18582 18583 18584 18585 18586 18587 18588 18589 18590 18591 18592 18593 18594 18595 18596 18597 18598 18599 18600 18601 18602 18603 18604 18605 18606 18607 18608 18609 18610 18611 18612 18613 18614 18615 18616 18617 18618 18619 18620 18621 18622 18623 18624 18625 18626 18627 18628 18629 18630 18631 18632 18633 18634 18635 18636 18637 18638 18639 18640 18641 18642 18643 18644 18645 18646 18647 18648 18649 18650 18651 18652 18653 18654 18655 18656 18657 18658 18659 18660 18661 18662 18663 18664 18665 18666 18667 18668 18669 18670 18671 18672 18673 18674 18675 18676 18677 18678 18679 18680 18681 18682 18683 18684 18685 18686 18687 18688 18689 18690 18691 18692 18693 18694 18695 18696 18697 18698 18699 18700 18701 18702 18703 18704 18705 18706 18707 18708 18709 18710 18711 18712 18713 18714 18715 18716 18717 18718 18719 18720 18721 18722 18723 18724 18725 18726 18727 18728 18729 18730 18731 18732 18733 18734 18735 18736 18737 18738 18739 18740 18741 18742 18743 18744 18745 18746 18747 18748 18749 18750 18751 18752 18753 18754 18755 18756 18757 18758 18759 18760 18761 18762 18763 18764 18765 18766 18767 18768 18769 18770 18771 18772 18773 18774 18775 18776 18777 18778 18779 18780 18781 18782 18783 18784 18785 18786 18787 18788 18789 18790 18791 18792 18793 18794 18795 18796 18797 18798 18799 18800 18801 18802 18803 18804 18805 18806 18807 18808 18809 18810 18811 18812 18813 18814 18815 18816 18817 18818 18819 18820 18821 18822 18823 18824 18825 18826 18827 18828 18829 18830 18831 18832 18833 18834 18835 18836 18837 18838 18839 18840 18841 18842 18843 18844 18845 18846 18847 18848 18849 18850 18851 18852 18853 18854 18855 18856 18857 18858 18859 18860 18861 18862 18863 18864 18865 18866 18867 18868 18869 18870 18871 18872 18873 18874 18875 18876 18877 18878 18879 18880 18881 18882 18883 18884 18885 18886 18887 18888 18889 18890 18891 18892 18893 18894 18895 18896 18897 18898 18899 18900 18901 18902 18903 18904 18905 18906 18907 18908 18909 18910 18911 18912 18913 18914 18915 18916 18917 18918 18919 18920 18921 18922 18923 18924 18925 18926 18927 18928 18929 18930 18931 18932 18933 18934 18935 18936 18937 18938 18939 18940 18941 18942 18943 18944 18945 18946 18947 18948 18949 18950 18951 18952 18953 18954 18955 18956 18957 18958 18959 18960 18961 18962 18963 18964 18965 18966 18967 18968 18969 18970 18971 18972 18973 18974 18975 18976 18977 18978 18979 18980 18981 18982 18983 18984 18985 18986 18987 18988 18989 18990 18991 18992 18993 18994 18995 18996 18997 18998 18999 19000 19001 19002 19003 19004 19005 19006 19007 19008 19009 19010 19011 19012 19013 19014 19015 19016 19017 19018 19019 19020 19021 19022 19023 19024 19025 19026 19027 19028 19029 19030 19031 19032 19033 19034 19035 19036 19037 19038 19039 19040 19041 19042 19043 19044 19045 19046 19047 19048 19049 19050 19051 19052 19053 19054 19055 19056 19057 19058 19059 19060 19061 19062 19063 19064 19065 19066 19067 19068 19069 19070 19071 19072 19073 19074 19075 19076 19077 19078 19079 19080 19081 19082 19083 19084 19085 19086 19087 19088 19089 19090 19091 19092 19093 19094 19095 19096 19097 19098 19099 19100 19101 19102 19103 19104 19105 19106 19107 19108 19109 19110 19111 19112 19113 19114 19115 19116 19117 19118 19119 19120 19121 19122 19123 19124 19125 19126 19127 19128 19129 19130 19131 19132 19133 19134 19135 19136 19137 19138 19139 19140 19141 19142 19143 19144 19145 19146 19147 19148 19149 19150 19151 19152 19153 19154 19155 19156 19157 19158 19159 19160 19161 19162 19163 19164 19165 19166 19167 19168 19169 19170 19171 19172 19173 19174 19175 19176 19177 19178 19179 19180 19181 19182 19183 19184 19185 19186 19187 19188 19189 19190 19191 19192 19193 19194 19195 19196 19197 19198 19199 19200 19201 19202 19203 19204 19205 19206 19207 19208 19209 19210 19211 19212 19213 19214 19215 19216 19217 19218 19219 19220 19221 19222 19223 19224 19225 19226 19227 19228 19229 19230 19231 19232 19233 19234 19235 19236 19237 19238 19239 19240 19241 19242 19243 19244 19245 19246 19247 19248 19249 19250 19251 19252 19253 19254 19255 19256 19257 19258 19259 19260 19261 19262 19263 19264 19265 19266 19267 19268 19269 19270 19271 19272 19273 19274 19275 19276 19277 19278 19279 19280 19281 19282 19283 19284 19285 19286 19287 19288 19289 19290 19291 19292 19293 19294 19295 19296 19297 19298 19299 19300 19301 19302 19303 19304 19305 19306 19307 19308 19309 19310 19311 19312 19313 19314 19315 19316 19317 19318 19319 19320 19321 19322 19323 19324 19325 19326 19327 19328 19329 19330 19331 19332 19333 19334 19335 19336 19337 19338 19339 19340 19341 19342 19343 19344 19345 19346 19347 19348 19349 19350 19351 19352 19353 19354 19355 19356 19357 19358 19359 19360 19361 19362 19363 19364 19365 19366 19367 19368 19369 19370 19371 19372 19373 19374 19375 19376 19377 19378 19379 19380 19381 19382 19383 19384 19385 19386 19387 19388 19389 19390 19391 19392 19393 19394 19395 19396 19397 19398 19399 19400 19401 19402 19403 19404 19405 19406 19407 19408 19409 19410 19411 19412 19413 19414 19415 19416 19417 19418 19419 19420 19421 19422 19423 19424 19425 19426 19427 19428 19429 19430 19431 19432 19433 19434 19435 19436 19437 19438 19439 19440 19441 19442 19443 19444 19445 19446 19447 19448 19449 19450 19451 19452 19453 19454 19455 19456 19457 19458 19459 19460 19461 19462 19463 19464 19465 19466 19467 19468 19469 19470 19471 19472 19473 19474 19475 19476 19477 19478 19479 19480 19481 19482 19483 19484 19485 19486 19487 19488 19489 19490 19491 19492 19493 19494 19495 19496 19497 19498 19499 19500 19501 19502 19503 19504 19505 19506 19507 19508 19509 19510 19511 19512 19513 19514 19515 19516 19517 19518 19519 19520 19521 19522 19523 19524 19525 19526 19527 19528 19529 19530 19531 19532 19533 19534 19535 19536 19537 19538 19539 19540 19541 19542 19543 19544 19545 19546 19547 19548 19549 19550 19551 19552 19553 19554 19555 19556 19557 19558 19559 19560 19561 19562 19563 19564 19565 19566 19567 19568 19569 19570 19571 19572 19573 19574 19575 19576 19577 19578 19579 19580 19581 19582 19583 19584 19585 19586 19587 19588 19589 19590 19591 19592 19593 19594 19595 19596 19597 19598 19599 19600 19601 19602 19603 19604 19605 19606 19607 19608 19609 19610 19611 19612 19613 19614 19615 19616 19617 19618 19619 19620 19621 19622 19623 19624 19625 19626 19627 19628 19629 19630 19631 19632 19633 19634 19635 19636 19637 19638 19639 19640 19641 19642 19643 19644 19645 19646 19647 19648 19649 19650 19651 19652 19653 19654 19655 19656 19657 19658 19659 19660 19661 19662 19663 19664 19665 19666 19667 19668 19669 19670 19671 19672 19673 19674 19675 19676 19677 19678 19679 19680 19681 19682 19683 19684 19685 19686 19687 19688 19689 19690 19691 19692 19693 19694 19695 19696 19697 19698 19699 19700 19701 19702 19703 19704 19705 19706 19707 19708 19709 19710 19711 19712 19713 19714 19715 19716 19717 19718 19719 19720 19721 19722 19723 19724 19725 19726 19727 19728 19729 19730 19731 19732 19733 19734 19735 19736 19737 19738 19739 19740 19741 19742 19743 19744 19745 19746 19747 19748 19749 19750 19751 19752 19753 19754 19755 19756 19757 19758 19759 19760 19761 19762 19763 19764 19765 19766 19767 19768 19769 19770 19771 19772 19773 19774 19775 19776 19777 19778 19779 19780 19781 19782 19783 19784 19785 19786 19787 19788 19789 19790 19791 19792 19793 19794 19795 19796 19797 19798 19799 19800 19801 19802 19803 19804 19805 19806 19807 19808 19809 19810 19811 19812 19813 19814 19815 19816 19817 19818 19819 19820 19821 19822 19823 19824 19825 19826 19827 19828 19829 19830 19831 19832 19833 19834 19835 19836 19837 19838 19839 19840 19841 19842 19843 19844 19845 19846 19847 19848 19849 19850 19851 19852 19853 19854 19855 19856 19857 19858 19859 19860 19861 19862 19863 19864 19865 19866 19867 19868 19869 19870 19871 19872 19873 19874 19875 19876 19877 19878 19879 19880 19881 19882 19883 19884 19885 19886 19887 19888 19889 19890 19891 19892 19893 19894 19895 19896 19897 19898 19899 19900 19901 19902 19903 19904 19905 19906 19907 19908 19909 19910 19911 19912 19913 19914 19915 19916 19917 19918 19919 19920 19921 19922 19923 19924 19925 19926 19927 19928 19929 19930 19931 19932 19933 19934 19935 19936 19937 19938 19939 19940 19941 19942 19943 19944 19945 19946 19947 19948 19949 19950 19951 19952 19953 19954 19955 19956 19957 19958 19959 19960 19961 19962 19963 19964 19965 19966 19967 19968 19969 19970 19971 19972 19973 19974 19975 19976 19977 19978 19979 19980 19981 19982 19983 19984 19985 19986 19987 19988 19989 19990 19991 19992 19993 19994 19995 19996 19997 19998 19999 20000 20001 20002 20003 20004 20005 20006 20007 20008 20009 20010 20011 20012 20013 20014 20015 20016 20017 20018 20019 20020 20021 20022 20023 20024 20025 20026 20027 20028 20029 20030 20031 20032 20033 20034 20035 20036 20037 20038 20039 20040 20041 20042 20043 20044 20045 20046 20047 20048 20049 20050 20051 20052 20053 20054 20055 20056 20057 20058 20059 20060 20061 20062 20063 20064 20065 20066 20067 20068 20069 20070 20071 20072 20073 20074 20075 20076 20077 20078 20079 20080 20081 20082 20083 20084 20085 20086 20087 20088 20089 20090 20091 20092 20093 20094 20095 20096 20097 20098 20099 20100 20101 20102 20103 20104 20105 20106 20107 20108 20109 20110 20111 20112 20113 20114 20115 20116 20117 20118 20119 20120 20121 20122 20123 20124 20125 20126 20127 20128 20129 20130 20131 20132 20133 20134 20135 20136 20137 20138 20139 20140 20141 20142 20143 20144 20145 20146 20147 20148 20149 20150 20151 20152 20153 20154 20155 20156 20157 20158 20159 20160 20161 20162 20163 20164 20165 20166 20167 20168 20169 20170 20171 20172 20173 20174 20175 20176 20177 20178 20179 20180 20181 20182 20183 20184 20185 20186 20187 20188 20189 20190 20191 20192 20193 20194 20195 20196 20197 20198 20199 20200 20201 20202 20203 20204 20205 20206 20207 20208 20209 20210 20211 20212 20213 20214 20215 20216 20217 20218 20219 20220 20221 20222 20223 20224 20225 20226 20227 20228 20229 20230 20231 20232 20233 20234 20235 20236 20237 20238 20239 20240 20241 20242 20243 20244 20245 20246 20247 20248 20249 20250 20251 20252 20253 20254 20255 20256 20257 20258 20259 20260 20261 20262 20263 20264 20265 20266 20267 20268 20269 20270 20271 20272 20273 20274 20275 20276 20277 20278 20279 20280 20281 20282 20283 20284 20285 20286 20287 20288 20289 20290 20291 20292 20293 20294 20295 20296 20297 20298 20299 20300 20301 20302 20303 20304 20305 20306 20307 20308 20309 20310 20311 20312 20313 20314 20315 20316 20317 20318 20319 20320 20321 20322 20323 20324 20325 20326 20327 20328 20329 20330 20331 20332 20333 20334 20335 20336 20337 20338 20339 20340 20341 20342 20343 20344 20345 20346 20347 20348 20349 20350 20351 20352 20353 20354 20355 20356 20357 20358 20359 20360 20361 20362 20363 20364 20365 20366 20367 20368 20369 20370 20371 20372 20373 20374 20375 20376 20377 20378 20379 20380 20381 20382 20383 20384 20385 20386 20387 20388 20389 20390 20391 20392 20393 20394 20395 20396 20397 20398 20399 20400 20401 20402 20403 20404 20405 20406 20407 20408 20409 20410 20411 20412 20413 20414 20415 20416 20417 20418 20419 20420 20421 20422 20423 20424 20425 20426 20427 20428 20429 20430 20431 20432 20433 20434 20435 20436 20437 20438 20439 20440 20441 20442 20443 20444 20445 20446 20447 20448 20449 20450 20451 20452 20453 20454 20455 20456 20457 20458 20459 20460 20461 20462 20463 20464 20465 20466 20467 20468 20469 20470 20471 20472 20473 20474 20475 20476 20477 20478 20479 20480 20481 20482 20483 20484 20485 20486 20487 20488 20489 20490 20491 20492 20493 20494 20495 20496 20497 20498 20499 20500 20501 20502 20503 20504 20505 20506 20507 20508 20509 20510 20511 20512 20513 20514 20515 20516 20517 20518 20519 20520 20521 20522 20523 20524 20525 20526 20527 20528 20529 20530 20531 20532 20533 20534 20535 20536 20537 20538 20539 20540 20541 20542 20543 20544 20545 20546 20547 20548 20549 20550 20551 20552 20553 20554 20555 20556 20557 20558 20559 20560 20561 20562 20563 20564 20565 20566 20567 20568 20569 20570 20571 20572 20573 20574 20575 20576 20577 20578 20579 20580 20581 20582 20583 20584 20585 20586 20587 20588 20589 20590 20591 20592 20593 20594 20595 20596 20597 20598 20599 20600 20601 20602 20603 20604 20605 20606 20607 20608 20609 20610 20611 20612 20613 20614 20615 20616 20617 20618 20619 20620 20621 20622 20623 20624 20625 20626 20627 20628 20629 20630 20631 20632 20633 20634 20635 20636 20637 20638 20639 20640 20641 20642 20643 20644 20645 20646 20647 20648 20649 20650 20651 20652 20653 20654 20655 20656 20657 20658 20659 20660 20661 20662 20663 20664 20665 20666 20667 20668 20669 20670 20671 20672 20673 20674 20675 20676 20677 20678 20679 20680 20681 20682 20683 20684 20685 20686 20687 20688 20689 20690 20691 20692 20693 20694 20695 20696 20697 20698 20699 20700 20701 20702 20703 20704 20705 20706 20707 20708 20709 20710 20711 20712 20713 20714 20715 20716 20717 20718 20719 20720 20721 20722 20723 20724 20725 20726 20727 20728 20729 20730 20731 20732 20733 20734 20735 20736 20737 20738 20739 20740 20741 20742 20743 20744 20745 20746 20747 20748 20749 20750 20751 20752 20753 20754 20755 20756 20757 20758 20759 20760 20761 20762 20763 20764 20765 20766 20767 20768 20769 20770 20771 20772 20773 20774 20775 20776 20777 20778 20779 20780 20781 20782 20783 20784 20785 20786 20787 20788 20789 20790 20791 20792 20793 20794 20795 20796 20797 20798 20799 20800 20801 20802 20803 20804 20805 20806 20807 20808 20809 20810 20811 20812 20813 20814 20815 20816 20817 20818 20819 20820 20821 20822 20823 20824 20825 20826 20827 20828 20829 20830 20831 20832 20833 20834 20835 20836 20837 20838 20839 20840 20841 20842 20843 20844 20845 20846 20847 20848 20849 20850 20851 20852 20853 20854 20855 20856 20857 20858 20859 20860 20861 20862 20863 20864 20865 20866 20867 20868 20869 20870 20871 20872 20873 20874 20875 20876 20877 20878 20879 20880 20881 20882 20883 20884 20885 20886 20887 20888 20889 20890 20891 20892 20893 20894 20895 20896 20897 20898 20899 20900 20901 20902 20903 20904 20905 20906 20907 20908 20909 20910 20911 20912 20913 20914 20915 20916 20917 20918 20919 20920 20921 20922 20923 20924 20925 20926 20927 20928 20929 20930 20931 20932 20933 20934 20935 20936 20937 20938 20939 20940 20941 20942 20943 20944 20945 20946 20947 20948 20949 20950 20951 20952 20953 20954 20955 20956 20957 20958 20959 20960 20961 20962 20963 20964 20965 20966 20967 20968 20969 20970 20971 20972 20973 20974 20975 20976 20977 20978 20979 20980 20981 20982 20983 20984 20985 20986 20987 20988 20989 20990 20991 20992 20993 20994 20995 20996 20997 20998 20999 21000 21001 21002 21003 21004 21005 21006 21007 21008 21009 21010 21011 21012 21013 21014 21015 21016 21017 21018 21019 21020 21021 21022 21023 21024 21025 21026 21027 21028 21029 21030 21031 21032 21033 21034 21035 21036 21037 21038 21039 21040 21041 21042 21043 21044 21045 21046 21047 21048 21049 21050 21051 21052 21053 21054 21055 21056 21057 21058 21059 21060 21061 21062 21063 21064 21065 21066 21067 21068 21069 21070 21071 21072 21073 21074 21075 21076 21077 21078 21079 21080 21081 21082 21083 21084 21085 21086 21087 21088 21089 21090 21091 21092 21093 21094 21095 21096 21097 21098 21099 21100 21101 21102 21103 21104 21105 21106 21107 21108 21109 21110 21111 21112 21113 21114 21115 21116 21117 21118 21119 21120 21121 21122 21123 21124 21125 21126 21127 21128 21129 21130 21131 21132 21133 21134 21135 21136 21137 21138 21139 21140 21141 21142 21143 21144 21145 21146 21147 21148 21149 21150 21151 21152 21153 21154 21155 21156 21157 21158 21159 21160 21161 21162 21163 21164 21165 21166 21167 21168 21169 21170 21171 21172 21173 21174 21175 21176 21177 21178 21179 21180 21181 21182 21183 21184 21185 21186 21187 21188 21189 21190 21191 21192 21193 21194 21195 21196 21197 21198 21199 21200 21201 21202 21203 21204 21205 21206 21207 21208 21209 21210 21211 21212 21213 21214 21215 21216 21217 21218 21219 21220 21221 21222 21223 21224 21225 21226 21227 21228 21229 21230 21231 21232 21233 21234 21235 21236 21237 21238 21239 21240 21241 21242 21243 21244 21245 21246 21247 21248 21249 21250 21251 21252 21253 21254 21255 21256 21257 21258 21259 21260 21261 21262 21263 21264 21265 21266 21267 21268 21269 21270 21271 21272 21273 21274 21275 21276 21277 21278 21279 21280 21281 21282 21283 21284 21285 21286 21287 21288 21289 21290 21291 21292 21293 21294 21295 21296 21297 21298 21299 21300 21301 21302 21303 21304 21305 21306 21307 21308 21309 21310 21311 21312 21313 21314 21315 21316 21317 21318 21319 21320 21321 21322 21323 21324 21325 21326 21327 21328 21329 21330 21331 21332 21333 21334 21335 21336 21337 21338 21339 21340 21341 21342 21343 21344 21345 21346 21347 21348 21349 21350 21351 21352 21353 21354 21355 21356 21357 21358 21359 21360 21361 21362 21363 21364 21365 21366 21367 21368 21369 21370 21371 21372 21373 21374 21375 21376 21377 21378 21379 21380 21381 21382 21383 21384 21385 21386 21387 21388 21389 21390 21391 21392 21393 21394 21395 21396 21397 21398 21399 21400 21401 21402 21403 21404 21405 21406 21407 21408 21409 21410 21411 21412 21413 21414 21415 21416 21417 21418 21419 21420 21421 21422 21423 21424 21425 21426 21427 21428 21429 21430 21431 21432 21433 21434 21435 21436 21437 21438 21439 21440 21441 21442 21443 21444 21445 21446 21447 21448 21449 21450 21451 21452 21453 21454 21455 21456 21457 21458 21459 21460 21461 21462 21463 21464 21465 21466 21467 21468 21469 21470 21471 21472 21473 21474 21475 21476 21477 21478 21479 21480 21481 21482 21483 21484 21485 21486 21487 21488 21489 21490 21491 21492 21493 21494 21495 21496 21497 21498 21499 21500 21501 21502 21503 21504 21505 21506 21507 21508 21509 21510 21511 21512 21513 21514 21515 21516 21517 21518 21519 21520 21521 21522 21523 21524 21525 21526 21527 21528 21529 21530 21531 21532 21533 21534 21535 21536 21537 21538 21539 21540 21541 21542 21543 21544 21545 21546 21547 21548 21549 21550 21551 21552 21553 21554 21555 21556 21557 21558 21559 21560 21561 21562 21563 21564 21565 21566 21567 21568 21569 21570 21571 21572 21573 21574 21575 21576 21577 21578 21579 21580 21581 21582 21583 21584 21585 21586 21587 21588 21589 21590 21591 21592 21593 21594 21595 21596 21597 21598 21599 21600 21601 21602 21603 21604 21605 21606 21607 21608 21609 21610 21611 21612 21613 21614 21615 21616 21617 21618 21619 21620 21621 21622 21623 21624 21625 21626 21627 21628 21629 21630 21631 21632 21633 21634 21635 21636 21637 21638 21639 21640 21641 21642 21643 21644 21645 21646 21647 21648 21649 21650 21651 21652 21653 21654 21655 21656 21657 21658 21659 21660 21661 21662 21663 21664 21665 21666 21667 21668 21669 21670 21671 21672 21673 21674 21675 21676 21677 21678 21679 21680 21681 21682 21683 21684 21685 21686 21687 21688 21689 21690 21691 21692 21693 21694 21695 21696 21697 21698 21699 21700 21701 21702 21703 21704 21705 21706 21707 21708 21709 21710 21711 21712 21713 21714 21715 21716 21717 21718 21719 21720 21721 21722 21723 21724 21725 21726 21727 21728 21729 21730 21731 21732 21733 21734 21735 21736 21737 21738 21739 21740 21741 21742 21743 21744 21745 21746 21747 21748 21749 21750 21751 21752 21753 21754 21755 21756 21757 21758 21759 21760 21761 21762 21763 21764 21765 21766 21767 21768 21769 21770 21771 21772 21773 21774 21775 21776 21777 21778 21779 21780 21781 21782 21783 21784 21785 21786 21787 21788 21789 21790 21791 21792 21793 21794 21795 21796 21797 21798 21799 21800 21801 21802 21803 21804 21805 21806 21807 21808 21809 21810 21811 21812 21813 21814 21815 21816 21817 21818 21819 21820 21821 21822 21823 21824 21825 21826 21827 21828 21829 21830 21831 21832 21833 21834 21835 21836 21837 21838 21839 21840 21841 21842 21843 21844 21845 21846 21847 21848 21849 21850 21851 21852 21853 21854 21855 21856 21857 21858 21859 21860 21861 21862 21863 21864 21865 21866 21867 21868 21869 21870 21871 21872 21873 21874 21875 21876 21877 21878 21879 21880 21881 21882 21883 21884 21885 21886 21887 21888 21889 21890 21891 21892 21893 21894 21895 21896 21897 21898 21899 21900 21901 21902 21903 21904 21905 21906 21907 21908 21909 21910 21911 21912 21913 21914 21915 21916 21917 21918 21919 21920 21921 21922 21923 21924 21925 21926 21927 21928 21929 21930 21931 21932 21933 21934 21935 21936 21937 21938 21939 21940 21941 21942 21943 21944 21945 21946 21947 21948 21949 21950 21951 21952 21953 21954 21955 21956 21957 21958 21959 21960 21961 21962 21963 21964 21965 21966 21967 21968 21969 21970 21971 21972 21973 21974 21975 21976 21977 21978 21979 21980 21981 21982 21983 21984 21985 21986 21987 21988 21989 21990 21991 21992 21993 21994 21995 21996 21997 21998 21999 22000 22001 22002 22003 22004 22005 22006 22007 22008 22009 22010 22011 22012 22013 22014 22015 22016 22017 22018 22019 22020 22021 22022 22023 22024 22025 22026 22027 22028 22029 22030 22031 22032 22033 22034 22035 22036 22037 22038 22039 22040 22041 22042 22043 22044 22045 22046 22047 22048 22049 22050 22051 22052 22053 22054 22055 22056 22057 22058 22059 22060 22061 22062 22063 22064 22065 22066 22067 22068 22069 22070 22071 22072 22073 22074 22075 22076 22077 22078 22079 22080 22081 22082 22083 22084 22085 22086 22087 22088 22089 22090 22091 22092 22093 22094 22095 22096 22097 22098 22099 22100 22101 22102 22103 22104 22105 22106 22107 22108 22109 22110 22111 22112 22113 22114 22115 22116 22117 22118 22119 22120 22121 22122 22123 22124 22125 22126 22127 22128 22129 22130 22131 22132 22133 22134 22135 22136 22137 22138 22139 22140 22141 22142 22143 22144 22145 22146 22147 22148 22149 22150 22151 22152 22153 22154 22155 22156 22157 22158 22159 22160 22161 22162 22163 22164 22165 22166 22167 22168 22169 22170 22171 22172 22173 22174 22175 22176 22177 22178 22179 22180 22181 22182 22183 22184 22185 22186 22187 22188 22189 22190 22191 22192 22193 22194 22195 22196 22197 22198 22199 22200 22201 22202 22203 22204 22205 22206 22207 22208 22209 22210 22211 22212 22213 22214 22215 22216 22217 22218 22219 22220 22221 22222 22223 22224 22225 22226 22227 22228 22229 22230 22231 22232 22233 22234 22235 22236 22237 22238 22239 22240 22241 22242 22243 22244 22245 22246 22247 22248 22249 22250 22251 22252 22253 22254 22255 22256 22257 22258 22259 22260 22261 22262 22263 22264 22265 22266 22267 22268 22269 22270 22271 22272 22273 22274 22275 22276 22277 22278 22279 22280 22281 22282 22283 22284 22285 22286 22287 22288 22289 22290 22291 22292 22293 22294 22295 22296 22297 22298 22299 22300 22301 22302 22303 22304 22305 22306 22307 22308 22309 22310 22311 22312 22313 22314 22315 22316 22317 22318 22319 22320 22321 22322 22323 22324 22325 22326 22327 22328 22329 22330 22331 22332 22333 22334 22335 22336 22337 22338 22339 22340 22341 22342 22343 22344 22345 22346 22347 22348 22349 22350 22351 22352 22353 22354 22355 22356 22357 22358 22359 22360 22361 22362 22363 22364 22365 22366 22367 22368 22369 22370 22371 22372 22373 22374 22375 22376 22377 22378 22379 22380 22381 22382 22383 22384 22385 22386 22387 22388 22389 22390 22391 22392 22393 22394 22395 22396 22397 22398 22399 22400 22401 22402 22403 22404 22405 22406 22407 22408 22409 22410 22411 22412 22413 22414 22415 22416 22417 22418 22419 22420 22421 22422 22423 22424 22425 22426 22427 22428 22429 22430 22431 22432 22433 22434 22435 22436 22437 22438 22439 22440 22441 22442 22443 22444 22445 22446 22447 22448 22449 22450 22451 22452 22453 22454 22455 22456 22457 22458 22459 22460 22461 22462 22463 22464 22465 22466 22467 22468 22469 22470 22471 22472 22473 22474 22475 22476 22477 22478 22479 22480 22481 22482 22483 22484 22485 22486 22487 22488 22489 22490 22491 22492 22493 22494 22495 22496 22497 22498 22499 22500 22501 22502 22503 22504 22505 22506 22507 22508 22509 22510 22511 22512 22513 22514 22515 22516 22517 22518 22519 22520 22521 22522 22523 22524 22525 22526 22527 22528 22529 22530 22531 22532 22533 22534 22535 22536 22537 22538 22539 22540 22541 22542 22543 22544 22545 22546 22547 22548 22549 22550 22551 22552 22553 22554 22555 22556 22557 22558 22559 22560 22561 22562 22563 22564 22565 22566 22567 22568 22569 22570 22571 22572 22573 22574 22575 22576 22577 22578 22579 22580 22581 22582 22583 22584 22585 22586 22587 22588 22589 22590 22591 22592 22593 22594 22595 22596 22597 22598 22599 22600 22601 22602 22603 22604 22605 22606 22607 22608 22609 22610 22611 22612 22613 22614 22615 22616 22617 22618 22619 22620 22621 22622 22623 22624 22625 22626 22627 22628 22629 22630 22631 22632 22633 22634 22635 22636 22637 22638 22639 22640 22641 22642 22643 22644 22645 22646 22647 22648 22649 22650 22651 22652 22653 22654 22655 22656 22657 22658 22659 22660 22661 22662 22663 22664 22665 22666 22667 22668 22669 22670 22671 22672 22673 22674 22675 22676 22677 22678 22679 22680 22681 22682 22683 22684 22685 22686 22687 22688 22689 22690 22691 22692 22693 22694 22695 22696 22697 22698 22699 22700 22701 22702 22703 22704 22705 22706 22707 22708 22709 22710 22711 22712 22713 22714 22715 22716 22717 22718 22719 22720 22721 22722 22723 22724 22725 22726 22727 22728 22729 22730 22731 22732 22733 22734 22735 22736 22737 22738 22739 22740 22741 22742 22743 22744 22745 22746 22747 22748 22749 22750 22751 22752 22753 22754 22755 22756 22757 22758 22759 22760 22761 22762 22763 22764 22765 22766 22767 22768 22769 22770 22771 22772 22773 22774 22775 22776 22777 22778 22779 22780 22781 22782 22783 22784 22785 22786 22787 22788 22789 22790 22791 22792 22793 22794 22795 22796 22797 22798 22799 22800 22801 22802 22803 22804 22805 22806 22807 22808 22809 22810 22811 22812 22813 22814 22815 22816 22817 22818 22819 22820 22821 22822 22823 22824 22825 22826 22827 22828 22829 22830 22831 22832 22833 22834 22835 22836 22837 22838 22839 22840 22841 22842 22843 22844 22845 22846 22847 22848 22849 22850 22851 22852 22853 22854 22855 22856 22857 22858 22859 22860 22861 22862 22863 22864 22865 22866 22867 22868 22869 22870 22871 22872 22873 22874 22875 22876 22877 22878 22879 22880 22881 22882 22883 22884 22885 22886 22887 22888 22889 22890 22891 22892 22893 22894 22895 22896 22897 22898 22899 22900 22901 22902 22903 22904 22905 22906 22907 22908 22909 22910 22911 22912 22913 22914 22915 22916 22917 22918 22919 22920 22921 22922 22923 22924 22925 22926 22927 22928 22929 22930 22931 22932 22933 22934 22935 22936 22937 22938 22939 22940 22941 22942 22943 22944 22945 22946 22947 22948 22949 22950 22951 22952 22953 22954 22955 22956 22957 22958 22959 22960 22961 22962 22963 22964 22965 22966 22967 22968 22969 22970 22971 22972 22973 22974 22975 22976 22977 22978 22979 22980 22981 22982 22983 22984 22985 22986 22987 22988 22989 22990 22991 22992 22993 22994 22995 22996 22997 22998 22999 23000 23001 23002 23003 23004 23005 23006 23007 23008 23009 23010 23011 23012 23013 23014 23015 23016 23017 23018 23019 23020 23021 23022 23023 23024 23025 23026 23027 23028 23029 23030 23031 23032 23033 23034 23035 23036 23037 23038 23039 23040 23041 23042 23043 23044 23045 23046 23047 23048 23049 23050 23051 23052 23053 23054 23055 23056 23057 23058 23059 23060 23061 23062 23063 23064 23065 23066 23067 23068 23069 23070 23071 23072 23073 23074 23075 23076 23077 23078 23079 23080 23081 23082 23083 23084 23085 23086 23087 23088 23089 23090 23091 23092 23093 23094 23095 23096 23097 23098 23099 23100 23101 23102 23103 23104 23105 23106 23107 23108 23109 23110 23111 23112 23113 23114 23115 23116 23117 23118 23119 23120 23121 23122 23123 23124 23125 23126 23127 23128 23129 23130 23131 23132 23133 23134 23135 23136 23137 23138 23139 23140 23141 23142 23143 23144 23145 23146 23147 23148 23149 23150 23151 23152 23153 23154 23155 23156 23157 23158 23159 23160 23161 23162 23163 23164 23165 23166 23167 23168 23169 23170 23171 23172 23173 23174 23175 23176 23177 23178 23179 23180 23181 23182 23183 23184 23185 23186 23187 23188 23189 23190 23191 23192 23193 23194 23195 23196 23197 23198 23199 23200 23201 23202 23203 23204 23205 23206 23207 23208 23209 23210 23211 23212 23213 23214 23215 23216 23217 23218 23219 23220 23221 23222 23223 23224 23225 23226 23227 23228 23229 23230 23231 23232 23233 23234 23235 23236 23237 23238 23239 23240 23241 23242 23243 23244 23245 23246 23247 23248 23249 23250 23251 23252 23253 23254 23255 23256 23257 23258 23259 23260 23261 23262 23263 23264 23265 23266 23267 23268 23269 23270 23271 23272 23273 23274 23275 23276 23277 23278 23279 23280 23281 23282 23283 23284 23285 23286 23287 23288 23289 23290 23291 23292 23293 23294 23295 23296 23297 23298 23299 23300 23301 23302 23303 23304 23305 23306 23307 23308 23309 23310 23311 23312 23313 23314 23315 23316 23317 23318 23319 23320 23321 23322 23323 23324 23325 23326 23327 23328 23329 23330 23331 23332 23333 23334 23335 23336 23337 23338 23339 23340 23341 23342 23343 23344 23345 23346 23347 23348 23349 23350 23351 23352 23353 23354 23355 23356 23357 23358 23359 23360 23361 23362 23363 23364 23365 23366 23367 23368 23369 23370 23371 23372 23373 23374 23375 23376 23377 23378 23379 23380 23381 23382 23383 23384 23385 23386 23387 23388 23389 23390 23391 23392 23393 23394 23395 23396 23397 23398 23399 23400 23401 23402 23403 23404 23405 23406 23407 23408 23409 23410 23411 23412 23413 23414 23415 23416 23417 23418 23419 23420 23421 23422 23423 23424 23425 23426 23427 23428 23429 23430 23431 23432 23433 23434 23435 23436 23437 23438 23439 23440 23441 23442 23443 23444 23445 23446 23447 23448 23449 23450 23451 23452 23453 23454 23455 23456 23457 23458 23459 23460 23461 23462 23463 23464 23465 23466 23467 23468 23469 23470 23471 23472 23473 23474 23475 23476 23477 23478 23479 23480 23481 23482 23483 23484 23485 23486 23487 23488 23489 23490 23491 23492 23493 23494 23495 23496 23497 23498 23499 23500 23501 23502 23503 23504 23505 23506 23507 23508 23509 23510 23511 23512 23513 23514 23515 23516 23517 23518 23519 23520 23521 23522 23523 23524 23525 23526 23527 23528 23529 23530 23531 23532 23533 23534 23535 23536 23537 23538 23539 23540 23541 23542 23543 23544 23545 23546 23547 23548 23549 23550 23551 23552 23553 23554 23555 23556 23557 23558 23559 23560 23561 23562 23563 23564 23565 23566 23567 23568 23569 23570 23571 23572 23573 23574 23575 23576 23577 23578 23579 23580 23581 23582 23583 23584 23585 23586 23587 23588 23589 23590 23591 23592 23593 23594 23595 23596 23597 23598 23599 23600 23601 23602 23603 23604 23605 23606 23607 23608 23609 23610 23611 23612 23613 23614 23615 23616 23617 23618 23619 23620 23621 23622 23623 23624 23625 23626 23627 23628 23629 23630 23631 23632 23633 23634 23635 23636 23637 23638 23639 23640 23641 23642 23643 23644 23645 23646 23647 23648 23649 23650 23651 23652 23653 23654 23655 23656 23657 23658 23659 23660 23661 23662 23663 23664 23665 23666 23667 23668 23669 23670 23671 23672 23673 23674 23675 23676 23677 23678 23679 23680 23681 23682 23683 23684 23685 23686 23687 23688 23689 23690 23691 23692 23693 23694 23695 23696 23697 23698 23699 23700 23701 23702 23703 23704 23705 23706 23707 23708 23709 23710 23711 23712 23713 23714 23715 23716 23717 23718 23719 23720 23721 23722 23723 23724 23725 23726 23727 23728 23729 23730 23731 23732 23733 23734 23735 23736 23737 23738 23739 23740 23741 23742 23743 23744 23745 23746 23747 23748 23749 23750 23751 23752 23753 23754 23755 23756 23757 23758 23759 23760 23761 23762 23763 23764 23765 23766 23767 23768 23769 23770 23771 23772 23773 23774 23775 23776 23777 23778 23779 23780 23781 23782 23783 23784 23785 23786 23787 23788 23789 23790 23791 23792 23793 23794 23795 23796 23797 23798 23799 23800 23801 23802 23803 23804 23805 23806 23807 23808 23809 23810 23811 23812 23813 23814 23815 23816 23817 23818 23819 23820 23821 23822 23823 23824 23825 23826 23827 23828 23829 23830 23831 23832 23833 23834 23835 23836 23837 23838 23839 23840 23841 23842 23843 23844 23845 23846 23847 23848 23849 23850 23851 23852 23853 23854 23855 23856 23857 23858 23859 23860 23861 23862 23863 23864 23865 23866 23867 23868 23869 23870 23871 23872 23873 23874 23875 23876 23877 23878 23879 23880 23881 23882 23883 23884 23885 23886 23887 23888 23889 23890 23891 23892 23893 23894 23895 23896 23897 23898 23899 23900 23901 23902 23903 23904 23905 23906 23907 23908 23909 23910 23911 23912 23913 23914 23915 23916 23917 23918 23919 23920 23921 23922 23923 23924 23925 23926 23927 23928 23929 23930 23931 23932 23933 23934 23935 23936 23937 23938 23939 23940 23941 23942 23943 23944 23945 23946 23947 23948 23949 23950 23951 23952 23953 23954 23955 23956 23957 23958 23959 23960 23961 23962 23963 23964 23965 23966 23967 23968 23969 23970 23971 23972 23973 23974 23975 23976 23977 23978 23979 23980 23981 23982 23983 23984 23985 23986 23987 23988 23989 23990 23991 23992 23993 23994 23995 23996 23997 23998 23999 24000 24001 24002 24003 24004 24005 24006 24007 24008 24009 24010 24011 24012 24013 24014 24015 24016 24017 24018 24019 24020 24021 24022 24023 24024 24025 24026 24027 24028 24029 24030 24031 24032 24033 24034 24035 24036 24037 24038 24039 24040 24041 24042 24043 24044 24045 24046 24047 24048 24049 24050 24051 24052 24053 24054 24055 24056 24057 24058 24059 24060 24061 24062 24063 24064 24065 24066 24067 24068 24069 24070 24071 24072 24073 24074 24075 24076 24077 24078 24079 24080 24081 24082 24083 24084 24085 24086 24087 24088 24089 24090 24091 24092 24093 24094 24095 24096 24097 24098 24099 24100 24101 24102 24103 24104 24105 24106 24107 24108 24109 24110 24111 24112 24113 24114 24115 24116 24117 24118 24119 24120 24121 24122 24123 24124 24125 24126 24127 24128 24129 24130 24131 24132 24133 24134 24135 24136 24137 24138 24139 24140 24141 24142 24143 24144 24145 24146 24147 24148 24149 24150 24151 24152 24153 24154 24155 24156 24157 24158 24159 24160 24161 24162 24163 24164 24165 24166 24167 24168 24169 24170 24171 24172 24173 24174 24175 24176 24177 24178 24179 24180 24181 24182 24183 24184 24185 24186 24187 24188 24189 24190 24191 24192 24193 24194 24195 24196 24197 24198 24199 24200 24201 24202 24203 24204 24205 24206 24207 24208 24209 24210 24211 24212 24213 24214 24215 24216 24217 24218 24219 24220 24221 24222 24223 24224 24225 24226 24227 24228 24229 24230 24231 24232 24233 24234 24235 24236 24237 24238 24239 24240 24241 24242 24243 24244 24245 24246 24247 24248 24249 24250 24251 24252 24253 24254 24255 24256 24257 24258 24259 24260 24261 24262 24263 24264 24265 24266 24267 24268 24269 24270 24271 24272 24273 24274 24275 24276 24277 24278 24279 24280 24281 24282 24283 24284 24285 24286 24287 24288 24289 24290 24291 24292 24293 24294 24295 24296 24297 24298 24299 24300 24301 24302 24303 24304 24305 24306 24307 24308 24309 24310 24311 24312 24313 24314 24315 24316 24317 24318 24319 24320 24321 24322 24323 24324 24325 24326 24327 24328 24329 24330 24331 24332 24333 24334 24335 24336 24337 24338 24339 24340 24341 24342 24343 24344 24345 24346 24347 24348 24349 24350 24351 24352 24353 24354 24355 24356 24357 24358 24359 24360 24361 24362 24363 24364 24365 24366 24367 24368 24369 24370 24371 24372 24373 24374 24375 24376 24377 24378 24379 24380 24381 24382 24383 24384 24385 24386 24387 24388 24389 24390 24391 24392 24393 24394 24395 24396 24397 24398 24399 24400 24401 24402 24403 24404 24405 24406 24407 24408 24409 24410 24411 24412 24413 24414 24415 24416 24417 24418 24419 24420 24421 24422 24423 24424 24425 24426 24427 24428 24429 24430 24431 24432 24433 24434 24435 24436 24437 24438 24439 24440 24441 24442 24443 24444 24445 24446 24447 24448 24449 24450 24451 24452 24453 24454 24455 24456 24457 24458 24459 24460 24461 24462 24463 24464 24465 24466 24467 24468 24469 24470 24471 24472 24473 24474 24475 24476 24477 24478 24479 24480 24481 24482 24483 24484 24485 24486 24487 24488 24489 24490 24491 24492 24493 24494 24495 24496 24497 24498 24499 24500 24501 24502 24503 24504 24505 24506 24507 24508 24509 24510 24511 24512 24513 24514 24515 24516 24517 24518 24519 24520 24521 24522 24523 24524 24525 24526 24527 24528 24529 24530 24531 24532 24533 24534 24535 24536 24537 24538 24539 24540 24541 24542 24543 24544 24545 24546 24547 24548 24549 24550 24551 24552 24553 24554 24555 24556 24557 24558 24559 24560 24561 24562 24563 24564 24565 24566 24567 24568 24569 24570 24571 24572 24573 24574 24575 24576 24577 24578 24579 24580 24581 24582 24583 24584 24585 24586 24587 24588 24589 24590 24591 24592 24593 24594 24595 24596 24597 24598 24599 24600 24601 24602 24603 24604 24605 24606 24607 24608 24609 24610 24611 24612 24613 24614 24615 24616 24617 24618 24619 24620 24621 24622 24623 24624 24625 24626 24627 24628 24629 24630 24631 24632 24633 24634 24635 24636 24637 24638 24639 24640 24641 24642 24643 24644 24645 24646 24647 24648 24649 24650 24651 24652 24653 24654 24655 24656 24657 24658 24659 24660 24661 24662 24663 24664 24665 24666 24667 24668 24669 24670 24671 24672 24673 24674 24675 24676 24677 24678 24679 24680 24681 24682 24683 24684 24685 24686 24687 24688 24689 24690 24691 24692 24693 24694 24695 24696 24697 24698 24699 24700 24701 24702 24703 24704 24705 24706 24707 24708 24709 24710 24711 24712 24713 24714 24715 24716 24717 24718 24719 24720 24721 24722 24723 24724 24725 24726 24727 24728 24729 24730 24731 24732 24733 24734 24735 24736 24737 24738 24739 24740 24741 24742 24743 24744 24745 24746 24747 24748 24749 24750 24751 24752 24753 24754 24755 24756 24757 24758 24759 24760 24761 24762 24763 24764 24765 24766 24767 24768 24769 24770 24771 24772 24773 24774 24775 24776 24777 24778 24779 24780 24781 24782 24783 24784 24785 24786 24787 24788 24789 24790 24791 24792 24793 24794 24795 24796 24797 24798 24799 24800 24801 24802 24803 24804 24805 24806 24807 24808 24809 24810 24811 24812 24813 24814 24815 24816 24817 24818 24819 24820 24821 24822 24823 24824 24825 24826 24827 24828 24829 24830 24831 24832 24833 24834 24835 24836 24837 24838 24839 24840 24841 24842 24843 24844 24845 24846 24847 24848 24849 24850 24851 24852 24853 24854 24855 24856 24857 24858 24859 24860 24861 24862 24863 24864 24865 24866 24867 24868 24869 24870 24871 24872 24873 24874 24875 24876 24877 24878 24879 24880 24881 24882 24883 24884 24885 24886 24887 24888 24889 24890 24891 24892 24893 24894 24895 24896 24897 24898 24899 24900 24901 24902 24903 24904 24905 24906 24907 24908 24909 24910 24911 24912 24913 24914 24915 24916 24917 24918 24919 24920 24921 24922 24923 24924 24925 24926 24927 24928 24929 24930 24931 24932 24933 24934 24935 24936 24937 24938 24939 24940 24941 24942 24943 24944 24945 24946 24947 24948 24949 24950 24951 24952 24953 24954 24955 24956 24957 24958 24959 24960 24961 24962 24963 24964 24965 24966 24967 24968 24969 24970 24971 24972 24973 24974 24975 24976 24977 24978 24979 24980 24981 24982 24983 24984 24985 24986 24987 24988 24989 24990 24991 24992 24993 24994 24995 24996 24997 24998 24999 25000 25001 25002 25003 25004 25005 25006 25007 25008 25009 25010 25011 25012 25013 25014 25015 25016 25017 25018 25019 25020 25021 25022 25023 25024 25025 25026 25027 25028 25029 25030 25031 25032 25033 25034 25035 25036 25037 25038 25039 25040 25041 25042 25043 25044 25045 25046 25047 25048 25049 25050 25051 25052 25053 25054 25055 25056 25057 25058 25059 25060 25061 25062 25063 25064 25065 25066 25067 25068 25069 25070 25071 25072 25073 25074 25075 25076 25077 25078 25079 25080 25081 25082 25083 25084 25085 25086 25087 25088 25089 25090 25091 25092 25093 25094 25095 25096 25097 25098 25099 25100 25101 25102 25103 25104 25105 25106 25107 25108 25109 25110 25111 25112 25113 25114 25115 25116 25117 25118 25119 25120 25121 25122 25123 25124 25125 25126 25127 25128 25129 25130 25131 25132 25133 25134 25135 25136 25137 25138 25139 25140 25141 25142 25143 25144 25145 25146 25147 25148 25149 25150 25151 25152 25153 25154 25155 25156 25157 25158 25159 25160 25161 25162 25163 25164 25165 25166 25167 25168 25169 25170 25171 25172 25173 25174 25175 25176 25177 25178 25179 25180 25181 25182 25183 25184 25185 25186 25187 25188 25189 25190 25191 25192 25193 25194 25195 25196 25197 25198 25199 25200 25201 25202 25203 25204 25205 25206 25207 25208 25209 25210 25211 25212 25213 25214 25215 25216 25217 25218 25219 25220 25221 25222 25223 25224 25225 25226 25227 25228 25229 25230 25231 25232 25233 25234 25235 25236 25237 25238 25239 25240 25241 25242 25243 25244 25245 25246 25247 25248 25249 25250 25251 25252 25253 25254 25255 25256 25257 25258 25259 25260 25261 25262 25263 25264 25265 25266 25267 25268 25269 25270 25271 25272 25273 25274 25275 25276 25277 25278 25279 25280 25281 25282 25283 25284 25285 25286 25287 25288 25289 25290 25291 25292 25293 25294 25295 25296 25297 25298 25299 25300 25301 25302 25303 25304 25305 25306 25307 25308 25309 25310 25311 25312 25313 25314 25315 25316 25317 25318 25319 25320 25321 25322 25323 25324 25325 25326 25327 25328 25329 25330 25331 25332 25333 25334 25335 25336 25337 25338 25339 25340 25341 25342 25343 25344 25345 25346 25347 25348 25349 25350 25351 25352 25353 25354 25355 25356 25357 25358 25359 25360 25361 25362 25363 25364 25365 25366 25367 25368 25369 25370 25371 25372 25373 25374 25375 25376 25377 25378 25379 25380 25381 25382 25383 25384 25385 25386 25387 25388 25389 25390 25391 25392 25393 25394 25395 25396 25397 25398 25399 25400 25401 25402 25403 25404 25405 25406 25407 25408 25409 25410 25411 25412 25413 25414 25415 25416 25417 25418 25419 25420 25421 25422 25423 25424 25425 25426 25427 25428 25429 25430 25431 25432 25433 25434 25435 25436 25437 25438 25439 25440 25441 25442 25443 25444 25445 25446 25447 25448 25449 25450 25451 25452 25453 25454 25455 25456 25457 25458 25459 25460 25461 25462 25463 25464 25465 25466 25467 25468 25469 25470 25471 25472 25473 25474 25475 25476 25477 25478 25479 25480 25481 25482 25483 25484 25485 25486 25487 25488 25489 25490 25491 25492 25493 25494 25495 25496 25497 25498 25499 25500 25501 25502 25503 25504 25505 25506 25507 25508 25509 25510 25511 25512 25513 25514 25515 25516 25517 25518 25519 25520 25521 25522 25523 25524 25525 25526 25527 25528 25529 25530 25531 25532 25533 25534 25535 25536 25537 25538 25539 25540 25541 25542 25543 25544 25545 25546 25547 25548 25549 25550 25551 25552 25553 25554 25555 25556 25557 25558 25559 25560 25561 25562 25563 25564 25565 25566 25567 25568 25569 25570 25571 25572 25573 25574 25575 25576 25577 25578 25579 25580 25581 25582 25583 25584 25585 25586 25587 25588 25589 25590 25591 25592 25593 25594 25595 25596 25597 25598 25599 25600 25601 25602 25603 25604 25605 25606 25607 25608 25609 25610 25611 25612 25613 25614 25615 25616 25617 25618 25619 25620 25621 25622 25623 25624 25625 25626 25627 25628 25629 25630 25631 25632 25633 25634 25635 25636 25637 25638 25639 25640 25641 25642 25643 25644 25645 25646 25647 25648 25649 25650 25651 25652 25653 25654 25655 25656 25657 25658 25659 25660 25661 25662 25663 25664 25665 25666 25667 25668 25669 25670 25671 25672 25673 25674 25675 25676 25677 25678 25679 25680 25681 25682 25683 25684 25685 25686 25687 25688 25689 25690 25691 25692 25693 25694 25695 25696 25697 25698 25699 25700 25701 25702 25703 25704 25705 25706 25707 25708 25709 25710 25711 25712 25713 25714 25715 25716 25717 25718 25719 25720 25721 25722 25723 25724 25725 25726 25727 25728 25729 25730 25731 25732 25733 25734 25735 25736 25737 25738 25739 25740 25741 25742 25743 25744 25745 25746 25747 25748 25749 25750 25751 25752 25753 25754 25755 25756 25757 25758 25759 25760 25761 25762 25763 25764 25765 25766 25767 25768 25769 25770 25771 25772 25773 25774 25775 25776 25777 25778 25779 25780 25781 25782 25783 25784 25785 25786 25787 25788 25789 25790 25791 25792 25793 25794 25795 25796 25797 25798 25799 25800 25801 25802 25803 25804 25805 25806 25807 25808 25809 25810 25811 25812 25813 25814 25815 25816 25817 25818 25819 25820 25821 25822 25823 25824 25825 25826 25827 25828 25829 25830 25831 25832 25833 25834 25835 25836 25837 25838 25839 25840 25841 25842 25843 25844 25845 25846 25847 25848 25849 25850 25851 25852 25853 25854 25855 25856 25857 25858 25859 25860 25861 25862 25863 25864 25865 25866 25867 25868 25869 25870 25871 25872 25873 25874 25875 25876 25877 25878 25879 25880 25881 25882 25883 25884 25885 25886 25887 25888 25889 25890 25891 25892 25893 25894 25895 25896 25897 25898 25899 25900 25901 25902 25903 25904 25905 25906 25907 25908 25909 25910 25911 25912 25913 25914 25915 25916 25917 25918 25919 25920 25921 25922 25923 25924 25925 25926 25927 25928 25929 25930 25931 25932 25933 25934 25935 25936 25937 25938 25939 25940 25941 25942 25943 25944 25945 25946 25947 25948 25949 25950 25951 25952 25953 25954 25955 25956 25957 25958 25959 25960 25961 25962 25963 25964 25965 25966 25967 25968 25969 25970 25971 25972 25973 25974 25975 25976 25977 25978 25979 25980 25981 25982 25983 25984 25985 25986 25987 25988 25989 25990 25991 25992 25993 25994 25995 25996 25997 25998 25999 26000 26001 26002 26003 26004 26005 26006 26007 26008 26009 26010 26011 26012 26013 26014 26015 26016 26017 26018 26019 26020 26021 26022 26023 26024 26025 26026 26027 26028 26029 26030 26031 26032 26033 26034 26035 26036 26037 26038 26039 26040 26041 26042 26043 26044 26045 26046 26047 26048 26049 26050 26051 26052 26053 26054 26055 26056 26057 26058 26059 26060 26061 26062 26063 26064 26065 26066 26067 26068 26069 26070 26071 26072 26073 26074 26075 26076 26077 26078 26079 26080 26081 26082 26083 26084 26085 26086 26087 26088 26089 26090 26091 26092 26093 26094 26095 26096 26097 26098 26099 26100 26101 26102 26103 26104 26105 26106 26107 26108 26109 26110 26111 26112 26113 26114 26115 26116 26117 26118 26119 26120 26121 26122 26123 26124 26125 26126 26127 26128 26129 26130 26131 26132 26133 26134 26135 26136 26137 26138 26139 26140 26141 26142 26143 26144 26145 26146 26147 26148 26149 26150 26151 26152 26153 26154 26155 26156 26157 26158 26159 26160 26161 26162 26163 26164 26165 26166 26167 26168 26169 26170 26171 26172 26173 26174 26175 26176 26177 26178 26179 26180 26181 26182 26183 26184 26185 26186 26187 26188 26189 26190 26191 26192 26193 26194 26195 26196 26197 26198 26199 26200 26201 26202 26203 26204 26205 26206 26207 26208 26209 26210 26211 26212 26213 26214 26215 26216 26217 26218 26219 26220 26221 26222 26223 26224 26225 26226 26227 26228 26229 26230 26231 26232 26233 26234 26235 26236 26237 26238 26239 26240 26241 26242 26243 26244 26245 26246 26247 26248 26249 26250 26251 26252 26253 26254 26255 26256 26257 26258 26259 26260 26261 26262 26263 26264 26265 26266 26267 26268 26269 26270 26271 26272 26273 26274 26275 26276 26277 26278 26279 26280 26281 26282 26283 26284 26285 26286 26287 26288 26289 26290 26291 26292 26293 26294 26295 26296 26297 26298 26299 26300 26301 26302 26303 26304 26305 26306 26307 26308 26309 26310 26311 26312 26313 26314 26315 26316 26317 26318 26319 26320 26321 26322 26323 26324 26325 26326 26327 26328 26329 26330 26331 26332 26333 26334 26335 26336 26337 26338 26339 26340 26341 26342 26343 26344 26345 26346 26347 26348 26349 26350 26351 26352 26353 26354 26355 26356 26357 26358 26359 26360 26361 26362 26363 26364 26365 26366 26367 26368 26369 26370 26371 26372 26373 26374 26375 26376 26377 26378 26379 26380 26381 26382 26383 26384 26385 26386 26387 26388 26389 26390 26391 26392 26393 26394 26395 26396 26397 26398 26399 26400 26401 26402 26403 26404 26405 26406 26407 26408 26409 26410 26411 26412 26413 26414 26415 26416 26417 26418 26419 26420 26421 26422 26423 26424 26425 26426 26427 26428 26429 26430 26431 26432 26433 26434 26435 26436 26437 26438 26439 26440 26441 26442 26443 26444 26445 26446 26447 26448 26449 26450 26451 26452 26453 26454 26455 26456 26457 26458 26459 26460 26461 26462 26463 26464 26465 26466 26467 26468 26469 26470 26471 26472 26473 26474 26475 26476 26477 26478 26479 26480 26481 26482 26483 26484 26485 26486 26487 26488 26489 26490 26491 26492 26493 26494 26495 26496 26497 26498 26499 26500 26501 26502 26503 26504 26505 26506 26507 26508 26509 26510 26511 26512 26513 26514 26515 26516 26517 26518 26519 26520 26521 26522 26523 26524 26525 26526 26527 26528 26529 26530 26531 26532 26533 26534 26535 26536 26537 26538 26539 26540 26541 26542 26543 26544 26545 26546 26547 26548 26549 26550 26551 26552 26553 26554 26555 26556 26557 26558 26559 26560 26561 26562 26563 26564 26565 26566 26567 26568 26569 26570 26571 26572 26573 26574 26575 26576 26577 26578 26579 26580 26581 26582 26583 26584 26585 26586 26587 26588 26589 26590 26591 26592 26593 26594 26595 26596 26597 26598 26599 26600 26601 26602 26603 26604 26605 26606 26607 26608 26609 26610 26611 26612 26613 26614 26615 26616 26617 26618 26619 26620 26621 26622 26623 26624 26625 26626 26627 26628 26629 26630 26631 26632 26633 26634 26635 26636 26637 26638 26639 26640 26641 26642 26643 26644 26645 26646 26647 26648 26649 26650 26651 26652 26653 26654 26655 26656 26657 26658 26659 26660 26661 26662 26663 26664 26665 26666 26667 26668 26669 26670 26671 26672 26673 26674 26675 26676 26677 26678 26679 26680 26681 26682 26683 26684 26685 26686 26687 26688 26689 26690 26691 26692 26693 26694 26695 26696 26697 26698 26699 26700 26701 26702 26703 26704 26705 26706 26707 26708 26709 26710 26711 26712 26713 26714 26715 26716 26717 26718 26719 26720 26721 26722 26723 26724 26725 26726 26727 26728 26729 26730 26731 26732 26733 26734 26735 26736 26737 26738 26739 26740 26741 26742 26743 26744 26745 26746 26747 26748 26749 26750 26751 26752 26753 26754 26755 26756 26757 26758 26759 26760 26761 26762 26763 26764 26765 26766 26767 26768 26769 26770 26771 26772 26773 26774 26775 26776 26777 26778 26779 26780 26781 26782 26783 26784 26785 26786 26787 26788 26789 26790 26791 26792 26793 26794 26795 26796 26797 26798 26799 26800 26801 26802 26803 26804 26805 26806 26807 26808 26809 26810 26811 26812 26813 26814 26815 26816 26817 26818 26819 26820 26821 26822 26823 26824 26825 26826 26827 26828 26829 26830 26831 26832 26833 26834 26835 26836 26837 26838 26839 26840 26841 26842 26843 26844 26845 26846 26847 26848 26849 26850 26851 26852 26853 26854 26855 26856 26857 26858 26859 26860 26861 26862 26863 26864 26865 26866 26867 26868 26869 26870 26871 26872 26873 26874 26875 26876 26877 26878 26879 26880 26881 26882 26883 26884 26885 26886 26887 26888 26889 26890 26891 26892 26893 26894 26895 26896 26897 26898 26899 26900 26901 26902 26903 26904 26905 26906 26907 26908 26909 26910 26911 26912 26913 26914 26915 26916 26917 26918 26919 26920 26921 26922 26923 26924 26925 26926 26927 26928 26929 26930 26931 26932 26933 26934 26935 26936 26937 26938 26939 26940 26941 26942 26943 26944 26945 26946 26947 26948 26949 26950 26951 26952 26953 26954 26955 26956 26957 26958 26959 26960 26961 26962 26963 26964 26965 26966 26967 26968 26969 26970 26971 26972 26973 26974 26975 26976 26977 26978 26979 26980 26981 26982 26983 26984 26985 26986 26987 26988 26989 26990 26991 26992 26993 26994 26995 26996 26997 26998 26999 27000 27001 27002 27003 27004 27005 27006 27007 27008 27009 27010 27011 27012 27013 27014 27015 27016 27017 27018 27019 27020 27021 27022 27023 27024 27025 27026 27027 27028 27029 27030 27031 27032 27033 27034 27035 27036 27037 27038 27039 27040 27041 27042 27043 27044 27045 27046 27047 27048 27049 27050 27051 27052 27053 27054 27055 27056 27057 27058 27059 27060 27061 27062 27063 27064 27065 27066 27067 27068 27069 27070 27071 27072 27073 27074 27075 27076 27077 27078 27079 27080 27081 27082 27083 27084 27085 27086 27087 27088 27089 27090 27091 27092 27093 27094 27095 27096 27097 27098 27099 27100 27101 27102 27103 27104 27105 27106 27107 27108 27109 27110 27111 27112 27113 27114 27115 27116 27117 27118 27119 27120 27121 27122 27123 27124 27125 27126 27127 27128 27129 27130 27131 27132 27133 27134 27135 27136 27137 27138 27139 27140 27141 27142 27143 27144 27145 27146 27147 27148 27149 27150 27151 27152 27153 27154 27155 27156 27157 27158 27159 27160 27161 27162 27163 27164 27165 27166 27167 27168 27169 27170 27171 27172 27173 27174 27175 27176 27177 27178 27179 27180 27181 27182 27183 27184 27185 27186 27187 27188 27189 27190 27191 27192 27193 27194 27195 27196 27197 27198 27199 27200 27201 27202 27203 27204 27205 27206 27207 27208 27209 27210 27211 27212 27213 27214 27215 27216 27217 27218 27219 27220 27221 27222 27223 27224 27225 27226 27227 27228 27229 27230 27231 27232 27233 27234 27235 27236 27237 27238 27239 27240 27241 27242 27243 27244 27245 27246 27247 27248 27249 27250 27251 27252 27253 27254 27255 27256 27257 27258 27259 27260 27261 27262 27263 27264 27265 27266 27267 27268 27269 27270 27271 27272 27273 27274 27275 27276 27277 27278 27279 27280 27281 27282 27283 27284 27285 27286 27287 27288 27289 27290 27291 27292 27293 27294 27295 27296 27297 27298 27299 27300 27301 27302 27303 27304 27305 27306 27307 27308 27309 27310 27311 27312 27313 27314 27315 27316 27317 27318 27319 27320 27321 27322 27323 27324 27325 27326 27327 27328 27329 27330 27331 27332 27333 27334 27335 27336 27337 27338 27339 27340 27341 27342 27343 27344 27345 27346 27347 27348 27349 27350 27351 27352 27353 27354 27355 27356 27357 27358 27359 27360 27361 27362 27363 27364 27365 27366 27367 27368 27369 27370 27371 27372 27373 27374 27375 27376 27377 27378 27379 27380 27381 27382 27383 27384 27385 27386 27387 27388 27389 27390 27391 27392 27393 27394 27395 27396 27397 27398 27399 27400 27401 27402 27403 27404 27405 27406 27407 27408 27409 27410 27411 27412 27413 27414 27415 27416 27417 27418 27419 27420 27421 27422 27423 27424 27425 27426 27427 27428 27429 27430 27431 27432 27433 27434 27435 27436 27437 27438 27439 27440 27441 27442 27443 27444 27445 27446 27447 27448 27449 27450 27451 27452 27453 27454 27455 27456 27457 27458 27459 27460 27461 27462 27463 27464 27465 27466 27467 27468 27469 27470 27471 27472 27473 27474 27475 27476 27477 27478 27479 27480 27481 27482 27483 27484 27485 27486 27487 27488 27489 27490 27491 27492 27493 27494 27495 27496 27497 27498 27499 27500 27501 27502 27503 27504 27505 27506 27507 27508 27509 27510 27511 27512 27513 27514 27515 27516 27517 27518 27519 27520 27521 27522 27523 27524 27525 27526 27527 27528 27529 27530 27531 27532 27533 27534 27535 27536 27537 27538 27539 27540 27541 27542 27543 27544 27545 27546 27547 27548 27549 27550 27551 27552 27553 27554 27555 27556 27557 27558 27559 27560 27561 27562 27563 27564 27565 27566 27567 27568 27569 27570 27571 27572 27573 27574 27575 27576 27577 27578 27579 27580 27581 27582 27583 27584 27585 27586 27587 27588 27589 27590 27591 27592 27593 27594 27595 27596 27597 27598 27599 27600 27601 27602 27603 27604 27605 27606 27607 27608 27609 27610 27611 27612 27613 27614 27615 27616 27617 27618 27619 27620 27621 27622 27623 27624 27625 27626 27627 27628 27629 27630 27631 27632 27633 27634 27635 27636 27637 27638 27639 27640 27641 27642 27643 27644 27645 27646 27647 27648 27649 27650 27651 27652 27653 27654 27655 27656 27657 27658 27659 27660 27661 27662 27663 27664 27665 27666 27667 27668 27669 27670 27671 27672 27673 27674 27675 27676 27677 27678 27679 27680 27681 27682 27683 27684 27685 27686 27687 27688 27689 27690 27691 27692 27693 27694 27695 27696 27697 27698 27699 27700 27701 27702 27703 27704 27705 27706 27707 27708 27709 27710 27711 27712 27713 27714 27715 27716 27717 27718 27719 27720 27721 27722 27723 27724 27725 27726 27727 27728 27729 27730 27731 27732 27733 27734 27735 27736 27737 27738 27739 27740 27741 27742 27743 27744 27745 27746 27747 27748 27749 27750 27751 27752 27753 27754 27755 27756 27757 27758 27759 27760 27761 27762 27763 27764 27765 27766 27767 27768 27769 27770 27771 27772 27773 27774 27775 27776 27777 27778 27779 27780 27781 27782 27783 27784 27785 27786 27787 27788 27789 27790 27791 27792 27793 27794 27795 27796 27797 27798 27799 27800 27801 27802 27803 27804 27805 27806 27807 27808 27809 27810 27811 27812 27813 27814 27815 27816 27817 27818 27819 27820 27821 27822 27823 27824 27825 27826 27827 27828 27829 27830 27831 27832 27833 27834 27835 27836 27837 27838 27839 27840 27841 27842 27843 27844 27845 27846 27847 27848 27849 27850 27851 27852 27853 27854 27855 27856 27857 27858 27859 27860 27861 27862 27863 27864 27865 27866 27867 27868 27869 27870 27871 27872 27873 27874 27875 27876 27877 27878 27879 27880 27881 27882 27883 27884 27885 27886 27887 27888 27889 27890 27891 27892 27893 27894 27895 27896 27897 27898 27899 27900 27901 27902 27903 27904 27905 27906 27907 27908 27909 27910 27911 27912 27913 27914 27915 27916 27917 27918 27919 27920 27921 27922 27923 27924 27925 27926 27927 27928 27929 27930 27931 27932 27933 27934 27935 27936 27937 27938 27939 27940 27941 27942 27943 27944 27945 27946 27947 27948 27949 27950 27951 27952 27953 27954 27955 27956 27957 27958 27959 27960 27961 27962 27963 27964 27965 27966 27967 27968 27969 27970 27971 27972 27973 27974 27975 27976 27977 27978 27979 27980 27981 27982 27983 27984 27985 27986 27987 27988 27989 27990 27991 27992 27993 27994 27995 27996 27997 27998 27999 28000 28001 28002 28003 28004 28005 28006 28007 28008 28009 28010 28011 28012 28013 28014 28015 28016 28017 28018 28019 28020 28021 28022 28023 28024 28025 28026 28027 28028 28029 28030 28031 28032 28033 28034 28035 28036 28037 28038 28039 28040 28041 28042 28043 28044 28045 28046 28047 28048 28049 28050 28051 28052 28053 28054 28055 28056 28057 28058 28059 28060 28061 28062 28063 28064 28065 28066 28067 28068 28069 28070 28071 28072 28073 28074 28075 28076 28077 28078 28079 28080 28081 28082 28083 28084 28085 28086 28087 28088 28089 28090 28091 28092 28093 28094 28095 28096 28097 28098 28099 28100 28101 28102 28103 28104 28105 28106 28107 28108 28109 28110 28111 28112 28113 28114 28115 28116 28117 28118 28119 28120 28121 28122 28123 28124 28125 28126 28127 28128 28129 28130 28131 28132 28133 28134 28135 28136 28137 28138 28139 28140 28141 28142 28143 28144 28145 28146 28147 28148 28149 28150 28151 28152 28153 28154 28155 28156 28157 28158 28159 28160 28161 28162 28163 28164 28165 28166 28167 28168 28169 28170 28171 28172 28173 28174 28175 28176 28177 28178 28179 28180 28181 28182 28183 28184 28185 28186 28187 28188 28189 28190 28191 28192 28193 28194 28195 28196 28197 28198 28199 28200 28201 28202 28203 28204 28205 28206 28207 28208 28209 28210 28211 28212 28213 28214 28215 28216 28217 28218 28219 28220 28221 28222 28223 28224 28225 28226 28227 28228 28229 28230 28231 28232 28233 28234 28235 28236 28237 28238 28239 28240 28241 28242 28243 28244 28245 28246 28247 28248 28249 28250 28251 28252 28253 28254 28255 28256 28257 28258 28259 28260 28261 28262 28263 28264 28265 28266 28267 28268 28269 28270 28271 28272 28273 28274 28275 28276 28277 28278 28279 28280 28281 28282 28283 28284 28285 28286 28287 28288 28289 28290 28291 28292 28293 28294 28295 28296 28297 28298 28299 28300 28301 28302 28303 28304 28305 28306 28307 28308 28309 28310 28311 28312 28313 28314 28315 28316 28317 28318 28319 28320 28321 28322 28323 28324 28325 28326 28327 28328 28329 28330 28331 28332 28333 28334 28335 28336 28337 28338 28339 28340 28341 28342 28343 28344 28345 28346 28347 28348 28349 28350 28351 28352 28353 28354 28355 28356 28357 28358 28359 28360 28361 28362 28363 28364 28365 28366 28367 28368 28369 28370 28371 28372 28373 28374 28375 28376 28377 28378 28379 28380 28381 28382 28383 28384 28385 28386 28387 28388 28389 28390 28391 28392 28393 28394 28395 28396 28397 28398 28399 28400 28401 28402 28403 28404 28405 28406 28407 28408 28409 28410 28411 28412 28413 28414 28415 28416 28417 28418 28419 28420 28421 28422 28423 28424 28425 28426 28427 28428 28429 28430 28431 28432 28433 28434 28435 28436 28437 28438 28439 28440 28441 28442 28443 28444 28445 28446 28447 28448 28449 28450 28451 28452 28453 28454 28455 28456 28457 28458 28459 28460 28461 28462 28463 28464 28465 28466 28467 28468 28469 28470 28471 28472 28473 28474 28475 28476 28477 28478 28479 28480 28481 28482 28483 28484 28485 28486 28487 28488 28489 28490 28491 28492 28493 28494 28495 28496 28497 28498 28499 28500 28501 28502 28503 28504 28505 28506 28507 28508 28509 28510 28511 28512 28513 28514 28515 28516 28517 28518 28519 28520 28521 28522 28523 28524 28525 28526 28527 28528 28529 28530 28531 28532 28533 28534 28535 28536 28537 28538 28539 28540 28541 28542 28543 28544 28545 28546 28547 28548 28549 28550 28551 28552 28553 28554 28555 28556 28557 28558 28559 28560 28561 28562 28563 28564 28565 28566 28567 28568 28569 28570 28571 28572 28573 28574 28575 28576 28577 28578 28579 28580 28581 28582 28583 28584 28585 28586 28587 28588 28589 28590 28591 28592 28593 28594 28595 28596 28597 28598 28599 28600 28601 28602 28603 28604 28605 28606 28607 28608 28609 28610 28611 28612 28613 28614 28615 28616 28617 28618 28619 28620 28621 28622 28623 28624 28625 28626 28627 28628 28629 28630 28631 28632 28633 28634 28635 28636 28637 28638 28639 28640 28641 28642 28643 28644 28645 28646 28647 28648 28649 28650 28651 28652 28653 28654 28655 28656 28657 28658 28659 28660 28661 28662 28663 28664 28665 28666 28667 28668 28669 28670 28671 28672 28673 28674 28675 28676 28677 28678 28679 28680 28681 28682 28683 28684 28685 28686 28687 28688 28689 28690 28691 28692 28693 28694 28695 28696 28697 28698 28699 28700 28701 28702 28703 28704 28705 28706 28707 28708 28709 28710 28711 28712 28713 28714 28715 28716 28717 28718 28719 28720 28721 28722 28723 28724 28725 28726 28727 28728 28729 28730 28731 28732 28733 28734 28735 28736 28737 28738 28739 28740 28741 28742 28743 28744 28745 28746 28747 28748 28749 28750 28751 28752 28753 28754 28755 28756 28757 28758 28759 28760 28761 28762 28763 28764 28765 28766 28767 28768 28769 28770 28771 28772 28773 28774 28775 28776 28777 28778 28779 28780 28781 28782 28783 28784 28785 28786 28787 28788 28789 28790 28791 28792 28793 28794 28795 28796 28797 28798 28799 28800 28801 28802 28803 28804 28805 28806 28807 28808 28809 28810 28811 28812 28813 28814 28815 28816 28817 28818 28819 28820 28821 28822 28823 28824 28825 28826 28827 28828 28829 28830 28831 28832 28833 28834 28835 28836 28837 28838 28839 28840 28841 28842 28843 28844 28845 28846 28847 28848 28849 28850 28851 28852 28853 28854 28855 28856 28857 28858 28859 28860 28861 28862 28863 28864 28865 28866 28867 28868 28869 28870 28871 28872 28873 28874 28875 28876 28877 28878 28879 28880 28881 28882 28883 28884 28885 28886 28887 28888 28889 28890 28891 28892 28893 28894 28895 28896 28897 28898 28899 28900 28901 28902 28903 28904 28905 28906 28907 28908 28909 28910 28911 28912 28913 28914 28915 28916 28917 28918 28919 28920 28921 28922 28923 28924 28925 28926 28927 28928 28929 28930 28931 28932 28933 28934 28935 28936 28937 28938 28939 28940 28941 28942 28943 28944 28945 28946 28947 28948 28949 28950 28951 28952 28953 28954 28955 28956 28957 28958 28959 28960 28961 28962 28963 28964 28965 28966 28967 28968 28969 28970 28971 28972 28973 28974 28975 28976 28977 28978 28979 28980 28981 28982 28983 28984 28985 28986 28987 28988 28989 28990 28991 28992 28993 28994 28995 28996 28997 28998 28999 29000 29001 29002 29003 29004 29005 29006 29007 29008 29009 29010 29011 29012 29013 29014 29015 29016 29017 29018 29019 29020 29021 29022 29023 29024 29025 29026 29027 29028 29029 29030 29031 29032 29033 29034 29035 29036 29037 29038 29039 29040 29041 29042 29043 29044 29045 29046 29047 29048 29049 29050 29051 29052 29053 29054 29055 29056 29057 29058 29059 29060 29061 29062 29063 29064 29065 29066 29067 29068 29069 29070 29071 29072 29073 29074 29075 29076 29077 29078 29079 29080 29081 29082 29083 29084 29085 29086 29087 29088 29089 29090 29091 29092 29093 29094 29095 29096 29097 29098 29099 29100 29101 29102 29103 29104 29105 29106 29107 29108 29109 29110 29111 29112 29113 29114 29115 29116 29117 29118 29119 29120 29121 29122 29123 29124 29125 29126 29127 29128 29129 29130 29131 29132 29133 29134 29135 29136 29137 29138 29139 29140 29141 29142 29143 29144 29145 29146 29147 29148 29149 29150 29151 29152 29153 29154 29155 29156 29157 29158 29159 29160 29161 29162 29163 29164 29165 29166 29167 29168 29169 29170 29171 29172 29173 29174 29175 29176 29177 29178 29179 29180 29181 29182 29183 29184 29185 29186 29187 29188 29189 29190 29191 29192 29193 29194 29195 29196 29197 29198 29199 29200 29201 29202 29203 29204 29205 29206 29207 29208 29209 29210 29211 29212 29213 29214 29215 29216 29217 29218 29219 29220 29221 29222 29223 29224 29225 29226 29227 29228 29229 29230 29231 29232 29233 29234 29235 29236 29237 29238 29239 29240 29241 29242 29243 29244 29245 29246 29247 29248 29249 29250 29251 29252 29253 29254 29255 29256 29257 29258 29259 29260 29261 29262 29263 29264 29265 29266 29267 29268 29269 29270 29271 29272 29273 29274 29275 29276 29277 29278 29279 29280 29281 29282 29283 29284 29285 29286 29287 29288 29289 29290 29291 29292 29293 29294 29295 29296 29297 29298 29299 29300 29301 29302 29303 29304 29305 29306 29307 29308 29309 29310 29311 29312 29313 29314 29315 29316 29317 29318 29319 29320 29321 29322 29323 29324 29325 29326 29327 29328 29329 29330 29331 29332 29333 29334 29335 29336 29337 29338 29339 29340 29341 29342 29343 29344 29345 29346 29347 29348 29349 29350 29351 29352 29353 29354 29355 29356 29357 29358 29359 29360 29361 29362 29363 29364 29365 29366 29367 29368 29369 29370 29371 29372 29373 29374 29375 29376 29377 29378 29379 29380 29381 29382 29383 29384 29385 29386 29387 29388 29389 29390 29391 29392 29393 29394 29395 29396 29397 29398 29399 29400 29401 29402 29403 29404 29405 29406 29407 29408 29409 29410 29411 29412 29413 29414 29415 29416 29417 29418 29419 29420 29421 29422 29423 29424 29425 29426 29427 29428 29429 29430 29431 29432 29433 29434 29435 29436 29437 29438 29439 29440 29441 29442 29443 29444 29445 29446 29447 29448 29449 29450 29451 29452 29453 29454 29455 29456 29457 29458 29459 29460 29461 29462 29463 29464 29465 29466 29467 29468 29469 29470 29471 29472 29473 29474 29475 29476 29477 29478 29479 29480 29481 29482 29483 29484 29485 29486 29487 29488 29489 29490 29491 29492 29493 29494 29495 29496 29497 29498 29499 29500 29501 29502 29503 29504 29505 29506 29507 29508 29509 29510 29511 29512 29513 29514 29515 29516 29517 29518 29519 29520 29521 29522 29523 29524 29525 29526 29527 29528 29529 29530 29531 29532 29533 29534 29535 29536 29537 29538 29539 29540 29541 29542 29543 29544 29545 29546 29547 29548 29549 29550 29551 29552 29553 29554 29555 29556 29557 29558 29559 29560 29561 29562 29563 29564 29565 29566 29567 29568 29569 29570 29571 29572 29573 29574 29575 29576 29577 29578 29579 29580 29581 29582 29583 29584 29585 29586 29587 29588 29589 29590 29591 29592 29593 29594 29595 29596 29597 29598 29599 29600 29601 29602 29603 29604 29605 29606 29607 29608 29609 29610 29611 29612 29613 29614 29615 29616 29617 29618 29619 29620 29621 29622 29623 29624 29625 29626 29627 29628 29629 29630 29631 29632 29633 29634 29635 29636 29637 29638 29639 29640 29641 29642 29643 29644 29645 29646 29647 29648 29649 29650 29651 29652 29653 29654 29655 29656 29657 29658 29659 29660 29661 29662 29663 29664 29665 29666 29667 29668 29669 29670 29671 29672 29673 29674 29675 29676 29677 29678 29679 29680 29681 29682 29683 29684 29685 29686 29687 29688 29689 29690 29691 29692 29693 29694 29695 29696 29697 29698 29699 29700 29701 29702 29703 29704 29705 29706 29707 29708 29709 29710 29711 29712 29713 29714 29715 29716 29717 29718 29719 29720 29721 29722 29723 29724 29725 29726 29727 29728 29729 29730 29731 29732 29733 29734 29735 29736 29737 29738 29739 29740 29741 29742 29743 29744 29745 29746 29747 29748 29749 29750 29751 29752 29753 29754 29755 29756 29757 29758 29759 29760 29761 29762 29763 29764 29765 29766 29767 29768 29769 29770 29771 29772 29773 29774 29775 29776 29777 29778 29779 29780 29781 29782 29783 29784 29785 29786 29787 29788 29789 29790 29791 29792 29793 29794 29795 29796 29797 29798 29799
|
2021-02-23 twu
* VERSION, index.html: Updated version number
2021-02-22 twu
* stage3hr.c: In removing overlaps, handling the case where hitlist or
hitpairlist is NULL. Using low and high instead of genomicstart and
genomicend to compute insertlength
* substring.c: Added commented code
* samprint.c: Using pairtype for concordant pairs
2021-02-14 twu
* index.html: Updated for latest version
* outbuffer.c: Added comment
2021-02-13 twu
* outbuffer.c: Calling setvbuf before printing file headers
* gmap.c, gsnap.c: Added comments
* output.c: Fixed code for deciding on failed input for single-end reads
* VERSION, index.html: Updated version number
* output.c: Using the results of Stage3pair_print_end to determine whether a
sequence failed to print
* stage3hr.c, stage3hr.h: Stage3pair_print_end now returns whether it
printed an alignment
* gsnap.c: Formatting change
* genome128_hr.c: Removed extraneous semicolon at end of macro
* outbuffer.c: Checking for a NULL string from Filestring_T before pushing
onto the failedinput Filestring_T
* resulthr.c: Fixed a memory leak
* filestring.c: Removed faulty assertion
2020-12-16 twu
* outbuffer.c: Added option for using write instead of fwrite
* outbuffer.c: Using setvbuf to set our own buffer size
* sam_sort.c: Using new function name SAM_header_fopen
* samheader.c, samheader.h: Changed function name from SAM_header_open_file
to SAM_header_fopen
* outbuffer.c: Fixed a bug where we were not checking for filestrings having
the type OUTPUT_NONE
2020-12-14 twu
* VERSION, index.html: Updated version number
* outbuffer.c: Printing SAM headers again
* output.c, samprint.c: Added comments to indicate where split output should
be set to OUTPUT_NONE, which is the initial value
* filestring.c: Added comment
* gmap.c, gsnap.c: Added final print for the single-threaded case
* index.html: Updated for latest version
* outbuffer.c: Fixed new output routines for filestrings assigned to
OUTPUT_NONE
* samflags.h: Revised comment
2020-12-13 twu
* VERSION: Updated version number
* stage3.c: Modified debugging statements
* outbuffer.c, outbuffer.h: Implemented Outbuffer_lines_T object and
printing output lines in batches
* path-solve.c, path-solve.h: Changed behavior to depend on splicingp rather
than novelsplicingp
* distant-rna.c, distant-rna.h: Searching for novel splice sites only if
novelsplicingp is true
* dynprog_genome.c: Added static variable for knownsplicingp
* extension-search.c, extension-search.h: Removed unused static variables
* gsnap.c: Changed default behavior for --find-dna-chimeras to be off when
only known splicing is requested. Using Outbuffer_lines routines
* gmap.c: Added --strand option. Fixed --max_deletionlength to
--max-deletionlength. Using Outbuffer_lines routines
* stage1hr.c, stage1hr.h: Using specified values for max_middle_insertions
and max_middle_deletions
* genome.c, samprint.c: Using new interface to Filestring_free
* filestring.c, filestring.h: Filestring_print now calls fwrite only once.
Implemented Filestring_string. Filestring_free now takes an argument
free_string_p
* sequence.c, sam_sort.c: Using a Filestring_T object in Sequence_stdout
* splice.c, splice.h: Using the specified value of novelsplicing instead of
being fixed as true previously
* pair.c: Added a static variable for knownsplicingp
* indel.c, indel.h: Using the specified values of max_end_insertions and
max_end_deletions, rather than a value of 3 previously
* kmer-search.c: Using new interface to Indel_solve_end_low and
Indel_solve_end_high
* get-genome.c: Restored behavior for coordinates requesting a reverse
complement segment
2020-12-08 twu
* samprint.c: For calls to Transcript_singleend_print, using the XY field
rather than XX
2020-11-28 twu
* genome.c, genome.h: Not printing messages to stderr for utility programs.
Implemented Genome_dump for faster printing of a given genome segment
* get-genome.c: Using USE_ALLOCATE access when memory mapping is not
available. Implemented a faster method of dumping genome contents
2020-11-20 twu
* index.html: Updated for latest version
* VERSION: Updated version number
* get-genome.c: Added option --transcriptdb to get-genome to retrieve a gene
structure
* gmap.c: Using new interface to Inbuffer_setup
* inbuffer.c, inbuffer.h: Providing NULL as arguments for genome and
genomealt to Request_new
2020-11-15 twu
* substring.c: Checking that pos5 < pos3 before Substring trim procedures
* kmer-search.c: In search_transcriptome_complete, aborting if trim5 >=
queryend
* genome128_hr.c: Removed obsolete debugging macros
* samprint.c: Added comments
* stage3hr.c, stage3hr.h: Handling SENSE_NULL distant splices (from
Distant_dna_solve) by Stage3end_substring_donor and
Stage3end_substring_acceptor
* stage1hr.c: Fixed a bug where first_read_p was always being set to true
for Distant_rna_solve
* kmer-search.c: In search_transcriptome_ends, making sure that the two
diagonals are on the same transcript or chromosome
* substring.c: Restored the filling buffer for the entire querylength
* splice.c: Returning the value of end_genomicpos from count procedures
* kmer-search.c: Trimming coordinates and then query. Checking query
coordinates before calling Genome_count_mismatches_substring
* genome.c: Restored trimming of ends at genome bounds
2020-11-14 twu
* extension-search.c: In process_seed, trimming query first at genome
bounds, then at chromosome bounds
* extension-search.c: Restoring the update to chrnum, chroffset, and chrhigh
in process_seed
* gmap.c, simplepair.c: Removed unused parameters
* oligoindex_hr.c: Hiding unused procedures
* output.c: Removed unused variables
* path-solve.c: Using new interface to Indel_resolve_middle_insertion and
Indel_resolve_middle_deletion procedures
* kmer-search.c: Using new interface to Indel_middle_insertion and
Indel_middle_deletion procedures
* splice.c, splice.h: Removed chrhigh as a parameter to Splice_resolve_sense
and Splice_resolve_antisense
* indel.c, indel.h: Removed chroffset and chrhigh as parameters to
Indel_middle_insertion and Indel_middle_deletion
* genome128_consec.c: Removed unused procedures
* genome.c: Removed reference to pos5 and pos3 in Genome_fill_buffer
* diag.c: Hiding obsolete procedures
* gsnap.c, path-solve.c, path-solve.h: Path_solve_setup no longer needs
genomelength
* genome.c, genome.h: Hiding Genome_fill_buffer_bounds
* indel.h: Procedures now take chroffset, chrhigh, troffset, and trhigh
* distant-dna.h: Distant_dna_setup no longer needs genomelength
* distant-dna.c, distant-rna.c, extension-search.c, indel.c, kmer-search.c,
path-solve.c, substring.c, terminal.c: Trimming query at chromosome bounds
rather than genome bounds
* splice.c, splice.h: Taking chroffset and chrhigh as parameters
* align.test.ok: New result has splice site probabilities
* trunk, VERSION, src, Makefile.gsnaptoo.am, boyer-moore.c, boyer-moore.h,
chimera.c, chimera.h, cigar.c, diag.c, diag.h, dinucl_bits.c,
dinucl_bits.h, distant-rna.c, dynprog.c, dynprog.h, dynprog_cdna.c,
dynprog_cdna.h, dynprog_end.c, dynprog_end.h, dynprog_genome.c,
dynprog_genome.h, dynprog_simd.c, dynprog_simd.h, dynprog_single.c,
dynprog_single.h, genome.c, genome.h, genome_canonical.c,
genome_canonical.h, genome_sites.c, genome_sites.h, get-genome.c, gmap.c,
gsnap.c, inbuffer.c, inbuffer.h, indel.c, junction.c, junction.h,
knownsplicing.c, maxent_hr.c, maxent_hr.h, oligoindex_hr.c,
oligoindex_hr.h, outbuffer.c, outbuffer.h, output.c, output.h, pair.c,
pair.h, pairpool.c, pairpool.h, path-solve.c, request.c, request.h,
snpindex.c, splice.c, splice.h, splicetrie.c, splicetrie.h,
splicetrie_build.c, stage1hr.c, stage2.c, stage2.h, stage3.c, stage3.h,
stage3hr.c, stage3hr.h, substring.c, substring.h, terminal.c: Merged
revisions 223430 through 223510 from branches/2020-11-04-mult-pairalign to
allow for multiple pairalignment in GMAP, which required providing genome
and genomealt to Genome procedures. Also allowing splice trim procedures
to assign sense when it is SENSE_NULL.
* stage3hr.c: No longer modifying scores when chrlength < querylength
* distant-rna.c: Modifying elt->qstart and elt->qend to pos5 and pos3 based
on genomic beginning and end
2020-11-04 twu
* dynprog.c, dynprog.h, dynprog_end.c, dynprog_end.h, dynprog_genome.c,
dynprog_genome.h, dynprog_single.c, dynprog_single.h, gmap.c, stage3.c,
stage3.h: Provided options --indel-open and --indel-extend for GMAP
* configure.ac, index.html, Makefile.gsnaptoo.am, access.c, bool.h,
filestring.c, filestring.h, gmap.c, gsnap.c, iit-read-univ.c,
iit-read-univ.h, inbuffer.c, inbuffer.h, mem.c, outbuffer.c, outbuffer.h,
samheader.c, samheader.h, samprint.c, shortread.c, shortread.h: Removed
code for MPI
* gmap.c: Removed unused code for align_relative
2020-10-27 twu
* gff3_genes.pl.in: Providing an option to exclude readthrough transcripts
* VERSION: Updated version number
* pair.c, simplepair.c: Allocating an extra byte for tokens, to prevent
compiler warnings
* access.c, access.h, add_rpk.c, alphabet.c, assert.c, atoi.c, atoiindex.c,
backtranslation.c, bigendian.c, bigendian.h, bitpack16-read.c,
bitpack16-write.c, bitpack32-read.c, bitpack32-write.c,
bitpack64-access.c, bitpack64-incr.c, bitpack64-read.c, bitpack64-read.h,
bitpack64-readtwo.c, bitpack64-serial-read.c, bitpack64-serial-read.h,
bitpack64-serial-write.c, bitpack64-speed-test.c, bitpack64-write.c,
block.c, boyer-moore.c, bp-read.c, bp-write.c, branchpoint.c,
bytecoding.c, bzip2.c, cellpool.c, changepoint.c, chimera.c,
chop_primers.c, chrnum.c, chrom.c, chrsegment.c, chrsubset.c, cigar.c,
cmet.c, cmetindex.c, color.c, compress-write.c, compress.c, compress.h,
compress128.c, concordance.c, concordance.h, cpuid.c, cum.c, datadir.c,
datum.c, diag.c, diagnostic.c, diagpool.c, dibase.c, dibaseindex.c,
distant-dna.c, distant-rna.c, doublelist.c, dynprog.c, dynprog.h,
dynprog_cdna.c, dynprog_end.c, dynprog_genome.c, dynprog_simd.c,
dynprog_single.c, epu16-bitpack64-access.c, epu16-bitpack64-incr.c,
epu16-bitpack64-read.c, epu16-bitpack64-read.h, epu16-bitpack64-readtwo.c,
epu16-bitpack64-write.c, except.c, except.h, exonscan.c,
extension-search.c, extents_genebounds.c, filestring.c,
filter-diagonals.c, filter-diagonals.h, fopen.c, fopen.h,
gamma-speed-test.c, gbuffer.c, gdiag.c, geneadjust.c, geneeval.c,
genome-write.c, genome.c, genome128-write.c, genome128.c,
genome128_consec.c, genome128_hr.c, genome_hr.c, genome_sites.c,
genomepage.c, genomeplot.c, genomicpos.c, genomicpos.h, genuncompress.c,
get-genome.c, getline.h, gmap.c, gmapindex.c, goby.c, gregion.c, gsnap.c,
gsnap_best.c, gsnap_concordant.c, gsnap_filter.c, gsnap_iit.c,
gsnap_multiclean.c, gsnap_tally.c, gsnap_terms.c, gsnapread.c, hint.c,
hitlistpool.c, iit-read-univ.c, iit-read.c, iit-read.h, iit-write-univ.c,
iit-write.c, iit_dump.c, iit_fetch.c, iit_get.c, iit_pileup.c, iit_plot.c,
iit_store.c, iit_tally.c, iit_update.c, iitdef.h, inbuffer.c, inbuffer.h,
indel.c, indexdb-cat.c, indexdb-write.c, indexdb-write.h, indexdb.c,
indexdb.h, indexdb_dibase.c, indexdb_dump.c, indexdb_hr.c, indexdbdef.h,
intersect-large.c, intersect-large.h, intersect.c, intersect.h,
interval.c, intlist.c, intlist.h, intlistpool.c, intron.c, junction.c,
kmer-search.c, knownsplicing.c, ladder.c, lgamma.c, list.c, list.h,
listpool.c, littleendian.c, littleendian.h, localdb-write.c,
localdb-write.h, localdb.c, localdb.h, localdbdef.h, mapq.c, master.c,
master.h, match.c, matchpool.c, maxent.c, maxent128_hr.c, maxent_hr.c,
md5-compute.c, md5.c, mdprint.c, mem.c, mem.h, memchk.c,
merge-diagonals-heap.c, merge-diagonals-heap.h,
merge-diagonals-simd-uint4.c, merge-diagonals-simd-uint4.h,
merge-diagonals-simd-uint8.c, merge-diagonals-simd-uint8.h,
merge-records-heap.c, merge-records-heap.h, merge-records-simd.c,
merge-records-simd.h, merge-uint4.c, merge-uint4.h, merge-uint8.c,
merge-uint8.h, method.c, mpidebug.c, nmath.c, oligo-count.c, oligo.c,
oligoindex.c, oligoindex.h, oligoindex_hr.c, oligoindex_localdb.c,
oligoindex_old.c, oligoindex_old.h, oligoindex_pmap.c, oligop.c,
orderstat.c, outbuffer.c, output.c, pair.c, pairingcum.c, pairingflats.c,
pairinggene.c, pairingstrand.c, pairingtrain.c, pairpool.c, parserange.c,
path-solve.c, pbinom.c, pdl_smooth.c, pdldata.c, plotdata.c, plotgenes.c,
pmapindex.c, popcount.c, popcount.h, random.c, rbtree.c, reader.c,
reads.c, reads_dump.c, reads_store.c, record.h, regiondb-write.c,
regiondb-write.h, regiondb.c, regiondb.h, regiondbdef.h, request.c,
result.c, resulthr.c, revcomp.c, sam_sort.c, samheader.c, samprint.c,
samread.c, sarray-read.c, sarray-search.c, sarray-write.c, sedgesort.c,
segment-search.c, segment-search.h, segmentpos.c, segue.c, semaphore.c,
semaphore.h, seqlength.c, sequence.c, sequence.h, shortread.c,
shortread.h, simplepair.c, smooth.c, snpindex.c, sortinfo.c,
spanningelt.c, splice.c, splicealt.c, spliceeval.c, splicefill.c,
splicegraph.c, splicescan.c, splicestringpool.c, splicetrie.c,
splicetrie_build.c, splicing-scan.c, splicing-score.c, stage1.c,
stage1hr.c, stage2.c, stage3.c, stage3hr.c, stopwatch.c, subseq.c,
substring.c, table.c, tableint.c, tableuint.c, tableuint8.c, tally.c,
tally_exclude.c, tally_expr.c, tallyadd.c, tallyflats.c, tallygene.c,
tallyhmm.c, tallystrand.c, terminal.c, transcript.c, transcript.h,
transcriptome.c, translation.c, trial.c, trindex.c, types.h, uint8list.c,
uint8list.h, uint8listpool.c, uint8table.c, uint8table.h, uint8table_rh.c,
uint8table_rh.h, uintlist.c, uintlist.h, uintlistpool.c, uinttable.c,
uinttable_rh.c, uinttable_rh.h, uinttableuint.c, uniqscan.c, univcoord.h,
univdiag.c, univdiagpool.c, univinterval.c, ushortlist.c, ushortlist.h:
Replaced all occurrences of <config.h> with 'config.h'
* access.c: Including fcntl.h
* access.h: Including stddef.h here instead of in access.c
* intlist.h: Providing a declaration for Intlist_free for systems where
HAVE_INLINE is false
* littleendian.c, littleendian.h: Hiding UINT8 from systems without
HAVE_64_BIT
2020-10-26 twu
* distant-rna.c: Increased allowable soft-clipping at distal end from 6 to
index1part
* stage1hr.c: Added a step, when multiple alignments exist on either end, to
impose consistency of sensedir
* stage3hr.c: If original sensedir is given, checking to see if it should be
changed to SENSE_NULL
* samprint.c: Transferring sensedir from mate only for concordant results
* substring.c: Making sensedir of substrings SENSE_NULL unless splices are
found at either end
2020-10-21 twu
* README: Fixed the command to generate refseq-to-chr.txt
* gff3_genes.pl.in: Using gene_name field if present, rather than ID in a
gene entry
* stage3hr.c: No longer using NONSPLICED_END_RESTORE to reduce
ref_score_within_trims
* stage3.c: Consolidated code for splice site probabilities into
intron_probs procedure, now called by assign_gap_types,
assign_intron_probs, and score_introns. In fill_in_gaps, probabilities
are used to decide between a short intron or long deletion
* gmap.c: Added option for --max-deletionlength. Increased default length
from 50 to 100
2020-10-20 twu
* kmer-search.c: In disambiguate_paths, sorting paths secondarily by number
of shared endpoints with the group. Also, handling the case where no exon
is in common with the entire overlapping cluster
2020-10-19 twu
* gsnap.c: Restored -E abbreviation for --distant-splice-penalty, needed by
Jenkins tests
* VERSION: Updated version number
* README: Added an extensive discussion of possible sources for genome and
SNP data, and described how to process them
* stage3hr.c, stage3hr.h: Removed unused parameter favor_multiexon_p from
Stage3hr_setup
* stage1hr.c: Providing debugging variables to Distant_rna_solve
* get-genome.c: Always calling Genome_setup after Genome_new
* genome.c: Added an assertion
* distant-rna.c, distant-rna.h, gsnap.c: Added a --fusion-sensitivity flag.
Increasing fusion sensitivity by default
* gvf_iit.pl.in: Added a -g flag for gunzipping each of the files on the
command line
* vcf_iit.pl.in: Added -C flag for keeping only lines marked COMMON. Added
-n flag for converting chromosome names, as for NCBI files
* gff3_genes.pl.in: Simplified code for printing genes. Removing
"transcript:" and "gene:" from transcript and gene IDs
* fa_coords.pl.in: Handling the --limit-to-names option. For circular
chromosomes, now using the new name of the chromosome
* gmap_build.pl.in: Added --limit-to-names flag
2020-10-17 twu
* inbuffer.c, stage2.c: Modified debugging statements
* gmap.c, stage3.c, stage3.h: Made the --selfalign procedure faster by
creating an alignment directly from the query sequence
2020-10-15 twu
* VERSION, index.html: Updated version number
* gmap.c: Added option --nomargin. Removed -N abbreviation for --nolengths
* pair.c, pair.h: Fixed printing of ruler for long wrap lengths
* kmer-search.c: Fixed parameters to Genome_count_mismatches_substring and
Substring_new in single_hits_gminus. Simplified indentation in
search_transcriptome_ends
2020-10-14 twu
* stage3hr.c: Changed remap_transcriptome_p into a macro
* genome128_hr.c: Added assertions that pos5 < pos3
* distant-rna.c: Ensuring that pos5 < pos3 when checking for mismatches
* substring.c: Checking for pos3 == pos5 and not computing mismatches for
that case
2020-10-13 twu
* kmer-search.c: Initializing abortp in the correct place for
transcriptome-guided genomic alignment
2020-10-12 twu
* fa_coords.pl.in: Checking for circular and altscaffold arguments to be
plain files
* substring.c: Fixed the printing of stars when one end is hardclipped and
one end is softclipped
* iit_get.c: Added another exception to printing a sequence
* gsnap.c: Fixed the determination of paired_end_p
* sam_sort.c: Printing output in multiples of a page size
2020-10-09 twu
* sam_sort.c: Using an output chunk of 1024 SAM lines to prevent excessively
large content strings
* trunk, VERSION, index.html: Revised version number
* src, sam_sort.c, samread.c, samread.h: Merged revisions 223120 through
223145 from branches/2020-10-06-improve-sam-sort to implement a faster
version of sam_sort with working option for split output
2020-10-06 twu
* sam_sort.c: Freeing stopwatch later. Added monitoring output. Revised
debugging statements.
* samread.c: Fixed one terminating character
* gmap.c: Fixed merge_middlepieces so all three pieces are merged
* stage3.c: Modified debugging statement
* gmap.c, gsnap.c: Using new interface to Outbuffer_setup
* outbuffer.c, outbuffer.h: Outbuffer_setup using new interface to
SAM_header_open_file
* sam_sort.c: Fixed the opening of empty files when --split-output is
requested
* samheader.c, samheader.h: SAM_header_open_file now takes paired_end_p as a
parameter
2020-10-05 twu
* samheader.c: Added call to Filestring_stringify to read in SAM headers
properly
* filestring.c: Fixed debugging statements
* sam_sort.c: Fixed Cell comparison function
* samread.c: Fixed parsing functions from file
* gmapindex.c: For COUNT action, printing both number of offsets and genome
length
* gmap_build.pl.in: No longer creating regiondb for large genomes
2020-09-17 twu
* Makefile.gsnaptoo.am: Including sam_sort again
2020-09-13 twu
* VERSION, index.html: Updated version number
* path-solve.c, distant-rna.c, stage3hr.c, terminal.c: Using new interfaces
to Substring trim functions
* gmap_build.pl.in: Fixed flag in help output
* substring.c, substring.h: For DNA-seq, not allowing extension of last
mismatch at end if it extends beyond chromosomal bounds
2020-07-10 twu
* pair.c: Added semicolon between Dir and coverage in gff3 output
2020-06-29 twu
* trunk, VERSION: Revised version number
* src: Merged revisions 222852 through 222926 from
branches/2020-06-12-end-trimming
* terminal.c: Merged revisions 222852 through 222926 from
branches/2020-06-12-end-trimming to use new interface to
Subsrtring_trim_qend_nosplice
* stage1hr.c, stage1hr.h: Merged revisions 222852 through 222926 to use both
max_mismatches_refalt and max_mismatches_ref
* gsnap.c: Merged revisions 222852 through 222926 to add
--max-mismatches-ref and to change --ignore-trim-in-filtering to
--filter-within-trims
* substring.c, substring.h: Merged revisions 222852 through 222926 from
branches/2020-06-12-end-trimming to backup 1 bp for mismatches at the ends
of reads for DNA-seq
* stage3hr.c, stage3hr.h, stage3hrdef.h: Merged revisions 222852 through
222926 from branches/2020-06-12-end-trimming to use refalt for scoring and
both ref and refalt for filtering
* indel.c: Merged revisions 222852 through 222926 from
branches/2020-06-12-end-trimming to use new interface to
Genome_count_mismatches_substring
* genome128_hr.c, genome128_hr.h: Merged revisions 222852 through 222926
from branches/2020-06-12-end-trimming to handle masked genomes
2020-06-22 twu
* gsnap.c: Using new interface to Indel_setup
* indel.h: Providing genomelength in Indel_setup
* indel.c: Using Genome_fill_buffer_ref instead of Genome_fill_buffer_blocks
to avoid issues at ends of the genome
* genome.c, genome.h: Implemented Genome_fill_buffer_ref
2020-06-18 twu
* cpuid.c: Fixed parameter list for Intel compilers
* stage3.c: Adding another exception for long end introns: end exon must be
less than 40 bp
2020-06-15 twu
* VERSION: Updated version number
* trunk, src, genome128_consec.c, genome128_hr.c, splice.c: Merged revisions
222853 through 222859 from branches/2020-06-12-end-trimming to increase
MIN_EXON_LENGTH from 9 to 20
2020-06-13 twu
* trunk, src, substring.c: Merged revision 222854 from
branches/2020-06-12-end-trimming to iterate through correct number of
mismatches from Genome trim procedures, and when splicing fails, using
trimpos rather than pos5 or pos3
2020-06-04 twu
* samprint.c: Fixed memory leak with mate_md_fp in nomapping alignments
* trunk, VERSION, config.site.rescomp.tst: Updated version number
* index.html: Updated for new version
* src, Makefile.gsnaptoo.am, cigar.c, cigar.h, gsnap.c, mdprint.c,
mdprint.h, samprint.c, samprint.h: Merged revisions 222824 through 222834
from branches/2020-06-03-MD to integrate computation of CIGAR and MD
strings
2020-06-03 twu
* trunk, cigar.c, cigar.h, samprint.c: Merged revisions 222820 through
222823 from branches/2020-06-03-MD to compute MD string correctly for
hardclipping on minus alignments and for the --sam-hardclip-use-S flag
* samprint.c: Removed debugging macro
* samprint.c: Removed code that led to a zero-length MD string when
hard-clipping was present
* substring.c: Changed format of debugging statement to handle both regular
and large genomes
* splice.c, oligoindex_hr.c, kmer-search.c, iit-read-univ.c: Removed unused
variables
* gsnap.c: Using new interfaces to Indexdb_new_genome and
Indexdb_new_transcriptome
* gmap.c, atoiindex.c, cmetindex.c, indexdb-cat.c: Using new interface to
Indexdb_new_genome
* genome.c, transcriptome.c: Removed unused variable
* compress-write.c: Fixed messages to stderr
* stage3.c: Changed type of intronlength from Chrpos_T to int. Changed
types of new_leftgenomepos and new_rightgenomepos to be int
* stage1hr.c: Using new interface to Terminal_solve_plus,
Terminal_solve_minus, and Distant_rna_solve
* distant-rna.c, distant-rna.h: Removed unused parameters queryuc_ptr and
queryrc
* terminal.c, terminal.h: Removed unused parameters queryuc_ptr and queryrc
from Terminal_solve_plus and Terminal_solve_minus, respectively
* uniqscan.c: Using new interface to Stage3_new_genome
* stage3hr.c: Using new interface to Junction_new_chimera
* junction.c, junction.h: Removed unused parameter sensedir from
Junction_new_chimera
* indexdb.c, indexdb.h: Removed unused parameter expand_offsets_p
* trunk, VERSION, config.site.rescomp.tst, index.html, src, distant-dna.c,
distant-dna.h, gsnap.c, indel.c, indel.h, kmer-search.c, path-solve.c,
splice.c, splice.h, stage3hr.c, stage3hr.h: Merged revisions 222790
through 222796 from branches/2020-06-03-TGGA to fix bugs in
transcriptome-guided genomic alignment
2020-06-02 twu
* stage3hrdef.h: Added comment
* substring.c: Made computations for mandatory trims similar to those for
querystart_chrbound and queryend_chrbound
* stage3hr.c: Computing fields mandatory_trim_querystart and
mandatory_trim_queryend and using them in computing coverage
* stage3hrdef.h: Added fields mandatory_trim_querystart and
mandatory_trim_queryend
* kmer-search.c: Changed types of genomic coords from Trcoord_T to
Univcoord_T
* get-genome.c, snpindex.c: Added calls to Genome_setup
2020-06-01 twu
* VERSION: Updated version number
* gmap.c: Using new interface to Genome_user_setup
* genome.c, genome.h: Added genomelength to Genome_user_setup
* gsnap.c: Moved Genome_setup before knownsplicing initialization
2020-05-31 twu
* gmap.c, gsnap.c: Using new interface to Genome_setup
* index.html: Updated for latest version
* VERSION: Updated version number
* README: Updated information
* genome.c: Fixed bug in specifying coordinates in Genome_fill_buffer_simple
* genome.c, genome.h: Modifying pos5 and pos3 in Genome_fill_buffer_simple
to avoid going outside of genome bounds
* stage3hr.c: Fixed a bug in Stage3end_remove_duplicates that failed to
return distant alignments
* gmap_build.pl.in: Checking if transcript FASTA or genes file is provided
but transcriptome name is not
* gsnap.c: Using new interfaces to setup procedures
* stage3hr.c, stage3hr.h, substring.c, substring.h, terminal.c, terminal.h,
distant-rna.h, extension-search.c, extension-search.h, kmer-search.h,
path-solve.c, path-solve.h: Providing genomelength to setup procedures
* kmer-search.c: Fixed calculations of pos5 and pos3 for transcriptome bounds
* distant-rna.c: Fixed bug in computation of pos3. Providing genomelength
to setup procedure
* stage3hr.c, stage3hr.h: Removed unused variables for Stage3hr_setup
* gsnap.c, substring.c, substring.h: Removed unused variables for setup
* get-genome.c: Added a --genes option for converting a genes file to FASTA
format
* gmap_build.pl.in: Added --genes option for building a transcriptome from a
genes file
* gff3_genes.pl.in: Added options for printing exon and/or CDNA fields.
Printing only exons by default
* get-genome.c: Removed unused code
* kmer-search.c: Fixed issue with uninitialized variable in
transcriptome-guided genomic alignment
2020-05-30 twu
* VERSION: Updated version number
* gmap_build.pl.in: Building a genome index based on the presence of genome
FASTA files
* path-solve.c: Handling the case where best_left_paths or best_right_paths
is NULL, due to an alignment attempt that yields an unacceptable path
* kmer-search.c: No longer using SIMD shortcuts for computing exoni. Not
comparing len against nindels, only exon_residual against nindels
* stage1hr.c: Edited comment
* stage3hr.c: Removed test against MIN_ALIGNMENT_LEN
* transcriptome.c: Added variable for debugging
* transcript.c: Fixed memory leak
* distant-rna.c: Computing fragment substring bounds for new
Genome_mismatches_left and Genome_mismatches_right
* distant-dna.c: Fixed memory leak. Allocating memory for new
Genome_mismatches_left and Genome_mismatches_right
* genome128_hr.c: For Genome_mismatches_left and Genome_mismatches_right,
enforcing that nmismatches <= max_mismatches
2020-05-29 twu
* VERSION: Updated version number
* iit_store.c: Fixed bug in allocating memory for string
* path-solve.c: Rearranged debugging code
* distant-rna.c: Using new interface to Substring_qstart_trim and
Substring_qend_trim
* stage3hr.c: In Stage3end_new_substitution, checking for pos5 and pos3
being outside of genome bounds
* substring.c, substring.h: Changed parameter names for
Substring_qstart_trim and Substring_qend_trim
* iit_store.c: Fixed bug with double freeing line in parsing GFF3 files
* concordance.c, concordance.h, distant-dna.c, distant-dna.h, kmer-search.c,
kmer-search.h, path-solve.c, path-solve.h, samprint.c, samprint.h,
simplepair.c, simplepair.h, stage1hr.c, stage1hr.h, terminal.c,
terminal.h: Using splicingp instead of novelsplicingp
* gsnap.c: Added variable splicingp and providing it to setup procedures
2020-05-28 twu
* stage1hr.c: Testing novelsplicingp before all code for antisense hits
* ladder.c: In Ladder_minimax_trim, handling the case where the antisense
ladders are NULL
* VERSION: Updated version number
* trunk, src, concordance.c, concordance.h, distant-dna.c, distant-dna.h,
distant-rna.c, distant-rna.h, extension-search.c, extension-search.h,
genome128_hr.c, gsnap.c, indel.c, indel.h, junction.c, junction.h,
kmer-search.c, kmer-search.h, ladder.c, ladder.h, path-solve.c,
path-solve.h, segment-search.c, segment-search.h, splice.c, splice.h,
stage1hr.c, stage3hr.c, stage3hr.h, stage3hrdef.h, substring.c,
substring.h, terminal.c, terminal.h: Merged revisions 222646 through
222711 from branches/2020-05-23-min-coverage to improve splice-plus-indel
alignments, allow multiple paths in path-solve procedures, and find
concordance separately for sense and antisense alignments
2020-05-23 twu
* indexdb-cat.c: Handling cases where sampling intervals are different
* gsnap.c: Using new interface to Extension_search_setup
* extension-search.c, extension-search.h: Generalized from an index1interval
of 3
2020-05-20 twu
* stage1hr.c: Always making calls to Stage3end_filter
* stage3hr.c: Added a debugging statement
2020-05-19 twu
* trunk, src, gsnap.c, indel.c, ladder.c, merge-diagonals-simd-uint4.c,
regiondb.c, splice.c, stage1hr.c, stage3hr.c, stage3hr.h, stage3hrdef.h,
terminal.c, terminal.h: Merged revisions 222614 through 222628 from
branches/2020-05-18-filtering to improve the filtering and choices among
alignments
* VERSION, config.site.rescomp.tst: Updated version number
2020-05-18 twu
* substring.c: Removed printing of NA for probabilities under NO_COMPARE
macro
2020-05-15 twu
* splice.c: Removed code that exited prematurely from search for indels plus
splicing
* splice.c: Removed unused code
2020-05-14 twu
* stage3hr.c: In Stage3end_new_substrings, checking if an alignment in a
circular chromosome exceeds chrlength
* substring.c: Added assertions
2020-05-13 twu
* stage3hr.c: Setting trim_querystart and trim_queryend, and then
querystart_chrbound and queryend_chrbound, revising according to trim
values
* stage3hr.c: Applying minimum alignment length to Stage3_new_substrings
* stage3hr.c: Requiring a minimum exon length before reducing score for
spliced ends
* extension-search.c: Using new interface to Univ_IIT_update_chrnum
* stage3hr.c: Fixed penalty for trims to work with spliced ends
* gsnap.c: Restored the -E abbreviation for --distant-splice-penalty
2020-05-12 twu
* trunk, config.site.rescomp.tst, src, distant-rna.c, extension-search.c,
extension-search.h, gsnap.c, kmer-search.c, kmer-search.h, path-solve.c,
path-solve.h, segment-search.c, segment-search.h, stage1hr.c, stage1hr.h,
stage3hr.c, stage3hr.h: Merged revisions 222561 through 222589 from
branches/2020-05-08-large-insertions to identify large insertions
* Makefile.gsnaptoo.am: Added comment
* stage3hr.c: Not compensating for short chrlengths if chrlength is 0
* stage1hr.c: Filtering singlehits5 and singlehits3
* terminal.c, distant-rna.c: Using new interface to Univ_IIT_get_chrnum
* genome128_hr.c: Setting final querypos to be pos5 - 1, rather than -1
* concordance.h: Removed unused header file
* iit-read-univ.c, iit-read-univ.h: Univ_IIT_get_chrnum,
Univ_IIT_update_chrnum, and Univ_IIT_get_trnum take low and high as
parameters
* segment-search.c: Checking for the need to go to a previous chrnum
* intersect-large.c: Adding parentheses for clarity
* kmer-search.c: Using new interface to Univ_IIT_get_trnum
* path-solve.c: In attach_qstart_diagonal and attach_qend_diagonal, checking
that left+pos5/pos3 are not before the beginning of the genome
2020-05-09 twu
* intersect-large.c: Fixed check for positions0 at beginning of genome
* extension-search.c: Using new interface to Univ_IIT_update_chrnum
2020-04-25 twu
* stage3hr.c: Redefining low and high in Stage3end_T object to be based on
the aligned endpoints
* extension-search.c: Checking for genomic bounds on each of the
univdiagonals
2020-04-24 twu
* intersect-large.c, intersect.c: Fixed Intersect_exact_indices routines to
handle small univdiagonals less than diagterm
2020-04-22 twu
* splice.c: Specifying a minimum splice prob
* stage3hr.c: Adding slop for insert length and splice score
* util, gmap_cat.pl.in: Removed unused lines
* trunk, config.site.rescomp.prd, config.site.rescomp.tst, src,
Makefile.gsnaptoo.am, concordance.c, distant-rna.c, genome128_hr.c,
genome128_hr.h, gmap.c, gsnap.c, indexdb.c, kmer-search.c, ladder.c,
path-solve.c, path-solve.h, splice.c, splice.h, splicetrie.c,
splicetrie.h, stage3.h, stage3hr.c, stage3hr.h, stage3hrdef.h,
substring.c, substring.h, terminal.c: Merged revisions 222160 through
222483 from branches/2020-03-13-exon-intron-scores to allow for masked
genomes
* simplepair.c: No longer printing transcript information, since it requires
a merging step
* samprint.c: Using new interfaces to procedures for printing transcripts
* stage3hr.c, stage3hr.h, stage3hrdef.h, transcript.c, transcript.h: Removed
transcripts5 and transcripts3 from Stage3pair_T object, and computing and
printing concordance when needed
* stage3hr.c: Put debugging statements within a macro
* stage3hr.c: Merged revision 222198 from branches/2020-03-13 to restore
hit_equal and hitpair_equal procedures for handling overlaps within loci
* substring.c: Merged revision 222188 from
branches/2020-03-13-exon-intron-scores to improve trimming
* stage1hr.c: Merged revision 222189 from
branches/2020-03-13-exon-intron-scores to call optimal_score_prefinal
before removing overlaps and optimal_score_final
* splice.c, splice.h: Merged revision 222187 from
branches/2020-03-13-exon-intron-scores to taking probability and
nconsecutive thresholds as parameters
2020-04-21 twu
* iit-read-univ.c, iit-read-univ.h: Removed obsolete procedure
Univ_IIT_interval_bounds_linear
* kmer-search.c: Using Univ_IIT_get_trnum
* trunk, configure.ac, src, Makefile.gsnaptoo.am, chrnum.h, distant-dna.c,
distant-rna.c, extension-search.c, extension-search.h, gmapindex.c,
gsnap.c, iit-read-univ.c, iit-read-univ.h, indexdb.c, indexdb.h,
kmer-search.c, localdb.c, merge-diagonals-heap.c,
merge-diagonals-simd-uint4.c, merge-diagonals-simd-uint8.c, path-solve.c,
path-solve.h, record.h, regiondb.c, regiondb.h, segment-search.c,
stage1hr.c, stage3hr.c, stage3hr.h, substring.c, substring.h, terminal.c,
util: Merged revisions 222404 through 222444 from
branches/2020-04-14-right-diagonals to use univdiagonals instead of lefts
2020-04-18 twu
* samprint.c: Allowing for XS field to be printed in transcriptome-guided
alignment
* index.html: Updated for version 2020-04-08
* gsnap.c: Using new interface to Transcriptome_new
* gmap_build.pl.in: Added options for building a transcriptome db
* transcriptome.c, transcriptome.h: Putting transcriptome info into a
subdirectory of the genome db
* trindex.c: Putting transcriptome info into a subdirectory in the genome db
* kmer-search.c: Fixed bug in computing adj
* Makefile.gsnaptoo.am: Restored trindex program
2020-04-15 twu
* output.c: Fixed bug in checking for a condition
2020-04-14 twu
* gmap.c, output.c, output.h: Added options cdna+introns and genomic+introns
to the --exons flag
2020-04-13 twu
* svncl.pl: Grouping files with identical comments
2020-04-12 twu
* iit_get.c: Setting coordstart and coordend to 0 when force_label_p is true
2020-04-10 twu
* VERSION: Updated version number
* access.c, access.h, iit-read-univ.c, iit-read.c: Removed procedures for
read/write memory mapping
2020-04-08 twu
* trunk, src, gmapindex.c, indexdb-cat.c, gmap_cat.pl.in: Merged revisions
222346 to 222387 from branches/2020-03-13-exon-intron-scores to remove the
-F flag from concatenation programs
2020-04-05 twu
* VERSION, config.site.rescomp.prd, index.html: Revised for latest version
* svncl.pl: Adding spaces between lines of multi-line comments
* svncl.pl, MAINTAINER: Replaced svncl.pl with a program that does not
depend on xml output
* configure.ac: Added comment
* dynprog_end.c: Fixed debugging macro
* gsnap.c: Turned shared memory off by default
* gmap.c: Turned shared memory off by default. Added option
--use-shared-memory
* compress-write.c, regiondb-write.c: Initializing value of current_pos in
concatenation procedures
2020-04-02 twu
* get-genome.c: Added option --add-circular
2020-03-27 twu
* iit_store.c: Added option --accesion-only
2020-03-23 twu
* compress-write.c, compress-write.h, gmapindex.c: Allowing for compressing
genomes from stdin
* iit_get.c: Removed debugging command
* compress-write.c, compress-write.h, gmapindex.c: Implemented option for
compressing FASTA files
* iit_get.c: Fixed printing of sequence with coordinates
* gmap.c, output.c, output.h, pair.c, pair.h: Differentiating between
mask_introns and mask_utr_introns
2020-03-18 twu
* gmap.c, output.c, output.h, pair.c, pair.h: Added option --mask-introns to
GMAP
2020-03-13 twu
* iit_get.c: Ignoring linefeed characters in handling coordinates
* stage3hr.c, stage3hrdef.h, substring.c, substring.h: Computing
querystart_chrbound and queryend_chrbound, and disregarding in comparing
against max_mismatches
* index.html: Added statement about nosimd versions being restored
2020-03-12 twu
* archive.html, index.html: Updated for latest version
* trunk, VERSION, config.site.rescomp.prd, config.site.rescomp.tst,
index.html, src, Makefile.gsnaptoo.am, atoi.c, atoi.h, atoiindex.c,
cmet.c, cmet.h, cmetindex.c, dynprog_end.c, gmap.c, gmapindex.c, gsnap.c,
iit-read.c, iit-read.h, indexdb-write.c, indexdb.c, indexdb.h,
intersect.c, kmer-search.c, kmer-search.h, localdb.c,
merge-diagonals-simd-uint8.c, path-solve.c, path-solve.h,
regiondb-write.c, regiondb-write.h, regiondb.c, regiondb.h, regiondbdef.h,
stage1hr.c, stage1hr.h, transcriptome.c, trindex.c, uint8list.c,
uint8list.h, util, fa_coords.pl.in, gmap_build.pl.in, gmap_cat.pl.in:
Merged revisions 221585 through 222138 from
branches/2020-02-01-local-fixed-size to implement a regiondb hash to
replace the localdb hash
* MAINTAINER: Revised instructions for rosalind
* bootstrap.gsnaptoo: Using automake and autoreconf in path
* index.html: Added comment about default behavior for localdb usage
2020-03-06 twu
* gsnap.c: Setting default behavior for localdb usage to be true for RNA-seq
and false for DNA-seq
* gmap.c: Setting npaths_primary and npaths_altloc when stage3list is NULL
* atoiindex.c, cmetindex.c, genome.c, indexdb-write.c, indexdb.c, localdb.c,
snpindex.c: Using new interface to Access_mmap
* access.c, access.h: Access_mmap now returns seconds
2020-02-20 twu
* trunk, index.html, src, iit_store.c, indexdb-cat.c, indexdb.c, util,
gmap_cat.pl.in: Merged revisions 221935 through 221944 from
branches/2020-02-01-local-fixed-size to handle circular chromosomes in
gmap_cat
* gmap_cat.pl.in, indexdb-cat.c: Allowing -F flag to handle multiple source
directories
* gmapindex.c: Improved user error message
* gmap_build.pl.in: Made none the default value for sorting. Removed usage
of sourcedir and -F in calls to gmapindex.
* gmapindex.c: Removed usage of -F except for concatenating genomes.
Allowing -F to handle multiple source directories
2020-02-19 twu
* README: Now references LICENSE file
* Makefile.am: Added LICENSE file for distribution
* LICENSE: Initial import
* COPYING: Removed old license. Now refers to the LICENSE file
* NOTICE: Removed references to suffix array code. Added journal reference
for Lemire and Boytsov
* compress-write.c: Implemented a simpler and more general algorithm for
Compress_cat
2020-02-17 twu
* gmapindex.c: Fixed bug when -F and -D values are different when
concatenating genomes
2020-02-13 twu
* indexdb.c: Fixed lengths for kmer size and sampling and error message to
user
* gmap_build.pl.in: Removed unused variable
* gmap_cat.pl.in: Fixed Id comment line
* gmap_cat.pl.in: Added Id comment line
* gmap_cat.pl.in: Removed references to sleep. Added better final message
* gmap_build.pl.in, gmap_cat.pl.in: Adding package version numbers to
version file
* Makefile.am: Removed obsolete scripts
* indexdb-cat.c: Updating required_index1part and required_index1interval so
they all match
* indexdb.c: Added user statement about required kmer and interval
* gmap_cat.pl.in: Checking compiler assumptions and creating version file
* compress-write.c: Made fixes to computation of flags
* trunk, index.html: Updated version
* VERSION, config.site.rescomp.prd, config.site.rescomp.tst: Updated version
number
* gmap_cat.pl.in: Implemented --names option
* gmap_build.pl.in: Changed documentation
* gmap_cat.pl.in: Initial import
* index.html: Updated for latest version
* access.h: Added NOT_USED as an access type
* access.c: Adding a check for a null filename
* gmap_build.pl.in: Removed comment about gmap_setup
* indexdb-cat.c: Using destdir, which should be set correctly by gmap_cat
* iit_store.c: For universal IIT output, adding a circular type
* gmapindex.c: Added code for concatenating genomecomp
* gmap.c: Turning off contig output
* compress-write.c: Put debugging messages inside macro
* bitpack64-write.c, bitpack64-write.h: Using
Bitpack64_compute_q4_diffs_bidir_huge. Removed duplicate code
* README: Removed references to suffix array
* configure.ac: Added gmap_cat. Removed obsolete scripts
* Makefile.gsnaptoo.am: Added instructions for indexdb_cat
* indexdbdef.h: Added a field hugep
* indexdb.c: Modified user messages
* indexdb-cat.c: Implemented procedures for huge genomes and 8-byte positions
* indexdb-cat.c: Implemented uint8 procedure, but has errors
* indexdb-cat.c: Implemented merging of positions
* indexdb.c: Cleaned up Indexdb_new_genome procedure
2020-02-12 twu
* indexdb-cat.c: Initial import
* compress-write.c: Enclosed debugging statements into macros
* compress-write.c: Fixed bugs in Compress_cat
* compress-write.c: Simplified code for Compress_cat
2020-02-11 twu
* compress-write.c: Made fixes for shifts < 16
* compress-write.c, compress-write.h: Initial implementation of Compress_cat
for concatenating genomes
* Makefile.gsnaptoo.am: Removed trindex and sam_sort
* gmap.c, pair.c, sequence.c, sequence.h, stage3.c, stage3.h: Added option
--gff3-fasta-annotation to GMAP
2020-02-04 twu
* iit-read.c: Fixed IIT_dump for intervals with start and end both being 0
* iit_store.c: Handling the case where annotation has zero length
* gmap_build.pl.in: Returning pipe variable
* stage3.c: Using new interface to Pair_print_gff3
* sequence.c, sequence.h: Implemented Sequence_restofheader
* path-solve.c: No longer using localdb for large genomes
* pair.c, pair.h: For GFF3 output, printing FASTA headers as annotation
* localdb.c: Created a separate debugging category
2020-01-30 twu
* Makefile.gsnaptoo.am: Removed ushortlist.c
* dynprog.h, dynprog_cdna.h, dynprog_end.h, dynprog_genome.h,
dynprog_single.h, extension-search.h, genome-write.h, genome128-write.h,
iit-read-univ.h, ladder.h, record.h, stage2.h, stage3hrdef.h: Added
include for genomicpos.h
* chrnum.h, genome.h, genomicpos.h, merge-records-heap.h,
merge-records-simd.h, path-solve.h: Added include for univcoord.h
* snpindex.c: Making genomeblocks for snpindex the same length as for the
reference
* parserange.c: Initializing coordstart and coordend
* localdbdef.h, localdb.c, localdb.h: Using a single loctable
* localdb-write.c, localdb-write.h: Using Uintlist_T instead of Ushortlist_T
* iit_get.c: If coords are given, printing the corresponding substring of
the annotation
* gsnap.c, gmap.c: Allowing pipe signals
* gmapindex.c: Printing user messages about compression only when offsets
are compressed
* epu16-bitpack64-readtwo.c: Declaring a variable
2020-01-23 twu
* iit_store.c: Allowing input FASTA to have labels without intervals
2019-12-15 twu
* fa_coords.pl.in, gmap_build.pl.in, gmap_process.pl.in: Handling 1-column
names.txt file
* stage3.c: Changed some variables from int to Chrpos_T
* merge-diagonals-simd-uint8.c: Using correct printf format for uint8
* localdb.c, localdb.h, localdbdef.h, localdb-write.c, localdb-write.h:
Handling large genomes
* indexdb-write.c, indexdb-write.h: Changed procedure names to make them
clearer
* gmapindex.c: Using new interfaces to indexdb and localdb write procedures
* epu16-bitpack64-write.h: Added comment
* Makefile.gsnaptoo.am, ushortlist.c, ushortlist.h: Added Ushortlist_T
object for localdb
* pair.c: Added cDNA direction to GFF3 output
2019-12-12 twu
* path-solve.c: Changed type for deletionpos to be Univcoord_T
2019-09-12 twu
* src, concordance.c, concordance.h, distant-dna.c, distant-dna.h,
extension-search.c, extension-search.h, gsnap.c, junction.c, junction.h,
kmer-search.c, kmer-search.h, list.c, list.h, stage1hr.c, stage1hr.h,
stage3hr.c, stage3hr.h, stage3hrdef.h, substring.c, substring.h: Merged
revisions 220290 through 220325 from
branches/2019-09-11-zero-length-introns to handle cases where ambiguous
ends are resolved and where distant DNA alignments yield a zero-length
intron
* index.html: Updated for latest version
2019-08-09 twu
* inbuffer.c, inbuffer.h: Removed references to interleavedp from GMAP
* gsnap.c: Added --interleaved feature
* bzip2.c: Saving a file handle and closing it
* Makefile.gsnaptoo.am: Including bzip2.c and bzip2.h to relevant programs
* atoiindex.c, cmetindex.c: Initializing filenames to be NULL
2019-07-15 twu
* inbuffer.c, inbuffer.h, shortread.h: Added interleavedp parameter
* shortread.c: Implemented interleaved format for gzip- and bzip2-compressed
files.
* getline.c, getline.h: Implemented Getline_gzip and Getline_bzip2
* shortread.c: Implemented Shortread_read_interleaved_text
2019-06-11 twu
* stage1hr.c: Added debugging statements
* concordance.c: Limiting number of overlaps to avoid combinatorial
explosion in some cases
2019-05-20 twu
* index.html: Updated for latest version
* ax_ext.m4: Improved structure for AVX2 and AVX512
2019-05-12 twu
* gmap.c, gsnap_select.c, gsnapl_select.c, cpuid.c, cpuid.h, gmap_select.c,
gmapl_select.c: Adding support for avx512bw
* gsnap.c: Changed default parameter for --max-mismatches for DNA-seq
* Makefile.gsnaptoo.am: Adding programs for avx512bw
* configure.ac: Adding option for AVX512BW SIMD
* ax_cpuid_intel.m4, ax_cpuid_non_intel.m4: Adding test for AVX512BW support
* ax_ext.m4: Adding commands for AVX512BW
* univdiagpool.c: Adding assertions
* substring.c: Checking against substrings on the wrong chromosome
* stage1hr.c: Commenting out extended algorithm, which can cause problems
with repetitive reads
* output.c: Consider excessive output to be a fail for the purpose of the
--nofails flag.
* merge-uint8.c: Fixed SIMD command for AVX512 machines
2019-03-25 twu
* extension-search.c, extension-search.h: Implemented extension of elt sets
in the opposite direction
2019-03-19 twu
* segment-search.c: For alignments straddling a chromosome, recomputing
querypos and queryend to cover the new chromosome
* gsnap.c: Using new interface for Path_solve_setup
* path-solve.h: New interface for Path_solve_setup
* path-solve.c: Not allowing splices on circular chromosomes
* concordance.c: Using field sensedir_for_concordance
* stage3hr.c, stage3hrdef.h: Now using fields sensedir_for_concordance and
sensedir
* samprint.c: Removed references to Stage3end_sensedir_distant_guess
2019-03-15 twu
* trunk, VERSION, config.site.rescomp.tst, index.html, src,
Makefile.gsnaptoo.am, distant-dna.c, distant-dna.h, distant-rna.c,
distant-rna.h, gsnap.c, method.c, method.h, output.c, path-solve.c,
samprint.c, samprint.h, splice.c, splice.h, stage1hr.c, stage3hr.c,
stage3hr.h, stage3hrdef.h, substring.c, substring.h, terminal.c: Merged
revisions 218560 through 218674 from branches/2019-03-07-distant-dna to
implement distant splicing and to fix some bugs in spliced alignments
* genome128_hr.c: Added comment
2019-03-06 twu
* distant-dna.h, distant-dna.c: Initial import
2019-03-05 twu
* gsnap.c: Added option --use-local-hash
* trunk, VERSION, config.site.rescomp.tst, src: Updated for latest version
2019-03-04 twu
* localdb.c, localdb.h, path-solve.c: Merged revisions 218528 and 218529
from branches/2019-03-04-fix-repetitive to limit recursive procedures
* stage3hr.c: Fixed debugging statements
* stage1hr.c, kmer-search.c: Added debugging statements
* segment-search.c: Fixed debugging statement
* gsnap.c: Removed variables relating to stage2 suboptimal alignments
2019-03-02 twu
* path-solve.c: For compute_qstart_paths and compute_qend_paths, added a
max_depth criterion, and checking for repetitive positions
2019-03-01 twu
* stage3hr.c: Checking nmismatches when we are checking all assertions
* path-solve.c: Revising qstart of middle segment if an insertion is
present. Revised code for computing ninserts
* path-solve.c: Fixed addition of alts substring to best path from
all_child_paths, rather than to path
2019-02-26 twu
* VERSION, config.site.rescomp.prd, index.html: Updated for latest version
* substring.c: In trimming at ends without splicing, extending to the end if
nmismatches is 0
* stage3hr.c: For score_within_trims, adding a penalty for long ambiguous
ends
* substring.c: Restored previous algorithm for computing trim at ends with
no splice. For alts with good splice probability, counting substring as
nmatches rather than amb. For alts with poor splice probability, counting
substring as amb.
2019-02-25 twu
* path-solve.c: Fixed cases in compute_qstart_paths and compute_qend_paths
where terminalp was not being set
2019-02-22 twu
* trunk, config.site.rescomp.prd, config.site.rescomp.tst, src,
concordance.c, concordance.h, distant-rna.c, distant-rna.h,
extension-search.c, extension-search.h, genome128_hr.c, gsnap.c,
kmer-search.c, kmer-search.h, ladder.c, path-solve.c, path-solve.h,
resulthr.c, segment-search.c, segment-search.h, stage1hr.c, stage3hr.c,
stage3hr.h, stage3hrdef.h, substring.c, substring.h, terminal.c,
terminal.h: Merged revisions 218419 through 218472 from
branches/2019-02-19-restore-fusions
* index.html: Updated for latest version
2019-02-19 twu
* stage1hr.c: Using user_maxlevel_float for final filtering, and not for
searching
* concordance.c: Using score_ignore_trim instead of score_posttrim
* gsnap.c: Added option for --ignore-trim-in-filtering
* stage3hrdef.h: Changed field score_posttrim to score_ignore_trim
* stage3hr.c, stage3hr.h: Changed Stage3hr_filter_coverage to
Stage3hr_filter, which accounts for number of mismatches. Added parameter
ignore_trim_p. Changed score_posttrim to score_ignore_trim, and computing
this to be lower than score
2019-02-18 twu
* terminal.c: Using Univ_IIT_get_chrnum and new interface to Substring_new
* substring.c, substring.h: Substring_new now assumes that chrnum was set
correctly
* stage3hr.c: Using new interface to Substring_new
* stage1hr.c: Using new interface to Segment_identify procedures
* segment-search.c, segment-search.h: Handling alignments straddling more
than two chromosomes. Removed plusp as a parameter
* kmer-search.c: Using Univ_IIT_get_chrnum. Using new interface to
Substring_new
* iit-read-univ.c, iit-read-univ.h: Implemented Univ_IIT_update_chrnum and
Univ_IIT_get_chrnum
* extension-search.c: Calling Univ_IIT_update_chrnum to set chrnum
* stage3hr.c: If qend <= qstart, do not calculate number of nmismatches,
which is not defined. For nmatches, do not penalize for indels
* substring.c: If trimming changes querystart or queryend, recalculate
nmismatches
2019-02-15 twu
* stage1hr.c: Skipping alignment when querylength is less than index1part +
index1interval - 1
* segment-search.c: When advancing chrnum for straddled alignments, checking
that we do not go past the last chromosome
* VERSION: Updated version number
* indexdb-write.c: Added code for comparing counts with compression and
counts without compression
* bitpack64-write.c, bitpack64-write.h: For huge genomes, using an array of
UINT8 for calculations of genome position. Printing strerror for
file-related errors.
* substring.c: For circular chromosomes, checking if the entire substring
resides in the next chromosome and returning the circularpos at that query
position
* segment-search.c: When a straddle calls for advancing to a later
chromosome, using local data structures instead of mixing them with a call
to Univ_IIT_get_one
* path-solve.c: Using subtract_bounded to subtract the local1part amount
* localdb.c: In Localdb_get_diagonals, checking for low < high, since low ==
high can occur when a substring is at the beginning or end of a chromosome
* cigar.c: Checking for an initial M to be printed before printing any indel
or splice
2019-02-13 twu
* intersect-large.c, intersect.c: No longer initializing last_diagonal to be
0, and comparing against it, which fails if a diagonal is 0. Instead,
checking explicitly for the first case
2019-01-31 twu
* gsnap.c, path-solve.c, path-solve.h, segment-search.c, segment-search.h,
stage1hr.c, stage1hr.h, stage3hr.c, stage3hr.h: Removed GMAP parameters
from GSNAP code
* VERSION, config.site.rescomp.prd, archive.html, index.html: Updated for
latest version
* intersect-large.c, intersect.c: In Intersect_approx_lower and
Intersect_approx_higher, ignoring duplicates of diagonals0, in order to
ensure that the result is in ascending order
* substring.c: Using plusp in interpreting mandatory_trim_querystart and
mandatory_trim_queryend
* path-solve.c: When a mismatch extends a diagonal, computing the number of
mismatches at that time
* merge-diagonals-simd-uint4.c: Added code for checking that inputs are in
ascending order
* stage3hr.c: Improved tradeoffs between nmatches, nmatches_posttrim, splice
score, nsegments, insertlength, and outerlength in Stage3end_optimal_score
and Stage3pair_optimal_score_final
* extension-search.c, segment-search.c: Handling the case where the middle
or anchor diagonal straddles two chromosomes
2019-01-23 twu
* trunk, VERSION, src, Makefile.gsnaptoo.am, changepoint.h, compress.c,
concordance.h, diag.c, extension-search.h, gbuffer.c, gbuffer.h, genome.c,
genome.h, genome128_consec.c, genome128_hr.c, genome_hr.c, genome_sites.c,
gmap.c, gsnap.c, kmer-search.c, kmer-search.h, knownsplicing.c,
knownsplicing.h, oligoindex_localdb.h, outbuffer.h, output.c, pair.c,
pair.h, path-solve.c, path-solve.h, samprint.c, segment-search.c,
segment-search.h, simplepair.c, simplepair.h, splice.c, splice.h,
splicetrie.c, splicetrie.h, splicetrie_build.h, stage1hr.c, stage1hr.h,
stage3.c, stage3hr.c, stage3hr.h, substring.c, substring.h, types.h,
univcoord.h, univdiagpool.c: Merged revisions 218195 through 218285 from
branches/2019-01-17-split-gmap-gsnap to separate GMAP and GSNAP code
2019-01-22 twu
* stage1hr.c, cigar.c, path-solve.c: Added debugging statements
* stage3hr.c: At a given locus, checking for nmatches_posttrim before
checking for splice score
* splice.c, splice.h, substring.c: Added limit on consecutive matches in
scanning for spliceends
* path-solve.c: Restored previous code that did not look at extendedp
* splice.c: Added requirement for MIN_EXON_LENGTH in trimming ends
* gsnap.c: Added --sam_sparse_secondaries to omit SEQ and QUAL flags in
secondary alignments
* samprint.c, samprint.h: Using mate_plusp computed in Cigar_compute_main in
compute_flag, to give correct results for translocations. Added
sam_sparse_secondaries_p to omit SEQ and QUAL flags in secondary alignments
* path-solve.c: Restored usage of the sense condition to handle the
non-splicing condition
* filestring.c: Handling the case in Filestring_merge where source->string
is NULL
* path-solve.c: Checking if qstart or qend is extended from spliced
endpoints of middle diagonal, and if not extended, using the original
endpoints
* junction.c: Changed name of macro for debugging
* samprint.c, cigar.c: Handling the case where hard clipping removes all
substrings
* extension-search.c: Added debugging statement
2019-01-18 twu
* extension-search.c: Fixed bugs in process_seed for processing the
remainder of the queryfwd set or the queryrev set
* extension-search.c: Rewrote algorithm extensively to combine seeds and
sets from queryfwd and queryrev passes
* segment-search.c: Using new interface to Path_solve_from_diagonals
* path-solve.c, path-solve.h: Changed Path_solve_from_diagonals to take a
univdiagonal, qstart, and qend, instead of a Univdiag_T object
* kmer-search.c: Removed unused include file
2019-01-17 twu
* setup.genomecomp.ok: Revised gold standard for extra bytes at end
* segment-search.c: Fixed problem with allocation when total_npositions is
zero in Segment_identify_lower and Segment_identify_higher
* hitlistpool.c: Initial import
* gsnap.c: Remoed oligoindices_major, oligoindices_minor, pairpool,
diagpool, cellpool, and Dynprog_T objects as variables
* gmap.c: Using new interfaces to stage1, stage2, and stage3 procedures
* stage3.c, stage3.h: Removed unused parameters
* translation.c: Removed npairs as a parameter for backward procedures
* terminal.c, terminal.h: Removed mismatch_positions_alloc as a parameter
* stage2.c: Removed code based on anchoredp, anchor_querypos, and
anchor_position, which are now always false and 0
* stage1hr.c, stage1hr.h: Using new interfaces to kmer-search,
extension-search, terminal, and concordance procedures. Removed
oligoindices_minor, diagpool, and cellpool parameters to single_read and
paired_read procedures
* stage1.c, stage1.h: Using new interfaces to Block_process_oligo_5 and
Block_process_oligo_3. Removed sizelimit parameters to Stage1_compute
procedures
* splice.c, splice.h: Removed unused parameters to Splice_setup
* smooth.c: Removed exon_denominator as a parameter to
find_internal_bads_by_prob
* segment-search.c: Removed unused variable
* samprint.h: Removed preprocessor macros for GSNAP
* samprint.c: Using new interfaces to Substring_compute_chrpos and
Pair_print_sam
* path-solve.h: Removed interface to Path_solve_via_gmap
* path-solve.c: Using new interfaces to substring procedures
* pair.c, pair.h: Removed unused parameters for Pair_print_sam
* output.c: Using new interfaces to stage3 print procedures
* iit-write-univ.c: Removed omegas as a parameter to node_select
* iit-read.c, iit-read.h: Commented out obsolete procedure
2019-01-16 twu
* genome128_consec.c: Added macros around some procedures. Removed unused
procedures
* genome128_hr.c: Added macros around some procedures
* intersect.c, indexdb.c: Added LARGE_GENOMES macro to a procedure
* oligoindex_hr.c, oligoindex_hr.h: Removed unused parameters from
Oligoindex_untally
* kmer-search.c, kmer-search.h, extension-search.c, extension-search.h:
Removed unused parameters
* epu16-bitpack64-read.c, epu16-bitpack64-readtwo.c: Commented out print
procedures for debugging
* epu16-bitpack64-incr.c: Turned off CHECK macro
* dynprog_single.c, dynprog_single.h: Removed glengthL and glengthR as
parameters to Dynprog_microexon_int
* dynprog_genome.c, dynprog_genome.h: Removed unused parameters, including
calculation of canonical_reward
* distant-rna.c, distant-rna.h: Removed user_maxlevel as a parameter
* datadir.c: Removed unused variables
* concordance.c, concordance.h: Using new interface to Stage3pair_new.
Removed unused parameters
* stage3hr.c, stage3hr.h: Removed oligoindices_minor, diagpool, and cellpool
as parameters, used previously for resolving insides
* compress-write.c: Using different format statements for Univcoord_T
variables
* cigar.c: Removed trimlength as a parameter to length_cigar_M. Using new
interface to Substring_compute_chrpos
* substring.c, substring.h: Removed plusp as a parameter to
embellish_genomic and hardclip_high as a parameter to
Substring_compute_chrpos
* reader.c, reader.h: Commented out unused procedures
* block.c, block.h: Removed indexdb_sizelimit as a parameter to
Block_process_oligo_5 and Block_process_oligo_3
* atoiindex.c, cmetindex.c, snpindex.c: Using new interfaces to
Indexdb_bitpack_counter and Localdb_new_genome
* localdb.c, localdb.h: Removed expand_offsets_p as a parameter to
Localdb_new_genome
* indexdb-write.c, indexdb-write.h: Removed offsetsstrm and offsetspages as
parameters to Indexdb_bitpack_counter and Indexdb_bitpack_counter_huge
* trunk, VERSION, src, Makefile.gsnaptoo.am, access.c, atoiindex.c,
boyer-moore.h, cellpool.c, chrom.h, cigar.c, cmetindex.c,
compress-write.h, concordance.c, concordance.h, diag.h, diagpool.c,
distant-rna.c, distant-rna.h, extension-search.c, extension-search.h,
filestring.c, genome128_hr.c, genome_sites.h, genomicpos.c, gmap.c,
gmapindex.c, gregion.c, gsnap.c, hitlistpool.h, indel.c, indexdb.c,
indexdb.h, intersect-large.h, intersect.c, intersect.h, intlist.c,
intlist.h, intlistdef.h, intlistpool.c, intlistpool.h, junction.c,
junction.h, kmer-search.c, kmer-search.h, ladder.c, ladder.h, list.h,
listdef.h, listpool.c, listpool.h, localdb.c, localdb.h, localdbdef.h,
matchpool.c, maxent_hr.h, mem.h, merge-diagonals-heap.h,
merge-diagonals-simd-uint4.c, merge-diagonals-simd-uint4.h,
merge-diagonals-simd-uint8.h, merge-uint4.c, method.c, oligoindex_hr.c,
oligoindex_hr.h, outbuffer.c, outbuffer.h, output.c, output.h, pair.c,
pair.h, pairpool.c, path-solve.c, path-solve.h, result.h, resulthr.h,
samprint.c, samprint.h, segment-search.c, segment-search.h, shortread.c,
shortread.h, splice.c, splice.h, splicestringpool.c, splicetrie.c,
splicetrie_build.h, stage1hr.c, stage1hr.h, stage3.c, stage3.h,
stage3hr.c, stage3hr.h, stage3hrdef.h, substring.c, substring.h,
terminal.c, terminal.h, types.h, uint8listpool.c, uint8listpool.h,
uint8table_rh.c, uint8table_rh.h, uintlist.c, uintlist.h, uintlistpool.c,
uintlistpool.h, uinttable.c, uinttable_rh.c, uinttable_rh.h, uniqscan.c,
univcoord.h, univdiag.c, univdiag.h, univdiagdef.h, univdiagpool.c,
univdiagpool.h, univinterval.h: Merged revisions 216893 to 218146 from
branches/2018-10-08-path-solve to improve path-solve procedure
2018-10-18 twu
* genome-write.c: Adding 2 words to the end of genomecomp, needed for
accessing nextlow (ptr+4) in the fwd_partial and rev_partial procedures in
oligoindex_hr.c
2018-10-10 twu
* genome.c, bitpack64-serial-write.c, bitpack64-write.c, genome-write.c,
genome128.c, gmapindex.c, indexdb-write.c, indexdb.c, indexdb_hr.c,
kmer-search.c, outbuffer.c, pair.c, parserange.c, stage3.c, stage3hr.c,
substring.c: Replaced occurrences of 1U with 1
* sedgesort.c: Fixed bug in Sedgesort_uint8 where we assigned -1U instead of
(UINT8) -1 as the sentinel value
2018-10-09 twu
* path-solve.c: Removing endpoints from the left and right to see if a
continuing alignment works
* trunk, src, Makefile.gsnaptoo.am, concordance.c, concordance.h,
distant-rna.c, distant-rna.h, extension-search.c, extension-search.h,
gsnap.c, intlistpool.c, junction.c, junction.h, kmer-search.c,
kmer-search.h, list.h, listpool.c, listpool.h, pair.c, pair.h,
path-solve.c, path-solve.h, segment-search.c, segment-search.h,
stage1hr.c, stage1hr.h, stage3hr.c, stage3hr.h, terminal.c, terminal.h:
Merged revision 216940 from branches/2018-10-10-reduce-list-push to add
Listpool_T object for lists of substrings and junctions
* trunk, VERSION, src, Makefile.gsnaptoo.am, intersect-large.c,
intersect-large.h, intersect.c, intersect.h, path-solve.c, record.h,
segment-search.c, segment-search.h, stage1hr.c, stage1hr.h: Merged
revisions 216922 through 216936 from branches/2018-10-09-merge-records to
replace Merge_records procedures in segment search with Merge_diagonals
* localdb.c: Allocating extra space for array, needed for Sedgesort
* concordance.c: Removed unused code for filtering paired-end hits
* concordance.c: Restored computation of abort_pairing_p
2018-10-08 twu
* trunk, VERSION, src, Makefile.gsnaptoo.am, distant-rna.c, distant-rna.h,
extension-search.c, filter-diagonals.c, filter-diagonals.h, gsnap.c,
kmer-search.c, kmer-search.h, localdb.c, localdb.h,
merge-diagonals-heap.c, merge-diagonals-heap.h,
merge-diagonals-simd-uint4.c, merge-diagonals-simd-uint4.h,
merge-diagonals-simd-uint8.c, merge-diagonals-simd-uint8.h,
merge-records-heap.c, merge-records-heap.h, merge-records-simd.c,
merge-records-simd.h, merge-uint4.c, path-solve.c, path-solve.h,
segment-search.c, segment-search.h, splice.c, splice.h, stage1hr.c,
stage1hr.h, stage3hr.c, stage3hr.h, substring.c, substring.h, terminal.c,
terminal.h: Merged revisions 216889 to 216917 from
branches/2018-10-07-filter-diagonals to introduce a filtering step before
segment search, and to pre-allocate memory for Merge_records,
Merge_diagonals, Splice_resolve, and Substring_new procedures
* src, kmer-search.c: Fixed double-assignment of variable
* method.c, method.h, segment-search.c, segment-search.h, stage1hr.c:
Distinguishing between segment search for single-end reads and segment
search for anchored paired-end reads
2018-10-07 twu
* trunk, VERSION, config.site.rescomp.prd, src, Makefile.gsnaptoo.am,
concordance.c, concordance.h, extension-search.c, extension-search.h,
gsnap.c, intersect-large.c, intersect-large.h, intersect.c, intersect.h,
intlistpool.c, intlistpool.h, intpool.c, intpool.h, kmer-search.c,
kmer-search.h, ladder.c, ladder.h, localdb.c, mem.h,
merge-diagonals-heap.c, merge-diagonals-heap.h,
merge-diagonals-simd-uint4.c, merge-diagonals-simd-uint4.h,
merge-diagonals-simd-uint8.c, merge-diagonals-simd-uint8.h,
merge-heap-diagonals.c, merge-heap-diagonals.h, merge-heap-records.c,
merge-heap-records.h, merge-records-heap.c, merge-records-heap.h,
merge-records-simd.c, merge-records-simd.h, merge-simd-diagonals.c,
merge-simd-diagonals.h, merge-simd-records.c, merge-simd-records.h,
merge-uint4.h, merge-uint8.c, merge-uint8.h, method.c, method.h,
path-solve.c, path-solve.h, sedgesort.c, sedgesort.h, segment-search.c,
segment-search.h, stage1hr.c, stage1hr.h, stage3hr.c: Merged revisions
216741 to 216887 from branches/2018-10-01-gsnapl-speed to increased speed
of GSNAP and GSNAPL, especially for paired-end reads
* index.html: Updated for current version
2018-10-03 twu
* concordance.c, concordance.h, stage1hr.c: Using new interfaces to
Stage3pair_new and Concordance_pair_up procedures
* stage3hr.c, stage3hr.h: No longer filtering substrings based on
endtrim_allowed on one side. Performing resolve_insides at end of
Stage3pair_new
* segment-search.c: Fixed debugging statements
* path-solve.c: No longer calling a check creation of a substring of the
middle diagonal
* kmer-search.c: Hiding a debugging procedure
* acinclude.m4, simd-intrinsics.m4, configure.ac, genome128_consec.c,
genome128_hr.c: Added compiler checks for the SIMD intrinsics
_mm_extract_epi64 and _mm_popcnt_u64, and using them
* extension-search.c: Removed unused debugging procedure
* epu16-bitpack64-write.c: Modified comments
* bigendian.c, bigendian.h: Implemented FWRITE_USHORT and FWRITE_USHORTS for
bigendian machines
2018-10-02 twu
* substring.c: Fixed uninitialized fields querystart_pretrim and
queryend_pretrim in Substring_T object
2018-07-05 twu
* trunk, VERSION, config.site.rescomp.tst, index.html, src, path-solve.c:
Changed check on sense_endpoints to antisense_endpoints before trying to
remove an end segment
2018-06-29 twu
* types.h: Merged revisions 215752 to 215897 from
branches/2018-06-15-path-solve-junctions to define Univcoordlist_pop
* stage3hrdef.h: Merged revisions 215752 to 215897 from
branches/2018-06-15-path-solve-junctions to add nmatches_amb as a field
* stage3hr.c, stage3hr.h: Merged revisions 215752 to 215897 from
branches/2018-06-15-path-solve-junctions to consider trim amount in the
final comparison among alignments
* stage3.h: Merged revisions 215752 to 215897 from
branches/2018-06-15-path-solve-junctions to take queryseq as an argument
in merge procedures
* stage3.c: Merged revisions 215752 to 215897 from
branches/2018-06-15-path-solve-junctions to copy all pairs when performing
a merge, to peelback to indels on the medial side when taking the
continuous solution, and to remove indels in insert_gapholders
* pair.c, pair.h: Merged revisions 215752 to 215897 from
branches/2018-06-15-path-solve-junctions to implement Pair_split_circular
* junction.c, junction.h: Merged revisions 215752 to 215897 from
branches/2018-06-15-path-solve-junctions to implement Junction_new_generic
* gmap.c: Merged revisions 215752 to 215897 from
branches/2018-06-15-path-solve-junctions to split circular alignments to
cross the origin
* chimera.c, chimera.h: Merged revisions 215752 to 215897 from
branches/2018-06-15-path-solve-junctions to disallow chimeras to circular
chromosomes
* path-solve.c: Merged revisions 215752 to 215897 from
branches/2018-06-15-path-solve-junctions to create a new junction at the
end, rather than use a precomputed junction
* path-solve.c: At ends, pushing NULL instead of the previously computed
junction
* pair.c, pair.h: Implemented checking procedures
2018-06-15 twu
* stage3hr.c: Removed extraneous characters in debugging statement
2018-06-14 twu
* stage3.c: Fixed handling of end exons in end trimming procedures. Using
new interfaces to Pair_clip_bounded_list_5 and Pair_clip_bounded_list_3
* pair.c, pair.h: Implemented separate 5' and 3' versions of
Pair_clip_bounded_list
* chimera.c: Fixed debugging statements to use new interface to
Sequence_stdout
2018-05-30 twu
* pair.c: In converting pairarray to substrings, now resetting exon
variables after an insertion or deletion
* iit-read.c: Added a warning in using IIT_read for a version 1 IIT
* iit_get.c, iit_store.c: Fixed some memory leaks
* getline.c, getline.h: Returning string_length with Getline_wlinefeed
* iit_store.c: Using Getline_wlinefeed instead of Getline_wlength
* stage1hr.c: Fixed uninitialized variable
* substring.c: Fixed assertion
* stage3hr.c: Checking if there is enough space at ends of the chromosome
before resolving inner exons
* gmapindex.c: Assigning variable and not pointer when clearing empty space
at end of line
* atoiindex.c, cmetindex.c, indexdb-write.c, snpindex.c: Calling
Access_allocate_private properly for machines where mmap is not available
or disabled
* gregion.c: Improved procedure for finding unique gregions by sorting by
support instead of weight, by checking if query coordinates are
consistent, and handling cases where endpoints are equal
* Makefile.gsnaptoo.am: Not making iit_pileup
2018-05-25 twu
* Makefile.gsnaptoo.am: Added getline.c and getline.h to library
* get-genome.c: Using variable line instead of Buffer
* trunk, config.site.rescomp.prd, config.site.rescomp.tst, index.html, src,
gmap.c, oligoindex_hr.c, stage2.c, stage3.c: Merged revisions 215481
through 215483 from branches/2018-05-25-restore-compute-ends to restore
GMAP behavior from 2018-03-20 and use Stage3middle_T,
Stage3_compute_middle, and Stage3_comput_ends for better alignment at ends
* datadir.c: Fixed extraneous parenthesis
* datadir.c, Makefile.gsnaptoo.am, iit_get.c, iit_store.c: Using Getline
* gmap.c, chrsubset.c, iit-read-univ.c, iit-read.c, iit_store.c, samread.c,
sequence.c, shortread.c, splicing-scan.c, stage2.c, stage3hr.c: Ensuring
that calls to strncpy are followed by setting the end to be '\0'. Using
malloc instead of calloc in these situations
* Makefile.gsnaptoo.am, datadir.c, get-genome.c, getline.c, getline.h,
gmapindex.c: Using calls to Getline to prevent problems with buffer
overflow
* substring.c: Fixed calculation of mandatory_trim_left and
mandatory_trim_right for alias to be positive, rather than negative
2018-05-11 twu
* VERSION: Updated version number
* src: Made changes
* fa_coords.pl.in, gmap_build.pl.in, gmap_process.pl.in: Added options to
handle fastq files and to reverse complement all sequences
* stage1hr.c: No longer iterating through Segment_search
* stage1hr.c: Not calling Stage1_init or other Stage1 procedures when
querylength < index1part
* substring.c: Not doing aliasing and unaliasing on alt substrings.
Computing mandatory_trim_querystart and mandatory_trim_queryend instead of
mandatory_trim_left and mandatory_trim_right
* trindex.c: Revised instructions
* iit-read-univ.c, iit_get.c, iit_pileup.c, iit_tally.c: Freeing div name
allocated by new versions of Parserange routines
* genome128_consec.c, genome128_hr.c: Added header files needed for SSSE3
computers
2018-04-30 twu
* shortread.c: Formatting change
* parserange.c: Allow colons in accession names
* gmapindex.c: Allow colons in accession. Implementing revcomp by using '-'
sign for contig length
* get-genome.c: Freeing chromosome string after call to Parserange
2018-04-21 twu
* stage1hr.c: Merged revision 214805 from branches/2018-04-21-fix-anchors to
use Solve_segment_all instead of Concordance_filter_records for paired-end
reads
* stage3hr.c, stage1hr.c: Added variable remap_transcriptome_p
* stage3.c: Checking for a NULL stage3 in Stage3_split
* intersect.c: Fixed comments
* stage1.c: Using Oligospace_T to cast 0
* kmer-search.c: Fixed issues with transcript coordinates
2018-04-20 twu
* gmap.c, gsnap.c, inbuffer.c, inbuffer.h, sequence.c, sequence.h,
shortread.c, shortread.h: Adding a command-line option --read-files-command
* extension-search.c, segment-search.c, stage1hr.c: Using (Oligospace_T) as
a cast for 0
* Makefile.gsnaptoo.am, fopen.c, fopen.h: Added a command-line option
--read-files-command
* gsnap.c: Changed flag from --use-transcriptome to --use-transcriptome-only
2018-03-25 twu
* samprint.c: Using new interface to Pair_print_sam
* trunk, config.site.rescomp.prd, index.html, src, block.c, gmap.c,
indexdb.c, oligoindex_hr.c, pair.c, stage1.c, stage2.c, stage3.c,
translation.c: Merged revisions 214439 through 214446 from
branches/2018-03-24-restore-gmap to restore speed and sensitivity of GMAP
by eliminating use of Stage3_compute_ends, restoring use of multiple
oligoindices, and fixing CDS phases for GFF3 output
2018-03-24 twu
* indexdb.c: Removed option for expanding offsets
* gmap.c: Fixed memory issues with new calls to Stage3_merge_local and
Stage3_merge_chimera, and appending middlepieces to stage3list. Changed
default gff3_cds to be genomic. Ignoring option for --expand-offsets
* stage3.c, stage3.h: Removed cigar_tokens and intronp as fields for
Stage3_T objects. Implemented Stage3_copy and using it in
Stage3_merge_local and Stage3_merge_chimera
* pair.c, pair.h: Removed cigar_tokens and intronp as paramters to
Pair_print_sam
2018-03-23 twu
* substring.c: Added assertions about alts, to make sure we don't use
alignstart_trim or alignend_trim fields in those cases
* stage3hr.c: Checking for alts when getting chrpos_low or chrpos_high
* samprint.c: Formatting changes
* dynprog.h: Setting AMBIGUOUS score to be 3, to make cmet and atoi
alignments equivalent to standard
* cigar.c: For sam_hardclip_use_S option, returning hardclips to be 0 so
they don't affect the query sequence in SAM output
* stage3.c: Restored force of single gaps, to avoid problems with
add_dual_break later
2018-03-21 twu
* iit_tally.c: Initial import
2018-03-20 twu
* VERSION: Updated version number
* trunk, src, dynprog.c, dynprog.h, dynprog_cdna.c, dynprog_cdna.h,
dynprog_end.c, dynprog_end.h, dynprog_genome.c, dynprog_genome.h,
dynprog_simd.c, dynprog_simd.h, dynprog_single.c, dynprog_single.h,
gmap.c, gsnap.c, pair.c, path-solve.c, splicetrie.c, splicetrie.h,
stage3.c, stage3.h: Merged revisions 214343 through 214360 from
branches/2018-03-20-cmet-gmap to make non-standard modes work in GMAP
alignments
* sam-exons.pl.in: Initial import
* indexdb.c: Fixed preprocessor macro
2018-03-19 twu
* VERSION: Updated version number
* trunk, VERSION, config.site.rescomp.tst, configure.ac, index.html, src,
Makefile.gsnaptoo.am, atoi.c, atoi.h, atoiindex.c, cmet.c, cmet.h,
cmetindex.c, epu16-bitpack64-access.c, epu16-bitpack64-access.h,
epu16-bitpack64-incr.c, epu16-bitpack64-incr.h, epu16-bitpack64-read.c,
epu16-bitpack64-read.h, epu16-bitpack64-readtwo.c,
epu16-bitpack64-readtwo.h, epu16-bitpack64-write.c,
epu16-bitpack64-write.h, extension-search.c, extension-search.h,
genomicpos.c, genomicpos.h, gsnap.c, indexdb.h, indexdbdef.h,
kmer-search.c, kmer-search.h, localdb-write.c, localdb.c, localdb.h,
localdbdef.h, oligo.c, path-solve.c, path-solve.h, reader.c, stage1hr.c,
types.h: Merged revisions 214117 through 214304 from
branches/2018-03-11-cmet-localdb to implement non-standard modes with
support for localdb
* substring.c: Removed an assertion that is not always valid
* stage3hr.c, stage3hr.h: Commented out code for Stage3end_substring_high,
which is not used anymore
* samprint.c: Fixed typo in using querylength5 instead of querylength3 for
mate
* samflags.h: Incremented values for number of filestream outputs, to handle
XS output files properly
* outbuffer.c: Always skipping output for OUTPUT_NONE filestream
* Makefile.gsnaptoo.am: Hiding program iit_pileup
* oligoindex_hr.c, localdb.c: Allowing debugging statements to work
* kmer-search.c: Added debugging statements
* indexdb.c: Distinguishing between GSNAP pointer procedures, which need to
handle negative diagterms, and GMAP read procedures, where diagterms are
always non-negative
* gsnap.c: Added option for --sam-hardclip-use-S
* cigar.c, cigar.h: Calling low substring for all circular chromosome hits
to get correct chrpos in SAM output. Added provisions for
SAM_hardclip_use_S_p
2018-03-10 twu
* substring.c: Changed assertion to allow for '*'
* stage3.c: Added a clip of end5 against the chromosomal bound for
alignments on the minus strand
* intersect.c: Commenting out advance of positions past -diagterm
* indexdb.c: No longer checking for diagterm < 0, since diagterm <= 0 should
be true
* extension-search.c: Reverted advance of positions past -diagterm and added
assertions instead. Changed algorithm to not add an elt when the
positions are invalid.
* shortread.c: Preventing invalid read when accession contains only one or
two characters
* kmer-search.c, merge-heap-diagonals.c, merge-heap-records.c,
merge-simd-diagonals.c, merge-simd-records.c: Added assertions to check
that positions >= -diagterm
* extension-search.c, intersect.c: Skipping or advancing positions so they
exceed -diagterm
* oligo.c: Fixed initialization code so that Clang compiler will accept it
2018-03-09 twu
* localdb-write.c: Added a message to indicate when writing is done
* Makefile.gsnaptoo.am, iit_pileup.c: Added ability for the user to specify
a genomic range
* stage3hr.c, stage3hr.h: Fixed bug with circular read not being unaliased
because of a soft clip. Removed macro for soft clips avoiding
circularization. Removed circularalias values of +2 and -2, since
alignments now should stay within chromosomal bounds
* iit_pileup.c: Added code for using filestrings
* Makefile.gsnaptoo.am, iit-read.c, iit-read.h, iit_pileup.c: Restored
iit_pileup program. Rewrote to take a FASTA file as input
2018-03-05 twu
* stage3.c: Added back declaration of variables
* stage3.c: Removed unused variables and parameters
* samprint.c: Removed unused static variable
* path-solve.c: Using new interface to Stage3end_new_gmap
* stage3hr.c, stage3hr.h: Removed unused parameters to Stage3end_new_gmap.
Removed unused code
* gsnap.c: Removed unused parameter to process_request
* extension-search.c: Removed unused code
* stage1hr.c: Fixed memory error when abort_pairing_p is true
2018-03-04 twu
* localdb.c: Defined variables needed for large genomes
* trunk, VERSION, src, block.c, block.h, distant-rna.c, distant-rna.h,
extension-search.c, gmap.c, gsnap.c, kmer-search.c, kmer-search.h,
ladder.c, ladder.h, localdb.c, localdb.h, oligo.c, oligo.h, pair.c,
pair.h, path-solve.c, path-solve.h, segment-search.c, segment-search.h,
splice.c, splice.h, stage1.c, stage1.h, stage1hr.c, stage1hr.h, stage2.c,
stage3.c, stage3hr.c, stage3hr.h, substring.c, substring.h, terminal.c,
terminal.h: Merged revisions 213978 through 214024 from
branches/2018-03-01-ptr-and-diagterm to add potential support for cmet,
atoi, and ttoc modes
2018-03-03 twu
* cigar.c, cigar.h, gsnap.c: Obey behavior of --sam-use-0M flag
2018-03-02 twu
* trunk, config.site.rescomp.prd, src, Makefile.gsnaptoo.am, access.c,
atoiindex.c, bitpack64-readtwo.c, cmetindex.c, epu16-bitpack64-read.c,
epu16-bitpack64-readtwo.c, epu16-bitpack64-write.c, extension-search.c,
filesuffix.h, genome-write.c, genome128_hr.c, gmapindex.c, gsnap.c,
iit-write.c, iit_store.c, indel.c, indexdb-write.c, indexdb.c, indexdb.h,
indexdbdef.h, intersect.c, intersect.h, kmer-search.c, kmer-search.h,
localdb.c, merge-heap-diagonals.c, merge-heap-diagonals.h,
merge-heap-records.c, merge-heap-records.h, merge-simd-diagonals.c,
merge-simd-diagonals.h, oligoindex_hr.c, pair.c, path-solve.c,
segment-search.c, segment-search.h, smooth.c, snpindex.c, stage1hr.c,
stage1hr.h, stage2.c, stage3.c, stage3hr.c, substring.c, terminal.c,
types.h: Merged revisions 213921 to 213977 from
branches/2018-03-01-ptr-and-diagterm to add back support for large genomes
* stage3hr.c: Improved debugging statements
* kmer-search.c: Removed extraneous calls to free lists
* ladder.c, ladder.h: Implemented Ladder_cutoff to allow for ladders, but
restrict computation to MAX_HITS
* concordance.c: Calling Ladder_cutoff instead of Ladder_maxscore
* cigar.c: Fixed computation of SAM chrpos. Always calling
Stage3end_substring_low for a single SAM line
* concordance.c: Applying MAX_HITS to Concordance_pair_up_distant
2018-03-01 twu
* stage1hr.c: Fixed a typo: querypos_rc instead of querypos for minus
positions
* indexdb.c: Hiding Indexdb_ptr_with_diagterm from utility programs
* pairpool.c, pairpool.h, stage3.c: Using a better method for handling
chromosomal bounds in GMAP, by trimming pairs, rather than the pairarray
* trunk, VERSION, config.site.rescomp.prd, config.site.rescomp.tst, src,
Makefile.gsnaptoo.am, extension-search.c, extension-search.h, indexdb.c,
indexdb.h, intersect.c, intersect.h, kmer-search.c, kmer-search.h,
localdb.c, merge-heap-diagonals.c, merge-heap-diagonals.h,
merge-heap-records.c, merge-heap-records.h, merge-simd-diagonals.c,
merge-simd-diagonals.h, merge-simd-records.c, merge-simd-records.h,
segment-search.c, stage1hr.c, stage1hr.h: Merged revisions 213873 through
213921 from branches/2018-03-01-ptr-and-diagterm to use pointers to
positions and diagterms, rather than copying positions
* substring.c: In Substring_set_alt, handling amb_splice_pos properly as
being defined from left, rather than querystart
* Makefile.util.am: Added instructions for bitpack64-test
* segment-search.c: Freeing lists provided to Merge_records
* path-solve.c: Using FREE_ALIGN for freeing results of merge procedures
* localdb.c: Localdb_read_with_bounds now always returns aligned memory,
which can happen if merging is performed. Using new interface to merge
procedures
* kmer-search.c: Fixed a memory leak in exact procedure when the ends have
invalid oligos
* merge-heap-diagonals.c, merge-heap-records.c, merge-simd-diagonals.c,
merge-simd-records.c: Merge procedures no longer free their input lists or
streams. Streams can be aligned or not, so the caller needs to free them
* pair.c, pairpool.c, pairpool.h, stage3.c: Prevent GMAP results from going
beyond chromosomal bounds, when making the final pairarray
2018-02-28 twu
* stage1hr.c: Using new interfaces to exact and approx algorithms
* stage3hr.c, stage3hr.h: Renamed Stage3end_list_free to Stage3end_gc
* segment-search.c: Fixing memory leak
* kmer-search.c, kmer-search.h: Limiting results for exact algorithm with
max_hits, and limiting approx algorithm with both max_hits and sizelimit
* concordance.c: Limiting results for newladder with MAX_HITS
* uint8table.c, uint8table.h: Initial implementation
* Makefile.gsnaptoo.am, stage1hr.c: Using a Uint8table_T object to call
repeated occurrences of an oligo invalid
* substring.c: Allocating querylength+1 for mismatch_positions
* path-solve.c: Making a better call to Substring_new for the middle
diagonal, and handling the case where the result is NULL
* gsnap.c: Including header for concordance.h
* terminal.c, stage3hr.c: Using new interface to Substring_new
* path-solve.c: Removed all instances of middle_path
* extension-search.c, kmer-search.c: Using new interfaces to
Path_solve_from_diagonals and Substring_new
* segment-search.c: Using new interface to Path_solve_from_diagonals.
Letting that procedure determine which left and right diagonals are in the
correct chrnum
* path-solve.c, path-solve.h: Path_solve_from_diagonals now takes a middle
diagonal, rather than a middle path. Sets chrnum based on middle
diagonal, as determined by Substring_new. Also considers only left and
right diagonals in that chrnum.
* substring.c, substring.h: Substring_new now takes chrnum_fixed_p as a
parameter. If true, then the procedure does not recompute chromosomal
bounds
2018-02-27 twu
* segment-search.c: Using left instead of lowpos to determine chrnum, to be
safe
* substring.c: Checking for substring being below given chroffset, and
recomputing chromosomal bounds
* segment-search.c: When adding segments to left and right, checking to make
sure they fall at least partially into the same chromosome as the anchor
segment
* pair.c: Fixed argument for setting querystart_pretrim and queryend_pretrim
* substring.c: Fixed typo
* pair.c, substring.c, substring.h: Computing more accurate values for
querystart_pretrim and queryend_pretrim. Removed assertion about
left_bound and right_bound in Substring_count_mismatches_region
* concordance.c, stage1hr.c, stage3hr.c, stage3hr.h, stage3hrdef.h: Removed
private5p and private3p fields from Stage3pair_T object. Always copying
hits when making a pair, needed because the concordance procedure can now
delete hits
* ladder.c: Assigning a value to nhits in all cases from
Ladder_hits_for_score
* concordance.c, stage1hr.c: Restored assignment of abort_pairing_p
* ladder.c, ladder.h, stage3hr.c, stage3hr.h: Removing duplicates before
returning hits at a given score. Added a duplicates field to hold
duplicate hits, and a procedure for deleting them
2018-02-26 twu
* filestring.c: Now handling %p in format statement
* stage3hr.c: Changed one occurrence of Substring_ambiguous_p to
Substring_has_alts_p
* trunk, src, Makefile.gsnaptoo.am, cigar.c, concordance.c, concordance.h,
distant-rna.c, distant-rna.h, gsnap.c, kmer-search.c, kmer-search.h,
ladder.c, ladder.h, merge-heap-records.c, merge-heap-records.h,
merge-simd-records.c, merge-simd-records.h, merge-uint4.c, method.c,
method.h, pair.c, pair.h, path-solve.c, record.h, samprint.c,
segment-search.c, segment-search.h, splice.h, stage1hr.c, stage1hr.h,
stage3hr.c, stage3hr.h, stage3hrdef.h, substring.c, substring.h,
terminal.c, terminal.h, types.h, univdiag.c: Merged revisions 213654 to
213758 from branches/2018-02-22-limit-segment-search to add concordance
procedures, ladders, and filters for segment search, and to revise the
single-end and paired-end procedures
2018-02-22 twu
* trunk, VERSION, config.site.rescomp.tst, src, Makefile.gsnaptoo.am,
block.c, extension-search.c, genomicpos.h, gmap.c, gsnap.c, indexdb.c,
indexdb.h, intersect.c, intersect.h, intlist.c, intlist.h, junction.c,
kmer-search.c, localdb.c, localdb.h, merge-heap-diagonals.c,
merge-heap-diagonals.h, merge-heap-records.c, merge-heap-records.h,
merge-heap.c, merge-heap.h, merge-simd-diagonals.c,
merge-simd-diagonals.h, merge-simd-records.c, merge-simd-records.h,
merge-uint4.c, merge-uint4.h, merge.c, merge.h, oligo.c, pair.c, pair.h,
path-solve.c, segment-search.c, segment-search.h, splice.c, splicetrie.c,
splicetrie.h, stage1.c, stage1hr.c, stage1hr.h, stage2.c, stage2.h,
stage3.c, stage3hr.c, stage3hr.h, substring.c, substring.h, types.h,
uint8list.c, uint8list.h, uintlist.h, uniqscan.c, util, gmap_build.pl.in:
Merged revisions 213593 through 213652 from
branches/2018-02-21-large-genomes to add support for large genomes
2018-02-21 twu
* stage1hr.c: Checking for a set with zero elements before calling
Orderstat_int_pct
* segment-search.c: Excluding invalid oligos
* path-solve.c: When novelsplicing is false, creating just one hit
* oligo.c: Commented out unused procedures
* kmer-search.c: Handling invalid oligos correctly
* stage1hr.c: Making poly_A and poly_T oligos not valid. Added a min
sizelimit
* stage1hr.c: Added comment
* cigar.c, samprint.c, stage3hr.c, stage3hr.h: Removed code designed for old
meaning of Substring_ambiguous_p, now distinct from Substring_has_alts_p
* indexdb.c: Commented out SIMD procedures for utility programs. Added
support for AVX2
* trunk, src, extension-search.c, extension-search.h, indexdb.c, indexdb.h,
kmer-search.c, kmer-search.h, path-solve.c, path-solve.h,
segment-search.c, segment-search.h, splice.c, stage1hr.c, stage1hr.h,
stage3hr.c, stage3hr.h, substring.c, substring.h, types.h: Merged
revisions 213472 through 213572 from branches/2018-02-16-faster-one-kmer
to increase speed and accuracy
* VERSION, config.site.rescomp.prd, config.site.rescomp.tst, index.html:
Updated for latest version
* stage1hr.c: Added debugging statements
2018-02-16 twu
* stage3.c: Replaced check with an assertion
* gmap.c: Filtering out Stage3_T objects with zero npairs
* junction.c, junction.h: Implemented Junction_typestring
* stage3.c: Making explicit check for npairs being 0 in Stage3_new
* pair.c: Handling extra exons with a specific type and transition rules
* gmap.c: Checking for Stage3_T object having zero pairs before pushing onto
list
* cigar.c: Checking hardclip when printing insertion token. Calling either
Stage3end_substring_low or Stage3end_substring_high
* stage3hr.c: Added provisions for junctions_HtoL
2018-02-13 twu
* localdb.c: Performing a check for possible negative coordinates
* cigar.c, cigar.h: Storing value of extended_cigar_p during setup
* gsnap.c: Added flags for --endtrim-length and --sam-extended-cigar
* stage3.c: In Stage3_recompute_coverage, added a check for npairs being 0
* shortread.c, shortread.h: Added support for end (3') trimming of each read
* kmer-search.c: Improved debugging statements
* gmap.c: Added checks for breakpoint being invalid, before running
Stage3_mergeable
* chimera.c: Added checks for npairs being zero
* gsnap.c: Removed the --expand-offsets flag
* access.c: Put a limit on the maximum number of attempts to kill an
unattached shared memory segment
* splice.c, substring.c: Checking for cases where the splice search
boundaries yield a negative coordinate. Redefined middle coordinate to be
alignstart and alignend, and not the positions 1 bp distal to these
coordinates.
* stage1hr.c: Increased all sizelimits for diagonals from 100 to 3000
* stage3.c: Defined all parameters for allowed iterations to be 1
* stage2.c: Added a step to filter stage2 middles, starts, and ends for
uniqueness
2018-02-11 twu
* substring.c: Improved debugging statement
* stage3hr.c: Added a comment
* splice.c: Improved debugging statements
* stage3hr.c: Now allowing Stage3pair_resolve_insides to resolve both ends
at the same time
* VERSION: Updated version number
* stage3hr.c: Added debugging statements
2018-02-10 twu
* pair.c: Fixed counting routines to skip over bad pairs correctly
* substring.c: Removed unused code
* stage3hr.c: In Stage3pair_resolve_insides and resolve_inside_general
procedures, when hits are changed for the hitpair, revising the values of
alts_resolve_5 and alts_resolve_3. Fixes a seg fault when
Stage3pair_eval_and_sort tries to resolve the hits a second time
* path-solve.c, samprint.c, stage3hr.c, stage3hr.h, substring.c,
substring.h: Changed variable names to distinguish between an ambiguous
splice length and alternative splice coords
* trunk, config.site.rescomp.tst, src, pair.c, pair.h, segment-search.c,
segment-search.h, stage1hr.c, stage3.c, stage3hr.c: Merged revisions
213287 through 213291 from branches/2018-02-10-fix-bugs to fix bugs
related to pairarrays that crossed chromosomal bounds, segments subsuming
others, and handling of dual breaks
* pair.c: Hiding function Pairarray_convert_to_substrings from GMAP
* trunk, index.html, src, cigar.c, cigar.h, distant-rna.c,
extension-search.c, filestring.c, filestring.h, gsnap.c, kmer-search.c,
output.c, pair.c, pair.h, path-solve.c, samprint.c, samprint.h,
segment-search.c, splice.c, stage1hr.c, stage3hr.c, stage3hr.h,
substring.c, substring.h: Merged revisions 213162 through 213277 from
branches/2018-02-07-improve-circular to standardize printing of circular
and translocation alignments
* stage3hr.c: Fixed printing of method label
2018-02-08 twu
* extension-search.c: Making sure that queryoffset does not go outside
bounds of 0 to query_lastpos
2018-02-07 twu
* substring.c: In Substring_new, giving initial values to trim_left and
trim_right
* substring.c: In Substring_new, calling trim_left_end and trim_right_end
from 0 and querylength, but trim_novel_spliceends from given querystart
and queryend. If novel spliceends yields a short exon, then using the
non-spliced trimming results.
* stage3hr.c: Calling compute_circularpos on all Substring_T objects
* samprint.c: Added debugging statements
* substring.c: In Substring_new, initializing necessary values of
Substring_T object earlier than first possible abort and call to
Substring_free
2018-02-06 twu
* trunk, config.site.rescomp.tst, index.html, src, extension-search.c,
gmap.c, oligo.c, oligo.h, reader.c, reader.h, stage1hr.c, svncl.pl: Merged
revisions 213097 through 213130 from branches/2018-02-05-distant-dna to
allow for updating of querypos around N's and fix of fatal bug in GMAP in
merge_left_and_right_readthrough
2018-02-05 twu
* substring.c: In trim_left_end and trim_right_end, computing trims relative
to querystart and queryend, rather than 0 and querylength. In
embellish_genomic_sam, filling in beginning and end with dashes and stars
to avoid genomic_refdiff having a different length than querylength
* Makefile.gsnaptoo.am: When MAKE_LIB is not defined, not copying headers
either
* VERSION, config.site.rescomp.prd: Updated version number
* get-genome.c, parserange.c, parserange.h: Allowing Parserange_universal to
return a value for whole_chromosome_p
* substring.c: In Substring_new, always making querystart and queryend
correspond to trim_left and trim_right, to fix errors in CIGAR strings
* access.c: Using a loop to create (and possibly deallocate) shared memory
* access.c: Upon startup, if a shared memory segment exists with no other
attached processes (possibly corrupted), then deallocating it and creating
a new one
* kmer-search.c: Simplified code in making sure that both ends are in the
same chromosome
* samprint.c: In SAM_compute chrpos, if hardclip_low yields a NULL
low_substring, then trying hardclip_high
* substring.c: Handling the case where both cdna and genome have 'N'
* stage3hr.c, stage3hr.h: Reversed revision 213080, which is giving CIGAR
errors. Restored Stage3end_substring_high
* substring.c: In Substring_new, added a minimum of 8 bp in general test 2
* stage3hr.c: For Stage3end_substring_low and Stage3end_substring_high, if
the hardclip passes all substrings, return the last one processed
* stage3hr.c: In Stage3end_new_substrings, handling the case where
substring1 and substringN have different chrnums assigned by Substring_new
* stage3hr.c: In Stage3end_new_substitution, handling the case where
Substring_new has assigned a different chrnum than the one given
* substring.c: Made trim incremental when performed as a preliminary step
before finding novel splice ends
* kmer-search.c: Made a more rigorous check that both ends are on the same
chromosome
* kmer-search.c: Making sure the two ends are on the same chromosome
* substring.c: Handling the case where the alignment goes over the upper
bound of the next higher chromosome
* substring.c: Removed residue from an SVN merge conflict
* trunk, src, Makefile.gsnaptoo.am, distant-rna.c, gsnap.c, path-solve.c,
stage1hr.c, stage3hr.c, stage3hr.h, terminal.c, terminal.h, types.h:
Merged revisions 213033 through 213070 from
branches/2018-02-05-add-terminals to add terminal alignments
2018-02-04 twu
* substring.c: Not applying general test 1 if orig_nmismatches < 0
* substring.c: Making outofbounds adjustments to be increments to the
existing trim_left and trim_right
* trunk, src, cigar.c, distant-rna.c, gsnap.c, kmer-search.c, samprint.c,
splice.c, stage1hr.c, stage3hr.c, stage3hr.h, substring.c, substring.h:
Merged revisions 213027 through 213053 from
branches/2018-02-04-simplify-substring-new to simplify Substring_new
procedure
* config.site.rescomp.prd, config.site.rescomp.tst, index.html, svncl.pl:
Updated version number
* stage3.c: Added a List_reverse command
* types.h: Added comments
* stage1hr.c: Fixed wrong assignment of first_read_p
* stage3hr.c: Checking for the eventrim region having bounds that don't make
sense
* substring.c: Added an assertion
* stage3.c: Restored pass 7 to remove dual breaks at the ends
* stage1hr.c: Fixed a memory leak for queryrc
* filestring.c: Added a check for fp being NULL, which can occur with the
flags --omit-concordant-uniq or --omit-concordant-mult
2018-02-02 twu
* VERSION: Updated version number
* configure.ac: Fixed command for --enable-lib
2018-02-01 twu
* trunk, index.html, src, block.c, block.h, extension-search.c,
extension-search.h, filestring.c, gmap.c, gsnap.c, indexdb.c, indexdb.h,
intersect.c, kmer-search.c, kmer-search.h, oligoindex_hr.c, pair.c,
pair.h, path-solve.c, segment-search.c, stage1.c, stage1.h, stage1hr.c,
stage1hr.h, stage2.c, stage3.c, stage3.h, stage3hr.c, substring.c,
transcriptome.c: Merged revisions 212952 through 212996 from
branches/2018-02-01-reverse-gmap-slowdown
* gmap.c: Fixed reference to array0 that should have been array1
* configure.ac: Disabled alloca by default
* stage3.c: Restored computation of Stage2_compute_starts and
Stage2_compute_ends
* pair.c: Handling GFF3 output when chrstring or accession is NULL
* oligoindex_localdb.c, oligoindex_localdb.h: Not using
Oligoindex_localdb_tally because of speed
* inbuffer.c: Added debugging statements
* gmap.c: Restored function of --pairalign option
2018-01-30 twu
* stage3hr.c: Made changes to debugging statements
* stage3.h: Updated interface for Stage3_setup
* stage3.c: Distinguishing between overall_end_distance_linear and
overall_end_distance_circular when making call to Stage2_compute_starts
and Stage2_compute_ends
* stage1hr.c: Making calls to remove_circular_alias and remove_duplicates
for single-end reads
* samprint.c: Fixed bug where alignment results to circular chromosome were
not being printed
* path-solve.c, kmer-search.c: Not allowing splice junctions for circular
chromosomes
* gsnap.c: Using new interface to Stage3_setup
* gmap.c: Using genomelength instead of genome_totallength. Initializing
circularp for usersegment
2018-01-29 twu
* gsnap.c, kmer-search.c, kmer-search.h, path-solve.c, path-solve.h: Not
allowing Junction_new_splice to be called when splicing is turned off
* trindex.c: Added some headers for open() procedure
* gmap.c, gsnap.c: Using interfaces to new setup procedures
* stage3.c, stage3.h: Limiting chrend for Stage2_compute_starts and
Stage2_compute_ends based on genome total length
* stage1hr.c, stage1hr.h: Added hook for distant DNA alignments
* samprint.c, samprint.h: Not printing XS field in SAM output when splicing
is turned off.
* pair.c, pair.h: Printing . in features field of GFF3 output when sense is
unknown. Not printing XS field in SAM output when splicing is turned off.
* stage3.c: Before running Stage2_compute_starts and Stage2_compute_ends,
removed the check on chrend going past chrhigh. The truncated coordinates
can cause chrend to be less than chrstart if the alignment straddles a
chromosomal bound
* stage3hr.c: Fixed a memory free error. In Stage3end_new_substrings, when
Stage3end_T object fails due to circular alias, letting Stage3end_free and
not the caller free the junctions
* stage3.c: Accepting a single alignment regardless of final score, if the
original queryjump or genomejump is negative
2018-01-27 twu
* substring.c: In Substring_new, computing trims first, then adjusting trims
for out-of-bound lengths, then adjusting query and genomic bounds
2018-01-26 twu
* VERSION: Updated version number
* intlist.c, intlist.h, list.c, list.h, uintlist.c, uintlist.h: Added code
for non-inlined functions
* transcript.c, transcript.h: Made Transcript_num non-inline
* configure.ac: Added macro AC_C_INLINE
* get-genome.c: Added include for intlist.h
2018-01-25 twu
* archive.html, index.html: Updated for latest version
* path-solve.c: Added a type conversion
* Makefile.gsnaptoo.am: Fixed typos in file names
* VERSION: Updated version number
* gsnap.c: Change in description for --action-if-cigar-error
* gmap.c: Implemented --action-if-cigar-error flag
* stage3.c: Putting an upper bound on the number of Boyer-Moore searches for
microexons, based on the number of 5' and 3' splice positions
* translation.c: Assigning aaphase_e for indels in CDS
* pair.c, pair.h, gsnap.c: Implemented flag --action-if-cigar-error
* oligoindex_hr.h, oligoindex_hr.c: Moved SIMD includes to header file,
since definition of Oligoindex_T object is now there
* gmap.c: Using new interface to Pair_setup
* Makefile.gsnaptoo.am: Turned off making of gmapl and gsnapl for all SIMD
types
* pair.c: Implemented patch by Nathan Weeks to remove extra token from
print_gff3_exons_forward. For GFF3 code, changed types for genomic
coordinates from int to Chrpos_T. Also initializing genomic coordinates
to 0 instead of -1
2018-01-24 twu
* oligoindex_hr.c: Added assertion
* stage3.c: Fixed computation of chrstart and chrend before final
Stage2_compute_start and Stage2_compute_ends for circular chromosomes
* trunk, src, Makefile.gsnaptoo.am, gmap.c, gsnap.c, localdb.c, localdb.h,
merge.c, oligoindex_hr.c, oligoindex_hr.h, oligoindex_localdb.c,
oligoindex_localdb.h, stage2.c, stage2.h, stage3.c: Merged revisions
212704 through 212742 from branches/2018-01-23-gmap-localdb
* VERSION, config.site.rescomp.tst: Updated version
* configure.ac: Added the conditional MAKE_LIB and a flag to control it
* pair.c: Added an assertion about cds_phase being non-negative
* Makefile.gsnaptoo.am: Building library only if MAKE_LIB is true
2018-01-23 twu
* gsnap.c: Allow --terminal-threshold flag for backward compatibility, but
ignore
2018-01-22 twu
* gsnap.c: Allowing the --use-sarray flag and ignoring it
* stage1hr.c: Fixed bugs regarding the use of querylength3 instead of
querylength5, and for handling the case where all paired-end hits are NULL
and we need to run GMAP on the complete paths.
2018-01-21 twu
* setup1.test.in: Turning off setup1.test
2018-01-19 twu
* trunk, src, Makefile.gsnaptoo.am, access.c, atoiindex.c,
bitpack64-access.h, bitpack64-incr.c, bitpack64-incr.h, bitpack64-read.c,
bitpack64-read.h, bitpack64-readtwo.c, bitpack64-write.c,
bitpack64-write.h, block.c, cellpool.c, cmetindex.c, diagpool.c,
distant-rna.c, distant-rna.h, dynprog.c, dynprog_cdna.c, dynprog_end.c,
dynprog_genome.c, dynprog_single.c, epu16-bitpack64-access.c,
epu16-bitpack64-access.h, epu16-bitpack64-incr.c, epu16-bitpack64-incr.h,
epu16-bitpack64-read.c, epu16-bitpack64-read.h, epu16-bitpack64-readtwo.c,
epu16-bitpack64-readtwo.h, epu16-bitpack64-write.c,
epu16-bitpack64-write.h, extension-search.c, extension-search.h,
filesuffix.h, genome.c, genome128_consec.c, genome128_hr.c,
genome128_hr.h, genome_sites.c, genome_sites.h, gmap.c, gmapindex.c,
gregion.c, gregion.h, gsnap.c, indel.c, indel.h, indexdb-write.c,
indexdb-write.h, indexdb.c, indexdb.h, indexdb_hr.c, indexdbdef.h,
intersect.c, intersect.h, intlist.c, intlist.h, junction.c, kmer-search.c,
kmer-search.h, list.c, list.h, littleendian.h, localdb-write.c,
localdb-write.h, localdb.c, localdb.h, localdbdef.h, matchpool.c,
maxent_hr.c, merge.c, merge.h, oligo.c, oligo.h, oligoindex.c, output.c,
pair.c, pair.h, pairpool.c, path-solve.c, path-solve.h, reader.c,
reader.h, resulthr.c, resulthr.h, samprint.c, sarray-read.c,
sarray-search.c, sarray-search.h, sedgesort.c, sedgesort.h,
segment-search.c, segment-search.h, smooth.c, snpindex.c, splice.c,
splice.h, splicestringpool.c, splicetrie.c, splicetrie.h, stage1.c,
stage1hr.c, stage1hr.h, stage2.c, stage3.c, stage3.h, stage3hr.c,
stage3hr.h, substring.c, substring.h, transcript.c, transcript.h, types.h,
uintlist.c, uintlist.h, uniqscan.c, univdiag.c, univdiag.h, univdiagdef.h,
util, gmap_build.pl.in: Merged revisions 210996 to 212658 from
branches/2017-11-04-faster-transcriptome. Put previous version into
tags/2018-01-19-version1-pre-transcriptome
* index.html: Revised for new version
* uniqscan.c: Using new interface to Stage1hr_setup
* transcript.c: Sorting transcripts before printing
* stage3hr.c, stage3hr.h: Stage3end_new_transcript aborting of number of
substrings and junctions don't match. Stage3end_new_substrings returning
new junctions to caller
* stage1hr.h, stage1hr.c: Using transcriptome_end_accept
* sarray-search.c: Various improvements to algorithm
2018-01-17 twu
* pair.c: Skipping gap characters in Pairarray_genomic_sequence
* gsnap.c: Fixed default value of max_middle_insertions
2017-11-17 twu
* sarray-search.c: Turned off qsort in favor of sedgesort
2017-11-15 twu
* stopwatch.c: Added standalone code for testing
* stage3.c: Fixed issue with trimming of chimeras not being re-extended back
to the breakpoint. Fixed issue with CIGAR strings from shortgap comps
being treated differently from indel comps.
* gsnap.c: Using new interface to Sarray_search. Printing worker runtimes
to stderr. Using --transcriptome-accept flag instead of old flags
* sarray-search.c, sarray-search.h: Removed genes_iit as a parameter
* indel.c: Added debugging statements
* dynprog_end.c: Handling the case where rev_goffset is negative
* outbuffer.c: Fixed fatal bug when trying to write SAM headers to
OUTPUT_NONE split output, which is now NULL
2017-10-30 twu
* stage3.c: Fixed trim_end_indel procedures to stop at a gap
2017-10-29 twu
* substring.c: Fixed a memory leak when embellish_genomic is called a second
time on a Stage3end_T object
* stage3hr.c: In Stage3end_new_gmap, in computing CIGAR, starting with
hardclips set to zero
* stage3.c: In traverse_single_gap, not allowing a force of the gappairs.
Implemented contiguous versions of peel_leftward and peel_rightward, but
not using yet.
* sarray-search.c: For transcriptome endpoints, restricting further the
cases where an indel should be ignored. Added some potential code for
genome endpoints, but not implemented
* pairpool.c: In Pairpool_join_end5 and Pairpool_join_end3, if the final
revision process still results in a negative queryjump or genomejump, then
don't attach the end at all
* get-genome.c: If map file is not in mapdir, then look in genomesubdir
* cigar.c, cigar.h, gsnap.c: Calling Cigar_setup to initialize static
variables
2017-10-20 twu
* trindex.c: Added error message if genes file not provided by user
* stage1hr.c, stage3hr.c, stage3hr.h: Doing remapping back to transcriptome
during pairing procedure
* get-genome.c: Added comment
* pair.c, pair.h, samprint.c: For GMAP-based alignments in GSNAP, printing
transcript info
* sarray-read.c, sarray-read.h: Changed relevant types from Univcoord_T to
Sarrayptr_T
* sarray-search.c, sarray-search.h: Changed relevant types from Univcoord_T
to Sarrayptr_T. Choosing closest indel on left and right ends. For
transcriptome, checking for overlapping or adjacent indels
2017-10-18 twu
* config.site.rescomp.prd, config.site.rescomp.tst, index.html: Updated for
latest version
* transcriptome.c, transcriptome.h: Implemented
Transcriptome_genomic_bounded_p
* transcript.c, transcript.h: Implemented Transcript_in_list_p. Returning
min_insertlength from Transcript_intersect_p
* stage3hr.c: Fixed bug in retrieving parent hit in Stage3end_remove_overlaps
* sarray-search.c: Fixed an issue where querystart was -1, which can result
when nmatches is 0
2017-10-17 twu
* gmap.c: Changed option name from --alt-initiation-codons to
--alt-start-codons
* stage1hr.c: Fixed typo in variable name
2017-10-12 twu
* stage3hr.c: Removed optimization to use the minimum of npaths and
maxpaths, since we need computations on all npaths to make a random
selection
* samprint.c: Handling dinucleotides gracefully if genomic sequence is NULL
* gsnap.c: Increased default expected_pairlength from 200 to 500. Removed
--maxsearch option, since it can lead to poor answers.
* outbuffer.c, samflags.h, samheader.c: Fixed problem introduced in 2017-05
which caused the --output-file option to produce a NULL file pointer
* chrom.c: Deferencing a character pointer
* pair.c: Fixed problem with initialization of endi in determining gff3
coordinates. Avoiding reading the pair at index of npairs.
* cellpool.c, cellpool.h, stage2.c: Adding non-overlapping paths, as well as
high-scoring paths
* stage1hr.c: Using new interface to Sarray_search_transcriptome
* substring.c, substring.h: Added functions for Substring_chrpos_low and
Substring_chrpos_high
* stage3hr.h: Added interfaces for chrpos_low and chrpos_high
* stage3hr.c: In transferring Transcript_T objects, checking for duplicates
* sarray-search.c, sarray-search.h: Allowing for genomic bounds on
transcriptome alignment, for use in re-aligning genomic hit to find
transcriptome coordinates
* gmap.c, translation.c, translation.h: Adding option for only ATG as
initiation codon. Making this the default.
* stage3.c: For trimming end exons, using a percentage of the querylength
instead of a fixed length
2017-10-03 twu
* stage3hr.c: Removed old version of pair_up_concordant_transcriptome
* stage3hr.c: In resolve_ambiguous_splice procedures, when ambiguity is
resolved, setting genomicstart or genomicend fields accordingly
* stage3hr.h: Removed interface for Stage3pair_pairtype
* stage3hr.c: Fixed the order for constructing genomic sequence from
substrings
* substring.h, substring.c: Not applying general test of goodness for
substrings from transcriptome-guided alignment
* sarray-search.c: Modified debugging statements
* samprint.c: Determining pairtype again, and allowing for the possibility
of concordant uniq
2017-10-02 twu
* trindex.c: Copying the genes IIT file to the transcriptome directory
* transcriptome.c: Allowing a transcriptome to be read without a genome, by
using divints instead of chrnums
* transcript.c: Revised debugging statements
* gsnap.c, uniqscan.c: Using new interface to Stage1hr_setup
* stage3hr.c, stage3hr.h: Implemented a separate procedure for pairing up
transcriptome alignments. Transcriptome hit types now take precedence.
Implemented Stage3end_substrings_genomic_sequence.
* stage1hr.c, stage1hr.h: Using new interface to Stage3pair_new. Re-mapping
to transcriptome from genomic suffix array alignment.
* junction.c, junction.h: Added function Junction_deletionpos
2017-09-29 twu
* stage3.c: Moved location of build_dual_breaks step to get better behavior
2017-09-27 twu
* stage3.c: Fixed over-aggressive use of minintronlen_ends from wrong end of
sequence
* chimera.c: Initializing a variable
2017-09-22 twu
* uinttableuint.c, uinttableuint.h: Initial import
* stage1hr.c, stage3hr.c, stage3hr.h: For transcriptome-guided genomic
alignment, placing results into concordant_uniq instead of
paired_uniq_long, if a transcript matches both ends
* pair.c, pair.h: Fixed computation of cds bounds for GFF3 output
* gsnap.c, uniqscan.c: Using new interface to Pair_setup
* gmap.c: Restored MAX_CHIMERA_ITER to be 3, but not iterating multiple
times for middle pieces. Added option --gff3-cds
* Makefile.gsnaptoo.am: Added uinttableuint.c to library
* translation.c: Assigning aaphase_g for final genomic codon
2017-09-11 twu
* gmap.c: Restored --intronlength option
* pair.c: Fixed gff3 cds output so it ignores indels
* pair.c: Replaced gff3 printing code for CDS with a call to code for exons
* pair.c, pair.h: Using new interface to transcript print functions
* stage3hr.c, stage3hr.h, transcript.c, transcript.h: Replacing separate
trnums, trstarts, and trends fields with transcripts field
* stage1hr.c: No longer filtering initially by transcript concordance
* gsnap.c, samprint.c, samprint.h: Using new interface to transcripts field
for Stage3end_T and Stage3pair_T objects
* gmap.c: No longer iterating on check_middle_local
* Makefile.gsnaptoo.am: Added transcript.c and transcript.h to programs that
need it from pair.c
2017-09-05 twu
* sarray-search.c, stage3hr.c: Commented out or fixed code for LARGE_GENOMES
to use Uint8list_T
* sarray-search.c, sarray-search.h: Commented out genome code for
LARGE_GENOMES
* trindex.c: Fixed code so it will compile. Fixed memory leaks.
* trunk, VERSION, gsl.m4, config.site.rescomp.prd, config.site.rescomp.tst,
index.html, src, Makefile.gsnaptoo.am, bitpack64-readtwo.c,
genome128_consec.c, genome128_consec.h, genome128_hr.c, genome128_hr.h,
get-genome.c, gmap.c, gmapindex.c, gsnap.c, iit-read-univ.c,
iit-read-univ.h, iit-read.c, iit-read.h, indel.c, indel.h, intlist.c,
intlist.h, junction.c, junction.h, mapq.c, mapq.h, output.c, pair.c,
pair.h, pairpool.c, samprint.c, samprint.h, sarray-read.c, sarray-read.h,
sarray-search.c, sarray-search.h, sarray-write.c, sarray-write.h,
splicealt.c, stage1hr.c, stage1hr.h, stage3hr.c, stage3hr.h, substring.c,
substring.h, transcriptome.c, transcriptome.h, trindex.c, uniqscan.c:
Merged revisions 207858 to 209656 from branches/2017-07-01-transcripts to
allow for transcriptome-guided genomic alignment
* sarray-search.c: Fixed bug resulting from check of common diagonal over
circular origin
2017-09-01 twu
* index.html: Updated for latest version
* stage1hr.c: Fixed criterion for looking for spliceends with nmismatches
less than max_splice_mismatches on each end
* sarray-search.c: Checking right and left diagonals for collinearity with
middle diagonal in query coordinates
* gmapindex.c: Casting all Univcoord_T lengths to UINT4 for suffix array
procedures
* get-genome.c: Added option --gsequence to print exons and introns
2017-08-15 twu
* stage3hr.c: Changed checks on circularalias to circularpos
* sarray-search.c: Disallowing any splicing solution that goes around a
circular origin
* pair.c: Checking that we are not at the end of the alignment before doing
backward steps
* gmap.c: Removed option -G for uncompressed genome
* stage1hr.c: Disallowing any splicing solution that goes around a circular
origin. Incrementing counter when comparing against max_gmap_improvement.
Fixed a memory leak.
2017-07-27 twu
* stage3.c: In find_dual_break_spliceends, fixed a bug that generated
negative coordinates
2017-06-29 twu
* Makefile.gsnaptoo.am: Added some files to the library and the include
directory
* table.c, table.h: Added a function needed by gstruct
* interval.c, interval.h, iit-write.c, iit-write.h: Added a variable to make
a function compatible with the gstruct version
* gsnap.c: Added a header file
* dynprog_genome.c: Commented out assertions that do not hold in transcript
alignment
* chrom.c: Removed a faulty assertion
2017-06-21 twu
* stage3.c: For final call to insert_gapholders from path_compute_final,
filling the gap with nucleotides if queryjump == genomejump
* pair.c: For GFF3 output, not printing lines where genomestart and
genomeend coordinates are the same, typically resulting from a query skip
2017-06-20 twu
* Makefile.gsnaptoo.am: Added maxent_hr to lib and include
* stage3hr.c: Added assertions to make sure ilengths are not negative
* substring.c: In overlap checking procedures, decrementing high coordinate
by 1 if possible to match the procedures for clip_overlap and
merge_overlap in stage3hr.c
2017-06-19 twu
* VERSION, index.html: Updated version number
2017-06-16 twu
* gmap.c: Added to debugging statements
* stage3.c: In merge procedures, restoring original pairs to Stage3_T
objects if the merge fails
* gsnap.c: Turning off default of 0 for trim-mismatch-score and
trim-indel-score for DNA-Seq
2017-06-15 twu
* samprint.c, pair.c: For XM, handling the case where queryseq_mate is NULL
* shortread.c: Changed memory source of longstring to IN
* samprint.c: Added back a missing else clause after checking for
omit_concordant_uniq_p
* stage3.c: Added debugging statements for creating and freeing Stage3_T
objects
* stage1hr.c: Fixed memory leaks relating to floors and anchor segments
* sequence.c: Changed memory source of all contents to IN
* pair.c: Changed memory source of all tokens to OUT
* list.c, list.h: Implemented List_to_array_out_n
* intlist.c, intlist.h: Implemented Intlist_to_char_array_in
* gmap.c: Fixed memory leaks and memory bugs relating to chimera code.
Removed all references to a nonjoinable list, and using stage3list as the
master list for all procedures.
* genome.c: Changed source of alloc to IN
2017-06-14 twu
* index.html: Updated for latest version
* configure.ac: Removed unused macros
* src, util: Merged revisions 204076 through 207268 from
branches/2017-03-07-multimapper-genes
* stage3hr.c: Reverted from revision 207330 (revision 205421 from
branches/2017-03-07-multimapper-genes) to remove nindelbreaks field, since
it discriminates against some equivalently good alignments
* stage3hr.c: Merged revision 205421 from
branches/2017-03-07-multimapper-genes to add nindelbreaks field
* Makefile.gsnaptoo.am: Added commands for building lib and include
* uniqscan.c: Added Access_controlled_cleanup
* substring.c: Merged revisions 204076 through 205371 from
branches/2017-03-07-multimapper-genes to remove splicecoordN and to set
splicecoordD_knowni and splicecoordA_knowni.
* stage1hr.c: Merged revisions 204076 through 205713 from
branches/2017-03-07-multimapper-genes to find DNA chimeras in paired-end
reads and to double-check apparent perfect matches for actual number of
mismatches
* sequence.c, sequence.h: Added function Sequence_stdout_header
* sarray-read.c, sarray-read.h, sarray-search.c, sarray-search.h: Merged
revisions 204076 through 205420 from branches/2017-03-07-multimapper-genes
to move search functions from sarray-read.c to sarray-search.c
* samprint.c, samprint.h: Merged revisions 204076 through 206196 from
branches/2017-03-07-multimapper-genes to print information in XT field for
transcript splicing and to handle omitting of concordant alignments
* samheader.c: Don't open a file for OUTPUT_NONE
* popcount.c, popcount.h: Modified conditions for including our own popcount
instructions. No longer needed if built-in options are available
* pair.c: Modified compressed format to no longer print tokens or
dinucleotides
* littleendian.h: Added macros for FREAD_FLOATS and FWRITE_FLOATS
* iit-read.c, iit-read.h: Merged revisions 205322 through 206058 from
branches/2017-03-07-multimapper-genes to support --coding in get-genome
and to implement IIT_genestruct_chrpos
* gsnap.c: Merged revisions 204076 through 206184 from
branches/2017-03-07-multimapper-genes to add options for transcriptome
alignment and omitting concordant output.
* datadir.c: Modified messages when gmapdb is not found
* cigar.c: Fixed printing of "*" for cigar with mate is NULL or substrings
is NULL
* block.c: Merged revision 204180 from branches/2017-03-07-multimapper-genes
to generalize from 12-mers to oligo size for debugging output
* stage3hr.c, stage3hr.h: Merged revisions 206187 and 205714 from
branches/2017-03-07-multimapper-genes to add behavior for
--omit-concordant-uniq and --omit-concordant-mult and to add a
splice_score field for all splice types
2017-06-13 twu
* pair.c, stage3.c: Changed type of chroffset and chrhigh from Chrpos_T to
Univcoord_T in trim end functions
2017-06-12 twu
* src, gmap.c, output.c, pair.c, pair.h, stage3.c, stage3.h: Merged revision
204925 from branches/2017-04-02-genome-genome to add bedpe output
* diag.c, stage2.c: Merged revisions 207196 and 207198 from
branches/2017-04-02-genome-genome to improve genome-genome alignment
2017-06-09 twu
* stage3.c, output.c: Using functions now in pair.c
* samprint.c, cigar.c, pair.c, pair.h, stage3hr.c, stage3hr.h: Moved some
functions to pair.c
* get-genome.c: Allowing for --dump to work with --exons
* substring.h: Moved typedef of Substring_T early
* Makefile.gsnaptoo.am: Including cigar.c and cigar.h for uniqscan and
uniqscanl
2017-05-30 twu
* substring.c, substring.h: Commenting out procedures needed for chrpos_high
* stage3hr.c, stage3hr.h: Commenting out procedures needed for chrpos_high.
Using procedures from cigar.c.
* stage3.c: Using procedures from cigar.c
* stage1hr.c: Added debugging statement
* pair.c, pair.h, samprint.c, samprint.h: Moved CIGAR printing procedures to
cigar.c. Printing mate cigar in XM field instead of mate chrpos_high.
* gsnap.c: Using new interfaces to Output_setup and SAM_setup
* output.c, output.h, samprint.c, samprint.h: Moved setup of merge_samechr_p
from output.c to samprint.c
* Makefile.gsnaptoo.am, cigar.c, cigar.h: Added cigar.c and cigar.h for code
relating to printing of CIGAR strings
2017-05-25 twu
* sarray-read.c: Increased iteration condition, allowing sarray algorithm to
work when nmisses_allowed is zero.
2017-05-12 twu
* stage3.c: In Stage3_merge_chimera, doing peelback to remove any indels at
the chimeric junction
2017-05-11 twu
* output.c, pair.c, pair.h, samprint.c, samprint.h, stage3hr.c, stage3hr.h,
substring.c, substring.h: Added printing of mate chrpos high with an XM
field
* stage1hr.c: Removed exception for FREE_ALIGN when nstreams is 1
* iit_get.c: Commented out printing of total when reading queries from stdin
2017-05-10 twu
* chimera.c, chimera.h, gmap.c: Allowing search for chimera exon-exon
boundary to extend for 1 mismatch
* stage3.c, stage3.h: Implemented procedures Stage3_trim_left and
Stage3_trim_right
* gmap.c: Calling Chimera_find_breakpoint first to set bounds based on
sequence, and then Chimera_find_exonexon to find the exon boundary
* gmap.c: Increased value of CHIMERA_EXTEND from 8 to 20
* parserange.c: Added null terminating character after strncpy
2017-05-09 twu
* shortread.c: Fixed uninitialized variable in nextchar2 and invalid free
when skipping in second file
2017-05-08 twu
* uniqscan.c: Using new interface to Stage1hr_setup
* gsnap.c, stage1hr.c, stage1hr.h: Added --speed option for GSNAP
* gmap.c: Set default value for maxintronlen to be 500,000
2017-05-03 twu
* stage3hr.c: Changed the procedure for resolving overlapping and separate
alignments. Now filtering both the overlapping and separate alignments.
Using expected pairlength and pairlength deviation to select which one to
report.
* stage1hr.c: Turned off the shortcut to skip complete set algorithm if
suffix array has found something. Turned off the shortcut for GMAP
pairsearch/halfmapping if nconcordant > 0
* spanningelt.c: Changed a check procedure to abort rather than exit
* spanningelt.h: Fixed a typo in a comment
* merge.c: Made Merge_diagonals non-destructive, by copying the streams into
the heap
* indexdb_hr.c: Added a comment about Merge_uint4 being destructive
* iit-read.c, iit-read.h: Added IIT_gene_overlapp function used by
get-genome with the --coding flag
* get-genome.c: Added a --coding flag to report only genes that overlap in
their coding regions
2017-04-24 twu
* index.html: Updated for latest version
* Makefile.am: Added gtf_transcript_splicesites to CLEANFILES
* pair.c, pair.h: Taking mate_chrnum as an argument in SAM print function
* output.c, samprint.c, samprint.h: Computing chrnum and mate_chrnum at same
time as chrpos and mate_chrpos, to resolve issues with SAM output
2017-04-21 twu
* samprint.c, stage3hr.c, stage3hr.h: Fixed issue in mate chromosome printed
when mate is a translocation
2017-04-13 twu
* chrom.c: Fixed compare function for alpha_numeric entries
2017-04-12 twu
* Makefile.am, configure.ac: Added an entry for gtf_transcript_splicesites
* index.html: Changed for latest version
* gtf_transcript_splicesites.pl.in: Changed output format
* stage3.c: Revised debugging statements
* samflags.h, samprint.c, samprint.h: Adding supplementary flag. Adding
information to XT field for transcript splicing.
* gsnap.c: Turning off trim mismatch for DNA-Seq
* gmap.c: Added comments
* dynprog_end.c: Turned wideband off for extending from medial splicesite,
which was causing the end exon to be re-discovered as an indel
2017-04-11 twu
* gtf_transcript_splicesites.pl.in: Initial import
2017-03-17 twu
* substring.c: Commented out assertions that don't hold under SNP-tolerant
alignment
* stage3hr.c: Improved debugging statement
* stage1hr.c: Proceeding to spanning set procedure if the other end has more
hits than the number of concordant hits
* splicing-score.c: Added debugging calls to Maxent_hr procedures, to help
in development
* samprint.c, maxent_hr.c, junction.c: Added debugging statements
* Makefile.gsnaptoo.am: Added files for splicing_score
* iit-read.c: Fixed memory leak for intron-level known splicing
* intron.c, intron.h: Added some utility functions
* indel.c, indel.h: Modified Indel_resolve_middle_deletion to favor and
report intron dinucleotides for short deletions
* gsnap.c: Using new interface to Sarray_setup
* sarray-read.c, sarray-read.h: Checking short deletions with length between
min_intronlength and max_deletionlen to see if they are introns
2017-03-09 twu
* get-genome.c: Allowing dump of all sequences from a map file
2017-03-02 twu
* pairpool.c: Allowing querypos and genomepos of 0
2017-02-24 twu
* VERSION, config.site.rescomp.prd, config.site.rescomp.tst: Updated version
number
* chrom.c: Added a type ALPHA_NUMERIC and sorting appropriately for those.
Stripping "Chr" as well as "chr" from names
* pairpool.c: In Pairpool_push, not doing anything if querypos or genomepos
is less than or equal to 0
* stage2.c: For convert_to_nucleotides, handling the case where path is NULL
* gmap.c: Added missing brace
* gmap.c, stage3.c, stage3.h: Added option --split-large-introns and
implemented procedure Stage3_split
* stage2.c: Renamed variable querypos to curr_querypos in some procedures,
so debug9 can be used
2017-02-16 twu
* iit-read.c: Handling the case in finding unique positions and splices
where a gene has no overlapping genes
2017-02-15 twu
* archive.html, index.html: Updated for latest version
* VERSION: Updated version number
* substring.c: Fixed calculation of mandatory_trim_left and
mandatory_trim_right
* indexdb.c: Assigning MMAPPED to positions_high_access when appropriate, to
avoid free() error at end of program
2017-02-14 twu
* output.c: Ignoring mergedp in restricting the final result to a single path
* gmap.c: Allowing value of --suboptimal-score to be a float. Ignoring
mergedp in handling the final result
* gmap_build.pl.in: Added flag to build genome index in parts
* substring.c: For default alignment format, filling in stars in regions
where the alignment goes past the beginning or end of the genome
* dynprog_single.c, stage3.c: Added checks against non-positive values for
rlength and glength in Dynprog_single_gap. Also requiring a positive
value for rlength in running Dynprog_single_gap over Dynprog_cdna_gap or
Dynprog_genome_gap.
* dynprog.c: Added debugging statements
* stage3.c, stage3.h: Added sort comparison procedures to help with local
chimeric joins on each chromosome
* gmap.c: In checking for local chimeric joins, processing each chromosome
separately
* stage3hr.c: Not resolving inside alignment when the coordinates look like
a scramble, which can occur with circular chromosomes
* stage1hr.c: Fixed a memory leak for a non-concordant pair. Fixed an
uninitialized variable for non-spliced alignment
* oligoindex_hr.h: Commented out obsolete code
* iit-read.c, iit-read.h: Added support for finding unique splices, and for
finding unique positions and splices in a set of genes
* gmap.c, gsnap.c: Fixed printing of SIMD capabilities for AVX2 and AVX512
* get-genome.c: Added ability to dump a map file, and the ability to print
unique positions among a set of genes
* genome128_hr.c: Changed builtin commands for trailing and leading zeroes
to use the long long versions for 64-bit words
* genome.c: Commented out messages to stderr for negative coordinates
* dynprog_genome.c: Increased rewards for canonical intron. Removed
penalties for indels next to a splice site
2017-02-08 twu
* get-genome.c: Printing presence/absence of unique splices also
2017-01-31 twu
* get-genome.c, iit-read.c, iit-read.h: Added option --nunique to print
number of unique positions
2017-01-27 twu
* oligoindex_hr.c: Fixed typos in atoi functions for SSE2 code
2017-01-13 twu
* stage3hr.c: Fixed a memory leak in resolving inner splices
* dynprog_end.c: Fixed conditional jump based on finalscore, by not checking
when endalign is QUERYEND_NOGAPS
* stage1hr.c: Fixed uninitialized value for successp. Using FREE_ALIGN macro
* spanningelt.c, indexdb_hr.c: Using MALLOC_ALIGN instead of MALLOC when
needed
* oligoindex_hr.c: Including atoi.h
* samprint.c, substring.c, substring.h: Fixed coordinates reported in XT
field, which depend on the donor and acceptor strands
* merge.c: Using macros FREE_ALIGN and CHECK_ALIGN
* mem.h: Defined macros FREE_ALIGN and CHECK_ALIGN
2017-01-10 twu
* genome128_hr.c: Fixed incorrect AVX macro
* oligoindex_hr.c: Changed _mm_bsrli_si128 to _mm_srli_si128. Added atoi
and ttoc modes to all code.
2017-01-09 twu
* gsnap.c: Removed option --microexon-spliceprob
2017-01-06 twu
* stage1hr.c: Using alignments with most matches, even if they are
translocations compared with other hitpairs
2017-01-02 twu
* genome128_hr.c: For handling middle rows, using <= and >= to endptr and
startptr, instead of < and >
2017-01-01 twu
* stage3.c: Using new interface to Dynprog_end5_gap and Dynprog_end3_gap
* stage1hr.c: In identify_all_segments, filtering out diagonals <
querylength from the merged array
* dynprog_single.c, dynprog_cdna.c, dynprog_genome.c: Using use8p_size
* dynprog_simd.h: Removing fixed definition for SIMD_MAXLENGTH_EPI8
* dynprog_simd.c: Added assertions for traceback procedures for vertical and
horizontal jumps not to go past the main diagonal. Put macros around
memory fences in debugging print procedures.
* dynprog_end.c, dynprog_end.h: Using use8p_size and introduced parameter
require_pos_score_p
* dynprog.c, dynprog.h: Introducing an array for use8p_size that depends on
the mismatch type
2016-12-30 twu
* stage3hr.c: Not converting splices when resolving insides of
paired-end-reads
2016-12-29 twu
* trunk, src, dynprog_genome.c, gsnap.c, pair.c, pair.h, sarray-read.c,
smooth.c, stage1hr.c, stage1hr.h, stage2.c, stage3.c, stage3.h,
stage3hr.c, stage3hr.h, substring.c, substring.h, uniqscan.c: Merged
revisions 201789 through 202030 from branches/2016-12-18-stage2-soa to
make various improvements to alignments
* stage1hr.c: Added debugging statements
* indexdb_hr.c: Checking for nmerged being 0
2016-12-16 twu
* ax_ext.m4: Not adding -mno options to an Intel compiler
* indexdb_hr.c: Returning an array created by malloc, rather than
_mm_malloc, from the merge version of Indexdb_merge_compoundpos
* sarray-read.c: Using qsort instead of Sedgesort, because of seg faults
observed on Intel compiler
* Makefile.gsnaptoo.am: Including merge.c, merge.h, merge-heap.c, and
merge-heap.h where needed
* stage1hr.c: Providing a version of identify_all_segments for LARGE_GENOMES
* indexdb_hr.c: Cleaned up code so there are three versions of
Indexdb_merge_compoundpos. Fixed the merge version.
* oligoindex_hr.c: Fixed faulty svn merge
* genome128_hr.c: Fixed faulty svn merge, and hid shift_lo and shift_hi
procedures
* trunk, src, Makefile.gsnaptoo.am, indexdb_hr.c, mem.h, merge-heap.c,
merge-heap.h, merge.c, merge.h, stage1hr.c: Merged revisions 200992
through 201743 from branches/2016-11-28-simd-merging to revise SIMD merge
code
* spanningelt.c, spanningelt.h: Merged revisions 200992 through 201743 from
branches/2016-11-28-simd-merging to change a calloc to a malloc
* trunk, ax_cpuid_intel.m4, ax_cpuid_non_intel.m4, ax_ext.m4, configure.ac,
src, Makefile.gsnaptoo.am, cpuid.c: Merged revisions 200476 through 201735
from branches/2016-11-14-avx512 to make provisions for AVX-512
* gmap.c: Merged revisions 200476 through 201735 from
branches/2016-11-14-avx512 to change Genome_hr_user_setup to
Genome_hr_setup
* gmap_select.c, gmapl_select.c, gsnap_select.c, gsnapl_select.c: Merged
revisions 200476 through 201735 from branches/2016-11-14-avx512 to add
provisions for AVX-512
* genome128_hr.c, genome128_hr.h: Merged revisions 200476 through 201735
from branches/2016-11-14-avx512 to add shift and wrap procedures
* oligoindex_hr.c, oligoindex_hr.h: Merged revisions 200476 through 201735
from branches/2016-11-14-avx512 to revise algorithms substantially
* oligoindex_old.c, oligoindex_old.h: Merged revisions 200476 through 201735
from branches/2016-11-14-avx512 to make checking code work with current
code
* stage2.c: Merged revisions 200476 through 201735 from
branches/2016-11-14-avx512 to fix debugging comment
* sarray-read.c: Merged revisions 200476 through 201735 from
branches/2016-11-14-avx512 to add AVX-512 code
* stage1hr.c: Fixed uninitialized variable
2016-12-13 twu
* trunk, VERSION, config.site.rescomp.prd, config.site.rescomp.tst, src,
genome128_hr.c: Merged revisions 201421 through 201532 from
branches/2016-12-09-genomebits-serial-simd to change structure of SIMD
code in genome128_hr.c
* index.html: Updated for version 2016-11-07
* configure.ac: Allowing sse4.1 and sse4.2 as responses to --with-simd-level
* samprint.c: Added missing pair of braces
* gsnap.c, stage1hr.c, stage1hr.h: Removed references to indel_knownsplice
mode for gmap
2016-11-18 twu
* oligoindex_hr.c: Fixed debugging statements to use SIMD commands in count
procedures
2016-11-16 twu
* ax_ext.m4: Removed -mno... flags for compilers
* configure.ac: Restricting response to --with-simd-level
* ax_cpuid_intel.m4: Fixed configure issue for AVX2 support using Intel
compiler
2016-11-14 twu
* pair.c: Removed initialization of static variables
* gsnap.c, outbuffer.c, outbuffer.h, output.c, output.h: Separate output
files for single-end and paired-end results
2016-11-07 twu
* sam_sort.c: Added printing at monitor intervals
* stage3hr.c: Checking for cases where insertions and deletions extend past
genomicpos 0
* samprint.c: Added preliminary code for printing extended cigar strings
* pair.c, pair.h: Added code for printing extended cigar strings. Not
printing BLAST e-values.
* indexdb.c: Removed unused statement
* gsnap.c, uniqscan.c: Using new interface to Pair_setup
* gmap_select.c, gmapl_select.c, gsnap_select.c, gsnapl_select.c: Using new
interface to CPUID_support
* gmap.c: Added option --sam-cigar-extended
* cpuid.c, cpuid.h: Added provisions for AVX512
2016-10-24 twu
* stage1hr.c: Not computing floors if querylength is less than index1part
* pair.c: Showing blast_evalue function for GMAP
* samprint.c: Removing assertions and aborts that do not hold for DNA-Seq
chimeras
2016-10-23 twu
* bitpack64-read.c, pair.c, samprint.c, stage3hr.c, stage3hr.h, substring.c,
substring.h: Printing BLAST e-values
2016-10-22 twu
* indexdb-write.c: For initializing counters, using packsizes from
offsetsmeta, rather than recomputing them from offsetsstrm
* bitpack64-write.c: Handling problem with ptri overflowing a signed int.
Now just advancing a pointer.
* bitpack64-read.h: Added interface for a function
* bitpack64-access.c, bitpack64-incr.c, bitpack64-read.c,
bitpack64-readtwo.c: Handling the case where nwritten*4 overflows an
unsigned int. Casting it first to UINT8.
2016-10-17 twu
* bitpack64-access.c: Fixed an increment of out in extract_28
2016-09-26 twu
* archive.html, index.html: Updated for latest version
2016-09-23 twu
* stage3.c: In solving dual introns, handling the case where single_gappairs
is NULL. Added code for gmapl.
* stage1.c: Modified debugging statements
* pair.c: Added a check for monotonicity of query coordinates to the
debugging procedure
* dynprog_genome.c: If procedure is returning NULL instead of the computed
gap pairs, then setting finalscore to be negative, so the result is not
used by the calling procedure
* access.c: If shm_attach fails, and using mmap instead, then not trying to
copy a file to a read-only memory segment
* Makefile.gsnaptoo.am: Added uint8list.c and uint8list.h for gmap
* stage2.c: Added back find_shifted_canonical procedure as unused code
2016-09-20 twu
* substring.h: Using sensedir as a field, instead of chimera_sensedir
* substring.c: Substring_new can use trimmed ends to determine the sensedir.
Using sensedir as a field, instead of chimera_sensedir
* stage3hr.h: Stage3end_new_gmap takes sensedir_knownp as an argument
* stage3hr.c: Stage3end_new_gmap takes sensedir_knownp as an argument, and
can use trimmed ends to determine the sensedir. Stage3end_new_substrings
can determine sensedir from its component substrings and junctions. For
comparing alignments, using nmatches rather than nmatches_posttrim
* stage3.h: Changed variable name
* stage3.c: Removing maxpeelback restriction on peeling back for introns.
For microexons, just transferring without checking. In comparing single
and dual gaps, not using middle exonprob to evaluate middle exon. In
solving dual breaks for microexons, allowing for multiple possible outer
splice positions. Changed order of operations to smooth first, then find
dual breaks, and then single introns.
* stage1hr.c: Deciding separately whether to run gmap on 5' and 3' ends,
depending on max_matches found on each end
* pair.c: Putting macro around GSNAP-specific output code for using mate
sensedir
* dynprog_genome.c: Not using probabilities to determine if dinucleotide
solution is good
* boyer-moore.c: Added debugging statement
* dynprog_genome.c: Removed backup algorithm for best score above a
probability threshold. Instead, using best probability among canonical or
semicanonical dinucleotides.
2016-09-16 twu
* stage3.c: Solving for microexons inside of traverse_dual_break. Solving
for dual breaks before solving introns.
* splice.c, splice.h: Splice_trim_novel_spliceends function now returning
new splicedir
* stage3.c: Added intron-specific functions for peelback, to handle long
similarity between exon ends and intron segments on the other end.
Function for finding novel spliceends now returns new splicedir, although
currently not used.
2016-09-15 twu
* samprint.c: Using new interface for Substring_sensedir
* pair.c: Printing mate sensedir, to be consistent with samprint code
2016-09-13 twu
* stage3hr.h: Removed obsolete functions
* stage3hr.c: Fixed cases where trim was added to amb_length. Removed
specific amb_length fields for GMAP alignments, and calculating instead
using trim_left_splicep and trim_right_splicep
* stage1hr.c: Modified debugging statements
* substring.h, substring.c: Removed an include statement
* splice.c, splice.h: Moved splice site probability calculations from
substring_trim_novel_spliceends to here
* stage3.c: Fixed gmap_trim_novel_spliceends to initialize mismatchp to be
true if the alignment does not extend to the end
* dynprog_genome.c: Turned off debugging
* substring.c: Added comments
* dynprog_genome.c: Fixed bug in decision-making for using bestscore when it
has a good probability. Previously, this switched to the
probability-based algorithm. Renamed variables to clarify the algorithms.
2016-09-12 twu
* stage3hr.c: For overlap calculation, using just trim, not trim plus amb
length
2016-09-09 twu
* stage3hr.h: Defining Stage3end_nmatches
* stage3hr.c: Defining nmatches to be nmatches_posttrim plus amb length.
Requiring minimum number of matches to allow a transloc splice. Favoring
definite ambig results, plus insertlength, over definite splices or
trimmed ambig, and then favoring definite splices over trimmed ambig.
* stage1hr.c: Using Stage3end_nmatches instead of
Stage3end_nmatches_posttrim to decide whether to run GMAP
* substring.h: Defining procedures for returning nmatches and amb lengths
* substring.c: Defining nmatches to be nmatches_posttrim plus ambiguous
length. Computing MAPQ over trimmed region to be consistent with
pair-based method. For new donor and acceptor substrings, extending the
trim calculation to 0 or querylength.
2016-09-07 twu
* stage1hr.c: Checking whether result of Stage3end_new_splice is NULL
* stage3hr.c: Using number of matches and nmatches_posttrim in
hit_goodness_cmp and hitpair_goodness_cmp. Requiring a minimum number of
matches in donor and acceptor before creating a transloc splice. Added
code for checking suffix array mismatches.
* sarray-read.c: After finding an insertion, modifying querystart of current
diagonal, so next substring operation starts from that position
* indel.c: Improved debugging statements
* bitpack64-incr.c: Fixed errors in code for transferring from bitpack sizes
22 to 24, and from 26 to 28
2016-09-02 twu
* gmap.c, gsnap.c, uniqscan.c: Using new interface to Indexdb_new_genome
* splice.c: When splice is not found, return -1 as values for nmismatches
* sarray-read.c: Allowing initial value of nmismatches to be used if it is
0. Fixed case involving ambiguous substrings.
* sarray-read.c: Setting nmismatches correctly in various cases, so we do
not have to recompute them. Looking at endpoints to determine if the
nmismatches value is correct.
2016-09-01 twu
* indexdb.c, indexdb.h: For the option --unload-shared-memory, use
allocation and not memory mapping to make sure we deallocate any shared
memory
2016-08-24 twu
* genome.c: Not accessing beyond end of blocks when enddiscard is 0
2016-08-16 twu
* VERSION: Updated version number
* README: Discussing MAX_STACK_READLENGTH
* gsnap.c, uniqscan.c: Using MAX_FLOORS_READLENGTH instead of MAX_READLENGTH
* configure.ac, Makefile.gsnaptoo.am: Using MAX_STACK_READLENGTH instead of
MAX_READLENGTH
* stage1hr.h: Adding max_floor_readlength to setup
* stage1hr.c: Removed local allocation of arrays of size MAX_READLENGTH.
Now checking querylength against MAX_STACK_READLENGTH to determine whether
to allocate from stack or heap. Adding max_floor_readlength to setup
* indel.c, mapq.c, sarray-read.c, splice.c: Removed local allocation of
arrays of size MAX_READLENGTH. Now checking querylength against
MAX_STACK_READLENGTH to determine whether to allocate from stack or heap
* stage3hr.c: Not allowing any indels to set trims in determining optimal
score
* stage1hr.c: Using pre-processor macro LONG_READLENGTHS to allocate
read-related memory on heap instead of stack. Setting spliceable_high_p
to be false for last segment. In computing end indels, ensuring that
shifti is not negative when looking up array value.
* shortread.c: Using MAX_EXPECTED_READLENGTH instead of MAX_READLENGTH
* stage3.c: Handling the case when trimming ends that exon is empty
* stage3hr.c: Restored setting of abort_pairing_p when nconcordant exceeds
maxpairedpaths
* gsnap.c, uniqscan.c: Using new interface to Pair_setup
* indel.c, mapq.c, sarray-read.c, splice.c, substring.c: Using pre-processor
macro LONG_READLENGTHS to allocate read-related memory on heap instead of
stack
* gmap.c, pair.c, pair.h: Added option --gff3-swap-phase
* bytecoding.c: Added explanation messages to remove shared memory segments
2016-08-12 twu
* trunk, config.site.rescomp.prd, configure.ac, src, Makefile.gsnaptoo.am,
filestring.c, genome_sites.c, gsnap.c, pair.c, samprint.c, sarray-read.c,
sedgesort.c, sedgesort.h, shortread.c, splice.c, stage1hr.c, stage3hr.c,
stage3hr.h, substring.c, substring.h, univdiag.c, univdiag.h, util: Merged
revisions 195608 to 196272 from branches/2016-08-09-genome-sites-hr, which
contains merged revisions from branches/2016-08-02-long-read-fusions and
2016-07-01-better-triage
* trunk, VERSION: Updated version number
* Makefile.gsnaptoo.am: Removed chrsubset.c and chrsubset.h for
splicing-score
* pair.c: Added variable to swap phase for gff3 output
* configure.ac: Added a line to disable maintainer mode for users
* config.site.rescomp.prd, config.site.rescomp.tst, archive.html,
index.html: Updated for latest version
* MAINTAINER: Added note about PATH
2016-08-08 twu
* gtf_genes.pl.in, gtf_introns.pl.in, gtf_splicesites.pl.in: Printing both
gene_id and gene_name
* atoi.c, cmet.c: Fixed reduce procedures for 64-bit computers
* Makefile.gsnaptoo.am: Added semaphore.c and semaphore.h to list of files
for splicing-score
* stage1hr.c: Fixed debugging statements
* stage3.c: Fixed issue where we tried to use pairs_pretrim after path_trim
altered the pairs
* samprint.c, substring.c, substring.h: Fixed XT field to print correct
junction coordinates
2016-08-02 twu
* stage3hr.c: Restoring final procedure based on nmatches in
Stage3pair_optimal_score
* stage3.c: Reverting from revision 195487 to allow extraexon comps again
and from revision 193238 to always insert dual break alignments
* comp.h, pair.c, pairpool.c: Reverting from revision 195484 to allow
extraexon comps again
2016-08-01 twu
* gtf_genes.pl.in, gtf_introns.pl.in, gtf_splicesites.pl.in: Imposing
preference order based on desired keys, rather than the text
* src, inbuffer.c, shortread.c: Merged revisions 195492 and 195493 to fix
problem where --force-single-end terminated when a file had reads that
were a multiple of --input-buffer-size
* stage3.c, comp.h, pair.c, pairpool.c: Using shortgap comp instead of
extraexon comp for representing dual breaks
* shortread.c: Fixed issues in reading multiple pairs of files from command
line
2016-07-23 twu
* atoiindex.c: Fixed calculation of oligo using new block algorithm
2016-07-11 twu
* stage1hr.c: For paired terminals, assigning final pairtype to be concordant
* archive.html: Exposed version 2015-07-23. Improved formatting.
* stage3hr.c, substring.c: Handling the special case when alignstart or
alignend is requested on an ambiguous substring
* stage3hr.c: Computing insertlength and concordance properly for
overlapping dual GMAP alignments
* stage1hr.c: For dynamic programming of anchor segments, proceeding from
closest to farthest segments, to favor shorter splice distances. Resolved
uninitialized variable when completeset algorithm is called but
spanningset was not called, so read_oligos was not called. Skipping
re-alignment of hits that are already have a type of GMAP.
* sarray-read.c: Resolving ambiguous ends if one dominates by both
probability and splice distance
* gmap.c, gsnap.c, uniqscan.c: Using new interface to Stage3_setup
* stage3.c, stage3.h: Adding dual break for both SAM and non-SAM output,
needed to give the correct CIGAR starting coordinate
* doublelist.c, doublelist.h, intlist.c, intlist.h, uintlist.c, uintlist.h:
Implemented procedures for keeping a single item in the list
* sarray-read.c: Among ambiguous splice segments, ranking by probability and
selecting closest one if it is less than half the distance of the second
one
* substring.c, stage3hr.c: Improved debugging statements
* sarray-read.c: Fixed value of substring1p passed to Substring_new_ambig
for alignments on the minus strand, which resulted in problems with the
--merge-overlap feature
2016-07-06 twu
* stage3.c, pair.c: Restored backward movement of ptr
* stage1hr.c: Fixed infinite loop due to circular list
* gsnap.c: Changed default end detail to be medium
* substring.c: Fixed standard GSNAP output for deletions, by reducing the
number of final dashes
* stage3hr.c: Added debugging statements
2016-06-30 twu
* pair.c, stage3.c: No longer going backward after an indel, which could
cause an infinite loop
* splice.c: Using looser criteria for accepting a splice
* sarray-read.c: Revising previous number of mismatches instead of replacing
it
* substring.c: Using correct memory category for substrings
* stage3hr.c: Penalizing bad introns
* pair.c, pair.h: Pair_nmismatches_region returning number of bad introns
* indel.c: Improved debugging statements
* gmap.c: Including -K for backward compatibility
* stage1hr.c: Merged revision 193193 from branches/2016-06-29-add-listpool
to change from lists to a vector for anchor_segments
2016-06-29 twu
* resulthr.c, resulthr.h: Added UNPAIRED_TERMINALS result type
* stage1hr.c: Handling unpaired_terminals. Consolidated memory allocation
for plus and minus cases in Stage1hr_T object.
2016-06-21 twu
* shortread.c: Made fixes for --force-single-end option to work properly
2016-06-15 twu
* configure.ac: Added provision for user-selected SIMD level
2016-06-09 twu
* cpuid.c: Providing more detailed information from standalone program
* splicetrie.c: Commented out a debugging statement
* splice.c: Restored check for sufficient splice probabilities on splices
* sarray-read.c: Restored _pext_u32 command
* Makefile.gsnaptoo.am, cpuid.c: Added cpuid main program
* stage3.c: Added comment
2016-06-08 twu
* ax_ext.m4: Added -mbmi2 flag for avx2
2016-06-03 twu
* VERSION: Updated version number
* stage1hr.c: Replaced constant value of 15 with
min_distantsplicing_end_matches
* indexdb.c: Removed sanity check on positions filesize, which can fail on
multiple simultaneous instances of the process
* stage1hr.c, stage3hr.c, stage3hr.h: Searching for distant splicing based
on trim
* sarray-read.c: Turning off AVX2-specific version of
fill_positions_filtered_first
* sam_sort.c: Fixed warning message. Fixed memory leak.
* gmap_select.c, gmapl_select.c, gsnap_select.c, gsnapl_select.c: Improved
warning messages
2016-05-25 twu
* pair.c: Fixed calculation of circularpos for plus strand
2016-05-24 twu
* trunk, VERSION, acinclude.m4, bootstrap.gsnaptoo, asm-bsr.m4, ax_ext.m4,
builtin-popcount.m4, configure.ac, index.html, src, Makefile.gsnaptoo.am,
cpuid.c: Merged revisions 189683 through 190434 from
branches/2016-05-12-power8
* uniqscan.c, gsnap.c: Using new interface to Stage3_setup
* iit-read-univ.c: Commented out warning when IIT file cannot be read
* gmap.c: Changed flag names to --max-intronlength-middle,
--max-intronlength-ends, and --trim-end-exons
* stage3.c, stage3.h: Checking for end exon length with minendexon variable
* config.site.rescomp.dev, config.site.rescomp.tst: Added a configuration
file for run-time checking
2016-05-17 twu
* stage3hr.c: Fixed uninitialized variable
* gmap.c, gsnap.c, uniqscan.c: Using new interface to Stage3_setup
* stage3.h: Added variable maxintronlen_ends
* stage3.c: In trimming ends, always going to intron, and not allowing
indel. Comparing end intron length with maxintronlen_ends.
2016-05-06 twu
* dynprog_simd.c: Computing X_prev_nogap correctly for the case of zero gap
penalty
* stage1hr.c: Modified debugging statements
* stage2.c: Added assertions
* acinclude.m4, ax_cpuid_intel.m4, ax_cpuid_non_intel.m4, ax_ext.m4,
configure.ac: Writing own cpuid configure checks based on same code as in
src/cpuid.c
2016-05-01 twu
* trunk, src, dynprog.c, dynprog.h, dynprog_genome.c, gmap.c, gsnap.c,
pair.c, pair.h, sarray-read.c, splice.c, stage1hr.c, stage3.c, stage3.h,
stage3hr.c, stage3hr.h, uniqscan.c: Merged revisions 188721 through 188751
from branches/2016-04-29-improve-alignments
* trunk, Makefile.gsnaptoo.am: Property changes
* VERSION, config.site.rescomp.prd: Updated version number
* config.site.rescomp.tst: Added sanitize flag
2016-04-29 twu
* papers: Removed papers directory from SVN
* src, Makefile.gsnaptoo.am, gmap.c, pair.c, stage3.c, translation.c,
translation.h: Merged revisions 188558 to 188717 from
branches/2016-04-27-alt-codons to allow for alternate genetic codes
2016-04-20 twu
* archive.html, index.html: Updated for latest version
* stage3.c: Not allowing any ambiguous matches at 3' or 5' ends when trimming
* datadir.c: Modified comments
* datadir.c: In find_fileroot, showing preference if <dbroot>.version is
found. Otherwise, handling the case where multiple .version files are
found.
2016-04-04 twu
* archive.html, index.html: Revised for latest version
* splice.c: Checking for more than 10% mismatches in either end. Using
value of min_shortend in Splice_resolve_sense and Splice_resolve_antisense.
* sarray-read.c: Modified debugging statements
2016-03-30 twu
* gmapindex.c: Not creating altscaffold IIT file if no alt scaffolds are
observed
* gmap.c, uniqscan.c: Using new interface to Univ_IIT_altlocp
* VERSION: Updated version number
* index.html: Updated for latest version
* stage3hr.c: Removed low_alias and high_alias fields. Using altlocp,
alias_starts, and alias_ends.
* resulthr.c: Using npaths_primary and npaths_altloc
* gsnap.c, iit-read-univ.c, iit-read-univ.h: Reading altloc IIT file
2016-03-29 twu
* sam_sort.c: Added option --restore-orig-order
* samprint.c: Removed print statement
* iit-read.c: Trying adding .iit suffix first
* stage3hr.c: Turning off DISTANT_SPLICE_SPECIAL, so we can find distant
splices. For substrings, updating found_score only when the new one is
better. Using nmismatches_whole for score field.
* sarray-read.c: Fixed debugging statements
2016-03-17 twu
* gff3_genes.pl.in, gff3_introns.pl.in, gff3_splicesites.pl.in: Handling
Parent and ID fields in exon and CDS types of recent NCBI gff3 files.
Handling new transcript types.
* pair.c: Changed occurrences of abs() to explicit conditional statements,
since abs() can give large integers with -m64 compiler flag
* stage2.c: Added parentheses
* gsnap.c, stage1hr.c, stage1hr.h, uniqscan.c: Added option --max-anchors
* samread.c: Added debugging statements
2016-02-19 twu
* VERSION: Updated for latest version
* index.html: Updated version latest version
* stage3hr.c: Turned off debugging
* stage1hr.c, stage3hr.c, substring.c: Fixed query coordinates for salvage
terminal procedure on minus strand
2016-02-18 twu
* stage3hr.c: Checking for stage2pairs being NULL when running GMAP on
substrings or previous GMAP
* gmap_build.pl.in: Restored removal of fasta_sources and coordsfile
temporary files
* gmap_build.pl.in: Added quotes around bindir programs
* stage3hr.c: Fixed creating stage 2 pairs for circular chromosomes
* stage3.c: Fixed debugging statements
2016-02-17 twu
* indel.c: Require more matches on both ends than the length of the insertion
* indel.c, oligoindex_hr.c, sarray-read.c, stage1hr.c, stage3hr.c,
stage3hr.h, substring.c, substring.h: Removed genomiclength as a field
from Substring_T objects. Fixed overflow bug for large insertion
substrings.
* sarray-read.c: Fixed code for SSE2 compilation
* stage3hr.c: Removed assertion, which is not valid
* indexdb.c: Handling stderr message for single sequence, where number of
seconds is not defined
* stage3hr.c: Changed assertion to handle large genomes
* stage1hr.c, smooth.c, stage3.c, translation.c, translation.h, gmap.c,
gregion.c, gregion.h, stage1.c, stage1.h, sarray-read.c, splice.c,
splice.h: Removed unused variables and parameters
* splicetrie_build.c, sarray-write.c, indexdb.c, indexdb-write.c,
gmapindex.c, dynprog_single.c: Removed unused variables
* splicetrie.c: Using new interface to dynprog procedures
* pbinom.c, outbuffer.c, iit-write.c, get-genome.c: Hiding unused procedures
* pair.c, indel.c, indel.h, sarray-read.c, sarray-read.h, stage1hr.c,
stage3.c, dynprog_single.c, dynprog.c, dynprog_cdna.c, dynprog_cdna.h,
dynprog_end.c, dynprog_end.h, dynprog_genome.c, dynprog_simd.c,
dynprog_simd.h: Removed unused parameters
* output.c, stage3.h: Removed unused parameters in Stage3_print_sam
* oligoindex_hr.c: Fixed comparison between unsigned and signed values
* genome128_hr.c: Hiding procedures specific to GSNAP
* dynprog_cdna.c, dynprog_end.c, dynprog_end.h, dynprog_genome.c,
dynprog_single.c, dynprog_single.h: Changed check for HAVE_SSE4_1 or
HAVE_SSE2 to just HAVE_SSE2
* compress.c, bitpack64-readtwo.c: Put macros around a variable
* stage3hr.c: Using new interfaces to stage 2 procedures
* splicetrie.c: Using new interfaces to dynprog procedures
* gmap.c, stage1hr.c, stage2.c, stage2.h, stage3.c: Removed stage2_source
and stage2_indexsize as return values from procedures
* stage3hr.c: Fixed comparisons of signed and unsigned integers
* gmap.c, stage3.c, stage3.h: Removed stage2 and stage3 benchmarking fields
from Stage3_T object
* output.c, samprint.c, samprint.h, chimera.c, pair.c, pair.h: Removed
unused variables and parameters from pair procedures
* dynprog_single.c, dynprog.c, dynprog.h, dynprog_cdna.c, dynprog_end.c,
dynprog_genome.c, dynprog_simd.c, pairpool.c, pairpool.h: Reduced
parameters for Pairpool_add_genomeskip and Dynprog_traceback_std
* gsnap.c, stage1hr.c, stage3hr.c, stage3hr.h, uniqscan.c: Putting
subopt_levels into Stage3hr_setup. Removing cutoff_level as parameter
from optimal score procedures
* stage1hr.c, stage3hr.c, stage3hr.h: Removed unused parameters from display
and eval procedures
2016-02-16 twu
* sarray-read.c, stage1hr.c, stage3hr.c, stage3hr.h: Removed unused
parameters for stage3hr procedures
* sarray-read.c, splice.c, stage1hr.c, stage3hr.c, stage3hr.h: Removed
unused parameters and variables
* stage1hr.c, stage3hr.c, stage3hr.h: Using new interfaces to functions
without first_read_p
* mapq.c, mapq.h, bitpack64-readtwo.c: Hiding unused functions
* bitpack64-read.c: Put correct macros around variable
* indel.c, splice.c: Using new interfaces to procedures without first_read_p
* sarray-read.c, substring.c, substring.h: Removed unused field first_read_p
* samprint.c: Removed unused parameter concordant_chrpos
2016-02-13 twu
* gsnap.c, stage1hr.c, stage1hr.h, uniqscan.c: Removed unused variables and
parameters in stage1hr.c
2016-02-12 twu
* gmap.c, gsnap.c, indexdb_hr.c, oligo.c, sarray-read.c, sarray-read.h,
stage1hr.c, stage2.c, stage2.h, stage3.c, stage3hr.c: Removed unused
variables and parameters from sarray procedures
* genome-write.c, genome-write.h, gmapindex.c: Removed altstrain_iit as a
parameter to Genome_write_comp32
* genome_sites.c, sequence.c: Hiding unused function
* genome_sites.c, genome_sites.h, access.c: Hiding unused functions
* genome128_hr.c, genome128_hr.h, indel.c, mapq.c, sarray-read.c, splice.c,
splicetrie.c, stage1hr.c, stage3hr.c, substring.c: Removed first_read_p as
a parameter from all genome128 procedures
* compress.c: Removed unused variables
* iit_get.c: Removed unused variables and parameters
* parserange.c, iit-read-univ.c: Removed unused variable
* iit-read.c, iit-read.h, stage3.c: Removed map_bothstrands_p as a parameter
to IIT_print_header
* iit-read-univ.c, iit-read.c, iit-read.h, iit_get.c: Removed sortp as a
parameter from IIT_get_values routines
* get-genome.c, gmap.c, gsnap.c, iit-read.c, iit-read.h, iit_dump.c,
iit_fetch.c, iit_get.c, parserange.c, snpindex.c, uniqscan.c: Removed
labels_read_p as a parameter from IIT_read
* access.c: Restored check for number of attached processes when deallocating
* iit-read.c, iit-read.h, splicing-scan.c: Removed parameter annotationonlyp
from IIT_dump
* genome128.c, gmapindex.c, sarray-read.c, snpindex.c, access.c, access.h,
atoiindex.c, cmetindex.c, genome.c, iit-read-univ.c, iit-read.c,
indexdb-write.c, indexdb.c, sarray-write.c: Removed eltsize as an argument
to Access_mmap routines
* gsnap.c, stage3hr.c, stage3hr.h, uniqscan.c: Added --end-detail flag
* access.c: No longer printing long string of periods and commas during
pre-load
2016-02-09 twu
* gmap.c, gsnap.c: Added message to remove shared memory manually
* access.c: Removed warning message
* spanningelt.c: Removed debugging code
* spanningelt.c: For debugging purposes
* src, atoiindex.c, cmetindex.c, gmapindex.c, indexdb.c, indexdb_hr.c,
sarray-write.c: Using new interface to Access_allocate_private
* sarray-read.c: Using new interface to Access_allocate_shared and
Access_allocate_private. Removed code for USE_CSA.
* genome.c, indexdb.c, indexdb.h, indexdbdef.h: Using new interface to
Access_allocate_shared and Access_allocate_private
* access.c, access.h: If shared allocation fails, now using memory mapping
if possible. Setting access variable.
* gsnap.c, gmap.c, uniqscan.c: Using new interface to Access_setup
* dynprog_cdna.c: Changed variable initialization
* semaphore.c: Changed variable names
* access.c: Storing all semaphore IDs, and looking at their
resident/freeable status. Handling emergency stops better.
2016-02-08 twu
* Makefile.gsnaptoo.am, access.c, semaphore.c, semaphore.h: Put semaphore
commands in a separate file. Fixed small bugs with deleting semaphores
and shared memory.
2016-02-05 twu
* stage2.c: Removed unnecessary calls to abs(). Replaced with a comparison
between gendistance and querydistance.
* shortread.c: Using size_t instead of unsigned long long
* bitpack64-write.c, indexdb_hr.c, junction.c, pair.c, parserange.c,
sarray-write.c, splicetrie_build.c, substring.c, uint8list.c: Using %llu
for formatting instead of %u
* access.c, access.h, genuncompress.c, iit-read-univ.c, iit-read.c,
indexdb-write.c, indexdb.c, sam_sort.c: Changed off_t to size_t for
filesize
* gsnap.c: Removed testing code
* gsnap.c: For testing purposes
2016-02-04 twu
* pairpool.c: Fixed assertion on genomepos
* stage3hr.c: Fixed computation of minus chromosome coordinates for circular
chromosomes
2016-02-03 twu
* gmap.c: Creating altlocp, alias_starts, and alias_ends for user-provided
genomic segment
* coords1.test.ok: Revised for alternate genomic contigs
* gsnap.c: Allowing for npaths_primary and npaths_alternate. Letting
insertion length be arbitrarily long when user does not specify
--max-middle-insertions.
* gmap.c, stage3hr.h, stage3.h: Allowing for npaths_primary and
npaths_alternate
* uniqscan.c: Using new interfaces to functions
* substring.h, substring.c: Trimming novel spliceends for substrings
* stage3hr.c: Allowing for npaths_primary and npaths_alternate. Changed
logic for extending substrings using GMAP. Implemented extension of GMAP
alignments.
* stage3.c: In find_novel_spliceends, using trim lengths at ends to define
two regions, one with a stronger and one with a weaker criterion for
splice site probability.
* stage2.c, stage2.h: Implemented Stage2_compute_starts and
Stage2_compute_ends for extending ends of alignments. Fixed a condition
for termination of while loop.
* stage1hr.h: Allowing for npaths_primary and npaths_alternate. Allowing
for arbitrarily long insertions when --max-middle-insertions is not set by
user.
* stage1hr.c: Allowing for npaths_primary and npaths_alternate. Making call
to extend gmap alignments. Allowing for arbitrarily long insertions when
--max-middle-insertions is not set by user
* sarray-read.h, sarray-read.c: Allowing for arbitrarily long insertions
when --max-middle-insertions is not set by user
* samprint.c, samprint.h: Allowing for npaths_primary and npaths_alternate.
Added parameter for artificial mate in --add-paired-nomappers.
* pairpool.c: Added assertion for genomepos
* pair.c, pair.h: Allowing for npaths_primary and npaths_alternate. Added
function Pair_trim_distances, used by new find_novel_spliceends function
for pairs.
* output.c: Allowing for npaths_primary and npaths_alternate. Using new
interface using artificial_mate_p.
* gmapindex.c: Allowing for alternate scaffolds
* dynprog_simd.c: Calling correct printing procedures for debugging
* dynprog_genome.c: Requiring finalscore to be >= 0
* Makefile.gsnaptoo.am: Added parserange.c and parserange.h for sam_sort
2016-01-15 twu
* src, result.c, result.h, resulthr.c, resulthr.h: Merging revision 182439
from branches/2014-09-04-secondary-chr to handle npaths_primary and
npaths_altloc
* src, iit-read.c, iit_store.c: Merging revision 182435 to use new interface
to Chrom_from_string
* src, parserange.c: Merging revision 182431 from
branches/2014-09-04-secondary-chr to check if contig_iit is NULL
* Makefile.gsnaptoo.am: Merging revisions 162111 and 182429 from
branches/2014-09-04-secondary-chr to add tableint.c, tableint.h,
parserange.c, and parserange.h where needed
* chrom.c, chrom.h: Merged revisions 162112 and 182427 from
branches/2014-09-04-secondary-chr to add fields for alt_scaffold_start and
alt_scaffold_end
* table.c, tableint.c, tableuint.c, tableuint8.c: Merged revision 162110
from branches/2014-09-04-secondary-chr to fix memory leak
* iit-read-univ.c, iit-read-univ.h: Merged revision 182424 from
branches/2014-09-04-secondary-chr to add function Univ_IIT_altlocp
* util, fa_coords.pl.in, gmap_build.pl.in, gmap_process.pl.in: Merged
revisions 146896 through 182422 from branches/2014-09-04-secondary-chr
* index.html: Updated to latest version
* archive.html: Added revision for 2014-12-31.v2
* stage1hr.c: Fixed array overflow in segmentation procedure
2016-01-14 twu
* uniqscan.c: Using new interface to Stage3hr_setup
* gsnap.c, stage3hr.c, stage3hr.h: Distinguishing between pairmax_linear and
pairmax_circular
* pair.c: Removed SOFT_CLIPS_AVOID_CIRCULARIZATION code in computing
circularpos, since it isn't needed
* substring.c, substring.h: Defining Substring_mandatory_trim_left and
Substring_mandatory_trim_right.
* stage3hr.c: Turning SOFT_CLIPS_AVOID_CIRCULARIZATION back on. Using
Substring_mandatory_left_trim and Substring_mandatory_right_trim.
2016-01-13 twu
* gmap_select.c, gmapl_select.c, gsnap_select.c, gsnapl_select.c: Checking
for ENOENT instead of EACCES
* substring.c: Handling trim_left_action and trim_right_action, instead of
trim_left_p and trim_right_p
* substring.h, stage3hr.c: Using trim_left_action and trim_right_action,
instead of trim_left_p and trim_right_p
2016-01-12 twu
* gmap_select.c, gmapl_select.c, gsnap_select.c, gsnapl_select.c: Returning
return code from execvp
* gmap_select.c, gmapl_select.c, gsnap_select.c, gsnapl_select.c: Handling
case where no path is given, by using execvp to find the correct program
* stage3hr.c, substring.c: Computing coordinates correctly in salvage
procedure for terminal alignments with too many mismatches
2016-01-11 twu
* iit-read.c: Removed debugging code which was not adding .iit ending to
file suffix
2016-01-08 twu
* dynprog_cdna.c, dynprog_end.c, dynprog_genome.c, dynprog_single.c: Added
parameters for Dynprog_standard in non-SIMD code
* samprint.c, stage3hr.c, stage3hr.h: Changed name of function from
Stage3end_substring2 to Stage3end_substringN
* gsnap.c, sarray-read.c, sarray-read.h: Not allowing for ambiguous splicing
on circular chromosomes
* dynprog_single.c: Added assertion
* dynprog_genome.c: Checking for the case where no intron is found in a gap
2016-01-07 twu
* stage1hr.c: Removed extra debugging code
* sarray-read.c, stage3hr.c: Handling Junction_gc from within
Stage3end_new_substrings
* stage1hr.c: Defining MAX_ANCHORS instead of EXHAUSTIVE_ANCHORS. Keeping
track of both all_segments and anchor_segments, and using whichever
satisfies MAX_ANCHORS.
* dynprog_cdna.c, dynprog_end.c, dynprog_genome.c, dynprog_simd.h,
dynprog_single.c: Replacing DEBUG14 and DEBUG16 with DEBUG_SIMD and
DEBUG_AVX2, respectively
* dynprog.c: Allocating space needed for AVX2 debugging
* dynprog.h: Providing SIMD variables for non-AVX2 debugging procedures
* dynprog_simd.c: Fixed SIMD variables in non-AVX2 debugging procedures
* dynprog_simd.c: Added code for AVX2
* dynprog_cdna.c, dynprog_end.c, dynprog_genome.c, dynprog_simd.h: Passing
debugging parameters for both SIMD and AVX2 debugging
* dynprog.c: Generalized allocation procedures to use ALIGN_SIZE
* stage3hr.c: In Stage3end_optimal_score and Stage3pair_optimal_score,
turning off comparison of score_eventrim with cutoff_level. In
Stage3end_new_terminal, if number of mismatches between pos5 and pos3
exceeds number allowed, then recomputing pos5 or pos3 that does fit within
the number allowed.
* stage1hr.c: Allowing for an exhaustive set of anchor segments
2015-12-19 twu
* types.h: Revised comment
* dynprog.h: Moved definitions of infinite gap penalties here
* dynprog.c: Adjusting for negative infinity in last_nogap in F loop
* atoi.c, cmet.c: Added types to make sure 64 bits are used
2015-12-18 twu
* dynprog_simd.c: Fixed initial conditions for all three types of initial
gap penalty
2015-12-15 twu
* dynprog.c: Added initializtion for ZERO_INITIAL_GAP_PENALTY. Made fixes
for initial Fgap calculation for standard initial gap penalty, by
initializing last_nogap appropriately.
* dynprog_simd.c: Using a filter on lband for the E calculation for the
first column when using a standard initial gap penalty
* dynprog_end.c: Added function needed for debugging
* dynprog_simd.c: For standard initial gap penalty, revising extend_ladder
for first column of values, in second and later blocks.
* dynprog_simd.c: Implemented code for ZERO_INITIAL_GAP_PENALTY and
INFINITE_INITIAL_GAP_PENALTY. Added filters on lband for first column of
values in ZERO_INITIAL_GAP_PENALTY.
2015-12-11 twu
* dynprog.c, dynprog.h: Added upperp and lowerp parameters to
Dynprog_standard, to give it the same behavior as the SIMD upper and lower
procedures
2015-12-10 twu
* VERSION: Updated version number
* oligoindex_hr.h: Removed duplicate definition of Shortoligmer_T
* trunk, config.site.rescomp.tst, src, Makefile.gsnaptoo.am,
Makefile.pmaptoo.am, alphabet.c, alphabet.h, atoi.h, bitpack64-read.c,
bitpack64-read.h, block.c, block.h, cmet.h, compress-write.c, dynprog.c,
dynprog_single.c, gmap.c, gmapindex.c, indexdb-write.c, indexdb-write.h,
indexdb.c, indexdb.h, oligoindex.h, oligoindex_hr.c, oligoindex_pmap.c,
oligoindex_pmap.h, oligop.c, oligop.h, pair.c, pmap_select.c, pmapindex.c,
stage1.c, stage2.c, stage2.h, stage3.c, types.h: Merged revisions 179384
through 180698 from branches/2015-11-20-pmap
2015-12-09 twu
* indexdb.c: Defining blocksize for PMAP
* stage3hr.c: Adjusting cutoff levels in Stage3end_pair_up_concordant to
consider the best alignment on each end, if has mismatches that exceed the
given cutoff level. Added a field distant_splice_p, and using that for
filtering, rather than chrnum == 0.
* output.c, samprint.c, samprint.h: Printing NH to be the maximum of the two
npaths in --add-paired-nomappers option
2015-12-07 twu
* trunk, VERSION, config.site.rescomp.prd, config.site.rescomp.tst, src,
Makefile.gsnaptoo.am, atoi.c, atoi.h, atoiindex.c, bitpack64-access.c,
bitpack64-access.h, bitpack64-incr.c, bitpack64-incr.h, bitpack64-read.c,
bitpack64-read.h, bitpack64-readtwo.c, bitpack64-readtwo.h,
bitpack64-write.c, bitpack64-write.h, block.c, block.h, cmet.c, cmet.h,
cmetindex.c, gdiag.c, genome_sites.c, gmap.c, gmapindex.c, gsnap.c,
indexdb-write.c, indexdb-write.h, indexdb.c, indexdb.h, indexdb_hr.c,
indexdb_hr.h, oligo.c, oligo.h, oligoindex_hr.c, sarray-read.c,
sarray-write.c, snpindex.c, spanningelt.c, spanningelt.h,
splicetrie_build.c, stage1.c, stage1hr.c, types.h: Merged revisions 179335
through 180340 from branches/2015-11-20-16mers to allow for genomic
indices up to 18-mers
* index.html: Updated for latest version
2015-12-04 twu
* samprint.c: Fixed printing under --add-paired-nomappers for unpaired
multiple alignments where npaths2 > npaths1
2015-12-02 twu
* stage2.c: In looking back, skipping positions where no hits were found
2015-12-01 twu
* configure.ac: Building only one level of SIMD
* ax_ext.m4: Added "no" flags for various SIMD levels
2015-11-19 twu
* gsnap.c: Fixed incorrect check on floating-point values for --min-coverage
* gmap_select.c, gmapl_select.c, gsnap_select.c, gsnapl_select.c: Picking
best available program at run-time
* stage1hr.c: In computing gmap using segments, introducing a min_genomepos
and max_genomepos
* samread.c: Fixed incorrect parsing needed by sam_sort, resulting from
missing break commands in case statements
* samprint.c: Fixed some issues with --add-paired-nomappers option
* inbuffer.c: Initializing variables
* pair.c, stage3hr.c: Excluding alignments to circular chromosomes that
extend below the first copy or above the second copy
* atoiindex.c, cmetindex.c, gmapindex.c: Using new interface to suffix array
write procedures
* sarray-write.h, sarray-write.c: Using a list of Cell_T objects to compute
exceptions
* bytecoding.c, bytecoding.h: Implemented procedures for writing bytes file
and interleaving bytes files.
* Makefile.gsnaptoo.am: Removed uinttable.c and uinttable.h from programs
with bytecoding.c
2015-11-13 twu
* Makefile.gsnaptoo.am: Added uinttable files needed by bytecoding
2015-10-29 twu
* stage3hr.c: Turning off SOFT_CLIPS_AVOID_CIRCULARIZATION. Handling
deletions on minus strand that go beyond genomic position 0.
2015-10-06 twu
* Makefile.gsnaptoo.am: Added "=1" to some pre-processor flags
* access.h: Added LOADED type for cases where IIT file is loaded from memory
instead of being read from file
* iit-write.c: Adding padding to character arrays in IIT file, so the
integer arrays are aligned.
* iit-read.c, iit-read.h: Added IIT_load function to obtain IIT from a
region of memory
2015-09-28 twu
* oligoindex_hr.c: Handling the case where left_plus_length < indexsize
2015-09-24 twu
* gmap.c, gsnap.c: Revised explanation message for illegal instructions
* datadir.c: Increased buffer size for dbversion file name
* access.c: Looping until we get a semaphore, either by creation or by using
existing one
2015-09-21 twu
* stage3hr.c: Using nsegments field for all alignment types. Filtering
results if nsegments is relatively high compared with the best result.
* stage3.c: Using higher standard for microexons. Using nmatches instead of
support for sufficient_splice_prob.
* stage1hr.c: Handling case of paired-end alignments where both ends do not
satisfy minimum coverage. Fixed debugging statements.
* splice.c: Using correct variable names inside FREEA calls
* pair.c, pair.h: Implemented Pair_maxnegscore
* dynprog_genome.c: Revised bridge_intron_gap procedures to make reasoning
clearer. Using weaker values for scoreI. Using maxnegscore to filter out
bad alignments.
2015-09-16 twu
* stage1hr.c: Removed debugging comment
* stage1hr.c, stage3hr.c: Fixed calls to Genome_get_segment_blocks_left,
where we were providing the left coordinate instead of the right one.
* genome.c: Added comment
2015-09-11 twu
* cpuid.c: Added to repository
* uniqscan.c: Using new interfaces to stage 1 procedures
* substring.c, substring.h: Removed reject_trimlength
* stage3hr.c, stage3hr.h: Added procedures for filtering by coverage
* stage1hr.h, gsnap.c: Added --min-coverage and removed --terminal-threshold
* stage1hr.c: Added --min-coverage. Changed criteria for running
find_terminals.
* gmap_build.pl.in: Allowing for spaces in destination directory
2015-09-08 twu
* stage3hr.c: Fixed debugging statement
2015-09-01 twu
* gmap.c: Setting some uninitialized variables for chimera
* chimera.c: When chimeric breakpoint is beyond chromosomal bounds,
returning NN for dinucleotides
* stage3hr.c: Favoring non-zero sensedirs when sorting results
* splice.c: Fixed variable names for FREEA command, which was observed by
compiler only when alloca is not available.
* oligoindex_hr.c: Initializing some return variables when exiting trimming
procedure early
* chimera.c, chimera.h, gmap.c, pair.c: Fixed issues when chimeras extend to
beginning or end of chromosomes, causing a search for donor and acceptor
nucleotides beyond chromosomal bounds. Fixed Pair_pathscores to extend to
last pair of path.
* dynprog_genome.c: Resolved fatal bug in bridging intron gaps when when no
probabilities are found
2015-08-31 twu
* stage1.c: No longer using alloca for array of Batch_T objects
2015-08-28 twu
* gmap.c: Changed MALLOC of array to MALLOC_OUT. Fixed code for memusage.
* outbuffer.c: Changed FREE of outputs to FREE_KEEP
* stage3.c: Using known splices in pick_cdna_direction. Changed MALLOC of
Stage3_new to MALLOC_OUT.
2015-08-27 twu
* stage3.c: Changed criterion for evaluating splice neighborhood to allow
for short ends
2015-08-24 twu
* sarray-read.c: Using max_mismatches_allowed from original call to suffix
array algorithm, and not allowing it to be unlimited
* splice.c: Added debugging statements
* genome.c: Setting end of genomealt string to be NULL
* dynprog_genome.c: Changed loop end condition to avoid accessing
uninitialized variables
* stage3hr.c: Calling Genome_get_segment_blocks_left with chroffset and not
chrhigh. Setting resolve value to be -1 in case of AMB_UNRESOLVED_TOOCLOSE.
2015-08-19 twu
* gsnap.c: Removed extraneous space after newline
* access.c: Added header file
2015-08-13 twu
* gsnap.c: Using new interface to SAM_setup
* oligoindex_hr.h: Changing Inquery_T types to be unsigned
* oligoindex_hr.c: Removed old code that caused counts to be incremented
twice
* access.c, access.h, atoiindex.c, cmetindex.c, genome.c, gmapindex.c,
indexdb-write.c, indexdb.c, sarray-read.c, sarray-write.c, snpindex.c:
Replaced Access_allocate with Access_allocate_private and
Access_allocate_shared
* gsnap.c, samprint.c, samprint.h: Added option
--paired-flag-means-concordant
2015-08-11 twu
* genome.c, indexdb.c, indexdbdef.h, sarray-read.c: Storing keys from shared
memory to check semaphores to see if the memory should be retained
* access.c, access.h, gsnap.c: Added code for preloading and unloading of
shared memory
* configure.ac: No longer offering options to enable or disable CPU types
* ax_ext.m4: Various changes to handle CPU types and features
* oligoindex_hr.c, oligoindex_hr.h: Created Inquery_T type to handle both
SSE2 and non-SSE code
* genome128_hr.c: Fixed branch for clz/ctz for SSE4.2
* configure.ac: Removing mpi for now
2015-08-10 twu
* ax_ext.m4: Requiring BMI2 as part of AVX2
* sarray-read.c: Allowing stream_load of si128 only for HAVE_SSE4_1
* genome.c, genome.h, indexdb.c, indexdb.h, sarray-read.h: Commented unused
procedures for shmem_remove
* trunk, Makefile.am, acinclude.m4, ax_compiler_vendor.m4, ax_ext.m4,
config.site.rescomp.tst, configure.ac, src, Makefile.gsnaptoo.am, cpuid.h,
genome128_hr.c, gmap.c, gmap_select.c, gmapindex.c, gmapl_select.c,
gsnap.c, gsnap_select.c, gsnapl_select.c, popcount.c, popcount.h,
sarray-read.c: Merged revisions 171384 through 171613 from
branches/2015-08-06-run-time-variants to allow for run-time variants
* trunk, VERSION, config.site.rescomp.tst, index.html, src, oligoindex_hr.c,
oligoindex_hr.h, sarray-read.c: Merged revisions 170634 to 171595 to add
code for AVX2
2015-08-05 twu
* oligoindex_hr.c: Changed debugging statements
2015-08-04 twu
* oligoindex_hr.c: Added debugging statements
2015-08-03 twu
* gsnap.c, samprint.c, samprint.h: Added flag --add-paired-nomappers
2015-07-28 twu
* oligoindex_hr.c: Restored missing line in counting of 9-mers
2015-07-23 twu
* VERSION: Updated version number
* stage1hr.c: Removed an abort command from debugging
* sarray-read.c: Using new interface to Bytecoding lcp_next function.
Commented out code that is not used when SUBDIVIDE_ENDS is not defined.
* bytecoding.c, bytecoding.h: Call to lcp_next now returns child_next
2015-07-22 twu
* VERSION: Updated version number
* dynprog_genome.c: Fixed boundaries that led to negative coordinates for
splice site candidates.
* stage1hr.c: Removed unused variables
* stage1hr.c: Removed allvalidp as parameter to align_end and align_pair.
* stage1hr.c: Setting spanningsetp and completesetp to false if querylength
< min_kmer_readlength
* stage1hr.c: Removed restriction on min_readlength. Running only suffix
array, if possible, if reads are too short.
* access.c: Changed user message
* sarray-write.c: Changing plcp[n] to be 0 instead of -1
* sarray-read.c: Improved debugging results
* access.c: Printing user message if shmem fails
2015-07-17 twu
* get-genome.c, sequence.c, sequence.h: Added flags for --stream-chars and
--stream-ints
2015-06-26 twu
* trunk, Makefile.gsnaptoo.am, 2015-statgen, algorithm.tex, discussion.tex,
features.tex, introduction.tex, util: Modified mergeinfo
* config.site.rescomp.tst: Updated version
* index.html: Updated for version 2015-06-23
* archive.html: Updated for version 2014-12-31
* README: Removed references to Goby
* src, access.c, bigendian.c, bigendian.h, bitpack64-access.c,
bitpack64-read.c, bitpack64-readtwo.c, bytecoding.c, compress.c,
compress.h, genome-write.c, genome.c, genome.h, genome128_hr.c,
iit-read-univ.c, indexdb.c, indexdb_hr.c, sarray-read.c, sarray-write.c,
snpindex.c, types.h, univinterval.h: Merged revisions 167282 through
168383 from branches/2015-06-10-bigendian to support bigendian
architectures
* Makefile.dna.am, Makefile.util.am: Added instructions for check-bigendian
2015-06-23 twu
* VERSION, config.site.rescomp.tst: Updated version number
* algorithm.tex, biblio.bib, discussion.tex, features.tex, introduction.tex,
toplevel.tex: Final version
* stage1hr.c: Added comments
* gmap.c: Removed message about different batch levels
* gsnap.c: Added option --master-is-worker for MPI version
* access.c: Using malloc whenever shmget fails
2015-06-15 twu
* stage1hr.c: Removed extra #endif statements
* trunk, VERSION, config.site.rescomp.tst, Makefile.gsnaptoo.am,
2015-statgen, Ambiguous-splicing.eps, Hierarchical-GMAP.eps,
Large-hash-table.eps, Overlapping-alignment.eps, biblio.bib, toplevel.tex,
util: Updated version number
* stage1hr.c: Fixed indentation
* src, genome.c, genome128_hr.c, gmap.c, gsnap.c, indexdb.c, mode.h,
sarray-read.c, stage1hr.c, substring.c, uniqscan.c: Merged revisions
165630 through 167691 from branches/2015-05-13-ttoc to implement ttoc mode
* splice.c: Applied revision 167580 from releases/public-2014-12-17. In
group_by_segmenti_aux and group_by_segmentj_aux, checking plusp for each
individual hit in deciding whether to group donor or acceptor.
* bitpack64-readtwo.c: Added debugging statements
* sarray-read.c: Defining a variable for debugging
* oligoindex_hr.c: Defining reverse_nt for machines without SSE4.1
2015-06-11 twu
* stage3hr.c: Changed occurrences of Uintlist_next to Uint8list_next for
LARGE_GENOMES
* oligoindex_hr.c: Providing alternative to _mm_extract_epi32 for machines
without SSE4.1
* acinclude.m4, shm-flags.m4, configure.ac, access.c: Including check for
SHM_NORESERVE
* Makefile.gsnaptoo.am: Removed -lrt
* sarray-read.c: Initializing chromosome values to be those for chrnum 1 to
handle left == 0
2015-06-10 twu
* VERSION, index.html: Updated version number
* sarray-write.c, gmapindex.c: Removing rankfile
* gmap_build.pl.in: Changed flag from --no-sarray to --build-sarray
* atoiindex.c, cmetindex.c: Added flag --build-sarray
2015-06-09 twu
* indel.c: Added debugging statements
* stage1hr.c: Bypassing gmap on region if mappingend is less than or equal
to mappingstart, which can happen if the region is pushed to the beginning
or end of the chromosome
* stage3hr.c: Assigning loop variable to given junctions before we push
left_ambig
2015-06-06 twu
* stage3.c: Reversed last revision, and put trim_novel_spliceends at
beginning of path_trim, since putting it at the end results in an infinite
loop
* stage3hr.c: Added debugging statement
* stage3.c: Moved trimming of novel spliceends from beginning of path_trim
procedure to end
* pair.c: Fixed computation of circularpos for minus alignments
2015-06-05 twu
* samprint.c: Removed unused variables
* stage3hr.c: In printing translocations, getting separate chrs for the two
halves. Turned on TRANSLOC_SPECIAL.
* samprint.c: In printing halfdonors and halfacceptors, comparing endlengths
to trimlengths to determine whether to print H or S in CIGAR string
* samprint.c: Fixed printing of CIGAR strings for minus alignments
2015-06-04 twu
* stage1hr.c: Added lowpos and highpos to Segment_T object. Rewrote dynamic
programming procedures for converting segments to pairs.
2015-06-03 twu
* stage1hr.c: In converting segments to GMAP, changed criteria for dynamic
programming to be relative to anchor_segment and not to segment[k].
2015-06-02 twu
* sarray-read.c, stage3hr.c: Using new interface to Substring_new_ambig
* substring.c, substring.h: Setting trim_left and trim_right for ambiguous
substrings
* stage3hr.c, substring.c, substring.h: Renamed outofbounds variables to
outofbounds_start and outofbounds_end. Handling the case where the
alignment is out of bounds to the left of the current chromosome.
* VERSION: Updated version number
* archive.html, index.html: Made changes for new version
* stage1hr.c: Handling the case where floors is NULL, such as for a poly-A
read
* stage3hr.c: Fixed genomic segments for converting substrings to GMAP
* stage1hr.c: For converting segments to GMAP, fixed criteria for allowing
non-monotonic query orders and possible insertions
* stage3hr.c: Fixed bug in referring to uninitialized substring
* substring.c, substring.h: Removed left_genomicseg field
* stage3hr.c: In converting substrings to GMAP, using correct genomic
nucleotide now
* gsnap.c: Made batch level 4 the default
* stage1hr.c: Reordered search algorithms. Limiting number of anchor
segments, and pairing up those instead. Disabling doublesplicing
algorithm.
* sarray-read.h, sarray-read.c: Removed references to sarray_gmap
* pair.c, pair.h: For GSNAP default output format, no longer printing pair
info for single-end reads
* memory-check.pl: Handling results for non-threaded runs
* sarray-read.c: Fixed memory leak
2015-06-01 twu
* stage1hr.c: Deferring read_oligos until we need them for spanning set or
complete set algorithms
* stage1hr.c: Fixed call to single hit alignment of GMAP. Made batch level
4 the default for memory.
* stage1hr.c: Allowing terminal alignments only if no single-end alignments
are found, or if no concordant alignments are found.
* sarray-read.c: Fixed memory leak
* stage1hr.c: Limiting number of anchor segments. Implementing terminal
alignments.
* stage3hr.c: Using new interface to Substring procedures
* substring.c, substring.h: Removed unused variables
* samprint.c: Removed obsolete code for printing specific GSNAP types
* stage1hr.c: Implemented finding of terminals based on anchor segments
* stage3hr.c: Fixed accumulation of ilength_high. In comparing GMAP against
substrings, iterating through all substrings.
* stage3hr.c, stage3hr.h, substring.c, substring.h: Fixed issues with
substring boundaries, computing genomic_diff, and marking mismatches
* stage2.c: Removed GMAP-specific code from GSNAP
* samprint.c: Changed call to get querylengths
* genome128_hr.c, genome128_hr.h: Removed mismatch_offset
2015-05-31 twu
* samprint.c: Removed references to Pair_check_cigar. Changed calls to get
cdna_direction to those for sensedir.
* pair.c: Removed printing of state
* substring.c: Removed references to genomicstart_adj and genomicend_adj in
converting substrings to pairs
* stage3hr.h: Removed interface for Stage3end_indel_pos
* stage3hr.c: Changed calls to Substring_new for insertion and deletion
types to conform to new substrings standards, where each substring has its
genomicstart and genomicend adjusted for indels. Removed indel_pos and
indel_low fields from Stage3end_T object. Removed code for printing
separate GSNAP types.
* stage3hr.c: Setting trim_left, trim_right, trim_left_splicep, and
trim_right_splicep for substring hit type
2015-05-29 twu
* stage3hr.c: Fixed coordinate error in test_hardclips
* stage3hr.c: Fixed typo
* samprint.c, stage3hr.c: Fixed issues in finding substring_low for minus
alignments using hardclip_low
* stage3hr.c: Fixed computation of ilength for substrings
* trunk, README, Makefile.gsnaptoo.am, 2015-statgen, Ambiguous-splicing.eps,
DP-triangles.eps, Diagonalization.eps, Hierarchical-GMAP.eps,
Large-hash-table.eps, Overlapping-alignment.eps, SIMD-oligomers.eps,
Vertical-format.eps, algorithm.tex, biblio.bib, context.tex,
discussion.tex, features.tex, introduction.tex, toplevel.tex, src, diag.c,
diag.h, diagpool.c, diagpool.h, doublelist.c, doublelist.h,
genome128_hr.c, gmap.c, gsnap.c, indel.c, indel.h, intlist.c, intlist.h,
junction.c, junction.h, list.c, list.h, oligoindex_hr.c, oligoindex_hr.h,
pair.c, pair.h, samprint.c, samprint.h, sarray-read.c, sarray-read.h,
sequence.c, splice.c, splice.h, splicing-score.c, stage1hr.c, stage1hr.h,
stage2.c, stage2.h, stage3.c, stage3.h, stage3hr.c, stage3hr.h,
substring.c, substring.h, uint8list.c, uint8list.h, uintlist.c,
uintlist.h, uniqscan.c, univdiag.c, univdiag.h, univdiagdef.h, util:
Merged revisions 162218 to 166640 from branches/2015-03-28-sarray-gmap,
2015-03-31-new-sarray-, 2015-05-07-sarray-ambig, 2015-05-21-segment-gmap,
and 2015-05-22-fast-oligoindex
* trunk, config.site.rescomp.tst: Updated version number
* index.html: Made changes for 2014-12-29
* samprint.c: Moved position of #endif line
2015-05-27 twu
* substring.c, stage3.c: Fixes to debugging statements
* samprint.c, samprint.h: Revisions to SAM_compute_chrpos
* output.c: Using new interface to SAM_compute_chrpos
2015-05-19 twu
* src, gmapindex.c: Allowing genomecomp to be a command-line argument.
Merged changes from branches-2015-05-15-compressed-sarray to allow for
compressed suffix arrays.
* util, gmap_build.pl.in: Providing genomecomp file as a command-line
argument, instead of piping it into gmapindex
* sarray-write.c, sarray-write.h: Merged changes from
branches/2015-05-15-compressed-sarray to allow for compressed suffix
arrays, but removed csafile needed for debugging
* sarray-read.c: Turning off code for compressed suffix arrays
* indexdb-write.c, indexdb-write.h: Allowing the case where genomelength is
less than index1part
* bitpack64-write.h: Improved comments
* access.c: Merged changes from branches/2015-05-15-compressed-sarray to
assign *fd, even if file is empty
* sarray-read.c: Merged code for compressed suffix array. Implemented
different methods for Elt_fill_positions_filtered, depending on whether
the filtering occurs more than once.
* gmap.c: Using new interface to Pair_setup
2015-05-15 twu
* output.c: Not computing chrpos for SAMECHR_SPLICE and TRANSLOC_SPLICE
hittypes
* gmap.c, gsnap.c, pair.c, pair.h, uniqscan.c: Fixed issue with printing
nsnpdiffs for GMAP alignments
* stage3hr.c: Turned on TRANSLOC_SPECIAL to remove translocations when
non-translocation alignments are found. Using effective_chr for printing
purposes. Pushing both substrings for a distant splice. Using querystart
and queryend instead of querystart_adj and queryend_adj for computing
insertlength.
* samprint.c: Using Substring_compute_chrpos to compute chrpos based on
substrings instead of Stage3end_T object
* substring.c, substring.h: Implemented Substring_compute_chrpos
2015-05-01 twu
* iit-read.c: Checking for the possibility in IIT_get_highs_for_low and
IIT_get_lows_for_high of a zero-length array.
* stage3hr.c: Fixed order of LtoH substrings for deletions
* oligoindex_hr.c: Replaced count_fwdrev_simd with individual
count_*mer_fwd|rev_simd procedures
2015-04-30 twu
* substring.c: Revised some debugging statements
* stage3hr.c: Retaining old information about sarrayp when copying a
Stage3_T object
* stage3.c: Initializing max_nmatches to be 0 in end-trimming procedures
* Makefile.gsnaptoo.am: Added -lrt to get shm commands
* algorithm.tex, context.tex, features.tex, introduction.tex: Augmented
captions
* biblio.bib, toplevel.tex: Added references
* discussion.tex: Added material
* algorithm.tex, features.tex, introduction.tex: Added citations
* discussion.tex: Added text
* context.tex: Added description of GSTRUCT
* context.tex, discussion.tex: Moved HTSeqGenie to context.tex
* introduction.tex: Added caption
* features.tex: Revisions
* Diagonalization.eps, Hierarchical-GMAP.eps, Large-hash-table.eps,
Overlapping-alignment.eps, SIMD-oligomers.eps: Revised figures
* algorithm.tex: Expanded caption
2015-04-29 matthejb
* discussion.tex: + adding content to discussion
2015-04-29 twu
* context.tex, algorithm.tex: Revisions
* algorithm.tex: Revisions to diagonalization
* toplevel.tex: Changed symbols for logical operations
2015-04-28 matthejb
* discussion.tex: + initial additions to discussion by MB
2015-04-28 twu
* algorithm.tex: Revisions to linear genome
* algorithm.tex: Moved material on large genomes from features.tex to here
* introduction.tex, features.tex: Revisions
* algorithm.tex: Moved section on ranking alignments and eliminating
duplicates to features.tex
2015-04-27 twu
* discussion.tex: Added notes
* algorithm.tex: Changed table
* features.tex, introduction.tex: Revisions
2015-04-27 michafla
* biblio.bib, context.tex: first draft of gmapR writeup
2015-04-24 twu
* Hierarchical-GMAP.eps, algorithm.tex, features.tex, introduction.tex,
toplevel.tex: Revisions
* Ambiguous-splicing.eps, DP-triangles.eps, Diagonalization.eps,
Hierarchical-GMAP.eps, Large-hash-table.eps, Overlapping-alignment.eps,
SIMD-oligomers.eps, Vertical-format.eps: Added figures
2015-04-23 twu
* papers, 2015-statgen, algorithm.tex, context.tex, discussion.tex,
features.tex, introduction.tex, toplevel.tex: Added directory for editing
papers
2015-04-07 twu
* splice.c: Fixed probability calculation for an ambiguous splice
2015-03-27 twu
* stage3hr.c: Allowing insertlength to be negative, up to -pairmax, to allow
for overlaps. For debugging messages involving insert length, using
chromosomal coordinates.
* stage1hr.c: Added address of GMAP alignment to debugging messages
* chimera.c: Added information about querypos and homology to XT field for
GMAP
* samprint.c: Removed old version of adjust_hardclips
2015-03-26 twu
* filestring.c: Turned off debugging output to stdout
* outbuffer.c, master.c, master.h, gsnap.c, filestring.c: Allow possibility
in MPI for output to stdout
* mpidebug.h: Added tag for writing to stdout
* mpidebug.c: Handling debugging output for MPI_BOOL_T as an unsigned char
* gsnap.c: Allowing MPI with only a single thread per rank, by calling
Master_parser as a detached thread
* sarray-read.c: Allowing memory mapping for indexij_access
2015-03-25 twu
* gmap.c, gsnap.c: Added USE_MPI checks around final MPI_Barrier
* VERSION: Updated version number
* trunk, configure.ac, index.html, src, access.c, access.h, atoiindex.c,
cmetindex.c, genome.c, genome.h, get-genome.c, gmap.c, gmapindex.c,
gsnap.c, iit-read-univ.c, iit-read.c, indexdb-write.c, indexdb.c,
indexdb.h, indexdbdef.h, outbuffer.c, sarray-read.c, sarray-read.h,
sarray-write.c, snpindex.c, uniqscan.c: Merged revisions 161768 through
161939 from branches/2015-03-23-shmem to implement shared memory
2015-03-24 twu
* stage3hr.c: In test_hardclips, checking if low and high coordinates are
equal
* stage3hr.c: Fixed comparison of chrpos in adjust_hardclips_right and
adjust_hardclips_left
2015-03-23 twu
* stage3hr.c: In adjust_hardclips, advancing both low_querypos and
high_querypos on either failure, to prevent infinite loop
* stage3hr.c: In adjust_hardclips, advancing either low_querypos or
high_querypos if needed
2015-03-22 twu
* stage3hr.c: Doing a final test_hardclip when shift right and shift left
are not possible
* substring.c: In alias_circular and unalias_circular, updating
genomicstart_adj and genomicend_adj
* stage3hr.c: Changed endpoint test in Stage3end_substring_low
* substring.c: Removed debugging string
* substring.c, substring.h: Added fields genomicstart_adj and genomicend_adj
for substring2 of insertions and deletions to handle computations with
querypos to obtain a genomic position
* stage3hr.c: Using genomicstart_adj and genomicend_adj in insertions and
deletions to handle computations with querypos to obtain a genomic position
2015-03-21 twu
* substring.c, substring.h: Substring_convert_to_pairs now takes
genomicstart_indel_adj
* stage3hr.c: No longer changing left2, genomicstart2, and genomicend2 for
substring2 of insertions and deletions. Providing indel adjustments
instead to Substring_convert_to_pairs.
* pair.c: Made Pairarray_contains_p routine look for any case of a gap or
indel for a given querypos
2015-03-20 twu
* stage3hr.c: In adjust_hardclips, for dual GMAP, added the ability to shift
low_querypos or high_querypos independently to make the low genomicpos and
high genomicpos equal.
* stage3hr.c: In test_hardclips, for dual GMAP, checking that the
coordinates match for the two ends
* stage3hr.c: On recomputing of hardclips near center, decrementing the
higher value to make the clipping more even
* stage3hr.c: Fixed bug in defining left2 for deletion
* stage3hr.c: In find_ilengths, returning false instead of aborting
* VERSION: Updated version number
* list.c, list.h: Implemented List_pop_out
* substring.c: Fixed genomic coordinates to be 0-based when converting from
substrings to pairs
* stage3hr.c: In test_hardclips, fixed bug with uninitialized values. In
adjust_hardclips, checking querypos, querypos-1, and querypos+1 again.
Also, for dual GMAP, checking that genomepos matches for the given
low_querypos and high_querypos, meaning that alignments are similar.
Always doing a recompute of ilengths after adjust_hardclips. Implemented
stripping of gaps and indels that occur between the two parts when doing a
merge overlap.
* stage3hr.c: Subtracting 1 from alignstart or alignend in computing
overlaps. The find_ilengths function returns false if a common point is
not found. Added a test_hardclips step and separate right and left shifts
for adjust_hardclip. Computing a separate genomicstart2 for substring2 of
insertions and deletions.
2015-03-19 twu
* pair.c, pair.h: Implemented Pairarray_lookup
* stage3hr.c: Computing second hardclip from its ilength, not overlap. In
finding common point involving GMAP, skipping introns and indels. Added
code to check that merged overlap pieces are next to each other.
2015-03-18 twu
* stage3hr.c: Fixed bug in some of the initial loops of adjust_hardclips
* splice.c, stage1hr.c: Using only sensedir and not sensep in calling
Substring_new_donor, acceptor, and shortexon
2015-03-17 twu
* stage3hr.c, substring.c, substring.h: Removed unused variables and
parameters. Using sensedir instead of sensep.
* samprint.c: Removed unused parameters and variables
* substring.c, substring.h: Making Substring_print_shortexon use sensedir
instead of sensep. Removed unused parameters.
* stage3hr.c: Calling Substring_print_donor, acceptor, and shortexon
procedures with sensedir instead of sensep
* pair.c, pair.h: Removed unused parameters
* VERSION: Updated version number
* output.c: Using new interface to SAM_compute_chrpos
* samprint.c, samprint.h: Corrected calculations in SAM_compute_chrpos
* stage3hr.c, stage3hr.h: Using substring_LtoH instead of substring_low and
substring_high. Added initial shift in adjust_hardclips. Fixed
calculation of overlap to depend only on common_left and common_right.
* substring.c, substring.h: Changed Substring_chrstart and Substring_chrend
to Substring_alignstart_chr and Substring_alignend_chr
* output.c, samprint.c, samprint.h: Did a reverse merge to undo revision
160876 which used substring_hardclipped instead of substring_low
2015-03-12 twu
* VERSION: Updated version number
* output.c, samprint.c, samprint.h: Revised SAM_compute_chrpos to search for
the hardclipped substring, rather than using substring_low
* stage3hr.c: Changed comment
* shortread.c: Initializing nextchar2 in various procedures
* gsnap.c: Fixed small memory leak
2015-03-11 twu
* stage3hr.c: Adjusting hardclips by checking adjacent positions left and
right of the crossover querypos.
* substring.c: Removed comment
* stage3hr.c: Restored correct ilength calculations for minus strand
2015-03-06 twu
* VERSION, config.site.rescomp.prd, config.site.rescomp.tst, index.html:
Updated version number
* stage3hr.c: Added comparisons in hitpair_sort_cmp to fix issue where
duplicate alignments were not being put together for removal
* oligoindex_hr.c: Implemented bit twiddling and SIMD-based method for
computing reverse_nt
2015-03-03 twu
* stage3.c: Removed automatic trimming of ends less than 12 bp. Fixed bug
in assigning splice pair in end trimming procedures.
* ax_ext.m4: Performing run test for tzcnt_u32 and tzcnt_u64
* stage3hr.c: Made minor fixes in --clip-overlap feature, including fixes to
gaps and overlaps, more even division of overlaps, and preference for
clipping heads rather than tails in cases of ties
* stage3.c: Turning off branch that can lead to bad CIGAR strings
* inbuffer.c: Defining variable needed when MPI_FILE_INPUT is specified
* gsnap.c: Doing a chromosome_iit_setup before worker_setup
* genome128_hr.c: Using HAVE_TZCNT instead of HAVE_BMI1
2015-02-25 twu
* stage1hr.c, stage3hr.c, stage3hr.h: Printing an accession when reporting a
CIGAR error
* inbuffer.c, inbuffer.h: Changed nspaces to be an unsigned int
* gsnap.c: Moved pthread_attr_init to places just before they are needed
* Makefile.gsnaptoo.am: Added master.c and master.h as extra files to be
distributed
* master.c: Added pre-processor macros
2015-02-24 twu
* gsnap.c: Added pre-processor macro around inclusion of master.h
* trunk, VERSION, config.site.rescomp.prd, config.site.rescomp.tst,
Makefile.gsnaptoo.am, index.html, src, filestring.c, filestring.h,
gsnap.c, inbuffer.c, inbuffer.h, master.c, master.h, mpidebug.c,
mpidebug.h, util: Merged revisions 158119 through 159424 from
branches/2015-02-05-mpi-workers-0 to allow for worker threads in rank 0
2015-02-12 twu
* gmap.c: Added debugging statements
* chimera.c, pair.c, pair.h: Providing Pair_pathscores with a
pre_extension_slop parameter. Distinguishing between call to
Pair_pathscores when finding non-extended paths to pair up, and when
finding a breakpoint between the final, extended paths.
* outbuffer.c: Rearranged procedures for compilation to work
* pair.c: In Pair_print_sam, always doing a Pair_compute_cigar
* outbuffer.c: Printing SAM headers on empty files
2015-02-10 twu
* gmap.c: Allowing PMAP to have variables for gff3_separators_p
* gmap.c, gsnap.c, pair.c, pair.h, uniqscan.c: For gff3 output, always
adding a separator line. Added --gff3-add-separators flag to GMAP.
* stage1.c: In find_range, limiting number of results to 100 to avoid
getting bogged down in repeats
* gff3_genes.pl.in, gff3_introns.pl.in, gff3_splicesites.pl.in: For gff3
files without a gene name, always read $chr from line
2015-02-05 twu
* pair.c: GMAP always recompute cigar_tokens, in case merging has affected
them
2015-02-04 twu
* pair.c: Added slop in computing Pair_pathscores, to allow for better
identification of translocations
* gmap.c: Improved debugging statements
* chimera.c: Changed type of some debugging statements
2015-02-02 twu
* trunk, VERSION, src, gmap.c, gsnap.c, pair.c, pair.h, samprint.c,
stage3.c, stage3hr.c, stage3hr.h, uniqscan.c: Merged revisions 157793
through 157918 from branches/2015-01-30-cigar-check to create and check
cigar strings when Stage3_T or Stage3end_T objects are created
2015-01-30 twu
* stage1hr.c: Using new interface to Stage3_compute
* gmap.c: Using new interface to Stage3_compute and Stage3_new. No longer
calling Stage3_recompute_goodness.
* pair.c, pair.h: Implemented Pair_fracidentity_array, which returns goodness
* stage3.h, stage3.c: Changed Stage3_recompute_goodness to
Stage3_compute_mapq. Always recomputing matches and goodness when
this->pairarray is assigned. Removed references to
END_KNOWNSPLICING_SHORTCUT.
2015-01-29 twu
* stage3.c: In Stage3_cmp, using npairs and matches as secondary criteria
beyond goodness
* gmap.c: Cleaned up unused variables and parameters. Using new interface
to Stage3_compute
* filestring.c: Added ability to handle %f
* stage3.c, stage3.h: Cleaned up unused variables and parameters
* stage1hr.c: Using new interface to Stage3_compute
* pair.c: Using false instead of 0
2015-01-28 twu
* gmap.c: Added call to Outbuffer_cleanup()
* outbuffer.c: Moved lock outside of loop to prevent a race condition
* inbuffer.c: Removed check of nextchar == EOF, which causes standard GSNAP
and GMAP not to terminate
2015-01-27 twu
* shortread.c, shortread.h: Fixed some issues with variable names for MPI
code
* outbuffer.c, outbuffer.h: Added Outbuffer_cleanup, which frees array of
outputs
* inbuffer.c: Allowing for gzipped and bzipped2 files in MPI version by
sending and receiving filecontents
* gsnap.c: Calling Outbuffer_cleanup
* gmap.c: Revealed variable needed for debugging
* filestring.c, filestring.h: Implemented Filestring_send and Filestring_recv
* compress.c: Fixed comment
* shortread.c: Made code consistent across text, gzip and bzip2. Added
hooks for filling a Filestring_T object in gzip and bzip2 procedures.
2015-01-26 twu
* index.html: Updated for 2014-12-17.v2
* shortread.c, shortread.h, mpidebug.c, mpidebug.h: Using workers_comm in
MPI_fopen
* inbuffer.c, inbuffer.h: Passing workers_comm to Shortread_read_filecontents
* gsnap.c: Introduced a workers_comm so MPI_File_open and MPI_File_close can
be restricted to that group
* shortread.c: Added debugging statements for opening and closing files
* gsnap.c: Added debugging statements for opening and closing files. For
MPI master using MPI_File input, explicitly closing those inputs.
* gsnap.c: Using new interfaces to Inbuffer_setup, Inbuffer_new, and
Inbuffer_master_process. Master rank 0 no longer calling Inbuffer_new.
* gmap.c: Using new interface to Inbuffer_new
* inbuffer.c: No longer making a special case in fill_buffer for MPI when
nextchar at end of block is EOF.
* shortread.c, shortread.h: MPI procedures for reading from filecontents
also close and open input files
* inbuffer.h, inbuffer.c: Moved nspaces into Inbuffer_T object and into
Inbuffer_new instead of Inbuffer_setup. Made Inbuffer_master_process free
of an Inbuffer_T object.
2015-01-23 twu
* inbuffer.c: Added comments
* gsnap.c: Created separate worker_setup and worker_cleanup procedures
2015-01-22 twu
* inbuffer.c: Assigning filecontents buffers to the IN category for memusage
* trunk, config.site.rescomp.prd, src, gsnap.c, inbuffer.c, inbuffer.h,
shortread.c, shortread.h: Merged revisions 157242 to 157253 from
branches/2015-01-22-mpi-file-block to have worker ranks read blocks into a
buffer
* memchk.c, popcount.c, bitpack64-read.h, bitpack64-serial-read.h,
compress.h, dynprog.h, except.h, genomicpos.h, iit-read.h,
indexdb-write.h, indexdb.h, indexdbdef.h, popcount.h, sequence.h: Added
include of config.h
* configure.ac: Changed variable name from USE_MPI_FILE to USE_MPI_FILE_INPUT
* samheader.h, iit-read-univ.h: Added include of <mpi.h>
* oligoindex_pmap.h, bigendian.h, fopen.h, iitdef.h, littleendian.h, mem.h,
oligoindex.h, types.h: Added explanation of why config.h needs to be
included
* gsnap.c, inbuffer.c, inbuffer.h, shortread.c, shortread.h: Checking for
both USE_MPI and USE_MPI_FILE_INPUT in using MPI_File for input
* chrsubset.h, access.h, alphabet.h, backtranslation.h, block.h,
boyer-moore.h, bp-read.h, bp-write.h, bytecoding.h, bzip2.h, chrom.h,
chrsegment.h, datadir.h, diag.h, diagdef.h, diagpool.h, genome-write.h,
genome128-write.h, genome_hr.h, genome_sites.h, genomepage.h, gregion.h,
iit-write-univ.h, iit-write.h, indel.h, indexdb_hr.h, interval.h,
intlist.h, intpool.h, intron.h, match.h, matchdef.h, matchpool.h,
maxent128_hr.h, maxent_hr.h, oligo.h, oligop.h, pairdef.h, parserange.h,
reader.h, stage1.h, tableuint8.h, tally.h, translation.h, univinterval.h:
Added blank line for formatting
* filestring.h, mpidebug.h, oligoindex_hr.h, samprint.h, sortinfo.h: Removed
include of config.h, since not necessary
* atoi.h, bitpack64-write.h, cmet.h: Added $Id$ string
2015-01-21 twu
* stage3hr.c: Turning on SOFT_CLIPS_AVOID_CIRCULARIZATION again to avoid
duplicates in circular chromosomes
* ax_mpi.m4: Added cc to list of possible values for MPICC, for systems that
use a wrapper called cc
* shortread.c: Fixed parsing issues for blank lines and ends of files
* configure.ac: Added configure flag --enable-mpi-file
* Makefile.gsnaptoo.am: Removed mpi_gmap for now
* gsnap.c, pair.c, pair.h: Added noprint option for --action-if-cigar-error
and made it the default
* gsnap.c, inbuffer.c: Made -q or --part flag work for MPI code
* inbuffer.c: Added ending brace for MPI code
2015-01-20 twu
* shortread.c: Fixed bug in a print statement where a pointer was not being
provided. In input_oneline, making a single read to get nextchar.
* inbuffer.c: Not doing fseek if nextchar is EOF
* gsnap.c: Removed a debugging statement
* filestring.c: Increased size of buffer
* outbuffer.c, outbuffer.h: Added parameter for output_file
* gmap.c: Using new interface to Outbuffer_setup and
Outbuffer_print_filestrings
* samheader.c, samheader.h, iit-read-univ.c, iit-read-univ.h, gsnap.c,
filestring.c, filestring.h: Applied changes from
branches/2015-01-17-mpi-seq
* outbuffer.c, outbuffer.h: Applied changes from
branches/2015-01-17-mpi-seq. Removed code for Outbuffer_mpi_process.
* inbuffer.c: Removed requestid variable from fill_buffer for GMAP
* gmap.c: Put in dummy variables for Inbuffer_new
* trunk, config.site.rescomp.tst, src, filestring.c, filestring.h, gsnap.c,
inbuffer.c, inbuffer.h, mpidebug.c, mpidebug.h, outbuffer.c, shortread.c,
shortread.h: Merged revisions 156908 to 157083 from
branches/2015-01-17-mpi-seq to change the input side of mpi_gsnap
* index.html: Updated for version 2014-12-17
* VERSION: Updated version number
* samprint.c: Consolidated print statements
* output.c: Defining abbrev for a nomapper
* diag.c: Added debugging statement
2015-01-15 twu
* gmap.c, pair.c, stage2.c, stage3.c: Merged revisions 156824 to 156843 from
branches/2015-01-15-fix-chimeras to make better decisions for last exons
having partial alignments
* oligoindex_hr.c: Allowing diagonals where ptr->i < querylength. Reveals
alignments that were otherwise missed.
* gmap.c: Fixed debugging statements to use Sequence_stdout instead of
Sequence_print
* chimera.c, chimera.h, gmap.c: Fixed algorithm for finding non-exon-exon
chimeric breakpoint and finding dinucleotides
2015-01-14 twu
* stage3hr.c: In anomalous_splice_p procedures, checking for samechr_splice
hittypes
* stage1hr.c: Not applying GMAP to samechr_splice hittypes
2015-01-07 twu
* oligoindex_hr.c: Fixed type for positions_space field in Oligoindex_T
* trunk, src, oligoindex_hr.c, oligoindex_hr.h, oligoindex_old.c,
oligoindex_old.h, stage2.c: Merged revisions 154793 through 156263 from
branches/2014-12-06-stage2-larger-kmers to allow for 9-mers in stage 2
* config.site.rescomp.prd, config.site.rescomp.tst, VERSION: Updated version
number
* index.html: Added changes for version 2014-12-16 (v2)
* substring.c: Fixed assertions to account for out-of-bounds regions
* README: Added explanation of XI field
* pair.c, samprint.c, shortread.c, shortread.h: Added code for XI field
2015-01-05 twu
* stage1hr.c: Using correct typecast of ambcoords to (Uint8list_T) NULL for
large genomes
* stage2.c: Fixed uninitialized variable for firstactive
2014-12-16 twu
* gsnap.c, uniqscan.c: Using new interface to Stage3hr_setup
* stage3hr.c, stage3hr.h: Computing outofbounds_left and outofbounds_right.
Using new interface to Substring_new.
* substring.c, substring.h: Added provision for outofbounds_left and
outofbounds_right, to be considered part of trimming
* gsnap.c: Changed input sequence to open input streams to get one character
and determine if it is FASTQ format, and then to do Shortread_setup, and
then to fill the inbuffer.
* sarray-read.c: Fixed typo: spliceends_antisense => spliceends_sense
* substring.c: Removed debugging statement
2014-12-15 twu
* samheader.c: Not printing tabs if there are no headers
* sam_sort.c: Setting fileposition variable for each file
* filestring.c: Handling the case where filestring is NULL
2014-12-12 twu
* doublelist.c: Fixed type error in doublelist_to_array_out
* trunk, config.site.rescomp.prd, Makefile.gsnaptoo.am, src, gsnap.c,
samprint.c, stage1hr.c, stage1hr.h, stage3hr.c, substring.c, substring.h,
uniqscan.c: Merged revisions 154499 through 155289 from
branches/2014-12-03-dna-chimeras
* VERSION, config.site.rescomp.prd, index.html: Updated version number
* sam_sort.c: Revised sam_sort to handle multiple input files
* trunk, Makefile.am, VERSION, bootstrap.gsnaptoo, ax_mpi.m4, config.site,
config.site.rescomp.prd, config.site.rescomp.tst, configure.ac,
memory-check.pl, mpi, src, Makefile.gsnaptoo.am, access.c,
backtranslation.c, backtranslation.h, bool.h, chimera.c, chimera.h,
filestring.c, filestring.h, genomicpos.c, genomicpos.h, get-genome.c,
gmap.c, gsnap.c, iit-read-univ.c, iit-read-univ.h, iit-read.c, iit-read.h,
inbuffer.c, inbuffer.h, md5.c, md5.h, mem.c, mem.h, mpidebug.c,
mpidebug.h, outbuffer.c, outbuffer.h, output.c, output.h, pair.c, pair.h,
request.c, request.h, resulthr.c, resulthr.h, revcomp.c, sam_sort.c,
samflags.h, samheader.c, samheader.h, samprint.c, samprint.h,
sarray-read.c, segmentpos.c, segmentpos.h, sequence.c, sequence.h,
shortread.c, shortread.h, stage1hr.c, stage2.c, stage2.h, stage3.c,
stage3.h, stage3hr.c, stage3hr.h, substring.c, substring.h, translation.c,
translation.h, types.h, uniqscan.c: Merged revisions 154226 to 155279 from
branches/2014-11-27-mpi to implement MPI versions and to use Filestring_T
objects for all output
* genome.c, genome.h: Changed type of gbuffer from unsigned char to char
2014-12-10 twu
* oligoindex_hr.c: Added code for handling 9-mers
2014-12-06 twu
* stage1hr.c: Fixed typo in assigning probs_acceptor
2014-12-05 twu
* trunk, VERSION, config.site.rescomp.prd, src, doublelist.c, doublelist.h,
gsnap.c, samprint.c, sarray-read.c, splice.c, stage1hr.c, stage1hr.h,
stage3hr.c, stage3hr.h, uniqscan.c: Merged revisions 154673 through 154777
from branches/2014-12-04-stage1-ambig to compute ambiguous splicing better
in suffix array, stage1, and combining splices. Fixed memory leak and
changed criteria for comparing across hits
2014-12-04 twu
* samprint.c, stage3hr.c, stage3hr.h: Merged revisions 154673 through 154678
from branches/2014-12-04-stage1-ambig to change XA field
* index.html: Updated for latest version
* configure.ac: Added more detailed messages about our own loading of
config.site files to counteract the warning message from the standard
autoconf loading
2014-12-03 twu
* uniqscan.c: Using new interface to Substring_setup
* gsnap.c: Replaced --terminal-output-minlength with --reject-trimlength
* stage1hr.c, stage1hr.h: Calling Sarray_search_greedy with nmisses_allowed
being cutoff_level, and not querylength. Using reject_trimlength instead
of terminal_output_minlength.
* stage3hr.c, stage3hr.h: Replaced Stage3_filter_terminals with
Stage3_reject_trimlengths
* substring.c, substring.h: Implemented new logic based on
reject_trimlength. True terminals from the GSNAP algorithm are allowed at
this point (but taken care of now by Stage3end_reject_trimlengths).
* sarray-read.c: Improved debugging statements
* stage3hr.c: No longer trying to clip overlaps when the two ends are not in
a concordant orientation
* outbuffer.c: Using new interface to SAM_print_nomapping
* samprint.c, samprint.h: Allowing for non-zero npaths to be printed in
SAM_print_nomapping as an NH field, which can occur with the
--quiet-if-excessive feature
* samread.c: Allowing for the possibility that XO is the first field in a
SAM line
* stage3hr.c: Fixed problem with --merge-distant-samchr feature giving the
wrong chrpos on SAM output on distant splices, since this was being
treated the same as a translocation (chrnum == 0)
* samread.c: Terminating parse_XO procedures for either '\0' or '\n'
2014-12-02 twu
* gmap.c, gsnap.c: Including default variables in --help statement
* stage3hr.c: Calculating common_shift to get more even splits between the
two paired ends, by accounting for the common shared point between
common_right and common_left.
* sarray-read.c: Fixed typo in a for loop
* sam_sort.c: Made --no-sam-headers option work correctly
* sam_sort.c, samheader.c, samheader.h: For --split-output function, writing
SAM header files to each output file
2014-11-27 twu
* archive.html, index.html: Updated for latest version
2014-11-25 twu
* README: Added comment about sam_sort and --split-output
* sam_sort.c, samflags.h, samread.c, samread.h: Added --split-output and
--append-output options
* outbuffer.c: Changed abbrev NM in comment
* stage1hr.c: Changed calculation of amb_nmatches to amb_length
* stage3hr.c: Swapping ilength_low and ilength_high for GMAP when alignment
is minus
2014-11-24 twu
* stage1hr.c: Turning off debugging
* bootstrap.gsnaptoo: Running automake to add missing files
* trunk, util, src, outbuffer.c, pair.c, pair.h, samprint.c, samprint.h,
stage3hr.c, stage3hr.h, substring.c, substring.h: Merged revisions 153682
to 154020 from branches/2014-11-20-redo-overlap to compute overlap better
using ilength53 and ilength35 and a common shift
* sarray-read.c: Merged revisions 153682 to 154020 to handle ambiguous
splicing better
* trunk, INSTALL, VERSION, config.site.rescomp.prd, index.html: No longer
keeping track of INSTALL
* config.guess, config.sub, ltmain.sh: No longer keeping track of
config.guess, config.sub, or ltmain.sh
* gmap_build.pl.in: Added comment about meaning of -D flag
* acinclude.m4, configure.ac: Adding check for MPI
* ax_mpi.m4: Added code for MPI
* src, access.c, bitpack64-read.c, bitpack64-readtwo.c, bitpack64-write.c,
compress-write.c, genome-write.c, genome.c, genome128-write.c,
genome128.c, genome_sites.c, get-genome.c, gmapindex.c, iit-read-univ.c,
iit-read.c, iit_get.c, iit_store.c, indel.c, indexdb-write.c, indexdb.c,
indexdb_hr.c, mem.c, oligoindex_hr.c, sam_sort.c, samheader.c, snpindex.c,
stage1hr.c, stage3.c, table.c, tableuint8.c, uniqscan.c, univinterval.c:
Merged revisions 153114 to 153944 from branches/2014-11-12-make-check-i386
to make tests work in i386 computers
* stage3hr.c: Not using ambiguous splices to update found_score
* stage2.c: Removed adjacentp as unused variables
* samprint.c, samprint.h: For circular alignments, checking for sole HS
pattern. Also checking for chrpos > chrlength, and subtracting chrlength
if necessary.
* pair.c, pair.h: Added Cigar_action_T. Added Pair_check_cigar. Removed
prev as an unused variable.
* iit-write-univ.c: Handling the case if total_nintervals is 0
* gmap.c, gsnap.c: Added --action-if-cigar-error
2014-11-17 twu
* bytecoding.c: Removed unused variable
* gmapindex.c: Printing genome length to stderr
* bytecoding.c: Using a buffer of 10,000,000 block-sizes, and writing
iteratively, rather than a single buffer and single write.
2014-11-13 twu
* gmapindex.c: Made some changes in casting. Fixed printf format to use
%llu.
* stage3hr.c, stage3hr.h: Renamed amb_nmatches to amb_length.
* samprint.c: In adjust_hardclips, not changing hardclips if shift downward
fails. Renamed amb_nmatches to amb_length.
* splice.c, stage1hr.c: Renamed amb_nmatches to amb_length. Providing
Substring_match_length_orig to amb_length in Stage3end_new_splice and
Stage3end_new_shortexon
2014-10-31 twu
* iit-read-univ.c, iit-read.c: Using %llu and casting to (long long int) for
printing offset and filesize
* gmap.c, gsnap.c: Using %zu for printing results of sizeof().
2014-10-29 twu
* stage3hr.c: Restoring revision of SAM insertlength for ends involving GMAP
when method is successful
* stage3hr.c: Fixed SAM output of insert length of 0 when no overlap is
found in a GMAP alignment
* stage3.c: Added debugging statements
* gmap.c, gmapindex.c, gsnap.c: Added output statement at end of checking
compiler assumptions
* README: Added comment about change from PG: to XG:
* ax_ext.m4, configure.ac: Added option to enable or disable sse4.2
* samprint.c: Fixed typo in adjust_hardclips. Also, when querypos increase
fails, trying querypos decrease.
* samprint.c: Fixed infinite loop in adjust_hardclips
2014-10-28 twu
* stage3hr.c: Fixed bug with uninitialized variables
* outbuffer.c: Fixing potential data race as noted by valgrind for
this->ntotal between input and output threads, although not problematic
before, because this->ntotal increases monotonically
2014-10-27 twu
* stage3hr.c: Fixed computation of overlap between GMAP and non-GMAP
alignments
2014-10-22 twu
* gregion.c: Checking size before deciding to use alloca or malloc.
2014-10-16 twu
* stage2.c: Fixed an uninitialized variable in grand_fwd and grand_rev
procedures, plus the checks on maxintronlen in computing
grand_fwd_lookforward and grand_rev_lookforward.
* shortread.c: Allowing queryseq1 to be equal to SKIPPED. Removed unused
parameter acc from input_oneline routines.
* VERSION, index.html: Updated version number
* stage1hr.c: Added debugging statement
* sarray-read.c: Don't limit filling of best elt based on nmatches being
more than half of the read length
* stage3hr.c: Restored previous behavior where soft clips avoid
circularization
* indexdb-write.c, sarray-write.c: Removed unnecessary includes of popcount.h
* bitpack64-write.c, genome128_hr.c: In lookups of clz_table, removing the
intermediate variable "top".
2014-10-15 twu
* stage3hr.c: Not allowing soft-clipping at ends to avoid circularization.
Added pre-processor macro SOFT_CLIPS_AVOID_CIRCULARIZATION to preserve
previous code.
* VERSION, config.site.rescomp.prd, config.site.rescomp.tst, index.html:
Updated version number
* outbuffer.c: Added gff3 header for GMAP gff3 output to stdout. Added HD
and PG headers for GMAP sam output to --split-output files.
2014-10-14 twu
* Makefile.gsnaptoo.am: Changed variable name to SIMD_CFLAGS
* configure.ac: Changed variable name to SIMD_CFLAGS. Setting -mpopcnt
based only on acx_mpopcnt_ok, and not on individual builtin functions.
* builtin-popcount.m4: Setting CFLAGS instead of LIBS. Checking for builtin
functions regardless of whether -mpopcnt works.
* ax_ext.m4: Setting CFLAGS instead of LIBS. Changed variable name to
SIMD_CFLAGS
* builtin-popcount.m4, configure.ac: Changed macro name to
ACX_BUILTIN_POPCOUNT
* acinclude.m4, builtin-popcount.m4, builtin.m4: Renamed program to
builtin-popcount.m4
* popcnt.m4: Added comment
* builtin.m4: Added check from popcnt.m4
2014-10-13 twu
* shortread.c: Made input procedures robust to incomplete entries
* indexdb.c: Fixed bug where munmap was called twice on positions_high for
GSNAPL and GMAPL.
2014-10-09 twu
* VERSION: Updated version number
* ax_ext.m4: Seeing of avx and avx2 are enabled
* configure.ac: Added ability to disable avx and avx2
* gmap_build.pl.in: Calling gmapindex initially to check for compiler
assumptions
* gmapindex.c: Added check for compiler assumptions
* gmap.c, gsnap.c: Added description of --check option to --help message
* gmap.c, gsnap.c: Improved check for compiler assumptions, and added
--check option
* samprint.c: Fixed flag for merged overlap to not have PAIRED_READ set.
When clipdir == 0, not calling adjust_hardclips.
2014-10-07 twu
* sarray-read.c: Fixed bug when setting array_stop and finalptr is less than
4
2014-10-01 twu
* samprint.c: Further fixes to mapping quality for merged alignments
* samprint.c: For merged alignments, printing mapping quality of 40
* pair.c, samprint.c: Putting XB and XP tags after XH
* VERSION: Updated version number
* samprint.c: Calling GMAP procedure for querystart and queryend when
necessary in adjust_hardclips
* stage3.c: Initializing variables
* pair.c: Passing correct values for hardclip_low and hardclip_high to
hardclip_pairs
* samprint.c: Removed static initialization of hide_soft_clips_p
* trunk, src, outbuffer.c, pair.c, pair.h, samprint.c, samprint.h: Merged
revisions 149547 through 149570 from branches/2014-10-01-hardclip-adj to
improve adjustment of hardclipping
* stage3.c: Removed diagnosticp from some procedures
2014-09-30 twu
* sam_sort.c: Providing timing information to user
* sam_sort.c, samread.c, samread.h: Changed algorithm to parse just for
linelengths initially, which allows for the SAM file to be read using
buffers
2014-09-29 twu
* VERSION, index.html: Updated version number
* sam_sort.c: Fixed usage statement
* samread.c, samread.h: Commented out unused procedures
* samread.c: Added header file
* samheader.c, samheader.h, chimera.c, chimera.h, get-genome.c, gmap.c,
gsnap.c, iit-read-univ.c, iit-read-univ.h, outbuffer.c, outbuffer.h,
pair.c, pair.h, samprint.c, samprint.h, shortread.c, shortread.h,
stage3.c, stage3.h, uniqscan.c: Removing computation of .sortinfo file
* Makefile.gsnaptoo.am: Removed sortinfo.c and sortinfo.h
* sam_sort.c, samflags.h, samheader.c, samheader.h, samread.c, samread.h:
Made sam_sort independent of a .sortinfo file. Computes sortinfo
information directly from the SAM input file
2014-09-26 twu
* outbuffer.c: Providing sortinfo to Pair_print_sam_nomapping
* VERSION, index.html: Updated version number
* trunk, index.html, src, outbuffer.c, pair.c, pair.h, sam_sort.c,
samflags.h, samprint.c, samprint.h, samread.c, samread.h, sortinfo.c,
sortinfo.h, stage3.c: Merged revisions 148987 through 149172 from
branches/2014-09-25-sam-sort to yield a working version of sortinfo feature
* stage3hr.c: Reverting to previous version 149160 in trunk
* stage3hr.c: Rewrote Stage3pair_remove_overlap procedures to use
hitpair_overlap_score_cmp, hitpair_overlap_test,
hitpair_equiv_preference_cmp, and hitpair_equiv_tst
2014-09-25 twu
* samprint.c: Not merging paired ends if there is no overlap
2014-09-24 twu
* pair.c: Enabled call to Sortinfo_update to work for GMAP
* VERSION, index.html: Updated version number
* chimera.c, chimera.h, pair.c, samprint.c: Using @ instead of : for
coordinates for XT field. Made GMAP XT field same as that for GSNAP.
* config.site.rescomp.tst: Allowing builtin_popcount
* README: Added description of XH field
* Makefile.gsnaptoo.am: Added samheader.c to sam_sort
* sam_sort.c: Handling single-end alignments
* samread.c: Setting terminating character at end of computation
* samheader.c, samheader.h: Implemented change of HD header to SO:sorted
* sam_sort.c, samread.c, samread.h: Handling hard-clipped alignments
* pair.c, pair.h, samprint.c, shortread.c, shortread.h: Added XH field to
provide hard-clipped sequence
* substring.c: For --merge-overlap feature, in deciding whether to add
insertion, deletion, or intron between the pieces, using <= instead of <
to decide.
* samprint.c: In SAM_compute_chrpos, always using substring_low to compute
chrpos
* indel.c: Initializing variables
* sam_sort.c: Implemented --dups-only and --uniq-only
* sortinfo.c: Fixed typo in comment
* pair.c, pair.h, samprint.c, sortinfo.c, sortinfo.h: Sortinfo_update uses
sign on chrnum to indicate whether a read is the low or high end
* iit-read-univ.c: Using safer computation of an average
* Makefile.gsnaptoo.am, sam_sort.c, samread.c, samread.h: Implemented
--mark-dups function into sam_sort. Works except for hard-clipping.
* samread.c, samread.h: Brought over copy from GSTRUCT
* sam_sort.c: Introduced readindex, needed for marking duplicates
2014-09-23 twu
* Makefile.gsnaptoo.am, iit-read-univ.c, iit-read-univ.h, sam_sort.c,
sortinfo.c: Using genome chromosome_iit file in sam_sort, instead of
storing information in .sortinfo files
* VERSION, index.html: Updated version number
* trunk, src, Makefile.gsnaptoo.am, chimera.c, chimera.h, chrnum.h,
get-genome.c, gmap.c, gsnap.c, iit-read-univ.c, iit-read-univ.h,
outbuffer.c, outbuffer.h, pair.c, pair.h, sam_sort.c, samheader.c,
samheader.h, samprint.c, samprint.h, shortread.c, shortread.h, sortinfo.c,
sortinfo.h, stage1hr.h, stage3.c, stage3.h, types.h, uint8list.h,
uintlist.h, uniqscan.c: Merged revisions 148659 through 148720 from
branches/2014-09-23-sam-sort to add --make-sortinfo option and sam_sort
program
* splice.c: Fixed bug in reading Stage3end_T object that was already freed
2014-09-22 twu
* configure.ac: Loading default ./config.site file only if it exists
* VERSION, config.site.rescomp.prd, config.site.rescomp.tst, configure.ac,
index.html: Handling cases differently if CONFIG_SITE file begins with ./.
Allowing for multiple files in CONFIG_SITE.
* Makefile.gsnaptoo.am: Interchanged order of POPCNT_CFLAGS and SIMD_FLAGS
* samprint.c: Fixed bugs introduced in adding --merge-overlap feature
* INSTALL, config.guess, config.sub, ltmain.sh: Updated to more recent
version of autoconf
* bootstrap.gsnaptoo: Changed /usr/bin/touch to touch
* configure.ac: Testing first for ./ in front of CONFIG_SITE. Added
AC_SUBST of POPCNT_CFLAGS
* shortread.c, shortread.h: Removed unused procedures
* stage1hr.c: Changed nmisses_allowed for sarray method to be querylength,
for both --use-sarray=2 and normal method
2014-09-21 twu
* pair.c: Changed PG field to XG
* configure.ac: Change flag to --disable-builtin-popcount
* splice.c: In procedures that group for ambiguous hits, separating sense
and antisense segments
* substring.c, stage3hr.c, stage1hr.c: Improved debugging statements
2014-09-19 twu
* stage3.c: Using new interface to Pair_print_sam
* ax_ext.m4, builtin.m4: Restoring value of LIBS at end of procedure
* builtin.m4: Setting LIBS to -mpopcnt before running tests
* gsnap.c: Added comment that --merge-overlap is a beta implementation
* trunk, VERSION, config.site.rescomp.tst, index.html, src, gsnap.c, list.c,
list.h, outbuffer.c, outbuffer.h, pair.c, pair.h, samprint.c, samprint.h,
shortread.c, shortread.h, stage3hr.c, stage3hr.h, substring.c,
substring.h: Merged revisions 148190 through 1418357 from
branches/2014-10-18-merge-overlap to add --merge-overlap feature
* ax_ext.m4: Changed name of cpu features to sse4.1 and sse4.2
* configure.ac: Showing pthread and popcnt flags at end
* README: Explaining XG flag
* samprint.c: Changed PG flag to XG flag
* samheader.c: Commenting out code for extra @PG lines
* configure.ac: Setting POPCNT_CFLAGS based on result from builtin.m4
* builtin.m4: Setting a variable ax_cv_compile_builtin_ext
* ax_ext.m4: Setting LIBS properly before running AC_LINK_IFELSE
2014-09-18 twu
* genome128_hr.c: Using Harley's method to reduce number of popcount
operations for SSE2
* genome128_hr.c: Fixed HAVE_MM_POPCNT alternative for SSE2-based popcount
* archive.html, index.html: Updated for version 2014-09-18
2014-09-17 twu
* VERSION: Updated version number
* genome128_hr.c: Added macro for debug4
* genome128_hr.c: Using _mm_extract_epi16 for SSE2 code instead of casting
to (UINT4 *), because casting leads some compilers to generate wrong
ordering of statements
2014-09-16 twu
* ax_ext.m4, configure.ac: Providing information to user about SIMD cpu
features available and compiler flags to be used
* gmap.c, gsnap.c: Added extra information to --version about type of popcnt
supported
* ax_ext.m4: Changed ordering of ifthen clauses. Added variable to indicate
compiler or linker problems.
* configure.ac: Added warnings for compiler or linker problems for SIMD
extensions
2014-09-15 twu
* configure.ac: Changed flag name from --enable-popcnt to
--enable-builtin-popcnt
* genome128_hr.c, sarray-read.c: Using mm_popcnt when _popcnt not available
* ax_ext.m4: Revamped tests to check for CPU, compile, and linking, in that
order. Renamed variables more systematically.
* pair.c, pair.h, pairpool.c, pairpool.h, stage3.c, stage3.h: Moved
Pairpool_clip_bounded to pair.c, and created Pair_clip_bounded_pairs for
computing chimeras by the working thread and Pair_clip_bounded_array for
truncation by the output thread. This enables the --truncate flag to work
again for GMAP.
* gmap.c: Improved error message when invalid argument is given to -f
* stage3.c: Made cdna direction choices based on splice site scores more
stringent, so both donor and acceptor sites have to be significantly
better.
2014-09-12 twu
* gmap.c, outbuffer.c, pair.c, pair.h, stage3hr.c, stage3hr.h, substring.c,
substring.h: For BLAST m8 output, adding endings to accessions for
paired-end reads
* gmap.c, gsnap.c, outbuffer.c, outbuffer.h, pair.c, pair.h, stage3hr.c,
stage3hr.h, substring.c, substring.h, uniqscan.c: Added implementation of
BLAST m8 output format
2014-09-10 twu
* VERSION: Updated version number
* ax_ext.m4: Fixed typo for handling AVX
2014-09-08 twu
* uniqscan.c: Using new interface to Stage1hr_setup
* stage1hr.c, stage1hr.h, gsnap.c: Added option --use-sarray=2 to use only
suffix array algorithm
* stage3hr.c: Stopped using alloca for hitlists, since they can cause stack
overflow. Made loops more efficient for pair_up_concordant_aux.
* gmap.c: Stopping memory error when a chimera is found, --npaths is set to
1, and one part of the chimera fails the conditions for --min-identity or
--min-trimmed-coverage
2014-09-04 twu
* sarray-read.c: Implemented faster SIMD algorithm for
Elt_fill_positions_filtered
* sarray-read.c: Implemented Elt_fill_positions_filtered using alloca and
copying from stack, instead of guessing allocation
2014-09-03 twu
* VERSION, config.site.rescomp.prd: Updated version number
* stage3hr.c: In pair_remove_bad_superstretches, keeping track of better and
worse children separately, and handling list order correctly. Now chooses
shorter insert lengths correctly. Added OUTERLENGTH_SLOP.
2014-09-02 twu
* stage3hr.c: In Stage3end_optimal_score_aux and
Stage3pair_optimal_score_aux, counting indels only if they are within
trim_left and trim_right
* gsnap.c, stage3hr.c, stage3hr.h, uniqscan.c: Added option
--order-among-best to GSNAP to control randomization among best alignments
* stage3hr.c: When SCORE_INDELS is true for comparing alignments, not
counting indel_penalty in new->penalties to avoid double-counting
* stage2.c, stage3hr.c, stage3.c: Turned off debugging
* stage1hr.c: Calling GMAP pairsearch if indels5 or indels3 is not NULL, as
well as if found_score is too high
* trunk, src: Merged revisions 145502 to 146146 from
branches/2014-08-19-stack-alloca and 146146 to 146618 from
branches/2014-08-27-parallel-stage2
* oligoindex_hr.c, diag.c: Merging revisions 145502 to 146146 from
branches/2014-08-19-stack-alloca to ignore check on querylength and to use
alloca for GSNAP
* stage2.c, stage2.h: Merging revisions 146146 to 146618 from
branches/2014-08-27-parallel-stage2 to make stage 2 computation faster
* list.c, list.h, spanningelt.c, spanningelt.h, stage1hr.c, stage1hr.h:
Merging revisions 145502 to 146146 from branches/2014-08-19-stack-alloca
to work directly on arrays of Spanningelt_T objects
* dynprog_simd.c, dynprog_single.c: Merging revisions 146146 to 146618 from
branches/2014-08-27-parallel-stage2 to fix debugging procedures and to use
a stricter check for using 8-bit SIMD
* gmap.c, gsnap.c, stage3.c, stage3.h: Merging revisions 145502 to 146146
from branches/2014-08-19-stack-alloca to use stage2_alloc only for GMAP
initial stage 2 computation
* stage1hr.c: Fixed debugging statements
2014-08-25 twu
* trunk, configure.ac: Merged revisions 145989 through 145990 from
branches/2014-08-19-stack-alloca to adde flag to configure.ac
* trunk, VERSION, index.html, src, boyer-moore.c, boyer-moore.h, chimera.c,
chop_primers.c, diag.c, doublelist.c, doublelist.h, dynprog.c,
dynprog_cdna.c, dynprog_end.c, dynprog_genome.c, dynprog_simd.c,
dynprog_single.c, genome.c, genome.h, genome128_hr.c, genome_sites.c,
genomicpos.h, gmap.c, gregion.c, gregion.h, gsnap.c, indel.c, intlist.c,
intlist.h, list.c, list.h, mapq.c, mem.c, mem.h, oligo.c, oligoindex_hr.c,
outbuffer.c, pair.c, pair.h, pairpool.c, parserange.c, samprint.c,
sarray-read.c, shortread.c, shortread.h, smooth.c, splice.c, splicetrie.c,
stage1.c, stage1hr.c, stage1hr.h, stage2.c, stage2.h, stage3.c, stage3.h,
stage3hr.c, substring.c, uint8list.c, uint8list.h, uintlist.c, uintlist.h,
uinttable.c, uinttable.h, gvf_iit.pl.in: Merged revisions 145503 through
145988 from branches/2014-08-19-stack-alloca to use alloca
2014-08-20 twu
* uniqscan.c: Using new interface to Shortread_new
* shortread.c, shortread.h: Made Shortread_new extern again
* trunk, INSTALL, README, config.guess, config.sub, ltmain.sh,
config.site.rescomp.prd, config.site.rescomp.tst, src, gmap.c, gsnap.c,
indel.c, mapq.c, mem.c, mem.h, sarray-read.c, shortread.c, shortread.h,
splice.c, stage1hr.c, stage3hr.c, stage3hr.h, substring.c: Merged
revisions 145503 through 145603 from branches/2014-08-19-stack-alloca to
use alloca instead of statck arrays based on MAX_READLENGTH and to handle
reads longer than MAX_READLENGTH
2014-08-19 twu
* VERSION, index.html: Updated version number
* configure.ac: Added configure flag for enabling or disabling ssse3
instructions
* ax_ext.m4: Checking whether user wants SSSE3 instructions
* stage2.c, stage3.c: Not putting gapholders into starts and ends. Removing
gapholders from middle before calling Pairpool_join_end5 and
Pairpool_join_end3. Gapholders were causing problems with the join
operation.
* pairpool.c, pairpool.h: Implemented Pairpool_remove_gapholders
2014-08-04 twu
* stage3.c: Not setting best_pairs or best_path when the result is NULL
* shortread.c: Ignoring spaces in read
* pairpool.c: In joining paths, handling the case when one path is NULL
2014-07-29 twu
* ltmain.sh: Updated from 2.2.6 to 2.2.6b
* config.sub, config.guess: Updated from 2008 version to 2009 version
* INSTALL: Updated from 2007 version to 2009 version
* archive.html, index.html: Made changes for new version
* gff3_genes.pl.in, gff3_introns.pl.in, gff3_splicesites.pl.in: Handling
GFF3 files that have both exon and CDS fields
2014-07-21 twu
* stage3hr.c: Restored random behavior for equivalent alignments
* VERSION: Updated version number
* atoiindex.c, cmetindex.c: Making correct calls to
Sarray_discriminating_chars
* dynprog_simd.c: Added code for having no initial gap penalties, to be used
for bam_indelfix
2014-07-19 twu
* dynprog_simd.c: Improvements to debugging procedures to handle 3-digit
indices
2014-07-16 twu
* gmap.c: Using new interface to Stage3_compute
* stage3hr.h: Added interfaces for Stage3end_shortexonA_distance and
Stage3end_shortexonD_distance
* stage3hr.c: Added hook for amb_prob in Stage3end_new_gmap. Removed
penalty for ambig end lengths if amb_prob > 0.9. Fixed pointer advances
in removing bad superstretches.
* stage3.c: Made fixes to choice of cdna_direction: using presence/absence
of intron types, rather than number, and decreased binomial threshold for
alignments around intron. Fixed handling of multiple start or end paths
by joining them at the outset.
* stage2.c: Allowing for multiple end cells from each rootposition
* dynprog_genome.c: Altered decision-making between best alignment and
probability-based alignment
* dynprog_simd.c: Added debugging statements
* dynprog.h: Made open gap penalties more uniform for different defect rates
* pair.c: Fixed calculation of nmatches at end for Pair_trim_ends
* pairpool.c: Enhanced debugging statement
2014-07-15 twu
* stage3.h: Stage3_compute now returns ambig_prob_5 and ambig_prob_3
* stage3.c: For ambiguous ends, no longer calling clean_pairs_end5,
clean_path_end3, trim_end5_exon_indels, or trim_end3_exon_indels
* stage1hr.c: Passing amb_prob_5 and amb_prob_3 to Stage3end_new_gmap
* pair.c, pair.h: In Pair_trim_ends, not trimming ambiguous ends
* stage3.c: In trim_end5_exon_indels and trim_end3_exon_indels, counting
trimmed ends as mismatches and handling large indels differently from
small indels. Added hooks for ambig_prob.
* pair.c, pair.h, stage3hr.c: Eliminating ambig_end_nmatches from
consideration in Pair_nmismatches_region
* stage3hr.c: Restored eventrim algorithm from revision 140363, which had
only a single eventrim calculation and not separate calculations for the
two ends.
2014-07-03 twu
* VERSION: Updated version number
* dynprog_end.c: Fixed typo in calling matrix for 16_upper twice, instead of
16_upper and 16_lower
* splicetrie.c: Handling the case where Dynprog_end5_splicejunction or
Dynprog_end3_splicejunction returns NULL
* dynprog_simd.h: Decreased value of SIMD_MAXLENGTH_EPI8 from 40 to 30 to
prevent issues with overflows
* dynprog_simd.c, dynprog.c: In traceback, using main loop to decide whether
to handle dir == DIAG
* dynprog_end.c: In traceback, using main loop to decide whether to handle
dir == DIAG. In Dynprog_end5_splicejunction and
Dynprog_end3_splicejunction, requiring finalscore to be positive before
doing any traceback.
* sarray-write.c: Made monitoring statements work for
Sarray_discriminating_chars
* sarray-write.c: Implemented batch reading method for
Sarray_discriminating_chars
2014-07-02 twu
* gmapindex.c, sarray-write.c, sarray-write.h: Made the building of the LCP
array and discriminating chars array more memory efficient by writing
temporary files for rank and permuted sarray
* genome.c, genome.h: Changed type of counts to be Univcoord_T
* access.c: Fixed bug in handling of final partial block. Added debugging
code for checking results.
* access.c: Increased FREAD_BATCH to 100 million bytes. Modified
Access_allocated to always read in batches of size FREAD_BATCH.
2014-07-01 twu
* trunk, config.site.rescomp.prd, config.site.rescomp.tst, src,
genome128_hr.c, gmap.c, intlist.c, intlist.h, samprint.c, sarray-read.c,
splice.c, splice.h, stage1hr.c, stage3.c, stage3hr.c, stage3hr.h,
substring.c, uint8list.c, uint8list.h, uintlist.c, uintlist.h: Merged
revisions 140131 through 140367 from branches/2014-06-27-fix-amb to
implement separate eventrim scores for start/end of read, fix
cmet-stranded and cmet-nonstranded modes, implement separate
sense/antisense for Splice_solve_single, and rewrite of ambiguous
parameters from left/right to donor/acceptor
* index.html: Updated for version 2014-06-10
* archive.html: Added link to version 2011-12-28
* VERSION: Updated version number
* stage3hr.c: In Stage3end_new_shortexon, setting amb_nmismatches_start and
amb_nmismatches_end separately
* stage3.c: Using score_introns (which looks at splice site neighborhood),
instead of score_alignment to count canonical introns. Using defect_rate
to determine whether to rely on splice site probabilities.
* stage1hr.c: Added blank lines
* gmap.c: Preventing leftpos and rightpos from exceeding query coordinates
in solving for chimeras. Not using extension in finding remaining
alignment, since it makes alignment harder.
2014-06-30 twu
* stage3.c: Transferring microexon pairs without looking at probabilities
* dynprog_single.c: Using MIN_MICROEXON_LENGTH instead of 8
2014-06-25 twu
* stage3hr.c: Fixed assignment of amb_nmatches_start and amb_nmatches_end
for shortexons on minus strand
* stage3hr.c: Removed debugging code
* sarray-write.c: In Sarray_compute_child, cleaning out stack at end,
because skipping it results in an incorrect child array
2014-06-24 twu
* stage3hr.c: Commenting out assertions that are not always true
* stage1hr.c: Assigning correct values of amb_nmatches_donor and
amb_nmatches_acceptor to Stage3end_new_shortexon
2014-06-11 twu
* VERSION: Updated version number
* trunk, config.site.rescomp.prd, config.site.rescomp.tst, src,
sarray-read.c, splice.c, stage1hr.c, stage3hr.c, stage3hr.h: Merged
revisions 138722 through 138743 from
branches/2014-06-01-amb-shortexon-fix1 to allow for ambiguous shortexons
and to place a limit of MAX_LOCALSPLICING_POTENTIAL on splicing and
shortexons
* archive.html, index.html: Put version 2014-05-15.v3 into archive
* stage1hr.c: Not processing splicing or shortexons if number of
possibilities exceeds MAX_LOCALSPLICING_POTENTIAL
* gsnap.c: In memusage debugging, printing accession for each thread at the
start of its processing
* substring.c, substring.h: Added function Substring_chimera_prob_2
* segmentpos.c, segmentpos.h: Added function Segmentpos_compare_order
* samprint.c: Removed unused variable
* iitdef.h: Added FILENAME_SORT as a sorting type
* gsnap.c: Added commas to memusage debugging output
* dynprog_end.c: Removed assertions, which do not hold
2014-06-09 twu
* gmap_build.pl.in, chrom.c, chrom.h, gmapindex.c: Added option for sorting
chromosomes by order in a file
2014-06-04 twu
* VERSION: Updated version number
* README, configure.ac: Increased default MAX_READLENGTH for GSNAP from 250
to 300
* dynprog_simd.c: Fixed formatting
* dynprog_genome.c, dynprog_cdna.c: Using correct interface to
Dynprog_standard for non-SSE2 systems
* dynprog_end.c: Enabled non-SSE2 compilation to work. Made traceback
procedures follow those in dynprog_simd.c.
* dynprog.c, dynprog.h: Modified traceback_std (non-SIMD) to behave the same
as the SIMD traceback routines. Improved debugging output.
* stage3.c: Added comments
* dynprog.c, dynprog.h: Exposed Dynprog_standard, needed for systems without
SSE2 instructions
2014-06-03 twu
* gmap.c, gsnap.c: In checking behavior of _mm_extract_epi8, just reporting
results and not exiting based on behavior
* genome128_hr.c: Casting _mm_extract_epi16 to unsigned short, or a zero
extended result, which is technically the correct behavior
* dynprog_simd.c: Being very explicit about casting between int and Score8_T
and Score16_T types
* dynprog.h: Removed conditionals around defining Score_8 and Score_16 types
* compress.c: Calling _mm_free to match _mm_malloc
2014-05-30 twu
* trunk, VERSION, config.site.rescomp.prd, config.site.rescomp.tst,
index.html, src, compress.c, genome128_hr.c: Merged revisions 137093
through 137694 from branches/2014-05-23-genome128-32bit-shortchut to
implement 32-bit shortcuts for 128-bit genomebits
* stage3hr.c: Not checking any more for duplicate Stage3end_T objects
* stage2.c: Eliminated penalty when exon length < EXON_DEFN, which misses
short exons
2014-05-29 twu
* stage1hr.c: Restored usage of paired_usedp to avoid excess calls to GMAP
for halfmapping alignments
* stage1hr.c: Not using paired_usedp in computing GMAP for halfmapping
alignments
* dynprog_simd.c: Fixed traceback procedures to follow correct paths on gaps
* dynprog.c, dynprog.h, dynprog_genome.c, dynprog_single.c: Moved gap
penalties to dynprog.h, and reduced open penalties to allow for multiple
indels
2014-05-28 twu
* stage3.c: In merge_local_single, checking filledp to see if merge failed.
When merge fails, recompute pairarray for this_left and this_right.
2014-05-27 twu
* dynprog_simd.c: Implemented replacement for _mm_min_epi8 for non-SSE4.1
systems
2014-05-23 twu
* dynprog_simd.c: Fixed computations of E and H for values near
NEG_INFINITY, to prevent horizontal or vertical jumps into the empty
triangle. Added an E_mask variable to set horizontal/vertical scores into
the empty triangle to be NEG_INFINITY. Setting directions_nogap
explicitly to DIAG along the main diagonal to take care of ties between E
and H.
* dynprog_end.c: Using computed lband and uband in find_best_endpoint
procedures, instead of trying to recompute them, which led to incorrect
results
* get-genome.c, iit-read-univ.c: Fixed zero-based behavior of -L option to
one-based behavior. Zero-based behavior introduced in revision 99737 on
2013-06-27.
* index.html: Made changes for version 2014-05-15
* gmap.c, gsnap.c: Added better test for behavior of max operation in SSE4.1
* stage1hr.c: Fixed a memory leak involving ambcoords
* stage1hr.c: Using nmatches_posttrim in evaluating alignments. Doing a
comparison between concordant alignments involving terminals and
halfmapping alignments, to determine the best solution.
* stage3hr.c: Fixed assignment of amb_nmatches to correct end for minus
alignments in Stage3end_new_splice. Extending hardclips by
amb_nmatches_start and amb_nmatches_end in Stage3pair_overlap.
* stage3hr.c, stage3hr.h: Changed Stage3end_nmatches and Stage3pair_nmatches
to Stage3end_nmatches_posttrim and Stage3pair_nmatches_posttrim
2014-05-21 twu
* samprint.c, samprint.h: Fixed typo in keeping a parameter
* README: Clarified effect of --failed-input option
* README: Fixed typo
* acx_mmap_fixed.m4, acx_mmap_variable.m4: Fixed type incompatibility when
char * is cast to int
* stage1hr.c: Changed categories for some debugging statements
* gmap.c, gsnap.c, outbuffer.c, outbuffer.h, samprint.c, samprint.h,
stage3hr.c, stage3hr.h: Changed --fails-as-input flag to --failed-input
flag, which takes an argument. Printing failed inputs in addition to
nomapping output.
* genome.c: Made error message clearer when genomebits128 file not found
2014-05-17 twu
* stage3hr.c: Setting amb_nmatches_start, amb_nmatches_end,
start_ambiguous_p, and end_ambiguous_p based on amb_nmatches for halfdonor
and halfacceptor splices, even when ambcoords_left and ambcoords_right are
NULL
2014-05-16 twu
* stage1hr.c: Minor fixes to debugging statements
* stage1hr.c: Turned on macro for finding middle alignments.
* stage1hr.c: Turned on finding of middle alignments in find_terminals. Set
length threshold to be querylength/3 instead of index1part.
2014-05-15 twu
* stage3.c: Removed revisions of coordinates near indels, which is not
needed any more with the latest dynamic programming procedures
2014-05-13 twu
* VERSION, index.html: Updated version number
* dynprog_simd.c: For systems with SSE2 but not SSE4.1, subtracting 128 from
pairscore in F loop of Dynprog_simd_8, and from initial column in
Dynprog_simd_8_lower, to obtain correct results
* dynprog_genome.c: Removed debugging statements
* dynprog_genome.c: Fixed an uninitialized variable, best_prob
* trunk, README, src, Makefile.gsnaptoo.am, bytecoding.c, samprint.c,
sarray-read.c, sarray-read.h, splice.c, splice.h, stage1hr.c, stage3hr.c,
stage3hr.h, substring.c, substring.h, uint8list.c, uint8list.h,
uintlist.c, uintlist.h: Merged revisions 135802 through 136084 from
branches/2014-05-09-novel-ambiguous to consolidate ambiguous splices to
save on memory usage and to consider them as a single concordant alignment
* pairpool.c: Not advancing coordinate at start of Pairpool_add_queryskip
and Pairpool_add_genomeskip
2014-05-09 twu
* trunk, VERSION, config.site.rescomp.prd, config.site.rescomp.tst,
index.html, src, sarray-read.c, splice.c, splice.h, stage1hr.c,
stage3hr.c, stage3hr.h, substring.c, substring.h: Merged revisions 135797
through 135801 from branches/2014-05-09-novel-ambiguous to save state
before implementing novel ambiguous positions
* gsnap.c: Improved output for MEMUSAGE
* dynprog_simd.c: Speeding up traceback for upper and lower triangles, to
take advantage of the fact that the main diagonal is filled with DIAGs.
* sarray-read.c, stage1hr.c, stage3hr.c, stage3hr.h: Grouping multiple
splice segments connected to a given splice site, and selecting the best
among these. If multiple best choices are found, creating an ambiguous
splice.
* dynprog_simd.c: In traceback procedures, added check after each indel to
see if we are in row 0 or column 0, and if so, not to trust the value in
directions_nogap.
2014-05-08 twu
* splicetrie_build.c: Fixed comparison with MAX_SITES_ALLOCATED for
intron-level splicing files
* gsnap.c: Setting POOL_FREE_INTERVAL to be 1
* gsnap.c, outbuffer.c: Using new interface to memusage routines
* mem.c, mem.h: Changed variable names for memusage. Added memusage report
for keep pool.
* dynprog_simd.c: Fixed bug with infinite loop at row 1 or column 1, leading
to error in memory allocation for pairpool
* dynprog_simd.c, pairpool.c: Adding 1 to r and c only on final indel
* pairpool.c: When adding indel (queryskip or genomeskip), not advancing
coordinate by 1, since that can cross over chromosomal bounds
2014-05-07 twu
* VERSION: Updated version number
* Makefile.gsnaptoo.am, cellpool.c, cellpool.h, gmap.c, smooth.c,
stage1hr.c, stage1hr.h, stage2.c, stage2.h, stage3.c, stage3.h,
uniqscan.c: Added Cellpool_T object to handle allocations of Cell_T in
stage 2
* gsnap.c: Calling Pairpool_reset, Diagpool_reset, and Cellpool_reset before
processing each request. Previously, this memory was not being freed
until the end of the process.
* splicestringpool.c: Changed memory procedures to use standard pool instead
of keep pool
* dynprog_simd.c: Fixed saturation bug in F loop when trying to add
pairscore. Setting E and H so non-diag directions are placed in row 0 and
column 0. At end of traceback procedures, adding final indel to (0,0).
* stage3.c: Having make_pairarray_merge return a boolean to indicate success
or failure. Trying to use the old pairarray in case of failure.
* gmap.c, gsnap.c, uniqscan.c: Using new interface to Splicetrie_retrieve
procedures, using Splicestringpool_T object
* Makefile.gsnaptoo.am, splicestringpool.c, splicestringpool.h,
splicetrie_build.c, splicetrie_build.h: Using Splicestringpool_T object to
reduce number of memory allocations for Splicestring_T objects. Using
local array for sites to also reduce number of memory allocations.
* splicetrie_build.c: Allocating struct Interval_T when copies are needed,
to reduce the number of calls to allocate memory. Allocating triecontents
as an array instead of uintlist, also to reduce the number of calls to
allocate memory.
* interval.c, interval.h: Allocating struct Interval_T when copies are
needed, to reduce the number of calls to allocate memory
2014-05-06 twu
* dynprog.h: Added definitions of POS_INFINITY_8 and POS_INFINITY_16
* pair.c: Returning correct type of NULL
* dynprog_cdna.c, dynprog_genome.c: Fixed potential memory leak with
SNP-tolerant alignment
* genome.c: If glength == 0, Genome_get_segment_blocks now returns NULL
* stage3.c: Changed condition for not running Dynprog_single_gap from
(queryjump < 0 || genomejump < 0) to (queryjump <= 0 || genomejump <= 0)
2014-05-05 twu
* stage1hr.c: Checking for anomalous splice (samechr splice) before trying
to compute mapping position from distal end
* compress.c: Bypassing SSSE3 version of Compress_shift when
DEFECTIVE_SSE2_COMPILER is true
2014-05-02 twu
* ax_ext.m4: Not allowing AVX unless immintrin.h is present
* indexdb.c: Freeing offsetspages for large genomes
* gsnap.c: For MEMUSAGE, changing pool free interval to be 1
* dynprog.c: Freeing nt_to_int_array
* dynprog_simd.c: Simplified traceback procedures
2014-05-01 twu
* stopwatch.c: Removed comment
* stage3hr.c: Allocating ambi and amb_nmismatches into output memory pool
* stage1hr.c: Limiting the number of terminals
* mem.c: Fixed memusage_reset procedure and revised debugging messages to
print pool type
* intlist.c, intlist.h: Implemented Intlist_to_array_out
* compress.c, compress.h: Implemented SSSE3 procedure and fixed bug in SSE2
procedure for Compress_shift
* genome128_hr.c: Fixed block_diff_snp procedure to perform the complete
calculation for query, ref, and alt sequences.
* genome.c: Fixed Genome_get_segment_blocks_left and _right to provide
correct alternate genomic segment
* dynprog_simd.c: Setting values along bands to be DIAG to avoid going out
of bounds. Revised loops for gaps in traceback procedures. Compensating
for open value in comparing E vs H for dir_horiz and dir_vert. Using
nt_to_int_array. Improved debugging print procedures.
* dynprog.c, dynprog.h: Introduced nt_to_int_array. Setting score for
AMBIGUOUS to be 0, and setting N-N to be that score.
2014-04-24 twu
* ax_ext.m4: Removing -mavx and -mavx2 compiler flags for now. Being added
to Mac OS X Mavericks, where they are causing problems.
* stopwatch.c: Added macro for specifying POSIX C time
* stage3hr.c: Allowing pair_insert_length_trimmed to handle non-concordant
paired ends
* genome128_hr.c: Implemented code for defective SSE2 compilers that cannot
handle shifts with a non-immediate scalar
* samprint.c: Fixed printing of XS for half-donor and half-acceptor reads
2014-04-21 twu
* iit-read-univ.c: Printing tp:circular for circular chromosomes
* stage3.c: Using new interface to Pair_circularpos
* stage3hr.c: Computing insertlength properly for circular chromosomes when
ends have different aliases. Handling duplicates of aliases better.
* pair.c, pair.h: Computing alias correctly in Pair_circularpos
* pair.c: Fixed bug when Pair_trim_ends is called when pairs is NULL
2014-04-19 twu
* trunk, src, Makefile.gsnaptoo.am, atoiindex.c, cmetindex.c, genome.c,
genome.h, genome128_hr.c, genome128_hr.h, gmapindex.c, gsnap.c, indel.c,
indel.h, indexdb.c, mapq.c, mapq.h, oligo.c, oligoindex_hr.c,
sarray-read.c, sarray-read.h, sarray-write.c, sarray-write.h, splice.c,
splice.h, splicetrie.c, splicetrie.h, stage1hr.c, stage1hr.h, stage3hr.c,
stage3hr.h, substring.c, substring.h: Merged revisions 133654 through
133759 from branches/2014-04-18-cmet-atoi-suffix-arrays to fix CMET and
ATOI procedures
* VERSION, config.site.rescomp.prd, config.site.rescomp.tst: Updated version
number
* index.html: Updated for version 2014-04-17
* stage3.c: Fixed uninitialized variable
2014-04-17 twu
* indexdb.c: Creating correct bitpack filenames when snps_root is given
* snpindex.c: Handling the case when an IIT file is not provided on the
command-line, and installation is not needed
2014-04-10 twu
* gmap_build.pl.in: Fixed documentation for compression types
* dynprog_simd.c, dynprog_simd.h, dynprog_single.c: Added traceback
procedure for Score16_T variables
2014-04-09 twu
* snpindex.c: Creating genomebits128 format instead of genomebits format
* sarray-read.c: Made changes in code (commented out) to try to infer
lcp-intervals for short oligomers from 12-mer index, but still has some
bugs
* archive.html, index.html: Made changes for version 2014-04-08 and
2013-03-28.v2
2014-04-08 twu
* VERSION: Updated version number
* gmap.c: Removed unnecessary const
* stage2.c: Applying maxintronlen in deciding whether to create a grand
lookback
* pairpool.c: Fixed bug in Pairpool_compact_copy
* stage3.c: Removed debugging statement
* trunk, src, Makefile.gsnaptoo.am, dynprog.c, dynprog.h, dynprog_cdna.c,
dynprog_cdna.h, dynprog_end.c, dynprog_end.h, dynprog_genome.c,
dynprog_genome.h, dynprog_simd.c, dynprog_simd.h, dynprog_single.c,
dynprog_single.h, gmap.c, gsnap.c, sarray-read.c, sequence.c, sequence.h,
splicetrie.c, stage3.c, stage3.h, uniqscan.c, util: Merged --reintegrate
branches/2014-04-04-dynprog-shift to change dynprog_end routines to use
upper/lower algorithm without F loops
* index.html: Updated for version 2014-04-06
* pairpool.c, pairpool.h: Implemented Pairpool_compact_copy
* oligoindex_hr.c: Freeing storage memory in Oligoindex_array_T object
* stage1.c: Checking extensions to make sure they fall within
--max_totallength
* chimera.c, chimera.h, gmap.c, stage3.c, stage3.h: Checking chimeras to see
that they satisfy --min-trimmed-coverage and --min-identity filters
* mem.c: Turned off DEBUG macro
* iit-read-univ.c: Restored warning message when IIT file cannot be read
* gmapindex.c, sarray-write.c, sarray-write.h: Implemented
Sarray_child_uncompress
* gmap.c: No longer proceeding to align if Stage2_scan yields max_ncovered
of less than 10% of the querylength.
* genome128_hr.c: Made additional changes to avoid _mm_extract_ps on
non-SSE4.1 systems
2014-04-07 twu
* sarray-read.c: When effective querylength is less than bucket indexsize,
no longer trying to infer lcp-interval from bucket array, but just
starting from entire lcp-interval.
2014-04-06 twu
* gmap.c, gsnap.c, uniqscan.c: Checking for valid float values between 0.0
and 1.0
* dynprog_genome.c, splicetrie.c: Fixed headers for dynprog routines
* dynprog_single.c: Using a variable in the F loop to see if H needs to be
reloaded
* oligoindex_hr.c: Allocating storage for Oligoindex_array_T
2014-04-05 twu
* trunk, src, Makefile.gsnaptoo.am, dynprog.c, dynprog.h, dynprog_genome.c,
dynprog_genome.h, dynprog_single.c, dynprog_single.h, gmap.c, gsnap.c,
pairpool.c, pairpool.h, stage3.c, stage3.h, uniqscan.c, util: erge
reintegrated branches/2014-04-04-dynprog-shift to use new SIMD routines
that are row-first and reduce use of F loops
* util, src: Removed property changes
* VERSION: Updated version number
* gmap.c, gsnap.c, oligoindex_hr.c, oligoindex_hr.h, stage1hr.c, stage1hr.h,
stage2.c, stage2.h, stage3.c, stage3.h, uniqscan.c: Created a separate
Oligoindex_array_T object, which also holds storage
* configure.ac: Added AC_FUNC_ALLOCA
* mem.h: Added hooks for memory allocation using alloca
* bitpack64-readtwo.c: Added missing #endif
* indexdb.c: Fixed type of offsets from UINT4 back to Positionsptr_T
* iit-read-univ.c: aking Univ_FNode_T struct separate from FNode_T struct
* gmapindex.c: Checking genomelength before trying to create suffix array or
LCP/child/DC arrays
* bitpack64-readtwo.c: Fixed bug where not enough 128-bit registers were
provided for large genomes
* bitpack64-read.c: Eliminated an extra addition from computing offset in
large genomes
2014-04-03 twu
* uniqscan.c: Using new interface to Oligoindex_new routines
* index.html: Updated for 2014-04-01 version
* archive.html: oved 2014-03-28 and earlier versions to archive
* acinclude.m4, ax_ext.m4, ax_gcc_x86_avx_xgetbv.m4: Updated ax_ext.m4 and
added ax_gcc_x86_avx_xgetbv
* uniqscan.c: Using new interface to Stage3_setup
* stage3.c, stage3.h: Providing min_end_indel_matches to end trimming
procedures
* sarray-read.c, sarray-read.h: Added separate access control for lcpchilddc
* oligoindex_hr.c, oligoindex_hr.h: Allocating dedicated space needed for
Oligoindex_get_mappings, to avoid memory allocation/deallocation
* gsnap.c: Using new interface to Oligoindex_new routines, Stage3_setup, and
Sarray_new
* gmap.c: Using new interface to Oligoindex_new routines and Stage3_setup
* genome128_hr.c: Provided alternative to _mm_extract_ps, which also
requires SSE4.1
2014-04-02 twu
* dynprog.c: odified debugging statements
* ax_ext.m4: Adding warning messages when immintrin.h is not found and could
be used
* ax_ext.m4: Checking for immintrin.h before allowing popcnt, lzcnt, or bmi1
* ax_ext.m4: Improved warning messages
* trunk, VERSION, ax_ext.m4, config.site.rescomp.prd,
config.site.rescomp.tst, src, Makefile.gsnaptoo.am, atoiindex.c,
bitpack64-access.c, bitpack64-read.c, bitpack64-read.h,
bitpack64-readtwo.c, bitpack64-readtwo.h, bitpack64-serial-read.c,
bitpack64-serial-read.h, bitpack64-serial-write.c,
bitpack64-serial-write.h, bitpack64-write.c, bitpack64-write.h,
bytecoding.c, bytecoding.h, cmetindex.c, compress-write.c,
compress-write.h, compress.c, compress.h, compress128.c, dynprog.c,
genome-write.c, genome-write.h, genome.c, genome128-write.c,
genome128-write.h, genome128_hr.c, genome128_hr.h, genome_hr.c,
genome_hr.h, genome_sites.c, gmap.c, gmapindex.c, gsnap.c,
iit-read-univ.c, iit-read.h, iit-write-univ.h, iit-write.h, iitdef.h,
indel.c, indexdb-write.c, indexdb-write.h, indexdb.c, indexdb.h,
indexdb_hr.c, indexdbdef.h, mapq.c, sarray-read.c, sarray-read.h,
sarray-write.c, sarray-write.h, snpindex.c, splice.c, splicetrie.c,
splicetrie_build.c, stage1hr.c, stage2.c, stage3.c, stage3hr.c,
substring.c, types.h, uniqscan.c, util, gmap_build.pl.in: ajor change.
Merged revisions 131573 to 132142 from branches/2014-03-26-bitpack-esa to
use genomebits128 format, bp64-columnar, and enhanced suffix arrays
* VERSION: Updated version number
* index.html: Revised for latest version
* stage3hr.c: Not using number of introns to determine equivalence in
hit_equiv_cmp and hitpair_equiv_cmp
* genome_sites.c: Resolved comparison between unsigned and signed values for
-1
2014-04-01 twu
* boyer-moore.c, dynprog.c, stage3.c: Implemented changes to restore finding
microexons
* shortread.c: Handling Casava reads ending in ";1". Fixed problem where -q
and --allow-pe-name-mismatch together caused a fatal bug.
* genome_hr.c: Changed debugging output for splice fragments to print
unsigned shorts
2014-03-28 twu
* indexdb.c: Changed type of offsets (called only for regular bitpack
procedure) from Positionsptr_T * to UINT4 *
* indexdb_hr.c: Calling correct procedures for LARGE_GENOMES
* substring.c, substring.h, stage3hr.c: Fixing comparisons of coordinates to
handle circular chromosomes
* stage3hr.h: Removed queryseq as arguments to Stage3end_remove_duplicates
* stage1hr.c: Calling Stage3end_remove_duplicates after
Stage3end_remove_circular_alias
2014-03-27 twu
* stage1hr.c, stage3hr.c, stage3hr.h: Trying to salvage alias +1 within
Stage3end_remove_circular_alias, and calling that rather than
Stage3end_unalias_circular
* stage3hr.c: In Stage3_new_splice, not trying to merge long-distance
splices at this time, which can lead to bad coordinates for GMAP
* stage1hr.c: Initializing variable, previously not initialized
* pair.c: Fixed assertion on CIGAR length to include hardclips
* gsnap.c: Including splice.h
* chimera.c: Rearrange order of loops in Chimera_bestpath
2014-03-26 twu
* stage3hr.c: Fixed bug where overlap across circular chromosome origin is
entirely trimmed, leading to an "SH" cigar string
* trunk, VERSION, config.site.rescomp.prd, config.site.rescomp.tst,
index.html, util: Updated version number
* gmap.c: Checking return value of Chimera_bestpath
* chimera.c, chimera.h: Chimera_bestpath returning a value to indicate if a
chimera was found
* stage3.c: In Stage3_merge_local, traversing cDNA gap any time intronlength
is less than 0
* pairpool.c: Created separate debugging category for Pairpool_clean_join
* gsnap.c, splice.c, splice.h: Using min_shortend to control splice length
at ends
* sarray-read.c: Fixed problem where splice or deletion could extend into
next chromosome
2014-03-17 twu
* translation.c: Fixed bug where assign_cdna_backward was returning ncdna
instead of codon
2014-03-12 twu
* sarray-read.c: Put sarray_search loop into a new function,
find_longest_match
* sarray-read.c: Improved sarray_search by looking up genome only when lcp
advances by more than one position
* oligoindex_hr.c: Using faster method for checking for zero 128-bit
register when SSE4.1 is available
* dynprog.c, gdiag.c, indexdb_hr.c, oligo.c, pair.c, sarray-read.c,
spanningelt.c, stage1.c, stage1hr.c, stage2.c: Using safer method for
computing average of lowi and highi in binary search
* access.c: Handling empty files
2014-02-28 twu
* gmap.c, stage3.c, stage3.h: Removed alignment_score_fwd and
alignment_score_rev from Stage3_T object
* VERSION: Updated version number
* stage3.h, gmap.c: Using maxintronlen_bound in Stage3_mergeable
* stage3.c: Not trimming exons at end based on splice site probabilities
* stage2.c: In D4 section, when diffdistance <= EQUAL_DISTANCE_NO_SPLICING,
adding CONSEC_POINTS_PER_MATCH, to extend chains further
* gmap.c: Performing iterations for finding a local join, before iterations
for finding a chimera.
* stage3.c: Revised criteria for pick_cdna_direction. When cdna_direction
== 0, assigning fwd or rev introntype instead of NONINTRON.
* Makefile.dna.am, Makefile.gsnaptoo.am, get-genome.c, gmap.c, outbuffer.c,
outbuffer.h, stage1.c, stage1.h: Removed references to Chrsubset_T. Made
-c flag work again by setting universal coordinate bounds in stage 1.
* gmap.c: Checking coverage of nonchimericbest against each chimeric part,
and if coverage is large enough on each side, picking non-chimeric
alignment over chimera.
* stage3.c, stage3.h: Not determining mergeable based on cdna_directions of
the left and right part. In that situation, fixing the alignment by
recomputing to find the best cdna_direction.
* pair.c: Disallowing transitions even 10 bp outside of alignments
* chimera.c: Calling Pair_pathscores directly, instead of through
Stage3_pathscores
* gmap_build.pl.in: Removed reference to localdir. Putting tempfiles for
<genome>.coords and <genome>.sources into destination directory.
2014-02-27 twu
* sarray-read.c: Slight speed improvement in handling pre-alignment loop.
Now using 0 instead of 4.
* stage3hr.c: Restored old version of Stage3end_remove_overlaps, with
different algorithm from that for paired ends
2014-02-26 twu
* stage1hr.c: For paired-end reads, when only one end is too short, aligning
the other end as part of a halfmapping alignment
2014-02-24 twu
* stage3hr.c: Implemented recursive, list-based approach to removing bad
superstretches in paired-end alignments, instead of O(n^3) algorithm,
which occasionally hanged in repetitive regions
* gmap.c: Added debugging statements for chimera
* stage2.c: Reducing NINTRON_PENALTY_MISMATCH from 32 to 1, because short
exons were being missed. Also, eliminated querydist_credit and restored
querydist_penalty.
* chimera.c: In Chimera_local_join_p, checking genomic positions to make
sure they make sense
2014-02-21 twu
* gsnap.c, pair.c, pair.h, samprint.c, samprint.h: Implemented option
--hide-soft-clips
* stage3.c: Returning value from merge_local_single
* substring.c, substring.h: Added function Substring_queryend_orig
2014-02-20 twu
* gsnap.c: Using new interface to Stage2_setup
* src, chimera.c, diagpool.c, gmap.c, oligoindex.c, oligoindex_hr.c, pair.c,
stage2.c, stage2.h, stage3.c, uniqscan.c: erged revisions 128075 to
128117 from branches/2014-02-20-chimera-breakpoint to fix cases where
breakpoint was found outside of alignments
2014-02-19 twu
* dynprog.c: Handling lower-case nucleotides correctly in dynamic
programming. Handling alternate alleles equally in dynamic programming.
* bitpack64-read.c, indexdb.c: Fixed bugs in handling bitpackpages file for
huge genomes
2014-02-18 twu
* pair.c, pair.h, stage3.c: Added numbers of matches, mismatches, indels and
unknowns to GFF3 output
* VERSION: Updated version
* ax_ext.m4: Fixed typo
* gmap_build.pl.in: Not printing warning message about -T unless it is used.
Deleting .sources file.
* chimera.c, chimera.h, gmap.c: Added donor_watsonp and acceptor_watsonp to
fields for GMAP
* stage3hr.c: Fixed bug in printing header twice for concordant
translocations in standard GSNAP output format
* get-genome.c: Added output for vareffect
* samprint.c: Added strands to XT field
* indexdb-write.c: When checking bitpack64, printing warning messages rather
than exiting.
2014-02-14 twu
* sarray-read.c: Fixed a bug where r for an lcp-interval was being
decremented from 0 to -1U.
2014-02-13 twu
* samprint.c: Added distant splice information to XT field, but without
strand information
* samprint.c: Added XC flag to indicate a circular alignment
2014-02-11 twu
* stage1hr.c: Removed unnecessary variables that have been made global
within the file
* branchpoint.c, branchpoint.h, splicing-score.c: Added bpa-all option to
print all marginals
* iit-read.c: Handling cases in signed matching where sign == 0
* get-genome.c: Added option --aslabel
* sarray-read.c: Turning off debugging
2014-02-02 twu
* get-genome.c: Added debugging statements
2014-01-31 twu
* get-genome.c: Making typed queries when user provides a typestring
* trunk, README, src, Makefile.dna.am, Makefile.gsnaptoo.am, branchpoint.c,
branchpoint.h, gsnap.c, indel.c, indel.h, sarray-read.c, splicing-score.c,
stage1hr.c, translation.c, uniqscan.c, util: Merged revisions 125127 to
125307 from branches/2014-01-30-lsm-branch-point to add bpa analysis to
splicing-score
2014-01-29 twu
* splicing-score.c: Added -v flag to handle alternate alleles
2014-01-21 twu
* fa_coords.pl.in: Fixed syntax error
* VERSION, config.site.rescomp.prd, config.site.rescomp.tst, index.html,
src, closeparen-table.pl, excess-leftward.pl, excess-rightward.pl,
openparen-table.pl, select1-table.pl: Updated version number
* Makefile.dna.am, Makefile.gsnaptoo.am: Added SIMD_FLAGS and POPCNT_FLAGS
to all programs that include compress.c
* sarray-read.c: Added benchmarking procedure Sarray_traverse_children
* oligo.c: Added "U" to end of integer constants where necessary
* gsnap.c: Fixed typo in --help statement
* gmap.c: Fixed revision of extension length relative to end-segment length
for finding chimeras
2014-01-15 twu
* gmap_build.pl.in: Fixed variable name
2013-12-23 twu
* sarray-read.c: Added comments
* Makefile.gsnaptoo.am: Added files for bp-read and bp-write
* bp-read.c, bp-read.h, bp-write.c, bp-write.h, bp.h, gmapindex.c,
sarray-write.c, sarray-write.h, types.h: Merged revisions 119699 to 122378
from branches/2013-11-27-child-bitvector to add code for bitvector
representation of child array
* bitpack64-read.c: Removed extraneous addition for second half of block for
packsize of 32
2013-12-20 twu
* util, gmap_build.pl.in: Merged revisions 119700 through 122189 from
branches/2013-11-27-child-bitvector/util to write genome indices in
destination directory
* fa_coords.pl.in, gmap_build.pl.in, gmap_process.pl.in: Writing FASTA
sources to a file
2013-12-19 twu
* Makefile.dna.am: Added gamma-speed-test and bitpack64-speed-test
* splicing-score.c: Checking if coordinates are valid
* gamma-speed-test.c: Printing nanoseconds per query
* bitpack64-speed-test.c: Using new interface to bitpack64 read commands
2013-12-17 twu
* stage1hr.c, stage3hr.c, stage3hr.h: Requiring single or unpaired terminals
to be 2/3rds of the querylength
* sarray-read.c: Fixed memory leak
2013-12-16 twu
* dynprog.c: Fixed a bug where the wrong genomic position was provided for a
high-probability microexon.
* stage1hr.c: Not passing in max_terminal_length to find_terminals
* sarray-read.c: Allowing initial lcp interval for the first nucleotide
* gmapindex.c: Changed name of child_uncompress procedure
* bitpack64-write.c: Handling the case for Bitpack64_write_direct where the
packsize is 0.
* sarray-write.c, sarray-write.h: Included code for bp version of child
information
2013-12-13 twu
* trunk, config.site.rescomp.tst, src, Makefile.dna.am,
Makefile.gsnaptoo.am, atoiindex.c, bitpack64-access.c, bitpack64-access.h,
bitpack64-read.c, bitpack64-read.h, bitpack64-write.c, bitpack64-write.h,
chimera.c, cmetindex.c, dynprog.c, dynprog.h, genome.c, genome.h, gmap.c,
gmapindex.c, indexdb-write.c, indexdb-write.h, indexdb.c, indexdb.h,
indexdb_hr.c, indexdbdef.h, oligoindex_hr.c, oligoindex_hr.h,
oligoindex_pmap.c, oligoindex_pmap.h, pmapindex.c, sarray-read.c,
sarray-write.c, sarray-write.h, sequence.c, sequence.h, snpindex.c,
splicetrie.c, splicetrie.h, splicing-score.c, stage1hr.h, stage2.c,
stage2.h, stage3.c, stage3.h, util: Merged revisions 120874 through 121506
from branches/2013-12-10-pmap
2013-12-04 twu
* substring.c: Fixed issue where circularpos was assigned in the region of
soft-clipping, leading to an improper CIGAR string
* pair.c: In GFF output, printing '.' instead of '?' for unknown strand
2013-11-27 twu
* index.html: Added statement about portability of NAN
* iit_store.c: Handling NAN when it is not available
* VERSION, index.html: Updated version number
* popcount.c, popcount.h: Added popcount.c and popcount.h to store tables
* Makefile.gsnaptoo.am: Added popcount.c and popcount.h
* genome_sites.c: Added include of popcount.h.
* genome_hr.c: Moved tables to popcount.c, and added include of popcount.h.
* indexdb-write.c: Revised size of memory reserved for bitpackptrs. Moved
tables to popcount.c, and added include of popcount.h.
* sarray-write.c: Revised size of memory reserved. Added include of
popcount.h
2013-11-22 twu
* VERSION, config.site.rescomp.prd, config.site.rescomp.tst, index.html:
Updated version number
* stage3.c: In build_pairs_singles, build_pairs_introns, and
build_pairs_dualintrons, remove gaps at the beginning and any gap at the
end of the path.
* outbuffer.c: Fixed bug in using --fails-as-input on single-end reads
* stage3.c: Replaced one call to insert_gapholders with List_reverse
* Makefile.dna.am: Added files needed for gmapindex and uniqscan
* stage3.c: Inserting gapholders before calling assign_intron_probs
2013-11-21 twu
* iit-read.c: Fixed issue with IIT_fieldvalue returning the previous line
from what is desired
2013-11-20 twu
* stage3hr.c: Slight improvement in efficiency of randomization procedure
* stage3hr.c: Using a different formula for generating a random integer
* stage3hr.c: Put an explicit type conversion to double before RAND_MAX
* stage3hr.c: Now picking primary alignment randomly among best alignments
with ties
* trunk, VERSION, config.site.rescomp.prd, config.site.rescomp.tst,
index.html, src, Makefile.dna.am, Makefile.gsnaptoo.am, dynprog.c,
iit-read.c, iit-read.h, iit-write.c, iit-write.h, iit_get.c, iit_store.c,
iitdef.h, littleendian.h, util: Merged revisions 115687 to 115891 from
branches/2013-11-19-iit-values
* archive.html: Moved version 2013-10-28 to archive
* README: Changed discussion to use gmap_build and not gmap_setup
* configure.ac, Makefile.am: Removing gmap_setup program
* gmap_build.pl.in: Added --contigs-are-mapped and --fasta-pipe options.
Providing arguments to subroutines.
* archive.html: Added link to old database format using gamma compression
* index.html: Provided link to old genome database formats
* src, spanningelt.c: Fixed bug in code for spanning elt in regular-sized
genomes
* index.html: Updated for version 2013-10-28
2013-11-19 twu
* README: Changed N1 and N2 abbreviations to NM
* configure.ac, Makefile.am, ensembl_genes.pl.in: Added ensembl_genes program
* gmap_build.pl.in: Clarified usage of --circular feature when --names
feature is also used
* bitpack64-read.c: Declared variable needed for non-SSE2 compilation
* stage2.c: Revised debugging statement
* gmap.c: Added debugging statement
* oligoindex_hr.c: Made debugging statements print counts for each oligomer
* oligoindex.c: Resetting query_evaluated_p in Oligoindex_untally, to avoid
interactions between queryseqs
2013-11-18 twu
* dynprog.c: Not adding dashes if gap is long. Not performing simple genome
gap if finalp is true.
* Makefile.dna.am, Makefile.gsnaptoo.am: Added sarray-read.c and
sarray-read.h to gsnapl and uniqscanl
* uniqscan.c: Using new interface to Stage3hr_setup
* stage3hr.c, outbuffer.c: Removed unused code
* smooth.c, stage3.c: In traverse_dual_intron, if dual scores win over
single score, protecting that exon from being smoothed by size
* pairdef.h: Added comment about protectedp
* pair.c, pair.h: Added function Pair_protect_list
* gmap.c: Reduced threshold for discarding gegion from 0.80*max_ncovered to
0.25.
* stage3.c: Reduced amount of peelback. Made fixes to improve finding of
short exons, including reducing criteria for dual_canonical_p, and
eliminating smoothing by size.
* access.c, atoiindex.c, bitpack64-read.c, bitpack64-read.h,
bitpack64-speed-test.c, cmetindex.c, genome_hr.c, genome_hr.h, gmap.c,
gmapindex.c, gsnap.c, iit-read-univ.c, indexdb-write.c, indexdb-write.h,
indexdb.c, indexdb.h, indexdb_hr.c, indexdbdef.h, littleendian.c,
littleendian.h, snpindex.c, spanningelt.c, spanningelt.h, stage1hr.c,
stage2.c, types.h: Merged changes from branches/2013-10-16-huge-genomes to
allow for huge genomes, where offsets (or length of positions file)
exceeds 2^32 entries
* table.c, table.h: Changed type of keyfree procedure to take const void as
argument type
* stage3hr.c, stage3hr.h, shortread.c, shortread.h: Allowing
--fails-as-input to work with new xs files
* samflags.h: Changed N1 and N2 abbreviations to be NM
* outbuffer.c, samprint.c, samprint.h: Allowing --fastq-as-input to work
with new xs files
* gmap_build.pl.in: Merged changes from branches/2013-10-16-huge-genomes to
count offsets and to allow for huge genomes
2013-11-15 twu
* README: Added discussion of XS categories
* stage1hr.c: Setting ignore_found_score to be found_score, so the input
value will be correct for those procedures
* stage1.c: For short sequences (< 4 times the default matchsize),
performing scan_ends twice, once with a short matchsize and once with a
default matchsize, to improve sensitivity and specificity
* outbuffer.c, samflags.h, samprint.c, samprint.h, stage3hr.c, stage3hr.h:
Added xs (for quiet-if-excessive) output files and categories. Added
file_setup procedures for SAM and standard output types for GSNAP.
* gmap.c: Removed --quiet-if-excessive option from GMAP, because not working
properly
* gmap_build.pl.in: Added comment
* indexdb-write.c: Not performing check of bitpack compression when there
are no k-mers
* sarray-write.c: Include code for reading sarray using fread instead of mmap
2013-11-14 twu
* Makefile.gsnaptoo.am, gmapindex.c, sarray-write.c, sarray-write.h: Using
less memory for creating LCP array and saindex. Using permuted LCP
algorithm. Memory mapping suffix array file and compressed LCP files.
* genome_hr.c: Made some fixes to Genome_consecutive_matches_pair
2013-11-13 twu
* genome_hr.c, genome_hr.h: Implemented Genome_consecutive_matches_pair
2013-10-29 twu
* indexdb.c: Using a separate sanity check on positions filesize when
expanding bitpack offsets
* stage1.c: Reducing intial matchsize for short reads to be less than half
the read length
* stage3hr.c: Fixed bug where insert length was being computed between GMAP
and a TRANSLOC_SPLICE using pair_insert_length on a NULL substring.
2013-10-25 twu
* samheader.c: Changed SO:unknown to SO:unsorted
* gregion.c: Added comment
2013-10-24 twu
* gregion.c: In Gregion_extend, when chrend goes past chrhigh, setting it to
chrlength - 1 and not chrlength
* stage3.c: In peel_leftward and peel_rightward, removing initial pairs that
are gaps or indels, so we don't leave a gap or indel on the top.
* genome_sites.c: Fixed bug in large genomes where -1U was being compared to
-1UL
2013-10-23 twu
* samprint.c: For non-concordant pairs, setting clipdir to 0 when calling
SAM_compute_chrpos.
* stage3hr.c: Now calling pair_insert_length_unpaired for unpaired
alignments involving GMAP, which helps to eliminate duplicate alignments.
* README, acinclude.m4, sse2_shift_defect.m4, configure.ac: Added a test to
see if compiler can handle SSE2 shift commands properly, and setting
config.h variable automatically
2013-10-22 twu
* VERSION: Updated version number
* dynprog.c: Fixed bug in checking for either rlength or glength to be too
long
2013-10-21 twu
* stage3.c: Added hooks for HMM step, but still not using it
* stage3.h, gmap.c: Allowing stage3debug value of middle
* oligoindex.c: Adjusting lookback for querypos with no hits. Making
lookback value equal for all oligoindices.
2013-10-18 twu
* gmap.c, stage3.c, stage3.h: Made stage3debug work
2013-10-15 twu
* stage1hr.c: Restored previous criteria for distant splicing
* stage1hr.c: Loosened criteria for distant splice probabilities. Checking
for gmap_allowance on single-end GMAP and terminal improvement GMAP
alignments.
* samprint.c: Fixed bug in printing deletion resulting in consecutive M
tokens
2013-10-11 twu
* indexdb.c: Fixed memory leak when snp_root is given and desired file is
not found
* boyer-moore.c: Checking if result of Genome_get_segment_blocks_left is NULL
* dynprog.c, genome.c: In Genome_get_segment_blocks_right and
Genome_get_segment_blocks_left, returning NULL if the entire segment is
outside the chromosomal bounds
* psl_splices.pl.in: Replaced hard-coded path
2013-10-10 twu
* stage3.c: Fixed typo in variable name
* stage2.c: Adding query_offset to pairs in non-standard (cmet or atoi) modes
2013-10-09 twu
* VERSION, index.html: Updated version
* README: Added section on compiler issues. Added section numbers. Revised
some sections for latest version.
* configure.ac: Added option --with-defective-sse2-compiler
* diag.c: Added debugging statements
* compress.c: Added code for defective SSE2 compilers
* stage3hr.c: In cases of overreach when concordant is expected, returning
NULL
* gmap_build.pl.in: Added --no-sarray option to gmap_build
* stage3.c: Making sure not to leave an INDEL_COMP or SHORTGAP_COMP on the
path/pairs after peelback. Treating SHORTGAP_COMP the same as INDEL_COMP
throughout the code.
* VERSION: Updated version number
* stage3hr.c: Fixed insert length calculation for GMAP/substring to go from
start5 to end3. Fixed overreach criteria to consider only substrings that
entirely pass the end of the other hit.
2013-10-08 twu
* substring.c, substring.h: Added procedures for segment-based trimmed
overlap
* stage3hr.c: Added a general segment-based algorithm for finding trimmed
insert length. Skipping cases with dual overreach.
* stage3hr.c: Fixed bug in computing start and end positions in computing
non-GMAP trimmed insert length. Added code for handling overreach, by
changing splices to halfsplices.
* stage3hr.c: Fixed bug in computing overlap between non-GMAP hits
2013-10-07 twu
* stage3hr.c, substring.c, substring.h: Checking all possible substring
combinations in computing trimmed insertlength for non-GMAP hits
2013-10-04 twu
* stage1hr.c: Fixed the computation of goal for finding the next chromosome
bounds
* sarray-read.c: Handling the case where nmisses_allowed < 0. When saindex
yields no result, setting low and high to be the beginning and end of the
suffix array, rather than iterating through saindex for a hit.
* gmap.c, stage3.c, stage3.h: In Stage3_merge_local, recovering gracefully
if clipping leads to a NULL list for either Stage3_T object
* substring.c, substring.h: Revised calculation of substring trimmed
insertlength to account for trimming of 3' end
* stage3hr.c: Revised calculation of GMAP-GMAP trimmed insertlength to
account for trimming
* samprint.c: Revised criteria for printing D token in deletion. Made
compute_cigar follow the logic of print_cigar.
2013-10-03 twu
* samprint.c: Not computing pair overlaps when a pair involves a circular
alignment
* stage3hr.c: Improvements to computing insertlength for calculating pair
overlap. Using querylength instead of querylength_adj. Searching through
GMAP pairarray to find intersecting genomic position. Added checks for
(left + genomiclength) > chrhigh.
2013-10-02 twu
* stage3.c: Reduced some parameters for looping in stage 3. Not looping for
GSNAP.
* bitpack64-access.c: Added an access procedure for packsize of zero
* bitpack64-read.c: Added a read procedure for packsize of zero
* sarray-write.c: Changed type of len to size_t
* indexdb-write.c: Handling the case when all offset differences are zero,
resulting in a packsize of zero
* bitpack64-write.c: Allow writing when packsize is zero
* stage3.c: Using new interface to Pair_print_sam
* config.site.rescomp.prd, config.site.rescomp.tst: Updated version number
* archive.html, index.html: Made changes for new version, including the one
with suffix arrays
* outbuffer.c, pair.c, pair.h, samprint.c, samprint.h, stage3hr.c,
substring.c, substring.h: Made numerous changes to fix --clip-overlap,
involving trimming and GMAP alignments
2013-10-01 twu
* sarray-read.c: Fixed memory leak
* stage3.c: Turned off HMM in stage 3, because of possible core dumps
* VERSION, config.site.rescomp.prd: Updated version number
* stage3.c: Allowing iteration of outer loop based on dual_break_p.
Restored filtering by HMM to remove bad sections. Removing noncanonical
end exons only if both canonical and noncanonical introns exist.
* index.html: Updated for latest version
* fa_coords.pl.in, gmap_build.pl.in: Allowing user to specify circular
chromosomes using a file instead of the command line.
2013-09-30 twu
* bitpack64-read.c: Fixed Bitpack64_offsetptr_only to work on poly-T.
* trunk, src, bitpack64-read.c, dynprog.c, indexdb-write.c, sarray-write.c,
util: Merged revisions from branches/2013-09-27-bidir-bitpack64 to
implement bidirectional bitpack64 format
* stage3hr.c: Fixed Stage3pair_overlap to handle trimmed ends
* access.c: Disabling memory allocation on Macintoshes when single fread
fails
* stage3hr.c, stage1hr.c: Added debugging statements
* resulthr.c: Handing a missing case in Pairtype_string
* gsnap.c: No longer setting terminal_output_minlength to be MAX_READLENGTH
* genome_hr.c: For determining trim mismatches, setting query_unk_mismatch_p
to be false
* access.c: Eliminated check for i >= 0, which is always true for size_t
* trunk, src, dynprog.c, util: Merged revisions 109420 through 109556 from
releases/internal-2013-09-27
* stage1hr.c, stage3hr.c, stage3hr.h: Merged revisions 109420 through 109556
from releases/internal-2013-09-27 to check for concordance after running
GMAP improvement on paired results
* madvise-flags.m4: Added check for MADV_SEQUENTIAL
* dynprog.c: Making NEG_INFINITY_16 and NEG_INFINITY_8 visible outside SIMD
code
* gmap.c: No longer bounding chimera_margin by CHIMERA_SLOP
* stage3.c: Avoiding indels at chimeric join by cleaning ends, extending
with no gaps, and then clipping at breakpoint
* substring.c: Fixed assertions relative to chrhigh
* samprint.c: Fixed bug in --clip-overlap SAM output where deletion occurs
at the clip site
2013-09-27 twu
* VERSION, config.site.rescomp.prd, config.site.rescomp.tst: Updated version
number
* access.c: Fixed workaround for Macintosh fread bug to handle large files
* uniqscan.c, gsnap.c: Providing --gmap-allowance
* samprint.c: Fixing problem with mate flags when the mate is a translocation
* sarray-read.h: Passing max_end_deletions into Sarray_setup
* sarray-read.c: If read from beginning does not go halfway across read,
trying a search from the middle of the read to see if it finds
substitutions
* substring.c: No longer allowing donor or acceptor substrings in circular
chromosomes, thereby preventing translocations to those chromosomes
* stage1hr.c, stage3hr.c, stage3hr.h: Using
Stage3end_better_equiv_unpaired_p as a way to determine whether to do
spanning set or complete set algorithms after suffix array method.
* stage1hr.h: Added gmap_allowance to filter bad GMAP alignments
* stage1hr.c: Added gmap_allowance to filter bad GMAP alignments. No longer
allowing GMAP splicing on circular chromosomes. No GMAP improvement on
sarray hits. Doing spanning set algorithm after suffix array algorithm if
nconcordant == 0.
2013-09-26 twu
* sarray-read.c: Applying max_end_deletions. In collect_elt_matches,
looking at multiple hits between goal and high.
2013-09-25 twu
* substring.c: Disallowing donor and acceptor substrings on the duplicate
length of a circular chromosome
2013-09-24 twu
* trunk, src, dynprog.c, util: Reintegrated changes from
releases/internal-2013-09-01
* stage3.c, stage3.h: Merged changes from releases/internal-2013-09-01 to
clean ends of local merges
* sarray-read.c, sarray-read.h: Merged changes from
releases/internal-2013-09-01 to fix problem with insertions that are too
long
* pairpool.c: Merged changes from releases/internal-2013-09-01 to add
debugging statements
* pair.c: Merged changes from releases/internal-2013-09-01 to fix
Pair_start_bound and Pair_end_bound
* gsnap.c: Merged changes from releases/internal-2013-09-01 to use new
interfaces to Sarray_setup
* gmap.c: Merged changes from releases/internal-2013-09-01 to use new
interfaces to stage 3 chimera commands
* VERSION: Updated version number
* bitpack64-speed-test.c, gamma-speed-test.c: Finding both start and end of
position blocks
* sarray-read.c, sarray-read.h: Make some fields available for benchmarking
purposes
2013-09-19 twu
* pairpool.c: In Pairpool_clean_join, making sure that no gaps are left at
the medial ends
* gmap.c, stage3.c, stage3.h: In Stage3_merge_local_splice, when a new
intron is required, performing a full call to path_compute_dir and
path_compute_final
* gmap.c: Fixed behavior of -n 1 in GMAP, so it does not print chimeric
alignments
* substring.c: Added new assertions to make sure alignstart and alignend do
not exceed chrhigh
* sarray-read.c: For indels and splices, using subtract_bounded and
add_bounded to make sure the other piece is on the same chromosome
2013-09-18 twu
* stage3hr.c: Stage3pair_overlap inverts return value for minus alignments
* pair.c, stage3hr.c: Reworking of computation for pair overlap
2013-09-17 twu
* bitpack64-speed-test.c: Added code for uncompressing offsets
* stage3hr.c: Computing pair overlap based on difference between totallength
and insertlength, not alignment coordinates
* indexdb.c: Fixed user message about expanding bitpackcomp
* stage1hr.c: Fixed memory leak when newpair involving GMAP is not kept
* stage3.c: When queryjump > nullgap and genomejump < 16 (not enough
material for stage 2), performing cDNA gap rather than dual break. In
path_compute_final, resolving dual breaks as single gaps.
* stage3hr.c: Fixed memory leak when resolving inner splices
* stage3hr.c: Fixed memory leak
* shortread.c: Fixed invalid read in chopping adapters when read lengths are
unequal
* stage3.c: Removed uninitialized variable
2013-09-16 twu
* bitpack64-speed-test.c: Initial import into SVN
* Makefile.dna.am: Made instructions match those for Makefile.gsnaptoo.am.
Added gamma-speed-test and bitpack64-speed-test.
* gamma-speed-test.c: Simplified code to focus only on decoding
* splicing-score.c: Fixed code for universal chromosome IIT
* indexdb.c, indexdb.h: Exposing different types of filename retrieval
* gregion.c: Considering fraction of overlap
* samprint.c: Fixed CIGAR strings for splicing on minus strand with
clip-overlap
2013-09-13 twu
* gmap.c: Increased default value for shortsplicedist from 200,000 to
2,000,000. For chimeric alignments that overlap, fixed bug where
maxpeelback was negative, and now computing based on midpoint of the ends.
* stage3.h: In Stage3_mergeable, using shortsplicedist
* stage3.c: No longer calling Smooth_pairs_by_netgap. In Stage3_mergeable,
using shortsplicedist. For extend5 and extend3, handling case where path
or pairs is NULL after peelback.
2013-09-12 twu
* index.html: Updated for version 2013-09-11
2013-09-11 twu
* VERSION, config.site.rescomp.prd, config.site.rescomp.tst: Updated version
numbers
* stage1.c: Added debugging statement
* gregion.c: For determining overlaps, using chrstart and chrend instead of
extentstart and extentend
* pairpool.c, pairpool.h, stage3.c: In merging for local splice, added
procedure Pairpool_clean_join to peel back both ends to remove negative
genomejumps
* stage3hr.c: Fixed memory leak on new pairs. Handling the case where pair
overlap is negative.
* stage1hr.c: Fixed memory leak on terminal alignments that fail
terminal_output_minlength test
* shortread.c: Not checking for Illumina endings on shortreads that are
skipped
* dynprog.c: Initializing value of introntype
* pair.c: Fixed memory leak under --clip-overlap feature
* stage3.c: Requiring 25 bp on both sides of a chimeric alignment
2013-09-10 twu
* sarray-read.c: Fixed bug in Elt_fill_positions_all where position could be
negative
2013-09-09 twu
* VERSION, config.site.rescomp.prd, config.site.rescomp.tst: Updated version
number
* stage3.c: Restored finding of microexons when sequence quality is high
* pair.c: Making value of trim_5p dependent on trim_left
* iit_get.c: Fixed behavior of iit_get to work when reading queries from
stdin
* stage3.c: In build_pairs_introns, when finalp is true, solving cDNA gaps
as single gaps
* stage3.c: When finalp is true, not relying upon stored solutions for any
traversal functions. Performing iterative calls to trimming non-canonical
introns at ends.
* stage3.c: In traverse_single_gap, not relying upon previous result if
finalp is true
* stage2.c: Checking to make sure calls to genome canonicalp procedures do
not have negative coordinates
* gmap.c: Initializing genome_sites for pairalign and usersegment queries
* samprint.c: Allowing printing of results where chrpos == 0
2013-09-06 twu
* index.html: Updated for latest version
* stage1hr.c: Fixed bug in add_bounded where coordinates extended past
chrhigh
* dynprog.c: Fixed problem where prob_trunc was not being initialized
* fa_coords.pl.in, gmap_build.pl.in: Added -n flag for substituting
chromosome names
2013-09-05 twu
* gmap.c, gsnap.c: Allowed compilation when pthreads are disabled
2013-09-04 twu
* stage1hr.c: Fixed a fatal bug in find_middle_indels when floors is NULL,
from all oligos being omitted
* README: Added discussion about output types
* stage3.c: Put an absolute limit on peelback. In build_pairs_introns, when
queryjump exceeds nullgap, traversing a dual break.
* pair.c: Eliminated possible infinite loop in
Pair_guess_cdna_direction_array
* gsnap.c: Fixed error message
2013-09-03 twu
* VERSION, config.site.rescomp.prd, config.site.rescomp.tst: Updated version
number
* outbuffer.c, stage3.c, stage3.h: Providing XO abbrev information for SAM
output from GMAP
* outbuffer.c, pair.c, pair.h, samflags.h, samprint.c, samprint.h: Added XO
flag to SAM output
2013-08-31 twu
* gsnap.c, stage3hr.c, stage3hr.h, uniqscan.c: Fixed genomicstart for
distant samechr splices when --merge-distant-samechr is specified
* gregion.c: Fixed bug where overlaps were found in error because we used
chromosomal coordinates instead of univeral coordinates
* dynprog.c: Made single and paired indel penalties the same
* README, configure.ac: Modified to include gvf_iit program
* Makefile.am, gvf_iit.pl.in: Added gvf_iit program
2013-08-30 twu
* gsnap.c, stage1hr.c, stage1hr.h, stage3hr.c, stage3hr.h, uniqscan.c:
Filtering terminals with minlength less than value of
--terminal-output-minlength
* stage3hr.c, stage3hr.h: Creating Stage3pair_T for terminals only if the
terminal lengths exceed terminal_output_minlength
2013-08-29 twu
* trunk, src, dynprog.c, dynprog.h, gmap.c, gsnap.c, pair.c, pair.h,
stage1hr.c, stage2.c, stage3.c, stage3.h, stage3hr.c, stage3hr.h, util:
Did reintegration merge (revisions 106186 to 106267) of
branches/2013-08-20-stage23-work
* trunk, src, util: Commit property changes
2013-08-28 twu
* trunk, config.site.rescomp.tst, src, Makefile.gsnaptoo.am, diag.c, diag.h,
doublelist.c, dynprog.c, dynprog.h, genome_sites.c, genome_sites.h,
gmap.c, gsnap.c, pairpool.c, pairpool.h, smooth.c, smooth.h, stage1.c,
stage1hr.c, stage2.c, stage2.h, stage3.c, stage3.h, stage3hr.c,
uniqscan.c, util: Merged revisions 105272 to 106186 from
branches/2013-08-20-stage23-work to improve stage 2 and stage 3 procedures
2013-08-20 twu
* trunk, src, dynprog.c, gmap.c, scores.h, util: Reintegrated changes from
branches/2013-08-20-stage23-work to fix alignment rankings and evaluation
of chimeric alignment
* index.html: Updated for latest version
* gmap_build.pl.in: Handling case where -d argument contains a directory
* stage3.c: Fixed infinite loop in peelback. Made some changes in stage 3
procedures.
* smooth.c, smooth.h: Restoring procedures for smoothing by net gap
* pairpool.c: If all pairs are outside bounds, returning NULL
* dynprog.c: Reduced indel penalties for low-quality alignments
2013-08-19 twu
* outbuffer.c: Adding q() line for R output
2013-08-13 twu
* stage3hr.c: Restoring filter where terminal alignments are removed if
mismatches in trimmed region are too high.
* stage1hr.c: In computing genomic bounds for GMAP alignment, truncating at
chromosomal boundaries
* chimera.c, chimera.h, gmap.c: For local chimeric joins, requiring that
pieces are locally joinable in Chimera_bestpath
2013-08-07 twu
* dynprog.c: Added cache flush command needed for opencc compiler using -O3
optimization on AMD computers
2013-08-06 twu
* genome_sites.c: Added lookup tables needed when popcnt is not available
* genome_hr.c: Made debugging statements work when popcnt is not available
2013-08-02 twu
* acinclude.m4, configure.ac: Removed popcnt.m4 and relying upon ACX_BUILTIN
* Makefile.gsnaptoo.am, outbuffer.c, samheader.c, samheader.h, samprint.c,
samprint.h: Moved SAM header code to new samheader.c file
* gmap.c, gsnap.c, outbuffer.c, outbuffer.h, samprint.c, samprint.h: Added
@HD and @PG header lines to SAM output
2013-07-25 twu
* trunk, NOTICE, config.site.rescomp.prd, src, Makefile.gsnaptoo.am,
dynprog.c, fastlog.h, mapq.c, mapq.h, oligoindex.c, oligoindex_hr.c,
pair.c, pair.h, pairpool.c, smooth.c, smooth.h, stage1hr.c, stage2.c,
stage3.c, stage3hr.c, stage3hr.h, substring.c, substring.h,
univinterval.c, univinterval.h, util: Reintegrated revisions from
branches/2013-07-24-fastlog to use a fast approximate log function, to
change calloc to malloc in several places, to eliminate a smoothing step,
and to improve the oligoindex SIMD procedures slightly
2013-07-24 twu
* trunk, src, Makefile.gsnaptoo.am, compress.c, compress.h, dynprog.c,
dynprog.h, genome_hr.c, gmap.c, pair.c, pairpool.c, pairpool.h,
stage1hr.c, stage2.c, stage3.c, stage3.h, stage3hr.c, stage3hr.h, table.c,
table.h, univinterval.c, univinterval.h, util: Merged revisions 102329 to
102725 from branches/2012-07-22-stage3-dir to restructure stage 3
algorithm, including reduction of insert_gapholders and early
determination of cdna direction; to add gmap_history to memoize GMAP
alignment results; and SIMD instructions for Compress_shift
2013-07-23 twu
* stage3.c: Changing Pairpool_pop to List_transfer_one in some procedures
2013-07-22 twu
* trunk, config.site.rescomp.prd, config.site.rescomp.tst, src, dynprog.c,
dynprog.h, stage3.c, util: Reintegrated revisions from
branches/2013-07-22-stage3-dir to determine cdna direction early in stage 3
* VERSION, index.html: Updated version number
* dynprog.c: Added memory fences at beginning of each SIMD loop
* stage2.c: Returned to safer code that keeps ranges 3 and 4 separate
* dynprog.c: Added needed variables
* dynprog.c: Making a single call to find splice sites from IIT file,
instead of individual calls. Removed old code.
* iit-read.c, iit-read.h: Added functions IIT_get_lows_signed and
IIT_get_highs_signed
* dynprog.c: In bridge_intron_gap procedures, using genomic sequence instead
of get_genomic_nt. Also, implemented code for a single call to
splicesites IIT, instead of one call for each position.
2013-07-19 twu
* gmap_setup.pl.in: Added backslash escape that was missing
* gsnap.c: Fixed help string
* sarray-read.c: Handling N's in query sequence
* types.h, compress-write.h: Added comment
* iit-write-univ.c: Fixed message
* types.h: Prefer to use unsigned long long for UINT8
* stage3hr.c: Changed debugging statement
* stage1hr.c: Changed single-end statement for terminals to look like
paired-end statement
* gsnap.c: Turned default for --terminal-threshold back to 2 for DNA-Seq
alignments, since terminal alignments are needed for some GMAP alignments
* bitpack64-write.c: Using _mm_store_si128 instead of type casting and a
memory fence
* iit-write-univ.c: Added message about coordinate sizes
* iit-write.c: Including types.h just to make sure we have it
* stage1hr.c: Keeping lowprob splices, unless dominated by other splices
* gsnap.c, uniqscan.c: Restored --pairexpect and --pairdev flags. Giving
information to Stage3hr_setup.
* stage3hr.h, stage3hr.c: Using insertlength, outerlength, and splice
probabilities to resolve difficult cases
* dynprog.c: Added memory fences between SIMD and non-SIMD code
2013-07-18 twu
* bitpack64-write.c: Added comment
* bitpack64-write.c: Added _mm_lfence to take care of incorrect SIMD
behavior.
* sarray-write.c: Fixed procedure for checking bitpack compression
* indexdb.c: Fixed procedure for expanding bitpack
* indexdb-write.c: Added procedure for checking bitpack compression
* indexdb.c: Added extra information to message when expanding offsets
* bitpack64-access.c: Added extra information to error message
* bitpack64-read.c: Fixed Bitpack64_block_offsets function
2013-07-17 twu
* trunk, config.site.rescomp.tst, src, dynprog.c, util: Reintegrated
revisions from branches/2013-07-16-faster-stage2
* stage2.c: Reintegrated revisions from branches/2013-07-16-faster-stage2 to
speed up stage2 procedure
* trunk, src, dynprog.c, goby.c, goby.h, inbuffer.c, shortread.c,
shortread.h, uniqscan.c, util: Reintegrated changes from
branches/2013-07-17-reduce-memset to not allocate memory for short reads
that are skipped
* VERSION, index.html: Updated version number
* dynprog.c: Preventing read of uninitialized variable at matrix position
0,0 during traceback
* dynprog.c: Not initializing directions_Egap or directions_nogap
* stage1.c: In find_first_pair, restored decision of which end to advance
based on number of hits
2013-07-16 twu
* gregion.c: Turned off debugging
* stage1.c: Allowing arbitrarily long scanning of ends to find first pair
* sequence.c: Modified definition of Sequence_trimlength
* gregion.c: Handling case where extentstart goes past genomic position 0
* gmap.c: Handling case where genomebits is absent
* genome.c: Handling case of negative genomic coordinates
* diag.c, oligoindex_hr.c, stage3.c: Fixed ends to capture last oligomers of
sequence and chimeras exactly
* stage2.c: Added comment
* bitpack64-access.c: Fixed size of accessor table
* dynprog.c: For non-SSE4.1 8-bit SIMD, using a separate pairscore array
incremented by 128.
2013-07-15 twu
* dynprog.c: Turned off debugging
* trunk, src, dynprog.c, util: Reintegrated revisions from
branches/2013-07-15-sse2-simd8 allowing SSE computers to run 8-bit SIMD
dynamic programming procedures
* VERSION, config.site.rescomp.prd, index.html: Updated version number
* dynprog.c: Before calling Boyer-Moore procedure, requiring that textlen >=
querylen
* dynprog.c: Not calling Boyer-Moore procedure if textright <= textleft
* dynprog.c: Fixed variable name for SSE2 code
* gsnap.c: Fixed --help comment for --mode flag
* gmap.c: Including a header file for compress-write.h
* compress-write.c: Fixed bug in writing genomebits for user-provided
segment in GMAP
* genomicpos.c, genomicpos.h: Fixed formatting routine to work on large files
* trunk, Makefile.am, src, Makefile.gsnaptoo.am, bitpack64-access.c,
bitpack64-access.h, bitpack64-write.c, bitpack64-write.h, gmapindex.c,
gsnap.c, indexdb-write.c, sarray-read.c, sarray-write.c, sarray-write.h,
util, gmap_build.pl.in, gmap_setup.pl.in: Reintegrated revisions from
branches/2013-07-11-compress-lcp to compress lcp file
* stage1hr.c: Disallowing diagonals < querylength, which lead to left < 0,
again
* boyer-moore.c, dynprog.c: Using new Genome_get_segment_blocks_left and
Genome_get_segment_blocks_right procedures
* genome.c, genome.h: Added separate Genome_get_segment_blocks_left and
Genome_get_segment_blocks_right
2013-07-14 twu
* stage1hr.c: In batch_init and identify_all_segments, allowing diagonal to
be less than querylength, needed for insertions in reads at the beginning
of the genome.
* genome.c: Fixed memory leak when genomebits file is not available
* Makefile.am, NOTICE: Added NOTICE file for distribution
* Makefile.gsnaptoo.am: Using new saca-k.c and saca-k.h files
* compress-write.c, genome.c: Fixed bug where coordinates above 2^31 were
being treated as negative values
* iit_store.c: For IITs with divs (versions above 1), converting from
Univinterval_T objects to Interval_T objects
* saca-k.c, saca-k.h, sarray-write.c: Moved suffix array construction
procedures to a separate file. Using latest version of SACA-K code.
* gmap_build.pl.in: Added command for making suffix array
2013-07-11 twu
* trunk, VERSION, config.site.rescomp.prd, config.site.rescomp.tst, src,
Makefile.gsnaptoo.am, compress-write.c, compress-write.h, compress.c,
compress.h, compress128.c, compress128.h, genome-write.c, genome-write.h,
genome.c, genome.h, genome128-write.c, genome128-write.h, genome128.c,
genome128.h, genome128_hr.c, genome128_hr.h, genome_hr.c, genome_hr.h,
genome_sites.c, genome_sites.h, get-genome.c, gmap.c, gmapindex.c,
gsnap.c, indexdb-write.c, maxent128_hr.c, maxent128_hr.h, snpindex.c,
splice.c, splicetrie.c, splicetrie_build.c, stage1hr.c, stage2.c,
uniqscan.c, util, gmap_build.pl.in: Merged changes from
branches/2013-07-05-new-genomecomp to implement 32-bit unshuffled
representation of genome
* gmap_setup.pl.in: Added offsets, bitpackptrs, and bitpackcomp suffixes
* gff3_splicesites.pl.in: Removed debugging statement
* boyer-moore.c, dynprog.c: Using Genome_get_segment_blocks to avoid making
repeated calls to Genome_get_char_blocks
* genome.c, genome.h: Implemented Genome_get_segment_blocks
* sarray-read.c: Fixed bug in known splicing, where index j was not being
incremented
* samprint.c: Fixed hardclipping ends for circular alignments
2013-07-10 twu
* stage1hr.c: Fixed debugging statement
2013-07-09 twu
* gff3_introns.pl.in: Fixed warning statements
2013-07-05 twu
* setup1.test.in: Fixed test to use gmap_build, rather than gmap_setup
* oligoindex_hr.c: Handling special cases now within general case. Not
allowing count/store rev SIMD procedures to have ptr go below 0. Fixed
debugging comparison of std and SIMD results.
* gmap.c, stage2.c: Using new interfaces to Oligoindex_untally and
Oligoindex_clear_inquery
* VERSION, config.site.rescomp.prd, config.site.rescomp.tst: Updated version
number
* sarray-read.c, sarray-write.c: Changed lcp file suffix to be .salcp.
Checking to see if suffix array files are present, and returning NULL
gracefully if not.
* oligoindex.c, oligoindex.h: Clearing Oligoindex_T data structures after
alignment by going through oligomers in the query, rather than using
memset on the entire structure.
* uniqscan.c: Using new interface to Stage1hr_setup
* gsnap.c: Added --use-sarray flag
* stage1hr.c, stage1hr.h: Added parameter use_sarray_p to setup procedure.
Improved warning message for very short reads.
* gff3_genes.pl.in: Printing genes with a single exon
* gff3_genes.pl.in, gff3_introns.pl.in, gff3_splicesites.pl.in: Allowing
type CDS to be equivalent to exon
2013-07-04 twu
* oligoindex_hr.c: Removed global memcpy of oligospace in
allocate_positions, and copying individual pointers at the same time as
positions are assigned.
* gsnap.c, uniqscan.c: Using new interfaces to Stage1hr procedures
* stage1hr.c, stage1hr.h: Generalized mergeable list to be used for middle
indels as well as splicing
* oligoindex_hr.c: Fixed values for startdiscard and enddiscard in special
case of count_fwdrev_simd and store_fwdrev_simd when startptr + 3 ==
endptr.
* sarray-read.c: Implemented SIMD procedure for scanning array in
Elt_fill_positions_filtered
2013-07-03 twu
* gsnap.c: For a single read, changing sarray access to be USE_MMAP_ONLY
* stage1hr.c: Removed requirement of nconcordant == 0 from performing GMAP
terminal alignments. The concordant pairs can sometimes include pairs of
terminal alignments.
* sarray-read.c: In calls to Genome_count_mismatches_limit, changed
nmismatches_allowed parameter from incorrect value of 2 to nmisses_allowed.
* sarray-read.c: Increased value of SARRAY_EXCESS_HITS from 1000 to 100,000.
Providing separate actions for USE_MMAP_PRELOAD and USE_MMAP_ONLY in
Sarray_new.
* trunk, src, Makefile.gsnaptoo.am, access.c, genome.c, genome.h,
genome_hr.c, genome_hr.h, gmapindex.c, gsnap.c, pair.c, pair.h,
samprint.c, sarray-read.c, sarray-read.h, sarray-write.c, sarray-write.h,
spanningelt.c, splice.c, splice.h, stage1hr.c, stage3hr.c, stage3hr.h,
types.h, util: Merged revisions 100273 to 100402 from
branches/2013-07-02-suffix-array-redo to implement suffix array algorithm
2013-07-02 twu
* gsnap.c: Added comment
* gmap_build.pl.in: When compression type is specified to be none, setting
base size to be equal to the k-mer size.
* gmapindex.c: Fixed bug from failing to initialize compression_types.
Changing to NO_COMPRESSION when base size is equal to k-mer size.
* indexdb-write.c, indexdb-write.h: When compression_type is NO_COMPRESSION,
writing file as "offsets", rather than "offsetscomp".
* indexdb.c, indexdb_hr.c: Checking for NO_COMPRESSION case first
* oligoindex_hr.c: Fixed case for SIMD where startptr and endptr are 3 units
apart (adjacent blocks).
* shortread.c: Ignoring accession and header for second queryseq in
paired-end FASTA format over two files
* indexdb-write.c: Computing basesize separately for bitpack and gamma
compression
* atoiindex.c, cmetindex.c, gmapindex.c, indexdb.c, indexdb.h, snpindex.c:
Added offsets_only_p argument to Indexdb_get_filenames
* gmap_build.pl.in: Added -z flag for specifying compression types
* trunk, config.site.rescomp.prd, src, util: Merged changes from
branches/2013-07-01-faster-splicing to speed up find_singlesplices and
find_doublesplices
* gsnap.c, uniqscan.c: Using new interface to Stage1hr procedures
* stage1hr.c, stage1hr.h: Merged changes from
branches/2013-07-01-faster-splicing to use a spliceable array. Removed
code for quicksort version of identify_all_segments.
* genome_hr.c: Added comment about unshuffle procedure
* VERSION, config.site.rescomp.tst: Updated version number
* gff3_genes.pl.in, gff3_introns.pl.in, gff3_splicesites.pl.in: Handling
GFF3 files that lack gene lines
* indexdb.c: Changed stderr message when allocating memory for output
pointers
* bitpack64-read.c: Using a procedure dispatch table instead of a switch
statement
2013-07-01 twu
* atoiindex.c, cmetindex.c, indexdb-write.c, indexdb-write.h, snpindex.c:
Added support for bitpack compression
* gmapindex.c, indexdb.c, indexdb.h, indexdb-write.c, indexdb-write.h: Added
ability to write positions from offsets compressed with bitpacking method
* bitpack64-read.c: Implemented Bitpack64_block_offsets to compute all
offsets
2013-06-30 twu
* bitpack64-read.c: Gave distinct variable names for debugging statements
* indexdb-write.c, bitpack64-write.c: Added code so bitpack offsets file can
be written without SIMD instructions.
* bitpack16-read.c, bitpack32-read.c: Fixed debugging code
* bitpack64-read.c: Made some speed improvements in non-SIMD code by only
adding as many terms as needed.
* bitpack64-read.c: Added code for decoding of bitpack files without SSE2
present
* Makefile.gsnaptoo.am, bitpack64-read.c, bitpack64-read.h,
bitpack64-write.c, bitpack64-write.h, gmapindex.c, indexdb-write.c,
indexdb.c, indexdb_hr.c, indexdbdef.h: Changed back to 64-element blocks,
with only two pieces of meta-information, pointer and offset0, and
packsize inferred from successive pointers.
* bitpack32-read.c, bitpack32-read.h: Performing cumulative sums within each
unpack procedure
2013-06-29 twu
* bitpack32-read.c, bitpack32-read.h, bitpack32-write.c, bitpack32-write.h,
indexdb-write.c, indexdb.c, indexdb_hr.c, indexdbdef.h, gmapindex.c,
Makefile.gsnaptoo.am: Changed from 64-element blocks to 32-element
half-blocks
* indexdb.c, indexdb.h: Added Indexdb_get_filenames procedure that returns a
Filenames_T object. Put definition of Filenames_T into header file.
* genome_hr.c, genome_hr.h: Using Offsetscomp_T type
* bitpack16-write.c: Changed write procedure to take a pointer as argument,
to avoid compiler warnings
* bitpack16-read.h: Added procedure to compute all offsets
* gmapindex.c, indexdb-write.h: Determining selection of filenames based on
compression type for offsets.
* bitpack16-read.c, indexdb-write.c: Changed forrmat to put all metablock
information, including offset and packsize, into the pointers file
2013-06-28 twu
* indexdbdef.h: Added values for compression types, and compression type
field in Indexdb_T object.
* indexdb_hr.c: Calling bitpack or gamma compression at run time, as needed
* indexdb.c, indexdb.h: Searching for bitpack format, then gamma format, and
then no compression. Created Filenames_T object to standardize routines.
* indexdb-write.c, indexdb-write.h: Added compression_types as a parameter,
so multiple formats can be written
* bitpack16-read.c, bitpack16-read.h: Moved data structures to static
variables, so they do not need to be passed as arguments each time.
* bitpack16-read.c, bitpack16-read.h: Using Blocksize_T type
* indexdb-write.c, indexdb-write.h: Restored lost code from large genome
revisions
* Makefile.gsnaptoo.am: Added bitpack files for large genome programs
* trunk, config.site.rescomp.prd, src, Makefile.gsnaptoo.am,
bitpack16-read.c, bitpack16-read.h, bitpack16-write.c, bitpack16-write.h,
indexdb-write.c, indexdb-write.h, indexdb.c, indexdb_hr.c, util:
Reintegrated revisions from branches/2013-06-14-bitpacking to add bitpack
compression code
* trunk, index.html, src, util: Added link to large genomes version
* oligoindex_hr.c: Merged revisions 99785 to 99781 from
branches/2013-06-27-simd-oligo to add SIMD code for counting and storing
olimers
2013-06-27 twu
* types.h: Added comment
* stage1.c, gsnap.c, gmap.c: Using Width_T type
* spanningelt.c, spanningelt.h: Using Width_T types
* pair.c, pair.h: Changed types for binary search to be Chrpos_T
* indexdbdef.h, indexdb.c, indexdb.h: Using Width_T and Blocksize_T types
* indexdb-write.c: Resolved compiler warnings about signed/unsigned
comparisons
* genome_hr.c, genome_hr.h: Using Blocksize_T type for offsetscomp_blocksize
* block.c, block.h: Using Width_T type for oligosize
* types.h: Added Width_T and Blocksize_t types
* trunk, README, configure.ac, src, Makefile.gsnaptoo.am, access.c,
alphabet.c, alphabet.h, atoi.c, atoiindex.c, bigendian.c, bigendian.h,
block.c, block.h, boyer-moore.c, boyer-moore.h, chimera.c, chimera.h,
chrnum.c, chrnum.h, chrom.c, chrom.h, chrsubset.c, chrsubset.h, cmet.c,
cmetindex.c, compress.c, compress.h, diag.c, diag.h, diagdef.h, dynprog.c,
dynprog.h, gdiag.c, genome-write.c, genome-write.h, genome.c, genome.h,
genome_hr.c, genome_hr.h, genomicpos.c, genomicpos.h, genuncompress.c,
get-genome.c, gmap.c, gmapindex.c, goby.c, goby.h, gregion.c, gregion.h,
gsnap.c, gsnap_tally.c, iit-read-univ.c, iit-read-univ.h, iit-read.c,
iit-read.h, iit-write-univ.c, iit-write-univ.h, iit-write.c, iit_dump.c,
iit_get.c, iit_store.c, iitdef.h, indexdb-write.c, indexdb-write.h,
indexdb.c, indexdb.h, indexdb_hr.c, indexdb_hr.h, indexdbdef.h,
interval.c, interval.h, intron.c, intron.h, littleendian.c,
littleendian.h, mapq.c, mapq.h, match.c, match.h, matchdef.h, matchpool.c,
matchpool.h, maxent_hr.c, maxent_hr.h, oligo.c, oligo.h, oligoindex.c,
oligoindex.h, oligoindex_hr.c, oligoindex_hr.h, oligop.c, oligop.h,
outbuffer.c, outbuffer.h, pair.c, pair.h, pairdef.h, parserange.c,
parserange.h, samprint.c, samprint.h, segmentpos.c, segmentpos.h,
snpindex.c, spanningelt.c, spanningelt.h, splicetrie.c, splicetrie.h,
splicetrie_build.c, splicetrie_build.h, stage1.c, stage1.h, stage1hr.c,
stage1hr.h, stage2.c, stage2.h, stage3.c, stage3.h, stage3hr.c,
stage3hr.h, substring.c, substring.h, tableint.c, tableuint.c,
tableuint.h, tableuint8.c, tableuint8.h, types.h, uint8list.c,
uint8list.h, uintlist.c, uintlist.h, uniqscan.c, univinterval.c,
univinterval.h, iittest.iit.ok, util: Reintegrated changes from
branches/2012-02-14-biggenomes to handle large genomes
* index.html: Updated comments for latest version
* trunk, src, dynprog.c, oligoindex.c, oligoindex.h, oligoindex_hr.c,
stage2.c, util: Merged revisions 99648 to 99702 from
branches/2013-06-25-simd-8 to use unsigned chars for counts, and 8-bit
SIMD instructions for allocate_positions
* stage3.c: Removed assertion that c != g when filling in a single bp gap,
which can occur with overabundant oligomers, such as poly-A
2013-06-26 twu
* VERSION, index.html: Updated version number
* trunk, ax_ext.m4, config.site, config.site.rescomp.tst, configure.ac, src,
boyer-moore.c, dynprog.c, gmap.c, gsnap.c, stage3hr.c, util: Merged
revisions 99462 to 99647 from branches/2013-06-25-simd-8 to allow for
dynamic programming with 8-bit chars when SSE4.1 is available
* stage3.c: Allowing insert_gapholders to fill in single mismatches, rather
than inserting a gap. Fixed peel_rightward and peel_leftward, so the
extrapeel step does not remove gaps.
* stage3hr.c: Fixed issues with wrong queryseq sequence for second end being
printed in standard GSNAP output
* dynprog.c: Further check to make sure traceback_local does not give rise
to negative querypos coordinates.
2013-06-25 twu
* dynprog.c: Distinguishing between NEG_INFINITY and NEG_INFINITY_DISPLAY
* dynprog.c: Made traceback_local code follow that for traceback
* VERSION, index.html: Updated version number
* dynprog.c: Fixed bug in traceback_local where r is negative
2013-06-18 twu
* VERSION, index.html: Updated version number
* stage3.c: Using new interface to Pair_print_sam, which requires two
accessions
* gsnap.c, outbuffer.c, pair.c, pair.h, samprint.c, samprint.h, shortread.c,
shortread.h, stage3hr.c: Added flag --allow-pe-name-mismatch
* oligoindex_hr.c: Fixed faulty cmpgt statement based on 16-bit quantities,
instead of 32-bit quantities
2013-06-14 twu
* dynprog.c: Removed unnecessary initialization for SIMD code
* trunk, util, src, gsnap.c, iit-read.c, iit-read.h, stage1hr.c, stage1hr.h:
Merged revisions 98458 to 98523 from branches/2013-06-13-sort-not-merge to
speed up update of chromosome bounds in identify_all_segments
* VERSION: Removed newline from file
2013-06-13 twu
* VERSION, config.site.rescomp.tst, index.html: Updated version number
* trunk, src, gmap.c, gsnap.c, indexdb.c, stage1hr.c, util: Merged revisions
98429 to 98457 from branches/2013-06-13-sort-not-merge to add code for
qsort, although code not used
* config.site.rescomp.prd: Removed -g flag from production CFLAGS
* trunk, config.site.rescomp.prd, index.html, src, dynprog.c, util: Merged
revisions 97749 to 98423 from branches/2013-06-05-dynprog-sse to use SIMD
instructions for dynamic programming procedures
2013-06-12 twu
* dynprog.c: Fixed an unassigned value for rlo. Fixed code in
make_splicejunction_3 where the standard splicejunction was reverse
complemented, but not splicejunction_alt.
2013-06-10 twu
* oligoindex_hr.c: Using compile-time constants to clarify code
* gmap.c, gsnap.c: Added run-time check of compiler assumptions
2013-06-09 twu
* dynprog.c: Fixes to debugging code. Slightly more efficient
initialization.
2013-06-08 twu
* oligoindex_hr.c: Computing skips of empty counts to increase speed of
allocate_positions
2013-06-07 twu
* oligoindex_hr.c: Using a convert instruction instead of a store
instruction to compute the final sum in allocate_positions.
* oligoindex_hr.c: In allocate_positions, using SIMD commands to check when
positions need to be computed
* gsnap.c: Turning off terminal alignments by default for DNA-Seq alignment.
Added "all" option for --gmap-mode.
* trunk, config.site.rescomp.prd, config.site.rescomp.tst, src, dynprog.c,
pair.c, Makefile.am, util: Merged revisions 97750 to 97963 from
branches/2013-06-05-dynprog-sse for a faster and better implementation of
dynamic programming procedures
* indexdb.c: Removed forced writing of positions by chunks for debugging
* Makefile.dna.am, Makefile.gsnaptoo.am: Removed PTHREAD_CFLAGS from LDFLAGS
and moved SIMD_FLAGS to CFLAGS
* config.site: Added sections for bzlib, simd, and popcnt
* gmap.c: Moved free of usersegment to end, after we decide whether we need
to free genome_blocks
* configure.ac: Moved check for SIMD to be close to that for check for popcnt
* VERSION: Updated version number
* configure.ac: Added flags --enable-simd and --disable-simd to control
check for SIMD features
* indexdb.c: Added a second attempt to write positions file in chunks if the
first write fails. Added a sanity check in reading in a genomic index
that the positions file has the expected size.
2013-06-05 twu
* configure.ac: Fixed comment
* ax_ext.m4: Changed comment lines for config.h to be standard definitions
of 1
* Makefile.dna.am, Makefile.gsnaptoo.am: Added SIMD_FLAGS
* gmap.c, gsnap.c: Added available SIMD functions to --version output
* indexdb.c: Added check that positions file has the expected size
* oligoindex.c, oligoindex.h, oligoindex_hr.c: Using SIMD functions for
allocate_positions
* acinclude.m4, configure.ac: Added check for SIMD support
* ax_ext.m4: Fixed spelling errors
* ax_check_compile_flag.m4, ax_gcc_x86_cpuid.m4, ax_ext.m4: Initial import
into SVN
* dynprog.c: Created separate source code for jump late and jump early
conditions
2013-05-28 twu
* pair.c: Fixed bug in final revcomp of sequence, where genomealt was not
being complemented
2013-05-22 twu
* archive.html: Storing version 2013-03-31 into archive
* index.html: Updated for version 2013-05-09
* iit_get.c: No longer printing total, when getting queries from stdin
* parserange.c: Handling a case where interpreting query as a contig, and
the result is NULL
* pair.c: Fixed computation of coverage for GFF output
2013-05-17 michafla
* Makefile.am, bootstrap.dna, bootstrap.gmaponly, bootstrap.gsnaptoo,
bootstrap.pmaptoo, bootstrap.three, configure.ac: Port from release: use
only 'config' for m4 macros
2013-05-09 twu
* dynprog.c, dynprog.h, splicetrie.c: In making splice junctions, checking
for junctions that go to the left of genomic position 0.
2013-05-08 twu
* samprint.c: Fixed SAM output for translocations, affected by changes to
hardclipping
2013-05-07 twu
* stage2.c: In find_shifted_canonical, checking for leftpos and rightpos
that exceed chrhigh
2013-05-06 twu
* uniqscan.c: Using new interface to Stage1_single_read
* gsnap.c, stage1hr.c, stage1hr.h, uniqscan.c: Passing
min_distantsplicing_end_matches and min_distantsplicing_identity to
Stage1hr_setup. Made GSNAP more sensitive to shorter reads, such as 50 bp.
* stage3.c: Removed line that is not reached.
2013-05-03 twu
* gmap.c: Added comment in --help output about chimeric alignments being an
exception to the filtering options
* stage3.c, stage3.h: Added functions Stage3_recompute_coverage and
Stage3_passes_filter
* gmap.c: Added options --min-trimmed-coverage and --min-identity
* stage2.c: Further check to make sure splice site does not occur in first 3
bp of segment in find_shifted_canonical
2013-05-02 twu
* stage2.c: In find_shifted_canonical, preventing discovery of splice sites
in first 3 bp of segment
* index.html: Made changes for version 2013-03-31 (v5)
* gmap.c: Checking value of Stage3_merge_chimera before creating chimera and
running merge_left_and_right_transloc.
* stage3.c, stage3.h: Stage3_merge_chimera checks for pairs on left and
right being non-NULL, and returns a bool
* pairpool.c: Fixed Pairpool_clip_bounded so it handles the case where the
list is NULL.
2013-04-30 twu
* get-genome.c: Added --exact flag
* stage3hr.c: Removed exclusions in Stage3pair_overlap that prevented tails
from being hard clipped.
* shortread.c: Fixed hard clipping of reads and quality strings to match new
definitions of hardclip_low and hardclip_high
2013-04-26 twu
* iit_get.c: Added --exact option
* chimera.c: Adding range of chimeric overlap in XT SAM field
* outbuffer.c: Using new interface to SAM_print
* stage3hr.h, stage3hr.c: Revised Stage3pair_overlap to return hardclip5,
hardclip3, and clipdir.
* samprint.h: Added clipdir as a parameter to SAM_print
* samprint.c: Handling clipping of overlaps when the low end of the first
read and the high end of the second read should be clipped. This is
indicated by clipdir of -1, and occurs when the insert length is so short
that the two reads have passed each other.
2013-04-12 twu
* pair.c, pair.h, samprint.c: Providing effective_chrnum of mate to
Pair_print_sam, in case GMAP alignment has a translocation for a mate
2013-04-11 twu
* gmap.c: Made changes for PMAP to compile
2013-04-10 twu
* gmap.c: Fixed memory issues in merging middle pieces
2013-04-09 twu
* Makefile.dna.am: Fixed name of file
2013-04-05 twu
* stage1hr.c, stage3hr.c, stage3hr.h: When terminal_threshold is set to a
high value, using trim_left_raw and trim_right_raw to exclude GMAP hits.
* samprint.c: Reverting to previous version, where chrpos of 0 indicates
nomapping
* oligoindex_hr.c: Fixed an issue where minus oligomers were extending past
the beginning
* stage3.c: Removed the unused variables nintrons, nnonintrons, intronlen,
and nonintronlen.
2013-04-04 twu
* gmap.c, outbuffer.c, stage3.h: Changed name from map_genes to map_ranges,
to avoid confusion
* iit_store.c: Handling empty files gracefully
* iit-write.c: If no intervals are found, then returning gracefully instead
of exiting
* outbuffer.c: If allow_chimeras_p is false and chimera is present, then
effective_maxpaths is 0.
* gmap.c: Added flag --no-chimeras
* stage3hr.c: Added debugging statement
* samprint.c: Using NULL hit instead of zero chrpos to indicate lack of
mapping
2013-04-03 twu
* dynprog.c, stage3.c: Fixed uninitialized variable for g_alt in
get_genomic_nt
* stage3hr.c, substring.c, substring.h: Updating nmismatches_bothdiff also
2013-04-02 twu
* stage3hr.c: Handling the new pairtype UNSPECIFIED.
* stage1hr.c: Allowing align_pair_with_gmap to change final_pairtype. Calls
Stage3pair_new with pairtype UNSPECIFIED.
* resulthr.c, resulthr.h: Added function Pairtype_string
2013-04-01 twu
* VERSION, config.site.rescomp.tst: Updated version number
* stage3.c: Added debugging statements
* splicetrie.c: Using computed miss_score. Both miss_score and
threshold_miss_score are negative.
* smooth.c, smooth.h: Added code for marking short exons, but not used
* pair.c, pair.h: Removed unused procedures
* dynprog.h, dynprog.c: Computing miss_score. Both miss_score and
threshold_miss_score are negative.
2013-03-30 twu
* stage3.c: Fixed bug in handling list for middle exon in
build_pairs_dualintrons. In traverse_single_gap, when forcep == false, not
adding pairs if finalscore < 0.
2013-03-28 twu
* stage3.c: Skipping pass 9 (distalmedial) and relying instead of
trim_noncanonical_exons. In pass 8, for extend_endings, not removing
indel gaps, and setting quit_on_gap_p true, to preserve indels at ends.
Not setting ambig_end_lengths in trim_noncanonical_exons, reserving it
instead for trim_novel_spliceends. Performing final extension of ends at
end of path_compute, and not at beginning of path_trim.
2013-03-27 twu
* stage1hr.c: Allowing redo of GMAP pairs based on inconsistent senses.
* gmap.c, gsnap.c: Increased maxpeelback from 11 to 20
* stage3.c: Added quit_on_gap_p parameter to peel_rightward and
peel_leftward. This allows smoothing procedures after traverse_single_gap
to merge gaps, and dynamic programming traversal of introns to add indels.
Calling remove_indel_gaps before dynamic programming solutions of introns.
Added computation of max_intron_score, and using it in
pick_cdna_direction to determine if sense is NULL.
* Makefile.dna.am: Added commands for splicing-score program
* stage3hr.c: Handling case in Stage3pair_new where expect_concordant_p is
false for a GMAP alignment
* stage1hr.h, gsnap.c, uniqscan.c: Added distances_observed_p to
Stage1hr_setup
* splicetrie_build.c: When observed distance is greater than
localsplicedist, storing observed distance
* stage1hr.c: In find_splicepairs_distant, if splice distance is within the
known maximum distance, then treating as a local splice, rather than a
distant splice.
* substring.c: Added code for Substring_intragenic_splice_p, but using
version in stage1hr.c instead
* stage3hr.c: In Stage3_determine_pairtype, using effective_chrnum instead
of chrnum, so new pairs with PAIRED_UNSPECIFIED from GMAP runs work.
* stage3hr.h: Added function Stage3pair_sense_consistent_p.
* stage3hr.c: Changed ambig_end_interval, used in penalty, to 8. In
Stage3end_pick_cdna_direction, returning a non-zero cdna_direction. Added
function Stage3pair_sense_consistentp.
* stage3.c: Allowing ambig end only if medial prob > 0.95. Moved
trim_novel_spliceends earlier, so it is effective. In
pick_cdna_direction, making a final use of alignment score. Checking for
divide by zero in computing defect_rate.
* stage1hr.c: Using matches over entire read to determine whether to perform
GMAP. Performing halfmapping of GMAP against terminals only if nconcordant
is 0. Added a redo step on align_pair_with_gmap if the senses are
inconsistent.
* inbuffer.c, shortread.c, shortread.h: Allowing FASTA input files to be
paired
2013-03-26 twu
* stage1hr.c: For GMAP alignments with long trimmed ends, comparing
terminal_threshold against user_maxlevel in deciding whether to drop them.
2013-03-25 twu
* stage3hr.c, stage1hr.c: Allowing GMAP improvement to be run on paired
results
* samprint.c: Adding PG:Z:T output for terminal alignments
* stage3hr.c: Switching back to old GMAP filter, where nmatches_posttrim
must exceed querylength/2.
* stage1hr.c: Applying terminal_threshold test to GMAP alignments
2013-03-21 twu
* dynprog.c: Allowing only two mismatches in a distant splice
* gmap.c, gsnap.c: Made results of --version consistent
* stage1hr.c: Applying match length criterion to terminal GMAP alignments
* stage1hr.c: Incrementing nconcordant only for high-quality GMAP pairsearch
results, where nmatches is high enough, instead of using GMAP score.
2013-03-14 twu
* splicetrie_build.c: Fixed type
* gsnap.c, shortread.c, shortread.h: Implemented --force-single-end flag
* index.html: Added comment about --force-single-end flag
* README: Added description of stranded and nonstranded cmet and atoi modes.
* README: Added information about --force-single-end flag and multiple FASTQ
files on the command line.
* configure.ac: Removed --with-samtools flag
2013-03-13 twu
* VERSION, config.site.rescomp.tst, index.html: Updated version number
* Makefile.dna.am, gmap.c, oligoindex_hr.c, stage3.c, stage3.h: Made changes
so PMAP would compile
* index.html: Made changes for release 2013-03-05
* splicetrie_build.c: Added code for get_exons, which may be needed for
getting splice sites from a genes IIT file
* genome-write.h: Changed UINT4 to Genomecomp_T
* types.h: Added Genomecomp_T
* stage3.c: Added minimum length for running stage2 in a dual break
* stage2.c: Improved debugging statements
2013-03-12 twu
* genome_hr.c: Returning correct reverse chrpos for
prev_dinucleotide_position_rev. Printing universal coordinates on blocks
for debugging. Improved debugging statements for finding dinucleotides.
* stage3.c: Fixed pass 7, so path is expected in loop, and then is converted
to pairs at the end.
* pair.c: In Pair_start_bound and Pair_end_bound, skipping pairs with
querypos < 0, which represent gaps.
2013-03-06 twu
* indexdb.c: Removed code for writing word-by-word
* gsnap.c: Fixed bug where --adapter-strip=off had no effect.
2013-03-05 twu
* fa_coords.pl.in, gmap_build.pl.in, gmap_process.pl.in: Allowing FASTA file
names and gzipped file names to contain spaces
* gsnap.c: Added entry for distant-splice-identity
* VERSION, config.site.rescomp.tst: Updated version number
* stage3hr.c: Restored check on bad GMAP alignments
* stage1hr.c: Taking both long and short terminals, with short terminals
based on half the number of mismatches. Always setting long and short
terminal pos based on one-third of the read length.
* gmap.c: Fixed issues with computation of chimeric middle pieces, including
memory freeing
2013-03-04 twu
* dynprog.c: Fixed bug with uninitialized variable introntype in
Dynprog_genome_gap
2013-02-28 twu
* stage1hr.c: No longer using Stage3_short_alignment_p to rule out GMAP hits
* gsnap.c, stage3hr.c, stage3hr.h: Using min-nconsecutive criterion instead
of min-coverage criterion for keeping GMAP hits.
2013-02-26 twu
* stage1hr.c: Introduced idea of long terminals and short terminals, with
terminal length = querylength/3. Allowing GMAP pairsearch to set
nconcordant.
* stage1hr.c: Reverted to not recording GMAP pairsearch successes as
nconcordant
* stage3hr.c: No longer calling Stage3_bad_stretch_p on GMAP alignment. Not
allowing end indels to set trims for alignment comparisons. Penalizing
indels in alignment comparisons.
* stage3.c, stage3.h: Added procedure Stage3_good_part
* stage1hr.c: Recording GMAP pairsearch successes as nconcordant
2013-02-25 twu
* index.html: Updated to reflect new version number
* VERSION: Updated version number
* gsnap.c: Increased values of max_gmap_pairsearch and max_gmap_terminals
from 10 and 5, respectively, to 50 and 50
* stage1hr.c: Sorting terminals by matches before running GMAP against a
limited number of them
* uniqscan.c: Printing given sequence in addition to uniqueness result
* stage3.c: Reduced requirement for GMAP from querylength/2 to querylength/3
* gsnap.c: Reduced gmap_min_coverage from 0.50 to 0.33
* stage1hr.c: Counting nhits during subs and indels, and exiting when the
value exceeds maxpaths_search
2013-02-22 twu
* gff3_introns.pl.in, gff3_splicesites.pl.in: Disallowing negative intron
lengths
* gff3_genes.pl.in, gff3_introns.pl.in: Skipping blank lines.
* gff3_splicesites.pl.in: Fixed bug in warning message. Skipping blank
lines.
* stage3hr.c: Setting value of guided_insertlength for exact hits
* gsnap.c, outbuffer.c, outbuffer.h, samprint.c, samprint.h, stage1hr.c,
stage1hr.h: Created separate variables maxpaths_search and maxpaths_report
* shortread.c: Fixed potential bug if sequence is longer than sequence_length
* uniqscan.c: Computing full sequence first, then iterating from start until
we reach a unique alignment.
2013-02-21 twu
* samprint.c: Fixed assertion for circularpos to handle trimming at ends
* substring.c: Handling trimmed regions in Substring_circularpos
2013-02-15 twu
* VERSION, config.site.rescomp.tst, index.html: Updated version number
* stage1hr.c: Changed terminal length for one-half to one-third
* stage1hr.c: Fixed bug in the terminal position used for comparing
mismatches in find_terminals
2013-02-14 twu
* stage3hr.c: Preventing distant splices from being considered as circular
aliases and removed
* stage1hr.c: Fixed typo in comment
2013-02-12 twu
* stage1hr.c: Increased MAXCHIMERAPATHS from 3 to 100
2013-02-07 twu
* stage1hr.c, stage3hr.c, stage3hr.h: Added Stage3end_eval_and_sort_guided,
to sort one end of unpaired alignments when the other end has a unique
alignment.
* get-genome.c: Fixed bug in printing accession for sequence, based on
coordinates, for minus strand
2013-02-06 twu
* uniqscan.c: Running scan for entire sequence length. Reduced
terminal-threshold to 5.
2013-02-05 twu
* VERSION, index.html: Updated version number
* genome-write.c, genome_hr.c, indexdb.c, outbuffer.c, stage1hr.c, stage3.c,
substring.c: Fixed static analysis errors found by Nathan Weeks using the
Clang 3.1 compiler
2013-01-24 twu
* VERSION, config.site.rescomp.tst, archive.html, index.html: Updated
version number
* iit_get.c: Allowing -N flag to be used with tally IIT files
* outbuffer.c: Fixed bug where SAM headers were duplicated in .nomapping file
2013-01-17 twu
* pair.c: Added PG:Z:M flag for alignments using GMAP method within GSNAP
* stage3.c: Made further changes to peel_leftward and peel_rightward to
prevent an indel from being on top of pairs or path
* Makefile.dna.am: Add bzip2 files
* stage3.c: Changed peel_leftward and peel_rightward so they do not leave a
gap or indel at the top of the pairs or path.
2013-01-16 twu
* iit_store.c: Added debugging statement
* snpindex.c: Added information for warning statement from check_acgt
* configure.ac: Added configure commands for bzip2 library
* README: Added comment about bunzip2
* Makefile.gsnaptoo.am, bzip2.c, bzip2.h, gsnap.c, inbuffer.c, inbuffer.h,
sequence.c, sequence.h, shortread.c, shortread.h: Added procedures for
handling bunzip2
* snpindex.c: Fixed issue where warning messages referred to wrong labels
* stage1hr.c: Added debugging information about hits used as anchors for GMAP
* stage3hr.c: Restored checks on chromosomal bounds for Stage3_new_gmap, and
returning NULL when bounds are exceeded
2012-12-19 twu
* VERSION, config.site.rescomp.tst, index.html: Updated version number
* stage3.c: In assign_gap_types, handling case where cdna_direction == 0
* gmap.c: Calling Stage3_guess_cdna_direction at appropriate places after
merge_left_and_right_readthrough. Using maxextension instead of
max_intronlength for Stage 1 computations.
* stage3.h: Added function Stage3_guess_cdna_direction
* stage3.c: Using new interface to Pairpool_push_gapalign. Not performing
guess of cdna_direction in Stage3_merge_local_splice.
* stage3hr.c, stage3hr.h: Stage3_pair_up_concordant now limits number of
samechr results
* stage1hr.c: Using new interface to Stage3_pair_up_concordant, which now
takes nsamechr
* stage1.c, stage1.h: Using extensionlen instead of max_intronlength.
Providing variables in Stage1_setup.
* pairpool.c, pairpool.h: Handling introntype field in Pair_T object
* pairdef.h: Added introntype as field for Pair_T object
* pair.c, pair.h: Added function Pair_fix_cdna_direction_array
* outbuffer.c, outbuffer.h: Adding commas to output of memory usage
* mem.c, mem.h: Added reporting of peak memory usage
* intron.c: For cdna_direction of 0, returning introntype rather than
NONINTRON
2012-12-18 twu
* stage3.c: In Stage3_merge_local_splice, if intronlength is small or
negative, calling Stage3_merge_local_single.
* mem.c: Increased size of hash table
* gmap.c: Putting results of middle search into middlepieces, and iterating
through that list. Searching for local readthroughs first, but linking
local middle piece to both ends in any case.
2012-12-14 twu
* gsnap.c: Fixed --help documentation to show default for -a is off
2012-12-11 twu
* VERSION, index.html: Updated version number
* stage3.c: In Stage3_merge_local_splice, not advancing querypos for a
deletion
* trunk, VERSION, src, gmap.c, pair.c, result.c, stage3.c, stage3.h, util:
Merged changes from branches/2012-11-11-middle-piece to allow for
searching of middle pieces in GMAP
* uinttable.c, table.c: Fixed memory leak
* samprint.c: Fixed issue with CIGAR string and hard-clipping
* gmap.c: Not finding chimeras if --nosplicing is requested
2012-12-10 twu
* indexdb.c: In writing gammas, using a buffer for offsetcomp, so we do not
write words one at a time, which is slow on some filesystems
* samprint.c: In compute_cigar, handling D and N types based on querypos
* samprint.c: Rewrote compute_cigar
2012-12-07 twu
* gmap.c: Added separate procedure evaluate_query for use before all calls
to Stage1_compute. Turned on check for repetitivep.
* VERSION, config.site.rescomp.tst, index.html: Updated version number
* archive.html: Added entry for 2012-07-20.v2
* gmap_build.pl.in: Added -e or --nmessages flag
* samprint.c: Fixed problem with CIGAR string and clip-overlap function
* shortread.c: No longer assuming that slashes in the input FASTQ file are
present consistently
2012-12-06 twu
* stage1hr.c: Added debugging messages
* iit_get.c: Added printing of flanking results in stdin mode. Added -C
flag to force interpretation of queries as coordinates.
* iit-read.c: Fixed coord_search_low and coord_search_high to prevent it
from going below given chromosome
* gsnap.c: Revised --help message to give correct formula for fast index size
* gmapindex.c, genome-write.c, genome-write.h: Added nmessages as a parameter
* gmap.c: Using new interface to Stage3_merge_chimera
* get-genome.c: Added --signed flag
* stage3.h: Doing full trimming of inside ends of chimeras
* stage3.c: Doing full trimming of inside ends of chimeras. Added commented
code from revised 2012-07-20 version to prevent chimera extensions from
crossing chromosomal coordinates.
* pair.c: Handling case for MD string in reverse query direction of I-to-D
and N-to-D transitions
2012-11-28 twu
* get-genome.c: Fixed sign to be 0 when coordend == coordstart
2012-11-27 twu
* VERSION, index.html: Updated version number
* gmap_process.pl.in: Turning off handling of pipes in FASTA headers
* iit_get.c: Using new interface for IIT_get_flanking_typed
* iit-read.c, iit-read.h: Added procedures for getting signed results
* get-genome.c: Added --signed flag
* samprint.c: Fixed bugs in cigar strings for hard clipping and
translocations
2012-11-21 twu
* VERSION, index.html: Updated version number
* gmap.c: Added a limit to iterations of chimera search to prevent an
infinite loop
2012-11-19 twu
* VERSION, index.html: Updated version number
* gmap.c: On chimera search, calling original found subsequence, in case
stage 1 was led astray by the ends
* stage2.c: Changed debugging call
* dynprog.c: Not aborting from Dynprog_microexon_int if cdna_direction is 0,
which can happen with chimeric joins. Returning NULL instead.
* stage2.c: Moved some debugging statements around
* stage1.c: Added debugging statements for printing final results
* gregion.c: Increasing MAX_GENOMICLENGTH from 1 million bp to 2 million bp
* stage3.c: Using make_pairarrays instead of make_pairarrays_chimera in
Stage3_merge_local_splice. Calling this even for dual break.
* stage2.c: Made slight differences in coordinates for Oligoindex_hr_tally
on minus strand
* oligoindex_hr.c: Added debugging statements
* oligoindex.c, oligoindex.h: Added debugging procedure
* pair.c: Changed all types of exon_genome positions from int to Genomicpos_T
2012-11-16 twu
* stage3.c: Fixed bug in computing chrstart and chrend for
Stage2_compute_one from traverse_dual_break. Making all splice_genomepos
consistently as chromosomal coordinates.
* stage3hr.c: Handling case in Stage3_cdna_direction where stage3 is NULL
* pair.c, pair.h, samprint.c: Fixed CIGAR strings when clip-overlap hits an
insertion
2012-11-15 twu
* pairpool.c: Added assert.h
* trunk, config.site.rescomp.tst, src, boyer-moore.c, boyer-moore.h, diag.c,
diag.h, diagdef.h, dynprog.c, dynprog.h, genome.c, genome_hr.c,
genome_hr.h, gmap.c, gregion.c, gregion.h, iit-read.c, iit_dump.c,
oligoindex.c, oligoindex.h, oligoindex_hr.c, oligoindex_hr.h, pair.c,
pair.h, pairpool.c, splicetrie.c, splicetrie.h, splicetrie_build.c,
stage1hr.c, stage2.c, stage2.h, stage3.c, stage3.h, util: Merged revisions
78884 to 79299 from branches/2012-11-11-middle-piece to use chromosomal
coordinates, chroffset, and chrhigh
* README, VERSION, index.html: Updated version number
* genome.h: Changed interface to Genome_fill_buffer_simple_alt
* splicetrie_build.c: Using new interface to Genome_fill_buffer_simple_alt
* genome.c: Fixed bugs in substituting SNPs into string
(uncompress_mmap_snps_subst)
2012-11-09 twu
* VERSION, index.html: Updated version number
* stage3.c, stage3.h: Implemented Stage3_set_genomicend
* stage2.c: Remove MAX_GENOMICLENGTH restriction
* gregion.c: Impose MAX_GENOMICLENGTH on gregion
* gmap.c: Revising genomicend after pieces are merged
* indexdb.h: Fixed type conflict involving Oligospace_T with definition in
indexdb.c
* stage2.c: Raised MAX_GENOMICLENGTH from 1 million to 10 million bp
* stage3.c: Fixed identification of insertion in Stage3_mergeable
* dynprog.c: Added debugging statement
* gregion.c: Fixed bug where gregion could extend beyond chrhigh
2012-11-07 twu
* stage3hr.c: Using fact that circularp[0] == false to handle translocations
* iit-read.c: Fixed IIT_circularp for lookup of 1-based chrnum values
* VERSION, index.html, config.site.rescomp.prd: Updated version number
* gmap.c, gsnap.c, uniqscan.c: Using new interface to Stage3hr_setup
* stage3hr.c: Checking hit->alias before checking hit->plusp
* stage3hr.c, stage3hr.h: Using information about circularp in
compute_circularpos and in computing hit->alias.
* iit-read.c, iit-read.h: Implemented IIT_circularp function
* stage1hr.c: Added checks to make sure greedy mapping positions do not
result in a genomic segment with negative length
2012-11-06 twu
* VERSION, index.html: Updated version number
* access.c: Changed printf commands for off_t to use %ju
* genome_hr.c: Fixed computation of wildcard SNP positions to exclude
positions where reference allele is 'N'.
* configure.ac, access.c: Handling compiler warning messages when sizeof
off_t is not 8
* dynprog.c, dynprog.h, gsnap.c, uniqscan.c: Removed genome and genomealt
from Dynprog_setup
* stage1.c: No longer filtering for support anymore, since it leads to poor
behavior
* gmap.c: Fixed a bug where a NULL genomealt was being passed to
Chimera_find_exonexon
2012-10-31 twu
* coords1.test.ok: Changed to match current output of program, which allows
for circular chromosomes
* Makefile.gsnaptoo.am: Made files match those of Makefile.dna.am
* archive.html, index.html: Made changes for new version
* README: Added discussion of --force-xs-flag, the XW and XV fields, and
circular chromosomes.
* VERSION, config.site.rescomp.tst: Updated version number
* stage2.c: Prefer shorter intron in all cases
* gsnap.c, stage1hr.c, stage1hr.h, uniqscan.c: Implemented variable for
splice distancs at novel ends
* stage2.c: For first intron, favoring shorter intron lengths in cases of
ties
* stage1hr.c: Refinements made to computing GMAP genomic region, by checking
if a fallback mappingstart or end is available. Performing n best GMAP
alignments for pairsearch. Going back to sorting terminals by matches in
GMAP terminals.
* stage3hr.c: Modifications made to score_eventrim for GMAP, by ignoring
small trims and restoring indel penalty. Fixed bug in computing
non_terminal3_p.
* stage3.c: Calculating ambig end lengths better in trimming noncanonical
ends. Not doing trim of novel splice end if ambig end length already set.
* samprint.c, samprint.h: Implemented --force-xs-direction flag
2012-10-30 twu
* dynprog.c: Limiting indels to 3 bp around splice sites
* stage2.c: Edited comment on debugging usage
* stage3.c: In trimming non-canonical end exons, when end exon is actually
canonical but with poor probability, we trim the exon but set
ambig_end_length, so the alignment can compete with other alignments.
* stage3.c: Made bad stretch algorithm more tolerant
2012-10-29 twu
* genome_hr.c: Fixed compile for GMAP
* pair.h: Added code to handle force-xs-direction. Using
mate_cdna_direction when necessary.
* pair.c: Added code to handle force-xs-direction. Using
mate_cdna_direction when necessary. Counting N's as mismatches for
trimming purposes.
* genome_hr.c, genome_hr.h, substring.c: Counting N's as mismatches for the
purposes of end trimming
2012-10-28 twu
* substring.c, substring.h: Added function Substring_chimera_sensedir
* outbuffer.c, outbuffer.h: Removed snps_p as a parameter
* gmap.c, gsnap.c, uniqscan.c: Added --force-xs-dir flag
2012-10-27 twu
* stage3.c, stage3.h: Removed snps_p
2012-10-26 twu
* dynprog.c: Fixed debugging statements
* gmap.c, gsnap.c, uniqscan.c: Using new interface to Stage3_setup
* dynprog.c, stage3.c, stage3.h, intron.c: Commented out constants that
should not be used by PMAP
* samprint.c, pair.c, pair.h: Printing XW and XV only when SNP-tolerant
alignment is used
* uniqscan.c: Using new interfaces to procedures.
* gmap.c, gsnap.c: Added flag --md-lowercase-snp. Using new interfaces to
procedures.
* dynprog.c: Using jump_late_p in cdna_gap and genome_gap, which can
sometimes give rise to indels
* stage1hr.c: Removed nsalvage from debugging statements
* stage3.c: Changed Stage3_bad_stretch_p to count each indel gap as one
mismatch, regardless of length
* pair.c: Removed debugging statements
2012-10-25 twu
* stage3.c, stage3.h: Using new interface to Intron_type. Obtaining
alternate genomic segments in some procedures.
* stage2.c, stage2.h: Obtaining alternate allele and putting into Pair_T
object when SNP-tolerant alignment is used
* splicetrie.c, splicetrie.h: Using new interfaces to Dynprog splicejunction
procedures
* outbuffer.c, outbuffer.h: Added snps_p field
* iit-read.c, iit-read.h: Added function IIT_interval_type
* genome_hr.c: In finding dinucleotides, using alternate genome
* genome.c, genome.h: In function used to create splicejunctions, returning
alternate genomic segment
* chimera.c, chimera.h, intron.c, intron.h, maxent_hr.c, maxent_hr.h,
dynprog.c: Using alternate alleles in evaluating splice sites
* samprint.c, samprint.h, pair.c, pair.h: Printing lowercase MD for known
SNP variants
* access.c, access.h: Added function Access_file_equal
* dynprog.h: Taking alternate genome sequence in splicejunction procedures
* dynprog.c: Using alternate allele in computing dynamic programming matrices
* snpindex.c: Made snpindex with with circular chromosomes
* snpindex.c: Checking if given IIT file and installed IIT file are the same
2012-10-23 twu
* trunk, VERSION, config.site.rescomp.tst, src, boyer-moore.c,
boyer-moore.h, diag.c, diag.h, dynprog.c, dynprog.h, genome.c, gmap.c,
oligoindex_hr.c, oligoindex_hr.h, stage1hr.c, stage2.c, stage2.h,
stage3.c, stage3.h, util: Merged revisions 77378 to 77446 from
branches/2012-10-21-no-genomicseg to remove genomicseg parameters
* stage3.c: Fixed uninitialized variables knownsplice5p, knownsplice3p, and
intronlength.
2012-10-21 twu
* gsnap.c: Using new interface to Genome_setup and SAM_setup.
* gmap.c: Using new interface to Genome_setup. Using -V and -v flags as in
GSNAP.
* stage1.c: Fixed matchsize to be double index1part when index1part < 12
* dynprog.c, genome.c, pair.c, pair.h, pairdef.h, pairpool.c, pairpool.h,
stage2.c, stage3.c: Added genomealt to Pair_T object and assigning this
value in Pairpool_push routines. Creating GENOMEALT_DEFERRED value until
we resolve all occurrences of genomicseg.
* boyer-moore.c, dynprog.c, dynprog.h, genome.c, genome.h, maxent_hr.c,
pair.c, stage2.c, stage3.c: Genome_get_char_blocks and get_genomic_nt now
return alternate allele
* samprint.h: Added snps_iit to SAM_setup
* samprint.c: Computing mismatches for both refdiff and bothdiff, and
printing XW and XV fields if running in SNP-tolerant mode
2012-10-20 twu
* gmap.c: Clarified that chimera search can be turned off by setting value
to 0
* indexdb_hr.c: Fixed debugging statement
* snpindex.c: Fixed program to work with sampling intervals other than 3 bp.
Performing file copy of IIT file to maps subdirectory.
* access.c, access.h: Added function Access_file_copy
* samprint.c: Fixed bug where circularpos was called before results arrays
were retrieved
2012-10-19 twu
* stage3hr.c: Using hitpair score_eventrim in Stage3pair_optimal_score,
instead of individual score5 and score3.
* trunk, VERSION, config.site.rescomp.tst, src, Makefile.dna.am, chrom.c,
chrom.h, dynprog.c, gamma-speed-test.c, gdiag.c, genome-write.c,
genome-write.h, genome.c, genome.h, gmap.c, gmapindex.c, gregion.c,
gregion.h, gsnap.c, gsnap_tally.c, iit-read.c, iit-read.h, iit_dump.c,
iit_store.c, indexdb.c, outbuffer.c, pair.c, pair.h, samprint.c,
samprint.h, splicetrie_build.c, stage1.c, stage1.h, stage1hr.c, stage3.c,
stage3.h, stage3hr.c, stage3hr.h, substring.c, substring.h, uniqscan.c,
util, fa_coords.pl.in, gmap_build.pl.in, gmap_process.pl.in: Merged
revisions 76693 to 77345 from branches/2012-10-15-circular to handle
circular chromosomes
* outbuffer.c: Fixed write mode for appending to a file
2012-10-18 twu
* dynprog.c: Allowing get_genomic_nt to retrieve negative coordinates
2012-10-15 twu
* gmap.c, gsnap.c, outbuffer.c, outbuffer.h: Added --append option to append
results to output files
* memory-check.pl: Added -9 flag to print successive maximum memory usage
2012-10-12 twu
* types.h: Added OLIGOSPACE_NOT_LONG
* stage3.c: Handling PMAP case for final guess at cdna_direction
* stage2.c: Not doing stage 2 if genomiclength > MAX_GENOMICLENGTH
* indexdb.c: Fixed issues with %lu using OLIGOSPACE_NOT_LONG
* sequence.c, sequence.h: Sequence_read_unlimited returns nextchar
* inbuffer.c, inbuffer.h, outbuffer.c, outbuffer.h: Handling multiple pairs
of sequences in --pairalign mode
* gmap.c, gsnap.c: Requiring -m to be 0.10 or less when it is a float
* stage3hr.c, stage3hr.h: In pair_up_concordant, treating hits and terminals
separately. When all results are double terminals, treating as if it were
final.
* stage1hr.c: Added variable gmap_rerun_p. Fixed memory leak. Removed use
of segment->usedp. Changed some uses of starti and endi. GMAP result
must be significantly better than original hit (reducing misses by half).
GMAP pairsearch run only if hit list is small enough. Keeping hits and
terminals in separate lists.
2012-10-10 twu
* substring.c: Fixed bug in not copying trim_left_splicep and
trim_right_splicep
2012-10-01 twu
* stage3.c: Added ability for Stage3_merge_local_splice to make a deletion
instead of an intron
* stage3.c: Made code for coordinate change in Stage3_merge_local_splice
match that of Stage3_merge_local_single
2012-09-26 twu
* gmap.c, gsnap.c, pair.c, pair.h, stage3.c, stage3.h, uniqscan.c: Added
--require-splicedir flag and code for guessing at cdna direction.
2012-09-24 twu
* gmap.c, gsnap.c, stage3.c, stage3.h, stage3hr.c, stage3hr.h, uniqscan.c:
Removed --pairexpect and --pairdev flags. Removed expected_pairlength and
pairlength_deviation variables.
* gsnap.c, uniqscan.c: Using new interface to setup procedures
* stage1hr.c: Turned off debugging
* substring.c, substring.h: Added check for splice site at trimmed position,
to be used in even-trimming
* stage3hr.c, stage3hr.h: Added Stage3end_trim_left and Stage3end_trim_right
commands. In Stage3pair_optimal_score, eliminating hits only if both hit5
and hit3 are worse than optimal in pre-final stages. In final stages,
using the sum of hit5 and hit3. Eliminated absdifflength field.
* stage1hr.c: For determining GMAP bounds, computing both close
mappingstart/mappingend (at distal end) and middle
mappingstart/mappingend, and using close if available. If close
mappingstart/mappingend does not give a good alignment at the end, then
trying full pairmax plus shortsplicedist.
2012-09-21 twu
* stage1hr.c: Consolidated calls to Stage2_compute and Stage3_compute into a
run_gmap procedure. Computing close_genomicstart and close_genomicend
values, but using full pairmax for now.
2012-09-20 twu
* stage1hr.c: In computing mappingstart and mappingend for GMAP region,
evaluating each diagonal for shortsplicedist vs querylength extension
2012-09-19 twu
* stage3hr.c: Computing trim_left_splicep and trim_right_splicep for GMAP
alignments. Using trim_left_splicep and trim_right_splicep to determine
trim amount.
* stage1hr.c: Taking lowprob splice only if both ends have minimum support
(set at 20) and no subs or indels were found previously
2012-09-18 twu
* stage1hr.c: Requiring one splice site to be sufficient for lowprob
splices. Finding best splice first by nmismatches, and then by prob.
* gsnap.c, stage1hr.c, stage1hr.h, uniqscan.c: Using min_intronlength to
prevent deletions from showing up as lowprob splices
2012-09-17 twu
* gmap.c, stage3.c, stage3.h: Implemented procedures to favor paths first by
best goodness score, and then by shorter genomiclength
2012-09-14 twu
* stage3.c: Compute best_absmq_score, even if it is negative
* dynprog.c, pair.c, pair.h, stage3.c: Not protecting distal indels after
known splice sites
* gsnap.c: For cmet-stranded and cmet-nonstranded mode, make
--terminal-threshold=100 the default
* stage3.c: Allowing for merging when there is excess query sequence at the
breakpoint
2012-09-13 twu
* oligoindex_hr.c: For minus strand, not subtracting 1 from left if left is
0, which caused the entire sequence to be skipped
* datadir.c: Including comment about -F flag for cmetindex and atoiindex
* snpindex.c: Allowing snpindex to work when there is no gammaptrs file
2012-09-12 twu
* stage3hr.c: Changed a free command from FREE to FREE_OUT
2012-08-10 twu
* oligoindex.c: Fixed assertion for PMAP to use 3*querylength
2012-07-20 twu
* pair.c: Restored capitalized Coverage for standard output
* index.html: Added note about lower-case coverage and identity tags in GFF3
* pair.c: Changed Coverage and Identity in GFF3 output to be lower case
* Makefile.three.am: Fixed Makefile instructions
* VERSION, config.site.rescomp.tst, index.html: Updated version number
* gmap_build.pl.in: Fixed use of short and long options
2012-07-18 twu
* stage3.c: Merging pairs list of left and right parts for local merge, so
that the resulting Stage3_T object can be used iteratively to find a
chimera.
* gmap.c: Added DEBUG2A macro to show details of chimera detection
2012-07-17 twu
* gmap.c, pair.c, pair.h, stage3.c, stage3.h: Implemented iterative
algorithm for finding chimeras
2012-07-12 twu
* VERSION, config.site.rescomp.tst, index.html: Updated version number
* pair.h, pair.c, samprint.c: Fixed problem in SAM output with an unpaired
alignment with one end being a GMAP alignment
2012-07-09 twu
* gff3_introns.pl.in, gff3_splicesites.pl.in: Added -Q flag to suppress
messages to stderr
* gff3_genes.pl.in, gff3_introns.pl.in, gff3_splicesites.pl.in: Checking for
transcript lines as well as mRNA lines
2012-07-06 twu
* README: Added instructions for using vcf_iit
* configure.ac, Makefile.am, vcf_iit.pl.in: Added vcf_iit program for
processing VCF files
2012-07-05 twu
* dbsnp_iit.pl.in: Added exception types InconsistentAlleles and
SingleAlleleFreq
* dbsnp_iit.pl.in: Printing exception handling rules to stderr even if
exception file not given
2012-07-03 twu
* VERSION, index.html: Updated version number
* parserange.c: No longer checking for labels that match a contig
* stage3.c: In extend_ending5 and extend_ending3, checking for a gap between
gappairs and the rest of the read
2012-06-27 twu
* gmap.c: Fixed bug which prevented -1 flag from working
* stage1.c: Made stage 1 work for PMAP
* stage3hr.c: Turned off debugging
* stage3hr.c: For hitpair_equiv_cmp, not looking at score or nmatches anymore
2012-06-25 twu
* VERSION, index.html: Updated version number
* splicetrie_build.c: Fixed contents of splicesite_i in splicestrings, after
sorting of splice sites
2012-06-20 twu
* uniqscan.c: Using new interface to Stage3hr_setup
* VERSION, index.html: Updated version number
* index.html, gsnap.c, stage3hr.c, stage3hr.h: Added --gmap-min-coverage flag
* stage2.c: Changed find_shifted_canonical to go directly to Genome_hr
procedures instead of allocating memory and saving past results
* gsnap.c, stage1hr.c: Added indel_knownsplice as option to --gmap-mode
2012-06-19 twu
* gmap_build.pl.in: Added -M flag for handling NCBI MD files
2012-06-18 twu
* stage3hr.c: Cleaned up optimal_score commands for removing terminal
alignments in final stage. Using trim_terminals_p argument in calling
compute_mapq functions.
* pair.c, pair.h, substring.c, substring.h: Added trim_terminals_p argument
to compute_mapq functions
2012-06-15 twu
* index.html, pair.c, substring.c: Reverted back to old behavior for
computing MAPQ in entire read, but trimming off ends of type TERM
* index.html: Added comments for changes
* gmap_build.pl.in: Using long flag names
* pair.c, pair.h, stage3hr.c, substring.c: Computing MAPQ scores over
trim-kept region, instead of entire substring
* VERSION, config.site.rescomp.tst, index.html: Updated version number
* stage3.c: In trim_noncanonical_end_exons, keeping known introns only if
nmismatches == 0
2012-06-14 twu
* stage3hr.c: Allowing Stage3end_remove_overlaps to work with translocations
* stage1hr.c: Allowing for multiple translocations to be reported. Not
updating nconcordant for GMAP pair revisions
* outbuffer.c, resulthr.c, samprint.c: Allowing for multiple single-end and
unpaired translocations to be printed
* resulthr.c, samprint.c: Allowing for multiple paired translocations to be
printed
* stage3hr.c, stage3hr.h: Changed Stage3pair_remove_overlaps and
hitpair_sort_cmp, so they work on translocations
* stage3hr.c: Allowing multiple concordant translocations to be printed
* stage1hr.c: Not skipping GMAP on terminal alignments. Performing
align_concordant_with_gmap on with_terminal list.
* resulthr.c, resulthr.h, pair.c: Changed pairtype TRANSLOCATION to
CONCORDANT_TRANSLOCATIONS
* stage1hr.c, stage3hr.c, stage3hr.h: Added a category of hitpairs called
with_terminal, with lower priority than samechr or conc_transloc
* stage2.c: Increased value of SHIFT_EXTRA to fix a fatal bug
2012-06-13 twu
* stage3.c: Counting indels and short gaps as mismatches in
Stage3_bad_stretch_p
2012-06-12 twu
* index.html: Added comment about improved detection of translocations
within read ends
* stage3hr.c: Computing substring_for_concordance for both translocations
(chrnum == 0) and intrachromosomal rearrangements (shortdistancep == false)
* stage1hr.c: Checking for bad stretch in GMAP hits, as soon as we call
Stage3end_new_gmap
* index.html: Updated version number
2012-06-11 twu
* trunk, VERSION: Updated version
* Makefile.gsnaptoo.am: Removed extra includes of cmet and atoi files for
GMAP
* oligoindex_hr.c: Getting the final oligomers when computing mappings
* stage3.c: Fixed computation of mappingstart and mappingend for traversing
dual breaks on crick strand
* stage1.c: Restoring old scan ends algorithm
* stage1hr.c: Removed unused debugging macro
* stage3.c: In trimming novel splice ends, allowing perfect matches to
extend into intron
* psl_introns.pl.in: Added print command
* Makefile.gsnaptoo.am: Added file dependencies
* stage3.c: Using QUERYEND_NOGAPS for pass 9a and pass 9b for GSNAP, so
trimming will work. Fixed computation of mappingstart and mappingend in
traverse_dual_break.
2012-06-06 twu
* Makefile.dna.am, stage3hr.c: Adding an absolute sufficient minlength for a
terminal, besides using querylength
* VERSION, config.site.rescomp.tst, index.html: Updated version number
* src: Committing property changes from last merge
* gmap.c: Increased max_nalignments from 3 to 10
* stage1hr.c: Fixed bug in find_terminals, where querypos3 was used to
compute start_endtype and querypos5 was used to compute end_endtype,
instead of querypos5 and querypos3, respectively.
* stage3hr.c: Allowing both ends to be of type TERM in a terminal, and
checking for mismatches only between the trimmed ends. Requiring that
final length is querylength/3.
* dynprog.c: Dropped mismatch scores, which helps GMAP extend ends and find
chimeras.
* stage3.c: Changed endalign for pass 9a and 9b from QUERYEND_NOGAPS to
BEST_LOCAL. This fixes an issue in GMAP where ends are truncated, and
chimeras not found, as introduced in revision 64732 on 2012-05-22.
* stage2.c: Fixed bug in condition on suboptimal stage 2 paths, where we
were requiring fewer than max_nalignments results plus the score ==
bestscore. The condition should have been a disjunction, not a
conjunction.
* stage1hr.c: Skipping computation of GMAP on single-end terminal
alignments, since that is a duplication of effort
* stage1hr.c, stage3hr.c, stage3hr.h, substring.c, substring.h: Restored
assignment of endtypes to terminal alignments. Using them again determine
whether to extend terminals left or right for GMAP alignments.
* stage1hr.c: Integrated two criteria for finding terminals: old method
based on counting mismatches from ends, and new method based on width of
(querypos3 - querypos5).
2012-06-05 twu
* stage3.c: Fixed bug in local chimera alignment with uninitialized value
for genomicseg_ptr
2012-06-01 twu
* genome.c: Added assertions to Genome_get_char and Genome_get_char_blocks
to check for negative coordinates
* dynprog.c: Removed debugging statements
* stage3.c, dynprog.c: Fixed get_genomic_nt to check for both genomicpos
between 0 and genomiclength, and pos between chroffset and chrhigh
* VERSION, index.html, config.site.rescomp.tst: Updated version number
* dynprog.c, stage3.c: Checking only for genomepos < 0 in get_genomic_nt,
not for chrpos between chroffset and chrhigh, which may need further
debugging for chimeras.
* dynprog.c: Checking for genomepos < 0 in get_genomic_nt.
* stage3.c: For Stage3_extend_left and Stage3_extend_right, using
get_genomic_nt instead of going directly to Genome_get_char. Checking for
genomepos < 0 in get_genomic_nt.
* stage3hr.c: In Stage3pair_remove_overlaps, allowing separate pair to
subsume overlapping pair only if it is better
* stage3.c, dynprog.c: Fixed check of chrpos to compare genomicpos against
chrhigh
* dynprog.c, stage3.c: Checking for chrpos between 0 and chrhigh - 1,
inclusive
* dynprog.c, dynprog.h, gmap.c, gregion.c, gregion.h, splicetrie.c,
splicetrie.h, stage1hr.c, stage3.c, stage3.h: Passing chrhigh along with
chroffset to all procedures
* dynprog.c, genome.c: When chromosomal coordinate is negative, returns '*'
instead of 'N'. Traceback procedures in dynamic programming will not add
pairs with '*' genomic nucleotides.
* util: Merged changes from last branch
* README: Added note that MAX_READLENGTH applies only to GSNAP
* stage2.c, stage3.c: Merged changes from
branches/2012-06-01-merge-single-gap to fix problems with merging single
gap on minus strand
2012-05-31 twu
* stage3.c: Protected another debugging statement from referring to
genomicseg
* gsnap.c: Fixed documentation for --fails-as-input flag.
* gmap.c: Added --fails-as-input string to getopt processing. Fixed
documentation for --fails-as-input flag.
* dynprog.c: Added messages to stderr before all abort statements
* translation.c: Requiring translation_leftpos and translation_rightpos to
be between 0 and querylength-1.
2012-05-25 twu
* gmap_build.pl.in: If -k 15 specified, but not -b, setting basesize to be 12
2012-05-24 twu
* Makefile.gsnaptoo.am: Added uniqscan program
* stage1hr.c: Decreased max_nalignments from 3 to 2
* dynprog.c: For known splicesites, adjusted low and high boundaries so
contlength is always between 0 and endlength-1, inclusive.
* stage3.c: Not reducing genomejump at ends anymore
2012-05-23 twu
* VERSION, index.html: Updated version number
* gmap.c, stage1hr.c: Increased max_nalignments for stage 2 to 3
* stage3hr.c: Turned off check for cdna_direction != 0 and SENSE_NULL in
declaring a GMAP alignment as bad
* stage3.c: Changed pass 9 from queryend_indels to queryend_nogaps, to avoid
false positive indels at ends and to prepare for noncanonical end trimming
* splicetrie.c: Improved debugging statements
* pair.c: Added information about knowngapp and protectedp in printing pair
information for debugging purposes
2012-05-22 twu
* stage3.c: In trimming of noncanonical introns near end, making an
exception for known introns
* dynprog.c: Replaced noindel version of Dynprog_end_splicejunction
functions with version allowing indels
* stage3.c, stage3.h, stage3hr.c: In Stage3_bad_stretch_p, excluding trimmed
regions on ends
* uniqscan.c: Using new interfaces to setup procedures
* stage1hr.c: Added debugging statements
* genome_hr.c: Removed debugging statements
* gamma-speed-test.c: Using new interface to setup procedures
* dynprog.c, dynprog.h, stage3.c: Introduced new endalign type,
QUERYEND_GAP, and using it in pass 8. Restored call of Dynprog_end
procedures in trim ends using BEST_LOCAL, which does not try to find an
intron.
* stage3.c: Better handling of ends: pass 8, best_local plus known splicing;
pass 9, queryend_indels; pass 10, queryend_nogaps. Medial gap not using
known splicing. Simplified trim_ends procedure. Not removing or
re-inserting known intron gaps. Bayesian computation of mapping scores
for GMAP alignments.
* splicetrie.c: Computing separate offsets for anchor and far splicesites
for use in Dynprog_end_splicejunction procedures. Not calling
Dynprog_add_known_splice procedures.
* pairdef.h, pairpool.c, pairpool.h: Added knowngapp field to Pair_T object
* dynprog.c, dynprog.h: Allowing compute_scores procedures to work on
genomicseg (for splicejunctions). Added functions
Dynprog_end5_splicejunction and Dynprog_end3_splicejunction to replace
add_known_splice procedures. Calling traceback_local_nogaps in two parts.
Dynprog_end_gap procedures returning final score even for QUERYEND_INDELS.
Made debugging statements work without genomicseg.
2012-05-21 twu
* samprint.c: Checking if MD string output is empty and if so printing "0"
2012-05-18 twu
* stage3hr.c: For paired-end reads, in cases of tie score, sorting results
by genomic position
* splicetrie_build.c: For intron-level splicing information, sorting
individual splicesites by ascending genomic position
* splicetrie.c, splicetrie.h: Computing spliceoffset needed to construct
splicejunctions. Calling Dynprog_local_nogaps procedures. Requiring
dynprog score > 0 on known splicejunctions.
* dynprog.c, dynprog.h: Fixed bug in making A-G and C-T ambiguous scores for
all modes. Implemented traceback procedure using given sequence and no
gaps for handling known splicejunctions. End dynamic programming
procedure now returns a final score for queryend_nogaps endalign.
Implemented make_contjunction procedures to retrieve the continuous part
of splicejunctions. Made make_splicejunction_3 consistent with
make_contjunction_3.
* stage3hr.c: Turned off check for min_splice_prob on GMAP alignments, since
it appears not to work for known splicesites
* pair.c: Pair_dump_list now prints line to indicate start of list
2012-05-17 twu
* dynprog.c: Handling cases where length1 == 0 or length2 == 0, which
otherwise cause fatal errors
2012-05-16 twu
* stage3.c: Setting use_genomicseg_p to false in all cases
* stage2.c: Using Oligoindex_hr_tally even if user genomic segment provided
* gmap.c: Computing genomicend even if user genomic segment provided
* genome-write.c: Added extra 4 words to end of genome blocks to accommodate
nextlow in Oligoindex_hr procedures
* Makefile.gsnaptoo.am: Added source files for GMAP
* VERSION, config.site.rescomp.prd, config.site.rescomp.tst, index.html:
Updated version number
* stage1.c, stage1.h: Added procedure for nonstranded alignment. Turning
off scan_ends algorithm and using only sampling. Using indexdb size limit
for standard mode, but not for cmet-nonstranded mode.
* stage2.c: Looking up genomic nt for all alignment pairs in
convert_to_nucleotides
* uniqscan.c, gsnap.c: Using new interfaces to setup routines
* gmap.c: Using different indexdb size thresholds for standard and
cmet-nonstranded modes
2012-05-15 twu
* stage1hr.c: Using new interfaces to stage 2 procedures
* stage3.c, stage3.h: Handling AMBIGUOUS_COMP the same as MATCH_COMP and
DYNPROG_MATCH_COMP. Removed genomic_offset argument from Stage2_compute.
Fixed intermediate alignment results for debugging by returning pairs
instead of path from path_compute.
* oligoindex_hr.c, oligoindex_hr.h: Fixed Cmet_reduce commands for
CMET_STRANDED mode
* match.c: Fixed memory leak in a debugging procedure, Match_print
* gregion.c, gregion.h: Made Gregion_filter_support function available
again. Added function Gregion_genestrand.
* Makefile.dna.am, dynprog.c, dynprog.h, genome.c, genome.h, gmap.c,
stage2.c, stage2.h: Added code to make GMAP work on cmet-stranded and
cmet-nonstranded modes
2012-05-14 twu
* stage3hr.c: Disallowing new Stage3pair_T object if its insertlength
exceeds pairmax and we expect a concordant pair
* trunk, config.site.rescomp.tst, src, Makefile.dna.am, block.c, block.h,
boyer-moore.c, boyer-moore.h, dynprog.c, dynprog.h, genome-write.c,
genome-write.h, genome.c, genome.h, genome_hr.c, genome_hr.h, gmap.c,
gsnap.c, intlist.c, intlist.h, oligoindex.c, oligoindex.h,
oligoindex_hr.c, oligoindex_hr.h, pair.c, pair.h, splicetrie.c, stage1.c,
stage1.h, stage1hr.c, stage2.c, stage2.h, stage3.c, stage3.h, stage3hr.c,
stage3hr.h, uniqscan.c, util: Merged revisions 63606 to 64016 from
branches/2012-05-08-genomic-nts to read genomic nt rather than generate
genomicseg
2012-05-10 twu
* trunk, config.site.rescomp.tst, archive.html, index.html, src, dynprog.c,
util: Merged revisions 63773 through 63823 from
branches/2012-05-10-better-affine-gap to make score matrix symmetric, put
sequence2 in outer loop, fix boundary conditions, and improve efficiency
* resulthr.c: Fixed uninitialized value for X2 on halfmapping_mult alignments
* samprint.c: Fixed uninitialized value for X2 on halfmapping_uniq alignments
2012-05-07 twu
* gtf_genes.pl.in, gtf_introns.pl.in, gtf_splicesites.pl.in: Made -E flag
use exon_number field
* gtf_genes.pl.in, gtf_introns.pl.in, gtf_splicesites.pl.in: Added -E flag
to ignore exon_number fields in GTF file
* VERSION: Updated version
* oligoindex_hr.c: Reverting to version that zeroes out counts for oligomers
that are overabundant or not in query
* gmap.c, stage1hr.c, stage2.c, stage2.h: Providing a limit on the number of
suboptimal alignments returned from stage 2. Limit set to 2 for GMAP and
1 for GSNAP.
* gsnap.c: Added getopt handler for --sam-multiple-primaries
2012-05-03 twu
* trunk, VERSION, config.site.rescomp.tst, src, dynprog.c, gmap.c, pair.c,
pair.h, stage1hr.c, stage3.c, stage3.h, stage3hr.c, stage3hr.h, util:
Merged revisions 63036 to 63240 from branchese/2012-05-01-affine-gap to
implement an affine gap algorithm for dynamic programming
* uniqscan.c: Using new interface to Stage1hr procedure that contains
first_absmq as an argument
* shortread.c: Generalized handling of old Illumina paired-end format ending
in :0 or :<digit>.
* genome.c: Fixed function Genome_fill_buffer_simple_alt so it returns
ref+alt, instead of empty+alt.
* indexdb.c: Made writing of offsetscomp file faster when blocksize == 1 (or
k == b), by using a single write command instead of looping.
* goby.c, outbuffer.c: Implemented patches for Goby 2.0
* gmap.c, gsnap.c, outbuffer.c, pair.c, pair.h, result.c, result.h,
resulthr.c, resulthr.h, samprint.c, samprint.h, stage1hr.c, stage1hr.h,
stage3.c, stage3.h, stage3hr.c, stage3hr.h: Added flag
--sam-multiple-primaries to allow multiple alignments to be marked as
primary, if their mapping scores are equally good
* shortread.c: Handling older style Illumina paired-end reads that end in
":0"
2012-05-02 twu
* stage3.c: Fixed debugging statements
* oligoindex_hr.c: Not zeroing out counts[i] for oligomers that are either
overabundant or not in query. Saves time in allocate_positions function
and in store functions.
2012-04-27 twu
* VERSION, config.site.rescomp.prd, config.site.rescomp.tst, index.html:
Updated version
* gmap.c: Using new interface to Stage3_compute
* stage3hr.c, stage3hr.h: Using nmatches_posttrim (adjusted by scores for
indels) to compare alignments, except for pre-final terminals. Performing
iteration in optimal_score procedures to use updated trim boundaries as
poor alignments are removed.
* stage3.h: No longer use nmatches_pretrim in Stage3_compute
* stage3.c: Removed use of QUERYEND_INDELS at ends of reads, and using
QUERYEND_NOGAPS instead, to reduce time spent in dynamic programming
* stage1hr.c: Added usedp field to Segment_T object, and marking it true if
the segment is used in making a Stage3end_T object. Skipping these
segments in finding terminals.
* pair.c, pair.h: Added pos5 and pos3 arguments to Pair_nmatches
* genome.c: Made uncompress_mmap faster by translating high and low words 16
bits at a time. Adding N's only if flags is not zero.
* dynprog.c: Modified bridge_intron_gap so it searches for indels on left or
right of splice, but not both. Changes algorithm from quadratic to linear
time.
2012-04-25 twu
* stage1hr.c, stage3hr.c, stage3hr.h: Revisions made to hit_goodness_cmp and
hitpair_goodness_cmp. Using genomiclength in hit_goodness_cmp, final
round. For terminals, returning 0 in preliminary rounds. Not using
scores for terminals, and using scores only if GMAP or indels are
involved. In pair_up procedure, not using terminal scores to update
found_score.
* indexdb.c: Fixed uninitialized variables that caused problems with older
GMAP indices
2012-04-23 twu
* dynprog.c: Created separate functions, compute_scores_lookup_fwd and
compute_scores_lookup_rev
* splicetrie.c: Fixed calls to Dynprog_end5_gap and Dynprog_end3_gap to use
endalign instead of to_queryend_p.
2012-04-20 twu
* VERSION: Updated version
* stage3hr.c: Added check for too many indel breaks in GMAP
* stage1hr.c, stage3hr.c, stage3hr.h: Storing cdna_direction in hit of type
GMAP and using it instead of sensedir when printing
2012-04-19 twu
* VERSION, index.html: Updated version number
* stage3.c: Turned off debugging
* stage3.c: Fixed uninitialized variable in trim_novel_spliceends
* dynprog.c, dynprog.h, stage3.c: Using correct endalign types in
Dynprog_end5_known and Dynprog_end3_known
* pair.c: Fixed issue with uninitialized variable in printing splicesite
labels
2012-04-18 twu
* result.c, result.h: Added mergedp variable
* outbuffer.c, outbuffer.h: Handling results where mergedp is true
* gmap.c: Allowing chimera finding to be turned off by setting -x to be 0.
Added mergedp variable so merged alignments generate only a single result.
2012-04-16 twu
* VERSION, config.site.rescomp.tst, index.html: Updated version
* samprint.c: Handling case where clip-overlap results in a NULL substring
* gmap.c: In call to Stage3_merge_local_single, clipping parts around
breakpoint instead of chimerapos and chimeraequivpos, to avoid issues
where maxpeelback is insufficient
* stage3.c, stage3.h: Renamed variable from extendp to max_extend_p
* get-genome.c: Added --forsam flag to generate header for SAM files
2012-04-10 twu
* VERSION: Updated version
* stage1hr.c: Fixed bug from uninitialized variable
* gmap.c, stage3.c, stage3.h: Added a criterion for extending left and right
chimera ends to consecutive mismatches, based on queryjump and genomejump
being unequal.
2012-04-09 twu
* gsnap.c, uniqscan.c: Using new interface to Stage3hr_setup
* stage1hr.c: Finding terminals by a new method. Instead of counting
mismatches from end, requiring only that querypos3 - querypos5 is greater
than index1part. Now searching terminals on single-end reads even if a
GSNAP alignment has been found. Removed nsalvage == 0 requirement for
searching terminals and paired-end reads.
* substring.c, substring.h: Added procedure Substring_set_endtypes
* stage3hr.c, stage3hr.h: Changed optimal score procedures to use max of
max-terminal and min-other for prefinal rounds, and max of max-GMAP and
min-other for final rounds. For GMAP eventrim scores, not counting
indels, and adding a penalty for long ambiguous ends, by dividing by
index1part + (index1interval - 1). Terminal alignments now compute their
own endtypes.
2012-04-06 twu
* stage2.c: Fixed fatal bug when looking for shifted canonical splice site
by checking that rightpos is less than genomicend.
* gmap.c, stage3.c, stage3.h: For chimeras, extending ends until three
consecutive mismatches are found. At final breakpoint, cleaning indels
from ends.
2012-04-04 twu
* stage1hr.c: For single-end reads, finding distant splicing only when no
other hits have been found
* VERSION, index.html: Updated version
* pair.c, pair.h, stage3.c: Fixed bug in Stage3_mergeable where we require
end1 and start2 pairs to be computed
* splicetrie.c: Allowing 1 mismatch in distal exon
* stage3hr.c: Changed debugging statement to report score_eventrim
* stage3.c: Rewrite trim_novel_spliceends to scan pairs first to find
genomicpos bounds and then iterate through genomicpos. Allowing
pick_cdna_direction to return SENSE_NULL if no introns exist.
* stage1hr.c: For GMAP terminals, also checking for a bad stretch in GMAP
result after the call to align_halfmapping_with_gmap
* stage3.c: Fixed bug in trim_novel_spliceends when pairs is NULL
2012-04-03 twu
* stage3.c: Turned off trimming of novel splice ends for GMAP
* index.html: Made changes to reflect new version
* VERSION: Updated version
* stage3hr.c: Fixed issue where no non-terminal alignment existed, resulting
in using min trim length of MAX_READLENGTH
* splicetrie.c: Allowing only 1 mismatch (previously 2) in internal splice
region of 6 bp, and no mismatches in external splice region (previously
depended on extension). This avoids bad splicing due to poor gene models.
* pair.c, pair.h, stage3.c: Using pairs instead of pairarray in determining
whether chimera ends are connectable
* pair.c, pair.h, stage3hr.c: Counting indels in GMAP alignments only within
eventrim region
* stage3.c: Added function trim_novel_spliceends
* pair.c: Requiring that cdna_direction not be zero when printing splice
site probabilities at the ends
2012-04-02 twu
* trunk, VERSION, src, chimera.c, chimera.h, gmap.c, outbuffer.c, pair.c,
pair.h, pairpool.c, pairpool.h, stage1hr.c, stage3.c, stage3.h,
stage3hr.c, stage3hr.h, util: Merged revisions 60621 to 60936 from
branches/2012-03-27-gmap-chimeras to improve GMAP chimeras and to apply a
uniform eventrim procedure in stage 3 optimal score procedures
2012-03-30 twu
* gamma-speed-test.c: Added to SVN repository
* config.site.rescomp.tst, configure.ac, memory-check.pl, atoiindex.c,
cmetindex.c, gmap.c, gsnap.c, indexdb.c, pdldata.c, snpindex.c: Added
--enable-mmap flag to configure. Added small fixes to allow programs to
work without mmap.
2012-03-29 twu
* stage3hr.c: Allowing trimming on both ends of a terminal alignment
* pair.c: Handling case where hard clipping is not possible
2012-03-27 twu
* stage1hr.c: Handling the case where GMAP alignment is attempted on a
translocation
2012-03-23 twu
* VERSION, index.html: Updated version
* stage1hr.c: Added multiple checks for GMAP bad stretch
2012-03-22 twu
* index.html: Updated for latest version
* stage1hr.c: Fixing fatal bug when max_end_insertions is set to less than 3
2012-03-21 twu
* VERSION: Updated version
* index.html, archive.html: Updated for new version
* gsnap.c: Reduced default value of distant-splice-penalty from 2 to 1
* gsnap.c: Reduced default value of distant-splice-penalty from 3 to 2
* stage1hr.c: Changed criterion for GMAP salvage from
Stage3end_bad_stretch_p to Stage3end_score > cutoff_level, because
previous criterion caused GSNAP to miss distant splicing
* dbsnp_iit.pl.in: Checking whether the exceptions field is defined in the
snp file
2012-03-20 twu
* gmap.c, outbuffer.c, stage3.c, stage3.h: Removed some unused parameters
* VERSION: Updated version
* stage3hr.c: Changed score for GMAP alignments to be post-trim matches
minus penalties for splicing and indels. Allowing Stage3end_bad_stretch_p
to handle GMAP alignments.
* stage3.c: Not doing any peelback on the extension after trim_ends
* stage1hr.c: Moved check for GMAP bad stretch from elimination as a very
bad alignment to a salvage status
* dynprog.h: Added comment
2012-03-19 twu
* pair.c, pair.h, stage3.c: Added an extend step after the ends are trimmed,
to get as long an extension as possible.
* stage1hr.c, stage3.c, stage3.h: Added a check for GMAP alignment length,
in addition to the bad stretch check
* gmap.c: Using new interface to Stage3_compute
* substring.c: Added comment
* stage1hr.c, stage3.c, stage3.h, stage3hr.c, stage3hr.h: Removed
computation of gmap_nconsecutive, and implemented Stage3_bad_stretch_p to
evaluate GMAP alignments instead.
2012-03-16 twu
* gff3_genes.pl.in, gff3_introns.pl.in, gff3_splicesites.pl.in: Filtering
out comment lines beginning with '#'.
* gff3_genes.pl.in, gff3_introns.pl.in, gff3_splicesites.pl.in: Printing
results for last mRNA in each gene
2012-03-15 twu
* stage3.c: In traversing dual break, accepting the stage 2 solution only if
the entire query sequence is bridged
* gff3_genes.pl.in, gff3_introns.pl.in, gff3_splicesites.pl.in,
gtf_genes.pl.in, gtf_introns.pl.in, gtf_splicesites.pl.in: Made gff3 and
gtf programs handle exons in arbitrary order
* gtf_genes.pl.in, gtf_introns.pl.in: Checking for comment lines
* gtf_genes.pl.in, gtf_introns.pl.in: Allowing GTF file to lack exon_number
field
* gtf_genes.pl.in: Checking both gene_name and gene_id to get gene name
2012-03-13 twu
* gmap.c, inbuffer.c, inbuffer.h: Added -1 flag for self-align feature
2012-03-12 twu
* configure.ac, README, config.site: Changed default value of MAX_READLENGTH
from 200 to 250
* stage1hr.c: Added missing else statement
* dynprog.c, splicetrie.c, splicetrie.h: Changed Splicetrie_solve_end5 and
Splicetrie_solve_end3 to take triecontents, trieoffsets, and j, rather
than triestart, and to check for a null pointer.
* stage3hr.c: No longer returning NULL from Stage3end_new_gmap when result
crosses before beginning of genome or at end of chromosome.
* stage1hr.c: Removed possibility of dereferencing uninitialized memory when
skipping over diagonals straddling beginning of genome.
2012-03-09 twu
* gmap.c, stage3.c, stage3.h: Made changes in Stage3_merge_single compatible
with PMAP
* Makefile.dna.am, gsnap_fasta.c: Moved gsnap_fasta.c and bam_fasta and
sam_fasta programs to GSTRUCT repository
* get-genome.c: Fixed -E option for printing exons from gene map files.
Added -S option for printing sequence from gene map files.
* get-genome.c: Removed R from getopt string
* get-genome.c: Removed -R flag and references to map_relativep
2012-03-08 twu
* gmap.c: Removed debugging statement
* gmap.c: Fixed test for CHIMERA_SLOP to handle minus strand alignments
correctly. Added a remove duplicates step to stage3list_from_gregions.
Using new interface to Stage3_merge_single and Stage3_merge_splice.
* stage3.c, stage3.h: Implemented functions Stage3_merge_single and
Stage3_merge_splice. The first function uses dynamic programming to solve
the region between the two parts.
* pair.c, pair.h: Implemented function Pair_set_genomepos_list
2012-03-07 twu
* gsnap.c: Fixed default values in --help statement for number of GMAP runs
allowed
* stage1hr.c: Turned bad stretchp back on as a criterion for running GMAP
improvement in paired-end reads. Implemented GMAP improvement for
single-end reads.
* substring.c, substring.h: Implemented Substring_genestrand
* stage3hr.c, stage3hr.h: Added genestrand as a field for Stage3end_T object
2012-03-02 twu
* stage3.c: Added another check to make sure we don't try to solve for dual
breaks at the ends of an alignment
2012-03-01 twu
* dynprog.c: Making certain that left_prob and right_prob are initialized to
0.0
* splicetrie_build.c: Limiting distance for splicetries to shortsplicedist
* stage3.c: Checking for endgappairs == NULL, before trying to access
*endgappairs
2012-02-29 twu
* configure.ac: Changed default .config.site to ./config.site
* dynprog.c: Fixed cases where left_prob and right_prob were not assigned
* stage3.c: Making singlep a local variable in traverse_dual_genome_gap
2012-02-28 twu
* stage3.c: Setting *singlep to false to fix bug in traversing dual genome
gaps where left goodness or right goodness was called after a dual gap win.
* gmap.c: Calling Alphabet_setup, Oligo_setup, Oligop_setup, Indexdb_setup,
and Stage1_setup only when genome is provided
2012-02-27 twu
* gmap.c, stage3.c, stage3.h: Restored cdna_direction == 0 when no splices
are present. Transferring overall cdna_direction to first Stage3_T object
in a chimera.
* stage3.c: Disallowing cdna_direction to be set to 0
* gmap.c: Added call to Oligo_setup
* indexdb.c: Looking for offsetscomp_suffix exactly
* gmap.c: Using new interface to Stage3_compute
2012-02-24 twu
* VERSION: Updated version number
* stage3.c: Handling case where non-canonical splice is exactly in the
middle of the read
* stage1hr.c, stage3.c, stage3.h, stage3hr.c, stage3hr.h: Replaced
high_quality computation on gmap alignments with gmap_nconsecutive
* outbuffer.c, gsnap.c, goby.c, goby.h: Implemented new Goby code
* gtf_splicesites.pl.in: Skipping comment lines that begin with '#'.
* gtf_splicesites.pl.in: Removed requirement for exon_number field in GTF
file
* gmap_build.pl.in: Added -T flag to specify temporary build directory
* gmap_build.pl.in: Deleting .coords file
2012-02-14 twu
* gmap_build.pl.in: Fixed filenames for 1-digit values
* oligo.c: Added repetitive oligos for 6-, 7-, and 8-mers.
* indexdb.c: Improved warning message
* gmap_build.pl.in: Made default value for base size to be kmer size
* configure.ac: Moved read of config.site and setting of CFLAGS earlier, so
default CFLAGS is not set by autoconf.
* setup1.test.in, Makefile.am, setup.ref12123positions.ok,
setup.ref123positions.ok: Changed name of positions file to reflect new
naming scheme
* README: Added comment about new naming for positions file
* VERSION: Updated version
* config.site.rescomp.prd, config.site.rescomp.tst: Changed dates
* configure.ac: Adding check for CFLAGS and setting default to be -O3
* config.site: Commenting out default CFLAGS variable
* README: Added comment about adding -m64 in CFLAGS for Macintosh machines
* gsnap.c, stage1hr.c, stage1hr.h: Added control of gmap_indel_knownsplice
feature to gsnap program
* stage3hr.c, stage3hr.h: Added function to help run GMAP on indels to find
known splicing
* stage1hr.c: Added function to run GMAP on indels to find known splicing
* stage1.c: Using new interface to Reader_new
* pmapindex.c: Removed -R flag for processing reverse strand of genome.
Consolidated code for computing offsets and positions.
* pair.c: Eliminated extra points subtracted from Pair_nmatches, so that the
function reports the correct number of matches
* dynprog.c: Fixed bug where known splicing was being called when length2
was 0, resulting in bad endpoints for binary search
2012-02-03 twu
* stage1hr.c: Using spansize in computing floors. Reduced value of
STAGE2_MIN_OLIGO from 5 to 3.
* gsnap.c, uniqscan.c: Using new interface to Stage1hr_setup and
Spanningelt_setup
* reader.c, reader.h: Removed blocksize as a field for Reader_T object
* oligo.c, oligo.h: Storing oligosize as a static variable, and not using it
anymore as a parameter to Oligo_next or Oligo_skip.
* block.c: Removed oligosize as a field from Block_T object
* stage1hr.c, stage3hr.c, stage3hr.h, substring.c, substring.h: Revised
Stage3end_optimal_score to compare terminal and non-terminal alignments
using an eventrim procedure, based on the maximal trims found in the
non-terminal alignments, and re-computing the mismatch scores in those
regions.
* stage1hr.c: Running Stage3end_remove_overlaps to remove terminals that
overlap with a GSNAP alignment. Using spansize to compute fast_level.
Removed index1part (oligosize) from calls to Reader_new and Oligo_next.
2012-02-02 twu
* stage1hr.c, stage1hr.h: Generalized from 6 to STAGE2_MIN_OLIGO +
index1interval, for deciding whether a found diagonal is close enough to
end to redefine region for GMAP alignment. Using spansize to compute
floors, and counting all mismatches if spansize != index1part.
* spanningelt.c, spanningelt.h: Added function Spanningelt_setup, which
computes spansize for a given index1part and index1interval
* pairdef.h, pairpool.c, stage3.c: Added field end_intron_p to Pair_T
object, and using it in traversing dual genome gaps. When applicable,
solving introns where left intron or right intron is omitted.
* stage1hr.c: Improved debugging messages by printing chromosomal
coordinates rather than universal coordinates
* trunk, config.site.rescomp.prd, config.site.rescomp.tst, memory-check.pl,
src, Makefile.dna.am, atoiindex.c, cmetindex.c, genomicpos.c,
genomicpos.h, gmapindex.c, indexdb.c, indexdb_hr.c, pmapindex.c,
snpindex.c, spanningelt.c, spanningelt.h, stage1hr.c, types.h, util,
gmap_build.pl.in, gmap_setup.pl.in: Merged revisions 56772 to 56962 from
branches/2012-01-31-index1interval to allow genomic indices up to 16-mers
and sampling intervals from 1 to 3
2012-02-01 twu
* resulthr.c, stage3hr.c, stage3hr.h: If anomalous splices (of type
SAMECHR_SPLICE) occur within a read for a unique mapping, changing
resulttype from _UNIQ to _TRANSLOC.
* outbuffer.c: Fixed issue with double opening of nomapping output file,
first as single and then as paired
2012-01-31 twu
* gmapindex.c: Allowing only values 1, 2, and 3 for sampling interval
2012-01-30 twu
* gmap.c, gsnap.c, indexdb.c, indexdb.h, stage1hr.c, stage1hr.h, uniqscan.c:
Added --sampling flag and passing index1interval to Stage1hr_setup
* uniqscan.c: Using new interface to Indexdb_new_genome
* indexdb.c: Fixed incorrect variable name (oligo instead of oligoi).
* gmap_build.pl.in: Removed fixed value of -q 3 in calls to gmapindex
* gmap_build.pl.in: Added -q flag for sampling interval
* gmap.c, gmapindex.c, gsnap.c: Allowing user to enter -k 16
* block.c, indexdb_dump.c, indexdb_hr.c, stage1.c, stage1hr.c: Fixed mask
calculation to use unsigned long, so kmer of 16 works
* atoiindex.c, cmetindex.c, indexdb.c, indexdb.h, snpindex.c: Enabled
writing of genome indices with kmer of 16
* oligo.c: Allowing k-mer size of 16
* genome_hr.c: Clarified types of gamma and value to be Positionsptr_T and
firstbit to be int.
2012-01-27 twu
* oligo.c: Allowing k-mer size to go down to 9
* indexdb_hr.c, indexdbdef.h, indexdb.c: Eliminated separate data storage
for offsets when expand_offsets_p is true. Instead, expanding gammas into
offsetscomp and making gammaptrs just the identity function.
2012-01-19 twu
* indexdb.c, indexdb.h: Allowing selection of base size, and returning found
base size. Simplified logic for selecting indexdb.
* gmap.c, gsnap.c: Added --basesize flag to allow user to select base size
* gmapindex.c: Allowing k-mer size to be 15 or less
2012-01-18 twu
* gmapindex.c: Added break after handling flag -b
2012-01-13 twu
* VERSION: Updated version number
* Makefile.three.am: Rebuilt instructions from Makefile.gsnaptoo.am, plus
Makefile.dna.am for PMAP and PMAPINDEX
* Makefile.gsnaptoo.am: Removed gsnap_tally from distribution
* README: Change URL for Web site
* alphabet.c: No longer checking new get_codon procedures against old ones
* pmapindex.c: Revised limit for 8-mers to be alphabet size of 13 or less
* README.PMAP: Added a README file for PMAP
* indexdb.c, indexdb.h: Providing alphabet and alphabet_size to caller of
Indexdb_new_genome
* gmap.c: Added --alphabet flag to PMAP to specify a particular alphabet
* atoiindex.c, cmetindex.c, genome_hr.h, snpindex.c, types.h: Moved
definition of Storedoligomer_T from indexdbdef.h to types.h
* block.c, block.h, stage1.c: Removed msb computation and storage in Block_T
object, and initializing instead by Oligop_setup
* alphabet.c, alphabet.h, oligop.c, oligop.h: Providing aa_index_table to
Oligop procedures at run time
* Makefile.dna.am, alphabet.c, alphabet.h, indexdb.c, indexdb.h,
indexdbdef.h, pmapindex.c: Created Alphabet_T object. Moved relevant
codon-based procedures from indexdb.c to alphabet.c. Replaced
get_codon_fwd and get_codon_rev with lookup tables. Allowed pmapindex to
generate indices with different alphabets and alphabet sizes.
* gmapindex.c: Added check for k-mer size being greater than or equal to
base size
* gmap.c: Made default k-mer size 7 for PMAP
* Makefile.dna.am, indexdb.c, pmapindex.c: Made PMAP index files compatible
with compressed hash tables and added flags to be consistent with other
auxiliary indexing programs
2012-01-11 twu
* VERSION: Updated version
* index.html: Updated Web page for latest version
* mapq.c: Fixed debugging statement
* stage3hr.c: In output_cmp procedures, sorting first by nmatches (pretrim),
and then by mapq. Added procedure to enforce monotonicity of mapq scores.
* stage1hr.c, stage3hr.c, stage3hr.h: Running optimal_score first allowing
all GMAP alignments, then removing overlaps, and then running
optimal_score again without special provision for GMAP alignments.
* gmap.c, gsnap.c, pair.c, pair.h, uniqscan.c: Added flag --sam-use-0M to
control printing of 0M in CIGAR strings
* gmap.c, gsnap.c, stage3.h, uniqscan.c: Providing output_sam_p to
Stage3_setup
* stage3.c: In fill_in_gaps, handling dual breaks by inserting query and
genomic segments when output_sam_p is true.
* pair.c: In compute_md_string, setting state to be IN_MATCHES after seeing
I token, so we don't print two successive ^ tokens.
* stage3.c: Added a pass to remove adjacent insertions and deletions and fix
single gaps again.
2012-01-10 twu
* gmap_build.pl.in: Doing chmod on gammaptrs only if kmersize > basesize
2012-01-09 twu
* gsnap.c, stage1hr.c, stage3hr.c, stage3hr.h: Computing GMAP alignment
score using standard splicing penalties and indel penalties
2012-01-06 twu
* README: Added information about maximum genome length. Added information
about SAM fields.
* atoiindex.c, cmetindex.c: Eliminated memory leaks and reduced memory usage
from 20 GB to 12 GB for a human-sized genome
* gmap.c, gsnap.c, uniqscan.c: Using new interfaces to Dynprog_setup and
Stage3_setup to provide information about novelsplicingp. Passing
fileroot instead of dbroot to Datadir_find_mapdir.
* stage3hr.c: Stage3end_optimal_score and Stage3pair_optimal_score now
keeping all GMAP results
* stage3.c, stage3.h: Marking pairs as disallowed when novelsplicingp is
false and dynamic programming cannot find a solution provided by the
splicing_iit file. Trimming end introns that are disallowed.
* stage1hr.c: Added debugging message
* dynprog.c, dynprog.h: The bridge_intron_gap function now handling runs
where novelsplicingp is false and known splicing is at intron level.
Dynprog_genome_gap returning NULL in such cases.
* pair.c, pairdef.h, pairpool.c: Added field disallowedp to Pair_T object
* iit-read.c, iit-read.h: Added functions IIT_low_exists and
IIT_high_exists, used for intron-level known splicing. Improved warning
message for invalid IIT files.
2012-01-05 twu
* gmapindex.c: Added check for total genomic length exceeding 4 GB
* snpindex.c: Reduced maximum memory usage for human genome from 12 GB to 8
GB. Eliminated memory leaks.
2012-01-03 twu
* trunk, src, util: Merged revisions 50470 to 50909 from
branches/gmap-2011-10-24-mult-stage2
* config.site.rescomp.tst, config.site.rescomp.prd: Changed date
2011-12-28 twu
* README: Changed comment about splicing file with known introns possibly
being buggy
* VERSION: Updated version number
* index.html: Added changes for new version
* iit-read.c: If add_iit_p is true, trying filename with .iit suffix first
before trying filename as given
* stage3hr.c: Added check for low coordinate of new GMAP object being to the
left of coordinate 0
2011-12-21 twu
* shortread.c: Allowing for /3 endings in second end of Illumina short reads
2011-12-13 twu
* index.html: Made update for new version
* VERSION: Updated version number
* stage3.c: Fixed issue in trying to solve dual introns with negative query
coordinates
2011-12-09 twu
* spanningelt.c, spanningelt.h, stage1hr.c: Alternate fix to problem with
positions being NULL, while npositions > 0. Fixed Spanningelt_set so it
updates npositions along with positions.
* VERSION: Updated version number
* archive.html: Moved old version here
* index.html: Added new version. Added information about users group.
* stage1hr.c: Added an extra check for positions[querypos] not being NULL,
needed for nonstranded modes
2011-12-07 twu
* gmap_build.pl.in: Fixed bug in not assigning cmd variable
2011-12-05 twu
* shortread.c: Fixed bug in printing null accession for second end when
using --fails-as-input flag
* gmap_build.pl.in: Checking return codes from system calls
2011-12-02 twu
* VERSION, index.html: Updated version number
* gsnap.c: Changed name of flag from --ambig-splice-notrim to
--ambig-splice-noclip
* pair.c: Pushing "0M" between adjacent I and D operations in a cigar string
* gmap.c, gsnap.c, uniqscan.c: Using new interface to Splicetrie_setup.
Providing --ambig-splice-notrim flag in GSNAP.
* splicetrie.c, splicetrie.h: Providing behavior to turn clipping off at
ambiguous known splice sites, useful if trying to turn off all soft
clipping
* stage1hr.c: Fixed debugging statement
2011-11-30 twu
* stage1.c: Fixed variable name so PMAP could compile
2011-11-29 twu
* dynprog.c, maxent_hr.c, maxent_hr.h, pair.c, stage1hr.c, stage3.c,
substring.c: Checking for case where splice_pos minus margin goes beyond
beginning of chromosome
* VERSION, index.html: Updated version number
* maxent_hr.c: Checking for case where splice_pos is smaller than margin
* pair.c, samprint.c: Changed value of NM tag in SAM output to be edit
distance (mismatches plus gaps)
* stage1hr.c: Fixed debugging statement
2011-11-27 twu
* VERSION, index.html: Updated version number
* gsnap.c: Calling SAM_setup
* outbuffer.c: For GMAP, when quiet_if_excessive_p is true and npaths >
maxpaths, not printing any output
* pair.c, pair.h, stage3.c: Printing HI tag in SAM output
* samprint.c, samprint.h: When quiet_if_excessive_p is true and npaths_mate
> maxpaths, setting MATE_UNMAPPED in flag. Printing HI tag. Added
function SAM_setup.
2011-11-25 twu
* VERSION, index.html: Updated version number
* stage3.c: For GMAP, using QUERYEND_INDELS instead of QUERYEND_NOGAPS
* stage1.c: Added check for querylength being too short
* gmap.c: Changed information in --help about how to turn off chimera
detection
* stage3.c: In Stage3_mergeable, added check for firstpart_npairs or
secondpart_npairs being 0
2011-11-23 twu
* index.html, VERSION: Updated version number
* stage1.c: Made direct calls to fields in Match_T objects for speed. Made
changes so debugging macros work.
* dynprog.c: Made special traceback procedure for queryend_nogaps
2011-11-21 twu
* stage3hr.c: Fixed bug where Stage3end_remove_overlaps was not keeping ends
where paired_usedp was true. In Stage3end_remove_overlaps and
Stage3pair_remove_overlaps, terminal alignments lose to all other types.
* stage3hr.h: Put GMAP hittype before TERMINAL hittype
* gsnap.c: Increased default values for max_gmap_pairsearch,
max_gmap_terminal, and max_gmap_improvement
* stage3hr.c: Changed code for Stage3end_remove_overlaps to parallel that
for Stage3pair_remove_overlaps
* samprint.c: Allowing for GMAP alignments to be printed for single-end reads
* VERSION, config.site.rescomp.prd, config.site.rescomp.tst: Updated version
number
* stage3.c: Added comment about pass 8 and 9
* index.html: Made changes for 2011-11-20 version
* uniqscan.c: Using new interface to Stage1_single_read
* stage3hr.c: Changed the code for Stage3end_optimal_score to be consistent
with that of Stage3pair_optimal_score. Allowing terminal alignments to be
considered along with other alignments.
* stage1hr.c: For single-end alignment, initializing done_level to
user_maxlevel, rather than opt_level, to be consistent with the code for
paired-end alignment. However, the values are the same anyway.
* gsnap.c: Changed the default value for terminal_threshold from 3 to 2.
Expanded the --help entry for --terminal-threshold.
* index.html: Made changes for version 2011-11-17
2011-11-20 twu
* stage3.c: Added more restrictions on non-canonical splices at ends of read
* stage3hr.c, stage3hr.h: Added functions Stage1end_start_endtype and
Stage1end_end_endtype for terminal alignments
* gsnap.c: Using new interface to Stage1_single_read
* gmap.c, stage3.c, stage3.h: Changed name of variable
* stage1hr.c, stage1hr.h: Implemented GMAP terminal mode for single-end reads
* dynprog.c: Lowered rewards for canonical introns, to help find
non-canonical introns
* stage3.c: In fill_in_gaps, if splicingp is false, then filling in a
deletion, not an intron
2011-11-19 twu
* stage3.c: Moved trim_noncanonical procedures from path_trim to
path_compute. After trim_noncanonical procedures, doing an extension using
QUERYEND_NOGAPS.
* stage1hr.c: Using -3*nmismatches and -4 for an indel in evaluating end
indels, corresponding to default values for trim_mismatch_score and
trim_indel_score.
2011-11-18 twu
* stage3.c: Using endalign instead of to_queryend_p. When endalign is
QUERY_NOGAPS, not doing peelback. In pass 8, changed extendp from false
to true (QUERYEND_INDELS).
* gsnap.c, uniqscan.c: Added flag --trim-indel-score
* dynprog.c, dynprog.h: Replaced to_queryend_p with endalign, with types
BEST_LOCAL, QUERYEND_INDELS, and QUERYEND_NOGAPS. For extensions to
queryend, always returning gappairs.
* atoiindex.c, cmetindex.c: Updated --help to indicate how --kmer and -D are
chosen by default
2011-11-17 twu
* VERSION: Updated version number
* gmap.c, gsnap.c, stage1hr.c, stage3.c, stage3.h, uniqscan.c: Revised
calculation of insertlength inside trim_noncanonical_ends procedures to
compensate for using maximum overlap in computing genomicseg.
* stage3.c: Made fill_in_gaps procedure replace short non-canonical introns
with deletions. Removed this feature from assign_gap_types. Added
additional checks on translation coordinates to stay within array bounds.
* dynprog.c: Added debugging statements
* substring.c: Added comment
* shortread.c, shortread.h, stage1hr.c: For GMAP algorithm in GSNAP,
assuming maximum overlap, rather than trying to compute overlap
* outbuffer.c: Printing all paths when -E flag is given to GMAP
* stage3hr.c: Improved error message when ambig end splicetype has an
unexpected value
* dynprog.c: Removed unused code for END_KNOWNSPLICING_SHORTCUT. Always
assigning *ambig_end_length in Dynprog_end5_known and Dynprog_end3_known.
2011-11-14 twu
* uniqscan.c: Using new interface to Stage3hr_setup
* stage3.c: Not substituting for long deletions at end of path_compute
* stage1hr.c, stage3hr.c, stage3hr.h: Removed end_indel_p parameters to
Stage3end_new_insertion and Stage3end_new_deletion
* stage1hr.c: Preventing call to Genome_count_mismatches_limit by
find_doublesplices where pos5 >= pos3
* VERSION: Updated version number
* configure.ac: Added flag --enable-popcnt
* dynprog.c: Initialing value of ambig_end_length
* stage3.c: Skipping gap pairs at the beginning of alignments in fill_in_gaps
* goby.c: Using new interface to Result_array
2011-11-12 twu
* VERSION: Updated version number
* stage1hr.c, stage3hr.c: Checking in Stage3end_new_gmap if genomicstart or
genomicend exceeds chrhigh, and if so, returns NULL.
2011-11-11 twu
* stage3.c: Moved assigning of gap types to end of path_compute, rather than
beginning of path_trim
* stage3.c: Inserting gap pairs and adding gap types at beginning of
path_trim
* gsnap.c, outbuffer.c, outbuffer.h, samprint.c, samprint.h, stage1hr.c,
stage3hr.c, stage3hr.h: Divided DISTANT_SPLICE type to SAMECHR_SPLICE and
TRANSLOC_SPLICE. Making merge_samechr_p act only at print time, which
allows SAMECHR_SPLICE to undergo pair_up_concordant again. Stopping
clip-overlap on distant splices.
2011-11-10 twu
* VERSION: Updated version number
* stage3hr.c: Keeping hits in optimal scoring if nmatches_posttrim is
sufficiently high relative to the best hit
* gsnap.c, uniqscan.c: Increased value of max_deletionlength from 30 to 50
* gmap.c, stage1hr.c: Using new interface to Stage3_compute
* stage3.c, stage3.h: Changed flow to path_compute on both cdna directions,
then pick_cdna_direction, then path_trim, which removes non-canonical end
exons.
* stage3hr.c, stage3hr.h, substring.c, substring.h: Added nmatches_posttrim
and using it to break ties resulting from equal values of nmatches. Added
a general test for Substring_new based on matches and mismatches before
trimming.
* pair.c: Modified Pair_nmatches to add penalties for an indel and for a
splice site with low probabilities. Modified compute_md_string on I
tokens to skip only for insertion pairs.
* dynprog.c: Fixed procedure find_best_endpoint_to_queryend to look only at
r == length1
* stage3hr.c: Turned off separate treatment of terminal alignments in
Stage3pair_optimal_score. Always trimming ends of insertions and
deletions (previously depended on value of end_indel_p).
2011-11-09 twu
* stage3.c: Using a sliding scale in trimming end exons
* pair.c: Added debugging macro for compute_md_string
* pair.c, pair.h: Taking cdna_direction as a parameter in
Pair_print_exonsummary, instead of determining sense separately for each
intron
* stage3.c: Restored alignment score in pick_cdna_direction with different
values of significant difference for GMAP and GSNAP. Considering a
non-canonical intron as canonical if both splice site probabilities are
high.
* splicing-score.c: Fixed getopt so -D takes an argument
* pairpool.c, pairpool.h: Revised Pairpool_count_bounded to return number of
pairs at start.
* gmap.c: Added comments
* stage3.c: Restored use of alignment scores in pick_cdna_direction, after
comparing number of noncanonical splices
2011-11-08 twu
* VERSION: Updated version number
* shortread.c: No longer printing warning message about not finding "/1" or
"/2" endings
* gsnap.c, uniqscan.c: Added max_deletionlength variable
* gmap.c, outbuffer.c, outbuffer.h, pair.c, pair.h, samprint.c: No longer
converting short noncanonical splices from type N to type D, since this is
now performed in stage 3. Removed cigar_noncanonical_splices_p variable.
* stage3.c, stage3.h: In assign_gap_types, converting noncanonical splices
smaller than max_deletionlength into deletions
* gsnap.c, stage3hr.c, stage3hr.h, uniqscan.c: Treating distant splices on
the same chromosome as a translocation by default. Added a flag
--merge-distant-samechr to get previous behavior.
* stage3.c: Using maxintronlen in trimming end exons
* stage3.c: Removed alignment scores from pick_cdna_direction. Revised
procedures for trimming noncanonical end exons and doing distal/medial
comparison by adding procedures canonicalp and good_end_intron_p, with
latter using probabilities.
* gmap.c: Increased parameter for maxpeelback_distalmedial from 24 to 100
* dynprog.c: Added debugging statements
2011-11-07 twu
* pair.c: Computing MD string from cigar tokens
* stage2.c: Restored querydist penalty
* VERSION: Revised version number
* index.html: Added entry for new version
* stage1hr.c: Commented out second round of terminal alignments
* gmap.c, gsnap.c, uniqscan.c: Set trim_indel_score to -4 to be consistent
with previous value
* gmap.c, gsnap.c, uniqscan.c: Calling Pair_setup
* substring.c: Extending trimming toward ends in case of ties
* stage3.c: Extending ends completely before final trim
* pair.h, pair.c: Extending trimming toward ends in case of ties. Added
Pair_setup function to use trim_mismatch_score value provided by user.
* stage2.c: Put code for suboptimal starts into a compiler directive
* gmap.c, gsnap.c, uniqscan.c: Provided new defaults for
suboptimal_score_start and suboptimal_score_end, based on simulations
* gmap.c, gsnap.c, stage2.c, stage2.h, uniqscan.c: Introduced parameters for
suboptimal_score_end and suboptimal_score_start
2011-11-06 twu
* gmap.c: Added flag for --suboptimal-score
* gmap.c, gsnap.c, outbuffer.c, pair.c, pair.h, result.c, result.h,
resulthr.c, resulthr.h, samprint.c, samprint.h, stage1hr.c, stage3.c,
stage3.h, stage3hr.c, stage3hr.h, uniqscan.c: Restoring old MAPQ score.
Making absolute MAPQ score a separate calculation, and printing it in an
XQ flag.
2011-11-05 twu
* stage1hr.c: Added comments
* stage3hr.c: Added field indel_low, and using to prefer indels at low
genomic coords
* stage1hr.c: Fixed computation of firstbound and lastbound so end indels
are found on short reads, such as 36-mers.
2011-11-04 twu
* pair.c: Added code to compute_cigar to merge duplicate token types
* stage2.c: Fixed uninitialized value for last_canonicalp
* stage2.c: Implemented ability to generate suboptimal paths based on
different initial positions
2011-11-03 twu
* gmapindex.c: Removed unnecessary file open for -P flag
* gmapindex.c: Fixed memory leaks for -G flag
* gmapindex.c: Fixed memory leaks for -A flag
2011-11-01 twu
* pairpool.c: Commented out copy of shortexonp
* dynprog.c, dynprog.h, gmap.c, gsnap.c, stage3.c, uniqscan.c: Removed
endlength requirement for microexons. Returning prob2 and prob3 from
Dynprog_microexon_int and applying two standards, depending on whether an
indel was originally present
* pair.c: Printing shortexon information
* pairpool.c: Copying shortexon information in copying pairs
* smooth.c: Removed unused parameters
* stage3.c: Adding endlength requirement for finding microexons
* smooth.c: Printing result of smoothing as pairs
* stage3.c: Made penalties harsher for indels at end near poor splice sites
* stage2.c: Computing best overall score during dynamic programming process
* uniqscan.c: Using new interface to Stage3hr_setup
* stage2.c: Made changes so PMAP could compile
* stage3.h: Added parameter favor_mode to Stage3_compute
* stage3.c: Counting indel near splice as 2 mismatches. Added endlength
requirement of 12 for indel near splice.
* stage2.c: Going through all hits to accumulate cells. No longer using
number of links to set root scores.
* gmap.c, stage1hr.c: Passing value of favor_mode to Stage3_compute
* smooth.c: Relaxing probability requirement for end exons in GSNAP from
0.05 to 0.10.
2011-10-31 twu
* stage3.c: In trimming non-canonical end exons, not combining nearindelp
with splice probs, requiring 1 mismatch or less for bingop, and allowing
AT-AC introns. Extending alignments to queryend before trimming
non-canonical end exons.
* stage3.c: In trimming end exons, using a sliding scale based on intron
length. Also penalizing for indels and mismatches close to exon-exon
boundary.
2011-10-28 twu
* gsnap.c, stage1hr.c, stage1hr.h, stage3hr.c, stage3hr.h: Providing
expected_pairlength and pairlength_deviation values in Stage3hr_setup, and
removing from Stage1hr procedures.
* stage3.c: In end exons, checking if indel present, and if so, requiring
that splice probabilities both be greater than 0.9.
* gsnap.c, stage3hr.c, stage3hr.h: Restoring pairlength deviation. Using
expected pairlength and pairlength deviation to discriminate among
paired-end reads.
* stage2.c: Adding all hits from final querypos directly to celllist, rather
than updating rootscores. Fixed update of rootscore information to use
current querypos and hit. Dynamic programming starting from querypos 0,
rather than querypos 1.
* diag.c: Restricted update of diagonal in middle region, to avoid affecting
subsequent beginning and end regions
* splicetrie_build.c: Fixed handling of intron intervals, by introducing
INTRON_HIGH_TO_LOW
2011-10-27 twu
* stage3hr.c: Made other fixes to allow copying of GMAP hit types
* stage1hr.c: Making copies where necessary for multiple GMAP subpaths, and
freeing old Stage3pair_T objects at the appropriate time. Calling
Stage3pair_remove_overlaps on double GMAP alignments.
* pairpool.c: Using CALLOC_OUT in Pairpool_copy_array
* gsnap.c: Printing sequence name when debugging memusage
* stage1hr.c: Handling multiple subpaths from stage 2 computation
* stage2.c: Removed restriction on number of positions for final non-zero
querypos
* stage3hr.c: Allowing Stage3end_T objects of type GMAP to be copied
* pairpool.c, pairpool.h: Added function Pairpool_copy_array
* stage2.c: Fixed location of compiler directive for PMAP
* stage3.c: Restoring negative points for non-canonical introns in computing
goodness
* stage2.c: Adding root scores for final non-zero and specific querypos
* gmap.c, list.c, list.h, stage1hr.c, stage1hr.h, stage2.c, stage2.h: Going
back to changes from revision 50508 for multiple subpaths from stage2,
plus revisions 50504 to 50909 from branches/gmap-2011-10-24-mult-stage2
for a root position method for finding optimal and suboptimal subpaths
2011-10-25 twu
* src, gmap.c, list.c, list.h, stage1hr.c, stage1hr.h, stage2.c, stage2.h:
Reverted to version 50507, before changes made to allow multiple paths
from a stage2 computation
2011-10-24 twu
* VERSION: Updated version number
* archive.html, index.html: Added changes for 2011-10-24 version
* gmap.c, list.c, list.h, stage1hr.c, stage1hr.h, stage2.c, stage2.h: Merged
revisions 50469 to 50504 from branches/2011-10-24-mult-stage2 to allow for
multiple stage2 results from the same genomic segment for GMAP.
* stage3.c: Assigning sensedir in all cases, based on intron scores if
necessary
* gmap.c, gsnap.c, outbuffer.c, pair.c, pair.h, result.c, result.h,
resulthr.c, resulthr.h, samprint.c, samprint.h, stage1hr.c, stage1hr.h,
stage3hr.c, stage3hr.h, uniqscan.c: Computing MAPQ score relative to best
alignment. Printing X2 field in SAM output to provide second best MAPQ
score.
* stage3.c, stage3.h: When sense_try is provided, assigning sensedir
2011-10-16 twu
* VERSION: Updated version number
* index.html: Added comment about change to GMAP
* gmap.c: Sorting stage3list before evaluating for chimeras
2011-10-14 twu
* VERSION: Updated version number
* index.html: Updated for new version
* configure.ac: Grouped together checks for built-in procedures
* splicetrie_build.c, splicetrie_build.h: Checking for splice sites being
beyond the chromosome length boundary
* splicetrie.c: Handling case where trieoffsets has a NULL_POINTER, which
can occur with intron-type splicing
* splicetrie_build.c: Handling case where nsites from a given splice site is
zero
2011-10-13 twu
* popcnt.m4: Checking each built-in instruction only if available
* asm-bsr.m4: Assigning a value to x
* VERSION: Updated version number
* gsnap.c: Removed 'S' from getopt
* gsnap.c: Fixed naming of --splicingdir flag
* samprint.c: Not printing mate chr or mate chrpos when mate has an
excessive number of paths
* popcnt.m4: Changed test for -mpopcnt from a compiler test to a run test,
to make sure instruction is legal
2011-10-12 twu
* dbsnp_iit.pl.in: Fixed syntax
* stage1hr.c: Fixed debugging statements
* VERSION: Updated version number
* README, dbsnp_iit.pl.in: Made changes to handle exceptions within the snp
file
* stage1hr.c: Fixed detection of novel splice ends for distant splicing
* maxent_hr.c: Fixed debugging commands
* stage3hr.c: Added comment
* stage3hr.c: Eliminating terminals in Stage3end_optimal_score when
non-terminals are present
* iit-read.c: Checking for case where nintervals is zero
* iit-write.c: Fixed memory leak in IIT_build
2011-10-10 twu
* stage3hr.c: Removing -nindels from calculation of querylength_adj
* gsnap.c: Limiting minimum value of indexdb_size_threshold
* indexdb.c: Fixed bug in computing Indexdb_mean_size using compressed hash
table
* genome_hr.c: Adding +1 to ctr only when necessary
2011-10-09 twu
* genome_hr.c, genome_hr.h: In gamma decoding, changed from division of
shift by 2 to using shift - 1, made all variables unsigned ints, and added
code for branchless computation.
2011-10-07 twu
* gmap.c, gsnap.c, inbuffer.c, inbuffer.h: Generalized --filter-chastity to
work on either or both ends of a paired-end read
* index.html: Changed wording
* index.html: Added information about --filter-chastity option
* goby.c, gsnap.c, inbuffer.c, shortread.c, shortread.h, uniqscan.c:
Implemented --filter-chastity option
* genome_hr.c: Implemented a 2-shift method for decoding gammas, which
avoids a branch
* config.site.rescomp.prd, config.site.rescomp.tst: Updated for new version
* index.html: Entered new version 2011-10-07
* VERSION: Updated version number
* splicetrie.c: Fixed a bug where splicesites_i was not being initialized to
NULL
* genome_hr.c: Removed dispatch procedures. Moved part of read_gamma to top
of loop, to guarantee that the final ptr location is correct.
* genome_hr.c: Removed nbits. Implemented macros for clear_lowbit and
clear_highbit.
* genome_hr.c: Implemented both dispatch and non-dispatch methods for
getting offsetptrs from gammas
2011-10-06 twu
* genome_hr.c: Rearranged order of gamma computations within loops
* genome_hr.c: For gamma commands, trying builtin clz first, then bsr in
assembly, then table lookup.
* Makefile.dna.am, Makefile.gsnaptoo.am: Added POPCNT_CFLAGS
* config.site: Made CFLAGS=-O3 and added comments about testing -mpopcnt
* acinclude.m4, asm-bsr.m4, popcnt.m4, configure.ac: Added tests for
-mpopcnt compiler flag and bsr function in assembly
* genome_hr.c: Added assertion statements to make sure builtin_clz is not
called with a value of 0
2011-10-05 twu
* VERSION: Updated version number
* index.html: Made changes for latest version
* configure.ac: Added gff3 utility programs
* stage3hr.c: Sped up comparison of overlapping and separate paired-end
alignments by using arrays instead of lists.
* stage3hr.c: Simplified procedure for finding bad superstretches
* stage1hr.c: Fixed bugs in pushing indels to low genomic position in
solve_middle_insertion and solve_middle_deletion
* gsnap.c: Commenting out --genes option
* stage3hr.c: Performing separate runs of Stage3pair_remove_overlaps on
overlapping and non-overlapping alignments
* stage3hr.c: Added check for bad superstretches in Stage3pair_overlap
* gsnap.c, stage1hr.c, stage1hr.h, uniqscan.c: Implemented a fast,
integrated method for novel and known double splicing, and using it by
default
* access.c: Added error messages when mmap fails
2011-10-04 twu
* stage3hr.c: Picking a winner in case of ties in Stage3pair_remove_overlaps
* stage1hr.c: Fixed calculation of genomicstart and genomicend for GMAP
alignments, to extend as if there were no trimming
* stage3hr.c: Added debugging statements
* stage1hr.c: Made finding of novel doublesplices faster
* psl_introns.pl.in, psl_splicesites.pl.in: Moved print_exons into a
subroutine
* psl_introns.pl.in, psl_splicesites.pl.in: Using donor_okay_p and
acceptor_okay_p subroutines
* gtf_introns.pl.in: Changed warning message to refer to intron, not exon
* gtf_genes.pl.in: Removed unused variables
* Makefile.am, gff3_genes.pl.in, gff3_introns.pl.in, gff3_splicesites.pl.in:
Added programs to handle GFF3 files
* Makefile.dna.am: Removed programs for processing BAM files
* iitdef.h, iit-write.c: Restored NUMERIC_ALPHA_SORT to sort types
* stage1hr.c: Fixed issue with finding novel doublesplices where not all
middle segments were tested
2011-10-03 twu
* iitdef.h, iit-write.c, iit-write.h: Added code for creating IIT files
internally
* archive.html, index.html: Made changes for version 2011-10-01
2011-10-02 twu
* VERSION: Updated version number
2011-10-01 twu
* sequence.c: Removed compiler directive to undef HAVE_ZLIB
* stage1hr.c: Allowing reads to be equal to index1part+2, not just greater
than
* Makefile.dna.am, uniqscan.c: Added program uniqscan
2011-09-30 twu
* Makefile.dna.am, dynprog.c, dynprog.h, gmap.c, gsnap.c, splicetrie.c,
splicetrie.h, stage1hr.c, stage1hr.h, stage3.c, stage3.h: Moved
splicesites and trieoffsets into local static variables to avoid passing
them as parameters
* stage1hr.c: Moved check of genestrand value outside loop for retrieving
oligos
* substring.c: Created separate procedures for mark_mismatches for stranded
and non-stranded cases
* genome_hr.c, gsnap.c, indexdb.c, mode.h, stage1hr.c, substring.c:
Implemented stranded and non-stranded versions of cmet and atoi
substitutions
* trunk, VERSION, config.site.rescomp.prd, config.site.rescomp.tst, src,
compress.c, genome_hr.c, genome_hr.h, gsnap.c, mapq.c, mapq.h,
splicetrie.c, splicetrie.h, stage1hr.c, stage3hr.c, stage3hr.h,
substring.c, substring.h, util: Merged revisions 48527 to 48790 from
branches/2011-09-28-atoi
2011-09-29 twu
* stage1hr.c: Previously returning NULL if either end of a paired-end read
had no oligos. Now returning NULL if both ends have no oligos.
* stage3hr.c: Requiring that overlap be greater than both ends of a
paired-end read
2011-09-28 twu
* oligoindex_hr.c: Not computing oligoindex if left_plus_length <= left
* psl_introns.pl.in, psl_splicesites.pl.in: Added warning if intron lengths
are negative
* splicetrie_build.c: Added check for negative distances in a splicing_iit
file
* genome_hr.c: Fixed assignment to predicate inside of assert statement
2011-09-22 twu
* VERSION: Updated version number
* gsnap.c: Fixed typo in help statement
* table.c, uinttable.c: Allowing retrieval of keys to work, even if table is
empty
* substring.c: Allowing case where nothing is found in splicesites_iit,
because it is for introns and not splicesites
* splicetrie.c: Fixed case where splicetrie_obs is not NULL, but
splicetrie_max is NULL, which occurs if the splicing file is for introns,
and not splicesites
* splicetrie_build.c: Fixed memory leak in building splicetrie for introns
splicing file
2011-09-20 twu
* cappaths.c, spliceclean.c: Moved cappaths and spliceclean programs to
GSTRUCT repository
* Makefile.dna.am, genecompare.c: Moved program genecompare to GSTRUCT
repository
* Makefile.dna.am: Moved cappaths to GSTRUCT repository
* Makefile.dna.am, splicegene.c, spliceturn.c: Moved spliceturn and
splicegene to GSTRUCT repository
2011-09-16 twu
* gsnap.c: Provided further clarification in help statement about
min-localsplice-endlength
* gsnap.c: Checking that min_distantsplicing_end_matches is greater than or
equal to kmer size. Clarified some help statements.
* README: Added recommendation to use known splice sites, rather than known
introns
* README: Clarified that a given set of known splice sites can find
alternative splicing.
* except.c: Fixed Except_advance_stack to return a value if pthreads not
available
* Makefile.dna.am: Moved instructions for spliceclean and splicealt to
GSTRUCT repository
* psl_introns.pl.in: Removed extraneous "v" at beginning of line
2011-09-14 twu
* VERSION: Updated version number
* index.html: Updated page to show version 2011-09-14
* inbuffer.c, sequence.c, shortread.c, shortread.h: Revised read procedures
to handle multiple files correctly
2011-09-13 twu
* pair.c, pair.h, samprint.c: For SAM output of GMAP alignments, printing
correct value of NH for number of hits
* stage3.c, stage3.h, gmap.c, gsnap.c: Added parameter for min_intronlength
* Makefile.dna.am, bam_pileup.c, bam_tally.c, bamread.c, bamread.h,
gsnap_extents.c, gsnap_splices.c: Moved files to gstruct
* stage3.c: Reduced value of MIN_NONINTRON from 50 to 9 to avoid declaring
short introns as indels
* pair.c: Fixed Pair_print_sam to work properly for chimeric alignments
* stage3.c: Cleaning up gaps and indels from ends at end of stage 3
* stage1hr.c: Fixed debugging statement
2011-09-09 twu
* VERSION: Updated version number
* dynprog.c: Fixed computation of lband and rband in find_best_endpoint and
find_best_endpoint_to_queryend
* dynprog.c: Added protections against length1 being negative
2011-09-07 twu
* index.html: Made changes for 2011-09-07 version
* samprint.c, pair.c, pair.h, stage3.c: Removed almost all unused parameters
* outbuffer.c, stage3.c, stage3.h: Removed most unused parameters
* pair.c: Fixed compiler messages about comparison of signed and unsigned
ints
* compress.c, compress.h: Fixed compiler messages about comparison of
unsigned and signed ints
* svncl.pl: Made changes to fix merges of words between lines and preserve
original numbers of spaces
* VERSION: Updated version number
* shortread.c: For input error involving line that is too long, printing
accession where problem occurred.
* Makefile.am: Including full set of psl and gtf parsing programs with or
without fulldist
* genome.c: Added include line for genomicpos.h
* Makefile.dna.am: Included genomicpos.c and genomicpos.h for
extents_genebounds
* README: Made changes to reflect new gtf_introns program
* configure.ac, Makefile.am, gtf_introns.pl.in: Added gtf_introns program.
Also putting psl_introns back into the public distribution. Made changes
accordingly in README file.
* dynprog.c, dynprog.h, stage3.c: When searching for gappairs_alt using
probabilities, bounding the search based on a score threshold computed
from the original score
* stage3.c: For GSNAP, no trimming of noncanonical ends based on
probabilities, since need to compare fwd and rev directions. Stopped
final trimming of short end exons. For gappairs_alt, accepting if it
results in high-probability splice sites. For pick_cdna_direction, using
separate donor and acceptor scores and using alignment score again.
* stage3.c: Put final pass to find canonical introns before trimming of dual
breaks at ends
* stage3.c: Fixed problem with trimming dual breaks where it was trimming
indels. In trimming noncanonical exons at end, reduced NONCANONICAL_ACCEPT
from 20 to 15, and added NONCANONICAL_PERFECT_ACCEPT. In
pick_cdna_direction, turned off use of indel_alignment_score, and added
nmatches - nmismatches - 3*nindels with MATCHES_SIGDIFF after use of
totalintronscore.
* pair.c, dynprog.c: Counting nindels correctly near splice sites
* gtf_splicesites.pl.in: Allowing GTF file to use tag gene_id instead of
gene_name
2011-09-06 twu
* stage1hr.c: Fixed masking of oligos for cmet and atoi modes
* gsnap.c: Added 'S' flag to getopt command. Removed 'R' flag from getopt
and from help message.
* datadir.c: Fixed error message to remove the word default
* stage3.c: Peeling back 1 pair in a dual break (previously turned off) to
avoid having a gap on either side.
* dynprog.c, dynprog.h, stage3.c: Changed bridge_intron_gap to have an
explicit parameter for use_probabilities_p. Using indel_alignment_score
now in pick_cdna_direction.
* splicetrie.c, splicetrie.h: Providing contlength as parameter when making
3' splicejunctions
* genome.c, genome.h: Added functions for Genome_fill_buffer_blocks that do
not print final null at end of string, needed for making splicejunctions.
* dynprog.c, dynprog.h: Fixed creation of splicejunctions. For 5'
splicejunctions, not printing final null at end of string. For 3'
splicejunctions, printing distal sequence at splicejunction[contlength].
2011-09-02 twu
* atoiindex.c, cmetindex.c: Masking to oligomer indices to given index1part
size
* gmap.c: Fixed bug in pairalign where min_matches not be adjusted downward
to MIN_MATCHES
2011-09-01 twu
* stage3.c: Fixed bug where longer end of dual break was being trimmed, not
shorter end
* dynprog.c: Allow for cDNA insert of up to 3 bp at splice site
* dynprog.c, dynprog.h, gmap.c, gsnap.c, stage3.c: Introduced
--microexon-spliceprob flag and allowing microexons only if one of the
splice sites exceeds this value
* stage3.c: Always keeping gappairs if finalp is true (was forcep)
* dynprog.c: Always using splice site probabilities to find introns if
finalp is true, and allowing indels nearby
* stage3.c: Added a factor for dual break query jump to avoid dual breaks at
end with small query jumps
* gmap.c: Fixed indentation
* scores.h, pair.c: Added negative points in pathscore for non-canonical
intron or for when cdna direction is indeterminate
* chimera.c: Not allowing chimeric transition into a gap
* stage1hr.c: Using new interface to Stage3_compute
* gsnap.c: Using new interface to Stage3_setup
* gmap.c: Added --nosplicing flag. Fixed memory leak when matches <
min_matches.
* stage3.c, stage3.h: Made splicingp a static variable. Added a step to
remove dual breaks from the ends of an alignment.
* stage2.c: Turning off link back to grand_fwd_hit when splicingp is false
* stage2.c: Using macros for diffdist_penalty under splicing and
non-splicing cases
* stage2.c: Consolidated loops for use_shifted_canonical_p == true and ==
false. Removed compiler branches when SHIFT_EXTRA is not defined.
2011-08-31 twu
* stage3.c: In trimming noncanonical exons from end, requiring end intron to
be canonical and have donor prob or acceptor prob >= 0.9.
* chimera.c, chimera.h, pair.c, pair.h, stage3.c, stage3.h: Prohibiting
chimeric join at a query position containing a gap
* gsnap.c: Added comment in --help output about how to turn off terminal
alignments
* stage1hr.c: Using terminal_threshold in paired-end reads
* gsnap.c, stage1hr.c, stage1hr.h, stage3hr.c, stage3hr.h: Replaced
terminal_penalty with terminal_threshold
2011-08-30 twu
* gmap.c: Fixed entry for --pairalign in long_options
* gmap.c: If min_matches exceeds MIN_MATCHES, use MIN_MATCHES
* outbuffer.c, pair.c, pair.h, stage3.c, stage3.h: In compressed output,
printing accession of usersegment instead of null dbversion
* gmap.c: Implemented --pairalign flag for aligning a pair of sequences via
stdin
* splicetrie_build.c: Fixed compiler warning about unused variable
* stage3.c: Improved debugging output
* genome.c, gmap.c, inbuffer.c, inbuffer.h, sequence.c, sequence.h:
Implemented --cmdline flag to align two sequences provided on the command
line
* shortread.c: Changed warning message when /1 or /2 endings not found
* access.c: Changed warning message for Macs
* stage3.c: Fixed add_querypos_offset, which previously excluded gap pairs
2011-08-29 twu
* genome.c: Fixed compiler warnings about comparing ints and unsigned ints
* indexdb.c: Printing genomesubdir and then individual index file names in
monitoring message
* indexdb.c: Using commas in initial monitoring message
* genome.c: Using commas in initial monitoring message. Allowing allocation
if mmap not available.
* gsnap.c: Modified messages about RNA-Seq and DNA-Seq
* stage1hr.c: Put GMAP modes on a single line in monitoring message
* indexdb.c: Restored debugging messages
* iit-read.c: Casting all size_t to (unsigned long) in error messages
* access.c: Added check for failure of fread, which happens with Macs on
large genomes
* pair.c: Fixed case of a negative distance in printing GMAP alignment
beyond chromosomal bounds
* substring.c, substring.h: Using new interface to Pair_print_gsnap
* segmentpos.c, indexdb_hr.c: Fixed compiler warning about comparing int and
unsigned int
* oligo.c: Commented out unused procedures for dibase
* stage1hr.c: Using new interfaces to Stage3end_remove_overlaps and
Pair_print_gsnap
* stage3hr.c, stage3hr.h: Removed unused parameters in
Stage3end_remove_overlaps
* pair.c, pair.h: Fixed bug in GSNAP standard output in printing GMAP
alignments beyond chromosomal bounds
* stage1hr.c: For finding GMAP mapping bounds using segments, checking
plus_nsegments and minus_nsegments > 0, rather than plus_segments and
minus_segments == NULL.
2011-08-27 twu
* VERSION: Updated VERSION
* pair.c: Switched from aaphase_g to aaphase_e, since aaphase_e correctly
codes for all three positions of the stop codon
* config.site: Removed references to samtools
* gsnap.c, shortread.c, shortread.h: Added flags --fastq-id-start and
--fastq-id-end. Stripping Illumina paired-end endings more intelligently.
2011-08-26 twu
* dynprog.c, dynprog.h, splicetrie.c, stage3.c: Removed some unused
parameters
* dynprog.c, dynprog.h, stage3.c: Removed code for INTRON_HELP
* gmap.c: Changed author list
* dynprog.c, dynprog.h, splicetrie.c, splicetrie.h: Removed references to
gbuffer
2011-08-25 twu
* gsnap.c: Fixed warning messages for -N and -s. In --help message,
notifying user that full pathnames are allowed.
* stage3.h: Providing accessor commands for chrnum, chrstart, and chrend
* stage3.c: Fixed bug in determining coordinates for Stage3_mergeable
* gmap.c: Modified some debugging statements for chimeras
2011-08-19 twu
* stage1hr.c: Adding at least querylength beyond distal mappingstart or
mappingend to obtain distal genomicstart or genomicend
* oligoindex.c: Added to size of genomicdiag by 1. Added assertion about
exceeding those bounds.
* gsnap.c: Added warning messages about interpretation of -N and -s flags
* genome_hr.c: Fixed bugs preventing program from compiling
* gmap_build.pl.in: Not providing -s flag to fa_coords
2011-08-16 twu
* archive.html, index.html: Updated for version 2011-08-15
* COPYING: Changed Developer
* gmap_setup.pl.in: Removed instructions about gmapdb_lc and gmapdb_lc_masked
* README: Added default value for MAX_READLENGTH
* README: Revised for new features
* indexdb.c: Added warning message if no gammaptrs file is produced
* gmapindex.c: Allocating an extra two chars in the offsets file names for
the basesize
* archive.html, index.html: Changes made for 2011-03-28 version
2011-08-15 twu
* fa_coords.pl.in: Limiting number of warning messages about duplicate
contigs
* atoiindex.c, cmetindex.c: Improved monitoring messages
* setup1.test.in, Makefile.am, setup.ref12123positions.ok,
setup.ref123positions.ok: Changed tests for new gamma file format
* atoiindex.c, cmetindex.c, indexdb.c: Handling special case where
index1part == basesize by not writing gammaptrs file and reading
offsetscomp file directly into offsets.
* atoiindex.c, cmetindex.c: Modified procedures to work with new compressed
offsets file format
* indexdb.c: Handling case where kmer == basesize
2011-08-14 twu
* gmap_setup.pl.in: Removed -S flag and added -s flag
* gmap_build.pl.in: Changed message to indicate that default order is chrom
order
* fa_coords.pl.in: Removed -S flag
* gmapindex.c: Made chrom sort order the default
* gmap.c, gsnap.c: Added --splicingdir flag
* gsnap_splices.c: Turned off warning messages about non-canonical splices
* bam_tally.c: Fixed warning message output going to stdout
* bam_tally.c: Fixed printing of print_allele_counts_simple. Allocating and
freeing tallies within parse_bam procedure for each chromosome.
* indexdb.c: Added missing closing brace
* gsnap_extents.c: Using find_strand procedure from gsnap_splices, which
trusts strand from SAM output
* spliceturn.c: Fixed eliminatep to be indexed by universal IIT index
2011-08-13 twu
* indexdb.c, indexdb.h, indexdb_hr.c, indexdbdef.h: Allowing backward
compatibility with pre-gamma genomic indices. Using littleendian and
bigendian versions of gamma procedures. Implemented more compatibility
with bigendian machines.
* genome_hr.c, genome_hr.h: Instead of allocated/mmapped versions of gamma
procedures, creating littleendian and bigendian versions
* spliceclean.c: Changed wording of monitoring messages from "Resolve" to
"Choose". Providing count information even when cannot choose between fwd
and rev.
* spliceturn.c: Printing monitoring message about number of splices
eliminated
* bam_tally.c: Improved warning messages for genotypes inconsistent with
reference allele
* atoiindex.c, cmetindex.c: Using offsetscomp instead of offsets in variable
names.
* snpindex.c: Fixed bug with freeing gammaptrs_filename and
offsetscomp_filename too early. Using offsetscomp instead of offsets in
variable names.
* indexdb.h: Removed unused procedures
* indexdb.c: Checking for rare case that ctr == 0 after all gammas read, and
not advancing ptr in that case
* bam_tally.c: Added Tally_T structure to simplify data structures and speed
up program
* bam_tally.c: Removed quality_score_constant. Handling empty quality
strings correctly.
* bamread.c: Handling empty quality strings correctly
* indexdb.c, indexdb_hr.c: Handling an offsetscomp_access condition that is
not possible, to eliminate compiler warnings
* gmap.c, stage1hr.c: Reduced value of minendexon from 12 to 9, since we are
using nmatches - nmismatches. Results in much better results at ends.
* stage3hr.c: Restored usage of score before nmatches in remove_overlaps
procedures
* stage3.c: For trimming at ends, using nmatches - nmismatchs to evaluate.
Fixed bug in pick_cdna_direction where value for cdna_direction not
assigned correctly. Indels for bad introns, using requiring each end
probability to be greater than 0.9, and taking alternate gappairs if
nmismatches is less.
2011-08-12 twu
* trunk, VERSION, config.site.rescomp.prd, config.site.rescomp.tst, src,
Makefile.dna.am, Makefile.gsnaptoo.am, access.h, atoiindex.c, cmetindex.c,
genome_hr.c, genome_hr.h, gmap.c, gmapindex.c, gsnap.c, iit-read.c,
indexdb.c, indexdb.h, indexdb_hr.c, indexdbdef.h, pmapindex.c, snpindex.c,
splicetrie_build.c, types.h, util, gmap_build.pl.in, gmap_setup.pl.in:
Merged revisions 44539 to 44852 from branches/2011-08-09-elias-gamma,
implementing gamma coding to represent offsets in genomic indices
2011-08-10 twu
* stage1hr.c: Put GMAP pairsearch back in front of distant splicing.
Characterizing GMAP pairsearch results according to quality, and updating
either nconcordant or nsalvage. Distant splicing done if nconcordant is
0. Terminals done if both nconcordant and nsalvage are 0.
* stage3.c: Not using nnoncanonical in pick_cdna_direction. Restored
assignment of SENSE_NULL if no canonical site is found.
* stage3hr.c: In Stage3end_remove_overlaps and Stage3pair_remove_overlaps,
using nmatches rather than score as the primary measure
* samprint.c, stage1hr.c, stage3hr.h: Introduced new hittype DISTANT_SPLICE
and not trying to do GMAP alignment on those
* stage1hr.c: Moved distant splicing ahead of GMAP pairsearch
* stage3hr.c: Added debugging information
2011-08-09 twu
* substring.c: Trimming terminals with fixed -3 mismatch score, while
allowing other ends to be controlled with user-specified
trim_mismatch_score.
* dynprog.c: Using probabilities to find splice sites only if finalp is true
2011-08-08 twu
* bamread.c: Added missing #endif statement
* stage3.c: Limiting SENSE_NULL only to ties between fwd and rev
* stage3.c: Preventing semicanonical splices at end from being trimmed.
Using product of probabilities to decide whether an indel is next to a bad
intron
* dynprog.c: If canonical intron cannot be found, using probabilities to
find best splice junction
* bamread.c, bamread.h: Added function Bamread_splice_strand
* samprint.c: Making cigar_noncanonical_splices_p true
* sequence.c, shortread.c: Made fixes in new handling of eoln situations
2011-08-07 twu
* stage3hr.c, stage3hr.h, substring.c, substring.h: Using runlength IIT for
resolving multiple mappings
* spliceturn.c: Sorting splices in order of observed counts, and eliminating
in that order
* snpindex.c: Fixed coordinates in error messages
* gsnap.c: Added option for using runlength IIT to resolve multiple mappings
* bam_tally.c: Added option for printing runlengths
* bam_tally.c: Printing genotype. Added --diffs-only flag.
2011-08-06 twu
* dynprog.c: Providing separate rewards for GC-AG and AT-AC introns, with
stronger reward for GC-AG.
* stage3.c: Not removing bad non-canonical exons at end. Computing combined
probability score for donor and acceptor splice sites.
* spliceturn.c: Fixed program to not depend on distinction between known and
new splices
* gsnap_splices.c: Added --minsupport flag
* gmap.c: Added break after case 'z'
* stage3hr.c: Turned off TALLY_RATIO, and checking instead for presence or
absence of overlap
* gsnap_splices.c: Added --mincount flag
* gsnap.c: Adding .iit to splicesites file when searching locally
* bam_pileup.c: Printing accessions at start and end of reads
* indexdb.c: Checking for full offsets_suffix, not just "offsets"
* dynprog.c, dynprog.h, stage3.c: Fixed bug in assigning the wrong value to
splicingp. For indels next to bad introns, checking the alternative to
see if it is free of mismatches.
* Makefile.dna.am, Makefile.gsnaptoo.am, atoiindex.c, cmetindex.c,
indexdb.c, indexdb.h, snpindex.c: Created a general Indexdb_get_filenames
procedure and using it for snpindex, cmetindex, and atoiindex, so they
work on all k-mer types
* stage3hr.c, substring.c, substring.h: Removed some unused parameters and
variables
* outbuffer.c, stage3.c, stage3.h: Removed some unused parameters
* stage3.c: Not calling pick_cdna_direction when splicingp is false
2011-08-05 twu
* bam_pileup.c: Initial import into SVN
* outbuffer.c: Restored Paths and Alignments sections to "gmap -4" output
(continuous by exon).
* VERSION: Updated version number
* stage3hr.c: In Stage3end_gene_overlap, initializing foundp
* iit-read.c: In IIT_gene_overlap, initializing allocp and freeing matches.
* gtf_genes.pl.in: Revised ends and starts for genes on minus strand.
* psl_genes.pl.in: Fixed 0-basis of starts. Revised ends and starts for
genes on minus strand.
* psl_genes.pl.in: Added headers for gene format
* configure.ac, Makefile.am: Revised set of files distributed
2011-08-04 twu
* Makefile.am, gtf_genes.pl.in, psl_genes.pl.in: Added psl_genes and
gtf_genes programs
* gmap.c, iit-read.c, stage1.c, stage1.h, gregion.c: Removed unused
parameters
* inbuffer.h: Removed parameter pc_linefeeds_p
* shortread.h: Removed braces
* sequence.c, shortread.c: Removed call to find_bad_char, since we are
checking for '\r' directly before '\n'
* sequence.c, shortread.c: Added checks so we don't read p[-1] when the
first character in the string is already '\n'
* sequence.c, shortread.c: Checking for carriage return before every line
feed
* gsnap.c, inbuffer.c, shortread.h: Removed --pc-lines option
* Makefile.am, dbsnp_iit.pl.in, fa_coords.pl.in, gmap_compress.pl.in,
gmap_process.pl.in, gmap_uncompress.pl.in, gtf_splicesites.pl.in,
md_coords.pl.in, psl_introns.pl.in, psl_splices.pl.in,
psl_splicesites.pl.in: Stripping CR-LF from input files
* gsnap.c, iit-read.c, iit-read.h, pair.c, pair.h, stage3hr.c, stage3hr.h,
substring.c, substring.h: Added option to favor multi-exon genes
* gmap.c, gsnap.c: Checking for valid int and float arguments
* stage3hr.c: Fixed bug in resolve_multimapping procedures
* stage3hr.c: Fixed bug in not initializing antistranded_penalty
* gsnap.c, iit-read.c, iit-read.h, pair.c, pair.h, stage1hr.c, stage3hr.c,
stage3hr.h, substring.c, substring.h: Added -g flag and genes_iit. Added
procedures for resolving multimapping using known genes and tally.
2011-08-03 twu
* gmap.c: Making call to Splicetrie_setup, so -s flag works for known splice
sites
* Makefile.dna.am, Makefile.gsnaptoo.am, Makefile.three.am, Makefile.am,
coords1.test.in, iit.test.in, setup1.test.in: Fixed "make check" so it
works for Cygwin on Windows, where copying of programs from src does not
work
* dynprog.c, dynprog.h, stage3.c: Computing splice site probabilities when
user genomic segment is provided
* stage3.c: In assign_gap_types, using known splicesites_iit to assign
splice site probabilities of 1.
* pair.c, pairdef.h, pairpool.c, stage3.c: In addition to trimming
noncanonical exons close to the end, trimming bad canonical exons close to
the end. Now computing splice site probabilities in assign_gap_types.
* dynprog.c: Checking for indel plus bad intron only when finalp is true,
because earlier passes may need some time to iterate to reach a final
solution.
* stage3.c: Checking for pairs being NULL before calling Pair_trim_ends
* gmap.c, pair.c, pair.h, stage1hr.c, stage3.c, stage3.h, stage3hr.c,
stage3hr.h: Using matches post-trim for deciding if the alignment has
sufficient quality, but using nmatches_pretrim for ranking and scoring
purposes.
2011-08-02 twu
* stage3.c: Turned final pass 6 back on, which was inadvertently turned off
* dynprog.c, pair.c, pair.h, pairdef.h, pairpool.c: Protecting pairs at end
against trimming by GMAP if they are found by splicetrie at known splice
sites
* stage3.c: Not running Dynprog_single_gap if queryjump or genomejump of a
dual break is equal to 1, since that just leads to two indels.
* pair.c: For trimming of ends, changed penalty for indel score from -6 to
-4.
* dynprog.c: Corrected coordinates for splice site probabilities and
dinucleotides. Disallowing indels near splice sites if either probability
is less than 0.9.
* stage3.c: Keeping non-canonical intron if there is sufficient exon
evidence at the end. Iterating trimming of non-canonical introns at the
end.
* pair.c: Printing "method:gmap" even if assertions are turned off
* dynprog.c: Checking probability of intron found by bridge_intron_gap and
discarding the solution if it finds both an indel and a bad intron.
* chrom.c, diag.c, diag.h, dynprog.c, gmap.c, oligoindex_hr.c, outbuffer.c,
pair.c, pair.h, smooth.c, stage1.c, stage1hr.c, stage2.c, stage3.c,
stage3.h: Removed various unused parameters
* VERSION, config.site.rescomp.prd, config.site.rescomp.tst: Revised version
number
* stage1hr.c: Fixed debugging statements
* substring.c: Added a general test of goodness for a substring based on its
numbers of matches and mismatches.
* stage3.c: Restored trimming of non-canonical end exons, but removed code
that trimmed exons less than 20 bp when known splicing was available.
* iit-read.c, iit-read.h: Added function IIT_exists_with_divno_typed_signed
* stage1hr.c: Providing an allowance for GMAP score when want_high_quality_p
is true. Turning off use of score as primary criterion in choosing
splices, since it does not take advantage of known splicing.
* stage3.c: Not using nmismatches in pick_cdna_direction. If
splicesites_iit is available, assigning known splicesites a probability of
1.0.
2011-08-01 twu
* stage1hr.c, stage1hr.h: When antistranded_penalty has a value, using
mismatches plus penalty to decide between sense and antisense, rather than
using probabilities.
* stage3.c: Changed to_queryend_p to be true for distalmedial_ending, since
it is comparing alternatives.
* stage3hr.c, stage3hr.h: Adding antistranded_penalty to score
* stage3.c: In pick_cdna_direction, using presence of indels in combination
with bad intron
* stage1hr.c, stage1hr.h: Using antistranded penalty to determine to force
GMAP to look for antisense result. Requiring GMAP to be high quality in
all cases. Basing GMAP quality on nmismatches plus gap opens. Choosing
best result among all paired types, so concordant no longer predominates
over others.
* samflags.h: Added comment to show some common flags
* gsnap.c: Added parameter for antistranded penalty
* chrom.c, chrom.h, gmapindex.c, gsnap_extents.c, gsnap_splices.c,
gsnap_terms.c, iit-read.c, iit_store.c, iitdef.h, segmentpos.c,
segmentpos.h, spliceclean.c: Introducing new chrom sort in addition to
previous numeric_alpha sort
* chrom.c: Fixed bugs in parsing out initial "chr" from strings
2011-07-31 twu
* gsnap.c: Changed default for all gmap parameters from 2 to 3
* stage1hr.c: Added macros add_bounded and subtract_bounded to keep
computations within chromosomal bounds
* gmap.c, stage2.c, stage2.h: Created a specialized procedure for
score_querypos when splicing is true and no shifted canonicals are used
* oligoindex.c: Using only 8-mers for oligoindices_major in GSNAP for stage 2
* trunk, config.site.rescomp.tst, configure.ac, src, gsnap.c, stage1hr.c,
stage1hr.h, stage3hr.c, stage3hr.h, util, gmap_build.pl.in,
gtf_splicesites.pl.in, psl_splicesites.pl.in: Merged revisions 44034 to
44047 from branches/2011-07-31-gmap-then-terminals
* stage1hr.c: For GMAP halfmapping, when overlap is found, take widest
possible starting point between the overlap calculation and the normal
calculation
* stage3.c: Added gappairs in debugging statements
* shortread.c, shortread.h, stage1hr.c: In computing GMAP halfmapping,
checking for existence of primers and extending GMAP region if they exist
* stage3hr.c, stage3hr.h: Added function Stage3end_best_score_paired
* stage1hr.c: Running GMAP pairsearch only on ends with scores better than
those already paired
* gmap.c, gsnap.c: Printing arguments before they are parsed
* stage2.c: When shifted_canonical_p is false, not computing rev scores,
since they are the same as the fwd scores
* stage2.c: Improved debugging statements
* gmap.c, stage1hr.c, stage2.c, stage2.h, stage3.c: Added parameter
use_shifted_canonical_p. Using now only for cross-species alignment in
GMAP.
* stage3hr.c, stage3hr.h: Added function Stage3pair_sort_bymatches. For
substitutions with 0 mismatches, classifying hit as EXACT rather than SUB,
so duplicates are eliminated properly.
* stage1hr.c, stage1hr.h: Allowing multiple concordant results to undergo
GMAP improvement, up to max_gmap_improvement. Sorting results by matches
before GMAP improvement.
* gsnap.c: Introduced separate parameter for max_gmap_improvement. Hid
pairexpect and terminal-penalty parameters.
2011-07-30 twu
* stage1hr.c: Fixed errors in find_terminals with analysis of mismatches and
use of floors
* stage1hr.c: Requiring high quality GMAP unpaired method when we observe a
paired toolong alignment
* gsnap.c: Made halfmapping,unpaired,improve the default GMAP method
* stage3hr.c, stage3hr.h: Scoring GMAP based on total number of matches.
Implemented Stage3end_best_score.
* stage1hr.c: Using new test based on Stage3end_best_score to avoid
terminals in anticipation of GMAP halfmapping. Allowing poor quality GMAP
in GMAP improvement. Filtering results by optimal score before GMAP
improvement using an infinite cutoff.
* stage3.c: In pick_cdna_direction, allowing bad-scoring canonical introns
to determine sense
* Makefile.dna.am, Makefile.gsnaptoo.am, gmap.c, sense.h, stage1hr.c,
stage3.c, stage3.h, stage3hr.h: Created new header file sense.h.
Returning sensedir from pick_cdna_direction and Stage3_compute.
* gsnap.c, stage1hr.c, stage1hr.h: Introduced separate parameter for
trigger-score-for terminals
* stage1hr.c: Turned off option to avoid terminals if halfmapping
anticipated later
* stage3.c: Increased threshold for trying microexon from 0 acceptable
mismatches to 2 in high quality sequences
* stage1hr.c: Applying Stage3pair_optimal_score after GMAP improvement step
* pair.c, pair.h, samprint.c, stage3hr.c, substring.c, substring.h:
Implemented computation of MAPQ scores for GMAP alignments
* mapq.c, mapq.h: Moved some constants to mapq.h, so pair.c can access them
* gsnap.c, stage1hr.h: Changed name of GMAP method from concordant_uniq to
improvement
* stage1hr.c: Conducting search for terminals even if gmap_halfmapping_p is
true, if both single ends have no hits so far
* splicetrie.c, splicetrie.h: Checking internal exon region at splice site
before doing a search for a short-end splice.
* oligoindex_hr.c: Removed assertions about sequencepos
* stage1hr.c: In using segments to bound GMAP region, looking at querypos5
and querypos3 to help decide whether to extend mapping region by
shortsplicedist.
* stage3.c: In pick_cdna_direction, no longer using indel_alignment_score
* gsnap.c, stage1hr.c, stage1hr.h, stage3hr.c, stage3hr.h: Introducing
separate parameters for GMAP methods. Sorting singlehits5 and singlehits3
before running GMAP halfmapping/unpaired.
* gmap.c, gsnap.c, oligoindex_hr.c, oligoindex_hr.h, stage1hr.c, stage1hr.h,
stage2.c, stage2.h, stage3.c: Merged revisions 43984 to 43994 from
branches/2011-07-30-oligoindex-mapping-region to introduce mappingstart
and mappingend in addition to genomicstart and genomicend for finding
oligoindex mappings in stage 2
* pair.c: In trimming, counting an indel only as a single mismatch
* stage3.c: Turning off the step to remove noncanonical introns, since it
fails in some cases
* stage1hr.c, stage2.c: Not doing terminals if GMAP pairsearch or
halfmapping is available
* stage2.c: Ignore log message for 43986. Correct log message should be:
Introduced NEAR_END_LENGTH to define ends of reads where we can ignore
EXON_DEFN.
* gsnap.c: Made halfmapping,cuniq the default for gmap-mode.
* gmap.c: Reduced values of CHIMERA_SLOP
2011-07-29 twu
* stage3hr.c: For single-end reads, favoring non-ambiguous alignments over
ambiguous ones when they have identical genomicstarts or genomicends.
* dynprog.c: In Dynprog_end5_known and Dynprog_end3_known, if extension not
found to query end and if ambiguous splicing not found, then doing dynamic
programming again to best end, rather than to query end.
* pair.c, pair.h, stage3.c: Trimming of ends of GMAP alignments
* stage1hr.c: Allowing very poor alignments to be reported by GMAP method,
now that we have trimming at ends
* gsnap.c, stage1hr.c, stage1hr.h: Avoiding duplication of chr marker
segments and chr marker segment at the beginning. Passing chromosome_iit
and nchromosomes into Stage1hr_setup.
2011-07-28 twu
* stage1hr.c: Using binary search on segments to bound region for GMAP
alignments
2011-07-27 twu
* stage1hr.c: Created debugging version that identifies plus_segments and
minus_segments within GMAP halfmapping region
* stage2.c: Re-using result of find_shifted_canonical when leftpos remains
the same
* gsnap.c: Made default for GMAP halfmapping with 2 candidates
* genome_hr.c: Added comments in splicesite_positions that offset - 1 is
verified
* stage1hr.c: Fixed issues with fast_level and cutoff_level for very short
reads, where fast_level < 0
* stage1hr.c: Made plus_segments, plus_nsegments, minus_segments, and
minus_nsegments fields within the Stage1_T object, so they can be used
later in limiting range for GMAP alignments
* genome_hr.c, genome_hr.h, stage2.c: Implemented lookup as needed of the
previous dinucleotide from the genomic blocks
2011-07-26 twu
* stage1hr.c: Added chr marker segments to indicate boundaries between
chromosomes and simplify inner loops for localsplicing and indels
* Makefile.dna.am: Removed gbuffer.c and gbuffer.h from file lists
* stage3.c, stage3.h: Multiple revisions to pick_cdna_direction
* stage2.c: Requiring exon length > EXON_DEFN for canonical splicing only in
middle of queryseq
* gsnap.c, stage1hr.c, stage1hr.h: Allowing control over individual gmap
modes
* chimera.c, dynprog.c, dynprog.h, extents_genebounds.c, genome.c, genome.h,
get-genome.c, gmap.c, gsnap_tally.c, iit_plot.c, match.c, pair.c,
sequence.c, sequence.h, splicetrie.c, splicetrie.h, splicing-score.c,
substring.c: Making complement in place. Using new interface to
Genome_get_segment.
2011-07-21 twu
* Makefile.am, gtf_splicesites.pl.in: Added gtf_splicesites program
* config.site.rescomp.tst, VERSION: Updated version number
* indexdb.c: Fixed error message for PMAP
* trunk, config.site.rescomp.tst, src, gsnap.c, indexdb.c, outbuffer.c,
outbuffer.h, resulthr.c, resulthr.h, splicealt.c, splicetrie.c,
splicetrie.h, splicetrie_build.c, splicetrie_build.h, stage1hr.c,
stage1hr.h, stage3hr.c, stopwatch.c, substring.c, util: Merged revisions
43003 to 43362 from branches/2011-07-15-fast-knownsplices in creating
splicecomp and creating specialized procedures for find_spliceends for
shortends and distant splicing
* stage2.c: Fixed bug in older canonical dinucleotides procedure that
overwrite -1 values in initial positions
* stage1hr.c: Revised comments
* Makefile.dna.am: Added command for splicealt
* psl_splicesites.pl.in, psl_introns.pl.in: Fixed warning messages
* gmap_build.pl.in: Added initial definition for -B flag
2011-07-16 twu
* stage2.c: Fixed problem in get_last, needed by find_shifted_canonical,
when first few positions lack a value of -1.
2011-07-13 twu
* VERSION: Revised version number
* genome-write.c: Added fix for genomes in PC-DOS file format
* trunk, VERSION, config.site.rescomp.prd, src, atoiindex.c, cmetindex.c,
gmap.c, gmapindex.c, gsnap.c, indexdb.c, oligo.c, stage1.c, stage1hr.c,
stage1hr.h, stage3.c, stage3hr.c, stage3hr.h, util, gmap_build.pl.in,
gmap_setup.pl.in: Merged revisions 42540 through 42858 from
branches/2011-07-08-index-14mers to handler 13-mers and 14-mers, and to
fix various bugs
2011-07-08 twu
* samprint.c: Using new interface to Pair_print_sam
* outbuffer.c, pair.c, pair.h, stage3.c, stage3.h: Using usersegment
accession in SAM and GFF3 output when -g flag is specified.
* goby.h, gsnap.c, outbuffer.c, shortread.c, shortread.h, stage3hr.c,
stage3hr.h, substring.h: Merged external changes for Goby from 2011-07-01
and 2011-07-08
* Makefile.gsnaptoo.am: Added files for oligoindex_hr.c and oligoindex_hr.h
* goby.c: Applied external patch from 2011-07-08
* trunk, VERSION, config.site.rescomp.tst, src, Makefile.dna.am,
genome_hr.c, genome_hr.h, gmap.c, gsnap.c, oligoindex.c, oligoindex.h,
oligoindex_hr.c, oligoindex_hr.h, stage1hr.c, stage1hr.h, stage2.c,
stage2.h, stage3.c, stage3hr.c, stage3hr.h, util: Merged revisions 42282
to 42511 from branches/2011-07-05-oligoindex-hr to speed up GMAP
oligoindex and provide options for GMAP
* oligoindex.c: Stop initializing all diagonals. Initialize only when
needed.
* splice-sites-hr.pl: Added -S flag for semicanonical splice sites
* indexdb.c: Fixed Indexdb_new_genome for PMAP
* gmap.c: Changed default index1part for PMAP
2011-07-07 twu
* splice-sites-hr.pl: Initial import into SVN
2011-07-05 twu
* shortread.c: Allows for non-digit, then "1" or "2" for paired-end reads
* dynprog.c: Added protection against negative genomic coordinates in
dynamic programming at 5' and 3' ends
* shortread.c: Modified error statement
* stage1hr.c: Adding querylength to genomicbound in cases where no overlap
is found, just in case the overlap is not found.
* stage1hr.c: Requiring high quality when aligning concordant unique hits
with GMAP
2011-07-04 twu
* stage3.c: Trimming end exons if less than 20 bp and known splicing is
available
* stage1hr.c: Improved debugging statements for GMAP alignments
* splicetrie.c: Counting mismatches correctly, without penalty, when known
splicing exceeds observed distances
* gsnap.c, stage3hr.c, stage3hr.h: Adding ambiguous matches to nmatches when
favor_ambiguous_p is true, which happens if we have known splicing without
observed distances
* dynprog.c, dynprog.h: Implemented find_best_endpoint_to_queryend
* stage3.c: Trimming semi-canonical exons at end also
* splicetrie.c: Fixed bug in failing to push coordinates for ambiguous cases
* pair.c: Fixed symbol for rev semi-canonical pair in debugging output
* iit_store.c: Improved error message when parsing coords
* iit_store.c: Printing entire problematic line when a parsing error occurs
2011-07-03 twu
* stage3.c: Revised pick_cdna_direction to count semicanonical splices, and
to use nmatches - nmismatches
2011-07-02 twu
* stage3hr.c: Fixed bug in always eliminating second hit in
Stage3end_remove_overlaps
2011-07-01 twu
* Makefile.am, setup.ref123positions.ok, setup.ref3positions.ok,
setup1.test.in, setup2.test.in: Made changes for renaming of ref3positions
to ref123positions
* gmap.c, gsnap.c, stage2.c, stage2.h: Added setup for stage 2 to handle
GMAP alignments without splicing
* pair.c: Putting "0M" between adjacent deletion and insertion in CIGAR
string
* gsnap.c, stage1hr.c, stage1hr.h: Providing --allow-gmap flag to control
whether GMAP alignments are allowed
* stage3.c: Adding gap holders before nonconcordant exon trimming. Checking
if nonconcordant end exon is less than halfway from end. In picking cDNA
direction, checking only for presence or absence of nonconcordant or
concordant introns.
* stage2.c: Put back diffdist_penalty
* stage1hr.c: Using better genomicbounds when running GMAP alignments.
Introduced check for very bad GMAP alignments (nmatches < querylength/2).
Comparing GMAP against original hit for aligning concordant uniques with
GMAP. Trying top hits with GMAP to find concordant pairs, rather than
checking to see if the total number of hits is less than a threshold.
* pair.c, pair.h, shortread.c, stage3hr.c, stage3hr.h, substring.c,
substring.h: Added procedures for computing better genomicbounds on GMAP
alignments, based on overlap between the paired ends
* gsnap.c, splicetrie.c, splicetrie.h: Added option for amb_closest_p
behavior, where shortest intron among ambiguous ones is picked
* genome-write.c: Added index1part as parameter to some procedures
2011-06-30 twu
* stage3hr.c: Allowing terminal ends to win on the basis of nmatches,
instead of score
* stage3.c: Added pass 6a to remove noncanonical end exons
* stage2.c: Restored NINTRON_PENALTY_MISMATCH from 4 to 8
* stage1hr.c: Generalized minimum querylength for one-miss algorithm to
handle 15-mers. Order is now GMAP, terminal 1, terminal 2, and distant
splicing, with terminals done if found_score > trigger_score_for_gmap.
* gmap.c, gsnap.c, iit-read.c, iit-read.h, outbuffer.c, outbuffer.h: Added
flags for read group library and platform in SAM headers
* compress.c, compress.h, genome-write.c, genome-write.h, gmapindex.c:
Allowing for 15-mer genomic indices when writing an uncompressed genome
using a file
* spanningelt.c: Generalized from 12-mers to 15-mers
2011-06-29 twu
* gmap_build.pl.in: Fixed installation for 12-mer and 15-mer indices
* gmap.c, gsnap.c: Added -B 5 option to allocate offsets file
* trunk, src, Makefile.dna.am, access.h, atoiindex.c, block.c, block.h,
cmetindex.c, gdiag.c, gmap.c, gmapindex.c, gsnap.c, indexdb.c, indexdb.h,
indexdb_dump.c, indexdb_hr.c, indexdb_hr.h, indexdbdef.h, oligo-count.c,
oligo.c, oligo.h, oligop.c, oligop.h, outbuffer.c, pmapindex.c,
snpindex.c, spanningelt.c, splicetrie.c, stage1.c, stage1.h, stage1hr.c,
stage1hr.h, stage2.c, stage3.c, stage3.h, stage3hr.c, stage3hr.h,
substring.c, substring.h, util, gmap_build.pl.in, gmap_setup.pl.in: Merged
revisions 41633 to 41936 from branches/2011-06-22-index-15mers to allow
for 15-mers in genomic indices
2011-06-24 twu
* stage1hr.c: Computing query_compress if necessary before deciding whether
to perform halfmapping GMAP on concordant unique
* pair.c: Fixed bug in printing start: instead of end: when endtype2 is END
2011-06-22 twu
* trunk, config.site.rescomp.prd, config.site.rescomp.tst, src, stage1hr.c,
util: Merged revisions 41618 to 41631 from
branches/2011-06-22-gmap-earlier to move GMAP algorithm before distant
splices
* dynprog.c, dynprog.h, gmap.c, gsnap.c, iit-read.c, intron.c, intron.h,
stage1hr.c, stage1hr.h, stage3.c, stage3.h, stage3hr.c, stage3hr.h: Merged
revisions 41516 to 41609 from branches/2011-06-14-terminals to try GMAP up
to 5 times before terminals if no concordant matches can be found.
2011-06-21 twu
* dynprog.c, dynprog.h, gmap.c, iit-read.c, iit-read.h, maxent_hr.c, pair.c,
pair.h, splicetrie.c, splicetrie.h, stage1hr.c, stage2.c, stage3.c,
stage3.h, stage3hr.c, stage3hr.h, substring.c, substring.h: Merged
revisions 41410 to 41516 from branches/2011-06-14-terminals. Applying GMAP
for halfmapping multiple and unpaired unique results. Allowing ambiguous
splice ends for GMAP alignments. Fixed computation of splice junctions.
Allowing canonical exons to be found for short exons in GMAP alignments.
Improved pick_cdna_direction. Made fixes to GSNAP output for GMAP
alignments.
2011-06-18 twu
* src, stage3hr.c: Merged change from branches/2011-06-14-terminals to check
for subsumption in paired alignments
2011-06-17 twu
* samprint.c, stage1hr.c, stage3hr.c, stage3hr.h, substring.c, substring.h:
Merged revisions 41223 to 41410 from branches/2011-06-14-terminals to use
lexicographic comparison in Stage3end_remove_duplicates; to prevent known
splicing from extending past chromosomal bounds; to prevent end indels
from going past right end of chromosome; to iterate through both
mismatch_positions_cont and mismatch_positions_shift simultaneously in end
indel procedures; and to compute nmatches over entire substring.
2011-06-14 twu
* genome.c: Allowing fill_buffer_simple procedures to fill past the left end
of the genome
* splicetrie.c, splicetrie.h, stage1hr.c: Preventing short-overlap splicing
from going past chromosomal boundaries
* VERSION: Updated version
* gmap.c: Increased default chimera margin from 20 to 40
* trunk, src, gmap.c, stage3.c, stage3.h, util: Merged revisions 41185 to
41214 from branches/2011-06-13-gmap-merge. Merging chimeric parts when
possible.
* trunk, config.site.rescomp.tst, src, maxent.c, stage3hr.c, util: Merged
revisions 40955 to 41210 from releases/internal-2011-06-09
* trunk, src, dynprog.c, stage1hr.c, stage3hr.c, stage3hr.h, util: Merged
revisions 40682 to 41210 from releases/internal-2011-06-06
* stage1hr.c: Turning on NEW_TERMINALS branch
2011-06-13 twu
* shortread.c, shortread.h: Added function Shortread_find_overlap
2011-06-12 twu
* stage1hr.c: Added hooks for computing terminals at zero penalty if
necessary
2011-06-10 twu
* stage3hr.c: Allowing use of Stage3end_substringD and Stage3end_substringA
by half splices
* samprint.c, stage3hr.c, substring.c, substring.h: Merged changes from
releases/internal-2011-06-09 to change internal representation of splice
from substring1 for donor and substring2 for acceptor. Now substring1 and
substring2 are in query order. Needed to avoid problems when a splice was
labeled both as donor/acceptor and as acceptor/donor.
* gsnap.c: Checking for value of adapter stripping flag. Turning on adapter
stripping by default.
2011-06-09 twu
* gmap.c, gsnap.c: Joining worker threads instead of detaching them, so
Inbuffer_free can be called safely
* gmap.c, stage3.c, stage3.h: Merge chimeric parts into a single continuous
alignment if possible.
* VERSION: Updated version
* gmap.c: Allowing chimera switchpoint to occur one base pair earlier
* chimera.c: Improved debugging statements
* genome.c: Made Genome_fill_buffer refer to its local genome argument, not
the global one. Needed to fix a bug in snpindex.
* Makefile.dna.am, Makefile.gsnaptoo.am, compress.h, dynprog.h, outbuffer.c,
pair.c, pair.h, pairpool.h, samprint.c, splicetrie.h, splicetrie_build.h,
stage3.c, stage3.h: Printing XT flag for translocation information in SAM
output of both GMAP and GSNAP
* chimera.c, chimera.h, gmap.c: Changed algorithm for finding chimera
boundary in GMAP to maintain best number of mismatches, and then to find
highest splice site probabilities within that range
2011-06-08 twu
* gsnap.c: Fixed mode flag to take a required argument
* gmap.c, outbuffer.c, outbuffer.h: Added .transloc split output file for
GMAP
* gmap.c: Hiding -s flag from help output
* gmap.c, stage3.c, stage3.h: Revising pairarray genomepos coordinates of
GMAP chimeras to be chromosomal coordinates, so SAM output is correct
* pair.c: Implemented hard clipping in SAM output for GMAP chimeras
2011-06-06 twu
* samprint.c: Made the sign of insertlength for a translocation depend on
the concordant substring
* VERSION: Updated version number
* stage3hr.c: Fixed computation of pair_insert_length when there is no
overlap
* stage1hr.c: Added debugging statements
* stage1hr.c: Corrected privatep flags and memory freeing for halfmapping
unique cases solved by GMAP.
* outbuffer.c: Corrected argument list for GMAP when MEMUSAGE is turned on
* stage3hr.c, stage3hr.h: Added function Stage3end_effective_chrnum
* samprint.c: For mates that are translocations, using the effective chrnum
in printing the mate location
* resulthr.c: Fixed bug in assigning UNPAIRED_TRANSLOC category
2011-06-05 twu
* stage1hr.c: Added checks for new pair from Stage3pair_new being NULL
* stage3.c: Placed a limit on iterations of building ends using known
splicing
* trunk, src, util: Merged property changes on subdirectories from
branches/2011-06-04-gmap-genomicseg
* VERSION, config.site.rescomp.prd, config.site.rescomp.tst: Updated version
number
* dynprog.c, gmap.c, pair.c, stage1hr.c, stage3.c, stage3hr.c, stage3hr.h:
Merged changes from revision 40649 to 40663 from
branches/2011-06-04-gmap-genomicseg to provide correct bounds on GMAP
alignment in GSNAP and to improve various issues in alignment, including
close indels
2011-06-03 twu
* stage3hr.c: Changed paired_seenp back to paired_usedp in
Stage3end_remove_duplicates, because not all pairs are seen in previous
pair_up procedures, resulting in a fatal bug.
* config.site.rescomp.prd, config.site.rescomp.tst, VERSION: Updated version
number
* gmap.c, splicetrie.c, splicetrie.h, stage1hr.c, stage3.c, stage3.h: Making
use of jump_late_p
* dynprog.c, dynprog.h: Added provision for jump_late_p. Fixed issue in
jump_penalty where it was not consistent with jump_penalty_init. Now both
procedures compute extend*length.
2011-06-02 twu
* pair.c: Fixed printing of dashes in GSNAP standard output
* dynprog.c, dynprog.h, gmap.c, stage1hr.c, stage3.c, stage3.h: Allowing
close combinations of insertions and deletions, by allowing onesidegapp to
be false and letting extraband_single equal 3 instead of 0. Controlled by
--allow-close-indels flag in GMAP. Default set to be on in GMAP and in
GSNAP.
* stage1hr.c: Not performing Stage3end_remove_duplicates on exact matches,
which should not be necessary.
* stage3hr.c: In Stage3end_remove_duplicates, checking against paired_seenp,
instead of paired_usedp, for speed.
* stage3hr.c: Reverted back to revision 40489 of Stage3_pair_up_concordant,
which does has the pairing procedures inline.
* stage3hr.c: Attempt to have different lists for old and new hits, but this
seems to slow down the program.
2011-06-01 twu
* stage3hr.c: Moved parts of Stage3_pair_up_concordant into separate
procedures
* stage1hr.c, stage3hr.c, stage3hr.h: Performing GMAP on concordant unique
results where one end is of type TERMINAL
* gsnap.c: Changed default indel penalty from 1 to 2
* stage3hr.h: Formatting change
* stage3hr.c, substring.c, substring.h: Using nchimera_novel in
Stage3end_remove_overlaps
* stage3hr.c: In Stage3pair_remove_overlap, favoring longer insert lengths,
if all else is equal
* pair.c: Moved MD string for SAM output before NH tag to be consistent with
other GSNAP SAM output
* stage1hr.c, stage3hr.c, stage3hr.h, substring.c, substring.h: Making a
distinction between Stage3end_remove_duplicates and
Stage3end_remove_overlaps
* stage3hr.c: Reverted to old method of finding pair insert length, where
all substrings are checked.
* stage1hr.c: In pairing algorithm, moved short-overlap splicing and distant
splicing into a single singlesplicing class, so duplicates are handled
properly.
* gsnap.c: Added documentation for --use-tally flag
2011-05-30 twu
* inbuffer.c, inbuffer.h: Changed nspaces and nread to unsigned int
* gmap.c, gsnap.c, outbuffer.c, outbuffer.h: Made output buffer size a
user-definable parameter
* gmap.c, gsnap.c, outbuffer.c, outbuffer.h: Made more changes to output
thread. Made noutput a local variable. Clearing backlog in ordered output
when necessary.
* stage1hr.c: Added a dinucleotide check for repetitive sequences
* gmap.c, gsnap.c, mem.c, mem.h, outbuffer.c, request.c, result.c,
resulthr.c, sequence.c, shortread.c, stage1hr.c, stage1hr.h, stage3.c,
stage3hr.c, substring.c: Replaced LEAKCHECK system with MEMUSAGE system
* list.c, list.h: Added specialized procedures for using specific memory
pools for memusage
* inbuffer.c: Added comment
* diagpool.c, diagpool.h, pairpool.c, pairpool.h: Added procedures for
reporting memory usage. Using memory from keep portion.
* outbuffer.c: Cleaned up pthread code for output thread. Added MAXQUEUE to
clear out outbuffer.
2011-05-28 twu
* config.site.rescomp.tst: Added -Wextra to CFLAGS
* gsnap.c, stage3hr.c: Eliminating hitpair duplicates based on hittypes of
ends. Allowing MAPQ score to go as high as 96.
2011-05-27 twu
* gsnap.c, stage1hr.c, stage3hr.c, stage3hr.h: Implemented mapq-unique-score
* internal-2011-02-27, AUTHORS, COPYING, INSTALL, MAINTAINER, Makefile.am,
NEWS, README, VERSION, acinclude.m4, bootstrap.dna, bootstrap.gmaponly,
bootstrap.gsnaptoo, bootstrap.pmaptoo, bootstrap.three, config,
acx_mmap_fixed.m4, acx_mmap_variable.m4, acx_pthread.m4, builtin.m4,
config.guess, config.sub, expand.m4, fopen.m4, ltmain.sh,
madvise-flags.m4, mmap-flags.m4, pagesize.m4, perl.m4, struct-stat64.m4,
config.site, config.site.rescomp.prd, config.site.rescomp.tst,
configure.ac, dev, maint, memory-check.pl, share, archive.html,
index.html, src, Makefile.dna.am, Makefile.gmaponly.am,
Makefile.gsnaptoo.am, Makefile.pmaptoo.am, Makefile.three.am,
Makefile.util.am, access.c, access.h, add_rpk.c, assert.c, assert.h,
atoi.c, atoi.h, atoiindex.c, backtranslation.c, backtranslation.h,
bam_tally.c, bamread.c, bamread.h, bigendian.c, bigendian.h, block.c,
block.h, bool.h, boyer-moore.c, boyer-moore.h, cappaths.c, changepoint.c,
changepoint.h, chimera.c, chimera.h, chop_primers.c, chrnum.c, chrnum.h,
chrom.c, chrom.h, chrsegment.c, chrsegment.h, chrsubset.c, chrsubset.h,
cmet.c, cmet.h, cmetindex.c, color.c, color.h, comp.h, complement.h,
compress.c, compress.h, convert.t.c, cum.c, datadir.c, datadir.h, datum.c,
datum.h, diag.c, diag.h, diagdef.h, diagnostic.c, diagnostic.h,
diagpool.c, diagpool.h, dibase.c, dibase.h, dibaseindex.c, doublelist.c,
doublelist.h, dynprog.c, dynprog.h, except.c, except.h, exonscan.c,
extents_genebounds.c, fopen.h, gbuffer.c, gbuffer.h, gdiag.c,
geneadjust.c, genecompare.c, geneeval.c, genome-write.c, genome-write.h,
genome.c, genome.h, genome_hr.c, genome_hr.h, genomepage.c, genomepage.h,
genomeplot.c, genomicpos.c, genomicpos.h, genuncompress.c, get-genome.c,
getopt.c, getopt.h, getopt1.c, gmap.c, gmapindex.c, goby.c, goby.h,
gregion.c, gregion.h, gsnap.c, gsnap_best.c, gsnap_concordant.c,
gsnap_extents.c, gsnap_fasta.c, gsnap_filter.c, gsnap_iit.c,
gsnap_multiclean.c, gsnap_splices.c, gsnap_tally.c, gsnap_terms.c,
gsnapread.c, gsnapread.h, hint.c, hint.h, iit-read.c, iit-read.h,
iit-write.c, iit-write.h, iit_dump.c, iit_fetch.c, iit_get.c,
iit_pileup.c, iit_plot.c, iit_store.c, iit_update.c, iitdef.h, inbuffer.c,
inbuffer.h, indexdb.c, indexdb.h, indexdb_dibase.c, indexdb_dibase.h,
indexdb_dump.c, indexdb_hr.c, indexdb_hr.h, indexdbdef.h, interval.c,
interval.h, intlist.c, intlist.h, intlistdef.h, intpool.c, intpool.h,
intron.c, intron.h, lgamma.c, lgamma.h, list.c, list.h, listdef.h,
littleendian.c, littleendian.h, mapq.c, mapq.h, match.c, match.h,
matchdef.h, matchpool.c, matchpool.h, maxent.c, maxent.h, maxent_hr.c,
maxent_hr.h, md5-compute.c, md5.c, md5.h, mem.c, mem.h, memchk.c, mode.h,
nmath.c, nmath.h, nr-x.c, nr-x.h, oligo-count.c, oligo.c, oligo.h,
oligoindex.c, oligoindex.h, oligop.c, oligop.h, orderstat.c, orderstat.h,
outbuffer.c, outbuffer.h, pair.c, pair.h, pairdef.h, pairingcum.c,
pairingflats.c, pairinggene.c, pairingstrand.c, pairingtrain.c,
pairpool.c, pairpool.h, parserange.c, parserange.h, pbinom.c, pbinom.h,
pdl_smooth.c, pdldata.c, pdldata.h, pdlimage.c, plotdata.c, plotdata.h,
plotgenes.c, plotgenes.h, pmapindex.c, random.c, random.h, rbtree.c,
rbtree.h, rbtree.t.c, reader.c, reader.h, reads.c, reads.h, reads_dump.c,
reads_store.c, request.c, request.h, result.c, result.h, resulthr.c,
resulthr.h, revcomp.c, samflags.h, samprint.c, samprint.h, samread.c,
samread.h, scores.h, segmentpos.c, segmentpos.h, segue.c, separator.h,
seqlength.c, sequence.c, sequence.h, shortread.c, shortread.h, smooth.c,
smooth.h, snpindex.c, spanningelt.c, spanningelt.h, spliceclean.c,
spliceeval.c, splicefill.c, splicegene.c, splicegraph.c, splicescan.c,
splicetrie.c, splicetrie.h, splicetrie_build.c, splicetrie_build.h,
spliceturn.c, splicing-scan.c, splicing-score.c, stage1.c, stage1.h,
stage1hr.c, stage1hr.h, stage2.c, stage2.h, stage3.c, stage3.h,
stage3hr.c, stage3hr.h, stopwatch.c, stopwatch.h, subseq.c, substring.c,
substring.h, table.c, table.h, tableint.c, tableint.h, tableuint.c,
tableuint.h, tally.c, tally.h, tally_exclude.c, tally_expr.c, tallyadd.c,
tallyflats.c, tallygene.c, tallyhmm.c, tallystrand.c, translation.c,
translation.h, trial.c, trial.h, types.h, uintlist.c, uintlist.h,
uinttable.c, uinttable.h, svncl.pl, tests, align.test.in, align.test.ok,
coords1.test.in, coords1.test.ok, defs, fa.iittest, iit.test.in,
iit_get.out.ok, iittest.iit.ok, map.test.ok, setup.genomecomp.ok,
setup.idxpositions.ok, setup.ref3positions.ok, setup1.test.in,
setup2.test.in, ss.chr17test, ss.her2, util, dbsnp_iit.pl.in,
ddsgap2_compress.pl, fa_coords.pl.in, gmap_build.pl.in,
gmap_compress.pl.in, gmap_process.pl.in, gmap_reassemble.pl.in,
gmap_setup.pl.in, gmap_uncompress.pl.in, gmap_update.pl.in,
gsnap-fetch-reads.pl, gsnap-fetch-reads.pl.in, gsnap-remap.pl,
gsnap-remap.pl.in, md_coords.pl.in, psl_introns.pl.in, psl_splices.pl.in,
psl_splicesites.pl.in, sam_merge.pl.in, sam_restore.pl.in,
sim4_compress.pl, sim4_uncompress.pl, spidey_compress.pl, whats_on, trunk:
Restored gmap trunk subdirectory
* VERSION: Updated version
* substring.c: Computing MAPQ on entire substring, not on trimmed portion
* dynprog.c, dynprog.h, gmap.c, stage1hr.c, stage3.c, stage3.h: For genomic
GMAP alignments in GSNAP, not assigning any canonical reward, not
computing pairs_rev, and not scoring introns.
* splicetrie.c, splicetrie.h, stage1hr.c: Removed unused parameter
splicetypes
* gsnap.c, stage1hr.c, stage1hr.h: Removed unused parameters, including
queryptr and queryrc
* src, Makefile.dna.am, Makefile.gsnaptoo.am, dynprog.c, dynprog.h,
genome.c, genome.h, genome_hr.c, genome_hr.h, gmap.c, gsnap.c, mapq.c,
mapq.h, maxent_hr.c, maxent_hr.h, oligoindex.c, outbuffer.c, outbuffer.h,
pair.c, pair.h, samprint.c, samprint.h, splicetrie.c, splicetrie.h,
splicetrie_build.c, splicetrie_build.h, stage1hr.c, stage1hr.h, stage3.c,
stage3.h, stage3hr.c, stage3hr.h, substring.c, substring.h: Merged
revisions 40182:40234 from branches/2011-05-27-no-block-vars to reduce
number of parameters
2011-05-26 twu
* VERSION: Updated version number
* dynprog.c, dynprog.h, splicetrie.c, splicetrie.h, stage1hr.c, stage3.c,
stage3.h: Restored complete searching of known splicesites for dynamic
programming of ends
* stage1hr.c, dynprog.c, dynprog.h, splicetrie.c, splicetrie.h, stage3.c,
stage3.h: Created hybrid procedure for performing dynamic programming at
5' and 3' ends with known splicing
* dynprog.c, dynprog.h, splicetrie.c, splicetrie.h, stage3.c, stage3.h:
Wrote faster procedure for performing dynamic programming at 5' and 3'
ends with known splicing, but does not handle distal indels.
* inbuffer.c, gsnap.c, request.c, request.h, shortread.c, shortread.h:
Performing chopping of adapters only after paired-end alignment fails to
give concordant or paired result.
* stage1hr.c: Removed query as a parameter. Changed knownsplice limits.
* uintlist.c, uintlist.h: Added procedure Uintlist_to_string
* mapq.c, mapq.h, stage3hr.c, stage3hr.h, substring.c, substring.h: Removed
query as parameter to procedures
* genome_hr.c, genome_hr.h: Removed query as parameter to some procedures
2011-05-25 twu
* stage3hr.c: Moved assertions about private5p and private3p to correct place
* gsnap.c, inbuffer.c, request.c, request.h, shortread.c, shortread.h: When
potential paired-end adapter is found, checking alignment first without
chopping adapters, and then if no concordant or paired alignments are
found, then re-aligning with adapters chopped.
* stage3hr.c: Removing only duplicates that have not been used yet in a pair
* stage1hr.c: Doing Stage3pair_privatize before Stage3pair_eval
* pair.c: Added NH tag for GMAP alignments in GSNAP
* get-genome.c: Enabling re-use of contig_iit
* shortread.c: Fixed bug in printing pairedend fasta. Was printing both
revcomp and forward sequence for queryseq2.
2011-05-24 twu
* stage1hr.c, stage3hr.c, stage3hr.h: Reduced amount of memory copying in
making Stage3pair_T objects
* get-genome.c, parserange.c, parserange.h: Made operation of get-genome
from stdin more efficient by making only one open of chromosome_iit and
contig_iit
* VERSION: Updated version
* gsnap.c: Added message to indicate when alignment is starting
* stage1hr.c: Doing pairing only when individual alignments are performed
* outbuffer.c: Fixed debugging statement
* gmap.c: Added information to --help output on the -f flag about other
output types
* gsnap.c: Changed default value of genome_unk_mismatch_p to be 1
2011-05-23 twu
* samprint.c: Fixed sign insert size when read and mate have identical chrpos
* VERSION: Updated version
* samprint.c: Added NH flag to indicate number of paths
* gsnap_concordant.c: Defined concordance to allow for overlapping reads
* stage1hr.c: Introduced DEBUG4K for known doublesplicing
* dynprog.c, dynprog.h, gmap.c, intlist.c, intlist.h, splicetrie.c,
splicetrie.h, stage3.c, stage3.h: Removing duplicate results from
splicetrie when SNPs are allowed
* samprint.c: Fixed bug in printing cigar for two-thirds shortexon on minus
strand
* gmap.c: Removed include of mode.h
* Makefile.dna.am, Makefile.gsnaptoo.am, atoi.c, atoi.h, atoiindex.c,
genome_hr.c, genome_hr.h, gmap.c, gsnap.c, indexdb.c, indexdb.h, mode.h,
stage1hr.c, stage1hr.h, substring.c, substring.h: Using Mode_T instead of
cmetp. Incorporated atoi mode.
* oligo.c, oligo.h: Removed Oligo_setup
* convert.t.c: Initial import into SVN
* gsnap_fasta.c: Added code stub for handling BAM input
* gsnap.c: Added a thread-specific key for storing the request, and
accessing it with the signal handler, which no longer throws an exception.
* stage3.c: Checking for case in distalmedial comparison where medial
location extends past given genomicseg.
* stage1hr.c: Replaced indirect function calls with direct calls to
read_oligos_cmet and read_oligos_standard.
* oligo.c: Replaced indirect function calls with direct calls to oligo_read
and oligo_revise. Not handling dibasep anymore.
* genome_hr.c, genome_hr.h: Replaced indirect function calls with static
inline procedures
2011-05-22 twu
* stage1.c: Removed dibase parameter in calling Reader_new
* stage3hr.c: Removed assertion check for plusp equality in
pair_insert_length. For splice translocations, redefining plusp based on
substring_for_concordance.
* gsnap.c: Fixed output of query when exception occurs
* except.c: Fixed handling of exceptions by removing unnecessary call to
Except_advance_stack.
* gsnap.c: Removed dibasep and cmetp as parameters.
* stage1hr.c, stage1hr.h: Removed dibasep as a parameter. Also eliminated
cmetp as a parameter.
* substring.c, substring.h, stage3hr.c, stage3hr.h, splicetrie.c,
splicetrie.h, reader.c, reader.h, mapq.c, mapq.h, genome_hr.c,
genome_hr.h: Removed dibasep as a parameter
* oligo.c, oligo.h: Removed dibasep as a parameter. Added setup procedure
to assign procedure for dibase operation.
2011-05-21 twu
* genome_hr.c, genome_hr.h, gsnap.c, mapq.c, mapq.h, splicetrie.c,
splicetrie.h, stage1hr.c, stage3hr.c, stage3hr.h, substring.c,
substring.h: Setting block_diff procedure in genome_hr.c during setup, and
removing many uses of the cmetp variable.
* dynprog.c: Include header for splicetrie.h
2011-05-20 twu
* cmet.c, cmet.h, genome_hr.c: Moved mark_a, mark_c, mark_g, and mark_t data
and procedures from cmet.c to genome_hr.c
* Makefile.gsnaptoo.am: Made files for GMAP and GSNAP match those in
Makefile.dna.am
* Makefile.dna.am: Minor rearrangement of filenames
* samprint.c: Fixed printing of SAM output for translocations
* stage3hr.c: Using both hit5 and hit3 end points in hitpair_equal_cmp. Put
tally ahead of score in ranking subsumed hitpairs.
* splicetrie.c: Constraining max_mismatches_allowed to be less than
one-third of the end length
* substring.c: Added header file for pair.h
* stage3hr.c: Not using absdifflength bingo at all
* gsnap.c: Removed references to pairlength deviation
* stage3.c: In peel_leftward and peel_rightward, when running into a second
gap, transferring endgappairs first before transferring peeled pairs.
* dynprog.c: Using List_push_existing instead of Pairpool_push_existing to
save on memory
* stage3hr.c: Using only best splice within constraints and not pairlength
in resolving ambiguous inside splices
* stage1hr.c: Fixed definition of collect_all_p for shortexons
* splicetrie.c, splicetrie.h: Removed old code. Implemented collect_all_p
in Splicetrie_search_left and Splicetrie_search_right procedure.
2011-05-19 twu
* stage1hr.c, stage3hr.c, stage3hr.h: Requiring concordant pairs to have a
non-zero insert length
* stage1hr.c, stage3hr.c, stage3hr.h: Recording number of ambiguous matches
after known splicing, and subtracting from nmatches when ambiguous inner
splicing yields no candidates.
2011-05-18 twu
* gmap.c, stage1hr.c, stage2.c, stage2.h, stage3.c: Allowing stage 2 to
favor either left or right part of genomicseg
* gsnap.c: Using the same value for middle and end indel penalties. Changed
flags to allow only one indel penalty to be specified.
* stage3hr.c: In Stage3pair_remove_duplicates, allowing for ties within
cluster. Using hittype to rank hitpairs in hitpair_equal_cmp, but not for
distinguishing hitpairs in hitpair_equal_no_hittype_cmp.
* stage3hr.c: Fixed bug in using hitpair_equal_cmp
* stage3hr.c: Using same procedure, hitpair_equal_cmp (previously called
hitpair_position_cmp), both for sorting and for recognizing equal hitpairs.
* stage3hr.c: In Stage3pair_remove_duplicates, added hittype in sorting and
removing exact duplicates. Going through clusters separately from left
and right and checking for subsumption against initial alignment, not the
previous one.
2011-05-17 twu
* stage3hr.c: In Stage3pair_remove_duplicates, using tally within clusters.
Removing absdifflength bingo from Stage3pair_optimal_score.
* README, config.site, configure.ac: Setting default value of MAX_READLENGTH
to be 200
* gsnap.c: Providing value of MAX_READLENGTH in printing --version output
* dibase.c, inbuffer.c, mapq.c, shortread.c, stage3hr.c, substring.c:
Removed unnecessary includes of stage1hr.h, needed previously to obtain
MAX_READLENGTH
* Makefile.dna.am: Providing MAX_READLENGTH to gsnap. Provided files for
bam_fasta.
* README, config.site, configure.ac, Makefile.gsnaptoo.am, dibase.c,
gsnap.c, inbuffer.c, mapq.c, reads_store.c, samprint.c, shortread.c,
stage1hr.c, stage1hr.h, stage3hr.c, substring.c: Changed MAX_QUERYLENGTH
to MAX_READLENGTH and allowing value to be defined as an argument to
configure
* samprint.c, shortread.c, shortread.h: Printing chopped primers in SAM
output
* stage3hr.c: Moved code for resolving inside ambiguous splices to separate
procedures. Allowing mate of a GMAP alignment to resolve its inside
ambiguous splice.
* dynprog.c, dynprog.h, gmap.c, splicetrie.c, splicetrie.h, stage1hr.c,
stage3.c, stage3.h: Limiting region of known splice extension for GMAP
alignments in GSNAP involving paired-end reads. Region now cannot extend
past mate.
2011-05-16 twu
* stage3hr.c: Allowing overlapping of paired-end reads when resolving
ambiguous splices on insides
* dynprog.c, dynprog.h, gmap.c, stage1hr.c, stage3.c, stage3.h: Counting
ambiguous end matches in stage 3 alignment
* substring.c: Hid debugging statement
* gsnap.c: Using user-provided dir for tally IIT when available
* substring.c, substring.h, gsnap.c, stage1hr.c, stage3hr.c, stage3hr.h:
Implemented a multiclean procedure using a tally IIT file
2011-05-15 twu
* stage3hr.c: Not using hitpair type in making comparisons across multiple
alignments. Using subsumption instead of overlap in
Stage3pair_remove_duplicates.
2011-05-14 twu
* stage3hr.c: Changed removal of Stage3pair_T duplicates within overlapping
clusters from an O(n^2) algorithm to an O(n) algorithm
2011-05-13 twu
* stage1hr.c: Added debugging statement
* stage3hr.c: Assigning plusp for translocations based on overall
genomestart and genomeend. Setting substring_low and substring_high for
translocations to be the part that is concordant.
* samprint.c: Fixed bug in printing of translocations
* gsnap.c, genome_hr.c, genome_hr.h: Added options --query-unk-mismatch and
--genome-unk-mismatch, and made both default false, meaning that query N
and genome N no longer count as mismatches
2011-05-12 twu
* samprint.c, stage3hr.c, stage3hr.h: Handling case where clipping of
overlap removes entire alignment
* pair.c: Fixed bug in number of dashes in GSNAP output on deletions
* substring.c: Fixed GSNAP and SAM output for bisulfite alignments
* stage1hr.c: Eliminating cases with bad GMAP alignments, either with
non-canonical splices or with too many mismatches
* stage3.c: On extend_ending5 and extend_ending3, returning dynamic
programming results, even if finalscore is negative
2011-05-11 twu
* snpindex.c: Checking for presence of IIT file at destination, and
providing a better reminder message
* snpindex.c: Added reminder message at end to install IIT file
* gsnap.c: Added warnings under --use-cmet flag if cmet index files are not
present
* substring.c: Removed references to MAX_END_DELETIONS
* stage1hr.c, substring.c: Allocating gbuffer when it exceeds the amount
allocated statically
* stage3hr.c, substring.c, substring.h: Added preference in removing
duplicates for known splice sites over novel ones
* gsnap.c: Fixed documentation for --clip-overlap flag
* outbuffer.c, pair.c, pair.h, samprint.c, samprint.h, stage3hr.c,
stage3hr.h, substring.c, substring.h: Performing search for correct
hardclipping boundary. Computing chrpos and mate_chrpos in advance of
printing SAM output. For Pair_binary_search, performing forward and
backward search of middlei to avoid gaps. Fixed computation of overlap in
some cases involving GMAP alignments.
2011-05-10 twu
* samprint.c: Made logic of print_cigar follow that of print_md_string
* outbuffer.c, samprint.c, shortread.c, stage3hr.c: Fixes made to MD string
with hard clipping of overlaps
2011-05-09 twu
* pair.c: Fixed infinite loops in binary search procedures
* resulthr.c: Removed unused variable
* pair.c, pair.h, samprint.c: Performing hard clipping by computing a
subsequence on pairarray
* stage3hr.c: Allowing for NULL arguments in Stage3end_substring_low and
Stage3end_chrnum, now possible in samprint.c procedures
* stage1hr.c: Rewrote calculation for genomicseg
* stage3hr.c: Fixed calculation of insert length involving GMAP alignment
* samprint.c, pair.c: Fixed printing of SAM chromosomal pos
* pair.c, pair.h, samprint.c: Further implementation of hard clipping for
overlapping paired-end reads
2011-05-07 twu
* pair.c: Fixed bug caused by wrong order of parameters
* gsnap.c, outbuffer.c, outbuffer.h, pair.c, pair.h, samprint.c, samprint.h,
stage3.c, stage3hr.c, stage3hr.h, substring.c, substring.h: Added code for
hardclipping overlaps between paired ends. Improved computation of insert
lengths involving GMAP alignments. Increased default shortsplicedist to
200000.
* gmap.c: Changed default shortsplicedist to 200000
* get-genome.c: Added -E option for printing exons for gene maps
* genome.c: Made changes to perror statements
2011-05-05 twu
* samprint.c: Made changes to MD string to handle hard clipping
* stage1hr.c: Reduced genomicseg for GMAP from pairmax + shortsplicedist to
just pairmax
* dynprog.c: Added a new debugging category for known splicing at ends
* outbuffer.c, samprint.c, samprint.h, stage3hr.c, stage3hr.h: Computing
cigar strings to allow for hard clipping
* pair.c, pair.h, stage3.c: Fixed SAM flags when printing GMAP alignment in
GSNAP
* bam_tally.c: Made bam_tally work on entire genome
2011-05-04 twu
* README, chrom.c, chrom.h, gmapindex.c, fa_coords.pl.in, gmap_build.pl.in,
gmap_process.pl.in, gmap_setup.pl.in: Made changes for fa_coords to sort
chromosomes in .coords file, for gmap_process to provide universal
coordinate information to gmapindex, and for gmapindex to sort based on
this order. Ignoring leading "chr" in sorting chromosomes.
* chrom.c: Ignoring leading "chr" in chromosome name for sorting purposes
2011-05-03 twu
* stage1hr.c: Fixed bug in calling Substring_chrhigh
2011-05-02 twu
* gsnap.c, stage1hr.c, stage3hr.c, stage3hr.h, substring.c, substring.h:
Storing both chroffset and chrhigh in Stage3end_T and Substring_T objects.
Checking chromosomal boundaries when performing GMAP algorithm in GSNAP.
2011-05-01 twu
* gmap_build.pl.in: Added Id property
* gmap_build.pl.in: Added a -w flag for sleeping between steps
2011-04-27 twu
* splicetrie.c: Removed unused variables
* goby.c, gsnap.c, outbuffer.c, pair.c, pair.h, resulthr.c, resulthr.h,
samprint.c, samprint.h, stage1hr.c, stage1hr.h, stage3.c, stage3.h,
stage3hr.c, stage3hr.h: Changes in handling of translocations: (1) Created
new "_transloc" output files. (2) Removing XT flag from SAM output. (3)
Enforcing reported translocations to be unique. (4) Creating a new
category for translocations in pair_up procedure.
* gmap.c: Fixed bug in calling Genome_blocks on a user-provided segment
* diag.c: Added debugging information about minactive and maxactive at ends
of query
2011-04-26 twu
* stage2.c: Not checking for pct_coverage or ncovered when querylength < 150
* tally_expr.c: Removed dependence on pre-computed total in tally IIT file
* README: Showing examples from both hg18 and hg19 in retrieving known
splicesite tracks from UCSC
* gsnap.c, stage1hr.c, stage1hr.h: Removed min_localsplicing_end_matches
parameter. For short overlaps, now checking only that endlength >=
support.
* pair.c: Not printing first read or second read bit in GMAP samse output
2011-04-25 twu
* shortread.c: Fixed bug in printing queryseq in SAM output when
hardclipping is present
* samprint.c: Formatting changes
* pair.c: Removed warning message when splicesite not found in GMAP
* gsnap.c: Changed documentation for -N
2011-04-22 twu
* parserange.c: Added warning message if divstring not found in IIT file
* get-genome.c, iit_get.c: Removed extra linefeed introduced in old IIT
versions
* stage1hr.c: Added check for NULL pairarray
2011-04-21 twu
* Makefile.dna.am, dynprog.c, dynprog.h, gmap.c, indexdb_hr.c, sequence.c,
stage3.c: Made changes so PMAP would compile
* trunk, src, Makefile.dna.am, dynprog.c, dynprog.h, genome.c, genome.h,
gmap.c, goby.c, goby.h, gregion.c, gregion.h, gsnap.c, iit-read.c,
maxent.c, maxent_hr.c, outbuffer.c, pair.c, pair.h, pairdef.h, resulthr.c,
samprint.c, samprint.h, sequence.c, sequence.h, shortread.c, shortread.h,
splicetrie.c, splicetrie.h, stage1.c, stage1.h, stage1hr.c, stage1hr.h,
stage3.c, stage3.h, stage3hr.c, stage3hr.h, substring.c, substring.h,
translation.c, translation.h, util: Merged revisions 38122 to 38539 from
branch 2011-04-14-halfmapping-gmap
2011-04-15 twu
* bam_tally.c: Changed colon to tab when printing chromosomal coordinates
2011-04-14 twu
* trunk, src, genome_hr.c, gsnap.c, stage1hr.c, stage1hr.h, stage3hr.c,
stage3hr.h, util: Merged revisions 37902 to 38171 from
branches/2011-04-10-end-indels
* bam_tally.c: Fixed bug in iterating through list in position_printable_p
* bam_tally.c: Added -n and -X flags to control depth and variant strands
required. Added -B flag to control block format output.
* bam_tally.c: Changed -A flag into separate -C and -Q flags to print
details about cycles and quality scores
* stage3hr.c: Allowing optimal_score procedures to consider terminal scores
if all alignments are terminal
* trunk, VERSION, src, dynprog.c, dynprog.h, gmap.c, intron.c, intron.h,
maxent.c, maxent.h, stage3.c, stage3.h, util: Merged 38078:38120 from
branch 2011-04-13-gmap-knownsplicing
2011-04-13 twu
* snpindex.c: Added variables for bigendian machines
2011-04-10 twu
* stage1hr.c: Using function Genome_fill_buffer_blocks for debugging
* config.site.rescomp.prd: Revised version number
* genome.c, genome.h: Added function Genome_fill_buffer_blocks
* substring.c: Made fixes to printing of SNPs in GSNAP output
* README, configure.ac, Makefile.am, psl_introns.pl.in: Added psl_introns
program. Explaining in README file about known site-level and known
intron-level splicing.
* gmap.c, outbuffer.c, pair.c, pair.h, stage3.c, stage3.h: Added option for
-f introns output
2011-04-09 twu
* gsnap.c, iit-read.c, iit-read.h, interval.c, interval.h, splicetrie.c,
splicetrie.h: Added code to allow known splicing based on introns, rather
than splice sites
2011-03-29 twu
* bam_tally.c: Fixed bug in checking too early for chrpos_high > alloc_high
* VERSION: Updated version number
2011-03-28 twu
* cmetindex.c, snpindex.c: Made program work on bigendian machines
* stage3hr.c, stage3hr.h, substring.c, substring.h: Moved definition of
Hittype_T from substring.h to stage3hr.h
* trunk, VERSION, config.site.rescomp.tst, src, goby.c, goby.h, gsnap.c,
inbuffer.c, inbuffer.h, outbuffer.c, samprint.c, samprint.h, stage3hr.c,
substring.c, util: Merged changes 36746 through 37246 from branch
2011-03-17-goby-paired-end
2011-03-26 twu
* resulthr.c: Fixed bug in assignment of translocationp for single-end reads
2011-03-25 twu
* gmap.c, gsnap.c: Added clarification about memory mapping in message
* bamread.c: Fixed a memory leak caused by unnecessary copying of read
string from BAM.
* Makefile.dna.am, bamread.c, bamread.h, gsnap_splices.c: Added bam_splices
program
* bam_tally.c: Fixed bug with non-initialized count_plus or count_minus
* bam_tally.c: Fixed bug in extracting genomic reference nt from Genome_T.
Added option for signed counts.
* configure.ac, Makefile.dna.am: Added Automake conditional to control when
bam_tally can be made
* bamread.c: Added compiler directives to protect code when samtools is not
available
* bam_tally.c: Fixed usage statement
* bam_tally.c: Added --pairmax option
* bam_tally.c: Fixed help message and added some printing options
* bamread.c: Stopped printing of chromosomes to stdout
* Makefile.dna.am: Added compiler commands for bam_tally
* bam_tally.c: Added clipping at ends of requested genomic region
* samread.c, samread.h: Added function Samread_print_cigar
* bam_tally.c: Implementation of working version
2011-03-24 twu
* bam_tally.c: Implementation of overall alloc and block structure
* bam_tally.c, bamread.c, bamread.h: Implemented memory freeing procedures
* bam_tally.c: Initial import into SVN
2011-03-18 twu
* gsnap.c: Rearranged some lines
* gmap.c: Added --quality-protocol flag and 'j' flag to set to quality print
shift
* gmap.c, outbuffer.c, outbuffer.h: Added --quiet-if-excessive option to GMAP
* gsnap.c, outbuffer.c: Adding globals for invert_first_p and
invert_second_p to stage3hr.c
* stage1hr.c, stage3hr.c, stage3hr.h: Determining effective_chrnum,
genomicstart, and genomicend for splice translocations based on inner
substrings. No longer generating copies for each substring when chrnum ==
0.
* outbuffer.c: Changed a 0 to a false
* README: Added explanation of dbsnp_iit program and output reporting for
translocations
* configure.ac, Makefile.am, dbsnp_iit.pl.in: Added dbsnp_iit program
* stage3.h: Fixed declaration
* resulthr.c: Put debugging statements into debug macro
* outbuffer.c, pair.c, pair.h, resulthr.c, resulthr.h, samprint.c,
samprint.h, stage3.c, stage3hr.c, stage3hr.h: Removed separate fp_transloc
file. Adding (transloc) string to GSNAP output, and XT flag to SAM output
for translocation results.
* substring.c: Added error message for endtype_string
* outbuffer.c, resulthr.c, resulthr.h, samprint.c, stage3hr.c: Removed
SINGLEEND_TRANSLOCATION and PAIREDEND_TRANSLOCATION types
2011-03-11 twu
* README: Added more information about the --gunzip option, about the
command-line usage for paired-end reads, and about extended FASTA inputs.
* README: Added information about -s flag for psl_splicesites
* VERSION: Changed version number
* snpindex.c: Added option to limit number of warning messages
2011-03-10 twu
* snpindex.c: Fixed warning messages that previously reported the wrong SNPs
and coordinates that were problematic
* gsnap.c, mem.c, outbuffer.c, shortread.c: Enabled LEAKCHECK in GSNAP to
check for memory leaks
* gmap_build.pl.in: Fixed bug in creating .maps subdirectory
2011-03-09 twu
* VERSION: Updated version
* README: Added information about latest syntax for running snpindex
2011-03-08 twu
* stage1hr.c, stage1hr.h: Passing new parameter nsplicepartners_skip
* stage3hr.c, stage3hr.h: Eliminating splice translocations if a
non-translocation exists
* outbuffer.c, outbuffer.h, resulthr.c, resulthr.h, samprint.c, samprint.h:
Providing a translocation result type and printing split output to a new
file
* indexdbdef.h: Removing unused macro definition
* gsnap.c, splicetrie.c, splicetrie.h: Providing a minimum intron length in
building the splicetrie
* snpindex.c: Providing user options to specify sourcedir and destdir
* cmetindex.c: Fixed amount of space allocated for filename
2011-03-06 twu
* gmap.c, gsnap.c: Changed warning message about memory mapping
* stage1hr.c: Removed a conversion for bigendian machines in
Batch_init_simple.
* spanningelt.c: Added a necessary conversion for bigendian machines
* indexdb.c, indexdb_hr.c: Making FILEIO access to positions look the same
as MMAPPED access for bigendian machines
* bigendian.c, littleendian.c: Using unsigned char instead of char
2011-03-04 twu
* inbuffer.c: Initializing value of pc_linefeeds_p
* gmap.c, gsnap.c: Changed advice message about -B 3 and -B 4
* iit_store.c: Using 2^32-1 as a constant instead of 2^32
* stage1hr.c: Added additional places where Bigendian_convert_uint should be
applied
* bigendian.c: Removed monitoring message
2011-03-03 twu
* stage3hr.c: Sorting paired-end reads by insert length
* gsnap.c: Removed --dibase flag from --help output
* inbuffer.c, shortread.c, shortread.h: Added field for pc_linefeeds_p.
Changed variable name from pc_line_feeds_p.
* gsnap.c, inbuffer.c, inbuffer.h, shortread.c, shortread.h: Added option to
strip PC line feeds from input
2011-03-02 twu
* sequence.c: Enabled GMAP to read FASTQ files
* gsnap_fasta.c: Made the default behavior to print all sequences
* pair.c: Fixed problem with double tabs in SAM output. Now printing NM tag
in SAM output.
* config.site.rescomp.prd, config.site.rescomp.tst, VERSION: Changed version
number
* sam_merge.pl.in: Printing nomapping lines from original GSNAP output
* configure.ac, Makefile.am, sam_restore.pl.in: Added program sam_restore
* spliceturn.c: Added option to print splices, rather than splicesites
* samread.c, samread.h: Added functions to print modified SAM reads
* gsnap_splices.c: Trusting SAM splice directions by default. Added an
explicit variable to indicate if the splice is canonical.
* gsnap_multiclean.c: Added a flag -C to pick either concordant or
non-concordant behavior. Allowing nonmapped queries to be printed, if no
other alignment is available.
* gsnap_filter.c: Modified to print nonmapped SAM entries. Removed
inconsistencies in printing some lines to fp_one and some to fp_many.
* gsnap_fasta.c: Prints the query from the SAM line with the most matches
(i.e., not hardclipped). Added an option --oneway to print both ends of a
paired-end read in the same direction.
* gsnap_concordant.c: Modified to print nonmapped SAM entries, and to add
mate information for concordant pairs. Code follows that in
gsnap_multiclean.c
2011-03-01 twu
* splicetrie.c: Increased value of MAX_DUPLICATES from 100 to 1000
* gsnap.c: Checking for case where splicesites_iit has no sites
corresponding to given genome
* pair.c: Fixed MD string to print genomic nt, rather than query nt
2011-02-28 twu
* pair.c: Implemented MD string in SAM output for GMAP
2011-02-26 twu
* spliceturn.c: Added --new flag to print only new splices
* spliceclean.c: Added break after case 0 in getopt
* substring.c, substring.h: Fixed insert length calculation to be based on
genomicstart and genomicend. Removed querylength_adj field.
* stage3hr.c: Preventing terminal alignments from setting minscore in
Stage3_optimal_score and Stage3pair_optimal_score
* stage1hr.c: Fixed and simplified calculation of floor_left and floor_right
for diagonals
2011-02-25 twu
* configure.ac, Makefile.am, psl_splices.pl.in: Added psl_splices program
* sam_merge.pl.in: Including inserted query segment in computing number of
matches.
* spliceturn.c: Eliminating only new splices (numeric labels). Searching
for nearest surrounding known splices.
* spliceclean.c: Printing original label for each splice
* gsnap_splices.c: Added flag to require canonical splices. Checking for
canonical dinucleotides regardless of sam XS string.
* samprint.c: Fixed bug in MD string for shortexon. Added debugging
statements.
* stage1hr.c, stage3hr.c, stage3hr.h, substring.c, substring.h: Added
querylength_adj to compensate for indels when computing insert length
* pair.c: Fixed bug in tokens where Ilength was being printed instead of
Dlength
* spliceturn.c: Printing intron lengths. Added labels to output when
splices are turned.
* shortread.c: Requiring first file to have "/1" and second file to have
"/2" if slashes are present
* samread.c: Allowing for XS:A:? to indicate unknown splice direction
* pair.c: Changed minimum intron length for non-concordant splice in cigar
string from 100 to 20
* gsnap_tally.c: Changed default handling of quality scores to be Sanger
protocol
* gsnap_splices.c: Using available XS flag in SAM output. Printing
non-directional splices as both forward and reverse. Printing
non-canonical dinucleotide pairs.
* gsnap_multiclean.c: Skipping, rather than aborting on, concordant pairs
with different numbers of hits, due to translocations
* gmap.c: Changed flag for printing noncanonical splices in cigar string
* samprint.c, stage3hr.c, substring.c, substring.h: Fixed MD string to
exclude part that is soft-clipped
* spliceclean.c, tally.c, tally.h: Added default-count option. Fixed bug
where splice occurred beyond extents. Fixed some memory leaks.
2011-02-24 twu
* psl_splicesites.pl.in: Added missing parentheses
* configure.ac, Makefile.am, sam_merge.pl.in: Added sam_merge program
* README: Added description of gmap_build
* gsnap.c, mapq.c: Made sanger the default for quality protocol
2011-02-23 twu
* shortread.c: Fixed bug in chopping paired-end reads of different lengths
* sequence.c: Fixed bug in skipping initial '<', '>', or '+' in quality
string
* gsnap_concordant.c: Requiring chr strings in the two ends to be equal
* pair.c: Printing XS:A:? when splice direction is not known, because it is
non-canonical
* sequence.c, sequence.h: Added compiler directives to hide quality string
from PMAP
* gsnap_fasta.c: Printing extended FASTA with quality strings for GMAP output
* gmap.c, outbuffer.c, outbuffer.h, pair.c, pair.h, sequence.c, sequence.h,
stage3.c, stage3.h: Implemented ability to read extended FASTA with
quality strings and print them in SAM format
* mapq.c: Added recommendation to use --quality-protocol=sanger
* gsnap_concordant.c, samread.c, samread.h: Printing altered mapq scores
* gmap.c, outbuffer.c, outbuffer.h, pair.c, pair.h, stage3.c, stage3.h:
Added flag to print non-concordant splices as N in SAM cigar string,
rather than as D
* gsnap_concordant.c, samread.c, samread.h: Added calculation and printing
of insert length
* gsnap.c: Added information about quality protocols to --help output
2011-02-21 twu
* shortread.c: Fixed bugs in handling gzipped paired-end files
* exonscan.c, extents_genebounds.c, gdiag.c, geneadjust.c, get-genome.c,
gsnap_extents.c, gsnap_splices.c, gsnap_tally.c, gsnap_terms.c,
iit_plot.c, pairinggene.c, segue.c, snpindex.c, splicefill.c,
splicegene.c, splicegraph.c, splicescan.c, spliceturn.c, splicing-scan.c,
splicing-score.c, tally_expr.c, tallygene.c: Changed parameter for
Genome_new from batchp to access mode
* psl_splicesites.pl.in: Added -R flag to report non-canonical splice sites
* gmap_build.pl.in: Fixed name of maps subdirectory
* sequence.c, shortread.c: Fixed bugs further in closing NULL file pointer
* psl_splicesites.pl.in: Fixed bugs in syntax. Added -s flag to specify
start column.
* util, fa_coords.pl.in, gmap_build.pl.in, gmap_process.pl.in: Merged -r
35283:35500 from branches/gmapindex-multifile/util
* Makefile.am: Added psl_splicesites to the list of files to be cleaned
* README, configure.ac, Makefile.am, psl_splicesites.pl.in: Added
psl_splicesites program to process UCSC alignment tracks into a splicesite
file.
* sequence.c, shortread.c: Fixed bug from trying to close a NULL file pointer
* trunk, README, VERSION, config.site.rescomp.tst, maint, memory-check.pl,
src, gsnap.c, interval.c, mapq.c, mapq.h, samprint.c, splicetrie.c,
splicetrie.h, stage1hr.c, stage1hr.h, stage3hr.c, stage3hr.h, substring.c,
substring.h: Merged revisions 35346:35468 from branches/tentative-splices
to store and resolve ambiguous splice ends
* outbuffer.c: Added warning messages when an output file cannot be written
2011-02-16 twu
* mapq.c: Changed allowable range of quality scores to go from 0 to 96
* gsnap.c: Changed default min_shortend from 1 to 2
* stage1hr.c: Removed unused code for old maxent procedures
* splicetrie.c, splicetrie.h, stage1hr.c: Providing splicesites to
Splicetrie_find_short procedures, useful for debugging
* stage3hr.c: Removed debugging statement
* stage1hr.c: Restored ability to find short-overlap splicing in 1-2 bp at
ends of read
* stage3hr.c: For insert length of 0, setting absdifflength to be infinite
* sequence.c, shortread.c: Closing files when multiple ones are provided on
the command line
* pair.c: Implemented printing of intron distances in splice sites (-f 6)
output
* gsnap.c, interval.c, interval.h, splicetrie.c, splicetrie.h, stage1hr.c,
stage1hr.h: Implemented usage of known splice distances for single splices
* get-genome.c: Printing output correctly for version 5 IIT files with
information in rest of header
2011-02-15 twu
* stage1hr.c: Removed unnecessary checks for nsplicesites > 0
* stage1hr.c: Computing and storing known splicesites_i for each segment once
* shortread.c: Removed abort left in for debugging
* shortread.c: Fixed bug in selecting region of short read to look for
adapter stripping
* stage3hr.c: Fixed indentation
* gsnap_concordant.c: Fixed procedure to report all concordant pairs with
correct distance and orientation
* gsnap.c, inbuffer.c, shortread.c, shortread.h: Simplified procedure for
adapter stripping using a linear algorithm, instead of dynamic programming.
* Makefile.dna.am, Makefile.gsnaptoo.am: Revised instructions for cmetindex
* cmetindex.c: Enabled -F and -D flags for specify source and destination
directories. Added --version and --help flags.
* gmap_build.pl.in: Added -B flag to optionally specify bindir
* gmap_setup.pl.in: Modified usage statement to include GSNAP
* configure.ac, Makefile.am, gmap_build.pl.in: Added gmap_build program
2011-02-14 twu
* VERSION: Updated version
* README: Added statement about usage of -m flag
* gsnap_concordant.c: Fixed typo leading to wrong output file
* outbuffer.c: Fixed typo
* stage3.c, stage3.h: Allowing flags for GMAP to indicate SAM output is
paired-end
* shortread.c: Allowing extended FASTA format to include a quality string
* sequence.c: Removed debugging statement
* samflags.h: Clarified comment
* pair.c, pair.h: Allowing flags for GMAP to indicate SAM output is
paired-end. Printing XS flag for strand direction.
* outbuffer.c, outbuffer.h: Added variables for indicating SAM output is
paired-end
* gsnap_terms.c: Using procedure in samread.c for computing chrpos_high.
* gsnap_multiclean.c, samread.c, samread.h: Added file variable for printing
altered flags. Added procedure for computing chrpos_high.
* gsnap_filter.c: Added program for sam_filter
* gsnap_fasta.c: Handling GMAP and GSNAP output correctly in a single
procedure
* gsnap_extents.c: Enabling GMAP indexdb for sam_extents
* gsnap_concordant.c: Recomputing all SAM flags
* gmap.c: Changed format names to samse and sampe
* Makefile.gsnaptoo.am: Include maxent_hr.c and .h for GMAP. Removed
maxent.c and .h from GSNAP.
* Makefile.dna.am: Include maxent_hr.c and .h for PMAP. Included programs
sam_fasta and sam_concordant.
2011-02-13 twu
* gsnap_concordant.c: Added file for concordant_mult. Printing concordant
results in adjacent pairs of SAM lines.
* gsnap_concordant.c: Initial import into SVN
2011-02-12 twu
* gsnap_fasta.c: Printing extended FASTA output for GMAP, using '>' and '<'
* gsnap_fasta.c: Provided separate output types for GMAP and for GSNAP
* pair.c, sequence.c, sequence.h: Enabled reading of extended FASTA using
'>' and '<' to indicate first and second reads, and putting information in
flag of SAM output.
2011-02-11 twu
* stage3hr.c: Implemented a different method for using bingo pairlengths, by
calculating a minscore for those pairlengths with the bingo characteristic
* stage3hr.c: Using pairlength_deviation to eliminate pairs, even if
non-overlapping
* gsnap_extents.c, gsnap_iit.c, gsnap_splices.c, gsnap_tally.c,
gsnap_terms.c: Changed defaults of concordant and unique to be false in
all programs
* gsnap_fasta.c: Added printing of quality strings. Implemented -A flag to
print all sequences.
* stage1hr.c: Rearranged order of singlesplice_minus procedure
* gsnap_terms.c: Fixed check of concordance and uniqueness for last sequence
* shortread.c: Moved return statement to correct place
* gmap.c, outbuffer.c: Added compiler directives for case when pthreads not
available
* datadir.c: Changed warning message about needing to recompile GMAP package
* outbuffer.c, samprint.c, samprint.h, stage3hr.c, stage3hr.h: Split
fp_paired_uniq into separate files for inversions, scrambles, and long
inserts
2011-02-10 twu
* gsnap.c, substring.c, substring.h: Removed notion of termdonor and
termacceptor typeints
* samprint.c: Printing insert length for paired alignments. Changed method
for determining sign.
* substring.c, substring.h: Added functions for finding overlaps and insert
lengths between two substrings
* stage3hr.c: Changed criterion for scramble to be the absence of any
overlap in the wrong relative positions. Revamped computation of insert
length to look for overlapping substrings.
* inbuffer.c: Removed second check of nleft == 0 when initially it is not
* substring.c: Added other fields in Substring_T that were not being copied.
Removed splicesites_offset from Substring_T object.
* gsnap.c, outbuffer.c, outbuffer.h, stage1hr.c, stage3hr.c, stage3hr.h,
substring.c, substring.h: Added compiler directives for using new
maxent_hr procedures. Fixed problem in substring.c where chimera_knownp_2
was not being copied.
* maxent_hr.c: Using jump tables based on shift
2011-02-09 twu
* maxent_hr.c: Removed duplicate calls in reading genome_blocks
* stage3hr.h: Fixed definition of sense consistency for inversion pairs
* stage3hr.c, stage3hr.h: Changed splicing sense test to work for inversion
pairs
* stage1hr.c: Distinguished a binary_search procedure to be used for
bigendian computers in processing positions from the indexdb file.
* stage1hr.c: Added missing code for computing nmismatches for one type of
splice end
* Makefile.dna.am, maxent_hr.c, maxent_hr.h, stage1hr.c: Implemented fast
calculation of maxent splice site probabilities
* stage1hr.c: Removed trimpos calculation from find_spliceends. Not useful
when we can compute short-overlaps.
* Makefile.dna.am, gsnap_best.c, gsnap_fasta.c: Added programs for
extracting alignment with best MAPQ score and converting alignment output
to FASTA.
* stage1hr.c: Using genome-based splice site detection for splice ends
* gsnap.c: Fixed name of flag suboptimal-levels in help statement
* shortread.c: Fixed bug in not allocating space for final '\0' to contents.
Commented out check for PC line feeds.
2011-02-08 twu
* genome_hr.c: Fixed excessive shift in calculating high_halfbit
* stage1hr.c: Put back checks for zero nsegments
* outbuffer.c: Some output seems to be missed on occasion. Rewrote to use
ndone and noutput.
* stage1hr.c: Making all loops on segments go to segments[nsegments] instead
of nsegments-1
* stage1hr.c: Implemented novel double splice detection using new
genome-based splice site detection. Removed leftspan/rightspan test and
replaced with counting mismatches.
* outbuffer.c: Fixed outbuffer to check for donep flag, and for inbuffer to
signal when donep is set
2011-02-07 twu
* genome_hr.c: Fixed bug in dealing with high halfbit
2011-02-06 twu
* stage1hr.c: Fixed memory leak caused by computing floors twice
* stage1hr.c: Raised minimum splice prob support to 0.80. Extending range
for single splicing to 2 nt from each end. Implemented double splice
detection involving known splice sites.
* stage3hr.c: Eliminating identical Stage3_T pairs properly
* outbuffer.c: Fixed bug that caused GMAP to handle maxpaths parameter
incorrectly
* genome_hr.c, genome_hr.h, stage1hr.c, substring.c: Implemented working
procedure for finding splice sites from compressed genome, and merging
with known sites. Using this procedure to find single splices. Merged
single-splice procedures for plus and minus strand, and moved handling of
plus and minus strands inside of Substring_T procedures.
2011-02-05 twu
* genome_hr.c, genome_hr.h: Implemented fast determination of splice site
locations
* stage1hr.c: Removed unused variables
* genome_hr.c: Added tables for splice site positions
* dev: New directory for developer work
2011-02-04 twu
* stage1hr.c: Integrated multiple procedures for merging heaps to find
segments into a single procedure (plus a specialized one for terminals
only). Removed separate Splicesegment_T object and using a general
Segment_T object.
* stage1hr.c: Divided single splicing, known double splicing, and novel
double splicing into separate procedures.
* gsnap.c, stage1hr.c, stage1hr.h: Provided flag for detecting novel double
splices, and turned the feature off by default.
* stage1hr.c: Implemented faster method for finding double splices.
Commented out code no longer valid for setting splice_pos_start and
splice_pos_end in finding single splices.
* stage1hr.c: Increased speed of finding local splices by storing leftmost
and rightmost querypos. Hiding double-splicing for now.
2011-02-03 twu
* stage1hr.c, substring.c, substring.h: Handling splicesites_offset in
donor, acceptor, and shortexon Substring_T types. Handling two values of
splicesites_i in shortexon.
* stage1hr.c: Implemented detection of double-splicing at novel splice
sites. Integrated detection of single-splicing and double-splicing.
Removed some unused code based on USE_CHARS rather than nucleotides.
* stage3hr.h: Added missing declaration of function
* outbuffer.c: Finishing up printing of remaining output
2011-02-02 twu
* stage1hr.c: Offset knowni arrays by +1, so we can clear by setting to 0,
rather than by setting to -1.
* outbuffer.c, pair.c, pair.h: Now GMAP prints nomapping results in SAM
format
* pair.c: Fixed printing of splice site scores for antisense cDNAs
* stage1hr.c: Removed restriction on finding terminals only when nconcordant
was 0.
* trunk, src, gsnap.c, stage1hr.c, stage1hr.h: Merged r33262:34617 from
branch suboptimal-alignments, adding parameter for terminal length, and
not using 10*maxpaths for computation
* config.site.rescomp.tst, VERSION: Updated version number
* README: Updated description to include information about paired alignments
and the advantages of known splice sites
* stage1hr.c: Added better debugging statements. Renamed "shortend"
procedures to "short-overlap".
* splicetrie.c: Requiring that 3 * nmismatches < nmatches to report a true
search result
* gsnap.c, outbuffer.c, resulthr.c, resulthr.h, samprint.c, samprint.h,
stage1hr.c, stage1hr.h, stage3hr.c, stage3hr.h: Implemented three types of
alignment: concordant, paired, and unpaired, with three subtypes of
paired: inversion, toolong, and scramble. Detecting paired alignments in
Stage3_pair_up_concordant. Converting unpaired uniq to paired uniq when
appropriate.
2011-02-01 twu
* stage3hr.c: Adding information about unpaired type (interchrom, toolong,
scramble, inversion) for unpaired_uniq results
* outbuffer.c: Changed type for nread and ncomputed fields from bool to int
* gsnap.c, stage1hr.c, stage1hr.h, stage3hr.c, stage3hr.h: Added parameter
for pairlength_deviation
* outbuffer.c: Protecting print loops with surrounding lock and unlock
instructions. Removed debugging flag.
* Makefile.dna.am, Makefile.gsnaptoo.am, Makefile.three.am, gmap.c, gsnap.c,
inbuffer.c, inbuffer.h, ioboard.c, ioboard.h, outbuffer.c, outbuffer.h:
Fixed bug where multithreading was hanging. Moved IOBoard_T information
to Outbuffer_T.
* stage3hr.c: Rewrite of Stage3_pair_up_concordant to get all concordant
pairs
* gmap.c, gsnap.c: Not printing program name as an arg
* gmap.c, gsnap.c: Printing version and calling arguments to stderr
* stage3hr.c: Using a pointer instead of a count to mark paired_seenp
* outbuffer.c: Using RRlist_T to represent queue for ordered output
* outbuffer.c: Replaced doubly linked list with singly linked list for queue
2011-01-31 twu
* stage3hr.c: Added additional check to prevent negative insert lengths
* outbuffer.c: Storing results in a queue, instead of a list
* Makefile.dna.am: Removed reads_store and reads_dump
* stage3hr.c: Removed keyword "quality:" from output
* shortread.c: Fixed bug that removed all quality strings
* stage3hr.c: Fixed bug in finding concordance between overlapping ends of
read
* indexdb.c: Fixed bug in compiler directive for MMAP
2011-01-28 twu
* goby.c, goby.h, gsnap.c, inbuffer.c, inbuffer.h, samprint.c, shortread.c,
shortread.h, stage3hr.c: Simplified procedures in shortread.c.
Implemented parsing of barcodes.
* mapq.c, mapq.h, substring.c: Fixed calculation of MAPQ with separate
coordinates for checking genomic string and quality string
* shortread.c, shortread.h: Initial import into SVN. Contains Sequence_T
functions specific to GSNAP.
* Makefile.dna.am, Makefile.gsnaptoo.am, Makefile.three.am, chop_primers.c,
genome.c, goby.c, goby.h, gsnap.c, inbuffer.c, outbuffer.c, reads_store.c,
request.c, request.h, samprint.c, samprint.h, sequence.c, sequence.h,
stage1hr.c, stage1hr.h, stage3hr.c, stage3hr.h, substring.c, substring.h:
Separated Sequence_T functions into Sequence_T and Shortread_T
* gmap.c, outbuffer.c, outbuffer.h: Made outbuffer work for PMAP by removing
references to sam_header_p and related variables
* gsnap.c: Removed old sam.h include
* substring.c: Turned off debugging
* spliceclean.c: Version 34373 was an accidental reversion. Going back to
version 34369, where we are adding use of genebounds_iit and adding
functionality for resolving splice directions
* spliceturn.c: Version 34374 was an accidental reversion. Going back to
version 34369, where information goes to stderr.
* splicegene.c: Version 34375 was an accidental reversion. Going back to
version 34369, which actually does stop fixing of terminalp in acceptors
where next donor is terminal.
* splicefill.c: Version 34377 was an accidental reversion. Going back to
version 34369, which uses tally, adds smoothing, does not use slopes to
find edges, and does not check for edgedistance.
* pair.c: Using new samflags.h
* Makefile.dna.am, Makefile.gsnaptoo.am, Makefile.three.am, gsnap_extents.c,
gsnap_multiclean.c, gsnap_splices.c, gsnap_tally.c, gsnap_terms.c, sam.c,
sam.h, samflags.h, samprint.c, samprint.h: Change file name from sam.c to
samprint.c. Moved definitions of SAM flags to samflags.h.
* splicefill.c: Removed smoothing. Using slopes to find edges. Checking
for edgedistance.
* splicegene.c: Stopped fixing of terminalp in acceptors cases where next
donor was terminal
* spliceturn.c: Providing information to stdout about splices that are turned
* spliceclean.c: Removed use of genebounds_iit and functionality for
resolving splice directions
* changepoint.c: Changed function for both ends, but not used anyway
* src, Makefile.dna.am, Makefile.gsnaptoo.am, Makefile.three.am,
blackboard.c, blackboard.h, changepoint.c, chop_primers.c, gmap.c,
gsnap.c, iit_pileup.c, inbuffer.c, inbuffer.h, ioboard.c, ioboard.h,
outbuffer.c, outbuffer.h, reads_get.c, reqpost.c, reqpost.h, request.c,
request.h, result.c, result.h, resulthr.c, resulthr.h, sequence.c,
sequence.h, spliceclean.c, splicefill.c, splicegene.c, spliceturn.c,
stage3.c, stage3.h, tableuint.c, tableuint.h, tally_exclude.c: Merging
changes to threads system from new-threads branch
* indexdb.c: Removed extraneous allocation of memory for offsets
* pair.c, pair.h, stage3.c, stage3.h: Using a single parameter for
sourcename for GFF3 output
* substring.c: Fixed calculation of region to check for MAPQ scoring
* mapq.c: Added check for wrong segment to check in computing MAPQ
2011-01-24 twu
* result.c: Altered debugging statements
* gmap.c: Fixed memory leak in chimera detection
2011-01-21 twu
* gmap.c: Fixed case where best0 or best1 was duplicated in rest of
stage3list
* result.c: Added debugging statements
* gmap.c: Removed debugging comment
* stage3.c, stage3.h: Added function Stage3_identity_cmp to help with
chimera detection
* gmap.c: Removed check for chimeras based on alignment break. Handling
cases where the same stage3 object is in both lists.
* chimera.c, pair.c, pair.h: Simplified Pair_matchscores and computing over
querylength. In Chimera_bestpair, check for cases where the same stage3
object is in both lists.
2011-01-20 twu
* Makefile.gsnaptoo.am: Added chimera.c to build of gmap
* VERSION: Updated version to 2011-01-21
* gsnap.c: Always creating a .nomapping file with --split-output option
* stage1hr.c: Changed debugging statements for shortexon
* splicetrie.c: Changed debugging statements
* sequence.c: Not printing space at end of accession
* gsnap.c: Turning on splicetrie precomputation by default
* gmap.c: Fixed bug in separating chimeric paths
* gmap.c: Not sorting first part of stage3list when chimera is present
* Makefile.dna.am: Added uintlist.c to gsnap_iit
* chimera.c: Made detection of alignment break work again
* splicetrie.c: Implemented handling of duplicate leaves
2011-01-19 twu
* splicegene.c: Handling genebounds.iit as input
* gsnap.c: Added --sam-headers-batch option
* gsnap_iit.c: Changed output to look like gene map format
* gsnap_extents.c: Fixed handling of non-spliced reads in sam_extents
2011-01-18 twu
* sam.h: Added constant for clearing NOT_PRIMARY bit
* sam_tally.c: Removed from CVS
* gsnap_multiclean.c, samread.c, samread.h: Implemented printing of altered
flag
* gsnap_terms.c: Made program provide same output with SAM input.
Implemented filtering for concordant pairs. Removed filtering by
max_endlength.
* gsnap_tally.c: Implemented filtering for concordant pairs
* gsnap_splices.c: Made program provide same output with SAM input
* gsnap_extents.c: Made program with SAM input
* gsnap_multiclean.c: Turned off debugging statements
* Makefile.dna.am, gsnap_multiclean.c: Implemented sam_multiclean
2011-01-15 twu
* gsnap_multiclean.c, multiclean.c: Renamed file
* gsnap_tally.c: Added check for concordantp in SAM input. Fixed bug in
initializing a variable.
* sequence.c: Made paired adapter detection more stringent, allowing only 1
mismatch
* gsnap.c, sam.c, sequence.c, sequence.h, stage3hr.c: Fixed bugs in printing
full quality string in GSNAP output
* gsnap.c, sam.c, sequence.c, sequence.h, stage3hr.c: Printing full quality
string (not chopped for adapter) in GSNAP output
2011-01-10 twu
* stage3.c: Fixed compilation for PMAP
* gmap.c: Added compiler directives to hide SAM output which is not used in
PMAP
* translation.c: Added compiler directives to hide functions that are not
used in PMAP
* oligop.c: Fixed compiler warnings about array index being char
* Makefile.dna.am: Removed bam_pileup from being made
* gmap.c: Added documentation for new output flags
* gsnap.c: Changed output flag from -7 to --split-output
* chimera.c, chimera.h, genome.c, get-genome.c, gmap.c, iit-read.c,
iit-read.h, md5-compute.c, pair.c, pair.h, revcomp.c, segmentpos.c,
segmentpos.h, sequence.c, sequence.h, stage1.c, stage3.c, stage3.h,
subseq.c, translation.c, translation.h: Implemented split output to files
* iit-read.c: Fixed bug in handling NULL IITs
* gmap.c, pair.c, pair.h, sequence.c, sequence.h, stage3.c, stage3.h:
Implemented printing of chimeras in SAM output
2011-01-09 twu
* trunk, gmap.c, pair.c, pair.h, result.c, result.h, stage3.c, stage3.h,
translation.c: Merged all changes from chimera branch
* Makefile.pmaptoo.am: Update commands
* Makefile.dna.am: Added commands for bam_pileup
2011-01-07 twu
* gmap.c, stage3.h: Added new debugging point for result after all cycles.
* stage3.c: Not forcing solution for dual breaks. Using separate maxiter
limits.
* stage3.c: Changed comments for fix_adjacent_indels
2011-01-06 twu
* Makefile.three.am: Added files to GSNAP
* pair.c: Changed debugging output for Pair_dump to show the comp
* stage2.c: Added a check for all zero scores when trying to find alignment
end point
2011-01-05 twu
* stage3.c: Added a final cleaning of ends
* stage3.c: Added procedure to fix adjacent indels
* gmap.c, pair.c, pair.h, segmentpos.c, segmentpos.h, stage3.c, stage3.h:
Removed references to zerobasedp
* pair.c: Using last_querypos and last_genomepos explicitly instead of
prev->querypos and prev->genomepos. Fixed issues with SAM output.
2011-01-04 twu
* gmap.c: Added compiler directives to prevent PMAP from seeing SAM output
code
* backtranslation.h: Fixed typo in declaration
* gsnap.c: Fixed comment
* gmap.c, pair.c, pair.h, stage3.c, stage3.h: Printing headers and
read-groups in SAM output
2011-01-03 twu
* MAINTAINER: Updated instructions for ChangeLog
* config.guess: Update of config.guess by latest autoconf
* INSTALL: Update of INSTALL message by latest autoconf
* stage3hr.c: Added assertions about sign of nindels
* gmap_setup.pl.in: Handling case where user gives -d argument with trailing
slash
* gsnap.c: Added missing break after -o flag
2010-12-22 coryba
* gsnap_tally.c: changed compiler directives to get gmap build to work
* sam.c: *minor change to have the MD field output a 0 after the deletion if
an insertion is adjacent to a deletion **IGB can now parse gsnap's SAM
output
2010-12-15 twu
* gsnap.c, mapq.c, sequence.c: Added flag --quality-protocol
2010-12-12 twu
* stage1hr.c: Fixed bugs in storing splicesites_i
2010-12-10 twu
* pair.c: Fixed bug in dealing with EXTRAEXON_COMP
* gsnap_tally.c: Added flag for minimum mapq
* gsnap.c, mapq.c, stage1hr.c, stage1hr.h, stage3hr.c, stage3hr.h,
substring.c: Merged r32485:32693 from branch gsnap-trim-penalty into the
trunk
2010-12-08 twu
* config.site.rescomp.tst: Updated to include with_samtools
* bamread.c: Hid declaration of bam_init_header_hash when samtools is not
enabled
2010-12-07 twu
* substring.c: Implemented marking of methylation changes
* stage1.c: Performing a single uniqueness step at end
* stage2.c: Using global or local winner for end of stage 2
* indexdb_dibase.c: Using Access_mode_T for Indexdb_new_genome
* indexdb.c: Minor fixes
* gregion.c, gregion.h: Providing hooks for Gregion_filter_clean
* gmap.c, gsnap.c: Using allocate as default mode if mmap not available
2010-12-06 twu
* gsnap_tally.c, bamread.c, bamread.h, gsnap_extents.c, gsnap_splices.c,
gsnap_terms.c, samread.c, samread.h: Returning mapping quality from SAM
and BAM inputs
* gmap.c: Improved default information for --batch feature in --help
* get-genome.c: Fixed mapping labels from stdin
* gsnap.c: Changed default memory access to be level 2
2010-12-04 twu
* stage3hr.c: Disallowing concordant pairs between two terminal alignments
* stage1hr.c, stage3hr.c, stage3hr.h: Placed restriction on terminal
alignments to have fewer than allowed mismatches within region after
trimming
* stage3hr.c: Changed Stage3pair_remove_duplicates to resolve overlaps using
absdifflength
* gsnap.c: Changed --help output to show default batch mode of 4
* gmap.c: Providing more batch modes in GMAP
* access.h, genome.c, genome.h, gsnap.c, indexdb.c, indexdb.h: Providing
more batch modes in GSNAP
2010-12-03 twu
* stage1hr.c: Made done_level always less than or equal to user_maxlevel
2010-12-02 twu
* samread.h: Added Id tag
* sam.c: Changed terminal alignments to use soft clipping, since hard
clipping information appears to be removed in making BAM files.
* parserange.c, parserange.h: Implemented simple parser for regions
* gsnap_tally.c: Implemented limited region for indexed BAM files in
bam_tally. Added -P flag for printing probabilities.
* bamread.c, bamread.h: Implemented indexed BAM files
* Makefile.dna.am: Added parserange.c and .h to bam_tally
2010-12-01 twu
* Makefile.dna.am, bamread.c, bamread.h, gsnap_tally.c: Implemented
bam_tally. Changed standard tally output back to previous format.
* config.site, configure.ac: Made changes to include samtools library
2010-11-30 twu
* Makefile.dna.am, Makefile.gsnaptoo.am, Makefile.three.am, blackboard.c,
blackboard.h, gmap.c, gsnap.c, sequence.c, sequence.h: Implemented the
ability to uncompress gzip files by GSNAP
* README, config.site, configure.ac: Made changes to reflect a new zlib
option
2010-11-29 twu
* gsnap.c: Fixed bug in output to multiple files where GSNAP single-end
nomapping goes to stdout.
2010-11-24 twu
* get-genome.c: Fixed stdin input to get-genome for non-map requests
2010-11-22 twu
* Makefile.dna.am, Makefile.pmaptoo.am, Makefile.three.am: Added uinttable.c
and uinttable.h to pmap
* Makefile.dna.am, Makefile.gmaponly.am, Makefile.gsnaptoo.am,
Makefile.pmaptoo.am, Makefile.three.am: Added uinttable.c and uinttable.h
to gmap
* stage2.c: Fixed bug in determining overall grand winner
* sam.c: Moved read group field to be first
* iit-read.c, iit-read.h: Implemented print_comment option
* gmap.c: Providing nchrs to stage1 procedure
* gregion.c, gregion.h: Implemented extentstart and extentend for comparing
gregions. Added code for a Gregion cleaning step.
* stage1.c, stage1.h: Added hooks for a Gregion cleaning step
2010-11-18 twu
* gmap.c, pair.c, pair.h, stage3.c, stage3.h: Implemented --print-comment
for map output. Removed old code for universal coordinate IIT files.
* genome.h, stage3hr.h: Formatting changes
* goby.c, goby.h, gsnap.c, sequence.c, sequence.h: Changes made for new Goby
code
* substring.c: Always initializing trim_left and trim_right
* pdl_smooth.c, spliceturn.c, multiclean.c: Initial import into svn
* sam.c: Made fixes in printing mate information
* splicing-scan.c: Combining splice and terminal splicesites
* Makefile.dna.am: Removed hexamer-score.c and .h from extents_genebounds
* extents_genebounds.c: Added debugging information
* tally_expr.c: Using new interface to IIT_annotation. Providing option to
print gc-content.
* Makefile.dna.am: Removed tally_exclude from bin files
* Makefile.dna.am: Removed iit_pileup from bin files
* fopen.m4: Added _cv_ to all variable names
* Makefile.am: Added ACLOCAL_AMFLAGS
* VERSION: Updated version
* bootstrap.dna, bootstrap.gmaponly, bootstrap.gsnaptoo, bootstrap.three:
Using autoreconf. Added --install to some files to allow building from
svn.
* sam.c: For unpaired_uniq, performing sorting first and then selecting mate
for each end.
* sam.c: Restored null mate for unpaired_mult
* sam.c: Providing mate information in unpaired_mult
* iit-read.c: Printing tabs in SAM headers
* Makefile.dna.am, Makefile.gmaponly.am, Makefile.gsnaptoo.am,
Makefile.three.am: Changes to Makefile.am files
2010-11-17 twu
* gsnap.c: Added a flag for --no-sam-headers
* resulthr.c, resulthr.h: Added a printing command for resulttype
* sam.c: For unmapped reads, always providing a mate if available
* sequence.c, sequence.h, gmap.c, pair.c, pair.h, stage3.c, stage3.h: Added
ability to print GMAP alignments in SAM output format
* gsnap.c, substring.c, substring.h: Added --show-refdiff option
* README: Added further information about SNP-tolerant alignment and
wildcard SNPs.
2010-11-16 twu
* README: Made changes in instructions for -V and -v flags
* gsnap.c, iit-read.c, iit-read.h, sam.c, sam.h: Implemented SM and RG
fields.
* gsnap.c: Added warning about paired-end output in Goby
* goby.c, goby.h: Using new interface to sequence.h
* datadir.c: Minor formatting change
* get-genome.c, sequence.c, sequence.h: Handling printing of wildcard SNPs
2010-11-15 twu
* sequence.c, sequence.h: Changed name of procedure
* revcomp.c: Added flag for --byline
* reads_store.c: Fixed bug in freeing memory too early
* gsnapread.c: Reading quality string based on presence of third tab
* gsnap.c: Removed short versions of some flags
* sam.c: Using nmismatches_refdiffs in NM output
* stage3hr.c, stage3hr.h, substring.c, substring.h: Fixed trimming based on
SNPs. Computing different types of nmismatches.
* add_rpk.c, exonscan.c, genecompare.c, plotgenes.c, tally.c, tallygene.c:
Using new interface to IIT_annotation
* genome_hr.c, snpindex.c: Enabled representation of wildcard SNPs
* get-genome.c: Added -V flag to specify a directory for alternate genome
information
* substring.c, substring.h, gsnap.c, mapq.c, mapq.h, sam.c, stage1hr.c,
stage3hr.c, stage3hr.h: Added computation of mapping quality
2010-11-11 twu
* sequence.c, sequence.h: Fixes to printing of query sequences for failed
alignments
* goby.c, goby.h, gsnap.c: Always shutting down Protobuf if compiled in.
Calling gobyAlEntry_appendTooManyHits, even under quiet-if-excessive
option. Changes to flag descriptions.
* genome_hr.c, genome_hr.h, stage1hr.c, stage3hr.c, stage3hr.h, substring.c,
substring.h: Marking all mismatches by using query_compress,
genome_blocks, and snp_blocks.
2010-11-10 twu
* goby.c, goby.h, gsnap.c, sam.c, sam.h, stage3hr.c, stage3hr.h: Allowing
for three paired-end orientations instead of circular option. Added
--fails-as-input flag. Fixed issues with handling --failsonly option.
* Makefile.gsnaptoo.am: Added blank line
* Makefile.gmaponly.am: Added parserange.c and parserange.h
* iit-read.c, iit-read.h: Revised IIT_annotation to handle version 5 IIT
files
* iit_get.c: Using new interface to IIT_annotation
* genome.c, genome.h: Added procedures for returning ntcounts in a segment
* gmap.c: Using long options without short options for --version and --help
* pair.c: Fixed output of -f 8 format
* sequence.c, sequence.h: Added procedures for printing GSNAP queries
2010-11-09 twu
* goby.c, goby.h, Makefile.dna.am, Makefile.gsnaptoo.am, Makefile.three.am,
gsnap.c, stage3hr.c, stage3hr.h: Added functionality for Goby file formats
* configure.ac: Added hooks for Goby compile-time option
* config.site: Added information for Goby compile-time option
* README: Added comments for FASTQ files, -z flag, and Goby functionality
* gsnap_tally.c: Fixed problem with underflow in taking exp() of log
likelihood
* gsnap_tally.c: Added flag for using a constant quality score. Printing
1-p instead of p.
2010-11-08 twu
* get-genome.c: Making only a single open of genome or genomealt
2010-11-07 twu
* genome.c, get-genome.c: Using new interface to IIT_annotation
2010-11-04 twu
* gsnap_terms.c: Added flags and parameters for mincount, min_endlength, and
max_endlength.
2010-10-31 twu
* reads_get.c: Made several changes in parsing
* gsnap.c, sam.c, sam.h, stage1hr.c, stage1hr.h, stage3hr.c, stage3hr.h,
substring.c, substring.h: Made SNP and splicesite parameters local to
Substring procedures
2010-10-29 twu
* gsnap_tally.c: Added -A flag for controlling printing of ref details.
Removed unused global parameters. Fixed bug in retrieving genomic
reference.
2010-10-28 twu
* sam.c: Added XS flag to indicate splice direction
* configure.ac: Additions needed for new libtool version
* bootstrap.gsnaptoo: Running full set of autoconf programs
* iit.test.in: Revised version of test
* config.guess, config.sub: New version of libtool programs
* config.guess, config.sub, ltmain.sh: Previous version of libtool programs
* gsnap_tally.c: Added computation of genotype probabilities
* indexdb_dibase.c, indexdb_dibase.h, setup.ref3positions.ok: Initial import
into SVN
2010-10-27 twu
* gsnap_tally.c: Sorting shifts and quality scores
* gsnap_tally.c: Keeping track of and reporting shifts and quality for
reference matches
* list.c: Added tentative code for dealing with NULL lists in List_to_array
* chop_primers.c: Using new interface to Sequence_print_header
* translation.c, translation.h: Added ability to start CDS from a given
position
* tally_expr.c: Added ability to show mincount
* tally.c, tally.h: Added functions Tally_mean_double and Tally_quantile
* seqlength.c, pair.c: Using new interface to Sequence_print_digest
* parserange.c: Fixed bug in returning coordstart
* md5-compute.c: Using new interface to MD5_print
* iit_store.c: Storing rest of header in annotation. Using new interface to
IIT_write.
* iit_get.c: Changed stats to compute mean over entire width, not just
non-zero positions
* gsnap_iit.c: Using new interface to Gsnapread_parse_line
* gsnap_tally.c: Subtracting 64 from quality scores, as standard for Illumina
* gsnap_extents.c, gsnap_splices.c, gsnap_terms.c: Using new interface to
Samread
* gsnap_tally.c: Printing quality scores relative to highest one seen
* sam.c: Changed separator for extra fields to be a tab, rather than a space.
2010-10-26 twu
* gsnap.c, stage1hr.c, stage1hr.h, stage3hr.c, stage3hr.h, substring.c,
substring.h: Implemented trim-mismatch-score for controlling trimming
* samread.c, samread.h, gsnapread.c, gsnapread.h: Implemented retrieval of
quality string
* gsnap_tally.c: Printing mismatch information by position and quality
* gmapindex.c: Using new interface to IIT_write
* gmap.c: Protecting against calling List_to_array with an empty list
* get-genome.c: Fixed bug introduced by new default snps_mode
* diag.c: Protecting against call to List_to_array on an empty list
* stage3hr.c, stage3hr.h: Reversing quality string in GSNAP output when
necessary, and using quality shift.
* gsnap.c: Fixed memory leak when npaths is zero. Reversing quality string
in GSNAP output when necessary.
2010-10-25 twu
* gsnap_tally.c: Changed output format to show all signed query positions
* gsnap_tally.c: Changed output format to show all query positions
* gsnap_tally.c: Incorporated sam_tally into this source code
2010-10-24 twu
* splicetrie.h: Changed Splicetrie_dump
* splicetrie.c: Changed Splicetrie_dump. Handling case where pos5 == pos3
in short-exon splicing. Added debugging statements.
* stage1hr.c: Fixed a bug in handling one case of ambiguous splice ends in
short-exon splicing.
* splicetrie.c: Allowing only one mismatch at most for searching at ends in
short-exon splicing when ends are 16 nt or shorter.
* splicetrie.c: Combined Trie_new and Trie_output into a single procedure
2010-10-23 twu
* splicetrie.c: Removed debugging statement
* gsnap.c, splicetrie.c, splicetrie.h, stage1hr.c, stage1hr.h: Enabled
computation of splice tries on the fly
* gsnap.c, splicetrie.c, splicetrie.h: Divided Splicetrie_build process into
two steps, with one computing nsplicepartners.
* gsnap.c, splicetrie.c, splicetrie.h: Using unsigned ints rather than char
* to store splicestrings and compute tries.
2010-10-22 twu
* splicetrie.c: Ignoring cases where splice site has an N
* stage3hr.c: Changed assertion to use effective_chrnum rather than chrnum
* stage1hr.c: Using new interface to Splicetrie procedures. Revised
parameters for distant splicing.
* splicetrie.c, splicetrie.h: Checking short-end and short-exon splicing
against extension
* gsnap.c: Automatically setting pairmax if not specified for RNA-Seq
2010-10-20 twu
* stage3hr.c: Changed ambiguous splice procedure to remove longer splice
* stage1hr.c: Using new interfaces to Splicetrie procedures.
* splicetrie.c, splicetrie.h: Fixed various bugs. Implemented separate
procedures for short-ends and for longer ends (needed for short-exon
alignments).
2010-10-19 twu
* splicetrie.c, splicetrie.h, stage1hr.c: Checking entire subtree against
splicefrags when using alternate genome and reaching a non-leaf with no
string remaining. Removed unused parameters.
* stage1hr.c: Using new interface to Splicetrie_dump
* get-genome.c: Changed flags for using SNPs
* genome.c, genome.h: Added function Genome_fill_buffer_simple_alt
* splicetrie.c, splicetrie.h: Fixed use of nmismatches from splicefrags.
Fixed use of alternate genome. Using 4 bytes instead of 2 bytes for
reloffsets. Not using suboptimal separation.
2010-10-18 twu
* stage1hr.c, stage3hr.c, stage3hr.h, substring.c, substring.h: Removed
chrhigh from Substring_T and Stage3_T objects, and removed from segment
objects in stage 1.
* gsnap.c, splicetrie.c, splicetrie.h, stage1hr.c: Completed transition to
using splicetries. Removed unnecessary variables and code.
* genome_hr.c, genome_hr.h, gsnap.c, splicetrie.c, splicetrie.h, stage1hr.c,
stage1hr.h: Enabling use of splicefrags with splicetrie. Enabled
processing of alternate alleles.
2010-10-16 twu
* Makefile.dna.am, Makefile.gsnaptoo.am, gsnap.c, splicetrie.c,
splicetrie.h, stage1hr.c, stage1hr.h: Implemented tries for short-end
splicing
2010-10-15 twu
* gsnapread.c, gsnapread.h: Returning number of mismatches
* stage3.c, stage3.h, gmap.c: Added ability to specify where CDS begins
* get-genome.c: Added -A flag for dumping entire genome. Handling -m flag
correctly for stdin input.
2010-10-14 twu
* pair.c: Implemented printinf of coverage, identity, and phases in GFF
output
* iit-write.c, iit-write.h, iitdef.h: Implemented version 5 of IIT format,
which allows different pointer sizes for labels and annotations
* gsnap.c, iit-read.c, iit-read.h, md5.c, md5.h, resulthr.c, resulthr.h,
sam.c, sam.h, sequence.c, sequence.h, stage3hr.c, stage3hr.h, substring.c,
substring.h: Enabled printing of output into multiple files
* stage1hr.c: Allowing 1 mismatch in shortexon end, but requiring a
separation of 2 from next best alignment. Consolidated code into
find_left_splice and find_right_splice.
2010-10-13 twu
* gsnap.c: Changed default quality-shift to be 0
* stage3hr.c, stage3hr.h, substring.c, substring.h, sam.c: Implemented
calculation and printing of MD string
* gsnap_tally.c: Fixed bug in freeing data. Setting min_readlength to 0.
Using new interface to Gsnapread.
2010-10-06 twu
* spliceclean.c: Added ability to print excluded splices
* spliceclean.c: In resolving splice direction, using a 10-to-1 threshold
and checking adjacent splices if necessary.
* gsnap.c: Turned on indels by setting default indel-penalty to be 1.
2010-10-04 twu
* stage1hr.c: Added restriction on number of mismatches allowed in short
exons
2010-10-02 twu
* spliceclean.c: Added ability to resolve between competing splices based on
fwd and rev extents. Added ability to print endpoints and midpoints.
Added flag to bypass cleaning step.
* stage3hr.c: Allowing pair overlaps when splices are involved. Using new
way of computing low and high for pairs.
2010-09-30 twu
* gsnap_extents.c: Initial creation
* Makefile.dna.am: Added extents_genebounds
2010-09-29 twu
* spliceclean.c: Added ability to print runlengths and splicesites. Added
ability to filter based on uniqueness, concordance, or maxminsupport.
* extents_genebounds.c: Initial creation
2010-09-28 twu
* Makefile.dna.am: Added multiclean, gsnap_extents, and gsnap_terms
* gsnap_terms.c: Initial creation of gsnap_terms
2010-09-22 twu
* iit_dump.c: Implemented printing of IIT in runlength or integral output
* gsnap.c, sam.c, sam.h, sequence.c, stage3hr.c: Fixed handling of circular
reads and implemented printing of SAM output for circular reads.
* gmap.c, stage3.c, stage3.h, translation.c, translation.h: Added feature to
start protein coding sequence from first query position.
2010-09-20 twu
* splicegene.c: Added the ability to output genes as well as paths
2010-09-08 twu
* mem.c: Added comment about use of LEAKCHECK
* list.c: Removed unused variable
* iit-read.c, iit-read.h, iit_dump.c: Added sort functionality for iit_dump
* snpindex.c: Using -V flag to allow user to specify destination directory
* substring.c: Added check to avoid checking for mismatches past end of
string
* stage1hr.c: Simplified computation of leftbound and rightbound in
short-end splicing
* stage1hr.c: Using different stopi for novel and known splicing. Fixed
possible bug in reading past mismatches_left and mismatches_right. Fixed
calculation of chrend in finding right bound for short-end splicing.
2010-09-04 twu
* splicegene.c: Incorporated cappaths functionality
* splicegene.c: Removed global variables related to linear fitting
* splicegene.c: Fixed memory leaks. Added filtering based on mean number of
splices.
2010-09-03 twu
* splicegene.c: Removed global variables
* splicegene.c: Added ability to handle all chromosomes in a single run.
Fixed some memory leaks.
* gsnap.c, stage1hr.c, stage1hr.h, stage3hr.c, stage3hr.h, substring.c,
substring.h: Introduced -V flag for specifying snpsdir, and now using -v
flag for indicating SNPs file. Removed geneprob option and procedures.
* stage1hr.c, stage3hr.c, substring.c, substring.h: Made terminals extend to
beginning and end of read, with trimming starting from there. Endtypes
based on presence of trimming.
2010-09-02 twu
* gsnapread.c, sam.c, stage1hr.c, stage3hr.c, stage3hr.h, substring.c,
substring.h: Added left and right endtypes to Substring_T object, and
using them for printing exact, substitution, and terminal alignments.
Renamed variables in Hittype_T enum. Added ambiguous alignments.
Restored usage of score in Stage3_remove_duplicates. Using number of
mismatches to compute nmatches in Stage3_T objects. Revised computation
of terminal alignments.
2010-09-01 twu
* gsnap.c, stage1hr.c, stage1hr.h: Introduced min_shortend as a parameter
with flag -l. The find_left_splice and find_right_splice procedures now
compete with extension.
2010-08-31 twu
* gmap.c, stage3.c, stage3.h: Introducing sense_filter in addition to
sense_try. Counting non-canonical introns, rather than canonical ones to
determine sense, and adding a small penalty for introns to bias against
short exons.
* stage1hr.c: Using new parameter to turn off concordant translocations with
terminal alignments. Clarified usage of query, queryuc_ptr, and queryrc.
* stage3hr.c, stage3hr.h: Added flag to control concordant translocations
* genome_hr.c, genome_hr.h: Fixed bug in handling fragments when query
length is 16. Removed query parameter from Genome_trim_left and
Genome_trim_right procedures.
* stage1hr.c: Stopped placing restrictions on stopi in finding splice ends.
Requiring minimum endlength for short end splicing.
2010-08-30 twu
* stage1hr.c: Fixed bug with donor, acceptor, and shortexons that were NULL.
Fixed logic with novel splice sites in local splicing.
2010-08-26 twu
* stage1hr.c: Fixed bug attempting to make shortexon of length 0
2010-08-25 twu
* splicing-scan.c: Initial import into SVN
* Makefile.util.am: Renamed revcomp program to rc
* Makefile.gsnaptoo.am: Added gsnapread.c and gsnapread.h for gsnap_tally
* Makefile.dna.am: Added rc and splicing-scan
* get-genome.c: Removed unused parameter
* gsnap_splices.c, gsnapread.c: Allowing program to handle short exon
alignments with multiple splices
* parserange.c: Added check for coordinate lengths that exceed 32 bits
* substring.c: Commented old location of sub: field in donor and acceptor
substrings
* stage3hr.c, stage3hr.h: Including chrhigh in substrings
* stage1hr.c: Including chrhigh in segments and substrings. Implemented
usage of splice distance in short exon alignments.
* gsnap_iit.c, gsnap_splices.c, gsnap_tally.c, gsnapread.c, gsnapread.h:
Gsnapread_parse_line returns information about types of endpoints
2010-08-24 twu
* substring.c, substring.h: Including chrhigh as a field in Substring_T
* sam.c, stage1hr.c, stage3hr.c, stage3hr.h, substring.c, substring.h:
Implemented shortexon alignment
* gsnap.c: Changed message for reading splicesites IIT file
* genome_hr.c: Fixed bug in using flags for shortend splicing
2010-08-23 twu
* gsnap.c: Enabled reading of a local splicesite file
2010-08-19 twu
* stage1hr.c: Added check for query_compress to be non-NULL before
find_terminals for single-end alignment
* stage1hr.c: Fixed bug where query_compress needed to be computed before
finding terminals
2010-08-18 twu
* iit_store.c: Using string_compare and string_hash functions from table.c
* iit-write.c: Moved position of free() statement
* iit-read.c: Fixed debugging output
* genome_hr.c, genome_hr.h, gsnap.c, stage1hr.c, stage1hr.h: Using
splicefrags to increase speed of finding short-end splicing
* stage3hr.c: Setting nindels to be zero for a terminal alignment
2010-08-13 twu
* gsnap.c, indexdb.c, indexdb.h, indexdb_hr.c, indexdb_hr.h, spanningelt.c,
spanningelt.h, stage1hr.c, stage1hr.h: Allowing GSNAP to run when
positions are read as FILEIO.
* stage1hr.c, stage3hr.c, stage3hr.h, substring.c, substring.h: Using
splicesites array rather than splicesites_iit in short-end splicing
* stage1hr.h: Increased MAX_QUERYLENGTH from 200 to 500
2010-08-09 twu
* indexdb_dump.c: Added file left out in SVN conversion
* indexdb_dump.c: Undo addition to wrong directory
* stage1hr.c: Removed requirement for nconcordant == 0 in deciding to
compute local splices and short-end splices.
2010-08-06 twu
* reads.c, reads_store.c: Eliminating labelorder in file format. Cleaned up
memory leaks.
2010-08-05 twu
* reads.c: Improved speed of dumping procedure
* reads.c, reads_get.c, reads_store.c: Allowing either 4-byte or 8-byte
label and read pointers
* Makefile.dna.am, reads.c, reads.h, reads_get.c, reads_store.c: Enabled
compression of reads
* Makefile.dna.am, reads.c, reads.h, reads_store.c: Using a div structure in
file format
* Makefile.dna.am, reads.c, reads.h, reads_dump.c, reads_get.c,
reads_store.c: Using our own file format for storing reads, rather than
BerkeleyDB
* iit-read.c: Fixed bug in fileio reading of annotations
* access.c, access.h, add_rpk.c, assert.c, assert.h, backtranslation.c,
backtranslation.h, bigendian.c, bigendian.h, blackboard.c, blackboard.h,
block.c, block.h, bool.h, boyer-moore.c, boyer-moore.h, cappaths.c,
changepoint.c, changepoint.h, chimera.c, chimera.h, chop_primers.c,
chrnum.c, chrnum.h, chrom.c, chrom.h, chrsegment.c, chrsegment.h,
chrsubset.c, chrsubset.h, cmet.c, cmet.h, cmetindex.c, color.c, color.h,
comp.h, complement.h, compress.c, compress.h, cum.c, datadir.c, datadir.h,
datum.c, datum.h, diag.c, diag.h, diagdef.h, diagnostic.c, diagnostic.h,
diagpool.c, diagpool.h, dibase.c, dibase.h, dibaseindex.c, doublelist.c,
doublelist.h, dynprog.c, dynprog.h, except.c, except.h, exonscan.c,
fopen.h, gbuffer.c, gbuffer.h, gdiag.c, geneadjust.c, genecompare.c,
geneeval.c, genome-write.c, genome-write.h, genome.c, genome.h,
genome_hr.c, genome_hr.h, genomepage.c, genomepage.h, genomeplot.c,
genomicpos.c, genomicpos.h, genuncompress.c, get-genome.c, getopt.c,
getopt.h, getopt1.c, gmap.c, gmapindex.c, gregion.c, gregion.h, gsnap.c,
gsnap_iit.c, gsnap_splices.c, gsnap_tally.c, gsnapread.c, gsnapread.h,
hint.c, hint.h, iit-read.h, iit-write.c, iit-write.h, iit_dump.c,
iit_fetch.c, iit_get.c, iit_plot.c, iit_store.c, iit_update.c, iitdef.h,
indexdb.c, indexdb.h, indexdb_hr.c, indexdb_hr.h, indexdbdef.h,
interval.c, interval.h, intlist.c, intlist.h, intlistdef.h, intpool.c,
intpool.h, intron.c, intron.h, lgamma.c, lgamma.h, list.c, list.h,
listdef.h, littleendian.c, littleendian.h, match.c, match.h, matchdef.h,
matchpool.c, matchpool.h, maxent.c, maxent.h, md5-compute.c, md5.c, md5.h,
mem.c, mem.h, memchk.c, nmath.c, nmath.h, nr-x.c, nr-x.h, oligo-count.c,
oligo.c, oligo.h, oligoindex.c, oligoindex.h, oligop.c, oligop.h,
orderstat.c, orderstat.h, pair.c, pair.h, pairdef.h, pairingcum.c,
pairingflats.c, pairinggene.c, pairingstrand.c, pairingtrain.c,
pairpool.c, pairpool.h, parserange.c, parserange.h, pbinom.c, pbinom.h,
pdldata.c, pdldata.h, pdlimage.c, plotdata.c, plotdata.h, plotgenes.c,
plotgenes.h, pmapindex.c, random.c, random.h, rbtree.c, rbtree.h,
rbtree.t.c, reader.c, reader.h, reqpost.c, reqpost.h, request.c,
request.h, result.c, result.h, resulthr.c, resulthr.h, revcomp.c, sam.c,
sam.h, sam_tally.c, samread.c, samread.h, scores.h, segmentpos.c,
segmentpos.h, segue.c, separator.h, seqlength.c, sequence.c, sequence.h,
smooth.c, smooth.h, snpindex.c, spanningelt.c, spanningelt.h,
spliceeval.c, splicegene.c, splicegraph.c, splicescan.c, splicing-score.c,
stage1.c, stage1.h, stage1hr.c, stage1hr.h, stage2.c, stage2.h, stage3.c,
stage3.h, stage3hr.c, stage3hr.h, stopwatch.c, stopwatch.h, subseq.c,
substring.c, substring.h, table.c, table.h, tableint.c, tableint.h,
tallyadd.c, tallyflats.c, tallygene.c, tallyhmm.c, tallystrand.c,
translation.c, translation.h, trial.c, trial.h, types.h, uintlist.c,
uintlist.h, uinttable.c, uinttable.h: Added keyword property for Id
* tally_expr.c: Fixed bug in reporting number of exons and in skipping exons
2010-08-04 twu
* stage3hr.c: Improved debugging statements
* stage3.c: Fixed bug when ngap was larger than gaps in dual_break
* samread.c: Added old but unused code
* iit-read.c, iit-read.h: Added function IIT_get_typed_signed
* gsnap_splices.c: Added parameter for shortsplicedist
* gsnapread.c, gsnapread.h: Added function Gsnapread_accession
* gsnap_tally.c: Fixed bug in using advance_one_hit
* Makefile.dna.am, gsnap_iit.c: Created gsnap_iit
* Makefile.dna.am, reads_get.c, reads_store.c: Created programs reads_store
and reads_get.
* splicefill.c: Using median filtering as a first step
* splicefill.c: Removed probabilistic calculations
* splicefill.c: Version with probabilistic calculations
2010-08-03 twu
* splicefill.c: Using tally information to find edges. Using Poisson and
exponential models.
* iit-read.c, iit-read.h, snpindex.c: Providing messages about chromosomes
in the genome and in the SNPs IIT file
2010-08-02 twu
* Makefile.dna.am, splicefill.c: Initial creation of splicefill program
2010-08-01 twu
* gsnap_tally.c, gsnapread.c, gsnapread.h: Able to separate low and high
ends of paired-end reads
2010-07-31 twu
* Makefile.dna.am, gsnap_tally.c: Using parsing functions in gsnapread.c
* Makefile.dna.am, gsnap_splices.c, gsnapread.c, gsnapread.h: Moved parsing
functions to gsnapread.c
2010-07-30 twu
* spliceclean.c: Preserving information in rest of header
* gsnap_splices.c: Printing maxminsupport and nconcordant information
* Makefile.dna.am, spliceclean.c: Enabled spliceclean to handle all
chromosomes. Using tables to store splices.
* spliceclean.c: Fixed bugs in parsing input
2010-07-29 twu
* spliceclean.c: Added procedure to free memory
* gsnap_splices.c: Fixed bug from freeing table keys too early
* gsnap_splices.c: Enabled program to handle all chromosomes in a single run
* Makefile.dna.am, gsnap_splices.c, iit-read.c, uinttable.c, uinttable.h:
Using a table to store splice sites in gsnap_splices.c
2010-07-28 twu
* gsnap_splices.c: Removed -F and -R flags for separate strands
* Makefile.dna.am, gsnap_splices.c, sam_splices.c: Integrated sam_splices.c
and gsnap_splices.c into a single file
* mem.c, genome.c: Removed unused variable
* iit-read.c, substring.c: Removed unused code
* blackboard.c: Returning bool type explicitly
* sequence.c: Resolving compiler warning about type casting
* gsnap_splices.c, sam_splices.c, spliceclean.c: Allowing -s flag to print
annotations about known splicesites
* struct-stat64.m4: Added missing m4 file
* Makefile.am, cvs2cl.pl, svncl.pl: Replace cvs2cl.pl with svncl.pl
* CVSROOT: Removed CVSROOT directory
2010-07-27 twu
* assert.h: Changed compiler variable
* VERSION, config.site.rescomp.prd, index.html: Revised for 2010-07-27
release
* bootstrap.dna: Using autoreconf
* README: Modified statement about -m flag and about types in SNP IIT files
* MAINTAINER: Added statement about assert.h
* tally_expr.c: Standardized output format
* gsnap.c: Made -q flag work correctly for single-thread mode. Printing run
time at end of each run.
* gmap.c: Calling correct exception for a sigtrap
2010-07-26 twu
* Makefile.dna.am, Makefile.gsnaptoo.am: Using datadir in snpindex
* iit-read.c, iit-read.h: Fixed IIT_index function
* snpindex.c: Using datadir. Fixed error messages.
* stage3hr.h, substring.c, substring.h: Removed fields for halfintrons.
* stage3hr.c: Fixed bug in removing duplicates. Removed fields for
halfintrons.
* stage1hr.c, stage1hr.h: Implemented short-end splicing for known splice
sites
* mem.c: Changed monitoring statement to print only in debug mode
* iit-read.c, iit-read.h: Added procedure for typed and signed intervals
based on divno
* gsnap.c: New interface to stage 1 procedures
2010-07-23 twu
* VERSION, index.html: Revised for 2010-07-23 release
* spliceclean.c: Processing forward and reverse splices separately
* gsnap.c: Fixed bug where -a flag modified trim_maxlength
* assert.h: Turned off assertion checking
* Makefile.dna.am: Added tally_exclude
* substring.c: Modified debugging statements for trimming
* stage3hr.c: Added debugging statements
* iit-read.c, iit-read.h: Added function IIT_interval_sign
2010-07-22 twu
* tally_expr.c: Allowing printing over all positions
* tally_expr.c: Allowing multiple tallies
2010-07-21 twu
* gsnap.c, sam.c, sam.h, sequence.c, sequence.h: Fixed handling of quality
scores to match that of sequence. Added -j flag to specify amount of
shift for quality scores.
* setup1.test.in: Putting test chromosome in subdirectory
* setup2.test.in: Revised test for new gmapindex, but test not being used
currently
* iit.test.in: Not testing for diff in iittest.iit
* align.test.ok, coords1.test.ok, map.test.ok: Changed expectations to match
latest program output
* iittest.iit.ok: Using latest IIT version
* Makefile.am: Using ref3offsets and ref3positions instead of idxoffsets and
idxpositions
* acx_mmap_fixed.m4, acx_mmap_variable.m4: Added stdlib.h and unistd.h
headers
* bootstrap.dna, bootstrap.gsnaptoo, bootstrap.three,
config.site.rescomp.prd, config.site.rescomp.tst, gsnap-fetch-reads.pl,
gsnap-fetch-reads.pl.in, gsnap-remap.pl, gsnap-remap.pl.in, cum.c,
dibaseindex.c, geneadjust.c, pairingtrain.c, splicegraph.c, tallyadd.c,
tallygene.c, tallystrand.c: Initial import into CVS
* config.site.gne: Removed old config file
* acinclude.m4: Including builtin m4 code
* MAINTAINER: Added notes about checking Bigendian behavior
* archive.html, index.html: Revised for 2010-07-20 release
* configure.ac: Better checking for VERSION
* bootstrap.pmaptoo: Added --force flag
* bootstrap.gmaponly: Added autoreconf step
* README, VERSION: Changed for 2010-07-20 release
* gmap_process.pl.in: Removed check for contig version
* gmap_update.pl.in: Not updating chromosome or contig IIT files
* gmap_setup.pl.in: Providing -q and -Q flags for GMAP and PMAP indexing
intervals.
* gsnap_splicing.pl: Program is superseded by C program gsnap_splices
* gsnap_splicing.pl: Various changes. Program is superseded by C program
gsnap_splices.
* gmap_compress.pl.in, gmap_reassemble.pl.in, gmap_uncompress.pl.in,
md_coords.pl.in: Using "use warnings" instead of "-w" flag
* fa_coords.pl.in: Handling duplicate occurrences of a chromosome. Limiting
number of warnings.
* Makefile.am: Added gmap_update
* gmap.c, pair.c, pair.h, stage3.c, stage3.h: Implemented -f 4 GFF estmatch
format based on patch from Shaun Jackman and Eoghan Harrington of British
Columbia Genome Sciences Centre.
* chimera.c: Commented out problematic code, to be resolved later
* get-genome.c: Fixed coordinates when retrieving map file contents
* pairingstrand.c, tallyhmm.c, geneeval.c: Using new Parserange_universal
function
* tally.c, tally.h: Treating counts as long ints
* splicegene.c: Changed algorithm
* spliceeval.c: Removed unused code
* plotgenes.c, plotgenes.h: Several changes, including trying to resolve
fatal errors
* pdldata.c, pdldata.h: Implemented Pdldata_new and Pdldata_write
* pairinggene.c: Counting found splices as flats
* pairingflats.c: Changed algorithm for finding flat regions
* pairingcum.c: Treating high and low reads separately
* oligo-count.c: Using new interface to Reader_new.
* lgamma.c, lgamma.h: Handling counts as long ints
* hint.c, hint.h: Changed models
* genecompare.c: Separate output for forward and reverse chromosome strands.
* gdiag.c: Removed some output. Using new interfaces to IIT_read.
* dibase.c, dibase.h, exonscan.c: Change in algorithm
* chimera.c: Using Path_matchscores instead of Stage3_matchscores
* cappaths.c: Using xintercepts instead of slopes
* boyer-moore.c, boyer-moore.h: Added procedures for chop_primers.c
* add_rpk.c: Change of output format
2010-07-20 twu
* stage1hr.c: Tightened requirements further for splice site probabilities
on distant splicing.
* stage3hr.c: Using nmatches to filter pairs containing terminal alignments
* gsnap.c: Changed advice on RNA-Seq settings for -m.
* archive.html, index.html: Released version 2010-03-10
* substring.c, substring.h: Computing nmatches directly
* stage3hr.h: Removed score parameter from Stage3_new_terminal
* stage3hr.c: Selecting best among terminal alignments. Computing nmatches
directly.
* stage1hr.c: Changed algorithm for finding terminal alignments. Requiring
distant splicing to have high splice probabilities.
* sam_splices.c: Computing readlengths on each end of splice separately
* gsnap.c, gsnap_splices.c: Added debugging code
2010-07-19 twu
* stage1hr.c: Using sequences as numeric in some cases
* maxent.c, maxent.h: Added procedures to handle sequences as numeric
* gsnap.c: Added a comment to the --help message
* genome_hr.c, genome_hr.h: Added a procedure to retrieve a dinucleotide
* genome.c, genome.h: Added a procedure to retrieve sequences as numeric
* Makefile.dna.am, Makefile.gmaponly.am, Makefile.gsnaptoo.am,
Makefile.pmaptoo.am: Revised files and programs as needed
2010-07-16 twu
* stage3hr.c: Requiring that dual translocations be concordant only across
the same two chromosomes.
* smooth.c: Conserving listcells where possible
* oligoindex.c, oligoindex.h: Removed computation of fingerprint
* list.c, list.h: Implemented List_transfer_one and List_push_existing
* gsnap.c: Performing trimming by default
* dynprog.c: Ensuring that finalscore is assigned in Dynprog_genome_gap.
* stage3hr.c, substring.c, substring.h: Providing a minlength parameter to
Substring_new, so end indels do not get eliminated.
* chop_primers.c: Initial import into CVS
* sam_tally.c: Trimming uses -3 for mismatches and resets negative scores to
zero. Handling hard clipping.
* gsnap_tally.c: Trimming uses -3 for mismatches and resets negative scores
to zero
* gsnap_splices.c: Sorting splices using both ends.
* sam_splices.c: Handling AT-AC introns. Sorting splices using both ends.
* samread.c, samread.h: Returning acc in parsing line
* sam.h: Renamed NOT_PRIMARY bit
* sam.c: Implemented hard clipping of sequences for SAM output. Enabled
printing of distant splices onto two separate lines. Using NOT_PRIMARY
bit in flag.
* sequence.c, sequence.h: Implemented hard clipping of sequences for SAM
output
* stage3.c: Removed unused procedures. Conserving listcells when possible.
* stage2.c: Removed unused procedures
* gmap.c: Removed references to Intpool_T
* pairpool.c: Setting initial value for state
* matchpool.c, matchpool.h: Implemented Matchpool_free_memory procedure
* mem.c, mem.h: Added procedures for computing memory usage
* stage3hr.c, substring.c, substring.h: Now trimming from both ends of
terminal alignment. Explicitly specifying which ends to trim.
* substring.h: Replaced Substring_T with T
* substring.c: Allowing Substring_new to return NULL if trimmed alignments
are poor. Replaced Substring_T with T. Resetting score to zero when it
becomes negative in trimming.
* stage1hr.c, stage3hr.c: Allowing Substring_new and Stage3_new to return
NULL if trimmed alignments are poor.
* substring.c: Changed mismatch score from -1 to -3 for trimming
* stage1hr.c, stage3hr.h: Added notion of ambiguous splices.
* stage3hr.c: Added notion of ambiguous splices. Fixing removal of
duplicates.
2010-07-14 twu
* stage3hr.c, stage3hr.h: Implemented Stage3_substring_low
* samread.c: Added debugging comments
* sam.c, sam.h: Moved flag constants to sam.h. Using Stage3_substring_low
to print chromosomal pos.
* sam_splices.c: Simplified loop
* gsnap_splices.c: Having lines_gc return NULL
* gsnap_tally.c: Fixed trimming. Turning off trimming by default.
* sam_tally.c: Initial import into CVS
* sam_splices.c, samread.c, samread.h: Fixed bug in specifying allowed
dinucleotides. Moved parsing procedures to samread.c.
2010-07-13 twu
* sam_splices.c: Initial import into CVS
2010-07-10 twu
* spliceclean.c: Changed variable names
* stage2.c, stage2.h, stage3.h: Removed stage2 fingerprint
* gmap.c: Added freeing of pairpool and diagpool memory at certain intervals.
* pair.c, pair.h, stage3.c: Moved HMM code from pair.c to stage3.c
* pairpool.c, pairpool.h: Implemented Pairpool_free_memory function
* diagpool.c, diagpool.h: Implemented Diagpool_free_memory function
* gsnap.c: Added ability to remove adapters from paired-end reads.
Providing option for maxlength on trimming.
* gmap.c: Using Stage2_scan method to rank gregions. Providing additional
diagnostic options.
* diag.c, diag.h, diagpool.h: Added ability to allocate memory for
diagonals, rather than using diagpool
* tally_expr.c: Fixed bug in using IIT index
* substring.h: Added handling of terminal reads
* substring.c: Using trimming maxlength. Fixed printing of sequences with
adapters.
* stage3hr.c: Fixed identification of duplicates. Using total matches to
compare results, rather than score.
* stage3.c, stage3.h: Using an HMM to find bad sections and fixing resulting
dual breaks.
* stage2.c, stage2.h: Added Stage2_scan procedure. Providing diagonals for
diagnostic purposes. Computing a fingerprint.
* stage1.c: Using a boolean to see if weight exists rather than depending on
floating point value
* sequence.h: Added handling of finding adapters. Computing sequence
quality for trimming.
* sequence.c: Fixed bug where fastq quality line begins with ">". Added
removal of adapters from paired-end data.
* sam.h: Removed genome from argument lists
* sam.c: Fixed bugs in coordinates, epecially involving trimmed reads.
Handling terminal reads.
* result.c, result.h: Added ability to report intermediate gregions or
diagonals
* oligoindex.h: Added computation of fingerprint
* oligoindex.c: Added necessary clearing of oligoindex.
2010-07-09 twu
* pairdef.h, pairpool.c: Added Pair_goodness_hmm procedure.
* pair.c, pair.h: Added Pair_goodness_hmm procedure. Added printing of
stage2 fingerprint.
* orderstat.c: Removed reliance on a floating point equality
* mem.c, mem.h: Added leak check procedures
* match.c, match.h, matchdef.h: Using a boolean to record whether weight is
zero or not, rather than relying on floating point
* indexdb_hr.c: Added comment
* indexdb.c: Fixed printf procedure
* iit-read.h: Removed unused IIT_print prototype
* iit-read.c: Fixed print_record procedure
* gsnap_tally.c: Fixed trimming procedure. Added reference nucleotide in
all lines. Fixed processing of all chromosomes.
* gsnap_splices.c: Fixed parsing. Made uniquep false by default. Added
info about nextensions and nunique.
* gregion.c, gregion.h: Added fields ncovered and source, plus function
Gregion_cmp
* get-genome.c: Removed unused function print_map
* genome.c, genome.h: Added function Genome_get_char
* dynprog.c: Added space for formatting
* stage1hr.h: Setting a maxlength on trimming
* stage1hr.c: Finding terminals rather than halfintrons. Fixed case where
splice ends are adjacent in genome.
2010-07-02 twu
* stage3hr.h: Added support for a terminal alignment.
* stage3hr.c: Added support for a terminal alignment. Removed
halfintron_support field.
2010-05-28 twu
* iit_store.c: Fixed issues with removing and re-inserting null divstring.
2010-05-26 twu
* stage1hr.c: Added trim_maxlength. Added nmismatches to halfintron
alignments.
* stage3hr.h: Added trim_maxlength.
* stage3hr.c: Added trim_maxlength. Checking pairlength on samechr_single
to see if concordant.
2010-05-21 twu
* iit-read.c, iit-read.h, stage3.c: Fixed printing of chromosome in map
results
2010-05-20 twu
* stage3hr.c: Finding concordant pairs against translocations with chrnum ==
0, by making copies for each chrnum and storing in effective_chrnum.
2010-05-17 twu
* substring.c, substring.h: Added halfintron support field.
* stage3hr.c, stage3hr.h: Implemented sense consistency in paired-end
alignments
* stage1hr.c: Fixed bugs in previous implementation of half introns
2010-05-16 twu
* stage1hr.c, stage3hr.c: Implemented new way of handling half introns, by
storing best half intron for sense and for antisense
2010-05-14 twu
* resulthr.c, resulthr.h, stage1hr.c, stage3hr.c, stage3hr.h: Added
procedure for finding samechr pairs if no concordant ones found. Revised
result types to include PAIREDEND_SAMECHR_SINGLE and
PAIREDEND_SAMECHR_MULTIPLE.
2010-05-13 twu
* stage1hr.c: Added conditional compilation statements for filtering
halfintrons
* gsnap.c, stage3hr.c, stage3hr.h: Handling failsonly and nofails flags for
paired-end data. Printing FASTQ format for failsonly on single-end data.
2010-04-16 twu
* iit-write.c: Fixed bug in freeing data when number of intervals is zero
2010-04-12 twu
* iit-read.c: Commented out IIT_index function
* sam.c: Fixed situation where query has no mapping and mate is an
interchromosomal splice
2010-04-05 twu
* tally_expr.c: Initial import into CVS
2010-04-02 twu
* iit_get.c: Added allele information to -T option
2010-03-24 twu
* gsnap.c, sequence.c, sequence.h: Implemented processing of FASTQ files
* gmap.c: Using new interface to blackboard.c
* blackboard.c, blackboard.h: Added input2 to Blackboard_T object
* stage3hr.c, stage3hr.h: Fixed classification of paired-end reads when one
or both ends have a translocation.
2010-03-09 twu
* stage3hr.c: Revised half_intron_score. Using that score when comparing
overlapping half_introns with one another.
* gsnap.c, stage1hr.c, stage1hr.h: Added parameter for
min_distantsplicing_identity
* stage1hr.c: Providing querylength information when making Stage3_T splice
objects
* stage3hr.c, stage3hr.h: Adding a penalty to half-intron alignments based
on the amount of sequence that was not aligned.
* stage3hr.c: Changed output for samechr results
* substring.c: Printing sub:0 instead of exact
* stage1hr.c: Checking for exact matches that cross chromosomal boundaries
2010-03-08 twu
* resulthr.c: Making all paired reads of type concordant
* stage3hr.c: Added printing of samechr as a special case of
PAIREDEND_AS_SINGLES_UNIQUE.
* sam.h: Added mate information to nomapping result.
* sam.c: Removed unused code. Fixed printing of query string. Added mate
information to nomapping result.
* gsnap_tally.c: Handling new output format for GSNAP
* gsnap.c: Using new interface for SAM_print_nomapping
* README: Added more information about GSNAP features and output
2010-03-04 twu
* iit-read.c, iit-read.h: Added function IIT_dump_sam
* gsnap.c: Renamed resulttypes
* resulthr.c, resulthr.h: Added resulttype PAIREDEND_AS_SINGLES_UNIQUE
* substring.c, substring.h: Added function Substring_match_length
* stage3hr.h: Computing chrnum, chroffset, genomicstart, and genomicpos at
Stage3_T level for splices.
* stage3hr.c: Pairing up at each successive score level. Computing chrnum,
chroffset, genomicstart, and genomicpos at Stage3_T level for splices.
* stage1hr.c: Fixed bug allowing deletion to extend past genomicpos 0.
Fixed cases where known splicing occurs near end of sequence. Removing
duplicate hits before pairing up ends.
* sam.c: Made multiple changes to generate correct SAM output
2010-03-01 twu
* substring.c, substring.h: Removed unnecessary parameters during printing
* stage3hr.h: Removed unnecessary parameters.
* stage3hr.c: Added support information to splices, and using it to select
best half introns. Removing unnecessary parameters during printing.
Checking for abort in pairing process, based on local counts.
* stage1hr.c: Added support information in making splices. Not checking for
sufficiency for half introns. Using an abort_pairing_p flag, and when
true, recomputing ends as singles.
* splicing-score.c: Using parserange module. Allowing range to be specified.
* iit-read.h: Removed unused parameter
* iit-read.c: Changed format strings to eliminate compiler warnings
* genome.c: Added parentheses around some conditional statements
2010-02-26 twu
* stage3.c: Removed unused parameters from print functions
* sequence.c: Handling sequence at end of file without line feed
* reader.c: Commented out unused code
* gsnap.c: Added flags for SAM and quiet-if-excessive. Dropped flags for
probability thresholds.
* datadir.h: Added external interface for a function
2010-02-25 twu
* sam.c: Fixed bug where numbers of deletions was being reported as a
negative number
* genome_hr.c, genome_hr.h, stage1hr.c: Removed computation of snpdiffs by
genome_hr
* genome_hr.h, genome_hr.c: Added code for performing trimming. Using
macros for clearing and setting outside regions in start and end blocks.
* stage1hr.c: Added trimming of splice ends to avoid extending into region
of many mismatches. Saving all splice ends that have sufficient sequence
and probability support.
2010-02-23 twu
* substring.c: Fixed printing of splices. Fixed bugs in retrieving SNP
information.
* stage3hr.h: Returning found score from all functions that create a
Stage3_T object
* stage3hr.c: Fixed computation of pair length. Fixed search for concordant
pairs.
* stage1hr.h: Removed unused parameters
* stage1hr.c: Using found score rather than found number of mismatches.
Fixed cases where indel pos was outside of query range.
* spanningelt.c: Fixed typecast error
* sam.c, sam.h: Implemented SAM output for paired-end reads
2010-02-12 twu
* resulthr.c, resulthr.h, stage1hr.c, stage3hr.c, stage3hr.h, substring.c,
substring.h: Changed output format to have separate columns for alignment
information and pair information. Standardized output routines. Three
categories for paired-end reads: concordant, samechr, and unpaired.
2010-02-11 twu
* sam.c, sam.h, stage1hr.c, stage3hr.c, stage3hr.h, substring.c,
substring.h: Rearranged and cleaned up code for making substrings
2010-02-10 twu
* gmap_process.pl.in: Removed code that removed version numbers on accessions
2010-02-03 twu
* indexdb.c: Fixed string formatting
* snpindex.c: Fixed some printing statements
* get-genome.c: Changed call to parserange to match new interface
* uintlist.c, uintlist.h: Added Uintlist_find command
* table.c, tableint.c: Added stdlib.h header file
* stage3.h: Added genome to print_alignment for splice sites scores in output
* stage3.c: Allowing null gaps again
* stage2.c: Added separate data types for a 1-dimensional matrix and
2-dimensional matrix representation
* stage1hr.c: Prevented splicing unless both dinucleotides are present
* stage1.c: Removed extensions of gregions
* sequence.c: Commented out unused functions
* resulthr.c, resulthr.h: Renamed result type to PAIRED_AND_PAIRABLE
* parserange.c, parserange.h: Implemented parse_query function
* pair.c, pair.h: Added donor and acceptor scores to output
* orderstat.c, orderstat.h: Added functions for long int
* oligoindex.c, oligoindex.h: Added parameter oned_matrix_p
* nr-x.h: Added ppois functions
* nr-x.c: Added ppois functions. Fixed bug in pbinom for zero observed
counts.
* list.c, list.h: Rewrote function for List_insert
* intlist.c: Handling case of empty list better for conversion to string
* interval.c, interval.h: Added functions for sorting intervals by position
* indexdb.c: Added debugging statements
* iit_plot.c: Using new interface to Genome_new
* iit_get.c: Implemented statistics function. Using long int for tally
IITs. Using parserange module.
* iit-read.h: Added function for divlength
* iit-read.c: New implementation of sorting of intervals by position
2010-02-02 twu
* gmapindex.c: Increased expected table size for number of chromosomes.
Stopping warning messages after 100 printed.
* gmap.c: Added genome parameter to Stage3_print_alignment
* get-genome.c: Using parserange module. Implemented flanking segments.
* genome_hr.c: Removed unused variables for certain compile-time conditions
* genome-write.c: Stopping warning messages after 50 are printed
* gdiag.c: Formatting changes
* except.c: Using pointers to exception frame objects
* dynprog.c: Reduced PAIRED_OPEN penalty from -24 to -18
* diag.h: Added function Diag_range
* diag.c: Reduced EXTRA_BOUNDS parameter
* datadir.c: Fixed bug where insufficient buffer space was provided for one
string
* backtranslation.h: Removed void in formal parameter lists
* backtranslation.c: Casting character array indices to ints
* splicegene.c: Attempted to find genebounds on all sites
2010-02-01 twu
* splicegene.c: Implemented finding and reporting of alternate splice forms
* splicegene.c: Differentiated donor and acceptor sites. Handling reverse
strand in reverse direction. Noting conflicts when either endpoint is
close to an endpoint on the other.
2010-01-31 twu
* splicegene.c: Completely new rewrite based on pairinggene.c. Attempt to
assign genebounds based on tally high and tally low.
2010-01-30 twu
* splicegene.c: Using tally high and low. Added hooks for alternate splice
site.
* spliceclean.c: Performing validation based on ratio of count to maxcount
over region
2010-01-29 twu
* spliceclean.c: Using less memory. Attempted validation of splices based
on envelope.
* spliceclean.c: Initial import into CVS
* pairingcum.c: Implemented filtering based on significance at endpoints
2010-01-27 twu
* pairingcum.c: Added computation on floors as well as ceilings
2010-01-25 twu
* pairinggene.c: Testing flat regions against the splice IIT to determine if
they are intron-like. Also adding splice edges to the original list.
Splices will therefore need to be filtered.
* pairinggene.c: Reverted to previous version using only observed GSNAP
splices
* pairinggene.c: Improved algorithm for distinguishing between intergenic
flats and intron flats.
2010-01-24 twu
* geneeval.c: Initial import into CVS
* pairinggene.c: Improved algorithm for detecting intergenic regions. For
flats, we can use a loose criterion without a level threshold, because of
the ordering constraint. We are using both edges from flats and from
gsnap splices. We added a procedure for distinguishing between intergenic
regions and long exons based on the counts_tally.
* pairinggene.c: Using iblocks instead of nblocks to control exon segments,
so essentially all combinations of introns are considered
* pairinggene.c: Reading in edges from the splices_iit file, presumably
after filtering
* pairinggene.c: Attempt to get more edges by looking up splice edges when a
flat does not yield clean ones
* cappaths.c: Added analysis of slopes and attempt to find a flat region
* pairinggene.c: Fixed bug with negative unsigned int
* pairinggene.c: For objective function, using the count of observed splices
from GSNAP.
* pairinggene.c: Eliminated concept of an eblock (or exon block). Trying
all intron combinations, since intergenic blocks are sufficient to contain
the search space.
* pairinggene.c: Fixed bug where an up was a terminal, which hid the
downstream down. Added some debugging code.
* pairinggene.c: Added checking of edges based on genome splice sites
2010-01-23 twu
* pairinggene.c: Made intergenic regions go between flats, and increased the
length requirement. Using auto_exonlength for adding exons.
* pairinggene.c: Restricted intergenic blocks to be between adjacent down to
up edges
* pairinggene.c: Reordered procedures to minimize memory usage
* pairinggene.c: Implemented a new algorithm for constructing the graph,
using various blocks and building the graph in stages
* tally.c: Added functions for the median and for adding a runlength to an
existing count
* pairinggene.c: Fixed error in formula for computing down edge
* splicing-score.c: Initial import into CVS
* pairingcum.c: Fixed a bug where the cum was being put at the wrong
position, causing the down edge to be 1 position too small.
* pairinggene.c: Implemented trimming of ends
* pairinggene.c: Implemented a new test for intergenic regions based on
finding a long flat region in the counts, which should not happen in an
exon.
2010-01-22 twu
* pairinggene.c: Added a test for sharpness based on an area ratio
* pairinggene.c: Fixed dynamic programming procedure
* pairinggene.c: Keeping a min-max test on whether introns are acceptable,
but using mean levels of introns and exons for scoring.
* pairinggene.c: Using zero-based check on pairingfull to test for
intergenic regions. Added a greedy addition of introns.
* pairinggene.c: Attempt to use pairing full information and gradual
downsloping to find UTRs.
* pairinggene.c: Using intron level minus exon level to determine edges with
greater sensitivity. Implemented scores as double, rather than int,
although currently using mincount and maxcount.
* pairinggene.c: Changed from onepath dynamic programming to multiple paths
with terminals. Using explicit objects for exons and introns.
* pairinggene.c: Implemented finding of initial ups
2010-01-21 twu
* pairinggene.c: Initial import into CVS. Dynamic programming based on
splicegene.c
2010-01-20 twu
* spliceeval.c: Implemented computation of reception zone and init/term
status of splice sites
* gsnap.c, stage1hr.c, stage3hr.c, stage3hr.h: Implemented trimming of ends
of sequences
* stage1hr.c: Allowing GC-AG splicing as well as GT-AG. Using a sliding
scale of splice site probabilities based on alignment support.
* stage3hr.c, stage3hr.h: Added code for using a geneprob IIT file to assist
in finding splice sites
* gsnap.c: Added a -g flag for using a geneprob IIT file to assist in
finding splice sites
2010-01-19 twu
* tally.c, tally.h: Added function Tally_mean()
* tallyflats.c: Analyzing both fwd and rev tallies and storing in a single
IIT file
* splicegene.c: Using variability in pairing.unk rather than tallyflats to
determine intragenic regions
* spliceeval.c: Computing slopes internally, rather than relying upon
pairingflats
* pairingflats.c: Added median smoothing
* lgamma.c, lgamma.h: Added ppois function
* genecompare.c: Initial import into CVS
* geneeval.c: Changed name to genecompare.c
* geneeval.c: Added ability to handle comment lines in gene
2010-01-17 twu
* lgamma.c, lgamma.h, random.c, random.h, pairingstrand.c, tallyflats.c:
Initial import into CVS
* tally.c, tally.h: Added functions
* geneeval.c: Printing goldstandard information in comment line
* cappaths.c: Using pairing fwd and rev iits
* spliceeval.c: Removed unused code. Added procedures for merging
pairingflats.
* splicegene.c: Added a check for validity tally flats by looking at tally
information
* tallyflats.c: Keeping track separately of zero regions and flat regions.
Changed parameters.
* gsnap_splices.c: Removed unused code
* gsnap_tally.c: Added flags for picking specific strands and for forced
trimming at ends
* pairingflats.c: Storing regions and then printing them. Have three
states, for zero, flat, and bumpy.
* pairingcum.c: Print all run lengths, even those with level 0
* splicegene.c: Using tallyflats to determine boundaries for donor to
acceptor
* splicegene.c: Removed donorprob and acceptorprob. Recording and printing
all extra information from each splice. Removed unused paths code.
2010-01-16 twu
* pairingflats.c: Initial import into CVS
* spliceeval.c: Added a intron_transition buffer at the ends of each intron,
where level changes are ignored.
2010-01-15 twu
* splicegene.c: Using a gap test on pairing IITs to determine whether to
link donor to previous acceptor.
* spliceeval.c: Now computing statistics based on edge finding using Poisson
model and number of consecutive zeroes.
* spliceeval.c: Printing mean pairing levels of each splice
2010-01-13 twu
* tallyhmm.c: Integrated parserange and tally modules. Removed hints.
Added edge detection. Simplified state model.
2010-01-12 twu
* tally.c, tally.h: Implemented an exon test and a scanning solution for
pairing information.
* splicegene.c: Using an exon test to determine if we can join splices
* littleendian.c, geneeval.c: Initial import into CVS
* bigendian.c: Created distinct function names for 64-bit procedures. Added
procedures for OUTPUT_BIGENDIAN. Fixed compiler warning messages about
truncating unsigned ints to chars.
* bigendian.h: Created distinct function names for 64-bit procedures
2010-01-11 twu
* cappaths.c: Initial import into CVS
2010-01-09 twu
* tally.c, tally.h: Initial import into CVS
2010-01-08 twu
* pairingcum.c: Initial import into CVS
2010-01-07 twu
* splicegene.c: Computing exonbounds for each donor
2010-01-05 twu
* tallyhmm.c: Using edges rather than edgepairs
2010-01-04 twu
* parserange.c, parserange.h, spliceeval.c: Initial import into CVS
* splicegene.c: Added iterative method to remove conflicting splices
* splicegene.c: Computing one path over forward and one path over reverse
strands, instead of collecting terminals
2010-01-03 twu
* splicegene.c: Added reading and printing of probability values. Added
debugging statements for Paths_remove_dominated
2009-12-28 twu
* stage1hr.c, stage1hr.h: Added separate stage for half introns. Added hook
for geneprob_iit eval.
2009-12-22 twu
* splicegene.c: Initial import into CVS
* gsnap_splices.c: Added command for dumping graph
2009-12-21 twu
* gsnap.c, stage3hr.h: Added ability to print output in SAM format
* stage3hr.c: Added ability to print output in SAM format. Fixed bug in
identifying pairing.
2009-12-17 twu
* exonscan.c: Added function for writing edges
2009-12-10 twu
* stage1hr.c: Fixed bug in insertion at end of query sequence. Removed
requirement for HALF_INTRON_END_LENGTH. Made separate done levels for 5'
and 3' ends in paired alignment.
2009-12-04 twu
* gsnap_tally.c: Added ability to run on forward or reverse complement
strand only
* gsnap_tally.c: Added ability to run on all chromosomes
2009-11-25 twu
* stage1hr.h: Added new masktypes
* stage1hr.c: Created a single procedure for omit_oligos. Altered xfirst
and xlast calculation.
2009-11-20 twu
* stage1hr.c: Made slight efficiency improvements in accessing floor->score
array
2009-11-18 twu
* stage2.c: Combined features of versions 235 and 237 for both GMAP and PMAP
to work.
* stage2.h: Updated interface
* stage2.c: Fixed bug where processed was updated too soon
* pairpool.c, pairpool.h: Added function Pairpool_transfer_n
* orderstat.c: Commented out debugging function
* gmap.c, oligoindex.c: Restored variables specific to gmap
* oligoindex.c, oligoindex.h: Added major and minor oligoindices
* gmap.c: Added Oligoindex_clear_inquery in all cases
* stage2.c: Restored stage 2 to working condition
2009-11-06 twu
* maxent.c, maxent.h: Added functions for reporting log odds scores
* littleendian.h: Added interface for WRITE_UINT
* list.c: Added check for NULL in List_truncate
* iit_fetch.c: Added flag for computing cumulative total of an iit. Removed
unused variables.
* iit_get.c: Added flags for computing mean and overall total of tally iit.
2009-11-04 twu
* add_rpk.c: Initial import into CVS
2009-10-30 twu
* exonscan.c, hint.c, hint.h, tallyhmm.c: Using edgepair and splice
information in transitions, and tally and pairing information in
emissions. Providing separate training information for transitions and
emissions.
2009-10-27 twu
* hint.c, hint.h: Initial import into CVS
2009-10-26 twu
* stage3.c: Removed unused variables
2009-10-14 twu
* pair.c: Fixed bug in PSL output
2009-10-08 twu
* exonscan.c, tallyhmm.c: Multiple changes. Version used for rGASP
submission 2.
2009-10-02 twu
* gsnap.c, stage1hr.c, stage1hr.h: Allowing user to specify max mismatches
as a fraction of read length
* stage3hr.c, stage3hr.h: Made printing of score and insert length more
consistent. Made filtering of paired hits by score and duplicates
consistent with filtering of single hits.
* resulthr.c, resulthr.h: Removed Pairedresult_T type
* stage3hr.c: Made printing of insert length consistent for paired-end reads
* gsnap.c, stage1hr.c, stage1hr.h: Added parameters for minimum end matches
for local and distant splicing
2009-10-01 twu
* spanningelt.c: Made intersection procedures remove duplicates
* snpindex.c: Formatting change
* gsnap.c: Added parameters for second part of novel splicing and half
intron minimum support
* genome_hr.c, genome_hr.h: Returning ncolordiffs
* gbuffer.h: Added procedures for allocing and freeing contents
* blackboard.c: Fixed problem with hanging when using -q batch feature
* stage1hr.c: Made min_end_matches work on middle indels
* stage3hr.h: Added printing of colordiffs and score.
* stage3hr.c: Added printing of colordiffs and score. Fixed problem with
printing splice on second, inverted read.
* stage1hr.h: Added half_intron_min_support parameter.
* stage1hr.c: Added half_intron_min_support parameter. Fixed bug where
deletion indels were mixed up with colordiffs. Fixed bug where splice
junctions were evaluated past beginning of genome.
2009-09-21 twu
* tallyhmm.c: Uses hints from splices iit and altexons iit files. Added
median filtering.
* exonscan.c: Added ability to get splice sites from splices iit and
altexons iit file
* tallyhmm.c: Adding information from splices_iit and altexons_iit files
* tallyhmm.c: Implemented two-strand solution as default, with ability to
force 1-strand solution. Provided hooks for splices and altexons iit
files.
* gsnap_tally.c: Added flag for handling 2-base encoded GSNAP output
* gsnap_splices.c: Eliminated printing of overlapping paths.
2009-09-20 twu
* gsnap_splices.c: Fixed various bugs
* gsnap_splices.c: Added ability to find alternate skipped or extra exons at
each acceptor.
* gsnap_splices.c: Initial import into CVS
2009-09-19 twu
* tallyhmm.c: Implemented faster way of computing running percentiles
* tallyhmm.c: Implemented ability to read lambda parameters from a file.
Attempted to add a SINGLE exon state and allow transitions from NON to
SINGLE even when no edges were present.
* gsnap_tally.c: Had program determine own trimming based on scoring matches
and mismatches from the ends.
* iit_get.c: Added -M flag for reporting mean of a region in a tally IIT
file.
* gsnap_splicing.pl: Initial import into CVS
2009-09-18 twu
* exonscan.c: Printing information about sharp edges
* exonscan.c: Fixed bug in recording history. Added hook for allowing GC
donor site. Added splice model probability to name of splice site.
* tallyhmm.c: Added flag for printing lambdas
* tallyhmm.c: Added ability to handle multiple sites at the same position,
by making a mixture of transition tables. Wrote down transition table
explicitly.
2009-09-17 twu
* tallyhmm.c: Added smoothing to estimation of lambdas. Added routines for
printing genes.
* tallyhmm.c: Consolidated separate fivefwd, fiverev, threefwd, and threerev
sites back into up and down sites.
* tallyhmm.c: Working version of Viterbi algorithm, but still need output of
segments.
* tallyhmm.c: Initial import into CVS
* segue.c: Attempt to use objective function based on sum of counts,
relative to threshold.
* segue.c: Reduced states to be much simpler, where only one strand can be
coding at a time.
2009-09-16 twu
* segue.c: Added functions for printing genes by exons.
* segue.c: Fixed lookback lengths between sites. Added flag for specifying
knownsites iit.
* segue.c: Optimizing using mean square error. Simplified code for
traversing graph.
* segue.c: Fixed bug with computing cumulative gammln. Version works on
test data set.
* segue.c: Complete rewrite to handle both strands simultaneously
2009-09-15 twu
* exonscan.c: Using LR test to take all acceptable gene ends. Using
separate end bounds for finding gene ends.
* exonscan.c: Using both edge algorithms, stepfunction and linear fit. Made
different objects for Edge_T and Diff_T. Using goodness-of-fit instead of
xintercept for finding gene ends.
* exonscan.c: No longer using bootstrap method, but relying on testing of
sites using goodness of fit. Evaluating missing edges for both ups and
downs based only on greedy splice sites, and then performing both testing
of sites and tracing.
2009-09-14 twu
* exonscan.c: Using x-intercepts instead of step function to detect edges
* exonscan.c: Added hooks for a history-recording mechanism.
* exonscan.c: Fixed some bugs with array indices. Added flags for debugging.
* segue.c: Using splice model scores to evaluate introns
2009-09-13 twu
* exonscan.c: Made numerous tweaks to the scanning algorithm. Incorporated
finding of ends into scanning, using x-intercepts. Always finding ends
when an edge with a splice lacks a match.
* exonscan.c: Implemented two-phase method on stepfunction results, first
picking steps with highest probabilities, and then bootstrapping
neighboring steps.
* exonscan.c: Allowed best prob again for stepfunction results. Fixed bug
in code in scanning procedure. Distinguishing between donor and acceptor
types for matching edges.
* exonscan.c: Large numbers of changes. Implemented scanning method,
testing at positions with good splice site scores, for finding other ends.
Using adjacency information to decide whether to scan. Implemented
testing procedures for ends. Removed unused code.
2009-09-12 twu
* exonscan.c: Implemented a Gibbs sampling method to speed up identification
of changepoint, but reverted back to testing goodness of fit exhaustively
over a limited range.
* exonscan.c: Implemented a strategy of finding edges only for those that
appear to be missing. Using changepoint to find those edges, with
maximizing goodness of fit.
* exonscan.c: Made reasonably good step function based on log scale.
* exonscan.c: Implemented rampfunction as anchored to a step result.
2009-09-11 twu
* exonscan.c: Implemented a ramp detector using linear fitting, but too
sensitive
* exonscan.c: Using a cumulative tally to speed up computation of segment
means
* exonscan.c: Added hooks for a redo changepoint step
* segue.c: Implemented traversal of minus strand. Implemented reading of
splicepairs.
* segue.c: Implemented scoring of exons using log likelihood. Implemented
dynamic programming and printing of paths.
2009-09-10 twu
* exonscan.c: Added splice sites based on analyzing local data, using
methodology from splicescan.
* splicescan.c: Implemented posterior log odds calculations.
* exonscan.c: Added filtering of other ends. Cleaned up unused code.
* exonscan.c: Made output format consistent with that of splicescan
* segue.c: Initial import into CVS
* exonscan.c: Added finding of nearest good splice sites.
2009-09-09 twu
* exonscan.c: Method based on finding exons. However, will need to switch
to a dynamic programming method.
* exonscan.c: Initial import into CVS
2009-09-08 twu
* splicescan.c: Added options for separate output files, training mode only,
and random output.
* stage1hr.c: Fixed algorithm for end indels. Provided hooks for 2-base
encoding.
2009-09-07 twu
* splicescan.c: Fixed calculations
2009-09-06 twu
* splicescan.c: Added ability to use a known splice site IIT file
* splicescan.c: Initial import into CVS
2009-09-03 twu
* stage1hr.c: Fixed bug where singlehits5 and singlehits3 not being
initialized. Set limits on local splicing hits and attempts.
2009-09-02 twu
* stage3.c, stage3.h: Allowing a re-do of stage 2 for bad exons in middle
2009-08-31 twu
* gsnap.c: Using single-end hits already computed when paired alignments not
found.
* gmap.c: Added minor oligoindices
* changepoint.c: Added comment
* stage3hr.c, stage3hr.h: Introduced faster pair-up procedure. Sorting
paired-end solutions by score.
* stage1hr.h: For paired-end alignment, returning single-end hits.
* stage1hr.c: Fixes to paired-end alignment: (1) stopping when excessive
splicing hits or paired hits found, (2) using new pair_up procedure, (3)
fixed pairing code, (4) returning single-end hits. For dibase alignment,
skipping spanning set.
* iit_store.c: Using total label and annotation lengths to decide if format
should use 8-byte quantities.
* iit_get.c: Added flag to explicitly indicate coordinate is a label. Added
flag to print all zeroes in tally mode.
2009-08-28 twu
* stage3hr.c, stage3hr.h: Taking a splicing penalty for all splices. Added
code for marking dibase mismatches.
* stage1hr.c, stage1hr.h: Made procedure work for 2-base encoded reads
* oligo.c: Added code to read 2-base encoded queries
* reader.c, reader.h: Added field to indicate if Reader_T is for dibase
queries
* littleendian.h: Added code for handling 8-byte quantities
* iit-read.c, iit-write.c, iitdef.h: Added version 4 format, which uses
8-byte quantities to store label pointers and annotation pointers.
* gsnap_tally.c: Added trimming on left and right
* gsnap.c: Added flag for 2-base mode. Added local splice penalty.
* genome_hr.c, genome_hr.h: Provided hooks for dibase procedures
* genome.c, genome.h: Provided exposure to uncompress_mmap directly from
blocks, needed by dibase procedures.
* dibase.c, dibase.h: Initial import into CVS
* compress.c: Added code for compressing 2-base color genomes, but not
necessary.
* bigendian.c, bigendian.h: Added functions for 8-byte quantities
* access.c: Changed types in debugging statements for off_t
2009-08-21 twu
* oligo.c: If state is invalid, skipping forward until a valid state is found
* sequence.h: Added FILE * parameter for oneline outputs.
* sequence.c: Added FILE * parameter for oneline outputs. Added hooks for
skipping dashes, but appears to be buggy.
* gsnap.c: Added flag to turn off output (quiet) if too many are found.
* stage1hr.c: Classified half introns as long distance
2009-08-20 twu
* stage1hr.c: Moved half introns after distant splicing. Set fast_level to
be 1, if user hasn't already specified it.
2009-08-19 twu
* gmap_setup.pl.in: Using "use warnings" instead of -w flag
2009-08-18 twu
* stage3hr.h: Added quiet-if-excessive flag.
* stage3hr.c: Fixed problem where total_nmismatches not being set for
indels. Making printing of excessive paths consistent with single-end
behavior.
* stage1hr.c: Fixed problem where pair_up function was creating circular
loops by calling List_append more than once.
* list.c, list.h: Added function List_dump
2009-08-17 twu
* gsnap-to-iit.c, gsnap_tally.c: Renamed gsnap-to-iit.c to gsnap_tally.c
* stage1hr.c: Fixed bug where program tried to find deletions at end
extending past coordinate 0U.
2009-08-14 twu
* stage3hr.c: Revised paired-end output to show npaths and indication of
paired or unpaired.
* bigendian.c, gsnap.c, mem.c, iit-read.c, sequence.c, genome.c, indexdb.c,
indexdb_hr.c: Fixed compiler warnings from -Wall
* iit-read.c: Fixed bug where divno not checking the last div.
* sequence.h: Added a function to the interface
* genome.c: Using SNP_FLAGS for getting alternate genome
* indexdb.c, indexdb.h: Added procedure for reading with diagterm and
sizelimit
* indexdb_hr.c: Providing information about nmerged
* maxent.h: Defined a variable to specify maximum storage required
* maxent.c: Added code for computing splices from revcomp sequences
* cmet.c: Added a missing type
* genome_hr.c, stage3hr.c: Fixed compiler warnings from -Wall.
* stage1hr.h: Added Floors_free() to interface.
* stage1hr.c: Fixed bug where unsigned int * assigned to signed int *.
Fixed compiler warnings from -Wall.
2009-08-04 twu
* list.c, list.h: Added function List_truncate
* pair.c, pair.h: Added function Pair_fracidentity_max
* stage3.h: Added some interfaces
* stage3.c: Forward/reverse decision based on local scoring around each
intron. Distal/medial step now truncates distal exon at best point, and
iterates. When edges cross in changepoint step, now chopping shortest end.
* stage2.c: Allowing rightward and leftward shifts in finding shifted
canonical introns. Fixed bug in scoring for reverse introns. Adjusted
scoring for canonical introns.
* stage1.h: Imposing a size limit on position lists, so large ones are
ignored.
* stage1.c: Imposing a size limit on position lists, so large ones are
ignored. Finding best solutions for each level of numbers of exons.
2009-07-30 twu
* stage3.c: Using existing pairs/path in end exons, rather than recomputing
in distal/medial calculation. Moved distal/medial after changepoint.
* stage3.h: Moved distal/medial calculation to be after changepoint
* README: Added information about processing reads from bisulfite-treated DNA
* gsnap.c, stage3hr.c, stage3hr.h: Added printing of SNP information
* stage1hr.c: Making correct decision on when to find splice ends for half
introns
* gmap.c: Added indexdb_size_threshold. Made -9 flag do checking, but not
print full diagnostics.
* dynprog.c: Made gap penalties at ends the same as for middle
* diag.c: Added debugging statement
* Makefile.gsnaptoo.am: Included snpindex
2009-07-29 twu
* README: Added information about providing information to GSNAP about known
splice sites and SNPs
* gsnap.c: Increased default max_middle_insertions from 6 to 9
* stage1hr.c: Checking entire query for nsnpdiffs when snp_blocks is present
2009-07-07 twu
* stage1.c: In find_best_path, calculating median of segments and requiring
that medians ascend or descend. Adjusting scores for overlaps between
segments.
2009-07-01 twu
* gsnap.c, stage1hr.c, stage1hr.h: Added separate probability thresholds for
local and distant splicing
* stage1hr.c, stage1hr.h: Re-implemented paired-end alignment
2009-06-29 twu
* gsnap.c, stage1hr.c, stage1hr.h, stage3hr.c, stage3hr.h: Introduced a
penalty for distant splicing
* gsnap.c: Introduced masktype
* stage3hr.c, stage3hr.h: Introduced chimera_prob field for Stage3_T object.
Increased distantsplicing penalty from 1 to 2.
* stage1hr.c: For novel splice sites in local splicing, requiring canonical
dinucleotides plus sufficient probability score in either donor or
acceptor. Fixed bug in recognizing plus antisense splicing. Restricted
half introns to known splice sites.
* stage1hr.h: Added masktype
* stage1hr.c: Added masktype. Added back half introns.
2009-06-28 twu
* stage1hr.c: Merged finding of distant splice pairs using known or novel
splice sites.
* stage1hr.c: Consolidated finding of splice pairs using known or novel
splice sites. Making a single call to retrieve genomic segment for local
splicing.
2009-06-26 twu
* stage1hr.c: Removed unused tournament and middle_indel_p code
* stage1hr.c: Using floors for novel distant splicing
* stage1hr.c: Using floors for finding novel local splicing
* stage3hr.h: Added field for paired_up
* stage3hr.c: Added field for paired_up. Removing duplicates that differ in
indel gap length.
2009-06-25 twu
* stage1hr.h: Implemented paired-end alignment
* stage1hr.c: Implemented paired-end alignment. Fixed computation of floors
for middle indels. Fixed computation of find_segments_all for novel
splicing.
2009-06-18 twu
* Makefile.dna.am, Makefile.gsnaptoo.am: Removed segmentpool files from
Makefile.am
* gsnap.c, segmentpool.c, segmentpool.h, stage1hr.c, stage1hr.h: Removed all
references to segmentpool
* stage1hr.c: Removed code for Segmentpool_T
* segmentpool.c, segmentpool.h: Removed some fields
* stage1hr.c: Fixed bug resulting from resetting nmismatches_all
unnecessarily
* stage3.c: Fixed bug in computing goodness_rev
2009-06-09 twu
* dynprog.c: Increased penalties for mismatch at ends and for opening indels
around introns.
* gsnap.c: Increased default definition of shortsplicedist from 200000 to
500000.
* stage1hr.c: Fixed bugs in dealing with empty plus_segments or
minus_segments.
2009-06-08 twu
* stage1hr.c: Created variables for faster retrieval of floor scores.
Reporting half introns only if both local and distant known splicing fail.
2009-06-07 twu
* stage1hr.c: Fixed computation of floors, floor_xfirst, and floor_xlast
* stage1.c: Fixed bug in debugging variables
* stage1hr.c: Implemented code that does not use Segmentpool_T object
* pair.c: Treating ambiguous characters as mismatches for computing
matchscores
* gmap.c: Proving an option for splicesites output. Allowing use of a SNP
genome version.
* dynprog.c: Giving ambiguous characters a negative score in GMAP, but not
PMAP
* stage3.c: Computing direction using just canonical introns and indel
openings
* stage1hr.c: Removed middle_indel_p field in Segment_T object
2009-06-06 twu
* segmentpool.c, segmentpool.h, stage1hr.c: Removed floor from the
Segmentpool_T object
* segmentpool.c, segmentpool.h: Added a procedure for pushing without any
floors
* stage3hr.c: Added a penalty for distant splicing
* stage1hr.c: Performing local and distant splicing in separate levels
2009-06-05 twu
* pair.c, pair.h, stage3.h: Added a function for printing splicesites
* stage3.c: Reinstated checking of goodness to determine direction, but now
considering just canonical introns and indels. Added a final call to
assign_gap_types after trimming.
2009-06-04 twu
* stage1hr.c: Small improvements to code for binary_search and dual_search
* stage1hr.c: Made improvements in code for dual_search by removing lowi and
lowj and updating pointers for positions1 and positions2.
2009-06-03 twu
* stage1hr.c: Using single splicesites list for novel local splicing
* stage1hr.c: Implemented slightly more efficient code for dual_search.
* gsnap.c, stage1hr.c, stage1hr.h: Improved speed of finding known
splicesites by doing one dual_search for all splice types.
* stage1hr.c: Revised dual_search procedure to handle overlapping
splicesites and positions correctly.
2009-06-02 twu
* genome_hr.c: Fixed bug in handling pos5 for Genome_mismatches_left.
* stage1hr.c: Fixed dual_search so it handles all overlapping splicesites
and intervals
* stage1hr.c: Implemented faster code for finding novel splice ends
* stage1hr.c: Changed Floors_T to be just a single set of scores. Using
floors to prune known splice ends.
2009-06-01 twu
* stage3hr.c: Removed genomiclength from Stage3_T object. Using chrnum of 0
to indicate distant splicing.
* stage1hr.c: Removed oldindels code
2009-05-29 twu
* stage1hr.c: Using compressed nucleotide-level alignment for end indels.
Preventing firstbound and lastbound from going past read boundaries.
* stage1hr.c: Fixed assignment of shortdistancep in splice pairs. Fixed
assignments of prior penalties.
2009-05-25 twu
* stage3hr.c: Eliminating duplicates where genomicstart and genomicend are
equal
* stage1hr.c: Excluding splice positions at ends in novel local splicing
* genome_hr.c: Fixed bug in mismatches_right_snps where startdiscard was not
being applied
2009-05-24 twu
* stage3hr.h: Added genomicend to Stage3_T object
* stage3hr.c: Made removal of duplicates work for splicing by adding
genomicend to Stage3_T object. Made removal of duplicate splice ends
faster.
* stage1hr.c: Eliminating splice ends where splice position occurs too close
to the beginning. Fixed triage to treat novel splicing and known splicing
equally, and to give preferences to substitutions over indels over
splicing when hits are found at each type.
* gsnap.c: Passing only one splice prob to stage 1
* stage3hr.c: Lookin up splicesites iit only if site was known
* stage1hr.h: Passing only one splice prob.
* stage1hr.c: Implemented finding of novel distant splice pairs. Fixed bug
in dual_search.
2009-05-23 twu
* stage1hr.c: Implemented novel local splicing, which includes known splice
sites
2009-05-22 twu
* indexdb_hr.c, indexdb_hr.h, spanningelt.c, stage1hr.c: Implemented gallop
search
* stage1hr.c, stage1hr.h: Implemented a new flow through the different
algorithms.
* gsnap.c: Made shortsplicedist an unsigned int. Changed name of spliceprob
to minspliceprob.
* stage3hr.c: Using total number of mismatches to score spliced reads
* stage1hr.c: Implemented dual intersection method for finding known splice
sites
* genome_hr.c, genome_hr.h: Added substring parameters to
Genome_count_mismatches_limit
2009-05-20 twu
* stage1hr.h: Changed name from spliceprob to minspliceprob
* stage1hr.c: Moved mismatches, indels, and splicing into a single
procedure. Using firstbound and lastbound for half introns. Added
provisions for a stretch procedure which uses a spanning set with
nrequired = 1.
* stage2.c: Modified debugging output
* oligoindex.c: Removed unnecessary clearing step
* gmap.c: Fixed bug where program failed to clear oligoindices after a poor
or repetitive sequence.
2009-05-17 twu
* genome_hr.c, gsnap.c: Enabled methylation mode on snp databases
* indexdb.c: Improved pre-loading messages for snp databases
* snpindex.c: Changed naming convention for snp databases
* stage1hr.c: Fixed bug in calling new substitution with fixed value for
cmetp.
* cmetindex.c: Made the program work for snp databases
* stage3hr.c, stage3hr.h: Stage3 now handles all marking of mismatches.
Implemented marking of methylation for indels.
* stage1hr.c: Moved functions for counting and marking mismatches to
stage3hr.c
* gsnap.c: Removed call to specify methylation printing
* genome.c, genome.h: Removed function for signaling methylation printing
2009-05-16 twu
* stage1hr.h: Providing second indexdb and size_threshold to procedures.
* stage1hr.c: Omitting frequent 12-mers in the middle and poly-AT 12-mers at
the ends. Performing another round without omitting 12-mers if necessary.
Added first implementation for handling methylation data.
* oligo.c, oligo.h: Changed definition of repetitive to mean only shifts of
1, 2, or 3 nucleotides. Added procedure to mark frequent oligos, but not
used.
* indexdb.c, indexdb.h: Added procedure to compute indexdb mean size.
* gsnap.c: Added flag to deal with methylation data. Passing size_threshold
to stage 1 procedure.
* genome_hr.c, genome_hr.h: Added procedures to deal with methylation data.
* cmetindex.c: Creating two indexdb's, one for plus strand and one for minus
strand. Moved conversion tables to cmet.c. Removed conversion of genome.
2009-05-14 twu
* cmet.c, cmet.h: Initial import into CVS
2009-05-13 twu
* cmetindex.c: Initial import into CVS. Implements reverse genome.
* iit-read.c: Added warning for use of IIT_string_from_position
* stage2.c: Fixed bug in using wrong indexsize for a given oligoindex
* oligoindex.c, oligoindex.h: Added a procedure to return indexsize
* get-genome.c: Fixed printing of coordinates
* stage1hr.c, stage1hr.h: Changed polyat to omitted. Created separate
procedure to mark omitted, which omits repetitive oligos except at the
ends, except for poly-AT at the ends.
* stage3hr.c, stage3hr.h: Made removal of duplicates faster
* stage1hr.c, stage1hr.h: Added ability to handle methylation data. Using
simplified version of Genome_fill_buffer that does not check chromosome
bounds.
* snpindex.c: Changed name of SNP genome file from genome to genomecomp.
* genome.c, genome.h, gsnap.c: Added ability to handle methylation data
2009-05-11 twu
* gsnap.c: Made min_end_matches a user-adjustable parameter. Made
maxchimerapaths the same as maxpaths.
* stage1hr.h: Made min_end_matches a user-adjustable parameter
* stage1hr.c: Fixed computation of end indels so it finds maximal length
from end. Made min_end_matches a user-adjustable parameter. Removed
allocation of polyat outside of 0..query_lastpos.
2009-05-10 twu
* stage1hr.c: Fixed bug in turning off end indels. Implemented faster
method for computing firstbound and lastbound for xfirst and xlast
computation.
2009-05-08 twu
* stage3hr.c: Fixed sorting so it uses hittype, and not indel separation
* oligo.c, oligo.h: Added function Oligo_mark_repetitive
* stage1hr.c: Fixed computations of xfirst and xlast in presence of
repetitive oligos. Increased speed of computing nmismatches_long in end
indels.
* genome_hr.c, genome_hr.h: Modified Genome_mismatches_left and
Genome_mismatches_right to take pos5 and pos3 as arguments.
2009-05-07 twu
* stage1hr.c: Fixed floor formulas again. Generalized idea of polyat to
mean all repetitive oligos. For middle indels, computing middle floor
explicitly when polyat oligos are present. For end indels, computing
firstbound and lastbound to handle cases with polyat oligos at the ends.
Reordered compute_end_indels to starting computing from 1 deletion.
2009-05-06 twu
* spanningelt.c, spanningelt.h, stage1hr.c: Removed unused code
* spanningelt.c, spanningelt.h: Added code for spanning set computation of
end indels, but not used.
* gsnap.c, stage1hr.h: Added floors_array
* stage1hr.c: Made changes to floors: (1) Made floor_middle formula handle
middle indels, (2) Created Floors_T object to precompute floors and handle
polyat oligomers. Fixed computation of end indels, now can handle
mismatches. Fixed bug in computing max_indel_sep. Added code for
possible fast computation of end indels, but not used.
2009-05-05 twu
* stage1hr.c: Replaced arithmetic expression for smallesti with if statement.
* stage3hr.c: Changed sorting order so non-indel alignments rank higher than
indel alignments.
* stage1hr.c: Computing floors between two segments in finding middle indels
2009-05-04 twu
* stage1hr.c: Added limit to number of middle indels found
2009-05-01 twu
* stage1hr.c: Implemented incremental execution of fast mismatch algorithm.
* stage1hr.c: Fixed setting of indel separation in triage. Prevented
boostpos from being a compoundpos. Turned off tournament tree, since it
was causing a crash.
* spanningelt.c: Fixed bug occurring when all spanningelts have no positions
2009-04-30 twu
* stage1hr.c: Fixed bug with setting mismatch levels for suboptimal results.
Solving all multiple_mm solutions in a single run.
* stage3hr.c: Fixed bugs in Stage3_remove_duplicates
* stage3hr.h: Added function for counting number of optimal hits.
* stage3hr.c: Fixed problem with undesired removal of second splice site
within a given genomic region. Added function for counting number of
optimal hits.
* stage1hr.c: Removed code for REFINE_MISSES. Added back missing statement
setting duplicates_possible_p.
* gsnap.c: Removed minlevel and maxlevel and replaced with suboptimal
mismatches. Made sort always happen. Changed flag name from invertp to
circular-output.
* stage1hr.c, stage3hr.c, stage3hr.h: Using score in sorting procedure.
Added provision for minlevel in Stage3_optimal_score.
* stage1hr.c, stage1hr.h: Refined miss_querypos5 and miss_querypos3
boundaries. Implemented triage for minlevel_mismatches and
maxlevel_mismatches instead of minlevel and maxlevel, and accounted for
fast mismatch algorithm.
2009-04-27 twu
* gmapindex.c: Allowing contigs with a single nucleotide
2009-04-26 twu
* spanningelt.c, spanningelt.h: Removed boosterset idea and returning only
the minscore within the spanningset.
* stage1hr.c: Removed all recursion from identify_multimiss_iter. Added
feature to modify spanningset list and update a counter when a spanningelt
is empty. Removed boosterset idea and restored a single boostpos.
Correctly implemented fast multimiss algorithm.
* spanningelt.c, spanningelt.h, stage1hr.c: Created separate scores for
candidate generation and for pruning. Implemented idea of a boosterset,
instead of a single boostpos, but seems to be slower.
2009-04-25 twu
* Makefile.dna.am, Makefile.gsnaptoo.am, indexdb_hr.c, indexdb_hr.h,
spanningelt.c, spanningelt.h, stage1hr.c: Created formal Spanningelt_T
object and rewrote algorithms to use it.
2009-04-24 twu
* stage1hr.c: Preliminary implementation of a multimiss algorithm
generalized from the onemiss algorithm.
2009-04-23 twu
* stage3hr.c: Fixed bug that resulted in duplicate outputs
* stage1hr.c: Reverting to previous version
* stage1hr.c: Implemented vertical solution for find_segments_multiple_mm,
which handles each querypos one at a time, but result is much slower.
* stage1hr.c: Removed code from before DELAY_READING. Implemented
tournament trees, which require slightly fewer instructions than heaps.
2009-04-22 twu
* Makefile.gmaponly.am, Makefile.pmaptoo.am: Included changepoint files.
* Makefile.dna.am: Included changepoint files. Included indexdb_dump
program.
* bigendian.c: Added code to output files in bigendian format
* stage3.c, stage3.h: Parameterized TRIM_END_PVALUE. Fixed map feature of
GMAP for new IIT format.
* stage3hr.c: Loosened criteria for duplicate hits that was eliminating
overlapping matches with the same number of mismatches.
* indexdb_hr.c, indexdb_hr.h, stage1hr.c: Using binary_threshold instead of
parent_ndiagonals
* indexdb.c: Fixed bug for bigendian machines
* iitdef.h: Changed type of divsort to be int, apparently for compiler
warnings?
* iit-read.c, iit-read.h: Implemented algorithm for map feature of GMAP to
use new IIT format
* gmap.c: Using divint crosstable for map feature.
2009-04-21 twu
* stage1hr.c: Implemented a delay in converting positions to diagonals until
needed.
2009-04-08 twu
* stage3.c: Using number of matches to trim ends, not total length
2009-04-04 twu
* indexdb_hr.c, stage1hr.c: Made exact and sub-1 code work correctly on
bigendian machines without having to copy memory.
2009-04-03 twu
* gmap.c, stage3.c, stage3.h: Modified -H flag to let user control
minendexon length
* stage1hr.c: Skipping computation on poly-A or poly-T sequences
2009-04-02 twu
* stage3.c: Added check for null pairs before generating matchscores
* pair.c, pair.h: Revised procedures for producing matchscores
* gsnap.c: Adding number of paths to output. Removed check for inplace
being possible.
* snpindex.c: Added comment
* indexdb.c, indexdb.h: Removed procedures that create sentinels.
* indexdb_hr.c: Not using sentinels. For compoundpos, always reading in
place and converting to bigendian when needed.
* stage1hr.c, stage1hr.h: Not using sentinels. For exact and sub:1, always
reading in place and converting to bigendian when needed.
* stage1.c: Added limits on number of gregions to speed up program
* stage3.c, stage3.h: Using changepoint algorithm and iterative trimming of
ends to improve ends of alignment.
* indexdb.c, indexdb.h: Added hooks for sentinel in indexdb files
* gregion.c, gregion.h: Rewrote support filtering to use either a fixed
difference (for longer sequences) or a percentage difference (for shorter
ones). For extending sequences, using querylength for adequate support or
short support.
* gmap.c: Improved documentation for --help
* genome-write.c: Always print out number of bad characters
* changepoint.c, changepoint.h: Modified procedures to ignore -1 values in
input
2009-03-31 twu
* stage1hr.c: Using heaps for nomiss and onemiss, but not for exact.
Replacing NULL lists with sentinels when necessary.
* stage1.c: Implemented limit on number of gregions before finding unique
ones, to prune nonspecific, slow sequences.
2009-03-26 twu
* stage1hr.c: Using pointers and relying on sentinels to advance through
lists.
* stage1hr.c: Removed old code
* stage1hr.c: Changed sub:1 recursive procedures to iterative
* stage1hr.c: Reverting to previous version
* stage1hr.c: Attempt to use information from third and later lists to
advance first list
* stage1hr.c: Using results of second list to speed up intersection
2009-03-25 twu
* stage1hr.c: Implemented a faster version of performing intersection for
exact matches.
* stage1hr.c: Starting exact matches with intersection of first two lists.
* stage1hr.c: Made non-heap compoundpos_find procedure the default
* indexdb_hr.c, indexdb_hr.h: Implemented a non-heap version of searching
through a union of positions.
* indexdb_hr.c: Added hooks for handling indexdb with sentinels. Removed
unused code.
* stage1hr.c: Implemented iterative version of find_exact_aux. Depending on
sentinel to increase speed of loop.
2009-03-24 twu
* gmap.c, pair.c, pair.h, stage3.c, stage3.h: Fixed printing in psl format
when user provides a segment
* compress.c, compress.h, genome-write.c: Fixed reporting of non-ACGTNX
characters
* gmap.c, stage3.c, stage3.h: Removed -k flag and trimexonpct parameter
* iit_fetch.c: Added ability to compute ratios from pdl files
* iit-read.c, iit-read.h: Provided procedures for dumping version 1 IITs
* get-genome.c: Fixed dump output of contigs
* compress.c: Reduced number of error messages for non-ACGTNX characters
* stage3.c: No longer using non-canonical introns for trimming end exons,
only binomial test. Fixed bad behavior for theta values close to 1.0. Not
reporting alignments with fewer than 20 matches.
2009-03-19 twu
* iit_fetch.c: Added hook for splices iits
* plotgenes.c, plotgenes.h: Added function for handling splices. Made sure
nbins > 0.
2009-03-18 twu
* plotgenes.c, plotgenes.h: Added function Plotgenes_fetch_points. Renamed
some functions.
* gmap_setup.pl.in: Gives files on command line to fa_coords and
gmap_process, which can then add linefeed if necessary to lines.
* gmap_process.pl.in: Allowed program to read from either stdin or files on
command line. In the latter case, it adds linefeed if necessary to lines.
* fa_coords.pl.in: Re-indented program. Allowed program to read from either
stdin or files on command line.
* iit_fetch.c: Implemented handling of PDL files. Added -s flag to specify
sample number.
* stage3.c: Trimming using a binomial test on end exons
* gdiag.c, genome.c, genome.h, get-genome.c, gsnap-to-iit.c, gsnap.c,
indexdb.c, indexdb.h, oligo-count.c, snpindex.c: Modified programs to take
a snp_root argument to -V
* gmap.c: Added "Processed" message to stderr at end of batch run
* Makefile.dna.am, Makefile.gmaponly.am, Makefile.gsnaptoo.am,
Makefile.pmaptoo.am: Updated Makefiles
* pair.c, pair.h: Added function Pair_fracidentity_simple
* pbinom.c, pbinom.h: Initial import into CVS
2009-03-17 twu
* gsnap.c: Considering too many paths as a failure type for the --nofails
and --failsonly flags
2009-03-10 twu
* stage1.c: Added variable needed in debugging
* plotgenes.c, plotgenes.h: Renamed functions relating to fetching
* iit_fetch.c: Removed code relating to printing
* iit-read.c: Fixed issues for bigendian machines and for fileio access
* datadir.c, datadir.h, gmap.c: Added extra functionality to show available
databases and map files
* iit_fetch.c: Initial import into CVS. Copied from iit_plot.c
* blackboard.c, changepoint.c, compress.c, diag.c, dynprog.c,
genome-write.c, genome.c, get-genome.c, gmap.c, gmapindex.c, gregion.c,
iit-read.c, iit-write.c, indexdb.c, indexdb_hr.c, intpool.c, oligo.c,
oligoindex.c, oligoindex.h, pair.c, reader.c, sequence.c, smooth.c,
stage1.c, stage2.c, stage3.c, translation.c: Removed unused variables
based on SGI compiler warnings
2009-03-09 twu
* iit-read.c: Fixed reading of labels and annotations using fileio on
bigendian machines.
* bigendian.c, bigendian.h: Added command for reading uint using fileio
* stage3.c: Fixed bug where gap was left at 3' end before extending the end.
* stage1hr.c: Fixed counting of mismatches on revcomp sequences
* indexdb_hr.c: Made heapsize an int, rather than unsigned int, in all
procedures
* indexdb.c: Moved variable declarations above debugging statement to
satisfy SGI compiler.
* gsnap.c: Changed variable names involving -q flag to "part_"
* get-genome.c, iit_get.c: Printing divstring only if ndivs > 1
* iit-read.h: Added ability to retrieve divstring from universal coordinates.
* iit-read.c: Added ability to retrieve divstring from universal
coordinates. Fixed bug occurring during reading annotatinos using fileio.
* iit-write.c: Fixed bug relating to use of null annotlist to indicate
altstrain IIT.
2009-03-06 twu
* pair.c, pair.h, stage3.c: Fixed bug where -P and -Q flags were printing
C-to-N on antisense cDNAs.
2009-02-20 twu
* gmap_setup.pl.in: Added FILE_ENDINGS variable
* get-genome.c: Removed debugging flag
* sequence.c: Improved handling of blank lines
* indexdb.c, snpindex.c: Changed offsets and positions to occur at modulo 0
intervals based on chrpos, not universal position.
* iit_get.c: Added feature to print results from tally types of IITs
* iit-read.c: Fixed bug in re-fetching matches based on index
* gsnap.c: Cleared flags for batch and novelsplicing
* types.h: Removed an extraneous open brace
* stage3hr.c, stage3hr.h: Calculating splice site model score at print time
for all splice sites.
* stage1hr.c: Providing sense information for donor and acceptor substrings.
Testing if duplicate matches are possible due to minlevel and eliminating
them.
2009-02-05 twu
* stage3hr.c, stage3hr.h: Distinguishing between splice objects that require
copying of substrings and those that do not.
* stage1hr.c: Incorporated splice site probabilities into finding splice
sites by distance.
* stage1hr.c: Initial implementation of finding splice pairs by distance
* gsnap.c: Retrieving and printing known splicesite information at print
time.
* iit-read.c, iit-read.h: Added function IIT_get_typed_with_divno
* stage3hr.c, stage3hr.h: Providing chroffset information to Substring_T
object. Retrieving and printing known splicesite information at print
time.
* stage1hr.c: Providing chroffset information to Substring_T object
2009-02-04 twu
* stage1hr.c, stage1hr.h, gsnap.c: Implemented faster method for applying
known splice sites
* stage1hr.c, stage1hr.h: Fixed bug when allvalidp5 or allvalidp3 is false
in paired end reads. Implemented Stage1_retrieve_splicesites.
* resulthr.c, resulthr.h: Renamed paired_translocation as paired_as_singles
* iit-read.c: Fixed bug in evaluating nexactmatches
* get-genome.c: Made retrieval of map information work with universal
coordinates.
* stage3hr.c: Fixed bug where second read of paired_as_singles was not being
printed
* gsnap.c: Renamed paired_translocation to paired_as_singles
2009-02-03 twu
* genome_hr.c: Now requiring query to match either reference or alternate
allele at SNPs. Otherwise, it counts as a mismatch.
2009-02-02 twu
* get-genome.c: Added feature to print snp information
* gdiag.c, gsnap-to-iit.c, iit_plot.c: Using new interface to Genome_new
* stage3hr.c: Eliminating novel splice site when it duplicates a known
splice site
* stage1hr.c: Fixed bug where plus multiple mismatches are dropped when
minus batches are all empty. Added flexibility to floor_xfirst and
floor_xlast to allow for indels adjacent to end 12-mers. Changed
condition endpoint for indels at ends. Using genome_hr procedure to count
mismatches for indels. Finding all shortdistance splices with known
splice sites first, before finding novel splice sites.
* snpindex.c: Writing revised version of genome. Skipping cases where snp
type is inconsistent with reference genome. Taking snps_iit as a
command-line argument.
* sequence.c, sequence.h: Added Sequence_print_two function for snps
* gsnap.c, genome_hr.c, genome_hr.h, stage1hr.c, stage1hr.h: Using genomealt
instead of snps_iit
* gmap.c: Switched -v and -V flags. Using new interface to Genome_new.
* genome_hr.c: Corrected calculation of mismatches_right. Corrected offset
and now subtracting number of leading zeroes. Using clz_table instead of
log_table. Improved debugging statements.
* genome.c, genome.h: Added ability to read Genome_T object as alternate or
snp only versions.
2009-02-01 twu
* gdiag.c, oligo-count.c: Using new interface to Indexdb_new_genome
* gmapindex.c, pmapindex.c: Removed altstrain code
* indexdb.c, indexdb.h: Storing positions only at 0 mod 3.
* gsnap.c: Made batch loading the default for multiple input sequences.
Testing for both new names and old names of reference offsets and
positions files. Made flag for splicing refer to novel splicing.
* gmap.c: Made batch loading the default for multiple input sequences.
Testing for both new names and old names of reference offsets and
positions files.
* get-genome.c: Added debugging statement
* genome_hr.c: Fixed retrieval of intervals from snps_iit file, which are
1-based.
* stage1hr.c, stage1hr.h: Implemented snp differences for splicing and exact
matches. Allowing identification of novel splicing in addition to known
splice sites.
* stage3hr.c: Implemented reporting of snp differences for indels.
* snpindex.c: Fixed bug in using chromosomal position instead of universal
position. Using parameterized suffix for reference offsets and positions
files.
2009-01-30 twu
* snpindex.c: Initial import into CVS
2009-01-28 twu
* genome_hr.c: In Genome_mismatches_left and Genome_mismatches_right, adding
a 'sentinel' mismatch position to list if haven't reached max_mismatches.
* stage1hr.c: Implemented tolerance and reporting of snp differences for
novel splice sites.
* stage3hr.c, stage3hr.h: Added ability to print number of SNP differences
separately. Thie feature not yet implemented for indels though.
* stage1hr.c, stage1hr.h: Added ability to tolerate known SNPs and count
differences at those sites separately. Thie feature not yet implemented
for novel splices though.
* segmentpool.c, segmentpool.h: Changed name of chrlow to chroffset
* iit-read.c: Fixed computation of divint_crosstable
* genome_hr.c, genome_hr.h, gsnap.c: Added ability to tolerate known SNPs
and count differences at those sites separately.
2009-01-27 twu
* gsnap-to-iit.c: Increased buffer sizes
* get-genome.c: Made get-genome work on map files
* plotgenes.c: Edited comments
* gsnap.c: Provided user with the ability to set parameters for size of
middle and end insertions and deletions.
2009-01-22 twu
* stage1hr.c: Added steps to remove duplicate paired-end results
* stage1hr.c: Fixed memory leak
* gsnap.c: Increased default maxpaths from 20 to 100. Added -e flag to
--help output. Providing max_mismatches parameter to paired-end procedure.
* stage3hr.h: Added function Stage3_remove_old.
* stage3hr.c: Revised definition of paired-end length to go from
beginning-of-read to beginning-of-read. Added function Stage3_remove_old.
* stage1hr.c, stage1hr.h: Passing max_mismatches parameter for paired reads
2009-01-21 twu
* indexdb_hr.c: Fixed bug where heapify was performed after binary_search
used up all available diagonals in a batch.
* gmap_setup.pl.in: Fixed bug in escaping a variable
* pagesize.m4: Made comment line clearer
* builtin.m4: Initial import into CVS.
* Makefile.am: Added gmap_reassemble
* fa_coords.pl.in: Made -S flag the default. Added -C flag to look
explicitly for chromosomal information.
* md_coords.pl.in: Added check for unmapped contigs
* gmap_setup.pl.in: Maded -S flag the default behavior. Added -C and -O
flags. Added clean procedure when making coords.genome.
* stage3hr.c, stage3hr.h: Including label as part of Substring_T
* stage1hr.h: Added procedures for finding splices against known splicesites
iit.
* stage1hr.c: Added procedures for finding splices against known splicesites
iit. Corrected computation of distances on inversions.
* indexdb.c: Cleaned code to ensure gsnap finds the right offsets file
* iit-write.c: Added check for a null typestring
* iit_dump.c: Fixed debugging output
* iit-read.h: Added procedures to search based on divint and to get a
crosstable of divints.
* iit-read.c: Fixed IIT_debug. Added procedures to search based on divint
and to get a crosstable of divints.
* gsnap.c: Added flags for maxmismatches, splicing penalties, and splicing
iit. Added flags for failsonly and nofails in output.
* gregion.c: Added abort if genomicend < genomicstart
* gmapindex.c: Eliminated reading of strain information and assignment to
contigtypelist. Increased size of chrpos string from 100 to 8192.
* gmap.c: Reformatted output for --help
2009-01-15 twu
* stage1hr.c: Created user-specified parameters for splicing probabilities
and length.
* stage1hr.c: Fixed bug in printing coordinates of splicing results on minus
strand
2009-01-14 twu
* gsnap.c: Added flags for excluding failed alignments, or limiting to those
2009-01-13 twu
* stage1hr.c: Added additional check to prevent straddling across chromosomes
* indexdb_hr.c, indexdb_hr.h, stage1hr.c: Implemented more efficient way of
ignoring extensions past beginning of genome.
2009-01-07 twu
* stage1hr.c: Hacks put in to exclude diagonals that are less than
querylength
* stage1hr.c: Fixed issues with wrong indel_pos chosen in middle insertions,
and not checking up to specified number of indels.
2008-12-24 twu
* indexdbdef.h: Reverted to version 1.2
* indexdb.c: Reverted to version 1.121
* indexdb.c, indexdbdef.h: Attempt to use a compressed indexdb file
2008-12-22 twu
* stage1hr.c: Reading floor 2 during find_segments_multiple_mm. Returning
min_mismatches_seen from find_onemiss_matches.
2008-12-21 twu
* indexdb_hr.c: Put check for size of compoundpos outside of loop.
* stage1hr.c: Added bounds on location of mismatch in onemiss search
* stage1hr.c: Reverted back to version 1.106 that has hanging compoundpos
positions for exact and onemiss matches.
* stage1hr.c: Version of stage 1 with hooks for disallowing compoundpos
positions that hang over ends. However, this appears to add 40% to number
of instructions.
2008-12-20 twu
* stage1hr.c: Setting pointers->compoundpos to NULL after it becomes empty,
to prevent further computation on it.
* indexdb_hr.c, indexdb_hr.h: Added function Compoundpos_intersect
* stage1hr.c: Eliminated compoundpos positions in creating segments from
multiple mismatches. Delayed sorting of segments until needed for middle
insertions and deletions. Setting floor to zero in cases where poly-AT is
present. For end indels, computing oligomer_start and oligomer_end based
on results of actual mismatches found.
* stage1hr.c: Fixed bug in debugging statement
* gmapindex.c: Added more compiler checks to hide alternate strain code
* Makefile.util.am: Added program all-orfs
* Makefile.dna.am, Makefile.gsnaptoo.am: Removed chrsubset.c from gsnap
sources
2008-12-18 twu
* stage3hr.c, stage3hr.h: Using left instead of genomicpos5 for creating
Stage3_T objects.
* stage1hr.c: Using left instead of genomicpos5 for creating Stage3_T
objects. Consolidated code for plus and minus segments. Changed parameter
list for find_segments_multiple_mm to prepare for finding hits within that
procedure.
* stage1hr.c: Clarified calculations of floors
2008-12-17 twu
* stage1hr.c: Using init and search routines for Compoundpos_T objects
* separator.h: Changed coordinate separator from "--" to ".."
* intlist.c, intlist.h: Added function Intlist_sort_ascending()
* indexdb_hr.c, indexdb_hr.h: Implemented init and search routines for
Compoundpos_T objects
* iit-write.c, iit_store.c: Added monitoring output
* stage1hr.c: Clarified processing of pointers in search for onemiss matches.
2008-12-16 twu
* stage1hr.c: Allowing spanning 12-mers for exact and onemiss searches to go
in either forward or reverse direction, and picking the optimal direction.
* stage1hr.c: Allowing compoundpos positions to be used for boosting, by
merging them during search for exact matches.
* stage1.c: Fixed a bug where the querypos of sentinel was set incorrectly.
Now using querylength, not -1. Added a check to prevent gregions with
negative values for genomicstart.
* indexdb_hr.c, indexdb_hr.h: Removed partner_diagonals from Compoundpos_T
object and removed reduce function.
* stage1hr.c: Fixed potential problem with sentinel. The querypos part of
sentinel now set to querylength, not -1, to guarantee it stops the loop.
Introduced Pointers_T object to simplify exact and onemiss code.
2008-12-15 twu
* stage1hr.c: Introduction of Compoundpos_T object for speeding up
computation in exact and onemiss algorithms
* intlist.c, intlist.h: Added Intlist_insert_second() function
* indexdb_hr.c, indexdb_hr.h: Introduction of Compoundpos_T object and
operations
2008-12-14 twu
* intlist.c: Made cell_ascending() and cell_descending() static.
* genome_hr.c, genome_hr.h: Created Genome_count_mismatches_limit. Also
added code for a oneloop version of Genome_count_mismatches.
* gsnap.c: Removed chrsubset feature
* stage3hr.c, stage3hr.h: Clarified that Substring_new_donor and
Substring_new_acceptor should receive forward query sequence.
* stage1hr.c: Removed querylength from call to select_positions_for_exact().
* stage1hr.h: Removed Chrsubset_T object.
* stage1hr.c: Using new call to Genome_count_mismatches_limit. Replaced
uses of queryseq with queryuc_ptr and queryrc. Introduced query_lastpos
to replace multiple calculations of (querylength - INDEX1PART). Removed
Chrsubset_T object.
* stage1hr.c: Fixed onemiss algorithm so it handles short reads less than
2*INDEX1PART in length. Changed occurrences of oligobase to INDEX1PART.
Removed oligobase and querylength from Stage1_T object.
* genome_hr.c, genome_hr.h: Made Genome_count_mismatches more efficient, by
using pointers and stepping through query and genome blocks sequentially.
2008-12-13 twu
* stage1hr.c: Implemented new method for identifying single mismatches,
similar to that for finding exact matches.
2008-12-12 twu
* result.h: Added a new failure type for short sequences
* pmapindex.c: Changed default index1interval from 3 to 6
* oligo.c: Added a comment.
* oligo-count.c: Using revised interface to indexdb.c.
* indexdb_hr.c, indexdb_hr.h: Removed addition of a diagterm for lookups
involving a left or right shift.
* indexdb.c, indexdb.h: Changed some function names. Added function to
determine if inplace reading is possible. Added parameter to require
sampling of 3 in indexdb.
* gmap.c: Using revised interface to indexdb.c. Added check and message for
sequences shorter than INDEX1PART.
* gdiag.c: Using revised interface to indexdb.c
* block.c: Using revised function names in indexdb.c
* segmentpool.c, segmentpool.h: Removed break5 and break3 from Segmentpool_T
object.
* stage1hr.c: Removed break5 and break3 from Segmentpool_T object
* stage1.c: Providing correct adjustment to diagonals for the minus strand,
by adding the length of the oligomer.
* stage3hr.c, stage3hr.h: Added function to compare Stage3_T objects by
genomic location
* gsnap.c: Added flag to sort results by genomic location
* stage1hr.c: No longer saving segments during check for single mismatches.
Checking and saving substitution hits within each heap merge.
* gsnap.c: Added user flag for setting indel penalty
* stage1hr.c, stage1hr.h: Made indel_penalty a parameter adjustable by the
user.
2008-12-11 twu
* stage1hr.c: Moved around code for special cases that prevent searching for
end indels at beginning or end of the sequence.
* gsnap.c: Added information about whether inplace reading of indexdb is
possible. Added information to version command.
* segmentpool.c, segmentpool.h: Added floor, floor_xfirst, and floor_xlast
fields to the Segment_T object.
* stage1hr.c: Removed separate lists of segments by floor. Keeping only a
single list of plus segments and of minus segments, with floor information
stored in the Segment_T object.
* stage1hr.h: Providing information about whether inplace reading of
diagonals is possible.
* stage1hr.c: Delayed addition of diagterm until after search for exact
matches. Allowing reads of diagonals from indexdb to be inplace when
possible
2008-12-10 twu
* types.h: Added check for size of unsigned long long as an 8-byte word
* stage1hr.c: Using 64-bit words, if available, to speed up comparison of
batches in heap merge.
* stage1hr.c: Replaced calls to List_head, List_next, Intlist_head, and
Intlist_next with primitives.
* Makefile.pmaptoo.am: Revised source files
* Makefile.gsnaptoo.am: Added gsnap-to-iit program. Added segmentpool.
* Makefile.gmaponly.am: Added chrom.c to iit utilities
* Makefile.dna.am: Removed test programs for Compress_T procedures
* Makefile.dna.am: Added test programs for Compress_T procedures. Added
segmentpool.
* segmentpool.c, segmentpool.h: Initial import into CVS
* gsnap.c, stage1hr.c, stage1hr.h: Added segmentpool
2008-12-09 twu
* stage1hr.c: Enforcing diagonals to be within chromosomal bounds. Removed
unused code.
* genome.c: Fixed check for chromosome bounds
* genome_hr.c, genome_hr.h: Removed checks for crossing of chromosome
boundaries. Relying upon calling procedures to enforce this.
2008-12-08 twu
* gsnap.c: Added flags for minlevel and maxlevel. Cleaned up unused flags.
* genome_hr.c, genome_hr.h: Made functions return maximum number of
mismatches if they cross a chromosome bound.
2008-12-05 twu
* stage1hr.c: On end indels, checking to see if indel_pos is non-positive.
Passing chromosome_iit to Genome_mismatches_left and
Genome_mismatches_right.
* gsnap.c: Added statement at end of batch processing to indicate number of
queries processed.
2008-12-02 twu
* subseq.c: Added U and l flags
* stage1hr.c: For middle indels, putting indel at leftmost genomic position.
Fixed filtering criteria for end indels. Counting mismatches and left
and right to identify candidates for end indels.
* stage1.c: Added debugging statements
* gdiag.c, gsnap-to-iit.c: Using new interface to Genome_fill_buffer
* gmap.c: Fixed genomiclength for user-provided genomic sequence. Stopped
trimming of sequence.
* diag.c: Loosened criteria for MAX_DIAGONALS and MIN_SCORE.
2008-11-25 twu
* genome_hr.c: Made Genome_mismatches_left and Genome_mismatches_right fill
mismatch_positions entries 0..max_mismatches.
* stage1hr.c: Implemented new algorithms for middle insertions and
deletions, using Genome_mismatches_left and Genome_mismatches_right.
* stage1hr.c: Using new interface to Genome_count_mismatches
* genome_hr.c, genome_hr.h: Added functions Genome_mismatches_left and
Genome_mismatches_right. Added builtin bit-vector functions.
2008-11-24 twu
* genome_hr.c, genome_hr.h: Allowed specification of pos5 and pos3 in
Genome_count_mismatches
* genome_hr.h: Using new Compress_T object.
* genome_hr.c: Using new Compress_T object. Removed unused code.
* stage1hr.c: Using new Compress_T object
* stage3hr.c: Fixed memory leak
* compress.c, compress.h: Introduced Compress_T object
2008-11-23 twu
* genome_hr.c, genome_hr.h: Initial entry into CVS
* gsnap.c: Added flag for handling circular-end reads
* stage3hr.c, stage3hr.h: Making a single invertp work on paired-end and
circular-end reads
* stage1hr.c: Using direct comparison against compressed genome to count
mismatches
* genome.c, genome.h: Added function Genome_blocks. Made Genome_fill_buffer
return nunknowns.
* compress.c, compress.h: Added functions Compress_new and Compress_shift
2008-11-20 twu
* sequence.c, sequence.h: For circular-end reads, keeping reverse complement
of queryseq2, but swapping queryseq1 and queryseq2.
* stage1hr.h: Placed local exon-exon mappings after multiple substitutions
and indels in the hierarchy of levels.
* stage2.c: Commented out unused procedure
* stage3.c: Added debugging statement
* stage3hr.c, stage3hr.h: Enabled printing of circular-end reads
2008-11-13 twu
* stage1hr.c: Allowing mismatches with splicing
2008-11-11 twu
* gsnap.c, stage1hr.h: Removed unused parameters
* stage1hr.c: Using new single-end read algorithms for paired-end reads.
Fixed problems with query sequences that contain non-ACGT characters.
* stage3hr.c: Fixed problems with printing inverted sequence for paired-end
reads
* gmap_reassemble.pl.in: Initial entry into CVS
* stage3hr.c: Fixed bug in Stage3_remove_duplicates
* stage1hr.c: Revised splicing parameters. Fixed calculation of maxfloor.
2008-11-10 twu
* stage3hr.c: Favoring substitutions over equivalent indels in removing
repeats.
* stage1hr.c: Treating max_middle_insertions and max_middle_deletions
separately in solving middle indels. Removed scores from
compute_end_indels. Not resetting min_mismatches after single_mm, because
of effect of poly_at oligos.
* gsnap.c, stage1hr.h: Allowing separate parameters for middle and end
insertions and deletions.
* stage3hr.c: Added check and warning messages if observed mismatches is
different from the number expected
* stage1hr.c: Introduced floor system for computing indels. Defined
calculation of middle and end indels more clearly, with separate
parameters for middle and end insertions and deletions.
2008-11-06 twu
* stage1hr.c: Made fixes for splicing to work
* indexdb_hr.h: Removed obsolete functions
* indexdb_hr.c: Fixed masking of left shifts
* indexdb.c: Commented out warning message for multiple index files
* gsnap.c: Partially implemented minlevel and maxlevel controls for Stage1.
Removed references to Stage3chimera_T objects
* resulthr.c, resulthr.h: Removed references to Stage3chimera_T objects
* stage3hr.h: Implemented new structure for Stage3 objects: single reads may
have one or more substrings.
* stage3hr.c: Implemented new structure for Stage3 objects: single reads may
have one or more substrings. Modified print procedure for indels to allow
for mismatches.
* stage1hr.h: Removed chimerap variable since Stage3 single reads are all of
the same type now.
* stage1hr.c: Fixed polyat assessment at ends of query. Storing first and
last diagonals and computing mismatches on both. Changed ptr->indels to
be consistently positive for insertions and negative for deletions. Using
new Stage3 objects. Solving middle indels with mismatches. Using minlevel
and maxlevel to control computing behavior on different alignment types.
2008-11-03 twu
* stage1hr.h: Made max_insertions and max_deletions parameters. Added
minlevel and maxlevel.
* stage1hr.c: Cleaned up procedures for single mismatches and multiple
mismatches. Added oligobase to minus diagonals to prevent negative
coordinates. Made max_insertions and max_deletions parameters. Added
minlevel and maxlevel.
2008-10-28 twu
* md_coords.pl.in: Revised instructions to user
2008-10-24 twu
* stage3.c: In comparing paths_fwd and paths_rev, using just number of
matches
* stage2.c: Also performing stage2 if there is a sufficient value for
ncoverage
* stage1.c: Removed matchsize and matchinterval from Stage1_T object, and
allowing option in scan_ends of iterating on different matchsizes. In
removal of repeated oligomers, now also removing neighboring oligomers.
Now filtering gregions by support.
* reader.c: Added fields so Reader_reset_ends resets correctly
* gregion.c, gregion.h: Added function to filter by support
* gmap.c: Fixed error message for -z flag
* diag.c, diag.h: Returning ncovered from Diag_update_coverage
* block.c, block.h: Removed high-resolution option
2008-10-23 twu
* stage3.c: Handling case where gap is at beginning of path. Trimming end
exons until a canonical intron is reached.
* stage1.c: Identifying repeated oligos at the outset
* pair.c: Made counting of ambiguous matches more uniform
* gsnap-to-iit.c: Added information about unique positions. Added ability
to halt at a given position.
* gregion.c: Modified print statement
2008-10-10 twu
* pair.c: Made N's in query sequence align as mismatches in GMAP.
* gmapindex.c: Removing "chr" from chrsubset file
* stage1hr.c: Tightened criteria for finding exon-exon junctions. Not
reading 10- or 11-mers at ends if the 12-mer is invalid.
* stage3hr.c: Made macros for text constants
* match.c, match.h: Added function Match_print
* gsnap.c: Changed default of trim flag to be false
* gmap.c: Added jobdiv capability to GMAP
2008-10-02 twu
* blackboard.c: Simplified the fix for the hang for input done with no inputs
2008-10-01 twu
* blackboard.c: Fixed hang that occurs when no input was ever received,
which happens with the jobdiv option when the input has fewer sequences
than the first batch modulus.
2008-09-26 twu
* iit_get.c: Allowing retrieval of labels that contain colons, by checking
first to see if the first part of the label is a divstring.
* iit-read.c, iit-read.h: Added function to determine divint without reading
entire IIT.
2008-09-23 twu
* stage3.h: Added PRE_ENDS as a debugging endpoint
* stage3.c: Modified solutions at ends. First, we decide between distal and
medial, with distal penalized for non-canonical introns. Then, we simply
extend the ends without peelback and permitting an initial gap.
* stage2.c: Introducing a minimum pct_coverage
* oligoindex.c, oligoindex.h: Allowed suffnconsecutive to be a different
value in each level of resolution.
* match.c, match.h: Moved function Match_get_coords to gregion.c
* gregion.h: Added fields for weight and support.
* gregion.c: Added fields for weight and support. Duplicate gregions are
now resolved in favor of the gregion with the greatest weight, or if
equal, the greatest support.
* gmap.c: Increased extraband at end from 3 to 6
* dynprog.c: Allowing a gap to start the alignment of end5 and end3.
Introducing a parameter init_jump_penalty_p to control this.
* diag.c: Replaced 0 with 0U in some cases
* stage1.c: Using weights on matches and on gregions to focus on genomic
regions with most specificity.
2008-09-19 twu
* stage1.c: Added penalty for intron length in find_best_path to reduce
excessively large regions. If segments are used, then clearing gregions
and starting over.
2008-09-16 twu
* stage1hr.c: Fixed bug with false positives on middle indel. Fixed bug
with combinations of insertions and deletions in find_segments_multiple_mm.
2008-09-15 twu
* gmap.c: Allowing multiple paths for alignment against user-provided
segment. Explicitly recomputing goodness over all stage3 objects.
Allowing user to specify direction of introns.
* stage2.c: Giving points for indexsize-equivalent number of matches if it
starts a new chain.
* stage1hr.c: Fixed problem with insertions in first 12-mer. Now treating
as a mismatch, as we did for insertions in the last 12-mer.
* oligoindex.h: Removed debug_graphic_p from argument list
* oligoindex.c: Fixed memory leak
* md5-compute.c: Added ability to handle multiple input files
* diag.c, diag.h, diagdef.h: Computing scores for each diagonal and
requiring a minimum score
2008-09-09 twu
* stage1hr.c: Fixed bug in exact match to end of chromosome, resulting in
negative coordinates of the next chromosome.
* stage3.h: Added function for recomputing goodness
* stage3.c: Made widebandp true on all single gap solutions. Extending 5'
and 3' ends, rather than comparing distal with medial, when defect rate is
high. Recomputing goodness using just matches if best hit is poor.
2008-09-08 twu
* diag.c, diag.h: Moved some functions from oligoindex.c to diag.c
* oligoindex.c, oligoindex.h: Implemented different mapping resolutions by
using multiple oligoindices. Using a separate lookback for each
resolution.
* stage2.c, stage2.h: Implemented different mapping resolutions by using
multiple oligoindices.
2008-09-04 twu
* configure.ac: Added check for stat64
* acinclude.m4: Including config/acx_mmap_fixed.m4,
config/acx_mmap_variable.m4, and config/struct-stat64.m4.
* VERSION: Updated version
* README: Augmented instructions for new gmap_setup flags and made mention
of GSNAP.
* dynprog.c, dynprog.h: New functions added for dealing with an internal gap
* indexdb.c: Fixed problem in reading offsets and positions file based on
interval of 6.
* gmap.c: Made default canonical mode to be 1
* stage3.c: Reverted to revision 1.300 with newer code kept for stage3debug.
* stage2.c: Reverted to revision 1.221 with newer code kept for converting
oligomers to nucleotides
* smooth.h: Removed stage2_indexsize
* smooth.c: Reverted to revision 1.41, plus removal of stage2_indexsize
* oligoindex.h: Reverted to revision 1.47, plus most recent wobble masking
and code for multiple oligoindices
* oligoindex.c: Reverted to revision 1.108, plus most recent wobble masking
and code for multiple oligoindices
* diagpool.c: Removed initialization for bestscore and prev fields
* diagdef.h: Removed score, bestscore, and prev fields
* diag.h: Reverted to revision 1.5 with some functions moved from
oligoindex.c.
* diag.c: Reverted to revision 1.7 with some functions moved from
oligoindex.c.
* stage3.h: Calling stage 2 directly
* stage3.c: More attempts to rearrange steps
* stage2.c, stage2.h: Bypasses former stage 2 and returns best path of
diagonals, converted to nucleotides
* smooth.c: Changed function for finding internal shorts
* oligoindex.c, oligoindex.h: Changed Oligoindex_get_mappings to return a
list of diagonals
* iit-read.h: Added comments to explain arguments
* gmap.c: Having stage2 return a path
* diagpool.c: Added initialization for bestscore and prev
* diagdef.h: Added fields for bestscore and prev
* diag.c, diag.h: Added functions Diag_compare_querystart and Diag_best_path
2008-08-15 twu
* diag.c, diag.h, gmap.c, oligoindex.c, oligoindex.h, stage2.c, stage2.h:
Implementation of oligoindex step at multiple resolutions
* stage2.c: Rearranged procedures in preparation for multiple oligoindices.
* oligoindex.c, oligoindex.h: Moved various functions from oligoindex.c to
diag.c. Added various variables to Oligoindex_T struct. Rearranged
procedures in preparation for multiple oligoindices.
* diag.c, diag.h: Moved various functions from oligoindex.c to diag.c
* stage2.h: Added a version of stage 2 that can be called from within stage
3.
* stage2.c: Using active hits, instead of minactive and maxactive bounds.
Added hooks for relying upon splice site scores. Made conversion to
nucleotides handle arbitrary masks. Added penalty for diffdistance not a
multiple of 3. Added a version of stage 2 that can be called from within
stage 3.
* stage1hr.c: Exiting if a single polyat 12-mer found, to prevent false
indels from being found in find_segments_multiple_mm.
* oligoindex.h: Computing active hits around each diagonal, instead of
minactive and maxactive bounds.
* oligoindex.c: Added wobble masking. Computing dominance by using scores,
based on number of diagonals overlapping each querypos.
* indexdb_hr.c: Added masking for all left shifts
* indexdb.c: Fixed problem where highest resolution indexdb was not being
used
* gmap.c: Using new interface to Oligoindex_set_inquery
* diag.c, diag.h, diagdef.h: Added score to Diag_T object
* block.c: Added error message
2008-08-11 twu
* oligoindex.c, oligoindex.h: Passing character strings to procedures,
rather than Sequence_T objects.
2008-08-10 twu
* gmap.c: Made changes to debug requests from stage3
2008-08-09 twu
* stage3.c: Rearranging steps to improve cross-species performance. Work
still in progress.
2008-08-08 twu
* stage1hr.c: Removed old code
* stage1.c: Made heap and segment algorithm work for PMAP
* binarray.c, binarray.h: Removed binarray source code
* sequence.c: Redefined trim_end for PMAP to exclude the terminal stop codon
added
* pmapindex.c: Including index1interval in filename for PMAP databases
* matchpool.c: Removed old code that referred to positions, not diagonals
* match.c: Simplified a procedure
* indexdbdef.h: For PMAP, allowed index1interval to be determined by
available databases
* indexdb_hr.c, indexdb_hr.h: Moved Indexdb_read_no_subst command to
indexdb.c
* indexdb.c, indexdb.h: Moved Indexdb_read_no_subst command here. Including
index1interval into filename for PMAP databases.
* gmap.c: Changed variable name from samplingp to lowidentityp
* block.c: Bypassing oligo.c and calling Indexdb commands directly
2008-08-06 twu
* Makefile.dna.am, Makefile.gmaponly.am, Makefile.gsnaptoo.am,
Makefile.pmaptoo.am, stage1.c, stage1.h: Removed binarray
2008-08-05 twu
* binarray.c, binarray.h, stage1.c: Transitioning away from bins and toward
segments. Intermediate code contains both sets of functions.
2008-08-01 twu
* block.c, block.h, gregion.c, gregion.h, matchpool.c, matchpool.h,
stage1.c: Just before change to using diagonals, with directives
indicating changes
2008-07-30 twu
* gsnap.c: Changed batch specification so it runs from 0 to n-1.
* stage1hr.c: Changed hierarchy of results to be exact, sub:1, local
splicing, half introns, sub:2, sub:3, sub:4, indels, distant splicing.
Increased speed for computing splice ends. Limiting nmismatches for each
splice end, so not checking nmismatches for splicing after that.
* gsnap.c: Reduced default maxpaths to 20 and maxchimerapaths to 2
2008-07-29 twu
* stage1hr.c: Improved identification of repetitive oligos
* sequence.c: Better handling of FASTA files that end with blank lines
* stage1hr.c, stage1hr.h: Implemented different sizes for insertions and
deletions
* stage3hr.c, stage3hr.h: Added function Stage3_remove_duplicates
* stage1hr.c: Made 12-mer mod 3 strategy work for multiple mismatches,
indels, and exon-exon junctions.
2008-07-28 twu
* stage1hr.c: Removed special variables for -2, -1, querypos+1, and
querypos+2. Removed middle_indel_p.
* stage1hr.c: Made paired reads use new 12-mer strategy for exact and 1-sub
* stage1hr.h: Changed variable name to expected_pairlength
* datadir.c: Improved error message when genome db not found
* indexdb_hr.c, indexdb_hr.h, intlist.c, intlist.h, stage1hr.c: Implemented
faster version of exact and 1-sub using 12-mers
2008-07-26 twu
* indexdb_hr.c: Removed oligo_hr.h and oligo_hr.c. Added code for reading
left and right subst of 1 and 2 nts.
* oligo_hr.c, oligo_hr.h: Removed oligo_hr.h and oligo_hr.c
* Makefile.dna.am, Makefile.gmaponly.am, Makefile.gsnaptoo.am,
Makefile.pmaptoo.am, block.c, indexdb.c, indexdb.h, stage1hr.c: Removed
oligo_hr.h and oligo_hr.c.
* indexdb_hr.h: Initial import into CVS
2008-07-17 twu
* stage3hr.c: Fixed handling of trimming for inverted hits. Fixed handling
of hits that have negative genomic coordinates.
* stage1hr.c: Fixed handling of trimming at ends
* pair.c: Changed output to show "genome" instead of "chr"
* Makefile.dna.am, Makefile.gmaponly.am, Makefile.gsnaptoo.am,
Makefile.pmaptoo.am, diagnostic.c, diagnostic.h, gmap.c, result.c,
result.h, stage1.c, stage1.h: Added Diagnostic_T to hold information
* chimera.c, chimera.h, get-genome.c, iit_plot.c, match.c, match.h,
stage3.c: Using new interface to Genome_get_segment
* genome.c, genome.h: Printing out-of-bounds characters on all cases where
coordinates exceed chromosomal boundaries.
2008-07-07 twu
* stage1hr.c: Added comment
* stage1.c: Added high-resolution sampling
* oligo.c, oligo.h: Removed burden of leftreadshift to caller
* indexdb.c, indexdb.h: Added function to provide indexing interval
* block.c, block.h: Added high-resolution behavior to Block_T object
* gmap.c: Added hybrid behavior for finding canonical introns: low reward
for high-identity sequences and high reward otherwise.
* stage1.h: Removed obsolete functions
* stage1.c: Renamed variables
* pair.c, pair.h, stage3.c: Printing separate runtimes for stage2
diagonalization and alignment
* stage2.h: Computing separate runtimes for stage2 diagonalization and
alignment.
* stage2.c: Reinstating limitation on maximum number of active hits.
Computing separate runtimes for stage2 diagonalization and alignment.
* stage1.c, stage1.h: Reporting whether sampling was used
* gmap.c: Using smaller stage 2 indexsize when stage 1 sampling is done
* plotgenes.c, plotgenes.h: Added ability to handle values
* pdldata.c: Using Access_mmap function
* gdiag.c, gsnap-to-iit.c: Using new interface to Genome_fill_buffer
* subseq.c: Added initial '>' to header
* stage3.c: Using new interface to IIT_print
* stage1.c: Removed references to Matchpair_T
* pmapindex.c: Removed -l as an input flag
* oligo.c, oligo.h: Added code for identifying repetitive oligos
* match.c, match.h: Added code for dealing with pairs of matches
* matchpair.c, matchpair.h: Removed Matchpair_T code
* indexdb.h: Restoring previous definition of sufficient support
* iit_update.c: Using new interface to IIT_read
* iit_plot.c: Handling values, in addition to counts and genes
* gregion.c, gregion.h: Added fields to Gregion_T
* gmap.c: Interpreting optarg as strings, not integers
* get-genome.c: Removed -F and -R flags. Using -R flag for relative
coordinates.
* block.h: Separated interfaces for GMAP and PMAP
* block.c: Added hook for removing repetitive oligos
* binarray.c: Taking all boxes in final step. Reduced debugging output.
* Makefile.dna.am, Makefile.gmaponly.am, Makefile.gsnaptoo.am,
Makefile.pmaptoo.am: Added binarray.c and .h and removed matchpair.c and .h
* matchpair.c: Added code for bins
* stage1.c: Added two levels of ntopboxes
2008-07-03 twu
* binarray.c, binarray.h, stage1.c: Implemented working version of binarray
algorithm
* Makefile.util.am, revcomp.c, seqlength.c, subseq.c: Added utility programs
for internal use
2008-07-02 twu
* binarray.c, binarray.h: Initial import into CVS
2008-07-01 twu
* stage1.c, stage1.h: Initial implementation of bins
2008-06-30 twu
* stage3.c, iit-read.c, iit-read.h: Added ability to print relative
coordinates
* stage1hr.c: Improved handling of heaps. Added code for handling
out-of-bounds conditions.
* sequence.c, sequence.h: Added command for printing revcomp of sequence
* interval.c, indexdb.c: Added debugging statements
* indexdb_hr.c: Replaced separate variables for heapsize and delta into a
single header. Added code for doing all reads, then doing all writes.
* iitdef.h: Storing separate mmap pointers for parts of IIT
* gmap.c: Removed -R flag
* gsnap.c: Added ability to handle input sequence in batches
* genome.c, genome.h: Changed out-of-bounds symbol to be '*'.
2008-06-25 twu
* gmap.c: Made low reward for canonical sequences to be the default
* get-genome.c: Fixed calculation of genomiclength
* iit-read.h: Removed unused function.
* iit-read.c: Now doing memory mapping of pointers rather than reading all
of them. Fixed bug in reporting second chromosomal coordinate. Fixed bug
in sorting segments by coordinate.
2008-05-19 twu
* indexdb_hr.c: Made process_heap inline. Removed delta from Batch_T.
* indexdb_hr.c: Various code introduced to improve speed of heapify operation
2008-05-09 twu
* access.c, access.h: Fixed mmap calls with offset so offset is on a page
boundary
* stage3hr.c, stage3hr.h: Added new functions for filtering and sorting
chimeras. Fixed calls of scrambled exons.
* stage1hr.c: Major efficiency improvements in heapify and other heap
functions for merging diagonals
* indexdb_hr.c: Major efficiency improvements in heapify and other heap
functions for merging batches
* stage1hr.c: Changed filtering methodology for exon-exon junctions
2008-05-08 twu
* indexdb_hr.c: Using pointers to memory-mapped positions file, and adding
shift-plus-diagterm as heap builds the final array of positions.
2008-05-07 twu
* indexdb_hr.c: Restored previous version
* indexdb_hr.c: Attempt to reduce D2 cache miss rate, but actually increases
it by 10x.
2008-05-06 twu
* stage3hr.c: Using correct type for Stage3chimera_t objects
* stage1hr.c: Set chimerap flag correctly
* indexdb_hr.c: Faster counting of entries in cases where duplicates are not
allowed
* indexdb.c: Minor syntactic changes
* gsnap.c: Turned off reading of labels for map iit files
2008-05-05 twu
* gmapindex.c, pmapindex.c: Providing chromosome_iit to procedures for
writing offset and position files
* stage3hr.c, stage3hr.h: Values for chrnum are pre-computed rather than
computed here
* stage1hr.c: Using new interfaces to Genome_fill_buffer and Stage3_new
routines
* indexdb.c, indexdb.h: No longer storing oligomers at ends of chromosomes
* iit-read.c, iit-read.h, iitdef.h: Providing specific fields for memory
mapping of labels and annotations. Reading all pointers for labels and
annotations.
* genome.c, genome.h: Trimming correctly at chromosome boundaries.
Returning chrnum.
* access.h: Added function to mmap at a particular offset.
* access.c: Added function to mmap at a particular offset. Added check for
struct stat64.
* stage1hr.c: Searching for indels only if substitution fails
2008-05-04 twu
* gsnap.c, stage1hr.c, stage1hr.h, stage3hr.c, stage3hr.h: Added trimming of
mismatches at ends of substitutions
2008-04-25 twu
* stage3hr.c, stage3hr.h: Printing information about paired result type and
about structural variations in spliced reads.
* stage3.c: Removed include of maxent.h
* stage1hr.c, stage1hr.h: Added hierarchy of paired result types. Checking
for cross repetitiveness.
* sequence.c: Improved debugging statements
* resulthr.c, resulthr.h: Storing information about paired result type
* iit_plot.c, plotgenes.c, plotgenes.h: Fixed handling of new IIT map format
* pair.c: Changed output format for IIT-readable files (-f 7)
* list.c, list.h: Added function of List_to_array that reports list length
* iit-read.c: Fixed handling of flanking intervals
* gsnap.c: Adding information about paired result type and providing
information about max number of paired paths.
* Makefile.gsnaptoo.am: Using tables for IITs. Removed iit_update.
* Makefile.dna.am: Using tables for IIT. Removing gdiag and iit_update.
2008-04-23 twu
* iit_get.c: Fixed bug in handling queries from stdin
2008-04-22 twu
* stage3hr.c: Slight improvement in efficiency in eliminating duplicates or
dominated paired end solutions.
* dynprog.c: Reduced mismatch penalty for low quality sequences. Equalizing
extension penalty for single gaps, regardless of sequence quality.
* maxent.c: Added debugging statements
* stage3hr.c: Fixed bugs in eliminating duplicate or dominated paired-end
results
* iit-read.c: Fixed memory leak for entire IIT structure
* stage3.c: In dual break, peeling pairs first.
* stage3.c: Improved handling of dual breaks by scanning genomic segment
2008-04-21 twu
* iit_store.c: Fixed bug in handling intervals without divs
* iit-write.c: Added error message if total_nintervals is zero.
* iit-read.c: Modified output for IIT_dump
* fa.iittest, iit_get.out.ok: Modified IIT input/output for new interval
format
* stage3hr.c, stage3hr.h: Added code for printing half introns. Now storing
chrnum when Stage3_T objects are computed. Using chrnum to determine
whether two paired ends are connectable.
* stage1hr.c: Using scores to determine whether indel beats substitution.
Added code for finding half introns. Now storing chrnum when Stage3_T
objects are computed.
* indexdb_hr.c: Added code, not currently used, for using doubles to find
longer oligomers.
2008-04-15 twu
* stage1hr.c, stage1hr.h: Increased MAX_INDELS, and using it instead of
hard-coded 3
* plotdata.c, segmentpos.c, stage3hr.c, chrnum.c, chrnum.h, chrsubset.c,
gdiag.c, genomepage.c, genomeplot.c: Using new interface to IIT_label
* plotgenes.c, plotgenes.h, stage3.c: Using new interface to IIT_get and
IIT_label
* table.c, table.h: Added functions Table_string_compare and
Table_string_hash
* pmapindex.c, iit_dump.c, iit_plot.c: Using new interface to IIT_read
* match.c, pair.c: Using new interface to Chrnum_to_string
* indexdb.c: Using IIT_total_nintervals
* indexdbdef.h: Moved definition of Indexdb_T to a separate file
* iitdef.h: Added fields for whether labels were read, and for offsets to
various parts of the iit file.
* iit_store.c: Using new version for reporting intervals
* iit_get.c: Using new interface to IIT_get and IIT_read. Added ability to
center annotations at a given column.
* iit-write.c: Fixed bugs for divs with no intervals
* iit-print.c, iit-print.h: Moved IIT_print procedures back to iit-read.c.
* iit-read.c, iit-read.h: Fixed bug in handling divs with no intervals.
Allowing memory mapping of labels and intervals and their pointers (in
addition to annotations). Moved IIT_print procedures back to this file.
* gsnap.c: Providing flag for user to specify consecutive matches, to
control speed
* gsnap-to-iit.c: Removed flag for old GSNAP version output format
* gmapindex.c: Using tables to provide information to IIT_write
* get-genome.c, gmap.c: Using new interface to IIT_get and IIT_read
* genome-write.c: Using new interface to IIT_get
2008-04-10 twu
* gregion.c, match.c, matchpool.c: Made IIT_get_one pass additional parameter
2008-04-01 twu
* stage1hr.c: Various methods to improve speed, including separate
processing for plus and minus strands, use of threshold_noligomers and a
user-specified threshold_score for finding segments for multiple
mismatches.
2008-03-31 twu
* stage1hr.c: Removed old code based on fixed (nonrecursive) oligosize
* stage1hr.h: Changed variable names
* stage1hr.c: Using new variable names for paired-end lengths. Generalized
mask for oligosize.
* gsnap.c: Removed -a flag and replaced it with -S flag. Changed flags for
paired-end lengths.
* chrnum.c, chrom.c, chrom.h, chrsubset.c, segmentpos.c: Using new interface
to IIT routines with divs.
* get-genome.c: Moved Chrom_string_from_position function to iit-print.c
* stage3hr.h: Changed variable names for paired-end lengths.
* stage3hr.c: Using new interface to IIT routines wiht divs. Changed
variable names for paired-end lengths.
* stage1hr.c: Made indel alignments extend inward from ends as far as
possible.
* stage1hr.c: Added new routine for computing indels without using dynamic
programming matrix. Maximizes matches from left to right.
2008-03-27 twu
* iit-print.c, iit-print.h, iit-read.c, iit-read.h, iit-write.c,
iit-write.h, iit_get.c, iit_store.c, iitdef.h: Introduced version 3 of IIT
format, to handle multiple divs.
2008-03-20 twu
* Makefile.dna.am, Makefile.gsnaptoo.am: Removed block_hr and blockdef files
* pmapindex.c: Removed both uppercase and lowercase flags, and added -l flag
to make the distinction
* stage3hr.c: Changed order of output so type of match comes before genomic
location
* stage1hr.c: Handling short reads with lowercase characters. Using
Oligo_hr functions rather than Block_T functions.
* sequence.c, sequence.h: Added functions to handle short reads with
lowercase characters
* oligo_hr.c, oligo_hr.h: Moved leftreadshift step out of oligo_hr functions
* oligo-count.c: Using new interface to Block_new
* indexdb_hr.c: Removed checking for duplicates
* indexdb.c, indexdb.h: Added ability to mask lowercase characters in genome
* gsnap.c: Made program work for query sequences with lower case
* gmapindex.c: Removed uppercase and lowercase flags and added -l flag.
Making ".masked" indexdb files for masked genomes (where lowercase nts not
indexed).
* genome.c, genome.h: Changed name of variable
* block.c, block_hr.c, block_hr.h, blockdef.h: Restored definition of
Block_T to block.c
2008-03-05 twu
* gsnap.c: Using new interfaces to Stage1 procedures
* stage1hr.c: Deleted debugging statements that give a seg fault
* stage1hr.c, stage1hr.h: Generalized procedures to use arbitrary oligosize
* stage1.c: Using new interface to Block_new
* block.c, block.h, block_hr.c, blockdef.h, oligo.c, oligo.h, oligo_hr.c,
oligo_hr.h: Generalized procedures to handle arbitrary oligosize
* indexdb_hr.c: Fixed bugs in adding wildcard nucleotides
* indexdb.c: Fixed bug in recognizing index file at interval 6
* Makefile.dna.am, Makefile.gmaponly.am, Makefile.gsnaptoo.am: Added new
source files
* stage3hr.c: Generalized print procedures to handle arbitrarily long reads
* stage1hr.c, stage1hr.h: Added recursive procedures for paired end reads
* stage1.c: Using original GMAP calls to Block_T procedures
* oligo.c, oligo.h, oligo_hr.c, oligo_hr.h: Moved GSNAP-specific procedures
to a separate file
* intlist.c, intlist.h: Added function Intlist_ascending_by_key
* indexdb_hr.c: Moved GSNAP-specific procedures to a separate file.
* indexdb.h: Using "id<number>" as file suffix for offsets and positions
files.
* indexdb.c: Moved definition of Indexdb_T object to a separate file.
Separated GSNAP-specific procedures to a separate file. Using
"id<number>" as file suffix for offsets and positions files.
* gmapindex.c: Removed -e flag for specifying subindexing
* block_hr.c, block_hr.h: Made separate file for GSNAP-specific procedures
* blockdef.h: Put Block_T definition into a separate file
* block.h: Removed Block_T procedures specific to GSNAP
* block.c: Put Block_T definition into a separate file. Removed GSNAP
parameters for GMAP calls to Block_T procedures.
2008-03-04 twu
* stage1hr.c: Implemented recursive method for finding exact matches.
Binary search not yet added.
2008-03-03 twu
* Makefile.am, cvs2cl.pl: Made maintainer Perl machine-independent
* fa_coords.pl.in, gmap_process.pl.in, gmap_setup.pl.in, md_coords.pl.in:
Made different make commands for gmapdb_highres and gmapdb_lowres
* Makefile.dna.am, Makefile.gmaponly.am, Makefile.gsnaptoo.am: Customized
each Makefile.am for its specific task
2008-02-29 twu
* gmapindex.c, iit-read.c, segmentpos.c: Using new interface to obtain
strings from Chrom_T objects
* chrom.c, chrom.h: Restricted criteria for considering initial part of
chromosome string as numeric. Now storing initial string directly.
* gsnap.c: Using new interface to print commands
2008-02-28 twu
* stage3hr.c, stage3hr.h: Changed output to be more uniform, in a 1-column
format
* list.c: Added include of string.h
* iit_plot.c: Made program able to print counts
* iit-read.c: Added more informative error messages when offset appears
incorrect relative to filesize. Removed output of type in print_record.
* gsnap-to-iit.c: Handles new GSNAP output format. Handles remapping to
genome.
* get-genome.c: Made program work correctly on chromosomally tagged IIT map
files
* genomepage.c, genomepage.h: Removed sequence as a parameter
* pair.c, pair.h, stage3.c: Modified output of exon map
* plotgenes.c, plotgenes.h: Added function for printing counts
* Makefile.dna.am, Makefile.gsnaptoo.am, blackboard.c, blackboard.h, gmap.c,
gsnap.c, params.c, params.h, reqpost.c, reqpost.h: Removed Params_T object
2008-02-26 twu
* gsnap-to-iit.c: Handling new version of gsnap output (after remapping).
* gsnap-to-iit.c: Added -b flag to specify blocksize. Made default
blocksize 10000.
2008-02-13 twu
* iit_plot.c, plotgenes.c, plotgenes.h: Fixed printing of genes in ascii
format
2008-02-08 twu
* plotgenes.c, plotgenes.h: Added binning by pixel. Removed allgenesp for
plot_counts.
2008-02-07 twu
* gsnap-to-iit.c, plotgenes.c: Modified count format for IITs to store
information in batches
* plotgenes.c: Added printing of alternate counts. Fixed problem for calls
to IIT_get_typed.
* gsnap-to-iit.c: Initial import into CVS
2008-02-06 twu
* iit_plot.c: Increased top margin. Added -V flag for handing count data.
* plotgenes.c, plotgenes.h: Added function for plotting count data.
Handling signs for both versions 1 and 2 of IIT files.
* iit-read.h: Added interface for IIT_version()
* iit-read.c: Added abort statement for negative coordinates
* sequence.c: Added functions for skipping sequences
* indexdb.c: Commented out some information output to stderr
* iit_get.c: If iit file not found, try adding ".iit" suffix
* stage3hr.c: Printing distances for spliced reads only if distance value is
nonzero
* stage1hr.c: Fixed calculation of distances in spliced reads
2008-02-05 twu
* Makefile.gmaponly.am, Makefile.gsnaptoo.am, Makefile.pmaptoo.am: Added
compiler commands for iit_plot
* iit_plot.c: Taken from mapplot.c in gdp.
* genomepage.c, genomepage.h: Extracted commands from gdata-write in gdp.
* plotgenes.c, plotgenes.h: Incorporated changes from gdp. Improved
plotting capabilities.
* list.c, list.h: Incorporated changes from gdp. Added List_from_string.
* color.c: Incorporated changes from gdp. Removed yellow.
2008-01-30 twu
* gsnap.c: Limited reporting of exon-exon paths. Added -E flag to turn off
finding of exon-exon solutions.
* genome.c, genome.h: Made Genome_fill_buffer return a false value if it
goes into negative genome coordinates.
2008-01-29 twu
* stage1hr.c: Skipping cases that result in negative genomic coordinates.
Skipping cases of finding first indels when alignment doesn't extend to
the end.
* stage3hr.c, stage3hr.h: Made fixes for handling exon-exon junctions
* stage1hr.c: Fixed problems in handling various combinations of
sense/antisense and plus/minus strands for exon-exon junctions.
* gmap.c: Made finding canonical introns the default. Made -X flag take an
argument.
2008-01-16 twu
* stage1hr.c, stage3hr.c, stage3hr.h: Improved algorithm for finding and
ranking chimeras
2008-01-14 twu
* gmap.c, pair.c, pair.h, stage3.c, stage3.h: Added output in IIT exon map
format
* stage1hr.c, stage3hr.c, stage3hr.h: Added printing of number of mismatches
for chimeras
* dynprog.c: Added type cast for memset.
* stage1hr.c: Reduced max mismatches to 4. Penalized mismatches further in
finding breakpoints for chimeras.
* stage3hr.c: Reporting breakpoint coordinates for chimeras
* stage1hr.c: Increased penalty for mismatches to help find correct
breakpoint for chimeras
2008-01-11 twu
* Makefile.gmaponly.am, Makefile.gsnaptoo.am, Makefile.pmaptoo.am,
stage1hr.c, stage3hr.c, stage3hr.h: Added probabilistic calculations of
splice sites
* gsnap.c, stage1hr.c, stage1hr.h, stage3hr.c, stage3hr.h: Implemented code
for identifying chimeras
* resulthr.c, resulthr.h: Added a new type for chimeras
2008-01-09 twu
* stage1hr.c: Fixed typo in variable name
* resulthr.c, resulthr.h, stage3hr.c, stage3hr.h, gsnap.c: Added the ability
to report paired-end cases that fail to co-localize
* gmap.c: Cleaned up some code
* gsnap.c, stage1hr.c, stage1hr.h: Added -o flag to specify optimum length
2008-01-08 twu
* gsnap.c: Added -I flag for specifying inversion of second read of paired
end read
* sequence.c, sequence.h: Added procedure for printing revcomp of a short
read
* stage3hr.c, stage3hr.h: Provided options for printing second read either
in original direction or as reverse complement.
* stage1hr.c: Fixed various memory leaks
* gsnap.c: Added flag to print all solutions, either for single read or for
paired end read.
* stage3hr.c, stage3hr.h: Added procedures for sorting results of single
read mappings
* stage1hr.c, stage1hr.h: Added ability to print all solutions in single read
* stage3hr.c, stage3hr.h: Added sorting of results by closeness to optimal
distance
* gsnap.c: Removed unused variables. Removed instant printing feature.
* stage1hr.c, stage1hr.h: Removed instant printing feature
2008-01-07 twu
* resulthr.c, resulthr.h: Generalized Result_T object so it can print either
single or paired end results
* sequence.c, sequence.h: Implemented procedure for reading short reads,
either single or paired ends.
* request.c, request.h: Enabled storage of paired reads in Request_T object
* stage3hr.c, stage3hr.h: Implemented routines for storing and printing
paired ends
* stage1hr.c: Implemented separate strategy for handling reads with poly-A
or poly-T 12-mers. In such cases, need to test 12-mers exhaustively.
* stage1hr.c, stage1hr.h: Initial implementation of mapping for paired
reads. For consistency, changed indel to be same rank as sub:2 for single
reads. Generalized separator used in printing results.
2008-01-04 twu
* stage1hr.c: Separated single read strategy into separate components
* sequence.c: Fixed a memory leak.
* stage3hr.c, stage3hr.h: Added a stage3 procedure specific for GSNAP
* sequence.c, sequence.h: Added a read procedure that converts input to
uppercase
* gsnap.c, resulthr.c, resulthr.h: Made GSNAP algorithm return results
rather than printing them
* Makefile.gmaponly.am, Makefile.gsnaptoo.am, Makefile.pmaptoo.am: Removed
gregion.c and added stage3hr.c to GSNAP build
* stage1hr.c, stage1hr.h: Made algorithm return results rather than printing
them. Fixed a bug in handling cases with mismatches on both ends.
2007-12-19 twu
* gmap.c, pair.c, pair.h, stage3.c, stage3.h: Added -4 flag for printing
alignments per exon
* gsnap.c: Removed unused code
* stage1hr.c: Removed unused header files and objects
* stage1.c, stage1.h: Added function Stage1_size()
* resulthr.c, resulthr.h: Added specialized Result_T for GSNAP
* reqpost.h: Added conditional include of resulthr.h for GSNAP
* params.c, params.h: Removed obsolete fields
* matchpair.c, matchpair.h: Removed obsolete functions
* iit-read.c: Removed warning message about not finding a file
* stage3.h: Made maponly mode work with Gregion_T objects.
* stage3.c: Fixed bug where genomicuc_ptr was NULL. Made maponly mode work
with Gregion_T objects.
* gregion.c, gregion.h: Added fields to Gregion_T for maponly mode.
* gmap.c: Restored old maponlyp code. Added error message if pthreads fails
on operating system.
* genuncompress.c: Added flag to print one character per line
* blackboard.c, blackboard.h: Added function to see if blackboard is done
* Makefile.gmaponly.am, Makefile.gsnaptoo.am, Makefile.pmaptoo.am: Added
commands for GSNAP
2007-12-07 twu
* pair.c: Fixed bug with trim_end, now an exclusive coordinate rather than
an inclusive one.
* cvs2cl.pl: Added changelog program to CVS
* gmap.c: Changed calls to Stage1_compute to match new interface
* stage1hr.h: Removing unused parameters
* stage1hr.c: Corrected positions of 12-mers for sequences shorter than 36
nt. Reduced final threshold score to 12. Checking for repetitive sequence.
* stage1.c, stage1.h: Separated stage 1 low-resolution procedure from
high-resolution procedure.
* block.c, block.h, indexdb.c, indexdb.h, oligo.c, oligo.h: Added procedure
for counting number of genomic positions for a given oligomer
* gsnap.c: Now calling Stage1hr_compute directly. Removed some unused code.
* gmap.c: Restored maponly mode
2007-12-06 twu
* stage1hr.c: Initial attempt to generalized procedure to handle oligomers
shorter than 36, using initial testing of 3 12-mers.
2007-12-01 twu
* stage1hr.c: Fixed bugs in computation and printing of middle indels.
Fixed bug when best_querypos (in terms of npositions) was -1.
* stage1hr.c: Added strategy of using most specific oligomer to drive search
for exact matches.
* stage1hr.c: Added keep_score parameter to find_segments.
* indexdb.c: Made heapify faster. Hard-coded left_nts and right_nts to be 1
in read_shifted.
* genome.c: In Genome_fill_buffer, checking for negative starting
coordinate, and filling with N's if necessary.
2007-11-30 twu
* stage1hr.c: Limited dynamic programming to just the non-matching oligomer,
whenever possible.
* stage1hr.c: Added a triplet matching step with binary search to find exact
matches. Fixed a bug in find_segments in handling the last diagonal.
2007-11-29 twu
* stage1hr.c: Changed output format to show substitutions, insertions, and
deletions. Made speed improvements in heap algorithm.
* gsnap.c: Simplified code for handling short reads. Stopped usage of
oligoindex.
* stage1hr.c: Implemented version that handles indels. Some speed
improvements in reporting exact matches when found.
2007-11-27 twu
* stage1hr.c: Removed unused code
* stage1hr.c, stage1hr.h: Working version implemented for 36-mers, allowing
for substitutions
* stage1.h: Taking queryseq as an argument for Stage1_compute (needed for
gsnap).
* stage1.c: Using new interface to Block_process_oligo
* Makefile.am: Makefile.am now generated by bootstrap from other files
* stage2.c: Added debugging statements
* sequence.c, sequence.h: Added procedure Sequence_print_oneline
* rbtree.c, rbtree.h, rbtree.t.c, gregion.c, gregion.h: Initial import into
CVS
* result.c, result.h: Added procedure Result_blank
* params.c, params.h: Removed truncstep
* oligoindex.c: Added correct calculation of badoligos
* oligo.c, oligo.h: Providing diagterm information to lookups from indexdb
* indexdb.c, indexdb.h: Changed high-resolution indexdb to be subclassified
by adjacent nucleotides, rather than by phase.
* gsnap.c: Adding separate main program for gsnap.
* block.h: Added function Block_skipto. Giving diagterm information to
Oligo_lookup.
* block.c: Added function Block_skipto. Revised coordinates assigned to
last_querypos. Giving diagterm information to Oligo_lookup.
* Makefile.gmaponly.am, Makefile.pmaptoo.am: Added hooks for gsnap
* fa_coords.pl.in: Improved handling of cases where chromosome is not parsed
* gmap_setup.pl.in: Added -H flag to generate high-resolution gmap dbs.
2007-11-26 twu
* iit_store.c: Fixed bug in handling GFF files
2007-11-14 twu
* indexdb.c, indexdb.h: Implemented precise positioning by organizing
composite positions according to phase
2007-11-13 twu
* result.c, result.h: Remove stage 1 diagnostic information
* matchpair.c, matchpair.h: Making matchpair generate gregion as output from
stage 1
* gmap.c: Using new interface to stage 1. Removed maponly output.
* Makefile.gmaponly.am, Makefile.pmaptoo.am, stage1hr.c, stage1hr.h: Moved
high-resolution stage 1 algorithm to a different file
* stage1.c, stage1.h: Eliminated diagnostic fields. Made interface for
low-resolution version compatible with high-resolution version.
2007-11-02 twu
* stage3.c, stage3.h: Removed matchpairend and Stage3_direct procedure
* stage1.c, stage1.h: Reverting back to 2007-09-28 version
* pmapindex.c: Changed order of arguments in a function call
* params.c, params.h: Added slots for truncstep and chromosomal transitions.
* list.c, list.h: Added functions List_insert and List_reinsert.
* indexdb.c, indexdb.h: Added function Indexdb_shiftedp. For
high-resolution indexdbs, added code to merge batches using either a queue
or a heap.
* iit-read.c, iit-read.h: Added function IIT_transitions_subset.
* gmapindex.c: Added -e flag to specify high-resolution genomic indices
* chrsubset.c, chrsubset.h: Added function Chrsubset_transitions. Added
assumption to Chrsubset_includep.
* chrnum.c, chrnum.h: Added function Chrnum_print_position
* block.h: Added function Block_donep.
* block.c: Improved debugging output. Added function Block_donep.
* access.h: Added function to report if file exists.
* access.c: Improved error messages. Added function to report if file
exists.
2007-10-16 twu
* stage1.c: Refined high-resolution algorithm
2007-10-11 twu
* orderstat.c: Included appropriate header files for memcpy
2007-10-08 twu
* reader.c: Made reader go all the way to the ends of the sequence
* sequence.c: Fixed computation of trimlength
* indexdb.c, indexdb.h: Implemented read and write procedures for new
genomic index format (trading off position resolution for adjacent
nucleotide contents).
2007-10-07 twu
* stage1.c: Implemented mapping at ends
2007-10-06 twu
* stage1.c: Completed initial mapping from middle outward
* stage1.c: Added computation of best subpaths
2007-10-03 twu
* stage1.c: Implemented high-resolution mapping, and arbitrarily long
matches for the middle of the sequence outward.
2007-09-30 twu
* stage1.c: Attempt to use diagonals to find genomic position
2007-09-29 twu
* genome.c, genome.h: Added Genome_totallength function
* stage1.c, stage1.h: Added procedure to match doubles of truncated indexdb
entries
2007-09-28 twu
* gmap_setup.pl.in: In -S mode (treating each contig as a chromosome),
turning off sorting of chromosomes and contigs.
* gmapindex.c: Added -S flag to turn off sorting of chromosomes and contigs
* table.c, table.h, tableint.c, tableint.h: Added ability to return keys
sorted by timeindex
* stage1.c, trial.c, trial.h: Changes made to scan query sequence from
middle outward
* VERSION: Updated version
2007-09-27 twu
* gmap.c: Fixed bug for -f 9 and -E output when no paths were found
2007-09-26 twu
* VERSION: Updated version
* index.html: Revised features for 2007-09-26 version
* gmap_update.pl.in: Made new IIT file permissions the same as the old
permissions
* iit-read.c: Added error messages to various conditions in IIT_read
2007-09-25 twu
* sequence.c: Fixed reading of sequences with multiple PC line feeds
2007-09-20 twu
* stage1.c: Kept code that depended on USE_MATCHPOOL and removed alternate
(old) code
* block.c, block.h: Put save variables inside Block_T object
2007-09-19 twu
* VERSION: Updated version number
* MAINTAINER: Added reminder to do cvs tag
* iit-read.c, iit-read.h, iit-write.c: Moved compute_flanking procedure from
iit-read.c to iit-write.c
* configure.ac, Makefile.am, gmap_update.pl.in: Added gmap_update program
* Makefile.am: Added compile instructions for iit_update
* iit-write.c: Made stringlen of type off_t (to handle annotations of length
greater than can be handled by int). Added check to make sure stringlen
is non-zero.
* iit-read.c: Made stringlen of type off_t (to handle annotations of length
greater than can be handled by int)
* archive.html, index.html: Made changes for 2007-09-20 release
2007-09-18 twu
* Makefile.gmaponly.am, Makefile.pmaptoo.am, iit-read.c, iit-read.h,
iit-write.c, iit-write.h, iit_update.c: Implemented iit_update program
* iit_store.c: Added -v flag to specify desired version
2007-09-17 twu
* oligoindex.c, stage2.c: Changed R output for diagonal graphics
2007-09-12 twu
* gmap.c: Added a check to make sure we don't push NULL for Stage3_T object.
* dynprog.c: Fixed bug in Dynprog_dual_break; need to compute matrix scores
only to the minimum of length1 and length2.
2007-09-11 twu
* dynprog.c: Fixed problem where Dynprog_dual_break was exiting
unnecessarily; need to be concerned only about shorter distance.
* stage3.c: Added cDNA direction to debugging statements
* iit_get.c, stage3.c: Added sign argument for getting flanking entries
* pair.c: Added provision in PMAP to limit coverage to 100% (could exceed
previously because of implicit stop codon added at end of query sequence).
* iit-read.c, iit-read.h: Added a sign argument for getting flanking entries
* get-genome.c: Added flags for accessing from map files entries of a
particular direction or tag
* stage1.c: Performing filtering based on clustersize only if too many
entries and at least one cluster is large.
2007-09-04 twu
* stage1.c: Removed filtering based on too many matching pairs
2007-08-30 twu
* gbuffer.c, gbuffer.h, gmap.c: Removed unused code and parameters from
Gbuffer_T
* gmap.c: Allocating memory for genomicseg only as needed
2007-08-28 twu
* blackboard.c, blackboard.h, gmap.c, sequence.c, sequence.h: Added ability
to read input from multiple sequence files
* oligoindex.h: Changed calls to reset oligoindex
* oligoindex.c: Fixed hang that resulted when no oligomer positions were
found. Eliminated an extra call to Oligoindex_set_inquery.
* stage2.c: Changed call to Oligoindex to reset after tally
* stage1.c: Modified debugging output
* match.c, mem.c: Enhanced debugging output
* stage2.c, stage2.h: Returned to previous algorithm for finding shifted
canonical dinucleotides, but now allocating memory dynamically.
* gbuffer.c, gbuffer.h: Removed pre-allocated memory for finding shifted
dinucleotides
* stage2.c: Attempt to conserve memory used in finding shifted canonical
dinucleotides. However, results in speed penalty.
2007-08-26 twu
* gbuffer.c, gbuffer.h: Removed unused matchscores variable and unnecessary
memory allocation.
2007-08-23 twu
* gmap_uncompress.pl.in: Added a missing space in the output
* gmap_uncompress.pl.in: Added coordinates output (with flag '-f 9')
2007-08-22 twu
* pair.c: Fixed potential divide-by-zero bug
2007-08-20 twu
* gmap_setup.pl.in: Added a .SUFFIXES: command at top to prevent unexpected
behaviors
2007-08-18 twu
* stage2.c: Added step to recover when all scores at a querypos are
negative, by continuing from grand result.
2007-08-16 twu
* pair.c, pair.h, stage2.c, stage2.h, stage3.c: Computing defect rate in
middle of stage 3, instead of in stage 2
2007-08-15 twu
* oligoindex.c: Restored amino acid alphabet to 20 from 18.
* oligoindex.c: Fixed typo in variable name
* stage2.c: Inactivated limit on number of active hits
* Makefile.gmaponly.am, Makefile.pmaptoo.am: Added orderstat.c and
orderstat.h to code
* orderstat.c, orderstat.h: Modified procedures to compute order statistics
in place and for both doubles and ints.
* oligoindex.c: Computing overabundance based on upper percentile of
non-zero counts
* doublelist.h: Added Id info to header
* gmap.c: Fixed memory leak when user segment is provided
* orderstat.c, orderstat.h: Added orderstat to CVS
* oligoindex.c, oligoindex.h: Trial to eliminate limit on maxoligohits
* stage2.c: Improved output for graphical debugging
* oligoindex.c: Improved debugging output
2007-08-13 twu
* stage2.c: Fixed problem with debugging output
* stage1.c: Added pruning by path sizes
* matchpair.c, matchpair.h: Added a procedure for finding path size of a
given matchpair
2007-07-16 twu
* Makefile.gmaponly.am, Makefile.pmaptoo.am: Created two specialized
Makefile.am files
* Makefile.am: Preparing for iit and genome libraries
* bootstrap, bootstrap.gmaponly, bootstrap.pmaptoo: Created separate
bootstrap routines for gmap and gmap-plus-pmap
* VERSION: Updated version number
* iit-read.c: Computing alphas and betas for iit_dump
* iit-read.c: Computing alphas and betas only when needed for flanking
* gmapindex.c, iit-read.c, iit-read.h, iit-write.c, iit-write.h, iit_get.c,
iit_store.c, iitdef.h: Added fields for annotation in IITs
* indexdb.c: Added monitoring information
2007-06-25 twu
* genome-write.c, segmentpos.c: For version 2 IITs and later, getting sign
directly from IIT, rather than from annotation.
* get-genome.c, plotgenes.c: Providing sortp parameter to IIT_get
* iit_store.c: Added -v flag to print IIT version
* iit_get.c: Added -U flag to indicate unsigned results
* genome.c: Added comment
* gmapindex.c: No longer writing segment length in contig IITs
* iit-write.c: Writing alphas and betas for correct calculation of flanking
intervals.
* iit-read.c, iit-read.h: Using alphas and betas for correct calculation of
flanking intervals. Added functions IIT_types, IIT_get_all, and
IIT_get_all_typed.
* interval.h: Added interface for Interval_sign
* iitdef.h: Added space for alphas and betas, needed for correct calculation
of flanking intervals
2007-06-22 twu
* dynprog.h: Defined UNKNOWNJUMP to be used for temporary gapholders during
stage 3 calculations.
* dynprog.c: Returning NULL on all failures, without gapholders (which are
now inserted by calling procedures in stage 3). Allowing 5' and 3'
extensions to work to maxlength allowed.
2007-06-21 twu
* iitdef.h: Added version to IIT_T
* interval.c: Storing sign for each interval
* iit-read.c, iit-read.h, iit-write.c, iit-write.h: Introduced version 2
format, which stores sign for each interval. Not using annotation anymore
to represent sign. Added function IIT_find_multiple.
2007-06-20 twu
* pairpool.c: Improved debugging statements
* gdiag.c, get-genome.c, gmap.c, iit-read.c, iit-read.h, iit_get.c,
interval.c, interval.h, plotgenes.c, segmentpos.c, stage3.c: Added ability
to sort intervals by coordinates in IIT_get routines
* mem.c: In comments, showing how TRAP should be defined
* stage3.c: Whenever dynprog procedure returns NULL, make sure to put back
peeled pairs and insert a gapholder. Fixes a bug in BQ672778 against
hg18. Jumps in gapholders now calculated only in certain procedures.
2007-06-06 twu
* stage3.c: Removed abort commands on peels that run into gaps
* dynprog.c: Expanded on comment
2007-06-04 twu
* VERSION, index.html: Updated version
* dynprog.c: Removed insertion of gapholder for a single gap that is too
long to solve.
2007-06-02 twu
* stage1.c: Added debugging statement to signal end of stage 1
* dynprog.c: Eliminated allocation of temporary Dynprog_T objects
* pair.c: Now printing query_skip in -A and -S output
2007-05-29 twu
* dynprog.c: Lowered gap penalties for single gaps. Fixed bug in solving
dynamic programming for lower-case input sequences.
* stage3.c: Added procedures for cleaning non-matches at ends of alignment,
which are always called.
* stage3.c: Removing all nonmatches at 5' and 3' ends
2007-05-25 twu
* VERSION: Updated version
* index.html, archive.html: Made changes to reflect 2007-05-25 version
* pair.c, pair.h, stage3.c: Added coverage and identity information to GFF3
output
2007-05-24 twu
* configure.ac: Checking both fixed and variable mapping for mmap
* fa_coords.pl.in: Fixed problem in parsing lines containing the word
"chromosome"
* stage3.c: Fixed bug in solving dual breaks
* pair.c: Fixed PSL output so query and target gaps are computed directly
from the block starts and lengths.
* gmap.c: Added flag -j for showing dual breaks
* dynprog.c: Fixed bug where solution of dual break exceeded minimum gap
2007-05-15 twu
* stage3.h: Added do_final_p parameter to Stage3_compute
* stage3.c: Incorporating procedure to trim bad middle exons
* stage1.c: Performing salvage if total number of matches is relatively low
* smooth.c: Added to exon length in smoothing for short exons
* pair.c: Added test code to print information about extra exons
* intron.c: Removing call to abort.
* dynprog.c: Added code to bridge dual break with rewards for canonical
introns. Tweaked some parameters, including less penalty for gap
extensions.
2007-04-25 twu
* mem.c: Added error messages for memory allocation problems
* gmap.c: Added -X flag for heavily favoring canonical and semi-canonical
introns
2007-04-23 twu
* gmap.c: Made strict translation the default again
* indexdb.c: Added "U" to integers in bit operations
* genome.c: Using two arrays instead of one for translate.
* compress.c: Using two arrays instead of one for translate. Added various
abort checks.
* translation.c: Removed unnecessary checks of extraexonp
* pair.c: Fixed Pair_dump_one so it handles extraexonp flag
* pairpool.c: Fixed bug in assigning extraexonp
* pair.c: Changed add_intronlengths slightly
* acx_mmap_variable.m4: Moved AC_DEFINE out of macro, to be called
explicitly in configure.ac
* acx_mmap_fixed.m4: Added macro for testing mmap with MAP_FIXED
* gmap.c: Made frameshift-tolerant translation the default
* pair.c: Changed output of CDS in gff3 mode to produce an in-frame protein
sequence
* translation.h, stage3.h: Added strictp for PMAP.
* translation.c: Added strictp for PMAP. Handling extraexonp items in
alignment.
* stage3.c: Giving extraexonp information to pairpool procedures for
gapalign items
* pair.h: Added procedure for computing fractional error.
* pair.c: Added provisions for handling extraexonp flag. Added procedure
for computing fractional error.
* pairpool.c, pairpool.h: Added provisions for handling extraexonp flag
* pairdef.h: Added flag for extra cDNA exon
* comp.h: Added comp for extra cDNA exon
* stage3.c: Added procedures for trimming internal exons with poor matches,
and for finding extra exons in a dual break. Added strictp for PMAP.
* dynprog.c, dynprog.h: Added procedures for finding extra exons in a dual
break
* compress.c: Provided more informative error message
* oligoindex.c: Handled an arithmetic error caused by divide by zero
* gmap.c: Added flag -H for handling trimming middle exons and reporting of
dual breaks. Removed flag -j. Allowed strictp to be used in PMAP.
2007-04-16 twu
* align.test.ok, map.test.ok: Made test output match current output
* acx_mmap_variable.m4, configure.ac: Added check for mmap using
MAP_VARIABLE (because AIX fails on MAP_FIXED)
* gmap.c: Fixed global variables not used in PMAP
* stage2.c: Fixed check for negative scores to work for PMAP
* stage1.c: Placing bound on number of samples taken over query sequence
* stage2.c: For very long sequences, pruning based on clear coverage
* oligoindex.c, oligoindex.h: Added computation of clear coverage (percent
of query sequence covered by relatively few diagonals).
* matchpair.c: Added debugging statements
2007-04-06 twu
* stage2.c: Added restriction on distance for grand lookback. Added check
on nactive for scoring at querypos.
* stage2.c: Added nactive as a filter in stage 2
2007-04-02 twu
* stage2.c: Added comment about a way to restrict debugging output
* iit-read.c: Added start/end information to endpoints in dumping counts
2007-03-22 yunli
* modules: *** empty log message ***
2007-03-21 yunli
* modules: *** empty log message ***
2007-03-08 twu
* stage2.c: Quitting if totalpositions is zero
* gmap.c: Enabled table output (-f 9) for relative alignment mode (-w).
2007-03-03 twu
* stage2.c: Fixed bug which led to the partial fill problem in filling
oligomers
2007-03-02 twu
* Makefile.am: Added pthread_libs for iit utilities
2007-03-01 twu
* gdiag.c: Added printing of centromere regions. Printing marginals more
efficiently.
* Makefile.am: Added chrnum and chrsubset files to gdiag
* iit-read.c, iit-read.h: Made IIT_transitions function return signs
2007-02-28 twu
* gdiag.c: Added flag for ignoring main diagonal. Fixed problem with
printing revcomp diagonals. Added code for computing different types of
patterns.
2007-02-21 twu
* gdiag.c: Added hooks for user to enter chromosomal subsets
* complement.h: Added a character code string that doesn't convert to
uppercase, for gdiag.
2007-02-20 twu
* gdiag.c: Added ability to use map iit files. Skipping over masked regions
in determining lookback.
* gdiag.c: Fixed bug in retrieving last part of sequence from gmapdb.
Providing flag to ignore lowercase (e.g., masked) characters in query.
* gdiag.c: Added ability for user to provide genomic segment
* Makefile.am: Added iit-write source files for gdiag
* gdiag.c: Fixed bug where genomestart could be less than genomeend in a
diagonal. Made separate procedures for updating forward and revcomp
diagonals.
* indexdb.c, indexdb.h: Added procedure IIT_read_inplace for gdiag.
* iit-write.h: Added procedure IIT_new to allow creation and use of iit in
the same program.
* iit-write.c: Added procedure IIT_new to allow creation and use of iit in
the same program. Simplified code for Node_fwrite.
* iit-read.c, iit-read.h: Added procedure IIT_transitions
2007-02-18 twu
* gdiag.c: Added a ring structure to increase speed
* genome.c, genome.h: Added procedures for gdiag
* gdiag.c: Made speed improvements by not storing full 24-mers, but rather
storing results of previous 12-mers
* gdiag.c: Added calculation of diagonals, ability to read query from
gmapdb, and storage of intervals.
2007-02-16 twu
* Makefile.am, gdiag.c: Added program gdiag
2007-02-12 twu
* gmap.c: Increased parameter for maxoligohits
* stage3.c: Lowered parameter for intronlen
* stage2.h: Removed unused function
* stage2.c: Changed distance penalty to consider both gendistance (now
linearly, instead of logarithmically) and querydistance (quadratically).
Using both maxnconsecutive and pct_coverage to decide whether to continue
with stage 2.
* stage1.c: Reduced parameter for number of trials
* pair.c: Fixed calculation of coverage for PMAP
* oligoindex.h: Moved parameters here
* oligoindex.c: Implemented algorithm for PMAP. Allowing a diagonal to
dominate only if it is completely consecutive.
2007-02-09 twu
* iit-read.c, iit-read.h: Added function to dump labels
* gmap.c: Fixed bugs with map iit files: bad test for distinguishing between
universal map files and chromosomal map files, and incorrectly checking
map tags against chromosomal iit.
2007-02-08 twu
* oligoindex.c, oligoindex.h: Added computation of percent coverage by
diagonals
* access.c: Added a debugging statement
* gmap.c: Fixed floating point error when trimoligos is zero
* gmap.c, oligoindex.c, oligoindex.h, stage2.c, stage2.h: Added graphical
debugging output for stage 2
2007-02-07 twu
* smooth.c: Distinguishing use of genomejump and queryjump lengths in
pre-single-gap smoothing versus post-single-gap smoothing.
* gmap.c: Fixed floating exception when sequence has no oligos
* stage2.c: Initializing guide in 5' trim region, until first hits are found.
* gmap.c: Changed default pruning behavior to be no pruning.
* oligoindex.c: Made speed improvements in scanning diagonals. Removed old
code for computing maxconsecutive.
* oligoindex.h, stage2.c: Changed name of variable
* oligoindex.c: Eliminated convex hull algorithm and implemented method
based on ordering of diagonals.
* diag.c, diag.h: Removed unused procedures
2007-02-06 twu
* oligoindex.c: Implemented a convex hull algorithm to determine minactive
and maxactive bounds.
* diag.c, diag.h: Added a procedure to sort diagonals based on closeness to
origin
* oligoindex.c, oligoindex.h, stage2.c: Restored computation of
maxgoodconsecutive to filter out bad stage1 candidates
* stage2.c: Allowing fill of nucleotides to occur even when
querypos/lastquerypos or genomepos/lastgenomepos are too close.
* oligoindex.c: Added new procedure for determining dominance among diagonals
* diag.c, diag.h: Added procedure for sorting by nconsecutive
* stage2.c: Using minactive and maxactive to bound current querypos, and
active to determine available hits for previous querypos.
* oligoindex.c, oligoindex.h: Computing diagonals inside
Oligoindex_get_mappings procedure. Implemented simple dominance procedure.
* diag.c, diag.h, diagdef.h, diagpool.c, diagpool.h: Added fields for
nconsecutive and dominancep
2007-02-05 twu
* oligoindex.c: Changes in parameters
* stage2.c: Changes to debugging output
* oligoindex.c: Speed improvements by inlining calls to Intlist accessors
* stage2.h: Removed unused function
* oligoindex.h, stage2.c: Using length as a criterion instead of
nconsecutive for proceeding to dynamic programming
* oligoindex.c: Reduced requirement for nconsecutive in scanning diagonals.
Keeping cumulative track of highest and lowest diagonals. Added
extra_bounds to diagonal bounds.
* Makefile.am, diag.c, diag.h, diagdef.h, diagpool.c, diagpool.h, gmap.c:
Adding Diag_T and Diagpool_T objects for scanning diagonals
* oligoindex.c, oligoindex.h, stage2.c, stage2.h: Scanning diagonals to set
bounds on active oligomers
2007-02-04 twu
* Makefile.am, gmap.c, intpool.c, intpool.h, oligoindex.c, oligoindex.h,
stage2.c, stage2.h: Added Intpool_T object to manage storage for Intlist_T
objects
* gmap.c: Using new interface to Stage3_compute
* stage2.c: In dynamic programming, added a lookback to the grand best
querypos and hit
* intlist.c, intlistdef.h: Provided exposure to internal structure of
Intlist_T
2007-02-03 twu
* oligoindex.c, oligoindex.h, stage2.c: Implemented faster algorithm for
identifying active stage 2 oligomers.
* stage2.h: Removed unused procedures
* oligoindex.c, oligoindex.h, stage2.c: Implemented procedures to skip over
unused mappings, based on active
2007-02-02 twu
* gmap.c, oligoindex.c, oligoindex.h, stage2.c, stage2.h: Implemented new
stage 2 procedure. Now using oligoindex at minindexsize and filtering
those hits according to a local search. Initial search is based on
maxindexsize. Uncovered ends of the alignment receive a looser local
search criterion. Increased stage 2 lookback from 60 to 100.
* translation.c: Fixed bug where first cDNA amino acid appeared under a cDNA
space.
* stage1.c: Adding one more trial for long sequences
* pair.h, stage3.h: Computing coverage and now trimmed coverage at print
time.
* pair.c: Computing coverage and now trimmed coverage at print time. Added
output line for trimmed coverage. Added trim information in compressed
(-Z) output. Adjusting output of trim boundaries based on alignment.
* stage3.c: Computing coverage and now trimmed coverage at print time.
Increased parameter for minintronlength.
* gmap.c: Removed -X (cross-species) flag
2007-01-31 twu
* oligoindex.c: Added code for 18-amino-acid alphabet
* gmap.c: Made user_stage1p false
* stage2.c: Trying to make penalties consistent across different cases
* stage3.c: Initialized variables in Stage3_T object
* pair.c: Fixed memory leak in gff exon mode (-f 2)
2007-01-07 twu
* stage3.c: If both single and dual gap solutions are canonical, picking
solution with best score.
2007-01-06 twu
* dynprog.c: Made reward for final canonical intron uniform across defect
rates. Boosted reward for final semicanonical intron to match that for
canonical intron.
2007-01-05 twu
* pair.c: Fixed dinucleotide output in compressed (-Z) format when
user-provided genomic segment has lower-case characters.
2007-01-04 twu
* stage2.c: Reduced value of EQUAL_DISTANCE, to favor better local alignment
over longer global alignment
* stage3.c: Counting exons only after gaps filled in
* pair.c, pair.h: Added procedure for counting exons after gaps filled in
2007-01-03 twu
* dynprog.c: Made one-sided gap behavior true only for single gaps and end
gaps
* stage3.c: Removed fix_pmap_holes function and all references to it
2006-12-18 twu
* index.html, VERSION: Updated version
* translation.c: Prevented assignment of incomplete last codon on cDNA side
in strict mode
* stage1.c: Removed unused variables
* matchpair.c: Increased EXTRA_SHORTEND
* gmap.c: Reduced default trimexonpct. Changed bandwidths for single and
gap gaps.
* dynprog.c: Added onesidegap behavior, which allows gaps on either genomic
or cDNA side, but not both. Added concept of fixeddestp, which is not
true for the ends.
2006-12-15 twu
* VERSION: Updated version
* index.html: Made changes to reflect new version and strict translation as
default
* gmap.c: Reduced extraband_end
* dynprog.c, gmap.c: Reduced extraband_single to prevent gaps from being
inserted on both sides
* gmap.c, pair.c, pair.h, sequence.c, sequence.h, stage3.c: For PMAP, adding
an implicit stop codon at end of sequence if not already present, and
distinguishing between computational fulllength and given fulllength.
* smooth.c: Changed probability threshold for identifying short exons
2006-12-14 twu
* gmap.c: Made strict translation the default, and tolerant translation
turned on by -Y flag
* stage3.c: Providing pound signs in dual breaks in diagnostic output.
Replacing backtranslation characters with 'N' in PMAP output. Removed
microexon search from PMAP. Using single gap procedure instead of
fix_pmap_holes procedure for PMAP.
* pair.c: Counting ambiguous characters as matches in all instances of
computing percent identity
* indexdb.h: Added variables for 5-aa mers
* dynprog.c: Limiting bandwidth in single gap alignment to be dependent on
differences in segment lengths
* backtranslation.c: Not performing backtranslation if any genomic codon
position is blank.
* md5.t.c: Removed unused file
2006-12-13 twu
* dynprog.c: Reduced width of band in single gaps when lengths are equal
* translation.c: Fixed strict translation mode, so it begins as same
location as genomic translation.
* stage3.c: Removed step of merging adjacent dynamic programming. Using two
different smoothing steps. Protected small introns from being solved as
single gaps in final intron pass.
* smooth.c, smooth.h: Created two separate smoothing procedures, one based
on net gap, and one based on size.
* dynprog.c: For cDNA gaps, inserting indel pairs only if both gaps are small
2006-12-12 twu
* map.test.ok: Added blank line at end
* VERSION: Updated version
* MAINTAINER: Added reminder to check cvs log to make sure files are all up
to date
* stage3.c: Removing gaps at 5' and 3' ends after end extensions. Checking
for division by zero in trim_bad_exons.
* Makefile.am: Simplified list of source files
2006-12-08 twu
* archive.html: Updated to reflect 2006-12-08 version
* archive.html, Makefile.am, align.test.in, align.test.ok, coords1.test.in,
coords1.test.ok, iit_dump.test.in, iit_get.test.in, iit_store.test.in,
map.test.in, map.test.ok, setup1.test.in, setup2.test.in: Merged into trunk
* defs: Initial import into CVS
* VERSION, config.site.gne, share, index.html, pmap_setup.pl.in: Merging
into trunk
* MAINTAINER: Merging into main trunk
* iit.test.in: Combined iit_store, iit_get and iit_dump tests into one script
* stage1.c: Increased definition of short sequence (for allowing cluster
mode) for PMAP
* match.c: Fixed printing of sequences in debugging statements
2006-12-05 twu
* stage3.c: Fixed miscount problem with filling in short introns. Increased
MININTRONLEN_FINAL significantly.
2006-12-01 twu
* stage1.c: Printing chromosome name in debugging statements
* match.c, match.h: Added procedure Match_chr
2006-11-30 twu
* stage3.c: Allowing and correcting for gaps after gaps
* smooth.c: Using difference between genomejump and queryjump to define
introns for the purposes of smoothing.
* configure.ac: Added check for sigaction
* README: Updated README file
* md_coords.pl.in: Fixed behavior when user wants only the reference strain
* gmap_setup.pl.in: Changed name from raw to fullascii. Changed default for
PMAP from 7 to 6.
* gmap_process.pl.in: Added check to see that all contigs are processed
* stage3.c: Assigning gap pairs after final extensions of 5' and 3' ends
* smooth.c: Removed include of unused header file
* genome-write.c, gmapindex.c, indexdb.c, indexdb.h, pmapindex.c: Added
genome name to monitoring statements
* gmap.c: Stopped warning message for -B when flag was not provided
2006-11-28 twu
* stage1.c: Revised heuristics for determining maxtotallen and lengths for
extensions
* gmap.c: Ignoring batch flag if only a single sequence is given
2006-11-27 twu
* stage1.c: Removed unused code
* matchpair.c: Adding extra extension length when continuousp is false
* gmap.c: Revised default lengths for single intron length and total genomic
length
* dynprog.c: Added checks for genomic segment at ends being shorter than
query segment
* indexdb.h: Using 6-mers with full alphabet in PMAP
* indexdb.c: Improved monitoring statements
* matchpair.c, matchpair.h: Revised procedures for computing support and
extensions. Integrated procedures for filtering of unique and duplicate
matchpairs.
* oligoindex.c: Returned to 20 amino acids in stage 2
* params.c, params.h, stage1.h: Removed unused variable
* stage1.c: Integrated matchpairs into a single list. Revised procedures
for extending genomic region based on 12-mers.
* stage3.c: Allowed arbitrarily long incursion into previous dynprog during
peelback.
* stage2.c: Separated fwd_consecutive and rev_consecutive. Made values
consistent regardless of indexsize.
2006-11-21 twu
* stage1.c: Fixed extensions for PMAP
* indexdb.c: Reversed previous changes to try to make idxpositions file
point to end of oligomer for reverse strand matches.
* indexdb.c: Made idxpositions file point to end of oligomer for reverse
strand matches. Improved debugging output.
* stage1.c: Added a binary search routine
2006-11-20 twu
* indexdb.h, stage1.c: Made changes for PMAP to work with 6-mer pmapdb
* oligoindex.c: Fixed debugging statements for PMAP
* pair.c: Revised psl protein output for matches to the negative genome
strand
* backtranslation.c, backtranslation.h: Made an extern procedure for
computing consistent codon for a given amino acid.
* translation.c, translation.h: Made get_codon an extern procedure
* stage3.c: Added procedure for fixing alignment holes in PMAP. Applying
higher standard for accepting dual intron solutions.
* stage2.c: Fixing bugs in identifying stage 2 candidates to abort
* gmap.c: Setting trim variables appropriately in maponly mode
* dynprog.c: In PMAP, rounding up or down to finish codon
2006-11-16 twu
* stage3.c: Using intron types to evaluate bad exons at ends. Adding
another round of extensions at ends after trimming of bad exons. Restored
correction for genomepos at left end skip when filling in introns.
* dynprog.c: Assigning intron type for microexons added at ends of alignment
* gbuffer.c, gbuffer.h: Removed unused variables
* stage2.c: Removed unused variables. Using correct value for
maxconsecutive instead of last one.
2006-11-15 twu
* stage3.c: Using uppercase string, with U-to-T conversion, to identify
mismatches in peelback procedures.
* backtranslation.c, translation.c: Using uppercase string, with U-to-T
conversion, instead of toupper().
* sequence.c: Using new complement and uppercase strings
* pair.c: Using new name for (lowercase) complement string. Including 'U'
and 'u' as known bases for computing percent identity.
* indexdb.c: Using uppercase string, which also performs U-to-T conversion,
instead of toupper().
* genome-write.c, genome.c: Using new name for (lowercase) complement string.
* compress.c: Using uppercase string instead of toupper.
Compress_get_char() no longer converts characters to uppercase.
* dynprog.c: Made U and T a matching pair. Commented out old code dealing
with lowercase characters.
* complement.h: Added strings for uppercase of complement, and for U-to-T
conversion during uppercase
* sequence.c: Enabled removal of spaces in read procedure
2006-11-14 twu
* dynprog.c: Reduced extension penalties for single gaps
* stage3.c: Fixed bug in filling in gaps where leftpair has a genome gap.
Increased size of MININTRONLEN to avoid finding introns in single gap
regions.
2006-11-12 twu
* stage3.h: Added parameter for number of flanking sequences to
Stage3_print_map
* stage3.c: In 5' and 3' extensions, evaluating continuations before and
after a gap if one is found during peeling, and performing microexon
search medial to the gap
* dynprog.c: Returning null in genome gap if queryjump <= 1
* get-genome.c: Added -u flag for printing flanking intervals
* iit-read.c, iit-read.h: Added option to print iit entries in reverse order
* indexdb.h: Restored previous parameters
* pair.c: Added pointer to pair in debugging output
2006-11-11 twu
* stage3.c: Fixed computation of bad end exons. Included short end exons in
definition of bad end exons. Finding bad end exons after 5' and 3'
extensions. Fixed declaration of sense/antisense when no canonical or
semicanonical introns are present. Removing end introns during peelback
before 5' and 3' extensions. Removed unused code for trimming alignment
at ends.
2006-11-06 twu
* gmap.c: Added flags for printing flanking IIT hits and for trimming end
exons
* stage3.c: Fixed bug in trimming empty alignment
* smooth.c: Fixed bug in handling lower-case query sequences
2006-11-01 twu
* translation.c: Fixed bug in strict translation
* iit-read.h: Added procedures for finding flanking hits.
* iit-read.c: Made IIT_get more efficient. Added procedures for finding
flanking hits.
* iit_get.c: Added -u flag for printing flanking hits
* stage3.c, stage3.h: Allowing trimming of bad exons at ends. Increased
peelback at ends. Added iterative cycles of intron finding within
smoothing and dual intron cycles.
* smooth.c: Relaxing requirements for short exons at ends, because of later
trimming of poor exons at ends
* pair.c: Adding printing of intron type for debugging
* gmap.c: Stopping deletion of global_except_key, because worker threads may
still need it. Increasing standards for defining a sequence to be
repetitive. Eliminating -U flag for trimming alignments, and adding -k
flag for specifying trimming of exons at ends.
* except.c: Stopping deletion of global_except_key, because worker threads
may still need it
* blackboard.c: Letting each thread destroy its own reqpost
2006-10-31 twu
* gmap.c: Added -Y flag for performing strict translation of cDNA sequence.
Removed worker_assignments variable, and using global blackboard variable
instead to handle exceptions.
* stage3.c, stage3.h, translation.c, translation.h: Added strictp flag for
protein translation
* oligoindex.c: Dropped oligospace requirements for PMAP by reducing amino
acid alphabet in stage 2 from 20 to 16.
* gmapindex.c, indexdb.c, pmapindex.c: Fixed memory allocation for filename
* except.c: Fixed location of compiler directive
* blackboard.c: Put mutex locks outside of updates to input counter and
output counter. This is to be cautious, since only input thread and
output thread, respectively, should be affecting these counters.
2006-10-24 twu
* stage3.c: Fixed undefine_nucleotides to handle gapholders
* oligoindex.c: Using calloc instead of malloc for initializing oligoindex
space
* gmap.c: Reduced indexsizes in PMAP, so they won't overflow in some machines
* backtranslation.c: Fixed usage of translation_start and translation_end
2006-10-20 twu
* gmap.c: Printing messages to stderr when no paths are found, in all cases
where sequence headers are not printed.
* translation.c: Fixed coordinates for translation start and end
* stage3.c: Fixed bug with NULL path passed to undefine_nucleotides
* pair.c: Changed gff3 procedures to treat translation start and end values
as query positions, not alignment indices.
2006-10-16 twu
* stage3.c: Making sure that gaps are inserted after smoothing procedure
deletes exons
* stage2.c: Clarified differences between amino acid indexsize and
nucleotide indexsize. Cleaned up code for filling in oligomers.
* smooth.c: Reduced definition of a gap between exons
* oligop.c: Included possibility of 12-amino acid alphabet for 8-mers.
* indexdb.h: Included possibility of 12-amino acid alphabet for 8-mers.
Provided compile-time values for file suffixes.
* indexdb.c: Included possibility of 12-amino acid alphabet for 8-mers
* pmapindex.c: Performing complete build with a single command
* gmapindex.c: Using compiler-time value for suffix
* gmap.c: Printing value of INDEX1PART in help output for PMAP
2006-10-13 twu
* oligoindex.c: Added debugging statements
* translation.c: Fixed bug with translating cDNA beyond the genomic stop
* stage3.c: Reorganized passes through the alignment. Made peelback
routines more robust.
* smooth.h: Using stage 2 indexsize in smoothing procedures
* smooth.c: Major rewrite of smoothing procedures
* dynprog.c: Added another mechanism to prevent microexon from having a gap
at either end
2006-10-12 twu
* gmap.c: Allowing "-t 0" to mean non-threaded behavior. Using new
thread-safe exception handler.
* dynprog.c, dynprog.h: Fixed traceback for cDNA gaps
* except.c, except.h: Re-implemented thread-safe exception handler to remove
memory leaks. Now using exception frames in stack rather than in heap.
* stage3.c: Fixed peelback to codon boundaries for PMAP. Relaxed forcep
requirement for single gaps. Recognizing cases where prior genome or cDNA
gap solution was obtained.
* stage3.h: Removed ngap from parameter lists when possible
* stage1.c: Initialized a diagnostic variable
* pair.c, pair.h: Removed unused code and variables
2006-10-11 twu
* except.c, except.h: Implemented thread-safe version of exception handler
* gmap.c: Added -j flag to control printing of dual breaks
* except.c, except.h: Reformatted exception handling code. Using pointers
to frames.
2006-10-10 twu
* stage3.c, stage3.h: Rewrote peelback routines. More accurate handling of
coordinates and checking of coordinates and gaps.
* dynprog.c: Advancing query and genome coordinates in cases of skips
* smooth.c: Revised trimming at ends to use individual exons, rather than
the sum of exon and intron lengths
* pairpool.c: Showing pointer to pair in debugging statements
* pair.c: Showing queryjump and genomejump in debugging statements
* gmap.c: Added -0 flag to inactivate exception handler, and -7 and -8 flags
to show results of stage 2 and smoothing, respectively.
* except.c, except.h: Added mechanism to inactivate exception handler
2006-10-09 twu
* access.c: Fixed compiler warning about reference to void *.
* block.c, chimera.c, oligo.c, sequence.c, stage1.c: Removed unused variables
* compress.c, dynprog.c, genome-write.c, intron.c: Added necessary header
file
* genome.c: Fixed compiler warning about mismatched variable types.
* gmap.c: Added flag for pruning level. Inactivated conversion of signals
to exceptions with diagnostic flag. Removed references to badoligos.
* indexdb.c: Added necessary header file. Fixed compiler warning about
mismatched variable types.
* matchpool.h: Added declarations of external functions
* oligoindex.c, oligoindex.h: Computing estimate of maxconsecutive when
mappings are obtained
* pair.c, pair.h: Added diagnostic information about stage 2 maxconsecutive.
* result.c, result.h: Added diagnostic information about initial query check
* smooth.c: Handling possible gaps at ends of alignment
* stage2.c: Using maxconsecutive estimate from Oligoindex_get_mappings to
determine whether to proceed with stage 2.
* stage3.c: Added diagnostic information about stage 2 maxconsecutive.
Fixed procedure for removing adjacent dynamic programming to remove all
gaps, and then to reinsert them later.
2006-10-06 twu
* oligoindex.c, oligoindex.h: Added counting of replicate oligos
* md_coords.pl.in: Added information about number of contigs in each strain
* configure.ac: Removed obsolete tests. Fixed problem in setting share
directory. Added maintainer option.
* gmap.c: Distinguishing between poor and repetitive sequences. Providing
-p flag to control pruning behavior.
* result.h: Distinguishing between poor and repetitive sequences
* sequence.c: Set skiplength correctly on empty sequences
* gmap.c: Added -W flag to force GMAP to compute repetitive or poor sequences
* oligoindex.c: Limited definition of badoligo to consider only non-ACGT
characters, and not to consider number of hits.
* stage3.c: Fixed bug arising from gaps left at ends of alignment
* dynprog.c: Disallowing bridges of introns and cDNA insertions that lead to
coordinate errors
* gmap.c: Changed thread-based exception handling to kill all other threads
and to report all worker assignments
2006-10-05 twu
* stage3.h: Made checking of coordinates occur in diagnostic mode.
* stage3.c: Made checking of coordinates occur in diagnostic mode. Fixed
case where cDNA gap turned into a single gap after peelback.
* stage1.c: Fixed memory leak
* smooth.c: Fixed bug resulting from apparent negative exon and negative
intron lengths.
* oligoindex.c: Restored pruning of sequences with bad oligos.
* gmap.c: Added handlers to convert signals into exceptions, to indicate the
problematic sequence. Restored pruning of sequences with bad oligos.
2006-10-04 twu
* result.c, stage1.c, stage1.h: Added reporting of more diagnostic
information
* stage2.c: Fixed problems with uninitialized variable
* matchpair.c: Fixed problem with uninitialized variable
* gmap.c, pair.c, pair.h, result.c, result.h: Printing diagnostic
information upon request
* access.c: Using a Stopwatch_T object
* stage1.c, stage1.h, stage2.c, stage2.h, stage3.c, stage3.h: Storing
diagnostic information
* smooth.c: Fixed memory leak
* oligoindex.c: Stopped initializing data buffer for Oligoindex_T object
* stopwatch.c, stopwatch.h: Created a Stopwatch_T object
2006-10-03 twu
* stage3.c: Removed unnecessary list reversal
* pair.c: Allowing jump in querypos in pair check procedure
2006-10-02 twu
* gmap.c: Provide stage 2 information in diagnostic output. Use stage 2
information to prune bad alignments before stage 3.
* stage3.c: Provide stage 2 information in diagnostic output. Allow a
single open in scoring a single intron compared with dual introns.
* stage2.h: Interface provides number of canonical and non-canonical introns
* stage2.c: Returned to using gendistance for computing penalties, except
for diffdistance in deadp. Fixed bug in tallying unknown types of introns.
* sequence.c: Fixed problems with reading control-M characters (PC line
feed) in input.
* pair.h: Reporting stage 2 information in diagnostic output.
* pair.c: Reporting stage 2 information in diagnostic output. Counting
indels in computing percent identity for each exon.
* dynprog.c: Eliminated extra reward for finding semicanonical introns in
final pass
* stage2.c: Need to take abs() when measuring diffdistance. Scoring
behavior checked against revision 1.157. Making stage2 information
available for diagnostic output.
2006-09-30 twu
* gmap.c: Added -8 flag to show results of stage 2 calculation
* boyer-moore.c: Revised procedure to handle ambiguous characters for PMAP.
* dynprog.h: Added dynprogindex information.
* dynprog.c: Added table for performing Boyer-Moore searches of microexons
for PMAP. Reduced penalties for extending gaps. Added separate rewards
for final pass of finding canonical introns. Added dynprogindex
information.
* smooth.c: Systematically checking ends for smoothing. Using matches
instead of lengths to evaluate exons. Added probabilistic checking for
marking middle exons.
* stage3.h: Passing stage2p as a parameter to Stage3_compute.
* stage3.c: Major changes to algorithm. Added iteration through smoothing,
dual intron, and single intron passes. Checking peel back to determine if
canonical intron needs to be recomputed. Added final pass to find introns
with higher reward. Added dynprogindex information. Using dynprogindex
information in peeling leftward and rightward.
* stage1.c: New criterion for setting usep to false, namely, if support is
less than a certain fraction of the maximum observed support.
* pair.h: Pair_check_array now returns a bool.
* pair.c: Handling more cases of short gaps as indels. Printing
dynprogindex in diagnostic and debugging output.
* stage2.c: Reverted to algorithm from revision 1.157. Using diffdistance
instead of gendistance. Making sufflookback depend on mapfraction.
2006-09-28 twu
* stage2.c: Changes made to scoring algorithm, but not well-motivated.
Fixed bugs in predicting cDNA direction.
2006-09-18 twu
* stage3.c: Fixed bug when recomputing over adjacent dynamic programming
regions at end of sequence
* stage2.c: Revised rules for giving credit for query distance, giving none
if difference in distance is greater than min intron length.
* stage1.c: Doubling genomic region with each iteration, until sufficient
support found for a matchpair.
* matchpair.c, matchpair.h: Computing and storing fraction of stage 1 support
* gbuffer.c, gbuffer.h, gmap.c: Allowed Genome_T object to exceed default
length of genomic segment
* dynprog.c: Reduced penalties for gap extension, to match reductions in
mismatch penalties
* Makefile.am: Provided target machine during compilation
2006-09-11 twu
* gmap.c: Included build target in version output. Increased oligomer size
in PMAP from 3-4 to 4-5.
2006-09-08 twu
* stage2.c: Added oligos to output of debugging statements
* configure.ac: Using AC_FUNC_FSEEKO to check for fseeko. Added comment
line for $Id$.
2006-09-07 twu
* stage2.c: Added debugging statements for finding shifted canonical introns
* stage1.c: Increased trimlength and extension past ends for PMAP
* gmap.c: Increased maxextension to 120000
2006-09-01 twu
* translation.c: Making sure to assign values to variables when number of
alignment pairs is fewer than the minimum
* pair.c: Fixed bug in printing CDS of GFF3 format
* stage3.c: For PMAP, trimming ends of alignment to codon boundaries
* translation.c: Removed check for minimum number of pairs for PMAP
* dynprog.c: Changed calls to Pairpool_push. Added dynprogindex
information. Reduced penalty for mismatches.
* dynprog.h: Changed calls to Pairpool_push. Added dynprogindex information.
* stage3.c: Fixed bug where peeling back yielded wrong coordinates. Changed
calls to Pairpool_push. Added dynprogindex information. Recomputing
regions with adjacent dynamic programming solutions.
* matchpair.c, stage2.c: Changed calls to Pairpool_push
* smooth.c: Added debugging statements for exon and intron lengths
* sequence.c: Fixed bug where return type should be int, not bool.
* pairpool.c, pairpool.h: Distinguished between gapholder and gapalign
elements. Added dynprogindex to Pairpool_push.
* pair.c: Added debugging option for printing dynprogindex
* pairdef.h: Added dynprogindex to struct. Reordered fields.
* bool.h: Defining bool to be an unsigned char instead of an enumerated type
2006-08-03 twu
* fa_coords.pl.in: Added pattern Chr_ seen in some TIGR genomes. Changed
variable name from chronlyp to concatenatedp.
* oligoindex.c: Added check for query lengths shorter than index size
* get-genome.c, iit_get.c: Allowed program to take coordinate requests from
stdin
* iit-read.c, iit-read.h, iit_dump.c: Added option to dump counts of each
segment
* gmap.c: Printing calling arguments in gff mode
2006-06-12 twu
* pair.c, gmap_compress.pl.in, gmap_uncompress.pl.in: Using bp to denote
query length, instead of nt
* stage3.c: Turned off gap checking
* gmap_compress.pl.in: Putting cDNA length into the Coverage field
* gmap_uncompress.pl.in: Getting cDNA length from the Coverage field
2006-05-31 twu
* params.c, params.h: Adding maxoligohits as a parameter
* oligoindex.c, oligoindex.h: Using maxoligohits parameter, and reducing it
for cross-species alignment (to avoid random and misleading matches)
* stage2.h: Using maxoligohits parameter
* stage2.c: For cross-species alignment, increasing enough_consecutive
parameter and not opportunistically increasingly sampling interval
* stage1.h: Reduced SINGLEINTRONLENGTH to 100000
* stage1.c: Using maxextension parameter instead of SINGLEINTRONLENGTH
directly
* gmap.c: Limited crossspecies parameters to maxextension and maxoligohits.
2006-05-25 twu
* stage2.c: Introduced detection of semicanonical introns and penalty for
these. Removed distpenalty_dead, and introduced distpenalty_noncanonical;
motivated by ENST0356720. Decreased distpenalty; motivated by
ENST0356222. Introduced procedure for querydist_credit, bounded below by
zero. Decreated querydist points when gendistance equals querydistance;
motivated by ENST0354988.
* gmap.c: Using single intron length as basis for maxextension
* stage3.c: Made initial pass of build_pairs_singles work only when
genomejump equals queryjump; motivated by ENST0341339. Made acceptable
mismatches for dual introns depend again on defect rate.
* smooth.c: Removed deletion of longest middle exon in a series of short
exons. Motivated by ENST0348697.
* stage1.c, stage1.h: Using single intron length to extend genomic segment
at ends. Motivated by ENST0358972.
* oligoindex.c: Increased thetadiff for trimming repetitive oligos from 2 to
20. Motivated by ENST0357282.
* gbuffer.c, gbuffer.h: Added data structures for storing positions of
semicanonical dinucleotides
* dynprog.c, dynprog.h: Made microexon p-value threshold depend on the
defect rate. Increased genomejump needed for single gap penalties to
apply. Motivated by ENST0262608.
2006-05-23 twu
* stage2.c: Moved preprocessor directive outside of macro (needed for gcc3
compiler).
2006-05-22 twu
* gmap.c, stage3.c, stage3.h: Changed variable name from extend_mismatch_p
to trimalignmentp
* changepoint.c: Changed criterion from a differences in theta to a ratio
2006-05-19 twu
* stage3.c: Removed trimming of alignments in PMAP
* stage1.c: Changed some parameters to increase sensitivity
* chrsubset.c, chrsubset.h: Added function Chrsubset_make
* translation.c: Fixed assignment of amino acids to genomic sequence in PMAP
* stage3.h: Minor formatting change
* stage3.c: Printing trimmed query coordinates in path summary. Pruning
stage 3 result of coverage is less than MINCOVERAGE.
* sequence.h: Added appropriate MAXSEQLEN for PMAP
* reader.c, reader.h: Allowing reading in each direction to proceed to the
ends of the query sequence
* pairpool.c: Setting initial value for aa_g and aa_e
* pair.c, pair.h: Printing trimmed query coordinates in path summary
* oligoindex.c: Reinstated trimming of query sequence based on changepoint
analysis
* mem.c: Fixed compiler warning about pointer arithmetic on void *.
* matchpair.c: Added comments
* gmap.c: Performing trimming of query sequence in more cases. Changed name
of "mutation reference" to "reference sequence".
* dynprog.c: Removed step function penalty based on codons. Reduced
extension penalty to obtain better behavior.
* stage2.c: Changed position for starting to compute mismatch gaps. Added
trimming at ends for PMAP.
* stage1.c: Improved calculation of genome segment length, based on expected
exon and intron sizes. In sampling mode, continuing sampling at current
position of block pointers.
* block.c, block.h: Added procedures for saving and restoring blocks
2006-05-15 twu
* Makefile.am, gbuffer.c, gbuffer.h, gmap.c, matchpair.c, matchpair.h,
pair.c, pair.h, stage2.c, stage2.h, stage3.c, stage3.h: Created Gbuffer_T
object to use as workspace for various calculations
* translation.c: Fixed uninitialized variable
* dynprog.c, pair.c: Made cDNA gaps into type SHORTGAP_COMP instead of
INDEL_COMP, so they get treated properly by the changepoint analysis
* stage3.c: Fixed memory leak
* changepoint.c: Changed changepoint parameters slightly
2006-05-14 twu
* stage3.c: Added checks to make sure both qgenome lengths are adequately
long in dual intron gaps
* dynprog.c: Increased penalties for mismatches
* gmap.c: Changed 'U' flag to mean no trimming of poor alignments at ends
2006-05-13 twu
* gmap.c: Changed interfaces to some Stage3_T functions
* pair.c, pair.h: Added query length to Coverage line
* stage1.c: Fixed bug where maxtrial wasn't set
* stage2.c: Removed final assignment of dinucleotide positions
* stage3.h: Changed some interfaces to Pair_T functions
* stage3.c: Added some shortcuts for changepoint analysis. Changed some
interfaces to Pair_T functions.
2006-05-12 twu
* gmap.c: Provided initial values to some variables
* oligoindex.c: Reduced MAXHITS parameter from 200 to 20
* stage1.c: Limiting trials for same-species alignment. Limiting salvage
algorithm to short sequences and cross-species alignment.
* stage2.c: Implemented faster method for finding shifted canonical introns
* stage2.c: Saving mappings for each indexsize, and going back to best one.
Introduced idea of sufficient and minimum map fraction, and aborting if
minimum map fraction not satisfied.
* gmap.c, stage3.c, stage3.h: Added option to print output in IIT FASTA map
format
* pair.c, pair.h: Removed parameter from Pair_print_iit_map
* pair.c, pair.h: Removed old code. Added a procedure for printing an IIT
map.
* sequence.c: Removed printing of '>' from Sequence_print_header
* iit-read.c: Fixed bug in printing results from map iit
* stage2.c: Added debugging statements
2006-05-11 twu
* gmap.c, pair.c, pair.h, stage3.c, stage3.h: Included filename of
user-provided genomic seg as source in gff3 output
* iit_dump.c: Included header for getopt_long
* README: Added more information about IIT utilities
* iit-read.c, iit-read.h: Added annotation-only option to IIT_dump
* get-genome.c: Changed program description statement slightly
* Makefile.am, iit_dump.c, iit_get.c, iit_store.c: Added long options and
documentation for the IIT utilities
* iit_store.c: Added support for quotation marks in gff3 features
* chrnum.c, chrnum.h, chrsubset.c: Using new Chrnum_to_string interface
* gmap.c, stage3.c, stage3.h: Added cDNA_match option for gff3 output
* pair.c, pair.h: Added cDNA_match option for gff3 output, including Gap
attribute. Using new Chrnum_to_string interface.
* gmap.c, params.c, params.h, stage3.c, stage3.h: Added procedures for
allowing chromosome-tagged IIT map files, in addition to strand-tagged IIT
map files.
* iit-read.c, iit-read.h: Added functions for retrieving multiple types and
for getting label when access mode is fileio.
2006-05-10 twu
* iit_get.c: Allowing user to specify multiple types
* iit_store.c: Modified gff3 parsing to assign only one tag for each row.
Using feature column as a source for labels.
* pair.c, pair.h: Added routines for output in GFF3 format
* Makefile.am: Added trimming of alignment based on changepoint analysis
* stage2.c: Fixed bug in scanning for reverse canonical intron
* backtranslation.c, pairdef.h, pairpool.c, translation.c: Introduced phase
* gmap.c: Added flags for gff3 output
* stage3.c, stage3.h: Added procedure for trimming pairs. Added gap when
bounds don't make sense for dual intron gaps. Introduced gff3 format and
phase.
* changepoint.c, changepoint.h: Initial import into CVS
* stage2.c: Added debugging statements for the result of stage 2 prior to
trimming
* pair.c, pair.h: Added a function for computing matchscores from an
alignment
* dynprog.c: Changed various gap penalties, especially at ends of sequence
* iit_get.c: Changed atol() to strtoul(), because atol() was truncating
numbers above 2^31 in machines with long ints of 4 bytes.
2006-05-08 twu
* stage2.c: Changed condition for re-computing dead links from an "or" to an
"and" on both directions. Added trimming of ends, based on consecutive
matches.
2006-05-07 twu
* stage2.c: Cleaned up procedure for finding introns in PMAP, which can be
shifted. Cleaned up counts of canonical and total introns.
* stage3.c: Fixed problems with shortcut for existing introns and with
coordinates for dual genome pairs
* oligoindex.c: Added debugging statement
2006-05-06 twu
* stage2.c: Reworking of stage 2 scoring to make it more robust for
low-identity sequences. Includes identification of possible canonical
sites by shifting boundaries.
* stage3.c, stage3.h: Using dynamic programming paths computed for dual
intron gaps
* gmap.c, stage2.c, stage2.h: Computing indexsize adaptively
* smooth.c, smooth.h: Removed indexsize from smoothing procedures
* params.c, params.h: Added minindexsize and maxindexsize to params
* oligoindex.c, oligoindex.h: Changed PMAP indexsize to be in aa.
Calculating mapfraction.
* pair.c: Keeping ambiguous character and comp in PMAP alignments
* intron.c, intron.h: Added function to return string for printing intron
type
* gmap.c, oligoindex.c, oligoindex.h: Put indexsize parameters inside of
Oligoindex_T object
2006-05-05 twu
* md_coords.pl.in: Improved handling of alternate strains
2006-05-04 twu
* iit_store.c: Implemented parsing of gff format
* gmap.c: Incremented stuttercycles for PMAP
* indexdb.c: In monitoring commands, printing positions with commas
* matchpair.c, matchpair.h, stage1.c: Using maxintronlen instead of querylen
as criterion for removing hits before clustering
* stage2.c: Checking two possible query positions for intron in PMAP
* dynprog.c, dynprog.h: Removed obsolete parameters for computing genome gaps
* stage3.c: Fixed boundaries for check of dual introns
* fa_coords.pl.in: Improved monitoring messages to indicate when coordinates
are parsed and when they are concatenated.
* md_coords.pl.in: Fixed bug in handling alternate strains
2006-05-03 twu
* md5-compute.c, oligo-count.c: Using new version of Sequence_read
2006-05-02 twu
* stage2.c: Removed statement that does not apply to PMAP
* matchpair.c: Fixed computation of support for PMAP. Added debugging
statements
* match.c: Fixed genomic segment retrieved for debugging
* iit-read.c: Minor editing changes
* datadir.c: Improved error message when genome directory isn't found
* compress.c: Removing spaces from reading of uncompressed sequence
* stage1.c: Increased matchpairs allowed. Fixed position adjustment for
reverse strand matches on PMAP.
* indexdb.c: Shifted positions for .prxpositions down by one.
2006-04-21 twu
* gmap_setup.pl.in: Fixed bugs in printing of instructions
* fa_coords.pl.in: Augmented patterns allowed for specifying chromosomal
location of contigs
* Makefile.am, chrsubset.c, chrsubset.h, get-genome.c: Added ability to
print all chromosomal subsets from get-genome
* datadir.c: Improved error message
* README: Added information about the -q flag. Added additional forms for
specifying chromosomal location of contigs.
2006-04-18 twu
* gmap_setup.pl.in: Added the -q and -Q flags for specifying indexing
intervals
2006-04-07 twu
* gmap_setup.pl.in: Fixed bugs with new install statements
* gmap_setup.pl.in: Added comment about editing .chrsubset file. Creating
genome.maps directory.
* stage3.c: Turned off CHECK
* README: Added comment about editing .chrsubset file
2006-04-06 twu
* gmap_compress.pl.in: Changed program to handle intron lengths in exon
summary
* stage2.c, stage2.h: Introduced limit on individual intron lengths
* stage1.c, stage1.h: Changed variable name from maxintronlen to maxtotallen
* gmap.c: Added separate flag for limiting individual intron lengths
* pair.c: Added intronlengths to exon summary
* sequence.h: Increased maximum sequence length to be 1000000.
2006-04-04 twu
* stage3.c: Building singles if a short exon is deleted during smoothing
* smooth.c: Improved debugging statements
2006-03-24 twu
* gmap_setup.pl.in: Printing a copy of the install procedure to a file
2006-03-20 twu
* match.c: Match_npairings returns an int
* md_coords.pl.in: Passing back maxwidth as a result.
* gmap.c: Made changes for compatibility with PMAP.
* stage3.c, stage3.h: Giving maxpeelback information to dynamic programming
routine, so it can use single gap penalties for long intron gaps. Made
changes for compatibility with PMAP.
* smooth.c, smooth.h: Changed smoothing routine to be based on net intron
lengths. Sequences of small exons are removed if they yield a net intron
length of approximately zero.
* dynprog.c, dynprog.h: Disallowing intron or cDNA gaps to be placed at the
edge of the segment, which caused an error to occur in the check_gaps
routine. Using single gap penalties for long intron gaps.
2006-03-17 twu
* sequence.c: Added handler for cases where requested subsequence start and
end are beyond the bounds of the sequence
* gmap.c, stage1.c, stage1.h: Added concept of maxtrial, to be used for
chimera (subsequence) problems
* stage3.c: Added an exception handle for errors in checking gaps
* dynprog.c: Disallowed intron or cDNA gaps to be inserted at ends of the
subsequence, which results in an unexpected gap.
2006-03-05 twu
* gmap.c: Providing maponlyp information to Sequence_read, to turn
skiplength warning message on or off.
* sequence.h: Set MAXSEQLEN to be 200000
* pair.c, stage3.c: Revision of procedures to handle sequences with
skiplength
* stage1.c: Expanded maxintronlen to include skiplength
* sequence.c, sequence.h: Addition of skiplength. Rewriting of code for
reading sequences to handle skipping of middle correctly.
2006-03-04 twu
* gmap.c: Reworking of maponlyp case to generate a Stage3_T object
* stage3.c, stage3.h: Implementedq Stage3_direct function for maponlyp case.
Cleaned up merge function for combining two Stage3_T objects.
* stage1.c, stage1.h: Cleaned up various procedures in stage 1 computation.
Simplified function identify_matches. Eliminating extensions for maponlyp
case.
* matchpair.c, matchpair.h: Added function for making a path from a
matchpair object
* matchpool.c: Simplified code for handling positions on reverse genomic
strands.
* match.c, match.h: Added function for printing the oligomer for a match.
Simplified code for handling positions on reverse genomic strands.
* oligoindex.c: Turned off code for changepoint analysis for trimming ends
* pair.c, pair.h: Modified printing of path summary for maponlyp
* result.c, result.h: Removed Stage1_T objects from Result_T
* genome.c: Added debugging statements
* block.c, oligo.c, oligo.h: Fixed problem where oligomers read from left
side need to be shifted down to low 12-mer. This corrects problem with
match coordinates being off by 4.
2006-03-02 twu
* gmap.c: Revised code for computing chimeras
* chimera.c, chimera.h: Made Chimera_T object created only when completely
specified
* stage3.c: Added a step to allow for subseq_offset, if present
* sequence.c, sequence.h: Added subseq_offset to Sequence_T
2006-03-01 twu
* dynprog.c, dynprog.h: Restored one gap behavior on ends. Using
cdna_direction information on single gaps.
* stage3.c: Forcing single gaps to be solved. Adding cdna_direction
information for single gaps. Fixed problem with short indels being
inserted backward.
* oligoindex.c, oligoindex.h: Implemented new scheme for detecting
repetitive sequence on ends, based on changepoint analysis
* smooth.c: Fixed memory leak.
* translation.c: Added check so we won't go beyond ends. Assigned variables
when npairs is too few.
2006-02-27 twu
* stage3.c, stage3.h: Minor bug fixes
2006-02-26 twu
* match.c, match.h, matchdef.h, matchpool.c, stage1.c: Keeping track of
number of pairings for each match, and placing a limit on the number of
matchpairs generated for each match with a "promiscuous" variable
2006-02-25 twu
* stage2.c: Made behavior similar for sequence and reverse complement,
including bug fix and using diffdistance rather than querydistance
2006-02-24 twu
* pairpool.c, pairpool.h: Added procedure for counting result of bounding
operation
* pair.c, pair.h: Counting amino acids directly for protein PSL output.
Fixed problem in coordinates output where chrstring was NULL.
* dynprog.c: Increased penalty for gaps in single alignments and made them
uniform across sequence quality
* smooth.c, smooth.h: Rewrite of code to use arrays instead of lists.
Reduced definition of short exon. Now deleting consecutive strings of
short exons.
* translation.c: Noting large insertions and deletions of amino acids, even
if not a multiple of 3
2006-02-23 twu
* chimera.c, chimera.h, gmap.c, stage3def.h: Moved various functions back to
stage3.c
* stage3.c, stage3.h: Performing substitution of gaps only for final cDNA
direction
* oligoindex.c, oligoindex.h: Turned off trimming of sequence for reference
sequences and for protein sequences
* intron.c, intron.h: Using cdna_direction information in assigning
Intron_type
* dynprog.c, pairpool.c, pairpool.h, stage2.c: Passing in gapp as a
parameter to Pairpool_push
* translation.c: Fixed bug with marking backwards cDNAs relative to
reference sequence
2006-02-22 twu
* translation.c: Fixed minor bugs in new implementation
* Makefile.am: Rewrite of code for determining mutations and for printing
the results. Removed mutation.c and mutation.h.
* mutation.c, mutation.h, pair.c, pair.h, translation.c: Rewrite of code for
determining mutations and for printing the results
2006-02-21 twu
* stage3.c: Moved some chimera functions from stage3.c to chimera.c. Set
acceptable_mismatches for microexons to be 2.
* Makefile.am, chimera.c, chimera.h, stage3.h, stage3def.h: Moved some
chimera functions from stage3.c to chimera.c
* dynprog.c: Increased probability standard for finding microexons
2006-02-19 twu
* translation.c: Fixed bug where cDNA translation was incomplete
* stage3.c: Fixed bug in substitution for gaps when ngap is not 3
* stage3.c, stage3.h: Complete rewrite of stage 3 to use gap pairs
* translation.c: Increased parameter for ignoring amino acid mismatches at
ends of query sequence
* smooth.c, smooth.h: Made changes to handle new gap pairs
* pair.c: No longer assigning coordinates for query sequence and genomic
segment within gaps
* matchpair.c, matchpair.h: Limiting 12-mer hits that are considered in
clustering method to those that have a neighboring hit within the query
length
* dynprog.c, dynprog.h: Inserting a single gap pair for introns and cDNA
insertions instead of filling in nucleotides
* stage1.c: Reduced extension of genomic segment when cluster mode is
required
* gmap.c: Put output to stderr when path not found in compressed output
* intron.c, intron.h: Moved Intron_type function here
* pairpool.c, pairpool.h: Added explicit functios for handling gap pairs
* pairdef.h: Added fields for queryjump and genomejump, to be used for gaps
2006-02-08 twu
* translation.c: Set minimum number of pairs required for a translation
2006-02-07 twu
* gmap.c: Now checking for existence of -g or -d flag before proceeding
* stage3.c: Fixed problem when solving an intron and unable to peel back
anything.
2006-02-05 twu
* dynprog.c: Fixed problem with extending 5' and 3' ends with assumption of
no gap. Added extra efficiency based on this assumption.
2006-01-19 twu
* README: Enhanced usage statement for gmap_setup
* gmap_setup.pl.in: Cleaned up flags. Added messages after each make
procedure. Enhanced usage statement.
* gmap_process.pl.in: Removed code for a separate strain file
* gmap_process.pl.in: Added provision for a separate strain file, but
commented out code
* md_coords.pl.in: Fixed problem when MD file has fewer than 6 lines. Put
output into an array for printing out in one batch. Improved handling of
strains.
* fa_coords.pl.in: Put output into an array for printing out in one batch.
* pmap_setup.pl.in, Makefile.am: Removed pmap_setup program
* stage3.c: Added procedure to fix short gaps
* gmapindex.c: Added ability to read reference strain from coords file
* gmap.c: Added provision for different stage 2 index size for PMAP
2006-01-17 twu
* pair.c: Fixed problem with protein PSL coordinates
2005-12-15 twu
* backtranslation.c, backtranslation.h: Fixed problems in backtranslation
when genomic segment has lower case characters
* gmap.c, stage3.c, stage3.h: Preserved diagnostic info in PMAP through
backtranslation
* pair.c: Changed printing of cDNA on ambiguous comps to be lower case if
appropriate
* dynprog.c: Changed ends from 1 gap to no gaps. Changed open/extend
penalties at ends (which may be irrelevant now).
* matchpool.c, stage1.c: Fixed problems with genomic position in reverse
complement matches in PMAP.
* translation.c: Fixed problems with ends of cDNA and genomic translation
for PMAP. Set margin to zero for computing amino acid changes.
* iit-read.c: Commented out abort
2005-12-14 twu
* sequence.c: Fixed uninitialized heap
2005-12-13 twu
* gmap.c, stage2.c, stage2.h: Added pruning before stage 2 based on number
of potentially consecutive hits and short paths
* oligoindex.c, oligoindex.h: Added computation of potentially consecutive
hits in the query
* stage1.c: Added filtering of matchlist based on support
* matchpair.c, matchpair.h: Added storage of support and usep in Matchpair_T
object
2005-12-09 twu
* gmap.c, stage2.c: Removed code for finding PMAP unaligned access error
* gmap.c, stage2.c: Added code for finding PMAP unaligned access error
* backtranslation.c, oligoindex.c: Removed code for checking assertions
* backtranslation.c, oligoindex.c: Added code for checking assertions
2005-12-08 twu
* pair.c: Streamlined determination of amino acid coordinates in alignment
output
* indexdb.c: Fixed bug in handling offsets in alternative strains in PMAP
* dynprog.c: Reformulated assignment of pointers in two-dimensional array
2005-12-06 twu
* translation.c: Formatting change
* stage1.c: Turned on use of matchpool. Fixed problem where list was not
reset to NULL.
* pair.c: Changed dir:unknown to dir:indet
* oligoindex.c: Fixed uninitialized variable in GMAP
* matchpool.c: Improved debugging statements
* matchpair.c: Increased standard for stage 1 support
2005-12-05 twu
* oligoindex.c: Made code compatible with both GMAP and PMAP
* backtranslation.c, dynprog.c: Reduced memory allocation for
two-dimensional array into a one-dimensional array
* matchpool.c, pairpool.c: Removed initial creation of chunks
* oligoindex.c: Fixed bug in PMAP where stop codon in the genomic sequence
created a value that exceeded oligospace
* pair.c, pair.h: Added a way for the thread worker id to be printed with
the result. Removed ambiguous comp characters from gmap.
* gmap.c, reqpost.c, reqpost.h, result.c, result.h, stage3.c, stage3.h:
Added a way for the thread worker id to be printed with the result
* matchpool.c: Added commands for saving and restoring pointers, so memory
can be re-used
* match.c, match.h, stage1.c: Added compiler conditions for using matchpool
method.
* genome.c: Fixed messages to user
* chrsubset.c: Changed format of output
* translation.c: Fixed bug in translating backward cDNAs. Extended
translation all the way to the end.
2005-12-04 twu
* Makefile.am, gmap.c, matchpair.c, matchpair.h, matchpairdef.h,
matchpairpool.c, matchpairpool.h, stage1.c, stage1.h: Removed special
memory allocation routines for matchpairs
* Makefile.am, gmap.c, match.c, match.h, matchpair.c, matchpair.h,
matchpairdef.h, matchpairpool.c, matchpairpool.h, matchpool.c,
matchpool.h, stage1.c, stage1.h: Added special memory allocation routines
for matches and matchpairs
* iit-read.c: Added an exception handler
* pair.c: Commented out unused procedure
* genome.c: Added include of except.h
2005-12-02 twu
* gmap.c: Fixed memory leak
* translation.c: Added separate routine for printing list of mutations.
Fixed problem where number of cDNA nucleotides in codon is 4 or 5.
* stage2.c: Clarified different code for gmap and pmap
* stage1.c: Added checking routine for Stage1 object
* access.c, mem.c: Augmented debugging statements
* sequence.c: Fixed case where first sequence of FASTA file has no header,
but subsequent sequences do.
* nr-x.c: Initial import into CVS
* pair.c: Added printing of aapos to all positions in "f -9" mode
* mutation.c: Simplified logic of merge functions
2005-11-29 twu
* match.h: Provided interface for new functions
* gmap.c: Fixed bug due to switched parameters
* stage3.c: Added comment
* config.site: Added information about defaults
* README: Added information about Cygwin and defaults
2005-11-28 twu
* stage1.c: Added include of match.h
2005-11-23 twu
* acinclude.m4, fopen.m4, configure.ac: Added commands to check for 'b' or
't' flag to fopen
* pmap.c: Removed obsolete file
* Makefile.am, access.c, chrsubset.c, datadir.c, fopen.h, genome-write.c,
genomeplot.c, gmap.c, gmapindex.c, iit-read.c, iit-write.c, iit_store.c,
indexdb.c, oligo-count.c, pdldata.c, pmapindex.c: All calls to fopen now
generalized to handle systems that allow or disallow the 'b' or 't' flag
2005-11-22 twu
* VERSION: Updated version
* Makefile.am: Removed coords1.test, which is now performed by setup1.test
and setup2.test
* setup1.test.in, setup2.test.in: Added prerequisite of fa_coords program
for setup tests
* README: Made instructions for raw genome build match changes in gmap_setup
* gmap_setup.pl.in: Changed name of make command
* gmap_setup.pl.in: Clarified comments
* gmap.c: Made npaths output correct when user provides a segment
* match.c, matchdef.h, stage1.c: Storing reciprocal of nentries to avoid
repeating this calculation multiple times later
* setup1.test.in, setup2.test.in: Made changes in test to match changes in
program
* align.test.ok, map.test.ok: Made change in output from Mutations to Amino
acid changes
* Makefile.am: Made change in name of coords file
* README: Made instructions consistent with changes in programs
2005-11-21 twu
* fa_coords.pl.in: Changed a flag. Output now going to stdout rather than
stderr.
* gmap_setup.pl.in: Now making the call to fa_coords or md_coords within the
Makefile
* matchpair.c: Turned off debugging
* match.c, match.h, matchdef.h: Storing number of entries for each match
* indexdb.c: Moved one type of debug macro into its own category
* stage1.c: Weighted dangling computation according to number of entries for
each match
* translation.c: Fixed bug where pointer went past beyond sequence
2005-11-19 twu
* chrsubset.c, gmap.c: Added printing of chrsubset information.
Consolidated printing of npaths information into a single function.
* backtranslation.c: Using the two aamarkers. Allowing matches to codons
even for frameshifts.
* mutation.c: Allowed merging of adjacent insertions
* translation.c: Made PMAP assignment of genomic amino acids conform to GMAP
code for assignment of cDNA amino acids
* translation.c: Added further translation of cDNA beyond genomic stop
codon, if possible
* translation.c: Streamlined code for amino acids to cDNA sequence
* translation.c: Overhaul of method for assigning amino acids to cDNA
sequence, now based on separate marking and assignment of codons.
* pair.c, pairdef.h, pairpool.c: Created separate aamarkers for genomic and
cDNA sequence
2005-11-18 twu
* gmap.c, params.c, params.h, stage3.c, stage3.h, translation.c,
translation.h: Added flag for specifying maximum number of amino acid
changes to show
* stage1.c: Fixed memory leak
* matchpair.c: Fixed read of uninitialized heap when bestsize == 0
* matchpair.c, matchpair.h: Removed storage of support value
* gmap.c, matchpair.c, matchpair.h, stage1.c: Moved sequence pruning
procedures from gmap.c to matchpair.c
* stage1.c: Fixed bug which caused loop to continue unnecessarily
* gmap.c, result.c, stage1.h: Added complete option for freeing Stage 1
objects
* stage1.c: Introduced idea of stepping through trials to identify poor
genomic matches
* matchpair.c, matchpair.h: Introduced method for salvaging individual
12-mer hits
* gmap.c, stage1.c, stage1.h: Simplified call to Stage1_matchlist
* stage1.c: Cleaning up parameters in preparation for cycling through stage 1
2005-11-17 twu
* indexdb.c: Added forward/backward to pre-loading messages for pmap
* translation.c: Skipping mutation calls on non-standard amino acids
* backtranslation.c: Fixed bug when trying to backtranslate non-standard
amino acids
* gmap_setup.pl.in: Added intermediate commands to Makefile
2005-11-11 twu
* backtranslation.c: Improved matching of genomic codon to cdna codon.
* translation.c: Added debugging statement
* pair.c: Restored printing of genomic sequence for ambiguous matches in pmap
2005-11-10 twu
* genome-write.c: Added read of linefeed after FASTA entry in raw genome
files. Improved speed of writing blocks of zeros or X's.
* gmapindex.c: Fixed bug in skip_sequence for raw genome files
* get-genome.c: Implemented printing of raw genome files
* gmap.c, stage3.c, stage3.h: Moved final translation and backtranslation
steps into print procedures
* Makefile.am, backtranslation.c, backtranslation.h, translation.c,
translation.h: Moved nucleotide consistency procedures for pmap into
backtranslation.c
* pair.c: Removed consistency conversion. Now being done by backtranslation
procedures. Removed meaning of AMBIGUOUS_COMP for compressed output of
pmap.
* dynprog.c: Added actual coordinates to debugging statements
* translation.h: Made backtranslation procedure more rigorous.
* translation.c: Made backtranslation procedure more rigorous. Added
debugging statements.
2005-11-09 twu
* get-genome.c: Changed -r flag to also indicate use of the uncompressed
genome file
* get-genome.c, sequence.c, sequence.h: Added uncompressed raw format for
printing genome segment
* genome-write.c, genome-write.h, gmapindex.c: Added uncompressed raw format
for genome file
2005-11-08 twu
* pair.c: Reformulated printing of protein-based PSL output
* intlist.c: Added include of stdio.h
* chimera.c: Removed include of nmath.h
* gmap.c: Allowed coordinate output for pmap. Changed flag to -f 9.
2005-11-04 twu
* gmap_compress.pl.in: Allowed handling of PMAP output
* gmap_uncompress.pl.in: Fixed bug in printing last line of alignment
2005-11-01 twu
* gmap.c, pair.c, pair.h, stage3.c, stage3.h: Allowed introns to be printed
in exon mode
* matchpair.c: Imposed the requirement that minsize be 2 or more away from
bestsize
2005-10-31 twu
* gmap.c, pair.c, pair.h, stage3.c, stage3.h: Added ability to print exons
using genomic sequence
2005-10-28 twu
* indexdb.c: Using lseek instead of fseek/fseeko for writing a positions
file on disk
* access.c, access.h: Added a function for opening a file as read/write
2005-10-27 twu
* VERSION: Updated version
* Makefile.am: Restored setup2.test
* setup2.test.in: Made test use new gmap_setup script
* setup1.test.in: Removed install step
* gmap_setup.pl.in: Fixed bug in clean statement
* configure.ac: Added check for fseeko
* gmap.c: Added information about various type sizes to -V flag
* compress.c, genome-write.c, indexdb.c: Using fseeko if available
2005-10-25 twu
* gmap_setup.pl.in: Fixed bug where -W flag was in the wrong branch
* pair.c: Removed extraneous linefeed in compressed output
* VERSION: Updated version number
* pair.c: Fixed psl output
* README: Clarified use of ./configure flags. Added instructions for the -C
flag in fa_coords.
* gmap_setup.pl.in: Restored -W flag for writing directly to file
* gmap_setup.pl.in: Added instructions for clean to Makefile
* fa_coords.pl.in: Added -C flag to make each sequence a separate chromosome
* pair.c, pair.h, stage3.c: Added printing of cDNA direction in compressed
output
* bigendian.h, mem.h: Added include of config.h.
2005-10-21 twu
* VERSION: Updated version
* README: Added instructions for running make after gmap_setup
* MAINTAINER: Added reminder to check for DEBUG mode
* pair.c, pair.h, stage3.c, stage3.h: Restored printing of strain information
* pdldata.c: Fixed typo
* oligo-count.c: Using new interface to indexdb
* gmap.c: Added error message if user tries to use strain information and
file is not found
* gmap.c: Restored printing of strain information. Added conversion to
upper case for altstrain sequence.
* genome.c, indexdb.c: Added printing of number of bytes
* access.h: Added MAX32BIT
* access.c: Added debugging statements
* Makefile.am: Added needed files
2005-10-18 twu
* configure.ac: Added warning message if mmap not available
* fa_coords.pl.in: Added ability to read from stdin
* setup1.test.in: Added install command
* access.c, access.h: Made Access_filesize an external routine
* genuncompress.c, pdldata.c: Using routines from access.c
* Makefile.am: Added access.c and access.h to programs with IIT_T object
* chrsubset.c: Added include of config.h
* genome-write.c, genomeplot.c, get-genome.c, iit_dump.c, iit_get.c,
pmapindex.c, segmentpos.c: Changed calls to IIT_free and IIT_annotation
* gmapindex.c: Removing free of accsegmentpos_table, which fails on some
computers
* gmap.c: Reading user-provided genomic segment and reference sequence
before FASTA query
* iit-write.c, iit-write.h: Made write version of IIT_free static and
renamed it.
* iit-read.h: Changed interface to IIT_annotation.
* iit-read.c: Added FILEIO mode for reading IIT_T objects. Changed
interface to IIT_annotation.
* iitdef.h: Made mutex part of IIT_T object. Added offset to IIT_T object
for FILEIO mode.
* indexdb.c: Made mutexes part of Indexdb_T object. Changed calls to
IIT_annotation.
* genome.c: Made mutex part of Genome_T object
* access.h: Added flag for randomp. Added function for read/write mmap.
* access.c: Moved file size determination to a separate function
2005-10-14 twu
* gmap.c: Moved reading of input sequences to beginning
* indexdb.c: Minor fixes
* access.h: Returning length and time for Access_immediate
* access.c: Returning length and time for Access_immediate. Forcing read of
pages during pre-load.
* datadir.c, gmap.c: Removed unused variables
* genome.c, stage3.c: Added necessary include file
* result.c: Addressed compiler warning
* pair.c: Fixed faulty print statement in pslformat_nt
* indexdb.c: Added necessary include file. Removed unnecessary variables.
* dynprog.c: Applied type conversion for char to access array
* Makefile.am: Added access.c and access.h
* blackboard.c, blackboard.h, gmap.c: Added nextchar to Blackboard_T object
* gmap.c: Now reading first sequence in main thread, and using existence of
a second sequence to determine whether to start threads and to pre-read
offsets file for GMAP. Conditioning some flags based on existence of mmap
and threading support.
* datadir.h: Removed unnecessary include
* intlist.c: Minor fix to resolve gcc compiler warning
* access.c, access.h, genome.c, indexdb.c: Standardized file access routines
and moved them to access.c
2005-10-13 twu
* genomeplot.c, plotdata.c, plotdata.h: Fixed ASCII printing of universal
coordinates when a range is selected
* matchpair.c: Fixed calculation of genome length for segment
* sequence.c: Fixed Sequence_read_unlimited to handle sequences without a
header line.
2005-10-12 twu
* gmap_setup.pl.in: Changed program to generate a Makefile
* fa_coords.pl.in, md_coords.pl.in: Deleted comment about gmap_setup running
time
* gmap_process.pl.in: Initial import into CVS
* Makefile.am: Added instructions for gmap_process
* setup1.test.in: Modified setup test for new interface to utility programs
* Makefile.am: Modified setup test to put binary files in tests directory
* MAINTAINER: Minor note to self
* Makefile.am: Made FULLDIST work for gmap sources
* gmap.c: Made separate flags for batch for offsets and batch for positions
file. Simplified input thread.
* indexdb.h: Made separate flags for batch for offsets and batch for
positions file
* indexdb.c: Added memory mapping for offsets files under PMAP. Made
separate flags for batch for offsets and batch for positions file.
* oligoindex.c: Removed stop codon from oligomers in stage 2
* Makefile.am: Moved beta code for GMAP into a separate program
* stage2.c: Moved PMAP conditionals out of debugging statements
2005-10-11 twu
* Makefile.am: Removed conditional distribution of files
* translation.c: Including comp.h header
* gmap.c, pmapindex.c: Changed PMAP indexing interval to be based on amino
acids.
* oligop.c: Removed STOP from amino acid alphabet.
* gmapindex.c: Generating chrsubset file at same time as chromosome file
* indexdb.h: Removed STOP from amino acid alphabet. Changed PMAP interval
to be based on amino acids.
* indexdb.c: Simplified conversion of oligomer to amino acid index for PMAP.
Removed STOP from amino acid alphabet. Computing each protein frame
separately.
* configure.ac: Added large file support with AC_SYS_LARGEFILE. Removed
setup test number 2. Added gmap_process.
* acinclude.m4: Removed macros for O_LARGEFILE
* open-flags.m4: Removed file open-flags.m4
2005-10-10 twu
* acinclude.m4, open-flags.m4: Added check for O_LARGEFILE in open
2005-10-07 twu
* gmap_setup.pl.in: Restored -W flag and improved it
2005-10-06 twu
* VERSION: Updated version
* configure.ac: Added hook for pmap_setup.pl
* README: Added explanation of full, uncompressed genome, and of batch modes
* gmap_setup.pl.in: Added checks to make sure desired files are built.
Added printing of commands to stdout.
* pmap_setup.pl.in: Added checks to make sure desired files are built
* gmap_compress.pl.in, gmap_uncompress.pl.in, pair.c: Altered format of
compressed output to indicate ambiguous matches
* genome.c: Fixed batch loading of full genomes greater than 2 gigabytes
* gmap.c: Modified message about batch mode and multiple threads mode
* stage2.c: Parameterized alignment characters and defined them centrally in
comp.h. Restored previous intron penalties based on length.
* pair.c: Parameterized alignment characters and defined them centrally in
comp.h. Now printing ambiguous nucleotide matches.
* dynprog.c: Parameterized alignment characters and defined them centrally
in comp.h. Added separate table for consistent nucleotide pairs.
* Makefile.am, comp.h, pairpool.c, stage3.c, translation.c: Parameterized
alignment characters and defined them centrally in comp.h
2005-10-05 twu
* gmap.c: Changed batch mode to be of two types: pre-loading of indices
only, and pre-loading of both indices and genome.
* gmap.c: Clarified various user messages
* indexdb.c: Added an explicit check for a nonsensical offsets file
* pair.c: Made margin width determined dynamically in printing the alignments
2005-10-03 twu
* dynprog.c: Removed reverse intron possibilities from PMAP
* gmapindex.c: Restored monitoring output for logging contigs
* indexdb.c: Added fwd/rev to monitoring commands for indexing offsets and
position files
* compress.c: Added monitoring commands for compressing and uncompressing
files
* gmap_setup.pl.in: Clarified behavior and instructions for building a full
(uncompressed) genome file
* fa_coords.pl.in: Abbreviated monitoring output, with a parameter that
controls which contigs to ignore
* Makefile.am: Added make instructions for pmap_setup
* pmap_setup.pl.in: Initial import into CVS
2005-10-01 twu
* gmap.c: Performing translation of query sequence and genomic segment to
upper case. Turned off stage 1 for user-provided genomic segment in PMAP.
Provided -G flag for specifying full genome, if it exists.
* genome.c: Turned warning into error, if user wishes to use a full genome
and none exists
* dynprog.c: Allowed intron gap parameter to be arbitrarily large
* pair.c, stage2.c, translation.c: Fixed handling of user-provided genomic
segment with lower case characters for PMAP
* stage1.c: Improved debugging statement
* oligoindex.c: Minor formatting change
* mem.c: Enhanced trap features
* boyer-moore.c: Removed assertions
2005-09-30 twu
* boyer-moore.c, dynprog.c, dynprog.h, stage3.c, stage3.h: Made stage 3 use
upper case for query sequence and genomic segment when needed, but
original sequences for building alignment
* oligoindex.c, oligoindex.h, stage2.c, stage2.h: Made stage 2 use upper
case for query sequence and genomic segment for oligomer chaining, but
original sequences for building alignment
* oligo.c, oligop.c, stage1.c, stage1.h: Made stage 1 assume upper case
query sequence
* pair.c: Removed call to toupper
* complement.h, sequence.c, sequence.h: Provided utilities for making
uppercase and alias versions of sequences
* compress.c: Added toupper as reason for including ctype.h
* plotdata.c: Revised autoscale function
2005-09-29 twu
* stage2.c: For PMAP, fixed bug where C terminus of query sequence was not
aligned. Eliminated computation of reverse intron direction for PMAP.
* oligoindex.c: Modified comments
* gmap.c: Removed Sequence_trim for PMAP, and reduced stage 2 indexsize.
* pair.c: Changed psl output to reflect definition of a block to be a region
without indels or gaps, instead of an exon
2005-09-22 twu
* oligoindex.c: Make amino acid index for stage 2 (with 21 amino acids)
distinct from that of stage 1 (with 16)
* indexdb.c, indexdb.h, oligop.c: Compressing 21 amino acids into 16 to
allow offsets of amino acid 7-mers to fit into less than 2 GB
2005-09-21 twu
* stage1.c: Parameterized size of oligomers for PMAP
* gmap.c, indexdb.h: Parameterized interval for stage 1 when user provides a
genomic segment
* pmapindex.c: Parameterized size of oligomers
* matchpair.c: Turned off debugging
* indexdb.c, indexdb.h: Introduced indexing of 7-mers by PMAP
* VERSION: Updated version
* gmap_setup.pl.in: Commented out -W flag for forcing write to file. Added
option -G for making an uncompressed version of the genome (.genome file).
* fa_coords.pl.in: Allowed both chr and Chr in parsing for chromosomal
mapping
* config.site: Clarified possible choices for LDFLAGS
* matchpair.c: Penalizing clusters spread out in repetitive genomic regions
* pdlimage.c: Made images in color
2005-09-20 twu
* gmapindex.c: Commented out monitoring statement about logging contigs
* stage1.c: Fixed a bug involving subtraction of two unsigned ints into a
signed int, occurring for chromosomes greater than 2^31 in length.
2005-09-19 twu
* stage2.c: Fixed bug when stage 2 fails
* pair.c: Fixed assessment of unknown bases for PMAP queries
* matchpair.c: Fixed computation of stretch for PMAP protein queries
* indexdb.c: Removed debugging flag
* iit_get.c: Added termination message and flushing output when input coming
from stdin
* gmap.c: Added debugging messages
2005-09-16 twu
* Makefile.am, pdlimage.c: Initial addition of pdlimage to CVS.
2005-09-08 twu
* iit-read.c, iit-read.h, stage3.c: Added option to print levels of map
results
* intlist.c, intlist.h: Added command for Intlist_to_string
* gmap.c: Modified directory printing to go to a given file pointer. Added
information about default directory to print_version command.
* datadir.c, datadir.h, get-genome.c: Modified directory printing to go to a
given file pointer
* genomeplot.c, plotdata.c, plotdata.h: Changed format of positions file.
Changed title for summary genome plots.
2005-09-06 twu
* get-genome.c: Added ability to print levels of map contents. Fixed bug in
interpreting an entire chromosome.
2005-09-02 twu
* genomeplot.c, pdldata.c, pdldata.h, plotdata.c, plotdata.h: Generalized
variable for transform and added reciprocal
* genomeplot.c, plotdata.c, plotdata.h: Added option to autoscale
* genomeplot.c, pdldata.c, pdldata.h, plotdata.c, plotdata.h: Added options
for computing summary of multiple samples
* genomeplot.c, plotdata.c, plotdata.h: Added functions for printing a
threshold line, and for printing output in ascii format.
* pdldata.c: Now removing line feeds from annotations. If no annotations
are available, using sample numbers.
2005-08-29 twu
* plotgenes.c, plotgenes.h: Improved display of genes
* genomeplot.c: Made changes so PDL file is read only when necessary. Added
extra room for showing genes.
2005-08-26 twu
* genomeplot.c, plotdata.c, plotdata.h: Printing accession header only if
one sample per page. Reduced default number of genomes per page to 12.
* genomeplot.c, pdldata.c, pdldata.h, plotdata.c: Added ability to read
sample identifiers from a separate file for PDL input
* genomeplot.c: Added ability to plot a single page
2005-08-23 twu
* gmap.c, pair.c, pair.h, stage3.c, stage3.h: For PMAP, allowed PSL output
in both nucleotide coordinates and protein coordinates
* Makefile.am, genomeplot.c, plotgenes.c, plotgenes.h: Added ability to plot
genes
2005-08-18 twu
* gmap.c, stage3.c, stage3.h: Added option for printing coordinates
* plotdata.c, plotdata.h: Added options for printing dots and overlapping
samples
* pair.c, pair.h: Added option for printing coordinates. Trying to fix PSL
output for PMAP.
* genomeplot.c: Added option for printing dots and overlapping samples
* color.c, color.h: Added color brewer palette
* plotdata.c: Prevented printing of empty strings
2005-08-16 twu
* plotdata.c, plotdata.h: Fixed bug when only a subset of genes is selected.
Added commands for gif output.
* get-genome.c, gmap.c: Showing available map files when valid one is not
entered
* datadir.c, datadir.h: Added function to list directory contents
* genomeplot.c: Allowed user to specify a list of samples to plot
* intlist.c, intlist.h: Added function Intlist_from_string
* stage3.c: Fixed mapping to account for cDNA direction
2005-08-10 twu
* Makefile.am, genomeplot.c, pdldata.c, pdldata.h, plotdata.c, plotdata.h:
Allowed genomeplot to read PDL files
2005-08-07 twu
* iit-read.c: Improved debugging statements
* gmap.c, stage3.c, stage3.h: Added option to map by exons
* pair.c, pair.h: Added function to retrieve exon bounds
2005-08-04 twu
* Makefile.am, gmap.c: Merged pmap main code into gmap.c
* sequence.c, sequence.h: Added functionality for pmap chimeras
* stage3.c, stage3.h: Changed function to take queryntlength instead of
queryseq. Made function work with both gmap and pmap.
* chimera.c, chimera.h: Changed functions to take queryntlength instead of
queryseq
* pair.c: Made PSL format for proteins print protein coordinates
2005-08-03 twu
* chimera.c, chimera.h, gmap.c: Changed chimera algorithm to potentially
search both sides of an incomplete alignment
2005-08-02 twu
* stage3.c: Increased size of merge length for chimeric exon-exon junctions
* sequence.c: Restored trimming of subsequences
2005-08-02 gcavet
* modules: put back to original state
* modules: added dev module
2005-08-01 twu
* stage3.c, stage3.h: Changed chimeric margin detection to work on both ends
* stage1.c: Added debugging statements
* chimera.c, chimera.h, gmap.c: Changed chimeric search to work on both ends
that fail to align
* sequence.c: Turned trimming off for subsequences
* pair.c, pair.h: Added indel penalties at appropriate end for chimeric path
scores
* Makefile.am, get-genome.c: Allowed user to look up information in map iit
files
* chimera.c: Tested code for checking if breakpoint is outside the alignment
2005-07-29 twu
* datum.c, datum.h, genomeplot.c, plotdata.c, plotdata.h: Allowed colors to
be specified in input file
* chrsubset.c, chrsubset.h, genomeplot.c, plotdata.c: Implemented
user-selected genomic range
2005-07-28 twu
* plotdata.c: Allowed program to handle NaNQs
* genomeplot.c, plotdata.c, plotdata.h: Added log and signed cube root
functions
* genomeplot.c, plotdata.c, plotdata.h: Added ability to print genome on a
single line
2005-07-27 twu
* genomeplot.c: Added ability to handle multiple samples
* plotdata.c, plotdata.h: Added ability to detect and read header lines.
Made code for starting and ending pages extern.
* genomeplot.c, plotdata.c, plotdata.h: Implemented printing of circular
genome
2005-07-26 twu
* pair.c: Corrected query coordinates of chimera in compressed mode
* pairpool.c: Fixed problem where a gap was left at the 5' end of a bounded
transfer.
* datum.c, datum.h, genomeplot.c, plotdata.c, plotdata.h: Added ability to
read specified colors for each line
* datum.c, datum.h, genomeplot.c, plotdata.c, plotdata.h: Allowed printing
of segments between chromosomes
2005-07-25 twu
* pmap.c, stage3.c, stage3.h: Allowed chimeric pieces to be merged over
longer length if ends have strong splice sites
* stage1.c: Restored reader overlap for longer sequences
* gmap.c, iit-read.c, iit-read.h: Added chromosomal positions to map
information
* chrnum.c, chrnum.h: Added function to get offset for a chrnum
* Makefile.am: Added chrsegment.h to sources for genomeplot
2005-07-22 twu
* gmap.c, pmap.c, stage3.c, stage3.h: Allowed two parts of chimera to merge
if close on the genome
2005-07-21 twu
* stage3.c, stage3.h, translation.c, translation.h: Clarified code specific
to PMAP and GMAP
* sequence.c: Changed sequence header for PMAP to refer to amino acids
* pair.c, pair.h: Added ability to print inferred nucleotide sequence for
PMAP
* oligop.c: Clarified meaning of INDEX1PART to be number of amino acids
* pmapindex.c: Clarified meaning of INDEX1PART to be number of nucleotides
* oligo-count.c: Using new interface for Reader_new
* indexdb.c, indexdb.h: Clarified meaning of INDEX1PART to be number of
nucleotides for PMAP.
* pmap.c: Removed -q flag for specifying stage 1 interval, and removed -T
flag for truncating sequence at full-length protein. Specified -Q flag to
be printing of inferred nucleotide sequence.
* gmap.c: Removed -q flag
* Makefile.am: Added beta source files for pmap
* stage1.c: Introduced min intron length. Clarified meaning of INDEX1PART
to be number of amino acids. Added debugging statements.
* reader.c, reader.h: Allowed crossover of start pointer and end pointer so
that middle oligomers will be read. Should help in mapping of short
sequences.
* translation.c, translation.h: Moved combinatorial testing of codons to
translation step
* stage3.h: Performing protein translation only when necessary.
* stage3.c: Considering only forward intron directions for pmap. Performing
protein translation only when necessary.
* stage2.c: Considering only forward intron directions for pmap
* sequence.c: Changed header line for pmap
* pair.c: Made further changes to accommodate plus sign in alignment
* pair.c: Introduced plus sign in alignment
* oligoindex.c: Improved efficiency of analyzing genomic segment, by storing
indices for each frame
* genome.c: Added debugging statements
* dynprog.c, dynprog.h: Changed combinatorial instantiation of codons to a
single instantiation
* Makefile.am, params.c, params.h, pmap.c: Gave pmap the same overall
behavior as gmap, including multi-threading and flag options
2005-07-19 twu
* Makefile.am, block.c, block.h, dynprog.c, dynprog.h, gmap.c, indexdb.c,
indexdb.h, oligoindex.c, oligop.c, oligop.h, pair.c, pmap.c, pmapindex.c,
sequence.c, sequence.h, stage1.c, stage1.h, stage2.c, stage2.h, stage3.c,
stage3.h, translation.c, translation.h: Introduced pmap and pmapindex
2005-07-15 twu
* get-genome.c: Added range format to allow negative lengths
* genome.c: Added exception when requested length exceeds allocated buffer
length
* except.c: Added printing of exception message
* chimera.c: Fixed problem when donor or acceptor length exceeded allocated
buffer length
* Makefile.am: Fixed handling of non-distributed source code
* align.test.ok: Changed genomic coordinate to match new computation of
coordinates in gaps
* VERSION: Updated version number
2005-07-13 twu
* dynprog.c: Made genomic positions on left and right ends of gap constant,
to avoid problems in stage 3 computations
* memchk.c: Made procedures thread-safe
* stage3.c, stage3.h: Fixed genomic positions on left and right ends of gap.
Removing gaps at 5' end, possibly introduced by smoothing.
* mem.c, mem.h: Improved memory trap procedures
* matchpair.c: Added check before freeing some possibly null structures
* genome.c: Removed duplicate FREE of filename
* gmap.c: Fixed genomic positions on left and right ends of gap. Fixed bug
when chimera was not reset to NULL.
* pair.c, pair.h: Fixed genomic positions on left and right ends of gap
2005-07-12 twu
* stage3.c: Fixed bugs in computing dual introns, dealing with previously
computed gaps, and returning coordinates for empty peelbacks.
* stage1.c: Increased parameters for maximum number of matching pairs
considered
* pairpool.c: Enhanced debugging output
* pair.c, pair.h: Added procedure for printing a single pair
* mutation.c: Changed unnamed unions to named unions
2005-07-08 twu
* stage3.c: Added extra check to make sure pairs is non-empty
* gmap.c: Initialized chimera to be NULL
* oligoindex.c: Fixed bug caused by writing to a random location when
indexsize < 8.
* mem.c: Improved trap code
* memchk.c: Changed types to be consistent with regular version of memory
manager
* memchk.c: Added checking implementation of memory manager
* stage3.c: Fixed a segmentation fault bug.
* stage2.c: Changed distpenalty to ignore query distance and
max_intronlength, and simplified computation. These values were probably
not affecting previous computation anyway.
* mutation.c: Fixed problem caused by removal of unnamed union
* iitdef.h: Included header file for off_t type.
* gmap.c: Changed maxpeelback for cross-species mode back to previous value
* VERSION: Updated version
* configure.ac: Added check for caddr_t type. Added check for madvise flags.
* gmap.c: Removed unnecessary variables and arguments. Changed variable
type of nworkers.
* oligoindex.h, pairpool.c, pairpool.h, stopwatch.c, stopwatch.h: Added
formal void argument
* compress.c, genome.c, genome.h, matchpair.c, oligoindex.c, stage2.c:
Removed unnecessary variables
* block.c, boyer-moore.c, dynprog.c, dynprog.h, matchpair.h, oligo.c,
oligo.h, stage1.c, stage1.h: Removed unnecessary arguments
* sequence.c, sequence.h: Added formal void argument. Changed some variable
types.
* indexdb.c, match.c, segmentpos.c: Changed print statement
* pair.c: Added static specification to some functions
* iit-read.c, iitdef.h: Changed some variable types
* indexdb.c: Increased interval of monitoring output from 1 million nt to 10
million nt
* gmap_uncompress.pl.in: Fixed bug in argument list
* bigendian.c, boyer-moore.c, chrom.c, chrsubset.c, chrsubset.h, compress.c,
datadir.c, dynprog.c, except.c, genome-write.c, genome.c, get-genome.c,
gmap.c, gmapindex.c, iit-read.c, iit-write.c, iit_dump.c, iit_get.c,
iit_store.c, indexdb.c, interval.h, intlist.c, list.c, match.c,
matchpair.c, md5-compute.c, md5.c, mutation.c, oligo-count.c, oligo.c,
oligoindex.c, pair.c, pairpool.c, params.h, reqpost.c, segmentpos.c,
segmentpos.h, sequence.c, smooth.c, stage1.c, stage2.c, stage3.c,
translation.c, uintlist.c: Made changes to satisfy pedantic gcc compiler
warnings and to comply with ANSI C
* acinclude.m4: Added autoconf macro for madvise flags
* MAINTAINER: Added comment about strict compiler checking
* madvise-flags.m4: Initial import into CVS
2005-07-07 twu
* gmap.c: Changed parameters to prevent segmentation fault in cross-species
mode
* gmap.c, stage3.c, stage3.h, pair.c, pair.h: Added psl output format
* stage1.c: Increased matchpairs allowed at pre-unique stage
* match.c, match.h: Trivial formatting change
* gmapindex.c: Removed some unused variables
* get-genome.c: Changed usage statement for coordinate interval
* datadir.c: Added error message when genome subdirectory is not readable
* chrnum.c, chrnum.h: Added Chrnum_length command, needed for psl output
format
2005-06-23 twu
* VERSION: Updated version for release
* stage2.c: Increased cross-species penalty for intron length
* gmap.c: Added other constraints on using oligo depth. Reporting failure
type. Separated out beta source files from gmap.
* result.c, result.h: Added failure type
* chrsubset.c, chrsubset.h, plotdata.c: Added checks if chrsubset is NULL.
* genomeplot.c: Added getopt to genomeplot. Added mode for printing
segments.
* Makefile.am: Added getopt to genomeplot. Separated out beta source files
from gmap.
2005-06-21 twu
* genomeplot.c, plotdata.c, plotdata.h: Fixed coloring of raw data,
depending on whether segmentation is performed.
* Makefile.am: Moved chrsegment functionality to genomeplot
* gmap.c: Giving crossspecies flag to Stage 2
* genomeplot.c: Getting segment breakpoints back in three separate lists
* chrsegment.c, chrsegment.h: Added re-checking of segment breakpoints
* intlist.c, intlist.h: Added Intlist_delete function
* stage2.c, stage2.h: Implemented different intron penalties for
crossspecies mode
* stage1.c: Restored full functionality for crossspecies mode
2005-06-16 twu
* stage1.c: Added check for too many matchpairs before applying
Matchpair_filter_unique
* chrsegment.c, chrsegment.h: Using chromosomal positions in calculations
* genomeplot.c, plotdata.c, plotdata.h: Modified calls to Plotdata_values
and Plotdata_chrpositions
* genomeplot.c, iit-read.c, iit-read.h: Added function IIT_length
* plotdata.c: Storing chrpositions and values as individual arrays
* genomeplot.c: Using a tree structure to store segment results.
* chrsegment.c, chrsegment.h: Using a tree structure to store segment
results. Added check for single breakpoint in addition to double
breakpoints.
2005-06-15 twu
* chrsegment.c, chrsegment.h, genomeplot.c: Implemented recursive
segmentation, generating a list of segments
* iit-read.c: Fixed problem with memory fault
* Makefile.am, chrsegment.c, chrsegment.h, genomeplot.c, plotdata.c,
plotdata.h: Merged chrsegment functionality into genomeplot
* genomeplot.c: Fixed some memory leaks
* datum.c, datum.h: Added Datum_T object for use by Plotdata_T
* Makefile.am: Added program chrsegment and added Datum_T object to
genomeplot
* chrsegment.c, nr-x.h: Added program chrsegment
* plotdata.c, plotdata.h: Now storing data as sorted within each chromosome
* chrsubset.c, chrsubset.h: Added function to compute and retrieve old
indices
2005-06-14 twu
* Makefile.am, chrsubset.c, chrsubset.h, color.c, color.h, doublelist.c,
doublelist.h, genomeplot.c, plotdata.c, plotdata.h: Added program
genomeplot
* uintlist.c: Fixed typo
* iit-read.c: Skipping freeing of memory, since it sometimes gives a memory
fault.
* Makefile.am, chimera.c, maxent.c, maxent.h, splice-site.c, splice-site.h:
Changed splice site predictor from scoring matrix to maxent method
2005-06-10 twu
* indexdb.c: Added error message when user-provided genomic segment is
invalid
2005-06-03 twu
* chimera.c, chimera.h: Added detection of exon-exon boundary for chimeras
in both forward and reverse directions
* gmap.c, pair.c, pair.h, stage3.c, stage3.h: Added output of cDNA direction
of exon-exon boundary for chimeras
2005-06-02 twu
* dynprog.c, dynprog.h, gmap.c, stage3.c: Restored previous behavior for
finding microexons. Changed meaning of end_microexons_p to be an
allowance for longer introns at the ends.
* chimera.c: Improved debugging output
2005-06-01 twu
* stage3.c, stage3.h: Added utilities for new chimera functions.
* gmap.c: Added stage 3 calls for truncating full length. Using Chimera_T
objects and new chimera functions.
* stage1.c, stage1.h: Using ends only for cross-species mode in stage 1
* result.c, result.h: Created Chimera_T object.
* pair.c, pair.h: Added utility programs for chimera evaluation.
* chimera.c, chimera.h: Added search for exon-exon boundaries in chimeras.
Created Chimera_T object.
* Makefile.am, splice-site.c, splice-site.h: Added splice site calculations
to chimera evaluation.
2005-05-24 twu
* gmap.c: Moved translate calls up to gmap.c. Added hook for -T flag for
truncating full-length sequence.
* stage3.c, stage3.h: Using function Pairpool_transfer_bounded. Moved
translate calls up to gmap.c.
* pairpool.c, pairpool.h: Added function Pairpool_transfer_bounded.
2005-05-20 twu
* VERSION: Revised version
* stage3.c, stage3.h: Turned off default microexon finding at ends. Cleaned
up margin function for identifying chimeras.
* pair.c: Changed computation of matchscores
* oligoindex.c: Changed definition of oligodepth.
* gmap.c: Added -U flag to turn on microexons at ends. Changed code for
chimeras, and changed meaning of -x flag.
* chrsubset.c: Added check on freeing object.
* chimera.c: Fixed debugging statement
* Makefile.am: Added beta testing flag.
2005-05-09 twu
* Makefile.am: Added compiler instructions for pthreads to various programs
* VERSION: Modified version number.
* README: Added information about -E feature of fa_coords and gmap_setup,
and information about editing coords.txt.
* MAINTAINER: Added reminder to modify VERSION.
* gmap_setup.pl.in: Removed reverse complement procedures here; now being
done by gmapindex. Allowed specification of a command.
* fa_coords.pl.in: Introduced chromosome NA for headers that cannot be
parsed. Allowed specification of a command. Improved handling of Celera
genomes.
* genome.c, indexdb.c: Put mutexes around read procedures for the
combination of multi-threading and non-memory mapped reading of file.
* gmap.c: Fixed bug from uninitialized querysubseq.
* pair.c, pair.h, result.c, result.h, stage3.c, stage3.h: Allowed printing
of range of chimera breakpoints
* interval.c, interval.h: Changed interface to some functions
* iit-read.c, iit-write.c: Fixed bug in debug version of dump. Changed
calls to Interval_T functions.
* gmapindex.c: Changed count_sequence() to read a line at a time
* genome-write.c: Properly handling contigs marked as reverse complement.
* gmap.c: Using fscore threshold to determine statistical significance.
Reporting equivalent positions for breakpoint.
* chimera.c, chimera.h: Using fscore threshold to determine statistical
significance
2005-05-06 twu
* gmap_setup.pl.in: Handling other NCBI cases where version numbers are
missing
* genome-write.c, indexdb.c: Minor changes in monitoring output
* VERSION: Updated version number
* README: Added explanation of output ordering with multiple threads
* coords1.test.ok: Changed to add new comment line in coords.txt
* README: Minor textual change
* gmap_setup.pl.in: Added -q flag for specifying indexing interval. Allowed
comment lines to be in coords.txt.
* md_coords.pl.in: Improved messages to user.
* fa_coords.pl.in: Added handling of unmapped contigs for Ensembl genomes.
Improved messages to user. Added check for possible conversions of
alternate chromosomes to alternate strains.
* gmap_uncompress.pl.in: Fixed bug due to old code that referred to the -R
flag
* gmap.c: Enhanced result to show number of matches, mismatches, and indels
in alternative to chimera. Introduced maxpaths of 0 to indicate output of
both paths of chimera if present, otherwise one path.
* pair.c, pair.h, sequence.c, sequence.h, stage3.c, stage3.h: Removed
references to ntrimmed
* result.c, result.h: Enhanced result to show number of matches, mismatches,
and indels in alternative to chimera
* gmap.c: Remove check for badoligos. Modified logic for computing
chimeras. Made calls to initialization and termination routines for
Dynprog_T.
* chimera.c: Fixed memory leak
* pair.c: Removed printing of ntrimmed nucleotides
* stage3.c, stage3.h: Added functions for reporting matches, mismatches,
indels, and margin of a Stage3_T object
* translation.c: Added initial values for translation_start and
translation_end
* stage2.c: Removed computation of stage2 support. Simplified loop
conditions.
* oligoindex.c, oligoindex.h: Removed computation of stage2 support
* dynprog.c, dynprog.h: Replaced functions with arrays for computing
pairdistances and jump penalties
2005-05-05 twu
* oligoindex.c, oligoindex.h: Changed memory allocation scheme, by setting
ALLOCSIZE == MAXHITS. Assigning blocks in ascending order of available
slots. Computing trim_start and trim_end. Reporting support for stage 2.
* gmap.c: Changed calls to Sequence_read(). Using oligomer-based method for
trimming query sequence.
* md5-compute.c, oligo-count.c: Changed calls to Sequence_read()
* sequence.c, sequence.h: Removed poly-A and poly-T detection in favor of
oligomer-based trimming at ends.
* stage2.c, stage2.h: Added check for stage 2 support.
* stage1.c: Restored terminal sampling for short sequences. Fixed potential
bug with subtracting unsigned ints. Enhanced debugging messages.
* md5-compute.c, oligo-count.c, sequence.c, sequence.h: Modified functions
to report next char in input.
* matchpair.c, matchpair.h: Added reporting of stage1 support and stage1
stretch.
* gmap.c: Added checks for bad input sequences based on oligo depth, bad
oligos, stage1 support, and stage2 support. Moved message about batch
mode earlier, if evidence of a second sequence is present.
* gmap.c: Chopping chimeras at breakpoint, and providing a flag to allow
overlaps at the breakpoint.
* stage3.c, stage3.h: Simplified interface to Stage3_copy.
* pair.c, pair.h: Removed coverage correction for genomic gaps. Added way
to turn off merge_gaps during copying of pairs.
2005-05-04 twu
* stage2.c: Made changes in individual instructions to improve speed
* oligoindex.c: Added overabundant field
* chimera.c: Speeded up computation
* gmap.c: Using explicit step for marking oligos in the query. Terminating
attempt at mapping if oligo depth exceeds 2. Fixed memory leak.
* stage2.c, stage2.h: The variable badsequencep is now fed into
Stage2_compute.
* stage1.c: Killed terminal sampling for short sequences. Reduced values
for maxentries. Both done to improve speed.
* oligoindex.c, oligoindex.h: Added an explicit step for marking oligos in
the query, which needs to be done only once for each query sequence.
* chimera.h: Added computation of margin.
* chimera.c: Added computation of margin. Improved debugging output.
* gmap.c: Fixed bug where bestfrom == bestto. Added check for sufficient
margin at ends before finding chimera.
* gmap_compress.pl.in: Changed compression routine to handle chimera
information
* chrsubset.c: Fixed bug where stdin was closed if .chrsubset file didn't
exist
* stage3.h: Added function to compute matchscores for chimera detection.
* stage3.c: Changed calls to Sequence_T functions. Added function to
compute matchscores for chimera detection.
* stage2.c: Performing Stage 2 from trim start to trim end, instead of
entire sequence. Changed calls to Sequence_T functions.
* stage1.c: Changed calls to Sequence_T and Reader_T functions
* sequence.c, sequence.h: Cleaned up interface. Added ability to print
trimmed part of sequence.
* Makefile.am, chimera.c, chimera.h, gmap.c, nmath.c, nmath.h, pair.c,
pair.h: Added chimera detection based on Chow test
* md5-compute.c: Changed call to Sequence_T function. Using full sequence
now for MD5 computation.
* matchpair.c: Removed call to Sequence_T function
* oligoindex.c: Changed calls to Sequence_T functions
* oligo-count.c: Changed call to Reader_new
* get-genome.c: Changed call to Sequence_print
* reader.c, reader.h: Storing querystart and queryend in Reader_T object
* block.c, block.h: Removed unnecessary field
2005-05-03 twu
* gmap.c, gmapindex.c, indexdb.c, indexdb.h: Allowed indexing interval of
12-mers to be specified at run time
* configure.ac: Added check for madvise function
* README: Added Ensembl format as a recognized coordinate format
* md_coords.pl.in: Improved prompt for alternate chromosomes
* genome.c, iit-read.c, indexdb.c: Put compiler flags around madvise
* datadir.c: Deleted line that was causing problems when the GMAPDB
environment variable was set
2005-05-01 twu
* fa_coords.pl.in: Further fixed coordinates
* fa_coords.pl.in: Removed addition of 1 to coordinates. Added parsing for
Ensembl format.
* gmap_setup.pl.in: Testing accessions with and without version numbers
* md_coords.pl.in: Making -U and -A flags standard. Can exclude unmapped
contigs and alternate chromosomes with chrsubsets.
* md_coords.pl.in: Fixed case where direction eq "0".
* oligoindex.c: Modified memory allocation scheme to have a fixed block of
memory that expands when necessary.
* iit_get.c: Added -A back to allowed flags.
* chrsubset.c: Added debug statements
* VERSION: Updated version
2005-04-20 twu
* sequence.c: Kept poly-A and poly-T limits when specifying subsequences.
* pair.c: Added an exception handler. Removed minor bug where first pair
was handled twice.
* gmapindex.c: Allowed compress and uncompress routines to take a filename
as an argument. Added wraplength option for uncompress.
* gmap.c: Fixed bug in specifying wrong sequence length for computing
chimeras. Removed limit on number of paths for finding chimeras. Added
exception handler.
* except.c: Modified behavior of exception handler
* genuncompress.c: Fixed problem if positions were greater than allowed for
signed ints.
* compress.c, compress.h: Added wraplength option to Compress_uncompress.
2005-04-19 twu
* sequence.c, stage1.c: Added checks for null before freeing memory.
* gmap.c: Made IIT_get return an array of ints, rather than an Intlist, to
reduce repeated small memory allocations. Placed a limit on npaths for
finding chimeras.
* get-genome.c, iit-read.c, iit-read.h, iit_get.c, segmentpos.c, stage3.c:
Made IIT_get return an array of ints, rather than an Intlist, to reduce
repeated small memory allocations.
* mem.c: Added debugging statements.
2005-04-18 twu
* dynprog.c: Added memory allocation routines in cases where problem size
exceeds maxlength of Dynprog_T. Removed unused code for affine gap
penalties.
2005-04-12 vivekr
* cvswrappers: Added binary extensions
2005-03-11 twu
* gmap_setup.pl.in, md_coords.pl.in: Allowed for contigs to be reverse
complement
* fa_coords.pl.in: Removed unused functions
* gmap.c, get-genome.c: Moved dump functions to get-genome
* segmentpos.c: Fixed bug when alternate strain contig exists but reference
is to reference strain
* get-genome.c, iit-read.c, iit-read.h: Changed output of dump functions
* README: Added instructions for specifying reverse coordinates
* VERSION: Changed version number
2005-03-09 twu
* gmapindex.c, iit-read.c: Now storing information about reverse
complementing of contigs
* match.c, pair.c, pair.h, segmentpos.c, segmentpos.h, stage3.c: Limited
printing of contigs to those that are relevant for a given strain.
* get-genome.c, gmap.c: Fixed bug when using the -R release flag.
2005-03-08 twu
* get-genome.c: Changed default behavior to print just the reference strain.
Added a flag to print all strains.
2005-03-04 twu
* chrsubset.c: Fixed minor memory leak
* VERSION: Updated version
* README: Added explanation of chromosome subsets
* chrsubset.c: Changed Chrsubset_T object to be NULL when a blank list is
read in .chrsubset file.
* gmap.c: Incorporated chrsubset. Fixed printing of option flags.
* gmap_setup.pl.in: Added creation of chrsubset file
* whats_on: Changed directories where genomic maps are located
* Makefile.am, chrsubset.c, chrsubset.h, params.c, params.h, stage1.c,
stage1.h: Added capability to search on chromosome subsets
* separator.h: Changed separator back to dashes
* iit-read.c: Changed format of dumping typestrings for .altstrain.type file.
* gmapindex.c: Added writing of .altstrain.type file.
* stage1.c: Removed unused code. Using stage1size instead of INDEX1PART in
some places.
* gmap.c: Added error message.
* datadir.c: Removed unused error message.
2005-03-03 twu
* stage1.c: Introduced idea of dangling matches at ends, and using it to
determine when to sample further at each end, and when to sample from the
middle.
2005-03-02 twu
* separator.h: Changed separator from dashes to dots.
* stage1.c: Fixed a bug in find_3prime_matches. Changed sampling to avoid
terminal sampling, and to redo sampling just before nskip is zero. This is
done to avoid long computation times with terminal sampling on long cDNAs.
2005-03-01 twu
* matchpair.c, matchpair.h: Added a boundmethod type.
* stage1.c: Added code for finding matches using triplets, but not using it.
Removing terminal sampling, and performing a redo of last sampling instead.
2005-02-18 twu
* params.c, params.h: Removed maxintronlen from the params structure.
* gmap.c: Increased default maxintronlen to 1.2M, and provided a flag to
allow user to change this value.
* perl.m4, configure.ac: Changed name of macro
* configure.ac: Added check for Perl with needed modules. Added warning
messages to bottom of configure script.
* config.site: Added option for user to specify a value for PERL
* acinclude.m4: Added check for Perl with needed modules
* perl.m4: Added check for Perl with appropriate modules
* VERSION: Set version number
* COPYING, config.site: Changed wording slightly
* README: Removed optional comment after make check
2005-02-17 twu
* datadir.c, gmap_setup.pl.in: Allowed subdirectory to be present in the -d
flag
* config.site: Fixed advice on installing in build directory
* README: Fixed some textual errors
2005-02-16 twu
* gmap_setup.pl.in: Modified instruction text
* md_coords.pl.in: Added guessing of columns
* genome.c, genuncompress.c, iit-read.c, indexdb.c: Added type cast to avoid
compiler warnings for munmap.
* configure.ac: Removed capitalization
* VERSION: Updated version
* configure.ac: Capitalized message when compilation of pthreads fails
* Makefile.am: Added subdirectories
* iit_get.out.ok, iittest.iit.ok: Added okay files for IIT programs
* AUTHORS: Minor text change
* gmap_setup.pl.in: Changed usage statement
* acx_pthread.m4: Updated macro to latest version
* configure.ac: Added tests for IIT programs. Changed call to ACX_PTHREAD.
* config.site.gne: Changed name from genomedir to gmapdb
* config.site: Added lines for PTHREAD_CFLAGS and PTHREAD_LIBS
* MAINTAINER: Added instructions for building .ok files for tests
* align.test.in, coords1.test.in, map.test.in, setup1.test.in,
setup2.test.in: Added ${srcdir} where necessary to make distcheck happy
* Makefile.am, fa.iittest, iit.test.in, iit_dump.test.in, iit_get.test.in,
iit_store.test.in: Added tests for IIT programs
* gmap.c: Changed ENABLE_PTHREADS to HAVE_PTHREAD. Added reporting of
features to version command.
* blackboard.c, reqpost.c: Changed ENABLE_PTHREADS to HAVE_PTHREAD
* iit_store.c: Changed flags and calling convention
* Makefile.am: Removed ENABLE_PTHREADS and POPT_LIBS.
* acinclude.m4, acx-pthread.m4, acx_pthread.m4: Changed name of file
* README: Completed instructions
* COPYING: Completed license terms
* acinclude.m4, config, acx-pthread.m4, expand.m4, mmap-flags.m4,
pagesize.m4: Put m4 macros into separate files
* configure.ac: Commented out code for AC_PROG_LIBTOOL. Added some compiler
checks.
2005-02-15 twu
* gmap_setup.pl.in: Removed IO::Dir. Changed behavior if -I flag not given.
Added -9 for debugging behavior.
* fa_coords.pl.in, md_coords.pl.in: Removed IO::Dir
* iit-read.h, iit-write.h: Fixed compiler complaint about double typedef for
IIT_T
* iit-read.c: Fixed one-off problem with IIT_totallength.
* genome-write.c: Fixed montoring statements.
* gmap.c: Put pthreads information in version text.
* gmapindex.c: Fixed problem in comparing an int (255) with EOF (-1) on some
machines.
* Makefile.am, align.test.in, align.test.ok, coords1.test.in,
coords1.test.ok, map.test.in, map.test.ok, setup.genomecomp.ok,
setup.idxpositions.ok, setup1.test.in, setup2.test.in, ss.chr17test:
Expanded test suite
* Makefile.am, ss.cdna, ss.chr17test, ss.her2: Initial addition to CVS
repository.
2005-02-14 twu
* gmap_setup.pl.in, md_coords.pl.in: Moved functionality to separate
md_coords program
* Makefile.am: Added fa_coords program
* fa_coords.pl.in: Added file to CVS repository.
* block.c, block.h, compress.c, dynprog.c, dynprog.h, genome-write.c,
iit-read.c, iit-write.c, indexdb.c, interval.c, intron.c, match.c,
match.h, matchpair.c, matchpair.h, md5.c, md5.h, md5.t.c, oligo.c,
oligo.h, pair.h, pairpool.c, pairpool.h, reader.c, request.c, result.h,
segmentpos.c, segmentpos.h, smooth.c, smooth.h, stage1.c, stage3.c,
stopwatch.c, translation.h: Cleaned up included headers
* table.c, table.h, tableint.c, tableint.h, chrom.c, chrom.h: Clarified
meaning of unsigned type.
* reqpost.h: Using Blackboard_T in interface.
* oligo-count.c: Fixed call to Block_new.
* listdef.h: Added a define for T.
* iitdef.h: Moved typedef to iit-read.h and iit-write.h.
* iit_get.c: Removed popt library calls.
* iit-read.h, iit-write.h: Moved include of iitdef.h to .c files.
* get-genome.c: Using SEPARATOR now instead of DASH.
* datadir.c, datadir.h: Formatting changes.
* gmap.c, oligoindex.c, oligoindex.h, params.c, params.h, stage2.c,
stage2.h: Moved get_mappings command to be in oligoindex.c. Moved
indexsize to be stored in Params_T.
* complement.c, complement.h, genome.c, pair.c, sequence.c, translation.c:
Changed complement table to be a macro.
* blackboard.h: Added comments about include of reqpost.h.
* Makefile.am: Cleaned up source files needed for each binary.
* shortoligomer.h: Removed file. Definition needed only by oligoindex.c.
* bigendian.h, genuncompress.c, iit-write.c, littleendian.h: Conditionally
include littleendian.h.
* iit-read.h: Added function to compute total length.
* iit-read.c: Conditionally include littleendian.h. Added function to
compute total length.
* indexdb.h: Allow user to force building of positions file in file.
* indexdb.c: Conditionally include littleendian.h. Allow user to force
building of positions file in file.
* genome-write.c: Added explanation of file format.
* genome.c: Changed type from unsigned int to UINT4. Conditionally include
littleendian.h.
* compress.c, compress.h: Added ability to create genome file in memory, if
enough is available. Changed type from unsigned int to UINT4.
* Makefile.am, genome-write.c, genome-write.h, gmapindex.c: Moved procedures
for writing genome file to a new file. Added ability to create genome
file in memory, if enough is available.
2005-02-10 twu
* iit_get.c: Added include for strings.h to handle rindex.
* bigendian.h, genome.c, genuncompress.c, indexdb.c, sequence.c: Added
includes for stddef.h to handle size_t
* genome.c, genuncompress.c, iit-read.c, indexdb.c: Added check for
HAVE_SYS_STAT_H
* gmap.c, gmapindex.c, oligo-count.c: Removed include of sys/stat.h
* iit-read.c: Commented out include of sys/param.h
* genome.c, indexdb.c: Commented out include of errno.h
* except.c: Removed code for mailing error messages to developer.
* genome.c, genuncompress.c, gmapindex.c, iit-read.c, iit_store.c,
indexdb.c, md5-compute.c, stopwatch.c: Added checks for HAVE_UNISTD_H and
HAVE_FCNTL_H.
* blackboard.c, compress.c, datadir.c, genome.c, genuncompress.c, gmap.c,
gmapindex.c, iit-read.c, iit_store.c, indexdb.c, oligo-count.c, reqpost.c,
stopwatch.c: Added check for HAVE_SYS_TYPES_H
* genome.c, genomicpos.c, iit-write.c, indexdb.c, match.c, md5.c,
oligoindex.c, pair.c, sequence.c: Created separate macros for handling
absence of memcpy and memmove.
* genome.c, genomicpos.c, iit-write.c, indexdb.c, match.c, md5.c,
oligoindex.c, pair.c, sequence.c: Included macros for handling computers
without memcpy or memmove.
* datadir.c: Included macros for handling computers without dirent.h.
2005-02-10 jmurray
* cvswrappers: Added binary extensions
2005-02-07 twu
* chimera.c, translation.c: Fixed rcsid lines
* Makefile.am, uinttable.c, uinttable.h: Removed files uinttable.c and
uinttable.h
* bigendian.c: Added ending quotation mark to rcsid.
* bigendian.h, chimera.h, scores.h, separator.h: Added Id comment to
beginning of header files.
* assert.c, assert.h, bigendian.c, bigendian.h, blackboard.c, blackboard.h,
block.c, block.h, boyer-moore.c, chimera.c, chrnum.c, chrnum.h, chrom.c,
chrom.h, complement.c, complement.h, compress.c, datadir.h, dynprog.c,
dynprog.h, except.c, except.h, genome.c, genome.h, genomicpos.c,
genomicpos.h, get-genome.c, gmap.c, gmapindex.c, iit-read.c, iit-read.h,
iit-write.c, iit-write.h, iit_dump.c, iit_get.c, iit_store.c, indexdb.c,
indexdb.h, interval.c, interval.h, intlist.c, intlist.h, intron.c,
intron.h, list.c, list.h, match.c, match.h, matchpair.c, matchpair.h,
md5-compute.c, md5.c, md5.h, mem.c, mem.h, mutation.c, mutation.h,
oligo-count.c, oligo.c, oligo.h, oligoindex.c, oligoindex.h, pair.c,
pair.h, pairpool.c, pairpool.h, params.c, params.h, reader.c, reader.h,
reqpost.c, reqpost.h, request.c, request.h, result.c, result.h,
segmentpos.c, segmentpos.h, sequence.c, sequence.h, smooth.c, smooth.h,
stage1.c, stage1.h, stage2.c, stage2.h, stage3.c, stage3.h, stopwatch.c,
stopwatch.h, table.c, table.h, tableint.c, tableint.h, translation.c,
translation.h, uintlist.c, uintlist.h, uinttable.c, uinttable.h: Moved
HAVE_CONFIG_H from .h file to .c file.
* datadir.c: Added check to see if closedir succeeded.
* Makefile.am: Augmented list of bin programs.
* get-genome.c, match.c, pair.c, pair.h, stage3.c, stage3.h: Changing
variable names to genomesubdir, fileroot, and dbversion.
* gmap.c: Added -g flag. Changing variable names to genomesubdir, fileroot,
and dbversion.
* params.c, params.h: Made dbversion a static variable.
* genome.c, genome.h, indexdb.c, indexdb.h: Changing variable names to
genomesubdir and fileroot.
* datadir.c, datadir.h: Now searching subdirectory to find name of fileroot,
which can be different from subdirectory name.
* pair.c: Removed unnecessary math.h header. Added initialization of donor
and acceptor arrays.
* getopt.c: Removed internationalization code.
* gmap.c: Removed unnecessary math.h header. Changed location of map
directory for each genome.
* matchpair.c, oligoindex.c, segmentpos.c, smooth.c, stage3.c: Removed
unnecessary math.h header.
* indexdb.c, indexdb.h: Allowed user to build positions file directly to
disk, if sufficient memory is unavailable.
* mem.c, mem.h: Added procedures for allocating memory without throwing an
exception.
* gmapindex.c: Changed flags. Allowed user to build positions file directly
to disk, if sufficient memory is unavailable.
* chrom.c: Eliminated printing of initial zero on non-numeric chromosomes.
2005-02-03 twu
* gmap_setup.pl.in: Removed -R flag, and symbolic links. Fixed problems
with parsing unmapped contigs in seq_contig.md files.
* gmapindex.c: Added debugging statements.
2005-01-27 twu
* config.site: Added warning about non-absolute paths.
* README: Added comments about downloading a genome database.
* Makefile.am: Added extra commands for "make distcheck" to be happy.
Removed genome example.
* MAINTAINER: Added comment about --enable-fulldist
2005-01-26 twu
* config.site, configure.ac, Makefile.am, datadir.c: Changed GENOMEDIR to
GMAPDB.
2005-01-25 twu
* ss.AA005326, ss.cdna: Changed name of example cDNA sequence.
2005-01-24 twu
* MAINTAINER: Added recommended steps for creating a distribution.
* Makefile.am, chrnum.c, chrnum.h, chrom.c, chrom.h, genome.c, genome.h,
get-genome.c, gmap.c, gmapindex.c, match.c, match.h, matchdef.h,
matchpair.c, pair.c, pair.h, segmentpos.c, segmentpos.h, stage1.c,
stage3.c, stage3.h: Made changes to allow chromosome names to be
arbitrarily long
* gmap_setup.pl.in: Removed restriction on chromosome name length. Stripped
spaces from beginning and end of input. Added step to create initial
genomedir.
* config.site: Changed defaults in config.site.
* getopt.c, getopt.h, getopt1.c: Added gnu getopt_long function
* tests, ss.AA005326: Added test sequence.
* Makefile.am: Created Makefile.am in util subdirectory
* gmap_setup.pl.in: Fixed bug due to missing quotation mark
* configure.ac: Removed dependence upon popt library
* Makefile.am, get-genome.c, gmap.c: Added gnu getopt_long procedure
* README: Changed prerequisites. Improved formatting.
2005-01-23 twu
* gmap_setup.pl.in: Added procedures for handling UCSC genomes.
* iit_store.c: Using Tableint_T instead of Table_T for types.
* Makefile.am: Removed some unnecessary source files.
* configure.ac: Added ACX_EXPAND, turned off popt, and fixed problem when no
threads compilation is possible.
* config.site.gne: Added comments for profiling and making .third file.
* acinclude.m4: Added macro for ACX_EXPAND.
* README: Added mention of examples and make check.
* Makefile.am: Added extra dist files for examples.
* oligoindex.c: Created a union type to make clear the possible storage of
either a position or a pointer to an array positions.
* datadir.c: Removed unused function.
* gmapindex.c, table.c, table.h, tableint.c, tableint.h, uinttable.c,
uinttable.h: Added an end value to avoid problems when table length is 0.
* Makefile.am, tableint.c, tableint.h, uinttable.c, uinttable.h: Made
specific table types.
* gmap.c: Removed duplicate getopt line.
* iit_get.c: Fixed compilation bug when popt not available.
* gmapindex.c: Used specific table types and keys/values functions.
* table.c, table.h: Made functions Table_keys and Table_values
* gmap_uncompress.pl.in: Using BINDIR for substitution.
* Makefile.am: Removed Makefile.am
* gmap_setup.pl.in: Major changes made to provide both interactive and
command-line use.
2005-01-22 twu
* configure.ac: Allowed hyphens to be in the version number
* MAINTAINER, bootstrap, config.site, config.site.gne: Added local
config.site to CVS directory
* MAINTAINER: Added notes for maintainer
* README: Simplifying the installation instructions
* configure.ac: Made configuration easier by adding VERSION and config.site
files. Removed MAPDIR. Added Perl scripts.
* VERSION, config.site: Made configuration easier by adding VERSION and
config.site files.
* gmap_compress.pl.in, gmap_uncompress.pl.in: Changed file from .pl version
to .pl.in version.
* Makefile.am: Moved Perl scripts to util subdirectory.
* datadir.c, datadir.h, gmap.c: Moved map files to a subdirectory in genome
directory.
* gmapsetup.pl.in: Moved file to util subdirectory.
* whats_on: Changed location of map files to be inside genome directories.
* gmap_compress.pl, gmap_uncompress.pl: Changing scripts from .pl to .pl.in
version
* README, configure.ac, Makefile.am, compress.c, datadir.c, genome.c,
get-genome.c, gmap.c, gmapsetup.pl.in, iit-read.c, indexdb.c,
segmentpos.c, gmap_compress.pl, gmap_compress.pl.in, gmap_setup.pl.in,
snap.c, snapbuild.pl.in, snapindex.c, snap_compress.pl,
snap_uncompress.pl: Renamed program from snap to gmap
* gmap_compress.pl, gmap_compress.pl.in, snap_compress.pl: Better handling
of MD5 info and aa lines.
* gmap_uncompress.pl, gmap_uncompress.pl.in, snap_uncompress.pl: Handling
arbitrary flags in compression.
* mutation.c, mutation.h, pair.c, pair.h, pairdef.h, translation.c: Added
refquerypos to print nucleotide position of mutations.
* get-genome.c: Fixed problem with empty header for reference sequence when
specific strain is requested.
2005-01-19 twu
* translation.c: Fixed problem with printing of an AA in an intron.
* mutation.c: Consolidated point mutations near a segmental mutation.
2005-01-06 twu
* translation.c: Fixed detection of deletion mutations where aapos was
advancing in a gap.
* mutation.c, mutation.h, translation.c: Fixed cases where a single-position
mutation was reported next to a segmental mutation.
* stage3.c: Added debugging statements for relative alignment.
2005-01-05 twu
* translation.c: Allowed lower case letters to translate appropriately to a
codon.
2004-12-21 twu
* stage3.c: Performing microexon search for all defect rates. Adjusted
acceptable mismatches for low-quality sequences.
2004-12-20 twu
* translation.c: Increased IGNORE_MARGIN to deal with nucleotide coordinates.
* stage3.c: Changed criteria for performing microexon search.
* translation.c, translation.h: Fixed detection of large deletions relative
to reference sequence. Fixed printing of cDNA aa in a gap.
* stage3.c: Changed criterion for starting microexon search to add
mismatches and indels. Fixed detection of large deletions relative to
reference sequence.
* gmap.c, snap.c: Set chimera threshold to 0 for default. Reduced band from
10 to 7.
* dynprog.c: Reduced pvalue thresholds for microexons.
2004-12-19 twu
* gmap.c, snap.c: Turned on chimera functionality. Increased dynamic
programming band from 7 to 10.
* stage1.c: Changed function for maxintronlen.
* smooth.c: Increased SHORTMIDEXON_LEN from 40 to 80.
* dynprog.c: Removed definition for INFINITY, which wasn't being used.
2004-12-18 twu
* stage2.c: Created define parameter SAMPLE_INTERVAL.
* gmap.c, snap.c: Change maxintronlen to be maxintronlen_bound, and compute
new maxintronlen depending on current query length. Increased size of
extraband_single and extraband_paired.
* params.c, params.h, stage1.c, stage1.h: Change maxintronlen to be
maxintronlen_bound, and compute new maxintronlen depending on current
query length.
* dynprog.c: Changed compute_scores_affine to have parameter list compatible
with compute_scores (with codon penalty).
* stage3.c: Subtracting points for non-canonical introns in determining
direction. Doing middle introns of sequence before doing 3' and 5' ends.
* dynprog.c: Increased pvalue thresholds.
2004-12-13 twu
* gmap.c, pair.c, pair.h, snap.c, stage3.h: Added function for printing cDNA
exons.
* stage3.c: Fixed problem where we shouldn't perform single-gap dynamic
programming because unable to peel forward and peel back.
* translation.c: Created separate mutation types for substitution,
insertion, and deletion. Allowed filling in of last amino acid.
* mutation.c, mutation.h: Created separate mutation types for substitution,
insertion, and deletion.
2004-12-09 twu
* stage3.c: Changed calls to Translate module.
* translation.c: Simplified code for computing protein bounds. Handled the
case where full length is specified, but no full length protein exists.
* mutation.c, mutation.h: Added procedures for handling multiple insertions
and deletions.
* translation.c, translation.h: Changed algorithm for translate_est_forward
and translate_est_backward.
2004-12-08 twu
* translation.c, translation.h: Changed algorithms for translate_est_forward
and translation_est_backward. Added printing of nucleotide differences.
* gmap.c, snap.c, stage3.c, stage3.h: Added options for printing either
genomic or cDNA version of protein.
* pair.c, pair.h: Added function Pair_dump_aapos.
* mutation.c, mutation.h: Added functions for retrieving amino acids from
mutation.
* dynprog.c: Added slight penalty against gaps next to an intron.
2004-12-05 twu
* Makefile.am, mutation.c, mutation.h, stage3.c, translation.c,
translation.h: Simplified computation of translations and mutations.
* pair.c, pair.h, pairdef.h, pairpool.c: Now printing both genomic and cDNA
proteins.
2004-12-02 twu
* stage3.h: Removed unused chimera code.
* stage3.c: Removed unused chimera code. Changed criteria for finding
microexons at end; now performed only when extension is poor and sequence
quality is high.
* dynprog.h, gmap.c, snap.c: Allowed user option to extend alignment past
last match.
* dynprog.c: Fixed bug in adding gap to replace dashes.
* pairpool.c: Added debugging statement for creation of pairs.
* smooth.c: Added check for negative exon length.
2004-11-29 twu
* dynprog.c, dynprog.h: Added symbols for an intron if applicable to a large
horizontal jump. Increased maximum microexon size.
* stage3.c: Added peel_back and peel_forward to 5' and 3' ends before doing
search for microexons.
2004-11-22 twu
* scores.h, stage3.c: Added credit for dual half-canonical introns.
* stage1.c, stage1.h: Removed unused code.
* dynprog.c: Added parameters for PVALUE for microexon and end exon searches.
2004-11-18 twu
* result.h: Changed interface for Result_new to match implementation.
* Makefile.am: Added scores.h to Makefile.am.
2004-11-15 twu
* stage3.c, stage3.h: Commented out code for extending pairs in a chimera.
* gmap.c, snap.c: Fixed problem in rearranging best two paths for chimera.
* pair.c: Stopped printing of the terminal amino acid '*'.
* genome.c, indexdb.c: Added printing of a dot every 10000 pages.
2004-10-12 twu
* gmap.c, snap.c: For chimeras that extend too long, now chopping off the
extra part.
* stage3.c, stage3.h: Added procedure for doing a bounded copy of a Stage 3
object.
* pairpool.c, pairpool.h: Added procedure for doing a bounded copy of a path.
2004-10-06 twu
* Makefile.am, chimera.c, chimera.h, gmap.c, snap.c: Changed procedure for
chimeras to find best pair and to order the chimeras according to query
sequence.
* scores.h: Moved scores for determining goodness into a separate file.
* stage3.c, stage3.h: Added procedure for copying a Stage3 object.
* result.c, result.h: Changed chimera information to be a position, rather
than a boolean.
* dynprog.c: Added code for allowing right angles, but not using at present.
* pairpool.c: Changed print statement to work only in debug mode.
* pair.c, pair.h: Added procedure for computing scores along a path.
2004-09-30 twu
* gmap.c, match.c, pair.c, pair.h, sequence.c, sequence.h, snap.c, stage3.c,
stage3.h: Added MD5 checksum for compressed output.
* stage1.c: Added notation about using position for revcomp matches in IITs.
* iit-read.c: Changed debugging statements to print unsigned ints.
2004-09-27 twu
* stage3.c: Fixed problem when peeling an extra pair if it's a gap.
2004-09-09 twu
* stage3.c: Restored behavior of crossing just one short exon for dual
genome gap.
* stage3.c: Peeled back one more matching pair. For dual intron gap, now
skipping multiple short exons and keeping the longest one.
* gmap.c, snap.c: Increased maxpeelback from 10 to 11.
* pairpool.c, pairpool.h: Added command Pairpool_transfer_copy, although not
currently used.
* pair.c, pair.h: Added command Pair_check_list.
* dynprog.c: Added end reward for bridging a cDNA gap.
* md5-compute.c: Changed behavior from a single sequence to a FASTA file of
multip[le sequences.
* Makefile.am: Added object file for md5-compute.
2004-09-02 twu
* stage3.c: Fixed floating exception bug when middle_exonlength is
non-positive.
* stage2.c: Fixed problem of reading uninitialized value.
2004-08-30 twu
* dynprog.c: Added check for non-positive span.
2004-07-28 twu
* stage2.c: Changed some penalties. Using bad sequence information to
increase lookback.
* oligoindex.c, oligoindex.h: Added check for bad sequences (with several
non-ACGTN characters).
* dynprog.c: Added check for zero span.
2004-06-25 twu
* gmap.c, snap.c: Added flag to search only reference strain.
* stage2.c: Increased definition of ENOUGH_CONSECUTIVE. Added penalties for
deadp.
* stage3.c: Penalizing noncanonical introns in comparing across different
paths.
* segmentpos.c: Changed output of contig length.
* pair.c, pair.h: Reporting number of noncanonical introns. Allowing
goodness to be reported during debugging.
2004-06-23 jtang
* cvswrappers: Added binary extensions
2004-06-20 twu
* stage2.c: Simplified decision making for mismatch gaps. Increased
penalties on gendistance and querydistance.
2004-06-16 twu
* whats_on: Added get_sequences function.
* gmap_uncompress.pl, gmap_uncompress.pl.in, snap_uncompress.pl: Implemented
code to interpret new compression scheme.
* stage2.c: Changed penalty functions into macros for speed. Made some
other changes to improve speed.
* dynprog.c: Fixed bug involving uninitialized variable.
2004-06-15 twu
* stage2.c: Penalizing intron length per 2000 nt instead of 1000.
* stage3.c: Using goodness scores to decide between single and dual introns.
Searching for microexons only when sequence quality is medium to high.
* dynprog.c, dynprog.h: Reporting nopens and nindels from Dynprog_genome_gap.
2004-06-12 twu
* stage3.c: Added evaluation of middle exon length in deciding between
single and dual introns.
* stage2.c: Reduced size of INTRON_DEFN. Further penalized large query
distances.
* stage1.c: Increased length of additional ends of genome segment.
* oligoindex.c: Improved debugging statements.
* dynprog.c, dynprog.h: Allowing up to 1 mismatch on either side for
microexon search. Reporting position of exonhead in Dynprog_genome_gap for
use in traversing dual genome gap.
2004-06-10 twu
* stage2.c: Added procedure to determine maximum intron length at a given
querypos, and based penalties to be linear with that.
* smooth.c: Added check for nullness of intronlengths.
* dynprog.c: Added probabilistic check on microexon length for a given
genomic span.
2004-06-09 twu
* pair.c, pair.h, stage3.c: Keeping track of semicanonical introns and
scoring them to decide on strand.
* smooth.c: Made decision about deleting end exons based on probability.
2004-06-08 twu
* get-genome.c: Fixed a bug in determining whether a query is a range.
2004-06-07 twu
* stage3.c: Added debugging statement for microexons.
* dynprog.c: Added minimum length for introns when looking for microexons.
* stage2.c: Changed penalties to be more consistent on mismatches between
different conditions, including deadp. For deadp, now requiring that
abs(gendistance - querydistance) or querydistance be less than INTRON_DEFN.
* smooth.c: Increased threshold on ends to be 20.
* pair.c: For determining fracidentity (and selecting between forward and
reverse strands), now counting semicanonical introns as canonical.
* stage2.c: For deadp, increased lookback.
* gmap.c, snap.c: Increased maxintronlen from 1 million bp to 2 million bp.
Motivated by HER4 (NM_005235).
* gmap.c, snap.c: Increased nullgap from 80 to 600.
* stage2.c: Modified stage 2 scoring for mismatch alignments. Invoked deadp
when fwd or rev score is zero.
2004-06-04 twu
* translation.c: Further fixed the bug involving uninitialized heap
(translation_start/translation_end extending beyond sequence boundaries).
* stage2.c, stage2.h: Rewrote code into separate procedure. Increased
gendistance penalty. Changed penalties when querypos is dead.
* gmap.c, snap.c: Created separate parameters for extraband_end and
extraband_paired. Renamed maxlookback to nullgap. Created nsufflookback
parameter. Removed repetition of stage 2.
* params.c, params.h, stage3.c, stage3.h: Created separate parameters for
extraband_end and extraband_paired. Renamed maxlookback to nullgap.
Created nsufflookback parameter.
* dynprog.h: Created separate parameters for extraband_end and
extraband_paired.
* dynprog.c: Created separate parameters for extraband_end and
extraband_paired. Extending last nucleotide at ends if possible. Removing
gaps at ends.
* translation.c: Fixed a bug involving reading/writing of uninitialized heap.
2004-06-02 twu
* stage3.c: Doubled intron space required for a paired gap solution to be
attempted.
* dynprog.c: Implemented gap penalties that are non-affine, with extensions
being the same within a codon.
* translation.c: Fixed bug where codon was assigned improperly at a cDNA gap.
* dynprog.c, dynprog.h, stage3.c: Added a conservative search for microexons
at the 5' and 3' ends.
* smooth.c: Increased pruning of ends from 8 back to 16.
2004-06-01 twu
* stage2.c, stage2.h: Added Stage2_pathlength function.
* gmap.c, snap.c: ncreased maxpeelback from 8 to 10 and allowed program to
redo stage2 with increased suflookback if cDNA not covered.
* stage3.h: Changed MININTRONLEN from 9 to 6 and moved definition into .c
file.
* stage3.c: Made search for microexon dependent on number of mismatches.
* dynprog.c, dynprog.h: Made Dynprog_genome_gap return number of matches and
mismatches.
2004-05-26 twu
* dynprog.c, dynprog.h, stage3.c: Made microexon search work in reverse
direction. Fixed memory leak.
* boyer-moore.h: Added RCS Id.
* boyer-moore.c: Removed debugging statement. Added RCS Id.
* Makefile.am, boyer-moore.c, boyer-moore.h, dynprog.c, dynprog.h, stage3.c:
Added procedure for finding microexons. Works for forward direction only.
* stage3.c: Increased goodness score for canonical intron when deciding
between forward and reverse directions.
* sequence.c: Fixed read procedure to handle PC line feeds.
* dynprog.c: Changed end extension to allow one gap and to proceed if number
of matches is greater than or equal to number of mismatches.
* gmap.c, snap.c, stage3.c, stage3.h, translation.c, translation.h: Added
option for assuming a full-length sequence.
* pair.c: Changed printing of protein coordinates to correspond to first
amino acid on each line.
* gmap.c, snap.c: Added ability to print protein sequence. Fixed some flags.
* translation.c, translation.h: Fixed calculation of translation coordinates.
* pair.c, pair.h, stage3.c, stage3.h: Added printing of protein coordinates.
* indexdb.c: Revised monitoring statement.
* dynprog.c: Revised extensions of 5' and 3' ends to use best score with no
gap, even if negative. This extends ends when there is one match and one
mismatch.
2004-05-14 vivekr
* cvswrappers: Added binary extensions
2004-05-05 twu
* compress.c, gmap.c, indexdb.c, pair.c, snap.c, stage3.c, stage3.h,
translation.c, translation.h: Made improvements to relative translation
routines.
* Makefile.am, compress.c, compress.h, gmapindex.c, indexdb.c, snapindex.c:
Moved compress and uncompress routines to a new file.
2004-04-22 twu
* translation.c: Fixed frameshift-tolerant protein computation for cases
where cDNA deletion is 3 or more nt.
* gmap.c, snap.c, stage3.c, stage3.h, translation.c, translation.h: Added
feature for fixing frameshifts in reference-based protein computation and
made it the default.
* stage3.c, translation.c, translation.h: Changed internal data format for
calculating translations.
* translation.c: Fixed array bounds bug in translating from reference.
* whats_on: Added flag for showing original headers.
2004-04-21 twu
* gmap.c, pair.c, pair.h, pairdef.h, pairpool.c, params.c, params.h, snap.c,
stage3.c, stage3.h, translation.c, translation.h: Added protein
calculation for ESTs based on a reference mRNA.
2004-04-19 twu
* gmap.c, pair.c, pair.h, params.c, params.h, snap.c, stage3.c, stage3.h,
translation.c, translation.h: Changes to allow calculation of mutation
effect given a specific mutation
2004-04-18 twu
* genome.c, genome.h: Fixed code for patching strains.
* genomicpos.c: Cleaned up code for adding commas.
* stage1.c: Changed variable name from stutter to stutterdist.
* gmap.c, snap.c: Added internal flag to control strain searching feature.
2004-04-17 twu
* whats_on: Added ability to print align.iit files, rather than map.iit
files.
* gmap_uncompress.pl, gmap_uncompress.pl.in, snap_uncompress.pl: Added
inversion mode.
* gmap_compress.pl, gmap_compress.pl.in, snap_compress.pl: Added code to
skip protein sequence lines.
2004-03-30 twu
* Makefile.am, gmap.c, snap.c: Added routines for printing protein sequence.
* iit-read.c, iit-read.h: Added procedure for listing all types.
* stage3.c, stage3.h: Fixed memory leak and bug when stage3 result is NULL.
* translation.c, translation.h: Added routines for printing peptide sequence.
* get-genome.c: Allowed user to select a particular strain to align against.
* pair.c, pair.h: Added routine for printing peptide. Clarified code for
handling inversions on minus strand. Fixed bug in compression for '#'
character.
2004-02-23 twu
* stage3.c: Added special cases for single mismatch and single cDNA
insertion.
* stage1.c: Defined maximum on finding match pairs, to eliminate slow
response on nonsense sequences, such as poly-G.
* stage3.c: In pass 3, force single gap to be crossed even if finalscore is
negative, to complete alignment.
2004-02-19 twu
* stage3.c, stage3.h: Removed unused variable minendtrigger.
* gmap.c, params.c, params.h, snap.c: Removed global user-specified
parameters from Params_T.
* gmap.c, snap.c: Removed fraction_threshold parameter. Changed default
chimera_threshold to 0.50.
* params.c, params.h: Removed fraction_threshold parameter.
2004-02-18 twu
* gmap.c, snap.c: Made chimera threshold definable by user, and set default
to 0.70.
* stage3.c: Re-defined criterion for a gap to be when queryjump <= 0 and
genomejump <= 0, which holds true after single gaps are filled. Prevented
filling in a genome gap when its alignment score is negative.
* smooth.c, smooth.h: Re-defined criterion for a gap to be when queryjump <=
0 and genomejump <= 0, which holds true after single gaps are filled.
* matchpair.c, matchpair.h: Made size bound a fraction of the best, rather
than subtraction.
* stage1.c: Increased maxentries parameters. Made size bound a fraction of
the best, rather than subtraction.
* gmap.c, snap.c: Removed calls to Stage1_matchpairlist. Now performing
sampling by default.
* smooth.c: Reduced size definition of short intron. Made intron definition
depend only on genome distance, which now includes single gaps that
weren't filled in in stage 3.
* stage2.c: Made a macro for query distance penalty.
* stage3.c: Fixed problem with peeled is NULL. Added decision to not fill
in single gap if the score is negative, and to restore peeled pairs in
that case.
* smooth.c: Fixed memory leak.
2004-02-17 twu
* stage3.c: Fixed bug where program would skip over a pair after gappairs
was added.
* stage3.c: Made peel_back and peel_forward end at a non-gap, by
backtracking from peeled.
2004-02-16 twu
* stage3.c: Increased reward for canonical introns from 5 to 8.
2004-02-15 twu
* stage3.c: Fixed bugs in peel_back and peel_forward. Fixed bug in
computing goodness scores.
* smooth.c, smooth.h, stage3.c: Giving information about number of short
exons found in smoothing to stage 3 to help improve speed.
* stage3.c: Removed occurrences of indexsize. Cleaned up procedure for
finding middle exons in dual intron procedure.
* smooth.c, smooth.h: Rewrote smoothing procedure to be a cleaner procedure.
Analyzing both ends to prune short exons.
* pair.c, pair.h: Added Pair_debug_alignment procedure.
2004-02-14 twu
* smooth.c, smooth.h, stage2.c, stage3.c: Changed stage 2 to produce a
nucleotide-based path, rather than 8-mer path. Changed smoothing and
stage 3 accordingly. Made all intron distance penalties equal in stage 2.
* dynprog.c, dynprog.h, stage3.c: Rearranged stage 3 to solve dual introns
before other introns and large gaps. Performing smoothing iteratively
with dual introns.
* stage2.c: Replaced calculations of gendistance_penalty with macros.
* smooth.c: Increased minexonlen for smoothing, because smoothing has been
made iterative.
* pair.c: Using memcpy commands instead of copying individual fields. Added
diagnostic printing of short exons.
* Makefile.am, smooth.c, smooth.h, stage3.c: Added files smooth.c and
smooth.h and moved Smooth_path there
* matchpair.c: Fixed memory leak.
2004-02-13 twu
* gmap.c, snap.c, stage2.c, stage2.h, stage3.c, stage3.h: Changed stage 2
and stage 3 algorithms to interleave in the following order: dynamic
programming on single gaps, then smoothing, then dynamic programming on
ends and large gaps. Allows dual intron algorithm to work even when
middle exon has small mismatches or gaps.
* pair.c: Fixed merge_one_gap to handle user-selected ngap != 3.
* dynprog.c, dynprog.h: Moved definitions of defect rate boundaries to
header file.
* stage2.c: Subtracting (querydistance+7)/8 on mismatches to penalize once
per 8-mer. Subtracting 1 for each intron to reduce number of introns,
especially when a/1000 + b/1000 < (a+b)/1000.
* stage2.c: Added separate intron penalties for consistent, unknown, and
inconsistent introns. Increased lengths of short middle exons marked for
dual genome gap.
* stage2.h, stage3.c: Added intron length to goodness score.
* stage2.c: Implemented two parallel computations in Stage 2 under forward
and reverse assumptions. Removed firstregion and lastregion computations
from smoothing.
2004-02-12 twu
* gmap.c, snap.c: Removed universalp flag (-U).
* iit_get.c: Added annotation only mode (-A).
2004-02-11 twu
* whats_on: Simplified code greatly.
* gmap_compress.pl, gmap_compress.pl.in, snap_compress.pl: Removed space
before first token. Fixed bug in reporting genomic exon length rather
than cDNA exon length.
* dynprog.c, dynprog.h, stage3.c: Added computation of nonintronlen for
goodness ranking.
* stage2.c: Introduced Link_T to hold dynamic programming data.
* stage2.c: Modified smoothing to have keep, delete, and mark options.
* stage2.c, stage2.h, stage3.c: Added category for introns of unknown
direction.
* gmap.c, snap.c: Made default for chimerasearchp false again. Added an
automatic mode for chimera search if coverage is less than 50%.
2004-02-10 twu
* gmap.c, snap.c: Made chimera search the default.
* stage2.c: Added counts of forward, reverse, and non-canonical introns to
the dynamic programming procedure, and used consistency in computing
scores.
* stage3.c: Added debugging macros.
* stage2.c: Added intron penalty only for noncanonical introns.
* stage1.c: Added debugging statements.
* matchpair.c: Reduced MAXCANDIDATES from 30 to 10.
* dynprog.c, pair.c, pairdef.h: Added '~' character for non-canonical gaps
converted to insertions, to avoid penalizing them as non-intron gaps.
2004-02-09 twu
* gmap.c, iit-read.c, iit-read.h, snap.c, stage3.c, stage3.h: Changed map
output to include strand if both strands are requested.
* dynprog.c: Restored horizontal or vertical jump of 1 next to intron.
* datadir.c: Changed error message.
* stage2.c: Add penalty for number of non-canonical introns. Accumulate
best score for introns, even if negative, and use that if no other score
exceeds 0.
* iit-read.h: Added name to IIT structure.
* gmap.c, pair.c, pair.h, params.c, params.h, snap.c, stage3.h: Added
compression feature.
* stage3.c: Added compression feature. Added debug mode to show output from
stage 2.
* gmapindex.c, snapindex.c, get-genome.c, iit-read.c, iit_dump.c, iit_get.c,
iitdef.h: Added optional name to IIT structure.
2004-02-06 twu
* dynprog.c: Advanced counter within gaps to the next position.
* pair.c, pair.h, pairdef.h, pairpool.c: Added the shortexonp field for
pairs.
* stage2.c: For smoothing of short exons, marking positions as short, rather
than deleting them. Increased length threshold for short exons, because
we now have a mechanism for handling them well.
* gmap.c, snap.c: Added a dynprogM for handling short exons.
* stage3.c, stage3.h: Removed special procedure for dual genome gaps.
Instead comparing a single genome gap with two half genome gaps for short
exons.
* dynprog.c: Removed special procedure for dual genome gaps. Instead, for
short exons, comparing a single genome gap with two half genome gaps.
* dynprog.c: Passing pointers to revsequence and revoffset from stage3 to
dynprog procedures where appropriate. Added preliminary code for dual
genome gap.
* dynprog.h, stage3.c: Passing pointers to revsequence and revoffset from
stage3 to dynprog procedures where appropriate.
* get-genome.c, gmap.c, pair.c, pair.h, params.c, params.h, sequence.c,
sequence.h, snap.c, stage3.c, stage3.h: Added option for specifying wrap
length.
* dynprog.c: Fixed problem with sequence being short by 1 nt in conversion
of gap to insertion.
* dynprog.c: Convert short non-canonical introns into insertions.
* dynprog.c: Removed reverse_sequence and creation of reverse sequence. Now
using a boolean to determine whether to use negative indices.
2004-02-05 twu
* README, configure.ac, Makefile.am, datadir.c, datadir.h, gmap.c, params.c,
params.h, snap.c, stage3.c, stage3.h: Changed references to "bounds" to
"map".
* ddsgap2_compress.pl: Made much faster.
* get-genome.c: Fixed get-genome for reverse complement. Added debugging
statements.
* dynprog.c: Added specific constraints on whether to allow gaps adjacent to
the intron, depending on sequence quality.
2004-02-03 twu
* dynprog.c, dynprog.h: Removed conservative option. Added comments to
explain rationale behing scoring scheme.
* gmap.c, params.c, params.h, snap.c, stage3.c, stage3.h: Removed
conservative option.
* stage3.c: Removed peelback on sequence ends. Continued peelback through
small gaps and mismatches. Included comp of '-' in pruning of gaps at end.
* iit-read.c: Added debugging code.
* genome.c: Fixed faulty reasoning when patch has expansion or contraction.
* dynprog.c: Raised penalties on paired gap alignment to prevent
gap-match-gap being preferred to two mismatches. Added checks to bridging
across introns to prevent genomic insertion or more than one cDNA
insertion.
2004-02-02 twu
* pairdef.h: Revised comment about definition of gapp.
* pair.c: Removed comment.
* dynprog.c: Fixed debugging statements for pairs pushed on horizontal or
vertical moves.
* gmap.c, pair.c, pair.h, snap.c, stage3.c, stage3.h: Added printing of
bounds information as a separate section.
2004-01-31 twu
* Makefile.am: Added uintlist.c and uintlist.h to source lists where
necessary.
* gmapindex.c, snapindex.c: Made contig intervals inclusive.
* iit_get.c: Changed isnumber to isnumberp to avoid conflict on some Unix
machines.
* iit_get.c: Handle case where strlen of annotation is 0. Add carriage
return after annotation if necessary. If one numeric argument given, try
as a label, then as a number.
* iit-read.c: Handle case where strlen of annotation is 0.
* genome.c, get-genome.c: Reverted to previous IIT format, where we don't
store lengths explicitly. For sequences, can determine actual length from
annotation strlen.
* iit-read.c, iit-read.h, iit-write.c, iit-write.h, iit_store.c, interval.c,
interval.h: Reverted to previous format, where we don't store lengths
explicitly.
* iit_dump.c: Added warning if IIT_read fails.
* gmapindex.c, snapindex.c: Reverted to previous format, where we don't
store lengths explicitly. For FASTA files, count sequence length and store
as annotation in contig_iit.
* stage3.c: Added Pair_check procedure.
* dynprog.c: Fixed problem with dynamic programming not going back to
beginning. Fixed bridging across cDNA gaps.
* datadir.c, datadir.h: Created two data directories, one for genome files
and one for bounds files.
* pair.c, pair.h: Added Pair_check function.
* configure.ac, Makefile.am, gmap.c, snap.c: Created two data directories,
one for genome files and one for batch files.
2004-01-27 twu
* dynprog.c: Reduced mismatch and gap penalties at ends to extend ends more
completely.
* stage1.c: Increased length of very small sequences from 30 to 40.
2004-01-26 twu
* gmap.c, snap.c, stage1.h: Changed criterion for good alignment on short
sequences to be based on coverage rather than percent identity.
* stage1.c: Sampling exhaustively on short sequences.
* stage2.c: Removed tiebreaker based on genomic distance. Ignoring
gendistance penalty if no better score can be found, which allows program
to find distant 5' exons.
* pair.c, pairpool.c, stage3.c: Restored large gap and '#' character when
queryjump exceeds maxlookback.
* match.c: Fixed bug where accessions were looked up on chromosomal
coordinates instead of universal coordinates.
* Makefile.am, datadir.c, datadir.h, snapconfig.c: Removed snapconfig and
run-time configuration of SNAP, which doesn't work on statically built
binaries.
2004-01-23 twu
* gmap.c, snap.c: Updated print_usage statement for non-popt systems.
* snapconfig.c: Added a usage statement.
* iit_dump.c: Added a debug flag.
* iit-write.c: Writing out elements of structs individually, instead of
depending on an fwrite of the struct.
* iit-read.c, iit-read.h: Fixed problem with Bigendian reads of iit files.
Added IIT_debug function.
* Makefile.am: Provided different dist and nodist instructions depending on
FULLDIST.
* stage1.c: Set maxentries during sampling to be 10 times that of scanning.
Set stage1size for short sequences to be 12-mers for < 40 nt, and 18-mers
for 40-80 nt.
* pair.c, pair.h, pairpool.c: Removed '#' is a character in alignment.
* dynprog.c, dynprog.h, stage3.c: Treated cDNA gaps (extra cDNA material) in
a way analogous to genome gaps.
* get-genome.c: Changed name of function from isnumber to isnumberp to avoid
name conflict with some systems (like MacOSX) that define isnumber in
ctype.h.
2004-01-20 twu
* stage3.c: Fixed bug where dynamic programming of ends wouldn't go all the
way to the end of the genomic segment.
* dynprog.c: Fixed debug statement.
* Makefile.am: Added file matchdef.h
* dynprog.c, dynprog.h, gmap.c, pair.c, pair.h, params.c, params.h, snap.c,
stage3.c, stage3.h: Added parameter for length of intron gap shown.
* stage1.c: Added a second maxentries parameter to prevent slowness on long
repeated inputs, like CA...CA.
2004-01-19 twu
* stage3.c: Allowed cDNA direction to be indeterminate.
* matchpair.c, stage1.c: Fixed clustering to work with minsize of 1.
* dynprog.c: Reduced points for match, which improves some alignments.
2004-01-16 twu
* gmap.c, params.c, params.h, snap.c, stage1.c, stage1.h: Removed nsamples
as a global parameter.
* bootstrap, configure.ac, Makefile.am: Added libtool and
--enable-static-linking feature.
* gmap.c, snap.c: Implemented incremental clustering based on progressively
smaller sampling intervals. Added ability to print alignment continuously.
* matchpair.c, matchpair.h, stage1.c: Implemented incremental clustering
based on progressively smaller sampling intervals.
* match.c, matchdef.h: Moved structure definition to matchdef.h
* block.c, block.h, reader.c, reader.h: Added ability to reset ends of block.
* stage3.c, stage3.h: Added printing of number of unknowns and of gap
openings in cDNA and genome.
* params.c, params.h: Added parameter for continuous output of alignment.
* pair.c, pair.h: Added output of number of unknowns. Added procedure for
continuous output of alignment.
* dynprog.c: Created different penalties for gaps in single and paired gaps.
2004-01-14 desany
* loginfo: Added Yan to e-mail notifications.
* loginfo: Finally figured out where to put the quote (I think).
* loginfo: e-mail command tweak
* loginfo: Tweaking the e-mail command.
* loginfo: Sending log messages to desany when cgh module updates are
committed.
2004-01-14 twu
* configure.ac: Added feature for static linking.
* params.c, params.h: Using two parameters for stutter: stuttercycles and
stutterhits.
* gmap.c, snap.c: Performing sampling only when necessary. Using popt help
when available.
* stage1.c, stage1.h: Performing sampling only when necessary. Limiting
size and changing parameters for bestlist.
* matchpair.c, matchpair.h: Eliminated unused code in filtering procedure.
* indexdb.c: Fixed fread_int to be fread_uint.
* iit-read.c: Added abort statement when more than one interval retrieved by
IIT_get_one.
* get-genome.c: Fixed bug with accessing chromosome_iit after being freed.
Using popt help when available.
* oligo.c, oligo.h: Added Oligo_skip function.
* block.c, block.h: Removed maxtries and added Block_skip.
2004-01-12 twu
* gmap.c, snap.c, stage1.c: Changed strategy to use clusters of matches,
after first pair found.
* gmapindex.c, snapindex.c: Eliminated check for genome database in
compression mode.
* stage2.c: Changed distance penalty to 1 point per 1000 nt.
* pair.c, pair.h, stage3.c: Keeping separate track of query indels and
target indels.
* genome.c, genome.h, get-genome.c: Implemented check for gbufferlen when
shifting old sequence.
* separator.h: Added file for separator information.
2004-01-09 twu
* Makefile.am, get-genome.c: Changed program to use chromosome_iit and
contig_iit, rather than text files.
* genome.c: Fixed bug from call to madvise on NULL region.
* iit-read.c, iit-read.h: Added function IIT_read_linear.
* gmapindex.c, snapindex.c: Storing length in interval of contig_iit, rather
than in annotation.
* stage1.c: Changed paired algorithm to use sum of reciprocals of number of
hits.
* get-genome.c: Removed unnecessary decompression functions (now in
genome.c).
* gmap.c, snap.c: Fixed bug where fraction_threshold was declared as int
rather than double.
* stage1.c: Revised algorithm to count number of query hits on 5' and 3'
ends.
* Makefile.am, datadir.c, datadir.h, get-genome.c, gmap.c, snap.c,
snapconfig.c: Moved datadir functions to a separate file.
* gmapindex.c, snapindex.c: Changed format of text files .chromosome and
.contig.
2004-01-08 twu
* genome.c, genome.h, get-genome.c, gmap.c, iit-write.c, iit-write.h,
snap.c: Allowed genomic patches to be longer or shorter than their
endpoints.
* gmapindex.c, snapindex.c: Allowed intervals to have length that is
different from their endpoints. Changed format for fasta file input to
snapindex.
* iit_store.c, interval.c, interval.h, segmentpos.c, segmentpos.h: Allowed
intervals to have length that is different from their endpoints.
* iit-read.c: Added carriage returns to annotations, if absent.
2004-01-07 twu
* gmap.c, params.c, params.h, snap.c: Made fraction_threshold a parameter.
* stage2.c: Changed calculation of penalty for large genome distances to be
done only when necessary.
* snapconfig.c: Changed feedback message.
* genome.c, indexdb.c: Improved warning messages when memory mapping fails.
2004-01-05 twu
* snapdir.c: Changed name of snapdir to snapconfig.
* gmap.c, match.c, match.h, params.c, params.h, result.c, result.h, snap.c:
Restored alignment using stage 1 only.
* stage1.c, stage1.h: Moved decision of stage1size and maxentries to here.
* genome.c: Added warning message of memory mapping of genome fails.
* genome.c: Restored batch memory mapping of genome.
* stage1.c: Greatly increased MAXENTRIES parameter.
* gmap.c, params.c, params.h, snap.c: Made stage1size dependent upon
sequence length, with short sequences getting stage1size of 12.
* gmap_compress.pl, gmap_compress.pl.in, snap_compress.pl, whats_on:
Generalized parse for coordinate separator.
* get-genome.c: Restored -- as coordinate separator.
2003-12-19 twu
* gmap.c, sequence.c, sequence.h, snap.c, stage3.c, stage3.h: New approach
to chimeras, involving a subsequence and new stage1 procedure.
* stage2.c: Added distance penalty for long introns.
* Makefile.am, pair.c, segmentpos.c: Included separator.h
* pair.c, segmentpos.c, segmentpos.h: Removed unnecessary parameters in
Segmentpos_print_accessions.
* get-genome.c: Change in coordinate separator from -- to ..
2003-12-17 twu
* gmap.c, match.c, match.h, matchpair.c, matchpair.h, snap.c, stage1.c,
stage1.h: Changed procedures for finding chimeras to try singlelist of the
appropriate side.
* pair.c, segmentpos.c: Changing coordinate output from -- to ..
* stage3.c, stage3.h: Changed procedures for finding chimeras to try
singlelist of the appropriate side. Fixed bug in computing chimeric
goodness.
* dynprog.c, dynprog.h: Provided separate parameters for ends, removed
multiplicative reward, and changed all score calculations to be integers.
2003-12-16 twu
* matchpair.c, stage1.c, stage1.h: Fixed bug with position calculations on
large chromosomes (> 2 Gig).
* gmap.c, matchpair.c, matchpair.h, snap.c, stage1.c, stage1.h: Based
algorithm for finding extensions on 12-mers.
* chrnum.c, chrnum.h: Added function for computing chromosomal string and
position from genomic position.
2003-12-15 twu
* gmap.c, matchpair.c, matchpair.h, params.c, params.h, snap.c: Made
extension linear depending on query length.
* stage1.c, stage1.h: Made cluster list depend on size of largest cluster.
2003-12-14 twu
* stage2.c: Added a minimum exon length for ends during smoothing.
* stage1.c, stage1.h: Added a last-resort procedure for trying all matches
found in stage 1. Enhanced debugging statements.
* gmap.c, snap.c: Added a last-resort procedure for trying all matches found
in stage 1.
* oligoindex.c, shortoligomer.h: Returned to old method for store_positions,
because it appears to be faster.
* genome.c, genome.h, get-genome.c: Enhanced debugging statements.
* matchpair.c: Added assertions about strands and relative position of
matches.
* stage2.c: Returned to old method for store_positions. Fixed smoothing for
a single exon.
2003-12-13 twu
* oligoindex.c, shortoligomer.h, types.h: Further attempt to increase speed
of store_positions.
* gmap.c, snap.c: Fixed memory leak when stage3array is recomputed.
* oligoindex.c, oligoindex.h, stage2.c: Increasing speed of store_positions
by reducing number of calls to calloc.
* gmap.c, matchpair.c, matchpair.h, params.c, params.h, snap.c, stage1.c,
stage1.h: Changed cluster algorithm to rank clusters based on size and
process the top ones based on sum of sizes.
2003-12-12 twu
* genome.c: Added check for enddiscard being 0.
* stage2.c: Did an in-lining of intron_score.
* gmap.c, params.c, params.h, snap.c, stage1.c, stage1.h, stage3.c,
stage3.h: Added new cluster algorithm for stage 1, used when paired
algorithm fails to produce an alignment with high identity.
* gmap.c, snap.c: Added ability to modify binary file to include default
genome directory.
* snapconfig.c, snapdir.c: Initial import into CVS.
2003-12-10 twu
* gmapindex.c, indexdb.c, indexdb.h, snapindex.c: Added ability to generate
idxoffsets and idxpositions files from compressed genome.
* gmap.c, snap.c: Changed the uncompressed flag from -G to -g.
* gmapindex.c, snapindex.c: Implemented direct writing of compressed genome
file.
2003-12-09 twu
* iit_store.c: Fixed bug where non-copied string is entered into table.
* iit_get.c: Improved error message.
* iit_dump.c: Added function for showing all types.
* table.c: Added debugging statements.
* gmap.c, params.c, params.h, snap.c: For user-provided segments, skipping
stage 1 (although can be specified by the user), to achieve increased
speed.
* sequence.c, sequence.h: Restored function Sequence_revcomp.
2003-12-04 twu
* stage1.c: Restored cluster algorithm for short sequences.
* gmap.c, snap.c: Generalized definition of chimera, and reduced percentage
to 80%.
2003-12-03 twu
* Makefile.am, iit-read.c, iit-read.h, iit_get.c: Augmented iit_get to
handle types and file input.
* gmap.c, intlist.c, intlist.h, sequence.c, sequence.h, snap.c: Allowed
user-specified genomic segment to have arbitrary length.
* gmap.c, snap.c: Restored -U flag for reporting in universal coordinates.
* iit-read.c: Fixed bug in IIT_dump_formatted.
* Makefile.am, md5-compute.c: Added program md5-compute.
2003-12-01 twu
* gmap.c, params.c, params.h, snap.c: Added message to user when FASTA file
is run without batch mode.
2003-11-28 twu
* oligo.c: Changed debug statements.
* reader.c: Cleaned up pointer calculation.
* sequence.h: Removed Sequence_revcomp, which is not used.
* sequence.c: Revised comments.
* stage2.c: In-lined gap_score.
* indexdb.c: More bug fixes for bigendian machines on user-provided segments.
* indexdb.c: Fixed a problem with bigendian machines for user-provided
segments.
* gmap.c, snap.c: Added releasestring in attempt to find version file.
* genome.c, genome.h: Added option for replacing X's with N's.
* get-genome.c: Added option for replacing X's with N's. Fixed bug when
closing a null file pointer.
* iit_store.c: Append .iit to given filename, instead of replacing existing
suffix.
2003-11-26 twu
* gmapindex.c, snapindex.c: Removed -U flag.
* gmapindex.c, indexdb.c, indexdb.h, snapindex.c: Reverted back to using
uncompressed genome for making idxoffsets and idxpositions.
* gmap.c, snap.c: Changed flag for uncompressed genome from -G to -U.
* gmapindex.c, indexdb.c, indexdb.h, snapindex.c: Attempt to build
idxoffsets and idxpositions from genomecomp, but has problems.
* genome.c: Added automated switching between compressed and uncompressed
genome, if the requested one cannot be found.
* iit_store.c: Keeping last carriage return of annotation.
* iit_get.c: If iit file cannot be found, try appending .iit.
* gmapindex.c, snapindex.c: Finding labels in IIT directly instead of
converting to a table.
* iit-read.c, iit-read.h, iit-write.c, iitdef.h: Changed IIT format to store
alphabetic order of labels, so that labels can be found by binary search.
2003-11-25 twu
* genome.c, genome.h, gmap.c, pair.c, pair.h, params.c, params.h, snap.c,
stage3.c, stage3.h: Added popt handling of options. Renamed various
program options.
* iit-read.c, iit-read.h, iit_get.c: Added ability to search IITs by label.
* get-genome.c: Changed usage statement for popt autohelp.
* Makefile.am: Changed name of variable to POPT_LIBS.
* acinclude.m4, configure.ac: Added AC_DEFINE for HAVE_LIBPOPT. Set various
defines to have value 1.
* gmapindex.c, iit-read.c, iit-read.h, iit-write.c, iit-write.h, iit_get.c,
iit_store.c, iitdef.h, match.c, segmentpos.c, snapindex.c: Change made to
format of IIT file. Now allowing each interval to be labeled.
* indexdb.c: Fix made for the case where an oligomer earlier than TT...TT is
the last one and points to totalcounts.
2003-11-24 twu
* gmap_compress.pl, gmap_compress.pl.in, snap_compress.pl: Added notation
for chimeric sequences.
* acinclude.m4: Added check for MAP_FAILED. Added sys/types.h when checking
for pthreads (needed for Sun compiler).
* assert.h, bigendian.h, blackboard.h, block.h, chrnum.h, complement.h,
dynprog.h, except.h, genome.h, genomicpos.h, iit-read.h, iit-write.c,
iit-write.h, iit_dump.c, iit_get.c, iit_store.c, indexdb.h, interval.h,
intlist.h, intron.h, list.h, match.h, matchpair.h, md5.h, mem.h, oligo.h,
oligoindex.h, pair.h, pairpool.h, params.h, reader.h, reqpost.h,
request.h, result.h, segmentpos.h, sequence.h, stage1.h, stage2.h,
stage3.h, stopwatch.h, table.h, uintlist.h: Included config.h in all
header files, to catch redefinition of const, which is needed for the Sun
compiler.
* stage3.c: Commented out code that is never reached.
* genome.c, indexdb.c: Modified messages to stderr for batch mode.
* blackboard.c, gmap.c, reqpost.c, snap.c: Added sys/types.h to handle
pthread_t, needed by Sun compiler.
* assert.c: Kept only the header file definition of assert, due to problem
with Sun compiler.
* iit-read.c, table.c: For functions passed as arguments, added pointer and
parentheses around parameter list.
* stage2.c: Changed some exon length parameters.
2003-11-19 twu
* gmap.c, snap.c, stage3.c, stage3.h: Added additional check for chimeras,
based on top two hits.
* bigendian.c, indexdb.c: Moved masking to the logical or statements to
address a bug on MacOSX.
2003-11-18 twu
* gmap.c, snap.c: Made directory searching process more flexible, by looking
for version file at toplevel and subdirectory of datadir.
* genome.c, indexdb.c: Fixed calls to mmap and munmap when mmap fails.
Moved stopwatch start before madvise command.
* bigendian.c, genome.c, indexdb.c: Added masks to chars when converting to
an int or unsigned int, due to problem observed on DEC Alpha.
* genome.c, indexdb.c: Corrected conversion of littleendian to bigendian
numbers. Added lseek and read procedures when mmap is not present or
fails.
* bigendian.c: Corrected conversion of littleendian to bigendian numbers.
* Makefile.am: Generate ChangeLog only when CVS directory present.
2003-11-17 twu
* Makefile.am: Used LDADD instructions to call libraries instead of LDFLAGS.
(Required for program to load on SGI.) Moved SCRIPTS under FULLDIST.
* configure.ac: Renamed POPT_LDFLAGS to POPT_LIBS.
* bootstrap: Added --copy flag to automake.
* Makefile.am: Added dist-hook to make ChangeLog up to date.
* config: Removed secondary config files generated by automake.
* gmapindex.c, snapindex.c: Fixed bug where X's were not being filled in,
because variable declared as int, rather than unsigned int.
* block.c, block.h: Removed obsolete function.
* acinclude.m4: Moved to top-level directory.
* ChangeLog: Removed from repository. Can be generated as needed.
* bootstrap: Added --add-missing flag.
* README: Added message about config.site.
* bootstrap: Initial import into CVS. Added because autoreconf doesn't work
with a config subdirectory.
* configure.ac: Made toplevel configure.ac work with a config subdirectory.
* gmap.c, snap.c, stage1.c, stage1.h: Changed algorithm to declare chimera
only after alignment is done, and to use salvaged matches in that case.
* stage3.c, stage3.h: Stored genomicstart and genomicend as part of Stage3_T
structure.
* ddsgap2_compress.pl: Initial import into CVS.
* whats_on, install-sh, missing, mkinstalldirs, sim4_compress.pl,
sim4_uncompress.pl, snap_compress.pl, snap_uncompress.pl, snapbuild.pl.in,
spidey_compress.pl: Moved to subdirectory.
* compile, config.guess, config.sub, depcomp: Removed secondary config files
(generated by automake).
* Makefile.am: Adding top-level Makefile.am
* assert.c, assert.h, bigendian.c, bigendian.h, blackboard.c, blackboard.h,
block.c, block.h, bool.h, chrnum.c, chrnum.h, complement.c, complement.h,
dynprog.c, dynprog.h, except.c, except.h, genome.c, genome.h,
genomicpos.c, genomicpos.h, genuncompress.c, get-genome.c, iit-read.c,
iit-read.h, iit-write.c, iit-write.h, iit_dump.c, iit_get.c, iit_store.c,
iitdef.h, indexdb.c, indexdb.h, interval.c, interval.h, intlist.c,
intlist.h, intron.c, intron.h, list.c, list.h, listdef.h, match.c,
match.h, matchpair.c, matchpair.h, md5.c, md5.h, md5.t.c, mem.c, mem.h,
oligo-count.c, oligo.c, oligo.h, oligoindex.c, oligoindex.h, pair.c,
pair.h, pairdef.h, pairpool.c, pairpool.h, params.c, params.h, reader.c,
reader.h, reqpost.c, reqpost.h, request.c, request.h, result.c, result.h,
segmentpos.c, segmentpos.h, sequence.c, sequence.h, shortoligomer.h,
snap.c, snapindex.c, stage1.c, stage1.h, stage2.c, stage2.h, stage3.c,
stage3.h, stopwatch.c, stopwatch.h, table.c, table.h, types.h, uintlist.c,
uintlist.h: Moved source files to subdirectory.
* iit-read.c, iit-read.h: Added function IIT_get_typed.
* indexdb.c: Removed debugging message.
* snap.c, gmap.c, stage3.c, stage3.h: Improved determination of when an
alternate strain applies, based on the aligned genomic segment. Added
strain type to sorting of results.
* stage1.c: Bypassing the cluster algorithm.
* snap.c, gmap.c: Added ability to determine datadir from environment
variable or configuration file.
* get-genome.c: Added popt processing of command-line options.
* genome.c: Added bigendian conversions for compressed genome, which is
memory mapped.
* configure.ac, Makefile.am: Added check for popt library.
2003-11-15 twu
* snapindex.c, gmapindex.c: Fixed pointer bug.
* stage2.c, stage2.h: Removed directional check on stage 2 smoothing.
Introduced separate length criterion for first long exon.
* stage3.c, stage3.h: Implemented checks and procedures for chimeric
sequences. Removed directional check on stage 2 smoothing.
* pair.c, pair.h, result.c, result.h, snap.c, gmap.c, stage1.c, stage1.h:
Implemented checks and procedures for chimeric sequences.
* genome.c: Changed debug statements from stderr to stdout.
2003-11-14 twu
* stage1.c: Changed identify_matches to assume the absence of duplicates.
* stage2.c: Changed criterion for short first and last exon during smoothing
to be half of the corresponding region.
* stage3.c: Fixed debugging statements.
* snap.c, gmap.c, stage3.c, stage3.h: Fixed bug where a strain was falsely
reported due to duplicate stage 3 objects and deletion of the one for the
reference.
* sequence.c: Reduced poly-A tail left from 7 to 1.
* pair.c: Made print procedure backward compatible with old altstrain_iits.
* pair.c, params.c, params.h, snap.c, snapindex.c, gmap.c, gmapindex.c: Made
changes to include name of reference strain.
* snap.c, gmap.c: Fixed typo in comment.
* iit-read.c: Fixed memory leak when altstrain_iit doesn't exist.
* Makefile.am: Integrated get-genome into snap code.
* get-genome.c, sequence.c, sequence.h: Major rewrite of get-genome, to
integrate it into existing snap code.
* genome.c, genome.h, snap.c, gmap.c: Handled case where more than one patch
from a given strain is applicable to a given genomic segment.
* intlist.c: Added check for null list in Intlist_to_array.
* indexdb.c: Changed idxpositions to eliminate duplicates during writing and
to skip bad values during reading.
2003-11-13 twu
* snap_compress.pl, gmap_compress.pl, gmap_compress.pl.in: Revised program
to parse strain info.
* stage1.c: Added some comments.
* block.c, block.h, oligo-count.c, oligo.c, oligo.h, Makefile.am: Revised
oligo-count to use the new code.
* Makefile.am: Added build for get-genome.
* get-genome.c: Major cleaning of code. Added ability to read from
compressed genome files.
* oligo-count.c: Initial import into CVS. Dated 2003-07-16.
* genome.c, genome.h, genomicpos.c, genomicpos.h, matchpair.c, matchpair.h,
pair.c, pair.h, snap.c, gmap.c, stage3.c, stage3.h: Added ability to align
to multiple strains.
* stage1.c: Cleaned up some bugs on handling stutter. Implemented check for
duplicates in idxpositions.
* indexdb.c, indexdb.h, snapindex.c, gmapindex.c: Changed strategy for
idxoffsets and idxpositions for strains. Now storing the union of all
strains.
2003-11-12 twu
* snapbuild.pl.in, Makefile.am, gmapsetup.pl.in, gmap_setup.pl.in: Fixed
procedure for making snapbuild.
* Makefile.am: Added procedure for making snapbuild script.
* configure.ac: Added feature for enabling full distribution.
* snapbuild.pl: Changed file from snapbuild.pl to snapbuild.pl.in.
* configure.ac, params.h, Makefile.am: Cleaned up specification of data
directory and version file.
* params.c: Added provisions for reading altstrain IIT.
* snap.c, gmap.c: Cleaned up specification of data directory and version
file. Added provisions for reading altstrain IIT.
* snapindex.c, gmapindex.c: Fixed problem with slashes in alternate strain
name.
* stage1.c: Cleaned up code for stage1.c. Fixed memory leak for paired
algorithm. Added chromosomal constraint for cluster algorithm.
2003-11-11 twu
* iit-read.c, iit-read.h, iit-write.c, iit-write.h, iit_dump.c, iit_get.c,
iit_store.c, iitdef.h, indexdb.c, indexdb.h, interval.c, interval.h,
segmentpos.c, segmentpos.h, snap.c, snapbuild.pl, snapbuild.pl.in,
snapindex.c, Makefile.am, gmap.c, gmapindex.c, gmapsetup.pl.in,
gmap_setup.pl.in: Changes made to introduce types into IITs, and to build
SNAP databases with alternate strain information.
* match.c, match.h, stage1.c: Changes to stage 1 algorithm: (1) choice of 5'
or 3' advancement based on number of hits, (2) stutter based on positions
with hits, (3) computed fraction of paired hits on each end.
2003-11-10 twu
* snapbuild.pl, snapbuild.pl.in, gmapsetup.pl.in, gmap_setup.pl.in: Initial
import into CVS.
* Makefile.am: Added object files for bigendian.
* iit-read.c: Added header file for bigendian.h
* bigendian.c, indexdb.c: Fixed problem in bigendian conversion.
* sequence.c: Fixed problem in handling sequence files without headers.
* iit-read.c: Changed most elements of IIT_T to be fread, rather than
mmapped. Added code for program to work on bigendian architectures.
* indexdb.c: Changed offsets file to be fread, rather than mmapped. Added
code for program to work on bigendian architectures.
* iitdef.h: Added comments.
* bigendian.c, bigendian.h, configure.ac, genuncompress.c, iit-write.c,
snapindex.c, Makefile.am, gmapindex.c: Added code for program to work on
bigendian architectures.
2003-11-08 twu
* acinclude.m4, configure.ac: Made VERSION automatically equal the current
date.
* Makefile.am: Removed reference to iit_convert.
* genome.c: Turned off batch loading of genome.
* sequence.c, snap.c, gmap.c: Rest of header printed in output. Exceptional
file terminations handled better.
2003-11-07 twu
* params.c, params.h, snap.c, gmap.c, stage1.c, stage1.h: Added cluster
algorithm for short query sequences.
* block.c, block.h, longoligomer.c, longoligomer.h, match.c, match.h,
matchpair.c, matchpair.h, oligo.c, oligo.h, Makefile.am: Removed
longoligomers.
* genome.c: Fixed print statement for batch mode.
* snap.c, gmap.c: Restored dump_segs functionality.
* snapindex.c, gmapindex.c: Changed name of table from chroffset to
chrlength.
* iit-read.c, iit-read.h: Added function IIT_dump_formatted.
2003-10-27 twu
* iit_get.c, iit_store.c: Removed carriage return at end of annotation.
* iit-read.c, iit-read.h, iit_dump.c, Makefile.am: Added a program for
dumping IIT files.
* snapindex.c, gmapindex.c: Added better comments.
* whats_on: Fixed program to use new IIT file format.
* table.c: Removed assertion checks for key being non-zero, which doesn't
work for a chromosome of 0.
* INSTALL: Copied generic installation instructions.
* COPYING: Created copyright notice.
2003-10-25 twu
* iit-write.c: Made Node_make static.
2003-10-24 twu
* indexdb.c: Fixed format of batch statement.
* iit-read.c, iit-write.c, iit_get.c, match.c, segmentpos.c, snapindex.c,
gmapindex.c: Changed annotations in .iit files to have '\0' characters at
the ends, so they can be used in the file, without copying.
2003-10-23 twu
* interval.c: Added comment about sorting procedures.
* iit_get.c, iit_store.c: Changed program to use the IIT implementation in
this directory.
* iit-read.c: Added madvise command.
* genome.c, indexdb.c: Changed reporting of touching pages under batch mode.
* Makefile.am: Added iit_store and iit_get.
* genome.c, indexdb.c: Revised touching of pages for batch mode.
* assert.h, blackboard.h, block.h, bool.h, chrnum.h, complement.h,
dynprog.h, except.h, genome.h, genomicpos.h, iit-read.h, iit-write.h,
iitdef.h, indexdb.h, interval.h, intlist.h, intron.h, list.h, listdef.h,
longoligomer.h, match.h, matchpair.h, md5.h, mem.h, oligo.h, oligoindex.h,
pair.h, pairdef.h, pairpool.h, params.h, reader.h, reqpost.h, request.h,
result.h, segmentpos.h, sequence.h, shortoligomer.h, stage1.h, stage2.h,
stage3.h, stopwatch.h, table.h, types.h, uintlist.h: Added RCS Id string
to header files.
* snap.c, gmap.c: Removed call to strdup.
* snapindex.c, gmapindex.c: Removed printing of superaccessions for NCBI
genomes.
* segmentpos.c: Removed unused procedures based on Berkeley DB.
* chrnum.c: Fixed problem with numeric-alpha ordering of chromosomes. XU
now follows X and precedes Y.
2003-10-22 twu
* acinclude.m4, genome.c, indexdb.c: Added macros to check for pagesize
determination.
* config.h.in: Removed derived file.
* configure.ac: Cleaned up unnecessary autoconf macros.
* acinclude.m4, config.h.in, genome.c, genuncompress.c, iit-read.c,
indexdb.c: Improved autoconf checks and header files for mmap.
* snapindex.c, gmapindex.c: Fixed problem with freeing memory.
* segmentpos.c: Fixed small error with printing accession bounds.
* chrnum.c, iit-read.c, iit-read.h, iit-write.c, iit-write.h, segmentpos.c,
snap.c, snapindex.c, gmap.c, gmapindex.c: Fixed memory leaks.
* acinclude.m4, block.h, chrnum.c, chrnum.h, config.h.in, configure.ac,
database.c, database.h, genomicpos.c, genomicpos.h, get-genome.c,
iit-read.c, iit-read.h, iit-write.c, iit-write.h, interval.c, interval.h,
match.c, match.h, offset.c, offset.h, offsetdb.c, offsetdb.h, oligo.c,
oligo.h, pair.c, pair.h, params.c, params.h, segmentpos.c, segmentpos.h,
sequence.c, snap.c, snapindex.c, Makefile.am, gmap.c, gmapindex.c,
stage1.c, stage1.h, stage3.c, stage3.h, table.c, table.h: Eliminated
dependence upon Berkeley DB.
* table.c, table.h: Initial import into CVS.
2003-10-21 twu
* acinclude.m4, config.h.in, configure.ac, genome.c, genuncompress.c,
iit-read.c, indexdb.c: Added checks for various mmap flags.
* iitdef.h: Restructed IIT_T commands.
* iit-read.c, iit-read.h, iit-write.c, iit-write.h, interval-read.c,
interval-read.h, interval.c, interval.h, pair.c, snap.c, Makefile.am,
gmap.c: Restructured Interval_T and IIT_T implementations so they don't
depend on BerkeleyDB, and added ability to write IITs.
* acinclude.m4, database.c: Added provision for BerkeleyDB version 4.1.
* iit_store.c: Changed format of input file to have only intervals on the
header line.
* iit_get.c: Changed program to use new IIT format.
* iit_store.c: Fixed problem with annotlist being reversed.
* iit_store.c: Changed format of iit file to include annotations.
2003-10-20 twu
* sequence.c: Corrected type for return value of fgetc.
* oligo.c: Corrected type for return value of Reader_getc.
* stage1.h: Removed db.h as an included header.
* acinclude.m4: Added -rpath flag during linking of Berkeley DB.
* Makefile.in, configure: Removing from CVS.
* Makefile.in, configure: Result of autoreconf.
* Makefile.am: Added header files to SOURCES.
* configure.ac: Added no-dependencies option.
* iit-read.c: Removed MAP_VARIABLE from mmap call, because not recognized by
Linux.
* sequence.c: Renamed variable strlen to avoid compiler error on Linux.
* Makefile.in: Added various auxiliary files.
* Makefile.in, compile, config.guess, config.sub, depcomp, install-sh,
configure: Initial import into CVS.
* missing, mkinstalldirs: Provided updated version.
* genome.c, genomicpos.c, iit-read.c, iit-read.h, indexdb.c, intlist.c,
intlist.h, match.c, md5.c, mem.c, offset.c, offsetdb.c, oligoindex.c,
pair.c, segmentpos.c, segmentpos.h, Makefile.am, uintlist.c, uintlist.h:
Addressed compiler warnings from gcc.
2003-10-19 twu
* acinclude.m4, blackboard.c, configure.ac, reqpost.c, snap.c, Makefile.am,
gmap.c: Allowed pthreads to be enabled or disabled.
2003-10-18 twu
* assert.c, block.c, chrnum.c, complement.c, database.c, dynprog.c,
except.c, genome.c, genomicpos.c, genuncompress.c, get-genome.c,
iit-read.c, indexdb.c, interval-read.c, intron.c, list.c, longoligomer.c,
match.c, matchpair.c, md5.c, offset.c, offsetdb.c, oligo.c, oligoindex.c,
pair.c, pairpool.c, params.c, reader.c, reqpost.c, request.c, result.c,
segmentpos.c, sequence.c, snap.c, snapindex.c, gmap.c, gmapindex.c,
stage1.c, stage2.c, stage3.c, stopwatch.c: Added RCS Id string correctly
* assert.c, block.c, chrnum.c, complement.c, database.c, dynprog.c,
except.c, genome.c, genomicpos.c, genuncompress.c, get-genome.c,
iit-read.c, indexdb.c, interval-read.c, intron.c, list.c, longoligomer.c,
match.c, matchpair.c, md5.c, offset.c, offsetdb.c, oligo.c, oligoindex.c,
pair.c, pairpool.c, params.c, reader.c, reqpost.c, request.c, result.c,
segmentpos.c, sequence.c, snap.c, snapindex.c, gmap.c, gmapindex.c,
stage1.c, stage2.c, stage3.c, stopwatch.c: Added rcsid strings.
* blackboard.c, block.c, complement.c, database.c, dynprog.c, except.c,
iit-read.c, interval-read.c, intron.c, list.c, matchpair.c, md5.c, mem.c,
mem.h, oligo.c, oligoindex.c, pair.c, pairpool.c, params.c, reader.c,
reqpost.c, request.c, result.c, sequence.c, stage2.c, stopwatch.c:
Rearranged header includes.
* longoligomer.h: Defined T for both cases of HAVE_64_BIT.
* longoligomer.c: Added conditional compiling based on HAVE_64_BIT.
* offset.h: Added necessary header file stdio.h.
* types.h: Added compiler directives from config.h.
* configure.ac: Initial changes to configure.scan to make autoconf and
automake work for the cc compiler.
* Makefile: Removed Makefile from CVS, because it is now generated from
Makefile.am by automake, and then from Makefile.in by configure.
* AUTHORS, COPYING, ChangeLog, INSTALL, NEWS, README, acinclude.m4, config,
missing, mkinstalldirs, config.h.in, Makefile.am: Added files for autoconf
and automake to work.
* configure.ac: Initial configure.ac from configure.scan produced by
autoscan.
2003-10-17 twu
* gencompress.c, snapindex.c, gmapindex.c: Moved gencompress function inside
snapindex (previously in gencompress.c).
* segmentpos.c: Changed type of relstart and relend to int, due to problems
with long.
2003-10-16 twu
* dynprog.c: Removed splice-site.c.
* commafmt.c, commafmt.h, genomicpos.c, genomicpos.h, match.c, pair.c,
segmentpos.c: Moved commafmt command to genomicpos.c.
* types.h: Defined UINT8 only if HAVE_64_BIT is defined.
* splice-site.c, splice-site.h: Removed splice-site.c from CVS.
* readcirc.c, readcirc.h: Removing readcirc from CVS.
* radixsort.c, radixsort.h: Removing radixsort from CVS.
* boyer-moore.c, boyer-moore.h: Removed Boyer-Moore procedures from CVS.
* longoligomer.h: Introduced constants and procedures for Longoligomer_T on
32-bit systems.
* snapindex.c, gmapindex.c: Changed output type of write_genome_file.
* indexdb.c, indexdb.h: Introduced Storedoligomer_T.
* iit-read.c: Added type cast from void * to char *.
* oligo.c, oligo.h: Created 32-bit versions of procedures.
* match.c: Removed functions Match_print() and oligo_nt().
* stage1.c: Removed mask from Block_T. Removed function Match_print().
* block.c, block.h: Removed mask from Block_T.
* longoligomer.c: Added object Longoligomer_T for 32-bit systems.
2003-10-13 twu
* Makefile, chrnum.c, database.c, gencompress.c, genome.c, genome.h,
genuncompress.c, stage1.c, chrnum.h, match.c, match.h, offset.c, offset.h,
pair.c, pair.h, segmentpos.c, segmentpos.h, snap.c, snapindex.c, gmap.c,
gmapindex.c, stage3.c, stage3.h: Changed unsigned int to more descriptive
types.
* genomicpos.c, genomicpos.h, longoligomer.h, shortoligomer.h: Added new
types.
* offsetdb.c, offsetdb.h: Added type for Chrnum_T. Removed function
Offset_position_to_chr.
* oligoindex.c, stage1.c, stage2.c: Changed unsigned long and unsigned int
to more descriptive types.
* add-chrpos-to-endpoints.c: Removed file used for prototyping.
* rsort-check.c, rsort-test.c: Removed utility files for radixsort.
* sequence.c: Removed code for computing CRC32 checksum.
* sample-oligos.c: Removed sample-oligos.c, which was used for prototyping.
* Makefile: Removed cksum-fa
* prb.c, prb.h: Removed prb.c and prb.h, which implemented red-black trees.
* block.c, block.h, indexdb.c, indexdb.h, match.c, match.h, oligo.c,
oligo.h, oligoindex.c, oligoindex.h: Changed unsigned long and unsigned
int to more informative types.
* cksum.c: Removed cksum.c, which is now computed in sequence.c
* cksum-fa.c: Removed cksum-fa.c, which was a utility program.
* cell.c, cell.h: Removed Cell_T, which was designed for the HashDB storage
scheme for genomic oligomers.
* pair.c, pair.h, sequence.c, sequence.h, stage3.c: Added provision for
correcting coverage in the presence of genomic gaps at the ends.
* chrnum.c: Fixed a bug in printing output.
2003-10-09 twu
* stage3.c, stage3.h: Added reward for spliced cDNAs based on number of
exons, if it's greater than 2. Also, added flag for conservative behavior
for splice site prediction, by reducing the reward for canonical splice
sites. Note, however, that such behavior causes SNAP to perform poorly in
the presence of sequence errors.
* dynprog.c, dynprog.h, params.c, params.h, snap.c, gmap.c: Added flag for
conservative behavior for splice site prediction, by reducing the reward
for canonical splice sites. Note, however, that such behavior causes SNAP
to perform poorly in the presence of sequence errors.
2003-10-07 twu
* iit-read.c: Adapt to new format of bounds database contents.
* pair.c: Makes correct call to IIT_get when coordinates are in reverse
order.
2003-08-19 twu
* Makefile, iit-read.c, iit.c, iit.h, interval-read.c, interval.c,
interval.h, pair.h, params.h, snap.c, gmap.c, stage3.h: Changed filenames
from iit.c and interval.c to iit-read.c and interval-read.c
* whats_on: Generalized procedure for identifying FASTA files containing
ESTs.
* sequence.c: Fixed conversion of char to unsigned char.
* Makefile, bounds.c, bounds.h, iit-read.c, iit-read.h, iit.c, iit.h,
interval-read.c, interval-read.h, interval.c, interval.h, pair.c, pair.h,
params.c, params.h, snap.c, gmap.c, stage3.c, stage3.h: Changed calls to
iit to open the files just once.
* bounds.c, bounds.h: Adding bounds.c file to compute bounds.
* Makefile, database.c, database.h, iit-read.c, iit.c: Added ability to use
a gene bounds iit file.
* interval-read.c, interval-read.h, interval.c, interval.h: Revised version
from berkeleydb CVS repository.
* pair.c, pair.h, params.c, params.h, snap.c, gmap.c, stage3.c, stage3.h:
Added capability to use a gene bounds iit file.
2003-08-18 twu
* iit-read.h, iit.h: Initial import into CVS.
* iit_get.c: Compare only to query length.
* get-genome.c: Fixes procedure isrange to make a copy of the string.
2003-07-07 twu
* whats_on: Changed behavior to not die if directory isn't found.
* chrnum.c, chrnum.h, segmentpos.c: Fixed sorting and printing for
chromosomes like 2L.
* stage3.h: Removed Stage3_goodness as an external procedure.
* stage3.c: Changed goodness within a given chromosomal segment to include
canonical introns, but goodness between chromosomal segments to exclude
this.
* stage2.c: Increased MAXHITS from 20 to 1000. Previous value was too low
and led to splicing errors.
* get-genome.c: Changed program to try segment first as a chromosome, then
as a contig.
* offsetdb.c: Improved output statements to print beginning and ending of
chromosomes.
2003-06-19 twu
* dynprog.c: Changed penalties. Made reward for extension multiplicative.
* Makefile, md5.c, md5.h, md5.t.c, params.c, params.h, sequence.c,
sequence.h, snap.c, snap_compress.pl, snap_uncompress.pl, gmap.c, types.h,
gmap_compress.pl, gmap_compress.pl.in, gmap_uncompress.pl,
gmap_uncompress.pl.in: Added MD5 calculations.
* stage3.c: Added debugging statements for finalscore.
* cksum-fa.c: Added comments.
2003-06-17 twu
* Makefile: Rearranged lines.
* snap.c, gmap.c: Fixed calculation of indexdb to occur only once for
user-provided segment.
* sequence.c, sequence.h: Added computation for crc32.
2003-06-13 twu
* dynprog.c: Changed reward for partial match to be zero.
* stage3.c: Fixed bug where pairs_fwd or pairs_rev might be NULL.
2003-06-03 twu
* stage2.c, stage2.h: Created separate paths for forward and revcomp
directions after smoothing. Added back intron score during calculations.
* stage3.c: Separated calculations of forward and revcomp paths.
* snap.c, gmap.c: Increased size of maxlookback.
* pair.c, pair.h: Added calculation of number of canonical exons.
* dynprog.c: Setting finalscore as a return value.
* stage3.c: Added number of canonical exons to goodness criterion. Added
"Stage 3" to debug statements.
2003-05-27 twu
* snap.c, gmap.c, stage2.c, stage2.h, stage3.c, stage3.h: Moved alignment of
different cDNA direction from stage 2 to stage 3.
* pair.c, pair.h: Changed Pair_fracidentity to work on a list, rather than
an array.
2003-05-22 twu
* stage3.c: Changed goodness function to ignore number of canonical introns.
* snap.c, gmap.c: Added parameter for sufflookback, potentially different
from maxlookback, but found that setting maxlookback >> sufflookback led
to long, poor alignments, so set maxlookback = sufflookback.
* params.c, params.h, stage2.c, stage2.h: Added separate parameter for
sufflookback, to be used in stage 2, and possibly different from
maxlookback, used in stage 3.
2003-05-03 twu
* genome.c, genuncompress.c, iit-read.c, iit.c, indexdb.c: Removed
MAP_VARIABLE from mmap command, because it is not available in Linux.
* hash-test.c, hashdb-read.c, hashdb-read.h, hashdb-write.c, hashdb-write.h,
hashdb.c, hashdb.h: Removed hashdb files, which have been replaced by
indexdb.
* whats_on: Added error message.
* Makefile: Removed old Makefile commands.
* snapgenerate.c, snapindex.c, gmapindex.c: Moved functions from
snapgenerate.c to snapindex.c, so only snapindex is needed to create SNAP
genome files.
* genuncompress.c: Initial import into CVS.
2003-04-30 twu
* stage2.c: Added check for MAXHITS in stage 2, to prevent slowness problems
from repetitive cDNAs in repetitive genomic segments (such as AA704019).
* stage1.c: Added debugging statement.
* snap_uncompress.pl, gmap_uncompress.pl, gmap_uncompress.pl.in: Fixed
problem where gpos was not handled correctly for the minus strand.
* chrnum.c: Fixed problem where signed chromosomes were being printed
incorrectly.
2003-04-29 twu
* whats_on: Fixed problem where genomic coordinates were in the order of
largest, then smallest (reverse strand).
2003-04-27 twu
* stage3.c: Removed queryoffset.
* stage2.c, stage2.h: Removed queryoffset. Made sampling interval variable.
Added bounding of a querypos to a single hit if its top score exceeds its
second highest score.
* snap.c, gmap.c: Changed lookback and extramaterial_paired.
* sequence.c: Changed trimming to leave non-poly-A/T oligomers.
* chrnum.c, database.c, get-genome.c, segmentpos.c: Changed interpretation
of chromosome numbers to allow all single letters and all numbers.
* offsetdb.c: Extra blank line.
2003-04-16 twu
* Makefile, accpos.c: Removed file accpos.c, which isn't being used anymore.
* genome.c, sequence.c, sequence.h: Removed offset as a parameter for
Sequence_genomic_new.
* mem.c: Removed upper limit check on allocating memory.
* pair.h: Removed queryoffset from print routines.
* pair.c: Removed queryoffset from print routines. Fixed calculation of
genomic distances for Crick strand.
2003-04-14 rkh
* config, cvswrappers: *** empty log message ***
2003-04-09 rkh
* config: *** empty log message ***
2003-04-07 twu
* dynprog.c: Reduced rewards for canonical introns.
* pair.c: Added conversion to uppercase.
* mem.c: Added check for unexpectedly large allocations.
2003-04-02 twu
* stage3.c: Made separate procedures for 3' and 5' ends. Turned off
Boyer-Moore extension at ends. Added checks to prevent dynamic
programming past end of sequence.
* params.c: Removed freeing of version.
* pairpool.c: Added additional debugging checks.
* pair.c: Improved output for user-provided segments.
* indexdb.c, indexdb.h, match.c, matchpair.c, matchpair.h, offset.c,
sequence.c, sequence.h, snap.c, gmap.c: Now performing stage 1 on
user-provided segments. This eliminates poor alignments when the
user-provided segment is longer than stage 1 would have provided.
* segmentpos.c: Added limit to number of accessions reported.
* request.c, request.h, blackboard.c, blackboard.h: Changed from name from
genomicseg to usersegment.
* stage1.c: Removed offset from call to Block_T procedures.
* genome.c: Renamed some procedures.
* dynprog.c: Increased penalties for mismatch.
* chrnum.c: Allowed chromosome 0.
* block.c, block.h: Removed offset from list of parameters.
2003-03-27 twu
* pair.c, snap_compress.pl, snap_uncompress.pl, stage3.c, gmap_compress.pl,
gmap_compress.pl.in, gmap_uncompress.pl, gmap_uncompress.pl.in: Changed
alignment output for dual breaks.
2003-03-25 twu
* dynprog.c: Created an inline procedure and scheme for scoring canonical
and alternate introns. Increased penalties for mismatches.
* intron.c, intron.h: Moved most functions to other files, to increase speed.
* pair.c, pair.h, pairdef.h, pairpool.c, stage3.c: Added field to Pair_T
object to denote a gap.
* sequence.c: Fixed bug that caused large amounts of memory to be allocated.
2003-03-24 twu
* snap_compress.pl, gmap_compress.pl, gmap_compress.pl.in: Introduced a
better error statement.
* stage2.c: Changed sampling to start at -1 after the first 8-mer missed,
and then go back by the Nyquest rate.
* stage3.c: Introduced peelback for single gaps.
2003-03-21 twu
* genome.c, pair.c, pair.h, sequence.c, sequence.h, snap.c, gmap.c,
stage3.c: Fixed algorithm to handle poly-T starts as well as poly-A ends.
Added extra information to Sequence_T structure and output procedures to
handle this correctly.
* snap.c, gmap.c: Fixed problems with Stage3_T objects that were not
assigned to NULL. Added flushing of output for debugging.
* dynprog.c: Fixed dynamic programming on ends so the genomic segment won't
stick out.
2003-03-20 twu
* spidey_compress.pl: Modified routine to look for spaces of at least 10,
instead of 20.
* dynprog.c: Added a separate reward for canonical introns, depending on the
defect rate.
* list.c, list.h: Added a command for setting the head of a list.
* pair.c: Fixed counting of indels.
* pairpool.c: Created new debugging commands.
* snap.c, gmap.c: Added trimming of first or last exon in stage 2 if the
defect rate is high enough and the exons are too long. Increased lookback
from 60 to 90.
* stage3.c: Modified peelback to go past nonconsecutive hits, stopping only
at an intron.
* stage2.c, stage2.h: Added trimming of first or last exon if the defect
rate is high enough and the exons are too long.
2003-03-16 twu
* stage2.c: Added hooks for making smooth_path depend on defect_rate, but
this appears to be a bad idea.
* pair.c: Improved consistency check to work when cdna_direction is
initially zero.
* dynprog.c, dynprog.h, stage3.c: Changed effect of defect rate to be on
mismatches and gaps, rather than intron scores.
2003-03-15 twu
* pair.c, pair.h, snap.c, gmap.c, stage2.c, stage2.h, stage3.c, stage3.h:
Added check for consistency of intron directions, and ability to back
track to stage 2 with forced cdna_directions if the stage 3 result is
inconsistent.
2003-03-14 twu
* dynprog.c, dynprog.h, pair.c, pair.h, stage2.c, stage2.h, stage3.c: Added
estimation of defect_rate in stage 2, and used it to change parameters in
dynamic programming and extension of ends.
2003-03-12 twu
* stage3.c: Changed limitation on Boyer-Moore search to be a certain number
of hits. This compensates for the fact that smaller oligomers will occur
more frequently than longer ones, and that longer ones are more
statistically significant.
* stage3.c, stage3.h: Limited length of Boyer-Moore search at ends. Changed
name of minendsearch to minendtrigger.
* params.c, params.h, snap.c, gmap.c: Changed name of minendsearch to
minendtrigger.
2003-03-11 twu
* dynprog.c: Extended the search range of bridge_gap, so that it finds
introns even at the bounds of the dynamic programming.
* params.c, params.h, snap.c, gmap.c, stage3.c, stage3.h: Added parameter
for minendsearch.
* dynprog.c: Fixed safety check in intron_score for reading off end of
segment.
* stage3.c: Rearranged computation of stage 3, such that middle is computed
first, then cDNA direction is recomputed, then 5' and 3' ends are computed.
* pair.c, pair.h: Added function for computing cDNA direction from list of
pairs.
* dynprog.c: Adjusted various dynamic programming scores. Fixed coordinates
in gap. Added check for very short introns.
* stage3.c: Discrimination between paired gap dynamic programming at ends
and in middle.
* dynprog.c, dynprog.h: Major rewrite of dynamic programming procedures.
Changed from Gotoh algorithm to pure banded procedure. Reversing
sequences when necessary, so all computations are symmetric.
2003-03-10 twu
* sim4_compress.pl, spidey_compress.pl: Added output of the number of exons.
* stage3.c: Added check for genomejump being zero or negative, which would
give rise to a position beyond the genomic segment.
* pair.c: Added check for zero denominator.
* boyer-moore.c: Added check for sequence to consist entirely of valid
nucleotides.
2003-03-09 twu
* spidey_compress.pl: Added printing of exon lengths, intron lengths, and
dinucleotides, to match new output of snap_compress.pl. Fixed problems
with parsing Spidey output.
* sim4_compress.pl: Added printing of exon lengths, intron lengths, and
dinucleotides, to match new output of snap_compress.pl.
* snap_compress.pl, gmap_compress.pl, gmap_compress.pl.in: Fixed problem
when reverse intron is GT-AG.
* snap_uncompress.pl, gmap_uncompress.pl, gmap_uncompress.pl.in: Fixed bug
that occurs when snap was called with -N, without printing intron lengths.
2003-03-08 twu
* stage1.c: Fixed a memory leak from not freeing Stage1_T object.
* block.c: Fixed bug which caused a memory leak because we were overwriting
a previous querypos.
* oligo.c: Fixed debug message.
2003-03-07 twu
* snap.c, gmap.c: Reduced stage1size for short query sequences (< 60 bp).
* match.c, match.h, stage1.c, stage1.h: Fixed Match_print to print the
correct oligo.
* get-genome.c: Changed header to contain the version number.
* snap_uncompress.pl, gmap_uncompress.pl, gmap_uncompress.pl.in: Added exon
lengths to compressed output.
* snap_compress.pl, gmap_compress.pl, gmap_compress.pl.in: Added exon
lengths to compressed output. Removed printing of dinucleotides for
canonical introns.
2003-03-06 twu
* stage3.c: Cleaned up code extensively. Added Boyer-Moore searches on both
ends of cDNA.
* dynprog.c, dynprog.h: Cleaned up code by making separate procedures for
single gap in middle, and 5' and 3' ends.
* pair.c, pair.h: Added procedure for dumping a list of pairs.
* boyer-moore.c: Removed debugging statements.
2003-03-04 twu
* Makefile, boyer-moore.c, boyer-moore.h: Addition of Boyer-Moore string
search.
* stage3.c: Consolidated peelback code. Beginning to insert Boyer-Moore
code.
* stage2.c: Fixed bug where index was -1.
* block.c, block.h, oligo.c, oligo.h, stage1.c: Fixed code to use stage1size
instead of INDEX1PART in certain places.
* snap_compress.pl, gmap_compress.pl, gmap_compress.pl.in: Fixed code to
handle genomic accession when genomic sequence is provided by the user.
* pair.c, sequence.c, sequence.h, snap.c, gmap.c: Fixed code to print out
genomic accession when genomic sequence is provided by the user.
* match.c: Fixed code to print just forward oligo.
2003-03-03 twu
* intron.c, pair.c: Changed '===...===' to represent a non-canonical intron.
* dynprog.c: Reduced reward to semi-canonical introns to be slightly less
than that for canonical introns.
* stage3.c: Changed output of large gaps from '=========' to '###...###'.
* snap_uncompress.pl, gmap_uncompress.pl, gmap_uncompress.pl.in: Made
changes to accommodate enhancements to SNAP, namely use of '#' for large
gaps and switch of intronends and intronlengths info.
* snap_compress.pl, gmap_compress.pl, gmap_compress.pl.in: Initial import
into CVS.
2003-03-02 twu
* snap_uncompress.pl, gmap_uncompress.pl, gmap_uncompress.pl.in: Initial
import into CVS.
* Makefile, dynprog.c, intron.c, intron.h, pair.c, splice-site.c: Removed
use of splice site matrices and added identification of semi-canonical
dinucleotides.
2003-02-28 twu
* params.c, params.h, snap.c, gmap.c, stage3.c, stage3.h: Changed
extramaterial at the end and for paired to be parameters.
* get-genome.c: Changed program to check only first four letters of genomic
name.
2003-02-11 twu
* pair.c, pair.h, stage3.c: Adjusted goodness score of alignment by number
of canonical introns.
* dynprog.c, dynprog.h, params.c, params.h, snap.c, gmap.c, stage3.c,
stage3.h: Parameterized band size in dynamic programming and increased
bands for cross-species alignment.
* oligoindex.c, oligoindex.h, params.c, params.h, snap.c, gmap.c, stage2.c,
stage3.c, stage3.h: Parameterized INDEXSIZE and made it different for
cross-species alignment.
* stage2.c: Added smooth_path step in stage 2 to remove short spurious exon
hits.
2003-02-03 twu
* params.c, params.h: Replaced dbroot with version.
* snap.c, gmap.c: Added reporting of version to program.
2003-01-27 twu
* stage3.c: Fixed problem where a base pair was missed on the 5' end.
* stage2.c: Fixed problems where genomic matches can overlap.
* pair.c: Fixed problems in computing exon endpoints.
2003-01-22 twu
* stage3.c: Reverted back to old method of building pairs in the middle.
* pair.c: Added post-processing check for a gap at the end of the alignment.
* oligoindex.c: Added check for poly-T.
2003-01-03 twu
* stage3.c: Made some changes to eliminate large gaps at the 3' end.
* snap.c, gmap.c: Improved handling of case where user provides both cDNA
and genomic files.
* stage3.c: Fixed bug when no pairs are found.
* sequence.c: Fixed bug in failing to initialize.
2002-12-30 twu
* params.c, params.h: Added parameter for fwdonlyp.
* pairpool.c: Fixed small memory leak.
* params.h: Changed genomeinvert from a bool to an int.
* pair.c: Fixed bug where pointer was advanced before freeing it.
2002-12-11 twu
* snap.c, gmap.c: Fixed problem where complement table was not initialized
early enough.
2002-12-10 twu
* sequence.c, sequence.h, snap.c, gmap.c: Improved procedure for trimming
poly-A tails.
* pair.c: Increased space for positions from 12 to 14.
* complement.h: Removed extraneous semicolon.
* stage2.c: Fixed problem where no matching 8-mers are found.
2002-12-04 twu
* snapindex.c, gmapindex.c: Write accession names to .aux file, even if they
do not start with NT_ or GA_.
* snap.c, gmap.c: Added routines for adding signs to chromosomes, inverting
the genome, printing intron lengths, and trimming poly-A tails.
* pair.c, pair.h, params.c, params.h, stage3.c, stage3.h: Added routines for
adding signs to chromosomes, inverting the genome, and printing intron
lengths.
* sequence.c, sequence.h: Added routines for trimming poly-A tails.
* chrnum.c, chrnum.h, match.c: Added routines for adding signs to
chromosomes.
* Makefile, complement.c, complement.h, genome.c: Added files for handling
complements.
2002-11-26 twu
* match.c, pair.c: Changed printing of FWD/REV to +/-. Added printing of
intron lengths.
* pair.c, pair.h, params.c, params.h, snap.c, gmap.c, stage3.c, stage3.h:
Added ability to print genome first in alignment.
2002-11-25 twu
* snap.c, gmap.c: Added iteration code for cross-species alignments.
* Makefile, matchpair.c, matchpair.h: Added an object Matchpair_T to hold
pairs of Match_T objects.
* segmentpos.c: Added a check against freeing a null value.
* result.c, result.h: Created a Stage1_T object that can hold state, for
resuming stage 1 calculations later.
* pair.c, stage3.c: Changed definition of coverage to be based on length of
query sequence that aligns.
* dynprog.c: Changed allocation procedures for Matrix_T and Directions_T.
Provided hooks for doing band-limited memory clearing, but this won't work
with the Gotoh P1 and Q1 matrices.
* stage2.c: Added a seenone check to protect against long stretches of N's
in the genome.
* stage1.c, stage1.h: Created a Stage1_T object that can hold state, for
resuming stage 1 calculations later. Stage1_T contains a list of
Matchpair_T objects, and some procedures have been moved to matchpair.c.
* params.c, params.h: Added parameters for crossspecies and changed name of
maxextend to maxstutter.
2002-11-20 twu
* dynprog.c, dynprog.h, stage3.c: When calling dynprog, now passing pointers
to subsequence rather than copying subsequences.
* block.c, block.h: Simplified procedure for processing oligos by Block_T
object.
* params.c, params.h: Added parameters for stage1size and maxlookback.
* pair.c, pair.h, stage3.c, stage3.h: Added counts for unknowns and
reporting of coverage.
* oligoindex.c: Increased size of memory blocks from 10 to 50.
* oligoindex.c: Replaced realloc function with explicit calls to calloc and
free, because Third Degree reported occasional errors with realloc.
* indexdb.c, indexdb.h, oligo.c, oligo.h, snap.c, gmap.c, stage1.c,
stage1.h: Major change to stage 1 procedure to work on either 24-mers or
18-mers.
* mem.c: Added blank line.
* match.c, match.h: Added procedure for Match_copy and simplified Match_new.
* genome.c: Inlined procedure fill_buffer.
* genome.c: Simplified routine for fill_buffer.
2002-11-15 twu
* indexdb.c, indexdb.h: Added code for ignoring poly A hits. Added
procedure for reading 12-mer positions.
* pair.c: Removed debugging statement.
* block.c, match.c, match.h, oligo.c, stage1.c, stage1.h: Parameterized
stage1size.
2002-11-11 twu
* pair.c, pair.h: Distinguished between mismatches and indels. Fixed cases
where gaps need to be merged (e.g., affy.HGU95A.34233_i_at, which created
===...======...=== when an 8-mer fell into a gap and was then aligned to
either end of the gap by dynamic programming.
* params.c, params.h: Added flag for low stringency.
* stage3.c, stage3.h: Changed definition of LARGEQUERYGAP to be maxlookback.
Distinguished between mismatches and indels.
* snap.c, gmap.c: Changed definition of LARGEQUERYGAP to be maxlookback.
Added flag for lowstringency (12-mers).
* dynprog.c, dynprog.h, stage2.c: Changed definition of LARGEQUERYGAP to be
maxlookback.
* result.c: Improved check on whether to free array in result.
* params.c, params.h, stage2.c, stage2.h: Made maxlookback a parameter.
* snap.c, gmap.c: Introduced heap memory for each thread for dynamic
programming. Made maxlookback a parameter.
* stage3.c, stage3.h: Introduced heap memory for each thread for dynamic
programming. Restricted peelback for consecutive positions.
* dynprog.c, dynprog.h: Introduced heap memory for each thread for dynamic
programming.
2002-11-08 twu
* dynprog.c: Changed dynamic programming procedure to be banded.
* stage2.c, stage3.c: Revised stage 2 procedure to jump every INDEXSIZE,
keep track of consecutive matches, and have a maximum lookback. Changed
stage 3 procedure accordingly, including increasing peelback to INDEXSIZE.
* snap.c, gmap.c: Changed default behavior to be ordered output.
* genome.c: Revmoed pre-loading for genome, and used madvise(MADV_DONTNEED)
instead.
2002-11-07 twu
* dynprog.c: Added a zero gap penalty on the ends. Changed mismatch penalty
to be less than a match penalty, and reduced intron reward accordingly.
* stage3.c: Added a peelback on the 5' end, because it's just like half of a
paired gap alignment.
* snap.c, gmap.c: Removed hack used for debugging.
* stage2.c: Introduced concept of a maximum lookback, and will now go beyond
the previous limit if no hit has been found.
* genome.c: Changed genome to be pre-paged when user specifies it.
* indexdb.c: Changed type of i from int to size_t.
2002-11-06 twu
* pair.c, pair.h, stage3.c: Changed pairs in stage 3 object to be allocated
as a separate block, so they can be output at a later time.
* blackboard.c, blackboard.h, reqpost.c, reqpost.h, request.c, request.h,
result.c, result.h, snap.c, gmap.c: Updated multithreading system to
handle ordered output with better throughput by adding an output queue.
* genome.c, indexdb.c, pair.c: Added header for string.h to eliminate
compiler warnings about strlen type.
* pairpool.c: Increased chunk size from 10000 to 20000.
* stage2.c: Added debugging comments to generate a graph.
* stage3.c: In-lined calls to List_T and Pair_T accessor functions.
* pair.c, stage1.c: In-lined calls to List_T accessor functions.
* list.c, list.h: Added function to return value of the last element of a
list.
2002-11-05 twu
* pairpool.c, pairpool.h: Removed calls to realloc(), because they do not
preserve pointer values. Replaced with allocation of chunks of memory as
needed.
* dynprog.c, stage2.c: Changed two-dimensional matrices to be
one-dimensional with pointer.
* blackboard.h, snap.c, gmap.c: Made minor tweaks to blackboard object,
primarily alterating ninputs and noutputs outside the lock, and changing
signal of end of output to be a null result.
* reqpost.c, reqpost.h: Added orderedp flag to send only appropriate signals.
* blackboard.c: Made minor tweaks to blackboard object, primarily alterating
ninputs and noutputs outside the lock.
2002-11-04 twu
* dynprog.c, list.c, listdef.h, pairpool.c, pairpool.h, stage2.c, stage3.c:
Added a pool of List_T cells for each thread to reduce heap contention.
* Makefile, blackboard.c, dynprog.c, dynprog.h, genome.c, genome.h, pair.c,
pairdef.h, pairpool.c, pairpool.h, reqpost.c, reqpost.h, request.c,
request.h, sequence.c, sequence.h, snap.c, gmap.c, stage2.c, stage2.h,
stage3.c, stage3.h: Provided each worker thread with separate sources of
heap memory for genomic sequence and for Pair_T objects. Intended to
reduce heap contention.
* stage2.c: Created define for MAXLOOKBACK.
* stage1.c, stage1.h: Changed constants to be based on those in indexdb.h.
* indexdb.c, indexdb.h: Changed stage 1 lookup to be based on 12-mers,
rather than 8-mers.
2002-11-02 twu
* indexdb.c: Implemented binary search on third 8-mer.
* snap.c, gmap.c: Allowed user to specify full path of database in the -d
flag.
* Makefile: Added stopwatch to Makefile.
* indexdb.c: Changed preloading of indexdb to touch each page effectively,
not by using memcpy(), which fails to load in pages.
* sequence.c: Added check for first call to fgetc(input) being EOF.
2002-11-01 twu
* pair.c: Changed print routine to work properly on user-supplied genomic
segments.
* genome.c, indexdb.c: Changed pre-load to use fread/fopen/fwrite, rather
than memcpy, which fails to load the pages into memory.
* Makefile, block.c, block.h, oligo.c, oligo.h, params.c, params.h, snap.c,
snapindex.c, gmap.c, gmapindex.c, stage1.c, stage1.h: Changed stage 1
database to use index table of 8-mers, rather than a hash table of 24-mers.
* stopwatch.c, stopwatch.h: Added stopwatch function to program.
* genome.c, genome.h: Added batch mode by using mmap/memcpy, but this
appears to fail on a clustered file system.
* indexdb.c, indexdb.h: Implemented Indexdb_T as a substitute for Hashdb_T.
2002-10-31 twu
* stage2.c: Changed stage 2 procedure to consider both forward and reverse
complement introns in one pass. Fixed a small bug in intron_score to
require position >= 2.
2002-10-29 twu
* stage2.c, stage3.c: Replaced calls to Sequence_char with direct array
access.
* Makefile, blackboard.h, genome.c, genome.h, hashdb-read.c, match.c,
offset.c, oligoindex.c, pair.c, reqpost.c, reqpost.h, segmentpos.c,
snap.c, gmap.c, stage2.c, stage3.c: Made various fixes for compiler
warnings.
* stage3.c: Separated procedures for middle single gap and end single gap.
Decreased size of single gap dynamic programming procedure for 5' and 3'
ends to have genomejump = 2*queryjump.
* snap.c, gmap.c: Increased default extension to 30000 nt.
* dynprog.c: Prevented horizontal jumps on 3' end of splice site. Adjusted
score parameters.
2002-10-28 twu
* dynprog.c, oligoindex.h, stage2.c: Changed oligomer size in stage 2 from
10 to 8, and adjusted dynamic programming parameters accordingly.
Prevented genomic gap at the 5' edge of an intron. Made initial
cdna_direction test more robust.
* snap.c, gmap.c: Fixed calls to SNAP that don't involve any sequence (the
-C and -L flags).
* stage3.c: Reduced minimum intron size from 10 to 9.
* stage1.c: Substituted the constant HASHSIZE for 24.
* dynprog.c: Increased reward for intron. A score of 10 fails to identify a
canonical intron with a gap.
* pair.c: Fixed misreporting of query start coordinate.
* snap.c, gmap.c: Fixed small memory leak.
2002-10-27 twu
* Makefile, dynprog.c, splice-site.c, splice-site.h: Added splice site
calculations to find best intron.
* dynprog.c, dynprog.h, stage2.c, stage2.h, stage3.c, stage3.h: Improved
stage 2 dynamic programming procedure to consider introns (only for
consecutive query positions), to compute gap penalty based on difference
of genomejump and queryjump, and to consider cDNA directions separately.
* snap.c, gmap.c: Improved handling of arguments for database search and for
alignment to genomic segment.
* stage1.c, stage1.h: Fixed stage 1 to consider Watson and Crick strands
separately.
* params.c, params.h, snap.c, gmap.c, stage1.c, stage1.h: Made extension in
stage 1 a user-definable parameter.
* blackboard.c, blackboard.h, pair.c, request.c, request.h, sequence.c,
sequence.h, snap.c, gmap.c: Provided ability to align cDNA against
user-provided genomic segment.
* dynprog.c: Gave credit to half introns.
2002-10-26 twu
* genome.c, genome.h, oligoindex.c, oligoindex.h, snap.c, gmap.c, stage1.c,
stage1.h, stage2.c, stage2.h, stage3.c, stage3.h: Changed genomicseg to be
of type Sequence_T.
* block.c: Changed debug flag.
2002-10-25 twu
* Makefile, request.c, request.h, sequence.c, sequence.h, snap.c, gmap.c,
stage1.c, stage1.h, stage2.c, stage2.h, stage3.c, stage3.h: Renamed
Queryseq_T to Sequence_T.
* queryseq.c, queryseq.h: Renamed Queryseq_T to Sequence_T, to allow genomic
sequences to be represented this way.
* pair.c, pair.h, stage3.c, stage3.h: Simplified argument lists of some
functions.
* params.c, params.h, result.c, result.h: Allowed first-order approximation
using stage1 results.
* stage2.c: Increased extension on left and right to find small terminal
exons.
* stage1.c, stage1.h: Fixed assessment of whether getpair succeeded or
failed.
* match.c, match.h, snap.c, gmap.c: Added first-order approximation, to use
just stage1 results.
* block.c, block.h, oligo.c: Fixed bugs in Block_next_to_stoppos when the
query sequence has many non-ACGT characters.
* Makefile, snap.c, gmap.c: Made compressed genome the default.
* hashdb-read.c, hashdb-write.c: Reverted to old hashtable format, which
contains only two arrays.
2002-10-24 twu
* dynprog.c, dynprog.h, pair.c, pair.h, params.c, params.h, snap.c, gmap.c,
stage3.c, stage3.h: Added diagnostic mode to print out asterisks instead
of vertical bars where dynamic programming was done.
* genome.c, genome.h: Added ability to read compressed genomes.
* Makefile, gencompress.c: Added compression routine for genomes.
* blackboard.c, blackboard.h, reqpost.c, reqpost.h, snap.c, gmap.c: Added
anyorder behavior to blackboard, and made it default.
2002-10-22 twu
* stage2.c: Removed code for memory freeing of positions, which is now
performed by Oligoindex_T.
* oligoindex.c: Changed type of positions from void ** to unsigned int **,
to make code clearer and more robust.
* pair.c, pair.h, params.c, params.h, snap.c, gmap.c, stage3.c, stage3.h:
Added option to print universal genomic coordinates.
* genome.c, hashdb-read.c: Changed mmap to from MAP_PRIVATE to MAP_SHARED.
* genome.c, genome.h, params.c, params.h, snap.c, gmap.c, stage2.c,
stage2.h: Changed Genome_T to be memory-mapped, rather than using fopen,
which is needed for multithreading.
* stage1.c: Fixed bug where salvage procedure fails to find anything.
* mem.c: Enhanced mem.c to give actual location of failure.
* oligoindex.c, oligoindex.h, snap.c, gmap.c, stage2.c, stage2.h: Changed
algorithm for stage 2 to allocate genomic positions dynamically in
Oligoindex_T. To limit number of positions stored, we prescan the
queryseq to see what oligomers are relevant.
* stage2.c: Reverted back to previous Stage 2 strategy where we stored
genomic sequence in oligoindex and scanned query sequence.
* oligoindex.c, oligoindex.h: Simplified routines greatly.
2002-10-21 twu
* stage2.c: Changed stage 2 strategy to index the query sequence rather than
the genomic sequence. This should result in some speed up.
* reader.c, reader.h, stage1.c: Made Reader_new function depend on sequence
rather than Queryseq_T object.
* list.c, list.h: Added function List_index.
* oligoindex.c, oligoindex.h: Hard-coded interval, rather than passing it in.
* pair.c, snap.c, gmap.c: Added flag to avoid showing contig coordinates.
* params.c, params.h: Added Params_T object.
* Makefile, blackboard.c, blackboard.h, reqpost.c, reqpost.h, request.c,
request.h, result.c, result.h, snap.c, gmap.c: Major change to make
program multithreaded. Introduced Blackboard_T, new Reqpost_T, and new
Result_T objects.
* stage3.h: Added header file.
* stage2.c, stage2.h: Cleaned up procedures. Passed in querylength via
queryseq.
* stage1.c, stage1.h: Cleaned up procedures. Made hashinterval a constant.
* queryseq.c, sequence.c: Removed macro for DEBUG2.
* oligoindex.c, oligoindex.h: Allowed offsets for oligoindex to be created
separately (for individual worker threads.)
* block.c: Changed name from Result_T to Match_T.
* block.c, match.c, match.h, stage1.c, stage1.h, stage2.c, stage2.h: Changed
name of Result_T to Match_T.
* result.c, result.h: Renamed Result_T to Match_T.
* stage3.c: Turned off debug statements.
2002-10-20 twu
* queryseq.c, sequence.c: Fixed small memory leak.
* Makefile, align.c, align.h, snap.c, gmap.c, stage2.c, stage2.h, stage3.c,
stage3.h: Created Stage 3 and moved part of Stage 2 commands there.
* Makefile, align.c, align.h, queryseq.c, queryseq.h, reader.c, reader.h,
sequence.c, sequence.h, snap.c, gmap.c, stage1.c, stage1.h, stage2.c,
stage2.h: Added a separate Queryseq_T object, and moved some functions
from Reader to Querypos.
* pair.c: Removed bottom ruler.
* block.c, block.h, snap.c, gmap.c, stage1.c, stage1.h: Removed
multithreading from stage 1 (hash table reads).
* dynprog.c: Made maximize_entry inline to speed up dynamic programming.
* stage2.c: Added another 24 (hashsize) to extensions. Without this, for
some reason, we miss the ends.
* Makefile, dynprog.c, penalties.c, penalties.h: Removed Penalties_T object
in order to increase speed.
* stage2.c: Changed extension to be based on remaining distance from end.
2002-10-19 twu
* snap.c, gmap.c: Eliminated parameters maxentries and indexsize.
* oligoindex.c, oligoindex.h: Made changes to improve speed, by eliminating
unnecessary arrays.
* stage2.c, stage2.h: Made changes to improve speed, including making
build_pairs_middle iterative and using the fill_oligo function where
possible.
* align.c: Changed Pair_T object to reflect the actual case of the query
sequence.
* oligoindex.c: Simplified construction of Oligoindex_T object.
* Makefile, align.c, align.h, oligoindex.c, oligoindex.h, snap.c, gmap.c,
stage2.c, stage2.h: Made indexsize a hardcoded parameter. Allocated space
for Oligoindex once at beginning of program.
* align.c, stage2.c: Using Pair_T object instead of Result_T object
throughout stage 2.
* stage1.c: Fixed memory leak.
* align.c, align.h, stage2.c: Major change to improve stage 2 efficiency.
Using arrays instead of lists for the dynamic programming alignment.
* oligoindex.c: Changed debug statements from fprintf to printf.
* offset.c, offset.h: Added back to repository.
2002-10-18 twu
* stage1.c, stage2.c: Allowed alignments even if we can't find a matching
pair on the 5' and 3' ends.
* oligoindex.c: Changed debug statements.
* align.c: Removed debug statement.
* Makefile: Added rule for counting lines of code.
2002-10-17 twu
* Makefile, segmentpos.c, segmentpos.h, snap.c, snapgenerate.c, gmap.c:
Generated dump procedure to work on either the text offset file or the
offset BerkeleyDB. Added accession length to the output.
* match.c, match.h, result.c, result.h: Cleaned up unused or obsolete
procedures.
* hashdb-read.c: Made hashindex memory mapped again. Added madvise()
commands to help with memory mapping.
* stage1.c: Changed algorithm for stage 1 to extend for 2 hash intervals
past the first connectable pair of hits.
* block.c, block.h: Added ability to stop block at a certain position.
2002-10-15 twu
* dynprog.c, dynprog.h, stage1.c, stage2.c: Added counts of matches and
mismatches on dynamic programming of single gaps, and used this to exclude
dynamic programming results on 5' and 3' ends.
* snap.c, gmap.c, stage2.c, stage2.h: Increased EXTENSION from 90 to 1000.
Included check for genomicpos2 against chromosomal length.
* hashdb-read.c, hashdb-write.c: Reading hashindex into memory instead of
memory mapping it.
* dynprog.c: Cosmetic changes to debug macro.
* align.c: Cleaned out unused code.
* snap.c, gmap.c: Added flag for specifying maxentries (in stage 2).
* stage2.c: Fixed one-off error on requesting dynamic programming of 5' end.
2002-10-14 twu
* hashdb-write.h: Added log file.
* snapindex.c, gmapindex.c: Fixed minor bug relating to log file.
* hashdb-write.c: Added file pointer for a log file.
* snapindex.c, gmapindex.c: Raised default maxentries value from 5 to 20.
Added file pointer for a log file.
* hashdb-write.c: Commented out monitoring statements.
* hashdb.c: Developed new hash function to give the same hash value for an
oligo and its reverse complement. This should improve page access for the
hash lookup.
* stage2.c: Improved debugging statement.
* snap.c, gmap.c: Removed effect of maxentries in stage 2, which was causing
some alignments to be short.
* align.c: Changed debug statements from fprintf to printf.
* hashdb-read.c: Fixed problem in binary search where we subtracted 1U from
0U.
* dynprog.c: Allowed dynamic programming to identify introns even if
lowercase.
* database.c: Reformatting.
* hashdb-read.c: Fixed bug where function was returning NULL prematurely.
* snapindex.c, gmapindex.c: Made second pass read only on auxfile.
2002-10-13 twu
* Makefile, block.h, database.c, database.h, match.c, oligo.h, request.h,
result.c, snap.c, snapindex.c, gmap.c, gmapindex.c, stage1.h: Removed
traces of PureDB package.
* Makefile, cell.c, cell.h, snapindex.c, gmapindex.c: Changed snapindex
program to a two-pass process. The first pass saves the .aux file, and
the second pass creates the hash table. This simplifies the Cell_T object
greatly.
* cell.c, cell.h, hashdb-write.c, snapindex.c, gmapindex.c: Sorting cell
entries by hashvalue then by oligo.
2002-10-12 twu
* hashdb-read.c: Fixed minor bug in binary search routine.
2002-10-11 twu
* Makefile, block.c, hashdb-read.c, hashdb-write.c: Changed structure of
hashdb to have three tables: oligo_offset, oligos, and positions.
* hashdb-write.c: Checking totalsize of contents and setting file size
initially to that.
* hashdb-read.c: Memory mapping hash contents now.
2002-10-10 twu
* snapgenerate.c: Fixed memory leaks.
* snap.c, gmap.c: Fixed memory leak from failure to free Offset_T object.
* snap.c, gmap.c: Fixed bug where datadir was freed.
* hashdb-read.c: Fixed bug where memory mapped offsets were freed.
* dynprog.c: Fixed bug where cL+1 or cL+2 exceeded length2L.
* oligo.c: Revised procedures to handle lowercase letters in the query
sequence.
* hashdb-read.c: Fixed bug where nentries wasn't being set.
* Makefile: Divided hashdb into separate read and write files.
* block.c, block.h, oligo.c, oligo.h, request.c, request.h, snap.c, gmap.c:
Changing from PureDB to our own Hashdb_T.
* stage1.c, stage1.h: Changing from PureDB to our own Hashdb_T. Also fixed
bug where results3 was not being initialized to NULL.
* snapindex.c, gmapindex.c: Divided hashdb file into a separate read and
write file.
* snapgenerate.c: Using Offset_T object now, after we have written the
chromosome file.
* hashdb-write.c: Fixed bug causing unaligned access errors, by splitting
header into two 4-byte unsigned ints.
* hashdb-read.c: Fixed bug where length = 0. Also fixed bug causing
unaligned access errors, by splitting header into two 4-byte unsigned ints.
* hashdb-read.c, hashdb-read.h, hashdb-write.c, hashdb-write.h, hashdb.c,
hashdb.h: Split Hashdb functions into separate read and write files.
* hashdb.c, hashdb.h: Provided option to switch between unsigned long and
unsigned int for hashoffset_t.
* hashdb.c, hashdb.h: Changed offsets to be memory-mapped rather than read
by file.
* Makefile, hashdb.c, hashdb.h, snapindex.c, gmapindex.c: Changing hash
database to our own format.
* snapindex.c, gmapindex.c: Fixed bug where last oligo would not get stored.
2002-10-09 twu
* pair.c, segmentpos.c: Added missing header file for commafmt.
* Makefile: Revised object files needed for snapindex and snapgenerate.
* Makefile, accpos.c, add-chrpos-to-endpoints.c, block.c, block.h, cell.c,
database.c, database.h, match.c, match.h, offsetdb.c, offsetdb.h,
result.c, result.h, segmentpos.c, segmentpos.h, snap.c, snapgenerate.c,
snapindex.c, gmap.c, gmapindex.c, stage1.c, stage1.h: Changed offset reads
from a database to a structure read from a flat file.
* get-genome.c: Added -U flag to generate unmasked sequences.
* offset.c, offset.h: Renamed files from offset.* to offsetdb.h
* snap.c, gmap.c: Implemented print_details.
* stage1.c, stage1.h: Implemented print_details. Fixed problem where
dominated bounds were not being eliminated.
* stage2.c: Increased the peelback to identify introns. Added debugging
statements.
* align.c: Fixed greediness for finding introns. Removed gap penalty and
reward for intron. Instead, implemented a tie breaker for scores based on
genomic distance. Increased the peelback to identify introns.
2002-10-08 twu
* align.c, block.c, match.c, match.h, result.c, result.h, snap.c, gmap.c,
stage1.c, stage1.h, stage2.c: Changed stage 1 of algorithm to find bounds
using 5' and 3' hits.
* Makefile, pair.c, pair.h, stage2.c: Changed goodness to be differences of
matches and mismatches.
2002-10-07 twu
* dynprog.c: Changed recursive functions of traceback and scoreback to be
iterative.
* snap.c, gmap.c, stage2.c: Added check for large query gaps and avoided
doing dynamic programming on those. Also added check for allpaths being
NULL from stage 2.
* align.c: Toggled DEBUG.
* Makefile, snap.c, gmap.c, stage2.c, stage2.h: Added ability to print
alignment summaries only.
* pair.c: Ignored N's in computing percent identity.
* Makefile, pair.c, pair.h, stage2.c: Added number of exons to calculations
and output.
* snap.c, gmap.c, stage2.c, stage2.h: Made alignment procedure the default.
Now sorting paths based on the goodness of the alignment.
* pair.c, pair.h, stage2.c: Removed npairs from some parameter lists.
* pair.c, pair.h, stage2.c, stage2.h: Added calculation for goodness, based
on percent identity.
* match.c, pair.c, pair.h, result.c, segmentpos.c, segmentpos.h, snap.c,
gmap.c, stage2.c, stage2.h: Now printing endpoints based on alignments, if
available.
* list.c: Fixed bug in List_last.
* list.c, list.h: Added a List_last procedure.
* Makefile, match.c, match.h, result.c, result.h, snap.c, gmap.c, stage1.c,
stage1.h, stage2.c, stage2.h: Created a Stage2_T object and reorganized
calculations, in preparation for using the alignments to rank the results.
* Makefile, snap.c, gmap.c: Added parameter for maxaligns, the maximum
number of alignments to print.
2002-10-06 twu
* dynprog.c: Fixed read of unallocated hash.
* align.c: Fixed read of uninitialized variable.
* align.c, dynprog.c, pair.c, pair.h, stage2.c: Added ability to recognize
introns in revcomp direction, and to print correct indices for Crick
strand matches.
* match.c, result.c: Simplified use of zerobasedp.
* snap.c, gmap.c, stage1.c, stage1.h, stage2.c, stage2.h: Changed variable
names to distinguish between hashsize and indexsize.
* dynprog.c, genome.c, stage2.c: Fixed errors with the sequence and genomic
indices.
* stage1.c: Removed list reversal to match new scheme for doing stage 1
dynamic programming.
* align.c, pair.c: Enhanced debugging information.
* align.c: Revised code to make sure that we don't pick unwanted paths after
the first. We set the usedp flags and recompute dynamic programming on
subsequent rounds to avoid using those results. This should affect only
stage 1, because maxpaths equals 1 on stage2.
* align.c: Removed gappenalty for stage 1 computation. This was causing
problems with multiple paths for HER2.
* pair.c, pair.h, stage2.c: Added procedure for summary of exons.
* snap.c, gmap.c: Made printout slightly better.
* match.c, result.c: Fixed miscount on number of matches.
* pair.c: No change.
* genome.c, genome.h: Added modules to retrieve genome sequences.
* dynprog.c: Minor restructuring of procedures.
* dynprog.c: Fixed coordinates in gap. Changed gap output for non-introns.
* pair.c: Added printing of rulers in alignments.
* stage2.c: Fixed memory leaks.
2002-10-05 twu
* dynprog.c: Fixed major problem in paired gap assessments. Need to
subtract, not add, the entry in the right matrix.
* stage2.c: Changed criteria for single and paired gaps, based on a minimum
intron length. Created special case for the 3' end.
* penalties.c: Changed middle gap penalties to have bigger opening and
smaller extend penalties.
* dynprog.c, dynprog.h: Changed concepts from short and long gaps to single
and paired gaps.
* stage2.c: Added peelback procedure to help identify correct intron.
Otherwise, the greedy oligo matching procedure can mask the intron
boundaries.
* dynprog.c: Fixed bug for traceback on longgap, where we didn't start from
the lower right cell.
* align.c, stage2.c: Increased size of stage 2 oligos from 8 to 10.
* align.c, oligoindex.c, oligoindex.h, snap.c, gmap.c, stage2.c, stage2.h:
Added ability to limit maxentries in stage 2.
* dynprog.c: Changed alignment character for dynamic programming to help
with debugging.
* dynprog.c, stage2.c: Implemented dynamic programming across long gaps.
* dynprog.c: Reordered priorities in traceback to be (1) continue in same
direction, (2) diagonal, (3) vertical, and (4) horizontal.
* dynprog.c, dynprog.h, penalties.c, penalties.h, stage2.c: Cleaned up
dynamic programming code for the three cases of FIVE, MIDDLE, and THREE.
Added stub for dynamic programming of long gaps.
* pair.c, pair.h, stage2.c: Made improvements to the alignment output.
2002-10-04 twu
* match.c, pair.c, pair.h, result.c, snap.c, gmap.c, stage2.c: Added
improvements to the alignment output.
* stage2.c: Added code to handle the 5' end properly.
* penalties.c: Changed some values for the penalty parameters.
* dynprog.c: Changed opening penalties to not include the extension. Added
special procedures for 5' and 3' ends of sequence, essentially
implementing part of Smith-Waterman on each end. Added special cases in
traceback for 5' and 3' ends, but may not be necessary in light of the
other changes.
* stage2.c: Added querypos and genomepos to the Pair object. Reorganized
various functions.
* pair.c, pair.h: Added querypos and genomepos to the Pair object.
* reader.h: Added another option to cDNAEnd_T.
* align.c: Fixed the precise bounds around an intron.
* Makefile, dynprog.c, dynprog.h, penalties.c, penalties.h: Added penalties
object. Provided ability to specify different penalties for left, middle,
and right part of sequence.
* stage2.c: Moved printing procedure to another file. Fixed small bug that
caused us to miss printing a base.
* pair.c, pair.h: Removed printing of loci names from alignment.
* Makefile, align.c, dynprog.c, dynprog.h, pair.c, stage2.c: Added dynamic
programming routine to take care of small gaps.
* Makefile, align.c, align.h, match.c, match.h, matrix.c, matrix.h,
oligoindex.c, oligoindex.h, pair.c, pair.h, path.c, path.h, penalties.c,
penalties.h, reader.c, reader.h, result.c, result.h, snap.c, gmap.c,
stage1.c, stage1.h, stage2.c, stage2.h: Major change to algorithm to have
two stages: one using hash table (24-mers) and another using an index
table (8-mers). Still need to incorporate a dynamic programming step for
gaps in the final alignment.
2002-10-02 twu
* whats_on: Changed program to work with new data directory for alignment
results.
2002-10-01 twu
* snap.c, gmap.c: Fixed problem where intronlen == 0. Now requiring
intronlen > 0. Added extra carriage return when zero paths found.
2002-09-27 twu
* Makefile: Reduced number of object files used in SNAP.
* get-genome.c: Fixed use of fscanf to match the .chromosome and .contig
file format.
* path.c, path.h: Simplified call to Path_compute to eliminate scoremat.
* penalties.c, penalties.h: Added procedure to create a default penalties
object.
* match.c, result.c: Added line for number of matches.
* snap.c, gmap.c: Fixed bug where resultlist was uninitialized. Allowed
resultstring of 0. Simplified call to Path_compute.
* whats_on: Added -R flag for release number.
2002-09-25 twu
* intlist.c, intlist.h, scoremat.c, scoremat.h: No longer need Intlist_T or
Scoremat_T.
* Makefile, path.c, path.h, snap.c, gmap.c: Removing Sequence_T. Using char
* instead to represent sequences.
* reader.c, reader.h: Added Reader_pointer function.
* penalties.c, intlist.c, matrix.c, scoremat.c: Using CALLOC/FREE macros.
* snap.c, gmap.c: Inadvertent commit. Adding routines to perform
nucleotide-level dynamic programming.
* ring.c, ring.h: Removed Ring_T. Apparently not used by other seqalign
files.
* path.c, path.h: Premature commit. Adding routines to analyze only
submatrices.
* Makefile: Adding files from seqalign.
* intlist.c, intlist.h: Added files from seqalign to do nucleotide-level
dynamic programming.
* get-genome.c: Added flag for release string. Changed type of positions
from long to unsigned int.
* offset.c, offset.h, offsetdb.c, offsetdb.h: Added datadir to
Offset_read_file.
* match.c, match.h, result.c, result.h: Added Result_path command.
* snapgenerate.c, snapindex.c, gmapindex.c: Simplified strcpy/strcat calls
to sprintf.
* matrix.c, matrix.h, penalties.c, penalties.h, ring.c, ring.h, scoremat.c,
scoremat.h, path.c, path.h: Added to program for doing nucleotide-level
dynamic program. Taken from seqalign.
* path.c: Inadvertent commit. Still editing.
2002-09-24 twu
* segmentpos.c, segmentpos.h, snapindex.c, gmapindex.c: Added
superaccessions to accsegmentpos_db.
2002-09-23 twu
* radixsort.c: Fixed syntax error when monitoring is turned off.
* Makefile, radixsort.c: Added monitoring routine for radix sort.
2002-09-19 twu
* snapgenerate.c: Removed debug line.
* snapindex.c, gmapindex.c: Added option for using lowercase characters.
2002-09-18 twu
* Makefile, snapgenerate.c: Added program snapgenerate, to create text
.chromosome, .contig, and .chromosome files.
* snapindex.c, gmapindex.c: Clarified the variable auxfile.
* snapindex.c, gmapindex.c: Clarified the variable dbroot.
2002-09-17 twu
* oligo.c: Made comment to explain Third Degree warning.
* Makefile, block.c, block.h, match.c, match.h, reqpost.c, reqpost.h,
result.c, result.h, snap.c, gmap.c: Made changes to sample query sequence
at a test interval and perform dynamic programming.
2002-09-16 twu
* endpoints.c, endpoints.h: Removed from source.
2002-09-12 twu
* block.c: Changed debug flag.
* Makefile: Changed C compiler flags.
* snap.c, gmap.c: Changed default directory to be in /usr/seqdb2_nb.
2002-08-30 twu
* endpoints.c, match.c, match.h, result.c, result.h, snap.c, gmap.c: Made
changes to facilitate garbage collection, including adding a matchedp flag
to results, and putting singleton results into an endpoint.
* block.c: Changed debug messages.
* endpoints.c, endpoints.h: Changed print routine. Added code for query
length.
* snap.c, gmap.c: Added consolidation of endpoints, and ranking of those to
generate a single result.
2002-08-29 twu
* block.c, match.c, oligo.c, result.c, segmentpos.c: Added debug macros.
* endpoints.c, endpoints.h: Added commands for sorting endpoints and testing
for adjacency.
* reader.c: Fixed test when startptr == endptr.
* snap.c, gmap.c: Implemented divide-and-conquer strategy on query sequence.
2002-08-28 twu
* snapindex.c, gmapindex.c: Turned off printing of subaccession messages.
2002-08-22 twu
* snapindex.c, gmapindex.c: Added timing statistics.
* block.c, oligo.c: Fixed coordinate calculations. May need to check.
* snapindex.c, gmapindex.c: Added dump function.
* radixsort.c, radixsort.h: Changed accessor function to get a character
rather than a pointer. Fixed algorithm for case where byte equals strlen.
* cell.c, cell.h: Changed accessor function to get a character rather than a
pointer.
* Makefile, rsort-check.c, rsort-test.c: Added a test and check routine for
radixsort.
* Makefile, snapindex.c, gmapindex.c: Removed unnecessary files for
snapindex.
2002-08-21 twu
* Makefile, block.c, block.h, endpoints.c, endpoints.h, match.c, match.h,
offset.c, offsetdb.c, oligo.c, oligo.h, readcirc.c, reader.c, reader.h,
request.h, result.c, result.h, snap.c, gmap.c: Major change to implement
divide-and-conquer strategy.
* read.c, read.h: Changed name of file from read.c to readcirc.c
* block.c, block.h, endpoints.c: Partial changes to implement
divide-and-conquer strategy.
* snapindex.c, gmapindex.c: Improved diagnostic messages.
* Makefile, offset.c, offsetdb.c, radixsort.c, radixsort.h, read.c,
readcirc.c, segmentpos.c, snapindex.c, gmapindex.c: Fixed minor compiler
warnings.
* cell.c: Using pointers rather than lists to store multiple positions for
an oligo. Fixed quicksort compare function accordingly.
* snapindex.c, gmapindex.c: Using pointers rather than lists to store
multiple positions for an oligo.
* radixsort.c: Added small speed hacks.
* Makefile: Added quicksort as an option.
* Makefile, cell.c, cell.h, radixsort.c, radixsort.h, snapindex.c,
gmapindex.c: Added radix sort as a replacement for quicksort.
2002-08-20 twu
* oligo.c: Fixed key_size for partial bytes.
* snapindex.c, gmapindex.c: Changed location of oligo file to be in dbenv
directory, not a subdirectory.
2002-08-15 twu
* Makefile, block.c, block.h, oligo.c, oligo.h, request.c, request.h,
snap-withenv.c, snap.c, snapindex.c, gmap.c, gmapindex.c: Made changes to
accommodate sizes less than 32-mers.
* match.c, match.h, offset.c, offset.h, offsetdb.c, offsetdb.h, result.c,
result.h, snap.c, gmap.c: Added ability to read chromosome information
from file, but not done by default right now.
2002-08-11 twu
* Makefile, block.c, block.h, dpentry.c, dpentry.h, endpoints.c,
endpoints.h, match.c, match.h, result.c, result.h, snap.c, gmap.c: Changed
algorithm to work inward from both ends and find a single match.
2002-08-10 twu
* Makefile, snapindex.c, gmapindex.c: Fixed program to handle cases where
interval is less than size.
2002-07-19 twu
* segmentpos.c: Changed type of querylen.
* whats_on: Changed suffix for db filenames.
* iit-read.c, iit.c, interval-read.c, interval-read.h, interval.c,
interval.h: Changed binary storage format to be a single file.
* get-genome.c: Changed input format to accept a single string.
2002-07-13 twu
* iit_get.c: Added ability to query symbolic db.
2002-07-12 twu
* iit_get.c: Fixed bugs in the algorithm.
* iit_get.c: Allowed user to specify a single point, rather than an interval.
* iit_store.c: Added an output message when Berkeley DB file is done.
* iit_get.c, iit_store.c: Integrated interval tree into
db_load/retrieve_endpoints.
* Makefile, basic.h, iit-read.c, iit.c, interval-read.c, interval-read.h,
interval.c, interval.h: Rewrote interval tree to handle interval queries
and to write tree to and read tree from files.
2002-07-11 twu
* basic.h, iit-read.c, iit.c: Added code for integer interval trees from
Edelsbrunner's alpha shapes.
* get-genome.c: Added ability to convert a single coordinate.
* prb.c: Fixed minor typos.
* prb.c, prb.h: Revised format and separated interface from implementation.
* prb.c, prb.h: Added routines for red-black trees with parent pointers from
libavl 2.0
2002-07-10 twu
* dpentry.c: Changed criterion to consider query coverage.
* endpoints.c: Fixed problem with negative relative positions.
2002-07-09 twu
* endpoints.c, endpoints.h, snap.c, gmap.c: Revised output format of SNAP.
Coordinates are now given for each accession.
* offset.c, offset.h, offsetdb.c, offsetdb.h, snap.c, gmap.c: Revised
chromosome dump procedure to print lengths as well as offsets.
* Makefile, add-chrpos-to-endpoints.c: Added program for adding chromosomal
position to endpoints.
* whats_on: Modified program to work with new version of SNAP.
2002-07-08 twu
* dpentry.c, match.c, match.h, result.c, result.h, snap.c, gmap.c: Restored
nleads as a criterion in dynamic programming. Added features to help with
debugging.
* Makefile, get-genome.c: Freed get-genome from using BerkeleyDB databases,
which are too slow to open.
* get-genome.c: Added ability to report coordinates.
* iit_get.c: Added check for zero matches.
* whats_on: Preliminary changes (inadvertent checkin).
* Makefile, get-genome.c: Created get-genome program.
* accpos.c: Added offset for chromosomes.
* accpos.c, database.c, database.h, snap.c, snapindex.c, gmap.c,
gmapindex.c: Changed location of data files.
2002-07-07 twu
* iit_get.c: Fixed bug with testing dumpp.
2002-07-06 twu
* endpoints.c, endpoints.h, match.c, result.c: Added check for boomerang
paths, where the genomic length is 0.
2002-07-05 twu
* whats_on: Added whats_on from ../snap.
* spidey_compress.pl: Added meta-level compression.
* spidey_compress.pl: Added spidey_compress.pl from ../snap.
* iit_get.c: Added dump utility.
* iit_get.c, iit_store.c: Added programs for storing and retrieving records
based on endpoints.
2002-07-04 twu
* sim4_uncompress.pl: Added retrieval function for get-genome.
2002-07-03 twu
* sim4_compress.pl, sim4_uncompress.pl: Added further compression by
counting repeated tokens.
* sim4_compress.pl, sim4_uncompress.pl, util: Added sim4
compression/uncompression routines from snap CVS archive.
* dpentry.c, match.c, result.c: Changed from using slopes (quotients) to
intron measurements (differences).
* endpoints.c, snap.c, gmap.c: Changed output slightly, e.g., en-dash for
number ranges.
* Makefile, accpos.c: Created program accpos, for finding genomic position
of accessions.
* segmentpos.c, segmentpos.h: Added procedure for finding partially matching
accessions.
* snapindex.c, gmapindex.c: Made creation of aux-only database faster.
* snap.c, gmap.c: Fixed small bug in error message.
* segmentpos.c, segmentpos.h: Made Segmentpos_print extern.
* database.c: Changed accsegmentpos_db from hash to B-tree.
* database.c, database.h, snap.c, snapindex.c, gmap.c, gmapindex.c: Merged
two database procedures.
* segmentpos.c, segmentpos.h: Added procedure for reading from
accsegmentpos_db.
* Makefile, cell.c, cell.h, database.c, database.h, endpoints.c,
endpoints.h, segmentpos.c, segmentpos.h, snap.c, snapindex.c, gmap.c,
gmapindex.c: Added another database, from accession name to segmentpos,
and renamed databases.
2002-07-02 twu
* Makefile, block.c, block.h, snap.c, gmap.c: Added specification for
minimum separation between leads.
* Makefile: Removed segmentpos dump flag from db.test
* endpoints.c, endpoints.h, snap.c, gmap.c: Changed to 1-based coordinates
as default.
* snapindex.c, gmapindex.c: Removed segment dump, because it can be
performed by snap.
* snap.c, gmap.c: Added several command-line options.
* segmentpos.c, segmentpos.h: Enhanced dump procedure to report absolute
genomic positions.
* match.c, match.h, result.c, result.h: Storing signed genome_coverage into
dpentry and checking for impossible slopes (< 0.9).
* offset.c, offset.h, offsetdb.c, offsetdb.h: Added dump procedure.
* endpoints.c: Added printing of subaccessions for Celera genome. Added
commas to output of positions. Changed dominated function to look for any
overlap instead of complete coverage.
* Makefile, dpentry.c, dpentry.h: Changed comparison function to use slopes.
* chrnum.c: Added check for uninitialized chromosome.
2002-07-01 twu
* block.h, buffer-thread-attempt.c, buffer-thread-attempt.h, buffer.c,
buffer.h, dbentry.c, dbentry.h, entry.c, entry.h, hits.c, hits.h, oligo.c,
sort.c, sort.h, table.c, table.h: Removed unused files.
* Makefile, block.c, block.h, database.c, database.h, endpoints.c,
endpoints.h, hash-oligos.c, hit.c, hit.h, match.c, match.h, oligo.c,
oligo.h, request.c, request.h, result.c, result.h, scan.c, scan.h,
segmentpos.c, snap.c, snapindex.c, gmap.c, gmapindex.c: Changed oligo_db
from BerkeleyDB to PureDB. Created object for endpoints. Removed unused
files.
2002-06-29 twu
* segmentpos.c, segmentpos.h, snap.c, gmap.c: Added genomic position to the
output.
* match.c, result.c, snap.c, gmap.c: Fixed memory leaks.
* block.c, block.h, dpentry.c, dpentry.h, match.c, match.h, request.c,
request.h, result.c, result.h, snap.c, gmap.c: Added minimum spanning
tree. Version appears to work well.
* Makefile, block.c, block.h, dpentry.c, dpentry.h, match.c, match.h,
request.c, request.h, result.c, result.h, snap.c, gmap.c: Early version of
dynamic programming that stores H best paths at each hit.
2002-06-28 twu
* match.c, match.h, result.c, result.h, segmentpos.c, snap.c, gmap.c: Added
simple dynamic programming and best pair techniques.
* Makefile, block.c, block.h, database.c, database.h, match.c, match.h,
oligo.c, oligo.h, reqpost.c, request.c, request.h, result.c, result.h,
snap.c, snapindex.c, gmap.c, gmapindex.c: Implemented working version of
snap that uses multiple oligo_dbs with requests and strings results
together from 5' and 3' ends.
* commafmt.c, commafmt.h: Added source code for adding commas to numbers.
2002-06-25 twu
* Makefile: Added specification of directory for dbenv.
* database.c, snapindex.c, gmapindex.c: Added provisions for transactions,
to try to speed up build of database.
2002-05-29 twu
* snapindex.c, gmapindex.c: Allowed the user to specify a directory for the
BerkeleyDB environment.
2002-05-27 twu
* Makefile, segmentpos.c, snapindex.c, gmapindex.c: Added specification of
segmentfile as flag -g.
* Makefile, database.c, database.h, snapindex.c, gmapindex.c: Removed
genome_db and delta_db from snapindex.
* Makefile, segmentpos.c, segmentpos.h, snapindex.c, gmapindex.c: Added
ability to dump segments (in order) from segmentpos_db
* Makefile: Changed flags for C compiler.
* segmentpos.c: Added check to get previous segment only in some cases.
2002-05-22 twu
* Makefile, oligo.c, oligo.h, read.c, read.h, readcirc.c, readcirc.h,
scan.c, scan.h, segmentpos.c, segmentpos.h, snap.c, gmap.c: Working
version of snap using a scan of genomic and delta information.
2002-05-21 twu
* Makefile, cell.c, cell.h, database.c, database.h, hit.c, hit.h, offset.c,
offset.h, offsetdb.c, offsetdb.h, oligo.c, oligo.h, scan.c, scan.h,
snap.c, snapindex.c, gmap.c, gmapindex.c, table.c, table.h: Made changes
to store delta position of genomic oligos and to store oligos of query
sequence.
2002-05-08 twu
* Makefile, database.c, database.h, hash-oligos.c, snap.c, snapindex.c,
gmap.c, gmapindex.c: Consolidated sample-oligos and hash-oligos into
snapindex. Specified oligo dbtype by using Berkeley DB constants.
2002-05-03 twu
* Makefile, cell.c, cell.h, database.c, database.h, hit.c, read.c,
readcirc.c, sample-oligos.c: Separated database commands for oligos from
the other database (aux).
2002-04-26 twu
* Makefile, hit.c, oligo.c, read.c, readcirc.c, snap.c, gmap.c: Removed
environment. Began implementation of dynamic programming.
* Makefile, chrnum.c, chrnum.h, database.c, database.h, hash-oligos.c,
hit.c, hit.h, offset.c, offset.h, offsetdb.c, offsetdb.h, oligo.c,
oligo.h, read.c, read.h, readcirc.c, readcirc.h, snap.c, gmap.c:
Re-implementation of SNAP using new database created by hash-oligos.
2002-04-24 twu
* segmentpos.c, segmentpos.h: Handled problems with chromosome string to
integer conversions.
* hash-oligos.c: Handled problems with chromosome string to integer
conversions. Rearranged calls to db->open so that each db is opened only
once.
* Makefile, btree.c, btree.h, hash-oligos.c, hash.c, hash.h, oligo.c,
oligo.h: Consolidated code into fewer files.
* Makefile: Changed CFLAGS to optimize speed.
* cell.h: Matched up .h file with .c file.
* Makefile, genomicpos.c, genomicpos.h, hash-oligos.c, segmentpos.c,
segmentpos.h: Now storing genomic locations as global positions, which
require keeping track of chromosomal offsets.
* Makefile, btree.c, btree.h, cell.c, cell.h, entry.c, entry.h,
genomicpos.c, genomicpos.h, hash-oligos.c, sample-oligos.c: Major change
to allow B-trees, to avoid storing adjacent oligos, to store genomic
positions, and to write oligos in binary format.
2002-04-22 twu
* Makefile, assert.c, assert.h, block.c, block.h, bool.h,
buffer-thread-attempt.c, buffer-thread-attempt.h, buffer.c, buffer.h,
cksum-fa.c, cksum.c, dbentry.c, dbentry.h, entry.c, entry.h, except.c,
except.h, hash-oligos.c, hash-test.c, hash.c, hash.h, hits.c, hits.h,
list.c, list.h, match.c, match.h, mem.c, mem.h, oligo.c, oligo.h, read.c,
read.h, readcirc.c, readcirc.h, reqpost.c, reqpost.h, request.c,
request.h, result.c, result.h, sample-oligos.c, snap-withenv.c, snap.c,
sort.c, sort.h, src, gmap.c: Initial import into CVS.
2000-05-08 paf
* config, cvswrappers, loginfo, modules, CVSROOT, checkoutlist, commitinfo,
editinfo, notify, rcsinfo, taginfo, verifymsg: initial checkin
2000-05-08 (no author)
* branches, tags, trunk: Standard project directories initialized by cvs2svn.
|