1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 112 113 114 115 116 117 118 119 120 121 122 123 124 125 126 127 128 129 130 131 132 133 134 135 136 137 138 139 140 141 142 143 144 145 146 147 148 149 150 151 152 153 154 155 156 157 158 159 160 161 162 163 164 165 166 167 168 169 170 171 172 173 174 175 176 177 178 179 180 181 182 183 184 185 186 187 188 189 190 191 192 193 194 195 196 197 198 199 200 201 202 203 204 205 206 207 208 209 210 211 212 213 214 215 216 217 218 219 220 221 222 223 224 225 226 227 228 229 230 231 232 233 234 235 236 237 238 239 240 241 242 243 244 245 246 247 248 249 250 251 252 253 254 255 256 257 258 259 260 261 262 263 264 265 266 267 268 269 270 271 272 273 274 275 276 277 278 279 280 281 282 283 284 285 286 287 288 289 290 291 292 293 294 295 296 297 298 299 300 301 302 303 304 305 306 307 308 309 310 311 312 313 314 315 316 317 318 319 320 321 322 323 324 325 326 327 328 329 330 331 332 333 334 335 336 337 338 339 340 341 342 343 344 345 346 347 348 349 350 351 352 353 354 355 356 357 358 359 360 361 362 363 364 365 366 367 368 369 370 371 372 373 374 375 376 377 378 379 380 381 382 383 384 385 386 387 388 389 390 391 392 393 394
|
Beta Release 0.1.8 (11 July, 2010)
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
Notable functional changes:
* Added the `reheader' command which replaces a BAM header with a new
header. This command is much faster than replacing header by
BAM->SAM->BAM conversions.
* Added the `mpileup' command which computes the pileup of multiple
alignments.
* The `index' command now stores the number of mapped and unmapped
reads in the index file. This information can be retrieved quickly by
the new `idxstats' command.
* By default, pileup used the SOAPsnp model for SNP calling. This
avoids the floating overflow in the MAQ model which leads to spurious
calls in repetitive regions, although these calls will be immediately
filtered by varFilter.
* The `tview' command now correctly handles CIGARs like 7I10M and
10M1P1I10M which cause assertion failure in earlier versions.
* Tview accepts a region like `=10,000' where `=' stands for the
current sequence name. This saves typing for long sequence names.
* Added the `-d' option to `pileup' which avoids slow indel calling
in ultradeep regions by subsampling reads locally.
* Added the `-R' option to `view' which retrieves alignments in read
groups listed in the specified file.
Performance improvements:
* The BAM->SAM conversion is up to twice faster, depending on the
characteristic of the input.
* Parsing SAM headers with a lot of reference sequences is now much
faster.
* The number of lseek() calls per query is reduced when the query
region contains no read alignments.
Bug fixes:
* Fixed an issue in the indel caller that leads to miscall of indels.
Note that this solution may not work well when the sequencing indel
error rate is higher than the rate of SNPs.
* Fixed another issue in the indel caller which may lead to incorrect
genotype.
* Fixed a bug in `sort' when option `-o' is applied.
* Fixed a bug in `view -r'.
APIs and other changes:
* Added iterator interfaces to random access and pileup. The callback
interfaces directly call the iterator interfaces.
* The BGZF blocks holding the BAM header are indepedent of alignment
BGZF blocks. Alignment records shorter than 64kB is guaranteed to be
fully contained in one BGZF block. This change is fully compatible
with the old version of samtools/picard.
Changes in other utilities:
* Updated export2sam.pl by Chris Saunders.
* Improved the sam2vcf.pl script.
* Added a Python version of varfilter.py by Aylwyn Scally.
(0.1.8: 11 July 2010, r613)
Beta Release 0.1.7 (10 November, 2009)
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
Notable changes:
* Improved the indel caller in complex scenariors, in particular for
long reads. The indel caller is now able to make reasonable indel
calls from Craig Venter capillary reads.
* Rewrote single-end duplicate removal with improved
performance. Paired-end reads are not touched.
* Duplicate removal is now library aware. Samtools remove potential
PCR/optical dupliates inside a library rather than across libraries.
* SAM header is now fully parsed, although this functionality is not
used in merging and so on.
* In samtools merge, optionally take the input file name as RG-ID and
attach the RG tag to each alignment.
* Added FTP support in the RAZF library. RAZF-compressed reference
sequence can be retrieved remotely.
* Improved network support for Win32.
* Samtools sort and merge are now stable.
Changes in other utilities:
* Implemented sam2vcf.pl that converts the pileup format to the VCF
format.
* This release of samtools is known to work with the latest
Bio-Samtools Perl module.
(0.1.7: 10 November 2009, r510)
Beta Release 0.1.6 (2 September, 2009)
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
Notable changes:
* In tview, do not show a blank screen when no reads mapped to the
corresponding region.
* Implemented native HTTP support in the BGZF library. Samtools is now
able to directly open a BAM file on HTTP. HTTP proxy is also
supported via the "http_proxy" environmental variable.
* Samtools is now compitable with the MinGW (win32) compiler and the
PDCurses library.
* The calmd (or fillmd) command now calculates the NM tag and replaces
MD tags if they are wrong.
* The view command now recognizes and optionally prints FLAG in HEXs or
strings to make a SAM file more friendly to human eyes. This is a
samtools-C extension, not implemented in Picard for the time
being. Please type `samtools view -?' for more information.
* BAM files now have an end-of-file (EOF) marker to facilitate
truncation detection. A warning will be given if an on-disk BAM file
does not have this marker. The warning will be seen on BAM files
generated by an older version of samtools. It does NO harm.
* New key bindings in tview: `r' to show read names and `s' to show
reference skip (N operation) as deletions.
* Fixed a bug in `samtools merge -n'.
* Samtools merge now optionally copies the header of a user specified
SAM file to the resultant BAM output.
* Samtools pileup/tview works with a CIGAR with the first or the last
operation is an indel.
* Fixed a bug in bam_aux_get().
Changes in other utilies:
* Fixed wrong FLAG in maq2sam.
(0.1.6: 2 September 2009, r453)
Beta Release 0.1.5 (7 July, 2009)
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
Notable changes:
* Support opening a BAM alignment on FTP. Users can now use "tview" to
view alignments at the NCBI ftp site. Please read manual for more
information.
* In library, propagate errors rather than exit or complain assertion
failure.
* Simplified the building system and fixed compiling errors caused by
zlib<1.2.2.1.
* Fixed an issue about lost header information when a SAM is imported
with "view -t".
* Implemented "samtool.pl varFilter" which filters both SNPs and short
indels. This command replaces "indelFilter".
* Implemented "samtools.pl pileup2fq" to generate FASTQ consensus from
pileup output.
* In pileup, cap mapping quality at 60. This helps filtering when
different aligners are in use.
* In pileup, allow to output variant sites only.
* Made pileup generate correct calls in repetitive region. At the same
time, I am considering to implement a simplified model in SOAPsnp,
although this has not happened yet.
* In view, added '-u' option to output BAM without compression. This
option is preferred when the output is piped to other commands.
* In view, added '-l' and '-r' to get the alignments for one library or
read group. The "@RG" header lines are now partially parsed.
* Do not include command line utilities to libbam.a.
* Fixed memory leaks in pileup and bam_view1().
* Made faidx more tolerant to empty lines right before or after FASTA >
lines.
Changes in other utilities:
* Updated novo2sam.pl by Colin Hercus, the key developer of novoalign.
This release involves several modifications to the key code base which
may potentially introduce new bugs even though we have tried to minimize
this by testing on several examples. Please let us know if you catch
bugs.
(0.1.5: 7 July 2009, r373)
Beta Release 0.1.4 (21 May, 2009)
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
Notable changes:
* Added the 'rmdupse' command: removing duplicates for SE reads.
* Fixed a critical bug in the indel caller: clipped alignments are not
processed correctly.
* Fixed a bug in the tview: gapped alignment may be incorrectly
displayed.
* Unified the interface to BAM and SAM I/O. This is done by
implementing a wrapper on top of the old APIs and therefore old APIs
are still valid. The new I/O APIs also recognize the @SQ header
lines.
* Generate the MD tag.
* Generate "=" bases. However, the indel caller will not work when "="
bases are present.
* Enhanced support of color-read display (by Nils Homer).
* Implemented the GNU building system. However, currently the building
system does not generate libbam.a. We will improve this later. For
the time being, `make -f Makefile.generic' is preferred.
* Fixed a minor bug in pileup: the first read in a chromosome may be
skipped.
* Fixed bugs in bam_aux.c. These bugs do not affect other components as
they were not used previously.
* Output the 'SM' tag from maq2sam.
(0.1.4: 21 May 2009, r297)
Beta Release 0.1.3 (15 April, 2009)
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
Notable changes in SAMtools:
* SAMtools is more consistent with the specification: a) '*' in the
QUAL field is allowed; b) the field separator is TAB only and SPACE
is treated as a character in a field; c) empty header is allowed.
* Implemented GLFv3 support in pileup.
* Fixed a severe bug in fixmate: strand information is wrongly
overwritten.
* Fixed a bug in alignment retrieval: alignments bridging n*16384bp are
not correctly retrieved sometimes.
* Fixed a bug in rmdup: segfault if unmapped reads are present.
* Move indel_filter.pl to samtools.pl and improved the filtering by
checking the actual number of alignments containing indels. The indel
pileup line is also changed a little to make this filtration easier.
* Fixed a minor bug in indexing: the bin number of an unmapped read is
wrongly calculated.
* Added `flagstat' command to show statistics on the FLAG field.
* Improved indel caller by setting the maximum window size in local
realignment.
Changes in other utilities:
* Fixed a bug in maq2sam: a tag name is obsolete.
* Improvement to wgsim: a) added support for SOLiD read simulation; b)
show the number of substitutions/indels/errors in read name; c)
considerable code clean up.
* Various converters: improved functionality in general.
* Updated the example SAM due to the previous bug in fixmate.
(0.1.3: 15 April 2009, r227)
Beta Release 0.1.2 (28 January, 2008)
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
Notable changes in SAMtools:
* Implemented a Bayesian indel caller. The new caller generate scores
and genotype and is potentially more accurate than Maq's indel
caller. The pileup format is also changed accordingly.
* Implemented rmdup command: remove potential PCR duplicates. Note that
this command ONLY works for FR orientation and requires ISIZE is
correctly set.
* Added fixmate command: fill in mate coordinates, ISIZE and mate
related flags from a name-sorted alignment.
* Fixed a bug in indexing: reads bridging 16x kbp were not retrieved.
* Allow to select reads shown in the pileup output with a mask.
* Generate GLFv2 from pileup.
* Added two more flags for flagging PCR/optical duplicates and for QC
failure.
* Fixed a bug in sort command: name sorting for large alignment did not
work.
* Allow to completely disable RAZF (using Makefile.lite) as some people
have problem to compile it.
* Fixed a bug in import command when there are reads without
coordinates.
* Fixed a bug in tview: clipping broke the alignment viewer.
* Fixed a compiling error when _NO_CURSES is applied.
* Fixed a bug in merge command.
Changes in other utilities:
* Added wgsim, a paired-end reads simulator. Wgsim was adapted from
maq's reads simulator. Colin Hercus further improved it to allow
longer indels.
* Added wgsim_eval.pl, a script that evaluates the accuracy of
alignment on reads generated by wgsim.
* Added soap2sam.pl, a SOAP2->SAM converter. This converter does not
work properly when multiple hits are output.
* Added bowtie2sam.pl, a Bowtie->SAM converter. Only the top hit will
be retained when multiple hits are present.
* Fixed a bug in export2sam.pl for QC reads.
* Support RG tag at MAQ->SAM converter.
* Added novo2sam.pl, a NovoAlign->SAM converter. Multiple hits and
indel are not properly handled, though.
* Added zoom2sam.pl, a ZOOM->SAM converter. It only works with the
default Illumina output.
(0.1.2: 28 January 2008; r116)
Beta Release 0.1.1 (22 December, 2008)
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
The is the first public release of samtools. For more information,
please check the manual page `samtools.1' and the samtools website
http://samtools.sourceforge.net
|