File: changes.rst

package info (click to toggle)
python-pybedtools 0.10.0-4
  • links: PTS, VCS
  • area: main
  • in suites: forky, sid, trixie
  • size: 16,620 kB
  • sloc: python: 10,030; cpp: 899; makefile: 142; sh: 57
file content (796 lines) | stat: -rw-r--r-- 33,453 bytes parent folder | download
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
293
294
295
296
297
298
299
300
301
302
303
304
305
306
307
308
309
310
311
312
313
314
315
316
317
318
319
320
321
322
323
324
325
326
327
328
329
330
331
332
333
334
335
336
337
338
339
340
341
342
343
344
345
346
347
348
349
350
351
352
353
354
355
356
357
358
359
360
361
362
363
364
365
366
367
368
369
370
371
372
373
374
375
376
377
378
379
380
381
382
383
384
385
386
387
388
389
390
391
392
393
394
395
396
397
398
399
400
401
402
403
404
405
406
407
408
409
410
411
412
413
414
415
416
417
418
419
420
421
422
423
424
425
426
427
428
429
430
431
432
433
434
435
436
437
438
439
440
441
442
443
444
445
446
447
448
449
450
451
452
453
454
455
456
457
458
459
460
461
462
463
464
465
466
467
468
469
470
471
472
473
474
475
476
477
478
479
480
481
482
483
484
485
486
487
488
489
490
491
492
493
494
495
496
497
498
499
500
501
502
503
504
505
506
507
508
509
510
511
512
513
514
515
516
517
518
519
520
521
522
523
524
525
526
527
528
529
530
531
532
533
534
535
536
537
538
539
540
541
542
543
544
545
546
547
548
549
550
551
552
553
554
555
556
557
558
559
560
561
562
563
564
565
566
567
568
569
570
571
572
573
574
575
576
577
578
579
580
581
582
583
584
585
586
587
588
589
590
591
592
593
594
595
596
597
598
599
600
601
602
603
604
605
606
607
608
609
610
611
612
613
614
615
616
617
618
619
620
621
622
623
624
625
626
627
628
629
630
631
632
633
634
635
636
637
638
639
640
641
642
643
644
645
646
647
648
649
650
651
652
653
654
655
656
657
658
659
660
661
662
663
664
665
666
667
668
669
670
671
672
673
674
675
676
677
678
679
680
681
682
683
684
685
686
687
688
689
690
691
692
693
694
695
696
697
698
699
700
701
702
703
704
705
706
707
708
709
710
711
712
713
714
715
716
717
718
719
720
721
722
723
724
725
726
727
728
729
730
731
732
733
734
735
736
737
738
739
740
741
742
743
744
745
746
747
748
749
750
751
752
753
754
755
756
757
758
759
760
761
762
763
764
765
766
767
768
769
770
771
772
773
774
775
776
777
778
779
780
781
782
783
784
785
786
787
788
789
790
791
792
793
794
795
796
.. include:: includeme.rst

Changelog
=========

Changes in v0.10.1
------------------

2024-04-09

* Remove last traces of Python 2.7 support by removing ``six`` dependency (thanks Valentyn Bezshapkin)
* Support building on later C++ toolchains (thanks Cameron Smith)
* Support ``pathlib.Path`` in ``BedTool.cat()`` (fixes #405)
* Improvements to testing: add tests for Python 3.12, more explicity setup/teardown


Changes in v0.9.1
-----------------

2023-07-23

* Dropping support for Python 3.6 and 3.7
* Respect sorting of chromsize files (thanks @mgperry)
* Updated setup.py to correctly reflect the MIT license change elsewhere (`#374
  <https://github.com/daler/pybedtools/issues/374>`, thanks @hyandell)
*  Support plotting lengths of intervals and custom DPI (`#367
   <https://github.com/daler/pybedtools/issues/367>`, `#366
   <https://github.com/daler/pybedtools/issues/366>`), thanks @yunfeiguo)
* Remove outdated hard-coded check for 510 files in ``intersect`` and instead
  defer to local machine's ``ulimit``
* Enabling building/installing on Python 3.11 (thanks @daz10000)
* Allow np.int64 start/stop positions to be used when creating Interval objects (`#390 <https://github.com/daler/pybedtools/issues/390>`)
* properly close filehandles in .save_seq (thanks @PeterRobots)
* include minimal pyproject.toml file (thanks @afg1)


Changes in v0.9
---------------

2022-01-23

The biggest change is that pybedtools is now under the MIT license, following
the lead of BEDTools itself.

Bugfixes:

* Bugfix: `Interval` objects representing VCF lines now have their `start`
  attribute correctly zero-based, as indicated in the docs and consistent with
  other 1-based formats. See `#355 <https://github.com/daler/pybedtools/issues/355>`_.
* Bugfix: Manually creating `Interval` objects using the `otherfields` argument
  now correctly converts to C++ strings. See `#348
  <https://github.com/daler/pybedtools/issues/348>`_.
* Bugfix: Workaround for `BedTool.intersect` which in some versions of BEDTools
  requires a specific order of arguments. Fixes `#345
  <https://github.com/daler/pybedtools/issues/345>`_ and also is a better way
  of addressing `#81 <https://github.com/daler/pybedtools/issues/81>`_.

Code cleanup:

* Removed some remnants of Python 2.7 support (thanks @DavidCain)
* Updates to setup.py classifiers to better reflect state of code (thanks @DavidCain)
* Sorted filenames in setup.py to improve reproducibility of build (thanks @lamby)
* Tests converted to run on GitHub Actions (see `#339
  <https://github.com/daler/pybedtools/pull/339>`_).

Changes in v0.8.2
-----------------

2021-03-13: Minor updates

* Removed scripts directory from installed version. These are still available
  in the GitHub repo, but were causing import issues with Python 3.9 and were
  not well-used in the first place.
* Bugfix: unicode is better handled in gzipped files (thanks @naumenko-sa, see
  `#320 <https://github.com/daler/pybedtools/pull/320>`_)
* Bugfix: correctly ignore warnings even with capital letters (thanks
  @JureZmrzlikar, see `#326 <https://github.com/daler/pybedtools/pull/326>`_)
* Bugfix/improvements: update tests and code to work with Python 3.8 (see `#324
  <https://github.com/daler/pybedtools/pull/324>`_). Also addresses `#322
  <https://github.com/daler/pybedtools/issues/322>`_.
* Improvement: updates tests to work with bedtools v2.30
* Improvement: integration of `genomepy
  <https://github.com/vanheeringen-lab/genomepy>`_, which if installed will
  help retrieve chromsizes files for less commonly used assemblies (thanks
  @simonvh, see `#323 <https://github.com/daler/pybedtools/pull/323/>`_)


Changes in v0.8.1
-----------------

2019-12-27: This version has minor updates and bugfixes:

* Bugfix: Fixes to `pbt_plotting_example.py` (thanks Steffen Möllera @smoe)
* Bugfix: Using `BedTool.saveas()` when a BedTool already points to a compressed file
  correctly saves (`#308 <https://github.com/daler/pybedtools/pull/308>`_)
* Improvement: Deprecate `pandas.read_table` (thanks André F. Rendeiro
  @afrendeiro)
* Improvement: overhaul testing on travis-ci
* Improvement: BedTool objects support either strings or Path objects (`#287
  <https://github.com/daler/pybedtools/pull/287>`_, thanks @drchsl)
* Improvement: MySQL host can be configured (`#301
  <https://github.com/daler/pybedtools/pull/301>`_, thanks André F. Rendeiro
  @afrendeiro)
* Improvement: Better version string parsing (`#289
  <https://github.com/daler/pybedtools/pull/289>`_, thanks Steffen Möllera
  @smoe), fixes `#275 <https://github.com/daler/pybedtools/issues/275>`_ and others.
* Improvement: Proper CRAM support: `#307 <https://github.com/daler/pybedtools/pull/307>`_
* Improvement: Raise an exception when the `-b` argument to `BedTool.intersect` has more
  than 510 files (`#303 <https://github.com/daler/pybedtools/pull/306>`_)
* `*.h` files now included in the distribution (thanks @blaiseli)
* Improvement: Update tests to work with bedtools v2.29.2


Changes in v0.8.0
-----------------
This version further improves testing, improves the way C++ files are included
in the package, and fixes many long-standing bugs.

* Using pytest framework rather than nose for testing
* Updated `setup.py` to be more robust and to more clearly separate
  "cythonization" into .cpp files
* Updated test harness for testing in independent conda environments
* All issue tests go in their own test module
* Included Python 3.7 tests (note that at the time of this writing, pysam is
  not yet available on bioconda so that dependency is pip-installed in the
  test) (`#254 <https://github.com/daler/pybedtools/issues/254>`_)
* Updated tests to reflect BEDTool 2.27.1 output (`#260
  <https://github.com/daler/pybedtools/issues/260>`_`#261
  <https://github.com/daler/pybedtools/issues/261>`_)
* Removed the `contrib.classifier` module, which has been unsupported for
  a while.
* More informative error messages for UCSC tools if they're missing (`#227
  <https://github.com/daler/pybedtools/issues/227>`_)
* BedTool objects that are the result of operations that create files that are
  not BED/GTF/GFF/BAM can be more easily converted to pandas.DataFrame with
  `disable_auto_names=True` arg to `BedTool.to_dataframe()` (`#258
  <https://github.com/daler/pybedtools/issues/258>`_)
* Added aliases to existing methods to match current BEDTools commands, e.g.
  the `BedTool.nucleotide_content` method can now also be called using
  `BedTool.nuc` which is consistent with the `bedtools nuc` command line name.
* New wrapper for `bedtools split`. The wrapper method is called `splitbed` to
  maintain backwards compatibility because `pybedtools.BedTool` objects have
  long had a `split` method that splits intervals based on a custom function.
* New wrapper for `bedtools spacing`.
* `BedTool.from_dataframe` handles NaN in dataframes by replacing with `"."`,
  and is more explicit about kwargs that are passed to `pandas.DataFrame`
  (`#257 <https://github.com/daler/pybedtools/issues/257>`_)
* Raise FileNotFoundError when on Python 3 (thanks Gosuke Shibahara, (`#255
  <https://github.com/daler/pybedtools/issues/255>`_)
* Relocated BEDTools header and .cpp files to the `pybedtools/include`
  directory, so they can more easily be linked to from external packages
  (`#253 <https://github.com/daler/pybedtools/issues/253>`_)
* Add test for (`#118 <https://github.com/daler/pybedtools/issues/118>`_)
* `BedTool.tabix_contigs` will list the sequence names indexed by tabix
  (`#180 <https://github.com/daler/pybedtools/issues/180>`_)
* `BedTool.tabix_intervals` will return an empty generator if the coordinates
  provided are not indexed, unless `check_coordinates=True` in which case the
  previous behavior of raising a ValueError is triggered (`#181
  <https://github.com/daler/pybedtools/issues/181>`_)
* Bugfix: Avoid "ResourceWarning: unclosed file" in `helpers.isBGZIP` (thanks
  Stephen Bush)
* Bugfix: Interval objects created directly no longer have their filetype set
  to None (`#217 <https://github.com/daler/pybedtools/issues/217>`_)
* Bugfix: Fixed the ability to set paths and reload module afterwards (`#218
  <https://github.com/daler/pybedtools/issues/218>`_, `#220
  <https://github.com/daler/pybedtools/issues/220>`_, `#222
  <https://github.com/daler/pybedtools/issues/222>`_)
* Bugfix: `BedTool.head()` no longer uses an IntervalIterator (which would
  check to make sure lines are valid BED/GTF/GFF/BAM/SAM). Instead, it simply
  prints the first lines of the underlying file.
* Bugfix: functions passed to `BedTool.filter` and `BedTool.each` no longer
  silently pass ValueErrors (`#231
  <https://github.com/daler/pybedtools/issues/231>`_)
* Bugfix: Fixed IndexError in IntervalIterator if there was an empty line (`#233
  <https://github.com/daler/pybedtools/issues/233>`_)
* Bugfix: Add additional constraint to SAM file detection to avoid incorrectly
  detecting a BED file as SAM (`#246
  <https://github.com/daler/pybedtools/issues/246>`_)
* Bugfix: accessing Interval.fields after accessing Interval.attrs no longer
  raises ValueError (`#246 <https://github.com/daler/pybedtools/issues/246>`_)

Changes in v0.7.10
------------------
Various bug fixes and some minor feature additions:

* Support for comma-separated lists for `mapBed` (thanks Chuan-Sheng Foo)
* Support many calls to `tabix_intervals` without hitting a "Too many open
  files" error (`#190 <https://github.com/daler/pybedtools/issues/190>`_)
* Clarify documentation for `genome_coverage` when used with default
  parameters (`#113 <https://github.com/daler/pybedtools/issues/113>`_)
* Ignore stderr from samtools on older zlib versions (`#209 <https://github.com/daler/pybedtools/issues/209>`_, thanks Gert Hulselmans)
* Support fetching all regions from a chromosome (`#201 <https://github.com/daler/pybedtools/issues/210>`_, thanks Matt Stone)
* Add wrapper for `shiftBed` (`#200 <https://github.com/daler/pybedtools/issues/200>`_, thanks Saket Choudhary)
* Fix `truncate_to_chrom` in Python 3 (`#203 <https://github.com/daler/pybedtools/issues/203>`_, thanks Saulius Lukauskas)
* When making bigWigs, use `bedSort` to ensure the sort order matches that expected by UCSC tools (`#178 <https://github.com/daler/pybedtools/issues/178>`_)
* Fix newline handling of `pysam.ctabix.TabixIterator` output (`#196 <https://github.com/daler/pybedtools/issues/196>`_)


Changes in v0.7.9
-----------------
Minor bugfix release:

* add `contrib.bigwig.bigwigtobedgraph` (thanks Justin Fear)
* fix `BedTool.seq()` in Python 3
* fix intron creation (`#182 <https://github.com/daler/pybedtools/pull/182>`_, thanks @mmendez12)
* add `six` as an explicit requirement (`#184 <https://github.com/daler/pybedtools/pull/184>`_, thanks @jrdemasi)
* improvements to setup (`<https://github.com/daler/pybedtools/issues/185>`_)
* make pandas fully optional

Changes in v0.7.8
-----------------
* Be more careful about BAM vs bgzipped files (#168)
* `BedTool.bgzip` now preserves the header when sorting
* In Python 3, parsed BEDTools help string is decoded properly
* Ensure integer number of processes in Python 3 (thanks Illa Shamovsky)
* Add details on IOError messages for broken pipe error
* Make converting to pandas.DataFrames easier with non-standard BED files (thanks Panos Firmpas)

Changes in v0.7.7
-----------------
* Chromsizes for dm6 and mm10 assemblies added to `genome_registry`
* Better Python 3 compatibility in the `long_range_interaction` module
* New `featurefuncs.UniqueID` class, useful for ensuring all features in a file
  have a unique ID in their name field.
* Fix error message when a specified genome file doesn't exist (thanks Saket Choudhary)

Changes in v0.7.6
-----------------
* New module `pybedtools.contrib.long_range_interaction` for working with
  HiC-like data.

Changes in v0.7.5
-----------------
* When using tabix-indexed files, `tabix` and `bgzip` are no longer required to
  be installed separately. Only `pysam` is needed.

* Recent BEDTools releases support multiple files for the `-b` argument of
  `bedtools intersect`. This version of `pybedtools` now supports multiple
  files as well. Note that it is still possible to provide a list of strings
  representing intervals as the `b` argument to `BedTool.intersect`. To
  differentiate between a list of intervals and a list of filenames, the first
  item converted into an `Interval` object; if it fails then consider the items
  to be filenames; otherwise assume strings representing intervals. This check
  only occurs if the `b` argument is a list or tuple; other iterable types are
  always assumed to be intervals.

Changes in v0.7.4
-----------------
Bugfix release.

- fix `#147 <https://github.com/daler/pybedtools/issues/147>`_ so that warnings
  are simply passed to the user without raising exceptions
- in setup.py, allow depedencies to have "loose" versions with suffixes like
  "rc1"
- fix in `BedTool.cat()` on empty files (thanks Brad Chapman (`PR #149
  <https://github.com/daler/pybedtools/pull/149>`_)

Changes in v0.7.1
-----------------
This is largely a bugfix release with the following changes:

- fix for some BAM headers (thanks Gabriel Platt)
- unified IntervalIterator to address some streaming issues (fixes #143)
- fix bug where `__add__` was not re-raising exceptions (thanks Brad Chapman
  and Dan Halligan)


Changes in v0.7.0
-----------------
This release reflects a major upgrade in the underlying code in order to
support both Python 2 and Python 3 using the same code. Aside from trivial
things like converting print statements to functions and using `next()` instead
of `.next()`, this required a substantial rewrite to support the way strings
are handled in Python 3 (in Cython and wrapped C++) and how relative modules
work.

Importantly, after converting them to Python 2- and 3-compatible syntax *all
previous tests pass* so to the end user should not notice any differences
except those noted below.

Strings from Interval fields are unicode
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
For consistency between Python 2 and 3, all strings from Interval objects are
now unicode. That is, in Python 2, previously we would get this::

    >>> a = pybedtools.example_bedtool('a.bed')
    >>> a[0].name
    'feature1'

Now, we get this::

    >>> a = pybedtools.example_bedtool('a.bed')
    >>> a[0].name
    u'feature1'


samtools no longer a dependency
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
The dependency for samtools has been removed, which simplifies the installation
process. Instead, `pysam` is used for handling BAM files.

In order for existing tests to pass, `pysam.AlignedSegment` objects are
currently converted to `pybedtools.Interval` objects when iterating over a BAM
file. This will come at a performance cost if you are iterating over all reads
in a BAM file using the `pybedtools.BAM` object.

In the future, iterating over a BAM file will yield `pysam.AlignedSegment`
objects directly, but for now you can use the `pybedtools.BAM.pysam_bamfile`
attribute to access the underlying `pysam.AlignmentFile`

Cython no longer a dependency
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
The Cythonized ``.cxx`` files are now shipped with the `pybedtools`
distribution, so Cython is no longer a requirement for installation.

You will however need to have Cython installed if you're developing pybedtools.

Remote BAM support clarification
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
Previously, `pybedtools` was able to support remote BAMs by loosely wrapping
samtools, but BAM files still needed to be fully downloaded to disk before
using with BEDTools. This was done automatically, but through an inefficient
mechanism.

Pysam does support remote BAMs, and as before, a BAM file needs to be created
on disk for use with BEDTools. But now this needs to be explicitly done by the
user, which should result in better performance.


Iterating over intervals
~~~~~~~~~~~~~~~~~~~~~~~~
Previously, when iterating over a `BedTool` object, different machinery would
be invoked depending on whether the BedTool was pointing to a file (a
cbedtools.IntervalFile would be invoked), to another iterator of Interval
objects, or to a stream like from the stdout of a BEDTools call
(cbedtools.IntervalIterator in both cases).

Everything is now an IntervalIterator, simplifying the path towards
performance optimization.

gzip support
~~~~~~~~~~~~
Thanks to Saulius Lukauskas, gzip handling is now improved, and calling
`BedTool.saveas()` with a `.gz` extension on the filename will automatically
compress the output.

Docker
~~~~~~
In the github repo there is a `docker` directory containing Dockerfiles to set
up isolated testing environments. These Dockerfiles also demonstrate how to set
up a complete environment starting from a base Ubuntu install.

Tests
~~~~~
All tests from v0.6.9 (which was Python 2 only) have been made Python 2/3
compatible and all previous tests pass.

If you have docker installed, from the top level directory, you can run the
full tests like this::

    cd docker
    ./full-tests.sh

This will build docker containers for Python 2 and Python 3 with all
depedencies, export the parent directory to the container, and run the test
suite.


Conda packages
~~~~~~~~~~~~~~
You can now install the latest versions of tabix, bedtools, pysam, and
pybedtools from conda, dramatically speeding up installation time. These
mechanisms are used for automated testing as well (see the ``condatest.sh``
script in the github repo).

To use these packages in your own environment(s), specify the `daler` conda
channel like this::

    conda install -c daler pybedtools

Note that this will not install BEDTools or tabix unless you explicitly say
so::

    conda install -c daler pybedtools bedtools tabix

.. note::

    This currently only works on Linux; contributions to Mac conda recipes (see
    the `conda` dir in the github repo) would be welcomed.

Changes in v0.6.9
-----------------
Minor bug fix release.

* improved the automatic field name handling when converting an interval file to
  a `pandas DataFrame`.
* fixed a bug in `IntervalFile` methods `all_hits`, `any_hits` and `count_hits`
  where zero-length features were being counted multiple times (thanks Brent
  Pedersen and Kyle Smith)
* bgzip and tabix paths can now be configured separately (thanks Rob Beagrie)
* fixed a bug where streaming BAM files were read fully into memory (thanks
  Alexey Sergushichev)

Changes in v0.6.8
-----------------

Bugfix: Thanks to Gabriel Pratt, `pybedtools` is no longer plagued by open filehandles
in the C code causing the notorious "Too many files open" error.

Changes in v0.6.7
-----------------
Now compatible with BEDTools v2.21.0.

The one exception is that the new `bedtools intersect` functionality that
allows multiple `-b` files is not yet implemented in `pybedtools`.

New features:

* `BedTool.fisher()` wraps the new BEDTools `fisher` tool.  The result is
  an object containing parsed results.

* `BedTool.colormap_normalize()` accepts a `percentile` argument, useful when
  applying colormaps to data with a handful of extreme outliers

* `BedTool.to_datafame()` converts a `BedTool` object into a `pandas.DataFrame`
  with columns named after the appropriate fields for the filetype (thanks
  Radhouane Aniba for the suggestion)

* `BedTool.tail()` to complement `BedTool.head()` (thanks Radhouane Aniba for
  the suggestion)

* Add hg38 and hg38.default chromsizes

Minor bug fixes:

* Ensure tuple-like args to `parallel_apply` (fixes #109)

* Temp fix for BEDTools v2.20.0 which required the `-w` arg to come before the
  `-s` arg in `bedtools makewindows` (#81)

* Better (i.e., UCSC Genome Browser-compliant) defaults for `featurefuncs.expand_fields`.

* Fix for BedTool.all_hits() and any_hits() which will now show hits for
  zero-length features intersecting with other zero-length features with the
  same coordinates.



Changes in v0.6.6
-----------------
This is a compatibility release, updated for BEDTools v2.20.0.

There is one API change that affects the behavior of overloaded operators (that
is, using `+` and `-` with BedTool objects) when one of the BedTool objects
represents an empty file.

Assume `a` is a BedTool object representing a regular BED file but `b` is
empty.  Previously:

    * a + b = a
    * b + a = b
    * a - b = a
    * b - a = a
    * b - b = b
    * a + a = a

The following changes have been made (indicated in **bold**), which hopefully
make more logical sense:

    * **a + b = b**
    * b + a = b
    * a - b = a
    * **b - a = b**
    * b - b = b
    * a + a = a

Changes in v0.6.5
-----------------
This is a minor bug-fix release:

* Fix for BedTool.all_hits() and any_hits() which will now show hits for
  zero-length features with the same coordinates, like the rest of BEDTools.

* Improved error-handling to avoid Python interpreter crashing in cases when
  a BED file on the filesystem becomes unavailable after a BedTool object has
  been created for it.


Changes in v0.6.4
-----------------

* Full integration with BEDTools v2.18.  This includes some compatibility fixes
  for the new buffered output capabilities of BEDTool `intersect` and wrapping
  the new `bedtools sample` tool.

* Overloaded operators (`+` and `-`) allow empty files as input, even using
  BEDTools v2.18+.

* Travis-CI builds now use BEDTools v2.18+ for tests.

* Fix for :func:`pybedtools.featurefuncs.midpoint` (thanks ny-shao)

* Fix to :meth:`BedTool.randomstats` (thanks Michael Reschen)


Changes in v0.6.3
-----------------

* New :mod:`pybedtools.parallel` module for working with many operations in
  parallel.  See the docs for :func:`pybedtools.parallel.parallel_apply` for
  details.

* :func:`pybedtools.contrib.bigbed.bigbed` for converting to bigBed format,
  along with auto-SQL creation as needed.

* New function :func:`pybedtools.contrib.bigbed.bigbed_to_bed`, so now bigBed
  -> BED and BED -> bigBed interconversions are trivial.

* Support for remote BAMs by passing `remote=True` when creating
  a :class:`BedTool` object

* New method :meth:`BedTool.at` for subsetting a BedTool by a set of (sorted)
  indexes.

* New functions :func:`featurefuncs.gff2bed` and :func:`featurefuncs.bed2gff`
  for use with :meth:`BedTool.each`, for easy converting GFF/GTF to BED

* New function :func:`add_color` for applying matplotlib colormaps to BED
  files; see also new method :meth:`pybedtools.BedTool.colormap_normalize`.

* :class:`pybedtools.plotting.BinaryHeatmap` class for working with results
  from :meth:`BedTool.multi_intersect`.

* :meth:`BedTool.each` now also has some filter capabilities (if provided
  function's return value evaluates to False, feature will be skipped)

* Better detection for samtools (thanks Luca Beltrame)

* Expand BEDToolsError (thanks Ryan Layer)

* Creating a BedTool from a list of intervals now saves to temp file instead of treating
  like a consume-once iterator (#73)

* Various fixes to keyword arg handling to match semantics of BEDTools.

* Command line help and improved docs for the `peak_pie.py` script.

* Fix to GFF attributes (thanks Libor Mořkovský)

* Fix to labels in :mod:`pybedtools.contrib.venn_maker.py` (thanks Luca
  Pinello)

* Make the naive scaling (to million mapped reads) in
  :func:`pybedtools.contrib.bigwig.bam_to_bigwiq` optional.

* Fix for :meth:`BedTool.cat` to handle cases where at least one input is an
  empty file

* Removed SciPy dependency

* Every commit is built with Travis-CI for continuous integration testing of
  changes to source code.

Changes in v0.6.2
-----------------

* Wrapped new tools available in BEDTools 2.17: :meth:`BedTool.jaccard` and
  :meth:`BedTool.reldist` wrap the new `bedtools jaccard` and `bedtools
  reldist` respectively.

* Initial implementations of building blocks for computing statistics,
  :meth:`BedTool.absolute_distance` and :meth:`BedTool.relative_distance`

* :func:`pybedtools.featurefuncs.three_prime`,
  :func:`pybedtools.featurefuncs.five_prime`, and
  :func:`pybedtools.featurefuncs.TSS` modifier functions that can be passed to
  :meth:`BedTool.each`

* :func:`pybedtools.contrib.plotting.binary_heatmap` for visualizing results
  from :meth:`BedTool.multi_intersect`

* Fixed a long-standing issue where streaming :class:`BedTool` objects did not
  close their open file handles (stdout).  When working with many (i.e. tens
  of thousands) files, this caused the operating system to hit its open file
  limit.  This is now fixed.

* :meth:`BedTool.random_op`, a new mechanism for implementing operations that
  you would like to apply over tens of thousands of shuffled interval files.
  This makes it easy to extend the existing :mod:`pybedtools` multiprocessing
  functionality.

* :func:`pybedtools.contrib.bigwig.bam_to_bigwig`, a helper function to create
  a libary-size-scaled bigWig file from an input BAM file.

* :class:`pybedtools.contrib.plotting.TrackCollection` class, which handles
  plotting multiple files at once, using a provided "stylesheet" configuration
  to tweak colors etc.

* :class:`pybedtools.contrib.plotting.BedToolsDemo` and
  :class:`pybedtools.contrib.plotting.ConfiguredBedToolsDemo`, useful for
  running many graphical demos of BEDTools operations using the same
  "stylesheet" configuration.  Run :file:`pybedtools/contrib/plotting.py` for
  a demo.

* chromsizes dictionaries for common assemblies now have a `default` attribute,
  which is an OrderedDict of a default set of chromosome.  For example,
  ``pybedtools.chromsizes('hg19').default`` contains only the entries for the
  autosomes and X and Y.

* :meth:`BedTool.cat` now works better with multiprocessing

* added `include_distribution` kwarg to :meth:`BedTool.randomstats`, which will
  attach the full distribution of all the randomized files to the results
  dictionary.

* New method implementing Jaccard statistic (with pvalue using randomizations):
  :meth:`BedTool.random_jaccard`

* :func:`featurefuncs.extend_fields` helper function to pad fields with `'.'`,
  useful for manipulating features with the :meth:`BedTool.each` method

* Fixed a bug where BAM files, when written to disk via :meth:`BedTool.saveas`,
  were saved as SAM files.

* Better GTF/GFF detection, and if the input had quoted attribute values, then
  the output will, too

* various minor bug fixes and improvments as documented in the github commit
  logs....


Changes in v0.6.1
-----------------
* New :class:`pybedtools.contrib.plotting.Track` class allows plotting of
  features with matplotlib.  The `Track` class subclasses
  `matplotlib.collections.PolyCollection`, making it rather fast for 1000s of
  features.

* See the `scripts/pbt_plotting_example.py` script for a way of visually showing
  the results of BEDTools operations . . . great for teaching BEDTools to new
  users.

* New :meth:`BedTool.liftover` method (needs a chain file from UCSC and the
  `liftover` program installed)

* :class:`BedTool` creation using tuples/lists of values -- everything is
  converted to string before creating an :class:`Interval` object.

* bugfix: :meth:`BedTool.window_maker` now handles the `genome` kwarg correctly

* bugfix: `pybedtools.cleanup(remove_all=True)` now works correctly when using
  the default temp dir


Changes in v0.6
---------------
* Overhaul in online documentation to hopefully make functionality easier to
  find and/or discover.  See :ref:`pybedtools reference` for summary tables of
  the different parts of :mod:`pybedtools`; each entry is linked to further
  class/method/function-specific docs.  These more detailed docs also have
  links to view the source code from within the HTML docs for more exploration.

* :func:`pybedtools.contrib.venn_maker` function that acts as an interface to
  the VennDiagram R package -- just give it some BED files and it'll do the
  rest.

* Debug mode -- :func:`pybedtools.debug_mode` -- for verbose logging messages.

* Fixed an open file leak (OSError: too many open files) that occured when
  opening thousands of streaming bed files in a single session.

* Initial support for tabix files.  Useful for extracting features from
  a single region when you don't need a full intersection.

* New :mod:`pybedtools.contrib` module (in the spirit of Django's `contrib`)
  where higher-level functionality will be built.

* :class:`pybedtools.contrib.Classifier` class for identifying the classes of
  intervals.  Useful for making pie charts of intronic/exonic/intergenic etc
  classes of peaks.  Note that this is somewhat redundant with the new `mapBed`
  program in BEDTools.

* Experimental :class:`pybedtools.contrib.IntersectionMatrix` class for
  handling pairwise intersections of a large number of interval files --
  including a local sqlite3 database to avoid re-computing already up-to-date
  results.

* :class:`Interval` objects are now hashable (it's just a hash of the string
  representation) so that you can use them as dictionary keys.

* :meth:`BedTool.split` method, which accepts a function returning an iterable
  of :class:`Interval` objects. The function is applied to each interval.
  Useful for, say, splitting each gene into TSS, TTS, upstream and downstream
  features.

* :meth:`BedTool.truncate_to_chrom` method, which truncates features to the
  chromosome sizes of the provided genome.  Useful for when you try uploading
  a MACS-generated track to the UCSC genome browser, but it complains because
  peak boundaries have been extended outside chromosome boundaries . . . this
  method fixes the problem.

* :class:`BedTool` objects now have full functionality of :class:`IntervalFile`
  objects -- that is, they have the methods :meth:`BedTool.any_hits`,
  :meth:`BedTool.all_hits`, and :meth:`BedTool.count_hits` for doing
  single-interval tests.  Sometimes this will be faster than using the tabix
  support, sometimes it won't -- it's best to try both, depending on your data.

* String representations of :class:`Interval` objects now have a newline at the
  end, just like a raw lines from a BED/GFF/VCF file.  Previously, this was
  inconsistent and sometimes led to extra blank lines in "streaming"
  :class:`BedTool` instances . . . which in turn led to problems with BEDTools
  programs using the chromsweep algorithm.

* Concatentate multiple files with one call to :meth:`BedTool.cat` (thanks Jake
  Biesinger)

* Wrapped previous BEDTools programs:
    * `unionBedGraphs` (:meth:`BedTool.union_bedgraphs`)
    * `pairToBed` (:meth:`BedTool.pair_to_bed`)
    * `pairToPair` (:meth:`BedTool.pair_to_pair`)
    * `bedpeToBam` (:meth:`BedTool.bedpe_to_bam`)

* Wrapped new BEDTools programs:
    * `mapBed` (:meth:`BedTool.map`)
    * `clusterBed` (:meth:`BedTool.cluster`)
    * `randomBed` (:meth:`BedTool.random`)
    * `multiIntersectBed` (:meth:`BedTool.multi_intersect`)
    * `expandCols` (:meth:`BedTool.expand`)
    * `windowMaker` (:meth:`BedTool.window_maker`)
    * `bamToFastq` (:meth:`BedTool.bam_to_fastq`)

* Made venn_gchart and venn_mpl tests more stable

* Automatic documenting of which args are passed implicitly for BedTool method
  calls

* More robust mechanisms for specifying custom paths for BEDTools installation
  as well as optional tabix, samtools, and R installations.  This makes it
  easier to explicitly specify which versions of the tools to use.

* Improvements to GFF attributes: handle unescaped "=" (from sim4db GFFs) and
  make Attribute class properly dict-like (thanks Libor Mořkovský)

Changes in v0.5.5
-----------------
* Use `additional_args` kwarg to pass arguments verbatim to the underlying
  BEDTools programs.  This is necessary for arguments like
  `genomeCoverageBed`'s `-5` argument, since `5=True` is not a valid Python
  expression.  For example, you can use::

     import pybedtools
     a = pybedtools.example_bedtool('a.bed')
     a.genome_coverage(bg=True, strand='+', genome='hg19', additional_args='-5')

* Brent Pedersen added support for just 2 BED files in the Venn diagram scripts

* :meth:`BedTool.all_hits` uses the underlying BEDTools C++ API to get all hits
  in a file for a particular Interval::

    a = pybedtools.example_bedtool('a.bed')
    interval = Interval('chr1', 1, 5000)
    a.all_hits(interval)

* New semantics for comparisons of Interval objects.  Visual documentation of
  this coming soon.

* More tests for latest BEDTools code

* Interval instances are now pickleable; they can now be used across processes
  for parallel code.


Changes in v0.5
---------------
* support for running random intersections in parallel.  See
  :meth:`BedTool.randomstats` and :meth:`BedTool.randomintersection` (thanks,
  Jake Biesinger)

* Cython `Interval.__copy__()` for compatibility with `copy` module

* `seek()` and `rewind()` methods for `IntervalFile` class, used for Aaron
  Quinlan's new chromsweep algorithm (https://github.com/arq5x/chrom_sweep)
  (thanks, Aaron)

* support and tests for new BEDTools programs `multiBamCov`, `tagBam`, and `nucBed`

* `output="out.bed"` kwarg for all wrapped methods for explicitly specifying
  where to save output -- no more moving tempfiles

* docs improvements:
    * direct comparison with a shell script to illustrate benefit of
      `pybedtools`; see :ref:`shell_comparison`
    * more installation details
    * 0- and 1-based coordinates discussed early on (the 3 brief examples page,
      :ref:`3examples`)
    * development history and open collaboration model (see :ref:`devmodel`)