1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 112 113 114 115 116 117 118 119 120 121 122 123 124 125 126 127 128 129 130 131 132 133 134 135 136 137 138 139 140 141 142 143 144 145 146 147 148 149 150 151 152 153 154 155 156 157 158 159 160 161 162 163 164 165 166 167 168 169 170 171 172 173 174 175 176 177 178 179 180 181 182 183 184 185
|
# MACS: Model-based Analysis for ChIP-Seq
     
[](https://pypistats.org/packages/macs3)
Latest Release:
* Github: [](https://github.com/macs3-project/MACS/releases)
* PyPI: [](https://pypi.org/project/MACS3/)
* Bioconda:[](https://anaconda.org/bioconda/macs3)
* Debian Med: [](https://packages.debian.org/stable/macs)[](https://packages.debian.org/sid/macs)
## Introduction
With the improvement of sequencing techniques, chromatin
immunoprecipitation followed by high throughput sequencing (ChIP-Seq)
is getting popular to study genome-wide protein-DNA interactions. To
address the lack of powerful ChIP-Seq analysis method, we presented
the **M**odel-based **A**nalysis of **C**hIP-**S**eq (MACS), for
identifying transcript factor binding sites. MACS captures the
influence of genome complexity to evaluate the significance of
enriched ChIP regions and MACS improves the spatial resolution of
binding sites through combining the information of both sequencing tag
position and orientation. MACS can be easily used for ChIP-Seq data
alone, or with a control sample with the increase of
specificity. Moreover, as a general peak-caller, MACS can also be
applied to any "DNA enrichment assays" if the question to be asked is
simply: *where we can find significant reads coverage than the random
background*.
## Changes for MACS (3.0.2)
### Features added
1) Introduce a new emission model for the `hmmratac` function. Now
users can choose the simpler Poisson emission `--hmm-type poisson`
instead of the default Gaussian emission. As a consequence, the
saved HMM model file in json will include the hmm-type information
as well. Note that in order to be compatible with the HMM model
file from previous version, if there is no hmm-type information in
the model file, the hmm-type will be assigned as gaussian. #635
2) `hmmratac` now output narrowPeak format output. The summit
position and the peak score columns reported in the narrowPeak
output represents the position with highest foldchange value
(pileup vs average background).
3) Add `--cutoff-analysis-steps` and `--cutoff-analysis-max` for
`callpeak`, `bdgpeakcall`, and `hmmratac` so that we can
have finer resolution of the cutoff analysis report. #636 #642
4) Reduce memory usage of `hmmratac` during decoding step, by
writing decoding results to a temporary file on disk (file
location depends on the environmental TEMP setting), then loading
it back while identifying state pathes. This change will decrease
the memory usage dramatically. #628 #640
5) Fix instructions for preparing narrowPeak files for uploading
to UCSC browser, with the `--trackline` option in `callpeak`. #653
6) For gappedPeak output, set thickStart and thickEnd columns as
0, according to UCSC definition.
### Bugs fixed
1) Use `-O3` instead of `-Ofast` for compatibility. #637
### Documentation
1) Update instruction to install macs3 through conda/bioconda
2) Reorganize MACS3 docs and publish through
https://macs3-project.github.io/MACS
3) Description on various file formats used in MACS3.
## Install
The common way to install MACS is through
[PYPI](https://pypi.org/project/macs3/)) or
[conda](https://anaconda.org/macs3/macs3). Please check the
[INSTALL](docs/INSTALL.md) document for detail.
MACS3 has been tested using GitHub Actions for every push and PR in
the following architectures:
* x86_64 (Ubuntu 22, Python 3.9, 3.10, 3.11)
* aarch64 (Ubuntu 22, Python 3.10)
* armv7 (Ubuntu 22, Python 3.10)
* ppc64le (Ubuntu 22, Python 3.10)
* s390x (Ubuntu 22, Python 3.10)
* Apple chips (Mac OS 13, Python 3.11)
In general, you can install through PyPI as `pip install macs3`. To
use virtual environment is highly recommended. Or you can install
after unzipping the released package downloaded from Github, then use
`pip install .` command. Please note that, we haven't tested
installation on any Windows OS, so currently only Linux and Mac OS
systems are supported. Also, for aarch64, armv7, ppc64le and s390x,
due to some unknown reason potentially related to the scientific
calculation libraries MACS3 depends on, such as Numpy, Scipy,
hmm-learn, scikit-learn, the results from `hmmratac` subcommand may
not be consistent with the results from x86 or Apple chips. Please be
aware.
## Usage
Example for regular peak calling on TF ChIP-seq:
`macs3 callpeak -t ChIP.bam -c Control.bam -f BAM -g hs -n test -B -q 0.01`
Example for broad peak calling on Histone Mark ChIP-seq:
`macs3 callpeak -t ChIP.bam -c Control.bam --broad -g hs --broad-cutoff 0.1`
Example for peak calling on ATAC-seq (paired-end mode):
`macs3 callpeak -f BAMPE -t ATAC.bam -g hs -n test -B -q 0.01`
There are currently 14 functions available in MACS3 serving as
sub-commands. Please click on the link to see the detail description
of the subcommands.
Subcommand | Description
-----------|----------
[`callpeak`](docs/callpeak.md) | Main MACS3 Function to call peaks from alignment results.
[`bdgpeakcall`](docs/bdgpeakcall.md) | Call peaks from bedGraph file.
[`bdgbroadcall`](docs/bdgbroadcall.md) | Call nested broad peaks from bedGraph file.
[`bdgcmp`](docs/bdgcmp.md) | Comparing two signal tracks in bedGraph format.
[`bdgopt`](docs/bdgopt.md) | Operate the score column of bedGraph file.
[`cmbreps`](docs/cmbreps.md) | Combine bedGraph files of scores from replicates.
[`bdgdiff`](docs/bdgdiff.md) | Differential peak detection based on paired four bedGraph files.
[`filterdup`](docs/filterdup.md) | Remove duplicate reads, then save in BED/BEDPE format file.
[`predictd`](docs/predictd.md) | Predict d or fragment size from alignment results. In case of PE data, report the average insertion/fragment size from all pairs.
[`pileup`](docs/pileup.md) | Pileup aligned reads (single-end) or fragments (paired-end)
[`randsample`](docs/randsample.md) | Randomly choose a number/percentage of total reads, then save in BED/BEDPE format file.
[`refinepeak`](docs/refinepeak.md) | Take raw reads alignment, refine peak summits.
[`callvar`](docs/callvar.md) | Call variants in given peak regions from the alignment BAM files.
[`hmmratac`](docs/hmmratac.md) | Dedicated peak calling based on Hidden Markov Model for ATAC-seq data.
For advanced usage, for example, to run `macs3` in a modular way,
please read the [advanced usage](docs/Advanced_Step-by-step_Peak_Calling.md). There is a
[Q&A](docs/qa.md) document where we collected some common questions
from users.
## Contribute
Please read our [CODE OF CONDUCT](CODE_OF_CONDUCT.md) and [How to
contribute](CONTRIBUTING.md) documents. If you have any questions,
suggestion/ideas, or just want to have conversions with developers and
other users in the community, we recommend using the [MACS
Discussions](https://github.com/macs3-project/MACS/discussions)
instead of posting to our
[Issues](https://github.com/macs3-project/MACS/issues) page.
## Ackowledgement
MACS3 project is sponsored by [](https://czi.co/EOSS). And we particularly want to thank the user community for their supports, feedbacks and contributions over the years.
## Citation
2008: [Model-based Analysis of ChIP-Seq
(MACS)](https://genomebiology.biomedcentral.com/articles/10.1186/gb-2008-9-9-r137)
## Other useful links
* [Cistrome](http://cistrome.org/)
* [bedTools](http://code.google.com/p/bedtools/)
* [UCSC toolkits](http://hgdownload.cse.ucsc.edu/admin/exe/)
* [deepTools](https://github.com/deeptools/deepTools/)
```{toctree}
:maxdepth: 2
:hidden:
index.md
docs/INSTALL.md
docs/subcommands_index.md
docs/fileformats_index.md
docs/tutorial.md
docs/qa.md
CODE_OF_CONDUCT.md
CONTRIBUTING.md
|