File: quast.md

package info (click to toggle)
multiqc 1.21%2Bdfsg-3
  • links: PTS, VCS
  • area: main
  • in suites: forky, sid
  • size: 30,164 kB
  • sloc: python: 52,323; javascript: 7,064; sh: 76; makefile: 21
file content (60 lines) | stat: -rw-r--r-- 2,067 bytes parent folder | download | duplicates (2)
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
---
name: QUAST
url: http://quast.bioinf.spbau.ru/
description: >
  A Quality Assessment Tool for Genome Assemblies by
  the Center for Algorithmic Biotechnology.
---

[QUAST](http://quast.bioinf.spbau.ru/) evaluates genome assemblies by
computing various metrics, including

- N50, length for which the collection of all contigs of that length or
  longer covers at least 50% of assembly length
- NG50, where length of the reference genome is being covered
- NA50 and NGA50, where aligned blocks instead of contigs are taken
- Misassemblies, misassembled and unaligned contigs or contigs bases
- Genes and operons covered

The QUAST MultiQC module parses the `report.tsv` files generated by
QUAST and adds key metrics to the report General Statistics table.
All statistics for all samples are saved to `multiqc_data/multiqc_quast.txt`.

#### Configuration

By default, the QUAST module is configured to work with large _de-novo_ genomes,
showing thousands of contigs, mega-base pairs and other sensible defaults.

If these aren't appropriate for your genomes, you can configure them as follows:

```yaml
quast_config:
  contig_length_multiplier: 0.001
  contig_length_suffix: "Kbp"
  total_length_multiplier: 0.000001
  total_length_suffix: "Mbp"
  total_number_contigs_multiplier: 0.001
  total_number_contigs_suffix: "K"
```

The default module values are shown above. See the
[main MultiQC documentation](http://multiqc.info/docs/#configuring-multiqc)
for more information about how to configure MultiQC.

#### MetaQUAST

The QUAST module will also parse output from
[MetaQUAST](http://quast.sourceforge.net/metaquast) runs (`metaquast.py`).

The `combined_reference/report.tsv` file is parsed, and folders
`runs_per_reference` and `not_aligned` are ignored.

If you want to run MultiQC against auxiliary MetaQUAST runs, you must
explicitly pass these files to MultiQC:

```bash
multiqc runs_per_reference/reference_1/report.tsv
```

Note that you can pass as many file paths to MultiQC as you like
and use glob expansion (eg. `runs_per_reference/*/report.tsv`).