File: kraken.md

package info (click to toggle)
multiqc 1.21%2Bdfsg-3
  • links: PTS, VCS
  • area: main
  • in suites: forky, sid
  • size: 30,164 kB
  • sloc: python: 52,323; javascript: 7,064; sh: 76; makefile: 21
file content (30 lines) | stat: -rw-r--r-- 897 bytes parent folder | download | duplicates (2)
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
---
name: Kraken
url: https://ccb.jhu.edu/software/kraken2/
description: >
  is a taxonomic classification tool that uses exact k-mer matches to find
  the lowest common ancestor (LCA) of a given sequence.
---

The MultiQC module supports outputs from both Kraken and Kraken 2.

It works with report files generated using the `--report` flag, that look like the following:

```ts
11.66	98148	98148	U	0	unclassified
88.34	743870	996	-	1	root
88.22	742867	0	-	131567	  cellular organisms
88.22	742866	2071	D	2	    Bacteria
87.95	740514	2914	P	1239	      Firmicutes
```

A bar graph is generated that shows the number of fragments for each sample that
fall into the top-5 categories for each taxa rank. The top categories are calculated
by summing the library percentages across all samples.

The number of top categories to plot can be customized in the config file:

```yaml
kraken:
  top_n: 5
```