File: jellyfish.md

package info (click to toggle)
multiqc 1.9%2Bdfsg-3
  • links: PTS, VCS
  • area: main
  • in suites: bullseye
  • size: 27,568 kB
  • sloc: python: 28,657; javascript: 3,233; sh: 74; makefile: 24
file content (19 lines) | stat: -rw-r--r-- 1,194 bytes parent folder | download
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
---
Name: Jellyfish
URL: http://www.cbcb.umd.edu/software/jellyfish/
Description: >
    JELLYFISH is a tool for fast, memory-efficient counting of k-mers in DNA.
---

JELLYFISH is a tool for fast, memory-efficient counting of k-mers in DNA. A k-mer is a substring of length k, and counting the occurrences of all such substrings is a central step in many analyses of DNA sequence. JELLYFISH can count k-mers using an order of magnitude less memory and an order of magnitude faster than other k-mer counting packages by using an efficient encoding of a hash table and by exploiting the "compare-and-swap" CPU instruction to increase parallelism.

The MultiQC module for Jellyfish parses *only* `*_jf.hist` files. The general usage of jellyfish to be parsed by MultiQC module needs to be:

 - `gunzip -c file.fastq.gz | jellyfish count -o file.jf  -m ...`
 - `jellyfish histo -o file_jf.hist -f file.jf`

In case a user wants to customise the matching pattern for jellyfish, then multiqc can be run with the option `--cl_config "sp: { jellyfish: { fn: 'PATTERN' } }"` where `PATTERN` is the pattern to be matched. For example:

```
multiqc . --cl_config "sp: { jellyfish: { fn: '*.hist' } }"
```