File: TODO.md

package info (click to toggle)
python-seqcluster 1.2.9%2Bds-7
  • links: PTS, VCS
  • area: contrib
  • in suites: forky, sid
  • size: 113,648 kB
  • sloc: python: 5,308; makefile: 175; sh: 122; javascript: 55
file content (57 lines) | stat: -rw-r--r-- 2,356 bytes parent folder | download | duplicates (5)
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
# General pipeline

- [ ] normalized using median of gene expression ratio, same idea than DESeq2, for viz purpose  
- [X] add length enrichment function
- [X] implement bayes function to decide which sequences go where
- [X] sequence in the cluster - text
- [X] create modules to save sqlite3 results
- [X] add doc about web servers with downstream analysis:
** http://compbio.uthsc.edu/miR2GO/home.php
- [X] create small stats: # of sequences, # in seqs.ma, # in alignment, # in clusters
- [X] counts of aligned reads
- [X] load matrix ma after alignment parsed
- [X] NOW html to connect to sqlite3 db to viz results

- [X] implement tRNA-scan
- [ ] NOW implement small rna prediction: CoRaL is an option for everything http://wanglab.pcbi.upenn.edu/coral/  or
- [X] change to python alignment tool
- [-] implement mirna prediction http://evryrna.ibisc.univ-evry.fr/miRBoost/index.html (move inside bcbio)
- [ ] implement submire target prediction module http://research.nhgri.nih.gov/software/SubmiRine/user_guide.shtml
- [X] Add to report: https://github.com/pkerpedjiev/forna/

# modules

* QC: 
  - [ ] figures 5' positions by # unique sequences, this by class: mirna, trna ...
* report:
  - [ ] add (R file) mirna path enrichment to report
  - [X] limit number of sequences in html, like 1000 top


# miraligner

Future implementation at http://mirtop.github.io

inputs will be fastq or bam file. Will need to download the mirbase annotation (and hairpin.fa) if not given (need version). Species is required (for now).

- [X] create data set to test module
- [X] align with razer3 if option on
- [X] fix alignement error: get correct positions and guess additions
- [X] decide best hits
- [X] compare to mirbase ref: t5 t3 add mism
- [ ] ouput in SAM/BAM with custom flag for isomiRs
- [X] make it compatible with old miraligner output and isomiRs


# downstream

- [ ] isomirViz: html code to browse miraligner results for a project: preparation with miraligner module
- [ ] seqclusteR: will be an object that make easier to retrieve information like counts, rownames, annotations ... tidy tables, and figures

# documentation

- [X] add install subcommand for mirbase installation/genomes
- [X] add html report link to documentation
- [X] figure showing general pipeline steps: bcbio-nextgen
- [X] add documentation for seqclusterViz html reader