File: split_fasta_on_sample_ids.rst

package info (click to toggle)
qiime 1.4.0-2
  • links: PTS, VCS
  • area: main
  • in suites: wheezy
  • size: 29,704 kB
  • sloc: python: 77,837; haskell: 379; sh: 113; makefile: 103
file content (44 lines) | stat: -rw-r--r-- 1,354 bytes parent folder | download
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
.. _split_fasta_on_sample_ids:

.. index:: split_fasta_on_sample_ids.py

*split_fasta_on_sample_ids.py* -- Split a single post-split_libraries.py fasta file into per-sample fasta files.
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^

**Description:**

Split a single post-`split_libraries.py <./split_libraries.html>`_ fasta file into per-sample fasta files. This script requires that the sequences identitifers are in post-`split_libraries.py <./split_libraries.html>`_ format (i.e., SampleID_SeqID). A fasta file will be created for each unique SampleID.


**Usage:** :file:`split_fasta_on_sample_ids.py [options]`

**Input Arguments:**

.. note::

	
	**[REQUIRED]**
		
	-i, `-`-input_fasta_fp
		The input fasta file to split
	-o, `-`-output_dir
		The output directory [default: None]
	
	**[OPTIONAL]**
		
	`-`-buffer_size
		The number of sequences to read into memory before writing to file (you usually won't need to change this) [default: 500]


**Output:**




Split seqs.fna into one fasta file per sample and store the resulting fasta files in 'out'

::

	split_fasta_on_sample_ids.py -i seqs.fna -o out/