File: split_fasta_on_sample_ids.rst

package info (click to toggle)

qiime 1.4.0-2

links: PTS, VCS
area: main
in suites: wheezy
size: 29,704 kB
sloc: python: 77,837; haskell: 379; sh: 113; makefile: 103

file content (44 lines) | stat: -rw-r--r-- 1,354 bytes

.. _split_fasta_on_sample_ids:

.. index:: split_fasta_on_sample_ids.py

*split_fasta_on_sample_ids.py* -- Split a single post-split_libraries.py fasta file into per-sample fasta files.
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^

**Description:**

Split a single post-`split_libraries.py <./split_libraries.html>`_ fasta file into per-sample fasta files. This script requires that the sequences identitifers are in post-`split_libraries.py <./split_libraries.html>`_ format (i.e., SampleID_SeqID). A fasta file will be created for each unique SampleID.


**Usage:** :file:`split_fasta_on_sample_ids.py [options]`

**Input Arguments:**

.. note::

	
	**[REQUIRED]**
		
	-i, `-`-input_fasta_fp
		The input fasta file to split
	-o, `-`-output_dir
		The output directory [default: None]
	
	**[OPTIONAL]**
		
	`-`-buffer_size
		The number of sequences to read into memory before writing to file (you usually won't need to change this) [default: 500]


**Output:**




Split seqs.fna into one fasta file per sample and store the resulting fasta files in 'out'

::

	split_fasta_on_sample_ids.py -i seqs.fna -o out/