File: TOPP_IDFileConverter.html

package info (click to toggle)
openms 1.11.1-5
  • links: PTS, VCS
  • area: main
  • in suites: jessie, jessie-kfreebsd
  • size: 436,688 kB
  • ctags: 150,907
  • sloc: cpp: 387,126; xml: 71,547; python: 7,764; ansic: 2,626; php: 2,499; sql: 737; ruby: 342; sh: 325; makefile: 128
file content (111 lines) | stat: -rw-r--r-- 12,237 bytes parent folder | download
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
<HTML>
<HEAD>
<TITLE>IDFileConverter</TITLE>
<LINK HREF="doxygen.css" REL="stylesheet" TYPE="text/css">
<LINK HREF="style_ini.css" REL="stylesheet" TYPE="text/css">
</HEAD>
<BODY BGCOLOR="#FFFFFF">
<A href="index.html">Home</A> &nbsp;&middot;
<A href="classes.html">Classes</A> &nbsp;&middot;
<A href="annotated.html">Annotated Classes</A> &nbsp;&middot;
<A href="modules.html">Modules</A> &nbsp;&middot;
<A href="functions_func.html">Members</A> &nbsp;&middot;
<A href="namespaces.html">Namespaces</A> &nbsp;&middot;
<A href="pages.html">Related Pages</A>
<HR style="height:1px; border:none; border-top:1px solid #c0c0c0;">
<!-- Generated by Doxygen 1.8.5 -->
</div><!-- top -->
<div class="header">
  <div class="headertitle">
<div class="title">IDFileConverter </div>  </div>
</div><!--header-->
<div class="contents">
<div class="textblock"><p>Converts identification engine file formats.</p>
<center> <table class="doxtable">
<tr>
<td align="center" bgcolor="#EBEBEB">potential predecessor tools  </td><td valign="middle" rowspan="3"><img class="formulaInl" alt="$ \longrightarrow $" src="form_91.png"/> IDFileConverter <img class="formulaInl" alt="$ \longrightarrow $" src="form_91.png"/> </td><td align="center" bgcolor="#EBEBEB">potential successor tools   </td></tr>
<tr>
<td valign="middle" align="center" rowspan="1">TPP tools: PeptideProphet, ProteinProphet  </td><td valign="middle" align="center" rowspan="2">TPP tools: ProteinProphet<br/>
(for conversion from idXML to pepXML)   </td></tr>
<tr>
<td valign="middle" align="center" rowspan="1">Sequest protein identification engine   </td></tr>
</table>
</center><p>Conversion from the TPP file formats pepXML and protXML to <a class="el" href="namespaceOpenMS.html" title="Main OpenMS namespace. ">OpenMS</a>' idXML is quite comprehensive, to the extent that the original data can be represented in the simpler idXML format.</p>
<p>In contrast, support for converting from idXML to pepXML is limited. The purpose here is simply to create pepXML files containing the relevant information for the use of ProteinProphet.</p>
<p><b>Details on additional parameters:</b></p>
<p><code>mz_file:</code> <br/>
Some search engine output files (like pepXML, mascotXML, Sequest .out files) may not contain retention times, only scan numbers. To be able to look up the actual RT values, the raw file has to be provided using the parameter <code>mz_file</code>. (If the identification results should be used later to annotate feature maps or consensus maps, it is critical that they contain RT values. See also <a class="el" href="TOPP_IDMapper.html">IDMapper</a>.)</p>
<p><code>mz_name:</code> <br/>
PepXML files can contain results from multiple experiments. However, the idXML format does not support this. The <code>mz_name</code> parameter (or <code>mz_file</code>, if given) thus serves to define what parts to extract from the pepXML.</p>
<p><code>scan_regex:</code> <br/>
For Mascot results exported to XML, the scan numbers (used to look up retention times using <code>mz_file</code>) should be given in the "pep_scan_title" XML elements, but the format can vary. If the defaults fail to extract the scan numbers, a Perl-style regular expression can be given through the advanced parameter <code>scan_regex</code>, and will be used instead. The regular expression should contain a named group "SCAN" matching the scan number or "RT" matching the actual retention time. For example, if the format of the "pep_scan_title" elements is "scan=123", where 123 is the scan number, the expression "scan=(?&lt;SCAN&gt;\\d+)" can be used to extract the number. (However, the format in this example is actually covered by the defaults.)</p>
<p>Some information about the supported input types: <a class="el" href="classOpenMS_1_1MzIdentMLFile.html">mzIdentML</a> <a class="el" href="classOpenMS_1_1PepXMLFile.html">pepXML</a> <a class="el" href="classOpenMS_1_1ProtXMLFile.html">protXML</a> <a class="el" href="classOpenMS_1_1IdXMLFile.html">idXML</a> <a class="el" href="classOpenMS_1_1MascotXMLFile.html">mascotXML</a> <a class="el" href="classOpenMS_1_1OMSSAXMLFile.html">omssaXML</a> <a class="el" href="classOpenMS_1_1SequestOutfile.html">Sequest .out directory</a></p>
<p><b>The command line parameters of this tool are:</b> </p>
<pre class="fragment">
IDFileConverter -- Converts identification engine file formats.
Version: 1.11.1 Nov 14 2013, 11:18:15, Revision: 11976

Usage:
  IDFileConverter &lt;options&gt;

Options (mandatory options marked with '*'):
  -in &lt;path/file&gt;*      Input file or directory containing the output of the search engine.
                        Sequest: Directory containing the .out files
                        pepXML: Single pepXML file.
                        protXML: Single protXML file.
                        mascotXML: Single Mascot XML file.
                        omssaXML: Single OMSSA XML file.
                        idXML: Single idXML file.
                        (valid formats: 'pepXML', 'protXML', 'mascotXML', 'omssaXML', 'idXML')
  -out &lt;file&gt;*          Output file (valid formats: 'idXML', 'mzid', 'pepXML', 'FASTA')
  -out_type &lt;type&gt;      Output file type -- default: determined from file extension or content
                        (valid: 'idXML', 'mzid', 'pepXML', 'FASTA')
                        
  -mz_file &lt;file&gt;       [Sequest, pepXML, mascotXML only] Retention times will be looked up in this file (val
                        id formats: 'mzML', 'mzXML', 'mzData')
                        
  -mz_name &lt;file&gt;       [pepXML only] Experiment filename/path (extension will be removed) to match in the 
                        pepXML file ('base_name' attribute). Only necessary if different from 'mz_file'.
  -use_precursor_data   [pepXML only] Use precursor RTs (and m/z values) from 'mz_file' for the generated 
                        peptide identifications, instead of the RTs of MS2 spectra.
                        
Common TOPP options:
  -ini &lt;file&gt;           Use the given TOPP INI file
  -threads &lt;n&gt;          Sets the number of threads allowed to be used by the TOPP tool (default: '1')
  -write_ini &lt;file&gt;     Writes the default configuration file
  --help                Shows options
  --helphelp            Shows all options (including advanced)

</pre><p> <b>INI file documentation of this tool:</b> <div class="ini_global">
<div class="legend">
<b>Legend:</b><br>
 <div class="item item_required">required parameter</div>
 <div class="item item_advanced">advanced parameter</div>
</div>
  <div class="node"><span class="node_name">+IDFileConverter</span><span class="node_description">Converts identification engine file formats.</span></div>
    <div class="item item_advanced"><span class="item_name" style="padding-left:16px;">version</span><span class="item_value">1.11.1</span>
<span class="item_description">Version of the tool that generated this parameters file.</span><span class="item_tags"></span><span class="item_restrictions"> </span></div>    <div class="node"><span class="node_name">++1</span><span class="node_description">Instance '1' section for 'IDFileConverter'</span></div>
      <div class="item"><span class="item_name item_required" style="padding-left:24px;">in</span><span class="item_value"></span>
<span class="item_description">Input file or directory containing the output of the search engine.<br>Sequest: Directory containing the .out files<br>pepXML: Single pepXML file.<br>protXML: Single protXML file.<br>mascotXML: Single Mascot XML file.<br>omssaXML: Single OMSSA XML file.<br>idXML: Single idXML file.<br></span><span class="item_tags">input file</span><span class="item_restrictions">*.pepXML,*.protXML,*.mascotXML,*.omssaXML,*.idXML</span></div>      <div class="item"><span class="item_name item_required" style="padding-left:24px;">out</span><span class="item_value"></span>
<span class="item_description">Output file</span><span class="item_tags">output file</span><span class="item_restrictions">*.idXML,*.mzid,*.pepXML,*.FASTA</span></div>      <div class="item"><span class="item_name" style="padding-left:24px;">out_type</span><span class="item_value"></span>
<span class="item_description">output file type -- default: determined from file extension or content<br></span><span class="item_tags"></span><span class="item_restrictions">idXML,mzid,pepXML,FASTA</span></div>      <div class="item"><span class="item_name" style="padding-left:24px;">mz_file</span><span class="item_value"></span>
<span class="item_description">[Sequest, pepXML, mascotXML only] Retention times will be looked up in this file</span><span class="item_tags">input file</span><span class="item_restrictions">*.mzML,*.mzXML,*.mzData</span></div>      <div class="item item_advanced"><span class="item_name" style="padding-left:24px;">ignore_proteins_per_peptide</span><span class="item_value">false</span>
<span class="item_description">[Sequest only] Workaround to deal with .out files that contain e.g. "+1" in references column,<br>but do not list extra references in subsequent lines (try -debug 3 or 4)</span><span class="item_tags"></span><span class="item_restrictions">true,false</span></div>      <div class="item"><span class="item_name" style="padding-left:24px;">mz_name</span><span class="item_value"></span>
<span class="item_description">[pepXML only] Experiment filename/path (extension will be removed) to match in the pepXML file ('base_name' attribute). Only necessary if different from 'mz_file'.</span><span class="item_tags"></span><span class="item_restrictions"> </span></div>      <div class="item"><span class="item_name" style="padding-left:24px;">use_precursor_data</span><span class="item_value">false</span>
<span class="item_description">[pepXML only] Use precursor RTs (and m/z values) from 'mz_file' for the generated peptide identifications, instead of the RTs of MS2 spectra.</span><span class="item_tags"></span><span class="item_restrictions">true,false</span></div>      <div class="item item_advanced"><span class="item_name" style="padding-left:24px;">scan_regex</span><span class="item_value"></span>
<span class="item_description">[mascotXML only] Regular expression used to extract the scan number or retention time. See documentation for details.</span><span class="item_tags"></span><span class="item_restrictions"> </span></div>      <div class="item item_advanced"><span class="item_name" style="padding-left:24px;">log</span><span class="item_value"></span>
<span class="item_description">Name of log file (created only when specified)</span><span class="item_tags"></span><span class="item_restrictions"> </span></div>      <div class="item item_advanced"><span class="item_name" style="padding-left:24px;">debug</span><span class="item_value">0</span>
<span class="item_description">Sets the debug level</span><span class="item_tags"></span><span class="item_restrictions"> </span></div>      <div class="item"><span class="item_name" style="padding-left:24px;">threads</span><span class="item_value">1</span>
<span class="item_description">Sets the number of threads allowed to be used by the TOPP tool</span><span class="item_tags"></span><span class="item_restrictions"> </span></div>      <div class="item item_advanced"><span class="item_name" style="padding-left:24px;">no_progress</span><span class="item_value">false</span>
<span class="item_description">Disables progress logging to command line</span><span class="item_tags"></span><span class="item_restrictions">true,false</span></div>      <div class="item item_advanced"><span class="item_name" style="padding-left:24px;">test</span><span class="item_value">false</span>
<span class="item_description">Enables the test mode (needed for internal use only)</span><span class="item_tags"></span><span class="item_restrictions">true,false</span></div></div>
 </div></div><!-- contents -->
<HR style="height:1px; border:none; border-top:1px solid #c0c0c0;">
<TABLE width="100%" border="0">
<TR>
<TD><font color="#c0c0c0">OpenMS / TOPP release 1.11.1</font></TD>
<TD align="right"><font color="#c0c0c0">Documentation generated on Thu Nov 14 2013 11:19:24 using doxygen 1.8.5</font></TD>
</TR>
</TABLE>
</BODY>
</HTML>