File: converter_cmd.html

package info (click to toggle)
lamarc 2.1.10.1%2Bdfsg-3
  • links: PTS, VCS
  • area: main
  • in suites: buster
  • size: 77,052 kB
  • sloc: cpp: 112,339; xml: 16,769; sh: 3,528; makefile: 1,219; python: 420; perl: 260; ansic: 40
file content (564 lines) | stat: -rw-r--r-- 25,775 bytes parent folder | download | duplicates (3)
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
293
294
295
296
297
298
299
300
301
302
303
304
305
306
307
308
309
310
311
312
313
314
315
316
317
318
319
320
321
322
323
324
325
326
327
328
329
330
331
332
333
334
335
336
337
338
339
340
341
342
343
344
345
346
347
348
349
350
351
352
353
354
355
356
357
358
359
360
361
362
363
364
365
366
367
368
369
370
371
372
373
374
375
376
377
378
379
380
381
382
383
384
385
386
387
388
389
390
391
392
393
394
395
396
397
398
399
400
401
402
403
404
405
406
407
408
409
410
411
412
413
414
415
416
417
418
419
420
421
422
423
424
425
426
427
428
429
430
431
432
433
434
435
436
437
438
439
440
441
442
443
444
445
446
447
448
449
450
451
452
453
454
455
456
457
458
459
460
461
462
463
464
465
466
467
468
469
470
471
472
473
474
475
476
477
478
479
480
481
482
483
484
485
486
487
488
489
490
491
492
493
494
495
496
497
498
499
500
501
502
503
504
505
506
507
508
509
510
511
512
513
514
515
516
517
518
519
520
521
522
523
524
525
526
527
528
529
530
531
532
533
534
535
536
537
538
539
540
541
542
543
544
545
546
547
548
549
550
551
552
553
554
555
556
557
558
559
560
561
562
563
564
<!-- header fragment for html documentation -->
<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 3.2 Final//EN">
<HTML>
<HEAD>

<META NAME="description" CONTENT="Estimation of population parameters using genetic data usi
ng a maximum likelihood approach with Metropolis-Hastings Monte Carlo Markov chain importanc
e sampling">
<META NAME="keywords" CONTENT="MCMC, Markov chain, Monte Carlo, Metropolis-Hastings, populat
ion, parameters, migration rate, population size, recombination rate, maximum likelihood">

<TITLE>LAMARC Documentation: Data file conversion</title>
</HEAD>


<BODY BGCOLOR="#FFFFFF">
<!-- coalescent, coalescence, Markov chain Monte Carlo simulation, migration rate, effective
 population size, recombination rate, maximum likelihood -->
<P>(<A HREF="panels.html">Back</A> | <A HREF="index.html">Contents</A>
| <A HREF="xmlinput.html">Next</A>)</P>
<H2> <A NAME="command-file">Converter Command File Reference</A></H2>
<UL>
<LI><A HREF="converter_cmd.html#intro">Converter Command File Introduction</A></LI>
<LI> <A HREF="converter_cmd.html#cmd_overview">Command File Overview</a></LI>
<LI> <A HREF="converter_cmd.html#traits">Traits</a></LI>
<LI> <A HREF="converter_cmd.html#inherit">Tags Specifying Inheritance and Mutation Models</a></LI>
<LI> <A HREF="converter_cmd.html#regions">Regions</a></LI>
<LI> <A HREF="converter_cmd.html#segments">Segments</a></LI>
<LI> <A HREF="converter_cmd.html#populations">Populations</a></LI>
<LI> <A HREF="converter_cmd.html#panels">Panels</a></LI>
<LI> <A HREF="converter_cmd.html#infiles">Data files</a></LI>
<LI> <A HREF="converter_cmd.html#outfile">Specifying the Name of the Produced Lamarc file</a></LI>
<LI> <A HREF="converter_cmd.html#comment">Miscellaneous Tags</a></LI>
<LI> <A HREF="converter_cmd.html#phase">Specifying Relationships Between Individuals and Data Samples</a></LI>
</UL>

<h3><A NAME="intro">Converter Command File Introduction</A></h3>

<P>The converter command file is an 
<A HREF="xmlinput.html#overview">XML-format</a> text file
which can be used to bypass the <a href="converter.html">converter</a>
GUI interface and
directly provide information to the converter.
</p>

<h4>When to use a Converter Command File</h4>

<P>
For most LAMARC users, running the lamarc file converter in GUI mode
will be the quickest and most intuitive way to convert data files
for use in LAMARC. However, there are a few situations in which 
it may be necessary to write a converter command file.
These situations include:
</P>
<ul>
<li>automating conversion for use in simulation studies,</li>
<li>using a new converter feature for which there is not yet
a GUI interface, and</li>
<li>reading in information that is tedious and error prone to
enter by hand (such as 
<a href="genetic_map.html#segment-coord">locations</a> for SNP data).</li>
</ul>
<P>
If a command file is needed to access a particular feature,
it can be read into the converter either in batch mode or
from the GUI.
</P>

<h4>An Example Converter Command File</h4>

<p>
An example converter command file with matching MIGRATE
data files is provided in the <A
HREF="batch_converter/">batch_converter/</a> directory.  The file <A
HREF="batch_converter/sample-conv-cmd.html">sample-conv-cmd.xml</a> (actual xml is is <A HREF="batch_converter/sample-conv-cmd.xml">here</a>)
 annotated with comments, and should be a good guide to what's going on.
</p>

<h4>How to Create a Converter Command File</h4>

<p>
The simplest way to create your own file is probably a combination of:
</P>
<ul>
<li>copying the
<a href="batch_converter/sample-conv-cmd.xml">provided example</a>,</li>
<li>preparing an example in the GUI and then using the
<tt>File &gt; Write Batch Command File</tt> menu command, and </li>
<li>editing a final version based on the above two items.
</ul>

<P>
The rest of this section is provided as a reference should copying 
from examples is not sufficient for your needs.
</P>

<h4>How to Use a Converter Command File</h4>

<P>
You can use your converter command file by:
</P>
<ul>
<li>Reading it in from the GUI with the 
    <tt>&quot;File &gt;Read Command File&quot;</tt> menu item</li>
<li>Providing it using the <tt>-c</tt> command line
argument to the converter in either
<a href="converter.html#gui-mode">GUI</a>
or <a href="converter.html#batch-mode">batch</a> mode.</li>
</ul>

<H3 style="page-break-before: always"><A NAME="cmd_overview">Command File Overview</A></H3>

<P>The top level tag of the file is a
<b><tt>&lt;lamarc-converter-cmd&gt;</tt></b> tag. 
Its possible immediate children are listed in the table below.
Note that none of these child tags are required. This is because,
generally speaking, fragments of complete converter command files are
allowed to be read in from the GUI.
</P>

<table border=1 >
<tr><th colspan=4>Top Level Tags in Lamarc Converter Command File</th></tr>
<tr><th>parent tag</th><th>child tag</th><th>child required</th><th>child instances allowed</th></tr>
<tr><td rowspan=8>&lt;lamarc-converter-cmd&gt;</td>
        <td><a href="#traits">&lt;traits&gt;</a></td>
        <td>optional</td><td>SINGLE</td></tr>
<tr><td><a href="#regions">&lt;regions&gt;</a></td>
        <td>optional</td><td>SINGLE</td></tr>
<tr><td><a href="#populations">&lt;populations&gt;</a></td>
        <td>optional</td><td>SINGLE</td></tr>
<tr><td><a href="#phase">&lt;individuals&gt;</a></td>
        <td>optional</td><td>SINGLE</td></tr>
<tr><td><a href="#panels">&lt;panels&gt;</a></td>
        <td>optional</td><td>SINGLE</td></tr>
<tr><td><a href="#infiles">&lt;infiles&gt;</a></td>
        <td>optional</td><td>SINGLE</td></tr>
<tr><td><a href="#outfile">&lt;outfile&gt;</a></td>
        <td>optional</td><td>SINGLE</td></tr>
<tr><td><a href="#comment">&lt;lamarc-header-comment&gt;</a></td>
        <td>optional</td><td>SINGLE</td></tr>
</table>

<H3> <A NAME="traits">Traits</A></H3>

<p>The <tt>&lt;traits&gt;</tt> tag is used only when <a href="mapping.html">trait mapping</a>.
If you are not mapping traits, you may skip ahead to the <a href="#regions">regions</a> section.
</p>

<p>
The &lt;traits&gt; tag contains definitions of one or more
of the following objects.
</p>
<ul>
<li>&lt;trait-info&gt;, used to specify a trait name and associated alleles, and</li>
<li>&lt;phenotype&gt; definitions, used to specify a model for an observed
trait manifestation.</li>
</ul>
Below is 
<a href="#table-trait">a table discribing the relevant XML tags</a>.
You can also find an
<a href="mapping.html#trait-info-defs">examples trait-info definition</a>
and
<a href="mapping.html#phenotype-defs">examples of phenotype definitions</a>
in the <a href="mapping.html">section on trait mapping</a>.</p>

<h4 style="page-break-before: always"><a name="table-trait">Table of Sub-Tags of &lt;traits&gt;</a></h4>
<table border=1 >
<tr><th colspan=4>Tags Describing Traits in Lamarc Converter Command File</th></tr>
<tr><th>parent tag</th><th>child tag</th><th>child required</th><th>child instances allowed</th></tr>
<tr>
    <td rowspan=2>&lt;traits&gt;</td>
            <td>&lt;trait-info&gt;</td>
                <td>optional</td><td>multiple</td></tr>
        <tr><td>&lt;phenotype&gt;</td>
                <td>optional</td><td>multiple</td></tr>
<tr>
    <td rowspan=2>&lt;trait-info&gt;</td>
            <td>&lt;name&gt;</td>
                <td>REQUIRED</td><td>SINGLE</td>
        <tr><td>&lt;allele&gt;</td>
                <td>REQUIRED</td><td>multiple</td></tr>
<tr>
    <td rowspan=2>&lt;phenotype&gt;</td>
            <td>&lt;name&gt;</td>
                <td>REQUIRED</td><td>SINGLE</td>
        <tr><td>&lt;genotype-resolutions&gt;</td>
                <td>REQUIRED</td><td>multiple</td></tr>
<tr>
    <td rowspan=2>&lt;genotype-resolutions&gt;</td>
            <td>&lt;trait-name&gt;</td>
                <td>REQUIRED</td><td>SINGLE</td></tr>
        <tr><td>&lt;haplotypes&gt;</td>
                <td>REQUIRED</td><td>multiple</td></tr>
<tr>
    <td rowspan=2>&lt;haplotypes&gt;</td>
            <td>&lt;alleles&gt;</td>
                <td>REQUIRED</td><td>SINGLE</td></tr>
        <tr><td>&lt;penetrance&gt;</td>
                <td>REQUIRED</td><td>SINGLE</td></tr>
<tr> <th>tag</th><th colspan=3>contents</th></tr>
<tr><td >&lt;allele&gt;</td>
        <td colspan=3><em>unique name; should not contain spaces</em></td></tr>
<tr><td >&lt;alleles&gt;</td>
        <td colspan=3><em>ordered list of names (from &lt;allele&gt; tags of corresponding trait), separated by spaces</em></td></tr>
<tr><td >&lt;penetrance&gt;</td>
        <td colspan=3><em>value between 0 and 1; indicates the chance that an individual with these specific alleles will display the enclosing trait</em></td></tr>
<tr><td >&lt;name&gt;</td>
        <td colspan=3><em>unique name; should not contain spaces</em></td></tr>
<tr><td >&lt;trait-name&gt;</td>
        <td colspan=3><em>unique name; should not contain spaces</em></td></tr>
</table>

<H3> <A NAME="inherit">Tags Specifying Inheritance and Mutation Models:
&lt;regions&gt; and &lt;segments&gt;</A></H3>


<p>
In section <a href="genetic_map.html">
Modeling Linkage Properties and Relative Mutation Rates of Your Data</a>
of the documentation
</p>


<H3> <A NAME="regions">Regions</A></H3>

<table style="page-break-before: always" border=1 >
<tr><th colspan=4>Specifying Inheritance Relationships</th></tr>
<tr><th>parent tag</th><th>child tag</th><th>child required</th><th>child instances allowed</th></tr>
<tr>
    <td>&lt;regions&gt;</td>
            <td>&lt;region&gt;</td><td>REQUIRED</td><td>multiple</td></tr>
<tr>
    <td rowspan=4>&lt;region&gt;</td>
            <td>&lt;name&gt;</td>
                <td>REQUIRED</td><td>SINGLE</td></tr>
        <tr><td>&lt;effective-popsize&gt;</td>
                <td>optional</td><td>SINGLE</td></tr>
        <tr><td><a href="#segments">&lt;segments&gt;</a></td>
                <td>optional</td><td>SINGLE</td></tr>
        <tr><td>&lt;trait-location&gt;</td>
                <td>optional</td><td>multiple</td></tr>
<tr><td>&lt;trait-location&gt;</td><td>&lt;trait-name&gt;</td>
                <td>REQUIRED for mapping<br>optional for others</td><td>SINGLE</td></tr>
<tr> <th>tag</th><th colspan=3>contents</th></tr>
<tr><td >&lt;effective-popsize&gt;</td>
        <td colspan=3><em>value greater than 0; defaults to 1;
            the relative <a href="glossary.html#effpopsize">effective population size
            of samples from this region.</em></td></tr>
<tr><td >&lt;trait-name&gt;</td>
        <td colspan=3><em>unique name; should not contain spaces</em></td></tr>
</table>

<H3> <A NAME="segments">Segments</A></H3>


<table style="page-break-before: always" border=1 >
<tr><th colspan=4>Specifying Properties of Data Samples</th></tr>
<tr><th>parent tag</th><th>child tag or <em>attribute</em></th><th>child required</th><th>child instances allowed</th></tr>
<tr><td>&lt;segments&gt;</td><td>&lt;segment&gt;</td>
                <td>REQUIRED</td><td>multiple</td></tr>
<tr>
    <td rowspan=9>&lt;segment&gt;</td>
            <td><em>datatype</em></td>
                <td>REQUIRED</td><td><em>-</em></td></tr>
        <tr><td><em>marker-proximity</em></td>
                <td>optional</td><td><em>-</em></td></tr>
        <tr><td>&lt;name&gt;</td>
                <td>REQUIRED</td><td>SINGLE</td></tr>
        <tr><td>&lt;markers&gt;</td>
                <td>REQUIRED</td><td>SINGLE</td></tr>
        <tr><td>&lt;map-position&gt;</td>
                <td>optional</td><td>SINGLE</td></tr>
        <tr><td>&lt;length&gt;</td>
                <td>optional</td><td>SINGLE</td></tr>
        <tr><td>&lt;locations&gt;</td>
                <td>optional</td><td>SINGLE</td></tr>
        <tr><td>&lt;first-position-scanned&gt;</td>
                <td>optional</td><td>SINGLE</td></tr>
        <tr><td>&lt;unresolved-markers&gt;</td>
                <td>optional</td><td>SINGLE</td></tr>
<tr> <th>tag</th><th colspan=3>contents</th></tr>
<tr><td >&lt;markers&gt;</td>
        <td colspan=3><em>number of sites with data; for dna this is 
            the number of sites sequenced; for snp data it is the number 
            of snps; for kallele and microsat data it is the number
            of distinct sites at which kallele/msat data was collected.</em></td></tr>
<tr><td >&lt;map-position&gt;</td>
        <td colspan=3><em>location of &lt;first-position-scanned&gt; in 
        <a href="genetic_map.html#region-coord">region-wide coordinates</a></em></td></tr>
<tr><td >&lt;length&gt;</td>
        <td colspan=3><em>total number of bases searched for data</em></td></tr>
<tr><td >&lt;locations&gt;</td>
        <td colspan=3><em>the location of each particular data site of
        your data in <a href="genetic_map.html#segment-coord">segment coordinates</a></em></td></tr>
<tr><td >&lt;first-position-scanned&gt;</td>
        <td colspan=3><em>the location of the first sampled location in
        your data in <a href="genetic_map.html#segment-coord">segment coordinates</a></em></td></tr>
<tr><th>attribute</th><th>value</th><th colspan=2>meaning</th></tr>
<tr><td rowspan=4><em>datatype</em></td>
        <td>dna</td><td colspan=2>DNA data</td></tr>
    <tr><td>snp</td><td colspan=2>SNP data</td></tr>
    <tr><td>kallele</td><td colspan=2>k-allele data</td></tr>
    <tr><td>microsat</td><td colspan=2>microsatellite data</td></tr>
<tr><td rowspan=2><em>marker-proximity</em></td>
        <td>linked</td><td colspan=2>individual data markers likely to be inherited together</td></tr>
    <tr><td>unlinked</td><td colspan=2>individual data markers are independently inherited</td></tr>
</table>

<P><H3> <A NAME="populations">Populations</A></H3>

<P>
The <tt>&lt;populations&gt;</tt> tag is used to name distinct
<a href="glossary.html#population">populations</a>.
If your data files have named populations, the population names here
should match the names that are in your files.</P>

<table border=1 >
<tr><th colspan=4>Specifying population names with the &lt;populations&gt; tag</th></tr>
<tr><th>parent tag</th><th>child tag</th><th>child required</th><th>child instances allowed</th></tr>
<tr>
    <td>&lt;populations&gt;</td>
        <td>&lt;population&gt;</td>
        <td>Y</td><td>Y</td>
        </tr>
<tr> <th>tag</th><th colspan=3>contents</th></tr>
    <tr><td>&lt;population&gt;</td>
        <td colspan=3><em>a name unique among all populations, regions, and segments</em></td></tr>
</table>



<h3 style="page-break-before: always"> <A NAME="infiles">Data files</A></H3>

<P>The <tt>&lt;infiles</tt>&gt; tag will tell the converter where to find
your data, and how to associate each file with the previously-defined
regions, segments, and populations.
</p>

<table border=1>
<tr><th colspan=4>Tags Describing Input Files in Lamarc Converter Command File</th></tr>
<tr><th>parent tag</th><th>child tag or <em>attribute</em></th><th>child required</th><th>child instances allowed</th></tr>
<tr><td>&lt;infiles&gt;</td><td>&lt;infile&gt;</td>
                <td>REQUIRED</td><td>multiple</td></tr>
<tr>
    <td rowspan=7>&lt;infile&gt;</td>
            <td><em>datatype</em></td>
                <td>REQUIRED</td><td><em>-</em></td></tr>
        <tr><td><em>format</em></td>
                <td>optional</td><td><em>-</em></td></tr>
        <tr><td><em>sequence-alignment</em></td>
                <td>optional</td><td><em>-</em></td></tr>
        <tr><td>&lt;name&gt;</td>
                <td>REQUIRED</td><td>SINGLE</td></tr>
        <tr><td>&lt;segments-matching&gt;</td>
                <td>REQUIRED</td><td>SINGLE</td></tr>
        <tr><td>&lt;pop-matching&gt;</td>
                <td>optional</td><td>SINGLE</td></tr>
        <tr><td>&lt;individuals-from-samples&gt;</td>
                <td>optional</td><td>SINGLE</td></tr>
    <td>&lt;individuals-from-samples&gt;</td>
            <td><em>type</em></td>
                <td>REQUIRED</td><td><em>-</em></td></tr>
    <td rowspan=2>&lt;population-matching&gt;</td>
            <td><em>type</em></td>
                <td>REQUIRED</td><td><em>-</em></td></tr>
        <tr><td>&lt;population-name&gt;</td>
                <td>depends on value of <em>type</em> attribute</td><td>multiple</td></tr>
    <td rowspan=2>&lt;segments-matching&gt;</td>
            <td><em>type</em></td>
                <td>REQUIRED</td><td><em>-</em></td></tr>
        <tr><td>&lt;segment-name&gt;</td>
                <td>depends on value of <em>type</em> attribute</td><td>multiple</td></tr>
<tr> <th>tag</th><th colspan=3>contents</th></tr>
<tr><td>&lt;individuals-from-samples&gt;</td>
    <td colspan=3>the number of adjacent samples to bundle into a single individual</td></tr>
<tr><th>attribute</th><th>value</th><th colspan=2>meaning</th></tr>
    <tr><td rowspan=4><em>datatype</em></td>
        <td>dna</td><td colspan=2>DNA data</td></tr>
    <tr><td>snp</td><td colspan=2>SNP data</td></tr>
    <tr><td>kallele</td><td colspan=2>k-allele data</td></tr>
    <tr><td>microsat</td><td colspan=2>microsatellite data</td></tr>
    <tr><td rowspan=2><em>format</em></td>
        <td>migrate</td><td colspan=2>input file is a migrate file</td></tr>
    <tr><td>phylip</td><td colspan=2>input file is a phylip file</td></tr>
    <tr><td rowspan=2><em>sequence-alignment</em></td>
        <td>interleaved</td><td colspan=2>the first line of each sequence appears, followed 
        by all second lines, then all third lines, etc.</td></tr>
    <tr><td>sequential</td><td colspan=2>each entire sequence appears
        in the file before the next one starts.</td></tr>
    <tr><td><em>type</em> for &lt;individuals-from-samples&gt;</td>
        <td>byAdjacency</td>
        <td colspan=2>bundle adjacent samples into individuals</td></tr>
    <tr><td rowspan=3><em>type</em> for &lt;population-matching&gt;</td>
        <td>byList</td><td colspan=2>
            Each population referred to in the file is to be
            assigned to a particular population defined in this file.  If this type
            is used, sub-tags of the type <tt>&lt;population-name</tt>&gt; should be
            used to define those populations (each should have a name that matches a
            population defined in the <tt>&lt;populations</tt>&gt; tag, above).
            </td></tr>
        <tr><td>byName</td><td colspan=2>
            The file itself contains information about what
            populations the data refers to.  These names must match the names given
            in the 'population' tag, above.
            </td></tr>
        <tr><td>single</td><td colspan=2>
            All individuals in the file are to be assigned to a
            single population.  That population must then be defined by a
            <tt>&lt;population-name</tt>&gt; subtag.
            </td></tr>
    <tr><td rowspan=2><em>type</em> for &lt;segments-matching&gt;</td>
        <td>byList</td><td colspan=2>
            Each segment referred to in the file is to be
            assigned to a particular segment defined in this file.  If this type
            is used, sub-tags of the type <tt>&lt;segment-name</tt>&gt; should be
            used to define those segment (each should have a name that matches a
            defined segment).
            </td></tr>
        <tr><td>single</td><td colspan=2>
            All individuals in the file are to be assigned to a
            single segment.  That segment must then be defined by a
            <tt>&lt;segment-name</tt>&gt; subtag.
            </td></tr>
</table>



<H3> <A NAME="outfile">Specifying the Name of the Produced Lamarc file</A></H3>

<P>
<tt>&lt;outfile</tt>&gt;, where you can specify the name of the file that
you want the converter to produce,
</P>

<table border=1>
<tr><th colspan=4>Tags Describing Output Files in Lamarc Converter Command File</th></tr>
<tr><th>tag</th><th>contents</th></tr>
<tr><td>&lt;outfile&gt;</td>
    <td><em>name of outfile to produce; defaults to <tt>infile.xml</tt></em></td></tr>
</table>


<H3> <A NAME="comment">Miscellaneous Tags</A></H3>

<table border=1>
<tr><th colspan=4>Miscellaneous Tags in Lamarc Converter Command File</th></tr>
<tr><th>tag</th><th>contents</th></tr>
<tr><td>&lt;lamarc-header-comment&gt;</td>
    <td><em>text of comment to be inserted in lamarc file</em></td></tr>
</table>

<H3 style="page-break-before: always">
<A NAME="phase">Specifying Relationships Between Individuals and Data Samples</A> </H3>

<p>For most LAMARC analyses, it is not necessary to specify which
pairs (or more) of data sequences belong to the same individual. 
However, there are a few cases where it may be necessary, including:
</p>
<ul>
<li><a href="mapping.html">Trait mapping</a>, since traits are observed
    for individuals.</li>
<li>When haplotypes are incompletely resolved from individuals.</li>
<LI>When combining nucleotide data (defined by sample) and microsats
(defined by individuals).
</ul>
<p>
Assigning samples to individuals, and optionally assigning trait
phenotypes or information about haplotype resolution to them
is done with the &lt;individuals&gt; tag.
An example can be found in section
<a href="mapping.html#pheno-to-ind">Assigning Phenotypes to Individuals</a>
of the
<a href="mapping.html">Trait Mapping</a> documentation.
</p>

<table border=1>
<tr><th colspan=4>Specifying Relationships between Individuals and Sample Data in Converter Command File</th></tr>
<tr><th>parent tag</th><th>child tag</th><th>child required</th><th>child instances allowed</th></tr>
<tr><td>&lt;individuals&gt;</td><td>&lt;individual&gt;</td>
                <td>optional</td><td>multiple</td></tr>
<tr>
    <td rowspan=5>&lt;individual&gt;</td>
            <td>&lt;name&gt;</td>
                <td>REQUIRED</td><td>SINGLE</td>
    <tr><td>&lt;sample&gt;</td>
            <td>REQUIRED</td><td>multiple</td></tr>
    <tr><td>&lt;phase&gt;</td>
            <td>optional</td><td>multiple</td></tr>
    <tr><td>&lt;has-phenotype&gt;</td>
            <td>optional</td><td>multiple</td></tr>
    <tr><td>&lt;genotype-resolutions&gt;</td>
            <td>optional</td><td>multiple</td></tr>
<tr>
    <td>&lt;sample&gt;</td>
            <td>&lt;name&gt;</td>
                <td>REQUIRED</td><td>SINGLE</td></tr>
<tr>
    <td rowspan=2>&lt;phase&gt;</td>
            <td>&lt;segment-name&gt;</td>
                <td>REQUIRED</td><td>SINGLE</td></tr>
            <tr><td>&lt;unresolved-markers&gt;</td>
                <td>REQUIRED</td><td>SINGLE</td></tr>
<tr> <th>tag</th><th colspan=3>contents</th></tr>
    <tr><td>&lt;name&gt;</td>
        <td colspan=3><em>a name unique among all individuals and samples</em></td></tr>
    <tr><td>&lt;has-phenotype&gt;</td>
        <td colspan=3><em>a &lt;phenotype&gt;name already defined in the
        <a href="#table-trait">&lt;traits&gt;</a> section</em></td></tr>
    <tr><td>&lt;genotype-resolution&gt;</td>
        <td colspan=3><em>an &quot;anonymous&quot; phenotype belonging to the enclosing individual only.
            See <a href="#table-trait">&lt;traits&gt; subtags table</a> for definition</em></td></tr>
    <tr><td>&lt;segment-name&gt;</td>
        <td colspan=3><em>the name of the segment to which this set of phase information applies</em></td></tr>
    <tr><td>&lt;unresolved-markers&gt;</td>
        <td colspan=3><em>sites for which data markers are unresolved for this individual and segment</em></td></tr>
</table>

<p>To see an example of the &lt;phase&gt;, &lt;segment-name&gt; and
&lt;unresolved-markers&gt; tags in use, see the file <a
HREF="batch_converter/sample-conv-cmd.html">sample-conv-cmd.xml</a> (actual xml is <a HREF="batch_converter/sample-conv-cmd.xml">here</a>)

<P>The values for the 'unresolved-markers' tag should be site labels.  The
first valid site in a segment is the value of the 'first-position-scanned'
tag for that segment, and the last valid site is determined by the length of
the segment.  If the segment does not have as many markers in it as valid
sites (as for SNP data), the values here should match the values in the
'locations' tag for the segment.  In the example file, the second segment of
the second chromosome has SNP data with markers at positions 13, 19, 35, 77,
102, 112, and 204.  These are therefore the only valid values for the
'phase' tag for this segment.</P>

<H3 style="page-break-before: always">
<A NAME="panels">Specifying Panel Correction Information</A> </H3>

<p>Panel member counts should be entered only if the user wishes to invoke Panel Correction. They need not be specified for all regions, only those for which one has the number of sequences used to create the panel. 
</p>
<p>WARNING: Do not estimate the number of sequences used to create a panel, it will make your results indefensible. If you do not have the actual number of sequences, you should not use Panel Correction. Your mutation rates will be lower, but that's the best you can do without knowing more about how the panel was created. 
</p>
<table border=1>
<tr><th colspan=4>Specifying Panel Correction Information in Converter Command File</th></tr>
<tr><th>parent tag</th><th>child tag</th><th>child required</th><th>child instances allowed</th></tr>
<tr><td>&lt;panels&gt;</td><td>&lt;panel&gt;</td>
                <td>optional</td><td>multiple</td></tr>
<tr>
    <td rowspan=4>&lt;panel&gt;</td>
            <td>&lt;panel-name&gt;</td>
                <td>optional</td><td>SINGLE</td>
    <tr><td>&lt;panel-region&gt;</td>
            <td>REQUIRED</td><td>SINGLE</td></tr>
    <tr><td>&lt;panel-pop&gt;</td>
            <td>REQUIRED</td><td>SINGLE</td></tr>
    <tr><td>&lt;panel-size&gt;</td>
            <td>REQUIRED</td><td>SINGLE</td></tr>
</table>


<P>(<A HREF="panels.html">Back</A> | <A HREF="index.html">Contents</A>
| <A HREF="xmlinput.html">Next</A>)</P>

<!--
//$Id: converter_cmd.html,v 1.19 2016/04/19 21:01:32 lpsmith Exp $
-->
</BODY>
</HTML>