File: pfsearchV2.1

package info (click to toggle)
pftools 3%2Bdfsg-3
  • links: PTS, VCS
  • area: main
  • in suites: buster
  • size: 2,344 kB
  • sloc: fortran: 11,706; ansic: 10,085; makefile: 85; sh: 65
file content (133 lines) | stat: -rw-r--r-- 3,610 bytes parent folder | download | duplicates (4)
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
.TH PFSEARCHV2 1 "February 1998" "pftools 2.1"
.SH NAME
pfsearchV2 \- search a protein or DNA sequence library for sequence segments 
matching a profile
.SH SYNOPSIS
.B pfsearchV2 
[ -abflLrsuxyz ] [ profile-file | - ]
            [ seq-library-file | - ]    [C=#] [W=#] 
.SH DESCRIPTION
.B pfsearchV2 
compares a query profile against a DNA or protein sequence library.
The result is an unsorted list of profile-sequence matches written to the standard output. 
A variety of output formats containing different information can be specified 
via the options 
.I -a, -l,  -L, -r, -u, -s,
.I -x, -y, -z,
and the command-line parameter 
.I C=#.
.I profile-file
contains a profile in PROSITE format.
.I seq-library-file
contains a sequence library in EMBL/SWISS-PROT
format (assumed by default) or in Pearson/Fasta
format (indicated by option 
.I -f). 
.B pfsearchV2 
can be used as a filter if - is used instead of one of the input 
filenames. 
.SH OPTIONS 
.TP
\-a
Report optimal alignment scores for 
all sequences regardless of the cut-off value. 
This option simultaneously forces DISJOINT=UNIQUE.   
.TP
\-b
Search the complementary strands of DNA sequences as well.
.TP
\-f
Input sequence-library is in Pearson/Fasta format.
.TP
\-l
Indicate by number the highest cut-off level exceeded by the
match score in the output list. 
.TP
\-L
Indicate by character string the highest cut-off level exceeded by the match score
in the output list. Note that the generalized profile format includes a text
string field to specify a name for a cut-off level. The \-L option causes the
program to display the first two characters of this text string 
(usually something like "!", "?", "??", etc.) at the beginning of
each match description. 
.TP
\-r
Use raw scores rather than normalized 
scores for match selection. Normalized scores 
will not be listed in the output. 
.TP
\-s
List the sequences of the matched regions as well. 
The output will be a Pearson/Fasta-formatted sequence
library.
.TP
\-u
Forces DISJOINT=UNIQUE. 
.TP
\-x
List profile-sequence alignments 
in pftools PSA format.  
.TP
\-y
Display alignments between the profile and the matched sequence regions in 
a human-friendly format.   
.TP
\-z
Indicate starting and ending position of the matched profile range. The latter
position will be given as a negative offset from the end of the profile. Thus
the range [    1,    -1] means entire profile.
.SH PARAMETERS 
.TP
C=#
Cut-off value.
Over-writes the level zero cut-off value specified in
the profile.
An integer argument is interpreted as a raw score value,
a decimal argument as a normalized score value. An integer value 
forces option 
\-r.
.TP
W=#
Output width. 
Output lines will be truncated after 
.I W
characters. 
Default: W=132. 
.SH EXAMPLES 
.TP
(1)
.B pfsearchV2
\-f sh3.prf sh3.seq C=6.0

Searches the Pearson/Fasta-formatted protein sequence library sh3.seq
for SH3 domains with a cut-off value of 6.0 normalized score units.
sh3.seq contains 20 SH3 domain-containing protein sequences from SWISS-PROT
release 32.
sh3.prf contains the PROSITE entry SH3/PS50002.
.TP
(2)
.B pfsearchV2
\-bx ecp.prf CVPBR322 |
.B psa2msa   
\-du |    
.B readseq
\-p -fMSF > ecp.msf

Generates a multiple sequence alignment  of potential
.I E. coli
promoters on both strands of plasmid pBR322.
ecp.prf contains a profile for
.I E. coli   
promoters.  
CVPBR322 contains EMBL entry J01749|CVPBR322.
The result file
.I ecp.msf   
can further be processed by GCG programs accepting MSF files as input.

.I See
also manual pages of 
.B psa2msa. 
.SH AUTHOR
Philipp Bucher
.br
Philipp.Bucher@isrec.unil.ch