1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 112 113 114 115 116 117 118 119 120 121 122 123 124 125 126 127 128 129 130 131 132 133 134 135 136 137 138 139 140 141 142 143 144 145 146 147 148
|
Here are some examples that you might want to try out. The
executables of course should be installed before hand somewhere in
your path. Or you can call them as ../bin/genewise etc if you haven't
installed them.
If you get warnings like
Warning Error
Could not open codon.table
then you haven't set the WISECONFIGDIR enviroment variable.
You can set it now by going
setenv WISECONFIGDIR ../wisecfg
Print out the wise2 documentation (docs/wise2.ps) and read
the 'Introduction for the impatient user'
Here are some examples:
a) Comparing a protein sequence to a genomic clone
genewise road.pep hngen.fa
(the protein is from drosophila)
b) Comparing a protein HMM to a genomic clone
genewise -hmmer rrm.HMM hngen.fa
c) Try the following options on either a) or b)
-kbyte XXXX where XXXX is the maximum amount of memory
on your machine
-genes - gives gene prediction output
-trans - gives translations of the predicted genes
-embl - give EMBL format Feature table output
do genewise -help and have a look at other formats you
could make
d) try
genewise -hmmer rrm.HMM hngen.fa -alg 2193L
for the most complicated algorithm genewise has to offer.
For comparisons using ESTs
a) protein to est
estwise road.pep hn_est.fa
Notice the frameshift with the ! in the translation
b) profile HMM to est
estwise -hmmer rrm.HMM hn_est.fa
For database comparisons
a) an dna sequence against a database of profile HMMs
estwisedb -pfam db.hmm vav.dna
or
genewisedb -pfam db.hmm vav.dna
----------------------------------------------------------------
Using dba. dba (Dna Block aligner) compares two DNA sequences
trying to find co-linear blocks of homology with the potential
of a small number of gaps but with large intervening sequences
between the blocks of variable length.
The idea is to be able to find regulatory regions in non coding
sequences from different species.
For example, using the first intron from keratin18 genes from
mouse and human gives the following output:
adnah:[/wise2/example]<53>: ../bin/dba keratin_intron.human keratin_intron.mouse
score = 72.52
56 140 109 192 69% 3 gaps, MATCH_A
321 395 358 431 89% 1 gaps, MATCH_C
401 496 467 559 70% 3 gaps, MATCH_A
560 579 593 613 95% 1 gaps, MATCH_D
622 691 646 712 80% 5 gaps, MATCH_B
This indicates that there are 5 blocks of homology. The blocks are
parameterised into homology levels being (when run using the default
parameters)
A approx 60-70% identity
B approx 70-80% identity
C approx 80-90% identity
D approx 90-100% identity
The approximation is due to the fact that the models are parameterised
probabilistically with gaps being taken into account. The probabilites
for the different blocks are by default
A 0.65
B 0.75
C 0.85
D 0.95
(each probability representing the probability of finding a idenity
pair in the block).
By and large blocks around the identity range will fall into the
correct homology block, but the interplay of the parameters about
block probability means that the calculated %idenity need not fall
into the expected block type.
The -align option is very verbose and probably not that useful.
dba has been intergrated into the alfresco java viewer. In alfresco
you can interactively decide what regions to compare and shows the
results integrated with other analysis, eg repeats. Alfresco (and
dba) was written by Niclas Jareborg. A web site for it is at
http://www.sanger.ac.uk/Software/Alfresco
|