File: valid.msg

package info (click to toggle)
ncbi-tools6 6.0.2-1.1
  • links: PTS
  • area: main
  • in suites: potato
  • size: 33,932 kB
  • ctags: 50,922
  • sloc: ansic: 590,557; sh: 716; makefile: 375; csh: 234; lisp: 81; cpp: 57; perl: 48
file content (301 lines) | stat: -rw-r--r-- 10,340 bytes parent folder | download | duplicates (2)
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
293
294
295
296
297
298
299
300
301
MODULE valid
$$ SEQ_INST, 1
$^   ExtNotAllowed, 1
# This is a comment
A Bioseq "extension" is used for special classes of Bioseq. This class of
Bioseq should not have one but it does. This is probably a software error.

$^   ExtBadOrMissing, 2
This class of Bioseq requires an "extension" but it is missing or of the
wrong type. This is probably a software error.

$^   SeqDataNotFound, 3
No actual sequence data was found on this Bioseq. This is probably a
software problem.

$^   SeqDataNotAllowed, 4
The wrong type of sequence data was found on this Bioseq. This is probably a
software problem.

$^   ReprInvalid, 5
This Bioseq has an invalid representation class. This is probably a software
error.

$^   CircularProtein, 6
This protein Bioseq is represented as circular. Circular topology is
normally used only for certain DNA molecules, for example, plasmids.

$^   DSProtein, 7
This protein Bioseq has strandedness indicated. Strandedness is normally a
property only of DNA sequences. Please unset the strandedness.

$^   MolNotSet, 8
It is not clear whether this sequence is nucleic acid or protein. Please set
the appropriate molecule type (Bioseq.mol).

$^   MolOther, 9
Most sequences are either nucleic acid or protein. However, the molecule
type (Bioseq.mol) is set to "other". It should probably be set to nucleic
acid or a protein.

$^   FuzzyLen, 10
This sequence is marked as having an uncertain length, but the length is
known exactly.

$^   InvalidLen, 11
The length indicated for this sequence is invalid. This is probably a
software error.

$^   InvalidAlphabet, 12
This Bioseq has an invalid alphabet (e.g. protein codes on a nucleic acid or
vice versa). This is probably a software error.

$^   SeqDataLenWrong, 13
The length of this Bioseq does not agree with the length of the actual data.
This is probably a software error.

$^   SeqPortFail, 14
Something is very wrong with this entry. The validator cannot open a SeqPort
on the Bioseq. Further testing cannot be done.

$^   InvalidResidue, 15
Invalid residue codes were found in this Bioseq.

$^   StopInProtein, 16
Stop codon symbols were found in this protein Bioseq.

$^   PartialInconsistent, 17
This segmented sequence is described as complete or incomplete in several
places, but these settings are inconsistent.

$^   ShortSeq, 18
This Bioseq is unusually short (less than 4 amino acids or less than 11
nucleic acids). GenBank does not usually accept such short sequences.

$^   NoIdOnBioseq, 19
No SeqIds were found on this Bioseq. This is probably a software error.

$^   BadDeltaSeq, 20
Delta sequences should only be HTGS-1 or HTGS-2.

$^   LongHtgsSequence, 21
HTGS-1 or HTGS-2 sequences must be < 350 KB in length.

$^   LongLiteralSequence, 22
Delta literals must be < 350 KB in length.

$^   SequenceTooLong, 23
Individual sequences must be < 350 KB in length.

$^   ConflictingIdsOnBioseq, 24
Two SeqIds of the same class was found on this Bioseq. This is probably a software error.

$$ SEQ_DESCR, 2

$^   BioSourceMissing, 1
The biological source of this sequence has not been described correctly.  A
Bioseq must have a BioSource descriptor that covers the entire molecule.
Additional BioSource features may also be added to recombinant molecules,
natural or otherwise, to designate the parts of the molecule. Please add the
source information.

$^   InvalidForType, 2
This descriptor cannot be used with this Bioseq. A descriptor placed at the
BioseqSet level applies to all of the Bioseqs in the set. Please make sure
the descriptor is consistent with every sequence to which it applies.

$^   Inconsistent, 3
There are two descriptors of the same type which are inconsistent with each
other. Please make them consistent.

$^   Unknown, 4
An unknown or "other" modifier was used.

$^   NoPubFound, 5
No publications were found in this entry which refer to this Bioseq. If a
publication descriptor is added to a BioseqSet, it will apply to all of the
Bioseqs in the set. A publication feature should be used if the publication
applies only to a subregion of a sequence.

$^   NoOrgFound, 6
This entry does not specify the organism that was the source of the sequence.
Please name the organism.

$^   MultipleBioSources, 7
There are multiple BioSource or OrgRef descriptors in the same chain with
the same taxonomic name. Their information should be combined into a single
BioSource descriptor.

$^   NoMolInfoFound, 8
This sequence does not have a Mol-info descriptor applying to it.  This indicates
genomic vs. message, sequencing technique, and whether the sequence is incomplete.

$$ GENERIC, 3

$^   NonAsciiAsn, 1
There is a non-ASCII type character in this entry.

$^   Spell, 2
There is a potentially misspelled word in this entry.

$^   AuthorListHasEtAl, 3
The author list contains et al, which should be replaced with the
remaining author names.

$^   MissingPubInfo, 4
The publication is missing essential information, such as title or
authors.

$$ SEQ_PKG, 4

$^   NoCdRegionPtr, 1
A protein is found in this entry, but the coding region has not been
described. Please add a CdRegion feature to the nucleotide Bioseq.

$^   NucProtProblem, 2
Both DNA and protein sequences were expected, but one of the two seems to be
missing. Perhaps this is the wrong package to use.

$^   SegSetProblem, 3
A segmented sequence was expected, but it was not found. Perhaps this is the
wrong package to use.

$^   EmptySet, 4
No Bioseqs were found in this BioseqSet. Is that what was intended?

$^   NucProtNotSegSet, 5
A nuc-prot set should not contain any other BioseqSet except segset.

$^   SegSetNotParts, 6
A segset should not contain any other BioseqSet except parts.

$^   SegSetMixedBioseqs, 7
A segset should not contain both nucleotide and protein Bioseqs.

$^   PartsSetMixedBioseqs, 8
A parts set should not contain both nucleotide and protein Bioseqs.

$^   PartsSetHasSets, 9
A parts set should not contain BioseqSets.

$$ SEQ_FEAT, 5

$^   InvalidForType, 1
This feature type is illegal on this type of Bioseq.

$^   PartialProblem, 2
There are several places in an entry where a sequence can be described as
either partial or complete. In this entry, these settings are inconsistent.
Make sure that the location and product Seq-locs, the Bioseqs, and the
SeqFeat partial flag all agree in describing this SeqFeat as partial or
complete.

$^   InvalidType, 3
A feature with an invalid type has been detected. This is most likely a
software problem.

$^   Range, 4
The coordinates describing the location of a feature do not fall within the
sequence itself. A feature location or a product Seq-loc is out of range of
the Bioseq it points to.

$^   MixedStrand, 5
Mixed strands (plus and minus) have been found in the same location. While
this is biologically possible, it is very unusual. Please check that this is
really what you mean.

$^   SeqLocOrder, 6
This location has intervals that are out of order. While whis is biologically
possible, it is very unusual. Please check that this is really what you mean.

$^   CdTransFail, 7
A fundamental error occurred in software while attempting to translate this
coding region. It is either a software problem or sever data corruption.

$^   StartCodon, 8
An illegal start codon was used. Some possible explanations are: (1) the
wrong genetic code may have been selected; (2) the wrong reading frame may
be in use; or (3) the coding region may be incomplete at the 5' end, in
which case a partial location should be indicated.

$^   InternalStop, 9
Internal stop codons are found in the protein sequence. Some possible
explanations are: (1) the wrong genetic code may have been selected; (2) the
wrong reading frame may be in use; (3) the coding region may be incomplete
at the 5' end, in which case a partial location should be indicated; or (4)
the CdRegion feature location is incorrect.

$^   NoProtein, 10
Normally a protein sequence is supplied. This sequence can then be compared
with the translation of the coding region. In this entry, no protein Bioseq
was found, and the comparison could not be made.

$^   MisMatchAA, 11
The protein sequence that was supplied is not identical to the translation
of the coding region. Mismatching amino acids are found between these two
sequences.

$^   TransLen, 12
The protein sequence that was supplied is not the same length as the
translation of the coding region. Please determine why they are different.

$^   NoStop, 13
A coding region that is complete should have a stop codon at the 3'end.  A
stop codon was not found on this sequence, although one was expected.

$^   TranslExcept, 14
An unparsed \transl_except qualifier was found. This indicates a parser
problem.

$^   NoProtRefFound, 15
The name and description of the protein is missing from this entry. Every
protein Bioseq must have one full-length Prot-ref feature to provide this
information.

$^   NotSpliceConsensus, 16
Splice junctions typically have GT as the first two bases of the intron
(splice donor) and AG as the last two bases of the intron (splice acceptor).
This intron does not conform to that pattern.

$^ OrfCdsHasProduct, 17
A coding region flagged as orf has a protein product.  There should be no
protein product bioseq on an orf.

$^ GeneRefHasNoData, 18
A gene feature exists with no locus name or other fields filled in.

$^ ExceptInconsistent, 19
A coding region has an exception gbqual but the excpt flag is not set.

$^ ProtRefHasNoData, 20
A protein feature exists with no name or other fields filled in.

$^ GenCodeMismatch, 21
The genetic code stored in the BioSource is different than that for this CDS.

$^ RNAtype0, 22
RNA type 0 (unknown RNA) should be type 255 (other).

$^ UnknownImpFeatKey, 23
An import feature has an unrecognized key.

$^ UnknownImpFeatQual, 24
An import feature has an unrecognized qualifier.

$^ WrongQualOnImpFeat, 25
This qualifier is not legal for this feature.

$^ MissingQualOnImpFeat, 26
An essential qualifier for this feature is missing.

$^ PsuedoCdsHasProduct, 27
A coding region flagged as pseudo has a protein product.  There should be no
protein product bioseq on a pseudo CDS.

$^ IllegalDbXref, 28
The database in a cross-reference is not on the list of officially
recognized database abbreviations.

$^ FarLocation, 29
The location has a reference to a bioseq that is not packaged in this record.