1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 112 113 114 115 116 117 118 119 120 121 122 123 124 125 126 127 128 129 130 131 132 133 134 135 136 137 138 139 140 141 142 143 144 145 146 147 148 149 150 151 152 153 154 155 156 157 158 159 160 161 162 163 164 165 166 167 168 169 170 171 172 173 174 175 176 177 178 179 180 181 182 183 184 185 186 187 188 189 190 191 192 193 194 195 196 197 198 199 200 201 202 203 204 205 206 207 208 209 210 211 212 213 214 215 216 217 218 219 220 221 222 223 224 225 226 227 228 229 230 231 232 233 234 235 236 237 238 239 240 241 242 243 244 245 246 247 248 249 250 251 252 253 254 255 256 257 258 259 260 261 262 263 264 265 266 267 268 269 270 271 272 273 274 275 276 277 278 279 280 281 282 283 284 285 286 287 288 289 290 291 292 293 294 295 296 297 298 299 300 301 302 303 304 305 306 307 308 309 310 311 312 313 314 315 316 317 318 319 320 321 322 323 324 325 326 327 328 329 330 331 332 333 334 335 336 337 338 339 340 341 342 343 344 345 346 347 348 349 350 351 352 353 354 355 356 357 358 359 360 361 362 363 364 365 366 367 368 369 370 371 372 373 374 375 376 377 378 379 380 381 382 383 384 385 386 387 388 389 390 391 392 393
|
Version 1.23 Thu Dec 1 14:45:22 EST 2011
* Added $VERSION to Bio::GMOD::DB::Config so that when installing
Bio::DB::Das::Chado it would be happy.
* Changed all perl script #! lines to use /usr/bin/env perl instead of the
mishmash that was being used in various scripts.
* Fixed the GFF3 preprocessor so that if an insert into the sorting table fails,
the script will die to avoid losing GFF lines in the resulting file.
* Modified Makefile.PL so that if using PREFIX it won't stomp on the
GMOD_ROOT envirnment variable (that is, everything goes in PREFIX, except what
goes in GMOD_ROOT).
Version 1.22 Thu Nov 17 12:48:10 EST 2011
* Created a script to automatically create the schemas dirs for a release and
add them to the manifest.
Version 1.21 Thu Oct 20 10:52:55 EDT 2011
* Added rudimentary support for non-public schemas in Postgres, though it's
only good for Tripal at the moment.
* Fixed database initialization step "make prepdb" so that the version of the
schema gets set in the chadoprop table.
* Fixed bug in gmod_chado_properties.pl that caused it to crash when updating
the schema version.
Version 1.2 Tue Aug 10 13:59:05 EDT 2010
* Added some error checking to gmod_fasta2gff3.pl (Rob)
* Updated Bio::Chado::Builder to detect recent versions of go-perl. They
no longer have .pl at the end of the go2fmt script name (Rob)
* The GFF3 bulk loader will now ignore ##sequence-region directives;
previously, their presence caused the loader to fail (Scott)
* added a check for the nodes file downloaded from NCBI. There have been
cases of missing nodes/parents. Code dies if parent is missing. User
must check the nodes file. This needs further investigation with NCBI
taxonomy. (Naama)
* Fix the code in the gff-bridge in the sequence module so that GFF dumping
from GBrowse will work correctly (Scott)
* Added a cvprop table to capture things like obo file version from
a loaded cv. (Scott)
* Removed the "DB:" prefix from all of the names of databases that go
into the db table. (Scott)
* Removed the dependency on Log::Log4perl (it was overkill for what it
was being used for) (Scott)
* Bumped the version to 1.2 because of the new schema module. (Scott)
* Added gmod_chado_properties.pl to examine the chadoprop table and return
values on the command line (like the schema version) (Scott)
* Added gmod_update_chado.pl to update the schema automatically, and added
a make target (make update) to make updating easier. (Scott)
* Removed cell_type ontology from 'make ontologies' list as it wasn't loading
via stag_storenode.pl at release time (Scott)
Version 1.11 Tue May 25 11:10:47 EDT 2010
* fixed an issue with accessing the primary_id of
Bio::Annotation::DBLink objects (Naama)
* Fixes issue with dependencies on CXGN-specific code. Now all
db connection parameters for cxgn - contributed scripts should be passed
in a gmod_config file, or in command line options as listed in the POD.
* Made gmod_materialized_view_tool.pl able to accept all configuration
parameters on the command line (this is to make it work with automatic
MV creation for full text searching) (Scott).
* fixed messed up gmod_load_cvterms.pl -p and -u options (Naama)
Version 1.1
* Fixes to gmod_gff3_preprocessor.pl so that file splitting will work
as expected.
* Minor fix to Bio::GMOD::DB::Adaptor (used by gmod_bulk_load_gff3.pl)
to silence a uninint warning.
* Created a plpgsql function to share exons (called share_exons) that
takes exon features that have the same boundaries as another exon and reduces
them to one feature and fixes up feature_relationship entries so that
the remaining exon belongs to the right transcript/genes. This may
be needed for Apollo. Thanks to Robin Houston at GeneDB/Sanger Centre
for contributing much of the code.
* Created a function to order exons (called order_exons) that will create
entries in the feature_relationship.rank column to number exons (0 based)
from beginning to end in a given transcript. This was originally needed
for Apollo, but by the time this function was done, Ed Lee had fixed up
Apollo to not need it.
* Fixed the audit module so that the triggers in it work with versions
of PostgreSQL greater that 8.0.
* Added so-bridge views. These are a set of views on the feature table
the use SO terms for the names of the views, making it easy to query for
genes by doing something like this: SELECT * FROM so.gene WHERE name='foo'.
See http://gmod.org/Chado_SO_Views for more information.
* Added a protein_code_gene view (depends on so-bridge); thanks to Chris
Mungall.
* Updated all_feature_names view to search featureprop.value and
dbxref.accession and added organism_id so GBrowse can limit
by organism.
* Fixed gmod_bulk_load_gff3.pl to deal correctly with the case that
mulitple organisms have the same common name. If the user tries to
use a common name that corresponds to more than one organism, it will
die gracefully.
* Fixed gmod_bulk_load_gff3.pl so that featureloc.srcfeature_id will
not be sometimes set incorrectly when there are multiple organisms
with chromosomes (or other srcfeatures) with the same name.
* Added a __DIE__ signal handler to gmod_bulk_load_gff3.pl so that the
databsae will be cleaned up in most cases before the loader croaks.
* Fixed gmod_bulk_load_gff3.pl so that when loading dna sequence from the
fasta section it won't accidentally add sequence to a feature that has
the same uniquename but different organism_id to the actual target.
* Adding a update_sequences method for use by gmod_bulk_load_gff3.pl
that will check the next value available to the database sequences
that generate primary keys and update them when necessary. This
can happen when another database load adds data to chado but does
not update the sequence.
* Adding the ability to specify that gmod_bulk_load_gff3.pl use the
system tmp directory rather than using the current working directory
for writing the temporary load files.
* Added a --fastafile option to gmod_bulk_load_gff3.pl to use when
loading fasta files.
* Removed the cause of the error messages at the end of ontology loading:
it was a patch to correct for bad behavior that DBIx::DBStag which was
fixed some time ago.
* Added gmod_bulk_pub_loader.pl contributed by SGN.
* Added gmod_make_cvtermpath.pl contributed by SGN.
Below this line are changes up until a few years ago and are included
for historical purposes.
---------------------------------------------------------------------------
VERSION 0.01
============
No change log
VERSION 0.02
============
No change log
VERSION 0.03
============
all modules
ditched timeentered and timelastmod columns in every table
We've decided that auditing is implementation specific.
We're presently working on the FB implementation, and
will make that available soon
changed all dbxrefstr foreign keys to dbxref_id
added indexes for every foreign key column in every table
all linking tables:
Added primary keys to all linking tables, for consistency's
sake, since we already had primary keys in some linking tables.
I hope nobody objects...?
general module:
dbxref
removed dbxref.dbxrefstr
removed unique key (dbxrefstr)
cv module:
New table:
added table cv to hold cv names
cvterm
changed accordingly
cvpath
changed accordingly
cvterm_synonym
changed the name of table cvterm_synonym to cvtermsynonym,
since this isn't a linking table.
organism module:
organism_dbxref
added unique key (organism_id,dbxref_id)
organism
made genus, species, taxgroup fields not null
pub module
pub
added miniref
added unique key (miniref)
pub_author
added unique key (author_id,pub_id)
author
added unique key (surname,givenname,suffix)
sequence module
Deleted table
feature_organism
feature
added foreign key organism_id (not null)
added uniquename column (not null)
added unique key (organism_id,uniquename)
made type_id not null
featureloc
added unique key (feature_id,locgroup,rank)
feature_dbxref
changed unique key to (feature_id,dbxref_id)
- was (feature_dbxref_id, dbxref_id) OK???
feature_relationship
changed type_id int attribute to not null
synonym
added 'type_id' to the unique key (synonym,type_id)
genetic module
NOTE: We don't have unique keys on genotype, phenotype, and interaction.
Since we're going to be having a schema review on this module soon,
I decided not to worry about these for the time being.
feature_genotype
added unique key (feature_id,genotype_id)
feature_phenotype
added unique key (feature_id,phenotype_id)
phenotype_cvterm
added unique key (phenotype_id,cvterm_id,prank)
interaction_subj
added unique key (feature_id,interaction_id)
interaction_obj
added unique key (feature_id,interaction_id)
companalysis module
analysis
added unique key (program,programversion)
analysisfeature
changed unique key to (feature_id,analysis_id)
was (feature_id)
expression module
expression
doesn't have a unique key! But what would it be?
feature_expression
added unique key (expression_id,feature_id)
expression_cvterm
added unique key (expression_id,cvterm_id)
expression_pub
added unique key (expression_id,pub_id)
eimage
doesn't have a unique key! What would it be?
expression_image
added unique key (expression_id,eimage_id)
map module
I've skipped this one except for ditching the auditing attributes
and dbxrefstr stuff... still needs lots of work! Maybe we could
do a mini-review on this module sometime soon? I've got some
examples of how I use a rejigged featurepos table to manage the
fragmented a.gambiae genome, but I'd bet what I did can be
improved upon.
VERSION 0.04
============
GENERAL CHANGES
---------------
added Makefile
*prop tables have been made uniform: pkey_id has been changed to
type_id. all fk references to the cvterm table should now either be
type_id or cvterm_id. pval field has been changed to value, and prank
has been changed to rank. no pranks here!
all subject/predicate/object using tables previously used
subjterm_id/reltype_id/objterm_id. this has been changed to
subject_id/type_id/object_id to be more generally applicable and consistent
with other uses of cvterm references.
MODULE CHANGES
--------------
audit module
no changes
bin
added of scripts to autogenerate graphviz and "diagram"
ER diagrams from schema.
added script to autogenerate Class::DBI middleware layer.
this introduces an SQL::Translator dependency
companalysis
no changes
cv
cv.cvname is now cv.name. cv.cvdefinition is now cv.definition.
cvterm.termdefinition is now cvterm.definition.
tables cvrelationship and cvpath have been renamed as cvtermrelationship
and cvtermpath, and the appropariate field and index names have changed.
expression
no changes
general
contact table has been added. data should start moving over from
pub.author soon, and the contact table should be fleshed out by
one of the pub projects (ie pubsearch)
db table pk is now a serial int, rather than a varchar. dbxref
now references db.db_id as dbxref.db_id instead of dbxref.dbname
dbxrefprop and dbxrefrelationship are commented out pending review
genetic
no changes
map
no changes
organism
organism.abbrev changed to organism.abbreviation
pub
introduced a fk from author table to contact table in general module
sequence
no changes
www
added for gmod-web. this is likely to have many of the link
tables removed in favor of using the general-purpose "contact"
table in general module, and using a new tag-value based approach
to storing user bookmarks/lists/preferences/etc
VERSION 0.04
============
MODULE CHANGES
--------------
audit module
no changes
bin
no changes
companalysis
table analysisinvocation removed as redundant with current
usage of analysis.
table analysisinvocationprop changed back to analysisprop.
cv
renamed table cvtermrelationship to cvterm_relationship
expression
no changes
general
tables tableinfo and project moved to metadata.sql.
tables dbxrefprop and dbxrefrelationship removed for now
since they caused problems with the build.
genetic
interaction.backgroundgenotype_id changed to
interaction.background_genotype_id.
map
no changes
organism
no changes
pub
deleted the fk from author table to contact table in the
general module, as this relation is effectively
unimplementable for the vast majority of author records.
sequence
feature_relationship keys subjfeature_id, objfeature_id,
and relrank changed to subject_id, object_id, and rank,
respectively, for consistency with the rest of the schema.
www
no change
2005-02-08 - [cjm] db.contact_id nullable
- cvterm.dbxref_id NOT NULL
- added comments to cv table
2005-03-10 - [cjm] feature_cvterm_dbxref added
2005-04-20 - [cjm] changed comment style and added new comments
cv and sequence modules only
2005-06-07 - [cjm] added: feature_cvterm.is_not and feature_cvterm_dbxref
2006-04-11 - [cjm] split genetic module into genetic+phenotype modules
|