Chado Natural Diversity Module Working Group

From GMOD
Revision as of 22:17, 27 January 2010 by Clements (Talk | contribs)

Jump to: navigation, search
Under Construction

This page or section is under construction.

The Chado Natural Diversity Working Group has been established with the aim of getting the Chado Natural Diversity Module into the production version of Chado.


Background and Timeline

This section describes important events in the development of the module. Detailed discussion of functionality is a separate section below.

2007

The initial version of the Natural Diversity Module was developed by several people associated with NESCent. The initial application was heliconius research. This first version (and subsequent versions) are directly inspired by the Genomic Diversity and Phenotype Data Model (GDPDM), which comes out of Cornell. The GDPDM has great documentation and is also described in this presentation.

2009

In the fall of 2009, Sook Jung of Washington State downloaded the initial version (becoming the first known user outside heliconius) and started looking at it with the goal of using for GDR, a plant genome database. Sook found that a number of things weren't clear, and her input led to a rethinking of the design, and to the formation of this working group.

2010

January 2010 GMOD Satellite Meeting

Several working group participants met twice during PAG 2010, immediately before the January 2010 GMOD Meeting.


Discussion

At this time (January 2010), most of this discussion is about making changes relative to the version that was created in 2007 with Heliconius in mind. This is referred to below as HDB.

Observational Taxonomy

HDB has several different levels of biological unit, all represented with a different set of tables

And there are a bevy of relationships between these tables.

Organism M:M Biotype
Biotype 1:M Stock there are 3 different 1:M rels
Stock 1:M Individual
Crossexperiment 1:M Individual
Individual 1:M Crossexperiemnt
Individual 1:M Specimen
Biotype M:M Individual

All of this tables describe some unit/group of biology/life, ranging from species (organism) down to tissue in hand (specimen). The HDB design has several structurally identical tables in HDB for the various levels for different types of data (phenotype, images, ...). This particular hierarchy is also particular to butterflies.

Observational Taxonomy Proposal

When Sook Jung mapped the HDB version to tree biology a number of issues came up, many of which boil down to:

  • Species/biotype/stock/individual/cross hierarchy doesn't work for trees (living trees, not abstract ones).
  • Lineage doesn't work for trees.

This highlights that HDB is not a very Chadoesque design. We need to genericize the design to support arbitrary hierarchies, lineages, and mating types. This will support many more users and allow them to store images, phenotypes, genotypes, properties, etc. for whatever level of the hierarchy they have data for.

We can't touch Organism, as it's a key table in every Chado instance out there. However, everything else is open to change.

Observational Unit?

The GDPDM has observational units, which represents whatever level of sample you have data for. I find that name descriptive, but awkward. Unfortunately, I can't think of a better name. Suggestions are welcome.

Specifics:

  1. Try to combine biotype, stock, individual, and crossexperiemt into a single table, tentatively called obs_unit (with a nod to GDPDM).
  2. Investigate also folding specimen into obs_unit.
  3. An observational unit's place in the observational taxonomy will be indicated by a new column in obs_unit that points to the CV table. For butterflies, the possible values might be "species", "biotype", "stock", "individual", and possibly "specimen"

Links

Membership

If you are interested, please add your name below. (Either update this page directly, or send your contact info to Dave Clements.)

Name Email Affiliation Comments

Dave Clements (organizer)

clements@nescent.org

NESCent, GMOD

Please let me know if you are interested in participating in this group, or if you have any questions.

Sook Jung

sook * bioinfo.wsu.edu

Washington State University, GDR

Meg Staton

mestato * yahoo.com

CUGI

Stephen Ficklin

ficklin * clemson.edu

CUGI

Dorrie Main

dorrie * wsu.edu

Washington State University, GDR

Scott Cain

OICR / GMOD


edit table

Template:ThisIsATET