Case Studies for the RASH Process
Three main case studies have been chosen to ellucidate and demonstrate the RASH process:
- The development of schema for the SWISS-PROT and PIR protein sequence databanks and their reconciliation to each other and a unifying schema.
- The development of schema for each of the Saccharomyces Genome Database (SGD), MIPS Yeast Genome Database (MYDG) and the Yeast Proteome Database (YPD) and their reconciliation to the GIMS (Genome Information Management System) as a unifying schema. GIMS has an object database schema and takes information from each of the yeast databases, so portions of each resource's schema are reconciled with the GIMS schema.
- The reconciliation of a variety of schema fragments from several resources, including those above. this would include the concept of gene, function, and other sequence features.
Each of these case studies highlights some commonly occuring features of the reconciliation task in bioinformatics:
- The manifestation of implicit schema from flat-file databanks.
- the development of schema from a collection of relational tables.
- The development of schema from a web based access interface to a resource.
- Such developments from resources with the full spectrum of documentation from complete to only the resource itself.
- the reconciliation of schema to each other and to a unifying schema.
- The reconciliation of complete schema to the reconciliation of schema fragments.
- Inferring the conceptualisation of commonly occuring concepts, such as gene, which have a variety of legitimate forms.