|
|
Notes on the History of Animal QTL Database The initial proposals, plans, and design scratches on developing a (pig) QTL database was made within the NAGRP Pig Genome Coordination Program in the fall of 1998. The vision layout, complete database design, data preparation and preliminary exploration on the approaches was not made until 2002/2003 when I returned to Iowa State. The pig QTL data collection and tools development works were done in 2004, which result in our first paper on PigQTLdb published (Mammalian Genome 2005, Volume 16(10):792-800). Subsequently, the QTLdb was expanded for new functionality and utility. The first of which was the addition of the set of curator/editor tools the chromosomal viewer tools. The second of which was the expansion of the QTLdb to include cattle/chicken QTL data in 2005/2006. The third of which was additions of structural alignment of the QTL maps with RH maps, human maps, FPC maps etc., with includsion of SNPs, Microarray elements, and new microsatellites for alignments in 2006/7 (see FAQ #11 for details). As a result, two more papers were published (Nucleic Acids Research, 2007, 35 [Database issue]: D604-D609; Mammalian Genome, 18: 1-4, 2007 on a new database name, Animal QTLdb). During the course of the project development, many attempts were made to utilize existing tools. Initially, we contacted a number of research groups and attempted several existing tools, which include: Roslin Institute for Anubis tool, Cold Spring Harbor Lab for cMAP, University of Tennessee Bioinformatics group for their Mouse/Human Bone Density QTL Database, Sweden RatMap Group for its QTL tools, Texas A&M University for a Bovine QTL Viewer, among others. For various reasons none of these tools could be worked out with the available facilities, satisfection of our check list of goals in terms of functions and utilities. As the last resort we explored a Perl/GD and MySQL approach to program out the needed tools for integrating, displaying and analyzing the stored QTL data. Under an agreement between the NAGRP Bioinformatics Coordinator and the NCBI, a copy of the database is implemented at the NCBI. As the development work went, the NCBI made a preliminary release with partial data in June 2004 (Release 1); and the NAGRP made a second release (Release 2) in December, 2004 as the data collection and new tools development came to a completion. As initially planned, the pig QTL data is synchronized between the NCBI and NAGRP databases. Due to the NCBI database structural change and re-development (e.g. Case of Locus Link database and the creation of Gene database), the QTL synchronization was at a pause during 2006-2009. The data synchronization was resumed in 2009, with continued efforts in making the data stream smooth. The QTLdb release was made at lease once per year (See Box 1).
Box 1. QTLdb Release History
The database and its peripheral tools at the NAGRP implementation was designed mainly for users to compare, confirm and locate the most plausible locations on a chromosomes for QTL. This is aimed at providing a tool for structural genomic information mining for genes responsible for quantitative traits important to animal production; The NCBI implementation has all marker information matched to marker records in NCBI's UniSTS database. This allows automatic matching of markers to public sequence data by e-PCR. Data on the NCBI and the NAGRP Animal Genome servers are cross-referenced to each other. This function allows all of the unique information on each site to appear to be integrated on the same database server via worldwide web. A noticible extended feature developed into the QTLdb is that diverse types of structural genome features, such as microsatellites, SNPs, microarray elements, tiled FPC BACs, can be aligned with QTL map locations to aid the data mining. Further more, different map types, such as RH maps, genome sequence maps, human maps, can also be align where available data is available to establish links. This functionality is further enhanced with the use of GBrowse tool for alignment of essentially any structural genomic features, such as transcripts, mRNA and gene structures mapped to the genome. Related publications throughout the course of this work:
| |
|
First draft: January 3, 2005
Modified : January 4, 2006 Modified : January 11, 2007 Modified : January 08, 2008 Modified : September 25, 2009 |
By Zhiliang Hu
Associate Scientist Dept of Animal Science Iowa State University |
|
|
© 2003-
NAGRP - Bioinformatics Coordination Program. Contact: NAGRP Bioinformatics Team |