© 2002-2005
CSGR The University of Georgia
All data resulting from this award will be made freely available to the non-profit scientific community in a timely manner (see below), and will be made accessible to the public via Web-based resources. To ensure against loss of biological materials or distribution delays, all shared physical resources, such as DNA clones, will be maintained at a minimum of two locations (as described below), each of which will be able and authorized to respond to requests for distribution. The proposed work will yield comparative genetic and physical data, as well as DNA sequence and expression data that will be relevant to many major crops. Publications that describe the broad messages arising from these data will be promptly prepared for leading refereed journals. The vast majority of data, including the identities and arrangements of individual BAC clones, DNA sequences, etc, will not be appropriate for the published literature, but are of utmost importance for ongoing research in many taxa.
Release of sequence and other community resources: All STC data will be deposited into the Genome Survey Sequence section (GSS) of Genbank within one month of sequence generation. The STC data (flat files, trace files and quality files) will also be made available at our ftp site (ftp.genome.clemson.edu). 'Re-sequencing' data from Cornell and UGA, including SSR, SNP, and candidate gene/promoter sequences, will be deposited in the same timeframe, and also maintained in the database. As of this writing, from the current funding period, 58,102 ESTs have been deposited in GenBank, all of which meet widely-used quality standards (Q16 on ABI3700, Phred 20 on ABI377). Comparable quality standards will continue to be used.
Regarding physical map revisions, communication between FPC and the CSGR database will have an update component and a web display component. The update component consists of nightly cronjobs (NB at the Sanger Centre, FPC exchanges data with an Oracle database using this scenario). A cronjob will execute a batch FPC process to write a file of contigs and read a file of markers generated by the CSGR database. Another cronjob will execute a batch process to read the FPC generated file into the CSGR database and write the marker file. The files will be placed in directories that will be accessible by both databases.
Plan for short-term and long-term distribution of project outcomes. Responsibility for IP management lies with the lead institution. BAC libraries have been made, and are distributed and curated by the Wing (SB, rice, sugarcane) and Paterson (SP) labs. Each library is backed up at both locations, although the indicated location has distribution responsibility (requests should be directed accordingly). QBOTs are located at both locations. Cost structure is described above.
Amplified cDNA libraries, and individual clones from which ESTs are derived were generated under the initial funding period, and may be requested under the commitments made at that time (from the Cordonnier-Pratt/Pratt lab). Backup copies are maintained at one or more of the other labs with interest in particular libraries.
Clones will also be offered to the American Type Culture Collection; however in view of the large volumes of space that are occupied by our libraries (hundreds of plates per library), they have shown little interest in the past. Given our access to QBOTs, we feel comfortable serving as the primary distribution points.
Long-term data access is provided by a Web-based resource, maintained at the University of Georgia. Matching support from UGA (Patel, Cherry letters) assures that there will be a network administrator to help co-PI Gingle with routine aspects of computational needs, and free Gingle to concentrate on programming. In addition, UGA underwrites the long-term maintenance of the online and functional status of the database (see letter from G. Patel). Stand-alone software tools developed under this support will be available to the non-profit research community for the purposes of academic research through free academic license agreements. Distribution will be through WWW download pages or via ftp.