It appears you don't have support to open PDFs in this web browser. To view this file, Open with your PDF reader
Abstract
Expressed Sequence Tag (EST) sequencing is one of the most efficient means for gene discovery and gene expression profiling. With a good resource of ESTs, a large number of molecular markers can be identified, and issues related to alternative splicing and differential poly adenylation can be addressed at the genome-wide scale. Through the Community Sequencing Program, a catfish EST sequencing project was selected by the DOE’s Joint Genome Institute (JGI). In this project, a total of 12 cDNA libraries were constructed including eight from channel catfish (Ictalurus punctatus) and four from blue catfish (I. furcatus). A total of 600,000 sequencing attempts were made, generating a total of 438,321 quality ESTs. With previously existing ESTs in GenBank, this project brings the total of ESTs to nearly 500,000 in the catfish. The JGI EST sequencing had an overall sequencing success rate of 73% with an average length of 576 bp. All the ESTs were assembled using CAP3, resulting in 111,578 unique sequences, including 45,306 contigs and 66,272 singletons. Of these unique sequences, over 35% had significant similarities to known genes by BLASTX searches, which allowed the identification of 14,776 unique genes in the catfish. A total of 1,350 and 849 full length cDNAs have been identified from channel catfish and blue catfish, respectively. The ESTs are an enormous resource for SNP identification. The quality assessment parameters for EST-derived were established based on a pilot study with 384 SNPs. In order to select reliable SNPs, contigs containing four or more ESTs should be used and the minor allele sequence should be represented at least twice. Genotyping primers should be designed from a single exon, completely avoiding introns. Application of such quality assessment measures, along with large resources of ESTs, should provide effective means for SNP identification in species where genome sequence resources are lacking. Over 300,000 putative SNPs have been identified, of which over 48,000 are high quality SNPs as defined by contig size of at least four sequences and the minor allele presence of at least twice in the contig. The EST resource should also be valuable for identification of microsatellites, comparative genome analysis. This large scale EST sequencing project would allow the identification of majority of catfish transcriptome. The parallel analysis of ESTs from the two closely related ictalurid catfishes should also provide powerful means for the evaluation of ancient and recent gene duplications, and for the development of high-density microarrays in catfish. The inter- and intra-specific SNPs identified from all catfish EST dataset assembly will greatly benefit the catfish introgression breeding selection and whole genome association studies. All ESTs have been deposited in GenBank.
You have requested "on-the-fly" machine translation of selected content from our databases. This functionality is provided solely for your convenience and is in no way intended to replace human translation. Show full disclaimer
Neither ProQuest nor its licensors make any representations or warranties with respect to the translations. The translations are automatically generated "AS IS" and "AS AVAILABLE" and are not retained in our systems. PROQUEST AND ITS LICENSORS SPECIFICALLY DISCLAIM ANY AND ALL EXPRESS OR IMPLIED WARRANTIES, INCLUDING WITHOUT LIMITATION, ANY WARRANTIES FOR AVAILABILITY, ACCURACY, TIMELINESS, COMPLETENESS, NON-INFRINGMENT, MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Your use of the translations is subject to all use restrictions contained in your Electronic Products License Agreement and by using the translation functionality you agree to forgo any and all claims against ProQuest or its licensors for your use of the translation functionality and any output derived there from. Hide full disclaimer