Cargando…

Comparative genomics in cyprinids: common carp ESTs help the annotation of the zebrafish genome

BACKGROUND: Automatic annotation of sequenced eukaryotic genomes integrates a combination of methodologies such as ab-initio methods and alignment of homologous genes and/or proteins. For example, annotation of the zebrafish genome within Ensembl relies heavily on available cDNA and protein sequence...

Descripción completa

Detalles Bibliográficos
Autores principales: Christoffels, Alan, Bartfai, Richard, Srinivasan, Hamsa, Komen, Hans, Orban, Laszlo
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2006
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1764476/
https://www.ncbi.nlm.nih.gov/pubmed/17254304
http://dx.doi.org/10.1186/1471-2105-7-S5-S2
_version_ 1782131619387146240
author Christoffels, Alan
Bartfai, Richard
Srinivasan, Hamsa
Komen, Hans
Orban, Laszlo
author_facet Christoffels, Alan
Bartfai, Richard
Srinivasan, Hamsa
Komen, Hans
Orban, Laszlo
author_sort Christoffels, Alan
collection PubMed
description BACKGROUND: Automatic annotation of sequenced eukaryotic genomes integrates a combination of methodologies such as ab-initio methods and alignment of homologous genes and/or proteins. For example, annotation of the zebrafish genome within Ensembl relies heavily on available cDNA and protein sequences from two distantly related fish species and other vertebrates that have diverged several hundred million years ago. The scarcity of genomic information from other cyprinids provides the impetus to leverage EST collections to understand gene structures in this diverse teleost group. RESULTS: We have generated 6,050 ESTs from the differentiating testis of common carp (Cyprinus carpio) and clustered them with 9,303 non-gonadal ESTs from CarpBase as well as 1,317 ESTs and 652 common carp mRNAs from GenBank. Over 28% of the resulting 8,663 unique transcripts are exclusively testis-derived ESTs. Moreover, 974 of these transcripts did not match any sequence in the zebrafish or fathead minnow EST collection. A total of 1,843 unique common carp sequences could be stringently mapped to the zebrafish genome (version 5), of which 1,752 matched coding sequences of zebrafish genes with or without potential splice variants. We show that 91 common carp transcripts map to intergenic and intronic regions on the zebrafish genome assembly and regions annotated with non-teleost sequences. Interestingly, an additional 42 common carp transcripts indicate the potential presence of new splicing variants not found in zebrafish databases so far. The fact that common carp transcripts help the identification or confirmation of these coding regions in zebrafish exemplifies the usefulness of sequences from closely related species for the annotation of model genomes. We also demonstrate that 5' UTR sequences of common carp and zebrafish orthologs share a significant level of similarity based on preservation of motif arrangements for as many as 10 ab-initio motifs. CONCLUSION: Our data show that there is sufficient homology between the transcribed sequences of common carp and zebrafish to warrant an even deeper cyprinid transcriptome comparison. On the other hand, the comparative analysis illustrates the value in utilizing partially sequenced transcriptomes to understand gene structure in this diverse teleost group. We highlight the need for integrated resources to leverage the wealth of fragmented genomic data.
format Text
id pubmed-1764476
institution National Center for Biotechnology Information
language English
publishDate 2006
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-17644762007-01-09 Comparative genomics in cyprinids: common carp ESTs help the annotation of the zebrafish genome Christoffels, Alan Bartfai, Richard Srinivasan, Hamsa Komen, Hans Orban, Laszlo BMC Bioinformatics Proceedings BACKGROUND: Automatic annotation of sequenced eukaryotic genomes integrates a combination of methodologies such as ab-initio methods and alignment of homologous genes and/or proteins. For example, annotation of the zebrafish genome within Ensembl relies heavily on available cDNA and protein sequences from two distantly related fish species and other vertebrates that have diverged several hundred million years ago. The scarcity of genomic information from other cyprinids provides the impetus to leverage EST collections to understand gene structures in this diverse teleost group. RESULTS: We have generated 6,050 ESTs from the differentiating testis of common carp (Cyprinus carpio) and clustered them with 9,303 non-gonadal ESTs from CarpBase as well as 1,317 ESTs and 652 common carp mRNAs from GenBank. Over 28% of the resulting 8,663 unique transcripts are exclusively testis-derived ESTs. Moreover, 974 of these transcripts did not match any sequence in the zebrafish or fathead minnow EST collection. A total of 1,843 unique common carp sequences could be stringently mapped to the zebrafish genome (version 5), of which 1,752 matched coding sequences of zebrafish genes with or without potential splice variants. We show that 91 common carp transcripts map to intergenic and intronic regions on the zebrafish genome assembly and regions annotated with non-teleost sequences. Interestingly, an additional 42 common carp transcripts indicate the potential presence of new splicing variants not found in zebrafish databases so far. The fact that common carp transcripts help the identification or confirmation of these coding regions in zebrafish exemplifies the usefulness of sequences from closely related species for the annotation of model genomes. We also demonstrate that 5' UTR sequences of common carp and zebrafish orthologs share a significant level of similarity based on preservation of motif arrangements for as many as 10 ab-initio motifs. CONCLUSION: Our data show that there is sufficient homology between the transcribed sequences of common carp and zebrafish to warrant an even deeper cyprinid transcriptome comparison. On the other hand, the comparative analysis illustrates the value in utilizing partially sequenced transcriptomes to understand gene structure in this diverse teleost group. We highlight the need for integrated resources to leverage the wealth of fragmented genomic data. BioMed Central 2006-12-18 /pmc/articles/PMC1764476/ /pubmed/17254304 http://dx.doi.org/10.1186/1471-2105-7-S5-S2 Text en Copyright © 2006 Christoffels et al; licensee BioMed Central Ltd http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Proceedings
Christoffels, Alan
Bartfai, Richard
Srinivasan, Hamsa
Komen, Hans
Orban, Laszlo
Comparative genomics in cyprinids: common carp ESTs help the annotation of the zebrafish genome
title Comparative genomics in cyprinids: common carp ESTs help the annotation of the zebrafish genome
title_full Comparative genomics in cyprinids: common carp ESTs help the annotation of the zebrafish genome
title_fullStr Comparative genomics in cyprinids: common carp ESTs help the annotation of the zebrafish genome
title_full_unstemmed Comparative genomics in cyprinids: common carp ESTs help the annotation of the zebrafish genome
title_short Comparative genomics in cyprinids: common carp ESTs help the annotation of the zebrafish genome
title_sort comparative genomics in cyprinids: common carp ests help the annotation of the zebrafish genome
topic Proceedings
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1764476/
https://www.ncbi.nlm.nih.gov/pubmed/17254304
http://dx.doi.org/10.1186/1471-2105-7-S5-S2
work_keys_str_mv AT christoffelsalan comparativegenomicsincyprinidscommoncarpestshelptheannotationofthezebrafishgenome
AT bartfairichard comparativegenomicsincyprinidscommoncarpestshelptheannotationofthezebrafishgenome
AT srinivasanhamsa comparativegenomicsincyprinidscommoncarpestshelptheannotationofthezebrafishgenome
AT komenhans comparativegenomicsincyprinidscommoncarpestshelptheannotationofthezebrafishgenome
AT orbanlaszlo comparativegenomicsincyprinidscommoncarpestshelptheannotationofthezebrafishgenome