Cargando…

Assessing the impact of comparative genomic sequence data on the functional annotation of the Drosophila genome

BACKGROUND: It is widely accepted that comparative sequence data can aid the functional annotation of genome sequences; however, the most informative species and features of genome evolution for comparison remain to be determined. RESULTS: We analyzed conservation in eight genomic regions (apterous,...

Descripción completa

Detalles Bibliográficos
Autores principales: Bergman, Casey M, Pfeiffer, Barret D, Rincón-Limas, Diego E, Hoskins, Roger A, Gnirke, Andreas, Mungall, Chris J, Wang, Adrienne M, Kronmiller, Brent, Pacleb, Joanne, Park, Soo, Stapleton, Mark, Wan, Kenneth, George, Reed A, de Jong, Pieter J, Botas, Juan, Rubin, Gerald M, Celniker, Susan E
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2002
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC151188/
https://www.ncbi.nlm.nih.gov/pubmed/12537575
http://dx.doi.org/10.1186/gb-2002-3-12-research0086
_version_ 1782120664446009344
author Bergman, Casey M
Pfeiffer, Barret D
Rincón-Limas, Diego E
Hoskins, Roger A
Gnirke, Andreas
Mungall, Chris J
Wang, Adrienne M
Kronmiller, Brent
Pacleb, Joanne
Park, Soo
Stapleton, Mark
Wan, Kenneth
George, Reed A
de Jong, Pieter J
Botas, Juan
Rubin, Gerald M
Celniker, Susan E
author_facet Bergman, Casey M
Pfeiffer, Barret D
Rincón-Limas, Diego E
Hoskins, Roger A
Gnirke, Andreas
Mungall, Chris J
Wang, Adrienne M
Kronmiller, Brent
Pacleb, Joanne
Park, Soo
Stapleton, Mark
Wan, Kenneth
George, Reed A
de Jong, Pieter J
Botas, Juan
Rubin, Gerald M
Celniker, Susan E
author_sort Bergman, Casey M
collection PubMed
description BACKGROUND: It is widely accepted that comparative sequence data can aid the functional annotation of genome sequences; however, the most informative species and features of genome evolution for comparison remain to be determined. RESULTS: We analyzed conservation in eight genomic regions (apterous, even-skipped, fushi tarazu, twist, and Rhodopsins 1, 2, 3 and 4) from four Drosophila species (D. erecta, D. pseudoobscura, D. willistoni, and D. littoralis) covering more than 500 kb of the D. melanogaster genome. All D. melanogaster genes (and 78-82% of coding exons) identified in divergent species such as D. pseudoobscura show evidence of functional constraint. Addition of a third species can reveal functional constraint in otherwise non-significant pairwise exon comparisons. Microsynteny is largely conserved, with rearrangement breakpoints, novel transposable element insertions, and gene transpositions occurring in similar numbers. Rates of amino-acid substitution are higher in uncharacterized genes relative to genes that have previously been studied. Conserved non-coding sequences (CNCSs) tend to be spatially clustered with conserved spacing between CNCSs, and clusters of CNCSs can be used to predict enhancer sequences. CONCLUSIONS: Our results provide the basis for choosing species whose genome sequences would be most useful in aiding the functional annotation of coding and cis-regulatory sequences in Drosophila. Furthermore, this work shows how decoding the spatial organization of conserved sequences, such as the clustering of CNCSs, can complement efforts to annotate eukaryotic genomes on the basis of sequence conservation alone.
format Text
id pubmed-151188
institution National Center for Biotechnology Information
language English
publishDate 2002
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-1511882003-03-13 Assessing the impact of comparative genomic sequence data on the functional annotation of the Drosophila genome Bergman, Casey M Pfeiffer, Barret D Rincón-Limas, Diego E Hoskins, Roger A Gnirke, Andreas Mungall, Chris J Wang, Adrienne M Kronmiller, Brent Pacleb, Joanne Park, Soo Stapleton, Mark Wan, Kenneth George, Reed A de Jong, Pieter J Botas, Juan Rubin, Gerald M Celniker, Susan E Genome Biol Research BACKGROUND: It is widely accepted that comparative sequence data can aid the functional annotation of genome sequences; however, the most informative species and features of genome evolution for comparison remain to be determined. RESULTS: We analyzed conservation in eight genomic regions (apterous, even-skipped, fushi tarazu, twist, and Rhodopsins 1, 2, 3 and 4) from four Drosophila species (D. erecta, D. pseudoobscura, D. willistoni, and D. littoralis) covering more than 500 kb of the D. melanogaster genome. All D. melanogaster genes (and 78-82% of coding exons) identified in divergent species such as D. pseudoobscura show evidence of functional constraint. Addition of a third species can reveal functional constraint in otherwise non-significant pairwise exon comparisons. Microsynteny is largely conserved, with rearrangement breakpoints, novel transposable element insertions, and gene transpositions occurring in similar numbers. Rates of amino-acid substitution are higher in uncharacterized genes relative to genes that have previously been studied. Conserved non-coding sequences (CNCSs) tend to be spatially clustered with conserved spacing between CNCSs, and clusters of CNCSs can be used to predict enhancer sequences. CONCLUSIONS: Our results provide the basis for choosing species whose genome sequences would be most useful in aiding the functional annotation of coding and cis-regulatory sequences in Drosophila. Furthermore, this work shows how decoding the spatial organization of conserved sequences, such as the clustering of CNCSs, can complement efforts to annotate eukaryotic genomes on the basis of sequence conservation alone. BioMed Central 2002 2002-12-30 /pmc/articles/PMC151188/ /pubmed/12537575 http://dx.doi.org/10.1186/gb-2002-3-12-research0086 Text en Copyright © 2002 Bergman et al., licensee BioMed Central Ltd
spellingShingle Research
Bergman, Casey M
Pfeiffer, Barret D
Rincón-Limas, Diego E
Hoskins, Roger A
Gnirke, Andreas
Mungall, Chris J
Wang, Adrienne M
Kronmiller, Brent
Pacleb, Joanne
Park, Soo
Stapleton, Mark
Wan, Kenneth
George, Reed A
de Jong, Pieter J
Botas, Juan
Rubin, Gerald M
Celniker, Susan E
Assessing the impact of comparative genomic sequence data on the functional annotation of the Drosophila genome
title Assessing the impact of comparative genomic sequence data on the functional annotation of the Drosophila genome
title_full Assessing the impact of comparative genomic sequence data on the functional annotation of the Drosophila genome
title_fullStr Assessing the impact of comparative genomic sequence data on the functional annotation of the Drosophila genome
title_full_unstemmed Assessing the impact of comparative genomic sequence data on the functional annotation of the Drosophila genome
title_short Assessing the impact of comparative genomic sequence data on the functional annotation of the Drosophila genome
title_sort assessing the impact of comparative genomic sequence data on the functional annotation of the drosophila genome
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC151188/
https://www.ncbi.nlm.nih.gov/pubmed/12537575
http://dx.doi.org/10.1186/gb-2002-3-12-research0086
work_keys_str_mv AT bergmancaseym assessingtheimpactofcomparativegenomicsequencedataonthefunctionalannotationofthedrosophilagenome
AT pfeifferbarretd assessingtheimpactofcomparativegenomicsequencedataonthefunctionalannotationofthedrosophilagenome
AT rinconlimasdiegoe assessingtheimpactofcomparativegenomicsequencedataonthefunctionalannotationofthedrosophilagenome
AT hoskinsrogera assessingtheimpactofcomparativegenomicsequencedataonthefunctionalannotationofthedrosophilagenome
AT gnirkeandreas assessingtheimpactofcomparativegenomicsequencedataonthefunctionalannotationofthedrosophilagenome
AT mungallchrisj assessingtheimpactofcomparativegenomicsequencedataonthefunctionalannotationofthedrosophilagenome
AT wangadriennem assessingtheimpactofcomparativegenomicsequencedataonthefunctionalannotationofthedrosophilagenome
AT kronmillerbrent assessingtheimpactofcomparativegenomicsequencedataonthefunctionalannotationofthedrosophilagenome
AT paclebjoanne assessingtheimpactofcomparativegenomicsequencedataonthefunctionalannotationofthedrosophilagenome
AT parksoo assessingtheimpactofcomparativegenomicsequencedataonthefunctionalannotationofthedrosophilagenome
AT stapletonmark assessingtheimpactofcomparativegenomicsequencedataonthefunctionalannotationofthedrosophilagenome
AT wankenneth assessingtheimpactofcomparativegenomicsequencedataonthefunctionalannotationofthedrosophilagenome
AT georgereeda assessingtheimpactofcomparativegenomicsequencedataonthefunctionalannotationofthedrosophilagenome
AT dejongpieterj assessingtheimpactofcomparativegenomicsequencedataonthefunctionalannotationofthedrosophilagenome
AT botasjuan assessingtheimpactofcomparativegenomicsequencedataonthefunctionalannotationofthedrosophilagenome
AT rubingeraldm assessingtheimpactofcomparativegenomicsequencedataonthefunctionalannotationofthedrosophilagenome
AT celnikersusane assessingtheimpactofcomparativegenomicsequencedataonthefunctionalannotationofthedrosophilagenome