Cargando…

Sequencing refractory regions in bird genomes are hotspots for accelerated protein evolution

BACKGROUND: Approximately 1000 protein encoding genes common for vertebrates are still unannotated in avian genomes. Are these genes evolutionary lost or are they not yet found for technical reasons? Using genome landscapes as a tool to visualize large-scale regional effects of genome evolution, we...

Descripción completa

Detalles Bibliográficos
Autores principales: Huttener, R., Thorrez, L., Veld, T. In’t, Granvik, M., Van Lommel, L., Waelkens, E., Derua, R., Lemaire, K., Goyvaerts, L., De Coster, S., Buyse, J., Schuit, F.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8449477/
https://www.ncbi.nlm.nih.gov/pubmed/34537008
http://dx.doi.org/10.1186/s12862-021-01905-7
_version_ 1784569427559186432
author Huttener, R.
Thorrez, L.
Veld, T. In’t
Granvik, M.
Van Lommel, L.
Waelkens, E.
Derua, R.
Lemaire, K.
Goyvaerts, L.
De Coster, S.
Buyse, J.
Schuit, F.
author_facet Huttener, R.
Thorrez, L.
Veld, T. In’t
Granvik, M.
Van Lommel, L.
Waelkens, E.
Derua, R.
Lemaire, K.
Goyvaerts, L.
De Coster, S.
Buyse, J.
Schuit, F.
author_sort Huttener, R.
collection PubMed
description BACKGROUND: Approximately 1000 protein encoding genes common for vertebrates are still unannotated in avian genomes. Are these genes evolutionary lost or are they not yet found for technical reasons? Using genome landscapes as a tool to visualize large-scale regional effects of genome evolution, we reexamined this question. RESULTS: On basis of gene annotation in non-avian vertebrate genomes, we established a list of 15,135 common vertebrate genes. Of these, 1026 were not found in any of eight examined bird genomes. Visualizing regional genome effects by our sliding window approach showed that the majority of these "missing" genes can be clustered to 14 regions of the human reference genome. In these clusters, an additional 1517 genes (often gene fragments) were underrepresented in bird genomes. The clusters of “missing” genes coincided with regions of very high GC content, particularly in avian genomes, making them “hidden” because of incomplete sequencing. Moreover, proteins encoded by genes in these sequencing refractory regions showed signs of accelerated protein evolution. As a proof of principle for this idea we experimentally characterized the mRNA and protein products of four "hidden" bird genes that are crucial for energy homeostasis in skeletal muscle: ALDOA, ENO3, PYGM and SLC2A4. CONCLUSIONS: A least part of the “missing” genes in bird genomes can be attributed to an artifact caused by the difficulty to sequence regions with extreme GC% (“hidden” genes). Biologically, these “hidden” genes are of interest as they encode proteins that evolve more rapidly than the genome wide average. Finally we show that four of these “hidden” genes encode key proteins for energy metabolism in flight muscle. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12862-021-01905-7.
format Online
Article
Text
id pubmed-8449477
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-84494772021-09-20 Sequencing refractory regions in bird genomes are hotspots for accelerated protein evolution Huttener, R. Thorrez, L. Veld, T. In’t Granvik, M. Van Lommel, L. Waelkens, E. Derua, R. Lemaire, K. Goyvaerts, L. De Coster, S. Buyse, J. Schuit, F. BMC Ecol Evol Research Article BACKGROUND: Approximately 1000 protein encoding genes common for vertebrates are still unannotated in avian genomes. Are these genes evolutionary lost or are they not yet found for technical reasons? Using genome landscapes as a tool to visualize large-scale regional effects of genome evolution, we reexamined this question. RESULTS: On basis of gene annotation in non-avian vertebrate genomes, we established a list of 15,135 common vertebrate genes. Of these, 1026 were not found in any of eight examined bird genomes. Visualizing regional genome effects by our sliding window approach showed that the majority of these "missing" genes can be clustered to 14 regions of the human reference genome. In these clusters, an additional 1517 genes (often gene fragments) were underrepresented in bird genomes. The clusters of “missing” genes coincided with regions of very high GC content, particularly in avian genomes, making them “hidden” because of incomplete sequencing. Moreover, proteins encoded by genes in these sequencing refractory regions showed signs of accelerated protein evolution. As a proof of principle for this idea we experimentally characterized the mRNA and protein products of four "hidden" bird genes that are crucial for energy homeostasis in skeletal muscle: ALDOA, ENO3, PYGM and SLC2A4. CONCLUSIONS: A least part of the “missing” genes in bird genomes can be attributed to an artifact caused by the difficulty to sequence regions with extreme GC% (“hidden” genes). Biologically, these “hidden” genes are of interest as they encode proteins that evolve more rapidly than the genome wide average. Finally we show that four of these “hidden” genes encode key proteins for energy metabolism in flight muscle. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12862-021-01905-7. BioMed Central 2021-09-18 /pmc/articles/PMC8449477/ /pubmed/34537008 http://dx.doi.org/10.1186/s12862-021-01905-7 Text en © The Author(s) 2021 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Research Article
Huttener, R.
Thorrez, L.
Veld, T. In’t
Granvik, M.
Van Lommel, L.
Waelkens, E.
Derua, R.
Lemaire, K.
Goyvaerts, L.
De Coster, S.
Buyse, J.
Schuit, F.
Sequencing refractory regions in bird genomes are hotspots for accelerated protein evolution
title Sequencing refractory regions in bird genomes are hotspots for accelerated protein evolution
title_full Sequencing refractory regions in bird genomes are hotspots for accelerated protein evolution
title_fullStr Sequencing refractory regions in bird genomes are hotspots for accelerated protein evolution
title_full_unstemmed Sequencing refractory regions in bird genomes are hotspots for accelerated protein evolution
title_short Sequencing refractory regions in bird genomes are hotspots for accelerated protein evolution
title_sort sequencing refractory regions in bird genomes are hotspots for accelerated protein evolution
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8449477/
https://www.ncbi.nlm.nih.gov/pubmed/34537008
http://dx.doi.org/10.1186/s12862-021-01905-7
work_keys_str_mv AT huttenerr sequencingrefractoryregionsinbirdgenomesarehotspotsforacceleratedproteinevolution
AT thorrezl sequencingrefractoryregionsinbirdgenomesarehotspotsforacceleratedproteinevolution
AT veldtint sequencingrefractoryregionsinbirdgenomesarehotspotsforacceleratedproteinevolution
AT granvikm sequencingrefractoryregionsinbirdgenomesarehotspotsforacceleratedproteinevolution
AT vanlommell sequencingrefractoryregionsinbirdgenomesarehotspotsforacceleratedproteinevolution
AT waelkense sequencingrefractoryregionsinbirdgenomesarehotspotsforacceleratedproteinevolution
AT deruar sequencingrefractoryregionsinbirdgenomesarehotspotsforacceleratedproteinevolution
AT lemairek sequencingrefractoryregionsinbirdgenomesarehotspotsforacceleratedproteinevolution
AT goyvaertsl sequencingrefractoryregionsinbirdgenomesarehotspotsforacceleratedproteinevolution
AT decosters sequencingrefractoryregionsinbirdgenomesarehotspotsforacceleratedproteinevolution
AT buysej sequencingrefractoryregionsinbirdgenomesarehotspotsforacceleratedproteinevolution
AT schuitf sequencingrefractoryregionsinbirdgenomesarehotspotsforacceleratedproteinevolution