Cargando…
Sequencing refractory regions in bird genomes are hotspots for accelerated protein evolution
BACKGROUND: Approximately 1000 protein encoding genes common for vertebrates are still unannotated in avian genomes. Are these genes evolutionary lost or are they not yet found for technical reasons? Using genome landscapes as a tool to visualize large-scale regional effects of genome evolution, we...
Autores principales: | , , , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8449477/ https://www.ncbi.nlm.nih.gov/pubmed/34537008 http://dx.doi.org/10.1186/s12862-021-01905-7 |
_version_ | 1784569427559186432 |
---|---|
author | Huttener, R. Thorrez, L. Veld, T. In’t Granvik, M. Van Lommel, L. Waelkens, E. Derua, R. Lemaire, K. Goyvaerts, L. De Coster, S. Buyse, J. Schuit, F. |
author_facet | Huttener, R. Thorrez, L. Veld, T. In’t Granvik, M. Van Lommel, L. Waelkens, E. Derua, R. Lemaire, K. Goyvaerts, L. De Coster, S. Buyse, J. Schuit, F. |
author_sort | Huttener, R. |
collection | PubMed |
description | BACKGROUND: Approximately 1000 protein encoding genes common for vertebrates are still unannotated in avian genomes. Are these genes evolutionary lost or are they not yet found for technical reasons? Using genome landscapes as a tool to visualize large-scale regional effects of genome evolution, we reexamined this question. RESULTS: On basis of gene annotation in non-avian vertebrate genomes, we established a list of 15,135 common vertebrate genes. Of these, 1026 were not found in any of eight examined bird genomes. Visualizing regional genome effects by our sliding window approach showed that the majority of these "missing" genes can be clustered to 14 regions of the human reference genome. In these clusters, an additional 1517 genes (often gene fragments) were underrepresented in bird genomes. The clusters of “missing” genes coincided with regions of very high GC content, particularly in avian genomes, making them “hidden” because of incomplete sequencing. Moreover, proteins encoded by genes in these sequencing refractory regions showed signs of accelerated protein evolution. As a proof of principle for this idea we experimentally characterized the mRNA and protein products of four "hidden" bird genes that are crucial for energy homeostasis in skeletal muscle: ALDOA, ENO3, PYGM and SLC2A4. CONCLUSIONS: A least part of the “missing” genes in bird genomes can be attributed to an artifact caused by the difficulty to sequence regions with extreme GC% (“hidden” genes). Biologically, these “hidden” genes are of interest as they encode proteins that evolve more rapidly than the genome wide average. Finally we show that four of these “hidden” genes encode key proteins for energy metabolism in flight muscle. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12862-021-01905-7. |
format | Online Article Text |
id | pubmed-8449477 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-84494772021-09-20 Sequencing refractory regions in bird genomes are hotspots for accelerated protein evolution Huttener, R. Thorrez, L. Veld, T. In’t Granvik, M. Van Lommel, L. Waelkens, E. Derua, R. Lemaire, K. Goyvaerts, L. De Coster, S. Buyse, J. Schuit, F. BMC Ecol Evol Research Article BACKGROUND: Approximately 1000 protein encoding genes common for vertebrates are still unannotated in avian genomes. Are these genes evolutionary lost or are they not yet found for technical reasons? Using genome landscapes as a tool to visualize large-scale regional effects of genome evolution, we reexamined this question. RESULTS: On basis of gene annotation in non-avian vertebrate genomes, we established a list of 15,135 common vertebrate genes. Of these, 1026 were not found in any of eight examined bird genomes. Visualizing regional genome effects by our sliding window approach showed that the majority of these "missing" genes can be clustered to 14 regions of the human reference genome. In these clusters, an additional 1517 genes (often gene fragments) were underrepresented in bird genomes. The clusters of “missing” genes coincided with regions of very high GC content, particularly in avian genomes, making them “hidden” because of incomplete sequencing. Moreover, proteins encoded by genes in these sequencing refractory regions showed signs of accelerated protein evolution. As a proof of principle for this idea we experimentally characterized the mRNA and protein products of four "hidden" bird genes that are crucial for energy homeostasis in skeletal muscle: ALDOA, ENO3, PYGM and SLC2A4. CONCLUSIONS: A least part of the “missing” genes in bird genomes can be attributed to an artifact caused by the difficulty to sequence regions with extreme GC% (“hidden” genes). Biologically, these “hidden” genes are of interest as they encode proteins that evolve more rapidly than the genome wide average. Finally we show that four of these “hidden” genes encode key proteins for energy metabolism in flight muscle. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12862-021-01905-7. BioMed Central 2021-09-18 /pmc/articles/PMC8449477/ /pubmed/34537008 http://dx.doi.org/10.1186/s12862-021-01905-7 Text en © The Author(s) 2021 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data. |
spellingShingle | Research Article Huttener, R. Thorrez, L. Veld, T. In’t Granvik, M. Van Lommel, L. Waelkens, E. Derua, R. Lemaire, K. Goyvaerts, L. De Coster, S. Buyse, J. Schuit, F. Sequencing refractory regions in bird genomes are hotspots for accelerated protein evolution |
title | Sequencing refractory regions in bird genomes are hotspots for accelerated protein evolution |
title_full | Sequencing refractory regions in bird genomes are hotspots for accelerated protein evolution |
title_fullStr | Sequencing refractory regions in bird genomes are hotspots for accelerated protein evolution |
title_full_unstemmed | Sequencing refractory regions in bird genomes are hotspots for accelerated protein evolution |
title_short | Sequencing refractory regions in bird genomes are hotspots for accelerated protein evolution |
title_sort | sequencing refractory regions in bird genomes are hotspots for accelerated protein evolution |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8449477/ https://www.ncbi.nlm.nih.gov/pubmed/34537008 http://dx.doi.org/10.1186/s12862-021-01905-7 |
work_keys_str_mv | AT huttenerr sequencingrefractoryregionsinbirdgenomesarehotspotsforacceleratedproteinevolution AT thorrezl sequencingrefractoryregionsinbirdgenomesarehotspotsforacceleratedproteinevolution AT veldtint sequencingrefractoryregionsinbirdgenomesarehotspotsforacceleratedproteinevolution AT granvikm sequencingrefractoryregionsinbirdgenomesarehotspotsforacceleratedproteinevolution AT vanlommell sequencingrefractoryregionsinbirdgenomesarehotspotsforacceleratedproteinevolution AT waelkense sequencingrefractoryregionsinbirdgenomesarehotspotsforacceleratedproteinevolution AT deruar sequencingrefractoryregionsinbirdgenomesarehotspotsforacceleratedproteinevolution AT lemairek sequencingrefractoryregionsinbirdgenomesarehotspotsforacceleratedproteinevolution AT goyvaertsl sequencingrefractoryregionsinbirdgenomesarehotspotsforacceleratedproteinevolution AT decosters sequencingrefractoryregionsinbirdgenomesarehotspotsforacceleratedproteinevolution AT buysej sequencingrefractoryregionsinbirdgenomesarehotspotsforacceleratedproteinevolution AT schuitf sequencingrefractoryregionsinbirdgenomesarehotspotsforacceleratedproteinevolution |