Cargando…
Population genomics reveals mechanisms and dynamics of de novo expressed open reading frame emergence in Drosophila melanogaster
Novel genes are essential for evolutionary innovations and differ substantially even between closely related species. Recently, multiple studies across many taxa showed that some novel genes arise de novo, that is, from previously noncoding DNA. To characterize the underlying mutations that allowed...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Cold Spring Harbor Laboratory Press
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10519401/ https://www.ncbi.nlm.nih.gov/pubmed/37442576 http://dx.doi.org/10.1101/gr.277482.122 |
_version_ | 1785145954393915392 |
---|---|
author | Grandchamp, Anna Kühl, Lucas Lebherz, Marie Brüggemann, Kathrin Parsch, John Bornberg-Bauer, Erich |
author_facet | Grandchamp, Anna Kühl, Lucas Lebherz, Marie Brüggemann, Kathrin Parsch, John Bornberg-Bauer, Erich |
author_sort | Grandchamp, Anna |
collection | PubMed |
description | Novel genes are essential for evolutionary innovations and differ substantially even between closely related species. Recently, multiple studies across many taxa showed that some novel genes arise de novo, that is, from previously noncoding DNA. To characterize the underlying mutations that allowed de novo gene emergence and their order of occurrence, homologous regions must be detected within noncoding sequences in closely related sister genomes. So far, most studies do not detect noncoding homologs of de novo genes because of incomplete assemblies and annotations, and long evolutionary distances separating genomes. Here, we overcome these issues by searching for de novo expressed open reading frames (neORFs), the not-yet fixed precursors of de novo genes that emerged within a single species. We sequenced and assembled genomes with long-read technology and the corresponding transcriptomes from inbred lines of Drosophila melanogaster, derived from seven geographically diverse populations. We found line-specific neORFs in abundance but few neORFs shared by lines, suggesting a rapid turnover. Gain and loss of transcription is more frequent than the creation of ORFs, for example, by forming new start and stop codons. Consequently, the gain of ORFs becomes rate limiting and is frequently the initial step in neORFs emergence. Furthermore, transposable elements (TEs) are major drivers for intragenomic duplications of neORFs, yet TE insertions are less important for the emergence of neORFs. However, highly mutable genomic regions around TEs provide new features that enable gene birth. In conclusion, neORFs have a high birth-death rate, are rapidly purged, but surviving neORFs spread neutrally through populations and within genomes. |
format | Online Article Text |
id | pubmed-10519401 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Cold Spring Harbor Laboratory Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-105194012023-12-01 Population genomics reveals mechanisms and dynamics of de novo expressed open reading frame emergence in Drosophila melanogaster Grandchamp, Anna Kühl, Lucas Lebherz, Marie Brüggemann, Kathrin Parsch, John Bornberg-Bauer, Erich Genome Res Research Novel genes are essential for evolutionary innovations and differ substantially even between closely related species. Recently, multiple studies across many taxa showed that some novel genes arise de novo, that is, from previously noncoding DNA. To characterize the underlying mutations that allowed de novo gene emergence and their order of occurrence, homologous regions must be detected within noncoding sequences in closely related sister genomes. So far, most studies do not detect noncoding homologs of de novo genes because of incomplete assemblies and annotations, and long evolutionary distances separating genomes. Here, we overcome these issues by searching for de novo expressed open reading frames (neORFs), the not-yet fixed precursors of de novo genes that emerged within a single species. We sequenced and assembled genomes with long-read technology and the corresponding transcriptomes from inbred lines of Drosophila melanogaster, derived from seven geographically diverse populations. We found line-specific neORFs in abundance but few neORFs shared by lines, suggesting a rapid turnover. Gain and loss of transcription is more frequent than the creation of ORFs, for example, by forming new start and stop codons. Consequently, the gain of ORFs becomes rate limiting and is frequently the initial step in neORFs emergence. Furthermore, transposable elements (TEs) are major drivers for intragenomic duplications of neORFs, yet TE insertions are less important for the emergence of neORFs. However, highly mutable genomic regions around TEs provide new features that enable gene birth. In conclusion, neORFs have a high birth-death rate, are rapidly purged, but surviving neORFs spread neutrally through populations and within genomes. Cold Spring Harbor Laboratory Press 2023-06 /pmc/articles/PMC10519401/ /pubmed/37442576 http://dx.doi.org/10.1101/gr.277482.122 Text en © 2023 Grandchamp et al.; Published by Cold Spring Harbor Laboratory Press https://creativecommons.org/licenses/by-nc/4.0/This article is distributed exclusively by Cold Spring Harbor Laboratory Press for the first six months after the full-issue publication date (see https://genome.cshlp.org/site/misc/terms.xhtml). After six months, it is available under a Creative Commons License (Attribution-NonCommercial 4.0 International), as described at http://creativecommons.org/licenses/by-nc/4.0/ (https://creativecommons.org/licenses/by-nc/4.0/) . |
spellingShingle | Research Grandchamp, Anna Kühl, Lucas Lebherz, Marie Brüggemann, Kathrin Parsch, John Bornberg-Bauer, Erich Population genomics reveals mechanisms and dynamics of de novo expressed open reading frame emergence in Drosophila melanogaster |
title | Population genomics reveals mechanisms and dynamics of de novo expressed open reading frame emergence in Drosophila melanogaster |
title_full | Population genomics reveals mechanisms and dynamics of de novo expressed open reading frame emergence in Drosophila melanogaster |
title_fullStr | Population genomics reveals mechanisms and dynamics of de novo expressed open reading frame emergence in Drosophila melanogaster |
title_full_unstemmed | Population genomics reveals mechanisms and dynamics of de novo expressed open reading frame emergence in Drosophila melanogaster |
title_short | Population genomics reveals mechanisms and dynamics of de novo expressed open reading frame emergence in Drosophila melanogaster |
title_sort | population genomics reveals mechanisms and dynamics of de novo expressed open reading frame emergence in drosophila melanogaster |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10519401/ https://www.ncbi.nlm.nih.gov/pubmed/37442576 http://dx.doi.org/10.1101/gr.277482.122 |
work_keys_str_mv | AT grandchampanna populationgenomicsrevealsmechanismsanddynamicsofdenovoexpressedopenreadingframeemergenceindrosophilamelanogaster AT kuhllucas populationgenomicsrevealsmechanismsanddynamicsofdenovoexpressedopenreadingframeemergenceindrosophilamelanogaster AT lebherzmarie populationgenomicsrevealsmechanismsanddynamicsofdenovoexpressedopenreadingframeemergenceindrosophilamelanogaster AT bruggemannkathrin populationgenomicsrevealsmechanismsanddynamicsofdenovoexpressedopenreadingframeemergenceindrosophilamelanogaster AT parschjohn populationgenomicsrevealsmechanismsanddynamicsofdenovoexpressedopenreadingframeemergenceindrosophilamelanogaster AT bornbergbauererich populationgenomicsrevealsmechanismsanddynamicsofdenovoexpressedopenreadingframeemergenceindrosophilamelanogaster |