Cargando…

Population genomics reveals mechanisms and dynamics of de novo expressed open reading frame emergence in Drosophila melanogaster

Novel genes are essential for evolutionary innovations and differ substantially even between closely related species. Recently, multiple studies across many taxa showed that some novel genes arise de novo, that is, from previously noncoding DNA. To characterize the underlying mutations that allowed...

Descripción completa

Detalles Bibliográficos
Autores principales: Grandchamp, Anna, Kühl, Lucas, Lebherz, Marie, Brüggemann, Kathrin, Parsch, John, Bornberg-Bauer, Erich
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Cold Spring Harbor Laboratory Press 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10519401/
https://www.ncbi.nlm.nih.gov/pubmed/37442576
http://dx.doi.org/10.1101/gr.277482.122
_version_ 1785145954393915392
author Grandchamp, Anna
Kühl, Lucas
Lebherz, Marie
Brüggemann, Kathrin
Parsch, John
Bornberg-Bauer, Erich
author_facet Grandchamp, Anna
Kühl, Lucas
Lebherz, Marie
Brüggemann, Kathrin
Parsch, John
Bornberg-Bauer, Erich
author_sort Grandchamp, Anna
collection PubMed
description Novel genes are essential for evolutionary innovations and differ substantially even between closely related species. Recently, multiple studies across many taxa showed that some novel genes arise de novo, that is, from previously noncoding DNA. To characterize the underlying mutations that allowed de novo gene emergence and their order of occurrence, homologous regions must be detected within noncoding sequences in closely related sister genomes. So far, most studies do not detect noncoding homologs of de novo genes because of incomplete assemblies and annotations, and long evolutionary distances separating genomes. Here, we overcome these issues by searching for de novo expressed open reading frames (neORFs), the not-yet fixed precursors of de novo genes that emerged within a single species. We sequenced and assembled genomes with long-read technology and the corresponding transcriptomes from inbred lines of Drosophila melanogaster, derived from seven geographically diverse populations. We found line-specific neORFs in abundance but few neORFs shared by lines, suggesting a rapid turnover. Gain and loss of transcription is more frequent than the creation of ORFs, for example, by forming new start and stop codons. Consequently, the gain of ORFs becomes rate limiting and is frequently the initial step in neORFs emergence. Furthermore, transposable elements (TEs) are major drivers for intragenomic duplications of neORFs, yet TE insertions are less important for the emergence of neORFs. However, highly mutable genomic regions around TEs provide new features that enable gene birth. In conclusion, neORFs have a high birth-death rate, are rapidly purged, but surviving neORFs spread neutrally through populations and within genomes.
format Online
Article
Text
id pubmed-10519401
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Cold Spring Harbor Laboratory Press
record_format MEDLINE/PubMed
spelling pubmed-105194012023-12-01 Population genomics reveals mechanisms and dynamics of de novo expressed open reading frame emergence in Drosophila melanogaster Grandchamp, Anna Kühl, Lucas Lebherz, Marie Brüggemann, Kathrin Parsch, John Bornberg-Bauer, Erich Genome Res Research Novel genes are essential for evolutionary innovations and differ substantially even between closely related species. Recently, multiple studies across many taxa showed that some novel genes arise de novo, that is, from previously noncoding DNA. To characterize the underlying mutations that allowed de novo gene emergence and their order of occurrence, homologous regions must be detected within noncoding sequences in closely related sister genomes. So far, most studies do not detect noncoding homologs of de novo genes because of incomplete assemblies and annotations, and long evolutionary distances separating genomes. Here, we overcome these issues by searching for de novo expressed open reading frames (neORFs), the not-yet fixed precursors of de novo genes that emerged within a single species. We sequenced and assembled genomes with long-read technology and the corresponding transcriptomes from inbred lines of Drosophila melanogaster, derived from seven geographically diverse populations. We found line-specific neORFs in abundance but few neORFs shared by lines, suggesting a rapid turnover. Gain and loss of transcription is more frequent than the creation of ORFs, for example, by forming new start and stop codons. Consequently, the gain of ORFs becomes rate limiting and is frequently the initial step in neORFs emergence. Furthermore, transposable elements (TEs) are major drivers for intragenomic duplications of neORFs, yet TE insertions are less important for the emergence of neORFs. However, highly mutable genomic regions around TEs provide new features that enable gene birth. In conclusion, neORFs have a high birth-death rate, are rapidly purged, but surviving neORFs spread neutrally through populations and within genomes. Cold Spring Harbor Laboratory Press 2023-06 /pmc/articles/PMC10519401/ /pubmed/37442576 http://dx.doi.org/10.1101/gr.277482.122 Text en © 2023 Grandchamp et al.; Published by Cold Spring Harbor Laboratory Press https://creativecommons.org/licenses/by-nc/4.0/This article is distributed exclusively by Cold Spring Harbor Laboratory Press for the first six months after the full-issue publication date (see https://genome.cshlp.org/site/misc/terms.xhtml). After six months, it is available under a Creative Commons License (Attribution-NonCommercial 4.0 International), as described at http://creativecommons.org/licenses/by-nc/4.0/ (https://creativecommons.org/licenses/by-nc/4.0/) .
spellingShingle Research
Grandchamp, Anna
Kühl, Lucas
Lebherz, Marie
Brüggemann, Kathrin
Parsch, John
Bornberg-Bauer, Erich
Population genomics reveals mechanisms and dynamics of de novo expressed open reading frame emergence in Drosophila melanogaster
title Population genomics reveals mechanisms and dynamics of de novo expressed open reading frame emergence in Drosophila melanogaster
title_full Population genomics reveals mechanisms and dynamics of de novo expressed open reading frame emergence in Drosophila melanogaster
title_fullStr Population genomics reveals mechanisms and dynamics of de novo expressed open reading frame emergence in Drosophila melanogaster
title_full_unstemmed Population genomics reveals mechanisms and dynamics of de novo expressed open reading frame emergence in Drosophila melanogaster
title_short Population genomics reveals mechanisms and dynamics of de novo expressed open reading frame emergence in Drosophila melanogaster
title_sort population genomics reveals mechanisms and dynamics of de novo expressed open reading frame emergence in drosophila melanogaster
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10519401/
https://www.ncbi.nlm.nih.gov/pubmed/37442576
http://dx.doi.org/10.1101/gr.277482.122
work_keys_str_mv AT grandchampanna populationgenomicsrevealsmechanismsanddynamicsofdenovoexpressedopenreadingframeemergenceindrosophilamelanogaster
AT kuhllucas populationgenomicsrevealsmechanismsanddynamicsofdenovoexpressedopenreadingframeemergenceindrosophilamelanogaster
AT lebherzmarie populationgenomicsrevealsmechanismsanddynamicsofdenovoexpressedopenreadingframeemergenceindrosophilamelanogaster
AT bruggemannkathrin populationgenomicsrevealsmechanismsanddynamicsofdenovoexpressedopenreadingframeemergenceindrosophilamelanogaster
AT parschjohn populationgenomicsrevealsmechanismsanddynamicsofdenovoexpressedopenreadingframeemergenceindrosophilamelanogaster
AT bornbergbauererich populationgenomicsrevealsmechanismsanddynamicsofdenovoexpressedopenreadingframeemergenceindrosophilamelanogaster