Cargando…

New Genomic Signals Underlying the Emergence of Human Proto-Genes

De novo genes are novel genes which emerge from non-coding DNA. Until now, little is known about de novo genes’ properties, correlated to their age and mechanisms of emergence. In this study, we investigate four related properties: introns, upstream regulatory motifs, 5′ Untranslated regions (UTRs)...

Descripción completa

Detalles Bibliográficos
Autores principales: Grandchamp, Anna, Berk, Katrin, Dohmen, Elias, Bornberg-Bauer, Erich
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8871994/
https://www.ncbi.nlm.nih.gov/pubmed/35205330
http://dx.doi.org/10.3390/genes13020284
_version_ 1784657129016131584
author Grandchamp, Anna
Berk, Katrin
Dohmen, Elias
Bornberg-Bauer, Erich
author_facet Grandchamp, Anna
Berk, Katrin
Dohmen, Elias
Bornberg-Bauer, Erich
author_sort Grandchamp, Anna
collection PubMed
description De novo genes are novel genes which emerge from non-coding DNA. Until now, little is known about de novo genes’ properties, correlated to their age and mechanisms of emergence. In this study, we investigate four related properties: introns, upstream regulatory motifs, 5′ Untranslated regions (UTRs) and protein domains, in 23,135 human proto-genes. We found that proto-genes contain introns, whose number and position correlates with the genomic position of proto-gene emergence. The origin of these introns is debated, as our results suggest that 41% of proto-genes might have captured existing introns, and 13.7% of them do not splice the ORF. We show that proto-genes which emerged via overprinting tend to be more enriched in core promotor motifs, while intergenic and intronic genes are more enriched in enhancers, even if the TATA motif is most commonly found upstream in these genes. Intergenic and intronic 5′ UTRs of proto-genes have a lower potential to stabilise mRNA structures than exonic proto-genes and established human genes. Finally, we confirm that proteins expressed by proto-genes gain new putative domains with age. Overall, we find that regulatory motifs inducing transcription and translation of previously non-coding sequences may facilitate proto-gene emergence. Our study demonstrates that introns, 5′ UTRs, and domains have specific properties in proto-genes. We also emphasize that the genomic positions of de novo genes strongly impacts these properties.
format Online
Article
Text
id pubmed-8871994
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-88719942022-02-25 New Genomic Signals Underlying the Emergence of Human Proto-Genes Grandchamp, Anna Berk, Katrin Dohmen, Elias Bornberg-Bauer, Erich Genes (Basel) Article De novo genes are novel genes which emerge from non-coding DNA. Until now, little is known about de novo genes’ properties, correlated to their age and mechanisms of emergence. In this study, we investigate four related properties: introns, upstream regulatory motifs, 5′ Untranslated regions (UTRs) and protein domains, in 23,135 human proto-genes. We found that proto-genes contain introns, whose number and position correlates with the genomic position of proto-gene emergence. The origin of these introns is debated, as our results suggest that 41% of proto-genes might have captured existing introns, and 13.7% of them do not splice the ORF. We show that proto-genes which emerged via overprinting tend to be more enriched in core promotor motifs, while intergenic and intronic genes are more enriched in enhancers, even if the TATA motif is most commonly found upstream in these genes. Intergenic and intronic 5′ UTRs of proto-genes have a lower potential to stabilise mRNA structures than exonic proto-genes and established human genes. Finally, we confirm that proteins expressed by proto-genes gain new putative domains with age. Overall, we find that regulatory motifs inducing transcription and translation of previously non-coding sequences may facilitate proto-gene emergence. Our study demonstrates that introns, 5′ UTRs, and domains have specific properties in proto-genes. We also emphasize that the genomic positions of de novo genes strongly impacts these properties. MDPI 2022-01-31 /pmc/articles/PMC8871994/ /pubmed/35205330 http://dx.doi.org/10.3390/genes13020284 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Grandchamp, Anna
Berk, Katrin
Dohmen, Elias
Bornberg-Bauer, Erich
New Genomic Signals Underlying the Emergence of Human Proto-Genes
title New Genomic Signals Underlying the Emergence of Human Proto-Genes
title_full New Genomic Signals Underlying the Emergence of Human Proto-Genes
title_fullStr New Genomic Signals Underlying the Emergence of Human Proto-Genes
title_full_unstemmed New Genomic Signals Underlying the Emergence of Human Proto-Genes
title_short New Genomic Signals Underlying the Emergence of Human Proto-Genes
title_sort new genomic signals underlying the emergence of human proto-genes
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8871994/
https://www.ncbi.nlm.nih.gov/pubmed/35205330
http://dx.doi.org/10.3390/genes13020284
work_keys_str_mv AT grandchampanna newgenomicsignalsunderlyingtheemergenceofhumanprotogenes
AT berkkatrin newgenomicsignalsunderlyingtheemergenceofhumanprotogenes
AT dohmenelias newgenomicsignalsunderlyingtheemergenceofhumanprotogenes
AT bornbergbauererich newgenomicsignalsunderlyingtheemergenceofhumanprotogenes