Cargando…
New Genomic Signals Underlying the Emergence of Human Proto-Genes
De novo genes are novel genes which emerge from non-coding DNA. Until now, little is known about de novo genes’ properties, correlated to their age and mechanisms of emergence. In this study, we investigate four related properties: introns, upstream regulatory motifs, 5′ Untranslated regions (UTRs)...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8871994/ https://www.ncbi.nlm.nih.gov/pubmed/35205330 http://dx.doi.org/10.3390/genes13020284 |
_version_ | 1784657129016131584 |
---|---|
author | Grandchamp, Anna Berk, Katrin Dohmen, Elias Bornberg-Bauer, Erich |
author_facet | Grandchamp, Anna Berk, Katrin Dohmen, Elias Bornberg-Bauer, Erich |
author_sort | Grandchamp, Anna |
collection | PubMed |
description | De novo genes are novel genes which emerge from non-coding DNA. Until now, little is known about de novo genes’ properties, correlated to their age and mechanisms of emergence. In this study, we investigate four related properties: introns, upstream regulatory motifs, 5′ Untranslated regions (UTRs) and protein domains, in 23,135 human proto-genes. We found that proto-genes contain introns, whose number and position correlates with the genomic position of proto-gene emergence. The origin of these introns is debated, as our results suggest that 41% of proto-genes might have captured existing introns, and 13.7% of them do not splice the ORF. We show that proto-genes which emerged via overprinting tend to be more enriched in core promotor motifs, while intergenic and intronic genes are more enriched in enhancers, even if the TATA motif is most commonly found upstream in these genes. Intergenic and intronic 5′ UTRs of proto-genes have a lower potential to stabilise mRNA structures than exonic proto-genes and established human genes. Finally, we confirm that proteins expressed by proto-genes gain new putative domains with age. Overall, we find that regulatory motifs inducing transcription and translation of previously non-coding sequences may facilitate proto-gene emergence. Our study demonstrates that introns, 5′ UTRs, and domains have specific properties in proto-genes. We also emphasize that the genomic positions of de novo genes strongly impacts these properties. |
format | Online Article Text |
id | pubmed-8871994 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-88719942022-02-25 New Genomic Signals Underlying the Emergence of Human Proto-Genes Grandchamp, Anna Berk, Katrin Dohmen, Elias Bornberg-Bauer, Erich Genes (Basel) Article De novo genes are novel genes which emerge from non-coding DNA. Until now, little is known about de novo genes’ properties, correlated to their age and mechanisms of emergence. In this study, we investigate four related properties: introns, upstream regulatory motifs, 5′ Untranslated regions (UTRs) and protein domains, in 23,135 human proto-genes. We found that proto-genes contain introns, whose number and position correlates with the genomic position of proto-gene emergence. The origin of these introns is debated, as our results suggest that 41% of proto-genes might have captured existing introns, and 13.7% of them do not splice the ORF. We show that proto-genes which emerged via overprinting tend to be more enriched in core promotor motifs, while intergenic and intronic genes are more enriched in enhancers, even if the TATA motif is most commonly found upstream in these genes. Intergenic and intronic 5′ UTRs of proto-genes have a lower potential to stabilise mRNA structures than exonic proto-genes and established human genes. Finally, we confirm that proteins expressed by proto-genes gain new putative domains with age. Overall, we find that regulatory motifs inducing transcription and translation of previously non-coding sequences may facilitate proto-gene emergence. Our study demonstrates that introns, 5′ UTRs, and domains have specific properties in proto-genes. We also emphasize that the genomic positions of de novo genes strongly impacts these properties. MDPI 2022-01-31 /pmc/articles/PMC8871994/ /pubmed/35205330 http://dx.doi.org/10.3390/genes13020284 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Grandchamp, Anna Berk, Katrin Dohmen, Elias Bornberg-Bauer, Erich New Genomic Signals Underlying the Emergence of Human Proto-Genes |
title | New Genomic Signals Underlying the Emergence of Human Proto-Genes |
title_full | New Genomic Signals Underlying the Emergence of Human Proto-Genes |
title_fullStr | New Genomic Signals Underlying the Emergence of Human Proto-Genes |
title_full_unstemmed | New Genomic Signals Underlying the Emergence of Human Proto-Genes |
title_short | New Genomic Signals Underlying the Emergence of Human Proto-Genes |
title_sort | new genomic signals underlying the emergence of human proto-genes |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8871994/ https://www.ncbi.nlm.nih.gov/pubmed/35205330 http://dx.doi.org/10.3390/genes13020284 |
work_keys_str_mv | AT grandchampanna newgenomicsignalsunderlyingtheemergenceofhumanprotogenes AT berkkatrin newgenomicsignalsunderlyingtheemergenceofhumanprotogenes AT dohmenelias newgenomicsignalsunderlyingtheemergenceofhumanprotogenes AT bornbergbauererich newgenomicsignalsunderlyingtheemergenceofhumanprotogenes |