Cargando…

De novo emergence, existence, and demise of a protein-coding gene in murids

BACKGROUND: Genes, principal units of genetic information, vary in complexity and evolutionary history. Less-complex genes (e.g., long non-coding RNA (lncRNA) expressing genes) readily emerge de novo from non-genic sequences and have high evolutionary turnover. Genesis of a gene may be facilitated b...

Descripción completa

Detalles Bibliográficos
Autores principales: Petrzilek, Jan, Pasulka, Josef, Malik, Radek, Horvat, Filip, Kataruka, Shubhangini, Fulka, Helena, Svoboda, Petr
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9733328/
https://www.ncbi.nlm.nih.gov/pubmed/36482406
http://dx.doi.org/10.1186/s12915-022-01470-5
_version_ 1784846348395216896
author Petrzilek, Jan
Pasulka, Josef
Malik, Radek
Horvat, Filip
Kataruka, Shubhangini
Fulka, Helena
Svoboda, Petr
author_facet Petrzilek, Jan
Pasulka, Josef
Malik, Radek
Horvat, Filip
Kataruka, Shubhangini
Fulka, Helena
Svoboda, Petr
author_sort Petrzilek, Jan
collection PubMed
description BACKGROUND: Genes, principal units of genetic information, vary in complexity and evolutionary history. Less-complex genes (e.g., long non-coding RNA (lncRNA) expressing genes) readily emerge de novo from non-genic sequences and have high evolutionary turnover. Genesis of a gene may be facilitated by adoption of functional genic sequences from retrotransposon insertions. However, protein-coding sequences in extant genomes rarely lack any connection to an ancestral protein-coding sequence. RESULTS: We describe remarkable evolution of the murine gene D6Ertd527e and its orthologs in the rodent Muroidea superfamily. The D6Ertd527e emerged in a common ancestor of mice and hamsters most likely as a lncRNA-expressing gene. A major contributing factor was a long terminal repeat (LTR) retrotransposon insertion carrying an oocyte-specific promoter and a 5′ terminal exon of the gene. The gene survived as an oocyte-specific lncRNA in several extant rodents while in some others the gene or its expression were lost. In the ancestral lineage of Mus musculus, the gene acquired protein-coding capacity where the bulk of the coding sequence formed through CAG (AGC) trinucleotide repeat expansion and duplications. These events generated a cytoplasmic serine-rich maternal protein. Knock-out of D6Ertd527e in mice has a small but detectable effect on fertility and the maternal transcriptome. CONCLUSIONS: While this evolving gene is not showing a clear function in laboratory mice, its documented evolutionary history in Muroidea during the last ~ 40 million years provides a textbook example of how a several common mutation events can support de novo gene formation, evolution of protein-coding capacity, as well as gene’s demise. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12915-022-01470-5.
format Online
Article
Text
id pubmed-9733328
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-97333282022-12-10 De novo emergence, existence, and demise of a protein-coding gene in murids Petrzilek, Jan Pasulka, Josef Malik, Radek Horvat, Filip Kataruka, Shubhangini Fulka, Helena Svoboda, Petr BMC Biol Research Article BACKGROUND: Genes, principal units of genetic information, vary in complexity and evolutionary history. Less-complex genes (e.g., long non-coding RNA (lncRNA) expressing genes) readily emerge de novo from non-genic sequences and have high evolutionary turnover. Genesis of a gene may be facilitated by adoption of functional genic sequences from retrotransposon insertions. However, protein-coding sequences in extant genomes rarely lack any connection to an ancestral protein-coding sequence. RESULTS: We describe remarkable evolution of the murine gene D6Ertd527e and its orthologs in the rodent Muroidea superfamily. The D6Ertd527e emerged in a common ancestor of mice and hamsters most likely as a lncRNA-expressing gene. A major contributing factor was a long terminal repeat (LTR) retrotransposon insertion carrying an oocyte-specific promoter and a 5′ terminal exon of the gene. The gene survived as an oocyte-specific lncRNA in several extant rodents while in some others the gene or its expression were lost. In the ancestral lineage of Mus musculus, the gene acquired protein-coding capacity where the bulk of the coding sequence formed through CAG (AGC) trinucleotide repeat expansion and duplications. These events generated a cytoplasmic serine-rich maternal protein. Knock-out of D6Ertd527e in mice has a small but detectable effect on fertility and the maternal transcriptome. CONCLUSIONS: While this evolving gene is not showing a clear function in laboratory mice, its documented evolutionary history in Muroidea during the last ~ 40 million years provides a textbook example of how a several common mutation events can support de novo gene formation, evolution of protein-coding capacity, as well as gene’s demise. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12915-022-01470-5. BioMed Central 2022-12-08 /pmc/articles/PMC9733328/ /pubmed/36482406 http://dx.doi.org/10.1186/s12915-022-01470-5 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Research Article
Petrzilek, Jan
Pasulka, Josef
Malik, Radek
Horvat, Filip
Kataruka, Shubhangini
Fulka, Helena
Svoboda, Petr
De novo emergence, existence, and demise of a protein-coding gene in murids
title De novo emergence, existence, and demise of a protein-coding gene in murids
title_full De novo emergence, existence, and demise of a protein-coding gene in murids
title_fullStr De novo emergence, existence, and demise of a protein-coding gene in murids
title_full_unstemmed De novo emergence, existence, and demise of a protein-coding gene in murids
title_short De novo emergence, existence, and demise of a protein-coding gene in murids
title_sort de novo emergence, existence, and demise of a protein-coding gene in murids
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9733328/
https://www.ncbi.nlm.nih.gov/pubmed/36482406
http://dx.doi.org/10.1186/s12915-022-01470-5
work_keys_str_mv AT petrzilekjan denovoemergenceexistenceanddemiseofaproteincodinggeneinmurids
AT pasulkajosef denovoemergenceexistenceanddemiseofaproteincodinggeneinmurids
AT malikradek denovoemergenceexistenceanddemiseofaproteincodinggeneinmurids
AT horvatfilip denovoemergenceexistenceanddemiseofaproteincodinggeneinmurids
AT katarukashubhangini denovoemergenceexistenceanddemiseofaproteincodinggeneinmurids
AT fulkahelena denovoemergenceexistenceanddemiseofaproteincodinggeneinmurids
AT svobodapetr denovoemergenceexistenceanddemiseofaproteincodinggeneinmurids