Cargando…

De Novo Origin of Protein-Coding Genes in Murine Rodents

BACKGROUND: New genes in eukaryotes are created through a variety of different mechanisms. De novo origin from non-coding DNA is a mechanism that has recently gained attention. So far, de novo genes have been described in a handful of organisms, with Drosophila being the most extensively studied. We...

Descripción completa

Detalles Bibliográficos
Autores principales: Murphy, Daniel N., McLysaght, Aoife
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3504067/
https://www.ncbi.nlm.nih.gov/pubmed/23185269
http://dx.doi.org/10.1371/journal.pone.0048650
_version_ 1782250566570737664
author Murphy, Daniel N.
McLysaght, Aoife
author_facet Murphy, Daniel N.
McLysaght, Aoife
author_sort Murphy, Daniel N.
collection PubMed
description BACKGROUND: New genes in eukaryotes are created through a variety of different mechanisms. De novo origin from non-coding DNA is a mechanism that has recently gained attention. So far, de novo genes have been described in a handful of organisms, with Drosophila being the most extensively studied. We searched for genes that have appeared de novo in the mouse and rat lineages. METHODOLOGY: Using a rigorous and conservative approach we identify 75 murine genes (69 mouse genes and 6 rat genes) for which there is good evidence of de novo origin since the divergence of mouse and rat. Each of these genes is only found in either the mouse or rat lineages, with no candidate orthologs nor evidence for potentially-unannotated orthologs in the other lineage. The veracity of each of these genes is supported by expression evidence. Additionally, their presence in one lineage and absence in the other cannot be explained by sequencing gaps. For 11 of the 75 candidate novel genes we could identify a mouse-specific mutation that led to the creation of the open reading frame (ORF) specifically in mouse. None of the six rat-specific genes had an unequivocal rat-specific mutation creating the ORF, which may at least be partly due to lower data quality for that genome. CONCLUSIONS: All 75 candidate genes presented in this study are relatively small and encode short peptides. A large number of them (51 out of 69 mouse genes and 3 out of 6 rat genes) also overlap with other genes, either within introns, or on the opposite strand. These characteristics have previously been documented for de novo genes. The description of these genes opens up the opportunity to integrate this evolutionary analysis with the rich experimental data available for these two model organisms.
format Online
Article
Text
id pubmed-3504067
institution National Center for Biotechnology Information
language English
publishDate 2012
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-35040672012-11-26 De Novo Origin of Protein-Coding Genes in Murine Rodents Murphy, Daniel N. McLysaght, Aoife PLoS One Research Article BACKGROUND: New genes in eukaryotes are created through a variety of different mechanisms. De novo origin from non-coding DNA is a mechanism that has recently gained attention. So far, de novo genes have been described in a handful of organisms, with Drosophila being the most extensively studied. We searched for genes that have appeared de novo in the mouse and rat lineages. METHODOLOGY: Using a rigorous and conservative approach we identify 75 murine genes (69 mouse genes and 6 rat genes) for which there is good evidence of de novo origin since the divergence of mouse and rat. Each of these genes is only found in either the mouse or rat lineages, with no candidate orthologs nor evidence for potentially-unannotated orthologs in the other lineage. The veracity of each of these genes is supported by expression evidence. Additionally, their presence in one lineage and absence in the other cannot be explained by sequencing gaps. For 11 of the 75 candidate novel genes we could identify a mouse-specific mutation that led to the creation of the open reading frame (ORF) specifically in mouse. None of the six rat-specific genes had an unequivocal rat-specific mutation creating the ORF, which may at least be partly due to lower data quality for that genome. CONCLUSIONS: All 75 candidate genes presented in this study are relatively small and encode short peptides. A large number of them (51 out of 69 mouse genes and 3 out of 6 rat genes) also overlap with other genes, either within introns, or on the opposite strand. These characteristics have previously been documented for de novo genes. The description of these genes opens up the opportunity to integrate this evolutionary analysis with the rich experimental data available for these two model organisms. Public Library of Science 2012-11-21 /pmc/articles/PMC3504067/ /pubmed/23185269 http://dx.doi.org/10.1371/journal.pone.0048650 Text en © 2012 Murphy, McLysaght http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Murphy, Daniel N.
McLysaght, Aoife
De Novo Origin of Protein-Coding Genes in Murine Rodents
title De Novo Origin of Protein-Coding Genes in Murine Rodents
title_full De Novo Origin of Protein-Coding Genes in Murine Rodents
title_fullStr De Novo Origin of Protein-Coding Genes in Murine Rodents
title_full_unstemmed De Novo Origin of Protein-Coding Genes in Murine Rodents
title_short De Novo Origin of Protein-Coding Genes in Murine Rodents
title_sort de novo origin of protein-coding genes in murine rodents
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3504067/
https://www.ncbi.nlm.nih.gov/pubmed/23185269
http://dx.doi.org/10.1371/journal.pone.0048650
work_keys_str_mv AT murphydanieln denovooriginofproteincodinggenesinmurinerodents
AT mclysaghtaoife denovooriginofproteincodinggenesinmurinerodents