Cargando…

Formation of human long intergenic non-coding RNA genes, pseudogenes, and protein genes: Ancestral sequences are key players

Pathways leading to formation of non-coding RNA and protein genes are varied and complex. We report finding a conserved repeat sequence present in human and chimpanzee genomes that appears to have originated from a common primate ancestor. This sequence is repeatedly copied in human chromosome 22 (c...

Descripción completa

Detalles Bibliográficos
Autor principal: Delihas, Nicholas
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7098633/
https://www.ncbi.nlm.nih.gov/pubmed/32214344
http://dx.doi.org/10.1371/journal.pone.0230236
_version_ 1783511216860168192
author Delihas, Nicholas
author_facet Delihas, Nicholas
author_sort Delihas, Nicholas
collection PubMed
description Pathways leading to formation of non-coding RNA and protein genes are varied and complex. We report finding a conserved repeat sequence present in human and chimpanzee genomes that appears to have originated from a common primate ancestor. This sequence is repeatedly copied in human chromosome 22 (chr22) low copy repeats (LCR22) or segmental duplications and forms twenty-one different genes, which include the human long intergenic non-coding RNA (lincRNA) family FAM230, a newly discovered lincRNA gene family termed conserved long intergenic non-coding RNAs (clincRNA), pseudogene families, as well as the gamma-glutamyltransferase (GGT) protein gene family and the RNA pseudogenes that originate from GGT sequences. Of particular interest are the GGT5 and USP18 protein genes that appear to have formed from an homologous repeat sequence that also forms the clincRNA gene family. The data point to ancestral DNA sequences, conserved through evolution and duplicated in humans by chromosomal repeat sequences that may serve as functional genomic elements in the development of diverse genes.
format Online
Article
Text
id pubmed-7098633
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-70986332020-04-03 Formation of human long intergenic non-coding RNA genes, pseudogenes, and protein genes: Ancestral sequences are key players Delihas, Nicholas PLoS One Research Article Pathways leading to formation of non-coding RNA and protein genes are varied and complex. We report finding a conserved repeat sequence present in human and chimpanzee genomes that appears to have originated from a common primate ancestor. This sequence is repeatedly copied in human chromosome 22 (chr22) low copy repeats (LCR22) or segmental duplications and forms twenty-one different genes, which include the human long intergenic non-coding RNA (lincRNA) family FAM230, a newly discovered lincRNA gene family termed conserved long intergenic non-coding RNAs (clincRNA), pseudogene families, as well as the gamma-glutamyltransferase (GGT) protein gene family and the RNA pseudogenes that originate from GGT sequences. Of particular interest are the GGT5 and USP18 protein genes that appear to have formed from an homologous repeat sequence that also forms the clincRNA gene family. The data point to ancestral DNA sequences, conserved through evolution and duplicated in humans by chromosomal repeat sequences that may serve as functional genomic elements in the development of diverse genes. Public Library of Science 2020-03-26 /pmc/articles/PMC7098633/ /pubmed/32214344 http://dx.doi.org/10.1371/journal.pone.0230236 Text en © 2020 Nicholas Delihas http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Delihas, Nicholas
Formation of human long intergenic non-coding RNA genes, pseudogenes, and protein genes: Ancestral sequences are key players
title Formation of human long intergenic non-coding RNA genes, pseudogenes, and protein genes: Ancestral sequences are key players
title_full Formation of human long intergenic non-coding RNA genes, pseudogenes, and protein genes: Ancestral sequences are key players
title_fullStr Formation of human long intergenic non-coding RNA genes, pseudogenes, and protein genes: Ancestral sequences are key players
title_full_unstemmed Formation of human long intergenic non-coding RNA genes, pseudogenes, and protein genes: Ancestral sequences are key players
title_short Formation of human long intergenic non-coding RNA genes, pseudogenes, and protein genes: Ancestral sequences are key players
title_sort formation of human long intergenic non-coding rna genes, pseudogenes, and protein genes: ancestral sequences are key players
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7098633/
https://www.ncbi.nlm.nih.gov/pubmed/32214344
http://dx.doi.org/10.1371/journal.pone.0230236
work_keys_str_mv AT delihasnicholas formationofhumanlongintergenicnoncodingrnagenespseudogenesandproteingenesancestralsequencesarekeyplayers