Cargando…
Formation of human long intergenic non-coding RNA genes, pseudogenes, and protein genes: Ancestral sequences are key players
Pathways leading to formation of non-coding RNA and protein genes are varied and complex. We report finding a conserved repeat sequence present in human and chimpanzee genomes that appears to have originated from a common primate ancestor. This sequence is repeatedly copied in human chromosome 22 (c...
Autor principal: | |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7098633/ https://www.ncbi.nlm.nih.gov/pubmed/32214344 http://dx.doi.org/10.1371/journal.pone.0230236 |
_version_ | 1783511216860168192 |
---|---|
author | Delihas, Nicholas |
author_facet | Delihas, Nicholas |
author_sort | Delihas, Nicholas |
collection | PubMed |
description | Pathways leading to formation of non-coding RNA and protein genes are varied and complex. We report finding a conserved repeat sequence present in human and chimpanzee genomes that appears to have originated from a common primate ancestor. This sequence is repeatedly copied in human chromosome 22 (chr22) low copy repeats (LCR22) or segmental duplications and forms twenty-one different genes, which include the human long intergenic non-coding RNA (lincRNA) family FAM230, a newly discovered lincRNA gene family termed conserved long intergenic non-coding RNAs (clincRNA), pseudogene families, as well as the gamma-glutamyltransferase (GGT) protein gene family and the RNA pseudogenes that originate from GGT sequences. Of particular interest are the GGT5 and USP18 protein genes that appear to have formed from an homologous repeat sequence that also forms the clincRNA gene family. The data point to ancestral DNA sequences, conserved through evolution and duplicated in humans by chromosomal repeat sequences that may serve as functional genomic elements in the development of diverse genes. |
format | Online Article Text |
id | pubmed-7098633 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-70986332020-04-03 Formation of human long intergenic non-coding RNA genes, pseudogenes, and protein genes: Ancestral sequences are key players Delihas, Nicholas PLoS One Research Article Pathways leading to formation of non-coding RNA and protein genes are varied and complex. We report finding a conserved repeat sequence present in human and chimpanzee genomes that appears to have originated from a common primate ancestor. This sequence is repeatedly copied in human chromosome 22 (chr22) low copy repeats (LCR22) or segmental duplications and forms twenty-one different genes, which include the human long intergenic non-coding RNA (lincRNA) family FAM230, a newly discovered lincRNA gene family termed conserved long intergenic non-coding RNAs (clincRNA), pseudogene families, as well as the gamma-glutamyltransferase (GGT) protein gene family and the RNA pseudogenes that originate from GGT sequences. Of particular interest are the GGT5 and USP18 protein genes that appear to have formed from an homologous repeat sequence that also forms the clincRNA gene family. The data point to ancestral DNA sequences, conserved through evolution and duplicated in humans by chromosomal repeat sequences that may serve as functional genomic elements in the development of diverse genes. Public Library of Science 2020-03-26 /pmc/articles/PMC7098633/ /pubmed/32214344 http://dx.doi.org/10.1371/journal.pone.0230236 Text en © 2020 Nicholas Delihas http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. |
spellingShingle | Research Article Delihas, Nicholas Formation of human long intergenic non-coding RNA genes, pseudogenes, and protein genes: Ancestral sequences are key players |
title | Formation of human long intergenic non-coding RNA genes, pseudogenes, and protein genes: Ancestral sequences are key players |
title_full | Formation of human long intergenic non-coding RNA genes, pseudogenes, and protein genes: Ancestral sequences are key players |
title_fullStr | Formation of human long intergenic non-coding RNA genes, pseudogenes, and protein genes: Ancestral sequences are key players |
title_full_unstemmed | Formation of human long intergenic non-coding RNA genes, pseudogenes, and protein genes: Ancestral sequences are key players |
title_short | Formation of human long intergenic non-coding RNA genes, pseudogenes, and protein genes: Ancestral sequences are key players |
title_sort | formation of human long intergenic non-coding rna genes, pseudogenes, and protein genes: ancestral sequences are key players |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7098633/ https://www.ncbi.nlm.nih.gov/pubmed/32214344 http://dx.doi.org/10.1371/journal.pone.0230236 |
work_keys_str_mv | AT delihasnicholas formationofhumanlongintergenicnoncodingrnagenespseudogenesandproteingenesancestralsequencesarekeyplayers |