Cargando…

Origin of the human L1 elements: Proposed progenitor genes deduced from a consensus DNA sequence()

A consensus sequence for the human long interspersed repeated DNA element, L1H8 (LINE or KpnI sequence), is presented. The sequence contains two open reading frames (ORFs) which are homologous to ORFs in corresponding regions of L1 elements in other species. The L1H8 ORFs are separated by a small ev...

Descripción completa

Detalles Bibliográficos
Autores principales: Scott, Alan F., Schmeckpeper, Barbara J., Abdelrazik, Mona, Comey, Catherine Theisen, O'Hara, Bruce, Rossiter, Judith Pratt, Cooley, Tim, Heath, Peter, Smith, Kirby D., Margolet, Louise
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Published by Elsevier Inc. 1987
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7135745/
https://www.ncbi.nlm.nih.gov/pubmed/3692483
http://dx.doi.org/10.1016/0888-7543(87)90003-6
_version_ 1783518125409435648
author Scott, Alan F.
Schmeckpeper, Barbara J.
Abdelrazik, Mona
Comey, Catherine Theisen
O'Hara, Bruce
Rossiter, Judith Pratt
Cooley, Tim
Heath, Peter
Smith, Kirby D.
Margolet, Louise
author_facet Scott, Alan F.
Schmeckpeper, Barbara J.
Abdelrazik, Mona
Comey, Catherine Theisen
O'Hara, Bruce
Rossiter, Judith Pratt
Cooley, Tim
Heath, Peter
Smith, Kirby D.
Margolet, Louise
author_sort Scott, Alan F.
collection PubMed
description A consensus sequence for the human long interspersed repeated DNA element, L1H8 (LINE or KpnI sequence), is presented. The sequence contains two open reading frames (ORFs) which are homologous to ORFs in corresponding regions of L1 elements in other species. The L1H8 ORFs are separated by a small evolutionarily nonconserved region. The 5′ end of the consensus contains frequent terminators in all three reading frames and has a relatively high GC content with numerous stretches of weak homology with AluI repeats. The 5′ ORF extends for a minimum of 723 bp (241 codons). The 3′ ORF is 3843 bp (1281 codons) and predicts a protein of 149 kD which has regions of weak homology to the polymerase domain of various reverse transcriptases. The 3′ end of the consensus has a 208-bp nonconserved region followed by an adenine-rich end. The organization of the L1H8 consensus sequence resembles the structure of eukaryotic mRNAs except for the noncoding region between ORFs. However, due to base substitutions or truncation most elements appear incapable of producing mRNA that can be translated. Our observation that individual elements cluster into subfamilies on the basis of the presence or absence of blocks of sequence, or by the linkage of alternative bases at multiple positions, suggests that most L1 sequences were derived from a small number of structural genes. An estimate of the mammalian L1 substitution rate was derived and used to predict the age of individual human elements. From this it follows that the majority of human L1 sequences have been generated within the last 30 million years. The human elements studied here differ from each other, yet overall the L1H8 sequences demonstrate a pattern of species-specificity when compared to the L1 families of other mammals. Possible mechanisms that may account for the origin and evolution of the L1 family are discussed. These include pseudogene formation (retroposition), transposition, gene conversion, and RNA recombination.
format Online
Article
Text
id pubmed-7135745
institution National Center for Biotechnology Information
language English
publishDate 1987
publisher Published by Elsevier Inc.
record_format MEDLINE/PubMed
spelling pubmed-71357452020-04-08 Origin of the human L1 elements: Proposed progenitor genes deduced from a consensus DNA sequence() Scott, Alan F. Schmeckpeper, Barbara J. Abdelrazik, Mona Comey, Catherine Theisen O'Hara, Bruce Rossiter, Judith Pratt Cooley, Tim Heath, Peter Smith, Kirby D. Margolet, Louise Genomics Article A consensus sequence for the human long interspersed repeated DNA element, L1H8 (LINE or KpnI sequence), is presented. The sequence contains two open reading frames (ORFs) which are homologous to ORFs in corresponding regions of L1 elements in other species. The L1H8 ORFs are separated by a small evolutionarily nonconserved region. The 5′ end of the consensus contains frequent terminators in all three reading frames and has a relatively high GC content with numerous stretches of weak homology with AluI repeats. The 5′ ORF extends for a minimum of 723 bp (241 codons). The 3′ ORF is 3843 bp (1281 codons) and predicts a protein of 149 kD which has regions of weak homology to the polymerase domain of various reverse transcriptases. The 3′ end of the consensus has a 208-bp nonconserved region followed by an adenine-rich end. The organization of the L1H8 consensus sequence resembles the structure of eukaryotic mRNAs except for the noncoding region between ORFs. However, due to base substitutions or truncation most elements appear incapable of producing mRNA that can be translated. Our observation that individual elements cluster into subfamilies on the basis of the presence or absence of blocks of sequence, or by the linkage of alternative bases at multiple positions, suggests that most L1 sequences were derived from a small number of structural genes. An estimate of the mammalian L1 substitution rate was derived and used to predict the age of individual human elements. From this it follows that the majority of human L1 sequences have been generated within the last 30 million years. The human elements studied here differ from each other, yet overall the L1H8 sequences demonstrate a pattern of species-specificity when compared to the L1 families of other mammals. Possible mechanisms that may account for the origin and evolution of the L1 family are discussed. These include pseudogene formation (retroposition), transposition, gene conversion, and RNA recombination. Published by Elsevier Inc. 1987-10 2004-11-30 /pmc/articles/PMC7135745/ /pubmed/3692483 http://dx.doi.org/10.1016/0888-7543(87)90003-6 Text en Copyright © 1987 Published by Elsevier Inc. Since January 2020 Elsevier has created a COVID-19 resource centre with free information in English and Mandarin on the novel coronavirus COVID-19. The COVID-19 resource centre is hosted on Elsevier Connect, the company's public news and information website. Elsevier hereby grants permission to make all its COVID-19-related research that is available on the COVID-19 resource centre - including this research content - immediately available in PubMed Central and other publicly funded repositories, such as the WHO COVID database with rights for unrestricted research re-use and analyses in any form or by any means with acknowledgement of the original source. These permissions are granted for free by Elsevier for as long as the COVID-19 resource centre remains active.
spellingShingle Article
Scott, Alan F.
Schmeckpeper, Barbara J.
Abdelrazik, Mona
Comey, Catherine Theisen
O'Hara, Bruce
Rossiter, Judith Pratt
Cooley, Tim
Heath, Peter
Smith, Kirby D.
Margolet, Louise
Origin of the human L1 elements: Proposed progenitor genes deduced from a consensus DNA sequence()
title Origin of the human L1 elements: Proposed progenitor genes deduced from a consensus DNA sequence()
title_full Origin of the human L1 elements: Proposed progenitor genes deduced from a consensus DNA sequence()
title_fullStr Origin of the human L1 elements: Proposed progenitor genes deduced from a consensus DNA sequence()
title_full_unstemmed Origin of the human L1 elements: Proposed progenitor genes deduced from a consensus DNA sequence()
title_short Origin of the human L1 elements: Proposed progenitor genes deduced from a consensus DNA sequence()
title_sort origin of the human l1 elements: proposed progenitor genes deduced from a consensus dna sequence()
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7135745/
https://www.ncbi.nlm.nih.gov/pubmed/3692483
http://dx.doi.org/10.1016/0888-7543(87)90003-6
work_keys_str_mv AT scottalanf originofthehumanl1elementsproposedprogenitorgenesdeducedfromaconsensusdnasequence
AT schmeckpeperbarbaraj originofthehumanl1elementsproposedprogenitorgenesdeducedfromaconsensusdnasequence
AT abdelrazikmona originofthehumanl1elementsproposedprogenitorgenesdeducedfromaconsensusdnasequence
AT comeycatherinetheisen originofthehumanl1elementsproposedprogenitorgenesdeducedfromaconsensusdnasequence
AT oharabruce originofthehumanl1elementsproposedprogenitorgenesdeducedfromaconsensusdnasequence
AT rossiterjudithpratt originofthehumanl1elementsproposedprogenitorgenesdeducedfromaconsensusdnasequence
AT cooleytim originofthehumanl1elementsproposedprogenitorgenesdeducedfromaconsensusdnasequence
AT heathpeter originofthehumanl1elementsproposedprogenitorgenesdeducedfromaconsensusdnasequence
AT smithkirbyd originofthehumanl1elementsproposedprogenitorgenesdeducedfromaconsensusdnasequence
AT margoletlouise originofthehumanl1elementsproposedprogenitorgenesdeducedfromaconsensusdnasequence