Cargando…

The nucleotide landscape of polyXY regions

PolyXY regions are compositionally biased regions composed of two different amino acids. They are classified according to the arrangement of the two amino acid types ‘X′ and ‘Y′ into direpeats (composed of alternating amino acids, e.g. ‘XYXYXY’), joined (composed of two consecutive stretches of each...

Descripción completa

Detalles Bibliográficos
Autores principales: Mier, Pablo, Andrade-Navarro, Miguel A.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Research Network of Computational and Structural Biotechnology 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10652141/
https://www.ncbi.nlm.nih.gov/pubmed/38022702
http://dx.doi.org/10.1016/j.csbj.2023.10.054
_version_ 1785136146242600960
author Mier, Pablo
Andrade-Navarro, Miguel A.
author_facet Mier, Pablo
Andrade-Navarro, Miguel A.
author_sort Mier, Pablo
collection PubMed
description PolyXY regions are compositionally biased regions composed of two different amino acids. They are classified according to the arrangement of the two amino acid types ‘X′ and ‘Y′ into direpeats (composed of alternating amino acids, e.g. ‘XYXYXY’), joined (composed of two consecutive stretches of each amino acid, e.g. ‘XXXYYY’) and shuffled (other arrangements, e.g., ‘XYXXYY’). They have been characterized at the amino acid level in all domains of life, and are described as often found within intrinsically disordered regions. Since DNA replication slippage has been proposed as a driver of repeat variation, and given that some polyXY have a repetitive nature, we hypothesized that characterizing the nucleotide coding of various types of polyXY could give hints about their origin and evolution. To test this, we obtained all polyXY regions in the human transcriptome, categorized them, and studied their coding nucleotide sequences. We observed that polyXY exacerbates the codon biases, and that the similarity between the X and Y codons is higher than in the background proteome. Our results support a general mechanism of emergence and evolution of polyXY from single-codon polyX. PolyXY are revealed as hotspots for replication slippage, particularly those composed of repeats: joined and direpeat polyXY. Inter-conversion to shuffled polyXY disrupts nucleotide repeats and restricts further evolution by replication slippage, a mechanism that we previously observed in polyX. Our results shed light on polyXY composition and should simplify the determination of their functions.
format Online
Article
Text
id pubmed-10652141
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Research Network of Computational and Structural Biotechnology
record_format MEDLINE/PubMed
spelling pubmed-106521412023-10-31 The nucleotide landscape of polyXY regions Mier, Pablo Andrade-Navarro, Miguel A. Comput Struct Biotechnol J Short Communication PolyXY regions are compositionally biased regions composed of two different amino acids. They are classified according to the arrangement of the two amino acid types ‘X′ and ‘Y′ into direpeats (composed of alternating amino acids, e.g. ‘XYXYXY’), joined (composed of two consecutive stretches of each amino acid, e.g. ‘XXXYYY’) and shuffled (other arrangements, e.g., ‘XYXXYY’). They have been characterized at the amino acid level in all domains of life, and are described as often found within intrinsically disordered regions. Since DNA replication slippage has been proposed as a driver of repeat variation, and given that some polyXY have a repetitive nature, we hypothesized that characterizing the nucleotide coding of various types of polyXY could give hints about their origin and evolution. To test this, we obtained all polyXY regions in the human transcriptome, categorized them, and studied their coding nucleotide sequences. We observed that polyXY exacerbates the codon biases, and that the similarity between the X and Y codons is higher than in the background proteome. Our results support a general mechanism of emergence and evolution of polyXY from single-codon polyX. PolyXY are revealed as hotspots for replication slippage, particularly those composed of repeats: joined and direpeat polyXY. Inter-conversion to shuffled polyXY disrupts nucleotide repeats and restricts further evolution by replication slippage, a mechanism that we previously observed in polyX. Our results shed light on polyXY composition and should simplify the determination of their functions. Research Network of Computational and Structural Biotechnology 2023-10-31 /pmc/articles/PMC10652141/ /pubmed/38022702 http://dx.doi.org/10.1016/j.csbj.2023.10.054 Text en © 2023 The Authors https://creativecommons.org/licenses/by-nc-nd/4.0/This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).
spellingShingle Short Communication
Mier, Pablo
Andrade-Navarro, Miguel A.
The nucleotide landscape of polyXY regions
title The nucleotide landscape of polyXY regions
title_full The nucleotide landscape of polyXY regions
title_fullStr The nucleotide landscape of polyXY regions
title_full_unstemmed The nucleotide landscape of polyXY regions
title_short The nucleotide landscape of polyXY regions
title_sort nucleotide landscape of polyxy regions
topic Short Communication
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10652141/
https://www.ncbi.nlm.nih.gov/pubmed/38022702
http://dx.doi.org/10.1016/j.csbj.2023.10.054
work_keys_str_mv AT mierpablo thenucleotidelandscapeofpolyxyregions
AT andradenavarromiguela thenucleotidelandscapeofpolyxyregions
AT mierpablo nucleotidelandscapeofpolyxyregions
AT andradenavarromiguela nucleotidelandscapeofpolyxyregions