Cargando…

The difficulty of aligning intrinsically disordered protein sequences as assessed by conservation and phylogeny

Intrinsically disordered proteins (IDPs) are proteins that lack a stable 3D structure but maintain a biological function. It has been frequently suggested that IDPs are difficult to align because they tend to have fewer conserved residues compared to ordered proteins, but to our knowledge this has n...

Descripción completa

Detalles Bibliográficos
Autores principales: Riley, Andrew C., Ashlock, Daniel A., Graether, Steffen P.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10343074/
https://www.ncbi.nlm.nih.gov/pubmed/37440576
http://dx.doi.org/10.1371/journal.pone.0288388
_version_ 1785072651478237184
author Riley, Andrew C.
Ashlock, Daniel A.
Graether, Steffen P.
author_facet Riley, Andrew C.
Ashlock, Daniel A.
Graether, Steffen P.
author_sort Riley, Andrew C.
collection PubMed
description Intrinsically disordered proteins (IDPs) are proteins that lack a stable 3D structure but maintain a biological function. It has been frequently suggested that IDPs are difficult to align because they tend to have fewer conserved residues compared to ordered proteins, but to our knowledge this has never been directly tested. To compare the alignments of ordered proteins to IDPs, their multiple sequence alignments (MSAs) were assessed using two different methods. The first compared the similarity between MSAs produced using the same sequences but created with Clustal Omega, MAFFT, and MUSCLE. The second assessed MSAs based on how well they recapitulated the species tree. These two methods measure the “correctness” of an MSA with two different approaches; the first method measures consistency while the second measures the underlying phylogenetic signal. Proteins that contained both regions of disorder and order were analyzed along with proteins that were fully disordered and fully ordered, using nucleotide, codon and peptide sequence alignments. We observed that IDPs had less similar MSAs than ordered proteins, which is most likely linked to the lower sequence conservation in IDPs. However, comparisons of tree distances found that trees from the ordered sequence MSAs were not significantly closer to the species tree than those inferred from disordered sequence MSAs. Our results show that it is correct to say that IDPs are difficult to align on the basis of MSA consistency, but that this does not equate with alignments being of poor quality when assessed by their ability to correctly infer a species tree.
format Online
Article
Text
id pubmed-10343074
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-103430742023-07-14 The difficulty of aligning intrinsically disordered protein sequences as assessed by conservation and phylogeny Riley, Andrew C. Ashlock, Daniel A. Graether, Steffen P. PLoS One Research Article Intrinsically disordered proteins (IDPs) are proteins that lack a stable 3D structure but maintain a biological function. It has been frequently suggested that IDPs are difficult to align because they tend to have fewer conserved residues compared to ordered proteins, but to our knowledge this has never been directly tested. To compare the alignments of ordered proteins to IDPs, their multiple sequence alignments (MSAs) were assessed using two different methods. The first compared the similarity between MSAs produced using the same sequences but created with Clustal Omega, MAFFT, and MUSCLE. The second assessed MSAs based on how well they recapitulated the species tree. These two methods measure the “correctness” of an MSA with two different approaches; the first method measures consistency while the second measures the underlying phylogenetic signal. Proteins that contained both regions of disorder and order were analyzed along with proteins that were fully disordered and fully ordered, using nucleotide, codon and peptide sequence alignments. We observed that IDPs had less similar MSAs than ordered proteins, which is most likely linked to the lower sequence conservation in IDPs. However, comparisons of tree distances found that trees from the ordered sequence MSAs were not significantly closer to the species tree than those inferred from disordered sequence MSAs. Our results show that it is correct to say that IDPs are difficult to align on the basis of MSA consistency, but that this does not equate with alignments being of poor quality when assessed by their ability to correctly infer a species tree. Public Library of Science 2023-07-13 /pmc/articles/PMC10343074/ /pubmed/37440576 http://dx.doi.org/10.1371/journal.pone.0288388 Text en © 2023 Riley et al https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Riley, Andrew C.
Ashlock, Daniel A.
Graether, Steffen P.
The difficulty of aligning intrinsically disordered protein sequences as assessed by conservation and phylogeny
title The difficulty of aligning intrinsically disordered protein sequences as assessed by conservation and phylogeny
title_full The difficulty of aligning intrinsically disordered protein sequences as assessed by conservation and phylogeny
title_fullStr The difficulty of aligning intrinsically disordered protein sequences as assessed by conservation and phylogeny
title_full_unstemmed The difficulty of aligning intrinsically disordered protein sequences as assessed by conservation and phylogeny
title_short The difficulty of aligning intrinsically disordered protein sequences as assessed by conservation and phylogeny
title_sort difficulty of aligning intrinsically disordered protein sequences as assessed by conservation and phylogeny
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10343074/
https://www.ncbi.nlm.nih.gov/pubmed/37440576
http://dx.doi.org/10.1371/journal.pone.0288388
work_keys_str_mv AT rileyandrewc thedifficultyofaligningintrinsicallydisorderedproteinsequencesasassessedbyconservationandphylogeny
AT ashlockdaniela thedifficultyofaligningintrinsicallydisorderedproteinsequencesasassessedbyconservationandphylogeny
AT graethersteffenp thedifficultyofaligningintrinsicallydisorderedproteinsequencesasassessedbyconservationandphylogeny
AT rileyandrewc difficultyofaligningintrinsicallydisorderedproteinsequencesasassessedbyconservationandphylogeny
AT ashlockdaniela difficultyofaligningintrinsicallydisorderedproteinsequencesasassessedbyconservationandphylogeny
AT graethersteffenp difficultyofaligningintrinsicallydisorderedproteinsequencesasassessedbyconservationandphylogeny