Cargando…

Correlations between alignment gaps and nucleotide substitution or amino acid replacement

To assess the conventional treatment in evolutionary inference of alignment gaps as missing data, we propose a simple nonparametric test of the null hypothesis that the locations of alignment gaps are independent of the nucleotide substitution or amino acid replacement process. When we apply the tes...

Descripción completa

Detalles Bibliográficos
Autores principales: Seo, Tae-Kun, Redelings, Benjamin D., Thorne, Jeffrey L.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: National Academy of Sciences 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9407537/
https://www.ncbi.nlm.nih.gov/pubmed/35972964
http://dx.doi.org/10.1073/pnas.2204435119
_version_ 1784774388369850368
author Seo, Tae-Kun
Redelings, Benjamin D.
Thorne, Jeffrey L.
author_facet Seo, Tae-Kun
Redelings, Benjamin D.
Thorne, Jeffrey L.
author_sort Seo, Tae-Kun
collection PubMed
description To assess the conventional treatment in evolutionary inference of alignment gaps as missing data, we propose a simple nonparametric test of the null hypothesis that the locations of alignment gaps are independent of the nucleotide substitution or amino acid replacement process. When we apply the test to 1,390 protein alignments that are informed by protein tertiary structure and use a 5% significance level, the null hypothesis of independence between amino acid replacement and gap location is rejected for ∼65% of datasets. Via simulations that include substitution and insertion–deletion, we show that the test performs well with true alignments. When we simulate according to the null hypothesis and then apply the test to optimal alignments that are inferred by each of four widely used software packages, the null hypothesis is rejected too frequently. Via further simulations and analyses, we show that the overly frequent rejections of the null hypothesis are not solely due to weaknesses of widely used software for finding optimal alignments. Instead, our evidence suggests that optimal alignments are unrepresentative of true alignments and that biased evolutionary inferences may result from relying upon individual optimal alignments.
format Online
Article
Text
id pubmed-9407537
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher National Academy of Sciences
record_format MEDLINE/PubMed
spelling pubmed-94075372023-02-16 Correlations between alignment gaps and nucleotide substitution or amino acid replacement Seo, Tae-Kun Redelings, Benjamin D. Thorne, Jeffrey L. Proc Natl Acad Sci U S A Biological Sciences To assess the conventional treatment in evolutionary inference of alignment gaps as missing data, we propose a simple nonparametric test of the null hypothesis that the locations of alignment gaps are independent of the nucleotide substitution or amino acid replacement process. When we apply the test to 1,390 protein alignments that are informed by protein tertiary structure and use a 5% significance level, the null hypothesis of independence between amino acid replacement and gap location is rejected for ∼65% of datasets. Via simulations that include substitution and insertion–deletion, we show that the test performs well with true alignments. When we simulate according to the null hypothesis and then apply the test to optimal alignments that are inferred by each of four widely used software packages, the null hypothesis is rejected too frequently. Via further simulations and analyses, we show that the overly frequent rejections of the null hypothesis are not solely due to weaknesses of widely used software for finding optimal alignments. Instead, our evidence suggests that optimal alignments are unrepresentative of true alignments and that biased evolutionary inferences may result from relying upon individual optimal alignments. National Academy of Sciences 2022-08-16 2022-08-23 /pmc/articles/PMC9407537/ /pubmed/35972964 http://dx.doi.org/10.1073/pnas.2204435119 Text en Copyright © 2022 the Author(s). Published by PNAS. https://creativecommons.org/licenses/by-nc-nd/4.0/This article is distributed under Creative Commons Attribution-NonCommercial-NoDerivatives License 4.0 (CC BY-NC-ND) (https://creativecommons.org/licenses/by-nc-nd/4.0/) .
spellingShingle Biological Sciences
Seo, Tae-Kun
Redelings, Benjamin D.
Thorne, Jeffrey L.
Correlations between alignment gaps and nucleotide substitution or amino acid replacement
title Correlations between alignment gaps and nucleotide substitution or amino acid replacement
title_full Correlations between alignment gaps and nucleotide substitution or amino acid replacement
title_fullStr Correlations between alignment gaps and nucleotide substitution or amino acid replacement
title_full_unstemmed Correlations between alignment gaps and nucleotide substitution or amino acid replacement
title_short Correlations between alignment gaps and nucleotide substitution or amino acid replacement
title_sort correlations between alignment gaps and nucleotide substitution or amino acid replacement
topic Biological Sciences
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9407537/
https://www.ncbi.nlm.nih.gov/pubmed/35972964
http://dx.doi.org/10.1073/pnas.2204435119
work_keys_str_mv AT seotaekun correlationsbetweenalignmentgapsandnucleotidesubstitutionoraminoacidreplacement
AT redelingsbenjamind correlationsbetweenalignmentgapsandnucleotidesubstitutionoraminoacidreplacement
AT thornejeffreyl correlationsbetweenalignmentgapsandnucleotidesubstitutionoraminoacidreplacement