Cargando…

SubVis: an interactive R package for exploring the effects of multiple substitution matrices on pairwise sequence alignment

Understanding how proteins mutate is critical to solving a host of biological problems. Mutations occur when an amino acid is substituted for another in a protein sequence. The set of likelihoods for amino acid substitutions is stored in a matrix and input to alignment algorithms. The quality of the...

Descripción completa

Detalles Bibliográficos
Autores principales: Barlowe, Scott, Coan, Heather B., Youker, Robert T.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: PeerJ Inc. 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5490468/
https://www.ncbi.nlm.nih.gov/pubmed/28674656
http://dx.doi.org/10.7717/peerj.3492
_version_ 1783246990019133440
author Barlowe, Scott
Coan, Heather B.
Youker, Robert T.
author_facet Barlowe, Scott
Coan, Heather B.
Youker, Robert T.
author_sort Barlowe, Scott
collection PubMed
description Understanding how proteins mutate is critical to solving a host of biological problems. Mutations occur when an amino acid is substituted for another in a protein sequence. The set of likelihoods for amino acid substitutions is stored in a matrix and input to alignment algorithms. The quality of the resulting alignment is used to assess the similarity of two or more sequences and can vary according to assumptions modeled by the substitution matrix. Substitution strategies with minor parameter variations are often grouped together in families. For example, the BLOSUM and PAM matrix families are commonly used because they provide a standard, predefined way of modeling substitutions. However, researchers often do not know if a given matrix family or any individual matrix within a family is the most suitable. Furthermore, predefined matrix families may inaccurately reflect a particular hypothesis that a researcher wishes to model or otherwise result in unsatisfactory alignments. In these cases, the ability to compare the effects of one or more custom matrices may be needed. This laborious process is often performed manually because the ability to simultaneously load multiple matrices and then compare their effects on alignments is not readily available in current software tools. This paper presents SubVis, an interactive R package for loading and applying multiple substitution matrices to pairwise alignments. Users can simultaneously explore alignments resulting from multiple predefined and custom substitution matrices. SubVis utilizes several of the alignment functions found in R, a common language among protein scientists. Functions are tied together with the Shiny platform which allows the modification of input parameters. Information regarding alignment quality and individual amino acid substitutions is displayed with the JavaScript language which provides interactive visualizations for revealing both high-level and low-level alignment information.
format Online
Article
Text
id pubmed-5490468
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher PeerJ Inc.
record_format MEDLINE/PubMed
spelling pubmed-54904682017-07-03 SubVis: an interactive R package for exploring the effects of multiple substitution matrices on pairwise sequence alignment Barlowe, Scott Coan, Heather B. Youker, Robert T. PeerJ Bioinformatics Understanding how proteins mutate is critical to solving a host of biological problems. Mutations occur when an amino acid is substituted for another in a protein sequence. The set of likelihoods for amino acid substitutions is stored in a matrix and input to alignment algorithms. The quality of the resulting alignment is used to assess the similarity of two or more sequences and can vary according to assumptions modeled by the substitution matrix. Substitution strategies with minor parameter variations are often grouped together in families. For example, the BLOSUM and PAM matrix families are commonly used because they provide a standard, predefined way of modeling substitutions. However, researchers often do not know if a given matrix family or any individual matrix within a family is the most suitable. Furthermore, predefined matrix families may inaccurately reflect a particular hypothesis that a researcher wishes to model or otherwise result in unsatisfactory alignments. In these cases, the ability to compare the effects of one or more custom matrices may be needed. This laborious process is often performed manually because the ability to simultaneously load multiple matrices and then compare their effects on alignments is not readily available in current software tools. This paper presents SubVis, an interactive R package for loading and applying multiple substitution matrices to pairwise alignments. Users can simultaneously explore alignments resulting from multiple predefined and custom substitution matrices. SubVis utilizes several of the alignment functions found in R, a common language among protein scientists. Functions are tied together with the Shiny platform which allows the modification of input parameters. Information regarding alignment quality and individual amino acid substitutions is displayed with the JavaScript language which provides interactive visualizations for revealing both high-level and low-level alignment information. PeerJ Inc. 2017-06-27 /pmc/articles/PMC5490468/ /pubmed/28674656 http://dx.doi.org/10.7717/peerj.3492 Text en ©2017 Barlowe et al. http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, reproduction and adaptation in any medium and for any purpose provided that it is properly attributed. For attribution, the original author(s), title, publication source (PeerJ) and either DOI or URL of the article must be cited.
spellingShingle Bioinformatics
Barlowe, Scott
Coan, Heather B.
Youker, Robert T.
SubVis: an interactive R package for exploring the effects of multiple substitution matrices on pairwise sequence alignment
title SubVis: an interactive R package for exploring the effects of multiple substitution matrices on pairwise sequence alignment
title_full SubVis: an interactive R package for exploring the effects of multiple substitution matrices on pairwise sequence alignment
title_fullStr SubVis: an interactive R package for exploring the effects of multiple substitution matrices on pairwise sequence alignment
title_full_unstemmed SubVis: an interactive R package for exploring the effects of multiple substitution matrices on pairwise sequence alignment
title_short SubVis: an interactive R package for exploring the effects of multiple substitution matrices on pairwise sequence alignment
title_sort subvis: an interactive r package for exploring the effects of multiple substitution matrices on pairwise sequence alignment
topic Bioinformatics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5490468/
https://www.ncbi.nlm.nih.gov/pubmed/28674656
http://dx.doi.org/10.7717/peerj.3492
work_keys_str_mv AT barlowescott subvisaninteractiverpackageforexploringtheeffectsofmultiplesubstitutionmatricesonpairwisesequencealignment
AT coanheatherb subvisaninteractiverpackageforexploringtheeffectsofmultiplesubstitutionmatricesonpairwisesequencealignment
AT youkerrobertt subvisaninteractiverpackageforexploringtheeffectsofmultiplesubstitutionmatricesonpairwisesequencealignment