Cargando…
SubVis: an interactive R package for exploring the effects of multiple substitution matrices on pairwise sequence alignment
Understanding how proteins mutate is critical to solving a host of biological problems. Mutations occur when an amino acid is substituted for another in a protein sequence. The set of likelihoods for amino acid substitutions is stored in a matrix and input to alignment algorithms. The quality of the...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
PeerJ Inc.
2017
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5490468/ https://www.ncbi.nlm.nih.gov/pubmed/28674656 http://dx.doi.org/10.7717/peerj.3492 |
_version_ | 1783246990019133440 |
---|---|
author | Barlowe, Scott Coan, Heather B. Youker, Robert T. |
author_facet | Barlowe, Scott Coan, Heather B. Youker, Robert T. |
author_sort | Barlowe, Scott |
collection | PubMed |
description | Understanding how proteins mutate is critical to solving a host of biological problems. Mutations occur when an amino acid is substituted for another in a protein sequence. The set of likelihoods for amino acid substitutions is stored in a matrix and input to alignment algorithms. The quality of the resulting alignment is used to assess the similarity of two or more sequences and can vary according to assumptions modeled by the substitution matrix. Substitution strategies with minor parameter variations are often grouped together in families. For example, the BLOSUM and PAM matrix families are commonly used because they provide a standard, predefined way of modeling substitutions. However, researchers often do not know if a given matrix family or any individual matrix within a family is the most suitable. Furthermore, predefined matrix families may inaccurately reflect a particular hypothesis that a researcher wishes to model or otherwise result in unsatisfactory alignments. In these cases, the ability to compare the effects of one or more custom matrices may be needed. This laborious process is often performed manually because the ability to simultaneously load multiple matrices and then compare their effects on alignments is not readily available in current software tools. This paper presents SubVis, an interactive R package for loading and applying multiple substitution matrices to pairwise alignments. Users can simultaneously explore alignments resulting from multiple predefined and custom substitution matrices. SubVis utilizes several of the alignment functions found in R, a common language among protein scientists. Functions are tied together with the Shiny platform which allows the modification of input parameters. Information regarding alignment quality and individual amino acid substitutions is displayed with the JavaScript language which provides interactive visualizations for revealing both high-level and low-level alignment information. |
format | Online Article Text |
id | pubmed-5490468 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2017 |
publisher | PeerJ Inc. |
record_format | MEDLINE/PubMed |
spelling | pubmed-54904682017-07-03 SubVis: an interactive R package for exploring the effects of multiple substitution matrices on pairwise sequence alignment Barlowe, Scott Coan, Heather B. Youker, Robert T. PeerJ Bioinformatics Understanding how proteins mutate is critical to solving a host of biological problems. Mutations occur when an amino acid is substituted for another in a protein sequence. The set of likelihoods for amino acid substitutions is stored in a matrix and input to alignment algorithms. The quality of the resulting alignment is used to assess the similarity of two or more sequences and can vary according to assumptions modeled by the substitution matrix. Substitution strategies with minor parameter variations are often grouped together in families. For example, the BLOSUM and PAM matrix families are commonly used because they provide a standard, predefined way of modeling substitutions. However, researchers often do not know if a given matrix family or any individual matrix within a family is the most suitable. Furthermore, predefined matrix families may inaccurately reflect a particular hypothesis that a researcher wishes to model or otherwise result in unsatisfactory alignments. In these cases, the ability to compare the effects of one or more custom matrices may be needed. This laborious process is often performed manually because the ability to simultaneously load multiple matrices and then compare their effects on alignments is not readily available in current software tools. This paper presents SubVis, an interactive R package for loading and applying multiple substitution matrices to pairwise alignments. Users can simultaneously explore alignments resulting from multiple predefined and custom substitution matrices. SubVis utilizes several of the alignment functions found in R, a common language among protein scientists. Functions are tied together with the Shiny platform which allows the modification of input parameters. Information regarding alignment quality and individual amino acid substitutions is displayed with the JavaScript language which provides interactive visualizations for revealing both high-level and low-level alignment information. PeerJ Inc. 2017-06-27 /pmc/articles/PMC5490468/ /pubmed/28674656 http://dx.doi.org/10.7717/peerj.3492 Text en ©2017 Barlowe et al. http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, reproduction and adaptation in any medium and for any purpose provided that it is properly attributed. For attribution, the original author(s), title, publication source (PeerJ) and either DOI or URL of the article must be cited. |
spellingShingle | Bioinformatics Barlowe, Scott Coan, Heather B. Youker, Robert T. SubVis: an interactive R package for exploring the effects of multiple substitution matrices on pairwise sequence alignment |
title | SubVis: an interactive R package for exploring the effects of multiple substitution matrices on pairwise sequence alignment |
title_full | SubVis: an interactive R package for exploring the effects of multiple substitution matrices on pairwise sequence alignment |
title_fullStr | SubVis: an interactive R package for exploring the effects of multiple substitution matrices on pairwise sequence alignment |
title_full_unstemmed | SubVis: an interactive R package for exploring the effects of multiple substitution matrices on pairwise sequence alignment |
title_short | SubVis: an interactive R package for exploring the effects of multiple substitution matrices on pairwise sequence alignment |
title_sort | subvis: an interactive r package for exploring the effects of multiple substitution matrices on pairwise sequence alignment |
topic | Bioinformatics |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5490468/ https://www.ncbi.nlm.nih.gov/pubmed/28674656 http://dx.doi.org/10.7717/peerj.3492 |
work_keys_str_mv | AT barlowescott subvisaninteractiverpackageforexploringtheeffectsofmultiplesubstitutionmatricesonpairwisesequencealignment AT coanheatherb subvisaninteractiverpackageforexploringtheeffectsofmultiplesubstitutionmatricesonpairwisesequencealignment AT youkerrobertt subvisaninteractiverpackageforexploringtheeffectsofmultiplesubstitutionmatricesonpairwisesequencealignment |