Cargando…
SigniSite: Identification of residue-level genotype-phenotype correlations in protein multiple sequence alignments
Identifying which mutation(s) within a given genotype is responsible for an observable phenotype is important in many aspects of molecular biology. Here, we present SigniSite, an online application for subgroup-free residue-level genotype–phenotype correlation. In contrast to similar methods, SigniS...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2013
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3692133/ https://www.ncbi.nlm.nih.gov/pubmed/23761454 http://dx.doi.org/10.1093/nar/gkt497 |
_version_ | 1782274577445945344 |
---|---|
author | Jessen, Leon Eyrich Hoof, Ilka Lund, Ole Nielsen, Morten |
author_facet | Jessen, Leon Eyrich Hoof, Ilka Lund, Ole Nielsen, Morten |
author_sort | Jessen, Leon Eyrich |
collection | PubMed |
description | Identifying which mutation(s) within a given genotype is responsible for an observable phenotype is important in many aspects of molecular biology. Here, we present SigniSite, an online application for subgroup-free residue-level genotype–phenotype correlation. In contrast to similar methods, SigniSite does not require any pre-definition of subgroups or binary classification. Input is a set of protein sequences where each sequence has an associated real number, quantifying a given phenotype. SigniSite will then identify which amino acid residues are significantly associated with the data set phenotype. As output, SigniSite displays a sequence logo, depicting the strength of the phenotype association of each residue and a heat-map identifying ‘hot’ or ‘cold’ regions. SigniSite was benchmarked against SPEER, a state-of-the-art method for the prediction of specificity determining positions (SDP) using a set of human immunodeficiency virus protease-inhibitor genotype–phenotype data and corresponding resistance mutation scores from the Stanford University HIV Drug Resistance Database, and a data set of protein families with experimentally annotated SDPs. For both data sets, SigniSite was found to outperform SPEER. SigniSite is available at: http://www.cbs.dtu.dk/services/SigniSite/. |
format | Online Article Text |
id | pubmed-3692133 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2013 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-36921332013-06-25 SigniSite: Identification of residue-level genotype-phenotype correlations in protein multiple sequence alignments Jessen, Leon Eyrich Hoof, Ilka Lund, Ole Nielsen, Morten Nucleic Acids Res Articles Identifying which mutation(s) within a given genotype is responsible for an observable phenotype is important in many aspects of molecular biology. Here, we present SigniSite, an online application for subgroup-free residue-level genotype–phenotype correlation. In contrast to similar methods, SigniSite does not require any pre-definition of subgroups or binary classification. Input is a set of protein sequences where each sequence has an associated real number, quantifying a given phenotype. SigniSite will then identify which amino acid residues are significantly associated with the data set phenotype. As output, SigniSite displays a sequence logo, depicting the strength of the phenotype association of each residue and a heat-map identifying ‘hot’ or ‘cold’ regions. SigniSite was benchmarked against SPEER, a state-of-the-art method for the prediction of specificity determining positions (SDP) using a set of human immunodeficiency virus protease-inhibitor genotype–phenotype data and corresponding resistance mutation scores from the Stanford University HIV Drug Resistance Database, and a data set of protein families with experimentally annotated SDPs. For both data sets, SigniSite was found to outperform SPEER. SigniSite is available at: http://www.cbs.dtu.dk/services/SigniSite/. Oxford University Press 2013-07 2013-06-11 /pmc/articles/PMC3692133/ /pubmed/23761454 http://dx.doi.org/10.1093/nar/gkt497 Text en © The Author(s) 2013. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/3.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/3.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com |
spellingShingle | Articles Jessen, Leon Eyrich Hoof, Ilka Lund, Ole Nielsen, Morten SigniSite: Identification of residue-level genotype-phenotype correlations in protein multiple sequence alignments |
title | SigniSite: Identification of residue-level genotype-phenotype correlations in protein multiple sequence alignments |
title_full | SigniSite: Identification of residue-level genotype-phenotype correlations in protein multiple sequence alignments |
title_fullStr | SigniSite: Identification of residue-level genotype-phenotype correlations in protein multiple sequence alignments |
title_full_unstemmed | SigniSite: Identification of residue-level genotype-phenotype correlations in protein multiple sequence alignments |
title_short | SigniSite: Identification of residue-level genotype-phenotype correlations in protein multiple sequence alignments |
title_sort | signisite: identification of residue-level genotype-phenotype correlations in protein multiple sequence alignments |
topic | Articles |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3692133/ https://www.ncbi.nlm.nih.gov/pubmed/23761454 http://dx.doi.org/10.1093/nar/gkt497 |
work_keys_str_mv | AT jessenleoneyrich signisiteidentificationofresiduelevelgenotypephenotypecorrelationsinproteinmultiplesequencealignments AT hoofilka signisiteidentificationofresiduelevelgenotypephenotypecorrelationsinproteinmultiplesequencealignments AT lundole signisiteidentificationofresiduelevelgenotypephenotypecorrelationsinproteinmultiplesequencealignments AT nielsenmorten signisiteidentificationofresiduelevelgenotypephenotypecorrelationsinproteinmultiplesequencealignments |