Cargando…
Prediction of multi-drug resistance transporters using a novel sequence analysis method
There are many examples of groups of proteins that have similar function, but the determinants of functional specificity may be hidden by lack of sequence similarity, or by large groups of similar sequences with different functions. Transporters are one such protein group in that the general functio...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
F1000Research
2015
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4743146/ https://www.ncbi.nlm.nih.gov/pubmed/26913187 http://dx.doi.org/10.12688/f1000research.6200.2 |
_version_ | 1782414306898345984 |
---|---|
author | McDermott, Jason E. Bruillard, Paul Overall, Christopher C. Gosink, Luke Lindemann, Stephen R. |
author_facet | McDermott, Jason E. Bruillard, Paul Overall, Christopher C. Gosink, Luke Lindemann, Stephen R. |
author_sort | McDermott, Jason E. |
collection | PubMed |
description | There are many examples of groups of proteins that have similar function, but the determinants of functional specificity may be hidden by lack of sequence similarity, or by large groups of similar sequences with different functions. Transporters are one such protein group in that the general function, transport, can be easily inferred from the sequence, but the substrate specificity can be impossible to predict from sequence with current methods. In this paper we describe a linguistic-based approach to identify functional patterns from groups of unaligned protein sequences and its application to predict multi-drug resistance transporters (MDRs) from bacteria. We first show that our method can recreate known patterns from PROSITE for several motifs from unaligned sequences. We then show that the method, MDRpred, can predict MDRs with greater accuracy and positive predictive value than a collection of currently available family-based models from the Pfam database. Finally, we apply MDRpred to a large collection of protein sequences from an environmental microbiome study to make novel predictions about drug resistance in a potential environmental reservoir. |
format | Online Article Text |
id | pubmed-4743146 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2015 |
publisher | F1000Research |
record_format | MEDLINE/PubMed |
spelling | pubmed-47431462016-02-23 Prediction of multi-drug resistance transporters using a novel sequence analysis method McDermott, Jason E. Bruillard, Paul Overall, Christopher C. Gosink, Luke Lindemann, Stephen R. F1000Res Method Article There are many examples of groups of proteins that have similar function, but the determinants of functional specificity may be hidden by lack of sequence similarity, or by large groups of similar sequences with different functions. Transporters are one such protein group in that the general function, transport, can be easily inferred from the sequence, but the substrate specificity can be impossible to predict from sequence with current methods. In this paper we describe a linguistic-based approach to identify functional patterns from groups of unaligned protein sequences and its application to predict multi-drug resistance transporters (MDRs) from bacteria. We first show that our method can recreate known patterns from PROSITE for several motifs from unaligned sequences. We then show that the method, MDRpred, can predict MDRs with greater accuracy and positive predictive value than a collection of currently available family-based models from the Pfam database. Finally, we apply MDRpred to a large collection of protein sequences from an environmental microbiome study to make novel predictions about drug resistance in a potential environmental reservoir. F1000Research 2015-05-29 /pmc/articles/PMC4743146/ /pubmed/26913187 http://dx.doi.org/10.12688/f1000research.6200.2 Text en Copyright: © 2015 McDermott JE et al. http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution Licence, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Method Article McDermott, Jason E. Bruillard, Paul Overall, Christopher C. Gosink, Luke Lindemann, Stephen R. Prediction of multi-drug resistance transporters using a novel sequence analysis method |
title | Prediction of multi-drug resistance transporters using a novel sequence analysis method |
title_full | Prediction of multi-drug resistance transporters using a novel sequence analysis method |
title_fullStr | Prediction of multi-drug resistance transporters using a novel sequence analysis method |
title_full_unstemmed | Prediction of multi-drug resistance transporters using a novel sequence analysis method |
title_short | Prediction of multi-drug resistance transporters using a novel sequence analysis method |
title_sort | prediction of multi-drug resistance transporters using a novel sequence analysis method |
topic | Method Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4743146/ https://www.ncbi.nlm.nih.gov/pubmed/26913187 http://dx.doi.org/10.12688/f1000research.6200.2 |
work_keys_str_mv | AT mcdermottjasone predictionofmultidrugresistancetransportersusinganovelsequenceanalysismethod AT bruillardpaul predictionofmultidrugresistancetransportersusinganovelsequenceanalysismethod AT overallchristopherc predictionofmultidrugresistancetransportersusinganovelsequenceanalysismethod AT gosinkluke predictionofmultidrugresistancetransportersusinganovelsequenceanalysismethod AT lindemannstephenr predictionofmultidrugresistancetransportersusinganovelsequenceanalysismethod |