Cargando…

Prediction of multi-drug resistance transporters using a novel sequence analysis method

There are many examples of groups of proteins that have similar function, but the determinants of functional specificity may be hidden by lack of sequence similarity, or by large groups of similar sequences with different functions. Transporters are one such protein group in that the general functio...

Descripción completa

Detalles Bibliográficos
Autores principales: McDermott, Jason E., Bruillard, Paul, Overall, Christopher C., Gosink, Luke, Lindemann, Stephen R.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: F1000Research 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4743146/
https://www.ncbi.nlm.nih.gov/pubmed/26913187
http://dx.doi.org/10.12688/f1000research.6200.2
_version_ 1782414306898345984
author McDermott, Jason E.
Bruillard, Paul
Overall, Christopher C.
Gosink, Luke
Lindemann, Stephen R.
author_facet McDermott, Jason E.
Bruillard, Paul
Overall, Christopher C.
Gosink, Luke
Lindemann, Stephen R.
author_sort McDermott, Jason E.
collection PubMed
description There are many examples of groups of proteins that have similar function, but the determinants of functional specificity may be hidden by lack of sequence similarity, or by large groups of similar sequences with different functions. Transporters are one such protein group in that the general function, transport, can be easily inferred from the sequence, but the substrate specificity can be impossible to predict from sequence with current methods. In this paper we describe a linguistic-based approach to identify functional patterns from groups of unaligned protein sequences and its application to predict multi-drug resistance transporters (MDRs) from bacteria. We first show that our method can recreate known patterns from PROSITE for several motifs from unaligned sequences. We then show that the method, MDRpred, can predict MDRs with greater accuracy and positive predictive value than a collection of currently available family-based models from the Pfam database. Finally, we apply MDRpred to a large collection of protein sequences from an environmental microbiome study to make novel predictions about drug resistance in a potential environmental reservoir.
format Online
Article
Text
id pubmed-4743146
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher F1000Research
record_format MEDLINE/PubMed
spelling pubmed-47431462016-02-23 Prediction of multi-drug resistance transporters using a novel sequence analysis method McDermott, Jason E. Bruillard, Paul Overall, Christopher C. Gosink, Luke Lindemann, Stephen R. F1000Res Method Article There are many examples of groups of proteins that have similar function, but the determinants of functional specificity may be hidden by lack of sequence similarity, or by large groups of similar sequences with different functions. Transporters are one such protein group in that the general function, transport, can be easily inferred from the sequence, but the substrate specificity can be impossible to predict from sequence with current methods. In this paper we describe a linguistic-based approach to identify functional patterns from groups of unaligned protein sequences and its application to predict multi-drug resistance transporters (MDRs) from bacteria. We first show that our method can recreate known patterns from PROSITE for several motifs from unaligned sequences. We then show that the method, MDRpred, can predict MDRs with greater accuracy and positive predictive value than a collection of currently available family-based models from the Pfam database. Finally, we apply MDRpred to a large collection of protein sequences from an environmental microbiome study to make novel predictions about drug resistance in a potential environmental reservoir. F1000Research 2015-05-29 /pmc/articles/PMC4743146/ /pubmed/26913187 http://dx.doi.org/10.12688/f1000research.6200.2 Text en Copyright: © 2015 McDermott JE et al. http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution Licence, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Method Article
McDermott, Jason E.
Bruillard, Paul
Overall, Christopher C.
Gosink, Luke
Lindemann, Stephen R.
Prediction of multi-drug resistance transporters using a novel sequence analysis method
title Prediction of multi-drug resistance transporters using a novel sequence analysis method
title_full Prediction of multi-drug resistance transporters using a novel sequence analysis method
title_fullStr Prediction of multi-drug resistance transporters using a novel sequence analysis method
title_full_unstemmed Prediction of multi-drug resistance transporters using a novel sequence analysis method
title_short Prediction of multi-drug resistance transporters using a novel sequence analysis method
title_sort prediction of multi-drug resistance transporters using a novel sequence analysis method
topic Method Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4743146/
https://www.ncbi.nlm.nih.gov/pubmed/26913187
http://dx.doi.org/10.12688/f1000research.6200.2
work_keys_str_mv AT mcdermottjasone predictionofmultidrugresistancetransportersusinganovelsequenceanalysismethod
AT bruillardpaul predictionofmultidrugresistancetransportersusinganovelsequenceanalysismethod
AT overallchristopherc predictionofmultidrugresistancetransportersusinganovelsequenceanalysismethod
AT gosinkluke predictionofmultidrugresistancetransportersusinganovelsequenceanalysismethod
AT lindemannstephenr predictionofmultidrugresistancetransportersusinganovelsequenceanalysismethod