Cargando…

SCMMTP: identifying and characterizing membrane transport proteins using propensity scores of dipeptides

BACKGROUND: Identifying putative membrane transport proteins (MTPs) and understanding the transport mechanisms involved remain important challenges for the advancement of structural and functional genomics. However, the transporter characters are mainly acquired from MTP crystal structures which are...

Descripción completa

Detalles Bibliográficos
Autores principales: Liou, Yi-Fan, Vasylenko, Tamara, Yeh, Chia-Lun, Lin, Wei-Chun, Chiu, Shih-Hsiang, Charoenkwan, Phasit, Shu, Li-Sun, Ho, Shinn-Ying, Huang, Hui-Ling
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4682407/
https://www.ncbi.nlm.nih.gov/pubmed/26677931
http://dx.doi.org/10.1186/1471-2164-16-S12-S6
_version_ 1782405884425535488
author Liou, Yi-Fan
Vasylenko, Tamara
Yeh, Chia-Lun
Lin, Wei-Chun
Chiu, Shih-Hsiang
Charoenkwan, Phasit
Shu, Li-Sun
Ho, Shinn-Ying
Huang, Hui-Ling
author_facet Liou, Yi-Fan
Vasylenko, Tamara
Yeh, Chia-Lun
Lin, Wei-Chun
Chiu, Shih-Hsiang
Charoenkwan, Phasit
Shu, Li-Sun
Ho, Shinn-Ying
Huang, Hui-Ling
author_sort Liou, Yi-Fan
collection PubMed
description BACKGROUND: Identifying putative membrane transport proteins (MTPs) and understanding the transport mechanisms involved remain important challenges for the advancement of structural and functional genomics. However, the transporter characters are mainly acquired from MTP crystal structures which are hard to crystalize. Therefore, it is desirable to develop bioinformatics tools for the effective large-scale analysis of available sequences to identify novel transporters and characterize such transporters. RESULTS: This work proposes a novel method (SCMMTP) based on the scoring card method (SCM) using dipeptide composition to identify and characterize MTPs from an existing dataset containing 900 MTPs and 660 non-MTPs which are separated into a training dataset consisting 1,380 proteins and an independent dataset consisting 180 proteins. The SCMMTP produced estimating propensity scores for amino acids and dipeptides as MTPs. The SCMMTP training and test accuracy levels respectively reached 83.81% and 76.11%. The test accuracy of support vector machine (SVM) using a complicated classification method with a low possibility for biological interpretation and position-specific substitution matrix (PSSM) as a protein feature is 80.56%, thus SCMMTP is comparable to SVM-PSSM. To identify MTPs, SCMMTP is applied to three datasets including: 1) human transmembrane proteins, 2) a photosynthetic protein dataset, and 3) a human protein database. MTPs showing α-helix rich structure is agreed with previous studies. The MTPs used residues with low hydration energy. It is hypothesized that, after filtering substrates, the hydrated water molecules need to be released from the pore regions. CONCLUSIONS: SCMMTP yields estimating propensity scores for amino acids and dipeptides as MTPs, which can be used to identify novel MTPs and characterize transport mechanisms for use in further experiments. AVAILABILITY: http://iclab.life.nctu.edu.tw/iclab_webtools/SCMMTP/
format Online
Article
Text
id pubmed-4682407
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-46824072015-12-21 SCMMTP: identifying and characterizing membrane transport proteins using propensity scores of dipeptides Liou, Yi-Fan Vasylenko, Tamara Yeh, Chia-Lun Lin, Wei-Chun Chiu, Shih-Hsiang Charoenkwan, Phasit Shu, Li-Sun Ho, Shinn-Ying Huang, Hui-Ling BMC Genomics Research BACKGROUND: Identifying putative membrane transport proteins (MTPs) and understanding the transport mechanisms involved remain important challenges for the advancement of structural and functional genomics. However, the transporter characters are mainly acquired from MTP crystal structures which are hard to crystalize. Therefore, it is desirable to develop bioinformatics tools for the effective large-scale analysis of available sequences to identify novel transporters and characterize such transporters. RESULTS: This work proposes a novel method (SCMMTP) based on the scoring card method (SCM) using dipeptide composition to identify and characterize MTPs from an existing dataset containing 900 MTPs and 660 non-MTPs which are separated into a training dataset consisting 1,380 proteins and an independent dataset consisting 180 proteins. The SCMMTP produced estimating propensity scores for amino acids and dipeptides as MTPs. The SCMMTP training and test accuracy levels respectively reached 83.81% and 76.11%. The test accuracy of support vector machine (SVM) using a complicated classification method with a low possibility for biological interpretation and position-specific substitution matrix (PSSM) as a protein feature is 80.56%, thus SCMMTP is comparable to SVM-PSSM. To identify MTPs, SCMMTP is applied to three datasets including: 1) human transmembrane proteins, 2) a photosynthetic protein dataset, and 3) a human protein database. MTPs showing α-helix rich structure is agreed with previous studies. The MTPs used residues with low hydration energy. It is hypothesized that, after filtering substrates, the hydrated water molecules need to be released from the pore regions. CONCLUSIONS: SCMMTP yields estimating propensity scores for amino acids and dipeptides as MTPs, which can be used to identify novel MTPs and characterize transport mechanisms for use in further experiments. AVAILABILITY: http://iclab.life.nctu.edu.tw/iclab_webtools/SCMMTP/ BioMed Central 2015-12-09 /pmc/articles/PMC4682407/ /pubmed/26677931 http://dx.doi.org/10.1186/1471-2164-16-S12-S6 Text en Copyright © 2015 Liou et al. http://creativecommons.org/licenses/by/4.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research
Liou, Yi-Fan
Vasylenko, Tamara
Yeh, Chia-Lun
Lin, Wei-Chun
Chiu, Shih-Hsiang
Charoenkwan, Phasit
Shu, Li-Sun
Ho, Shinn-Ying
Huang, Hui-Ling
SCMMTP: identifying and characterizing membrane transport proteins using propensity scores of dipeptides
title SCMMTP: identifying and characterizing membrane transport proteins using propensity scores of dipeptides
title_full SCMMTP: identifying and characterizing membrane transport proteins using propensity scores of dipeptides
title_fullStr SCMMTP: identifying and characterizing membrane transport proteins using propensity scores of dipeptides
title_full_unstemmed SCMMTP: identifying and characterizing membrane transport proteins using propensity scores of dipeptides
title_short SCMMTP: identifying and characterizing membrane transport proteins using propensity scores of dipeptides
title_sort scmmtp: identifying and characterizing membrane transport proteins using propensity scores of dipeptides
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4682407/
https://www.ncbi.nlm.nih.gov/pubmed/26677931
http://dx.doi.org/10.1186/1471-2164-16-S12-S6
work_keys_str_mv AT liouyifan scmmtpidentifyingandcharacterizingmembranetransportproteinsusingpropensityscoresofdipeptides
AT vasylenkotamara scmmtpidentifyingandcharacterizingmembranetransportproteinsusingpropensityscoresofdipeptides
AT yehchialun scmmtpidentifyingandcharacterizingmembranetransportproteinsusingpropensityscoresofdipeptides
AT linweichun scmmtpidentifyingandcharacterizingmembranetransportproteinsusingpropensityscoresofdipeptides
AT chiushihhsiang scmmtpidentifyingandcharacterizingmembranetransportproteinsusingpropensityscoresofdipeptides
AT charoenkwanphasit scmmtpidentifyingandcharacterizingmembranetransportproteinsusingpropensityscoresofdipeptides
AT shulisun scmmtpidentifyingandcharacterizingmembranetransportproteinsusingpropensityscoresofdipeptides
AT hoshinnying scmmtpidentifyingandcharacterizingmembranetransportproteinsusingpropensityscoresofdipeptides
AT huanghuiling scmmtpidentifyingandcharacterizingmembranetransportproteinsusingpropensityscoresofdipeptides