Cargando…
Using cellular automata images and pseudo amino acid composition to predict protein subcellular location
The avalanche of newly found protein sequences in the post-genomic era has motivated and challenged us to develop an automated method that can rapidly and accurately predict the localization of an uncharacterized protein in cells because the knowledge thus obtained can greatly speed up the process i...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Springer-Verlag
2005
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7087770/ https://www.ncbi.nlm.nih.gov/pubmed/16044193 http://dx.doi.org/10.1007/s00726-005-0225-6 |
_version_ | 1783509401023283200 |
---|---|
author | Xiao, X. Shao, S. Ding, Y. Huang, Z. Chou, K.-C. |
author_facet | Xiao, X. Shao, S. Ding, Y. Huang, Z. Chou, K.-C. |
author_sort | Xiao, X. |
collection | PubMed |
description | The avalanche of newly found protein sequences in the post-genomic era has motivated and challenged us to develop an automated method that can rapidly and accurately predict the localization of an uncharacterized protein in cells because the knowledge thus obtained can greatly speed up the process in finding its biological functions. However, it is very difficult to establish such a desired predictor by acquiring the key statistical information buried in a pile of extremely complicated and highly variable sequences. In this paper, based on the concept of the pseudo amino acid composition (Chou, K. C. PROTEINS: Structure, Function, and Genetics, 2001, 43: 246–255), the approach of cellular automata image is introduced to cope with this problem. Many important features, which are originally hidden in the long amino acid sequences, can be clearly displayed through their cellular automata images. One of the remarkable merits by doing so is that many image recognition tools can be straightforwardly applied to the target aimed here. High success rates were observed through the self-consistency, jackknife, and independent dataset tests, respectively. |
format | Online Article Text |
id | pubmed-7087770 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2005 |
publisher | Springer-Verlag |
record_format | MEDLINE/PubMed |
spelling | pubmed-70877702020-03-23 Using cellular automata images and pseudo amino acid composition to predict protein subcellular location Xiao, X. Shao, S. Ding, Y. Huang, Z. Chou, K.-C. Amino Acids Article The avalanche of newly found protein sequences in the post-genomic era has motivated and challenged us to develop an automated method that can rapidly and accurately predict the localization of an uncharacterized protein in cells because the knowledge thus obtained can greatly speed up the process in finding its biological functions. However, it is very difficult to establish such a desired predictor by acquiring the key statistical information buried in a pile of extremely complicated and highly variable sequences. In this paper, based on the concept of the pseudo amino acid composition (Chou, K. C. PROTEINS: Structure, Function, and Genetics, 2001, 43: 246–255), the approach of cellular automata image is introduced to cope with this problem. Many important features, which are originally hidden in the long amino acid sequences, can be clearly displayed through their cellular automata images. One of the remarkable merits by doing so is that many image recognition tools can be straightforwardly applied to the target aimed here. High success rates were observed through the self-consistency, jackknife, and independent dataset tests, respectively. Springer-Verlag 2005-07-28 2006 /pmc/articles/PMC7087770/ /pubmed/16044193 http://dx.doi.org/10.1007/s00726-005-0225-6 Text en © Springer-Verlag/Wien 2005 This article is made available via the PMC Open Access Subset for unrestricted research re-use and secondary analysis in any form or by any means with acknowledgement of the original source. These permissions are granted for the duration of the World Health Organization (WHO) declaration of COVID-19 as a global pandemic. |
spellingShingle | Article Xiao, X. Shao, S. Ding, Y. Huang, Z. Chou, K.-C. Using cellular automata images and pseudo amino acid composition to predict protein subcellular location |
title | Using cellular automata images and pseudo amino acid composition to predict protein subcellular location |
title_full | Using cellular automata images and pseudo amino acid composition to predict protein subcellular location |
title_fullStr | Using cellular automata images and pseudo amino acid composition to predict protein subcellular location |
title_full_unstemmed | Using cellular automata images and pseudo amino acid composition to predict protein subcellular location |
title_short | Using cellular automata images and pseudo amino acid composition to predict protein subcellular location |
title_sort | using cellular automata images and pseudo amino acid composition to predict protein subcellular location |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7087770/ https://www.ncbi.nlm.nih.gov/pubmed/16044193 http://dx.doi.org/10.1007/s00726-005-0225-6 |
work_keys_str_mv | AT xiaox usingcellularautomataimagesandpseudoaminoacidcompositiontopredictproteinsubcellularlocation AT shaos usingcellularautomataimagesandpseudoaminoacidcompositiontopredictproteinsubcellularlocation AT dingy usingcellularautomataimagesandpseudoaminoacidcompositiontopredictproteinsubcellularlocation AT huangz usingcellularautomataimagesandpseudoaminoacidcompositiontopredictproteinsubcellularlocation AT choukc usingcellularautomataimagesandpseudoaminoacidcompositiontopredictproteinsubcellularlocation |