Cargando…

Prediction and Characterization of Missing Proteomic Data in Desulfovibrio vulgaris

Proteomic datasets are often incomplete due to identification range and sensitivity issues. It becomes important to develop methodologies to estimate missing proteomic data, allowing better interpretation of proteomic datasets and metabolic mechanisms underlying complex biological systems. In this s...

Descripción completa

Detalles Bibliográficos
Autores principales: Li, Feng, Nie, Lei, Wu, Gang, Qiao, Jianjun, Zhang, Weiwen
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Hindawi Publishing Corporation 2011
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3114432/
https://www.ncbi.nlm.nih.gov/pubmed/21687592
http://dx.doi.org/10.1155/2011/780973
_version_ 1782206064536584192
author Li, Feng
Nie, Lei
Wu, Gang
Qiao, Jianjun
Zhang, Weiwen
author_facet Li, Feng
Nie, Lei
Wu, Gang
Qiao, Jianjun
Zhang, Weiwen
author_sort Li, Feng
collection PubMed
description Proteomic datasets are often incomplete due to identification range and sensitivity issues. It becomes important to develop methodologies to estimate missing proteomic data, allowing better interpretation of proteomic datasets and metabolic mechanisms underlying complex biological systems. In this study, we applied an artificial neural network to approximate the relationships between cognate transcriptomic and proteomic datasets of Desulfovibrio vulgaris, and to predict protein abundance for the proteins not experimentally detected, based on several relevant predictors, such as mRNA abundance, cellular role and triple codon counts. The results showed that the coefficients of determination for the trained neural network models ranged from 0.47 to 0.68, providing better modeling than several previous regression models. The validity of the trained neural network model was evaluated using biological information (i.e. operons). To seek understanding of mechanisms causing missing proteomic data, we used a multivariate logistic regression analysis and the result suggested that some key factors, such as protein instability index, aliphatic index, mRNA abundance, effective number of codons (N (c)) and codon adaptation index (CAI) values may be ascribed to whether a given expressed protein can be detected. In addition, we demonstrated that biological interpretation can be improved by use of imputed proteomic datasets.
format Online
Article
Text
id pubmed-3114432
institution National Center for Biotechnology Information
language English
publishDate 2011
publisher Hindawi Publishing Corporation
record_format MEDLINE/PubMed
spelling pubmed-31144322011-06-17 Prediction and Characterization of Missing Proteomic Data in Desulfovibrio vulgaris Li, Feng Nie, Lei Wu, Gang Qiao, Jianjun Zhang, Weiwen Comp Funct Genomics Research Article Proteomic datasets are often incomplete due to identification range and sensitivity issues. It becomes important to develop methodologies to estimate missing proteomic data, allowing better interpretation of proteomic datasets and metabolic mechanisms underlying complex biological systems. In this study, we applied an artificial neural network to approximate the relationships between cognate transcriptomic and proteomic datasets of Desulfovibrio vulgaris, and to predict protein abundance for the proteins not experimentally detected, based on several relevant predictors, such as mRNA abundance, cellular role and triple codon counts. The results showed that the coefficients of determination for the trained neural network models ranged from 0.47 to 0.68, providing better modeling than several previous regression models. The validity of the trained neural network model was evaluated using biological information (i.e. operons). To seek understanding of mechanisms causing missing proteomic data, we used a multivariate logistic regression analysis and the result suggested that some key factors, such as protein instability index, aliphatic index, mRNA abundance, effective number of codons (N (c)) and codon adaptation index (CAI) values may be ascribed to whether a given expressed protein can be detected. In addition, we demonstrated that biological interpretation can be improved by use of imputed proteomic datasets. Hindawi Publishing Corporation 2011 2011-05-04 /pmc/articles/PMC3114432/ /pubmed/21687592 http://dx.doi.org/10.1155/2011/780973 Text en Copyright © 2011 Feng Li et al. https://creativecommons.org/licenses/by/3.0/ This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Li, Feng
Nie, Lei
Wu, Gang
Qiao, Jianjun
Zhang, Weiwen
Prediction and Characterization of Missing Proteomic Data in Desulfovibrio vulgaris
title Prediction and Characterization of Missing Proteomic Data in Desulfovibrio vulgaris
title_full Prediction and Characterization of Missing Proteomic Data in Desulfovibrio vulgaris
title_fullStr Prediction and Characterization of Missing Proteomic Data in Desulfovibrio vulgaris
title_full_unstemmed Prediction and Characterization of Missing Proteomic Data in Desulfovibrio vulgaris
title_short Prediction and Characterization of Missing Proteomic Data in Desulfovibrio vulgaris
title_sort prediction and characterization of missing proteomic data in desulfovibrio vulgaris
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3114432/
https://www.ncbi.nlm.nih.gov/pubmed/21687592
http://dx.doi.org/10.1155/2011/780973
work_keys_str_mv AT lifeng predictionandcharacterizationofmissingproteomicdataindesulfovibriovulgaris
AT nielei predictionandcharacterizationofmissingproteomicdataindesulfovibriovulgaris
AT wugang predictionandcharacterizationofmissingproteomicdataindesulfovibriovulgaris
AT qiaojianjun predictionandcharacterizationofmissingproteomicdataindesulfovibriovulgaris
AT zhangweiwen predictionandcharacterizationofmissingproteomicdataindesulfovibriovulgaris