Cargando…
A semiparametric modeling framework for potential biomarker discovery and the development of metabonomic profiles
BACKGROUND: The discovery of biomarkers is an important step towards the development of criteria for early diagnosis of disease status. Recently electrospray ionization (ESI) and matrix assisted laser desorption (MALDI) time-of-flight (TOF) mass spectrometry have been used to identify biomarkers bot...
Autores principales: | , , , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2008
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2253516/ https://www.ncbi.nlm.nih.gov/pubmed/18215299 http://dx.doi.org/10.1186/1471-2105-9-38 |
_version_ | 1782151111269941248 |
---|---|
author | Ghosh, Samiran Grant, David F Dey, Dipak K Hill, Dennis W |
author_facet | Ghosh, Samiran Grant, David F Dey, Dipak K Hill, Dennis W |
author_sort | Ghosh, Samiran |
collection | PubMed |
description | BACKGROUND: The discovery of biomarkers is an important step towards the development of criteria for early diagnosis of disease status. Recently electrospray ionization (ESI) and matrix assisted laser desorption (MALDI) time-of-flight (TOF) mass spectrometry have been used to identify biomarkers both in proteomics and metabonomics studies. Data sets generated from such studies are generally very large in size and thus require the use of sophisticated statistical techniques to glean useful information. Most recent attempts to process these types of data model each compound's intensity either discretely by positional (mass to charge ratio) clustering or through each compounds' own intensity distribution. Traditionally data processing steps such as noise removal, background elimination and m/z alignment, are generally carried out separately resulting in unsatisfactory propagation of signals in the final model. RESULTS: In the present study a novel semi-parametric approach has been developed to distinguish urinary metabolic profiles in a group of traumatic patients from those of a control group consisting of normal individuals. Data sets obtained from the replicates of a single subject were used to develop a functional profile through Dirichlet mixture of beta distribution. This functional profile is flexible enough to accommodate variability of the instrument and the inherent variability of each individual, thus simultaneously addressing different sources of systematic error. To address instrument variability, all data sets were analyzed in replicate, an important issue ignored by most studies in the past. Different model comparisons were performed to select the best model for each subject. The m/z values in the window of the irregular pattern are then further recommended for possible biomarker discovery. CONCLUSION: To the best of our knowledge this is the very first attempt to model the physical process behind the time-of flight mass spectrometry. Most of the state of the art techniques does not take these physical principles in consideration while modeling such data. The proposed modeling process will apply as long as the basic physical principle presented in this paper is valid. Notably we have confined our present work mostly within the modeling aspect. Nevertheless clinical validation of our recommended list of potential biomarkers will be required. Hence, we have termed our modeling approach as a "framework" for further work. |
format | Text |
id | pubmed-2253516 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2008 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-22535162008-02-25 A semiparametric modeling framework for potential biomarker discovery and the development of metabonomic profiles Ghosh, Samiran Grant, David F Dey, Dipak K Hill, Dennis W BMC Bioinformatics Research Article BACKGROUND: The discovery of biomarkers is an important step towards the development of criteria for early diagnosis of disease status. Recently electrospray ionization (ESI) and matrix assisted laser desorption (MALDI) time-of-flight (TOF) mass spectrometry have been used to identify biomarkers both in proteomics and metabonomics studies. Data sets generated from such studies are generally very large in size and thus require the use of sophisticated statistical techniques to glean useful information. Most recent attempts to process these types of data model each compound's intensity either discretely by positional (mass to charge ratio) clustering or through each compounds' own intensity distribution. Traditionally data processing steps such as noise removal, background elimination and m/z alignment, are generally carried out separately resulting in unsatisfactory propagation of signals in the final model. RESULTS: In the present study a novel semi-parametric approach has been developed to distinguish urinary metabolic profiles in a group of traumatic patients from those of a control group consisting of normal individuals. Data sets obtained from the replicates of a single subject were used to develop a functional profile through Dirichlet mixture of beta distribution. This functional profile is flexible enough to accommodate variability of the instrument and the inherent variability of each individual, thus simultaneously addressing different sources of systematic error. To address instrument variability, all data sets were analyzed in replicate, an important issue ignored by most studies in the past. Different model comparisons were performed to select the best model for each subject. The m/z values in the window of the irregular pattern are then further recommended for possible biomarker discovery. CONCLUSION: To the best of our knowledge this is the very first attempt to model the physical process behind the time-of flight mass spectrometry. Most of the state of the art techniques does not take these physical principles in consideration while modeling such data. The proposed modeling process will apply as long as the basic physical principle presented in this paper is valid. Notably we have confined our present work mostly within the modeling aspect. Nevertheless clinical validation of our recommended list of potential biomarkers will be required. Hence, we have termed our modeling approach as a "framework" for further work. BioMed Central 2008-01-23 /pmc/articles/PMC2253516/ /pubmed/18215299 http://dx.doi.org/10.1186/1471-2105-9-38 Text en Copyright © 2008 Ghosh et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Article Ghosh, Samiran Grant, David F Dey, Dipak K Hill, Dennis W A semiparametric modeling framework for potential biomarker discovery and the development of metabonomic profiles |
title | A semiparametric modeling framework for potential biomarker discovery and the development of metabonomic profiles |
title_full | A semiparametric modeling framework for potential biomarker discovery and the development of metabonomic profiles |
title_fullStr | A semiparametric modeling framework for potential biomarker discovery and the development of metabonomic profiles |
title_full_unstemmed | A semiparametric modeling framework for potential biomarker discovery and the development of metabonomic profiles |
title_short | A semiparametric modeling framework for potential biomarker discovery and the development of metabonomic profiles |
title_sort | semiparametric modeling framework for potential biomarker discovery and the development of metabonomic profiles |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2253516/ https://www.ncbi.nlm.nih.gov/pubmed/18215299 http://dx.doi.org/10.1186/1471-2105-9-38 |
work_keys_str_mv | AT ghoshsamiran asemiparametricmodelingframeworkforpotentialbiomarkerdiscoveryandthedevelopmentofmetabonomicprofiles AT grantdavidf asemiparametricmodelingframeworkforpotentialbiomarkerdiscoveryandthedevelopmentofmetabonomicprofiles AT deydipakk asemiparametricmodelingframeworkforpotentialbiomarkerdiscoveryandthedevelopmentofmetabonomicprofiles AT hilldennisw asemiparametricmodelingframeworkforpotentialbiomarkerdiscoveryandthedevelopmentofmetabonomicprofiles AT ghoshsamiran semiparametricmodelingframeworkforpotentialbiomarkerdiscoveryandthedevelopmentofmetabonomicprofiles AT grantdavidf semiparametricmodelingframeworkforpotentialbiomarkerdiscoveryandthedevelopmentofmetabonomicprofiles AT deydipakk semiparametricmodelingframeworkforpotentialbiomarkerdiscoveryandthedevelopmentofmetabonomicprofiles AT hilldennisw semiparametricmodelingframeworkforpotentialbiomarkerdiscoveryandthedevelopmentofmetabonomicprofiles |