Cargando…
Data mining of plasma peptide chromatograms for biomarkers of air contaminant exposures
BACKGROUND: Interrogation of chromatographic data for biomarker discovery becomes a tedious task due to stochastic variability in retention times arising from solvent and column performance. The difficulty is further compounded when the effects of exposure (e.g. to environmental contaminants) and bi...
Autores principales: | , , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2008
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2270821/ https://www.ncbi.nlm.nih.gov/pubmed/18234112 http://dx.doi.org/10.1186/1477-5956-6-6 |
_version_ | 1782151768321294336 |
---|---|
author | Karthikeyan, Subramanian Kumarathasan, Premkumari Vincent, Renaud |
author_facet | Karthikeyan, Subramanian Kumarathasan, Premkumari Vincent, Renaud |
author_sort | Karthikeyan, Subramanian |
collection | PubMed |
description | BACKGROUND: Interrogation of chromatographic data for biomarker discovery becomes a tedious task due to stochastic variability in retention times arising from solvent and column performance. The difficulty is further compounded when the effects of exposure (e.g. to environmental contaminants) and biological variability result in varying numbers and intensities of peaks among chromatograms. RESULTS: We developed a software tool to correct the stochastic time shifts in chromatographic data through iterative selection of landmark peaks and isometric interpolation to improve alignment of all chromatographic peaks. To illustrate application of the tool, plasma peptides from Fischer rats exposed for 4 h to clean air or Ottawa urban particles (EHC-93) were separated by HPLC with autofluorescence detection, and the retention time shifts between chromatograms were corrected (dewarped). Both dewarped and non-dewarped datasets were then mined for models containing peptide peaks that best discriminate among the treatment groups using ClinproTools™. In general, models generated by dewarped datasets were able to better classify test sample chromatograms into either clean air or EHC-93 exposure groups, and 0 or 24 h post-recovery time groups. Peak areas of peptides in a model that produced the best discrimination of treatment groups were analyzed by two-way ANOVA with exposure (clean air, EHC-93) and recovery time (0 h, 24 h) as factors. Statistically significant (p < 0.05) time-dependent and exposure-dependent increases and decreases were noted establishing these as biomarker candidates for further validation. CONCLUSION: Our software tool provides a simple and portable approach for alignment of chromatograms with complex, bi-directional retention time shifts prior to data mining. Reliable biomarker discovery can be achieved through chromatographic dewarping using our software followed by pattern recognition by commercial data mining applications. |
format | Text |
id | pubmed-2270821 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2008 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-22708212008-03-21 Data mining of plasma peptide chromatograms for biomarkers of air contaminant exposures Karthikeyan, Subramanian Kumarathasan, Premkumari Vincent, Renaud Proteome Sci Methodology BACKGROUND: Interrogation of chromatographic data for biomarker discovery becomes a tedious task due to stochastic variability in retention times arising from solvent and column performance. The difficulty is further compounded when the effects of exposure (e.g. to environmental contaminants) and biological variability result in varying numbers and intensities of peaks among chromatograms. RESULTS: We developed a software tool to correct the stochastic time shifts in chromatographic data through iterative selection of landmark peaks and isometric interpolation to improve alignment of all chromatographic peaks. To illustrate application of the tool, plasma peptides from Fischer rats exposed for 4 h to clean air or Ottawa urban particles (EHC-93) were separated by HPLC with autofluorescence detection, and the retention time shifts between chromatograms were corrected (dewarped). Both dewarped and non-dewarped datasets were then mined for models containing peptide peaks that best discriminate among the treatment groups using ClinproTools™. In general, models generated by dewarped datasets were able to better classify test sample chromatograms into either clean air or EHC-93 exposure groups, and 0 or 24 h post-recovery time groups. Peak areas of peptides in a model that produced the best discrimination of treatment groups were analyzed by two-way ANOVA with exposure (clean air, EHC-93) and recovery time (0 h, 24 h) as factors. Statistically significant (p < 0.05) time-dependent and exposure-dependent increases and decreases were noted establishing these as biomarker candidates for further validation. CONCLUSION: Our software tool provides a simple and portable approach for alignment of chromatograms with complex, bi-directional retention time shifts prior to data mining. Reliable biomarker discovery can be achieved through chromatographic dewarping using our software followed by pattern recognition by commercial data mining applications. BioMed Central 2008-01-30 /pmc/articles/PMC2270821/ /pubmed/18234112 http://dx.doi.org/10.1186/1477-5956-6-6 Text en Copyright © 2008 Karthikeyan et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Methodology Karthikeyan, Subramanian Kumarathasan, Premkumari Vincent, Renaud Data mining of plasma peptide chromatograms for biomarkers of air contaminant exposures |
title | Data mining of plasma peptide chromatograms for biomarkers of air contaminant exposures |
title_full | Data mining of plasma peptide chromatograms for biomarkers of air contaminant exposures |
title_fullStr | Data mining of plasma peptide chromatograms for biomarkers of air contaminant exposures |
title_full_unstemmed | Data mining of plasma peptide chromatograms for biomarkers of air contaminant exposures |
title_short | Data mining of plasma peptide chromatograms for biomarkers of air contaminant exposures |
title_sort | data mining of plasma peptide chromatograms for biomarkers of air contaminant exposures |
topic | Methodology |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2270821/ https://www.ncbi.nlm.nih.gov/pubmed/18234112 http://dx.doi.org/10.1186/1477-5956-6-6 |
work_keys_str_mv | AT karthikeyansubramanian dataminingofplasmapeptidechromatogramsforbiomarkersofaircontaminantexposures AT kumarathasanpremkumari dataminingofplasmapeptidechromatogramsforbiomarkersofaircontaminantexposures AT vincentrenaud dataminingofplasmapeptidechromatogramsforbiomarkersofaircontaminantexposures |