Cargando…

Novel Approaches to Visualization and Data Mining Reveals Diagnostic Information in the Low Amplitude Region of Serum Mass Spectra from Ovarian Cancer Patients

The ability to identify patterns of diagnostic signatures in proteomic data generated by high throughput mass spectrometry (MS) based serum analysis has recently generated much excitement and interest from the scientific community. These data sets can be very large, with high-resolution MS instrumen...

Descripción completa

Detalles Bibliográficos
Autores principales: Johann, Donald J., McGuigan, Michael D., Tomov, Stanimire, Fusaro, Vincent A., Ross, Sally, Conrads, Thomas P., Veenstra, Timothy D., Fishman, David A., Whiteley, Gordon R., Petricoin, Emanuel F., Liotta, Lance A.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: IOS Press 2004
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3851062/
https://www.ncbi.nlm.nih.gov/pubmed/15258334
http://dx.doi.org/10.1155/2004/549372
_version_ 1782294216516304896
author Johann, Donald J.
McGuigan, Michael D.
Tomov, Stanimire
Fusaro, Vincent A.
Ross, Sally
Conrads, Thomas P.
Veenstra, Timothy D.
Fishman, David A.
Whiteley, Gordon R.
Petricoin, Emanuel F.
Liotta, Lance A.
author_facet Johann, Donald J.
McGuigan, Michael D.
Tomov, Stanimire
Fusaro, Vincent A.
Ross, Sally
Conrads, Thomas P.
Veenstra, Timothy D.
Fishman, David A.
Whiteley, Gordon R.
Petricoin, Emanuel F.
Liotta, Lance A.
author_sort Johann, Donald J.
collection PubMed
description The ability to identify patterns of diagnostic signatures in proteomic data generated by high throughput mass spectrometry (MS) based serum analysis has recently generated much excitement and interest from the scientific community. These data sets can be very large, with high-resolution MS instrumentation producing 1–2 million data points per sample. Approaches to analyze mass spectral data using unsupervised and supervised data mining operations would greatly benefit from tools that effectively allow for data reduction without losing important diagnostic information. In the past, investigators have proposed approaches where data reduction is performed by a priori “peak picking” and alignment/warping/smoothing components using rule-based signal-to-noise measurements. Unfortunately, while this type of system has been employed for gene microarray analysis, it is unclear whether it will be effective in the analysis of mass spectral data, which unlike microarray data, is comprised of continuous measurement operations. Moreover, it is unclear where true signal begins and noise ends. Therefore, we have developed an approach to MS data analysis using new types of data visualization and mining operations in which data reduction is accomplished by culling via the intensity of the peaks themselves instead of by location. Applying this new analysis method on a large study set of high resolution mass spectra from healthy and ovarian cancer patients, shows that all of the diagnostic information is contained within the very lowest amplitude regions of the mass spectra. This region can then be selected and studied to identify the exact location and amplitude of the diagnostic biomarkers.
format Online
Article
Text
id pubmed-3851062
institution National Center for Biotechnology Information
language English
publishDate 2004
publisher IOS Press
record_format MEDLINE/PubMed
spelling pubmed-38510622013-12-17 Novel Approaches to Visualization and Data Mining Reveals Diagnostic Information in the Low Amplitude Region of Serum Mass Spectra from Ovarian Cancer Patients Johann, Donald J. McGuigan, Michael D. Tomov, Stanimire Fusaro, Vincent A. Ross, Sally Conrads, Thomas P. Veenstra, Timothy D. Fishman, David A. Whiteley, Gordon R. Petricoin, Emanuel F. Liotta, Lance A. Dis Markers Other The ability to identify patterns of diagnostic signatures in proteomic data generated by high throughput mass spectrometry (MS) based serum analysis has recently generated much excitement and interest from the scientific community. These data sets can be very large, with high-resolution MS instrumentation producing 1–2 million data points per sample. Approaches to analyze mass spectral data using unsupervised and supervised data mining operations would greatly benefit from tools that effectively allow for data reduction without losing important diagnostic information. In the past, investigators have proposed approaches where data reduction is performed by a priori “peak picking” and alignment/warping/smoothing components using rule-based signal-to-noise measurements. Unfortunately, while this type of system has been employed for gene microarray analysis, it is unclear whether it will be effective in the analysis of mass spectral data, which unlike microarray data, is comprised of continuous measurement operations. Moreover, it is unclear where true signal begins and noise ends. Therefore, we have developed an approach to MS data analysis using new types of data visualization and mining operations in which data reduction is accomplished by culling via the intensity of the peaks themselves instead of by location. Applying this new analysis method on a large study set of high resolution mass spectra from healthy and ovarian cancer patients, shows that all of the diagnostic information is contained within the very lowest amplitude regions of the mass spectra. This region can then be selected and studied to identify the exact location and amplitude of the diagnostic biomarkers. IOS Press 2004 2004-07-14 /pmc/articles/PMC3851062/ /pubmed/15258334 http://dx.doi.org/10.1155/2004/549372 Text en Copyright © 2004 Hindawi Publishing Corporation.
spellingShingle Other
Johann, Donald J.
McGuigan, Michael D.
Tomov, Stanimire
Fusaro, Vincent A.
Ross, Sally
Conrads, Thomas P.
Veenstra, Timothy D.
Fishman, David A.
Whiteley, Gordon R.
Petricoin, Emanuel F.
Liotta, Lance A.
Novel Approaches to Visualization and Data Mining Reveals Diagnostic Information in the Low Amplitude Region of Serum Mass Spectra from Ovarian Cancer Patients
title Novel Approaches to Visualization and Data Mining Reveals Diagnostic Information in the Low Amplitude Region of Serum Mass Spectra from Ovarian Cancer Patients
title_full Novel Approaches to Visualization and Data Mining Reveals Diagnostic Information in the Low Amplitude Region of Serum Mass Spectra from Ovarian Cancer Patients
title_fullStr Novel Approaches to Visualization and Data Mining Reveals Diagnostic Information in the Low Amplitude Region of Serum Mass Spectra from Ovarian Cancer Patients
title_full_unstemmed Novel Approaches to Visualization and Data Mining Reveals Diagnostic Information in the Low Amplitude Region of Serum Mass Spectra from Ovarian Cancer Patients
title_short Novel Approaches to Visualization and Data Mining Reveals Diagnostic Information in the Low Amplitude Region of Serum Mass Spectra from Ovarian Cancer Patients
title_sort novel approaches to visualization and data mining reveals diagnostic information in the low amplitude region of serum mass spectra from ovarian cancer patients
topic Other
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3851062/
https://www.ncbi.nlm.nih.gov/pubmed/15258334
http://dx.doi.org/10.1155/2004/549372
work_keys_str_mv AT johanndonaldj novelapproachestovisualizationanddataminingrevealsdiagnosticinformationinthelowamplituderegionofserummassspectrafromovariancancerpatients
AT mcguiganmichaeld novelapproachestovisualizationanddataminingrevealsdiagnosticinformationinthelowamplituderegionofserummassspectrafromovariancancerpatients
AT tomovstanimire novelapproachestovisualizationanddataminingrevealsdiagnosticinformationinthelowamplituderegionofserummassspectrafromovariancancerpatients
AT fusarovincenta novelapproachestovisualizationanddataminingrevealsdiagnosticinformationinthelowamplituderegionofserummassspectrafromovariancancerpatients
AT rosssally novelapproachestovisualizationanddataminingrevealsdiagnosticinformationinthelowamplituderegionofserummassspectrafromovariancancerpatients
AT conradsthomasp novelapproachestovisualizationanddataminingrevealsdiagnosticinformationinthelowamplituderegionofserummassspectrafromovariancancerpatients
AT veenstratimothyd novelapproachestovisualizationanddataminingrevealsdiagnosticinformationinthelowamplituderegionofserummassspectrafromovariancancerpatients
AT fishmandavida novelapproachestovisualizationanddataminingrevealsdiagnosticinformationinthelowamplituderegionofserummassspectrafromovariancancerpatients
AT whiteleygordonr novelapproachestovisualizationanddataminingrevealsdiagnosticinformationinthelowamplituderegionofserummassspectrafromovariancancerpatients
AT petricoinemanuelf novelapproachestovisualizationanddataminingrevealsdiagnosticinformationinthelowamplituderegionofserummassspectrafromovariancancerpatients
AT liottalancea novelapproachestovisualizationanddataminingrevealsdiagnosticinformationinthelowamplituderegionofserummassspectrafromovariancancerpatients