Cargando…

A kernel-based integration of genome-wide data for clinical decision support

BACKGROUND: Although microarray technology allows the investigation of the transcriptomic make-up of a tumor in one experiment, the transcriptome does not completely reflect the underlying biology due to alternative splicing, post-translational modifications, as well as the influence of pathological...

Descripción completa

Detalles Bibliográficos
Autores principales: Daemen, Anneleen, Gevaert, Olivier, Ojeda, Fabian, Debucquoy, Annelies, Suykens, Johan AK, Sempoux, Christine, Machiels, Jean-Pascal, Haustermans, Karin, De Moor, Bart
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2009
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2684660/
https://www.ncbi.nlm.nih.gov/pubmed/19356222
http://dx.doi.org/10.1186/gm39
_version_ 1782167253380235264
author Daemen, Anneleen
Gevaert, Olivier
Ojeda, Fabian
Debucquoy, Annelies
Suykens, Johan AK
Sempoux, Christine
Machiels, Jean-Pascal
Haustermans, Karin
De Moor, Bart
author_facet Daemen, Anneleen
Gevaert, Olivier
Ojeda, Fabian
Debucquoy, Annelies
Suykens, Johan AK
Sempoux, Christine
Machiels, Jean-Pascal
Haustermans, Karin
De Moor, Bart
author_sort Daemen, Anneleen
collection PubMed
description BACKGROUND: Although microarray technology allows the investigation of the transcriptomic make-up of a tumor in one experiment, the transcriptome does not completely reflect the underlying biology due to alternative splicing, post-translational modifications, as well as the influence of pathological conditions (for example, cancer) on transcription and translation. This increases the importance of fusing more than one source of genome-wide data, such as the genome, transcriptome, proteome, and epigenome. The current increase in the amount of available omics data emphasizes the need for a methodological integration framework. METHODS: We propose a kernel-based approach for clinical decision support in which many genome-wide data sources are combined. Integration occurs within the patient domain at the level of kernel matrices before building the classifier. As supervised classification algorithm, a weighted least squares support vector machine is used. We apply this framework to two cancer cases, namely, a rectal cancer data set containing microarray and proteomics data and a prostate cancer data set containing microarray and genomics data. For both cases, multiple outcomes are predicted. RESULTS: For the rectal cancer outcomes, the highest leave-one-out (LOO) areas under the receiver operating characteristic curves (AUC) were obtained when combining microarray and proteomics data gathered during therapy and ranged from 0.927 to 0.987. For prostate cancer, all four outcomes had a better LOO AUC when combining microarray and genomics data, ranging from 0.786 for recurrence to 0.987 for metastasis. CONCLUSIONS: For both cancer sites the prediction of all outcomes improved when more than one genome-wide data set was considered. This suggests that integrating multiple genome-wide data sources increases the predictive performance of clinical decision support models. This emphasizes the need for comprehensive multi-modal data. We acknowledge that, in a first phase, this will substantially increase costs; however, this is a necessary investment to ultimately obtain cost-efficient models usable in patient tailored therapy.
format Text
id pubmed-2684660
institution National Center for Biotechnology Information
language English
publishDate 2009
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-26846602009-05-20 A kernel-based integration of genome-wide data for clinical decision support Daemen, Anneleen Gevaert, Olivier Ojeda, Fabian Debucquoy, Annelies Suykens, Johan AK Sempoux, Christine Machiels, Jean-Pascal Haustermans, Karin De Moor, Bart Genome Med Research BACKGROUND: Although microarray technology allows the investigation of the transcriptomic make-up of a tumor in one experiment, the transcriptome does not completely reflect the underlying biology due to alternative splicing, post-translational modifications, as well as the influence of pathological conditions (for example, cancer) on transcription and translation. This increases the importance of fusing more than one source of genome-wide data, such as the genome, transcriptome, proteome, and epigenome. The current increase in the amount of available omics data emphasizes the need for a methodological integration framework. METHODS: We propose a kernel-based approach for clinical decision support in which many genome-wide data sources are combined. Integration occurs within the patient domain at the level of kernel matrices before building the classifier. As supervised classification algorithm, a weighted least squares support vector machine is used. We apply this framework to two cancer cases, namely, a rectal cancer data set containing microarray and proteomics data and a prostate cancer data set containing microarray and genomics data. For both cases, multiple outcomes are predicted. RESULTS: For the rectal cancer outcomes, the highest leave-one-out (LOO) areas under the receiver operating characteristic curves (AUC) were obtained when combining microarray and proteomics data gathered during therapy and ranged from 0.927 to 0.987. For prostate cancer, all four outcomes had a better LOO AUC when combining microarray and genomics data, ranging from 0.786 for recurrence to 0.987 for metastasis. CONCLUSIONS: For both cancer sites the prediction of all outcomes improved when more than one genome-wide data set was considered. This suggests that integrating multiple genome-wide data sources increases the predictive performance of clinical decision support models. This emphasizes the need for comprehensive multi-modal data. We acknowledge that, in a first phase, this will substantially increase costs; however, this is a necessary investment to ultimately obtain cost-efficient models usable in patient tailored therapy. BioMed Central 2009-04-03 /pmc/articles/PMC2684660/ /pubmed/19356222 http://dx.doi.org/10.1186/gm39 Text en Copyright ©2009 Daemen et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research
Daemen, Anneleen
Gevaert, Olivier
Ojeda, Fabian
Debucquoy, Annelies
Suykens, Johan AK
Sempoux, Christine
Machiels, Jean-Pascal
Haustermans, Karin
De Moor, Bart
A kernel-based integration of genome-wide data for clinical decision support
title A kernel-based integration of genome-wide data for clinical decision support
title_full A kernel-based integration of genome-wide data for clinical decision support
title_fullStr A kernel-based integration of genome-wide data for clinical decision support
title_full_unstemmed A kernel-based integration of genome-wide data for clinical decision support
title_short A kernel-based integration of genome-wide data for clinical decision support
title_sort kernel-based integration of genome-wide data for clinical decision support
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2684660/
https://www.ncbi.nlm.nih.gov/pubmed/19356222
http://dx.doi.org/10.1186/gm39
work_keys_str_mv AT daemenanneleen akernelbasedintegrationofgenomewidedataforclinicaldecisionsupport
AT gevaertolivier akernelbasedintegrationofgenomewidedataforclinicaldecisionsupport
AT ojedafabian akernelbasedintegrationofgenomewidedataforclinicaldecisionsupport
AT debucquoyannelies akernelbasedintegrationofgenomewidedataforclinicaldecisionsupport
AT suykensjohanak akernelbasedintegrationofgenomewidedataforclinicaldecisionsupport
AT sempouxchristine akernelbasedintegrationofgenomewidedataforclinicaldecisionsupport
AT machielsjeanpascal akernelbasedintegrationofgenomewidedataforclinicaldecisionsupport
AT haustermanskarin akernelbasedintegrationofgenomewidedataforclinicaldecisionsupport
AT demoorbart akernelbasedintegrationofgenomewidedataforclinicaldecisionsupport
AT daemenanneleen kernelbasedintegrationofgenomewidedataforclinicaldecisionsupport
AT gevaertolivier kernelbasedintegrationofgenomewidedataforclinicaldecisionsupport
AT ojedafabian kernelbasedintegrationofgenomewidedataforclinicaldecisionsupport
AT debucquoyannelies kernelbasedintegrationofgenomewidedataforclinicaldecisionsupport
AT suykensjohanak kernelbasedintegrationofgenomewidedataforclinicaldecisionsupport
AT sempouxchristine kernelbasedintegrationofgenomewidedataforclinicaldecisionsupport
AT machielsjeanpascal kernelbasedintegrationofgenomewidedataforclinicaldecisionsupport
AT haustermanskarin kernelbasedintegrationofgenomewidedataforclinicaldecisionsupport
AT demoorbart kernelbasedintegrationofgenomewidedataforclinicaldecisionsupport