Cargando…

Knowledge driven decomposition of tumor expression profiles

BACKGROUND: Tumors have been hypothesized to be the result of a mixture of oncogenic events, some of which will be reflected in the gene expression of the tumor. Based on this hypothesis a variety of data-driven methods have been employed to decompose tumor expression profiles into component profile...

Descripción completa

Detalles Bibliográficos
Autores principales: van Vliet, Martin H, Wessels, Lodewyk FA, Reinders, Marcel JT
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2009
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2648763/
https://www.ncbi.nlm.nih.gov/pubmed/19208120
http://dx.doi.org/10.1186/1471-2105-10-S1-S20
_version_ 1782164982489677824
author van Vliet, Martin H
Wessels, Lodewyk FA
Reinders, Marcel JT
author_facet van Vliet, Martin H
Wessels, Lodewyk FA
Reinders, Marcel JT
author_sort van Vliet, Martin H
collection PubMed
description BACKGROUND: Tumors have been hypothesized to be the result of a mixture of oncogenic events, some of which will be reflected in the gene expression of the tumor. Based on this hypothesis a variety of data-driven methods have been employed to decompose tumor expression profiles into component profiles, hypothetically linked to these events. Interpretation of the resulting data-driven components is often done by post-hoc comparison to, for instance, functional groupings of genes into gene sets. None of the data-driven methods allow the incorporation of that type of knowledge directly into the decomposition. RESULTS: We present a linear model which uses knowledge driven, pre-defined components to perform the decomposition. We solve this decomposition model in a constrained linear least squares fashion. From a variety of options, a lasso-based solution to the model performs best in linking single gene perturbation data to mouse data. Moreover, we show the decomposition of expression profiles from human breast cancer samples into single gene perturbation profiles and gene sets that are linked to the hallmarks of cancer. For these breast cancer samples we were able to discern several links between clinical parameters, and the decomposition weights, providing new insights into the biology of these tumors. Lastly, we show that the order in which the Lasso regularization shrinks the weights, unveils consensus patterns within clinical subgroups of the breast cancer samples. CONCLUSION: The proposed lasso-based constrained least squares decomposition provides a stable and relevant relation between samples and knowledge-based components, and is thus a viable alternative to data-driven methods. In addition, the consensus order of component importance within clinical subgroups provides a better molecular characterization of the subtypes.
format Text
id pubmed-2648763
institution National Center for Biotechnology Information
language English
publishDate 2009
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-26487632009-03-03 Knowledge driven decomposition of tumor expression profiles van Vliet, Martin H Wessels, Lodewyk FA Reinders, Marcel JT BMC Bioinformatics Research BACKGROUND: Tumors have been hypothesized to be the result of a mixture of oncogenic events, some of which will be reflected in the gene expression of the tumor. Based on this hypothesis a variety of data-driven methods have been employed to decompose tumor expression profiles into component profiles, hypothetically linked to these events. Interpretation of the resulting data-driven components is often done by post-hoc comparison to, for instance, functional groupings of genes into gene sets. None of the data-driven methods allow the incorporation of that type of knowledge directly into the decomposition. RESULTS: We present a linear model which uses knowledge driven, pre-defined components to perform the decomposition. We solve this decomposition model in a constrained linear least squares fashion. From a variety of options, a lasso-based solution to the model performs best in linking single gene perturbation data to mouse data. Moreover, we show the decomposition of expression profiles from human breast cancer samples into single gene perturbation profiles and gene sets that are linked to the hallmarks of cancer. For these breast cancer samples we were able to discern several links between clinical parameters, and the decomposition weights, providing new insights into the biology of these tumors. Lastly, we show that the order in which the Lasso regularization shrinks the weights, unveils consensus patterns within clinical subgroups of the breast cancer samples. CONCLUSION: The proposed lasso-based constrained least squares decomposition provides a stable and relevant relation between samples and knowledge-based components, and is thus a viable alternative to data-driven methods. In addition, the consensus order of component importance within clinical subgroups provides a better molecular characterization of the subtypes. BioMed Central 2009-01-30 /pmc/articles/PMC2648763/ /pubmed/19208120 http://dx.doi.org/10.1186/1471-2105-10-S1-S20 Text en Copyright © 2009 van Vliet et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research
van Vliet, Martin H
Wessels, Lodewyk FA
Reinders, Marcel JT
Knowledge driven decomposition of tumor expression profiles
title Knowledge driven decomposition of tumor expression profiles
title_full Knowledge driven decomposition of tumor expression profiles
title_fullStr Knowledge driven decomposition of tumor expression profiles
title_full_unstemmed Knowledge driven decomposition of tumor expression profiles
title_short Knowledge driven decomposition of tumor expression profiles
title_sort knowledge driven decomposition of tumor expression profiles
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2648763/
https://www.ncbi.nlm.nih.gov/pubmed/19208120
http://dx.doi.org/10.1186/1471-2105-10-S1-S20
work_keys_str_mv AT vanvlietmartinh knowledgedrivendecompositionoftumorexpressionprofiles
AT wesselslodewykfa knowledgedrivendecompositionoftumorexpressionprofiles
AT reindersmarceljt knowledgedrivendecompositionoftumorexpressionprofiles