Cargando…

MVDA: a multi-view genomic data integration methodology

BACKGROUND: Multiple high-throughput molecular profiling by omics technologies can be collected for the same individuals. Combining these data, rather than exploiting them separately, can significantly increase the power of clinically relevant patients subclassifications. RESULTS: We propose a multi...

Descripción completa

Detalles Bibliográficos
Autores principales: Serra, Angela, Fratello, Michele, Fortino, Vittorio, Raiconi, Giancarlo, Tagliaferri, Roberto, Greco, Dario
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4539887/
https://www.ncbi.nlm.nih.gov/pubmed/26283178
http://dx.doi.org/10.1186/s12859-015-0680-3
_version_ 1782386165491433472
author Serra, Angela
Fratello, Michele
Fortino, Vittorio
Raiconi, Giancarlo
Tagliaferri, Roberto
Greco, Dario
author_facet Serra, Angela
Fratello, Michele
Fortino, Vittorio
Raiconi, Giancarlo
Tagliaferri, Roberto
Greco, Dario
author_sort Serra, Angela
collection PubMed
description BACKGROUND: Multiple high-throughput molecular profiling by omics technologies can be collected for the same individuals. Combining these data, rather than exploiting them separately, can significantly increase the power of clinically relevant patients subclassifications. RESULTS: We propose a multi-view approach in which the information from different data layers (views) is integrated at the levels of the results of each single view clustering iterations. It works by factorizing the membership matrices in a late integration manner. We evaluated the effectiveness and the performance of our method on six multi-view cancer datasets. In all the cases, we found patient sub-classes with statistical significance, identifying novel sub-groups previously not emphasized in literature. Our method performed better as compared to other multi-view clustering algorithms and, unlike other existing methods, it is able to quantify the contribution of single views on the final results. CONCLUSION: Our observations suggest that integration of prior information with genomic features in the subtyping analysis is an effective strategy in identifying disease subgroups. The methodology is implemented in R and the source code is available online at http://neuronelab.unisa.it/a-multi-view-genomic-data-integration-methodology/. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12859-015-0680-3) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-4539887
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-45398872015-08-19 MVDA: a multi-view genomic data integration methodology Serra, Angela Fratello, Michele Fortino, Vittorio Raiconi, Giancarlo Tagliaferri, Roberto Greco, Dario BMC Bioinformatics Research Article BACKGROUND: Multiple high-throughput molecular profiling by omics technologies can be collected for the same individuals. Combining these data, rather than exploiting them separately, can significantly increase the power of clinically relevant patients subclassifications. RESULTS: We propose a multi-view approach in which the information from different data layers (views) is integrated at the levels of the results of each single view clustering iterations. It works by factorizing the membership matrices in a late integration manner. We evaluated the effectiveness and the performance of our method on six multi-view cancer datasets. In all the cases, we found patient sub-classes with statistical significance, identifying novel sub-groups previously not emphasized in literature. Our method performed better as compared to other multi-view clustering algorithms and, unlike other existing methods, it is able to quantify the contribution of single views on the final results. CONCLUSION: Our observations suggest that integration of prior information with genomic features in the subtyping analysis is an effective strategy in identifying disease subgroups. The methodology is implemented in R and the source code is available online at http://neuronelab.unisa.it/a-multi-view-genomic-data-integration-methodology/. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12859-015-0680-3) contains supplementary material, which is available to authorized users. BioMed Central 2015-08-19 /pmc/articles/PMC4539887/ /pubmed/26283178 http://dx.doi.org/10.1186/s12859-015-0680-3 Text en © Serra et al. 2015 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research Article
Serra, Angela
Fratello, Michele
Fortino, Vittorio
Raiconi, Giancarlo
Tagliaferri, Roberto
Greco, Dario
MVDA: a multi-view genomic data integration methodology
title MVDA: a multi-view genomic data integration methodology
title_full MVDA: a multi-view genomic data integration methodology
title_fullStr MVDA: a multi-view genomic data integration methodology
title_full_unstemmed MVDA: a multi-view genomic data integration methodology
title_short MVDA: a multi-view genomic data integration methodology
title_sort mvda: a multi-view genomic data integration methodology
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4539887/
https://www.ncbi.nlm.nih.gov/pubmed/26283178
http://dx.doi.org/10.1186/s12859-015-0680-3
work_keys_str_mv AT serraangela mvdaamultiviewgenomicdataintegrationmethodology
AT fratellomichele mvdaamultiviewgenomicdataintegrationmethodology
AT fortinovittorio mvdaamultiviewgenomicdataintegrationmethodology
AT raiconigiancarlo mvdaamultiviewgenomicdataintegrationmethodology
AT tagliaferriroberto mvdaamultiviewgenomicdataintegrationmethodology
AT grecodario mvdaamultiviewgenomicdataintegrationmethodology