Cargando…
MVDA: a multi-view genomic data integration methodology
BACKGROUND: Multiple high-throughput molecular profiling by omics technologies can be collected for the same individuals. Combining these data, rather than exploiting them separately, can significantly increase the power of clinically relevant patients subclassifications. RESULTS: We propose a multi...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2015
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4539887/ https://www.ncbi.nlm.nih.gov/pubmed/26283178 http://dx.doi.org/10.1186/s12859-015-0680-3 |
_version_ | 1782386165491433472 |
---|---|
author | Serra, Angela Fratello, Michele Fortino, Vittorio Raiconi, Giancarlo Tagliaferri, Roberto Greco, Dario |
author_facet | Serra, Angela Fratello, Michele Fortino, Vittorio Raiconi, Giancarlo Tagliaferri, Roberto Greco, Dario |
author_sort | Serra, Angela |
collection | PubMed |
description | BACKGROUND: Multiple high-throughput molecular profiling by omics technologies can be collected for the same individuals. Combining these data, rather than exploiting them separately, can significantly increase the power of clinically relevant patients subclassifications. RESULTS: We propose a multi-view approach in which the information from different data layers (views) is integrated at the levels of the results of each single view clustering iterations. It works by factorizing the membership matrices in a late integration manner. We evaluated the effectiveness and the performance of our method on six multi-view cancer datasets. In all the cases, we found patient sub-classes with statistical significance, identifying novel sub-groups previously not emphasized in literature. Our method performed better as compared to other multi-view clustering algorithms and, unlike other existing methods, it is able to quantify the contribution of single views on the final results. CONCLUSION: Our observations suggest that integration of prior information with genomic features in the subtyping analysis is an effective strategy in identifying disease subgroups. The methodology is implemented in R and the source code is available online at http://neuronelab.unisa.it/a-multi-view-genomic-data-integration-methodology/. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12859-015-0680-3) contains supplementary material, which is available to authorized users. |
format | Online Article Text |
id | pubmed-4539887 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2015 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-45398872015-08-19 MVDA: a multi-view genomic data integration methodology Serra, Angela Fratello, Michele Fortino, Vittorio Raiconi, Giancarlo Tagliaferri, Roberto Greco, Dario BMC Bioinformatics Research Article BACKGROUND: Multiple high-throughput molecular profiling by omics technologies can be collected for the same individuals. Combining these data, rather than exploiting them separately, can significantly increase the power of clinically relevant patients subclassifications. RESULTS: We propose a multi-view approach in which the information from different data layers (views) is integrated at the levels of the results of each single view clustering iterations. It works by factorizing the membership matrices in a late integration manner. We evaluated the effectiveness and the performance of our method on six multi-view cancer datasets. In all the cases, we found patient sub-classes with statistical significance, identifying novel sub-groups previously not emphasized in literature. Our method performed better as compared to other multi-view clustering algorithms and, unlike other existing methods, it is able to quantify the contribution of single views on the final results. CONCLUSION: Our observations suggest that integration of prior information with genomic features in the subtyping analysis is an effective strategy in identifying disease subgroups. The methodology is implemented in R and the source code is available online at http://neuronelab.unisa.it/a-multi-view-genomic-data-integration-methodology/. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12859-015-0680-3) contains supplementary material, which is available to authorized users. BioMed Central 2015-08-19 /pmc/articles/PMC4539887/ /pubmed/26283178 http://dx.doi.org/10.1186/s12859-015-0680-3 Text en © Serra et al. 2015 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Research Article Serra, Angela Fratello, Michele Fortino, Vittorio Raiconi, Giancarlo Tagliaferri, Roberto Greco, Dario MVDA: a multi-view genomic data integration methodology |
title | MVDA: a multi-view genomic data integration methodology |
title_full | MVDA: a multi-view genomic data integration methodology |
title_fullStr | MVDA: a multi-view genomic data integration methodology |
title_full_unstemmed | MVDA: a multi-view genomic data integration methodology |
title_short | MVDA: a multi-view genomic data integration methodology |
title_sort | mvda: a multi-view genomic data integration methodology |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4539887/ https://www.ncbi.nlm.nih.gov/pubmed/26283178 http://dx.doi.org/10.1186/s12859-015-0680-3 |
work_keys_str_mv | AT serraangela mvdaamultiviewgenomicdataintegrationmethodology AT fratellomichele mvdaamultiviewgenomicdataintegrationmethodology AT fortinovittorio mvdaamultiviewgenomicdataintegrationmethodology AT raiconigiancarlo mvdaamultiviewgenomicdataintegrationmethodology AT tagliaferriroberto mvdaamultiviewgenomicdataintegrationmethodology AT grecodario mvdaamultiviewgenomicdataintegrationmethodology |